CA2201885C - Dna polymerases having modified nucleotide binding site for dna sequencing - Google Patents
Dna polymerases having modified nucleotide binding site for dna sequencing Download PDFInfo
- Publication number
- CA2201885C CA2201885C CA002201885A CA2201885A CA2201885C CA 2201885 C CA2201885 C CA 2201885C CA 002201885 A CA002201885 A CA 002201885A CA 2201885 A CA2201885 A CA 2201885A CA 2201885 C CA2201885 C CA 2201885C
- Authority
- CA
- Canada
- Prior art keywords
- dna polymerase
- dna
- polymerase
- residue
- amino acid
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/12—Transferases (2.) transferring phosphorus containing groups, e.g. kinases (2.7)
- C12N9/1241—Nucleotidyltransferases (2.7.7)
- C12N9/1252—DNA-directed DNA polymerase (2.7.7.7), i.e. DNA replicase
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6844—Nucleic acid amplification reactions
- C12Q1/686—Polymerase chain reaction [PCR]
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6869—Methods for sequencing
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Engineering & Computer Science (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Molecular Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Genetics & Genomics (AREA)
- Microbiology (AREA)
- Biotechnology (AREA)
- Biochemistry (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Immunology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Analytical Chemistry (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Biomedical Technology (AREA)
- Medicinal Chemistry (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Enzymes And Modification Thereof (AREA)
- Saccharide Compounds (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
Modified gene encoding a modified DNA polymerase wherein the modified polymerase incorporates dideoxynucleotides at least 20--fold better compared to the corresponding deoxynucleotides as compared with the corresponding naturally-occurring DNA polymerase.
Description
I
DESCRIPTION
DNA Polyrnerases Havinq Mo ifi d Nucleotid Bindinc~ Site for DNA Seguencina This application is a continuation-in-part of Tabor and Richardson, entitled "DNA POLYMERASES HAVING MODIFIED
NUCLEOTIDE BINDING SITE FOR DNA SEQUENCING," filed October 17, 1994.
Background of the Invention This invention was made with government support including a grant from the U.S. Dept of Energy, contract number DE-FG02-88ER60688. The U.S. government may have certain rights in the invention.
This invention relates to DNA polymerases suitable for DNA sequencing and to automated and manual methods for DNA sequencing.
The following is a brief description of art relevant to DNA sequencing techniques. This is provided only to give general guidance to those reading the application, and is not an admission that any art cited herein or referred to explicitly or implicitly is prior art to the appended claims.
DNA sequencing generally involves the generation of four populations of single-stranded DNA fragments having one defined terminus and one variable terminus. The variable terminus generally terminates at specific nucleotide bases (either guanine (G), adenine (A), thymine (T), or cytosine (C)). The four different sets of fragments are each separated on the basis of their length, one procedure being on a high resolution polyacrylamide gel; each band on the gel corresponds colinearly to a specific nucleotide in the DNA sequence, thus identifying the positions in the sequence of the given nucleotide base. See Tabor and Richardson, U.S.
Patents 4,942,130 and 4,962,020.
DESCRIPTION
DNA Polyrnerases Havinq Mo ifi d Nucleotid Bindinc~ Site for DNA Seguencina This application is a continuation-in-part of Tabor and Richardson, entitled "DNA POLYMERASES HAVING MODIFIED
NUCLEOTIDE BINDING SITE FOR DNA SEQUENCING," filed October 17, 1994.
Background of the Invention This invention was made with government support including a grant from the U.S. Dept of Energy, contract number DE-FG02-88ER60688. The U.S. government may have certain rights in the invention.
This invention relates to DNA polymerases suitable for DNA sequencing and to automated and manual methods for DNA sequencing.
The following is a brief description of art relevant to DNA sequencing techniques. This is provided only to give general guidance to those reading the application, and is not an admission that any art cited herein or referred to explicitly or implicitly is prior art to the appended claims.
DNA sequencing generally involves the generation of four populations of single-stranded DNA fragments having one defined terminus and one variable terminus. The variable terminus generally terminates at specific nucleotide bases (either guanine (G), adenine (A), thymine (T), or cytosine (C)). The four different sets of fragments are each separated on the basis of their length, one procedure being on a high resolution polyacrylamide gel; each band on the gel corresponds colinearly to a specific nucleotide in the DNA sequence, thus identifying the positions in the sequence of the given nucleotide base. See Tabor and Richardson, U.S.
Patents 4,942,130 and 4,962,020.
2 ~ 2 018 8 5 PCT/US95/12920 There are two general methods of DNA sequencing.
One method (Maxam and Gilbert sequencing) involves the chemical degradation of isolated DNA fragments, each labeled with a single radiolabel at its defined terminus, each reaction yielding a limited cleavage specifically at one or more of the four bases (G, A, T or C). The other ~
method (dideoxy or chain-termination sequencing) involves the enzymatic synthesis of a DNA strand. Sanger et al.
(Proc. Nat. Acad. Sci. USA 74:5463, 1977). Four separate syntheses are generally run, each reaction being caused to terminate at a specific base (G, A, T or C) via incorporation of an appropriate chain terminating nucleotide, such as a dideoxynucleotide. The latter method is preferred since the DNA fragments can be uniformly labelled (instead of end labelled) by the inclusion of a radioactively labeled nucleoside triphosphate and thus the larger DNA fragments contain increasingly more radioactivity. Further, 35S-labelled nucleotides can be used in place of 32P-labelled nucleotides, resulting in sharper definition; the reaction products are easier to interpret since each lane corresponds only to either G, A, T or C. The enzymes used for most dideoxy sequencing include T7 DNA
polymerase and DNA polymerases isolated from thermophilic organisms such as Taq, Vent, Tth, and others. Other polymerases used to a lesser extent include AMV reverse transcriptase and Klenow fragment of E. coli DNA
polymerase I.
In the dideoxy chain terminating method a short single-stranded primer is annealed to a single-stranded template. The primer is elongated at its 3'-end by the incorporation of deoxynucleotides (dNMPs) until a dideoxynucleotide (ddNMP) is incorporated. When a ddNMP
is incorporated elongation ceases at that base. Other chain terminating agents can be used in place of a ddNTP
and the ddNTP can be labelled as discussed below.
Using the above methodology, automated systems for DNA sequence analysis have been developed. One instrument, which was manufactured by EG&G, makes use of SUBSTITUTE SHEET (RULE 26) conventional dideoxy chain terminating reactions with a radioactively labeled nucleotide. The resulting DNA
products are separated by gel electrophoresis. Toneguzzo et al , 6 Biotechniques 460, 1988. A detector scans for radioactivity as it passes through the bottom of the gel.
' Four synthesis reactions are required for each template to be sequenced, as well as four lanes on each gel, a separate lane being used for products terminated by each specific chain terminating agent.
Kambara et al , 6 Biotechnology 816, 1988, have used a fluorescent-labelled primer. The resulting fluorescently labelled products are excited with a laser at the bottom of the gel and the fluorescence detected with a CRT monitor. This procedure also requires four synthesis reactions and four lanes on the gel for each template to be sequenced.
Applied Biosystems manufactures an instrument in which four different primers are used, each labelled with a different fluorescent marker. Smith et al., 13 Nuc.
Acid. Res. 2399, 1985; and 321 Nature 674, 1986. Each primer is used in a separate reaction containing one of four dideoxynucleotides. After the four reactions have been carried out the mixtures are combined and the DNA
fragments are fractionated in a single lane on a gel. A
laser at the bottom of the gel is. used to detect fluorescent products after they have been electrophoresed through the gel. This system requires four separate annealing reactions and four separate synthesis reactions for each template, but only a single lane on the gel.
Computer analysis of the sequence is made easier by having all four bands in a single lane.
DuPont used to provide an instrument in which a different fluorescent marker was attached to each of four dideoxynucleoside triphosphates. Prober et al., 238 Science 336, 1987. A single annealing step, a single polymerase reaction (containing each of the four labelled dideoxynucleoside triphosphates) and a single lane in the sequencing gel are required. The four different fluorescent markers in the DNA products are detected SUBSTilUBf SMEET (RtU 20 separately as they are electrophoresed through the gel.
Englert et al., U.S. Pat. No. 4,707,235 (1987), describes a multichannel electrophoresis apparatus having a detection means, disposed substantially across the 5 whole width of the gel, that can detect labelled DNA
products as they migrate past the detector means in four separate lanes, and identifies the channel or lane in which the sample is located. Preferably, radioisotopic labels are used.
Inherent to procedures currently used for DNA
sequence analysis is the necessity to separate either radioactively or fluorescently-labelled DNA products by a gel permeation procedure such as polyacrylamide gel electrophoresis, and then detect their locations relative to one another along the axis of movement through the gel. The accuracy of this procedure is determined in part by the uniformity of the signal in bands which have permeated approximately the same distance through the gel. Differences or variations in signal intensities between nearby bands create several problems. First, they decrease the sensitivity of the method, which is limited by the ability to detect the bands containing the weakest signals. Second, they create difficulties in determining whether a band with a weak signal is a true signal due to the incorporation of a chain terminating agent, or an artifact due to a pause site in the DNA
where the polymerase has dissociated. Third, they decrease the accuracy in determining the DNA sequence between closely spaced bands since the strong signal of one band may mask the weak signal of its neighbor. See Tabor and Richardson, supra.
Variation in band intensity can arise from an inherent property of most DNA polymerases. Most DNA
polymerases discriminate against the chain terminating 35 dideoxynucleotides used in DNA sequence analysis. T4 DNA
polymerase discriminates against ddNTPs to such an extent that it cannot be used for DNA sequencing. E. coli DNA
polymerase I, Taq, and Vent DNA polymerase also discriminate strongly against ddNTPs, each incorporating SUBSTCRM SNEfT (RUL,E 26) a ddNNP a thousand times slower than the corresponding dNTP. Tabor and Richardson supra, have shown that T7 DNA polymerase lies at the other end of the spectrum, 5 discriminating against ddNTPs only several fold . If a DNA polymerase discriminated against a ddNTP to the same extent at all sequences, this problem could be overcome by simply altering the ratio of ddNTPs to dNTPs. Such an approach has been used with E. coli DNA polymerase I and Taq DNA polymerase. However, the extent of discrimination varies with the adjacent DNA sequences, which leads to wide variation in the intensity of adjacent radioactive fragments. The intensity of specific fragments can vary by 50-fold for E. coli DNA
polymerase I but only several fold for T7 DNA polymerase.
Consequently, the intensity of: bands on a DNA sequencing gel produced by T7 DNA polymerase are of similar intensity thus facilitating their detection and analysis by automated procedures. In addition, procedures that even further reduce the discrimination against dideoxynucleotides by T7 DNA polymerase are described such that it incorporates dideoxynucleotides equally as well as deoxynucleotides. These procedures and conditions also reduce but do not eliminate discrimination by other DNA polymerases such as Klenow and Taq DNA polymerases. For example, the use of manganese in place of, or in addition to, magnesium in the reaction mixture may reduce or eliminate discrimination against dideoxynucleotides. Under such conditions, T7 DNA polymerase does not differentiate between the two molecules whereas other DNA polymerases such as Klenow fragment, Taq and Vent still discriminate to some degree. For example, Klenow still discriminates against ddNTPs by as much as four-fold in the presence of manganese. More important, even though the overall degree of discrimination by such enzymes as Klenow and Taq DNA polymerases is reduced, the intensity of specific fragments can vary by much more than four fold due to high discrimination at certain sequences in DNA. These WO 96/12042 22018 8 5 PCT/US95/129,0 polymerases and procedures are now almost universally used in manual DNA sequencing (i.e., without aid of sequencing machines such as described above) and are extensively used in automated methods. The use of manganese and the lack of discrimination against ddNTPs at all sites results in bands of uniform intensities, thus facilitating the reading of sequencing gels, either by manual or automated procedures. Moreover, the lack of discrimination enables the use of novel procedures for sequence analysis (Tabor and Richardson, supra) . A
method based on this finding is provided to determine a ,DNA sequence in a single reaction that contains all four ddNTPs at different ratios, by measuring the relative intensity of each peak after gel electrophoresis. The authors indicate:
The DNA polymerases of this invention do not discriminate significantly between dideoxy-nucleotide analogs and normal nucleotides. That is, the chance of incorporation of an analog is approximately the same as that of a normal nucleotide or at least incorporates the analog with at least 1/10 the efficiency that of a normal analog. The polymerases of this invention also do not discriminate significantly against some other analogs.
This is important since,=in addition to the four normal deoxynucleoside triphosphates (dGTP, dATP, dTTP and dCTP), sequencing reactions require the incorporation of other types of nucleotide derivatives such as:
radioactively- or fluorescently-labelled nucleoside triphosphates, usually for labeling the synthesized strands with 35S, 32P, or other chemical agents. When a DNA polymerase does not discriminate against analogs the same probability will exist for the incorporation of an analog as for a normal nucleotide. For labelled nucleoside triphosphates this is important in order to efficiently label the synthesized DNA strands using a minimum of radioactivity. [4,942,130, COL 5:5]
They also state:
The ability to produce nearby bands of approximately the same intensity is useful since it permits the SUBS"fITtlTF SHET MM 2M
One method (Maxam and Gilbert sequencing) involves the chemical degradation of isolated DNA fragments, each labeled with a single radiolabel at its defined terminus, each reaction yielding a limited cleavage specifically at one or more of the four bases (G, A, T or C). The other ~
method (dideoxy or chain-termination sequencing) involves the enzymatic synthesis of a DNA strand. Sanger et al.
(Proc. Nat. Acad. Sci. USA 74:5463, 1977). Four separate syntheses are generally run, each reaction being caused to terminate at a specific base (G, A, T or C) via incorporation of an appropriate chain terminating nucleotide, such as a dideoxynucleotide. The latter method is preferred since the DNA fragments can be uniformly labelled (instead of end labelled) by the inclusion of a radioactively labeled nucleoside triphosphate and thus the larger DNA fragments contain increasingly more radioactivity. Further, 35S-labelled nucleotides can be used in place of 32P-labelled nucleotides, resulting in sharper definition; the reaction products are easier to interpret since each lane corresponds only to either G, A, T or C. The enzymes used for most dideoxy sequencing include T7 DNA
polymerase and DNA polymerases isolated from thermophilic organisms such as Taq, Vent, Tth, and others. Other polymerases used to a lesser extent include AMV reverse transcriptase and Klenow fragment of E. coli DNA
polymerase I.
In the dideoxy chain terminating method a short single-stranded primer is annealed to a single-stranded template. The primer is elongated at its 3'-end by the incorporation of deoxynucleotides (dNMPs) until a dideoxynucleotide (ddNMP) is incorporated. When a ddNMP
is incorporated elongation ceases at that base. Other chain terminating agents can be used in place of a ddNTP
and the ddNTP can be labelled as discussed below.
Using the above methodology, automated systems for DNA sequence analysis have been developed. One instrument, which was manufactured by EG&G, makes use of SUBSTITUTE SHEET (RULE 26) conventional dideoxy chain terminating reactions with a radioactively labeled nucleotide. The resulting DNA
products are separated by gel electrophoresis. Toneguzzo et al , 6 Biotechniques 460, 1988. A detector scans for radioactivity as it passes through the bottom of the gel.
' Four synthesis reactions are required for each template to be sequenced, as well as four lanes on each gel, a separate lane being used for products terminated by each specific chain terminating agent.
Kambara et al , 6 Biotechnology 816, 1988, have used a fluorescent-labelled primer. The resulting fluorescently labelled products are excited with a laser at the bottom of the gel and the fluorescence detected with a CRT monitor. This procedure also requires four synthesis reactions and four lanes on the gel for each template to be sequenced.
Applied Biosystems manufactures an instrument in which four different primers are used, each labelled with a different fluorescent marker. Smith et al., 13 Nuc.
Acid. Res. 2399, 1985; and 321 Nature 674, 1986. Each primer is used in a separate reaction containing one of four dideoxynucleotides. After the four reactions have been carried out the mixtures are combined and the DNA
fragments are fractionated in a single lane on a gel. A
laser at the bottom of the gel is. used to detect fluorescent products after they have been electrophoresed through the gel. This system requires four separate annealing reactions and four separate synthesis reactions for each template, but only a single lane on the gel.
Computer analysis of the sequence is made easier by having all four bands in a single lane.
DuPont used to provide an instrument in which a different fluorescent marker was attached to each of four dideoxynucleoside triphosphates. Prober et al., 238 Science 336, 1987. A single annealing step, a single polymerase reaction (containing each of the four labelled dideoxynucleoside triphosphates) and a single lane in the sequencing gel are required. The four different fluorescent markers in the DNA products are detected SUBSTilUBf SMEET (RtU 20 separately as they are electrophoresed through the gel.
Englert et al., U.S. Pat. No. 4,707,235 (1987), describes a multichannel electrophoresis apparatus having a detection means, disposed substantially across the 5 whole width of the gel, that can detect labelled DNA
products as they migrate past the detector means in four separate lanes, and identifies the channel or lane in which the sample is located. Preferably, radioisotopic labels are used.
Inherent to procedures currently used for DNA
sequence analysis is the necessity to separate either radioactively or fluorescently-labelled DNA products by a gel permeation procedure such as polyacrylamide gel electrophoresis, and then detect their locations relative to one another along the axis of movement through the gel. The accuracy of this procedure is determined in part by the uniformity of the signal in bands which have permeated approximately the same distance through the gel. Differences or variations in signal intensities between nearby bands create several problems. First, they decrease the sensitivity of the method, which is limited by the ability to detect the bands containing the weakest signals. Second, they create difficulties in determining whether a band with a weak signal is a true signal due to the incorporation of a chain terminating agent, or an artifact due to a pause site in the DNA
where the polymerase has dissociated. Third, they decrease the accuracy in determining the DNA sequence between closely spaced bands since the strong signal of one band may mask the weak signal of its neighbor. See Tabor and Richardson, supra.
Variation in band intensity can arise from an inherent property of most DNA polymerases. Most DNA
polymerases discriminate against the chain terminating 35 dideoxynucleotides used in DNA sequence analysis. T4 DNA
polymerase discriminates against ddNTPs to such an extent that it cannot be used for DNA sequencing. E. coli DNA
polymerase I, Taq, and Vent DNA polymerase also discriminate strongly against ddNTPs, each incorporating SUBSTCRM SNEfT (RUL,E 26) a ddNNP a thousand times slower than the corresponding dNTP. Tabor and Richardson supra, have shown that T7 DNA polymerase lies at the other end of the spectrum, 5 discriminating against ddNTPs only several fold . If a DNA polymerase discriminated against a ddNTP to the same extent at all sequences, this problem could be overcome by simply altering the ratio of ddNTPs to dNTPs. Such an approach has been used with E. coli DNA polymerase I and Taq DNA polymerase. However, the extent of discrimination varies with the adjacent DNA sequences, which leads to wide variation in the intensity of adjacent radioactive fragments. The intensity of specific fragments can vary by 50-fold for E. coli DNA
polymerase I but only several fold for T7 DNA polymerase.
Consequently, the intensity of: bands on a DNA sequencing gel produced by T7 DNA polymerase are of similar intensity thus facilitating their detection and analysis by automated procedures. In addition, procedures that even further reduce the discrimination against dideoxynucleotides by T7 DNA polymerase are described such that it incorporates dideoxynucleotides equally as well as deoxynucleotides. These procedures and conditions also reduce but do not eliminate discrimination by other DNA polymerases such as Klenow and Taq DNA polymerases. For example, the use of manganese in place of, or in addition to, magnesium in the reaction mixture may reduce or eliminate discrimination against dideoxynucleotides. Under such conditions, T7 DNA polymerase does not differentiate between the two molecules whereas other DNA polymerases such as Klenow fragment, Taq and Vent still discriminate to some degree. For example, Klenow still discriminates against ddNTPs by as much as four-fold in the presence of manganese. More important, even though the overall degree of discrimination by such enzymes as Klenow and Taq DNA polymerases is reduced, the intensity of specific fragments can vary by much more than four fold due to high discrimination at certain sequences in DNA. These WO 96/12042 22018 8 5 PCT/US95/129,0 polymerases and procedures are now almost universally used in manual DNA sequencing (i.e., without aid of sequencing machines such as described above) and are extensively used in automated methods. The use of manganese and the lack of discrimination against ddNTPs at all sites results in bands of uniform intensities, thus facilitating the reading of sequencing gels, either by manual or automated procedures. Moreover, the lack of discrimination enables the use of novel procedures for sequence analysis (Tabor and Richardson, supra) . A
method based on this finding is provided to determine a ,DNA sequence in a single reaction that contains all four ddNTPs at different ratios, by measuring the relative intensity of each peak after gel electrophoresis. The authors indicate:
The DNA polymerases of this invention do not discriminate significantly between dideoxy-nucleotide analogs and normal nucleotides. That is, the chance of incorporation of an analog is approximately the same as that of a normal nucleotide or at least incorporates the analog with at least 1/10 the efficiency that of a normal analog. The polymerases of this invention also do not discriminate significantly against some other analogs.
This is important since,=in addition to the four normal deoxynucleoside triphosphates (dGTP, dATP, dTTP and dCTP), sequencing reactions require the incorporation of other types of nucleotide derivatives such as:
radioactively- or fluorescently-labelled nucleoside triphosphates, usually for labeling the synthesized strands with 35S, 32P, or other chemical agents. When a DNA polymerase does not discriminate against analogs the same probability will exist for the incorporation of an analog as for a normal nucleotide. For labelled nucleoside triphosphates this is important in order to efficiently label the synthesized DNA strands using a minimum of radioactivity. [4,942,130, COL 5:5]
They also state:
The ability to produce nearby bands of approximately the same intensity is useful since it permits the SUBS"fITtlTF SHET MM 2M
results of any sequencing reaction to be read more easily and with greater certainty. Further, since the DNA
products from a sequencing reaction with a specific chain terminating agent form bands which are of approximately the same intensity as that of nearby bands, band intensity itself provides a specific label for the series of bands so formed. The number of DNA products of approximately the same molecular weight produced by a given chain terminating agent varies depending upon the concentration of the chain terminating agent. Thus, by using a different concentration of each of the four chain terminating agents for the synthesis the DNA products incorporating one chain terminating agent are distinguished from DNA products of approximately the same molecular weight incorporating other chain terminating agents in that they differ in number or amount;
consequently, the bands of DNA products can be identified as to chain terminating agent simply by their intensity as compared to the intensities of nearby bands. As a result, two or more series of DNA products, each series having a different chain terminating agent, can be subjected to gel permeation in a single lane and identified, i.e., distinguished from each other, by the intensity of each band as compared to the intensity of nearby bands. Moreover, the syntheses of DNA products incorporating different chain terminating agents need not be carried out separately, in separate containers, but may all be carried out simultaneously in a single reaction vessel, and the same label, e.g., radioisotopic, fluorescent, etc. can, if desired, be used for all chain terminating agents instead of a different label for each, thus simplifying the procedure. [4,962,020, col 3:1-35]
See also Tabor and Richardson Proc. Nat1. Acad.
Sci. USA 86, 4076 -4080 (1989) which indicates that substitution of manganese ions for magnesium ions for catalysis by T7 DNA polymerase or E. coli DNA polymerase reduces the discrimination of these polymerases for ddNTPs by 4-100 fold, and Tabor and Richardson J. Bio1.
Chem. 265, 8322-8328 (1990) which describes the use of SUBS`T11UrE SiiEET (RW 26) WO 96/12042 2201885 pCT/US95/1294D
products from a sequencing reaction with a specific chain terminating agent form bands which are of approximately the same intensity as that of nearby bands, band intensity itself provides a specific label for the series of bands so formed. The number of DNA products of approximately the same molecular weight produced by a given chain terminating agent varies depending upon the concentration of the chain terminating agent. Thus, by using a different concentration of each of the four chain terminating agents for the synthesis the DNA products incorporating one chain terminating agent are distinguished from DNA products of approximately the same molecular weight incorporating other chain terminating agents in that they differ in number or amount;
consequently, the bands of DNA products can be identified as to chain terminating agent simply by their intensity as compared to the intensities of nearby bands. As a result, two or more series of DNA products, each series having a different chain terminating agent, can be subjected to gel permeation in a single lane and identified, i.e., distinguished from each other, by the intensity of each band as compared to the intensity of nearby bands. Moreover, the syntheses of DNA products incorporating different chain terminating agents need not be carried out separately, in separate containers, but may all be carried out simultaneously in a single reaction vessel, and the same label, e.g., radioisotopic, fluorescent, etc. can, if desired, be used for all chain terminating agents instead of a different label for each, thus simplifying the procedure. [4,962,020, col 3:1-35]
See also Tabor and Richardson Proc. Nat1. Acad.
Sci. USA 86, 4076 -4080 (1989) which indicates that substitution of manganese ions for magnesium ions for catalysis by T7 DNA polymerase or E. coli DNA polymerase reduces the discrimination of these polymerases for ddNTPs by 4-100 fold, and Tabor and Richardson J. Bio1.
Chem. 265, 8322-8328 (1990) which describes the use of SUBS`T11UrE SiiEET (RW 26) WO 96/12042 2201885 pCT/US95/1294D
pyrophosphatase and manganese ions to generate dideoxy-terminated fragments of uniform intensity using T7 DNA
polymerase.
Summary of the Invention Applicant believes that the lesser utility of some DNA polymerases for dideoxy DNA sequencing is due in part to the reduced ability of those polymerases to incorporate a ddNMP (or other nucleotide analog) in place of a dNMP. As noted above, the ability not to discriminate allows use of lower concentrations of ddNTPs than with enzymes that do discriminate, and most important it provides banding patterns in sequencing gels that are of more uniform intensity along their length.
Both these results make automated sequencing with the enzyme easier and more profitable in that longer DNA
sequences can be determined with greater confidence.
The present invention provides a method by which an otherwise discriminatory DNA polymerase can be altered to discriminate less against ddNTPs than the corresponding naturally occurring enzyme. This method involves the modification of the polymerase genetically to provide amino acid residues at key locations that enhance the ability of the polymerase to incorporate analogs of dNMPs. Applicant has determined that amino acid changes in a specific region of DNA polymerases have a dramatic effect on the ability of those DNA polymerases to incorporate dideoxynucleotides; the specific amino acid residue inserted determines whether the polymerase is more or less discriminatory towards ddNTPs. Applicant has determined that modifying DNA polymerases so that they incorporate dideoxynucleotides more efficiently has a tremendous effect on their utility in DNA sequencing.
It is possible that such modified DNA polymerases will also prove useful for other common molecular biology procedures such as the amplification of DNA (for example by the polymerase chain reaction), in vitro mutagenesis and filling in the ends of DNA fragments. The combination of this technology with existing knowledge S'UBSTlTUFE SMEET (RULE 2M
polymerase.
Summary of the Invention Applicant believes that the lesser utility of some DNA polymerases for dideoxy DNA sequencing is due in part to the reduced ability of those polymerases to incorporate a ddNMP (or other nucleotide analog) in place of a dNMP. As noted above, the ability not to discriminate allows use of lower concentrations of ddNTPs than with enzymes that do discriminate, and most important it provides banding patterns in sequencing gels that are of more uniform intensity along their length.
Both these results make automated sequencing with the enzyme easier and more profitable in that longer DNA
sequences can be determined with greater confidence.
The present invention provides a method by which an otherwise discriminatory DNA polymerase can be altered to discriminate less against ddNTPs than the corresponding naturally occurring enzyme. This method involves the modification of the polymerase genetically to provide amino acid residues at key locations that enhance the ability of the polymerase to incorporate analogs of dNMPs. Applicant has determined that amino acid changes in a specific region of DNA polymerases have a dramatic effect on the ability of those DNA polymerases to incorporate dideoxynucleotides; the specific amino acid residue inserted determines whether the polymerase is more or less discriminatory towards ddNTPs. Applicant has determined that modifying DNA polymerases so that they incorporate dideoxynucleotides more efficiently has a tremendous effect on their utility in DNA sequencing.
It is possible that such modified DNA polymerases will also prove useful for other common molecular biology procedures such as the amplification of DNA (for example by the polymerase chain reaction), in vitro mutagenesis and filling in the ends of DNA fragments. The combination of this technology with existing knowledge S'UBSTlTUFE SMEET (RULE 2M
for alteration of 3'-5' exonuclease activity of a DNA
polymerase (such as T7 DNA polymerase as described by Tabor and Richardson, supra), and the use of manganese and pyrophosphatase in sequencing reactions will allow the production of significantly superior enzymes to those presently known. Modification of the DNA polymerase is preferably by substitution of one or more amino acids but may be by the insertion of one or more additional amino acids or the deletion of one or more amino acids.
In one particular aspect, namely, DNA sequencing, thermophilic enzymes having an ability to catalyze the polymerization of nucleotides at temperatures above 50 C
and, in particular, above 60 C, 70 C, or even 80 C, under conditions used for DNA sequencing, are well known in the art. Such enzymes are generally present in organisms that grow at these temperatures. However, applicant believes that many of these enzymes suffer from limitations, including the limited ability to incorporate dideoxynucleotides. By modification of these enzymes using methods shown below, those in the art can now modify any desired thermophilic DNA polymerase to make it incorporate dideoxynucleotides more efficiently. Such enzymes will be superior to those existing in the present day for DNA sequencing both in automated machines and in manual sequencing, especially in procedures known as cycle sequencing. In cycle sequencing, multiple rounds of DNA synthesis are carried out from the same template, with the synthesized strand removed after each cycle by heat denaturation; this allows much smaller amounts of DNA template to be used in a sequencing reaction Applicanthas determined experimentally that amino acid residue 526 in the relatively non-discriminating enzyme T7 DNA polymerase provides it with this property.
Applicant has determined that by modification of residue 526 it is possible to increase the ability of T7 DNA
polymerase to discriminate many fold. Based on the amino acid homologies between T7 DNA polymerase and other DNA
polymerases, applicant has determined that altering the residue at the homologous site in other DNA polymerases St=TM SKET (RUU 2%
WO 96/12042 22 018 8 5 PCT/US95/129*
likewise affects their ability to discriminate against dideoxnucleotides. Examples of such homologous sites are residue 762 of E. coli DNA polymerase I and residue 667 of Taq DNA polymerase. In all three of these examples, 5 applicant has shown that it is important that the residue at this site is different from phenylalanine (F), e.g., =
it may be tyrosine (Y) as in T7 DNA polymerase.
Surprisingly, modification of this single amino acid residue, even by the addition of a single hydroxyl group, 10 provides a very large alteration (250 - 8,000 fold) in discrimination levels. Those in the art will recognize that changes at this one site are not limiting in this invention, and changes at other sites that decrease the ability of a polymerase to discriminate can now be readily found by routine experimentation. For example, applicant has found that modification of T7 DNA
polymerase at 13 other sites also results in an increased ability of the enzyme to discriminate against ddNTPs, however the effect of alterations at these sites is much less, only 5 - 20 fold. By use of analogous procedures, other sites that effect discrimination against ddNTPs can be readily identified in other DNA
po.lymerase to make them more useful for DNA sequencing.
Such other sites include amino acid residues in regions highly homologous between E. coli DNA polymerase I and T7 DNA polymerase, since these regions are likely to make up in part the binding domain for ddNTPs; in E. coli DNA
polymerase I, these regions include amino acids in regions analogous to those in T7 DNA polymerase'from conserved or non-conserved amino acids in regions 665 -681 and 754 - 783, and possibly in regions 709 - 734, 797 - 866, and 913 - 927. The amino acid change to provide the desired function can be chosen to be identical to the corresponding amino acids of a non- .
discriminatory enzyme like T7 DNA polymerase, or other functionally equivalent amino acids which can be chosen by routine experimentation. By changing non-conserved amino acids more profound alteration of the ability to discrimination is obtained. Non-conserved amino acids $UBSTlTUIE SMHT (RULE 2%
220108~
polymerase (such as T7 DNA polymerase as described by Tabor and Richardson, supra), and the use of manganese and pyrophosphatase in sequencing reactions will allow the production of significantly superior enzymes to those presently known. Modification of the DNA polymerase is preferably by substitution of one or more amino acids but may be by the insertion of one or more additional amino acids or the deletion of one or more amino acids.
In one particular aspect, namely, DNA sequencing, thermophilic enzymes having an ability to catalyze the polymerization of nucleotides at temperatures above 50 C
and, in particular, above 60 C, 70 C, or even 80 C, under conditions used for DNA sequencing, are well known in the art. Such enzymes are generally present in organisms that grow at these temperatures. However, applicant believes that many of these enzymes suffer from limitations, including the limited ability to incorporate dideoxynucleotides. By modification of these enzymes using methods shown below, those in the art can now modify any desired thermophilic DNA polymerase to make it incorporate dideoxynucleotides more efficiently. Such enzymes will be superior to those existing in the present day for DNA sequencing both in automated machines and in manual sequencing, especially in procedures known as cycle sequencing. In cycle sequencing, multiple rounds of DNA synthesis are carried out from the same template, with the synthesized strand removed after each cycle by heat denaturation; this allows much smaller amounts of DNA template to be used in a sequencing reaction Applicanthas determined experimentally that amino acid residue 526 in the relatively non-discriminating enzyme T7 DNA polymerase provides it with this property.
Applicant has determined that by modification of residue 526 it is possible to increase the ability of T7 DNA
polymerase to discriminate many fold. Based on the amino acid homologies between T7 DNA polymerase and other DNA
polymerases, applicant has determined that altering the residue at the homologous site in other DNA polymerases St=TM SKET (RUU 2%
WO 96/12042 22 018 8 5 PCT/US95/129*
likewise affects their ability to discriminate against dideoxnucleotides. Examples of such homologous sites are residue 762 of E. coli DNA polymerase I and residue 667 of Taq DNA polymerase. In all three of these examples, 5 applicant has shown that it is important that the residue at this site is different from phenylalanine (F), e.g., =
it may be tyrosine (Y) as in T7 DNA polymerase.
Surprisingly, modification of this single amino acid residue, even by the addition of a single hydroxyl group, 10 provides a very large alteration (250 - 8,000 fold) in discrimination levels. Those in the art will recognize that changes at this one site are not limiting in this invention, and changes at other sites that decrease the ability of a polymerase to discriminate can now be readily found by routine experimentation. For example, applicant has found that modification of T7 DNA
polymerase at 13 other sites also results in an increased ability of the enzyme to discriminate against ddNTPs, however the effect of alterations at these sites is much less, only 5 - 20 fold. By use of analogous procedures, other sites that effect discrimination against ddNTPs can be readily identified in other DNA
po.lymerase to make them more useful for DNA sequencing.
Such other sites include amino acid residues in regions highly homologous between E. coli DNA polymerase I and T7 DNA polymerase, since these regions are likely to make up in part the binding domain for ddNTPs; in E. coli DNA
polymerase I, these regions include amino acids in regions analogous to those in T7 DNA polymerase'from conserved or non-conserved amino acids in regions 665 -681 and 754 - 783, and possibly in regions 709 - 734, 797 - 866, and 913 - 927. The amino acid change to provide the desired function can be chosen to be identical to the corresponding amino acids of a non- .
discriminatory enzyme like T7 DNA polymerase, or other functionally equivalent amino acids which can be chosen by routine experimentation. By changing non-conserved amino acids more profound alteration of the ability to discrimination is obtained. Non-conserved amino acids $UBSTlTUIE SMHT (RULE 2%
220108~
are those which vary from one species of polymerase to another (i.e., are found in less than 500 of polymerases). The term "analogous" is used in its commonly recognized manner. Thus, an analogue of a Pol I polymerase is one having an amino acid sequence as ' described by Braithwaite and Ito, infra, which is preferably as related to the other members of the Pol I
family of polymerases described therein as is Spo2 DNA
polymerase. Such analyses can be performed using Felsenstein's PHYLIP program. Id.
Thus, in a first aspect the invention features a modified gene encoding a modified DNA polymerase. The gene is modified to produce a modified DNA polymerase that, compared to the corresponding naturally-occurring or unmodified DNA polymerase, has an increased ability to incorporate a dideoxynucleotide compared to a deoxynucleotide.
By "increased ability" is meant that the DNA
polymerase is able to better incorporate a dideoxynucleotide. That is, it discriminates to a lesser extent than a corresponding naturally-occurring DNA
polymerase against a dideoxynucleotide compared to a deoxynucleotide. Specific methods for measuring such discrimination are provided below. The term "increased"
means to provide a measurable difference in ability to incorporate such dideoxynucleotides. In preferred embodiments, this is a increase of at least 10a compared to the naturally-occurring enzyme, although it is preferred that the level of discrimination against a dideoxynucleotide is reduced by at least 10 to 100 fold and preferably by 100 - 500 fold. One example of such an enzyme is E. coli DNA polymerase I which (as noted . herein) discriminates approximately 140 - 1,100 fold against incorporation of dideoxynucleotides compared to . 35 deoxynucleotides. By the method of this invention an enzyme can be derived (by alteration of only one or two amino acids) that actually prefers ddNTPs over dNTPS -that is, the ability of the polymerase to incorporate dideoxynucleotides has been increased by an average of 1,000 fold.
SUBSTlTItTE Sa'IEET (RUU 261 WO 96/12042 A-2018C3 .) PCT/US95/12910 The phrase "corresponding naturally-occurring DNA
polymerase" is one well known in the art and refers to the polymerase found in nature, and which preferably is not altered either by in vitro or in vivo manipulations in the laboratory. Similarly the corresponding nucleic acid is that nucleic acid encoding a DNA polymerase found in nature. This is simply used as a base line to compare modified nucleic acids encoding such polymerases. Thus, a base line for the DNA polymerase of Thermus aquaticus (also termed "Taq") is the nucleic acid which naturally encodes Taq DNA polymerase present in the bacterium Thermus aquaticus. Applicant provides at least one site which can be changed in such polymerases to alter the ability of the polymerase to incorporate a dideoxynucleotide. These sites are merely examples and are not limiting in this invention since those in the art armed with the knowledge that the ability of a DNA
polymerase can be usefully altered in this property are now provided with the methodology by which to alter such enzymes either at these specific sites or at other equivalent sites.
In embodiments of the invention described herein, modification of the DNA polymerase is by substitution of one or more amino acids. It may however be found that the modification may take the form of the insertion of one or more additional amino acids or the deletion of one or more amino acids.
DNA polymerases of this invention may also be modified to remove or alter an exonuclease domain, such as the 3' - 5' exonuclease activity described by Tabor and Richardson, supra, or the 5' - 3' exonuclease activity in Taq described by Barnes (WO 92/06188). The mutations that alter the ability of DNA polymerases of this invention to discriminate against ddNTPs preferably do not affect the exonuclease activity substantially; by this it is meant that the mutations are in the polymerase domain of the enzyme, near the active site for polymerization, and are not decreasing discrimination merely by reducing the ability of the polymerase to S=NfE $HEET (RULE 2%
family of polymerases described therein as is Spo2 DNA
polymerase. Such analyses can be performed using Felsenstein's PHYLIP program. Id.
Thus, in a first aspect the invention features a modified gene encoding a modified DNA polymerase. The gene is modified to produce a modified DNA polymerase that, compared to the corresponding naturally-occurring or unmodified DNA polymerase, has an increased ability to incorporate a dideoxynucleotide compared to a deoxynucleotide.
By "increased ability" is meant that the DNA
polymerase is able to better incorporate a dideoxynucleotide. That is, it discriminates to a lesser extent than a corresponding naturally-occurring DNA
polymerase against a dideoxynucleotide compared to a deoxynucleotide. Specific methods for measuring such discrimination are provided below. The term "increased"
means to provide a measurable difference in ability to incorporate such dideoxynucleotides. In preferred embodiments, this is a increase of at least 10a compared to the naturally-occurring enzyme, although it is preferred that the level of discrimination against a dideoxynucleotide is reduced by at least 10 to 100 fold and preferably by 100 - 500 fold. One example of such an enzyme is E. coli DNA polymerase I which (as noted . herein) discriminates approximately 140 - 1,100 fold against incorporation of dideoxynucleotides compared to . 35 deoxynucleotides. By the method of this invention an enzyme can be derived (by alteration of only one or two amino acids) that actually prefers ddNTPs over dNTPS -that is, the ability of the polymerase to incorporate dideoxynucleotides has been increased by an average of 1,000 fold.
SUBSTlTItTE Sa'IEET (RUU 261 WO 96/12042 A-2018C3 .) PCT/US95/12910 The phrase "corresponding naturally-occurring DNA
polymerase" is one well known in the art and refers to the polymerase found in nature, and which preferably is not altered either by in vitro or in vivo manipulations in the laboratory. Similarly the corresponding nucleic acid is that nucleic acid encoding a DNA polymerase found in nature. This is simply used as a base line to compare modified nucleic acids encoding such polymerases. Thus, a base line for the DNA polymerase of Thermus aquaticus (also termed "Taq") is the nucleic acid which naturally encodes Taq DNA polymerase present in the bacterium Thermus aquaticus. Applicant provides at least one site which can be changed in such polymerases to alter the ability of the polymerase to incorporate a dideoxynucleotide. These sites are merely examples and are not limiting in this invention since those in the art armed with the knowledge that the ability of a DNA
polymerase can be usefully altered in this property are now provided with the methodology by which to alter such enzymes either at these specific sites or at other equivalent sites.
In embodiments of the invention described herein, modification of the DNA polymerase is by substitution of one or more amino acids. It may however be found that the modification may take the form of the insertion of one or more additional amino acids or the deletion of one or more amino acids.
DNA polymerases of this invention may also be modified to remove or alter an exonuclease domain, such as the 3' - 5' exonuclease activity described by Tabor and Richardson, supra, or the 5' - 3' exonuclease activity in Taq described by Barnes (WO 92/06188). The mutations that alter the ability of DNA polymerases of this invention to discriminate against ddNTPs preferably do not affect the exonuclease activity substantially; by this it is meant that the mutations are in the polymerase domain of the enzyme, near the active site for polymerization, and are not decreasing discrimination merely by reducing the ability of the polymerase to S=NfE $HEET (RULE 2%
remove incorporated analogs via its exonuclease activity.
Particularly suitable DNA polymerases of this invention are Pol I- type polymerases as described by Braithwaite and Ito, 21 Nuc. Acid. Res. 787, 1993, s and referred to as Family A, and polymerase alpha or polymerase II - type DNA polymerases described by Braithwaite and Ito, and referred to.as family B. The other polymerase families described by Braithwaite and Ito may also be used in this invention. In particular, applicant has found that the presence of a polar, hydroxyl containing amino acid residue at a position near the binding site for the dNTP
substrate is important for the polymerase being able to efficiently incorporate a dideoxynucleotide. Without being bound by any theory, Applicant believes that this finding is contrary to the expected result that high discrimination against a nucleotide without a hydroxyl group at the 3' position of the ribose moiety (i.e. a ddNTP) requires the simultaneous absence of a hydroxyl group on the amino acid resic3ue at this critical site.
In other words, the presence of the gap, or hole created by the absence of both hydroxyl groups leads to discrimination against the analog. Knowledge of this result provides an approach to finding the critical residue in even distantly re:Lated DNA polymerases; the addition of a residue with a polar group for a non polar one in the region where the dNTP binds is a useful candidate amino acid change for decreasing the ability of the polymerase to discriminate against ddNTPs. For example, the phenylalanine at position 272 of rat DNA
polymerase b, a DNA polymerase with little if any homology to polymerases of Farnily A or B, has been shown by X-ray diffraction studies to be in contact with the 3' position of the ddCTP residue in a ternary complex with a primer-template (Pelletier et al., 264 Science 189, 1994). Knowledge of the Y-esults described in this invention make the modification of this residue to tyrosine a logical choice in a screen of mutants of rat DNA polymerase b that incorporate dideoxynucleotides more '20i885 efficiently. Those in the art will thus be likely to be able to alter the discriminatory phenotype of any DNA
polymerase using the information provided herein.
The ability of some polymerases of this invention to incorporate dideoxynucleotides more efficiently can be specific (i.e., the effect on dideoxynucleotide analogs is much greater than on other analogs) . However, some of the polymerases are also useful to aid in incorporation of other base modified analogs (e.g., deoxyinosine triphosphate (dITP) and 2'-deoxy-7-deazaguanosine 51-triphosphate (dc'GTP) to remove compression of bands during electrophoresis, and fluorescently labeled deoxynucleotides or dideoxynucleotides for use in automated procedures) In addition, such polymerases may be able to incorporate ribonucleotides more efficiently thus allowing synthesis of RNA without need for a promoter. Specifically, the conserved motifs between single-subunit DNA-dependent RNA polymerases such as T7 RNA polymerase and DNA polymerases of Family A (Pol I -type DNA polymerase) suggest that mutations in this region (residues 758 to 767 of E. coli DNA polymerase I) are likely to change the specificity towards rNTPs. This permits the engineering of RNA polymerases that efficiently initiated synthesis from a primer, eliminating the requirement for a promoter for the synthesis of RNA. Analogously, the data provided herein suggest that modifying residues 631 to 640 of T7 RNA
polymerase will alter its specificity towards dNTPs.
This permits the engineering of a new DNA polymerase that initiates DNA synthesis de novo from a promoter sequence, and cannot use a primer.
In preferred embodiments, the modified DNA
polymerase has sufficient DNA polymerase activity (e.g., at least that used in a standard sequencing reaction, and preferably at least 100 units/mg of enzyme as defined in the art; preferably the mutation in the polymerase does not alter the prior level by more than 5-10 fold) for use in DNA sequencing (when combined with any host factor necessary for that DNA polymerase activity) ; and has Sft-fMUTE SHEET (RULE 26) sufficiently low exonuclease activity (e.g., less than 500 units/mg, see Tabor and Richardson, supra) to allow the polymerase to be used in DNA sequencing; the DNA
polymerase has one or more of the amino acids at the 5 dideoxynucleotide binding site of a T7-type DNA
polymerase (e.g., one selected from the group consisting of T7, T3, 01, 011 H, W31, gh-1, Y, A1122, and SP6).
Preferably the modified DNA polymerase is modified from a thermostable enzyme, such as the DNA polymerase encoded 10 by Thermus aquaticus, Thermus ther.mophilus, Thermus flavus, Bacillus sterothermophilus, and Vent bacteria;
and the ability of the polymerase to incorporate a dideoxynucleotide is increased at least 10-fold, 50-fold or most preferably at least 100-fold compared to the 15 corresponding naturally-occurring DNA polymerase, e.g., by a change at just one amino acid.
In a second aspect,_the invention features a method for production of a modified DNA polymerase having an increased ability to incorporate a dideoxynucleotide compared to the ability of a corresponding naturally-occurring DNA polymerase. The method includes providing a nucleic acid molecule encoding a DNA polymerase and mutagenizing or otherwise altering the nucleotide base sequence of the nucleic acid molecule to incorporate one or more base changes in the nucleotide base sequence at one or more sites which significantly (i.e., at least 10, 50 or most preferably 100-500 fold) alter the ability of the polymerase encoded by the nucleic acid to incorporate a dideoxynucleotide.
In a third aspect, the invention features a method for determining the nucleotide base sequence of a DNA
molecule. The method includes providing a DNA molecule annealed with a primer molecule able to hybridize to the DNA molecule; and incubating the annealed molecules in a vessel containing at least one deoxynucleotide triphosphate, a DNA polymerase modified from a naturally-occurring DNA polymerase to have an increased ability to incorporate a dideoxynucleoti-de compared to the naturally-occurring polymerase. (The polymerase has SU65fiTUTE SHEET (RULE 26) sufficient DNA polymerase activity and sufficiently low exonuclease activity to be useful for DNA sequencing.) Also provided is at least one DNA synthesis terminating agent which terminates DNA synthesis at a specific nucleotide base. The method further includes separating the DNA products of the incubating reaction according to size, whereby at least a part of the nucleotide base sequence of the DNA molecule can be determined.
In preferred embodiments, the DNA polymerase is a thermostable DNA polymerase and the sequencing is performed at a temperature above 50 C, 60 C, or 70 C, and the DNA polymerase is derived (i.e., has at least 50%
identity in amino acid residues) from one encoded by Thermus aquaticus, Thermus thermophilus, Thermus flavus, Bacillus sterothermophilus, Thermococcus litoralis (Vent), Pyrococcus furiosus (Pfu) or Sulfolobus sol fa tari cus .
In other preferred embodiments, the DNA polymerase has less than 1000, 250, 100, 50, 10 or even 2 units of exonuclease activity per mg of polymerase and is able to utilize primers having only 4, 6 or 10 bases; and the concentration of all four deoxynucleoside triphosphates at the start of the incubating step is sufficient to allow DNA synthesis to continue until terminated by the agent, e.g., a ddNTP.
For cycle sequencing, the polymerases of the present invention now make it possible to use significantly lower amounts of dideoxynucleotides compared to other enzymes.
That is, the method involves providing an excess amount of deoxynucleotides to all four dideoxynucleotides in a cycle sequencing reaction, and performing the cycle sequencing reaction. For other enzymes, it was necassary to add an excess of at least one of the ddNTPs to such reactions. For example, Sears et al., 13 BioTechniques 626, 1992 describe use of about a 10 fold excess of ddNTPs to dNTPs with Vent polymerase, and Carothers et al., 7 BioTechniques 494, 1989 describe use of at least 2 fold excess of ddNTPs to dNTPs for Taq polymerase. In the present invention, such excess is not needed.
$lwilUTF SKET (RULE 20 Preferably, more than 2, 5, or even 10 fold excess of a dNTP is provided to the corresponding ddNTP. In a specific example, less than 10 M ddNTP is used with a modified Taq of this invention.
In a related aspect, the invention features a kit or solution for DNA sequencing including a modified DNA
polymerase as described above and a reagent necessary for the sequencing selected from the group consisting of dITP, deaza GTP, a chain terminating agent such as a ddNTP, and a manganese-containing solution or powder.
In another aspect, the invention features a method for providing a modified DNA polymerase having an increased ability compared to the corresponding naturally-occurring DNA polymerase to incorporate a dideoxynucleotide by providing a nucleic acid sequence encoding the modified DNA polymerase, expressing the nucleic acid within a host cell, and purifying the DNA
polymerase from the host cell.
In another related aspect, the invention features a method for sequencing a strand of DNA essentially as described above with one or more (preferably 2, 3 or 4) deoxyribonucleoside triphosphates, a DNA polymerase as described above, and a first chain terminating agent.
The DNA polymerase causes the primer to be elongated to form a first series of first DNA products differing in the length of the elongated primer, each first DNA
product having a chain terminating agent at its elongated end, and the number of molecules of each first DNA
products being approximately the same for substantially all DNA products differing in length by no more than 20 bases. The method also features providing a second chain terminating agent in the hybridized mixture at a concentration different from the first chain terminating agent, wherein the DNA polymerase causes production of a second series of second DNA products differing in the length of the elongated primer, with each second DNA
product having the second chain terminating agent at its elongated end. The number of molecules of each second DNA product is approximately the same for substantially SIWTCME DtEFT (AU1.E 2) all second DNA products differing in length from each other by from 1 to 20 bases, and is distinctly different from the number of molecules of all the first DNA
products having a length differing by no more than 20 bases from that of said second DNA products.
In preferred embodiments, three or four such chain terminating agents can be used to make different products as described in Tabor and Richardson, supra; and the sequencing reaction is provided with a magnesium ion, or even a manganese or iron ion (e.g., at a concentration between 0.05 and 100 mM, preferably between 1 and 10 mM);
and the DNA products are separated according to molecular weight in less than four lanes of a gel.
In another related aspect, the invention features a method for sequencing a nucleic acid by combining an oligonucleotide primer, a nucleic acid to be sequenced, between one and four deoxyribonucleoside triphosphates, a polymerase as described above, and at least two chain terminating agents in different amounts, under conditions favoring extension of the oligonucleotide primer to form nucleic acid fragments complementary to the nucleic acid to be sequenced. The method further includes separating the nucleic acid fragments by size and determining the nucleic acid sequence. The agents are differentiated from each other by intensity of a label in the primer extension products.
While it is common to use gel electrophoresis to separate DNA products of a DNA sequencing reaction, those in the art will recognize that other methods may also be used. Thus, it is possible to detect each of the different fragments using procedures such as time of flight mass spectrometry, electron microscopy, and single molecule detection methods.
The invention also features an automated DNA
sequencing apparatus having a reactor including reagents which provide at least two series of DNA products formed from a single primer and a DNA strand. Each DNA product of a series differs in molecular weight and has a chain terminating agent at one end. The reagents include a DNA
SUBSTIME SHEET (RIH.E 261 1220 188:~
Particularly suitable DNA polymerases of this invention are Pol I- type polymerases as described by Braithwaite and Ito, 21 Nuc. Acid. Res. 787, 1993, s and referred to as Family A, and polymerase alpha or polymerase II - type DNA polymerases described by Braithwaite and Ito, and referred to.as family B. The other polymerase families described by Braithwaite and Ito may also be used in this invention. In particular, applicant has found that the presence of a polar, hydroxyl containing amino acid residue at a position near the binding site for the dNTP
substrate is important for the polymerase being able to efficiently incorporate a dideoxynucleotide. Without being bound by any theory, Applicant believes that this finding is contrary to the expected result that high discrimination against a nucleotide without a hydroxyl group at the 3' position of the ribose moiety (i.e. a ddNTP) requires the simultaneous absence of a hydroxyl group on the amino acid resic3ue at this critical site.
In other words, the presence of the gap, or hole created by the absence of both hydroxyl groups leads to discrimination against the analog. Knowledge of this result provides an approach to finding the critical residue in even distantly re:Lated DNA polymerases; the addition of a residue with a polar group for a non polar one in the region where the dNTP binds is a useful candidate amino acid change for decreasing the ability of the polymerase to discriminate against ddNTPs. For example, the phenylalanine at position 272 of rat DNA
polymerase b, a DNA polymerase with little if any homology to polymerases of Farnily A or B, has been shown by X-ray diffraction studies to be in contact with the 3' position of the ddCTP residue in a ternary complex with a primer-template (Pelletier et al., 264 Science 189, 1994). Knowledge of the Y-esults described in this invention make the modification of this residue to tyrosine a logical choice in a screen of mutants of rat DNA polymerase b that incorporate dideoxynucleotides more '20i885 efficiently. Those in the art will thus be likely to be able to alter the discriminatory phenotype of any DNA
polymerase using the information provided herein.
The ability of some polymerases of this invention to incorporate dideoxynucleotides more efficiently can be specific (i.e., the effect on dideoxynucleotide analogs is much greater than on other analogs) . However, some of the polymerases are also useful to aid in incorporation of other base modified analogs (e.g., deoxyinosine triphosphate (dITP) and 2'-deoxy-7-deazaguanosine 51-triphosphate (dc'GTP) to remove compression of bands during electrophoresis, and fluorescently labeled deoxynucleotides or dideoxynucleotides for use in automated procedures) In addition, such polymerases may be able to incorporate ribonucleotides more efficiently thus allowing synthesis of RNA without need for a promoter. Specifically, the conserved motifs between single-subunit DNA-dependent RNA polymerases such as T7 RNA polymerase and DNA polymerases of Family A (Pol I -type DNA polymerase) suggest that mutations in this region (residues 758 to 767 of E. coli DNA polymerase I) are likely to change the specificity towards rNTPs. This permits the engineering of RNA polymerases that efficiently initiated synthesis from a primer, eliminating the requirement for a promoter for the synthesis of RNA. Analogously, the data provided herein suggest that modifying residues 631 to 640 of T7 RNA
polymerase will alter its specificity towards dNTPs.
This permits the engineering of a new DNA polymerase that initiates DNA synthesis de novo from a promoter sequence, and cannot use a primer.
In preferred embodiments, the modified DNA
polymerase has sufficient DNA polymerase activity (e.g., at least that used in a standard sequencing reaction, and preferably at least 100 units/mg of enzyme as defined in the art; preferably the mutation in the polymerase does not alter the prior level by more than 5-10 fold) for use in DNA sequencing (when combined with any host factor necessary for that DNA polymerase activity) ; and has Sft-fMUTE SHEET (RULE 26) sufficiently low exonuclease activity (e.g., less than 500 units/mg, see Tabor and Richardson, supra) to allow the polymerase to be used in DNA sequencing; the DNA
polymerase has one or more of the amino acids at the 5 dideoxynucleotide binding site of a T7-type DNA
polymerase (e.g., one selected from the group consisting of T7, T3, 01, 011 H, W31, gh-1, Y, A1122, and SP6).
Preferably the modified DNA polymerase is modified from a thermostable enzyme, such as the DNA polymerase encoded 10 by Thermus aquaticus, Thermus ther.mophilus, Thermus flavus, Bacillus sterothermophilus, and Vent bacteria;
and the ability of the polymerase to incorporate a dideoxynucleotide is increased at least 10-fold, 50-fold or most preferably at least 100-fold compared to the 15 corresponding naturally-occurring DNA polymerase, e.g., by a change at just one amino acid.
In a second aspect,_the invention features a method for production of a modified DNA polymerase having an increased ability to incorporate a dideoxynucleotide compared to the ability of a corresponding naturally-occurring DNA polymerase. The method includes providing a nucleic acid molecule encoding a DNA polymerase and mutagenizing or otherwise altering the nucleotide base sequence of the nucleic acid molecule to incorporate one or more base changes in the nucleotide base sequence at one or more sites which significantly (i.e., at least 10, 50 or most preferably 100-500 fold) alter the ability of the polymerase encoded by the nucleic acid to incorporate a dideoxynucleotide.
In a third aspect, the invention features a method for determining the nucleotide base sequence of a DNA
molecule. The method includes providing a DNA molecule annealed with a primer molecule able to hybridize to the DNA molecule; and incubating the annealed molecules in a vessel containing at least one deoxynucleotide triphosphate, a DNA polymerase modified from a naturally-occurring DNA polymerase to have an increased ability to incorporate a dideoxynucleoti-de compared to the naturally-occurring polymerase. (The polymerase has SU65fiTUTE SHEET (RULE 26) sufficient DNA polymerase activity and sufficiently low exonuclease activity to be useful for DNA sequencing.) Also provided is at least one DNA synthesis terminating agent which terminates DNA synthesis at a specific nucleotide base. The method further includes separating the DNA products of the incubating reaction according to size, whereby at least a part of the nucleotide base sequence of the DNA molecule can be determined.
In preferred embodiments, the DNA polymerase is a thermostable DNA polymerase and the sequencing is performed at a temperature above 50 C, 60 C, or 70 C, and the DNA polymerase is derived (i.e., has at least 50%
identity in amino acid residues) from one encoded by Thermus aquaticus, Thermus thermophilus, Thermus flavus, Bacillus sterothermophilus, Thermococcus litoralis (Vent), Pyrococcus furiosus (Pfu) or Sulfolobus sol fa tari cus .
In other preferred embodiments, the DNA polymerase has less than 1000, 250, 100, 50, 10 or even 2 units of exonuclease activity per mg of polymerase and is able to utilize primers having only 4, 6 or 10 bases; and the concentration of all four deoxynucleoside triphosphates at the start of the incubating step is sufficient to allow DNA synthesis to continue until terminated by the agent, e.g., a ddNTP.
For cycle sequencing, the polymerases of the present invention now make it possible to use significantly lower amounts of dideoxynucleotides compared to other enzymes.
That is, the method involves providing an excess amount of deoxynucleotides to all four dideoxynucleotides in a cycle sequencing reaction, and performing the cycle sequencing reaction. For other enzymes, it was necassary to add an excess of at least one of the ddNTPs to such reactions. For example, Sears et al., 13 BioTechniques 626, 1992 describe use of about a 10 fold excess of ddNTPs to dNTPs with Vent polymerase, and Carothers et al., 7 BioTechniques 494, 1989 describe use of at least 2 fold excess of ddNTPs to dNTPs for Taq polymerase. In the present invention, such excess is not needed.
$lwilUTF SKET (RULE 20 Preferably, more than 2, 5, or even 10 fold excess of a dNTP is provided to the corresponding ddNTP. In a specific example, less than 10 M ddNTP is used with a modified Taq of this invention.
In a related aspect, the invention features a kit or solution for DNA sequencing including a modified DNA
polymerase as described above and a reagent necessary for the sequencing selected from the group consisting of dITP, deaza GTP, a chain terminating agent such as a ddNTP, and a manganese-containing solution or powder.
In another aspect, the invention features a method for providing a modified DNA polymerase having an increased ability compared to the corresponding naturally-occurring DNA polymerase to incorporate a dideoxynucleotide by providing a nucleic acid sequence encoding the modified DNA polymerase, expressing the nucleic acid within a host cell, and purifying the DNA
polymerase from the host cell.
In another related aspect, the invention features a method for sequencing a strand of DNA essentially as described above with one or more (preferably 2, 3 or 4) deoxyribonucleoside triphosphates, a DNA polymerase as described above, and a first chain terminating agent.
The DNA polymerase causes the primer to be elongated to form a first series of first DNA products differing in the length of the elongated primer, each first DNA
product having a chain terminating agent at its elongated end, and the number of molecules of each first DNA
products being approximately the same for substantially all DNA products differing in length by no more than 20 bases. The method also features providing a second chain terminating agent in the hybridized mixture at a concentration different from the first chain terminating agent, wherein the DNA polymerase causes production of a second series of second DNA products differing in the length of the elongated primer, with each second DNA
product having the second chain terminating agent at its elongated end. The number of molecules of each second DNA product is approximately the same for substantially SIWTCME DtEFT (AU1.E 2) all second DNA products differing in length from each other by from 1 to 20 bases, and is distinctly different from the number of molecules of all the first DNA
products having a length differing by no more than 20 bases from that of said second DNA products.
In preferred embodiments, three or four such chain terminating agents can be used to make different products as described in Tabor and Richardson, supra; and the sequencing reaction is provided with a magnesium ion, or even a manganese or iron ion (e.g., at a concentration between 0.05 and 100 mM, preferably between 1 and 10 mM);
and the DNA products are separated according to molecular weight in less than four lanes of a gel.
In another related aspect, the invention features a method for sequencing a nucleic acid by combining an oligonucleotide primer, a nucleic acid to be sequenced, between one and four deoxyribonucleoside triphosphates, a polymerase as described above, and at least two chain terminating agents in different amounts, under conditions favoring extension of the oligonucleotide primer to form nucleic acid fragments complementary to the nucleic acid to be sequenced. The method further includes separating the nucleic acid fragments by size and determining the nucleic acid sequence. The agents are differentiated from each other by intensity of a label in the primer extension products.
While it is common to use gel electrophoresis to separate DNA products of a DNA sequencing reaction, those in the art will recognize that other methods may also be used. Thus, it is possible to detect each of the different fragments using procedures such as time of flight mass spectrometry, electron microscopy, and single molecule detection methods.
The invention also features an automated DNA
sequencing apparatus having a reactor including reagents which provide at least two series of DNA products formed from a single primer and a DNA strand. Each DNA product of a series differs in molecular weight and has a chain terminating agent at one end. The reagents include a DNA
SUBSTIME SHEET (RIH.E 261 1220 188:~
polymerase as described above. The apparatus includes a separating means for separating the DNA product along one axis of the separator to form a series of bands. It also includes a band reading means for determining the position and intensity of each band after separation along the axis, and a computing means that determines the DNA sequence of the DNA strand solely from the position and intensity of the bands along the axis and not from the wavelength of emission of light from any label that may be present in the separating means.
In other aspects, the invention features: (a) a method for in vitro mutagenesis of a cloned DNA fragment by providing the cloned fragment and a DNA polymerase described above, contacting the cloned fragment with the polymerase under conditions for synthesizing a DNA strand from the fragment. The conditions cause formation of the DNA strand by incorporation of a plurality of individual contiguous bases able to base-pair with the fragment and incorporation of a nucleotide base unable to base pair with the fragment; (b) a method for in vitro mutagenesis of a template DNA fragment by providing a primer and template, the primer having contiguous bases able to base-pair with contiguous bases of the template, except at least one base which is unable to base-pair with the template. The method involves extending the primer with a DNA polymerase as described above; (c) a method for producing blunt-ended double-stranded DNA from a linear DNA molecule having a 5' end having a single-stranded region. The 3' end of the molecule is double-stranded and has no 3' protruding termini. The method includes incubating the DNA molecule with a DNA polymerase as described above which acts on the single-stranded region to produce a blunt-ended double-stranded DNA molecule;
(d) a method for labeling the 3' end of a DNA fragment by incubating the DNA fragment with a DNA polymerase as described above, and a labelled deoxynucleotide species under conditions in which the 3' end of the DNA fragment is extended by the polymerase and thereby labelled by addition of the labelled deoxynucleotide to the DNA
M8Sf1111Tf WEET (Rttt.E 28f 2)1385 WO 96/12042 PCT/US95/129*
fragment; (e) a method of amplification of a DNA sequence by annealing a first and second primer to opposite strands of a double-stranded DNA sequence and incubating the annealed mixture with a DNA polymerase as described 5 above. The first and second primers anneal to opposite strands of the DNA sequence with their 3' ends directed towards each other after annealing, and with the DNA
sequence to be amplified located between the two annealed primers.
10 In yet other aspects, the invention features specific DNA polymerases, such asThermus aquaticus DNA
polymerase having a tyrosine at residue 667, E. co1.i DNA
polymerase I having a tyrosine at residue 762, and any Pol I type DNA polymerases having a tyrosine residue at 15 the analogous location to E. coli DNA polymerase residue 762, e.g., at the N4 position of the amino acid sequence K N,, Na N3 N4 NS N6 N, Y G, wherein each N is independently any amino acid. Furthermore, this invention features specific polymerases of the DNA polymerase alpha family 20 having the sequence K Nl N2 N3 N4 N5 N6 Y G/Q, wherein each N is independently any amino acid, and where one of the residues N1 to N, has been mutated to produce a polymerase that has reduced discrimination against ddNTPs (preferably reduced by at least 20-fold compared to the non-mutated sequence) . The invention also features nucleic acid encoding any of these DNA polymerases.
In related aspects, the invention features DNA
polymerases except reverse transcriptase which in the presence of magnesium as the only added divalent cation have an average processivity of less than 100 and discriminate less than 100 times against incorporation of a ddNMP compared to a dNMP, or which in the presence of magnesium as the only added divalent cation have an average processivity of less than 50 and discriminate less than 50 or 5 times against incorporation of a ddNMP
compared to a dNMP. Those in the art will recognize that processivity can be measured by any standard procedure that will indicate that the average processivity of T7 DNA polymerase is at least 500, that of Klenow fragment S'USSTPTUTE SHEET (ftU.E 26) is about 4 - 40, and for reverse transcriptase it is about 150 - 200. Such measurements can be performed as described by Tabor et a1., J. B_o1. Chem. 262: 16212, 1987. The average processivity of Taq DNA polymerase under t'nese conditions 1s eypE:cted to be less than 100.
In particularly preferrec3 aspects, the invention features thermophilic DNA polymerases that discriminate, e.g., in the presence of macTnesium, against a ddNMP
compared to a dNMP by less than a factor of 100, and which preferably have an average processivity less than 100, and cycle from one primer-template to another more than once per one or even ten seconds. Such cycling can be measured by standard procedures.
The invention also features a method for cycle sequencing using a DNA polymerase as described above, and also features cellular (as opposed to viral or mitochondrial) DNA polymerases:having a tyrosine in place of the naturally occurring amino acid at a location which causes the polymerase not to discriminate against a ddNMP
compared to a dNMP by more than 50 fold.
In other aspects, substitution of the amino acid at the noted sites will result in alteration of other properties of the corresponding natural polymerase. ?n addition, polymerases of this invention may be combined with other polymerases in the methods described herein to take advantage of the supes-ior properties of each polymerase in the mixture.
Other features and advantages of the invention will be apparent from the following description of the prefer.y_ec6 embodiments thereof, and from the claims.
21a According to one aspect of the present invention, there is provided modified nucleic acid encoding a modified DNA polymerase wherein said modified nucleic acid is modified to encode a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA
polymerase residue 762 in its dNMP binding site to increase the ability of said modified DNA polymerase to incorporate a dideoxynucleotide relative to the corresponding deoxynucleotide compared to the ability of the corresponding naturally-occurring unmodified DNA polymerase by at least 20-fold.
According to another aspect of the present invention, there is provided a modified DNA polymerase encoded by the modified nucleic acid as described above.
According to still another aspect of the present invention, there is provided a method for production of a modified DNA polymerase having an increased ability to incorporate a dideoxynucleotide relative to a corresponding deoxynucleotide compared to ability of a corresponding naturally-occurring unmodified DNA polymerase comprising steps of: providing a nucleic acid molecule encoding a DNA polymerase and mutagenizing said nucleic acid molecule to incorporate one or more base changes in nucleotide base sequence at a region that encodes its dNMP binding site to encode a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 at an amino acid position corresponding to E. coli DNA polymerase residue 762 in the dNMP binding site to alter ability of said polymerase encoded by said nucleic acid to incorporate a dideoxynucleotide by at least 20-fold.
21b According to yet another aspect of the present invention, there is provided a method for determining a nucleotide base sequence of a DNA inolecule comprising the steps of: incubating a DNA molecule annealed with a primer molecule able to hybridize to said DNA molecule in a vessel containing at least one deoxynucleoside triphosphate, a DNA polymerase modified by having a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site compared to a naturally-occurring unmodified DNA polymerase to have at least a 20-fold increased ability to incorporate a dideoxynucleotide relative to a corresponding deoxynucleotide compared to said naturally-occurring unmodified polymerase, said polymerase having sufficient DNA polymerase activity and less than 1000 units exonuclease activity per mg DNA polymerases, and at least one DNA synthesis terminating agent which terminates DNA synthesis at a specific nucleotide base, in an incubating reaction; and separating the DNA products of the incubating reaction according to size whereby at least a part of the nucleotide base sequence of said DNA molecule can be determined.
According to a further aspect of the present invention, there is provided a kit for DNA sequencing comprising a modified DNA polymerase modified to include a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site of the DNA polymerase to increase the ability of said DNA polymerase to i_ncorporate a dideoxynucleotide compared to ability of a corresponding naturally-occurring unmodified DNA polymerase by at least 21c 20-fold relative to a correspondincl deoxynucleotide; and a reagent necessary for said sequencing selected from the group consisting of dITP, a chain terminating agent, deaza-GTP and a manganese-containing compound.
According to yet a further aspect of the present invention, there is provided a method for sequencing a strand of DNA comprising the steps of: providing said strand hybridized with a primer able to hybridize to said strand, to form an hybridized mixture, incubating said hybridized mixture with one or more deoxyriboriucleoside triphosphates, a modified DNA polymerase modified by having a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site to increase ability of said polymerase to incorporate a dideoxynucleotide at least 20-fold relative to a corresponding deoxynucleotide compared to ability of a corresponding naturally-occurring unmodified DNA polymerase, and a first chain terminating agent, wherein said DNA polymerase causes said primer to be elongated to form a first series of first DNA products differing in the length of the elongated primer, each said first DNA product having a said chain terminating agent at its elongated end, the number of molecules of each said first DNA products being approximately the same for substantially all DNA products differing in length by no more thari 20 bases, and providing a second chain terminating agent iri said hybridized mixture at a concentration different from said first chain terminating agent, wherein said DNA polymerase causes production of a second series of second DNA products differing in the length of the elorlgated primer, each said second DNA product having said second chain terminating agent at its elongated end, the nurnber of molecules of each 21d said second DNA products being approximately the same for substantially all second DNA products differing in length from each other by from 1 to 20 bases, and being distinctly different from the number of molecules of all said first DNA products having a length differ_ing by no more than 20 bases from that of said second DNA products.
According to still a further aspect of the present invention, there is provided a method for sequencing a nucleic acid comprising: combining an oligonucleotide primer, a nucleic acid to be sequenced, between one and four deoxyribonucleoside triphosphates, a modified DNA polymerase modified by having a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresporlding to E. coli DNA polymerase residue 762 in its dNMP binding site to increase ability of said polymerase to incorporate a dideoxynucleotide at least 20-fold relative to a corresponding deoxynucleotide compared to ability of a corresponding naturally-occurring unmodified DNA polymerase, and at least two chain terminating agents in different amounts, under conditions favoring extension of said oligonucleotide primer to form nucleic acid fragments complementary to the nucleic acid to be sequenced;
separating the nucleic acid fragmerits by size; and determining nucleic acid sequence wherein said agents are differentiated from each other by _Lntensity of a label in the primer extension products.
According to another aspect of the present invention, there is provided a method for in vitro mutagenesis of a cloned DNA fragment comprising providing said cloned fragment and a modified DNA polymerase modified by having a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an 21e amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site to increase ability of said polymerase to incorporate a d__deoxynucleotide at least 20-fold relative to a correspondinq deoxynucleotide compared to ability of a corresponding naturally-occurring unmodified DNA polymerase, contacting said cloned fragment with said polymerase under conditions for syrlthesizing a DNA strand from said fragment, wherein said conditions cause formation of said DNA strand by incorporation of individual contiguous bases, which are different or the same able to base-pair with said fragment and incorporation of a nucleotide base unable to base pair with said fragraent.
According to yet another aspect of the present invention, there is provided a method for in vitro mutagenesis of a template DNA fragraent comprising providing a primer and said template, said pr_imer having contiguous bases able to base-pair with contiguous bases of said template except at least one base which is unable to base-pair with said template, and extending said primer with a modified DNA polymerase modified by having a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymE:rase residue 762 in its dNMP binding site to increase abil__ty of said polymerase at least 20-fold relative to a corresponding deoxynucleotide to incorporate a dideoxynucleotide compared to ability of a corresponding naturally-occurring unmodified DNA polymerase.
According to yet another aspect of the present invention, there is provided a method for producing blunt-ended double-stranded DNA from a linear DNA molecule having a 5' end comprising a single-stranded region, wherein 3' end of said molecule is double stranded and has no 3' protruding termini, comprising incubating said DNA molecule with a 21f modified DNA polymerase modified by having a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site to increase ability of said polymerase at least 20-fold to incorporate a dideoxynucleotide relative to a corresponding deoxynucleotide cornpared to ability of a corresponding naturally-occurring unmodified DNA polymerase, wherein said polymerase acts on said single-stranded region to produce a blunt-ended double stranded DNA molecule.
According to yet another aspect of the present invention, there is provided a method for labeling a 3' end of a DNA fragment comprising incubating said DNA fragment with a modified DNA polymerase modified by having a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site to increase abil_Lty of said polymerase at least 20-fold to incorporate a dideoxynucleotide relative to a corresponding deoxynucleotide cornpared to ability of a corresponding naturally-occurring unmodified DNA polymerase, and a labelled deoxynucleotide species under conditions in which said 3' end of said DNA fragrnent is extended by said polymerase and thereby labelled by addition of said labelled deoxynucleotide to said DNA fragmerlt.
According to yet another aspect of the present invention, there is provided a method of amplification of a DNA sequence comprising annealing a first and second primer to opposite strands of a double-str_anded DNA sequence to form an annealed mixture and incubating the annealed mixture with a modified DNA polymerase modified by having a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position 21g corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site to increase ability at least 20-fold of said polymerase to incorporate a dideoxynucleotide relative to a corresponding deoxynucleotide compared to ability of a corresponding naturally-occurring unmodified DNA polymerase, wherein said first and second primers anneal to opposite strands of said DNA sequence with their 3' ends directed towards each other after annealing, and with the DNA sequence to be amplified located between the first and second annealed primers.
According to yet another aspect of the present invention, there is provided a Thermus aquaticus DNA
polymerase comprising a tyrosine at residue 667, wherein said Thermus aquaticus DNA polymerase has an increased ability to incorporate a dideoxynucleotide relative to the corresponding deoxynucleotide compared to the ability of the corresponding naturally-occurring unmodified Thermus aquaticus DNA polymerase by at least 20-fold.
According to yet another aspect of the present invention, there is provided an E. coli DNA polymerase I
comprising a tyrosine at residue 762, wherein said E. coli DNA polymerase I has an increased ability to incorporate a dideoxynucleotide relative to the corresponding deoxynucleotide compared to the ability of the corresponding naturally-occurring unmodified E. coli DNA polymerase I by at least 20-fold.
According to yet another aspect of the present invention, there is provided a purified DNA polymerase having a tyrosine residue at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site provided that said polymerase is not a T7-type DNA polymerase or a mitochondrial DNA polymerase.
21h According to yet another aspect of the present invention, there is provided a modified DNA polymerase comprising the sequence K Nl N2 N3 N4 N5 N6 Y G, at its dNMP binding site wherein each N is independently any amino acid, and wherein one said N is mutated to have a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site to provide a polymerase which does not discriminate against ddNMP incorporation more than 50 fold relative to the corresponding dNMP incorporation.
According to yet another aspect of the present invention, there is provided nucleic acid encoding any of the DNA polymerases as described above.
According to yet another aspect of the present invention, there is provided a recombinant DNA polymerase which in the presence of magnesium as the only added divalent cation has an average processivity of less than 100 but more than 0 and discriminates less than 100 times but more than 0 time against incorporation of a ddNMP
relative to a corresponding deoxynucleotide, wherein said DNA polymerase is not reverse transcriptase and wherein said DNA polymerase includes a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site.
According to yet another aspect of the present invention, there is provided a recombinant DNA polymerase which in the presence of magnesium as the only divalent cation has an average processivity of less than 50 but more than 0 and discriminates less than 50 times but more than 0 time against incorporation of a ddNMP relative to a 21i corresponding deoxynucleotide and wherein said DNA polymerase includes a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site.
According to yet another aspect of the present invention, there is provided a recombinant DNA polymerase which in the presence of magnesium as the only divalent cation has an average processivity of less than 50 but more than 0 and discriminates less than 5 times but more than 0 time against incorporation of a ddNMP relative to a corresponding deoxynucleotide and wherein said DNA polymerase includes a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site.
According to yet another aspect of the present invention, there is provided a recombinant Thermophilic DNA polymerase that discriminates against a ddNMP relative to a corresponding deoxynucleotide by less than a factor of 100 but more than a factor of 0 and wherein said DNA polymerase includes a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site.
According to yet another aspect of the present invention, there is provided a recombinant Thermophilic DNA polymerase that discriminates in the presence of magnesium as the only divalent cation against a ddNMP relative to a corresponding deoxynucleotide by less than a factor of 100 but more than a factor of 0 and wherein said DNA polymerase includes a tyrosine residue at an amino 21j acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site.
According to yet another aspect of the present invention, there is provided a nucleic acid encoding the polymerase as described above.
According to yet another aspect of the present invention, there is provided a method for cycle sequencing using a DNA polymerase as described above wherein said method comprises mixing said DNA polymerase with a DNA molecule to be sequenced, a primer, dNTPs, and at least one chain terminating agent to form a mixture and cycling temperature of the mixture to allow alternate extension of the primer and denaturation of the primer and DNA molecule.
According to yet another aspect of the present invention, there is provided a purified cellular DNA polymerase having a tyrosine residue in place of the naturally occurring amino acid residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site which causes said polymerase not to discriminate against a ddNMP relative to a corresponding deoxynucleotide by more than 50 fold.
According to yet another aspect of the present invention, there is provided a method for cycle sequencing, comprising the step of providing an excess or equal amount of all four dNTPs than each of the four ddNTPs in a cycle sequencing reaction and performing said cycle sequencing reaction with the modified DNA polymerase as described above.
.. i . . . . . . . . .
21k According to yet another aspect of the present invention, there is provided method for cycle sequencing comprising the step of providing less than 10 times the amount of all four fluorescently labelled dideoxynucleotide than the corresponding deoxynucleotide in a cycle sequencing reaction and performing said cycle sequencing reaction with the modified DNA polymerase as described above.
According to yet another aspect of the present invention, there is provided a modified nucleic acid encoding a modified DNA polymerase wherein said modified nucleic acid is modified to encode a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E.
coli DNA polymerase residue 762 in its dNMP binding site to increase the ability of said modified DNA polymerase to incorporate a dideoxynucleotide relative to the corresponding deoxynucleotide compared to the ability of the corresponding naturally-occurring unmodified DNA polymerase.
According to yet another aspect of the present invention, there is provided a modified DNA polymerase encoded by the modified nucleic acid as described above.
According to yet another aspect of the present invention, there is provided a modified DNA polymerase having a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site, wherein said modified DNA polymerase is modified to increase the ability of said polymerase to incorporate a dideoxynucleotide relative to the corresponding deoxynucleotide compared to the ability of the corresponding naturally-occurring DNA polymerase.
Description of the Preferred Embodiments The drawings will first briefly be described.
Drawings FIG. 1 is a diagrammatic representation of the amino acid sequence of the DNA polymerase encoded by gene 5 of bacteriophage T7 indicating the palm and finger domains, and the location of various dideoxy resistant (DR) mutants, the location of regions labelled A-E, and the location of one site involved in ddNTP discrimination;
FIG. 2 is a three dimensional representation of the structure of DNA polymerase I showing the locations of regions A-E;
FIG. 3 is a diagrammatic representation of a riboselectivity region of pol I-type DNA polymerases with amino acids indicated by the universal single letter code. The initial amino acid number is indicated on the left of the figure and the amount of discrimination against dideoxynucleotides compared to deoxynucleotides is indicated on the right;
FIGS. 4, 5, and 6 are diagrammatic representations showing modifications of the riboselectivity region of E.
coli DNA polymerase I, T7 DNA polymerase, and Taq DNA
polymerase, respectively.
Dideoxy Resistant Mutants The following is a brief discussion of publications of some relevance. None are admitted to be prior art to the pending claims but are provided to aid in understanding the present invention.
Reha-Krantz et al., Mutational Analysis of Bacteriophage T4 DNA Polymerase, from Abstracts for Poster Presentations, presented at a meeting entitled "The Fidelity of DNA Synthesis: Structural and.
Mechanistic Perspectives, Beaufort, North Carolina, September 24-29, 1989 describe C-terminal mutants with increased utilization of ddNTPs. However, Reha-Krantz et al., J. Virology 67, 60-66 (1993), indicate that while the Ki for ddGTP was 50 times lower with the mutant L412M
compared to wild-type T4 DNA polymerase, no difference in efficiency of incorporation of ddGTP was found between the mutant and wild-type T4 DNA polymerase. On page 63, it states that "Despite the sensitivity of the L412M DNA
polymerase to ddGTP, there was no difference found in the incorporation of ddNTPs by wild-type and L412M DNA
polymerases". It also states "it does not appear that any single region is the sole binding site for either PPi SUBSTlTUTE SHEET (AUlE 26f 2201 sg5-or nucleotides". In addition, Reha-Krantz and Nonay, J.
Biol. Chem. 269, 5635-5643 (1994), provide a study of the mutant L412M and other mutant T4 DNA polymerases.
Gibbs et al., Proc. Natl. Acad. Sci. USA 85, 6672-6676 (1988) and Larder et al., the EMBO Journal 6, 169-175 (1987) describe the spectrum of mutations obtained in Herpes DNA polymerase when selected for resistance to a number of nucleotide analogs: pyrophosphate, phosphonoacetic acid, and phosphonoformic acid, acyclovir, vidarabine, ganciclovir and bromovinyldeoxyuridine. It indicates that many of the mutants resistant to one drug are also resistant to other drugs, even when they are analogs to different regions of the substrate.
Derse et al., J. Biol. Chem. 257, 10251-10260 (1982), describe five classes of mutants in Herpes Simplex DNA polymerase isolated by selection for growth in the presence of phosphonoformic acid, a pyrophosphate inhibitor. For mutants in each class, they compare resistance to ddGTP (page 10256, Table III). All increase the Ki for ddGTP by 20 to 100 fold.
Prasad et al., Proc. Natl. Acad. Sci. USA 88, 11363-11367 (1991) use a direct screening strategy and show that a single mutation in HIV reverse transcriptase (a change of Glu 89 to Glycine) renders the polymerase more resistant to ddGTP (requires about 10 times more ddGTP to obtain same extent of inhibition). This mutation confers a broad resistance to a number of analogs, including phosphonoformic acid, a pyrophosphate analog. While the mutant was equally resistant to ddTTP, ddCTP and ddGTP, it was much less resistant to ddATP.
Song et al., J. Virol. 66, 7568-7571 (1992) mutate glu-89 of human immunodeficiency virus type 1 reverse transcriptase to 9 different amino acid residues, and measure the resistance of each mutant enzyme to ddGTP and phosphonoformic acid, a pyrophosphate analog. The mutations fell into two classes; replacement of Glu-89 with alanine, glycine, valine or threonine resulted in enzymes highly resistant to both ddGTP and "SfiME SHEET (RULE 26) phosphonoformic acid compared to the wild-type enzyme, while mutation to serine, glutamine, asparagine, aspartic acid, and lysine resulted in enzymes with only moderate or no resistance to ddGTP. None of the mutants made the enzyme less resistant to ddGTP than the wild-type enzyme (Table I, page 7569). The authors speculate that the 89th and 90th residues of reverse transcriptase form a portion of the dNTP-binding pocket based on their results and the crystal structure of reverse transcriptase.
Papers concerned with the properties of E. coli DNA
polymerase I mutant proteins with mutations in the .vicinity of the mutation that result in ddNTP selectivity including the following:
Carroll et al., Biochemistry 30, 804-813 (1991) study two mutants: Tyr766Ser and Tyr766Phe for misincorporation of normal deoxynucleotides. Polesky et al., J. Biol.Chem. 265, 14579-14591 (1990), characterizes mutations that have two different properties: (1) Tyrosine 766, arginine 841, and asparagine 845; which the authors suggest that these residues contact the incoming dNTP. (2) Glutamine 849, arginine 668, and aspartic acid 882, which the authors suggest are involved in catalysis.
Polesky et al., J. Biol. Chem. 267:8417 (1992), further characterizes mutations in arginine 668, glutamine 849, and aspartic acid 882, and also mutations at aspartic acid 705 and glutamic acid 710. In this study the authors look at the incorporation of alpha-thio-substituted dNTPs, i.e. analogs in the phosphate moiety.
Pandey et al., J. Biol.Chem. 269, 13259-13265 (1994) look at two mutants in E. coli DNA polymerase I that change lysine 758 to alanine and arginine. The authors indicate that Basu et al., Biochemistry 26, 1704-1709 (1987) implicate the same lysine 758 in dNTP binding. This was shown chemically; DNA polymerase I was covalently modified using pyridoxal 5'-phosphate, a nucleotide analog, and lysine 758 was said to be the residue modified.
Beese et al., Biochemistry 321: 14095-14101 (1993) describe the structure of a cocrystal of Klenow fragment SUBSTIiUTE SHEET (RULE 26) of DNA polymerase I complexed with a dNTP or with pyrophosphate. The authors state that the dNTP binds adjacent to helix O. The authors make the following statements: (a) "In the Mg-dCTP complex, cytosine 5 interacts with His 881, while the sugar interacts with Phe 764 [sic., 762] (Figures 3 and5)." (b) "However, we conclude that the position of at least the dNMP moiety of dNTP in the binary complex is not likely to be the same as in its catalytically relevant complex with primer-10 template DNA." (c) "Since the entire binding site for the base of dNTP is formed by its Watson-Crick hydrogen bonding to the template strand and its stacking on the 3' base of the primer strand, it is not unlikely that the binding site for the base in the binary complex is 15 completely adventitious, consistent with our observation that it can bind in several locations dependent on conditions." (d) "The binding site for dNTP observed in crystals of the binary complex is the same as observed in solution studies. However, extrapolating from this 20 binary complex to a model for the complex with dNTP in the presence of primer and template DNA requires considerable caution. We presume that the sugar and base moiety of the dNTP require primer-template DNA to bind in correct conformation."
In other aspects, the invention features: (a) a method for in vitro mutagenesis of a cloned DNA fragment by providing the cloned fragment and a DNA polymerase described above, contacting the cloned fragment with the polymerase under conditions for synthesizing a DNA strand from the fragment. The conditions cause formation of the DNA strand by incorporation of a plurality of individual contiguous bases able to base-pair with the fragment and incorporation of a nucleotide base unable to base pair with the fragment; (b) a method for in vitro mutagenesis of a template DNA fragment by providing a primer and template, the primer having contiguous bases able to base-pair with contiguous bases of the template, except at least one base which is unable to base-pair with the template. The method involves extending the primer with a DNA polymerase as described above; (c) a method for producing blunt-ended double-stranded DNA from a linear DNA molecule having a 5' end having a single-stranded region. The 3' end of the molecule is double-stranded and has no 3' protruding termini. The method includes incubating the DNA molecule with a DNA polymerase as described above which acts on the single-stranded region to produce a blunt-ended double-stranded DNA molecule;
(d) a method for labeling the 3' end of a DNA fragment by incubating the DNA fragment with a DNA polymerase as described above, and a labelled deoxynucleotide species under conditions in which the 3' end of the DNA fragment is extended by the polymerase and thereby labelled by addition of the labelled deoxynucleotide to the DNA
M8Sf1111Tf WEET (Rttt.E 28f 2)1385 WO 96/12042 PCT/US95/129*
fragment; (e) a method of amplification of a DNA sequence by annealing a first and second primer to opposite strands of a double-stranded DNA sequence and incubating the annealed mixture with a DNA polymerase as described 5 above. The first and second primers anneal to opposite strands of the DNA sequence with their 3' ends directed towards each other after annealing, and with the DNA
sequence to be amplified located between the two annealed primers.
10 In yet other aspects, the invention features specific DNA polymerases, such asThermus aquaticus DNA
polymerase having a tyrosine at residue 667, E. co1.i DNA
polymerase I having a tyrosine at residue 762, and any Pol I type DNA polymerases having a tyrosine residue at 15 the analogous location to E. coli DNA polymerase residue 762, e.g., at the N4 position of the amino acid sequence K N,, Na N3 N4 NS N6 N, Y G, wherein each N is independently any amino acid. Furthermore, this invention features specific polymerases of the DNA polymerase alpha family 20 having the sequence K Nl N2 N3 N4 N5 N6 Y G/Q, wherein each N is independently any amino acid, and where one of the residues N1 to N, has been mutated to produce a polymerase that has reduced discrimination against ddNTPs (preferably reduced by at least 20-fold compared to the non-mutated sequence) . The invention also features nucleic acid encoding any of these DNA polymerases.
In related aspects, the invention features DNA
polymerases except reverse transcriptase which in the presence of magnesium as the only added divalent cation have an average processivity of less than 100 and discriminate less than 100 times against incorporation of a ddNMP compared to a dNMP, or which in the presence of magnesium as the only added divalent cation have an average processivity of less than 50 and discriminate less than 50 or 5 times against incorporation of a ddNMP
compared to a dNMP. Those in the art will recognize that processivity can be measured by any standard procedure that will indicate that the average processivity of T7 DNA polymerase is at least 500, that of Klenow fragment S'USSTPTUTE SHEET (ftU.E 26) is about 4 - 40, and for reverse transcriptase it is about 150 - 200. Such measurements can be performed as described by Tabor et a1., J. B_o1. Chem. 262: 16212, 1987. The average processivity of Taq DNA polymerase under t'nese conditions 1s eypE:cted to be less than 100.
In particularly preferrec3 aspects, the invention features thermophilic DNA polymerases that discriminate, e.g., in the presence of macTnesium, against a ddNMP
compared to a dNMP by less than a factor of 100, and which preferably have an average processivity less than 100, and cycle from one primer-template to another more than once per one or even ten seconds. Such cycling can be measured by standard procedures.
The invention also features a method for cycle sequencing using a DNA polymerase as described above, and also features cellular (as opposed to viral or mitochondrial) DNA polymerases:having a tyrosine in place of the naturally occurring amino acid at a location which causes the polymerase not to discriminate against a ddNMP
compared to a dNMP by more than 50 fold.
In other aspects, substitution of the amino acid at the noted sites will result in alteration of other properties of the corresponding natural polymerase. ?n addition, polymerases of this invention may be combined with other polymerases in the methods described herein to take advantage of the supes-ior properties of each polymerase in the mixture.
Other features and advantages of the invention will be apparent from the following description of the prefer.y_ec6 embodiments thereof, and from the claims.
21a According to one aspect of the present invention, there is provided modified nucleic acid encoding a modified DNA polymerase wherein said modified nucleic acid is modified to encode a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA
polymerase residue 762 in its dNMP binding site to increase the ability of said modified DNA polymerase to incorporate a dideoxynucleotide relative to the corresponding deoxynucleotide compared to the ability of the corresponding naturally-occurring unmodified DNA polymerase by at least 20-fold.
According to another aspect of the present invention, there is provided a modified DNA polymerase encoded by the modified nucleic acid as described above.
According to still another aspect of the present invention, there is provided a method for production of a modified DNA polymerase having an increased ability to incorporate a dideoxynucleotide relative to a corresponding deoxynucleotide compared to ability of a corresponding naturally-occurring unmodified DNA polymerase comprising steps of: providing a nucleic acid molecule encoding a DNA polymerase and mutagenizing said nucleic acid molecule to incorporate one or more base changes in nucleotide base sequence at a region that encodes its dNMP binding site to encode a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 at an amino acid position corresponding to E. coli DNA polymerase residue 762 in the dNMP binding site to alter ability of said polymerase encoded by said nucleic acid to incorporate a dideoxynucleotide by at least 20-fold.
21b According to yet another aspect of the present invention, there is provided a method for determining a nucleotide base sequence of a DNA inolecule comprising the steps of: incubating a DNA molecule annealed with a primer molecule able to hybridize to said DNA molecule in a vessel containing at least one deoxynucleoside triphosphate, a DNA polymerase modified by having a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site compared to a naturally-occurring unmodified DNA polymerase to have at least a 20-fold increased ability to incorporate a dideoxynucleotide relative to a corresponding deoxynucleotide compared to said naturally-occurring unmodified polymerase, said polymerase having sufficient DNA polymerase activity and less than 1000 units exonuclease activity per mg DNA polymerases, and at least one DNA synthesis terminating agent which terminates DNA synthesis at a specific nucleotide base, in an incubating reaction; and separating the DNA products of the incubating reaction according to size whereby at least a part of the nucleotide base sequence of said DNA molecule can be determined.
According to a further aspect of the present invention, there is provided a kit for DNA sequencing comprising a modified DNA polymerase modified to include a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site of the DNA polymerase to increase the ability of said DNA polymerase to i_ncorporate a dideoxynucleotide compared to ability of a corresponding naturally-occurring unmodified DNA polymerase by at least 21c 20-fold relative to a correspondincl deoxynucleotide; and a reagent necessary for said sequencing selected from the group consisting of dITP, a chain terminating agent, deaza-GTP and a manganese-containing compound.
According to yet a further aspect of the present invention, there is provided a method for sequencing a strand of DNA comprising the steps of: providing said strand hybridized with a primer able to hybridize to said strand, to form an hybridized mixture, incubating said hybridized mixture with one or more deoxyriboriucleoside triphosphates, a modified DNA polymerase modified by having a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site to increase ability of said polymerase to incorporate a dideoxynucleotide at least 20-fold relative to a corresponding deoxynucleotide compared to ability of a corresponding naturally-occurring unmodified DNA polymerase, and a first chain terminating agent, wherein said DNA polymerase causes said primer to be elongated to form a first series of first DNA products differing in the length of the elongated primer, each said first DNA product having a said chain terminating agent at its elongated end, the number of molecules of each said first DNA products being approximately the same for substantially all DNA products differing in length by no more thari 20 bases, and providing a second chain terminating agent iri said hybridized mixture at a concentration different from said first chain terminating agent, wherein said DNA polymerase causes production of a second series of second DNA products differing in the length of the elorlgated primer, each said second DNA product having said second chain terminating agent at its elongated end, the nurnber of molecules of each 21d said second DNA products being approximately the same for substantially all second DNA products differing in length from each other by from 1 to 20 bases, and being distinctly different from the number of molecules of all said first DNA products having a length differ_ing by no more than 20 bases from that of said second DNA products.
According to still a further aspect of the present invention, there is provided a method for sequencing a nucleic acid comprising: combining an oligonucleotide primer, a nucleic acid to be sequenced, between one and four deoxyribonucleoside triphosphates, a modified DNA polymerase modified by having a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresporlding to E. coli DNA polymerase residue 762 in its dNMP binding site to increase ability of said polymerase to incorporate a dideoxynucleotide at least 20-fold relative to a corresponding deoxynucleotide compared to ability of a corresponding naturally-occurring unmodified DNA polymerase, and at least two chain terminating agents in different amounts, under conditions favoring extension of said oligonucleotide primer to form nucleic acid fragments complementary to the nucleic acid to be sequenced;
separating the nucleic acid fragmerits by size; and determining nucleic acid sequence wherein said agents are differentiated from each other by _Lntensity of a label in the primer extension products.
According to another aspect of the present invention, there is provided a method for in vitro mutagenesis of a cloned DNA fragment comprising providing said cloned fragment and a modified DNA polymerase modified by having a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an 21e amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site to increase ability of said polymerase to incorporate a d__deoxynucleotide at least 20-fold relative to a correspondinq deoxynucleotide compared to ability of a corresponding naturally-occurring unmodified DNA polymerase, contacting said cloned fragment with said polymerase under conditions for syrlthesizing a DNA strand from said fragment, wherein said conditions cause formation of said DNA strand by incorporation of individual contiguous bases, which are different or the same able to base-pair with said fragment and incorporation of a nucleotide base unable to base pair with said fragraent.
According to yet another aspect of the present invention, there is provided a method for in vitro mutagenesis of a template DNA fragraent comprising providing a primer and said template, said pr_imer having contiguous bases able to base-pair with contiguous bases of said template except at least one base which is unable to base-pair with said template, and extending said primer with a modified DNA polymerase modified by having a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymE:rase residue 762 in its dNMP binding site to increase abil__ty of said polymerase at least 20-fold relative to a corresponding deoxynucleotide to incorporate a dideoxynucleotide compared to ability of a corresponding naturally-occurring unmodified DNA polymerase.
According to yet another aspect of the present invention, there is provided a method for producing blunt-ended double-stranded DNA from a linear DNA molecule having a 5' end comprising a single-stranded region, wherein 3' end of said molecule is double stranded and has no 3' protruding termini, comprising incubating said DNA molecule with a 21f modified DNA polymerase modified by having a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site to increase ability of said polymerase at least 20-fold to incorporate a dideoxynucleotide relative to a corresponding deoxynucleotide cornpared to ability of a corresponding naturally-occurring unmodified DNA polymerase, wherein said polymerase acts on said single-stranded region to produce a blunt-ended double stranded DNA molecule.
According to yet another aspect of the present invention, there is provided a method for labeling a 3' end of a DNA fragment comprising incubating said DNA fragment with a modified DNA polymerase modified by having a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site to increase abil_Lty of said polymerase at least 20-fold to incorporate a dideoxynucleotide relative to a corresponding deoxynucleotide cornpared to ability of a corresponding naturally-occurring unmodified DNA polymerase, and a labelled deoxynucleotide species under conditions in which said 3' end of said DNA fragrnent is extended by said polymerase and thereby labelled by addition of said labelled deoxynucleotide to said DNA fragmerlt.
According to yet another aspect of the present invention, there is provided a method of amplification of a DNA sequence comprising annealing a first and second primer to opposite strands of a double-str_anded DNA sequence to form an annealed mixture and incubating the annealed mixture with a modified DNA polymerase modified by having a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position 21g corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site to increase ability at least 20-fold of said polymerase to incorporate a dideoxynucleotide relative to a corresponding deoxynucleotide compared to ability of a corresponding naturally-occurring unmodified DNA polymerase, wherein said first and second primers anneal to opposite strands of said DNA sequence with their 3' ends directed towards each other after annealing, and with the DNA sequence to be amplified located between the first and second annealed primers.
According to yet another aspect of the present invention, there is provided a Thermus aquaticus DNA
polymerase comprising a tyrosine at residue 667, wherein said Thermus aquaticus DNA polymerase has an increased ability to incorporate a dideoxynucleotide relative to the corresponding deoxynucleotide compared to the ability of the corresponding naturally-occurring unmodified Thermus aquaticus DNA polymerase by at least 20-fold.
According to yet another aspect of the present invention, there is provided an E. coli DNA polymerase I
comprising a tyrosine at residue 762, wherein said E. coli DNA polymerase I has an increased ability to incorporate a dideoxynucleotide relative to the corresponding deoxynucleotide compared to the ability of the corresponding naturally-occurring unmodified E. coli DNA polymerase I by at least 20-fold.
According to yet another aspect of the present invention, there is provided a purified DNA polymerase having a tyrosine residue at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site provided that said polymerase is not a T7-type DNA polymerase or a mitochondrial DNA polymerase.
21h According to yet another aspect of the present invention, there is provided a modified DNA polymerase comprising the sequence K Nl N2 N3 N4 N5 N6 Y G, at its dNMP binding site wherein each N is independently any amino acid, and wherein one said N is mutated to have a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site to provide a polymerase which does not discriminate against ddNMP incorporation more than 50 fold relative to the corresponding dNMP incorporation.
According to yet another aspect of the present invention, there is provided nucleic acid encoding any of the DNA polymerases as described above.
According to yet another aspect of the present invention, there is provided a recombinant DNA polymerase which in the presence of magnesium as the only added divalent cation has an average processivity of less than 100 but more than 0 and discriminates less than 100 times but more than 0 time against incorporation of a ddNMP
relative to a corresponding deoxynucleotide, wherein said DNA polymerase is not reverse transcriptase and wherein said DNA polymerase includes a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site.
According to yet another aspect of the present invention, there is provided a recombinant DNA polymerase which in the presence of magnesium as the only divalent cation has an average processivity of less than 50 but more than 0 and discriminates less than 50 times but more than 0 time against incorporation of a ddNMP relative to a 21i corresponding deoxynucleotide and wherein said DNA polymerase includes a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site.
According to yet another aspect of the present invention, there is provided a recombinant DNA polymerase which in the presence of magnesium as the only divalent cation has an average processivity of less than 50 but more than 0 and discriminates less than 5 times but more than 0 time against incorporation of a ddNMP relative to a corresponding deoxynucleotide and wherein said DNA polymerase includes a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site.
According to yet another aspect of the present invention, there is provided a recombinant Thermophilic DNA polymerase that discriminates against a ddNMP relative to a corresponding deoxynucleotide by less than a factor of 100 but more than a factor of 0 and wherein said DNA polymerase includes a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site.
According to yet another aspect of the present invention, there is provided a recombinant Thermophilic DNA polymerase that discriminates in the presence of magnesium as the only divalent cation against a ddNMP relative to a corresponding deoxynucleotide by less than a factor of 100 but more than a factor of 0 and wherein said DNA polymerase includes a tyrosine residue at an amino 21j acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site.
According to yet another aspect of the present invention, there is provided a nucleic acid encoding the polymerase as described above.
According to yet another aspect of the present invention, there is provided a method for cycle sequencing using a DNA polymerase as described above wherein said method comprises mixing said DNA polymerase with a DNA molecule to be sequenced, a primer, dNTPs, and at least one chain terminating agent to form a mixture and cycling temperature of the mixture to allow alternate extension of the primer and denaturation of the primer and DNA molecule.
According to yet another aspect of the present invention, there is provided a purified cellular DNA polymerase having a tyrosine residue in place of the naturally occurring amino acid residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site which causes said polymerase not to discriminate against a ddNMP relative to a corresponding deoxynucleotide by more than 50 fold.
According to yet another aspect of the present invention, there is provided a method for cycle sequencing, comprising the step of providing an excess or equal amount of all four dNTPs than each of the four ddNTPs in a cycle sequencing reaction and performing said cycle sequencing reaction with the modified DNA polymerase as described above.
.. i . . . . . . . . .
21k According to yet another aspect of the present invention, there is provided method for cycle sequencing comprising the step of providing less than 10 times the amount of all four fluorescently labelled dideoxynucleotide than the corresponding deoxynucleotide in a cycle sequencing reaction and performing said cycle sequencing reaction with the modified DNA polymerase as described above.
According to yet another aspect of the present invention, there is provided a modified nucleic acid encoding a modified DNA polymerase wherein said modified nucleic acid is modified to encode a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E.
coli DNA polymerase residue 762 in its dNMP binding site to increase the ability of said modified DNA polymerase to incorporate a dideoxynucleotide relative to the corresponding deoxynucleotide compared to the ability of the corresponding naturally-occurring unmodified DNA polymerase.
According to yet another aspect of the present invention, there is provided a modified DNA polymerase encoded by the modified nucleic acid as described above.
According to yet another aspect of the present invention, there is provided a modified DNA polymerase having a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site, wherein said modified DNA polymerase is modified to increase the ability of said polymerase to incorporate a dideoxynucleotide relative to the corresponding deoxynucleotide compared to the ability of the corresponding naturally-occurring DNA polymerase.
Description of the Preferred Embodiments The drawings will first briefly be described.
Drawings FIG. 1 is a diagrammatic representation of the amino acid sequence of the DNA polymerase encoded by gene 5 of bacteriophage T7 indicating the palm and finger domains, and the location of various dideoxy resistant (DR) mutants, the location of regions labelled A-E, and the location of one site involved in ddNTP discrimination;
FIG. 2 is a three dimensional representation of the structure of DNA polymerase I showing the locations of regions A-E;
FIG. 3 is a diagrammatic representation of a riboselectivity region of pol I-type DNA polymerases with amino acids indicated by the universal single letter code. The initial amino acid number is indicated on the left of the figure and the amount of discrimination against dideoxynucleotides compared to deoxynucleotides is indicated on the right;
FIGS. 4, 5, and 6 are diagrammatic representations showing modifications of the riboselectivity region of E.
coli DNA polymerase I, T7 DNA polymerase, and Taq DNA
polymerase, respectively.
Dideoxy Resistant Mutants The following is a brief discussion of publications of some relevance. None are admitted to be prior art to the pending claims but are provided to aid in understanding the present invention.
Reha-Krantz et al., Mutational Analysis of Bacteriophage T4 DNA Polymerase, from Abstracts for Poster Presentations, presented at a meeting entitled "The Fidelity of DNA Synthesis: Structural and.
Mechanistic Perspectives, Beaufort, North Carolina, September 24-29, 1989 describe C-terminal mutants with increased utilization of ddNTPs. However, Reha-Krantz et al., J. Virology 67, 60-66 (1993), indicate that while the Ki for ddGTP was 50 times lower with the mutant L412M
compared to wild-type T4 DNA polymerase, no difference in efficiency of incorporation of ddGTP was found between the mutant and wild-type T4 DNA polymerase. On page 63, it states that "Despite the sensitivity of the L412M DNA
polymerase to ddGTP, there was no difference found in the incorporation of ddNTPs by wild-type and L412M DNA
polymerases". It also states "it does not appear that any single region is the sole binding site for either PPi SUBSTlTUTE SHEET (AUlE 26f 2201 sg5-or nucleotides". In addition, Reha-Krantz and Nonay, J.
Biol. Chem. 269, 5635-5643 (1994), provide a study of the mutant L412M and other mutant T4 DNA polymerases.
Gibbs et al., Proc. Natl. Acad. Sci. USA 85, 6672-6676 (1988) and Larder et al., the EMBO Journal 6, 169-175 (1987) describe the spectrum of mutations obtained in Herpes DNA polymerase when selected for resistance to a number of nucleotide analogs: pyrophosphate, phosphonoacetic acid, and phosphonoformic acid, acyclovir, vidarabine, ganciclovir and bromovinyldeoxyuridine. It indicates that many of the mutants resistant to one drug are also resistant to other drugs, even when they are analogs to different regions of the substrate.
Derse et al., J. Biol. Chem. 257, 10251-10260 (1982), describe five classes of mutants in Herpes Simplex DNA polymerase isolated by selection for growth in the presence of phosphonoformic acid, a pyrophosphate inhibitor. For mutants in each class, they compare resistance to ddGTP (page 10256, Table III). All increase the Ki for ddGTP by 20 to 100 fold.
Prasad et al., Proc. Natl. Acad. Sci. USA 88, 11363-11367 (1991) use a direct screening strategy and show that a single mutation in HIV reverse transcriptase (a change of Glu 89 to Glycine) renders the polymerase more resistant to ddGTP (requires about 10 times more ddGTP to obtain same extent of inhibition). This mutation confers a broad resistance to a number of analogs, including phosphonoformic acid, a pyrophosphate analog. While the mutant was equally resistant to ddTTP, ddCTP and ddGTP, it was much less resistant to ddATP.
Song et al., J. Virol. 66, 7568-7571 (1992) mutate glu-89 of human immunodeficiency virus type 1 reverse transcriptase to 9 different amino acid residues, and measure the resistance of each mutant enzyme to ddGTP and phosphonoformic acid, a pyrophosphate analog. The mutations fell into two classes; replacement of Glu-89 with alanine, glycine, valine or threonine resulted in enzymes highly resistant to both ddGTP and "SfiME SHEET (RULE 26) phosphonoformic acid compared to the wild-type enzyme, while mutation to serine, glutamine, asparagine, aspartic acid, and lysine resulted in enzymes with only moderate or no resistance to ddGTP. None of the mutants made the enzyme less resistant to ddGTP than the wild-type enzyme (Table I, page 7569). The authors speculate that the 89th and 90th residues of reverse transcriptase form a portion of the dNTP-binding pocket based on their results and the crystal structure of reverse transcriptase.
Papers concerned with the properties of E. coli DNA
polymerase I mutant proteins with mutations in the .vicinity of the mutation that result in ddNTP selectivity including the following:
Carroll et al., Biochemistry 30, 804-813 (1991) study two mutants: Tyr766Ser and Tyr766Phe for misincorporation of normal deoxynucleotides. Polesky et al., J. Biol.Chem. 265, 14579-14591 (1990), characterizes mutations that have two different properties: (1) Tyrosine 766, arginine 841, and asparagine 845; which the authors suggest that these residues contact the incoming dNTP. (2) Glutamine 849, arginine 668, and aspartic acid 882, which the authors suggest are involved in catalysis.
Polesky et al., J. Biol. Chem. 267:8417 (1992), further characterizes mutations in arginine 668, glutamine 849, and aspartic acid 882, and also mutations at aspartic acid 705 and glutamic acid 710. In this study the authors look at the incorporation of alpha-thio-substituted dNTPs, i.e. analogs in the phosphate moiety.
Pandey et al., J. Biol.Chem. 269, 13259-13265 (1994) look at two mutants in E. coli DNA polymerase I that change lysine 758 to alanine and arginine. The authors indicate that Basu et al., Biochemistry 26, 1704-1709 (1987) implicate the same lysine 758 in dNTP binding. This was shown chemically; DNA polymerase I was covalently modified using pyridoxal 5'-phosphate, a nucleotide analog, and lysine 758 was said to be the residue modified.
Beese et al., Biochemistry 321: 14095-14101 (1993) describe the structure of a cocrystal of Klenow fragment SUBSTIiUTE SHEET (RULE 26) of DNA polymerase I complexed with a dNTP or with pyrophosphate. The authors state that the dNTP binds adjacent to helix O. The authors make the following statements: (a) "In the Mg-dCTP complex, cytosine 5 interacts with His 881, while the sugar interacts with Phe 764 [sic., 762] (Figures 3 and5)." (b) "However, we conclude that the position of at least the dNMP moiety of dNTP in the binary complex is not likely to be the same as in its catalytically relevant complex with primer-10 template DNA." (c) "Since the entire binding site for the base of dNTP is formed by its Watson-Crick hydrogen bonding to the template strand and its stacking on the 3' base of the primer strand, it is not unlikely that the binding site for the base in the binary complex is 15 completely adventitious, consistent with our observation that it can bind in several locations dependent on conditions." (d) "The binding site for dNTP observed in crystals of the binary complex is the same as observed in solution studies. However, extrapolating from this 20 binary complex to a model for the complex with dNTP in the presence of primer and template DNA requires considerable caution. We presume that the sugar and base moiety of the dNTP require primer-template DNA to bind in correct conformation."
25 Joyce and Steitz, 63 Ann. Rev. Bioc. 777, 1994 (not admitted to be prior art to the present invention).
discuss the relationship of various DNA and RNA
polymerases. It indicates three functions for the "palm"
(rather than the "finger") subdomain of DNA polymerase I
- namely, the catalytic center, the binding site for the 3' terminus of the primer, and the dNTP binding site. In HIV-1 reverse transcriptase it indicates that mutations that influence binding of DNA polymerase inhibitors are around residues 67 - 70. It also states that "[a]lthough no useful conclusions could be drawn from the positions of the nucleotide base of sugar, it is possible that the crystalline binary complex may be informative in identifying contacts between Klenow fragment and the dNTP
phosphate groups." In the preceding paragraph, it states SjBSTITUTE SHEET (RULE 261 22,018,8~
discuss the relationship of various DNA and RNA
polymerases. It indicates three functions for the "palm"
(rather than the "finger") subdomain of DNA polymerase I
- namely, the catalytic center, the binding site for the 3' terminus of the primer, and the dNTP binding site. In HIV-1 reverse transcriptase it indicates that mutations that influence binding of DNA polymerase inhibitors are around residues 67 - 70. It also states that "[a]lthough no useful conclusions could be drawn from the positions of the nucleotide base of sugar, it is possible that the crystalline binary complex may be informative in identifying contacts between Klenow fragment and the dNTP
phosphate groups." In the preceding paragraph, it states SjBSTITUTE SHEET (RULE 261 22,018,8~
"although a polymerase-dNTP binary complex can be formed, such a complex is not catalytically competent." It further indicates that data "would place the deoxyribose close to Phe 762" and that "Mutation of Tyr 766 [in Klenow fragment helix O], which is located in the fingers domain in the vicinity of the model-built template strand, affects the discrimination between deoxy and dideoxy nucleotide substrates ..." However, it also states "In Klenow fragment, mutations that have been found to affect the binding of dNTP in the ternary complex (as reflected in Km(&;TP)) are located on one side .of the polymerase cleft within or close to the fingers subdomain. Positions identified thus far encompass the N terminus of helix Q (Arg 841 and Asn 845), the exposed face of helix 0 (Tyr 766, Phe 762, and Arg 754), and neighboring residues closer to the catalytic center (Asp 705 and Glu 710) . . . An advantage of the kinetic approach is that it probes the ternary complex; however, as discussed above, it is impossible, in the absence of other structural evidence, to distinguish direct effects from those mediated via template interactions. Moreover, the side chains listed above encompass an area much larger than the dNTP molecule and therefore cannot all be in direct contact with it. Since the region of Klenow fragment implicated by these studies is thought to make extensive contact with the template strand, a reasonable interpretation is that a subset of the residues mentioned above are in direct contact with the dNTP, while the remainder bind the template DNA."
Pelletier et al., 264 Science 1891, and Sawaya et al., 264 Science 1930, 1994 (not admitted to be prior art to the pending claims) in contrast, indicate that residues 271 - 274 in helices M-N of Polf3 (which are analogous to helices J-K of Klenow) "perform a common function, nucleotide discrimination."
Sousa et al. 364 Nature 593, 1993 (not admitted to be prior art to the pending claims) describe the three dimensional structure of T7 RNA polymerase and its homology to E. coli DNA polymerase I. They state that SUBSTITUTE SHEET (FtIILE Z%
Pelletier et al., 264 Science 1891, and Sawaya et al., 264 Science 1930, 1994 (not admitted to be prior art to the pending claims) in contrast, indicate that residues 271 - 274 in helices M-N of Polf3 (which are analogous to helices J-K of Klenow) "perform a common function, nucleotide discrimination."
Sousa et al. 364 Nature 593, 1993 (not admitted to be prior art to the pending claims) describe the three dimensional structure of T7 RNA polymerase and its homology to E. coli DNA polymerase I. They state that SUBSTITUTE SHEET (FtIILE Z%
their observations suggest the "C-terminal elements of KF
[Klenow fragment] (b-strand 14 [residues 916 to 928] and the C terminal) contact the deoxyribose moiety of the dNTP during polymerization to discriminate between rNTP
and dNTP substrates."
Dideoxynucleosides, such as dideoxythymidine, are potent inhibitors of T7 phage growth. Experiments indicate that the inhibition of DNA synthesis is a consequence of the incorporation of the dideoxynucleotide into T7 DNA. Dideoxynucleosides are not inhibitory to uninfected E. coli. We do not know the explanation for the lack of inhibition of E. coli DNA synthesis but it could be explained by cellular uptake, a high level of discrimination against their incorporation by E. coli DNA
polymerase III, inefficient phosphorylation to a triphosphate, or efficient removal. In any case we find that T7 mutant phage arise that can yield normal plaques on agar plates containing dideoxynucleosides with a frequency of approximately 10-3. The location of many of these mutations are shown in Fig. 1. They reside within the gene 5 protein. The mutant gene 5 proteins are more discriminatory (by a few fold) against ddNTPs than the native gene 5 protein. Some members of this class of mutants may delineate the region of the polymerase that is important in recognition of the ribose moiety of the dNTP.
It is important to note that the mutations obtained by this selection using dideoxynucleosides are based on an alteration of the region of the gene 5 protein that recognizes the ribose moiety. However, it is possible that such mutations will also have dramatic effects on other nucleotide analogs. In addition, it is possible to use the same procedure to select for other T7 mutants that discriminate strongly against other nucleotide analogs on the basis of the growth of phage in the presence of the other analogs.
Referring to Table I, various DR mutants are indicated with the amino acid substitution noted in the table. The amino acid substitution is further SUBSTITUTE SHEET (RULE 26) characterized on the right hand side of the table. The location of these mutants is shown in FIG. 1 throughout the palm and finger regions of T7 DNA polymerase. The thumb region is a flexible region that interacts with minor groove of product duplex DNA; the palm region is the catalytic center, the binding site for the 3' end of the primer and contributes to dNTP binding; the finger regions interact with ss template close to the site of synthesis and contribute to dNTP binding. These mutants are highly scattered throughout the polymerase and all have a relatively minor effect on incorporation of dideoxynucleotides, decreasing the ability to incorporate a dideoxynucleotide by only 5-20 fold. In addition, some of these mutants are located in regions of comparative non-homology to DNA polymerase I. Thus, they do not provide an indication of the location of sites in other Pol I enzymes involved in ddNTP discrimination.
Table 1 Summary of T7 DNA Polymerase Dideoxy-Resistant Mutants Mutanr, No. isolates Modification DR1 1 Ala 425 Thr Hydrophobic Polar DR2 1 Phe 434 Ser Hydrophobic Polar Gly 442 Ser Hydrophobic Polar DR3 1 Val 443 Ile Hydrophobic Hydrophobic DR4 2 Arg 444 His Strong basic Weak basic DRS 1 Arg 444 Cys Strong basic Neutral, polar DR6 8 Ser 477 Phe Polar Hydrophobic DR7 4 Asp 504 Asn Basic Neutral DR8 2 Ala 513 Thr Hydrophobic Polar DR9 2 Thr 517 Ile Polar Hydrophobic DR10 1 Ala 532 Ser Hydrophobic Polar DR11 1 Arg 566 Cys Strong basic Neutral, polar DR12 1 Ala 619 Thr Hydrophobic Polar DR13 3 Ala 700 Thr Hydrophobic Polar SUBSTITUTE SHEET (RULE 26) e2o Dumarv 7 Hydrophobic Polar 3 Strong basic Neutral, polar or weak basic 2 Polar Hydrophobic 1 Hydrophobic Hydrophobic In vitro MLtagenesis In vitro mutagenesis of the cloned gene 5 of T7 was used to construct gene 5 proteins in which different regions of E. coli DNA polymerase I were substituted for the analogous or homologous regions in T7 gene 5 protein.
As discussed, we were particularly interested in determining the ability of these enzymes to incorporate nucleotide analogs and the extent to which they discriminate against these analogs. Referring to FIG. 2, the regions within which hybrids between T7 DNA
polymerase and E. coli DNA polymerase I were made are shown as marked as regions A-E.
Referring to FIG. 3, Applicant has determined that region C provides a riboselectivity region having a significantly greater effect than other regions in the polymerase. Some of these other regions are specifically indicated in FIG. 3.
Referring to FIGS. 4-6 (and Table 2) , it was determined that substitution of amino acids in this region allowed conversion of the riboselectivity of a polymerase from a E. coli DNA pol I-type to a T7 DNA
polymerase type and vice versa. Thus, by targeted mutagenesis of this region of polI-type polymerases the riboselectivity of a polymerase can be significantly altered. The level of effect is at least 50-100 fold, and generally more than 500-fold.
DNA Polymerases DNA polymerases useful in this invention include those belonging to the class of homologous polymerases Stt85TiTUTE SHEET (RULE 2M
termed "Pol I-type DNA polymerases" including T7-type DNA
polymerases, the large fragment of E. coli DNA polymerase I, and Taq polymerase.
DNA polymerase useful in this invention include those belonging to a class of homologous polymerases including T7 type DNA polymerases (such as T7, T3, 01, 011, H W31, gh-1, Y, A1122, or SP6). By homologous polymerases is meant an enzyme such as those described by Delarue et al., Protein Engineering 3, 461-467 (1990), that presents an alignment of the Pol I family of DNA
polymerases. It also presents the alignment of Table 2.
Effect of domain exchanges between E. coli DNA polymerase I, T7 DNA polymerase, and Taq DNA polymerase within helix 0 on discrimination against ddNTPs. The sequence of the three polymerases are shown at the top, with the number of the first residue indicated. Below the consensus sequence for these three polymerases the mutants characterized in T7 DNA polymerase (T7), E. coli DNA
polymerase I (Pol) and Taq DNA polymerase (Taq) are shown, with the mutated residues underlined. Each mutant was tested for its relative rate of incorporation of ddNMP to dNMP by SDS activity gel analysis as described in Example 2 and indicated on the right. Mutants T7 C-T8, Pol I C-K6, and Taq C-Q5 were purified along with the wild-type proteins for further analysis.
ddNTP
Enzyme Sequence Discrimination Pol I 754 R R S A K A I N F G L I Y G High $U6$fINTE SHEET (RULE 26) 220)88~) Taq 658 R R A A K T I N F G V L Y G High T7 517 R D N A K T F I Y G F L Y G Low Consensus R A K G Y G
T7 WT R D N A K T F I Y G F L Y G Low T7 C-T2 R$_E A K A I N F G,L.__Z Y G High T7 C-T3 R$_$ A K T F I Y G F L Y G Low T7 C-T4 R D N A K A I N F G F L Y G High T7 C-T5 R D N A K I Y G F L Y G Low T7 C-T6 R D N A K T F N F G F L Y G High T7 C-T7 R D N A K T FN Y G F L Y G Low T7 C-T8 R D N A K T F IF G F L Y G High Pol I WT R R S A K A I N F G L I Y G High Pol I C-Kl R D A K T F I Y GF _L Y G Low Pol I C-K2 R R S A K T F I Y G L I Y G Low Pol I C-K3 R R S A K I N F G L I Y G High Pol I C-K4 R R S A K A I G L I Y G Low Pol I C-K5 R R S A K A I,l. F G L I Y G High Pol I C-K6 R R S A K A I N y G L I Y G Low Taq WT R R A A K T I N F G V L Y G High Taq C-Q1 R A K T I N F G V L Y G High Taq C-Q2 R R A A K T F I Y G L Y G Low Taq C-Q3 R R A A K T I 1_X G V L Y G Low Taq C-Q4 R R A A K T RRAAKTIIFGVLYG F G V L Y G High Tqa C-Q5 R R A A K T I N Y G V L Y G Low Specificity residue conserved sequence motifs from six families of polymerases: DNA polymerases from Pol I, Pol alpha, and Pol beta families, DNA-dependent RNA polymerases, reverse SUBSTlTlJTE StEET (F1U1.E 26) transcriptases, and RNA-dependent RNA polymerases; their results suggest a few residues are conserved between all polymerases. According to their alignment (Figure 3, page 463), the selectivity residue identified herein (phenylalanine 762 in E. coli DNA polymerase I) is in "Motif B". In addition to the pol I family of DNA
polymerases, Motif B is found in the pol alpha family of DNA polymerases and the T7 family of DNA-dependent RNA
polymerases; thus this alignment strongly suggests the residues one should mutate in these other families of polymerases, that include T4 DNA polymerase, Herpes DNA
polymerase, 029 DNA polymerase, Vent DNA polymerase and Pfu DNA polymerase.
In addition, Joyce, Current Opinion in Structural Biology 1, 123-129 (1991) compares the DNA sequences from many polymerases, and suggests that a small number of important active-site residues are conserved. In particular, there is a discussion of the similarities between polymerases of the pol I family (T7, pol I, Taq) and of the pol alpha family (T4, 029, Herpes). In Figure 1 (page 124) Joyce indicates the positions of 5 invariant residues found in these two families; they include lysine 758, tyrosine 766, and glycine 767; these are all very close to the selectivity residue identified herein, phenylalanine 762.
These polymerases are used in a DNA sequencing reaction under conditions in which they preferably produce nearby bands of appro}:imately uniform intensity (with about a 1.5- to 2.0-fold variation in intensity) when the DNA products of the sequencing reaction are run in a gel. By nearby is meant to include bands representing DNA products of molecular weight differing by as much as 6000, i.e., 20 bases. The actual value of this intensity will decrease along the length of the gel, as described in Tabor and Richardson, supra. Band intensity reflects the number of DNA products within a certain band. Labels such as fluorophores or radioisotopes, are used to produce a readily detectable band of intensity reflective of the number of such DNA
StJB.STrfl1TE WXT (Rlli-E 26) products. Thus, in this invention, nearby bands derived from one sequencing reaction with one chain terminating agent have approximately the same number of DNA products and thus a uniform band intensity. The sequencing conditions include incubation of the polymerase in the presence of specific divalent or trivalent cations such as manganese (II and III), ferrous and ferric ions;
monovalent and divalent cations which have no detectable effect, or are detrimental to DNA synthesis, include: K, Na, Ba, Be, Ca, Cc, Cr, Co, Cu, Ni, Si and Zn. The anion is unimportant, for example, chloride, acetate, and sulfate are suitable. Under these conditions the requirement for chain terminating agents, such as dideoxynucleosides, may be lessened by several-fold for enzymes for this invention. A chelator may also be provided in this solution in order to help regulate the concentration of available divalent metal ions. For example, citrate or isocitrate may be provided. These chelates are thought to maintain the level of, for example, free manganese ions at a concentration of between 10 and 100 uM over a wide range of manganese concentrations. That is, the chelator acts as a buffer.
The DNA polymerases of this invention do not discriminate significantly between dideoxynucleotide analogs and deoxynucleotides along the length of the DNA
template. That is, these polymerases are unable to discriminate significantly between a nucleotide that has a 3' hydroxyl group versus one that does not (i.e., has two hydrogens at the 3' position of the ribose).
However, these polymerases may discriminate against modifications at other positions on the nucleosides, even in the presence of manganese or iron. For example, the polymerases may discriminate against some dideoxynucleotide analogs which have fluorescent groups attached compared to deoxynucleotides. However, the polymerases do not discriminate to a different extent at neighboring, or nearby nucleotides, on the basis of the presence or absence of the modification to the dideoxynucleotide. Thus, while they discriminate M5fITUTE $HEET (RULf 26) ~ . ..
WO 96/12042 2201885 pCTIUS95/12910 strongly against these analogs, requiring higher concentrations for a DNA sequencing reaction compared to unmodified dideoxynucleosides, the intensity of nearby bands will still be uniform.
Thus, the polymerases of this invention provide a uniform efficiency of incorporation of chain terminating agents, even if they discriminate against overall incorporation. In addition, other polymerases of this invention will give more uniform bands with fluorescent ddNTPs than the corresponding naturally occuring enzyme, although not as uniform as with unlabelled or radioactively labelled ddNTPs.
Chain terminating agents useful in this invention include dideoxynucleosides having a 2', 3'dideoxy structure. Other agents useful in the invention are those able to specifically terminate a DNA sequencing reaction at a specific base, and are not discriminated against by the polymerase under the above conditions.
In order to determine whether any particular DNA
polymerase, in combination with any particular chain terminating agent, or other component of a sequencing reaction mixture, is useful in this invention, a standard sequencing reaction is performed, as described in Tabor and Richardson, supra, and the extent of band formation, and the uniformity of nearby bands in a sequencing gel, reviewed. If the polymerase reaction does not extend the primer by at least 20 bases, it is not suitable under the conditions used. Adjacent band uniformity within a two-fold or less range is useful in this invention, preferably the uniformity is within a 1.0-1.5 fold range.
Similarly, determination of optimum cation concentration, or of other potential cations useful in the invention, is determined by use of this sequencing reaction under various conditions. For example, cations are tested in ranges from 0.005-100 mM. An example of such an experiment follows:
The ability to incorporate a given ddNMP compared to the corresponding dNMP for any one enzyme is measured as the ratio of ddNTP to dNTP necessary to allow DNA
SUBST11UfE SHEET (RULE 26) 2201 88~) synthesis that terminates in a fixed range, detected as producing bands of no greater than a fixed molecular weight. That is, the bands produced in the reaction end within a specified range in the sequencing gel. Thus, if 5 one enzyme discriminates 1000-fold greater against a given ddNTP compared to another enzyme, a 1000-fold higher ratio of ddNTP to dNTP will be necessary to obtain bands terminating at the corresponding sites in the same range of the gel.
Exonuclease Activity The DNA polymerases of the invention preferably have less than 50%, preferably less than 1%, and most preferably less than 0.1%, of the normal or naturally associated level of exonuclease activity (amount of activity per polymerase molecule). By normal or naturally associated level is meant the exonuclease activity of e.g., an unmodified T7-type polymerase.
Normally the associated activity is about 5,000 units of exonuclease activity per mg of polymerase, measured as described below by a modification of the procedure of Chase et al. (249 J. Biol. Chem. 4545, 1974).
Exonucleases increase the fidelity of DNA synthesis by excising any newly synthesized bases which are incorrectly basepaired to the template. Such associated exonuclease activities can be detrimental to the quality of DNA sequencing reactions. They raise the minimal required concentration of nucleotide precursors which must be added to the reaction since, when the nucleotide concentration falls, the polymerase activity slows to a rate comparable with the exonuclease activity, resulting in no net DNA synthesis, or even degradation of the synthesized DNA.
More importantly, associated exonuclease activity may cause a DNA polymerase to idle at regions in the template with secondary structure impediments. When a polymerase approaches such a structure its rate of synthesis decreases as it attempts to pass. An associated exonuclease will excise the newly synthesized SUBSTITUTE SHEET (R1LE 26) WO 96/12042 PCT/US95/129*
DNA when the polymerase stalls. As a consequence numerous cycles of synthesis and excision will occur.
This may result in the polymerase eventually synthesizing past the hairpin (with no detriment to the quality of the sequencing reaction) ; or the polymerase may dissociate from the synthesized strand (resulting in an artifactual band at the same position in all four sequencing reactions); or, a chain terminating agent may be incorporated at a high frequency and produce a wide variability in the intensity of different fragments in a sequencing gel. This happens because the frequency of incorporation of a chain terminating agent at any given site increases with the number of opportunities the polymerase has to incorporate the chain terminating nucleotide.
An ideal sequencing reaction will produce fragments that give bands of uniform intensity throughout the gel.
This is essential for obtaining the optimal exposure of the X-ray film for every radioactive fragment. If there is variable intensity of radioactive bands, then fainter bands may go undetected. To obtain uniform radioactive intensity of all fragments, the DNA polymerase should spend the same interval of time at each position on the DNA, showing no preference for either the addition or removal of nucleotides at any given site. This occurs if the DNA polymerase lacks any associated exonuclease, so that it will have only one opportunity to incorporate a chain terminating nucleotide at each position along the template.
~hort Primers The DNA polymerase of the invention is preferably able to utilize primers of 10 bases or less, as well as longer ones, most preferably of 4-20 bases e.g., 6 bases (which can be used in groups of three to form an equivalent of an 18-mer). The ability to utilize short primers offers a number of important advantages to DNA
sequencing. The shorter primers are less expensive and easier to synthesize than the usual 17-mer primers. They MSTITUTE -S`HEtT {RULE 28) also anneal faster to complementary sites on a DNA
template, thus making the sequencing reaction faster.
Further, the ability to utilize small (e.g., six or seven base) oligonucleotide primers for DNA sequencing permits strategies not otherwise possible for sequencing long DNA
fragments. For example, a kit containing 80-4000 random hexamers could be generated, none of which are complementary to any sites in the cloning vector.
Statistically, one of the 80 hexamer sequences will occur an average of every 50 bases along the DNA fragment to be sequenced. The determination of a sequence of 3000 bases would require only five sequencing cycles. First, a "universal" primer (e.g., New England Biolabs #1211, sequence 5' GTAAAACGAACGGCCAGT 3') would be used to sequence about 600 bases at one end of the insert. Using the results from this sequencing reaction, a new primer would be picked from the kit homologous to a region near the end of the determined sequence. In the second cycle, the sequence of the next 600 bases would be determined using this primer. Repetition of this process five times would determine the complete sequence of the 3000 bases, without necessitating any subcloning, and without the chemical synthesis of any new oligonucleotide primers.
The use of such short primers may be enhanced by including gene 2.5 and gene 4 protein of T7 in the sequencing reaction.
In vitro Mutagenesis Mutagenesis of the polymerase genes was carried out using standard PCR techniques (see below).
Discrimination against ddNTPs In the presence of magnesium as the only divalent cation, T7 DNA polymerase discriminates about 3 - 4 fold against ddNTPs, less than any other known polymerase.
The next closest is reverse transcriptase, that discriminates about 10 - 50-fold against ddNTPs (3 - 10 times more than T7 DNA polymerase). After these two, all other known DNA polymerases characterized in the SUBSTINt'E SHEET (RULE 26) ; 2 0 1 ~8 J' literature discriminate at least 100-fold against ddNTPs, and often 10,000 fold or more.
In the presence of manganese discrimination by T7 DNA polymerase and E. coli DNA polymerase I are reduced;
with T7 DNA polymerase, it is reduced from 3.7 to 1, and with E. coli DNA polymerase I it is reduced from 550 to 3.9 (for ddATP). Applicant is the first to provide a DNA
polymerase that in the presence of magnesium ions as the only divalent cations has a processivity of less than 100 (defined as the average length of extension from a given primer before dissociating from the primer-template;
.Reverse transcriptase has a processivity by this definition of about 150 - 200, and T7 DNA polymerase has a processivity greater than this) and that discriminates less than 100 fold against incorporation of a ddNMP. In contrast, most of the known DNA polymerases, such a Taq that have a processivity less than 100 discriminate more than 100 fold against incorporation of a ddNMP.
Previously it was believed that whereas polymerases with high processivity such as T7 DNA polymerase remain bound to a primer-template for up to several minutes, polymerases with low processivity such as E. coli DNA
polymerase I cycle from one primer template to another every few seconds (or over one hundred times more frequently). See for example Tabor,et al. J. Biol.
Chem. 262, 16212-16223 (1987)). While processivity is advantageous for DNA sequencing, as in reducing the background due to terminations not from dideoxy incorporation, the slow cycling time is a disadvantage.
For example, if the polymerase does dissociate at specific sequences, it will result in strong artifactual bands on a sequencing gel unless there is a large excess of polymerase present. On the other hand, with a polymerase that cycles rapidly, one can use much less polymerase since a single enzyme molecule will extend many different primers during the course of a sequencing reaction, and any given primer end will have the opportunity to be extended by many different polymerase molecules, decreasing the chance of strong specific stops SUBSTlnrrE SHEET (RULE 26) ' 220188-from occurring.
However, it is better that a polymerase that cycles rapidly also incorporates ddNMPs efficiently, in order to give bands of uniform intensity and allow one to use less ddNTPs. It is also preferred that such a polymerase have low or no exonuclease activity, and that one add pyrophosphatase to prevent degradation of bands by pyrophosphorolysis.
It is also preferred that one be able to carry out DNA sequencing reactions with magnesium as the only divalent cation (i.e., the absence of manganese). First, polymerases tend to be less active with manganese compared with magnesium (see for example Tabor and Richardson. Proc. Natl. Acad. Sci. USA 86, 4076-4080 (1989)). Second, while polymerases tend to be active over wide ranges of magnesium concentrations, there is a very sharp, low optimum manganese concentration required in most cases for optimum activity (id.). And at the optimum manganese concentrations there is much less effect on reduction of discrimination against ddNTPs as at much higher concentrations, where the polymerase is much less active. Third, manganese is not as convenient a metal ion to have in a kit; it readily forms precipitates, particularly at higher pH's. Fourth, it is not clear whether manganese will be effective as a metal ion for reducing discrimination against ddNTPs with any thermophilic polymerase (i.e., at higher temperatures).
Prior to the present invention, we stated (Tabor and Richardson Proc. Natl. Acad. Sci. USA 86, 4076-4080 (1989)) that there was a correlation between discrimination and processivity:
"DNA polymerase I has a low processivity, dissociating after the incorporation of less than 10 nucleotides. There is a strong correlation between the frequency at which the enzyme dissociates from a site during DNA synthesis in the absence of ddNTPs, and the extent of discrimination against the incorporation of a ddNMP at that site (unpublished results). This suggests that DNA polymerase I
SUBSflTM SHEET (RULE 26) WO 96/12042 2201885 PCTfUS95/129 incorporates dNMPs and ddNMPs at similar rates during processive synthesis; however, when synthesis is nonprocessive, dNMPs are incorporated preferentially over ddNMPs. This model could 5 account for the greater variabiliy in ddNMP
incoporation byDNA polymerase I complared to T7 DNA
polymerase, since the latter has a processivity two orders of magnitude greater than the former."
[Citations omitted.]
10 Thus the results of this invention with E. coli DNA
polymerase I and Taq DNA polymerase are surprising, since we find no evidence that the mutants described herein do increase the processivity of the mutant enzymes.
15 Thermophilic polymerases Thermophilic polymerases that discriminate against a ddNTP by less than a factor of 100 are particularly useful in this invention. In, addition, those that discriminate against a ddNTP by less than a factor of 100 20 in the presence of magnesium as the only divalent cation and preferably cycle from one primer-template to another more than once per second are useful. Thermophilic polymerases are defined as polymerases that have optimum DNA polymerase activity in a 15 min. reaction at a 25 temperature above 60 C.
Uniform band intensities Even though manganese reduces the discrimination of Klenow fragment against ddATP from 550 to 3.9 fold Tabor 30 and Richardson (Proc. Natl. Acad. Sci. USA 86, 4076-4080 (1989)) show that there is still wide variability in the intensity of individual bands (see Figure 2 id.). Thus, apart from T7 DNA polymerase, this invention is the first to provide polymerases that cycle rapidly such as Klenow 35 fragment and those that are derived from thermophilic organisms to produce bands that have uniform intensities, even in the presence of magnesium as the only divalent cation, conditions that are preferable for the activity of most polymerases (see above). Enzymes which cycle SUBSTCt11TE SHEET (Rt1LE 26) = WO 96/12042 PCT/US95/12928 rapidly can be determined by methods known in the art as described below.
' Specific Polymerases From the above information it is possible to readily make the following polymerases which will have the desired properties discussed above. Each of these polymerases can be used for sequencing procedures if the level of exonuclease is low enough and the activity of the polymerase is sufficient (both of which are well known in the art) See Braithwaite and Ito, supra for reference to each amino acid site.
1. Pol I family E. coli DNA polymerase I with altered Phe762 (altered means replaced with e.g., Tyr, or an equivalent amino acid to give the non-discriminatory property).
Streptococcus pneumoniae DNA polymerase I with altered Phe711.
Thermus aquaticus DNA polymerase I with altered Phe667.
Thermus flavus DNA polymerase I with altered Phe666.
Bacteriophage T5 DNA polymerase with altered Phe570.
Bacteriophage Spo 1 DNA polymerase with altered Leu526.
Bacteriophage Spo 2 DNA polymerase with altered Phe690.
Mitochondrial DNA polymerase with natural Tyr753 or altered at this site without reducing the non-discriminatory activity. Such a polymerase has not previously been used for DNA sequencing. Applicant believes that it will be useful in such a procedure because of its expected low level of ddNTP
discrimination. If needed, it can be modified to reduce exonuclease activity associated with the polymerase activity.
SUBSTITUTE SHEET (RULE 26) 220 i 8U5 2. Polymerase alpha family (also called polymerase II
family) Delarue et al., Protein Engineering 3, 461-467 (1990) show that the two families of polymerases (polymerase I family and polymerase alpha family) share three common motifs. The region they call "Motif B"
contains the residue we have identified as responsible for specificity for the dideoxyribose moiety. This region is characterized by the sequence K Nl N2 N3 N4 N5 N6 N7 Y G in the polymerase I family, whe,,rre N is the specificity residue: if N4 is a phenylalanine there is high discrimination, if N4 is tyrosine there is low discrimination. In the polymerase alpha family, the sequence is K Nl N2 q N 1aJ tT Y G (there is one less base between the conserved residues). We predict therefore that just as with polymerase I type enzymes, changes to the residue(s) in this motif (betwen the lysine (K) and the tyrosine (Y)) will reduce the discrimination of these polymerases to ddNTPs. These residues are as follows:
Escherichia coli DNA polymerase II I1e494-Phe499 PRD1 DNA polymerase Leu341-Ser346 0 29 DNA polymerase Leu384-Leu389 M2 DNA polymerase Leu381-Leu386 T4 DNA polymerase Ile558-Leu563 Thermuococcus litoralis DNA polymerase (Vent) Leu492-Tyr497 Pyrococcus furiosus DNA polymerase Leu489-Phe494 Sulfolobus solfataricus DNA polymerase Va1604-Thr609 Human DNA polymerase alpha Leu951-His956 S. cerevisiae DNA polymerase I (alpha) Leu945-His950 S. pombe DNA polymerase I (alpha) Leu931-His936 Drosophila melanogaster DNA polymerase alpha Leu960-His965 Trypanosoma brucei DNA polymerase alpha Leu845-His850 Human DNA polymerase delta Val695-Va1700 Bovine DNA polymerase delta Va1694-Va1699 "$TtNTE SHEET (RULE 26) 2Z01g8J
= WO 96/12042 PCT/US95/12928 S. cerevisiae DNA polymerase III
(delta) I1e702-Va1707 S. pombe DNA polymerase III
(delta) Va1681-Va1686 Plasmodium falciparum DNA polymerase delta I1e692-Va1697 S. cerevisi.ae DNA polymerase II
(epsilon) Va182-Phe830 S. cerevisiae DNA polymerase Rev3 Leu1087-Thr1092 Herpes Simplex virus type 1 DNA
polymerase Va1812-Va1817 Equine herpes virus type 1 DNA polymerase Va1813-Va1818 Varicella-Zoster virus DNA polymerase Val776-Val781 Epstein-Barr virus DNa polymerase Cys682-Va1687 Herpesvirus saimiri DNA polymerase Val671-Va1676 Human cytomegalovirus DNA polymerase Val811-Phe816 Murine cytomegalovirus DNa polymerase Va1717-Phe722 Human herpes virus type 6 DNA polymerase Ile667-Va1672 Channel Catfish virus DNA polymerase I1e750-His755 Chlorella virus DNA polymerase Ile586-Va1591 Fowlpox virus DNA polymerase I1e648-Va1653 Vaccinia virus DNA polymerase I1e637-Val642 Choristoneura biennis DNA polymerase Ile669-Leu674 Autographa californica nuclear polyhedrosis virus (AcMNPV) DNA polymerase Arg606-Ile611.
Lymantria dispar nuclear polyhedrosis virus DNA polymerase Arg624-I1e629 Adenovirus-2 DNA polymerase Leu696-Leu701 Adenovirus-7 DNA polymerase Leu762-Leu767 Adenovirus-12 DNA polymerase Leu694-Leu699 S-1 maize DNA polymerase Leu6l8-Leu623 kalilo neurospora intermedia DNA
polymerase Leu776-Leu777 pAI2 Ascobolus immersus DNA polymerase Leu951-Leu956 pCLKl Claviceps purpurea DNA polymerase Leu831-Leu836 Maranhar neurospora crassa DNA polymerase Leu752-Leu757 pEM Agaricus bitorquis DNA polymerase Leu573-Leu578 SUBSTITUiF SHEET (RULE 26) ~.
220 1,88 WO 96/12042 PCT/US95/129=
pGKL1 Kluyveromyces lactis DNA polymerase I1e785-Leu790 pGKL2 Kluyveromyces lactis DNA polymerase I1e770-G1y776 pSKL Saccaromyces kluyveri DNA polymerase I1e775-G1y781 1.'.xam~nlaG
The following are examples of methods for determining the processivity and cycle times for various polymerases.
Also provided are examples for determining the level of discrimination by a polymerase, and other methods useful in this invention.
Example 1. Mutagenesis of DNA polymerase aenes and overaroduction of mutant DNA polymerases Standard techniques are used for the cloning and expression of mutant DNA polymerase genes. The genes for the large fragment of E. coli DNA polymerase I(Klenow fragment) and the large fragment of Taq DNA polymerase (KlenTaq or _Taq DNA polymerase, see Barnes 112 Gene 29, 1992 or Stoffel fragment, see Lawyer et al. 2 PCR Methods Appl 275, 1993), the starting materials for the generation of mutants in E. coli DNA polymerase I and Taq DNA polymerase, are expressed under the control of the T7 RNA polymerase promoter. The gene for the _28 amino acid deletion of T7 DNA polymerase (see Tabor and Richardson 264, J. Biol. Chem. 6447, 1989), the starting material for the generation of mutants in T7 DNA polymerase, is expressed under the control of the lac promoter in a strain that produces E. coli thioredoxin, a necessary factor for processive DNA synthesis by T7 DNA polymerase (Tabor and Richardson, supra). The gene for Taq DNA
polymerase mutant F667Y is transferred from the gene that produces _Taq DNA polymerase to the gene that produces the full length Taq DNA polymerase by standard techniques using PCR and restriction digestion followed by ligation.
Mutagenesis to construct the mutant DNA polymerases is carried out using standard mutagenesis techniques by PCR similar to the method described by Sarkar and Sommer 8 BioTechniques 404, 1990. To construct hybrids in which more than four amino acid residues are being exchanged, SUBSTtTUTE SHEET (RULE 26) Wo 96/12042 PCT/US95/12928 two hybrid primers are constructed whereby PCR is first carried out on the donor DNA, and then that product is used for PCR on the recipient DNA, generating the hybrid molecule. For the construction of hybrids in which the 5 exchange of domains is four amino acid residues or less, single PCR primers are synthesized that contain the entire region to be transferred as well as the proper flanking sequences of the recipient, and that primer is used to construct the hybrid molecule directly.
10 Overproduction of the mutant DNA polymerases is carried out using standard techniques (see for example Current Protocols in Molecular Biology, Ausubel et al, eds., Chapter 16, 1994). Mutant proteins are purified by standard procedures including ion exchange 15 chromatography. For the purification of E. coli DNA
polymerase I mutants, see for example Joyce and Grindley 80 Proc. Natl. Acad. Sci. 1830, 1983. For the purification of Taq DNA polymerase mutants, see for example Engelke et al. 191 Analytical Biochemistry 396, 20 1990. For the purification of T7 DNA polymerase mutants, see for example Tabor and Richardson 264, J.
Biol. Chem. 6447, 1989. Polymerase specific activities of each of the purified mutant proteins are determined by standard procedures described in these references.
25 Example 2. Rapid screen of DNA polymerases for mutants that are improved in their efficiency of incorporating a dideoxynucleotide relative to a deoxynucleotide Mutant DNA polymerases are screened for their ability to incorporate dideoxynucleotides by SDS activity gel 30 analysis. The procedure is a modification of that described by Spanos and Hiibscher 91 Methods in Enzymology 263, 1983 and Karawya et al. 135 Analytical Biochemistry 318, 1983. Briefly, 10 ml of cells are induced for 4 to 6 hours and then pelleted. The cell pellet is 35 resuspended in 0.3 ml 25 mM Tris-HCl, pH 7.0, 5 mM EDTA.
20 1 of the resuspended cells are mixed with 40 1 of a solution of 11; SDS (sodium dodecyl sulfate), 2%
mercaptoethanol, 30 % glycerol, 0.04% bromphenol blue, and 100 mM Tris=HCl, pH 6.8. The mixtures are incubated SLi6STiNTE SHFET (RULE 26) at 37 C for 5 min, and then 20 1 aliquots are loaded in duplicate onto two SDS polyacrylamide gels. The SDS
polyacrylamide gels consist of a resolving gel that contains 8!k polyacrylamide, 0.27% bisacrylamide, 190 mM
Tris=HCl, pH 8.8, 0.05%- SDS, and 25 g/ml denatured salmon sperm DNA, and a stacking gel that consists of 5%
polyacrylamide, 0.17% bisacrylamide, 150 mM Tris=HCl, pH
6.8, and 0.1% SDS. The two gels are electrophoresed at 100 V for 13 hr at a constant temperature of 13 C in an electrophoresis buffer consisting of 190 mM Tris=HCl and pH 8.8, 0.05%- SDS.
After electrophoresis, the gels are washed over 8 hr in 4 changes of 500 ml each of Renaturation Buffer (50 mM
Tris.HC1, pH 7.5, 5 mM magnesium acetate, 1 mM
dithiothreitol, 40 mM KC1, 400 g/ml bovine serum albumin, 16% glycerol and 0.95 mM EDTA) at 4 C.
The renatured proteins are assayed for DNA polymerase activity by incubating each of the two gels in 6 ml of Renaturation Buffer, 1.5 M 4dNTPs, 4 1 of [a-32P] dATP
(800 Ci/mmol, 10 mCi/ml), and 80 g of purified thioredoxin. One of the mixtures also contains 30 M
ddTTP (a 20-fold molar excess over dTTP). The mixtures are incubated for 4 hr at 37 O C (70 C for 2 hr for thermophilic DNA polymerases).
After incubation the gels are washed for 8 hr against four changes of 5% trichloracetic acid and 1% sodium pyrophosphate. The gels are then dried and autoradiographed.
To determine whether a mutant DNA polymerase is discriminating more or less against ddTTP, the intensities of the radioactive bands are compared on the two gels that were incubated in the presence and absence of ddTTP, and the ratio of the signal in the two bands for the unmodified DNA polymerase is compared with the ratio of the signal in the two bands for each of the mutants. If a mutation results in a DNA polymerase being less discriminatory towards ddTTP, then there will be a greater percentage decrease in radioactivity in the band in which ddTTP was present for the mutant DNA polymerase SUBSTINTE SFiEET (RULE 26) compared with the unmodified DNA polymerase. For example, under these conditions the radioactive bands observed for cells containing induced E. coli DNA
polymerase I or T7 DNA polymerase mutant Y526F are approximately the same intensity (within a factor of two) in reactions carried out in the presence versus the absence of ddTTP. In contrast, for cells containing induced E. coli DNA polymerase I mutant F762Y or T7 DNA
polymerase, the bands on the gel in which the reactions are carried out in the presence of ddTTP are less that 50 the intensity of the bands corresponding to the reactions carried out in the absence of ddTTP.
This assay represents a rapid method of screening a large number of DNA polymerase mutants for their ability to discriminate against dideoxynucleotides. It can detect changes of at least 5-fold in the relative rate of discrimination. However, this assay should be followed up with purification of potentially interesting mutant DNA polymerases and more rigorous assays of the purified proteins similar to those described below in order to determine precisely the effect of each mutation on discrimination against dideoxynucleotides.
The following examples are methods for determining the processivity and cycle times for various polymerases, determining the level of discrimination against ddNTPs by a polymerase, determining the uniformity of bands generated by dideoxy-terminated fragments on a DNA
sequencing gel; and using the DNA polymerases of this invention for DNA sequence analysis.
Example 3.Preparation and purification of a sinale-stranded M13 DNA - 5' 32P-labeled 40-mer primer complex The template is M13 mGPl-2 single-stranded DNA, 9950 nucleotides in length, as described in U.S. Patent 4,795,699 (Figure 9). The phage M13 mGPl-2 is deposited in ATCC as number 40303. M13 mGPl-2 single-stranded DNA
is purified as described in Tabor et al., 262 J. Biol.
Chem. 16212, 1987. Briefly, the phage is purified through two CsCl gradient centrifugations, the CsCl is 'SUBSfITUTE SHEET (RU1.E 26) removed by dialysis, the DNA is removed f rom the phage by extraction with phenol and chloroform in the presence of 0.1% sodium dodecyl sulfate, and the extracted DNA is dialyzed extensively against 20 mM Tris=HC1, pH 7.5, 2 mM
EDTA and stored at 4 C. The concentration of M13 mGPl-2 single-stranded DNA is determined spectrophotometrically using an extinction coefficient of 8.1 A260 units = 1 mg/ml, or 0.3 pmoles of M13 mGPI-2 template molecules per microliter.
The primer is a synthetic 40-mer having the sequence 5' d(TTTTCCCAGTCACGACGTTGTAAA.ACGACGGCCAGTGCCA)3' synthesized by standard procedures. It is complementary to M13 mGP1-2 DNA at nucleotid.es 9031 to 8992 (see 1699 patent supra for sequence). The primer is purified by ion exchange chromatography or denaturing polyacrylamide gel electrophoresis prior to end labeling.
The primer is labeled and annealed to the template essentially as described in T'abor et al (supra). The primer is 5' end labeled in a reaction mixture (15 1) containing 40 mM Tris=HC1, pH 7.5, 10 mM MgC121 5 mM
dithiothreitol, 100 mM NaC1, 50 g/ml BSA, 50 Ci [g32P]ATP, 6000 Ci/mmol, 5 pmoles of primer, and 10 units of T4 polynucleotide kinase prepared from the PseTl mutant, which is deficient in phosphatase activity. The mixture is incubated at 37 C for 15, min, followed by 70 C for 15 min to inactivate the kinase. 60 1 of single-stranded M13 mGPl-2 DNA (0.25 mg/ml), 6 1 of 1M
NaCl and 3 1 of 0.2 M MgC12 are added, and the mixture is slowly cooled from 70 C to room temperature (about 20 - 250C) over a period of 30 inin. The mixture is then extracted once with a 1:1 mixture of phenol and chloroform, and after cennrifugation for 30 sec in a microfuge, the aqueous phase (70 1) is placed on a 1 ml column of SepharoseM CL-GB equilibrated in 20 mM
Tris=HC1, pH 7.5, 2 mM EDTA, and 100 mM NaCl. The labeled primer-template complex is eluted from the column with the same buffer used for equilibration; the labeled complex elutes in the void volume. After elution, the complex is at a concentration of approximately 50 g/ml (0.015 pmoles of molecules per 1) with a specific activity of approximately 200,000 cpm/ 1.
Example 4. Determination of the processivity of a DNA
Fo1ynerase by dilution test Processivity is determined by enzyme dilution essentially as described in Tabor et al (supra) and Tabor and Richardson, 84 Proc. Nat1. Acad. Sci USA 4767, 1987.
The reactions are carried out under the same conditions as in the extension/termination reactions used in DNA
sequencing (Tabor and Richardson, supra), except that ddNTPs are omitted and the polymerase concentration is reduced in order to have an excess of primer-template molecules over polymerase molecules in some of the reactions. The primer-template consists of the single 5' end-labeled primer annealed to a single-stranded M13 DNA
molecule as described in Example 3.
The extension reaction mixtures are prepared substantially as described in Tabor et al (supra). Each reaction mixture (18 1) contains 1.0 1 annealed 32P-labeled primer-M13 DNA as described in Example 3(-=0.015 pmoles, -200,000 cpm), 40 mM Tris-HCl, pH 8.0, 5 mM MgC1z1 5 mM dithiothreitol, and 300 M 4dNTPs. The mixtures are incubated at 37 C for 1 min (70 C for thermophilic DNA
polymerases). The reactions are initiated by the addition of 2 1 aliquots of dilutions of the DNA
polymerase being analyzed, diluted in 20 mM Tris-HCl, pH
7.5, 10 mM 2-mercaptoethanol and 0.0596 bovine serum albumin. The reaction mixtures are further incubated at 37 C (70 C for thermophilic DNA polymerases) for either 30 sec or 3 min. At the indicated times, 8 1 aliquots are removed and added to either 8 1 of 90% formamide, 20 mM EDTA, 0.051-1; bromphenol blue for denaturing polyacrylamide gel electrophoresis, or 2 1 of 100 mM
EDTA, 2%- sodium dodecyl sulfate for alkaline agarose gel electrophoresis.
The samples are analyzed by either denaturing polyacrylamide gel electrophoresis or alkaline agarose gel electrophoresis. Denaturing polyacrylamide gel &Sf ITUTE SHEET (RULE 26) WO 96/12042 PCT/US95112*
electrophoresis is most suitable for analyzing polymerases with an average processivity of less than 500 nucleotides, while alkaline agarose gel electrophoresis provides a more sensitive estimate of the processivity of 5 DNA polymerases with an average processivity greater than 500 nucleotides; however, either method can be used successfully to determine the average processivity of any DNA polymerase.
To determine the processivity by denaturing 10 polyacrylamide gel electrophoresis, aliquots in formamide are heated at 90 C for 2 min immediately prior to loading 6 1 of each sample onto a gel consisting of 8%-polyacrylamide, 0.4%, N,N'-methylenebisacrylamide, 7 M
urea in 100 mM Tris=borate, pH 8.3, 1 mM EDTA.
15 Electrophoresis is at 2000 Volts for 90 min (until the bromphenol blue has just run off the bottom of the gel).
Suitable 5' 32P-end labeled molecular weight markers are also loaded onto the gel that allow the determination of fragment sizes 100 to 500 nucleotides in length. An 20 example of such suitable markers are T7 HpaI fragments that have been dephosphorylated with alkaline phosphatase and then 5' 32P-end labeled using [~J- P] ATP and T4 polynucleotide kinase using standard procedures (Sambrook et al., 1989, Molecular Cloning A Laboratory Manual, Cold 25 Spring Harbor Laboratory, N.Y. pp. 6.20-6.21 and Ausubel et al., 1989, Current Protocols in Molecular Biology, Greene Publishing Associates and Wiley Interscience, N.Y.). After electrophoresis, the gel is dried under vacuum, and autoradiographed. After autoradiography, the 30 distribution of radioactively labeled fragments is determined by Phosphorimager analysis (Molecular Dynamics ) .
Products are analyzed by alkaline agarose gel electrophoresis as described by (1) Villani et al., 256 35 J. Biol. Chem. 8202, 1981, by (2) Sabatino et al., 27 Biochemistry 2998, 1988, and by (3) Sambrook, et al., supra . To prepare the agarose gel, 250 ml of a 1.5%
agarose solution in 2 mM EDTA, pH 8.0 is heated in a microwave oven to dissolve the agarose, and then allowed ~}BSfINiE SHEET (RULE 26) to cool to 60 C. 8.75 ml of 1 N NaOH (finai concentration 35 mM NaOH) is added, the gel is poured into an agarose gel electrophoresis mold. The gel is allowed to set for 2 hours before use. The electrophoresis buffer is 35 mM NaOH and 2 mM EDTA. The samples are prepared by taking the 10 1 aliquots described above (in 20 mM EDTA, 0.4!~ sodium dodecyl sulfate) and denatured by the addition of 1 1 of 1 N
NaOH and heating at 60 C for 10 min. 7 1 of 75 ~
glycerol, 0.2 o bromcresol green are added to each sample, and then the samples are loaded onto the alkaline agarose gel. Electrophoresis is carried out at 4 C at a constant current of 150 mA for 15 hours (until the bromcresol green has migrated about 14 cm). The electrophoresis chamber has dimensions of 26 cm (length) x 20 cm (width) by 2 cm (height). After electrophoresis, the gel is soaked for 2 hours in 10% trichloracetic acid, dried under vacuum, and autoradiographed. After autoradiography, the distribution of radioactively labeled fragments is determined by Phosphorimager analysis (Molecular Dynamics).
To test the processivity of a given DNA polymerase, the concentration of that polymerase in the reaction-mixture is diluted by 2-fold increments until only a fraction (e.g. 25%) of the primers are extended, while the majority (e.g. 75a) remained unchanged, 40 nucleotides in length. Under these conditions, a two-fold increase or decrease in DNA polymerase concentration should result in about a two fold increase or decrease, respectively, in the fraction of primers extended. The average length of the labeled fragments that are extended is determined either by visual inspection of the autoradiograph or by quantitation using a phosphoimager.
For example, using this test, the exonuclease-deficient T7 DNA polymerase complexed to its processivity factor thioredoxin (e.g. SEQUENASE Version 2.0, United States Biochemical Corporation) has an average processivity of more than 500 nucleotides, while Klenow fragment of E.
coli DNA polymerase I has a processivity of less than 50 SUgSTiTTE SHEET (#tULE 26) nucleotides.
Example 5. Determination of the rate of cyrclina of a DNA
nol;,merase The rate of cycling is determined essentially as described Tabor et al (supra). The reactions are carried out under the same conditions as in the extension/termination reactions used in DNA sequencing (Tabor and Richardson, supra), except that ddNTPs are omitted and the polymerase concentration is reduced in order to have an excess of primer-template molecules over polymerase molecules. The primer-template consists of the 5'[32P]end-labeled primer annealed to single-stranded M13 DNA molecule as described in Example 3.
First a test is carried out to determine the functional ratio of primer-template molecules to polymerase molecules, for example, using the large fragment of E. coli DNA polymerase I (Klenow fragment).
A dilution experiment is carried out as described in Example 4, to determine the concentration of polymerase molecules necessary to extend 20% of the labeled primer-template molecules in 10 sec. This ratio of polymerase to primer-template is defined as being less than or equal to 1:5, and is used to determine the maximum rate of cycling of the polymerase as described below.
Extension reactions are carried out as described in Example 4 using the ratio of DNA polymerase to primer-template less than or equal to 1:5, as defined in the previous paragraph. Reactions are carried out under conditions (i.e. buffer, pH, salt, temperature) optimal for the polymerase being tested. Aliquots are removed at 10 sec, 20 sec, 40 sec and 80 sec, and the reactions are terminated and the products analyzed as described in Example 4. For DNA polymerases with low processivity (less than 100 nucleotides), such as the large fragment of E. coli DNA polymerase I, samples are analyzed preferably by denaturing polyacrylamide gel electrophoresis. For DNA polymerases with high processivity (greater than 100 nucleotides), such as T7 SBTITUTE SHEET (RULE 26) DNA polymerase, samples are analyzed preferably by alkaline agarose gel electrophoresis. After electrophoresis, the gels are dried under vacuum and analyzed either by autoradiography or with a Phosphoimager.
When reactions are carried out with a polymerase that cycles very slowly, such as T7 DNA polymerase, there is not a significant decrease (i.e. less than a factor of two) in the number of unextended primers between 10 sec and 80 sec. Thus, if the number of unextended labeled primers does not decrease by more than two fold between ,the 10 sec and 80 sec time points, then the DNA
polymerase cycles slower than once per 70 sec. For polymerases that cycle rapidly, the number of unextended primers will decrease by a significant fraction (i.e. by more than 2 fold) between the 10 and 80 sec time points.
To determine the rate of cycling for these polymerases, the following equation is used:
R = Nl x L (t2) /{L (tl) x(ta - tl) }
where:
R = minimum rate of cycling in cycles per sec.
N1 = ratio of primer-template molecules over, functional DNA polymerase molecules. (N1 = 5 in the example above.) L(tl) = the maximum processivity of the DNA polymerase being investigated, in nucleotides. This is defined as the maximum number of nucleotides extended from the labeled primers under conditions of limiting DNA
polymerase (where only 20% of the primers are extended in the 10 sec time point), as described in Example 3.
L(t2) = the maximum length of extension (in nucleotides) of the labeled primers at time t2, which is 80 sec in this example.
t,.: the shortest time at which an aliquot is taken, or 10 sec in this example.
SUBSTITUCE SHEET (RU!_E 261 WO 96/12042 22O18U J PCT/US95/129,~
t2: the longest time the reaction is allowed to precede before removing an aliquot, or 80 sec in this example.
When this test is carried out for the large fragment of E. coli DNA polymerase I, a value of more than 0.2 cycles per sec is obtained.
Examples 6 and 7 provide tests to determine the efficiency of incorporation of dideoxynucleotides for an unknown DNA polymerase using a 5' 32P-labeled 40-mer primer annealed to a single-stranded M13 DNA template and a gel electrophoresis-based analysis. Example 6 is best for DNA polymerases that efficiently incorporates dideoxynucleotides (e.g. wild-type T7 DNA polymerase and Taq DNA polymerase F667Y) while Example 7 is best for DNA
polymerases that discriminate strongly against the incorporation of dideoxynucleotides (e.g. T7 DNA
polymerase Y526F and wild-type Taq DNA polymerase).
Examnle 6. Gel electrnphorPs;.-ba d determination of the rate of incormoration of dideoxvnucjPo+-idPs relative to deoxvnuc leotides using a 1:1 ratio of dNTPs to ddNTPs The primary application of this test is to determine the absolute ratio of incorporation of a ddNMP to a dNMP
for a DNA polymerase that efficiently incorporates dideoxynucleotides, such as T7 DNA polymerase, E. coli DNA polymerase I mutant F762Y or Taq DNA polymerase mutant F667Y. It can also indicate the level of discrimination.against ddNTPs of any DNA polymerase;
however, for DNA polymerases that discriminate strongly against ddNTPs, such as T7 DNA polymerase mutant Y526F, E. coli DNA polymerase I or Taq DNA polymerase, higher ratios of ddNTP to dNTP are necessary to determine precisely their level of discrimination, which is described in detail in Example 7.
DNA synthesis reactions are carried out on the 32P-end labeled 40 mer-M13 mGPl-2 DNA template complex prepared as described in Example 3. Reaction conditions are used that are optimum for the DNA poiymerase being tested with regard to buffer, pH, salts, and temperature of the siBsnTUTE SHEET (RULE 26) reaction. A concentration of DNA polymerase is chosen whereby most of the primers are extended in a 10 min reaction and are terminated by the incorporation of a dideoxynucleotide. The reaction mixture contains 100 M
5 4dNTPs and 100 M of one of the four ddNTPs.
We used this test to compare the ability of six DNA
polymerases to incorporate each of the four ddNMPs. The DNA polymerases tested were (1) T7 DNA polymerase with a 28 amino acid deletion in the exonuclease domain and 10 complexed in a one-to-one ratio with thioredoxin (Tabor and Richardson 264 J. Bio1. Chem. 6447, 1989) (referred to here as "T7 DNA polymerase"), (2) The large fragment of E. coli DNA polymerase I, commonly called the Klenow fragment (referred to here as "E. coli DNA polymerase 15 I"), (3) Unmodified DNA polymerase from Thermus aquaticus (referred to here as "Taq DNA polymerase"), (4) T7 DNA
polymerase as described above in which the tyrosine at residue 526 has been changed to a phenylalanine (referred to here as "T7 DNA polymerase Y526F"), (5) E. coli DNA
20 polymerase I as described above in which the phenylalanine at residue 762 has been changed to a tyrosine (referred to here as "E. coli DNA polymerase I
F762Y"), and (6) Taq DNA polymerase as described above in which the phenylalanine at residue 667 has been changed 25 to a tyrosine (referred to here as "Taq DNA polymerase F667Y").
To test the relative rate of use of each of the four ddNTPs compared to the comparable dNTPs for each of the DNA polymerases indicated above, the reaction mixtures (8 30 1) contained 1.0 1 annealed 32P-labeled primer-M13 DNA
as described in Example 3 (-0.015 pmoles, -200,000 cpm), 40 mM Tris=HC1, pH 8.0, 5 mM MgC12, 5 mM dithiothreitol, 50 mM NaCl, 100 M 4dNTPs, and 100 M ddCTP. The reaction mixtures also contained 10 ng of yeast inorganic 35 pyrophosphatase to inhibit pyrophosphorolysis that could otherwise increase the apparent discrimination by the DNA
polymerase (Tabor and Richardson 265 j. Biol. Chezn. 8322, 1990). The reactions were initiated by the addition of 2 1 of each DNA polymerase, diluted in 20 mM Tris=HC1, SUBSTITM $HEET (RULE 261 2201885, pH 7.5, 10 mM 2-mercaptoethanol and 0.05% bovine serum albumin to a concentration of approximately 0.025 units/
1. The concentration of each DNA polymerase was sufficient to extend most of the labeled primers by more than 500 nucleotides in the absence of ddNTPs in a 15 min reaction. The reaction mixtures were incubated for 15 min at either 37 C (T7 DNA polymerase, T7 DNA polymerase Y526F, E. coli DNA polymerase I, and E. coli DNA
polymerase I F762Y) or 70 C (Taq DNA polymerase and Taq DNA polymerase F667Y). The reactions were terminated by the addition of 10 1 90% formamide, 20 mM EDTA, 0.050 bromphenol blue. Each sample was heated at 90 C for 2 min immediately prior to loading 6 1 of each sample onto a gel consisting of 8%- polyacrylamide, 0.4= s N,N'-methylenebisacrylamide, 7 M urea in 100 mM Tris-borate, pH 8.3, 1 mM EDTA. Electrophoresis was at 2000 Volts for 90 min (until the bromphenol blue had just run off the bottom of the gel). After electrophoresis, the gel was dried under vacuum, and autoradiographed. After autoradiography, the distribution of radioactively labeled fragments was determined by Phosphorimager analysis (Molecular Dynamics). Alternatively, the relative intensities of dideoxy-terminated bands can be determined by scanning the autoradiograph using an instrument such as the SciScan 5000 imaging densitometer (United States Biochemical Corp).
When the set of four reactions (each containing a single ddNTP at an equimolar concentration as the dNTP) was carried out with each of the six DNA polymerases described above, the reactions with three of the DNA
polymerases (T7 DNA polymerase Y526F, E. coli DNA
polymerase I and Taq DNA polymerase) resulted in most (>50%) of the radioactivity in the primers that had been extended migrating at the top of the gel, corresponding to fragments greater than 300 bases in length. Based on the predicted exponential decay of signal with increasing fragment size, this corresponds to discrimination by these three DNA polymerases of more than l00 fold against all four ddNTPs. A more precise measurement of the SU5TIM'E SHEET (RULE 26) discrimination against ddNTPs by these three DNA
polymerases is obtained using the test in Example 7 below.
For the other three DNA polymerases (T7 DNA
polymerase, E. coli DNA polymerase I F762Y and Taq DNA
polymerase F667Y) the autoradiograph showed a series of dideoxy-terminated fragments with all of the reactions.
In general, the average lengths of the labeled synthesized fragments were lowest for Taq DNA polymerase F667Y, with only about six radioactively labeled dideoxy-terminated fragments visible with even a several day exposure of the film. The average lengths of labeled fragments with E. coli DNA polymerase I F762Y are slightly longer than with Taq DNA polymerase F667Y, while the average lengths are significantly longer with T7 DNA
polymerase. The fragments are more uniform in intensity when synthesized by E. coli DNA polymerase I F762Y and Taq DNA polymerase F667Y that by T7 DNA polymerase.
The distribution of radioactivity in the fragments was quantitated by Phosphoimager analysis (Molecular Dynamics). The total amount of labeled primers in each lane was determined by running three control reactions in which no DNA polymerase was present, and the radioactivity in each of the corresponding radioactive bands on the gel at the position of the unextended primer was determined. With some preparations of radioactively labeled primers, a certain percentage (<100) is not extended by any of the DNA polymerases, regardless of the concentration of DNA polymerase used; this background level is determined by measuring the percentage of radioactivity remaining at the position of unextended primer for a series of four reactions containing ddNTPs, and subtracting the average of these four values from the total number of counts determined previously. This value is defined as the total number of counts in primers that are capable of being extended by a DNA polymerase.
The total number of counts (i.e. radioactivity) in the first three dideoxy-terminated fragments were determined for T7 DNA polymerase, E. coli DNA polymerase SUBSTIME SKET (NllE 26i 2201,8g5-WO 96/12042 PCT/US95/12,4 I F762Y and Taq DNA polymerase F667Y for each of the four ddNTP reactions. The values are presented in the Table below as the percentage of counts in the first three dideoxy-terminated fragments to the total number of counts in the primers capable of being extended by a DNA
polymerase.
Polymerase Reaction ddGTP ddATP ddTTP ddCTP
polymerase 67% 66% 76% 61%
E. coli DNA
polymerase I F762Y 95% 92% 96% 92%
Taq DNA
polymerase F667Y 97% 95% 95% 99%
As a further test of the efficiency of each DNA
polymerase to incorporate dideoxynucleotides, the number of counts in each fragment with a significant signal was determined for each reaction, and the data were plotted as a function of the fragment number using the Macintosh program Kaleidograph Version 3.0 (Synergy Software). The resulting plots were fit to an exponential decay curve using the Kaleidograph library routine for this function.
The decay curve is given by the equation:
Y = e M"X
where:
Y = 1 - (the fraction of labeled primers in fragments 1 to X compared to the total number of primers that can be extended) X the fragment number (the first dideoxy-terminated fragment is 1) M = the exponential decay function calculated for the data by the Kaleidograph library routine.
SU85TITUTE SHEET (RILE 26j In the Table below, the following data are provided for each of the four ddNTP reactions using T7 DNA
polymerase, E. coli DNA polymerase I F762Y and Taq DNA
polymerase F667Y:
N, the number of fragments used to fit each exponential curve.
M, the calculated exponential decay function as described above.
D, the discrimination factor given as the ratio of the use of a specific dNTP to the use of the comparable ddNTP when both nucleotides are present at equal concentrations. D is calculated from the equation above using the calculated value of M to determine Y when X =
1, and defining D, the ratio of preference of the dNTP to the ddNTP, as Y/(1-Y).
R2, the correlation index for the data, was calculated by the Kaleidograph library routine. This is a measure of the variability in band intensities, or the sequence-specific variability in the ability of a DNA polymerase to incorporate the specific dideoxynucleotide.
SLOSTffM SHM (ME 26) WO 96/12042 220188 j PCT/US95/120 PolymeraseddNTP N M D R~
T7 DNA ddGTP 8 -0.375 2.2 0.813 poly-merase ddATP 6 -0.356 2.3 0.996 ddTTP 5 -0.450 1.8 0.997 ddCTP 8 -0.317 2.7 0.889 E. Qoli ddGTP 5 -1.03 0.56 0.999 .DNA poly-merase I
ddATP 5 -0.860 0.72 0.998 ddTTP 5 -1.06 0.54 1.000 ddCTP 6 -0.842 0.75 1.000 Taq DNA ddGTP 5 -1.18 0.45 0.995 poly-merase ddATP 6 -0.997 0.59 0.997 ddTTP 6 -1.01 0.56 0.996 ddCTP 4 -1.44 0.32 0.996 Averages:
T7 DNA 4 ddNTP 2.3 .924 poly-merase E. coli 4 ddNTP 0.64 .999 DNA poly-merase I
Taq DNA 4 ddNTP 0.48 .996 poly-merase In summary, T7 DNA polymerase discriminates an average of 2.3 fold against ddNTPs, while E. coli DNA
Si16SnTtlTE SHEET (RULE 26) polymerase I F762Y and Taq DNA polymerase F667Y actually prefer ddNTPs over dNTPs an average of 1.6 fold (1/0.64) and 2.1 fold (1/0.48), respectively. A comparison of Rz indicates that the intensity of neighboring fragments are more uniform with E. coli DNA polymerase I F762Y and Taq DNA polymerase F667Y than with T7 DNA polymerase. For a more accurate measure of uniformity, a greater number of fragments could be included in the analysis by reducing the level of ddNTPs (for example by 5 fold) in each reaction, reducing the decay in intensity at each position (see Example 13).
To determine the amount of discrimination against ddNTPs by a new DNA polymerase, reactions analogous to those described above would be carried out, and identical reactions would be carried out in parallel using T7 DNA
polymerase (SEQUENASE Version 2.0, United States Biochemical Corporation), with all reactions analyzed on the same gel. An initial comparison of the distribution of dideoxy-terminated bands obtained with the new DNA
polymerase compared with those obtained with T7 DNA
polymerase would indicate whether the new DNA polymerase discriminated more or less against ddNTPs than T7 DNA
polymerase. For example, such a visual inspection using E. coli DNA polymerase I F762Y clearly shows that for reactions with each of the 4 ddNTPs, the number of fragments visible on the gel in reactions using E. coli DNA polymerase I F762Y are less (and smaller in average size) than those using T7 DNA polymerase. A more quantitative analysis could then be carried out analogous to that described above in order to calculate the exponential decay factor (M), average relative rate of utilization of dNTPs relative to dNTPs (D) and variability in intensity (R2) for the new DNA polymerase as described above.
One complication that can occur in this test is when the DNA polymerase has an associated exonuclease activity, such as the 5' to 3' exonuclease activity associated with Taq DNA polymerase and the 3' to 5' exonuclease activity associated with E. coli DNA
SUBS11TM SHEET (RULE 26) 220188~
polymerase I and native T7 DNA polymerase (not the 28 T7 DNA polymerase deletion mutant used in the experiments above). A 5' to 3' exonuclease activity is detrimental since it can remove the label on the 5' end of the primer, reducing the radioactivity signal being detected.
This problem can be partially avoided by reducing the amount of DNA polymerase in the reaction mixture. In the example above, 0.025 units of Taq DNA polymerase resulted in virtually all of the primers being extended until terminated by incorporation of a dideoxynucleotide, without appreciable loss in radioactivity due to 5' to 3' .exonuclease activity, whereas a 40 fold increase in Taq DNA polymerase activity, or 1 unit per reaction, resulted in the loss of virtually all 32P from the 5' ends of the primer. An alternative approach to measuring the extent of discrimination for a DNA polymerase with a 5' to 3' exonuclease activity is to use a different assay such as those described in Examples 8-10.
A 3' to 5' exonuclease activity can complicate the assay described above by making the DNA polymerase appear to discriminate more against a ddNTP than it actually does (for example, see Tabor and Richardson, 86 Proc.
Nat1. Acad. Sci. 4076, 1987) This is because once a dideoxynucleotide has been incorporated, the exonuclease activity can preferentially remove the dideoxynucleotide so that DNA synthesis can continue, resulting in an increase in the length of the fragment. Preferably, the enzymes assayed in the test described above are devoid of such 3' to 5' exonuclease activity; examples are modified T7 DNA polymerase (SequenaseO, United States Biochemical Corporation), Taq DNA polymerase, exonuclease-deficient Vent (Thermococcus litoralis) DNA polymerase (New England Biolabs catalog number 257), exonuclease-deficient Deep Vent (Pyrococcus GB-i) DNA polymerase (New England Biolabs catalog number 259 ), exonuclease-deficient Pfu (Pyrococcus furiosus) DNA polymerase (Stratagene catalog number 600163), and exonuclease-deficient Klenow fragment (E. coli DNA polymerase I, United States Biochemical Corporation, catalog number 70057). In some instances, StJB5T1ME SHEET (RUI.E 26) such as with E. coli DNA polymerase I(Klenow fragment) the 3' to 5' exonuclease activity is weak and does not interfere significantly with this assay (see for example Tabor and Richardson 264 J. Biol. Chem. 6447, 1989). One method to determine whether a new DNA polymerase being tested has a 3' to 5' exonuclease activity that is interfering with the ability to accurately measure the discrimination against ddNTPs is to carry out the experiment described above, removing aliquots at different time points up to 60 min. If the size distribution of the dideoxy-terminated fragments increases with time, then it is likely that such a 3' to 5' exonuclease activity is interfering with the assay, while if the distribution of fragments is constant over time then such an activity is not having a significant effect. If the average fragment length is increasing with time, then one should use a shorter incubation time and/or decrease the DNA polymerase concentration to a range in which the fragment sizes remain constant with time.
Pyrophosphorolysis, or the reversal of the polymerase reaction, can have a similar effect as the 3' to 5' exonuclease activity, allowing the DNA polymerase to remove the chain terminating dideoxynucleotide and further increase the length of the fragments (see Tabor and Richardson, 265 J. Biol. Chem. 8322, 1990). This activity is readily avoided by including pyrophosphatase in the reaction mixture, in order to remove the pyrophosphate that accumulates during DNA synthesis and is a necessary substrate for pyrophosphorolysis.
Fxam,nlP 7. Gel electroyhoresis-based determination of the rate of incorporation of dideoxvnucleotides relative to deoxynucleotides by varving the ratio of dNTPs to dd_NTPs This example is similar to that described in Example 6. While it is the preferred test for DNA polymerases SUBSTITUTE SHEET (RULE 26) 88~
WO 96/12042 22a 1PCT/US95/12-0 that discriminate strongly against the incorporation of dideoxynucleotides (e.g. T7 DNA polymerase Y526F, E. coli DNA polymerase I and Taq DNA polymerase), it also works well with DNA polymerases that efficiently incorporate ddNMPs (e.g. T7 DNA polymerase, E. coli DNA polymerase I mutant F762Y and Taq DNA polymerase mutant F667Y). In this test, the ratio of ddNTP to dNTP is varied for two different DNA polymerase preparations, keeping all other aspects of the reactions identical, and the distributions of dideoxy-terminated radioactively labeled fragments are compared to determine the ratios required for the two DNA
polymerases being tested to obtain fragments of comparable average length.
The average length of a series of fragments is determined in one of two ways. In the first, which is best for DNA polymerases that incorporate ddNMPs efficiently, one inspects the autoradiograph and determines the position of the largest fragments visible on a given exposure for a series of reactions containing ddNTP:dNTP ratios that vary by two-fold increments using one DNA polymerase, and compares that to an analogous series using the second DNA polymerase, to determine the ratios required to generate fragments of comparable size for the two DNA polymerases. The position of the front marking the appearance of visible radioactive bands is usually relatively sharp and readily observed by eye.
However, it is also possible to determine such positions more precisely using the Phosphoimager to locate the position in each lane where a certain threshold of radioactivity per unit area occurs, starting at the top of the gel and moving down the gel.
Some DNA polymerases discriminate very strongly against the incorporation of dideoxynucleotides, in which case it is difficult to add sufficient ddNTPs to the reaction to clearly detect the position of the largest dideoxy-terminated fragments on a denaturing polyacrylamide gel. For such DNA polymerases, one can use an alkaline agarose gel electrophoresis to compare the lengths of the dideoxy-terminated fragments in the "SpTUTE SKET (RUt.E M
different series. If one uses a denaturing polyacrylamide gel, then an alternative method to determine the ratios of ddNTP:dNTP required for the two DNA polymerases to generate dideoxy-terminated fragments 5 of comparable average lengths is to focus on one or several bands and determine the ratio of ddNTPs to dNTPs required to obtain a specific level of radioactivity in those fragments, as analyzed by the Phosphoimager, for the two DNA polymerases being tested.
10 These tests were carried out using the six DNA
polymerases described in Example 6. The reaction conditions were identical to that described in Example 6 except for the concentrations of dNTPs and ddNTPs. All reaction mixtures contained 10 M 4 dNTPs. Each of the 15 four ddNTP concentrations were varied by two-fold increments in the following ranges for the six DNA
polymerases as follows: T7 DNA polymerase, E. coli DNA
polymerase I F762Y, and Taq DNA polymerase F667Y , 0.02 M to 1 M, and T7 DNA polymerase Y526F, E. coli DNA
20 polymerase I and Taq DNA polymerase F667Y, 100 to 2,000 M. The reactions were carried out and the samples were analyzed by denaturing polyacrylamide gel electrophoresis as, described in Example 6. Drying of the gel, autoradiography, and Phosphoimager analysis were as 25 described in Example 6. The Table below summarizes the results from this experiment; the values shown for T7 DNA
polymerase, E. coli DNA polymerase I F762Y, and Taq DNA
polymerase F667Y are the absolute ratios obtained in Example 6 by statistical analysis of the rate of 30 exponential decay in intensity of dideoxy-terminated fragments obtained using a 1:1 ratio of dNTPs to ddNTPs.
The values obtained for T7 DNA polymerase Y526F, E. coli DNA polymerase I and Taq DNA polymerase were obtained by determining the ratios of ddNTP to dNTP required to 35 generate a series of dideoxy-terminated fragments of comparable average length to a series generated using T7 DNA polymerase, E. coli DNA polymerase I F762Y, and Taq DNA polymerase F667Y, respectively; i.e., for each pair of wild-type and mutant DNA polymerases the ratios of SUgST1TtlTE SHEET (RULE 26) ddNTPs:dNTPs were determined that give a comparable distribution of dideoxy-terminated fragments. The ddNTP:dNTP ratio used in the reaction with the strongly discriminating enzyme (i.e. the one that contains phenylalanine at the critical position) divided by the ddNTP:dNTP ratio used to obtain a comparable distribution of dideoxy-terminated fragments with the relatively non-discriminating enzyme (i.e. the one that contains tyrosine at the critical position) gives a factor that corresponds to the difference in efficiency between the two DNA polymerases in their use of ddNTPs relative to the comparable dNTP. This factor was multiplied by the absolute ratios obtained for T7 DNA polymerase, E. coli DNA polymerase I F762Y, and Taq DNA polymerase F667Y in Example 6 in order to obtain the values shown below for T7 DNA polymerase Y526F, E. coli DNA polymerase I and Taq DNA polymerase, respectively.
SUBSTITUTE SHEET (RULE 26j Polymerase Incorporation Rate Ratios dG/ddG dA/ddA dT/ddT dC/ddC
polymerase 3.2 3.3 2.8 3.7 polymerase Y526F 6,400 7,300 8,400 11,000 E. coli DNA
polymerase I 140 720 1,100 2S0 E. coli DNA
polymerase I
F762Y 0.56 0.72 0.54 0.75 Taq DNA
polymerase 1,400 4,700 4,500 2,600 Taq DNA
polymerase F667Y 0.45 0.59 0.56 0.32 The Table below summarizes the effect of having tyrosine in place of phenylalanine at the critical selectivity residue of T7 DNA polymerase, E. coli DNA
polymerase I and Taq DNA polymerase on the use of ddNTPs relative to dNTPs.
Residue Average Rate Improvement use of in dN/ddN ddNTPs T7 DNA Tyrosine (WT) 3.0 3,000 X
polymerase Phenylalanine 8,000 E. coli DNA Phenylalanine (WT~ 600 polymerase I Tyrosine 0.6 1000 X
$tMTUTE SHEET (RULE 2S, 2201,8.-85' Taq DNA Phenylalanine (WT) 3,000 polymerase Tyrosine 0.5 6,000 X
To use this test to determine the extent of discrimination of a new DNA polymerase, reactions would be carried out as described above initially using a wide range of ratios of ddNTPs to dNTPs, and comparing the distribution of dideoxy-terminated fragments on a denaturing polyacrylamide gel to those of a standard, e.g. T7 DNA polymerase. Matching the lanes that have comparable average lengths of DNA fragments, the ratio of ddNTPs:dNTPs of the new DNA polymerase is divided by the ratio used with T7 DNA polymerase to give the level of discrimination against ddNTPs by the new DNA polymerase relative to T7 DNA polymerase.
To use this test to determine whether the modification of a DNA polymerase results in a decrease in its ability to discriminate against ddNTPs (i.e.
incorporate dideoxynucleotides more efficiently), an identical number of units of modified and unmodified DNA
polymerases would be used in a series of reactions containing varying ratios of ddNTPs to dNTPs as described above. The average length of dideoxy-terminated fragments are compared for identical ratios of ddNTPs to dNTPs for the two enzymes. If the modification has resulted in a DNA polymerase that incorporates dideoxynucleotides more efficiently, the average length of dideoxy-terminated fragments will be shorter for reactions using the modified DNA polymerase compared with those using the unmodified DNA polymerase at the same ratios of ddNTP to dNTP, while the average length will be longer for reactions using the modified DNA polymerase if the modification resulted in a DNA polymerase that is more discriminatory towards ddNTPs.
This test can also be used to determine whether a modification of a DNA polymerase results in a decrease in its ability to discriminates against analogs of ddNTPs, for example fluorescently tagged ddNTPs. This is SUBSTIME SHFET (RUl.E 26) possible even if one does not: know the concentration of the analogs being tested. As an example of this, we compared the ability of Taq DNA polymerase and Taq DNA
polymerase F667Y to use each of the four DyeDeoxy Terminators manufactured by Applied BiosystemsTN (part number 401150). These DyeDeoxy Terminators have four different fluorescent moieties covalently bound to each of the four ddNTPs (see Example 12 for more detail). For each of the DyeDeoxy Terminat:.ors, the ratio of dNTPs to DyeDeoxy Terminators was varied over a 16,000 fold range by intervals of two-fold, and the pattern of dideoxy-terminated fragments was compared on the autoradiograph to determine the ratios required for each of the two enzymes to obtain the same average length of dideoxy-terminated fragments. The Table below summarizes these results. For each Terminator,, the column labeled "Rat.io"
represents the ratio of the ratios of ddNTP to dNTP
required to give fragments of identical average length for Taq DNA polymerase versus Taq DNA polymerase F667Y.
As with normal ddNTPs, I'aq DNA polymerase F667Y
incorporates the fluorescent ddNTP derivatives much more efficiently that does the unmodified Taq DNA polymerase, by at least a factor of 400.
DyeDeoxy Terminator Ratio G Terminator >400 A Terminator >2,000 T Terminator >2,000 C Terminator >2,000 As discussed previously, one complication that can arise in the use of this test. is when the DNA polymerase being tested has an associazed exonuclease activity. The problems that 5' to 3' and 3' to 5' exonucleases can cause and ways to minimize their effects when present are discussed in Example 6. When testing to determine whether the modification of a polymerase decreases its ability to incorporate dideoxynucleotides, one class of WO 96/12042 2201 U~ j PCT/US95/120 mutants that can have this effect are ones that inactivate a normally very active 3' to 5' exonuclease activity (see for example Tabor and Richardson 84, Proc.
Natl. Acad. Sci. 4767, 1987). This class of mutants are 5 not claimed in this patent. If one has a modified DNA
polymerase that gives an apparent increase in the ability of the DNA polymerase to incorporate dideoxynucleotides, and one wants to determine whether it is in the polymerase domain or the exonuclease domain, it is 10 necessary to carry out an exonuclease assay on the modified and unmodified forms of the enzyme; a mutation .that affects primarily the exonuclease activity of the enzyme will have a greater effect on the exonuclease activity of the enzyme than on the polymerase activity.
15 Preferably, one would measure the exonuclease activity on a DNA substrate labeled at its 3' end with 32P-ddAMP (see Example 21). As in Example 6, it is important to inhibit pyrophosphorolysis in these reactions in order to avoid it increasing the apparent discrimination against ddNTPs 20 by a DNA polymerase. This is readily accomplished by including pyrophosphatase in the reaction.
Example Q. Determination of the efficiency of incorporation of dideoxynucleotides by inhibition of DNA
25 synthesis on a single-stranded M13 DNA - unl beled 40-mer primer complex In this example the sensitivity of a DNA polymerase to a ddNTP is determined by measuring the ability of varying concentrations of the ddNTP to inhibit a standard 30 DNA synthesis reaction. The DNA synthesis assay is a modification of that described in Tabor and Richardson 264 J. Biol. Chem. 6447, 1989. The 40-mer primer and the M13 mGPl-2 template are as described in Example 3.
The primer is annealed to the M13 mGPl-2 single-stranded 35 DNA template in a reaction mixture (1X = 25 1) containing 2 g of M13 mGPl-2 DNA, 6 pmoles of primer (a 10-fold molar excess to template), 40 mM Tris-HCl, pH
8.0, 10 mM MgC12 and 100 mM NaCl. The mixture is incubated 65 C for 2 min and then cooled to room SUBSTiTUTE SHEET (R111.E 26) '2~2U1885 temperature over 30 min. The standard reaction mixture (45 1) contained 22 mM Tris-HCl, pH 8.0, 5.5 mM MgC1a1 55 mM NaCl, 300 M dGTP, dATP, dCTP and [3H] TTP (30 cpm/pmol), and varying concentrations of one of the four or all four ddNTPs. The reaction mixtures also contained ng of yeast inorganic pyrophosphatase to inhibit pyrophosphorolysis that could otherwise increase the apparent discrimination by the DNA polymerase (Tabor and Richardson 265 J. Biol. Chem. 8322, 1990). The mixtures 10 are incubated at 37 C for 1 min (70 C for thermophilic DNA polymerases), and the reactions are initiated by the addition of 5 1 aliquots of dilutions (0.01 to 1 unit) of the DNA polymerase being analyzed, diluted in 20 mM
Tris=HC1, pH 7.5, 10 mM 2-mercaptoethanol and 0.05%
bovine serum albumin. The reaction mixtures are further incubated at 37 C for 10 min (70 C for thermophilic DNA
polymerases). The reactions are terminated by the addition of 5 1 of 100 mM EDTA, and 45 1 is spotted onto Whatman DE81 filter discs. The discs are washed in 4 changes of 150 ml of 0.3 M ammonium formate, p H 8.0, followed by 2 changes of 150 ml of 90% ethanol, each of 5-10 min duration. The disks are then dried under a heat lamp and counted in a scintillation counter in the presence of 5 ml of fluor (Opti-Fluor 0, Packard). From the amount of radioactivity on each disk, the amount of total DNA synthesis is calculated.
Specific DNA polymerases being tested may have optimum buffer, pH, salt, or temperature conditions that differ from those suggested above. Each DNA polymerase should be tested under the conditions that give optimum specific polymerase activity for that enzyme.
To determine whether a modification of a given DNA
polymerase results in a decrease in its ability to discriminate against dideoxynucleotides, first a series of reactions are carried out in the absence of ddNTPs, varying the DNA polymerase concentration to determine the range where the activity varies approximately linearly with enzyme concentration for both the modified and unmodified forms of the enzyme. An enzyme concentration SUBSTIME %IEET f RUl.E 261 WO 96/12042 2201885 PCT/US95/12*
is chosen that is in this linear range for both forms of the enzyme; e.g., an enzyme concentration whereby about 30!k of the template is replicated in the 10 min reaction is likely to be in such a linear range.
Once a proper enzyme concentration is chosen, a series of reactions are carried out varying the amount of either one ddNTP or preferably all four ddNTPs in the mixture, in order to determine the concentration required to inhibit 50; of the DNA synthesis. For example, under the conditions stated above (300 M 4dNTPs), the following concentrations of a mixture of 4 ddNTPs are required to inhibit 50%- of the DNA synthesis for the following six DNA polymerases:
Polymerase [4ddNTP] for 50o inhibition T7 DNA polymerase 0.1 M
T7 DNA polymerase Y526F 300 AM
E. coli DNA polymerase I 20 M
E. coli DNA polymerase I F762Y 0.04 AM
Taq DNA polymerase 150 M
Taq DNA polymerase F667Y 0.4 M
This test can be used to determine if a modification in a new DNA polymerase results in a decrease in its 5 ability to discriminate against ddNTPs; if the mutation does have this effect, then a higher concentration of 4ddNTPs will be required to inhibit 50% of the DNA
synthesis in the assay described above.
10 Example 9. Determination of the efficiency of incor-poration of dideoxynucleotides by measurina the incorporation of [a-32 P1 dAMP into synthetic iprimer-template complexes S1IBSTITUTE SHEET (RULE 26) :~ 201885 In this example the competition between a dNTP and a ddNTP is assayed for use at a single site in a synthetic primer-template. This assay differs from the others in that it limits the comparison of the use of the two substrates to a single site, avoiding the complication of sequence-specific variability in discrimination. While this relatively simple assay is suitable for a preliminary screen of DNA polymerases for their ability to discriminate against ddNTPs, it should not be used to the exclusion of the assays presented in Examples 6-8 since often the discrimination against ddNTPs is strongly influenced by the neighboring sequences, an important problem for DNA sequence analysis (see for example Tabor and Richardson 265 J. Biol. Chem. 8322, 1990).
The two primer-templates shown below are used in this example. The first is used to determine the discrimination between dATP versus ddATP, while the second is used to determine the discrimination between dCTP versus ddCTP, dTTP versus ddTTP, and dGTP versus ddGTP.
Primer-Template A:
5' GGCGACGTTGTAAAACGACGGCCAGTGCCA 3' 3' GCTGCAACATTTTGCTGCCGGTCACGGTTCCCC 5' Primer-Template B:
5' GGCGACGTTGTAAAACGACGGCCAGTGCCA 3' 3' GCTGCAACATTTTGCTGCCGGTCACGGTCAGTTTT 5' Each reaction mixture contains 25 pmoles each of primer and template. The primer and template are mixed together and annealed in a reaction mixture (1X = 10 1) containing 40 mM Tris=HCl, pH 8.0, 10 mM MgC12 and 100 mM
NaCl. The mixture is incubated 65 C for 2 min and then SUSSTIME SHEET (RULE 26) WO 96112042 2201885 PCT/US95/12*
cooled to room temperature over 30 min. The standard reaction mixture (45 1) for reactions carried out with Primer-Template A contains 22 mM Tris=HC1, pH 8.0, 5.5 mM
MgC12, 55 mM NaCl, 25 pmoles of the Primer-Template A
complex, 5 M[a-32P] dGTP (4,000 cpm/pmole) and varying concentrations of dATP and ddATP. The reaction mixtures also contained 10 ng of yeast inorganic pyrophosphatase to inhibit pyrophosphorolysis that could otherwise increase the apparent discrimination by the DNA
polymerase (Tabor and Richardson 265 J. Biol. Chern. 8322, 1990). The mixtures are incubated at 37 C for 1 min (70 C for thermophilic DNA polymerases), and the reactions are initiated by the addition of 5 1 aliquots (0.01 to 1 unit) of the DNA polymerase being analyzed diluted in 20 mM Tris-HCl, pH 7.5, 10 mM 2-mercaptoethanol and 0.05% bovine serum albumin. The reaction mixtures are further incubated at 37 C for 10 min (70 C for thermophilic DNA polymerases). The reactions are terminated by the addition of 5 1 of 100 mM EDTA, and 45 1 is spotted onto Whatman DE81 filter discs. The discs are washed in 4 changes of 150 ml of 0.3 M ammonium formate, p H 8.0, followed by 2 changes of 150 ml of 90% ethanol, each of 5-10 min duration. The disks are then dried under a heat lamp and counted in a scintillation counter in 5 ml of fluor (Opti-Fluor 0, Packard). From the amount of radioactivity on each disk, the amount [31P]dGMP incorporated was determined. The assumption is made once a single dAMP residue has been incorporated to remove the block for the incorporation of dGMP residues, four [32PJdGMPs will be incorporated into each primer, and thus the number of dAMPs incorporated are one fourth the number of dGMPs incorporated.
All reactions are carried out with a constant amount of the DNA polymerase being analyzed; the amount of DNA
polymerase should be sufficient to replicate 50% of the total dCMP residues in the single-stranded region of the template in the 10 min incubation in the presence of 10 M dATP and the absence of ddATP. Specific DNA
polymerases being tested may have optimum buffer, pH, SUSSf INTE $HEET (RULE 2%
salt, or temperature conditions that differ from those suggested above. Each DNA polymerase should be tested under the conditions that give optimum specific polymerase activity for that enzyme. Control reactions 5 should also be carried out in which neither dATP nor ddATP are present; this defines the background DNA
synthesis that should be subtracted from each sample. In general this is less than 10% of the DNA synthesis obtained when dATP is present.
10 Reactions are then carried out with 10 M dATP and varying concentrations of ddATP, to determine the amount of ddATP required to inhibit DNA synthesis by 50%.
Examples are shown in the Table below for the concentration of ddATP required to inhibit 50% of the DNA
15 synthesis in the presence of 10 M dATP. The polymerases are defined as in Example 6.
Polymerase [ddATP] for 50o inhibition T7 DNA polymerase 30 M
T7 DNA polymerase Y526F >500 M
E. coli DNA polymerase I >500 M
E. coli DNA polymerase I F762Y 6 M
Taq DNA polymerase >500 M
Taq DNA polymerase F667Y 5 M
In order to carry out an analogous test measuring the discrimination against ddGTP, ddTTP or ddCTP, reactions are carried out identical to that described above except that Primer-Template B is used instead of Primer-Template A, and the reactions containing 10 M dGTP, dTTP and dCTP
and 5 M [a-32P] dATP (4,000 cpm/pmole) and varying concentrations of either ddGTP, ddTTP or ddCTP.
As with the other examples, DNA polymerases with a 3' to 5' exonuclease activity can interfere with this assay, MOTM SHEET (RU1.E 26) .
WO 96/12042 22" 16r p ~ J CT/iJS95/1290 making an enzyme more discriminatory against ddNTPs than that due to discrimination at the level of incorporation of the analog. In addition, enzymes with high levels of exonuclease activity can use up all the dNTPs in the reaction mixture (especially with the relatively low level of dNTPs present in these reactions), resulting in no net DNA synthesis (e.g. the native T7 DNA polymerase, see Tabor and Richardson 264 J. Biol. Chem. 6447, 1989).
In these cases the concentration of DNA polymerase and the incubation time of the reaction should be adjusted to obtain the maximum level of DNA synthesis in the absence of ddNTPs.
Example 10, De rmi nat-i nn of the effiCi E'nr'v of incorporation [a-32 Pl ddNMps into a ynthPt i c- primer-template complex In this example the competition between a dNMP and a ddNMP is assayed for incorporation at a single site in a synthetic primer-template. This assay differs from that in Example 9 in that the label is in [a-32P] ddATP, and thus the incorporation of ddAMP is being measured. This assay can be used to test whether a ddNTP is inhibiting a DNA polymerase by acting as a chain terminator, being incorporated into the 3'.end of the primer, or simply by binding the DNA polymerase and preventing further DNA
synthesis without actually being incorporated into the primer.
In the example below the incorporation of ddAMP is measured using [a-32P] ddATP and Primer-Template A (Example 9) :
Primer-Template A:
5' GGCGACGTTGTAAAACGACGGCCAGTGCCA 3' 3' GCTGCAACATTTTGCTGCCGGTCACGGTTCCCC 5' Incorporation of [a-32P] ddGMP, [a- 3:p ] ddCMP and .[a-32P]ddTMP could be similarly tested on the appropriate template (for example, Primer-Template B in Example 9).
StJMIME SHEET (RUIf 26) Each reaction mixture contains 25 pmoles each of primer and template (Primer-Template A, see above). The primer and template are mixed together and annealed in a reaction mixture (1X = 10 1) containing 40 mM Tris=HCl, pH 8.0, 10 mM MgClz and 100 mM NaCl. The mixture is incubated 65 C for 2 min and then cooled to room temperature over 30 min. The standard reaction mixture (45 1) contains 22 mM Tris=HCl, pH 8.0, 5.5 mM MgC12, 55 mM NaCl, 25 pmoles of the Primer-Template A complex, 2.5 M [a-32P]ddATP (Amersham PB10235, >5000 Ci/mmol diluted with cold ddATP to a specific activity of 4,000 cpm/pmole) and varying concentrations of dATP. The reaction mixtures also contained 10 ng of yeast inorganic pyrophosphatase to inhibit pyrophosphorolysis that could otherwise increase the apparent discrimination by the DNA
polymerase (Tabor and Richardson 265 J. Biol. Chem. 8322, 1990). The mixtures are incubated at 37 C for 1 min (70 C for thermophilic DNA polymerases), and the reactions are initiated by the addition of 5 1 aliquots (0.01 to 1 unit) of the DNA polymerase being analyzed diluted in 20 mM Tris-HC1, pH 7.5, 10 mM 2-mercaptoethanol and 0.05%- bovine serum albumin. The reaction mixtures are further incubated at 37 C for 10 min (70 C for thermophilic DNA polymerases). The reactions are terminated by the addition of 5 1 of 100 mM EDTA, and 45 1 is spotted onto Whatman DE81 filter discs. The discs are washed in 4 changes of 150 ml of 0.3 M ammonium formate, p H 8.0, followed by 2 changes of 150 ml of 90 s ethanol, each of 5-10 min duration. The disks are then dried under a heat lamp and counted in a scintillation counter 5 ml of fluor (Opti-Fluor 0, Packard). From the amount of radioactivity on each disk, the amount [32P]ddAMP incorporated is determined.
All reactions are carried out with a constant amount of the DNA polymerase being tested; the amount of DNA
polymerase should be that concentration which gives the highest level of incorporation of [32P] ddAMP into Primer-Template A in the 10 min incubation in the absence of dATP. Specific DNA polymerases being tested may have SI~SfITIJjE SHEET (RUL.E 2%
WO 96/12042 PCT/US95/1291&
optimum buffer, pH, salt, or temperature conditions that differ from those suggested above. Each DNA polymerase should be tested under the conditions that give optimum specific polymerase activity for that enzyme.
In order to use this assay to determine the level of discrimination against a ddNTP, reactions are carried out with a constant amount of the DNA polymerase and [32P]ddATP, in the presence or absence of 2.5 M dATP (an equimolar concentration to [32P]ddATP) and the effect the presence of dATP has on the incorporation of [32P]ddAMP
is determined. If a DNA polymerase does not discriminate between the incorporation of ddAMP and dAMP, and it has no 3' to 5' exonuclease activity, then the addition of dATP will inhibit incorporation of [32P]ddAMP by 501.1r.
This test is best for a DNA polymerase that efficiently incorporates ddNMPs, such as Taq DNA
polymerase F667Y. For DNA polymerases that discriminate strongly against ddNMPs, the previous assay is preferred, in which the label is in a dNTP other than the one being used in competition with the ddNTP, since in that case much higher concentrations of the ddNTP can be used.
However, with DNA polymerases that discriminate strongly against ddNMPs, if one is interested in testing whether a given mutation is reducing the level of discrimination against ddNMPs, this assay could be used by assaying the unmodified DNA polymerase on this substrate in the absence of dATP (measuring the incorporation of [32P]ddAMP
as a function of DNA polymerase concentration), and comparing the rate of incorporation to that of the mutant enzyme. If the mutation is reducing the discrimination against ddATP, then the mutant enzyme should have a higher specific activity for incorporation of [32P] ddAMP .
As with the other examples, DNA polymerases with a 3' to 5' exonuclease activity can interfere with this assay, making an enzyme more discriminatory against ddNTPs than that due to discrimination at the level of incorporation of the analog. And as in Example 9, enzymes with high levels of exonuclease activity can deplete all the dNTPs, resulting in no net incorporation of [32P]ddAMP. In SUBSTIME SHEET (RUi.E 26) these cases the concentration of DNA polymerase and the incubation time of the reaction should beadjusted to obtain the maximum level of incorporation of [32P] ddAMP
by the DNA polymerase being tested.
All of the above methods are based on radioactivity to detect either the length of the extended primer or the amount of DNA synthesis on the primer. The efficiency of incorporation of dideoxynucleotides by a DNA polymerase can also be measured nonradioactively. Two examples are presented below that make use of either fluorescent primers or fluorescent dye-dideoxy terminators that are detected on an Applied Biosystems Model 373A DNA
Sequencing System.
Example 11. Determination of the efficiency of incorporation of dideoxynucleotides using a fluorescent primer annealed to single-stranded DNA and gel electrophoresis In this example a fluorescently labeled primer is annealed to single-stranded DNA and DNA synthesis reactions are carried out using varying ratios of dNTPs to ddNTPs. The samples are then loaded onto an Applied Biosystems Model 373A DNA Sequencing System, and the length of each fluorescent fragment is determined by direct fluorescent detection during gel electrophoresis.
Reactions are carried out as described in Tabor and Richardson 265 J. Biol. Chem. 8322, 1990. The fluorescent primer used is "Fam" primer (Applied Biosystems). The DNA used is single-stranded M13 mGPl-2 DNA as described in Example 3. The primer is annealed to the M13 mGPl-2 single-stranded DNA template in a reaction mixture (1X = 10 1) containing 2 g of M13 mGPl-2 DNA, 5 ng of primer, 40 mM Tris-HCl, pH 8.0, 10 mM
MgC12 and 100 mM NaCl. The mixture is incubated 65 C
for 2 min and then cooled to room temperature over 30 min. The standard reaction mixture (18 1) contains 22 mM Tris=HC1, pH 8.0, 5.5 mM MgC12, 55 mM NaCl, and varying concentrations of the 4dNTPs and one of the four ddNTPs.
The reaction mixtures also contained 10 ng of yeast SIBSTINfE SHEET (RULE 26) 9(l inorganic pyrophosphatase to inhibit pyrophosphorolysis that could otherwise increase the apparent discrimination by the DNA polymerase (Tabor and Richardson 265 J. Biol.
Chem. 8322, 1990). The mixtures are incubated at 37 C
5 for 1 min (70 C for thermophilic DNA polymerases), and the reactions are initiated by the addition of 2 1 aliquots (0.01 to 1 unit) of the DNA polymerase being analyzed diluted in 20 mM Tris-HC1, pH 7.5, 10 mM 2-mercaptoethanol and 0.05% bovine serum albumin. The 10 reaction mixtures are further incubated at 37 C for 10 min (70 C for thermophilic DNA polymerases). The reactions are by the addition of 8 1 of 20 mM DTPA, 1 M potassium acetate, pH 5.0, and 60 1 of ethanol. After centrifugation, the DNA is resuspended in 6 1 of 80~;
15 formamide, 10 mM Tris=HC1, pH 8.0, and 1 mM EDTA, and heated at 80 C for 2 min immediately prior to loading on the polyacrylamide gel on the Applied Biosystems 373A DNA
Sequencing System following the manufacturer's procedures.
20 Specific DNA polymerases being tested may have optimum buffer, pH, salt, or temperature conditions that differ from those suggested above. Each DNA polymerase should be tested under the conditions that give optimum specific polymerase activity for that enzyme. The 25 concentration of DNA polymerase should be sufficient to extend most of the primers at least several hundred nucleotides or until a dideoxynucleotide has been incorporated in the 10 min reaction.
The ratio of dNTPs to ddNTPs is adjusted to obtain 30 optimum peak intensities for approximately three hundred bases. For example, for Taq DNA polymerase approximately 10 M 4dNTPs and 200-600 M ddNTPs is optimal, while for Taq DNA polymerase F667Y approximately 300 M 4dNTPs and 0.5-5 M ddNTPs is optimal.
35 To determine whether a modification of a given DNA
polymerase results in a decrease in its ability to discriminate against dideoxynucleotides, reactions should be carried out at varying ratios of dNTPs to ddNTPs for both the unmodified and modified DNA polymerases, and the MSTITt1TE SHEET (RULE 26) intensities of dideoxy-terminated fragments of different lengths are compared to determine whether the modified DNA polymerase is using ddNTPs more efficiently than the unmodified enzyme.
.xa le 12. Determination of the efficiency of incorporation of fluorescent dideoxynucleotides by gel electrophoresis In this example a nonfluorescent primer is annealed to single-stranded DNA and DNA synthesis reactions are carried out using varying ratios of dNTPs to a single fluorescently labeled-ddNTP. The samples are then loaded onto an Applied Biosystems Model 373A DNA Sequencing System, and the length of each fluorescent fragment is determined by direct fluorescent detection during gel electrophoresis. The primer used in this example is the 40-mer described in Example 3, and the template is the single-stranded M13 mGPl-2 described in Example 3. The primer is annealed to the M13 mGPl-2 single-stranded DNA
template in a reaction mixture (1X = 10 1) containing 2 g of M13 mGPl-2 DNA, 6 pmoles of primer (a 10-fold molar excess to template), 40 mM Tris=HC1, pH 8.0, 10 mM MgC12 and 100 mM NaCl. The mixture is incubated 65 C for 2 min and then cooled to room temperature over 30 min. The standard reaction mixture (18 1) contains 22 mM
Tris-HCl, pH 8.0, 5.5 mM MgC12, 55 mM NaCl, and varying concentrations of the 4dNTPs plus one of the four fluorescently-labeled ddNTPs. The four fluorescently labeled ddNTPs are from Applied Biosystems (Taq DyeDeoxy Terminator Cycle Qequencing Kit, part number 401150), and are referred to as G, A, T or C "DyeDeoxy Terminators"
(manual for Taq DyeDeoxy Terminator Cycle Sequencing Kit, part number 901497, Rev. E). The reaction mixtures also contained 10 ng of yeast inorganic pyrophosphatase to inhibit pyrophosphorolysis that could otherwise increase the apparent discrimination by the DNA polymerase (Tabor and Richardson 265 J. Bio1. Chem. 8322, 1990). The mixtures are incubated at 37 C for 1 min (70 C for thermophilic DNA polymerases), and the reactions are gMSTiNfE SHEET (RULE 26) ~~;~
WO 96/12042 2201PCT/US95/1291&
initiated by the addition of 2 1 aliquots (0. 01 to 1 unit) of the DNA polymerase being analyzed diluted in 20 mM Tris-HC1, pH 7.5, 10 mM 2-mercaptoethanol and 0.05%
bovine serum albumin. The reaction mixtures are further incubated at 37 C for 10 min (70 C for thermophilic DNA
polymerases). The reactions are by the addition of 8 1 of 20 mM EDTA, 1 M potassium acetate, pH 5.0, and 60 1 of ethanol. After centrifugation, the DNA is resuspended in 6 1 of 80o formamide, 10 mM Tris-HCl, pH
8.0, and 1 mM DTPA, and heated at 80 C for 2 min immediately prior to loading on the polyacrylamide gel on the Applied Biosystems 373A DNA Sequencing System following the manufacturer's procedures .
Specific DNA polymerases being tested may have optimum buffer, pH, salt, or temperature conditions that differ from those suggested above. Each DNA polymerase should be tested under the conditions that give optimum specific polymerase activity for that enzyme. The concentration of DNA polymerase used in these reactions should be that concentration that is sufficient to extend most of the primers at least several hundred nucleotides or until a dideoxynucleotide has been incorporated in the 10 min reaction. For DNA polymerases that have a 5' to 3' exonuclease activity, such as Taq DNA polymerase, the DNA polymerase concentration must be kept low enough to avoid this activity degrading a significant percentage of the 5' ends of the fragments.
To determine whether a DNA polymerase discriminates strongly or weakly against a fluorescent ddNTP, reactions are carried out using 20 M 4dNTPs and 0.01 1 of each DyeDeoxy Terminator provided by Applied Biosystems (part number 401150). When Taq DNA polymerase is used under these conditions, most of the fluorescence is either in unincorporated dye-ddNTPs at the leading front of the gel, or in fragments greater than several hundred bases in length. In contrast, with the Taq DNA polymerase mutant F667Y, under these conditions most of the fluorescence is in fragments that are less than several hundred bases in length, and a significantly lower SIBSTITUTE SHEET (RULE 26) = WO 96/12042 PCTIUS95/12928 percentage of the total fluorescence is in unincorporated dye-ddNTPs at the leading front of the gel.
To determine whether a modification of a given DNA
polymerase results in a decrease in its ability to discriminate against dideoxynucleotides, reactions are carried out at varying ratios of dNTPs to DyeDeoxy Terminators for both the unmodified and modified DNA
polymerases, and the average length of the resulting fluorescent fragments are compared to determine whether the modified DNA polymerase is using the DyeDeoxy Terminators more efficiently than the unmodified enzyme.
The following examples provide tests for determining the uniformity of band intensities produced from dideoxy-terminated fragments synthesized by different DNA
polymerases.
Exam,nl e 13. DetPr,T,; nat; nn of uniformity of incorporat inn of dideoxynuclPnridPs using a single-stranded M13 DNA -5' 32P-labeled 40-mer primer comipleX and gel electrophoresis In this example the uniformity of dideoxynucleotide incorporation is measured on a 5' 32P-end labeled primer extended on a single-stranded M13 DNA template. Three activities can cause variability in band intensity of dideoxy-terminated fragments. One is exonuclease activity that can preferentially at some sequences; this is avoided by removal of the activity selectively either by chemical or genetic means (see for example Tabor and Richardson 264, J. Biol. Chem. 6447, 1989). The second is pyrophosphorolysis; this is readily avoided by including pyrophosphatase in the reaction mixture, which degrades the pyrophosphate that accumulates during DNA
synthesis and is a necessary substrate for pyrophosphorolysis. The third is sequence-specific variability in the incorporation of dideoxynucleotides.
Variability in band intensity is detrimental to DNA
sequence analysis, decreasing the accuracy of the DNA
sequence determined. This test is designed to compare the degree of variability in band intensities in SUBSTiTUTE SHEET (RULE 26) -20,1885 fragments synthesized by different DNA polymerases, including mutant DNA polymerases that may incorporate dideoxynucleotides more efficiently.
The primer, template and reaction conditions in this Example are identical to that described in Examples 6 and 7. The template is M13 mGPl-2 single-stranded DNA
described in Example 3, and the primer is the 40 mer also described in Example 3. Reaction conditions used are those that are optimum for the DNA polymerase being tested with regard to buffer, pH, salts, and temperature of the reaction. It is preferred that magnesium is the only metal ion present in the reaction mixture (i.e. the reactions are carried out in the absence of added manganese ions). A concentration of DNA polymerase is chosen whereby most of the primers are extended in a 10 min reaction and are terminated by the incorporation of a dideoxynucleotide. The ratios of dNTPs to ddNTPs are adjusted for the specific DNA polymerase being tested so that the average fragment size is approximately 100-300 nucleotides. ddCTP is the preferred ddNTP to use for the test of uniformity since fragments terminated with this dideoxynucleotide tend to have the most variability in intensities (see for example Tabor and Richardson 86 Proc. Natl. Acad. Sci. 4076, 1989). Gel electrophoresis, autoradiography, and analysis of the band intensities by either scanning of the gel or phosphoimager analysis are as described in Example 6. Electrophoresis is carried out until fragments of approximately 55 nucleotides in length are at the bottom of the gel (the dye bromphenol blue has run off the bottom of the gel and the dye xylene cyanol is approximately 8 cm from the bottom of the gel).
For a given series of ddNMP-terminated fragments, for example a series of ddCMP-terminated fragments, the intensities of the first 20 fragments from the bottom of the gel are determined, preferably by Phosphoimager analysis. Alternatively, the autoradiograph can be scanned by an imaging densitometer to determine the relative intensities of the first 20 fragments. These intensities are then analyzed statistically as described SUBSTITUTE SHEET (RULE 26) = g WO 96/12042 2201885 pCT/US95/12928 in Example 6 in order to determine their variability.
For example, the values can be plotted using the Macintosh program Kaleidograph Version 3.0 (Synergy Software). The resulting plots are fit to an exponential 5 decay curve using the Kaleidograph library routine for this function. R2, the correlation index for the data, is calculated by the Kaleidograph library routine. This is a measure of the variability in band intensities. The values obtained for R2 using a new DNA polymerase are 10 compared to those obtained using known DNA polymerases, for example _28 T7 DNA polymerase (Sequenase Version 2.0, United States Biochemical Corporation) in the presence of magnesium or manganese (see Tabor and Richardson 265 J.
Biol. Chem. 8322, 1990), E. coli DNA polymerase I
15 (either Klenow fragment or Klenow fragment with the mutation F762Y) or Taq DNA polymerase (either wild-type or the mutant F667Y). The R 2 values obtained with these known DNA polymerases are used as standards by which to compare a new DNA polymerase for its uniformity.
Example 14 Determination of uniformity of incorporation of fa-32P1 ddNMPs using a sincrle-stranded M13 DNA -unlabeled arimer complex and gel electrophoresis In this example the uniformity of dideoxynucleotide incorporation is measured using an unlabeled primer annealed to a single-stranded M13 DNA template and carrying out DNA synthesis in the presence of [a-32P]ddATP. The test described in Example 13 is preferred over this one for measuring uniformity of dideoxynucleotide-terminated fragments, since it is more amenable to use of high concentrations of ddNTPs, which are required for use in enzymes that discriminate strongly against ddNTPs, such as E. coli DNA polymerase I, Taq DNA polymerase and T7 DNA polymerase Y526F. The test in this example is most suited for use with enzymes that incorporate dideoxynucleotides efficiently, such as T7 DNA polymerase, E. coli DNA polymerase I F762Y and Taq DNA polymerase F667Y.
The primer, template and general reaction conditions SUBSTITUTE SHEET (RULE 26) : J[~O~
WO 96/12042 :. PCT/US95/1290 in this example are similar to those described in Example 8, with the following exceptions. The template is M13 mGPl-2 single-stranded DNA described in Example 3, and the primer is the 40 mer also described in Example 3.
Reaction conditions used are those that are optimum for the DNA polymerase being tested with regard to buffer, pH, salts, and temperature of the reaction. It is preferred that magnesium is the only metal ion present in the reaction mixture (i.e. the reactions are carried out in the absence of added manganese ions). The reactions are carried out with 50 M dGTP, dCTP and dTTP, and varying concentrations of dATP and [a-32P)ddATP. The concentrations of dATP and [a-32P]ddATP are chosen to maximize the amount of radioactivity in fragments approximately 100 nucleotides in length. All other aspects with respect to gel electrophoresis and analysis of the radioactive fragments are as described in Example 13.
F:xampl e 15. D rmi nat-i nn of uniformity of incorporati nn of dideoxvnucleotidPs using a singlP-stranded M13 DNA -fluorescently-labeled primer comolex and gel electrophoresis In this example, reactions are carried out as described in Example 11. The template is M13 mGPl-2 single-stranded DNA described in Example 3, and the primer is the 40 mer also described in Example 3.
Reaction conditions used are those that are optimum for the DNA polymerase being tested with regard to buffer, pH, salts, and temperature of the reaction. It is preferred that magnesium is the only metal ion present in the reaction mixture (i.e. the reactions are carried out in the absence of added manganese ions) . A concentration of DNA polymerase is chosen whereby most of the primers are extended in a 10 min reaction and are terminated by the incorporation of a dideoxynucleotide. The ratios of dNTPs to ddNTPs are adjusted for the specific DNA
polymerase being tested so that the average fragment size is approximately 100-200 nucleotides. ddCTP is the SUBSTITUTE SHEET (RULE 26) preferred ddNTP to use since fragments terminated with this dideoxynucleotide tend to have the most variability in intensities (see for example Tabor and Richardson 86 Proc. Natl. Acad. Sci. 4076, 1989). The intensities of up to the first 50 dideoxy-terminated fragments from the primer (approximately 200 nucleotides) are determined, and they are analyzed statistically as described in Example 13. The correlation index R2 is determined for the DNA polymerase being tested, and compared with that obtained with known DNA polymerases such as those described in Example 13. Alternatively, the heights of the first 50 bands are determined and =the ratio of heights of adjacent bands are calculated and used as a measure of variability; the maximum and mean of these ratios obtained from reactions carried out with the DNA
polymerase being tested are compared with the values obtained for reactions carried out using known DNA
polymerases such as those described in Example 13.
Example 16. Determination of uniformity of incorooration of fluorescent dideoxvnucleotides by gel electrophoresis In this example, reactions are carried out as described in Example 12. In order to determine the uniformity of incorporation of the DyeDeoxy Terminators for a specific DNA polymerase, the concentration of dNTPs and the specific DyeDeoxy Terminator are chosen to obtain fluorescently labeled fragments that average 100-200 nucleotides in length. The intensity of fluorescence is determined for fragments 10 to 40 from the primer (the first 10 fragments near the fluorescently labeled primer are ignored). The fragments are analyzed statistically as described in Example 13, and the average variability is defined by R2, the correlation index for the data fitted to an exponential decay curve. The values obtained for R2 are compared with those obtained using known DNA polymerases as described in Example 13. To determine if a specific mutation in a DNA polymerase is resulting in that DNA polymerase producing bands that have less variability, the R2 value obtained for the SUBSTITUTE SHEET (RULE 26) WO 96/12042 PCT/US95/129.0 mutant DNA polymerase is compared to that obtained for the unmodified DNA polymerase.
Example 17: DNA secTuence analysis using a DNA polymerase that efficiently incorporates dideoxynuclPor;dPG
DNA sequence analysis with a DNA polymerase of this invention is carried out using standard procedures, with the ratio of dNTPs to ddNTPs adjusted to obtain dideoxy-terminated fragments of an average length appropriate for separation by electrophoresis. For the mutant in the large fragment of E. coli DNA polymerase I, "Klenow .fragment F762Y", reactions are carried out essentially as with modified T7 DNA polymerase and described in Tabor and Richardson U. S. patent number 4,795,699, Tabor and Richardson 84, Proc. Natl. Acad. Sci. USA 4767, 1987, and SEQUENASE manual, "Step-By-Step Protocols For DNA
Sequencing With SEQUENASE" 3rd Edition, United States Biochemical Corporation. Since Klenow fragment F762Y
incorporates dideoxynucleotides approximately 5 times more efficiently than modified T7 DNA polymerase, the concentration of ddNTPs in the extension-termination mixtures should be reduced by a factor of five compared with the standard mixtures recommended for modified T7 DNA polymerase (Sequenase manual, supra).
DNA sequence analysis with a thermostable DNA
polymerase such as Taq DNA polymerase F667Y is as described by Innis et al. 85, Proc. Natl. Acad. Sci. USA
9436, 1988, with the following modification. Whereas Innis et al. recommend, dNTP/ddNTP ratios of 1:6 dGTP:ddGTP, 1:32 dATP:ddATP, 1:48 dTTP:ddTTP and 1:16 dCTP:ddCTP, these ratios must be adjusted to take into account the 3,000- 8,000 fold more efficient use of the 4 ddNTPs by the Taq DNA polymerase F667Y compared with the wild-type Taq DNA polymerase. Thus the extension-termination reactions with Taq DNA polymerase F667Y
should contain 100 M 4dNTPs and 0.1 - 5 M each of the four ddNTPs; the exact amount of each ddNTP should be adjusted based on the desired average fragment size for optimum determination of DNA sequence. All other aspects SUBSTIME SHEET (RULE 26) . ~
of the DNA sequencing reactions and denaturing gel electrophoresis are as described in Innis et al. (supra).
ExamialP 18: Cycle DNA seauence analysis usinq a +-Ho,-.,,os,-ahlP DNApolYmerase that efficiently incornorates di o ynucleoti des Cycle DNA sequencing with a thermostable DNA
polymerase such as Taq DNA polymerase F667Y is carried out as described in Carothers et al. 7 BioTechniques 494, 1989 except that: (1) The four deoxy/dideoxy NTP mixes contain 250 M of all four dNTPs and 0.1 - 10 M of either ddGTP, ddATP, ddTTP or ddCTP, the exact amount of each ddNTP adjusted empirically based on the desired average fragment size for optimum determination of DNA
sequence. (2) Taq DNA polymerase F667Y is used in place of Taq DNA polymerase, using the same number of units of DNA polymerase as recommended by Carothers et al.
(supra). (3) The reaction mixtures contain 10 ng of inorganic pyrophosphatase to inhibit pyrophosphorolysis that could otherwise increase the apparent discrimination by the DNA polymerase at specific sequences, reducing the uniformity of band intensities (Tabor and Richardson 265 j. Biol. Chem. 8322, 1990). Preferably this pyrophosphatase is purified from a thermophilic organism, for example from Thermus thermophilus .(Hohne et al. 47 Biomed. Biochim. Acta 941, 1988).
ExanwlP 19. = Automated cycle DNA seguencina usina modified Taa DNA polymerase and fluorescent primers Cycle DNA sequencing with a thermostable DNA
polymerase such as Taq DNA polymerase F667Y and the Applied Biosystems Dye Primers is a modification of the procedure described in the Applied Biosystems manual (Part Number 901482, Rev. B). The procedure is identical to that described in the manual with the following modifications: (1) The dNTP/ddNTP mixes must be modified to take into account the more efficient use of ddNTPs by Taq DNA polymerase F667Y compared to Taq DNA
polymerase. The new mixes that should be used in place SIIBSTITUTE SHEET (RULE 26) WO 96/12042 ~2U 1885 PCT/US95/120 of the ones listed in the Applied Biosystems manual are as follows:
dG/ddG mix = 100 M c'dGTP, dATP, dTTP and dCTP, and 5 0.05 M ddGTP
dA/ddT mix = 100 M c'dGTP, dATP, dTTP and dCTP, and 0.3 M ddATP
dT/ddT mix = 100 M c'dGTP, dATP, dTTP and dCTP, and 0.25 M ddTTP
10 dC/ddC mix = 100 M c'dGTP, dATP, dTTP and dCTP, and 0.15 M ddCTP
The concentrations of ddNTPs should be varied to optimize the intensity of fluorescence in fragments of 15 specific size ranges, depending on the application. For example, increasing the concentration of ddNTPs will increase the fluorescence of fragments of shorter length.
(2) Taq DNA polymerase F667Y is used in place of Taq DNA
polymerase. The same number of units of enzyme are used 20 in both cases, when assayed under standard DNA polymerase assay conditions. Alternatively, the same number of DNA
polymerase molecules can be used. (3) The reaction mixtures contain 10 ng of inorganic pyrophosphatase to inhibit pyrophosphorolysis that could otherwise increase 25 the apparent discrimination by the DNA polymerase at specific sequences, reducing the uniformity of band intensities (Tabor and Richardson 265 J. Biol. Chem.
8322, 1990). Preferably this pyrophosphatase is purified from a thermophilic organism, for example from Thermus 30 thermophilus (Hohne et al. 47 Biomed. Biochim. Acta 941, 1988). All other aspects of the procedures are identical to that described in the Applied Biosystems manual (supra).
35 Example 20. Automated cycle DNA seguencina usina mod; f; Prd Taa DNA y?olymerase and fluorescent dye terminators Cycle DNA sequencing with a thermostable DNA
polymerase such as Taq DNA polymerase F667Y and the SUBSTITUTE SMEET (RULE 26) Applied Biosystems DyeDeoxy Terminators is a modification of the procedure in the Applied Biosystems manual (Part Number 901497, Rev. E). The procedure is identical to that described in the manual with the following modifications: (1) The manual calls for the use of 1 1 of each of the 4 DyeDeoxy Terminators undiluted in each sequencing reaction (20 1 reaction). In this Example the Terminators should be diluted prior to addition to the sequencing reaction mixture, since they are incorporated more than several hundred fold more efficiently with Taq DNA polymerase F667Y than with Taq DNA polymerase. 1 1 of each of the following dilutions are added to each sequencing reaction in place of 1 1 of the undiluted Terminator solutions:
DyeDeoxy G Terminator, 1 to 500 in H20 DyeDeoxy A Terminator, 1 to 1,500 in HZO
DyeDeoxy T Terminator, 1 to 1,500 in H20 DyeDeoxy C Terminator, 1 to 1,000 in H20 These dilutions are approximations only; the precise dilution of each DyeDeoxy Terminator should be determined empirically depending on the number of bases of DNA
sequence to be determined from the primer. (2) Taq DNA
polymerase F667Y is used in place of Taq DNA polymerase.
The same number of units of enzyme are used in both cases, when assayed under standard DNA polymerase assay conditions. Alternatively, the same number of DNA
polymerase molecules can be used. (3) The reaction mixtures contain 10 ng of inorganic pyrophosphatase to inhibit pyrophosphorolysis that could otherwise increase the apparent discrimination by the DNA polymerase at specific sequences, reducing the uniformity of band intensities (Tabor and Richardson 265 J. Biol. Chem.
8322, 1990). Preferably this pyrophosphatase is purified from a thermophilic organism, for example from Thermus thermophilus (Hohne et al. 47 Biomed. Biochim. Acta 941, 1988).
SU85TiTi1TE SHEET (RULE 26) WO 96/12042 12018U J PCT/US95/129*
Since this procedure uses over 500 times less DyeDeoxy Terminators that previous procedures, there is much less of a problem with unincorporated DyeDeoxy Terminators after the reactions is complete. Thus it is not necessary to remove the unincorporated DyeDeoxy Terminators by.passing the sample over a spin column, as recommended in the Applied Biosystems manual (supra).
Rather, the sample can be precipitated with ethanol, taken up in 5 1 deionized formamide and 1 1 50 mM EDTA, pH 8.0, heated at 90 C for 2 min, and loaded onto the Applied Biosystems 373A DNA Sequencing System according to the instructions in the 373A User's Manual. It is possible that with a DNA polymerase that incorporates the DyeDeoxy Terminators efficiently (such as Taq DNA
polymerase F667Y) that concentration of the DNA
sequencing reaction by precipitation with ethanol is not necessary; the DNA cycle sequencing reactions, carried out using preferably a high concentration of primer and dNTPs (see below), can be terminated by the addition of an equal volume of deionized formamide, heated at 90 C
for 2 min, and loaded immediately onto the Applied Biosystems 373A DNA Sequencing System. This represents a.major savings in time to the researcher preparing the samples for DNA sequence determination.
The procedure described above uses a relatively low concentration of dNTPs at the outset (7.5 M dATP, dTTP
and dCTP and 37.5 M dITP). The concentration of dNTPs decrease during the cycle DNA sequencing reaction as the dNTPs are used. This concentration of dNTPs (less than 7.5 M) is less than optimal for maximum DNA polymerase activity for most DNA polymerases. This low concentration was necessary with previous protocols because the DNA polymerase that was used discriminated strongly against ddNTPs, requiring a high ratio of ddNTPs to dNTPs. The use of a DNA polymerase of this invention that discriminates much less against the DyeDeoxy Terminators now allows the use of higher concentrations of dNTPs. For example, in the protocol described above a 10-fold higher concentration of dNTPs and DyeDeoxy SU85TITUTE SHEET (RULE 26) Terminators could be used; i.e. 75 M dATP, dTTP and dCTP, 375 M dITP, and the following dilutions of each of the four DyeDeoxy Terminators: DyeDeoxy G Terminator, 1 to 50 in H20; DyeDeoxy A Terminator, 1 to 150 in 4 0;
DyeDeoxy T Terminator, 1 to 150 in H20; DyeDeoxy C
Terminator, 1 to 100 in H20. Thus in this example the DyeDeoxy Terminator concentrations are still at least lower than that in previous protocols by a factor of at least 50.
F_x_amnle 21. Exonuclease assay Lsing a f32P1 dd_AMP
terminated DNA substrate The 3'[32P]ddAMP-terminated DNA substrate is prepared by digesting native calf thymus DNA with HindiIl in a reaction mixture (50 1) that contains 10 g double-stranded calf thymus DNA, 40 imM Tris.HC1, pH 7.5, 10 mM
MgCl2, 5 mM dithiothreitol, 50 mM NaCl, 50 g/ml bovine serum albumin, and 10 units HindIII. After incubation at 37 C for 60 min, 5 1 of [a-32P]ddATP (Amersham PB10235, >5000 Ci/mmol) and 5 units Sec"uenase Version 2.0 (United States Biochemical Corporation, catalog.number 70775) are added, and the mixture is incubated at 20 C for 15 mir_.
The reaction mixture is extracted once with an equal volume phenol:chloroform:isoamyl alcohol (24:24:1), and fractionated on a 1 ml column of Sephadex'M G1100 (Pharmacia) in 20 mM Tris=HC1, pH 7.5, 2 mM EDTA, 100 mM
NaCl. The 3' 32P-labeled DNA, that elutes in the void volume, has a specific activity of approximately 500 cpm/ng of total DNA.
Reaction mixtures for exonuclease assays (90 1) contain 40 mM Tris-HCl, pH 7.5, 10 mM MgC1Z1 10 mM
dithiothreitol, 50 mM KC1 and 1 nmol 3' 32P-labeled DNA.
The reaction mixtures also contain 10 ng of yeast inorganic pyrophosphatase to remove trace amounts of pyrophosphate and thus prevent pyrophosphorolysis (Tabor and Richardson 265 j. Biol. Chem. 8322, 1990). This mixture is preincubated at 20 C for 1 min, then 10 1 of the appropriate enzyme dillstion is added. After incubation at 37 C for the indicated times, the WO 96/12042 22U1g8'J-~ PCT/US95/1290 reactions are stopped by the addition of 30 1 of bovine serum albumin (10 mg/ml) and 30 1 of trichloracetic acid (100% w/v). The precipitated DNA is incubated at 0 C
for 15 min, and then pelleted by centrifugation at 12,000 g for 30 min. The acid-soluble radioactivity is measured in 100 1 of the supernatant. One unit of 3'[32P] ddAMP-DNA exonuclease activity catalyzes the acid solubilization of one pmol of [32P] ddAMP in 15 min.
Other embodiments are within the following claims.
~$Z~~~ SKET 1~U 2M
(1) GENERAL INFORMATION:
(i) APPLICANT:
(A) NAME President & Fellows of Harvard College (B) STREET 17 Quincy Street (C) CITY Cambridge (D) STATE Massachusetts (E) COUNTRY United States of America (F) POSTAL CODE 02138 (ii) CORRESPONDENCE ADDRESS:
(A) ADDRESSEE Richard J. Warburg Lyon & Lyon (B) STREET 633 West Fifth Street Suite 4700 (C) CITY Los Angeles, CA 90071 (D) COUNTRY USA
(E) POSTAL CODE 90071 (A) TELEPHONE: (213) 955-0440 (B) TELEFAX: (213) 489-1600 (C) TELEX: 67-3510 (iii) NUMBER OF SEQUENCES: 24 (iv) TITLE OF INVENTION: DNA POLYMERASES HAVING
MODIFIED NUCLEOTIDE BINDING
SITE FOR DNA SEQUENCING
(v) COMPUTER READABLE FORM:
(A) MEDIUM TYPE: 3.5" Diskette, 1.44 Mb storage (B) COMPUTER: IBM PS/2 Model 50Z or 55SX
(C) OPERATING SYSTEM: IBM P.C. DOS (Version 3.30) (D) SOFTWARE: WordPerfect (Version 5.0) (vi) CURRENT APPLICATION DATA:
(A) APPLICATION NUMBER: NOT YET ASSIGNED
(B) FILING DATE:
SWSTITUU SHEU (RULE 2S) 220 188a (vii) PRIOR APPLICATION DATA:
Prior applications total, including application described below: TWO
(A) APPLICATION NUMBER: 08/324,437 (B) FILING DATE: 17-OCT-1994 (A) APPLICATION NUMBER: US NOT YET ASSIGNED
(B) FILING DATE: 10-NOV-1994 '(2) INFORMATION FOR SEQ ID NO: 1:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 1:
Arg Arg Ser Ala Lys Ala Ile Asn Phe Gly Leu Ile Tyr Gly (2) INFORMATION FOR SEQ ID NO: 2:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 2:
Arg Arg Ala Ala Lys Thr Ile Asn Phe Gly Val Leu Tyr Gly (2) INFORMATION FOR SEQ ID NO: 3:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 "SME~~~U 2M
= WO 96/12042 PCTIUS95/12928 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii),SEQUENCE DESCRIPTION : SEQ ID NO: 3:
Arg Asp Asn Ala Lys Thr Phe Ile Tyr Gly Phe Leu Tyr Gly SUBSTiTtM SHEET {RULE 26J
~~2~ 1 ~~~, (2) INFORMATION FOR SEQ ID NO: 4:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 4:
Arg Asp Asn Ala Lys Thr Phe Ile Tyr Gly Phe Leu Tyr Gly (2) INFORMATION FOR SEQ ID NO: 5:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 5:
Arg Arg Ser Ala Lys Ala Ile Asn Phe Gly Leu Ile Tyr Gly (2) INFORMATION FOR SEQ ID NO: 6:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 6:
Arg Arg Ser Ala Lys Thr Phe Ile Tyr Gly Phe Leu Tyr Gly SU8ST1TM SHM (RUtf 261 (2) INFORMATION FOR SEQ ID NO: 7:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 7:
Arg Asp Asn Ala Lys Ala Ile Asn Phe Gly Phe Leu Tyr Gly (2) INFORMATION FOR SEQ ID NO: 8:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 8:
Arg Asp Asn Ala Lys Ala Ile Ile Tyr Gly Phe Leu Tyr Gly (2) INFORMATION FOR SEQ ID NO: 9:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 9:
Arg Asp Asn Ala Lys Thr Phe Asn Phe Gly Phe Leu Tyr Gly (2) INFORMATION FOR SEQ ID NO: 10:
(i) SEQUENCE CHARACTERISTICS:
SUBSfITUTE SHW (MU 2M
WO 96/12042 Z20 188" PCT/US95/129 (A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 10:
Arg Asp Asn Ala Lys Thr Phe Asn Tyr Gly Phe Leu Tyr Gly (2) INFORMATION FOR SEQ ID NO: 11:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 11:
Arg Asp Asn Ala Lys Thr Phe Ile Phe Gly Phe Leu Tyr Gly (2) INFORMATION FOR SEQ ID NO: 12:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 12:
Arg Arg Ser Ala Lys Ala Ile Asn Phe Gly Leu Ile Tyr Gly (2) INFORMATION FOR SEQ ID NO: 13:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear SW1ME SMW. (AJ1.f 26) (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 13:
Arg Asp Asn Ala Lys Thr Phe Ile Tyr Gly Phe Leu Tyr Gly (2) INFORMATION FOR SEQ ID NO: 14:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 14:
Arg Arg Ser Ala Lys Thr Phe Ile Tyr Gly Leu Ile Tyr Gly (2) INFORMATION FOR SEQ ID NO: 15:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 15:
Arg Arg Ser Ala Lys Thr Phe Asn Phe Gly Leu Ile Tyr Gly (2) INFORMATION FOR SEQ ID NO: 16:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 16:
StJBSTITUTE SHEET (RULE 26) Arg Arg Ser Ala Lys Ala Ile Ile Tyr Gly Leu Ile Tyr Gly (2) INFORMATION FOR SEQ ID NO: 17:
U) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 17:
Arg Arg Ser Ala Lys Ala Ile Ile Phe Gly Leu Ile Tyr Gly (2) INFORMATION FOR SEQ ID NO: 18:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 18:
Arg Arg Ser Ala Lys Ala Ile Asn Tyr Gly Leu Ile Tyr Gly (2) INFORMATION FOR SEQ ID NO: 19:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 19:
Arg Arg Ala Ala Lys Thr Ile Asn Phe Gly Val Leu Tyr Gly (2) INFORMATION FOR SEQ ID NO: 20:
(i) SEQUENCE CHARACTERISTICS:
SUBSiTME SHEET (RULE 2M
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 20:
Arg Asp Asn Ala Lys Thr Ile Asn Phe Gly Val Leu Tyr Gly (2) INFORMATION FOR SEQ ID NO: 21:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 21:
Arg Arg Ala Ala Lys Thr Phe Ile Tyr Gly Phe Leu Tyr Gly (2) INFORMATION FOR SEQ ID NO: 22:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 22:
Arg Arg Ala Ala Lys Thr Ile Ile Tyr Gly Val Leu Tyr Gly (2) INFORMATION FOR SEQ ID NO: 23:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear "nTM SHEET (RU 2M
(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 23:
Arg Arg Ala Ala Lys Thr Ile Ile Phe Gly Val Leu Tyr Gly (2) INFORMATION FOR SEQ ID NO: 24:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 24:
Arg Arg Ala Ala Lys Thr Ile Asn Tyr Gly Val Leu Tyr Gly SUBS'17TUi'E SHEET (Fi)l.E 26j
[Klenow fragment] (b-strand 14 [residues 916 to 928] and the C terminal) contact the deoxyribose moiety of the dNTP during polymerization to discriminate between rNTP
and dNTP substrates."
Dideoxynucleosides, such as dideoxythymidine, are potent inhibitors of T7 phage growth. Experiments indicate that the inhibition of DNA synthesis is a consequence of the incorporation of the dideoxynucleotide into T7 DNA. Dideoxynucleosides are not inhibitory to uninfected E. coli. We do not know the explanation for the lack of inhibition of E. coli DNA synthesis but it could be explained by cellular uptake, a high level of discrimination against their incorporation by E. coli DNA
polymerase III, inefficient phosphorylation to a triphosphate, or efficient removal. In any case we find that T7 mutant phage arise that can yield normal plaques on agar plates containing dideoxynucleosides with a frequency of approximately 10-3. The location of many of these mutations are shown in Fig. 1. They reside within the gene 5 protein. The mutant gene 5 proteins are more discriminatory (by a few fold) against ddNTPs than the native gene 5 protein. Some members of this class of mutants may delineate the region of the polymerase that is important in recognition of the ribose moiety of the dNTP.
It is important to note that the mutations obtained by this selection using dideoxynucleosides are based on an alteration of the region of the gene 5 protein that recognizes the ribose moiety. However, it is possible that such mutations will also have dramatic effects on other nucleotide analogs. In addition, it is possible to use the same procedure to select for other T7 mutants that discriminate strongly against other nucleotide analogs on the basis of the growth of phage in the presence of the other analogs.
Referring to Table I, various DR mutants are indicated with the amino acid substitution noted in the table. The amino acid substitution is further SUBSTITUTE SHEET (RULE 26) characterized on the right hand side of the table. The location of these mutants is shown in FIG. 1 throughout the palm and finger regions of T7 DNA polymerase. The thumb region is a flexible region that interacts with minor groove of product duplex DNA; the palm region is the catalytic center, the binding site for the 3' end of the primer and contributes to dNTP binding; the finger regions interact with ss template close to the site of synthesis and contribute to dNTP binding. These mutants are highly scattered throughout the polymerase and all have a relatively minor effect on incorporation of dideoxynucleotides, decreasing the ability to incorporate a dideoxynucleotide by only 5-20 fold. In addition, some of these mutants are located in regions of comparative non-homology to DNA polymerase I. Thus, they do not provide an indication of the location of sites in other Pol I enzymes involved in ddNTP discrimination.
Table 1 Summary of T7 DNA Polymerase Dideoxy-Resistant Mutants Mutanr, No. isolates Modification DR1 1 Ala 425 Thr Hydrophobic Polar DR2 1 Phe 434 Ser Hydrophobic Polar Gly 442 Ser Hydrophobic Polar DR3 1 Val 443 Ile Hydrophobic Hydrophobic DR4 2 Arg 444 His Strong basic Weak basic DRS 1 Arg 444 Cys Strong basic Neutral, polar DR6 8 Ser 477 Phe Polar Hydrophobic DR7 4 Asp 504 Asn Basic Neutral DR8 2 Ala 513 Thr Hydrophobic Polar DR9 2 Thr 517 Ile Polar Hydrophobic DR10 1 Ala 532 Ser Hydrophobic Polar DR11 1 Arg 566 Cys Strong basic Neutral, polar DR12 1 Ala 619 Thr Hydrophobic Polar DR13 3 Ala 700 Thr Hydrophobic Polar SUBSTITUTE SHEET (RULE 26) e2o Dumarv 7 Hydrophobic Polar 3 Strong basic Neutral, polar or weak basic 2 Polar Hydrophobic 1 Hydrophobic Hydrophobic In vitro MLtagenesis In vitro mutagenesis of the cloned gene 5 of T7 was used to construct gene 5 proteins in which different regions of E. coli DNA polymerase I were substituted for the analogous or homologous regions in T7 gene 5 protein.
As discussed, we were particularly interested in determining the ability of these enzymes to incorporate nucleotide analogs and the extent to which they discriminate against these analogs. Referring to FIG. 2, the regions within which hybrids between T7 DNA
polymerase and E. coli DNA polymerase I were made are shown as marked as regions A-E.
Referring to FIG. 3, Applicant has determined that region C provides a riboselectivity region having a significantly greater effect than other regions in the polymerase. Some of these other regions are specifically indicated in FIG. 3.
Referring to FIGS. 4-6 (and Table 2) , it was determined that substitution of amino acids in this region allowed conversion of the riboselectivity of a polymerase from a E. coli DNA pol I-type to a T7 DNA
polymerase type and vice versa. Thus, by targeted mutagenesis of this region of polI-type polymerases the riboselectivity of a polymerase can be significantly altered. The level of effect is at least 50-100 fold, and generally more than 500-fold.
DNA Polymerases DNA polymerases useful in this invention include those belonging to the class of homologous polymerases Stt85TiTUTE SHEET (RULE 2M
termed "Pol I-type DNA polymerases" including T7-type DNA
polymerases, the large fragment of E. coli DNA polymerase I, and Taq polymerase.
DNA polymerase useful in this invention include those belonging to a class of homologous polymerases including T7 type DNA polymerases (such as T7, T3, 01, 011, H W31, gh-1, Y, A1122, or SP6). By homologous polymerases is meant an enzyme such as those described by Delarue et al., Protein Engineering 3, 461-467 (1990), that presents an alignment of the Pol I family of DNA
polymerases. It also presents the alignment of Table 2.
Effect of domain exchanges between E. coli DNA polymerase I, T7 DNA polymerase, and Taq DNA polymerase within helix 0 on discrimination against ddNTPs. The sequence of the three polymerases are shown at the top, with the number of the first residue indicated. Below the consensus sequence for these three polymerases the mutants characterized in T7 DNA polymerase (T7), E. coli DNA
polymerase I (Pol) and Taq DNA polymerase (Taq) are shown, with the mutated residues underlined. Each mutant was tested for its relative rate of incorporation of ddNMP to dNMP by SDS activity gel analysis as described in Example 2 and indicated on the right. Mutants T7 C-T8, Pol I C-K6, and Taq C-Q5 were purified along with the wild-type proteins for further analysis.
ddNTP
Enzyme Sequence Discrimination Pol I 754 R R S A K A I N F G L I Y G High $U6$fINTE SHEET (RULE 26) 220)88~) Taq 658 R R A A K T I N F G V L Y G High T7 517 R D N A K T F I Y G F L Y G Low Consensus R A K G Y G
T7 WT R D N A K T F I Y G F L Y G Low T7 C-T2 R$_E A K A I N F G,L.__Z Y G High T7 C-T3 R$_$ A K T F I Y G F L Y G Low T7 C-T4 R D N A K A I N F G F L Y G High T7 C-T5 R D N A K I Y G F L Y G Low T7 C-T6 R D N A K T F N F G F L Y G High T7 C-T7 R D N A K T FN Y G F L Y G Low T7 C-T8 R D N A K T F IF G F L Y G High Pol I WT R R S A K A I N F G L I Y G High Pol I C-Kl R D A K T F I Y GF _L Y G Low Pol I C-K2 R R S A K T F I Y G L I Y G Low Pol I C-K3 R R S A K I N F G L I Y G High Pol I C-K4 R R S A K A I G L I Y G Low Pol I C-K5 R R S A K A I,l. F G L I Y G High Pol I C-K6 R R S A K A I N y G L I Y G Low Taq WT R R A A K T I N F G V L Y G High Taq C-Q1 R A K T I N F G V L Y G High Taq C-Q2 R R A A K T F I Y G L Y G Low Taq C-Q3 R R A A K T I 1_X G V L Y G Low Taq C-Q4 R R A A K T RRAAKTIIFGVLYG F G V L Y G High Tqa C-Q5 R R A A K T I N Y G V L Y G Low Specificity residue conserved sequence motifs from six families of polymerases: DNA polymerases from Pol I, Pol alpha, and Pol beta families, DNA-dependent RNA polymerases, reverse SUBSTlTlJTE StEET (F1U1.E 26) transcriptases, and RNA-dependent RNA polymerases; their results suggest a few residues are conserved between all polymerases. According to their alignment (Figure 3, page 463), the selectivity residue identified herein (phenylalanine 762 in E. coli DNA polymerase I) is in "Motif B". In addition to the pol I family of DNA
polymerases, Motif B is found in the pol alpha family of DNA polymerases and the T7 family of DNA-dependent RNA
polymerases; thus this alignment strongly suggests the residues one should mutate in these other families of polymerases, that include T4 DNA polymerase, Herpes DNA
polymerase, 029 DNA polymerase, Vent DNA polymerase and Pfu DNA polymerase.
In addition, Joyce, Current Opinion in Structural Biology 1, 123-129 (1991) compares the DNA sequences from many polymerases, and suggests that a small number of important active-site residues are conserved. In particular, there is a discussion of the similarities between polymerases of the pol I family (T7, pol I, Taq) and of the pol alpha family (T4, 029, Herpes). In Figure 1 (page 124) Joyce indicates the positions of 5 invariant residues found in these two families; they include lysine 758, tyrosine 766, and glycine 767; these are all very close to the selectivity residue identified herein, phenylalanine 762.
These polymerases are used in a DNA sequencing reaction under conditions in which they preferably produce nearby bands of appro}:imately uniform intensity (with about a 1.5- to 2.0-fold variation in intensity) when the DNA products of the sequencing reaction are run in a gel. By nearby is meant to include bands representing DNA products of molecular weight differing by as much as 6000, i.e., 20 bases. The actual value of this intensity will decrease along the length of the gel, as described in Tabor and Richardson, supra. Band intensity reflects the number of DNA products within a certain band. Labels such as fluorophores or radioisotopes, are used to produce a readily detectable band of intensity reflective of the number of such DNA
StJB.STrfl1TE WXT (Rlli-E 26) products. Thus, in this invention, nearby bands derived from one sequencing reaction with one chain terminating agent have approximately the same number of DNA products and thus a uniform band intensity. The sequencing conditions include incubation of the polymerase in the presence of specific divalent or trivalent cations such as manganese (II and III), ferrous and ferric ions;
monovalent and divalent cations which have no detectable effect, or are detrimental to DNA synthesis, include: K, Na, Ba, Be, Ca, Cc, Cr, Co, Cu, Ni, Si and Zn. The anion is unimportant, for example, chloride, acetate, and sulfate are suitable. Under these conditions the requirement for chain terminating agents, such as dideoxynucleosides, may be lessened by several-fold for enzymes for this invention. A chelator may also be provided in this solution in order to help regulate the concentration of available divalent metal ions. For example, citrate or isocitrate may be provided. These chelates are thought to maintain the level of, for example, free manganese ions at a concentration of between 10 and 100 uM over a wide range of manganese concentrations. That is, the chelator acts as a buffer.
The DNA polymerases of this invention do not discriminate significantly between dideoxynucleotide analogs and deoxynucleotides along the length of the DNA
template. That is, these polymerases are unable to discriminate significantly between a nucleotide that has a 3' hydroxyl group versus one that does not (i.e., has two hydrogens at the 3' position of the ribose).
However, these polymerases may discriminate against modifications at other positions on the nucleosides, even in the presence of manganese or iron. For example, the polymerases may discriminate against some dideoxynucleotide analogs which have fluorescent groups attached compared to deoxynucleotides. However, the polymerases do not discriminate to a different extent at neighboring, or nearby nucleotides, on the basis of the presence or absence of the modification to the dideoxynucleotide. Thus, while they discriminate M5fITUTE $HEET (RULf 26) ~ . ..
WO 96/12042 2201885 pCTIUS95/12910 strongly against these analogs, requiring higher concentrations for a DNA sequencing reaction compared to unmodified dideoxynucleosides, the intensity of nearby bands will still be uniform.
Thus, the polymerases of this invention provide a uniform efficiency of incorporation of chain terminating agents, even if they discriminate against overall incorporation. In addition, other polymerases of this invention will give more uniform bands with fluorescent ddNTPs than the corresponding naturally occuring enzyme, although not as uniform as with unlabelled or radioactively labelled ddNTPs.
Chain terminating agents useful in this invention include dideoxynucleosides having a 2', 3'dideoxy structure. Other agents useful in the invention are those able to specifically terminate a DNA sequencing reaction at a specific base, and are not discriminated against by the polymerase under the above conditions.
In order to determine whether any particular DNA
polymerase, in combination with any particular chain terminating agent, or other component of a sequencing reaction mixture, is useful in this invention, a standard sequencing reaction is performed, as described in Tabor and Richardson, supra, and the extent of band formation, and the uniformity of nearby bands in a sequencing gel, reviewed. If the polymerase reaction does not extend the primer by at least 20 bases, it is not suitable under the conditions used. Adjacent band uniformity within a two-fold or less range is useful in this invention, preferably the uniformity is within a 1.0-1.5 fold range.
Similarly, determination of optimum cation concentration, or of other potential cations useful in the invention, is determined by use of this sequencing reaction under various conditions. For example, cations are tested in ranges from 0.005-100 mM. An example of such an experiment follows:
The ability to incorporate a given ddNMP compared to the corresponding dNMP for any one enzyme is measured as the ratio of ddNTP to dNTP necessary to allow DNA
SUBST11UfE SHEET (RULE 26) 2201 88~) synthesis that terminates in a fixed range, detected as producing bands of no greater than a fixed molecular weight. That is, the bands produced in the reaction end within a specified range in the sequencing gel. Thus, if 5 one enzyme discriminates 1000-fold greater against a given ddNTP compared to another enzyme, a 1000-fold higher ratio of ddNTP to dNTP will be necessary to obtain bands terminating at the corresponding sites in the same range of the gel.
Exonuclease Activity The DNA polymerases of the invention preferably have less than 50%, preferably less than 1%, and most preferably less than 0.1%, of the normal or naturally associated level of exonuclease activity (amount of activity per polymerase molecule). By normal or naturally associated level is meant the exonuclease activity of e.g., an unmodified T7-type polymerase.
Normally the associated activity is about 5,000 units of exonuclease activity per mg of polymerase, measured as described below by a modification of the procedure of Chase et al. (249 J. Biol. Chem. 4545, 1974).
Exonucleases increase the fidelity of DNA synthesis by excising any newly synthesized bases which are incorrectly basepaired to the template. Such associated exonuclease activities can be detrimental to the quality of DNA sequencing reactions. They raise the minimal required concentration of nucleotide precursors which must be added to the reaction since, when the nucleotide concentration falls, the polymerase activity slows to a rate comparable with the exonuclease activity, resulting in no net DNA synthesis, or even degradation of the synthesized DNA.
More importantly, associated exonuclease activity may cause a DNA polymerase to idle at regions in the template with secondary structure impediments. When a polymerase approaches such a structure its rate of synthesis decreases as it attempts to pass. An associated exonuclease will excise the newly synthesized SUBSTITUTE SHEET (R1LE 26) WO 96/12042 PCT/US95/129*
DNA when the polymerase stalls. As a consequence numerous cycles of synthesis and excision will occur.
This may result in the polymerase eventually synthesizing past the hairpin (with no detriment to the quality of the sequencing reaction) ; or the polymerase may dissociate from the synthesized strand (resulting in an artifactual band at the same position in all four sequencing reactions); or, a chain terminating agent may be incorporated at a high frequency and produce a wide variability in the intensity of different fragments in a sequencing gel. This happens because the frequency of incorporation of a chain terminating agent at any given site increases with the number of opportunities the polymerase has to incorporate the chain terminating nucleotide.
An ideal sequencing reaction will produce fragments that give bands of uniform intensity throughout the gel.
This is essential for obtaining the optimal exposure of the X-ray film for every radioactive fragment. If there is variable intensity of radioactive bands, then fainter bands may go undetected. To obtain uniform radioactive intensity of all fragments, the DNA polymerase should spend the same interval of time at each position on the DNA, showing no preference for either the addition or removal of nucleotides at any given site. This occurs if the DNA polymerase lacks any associated exonuclease, so that it will have only one opportunity to incorporate a chain terminating nucleotide at each position along the template.
~hort Primers The DNA polymerase of the invention is preferably able to utilize primers of 10 bases or less, as well as longer ones, most preferably of 4-20 bases e.g., 6 bases (which can be used in groups of three to form an equivalent of an 18-mer). The ability to utilize short primers offers a number of important advantages to DNA
sequencing. The shorter primers are less expensive and easier to synthesize than the usual 17-mer primers. They MSTITUTE -S`HEtT {RULE 28) also anneal faster to complementary sites on a DNA
template, thus making the sequencing reaction faster.
Further, the ability to utilize small (e.g., six or seven base) oligonucleotide primers for DNA sequencing permits strategies not otherwise possible for sequencing long DNA
fragments. For example, a kit containing 80-4000 random hexamers could be generated, none of which are complementary to any sites in the cloning vector.
Statistically, one of the 80 hexamer sequences will occur an average of every 50 bases along the DNA fragment to be sequenced. The determination of a sequence of 3000 bases would require only five sequencing cycles. First, a "universal" primer (e.g., New England Biolabs #1211, sequence 5' GTAAAACGAACGGCCAGT 3') would be used to sequence about 600 bases at one end of the insert. Using the results from this sequencing reaction, a new primer would be picked from the kit homologous to a region near the end of the determined sequence. In the second cycle, the sequence of the next 600 bases would be determined using this primer. Repetition of this process five times would determine the complete sequence of the 3000 bases, without necessitating any subcloning, and without the chemical synthesis of any new oligonucleotide primers.
The use of such short primers may be enhanced by including gene 2.5 and gene 4 protein of T7 in the sequencing reaction.
In vitro Mutagenesis Mutagenesis of the polymerase genes was carried out using standard PCR techniques (see below).
Discrimination against ddNTPs In the presence of magnesium as the only divalent cation, T7 DNA polymerase discriminates about 3 - 4 fold against ddNTPs, less than any other known polymerase.
The next closest is reverse transcriptase, that discriminates about 10 - 50-fold against ddNTPs (3 - 10 times more than T7 DNA polymerase). After these two, all other known DNA polymerases characterized in the SUBSTINt'E SHEET (RULE 26) ; 2 0 1 ~8 J' literature discriminate at least 100-fold against ddNTPs, and often 10,000 fold or more.
In the presence of manganese discrimination by T7 DNA polymerase and E. coli DNA polymerase I are reduced;
with T7 DNA polymerase, it is reduced from 3.7 to 1, and with E. coli DNA polymerase I it is reduced from 550 to 3.9 (for ddATP). Applicant is the first to provide a DNA
polymerase that in the presence of magnesium ions as the only divalent cations has a processivity of less than 100 (defined as the average length of extension from a given primer before dissociating from the primer-template;
.Reverse transcriptase has a processivity by this definition of about 150 - 200, and T7 DNA polymerase has a processivity greater than this) and that discriminates less than 100 fold against incorporation of a ddNMP. In contrast, most of the known DNA polymerases, such a Taq that have a processivity less than 100 discriminate more than 100 fold against incorporation of a ddNMP.
Previously it was believed that whereas polymerases with high processivity such as T7 DNA polymerase remain bound to a primer-template for up to several minutes, polymerases with low processivity such as E. coli DNA
polymerase I cycle from one primer template to another every few seconds (or over one hundred times more frequently). See for example Tabor,et al. J. Biol.
Chem. 262, 16212-16223 (1987)). While processivity is advantageous for DNA sequencing, as in reducing the background due to terminations not from dideoxy incorporation, the slow cycling time is a disadvantage.
For example, if the polymerase does dissociate at specific sequences, it will result in strong artifactual bands on a sequencing gel unless there is a large excess of polymerase present. On the other hand, with a polymerase that cycles rapidly, one can use much less polymerase since a single enzyme molecule will extend many different primers during the course of a sequencing reaction, and any given primer end will have the opportunity to be extended by many different polymerase molecules, decreasing the chance of strong specific stops SUBSTlnrrE SHEET (RULE 26) ' 220188-from occurring.
However, it is better that a polymerase that cycles rapidly also incorporates ddNMPs efficiently, in order to give bands of uniform intensity and allow one to use less ddNTPs. It is also preferred that such a polymerase have low or no exonuclease activity, and that one add pyrophosphatase to prevent degradation of bands by pyrophosphorolysis.
It is also preferred that one be able to carry out DNA sequencing reactions with magnesium as the only divalent cation (i.e., the absence of manganese). First, polymerases tend to be less active with manganese compared with magnesium (see for example Tabor and Richardson. Proc. Natl. Acad. Sci. USA 86, 4076-4080 (1989)). Second, while polymerases tend to be active over wide ranges of magnesium concentrations, there is a very sharp, low optimum manganese concentration required in most cases for optimum activity (id.). And at the optimum manganese concentrations there is much less effect on reduction of discrimination against ddNTPs as at much higher concentrations, where the polymerase is much less active. Third, manganese is not as convenient a metal ion to have in a kit; it readily forms precipitates, particularly at higher pH's. Fourth, it is not clear whether manganese will be effective as a metal ion for reducing discrimination against ddNTPs with any thermophilic polymerase (i.e., at higher temperatures).
Prior to the present invention, we stated (Tabor and Richardson Proc. Natl. Acad. Sci. USA 86, 4076-4080 (1989)) that there was a correlation between discrimination and processivity:
"DNA polymerase I has a low processivity, dissociating after the incorporation of less than 10 nucleotides. There is a strong correlation between the frequency at which the enzyme dissociates from a site during DNA synthesis in the absence of ddNTPs, and the extent of discrimination against the incorporation of a ddNMP at that site (unpublished results). This suggests that DNA polymerase I
SUBSflTM SHEET (RULE 26) WO 96/12042 2201885 PCTfUS95/129 incorporates dNMPs and ddNMPs at similar rates during processive synthesis; however, when synthesis is nonprocessive, dNMPs are incorporated preferentially over ddNMPs. This model could 5 account for the greater variabiliy in ddNMP
incoporation byDNA polymerase I complared to T7 DNA
polymerase, since the latter has a processivity two orders of magnitude greater than the former."
[Citations omitted.]
10 Thus the results of this invention with E. coli DNA
polymerase I and Taq DNA polymerase are surprising, since we find no evidence that the mutants described herein do increase the processivity of the mutant enzymes.
15 Thermophilic polymerases Thermophilic polymerases that discriminate against a ddNTP by less than a factor of 100 are particularly useful in this invention. In, addition, those that discriminate against a ddNTP by less than a factor of 100 20 in the presence of magnesium as the only divalent cation and preferably cycle from one primer-template to another more than once per second are useful. Thermophilic polymerases are defined as polymerases that have optimum DNA polymerase activity in a 15 min. reaction at a 25 temperature above 60 C.
Uniform band intensities Even though manganese reduces the discrimination of Klenow fragment against ddATP from 550 to 3.9 fold Tabor 30 and Richardson (Proc. Natl. Acad. Sci. USA 86, 4076-4080 (1989)) show that there is still wide variability in the intensity of individual bands (see Figure 2 id.). Thus, apart from T7 DNA polymerase, this invention is the first to provide polymerases that cycle rapidly such as Klenow 35 fragment and those that are derived from thermophilic organisms to produce bands that have uniform intensities, even in the presence of magnesium as the only divalent cation, conditions that are preferable for the activity of most polymerases (see above). Enzymes which cycle SUBSTCt11TE SHEET (Rt1LE 26) = WO 96/12042 PCT/US95/12928 rapidly can be determined by methods known in the art as described below.
' Specific Polymerases From the above information it is possible to readily make the following polymerases which will have the desired properties discussed above. Each of these polymerases can be used for sequencing procedures if the level of exonuclease is low enough and the activity of the polymerase is sufficient (both of which are well known in the art) See Braithwaite and Ito, supra for reference to each amino acid site.
1. Pol I family E. coli DNA polymerase I with altered Phe762 (altered means replaced with e.g., Tyr, or an equivalent amino acid to give the non-discriminatory property).
Streptococcus pneumoniae DNA polymerase I with altered Phe711.
Thermus aquaticus DNA polymerase I with altered Phe667.
Thermus flavus DNA polymerase I with altered Phe666.
Bacteriophage T5 DNA polymerase with altered Phe570.
Bacteriophage Spo 1 DNA polymerase with altered Leu526.
Bacteriophage Spo 2 DNA polymerase with altered Phe690.
Mitochondrial DNA polymerase with natural Tyr753 or altered at this site without reducing the non-discriminatory activity. Such a polymerase has not previously been used for DNA sequencing. Applicant believes that it will be useful in such a procedure because of its expected low level of ddNTP
discrimination. If needed, it can be modified to reduce exonuclease activity associated with the polymerase activity.
SUBSTITUTE SHEET (RULE 26) 220 i 8U5 2. Polymerase alpha family (also called polymerase II
family) Delarue et al., Protein Engineering 3, 461-467 (1990) show that the two families of polymerases (polymerase I family and polymerase alpha family) share three common motifs. The region they call "Motif B"
contains the residue we have identified as responsible for specificity for the dideoxyribose moiety. This region is characterized by the sequence K Nl N2 N3 N4 N5 N6 N7 Y G in the polymerase I family, whe,,rre N is the specificity residue: if N4 is a phenylalanine there is high discrimination, if N4 is tyrosine there is low discrimination. In the polymerase alpha family, the sequence is K Nl N2 q N 1aJ tT Y G (there is one less base between the conserved residues). We predict therefore that just as with polymerase I type enzymes, changes to the residue(s) in this motif (betwen the lysine (K) and the tyrosine (Y)) will reduce the discrimination of these polymerases to ddNTPs. These residues are as follows:
Escherichia coli DNA polymerase II I1e494-Phe499 PRD1 DNA polymerase Leu341-Ser346 0 29 DNA polymerase Leu384-Leu389 M2 DNA polymerase Leu381-Leu386 T4 DNA polymerase Ile558-Leu563 Thermuococcus litoralis DNA polymerase (Vent) Leu492-Tyr497 Pyrococcus furiosus DNA polymerase Leu489-Phe494 Sulfolobus solfataricus DNA polymerase Va1604-Thr609 Human DNA polymerase alpha Leu951-His956 S. cerevisiae DNA polymerase I (alpha) Leu945-His950 S. pombe DNA polymerase I (alpha) Leu931-His936 Drosophila melanogaster DNA polymerase alpha Leu960-His965 Trypanosoma brucei DNA polymerase alpha Leu845-His850 Human DNA polymerase delta Val695-Va1700 Bovine DNA polymerase delta Va1694-Va1699 "$TtNTE SHEET (RULE 26) 2Z01g8J
= WO 96/12042 PCT/US95/12928 S. cerevisiae DNA polymerase III
(delta) I1e702-Va1707 S. pombe DNA polymerase III
(delta) Va1681-Va1686 Plasmodium falciparum DNA polymerase delta I1e692-Va1697 S. cerevisi.ae DNA polymerase II
(epsilon) Va182-Phe830 S. cerevisiae DNA polymerase Rev3 Leu1087-Thr1092 Herpes Simplex virus type 1 DNA
polymerase Va1812-Va1817 Equine herpes virus type 1 DNA polymerase Va1813-Va1818 Varicella-Zoster virus DNA polymerase Val776-Val781 Epstein-Barr virus DNa polymerase Cys682-Va1687 Herpesvirus saimiri DNA polymerase Val671-Va1676 Human cytomegalovirus DNA polymerase Val811-Phe816 Murine cytomegalovirus DNa polymerase Va1717-Phe722 Human herpes virus type 6 DNA polymerase Ile667-Va1672 Channel Catfish virus DNA polymerase I1e750-His755 Chlorella virus DNA polymerase Ile586-Va1591 Fowlpox virus DNA polymerase I1e648-Va1653 Vaccinia virus DNA polymerase I1e637-Val642 Choristoneura biennis DNA polymerase Ile669-Leu674 Autographa californica nuclear polyhedrosis virus (AcMNPV) DNA polymerase Arg606-Ile611.
Lymantria dispar nuclear polyhedrosis virus DNA polymerase Arg624-I1e629 Adenovirus-2 DNA polymerase Leu696-Leu701 Adenovirus-7 DNA polymerase Leu762-Leu767 Adenovirus-12 DNA polymerase Leu694-Leu699 S-1 maize DNA polymerase Leu6l8-Leu623 kalilo neurospora intermedia DNA
polymerase Leu776-Leu777 pAI2 Ascobolus immersus DNA polymerase Leu951-Leu956 pCLKl Claviceps purpurea DNA polymerase Leu831-Leu836 Maranhar neurospora crassa DNA polymerase Leu752-Leu757 pEM Agaricus bitorquis DNA polymerase Leu573-Leu578 SUBSTITUiF SHEET (RULE 26) ~.
220 1,88 WO 96/12042 PCT/US95/129=
pGKL1 Kluyveromyces lactis DNA polymerase I1e785-Leu790 pGKL2 Kluyveromyces lactis DNA polymerase I1e770-G1y776 pSKL Saccaromyces kluyveri DNA polymerase I1e775-G1y781 1.'.xam~nlaG
The following are examples of methods for determining the processivity and cycle times for various polymerases.
Also provided are examples for determining the level of discrimination by a polymerase, and other methods useful in this invention.
Example 1. Mutagenesis of DNA polymerase aenes and overaroduction of mutant DNA polymerases Standard techniques are used for the cloning and expression of mutant DNA polymerase genes. The genes for the large fragment of E. coli DNA polymerase I(Klenow fragment) and the large fragment of Taq DNA polymerase (KlenTaq or _Taq DNA polymerase, see Barnes 112 Gene 29, 1992 or Stoffel fragment, see Lawyer et al. 2 PCR Methods Appl 275, 1993), the starting materials for the generation of mutants in E. coli DNA polymerase I and Taq DNA polymerase, are expressed under the control of the T7 RNA polymerase promoter. The gene for the _28 amino acid deletion of T7 DNA polymerase (see Tabor and Richardson 264, J. Biol. Chem. 6447, 1989), the starting material for the generation of mutants in T7 DNA polymerase, is expressed under the control of the lac promoter in a strain that produces E. coli thioredoxin, a necessary factor for processive DNA synthesis by T7 DNA polymerase (Tabor and Richardson, supra). The gene for Taq DNA
polymerase mutant F667Y is transferred from the gene that produces _Taq DNA polymerase to the gene that produces the full length Taq DNA polymerase by standard techniques using PCR and restriction digestion followed by ligation.
Mutagenesis to construct the mutant DNA polymerases is carried out using standard mutagenesis techniques by PCR similar to the method described by Sarkar and Sommer 8 BioTechniques 404, 1990. To construct hybrids in which more than four amino acid residues are being exchanged, SUBSTtTUTE SHEET (RULE 26) Wo 96/12042 PCT/US95/12928 two hybrid primers are constructed whereby PCR is first carried out on the donor DNA, and then that product is used for PCR on the recipient DNA, generating the hybrid molecule. For the construction of hybrids in which the 5 exchange of domains is four amino acid residues or less, single PCR primers are synthesized that contain the entire region to be transferred as well as the proper flanking sequences of the recipient, and that primer is used to construct the hybrid molecule directly.
10 Overproduction of the mutant DNA polymerases is carried out using standard techniques (see for example Current Protocols in Molecular Biology, Ausubel et al, eds., Chapter 16, 1994). Mutant proteins are purified by standard procedures including ion exchange 15 chromatography. For the purification of E. coli DNA
polymerase I mutants, see for example Joyce and Grindley 80 Proc. Natl. Acad. Sci. 1830, 1983. For the purification of Taq DNA polymerase mutants, see for example Engelke et al. 191 Analytical Biochemistry 396, 20 1990. For the purification of T7 DNA polymerase mutants, see for example Tabor and Richardson 264, J.
Biol. Chem. 6447, 1989. Polymerase specific activities of each of the purified mutant proteins are determined by standard procedures described in these references.
25 Example 2. Rapid screen of DNA polymerases for mutants that are improved in their efficiency of incorporating a dideoxynucleotide relative to a deoxynucleotide Mutant DNA polymerases are screened for their ability to incorporate dideoxynucleotides by SDS activity gel 30 analysis. The procedure is a modification of that described by Spanos and Hiibscher 91 Methods in Enzymology 263, 1983 and Karawya et al. 135 Analytical Biochemistry 318, 1983. Briefly, 10 ml of cells are induced for 4 to 6 hours and then pelleted. The cell pellet is 35 resuspended in 0.3 ml 25 mM Tris-HCl, pH 7.0, 5 mM EDTA.
20 1 of the resuspended cells are mixed with 40 1 of a solution of 11; SDS (sodium dodecyl sulfate), 2%
mercaptoethanol, 30 % glycerol, 0.04% bromphenol blue, and 100 mM Tris=HCl, pH 6.8. The mixtures are incubated SLi6STiNTE SHFET (RULE 26) at 37 C for 5 min, and then 20 1 aliquots are loaded in duplicate onto two SDS polyacrylamide gels. The SDS
polyacrylamide gels consist of a resolving gel that contains 8!k polyacrylamide, 0.27% bisacrylamide, 190 mM
Tris=HCl, pH 8.8, 0.05%- SDS, and 25 g/ml denatured salmon sperm DNA, and a stacking gel that consists of 5%
polyacrylamide, 0.17% bisacrylamide, 150 mM Tris=HCl, pH
6.8, and 0.1% SDS. The two gels are electrophoresed at 100 V for 13 hr at a constant temperature of 13 C in an electrophoresis buffer consisting of 190 mM Tris=HCl and pH 8.8, 0.05%- SDS.
After electrophoresis, the gels are washed over 8 hr in 4 changes of 500 ml each of Renaturation Buffer (50 mM
Tris.HC1, pH 7.5, 5 mM magnesium acetate, 1 mM
dithiothreitol, 40 mM KC1, 400 g/ml bovine serum albumin, 16% glycerol and 0.95 mM EDTA) at 4 C.
The renatured proteins are assayed for DNA polymerase activity by incubating each of the two gels in 6 ml of Renaturation Buffer, 1.5 M 4dNTPs, 4 1 of [a-32P] dATP
(800 Ci/mmol, 10 mCi/ml), and 80 g of purified thioredoxin. One of the mixtures also contains 30 M
ddTTP (a 20-fold molar excess over dTTP). The mixtures are incubated for 4 hr at 37 O C (70 C for 2 hr for thermophilic DNA polymerases).
After incubation the gels are washed for 8 hr against four changes of 5% trichloracetic acid and 1% sodium pyrophosphate. The gels are then dried and autoradiographed.
To determine whether a mutant DNA polymerase is discriminating more or less against ddTTP, the intensities of the radioactive bands are compared on the two gels that were incubated in the presence and absence of ddTTP, and the ratio of the signal in the two bands for the unmodified DNA polymerase is compared with the ratio of the signal in the two bands for each of the mutants. If a mutation results in a DNA polymerase being less discriminatory towards ddTTP, then there will be a greater percentage decrease in radioactivity in the band in which ddTTP was present for the mutant DNA polymerase SUBSTINTE SFiEET (RULE 26) compared with the unmodified DNA polymerase. For example, under these conditions the radioactive bands observed for cells containing induced E. coli DNA
polymerase I or T7 DNA polymerase mutant Y526F are approximately the same intensity (within a factor of two) in reactions carried out in the presence versus the absence of ddTTP. In contrast, for cells containing induced E. coli DNA polymerase I mutant F762Y or T7 DNA
polymerase, the bands on the gel in which the reactions are carried out in the presence of ddTTP are less that 50 the intensity of the bands corresponding to the reactions carried out in the absence of ddTTP.
This assay represents a rapid method of screening a large number of DNA polymerase mutants for their ability to discriminate against dideoxynucleotides. It can detect changes of at least 5-fold in the relative rate of discrimination. However, this assay should be followed up with purification of potentially interesting mutant DNA polymerases and more rigorous assays of the purified proteins similar to those described below in order to determine precisely the effect of each mutation on discrimination against dideoxynucleotides.
The following examples are methods for determining the processivity and cycle times for various polymerases, determining the level of discrimination against ddNTPs by a polymerase, determining the uniformity of bands generated by dideoxy-terminated fragments on a DNA
sequencing gel; and using the DNA polymerases of this invention for DNA sequence analysis.
Example 3.Preparation and purification of a sinale-stranded M13 DNA - 5' 32P-labeled 40-mer primer complex The template is M13 mGPl-2 single-stranded DNA, 9950 nucleotides in length, as described in U.S. Patent 4,795,699 (Figure 9). The phage M13 mGPl-2 is deposited in ATCC as number 40303. M13 mGPl-2 single-stranded DNA
is purified as described in Tabor et al., 262 J. Biol.
Chem. 16212, 1987. Briefly, the phage is purified through two CsCl gradient centrifugations, the CsCl is 'SUBSfITUTE SHEET (RU1.E 26) removed by dialysis, the DNA is removed f rom the phage by extraction with phenol and chloroform in the presence of 0.1% sodium dodecyl sulfate, and the extracted DNA is dialyzed extensively against 20 mM Tris=HC1, pH 7.5, 2 mM
EDTA and stored at 4 C. The concentration of M13 mGPl-2 single-stranded DNA is determined spectrophotometrically using an extinction coefficient of 8.1 A260 units = 1 mg/ml, or 0.3 pmoles of M13 mGPI-2 template molecules per microliter.
The primer is a synthetic 40-mer having the sequence 5' d(TTTTCCCAGTCACGACGTTGTAAA.ACGACGGCCAGTGCCA)3' synthesized by standard procedures. It is complementary to M13 mGP1-2 DNA at nucleotid.es 9031 to 8992 (see 1699 patent supra for sequence). The primer is purified by ion exchange chromatography or denaturing polyacrylamide gel electrophoresis prior to end labeling.
The primer is labeled and annealed to the template essentially as described in T'abor et al (supra). The primer is 5' end labeled in a reaction mixture (15 1) containing 40 mM Tris=HC1, pH 7.5, 10 mM MgC121 5 mM
dithiothreitol, 100 mM NaC1, 50 g/ml BSA, 50 Ci [g32P]ATP, 6000 Ci/mmol, 5 pmoles of primer, and 10 units of T4 polynucleotide kinase prepared from the PseTl mutant, which is deficient in phosphatase activity. The mixture is incubated at 37 C for 15, min, followed by 70 C for 15 min to inactivate the kinase. 60 1 of single-stranded M13 mGPl-2 DNA (0.25 mg/ml), 6 1 of 1M
NaCl and 3 1 of 0.2 M MgC12 are added, and the mixture is slowly cooled from 70 C to room temperature (about 20 - 250C) over a period of 30 inin. The mixture is then extracted once with a 1:1 mixture of phenol and chloroform, and after cennrifugation for 30 sec in a microfuge, the aqueous phase (70 1) is placed on a 1 ml column of SepharoseM CL-GB equilibrated in 20 mM
Tris=HC1, pH 7.5, 2 mM EDTA, and 100 mM NaCl. The labeled primer-template complex is eluted from the column with the same buffer used for equilibration; the labeled complex elutes in the void volume. After elution, the complex is at a concentration of approximately 50 g/ml (0.015 pmoles of molecules per 1) with a specific activity of approximately 200,000 cpm/ 1.
Example 4. Determination of the processivity of a DNA
Fo1ynerase by dilution test Processivity is determined by enzyme dilution essentially as described in Tabor et al (supra) and Tabor and Richardson, 84 Proc. Nat1. Acad. Sci USA 4767, 1987.
The reactions are carried out under the same conditions as in the extension/termination reactions used in DNA
sequencing (Tabor and Richardson, supra), except that ddNTPs are omitted and the polymerase concentration is reduced in order to have an excess of primer-template molecules over polymerase molecules in some of the reactions. The primer-template consists of the single 5' end-labeled primer annealed to a single-stranded M13 DNA
molecule as described in Example 3.
The extension reaction mixtures are prepared substantially as described in Tabor et al (supra). Each reaction mixture (18 1) contains 1.0 1 annealed 32P-labeled primer-M13 DNA as described in Example 3(-=0.015 pmoles, -200,000 cpm), 40 mM Tris-HCl, pH 8.0, 5 mM MgC1z1 5 mM dithiothreitol, and 300 M 4dNTPs. The mixtures are incubated at 37 C for 1 min (70 C for thermophilic DNA
polymerases). The reactions are initiated by the addition of 2 1 aliquots of dilutions of the DNA
polymerase being analyzed, diluted in 20 mM Tris-HCl, pH
7.5, 10 mM 2-mercaptoethanol and 0.0596 bovine serum albumin. The reaction mixtures are further incubated at 37 C (70 C for thermophilic DNA polymerases) for either 30 sec or 3 min. At the indicated times, 8 1 aliquots are removed and added to either 8 1 of 90% formamide, 20 mM EDTA, 0.051-1; bromphenol blue for denaturing polyacrylamide gel electrophoresis, or 2 1 of 100 mM
EDTA, 2%- sodium dodecyl sulfate for alkaline agarose gel electrophoresis.
The samples are analyzed by either denaturing polyacrylamide gel electrophoresis or alkaline agarose gel electrophoresis. Denaturing polyacrylamide gel &Sf ITUTE SHEET (RULE 26) WO 96/12042 PCT/US95112*
electrophoresis is most suitable for analyzing polymerases with an average processivity of less than 500 nucleotides, while alkaline agarose gel electrophoresis provides a more sensitive estimate of the processivity of 5 DNA polymerases with an average processivity greater than 500 nucleotides; however, either method can be used successfully to determine the average processivity of any DNA polymerase.
To determine the processivity by denaturing 10 polyacrylamide gel electrophoresis, aliquots in formamide are heated at 90 C for 2 min immediately prior to loading 6 1 of each sample onto a gel consisting of 8%-polyacrylamide, 0.4%, N,N'-methylenebisacrylamide, 7 M
urea in 100 mM Tris=borate, pH 8.3, 1 mM EDTA.
15 Electrophoresis is at 2000 Volts for 90 min (until the bromphenol blue has just run off the bottom of the gel).
Suitable 5' 32P-end labeled molecular weight markers are also loaded onto the gel that allow the determination of fragment sizes 100 to 500 nucleotides in length. An 20 example of such suitable markers are T7 HpaI fragments that have been dephosphorylated with alkaline phosphatase and then 5' 32P-end labeled using [~J- P] ATP and T4 polynucleotide kinase using standard procedures (Sambrook et al., 1989, Molecular Cloning A Laboratory Manual, Cold 25 Spring Harbor Laboratory, N.Y. pp. 6.20-6.21 and Ausubel et al., 1989, Current Protocols in Molecular Biology, Greene Publishing Associates and Wiley Interscience, N.Y.). After electrophoresis, the gel is dried under vacuum, and autoradiographed. After autoradiography, the 30 distribution of radioactively labeled fragments is determined by Phosphorimager analysis (Molecular Dynamics ) .
Products are analyzed by alkaline agarose gel electrophoresis as described by (1) Villani et al., 256 35 J. Biol. Chem. 8202, 1981, by (2) Sabatino et al., 27 Biochemistry 2998, 1988, and by (3) Sambrook, et al., supra . To prepare the agarose gel, 250 ml of a 1.5%
agarose solution in 2 mM EDTA, pH 8.0 is heated in a microwave oven to dissolve the agarose, and then allowed ~}BSfINiE SHEET (RULE 26) to cool to 60 C. 8.75 ml of 1 N NaOH (finai concentration 35 mM NaOH) is added, the gel is poured into an agarose gel electrophoresis mold. The gel is allowed to set for 2 hours before use. The electrophoresis buffer is 35 mM NaOH and 2 mM EDTA. The samples are prepared by taking the 10 1 aliquots described above (in 20 mM EDTA, 0.4!~ sodium dodecyl sulfate) and denatured by the addition of 1 1 of 1 N
NaOH and heating at 60 C for 10 min. 7 1 of 75 ~
glycerol, 0.2 o bromcresol green are added to each sample, and then the samples are loaded onto the alkaline agarose gel. Electrophoresis is carried out at 4 C at a constant current of 150 mA for 15 hours (until the bromcresol green has migrated about 14 cm). The electrophoresis chamber has dimensions of 26 cm (length) x 20 cm (width) by 2 cm (height). After electrophoresis, the gel is soaked for 2 hours in 10% trichloracetic acid, dried under vacuum, and autoradiographed. After autoradiography, the distribution of radioactively labeled fragments is determined by Phosphorimager analysis (Molecular Dynamics).
To test the processivity of a given DNA polymerase, the concentration of that polymerase in the reaction-mixture is diluted by 2-fold increments until only a fraction (e.g. 25%) of the primers are extended, while the majority (e.g. 75a) remained unchanged, 40 nucleotides in length. Under these conditions, a two-fold increase or decrease in DNA polymerase concentration should result in about a two fold increase or decrease, respectively, in the fraction of primers extended. The average length of the labeled fragments that are extended is determined either by visual inspection of the autoradiograph or by quantitation using a phosphoimager.
For example, using this test, the exonuclease-deficient T7 DNA polymerase complexed to its processivity factor thioredoxin (e.g. SEQUENASE Version 2.0, United States Biochemical Corporation) has an average processivity of more than 500 nucleotides, while Klenow fragment of E.
coli DNA polymerase I has a processivity of less than 50 SUgSTiTTE SHEET (#tULE 26) nucleotides.
Example 5. Determination of the rate of cyrclina of a DNA
nol;,merase The rate of cycling is determined essentially as described Tabor et al (supra). The reactions are carried out under the same conditions as in the extension/termination reactions used in DNA sequencing (Tabor and Richardson, supra), except that ddNTPs are omitted and the polymerase concentration is reduced in order to have an excess of primer-template molecules over polymerase molecules. The primer-template consists of the 5'[32P]end-labeled primer annealed to single-stranded M13 DNA molecule as described in Example 3.
First a test is carried out to determine the functional ratio of primer-template molecules to polymerase molecules, for example, using the large fragment of E. coli DNA polymerase I (Klenow fragment).
A dilution experiment is carried out as described in Example 4, to determine the concentration of polymerase molecules necessary to extend 20% of the labeled primer-template molecules in 10 sec. This ratio of polymerase to primer-template is defined as being less than or equal to 1:5, and is used to determine the maximum rate of cycling of the polymerase as described below.
Extension reactions are carried out as described in Example 4 using the ratio of DNA polymerase to primer-template less than or equal to 1:5, as defined in the previous paragraph. Reactions are carried out under conditions (i.e. buffer, pH, salt, temperature) optimal for the polymerase being tested. Aliquots are removed at 10 sec, 20 sec, 40 sec and 80 sec, and the reactions are terminated and the products analyzed as described in Example 4. For DNA polymerases with low processivity (less than 100 nucleotides), such as the large fragment of E. coli DNA polymerase I, samples are analyzed preferably by denaturing polyacrylamide gel electrophoresis. For DNA polymerases with high processivity (greater than 100 nucleotides), such as T7 SBTITUTE SHEET (RULE 26) DNA polymerase, samples are analyzed preferably by alkaline agarose gel electrophoresis. After electrophoresis, the gels are dried under vacuum and analyzed either by autoradiography or with a Phosphoimager.
When reactions are carried out with a polymerase that cycles very slowly, such as T7 DNA polymerase, there is not a significant decrease (i.e. less than a factor of two) in the number of unextended primers between 10 sec and 80 sec. Thus, if the number of unextended labeled primers does not decrease by more than two fold between ,the 10 sec and 80 sec time points, then the DNA
polymerase cycles slower than once per 70 sec. For polymerases that cycle rapidly, the number of unextended primers will decrease by a significant fraction (i.e. by more than 2 fold) between the 10 and 80 sec time points.
To determine the rate of cycling for these polymerases, the following equation is used:
R = Nl x L (t2) /{L (tl) x(ta - tl) }
where:
R = minimum rate of cycling in cycles per sec.
N1 = ratio of primer-template molecules over, functional DNA polymerase molecules. (N1 = 5 in the example above.) L(tl) = the maximum processivity of the DNA polymerase being investigated, in nucleotides. This is defined as the maximum number of nucleotides extended from the labeled primers under conditions of limiting DNA
polymerase (where only 20% of the primers are extended in the 10 sec time point), as described in Example 3.
L(t2) = the maximum length of extension (in nucleotides) of the labeled primers at time t2, which is 80 sec in this example.
t,.: the shortest time at which an aliquot is taken, or 10 sec in this example.
SUBSTITUCE SHEET (RU!_E 261 WO 96/12042 22O18U J PCT/US95/129,~
t2: the longest time the reaction is allowed to precede before removing an aliquot, or 80 sec in this example.
When this test is carried out for the large fragment of E. coli DNA polymerase I, a value of more than 0.2 cycles per sec is obtained.
Examples 6 and 7 provide tests to determine the efficiency of incorporation of dideoxynucleotides for an unknown DNA polymerase using a 5' 32P-labeled 40-mer primer annealed to a single-stranded M13 DNA template and a gel electrophoresis-based analysis. Example 6 is best for DNA polymerases that efficiently incorporates dideoxynucleotides (e.g. wild-type T7 DNA polymerase and Taq DNA polymerase F667Y) while Example 7 is best for DNA
polymerases that discriminate strongly against the incorporation of dideoxynucleotides (e.g. T7 DNA
polymerase Y526F and wild-type Taq DNA polymerase).
Examnle 6. Gel electrnphorPs;.-ba d determination of the rate of incormoration of dideoxvnucjPo+-idPs relative to deoxvnuc leotides using a 1:1 ratio of dNTPs to ddNTPs The primary application of this test is to determine the absolute ratio of incorporation of a ddNMP to a dNMP
for a DNA polymerase that efficiently incorporates dideoxynucleotides, such as T7 DNA polymerase, E. coli DNA polymerase I mutant F762Y or Taq DNA polymerase mutant F667Y. It can also indicate the level of discrimination.against ddNTPs of any DNA polymerase;
however, for DNA polymerases that discriminate strongly against ddNTPs, such as T7 DNA polymerase mutant Y526F, E. coli DNA polymerase I or Taq DNA polymerase, higher ratios of ddNTP to dNTP are necessary to determine precisely their level of discrimination, which is described in detail in Example 7.
DNA synthesis reactions are carried out on the 32P-end labeled 40 mer-M13 mGPl-2 DNA template complex prepared as described in Example 3. Reaction conditions are used that are optimum for the DNA poiymerase being tested with regard to buffer, pH, salts, and temperature of the siBsnTUTE SHEET (RULE 26) reaction. A concentration of DNA polymerase is chosen whereby most of the primers are extended in a 10 min reaction and are terminated by the incorporation of a dideoxynucleotide. The reaction mixture contains 100 M
5 4dNTPs and 100 M of one of the four ddNTPs.
We used this test to compare the ability of six DNA
polymerases to incorporate each of the four ddNMPs. The DNA polymerases tested were (1) T7 DNA polymerase with a 28 amino acid deletion in the exonuclease domain and 10 complexed in a one-to-one ratio with thioredoxin (Tabor and Richardson 264 J. Bio1. Chem. 6447, 1989) (referred to here as "T7 DNA polymerase"), (2) The large fragment of E. coli DNA polymerase I, commonly called the Klenow fragment (referred to here as "E. coli DNA polymerase 15 I"), (3) Unmodified DNA polymerase from Thermus aquaticus (referred to here as "Taq DNA polymerase"), (4) T7 DNA
polymerase as described above in which the tyrosine at residue 526 has been changed to a phenylalanine (referred to here as "T7 DNA polymerase Y526F"), (5) E. coli DNA
20 polymerase I as described above in which the phenylalanine at residue 762 has been changed to a tyrosine (referred to here as "E. coli DNA polymerase I
F762Y"), and (6) Taq DNA polymerase as described above in which the phenylalanine at residue 667 has been changed 25 to a tyrosine (referred to here as "Taq DNA polymerase F667Y").
To test the relative rate of use of each of the four ddNTPs compared to the comparable dNTPs for each of the DNA polymerases indicated above, the reaction mixtures (8 30 1) contained 1.0 1 annealed 32P-labeled primer-M13 DNA
as described in Example 3 (-0.015 pmoles, -200,000 cpm), 40 mM Tris=HC1, pH 8.0, 5 mM MgC12, 5 mM dithiothreitol, 50 mM NaCl, 100 M 4dNTPs, and 100 M ddCTP. The reaction mixtures also contained 10 ng of yeast inorganic 35 pyrophosphatase to inhibit pyrophosphorolysis that could otherwise increase the apparent discrimination by the DNA
polymerase (Tabor and Richardson 265 j. Biol. Chezn. 8322, 1990). The reactions were initiated by the addition of 2 1 of each DNA polymerase, diluted in 20 mM Tris=HC1, SUBSTITM $HEET (RULE 261 2201885, pH 7.5, 10 mM 2-mercaptoethanol and 0.05% bovine serum albumin to a concentration of approximately 0.025 units/
1. The concentration of each DNA polymerase was sufficient to extend most of the labeled primers by more than 500 nucleotides in the absence of ddNTPs in a 15 min reaction. The reaction mixtures were incubated for 15 min at either 37 C (T7 DNA polymerase, T7 DNA polymerase Y526F, E. coli DNA polymerase I, and E. coli DNA
polymerase I F762Y) or 70 C (Taq DNA polymerase and Taq DNA polymerase F667Y). The reactions were terminated by the addition of 10 1 90% formamide, 20 mM EDTA, 0.050 bromphenol blue. Each sample was heated at 90 C for 2 min immediately prior to loading 6 1 of each sample onto a gel consisting of 8%- polyacrylamide, 0.4= s N,N'-methylenebisacrylamide, 7 M urea in 100 mM Tris-borate, pH 8.3, 1 mM EDTA. Electrophoresis was at 2000 Volts for 90 min (until the bromphenol blue had just run off the bottom of the gel). After electrophoresis, the gel was dried under vacuum, and autoradiographed. After autoradiography, the distribution of radioactively labeled fragments was determined by Phosphorimager analysis (Molecular Dynamics). Alternatively, the relative intensities of dideoxy-terminated bands can be determined by scanning the autoradiograph using an instrument such as the SciScan 5000 imaging densitometer (United States Biochemical Corp).
When the set of four reactions (each containing a single ddNTP at an equimolar concentration as the dNTP) was carried out with each of the six DNA polymerases described above, the reactions with three of the DNA
polymerases (T7 DNA polymerase Y526F, E. coli DNA
polymerase I and Taq DNA polymerase) resulted in most (>50%) of the radioactivity in the primers that had been extended migrating at the top of the gel, corresponding to fragments greater than 300 bases in length. Based on the predicted exponential decay of signal with increasing fragment size, this corresponds to discrimination by these three DNA polymerases of more than l00 fold against all four ddNTPs. A more precise measurement of the SU5TIM'E SHEET (RULE 26) discrimination against ddNTPs by these three DNA
polymerases is obtained using the test in Example 7 below.
For the other three DNA polymerases (T7 DNA
polymerase, E. coli DNA polymerase I F762Y and Taq DNA
polymerase F667Y) the autoradiograph showed a series of dideoxy-terminated fragments with all of the reactions.
In general, the average lengths of the labeled synthesized fragments were lowest for Taq DNA polymerase F667Y, with only about six radioactively labeled dideoxy-terminated fragments visible with even a several day exposure of the film. The average lengths of labeled fragments with E. coli DNA polymerase I F762Y are slightly longer than with Taq DNA polymerase F667Y, while the average lengths are significantly longer with T7 DNA
polymerase. The fragments are more uniform in intensity when synthesized by E. coli DNA polymerase I F762Y and Taq DNA polymerase F667Y that by T7 DNA polymerase.
The distribution of radioactivity in the fragments was quantitated by Phosphoimager analysis (Molecular Dynamics). The total amount of labeled primers in each lane was determined by running three control reactions in which no DNA polymerase was present, and the radioactivity in each of the corresponding radioactive bands on the gel at the position of the unextended primer was determined. With some preparations of radioactively labeled primers, a certain percentage (<100) is not extended by any of the DNA polymerases, regardless of the concentration of DNA polymerase used; this background level is determined by measuring the percentage of radioactivity remaining at the position of unextended primer for a series of four reactions containing ddNTPs, and subtracting the average of these four values from the total number of counts determined previously. This value is defined as the total number of counts in primers that are capable of being extended by a DNA polymerase.
The total number of counts (i.e. radioactivity) in the first three dideoxy-terminated fragments were determined for T7 DNA polymerase, E. coli DNA polymerase SUBSTIME SKET (NllE 26i 2201,8g5-WO 96/12042 PCT/US95/12,4 I F762Y and Taq DNA polymerase F667Y for each of the four ddNTP reactions. The values are presented in the Table below as the percentage of counts in the first three dideoxy-terminated fragments to the total number of counts in the primers capable of being extended by a DNA
polymerase.
Polymerase Reaction ddGTP ddATP ddTTP ddCTP
polymerase 67% 66% 76% 61%
E. coli DNA
polymerase I F762Y 95% 92% 96% 92%
Taq DNA
polymerase F667Y 97% 95% 95% 99%
As a further test of the efficiency of each DNA
polymerase to incorporate dideoxynucleotides, the number of counts in each fragment with a significant signal was determined for each reaction, and the data were plotted as a function of the fragment number using the Macintosh program Kaleidograph Version 3.0 (Synergy Software). The resulting plots were fit to an exponential decay curve using the Kaleidograph library routine for this function.
The decay curve is given by the equation:
Y = e M"X
where:
Y = 1 - (the fraction of labeled primers in fragments 1 to X compared to the total number of primers that can be extended) X the fragment number (the first dideoxy-terminated fragment is 1) M = the exponential decay function calculated for the data by the Kaleidograph library routine.
SU85TITUTE SHEET (RILE 26j In the Table below, the following data are provided for each of the four ddNTP reactions using T7 DNA
polymerase, E. coli DNA polymerase I F762Y and Taq DNA
polymerase F667Y:
N, the number of fragments used to fit each exponential curve.
M, the calculated exponential decay function as described above.
D, the discrimination factor given as the ratio of the use of a specific dNTP to the use of the comparable ddNTP when both nucleotides are present at equal concentrations. D is calculated from the equation above using the calculated value of M to determine Y when X =
1, and defining D, the ratio of preference of the dNTP to the ddNTP, as Y/(1-Y).
R2, the correlation index for the data, was calculated by the Kaleidograph library routine. This is a measure of the variability in band intensities, or the sequence-specific variability in the ability of a DNA polymerase to incorporate the specific dideoxynucleotide.
SLOSTffM SHM (ME 26) WO 96/12042 220188 j PCT/US95/120 PolymeraseddNTP N M D R~
T7 DNA ddGTP 8 -0.375 2.2 0.813 poly-merase ddATP 6 -0.356 2.3 0.996 ddTTP 5 -0.450 1.8 0.997 ddCTP 8 -0.317 2.7 0.889 E. Qoli ddGTP 5 -1.03 0.56 0.999 .DNA poly-merase I
ddATP 5 -0.860 0.72 0.998 ddTTP 5 -1.06 0.54 1.000 ddCTP 6 -0.842 0.75 1.000 Taq DNA ddGTP 5 -1.18 0.45 0.995 poly-merase ddATP 6 -0.997 0.59 0.997 ddTTP 6 -1.01 0.56 0.996 ddCTP 4 -1.44 0.32 0.996 Averages:
T7 DNA 4 ddNTP 2.3 .924 poly-merase E. coli 4 ddNTP 0.64 .999 DNA poly-merase I
Taq DNA 4 ddNTP 0.48 .996 poly-merase In summary, T7 DNA polymerase discriminates an average of 2.3 fold against ddNTPs, while E. coli DNA
Si16SnTtlTE SHEET (RULE 26) polymerase I F762Y and Taq DNA polymerase F667Y actually prefer ddNTPs over dNTPs an average of 1.6 fold (1/0.64) and 2.1 fold (1/0.48), respectively. A comparison of Rz indicates that the intensity of neighboring fragments are more uniform with E. coli DNA polymerase I F762Y and Taq DNA polymerase F667Y than with T7 DNA polymerase. For a more accurate measure of uniformity, a greater number of fragments could be included in the analysis by reducing the level of ddNTPs (for example by 5 fold) in each reaction, reducing the decay in intensity at each position (see Example 13).
To determine the amount of discrimination against ddNTPs by a new DNA polymerase, reactions analogous to those described above would be carried out, and identical reactions would be carried out in parallel using T7 DNA
polymerase (SEQUENASE Version 2.0, United States Biochemical Corporation), with all reactions analyzed on the same gel. An initial comparison of the distribution of dideoxy-terminated bands obtained with the new DNA
polymerase compared with those obtained with T7 DNA
polymerase would indicate whether the new DNA polymerase discriminated more or less against ddNTPs than T7 DNA
polymerase. For example, such a visual inspection using E. coli DNA polymerase I F762Y clearly shows that for reactions with each of the 4 ddNTPs, the number of fragments visible on the gel in reactions using E. coli DNA polymerase I F762Y are less (and smaller in average size) than those using T7 DNA polymerase. A more quantitative analysis could then be carried out analogous to that described above in order to calculate the exponential decay factor (M), average relative rate of utilization of dNTPs relative to dNTPs (D) and variability in intensity (R2) for the new DNA polymerase as described above.
One complication that can occur in this test is when the DNA polymerase has an associated exonuclease activity, such as the 5' to 3' exonuclease activity associated with Taq DNA polymerase and the 3' to 5' exonuclease activity associated with E. coli DNA
SUBS11TM SHEET (RULE 26) 220188~
polymerase I and native T7 DNA polymerase (not the 28 T7 DNA polymerase deletion mutant used in the experiments above). A 5' to 3' exonuclease activity is detrimental since it can remove the label on the 5' end of the primer, reducing the radioactivity signal being detected.
This problem can be partially avoided by reducing the amount of DNA polymerase in the reaction mixture. In the example above, 0.025 units of Taq DNA polymerase resulted in virtually all of the primers being extended until terminated by incorporation of a dideoxynucleotide, without appreciable loss in radioactivity due to 5' to 3' .exonuclease activity, whereas a 40 fold increase in Taq DNA polymerase activity, or 1 unit per reaction, resulted in the loss of virtually all 32P from the 5' ends of the primer. An alternative approach to measuring the extent of discrimination for a DNA polymerase with a 5' to 3' exonuclease activity is to use a different assay such as those described in Examples 8-10.
A 3' to 5' exonuclease activity can complicate the assay described above by making the DNA polymerase appear to discriminate more against a ddNTP than it actually does (for example, see Tabor and Richardson, 86 Proc.
Nat1. Acad. Sci. 4076, 1987) This is because once a dideoxynucleotide has been incorporated, the exonuclease activity can preferentially remove the dideoxynucleotide so that DNA synthesis can continue, resulting in an increase in the length of the fragment. Preferably, the enzymes assayed in the test described above are devoid of such 3' to 5' exonuclease activity; examples are modified T7 DNA polymerase (SequenaseO, United States Biochemical Corporation), Taq DNA polymerase, exonuclease-deficient Vent (Thermococcus litoralis) DNA polymerase (New England Biolabs catalog number 257), exonuclease-deficient Deep Vent (Pyrococcus GB-i) DNA polymerase (New England Biolabs catalog number 259 ), exonuclease-deficient Pfu (Pyrococcus furiosus) DNA polymerase (Stratagene catalog number 600163), and exonuclease-deficient Klenow fragment (E. coli DNA polymerase I, United States Biochemical Corporation, catalog number 70057). In some instances, StJB5T1ME SHEET (RUI.E 26) such as with E. coli DNA polymerase I(Klenow fragment) the 3' to 5' exonuclease activity is weak and does not interfere significantly with this assay (see for example Tabor and Richardson 264 J. Biol. Chem. 6447, 1989). One method to determine whether a new DNA polymerase being tested has a 3' to 5' exonuclease activity that is interfering with the ability to accurately measure the discrimination against ddNTPs is to carry out the experiment described above, removing aliquots at different time points up to 60 min. If the size distribution of the dideoxy-terminated fragments increases with time, then it is likely that such a 3' to 5' exonuclease activity is interfering with the assay, while if the distribution of fragments is constant over time then such an activity is not having a significant effect. If the average fragment length is increasing with time, then one should use a shorter incubation time and/or decrease the DNA polymerase concentration to a range in which the fragment sizes remain constant with time.
Pyrophosphorolysis, or the reversal of the polymerase reaction, can have a similar effect as the 3' to 5' exonuclease activity, allowing the DNA polymerase to remove the chain terminating dideoxynucleotide and further increase the length of the fragments (see Tabor and Richardson, 265 J. Biol. Chem. 8322, 1990). This activity is readily avoided by including pyrophosphatase in the reaction mixture, in order to remove the pyrophosphate that accumulates during DNA synthesis and is a necessary substrate for pyrophosphorolysis.
Fxam,nlP 7. Gel electroyhoresis-based determination of the rate of incorporation of dideoxvnucleotides relative to deoxynucleotides by varving the ratio of dNTPs to dd_NTPs This example is similar to that described in Example 6. While it is the preferred test for DNA polymerases SUBSTITUTE SHEET (RULE 26) 88~
WO 96/12042 22a 1PCT/US95/12-0 that discriminate strongly against the incorporation of dideoxynucleotides (e.g. T7 DNA polymerase Y526F, E. coli DNA polymerase I and Taq DNA polymerase), it also works well with DNA polymerases that efficiently incorporate ddNMPs (e.g. T7 DNA polymerase, E. coli DNA polymerase I mutant F762Y and Taq DNA polymerase mutant F667Y). In this test, the ratio of ddNTP to dNTP is varied for two different DNA polymerase preparations, keeping all other aspects of the reactions identical, and the distributions of dideoxy-terminated radioactively labeled fragments are compared to determine the ratios required for the two DNA
polymerases being tested to obtain fragments of comparable average length.
The average length of a series of fragments is determined in one of two ways. In the first, which is best for DNA polymerases that incorporate ddNMPs efficiently, one inspects the autoradiograph and determines the position of the largest fragments visible on a given exposure for a series of reactions containing ddNTP:dNTP ratios that vary by two-fold increments using one DNA polymerase, and compares that to an analogous series using the second DNA polymerase, to determine the ratios required to generate fragments of comparable size for the two DNA polymerases. The position of the front marking the appearance of visible radioactive bands is usually relatively sharp and readily observed by eye.
However, it is also possible to determine such positions more precisely using the Phosphoimager to locate the position in each lane where a certain threshold of radioactivity per unit area occurs, starting at the top of the gel and moving down the gel.
Some DNA polymerases discriminate very strongly against the incorporation of dideoxynucleotides, in which case it is difficult to add sufficient ddNTPs to the reaction to clearly detect the position of the largest dideoxy-terminated fragments on a denaturing polyacrylamide gel. For such DNA polymerases, one can use an alkaline agarose gel electrophoresis to compare the lengths of the dideoxy-terminated fragments in the "SpTUTE SKET (RUt.E M
different series. If one uses a denaturing polyacrylamide gel, then an alternative method to determine the ratios of ddNTP:dNTP required for the two DNA polymerases to generate dideoxy-terminated fragments 5 of comparable average lengths is to focus on one or several bands and determine the ratio of ddNTPs to dNTPs required to obtain a specific level of radioactivity in those fragments, as analyzed by the Phosphoimager, for the two DNA polymerases being tested.
10 These tests were carried out using the six DNA
polymerases described in Example 6. The reaction conditions were identical to that described in Example 6 except for the concentrations of dNTPs and ddNTPs. All reaction mixtures contained 10 M 4 dNTPs. Each of the 15 four ddNTP concentrations were varied by two-fold increments in the following ranges for the six DNA
polymerases as follows: T7 DNA polymerase, E. coli DNA
polymerase I F762Y, and Taq DNA polymerase F667Y , 0.02 M to 1 M, and T7 DNA polymerase Y526F, E. coli DNA
20 polymerase I and Taq DNA polymerase F667Y, 100 to 2,000 M. The reactions were carried out and the samples were analyzed by denaturing polyacrylamide gel electrophoresis as, described in Example 6. Drying of the gel, autoradiography, and Phosphoimager analysis were as 25 described in Example 6. The Table below summarizes the results from this experiment; the values shown for T7 DNA
polymerase, E. coli DNA polymerase I F762Y, and Taq DNA
polymerase F667Y are the absolute ratios obtained in Example 6 by statistical analysis of the rate of 30 exponential decay in intensity of dideoxy-terminated fragments obtained using a 1:1 ratio of dNTPs to ddNTPs.
The values obtained for T7 DNA polymerase Y526F, E. coli DNA polymerase I and Taq DNA polymerase were obtained by determining the ratios of ddNTP to dNTP required to 35 generate a series of dideoxy-terminated fragments of comparable average length to a series generated using T7 DNA polymerase, E. coli DNA polymerase I F762Y, and Taq DNA polymerase F667Y, respectively; i.e., for each pair of wild-type and mutant DNA polymerases the ratios of SUgST1TtlTE SHEET (RULE 26) ddNTPs:dNTPs were determined that give a comparable distribution of dideoxy-terminated fragments. The ddNTP:dNTP ratio used in the reaction with the strongly discriminating enzyme (i.e. the one that contains phenylalanine at the critical position) divided by the ddNTP:dNTP ratio used to obtain a comparable distribution of dideoxy-terminated fragments with the relatively non-discriminating enzyme (i.e. the one that contains tyrosine at the critical position) gives a factor that corresponds to the difference in efficiency between the two DNA polymerases in their use of ddNTPs relative to the comparable dNTP. This factor was multiplied by the absolute ratios obtained for T7 DNA polymerase, E. coli DNA polymerase I F762Y, and Taq DNA polymerase F667Y in Example 6 in order to obtain the values shown below for T7 DNA polymerase Y526F, E. coli DNA polymerase I and Taq DNA polymerase, respectively.
SUBSTITUTE SHEET (RULE 26j Polymerase Incorporation Rate Ratios dG/ddG dA/ddA dT/ddT dC/ddC
polymerase 3.2 3.3 2.8 3.7 polymerase Y526F 6,400 7,300 8,400 11,000 E. coli DNA
polymerase I 140 720 1,100 2S0 E. coli DNA
polymerase I
F762Y 0.56 0.72 0.54 0.75 Taq DNA
polymerase 1,400 4,700 4,500 2,600 Taq DNA
polymerase F667Y 0.45 0.59 0.56 0.32 The Table below summarizes the effect of having tyrosine in place of phenylalanine at the critical selectivity residue of T7 DNA polymerase, E. coli DNA
polymerase I and Taq DNA polymerase on the use of ddNTPs relative to dNTPs.
Residue Average Rate Improvement use of in dN/ddN ddNTPs T7 DNA Tyrosine (WT) 3.0 3,000 X
polymerase Phenylalanine 8,000 E. coli DNA Phenylalanine (WT~ 600 polymerase I Tyrosine 0.6 1000 X
$tMTUTE SHEET (RULE 2S, 2201,8.-85' Taq DNA Phenylalanine (WT) 3,000 polymerase Tyrosine 0.5 6,000 X
To use this test to determine the extent of discrimination of a new DNA polymerase, reactions would be carried out as described above initially using a wide range of ratios of ddNTPs to dNTPs, and comparing the distribution of dideoxy-terminated fragments on a denaturing polyacrylamide gel to those of a standard, e.g. T7 DNA polymerase. Matching the lanes that have comparable average lengths of DNA fragments, the ratio of ddNTPs:dNTPs of the new DNA polymerase is divided by the ratio used with T7 DNA polymerase to give the level of discrimination against ddNTPs by the new DNA polymerase relative to T7 DNA polymerase.
To use this test to determine whether the modification of a DNA polymerase results in a decrease in its ability to discriminate against ddNTPs (i.e.
incorporate dideoxynucleotides more efficiently), an identical number of units of modified and unmodified DNA
polymerases would be used in a series of reactions containing varying ratios of ddNTPs to dNTPs as described above. The average length of dideoxy-terminated fragments are compared for identical ratios of ddNTPs to dNTPs for the two enzymes. If the modification has resulted in a DNA polymerase that incorporates dideoxynucleotides more efficiently, the average length of dideoxy-terminated fragments will be shorter for reactions using the modified DNA polymerase compared with those using the unmodified DNA polymerase at the same ratios of ddNTP to dNTP, while the average length will be longer for reactions using the modified DNA polymerase if the modification resulted in a DNA polymerase that is more discriminatory towards ddNTPs.
This test can also be used to determine whether a modification of a DNA polymerase results in a decrease in its ability to discriminates against analogs of ddNTPs, for example fluorescently tagged ddNTPs. This is SUBSTIME SHFET (RUl.E 26) possible even if one does not: know the concentration of the analogs being tested. As an example of this, we compared the ability of Taq DNA polymerase and Taq DNA
polymerase F667Y to use each of the four DyeDeoxy Terminators manufactured by Applied BiosystemsTN (part number 401150). These DyeDeoxy Terminators have four different fluorescent moieties covalently bound to each of the four ddNTPs (see Example 12 for more detail). For each of the DyeDeoxy Terminat:.ors, the ratio of dNTPs to DyeDeoxy Terminators was varied over a 16,000 fold range by intervals of two-fold, and the pattern of dideoxy-terminated fragments was compared on the autoradiograph to determine the ratios required for each of the two enzymes to obtain the same average length of dideoxy-terminated fragments. The Table below summarizes these results. For each Terminator,, the column labeled "Rat.io"
represents the ratio of the ratios of ddNTP to dNTP
required to give fragments of identical average length for Taq DNA polymerase versus Taq DNA polymerase F667Y.
As with normal ddNTPs, I'aq DNA polymerase F667Y
incorporates the fluorescent ddNTP derivatives much more efficiently that does the unmodified Taq DNA polymerase, by at least a factor of 400.
DyeDeoxy Terminator Ratio G Terminator >400 A Terminator >2,000 T Terminator >2,000 C Terminator >2,000 As discussed previously, one complication that can arise in the use of this test. is when the DNA polymerase being tested has an associazed exonuclease activity. The problems that 5' to 3' and 3' to 5' exonucleases can cause and ways to minimize their effects when present are discussed in Example 6. When testing to determine whether the modification of a polymerase decreases its ability to incorporate dideoxynucleotides, one class of WO 96/12042 2201 U~ j PCT/US95/120 mutants that can have this effect are ones that inactivate a normally very active 3' to 5' exonuclease activity (see for example Tabor and Richardson 84, Proc.
Natl. Acad. Sci. 4767, 1987). This class of mutants are 5 not claimed in this patent. If one has a modified DNA
polymerase that gives an apparent increase in the ability of the DNA polymerase to incorporate dideoxynucleotides, and one wants to determine whether it is in the polymerase domain or the exonuclease domain, it is 10 necessary to carry out an exonuclease assay on the modified and unmodified forms of the enzyme; a mutation .that affects primarily the exonuclease activity of the enzyme will have a greater effect on the exonuclease activity of the enzyme than on the polymerase activity.
15 Preferably, one would measure the exonuclease activity on a DNA substrate labeled at its 3' end with 32P-ddAMP (see Example 21). As in Example 6, it is important to inhibit pyrophosphorolysis in these reactions in order to avoid it increasing the apparent discrimination against ddNTPs 20 by a DNA polymerase. This is readily accomplished by including pyrophosphatase in the reaction.
Example Q. Determination of the efficiency of incorporation of dideoxynucleotides by inhibition of DNA
25 synthesis on a single-stranded M13 DNA - unl beled 40-mer primer complex In this example the sensitivity of a DNA polymerase to a ddNTP is determined by measuring the ability of varying concentrations of the ddNTP to inhibit a standard 30 DNA synthesis reaction. The DNA synthesis assay is a modification of that described in Tabor and Richardson 264 J. Biol. Chem. 6447, 1989. The 40-mer primer and the M13 mGPl-2 template are as described in Example 3.
The primer is annealed to the M13 mGPl-2 single-stranded 35 DNA template in a reaction mixture (1X = 25 1) containing 2 g of M13 mGPl-2 DNA, 6 pmoles of primer (a 10-fold molar excess to template), 40 mM Tris-HCl, pH
8.0, 10 mM MgC12 and 100 mM NaCl. The mixture is incubated 65 C for 2 min and then cooled to room SUBSTiTUTE SHEET (R111.E 26) '2~2U1885 temperature over 30 min. The standard reaction mixture (45 1) contained 22 mM Tris-HCl, pH 8.0, 5.5 mM MgC1a1 55 mM NaCl, 300 M dGTP, dATP, dCTP and [3H] TTP (30 cpm/pmol), and varying concentrations of one of the four or all four ddNTPs. The reaction mixtures also contained ng of yeast inorganic pyrophosphatase to inhibit pyrophosphorolysis that could otherwise increase the apparent discrimination by the DNA polymerase (Tabor and Richardson 265 J. Biol. Chem. 8322, 1990). The mixtures 10 are incubated at 37 C for 1 min (70 C for thermophilic DNA polymerases), and the reactions are initiated by the addition of 5 1 aliquots of dilutions (0.01 to 1 unit) of the DNA polymerase being analyzed, diluted in 20 mM
Tris=HC1, pH 7.5, 10 mM 2-mercaptoethanol and 0.05%
bovine serum albumin. The reaction mixtures are further incubated at 37 C for 10 min (70 C for thermophilic DNA
polymerases). The reactions are terminated by the addition of 5 1 of 100 mM EDTA, and 45 1 is spotted onto Whatman DE81 filter discs. The discs are washed in 4 changes of 150 ml of 0.3 M ammonium formate, p H 8.0, followed by 2 changes of 150 ml of 90% ethanol, each of 5-10 min duration. The disks are then dried under a heat lamp and counted in a scintillation counter in the presence of 5 ml of fluor (Opti-Fluor 0, Packard). From the amount of radioactivity on each disk, the amount of total DNA synthesis is calculated.
Specific DNA polymerases being tested may have optimum buffer, pH, salt, or temperature conditions that differ from those suggested above. Each DNA polymerase should be tested under the conditions that give optimum specific polymerase activity for that enzyme.
To determine whether a modification of a given DNA
polymerase results in a decrease in its ability to discriminate against dideoxynucleotides, first a series of reactions are carried out in the absence of ddNTPs, varying the DNA polymerase concentration to determine the range where the activity varies approximately linearly with enzyme concentration for both the modified and unmodified forms of the enzyme. An enzyme concentration SUBSTIME %IEET f RUl.E 261 WO 96/12042 2201885 PCT/US95/12*
is chosen that is in this linear range for both forms of the enzyme; e.g., an enzyme concentration whereby about 30!k of the template is replicated in the 10 min reaction is likely to be in such a linear range.
Once a proper enzyme concentration is chosen, a series of reactions are carried out varying the amount of either one ddNTP or preferably all four ddNTPs in the mixture, in order to determine the concentration required to inhibit 50; of the DNA synthesis. For example, under the conditions stated above (300 M 4dNTPs), the following concentrations of a mixture of 4 ddNTPs are required to inhibit 50%- of the DNA synthesis for the following six DNA polymerases:
Polymerase [4ddNTP] for 50o inhibition T7 DNA polymerase 0.1 M
T7 DNA polymerase Y526F 300 AM
E. coli DNA polymerase I 20 M
E. coli DNA polymerase I F762Y 0.04 AM
Taq DNA polymerase 150 M
Taq DNA polymerase F667Y 0.4 M
This test can be used to determine if a modification in a new DNA polymerase results in a decrease in its 5 ability to discriminate against ddNTPs; if the mutation does have this effect, then a higher concentration of 4ddNTPs will be required to inhibit 50% of the DNA
synthesis in the assay described above.
10 Example 9. Determination of the efficiency of incor-poration of dideoxynucleotides by measurina the incorporation of [a-32 P1 dAMP into synthetic iprimer-template complexes S1IBSTITUTE SHEET (RULE 26) :~ 201885 In this example the competition between a dNTP and a ddNTP is assayed for use at a single site in a synthetic primer-template. This assay differs from the others in that it limits the comparison of the use of the two substrates to a single site, avoiding the complication of sequence-specific variability in discrimination. While this relatively simple assay is suitable for a preliminary screen of DNA polymerases for their ability to discriminate against ddNTPs, it should not be used to the exclusion of the assays presented in Examples 6-8 since often the discrimination against ddNTPs is strongly influenced by the neighboring sequences, an important problem for DNA sequence analysis (see for example Tabor and Richardson 265 J. Biol. Chem. 8322, 1990).
The two primer-templates shown below are used in this example. The first is used to determine the discrimination between dATP versus ddATP, while the second is used to determine the discrimination between dCTP versus ddCTP, dTTP versus ddTTP, and dGTP versus ddGTP.
Primer-Template A:
5' GGCGACGTTGTAAAACGACGGCCAGTGCCA 3' 3' GCTGCAACATTTTGCTGCCGGTCACGGTTCCCC 5' Primer-Template B:
5' GGCGACGTTGTAAAACGACGGCCAGTGCCA 3' 3' GCTGCAACATTTTGCTGCCGGTCACGGTCAGTTTT 5' Each reaction mixture contains 25 pmoles each of primer and template. The primer and template are mixed together and annealed in a reaction mixture (1X = 10 1) containing 40 mM Tris=HCl, pH 8.0, 10 mM MgC12 and 100 mM
NaCl. The mixture is incubated 65 C for 2 min and then SUSSTIME SHEET (RULE 26) WO 96112042 2201885 PCT/US95/12*
cooled to room temperature over 30 min. The standard reaction mixture (45 1) for reactions carried out with Primer-Template A contains 22 mM Tris=HC1, pH 8.0, 5.5 mM
MgC12, 55 mM NaCl, 25 pmoles of the Primer-Template A
complex, 5 M[a-32P] dGTP (4,000 cpm/pmole) and varying concentrations of dATP and ddATP. The reaction mixtures also contained 10 ng of yeast inorganic pyrophosphatase to inhibit pyrophosphorolysis that could otherwise increase the apparent discrimination by the DNA
polymerase (Tabor and Richardson 265 J. Biol. Chern. 8322, 1990). The mixtures are incubated at 37 C for 1 min (70 C for thermophilic DNA polymerases), and the reactions are initiated by the addition of 5 1 aliquots (0.01 to 1 unit) of the DNA polymerase being analyzed diluted in 20 mM Tris-HCl, pH 7.5, 10 mM 2-mercaptoethanol and 0.05% bovine serum albumin. The reaction mixtures are further incubated at 37 C for 10 min (70 C for thermophilic DNA polymerases). The reactions are terminated by the addition of 5 1 of 100 mM EDTA, and 45 1 is spotted onto Whatman DE81 filter discs. The discs are washed in 4 changes of 150 ml of 0.3 M ammonium formate, p H 8.0, followed by 2 changes of 150 ml of 90% ethanol, each of 5-10 min duration. The disks are then dried under a heat lamp and counted in a scintillation counter in 5 ml of fluor (Opti-Fluor 0, Packard). From the amount of radioactivity on each disk, the amount [31P]dGMP incorporated was determined. The assumption is made once a single dAMP residue has been incorporated to remove the block for the incorporation of dGMP residues, four [32PJdGMPs will be incorporated into each primer, and thus the number of dAMPs incorporated are one fourth the number of dGMPs incorporated.
All reactions are carried out with a constant amount of the DNA polymerase being analyzed; the amount of DNA
polymerase should be sufficient to replicate 50% of the total dCMP residues in the single-stranded region of the template in the 10 min incubation in the presence of 10 M dATP and the absence of ddATP. Specific DNA
polymerases being tested may have optimum buffer, pH, SUSSf INTE $HEET (RULE 2%
salt, or temperature conditions that differ from those suggested above. Each DNA polymerase should be tested under the conditions that give optimum specific polymerase activity for that enzyme. Control reactions 5 should also be carried out in which neither dATP nor ddATP are present; this defines the background DNA
synthesis that should be subtracted from each sample. In general this is less than 10% of the DNA synthesis obtained when dATP is present.
10 Reactions are then carried out with 10 M dATP and varying concentrations of ddATP, to determine the amount of ddATP required to inhibit DNA synthesis by 50%.
Examples are shown in the Table below for the concentration of ddATP required to inhibit 50% of the DNA
15 synthesis in the presence of 10 M dATP. The polymerases are defined as in Example 6.
Polymerase [ddATP] for 50o inhibition T7 DNA polymerase 30 M
T7 DNA polymerase Y526F >500 M
E. coli DNA polymerase I >500 M
E. coli DNA polymerase I F762Y 6 M
Taq DNA polymerase >500 M
Taq DNA polymerase F667Y 5 M
In order to carry out an analogous test measuring the discrimination against ddGTP, ddTTP or ddCTP, reactions are carried out identical to that described above except that Primer-Template B is used instead of Primer-Template A, and the reactions containing 10 M dGTP, dTTP and dCTP
and 5 M [a-32P] dATP (4,000 cpm/pmole) and varying concentrations of either ddGTP, ddTTP or ddCTP.
As with the other examples, DNA polymerases with a 3' to 5' exonuclease activity can interfere with this assay, MOTM SHEET (RU1.E 26) .
WO 96/12042 22" 16r p ~ J CT/iJS95/1290 making an enzyme more discriminatory against ddNTPs than that due to discrimination at the level of incorporation of the analog. In addition, enzymes with high levels of exonuclease activity can use up all the dNTPs in the reaction mixture (especially with the relatively low level of dNTPs present in these reactions), resulting in no net DNA synthesis (e.g. the native T7 DNA polymerase, see Tabor and Richardson 264 J. Biol. Chem. 6447, 1989).
In these cases the concentration of DNA polymerase and the incubation time of the reaction should be adjusted to obtain the maximum level of DNA synthesis in the absence of ddNTPs.
Example 10, De rmi nat-i nn of the effiCi E'nr'v of incorporation [a-32 Pl ddNMps into a ynthPt i c- primer-template complex In this example the competition between a dNMP and a ddNMP is assayed for incorporation at a single site in a synthetic primer-template. This assay differs from that in Example 9 in that the label is in [a-32P] ddATP, and thus the incorporation of ddAMP is being measured. This assay can be used to test whether a ddNTP is inhibiting a DNA polymerase by acting as a chain terminator, being incorporated into the 3'.end of the primer, or simply by binding the DNA polymerase and preventing further DNA
synthesis without actually being incorporated into the primer.
In the example below the incorporation of ddAMP is measured using [a-32P] ddATP and Primer-Template A (Example 9) :
Primer-Template A:
5' GGCGACGTTGTAAAACGACGGCCAGTGCCA 3' 3' GCTGCAACATTTTGCTGCCGGTCACGGTTCCCC 5' Incorporation of [a-32P] ddGMP, [a- 3:p ] ddCMP and .[a-32P]ddTMP could be similarly tested on the appropriate template (for example, Primer-Template B in Example 9).
StJMIME SHEET (RUIf 26) Each reaction mixture contains 25 pmoles each of primer and template (Primer-Template A, see above). The primer and template are mixed together and annealed in a reaction mixture (1X = 10 1) containing 40 mM Tris=HCl, pH 8.0, 10 mM MgClz and 100 mM NaCl. The mixture is incubated 65 C for 2 min and then cooled to room temperature over 30 min. The standard reaction mixture (45 1) contains 22 mM Tris=HCl, pH 8.0, 5.5 mM MgC12, 55 mM NaCl, 25 pmoles of the Primer-Template A complex, 2.5 M [a-32P]ddATP (Amersham PB10235, >5000 Ci/mmol diluted with cold ddATP to a specific activity of 4,000 cpm/pmole) and varying concentrations of dATP. The reaction mixtures also contained 10 ng of yeast inorganic pyrophosphatase to inhibit pyrophosphorolysis that could otherwise increase the apparent discrimination by the DNA
polymerase (Tabor and Richardson 265 J. Biol. Chem. 8322, 1990). The mixtures are incubated at 37 C for 1 min (70 C for thermophilic DNA polymerases), and the reactions are initiated by the addition of 5 1 aliquots (0.01 to 1 unit) of the DNA polymerase being analyzed diluted in 20 mM Tris-HC1, pH 7.5, 10 mM 2-mercaptoethanol and 0.05%- bovine serum albumin. The reaction mixtures are further incubated at 37 C for 10 min (70 C for thermophilic DNA polymerases). The reactions are terminated by the addition of 5 1 of 100 mM EDTA, and 45 1 is spotted onto Whatman DE81 filter discs. The discs are washed in 4 changes of 150 ml of 0.3 M ammonium formate, p H 8.0, followed by 2 changes of 150 ml of 90 s ethanol, each of 5-10 min duration. The disks are then dried under a heat lamp and counted in a scintillation counter 5 ml of fluor (Opti-Fluor 0, Packard). From the amount of radioactivity on each disk, the amount [32P]ddAMP incorporated is determined.
All reactions are carried out with a constant amount of the DNA polymerase being tested; the amount of DNA
polymerase should be that concentration which gives the highest level of incorporation of [32P] ddAMP into Primer-Template A in the 10 min incubation in the absence of dATP. Specific DNA polymerases being tested may have SI~SfITIJjE SHEET (RUL.E 2%
WO 96/12042 PCT/US95/1291&
optimum buffer, pH, salt, or temperature conditions that differ from those suggested above. Each DNA polymerase should be tested under the conditions that give optimum specific polymerase activity for that enzyme.
In order to use this assay to determine the level of discrimination against a ddNTP, reactions are carried out with a constant amount of the DNA polymerase and [32P]ddATP, in the presence or absence of 2.5 M dATP (an equimolar concentration to [32P]ddATP) and the effect the presence of dATP has on the incorporation of [32P]ddAMP
is determined. If a DNA polymerase does not discriminate between the incorporation of ddAMP and dAMP, and it has no 3' to 5' exonuclease activity, then the addition of dATP will inhibit incorporation of [32P]ddAMP by 501.1r.
This test is best for a DNA polymerase that efficiently incorporates ddNMPs, such as Taq DNA
polymerase F667Y. For DNA polymerases that discriminate strongly against ddNMPs, the previous assay is preferred, in which the label is in a dNTP other than the one being used in competition with the ddNTP, since in that case much higher concentrations of the ddNTP can be used.
However, with DNA polymerases that discriminate strongly against ddNMPs, if one is interested in testing whether a given mutation is reducing the level of discrimination against ddNMPs, this assay could be used by assaying the unmodified DNA polymerase on this substrate in the absence of dATP (measuring the incorporation of [32P]ddAMP
as a function of DNA polymerase concentration), and comparing the rate of incorporation to that of the mutant enzyme. If the mutation is reducing the discrimination against ddATP, then the mutant enzyme should have a higher specific activity for incorporation of [32P] ddAMP .
As with the other examples, DNA polymerases with a 3' to 5' exonuclease activity can interfere with this assay, making an enzyme more discriminatory against ddNTPs than that due to discrimination at the level of incorporation of the analog. And as in Example 9, enzymes with high levels of exonuclease activity can deplete all the dNTPs, resulting in no net incorporation of [32P]ddAMP. In SUBSTIME SHEET (RUi.E 26) these cases the concentration of DNA polymerase and the incubation time of the reaction should beadjusted to obtain the maximum level of incorporation of [32P] ddAMP
by the DNA polymerase being tested.
All of the above methods are based on radioactivity to detect either the length of the extended primer or the amount of DNA synthesis on the primer. The efficiency of incorporation of dideoxynucleotides by a DNA polymerase can also be measured nonradioactively. Two examples are presented below that make use of either fluorescent primers or fluorescent dye-dideoxy terminators that are detected on an Applied Biosystems Model 373A DNA
Sequencing System.
Example 11. Determination of the efficiency of incorporation of dideoxynucleotides using a fluorescent primer annealed to single-stranded DNA and gel electrophoresis In this example a fluorescently labeled primer is annealed to single-stranded DNA and DNA synthesis reactions are carried out using varying ratios of dNTPs to ddNTPs. The samples are then loaded onto an Applied Biosystems Model 373A DNA Sequencing System, and the length of each fluorescent fragment is determined by direct fluorescent detection during gel electrophoresis.
Reactions are carried out as described in Tabor and Richardson 265 J. Biol. Chem. 8322, 1990. The fluorescent primer used is "Fam" primer (Applied Biosystems). The DNA used is single-stranded M13 mGPl-2 DNA as described in Example 3. The primer is annealed to the M13 mGPl-2 single-stranded DNA template in a reaction mixture (1X = 10 1) containing 2 g of M13 mGPl-2 DNA, 5 ng of primer, 40 mM Tris-HCl, pH 8.0, 10 mM
MgC12 and 100 mM NaCl. The mixture is incubated 65 C
for 2 min and then cooled to room temperature over 30 min. The standard reaction mixture (18 1) contains 22 mM Tris=HC1, pH 8.0, 5.5 mM MgC12, 55 mM NaCl, and varying concentrations of the 4dNTPs and one of the four ddNTPs.
The reaction mixtures also contained 10 ng of yeast SIBSTINfE SHEET (RULE 26) 9(l inorganic pyrophosphatase to inhibit pyrophosphorolysis that could otherwise increase the apparent discrimination by the DNA polymerase (Tabor and Richardson 265 J. Biol.
Chem. 8322, 1990). The mixtures are incubated at 37 C
5 for 1 min (70 C for thermophilic DNA polymerases), and the reactions are initiated by the addition of 2 1 aliquots (0.01 to 1 unit) of the DNA polymerase being analyzed diluted in 20 mM Tris-HC1, pH 7.5, 10 mM 2-mercaptoethanol and 0.05% bovine serum albumin. The 10 reaction mixtures are further incubated at 37 C for 10 min (70 C for thermophilic DNA polymerases). The reactions are by the addition of 8 1 of 20 mM DTPA, 1 M potassium acetate, pH 5.0, and 60 1 of ethanol. After centrifugation, the DNA is resuspended in 6 1 of 80~;
15 formamide, 10 mM Tris=HC1, pH 8.0, and 1 mM EDTA, and heated at 80 C for 2 min immediately prior to loading on the polyacrylamide gel on the Applied Biosystems 373A DNA
Sequencing System following the manufacturer's procedures.
20 Specific DNA polymerases being tested may have optimum buffer, pH, salt, or temperature conditions that differ from those suggested above. Each DNA polymerase should be tested under the conditions that give optimum specific polymerase activity for that enzyme. The 25 concentration of DNA polymerase should be sufficient to extend most of the primers at least several hundred nucleotides or until a dideoxynucleotide has been incorporated in the 10 min reaction.
The ratio of dNTPs to ddNTPs is adjusted to obtain 30 optimum peak intensities for approximately three hundred bases. For example, for Taq DNA polymerase approximately 10 M 4dNTPs and 200-600 M ddNTPs is optimal, while for Taq DNA polymerase F667Y approximately 300 M 4dNTPs and 0.5-5 M ddNTPs is optimal.
35 To determine whether a modification of a given DNA
polymerase results in a decrease in its ability to discriminate against dideoxynucleotides, reactions should be carried out at varying ratios of dNTPs to ddNTPs for both the unmodified and modified DNA polymerases, and the MSTITt1TE SHEET (RULE 26) intensities of dideoxy-terminated fragments of different lengths are compared to determine whether the modified DNA polymerase is using ddNTPs more efficiently than the unmodified enzyme.
.xa le 12. Determination of the efficiency of incorporation of fluorescent dideoxynucleotides by gel electrophoresis In this example a nonfluorescent primer is annealed to single-stranded DNA and DNA synthesis reactions are carried out using varying ratios of dNTPs to a single fluorescently labeled-ddNTP. The samples are then loaded onto an Applied Biosystems Model 373A DNA Sequencing System, and the length of each fluorescent fragment is determined by direct fluorescent detection during gel electrophoresis. The primer used in this example is the 40-mer described in Example 3, and the template is the single-stranded M13 mGPl-2 described in Example 3. The primer is annealed to the M13 mGPl-2 single-stranded DNA
template in a reaction mixture (1X = 10 1) containing 2 g of M13 mGPl-2 DNA, 6 pmoles of primer (a 10-fold molar excess to template), 40 mM Tris=HC1, pH 8.0, 10 mM MgC12 and 100 mM NaCl. The mixture is incubated 65 C for 2 min and then cooled to room temperature over 30 min. The standard reaction mixture (18 1) contains 22 mM
Tris-HCl, pH 8.0, 5.5 mM MgC12, 55 mM NaCl, and varying concentrations of the 4dNTPs plus one of the four fluorescently-labeled ddNTPs. The four fluorescently labeled ddNTPs are from Applied Biosystems (Taq DyeDeoxy Terminator Cycle Qequencing Kit, part number 401150), and are referred to as G, A, T or C "DyeDeoxy Terminators"
(manual for Taq DyeDeoxy Terminator Cycle Sequencing Kit, part number 901497, Rev. E). The reaction mixtures also contained 10 ng of yeast inorganic pyrophosphatase to inhibit pyrophosphorolysis that could otherwise increase the apparent discrimination by the DNA polymerase (Tabor and Richardson 265 J. Bio1. Chem. 8322, 1990). The mixtures are incubated at 37 C for 1 min (70 C for thermophilic DNA polymerases), and the reactions are gMSTiNfE SHEET (RULE 26) ~~;~
WO 96/12042 2201PCT/US95/1291&
initiated by the addition of 2 1 aliquots (0. 01 to 1 unit) of the DNA polymerase being analyzed diluted in 20 mM Tris-HC1, pH 7.5, 10 mM 2-mercaptoethanol and 0.05%
bovine serum albumin. The reaction mixtures are further incubated at 37 C for 10 min (70 C for thermophilic DNA
polymerases). The reactions are by the addition of 8 1 of 20 mM EDTA, 1 M potassium acetate, pH 5.0, and 60 1 of ethanol. After centrifugation, the DNA is resuspended in 6 1 of 80o formamide, 10 mM Tris-HCl, pH
8.0, and 1 mM DTPA, and heated at 80 C for 2 min immediately prior to loading on the polyacrylamide gel on the Applied Biosystems 373A DNA Sequencing System following the manufacturer's procedures .
Specific DNA polymerases being tested may have optimum buffer, pH, salt, or temperature conditions that differ from those suggested above. Each DNA polymerase should be tested under the conditions that give optimum specific polymerase activity for that enzyme. The concentration of DNA polymerase used in these reactions should be that concentration that is sufficient to extend most of the primers at least several hundred nucleotides or until a dideoxynucleotide has been incorporated in the 10 min reaction. For DNA polymerases that have a 5' to 3' exonuclease activity, such as Taq DNA polymerase, the DNA polymerase concentration must be kept low enough to avoid this activity degrading a significant percentage of the 5' ends of the fragments.
To determine whether a DNA polymerase discriminates strongly or weakly against a fluorescent ddNTP, reactions are carried out using 20 M 4dNTPs and 0.01 1 of each DyeDeoxy Terminator provided by Applied Biosystems (part number 401150). When Taq DNA polymerase is used under these conditions, most of the fluorescence is either in unincorporated dye-ddNTPs at the leading front of the gel, or in fragments greater than several hundred bases in length. In contrast, with the Taq DNA polymerase mutant F667Y, under these conditions most of the fluorescence is in fragments that are less than several hundred bases in length, and a significantly lower SIBSTITUTE SHEET (RULE 26) = WO 96/12042 PCTIUS95/12928 percentage of the total fluorescence is in unincorporated dye-ddNTPs at the leading front of the gel.
To determine whether a modification of a given DNA
polymerase results in a decrease in its ability to discriminate against dideoxynucleotides, reactions are carried out at varying ratios of dNTPs to DyeDeoxy Terminators for both the unmodified and modified DNA
polymerases, and the average length of the resulting fluorescent fragments are compared to determine whether the modified DNA polymerase is using the DyeDeoxy Terminators more efficiently than the unmodified enzyme.
The following examples provide tests for determining the uniformity of band intensities produced from dideoxy-terminated fragments synthesized by different DNA
polymerases.
Exam,nl e 13. DetPr,T,; nat; nn of uniformity of incorporat inn of dideoxynuclPnridPs using a single-stranded M13 DNA -5' 32P-labeled 40-mer primer comipleX and gel electrophoresis In this example the uniformity of dideoxynucleotide incorporation is measured on a 5' 32P-end labeled primer extended on a single-stranded M13 DNA template. Three activities can cause variability in band intensity of dideoxy-terminated fragments. One is exonuclease activity that can preferentially at some sequences; this is avoided by removal of the activity selectively either by chemical or genetic means (see for example Tabor and Richardson 264, J. Biol. Chem. 6447, 1989). The second is pyrophosphorolysis; this is readily avoided by including pyrophosphatase in the reaction mixture, which degrades the pyrophosphate that accumulates during DNA
synthesis and is a necessary substrate for pyrophosphorolysis. The third is sequence-specific variability in the incorporation of dideoxynucleotides.
Variability in band intensity is detrimental to DNA
sequence analysis, decreasing the accuracy of the DNA
sequence determined. This test is designed to compare the degree of variability in band intensities in SUBSTiTUTE SHEET (RULE 26) -20,1885 fragments synthesized by different DNA polymerases, including mutant DNA polymerases that may incorporate dideoxynucleotides more efficiently.
The primer, template and reaction conditions in this Example are identical to that described in Examples 6 and 7. The template is M13 mGPl-2 single-stranded DNA
described in Example 3, and the primer is the 40 mer also described in Example 3. Reaction conditions used are those that are optimum for the DNA polymerase being tested with regard to buffer, pH, salts, and temperature of the reaction. It is preferred that magnesium is the only metal ion present in the reaction mixture (i.e. the reactions are carried out in the absence of added manganese ions). A concentration of DNA polymerase is chosen whereby most of the primers are extended in a 10 min reaction and are terminated by the incorporation of a dideoxynucleotide. The ratios of dNTPs to ddNTPs are adjusted for the specific DNA polymerase being tested so that the average fragment size is approximately 100-300 nucleotides. ddCTP is the preferred ddNTP to use for the test of uniformity since fragments terminated with this dideoxynucleotide tend to have the most variability in intensities (see for example Tabor and Richardson 86 Proc. Natl. Acad. Sci. 4076, 1989). Gel electrophoresis, autoradiography, and analysis of the band intensities by either scanning of the gel or phosphoimager analysis are as described in Example 6. Electrophoresis is carried out until fragments of approximately 55 nucleotides in length are at the bottom of the gel (the dye bromphenol blue has run off the bottom of the gel and the dye xylene cyanol is approximately 8 cm from the bottom of the gel).
For a given series of ddNMP-terminated fragments, for example a series of ddCMP-terminated fragments, the intensities of the first 20 fragments from the bottom of the gel are determined, preferably by Phosphoimager analysis. Alternatively, the autoradiograph can be scanned by an imaging densitometer to determine the relative intensities of the first 20 fragments. These intensities are then analyzed statistically as described SUBSTITUTE SHEET (RULE 26) = g WO 96/12042 2201885 pCT/US95/12928 in Example 6 in order to determine their variability.
For example, the values can be plotted using the Macintosh program Kaleidograph Version 3.0 (Synergy Software). The resulting plots are fit to an exponential 5 decay curve using the Kaleidograph library routine for this function. R2, the correlation index for the data, is calculated by the Kaleidograph library routine. This is a measure of the variability in band intensities. The values obtained for R2 using a new DNA polymerase are 10 compared to those obtained using known DNA polymerases, for example _28 T7 DNA polymerase (Sequenase Version 2.0, United States Biochemical Corporation) in the presence of magnesium or manganese (see Tabor and Richardson 265 J.
Biol. Chem. 8322, 1990), E. coli DNA polymerase I
15 (either Klenow fragment or Klenow fragment with the mutation F762Y) or Taq DNA polymerase (either wild-type or the mutant F667Y). The R 2 values obtained with these known DNA polymerases are used as standards by which to compare a new DNA polymerase for its uniformity.
Example 14 Determination of uniformity of incorporation of fa-32P1 ddNMPs using a sincrle-stranded M13 DNA -unlabeled arimer complex and gel electrophoresis In this example the uniformity of dideoxynucleotide incorporation is measured using an unlabeled primer annealed to a single-stranded M13 DNA template and carrying out DNA synthesis in the presence of [a-32P]ddATP. The test described in Example 13 is preferred over this one for measuring uniformity of dideoxynucleotide-terminated fragments, since it is more amenable to use of high concentrations of ddNTPs, which are required for use in enzymes that discriminate strongly against ddNTPs, such as E. coli DNA polymerase I, Taq DNA polymerase and T7 DNA polymerase Y526F. The test in this example is most suited for use with enzymes that incorporate dideoxynucleotides efficiently, such as T7 DNA polymerase, E. coli DNA polymerase I F762Y and Taq DNA polymerase F667Y.
The primer, template and general reaction conditions SUBSTITUTE SHEET (RULE 26) : J[~O~
WO 96/12042 :. PCT/US95/1290 in this example are similar to those described in Example 8, with the following exceptions. The template is M13 mGPl-2 single-stranded DNA described in Example 3, and the primer is the 40 mer also described in Example 3.
Reaction conditions used are those that are optimum for the DNA polymerase being tested with regard to buffer, pH, salts, and temperature of the reaction. It is preferred that magnesium is the only metal ion present in the reaction mixture (i.e. the reactions are carried out in the absence of added manganese ions). The reactions are carried out with 50 M dGTP, dCTP and dTTP, and varying concentrations of dATP and [a-32P)ddATP. The concentrations of dATP and [a-32P]ddATP are chosen to maximize the amount of radioactivity in fragments approximately 100 nucleotides in length. All other aspects with respect to gel electrophoresis and analysis of the radioactive fragments are as described in Example 13.
F:xampl e 15. D rmi nat-i nn of uniformity of incorporati nn of dideoxvnucleotidPs using a singlP-stranded M13 DNA -fluorescently-labeled primer comolex and gel electrophoresis In this example, reactions are carried out as described in Example 11. The template is M13 mGPl-2 single-stranded DNA described in Example 3, and the primer is the 40 mer also described in Example 3.
Reaction conditions used are those that are optimum for the DNA polymerase being tested with regard to buffer, pH, salts, and temperature of the reaction. It is preferred that magnesium is the only metal ion present in the reaction mixture (i.e. the reactions are carried out in the absence of added manganese ions) . A concentration of DNA polymerase is chosen whereby most of the primers are extended in a 10 min reaction and are terminated by the incorporation of a dideoxynucleotide. The ratios of dNTPs to ddNTPs are adjusted for the specific DNA
polymerase being tested so that the average fragment size is approximately 100-200 nucleotides. ddCTP is the SUBSTITUTE SHEET (RULE 26) preferred ddNTP to use since fragments terminated with this dideoxynucleotide tend to have the most variability in intensities (see for example Tabor and Richardson 86 Proc. Natl. Acad. Sci. 4076, 1989). The intensities of up to the first 50 dideoxy-terminated fragments from the primer (approximately 200 nucleotides) are determined, and they are analyzed statistically as described in Example 13. The correlation index R2 is determined for the DNA polymerase being tested, and compared with that obtained with known DNA polymerases such as those described in Example 13. Alternatively, the heights of the first 50 bands are determined and =the ratio of heights of adjacent bands are calculated and used as a measure of variability; the maximum and mean of these ratios obtained from reactions carried out with the DNA
polymerase being tested are compared with the values obtained for reactions carried out using known DNA
polymerases such as those described in Example 13.
Example 16. Determination of uniformity of incorooration of fluorescent dideoxvnucleotides by gel electrophoresis In this example, reactions are carried out as described in Example 12. In order to determine the uniformity of incorporation of the DyeDeoxy Terminators for a specific DNA polymerase, the concentration of dNTPs and the specific DyeDeoxy Terminator are chosen to obtain fluorescently labeled fragments that average 100-200 nucleotides in length. The intensity of fluorescence is determined for fragments 10 to 40 from the primer (the first 10 fragments near the fluorescently labeled primer are ignored). The fragments are analyzed statistically as described in Example 13, and the average variability is defined by R2, the correlation index for the data fitted to an exponential decay curve. The values obtained for R2 are compared with those obtained using known DNA polymerases as described in Example 13. To determine if a specific mutation in a DNA polymerase is resulting in that DNA polymerase producing bands that have less variability, the R2 value obtained for the SUBSTITUTE SHEET (RULE 26) WO 96/12042 PCT/US95/129.0 mutant DNA polymerase is compared to that obtained for the unmodified DNA polymerase.
Example 17: DNA secTuence analysis using a DNA polymerase that efficiently incorporates dideoxynuclPor;dPG
DNA sequence analysis with a DNA polymerase of this invention is carried out using standard procedures, with the ratio of dNTPs to ddNTPs adjusted to obtain dideoxy-terminated fragments of an average length appropriate for separation by electrophoresis. For the mutant in the large fragment of E. coli DNA polymerase I, "Klenow .fragment F762Y", reactions are carried out essentially as with modified T7 DNA polymerase and described in Tabor and Richardson U. S. patent number 4,795,699, Tabor and Richardson 84, Proc. Natl. Acad. Sci. USA 4767, 1987, and SEQUENASE manual, "Step-By-Step Protocols For DNA
Sequencing With SEQUENASE" 3rd Edition, United States Biochemical Corporation. Since Klenow fragment F762Y
incorporates dideoxynucleotides approximately 5 times more efficiently than modified T7 DNA polymerase, the concentration of ddNTPs in the extension-termination mixtures should be reduced by a factor of five compared with the standard mixtures recommended for modified T7 DNA polymerase (Sequenase manual, supra).
DNA sequence analysis with a thermostable DNA
polymerase such as Taq DNA polymerase F667Y is as described by Innis et al. 85, Proc. Natl. Acad. Sci. USA
9436, 1988, with the following modification. Whereas Innis et al. recommend, dNTP/ddNTP ratios of 1:6 dGTP:ddGTP, 1:32 dATP:ddATP, 1:48 dTTP:ddTTP and 1:16 dCTP:ddCTP, these ratios must be adjusted to take into account the 3,000- 8,000 fold more efficient use of the 4 ddNTPs by the Taq DNA polymerase F667Y compared with the wild-type Taq DNA polymerase. Thus the extension-termination reactions with Taq DNA polymerase F667Y
should contain 100 M 4dNTPs and 0.1 - 5 M each of the four ddNTPs; the exact amount of each ddNTP should be adjusted based on the desired average fragment size for optimum determination of DNA sequence. All other aspects SUBSTIME SHEET (RULE 26) . ~
of the DNA sequencing reactions and denaturing gel electrophoresis are as described in Innis et al. (supra).
ExamialP 18: Cycle DNA seauence analysis usinq a +-Ho,-.,,os,-ahlP DNApolYmerase that efficiently incornorates di o ynucleoti des Cycle DNA sequencing with a thermostable DNA
polymerase such as Taq DNA polymerase F667Y is carried out as described in Carothers et al. 7 BioTechniques 494, 1989 except that: (1) The four deoxy/dideoxy NTP mixes contain 250 M of all four dNTPs and 0.1 - 10 M of either ddGTP, ddATP, ddTTP or ddCTP, the exact amount of each ddNTP adjusted empirically based on the desired average fragment size for optimum determination of DNA
sequence. (2) Taq DNA polymerase F667Y is used in place of Taq DNA polymerase, using the same number of units of DNA polymerase as recommended by Carothers et al.
(supra). (3) The reaction mixtures contain 10 ng of inorganic pyrophosphatase to inhibit pyrophosphorolysis that could otherwise increase the apparent discrimination by the DNA polymerase at specific sequences, reducing the uniformity of band intensities (Tabor and Richardson 265 j. Biol. Chem. 8322, 1990). Preferably this pyrophosphatase is purified from a thermophilic organism, for example from Thermus thermophilus .(Hohne et al. 47 Biomed. Biochim. Acta 941, 1988).
ExanwlP 19. = Automated cycle DNA seguencina usina modified Taa DNA polymerase and fluorescent primers Cycle DNA sequencing with a thermostable DNA
polymerase such as Taq DNA polymerase F667Y and the Applied Biosystems Dye Primers is a modification of the procedure described in the Applied Biosystems manual (Part Number 901482, Rev. B). The procedure is identical to that described in the manual with the following modifications: (1) The dNTP/ddNTP mixes must be modified to take into account the more efficient use of ddNTPs by Taq DNA polymerase F667Y compared to Taq DNA
polymerase. The new mixes that should be used in place SIIBSTITUTE SHEET (RULE 26) WO 96/12042 ~2U 1885 PCT/US95/120 of the ones listed in the Applied Biosystems manual are as follows:
dG/ddG mix = 100 M c'dGTP, dATP, dTTP and dCTP, and 5 0.05 M ddGTP
dA/ddT mix = 100 M c'dGTP, dATP, dTTP and dCTP, and 0.3 M ddATP
dT/ddT mix = 100 M c'dGTP, dATP, dTTP and dCTP, and 0.25 M ddTTP
10 dC/ddC mix = 100 M c'dGTP, dATP, dTTP and dCTP, and 0.15 M ddCTP
The concentrations of ddNTPs should be varied to optimize the intensity of fluorescence in fragments of 15 specific size ranges, depending on the application. For example, increasing the concentration of ddNTPs will increase the fluorescence of fragments of shorter length.
(2) Taq DNA polymerase F667Y is used in place of Taq DNA
polymerase. The same number of units of enzyme are used 20 in both cases, when assayed under standard DNA polymerase assay conditions. Alternatively, the same number of DNA
polymerase molecules can be used. (3) The reaction mixtures contain 10 ng of inorganic pyrophosphatase to inhibit pyrophosphorolysis that could otherwise increase 25 the apparent discrimination by the DNA polymerase at specific sequences, reducing the uniformity of band intensities (Tabor and Richardson 265 J. Biol. Chem.
8322, 1990). Preferably this pyrophosphatase is purified from a thermophilic organism, for example from Thermus 30 thermophilus (Hohne et al. 47 Biomed. Biochim. Acta 941, 1988). All other aspects of the procedures are identical to that described in the Applied Biosystems manual (supra).
35 Example 20. Automated cycle DNA seguencina usina mod; f; Prd Taa DNA y?olymerase and fluorescent dye terminators Cycle DNA sequencing with a thermostable DNA
polymerase such as Taq DNA polymerase F667Y and the SUBSTITUTE SMEET (RULE 26) Applied Biosystems DyeDeoxy Terminators is a modification of the procedure in the Applied Biosystems manual (Part Number 901497, Rev. E). The procedure is identical to that described in the manual with the following modifications: (1) The manual calls for the use of 1 1 of each of the 4 DyeDeoxy Terminators undiluted in each sequencing reaction (20 1 reaction). In this Example the Terminators should be diluted prior to addition to the sequencing reaction mixture, since they are incorporated more than several hundred fold more efficiently with Taq DNA polymerase F667Y than with Taq DNA polymerase. 1 1 of each of the following dilutions are added to each sequencing reaction in place of 1 1 of the undiluted Terminator solutions:
DyeDeoxy G Terminator, 1 to 500 in H20 DyeDeoxy A Terminator, 1 to 1,500 in HZO
DyeDeoxy T Terminator, 1 to 1,500 in H20 DyeDeoxy C Terminator, 1 to 1,000 in H20 These dilutions are approximations only; the precise dilution of each DyeDeoxy Terminator should be determined empirically depending on the number of bases of DNA
sequence to be determined from the primer. (2) Taq DNA
polymerase F667Y is used in place of Taq DNA polymerase.
The same number of units of enzyme are used in both cases, when assayed under standard DNA polymerase assay conditions. Alternatively, the same number of DNA
polymerase molecules can be used. (3) The reaction mixtures contain 10 ng of inorganic pyrophosphatase to inhibit pyrophosphorolysis that could otherwise increase the apparent discrimination by the DNA polymerase at specific sequences, reducing the uniformity of band intensities (Tabor and Richardson 265 J. Biol. Chem.
8322, 1990). Preferably this pyrophosphatase is purified from a thermophilic organism, for example from Thermus thermophilus (Hohne et al. 47 Biomed. Biochim. Acta 941, 1988).
SU85TiTi1TE SHEET (RULE 26) WO 96/12042 12018U J PCT/US95/129*
Since this procedure uses over 500 times less DyeDeoxy Terminators that previous procedures, there is much less of a problem with unincorporated DyeDeoxy Terminators after the reactions is complete. Thus it is not necessary to remove the unincorporated DyeDeoxy Terminators by.passing the sample over a spin column, as recommended in the Applied Biosystems manual (supra).
Rather, the sample can be precipitated with ethanol, taken up in 5 1 deionized formamide and 1 1 50 mM EDTA, pH 8.0, heated at 90 C for 2 min, and loaded onto the Applied Biosystems 373A DNA Sequencing System according to the instructions in the 373A User's Manual. It is possible that with a DNA polymerase that incorporates the DyeDeoxy Terminators efficiently (such as Taq DNA
polymerase F667Y) that concentration of the DNA
sequencing reaction by precipitation with ethanol is not necessary; the DNA cycle sequencing reactions, carried out using preferably a high concentration of primer and dNTPs (see below), can be terminated by the addition of an equal volume of deionized formamide, heated at 90 C
for 2 min, and loaded immediately onto the Applied Biosystems 373A DNA Sequencing System. This represents a.major savings in time to the researcher preparing the samples for DNA sequence determination.
The procedure described above uses a relatively low concentration of dNTPs at the outset (7.5 M dATP, dTTP
and dCTP and 37.5 M dITP). The concentration of dNTPs decrease during the cycle DNA sequencing reaction as the dNTPs are used. This concentration of dNTPs (less than 7.5 M) is less than optimal for maximum DNA polymerase activity for most DNA polymerases. This low concentration was necessary with previous protocols because the DNA polymerase that was used discriminated strongly against ddNTPs, requiring a high ratio of ddNTPs to dNTPs. The use of a DNA polymerase of this invention that discriminates much less against the DyeDeoxy Terminators now allows the use of higher concentrations of dNTPs. For example, in the protocol described above a 10-fold higher concentration of dNTPs and DyeDeoxy SU85TITUTE SHEET (RULE 26) Terminators could be used; i.e. 75 M dATP, dTTP and dCTP, 375 M dITP, and the following dilutions of each of the four DyeDeoxy Terminators: DyeDeoxy G Terminator, 1 to 50 in H20; DyeDeoxy A Terminator, 1 to 150 in 4 0;
DyeDeoxy T Terminator, 1 to 150 in H20; DyeDeoxy C
Terminator, 1 to 100 in H20. Thus in this example the DyeDeoxy Terminator concentrations are still at least lower than that in previous protocols by a factor of at least 50.
F_x_amnle 21. Exonuclease assay Lsing a f32P1 dd_AMP
terminated DNA substrate The 3'[32P]ddAMP-terminated DNA substrate is prepared by digesting native calf thymus DNA with HindiIl in a reaction mixture (50 1) that contains 10 g double-stranded calf thymus DNA, 40 imM Tris.HC1, pH 7.5, 10 mM
MgCl2, 5 mM dithiothreitol, 50 mM NaCl, 50 g/ml bovine serum albumin, and 10 units HindIII. After incubation at 37 C for 60 min, 5 1 of [a-32P]ddATP (Amersham PB10235, >5000 Ci/mmol) and 5 units Sec"uenase Version 2.0 (United States Biochemical Corporation, catalog.number 70775) are added, and the mixture is incubated at 20 C for 15 mir_.
The reaction mixture is extracted once with an equal volume phenol:chloroform:isoamyl alcohol (24:24:1), and fractionated on a 1 ml column of Sephadex'M G1100 (Pharmacia) in 20 mM Tris=HC1, pH 7.5, 2 mM EDTA, 100 mM
NaCl. The 3' 32P-labeled DNA, that elutes in the void volume, has a specific activity of approximately 500 cpm/ng of total DNA.
Reaction mixtures for exonuclease assays (90 1) contain 40 mM Tris-HCl, pH 7.5, 10 mM MgC1Z1 10 mM
dithiothreitol, 50 mM KC1 and 1 nmol 3' 32P-labeled DNA.
The reaction mixtures also contain 10 ng of yeast inorganic pyrophosphatase to remove trace amounts of pyrophosphate and thus prevent pyrophosphorolysis (Tabor and Richardson 265 j. Biol. Chem. 8322, 1990). This mixture is preincubated at 20 C for 1 min, then 10 1 of the appropriate enzyme dillstion is added. After incubation at 37 C for the indicated times, the WO 96/12042 22U1g8'J-~ PCT/US95/1290 reactions are stopped by the addition of 30 1 of bovine serum albumin (10 mg/ml) and 30 1 of trichloracetic acid (100% w/v). The precipitated DNA is incubated at 0 C
for 15 min, and then pelleted by centrifugation at 12,000 g for 30 min. The acid-soluble radioactivity is measured in 100 1 of the supernatant. One unit of 3'[32P] ddAMP-DNA exonuclease activity catalyzes the acid solubilization of one pmol of [32P] ddAMP in 15 min.
Other embodiments are within the following claims.
~$Z~~~ SKET 1~U 2M
(1) GENERAL INFORMATION:
(i) APPLICANT:
(A) NAME President & Fellows of Harvard College (B) STREET 17 Quincy Street (C) CITY Cambridge (D) STATE Massachusetts (E) COUNTRY United States of America (F) POSTAL CODE 02138 (ii) CORRESPONDENCE ADDRESS:
(A) ADDRESSEE Richard J. Warburg Lyon & Lyon (B) STREET 633 West Fifth Street Suite 4700 (C) CITY Los Angeles, CA 90071 (D) COUNTRY USA
(E) POSTAL CODE 90071 (A) TELEPHONE: (213) 955-0440 (B) TELEFAX: (213) 489-1600 (C) TELEX: 67-3510 (iii) NUMBER OF SEQUENCES: 24 (iv) TITLE OF INVENTION: DNA POLYMERASES HAVING
MODIFIED NUCLEOTIDE BINDING
SITE FOR DNA SEQUENCING
(v) COMPUTER READABLE FORM:
(A) MEDIUM TYPE: 3.5" Diskette, 1.44 Mb storage (B) COMPUTER: IBM PS/2 Model 50Z or 55SX
(C) OPERATING SYSTEM: IBM P.C. DOS (Version 3.30) (D) SOFTWARE: WordPerfect (Version 5.0) (vi) CURRENT APPLICATION DATA:
(A) APPLICATION NUMBER: NOT YET ASSIGNED
(B) FILING DATE:
SWSTITUU SHEU (RULE 2S) 220 188a (vii) PRIOR APPLICATION DATA:
Prior applications total, including application described below: TWO
(A) APPLICATION NUMBER: 08/324,437 (B) FILING DATE: 17-OCT-1994 (A) APPLICATION NUMBER: US NOT YET ASSIGNED
(B) FILING DATE: 10-NOV-1994 '(2) INFORMATION FOR SEQ ID NO: 1:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 1:
Arg Arg Ser Ala Lys Ala Ile Asn Phe Gly Leu Ile Tyr Gly (2) INFORMATION FOR SEQ ID NO: 2:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 2:
Arg Arg Ala Ala Lys Thr Ile Asn Phe Gly Val Leu Tyr Gly (2) INFORMATION FOR SEQ ID NO: 3:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 "SME~~~U 2M
= WO 96/12042 PCTIUS95/12928 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii),SEQUENCE DESCRIPTION : SEQ ID NO: 3:
Arg Asp Asn Ala Lys Thr Phe Ile Tyr Gly Phe Leu Tyr Gly SUBSTiTtM SHEET {RULE 26J
~~2~ 1 ~~~, (2) INFORMATION FOR SEQ ID NO: 4:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 4:
Arg Asp Asn Ala Lys Thr Phe Ile Tyr Gly Phe Leu Tyr Gly (2) INFORMATION FOR SEQ ID NO: 5:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 5:
Arg Arg Ser Ala Lys Ala Ile Asn Phe Gly Leu Ile Tyr Gly (2) INFORMATION FOR SEQ ID NO: 6:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 6:
Arg Arg Ser Ala Lys Thr Phe Ile Tyr Gly Phe Leu Tyr Gly SU8ST1TM SHM (RUtf 261 (2) INFORMATION FOR SEQ ID NO: 7:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 7:
Arg Asp Asn Ala Lys Ala Ile Asn Phe Gly Phe Leu Tyr Gly (2) INFORMATION FOR SEQ ID NO: 8:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 8:
Arg Asp Asn Ala Lys Ala Ile Ile Tyr Gly Phe Leu Tyr Gly (2) INFORMATION FOR SEQ ID NO: 9:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 9:
Arg Asp Asn Ala Lys Thr Phe Asn Phe Gly Phe Leu Tyr Gly (2) INFORMATION FOR SEQ ID NO: 10:
(i) SEQUENCE CHARACTERISTICS:
SUBSfITUTE SHW (MU 2M
WO 96/12042 Z20 188" PCT/US95/129 (A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 10:
Arg Asp Asn Ala Lys Thr Phe Asn Tyr Gly Phe Leu Tyr Gly (2) INFORMATION FOR SEQ ID NO: 11:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 11:
Arg Asp Asn Ala Lys Thr Phe Ile Phe Gly Phe Leu Tyr Gly (2) INFORMATION FOR SEQ ID NO: 12:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 12:
Arg Arg Ser Ala Lys Ala Ile Asn Phe Gly Leu Ile Tyr Gly (2) INFORMATION FOR SEQ ID NO: 13:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear SW1ME SMW. (AJ1.f 26) (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 13:
Arg Asp Asn Ala Lys Thr Phe Ile Tyr Gly Phe Leu Tyr Gly (2) INFORMATION FOR SEQ ID NO: 14:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 14:
Arg Arg Ser Ala Lys Thr Phe Ile Tyr Gly Leu Ile Tyr Gly (2) INFORMATION FOR SEQ ID NO: 15:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 15:
Arg Arg Ser Ala Lys Thr Phe Asn Phe Gly Leu Ile Tyr Gly (2) INFORMATION FOR SEQ ID NO: 16:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 16:
StJBSTITUTE SHEET (RULE 26) Arg Arg Ser Ala Lys Ala Ile Ile Tyr Gly Leu Ile Tyr Gly (2) INFORMATION FOR SEQ ID NO: 17:
U) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 17:
Arg Arg Ser Ala Lys Ala Ile Ile Phe Gly Leu Ile Tyr Gly (2) INFORMATION FOR SEQ ID NO: 18:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 18:
Arg Arg Ser Ala Lys Ala Ile Asn Tyr Gly Leu Ile Tyr Gly (2) INFORMATION FOR SEQ ID NO: 19:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 19:
Arg Arg Ala Ala Lys Thr Ile Asn Phe Gly Val Leu Tyr Gly (2) INFORMATION FOR SEQ ID NO: 20:
(i) SEQUENCE CHARACTERISTICS:
SUBSiTME SHEET (RULE 2M
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 20:
Arg Asp Asn Ala Lys Thr Ile Asn Phe Gly Val Leu Tyr Gly (2) INFORMATION FOR SEQ ID NO: 21:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 21:
Arg Arg Ala Ala Lys Thr Phe Ile Tyr Gly Phe Leu Tyr Gly (2) INFORMATION FOR SEQ ID NO: 22:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 22:
Arg Arg Ala Ala Lys Thr Ile Ile Tyr Gly Val Leu Tyr Gly (2) INFORMATION FOR SEQ ID NO: 23:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear "nTM SHEET (RU 2M
(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 23:
Arg Arg Ala Ala Lys Thr Ile Ile Phe Gly Val Leu Tyr Gly (2) INFORMATION FOR SEQ ID NO: 24:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) SEQUENCE DESCRIPTION : SEQ ID NO: 24:
Arg Arg Ala Ala Lys Thr Ile Asn Tyr Gly Val Leu Tyr Gly SUBS'17TUi'E SHEET (Fi)l.E 26j
Claims (50)
1. Modified nucleic acid encoding a modified DNA
polymerase wherein said modified nucleic acid is modified to encode a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site to increase the ability of said modified DNA polymerase to incorporate a dideoxynucleotide relative to the corresponding deoxynucleotide compared to the ability of the corresponding naturally-occurring unmodified DNA polymerase by at least 20-fold.
polymerase wherein said modified nucleic acid is modified to encode a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site to increase the ability of said modified DNA polymerase to incorporate a dideoxynucleotide relative to the corresponding deoxynucleotide compared to the ability of the corresponding naturally-occurring unmodified DNA polymerase by at least 20-fold.
2. The modified nucleic acid of claim 1 wherein said modified DNA polymerase has sufficient DNA polymerase activity for use in DNA sequencing when combined with any factor necessary for said DNA polymerase activity.
3. The modified nucleic acid of claim 1 or claim 2 wherein said modified DNA polymerase has sufficiently low exonuclease activity to allow said polymerase to be used in DNA sequencing.
4. The modified nucleic acid of any one of claims 1 to 3 wherein said modified DNA polymerase has a dideoxynucleotide binding site comprising a tyrosine residue.
5. The modified nucleic acid of claim 4 wherein said polymerase is a T7-type DNA polymerase is selected from the group consisting of T7, T3, 0I, 0II, H, W31, gh-1, Y, A1122, and SP6.
6. A modified DNA polymerase encoded by the modified nucleic acid of any one of claims 1 to 5.
7. The modified nucleic acid of any one of claims 1 to 4 wherein said modified DNA polymerase is a thermostable enzyme.
8. The modified nucleic acid of claim 7 wherein said thermostable enzyme is selected from the group consisting of the DNA polymerase encoded by Thermus aquaticus, Thermus thermophilis, Thermus flavus, Bacillus sterothermophilus, Thermococcus litoralis, and Pyrococcus furiosis.
9. The modified nucleic acid of claim 1, 2, 3, 4, 5, 7 or 8 wherein said ability of said polymerase to incorporate a dideoxynucleotide relative to the corresponding deoxynucleotide is increased at least 25-fold compared to the corresponding naturally-occurring DNA polymerase compared with the corresponding deoxynucleotide.
10. The modified nucleic acid of claim 9 wherein said ability is increased at least 50-fold.
11. The modified nucleic acid of claim 9 wherein said ability is increased at least 100-fold.
12. The modified nucleic acid of claim 9 wherein said ability is increased at least 500-fold.
13. A method for production of a modified DNA polymerase having an increased ability to incorporate a dideoxynucleotide relative to a corresponding deoxynucleotide compared to ability of a corresponding naturally-occurring unmodified DNA polymerase comprising steps of:
providing a nucleic acid molecule encoding a DNA polymerase and mutagenizing said nucleic acid molecule to incorporate one or more base changes in nucleotide base sequence at a region that encodes its dNMP binding site to encode a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 at an amino acid position corresponding to E. coli DNA polymerase residue 762 in the dNMP binding site to alter ability of said polymerase encoded by said nucleic acid to incorporate a dideoxynucleotide by at least 20-fold.
providing a nucleic acid molecule encoding a DNA polymerase and mutagenizing said nucleic acid molecule to incorporate one or more base changes in nucleotide base sequence at a region that encodes its dNMP binding site to encode a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 at an amino acid position corresponding to E. coli DNA polymerase residue 762 in the dNMP binding site to alter ability of said polymerase encoded by said nucleic acid to incorporate a dideoxynucleotide by at least 20-fold.
14. A method for determining a nucleotide base sequence of a DNA molecule comprising the steps of:
incubating a DNA molecule annealed with a primer molecule able to hybridize to said DNA molecule in a vessel containing at least one deoxynucleoside triphosphate, a DNA polymerase modified by having a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site compared to a naturally-occurring unmodified DNA polymerase to have at least a 20-fold increased ability to incorporate a dideoxynucleotide relative to a corresponding deoxynucleotide compared to said naturally-occurring unmodified polymerase, said polymerase having sufficient DNA polymerase activity and less than 1000 units exonuclease activity per mg DNA polymerases, and at least one DNA synthesis terminating agent which terminates DNA synthesis at a specific nucleotide base, in an incubating reaction; and separating the DNA products of the incubating reaction according to size whereby at least a part of the nucleotide base sequence of said DNA molecule can be determined.
incubating a DNA molecule annealed with a primer molecule able to hybridize to said DNA molecule in a vessel containing at least one deoxynucleoside triphosphate, a DNA polymerase modified by having a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site compared to a naturally-occurring unmodified DNA polymerase to have at least a 20-fold increased ability to incorporate a dideoxynucleotide relative to a corresponding deoxynucleotide compared to said naturally-occurring unmodified polymerase, said polymerase having sufficient DNA polymerase activity and less than 1000 units exonuclease activity per mg DNA polymerases, and at least one DNA synthesis terminating agent which terminates DNA synthesis at a specific nucleotide base, in an incubating reaction; and separating the DNA products of the incubating reaction according to size whereby at least a part of the nucleotide base sequence of said DNA molecule can be determined.
15. The method of claim 14 wherein said DNA polymerase is a thermostable DNA polymerase and said sequencing is performed at a temperature above 50°C.
16. The method of claim 15 wherein said DNA polymerase is a thermostable DNA polymerase and said sequencing is performed at a temperature above 60°C.
17. The method of claim 16 wherein said thermostable enzyme is selected from the group consisting of the DNA polymerase encoded by Thermus aquaticus, Thermus thermophilis, Thermus flavus, Bacillus sterothermophilus, Thermococcus litoralis, and Pyrococcus furiosis.
18. The method of any one of claims 14 to 16 wherein said DNA polymerase distinguishes between deoxynucleotides and dideoxynucleotides by less than a factor of 100.
19. The method of any one of claims 14 to 16 wherein said DNA polymerase distinguishes between deoxynucleotides and dideoxynucleotides by less than a factor of 50.
20. The method of any one of claims 14 to 19 wherein said DNA polymerase has less than 1000 units of exonuclease activity per mg of polymerase.
21. A kit for DNA sequencing comprising a modified DNA polymerase modified to include a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site of the DNA polymerase to increase the ability of said DNA polymerase to incorporate a dideoxynucleotide compared to ability of a corresponding naturally-occurring unmodified DNA polymerase by at least 20-fold relative to a corresponding deoxynucleotide; and a reagent necessary for said sequencing selected from the group consisting of dITP, a chain terminating agent, deaza-GTP and a manganese-containing compound.
22. A method for sequencing a strand of DNA comprising the steps of:
providing said strand hybridized with a primer able to hybridize to said strand, to form an hybridized mixture, incubating said hybridized mixture with one or more deoxyribonucleoside triphosphates, a modified DNA polymerase modified by having a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site to increase ability of said polymerase to incorporate a dideoxynucleotide at least 20-fold relative to a corresponding deoxynucleotide compared to ability of a corresponding naturally-occurring unmodified DNA polymerase, and a first chain terminating agent, wherein said DNA polymerase causes said primer to be elongated to form a first series of first DNA products differing in the length of the elongated primer, each said first DNA product having a said chain terminating agent at its elongated end, the number of molecules of each said first DNA products being approximately the same for all DNA products differing in length by no more than 20 bases, and providing a second chain terminating agent in said hybridized mixture at a concentration different from said first chain terminating agent, wherein said DNA polymerase causes production of a second series of second DNA products differing in the length of the elongated primer, each said second DNA product having said second chain terminating agent at its elongated end, the number of molecules of each said second DNA products being approximately the same for all second DNA products differing in length from each other by from 1 to 20 bases, and being distinctly different from the number of molecules of all said first DNA products having a length differing by no more than 20 bases from that of said second DNA products.
providing said strand hybridized with a primer able to hybridize to said strand, to form an hybridized mixture, incubating said hybridized mixture with one or more deoxyribonucleoside triphosphates, a modified DNA polymerase modified by having a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site to increase ability of said polymerase to incorporate a dideoxynucleotide at least 20-fold relative to a corresponding deoxynucleotide compared to ability of a corresponding naturally-occurring unmodified DNA polymerase, and a first chain terminating agent, wherein said DNA polymerase causes said primer to be elongated to form a first series of first DNA products differing in the length of the elongated primer, each said first DNA product having a said chain terminating agent at its elongated end, the number of molecules of each said first DNA products being approximately the same for all DNA products differing in length by no more than 20 bases, and providing a second chain terminating agent in said hybridized mixture at a concentration different from said first chain terminating agent, wherein said DNA polymerase causes production of a second series of second DNA products differing in the length of the elongated primer, each said second DNA product having said second chain terminating agent at its elongated end, the number of molecules of each said second DNA products being approximately the same for all second DNA products differing in length from each other by from 1 to 20 bases, and being distinctly different from the number of molecules of all said first DNA products having a length differing by no more than 20 bases from that of said second DNA products.
23. A method for sequencing a nucleic acid comprising:
combining an oligonucleotide primer, a nucleic acid to be sequenced, between one and four deoxyribonucleoside triphosphates, a modified DNA polymerase modified by having a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site to increase ability of said polymerase to incorporate a dideoxynucleotide at least 20-fold relative to a corresponding deoxynucleotide compared to ability of a corresponding naturally-occurring unmodified DNA polymerase, and at least two chain terminating agents in different amounts, under conditions favoring extension of said oligonucleotide primer to form nucleic acid fragments complementary to the nucleic acid to be sequenced;
separating the nucleic acid fragments by size; and determining nucleic acid sequence wherein said agents are differentiated from each other by intensity of a label in the primer extension products.
combining an oligonucleotide primer, a nucleic acid to be sequenced, between one and four deoxyribonucleoside triphosphates, a modified DNA polymerase modified by having a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site to increase ability of said polymerase to incorporate a dideoxynucleotide at least 20-fold relative to a corresponding deoxynucleotide compared to ability of a corresponding naturally-occurring unmodified DNA polymerase, and at least two chain terminating agents in different amounts, under conditions favoring extension of said oligonucleotide primer to form nucleic acid fragments complementary to the nucleic acid to be sequenced;
separating the nucleic acid fragments by size; and determining nucleic acid sequence wherein said agents are differentiated from each other by intensity of a label in the primer extension products.
24. A method for in vitro mutagenesis of a cloned DNA fragment comprising providing said cloned fragment and a modified DNA polymerase modified by having a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site to increase ability of said polymerase to incorporate a dideoxynucleotide at least 20-fold relative to a corresponding deoxynucleotide compared to ability of a corresponding naturally-occurring unmodified DNA polymerase, contacting said cloned fragment with said polymerase under conditions for synthesizing a DNA strand from said fragment, wherein said conditions cause formation of said DNA strand by incorporation of individual contiguous bases, which are different or the same able to base-pair with said fragment and incorporation of a nucleotide base unable to base pair with said fragment.
25. A method for in vitro mutagenesis of a template DNA fragment comprising providing a primer and said template, said primer having contiguous bases able to base-pair with contiguous bases of said template except at least one base which is unable to base-pair with said template, and extending said primer with a modified DNA polymerase modified by having a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site to increase ability of said polymerase at least 20-fold relative to a corresponding deoxynucleotide to incorporate a dideoxynucleotide compared to ability of a corresponding naturally-occurring unmodified DNA polymerase.
26. A method for producing blunt-ended double-stranded DNA from a linear DNA molecule having a 5' end comprising a single-stranded region, wherein 3' end of said molecule is double stranded and has no 3' protruding termini, comprising incubating said DNA molecule with a modified DNA polymerase modified by having a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site to increase ability of said polymerase at least 20-fold to incorporate a dideoxynucleotide relative to a corresponding deoxynucleotide compared to ability of a corresponding naturally-occurring unmodified DNA polymerase, wherein said polymerase acts on said single-stranded region to produce a blunt-ended double stranded DNA molecule.
27. A method for labeling a 3' end of a DNA fragment comprising incubating said DNA fragment with a modified DNA polymerase modified by having a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site to increase ability of said polymerase at least 20-fold to incorporate a dideoxynucleotide relative to a corresponding deoxynucleotide compared to ability of a corresponding naturally-occurring unmodified DNA polymerase, and a labelled deoxynucleotide species under conditions in which said 3' end of said DNA fragment is extended by said polymerase and thereby labelled by addition of said labelled deoxynucleotide to said DNA fragment.
28. A method of amplification of a DNA sequence comprising annealing a first and second primer to opposite strands of a double-stranded DNA sequence to form an annealed mixture and incubating the annealed mixture with a modified DNA polymerase modified by having a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site to increase ability at least 20-fold of said polymerase to incorporate a dideoxynucleotide relative to a corresponding deoxynucleotide compared to ability of a corresponding naturally-occurring unmodified DNA polymerase, wherein said first and second primers anneal to opposite strands of said DNA sequence with their 3' ends directed towards each other after annealing, and with the DNA sequence to be amplified located between the first and second annealed primers.
29. A Thermus aquaticus DNA polymerase comprising a tyrosine at residue 667, wherein said Thermus aquaticus DNA
polymerase has an increased ability to incorporate a dideoxynucleotide relative to the corresponding deoxynucleotide compared to the ability of the corresponding naturally-occurring unmodified Thermus aquaticus DNA
polymerase by at least 20-fold.
polymerase has an increased ability to incorporate a dideoxynucleotide relative to the corresponding deoxynucleotide compared to the ability of the corresponding naturally-occurring unmodified Thermus aquaticus DNA
polymerase by at least 20-fold.
30. An E. coli DNA polymerase I comprising a tyrosine at residue 762, wherein said E. coli DNA polymerase I has an increased ability to incorporate a dideoxynucleotide relative to the corresponding deoxynucleotide compared to the ability of the corresponding naturally-occurring unmodified E. coli DNA polymerase I by at least 20-fold.
31. A purified DNA polymerase comprising a tyrosine residue at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site provided that said polymerase is not a T7-type DNA polymerase or a mitochondrial DNA polymerase.
32. The DNA polymerase of claim 31 wherein its dNMP binding site comprises the amino acid sequence K N1 N2 N3 N4 N5 N6 N7 N Y G/Q, wherein each N is independently any amino acid wherein the K corresponds to amino acid residue 758 of E. coli DNA Polymerase I and said position N4 is tyrosine.
33. A modified DNA polymerase comprising the sequence K N1 N2 N3 N4 N5 N6 Y G, at its dNMP binding site wherein each N is independently any amino acid, and wherein one said N is mutated to have a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site to provide a polymerase which does not discriminate against ddNMP incorporation more than 50 fold relative to the corresponding dNMP
incorporation.
incorporation.
34. Nucleic acid encoding any one of the DNA
polymerases of claims 29 to 33.
polymerases of claims 29 to 33.
35. A recombinant DNA polymerase which in the presence of magnesium as the only added divalent cation has an average processivity of less than 100 but more than 0 and discriminates less than 100 times but more than 0 time against incorporation of a ddNMP relative to a corresponding deoxynucleotide, wherein said DNA polymerase is not reverse transcriptase and wherein said DNA polymerase includes a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site.
36. A recombinant DNA polymerase which in the presence of magnesium as the only divalent cation has an average processivity of less than 50 but more than 0 and discriminates less than 50 times but more than 0 time against incorporation of a ddNMP relative to a corresponding deoxynucleotide and wherein said DNA polymerase includes a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site.
37. A recombinant DNA polymerase which in the presence of magnesium as the only divalent cation has an average processivity of less than 50 but more than 0 and discriminates less than 5 times but more than 0 time against incorporation of a ddNMP relative to a corresponding deoxynucleotide and wherein said DNA polymerase includes a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site.
38. A recombinant Thermophilic DNA polymerase that discriminates against a ddNMP relative to a corresponding deoxynucleotide by less than a factor of 100 but more than a factor of 0 and wherein said DNA polymerase includes a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site.
39. A recombinant Thermophilic DNA polymerase that discriminates in the presence of magnesium as the only divalent cation against a ddNMP relative to a corresponding deoxynucleotide by less than a factor of 100 but more than a factor of 0 and wherein said DNA polymerase includes a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site.
40. The DNA polymerase of claim 39 wherein said polymerase has an average processivity less than 100 but more than 0.
41. The polymerase of claim 40 wherein said polymerase cycles from one primer-template to another more than once per five seconds.
42. A nucleic acid encoding the polymerase of any one of claims 35 to 41.
43. A method for cycle sequencing using a DNA polymerase of any one of claims 29 to 33 and 35 to 41 wherein said method comprises mixing said DNA polymerase with a DNA molecule to be sequenced, a primer, dNTPs, and at least one chain terminating agent to form a mixture and cycling temperature of the mixture to allow alternate extension of the primer and denaturation of the primer and DNA molecule.
44. A purified cellular DNA polymerase comprising a tyrosine residue in place of the naturally occurring amino acid residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site which causes said polymerase not to discriminate against a ddNMP relative to a corresponding deoxynucleotide by more than 50 fold.
45. The DNA polymerase of claim 33 mutated at any of N1 to N6 to give a polymerase that incorporates ddNMPs at least 20 times more efficiently than the non-mutated polymerase.
46. A method for cycle sequencing, comprising the step of providing an excess or equal amount of all four dNTPs than each of the four ddNTPs in a cycle sequencing reaction and performing said cycle sequencing reaction with the modified DNA polymerase of claim 6.
47. Method for cycle sequencing comprising the step of providing less than 10 times the amount of all four fluorescently labelled dideoxynucleotide than the corresponding deoxynucleotide in a cycle sequencing reaction and performing said cycle sequencing reaction with the modified DNA polymerase of claim 6.
48. A modified nucleic acid encoding a modified DNA
polymerase wherein said modified nucleic acid is modified to encode a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site to increase the ability of said modified DNA polymerase to incorporate a dideoxynucleotide relative to the corresponding deoxynucleotide compared to the ability of the corresponding naturally-occurring unmodified DNA polymerase.
polymerase wherein said modified nucleic acid is modified to encode a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site to increase the ability of said modified DNA polymerase to incorporate a dideoxynucleotide relative to the corresponding deoxynucleotide compared to the ability of the corresponding naturally-occurring unmodified DNA polymerase.
49. A modified DNA polymerase encoded by the modified nucleic acid of claim 48.
50. A modified DNA polymerase having a tyrosine residue at an amino acid position corresponding to T7 DNA polymerase residue 526 or at an amino acid position corresponding to E. coli DNA polymerase residue 762 in its dNMP binding site, wherein said modified DNA polymerase is modified to increase the ability of said polymerase to incorporate a dideoxynucleotide relative to the corresponding deoxynucleotide compared to the ability of the corresponding naturally-occurring DNA polymerase.
Applications Claiming Priority (5)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US32443794A | 1994-10-17 | 1994-10-17 | |
| US08/324,437 | 1994-10-17 | ||
| US08/337,615 US5614365A (en) | 1994-10-17 | 1994-11-10 | DNA polymerase having modified nucleotide binding site for DNA sequencing |
| US08/337,615 | 1994-11-10 | ||
| PCT/US1995/012928 WO1996012042A2 (en) | 1994-10-17 | 1995-10-16 | Dna polymerases having modified nucleotide binding site for dna sequencing |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CA2201885A1 CA2201885A1 (en) | 1996-04-25 |
| CA2201885C true CA2201885C (en) | 2009-12-29 |
Family
ID=26984456
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CA002201885A Expired - Lifetime CA2201885C (en) | 1994-10-17 | 1995-10-16 | Dna polymerases having modified nucleotide binding site for dna sequencing |
Country Status (12)
| Country | Link |
|---|---|
| EP (1) | EP0655506B1 (en) |
| CN (1) | CN100335621C (en) |
| AT (1) | ATE143057T1 (en) |
| AU (1) | AU4193396A (en) |
| CA (1) | CA2201885C (en) |
| DE (2) | DE655506T1 (en) |
| DK (1) | DK0655506T3 (en) |
| ES (1) | ES2072238T3 (en) |
| FI (1) | FI120354B (en) |
| GR (1) | GR950300040T1 (en) |
| HU (1) | HU220839B1 (en) |
| WO (1) | WO1996012042A2 (en) |
Families Citing this family (55)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5912155A (en) * | 1994-09-30 | 1999-06-15 | Life Technologies, Inc. | Cloned DNA polymerases from Thermotoga neapolitana |
| US6015668A (en) * | 1994-09-30 | 2000-01-18 | Life Technologies, Inc. | Cloned DNA polymerases from thermotoga and mutants thereof |
| US5885813A (en) * | 1995-05-31 | 1999-03-23 | Amersham Life Science, Inc. | Thermostable DNA polymerases |
| US6001645A (en) * | 1995-06-07 | 1999-12-14 | Promega Corporation | Thermophilic DNA polymerases from thermotoga neapolitana |
| US6077664A (en) * | 1995-06-07 | 2000-06-20 | Promega Corporation | Thermophilic DNA polymerases from Thermotoga neapolitana |
| US5773258A (en) * | 1995-08-25 | 1998-06-30 | Roche Molecular Systems, Inc. | Nucleic acid amplification using a reversibly inactivated thermostable enzyme |
| EP0871775A4 (en) * | 1995-09-08 | 2002-07-31 | Life Technologies Inc | Cloned dna polymerases from thermotoga and mutants thereof |
| GB9524974D0 (en) * | 1995-12-06 | 1996-02-07 | Amersham Int Plc | An improved method for dye-terminator (cycle) sequencing |
| ATE203056T1 (en) * | 1995-12-15 | 2001-07-15 | Amersham Life Science Inc | THERMOSTABLE DNA POLYMERASE FROM THERMOANAEROBACTER THERMOHYDROSULFURICUS AND MUTATED ENZYMES DERIVED THEREOF WITH REMOVED EXONUCLEASE ACTIVITY |
| US5830657A (en) * | 1996-05-01 | 1998-11-03 | Visible Genetics Inc. | Method for single-tube sequencing of nucleic acid polymers |
| US5888736A (en) * | 1995-12-22 | 1999-03-30 | Visible Genetics, Inc. | Method, compositions and kit for detection and identification of microorganisms |
| US5789168A (en) * | 1996-05-01 | 1998-08-04 | Visible Genetics Inc. | Method for amplification and sequencing of nucleic acid polymers |
| DE19612684A1 (en) * | 1996-03-29 | 1997-10-02 | Gsf Forschungszentrum Umwelt | New use of extremely thermophilic DNA polymerases |
| US6214555B1 (en) | 1996-05-01 | 2001-04-10 | Visible Genetics Inc. | Method compositions and kit for detection |
| US6413718B1 (en) | 1996-05-01 | 2002-07-02 | Visible Genetics Inc. | Method for sequencing of nucleic acid polymers |
| US6083699A (en) * | 1996-05-01 | 2000-07-04 | Visible Genetics Inc. | Method for bi-directional sequencing of nucleic acid polymers |
| CA2252487A1 (en) * | 1996-05-01 | 1997-11-06 | Visible Genetics Inc. | Method, compositions and kit for detection and identification of microorganisms |
| US5827716A (en) * | 1996-07-30 | 1998-10-27 | Amersham Life Science, Inc. | Modified Pol-II type DNA polymerases |
| CZ293215B6 (en) * | 1996-08-06 | 2004-03-17 | F. Hoffmann-La Roche Ag | Enzyme of thermally stable DNA polymerase, process of its preparation and a pharmaceutical composition and a kit containing thereof |
| WO1998023733A2 (en) | 1996-11-27 | 1998-06-04 | University Of Washington | Thermostable polymerases having altered fidelity |
| US6306588B1 (en) | 1997-02-07 | 2001-10-23 | Invitrogen Corporation | Polymerases for analyzing or typing polymorphic nucleic acid fragments and uses thereof |
| DE69841023D1 (en) * | 1997-03-12 | 2009-09-10 | Applied Biosystems Llc | DNA polymerases with improved ability to incorporate labeled nucleotides |
| JP3172710B2 (en) | 1997-07-07 | 2001-06-04 | 理化学研究所 | RNA polymerase |
| JP3318579B2 (en) | 1997-07-07 | 2002-08-26 | 理化学研究所 | DNA sequencing method |
| US6228628B1 (en) * | 1997-07-09 | 2001-05-08 | Roche Molecular Systems | Mutant chimeric DNA polymerase |
| US5882904A (en) * | 1997-08-04 | 1999-03-16 | Amersham Pharmacia Biotech Inc. | Thermococcus barossii DNA polymerase mutants |
| CA2243985C (en) | 1997-09-11 | 2004-08-10 | F. Hoffmann-La Roche Ag | Thermostable dna polymerases incorporating nucleoside triphosphates labeled with fluorescein family dyes |
| WO1999037810A1 (en) * | 1998-01-23 | 1999-07-29 | Amersham Pharmacia Biotech, Inc. | A method, reagent solution and kits for dna sequencing |
| CA2266755A1 (en) * | 1998-04-24 | 1999-10-24 | James M. Dunn | Method for sequencing of nucleic acid polymers |
| US6096875A (en) | 1998-05-29 | 2000-08-01 | The Perlein-Elmer Corporation | Nucleotide compounds including a rigid linker |
| GB9907813D0 (en) * | 1999-04-06 | 1999-06-02 | Medical Biosystems Ltd | Synthesis |
| ATE299940T1 (en) * | 1999-08-21 | 2005-08-15 | Amersham Biosciences Corp | TAQ DNA POLYMERASE WITH AN AMINO ACID SUBSTITUTION IN POSITION E681 AND CORRESPONDING HOMOLOGUES WITH IMPROVED SALT TOLERANCE, USEFUL FOR SEQUENCING DIFFICULT TEMPLATING SEQUENCES USING CHARGED OR UNCHARGED TERMINATORS |
| WO2001032887A1 (en) * | 1999-10-29 | 2001-05-10 | Stratagene | Compositions and methods utilizing dna polymerases |
| US20040009486A1 (en) | 1999-10-29 | 2004-01-15 | Sorge Joseph A. | Compositions and methods utilizing DNA polymerases |
| WO2001038546A1 (en) * | 1999-11-23 | 2001-05-31 | Amersham Biosciences Corp | Improving dideoxynucleotide-triphosphate utilization by the hyper-thermophilic dna polymerase from the archaeon pyrococcus furiosus |
| US6329178B1 (en) | 2000-01-14 | 2001-12-11 | University Of Washington | DNA polymerase mutant having one or more mutations in the active site |
| US7179590B2 (en) | 2000-04-18 | 2007-02-20 | Roche Molecular Systems, Inc | High temperature reverse transcription using mutant DNA polymerases |
| AU2007201277B2 (en) * | 2000-05-24 | 2011-10-13 | Third Wave Technologies, Inc. | Detection of RNA |
| US7198924B2 (en) | 2000-12-11 | 2007-04-03 | Invitrogen Corporation | Methods and compositions for synthesis of nucleic acid molecules using multiple recognition sites |
| US20050042629A1 (en) | 2002-09-13 | 2005-02-24 | Applera Corporation | Thermus scotoductus nucleic acid polymerases |
| US7222059B2 (en) | 2001-11-15 | 2007-05-22 | Siemens Medical Solutions Diagnostics | Electrophoretic trace simulator |
| WO2003048309A2 (en) | 2001-11-30 | 2003-06-12 | Applera Corporation | Thermus thermophilus nucleic acid polymerases |
| WO2003048308A2 (en) | 2001-11-30 | 2003-06-12 | Applera Corporation | Thermus brockianus nucleic acid polymerases |
| WO2003048310A2 (en) * | 2001-11-30 | 2003-06-12 | Applera Corporation | Thermus oshimai nucleic acid polymerases |
| JP2005528114A (en) * | 2002-05-31 | 2005-09-22 | ユニバーシティー・オブ・ワシントン | Error-prone DNA polymerase I mutants and methods for targeted random mutagenesis using error-prone DNA polymerase I mutants in continuous culture - Patents.com |
| US7309570B2 (en) | 2002-10-21 | 2007-12-18 | University Of Texas | Methods for using double-mutant RNA polymerases with reduced discrimination between non-canonical and canonical nucleoside triphosphates |
| ATE469984T1 (en) | 2003-12-01 | 2010-06-15 | Life Technologies Corp | NUCLEIC ACID MOLECULES CONTAINING RECOMBINATION SITE AND METHOD FOR USE THEREOF |
| KR20080082662A (en) | 2005-12-23 | 2008-09-11 | 지멘스 헬쓰케어 다이아그노스틱스 인크. | Methods and reagents for determining genotype of HBC |
| US20100009412A1 (en) | 2006-05-01 | 2010-01-14 | Siemens Healthcare Diagnostics Inc. | Novel Oligonucleotide Primers and Methods for DNA Replication |
| US10131898B2 (en) * | 2014-07-22 | 2018-11-20 | Bio-Rad Laboratories, Inc. | Buffers for use with polymerases |
| CN106318924B (en) * | 2016-10-26 | 2018-10-16 | 江南大学 | A kind of archaeal dna polymerase that catalytic dna synthesis extension ability improves |
| CN109652499B (en) * | 2017-10-12 | 2022-06-03 | 深圳华大智造科技股份有限公司 | Method and kit for rapidly detecting 3'-5' exoactivity or mismatch of DNA polymerase |
| CN110093410A (en) * | 2019-05-21 | 2019-08-06 | 通用生物系统(安徽)有限公司 | A kind of DNA sequencing reaction reagent and preparation method thereof |
| CN116218814A (en) * | 2019-11-13 | 2023-06-06 | 深圳市真迈生物科技有限公司 | Engineered Klenow fragments and uses |
| CN111826366B (en) * | 2020-06-05 | 2022-08-02 | 广州英赞生物科技有限公司 | Direct-amplification type hot-start DNA polymerase and preparation method and application thereof |
Family Cites Families (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CA1338457C (en) * | 1986-08-22 | 1996-07-16 | Henry A. Erlich | Purified thermostable enzyme |
| US4942130A (en) * | 1987-01-14 | 1990-07-17 | President & Fellows Of Harvard College | T7 DNA polymerase |
| US4962020A (en) * | 1988-07-12 | 1990-10-09 | President And Fellows Of Harvard College | DNA sequencing |
| DE69002826T2 (en) * | 1989-02-06 | 1994-03-31 | Eastman Kodak Co | REACTION CONCENTRATE FOR DNA SEQUENCING WITH THERMOSTABLE DNA POLYMERASE. |
| US5001050A (en) * | 1989-03-24 | 1991-03-19 | Consejo Superior Investigaciones Cientificas | PHφ29 DNA polymerase |
| US5108892A (en) * | 1989-08-03 | 1992-04-28 | Promega Corporation | Method of using a taq dna polymerase without 5'-3'-exonuclease activity |
| JPH07500004A (en) * | 1991-07-24 | 1995-01-05 | ユニバーシティ パートナーシップス プロプライエタリー リミテッド | One-step amplification and sequencing of nucleic acids |
-
1994
- 1994-11-24 DE DE0655506T patent/DE655506T1/en active Pending
- 1994-11-24 AT AT94203433T patent/ATE143057T1/en active
- 1994-11-24 ES ES94203433T patent/ES2072238T3/en not_active Expired - Lifetime
- 1994-11-24 DE DE69400567T patent/DE69400567T2/en not_active Expired - Lifetime
- 1994-11-24 EP EP94203433A patent/EP0655506B1/en not_active Expired - Lifetime
- 1994-11-24 DK DK94203433.1T patent/DK0655506T3/en active
-
1995
- 1995-07-31 GR GR950300040T patent/GR950300040T1/en unknown
- 1995-10-16 HU HU9702016A patent/HU220839B1/en unknown
- 1995-10-16 WO PCT/US1995/012928 patent/WO1996012042A2/en active IP Right Grant
- 1995-10-16 CA CA002201885A patent/CA2201885C/en not_active Expired - Lifetime
- 1995-10-16 CN CNB951968297A patent/CN100335621C/en not_active Expired - Lifetime
- 1995-10-16 AU AU41933/96A patent/AU4193396A/en not_active Abandoned
-
1997
- 1997-04-16 FI FI971611A patent/FI120354B/en not_active IP Right Cessation
Also Published As
| Publication number | Publication date |
|---|---|
| EP0655506B1 (en) | 1996-09-18 |
| DE69400567D1 (en) | 1996-10-24 |
| CN100335621C (en) | 2007-09-05 |
| CN1170439A (en) | 1998-01-14 |
| DE69400567T2 (en) | 1997-02-06 |
| GR950300040T1 (en) | 1995-07-31 |
| HUT77037A (en) | 1998-03-02 |
| AU4193396A (en) | 1996-05-06 |
| FI120354B (en) | 2009-09-30 |
| DE655506T1 (en) | 1995-09-28 |
| DK0655506T3 (en) | 1997-03-10 |
| FI971611L (en) | 1997-06-13 |
| ATE143057T1 (en) | 1996-10-15 |
| ES2072238T1 (en) | 1995-07-16 |
| EP0655506A1 (en) | 1995-05-31 |
| FI971611A0 (en) | 1997-04-16 |
| ES2072238T3 (en) | 1997-01-16 |
| HU220839B1 (en) | 2002-06-29 |
| WO1996012042A2 (en) | 1996-04-25 |
| WO1996012042A3 (en) | 1996-08-01 |
| CA2201885A1 (en) | 1996-04-25 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CA2201885C (en) | Dna polymerases having modified nucleotide binding site for dna sequencing | |
| US5614365A (en) | DNA polymerase having modified nucleotide binding site for DNA sequencing | |
| WO1996012042A9 (en) | Dna polymerases having modified nucleotide binding site for dna sequencing | |
| US5939292A (en) | Thermostable DNA polymerases having reduced discrimination against ribo-NTPs | |
| US5001050A (en) | PHφ29 DNA polymerase | |
| JP4417452B2 (en) | Modified thermostable DNA polymerase for sequencing | |
| US5576204A (en) | φ29 DNA polymerase | |
| AU614245B2 (en) | Dna sequencing | |
| US6333159B1 (en) | Cold sensitive mutant DNA polymerases and methods of use thereof | |
| KR101958660B1 (en) | Pcr buffer compositions for increasing activity of dna polymerases with increased mutation specificity | |
| KR20190007214A (en) | Dna polymerases with increased mutation specific amplification | |
| KR100484124B1 (en) | DNA Polymerases Having Modified Nucleotide Binding Site for DNA Sequencing | |
| Hjorleifsdottir et al. | Thermostabilities of DNA ligases and DNA polymerases from four genera of thermophilic eubacteria | |
| AU2023354159A1 (en) | Amplification compositions and methods |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| EEER | Examination request | ||
| MKEX | Expiry |
Effective date: 20151016 |