WO2009124254A1 - Nucleotide analogs - Google Patents

Nucleotide analogs Download PDF

Info

Publication number
WO2009124254A1
WO2009124254A1 PCT/US2009/039475 US2009039475W WO2009124254A1 WO 2009124254 A1 WO2009124254 A1 WO 2009124254A1 US 2009039475 W US2009039475 W US 2009039475W WO 2009124254 A1 WO2009124254 A1 WO 2009124254A1
Authority
WO
WIPO (PCT)
Prior art keywords
nucleotide analog
inhibitor
group
analog
nucleotide
Prior art date
Application number
PCT/US2009/039475
Other languages
French (fr)
Inventor
J. William Efcavitch
Suhaib Siddiqi
Philip Buzby
Judith Mitchell
Edyta Krzymanska-Olejnik
Subramanian Marappan
Xiaopeng Bai
Atanu Roy
Mirna Jarosz
Jayson Bowers
Original Assignee
Helicos Biosciences Corporation
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from PCT/US2008/059446 external-priority patent/WO2009123642A1/en
Priority claimed from US12/098,196 external-priority patent/US8071755B2/en
Priority claimed from US12/244,698 external-priority patent/US8114973B2/en
Application filed by Helicos Biosciences Corporation filed Critical Helicos Biosciences Corporation
Publication of WO2009124254A1 publication Critical patent/WO2009124254A1/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07HSUGARS; DERIVATIVES THEREOF; NUCLEOSIDES; NUCLEOTIDES; NUCLEIC ACIDS
    • C07H19/00Compounds containing a hetero ring sharing one ring hetero atom with a saccharide radical; Nucleosides; Mononucleotides; Anhydro-derivatives thereof
    • C07H19/02Compounds containing a hetero ring sharing one ring hetero atom with a saccharide radical; Nucleosides; Mononucleotides; Anhydro-derivatives thereof sharing nitrogen
    • C07H19/04Heterocyclic radicals containing only nitrogen atoms as ring hetero atom
    • C07H19/06Pyrimidine radicals
    • C07H19/073Pyrimidine radicals with 2-deoxyribosyl as the saccharide radical
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07HSUGARS; DERIVATIVES THEREOF; NUCLEOSIDES; NUCLEOTIDES; NUCLEIC ACIDS
    • C07H19/00Compounds containing a hetero ring sharing one ring hetero atom with a saccharide radical; Nucleosides; Mononucleotides; Anhydro-derivatives thereof
    • C07H19/02Compounds containing a hetero ring sharing one ring hetero atom with a saccharide radical; Nucleosides; Mononucleotides; Anhydro-derivatives thereof sharing nitrogen
    • C07H19/04Heterocyclic radicals containing only nitrogen atoms as ring hetero atom
    • C07H19/16Purine radicals
    • C07H19/173Purine radicals with 2-deoxyribosyl as the saccharide radical

Definitions

  • the invention relates to nucleotide analogs and methods for sequencing a nucleic acid using the nucleotide analogs.
  • Sequencing-by-synthesis involves the template-dependent addition of nucleotides to a template/primer duplex.
  • Traditional sequencing-by-synthesis is performed using dye- labeled terminators and gel electrophoresis (so-called "Sanger sequencing”). See, e.g., Sanger, F. and Coulson, A.R., 1975, J. MoI. Biol. 94: 441-448; Sanger, F. et al, 1977, Nature. 265(5596): 687-695; and Sanger, F. et al, 1977, Proc. Natl. Acad. ScL U.S.A. 75: 5463-5467.
  • a challenge that has arisen in single molecule sequencing involves the ability to sequence through homopolymer regions (i.e., portions of the template that contain consecutive identical nucleotides). Often the number of bases present in a homopolymer region is important from the point of view of genetic function. Many polymerase enzymes used in sequencing-by- synthesis reactions are highly-processive and tend to add bases continuously in a homopolymer region. It is often difficult to resolve the number of nucleotides in a homopolymer due to the difficulty in distinguishing between the incorporation of one or two labeled nucleotides and the incorporation of a greater number of nucleotides.
  • the invention provides nucleotide analogs and methods of using them to allow sequencing-by-synthesis to occur such that, on average, a single nucleotide is incorporated into the 3' end of a primer portion of a template/primer duplex per sequencing cycle.
  • the invention is based, in part, on the discovery that nucleotide analogs having an attached inhibitory region with one or more charged groups provide good incorporation of a single nucleotide into the duplex without allowing a significant, or any, amount of second, third, etc. base incorporation.
  • the invention generally provides nucleotide analogs and methods of using nucleotide analogs in sequencing. More particularly, the invention provides compounds, methods and compositions useful in introduction of a single base at a time in a template- dependent sequencing-by-synthesis reaction. The invention allows template-dependent sequencing-by-synthesis through all regions of a target nucleic acid, including homopolymer regions, and provides methods for the determination of the number of nucleotides present in a homopolymer region.
  • the invention provides nucleotide analogs that comprise a nucleotide (or nucleotide analog), a detectable label, and an inhibitor group.
  • the inhibitor prevents subsequent nucleotide incorporation into the same duplex.
  • the nucleotide analog does not substantially hinder subsequent nucleotide (or nucleotide analog) incorporation.
  • a method for sequencing a nucleic acid includes the steps of: exposing a nucleic acid duplex comprising a template portion and a primer portion to a nucleotide analog comprising an inhibitor that is charged or capable of becoming charged, and a polymerase, under conditions that permit template-dependent incorporation of the analog into the primer; detecting incorporation of the analog; removing or neutralizing the inhibitor; and repeating the exposing, detecting, and removing steps at least once, thereby to determine the sequence of the template.
  • the invention in another aspect, relates to a nucleotide analog that includes: a nucleoside triphosphate; an inhibitor comprising (a) one or more multiply charged groups or groups capable of becoming multiply charged, or (b) two or more (i.e., a plurality of) singly charged groups or two or more groups capable of becoming singly charged; a detectable label; and a linker connecting the inhibitor and the label to the nucleoside triphosphate. It should be noted that in some embodiments, one or a single charged group may be sufficient to provide the desired inhibitory effect.
  • the invention relates to nucleotide analogs of the formula:
  • NTP is a nucleoside or nucleotide triphosphate or an analog thereof capable of template- dependent incorporation into the 3' end of a polynucleotide strand hybridized to a template.
  • Inhibitor comprises a moiety that is charged or capable of becoming charged and that inhibits subsequent nucleotide incorporation once the first nucleotide is incorporated.
  • Tether is a bond or a group linking the NTP to the Inhibitor group.
  • the inhibitor is a non-steric inhibitor.
  • the invention relates to nucleotide analogs of Formula II:
  • NTP is a nucleoside or nucleotide triphosphate or an analog of either capable of template- dependent incorporation into the 3' end of a polynucleotide strand hybridized to a template presenting the complement of the NTP.
  • L is a detectable label that facilitates the identification of the nucleotide analog.
  • Inhibitor comprises (a) one or more multiply charged groups or groups capable of becoming multiply charged, or (b) two or more singly charged groups or two or more groups capable of becoming singly charged.
  • Ri and R 2 are independently a bond or a group, wherein at least one of Ri and R 2 comprises a cleavable bond, which upon cleavage results in de- association of NTP from both Label and Inhibitor.
  • R 3 is a bond or group linking R 2 to the Inhibitor.
  • R 4 is a bond or group linking R 2 to a Label.
  • NTP is a nucleoside or nucleotide triphosphate or an analog of either capable of incorporating onto the 3' end of a polynucleotide strand hybridized to a template presenting the complement of the NTP;
  • L is a detectable label that facilitates the identification of the nucleotide analog;
  • Inhibitor comprises (a) one or more multiply charged groups or groups capable of becoming multiply charged, or (b) two or more singly charged groups or two or more groups capable of becoming singly charged;
  • Ri comprises a cleavable bond, which upon cleavage results in de-association of NTP from both L and Inhibitor;
  • R 2 is a tri-valent radical having the formula:
  • each of R 2 ' and R 2 " is a bi-valent or tri-valent radical selected from:
  • the invention relates to a method for sequencing a nucleic acid.
  • the method includes: (a) anchoring a nucleic acid duplex, or portion thereof, to a surface, the duplex comprising a template portion and a primer portion hybridized thereto; (b) exposing the duplex to nucleotide analog of Formula I or II (as defined herein) in the presence of a polymerase capable of catalyzing the addition of the nucleotide analog to the primer portion in a template- dependent manner; (c) removing unincorporated nucleotide analog and polymerase; (d) detecting incorporation of the nucleotide analog into the primer portion; and repeating the exposing, removing, and detecting steps at least once.
  • the invention relates to a method for sequencing a nucleic acid, the method comprising the steps of: (a) exposing a nucleic acid duplex comprising a template portion and a primer portion to a nucleotide analog of the following Formula II:
  • NTP is a nucleoside or nucleotide triphosphate or an analog of either capable of incorporating onto the 3 ' end of a polynucleotide strand hybridized to a template presenting the complement of the NTP
  • L is a detectable label that facilitates the identification of the nucleotide analog
  • Inhibitor comprises (a) one or more multiply charged groups or groups capable of becoming multiply charged, or (b) two or more singly charged groups or two or more groups capable of becoming singly charged
  • Ri comprises a cleavable bond, which upon cleavage results in de-association of NTP from both L and Inhibitor
  • R 2 is a tri-valent radical having the formula:
  • each of R 2 ' and R 2 " is a bi-valent or tri-valent radical selected from:
  • the invention provides methods and nucleotide analogs for selectively inhibiting the catalytic function of a polymerase enzyme.
  • nucleotide analogs comprise an inhibitory portion, such that the nucleotide analog is capable of being incorporated into a nucleic acid duplex but then inhibits subsequent nucleotide incorporation until the inhibitory portion is removed.
  • the inhibitory portion of an analog of the invention preferably is a charged group.
  • the charged group can take any appropriate form as long as it carries a charge.
  • the charge group is selected from a phosphate, a carboxylic acid (or carboxylate), a sulfate, caproic acid (or a caproic acid derivative), a charged amino acid, -SO3, -SO 2 , and - NR W R V , where R w and R v independently is H, an alkyl or aryl group.
  • the charged group can convey a negative or positive charge, but negative charged groups are preferred.
  • the charge group contains multiple charged portions.
  • the charge group can be a dipeptide, a di-phosphate, disulfate, or other multiples of charged moieties.
  • amino acid inhibitors are preferably selected from aspartic acid, glutamic acid, arginine, lysine, and histidine.
  • the invention provides charged inhibitors of subsequent base incorporation in a sequencing-by-synthesis reaction.
  • subsequent base incorporation it is intended that a first nucleotide (or analog) is incorporated in a template-dependent manner, but second, third, etc. base incorporation is inhibited by the inhibitor group.
  • inhibition occurs by positioning a charged group in proximity to the active site of a polymerase enzyme, thus disabling the ability of the polymerase to make subsequent incorporations.
  • analogs of the invention interfere with magnesium present in the active site of the polymerase, resulting in a reduced ability of the active site to catalyze subsequent nucleotide incorporation.
  • an analog of the invention comprises a nucleoside triphosphate, an inhibitor comprising a plurality of charged groups, a detectable label, and a linker connecting the charged groups and the label to the nucleoside triphosphate.
  • Preferred inhibitors comprise a plurality of charged groups and may be selected from any charged group capable of conferring a charge in a local area.
  • the inhibitor does not sterically inhibit a polymerase.
  • the linker is cleavable. Multiple cleavable groups, such as enzymatically-cleavable group, such as disulfide bonds and the like.
  • the invention provides methods and compositions that facilitate the addition of a single nucleotide to a template/primer duplex per reaction cycle (i.e., the addition of nucleotides and polymerase enzyme under conditions that result in template-dependent nucleotide incorporation into the primer).
  • Analogs of the invention comprise a charged inhibitory group that, upon incorporation of a nucleotide in a template-dependent manner, prevents subsequent nucleotide incorporation until the inhibitory group is removed.
  • an analog of the invention comprises a nucleotide triphosphate, a linker (or tether), a detectable label, and a charged inhibitory group, wherein the label and the inhibitory group are removable.
  • the invention generally provides nucleotide analogs of the following Formula I:
  • NTP is a nucleoside triphosphate or an analog thereof capable of incorporating onto the 3' end of a polynucleotide strand hybridized to a template presenting the complement of the NTP;
  • Inhibitor comprises a group that is charged or capable of becoming charged, e.g., under reaction conditions, and that inhibits a subsequent incorporation of a nucleotide (or analog thereof), and
  • Tether is a bond or a group linking the NTP to the Inhibitor moiety.
  • a group is considered capable of becoming charged if the group is capable of becoming electrically non-neutral, e.g., under reaction or buffer conditions. Examples of such groups include -COOH and -NR W R V , where R w and R v independently is H, an alkyl or aryl group.
  • the inhibitor group can cause inhibition of subsequent nucleotide incorporation without steric hinderance. In other words, the inhibition is caused by chemical or charge interaction with the enzyme and not be a physical blocking of the enzyme.
  • the charged inhibitor also provides steric inhibition of enzyme activity. However, in either case, the inhibitor group is charged.
  • Natural NTPs include nucleoside triphosphates, adenosine triphosphate (ATP), guanosine triphosphate (GTP), cytidine triphosphate (CTP), thymidine triphosphate (TTP) and uridine triphosphate (UTP); and nucleotide triphosphates, deoxyadenosine triphosphate (dATP), deoxyguanosine triphosphate (dGTP), deoxycytidine triphosphate (dCTP), deoxythimidine triphosphate (dTTP) and deoxyuridine triphosphate (dUTP).
  • NTPs useful in this invention include non-nature nucleosides and nucleotides, and analogs and derivatives thereof.
  • the inhibitor may include a moiety that is negatively charged or capable of becoming a negatively charged. In other embodiments, the inhibitor group is positively charged or capable of becoming positively charged.
  • the inhibitor is an amino acid or an amino acid analog.
  • the Inhibitor may be a peptide of 2 to 20 units of amino acids or analogs, a peptide of 2 to 10 units of amino acids or analogs, a peptide of 3 to 7 units of amino acids or analogs, a peptide of 3 to 5 units of amino acids or analogs.
  • the Inhibitor includes a group selected from the group consisting of GIu, Asp, Arg, His, and Lys, and a combination thereof (e.g., Arg, Arg- Arg, Asp, Asp- Asp, GIu, Glu-Glu, Asp-Glu-Asp, Asp-Asp-Glu or Asp Asp Asp Asp).
  • Peptides or groups may be combinations of the same or different amino acids or analogs.
  • the invention relates to an oligonucleotide with at least one nucleotide analog of the invention incorporated therein.
  • the Tether comprises
  • L is detectable label that facilitates the identification of the nucleotide analog after incorporation onto a template
  • Ri and R 2 are independently a bond or a group, wherein at least one of Ri and R 2 comprises a cleavable bond, which upon cleavage results in de-association of NTP from both L and Inhibitor;
  • R3 is a bond or group linking R 2 to the Inhibitor moiety; and R4 is a bond or group linking R 2 to a L.
  • the present invention is directed to nucleotide analogs of Formula II:
  • NTP is a nucleoside triphosphate or an analog thereof capable of incorporating onto the 3' end of a polynucleotide strand hybridized to a template presenting the complement of the NTP;
  • L is a detectable label to facilitate the identification of the nucleotide analog after incorporation onto the template
  • Inhibitor is a moiety that substantially inhibits a subsequent incorporation of a nucleotide (or analog thereof).
  • the Inhibitor moiety includes a nucleotide or nucleoside or analogs thereof, in other embodiments, the inhibitor is not a nucleotide or analog thereof;
  • Ri and R 2 are independently a bond or a group, wherein at least one of Ri and R 2 comprises a cleavable bond, which upon cleavage results in de-association of NTP from both Label and Inhibitor;
  • R 3 is a bond or group linking R 2 to the Inhibitor moiety; and R 4 is a bond or group linking R 2 to L.
  • NTP is a compound having the following formula: wherein B is selected from the group consisting of purine or pyrimidine bases, as well as derivatives of purine and pyrimidine bases; R' is independently selected from the group consisting of-OH, -0-P(O)(OH) 2 , -O-C(O)-R X , -NHR y , and an -O-blocking agent, where R x and R y are alkyl groups; R" is independently selected from the group consisting of H and -OH.
  • Non-limiting examples of representative purine and pyrimidine bases include adenine, cytosine, guanine, thymine, uracil, or hypoxanthine.
  • Non-limiting examples of derivatives of purine and pyrimidine bases include naturally-occurring and synthetic derivatives of a base, including pyrazolo[3,4-d]pyrimidines, 5-methylcytosine (5-me-C), 5-hydroxymethyl cytosine, xanthine, hypoxanthine, 2-aminoadenine, 6-methyl and other alkyl derivatives of adenine and guanine, 2-propyl and other alkyl derivatives of adenine and guanine, 2-thiouracil, 2-thiothymine and 2-thiocytosine, 5-propynyl uracil and cytosine, 6-azo uracil, cytosine and thymine, 5-uracil (pseudouracil), 4-thiouracil, 8-
  • Base B 1 of the invention permits a nucleotide to be incorporated into a polynucleotide chain by a polymerase and forms base pairs with a base on an antiparallel nucleic acid strand.
  • the term base pair encompasses not only the standard AT, AU or GC base pairs, but also base pairs formed between nucleotides and/or nucleotide analogs comprising non-standard or modified bases, wherein the arrangement of hydrogen bond donors and hydrogen bond acceptors permits hydrogen bonding between a nonstandard base and a standard base or between two complementary non-standard base structures.
  • non-standard base pairing is the base pairing between the nucleotide analog inosine and adenine, cytosine or uracil, where two hydrogen bonds are formed.
  • the Inhibitor may include a charged moiety (e.g., a negatively charged moiety, a positively charged moiety, or both) or a moiety that is capable of becoming charged.
  • the Inhibitor can include two or more charged groups.
  • the Inhibitor may have a charged group selected from the group consisting of -COOH, -PO 4 , -SO 4 , -SO3, -SO 2 , -NR W R V , where R w and R v independently is H, an alkyl or aryl group.
  • the Inhibitor moiety does not comprise a -PO 4 group.
  • the Inhibitor moiety does not comprise an aryl group.
  • the Inhibitor does not include a nucleotide or nucleoside or analogs thereof.
  • Inhibitor may be a compound having the following formula:
  • each Ai and each A 2 is independently an amino acid moiety;
  • Rs and R 9 independently is a H or an alkyl group;
  • each of x and y is an integer from 0 to about 10.
  • R 3 of a nucleotide analog of Formula II may include a group having the formula of
  • R 5 is a H or an alkyl group
  • p is an integer from 0 to about 10. In some embodiments, p is 5 or 6.
  • R3 of a nucleotide analog of Formula II may include a group having the formula of
  • k is an integer from about 1 to about 5. In some embodiments, k is an integer from about 2 to about 4. In some embodiments, k is 3.
  • R3 of a nucleotide analog of Formula II may include a group having the formula of
  • R , R are independently H or alkyl groups, and may together form one or more 3, 4, 5, or 6-member rings, and j is an integer from about 1 to about 5.
  • R 3 of include a group having the formula of
  • R3 of a nucleotide analog of Formula II may include a group having the formula of wherein R 1 , R 2 , R 3 , and R 4 are independently H or alkyl groups, and two or more of which may together form one or more 3, 4, 5, or 6-member rings, and j is an integer from about 1 to about 3.
  • R3 of include a group having the formula of
  • Ri of a nucleotide analog of Formula II may include a C-C triple bond, a S-S bond, or both a C-C triple bond and a S-S bond.
  • Ri in the nucleotide analog of Formula II includes a group having the formula of
  • R 6 is a H or an alkyl group; q and r independently is an integer from about 1 to about 10. [0040] In some embodiments, q is 1 or 2 and r is 1, 2 or 3. [0041] In some embodiments, R 2 is a tri-valent radical having the formula: wherein each Of R 2 ' and R 2 " is a bi-valent or tri-valent radical selected from:
  • R 2 " is -(CH 2 ) x - or -(CH 2 -O) x -, where x is 2, 3, 4, 5, or 6.
  • R 2 " is -(CH 2 -O) Z -(CH 2 ) y - or -(CH 2 ) Z -(CH 2 -O) y -, , where y+z is 2, 3, 4, 5, or 6.
  • Advantages of these analogs include increased stability and enhanced level of inhibition, allowing more optimal spacing of the inhibitor moiety within/on the polymerase to increase effective inhibition.
  • Exemplary compounds include:
  • Atto647N is typically used in the form of a carboxylic acid or ester
  • Atto647N (and other Atto dyes) is reacted with an amine group to for the above molecule - hence to amide moiety between the Atto647N and the back bone of the molecule.
  • Atto dyes may be couple to the rest of the molecule by other linkages than an amide linkage, although amide is often preferred for convenience of preparation.
  • the analogs of the invention may also be represented as follows, for example.
  • the location of the charged moiety within the inhibitor group and/or the distance of the charged group to the NTP plays an important role in the effectiveness of inhibiting a subsequent nucleotide incorporation.
  • the charged moiety of the inhibitor is from about 5 to about 60 bonds away from the NTP. In some other embodiments, the charged moiety of the inhibitor is from about 10 to about 40 bonds away from the NTP. In some other embodiments, the charged moiety of the inhibitor is from about 10 to about 35 bonds away from the NTP. In some other embodiments, the charged moiety of the inhibitor is from about 10 to about 30 bonds away from the NTP. In some other embodiments, the charged moiety of the inhibitor is from about 10 to about 20 bonds away from the NTP.
  • the above compound (about 17X fold inhibition) exhibits an inhibiting effect that is much less than the following compound (about 7OX fold inhibition).
  • the label may be any moiety that can be attached to or associated with, e.g., directly or via a linker or spacer, an oligonucleotide and that functions to provide a detectable signal, and/or to interact with a second label to modify the detectable signal provided by the first or second label, e.g. fluorescence resonance energy transfer (FRET).
  • FRET fluorescence resonance energy transfer
  • the label is an optically-detectable moiety (e.g., a fluorophore).
  • types of optically-detectable labels include a fluorescent, chemiluminescence, or electrochemically luminescent label.
  • fluorescent labels include, but are not limited to, 4-acetamido-4'-isothiocyanatostilbene-2,2'disulfonic acid; acridine and derivatives thereof such as acridine, acridine isothiocyanate; 5-(2'-aminoethyl)aminonaphthalene-l -sulfonic acid (EDANS); 4-amino-N-[3-vinylsulfonyl)phenyl]naphthalimide-3,5disulfonate; N-(4-anilino-l- naphthyl)maleimide; anthranilamide; BODIPY; Brilliant Yellow; coumarin and derivatives; coumarin, 7-amino-4-methylcoumarin (AMC, Coumarin 120), 7-amino-4- trifluoromethylcouluarin (Coumaran 15 1); cyanine dyes; cyanosine; 4',6-diaminidino
  • each R x is independently selected from the group consisting of H, alkyl, and substituted alkyl.
  • the above exemplary label moieties include any derivatives containing the chromophore of any of the labeling moieties exemplified or described herein, attached to the nucleotide analog by means of any suitable chemical linking group.
  • the chromophore can be attached to the nucleotide analog via an alkyl chain bonded to the nucleotide analog by a functional group such as an amide, ester, ether, amine, thiol, disulfide, urea, urethane, carbonate, etc.
  • the label is a fluorescent label such as cyanine-3 and cyanine-5.
  • Labels other than fluorescent labels are contemplated as part of the invention, including other optically-detectable labels. Any appropriate detectable label can be used according to the invention, and numerous other labels are known to those skilled in the art.
  • the invention also relates to methods for nucleic acid sequence determination using the nucleotide analogs described herein.
  • the nucleotide analogs of the invention are particularly suitable for use in single molecule sequencing techniques. Such techniques are described for example in U.S. Patent Application Serial Nos. 10/831,214 filed April 2004; 10/852,028 filed May 24,2004; 10/866,388 filed June 10,2005; 10/099,459 filed March 12,2002; and U.S. Published Application 2003/013880 published July 24, 2003, each of which is herein incorporated in its entirety for all purposes.
  • methods for nucleic acid sequence determination include exposing a target nucleic acid (also referred to herein as template nucleic acid or template) to a primer that is complementary to at least a portion of the target nucleic acid, under conditions suitable for hybridizing the primer to the target nucleic acid, forming a template/primer duplex.
  • a target nucleic acid also referred to herein as template nucleic acid or template
  • primer that is complementary to at least a portion of the target nucleic acid
  • the invention also relates to methods for nucleic acid sequence determination using the nucleotide analogs described herein.
  • the nucleotide analogs of the invention are particularly suitable for use in single molecule sequencing techniques. Such techniques are described for example in U.S. Patent Application Serial Nos. 10/831,214 filed April 2004; 10/852,028 filed May 24,2004; 10/866,388 filed June 10,2005; 10/099,459 filed March 12,2002; and U.S. Published Application 2003/013880 published July 24, 2003, each of which is herein incorporated in its entirety for all purposes.
  • methods for nucleic acid sequence determination include exposing a target nucleic acid (also referred to herein as template nucleic acid or template) to a primer that is complementary to at least a portion of the target nucleic acid, under conditions suitable for hybridizing the primer to the target nucleic acid, forming a template/primer duplex.
  • a target nucleic acid also referred to herein as template nucleic acid or template
  • primer that is complementary to at least a portion of the target nucleic acid
  • the invention in another aspect, relates to a method for sequencing a nucleic acid.
  • the method includes: (a) anchoring a nucleic acid duplex to a surface, the duplex comprising a template portion and a primer portion hybridized thereto; (b) exposing the duplex to nucleotide analog of Formula I or Formula II in the presence of a polymerase capable of catalyzing the addition of the nucleotide analog to the primer portion in a template- dependent manner; (c) removing unincorporated nucleotide analog and polymerase; (d) detecting incorporation of the nucleotide analog into the primer portion; and (e) repeating said exposing, removing, and detecting steps at least once.
  • the method may further include cleaving L from the nucleotide analog after the detecting step.
  • the invention in another aspect, relates to a method for inhibiting the catalytic function of a polymerase enzyme in a sequencing-by-synthesis reaction comprising introducing a nucleotide attached to an inhibitory group.
  • the invention comprises attaching one or both members of a template/primer duplex to a surface, introducing a polymerase and a nucleotide analog comprising a charged inhibitor under conditions sufficient for template- dependent incorporation of the nucleotide and inhibition of subsequent incorporation.
  • Such methods further comprise removing or neutralizing the inhibitor in order to facilitate further nucleotide incorporation.
  • nucleotides of the invention can be detectably labeled to monitor incorporation.
  • Target nucleic acids include deoxyribonucleic acid (DNA) and/or ribonucleic acid (RNA).
  • Target nucleic acid molecules can be obtained from any cellular material obtained from an animal, plant, bacterium, virus, fungus, or any other cellular organism, or may be synthetic DNA.
  • Target nucleic acids may be obtained directly from an organism or from a biological sample obtained from an organism, e.g., from blood, urine, cerebrospinal fluid, seminal fluid, saliva, sputum, stool and tissue. Any tissue or body fluid specimen may be used as a source for nucleic acid for use in the invention.
  • Nucleic acid molecules may also be isolated from cultured cells, such as a primary cell culture or a cell line.
  • the cells from which target nucleic acids are obtained can be infected with a virus or other intracellular pathogen.
  • Nucleic acid molecules may also include those of animal (including human), wild type or engineered prokaryotic or eukaryotic cells, viruses or completely or partially synthetic RNAs or DNAs.
  • a sample can also be total RNA extracted from a biological specimen, a cDNA library, or genomic DNA.
  • Nucleic acid typically is fragmented to produce suitable fragments for analysis.
  • nucleic acid from a biological sample is fragmented by sonication.
  • Test samples can be obtained as described in U.S. Patent Application 2002/0190663 Al, published October 9, 2003, herein incorporated by reference in its entirety for all purposes.
  • nucleic acid can be extracted from a biological sample by a variety of techniques such as those described by Maniatis, et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor, N.Y., pp. 280-281 (1982).
  • target nucleic acid molecules can be from about 5 bases to about 20 kb, about 30 kb, or even about 40 kb or more.
  • Nucleic acid molecules may be single- stranded, double-stranded, or double-stranded with single-stranded regions (for example, stem- and loop-structures)
  • Single molecule sequencing includes a template nucleic acid molecule/primer duplex that is immobilized on a surface such that the duplex and/or the nucleotides (or nucleotide analogs) added to the immobilized primer are individually optically resolvable.
  • the primer, template and/or nucleotide analogs are detectably labeled such that the position of an individual duplex molecule is individually optically resolvable.
  • Either the primer or the template is immobilized to a solid support.
  • the primer and template can be hybridized to each other and optionally covalently cross-linked prior to or after attachment of either the template or the primer to the solid support.
  • methods for facilitating the incorporation of a nucleotide analog as an extension of a primer include exposing a target nucleic acid/primer duplex to one or more nucleotide analogs disclosed herein and a polymerase under conditions suitable to extend the primer in a template dependent manner.
  • the primer is sufficiently complementary to at least a portion of the target nucleic acid to hybridize to the target nucleic acid and allow template-dependent nucleotide polymerization.
  • the primer extension process can be repeated to identify additional nucleotide analogs in the template.
  • the sequence of the template is determined by compiling the detected nucleotides, thereby determining the complementary sequence of the target nucleic acid molecule.
  • Any polymerase and/or polymerizing enzyme may be employed.
  • a preferred polymerase is Klenow with reduced exonuclease activity.
  • Nucleic acid polymerases generally useful in the invention include DNA polymerases, RNA polymerases, reverse transcriptases, and mutant or altered forms of any of the foregoing. DNA polymerases and their properties are described in detail in, among other places, DNA Replication 2nd edition, Komberg and Baker, W. H. Freeman, New York, N.Y. (1991).
  • Known conventional DNA polymerases useful in the invention include, but are not limited to, Pyrococcus furiosus (Pfu) DNA polymerase (Lundberg et al., 1991, Gene, 108: 1, Stratagene), Pyrococcus woesei (Pwo) DNA polymerase (Hinnisdaels et al., 1996, Biotechniques, 20: 186-8, Boehringer Mannheim), Thermus thermophilus (Tth) DNA polymerase (Myers and Gelfand 1991, Biochemistry 30:7661), Bacillus stearothermophilus DNA polymerase (Stenesh and McGowan, 1977, Biochim Biophys Acta 475:32), Thermococcus litoralis (TIi) DNA polymerase (also referred to as VentTM DNA polymerase, Cariello et al., 1991, Polynucleotides Res, 19: 4 193, New England Biolabs), 9"Nm DNA polymerase (New England Biolabs), Stoff
  • thermococcus sp Thermus aquaticus (T aq) DNA polymerase (Chien et al., 1976, J. Bacteoriol, 127: 1550), DNA polymerase, Pyrococcus kodakaraensis KOD DNA polymerase (Takagi et al., 1997, Appl. Environ. Microbiol. 63:4504), JDF-3 DNA polymerase (from thermococcus sp.
  • DNA polymerases include, but are not limited to, ThermoSequenase ® , 9°NmTM, TherminatorTM, Taq, Tne, Tma, Pfu, TfI, Tth, TIi, Stoffel fragment, VentTM and Deep VentTM DNA polymerase, KOD DNA polymerase, Tgo, JDF-3, and mutants, variants and derivatives thereof.
  • Reverse transcriptases useful in the invention include, but are not limited to, reverse transcriptases from HIV, HTLV-I, HTLV-11, FeLV, FIV, SIV, AMV, MMTV, MoMuLV and other retroviruses (see Levin, Cell 88:5-8 (1997); Verma, Biochim Biophys Acta. 473:1-38 (1977); Wu et al., CRC Crit Rev Biochem. 3:289-347(1975)).
  • Unincorporated nucleotide analog molecules may be removed prior to or after detecting. Unincorporated nucleotide analog molecules may be removed by washing.
  • a template/primer duplex is treated to remove the label and/or to cleave the molecular chain attaching the label to the nucleotide.
  • nucleotide analog after removal of the label and portions of the molecular chain connecting the label to the nucleotide can be represented by:
  • R is a N-containing group such as a primary amino group, a secondary amino group, a tertiary amino group, an amide group,
  • R is a phosphodiester linkage connecting the nucleotide analog to a sugar of an adjacent nucleotide in the nucleic acid, or a phosphoryl group.
  • z is an integer from about 1 to about 5. In some other embodiments, z is an integer from about 1 to about 3.
  • the invention also provides for a method of removing a label from a labeled base, comprising(a) exposing a base of Formula I or Formula II:
  • B 1 is a part of the NTP of a nucleotide analog in Formula I or Formula II, and n is an integer from about 1 to about 12.
  • the reducing agent is tris (2-carboxyl ethyl) phosphine.
  • the base is linked to a sugar selected from the group consisting of ribose, deoxyribose, and analogs thereof, where the base and sugar together may be present in a nucleotide in a nucleic acid.
  • One embodiment of a method for sequencing a nucleic acid template includes exposing a nucleic acid template to a primer capable of hybridizing to the template , a polymerase capable of catalyzing nucleotide addition to the primer, and a labeled nucleotide analog disclosed herein under conditions to permit the polymerase to add the nucleotide analog to the primer.
  • a method for sequencing may further include identifying or detecting the incorporated labeled nucleotide.
  • a cleavable bond may then be cleaved, removing at least the label from the nucleotide analog.
  • the exposing, detecting, and removing steps are repeated at least once. In certain embodiments, the exposing, detecting, and removing steps are repeated at least three, five, ten or even more times.
  • the sequence of the template can be determined based upon the order of incorporation of the labeled nucleotides.
  • a method for sequencing a nucleic acid template includes exposing a nucleic acid template to a primer capable of hybridizing to the template and a polymerase capable of catalyzing nucleotide addition to the primer.
  • the polymerase is, for example, Klenow with reduced exonuclease activity.
  • the polymerase adds a labeled nucleotide analog disclosed herein.
  • the method may include identifying the incorporated labeled nucleotide. Once the labeled nucleotide is identified, the label and at least a portion of a molecular chain connecting the label to the nucleotide analog are removed and the remaining portion of the molecular chain includes a free hydroxyl group.
  • the exposing, incorporating, identifying, and removing steps are repeated at least once, preferably multiple times depending on the application.
  • the sequence of the template is determined based upon the order of incorporation of the labeled nucleotides.
  • Removal of a label from a labeled nucleotide analog and/or cleavage of the molecular chain linking a nucleotide analog to a label may include contacting or exposing the labeled nucleotide with a reducing agent.
  • Such reducing agents include, for example, dithiothreitol (DTT), tris(2-carboxyethyl)phosphine (TCEP), tris(3-hydroxy -propyl) phosphine, tris(2-chloropropyl) phosphate (TCPP), 2-mercaptoethanol, 2-mercaptoethylarnine, cystein and ethylmaleimide.
  • DTT dithiothreitol
  • TCEP tris(2-carboxyethyl)phosphine
  • TCPP tris(2-chloropropyl) phosphate
  • 2-mercaptoethanol 2-mercaptoethylarnine
  • cystein cystein
  • ethylmaleimide Such contacting or exposing the reducing agent to a labeled nucleotide analog may occur at a range of pH values, for example at a pH of about 5 to about 10, or about 7 to about 9.
  • the above-described methods for sequencing a nucleic acid template can further include a step of capping a molecular chain, for example, after the label has been removed.
  • any optional 3' phosphate moiety can be removed enzymatically.
  • an optional phosphate can be removed using alkaline phosphatase or T4 polynucleotide kinase.
  • Suitable enzymes for removing optional phosphate include, any phosphatase, for example, alkaline phosphatase such as shrimp alkaline phosphatase, bacterial alkaline phosphatase, or calf intestinal alkaline phosphatase.
  • any suitable detection method may be used to identify an incorporated nucleotide analog.
  • exemplary detection methods include radioactive detection, optical absorbance detection, e.g., UV-visible absorbance detection, optical emission detection, e.g., fluorescence or chemiluminescence.
  • Single-molecule fluorescence can be carried out using a conventional microscope equipped with total internal reflection (TIR) objective.
  • TIR total internal reflection
  • the detectable moiety associated with the extended primers can be detected on a substrate by scanning all or portions of each substrate simultaneously or serially, depending on the scanning method used.
  • fluorescence labeling selected regions on a substrate may be serially scanned one-by-one or row-by-row using a fluorescence microscope apparatus, such as described in Fodor (U.S. Patent No.
  • a phosphorimager device can be used (Johnston et al., Electrophoresis, 13566, 1990; Drmanac et al., Electrophoresis, 13:566, 1992; 1993).
  • Other commercial suppliers of imaging instruments include General Scanning Inc., (Watertown, Mass. on the World Wide Web at genscan.com), Genix Technologies (Waterloo, Ontario, Canada; on the World Wide Web at confocal.com), and Applied Precision Inc. Such detection methods are particularly useful to achieve simultaneous scanning of multiple attached target nucleic acids.
  • the present invention provides for detection of molecules ranging from a single nucleotide to a single target nucleic acid molecule.
  • a number of methods are available for this purpose.
  • Methods for visualizing single molecules within nucleic acids labeled with an intercalating dye include, for example, fluorescence microscopy. For example, the fluorescent spectrum and lifetime of a single molecule excited-state can be measured. Standard detectors such as a photomultiplier tube or avalanche photodiode can be used. Full field imaging with a two-stage image intensified CCD camera also can be used. Additionally, low noise cooled CCD can also be used to detect single fluorescent molecules.
  • the detection system for the signal may depend upon the labeling moiety used.
  • a combination of an optical fiber or charge coupled device (CCD) can be used in the detection step.
  • CCD charge coupled device
  • the substrate is itself transparent to the radiation used, it is possible to have an incident light beam pass through the substrate with the detector located opposite the substrate from the target nucleic acid.
  • various forms of spectroscopy systems can be used.
  • Various physical orientations for the detection system are available and discussion of design parameters is provided in the art.
  • Optical setups include near-field scanning microscopy, far-field confocal microscopy, wide-field epi-illumination, but are not limited to, light scattering, dark field microscopy, photoconversion, single and/or multiphoton excitation, spectral wavelength discrimination, fluorophore identification, evanescent wave illumination, and total internal reflection fluorescence (TIRF) microscopy.
  • TIRF total internal reflection fluorescence
  • certain methods involve detection of laser-activated fluorescence using a microscope equipped with a camera.
  • Suitable photon detection systems include, but are not limited to, photodiodes and intensified CCD cameras.
  • an intensified charge couple device (ICCD) camera can be used.
  • ICCD intensified charge couple device
  • the use of an ICCD camera to image individual fluorescent dye molecules in a fluid near a surface provides numerous advantages. For example, with an ICCD optical setup, it is possible to acquire a sequence of images (movies) of fluorophores.
  • TIRF microscopy uses totally internally reflected excitation light and is well known in the art. See, e g., the World Wide Web at nikoninstrurnents.jp/eng/page/products/tirf.aspx.
  • detection is carried out using evanescent wave illumination and total internal reflection fluorescence microscopy.
  • An evanescent light field can be set up at the surface, for example, to image fluorescently-labeled nucleic acid molecules.
  • the excitation light beam penetrates only a short distance into the liquid.
  • the optical field does not end abruptly at the reflective interface, but its intensity falls off exponentially with distance.
  • This surface electromagnetic field called the "evanescent wave”
  • the thin evanescent optical field at the interface provides low background and facilitates the detection of single molecules with high signal-to-noise ratio at visible wavelengths.
  • the evanescent field also can image fluorescently-labeled nucleotides upon their incorporation into the attached target nucleic acid target molecule/primer complex in the presence of a polymerase. Total internal reflectance fluorescence microscopy is then used to visualize the attached target nucleic acid target molecule/primer complex and/or the incorporated nucleotides with single molecule resolution.
  • Fluorescence resonance energy transfer can be used as a detection scheme. FRET in the context of sequencing is described generally in Braslavasky, et al., Proc. Nat'l Acad. ScL, 100: 3960-3964 (2003), incorporated by reference herein.
  • a donor fluorophore is attached to the primer, polymerase, or template. Nucleotides added for incorporation into the primer comprise an acceptor fluorophore that is activated by the donor when the two are in proximity.
  • Measured signals can be analyzed manually or preferably by appropriate computer methods to tabulate results.
  • the signals of millions of analogs are read in parallel and then deconvoluted to ascertain a sequence.
  • the substrates and reaction conditions can include appropriate controls for verifying the integrity of hybridization and extension conditions, and for providing standard curves for quantification, if desired.
  • a control nucleic acid can be added to the sample. The absence of the expected extension product is an indication that there is a defect with the sample or assay components requiring correction.
  • the described nucleotide analogs can be used to facilitate "four color" sequencing by synthesis if each base (A, C, G, T) is labeled with a dye emitting and/or absorbing at a different and resolvable wavelength.
  • the sequencing procedure can be shortened from four separate addition cycles (i.e., one for each base) to the following: add A, C, G, T (each differently labeled) with polymerase and an appropriate reaction buffer, rinse, image the four resolvable dyes and record which base (if any) was incorporated, cleave and cap the nucleotides, and repeat.
  • the described nucleotide analogs facilitate this kind of sequencing because of their ability to incorporate one and only one base at a time. Without that ability, if all four bases are added to the incorporation reaction at once multiple bases would be added to a given strand and the interactions between the proximate dyes would hinder the ability to resolve the sequence information correctly.
  • the nucleotide analogs described herein can facilitate sequencing nucleic acids containing homopolymer sequences, using sequencing by synthesis methodology (e.g., using the methods of US 2007/0190546, herein incorporated by reference in its entirety for all purpose.
  • sequencing by synthesis methodology e.g., using the methods of US 2007/0190546, herein incorporated by reference in its entirety for all purpose.
  • nucleotide analog, and reaction buffer combination that allows for only a single nucleotide analog incorporation allows for each base in the homopolymer to be sequenced sequentially. After one base is incorporated into the homopolymer and detected, the portion of the analog that inhibits subsequent base incorporation and that contains the fluorescent label is removed, making incorporation of the next base in the homopolymer possible during the next addition cycle of the correct base.
  • ⁇ - N-Fmoc-S-tert-butylthio-L-cysteine (1 g, 2.32 mmol) was dissolved in anhydrous acetonitrile and solution of dicyclohexylcarbodiimide (DCC) (573 mg, 2.78 mmol in CH 3 CN) was added followed by solution of NHS (345 mg, 3.01 mmol in CH 3 CN). After 1 hr. dicyclohexylurea was spun down and active ester used without purification in coupling with ⁇ - amino-hexanoic acid (304 mg, 2.32 mmol) dissolved in 50% aq. DMF.
  • DCC dicyclohexylcarbodiimide
  • DIPEA N,N'- Diisopropylethylamine
  • dATP-AP3 and dCTP-AP3 were prepared by a modified procedure of Hobbs and Cocuzza: a) Pyrophosphate and tributylamine were added to the reaction mixture rather than vice versa; b) After pyrophosphate addition the reaction was quenched with 50 mM TEAB within 15 min.; c) DEAE-Sephadex chromatography was replaced by preparative HPLC.
  • reaction mixture was purified with HPLC (Waters Delta 600 pump and 2487 Dual ⁇ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.00 mm 10 micron, gradient: 100% A for 5 min, then 1% B/min, buffer A 0.05 M TEAB, buffer B CH 3 CN, 10 mL/min flow).
  • HPLC Waters Delta 600 pump and 2487 Dual ⁇ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.00 mm 10 micron, gradient: 100% A for 5 min, then 1% B/min, buffer A 0.05 M TEAB, buffer B CH 3 CN, 10 mL/min flow).
  • the NHS ester of the acid 32 was prepared by dissolving the acid 32 (3.0 ⁇ mol) in DMF (500.0 ⁇ L) and N,N,N',N'-Tetramethyl-O-(N-succinimidyl)uronium hexafluorophosphate (SbTMU) (4.3 mg, 12 ⁇ mol) in 100 ⁇ L DMF was added to the acid solution followed by the addition of DIPEA (80 ⁇ L). After stirring at RT for 1 hr., the reaction mixture was used immediately for peptide coupling without any purification.
  • the peptide Arg- Arg-Arg-OH (14.5 mg, 30 ⁇ mol) was dissolved in 160 ⁇ L 0.5M phosphate buffer, and added to the freshly prepared NHS ester of the acid 32. The reaction mixture was stirred for 30 minutes and then the crude reaction mixture was purified with HPLC (Waters Delta 600 pump and 2487 Dual ⁇ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.00 mm 10 micron, gradient: 100% A for 5 min, then 1% B/min, buffer A 0.05 M TEAB, buffer B MeCN, 10 mL/min flow).
  • HPLC Waters Delta 600 pump and 2487 Dual ⁇ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.00 mm 10 micron, gradient: 100% A for 5 min, then 1% B/min, buffer A 0.05 M TEAB, buffer B MeCN, 10 mL/min flow).
  • the NHS ester of the acid 45 was prepared by dissolving the acid 45 (4.0 ⁇ mol, 1 eqv.) in DMF (700.0 ⁇ L) and the SbTMU 5.93 mg, 16.5 ⁇ mol, in 200 ⁇ L DMF, 4.0 eqv.) was added, to the acid solution followed by the addition of DIPEA (103.0 ⁇ L). After stirring at RT for 1 hour, the reaction mixture was used immediately for peptide coupling without any purification. The peptide (Asp- Asp-Asp- Asp) was dissolved in DMFB 2 O (400.0 ⁇ L, 1 :1), basif ⁇ ed using DIPEA (50.0 ⁇ L).
  • HPLC fractions containing the thiol 7 (0.34 ⁇ mol, 1 eqv.) were mixed with HPLC fractions containing dCTP-SPDP (0.41 ⁇ mol, 1.25 eqv.) in an aluminum foil covered flask. After 15 min. LCMS analysis indicated that the completion of the reaction and it was then partially concentrated under reduced pressure to remove CH 3 CN, then HPLC purified (Waters Delta 600 pump and 2487 Dual ⁇ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 10.00 mm 10 micron, gradient: 100% A for 5 min, then 1% B/min, buffer A 0.05 M TEAB, buffer B CH 3 CN, 5 mL/min flow).
  • HPLC fractions containing thiol 47 were mixed with HPLC fractions containing dATP-SPDP (0.6 ⁇ mol, 1.2 eqv.) in an aluminum foil covered flask. After 15 min. LCMS analysis indicated that the completion of the reaction and it was then partially concentrated under reduced pressure to remove CH 3 CN, then HPLC purified (Waters Delta 600 pump and 2487 Dual ⁇ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 10.00 mm 10 micron, gradient: 100% A for 5 min., then 1% B/min, buffer A 0.05M TEAB, buffer B CH 3 CN, 5 mL/min flow).
  • Fmoc-Cys(S ⁇ Bu)-OH (2.0 g, 4.63 mmol, 1 eqv.) was dissolved in CH 3 CN (10 mL).
  • DCC 1.2 g, 5.81 mmol, 1.26 eqv.
  • NHS 0.1 g, 6.08 mmol, 1.31 eqv.
  • DCU White precipitate
  • 6-Aminohexanoic acid (0.60 g, 4.57 mmol, 1 eqv.) was dissolved in 1 :1 H 2 O:DMF (6 mL total). DIPEA (0.016 mL) was added to keep the pH about 8. NHS ester (4.63 mmol in 10 mL CH3CN, 1.01 eqv.) was added to the reaction mixture in 1 mL aliquots over aboutlO min. DIPEA (0.02 mL) was added after each aliquot to keep the reaction basic. After the first aliquot of NHS ester was added, the reaction became cloudy, and addition of extra H 2 O (0.2 mL) was needed to clear up the solution.
  • HPLC fractions containing the thiol were mixed with HPLC fractions containing SPDP-dATP (5 ⁇ mol, 1 equiv). After the SPDP-dATP was consumed based on LCMS analysis (about 10 min), the reaction was partially concentrated under reduced pressure to remove CH 3 CN and then HPLC purified (Waters Delta 600 pump and 2487 Dual ⁇ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 15.0 mm 10 micron, gradient: 100% A for 3 min, then 1% B/min., buffer A 0.05M TEAB, buffer B CH 3 CN, 10 mL/min. flow). Fractions containing the desired were pooled and lyophilized, then used for subsequent reactions without quantifying.
  • Atto647N-NHS ester (0.030 mL, 1.8 ⁇ mol, 0.06M in anhydrous DMF, 3.6 eqv.) was added to a solution of amine (0.5 ⁇ mol, 1 eqv.) in H 2 O (0.25 mL) in 10 ⁇ L aliquots. The reaction was monitored by LCMS to determine how much dye was needed to consume the starting amine.
  • HPLC fractions containing the thiol were mixed with HPLC fractions containing SPDP-dGTP (1.5 ⁇ mol, 1 eqv.). After the SPDP-dGTP was consumed based on LCMS analysis (about 10 min), the reaction was partially concentrated under reduced pressure to remove CH 3 CN and then HPLC purified (Waters Delta 600 pump and 2487 Dual ⁇ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 10.0 mm 10 micron, gradient: 100% A for 3 min., then 1% B/min, buffer A 0.05M TEAB, buffer B CH 3 CN, 5 mL/min. flow). Fractions containing the desired were pooled and lyophilized, then used for subsequent reactions without quantifying.
  • Atto647N-NHS ester (0.011 mL, 0.66 ⁇ mol, 0.06 M in anhydrous DMF, 2.5 eqv.) was added to a solution of amine (0.26 ⁇ mol, 1 equiv) in H2O (0.50 mL) in small aliquots. The reaction was monitored by LCMS to determine how much dye was needed to consume the starting amine.
  • HPLC fractions containing the thiol were mixed with SPDP-dCTP (1 ⁇ mol, 1 eqv.) in H 2 O (0.20 mL). After the SPDP-dCTP was consumed based on LCMS analysis (about 10 min.), the reaction was partially concentrated under reduced pressure to remove CH 3 CN and then HPLC purified (Waters Delta 600 pump and 2487 Dual ⁇ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 3 min, then 3% B/min., buffer A 0.05M TEAB, buffer B CH 3 CN, 5 mL/min. flow). Fractions containing the desired were pooled and lyophilized, then used for subsequent reactions without quantifying.
  • Atto647N-NHS ester (0.012 mL, 0.72 ⁇ mol, 0.06M in anhydrous DMF, 3.6 eqv.) was added to a solution of amine (0.15 ⁇ mol, 1 eqv.) in H 2 O (0.20 mL) in 5 ⁇ L aliquots. The reaction was monitored by LCMS to determine how much dye was needed to consume the starting amine.
  • Atto647N-NHS ester (0.010 mL, 0.68 ⁇ mol, 0.06M in anhydrous DMF, 3.6 eqv.) was added to a solution of amine (0.19 ⁇ mol, 1 eqv.) in H 2 O (0.40 mL) in small aliquots.
  • IM K 2 HPO 4 (0.40 mL) was also added to accelerate the reaction after there was little product formed within an hour. The reaction was monitored by LCMS to determine how much dye was needed to consume the starting amine.
  • HPLC fractions containing thiol 57 were mixed with HPLC fractions containing SPDP-dATP (20 ⁇ mol). After -15 minutes the reaction was lyophilized, then HPLC purified (Waters Delta 600 pump and 2487 Dual ⁇ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 3 min, then 1% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired were pooled and lyophilized to yield 58, which was used for the subsequent reaction without quantifying.
  • HPLC fractions containing thiol 57 were mixed with HPLC fractions containing SPDP-dCTP (45 ⁇ mol). After -30 minutes the reaction was lyophilized, then HPLC purified (Waters Delta 600 pump and 2487 Dual ⁇ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 3 min, then 1% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired were pooled and lyophilized to yield 61, which was used for the subsequent reaction without quantifying.
  • Acid 54 (0.14 g, 0.34 mmol) was dissolved in MeCN (0.6 mL). DCC (0.085 g, 0.41 mmol) was added, followed by NHS (0.051 g, 0.44 mmol) and the reaction was stirred at RT for an hour. White precipitate (DCU) began forming within five minutes. The reaction mixture was transferred to eppendorf tubes and centrifuged to remove the white precipitate. The supernatant containing NHS ester 55 was then added to a solution of 6-aminocaproic acid (0.049 g, 0.37 mmol) in H 2 O (0.4 mL) and DMF (0.4 mL). DIPEA (0.05 mL) was added to keep the pH ⁇ 8.
  • Acid 64 (0.12 g, 0.23 mmol) was dissolved in MeCN (0.6 mL). DCC (0.058 g, 0.28 mmol) was added, followed by NHS (0.035 g, 0.3 mmol) and the reaction was stirred at RT for an hour. White precipitate (DCU) began forming within twenty minutes. The reaction mixture was transferred to eppendorf tubes and centrifuged to remove the white precipitate. The supernatant containing NHS ester 65 was then added to a solution of H-Asp-Asp-OH (0.075 g, 0.30 mmol) in 0.1 M K 2 HPO 4 (0.5 mL) and MeCN (0.5 mL).
  • HPLC fractions containing thiol 67 were mixed with HPLC fractions containing SPDP-dUTP (50 ⁇ mol). After ⁇ 1 hr the reaction was concentrated to remove MeCN, then HPLC purified (Waters Delta 600 pump and 2487 Dual ⁇ Absorbance Detector, Phenomenex C18 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 5 min, then 2% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired were pooled and lyophilized to yield 68, which was used for the subsequent reaction without quantifying.
  • Carbamate 68 (unqualified, -50 ⁇ mol) was treated with 20% piperidine/MeCN (2 mL) for 30 minutes to remove the Fmoc protecting group. The solvent was removed under reduced pressure, and the residue was dissolved in 50 mM TEAB buffer ( ⁇ 3mL), causing formation of copious white precipitate (dibenzylfulvene). The mixture was transferred to eppendorf tubes and centrifuged to remove the precipitate.
  • HPLC fractions containing thiol 73 were mixed with HPLC fractions containing SPDP-dGTP (58 ⁇ mol). After -30 minutes the reaction was concentrated to remove MeCN, then HPLC purified (Waters Delta 600 pump and 2487 Dual ⁇ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 3 min, then 1% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired were pooled and lyophilized to yield 74, which was used for the subsequent reaction without quantifying.
  • nucleotide analogs disclosed here include compounds which otherwise correspond thereto, and which have the same general properties thereof, wherein one or more simple variations of substituents or components are made which do not adversely affect the characteristics of the nucleotide analogs of interest.
  • the components of the nucleotide analogs disclosed herein may be prepared by the methods illustrated in the general reaction schema as described herein or by modifications thereof, using readily available starting materials, reagents, and conventional synthesis procedures.

Landscapes

  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Biochemistry (AREA)
  • Biotechnology (AREA)
  • General Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Molecular Biology (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

The invention provides for novel nucleotide analogs and methods of using the same, e.g., for sequencing nucleic acids.

Description

NUCLEOTIDE ANALOGS
Related Applications
[0001 ] This application claims priority to PCT International Application No. PCT/US08/59446 filed April 4, 2008 and to U.S. Application Serial No. 12/098,196 filed on April 4, 2008, and to U.S. Application Serial No. 12/244,698 filed October 2, 2008, the entire contents of each of which are expressly incorporated herein by reference for all purposes.
Field of the Invention
[0002] The invention relates to nucleotide analogs and methods for sequencing a nucleic acid using the nucleotide analogs.
Background
[0003] Sequencing-by-synthesis involves the template- dependent addition of nucleotides to a template/primer duplex. Traditional sequencing-by-synthesis is performed using dye- labeled terminators and gel electrophoresis (so-called "Sanger sequencing"). See, e.g., Sanger, F. and Coulson, A.R., 1975, J. MoI. Biol. 94: 441-448; Sanger, F. et al, 1977, Nature. 265(5596): 687-695; and Sanger, F. et al, 1977, Proc. Natl. Acad. ScL U.S.A. 75: 5463-5467. Recently, single molecule sequencing methods have been proposed that provide increased resolution, throughput, and speed at reduced cost. For example, a sequencing-by-synthesis method that results in sequence determination without consecutive base incorporation, has been proposed by Braslavsky, et al., Proc. Nat'lAcad. Sd., 100: 3960-3964 (2003). These methods do not rely on the user of terminator nucleotides as in Sanger sequencing. Instead, template/primer duplex is anchored directly, or indirectly (e.g., via a polymerase enzyme) to a surface and labeled nucleotides are added in a template-dependent manner.
[0004] A challenge that has arisen in single molecule sequencing involves the ability to sequence through homopolymer regions (i.e., portions of the template that contain consecutive identical nucleotides). Often the number of bases present in a homopolymer region is important from the point of view of genetic function. Many polymerase enzymes used in sequencing-by- synthesis reactions are highly-processive and tend to add bases continuously in a homopolymer region. It is often difficult to resolve the number of nucleotides in a homopolymer due to the difficulty in distinguishing between the incorporation of one or two labeled nucleotides and the incorporation of a greater number of nucleotides.
[0005] A need therefore exists for nucleotide analogs that promote accurate base-over- base incorporation in sequencing-by-synthesis reactions.
Summary of the Invention
[0006] The invention provides nucleotide analogs and methods of using them to allow sequencing-by-synthesis to occur such that, on average, a single nucleotide is incorporated into the 3' end of a primer portion of a template/primer duplex per sequencing cycle. The invention is based, in part, on the discovery that nucleotide analogs having an attached inhibitory region with one or more charged groups provide good incorporation of a single nucleotide into the duplex without allowing a significant, or any, amount of second, third, etc. base incorporation.
[0007] The invention generally provides nucleotide analogs and methods of using nucleotide analogs in sequencing. More particularly, the invention provides compounds, methods and compositions useful in introduction of a single base at a time in a template- dependent sequencing-by-synthesis reaction. The invention allows template-dependent sequencing-by-synthesis through all regions of a target nucleic acid, including homopolymer regions, and provides methods for the determination of the number of nucleotides present in a homopolymer region.
[0008] The invention provides nucleotide analogs that comprise a nucleotide (or nucleotide analog), a detectable label, and an inhibitor group. Upon incorporation of the nucleotide, the inhibitor prevents subsequent nucleotide incorporation into the same duplex. However, upon removal of the detectable label and the inhibitor group, the nucleotide analog does not substantially hinder subsequent nucleotide (or nucleotide analog) incorporation.
[0009] In one aspect, A method for sequencing a nucleic acid. The method includes the steps of: exposing a nucleic acid duplex comprising a template portion and a primer portion to a nucleotide analog comprising an inhibitor that is charged or capable of becoming charged, and a polymerase, under conditions that permit template-dependent incorporation of the analog into the primer; detecting incorporation of the analog; removing or neutralizing the inhibitor; and repeating the exposing, detecting, and removing steps at least once, thereby to determine the sequence of the template.
[0010] In another aspect, the invention relates to a nucleotide analog that includes: a nucleoside triphosphate; an inhibitor comprising (a) one or more multiply charged groups or groups capable of becoming multiply charged, or (b) two or more (i.e., a plurality of) singly charged groups or two or more groups capable of becoming singly charged; a detectable label; and a linker connecting the inhibitor and the label to the nucleoside triphosphate. It should be noted that in some embodiments, one or a single charged group may be sufficient to provide the desired inhibitory effect.
[0011] In yet another aspect, the invention relates to nucleotide analogs of the formula:
Figure imgf000004_0001
NTP is a nucleoside or nucleotide triphosphate or an analog thereof capable of template- dependent incorporation into the 3' end of a polynucleotide strand hybridized to a template. Inhibitor comprises a moiety that is charged or capable of becoming charged and that inhibits subsequent nucleotide incorporation once the first nucleotide is incorporated. Tether is a bond or a group linking the NTP to the Inhibitor group. In a preferred embodiment, the inhibitor is a non-steric inhibitor.
[0012] In yet another aspect, the invention relates to nucleotide analogs of Formula II:
Figure imgf000004_0002
NTP is a nucleoside or nucleotide triphosphate or an analog of either capable of template- dependent incorporation into the 3' end of a polynucleotide strand hybridized to a template presenting the complement of the NTP. L is a detectable label that facilitates the identification of the nucleotide analog. Inhibitor comprises (a) one or more multiply charged groups or groups capable of becoming multiply charged, or (b) two or more singly charged groups or two or more groups capable of becoming singly charged. Ri and R2 are independently a bond or a group, wherein at least one of Ri and R2 comprises a cleavable bond, which upon cleavage results in de- association of NTP from both Label and Inhibitor. R3 is a bond or group linking R2 to the Inhibitor. R4 is a bond or group linking R2 to a Label.
[0013] In yet another aspect, the invention relates to nucleotide analogs of nucleotide analog of the following Formula II:
Figure imgf000005_0002
wherein NTP is a nucleoside or nucleotide triphosphate or an analog of either capable of incorporating onto the 3' end of a polynucleotide strand hybridized to a template presenting the complement of the NTP; L is a detectable label that facilitates the identification of the nucleotide analog; Inhibitor comprises (a) one or more multiply charged groups or groups capable of becoming multiply charged, or (b) two or more singly charged groups or two or more groups capable of becoming singly charged; Ri comprises a cleavable bond, which upon cleavage results in de-association of NTP from both L and Inhibitor; R2 is a tri-valent radical having the formula:
Figure imgf000005_0001
wherein each of R2' and R2" is a bi-valent or tri-valent radical selected from:
-(CH2) X-, -(CH2-O) x-, -(CH2-O) Z-(CH2) y-, -(CH2) Z-(CH2-O) y-, and the same substituted with one or more groups selected from hydroxyl, halogen, amino, thiol, (Ci-C6) alkyl, wherein x, y and z are each integers with x and y+z are each from 2 to 10; R3 is a bond or group linking R2 to the Inhibitor moiety; and R4 is a bond or group linking R2 to a L.
[0014] In yet another aspect, the invention relates to a method for sequencing a nucleic acid. The method includes: (a) anchoring a nucleic acid duplex, or portion thereof, to a surface, the duplex comprising a template portion and a primer portion hybridized thereto; (b) exposing the duplex to nucleotide analog of Formula I or II (as defined herein) in the presence of a polymerase capable of catalyzing the addition of the nucleotide analog to the primer portion in a template- dependent manner; (c) removing unincorporated nucleotide analog and polymerase; (d) detecting incorporation of the nucleotide analog into the primer portion; and repeating the exposing, removing, and detecting steps at least once.
[0015] In yet another aspect, the invention relates to a method for sequencing a nucleic acid, the method comprising the steps of: (a) exposing a nucleic acid duplex comprising a template portion and a primer portion to a nucleotide analog of the following Formula II:
Figure imgf000006_0001
(b) detecting incorporation of the analog; (c) removing or neutralizing the inhibitor; and (d) repeating the exposing, detecting, and removing steps at least once, thereby to determine the sequence of the template, wherein NTP is a nucleoside or nucleotide triphosphate or an analog of either capable of incorporating onto the 3 ' end of a polynucleotide strand hybridized to a template presenting the complement of the NTP; L is a detectable label that facilitates the identification of the nucleotide analog; Inhibitor comprises (a) one or more multiply charged groups or groups capable of becoming multiply charged, or (b) two or more singly charged groups or two or more groups capable of becoming singly charged; Ri comprises a cleavable bond, which upon cleavage results in de-association of NTP from both L and Inhibitor; R2 is a tri-valent radical having the formula:
Figure imgf000006_0002
wherein each of R2' and R2" is a bi-valent or tri-valent radical selected from:
-(CH2) X-, -(CH2-O) x-, -(CH2-O) Z-(CH2) y-, -(CH2) Z-(CH2-O) y-, and the same substituted with one or more groups selected from hydroxyl, halogen, amino, thiol, (Ci-C6) alkyl, wherein x, y and z are each integers with x and y+z are each from 2 to 10; R3 is a bond or group linking R2 to the Inhibitor moiety; and R4 is a bond or group linking R2 to a L.
[0016] In yet another aspect, the invention provides methods and nucleotide analogs for selectively inhibiting the catalytic function of a polymerase enzyme. As such, nucleotide analogs comprise an inhibitory portion, such that the nucleotide analog is capable of being incorporated into a nucleic acid duplex but then inhibits subsequent nucleotide incorporation until the inhibitory portion is removed.
[0017] The inhibitory portion of an analog of the invention preferably is a charged group. The charged group can take any appropriate form as long as it carries a charge. Preferably, the charge group is selected from a phosphate, a carboxylic acid (or carboxylate), a sulfate, caproic acid (or a caproic acid derivative), a charged amino acid, -SO3, -SO2, and - NRWRV, where Rw and Rv independently is H, an alkyl or aryl group. The charged group can convey a negative or positive charge, but negative charged groups are preferred. In another preferred embodiment, the charge group contains multiple charged portions. For example, the charge group can be a dipeptide, a di-phosphate, disulfate, or other multiples of charged moieties. For example, amino acid inhibitors are preferably selected from aspartic acid, glutamic acid, arginine, lysine, and histidine.
[0018] The invention provides charged inhibitors of subsequent base incorporation in a sequencing-by-synthesis reaction. By subsequent base incorporation it is intended that a first nucleotide (or analog) is incorporated in a template-dependent manner, but second, third, etc. base incorporation is inhibited by the inhibitor group. In a preferred embodiment, inhibition occurs by positioning a charged group in proximity to the active site of a polymerase enzyme, thus disabling the ability of the polymerase to make subsequent incorporations. Without being limited to theory, analogs of the invention, interfere with magnesium present in the active site of the polymerase, resulting in a reduced ability of the active site to catalyze subsequent nucleotide incorporation.
[0019] In a preferred embodiment, an analog of the invention comprises a nucleoside triphosphate, an inhibitor comprising a plurality of charged groups, a detectable label, and a linker connecting the charged groups and the label to the nucleoside triphosphate. Preferred inhibitors comprise a plurality of charged groups and may be selected from any charged group capable of conferring a charge in a local area. Preferably, the inhibitor does not sterically inhibit a polymerase. Also in a preferred embodiment, the linker is cleavable. Multiple cleavable groups, such as enzymatically-cleavable group, such as disulfide bonds and the like.
Detailed Description of the Invention
[0020] The invention provides methods and compositions that facilitate the addition of a single nucleotide to a template/primer duplex per reaction cycle (i.e., the addition of nucleotides and polymerase enzyme under conditions that result in template-dependent nucleotide incorporation into the primer). Analogs of the invention comprise a charged inhibitory group that, upon incorporation of a nucleotide in a template-dependent manner, prevents subsequent nucleotide incorporation until the inhibitory group is removed. Thus, an analog of the invention comprises a nucleotide triphosphate, a linker (or tether), a detectable label, and a charged inhibitory group, wherein the label and the inhibitory group are removable.
[0021] In one aspect, the invention generally provides nucleotide analogs of the following Formula I:
Figure imgf000008_0001
wherein
NTP is a nucleoside triphosphate or an analog thereof capable of incorporating onto the 3' end of a polynucleotide strand hybridized to a template presenting the complement of the NTP;
Inhibitor comprises a group that is charged or capable of becoming charged, e.g., under reaction conditions, and that inhibits a subsequent incorporation of a nucleotide (or analog thereof), and
Tether is a bond or a group linking the NTP to the Inhibitor moiety. A group is considered capable of becoming charged if the group is capable of becoming electrically non-neutral, e.g., under reaction or buffer conditions. Examples of such groups include -COOH and -NRWRV, where Rw and Rv independently is H, an alkyl or aryl group. [0022] In one embodiment, the inhibitor group can cause inhibition of subsequent nucleotide incorporation without steric hinderance. In other words, the inhibition is caused by chemical or charge interaction with the enzyme and not be a physical blocking of the enzyme. In another embodiment, the charged inhibitor also provides steric inhibition of enzyme activity. However, in either case, the inhibitor group is charged.
[0023] Natural NTPs include nucleoside triphosphates, adenosine triphosphate (ATP), guanosine triphosphate (GTP), cytidine triphosphate (CTP), thymidine triphosphate (TTP) and uridine triphosphate (UTP); and nucleotide triphosphates, deoxyadenosine triphosphate (dATP), deoxyguanosine triphosphate (dGTP), deoxycytidine triphosphate (dCTP), deoxythimidine triphosphate (dTTP) and deoxyuridine triphosphate (dUTP). NTPs useful in this invention include non-nature nucleosides and nucleotides, and analogs and derivatives thereof.
[0024] In some embodiments, the inhibitor may include a moiety that is negatively charged or capable of becoming a negatively charged. In other embodiments, the inhibitor group is positively charged or capable of becoming positively charged.
[0025] In some other embodiments, the inhibitor is an amino acid or an amino acid analog. The Inhibitor may be a peptide of 2 to 20 units of amino acids or analogs, a peptide of 2 to 10 units of amino acids or analogs, a peptide of 3 to 7 units of amino acids or analogs, a peptide of 3 to 5 units of amino acids or analogs. In some embodiments, the Inhibitor includes a group selected from the group consisting of GIu, Asp, Arg, His, and Lys, and a combination thereof (e.g., Arg, Arg- Arg, Asp, Asp- Asp, GIu, Glu-Glu, Asp-Glu-Asp, Asp-Asp-Glu or Asp Asp Asp Asp). Peptides or groups may be combinations of the same or different amino acids or analogs.
[0026] In one embodiment, the invention relates to an oligonucleotide with at least one nucleotide analog of the invention incorporated therein.
[0027] In some embodiments, the Tether comprises
Figure imgf000009_0001
wherein L is detectable label that facilitates the identification of the nucleotide analog after incorporation onto a template;
Ri and R2 are independently a bond or a group, wherein at least one of Ri and R2 comprises a cleavable bond, which upon cleavage results in de-association of NTP from both L and Inhibitor;
R3 is a bond or group linking R2 to the Inhibitor moiety; and R4 is a bond or group linking R2 to a L.
[0028] In another aspect, the present invention is directed to nucleotide analogs of Formula II:
Figure imgf000010_0001
wherein
NTP is a nucleoside triphosphate or an analog thereof capable of incorporating onto the 3' end of a polynucleotide strand hybridized to a template presenting the complement of the NTP;
L is a detectable label to facilitate the identification of the nucleotide analog after incorporation onto the template;
Inhibitor is a moiety that substantially inhibits a subsequent incorporation of a nucleotide (or analog thereof). In some embodiments, the Inhibitor moiety includes a nucleotide or nucleoside or analogs thereof, in other embodiments, the inhibitor is not a nucleotide or analog thereof;
Ri and R2 are independently a bond or a group, wherein at least one of Ri and R2 comprises a cleavable bond, which upon cleavage results in de-association of NTP from both Label and Inhibitor;
R3 is a bond or group linking R2 to the Inhibitor moiety; and R4 is a bond or group linking R2 to L.
[0029] In some embodiments, NTP is a compound having the following formula:
Figure imgf000011_0001
wherein B is selected from the group consisting of purine or pyrimidine bases, as well as derivatives of purine and pyrimidine bases; R' is independently selected from the group consisting of-OH, -0-P(O)(OH)2, -O-C(O)-RX, -NHRy, and an -O-blocking agent, where Rx and Ry are alkyl groups; R" is independently selected from the group consisting of H and -OH.
[0030] Non-limiting examples of representative purine and pyrimidine bases include adenine, cytosine, guanine, thymine, uracil, or hypoxanthine. Non-limiting examples of derivatives of purine and pyrimidine bases include naturally-occurring and synthetic derivatives of a base, including pyrazolo[3,4-d]pyrimidines, 5-methylcytosine (5-me-C), 5-hydroxymethyl cytosine, xanthine, hypoxanthine, 2-aminoadenine, 6-methyl and other alkyl derivatives of adenine and guanine, 2-propyl and other alkyl derivatives of adenine and guanine, 2-thiouracil, 2-thiothymine and 2-thiocytosine, 5-propynyl uracil and cytosine, 6-azo uracil, cytosine and thymine, 5-uracil (pseudouracil), 4-thiouracil, 8-halo (e.g., 8-bromo), 8-amino, 8-thiol, 8- thioalkyl, 8-hydroxyl and other 8-substituted adenines and guanines, 5-halo particularly 5- bromo, 5-trifluoromethyl and other 5-substituted uracils and cytosines, 7-methylguanine and 7- methyladenine, 8-azaguanine and 8-azaadenine, deazaguanine, 7-deazaguanine, 3-deazaguanine, deazaadenine, 7-deazaadenine, 3-deazaadenine, pyrazolo[3,4-d]pyrimidine, imidazo[l,5-a] 1,3,5 triazinones, 9-deazapurines, imidazo[4,5-d]pyrazines, thiazo Io [4,5-d]pyrimi dines, pyrazin-2- ones, 1 ,2,4-triazine, pyridazine; and 1,3,5 triazine.
[0031] Base B1 of the invention permits a nucleotide to be incorporated into a polynucleotide chain by a polymerase and forms base pairs with a base on an antiparallel nucleic acid strand. The term base pair encompasses not only the standard AT, AU or GC base pairs, but also base pairs formed between nucleotides and/or nucleotide analogs comprising non-standard or modified bases, wherein the arrangement of hydrogen bond donors and hydrogen bond acceptors permits hydrogen bonding between a nonstandard base and a standard base or between two complementary non-standard base structures. One example of such non-standard base pairing is the base pairing between the nucleotide analog inosine and adenine, cytosine or uracil, where two hydrogen bonds are formed.
[0032] The Inhibitor may include a charged moiety (e.g., a negatively charged moiety, a positively charged moiety, or both) or a moiety that is capable of becoming charged. The Inhibitor can include two or more charged groups. The Inhibitor may have a charged group selected from the group consisting of -COOH, -PO4, -SO4, -SO3, -SO2, -NRWRV, where Rw and Rv independently is H, an alkyl or aryl group. In other embodiments, the Inhibitor moiety does not comprise a -PO4 group. In some other embodiments, the Inhibitor moiety does not comprise an aryl group. In certain other embodiments, the Inhibitor does not include a nucleotide or nucleoside or analogs thereof.
[0033] Inhibitor may be a compound having the following formula:
Figure imgf000012_0001
wherein each Ai and each A2 is independently an amino acid moiety; Rs and R9 independently is a H or an alkyl group; each of x and y is an integer from 0 to about 10. In some embodiments, R8 and R9 are H atoms and x = 1 and y = 2.
[0034] R3 of a nucleotide analog of Formula II may include a group having the formula of
Figure imgf000012_0002
wherein R5 is a H or an alkyl group; p is an integer from 0 to about 10. In some embodiments, p is 5 or 6. [0035] In some embodiments, R3 of a nucleotide analog of Formula II may include a group having the formula of
Figure imgf000013_0001
wherein k is an integer from about 1 to about 5. In some embodiments, k is an integer from about 2 to about 4. In some embodiments, k is 3.
[0036] In some embodiments, R3 of a nucleotide analog of Formula II may include a group having the formula of
Figure imgf000013_0002
wherein R , R are independently H or alkyl groups, and may together form one or more 3, 4, 5, or 6-member rings, and j is an integer from about 1 to about 5. In some embodiments, R3 of include a group having the formula of
Figure imgf000013_0003
[0037] In some embodiments, R3 of a nucleotide analog of Formula II may include a group having the formula of
Figure imgf000014_0001
wherein R1, R2, R3, and R4 are independently H or alkyl groups, and two or more of which may together form one or more 3, 4, 5, or 6-member rings, and j is an integer from about 1 to about 3. In some embodiments, R3 of include a group having the formula of
Figure imgf000014_0002
[0038] Ri of a nucleotide analog of Formula II may include a C-C triple bond, a S-S bond, or both a C-C triple bond and a S-S bond.
[0039] In some embodiments, Ri in the nucleotide analog of Formula II includes a group having the formula of
Figure imgf000014_0003
wherein R6 is a H or an alkyl group; q and r independently is an integer from about 1 to about 10. [0040] In some embodiments, q is 1 or 2 and r is 1, 2 or 3. [0041] In some embodiments, R2 is a tri-valent radical having the formula:
Figure imgf000015_0001
wherein each Of R2' and R2" is a bi-valent or tri-valent radical selected from:
-(CH2) X-, -(CH2-O) x-, -(CH2-O) Z-(CH2) y-, -(CH2) Z-(CH2-O) y-, and the same substituted with one or more groups selected from hydroxyl, halogen, amino, thiol, (Ci-C6) alkyl, wherein x, y and z are each integers with x and y+z are each from 2 to 10.
[0042] In some detailed embodiments, R2" is -(CH2) x- or -(CH2-O) x-, where x is 2, 3, 4, 5, or 6. In some other detailed embodiments, R2" is -(CH2-O) Z-(CH2) y- or -(CH2) Z-(CH2-O)y-, , where y+z is 2, 3, 4, 5, or 6. Advantages of these analogs include increased stability and enhanced level of inhibition, allowing more optimal spacing of the inhibitor moiety within/on the polymerase to increase effective inhibition. Exemplary compounds include:
Table 1
Figure imgf000016_0001
[0043] As Atto647N is typically used in the form of a carboxylic acid or ester, Atto647N (and other Atto dyes) is reacted with an amine group to for the above molecule - hence to amide moiety between the Atto647N and the back bone of the molecule. It is noted here that Atto dyes may be couple to the rest of the molecule by other linkages than an amide linkage, although amide is often preferred for convenience of preparation. In this regard, the analogs of the invention may also be represented as follows, for example.
Figure imgf000017_0001
Figure imgf000017_0002
[0044] In some embodiments of the invention, the location of the charged moiety within the inhibitor group and/or the distance of the charged group to the NTP plays an important role in the effectiveness of inhibiting a subsequent nucleotide incorporation. In some embodiments, the charged moiety of the inhibitor is from about 5 to about 60 bonds away from the NTP. In some other embodiments, the charged moiety of the inhibitor is from about 10 to about 40 bonds away from the NTP. In some other embodiments, the charged moiety of the inhibitor is from about 10 to about 35 bonds away from the NTP. In some other embodiments, the charged moiety of the inhibitor is from about 10 to about 30 bonds away from the NTP. In some other embodiments, the charged moiety of the inhibitor is from about 10 to about 20 bonds away from the NTP.
[0045] For example, the above compound (about 17X fold inhibition) exhibits an inhibiting effect that is much less than the following compound (about 7OX fold inhibition).
Figure imgf000019_0001
[0046] The label (or "L") may be any moiety that can be attached to or associated with, e.g., directly or via a linker or spacer, an oligonucleotide and that functions to provide a detectable signal, and/or to interact with a second label to modify the detectable signal provided by the first or second label, e.g. fluorescence resonance energy transfer (FRET). In one embodiment, the label is an optically-detectable moiety (e.g., a fluorophore). Non-limiting examples of types of optically-detectable labels include a fluorescent, chemiluminescence, or electrochemically luminescent label. Examples of fluorescent labels include, but are not limited to, 4-acetamido-4'-isothiocyanatostilbene-2,2'disulfonic acid; acridine and derivatives thereof such as acridine, acridine isothiocyanate; 5-(2'-aminoethyl)aminonaphthalene-l -sulfonic acid (EDANS); 4-amino-N-[3-vinylsulfonyl)phenyl]naphthalimide-3,5disulfonate; N-(4-anilino-l- naphthyl)maleimide; anthranilamide; BODIPY; Brilliant Yellow; coumarin and derivatives; coumarin, 7-amino-4-methylcoumarin (AMC, Coumarin 120), 7-amino-4- trifluoromethylcouluarin (Coumaran 15 1); cyanine dyes; cyanosine; 4',6-diaminidino-2- phenylindole (DAPI); 5',5"-dibromopyrogallol-sulfonaphthalein (Bromopyrogallol Red); 7- diethylamino-3-(4'-isothiocyanatophenyl)-4-methylcoumarin; diethylenetriamine pentaacetate; 4,4'-diisothiocyanatodihydro-stilbene-2,2'-disulfonic acid; 4,4'-diisothiocyanatostilbene-2,2'- disulfonic acid; 5-[dimethylaminolnaphthalene-l-sulfonyl chloride (DNS, dansylchloride); 4- dimethylarninophenylazophenyl-4'-isothiocyanate (DABITC); eosin and derivatives; eosin, eosin isothiocyanate, erythrosin and derivatives; erythrosin B, erythrosin, isothiocyanate; ethidium; fluorescein and derivatives; 5 -carboxy fluorescein (FAM), 5-(4,6-dichlorotriazin-2- yl)aminofluorescein (DTAF), 2',7'-dimethoxy-4'5'-dichloro-6-carboxyfluorescein, fluorescein, fluorescein isothiocyanate, QFITC, (XRITC); fluorescamine; IR144; IR1446; Malachite Green isothiocyanate; 4-methylumbelliferoneortho cresolphthalein; nitrotyrosine; pararosaniline; Phenol Red; B-phycoerythrin; o-phthaldialdehyde; pyrene and derivatives: pyrene, pyrene butyrate, succinimidyl 1 -pyrene; butyrate quantum dots; Reactive Red 4 (Cibacron™ Brilliant Red 3B-A) rhodamine and derivatives: 6-carboxy-X-rhodamine (ROX), 6-carboxyrhodamine (R6G), lissamine rhodamine B sulfonyl chloride rhodarnine (Rhod), rhodamine B, rhodamine 123, rhodamine X isothiocyanate, sulforhodamine B, sulforhodamine 101, sulfonyl chloride derivatives of sulforhodamine 101 (Texas Red); N,N,N',N'-tetramethyl-6-carboxyrhodamine (TAMRA); tetramethyl rhodamine; tetramethyl rhodamine isothiocyanate (TRITC); riboflavin; rosolic acid; terbium chelate derivatives; Cy3; Cy5; Cy5.5; Cy7; IRD 700; IRD 800; La Jolta Blue; phthalocyanine; naphthalocyanine; any of the fluorescent labels available from Atto-Tec, such as Atto 390, Atto 425, Atto 465, Atto 488, Atto 495, Atto 520, Atto 532, Atto 550, Atto 565, Atto 590, Atto 594, Atto 610, Atto 61 IX, Atto 620, Atto 633, Atto 635, Atto 637, Atto 647, Atto 647N, Atto 655, Atto 680, Atto 700, Atto 725, Atto 740, etc.; any of the fluorescent labels available from Dyomics such as DY-630, DY-631, DY-632, DY-633, DY-634, DY-635, DY- 636, Dy-647, Dy-648, DY-649, Dy-650, Dy-651, DY-652, etc.; any of the fluorescent labels available from Pierce such as DyLight 405, DyLight 488, DyLight 549, DyLight 633, DyLight 649, DyLight 680, DyLight 800, etc.; any of the fluorescent labels available from AnaSpec such as HiLyte FluoriM 4. S. S dyes, HiLytc F!uor1M 555 Jycs. HiLyte Fluor™ 647 dyes, HiLytc HuorIM 680 dyes. HiLytc Fluor1* 750 dyes, HiLytePlus™ 555 dyes, HiLytePlus fM (47 dyes, HiLytcPlu-,3 M 750 dyes, etc ; any of the fluorescent labels available from Denovo Biolables such as Oyster 500, Oyster 550 P, Oyster 550 D, Oyster 556, Oyster 645, Oyster 650 P, Oyster 650 D, Oyster 656, etc.; IRDye® 680, IRDye® 700, IRDye® 700DX, IRDye® 800, IRDye® 800 RS, IRDye® 800 CW, etc.; any of the fluorescent labels available from SETA Biomedicals such as Seta Kl -204, Seta K5-3212, Seta K8- 1342, Seta K8- 1352, Seta K8- 1357, Seta K8- 1407, Seta K8-1642, Seta K8- 1644, Seta K8- 1663, Seta K8- 1664, Seta K8- 1669, Seta K8-3002, Seta K4- 1082, SetaK8-1669, SetaK7-545, SetaK7-547, SetaK7-549, SetaK8-1252, SetaK8-1261, Seta K8-1262, Seta K8- 1320, Seta K8- 1344, Seta K8- 1367, Seta K8- 1377, Seta K8- 1382, SetaK8- 1446, SetaK8-1667, SetaK8-1752, SetaK8-1762, SetaK8-1767, SetaK8-1777, SetaK8-1782, etc.; Q Dots; and dyes having the following structures:
Figure imgf000021_0001
Figure imgf000022_0001
wherein each Rx is independently selected from the group consisting of H, alkyl, and substituted alkyl.
[0047] The above exemplary label moieties include any derivatives containing the chromophore of any of the labeling moieties exemplified or described herein, attached to the nucleotide analog by means of any suitable chemical linking group. For example, the chromophore can be attached to the nucleotide analog via an alkyl chain bonded to the nucleotide analog by a functional group such as an amide, ester, ether, amine, thiol, disulfide, urea, urethane, carbonate, etc. In one embodiment, the label is a fluorescent label such as cyanine-3 and cyanine-5.
[0048] Labels other than fluorescent labels are contemplated as part of the invention, including other optically-detectable labels. Any appropriate detectable label can be used according to the invention, and numerous other labels are known to those skilled in the art.
[0049] The invention also relates to methods for nucleic acid sequence determination using the nucleotide analogs described herein. The nucleotide analogs of the invention are particularly suitable for use in single molecule sequencing techniques. Such techniques are described for example in U.S. Patent Application Serial Nos. 10/831,214 filed April 2004; 10/852,028 filed May 24,2004; 10/866,388 filed June 10,2005; 10/099,459 filed March 12,2002; and U.S. Published Application 2003/013880 published July 24, 2003, each of which is herein incorporated in its entirety for all purposes. In general, methods for nucleic acid sequence determination include exposing a target nucleic acid (also referred to herein as template nucleic acid or template) to a primer that is complementary to at least a portion of the target nucleic acid, under conditions suitable for hybridizing the primer to the target nucleic acid, forming a template/primer duplex.
[0050] The invention also relates to methods for nucleic acid sequence determination using the nucleotide analogs described herein. The nucleotide analogs of the invention are particularly suitable for use in single molecule sequencing techniques. Such techniques are described for example in U.S. Patent Application Serial Nos. 10/831,214 filed April 2004; 10/852,028 filed May 24,2004; 10/866,388 filed June 10,2005; 10/099,459 filed March 12,2002; and U.S. Published Application 2003/013880 published July 24, 2003, each of which is herein incorporated in its entirety for all purposes. In general, methods for nucleic acid sequence determination include exposing a target nucleic acid (also referred to herein as template nucleic acid or template) to a primer that is complementary to at least a portion of the target nucleic acid, under conditions suitable for hybridizing the primer to the target nucleic acid, forming a template/primer duplex.
[0051] In another aspect, the invention relates to a method for sequencing a nucleic acid. The method includes: (a) anchoring a nucleic acid duplex to a surface, the duplex comprising a template portion and a primer portion hybridized thereto; (b) exposing the duplex to nucleotide analog of Formula I or Formula II in the presence of a polymerase capable of catalyzing the addition of the nucleotide analog to the primer portion in a template- dependent manner; (c) removing unincorporated nucleotide analog and polymerase; (d) detecting incorporation of the nucleotide analog into the primer portion; and (e) repeating said exposing, removing, and detecting steps at least once. The method may further include cleaving L from the nucleotide analog after the detecting step.
[0052] In another aspect, the invention relates to a method for inhibiting the catalytic function of a polymerase enzyme in a sequencing-by-synthesis reaction comprising introducing a nucleotide attached to an inhibitory group. In one aspect, the invention comprises attaching one or both members of a template/primer duplex to a surface, introducing a polymerase and a nucleotide analog comprising a charged inhibitor under conditions sufficient for template- dependent incorporation of the nucleotide and inhibition of subsequent incorporation. Such methods further comprise removing or neutralizing the inhibitor in order to facilitate further nucleotide incorporation. Finally, nucleotides of the invention can be detectably labeled to monitor incorporation.
[0053] Target nucleic acids include deoxyribonucleic acid (DNA) and/or ribonucleic acid (RNA). Target nucleic acid molecules can be obtained from any cellular material obtained from an animal, plant, bacterium, virus, fungus, or any other cellular organism, or may be synthetic DNA. Target nucleic acids may be obtained directly from an organism or from a biological sample obtained from an organism, e.g., from blood, urine, cerebrospinal fluid, seminal fluid, saliva, sputum, stool and tissue. Any tissue or body fluid specimen may be used as a source for nucleic acid for use in the invention. Nucleic acid molecules may also be isolated from cultured cells, such as a primary cell culture or a cell line. The cells from which target nucleic acids are obtained can be infected with a virus or other intracellular pathogen. Nucleic acid molecules may also include those of animal (including human), wild type or engineered prokaryotic or eukaryotic cells, viruses or completely or partially synthetic RNAs or DNAs. A sample can also be total RNA extracted from a biological specimen, a cDNA library, or genomic DNA.
[0054] Nucleic acid typically is fragmented to produce suitable fragments for analysis. In one embodiment, nucleic acid from a biological sample is fragmented by sonication. Test samples can be obtained as described in U.S. Patent Application 2002/0190663 Al, published October 9, 2003, herein incorporated by reference in its entirety for all purposes. Generally, nucleic acid can be extracted from a biological sample by a variety of techniques such as those described by Maniatis, et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor, N.Y., pp. 280-281 (1982). Generally, target nucleic acid molecules can be from about 5 bases to about 20 kb, about 30 kb, or even about 40 kb or more. Nucleic acid molecules may be single- stranded, double-stranded, or double-stranded with single-stranded regions (for example, stem- and loop-structures)
[0055] Single molecule sequencing includes a template nucleic acid molecule/primer duplex that is immobilized on a surface such that the duplex and/or the nucleotides (or nucleotide analogs) added to the immobilized primer are individually optically resolvable. The primer, template and/or nucleotide analogs are detectably labeled such that the position of an individual duplex molecule is individually optically resolvable. Either the primer or the template is immobilized to a solid support. The primer and template can be hybridized to each other and optionally covalently cross-linked prior to or after attachment of either the template or the primer to the solid support.
[0056] In general, methods for facilitating the incorporation of a nucleotide analog as an extension of a primer include exposing a target nucleic acid/primer duplex to one or more nucleotide analogs disclosed herein and a polymerase under conditions suitable to extend the primer in a template dependent manner. Generally, the primer is sufficiently complementary to at least a portion of the target nucleic acid to hybridize to the target nucleic acid and allow template-dependent nucleotide polymerization. The primer extension process can be repeated to identify additional nucleotide analogs in the template. The sequence of the template is determined by compiling the detected nucleotides, thereby determining the complementary sequence of the target nucleic acid molecule.
[0057] Any polymerase and/or polymerizing enzyme may be employed. A preferred polymerase is Klenow with reduced exonuclease activity. Nucleic acid polymerases generally useful in the invention include DNA polymerases, RNA polymerases, reverse transcriptases, and mutant or altered forms of any of the foregoing. DNA polymerases and their properties are described in detail in, among other places, DNA Replication 2nd edition, Komberg and Baker, W. H. Freeman, New York, N.Y. (1991). Known conventional DNA polymerases useful in the invention include, but are not limited to, Pyrococcus furiosus (Pfu) DNA polymerase (Lundberg et al., 1991, Gene, 108: 1, Stratagene), Pyrococcus woesei (Pwo) DNA polymerase (Hinnisdaels et al., 1996, Biotechniques, 20: 186-8, Boehringer Mannheim), Thermus thermophilus (Tth) DNA polymerase (Myers and Gelfand 1991, Biochemistry 30:7661), Bacillus stearothermophilus DNA polymerase (Stenesh and McGowan, 1977, Biochim Biophys Acta 475:32), Thermococcus litoralis (TIi) DNA polymerase (also referred to as Vent™ DNA polymerase, Cariello et al., 1991, Polynucleotides Res, 19: 4 193, New England Biolabs), 9"Nm DNA polymerase (New England Biolabs), Stoffel fragment, Thermosequenase (Amersham Pharmacia Biotech UK), Therminator (New England Biolabs), Thermotoga maritima (Tma) DNA polymerase (Diaz and Sabino, 1998 Braz J Med. Res, 3 1 : 1239), Thermus aquaticus (T aq) DNA polymerase (Chien et al., 1976, J. Bacteoriol, 127: 1550), DNA polymerase, Pyrococcus kodakaraensis KOD DNA polymerase (Takagi et al., 1997, Appl. Environ. Microbiol. 63:4504), JDF-3 DNA polymerase (from thermococcus sp. JDF-3, Patent application WO 0132887), Pyrococcus GB-D (PGB-D) DNA polymerase (also referred as Deep VentTMD NA polymerase, Juncosa-Ginesta et al., 1994, Biotechniques, 16:820, New England Biolabs), UITma DNA polymerase (from thermophile Thermotoga maritima; Diaz and Sabino, 1998 Braz J. Med. Res, 3 1 : 1239; PE Applied Biosystems), Tgo DNA polymerase (from thermococcus gorgonarius, Roche Molecular Biochemicals), E. coli DNA polymerase I (Lecomte and Doubleday, 1983, Polynucleotides Res. 11 :7505), T7 DNA polymerase (Nordstrom et al., 198 1, J Biol. Chem. 256:3 1 12), and archaeal DP1I/DP2 DNA polymerase II (Cann et al., 1998, Proc Natl Acad. Sci. USA 95: 14250-5).
[0058] Other DNA polymerases include, but are not limited to, ThermoSequenase®, 9°Nm™, Therminator™, Taq, Tne, Tma, Pfu, TfI, Tth, TIi, Stoffel fragment, Vent™ and Deep Vent™ DNA polymerase, KOD DNA polymerase, Tgo, JDF-3, and mutants, variants and derivatives thereof. Reverse transcriptases useful in the invention include, but are not limited to, reverse transcriptases from HIV, HTLV-I, HTLV-11, FeLV, FIV, SIV, AMV, MMTV, MoMuLV and other retroviruses (see Levin, Cell 88:5-8 (1997); Verma, Biochim Biophys Acta. 473:1-38 (1977); Wu et al., CRC Crit Rev Biochem. 3:289-347(1975)).
[0059] Unincorporated nucleotide analog molecules may be removed prior to or after detecting. Unincorporated nucleotide analog molecules may be removed by washing.
[0060] A template/primer duplex is treated to remove the label and/or to cleave the molecular chain attaching the label to the nucleotide. One may repeat the steps of exposing template/primer duplex to one or more nucleotide analogs and polymerase, detecting incorporated nucleotides, and then treating to (1) remove the label, (2) remove the label and at least a portion of the molecular chain associating the label to the nucleotide or (3) cleave the molecular chain thereby identifying additional bases in the template nucleic acid, The identified bases can be compiled to determine the sequence of the target nucleic acid. In some embodiments, at least some portions of the remaining molecular chain and/or label are not removed, for example, in the last round of primer extension. [0061] In some embodiments, a nucleotide analog, after removal of the label and portions of the molecular chain connecting the label to the nucleotide can be represented by:
Figure imgf000027_0001
wherein B1, R', R", are as described herein, here R is a N-containing group such as a primary amino group, a secondary amino group, a tertiary amino group, an amide group,
η
Figure imgf000027_0002
; and z is an integer from about 1 to about 12. R is a phosphodiester linkage connecting the nucleotide analog to a sugar of an adjacent nucleotide in the nucleic acid, or a phosphoryl group. In some embodiments, z is an integer from about 1 to about 5. In some other embodiments, z is an integer from about 1 to about 3.
[0062] The invention also provides for a method of removing a label from a labeled base, comprising(a) exposing a base of Formula I or Formula II:
Figure imgf000027_0003
as described herein, to a reducing agent for a time sufficient to produce an unlabelled base of Formula III:
Figure imgf000028_0001
where B1 is a part of the NTP of a nucleotide analog in Formula I or Formula II, and n is an integer from about 1 to about 12. In some embodiments, the reducing agent is tris (2-carboxyl ethyl) phosphine. In other embodiments, the base is linked to a sugar selected from the group consisting of ribose, deoxyribose, and analogs thereof, where the base and sugar together may be present in a nucleotide in a nucleic acid.
[0063] One embodiment of a method for sequencing a nucleic acid template includes exposing a nucleic acid template to a primer capable of hybridizing to the template , a polymerase capable of catalyzing nucleotide addition to the primer, and a labeled nucleotide analog disclosed herein under conditions to permit the polymerase to add the nucleotide analog to the primer. A method for sequencing may further include identifying or detecting the incorporated labeled nucleotide. A cleavable bond may then be cleaved, removing at least the label from the nucleotide analog. The exposing, detecting, and removing steps are repeated at least once. In certain embodiments, the exposing, detecting, and removing steps are repeated at least three, five, ten or even more times. The sequence of the template can be determined based upon the order of incorporation of the labeled nucleotides.
[0064] In another embodiment, a method for sequencing a nucleic acid template includes exposing a nucleic acid template to a primer capable of hybridizing to the template and a polymerase capable of catalyzing nucleotide addition to the primer. The polymerase is, for example, Klenow with reduced exonuclease activity. The polymerase adds a labeled nucleotide analog disclosed herein. The method may include identifying the incorporated labeled nucleotide. Once the labeled nucleotide is identified, the label and at least a portion of a molecular chain connecting the label to the nucleotide analog are removed and the remaining portion of the molecular chain includes a free hydroxyl group. The exposing, incorporating, identifying, and removing steps are repeated at least once, preferably multiple times depending on the application. The sequence of the template is determined based upon the order of incorporation of the labeled nucleotides. [0065] Removal of a label from a labeled nucleotide analog and/or cleavage of the molecular chain linking a nucleotide analog to a label may include contacting or exposing the labeled nucleotide with a reducing agent. Such reducing agents include, for example, dithiothreitol (DTT), tris(2-carboxyethyl)phosphine (TCEP), tris(3-hydroxy -propyl) phosphine, tris(2-chloropropyl) phosphate (TCPP), 2-mercaptoethanol, 2-mercaptoethylarnine, cystein and ethylmaleimide. Such contacting or exposing the reducing agent to a labeled nucleotide analog may occur at a range of pH values, for example at a pH of about 5 to about 10, or about 7 to about 9.
[0066] The above-described methods for sequencing a nucleic acid template can further include a step of capping a molecular chain, for example, after the label has been removed. After addition of the nucleotide analog to the primer, any optional 3' phosphate moiety can be removed enzymatically. In one embodiment, an optional phosphate can be removed using alkaline phosphatase or T4 polynucleotide kinase. Suitable enzymes for removing optional phosphate include, any phosphatase, for example, alkaline phosphatase such as shrimp alkaline phosphatase, bacterial alkaline phosphatase, or calf intestinal alkaline phosphatase.
[0067] Any suitable detection method may be used to identify an incorporated nucleotide analog. Thus, exemplary detection methods include radioactive detection, optical absorbance detection, e.g., UV-visible absorbance detection, optical emission detection, e.g., fluorescence or chemiluminescence. Single-molecule fluorescence can be carried out using a conventional microscope equipped with total internal reflection (TIR) objective. The detectable moiety associated with the extended primers can be detected on a substrate by scanning all or portions of each substrate simultaneously or serially, depending on the scanning method used. For fluorescence labeling, selected regions on a substrate may be serially scanned one-by-one or row-by-row using a fluorescence microscope apparatus, such as described in Fodor (U.S. Patent No. 5,445,934) and Mathies et al. (U.S. Patent No. 5,09 1,652). Devices capable of sensing fluorescence from a single molecule include scanning tunneling microscope (STM) and the atomic force microscope (AFM). Hybridization patterns may also be scanned using a CCD camera (e.g., Model TE/CCD512SF, Princeton Instruments, Trenton, NJ.) with suitable optics (Ploem, CCD (Chase-Completed-Device) in Fluorescent and Luminescent Probes for Biological Activity Mason, T.G. Ed., Academic Press, Landon, pp. 1-11 (1993), such as described in Yershov et al., Proc. Natl. Aca. Sci. 93:4913 (1996), or may be imaged by TV monitoring. For radioactive signals, a phosphorimager device can be used (Johnston et al., Electrophoresis, 13566, 1990; Drmanac et al., Electrophoresis, 13:566, 1992; 1993). Other commercial suppliers of imaging instruments include General Scanning Inc., (Watertown, Mass. on the World Wide Web at genscan.com), Genix Technologies (Waterloo, Ontario, Canada; on the World Wide Web at confocal.com), and Applied Precision Inc. Such detection methods are particularly useful to achieve simultaneous scanning of multiple attached target nucleic acids.
[0068] The present invention provides for detection of molecules ranging from a single nucleotide to a single target nucleic acid molecule. A number of methods are available for this purpose. Methods for visualizing single molecules within nucleic acids labeled with an intercalating dye include, for example, fluorescence microscopy. For example, the fluorescent spectrum and lifetime of a single molecule excited-state can be measured. Standard detectors such as a photomultiplier tube or avalanche photodiode can be used. Full field imaging with a two-stage image intensified CCD camera also can be used. Additionally, low noise cooled CCD can also be used to detect single fluorescent molecules.
[0069] The detection system for the signal may depend upon the labeling moiety used. For optical signals, a combination of an optical fiber or charge coupled device (CCD) can be used in the detection step. In those circumstances where the substrate is itself transparent to the radiation used, it is possible to have an incident light beam pass through the substrate with the detector located opposite the substrate from the target nucleic acid. For electromagnetic labeling moieties, various forms of spectroscopy systems can be used. Various physical orientations for the detection system are available and discussion of design parameters is provided in the art.
[0070] A number of approaches can be used to detect incorporation of fluorescently labeled nucleotides into a single nucleic acid molecule. Optical setups include near-field scanning microscopy, far-field confocal microscopy, wide-field epi-illumination, but are not limited to, light scattering, dark field microscopy, photoconversion, single and/or multiphoton excitation, spectral wavelength discrimination, fluorophore identification, evanescent wave illumination, and total internal reflection fluorescence (TIRF) microscopy. In general, certain methods involve detection of laser-activated fluorescence using a microscope equipped with a camera. Suitable photon detection systems include, but are not limited to, photodiodes and intensified CCD cameras. For example, an intensified charge couple device (ICCD) camera can be used. The use of an ICCD camera to image individual fluorescent dye molecules in a fluid near a surface provides numerous advantages. For example, with an ICCD optical setup, it is possible to acquire a sequence of images (movies) of fluorophores.
[0071] Some embodiments of the present invention use TIRF microscopy for two- dimensional imaging. TIRF microscopy uses totally internally reflected excitation light and is well known in the art. See, e g., the World Wide Web at nikoninstrurnents.jp/eng/page/products/tirf.aspx. In certain embodiments, detection is carried out using evanescent wave illumination and total internal reflection fluorescence microscopy. An evanescent light field can be set up at the surface, for example, to image fluorescently-labeled nucleic acid molecules. When a laser beam is totally reflected at the interface between a liquid and a solid substrate (e.g., a glass), the excitation light beam penetrates only a short distance into the liquid. The optical field does not end abruptly at the reflective interface, but its intensity falls off exponentially with distance. This surface electromagnetic field, called the "evanescent wave", can selectively excite fluorescent molecules in the liquid near the interface. The thin evanescent optical field at the interface provides low background and facilitates the detection of single molecules with high signal-to-noise ratio at visible wavelengths.
[0072] The evanescent field also can image fluorescently-labeled nucleotides upon their incorporation into the attached target nucleic acid target molecule/primer complex in the presence of a polymerase. Total internal reflectance fluorescence microscopy is then used to visualize the attached target nucleic acid target molecule/primer complex and/or the incorporated nucleotides with single molecule resolution.
[0073] Fluorescence resonance energy transfer (FRET) can be used as a detection scheme. FRET in the context of sequencing is described generally in Braslavasky, et al., Proc. Nat'l Acad. ScL, 100: 3960-3964 (2003), incorporated by reference herein. In an embodiment, a donor fluorophore is attached to the primer, polymerase, or template. Nucleotides added for incorporation into the primer comprise an acceptor fluorophore that is activated by the donor when the two are in proximity.
[0074] Measured signals can be analyzed manually or preferably by appropriate computer methods to tabulate results. Preferably, the signals of millions of analogs are read in parallel and then deconvoluted to ascertain a sequence. The substrates and reaction conditions can include appropriate controls for verifying the integrity of hybridization and extension conditions, and for providing standard curves for quantification, if desired. For example, a control nucleic acid can be added to the sample. The absence of the expected extension product is an indication that there is a defect with the sample or assay components requiring correction.
[0075] As another example, the described nucleotide analogs can be used to facilitate "four color" sequencing by synthesis if each base (A, C, G, T) is labeled with a dye emitting and/or absorbing at a different and resolvable wavelength. The sequencing procedure can be shortened from four separate addition cycles (i.e., one for each base) to the following: add A, C, G, T (each differently labeled) with polymerase and an appropriate reaction buffer, rinse, image the four resolvable dyes and record which base (if any) was incorporated, cleave and cap the nucleotides, and repeat. The described nucleotide analogs facilitate this kind of sequencing because of their ability to incorporate one and only one base at a time. Without that ability, if all four bases are added to the incorporation reaction at once multiple bases would be added to a given strand and the interactions between the proximate dyes would hinder the ability to resolve the sequence information correctly.
[0076] For example, the nucleotide analogs described herein can facilitate sequencing nucleic acids containing homopolymer sequences, using sequencing by synthesis methodology (e.g., using the methods of US 2007/0190546, herein incorporated by reference in its entirety for all purpose. When the template sequence contains a homopolymer, using a polymerase, nucleotide analog, and reaction buffer combination that allows for only a single nucleotide analog incorporation allows for each base in the homopolymer to be sequenced sequentially. After one base is incorporated into the homopolymer and detected, the portion of the analog that inhibits subsequent base incorporation and that contains the fluorescent label is removed, making incorporation of the next base in the homopolymer possible during the next addition cycle of the correct base.
[0077] Reference to the following figures or schemes illustrating an exemplary reaction scheme and nucleotide analogs is intended in no way to limit the scope of this invention but is provided to illustrate how to prepare and use the compounds of the present invention.
Examples Example 1. Caproic-Glu and Caproic-Glu
Figure imgf000033_0001
Synthetic Scheme I
Figure imgf000033_0002
Figure imgf000034_0001
Figure imgf000035_0001
3-tert-Butyldisulfanyl-2-(9H-fluoren-9-ylmethoxycarbonylamino)-propionic acid 2, 5-dioxo- pyrrolidin-1-yl ester (2)
Figure imgf000036_0001
[0078] To a solution of Fmoc-Cys(S StBu)-OH (1, 2.15 g, 5.0 mmole,) dissolved in anhydrous CH2Cl2(SO mL) was added N-Ethyl-N'-(3-dimethylaminopropyl)carbodiimide hydrochloride (EDAC, 1.146g, 6 mmole), the reaction mixture was stirred for 10 min. at room temperature (RT) and then added N-hydroxysuccinimide (NHS) (0.690 g, 6.0 mmole). To this reaction mixture was added catalytic amount of N,N'-dimethlyaminopyridine and stirred at RT until completion of reaction tested with TLC. The solvent was evaporated and the residue obtained was extracted with ethyl acetate (50 mLx2), washed with IM NaHCO3 (10 mL), followed by brine solution (20 mL) and dried over anhydrous Na2SO^ Evaporation of the solvent afforded 2 as a white crystalline solid. Yield. 2.5 g (95%).
6-[S-tert-Butyldisulfanyl-2-(9H-fluoren-9-ylmethoxycarbonylamino)-propionylamino]-hexanoic acid (3)
Figure imgf000036_0002
[0079] To a solution of 6-Aminohexanoic acid (0.158.g, 1.2 mmole) dissolved in 0.1 M NaHCO3 (2.0 mL) was added the NHS ester 2 (0.68g, 1.3 mmole) in 4 mL of anhydrous THF. The reaction mixture was stirred at RT for 2 hr. The solvent was completely evaporated and the dried solid residue obtained was dissolved in CH3OHZCH2Cl2 mixture and purified by silica gel column chromatography using 10% CH3OH/CH2C12 and obtained 3 as a white solid on evaporation the solvent. Yield: 0.5 g (77%).
Figure imgf000037_0001
[0080] To a solution of (3, 500 mg, 0.92 mmole,) dissolved in anhydrous CH2C12/THF (1: 1)(5 mL) was added N-Ethyl-N'-(3-dimethylaminopropyl)carbodiimide hydrochloride (EDAC, 191 mg, 1.0 mmole), followed by NHS (115 mg, 1.0 mmole). To this reaction mixture was added catalytic amount of N,N'-dimethlyaminopyridine and stirred at RT until completion of reaction tested with TLC. The solvent was evaporated and the residue obtained was extracted with ethyl acetate (50 mLx2), washed with IM NaHCO3 (10 mL), followed by brine solution (10 mL) and dried over anhydrous Na2SCv Evaporation of the solvent afforded 4 as a white crystalline solid. Yield. 0.52g (88%).
2-{6-[3-tert-Butyldisulfanyl-2-(9H-fluoren-9-ylmethoxycarbonylamino)-propionylamino]- hexanoylaminoj-pentanedioic acid (5)
Figure imgf000037_0002
[0081] To a stirred solution of Glutamic acid (20 mg, 0.14 mmole) in 0.2M NaHCO3 (0.5 mL) was added 6-[3-tert-Butyldisulfanyl-2-(9H-fluoren-9-ylmethoxycarbonylamino)- propionylamino]-hexanoic acid 2,5-dioxo-pyrrolidin-l-yl ester (4, 96 mg, 0.15 mmole) dissolved in (THF-DMF(1 : 1), 0.5 mL). The reaction mixture was stirred at RT for 10 min. and analyzed with LCMS which showed the product (5) peak with mass m/z: 671.95 [M-H]. The reaction was stirred at RT for overnight and purified by HPLC using Phenomenex Cl 8 preparative column, (250 x 21.00 mm, gradient: 2% CH3CN/50mM TEAB (triethylammonium bicarbonate), pH 8.4, 10 mL/min flow). Fractions containing the compound 5 were collected together and evaporated the solvent using rotary evaporator and dried. Yielded 5 as a white solid: 50 mg. 2-{6-[2-(9H-Fluoren-9-ylmethoxycarbonylamino)-S-mercapto-propionylamino]- hexanoylaminoj-pentanedioic acid (6)
Figure imgf000038_0001
[0082] A solution of (5) (10 mg, 0.015 mmole) in H2O-THF (1:1, 1.0 ml) was treated with tris(2-carboxyethyl)phosphine (TCEP, 0.10 mL, 0.5M in H2O). The reaction was stirred at RT for 4h until complete cleavage of disulphide bond (monitored by LCMS) and purified by HPLC using Phenomenex Cl 8 preparative column, (250 x 21.00 mm, gradient: 2% CH3CN/50mM TEAB, pH 8.4, 10 mL/min flow). Fractions containing the compound 6 were pooled and used immediately for the subsequent displacement reaction with dATP-SPDP (SPDP: N-succinimidyl 3-(2-pyridyl dithio) propionate) and dCTP-SPDP as described below. LCMS: m/s: 583.95[M-H].
Compound 7
Figure imgf000038_0002
[0083] The fractions containing compound 6 in 60% CH3CN/50 mM TEAB buffer (4.0 mg, in 4 mL) collected from HPLC were mixed with dATP-SPDP (3.6 μmole, ref. previous patent) in 4 ml of 30% CH3N/50 mM TEAB buffer, pH 8.4 in a round bottom flask and stirred for 2h. The reaction solution was concentrated under reduced pressure, diluted with water and purified with HPLC (Phenomenex Cl 8 column, 250 x 21.0 mm, gradient: 1.5% CH3CN/50 mM TEAB buffer, lOmL/min flow rate). Fractions containing the desired were pooled together and evaporated and dried. Yielded 7 (3.0 mg) as a white solid. LCMS: m/z: 2121.80 [M-2H], 606.05 [M/2-2H].
Compound 8
Figure imgf000039_0001
[0084] The compound 7 (2.0 mg) obtained was dissolved in anhydrous DMF (0.6 mL) added 60μl of piperidine. The reaction mixture was then stirred at RT for an hour. The complete cleavage of FMOC group was monitored by LCMS and the reaction mixture was purified by HPLC (Phenomenex Cl 8 column, 250 x 21.0 mm, gradient: 1.5% CH3CN/50 mM TEAB buffer, 10 mL/min flow rate). Fractions containing the desired were pooled together and evaporated and obtained 8 (1.0 μmole) as a colorless solid. LCMS: 990.95 [M/2-2H]. Compound 9
Figure imgf000040_0001
[0085] To a solution of 8 (0.5 μmole) in 0.5 mL of 50 mM K2HPO4 was added Cy5- NHS (1 mg, 1.2 μmole) dissolved in 20μL of anhydrous DMF and stirred at RT until the complete disappearance of starting material 8 which was monitored by LCMS. Then the blue color reaction mixture was purified HPLC (Phenomenex Cl 8 column, 250 x 21.0 mm, gradient: 1.5% CH3CN/5O mM TEAB buffer, 10 mL/min flow rate). Fractions containing the desired were pooled together and lyophilized. Yielded 9a (0.36 μmole) as a blue solid. LCMS: 814.40 [M/2-2H].
[0086] Similarly a solution of 8 (0.5 μmole) in 0.5 mL of 50 mM K2HPO4 was added Atto 647N-NHS (2 mg, 2.5 μmole) dissolved in 40μL of anhydrous DMF and stirred at RT until the complete disappearance of starting material 8 which was monitored by LCMS. Then the blue color reaction mixture was purified HPLC (Phenomenex Cl 8 column, 250 x 21.0 mm, gradient: 2.0% CH3CN/5O mM TEAB buffer, 10 mL/min flow rate). Fractions containing the desired were pooled together and lyophilized. Yielded 9b (0.3 μmole) as a blue solid. LCMS: 1595.2 [M-2H], 797.0 [M/2-2H]. Compound 10
Figure imgf000041_0001
Figure imgf000041_0002
[0087] The fractions containing compound 6 in 60% CH3CN/50 mM TEAB buffer (3.0 mg, in 3 mL) collected from HPLC were mixed with dCTP-SPDP (3.0 μmole, ref. previous patent) in 3 ml of 30% CH3N/50 mM TEAB buffer, pH 8.4 in a round bottom flask and stirred for 2 hr. The reaction solution was concentrated under reduced pressure, diluted with water and purified with HPLC (Phenomenex Cl 8 column, 250 x 21.0 mm, gradient: 1.5% CH3CN/50 mM TEAB buffer, 10 mL/min flow rate). Fractions containing the desired were pooled together and evaporated and dried. Yielded 10 (3.0 mg) as a white solid. LCMS: m/z: 1189.85 [M-2H], 594.8 [M/2-2H].
Compound 11
Figure imgf000042_0001
Figure imgf000042_0002
[0088] The compound 10 (2.0 mg) obtained was dissolved in anhydrous DMF (0.6 mL) added 60μl of piperidine. The reaction mixture was then stirred at RT for an hour. The complete cleavage of FMOC group was monitored by LCMS and the reaction mixture was purified by HPLC (Phenomenex Cl 8 column, 250 x 21.0 mm, gradient: 1.5% CH3CN/50 mM TEAB buffer, 10 mL/min flow rate). Fractions containing the desired were pooled together and evaporated and obtained 11 (1.2 μmole) as a colorless solid. LCMS: 967.90 [M/2-2H].
Compound 12
Figure imgf000043_0001
[0089] To a solution of 11 (0.6 μmole) in 0.5 mL of 50 mM K2HPO4 was added Cy5- NHS (1.5 mg, 1.6 μmole) dissolved in 30 μL of anhydrous DMF and stirred at RT until the complete disappearance of starting material 11 which was monitored by LCMS. Then the blue color reaction mixture was purified HPLC (Phenomenex Cl 8 column, 250 x 21.0 mm, gradient: 1.5% CH3CN/50 mM TEAB buffer, 10 mL/min flow rate). Fractions containing the desired were pooled together and lyophilized. Yielded 12a (0.5 μmole) as a blue solid. LCMS: 814.40 [M/2-2H].
[0090] Similarly a solution of 11 (0.4 μmole) in 0.5 mL of 5OmM K2HPO4 was added Atto 647N-NHS (2 mg, 2.5 μmole) dissolved in 40μL of anhydrous DMF and stirred at RT until the complete disappearance of starting material 11 which was monitored by LCMS. Then the blue color reaction mixture was purified HPLC (Phenomenex Cl 8 column, 250 x 21.0 mm, gradient: 2.0% CH3CN/5O mM TEAB buffer, 10 mL/min flow rate). Fractions containing the desired were pooled together and lyophilized. Yielded 12b (0.35 μmole) as a blue solid. LCMS: 1595.2 [M-2H], 797.0 [M/2-2H]. Example 2 Caproic-Asp-Asp Synthetic Scheme II
Figure imgf000044_0001
C* cap-Asp-Asp: H
Figure imgf000044_0002
Figure imgf000045_0001
Figure imgf000046_0001
Figure imgf000047_0001
[0091] α- N-Fmoc-S-tert-butylthio-L-cysteine (1 g, 2.32 mmol) was dissolved in anhydrous acetonitrile and solution of dicyclohexylcarbodiimide (DCC) (573 mg, 2.78 mmol in CH3CN) was added followed by solution of NHS (345 mg, 3.01 mmol in CH3CN). After 1 hr. dicyclohexylurea was spun down and active ester used without purification in coupling with ε- amino-hexanoic acid (304 mg, 2.32 mmol) dissolved in 50% aq. DMF. N,N'- Diisopropylethylamine (DIPEA) was added to correct pH to 8.0. Upon completion reaction mixture was acidified to pH 3 and partitioned between water and dichloro methane (DCM). Organic layer was dried over anhydrous Na2SC^ and evaporated to give 1.33g of crude material. Purification using flash chromatography in DCM/methanol gave 745 mg of pure material (MW=544.75).
Figure imgf000047_0002
[0092] α- N-Fmoc-S-tert-butylthio-L-cyst-caproic acid (3, 77 mg, 141 μmols, CH3CN) was converted to NHS active ester using DCC (35 mg, 169 μmols, CH3CN) and NHS (21 mg, 183 μmols, ACN). After 1 hr. precipitate of dicyclohexylurea was removed by centrifugation and ester used without further purification in coupling with H-Asp-Asp-OH peptide (12 mg, 48 nmols) dissolved in 0.5M K2HPO4, pH of reaction mixture corrected to 7.5 with DIPEA. Progress of reaction was monitored by TLC (disappearance of ester) and by LC-MS (formation of product). Upon completion product was isolated by direct injection on preparative HPLC (Cl 8 column, 3 % CH3CN gradient in 50 mM TEAB, pH 8.6). Isolated product was lyophilized to give white powder (MW=774.9)
Figure imgf000048_0001
[0093] To free the thiol α- N-Fmoc-S-tert-butylthio-L-cyst-caproic-Asp-Asp-OH (15) was treated with 100 mM DTT in 0.1M K2HPO4 during 1 hr. at RT. Reaction was monitored by LC-MS and upon completion injected directly on preparative HPLC (Cl 8 column). Purification using 2% CH3CN gradient in 50 mM TEAB, pH 8.6 yielded product (MW=686.7) which was used immediately without evaporation in displacement reaction with SPDP modified nucleotide triphosphates.
Figure imgf000048_0002
[0094] dATP-AP3 and dCTP-AP3 were prepared by a modified procedure of Hobbs and Cocuzza: a) Pyrophosphate and tributylamine were added to the reaction mixture rather than vice versa; b) After pyrophosphate addition the reaction was quenched with 50 mM TEAB within 15 min.; c) DEAE-Sephadex chromatography was replaced by preparative HPLC.
Figure imgf000048_0003
Figure imgf000049_0001
[0095] SPDP modification of dATP-AP3 and dCTP-AP3 was accomplished using standard protocol: 2 μmols of dNTP-AP3 were dissolved in 250 μl of 0.1N NaHCO3 and 1.2 equivalent (eqv.) of freshly prepared 50 mM stock of SPDP in anhydrous DMF was added. Progress of modification was monitored using LC-MS. Product was isolated using preparative HPLC (C18 column ) with 1% CH3CN gradient in 50 mM TEAB, pH 8.6 gradient and used in displacement reaction with thiol without evaporation of HPLC solvents (MW=717.01 for dCTP- AP3-SPDP, MW=740.03 for dATP-AP3-SPDP).
Figure imgf000049_0002
[0096] Small aliquots of isolated thiol were added to freshly isolated dNTP-AP3-SPDP to obtain displacement product. Progress of reaction was monitored by LC-MS after every addition of thiol. Reaction was completed when all dNTP-AP3-SPDP was consumed at which point reaction mixture was concentrated and purified on preparative HPLC (C 18 column) using 1% gradient Of CH3CN in 50 mM TEAB, pH 8.6. Isolated product was lyophilized to give white powder (MW=1293.06 for cytidine-analog and MW= 1316.09 for adenosine-analog).
Figure imgf000050_0001
[0097] Removal of Fmoc-protecting group was accomplished using 20% piperidine in CH3CN (20 min., RT). Subsequently solvents were removed and crude reaction mixture purified on preparative HPLC (C 18 column) using 2% CH3CN gradient. Product was dried down and OD measured in water at 290 nm for cytidine analog (800 nmols, MW=1070.8) and 280 nm for adenosine analog (640 nmols, MW=1093.8).
Figure imgf000050_0002
Figure imgf000051_0001
[0098] Dye modified final products were prepared using following standard conditions: peptide modified dNTPs were re-dissolved in 20 mM K2HPO4 and dye -NHS dissolved in anhydrous DMF (5 mg in lOOμl) was added using initially 1.2 eqv. up to 4 eqv. to reach complete consumption of starting material. Progress of modification was monitored using LC- MS. Product was isolated using preparative HPLC (C 18 column) with 1% CH3CN gradient and 50 mM TEAB, pH 8.6. Desired fractions were combined, organic solvent removed under reduced pressure and products subjected to CH3OH repurification on C18 HPLC column (1% CH3OH gradient). Final fractions were quantitated at 650 nm using εβso = 250000 M" cm" for
Cy5 dye and 150000 M -11Cm -"l1 for Atto 647N dye.
Example 3 Caproic-Arg-Arg-Arg
Synthetic Scheme III
Figure imgf000052_0001
Compound 32
[0099] Compound 31 (100 mg, 0.18 mmol) was dissolved in 0.8 ml DMF and added 0.2 mL piperidine and then kept at RT for 30 min. DMF was removed and the residue was purified with flash column using CH2Cl2: CH3OH (2:1). The purified amine (35 mg) was dissolved in 1 mL DMF and used directly for the next step without characterization. 3.5 mg of the purified amine in 0.1 mL DMF (10.8 μmol) was added 60 μL DMF and 40 μL DIPEA and then Cy5 Mono NHS Ester (6.63 μmol) in 100 μL anhydrous DMF was added into the solution. After 30 minutes, the reaction mixture was purified with HPLC (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.00 mm 10 micron, gradient: 100% A for 5 min, then 1% B/min, buffer A 0.05 M TEAB, buffer B CH3CN, 10 mL/min flow). Fractions containing the desired compound 32 were pooled and quantified; (3.0 μmol, 45 %, ε649 = 250000); ESI-MS (negative ion mode): m/z = 959.20 (M-H).
Compound 33
[00100] The NHS ester of the acid 32 was prepared by dissolving the acid 32 (3.0 μmol) in DMF (500.0 μL) and N,N,N',N'-Tetramethyl-O-(N-succinimidyl)uronium hexafluorophosphate (SbTMU) (4.3 mg, 12 μmol) in 100 μL DMF was added to the acid solution followed by the addition of DIPEA (80 μL). After stirring at RT for 1 hr., the reaction mixture was used immediately for peptide coupling without any purification. The peptide Arg- Arg-Arg-OH (14.5 mg, 30 μmol) was dissolved in 160 μL 0.5M phosphate buffer, and added to the freshly prepared NHS ester of the acid 32. The reaction mixture was stirred for 30 minutes and then the crude reaction mixture was purified with HPLC (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.00 mm 10 micron, gradient: 100% A for 5 min, then 1% B/min, buffer A 0.05 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired compound 33 were pooled and quantified; (0.6 μmol, 20 %, ε649 = 250000); ESI-MS (negative ion mode): m/z = 713.45 [(M-2H)/2].
Compound 34
[00101] A solution of compound 33 (0.6 μmol) in 3 ml H2O was treated with TCEP (300 μL, IM solution) in an aluminum foil covered flask. After 30 minutes, the reaction mixture was purified with HPLC (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.00 mm 10 micron, gradient: 100% A for 5 min, then 1% B/min, buffer A 0.05 M TEAB, buffer B CH3CN, 10 mL/min flow). Fractions containing the desired thiol, analyzed with ESI-MS (negative ion mode): m/z = 669.90 [(M- 2H)/2], were pooled and immediately added dATP-SPDP (1 μmol in 1 mL H2O). After 15 minutes, LCMS analysis indicated that the completion of the reaction and the reaction mixture was then partially concentrated under reduced pressure to remove CH3CN, then purified with HPLC (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 10.00 mm 10 micron, gradient: 100% A for 5 min, then 1% B/min, buffer A 0.05M TEAB, buffer B CH3CN, 5 mL/min flow). Fractions containing the desired compound were pooled and concentrated and then purified again with HPLC using CH3OH and TEAB buffer. The fractions containing the desired compound 34 were pooled and lyophilized to yield compound 34 as a bright blue solid (0.37 μmol, 62 %, ε649 = 250000). ESI-MS (negative ion mode): m/z = 983.75 [(M-2H)/2].
Example 4 Cap- Asp-Asp- Asp-Asp
Figure imgf000054_0001
Compound 45 [00102] Cy5 Mono NHS Ester (100.0 μL, 6.63 μmol) in anhydrous DMF was added to a solution of amine 44 (13.26 μmol, 2 equiv) in DMF (100 μL) and DIPEA (20.0 μL) in an aluminum foil covered flask. After 30 minutes, the disappearance of the starting amine was determined by LCMS or HPLC. The reaction was HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex C18 preparative column, 250 x 21.00 mm 10 micron, gradient: 100% A for 5 min, then 1% B/min, buffer A 0.05 M TEAB, buffer B CH3CN, 10 mL/min flow). Fractions containing the desired product were pooled and quantified; (4.0 μmol, 60.3 %, ε649 = 250000); ESI-MS (negative ion mode): m/z = 959.20 (M-H).
Figure imgf000055_0001
Compound 46
[00103] The NHS ester of the acid 45 was prepared by dissolving the acid 45 (4.0 μmol, 1 eqv.) in DMF (700.0 μL) and the SbTMU 5.93 mg, 16.5 μmol, in 200 μL DMF, 4.0 eqv.) was added, to the acid solution followed by the addition of DIPEA (103.0 μL). After stirring at RT for 1 hour, the reaction mixture was used immediately for peptide coupling without any purification. The peptide (Asp- Asp-Asp- Asp) was dissolved in DMFB2O (400.0 μL, 1 :1), basifϊed using DIPEA (50.0 μL). To this peptide solution was added freshly prepared NHS ester of the acid 45. The reaction mixture was stirred for 30 minutes and it was then analyzed by LCMS. The crude reaction mixture was HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.00 mm 10 micron, gradient: 100% A for 5 min., then 1% B/min, buffer A 0.05 M TEAB, buffer B CH3CN, 10 mL/min flow). Fractions containing the desired were pooled and quantified; (3.0 μmol, 75.0 %, ε649 = 250000); ESI-MS (negative ion mode): m/z = 709.20 (1/2M-H).
Figure imgf000056_0001
Compound 47
[00104] A solution of compound 46 (1.0 μmol) in H2O was treated with TCEP (40.0 μL, 19.92 μmol, 0.5 M in H2O, 19.92 equiv) in an aluminum foil covered flask. After 30 minutes, the reaction mixture was analyzed by LCMS and was then HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex C18 preparative column, 250 x 21.00 mm 10 micron, gradient: 100% A for 5 min., then 1% B/min, buffer A 0.05 M TEAB, buffer B CH3CN, 10 mL/min flow). Fractions containing the desired were pooled and used immediately for the subsequent displacement reaction without removing the solvent. ESI-MS (negative ion mode): m/z = 665.45 (1/2M-H).
Figure imgf000056_0002
Compound 48a [00105] HPLC fractions containing the thiol 7 (0.34 μmol, 1 eqv.) were mixed with HPLC fractions containing dCTP-SPDP (0.41 μmol, 1.25 eqv.) in an aluminum foil covered flask. After 15 min. LCMS analysis indicated that the completion of the reaction and it was then partially concentrated under reduced pressure to remove CH3CN, then HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 10.00 mm 10 micron, gradient: 100% A for 5 min, then 1% B/min, buffer A 0.05 M TEAB, buffer B CH3CN, 5 mL/min flow). Fractions containing the desired were pooled and lyophilized to yield compound 1 as a bright blue solid (0.17 μmol, 50 %, ε649 = 250000). The desired product was HPLC purified a second time under the same conditions, using CH3OH instead Of CH3CN for buffer B. Fractions containing the desired were pooled and stored at -80 0C without removing the solvent. ESI-MS (negative ion mode): m/z = 968.35 (1/2M-H).
Figure imgf000057_0001
Compound 49a
[00106] HPLC fractions containing thiol 47 (0.5 μmol, 1 eqv.) were mixed with HPLC fractions containing dATP-SPDP (0.6 μmol, 1.2 eqv.) in an aluminum foil covered flask. After 15 min. LCMS analysis indicated that the completion of the reaction and it was then partially concentrated under reduced pressure to remove CH3CN, then HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 10.00 mm 10 micron, gradient: 100% A for 5 min., then 1% B/min, buffer A 0.05M TEAB, buffer B CH3CN, 5 mL/min flow). Fractions containing the desired were pooled and lyophilized to yield compound 49a as a bright blue solid (0.35 μmol, 70 %, ε649 = 250000). The desired was HPLC purified a second time under the same conditions, using CH3OH instead Of CH3CN for buffer B. Fractions containing the desired were pooled and stored at -80 0C without removing the solvent. ESI-MS (negative ion mode): m/z = 980.10 (1/2M-H).
Example 5 Caproic-Asp Synthetic Scheme IV
Figure imgf000058_0001
Figure imgf000059_0001
Figure imgf000059_0002
Figure imgf000060_0001
NHS ester
[00107] Fmoc-Cys(SϊBu)-OH (2.0 g, 4.63 mmol, 1 eqv.) was dissolved in CH3CN (10 mL). DCC (1.2 g, 5.81 mmol, 1.26 eqv.) was added, followed by NHS (0.70 g, 6.08 mmol, 1.31 eqv.) and the reaction was stirred at RT for 1 hr. White precipitate (DCU) began forming within five min. The reaction mixture was transferred to Eppendorf tubes and centrifuged to remove the white precipitate. The supernatant was then used in subsequent reactions without further purification.
Figure imgf000061_0001
Acid
[00108] 6-Aminohexanoic acid (0.60 g, 4.57 mmol, 1 eqv.) was dissolved in 1 :1 H2O:DMF (6 mL total). DIPEA (0.016 mL) was added to keep the pH about 8. NHS ester (4.63 mmol in 10 mL CH3CN, 1.01 eqv.) was added to the reaction mixture in 1 mL aliquots over aboutlO min. DIPEA (0.02 mL) was added after each aliquot to keep the reaction basic. After the first aliquot of NHS ester was added, the reaction became cloudy, and addition of extra H2O (0.2 mL) was needed to clear up the solution. The reaction was stirred at RT for two hours, then quenched with 20 mL 10% HCl (aq.). The aqueous phase was extracted with CH2Cl2 (2 x 5OmL). The organic phase was dried over Na24, filtered, and concentrated under reduced pressure to yield a brown oil. Purification by flash column chromatography (100% CH2Cl2 to 5% CH3OH/CH2C12) afforded the desired acid as a white foam (2.14 g, 86%).
Figure imgf000061_0002
NHS ester
[00109] The starting acid (0.99 g, 1.82 mmol, 1 eqv.) was dissolved in CH3CN (10 mL). DCC (0.46 g, 2.23 mmol, 1.23 eqv.) was added, followed by NHS (0.28 g, 2.43 mmol, 1.34 eqv.) and the reaction was stirred at RT for an hour. White precipitate (DCU) began forming within 5 min. The reaction mixture was transferred to Eppendorf tubes and centrifuged to remove the white precipitate. The supernatant was then used in subsequent reactions without further purification.
Figure imgf000062_0001
Dimethyl ester
[00110] L-Aspartic acid dimethyl ester hydrochloride (0.2 g, 1.01 mmol, 2 eqv.) was dissolved in CH3CN (1 mL) and DIPEA (0.32 mL, 1.84 mmol, 4 eqv.). A solution of NHS ester (0.48 mmol, 1 eqv.) in CH3CN (2 mL) was added, and the reaction was stirred at RT for 12 hr. The reaction was diluted with EtOAc (25 mL), then washed with brine (1 x 30 mL) and sat. NH4CI (aq.) (1 x 30 mL). The organic phase was dried over Na2SC^, filtered, and concentrated under reduced pressure. Purification by flash column chromatography (100% CH2Cl2 to 2% CH3OH/CH2CI2) afforded the desired ester as a white foam (0.12 g, 36%).
Figure imgf000062_0002
Diacid
[00111] IM LiOH(aq) (0.18 mL, ~6 equiv) was added to a solution of dimethyl ester (0.02 g, 0.029 mmol, 1 eqv.) in THF (0.30 mL). The reaction was stirred at RT until the starting dimethyl ester was consumed based on LCMS analysis (about 15 min). The crude reaction was then HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.2 mm 10 micron, gradient: 90% A for 3 min., then 5% B/min., buffer A 0.05M TEAB, buffer B CH3CN, 10 mL/min. flow). Fractions containing the desired were pooled and concentrated to yield the desired diacid, which was used for subsequent reactions without quantifying.
Figure imgf000062_0003
Thiol
[00112] Diacid (-29 μmol, 1 eqv.) was treated with TCEP (1.7 mL, 0.85 mmol, 0.5M in H2O, 29 eqv.). The reaction was stirred at RT until the starting material was consumed based on LCMS analysis (about 30 min.). The crude reaction was then HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 3 min, then 5% B/min., buffer A 0.05M TEAB, buffer B CH3CN, 10 mL/min. flow). Fractions containing the desired were pooled and used for subsequent reactions without concentrating or quantifying.
Figure imgf000063_0001
Disulfide
[00113] HPLC fractions containing the thiol (about 10 μmol, 2 eqv.) were mixed with HPLC fractions containing SPDP-dATP (5 μmol, 1 equiv). After the SPDP-dATP was consumed based on LCMS analysis (about 10 min), the reaction was partially concentrated under reduced pressure to remove CH3CN and then HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 15.0 mm 10 micron, gradient: 100% A for 3 min, then 1% B/min., buffer A 0.05M TEAB, buffer B CH3CN, 10 mL/min. flow). Fractions containing the desired were pooled and lyophilized, then used for subsequent reactions without quantifying.
Figure imgf000064_0001
Amine
[00114] The starting carbamate (~5 μmol, 1 eqv.) was treated with 20% piperidine in 1 : 1 DMF: CH3CN (2 mL), and stirred at RT until the starting material was consumed based on LCMS analysis (-15 min). After removing the solvent under reduced pressure, the reaction was HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 3 min, then 1% B/min., buffer A 0.05M TEAB, buffer B CH3OH, 10 mL/min. flow). Fractions containing the desired were pooled and lyophilized to yield the product as a white foam (1 μmol, 20%, £280 = 12700).
Figure imgf000064_0002
A * Caproic-Asp
[00115] Atto647N-NHS ester (0.030 mL, 1.8 μmol, 0.06M in anhydrous DMF, 3.6 eqv.) was added to a solution of amine (0.5 μmol, 1 eqv.) in H2O (0.25 mL) in 10 μL aliquots. The reaction was monitored by LCMS to determine how much dye was needed to consume the starting amine. After disappearance of amine, the crude reaction was HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 10.0 mm 10 micron, gradient: 100% A for 3 min, then 2% B/min., buffer A 0.05M TEAB, buffer B CH3CN, 5 mL/min. flow). Fractions containing the desired were pooled and concentrated, then HPLC purified a second time under the same conditions, using CH3OH instead of CH3CN for buffer B. Fractions containing the desired were pooled and stored at -80 0C without removing the solvent (0.086 μmol, 17%, ε64s = 150000).
Figure imgf000065_0001
Disulfide
[00116] HPLC fractions containing the thiol (-10 μmol, 6 eqv.) were mixed with HPLC fractions containing SPDP-dGTP (1.5 μmol, 1 eqv.). After the SPDP-dGTP was consumed based on LCMS analysis (about 10 min), the reaction was partially concentrated under reduced pressure to remove CH3CN and then HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 10.0 mm 10 micron, gradient: 100% A for 3 min., then 1% B/min, buffer A 0.05M TEAB, buffer B CH3CN, 5 mL/min. flow). Fractions containing the desired were pooled and lyophilized, then used for subsequent reactions without quantifying.
Figure imgf000065_0002
Amine
[00117] The starting carbamate (~1.5 μmol, 1 eqv.) was treated with 20% piperidine in DMF (0.5 mL), and stirred at RT until the starting material was consumed based on LCMS analysis (about 15 min). After removing the solvent under reduced pressure, the reaction was HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 10.0 mm 10 micron, gradient: 100% A for 3 min, then 1% B/min., buffer A 0.05M TEAB, buffer B CH3CN, 5 mL/min. flow). Fractions containing the desired were pooled and lyophilized to yield the product as a white foam (0.26 μmol, 17%, £272 = 11900).
Figure imgf000066_0001
G* Caproic-Asp
[00118] Atto647N-NHS ester (0.011 mL, 0.66 μmol, 0.06 M in anhydrous DMF, 2.5 eqv.) was added to a solution of amine (0.26 μmol, 1 equiv) in H2O (0.50 mL) in small aliquots. The reaction was monitored by LCMS to determine how much dye was needed to consume the starting amine. After disappearance of amine, the crude reaction was HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 10.0 mm 10 micron, gradient: 100% A for 3 min, then 2% B/min., buffer A 0.05M TEAB, buffer B CH3 CN, 5 mL/min. flow). Fractions containing the desired were pooled and concentrated, then HPLC purified a second time under the same conditions, using CH3OH instead Of CH3CN for buffer B. Fractions containing the desired were pooled and stored at -80 0C without removing the solvent (0.076 μmol, 29%, ε64s = 150000).
Figure imgf000067_0001
Disulfide
[00119] HPLC fractions containing the thiol (about 5 μmol, 5 eqv.) were mixed with SPDP-dCTP (1 μmol, 1 eqv.) in H2O (0.20 mL). After the SPDP-dCTP was consumed based on LCMS analysis (about 10 min.), the reaction was partially concentrated under reduced pressure to remove CH3CN and then HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 3 min, then 3% B/min., buffer A 0.05M TEAB, buffer B CH3CN, 5 mL/min. flow). Fractions containing the desired were pooled and lyophilized, then used for subsequent reactions without quantifying.
Figure imgf000067_0002
Amine
[00120] The starting carbamate (about 1 μmol, 1 eqv.) was treated with 20% piperidine in CH3CN (0.5 mL), and stirred at RT until the starting material was consumed based on LCMS analysis (~15 min). After removing the solvent under reduced pressure, the reaction was HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 10.0 mm 10 micron, gradient: 100% A for 3 min., then 1% B/min, buffer A 0.05M TEAB, buffer B CH3CN, 5 mL/min. flow). Fractions containing the desired were pooled and lyophilized to yield the product as a white foam (0.15 μmol, 15%, £294 = 9300).
Figure imgf000068_0001
C* Caproic-Asp
[00121] Atto647N-NHS ester (0.012 mL, 0.72 μmol, 0.06M in anhydrous DMF, 3.6 eqv.) was added to a solution of amine (0.15 μmol, 1 eqv.) in H2O (0.20 mL) in 5 μL aliquots. The reaction was monitored by LCMS to determine how much dye was needed to consume the starting amine. After disappearance of amine, the crude reaction was HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 10.0 mm 10 micron, gradient: 100% A for 3 min., then 2% B/min., buffer A 0.05M TEAB, buffer B CH3CN, 5 mL/min. flow). Fractions containing the desired were pooled and concentrated, then HPLC purified a second time under the same conditions, using CH3OH instead Of CH3CN for buffer B. Fractions containing the desired were pooled and stored at -80 0C without removing the solvent (0.030 μmol, 20%, ε645 = 150000).
Figure imgf000069_0001
Disulfide
[00122] HPLC fractions containing the thiol (~5 μmol, 2.5 equiv) were mixed with SPDP-dUTP (2 μmol, 1 eqv.) in H2O (0.13 mL). After the SPDP-dUTP was consumed based on LCMS analysis (-10 min), the reaction was partially concentrated under reduced pressure to remove CH3CN and then HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 10.0 mm 10 micron, gradient: 100% A for 3 min., then 1% B/min., buffer A 0.05M TEAB, buffer B CH3CN, 5 mL/min. flow). Fractions containing the desired were pooled and lyophilized, then used for subsequent reactions without quantifying.
Figure imgf000069_0002
Amine
[00123] The starting carbamate (~1 μmol, 1 equiv) was treated with 20% piperidine in DMF (2 mL), and stirred at RT until the starting material was consumed based on LCMS analysis (about 15 min). After removing the solvent under reduced pressure, the reaction was HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 10.0 mm 10 micron, gradient: 100% A for 3 min, then 1% B/min., buffer A 0.05M TEAB, buffer B CH3CN, 5 mL/min. flow). Fractions containing the desired were pooled and lyophilized to yield the product as a white foam (0.19 μmol, 19%, ε289 13000).
Figure imgf000070_0001
T* Caproic-Asp
[00124] Atto647N-NHS ester (0.010 mL, 0.68 μmol, 0.06M in anhydrous DMF, 3.6 eqv.) was added to a solution of amine (0.19 μmol, 1 eqv.) in H2O (0.40 mL) in small aliquots. IM K2HPO4 (0.40 mL) was also added to accelerate the reaction after there was little product formed within an hour. The reaction was monitored by LCMS to determine how much dye was needed to consume the starting amine. After disappearance of amine, the crude reaction was HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 10.0 mm 10 micron, gradient: 100% A for 3 min., then 2% B/min., buffer A 0.05M TEAB, buffer B CH3CN, 5 mL/min. flow). Fractions containing the desired were pooled and concentrated, then HPLC purified a second time under the same conditions, using CH3OH instead Of CH3CN for buffer B. Fractions containing the desired were pooled and stored at -80 0C without removing the solvent (0.059 μmol, 31%, ε645 = 150000).
Example 6 Caproic-Asp- Asp- Asp- Asp (Alternative Routes)
Synthetic Scheme V
Figure imgf000071_0001
Figure imgf000072_0001
Figure imgf000073_0001
Figure imgf000073_0002
Figure imgf000074_0001
Example 7 G* Pro-Pro-Lys-Pro-Asp
Synthetic Scheme VI
Figure imgf000075_0001
Example 8 Synthesis of compounds in Table 1 Synthesis ofdCTP and dATP analogs:
Figure imgf000075_0002
[00125] A solution of N-α-Fmoc-L-glutamic acid α-t-butyl ester (1.02 g, 2.35 mmol) in anhydrous THF (12 mL) was cooled to 00C. Anhydrous NEt3 (0.4 mL, 2.87 mmol) was added, followed by ethyl chloroformate (0.3 mL, 3.1 mmol). After stirring under Ar for ~10 min, sodium borohydride (0.27 g, 7.1 mmol) was added in one portion. Methanol (23 mL) was then added slowly over ~10 min, causing vigorous gas evolution. The reaction was warmed to RT, then acidified with 10% HCl (10 mL). The organics were removed in vacuo. The residue was diluted with EtOAc (40 mL), then washed with brine (2 x 50 mL). The organic phase was dried over Na2SO4, filtered, and concentrated under reduced pressure to yield alcohol 52 as a white foam (0.97 g, 99%).
Figure imgf000076_0001
[00126] A solution of alcohol 52 (0.97 g, 2.35 mmol) in anhydrous CH2Cl2 (9 mL) and anhydrous NEt3 (0.7 mL, 5.0 mmol) was cooled to 00C. Methanesulfonyl chloride (0.28 mL, 3.6 mmol) was added dropwise over -15 min. After disappearance of starting material by TLC (-10 min), the reaction was washed with ice cold H2O (2 x 25 mL), dried over Na2SO4, filtered, and concentrated under reduced pressure. Potassium thioacetate (0.54 g, 4.7 mmol) was added to a solution of the resultant white foam in acetone (6 mL), and the dark brown reaction was stirred at RT for 12 hrs. The crude reaction was then purified by flash column chromatography (5% to 20% EtOAc/hexanes) to afford thioacetate 53 as a brown oil (0.57 g, 52%).
Figure imgf000076_0002
[00127] Trifluoroacetic acid (2 mL) was added to a solution of thioacetate 53 (0.29 g, 0.61 mmol) in CH2Cl2 (2 mL) and the reaction was stirred at RT for -30 min. The reaction was diluted with CH2Cl2 (20 mL), then washed with brine (2 x 20 mL). The organic phase was dried over Na2SO4, filtered, and concentrated under reduced pressure to yield acid 54 as a brown oil (0.22 g, 88%).
Figure imgf000077_0001
[00128] Acid 54 (0.22 g, 0.53 mmol) was dissolved in MeCN (3 mL). DCC (0.12 g, 0.58 mmol) was added, followed by NHS (0.07 g, 0.63 mmol) and the reaction was stirred at RT for an hour. White precipitate (DCU) began forming within five minutes. The reaction mixture was transferred to eppendorf tubes and centrifuged to remove the white precipitate. The dark brown supernatant containing NHS ester 55 was then added to a solution of H-Asp-Asp-OH (0.13 g, 0.52 mmol) in 0.25 M K2HPO4 (2.4 mL) and MeCN (1 mL). DIPEA (0.15 mL) was added to keep the pH ~ 8. The reaction was stirred at RT for two hours, then HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 3 min, then 3% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired thioacetate 56 were pooled and lyophilized to yield the product as a white foam (0.25 g, 75%).
Figure imgf000077_0002
[00129] A solution of thioacetate 56 (0.023 g, mmol) in 50% MeCN/H2O (0.5 mL) was treated with 1 M NH2OH (0.5 mL, pH ~7). The reaction was stirred at RT for -10 min, then immediately HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 3 min, then 2% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired thiol 57 were used immediately for subsequent reactions without quantifying.
Synthesis ofdATP analog:
Figure imgf000078_0001
[00130] HPLC fractions containing thiol 57 (unqualified, -25 μmol) were mixed with HPLC fractions containing SPDP-dATP (20 μmol). After -15 minutes the reaction was lyophilized, then HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 3 min, then 1% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired were pooled and lyophilized to yield 58, which was used for the subsequent reaction without quantifying.
Figure imgf000078_0002
[00131] Carbamate 58 (unqualified, -16 μmol) was treated with 20% piperidine/MeCN (2 mL) and 20% piperidine/DMF (1 mL) for 15 minutes to remove the Fmoc protecting group. The solvent was removed under reduced pressure, and the residue was then HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 3 min, then 1% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired were pooled and lyophilized to yield the product as a white foam (7.5 μmol, 47%, ε289 = 12700).
Figure imgf000079_0001
[00132] Atto647N NHS ester (0.36 mL, 36 μmol, 0.1 M in anhydrous DMF) was added to a solution of amine 59 (17.6 μmol) in H2O (3 mL) and DMF (0.9 mL). After disappearance of amine by LCMS the crude reaction was HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 15.00 mm 10 micron, gradient: 100% A for 3 min, then 1% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired were pooled and lyophilized to yield amide 60 as a bright blue solid (13 μmol, 74%, E649 = 150000).
Synthesis ofdCTP analog:
Figure imgf000079_0002
[00133] HPLC fractions containing thiol 57 (unqualified, -55 μmol) were mixed with HPLC fractions containing SPDP-dCTP (45 μmol). After -30 minutes the reaction was lyophilized, then HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 3 min, then 1% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired were pooled and lyophilized to yield 61, which was used for the subsequent reaction without quantifying.
Figure imgf000080_0001
[00134] Carbamate 61 (unqualified, -36 μmol) was treated with 20% piperidine/MeCN (3 mL) for 15 minutes to remove the Fmoc protecting group. The solvent was removed under reduced pressure, and the residue was then HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 3 min, then 1% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired were pooled and lyophilized to yield amine 62 as a white foam (10.6 μmol, 29%, ε289 = 9300).
Figure imgf000080_0002
[00135] Atto647N NHS ester (0.475 mL, 47.5 μmol, 0.1 M in anhydrous DMF) was added to a solution of amine 62 (21 μmol) in H2O (3 mL) and DMF (0.7 mL). After disappearance of amine by LCMS the crude reaction was HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 15.00 mm 10 micron, gradient: 100% A for 3 min, then 1% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired were pooled and lyophilized to yield amide 63 as a bright blue solid (17.7 μmol, 84%, ε649 = 150000).
Synthesis ofdUTP analog:
Figure imgf000081_0001
[00136] Acid 54 (0.14 g, 0.34 mmol) was dissolved in MeCN (0.6 mL). DCC (0.085 g, 0.41 mmol) was added, followed by NHS (0.051 g, 0.44 mmol) and the reaction was stirred at RT for an hour. White precipitate (DCU) began forming within five minutes. The reaction mixture was transferred to eppendorf tubes and centrifuged to remove the white precipitate. The supernatant containing NHS ester 55 was then added to a solution of 6-aminocaproic acid (0.049 g, 0.37 mmol) in H2O (0.4 mL) and DMF (0.4 mL). DIPEA (0.05 mL) was added to keep the pH ~ 8. The reaction was stirred at RT for 12 hours, then adjusted to pH 4 with 0.1 M HCl and extracted with CH2Cl2 (2 x 25 mL). The organic phase was dried over Na2SO^ filtered, and concentrated under reduced pressure. Purification by flash column chromatography (100% CH2Cl2 to 5% MeOH/CH2Cl2) afforded acid 64 (0.12 g, 69%).
Figure imgf000082_0001
[00137] Acid 64 (0.12 g, 0.23 mmol) was dissolved in MeCN (0.6 mL). DCC (0.058 g, 0.28 mmol) was added, followed by NHS (0.035 g, 0.3 mmol) and the reaction was stirred at RT for an hour. White precipitate (DCU) began forming within twenty minutes. The reaction mixture was transferred to eppendorf tubes and centrifuged to remove the white precipitate. The supernatant containing NHS ester 65 was then added to a solution of H-Asp-Asp-OH (0.075 g, 0.30 mmol) in 0.1 M K2HPO4 (0.5 mL) and MeCN (0.5 mL). DIPEA (0.095 mL) was added to keep the pH ~ 8. The reaction was stirred at RT for 12 hours, then HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 5 min, then 2% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired thioacetate 66 were pooled and lyophilized to yield the product as a white foam (0.1 g, 55%).
Figure imgf000082_0002
[00138] A solution of thioacetate 66 (0.1 g, 0.13 mmol) in 50% MeCN/H2O (2 mL) was treated with 1 M NH2OH (2 mL, pH ~7). The reaction was stirred at RT for -10 min, then immediately HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 5 min, then 2% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired thiol 67 were used immediately for subsequent reactions without quantifying.
Synthesis ofdUTP analog:
Figure imgf000083_0001
[00139] HPLC fractions containing thiol 67 (unqualified, -55 μmol) were mixed with HPLC fractions containing SPDP-dUTP (50 μmol). After ~1 hr the reaction was concentrated to remove MeCN, then HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex C18 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 5 min, then 2% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired were pooled and lyophilized to yield 68, which was used for the subsequent reaction without quantifying.
Figure imgf000084_0001
[00140] Carbamate 68 (unqualified, -50 μmol) was treated with 20% piperidine/MeCN (2 mL) for 30 minutes to remove the Fmoc protecting group. The solvent was removed under reduced pressure, and the residue was dissolved in 50 mM TEAB buffer (~3mL), causing formation of copious white precipitate (dibenzylfulvene). The mixture was transferred to eppendorf tubes and centrifuged to remove the precipitate. The supernatant was then HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 5 min, then 2% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired were pooled and lyophilized to yield amine 69 as a white foam (39 μmol, 78%, ε288 = 13000).
Figure imgf000085_0001
[00141] Atto647N NHS ester (0.24 mL, 24 μmol, 0.1 M in anhydrous DMF) was added to a solution of amine 69 (20 μmol) in H2O (0.5 mL) and DMF (0.1 mL). DIPEA (4 μL, 20 μmol) was added to basify the reaction. After disappearance of amine by LCMS the crude reaction was HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 15.00 mm 10 micron, gradient: 100% A for 5 min, then 1% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired were pooled and lyophilized to yield amide 70 as a bright blue solid (20 μmol, 99%, ε649 = 150000).
Synthesis of dGTP analog:
HH PPrroo--LLyyss((FFmn oc)-Pro-
Figure imgf000086_0001
[00142] SPDP (2.1 mL, 0.10 mmol, 0.05 M in DMF) was added to a solution of H-Pro- Lys(Fmoc)-Pro-Asp-Asp-OH (0.054 g, 0.068 mmol) in 0.1 M K2HPO4 (3 mL) and the reaction was stirred at RT until disappearance of the peptide by LCMS. The crude reaction was HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 3 min, then 2% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired were pooled and lyophilized to yield peptide 72, which was used without quantifying.
Figure imgf000086_0002
[00143] A solution of disulfide 72 (-60 μmol) in H2O (5 mL) was treated with TCEP (1.44 mL, 0.72 mmol, 0.5 M in H2O). After -15 minutes the crude reaction was HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 3 min, then 2% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired thiol 73 were used immediately for the subsequent reaction without concentrating or quantifying.
Synthesis ofdGTP analog:
Figure imgf000087_0001
[00144] HPLC fractions containing thiol 73 (unqualified, -50 μmol) were mixed with HPLC fractions containing SPDP-dGTP (58 μmol). After -30 minutes the reaction was concentrated to remove MeCN, then HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 3 min, then 1% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired were pooled and lyophilized to yield 74, which was used for the subsequent reaction without quantifying.
Figure imgf000087_0002
[00145] Carbamate 74 (unqualified, -40 μmol) was treated with 20% piperidine/MeCN (4 mL) for 15 minutes to remove the Fmoc protecting group. The solvent was removed under reduced pressure, and the residue was then HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 3 min, then 1% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired were pooled and lyophilized to yield the product as a white foam (19.1 μmol, 48%, ε289 = 11900).
Figure imgf000088_0001
[00146] Atto647N NHS ester (0.30 mL, 30 μmol, 0.1 M in anhydrous DMF) was added to a solution of amine 75 (19.1 μmol) in H2O (3 mL) and DMF (0.6 mL). After disappearance of amine by LCMS the crude reaction was HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 15.00 mm 10 micron, gradient: 100% A for 3 min, then 1% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired were pooled and lyophilized to yield amide 76 as a bright blue solid (19 μmol, 99%, ε649 = 150000).
[00147] The schemes above and variations thereof may be utilized for syntheses of derivatives and analogs of the exemplary nucleotide analogs shown above, for example, those having additional amino groups at the Inhibitor end and/or compounds of different linking groups.
[00148] While specific embodiments of the subject invention have been discussed, the above specification is illustrative and not restrictive. Many variations of the invention will become apparent to those skilled in the art upon review of this specification. Contemplated equivalents of the nucleotide analogs disclosed here include compounds which otherwise correspond thereto, and which have the same general properties thereof, wherein one or more simple variations of substituents or components are made which do not adversely affect the characteristics of the nucleotide analogs of interest. In general, the components of the nucleotide analogs disclosed herein may be prepared by the methods illustrated in the general reaction schema as described herein or by modifications thereof, using readily available starting materials, reagents, and conventional synthesis procedures.
Equivalents
[00149] The invention may be embodied in other specific forms without departing from the spirit or essential characteristics thereof. The foregoing embodiments are therefore to be considered in all respects illustrative rather than limiting on the invention described herein. Scope of the invention is thus indicated by the appended claims rather than by the foregoing description, and all changes which come within the meaning and range of equivalency of the claims are therefore intended to be embraced therein.
Incorporation by Reference
[00150] The entire disclosure of each of the publications and patent documents referred to herein is incorporated by reference in its entirety for all purposes to the same extent as if each individual publication or patent document were so individually denoted.

Claims

CLAIMSWe claim:
1. A nucleotide analog of the following Formula II:
Figure imgf000090_0002
wherein
NTP is a nucleoside or nucleotide triphosphate, or an analog thereof, capable of incorporating onto the 3' end of a polynucleotide strand hybridized to a template presenting the complement of the NTP;
L is a detectable label that facilitates the identification of the nucleotide analog;
Inhibitor comprises (a) one or more multiply charged groups or groups capable of becoming multiply charged, or (b) two or more singly charged groups or two or more groups capable of becoming singly charged;
Ri comprises a cleavable bond, which upon cleavage results in de-association of NTP from both L and Inhibitor;
R2 is a tri-valent radical having the formula:
Figure imgf000090_0001
wherein each of R2' and R2" is a bi-valent or tri-valent radical selected from:
-(CH2) X-, -(CH2-O) x-, -(CH2-O) Z-(CH2) y-, -(CH2) Z-(CH2-O) y-, and the same substituted with one or more groups selected from hydroxyl, halogen, amino, thiol, (C1-C6) alkyl, wherein x, y and z are each integers with x and y+z are each from 2 to 10; R3 is a bond or group linking R2 to the Inhibitor moiety; and R4 is a bond or group linking R2 to a L.
2. The nucleotide analog of Claim 1, wherein the Inhibitor comprises a charged group selected from the group consisting of -COOH and -PO4.
3. The nucleotide analog of Claim 1, wherein the Inhibitor comprises at least two -COOH groups.
4. The nucleotide analog of Claim 1 , wherein the Inhibitor does not comprise a -PO4 group.
5. The nucleotide analog of Claim 1, wherein the Inhibitor does not comprise an aryl group.
6. The nucleotide analog of Claim 1, wherein the Inhibitor comprises a group selected from the group consisting of GIu, Asp, Arg, His, Thr, Trp, GIn, Tyr, Pro and Lys.
7. The nucleotide analog of Claim 1 , wherein Ri comprises a C-C triple bond or a trans C-C double bond.
8. The nucleotide analog of Claim 1, wherein Ri comprises a S-S bond.
9. The nucleotide analog of Claim 1, wherein Ri comprises a C-C triple bond and a S-S bond.
10. The nucleotide analog of Claim 1, wherein Ri comprises
Figure imgf000091_0001
wherein R^ is a H or an alkyl group; q and r independently is an integer from about 1 to about 10.
11. The nucleotide analog of Claim 10, wherein q is 1 or 2 and r is 1 , 2 or 3.
12. The nucleotide analog of Claim 1, wherein NTP is selected from dATP, dGTP, dCTP, dTTP, dUTP, ATP, GTP, CTP, TTP, UTP or an analog thereof.
13. The nucleotide analog of Claim 1, wherein the L is an optically-detectable moiety.
14. The nucleotide analog of Claim 13, wherein the optically-detectable moiety comprises a fluorophore.
15. The nucleotide analog of Claim 14, wherein the fluorophore is Cy5 or ATTO 647N.
16. The nucleotide analog of Claim 1, wherein R3 comprises
Figure imgf000092_0001
wherein R is H or alkyl groups, and may together form 3, 4, 5, or 6-member rings.
17. The nucleotide analog of Claim 16, wherein R3 comprises
Figure imgf000092_0002
wherein R is GIu, Asp, Arg, His, Thr, Trp, GIn, Tyr, Pro or Lys, or a peptide of two or more amino acids comprising an amino acid selected from the group consisting of GIu, Asp, Arg, His, Thr, Trp, GIn, Tyr, Pro and Lys.
18. The nucleotide analog of Claim 1, selected from:
Figure imgf000093_0001
19. A method for sequencing a nucleic acid, the method comprising the steps of: exposing a nucleic acid duplex comprising a template portion and a primer portion to a nucleotide analog of the following Formula II:
Figure imgf000094_0002
detecting incorporation of the analog; removing or neutralizing the inhibitor; and repeating the exposing, detecting, and removing steps at least once, thereby to determine the sequence of the template,
wherein
NTP is a nucleoside or nucleotide triphosphate or an analog of either capable of incorporating onto the 3' end of a polynucleotide strand hybridized to a template presenting the complement of the NTP;
L is a detectable label that facilitates the identification of the nucleotide analog;
Inhibitor comprises (a) one or more multiply charged groups or groups capable of becoming multiply charged, or (b) two or more singly charged groups or two or more groups capable of becoming singly charged;
Ri comprises a cleavable bond, which upon cleavage results in de-association of NTP from both L and Inhibitor;
R2 is a tri-valent radical having the formula:
Figure imgf000094_0001
wherein each ofR2' and R2" is a bi-valent or tri-valent radical selected from:
-(CH2) X-, -(CH2-O) x-, -(CH2-O) Z-(CH2) y-, -(CH2) Z-(CH2-O) y-, and the same substituted with one or more groups selected from hydroxyl, halogen, amino, thiol,
(Ci-C6) alkyl, wherein x, y and z are each integers with x and y+z are each from 2 to 10;
R3 is a bond or group linking R2 to the Inhibitor moiety; and
R4 is a bond or group linking R2 to a L.
20. The method of Claim 19, wherein the inhibitor is selected from the group consisting of one or more carboxylic acid, one or more phosphate, one or more amino acid, one or more peptide, one or more sulfate, one or more caproic acid, and any combination thereof.
21. A method for sequencing a nucleic acid, the method comprising the steps of: exposing a nucleic acid duplex comprising a template portion and a primer portion to a nucleotide analog comprising an inhibitor that is charged or capable of becoming charged, and a polymerase, under conditions that permit template-dependent incorporation of the analog into the primer; detecting incorporation of the analog; removing or neutralizing the inhibitor; and repeating the exposing, detecting, and removing steps at least once, thereby to determine the sequence of the template.
22. The method of Claim 21 , wherein the template portion and/or the primer portion is directly or indirectly anchored to a support.
23. The method of Claim 21 , wherein the detecting step comprises detecting individual analogs.
24. The method of Claim 21 further comprising the step of removing unincorporated analog.
25. The method of Claim 21 , wherein the inhibitor is selected from the group consisting of one or more carboxylic acid, one or more phosphate, one or more amino acid, one or more peptide, one or more sulfate, one or more caproic acid, and any combination thereof.
26. The method of Claim 25, wherein the amino acid is a negatively charged amino acid.
27. The method of Claim 26, wherein the amino acid is selected from the group consisting of aspartic acid, glutamic acid, histidine, lysine, and arginine.
28. The method of Claim 25, wherein the peptide is from about 2 to about 10 amino acids in length.
29. The method of Claim 21, wherein the inhibitor comprises multiple charged groups.
30. The method of Claim 21 , wherein the inhibitor is negatively charged.
31. The method of Claim 21 , wherein the inhibitor is positively charged.
32. The method of Claim 21, wherein the inhibitor does not cause steric inhibition of the polymerase.
33. A nucleotide analog, comprising a nucleoside triphosphate; an inhibitor comprising (a) one or more multiply charged groups or groups capable of becoming multiply charged, or (b) two or more singly charged groups or two or more groups capable of becoming singly charged; a detectable label; and a linker connecting the inhibitor and the label to the nucleoside triphosphate.
34. The analog of Claim 33, wherein the linker is cleavable.
35. The analog of Claim 34, wherein after the linker is cleaved, the residual analog has the structure of:
Figure imgf000096_0001
wherein
B1 is selected from the group consisting of purine bases, pyrimidine bases, and derivatives of purine and pyrimidine bases;
R is a N-containing group;
R' is independently selected from the group consisting of -OH, -0-P(O)(OH)2, -0-C(O)- Rx, -NHRy, and an -O-blocking agent, wherein Rx and Ry are alkyl groups;
R" is independently selected from the group consisting of H and -OH; R7 is a phosphodiester or a phosphoryl group; and z is an integer from about 1 to about 5.
36. The analog of Claim 33, wherein the charged groups consist of between about 2 to about 10 charged groups.
37. The analog of Claim 33, wherein the charged groups are selected from any combination of one or more carboxylic acid, one or more phosphate, one or more amino acid, one or more peptide, one or more sulfate, and one or more caproic acid.
38. The analog of Claim 33, wherein the label is optically detectable.
39. The analog of Claim 38, wherein the label is a fluorescent label.
40. The analog of Claim 33, wherein the inhibitor is not a steric inhibitor of a polymerase enzyme.
41. The analog of Claim 33, wherein the nucleoside triphosphate is selected from ATP, GTP, CTP, TTP, UTP, dATP, dGTP, dCTP, dTTP, dUTP, or an analog of any of the foregoing.
42. A nucleotide analog of the following Formula II:
Figure imgf000098_0001
wherein
NTP is a nucleoside or nucleotide triphosphate or an analog of either capable of incorporating onto the 3' end of a polynucleotide strand hybridized to a template presenting the complement of the NTP;
L is a detectable label that facilitates the identification of the nucleotide analog;
Inhibitor comprises (a) one or more multiply charged groups or groups capable of becoming multiply charged, or (b) two or more singly charged groups or two or more groups capable of becoming singly charged;
Ri and R2 are independently a bond or a group, wherein at least one of Ri and R2 comprises a cleavable bond, which upon cleavage results in de-association of NTP from both L and Inhibitor;
R3 is a bond or group linking R2 to the Inhibitor moiety; and
R4 is a bond or group linking R2 to a L.
43. The nucleotide analog of Claim 42, wherein the Inhibitor does not comprise a nucleotide or nucleoside or analogs thereof.
44. The nucleotide analog of Claim 42, wherein the Inhibitor comprises a negatively charged group or a group capable of becoming negatively charged.
45. The nucleotide analog of Claim 42, wherein the Inhibitor comprises a positively charged group or a group capable of becoming positively charged.
46. The nucleotide analog of Claim 42, wherein the Inhibitor comprises two or more charged groups.
47. The nucleotide analog of Claim 42, wherein the Inhibitor comprises a charged group selected from the group consisting of -COOH, -PO4, -SO4, -SO3, -SO2, -NRWRV, where Rw and Rv independently is H, an alkyl or aryl group.
48. The nucleotide analog of Claim 42, wherein the Inhibitor comprises
Figure imgf000099_0001
wherein each Ai and each A2 is independently an amino acid moiety;.
49. The nucleotide analog of Claim 48, wherein Rs and R9 are H atoms and x = 1 and y = 2.
50. The nucleotide analog of Claim 42, wherein the Inhibitor does not comprise a -PO4 group.
51. The nucleotide analog of Claim 42, wherein the Inhibitor does not comprise an aryl group.
52. The nucleotide analog of Claim 42, wherein the Inhibitor comprises an amino acid group or an amino acid analog group.
53. The nucleotide analog of Claim 52, wherein the Inhibitor comprises a peptide of 2 to 20 units of amino acids or analogs.
54. The nucleotide analog of Claim 52, wherein the Inhibitor comprises a group selected from the group consisting of GIu, Asp, Arg, His, Thr, Trp, GIn, Tyr and Lys.
55. The nucleotide analog of Claim 42, wherein R3 comprises
Figure imgf000099_0002
wherein R5 is a H or an alkyl group; p is an integer from 0 to about 10.
56. The nucleotide analog of Claim 55, wherein p is 5 or 6.
57. The nucleotide analog of Claim 42, wherein Ri comprises a C-C triple bond.
58. The nucleotide analog of Claim 42, wherein Ri comprises a S-S bond.
59. The nucleotide analog of Claim 42, wherein Ri comprises a C-C triple bond and a S-S bond.
60. The nucleotide analog of Claim 42, wherein Ri comprises
Figure imgf000100_0001
wherein R6 is a H or an alkyl group; q and r independently is an integer from about 1 to about 10.
61. The nucleotide analog of Claim 60, wherein q is 1 or 2 and r is 1, 2 or 3.
62. The nucleotide analog of Claim 42, wherein NTP is selected from dATP, dGTP, dCTP, dTTP, dUTP, ATP, GTP, CTP, TTP, UTP or an analog thereof.
63. The nucleotide analog of Claim 42, wherein the L is an optically-detectable moiety.
64. The nucleotide analog of Claim 63, wherein the optically-detectable moiety comprises a fluorophore.
65. The nucleotide analog of Claim 64, wherein the fluorophore is Cy5 or ATTO 647N.
66. The nucleotide analog of Claim 42, wherein the group of the Inhibitor that is charged or capable of becoming charged is from about 5 to about 60 bonds away from the NTP.
67. The nucleotide analog of Claim 56, wherein p is 5 and the detectable label comprises ATTO 647N.
68. The nucleotide analog of Claim 42, wherein R3 comprises
Figure imgf000101_0001
wherein k is an integer from about 1 to about 5.
69. The nucleotide analog of Claim 68, wherein the Inhibitor comprises a -COOH group.
70. The nucleotide analog of Claim 69, wherein the Inhibitor comprises two or more -COOH groups.
71. The nucleotide analog of Claim 42, wherein R3 comprises
Figure imgf000101_0002
wherein R . 1 , n R2 are independently H or alkyl groups, and may together form 3, 4, 5, or 6-member rings, and j is an integer from about 1 to about 5.
72. The nucleotide analog of Claim 42, wherein R3 comprises
Figure imgf000101_0003
wherein R , R , R , and R are independently H or alkyl groups, and two or more of which may together form one or more 3, 4, 5, or 6-member rings, and j is an integer from about 1 to about 3.
PCT/US2009/039475 2008-04-04 2009-04-03 Nucleotide analogs WO2009124254A1 (en)

Applications Claiming Priority (6)

Application Number Priority Date Filing Date Title
PCT/US2008/059446 WO2009123642A1 (en) 2008-04-04 2008-04-04 Nucleotide analogs
USPCT/US2008/059446 2008-04-04
US12/098,196 2008-04-04
US12/098,196 US8071755B2 (en) 2004-05-25 2008-04-04 Nucleotide analogs
US12/244,698 US8114973B2 (en) 2004-05-25 2008-10-02 Nucleotide analogs
US12/244,698 2008-10-02

Publications (1)

Publication Number Publication Date
WO2009124254A1 true WO2009124254A1 (en) 2009-10-08

Family

ID=40886121

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2009/039475 WO2009124254A1 (en) 2008-04-04 2009-04-03 Nucleotide analogs

Country Status (1)

Country Link
WO (1) WO2009124254A1 (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102268054A (en) * 2011-04-12 2011-12-07 宁辉 Beta-cytidine-5 '-triphosphoarginine derivative ester and its preparation method and use
EP2796552A3 (en) * 2013-04-02 2015-03-04 Molecular Assembly, LLC Methods and apparatus for synthesizing nucleic acids
US9279149B2 (en) 2013-04-02 2016-03-08 Molecular Assemblies, Inc. Methods and apparatus for synthesizing nucleic acids
WO2017015538A1 (en) 2015-07-22 2017-01-26 Purdue Research Foundation Modified glucagon molecules
US9771613B2 (en) 2013-04-02 2017-09-26 Molecular Assemblies, Inc. Methods and apparatus for synthesizing nucleic acid
WO2019105421A1 (en) * 2017-11-30 2019-06-06 深圳市瀚海基因生物科技有限公司 Nucleoside analogue, preparation method and application
WO2019231617A1 (en) * 2018-05-29 2019-12-05 Elitechgroup, Inc. Carborhodamine compounds and methods of preparation thereof
US10683536B2 (en) 2013-04-02 2020-06-16 Molecular Assemblies, Inc. Reusable initiators for synthesizing nucleic acids
US11331643B2 (en) 2013-04-02 2022-05-17 Molecular Assemblies, Inc. Reusable initiators for synthesizing nucleic acids
US11384377B2 (en) 2013-04-02 2022-07-12 Molecular Assemblies, Inc. Reusable initiators for synthesizing nucleic acids

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2005044836A2 (en) * 2003-11-05 2005-05-19 Genovoxx Gmbh Macromolecular nucleotide compounds and methods for using the same
WO2007062160A2 (en) * 2005-11-22 2007-05-31 Helicos Biosciences Corporation Methods and compositions for sequencing a nucleic acid
US20070128614A1 (en) * 2005-12-06 2007-06-07 Liu David R Nucleotide analogs
WO2008137661A1 (en) * 2007-05-03 2008-11-13 Helicos Biosciences Corporation Methods and compositions for sequencing a nucleic acid
WO2008144315A1 (en) * 2007-05-14 2008-11-27 Helicos Biosciences Corporation Methods and compositions for sequencing a nucleic acid
US20090061437A1 (en) * 2004-05-25 2009-03-05 Helicos Biosciences Corporation Nucleotide Analogs

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2005044836A2 (en) * 2003-11-05 2005-05-19 Genovoxx Gmbh Macromolecular nucleotide compounds and methods for using the same
US20090061437A1 (en) * 2004-05-25 2009-03-05 Helicos Biosciences Corporation Nucleotide Analogs
WO2007062160A2 (en) * 2005-11-22 2007-05-31 Helicos Biosciences Corporation Methods and compositions for sequencing a nucleic acid
US20070128614A1 (en) * 2005-12-06 2007-06-07 Liu David R Nucleotide analogs
WO2008137661A1 (en) * 2007-05-03 2008-11-13 Helicos Biosciences Corporation Methods and compositions for sequencing a nucleic acid
WO2008144315A1 (en) * 2007-05-14 2008-11-27 Helicos Biosciences Corporation Methods and compositions for sequencing a nucleic acid

Cited By (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102268054A (en) * 2011-04-12 2011-12-07 宁辉 Beta-cytidine-5 '-triphosphoarginine derivative ester and its preparation method and use
CN102268054B (en) * 2011-04-12 2012-11-21 宁辉 Beta-cytidine-5 '-triphosphoarginine derivative ester and its preparation method and use
US9695470B2 (en) 2013-04-02 2017-07-04 Molecular Assemblies, Inc. Methods and apparatus for synthesizing nucleic acids
US9279149B2 (en) 2013-04-02 2016-03-08 Molecular Assemblies, Inc. Methods and apparatus for synthesizing nucleic acids
EP3115462A1 (en) * 2013-04-02 2017-01-11 Molecular Assemblies, Inc. Methods and apparatus for synthesizing nucleic acids
US11331643B2 (en) 2013-04-02 2022-05-17 Molecular Assemblies, Inc. Reusable initiators for synthesizing nucleic acids
US9771613B2 (en) 2013-04-02 2017-09-26 Molecular Assemblies, Inc. Methods and apparatus for synthesizing nucleic acid
US10041110B2 (en) 2013-04-02 2018-08-07 Molecular Assemblies, Inc. Methods and apparatus for synthesizing nucleic acids
EP3425053A1 (en) * 2013-04-02 2019-01-09 Molecular Assemblies, Inc. Methods and apparatus for synthesizing nucleic acid
EP2796552A3 (en) * 2013-04-02 2015-03-04 Molecular Assembly, LLC Methods and apparatus for synthesizing nucleic acids
US10683536B2 (en) 2013-04-02 2020-06-16 Molecular Assemblies, Inc. Reusable initiators for synthesizing nucleic acids
US11384377B2 (en) 2013-04-02 2022-07-12 Molecular Assemblies, Inc. Reusable initiators for synthesizing nucleic acids
WO2017015538A1 (en) 2015-07-22 2017-01-26 Purdue Research Foundation Modified glucagon molecules
US11472857B2 (en) 2015-07-22 2022-10-18 Purdue Research Foundation Modified glucagon molecules
US10954283B2 (en) 2015-07-22 2021-03-23 Purdue Research Foundation Modified glucagon molecules
WO2019105421A1 (en) * 2017-11-30 2019-06-06 深圳市瀚海基因生物科技有限公司 Nucleoside analogue, preparation method and application
CN111741967A (en) * 2017-11-30 2020-10-02 深圳市真迈生物科技有限公司 Nucleoside analogue, preparation method and application
US11512106B2 (en) 2017-11-30 2022-11-29 Genemind Biosciences Company Limited Nucleoside analogue, preparation method and application
US11155713B2 (en) 2018-05-29 2021-10-26 Elitechgroup, Inc. Carborhodamine compounds and methods of preparation thereof
WO2019231617A1 (en) * 2018-05-29 2019-12-05 Elitechgroup, Inc. Carborhodamine compounds and methods of preparation thereof

Similar Documents

Publication Publication Date Title
US8071755B2 (en) Nucleotide analogs
US8114973B2 (en) Nucleotide analogs
WO2009124254A1 (en) Nucleotide analogs
EP1497304B1 (en) Dual-labeled nucleotides
EP2057175A2 (en) Nucleotide analogs
US7994304B2 (en) Methods and compositions for sequencing a nucleic acid
EP1141409B2 (en) A kit and methods for nucleic acid sequencing of single molecules by polymerase synthesis
EP3091026B1 (en) Disulfide-linked reversible terminators
US7476734B2 (en) Nucleotide analogs
WO2007062160A2 (en) Methods and compositions for sequencing a nucleic acid
WO2008016906A2 (en) 3'-phosphate-labeled nucleotide analogs and their use for sequencing a nucleic acid
WO2008137661A1 (en) Methods and compositions for sequencing a nucleic acid
AU2018298847B2 (en) Short pendant arm linkers for nucleotides in sequencing applications
AU2006297104A1 (en) Labeled nucleotide analogs and uses therefor
WO2009123642A1 (en) Nucleotide analogs
WO2008016907A1 (en) Nucleotide analogs
NZ759350B2 (en) Short pendant arm linkers for nucleotides in sequencing applications

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 09727687

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 09727687

Country of ref document: EP

Kind code of ref document: A1