WO2021023252A1 - Acridinium ester-containing compounds and methods of using the same for chemiluminescence-based one-color sequencing - Google Patents

Acridinium ester-containing compounds and methods of using the same for chemiluminescence-based one-color sequencing Download PDF

Info

Publication number
WO2021023252A1
WO2021023252A1 PCT/CN2020/107298 CN2020107298W WO2021023252A1 WO 2021023252 A1 WO2021023252 A1 WO 2021023252A1 CN 2020107298 W CN2020107298 W CN 2020107298W WO 2021023252 A1 WO2021023252 A1 WO 2021023252A1
Authority
WO
WIPO (PCT)
Prior art keywords
substituted
compound
group
unsubstituted
hydrogen
Prior art date
Application number
PCT/CN2020/107298
Other languages
French (fr)
Inventor
Yan Chen
Handong Li
Original Assignee
Bgi Shenzhen Co., Ltd.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Bgi Shenzhen Co., Ltd. filed Critical Bgi Shenzhen Co., Ltd.
Priority to CN202080056098.6A priority Critical patent/CN114245830A/en
Priority to EP20849079.7A priority patent/EP4010484A4/en
Publication of WO2021023252A1 publication Critical patent/WO2021023252A1/en

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07DHETEROCYCLIC COMPOUNDS
    • C07D401/00Heterocyclic compounds containing two or more hetero rings, having nitrogen atoms as the only ring hetero atoms, at least one ring being a six-membered ring with only one nitrogen atom
    • C07D401/02Heterocyclic compounds containing two or more hetero rings, having nitrogen atoms as the only ring hetero atoms, at least one ring being a six-membered ring with only one nitrogen atom containing two hetero rings
    • C07D401/12Heterocyclic compounds containing two or more hetero rings, having nitrogen atoms as the only ring hetero atoms, at least one ring being a six-membered ring with only one nitrogen atom containing two hetero rings linked by a chain containing hetero atoms as chain links
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N21/00Investigating or analysing materials by the use of optical means, i.e. using sub-millimetre waves, infrared, visible or ultraviolet light
    • G01N21/75Systems in which material is subjected to a chemical reaction, the progress or the result of the reaction being investigated
    • G01N21/76Chemiluminescence; Bioluminescence
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6869Methods for sequencing
    • C12Q1/6874Methods for sequencing involving nucleic acid arrays, e.g. sequencing by hybridisation

Definitions

  • acridinium ester-containing compounds Disclosed herein are acridinium ester-containing compounds, and methods of using the acridinium ester-containing compounds in chemiluminescence-based one-color nucleic acid sequencing.
  • Chemiluminescence refers to a chemical reaction resulting in the production of light. When applied to a sequencing platform, chemiluminescence has limited value due to the structure design of currently available chemiluminescence molecules. Currently available chemiluminescence molecules, such as luminol, hydrogen peroxide, fluorescein, dioxetanes, and oxalate derivatives, suffer from signal diffusion due to the signaling molecule being separated from the tri-phosphate reversible terminator after excitation. Therefore, chemiluminescence is not typically used in sequencing methods, which require signal localization.
  • AE compounds novel acridinium-ester-containing compounds
  • R 1 , R 2 , and R 3 are each independently absent or a linking group.
  • the linking group is selected from the group consisting of substituted or unsubstituted alkoxy, substituted or unsubstituted alkyl, substituted or unsubstituted alkenyl, substituted or unsubstituted alkynyl, and substituted or unsubstituted aryl.
  • R 4 , R 5 , and R 6 are each independently selected from the group consisting of hydrogen, substituted or unsubstituted C 1- 8 alkyl, substituted or unsubstituted C 2-8 alkenyl, and substituted or unsubstituted C 2-8 alkynyl.
  • Each R 7 is independently absent, hydrogen, a protein (e.g., an antibody) or a nucleotide moiety of the following formula:
  • R 8 is a nitrogenous base and R 9 is hydrogen or a blocking group; and X - is a counteranion.
  • R 7 present in the compound is a nucleotide moiety.
  • R 4 , R 5 , and R 6 are not substituted with the nucleotide moiety.
  • R 4 , R 5 , and R 6 are each independently selected from the group consisting of hydrogen and substituted or unsubstituted C 1-6 alkyl.
  • R 4 , R 5 , and R 6 are each independently selected from hydrogen and methyl.
  • R 4 is methyl
  • R 5 is methyl
  • R 6 is hydrogen.
  • each of R 4 , R 5 , and R 6 is hydrogen.
  • R 7 is a nucleotide moiety as shown above.
  • R 8 can be a nitrogenous base selected from the group consisting of:
  • R 9 may be a blocking group selected from the group consisting of:
  • compositions comprising a mixture of deoxyribonucleotide triphosphates (dNTPs) , the composition comprising: (i) first dNTPs conjugated to a first member of an AE pair; (ii) second dNTPs conjugated to a second member of the AE pair; (iii) third dNTPs conjugated to both the first member and the second member; and (iv) fourth dNTPs conjugated to neither the first member or the second member, wherein each of the first, second, third, and fourth dNTPs is selected from the group consisting of dATP, dTTP, dCTP, and dGTP, and are different from each other; and wherein the first member and second member have distinguishable properties.
  • the first member and the second member have minimal cross talk.
  • the dNTPs are reversible terminator dNTPs comprising cleavable blocking groups.
  • the methods can comprise i) providing an array of immobilized template DNA strands annealed to a primer or primer extension product; ii) contacting the array of (i) with the composition described above in the presence of a DNA polymerase under conditions in which the primers or primer extension products are extended by incorporation of a dNTP; iii) contacting the array with a first solution and capturing a first image; iv) contacting the array with a second solution and capturing a second image; and v) comparing the first and second images to identify bases of the plurality of template DNA strands.
  • Figure 1 shows surfactant affects initial signal intensity for two AE compounds.
  • AE means “AE-H”
  • AE (D) means “AE-D” .
  • alkyl, alkenyl, and alkynyl include straight-and branched-chain monovalent substituents. Examples include methyl, ethyl, isobutyl, 3-butynyl, and the like. Ranges of these groups useful with the compounds and methods described herein include C 1 -C 20 alkyl, C 2 -C 20 alkenyl, and C 2 -C 20 alkynyl.
  • Additional ranges of these groups useful with the compounds and methods described herein include C 1 -C 12 alkyl, C 2 -C 12 alkenyl, C 2 -C 12 alkynyl, C 1 -C 6 alkyl, C 2 -C 6 alkenyl, C 2 -C 6 alkynyl, C 1 -C 4 alkyl, C 2 -C 4 alkenyl, and C 2 -C 4 alkynyl.
  • Aryl molecules include, for example, cyclic hydrocarbons that incorporate one or more planar sets of, typically, six carbon atoms that are connected by delocalized electrons numbering the same as if they consisted of alternating single and double covalent bonds.
  • An example of an aryl molecule is benzene.
  • Heteroaryl molecules include substitutions along their main cyclic chain of atoms such as O, N, or S. When heteroatoms are introduced, a set of five atoms, e.g., four carbon and a heteroatom, can create an aromatic system. Examples of heteroaryl molecules include furan, pyrrole, thiophene, imadazole, oxazole, pyridine, and pyrazine.
  • Aryl and heteroaryl molecules can also include additional fused rings, for example, benzofuran, indole, benzothiophene, naphthalene, anthracene, and quinoline.
  • the aryl and heteroaryl molecules can be attached at any position on the ring, unless otherwise noted.
  • alkoxy as used herein is an alkyl group bound through a single, terminal ether linkage.
  • aryloxy as used herein is an aryl group bound through a single, terminal ether linkage.
  • amine or amino as used herein are represented by the formula -NZ 1 Z 2 , where Z 1 and Z 2 can each be a substitution group as described herein, such as hydrogen, an alkyl, halogenated alkyl, alkenyl, alkynyl, aryl, heteroaryl, cycloalkyl, cycloalkenyl, heterocycloalkyl, or heterocycloalkenyl group described above.
  • alkoxy, aryloxy, amino, alkyl, alkenyl, alkynyl, and aryl molecules used herein can be substituted or unsubstituted.
  • substituted includes the addition of a substitution group to a position attached to the main chain of the alkoxy, aryloxy, amino, alkyl, alkenyl, alkynyl, and aryl, e.g., the replacement of a hydrogen by a substitution group.
  • substitution groups include, but are not limited to, hydroxyl, halogen (e.g., F, Br, Cl, or I) , and carboxyl groups.
  • the term unsubstituted indicates the alkoxy, aryloxy, amino, alkyl, alkenyl, alkynyl, or aryl has a full complement of hydrogens, i.e., commensurate with its saturation level, with no substitutions, e.g., linear decane (– (CH 2 ) 9 –CH 3 ) .
  • nucleotide moiety refers to a group that includes a nitrogenous base, a sugar component (e.g., a 5-carbon sugar) or a derivative thereof, and at least one phosphate group.
  • acridinium-ester containing compounds described herein include compounds represented by Formula I:
  • R 1 , R 2 , and R 3 are each independently absent or a linking group.
  • linking group refers to a moiety positioned between a location on the acridinium moiety and the R 7 group, which is further defined below.
  • the linking group is selected from the group consisting of substituted or unsubstituted alkoxy, substituted or unsubstituted alkyl (e.g., C 1-8 substituted or unsubstituted alkyl) , substituted or unsubstituted alkenyl (e.g., C 2-8 substituted or unsubstituted alkenyl) , substituted or unsubstituted alkynyl (e.g., C 3-8 substituted or unsubstituted alkynyl) , and substituted or unsubstituted aryl.
  • R 1 , R 2 , and/or R 3 is absent, meaning a linking group is not present as R 1 , R 2 , and/or R 3 .
  • R 7 is attached directly to the acridinium moiety.
  • R 4 , R 5 , and R 6 are each independently selected from the group consisting of hydrogen, substituted or unsubstituted C 1-8 alkyl, substituted or unsubstituted C 2- 8 alkenyl, and substituted or unsubstituted C 2-8 alkynyl.
  • R 4 , R 5 , and R 6 are not substituted with the nucleotide moiety.
  • R 4 , R 5 , and R 6 are each independently selected from the group consisting of hydrogen and substituted or unsubstituted C 1-6 alkyl.
  • R 4 , R 5 , and R 6 are each independently selected from hydrogen and methyl.
  • R 4 and R 5 are the same (e.g., both R 4 and R 5 are hydrogen or both R 4 and R 5 are substituted or unsubstituted C 1-8 alkyl, such as methyl) .
  • R 4 is methyl
  • R 5 is methyl
  • R 6 is hydrogen.
  • each of R 4 , R 5 , and R 6 is hydrogen.
  • each R 7 is independently absent, hydrogen, a protein (e.g., an antibody) , or a nucleotide moiety represented by Structure A:
  • R 8 is a nitrogenous base, as further defined below, and R 9 is hydrogen or a blocking group, as further defined below.
  • only one of the three R 7 groups present in the compounds according to Formula I is a nucleotide moiety represented by Structure A.
  • the R 7 group linked to R 1 is a nucleotide moiety
  • the R 7 groups linked to R 2 and R 3 are independently absent or hydrogen.
  • the R 7 groups linked to R 1 and R 3 are independently absent or hydrogen.
  • the R 7 groups linked to R 1 and R 2 are independently absent or hydrogen.
  • R 7 is a nucleotide moiety.
  • R 8 can be a nitrogenous base.
  • Exemplary nitrogenous bases include adenine (A) , cytosine (C) , guanine (G) , thymine (T) , uracil (U) , inosine (I) , and derivatives of these. Exemplary nitrogenous bases are shown below:
  • the nitrogenous base is a 7-deaza derivative of adenine, inosine, or guanine.
  • the 7-deaza adenine, inosine, and/or guanine derivatives are linked to the alkyne group in R 7 through the 7-position (i.e., the 7-deaza adenine, inosine, and/or guanine derivatives are 7-substituted with R 7 through R 7 ’s alkyne group) .
  • uracil, cytosine, thymine, and/or derivatives thereof are 5-substituted.
  • the 5-substituted uracil, cytosine, thymine, and/or derivatives thereof can be attached to the alkyne group in R 7 through the 5-position (i.e., the 5-substituted uracil, cytosine, thymine, and/or derivatives thereof are substituted with R 7 through R 7 ’s alkyne group) .
  • R 9 can be hydrogen or a blocking group.
  • blocking group refers to any group that can be cleaved to provide a hydroxyl group at the 3’-position of the nucleotide analogue.
  • the blocking group can be cleavable by physical means, chemical means, heat, and/or light.
  • the blocking group is cleavable by enzymatic means.
  • the blocking group is an azido-containing blocking group (e.g., –CH 2 N 3 ) .
  • the blocking group is an amino-containing blocking group (e.g., –NH 2 ) .
  • the blocking group is an alkoxy-containing blocking group (e.g., –CH 2 OCH 3 ) or an aryloxy-containing blocking group (e.g., –CH 2 OPh) .
  • the blocking group is polyethylene glycol (PEG) .
  • the blocking group is a substituted or unsubstituted alkyl (i.e., a substituted or unsubstituted hydrocarbon) .
  • R 9 can be a group as shown below:
  • one or more R 7 groups can be a protein.
  • one or more R 7 groups can be an antibody or other protein capable of recognizing the nucleotide.
  • X - is a counteranion for the quaternary nitrogen of the acridinium moiety.
  • exemplary counteranions include, for example, halides (e.g., Cl - , Br - , I - , F - ) , CH 3 SO 4 - , FSO 3 - , CF 3 SO 3 - , C 4 F 9 SO 3 - , CH 3 C 6 H 4 SO 3 - , CF 3 COO - , CH 3 COO - , and NO 3 - .
  • the counteranion can be directly bonded to a substituent in Formula I.
  • acridinium-ester containing compounds according to Formula I include the following:
  • Formula I-A and Formula I-B X - , R 1 , R 2 , R 3 , and each R 7 are as defined above for Formula I.
  • Formula I-A may be referred to as “AE-D” (AE-dimethyl) and Formula I-B may be referred to as “AE-H” (AE-hydrogen) .
  • the compounds according to Formula I-B are represented by one of the following:
  • the compounds according to Formula I-A are represented by one of the following:
  • the compounds described herein can be prepared in a variety of ways known in the art of organic synthesis or variations thereon as appreciated by those skilled in the art.
  • the compounds described herein can be prepared from readily available starting materials. Optimum reaction conditions may vary with the particular reactants or solvents used, but such conditions can be determined by one skilled in the art.
  • Variations on Formula I and the compounds described herein include the addition, subtraction, or movement of the various constituents as described for each compound. Similarly, when one or more chiral centers are present in a molecule, the chirality of the molecule can be changed. Additionally, compound synthesis can involve the protection and deprotection of various chemical groups. The use of protection and deprotection and the selection of appropriate protecting groups can be determined by one skilled in the art. The chemistry of protecting groups can be found, for example, in Wuts and Greene, Protective Groups in Organic Synthesis, 4th Ed., Wiley & Sons, 2006, which is incorporated herein by reference in its entirety. The synthesis and subsequent testing of various compounds as described herein to determine efficacy is contemplated.
  • Reactions to produce the compounds described herein can be carried out in solvents, which can be selected by one of skill in the art of organic synthesis. Solvents can be substantially nonreactive with the starting materials (reactants) , the intermediates, or products under the conditions at which the reactions are carried out, i.e., temperature and pressure. Reactions can be carried out in one solvent or a mixture of more than one solvent. Product or intermediate formation can be monitored according to any suitable method known in the art.
  • product formation can be monitored by spectroscopic means, such as nuclear magnetic resonance spectroscopy (e.g., 1 H or 13 C) infrared spectroscopy, spectrophotometry (e.g., UV-visible) , or mass spectrometry (MS) , or by chromatography such as high performance liquid chromatography (HPLC) or thin layer chromatography (TLC) .
  • spectroscopic means such as nuclear magnetic resonance spectroscopy (e.g., 1 H or 13 C) infrared spectroscopy, spectrophotometry (e.g., UV-visible) , or mass spectrometry (MS)
  • chromatography such as high performance liquid chromatography (HPLC) or thin layer chromatography (TLC) .
  • the compounds described herein can be synthesized by using commercially available dNTPs.
  • the commercially available dTNPs can be conjugated to a linker or a binding molecule through a phosphate conjugation reaction.
  • the 3’-position of the dNTPs can be blocked by reacting the compounds with a protecting group.
  • the conjugating and blocking reactions can be performed in any order.
  • an acridinium ester compound as described herein (Structure 1) is exposed to a peroxide source.
  • the hydrogen peroxide anion then attacks the acridinium moiety to form Structure 2, which triggers an intramolecular nucleophilic attack on the carbonyl group to form the dioxetane intermediate Structure 3.
  • Structure 3 is highly strained and decomposes to form Structure 4 and emit light.
  • Structure 4, which is the acridinium moiety after excitation (as indicated by the asterisk) remains linked to the tri-phosphate reversible terminator (i.e., R 7 ) .
  • AE pairs with different signal producing properties find use in nucleic acid sequencing methods, such as sequencing-by-synthesis (SBS) methods.
  • the AE pair comprises a first compound of Formula I-A (AE-D) and a second compound of Formula I-B (AE-H) , such as Compound 2B and Compound 1B or Compound 2A and Compound 1A.
  • Suitable AE pairs can be determined using methods known to those of ordinary skill in the art.
  • AEs include light emission at different initial signal intensities when in the presence of peroxide (e.g., hydrogen peroxide or a hydrogen peroxide-urea complex) and surfactants (e.g., Triton X-100 (4- (1, 1, 3, 3-tetramethylbutyl) phenyl-polyethylene glycol) or Triton X-114 ( (1, 1, 3, 3-tetramethylbutyl) phenyl-polyethylene glycol) ) .
  • peroxide e.g., hydrogen peroxide or a hydrogen peroxide-urea complex
  • surfactants e.g., Triton X-100 (4- (1, 1, 3, 3-tetramethylbutyl) phenyl-polyethylene glycol) or Triton X-114 ( (1, 1, 3, 3-tetramethylbutyl) phenyl-polyethylene glycol)
  • AE-H and AE-D may emit light at high or very low (e.g., zero) initial signal
  • Table 1 shows exemplary conditions in which an AE pair (AE-D and AE-H) emit light at different intensities. Desirably, and as shown in Table 1 and Figure 1, the two AE members of an AE pair have minimal cross-talk under a predetermined conditions.
  • the predetermined conditions comprise a peroxide concentration and a surfactant type and concentration.
  • “minimal cross talk” means that under a first predetermined condition the initial signal intensity from a first member of an AE pair is at least 10, at least 50, at least 100, at least 500, or at least 1000-fold greater than the initial signal intensity of a second member of the AE pair under the first predetermined condition, and that under a second predetermined condition the initial signal intensity from the second first member of the AE pair is at least 10, at least 50, at least 100, at least 500, or at least 1000-fold greater than the initial signal intensity of the first member of the AE pair under the second predetermined condition.
  • minimal cross talk means that under a first predetermined condition the initial signal intensity from a first member of an AE pair is at least 50-fold greater than the initial signal intensity of a second member of the AE pair under the first predetermined condition, and that under a second predetermined condition the initial signal intensity from the second first member of the AE pair is at least 50-fold greater than the initial signal intensity of the first member of the AE pair under the second predetermined condition.
  • Table 1 and Figure 1 illustrate conditions under which the AE pair comprising AE-D (identified as “AE (D) ” in Figure 1) and AE-H (identified as “AE” in Figure 1) have minimal cross-talk.
  • the AE pair used in this illustration are Compound 2B and Compound 1B.
  • first and second predetermined conditions may include peroxide concentration and surfactant type and concentration.
  • Exemplary peroxide concentrations may range from, for example and not limitation, 0.0005 %to 0.010 % (e.g., 0.005 %) , sometimes referred to as “low concentration, ” and from 0.011 %to 1.0 % (e.g., 0.5 %) , sometimes referred to as “high concentration. ”
  • Exemplary surfactants include cationic, anionic, and nonionic surfactants.
  • the surfactants can include Triton X-100 (4- (1, 1, 3, 3-tetramethylbutyl) phenyl-polyethylene glycol) , cetyltrimethylammonium tosylate (CTAT) , other surfactants shown in Table 2, and combinations of surfactants.
  • Optimal concentrations of each surfactant for use with a given AE pair may be determined empirically.
  • Useful concentrations of CTAT include the range 1.0 mM to 10.0 mM (e.g., 7 mM)
  • useful concentrations for Triton X-100 include 0.5 %to 5 % (e.g., 2 %) .
  • Other exemplary concentrations for surfactants are shown in Table 2.
  • Acridinium ester-tri-phosphate reversible terminator conjugates may be used in nucleic acid sequencing methods.
  • the sequencing method is massively parallel sequencing-by-synthesis (SBS) .
  • SBS nucleic acid sequencing-by-synthesis
  • NGS next generation sequencing
  • SBS methods use 3’ blocked dNTPs with reversible terminator dyes, including but not limited to dyes as described in US Published Patent Application No. 2010/00317531 (incorporated herein by reference) .
  • SBS methods are carried out using an ordered or patterned array in a flow-cell.
  • Acridinium ester-tri-phosphate reversible terminator conjugates (AE pairs) disclosed herein find particular use in one-color (also called one-channel) sequencing.
  • SBS may be carried out using clonal populations of template sequences prepared by bridge PCR, emulsion PCR, production of concatemers (DNA nanoballs) and other methods.
  • one-color sequencing refers to massively parallel sequencing methods in which four bases can be distinguished based on differential emission of light under two different conditions.
  • the two conditions are exposure to two chemical environments.
  • the first condition and second condition are not different based on disassociation (e.g., by cleavage) or association (e.g., through a ligand antiligand interaction) of labels with nucleotides incorporated into a primer extension product or growing strand such as produced in a nucleic acid sequencing method.
  • the light emitted under the two conditions may be the same wavelength, different wavelengths, or approximately the same wavelength, e.g., differing less than 500 nm, 200 nm or 100 nM. See U.S. Pat.
  • the sequencing methods described herein make use of acridinium ester-tri-phosphate reversible terminator conjugates described herein.
  • a mixture of four nucleotide analogs (dATP, dTTP, dCTP, dGTP) is used.
  • dATP dATP
  • dTTP dTTP
  • dCTP dCTP
  • dGTP dGTP
  • one dNTP analog is labeled with (conjugated to) AE-D
  • a second dNTP analog is labeled with AE-H
  • a third dNTP analog is dual labeled with AE-D and AE-H
  • a fourth dNTP is not labeled with an AE.
  • the dual-labeled dNTP may be labeled in a variety of ways.
  • some or all of the dual-labeled dNTP molecules are physically labeled with two different labels (AE-D and AE-H) .
  • a mixture of the double labeled dNTP molecules e.g., dTTP
  • some dNTPs are labeled with AE-D and some are labeled with AE-H.
  • the mixture includes about equimolar amounts of AE-D labeled and AE-H labeled molecules.
  • the reversible terminator conjugates described herein are used in sequencing-by-synthesis (SBS) methods.
  • SBS sequencing-by-synthesis
  • generally reversible terminator dNTPs comprise blocking groups that are removed ( “deblocking” ) at the end of each sequencing cycle to regenerate a 3’-OH on the deoxyribose sugar and permit incorporation of the nucleotide complementary to the template nucleotide at the next sequencing position.
  • one nucleotide is incorporated in each sequencing cycle.
  • a reversible terminator dNTP is incorporated into a growing strand, the incorporation is detected, and the terminator is removed ( “deblocking” ) .
  • the incorporated dNTP is exposed to two predetermined conditions in each cycle (or one predetermined condition in each half-cycle) .
  • an array of clonal clusters (e.g., produced by bridge PCR) , single molecules, DNA nanoballs, or the like, is produced in a sequencing flow cell, comprising (1) a substrate on which target molecules (e.g., target amplicons) are immobilized and (2) a fluidic system that allows reagents, enzymes, buffers, washes, and the like to be delivered to the substrate.
  • target molecules e.g., target amplicons
  • it is a patterned flow cell.
  • a first solution is flowed through the flow cell (e.g., low H 2 O 2 plus CTAT) and an image is collected, and then a second solution is flowed through the flow cell (e.g., high H 2 O 2 plus Triton) and a second image is collected.
  • the pair of images is compared and the identity of the incorporated dNTP is determined as discussed below and described in Tables 3 and 4.
  • an “image” or “imaging” refers to collection of data (light emission intensity) and is not limited to a particular system or format of collection.
  • images may be collected using CCD cameras, complementary metal-oxide semiconductor (CMOS) chips, and other detectors.
  • CMOS complementary metal-oxide semiconductor
  • “Comparing” images generally comprises analysis of signal date using a computer.
  • Each flow step may be continuous or discontinuous and optionally buffer washes may be carried out before, between, or after exposure to each solution.
  • the first and second images are compared and changes in signal emissions of one or more AE-rtNTPs are determined. By comparing the signals at each position on an array, the incorporated nucleotide may be identified.

Landscapes

  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Wood Science & Technology (AREA)
  • Analytical Chemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Immunology (AREA)
  • Biochemistry (AREA)
  • Zoology (AREA)
  • Microbiology (AREA)
  • Molecular Biology (AREA)
  • Biophysics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • Biotechnology (AREA)
  • Genetics & Genomics (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Plasma & Fusion (AREA)
  • General Physics & Mathematics (AREA)
  • Pathology (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

Acridinium ester-containing compounds are provided herein. Also provided herein are chemiluminescence-based one-color sequencing methods and methods of using the acridinium ester-containing compounds in chemiluminescence-based one-color sequencing.

Description

ACRIDINIUM ESTER-CONTAINING COMPOUNDS AND METHODS OF USING THE SAME FOR CHEMILUMINESCENCE-BASED ONE-COLOR SEQUENCING
CROSS-REFERENCE TO RELATED APPLICATION
This application claims priority to U.S. Provisional Patent Application No. 62/883,760, filed August 7, 2019. This application is incorporated herein by reference in its entirety for all purposes.
FIELD
Disclosed herein are acridinium ester-containing compounds, and methods of using the acridinium ester-containing compounds in chemiluminescence-based one-color nucleic acid sequencing.
BACKGROUND
Chemiluminescence refers to a chemical reaction resulting in the production of light. When applied to a sequencing platform, chemiluminescence has limited value due to the structure design of currently available chemiluminescence molecules. Currently available chemiluminescence molecules, such as luminol, hydrogen peroxide, fluorescein, dioxetanes, and oxalate derivatives, suffer from signal diffusion due to the signaling molecule being separated from the tri-phosphate reversible terminator after excitation. Therefore, chemiluminescence is not typically used in sequencing methods, which require signal localization.
SUMMARY
Described herein are novel acridinium-ester-containing compounds (AE compounds) of the following formula:
Figure PCTCN2020107298-appb-000001
wherein R 1, R 2, and R 3 are each independently absent or a linking group. In some embodiments, the linking group is selected from the group consisting of substituted or unsubstituted alkoxy, substituted or unsubstituted alkyl, substituted or unsubstituted alkenyl, substituted or unsubstituted alkynyl, and substituted or unsubstituted aryl. R 4, R 5, and R 6 are each independently selected from the group consisting of hydrogen, substituted or unsubstituted C 1- 8 alkyl, substituted or unsubstituted C 2-8 alkenyl, and substituted or unsubstituted C 2-8 alkynyl. Each R 7 is independently absent, hydrogen, a protein (e.g., an antibody) or a nucleotide moiety of the following formula:
Figure PCTCN2020107298-appb-000002
wherein R 8 is a nitrogenous base and R 9 is hydrogen or a blocking group; and X -is a counteranion. In the compound, only one R 7 present in the compound is a nucleotide moiety.
Optionally, R 4, R 5, and R 6 are not substituted with the nucleotide moiety. In some cases, R 4, R 5, and R 6 are each independently selected from the group consisting of hydrogen and substituted or unsubstituted C 1-6 alkyl. In some cases, R 4, R 5, and R 6 are each independently selected from hydrogen and methyl. Optionally, R 4 is methyl, R 5 is methyl, and R 6 is hydrogen. Optionally, each of R 4, R 5, and R 6 is hydrogen.
In some cases, R 7 is a nucleotide moiety as shown above. In these cases, R 8 can be a nitrogenous base selected from the group consisting of:
Figure PCTCN2020107298-appb-000003
Optionally, R 9 can be a blocking group, wherein the blocking group is selected from the group consisting of –CH 2N 3, –NH 2, –CH 2CH=CH 2, –CH 2OCH 3, polyethylene glycol, and a substituted or unsubstituted alkyl. Optionally R 9 may be a blocking group selected from the group consisting of:
Figure PCTCN2020107298-appb-000004
Figure PCTCN2020107298-appb-000005
Also described herein are compositions comprising a mixture of deoxyribonucleotide triphosphates (dNTPs) , the composition comprising: (i) first dNTPs conjugated to a first member of an AE pair; (ii) second dNTPs conjugated to a second member of the AE pair; (iii) third dNTPs conjugated to both the first member and the second member; and (iv) fourth dNTPs conjugated to neither the first member or the second member, wherein each of the first, second, third, and fourth dNTPs is selected from the group consisting of dATP, dTTP, dCTP, and dGTP, and are different from each other; and wherein the first member and second member have distinguishable properties. In some embodiments,  the first member and the second member have minimal cross talk. Optionally, the dNTPs are reversible terminator dNTPs comprising cleavable blocking groups.
Further described herein are methods for identifying bases of a plurality of template DNA strands having different sequences. The methods can comprise i) providing an array of immobilized template DNA strands annealed to a primer or primer extension product; ii) contacting the array of (i) with the composition described above in the presence of a DNA polymerase under conditions in which the primers or primer extension products are extended by incorporation of a dNTP; iii) contacting the array with a first solution and capturing a first image; iv) contacting the array with a second solution and capturing a second image; and v) comparing the first and second images to identify bases of the plurality of template DNA strands.
The details of one or more embodiments are set forth in the drawings and the description below. Other features, objects, and advantages will be apparent from the description and drawings, and from the claims.
BRIEF DESCRIPTION OF THE DRAWINGS
Figure 1 shows surfactant affects initial signal intensity for two AE compounds. In Figure 1, “AE” means “AE-H” and “AE (D) ” means “AE-D” .
DETAILED DESCRIPTION
I. DEFINITIONS
As used herein, the terms alkyl, alkenyl, and alkynyl include straight-and branched-chain monovalent substituents. Examples include methyl, ethyl, isobutyl, 3-butynyl, and the like. Ranges of these groups useful with the compounds and methods described herein include C 1-C 20 alkyl, C 2-C 20 alkenyl, and C 2-C 20 alkynyl. Additional ranges of these groups useful with the compounds and methods described herein include C 1-C 12 alkyl, C 2-C 12 alkenyl, C 2-C 12 alkynyl, C 1-C 6 alkyl, C 2-C 6 alkenyl, C 2-C 6 alkynyl, C 1-C 4 alkyl, C 2-C 4 alkenyl, and C 2-C 4 alkynyl.
Aryl molecules include, for example, cyclic hydrocarbons that incorporate one or more planar sets of, typically, six carbon atoms that are connected by delocalized electrons numbering the same as if they consisted of alternating single and double covalent bonds. An example of an aryl molecule is benzene. Heteroaryl molecules include substitutions along their  main cyclic chain of atoms such as O, N, or S. When heteroatoms are introduced, a set of five atoms, e.g., four carbon and a heteroatom, can create an aromatic system. Examples of heteroaryl molecules include furan, pyrrole, thiophene, imadazole, oxazole, pyridine, and pyrazine. Aryl and heteroaryl molecules can also include additional fused rings, for example, benzofuran, indole, benzothiophene, naphthalene, anthracene, and quinoline. The aryl and heteroaryl molecules can be attached at any position on the ring, unless otherwise noted.
The term alkoxy as used herein is an alkyl group bound through a single, terminal ether linkage. Likewise, the term aryloxy as used herein is an aryl group bound through a single, terminal ether linkage.
The terms amine or amino as used herein are represented by the formula -NZ 1Z 2, where Z 1 and Z 2 can each be a substitution group as described herein, such as hydrogen, an alkyl, halogenated alkyl, alkenyl, alkynyl, aryl, heteroaryl, cycloalkyl, cycloalkenyl, heterocycloalkyl, or heterocycloalkenyl group described above.
The alkoxy, aryloxy, amino, alkyl, alkenyl, alkynyl, and aryl molecules used herein can be substituted or unsubstituted. As used herein, the term substituted includes the addition of a substitution group to a position attached to the main chain of the alkoxy, aryloxy, amino, alkyl, alkenyl, alkynyl, and aryl, e.g., the replacement of a hydrogen by a substitution group. Examples of substitution groups include, but are not limited to, hydroxyl, halogen (e.g., F, Br, Cl, or I) , and carboxyl groups. Conversely, as used herein, the term unsubstituted indicates the alkoxy, aryloxy, amino, alkyl, alkenyl, alkynyl, or aryl has a full complement of hydrogens, i.e., commensurate with its saturation level, with no substitutions, e.g., linear decane (– (CH 29–CH 3) .
II. COMPOSITIONS
Described herein are acridinium-ester containing compounds including a nucleotide moiety. As used herein, the term “nucleotide moiety” refers to a group that includes a nitrogenous base, a sugar component (e.g., a 5-carbon sugar) or a derivative thereof, and at least one phosphate group.
The acridinium-ester containing compounds described herein include compounds represented by Formula I:
Figure PCTCN2020107298-appb-000006
In Formula I, R 1, R 2, and R 3 are each independently absent or a linking group. As used herein, “linking group” refers to a moiety positioned between a location on the acridinium moiety and the R 7 group, which is further defined below. In some embodiments, the linking group is selected from the group consisting of substituted or unsubstituted alkoxy, substituted or unsubstituted alkyl (e.g., C 1-8 substituted or unsubstituted alkyl) , substituted or unsubstituted alkenyl (e.g., C 2-8 substituted or unsubstituted alkenyl) , substituted or unsubstituted alkynyl (e.g., C 3-8 substituted or unsubstituted alkynyl) , and substituted or unsubstituted aryl. Optionally, R 1, R 2, and/or R 3 is absent, meaning a linking group is not present as R 1, R 2, and/or R 3. In these embodiments, R 7 is attached directly to the acridinium moiety.
Also in Formula I, R 4, R 5, and R 6 are each independently selected from the group consisting of hydrogen, substituted or unsubstituted C 1-8 alkyl, substituted or unsubstituted C 2- 8 alkenyl, and substituted or unsubstituted C 2-8 alkynyl. Optionally, R 4, R 5, and R 6 are not substituted with the nucleotide moiety. In some cases, R 4, R 5, and R 6 are each independently selected from the group consisting of hydrogen and substituted or unsubstituted C 1-6 alkyl. In some cases, R 4, R 5, and R 6 are each independently selected from hydrogen and methyl. In some embodiments, R 4 and R 5 are the same (e.g., both R 4 and R 5 are hydrogen or both R 4 and R 5 are substituted or unsubstituted C 1-8 alkyl, such as methyl) . Optionally, R 4 is methyl, R 5 is methyl, and R 6 is hydrogen. Optionally, each of R 4, R 5, and R 6 is hydrogen.
Additionally in Formula I, each R 7 is independently absent, hydrogen, a protein (e.g., an antibody) , or a nucleotide moiety represented by Structure A:
Figure PCTCN2020107298-appb-000007
In Structure A, R 8 is a nitrogenous base, as further defined below, and R 9 is hydrogen or a blocking group, as further defined below.
In some cases, only one of the three R 7 groups present in the compounds according to Formula I is a nucleotide moiety represented by Structure A. For example, if the R 7 group linked to R 1 is a nucleotide moiety, then the R 7 groups linked to R 2 and R 3 are independently absent or hydrogen. Similarly, if the R 7 group linked to R 2 is a nucleotide moiety, then the R 7 groups linked to R 1 and R 3 are independently absent or hydrogen. Likewise, if the R 7 group linked to R 3 is a nucleotide moiety, then the R 7 groups linked to R 1 and R 2 are independently absent or hydrogen.
In some cases, R 7 is a nucleotide moiety. In these cases, R 8 can be a nitrogenous base. Exemplary nitrogenous bases include adenine (A) , cytosine (C) , guanine (G) , thymine (T) , uracil (U) , inosine (I) , and derivatives of these. Exemplary nitrogenous bases are shown below:
Figure PCTCN2020107298-appb-000008
In one aspect, the nitrogenous base is a 7-deaza derivative of adenine, inosine, or guanine. In some cases, the 7-deaza adenine, inosine, and/or guanine derivatives are linked to the alkyne group in R 7 through the 7-position (i.e., the 7-deaza adenine, inosine, and/or guanine derivatives are 7-substituted with R 7 through R 7’s alkyne group) . In some cases, uracil, cytosine, thymine, and/or derivatives thereof are 5-substituted. For example, the 5-substituted uracil, cytosine, thymine, and/or derivatives thereof can be attached to the alkyne group in R 7 through the 5-position (i.e., the 5-substituted uracil, cytosine, thymine, and/or derivatives thereof are substituted with R 7 through R 7’s alkyne group) .
Optionally, R 9 can be hydrogen or a blocking group. As used herein, the term “blocking group” refers to any group that can be cleaved to provide a hydroxyl group at the 3’-position of the nucleotide analogue. The blocking group can be cleavable by physical means, chemical means, heat, and/or light. Optionally, the blocking group is cleavable by enzymatic means. In some embodiments, the blocking group is an azido-containing blocking group (e.g., –CH 2N 3) . In some embodiments, the blocking group is an amino-containing blocking group (e.g., –NH 2) . In some embodiments, the blocking group is an allyl-containing blocking group  (e.g., –CH 2CH=CH 2) . In some embodiments, the blocking group is an alkoxy-containing blocking group (e.g., –CH 2OCH 3) or an aryloxy-containing blocking group (e.g., –CH 2OPh) . In some embodiments, the blocking group is polyethylene glycol (PEG) . In some embodiments, the blocking group is a substituted or unsubstituted alkyl (i.e., a substituted or unsubstituted hydrocarbon) . Optionally, R 9 can be a group as shown below:
Figure PCTCN2020107298-appb-000009
In some cases, one or more R 7 groups can be a protein. For example, one or more R 7 groups can be an antibody or other protein capable of recognizing the nucleotide.
Further in Formula I, X -is a counteranion for the quaternary nitrogen of the acridinium moiety. Exemplary counteranions include, for example, halides (e.g., Cl -, Br -, I -, F -) , CH 3SO 4 -, FSO 3 -, CF 3SO 3 -, C 4F 9SO 3 -, CH 3C 6H 4SO 3 -, CF 3COO -, CH 3COO -, and NO 3 -. Optionally, the counteranion can be directly bonded to a substituent in Formula I.
Examples of acridinium-ester containing compounds according to Formula I include the following:
Figure PCTCN2020107298-appb-000010
In Formula I-A and Formula I-B, X -, R 1, R 2, R 3, and each R 7 are as defined above for Formula I. Formula I-A may be referred to as “AE-D” (AE-dimethyl) and Formula I-B may be referred to as “AE-H” (AE-hydrogen) .
In some examples, the compounds according to Formula I-B are represented by one of the following:
Figure PCTCN2020107298-appb-000011
In some examples, the compounds according to Formula I-A are represented by one of the following:
Figure PCTCN2020107298-appb-000012
The compounds described herein can be prepared in a variety of ways known in the art of organic synthesis or variations thereon as appreciated by those skilled in the art. The compounds described herein can be prepared from readily available starting materials. Optimum reaction conditions may vary with the particular reactants or solvents used, but such conditions can be determined by one skilled in the art.
Variations on Formula I and the compounds described herein include the addition, subtraction, or movement of the various constituents as described for each compound. Similarly, when one or more chiral centers are present in a molecule, the chirality of the molecule can be changed. Additionally, compound synthesis can involve the protection and deprotection of various chemical groups. The use of protection and deprotection and the selection of appropriate protecting groups can be determined by one skilled in the art. The chemistry of protecting groups can be found, for example, in Wuts and Greene, Protective Groups in Organic Synthesis, 4th Ed., Wiley & Sons, 2006, which is incorporated herein by reference in its entirety. The synthesis and subsequent testing of various compounds as described herein to determine efficacy is contemplated.
Reactions to produce the compounds described herein can be carried out in solvents, which can be selected by one of skill in the art of organic synthesis. Solvents can be substantially nonreactive with the starting materials (reactants) , the intermediates, or products under the conditions at which the reactions are carried out, i.e., temperature and pressure. Reactions can be carried out in one solvent or a mixture of more than one solvent. Product or intermediate formation can be monitored according to any suitable method known in the art. For example, product formation can be monitored by spectroscopic means, such as nuclear magnetic resonance spectroscopy (e.g.,  1H or  13C) infrared spectroscopy, spectrophotometry (e.g., UV-visible) , or mass spectrometry (MS) , or by chromatography such as high performance liquid chromatography (HPLC) or thin layer chromatography (TLC) .
Optionally, the compounds described herein can be synthesized by using commercially available dNTPs. The commercially available dTNPs can be conjugated to a linker or a binding molecule through a phosphate conjugation reaction. The 3’-position of the dNTPs can be blocked by reacting the compounds with a protecting group. The conjugating and blocking reactions can be performed in any order.
Signal localization achieved by the acridinium ester-tri-phosphate reversible terminator conjugates as described herein is depicted through the reaction mechanism shown below in Scheme 1.
Scheme 1: Signal Localization Reaction Mechanism
Figure PCTCN2020107298-appb-000013
As shown above in Scheme 1, an acridinium ester compound as described herein (Structure 1) is exposed to a peroxide source. The hydrogen peroxide anion then attacks the acridinium moiety to form Structure 2, which triggers an intramolecular nucleophilic attack on the carbonyl group to form the dioxetane intermediate Structure 3. Structure 3 is highly strained and decomposes to form Structure 4 and emit light. Structure 4, which is the acridinium moiety after excitation (as indicated by the asterisk) , remains linked to the tri-phosphate reversible terminator (i.e., R 7) .
III. AE PAIRS WITH DISTINGUISHABLE PROPERTIES OR MINIMAL CROSS TALK
Advantageously, AE pairs with different signal producing properties find use in nucleic acid sequencing methods, such as sequencing-by-synthesis (SBS) methods. In one approach, the AE pair comprises a first compound of Formula I-A (AE-D) and a second compound of Formula I-B (AE-H) , such as Compound 2B and Compound 1B or Compound  2A and Compound 1A. Suitable AE pairs can be determined using methods known to those of ordinary skill in the art.
Different signal producing properties of AEs include light emission at different initial signal intensities when in the presence of peroxide (e.g., hydrogen peroxide or a hydrogen peroxide-urea complex) and surfactants (e.g., Triton X-100 (4- (1, 1, 3, 3-tetramethylbutyl) phenyl-polyethylene glycol) or Triton X-114 ( (1, 1, 3, 3-tetramethylbutyl) phenyl-polyethylene glycol) ) . For example, AE-H and AE-D may emit light at high or very low (e.g., zero) initial signal intensities depending on peroxide concentration and the presence, absence, or level of certain surfactants. For illustration, Table 1 shows exemplary conditions in which an AE pair (AE-D and AE-H) emit light at different intensities. Desirably, and as shown in Table 1 and Figure 1, the two AE members of an AE pair have minimal cross-talk under a predetermined conditions.
In one approach, the predetermined conditions comprise a peroxide concentration and a surfactant type and concentration.
In one approach, “minimal cross talk” means that under a first predetermined condition the initial signal intensity from a first member of an AE pair is at least 10, at least 50, at least 100, at least 500, or at least 1000-fold greater than the initial signal intensity of a second member of the AE pair under the first predetermined condition, and that under a second predetermined condition the initial signal intensity from the second first member of the AE pair is at least 10, at least 50, at least 100, at least 500, or at least 1000-fold greater than the initial signal intensity of the first member of the AE pair under the second predetermined condition. In one embodiment, “minimal cross talk” means that under a first predetermined condition the initial signal intensity from a first member of an AE pair is at least 50-fold greater than the initial signal intensity of a second member of the AE pair under the first predetermined condition, and that under a second predetermined condition the initial signal intensity from the second first member of the AE pair is at least 50-fold greater than the initial signal intensity of the first member of the AE pair under the second predetermined condition.
Table 1 and Figure 1 illustrate conditions under which the AE pair comprising AE-D (identified as “AE (D) ” in Figure 1) and AE-H (identified as “AE” in Figure 1) have minimal cross-talk. The AE pair used in this illustration are Compound 2B and Compound 1B.
TABLE 1
Figure PCTCN2020107298-appb-000014
As noted, first and second predetermined conditions may include peroxide concentration and surfactant type and concentration. Exemplary peroxide concentrations may range from, for example and not limitation, 0.0005 %to 0.010 % (e.g., 0.005 %) , sometimes referred to as “low concentration, ” and from 0.011 %to 1.0 % (e.g., 0.5 %) , sometimes referred to as “high concentration. ”
Exemplary surfactants include cationic, anionic, and nonionic surfactants. In some examples, the surfactants can include Triton X-100 (4- (1, 1, 3, 3-tetramethylbutyl) phenyl-polyethylene glycol) , cetyltrimethylammonium tosylate (CTAT) , other surfactants shown in Table 2, and combinations of surfactants. Optimal concentrations of each surfactant for use with a given AE pair may be determined empirically. Useful concentrations of CTAT include the range 1.0 mM to 10.0 mM (e.g., 7 mM) , and useful concentrations for Triton X-100 include 0.5 %to 5 % (e.g., 2 %) . Other exemplary concentrations for surfactants are shown in Table 2.
TABLE 2
Surfactant Abbreviation Exemplary Concentration
Cetyltrimethylammonium Chloride CTAC 7.8 mM
Dihexadecyldimethylammonium Bromide DHDAB 10 mM
Didodecyldimethylammonium Bromide DDDAB 10 mM
Trimethyloctadecylammonium Chloride TMDAC 10 mM
Dimethyldioctadecylammonium Bromide DMDAB 1 mM
Tetrahexylammonium Chloride THAC 10 mM
Tridodecylmethylammonium Chloride TDMAC 1 mM
Tetradodecylammonium Chloride TDAC 1 mM
IV. ONE COLOR SEQUENCING
Acridinium ester-tri-phosphate reversible terminator conjugates (AE pairs) may be used in nucleic acid sequencing methods. In some approaches, the sequencing method is massively parallel sequencing-by-synthesis (SBS) . Methods for nucleic acid sequencing-by-synthesis (SBS) and other next generation sequencing (NGS) methods are well known. See Blackburn et al., Curr. Genomics 16 (3) : 159-174 (2015) ; Stranneheim and Lundeberg, Biotechnol. J. 7: 1063-73 (2012) ; Guo et al., Acc. Chem. Res. 43: 551-563 (2010) , Drmanac et al US20180223358A1; Mardis, E.R., 2013, Annu. Rev. Anal. Chem. 6: 287–303; Metzker, M.L., 2010, Nat. Rev. Genet. 11: 31–46; Goodwin et al., 2016, Nat. Rev. Genet. 17: 333–351, each of which is incorporated herein by reference. In some cases, SBS methods use 3’ blocked dNTPs with reversible terminator dyes, including but not limited to dyes as described in US Published Patent Application No. 2010/00317531 (incorporated herein by reference) . In some cases, SBS methods are carried out using an ordered or patterned array in a flow-cell. Acridinium ester-tri-phosphate reversible terminator conjugates (AE pairs) disclosed herein find particular use in one-color (also called one-channel) sequencing. SBS may be carried out using clonal populations of template sequences prepared by bridge PCR, emulsion PCR, production of concatemers (DNA nanoballs) and other methods.
As used herein, “one-color sequencing” refers to massively parallel sequencing methods in which four bases can be distinguished based on differential emission of light under two different conditions. In one approach, the two conditions are exposure to two chemical environments. In one approach, the first condition and second condition are not different based on disassociation (e.g., by cleavage) or association (e.g., through a ligand antiligand interaction) of labels with nucleotides incorporated into a primer extension product or growing strand such as produced in a nucleic acid sequencing method. The light emitted under the two conditions may be the same wavelength, different wavelengths, or approximately the same wavelength, e.g., differing less than 500 nm, 200 nm or 100 nM. See U.S. Pat. No. 9,222,132, incorporated herein by reference, for a non-limiting description of certain one-color methods. Using one-color sequencing methods, different bases (e.g., A, T, C, G) are distinguished based on detection of signal at different intensity (or ‘brightness’ ) , including zero intensity (no signal) , under different conditions (e.g., under a first predetermined condition and a second predetermined condition) .
The sequencing methods described herein make use of acridinium ester-tri-phosphate reversible terminator conjugates described herein. Generally, a mixture of four nucleotide analogs (dATP, dTTP, dCTP, dGTP) is used. For example, in one approach one dNTP analog is labeled with (conjugated to) AE-D, a second dNTP analog is labeled with AE-H, a third dNTP analog is dual labeled with AE-D and AE-H, and a fourth dNTP is not labeled with an AE. The dual-labeled dNTP may be labeled in a variety of ways. In one approach, some or all of the dual-labeled dNTP molecules (e.g., dTTP) are physically labeled with two different labels (AE-D and AE-H) . In another approach, a mixture of the double labeled dNTP molecules (e.g., dTTP) is used in which some dNTPs are labeled with AE-D and some are labeled with AE-H. In one approach, the mixture includes about equimolar amounts of AE-D labeled and AE-H labeled molecules.
In one approach, the reversible terminator conjugates described herein are used in sequencing-by-synthesis (SBS) methods. It will be understood by those of skill in the art that generally reversible terminator dNTPs comprise blocking groups that are removed ( “deblocking” ) at the end of each sequencing cycle to regenerate a 3’-OH on the deoxyribose sugar and permit incorporation of the nucleotide complementary to the template nucleotide at the next sequencing position.
In one common SBS approach, one nucleotide is incorporated in each sequencing cycle. Typically, in each sequencing cycle, a reversible terminator dNTP is incorporated into a growing strand, the incorporation is detected, and the terminator is removed ( “deblocking” ) . According to the present invention, the incorporated dNTP is exposed to two predetermined conditions in each cycle (or one predetermined condition in each half-cycle) . In one approach, an array of clonal clusters (e.g., produced by bridge PCR) , single molecules, DNA nanoballs, or the like, is produced in a sequencing flow cell, comprising (1) a substrate on which target molecules (e.g., target amplicons) are immobilized and (2) a fluidic system that allows reagents, enzymes, buffers, washes, and the like to be delivered to the substrate. In one approach, it is a patterned flow cell. After each dNTP incorporation step, a first solution is flowed through the flow cell (e.g., low H 2O 2 plus CTAT) and an image is collected, and then a second solution is flowed through the flow cell (e.g., high H 2O 2 plus Triton) and a second image is collected. The pair of images is compared and the identity of the incorporated dNTP is determined as discussed below and described in Tables 3 and 4.
As used herein, an “image” or “imaging” refers to collection of data (light emission intensity) and is not limited to a particular system or format of collection. For example, “images” may be collected using CCD cameras, complementary metal-oxide semiconductor (CMOS) chips, and other detectors. “Comparing” images generally comprises analysis of signal date using a computer.
Each flow step may be continuous or discontinuous and optionally buffer washes may be carried out before, between, or after exposure to each solution. The first and second images are compared and changes in signal emissions of one or more AE-rtNTPs are determined. By comparing the signals at each position on an array, the incorporated nucleotide may be identified.
TABLE 3
Figure PCTCN2020107298-appb-000015
TABLE 4
dNTP analog Signal captured in 1 st image Signal captured in 2 nd image
dATP
0 1
dCTP 1 0
dTTP 1 1
dGTP 0 0
EXAMPLES
The present invention may be embodied in other specific forms without departing from its structures, methods, or other essential characteristics as broadly described herein and claimed hereinafter. The described embodiments are to be considered in all respects only as illustrative, and not restrictive. The scope of the invention is, therefore, indicated by the  appended claims, rather than by the foregoing description. All changes that come within the meaning and range of equivalency of the claims are to be embraced within their scope.
Example 1: Synthesis of Acridinium Ester-Containing Compounds
General Procedure for the Preparation of Compound 3:
Figure PCTCN2020107298-appb-000016
To a 250 mL flask, 3-bromoanisole (5.0 g) , 3-methoxyaniline (4.0 g) , palladium acetate (0.21 g) , 2, 2′-bis (diphenylphosphino) -1, 1′-binaphthyl (BINAP; 0.8 g) , cesium carbonate (9.6 g) , and toluene (100 mL) were added. The mixture was degassed for 5 minutes and then refluxed for 3 days under Argon. Water was added to the cooled reaction mixture, followed by extraction with ethyl acetate (EtOAc) . The extract was washed with water and dried over anhydrous sodium sulfate (Na 2SO 4) . The crude product was purified by silica gel column chromatography to give Compound 3.
General Procedure for the Preparation of Compound 4:
Figure PCTCN2020107298-appb-000017
Compound 3 (2.5 g) in toluene (60 mL) was slowly added to a solution of oxalyl chloride (3.0 mL) in toluene (100 mL) using an additional funnel. After the addition was over, the mixture was heated at 60 ℃ for 1 hour. The solvent was removed and the residue was heated at 120 ℃ overnight. The resulting solid (Compound 4) was used in the reaction of the next step without purification.
General Procedure for the Preparation of Compound 5:
Figure PCTCN2020107298-appb-000018
To the crude Compound 4 was added 2N NaOH/water (100 mL) . The mixture was refluxed for 4 hours and then cooled to room temperature. An HCl/water mixture (2N) was added to adjust the pH to 1-3. The resulting yellow precipitate was collected and washed with water. The solid was dried to give Compound 5.
General Procedure for the Preparation of Compound 6:
Figure PCTCN2020107298-appb-000019
Compound 5 was suspended in thionyl chloride (12 mL) and two drops of dimethylformamide (DMF) was added. The mixture was refluxed for 2 hours, and then thionyl chloride was removed. Phenol (1.3 g) , anhydrous dichloromethane (CH 2Cl 2; 20 mL) , and pyridine (5 mL) were added to the residue. The mixture was refluxed for 2 hours, and then CH 2Cl 2 and pyridine were removed. The residue was dissolved in chloroform (CHCl 3) and washed with sodium bicarbonate (NaHCO 3) /water. The organic layer was dried over anhydrous Na 2SO 4. The crude product was purified by silica gel column chromatography to give Compound 6.
General Procedure for the Preparation of Compound 7:
Figure PCTCN2020107298-appb-000020
Compound 6 (0.5 g) and 1, 4-butane sultone (10 g) were mixed and heated at 130 ℃for 10 hours, and then cooled to room temperature. A solution of 1N HCl in water (100 mL) was added to the reaction mixture and the resulting mixture was heated at 60 ℃ for 30 minutes. The mixture was cooled to room temperature and then adjusted pH to 1-2 with NaHCO 3/water.  The mixture was purified by C 18 column chromatography using 0.1%trifluoroacetic acid (TFA) buffer and acetonitrile (CH 3CN) as solvents to give Compound 7.
General Procedure for the Preparation of Compound 8:
Figure PCTCN2020107298-appb-000021
Compound 7 (0.1 g) was suspended in anhydrous CH 2Cl 2 (5 mL) . Oxalyl chloride (1 mL) was added to the mixture and the resulting mixture was stirred at room temperature for 2 hours. The solvent and oxalyl chloride were removed, the residue was dissolved in anhydrous CH 2Cl 2 (5 mL) , and then ethyl isonipecotate (0.2 ml) was added. The mixture was stirred at room temperature for 10 minutes, and then 1N HCl/water was added. The mixture was extracted with CHCl 3, the solvent was removed to give Compound 8 without purification.
General Procedure for the Preparation of Compound 1:
Figure PCTCN2020107298-appb-000022
The above crude Compound 8 was suspended in dioxane (20 mL) and 1N HCl/water (40 mL) , and the mixture was stirred at 70 ℃ for 4 hours. The dioxane was removed and the pH of the aqueous solution was adjusted to 1-2 with NaHCO 3/water. Acetonitrile was then added, resulting in a clear solution. The solution was purified by preparative HPLC using 0.1%TFA buffer and CH 3CN as solvents to give pure Compound 1.
General Procedure for the Preparation of Compound 2:
Compound 2 was synthesized using a similar procedure to that used to prepare Compound 1.
Figure PCTCN2020107298-appb-000023
Although the foregoing invention has been described in some detail by way of illustration and example for purposes of clarity of understanding, one of skill in the art will appreciate that certain changes and modifications may be practiced within the scope of the appended claims. In addition, each reference provided herein is incorporated by reference in its entirety to the same extent as if each reference was individually incorporated by reference. Where a conflict exists between the instant application and a reference provided herein, the instant application shall dominate.

Claims (13)

  1. A compound of the following formula:
    Figure PCTCN2020107298-appb-100001
    wherein
    R 1, R 2, and R 3 are each independently absent or a linking group;
    R 4, R 5, and R 6 are each independently selected from the group consisting of hydrogen, substituted or unsubstituted C 1-8 alkyl, substituted or unsubstituted C 2-8 alkenyl, and substituted or unsubstituted C 2-8 alkynyl;
    each R 7 is independently absent, hydrogen, a protein, or a nucleotide moiety of the following formula:
    Figure PCTCN2020107298-appb-100002
    wherein
    R 8 is a nitrogenous base; and
    R 9 is hydrogen or a blocking group; and
    X -is a counteranion,
    wherein only one R 7 present in the compound is the nucleotide moiety.
  2. The compound of claim 1, wherein R 4, R 5, and R 6 are not substituted with the nucleotide moiety.
  3. The compound of claim 1, wherein R 4, R 5, and R 6 are each independently selected from the group consisting of hydrogen and substituted or unsubstituted C 1-6 alkyl.
  4. The compound of claim 1, wherein R 4, R 5, and R 6 are each independently selected from hydrogen and methyl.
  5. The compound of claim 1, wherein R 4 is methyl, R 5 is methyl, and R 6 is hydrogen.
  6. The compound of claim 1, wherein each of R 4, R 5, and R 6 is hydrogen.
  7. The compound of claim 1, wherein R 7 is the nucleotide moiety and R 8 is the nitrogenous base selected from the group consisting of:
    Figure PCTCN2020107298-appb-100003
  8. The compound of claim 1, wherein R 7 is the nucleotide moiety and R 9 is the blocking group, wherein the blocking group is selected from the group consisting of –CH 2N 3, –NH 2, –CH 2CH=CH 2, –CH 2OCH 3, polyethylene glycol, and a substituted or unsubstituted alkyl.
  9. The compound of claim 1, wherein R 1, R 2, or R 3 is the linking group and the linking group is selected from the group consisting of substituted or unsubstituted alkoxy, substituted or unsubstituted alkyl, substituted or unsubstituted alkenyl, substituted or unsubstituted alkynyl, and substituted or unsubstituted aryl.
  10. A composition comprising a mixture of deoxyribonucleotide triphosphates (dNTPs) comprising:
    (i) first dNTPs conjugated to a first member of an AE pair;
    (ii) second dNTPs conjugated to a second member of the AE pair;
    (iii) third dNTPs conjugated to both the first member and the second member; and
    (iv) fourth dNTPs conjugated to neither the first member or the second member,
    wherein each of the first, second, third, and fourth dNTPs is selected from the group consisting of dATP, dTTP, dCTP, and dGTP, and are different from each other; and
    wherein the first member and second member have distinguishable properties.
  11. The composition of claim 10, wherein the first member and the second member have minimal cross talk.
  12. The composition of claim 11, wherein the dNTPs are reversible terminator dNTPs comprising cleavable blocking groups.
  13. A method for identifying bases of a plurality of template DNA strands having different sequences, comprising:
    i) providing an array of immobilized template DNA strands annealed to a primer or primer extension product;
    ii) contacting the array of (i) with the composition of claim 10 in the presence of a DNA polymerase under conditions in which the primers or primer extension products are extended by incorporation of a dNTP;
    iii) contacting the array with a first solution and capturing a first image;
    iv) contacting the array with a second solution and capturing a second image; and
    v) comparing the first and second images to identify bases of the plurality of template DNA strands.
PCT/CN2020/107298 2019-08-07 2020-08-06 Acridinium ester-containing compounds and methods of using the same for chemiluminescence-based one-color sequencing WO2021023252A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN202080056098.6A CN114245830A (en) 2019-08-07 2020-08-06 Acridinium ester-containing compound and method for using same for chemiluminescence-based monochromatic sequencing
EP20849079.7A EP4010484A4 (en) 2019-08-07 2020-08-06 Acridinium ester-containing compounds and methods of using the same for chemiluminescence-based one-color sequencing

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201962883760P 2019-08-07 2019-08-07
US62/883,760 2019-08-07

Publications (1)

Publication Number Publication Date
WO2021023252A1 true WO2021023252A1 (en) 2021-02-11

Family

ID=74499206

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/107298 WO2021023252A1 (en) 2019-08-07 2020-08-06 Acridinium ester-containing compounds and methods of using the same for chemiluminescence-based one-color sequencing

Country Status (4)

Country Link
US (1) US20210040556A1 (en)
EP (1) EP4010484A4 (en)
CN (1) CN114245830A (en)
WO (1) WO2021023252A1 (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2005065350A2 (en) * 2003-12-31 2005-07-21 Quest Diagnostics Investments Incorporated Compositions and methods for detecting reverse transciptase in a sample
WO2009097368A2 (en) * 2008-01-28 2009-08-06 Complete Genomics, Inc. Methods and compositions for efficient base calling in sequencing reactions
WO2018165207A1 (en) * 2017-03-06 2018-09-13 Singular Genomic Systems, Inc. Nucleic acid sequencing-by-synthesis (sbs) methods that combine sbs cycle steps

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
NL8703075A (en) * 1987-12-18 1989-07-17 Nederlanden Staat ACRIDINUM COMPOUNDS AS A CHEMILUMINESCENT MARKING.
JP3584379B2 (en) * 1993-02-04 2004-11-04 持田製薬株式会社 Acridinium compound and acridinium compound complex
AU6381900A (en) * 1999-07-30 2001-02-19 Bayer Corporation Chemiluminescent substrates of hydrolytic enzymes such as phosphatases
EP1539702B1 (en) * 2002-08-20 2012-04-11 Quest Diagnostics Investments Incorporated Hydrophilic chemiluminescent acridinium labeling reagents
GB0320236D0 (en) * 2003-08-29 2003-10-01 Molecular Light Tech Res Ltd Chemiluminescent compounds
JP2012020958A (en) * 2010-07-14 2012-02-02 Musashino Joshi Gakuin Compound and labeled antibody

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2005065350A2 (en) * 2003-12-31 2005-07-21 Quest Diagnostics Investments Incorporated Compositions and methods for detecting reverse transciptase in a sample
WO2009097368A2 (en) * 2008-01-28 2009-08-06 Complete Genomics, Inc. Methods and compositions for efficient base calling in sequencing reactions
WO2018165207A1 (en) * 2017-03-06 2018-09-13 Singular Genomic Systems, Inc. Nucleic acid sequencing-by-synthesis (sbs) methods that combine sbs cycle steps

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP4010484A4 *

Also Published As

Publication number Publication date
CN114245830A (en) 2022-03-25
EP4010484A4 (en) 2024-02-07
EP4010484A1 (en) 2022-06-15
US20210040556A1 (en) 2021-02-11

Similar Documents

Publication Publication Date Title
US6713622B1 (en) 4,7-dichlororhodamine dye labeled polynucleotides
JP4761086B2 (en) Nucleic acid, labeling substance, nucleic acid detection method and kit
US20160208100A1 (en) Water soluble fluorescent or colored dyes and methods for their use
US11618826B2 (en) Carborhodamine compounds and methods of preparation thereof
AU2015257477B2 (en) Polymethine compounds and their use as fluorescent labels
CN104910229A (en) Poly phosphoric acid end fluorescent labeled nucleotide and application thereof
CA2902992A1 (en) Rhodamine compounds and their use as fluorescent labels
JP5419278B2 (en) Fluorogenic molecule
WO2014013954A1 (en) Nucleic acid probe, method for designing nucleic acid probe, and method for detecting target sequence
CN106883637A (en) The methine cyanine dyes of indoles seven and preparation method and application of uracil
US20090093623A1 (en) 4,7-dichlororhodamine dyes
WO2021023252A1 (en) Acridinium ester-containing compounds and methods of using the same for chemiluminescence-based one-color sequencing
Skoblov et al. Solid-and solution-phase synthesis and application of R6G dual-labeled oligonucleotide probes
JP2020176108A (en) Guanine quadruplex-binding ligand
CN113979981B (en) Sulfhydryl response type dealkalized site capture reagent and application
Cooke et al. Solid-phase synthesis of terminal oligonucleotide–phosphoramidate conjugates
Zhang et al. New fluorescent conjugates of uridine nucleoside and substituted 1, 8-naphthalimide: synthesis, weak interactions and solvent effects on spectra
US10738346B2 (en) Nitrodiarylethenes as fluorescence quenchers for nucleic acid probes
CN110498796B (en) Tadalafil analogue containing sulfonyl fluoride group and synthesis method thereof
Faroughi et al. Halogenation of Tröger’s base analogues
US7385051B2 (en) Fluorescent N2,N3-etheno-purine (2′-deoxy) riboside derivatives and uses thereof
US20060281100A1 (en) Thiotriphosphate nucleotide dye terminators
CN115003762A (en) Fluorescent beta-lactamase substrates and related detection methods
CN114621306A (en) Compound, preparation method and application thereof
WO2023047141A1 (en) S- and se- adenosyl-l-methionine analogues with activated groups for transfer by methyltransferases on target biomolecules

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20849079

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 2020849079

Country of ref document: EP

Effective date: 20220307