WO2020234605A1 - Method of quality control of oligonucleotide synthesis - Google Patents

Method of quality control of oligonucleotide synthesis Download PDF

Info

Publication number
WO2020234605A1
WO2020234605A1 PCT/GB2020/051250 GB2020051250W WO2020234605A1 WO 2020234605 A1 WO2020234605 A1 WO 2020234605A1 GB 2020051250 W GB2020051250 W GB 2020051250W WO 2020234605 A1 WO2020234605 A1 WO 2020234605A1
Authority
WO
WIPO (PCT)
Prior art keywords
phosphate
nucleic acid
pyrophosphate
nucleotide
sequence
Prior art date
Application number
PCT/GB2020/051250
Other languages
French (fr)
Inventor
Michael Chun Hao CHEN
Gordon Ross MCINROY
Original Assignee
Nuclera Nucleics Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nuclera Nucleics Ltd filed Critical Nuclera Nucleics Ltd
Priority to CN202080038030.5A priority Critical patent/CN114008215A/en
Priority to EP20728173.4A priority patent/EP3973075A1/en
Publication of WO2020234605A1 publication Critical patent/WO2020234605A1/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6844Nucleic acid amplification reactions
    • C12Q1/6851Quantitative amplification
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6813Hybridisation assays
    • C12Q1/6834Enzymatic or biochemical coupling of nucleic acids to a solid phase

Definitions

  • the invention relates to methods and compositions for the quality control of the synthesis of oligonucleotides during cycles of template independent enzymatic extension.
  • Nucleic acid synthesis is vital to modern biotechnology. The rapid pace of development in the biotechnology arena has been made possible by the scientific community's ability to artificially synthesise DNA, RNA and proteins.
  • DNA synthesis technology does not meet the demands of the biotechnology industry. Despite being a mature technology, it is highly challenging to synthesise a DNA strand greater than 200 nucleotides in length in viable yield, and most DNA synthesis companies only offer up to 120 nucleotides routinely.
  • an average protein-coding gene is of the order of 2000- 3000 contiguous nucleotides
  • a chromosome is at least a million contiguous nucleotides in length and an average eukaryotic genome numbers in the billions of nucleotides.
  • Known methods of DNA sequencing use template-dependent DNA polymerases to add 3'-reversibly terminated nucleotides to a growing double-stranded substrate.
  • each added nucleotide contains a dye, allowing the user to identify the exact sequence of the template strand.
  • this technology is able to produce strands of between 500-1000 bps long.
  • this technology is not suitable for de novo nucleic acid synthesis because of the requirement for an existing nucleic acid strand to act as a template.
  • TdT terminal deoxynucleotidyl transferase
  • dNTP deoxynucleotide triphosphate
  • TdT has not been shown to efficiently add nucleoside triphosphates containing 3'-0- reversibly terminating moieties for building up a nascent single-stranded DNA chain necessary for a de novo synthesis cycle, and thus the synthesis of long strands is inefficient.
  • a primary source of error in solid-phase nucleic acid synthesis - both enzymatic and phosphoramidite-based - comes from the failure to add a nucleotide to the desired sequence. Such failure results in a deletion, which results in frameshift mutations in biological sequences.
  • the ability to assess the nucleotide coupling efficiency and thus the quality with which a particular nucleic acid was synthesized is critical regardless of the method used to synthesize said nucleic acid.
  • Coupling efficiency in phosphoramidite nucleic acid synthesis is measured in-situ by virtue of the colour of deprotected 5'-dimethoxytrityl (DMT) groups.
  • DMT deprotected 5'-dimethoxytrityl
  • the 5'-DMT groups act as reversible terminators in phosphoramidite synthesis, controlling the addition of one and only one amidite per synthesis cycle.
  • the orange colour of the DMT cation released from the 5'-DMT group is eluted; such colour can be quantitated via a spectrophotometer to determine coupling efficiency per phosphoramidite synthesis cycle.
  • non-templated enzymatic nucleic acid synthesis does not result in the production of any groups readily detectable via spectrophotometric means.
  • enzymatic nucleic acid synthesis results in the production of inorganic pyrophosphate and a proton.
  • inorganic pyrophosphate is detectable through a variety of means, such as enzyme-coupled reactions.
  • the preference to rapidly remove inorganic pyrophosphate due to the capability of TdT to perform strand dismutation reactions complicates the detection of inorganic pyrophosphate as a means to quantitate coupling efficiency.
  • reactions often resynthesise nucleoside triphosphate molecules, which is problematic in reactions where nucleotide triphosphates are the TdT substrates.
  • the monitoring is performed on one or more of the nucleotide monomer reaction cycles.
  • the monitoring can be performed on each cycle.
  • the term each cycle refers to monitoring directly after a nucleotide has been added.
  • the quality measurements are performed during or after extension cycles before the completed strand is assembled.
  • the term refers to monitoring after nucleotide extension, rather than just at the end of the synthesis, and does not require that every cycle is monitored. Thus if a 100 cycles of extension are performed, monitoring every other cycle is within the scope of the invention. Similarly if 50 cycles are performed only 49 can be monitored.
  • the claim is merely requiring that the quality of the synthesis is determined as the synthesis progresses not just at the end, not that every cycle must be monitored. Monitoring each and every cycle is within the scope of the claimed invention, but is not essential.
  • Described is a technique for detecting the incorporation of one or more nucleotides into a growing chain by template-independent synthesis.
  • the described method relies on detection of changes in heat, pH, phosphate concentration and/or pyrophosphate concentration as a result of template- independent DNA synthesis.
  • dNTP nucleotide
  • G Guanine
  • A Adenine
  • T Thymine
  • C Cytosine as incorporated into a growing DNA strand.
  • the nucleotide need not be limited to dNTPs (i.e., 2'-deoxy NTPs). They include any polyphosphate species that can be incorporated by an enzyme, including ribonucleoside 5'-triphosphates.
  • the nucleotides can be reversibly blocked such that only one monomer per strand is incorporated. The block may be anywhere on the monomer, including optionally at the 3'- position.
  • the nucleotides can also be reversibly blocked at the nitrogenous base by molecular entities such as a small molecule, peptide, oligosaccharide, polymer, or protein.
  • the nucleotide may also be amine-masked to mask the amino groups on the nitrogenous base and prevent hydrogen bonding. The amino-groups may subsequently be unmasked to reveal a free amino (NH 2 ) group.
  • the nucleotides are generally unlabelled in order to synthesise a non-modified strand, but the nucleotides can optionally be labelled. In order to control the desired sequence, a single nucleotide species is generally added per cycle, but more than one nucleotide can be added if the desired strand sequence is degenerate.
  • the DNA polymerase can be a terminal deoxynucleotidyl transferase.
  • the enzyme can be modified to increase the incorporation of 3'- blocked nucleotides.
  • the incorporation of the nucleotide in the above reaction is monitored, by monitoring changes in heat, pH, levels of pyrophosphate or levels of phosphate, to provide quality control information.
  • the T in the above reaction is about 22 kT or ⁇ 570 meV per nucleotide incorporation, and may be measured in accordance with the present invention, as well as DrH.
  • FIG. 1 One embodiment of phosphate detection.
  • Phosphate generated from non-templated enzymatic nucleic acid synthesis is quantified through methods such as enzyme binding, enzyme- coupled reaction, and/or inorganic molecule binding.
  • the aforementioned methodologies generate a detectable signal to quantitate the level of nucleotide incorporation.
  • Figure 2 Coupling of pyrophosphatase and phosphate-binding protein labelled to a fluorophore to monitor and quantitate reversibly terminated nucleotide incorporation by engineered TdTs.
  • Figure 3 Pyrophosphate detection as a method to monitor and quantitate reversibly terminated nucleotide incorporation by engineered TdTs.
  • nucleotide incorporation results in an increased temperature in solution generated by released pyrophosphate.
  • the splitting of pyrophosphate into two inorganic phosphate molecules, accomplished for example with the addition of pyrophosphatases, will produce further temperature change.
  • changes in temperature are monitored by a suitable sensitive instrument to detect if successful nucleotide incorporation has occurred. pH:
  • a method for monitoring the incorporation of one or more nucleotides into a growing chain by template-independent synthesis by monitoring pH change is disclosed herein.
  • Nucleotide incorporation results in an increased negative charge in solution generated by released pyrophosphate and inorganic phosphate.
  • the splitting of pyrophosphate into two inorganic phosphate molecules, accomplished for example with the addition of pyrophosphatases, will further lower the charge in solution.
  • changes in pH are monitored by a suitable sensitive instrument such as an ion-sensitive field-effect transistor (ISFET) to detect if successful nucleotide incorporation has occurred.
  • ISFET ion-sensitive field-effect transistor
  • pyrophosphate is detected by interaction, chelation, binding, or association with a small molecule.
  • pyrophosphate is detected by interaction, chelation, binding, or association with a biomolecule.
  • the biomolecule may be formed from protein, DNA, RNA, or a combination thereof. These interactions result in a detectable signal.
  • the small molecule is or contains a chelated Zn(ll) complex.
  • the small molecule is a terpyridine-Zn(ll) complex.
  • the terpyridine-Zn(ll) complex is carbazole or amino modified.
  • the molecule containing a terpyridine-Zn(ll) complex interacts with pyrophosphate and generates a detectable signal, such as a change in absorbance, chemiluminescence, and/or fluorescence.
  • real-time bioluminometric detection of released pyrophosphate is detected as a result of successful nucleotide incorporation.
  • the released pyrophosphate is converted to ATP by ATP sulfurylase from adenosine 5'-phosphosulfate and the level of ATP is sensed by a luciferase producing a proportional light signal, which is detected by photo sensing devices.
  • the reacted addition solution is removed from the immobilized oligonucleotide and the released pyrophosphate in the removed addition solution is then detected by the bioluminescence regenerative cycle involving ATP sulfurylase and luciferase.
  • the advantage of detecting pyrophosphate in removed addition solution is that there is no potential for the reagents of the bioluminescence regenerative cycle (e.g., addition of the ATP sulfurylase product, ATP, by TdT to the oligonucleotide initiator) to interfere with the enzymatic nucleic acid synthesis process.
  • the reagents of the bioluminescence regenerative cycle e.g., addition of the ATP sulfurylase product, ATP, by TdT to the oligonucleotide initiator
  • incorporation of one or more nucleotides into a growing chain by template-independent synthesis is monitored using a bioluminescence regenerative cycle (BRC).
  • BRC bioluminescence regenerative cycle
  • steady state levels of bioluminescence result from processes that produce pyrophosphate.
  • Pyrophosphate reacts in the presence of ATP sulfurylase and adenosine 5'-phosphosulfate to produce ATP.
  • the ATP reacts with luciferin in a luciferase-catalysed reaction, producing light and regenerating pyrophosphate.
  • the pyrophosphate is recycled to produce ATP and the regenerative cycle continues.
  • kits for use in monitoring changes in pyrophosphate concentration there is provided a kit for use in monitoring changes in pyrophosphate concentration.
  • the nucleotide may carry a reversible block.
  • the nucleotide may carry a 3'-block.
  • the 3'-block is a protecting group on the 3' hydroxyl which can be removed to release the 3'-OH, in which case the block acts as a reversible terminator.
  • the nucleotide may carry a reversible terminator attached to the nitrogenous base.
  • the base-blocking group is a protecting group which can be removed to enable addition of the nucleotide by a template independent enzyme such as a TdT.
  • the nucleotide may be covalently or non-covalently bound to a template independent enzyme such as a TdT.
  • Each mention of the term block includes optionally a 3'-block.
  • a method of enzymatic nucleic acid synthesis which includes:
  • nucleic acid transferase or template independent polymerase such as an engineered terminal deoxynucleotidyl transferase (TdT);
  • pyrophosphate-sensing reagents such as a terpyridine-Zn(ll) complex or the simultaneous presence of all of the following: adenosine 5'-phosphosulfate, ATP sulfuryase, luciferin, and luciferase;
  • Steps (b) - (f) can be repeated to add greater than one nucleotide to the above DNA initiator sequence.
  • a further method of enzymatic nucleic acid synthesis could include:
  • nucleic acid transferase or template independent polymerase such as an engineered terminal deoxynucleotidyl transferase (TdT);
  • reagents for the quantification of pyrophosphate can include a terpyridine-Zn(ll) complex, adenosine 5'-phosphosulfate, ATP sulfuryase, luciferin, luciferase, or a combination thereof;
  • Steps (b) - (h) can be repeated to add greater than one nucleotide to the above nucleic acid initiator sequence.
  • the further method of enzymatic nucleic acid synthesis directly mentioned above has the advantage of segregating the incorporation of a blocked nucleoside triphosphate from the detection of a produced pyrophosphate molecule.
  • This segregation is advantageous as the pyrophosphate detection reagents may interfere with the incorporation of a blocked nucleoside triphosphate.
  • ATP is generated when using adenosine 5'-phosphosulfate, ATP sulfuryase, luciferin, luciferase, or a combination thereof to detect pyrophosphate through bioluminescence.
  • ATP can be incorporated by a TdT to the 3'-end of a nucleic acid initiator in a method of enzymatic nucleic acid synthesis. This incorporation thus represents a mutation, which is detrimental to the accuracy of nucleic acid synthesis.
  • Disclosed herein is a method for monitoring the incorporation of one or more nucleotides into a growing chain by template-independent synthesis by monitoring the phosphate concentration over time.
  • the method involves the use of an inorganic pyrophophatase enzyme to catalyse the conversion of pyrophosphate, one molecule of which is released during the successful addition of one nucleotide, into two phosphate ions.
  • the presence of the phosphate ions, and hence the presence of the original pyrophosphate molecule can be detected and information on the success or failure of nucleotide addition gained.
  • inorganic pyrophosphatase which converts one molecule of inorganic pyrophosphate to two molecules of inorganic phosphate.
  • a phosphate-sensing assay we couple the reaction of inorganic pyrophosphatase, which converts one molecule of inorganic pyrophosphate to two molecules of inorganic phosphate.
  • the presence of phosphate ions is detected using a phosphate binding protein (PBP).
  • PBP phosphate binding protein
  • the phosphate binding protein is natural, in another embodiment of the invention the phosphate binding proteins sequence is modified. In one embodiment of the invention, the phosphate binding protein is E. coli phosphate binding protein.
  • the phosphate binding protein is conjugated to a fluorescent tag.
  • a kit for use in monitoring changes in phosphate concentration there is provided a kit for use in monitoring changes in phosphate concentration.
  • the method of nucleic acid synthesis involves the use of TdT in non-templated enzymatic DNA synthesis with 3'-0-reversibly terminated 2'-deoxynucleoside 5'-triphosphates (dNTPs).
  • dNTPs 3'-0-reversibly terminated 2'-deoxynucleoside 5'-triphosphates
  • dNTPs 3'-0-reversibly terminated 2'-deoxynucleoside 5'-triphosphates
  • dNTPs 3'-0-reversibly terminated 2'-deoxynucleoside 5'-triphosphates
  • N is known a priori as the quantity of free 3'-OH ends supplied by
  • 5'-immobilized oligonucleotides and N+l is quantified through a phosphate-sensing assay.
  • the phosphate sensing assay above utilizes a phosphate binding protein (PBP), such as E. coli PBP containing the mutation A197C, covalently coupled to fluorescent dye, such as N- [2-(l-maleimidyl)ethyl]-7-(diethylamino)coumarin-3-carboxamide (MDCC) or tetramethylrhodamine (rho).
  • PBP phosphate binding protein
  • MDCC N- [2-(l-maleimidyl)ethyl]-7-(diethylamino)coumarin-3-carboxamide
  • rho tetramethylrhodamine
  • E. coli PBP the MDCC or rho is covalently coupled to PBP.
  • MDCC is covalently coupled to PBP through a thioester bond at A197C.
  • the phosphate-sensing assay is based on reaction of inorganic phosphate with maltose, in the presence of an enzyme, to produce glucose. The glucose is then specifically oxidized to generate a product that reacts with a probe to generate fluorescence.
  • the enzyme is maltose phosphorylase which converts maltose, in the presence of inorganic phosphate, to glucose-l-phosphate and glucose. Then, glucose oxidase converts the glucose to gluconolactone and H 2 O 2 . Finally, with horseradish peroxidase (HRP) as a catalyst, the H 2 O 2 reacts with Amplex Red reagent to generate resorufin. The resulting increase in fluorescence or absorption is proportional to the amount of inorganic phosphate in the sample.
  • HRP horseradish peroxidase
  • the phosphate-sensing assay consists of complex formation of inorganic phosphate with ammonium molybdate, which is detectable as a colour change
  • the phosphate-sensing assay is based on quantification of the green complex formed between Malachite Green, ammonium molybdate and free phosphate.
  • the phosphate-sensing assay consists of a phosphorylase and a substrate that becomes detectable upon phosphorylation. Examples include utilizing a phosphorylase in the presence of 7-methylguanosine or 2-amino-6-mercapto-7-methylpurine.
  • a method of enzymatic nucleic acid synthesis could include:
  • nucleic acid transferase or template independent polymerase such as an engineered terminal deoxynucleotidyl transferase (TdT);
  • Steps (b) - (f) can be repeated to add greater than one nucleotide to the above nucleic acid initiator sequence.
  • a further method of enzymatic nucleic acid synthesis could include:
  • nucleic acid transferase or template independent polymerase such as an engineered terminal deoxynucleotidyl transferase (TdT);
  • Steps (b) - (h) can be repeated to add greater than one nucleotide to the above nucleic acid initiator sequence.
  • the further method of enzymatic nucleic acid synthesis directly mentioned above has the advantage of segregating the incorporation of a blocked nucleoside triphosphate from the detection of produced phosphate molecules. This segregation may be advantageous if the phosphate detection reagents interfere with the incorporation of a blocked nucleoside triphosphate.
  • references herein to an 'initiator sequence' refer to a short oligonucleotide with a free 3'-end which the nucleotide monomers can attach.
  • the initiator sequence is a DNA initiator sequence.
  • the initiator sequence is an RNA initiator sequence.
  • the strands are synthesised using enzymatic means.
  • the cycles of extension can be performed using a nucleic acid transferase or template independent polymerase, such as an engineered terminal deoxynucleotidyl transferase (TdT), where the nucleotide monomers are nucleoside triphosphates.
  • TdT engineered terminal deoxynucleotidyl transferase
  • the reversibly blocked nucleoside triphosphates can be 3'-reversibly blocked nucleoside triphosphates.
  • the 3'-reversible block can be selected from 3'-0-CH 2 N 3 , 3'-0-CH 2 CHCH 2 , 3'-0- CH 2 CH 2 CN or 3'-0-NH 2 .
  • references herein to a 'DNA initiator sequence' include a small sequence of DNA which a blocked nucleotide triphosphate can be attached to, i.e., DNA will be synthesised from the end of the DNA initiator sequence.
  • the initiator sequence is single-stranded.
  • the initiator sequence is double-stranded. It will be understood by persons skilled in the art that a 3'-overhang (l.e., a free 3'-end) allows for efficient addition.
  • the initiator sequence is immobilised on a solid support.
  • the initiator sequence may be attached to a solid support stable under aqueous conditions so that the method can be easily performed via a flow setup.
  • the initiator sequence is immobilised on a solid support via a reversible interacting moiety, such as a chemically-cleavable linker, an antibody/immunogenic epitope, a biotin/biotin binding protein (such as avidin or streptavidin), or glutathione-GST tag. Therefore, in a further embodiment, the method additionally comprises extracting the resultant nucleic acid by removing the reversible interacting moiety in the initiator sequence, such as by incubating with proteinase K.
  • a reversible interacting moiety such as a chemically-cleavable linker, an antibody/immunogenic epitope, a biotin/biotin binding protein (such as avidin or streptavidin), or glutathione-GST tag. Therefore, in a further embodiment, the method additionally comprises extracting the resultant nucleic acid by removing the reversible interacting moiety in the initiator sequence, such as by incubating with proteinase K
  • the initiator sequence contains a base or base sequence recognisable by an enzyme.
  • a base recognised by an enzyme such as a glycosylase, may be removed to generate an abasic site which may be cleaved by chemical or enzymatic means.
  • a base sequence may be recognised and cleaved by a restriction enzyme.
  • the resultant contiguous oligonucleotide sequence is released from being immobilised. In one embodiment, this release occurs by removing a non-canonical base from one or more of the immobilised oligonucleotides and cleaving the strands at the resultant abasic site.
  • the non-canonical base is uracil, which is removed by uracil DNA glycosylase. In an alternate embodiment, the non-canonical base is 8-oxoquanine, which is removed by formamidopyrimidine DNA glycosylase.
  • the initiator sequence is immobilised on a solid support via a chemically-cleavable linker, such as a disulfide, allyl, or azide-masked hemiaminal ether linker. Therefore, in one embodiment, the method additionally comprises extracting the resultant contiguous oligonucleotide sequence by cleaving the chemical linker through the addition of tris(2- carboxyethyljphosphine (TCEP) or dithiothreitol (DTT) for a disulfide linker; palladium complexes or an allyl linker; or TCEP for an azide-masked hemiaminal ether linker.
  • TCEP tris(2- carboxyethyljphosphine
  • DTT dithiothreitol
  • the cycles of extension are performed using a polymerase enzyme and the nucleotide monomers are nucleoside triphosphates. In one embodiment, the cycles of extension are performed using a template independent polymerase and the nucleotide monomers are nucleoside triphosphates. In one embodiment, the cycles of extension are performed using a template independent polymerase and the nucleotide monomers are reversibly blocked nucleoside triphosphates.
  • the cycles of extension are performed using a template independent polymerase and the nucleotide monomers are 3'-reversibly blocked nucleoside triphosphates.
  • the 3'-blocked nucleoside 5'-triphosphate can be blocked by any chemical group that can be unmasked to reveal a 3'-OH.
  • the 3'-blocked nucleoside triphosphates can be blocked by a 3'-0-azidomethyl, 3'- aminooxy, 3'-0-allyl group, 3'-0-cyanoethyl, 3'-0-acetyl, 3'-0-nitrate, 3'-0-phosphate, 3'-0-acetyl levulinic ester, 3'-0-tert butyl dimethyl silane, 3'-0-trimethyl(silyl)ethoxymethyl, 3'-0-ortho- nitrobenzyl, and 3'-0-para-nitrobenzyl.
  • the 3'-blocked nucleoside 5'-triphosphates can also be blocked by any chemical group that can be directly utilized in chemical ligations, such as copper-catalyzed or copper-free azide-alkyne click reactions and tetrazine-alkene click reactions.
  • the 3'-blocked nucleoside triphosphates can include chemical moieties containing an azide, alkyne, alkene, and tetrazine.
  • the 3'-reversible block is selected from 3'-0-CH 2 N 3 , 3'-0-CH 2 CHCH 2 , 3'- 0-CH 2 CH 2 CN or 3'-0-NH 2 .
  • the template independent polymerase is a terminal deoxynucleotidyl transferase (TdT).
  • the template independent polymerase is a modified TdT.
  • the TdT is added in the presence of an extension solution comprising one or more buffers (e.g., Tris or cacodylate), one or more salts (e.g., Na + , K + , Mg 2+ , Mn z+ , Cu 2+ , Zn 2+ , Co 2+ , etc. all with appropriate counterions, such as Cl) and inorganic pyrophosphatase (e.g., the Saccharomyces cerevisiae homolog).
  • buffers e.g., Tris or cacodylate
  • salts e.g., Na + , K + , Mg 2+ , Mn z+ , Cu 2+ , Zn 2+ , Co 2+ , etc. all with appropriate counterions, such as Cl
  • buffers and salts depends on the optimal enzyme activity and stability.
  • the use of an inorganic pyrophosphatase helps to reduce the build-up of pyrophosphate due to nucleoside triphosphate hydrolysis by TdT. Therefore, the use of an inorganic pyrophosphatase has the advantage of reducing the rate of (1) backwards reaction and (2) TdT strand dismutation.
  • References herein to 'nucleoside triphosphates' refer to a molecule containing a nucleoside (i.e. a base attached to a deoxyribose or ribose sugar molecule) bound to three phosphate groups.
  • nucleoside triphosphates that contain deoxyribose are: deoxyadenosine triphosphate (dATP), deoxyguanosine triphosphate (dGTP), deoxycytidine triphosphate (dCTP) or deoxythymidine triphosphate (dTTP).
  • nucleoside triphosphates that contain ribose are: adenosine triphosphate (ATP), guanosine triphosphate (GTP), cytidine triphosphate (CTP) or uridine triphosphate (UTP).
  • Other types of nucleosides may be bound to three phosphates to form nucleoside triphosphates, such as naturally occurring modified nucleosides and artificial nucleosides.
  • nucleosides may include any functional equivalents of nucleosides including amine-masked nucleoside 5'-triphosphates, such as 6-azido-adenosine, 4-azido-cytidine, or 2-azido-guanosine.
  • references herein to '3'-blocked nucleoside triphosphates' refer to nucleoside triphosphates (e.g., dATP, dGTP, dCTP or dTTP) which have an additional group on the 3' end which prevents further addition of nucleotides, i.e., by replacing the 3'-OH group with a protecting group.
  • nucleoside triphosphates e.g., dATP, dGTP, dCTP or dTTP
  • references herein to '3'-block', '3'-blocking group' or '3'-protecting group' refer to the group attached to the 3' end of the nucleoside triphosphate which prevents further nucleotide addition.
  • This method uses reversible 3'-blocking groups which can be removed by cleavage to allow the addition of further nucleotides.
  • irreversible 3'-blocking groups refer to dNTPs where the 3'-OH group can neither be exposed nor uncovered by cleavage.
  • the 3'-blocked nucleoside 5'-triphosphate can be blocked by any chemical group that can be unmasked to reveal a 3'-OH.
  • the 3'-blocked nucleoside triphosphate can be blocked by a 3'-0- azidomethyl, 3'-aminooxy, 3'-0-allyl group, 3'-0-cyanoethyl, 3'-0-acetyl, 3'-0-nitrate, 3'-0- phosphate, 3'-0-acetyl levulinic ester, 3'-0-tert butyl dimethyl silane, 3'-0- trimethyl(silyl)ethoxymethyl, 3'-0-ortho-nitrobenzyl, and 3'-0-para-nitrobenzyl.
  • the 3'-blocked nucleoside 5'-triphosphate can also be blocked by any chemical group that can be directly utilized in chemical ligations, such as copper-catalyzed or copper-free azide-alkyne click reactions and tetrazine-alkene click reactions.
  • the 3'-blocked nucleoside triphosphate can include chemical moieties containing an azide, alkyne, alkene, and tetrazine.
  • references herein to 'cleaving agent' refer either to a substance which is able to cleave the 3'- blocking group from the 3'-blocked nucleoside triphosphate or to a substance which is able to cleave immobilised oligonucleotides from the solid support.
  • the cleaving agent is a chemical cleaving agent.
  • the cleaving agent is an enzymatic cleaving agent.
  • cleaving agent is dependent on the type of 3'-nucleoside blocking group used.
  • tris(2- carboxyethyl)phosphine (TCEP) or tris(hydroxypropyl)phosphine (THPP) can be used to cleave a 3'-0- azidomethyl group
  • palladium complexes can be used to cleave a 3'-0-allyl group
  • sodium nitrite can be used to cleave a 3'-aminoxy group. Therefore, in one embodiment, the cleaving agent is selected from: tris(2- carboxyethyl)phosphine (TCEP), a palladium complex or sodium nitrite.
  • the cleaving agent is added in the presence of a cleavage solution comprising a denaturant, such as urea, guanidinium chloride, formamide or betaine.
  • a denaturant such as urea, guanidinium chloride, formamide or betaine.
  • the cleavage solution comprises one or more buffers. It will be understood by the person skilled in the art that the choice of buffer is dependent on the exact cleavage chemistry and cleaving agent required.
  • the inventors have previously developed a selection of engineered terminal transferase enzymes, any of which may be used in the current process.
  • Terminal transferase enzymes are ubiquitous in nature and are present in many species. Many known TdT sequences have been reported in the NCBI database http://www.ncbi.nlm.nih.gov/. The sequences of the various described terminal transferases show some regions of highly conserved sequence, and some regions which are highly diverse between different species.
  • the inventors have modified the terminal transferase from Lepisosteus oculatus TdT (spotted gar) (shown below). However the corresponding modifications can be introduced into the analagous terminal transferase sequences from any other species, including the sequences listed above in the various NCBI entries.
  • the amino acid sequence of the spotted gar ( Lepisosteus oculatus) is shown below
  • SEQ ID 1 wild type spotted Gar TdT
  • the inventors have identified various regions in the amino acid sequence having improved properties. Certain regions improve the solubility and handling of the enzyme. Certain other regions improve the ability to incorporate nucleotides with modifications at the 3'-position.
  • modified terminal deoxynucleotidyl transferase (TdT) enzymes comprising amino acid modifications when compared to a wild type sequence SEQ ID NO 1 or a truncated version thereof or the homologous amino acid sequence of a terminal deoxynucleotidyl transferase (TdT) enzyme in other species or the homologous amino acid sequence of RoIm, RoIb, RoIl, and RoIq of any species or the homologous amino acid sequence of X family polymerases of any species, wherein the amino acid is modified at one or more of the amino acids:
  • K392, F394, 1397, K398, K400 K401, E402, L403, A404, A405, G406, R407, D411, A421, P422, P423,
  • Modifications which improve the incorporation of modified nucleotides can be at one or more of selected regions shown below. Regions were selected according to mutation data, sequence alignment, and structural data obtained from spotted gar TdT co-crystal I ized with DNA and a 3'- modified dNTP.
  • the second modification can be selected from one or more of the amino acid regions VAIF, MG A, MENHNQI, SEGPCLAFMRA, HAISSS, DQTKA, KGFHS, QADNA, HFTKMQK, SAAVCK, EAQA, TVRLI, GKEC, TPEMGK, YYDIV, DHFQK, LAAG, APPVDNF, FARHERKMLLDNHALYDKTKK, and
  • references to particular sequences include truncations thereof. Included herein are modified terminal deoxynucleotidyl transferase (TdT) enzyme comprising at least one amino acid modification when compared to a wild type sequence SEQ ID NO 1 or a truncated version thereof, or the homologous amino acid sequence of a terminal deoxynucleotidyl transferase (TdT) enzyme in other species, wherein the modification is selected from one or more of the amino acid regions WLLNRLINRLQNQGILLYYDIV, VAIF, MG A, MENHNQI, SEGPCLAFMRA, HAISSS, DQTKA, KGFHS,
  • TdT modified terminal deoxynucleotidyl transferase
  • Truncated proteins may include at least the region shown below (SEQ ID NO 2)
  • TdT modified terminal deoxynucleotidyl transferase
  • SEGPCLAFMRA HAISSS, DQTKA, KGFHS, QADNA, HFTKMQK, SAAVCK, EAQA, TVRLI, GKEC, TPEMGK, DHFQK, LAAG, APPVDNF, FARHERKMLLDNHALYDKTKK, and DYIDP of the sequence:
  • Sequence homology extends to all modified or wild-type members of family X polymerases, such as DNA RoIm (also known as DNA polymerase mu or POLM), DNA RoIb (also known as DNA polymerase beta or POLB), and DNA RoIl (also known as DNA polymerase lambda or POLL).
  • DNA RoIm also known as DNA polymerase mu or POLM
  • DNA RoIb also known as DNA polymerase beta or POLB
  • DNA RoIl also known known as DNA polymerase lambda or POLL.
  • TdT DNA polymerase mu
  • POLB DNA polymerase beta
  • DNA RoIl also known known as DNA polymerase lambda or POLL
  • family X polymerases can be trivially modified to either display template-dependent or template-independent nucleotidyl transferase activities. Therefore, all motifs, regions, and mutations demonstrated in this patent can be trivially extended to modified X family polymerases to enable modified X family polymerases to incorporate 3'-modified nucleotides, reversibly terminated nucleotides, and modified nucleotides in general to effect methods of nucleic acid synthesis.
  • Modifications which improve the solubility include a modification within the amino acid region WLLNRLINRLQNQGILLYYDIV shown highlighted in the sequence below.
  • Modifications which improve the incorporation of modified nucleotides can be at one or more of selected regions shown below.
  • the second modification can be selected from one or more of the amino acid regions VAIF, EDN, MG A, ENHNQ, FMRA, HAI, TKA, FHS, QADNA, MQK, SAAVCK, EAQA, TVR, KEC, TPEMGK, DHFQ, LAAG, APPVDN, FARHERKMLLDNHA, and YIDP shown highlighted in the sequence below.
  • a modified terminal deoxynucleotidyl transferase (TdT) enzyme comprising at least one amino acid modification when compared to a wild type sequence or the homologous amino acid sequence of a terminal deoxynucleotidyl transferase (TdT) enzyme in other species, wherein the modification is selected from one or more of the amino acid regions WLLNRLINRLQNQGILLYYDI, VAIF, EDN, MG A, ENHNQ, FMRA, HAI, TKA, FHS, QADNA, MQK, SAAVCK, EAQA, TVR, KEC, TPEMGK, DHFQ LAAG, APPVDN, FARHERKMLLDNHA, and YIDP of the sequence of a wild type sequence or the homologous regions in other species.
  • Homologous refers to protein sequences between two or more proteins that possess a common evolutionary origin, including proteins from superfamilies in the same species of organism as well as homologous proteins from different species. Such proteins (and their encoding nucleic acids) have sequence homology, as reflected by their sequence similarity, whether in terms of percent identity or by the presence of specific residues or motifs and conserved positions.
  • a variety of protein (and their encoding nucleic acid) sequence alignment tools may be used to determine sequence homology. For example, the Clustal Omega multiple sequence alignment program provided by the European Molecular Biology Laboratory (EMBL) can be used to determine sequence homology or homologous regions.
  • EMBL European Molecular Biology Laboratory
  • a first modification is within the amino acid region WLLNRLINRLQNQGILLYYDI of the sequence of a wild type sequence or the homologous region in other species;
  • a second modification is selected from one or more of the amino acid regions VAIF, EDN,
  • MG A ENHNQ FMRA, HAI, TKA, FHS, QADNA, MQK, SAAVCK, EAQA, TVR, KEC, TPEMGK, DHFQ LAAG,
  • APPVDN, FARHERKMLLDNHA, and YIDP of the sequence of a wild type sequence or the homologous regions in other species The modification within the region WLLNRLINRLQNQGILLYYDIV or the corresponding region from other species help improve the solubility of the enzyme.
  • the modification within the amino acid region WLLNRLINRLQNQGILLYYDIV can be at one or more of the underlined amino acids.
  • W-Q N-P R-K, L-V, R-L, L-W, Q-E, N-K, Q-K or l-L.
  • the sequence WLLNRLINRLQNQGILLYYDIV can be altered to QLLPKVINLWEKKGLLLYYDLV.
  • the second modification improves incorporation of nucleotides having a modification at the 3' position in comparison to the wild type sequence.
  • the second modification can be selected from one or more of the amino acid regions VAIF, EDN, MG A, ENHNQ, FMRA, HAI, TKA, FHS, QADNA, MQK, SAAVCK, EAQA, TVR, KEC, TPEMGK, DHFQ, LAAG, APPVDN, FARHERKMLLDNHA, and YIDP of the sequence of a wild type sequence or the homologous regions in other species.
  • the second modification can be selected from two or more of the amino acid regions VAIF, EDN, MGA, ENHNQ, FMRA, HAI, TKA, FHS, QADNA, MQK, SAAVCK, EAQA, TVR, KEC, TPEMGK, DHFQ, LAAG, APPVDN, FARHERKMLLDNHA, and YIDP of the sequence of a wild type sequence or the homologous regions in other species shown highlighted in the sequence below.
  • the identified positions commence at positions V32, E74, M108, F182, T212, D271, M279, E298, A421, L456, Y486.
  • Modifications disclosed herein contain at least one modification at the defined positions.
  • the modified amino acid can be in the region FMRA.
  • the modified amino acid can be in the region QADNA.
  • the modified amino acid can be in the region EAQA.
  • the modified amino acid can be in the region APP.
  • the modified amino acid can be in the region LDNHA.
  • the modified amino acid can be in the region YIDP.
  • the region FARHERKMLLDNHA is advantageous for removing substrate biases in modifications.
  • the FARHERKMLLDNHA region appears highly conserved across species.
  • the modification selected from one or more of the amino acid regions FMRA, QADNA, EAQA, APP, FARHERKMLLDNHA, and YIDP can be at the underlined amino acid(s).
  • the positions for modification can include A53, V68, V71, D75, E97, 1101, G109, Q115, V116, S125, T137, Q143, N154, H155, Q157, 1158, 1165, G177, L180, A181, M183, A195, K200, T212, K213, A214,
  • Amino acid changes include any one of A53G, V68I, V71I, D75N, D75Q, E97A, 1101V, G109E, G109R, Q115E, V116I, V116S, S125R, T137A, Q143P, N154H, H155C, Q157K, Q157R, I158M, 1165V, G177D,
  • N352Q N360K, Q361K, I363L, Y367C, H389A, L403R, G406R, D411N, A421L, A421M, A421V, P422A, P422C, V424Y, N426R, R438K, F447W, R452K, L455I, and/or D488P.
  • Amino acid changes include any two or more of A53G, V68I, V71I, D75N, D75Q, E97A, 1101V, G109E, G109R, Q115E, V116I, V116S, S125R, T137A, Q143P, N154H, H155C, Q157K, Q157R, I158M, 1165V,
  • the modification of QADNA to KADKA, QADKA, KADNA, QADNS, KADNT, or QADNT is advantageous for the incorporation of 3'-0-modified nucleoside triphosphates to the 3'-end of nucleic acids and removing substrate biases during the incorporation of modified nucleoside triphosphates.
  • the modification of APPVDN to MCPVDN, MPPVDN, ACPVDR, VPPVDN, LPPVDR, ACPYDN, LCPVDN, or MAPVDN is advantageous for the incorporation of 3'-0-modified nucleoside triphosphates to the 3'- end of nucleic acids and removing substrate biases during the incorporation of modified nucleoside triphosphates.
  • FARHERKMLLDRHA WARHERKMILDNHA, FARHERKMILDNHA, WARHERKMLLDNHA, FARHERKMLLDRHA, or FARHEKKMLLDNHA is also advantageous for the incorporation of 3'-0-modified nucleoside triphosphates to the 3'-end of nucleic acids and removing substrate biases during the incorporation of modified nucleoside triphosphates.
  • the modification can be selected from one or more of the following sequences FRRA, QADKA, EADA, MPP, FARHERKMLLDRHA, and YIPP. Included is a modified terminal deoxynucleotidyl transferase (TdT) enzyme wherein the second modification is selected from two or more of the following sequences FRRA, QADKA, EADA, MPP, FARHERKMLLDRHA, and YIPP. Included is a modified terminal deoxynucleotidyl transferase (TdT) enzyme wherein the second modification contains each of the following sequences FRRA, QADKA, EADA, MPP, FARHERKMLLDRHA, and YIPP.
  • composition for monitoring the quality of template-independent nucleic acid synthesis comprising measuring the amount of incorporated nucleotide monomer on each reaction cycle comprising inorganic pyrophosphatase, a terminal deoxynucleotidyl transferase (TdT), a 3'-0- reversibly blocked dNTP and a fluorescently labelled phosphate binding protein.
  • composition for monitoring the quality of template-independent nucleic acid synthesis comprising measuring the amount of incorporated nucleotide monomer on each reaction cycle comprising; a terminal deoxynucleotidyl transferase (TdT), a 3'-0-reversibly blocked dNTP and a terpyridine-Zn(ll) complex.
  • TdT terminal deoxynucleotidyl transferase
  • a 3'-0-reversibly blocked dNTP and a terpyridine-Zn(ll) complex.
  • Example 1 Coupling of pyrophosphatase and phosphate-binding protein labelled to a fluorophore to monitor and quantitate reversibly terminated nucleotide incorporation by engineered TdTs.
  • PBP-MDCC (Thermo Fisher Scientific) was tested as a means to monitor non-templated enzymatic DNA synthesis.
  • PBP-MDCC (0.5 mM), engineered TdT (0.04 mg/ml), pyrophosphatase (0.01 mg/m!), dATP-ONH 2 (0.25 mM), fluorescently labelled oligonucleotide initiator (purchased from IDT; SEQ IDS 662 and 663 in Table 1; 5 mM) (Seq IDs 3 and 4), and required buffer components were mixed together and incubated at 37 °C for 10 minutes. Reactions were monitored at an excitation and emission wavelength of 430 and 460 nm, respectively, and measured 60 s after reaction initiation.
  • Phosphate mopping refers to incubation of solutions with purine nucleoside phosphorylase (Sigma-Aldrich; from microbial sources). PNPase was used at 1 unit/ml and 7-methylguanosine (MEG) was used at 200 mM final concentration (37 °C for 10 minutes).
  • “Spike Mop” in the figure above only contains buffer components and 20 mM phosphate, which were subsequently phosphate mopped.
  • "GA(ddC)” 1 and 2 contain oligonucleotide SEQ NO 662 from Table 1 whereas “GAC” 1 and 2 contains oligonucleotide SEQ NO 663 from Table 1.
  • GA(ddC) and GAC are fully constituted mixtures with all components necessary for reversibly terminated nucleotide addition.
  • No mop contains GA(ddC) reaction components, but no phosphate mop was performed previous to reaction initiation.
  • "Spike” contains only buffer components and 20 mM phosphate.
  • Example 2 Pyrophosphate detection as a method to monitor and quantitate reversibly terminated nucleotide incorporation by engineered TdTs.
  • Engineered TdT (only present if indicated; 0.04 mg/ml), pyrophosphatase (only present if indicated; 0.01 mg/ml), dTTP-ONH 2 (only present if indicated; 0.25 mM), immobilized oligonucleotide initiator (only present if indicated; 1 pmol), and required buffer components were mixed together and incubated at 37 °C for the indicated amount of time.
  • Reaction conditions were (1) buffer + pyrophosphate + immobilized oligonucleotide (0.1 mM); (2) buffer + pyrophosphate + immobilized oligonucleotide (0.5 mM); (3) buffer + pyrophosphate + immobilized oligonucleotide (1.0 mM); (4) buffer + engineered TdT + dTTP-ONH2 + immobilized oligonucleotide (0 min); (5) buffer + engineered TdT + dTTP-ONH 2 + immobilized oligonucleotide (10 min); (6) buffer + dTTP-ONH 2 + immobilized oligonucleotide (10 min); and (7) buffer + dTTP-ONH 2 + immobilized oligonucleotide + pyrophosphatase (10 min).
  • nucleotide incorporation over time can be monitored by the ATP sulfurylase - !uciferase bioluminescence regenerative cycle. If pyrophosphatase is included in the reaction mixture, then no luminescence is detected, confirming that the assay is specific for pyrophosphate.

Abstract

The invention relates to methods and compositions for the ensuring the quality of oligonucleotide synthesis during cycles of template independent terminal transferase extension. The amount of incorporated nucleotide can be determined for each cycle of extension, thereby confirming that the supplied nucleotide has efficiently incorporated to extend the strand.

Description

Method of Quality Control of Oligonucleotide Synthesis
FIELD OF THE INVENTION
The invention relates to methods and compositions for the quality control of the synthesis of oligonucleotides during cycles of template independent enzymatic extension.
BACKGROUND OF THE INVENTION
Nucleic acid synthesis is vital to modern biotechnology. The rapid pace of development in the biotechnology arena has been made possible by the scientific community's ability to artificially synthesise DNA, RNA and proteins.
Artificial DNA synthesis allows biotechnology and pharmaceutical companies to develop a range of peptide therapeutics, such as insulin for the treatment of diabetes. It allows researchers to characterise cellular proteins to develop new small molecule therapies for the treatment of diseases our aging population faces today, such as heart disease and cancer. It even paves the way forward to creating life, as the Venter Institute demonstrated in 2010 when they placed an artificially synthesised genome into a bacterial cell.
However, current DNA synthesis technology does not meet the demands of the biotechnology industry. Despite being a mature technology, it is highly challenging to synthesise a DNA strand greater than 200 nucleotides in length in viable yield, and most DNA synthesis companies only offer up to 120 nucleotides routinely. In comparison, an average protein-coding gene is of the order of 2000- 3000 contiguous nucleotides, a chromosome is at least a million contiguous nucleotides in length and an average eukaryotic genome numbers in the billions of nucleotides. In order to prepare nucleic acid strands thousands of base pairs in length, all major gene synthesis companies today rely on variations of a 'synthesise and stitch' technique, where overlapping 40-60-mer fragments are synthesised and stitched together by enzymatic copying and extension. Current methods generally allow up to 3 kb in length for routine production. The reason DNA cannot be chemically synthesised beyond 120-200 nucleotides at a time is due to the current methodology for generating DNA, which uses synthetic chemistry (i.e., phosphoramidite technology) to couple a nucleotide one at a time to make DNA. Even if the efficiency of each nucleotide-coupling step is 99% efficient, it is mathematically impossible to synthesise DNA longer than 200 nucleotides in acceptable yields. The Venter Institute illustrated this laborious process by spending 4 years and 20 million USD to synthesise the relatively small genome of a bacterium.
Known methods of DNA sequencing use template-dependent DNA polymerases to add 3'-reversibly terminated nucleotides to a growing double-stranded substrate. In the 'sequencing-by-synthesis' process, each added nucleotide contains a dye, allowing the user to identify the exact sequence of the template strand. Albeit on double-stranded DNA, this technology is able to produce strands of between 500-1000 bps long. However, this technology is not suitable for de novo nucleic acid synthesis because of the requirement for an existing nucleic acid strand to act as a template.
Various attempts have been made to use a terminal deoxynucleotidyl transferase (TdT) for de novo single-stranded DNA synthesis. Uncontrolled de novo single stranded DNA synthesis, as opposed to controlled, takes advantage of TdT's deoxynucleotide triphosphate (dNTP) 3' tailing properties on single-stranded DNA to create, for example, homopolymeric adaptor sequences for next-generation sequencing library preparation. In controlled extensions, a reversible deoxynucleotide triphosphate termination technology needs to be employed to prevent uncontrolled addition of dNTPs to the 3'- end of a growing DNA strand. The development of a controlled single-stranded DNA synthesis process through TdT would be invaluable to in situ DNA synthesis for gene assembly or hybridization microarrays as it removes the need for an anhydrous environment and allows the use of various polymers incompatible with organic solvents. However, TdT has not been shown to efficiently add nucleoside triphosphates containing 3'-0- reversibly terminating moieties for building up a nascent single-stranded DNA chain necessary for a de novo synthesis cycle, and thus the synthesis of long strands is inefficient. There is therefore a need for a new method to efficiently prepare long strands of oligonucleotides in order to provide an improved method of nucleic acid synthesis that is able to overcome the problems associated with currently available methods. The inventors of the current subject matter have developed such a method, which is disclosed in patent application WO2016128731. With this new method to efficiently prepare long strands of oligonucleotides comes the need for methods and compositions to evaluate the quality of the product/strands resulting from the template- independent synthesis. SUMMARY OF THE INVENTION
A primary source of error in solid-phase nucleic acid synthesis - both enzymatic and phosphoramidite-based - comes from the failure to add a nucleotide to the desired sequence. Such failure results in a deletion, which results in frameshift mutations in biological sequences. The ability to assess the nucleotide coupling efficiency and thus the quality with which a particular nucleic acid was synthesized is critical regardless of the method used to synthesize said nucleic acid.
When synthesizing nucleic acids using phosphoramidite chemistry, coupling efficiency is one of the primary metrics used to assess synthesis quality. Coupling efficiency in phosphoramidite nucleic acid synthesis is measured in-situ by virtue of the colour of deprotected 5'-dimethoxytrityl (DMT) groups. The 5'-DMT groups act as reversible terminators in phosphoramidite synthesis, controlling the addition of one and only one amidite per synthesis cycle. When deprotected, the orange colour of the DMT cation released from the 5'-DMT group is eluted; such colour can be quantitated via a spectrophotometer to determine coupling efficiency per phosphoramidite synthesis cycle.
However, unlike phosphoramidite chemistry, non-templated enzymatic nucleic acid synthesis does not result in the production of any groups readily detectable via spectrophotometric means. In addition to the incorporation of a nucleotide, enzymatic nucleic acid synthesis results in the production of inorganic pyrophosphate and a proton. Whilst not in itself readily detectable via spectrophotometric means, inorganic pyrophosphate is detectable through a variety of means, such as enzyme-coupled reactions. The preference to rapidly remove inorganic pyrophosphate due to the capability of TdT to perform strand dismutation reactions, complicates the detection of inorganic pyrophosphate as a means to quantitate coupling efficiency. Furthermore such reactions often resynthesise nucleoside triphosphate molecules, which is problematic in reactions where nucleotide triphosphates are the TdT substrates.
Described herein are methods and compositions for quality control of the synthesis of oligonucleotides. For each cycle where a nucleotide monomer is supplied, the amount of nucleotide monomer used to extend the strand can be detected. The quality of synthesis is determined base- by-base as synthesis progresses, rather than simply by measuring the purity of the full-length strands at the end of the synthesis.
The monitoring is performed on one or more of the nucleotide monomer reaction cycles. The monitoring can be performed on each cycle. The term each cycle refers to monitoring directly after a nucleotide has been added. The quality measurements are performed during or after extension cycles before the completed strand is assembled. The term refers to monitoring after nucleotide extension, rather than just at the end of the synthesis, and does not require that every cycle is monitored. Thus if a 100 cycles of extension are performed, monitoring every other cycle is within the scope of the invention. Similarly if 50 cycles are performed only 49 can be monitored. The claim is merely requiring that the quality of the synthesis is determined as the synthesis progresses not just at the end, not that every cycle must be monitored. Monitoring each and every cycle is within the scope of the claimed invention, but is not essential.
Described is a technique for detecting the incorporation of one or more nucleotides into a growing chain by template-independent synthesis. The described method relies on detection of changes in heat, pH, phosphate concentration and/or pyrophosphate concentration as a result of template- independent DNA synthesis.
Figure imgf000005_0001
The formula above represents the incorporation of a nucleotide, dNTP, which could be any modified or non-modified nucleotide, including, G: Guanine, A: Adenine; T: Thymine, or C: Cytosine as incorporated into a growing DNA strand. The nucleotide need not be limited to dNTPs (i.e., 2'-deoxy NTPs). They include any polyphosphate species that can be incorporated by an enzyme, including ribonucleoside 5'-triphosphates. The nucleotides can be reversibly blocked such that only one monomer per strand is incorporated. The block may be anywhere on the monomer, including optionally at the 3'- position. The nucleotides can also be reversibly blocked at the nitrogenous base by molecular entities such as a small molecule, peptide, oligosaccharide, polymer, or protein. The nucleotide may also be amine-masked to mask the amino groups on the nitrogenous base and prevent hydrogen bonding. The amino-groups may subsequently be unmasked to reveal a free amino (NH2) group. The nucleotides are generally unlabelled in order to synthesise a non-modified strand, but the nucleotides can optionally be labelled. In order to control the desired sequence, a single nucleotide species is generally added per cycle, but more than one nucleotide can be added if the desired strand sequence is degenerate.
The DNA polymerase can be a terminal deoxynucleotidyl transferase. The enzyme can be modified to increase the incorporation of 3'- blocked nucleotides. The incorporation of the nucleotide in the above reaction is monitored, by monitoring changes in heat, pH, levels of pyrophosphate or levels of phosphate, to provide quality control information.
The T in the above reaction is about 22 kT or ~570 meV per nucleotide incorporation, and may be measured in accordance with the present invention, as well as DrH.
BRIEF DESCRIPTION OF THE FIGURES
Figure 1: One embodiment of phosphate detection. Phosphate generated from non-templated enzymatic nucleic acid synthesis is quantified through methods such as enzyme binding, enzyme- coupled reaction, and/or inorganic molecule binding. The aforementioned methodologies generate a detectable signal to quantitate the level of nucleotide incorporation.
Figure 2: Coupling of pyrophosphatase and phosphate-binding protein labelled to a fluorophore to monitor and quantitate reversibly terminated nucleotide incorporation by engineered TdTs.
Figure 3: Pyrophosphate detection as a method to monitor and quantitate reversibly terminated nucleotide incorporation by engineered TdTs.
DETAILED DESCRIPTION OF THE INVENTION
Heat:
Disclosed herein is a method for monitoring the incorporation of one or more nucleotides into a growing chain by template-independent synthesis by monitoring temperature change. Nucleotide incorporation results in an increased temperature in solution generated by released pyrophosphate. The splitting of pyrophosphate into two inorganic phosphate molecules, accomplished for example with the addition of pyrophosphatases, will produce further temperature change. Thus, in one embodiment of the invention, changes in temperature are monitored by a suitable sensitive instrument to detect if successful nucleotide incorporation has occurred. pH:
Disclosed herein is a method for monitoring the incorporation of one or more nucleotides into a growing chain by template-independent synthesis by monitoring pH change. Nucleotide incorporation results in an increased negative charge in solution generated by released pyrophosphate and inorganic phosphate. The splitting of pyrophosphate into two inorganic phosphate molecules, accomplished for example with the addition of pyrophosphatases, will further lower the charge in solution. Thus, in one embodiment of the invention, changes in pH are monitored by a suitable sensitive instrument such as an ion-sensitive field-effect transistor (ISFET) to detect if successful nucleotide incorporation has occurred. Pyrophosphate:
Disclosed herein are methods for monitoring the incorporation of one or more nucleotides into a growing chain by template-independent synthesis by monitoring the pyrophosphate concentration. In one method, pyrophosphate is detected by interaction, chelation, binding, or association with a small molecule. In another method, pyrophosphate is detected by interaction, chelation, binding, or association with a biomolecule. The biomolecule may be formed from protein, DNA, RNA, or a combination thereof. These interactions result in a detectable signal.
In one embodiment, the small molecule is or contains a chelated Zn(ll) complex. In another embodiment, the small molecule is a terpyridine-Zn(ll) complex. In a further embodiment, the terpyridine-Zn(ll) complex is carbazole or amino modified. In a further embodiment, the molecule containing a terpyridine-Zn(ll) complex interacts with pyrophosphate and generates a detectable signal, such as a change in absorbance, chemiluminescence, and/or fluorescence.
In another method, real-time bioluminometric detection of released pyrophosphate is detected as a result of successful nucleotide incorporation. In one embodiment of the invention, the released pyrophosphate is converted to ATP by ATP sulfurylase from adenosine 5'-phosphosulfate and the level of ATP is sensed by a luciferase producing a proportional light signal, which is detected by photo sensing devices. In another embodiment of the invention, the reacted addition solution is removed from the immobilized oligonucleotide and the released pyrophosphate in the removed addition solution is then detected by the bioluminescence regenerative cycle involving ATP sulfurylase and luciferase. The advantage of detecting pyrophosphate in removed addition solution is that there is no potential for the reagents of the bioluminescence regenerative cycle (e.g., addition of the ATP sulfurylase product, ATP, by TdT to the oligonucleotide initiator) to interfere with the enzymatic nucleic acid synthesis process.
In an embodiment of the invention, incorporation of one or more nucleotides into a growing chain by template-independent synthesis is monitored using a bioluminescence regenerative cycle (BRC). In BRC, steady state levels of bioluminescence result from processes that produce pyrophosphate. Pyrophosphate reacts in the presence of ATP sulfurylase and adenosine 5'-phosphosulfate to produce ATP. The ATP reacts with luciferin in a luciferase-catalysed reaction, producing light and regenerating pyrophosphate. The pyrophosphate is recycled to produce ATP and the regenerative cycle continues. Because the kinetic properties of ATP sulfurylase are much faster than luciferase, a steady state results wherein concentrations of ATP and pyrophosphate and the rate of light production remain relatively constant. Photons are counted over a time interval to determine the number of successful nucleotide incorporations.
In one embodiment of the invention there is provided a kit for use in monitoring changes in pyrophosphate concentration.
The nucleotide may carry a reversible block. The nucleotide may carry a 3'-block. The 3'-block is a protecting group on the 3' hydroxyl which can be removed to release the 3'-OH, in which case the block acts as a reversible terminator. The nucleotide may carry a reversible terminator attached to the nitrogenous base. The base-blocking group is a protecting group which can be removed to enable addition of the nucleotide by a template independent enzyme such as a TdT. The nucleotide may be covalently or non-covalently bound to a template independent enzyme such as a TdT. Each mention of the term block includes optionally a 3'-block.
Disclosed is a method of enzymatic nucleic acid synthesis which includes:
a) providing an immobilized nucleic acid initiator sequence;
b) adding a blocked nucleoside triphosphate to said initiator sequence through the exposure of the immobilized nucleic acid initiator sequence to a reaction solution comprising:
• a blocked nucleoside triphosphate(s);
• a nucleic acid transferase or template independent polymerase, such as an engineered terminal deoxynucleotidyl transferase (TdT);
• required buffer components; and
• pyrophosphate-sensing reagents, such as a terpyridine-Zn(ll) complex or the simultaneous presence of all of the following: adenosine 5'-phosphosulfate, ATP sulfuryase, luciferin, and luciferase;
c) quantification of quantity of pyrophosphate generated during step (b);
d) removing all reagents from the initiator sequence; e) cleaving the blocking group from the blocked nucleoside in the presence of a cleaving agent; and
f) removing the cleaving agent by washing the immobilized initiator nucleic acid with wash solution.
Steps (b) - (f) can be repeated to add greater than one nucleotide to the above DNA initiator sequence.
A further method of enzymatic nucleic acid synthesis could include:
a) providing an immobilized nucleic acid initiator sequence;
b) adding a blocked nucleoside triphosphate to said initiator sequence through the exposure of the immobilized nucleic acid initiator sequence to a reaction solution comprising:
• a blocked nucleoside triphosphate(s);
• a nucleic acid transferase or template independent polymerase, such as an engineered terminal deoxynucleotidyl transferase (TdT);
• required buffer components;
c) removing the reaction solution of (b) from the immobilized nucleic acid initiator;
d) introducing reagents for the quantification of pyrophosphate generated during (b) to the removed solution of (c), reagents for the quantification of pyrophosphate can include a terpyridine-Zn(ll) complex, adenosine 5'-phosphosulfate, ATP sulfuryase, luciferin, luciferase, or a combination thereof;
e) quantification of pyrophosphate in the removed reaction solution of (c);
f) washing of the immobilized initiator nucleic acid with wash solution;
g) cleaving the blocking group from the blocked nucleoside in the presence of a cleaving agent; and
h) removing the cleaving agent by washing the immobilized initiator nucleic acid with wash solution.
Steps (b) - (h) can be repeated to add greater than one nucleotide to the above nucleic acid initiator sequence.
The further method of enzymatic nucleic acid synthesis directly mentioned above has the advantage of segregating the incorporation of a blocked nucleoside triphosphate from the detection of a produced pyrophosphate molecule. This segregation is advantageous as the pyrophosphate detection reagents may interfere with the incorporation of a blocked nucleoside triphosphate. For example, when using adenosine 5'-phosphosulfate, ATP sulfuryase, luciferin, luciferase, or a combination thereof to detect pyrophosphate through bioluminescence, ATP is generated. ATP can be incorporated by a TdT to the 3'-end of a nucleic acid initiator in a method of enzymatic nucleic acid synthesis. This incorporation thus represents a mutation, which is detrimental to the accuracy of nucleic acid synthesis.
Phosphate:
Disclosed herein is a method for monitoring the incorporation of one or more nucleotides into a growing chain by template-independent synthesis by monitoring the phosphate concentration over time. The method involves the use of an inorganic pyrophophatase enzyme to catalyse the conversion of pyrophosphate, one molecule of which is released during the successful addition of one nucleotide, into two phosphate ions. The presence of the phosphate ions, and hence the presence of the original pyrophosphate molecule, can be detected and information on the success or failure of nucleotide addition gained.
Here, we present a method to quantitate coupling efficiencies in a method or methods of enzymatic nucleic acid synthesis. Instead of quantitating inorganic pyrophosphate, we couple the reaction of inorganic pyrophosphatase, which converts one molecule of inorganic pyrophosphate to two molecules of inorganic phosphate, to a phosphate-sensing assay. Through this method, we rapidly remove inorganic pyrophosphate, which may be detrimental to enzymatic nucleic acid synthesis, and detect the quantity of phosphate generated as a means of determining coupling efficiency.
In one embodiment of the invention the presence of phosphate ions is detected using a phosphate binding protein (PBP).
In one embodiment of the invention the phosphate binding protein is natural, in another embodiment of the invention the phosphate binding proteins sequence is modified. In one embodiment of the invention, the phosphate binding protein is E. coli phosphate binding protein.
In one embodiment of the invention the phosphate binding protein is conjugated to a fluorescent tag. In one embodiment of the invention there is provided a kit for use in monitoring changes in phosphate concentration.
In one embodiment, the method of nucleic acid synthesis involves the use of TdT in non-templated enzymatic DNA synthesis with 3'-0-reversibly terminated 2'-deoxynucleoside 5'-triphosphates (dNTPs). When TdT incorporates a dNTP to the 3'-end of an oligonucleotide, inorganic pyrophosphate and a proton is generated. When coupled to inorganic pyrophosphatase (PPiase), inorganic pyrophosphate is converted into inorganic phosphate. Thus, for every 1 molecule of dNTP incorporated by TdT, 2 molecules of inorganic phosphate are generated. Through a phosphate sensing assay, the coupling efficiency as defined by can be determined. In the coupling
Figure imgf000011_0002
efficiency equation, N is known a priori as the quantity of free 3'-OH ends supplied by
Figure imgf000011_0001
5'-immobilized oligonucleotides and N+l is quantified through a phosphate-sensing assay.
In one embodiment, the phosphate sensing assay above utilizes a phosphate binding protein (PBP), such as E. coli PBP containing the mutation A197C, covalently coupled to fluorescent dye, such as N- [2-(l-maleimidyl)ethyl]-7-(diethylamino)coumarin-3-carboxamide (MDCC) or tetramethylrhodamine (rho). In E. coli PBP, the MDCC or rho is covalently coupled to PBP. For example, MDCC is covalently coupled to PBP through a thioester bond at A197C. When E. coli PBP-MDCC or PBP-rho binds to a molecule of inorganic phosphate, a 7-fold or 18-fold increase in fluorescent signal results, respectively.
In another embodiment, the phosphate-sensing assay is based on reaction of inorganic phosphate with maltose, in the presence of an enzyme, to produce glucose. The glucose is then specifically oxidized to generate a product that reacts with a probe to generate fluorescence. In one embodiment, the enzyme is maltose phosphorylase which converts maltose, in the presence of inorganic phosphate, to glucose-l-phosphate and glucose. Then, glucose oxidase converts the glucose to gluconolactone and H2O2. Finally, with horseradish peroxidase (HRP) as a catalyst, the H2O2 reacts with Amplex Red reagent to generate resorufin. The resulting increase in fluorescence or absorption is proportional to the amount of inorganic phosphate in the sample.
In another embodiment, the phosphate-sensing assay consists of complex formation of inorganic phosphate with ammonium molybdate, which is detectable as a colour change
In another embodiment, the phosphate-sensing assay is based on quantification of the green complex formed between Malachite Green, ammonium molybdate and free phosphate.
In another embodiment, the phosphate-sensing assay consists of a phosphorylase and a substrate that becomes detectable upon phosphorylation. Examples include utilizing a phosphorylase in the presence of 7-methylguanosine or 2-amino-6-mercapto-7-methylpurine.
A method of enzymatic nucleic acid synthesis could include:
a) providing an immobilized nucleic acid initiator sequence;
b) adding a blocked nucleoside triphosphate to said initiator sequence through the exposure of the immobilized nucleic acid initiator sequence to a reaction solution comprising:
• a blocked nucleoside triphosphate(s);
• a nucleic acid transferase or template independent polymerase, such as an engineered terminal deoxynucleotidyl transferase (TdT);
• required buffer components;
• pyrophosphatase; and
• phosphate-sensing reagents;
c) quantification of quantity of phosphate generated during step (b);
d) removal of all reagents from the initiator sequence;
e) cleaving the blocking group from the blocked nucleoside in the presence of a cleaving agent; and
f) removal of the cleaving agent by washing of the immobilized initiator nucleic acid with wash solution.
Steps (b) - (f) can be repeated to add greater than one nucleotide to the above nucleic acid initiator sequence.
A further method of enzymatic nucleic acid synthesis could include:
a) providing an immobilized nucleic acid initiator sequence;
b) adding a blocked nucleoside triphosphate to said initiator sequence through the exposure of the immobilized nucleic acid initiator sequence to a reaction solution comprising:
• a blocked nucleoside triphosphate(s);
• a nucleic acid transferase or template independent polymerase, such as an engineered terminal deoxynucleotidyl transferase (TdT);
• required buffer components; c) removing the reaction solution of (b) from the immobilized nucleic acid initiator;
d) introduction of reagents for the hydrolysis of pyrophosphate and quantification of released phosphate generated during (b) to the removed reaction solution of (c);
e) quantification of released phosphate in the removed solution of (c);
f) washing the immobilized initiator nucleic acid with wash solution;
g) cleaving the blocking group from the blocked nucleoside in the presence of a cleaving agent; and
h) removal of the cleaving agent by washing the immobilized initiator nucleic acid with wash solution.
Steps (b) - (h) can be repeated to add greater than one nucleotide to the above nucleic acid initiator sequence.
The further method of enzymatic nucleic acid synthesis directly mentioned above has the advantage of segregating the incorporation of a blocked nucleoside triphosphate from the detection of produced phosphate molecules. This segregation may be advantageous if the phosphate detection reagents interfere with the incorporation of a blocked nucleoside triphosphate.
References herein to an 'initiator sequence' refer to a short oligonucleotide with a free 3'-end which the nucleotide monomers can attach. In one embodiment, the initiator sequence is a DNA initiator sequence. In an alternative embodiment, the initiator sequence is an RNA initiator sequence.
The strands are synthesised using enzymatic means. The cycles of extension can be performed using a nucleic acid transferase or template independent polymerase, such as an engineered terminal deoxynucleotidyl transferase (TdT), where the nucleotide monomers are nucleoside triphosphates.
The length of the synthesized strands can be for example at least 25 bases (n = 25).
The reversibly blocked nucleoside triphosphates can be 3'-reversibly blocked nucleoside triphosphates. The 3'-reversible block can be selected from 3'-0-CH2N3, 3'-0-CH2CHCH2, 3'-0- CH2CH2CN or 3'-0-NH2.
References herein to a 'DNA initiator sequence' include a small sequence of DNA which a blocked nucleotide triphosphate can be attached to, i.e., DNA will be synthesised from the end of the DNA initiator sequence. In one embodiment, the initiator sequence is single-stranded. In an alternative embodiment, the initiator sequence is double-stranded. It will be understood by persons skilled in the art that a 3'-overhang (l.e., a free 3'-end) allows for efficient addition.
In one embodiment, the initiator sequence is immobilised on a solid support. The initiator sequence may be attached to a solid support stable under aqueous conditions so that the method can be easily performed via a flow setup.
In one embodiment, the initiator sequence is immobilised on a solid support via a reversible interacting moiety, such as a chemically-cleavable linker, an antibody/immunogenic epitope, a biotin/biotin binding protein (such as avidin or streptavidin), or glutathione-GST tag. Therefore, in a further embodiment, the method additionally comprises extracting the resultant nucleic acid by removing the reversible interacting moiety in the initiator sequence, such as by incubating with proteinase K.
In one embodiment, the initiator sequence contains a base or base sequence recognisable by an enzyme. A base recognised by an enzyme, such as a glycosylase, may be removed to generate an abasic site which may be cleaved by chemical or enzymatic means. A base sequence may be recognised and cleaved by a restriction enzyme. Thus in one embodiment, the resultant contiguous oligonucleotide sequence is released from being immobilised. In one embodiment, this release occurs by removing a non-canonical base from one or more of the immobilised oligonucleotides and cleaving the strands at the resultant abasic site. In one embodiment, the non-canonical base is uracil, which is removed by uracil DNA glycosylase. In an alternate embodiment, the non-canonical base is 8-oxoquanine, which is removed by formamidopyrimidine DNA glycosylase.
In an alternate embodiment, the initiator sequence is immobilised on a solid support via a chemically-cleavable linker, such as a disulfide, allyl, or azide-masked hemiaminal ether linker. Therefore, in one embodiment, the method additionally comprises extracting the resultant contiguous oligonucleotide sequence by cleaving the chemical linker through the addition of tris(2- carboxyethyljphosphine (TCEP) or dithiothreitol (DTT) for a disulfide linker; palladium complexes or an allyl linker; or TCEP for an azide-masked hemiaminal ether linker. In one embodiment, the cycles of extension are performed using a polymerase enzyme and the nucleotide monomers are nucleoside triphosphates. In one embodiment, the cycles of extension are performed using a template independent polymerase and the nucleotide monomers are nucleoside triphosphates. In one embodiment, the cycles of extension are performed using a template independent polymerase and the nucleotide monomers are reversibly blocked nucleoside triphosphates.
In one embodiment, the cycles of extension are performed using a template independent polymerase and the nucleotide monomers are 3'-reversibly blocked nucleoside triphosphates. The 3'-blocked nucleoside 5'-triphosphate can be blocked by any chemical group that can be unmasked to reveal a 3'-OH. The 3'-blocked nucleoside triphosphates can be blocked by a 3'-0-azidomethyl, 3'- aminooxy, 3'-0-allyl group, 3'-0-cyanoethyl, 3'-0-acetyl, 3'-0-nitrate, 3'-0-phosphate, 3'-0-acetyl levulinic ester, 3'-0-tert butyl dimethyl silane, 3'-0-trimethyl(silyl)ethoxymethyl, 3'-0-ortho- nitrobenzyl, and 3'-0-para-nitrobenzyl.
The 3'-blocked nucleoside 5'-triphosphates can also be blocked by any chemical group that can be directly utilized in chemical ligations, such as copper-catalyzed or copper-free azide-alkyne click reactions and tetrazine-alkene click reactions. The 3'-blocked nucleoside triphosphates can include chemical moieties containing an azide, alkyne, alkene, and tetrazine.
In a particular embodiment, the 3'-reversible block is selected from 3'-0-CH2N3, 3'-0-CH2CHCH2, 3'- 0-CH2CH2CN or 3'-0-NH2.
In one embodiment, the template independent polymerase is a terminal deoxynucleotidyl transferase (TdT). In one embodiment, the template independent polymerase is a modified TdT. In one embodiment, the TdT is added in the presence of an extension solution comprising one or more buffers (e.g., Tris or cacodylate), one or more salts (e.g., Na+, K+, Mg2+, Mnz+, Cu2+, Zn2+, Co2+, etc. all with appropriate counterions, such as Cl) and inorganic pyrophosphatase (e.g., the Saccharomyces cerevisiae homolog). It will be understood that the choice of buffers and salts depends on the optimal enzyme activity and stability. The use of an inorganic pyrophosphatase helps to reduce the build-up of pyrophosphate due to nucleoside triphosphate hydrolysis by TdT. Therefore, the use of an inorganic pyrophosphatase has the advantage of reducing the rate of (1) backwards reaction and (2) TdT strand dismutation. References herein to 'nucleoside triphosphates' refer to a molecule containing a nucleoside (i.e. a base attached to a deoxyribose or ribose sugar molecule) bound to three phosphate groups. Examples of nucleoside triphosphates that contain deoxyribose are: deoxyadenosine triphosphate (dATP), deoxyguanosine triphosphate (dGTP), deoxycytidine triphosphate (dCTP) or deoxythymidine triphosphate (dTTP). Examples of nucleoside triphosphates that contain ribose are: adenosine triphosphate (ATP), guanosine triphosphate (GTP), cytidine triphosphate (CTP) or uridine triphosphate (UTP). Other types of nucleosides may be bound to three phosphates to form nucleoside triphosphates, such as naturally occurring modified nucleosides and artificial nucleosides. Such nucleosides may include any functional equivalents of nucleosides including amine-masked nucleoside 5'-triphosphates, such as 6-azido-adenosine, 4-azido-cytidine, or 2-azido-guanosine.
Therefore, references herein to '3'-blocked nucleoside triphosphates' refer to nucleoside triphosphates (e.g., dATP, dGTP, dCTP or dTTP) which have an additional group on the 3' end which prevents further addition of nucleotides, i.e., by replacing the 3'-OH group with a protecting group.
It will be understood that references herein to '3'-block', '3'-blocking group' or '3'-protecting group' refer to the group attached to the 3' end of the nucleoside triphosphate which prevents further nucleotide addition. This method uses reversible 3'-blocking groups which can be removed by cleavage to allow the addition of further nucleotides. By contrast, irreversible 3'-blocking groups refer to dNTPs where the 3'-OH group can neither be exposed nor uncovered by cleavage.
The 3'-blocked nucleoside 5'-triphosphate can be blocked by any chemical group that can be unmasked to reveal a 3'-OH. The 3'-blocked nucleoside triphosphate can be blocked by a 3'-0- azidomethyl, 3'-aminooxy, 3'-0-allyl group, 3'-0-cyanoethyl, 3'-0-acetyl, 3'-0-nitrate, 3'-0- phosphate, 3'-0-acetyl levulinic ester, 3'-0-tert butyl dimethyl silane, 3'-0- trimethyl(silyl)ethoxymethyl, 3'-0-ortho-nitrobenzyl, and 3'-0-para-nitrobenzyl.
The 3'-blocked nucleoside 5'-triphosphate can also be blocked by any chemical group that can be directly utilized in chemical ligations, such as copper-catalyzed or copper-free azide-alkyne click reactions and tetrazine-alkene click reactions. The 3'-blocked nucleoside triphosphate can include chemical moieties containing an azide, alkyne, alkene, and tetrazine.
References herein to 'cleaving agent' refer either to a substance which is able to cleave the 3'- blocking group from the 3'-blocked nucleoside triphosphate or to a substance which is able to cleave immobilised oligonucleotides from the solid support. In one embodiment, the cleaving agent is a chemical cleaving agent. In an alternative embodiment, the cleaving agent is an enzymatic cleaving agent.
It will be understood by the person skilled in the art that the selection of cleaving agent is dependent on the type of 3'-nucleoside blocking group used. For instance, tris(2- carboxyethyl)phosphine (TCEP) or tris(hydroxypropyl)phosphine (THPP) can be used to cleave a 3'-0- azidomethyl group, palladium complexes can be used to cleave a 3'-0-allyl group, or sodium nitrite can be used to cleave a 3'-aminoxy group. Therefore, in one embodiment, the cleaving agent is selected from: tris(2- carboxyethyl)phosphine (TCEP), a palladium complex or sodium nitrite.
In one embodiment, the cleaving agent is added in the presence of a cleavage solution comprising a denaturant, such as urea, guanidinium chloride, formamide or betaine. The addition of a denaturant has the advantage of being able to disrupt any undesirable secondary structures in the DNA. In a further embodiment, the cleavage solution comprises one or more buffers. It will be understood by the person skilled in the art that the choice of buffer is dependent on the exact cleavage chemistry and cleaving agent required.
The inventors have previously developed a selection of engineered terminal transferase enzymes, any of which may be used in the current process.
Terminal transferase enzymes are ubiquitous in nature and are present in many species. Many known TdT sequences have been reported in the NCBI database http://www.ncbi.nlm.nih.gov/. The sequences of the various described terminal transferases show some regions of highly conserved sequence, and some regions which are highly diverse between different species.
The inventors have modified the terminal transferase from Lepisosteus oculatus TdT (spotted gar) (shown below). However the corresponding modifications can be introduced into the analagous terminal transferase sequences from any other species, including the sequences listed above in the various NCBI entries.
The amino acid sequence of the spotted gar ( Lepisosteus oculatus) is shown below
SEQ ID 1: wild type spotted Gar TdT
Figure imgf000017_0001
Figure imgf000018_0001
The inventors have identified various regions in the amino acid sequence having improved properties. Certain regions improve the solubility and handling of the enzyme. Certain other regions improve the ability to incorporate nucleotides with modifications at the 3'-position.
Described herein are modified terminal deoxynucleotidyl transferase (TdT) enzymes comprising amino acid modifications when compared to a wild type sequence SEQ ID NO 1 or a truncated version thereof or the homologous amino acid sequence of a terminal deoxynucleotidyl transferase (TdT) enzyme in other species or the homologous amino acid sequence of RoIm, RoIb, RoIl, and RoIq of any species or the homologous amino acid sequence of X family polymerases of any species, wherein the amino acid is modified at one or more of the amino acids:
V32, A33, 134, F35, A53, V68, V71, E97, 1101, M108, G109, A110, Q115, V116, S125, T137, Q143,
M152, E153, N154, H155, N156, Q157, 1158, 1165, N169, N173, S175, E176, G177, P178, C179, L180, A181, F182, M183, R184, A185, L188, H194, A195, 1196, S197, S198, S199, K200, E203, G204, D210,
Q211, T212, K213, A214, 1216, E217, D218, L220, Y222, V228, D230, Q238, T239, L242, L251, K260,
G261, F262, H263, S264, L265, E267, Q269, A270, D271, N272, A273, H275, F276, T277, K278, M279,
Q280, K281, S291, A292, A293, V294, C295, K296, E298, A299, Q300, A301, Q304, 1305, T309, V310,
R311, L312, 1313, A314, 1318, V319, T320, G328, K329, E330, C331, L338, T341, P342, E343, M344, G345, K346, W349, L350, L351, N352, R353, L354, 1355, N356, R357, L358, Q359, N360, Q361, G362,
1363, L364, L365, Y366, Y367, D368, 1369, V370, K376, T377, C381, K383, D388, H389, F390, Q391,
K392, F394, 1397, K398, K400, K401, E402, L403, A404, A405, G406, R407, D411, A421, P422, P423,
V424, D425, N426, F427, A430, R438, F447, A448, R449, H450, E451, R452, K453, M454, L455, L456,
D457, N458, H459, A460, L461, Y462, D463, K464, T465, K466, K467, T474, D477, D485, Y486, 1487, D488, P489.
Modifications which improve the incorporation of modified nucleotides can be at one or more of selected regions shown below. Regions were selected according to mutation data, sequence alignment, and structural data obtained from spotted gar TdT co-crystal I ized with DNA and a 3'- modified dNTP. The second modification can be selected from one or more of the amino acid regions VAIF, MG A, MENHNQI, SEGPCLAFMRA, HAISSS, DQTKA, KGFHS, QADNA, HFTKMQK, SAAVCK, EAQA, TVRLI, GKEC, TPEMGK, YYDIV, DHFQK, LAAG, APPVDNF, FARHERKMLLDNHALYDKTKK, and
DYIDP shown highlighted in the sequence below.
Figure imgf000019_0001
References to particular sequences include truncations thereof. Included herein are modified terminal deoxynucleotidyl transferase (TdT) enzyme comprising at least one amino acid modification when compared to a wild type sequence SEQ ID NO 1 or a truncated version thereof, or the homologous amino acid sequence of a terminal deoxynucleotidyl transferase (TdT) enzyme in other species, wherein the modification is selected from one or more of the amino acid regions WLLNRLINRLQNQGILLYYDIV, VAIF, MG A, MENHNQI, SEGPCLAFMRA, HAISSS, DQTKA, KGFHS,
QADNA, HFTKMQK, SAAVCK, EAQA, TVRLI, GKEC, TPEMGK, DHFQK, LAAG, APPVDNF, FARHERKMLLDNHALYDKTKK, and DYIDP of the sequence of SEQ ID NO 1 or the homologous regions in other species. Truncated proteins may include at least the region shown below (SEQ ID NO 2)
Figure imgf000019_0002
Described herein is a modified terminal deoxynucleotidyl transferase (TdT) enzyme comprising at least the sequence:
Figure imgf000020_0002
or the homologous regions in other species, wherein the sequence has one or more amino acid modifications in one or more of the amino acid regions WLLNRLINRLQNQGILLYYDI, MENHNQI,
SEGPCLAFMRA, HAISSS, DQTKA, KGFHS, QADNA, HFTKMQK, SAAVCK, EAQA, TVRLI, GKEC, TPEMGK, DHFQK, LAAG, APPVDNF, FARHERKMLLDNHALYDKTKK, and DYIDP of the sequence:
Figure imgf000020_0001
Sequence homology extends to all modified or wild-type members of family X polymerases, such as DNA RoIm (also known as DNA polymerase mu or POLM), DNA RoIb (also known as DNA polymerase beta or POLB), and DNA RoIl (also known known as DNA polymerase lambda or POLL). It is well known in the art that all family X member polymerases, of which TdT is a member, either have terminal transferase activity or can be engineered to gain terminal transferase activity akin to terminal deoxynucleotidyl transferase (Biochim Biophys Acta. 2010 May; 1804(5): 1136-1150). For example, when the following human TdT loopl amino acid sequence
...ESTFEKLRLPSRKVDALDHF... was engineered to replace the following human RoIm amino acid residues
...HSCCESPTRLAQQSHMDAF..., the chimeric human RoIm containing human TdT loopl gained robust terminal transferase activity (Nucleic Acids Res. 2006 Sep; 34(16): 4572-4582). Furthermore, it was generally demonstrated in US patent application no. 2019/0078065 that family X polymerases when engineered to contain TdT loopl chimeras could gain robust terminal transferase activity. Additionally, it was demonstrated that TdT could be converted into a template- dependent polymerase through specific mutations in the loopl motif (Nucleic Acids Research, Jun 2009, 37(14):4642-4656). As it has been shown in the art, family X polymerases can be trivially modified to either display template-dependent or template-independent nucleotidyl transferase activities. Therefore, all motifs, regions, and mutations demonstrated in this patent can be trivially extended to modified X family polymerases to enable modified X family polymerases to incorporate 3'-modified nucleotides, reversibly terminated nucleotides, and modified nucleotides in general to effect methods of nucleic acid synthesis.
Modifications which improve the solubility include a modification within the amino acid region WLLNRLINRLQNQGILLYYDIV shown highlighted in the sequence below.
Figure imgf000021_0001
Modifications which improve the incorporation of modified nucleotides can be at one or more of selected regions shown below. The second modification can be selected from one or more of the amino acid regions VAIF, EDN, MG A, ENHNQ, FMRA, HAI, TKA, FHS, QADNA, MQK, SAAVCK, EAQA, TVR, KEC, TPEMGK, DHFQ, LAAG, APPVDN, FARHERKMLLDNHA, and YIDP shown highlighted in the sequence below.
Figure imgf000021_0002
Figure imgf000022_0001
Described herein is a modified terminal deoxynucleotidyl transferase (TdT) enzyme comprising at least one amino acid modification when compared to a wild type sequence or the homologous amino acid sequence of a terminal deoxynucleotidyl transferase (TdT) enzyme in other species, wherein the modification is selected from one or more of the amino acid regions WLLNRLINRLQNQGILLYYDI, VAIF, EDN, MG A, ENHNQ, FMRA, HAI, TKA, FHS, QADNA, MQK, SAAVCK, EAQA, TVR, KEC, TPEMGK, DHFQ LAAG, APPVDN, FARHERKMLLDNHA, and YIDP of the sequence of a wild type sequence or the homologous regions in other species.
Homologous refers to protein sequences between two or more proteins that possess a common evolutionary origin, including proteins from superfamilies in the same species of organism as well as homologous proteins from different species. Such proteins (and their encoding nucleic acids) have sequence homology, as reflected by their sequence similarity, whether in terms of percent identity or by the presence of specific residues or motifs and conserved positions. A variety of protein (and their encoding nucleic acid) sequence alignment tools may be used to determine sequence homology. For example, the Clustal Omega multiple sequence alignment program provided by the European Molecular Biology Laboratory (EMBL) can be used to determine sequence homology or homologous regions.
Improved sequences as described herein can contain both modifications, namely
a. a first modification is within the amino acid region WLLNRLINRLQNQGILLYYDI of the sequence of a wild type sequence or the homologous region in other species; and
b. a second modification is selected from one or more of the amino acid regions VAIF, EDN,
MG A, ENHNQ FMRA, HAI, TKA, FHS, QADNA, MQK, SAAVCK, EAQA, TVR, KEC, TPEMGK, DHFQ LAAG,
APPVDN, FARHERKMLLDNHA, and YIDP of the sequence of a wild type sequence or the homologous regions in other species. The modification within the region WLLNRLINRLQNQGILLYYDIV or the corresponding region from other species help improve the solubility of the enzyme. The modification within the amino acid region WLLNRLINRLQNQGILLYYDIV can be at one or more of the underlined amino acids.
Particular changes can be selected from W-Q N-P, R-K, L-V, R-L, L-W, Q-E, N-K, Q-K or l-L. The sequence WLLNRLINRLQNQGILLYYDIV can be altered to QLLPKVINLWEKKGLLLYYDLV.
The second modification improves incorporation of nucleotides having a modification at the 3' position in comparison to the wild type sequence. The second modification can be selected from one or more of the amino acid regions VAIF, EDN, MG A, ENHNQ, FMRA, HAI, TKA, FHS, QADNA, MQK, SAAVCK, EAQA, TVR, KEC, TPEMGK, DHFQ, LAAG, APPVDN, FARHERKMLLDNHA, and YIDP of the sequence of a wild type sequence or the homologous regions in other species. The second modification can be selected from two or more of the amino acid regions VAIF, EDN, MGA, ENHNQ, FMRA, HAI, TKA, FHS, QADNA, MQK, SAAVCK, EAQA, TVR, KEC, TPEMGK, DHFQ, LAAG, APPVDN, FARHERKMLLDNHA, and YIDP of the sequence of a wild type sequence or the homologous regions in other species shown highlighted in the sequence below.
Figure imgf000023_0001
The identified positions commence at positions V32, E74, M108, F182, T212, D271, M279, E298, A421, L456, Y486. Modifications disclosed herein contain at least one modification at the defined positions. The modified amino acid can be in the region FMRA. The modified amino acid can be in the region QADNA. The modified amino acid can be in the region EAQA. The modified amino acid can be in the region APP. The modified amino acid can be in the region LDNHA. The modified amino acid can be in the region YIDP. The region FARHERKMLLDNHA is advantageous for removing substrate biases in modifications. The FARHERKMLLDNHA region appears highly conserved across species.
The modification selected from one or more of the amino acid regions FMRA, QADNA, EAQA, APP, FARHERKMLLDNHA, and YIDP can be at the underlined amino acid(s). The positions for modification can include A53, V68, V71, D75, E97, 1101, G109, Q115, V116, S125, T137, Q143, N154, H155, Q157, 1158, 1165, G177, L180, A181, M183, A195, K200, T212, K213, A214,
E217, T239, F262, S264, Q269, N272, A273, K281, S291, K296, Q300, T309, R311, E330, T341, E343,
G345, N352, N360, Q361, 1363, Y367, H389, L403, G406, D411, A421, P422, V424, N426, R438, F447, R452, L455, and/or D488.
Amino acid changes include any one of A53G, V68I, V71I, D75N, D75Q, E97A, 1101V, G109E, G109R, Q115E, V116I, V116S, S125R, T137A, Q143P, N154H, H155C, Q157K, Q157R, I158M, 1165V, G177D,
L180V, A181E, M183R, A195P, K200R, T212S, K213S, A214R, E217Q, T239S, F262L, S264T, Q269K, N272K, A273S, A273T, K281R, S291N, K296R, Q300D, T309A, R311W, E330N, T341S, E343Q, G345R,
N352Q, N360K, Q361K, I363L, Y367C, H389A, L403R, G406R, D411N, A421L, A421M, A421V, P422A, P422C, V424Y, N426R, R438K, F447W, R452K, L455I, and/or D488P.
Amino acid changes include any two or more of A53G, V68I, V71I, D75N, D75Q, E97A, 1101V, G109E, G109R, Q115E, V116I, V116S, S125R, T137A, Q143P, N154H, H155C, Q157K, Q157R, I158M, 1165V,
G177D, L180V, A181E, M183R, A195P, K200R, T212S, K213S, A214R, E217Q, T239S, F262L, S264T,
Q269K, N272K, A273S, A273T, K281R, S291N, K296R, Q300D, T309A, R311W, E330N, T341S, E343Q,
G345R, N352Q, N360K, Q361K, I363L, Y367C, H389A, L403R, G406R, D411N, A421L, A421M, A421V, P422A, P422C, V424Y, N426R, R438K, F447W, R452K, L455I, and/or D488P.
The modification of QADNA to KADKA, QADKA, KADNA, QADNS, KADNT, or QADNT is advantageous for the incorporation of 3'-0-modified nucleoside triphosphates to the 3'-end of nucleic acids and removing substrate biases during the incorporation of modified nucleoside triphosphates. The modification of APPVDN to MCPVDN, MPPVDN, ACPVDR, VPPVDN, LPPVDR, ACPYDN, LCPVDN, or MAPVDN is advantageous for the incorporation of 3'-0-modified nucleoside triphosphates to the 3'- end of nucleic acids and removing substrate biases during the incorporation of modified nucleoside triphosphates. The modification of FARHERKMLLDRHA to WARHERKMILDNHA, FARHERKMILDNHA, WARHERKMLLDNHA, FARHERKMLLDRHA, or FARHEKKMLLDNHA is also advantageous for the incorporation of 3'-0-modified nucleoside triphosphates to the 3'-end of nucleic acids and removing substrate biases during the incorporation of modified nucleoside triphosphates.
The modification can be selected from one or more of the following sequences FRRA, QADKA, EADA, MPP, FARHERKMLLDRHA, and YIPP. Included is a modified terminal deoxynucleotidyl transferase (TdT) enzyme wherein the second modification is selected from two or more of the following sequences FRRA, QADKA, EADA, MPP, FARHERKMLLDRHA, and YIPP. Included is a modified terminal deoxynucleotidyl transferase (TdT) enzyme wherein the second modification contains each of the following sequences FRRA, QADKA, EADA, MPP, FARHERKMLLDRHA, and YIPP. Disclosed is a composition for monitoring the quality of template-independent nucleic acid synthesis comprising measuring the amount of incorporated nucleotide monomer on each reaction cycle comprising inorganic pyrophosphatase, a terminal deoxynucleotidyl transferase (TdT), a 3'-0- reversibly blocked dNTP and a fluorescently labelled phosphate binding protein. Disclosed is a composition for monitoring the quality of template-independent nucleic acid synthesis comprising measuring the amount of incorporated nucleotide monomer on each reaction cycle comprising; a terminal deoxynucleotidyl transferase (TdT), a 3'-0-reversibly blocked dNTP and a terpyridine-Zn(ll) complex. EXAMPLES
Example 1: Coupling of pyrophosphatase and phosphate-binding protein labelled to a fluorophore to monitor and quantitate reversibly terminated nucleotide incorporation by engineered TdTs.
PBP-MDCC (Thermo Fisher Scientific) was tested as a means to monitor non-templated enzymatic DNA synthesis. PBP-MDCC (0.5 mM), engineered TdT (0.04 mg/ml), pyrophosphatase (0.01 mg/m!), dATP-ONH2 (0.25 mM), fluorescently labelled oligonucleotide initiator (purchased from IDT; SEQ IDS 662 and 663 in Table 1; 5 mM) (Seq IDs 3 and 4), and required buffer components were mixed together and incubated at 37 °C for 10 minutes. Reactions were monitored at an excitation and emission wavelength of 430 and 460 nm, respectively, and measured 60 s after reaction initiation.
Table 1:
Figure imgf000025_0001
All reactions were phosphate mopped to remove any pre-existing phosphate in the solutions prior to initiation unless otherwise stated. Phosphate mopping refers to incubation of solutions with purine nucleoside phosphorylase (Sigma-Aldrich; from microbial sources). PNPase was used at 1 unit/ml and 7-methylguanosine (MEG) was used at 200 mM final concentration (37 °C for 10 minutes).
Briefly, "Spike Mop" in the figure above only contains buffer components and 20 mM phosphate, which were subsequently phosphate mopped. "GA(ddC)" 1 and 2 contain oligonucleotide SEQ NO 662 from Table 1 whereas "GAC" 1 and 2 contains oligonucleotide SEQ NO 663 from Table 1. GA(ddC) and GAC are fully constituted mixtures with all components necessary for reversibly terminated nucleotide addition. "No mop" contains GA(ddC) reaction components, but no phosphate mop was performed previous to reaction initiation. "Spike" contains only buffer components and 20 mM phosphate.
As can be seen in Figure 2, by comparing "Spike Mop" and "Spike" samples, the phosphate mop successfully consumed 20 mM phosphate. If samples are not mopped up ("No mop"), then the background fluorescence from the reaction sample is very high. As compared with "GA(ddC)" (3'- ddC; negative control as 3'-dideoxy terminus precludes addition), the "GAC" reaction is nearly 2.5- fold higher in fluorescence at 60s. This experiment plainly shows that it is feasible to use PBP-MDCC to detect successful addition of a reversibly terminated nucleotide to the 3‘-end of an oligonucleotide.
Example 2: Pyrophosphate detection as a method to monitor and quantitate reversibly terminated nucleotide incorporation by engineered TdTs.
Engineered TdT (only present if indicated; 0.04 mg/ml), pyrophosphatase (only present if indicated; 0.01 mg/ml), dTTP-ONH2 (only present if indicated; 0.25 mM), immobilized oligonucleotide initiator (only present if indicated; 1 pmol), and required buffer components were mixed together and incubated at 37 °C for the indicated amount of time.
Samples were first removed from immobilized oligonucleotides. Pyrophosphate was then detected by incubation of samples with ATP sulfurylase and adenosine phosphosulfate at room temperature for 30 min (Lonza Bioscience). Luciferase and luciferin were then incubated with samples at room temperature for 10 min (Lonza Bioscience). Luminescence was detected by imaging on a Typhoon Trio (Amersham Biosciences). Reaction conditions were (1) buffer + pyrophosphate + immobilized oligonucleotide (0.1 mM); (2) buffer + pyrophosphate + immobilized oligonucleotide (0.5 mM); (3) buffer + pyrophosphate + immobilized oligonucleotide (1.0 mM); (4) buffer + engineered TdT + dTTP-ONH2 + immobilized oligonucleotide (0 min); (5) buffer + engineered TdT + dTTP-ONH2 + immobilized oligonucleotide (10 min); (6) buffer + dTTP-ONH2 + immobilized oligonucleotide (10 min); and (7) buffer + dTTP-ONH2 + immobilized oligonucleotide + pyrophosphatase (10 min).
As can be seen in Figure 3, by comparing reaction conditions 4 and 5, nucleotide incorporation over time can be monitored by the ATP sulfurylase - !uciferase bioluminescence regenerative cycle. If pyrophosphatase is included in the reaction mixture, then no luminescence is detected, confirming that the assay is specific for pyrophosphate.

Claims

Claims
1. A method for monitoring the quality of template-independent enzymatic nucleic acid synthesis comprising measuring the amount of incorporated nucleotide monomer after one or more of the reaction cycles.
2. The method according to claim 1, wherein said measuring detects changes resulting from nucleotide incorporation, wherein the changes are selected from: pH changes, temperature changes, phosphate concentration changes and/or pyrophosphate concentration changes.
3. The method according to claim 2, wherein the nucleotide monomers are reversibly blocked nucleoside triphosphates.
4. The method according to claim 3, wherein the nucleotide monomers are reversibly 3'- blocked nucleoside triphosphates.
5. The method according to claim 4, wherein the 3'-reversible block is selected from 3'-0- CH2N3, 3'-0-CH2CHCH2, 3'-0-CH2CH2CN or 3'-0-NH2.
6. The method according to claim 2, wherein pH changes are detected using an ion- sensitive field-effect transistor (ISFET).
7. The method according to any one of claims 1 to 5, wherein the released pyrophosphate is detected directly in-situ.
8. The method according to claim 7, wherein the pyrophosphate is detected by association with a chelated Zn(ll) complex.
9. The method according to claim 7, wherein the pyrophosphate is detected by bioluminescence regenerative cycle comprising catalysis with ATP sulfuryase, luciferase, and apyrase or a combination thereof.
10. The method according to any of claims 1 to 5, wherein the pyrophosphate is hydrolysed to phosphate and the formation of phosphate is detected.
11. The method according to claim 10, wherein the complex formation of phosphate with ammonium molybdate is detected.
12. The method according to claim 11, wherein the complex formation of phosphate with Malachite Green and ammonium molybdate is detected.
13. The method according to claim 10, wherein the presence of phosphate is detected based on reaction of phosphate with maltose, in the presence of an enzyme, to produce glucose, wherein the glucose is then specifically oxidized to generate a product that reacts with a probe to generate fluorescence.
14. The method of claim 13, wherein the enzyme is maltose phosphorylase which converts maltose, in the presence of phosphate, to glucose-l-phosphate and glucose, wherein glucose oxidase converts the glucose to gluconolactone and H2O2 which, in presence of horseradish peroxidase (HRP), reacts with Amplex Red reagent to generate resorufin.
15. The method according to claim 10, wherein the phosphate is detected using a fluorescently labelled phosphate binding protein.
16. The method of claim 10, wherein the phosphate output each time a new species of nucleoside 5'-triphosphate is added is monitored to assess the quality of nucleic acid synthesis.
17. A method for monitoring the quality of template-independent enzymatic nucleic acid synthesis comprising:
a) providing an immobilized nucleic acid initiator sequence;
b) adding a blocked nucleoside triphosphate to said initiator sequence through the exposure of the immobilized nucleic acid initiator sequence to a reaction solution comprising:
• a blocked nucleoside triphosphate(s);
• a nucleic acid transferase or template independent polymerase, such as an engineered terminal deoxynucleotidyl transferase (TdT);
• required buffer components;
• and phosphate or pyrophosphate-sensing reagents;
c) quantification of quantity of phosphate or pyrophosphate generated therefrom during step (b);
d) removing all reagents from the initiator sequence;
e) cleaving the blocking group from the blocked nucleoside in the presence of a cleaving agent; and
f) removing the cleaving agent by washing the immobilize initiator nucleic acid with wash solution. wherein steps (b) - (f) can be repeated to add greater than one nucleotide to the above nucleic acid initiator sequence.
18. A method for monitoring the quality of template-independent enzymatic nucleic acid synthesis comprising:
a) providing an immobilized nucleic acid initiator sequence; b) adding a blocked nucleoside triphosphate to said initiator sequence through the exposure of the immobilized nucleic acid initiator sequence to a reaction solution comprising:
• a blocked nucleoside triphosphate;
• a nucleic acid transferase or template independent polymerase, such as an engineered terminal deoxynucleotidyl transferase (TdT);
• required buffer components;
c) removing the reaction solution in (b) from the immobilized nucleic acid initiator;
d) introducing reagents for the quantification of pyrophosphate or phosphate generated therefrom during (b) to the removed reaction solution of (c);
e) quantification of pyrophosphate or phosphate in the removed reaction solution of (c): f) washing of the immobilized initiator nucleic acid with wash solution;
g) cleaving the blocking group from the blocked nucleoside in the presence of a cleaving agent; and
h) removing the cleaving agent by washing the immobilize initiator nucleic acid with wash solution. wherein steps (b) - (h) can be repeated to add greater than one nucleotide to the above nucleic acid initiator sequence.
19. The method of claim 17 or claim 18, wherein the reagents for the quantification of pyrophosphate are selected from:
(a) a chelated Zn(ll) complex; or
(b) ATP sulfuryase, luciferase, and apyrase, or a combination thereof.
20. The method of claim 17 or claim 18, wherein the reagents wherein the reaction contains a pyrophosphatase and reagents for the quantification of phosphate selected from:
(a) ammonium molybdate;
(b) ammonium molybdate and Malachite Green;
(c) maltose phosphorylase, glucose oxidase, horseradish peroxidase and Amplex Red; or
(d) a fluorescently labelled phosphate binding protein.
21. The method of claim 20, wherein the nucleoside block is selected from 3'-0-CH2N3, 3'-0-
CH2CHCH2, 3'-0-CH2CH2CN or 3'-0-NH2.
22. The method of any one preceding claim wherein the monitoring is performed on each of the reaction cycles.
23. A composition for monitoring the quality of template-independent nucleic acid synthesis comprising measuring the amount of incorporated nucleotide monomer on each reaction cycle comprising a terminal deoxynucleotidyl transferase (TdT), a 3'-0-reversibly blocked dNTP and reagents for the quantification of pyrophosphate or phosphate released as a result of nucleotide monomer incorporation.
24. The composition of claim 23, wherein the reagents for the quantification of pyrophosphate comprise a chelated Zn(ll) complex.
25. The composition of claim 23, wherein the reagents for the quantification of pyrophosphate comprise ATP sulfuryase, luciferase, and apyrase or a combination thereof.
26. The composition of claim 23, wherein the composition contains pyrophosphatase and reagents for the quantification of phosphate selected from; a. ammonium molybdate, with or without Malachite Green;
b. maltose phosphorylase, glucose oxidase, horseradish peroxidase and
Amplex Red; or
c. a fluorescently labelled phosphate binding protein.
PCT/GB2020/051250 2019-05-22 2020-05-22 Method of quality control of oligonucleotide synthesis WO2020234605A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN202080038030.5A CN114008215A (en) 2019-05-22 2020-05-22 Quality control method for oligonucleotide synthesis
EP20728173.4A EP3973075A1 (en) 2019-05-22 2020-05-22 Method of quality control of oligonucleotide synthesis

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
GBGB1907209.9A GB201907209D0 (en) 2019-05-22 2019-05-22 Method of quality control of oligonucleotide synthesis
GB1907209.9 2019-05-22

Publications (1)

Publication Number Publication Date
WO2020234605A1 true WO2020234605A1 (en) 2020-11-26

Family

ID=67002053

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/GB2020/051250 WO2020234605A1 (en) 2019-05-22 2020-05-22 Method of quality control of oligonucleotide synthesis

Country Status (4)

Country Link
EP (1) EP3973075A1 (en)
CN (1) CN114008215A (en)
GB (1) GB201907209D0 (en)
WO (1) WO2020234605A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2023152354A1 (en) * 2022-02-14 2023-08-17 Miltenyi Biotec B.V. & Co. KG Direct synthesis of oligonucleotides on microtomed tissue slices

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014179596A1 (en) * 2013-05-01 2014-11-06 Advanced Liquid Logic, Inc. Analysis of dna
WO2015175832A1 (en) * 2014-05-16 2015-11-19 Illumina, Inc. Nucleic acid synthesis techniques
WO2016128731A1 (en) 2015-02-10 2016-08-18 Nuclera Nucleics Ltd Novel use
US20190078065A1 (en) 2017-09-08 2019-03-14 Sigma-Aldrich Co. Llc Modified dna polymerases

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2017176541A1 (en) * 2016-04-04 2017-10-12 President And Fellows Of Harvard College Enzymatic nucleic acid synthesis

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014179596A1 (en) * 2013-05-01 2014-11-06 Advanced Liquid Logic, Inc. Analysis of dna
WO2015175832A1 (en) * 2014-05-16 2015-11-19 Illumina, Inc. Nucleic acid synthesis techniques
WO2016128731A1 (en) 2015-02-10 2016-08-18 Nuclera Nucleics Ltd Novel use
US20190078065A1 (en) 2017-09-08 2019-03-14 Sigma-Aldrich Co. Llc Modified dna polymerases

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
BIOCHIM BIOPHYS ACTA, vol. 1804, no. 5, May 2010 (2010-05-01), pages 1136 - 1150
NUCLEIC ACIDS RES., vol. 34, no. 16, September 2006 (2006-09-01), pages 4572 - 4582
NUCLEIC ACIDS RESEARCH, vol. 37, no. 14, June 2009 (2009-06-01), pages 4642 - 4656
SEBASTIAN PALLUK ET AL: "De novo DNA synthesis using polymerase-nucleotide conjugates", NATURE BIOTECHNOLOGY, vol. 36, no. 7, 18 June 2018 (2018-06-18), New York, pages 645 - 650, XP055529953, ISSN: 1087-0156, DOI: 10.1038/nbt.4173 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2023152354A1 (en) * 2022-02-14 2023-08-17 Miltenyi Biotec B.V. & Co. KG Direct synthesis of oligonucleotides on microtomed tissue slices

Also Published As

Publication number Publication date
CN114008215A (en) 2022-02-01
EP3973075A1 (en) 2022-03-30
GB201907209D0 (en) 2019-07-03

Similar Documents

Publication Publication Date Title
US10808244B2 (en) Method of normalizing biological samples
US20210261998A1 (en) Compositions and methods related to nucleic acid preparation
JP2022122950A (en) Novel use
US7777013B2 (en) Labeled nucleotide analogs and uses therefor
US20180201968A1 (en) Azidomethyl Ether Deprotection Method
AU2020217977A1 (en) Modified terminal deoxynucleotidyl transferase (TdT) enzymes
JP2020521508A (en) Use of terminal transferase enzymes in nucleic acid synthesis
EP3935187B1 (en) Method of oligonucleotide synthesis
CA3151858A1 (en) Method for sequencing polynucleotides based on optical signal kynetics of luminescent labels and secondary luminescent signals
EP3973075A1 (en) Method of quality control of oligonucleotide synthesis
Danhel et al. Voltammetric analysis of 5-(4-Azidophenyl)-2′-deoxycytidine nucleoside and azidophenyl-labelled single-and double-stranded DNAs
US20230175030A1 (en) Nucleic acid polymer with amine-masked bases
CN116249783A (en) Controlled and template independent nucleic acid synthesis using thermostable enzymes
CN115427559A (en) Terminal deoxynucleotidyl transferase variants and uses thereof
WO2024042217A1 (en) Methods of modifying methylcytosine or derivative thereof using a nucleophilic molecule, and methods of using the same to detect the methylcytosine or derivative thereof in a polynucleotide
Medžiūnė The synthesis and applications of oligonucleotide-modified nucleotides
WO2022197942A9 (en) Phase protective reagent flow ordering
WO2022029427A1 (en) MODIFIED TERMINAL DEOXYNUCLEOTIDYL TRANSFERASE (TdT) ENZYMES

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20728173

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 2020728173

Country of ref document: EP

Effective date: 20211222