WO2020072715A1 - Compositions et procédés comprenant des mutants de désoxynucléotidyle transférase terminale - Google Patents

Compositions et procédés comprenant des mutants de désoxynucléotidyle transférase terminale

Info

Publication number
WO2020072715A1
WO2020072715A1 PCT/US2019/054398 US2019054398W WO2020072715A1 WO 2020072715 A1 WO2020072715 A1 WO 2020072715A1 US 2019054398 W US2019054398 W US 2019054398W WO 2020072715 A1 WO2020072715 A1 WO 2020072715A1
Authority
WO
WIPO (PCT)
Prior art keywords
tdt
modified
polypeptide
enzyme
amino acid
Prior art date
Application number
PCT/US2019/054398
Other languages
English (en)
Inventor
George M. Church
Nicholas J. CONWAY
Richard E. KOHMAN
Erkin KURU
Jonathan RITTICHIER
Daniel Jordan WIEGAND
Original Assignee
President And Fellows Of Harvard College
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by President And Fellows Of Harvard College filed Critical President And Fellows Of Harvard College
Publication of WO2020072715A1 publication Critical patent/WO2020072715A1/fr

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • C12N9/12Transferases (2.) transferring phosphorus containing groups, e.g. kinases (2.7)
    • C12N9/1241Nucleotidyltransferases (2.7.7)
    • C12N9/1264DNA nucleotidylexotransferase (2.7.7.31), i.e. terminal nucleotidyl transferase
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P19/00Preparation of compounds containing saccharide radicals
    • C12P19/26Preparation of nitrogen-containing carbohydrates
    • C12P19/28N-glycosides
    • C12P19/30Nucleotides
    • C12P19/34Polynucleotides, e.g. nucleic acids, oligoribonucleotides
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y207/00Transferases transferring phosphorus-containing groups (2.7)
    • C12Y207/07Nucleotidyltransferases (2.7.7)
    • C12Y207/07031DNA nucleotidylexotransferase (2.7.7.31), i.e. terminal deoxynucleotidyl transferase

Definitions

  • TdT modified terminal deoxynucleotidyl transferase
  • Terminal deoxynucleotidyl transferase is a very useful template-independent DNA polymerase for major biotechnological applications such as the storage of digital information and c/e novo oligonucleotide synthesis.
  • TdT has the unique ability to rapidly catalyze the synthesis of long DNA oligonucleotides in the presence of only a small initiator sequence, cofactors and nucleoside triphosphate monomers.
  • a modified TdT polypeptide sequence can modify the function, reaction catalysis and substrate binding of the TdT polypeptide.
  • certain amino acid mutations can alter the cofactor preference for the TdT polypeptide, such that a modified TdT may be more efficient in the presence of Mn2+ or Mg2+ than in the presence of the endogenously preferred Co2+ cofactor.
  • a modified TdT polypeptide can comprise different temperature or pH sensitivities, altered rates of DNA synthesis, and the ability to incorporate non-natural nucleotides.
  • a modified TdT enzyme comprises a reduced substrate bias towards a preferred initiator sequence or nucleoside triphosphate base (A, G, C, or T) compared to an unmodified TdT enzyme from which it is derived.
  • TdT terminal deoxynucleotidyl transferase
  • enzyme i.e., enzyme
  • the modified TdT polypeptide comprises a sequence having at least one amino acid mutation and retains at least 10% of the template-independent DNA polymerase activity of the TdT polypeptide from which the modified TdT is derived.
  • the template-independent DNA polymerase activity of the modified TdT polypeptide and the wild-type TdT polypeptide from which it is derived is measured using the same enzymatic reaction conditions (e.g., co-factor and co-factor concentration, temperature, time, pH, nucleotide(s) and nucleotide(s) concentration etc.).
  • the template-independent polymerase activity of a given modified TdT polypeptide is assessed under reaction conditions that are different from the reaction conditions that are used to assess the activity of the unmodified TdT polypeptide.
  • modifications to a TdT polypeptide can alter the co-factor preference or degree of nucleotide bias, thus it may be desirable to compare the activity of the modified TdT polypeptide under conditions preferred by the modified enzyme to the activity of the wild-type enzyme under conditions preferred by the wild-type enzyme. This is because under the preferred conditions of the modified TdT polypeptide, the wild-type TdT may have substantially reduced activity.
  • the modified TdT comprises at least 50% of the template-independent DNA polymerase activity of the TdT polypeptide from which it is derived.
  • the protein sequence of the modified TdT comprises at least two amino acid mutations compared to the TdT polypeptide from which it is derived.
  • the TdT polypeptide from which the modified TdT is derived is a human TdT polypeptide.
  • the human TdT polypeptide comprises SEQ ID NO: 1.
  • the modified TdT comprises a cofactor preference that is different from the cofactor preference of the TdT polypeptide from which it is derived.
  • the modified TdT comprises a reduced degree of bias compared to the degree of bias of the TdT polypeptide from which it is derived under substantially similar enzyme assay conditions.
  • the modified TdT comprises at least one amino acid mutation at an amino acid residue selected from the group consisting of: S279, G341, H342, D343, V344, D345, A396, A429, 1430, R431, V432, D433, R442, F444, R453, Q454, and L459.
  • the modified TdT comprises a mutation at R453 and at least one additional mutation at an amino acid residue selected from the group consisting of: S279, G341, H342, D343, V344, D345, A396, A429, 1430, R431, V432, D433, R442, F444, R453, Q454, and L459.
  • the mutation at R453 is R453A.
  • the at least one additional mutation occurs at amino acid residue V432.
  • the mutation at amino acid residue V432 is V432G.
  • the modified TdT polypeptide comprises a sequence selected from those listed in any one of Tables 3-8.
  • the modified TdT polypeptide is a variant comprising the mutations R453A-V432G.
  • Another aspect provided herein relates to a method for generating a polynucleotide sequence c/e novo or in vitro, the method comprising; incubating a modified TdT enzyme as described herein in the presence of an initiator sequence, a cofactor and at least one nucleoside triphosphate under conditions and for a time sufficient to add at least one nucleotide to the 3’ end of a polynucleotide strand.
  • the modified TdT enzyme or the initiator sequence are conjugated to a solid support.
  • the solid support comprises a bead, a membrane, or a column.
  • the cofactor is a divalent cation.
  • the divalent cation is Co2+, Mn2+, Mg2+ or Zn2+.
  • the modified TdT enzyme of claim 1 is incubated in the presence of 2, 3, or 4 nucleoside triphosphates.
  • the method further comprises a second step of incubating a modified TdT enzyme of claim 1 in the presence of an initiator sequence, a cofactor and at least one different nucleoside triphosphate under conditions and for a time sufficient to add at least one nucleotide to the 3’ end of a polynucleotide strand.
  • the step is repeated once or twice each in the presence of at least one different nucleotide.
  • Another aspect provided herein relates to a nucleic acid molecule encoding any one of the modified TdT enzymes described herein (e.g., encoding any one of the polypeptide sequences listed in Tables 3-8).
  • the nucleic acid encodes a modified TdT polypeptide variant comprising the mutations R453A-V432G.
  • Another aspect provided herein relates to a vector comprising a nucleic acid molecule encoding any one of the modified TdT enzymes described herein (e.g., encoding any one of the polypeptide sequences listed in Tables 3-8).
  • the vector comprises a nucleic acid that encodes a modified TdT polypeptide variant comprising the mutations R453A-V432G.
  • Another aspect provided herein relates to a cell comprising the modified TdT polypeptide, a nucleic acid molecule encoding a modified TdT polypeptide, and/or the vector comprising such a nucleic acid molecule as described herein.
  • the cell is a bacterial cell.
  • a solid support comprising a modified TdT enzyme as described herein (e.g., any one of the protein sequences listed in Tables 3-8).
  • the solid support comprises a modified TdT polypeptide variant comprising the mutations R453A-V432G.
  • FIG. 1 View of catalytic pocket of murine TdT with large arginine at residue position 453 protruding near where nucleotide binds.
  • FIGs. 2A-2B Heat-map of single-codon mutant variants at amino acid residue R453.
  • FIG. 2B 6% TBE-Urea denaturing gel electrophoresis analysis of human TdT R453H DNA oligonucleotide synthesis reactions compared to control, wild-type human TdT. 200-nt ssDNA ladder was used to determine the size of the produced DNA oligonucleotide. Gels were stained with lx GelStar Nucleic Acid Stain.
  • FIG. 3 Heat-map of double-codon mutants variants carrying the constant amino acid change R453A.
  • RFU values for each cofactor evaluated were normalized by the total protein concentration of the human TdT mutant variants as determined by a reducing agent microBCA assay. The final concentration of cofactor was 0.25 mM for all reactions.
  • FIGs. 4A-4B Heat-maps of double-codon mutants variants carrying the constant amino acid change R453A.
  • RFU values for each natural nucleotide evaluated were normalized by the total protein concentration of the human TdT mutant variants as determined by a reducing agent microBCA assay. The final concentration for each nucleotide was 1 mM and the cofactor was 0.25 mM Mn2+.
  • FIG. 5 6% TBE-Urea denaturing gel electrophoresis analysis of natural nucleotide incorporate by wild-type human TdT (WT-hTDT) compared to double-codon mutant variant human TdT R453A- V432G. Nucleotide concentration was 1 mM and the initiator oligonucleotide sequence was a Poly-T-l5- mer at 10 pmol per reaction. Wild-type human TdT was supplemented with 0.25 mM Co2+ cofactor and the double mutant variant was supplemented with 0.25 mM Mn2+ cofactor.
  • FIGs. 6A-6B 15% TBE-Urea denaturing gel electrophoresis analysis of natural nucleotide incorporation by single-codon mutant R453A human TdT compared to double codon-mutant R453A- V432G with varying DNA oligonucleotide initiator sequences.
  • FIG. 6A indicates reactions supplemented with 10 pmol of Poly-T-l5-mer
  • FIG. 6B indicates reactions supplemented with 10 pmol of Poly-A- l5-mer. Both the mutant variants were supplemented with 0.25 mM Mn2+ cofactor.
  • FIG. 7. 15% TBE-Urea denaturing gel electrophoresis analysis of natural ribonucleotide incorporation by single-codon mutant R453A human TdT compared to double codon-mutant R453A- V432G. Reactions were supplemented with 1 mM of each ribonucleotide, 0.25 mM Mn2+, and 10 pmol of DNA oligonucleotide initiator sequence poly-T-l5-mer
  • TdT polypeptides having at least one amino acid mutation at a desired residue but retaining at least 25% of the template -independent DNA polymerase activity of the unmodified TdT polypeptide.
  • TdT variants are contemplated for use in the generation of nucleic acid sequences for the storage of digital information, c/e novo oligonucleotide synthesis, or the like.
  • template-independent DNA polymerase activity refers to the ability of a TdT polypeptide, variant or mutant to add at least one nucleotide to a growing polynucleotide strand in the absence of a template DNA strand.
  • the term "substantially retains TdT activity” means that a variant or modified TdT polypeptide will retain at least 10% of the template-independent DNA polymerase activity (as assessed by measuring in vitro TdT enzyme activity) of the polypeptide or peptide from which it is derived (e.g., wildtype human TdT).
  • the activity of the derivative and the activity of wild-type TdT are assessed under substantially similar conditions, for example, in the presence of the same cofactor (e.g., Co 2+ ).
  • the activity of the derivative can be determined under different conditions (e.g., in the presence of an alternative co-factor, such as Mn2 + , Zn 2+ or Mg 2+ , and compared to the activity of the wild-type TdT enzyme determined under conditions preferred under native conditions (e.g., in the presence of Co 2+ ).
  • the derivative will retain at least 25%, at least 30% at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, at least 99% or even 100% of the template -independent DNA polymerase activity of the peptide/polypeptide from which it is derived.
  • the term“cofactor preference” refers to the cofactor that permits the highest enzymatic activity of a given TdT variant in the same assay conditions and using the same concentration of cofactor (e.g., 0.25mM).
  • the cofactor preference is expressed in descending order, such as the cofactor preference for endogenous wild-type TdT which is expressed as Co2+ > Mg2+, Mn2+.
  • the term "increased activity" refers to an increase in template-independent DNA polymerase activity of a derivative compared to that of the parent peptide/polypeptide, for example, the derivative can have at least a 2-fold increase, at least 5 -fold, at least 10-fold, at least 20-fold, at least 50-fold, at least 100- fold, at least 1000-fold or more increase in template-independent DNA polymerase activity compared to the parent peptide/polypeptide from which it is derived.
  • the terms“derivative,” “variant,” or“mutant” as used herein refers to a polypeptide or nucleic acid that comprises at least one mutation but is "substantially similar” to a wild-type human TdT polypeptide.
  • a molecule is said to be “substantially similar” to another molecule if both molecules have substantially similar structures (i.e., they are at least 50% similar in amino acid sequence as determined by BLASTp alignment set at default parameters) and are substantially similar in at least one relevant function (e.g., template-independent DNA polymerase activity).
  • a variant differs from the naturally occurring polypeptide or nucleic acid by one or more amino acid or nucleic acid deletions, additions, substitutions or side-chain modifications, yet retains one or more specific functions or biological activities of the naturally occurring molecule.
  • Amino acid substitutions include alterations in which an amino acid is replaced with a different naturally-occurring or a non-conventional amino acid residue. Some substitutions can be classified as“conservative,” in which case an amino acid residue contained in a polypeptide is replaced with another naturally occurring amino acid of similar character either in relation to polarity, side chain functionality or size.
  • substitutions encompassed by variants as described herein can also be“non-conservative,” in which an amino acid residue which is present in a peptide is substituted with an amino acid having different properties (e.g., substituting a charged or hydrophobic amino acid with an uncharged or hydrophilic amino acid), or alternatively, in which a naturally-occurring amino acid is substituted with a non-conventional amino acid.
  • variants when used with reference to a polynucleotide or polypeptide, are variations in primary, secondary, or tertiary structure, as compared to a reference polynucleotide or polypeptide, respectively (e.g., as compared to a wild- type polynucleotide or polypeptide). Polynucleotide changes can result in amino acid substitutions, additions, deletions, fusions and truncations in the polypeptide encoded by the reference sequence.
  • Variants can also include insertions, deletions or substitutions of amino acids, including insertions and substitutions of amino acids and other molecules) that do not normally occur in the peptide sequence that is the basis of the variant, including but not limited to insertion of ornithine which does not normally occur in human proteins.
  • statically significant or“significantly” refer to statistical significance and generally mean a two standard deviation (2SD) or greater difference relative to a reference value.
  • “decrease”,“reduced”,“reduction”, or“inhibit” are all used herein to mean a decrease by a statistically significant amount.
  • “reduce,”“reduction” or“decrease” or“inhibit” typically means a decrease by at least 10% as compared to a reference level (e.g.
  • a decrease can be preferably down to a level accepted as within the range of normal for an individual without a given disorder.
  • the terms“increased”,“increase”,“enhance”, or“activate” are all used herein to mean an increase by a statically significant amount.
  • the terms“increased”,“increase”, “enhance”, or“activate” can mean an increase of at least 10% as compared to a reference level, for example an increase of at least about 20%, or at least about 30%, or at least about 40%, or at least about 50%, or at least about 60%, or at least about 70%, or at least about 80%, or at least about 90% or up to and including a 100% increase or any increase between 10-100% as compared to a reference level, or at least about a 2- fold, or at least about a 3 -fold, or at least about a 4-fold, or at least about a 5 -fold or at least about a 10-fold increase, or any increase between 2-fold and 10-fold or greater as compared to a reference level.
  • an“increase” is a statistically significant increase in such level.
  • oligonucleotide encompass double- or triple-stranded nucleic acids, as well as single-stranded molecules.
  • nucleic acid strands need not be coextensive (i.e., a double-stranded nucleic acid need not be double-stranded along the entire length of both strands).
  • Nucleic acid sequences, when provided, are listed in the 5' to 3' direction, unless stated otherwise. Methods described herein provide for the generation of isolated nucleic acids. Methods described herein additionally provide for the generation of isolated and purified nucleic acids.
  • An“oligonucleotide,”“polynucleotide,” and“nucleic acid” as referred to herein can comprise at least 5, 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, 350, 375, 400, 425, 450, 475, 500, 600, 700, 800, 900, 1000, 1100, 1200, 1300, 1400, 1500, 1600, 1700, 1800, 1900, 2000, or more bases in length.
  • compositions, methods, and respective component(s) thereof are used in reference to compositions, methods, and respective component(s) thereof, that are essential to the method or composition, yet open to the inclusion of unspecified elements, whether essential or not.
  • compositions, methods, and respective components thereof as described herein, which are exclusive of any element not recited in that description of the embodiment.
  • the term "consisting essentially of” refers to those elements required for a given embodiment. The term permits the presence of elements that do not materially affect the basic and novel or functional characteristic(s) of that embodiment.
  • TdT Terminal deoxynucleotidyl Transferase
  • Terminal deoxynucleotidyl transferase is a template-independent DNA polymerase that catalyzes the addition of nucleotides to the 3’ terminus of a DNA molecule (e.g., a single stranded DNA strand).
  • TdT plays a role in introducing minor changes into the genetic material by randomly adding nucleotides to single-stranded DNA during recombination.
  • TdT activity is important in adaptation of the vertebrate immune system by increasing antigen receptor diversity.
  • There are two known isoforms of TdT (i) a short form having 509 amino acids (TdTS), and (ii) a long form having 529 amino acids (TdTL).
  • TdTS and TdTL comprise the domains necessary to bind nucleotides, DNA, and metal ion cofactors.
  • the derivatives or mutants of TdT described herein can be derived from either TdT isoform.
  • Two functionally independent human TdT regions have been identified: breast cancer susceptibility protein BRCA1 C-terminal (BRCT) domain at the N-terminus and the polymerase-like domain at the C-terminus.
  • BRCT domain of TdT is involved in protein-protein and protein-DNA interactions during DNA repair and cell cycle checkpoint pathways.
  • the pol— like domain is the catalytic core of the enzyme and contains the active site of the phosphoryl transfer reaction.
  • NLS nuclear localization signal
  • the protein domain structure and crystal structure of TdT is known to those of skill in the art and is not described in further detail herein.
  • TdT is unique in its ability to use a variety of other divalent cations such as Mn2+, Zn2+ and Mg2+.
  • the extension rate in vitro with dATP in the presence of divalent metal ions is ranked in the following order: Mg2+ > Zn2+ > Co2+ > Mn2+.
  • each metal ion has different effects on the kinetics of nucleotide incorporation.
  • Mg2+ facilitates the preferential utilization of dGTP and dATP whereas Co2+ increases the catalytic polymerization efficiency of the pyrimidines, dCTP and dTTP.
  • Zn2+ behaves as a unique positive effector for TdT since reaction rates with Mg2+ are stimulated by the addition of micromolar quantities of Zn2+. This enhancement may reflect the ability of Zn2+ to induce conformational changes in TdT that yields higher catalytic efficiencies. Polymerization rates are lower in the presence of Mn2+ compared to Mg2+, suggesting that Mn2+ does not support the reaction as efficiently as Mg2+. Further description of TdT is provided in Biochim Biophys Acta., May 2010; 1804(5): 1151— 1166 hereby incorporated by reference in its entirety.
  • Table 1 Exemplary unmodified polypeptide sequences of TdT in different species
  • the modified TdT polypeptides can be derived from any one of SEQ ID NOs: 1-9.
  • the modified TdT polypeptide is derived from a human polypeptide sequence, for example, SEQ ID NO: 1.
  • TdT e.g., human TdT
  • variants of TdT that comprise at least one amino acid mutation compared to the TdT protein from which they are derived and retain at least 10% of the functional template -independent DNA polymerase activity of the unmodified TdT (e.g., using an enzymatic TdT test as described herein).
  • the variants of TdT retain at least 15%, at least 20%, at least 25%, at least 30%, at least 35%, at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 92%, at least 95%, at least 98%, at least 99% of the template-independent DNA polymerase activity of the unmodified TdT from which the variant is derived.
  • the TdT variant comprises a template-independent DNA polymerase activity that is substantially similar to the activity of the TdT from which it is derived.
  • the term“substantially similar” refers to an activity of a TdT variant that comprises an activity that is not statistically significant (i.e., p ⁇ 0.05) as compared to the activity from unmodified TdT protein from which the variant is derived (e.g., as assessed using a TdT enzyme assay as described herein).
  • TdT variant has more than 100% of the activity of a wild-type or native polypeptide, e.g., 110%, 125%, 150%, 175%, 200%, 500%, 1000% or more.
  • Variant TdTs can comprise at least 2, at least 3, at least 4, at least 5 amino acid mutations or more (e.g., 6, 7, 8, 9, 10 etc.) as compared to the TdT enzyme from which they are derived.
  • the variant of TdT comprises a“single -codon” mutation (i.e., a single amino acid mutation).
  • the TdT variant comprises a“double-codon” (i.e., two amino acid substitutions) or “triple-codon” mutation (i.e., three amino acid substitutions).
  • the amino acid substitutions can comprise a conservative amino acid substitution.
  • conservative amino acid substitutions is well known in the art, which relates to substitution of a particular amino acid by one having a similar characteristic (e.g., similar charge or hydrophobicity, similar bulkiness). Examples include aspartic acid for glutamic acid, or isoleucine for leucine. A list of exemplary conservative amino acid substitutions is given in the table below.
  • a conservative substitution mutant or variant will 1) have only conservative amino acid substitutions relative to the parent sequence, 2) will have at least 90% sequence identity with respect to the parent sequence, preferably at least 95% identity, 96% identity, 97% identity, 98% identity or 99% or greater identity; and 3) will retain TdT template-independent DNA polymerase activity (e.g., enzyme activity) as that term is defined herein.
  • TdT template-independent DNA polymerase activity e.g., enzyme activity
  • non-conservative amino acid substitution may be preferred, for example, when a TdT variant with differing cofactor binding, enzyme activity, or reduced substrate bias is desired.
  • “Non conservative substitution” refers to the substitution of an amino acid in one class with an amino acid from another class; for example, substitution of an Ala, a class II residue, with a class III residue such as Asp, Asn, Glu, or Gln.
  • non-conservative substitutions include the substitution of a non-polar (hydrophobic) amino acid residue such as isoleucine, valine, leucine, alanine, methionine for a polar (hydrophilic) residue such as cysteine, glutamine, glutamic acid or lysine and/or a polar residue for a non-polar residue.
  • a non-polar (hydrophobic) amino acid residue such as isoleucine, valine, leucine, alanine, methionine
  • a polar (hydrophilic) residue such as cysteine, glutamine, glutamic acid or lysine and/or a polar residue for a non-polar residue.
  • a TdT variant as described herein can have a mixture of conservative and non-conservative amino acid substitutions in any desired configuration.
  • the TdT variant can be tested for activity, co-factor preference and nucleotide bias using methods known in the art or described in the Examples.
  • the amino acid residue to be mutated is an amino acid that plays a role in maintaining protein structural integrity, reaction catalysis and substrate binding (cofactor, initiator sequence & nucleoside triphosphate).
  • the one or more amino acids targeted for mutation to a different amino acid include, but are not limited to, S279, G341, H342, D343, V344, D345, A396, A429, 1430, R431, V432, D433, R442, F444, R453, Q454, or L459 (numbering based on the wild- type human TdT sequence Uniprot #P04053.
  • the TdT is a non-human mammalian TdT or TdT from other non-mammalian species.
  • the TdT is a member of the archaeo-eukaryotic primase (AEP) superfamily.
  • the TdT is a PolpTN2 or a C- terminal truncated PolpTN2, a PriS, a nonhomologous end joining archaeo-eukaryotic primase, a mammalian Ro ⁇ q, or a eukaryotic PrimPol.
  • the variant does not comprise a mutation(s) that would require TdT to use a template for synthesis of a polynucleotide strand.
  • Amino acid sequence alignment of a polypeptide of interest with a reference can provide guidance regarding not only residues likely to be necessary for function but also, conversely, those residues likely to tolerate change. Where, for example, an alignment shows two identical or similar amino acids at corresponding positions, it is more likely that that site is important functionally. Where, conversely, alignment shows residues in corresponding positions to differ significantly in size, charge, hydrophobicity, etc., it is more likely that that site can tolerate variation in a functional polypeptide.
  • Such alignments are readily created by one of ordinary skill in the art, e.g., using the default settings of the alignment tool of the BLASTP program.
  • homologs of any given polypeptide or nucleic acid sequence can be found using BLAST programs, e.g., by searching freely available databases of sequence for homologous sequences, or by querying those databases for annotations indicating a homolog (e.g., search strings that comprise a gene name or describe the activity of a gene).
  • the variant amino acid sequence (or corresponding DNA sequence) can be at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or more, identical to a native or reference sequence.
  • the degree of homology (percent identity) between a native and a mutant sequence can be determined, for example, by comparing the two sequences using freely available computer programs commonly employed for this purpose on the world wide web.
  • the variant amino acid or DNA sequence can be at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or more, similar to the sequence from which it is derived (referred to herein as an“original" sequence).
  • the degree of similarity (percent similarity) between an original and a mutant sequence can be determined, for example, by using a similarity matrix.
  • Similarity matrices are well known in the art and a number of tools for comparing two sequences using similarity matrices are freely available online, e.g., BLASTp (available on the world wide web), with default parameters set.
  • an amino acid mutation is introduced using any method known in the art, for example, site-directed mutagenesis where targeted mutations are introduced into one or more desired positions of a template TdT polynucleotide.
  • site-directed mutagenesis where targeted mutations are introduced into one or more desired positions of a template TdT polynucleotide.
  • This may be achieved by classic primer extension mutagenesis using a mutagenesis primer containing one or more desired mutations relative to the template polynucleotide.
  • the mutagenesis primer can be a synthetic oligonucleotide or a PCR product, and it may include one or more desired substitutions, deletions, additions or any desired combination thereof. Means and methods for producing such primers are readily available in the art.
  • the oligonucleotide or PCR product used as primer must be 5'-phosphorylated for ligation. This can be achieved by enzymatic phosphorylation reaction, by enzymatic digestion of the 5' end of the DNA or by conjugation in a chemical reaction.
  • Commercial kits for site-directed mutagenesis can be obtained commercially from e.g., New England Biolabs (Ipswich, MA), Thermo Fisher Scientific (Waltham, MA), Agilent (Santa Clara, CA), TransgenBiotech (Beijing, China), Biogene (Cambridge, MA), etc.
  • an insertion comprises at least one additional residue but does not exceed 20 additional residues, for example, 1-18 residues, 1-16 residues, 1-15 residues, 1-14 residues, 1-12 residues, 1-10 residues, 1-9 residues, 1-8 residues, 1-7 residues, 1-6 residues, 1-5 residues, 1-4 residues, 1-3 residues or 1-2 residues are inserted.
  • a deletion comprises removal of at least one residue but does not exceed 10 residues, for example, less than 9, less than 8, less than 7, less than 6, less than 5, less than 4, less than 3, or 2 residues are deleted.
  • a TdT polypeptide can be modified, e.g., by addition of a moiety to one or more of the amino acids that together comprise the peptide.
  • a polypeptide as described herein can comprise one or more moiety molecules, e.g., 1 or more moiety molecules per polypeptide, 2 or more moiety molecules per polypeptide, 5 or more moiety molecules per polypeptide, 10 or more moiety molecules per polypeptide or more.
  • a polypeptide as described herein can comprise one or more types of modifications and/or moieties, e.g. 1 type of modification, 2 types of modifications, 3 types of modifications or more types of modifications.
  • Non-limiting examples of modifications and/or moieties include PEGylation; glycosylation; HESylation; ELPylation; lipidation; acetylation; amidation; end-capping modifications; cyano groups; phosphorylation; albumin, and cyclization.
  • an end-capping modification can comprise acetylation at the N- terminus, N-terminal acylation, and N-terminal formylation.
  • an end-capping modification can comprise amidation at the C-terminus, introduction of C-terminal alcohol, aldehyde, ester, and thioester moieties.
  • the TdT polypeptide variant comprises a single-codon mutation, for example, a single-codon mutation at amino acid residue R453.
  • Exemplary single-codon mutants with confirmed activity include the variants listed in Table 3.
  • the single-codon mutant is selected from the variants in Table 3, Table 4 or Table 5.
  • a single codon-mutant at R453 can exhibit an altered preference for divalent cation in comparison to wildtype. Examples of such single codon-mutants and their preferred substrate include are found in Table 5. Table 5: Single-codon mutants at R453 with altered preference for divalent cation in comparison to wildtype.
  • the TdT polypeptide variant comprises a double-codon mutation.
  • one of the two codon mutations in a double-codon mutation comprises a mutation at residue R453 (e.g., R453A).
  • Exemplary double-codon mutants having activity confirmed as described in the working Examples include those listed in Table 6, 7 or 8.
  • Table 8 Top performing double-codon mutant variants (with R453A constant)
  • the modified TdT polypeptide variant comprises the mutations R453A- V432G.
  • the technology described herein relates to a nucleic acid encoding a modified or variant TdT polypeptide as described herein.
  • the term“nucleic acid” or “nucleic acid sequence” refers to any molecule, preferably a polymeric molecule, incorporating units of ribonucleic acid, deoxyribonucleic acid or an analog thereof.
  • the nucleic acid can be either single -stranded or double -stranded.
  • a single-stranded nucleic acid can be one strand nucleic acid of a denatured double- stranded DNA.
  • the nucleic acid is DNA.
  • the nucleic acid is RNA.
  • Suitable nucleic acid molecules include DNA, including genomic DNA or cDNA. Other suitable nucleic acid molecules include RNA, including mRNA.
  • a nucleic acid encoding a modified or variant TdT polypeptide as described herein is comprised by a vector.
  • a nucleic acid sequence encoding a modified TdT polypeptide as described herein is operably linked to a vector.
  • the term "vector”, as used herein, refers to a nucleic acid construct designed for delivery to a host cell or for transfer between different host cells.
  • a vector can be viral or non-viral.
  • the term“vector” encompasses any genetic element that is capable of replication when associated with the proper control elements and that can transfer gene sequences to cells.
  • a vector can include, but is not limited to, a cloning vector, an expression vector, a plasmid, phage, transposon, cosmid, chromosome, virus, virion, etc.
  • expression vector refers to a vector that directs expression of an RNA or polypeptide from sequences linked to transcriptional regulatory sequences on the vector.
  • sequences expressed will often, but not necessarily, be heterologous to the cell.
  • An expression vector may comprise additional elements, for example, the expression vector may have two replication systems, thus allowing it to be maintained in two organisms, for example in human cells for expression and in a prokaryotic host for cloning and amplification.
  • the term“viral vector” refers to a nucleic acid vector construct that includes at least one element of viral origin and has the capacity to be packaged into a viral vector particle.
  • the viral vector can contain a nucleic acid encoding a mutant TdT polypeptide as described herein in place of non- essential viral genes.
  • the vector and/or particle may be utilized for the purpose of transferring nucleic acids into cells either in vitro or in vivo. Numerous forms of viral vectors are known in the art. Production of TdT polypeptides or variants
  • a TdT variant as that term is used herein, can be produced chemically by e.g., solution or solid-phase peptide synthesis, or semi-synthesis in solution beginning with protein fragments coupled through conventional solution methods, as described by Dugas et al (1981). However, given the size and complexity of an enzyme, it is generally preferred to synthesize e.g., a TdT polypeptide or variant using e.g., recombinant methods.
  • the TdT polypeptide or variant is produced recombinantly.
  • Systems for cloning and expressing polypeptides useful with the methods and compositions described herein include various microorganisms and cells that are well known in recombinant technology and thus are not described in detail herein. These include, for example, various strains of E. coli, Bacillus, Streptomyces, and Saccharomyces, as well as mammalian, yeast and insect cells.
  • a TdT peptide or variant can be produced as a peptide or fusion protein, if so desired.
  • Suitable vectors for producing peptides and polypeptides are known and available from private and public laboratories and depositories and from commercial vendors.
  • Recipient cells capable of expressing the gene product are then transfected.
  • the transfected recipient cells are cultured under conditions that permit expression of the recombinant gene products, which are recovered from the culture.
  • Host mammalian cells such as Chinese Hamster ovary cells (CHO) or COS-l cells, can be used. These hosts can be used in connection with poxvirus vectors, such as vaccinia or swinepox. Suitable non-pathogenic viruses that can be engineered to carry the synthetic gene into the cells of the host include poxviruses, such as vaccinia, adenovirus, retroviruses and the like.
  • non- pathogenic viruses are commonly used for human gene therapy, and as carrier for other vaccine agents, and are known and selectable by one of skill in the art.
  • the selection of other suitable host cells and methods for transformation, culture, amplification, screening and product production and purification can be performed by one of skill in the art by reference to known techniques.
  • TdT polypeptide or variant it may be desirable to isolate and/or purify a synthesized TdT polypeptide or variant.
  • Protein purification techniques are well known to those of skill in the art and as such are not described in detail herein. These techniques can involve, at one level, the homogenization and crude fractionation of the cells, tissue or organ to polypeptide and non-polypeptide fractions.
  • the TdT peptide or variant can be further purified using chromatographic and electrophoretic techniques to achieve partial or complete purification (or purification to homogeneity).
  • Analytical methods particularly suited to the preparation of a pure peptide or polypeptide are ion-exchange chromatography, gel exclusion chromatography, polyacrylamide gel electrophoresis, affinity chromatography, immunoaffinity chromatography and isoelectric focusing.
  • a particularly efficient method of purifying peptides/polypeptides is fast performance liquid chromatography (FPLC) or even high performance liquid chromatography (HPLC).
  • A“purified TdT peptide/polypeptide or variant” is intended to refer to a composition, isolatable from other components, wherein the TdT peptide or variant is purified to any degree relative to the organism producing recombinant protein or in its naturally-obtainable state.
  • An isolated or purified polypeptide therefore, also refers to a /polypeptide free from the environment in which it may naturally occur.
  • “purified” will refer to a TdT polypeptide composition that has been subjected to fractionation to remove various other components, and which composition substantially retains its expressed biological activity (i.e., TdT DNA polymerase activity).
  • substantially purified this designation will refer to a composition in which the TdT polypeptide forms the major component of the composition, such as constituting about 50%, about 60%, about 70%, about 80%, about 90%, about 95%, or more of the proteins in the composition.
  • TdT polypeptide or variant there is no general requirement that the TdT polypeptide or variant be provided in the most purified state. Indeed, it is contemplated that less purified products will have utility in certain embodiments. Partial purification can be accomplished by using fewer purification steps in combination, or by utilizing different forms of the same general purification scheme. For example, it is appreciated that a cation- exchange column chromatography performed utilizing an HPLC apparatus will generally result in a greater “-fold” purification than the same technique utilizing a low pressure chromatography system. Methods exhibiting a lower degree of relative purification may have advantages in total recovery of protein product, or in maintaining the activity of an expressed protein.
  • TdT polypeptide or variant Various methods for quantifying the degree of purification of a given TdT polypeptide or variant are known to those of skill in the art and include, for example, determining the specific activity of an active fraction, or assessing the amount of polypeptides within a fraction by SDS/PAGE analysis
  • the enzymatic activity of TdT or a variant can be determined using any assay known to those of skill in the art and are not described in detail herein.
  • the activity of TdT or a variant thereof is described by the amount of protein that is needed to catalyze the incorporation of a certain concentration of natural or non-natural nucleotides into a single-stranded polynucleotide sequence using an initiator strand that exists in-solution or bound to a surface (e.g., de novo or in vitro).
  • multiple enzyme assays can be run in parallel under a variety of different conditions, such as in the presence of different metal ion cofactors.
  • the activity of a given TdT variant is compared to the activity of the protein from which it was derived.
  • a human TdT variant can be compared to the activity of wild-type human TdT (e.g., as a positive control).
  • the enzyme assay is performed using the preferred co-factor of endogenous TdT, Co2+.
  • the enzyme assay is performed in the presence of an alternative cofactor (e.g., Mn2+, Mg2+, Zn2+ etc.).
  • the activity of the TdT variant and the activity of the e.g., wild-type TdT are measured using the same set of reaction conditions, which can directly show the effect of a given mutation(s) on the functional activity of TdT.
  • one can compare the activity of the variant TdT in the presence of Mg2+ to the activity of wild-type TdT in the presence of Co2+.
  • Such an assay can be useful to determine the effects of a given mutation on function of the variant TdT as compared to the wild-type TdT under preferred, endogenous conditions (i.e., in the presence of Co2+ as a co-factor).
  • multiplex and high- throughput are used to describe the parallelizable nature of a kinetic assay by having the ability to determine the individual enzymatic activity of less than, equal to or greater than 96 protein variants and/or reaction conditions in a single experiment.
  • the activity of multiple purified TdT variants can be determined by the rate at which long single-stranded polynucleotide sequences are produced by measuring the fluorescent response in Relative Fluorescence Units (RFU) of a nucleic acid stain that is highly specific for single-stranded DNA.
  • REU Relative Fluorescence Unit
  • the accuracy of the kinetic assay is characterized by a minimal observable fluorescent response if double stranded DNA contaminants are present in the reaction vessel or if single-stranded polynucleotide sequences form unintended secondary structures such as hairpins, stem-loop structures or G-quadraplexes and the like. Terminal deoxynucleotidyl transferase activity is only present if an observable increase in fluorescent signal occurs in comparison to a negative control consisting of only initiator strand, free nucleotides, cofactor and appropriate buffers.
  • a positive control consisting of commercially available terminal transferase, such as bovine terminal deoxynucleotidyl transferase (New England Biolabs, Inc.) may also be used to relatively gauge the activity of purified template independent DNA polymerase variants or complexes.
  • Terminal transferase such as bovine terminal deoxynucleotidyl transferase (New England Biolabs, Inc.)
  • bovine terminal deoxynucleotidyl transferase New England Biolabs, Inc.
  • Single stranded nucleic acid fluorescent stains suitable for kinetic assays are known to those of skill in the art and are described in (ThermoFisher Scientific Inc., The Molecular Probes Handbook, Nucleic Acid Detection and Analysis— Chapter 8, Nucleic Acid Stains— Section 8.1, hereby incorporated by reference in its entirety).
  • a concentration curve consisting of a single polynucleotide sequence greater than 10 nucleotides can be generated to yield a set of standardized fluorescent signals. Because the fluorescent response in the presence of TdT activity is directly correlated to the amount of single stranded polynucleotide present at a given reaction time interval, the exact amount of polynucleotide in terms of mass can be interpolated from the concentration versus RFU curve and tracked throughout the progression of the reaction. This produces a rate unit for a particular amount of protein in terms of "mass increase in single -stranded polynucleotide per minute".
  • the rate unit for this kinetic assay can be further quantitated given additional reaction parameters such as free nucleotide composition, cofactors, and initiator sequence composition as well as each component' s respective concentration.
  • This kinetic assay provides a highly accurate and standardized method to specifically determine the best- candidate TdT variants in a cost- efficient and high-throughput activity screen.
  • compositions comprising a TdT polypeptide or variant
  • the TdT polypeptide or variant (e.g., isolated, synthetic, or recombinant peptide) is attached to, or enclosed or enveloped by, a macromolecular complex.
  • the macromolecular complex can be, without limitation, a virus, a bacteriophage, a bacterium, a liposome, a microparticle, a targeting sequence, a nanoparticle (e.g., a gold nanoparticle), a magnetic bead, a yeast cell, a mammalian cell, a cell or a microdevice.
  • macromolecular complexes within the scope of the methods and compositions described herein can include virtually any complex that can attach or enclose a peptide/polypeptide and be used in methods for c/e novo synthesis of oligonucleotides or other nucleic acids.
  • the isolated TdT polypeptide or variant can be attached to a solid support e.g., for purification of the TdT polypeptide or variant and/or ease of removing nucleic acid products generated in an enzymatic reaction mix from the TdT polypeptide when desired, such as, for example, magnetic beads, Sepharose beads, agarose beads, nanoparticles, a nitrocellulose membrane, a nylon membrane, a column chromatography matrix, a high performance liquid chromatography (HPLC) matrix or a fast performance liquid chromatography (FPLC) matrix for purification.
  • a solid support e.g., for purification of the TdT polypeptide or variant and/or ease of removing nucleic acid products generated in an enzymatic reaction mix from the TdT polypeptide when desired, such as, for example, magnetic beads, Sepharose beads, agarose beads, nanoparticles, a nitrocellulose membrane, a nylon membrane, a column chromatography matrix, a high performance liquid chromatography (
  • a solid support may be biological, nonbiological, organic, inorganic, or any combination thereof.
  • Supports for use with TdT polypeptides or variants can be any shape, size, or geometry as desired.
  • the support may be square, rectangular, round, flat, planar, circular, tubular, spherical, and the like.
  • the support may be physically separated into regions, for example, with trenches, grooves, wells, or chemical barriers (e.g., hydrophobic coatings, etc.).
  • Supports may be made from glass (silicon dioxide), metal, ceramic, polymer or other materials known to those of skill in the art.
  • Supports may be a solid, semi-solid, elastomer or gel.
  • TdT polypeptide sequences can be bound to such supports or substrates using methods, linkers (cleavable or non-cleavable) and chemistry known to those of skill in the art.
  • the TdT polypeptide or variant comprises a fusion protein.
  • These molecules generally have all or a substantial portion of the TdT peptide/variant, linked at the N- or C- terminus, to all or a portion of a second polypeptide or protein.
  • fusions may employ leader sequences from other species to permit the recombinant expression of a protein in a heterologous host.
  • Another useful fusion includes the addition of an immunologically active domain, such as an antibody epitope, to facilitate purification of the fusion protein. Inclusion of a cleavage site at or near the fusion junction will facilitate removal of the extraneous polypeptide after purification.
  • Other useful fusions include linking of functional domains, such as, for example, active sites from enzymes, glycosylation domains, cellular targeting signals or transmembrane regions.
  • TdT variants described herein can be used in the synthesis of nucleic acids for the purpose of storing digital information in nucleic acids, such as DNA.
  • DNA has the capacity to hold vast amounts of information, readily stored for long periods in a compact form.
  • the high capacity of DNA to store information stably under easily achieved conditions has made DNA an attractive target for information storage since the mid-90’s.
  • DNA molecules have a longevity that permits long-term storage with little to no deterioration of the encoded information.
  • Data storage systems based on both living vector DNA (in vivo DNA molecules) and synthesized DNA (in vitro DNA) have been proposed. Given that in vivo DNA storage systems have constraints on the quantity, genomic elements and locations that can be manipulated without affecting viability of the DNA molecules in the living vector organisms (e.g., bacteria), in vivo DNA storage is not the preferred method for high capacity data storage.
  • the methods and compositions provided herein relate to an enzymatic method of making a polynucleotide or nucleic acid sequence.
  • provided herein are method for c/e novo synthesis of nucleic acid sequences using a TdT variant as described herein for the purpose of storing digital information in DNA.
  • the method includes combining at least one selected nucleotide triphosphate, one or more cations, and a TdT variant in an aqueous reaction medium including a target substrate comprising an initiator sequence and having a 3' terminal nucleotide attached to a single stranded portion, such that the template -independent polymerase interact with the target substrate under conditions which covalently add one or more of the selected nucleotide triphosphate to the 3' terminal nucleotide.
  • the method can further includes repeatedly introducing an additional subsequent selected nucleotide triphosphate to the aqueous reaction medium under conditions which enzymatically add one or more of the subsequent selected nucleotide triphosphate to the target substrate until the polynucleotide is formed.
  • a TdT variant as described herein to a macromolecule or solid support in a method of generating a polynucleotide.
  • a TdT variant can be contacted with the components necessary for an enzymatic reaction that produces polynucleotide products in a flow-through manner, while the TdT variant is conjugated to a solid.
  • the solid support can comprise a growing polynucleotide strand and an untethered TdT variant can be removed from the solid support to stop the enzymatic reaction.
  • Other methods where one or more of the reaction components are attached to a solid support can be readily envisioned by one of skill in the art.
  • conditions sufficient to synthesize one or more nucleic acid molecules using the TdT variants described herein can include one or more nucleotides, one or more buffers or buffering salts, and one or more cofactors (e.g., divalent metal ions).
  • conditions sufficient to synthesize one or more nucleic acid molecules according to the invention may include incubating at an elevated temperature (e.g., greater than about 37° C., 40° C., 45° C., 50° C., 55° C., 60° C., 65° C., 70° C., 75° C., 80° C., 85° C., 90° C., or 95° C.) and/or in the presence of one or more deoxy- or dideoxyribonucleoside triphosphates.
  • an elevated temperature e.g., greater than about 37° C., 40° C., 45° C., 50° C., 55° C., 60° C., 65° C., 70° C., 75° C., 80° C., 85° C., 90° C., or 95° C.
  • Suitable deoxy- and dideoxyribonucleoside triphosphates include, but are not limited to, dATP, dCTP, dGTP, dTTP, dITP, 7-deaza-dGTP, 7-deaza-dATP, ddUTP, ddATP, ddCTP, ddGTP, ddITP, ddTTP, [a-S]dATP, [a-S]dTTP, [a-S]dGTP, and [a-S]dCTP.
  • the conditions may comprise a suitable concentration of at least one divalent metal cofactor. In some embodiments, the conditions may comprise more than one divalent metal cofactor.
  • Nucleic acids synthesized using the methods and compositions described herein can be applied to the storage of digital information in DNA as known to those of skill in the art.
  • TdT variants as described herein can be used with any method known in the art for the purpose of generating oligonucleotides de novo.
  • Oligonucleotides synthesized using the methods and/or TdT variants described herein comprise, in various embodiments, at least about 5, 10, 15, 20, 30, 40, 50, 60, 70, 75, 80, 90, 100, 120, 150 or more bases.
  • oligonucleotide synthesis is performed on a surface to allow for synthesis at a fast rate.
  • a fast rate As an example, at least 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 35, 40, 45, 50, 55, 60, 70, 80, 90, 100, 125, 150, 175, 200 nucleotides per hour, or more are synthesized.
  • libraries of oligonucleotides are synthesized in parallel on a substrate.
  • kits comprising wild type TdT or variant TdT polypeptides can be configured for use in any procedure known to those skilled in the art.
  • Suitable kits can be prepared for, for example, for cDNA synthesis and/or amplification, detectably labeling DNA molecules, and DNA sequencing.
  • kits can comprise a carrier that can be compartmentalized to receive in close confinement one or more containers such as vials, test tubes, wells, solid supports, chips and the like.
  • at least one of such containers contains components or a mixture of components needed to perform c/e novo oligonucleotide or DNA synthesis.
  • a kit as described herein comprises a container having a substantially purified sample of a TdT variant of the invention.
  • the kit comprises a container(s) having one or more nucleotides needed to synthesize a DNA molecule.
  • a kit comprises a container having one or a number of different types of dideoxynucleoside triphosphates, optionally labeled with one or more detectable groups.
  • a kit as described herein can comprise pyrophosphatase.
  • Kits for DNA synthesis can comprise a first container containing a TdT variant polymerase as described herein, and one or more containers each having one, two, three or four dNTPs. Of course, it is also possible to combine one or more of these reagents in a single tube or other containers.
  • the kit of the present invention may include one or more containers that contain detectably labeled nucleotides that may be used during the synthesis or sequencing of a DNA molecule.
  • labels include, but are not limited to, radioactive isotopes, fluorescent labels, chemiluminescent labels, nuclear tags biolumine scent labels and enzyme labels.
  • the invention may be as claimed in any one of the following numbered paragraphs.
  • TdT terminal deoxynucleotidyl transferase
  • modified TdT enzyme of any one of paragraphs 1-8 wherein the modified TdT comprises a mutation at R453 and at least one additional mutation at an amino acid residue selected from the group consisting of: S279, G341, H342, D343, V344, D345, A396, A429, 1430, R431, V432, D433, R442, F444, R453, Q454, and F459.
  • a method for generating a polynucleotide sequence c/e novo or in vitro comprising; incubating a modified TdT enzyme of paragraph 1 in the presence of an initiator sequence, a cofactor and at least one nucleoside triphosphate under conditions and for a time sufficient to add at least one nucleotide to the 3’ end of a polynucleotide strand.
  • a modified TdT enzyme or the initiator sequence are conjugated to a solid support.
  • [00135] 23 A vector comprising the nucleic acid molecule of paragraph 22.
  • a cell comprising the modified TdT polypeptide of paragraphs 1-13, the nucleic acid molecule of paragraph 22, and/or the vector of paragraph 23.
  • a solid support comprising a modified TdT enzyme of paragraphs 1-13.
  • Terminal deoxynucleotidyl transferase is an exceedingly useful template -independent DNA polymerase for major biotechnological applications such as the storage of digital information and c/e novo oligonucleotide synthesis.
  • TdT has the ability to rapidly catalyze the synthesis of long DNA oligonucleotides in the presence of only a small initiator sequence, cofactors, and nucleoside triphosphate monomers.
  • TdT exhibits substrate bias towards both the preferred initiator sequence composition and nucleoside triphosphate base (A, G, C, or T). This bias greatly inhibits the ability for TdT to be utilized as a universal template -independent DNA polymerase and severely limits the capacity for the precise control of TdT in any oligonucleotide synthesis scheme.
  • functional mutant variants of TdT that display improved enzymatic phenotypes, such as reduced substrate bias, are highly desired to enable a multitude of commercially viable biotechnological applications. Because the protein functional landscape of TdT remains largely unexplored, it was sought to begin mapping it by generating libraries of TdT mutant variants via site-directed mutagenesis with the overall intention of improving the wild-type enzyme.
  • EXAMPLE 1 ENZYME ENGINEERING APPROACH
  • Rational site-directed mutagenesis was employed as a primary method for generating TdT mutant variant libraries in that the technique is low-cost and well-practiced in the protein engineering field. Based on a combination of protein structural analysis and previous reports of single-codon TdT mutant variant functionality analysis, several amino acid residues were identified that are important for maintaining protein structural integrity, reaction catalysis, and substrate binding (cofactor, initiator sequence, & nucleoside triphosphate).
  • Mutant variant generation proceeded hierarchically: single codon-mutants were first generated and evaluated for initial activity and/or the desired enzymatic phenotype followed by double-codon mutants and then, if necessary, triple-codon mutants.
  • mutant variant libraries were designed and built from TdT sequences originating from species including but not limited to wild-type Mus musculus, Bos taurus, Monodelphis domestica, Eulemur macaco, Xenopus laevis, Ambystoma mexicanum, Oncorhynchus mykiss, and Gallus. Additional mutant variation may arise from truncations or removal of protein domains associated with in vivo enzymatic activities DNA repair mechanisms unnecessary for in vitro DNA oligonucleotide synthesis.
  • TdT requires the presence of a divalent cation cofactor for optimal enzymatic activity.
  • Synthesis reactions are typically supplemented with Co2+, however alternative divalent cations with varying properties such as Mg2+, Mn2+, Zn2+, and combinations thereof have been reported to be compatible with TdT.
  • divalent cations are directly involved in the binding and catalysis of the nucleoside triphosphates onto the growing oligonucleotide, it was hypothesized that any human TdT mutant variants generated may have altered cofactor requirements due to changes in structure, polarity, or hydrophobicity of the catalytic pocket.
  • enzymatic activity was screened in the presence of 0.25 mM Co2+, Mn2+ or Mg2+. Reaction supplementation with Zn2+ is least tolerated by TdT and deviations from a concentration of 0.25 mM cofactor generally result in decreased enzymatic activity.
  • the single-codon mutant R453H displayed the highest activity when the reaction was supplemented with Mn2+; however, supplementation with Co2+ and Mg2+ still resulted in appreciable activity.
  • Denaturing gel electrophoresis analysis of these reactions indicate that long oligonucleotides >400-nt were synthesized in the presence of each cofactor as compared to the wild-type, no-change human TdT where long oligonucleotide were only synthesized in the presence of Co2+ (FIG. 2B).
  • a large array of different cofactor preferences were observed when screening double-codon mutants variants carrying the constant amino acid change R453A (FIG. 3).
  • mutant variants of human TdT indicate that improvements over the wild-type enzyme may be observed as increased flexibility in the presence of variable reaction conditions or substrates. Therefore, novel mutant variants may be, for example, less temperature or pH sensitive, highly processive, able to synthesize DNA oligonucleotide at faster rates, able to incorporate non-natural nucleotides or may only display their enzymatic phenotypes when reactions are supplemented with the preferred cofactor. However, mutant variants such as R453H are highly desired as they may function optimally regardless of reaction and substrate type or composition.
  • each double-codon mutant variants’ ability was evaluated with respect to its ability to incorporate each of the four natural nucleoside triphosphate bases with reactions substituted with Mn 2+ .
  • the 100 mutant variants produced it was found that several displayed an enhanced ability to incorporate all four bases given a particular oligonucleotide initiator sequence (FIG. 4).
  • TdT is very active when adding dATP to an initiator oligonucleotide consisting of a homopolymer dT initiator oligonucleotide but not active when in the presence of dTTP under similar conditions; however, the mutant variant R453A-V432G allowed all four natural nucleoside triphosphate bases to be incorporated at similar rates producing very long fragments of ssDNA (>l400-nt) using the same homopolymer dT oligonucleotide (FIG. 5).
  • this mutant variant of hTdT retained this decreased substrate bias and specificity in the presence of a homopolymer dA oligonucleotide, which could be active when adding dTTPs but not active in the presence of dATP (FIG. 6). While R453A-V432G was the best performing mutant variant identified to date in terms of most significantly decreased substrate bias and specificity, other mutant variants of human TdT that display similar enzymatic phenotypes are also of particular interest.
  • TdT can incorporate ribonucleotides in addition to deoxyribonucleotides; however, generally only 1-2 ribonucleotides can be added as growing DNA oligonucleotide becomes a less preferred substrate for TdT.
  • double-codon mutants display decreased substrate bias for deoxynucleotides
  • top performing variants can efficiently incorporate all four natural ribonucleoside triphosphates.
  • R453A-V432G produced long fragments of ssRNA in comparison to single-codon mutant R453A (FIG. 7).
  • the primary sequences of wild-type or mutant enzymes of interest were codon optimized for E. coli expression using a custom optimization algorithm and ordered as gBlocks® (IDT) with 20-nt overlap sequences for Gibson Assembly into the pET-28-c-(+) His-tag expression vector (EMD Millipore 69866- 3).
  • IDT gBlocks®
  • the gBlocks® were PCR amplified with Phusion High Fidelity (HF) Polymerase (NEB M05030).
  • PCR thermocycling was performed as follows: initial denature for 98°C for 30 seconds, denature at 98°C for 10 seconds, anneal at 68°C for 10 seconds, and extend at 72°C for 60 seconds for 18 cycles before a final extension of 5 minutes at 72°C.
  • PCR reactions were purified and concentrated using a QIAquick PCR Purification Kit (Qiagen 28106).
  • the pET-28-c-(+) expression vector was prepared for gBlocks® insertion by digesting the circular DNA with 40U of NDel (NEB R0111) per 500 ng vector at 37°C for 90 minutes.
  • the linear DNA was separated from undigested material with 2% agarose gel electrophoresis and extracted by incubating agarose containing the bands corresponding to the linear DNA in Buffer QG (Qiagen 19063) at 55°C rotating at 1000 RPM for 2 hours. The resultant mixture was cleaned and concentrated with the QIAquick PCR Purification Kit. The PCR amplified insert and vector sequences were combined at a ratio of 1:3 with 0.1 pmol of total material and assembled with Gibson Assembly Master Mix (NEB E5510S) at 50°C for 1 hour. T7 Express chemically competent E.
  • Cultures were then pelleted at 3500 x g for 10 minutes and then His-Tag purified using a HisTalon Resin Kit as per manufacturer’s instructions (Clontech 635654).
  • the eluted enzyme samples were then buffer exchanged into an optimal 2X protein storage buffer using l5-mL filter columns (Millipore) at the appropriate MWCO by centrifugation at 5000 x G for 15 minutes at 4C. This process was repeated twice. On the third spin, samples were spun for 30 minutes in order to concentrate the protein into a smaller volume.
  • Single or multiple amino acids can be mutagenized for improvement by rational design or by high-throughput methods such as error-prone PCR.
  • Plasmids carrying the target protein were harvested and purified from a sequence verified liquid bacterial cultures grown overnight in LB-kanamycin media at 37°C using a MiniPrep Kit (Qiagen 27104).
  • Oligonucleotide primers were ordered from IDT and were designed to PCR amplify the protein expression plasmid while simultaneously mutagenizing the plasmid at the predetermined location, yielding linearized DNA.
  • the protein expression plasmid was PCR amplified using the Q5 Hot Start High-Fidelity 2x Master Mix with the following thermocycling conditions: initial denature for 98°C for 30 seconds, denature at 98°C for 10 seconds, anneal at 68°C for 10 seconds, and extend at 72°C for 120 seconds for 25 cycles before a final extension of 2 minutes at 72°C. 1 pL of the resulting PCR amplification reaction was then treated with the kit’s enzyme reaction cocktail to re -circularize the protein expression plasmid while digesting away the unsubstituted plasmid sequences remaining in the reaction mixture .
  • the length of the ssDNA produced in these reactions was determined by comparing products to a 100-h ⁇ ssDNA ladder (Simplex Biosciences) using a 15% TBE-Urea denaturing gel (Thermo EC6885) following the manufacturer’s protocol. Approximately 8 pL of the initial activity screen reaction volume was loaded onto the gels and run at 185V for 60 minutes unless otherwise specified. Gels were then stained with a solution of lx GelStar Nucleic Acid stain gel stain for 15 minutes with gentle agitation. The resultant gel was then imaged on a Typhoon FLA 9500 system (GE Healthcare Life Sciences) using imaging parameters for SYBR Gold. For extension reactions using initiator oligonucleotides labeled with a 5’-fluorophore such as FAM, Cy5, Cy3, etc, gels were not stained and imaged directly using the appropriate parameters.
  • initiator oligonucleotides labeled with a 5’-fluorophore such as FAM, Cy5,

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Genetics & Genomics (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Biochemistry (AREA)
  • Molecular Biology (AREA)
  • Biotechnology (AREA)
  • Microbiology (AREA)
  • Biomedical Technology (AREA)
  • Medicinal Chemistry (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • General Chemical & Material Sciences (AREA)
  • Enzymes And Modification Thereof (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)

Abstract

La présente invention concerne des polypeptides TdT modifiés et les utilisations associées.
PCT/US2019/054398 2018-10-04 2019-10-03 Compositions et procédés comprenant des mutants de désoxynucléotidyle transférase terminale WO2020072715A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201862741143P 2018-10-04 2018-10-04
US62/741,143 2018-10-04

Publications (1)

Publication Number Publication Date
WO2020072715A1 true WO2020072715A1 (fr) 2020-04-09

Family

ID=70055838

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2019/054398 WO2020072715A1 (fr) 2018-10-04 2019-10-03 Compositions et procédés comprenant des mutants de désoxynucléotidyle transférase terminale

Country Status (1)

Country Link
WO (1) WO2020072715A1 (fr)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116836955A (zh) * 2023-05-17 2023-10-03 中国科学院深圳先进技术研究院 末端脱氧核苷酸转移酶及其制备方法

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160108382A1 (en) * 2014-10-20 2016-04-21 Molecular Assemblies, Inc. Modified template-independent enzymes for polydeoxynucleotide synthesis
US20180023108A1 (en) * 2015-02-10 2018-01-25 Nuclera Nucleics Ltd. Novel use
WO2018102818A1 (fr) * 2016-12-02 2018-06-07 President And Fellows Of Harvard College Variants de polymérase d'adn indépendantes de la matrice processives
US20190211315A1 (en) * 2018-01-08 2019-07-11 Dna Script Variants of Terminal Deoxynucleotidyl Transferase and Uses Thereof

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160108382A1 (en) * 2014-10-20 2016-04-21 Molecular Assemblies, Inc. Modified template-independent enzymes for polydeoxynucleotide synthesis
US20180023108A1 (en) * 2015-02-10 2018-01-25 Nuclera Nucleics Ltd. Novel use
WO2018102818A1 (fr) * 2016-12-02 2018-06-07 President And Fellows Of Harvard College Variants de polymérase d'adn indépendantes de la matrice processives
US20190211315A1 (en) * 2018-01-08 2019-07-11 Dna Script Variants of Terminal Deoxynucleotidyl Transferase and Uses Thereof

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116836955A (zh) * 2023-05-17 2023-10-03 中国科学院深圳先进技术研究院 末端脱氧核苷酸转移酶及其制备方法
CN116836955B (zh) * 2023-05-17 2024-05-10 中国科学院深圳先进技术研究院 末端脱氧核苷酸转移酶及其制备方法

Similar Documents

Publication Publication Date Title
CN102796728B (zh) 用于通过转座酶的dna片段化和标记的方法和组合物
Roy et al. Post‐transfer editing in vitro and in vivo by the β subunit of phenylalanyl‐tRNA synthetase
Robichon et al. Engineering Escherichia coli BL21 (DE3) derivative strains to minimize E. coli protein contamination after purification by immobilized metal affinity chromatography
US20190360013A1 (en) Processive Template Independent DNA Polymerase Variants
Frechin et al. Yeast mitochondrial Gln-tRNAGln is generated by a GatFAB-mediated transamidation pathway involving Arc1p-controlled subcellular sorting of cytosolic GluRS
US8859237B2 (en) Diguanylate cyclase method of producing the same and its use in the manufacture of cyclic-di-GMP and analogues thereof
JP2022543569A (ja) ポリ(a)およびポリ(u)ポリメラーゼを使用するポリヌクレオチドの鋳型なしの酵素による合成
CN113061591B (zh) 一种新型萤火虫萤光素酶突变体、其制备方法和应用
CN103562410A (zh) 具有降低的非特异性活性的Sso7-聚合酶缀合物
JPWO2016148044A1 (ja) 改変アミノアシルtRNA合成酵素およびその用途
JP4263598B2 (ja) チロシルtRNA合成酵素変異体
CN111073871B (zh) 热稳定性提高的dna聚合酶突变体及其构建方法和应用
CN111172129A (zh) 一种提高热稳定、扩增均一性和扩增效率的Phi29 DNA聚合酶突变体及其应用
EP3959312A1 (fr) Domaines de liaison d'acides nucléiques isolés
WO2020072715A1 (fr) Compositions et procédés comprenant des mutants de désoxynucléotidyle transférase terminale
CN116096872A (zh) 热稳定的末端脱氧核苷酸转移酶
CN112175980B (zh) 通过定点突变提高聚合酶大片段活性的方法及应用
Fang et al. Synonymous rare arginine codons and tRNA abundance affect protein production and quality of TEV protease variant
JP5305440B2 (ja) 変異体SepRS及びこれを用いるタンパク質への部位特異的ホスホセリン導入法
CN116240188A (zh) 一种热稳定性提高的二氢蝶酸合成酶突变体的制备方法
TW202115249A (zh) 用於定序反應之聚合酶
KR20210151928A (ko) 호열성 단백질을 이용한 재조합 시험관내 전사 및 해독을 위한 시스템, 방법 및 조성물
Sissler et al. Handling mammalian mitochondrial tRNAs and aminoacyl-tRNA synthetases for functional and structural characterization
WO2023098036A1 (fr) Mutant d'enzyme taq, son procédé de préparation et son application
Yanagihara et al. The dnaE173 mutator mutation confers on the α subunit of Escherichia coli DNA polymerase III a capacity for highly processive DNA synthesis and stable binding to primer/template DNA

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19869865

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19869865

Country of ref document: EP

Kind code of ref document: A1