WO2024059703A1 - Covalently modified template-independent dna polymerase and methods of use thereof - Google Patents
Covalently modified template-independent dna polymerase and methods of use thereof Download PDFInfo
- Publication number
- WO2024059703A1 WO2024059703A1 PCT/US2023/074179 US2023074179W WO2024059703A1 WO 2024059703 A1 WO2024059703 A1 WO 2024059703A1 US 2023074179 W US2023074179 W US 2023074179W WO 2024059703 A1 WO2024059703 A1 WO 2024059703A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- tdt
- deletion
- engineered
- seq
- redox
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 92
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 title description 5
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 title description 5
- 102100033215 DNA nucleotidylexotransferase Human genes 0.000 claims abstract description 529
- 108010008286 DNA nucleotidylexotransferase Proteins 0.000 claims abstract description 524
- 239000002773 nucleotide Substances 0.000 claims abstract description 244
- 125000003729 nucleotide group Chemical group 0.000 claims abstract description 217
- 230000035772 mutation Effects 0.000 claims abstract description 166
- 230000015572 biosynthetic process Effects 0.000 claims abstract description 86
- 238000003786 synthesis reaction Methods 0.000 claims abstract description 79
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 48
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 42
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 42
- 230000006820 DNA synthesis Effects 0.000 claims abstract description 15
- 230000002255 enzymatic effect Effects 0.000 claims abstract description 11
- 238000012217 deletion Methods 0.000 claims description 272
- 230000037430 deletion Effects 0.000 claims description 272
- 230000021615 conjugation Effects 0.000 claims description 141
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 claims description 122
- DFPAKSUCGFBDDF-UHFFFAOYSA-N Nicotinamide Chemical compound NC(=O)C1=CC=CN=C1 DFPAKSUCGFBDDF-UHFFFAOYSA-N 0.000 claims description 108
- AZQWKYJCGOJGHM-UHFFFAOYSA-N 1,4-benzoquinone Chemical compound O=C1C=CC(=O)C=C1 AZQWKYJCGOJGHM-UHFFFAOYSA-N 0.000 claims description 68
- 229960000907 methylthioninium chloride Drugs 0.000 claims description 64
- 239000011780 sodium chloride Substances 0.000 claims description 61
- 239000011570 nicotinamide Substances 0.000 claims description 54
- 229960003966 nicotinamide Drugs 0.000 claims description 54
- 235000005152 nicotinamide Nutrition 0.000 claims description 54
- 239000004971 Cross linker Substances 0.000 claims description 53
- GHCZTIFQWKKGSB-UHFFFAOYSA-N 2-hydroxypropane-1,2,3-tricarboxylic acid;phosphoric acid Chemical compound OP(O)(O)=O.OC(=O)CC(O)(C(O)=O)CC(O)=O GHCZTIFQWKKGSB-UHFFFAOYSA-N 0.000 claims description 51
- OUCMTIKCFRCBHK-UHFFFAOYSA-N 3,3-dibenzylcyclooctyne Chemical compound C1CCCCC#CC1(CC=1C=CC=CC=1)CC1=CC=CC=C1 OUCMTIKCFRCBHK-UHFFFAOYSA-N 0.000 claims description 51
- 230000001588 bifunctional effect Effects 0.000 claims description 49
- -1 bis- (trimethylammonio) propyl Chemical group 0.000 claims description 49
- 239000002253 acid Substances 0.000 claims description 47
- 239000008363 phosphate buffer Substances 0.000 claims description 47
- 102220581622 Heat shock factor-binding protein 1_L19K_mutation Human genes 0.000 claims description 42
- 102220570178 Hydroxymethylglutaryl-CoA lyase, mitochondrial_S75R_mutation Human genes 0.000 claims description 42
- 102220518048 NAD-dependent protein deacetylase sirtuin-1_S47A_mutation Human genes 0.000 claims description 42
- 102220600219 Neutrophil elastase_M44R_mutation Human genes 0.000 claims description 42
- 102220532368 Serpin B10_K99Q_mutation Human genes 0.000 claims description 42
- 102220030871 rs151103906 Human genes 0.000 claims description 42
- 102200081049 rs74315305 Human genes 0.000 claims description 42
- 102220244437 rs752960312 Human genes 0.000 claims description 42
- 239000001226 triphosphate Substances 0.000 claims description 39
- 235000011178 triphosphate Nutrition 0.000 claims description 39
- 239000000872 buffer Substances 0.000 claims description 35
- 238000003776 cleavage reaction Methods 0.000 claims description 35
- 230000007017 scission Effects 0.000 claims description 35
- 125000006850 spacer group Chemical group 0.000 claims description 35
- 125000000524 functional group Chemical group 0.000 claims description 33
- 102220042837 rs12944167 Human genes 0.000 claims description 32
- 102220075934 rs773979248 Human genes 0.000 claims description 32
- 238000007792 addition Methods 0.000 claims description 31
- RTZKZFJDLAIYFH-UHFFFAOYSA-N ether Substances CCOCC RTZKZFJDLAIYFH-UHFFFAOYSA-N 0.000 claims description 31
- 102220580177 Non-receptor tyrosine-protein kinase TYK2_D31A_mutation Human genes 0.000 claims description 30
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 claims description 30
- 150000001413 amino acids Chemical group 0.000 claims description 29
- UNXRWKVEANCORM-UHFFFAOYSA-N triphosphoric acid Chemical compound OP(O)(=O)OP(O)(=O)OP(O)(O)=O UNXRWKVEANCORM-UHFFFAOYSA-N 0.000 claims description 28
- JSTCPNFNKICNNO-UHFFFAOYSA-N 4-nitrosophenol Chemical group OC1=CC=C(N=O)C=C1 JSTCPNFNKICNNO-UHFFFAOYSA-N 0.000 claims description 26
- 108010043121 Green Fluorescent Proteins Proteins 0.000 claims description 26
- 102000004144 Green Fluorescent Proteins Human genes 0.000 claims description 25
- AUNGANRZJHBGPY-SCRDCRAPSA-N Riboflavin Chemical compound OC[C@@H](O)[C@@H](O)[C@@H](O)CN1C=2C=C(C)C(C)=CC=2N=C2C1=NC(=O)NC2=O AUNGANRZJHBGPY-SCRDCRAPSA-N 0.000 claims description 25
- FVTCRASFADXXNN-SCRDCRAPSA-N flavin mononucleotide Chemical compound OP(=O)(O)OC[C@@H](O)[C@@H](O)[C@@H](O)CN1C=2C=C(C)C(C)=CC=2N=C2C1=NC(=O)NC2=O FVTCRASFADXXNN-SCRDCRAPSA-N 0.000 claims description 25
- 229940013640 flavin mononucleotide Drugs 0.000 claims description 25
- 239000011768 flavin mononucleotide Substances 0.000 claims description 25
- FVTCRASFADXXNN-UHFFFAOYSA-N flavin mononucleotide Natural products OP(=O)(O)OCC(O)C(O)C(O)CN1C=2C=C(C)C(C)=CC=2N=C2C1=NC(=O)NC2=O FVTCRASFADXXNN-UHFFFAOYSA-N 0.000 claims description 25
- 239000005090 green fluorescent protein Substances 0.000 claims description 25
- 235000019231 riboflavin-5'-phosphate Nutrition 0.000 claims description 25
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 24
- 230000002829 reductive effect Effects 0.000 claims description 21
- 230000009467 reduction Effects 0.000 claims description 20
- 239000003115 supporting electrolyte Substances 0.000 claims description 20
- 125000001314 canonical amino-acid group Chemical group 0.000 claims description 19
- 239000004310 lactic acid Substances 0.000 claims description 19
- 230000003647 oxidation Effects 0.000 claims description 19
- 238000007254 oxidation reaction Methods 0.000 claims description 19
- JMSVCTWVEWCHDZ-UHFFFAOYSA-N syringic acid Chemical compound COC1=CC(C(O)=O)=CC(OC)=C1O JMSVCTWVEWCHDZ-UHFFFAOYSA-N 0.000 claims description 19
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 claims description 18
- MWOOGOJBHIARFG-UHFFFAOYSA-N vanillin Chemical compound COC1=CC(C=O)=CC=C1O MWOOGOJBHIARFG-UHFFFAOYSA-N 0.000 claims description 17
- FGQOOHJZONJGDT-UHFFFAOYSA-N vanillin Natural products COC1=CC(O)=CC(C=O)=C1 FGQOOHJZONJGDT-UHFFFAOYSA-N 0.000 claims description 16
- 235000012141 vanillin Nutrition 0.000 claims description 16
- 239000003795 chemical substances by application Substances 0.000 claims description 15
- RYYVLZVUVIJVGH-UHFFFAOYSA-N caffeine Chemical compound CN1C(=O)N(C)C(=O)C2=C1N=CN2C RYYVLZVUVIJVGH-UHFFFAOYSA-N 0.000 claims description 14
- 230000003165 hydrotropic effect Effects 0.000 claims description 14
- 102000034287 fluorescent proteins Human genes 0.000 claims description 13
- 108091006047 fluorescent proteins Proteins 0.000 claims description 13
- 150000004696 coordination complex Chemical class 0.000 claims description 12
- 229920001223 polyethylene glycol Polymers 0.000 claims description 12
- 239000004202 carbamide Substances 0.000 claims description 10
- 125000003588 lysine group Chemical group [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 claims description 10
- AGIJRRREJXSQJR-UHFFFAOYSA-N 2h-thiazine Chemical compound N1SC=CC=C1 AGIJRRREJXSQJR-UHFFFAOYSA-N 0.000 claims description 9
- YIBXWXOYFGZLRU-UHFFFAOYSA-N syringic aldehyde Natural products CC12CCC(C3(CCC(=O)C(C)(C)C3CC=3)C)C=3C1(C)CCC2C1COC(C)(C)C(O)C(O)C1 YIBXWXOYFGZLRU-UHFFFAOYSA-N 0.000 claims description 9
- CPELXLSAUQHCOX-UHFFFAOYSA-M Bromide Chemical compound [Br-] CPELXLSAUQHCOX-UHFFFAOYSA-M 0.000 claims description 8
- AUNGANRZJHBGPY-UHFFFAOYSA-N D-Lyxoflavin Natural products OCC(O)C(O)C(O)CN1C=2C=C(C)C(C)=CC=2N=C2C1=NC(=O)NC2=O AUNGANRZJHBGPY-UHFFFAOYSA-N 0.000 claims description 8
- 229950006238 nadide Drugs 0.000 claims description 8
- 229930027945 nicotinamide-adenine dinucleotide Natural products 0.000 claims description 8
- 235000019192 riboflavin Nutrition 0.000 claims description 8
- 239000002151 riboflavin Substances 0.000 claims description 8
- 229960002477 riboflavin Drugs 0.000 claims description 8
- LPHGQDQBBGAPDZ-UHFFFAOYSA-N Isocaffeine Natural products CN1C(=O)N(C)C(=O)C2=C1N(C)C=N2 LPHGQDQBBGAPDZ-UHFFFAOYSA-N 0.000 claims description 7
- 229960001948 caffeine Drugs 0.000 claims description 7
- VJEONQKOZGKCAK-UHFFFAOYSA-N caffeine Natural products CN1C(=O)N(C)C(=O)C2=C1C=CN2C VJEONQKOZGKCAK-UHFFFAOYSA-N 0.000 claims description 7
- 239000003752 hydrotrope Substances 0.000 claims description 7
- BOPGDPNILDQYTO-NNYOXOHSSA-N nicotinamide-adenine dinucleotide Chemical compound C1=CCC(C(=O)N)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OC[C@@H]2[C@H]([C@@H](O)[C@@H](O2)N2C3=NC=NC(N)=C3N=C2)O)O1 BOPGDPNILDQYTO-NNYOXOHSSA-N 0.000 claims description 7
- 230000008569 process Effects 0.000 claims description 7
- 159000000000 sodium salts Chemical class 0.000 claims description 7
- QUCDWLYKDRVKMI-UHFFFAOYSA-M sodium;3,4-dimethylbenzenesulfonate Chemical compound [Na+].CC1=CC=C(S([O-])(=O)=O)C=C1C QUCDWLYKDRVKMI-UHFFFAOYSA-M 0.000 claims description 7
- NEMHIKRLROONTL-QMMMGPOBSA-N (2s)-2-azaniumyl-3-(4-azidophenyl)propanoate Chemical compound OC(=O)[C@@H](N)CC1=CC=C(N=[N+]=[N-])C=C1 NEMHIKRLROONTL-QMMMGPOBSA-N 0.000 claims description 6
- 102220628555 Hepatocyte cell adhesion molecule_E29C_mutation Human genes 0.000 claims description 6
- 239000002202 Polyethylene glycol Substances 0.000 claims description 6
- ZPWVASYFFYYZEW-UHFFFAOYSA-L dipotassium hydrogen phosphate Chemical compound [K+].[K+].OP([O-])([O-])=O ZPWVASYFFYYZEW-UHFFFAOYSA-L 0.000 claims description 6
- 229910000396 dipotassium phosphate Inorganic materials 0.000 claims description 6
- 235000019797 dipotassium phosphate Nutrition 0.000 claims description 6
- VWWQXMAJTJZDQX-UYBVJOGSSA-N flavin adenine dinucleotide Chemical compound C1=NC2=C(N)N=CN=C2N1[C@@H]([C@H](O)[C@@H]1O)O[C@@H]1CO[P@](O)(=O)O[P@@](O)(=O)OC[C@@H](O)[C@@H](O)[C@@H](O)CN1C2=NC(=O)NC(=O)C2=NC2=C1C=C(C)C(C)=C2 VWWQXMAJTJZDQX-UYBVJOGSSA-N 0.000 claims description 6
- 235000019162 flavin adenine dinucleotide Nutrition 0.000 claims description 6
- 239000011714 flavin adenine dinucleotide Substances 0.000 claims description 6
- 229940093632 flavin-adenine dinucleotide Drugs 0.000 claims description 6
- 150000002430 hydrocarbons Chemical group 0.000 claims description 6
- 229910003002 lithium salt Inorganic materials 0.000 claims description 6
- 159000000002 lithium salts Chemical class 0.000 claims description 6
- 159000000003 magnesium salts Chemical class 0.000 claims description 6
- 229910000402 monopotassium phosphate Inorganic materials 0.000 claims description 6
- 235000019796 monopotassium phosphate Nutrition 0.000 claims description 6
- FIKAKWIAUPDISJ-UHFFFAOYSA-L paraquat dichloride Chemical compound [Cl-].[Cl-].C1=C[N+](C)=CC=C1C1=CC=[N+](C)C=C1 FIKAKWIAUPDISJ-UHFFFAOYSA-L 0.000 claims description 6
- 150000002978 peroxides Chemical class 0.000 claims description 6
- PJNZPQUBCPKICU-UHFFFAOYSA-N phosphoric acid;potassium Chemical compound [K].OP(O)(O)=O PJNZPQUBCPKICU-UHFFFAOYSA-N 0.000 claims description 6
- 159000000001 potassium salts Chemical class 0.000 claims description 6
- 102200000769 rs193922748 Human genes 0.000 claims description 6
- 239000001509 sodium citrate Substances 0.000 claims description 6
- NLJMYIDDQXHKNR-UHFFFAOYSA-K sodium citrate Chemical compound O.O.[Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O NLJMYIDDQXHKNR-UHFFFAOYSA-K 0.000 claims description 6
- 125000001273 sulfonato group Chemical group [O-]S(*)(=O)=O 0.000 claims description 6
- 229910052720 vanadium Inorganic materials 0.000 claims description 6
- BAWFJGJZGIEFAR-NNYOXOHSSA-N NAD zwitterion Chemical compound NC(=O)C1=CC=C[N+]([C@H]2[C@@H]([C@H](O)[C@@H](COP([O-])(=O)OP(O)(=O)OC[C@@H]3[C@H]([C@@H](O)[C@@H](O3)N3C4=NC=NC(N)=C4N=C3)O)O2)O)=C1 BAWFJGJZGIEFAR-NNYOXOHSSA-N 0.000 claims description 5
- 230000002194 synthesizing effect Effects 0.000 claims description 5
- GDOPTJXRTPNYNR-UHFFFAOYSA-N methyl-cyclopentane Natural products CC1CCCC1 GDOPTJXRTPNYNR-UHFFFAOYSA-N 0.000 claims description 4
- 125000004169 (C1-C6) alkyl group Chemical group 0.000 claims description 3
- OKONMFPEKSWGEU-UHFFFAOYSA-N 9,10-dioxoanthracene-2,7-disulfonic acid Chemical compound C1=C(S(O)(=O)=O)C=C2C(=O)C3=CC(S(=O)(=O)O)=CC=C3C(=O)C2=C1 OKONMFPEKSWGEU-UHFFFAOYSA-N 0.000 claims description 3
- QIRNURVBNXNPFN-UHFFFAOYSA-N CN(C)CC1=C(O)C(=C(C(=C1CN(C)C)O)CN(C)C)CN(C)C Chemical compound CN(C)CC1=C(O)C(=C(C(=C1CN(C)C)O)CN(C)C)CN(C)C QIRNURVBNXNPFN-UHFFFAOYSA-N 0.000 claims description 3
- VEXZGXHMUGYJMC-UHFFFAOYSA-M Chloride anion Chemical compound [Cl-] VEXZGXHMUGYJMC-UHFFFAOYSA-M 0.000 claims description 3
- 230000000903 blocking effect Effects 0.000 claims description 3
- SOCTUWSJJQCPFX-UHFFFAOYSA-N dichromate(2-) Chemical compound [O-][Cr](=O)(=O)O[Cr]([O-])(=O)=O SOCTUWSJJQCPFX-UHFFFAOYSA-N 0.000 claims description 3
- KTWOOEGAPBSYNW-UHFFFAOYSA-N ferrocene Chemical group [Fe+2].C=1C=C[CH-]C=1.C=1C=C[CH-]C=1 KTWOOEGAPBSYNW-UHFFFAOYSA-N 0.000 claims description 3
- 235000011083 sodium citrates Nutrition 0.000 claims description 3
- DGQOCLATAPFASR-UHFFFAOYSA-N tetrahydroxy-1,4-benzoquinone Chemical compound OC1=C(O)C(=O)C(O)=C(O)C1=O DGQOCLATAPFASR-UHFFFAOYSA-N 0.000 claims description 3
- YARHBRUWMYJLHY-UHFFFAOYSA-Q triazanium;iron(3+);hexacyanide Chemical compound [NH4+].[NH4+].[NH4+].[Fe+3].N#[C-].N#[C-].N#[C-].N#[C-].N#[C-].N#[C-] YARHBRUWMYJLHY-UHFFFAOYSA-Q 0.000 claims description 3
- RKEZHXGBECEYGM-UHFFFAOYSA-N cyclopenta-1,3-diene iron(2+) 5-propylcyclopenta-1,3-diene Chemical compound [Fe++].c1cc[cH-]c1.CCC[c-]1cccc1 RKEZHXGBECEYGM-UHFFFAOYSA-N 0.000 claims description 2
- 102200115452 rs137852659 Human genes 0.000 claims 13
- RBTBFTRPCNLSDE-UHFFFAOYSA-N 3,7-bis(dimethylamino)phenothiazin-5-ium Chemical group C1=CC(N(C)C)=CC2=[S+]C3=CC(N(C)C)=CC=C3N=C21 RBTBFTRPCNLSDE-UHFFFAOYSA-N 0.000 claims 11
- LEONUFNNVUYDNQ-UHFFFAOYSA-N vanadium atom Chemical compound [V] LEONUFNNVUYDNQ-UHFFFAOYSA-N 0.000 claims 2
- 229920000180 alkyd Polymers 0.000 claims 1
- MSSUFHMGCXOVBZ-UHFFFAOYSA-N anthraquinone-2,6-disulfonic acid Chemical compound OS(=O)(=O)C1=CC=C2C(=O)C3=CC(S(=O)(=O)O)=CC=C3C(=O)C2=C1 MSSUFHMGCXOVBZ-UHFFFAOYSA-N 0.000 claims 1
- KISFEBPWFCGRGN-UHFFFAOYSA-M sodium;2-(2,4-dichlorophenoxy)ethyl sulfate Chemical compound [Na+].[O-]S(=O)(=O)OCCOC1=CC=C(Cl)C=C1Cl KISFEBPWFCGRGN-UHFFFAOYSA-M 0.000 claims 1
- 108090000623 proteins and genes Proteins 0.000 abstract description 37
- 102000004169 proteins and genes Human genes 0.000 abstract description 32
- 150000003384 small molecules Chemical class 0.000 abstract description 16
- 125000000539 amino acid group Chemical group 0.000 abstract description 7
- 101150007302 dntt gene Proteins 0.000 abstract description 6
- 230000004048 modification Effects 0.000 abstract description 4
- 238000012986 modification Methods 0.000 abstract description 4
- 239000000243 solution Substances 0.000 description 132
- 125000005647 linker group Chemical group 0.000 description 99
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 91
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 71
- 125000003275 alpha amino acid group Chemical group 0.000 description 68
- XEKOWRVHYACXOJ-UHFFFAOYSA-N Ethyl acetate Chemical compound CCOC(C)=O XEKOWRVHYACXOJ-UHFFFAOYSA-N 0.000 description 57
- CXKWCBBOMKCUKX-UHFFFAOYSA-M methylene blue Chemical group [Cl-].C1=CC(N(C)C)=CC2=[S+]C3=CC(N(C)C)=CC=C3N=C21 CXKWCBBOMKCUKX-UHFFFAOYSA-M 0.000 description 52
- 239000000203 mixture Substances 0.000 description 50
- 102000004190 Enzymes Human genes 0.000 description 48
- 108090000790 Enzymes Proteins 0.000 description 48
- 238000006243 chemical reaction Methods 0.000 description 39
- 239000000047 product Substances 0.000 description 39
- SECXISVLQFMRJM-UHFFFAOYSA-N N-Methylpyrrolidone Chemical compound CN1CCCC1=O SECXISVLQFMRJM-UHFFFAOYSA-N 0.000 description 36
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 30
- 235000018102 proteins Nutrition 0.000 description 30
- 102220363249 c.40C>A Human genes 0.000 description 28
- 150000001875 compounds Chemical class 0.000 description 26
- 239000011541 reaction mixture Substances 0.000 description 25
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 24
- 150000001540 azides Chemical class 0.000 description 22
- 125000000956 methoxy group Chemical group [H]C([H])([H])O* 0.000 description 22
- 108020004414 DNA Proteins 0.000 description 21
- BVKZGUZCCUSVTD-UHFFFAOYSA-L Carbonate Chemical compound [O-]C([O-])=O BVKZGUZCCUSVTD-UHFFFAOYSA-L 0.000 description 17
- 238000009472 formulation Methods 0.000 description 17
- 239000000463 material Substances 0.000 description 17
- 239000011550 stock solution Substances 0.000 description 17
- PEEHTFAAVSWFBL-UHFFFAOYSA-N Maleimide Chemical compound O=C1NC(=O)C=C1 PEEHTFAAVSWFBL-UHFFFAOYSA-N 0.000 description 16
- VKIGAWAEXPTIOL-UHFFFAOYSA-N 2-hydroxyhexanenitrile Chemical compound CCCCC(O)C#N VKIGAWAEXPTIOL-UHFFFAOYSA-N 0.000 description 15
- QTWZICCBKBYHDM-UHFFFAOYSA-N leucomethylene blue Chemical compound C1=C(N(C)C)C=C2SC3=CC(N(C)C)=CC=C3NC2=C1 QTWZICCBKBYHDM-UHFFFAOYSA-N 0.000 description 14
- 238000004895 liquid chromatography mass spectrometry Methods 0.000 description 14
- 239000012267 brine Substances 0.000 description 13
- VLKZOEOYAKHREP-UHFFFAOYSA-N n-Hexane Chemical class CCCCCC VLKZOEOYAKHREP-UHFFFAOYSA-N 0.000 description 13
- HPALAKNZSZLMCH-UHFFFAOYSA-M sodium;chloride;hydrate Chemical compound O.[Na+].[Cl-] HPALAKNZSZLMCH-UHFFFAOYSA-M 0.000 description 13
- 239000000562 conjugate Substances 0.000 description 12
- 238000004108 freeze drying Methods 0.000 description 12
- 239000000543 intermediate Substances 0.000 description 12
- 239000002904 solvent Substances 0.000 description 12
- 235000001014 amino acid Nutrition 0.000 description 11
- 229940024606 amino acid Drugs 0.000 description 11
- 230000008033 biological extinction Effects 0.000 description 11
- ZMXDDKWLCZADIW-UHFFFAOYSA-N dimethylformamide Substances CN(C)C=O ZMXDDKWLCZADIW-UHFFFAOYSA-N 0.000 description 11
- 150000002148 esters Chemical class 0.000 description 11
- 238000000746 purification Methods 0.000 description 11
- PXIPVTKHYLBLMZ-UHFFFAOYSA-N Sodium azide Chemical compound [Na+].[N-]=[N+]=[N-] PXIPVTKHYLBLMZ-UHFFFAOYSA-N 0.000 description 10
- WYURNTSHIVDZCO-UHFFFAOYSA-N Tetrahydrofuran Chemical compound C1CCOC1 WYURNTSHIVDZCO-UHFFFAOYSA-N 0.000 description 10
- 150000001345 alkine derivatives Chemical class 0.000 description 10
- 230000000694 effects Effects 0.000 description 10
- 239000000499 gel Substances 0.000 description 10
- 238000004128 high performance liquid chromatography Methods 0.000 description 10
- 125000001570 methylene group Chemical group [H]C([H])([*:1])[*:2] 0.000 description 10
- LWIHDJKSTIGBAC-UHFFFAOYSA-K tripotassium phosphate Chemical compound [K+].[K+].[K+].[O-]P([O-])([O-])=O LWIHDJKSTIGBAC-UHFFFAOYSA-K 0.000 description 10
- VAKXPQHQQNOUEZ-UHFFFAOYSA-N 3-[4-[[bis[[1-(3-hydroxypropyl)triazol-4-yl]methyl]amino]methyl]triazol-1-yl]propan-1-ol Chemical compound N1=NN(CCCO)C=C1CN(CC=1N=NN(CCCO)C=1)CC1=CN(CCCO)N=N1 VAKXPQHQQNOUEZ-UHFFFAOYSA-N 0.000 description 9
- QIGBRXMKCJKVMJ-UHFFFAOYSA-N Hydroquinone Chemical compound OC1=CC=C(O)C=C1 QIGBRXMKCJKVMJ-UHFFFAOYSA-N 0.000 description 9
- 108091034117 Oligonucleotide Proteins 0.000 description 9
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 9
- DFPAKSUCGFBDDF-ZQBYOMGUSA-N [14c]-nicotinamide Chemical compound N[14C](=O)C1=CC=CN=C1 DFPAKSUCGFBDDF-ZQBYOMGUSA-N 0.000 description 9
- 239000003638 chemical reducing agent Substances 0.000 description 9
- 230000005257 nucleotidylation Effects 0.000 description 9
- 239000007858 starting material Substances 0.000 description 9
- 125000002264 triphosphate group Chemical class [H]OP(=O)(O[H])OP(=O)(O[H])OP(=O)(O[H])O* 0.000 description 9
- VHYFNPMBLIVWCW-UHFFFAOYSA-N 4-Dimethylaminopyridine Chemical compound CN(C)C1=CC=NC=C1 VHYFNPMBLIVWCW-UHFFFAOYSA-N 0.000 description 8
- KXDHJXZQYSOELW-UHFFFAOYSA-M Carbamate Chemical compound NC([O-])=O KXDHJXZQYSOELW-UHFFFAOYSA-M 0.000 description 8
- 125000002344 aminooxy group Chemical group [H]N([H])O[*] 0.000 description 8
- 238000002330 electrospray ionisation mass spectrometry Methods 0.000 description 8
- 230000003595 spectral effect Effects 0.000 description 8
- BTBUEUYNUDRHOZ-UHFFFAOYSA-N Borate Chemical compound [O-]B([O-])[O-] BTBUEUYNUDRHOZ-UHFFFAOYSA-N 0.000 description 7
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 7
- 238000004587 chromatography analysis Methods 0.000 description 7
- RGWHQCVHVJXOKC-SHYZEUOFSA-N dCTP Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](CO[P@](O)(=O)O[P@](O)(=O)OP(O)(O)=O)[C@@H](O)C1 RGWHQCVHVJXOKC-SHYZEUOFSA-N 0.000 description 7
- 238000011068 loading method Methods 0.000 description 7
- 239000012044 organic layer Substances 0.000 description 7
- JQWHASGSAFIOCM-UHFFFAOYSA-M sodium periodate Chemical compound [Na+].[O-]I(=O)(=O)=O JQWHASGSAFIOCM-UHFFFAOYSA-M 0.000 description 7
- 239000000126 substance Substances 0.000 description 7
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 7
- 108091028043 Nucleic acid sequence Proteins 0.000 description 6
- 238000013459 approach Methods 0.000 description 6
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 6
- HAAZLUGHYHWQIW-KVQBGUIXSA-N dGTP Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 HAAZLUGHYHWQIW-KVQBGUIXSA-N 0.000 description 6
- 238000013461 design Methods 0.000 description 6
- 238000010790 dilution Methods 0.000 description 6
- 239000012895 dilution Substances 0.000 description 6
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 6
- GETQZCLCWQTVFV-UHFFFAOYSA-N trimethylamine Chemical compound CN(C)C GETQZCLCWQTVFV-UHFFFAOYSA-N 0.000 description 6
- 102000053602 DNA Human genes 0.000 description 5
- 238000005481 NMR spectroscopy Methods 0.000 description 5
- 125000000637 arginyl group Chemical group N[C@@H](CCCNC(N)=N)C(=O)* 0.000 description 5
- LKFRZUAJFOSMJA-UHFFFAOYSA-N cyclohexa-2,5-diene-1,4-dione;propanoic acid Chemical class CCC(O)=O.O=C1C=CC(=O)C=C1 LKFRZUAJFOSMJA-UHFFFAOYSA-N 0.000 description 5
- NHVNXKFIZYSCEB-XLPZGREQSA-N dTTP Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)C1 NHVNXKFIZYSCEB-XLPZGREQSA-N 0.000 description 5
- 238000011534 incubation Methods 0.000 description 5
- 229910000160 potassium phosphate Inorganic materials 0.000 description 5
- 235000011009 potassium phosphates Nutrition 0.000 description 5
- 238000000425 proton nuclear magnetic resonance spectrum Methods 0.000 description 5
- 238000007363 ring formation reaction Methods 0.000 description 5
- 238000012163 sequencing technique Methods 0.000 description 5
- 239000012536 storage buffer Substances 0.000 description 5
- 238000006467 substitution reaction Methods 0.000 description 5
- YLQBMQCUIZJEEH-UHFFFAOYSA-N tetrahydrofuran Natural products C=1C=COC=1 YLQBMQCUIZJEEH-UHFFFAOYSA-N 0.000 description 5
- 150000003573 thiols Chemical class 0.000 description 5
- IUSARDYWEPUTPN-OZBXUNDUSA-N (2r)-n-[(2s,3r)-4-[[(4s)-6-(2,2-dimethylpropyl)spiro[3,4-dihydropyrano[2,3-b]pyridine-2,1'-cyclobutane]-4-yl]amino]-3-hydroxy-1-[3-(1,3-thiazol-2-yl)phenyl]butan-2-yl]-2-methoxypropanamide Chemical compound C([C@H](NC(=O)[C@@H](C)OC)[C@H](O)CN[C@@H]1C2=CC(CC(C)(C)C)=CN=C2OC2(CCC2)C1)C(C=1)=CC=CC=1C1=NC=CS1 IUSARDYWEPUTPN-OZBXUNDUSA-N 0.000 description 4
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 4
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical group [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 4
- BWGNESOTFCXPMA-UHFFFAOYSA-N Dihydrogen disulfide Chemical compound SS BWGNESOTFCXPMA-UHFFFAOYSA-N 0.000 description 4
- JGFZNNIVVJXRND-UHFFFAOYSA-N N,N-Diisopropylethylamine (DIPEA) Chemical compound CCN(C(C)C)C(C)C JGFZNNIVVJXRND-UHFFFAOYSA-N 0.000 description 4
- 229910021607 Silver chloride Inorganic materials 0.000 description 4
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 4
- 150000001412 amines Chemical class 0.000 description 4
- 230000008901 benefit Effects 0.000 description 4
- 150000004657 carbamic acid derivatives Chemical class 0.000 description 4
- 210000004027 cell Anatomy 0.000 description 4
- 239000013592 cell lysate Substances 0.000 description 4
- 239000003153 chemical reaction reagent Substances 0.000 description 4
- 229940125807 compound 37 Drugs 0.000 description 4
- 239000000470 constituent Substances 0.000 description 4
- 238000005859 coupling reaction Methods 0.000 description 4
- 238000002484 cyclic voltammetry Methods 0.000 description 4
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 4
- 235000018417 cysteine Nutrition 0.000 description 4
- 238000007865 diluting Methods 0.000 description 4
- XBDQKXXYIPTUBI-UHFFFAOYSA-N dimethylselenoniopropionate Natural products CCC(O)=O XBDQKXXYIPTUBI-UHFFFAOYSA-N 0.000 description 4
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 4
- 238000003487 electrochemical reaction Methods 0.000 description 4
- 238000001819 mass spectrum Methods 0.000 description 4
- 125000002496 methyl group Chemical group [H]C([H])([H])* 0.000 description 4
- 239000000178 monomer Substances 0.000 description 4
- 239000002777 nucleoside Substances 0.000 description 4
- 150000002923 oximes Chemical class 0.000 description 4
- 239000002953 phosphate buffered saline Substances 0.000 description 4
- 125000002467 phosphate group Chemical group [H]OP(=O)(O[H])O[*] 0.000 description 4
- BASFCYQUMIYNBI-UHFFFAOYSA-N platinum Chemical compound [Pt] BASFCYQUMIYNBI-UHFFFAOYSA-N 0.000 description 4
- 238000002360 preparation method Methods 0.000 description 4
- JKANAVGODYYCQF-UHFFFAOYSA-N prop-2-yn-1-amine Chemical group NCC#C JKANAVGODYYCQF-UHFFFAOYSA-N 0.000 description 4
- UBQKCCHYAOITMY-UHFFFAOYSA-N pyridin-2-ol Chemical compound OC1=CC=CC=N1 UBQKCCHYAOITMY-UHFFFAOYSA-N 0.000 description 4
- LEEJQTWXRMXYIB-UHFFFAOYSA-N quinone 7 Chemical compound C12=CC=CC(C3=O)=C2C2=C4C3=CC=CC4=C3C4=C2C2=C1C=CC=C2C(=O)C4=CC=C3 LEEJQTWXRMXYIB-UHFFFAOYSA-N 0.000 description 4
- 238000006894 reductive elimination reaction Methods 0.000 description 4
- 238000004366 reverse phase liquid chromatography Methods 0.000 description 4
- 150000003839 salts Chemical group 0.000 description 4
- HKZLPVFGJNLROG-UHFFFAOYSA-M silver monochloride Chemical compound [Cl-].[Ag+] HKZLPVFGJNLROG-UHFFFAOYSA-M 0.000 description 4
- GPPXJZIENCGNKB-UHFFFAOYSA-N vanadium Chemical compound [V]#[V] GPPXJZIENCGNKB-UHFFFAOYSA-N 0.000 description 4
- ABJSOROVZZKJGI-OCYUSGCXSA-N (1r,2r,4r)-2-(4-bromophenyl)-n-[(4-chlorophenyl)-(2-fluoropyridin-4-yl)methyl]-4-morpholin-4-ylcyclohexane-1-carboxamide Chemical compound C1=NC(F)=CC(C(NC(=O)[C@H]2[C@@H](C[C@@H](CC2)N2CCOCC2)C=2C=CC(Br)=CC=2)C=2C=CC(Cl)=CC=2)=C1 ABJSOROVZZKJGI-OCYUSGCXSA-N 0.000 description 3
- SXGZJKUKBWWHRA-UHFFFAOYSA-N 2-(N-morpholiniumyl)ethanesulfonate Chemical compound [O-]S(=O)(=O)CC[NH+]1CCOCC1 SXGZJKUKBWWHRA-UHFFFAOYSA-N 0.000 description 3
- 125000001494 2-propynyl group Chemical group [H]C#CC([H])([H])* 0.000 description 3
- OYBOVXXFJYJYPC-UHFFFAOYSA-N 3-azidopropan-1-amine Chemical compound NCCCN=[N+]=[N-] OYBOVXXFJYJYPC-UHFFFAOYSA-N 0.000 description 3
- VFSWCDPTQADMSM-UHFFFAOYSA-N 4-(2-bromoethoxy)-3-methoxybenzaldehyde Chemical compound COC1=CC(C=O)=CC=C1OCCBr VFSWCDPTQADMSM-UHFFFAOYSA-N 0.000 description 3
- YMWUJEATGCHHMB-UHFFFAOYSA-N Dichloromethane Chemical compound ClCCl YMWUJEATGCHHMB-UHFFFAOYSA-N 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- PCLIMKBDDGJMGD-UHFFFAOYSA-N N-bromosuccinimide Chemical compound BrN1C(=O)CCC1=O PCLIMKBDDGJMGD-UHFFFAOYSA-N 0.000 description 3
- WUGQZFFCHPXWKQ-UHFFFAOYSA-N Propanolamine Chemical compound NCCCO WUGQZFFCHPXWKQ-UHFFFAOYSA-N 0.000 description 3
- 108020004682 Single-Stranded DNA Proteins 0.000 description 3
- PMZURENOXWZQFD-UHFFFAOYSA-L Sodium Sulfate Chemical compound [Na+].[Na+].[O-]S([O-])(=O)=O PMZURENOXWZQFD-UHFFFAOYSA-L 0.000 description 3
- 230000004913 activation Effects 0.000 description 3
- 150000001299 aldehydes Chemical class 0.000 description 3
- 238000003556 assay Methods 0.000 description 3
- KGNDCEVUMONOKF-UGPLYTSKSA-N benzyl n-[(2r)-1-[(2s,4r)-2-[[(2s)-6-amino-1-(1,3-benzoxazol-2-yl)-1,1-dihydroxyhexan-2-yl]carbamoyl]-4-[(4-methylphenyl)methoxy]pyrrolidin-1-yl]-1-oxo-4-phenylbutan-2-yl]carbamate Chemical compound C1=CC(C)=CC=C1CO[C@H]1CN(C(=O)[C@@H](CCC=2C=CC=CC=2)NC(=O)OCC=2C=CC=CC=2)[C@H](C(=O)N[C@@H](CCCCN)C(O)(O)C=2OC3=CC=CC=C3N=2)C1 KGNDCEVUMONOKF-UGPLYTSKSA-N 0.000 description 3
- PFYXSUNOLOJMDX-UHFFFAOYSA-N bis(2,5-dioxopyrrolidin-1-yl) carbonate Chemical compound O=C1CCC(=O)N1OC(=O)ON1C(=O)CCC1=O PFYXSUNOLOJMDX-UHFFFAOYSA-N 0.000 description 3
- 229910052799 carbon Inorganic materials 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 238000012650 click reaction Methods 0.000 description 3
- 229940125833 compound 23 Drugs 0.000 description 3
- 238000010168 coupling process Methods 0.000 description 3
- 229940104302 cytosine Drugs 0.000 description 3
- SUYVUBYJARFZHO-RRKCRQDMSA-N dATP Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-RRKCRQDMSA-N 0.000 description 3
- SUYVUBYJARFZHO-UHFFFAOYSA-N dATP Natural products C1=NC=2C(N)=NC=NC=2N1C1CC(O)C(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-UHFFFAOYSA-N 0.000 description 3
- 238000011033 desalting Methods 0.000 description 3
- 238000006073 displacement reaction Methods 0.000 description 3
- 238000004090 dissolution Methods 0.000 description 3
- 238000003818 flash chromatography Methods 0.000 description 3
- 125000000487 histidyl group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C([H])=N1 0.000 description 3
- 125000004435 hydrogen atom Chemical group [H]* 0.000 description 3
- 238000000338 in vitro Methods 0.000 description 3
- 239000010410 layer Substances 0.000 description 3
- 238000004949 mass spectrometry Methods 0.000 description 3
- 230000007246 mechanism Effects 0.000 description 3
- 238000001668 nucleic acid synthesis Methods 0.000 description 3
- 238000000926 separation method Methods 0.000 description 3
- 238000001542 size-exclusion chromatography Methods 0.000 description 3
- 239000011734 sodium Substances 0.000 description 3
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 3
- 150000003871 sulfonates Chemical class 0.000 description 3
- 229940113082 thymine Drugs 0.000 description 3
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 2
- AXKGIPZJYUNAIW-UHFFFAOYSA-N (4-aminophenyl)methanol Chemical compound NC1=CC=C(CO)C=C1 AXKGIPZJYUNAIW-UHFFFAOYSA-N 0.000 description 2
- 125000001399 1,2,3-triazolyl group Chemical group N1N=NC(=C1)* 0.000 description 2
- PAAZPARNPHGIKF-UHFFFAOYSA-N 1,2-dibromoethane Chemical compound BrCCBr PAAZPARNPHGIKF-UHFFFAOYSA-N 0.000 description 2
- HTSGKJQDMSTCGS-UHFFFAOYSA-N 1,4-bis(4-chlorophenyl)-2-(4-methylphenyl)sulfonylbutane-1,4-dione Chemical compound C1=CC(C)=CC=C1S(=O)(=O)C(C(=O)C=1C=CC(Cl)=CC=1)CC(=O)C1=CC=C(Cl)C=C1 HTSGKJQDMSTCGS-UHFFFAOYSA-N 0.000 description 2
- NCMVOABPESMRCP-SHYZEUOFSA-N 2'-deoxycytosine 5'-monophosphate Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](COP(O)(O)=O)[C@@H](O)C1 NCMVOABPESMRCP-SHYZEUOFSA-N 0.000 description 2
- LTFMZDNNPPEQNG-KVQBGUIXSA-N 2'-deoxyguanosine 5'-monophosphate Chemical compound C1=2NC(N)=NC(=O)C=2N=CN1[C@H]1C[C@H](O)[C@@H](COP(O)(O)=O)O1 LTFMZDNNPPEQNG-KVQBGUIXSA-N 0.000 description 2
- AXAVXPMQTGXXJZ-UHFFFAOYSA-N 2-aminoacetic acid;2-amino-2-(hydroxymethyl)propane-1,3-diol Chemical compound NCC(O)=O.OCC(N)(CO)CO AXAVXPMQTGXXJZ-UHFFFAOYSA-N 0.000 description 2
- AJDDQLJFWIIBJF-UHFFFAOYSA-N 4-(2-azidoethoxy)-3-methoxybenzaldehyde Chemical compound N(=[N+]=[N-])CCOC1=C(C=C(C=O)C=C1)OC AJDDQLJFWIIBJF-UHFFFAOYSA-N 0.000 description 2
- MSSUFHMGCXOVBZ-UHFFFAOYSA-L 9,10-dioxoanthracene-2,6-disulfonate Chemical compound [O-]S(=O)(=O)C1=CC=C2C(=O)C3=CC(S(=O)(=O)[O-])=CC=C3C(=O)C2=C1 MSSUFHMGCXOVBZ-UHFFFAOYSA-L 0.000 description 2
- 229930024421 Adenine Natural products 0.000 description 2
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 2
- ZKHQWZAMYRWXGA-UHFFFAOYSA-N Adenosine triphosphate Natural products C1=NC=2C(N)=NC=NC=2N1C1OC(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)C(O)C1O ZKHQWZAMYRWXGA-UHFFFAOYSA-N 0.000 description 2
- 241000242764 Aequorea victoria Species 0.000 description 2
- 108091093088 Amplicon Proteins 0.000 description 2
- IAZDPXIOMUYVGZ-WFGJKAKNSA-N Dimethyl sulfoxide Chemical compound [2H]C([2H])([2H])S(=O)C([2H])([2H])[2H] IAZDPXIOMUYVGZ-WFGJKAKNSA-N 0.000 description 2
- QUSNBJAOOMFDIB-UHFFFAOYSA-N Ethylamine Chemical compound CCN QUSNBJAOOMFDIB-UHFFFAOYSA-N 0.000 description 2
- DGAQECJNVWCQMB-PUAWFVPOSA-M Ilexoside XXIX Chemical compound C[C@@H]1CC[C@@]2(CC[C@@]3(C(=CC[C@H]4[C@]3(CC[C@@H]5[C@@]4(CC[C@@H](C5(C)C)OS(=O)(=O)[O-])C)C)[C@@H]2[C@]1(C)O)C)C(=O)O[C@H]6[C@@H]([C@H]([C@@H]([C@H](O6)CO)O)O)O.[Na+] DGAQECJNVWCQMB-PUAWFVPOSA-M 0.000 description 2
- WHXSMMKQMYFTQS-UHFFFAOYSA-N Lithium Chemical compound [Li] WHXSMMKQMYFTQS-UHFFFAOYSA-N 0.000 description 2
- FYYHWMGAXLPEAU-UHFFFAOYSA-N Magnesium Chemical compound [Mg] FYYHWMGAXLPEAU-UHFFFAOYSA-N 0.000 description 2
- CSNNHWWHGAXBCP-UHFFFAOYSA-L Magnesium sulfate Chemical compound [Mg+2].[O-][S+2]([O-])([O-])[O-] CSNNHWWHGAXBCP-UHFFFAOYSA-L 0.000 description 2
- 241001529936 Murinae Species 0.000 description 2
- 229910019142 PO4 Inorganic materials 0.000 description 2
- ZLMJMSJWJFRBEC-UHFFFAOYSA-N Potassium Chemical compound [K] ZLMJMSJWJFRBEC-UHFFFAOYSA-N 0.000 description 2
- 108091028664 Ribonucleotide Proteins 0.000 description 2
- PZBFGYYEXUXCOF-UHFFFAOYSA-N TCEP Chemical compound OC(=O)CCP(CCC(O)=O)CCC(O)=O PZBFGYYEXUXCOF-UHFFFAOYSA-N 0.000 description 2
- QYTDEUPAUMOIOP-UHFFFAOYSA-N TEMPO Chemical group CC1(C)CCCC(C)(C)N1[O] QYTDEUPAUMOIOP-UHFFFAOYSA-N 0.000 description 2
- DRTQHJPVMGBUCF-XVFCMESISA-N Uridine Chemical group O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-XVFCMESISA-N 0.000 description 2
- PSLUFJFHTBIXMW-WYEYVKMPSA-N [(3r,4ar,5s,6s,6as,10s,10ar,10bs)-3-ethenyl-10,10b-dihydroxy-3,4a,7,7,10a-pentamethyl-1-oxo-6-(2-pyridin-2-ylethylcarbamoyloxy)-5,6,6a,8,9,10-hexahydro-2h-benzo[f]chromen-5-yl] acetate Chemical compound O([C@@H]1[C@@H]([C@]2(O[C@](C)(CC(=O)[C@]2(O)[C@@]2(C)[C@@H](O)CCC(C)(C)[C@@H]21)C=C)C)OC(=O)C)C(=O)NCCC1=CC=CC=N1 PSLUFJFHTBIXMW-WYEYVKMPSA-N 0.000 description 2
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 2
- 238000010521 absorption reaction Methods 0.000 description 2
- 238000000862 absorption spectrum Methods 0.000 description 2
- 230000002378 acidificating effect Effects 0.000 description 2
- 229960000643 adenine Drugs 0.000 description 2
- 150000001408 amides Chemical class 0.000 description 2
- 238000007098 aminolysis reaction Methods 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 230000000844 anti-bacterial effect Effects 0.000 description 2
- 238000000429 assembly Methods 0.000 description 2
- 230000000712 assembly Effects 0.000 description 2
- 125000004429 atom Chemical group 0.000 description 2
- 238000010504 bond cleavage reaction Methods 0.000 description 2
- 230000003139 buffering effect Effects 0.000 description 2
- 230000003197 catalytic effect Effects 0.000 description 2
- 238000012512 characterization method Methods 0.000 description 2
- 230000000295 complement effect Effects 0.000 description 2
- 229940125904 compound 1 Drugs 0.000 description 2
- 229940125898 compound 5 Drugs 0.000 description 2
- 230000008878 coupling Effects 0.000 description 2
- DAEAPNUQQAICNR-RRKCRQDMSA-K dADP(3-) Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](COP([O-])(=O)OP([O-])([O-])=O)O1 DAEAPNUQQAICNR-RRKCRQDMSA-K 0.000 description 2
- FTDHDKPUHBLBTL-SHYZEUOFSA-K dCDP(3-) Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](COP([O-])(=O)OP([O-])([O-])=O)[C@@H](O)C1 FTDHDKPUHBLBTL-SHYZEUOFSA-K 0.000 description 2
- CIKGWCTVFSRMJU-KVQBGUIXSA-N dGDP Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(O)=O)O1 CIKGWCTVFSRMJU-KVQBGUIXSA-N 0.000 description 2
- UJLXYODCHAELLY-XLPZGREQSA-N dTDP Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(O)=O)[C@@H](O)C1 UJLXYODCHAELLY-XLPZGREQSA-N 0.000 description 2
- GYOZYWVXFNDGLU-XLPZGREQSA-N dTMP Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(O)=O)[C@@H](O)C1 GYOZYWVXFNDGLU-XLPZGREQSA-N 0.000 description 2
- 230000002939 deleterious effect Effects 0.000 description 2
- 238000004925 denaturation Methods 0.000 description 2
- 230000036425 denaturation Effects 0.000 description 2
- KHWCHTKSEGGWEX-UHFFFAOYSA-N deoxyadenylic acid Natural products C1=NC=2C(N)=NC=NC=2N1C1CC(O)C(COP(O)(O)=O)O1 KHWCHTKSEGGWEX-UHFFFAOYSA-N 0.000 description 2
- LTFMZDNNPPEQNG-UHFFFAOYSA-N deoxyguanylic acid Natural products C1=2NC(N)=NC(=O)C=2N=CN1C1CC(O)C(COP(O)(O)=O)O1 LTFMZDNNPPEQNG-UHFFFAOYSA-N 0.000 description 2
- 230000005595 deprotonation Effects 0.000 description 2
- 238000010537 deprotonation reaction Methods 0.000 description 2
- 238000009826 distribution Methods 0.000 description 2
- 150000002019 disulfides Chemical class 0.000 description 2
- VHJLVAABSRFDPM-QWWZWVQMSA-N dithiothreitol Chemical compound SC[C@@H](O)[C@H](O)CS VHJLVAABSRFDPM-QWWZWVQMSA-N 0.000 description 2
- 238000001035 drying Methods 0.000 description 2
- 239000003480 eluent Substances 0.000 description 2
- 238000010828 elution Methods 0.000 description 2
- 238000006911 enzymatic reaction Methods 0.000 description 2
- QRMKTNANRJCRCY-UHFFFAOYSA-N ethylammonium acetate Chemical compound CC[NH3+].CC([O-])=O QRMKTNANRJCRCY-UHFFFAOYSA-N 0.000 description 2
- 238000001914 filtration Methods 0.000 description 2
- 239000011888 foil Substances 0.000 description 2
- 238000001502 gel electrophoresis Methods 0.000 description 2
- RWSXRVCMGQZWBV-WDSKDSINSA-N glutathione Chemical compound OC(=O)[C@@H](N)CCC(=O)N[C@@H](CS)C(=O)NCC(O)=O RWSXRVCMGQZWBV-WDSKDSINSA-N 0.000 description 2
- 125000003827 glycol group Chemical group 0.000 description 2
- 150000007857 hydrazones Chemical group 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 239000012535 impurity Substances 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 230000006698 induction Effects 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 238000004255 ion exchange chromatography Methods 0.000 description 2
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 2
- 239000003446 ligand Substances 0.000 description 2
- 229910052744 lithium Inorganic materials 0.000 description 2
- 239000011777 magnesium Substances 0.000 description 2
- 229910052749 magnesium Inorganic materials 0.000 description 2
- 229910021645 metal ion Inorganic materials 0.000 description 2
- 229930182817 methionine Natural products 0.000 description 2
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 2
- 230000005012 migration Effects 0.000 description 2
- 238000013508 migration Methods 0.000 description 2
- 150000003833 nucleoside derivatives Chemical class 0.000 description 2
- 235000021317 phosphate Nutrition 0.000 description 2
- 150000008300 phosphoramidites Chemical class 0.000 description 2
- 229910052697 platinum Inorganic materials 0.000 description 2
- 239000011591 potassium Substances 0.000 description 2
- 229910052700 potassium Inorganic materials 0.000 description 2
- BWHMMNNQKKPAPP-UHFFFAOYSA-L potassium carbonate Chemical compound [K+].[K+].[O-]C([O-])=O BWHMMNNQKKPAPP-UHFFFAOYSA-L 0.000 description 2
- 239000002243 precursor Substances 0.000 description 2
- 239000013615 primer Substances 0.000 description 2
- 235000019260 propionic acid Nutrition 0.000 description 2
- 125000006239 protecting group Chemical group 0.000 description 2
- 230000005588 protonation Effects 0.000 description 2
- 150000004053 quinones Chemical class 0.000 description 2
- XKMLYUALXHKNFT-UHFFFAOYSA-N rGTP Natural products C1=2NC(N)=NC(=O)C=2N=CN1C1OC(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)C(O)C1O XKMLYUALXHKNFT-UHFFFAOYSA-N 0.000 description 2
- 230000035484 reaction time Effects 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 239000002336 ribonucleotide Substances 0.000 description 2
- 125000002652 ribonucleotide group Chemical group 0.000 description 2
- 229910052709 silver Inorganic materials 0.000 description 2
- 239000004332 silver Substances 0.000 description 2
- 229910052708 sodium Inorganic materials 0.000 description 2
- 235000010378 sodium ascorbate Nutrition 0.000 description 2
- PPASLZSBLFJQEF-RKJRWTFHSA-M sodium ascorbate Substances [Na+].OC[C@@H](O)[C@H]1OC(=O)C(O)=C1[O-] PPASLZSBLFJQEF-RKJRWTFHSA-M 0.000 description 2
- 229960005055 sodium ascorbate Drugs 0.000 description 2
- PPASLZSBLFJQEF-RXSVEWSESA-M sodium-L-ascorbate Chemical compound [Na+].OC[C@H](O)[C@H]1OC(=O)C(O)=C1[O-] PPASLZSBLFJQEF-RXSVEWSESA-M 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 238000004611 spectroscopical analysis Methods 0.000 description 2
- KZNICNPSHKQLFF-UHFFFAOYSA-N succinimide Chemical compound O=C1CCC(=O)N1 KZNICNPSHKQLFF-UHFFFAOYSA-N 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 238000004809 thin layer chromatography Methods 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 150000003852 triazoles Chemical group 0.000 description 2
- 238000007039 two-step reaction Methods 0.000 description 2
- 229940035893 uracil Drugs 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- 238000012800 visualization Methods 0.000 description 2
- 239000003643 water by type Substances 0.000 description 2
- ASGMFNBUXDJWJJ-JLCFBVMHSA-N (1R,3R)-3-[[3-bromo-1-[4-(5-methyl-1,3,4-thiadiazol-2-yl)phenyl]pyrazolo[3,4-d]pyrimidin-6-yl]amino]-N,1-dimethylcyclopentane-1-carboxamide Chemical compound BrC1=NN(C2=NC(=NC=C21)N[C@H]1C[C@@](CC1)(C(=O)NC)C)C1=CC=C(C=C1)C=1SC(=NN=1)C ASGMFNBUXDJWJJ-JLCFBVMHSA-N 0.000 description 1
- ZZBOTLREHORFCK-UHFFFAOYSA-N (2,5-dioxopyrrolidin-1-yl) 2-[(2-methylpropan-2-yl)oxycarbonylamino]oxyacetate Chemical compound CC(C)(C)OC(=O)NOCC(=O)ON1C(=O)CCC1=O ZZBOTLREHORFCK-UHFFFAOYSA-N 0.000 description 1
- GHYOCDFICYLMRF-UTIIJYGPSA-N (2S,3R)-N-[(2S)-3-(cyclopenten-1-yl)-1-[(2R)-2-methyloxiran-2-yl]-1-oxopropan-2-yl]-3-hydroxy-3-(4-methoxyphenyl)-2-[[(2S)-2-[(2-morpholin-4-ylacetyl)amino]propanoyl]amino]propanamide Chemical compound C1(=CCCC1)C[C@@H](C(=O)[C@@]1(OC1)C)NC([C@H]([C@@H](C1=CC=C(C=C1)OC)O)NC([C@H](C)NC(CN1CCOCC1)=O)=O)=O GHYOCDFICYLMRF-UTIIJYGPSA-N 0.000 description 1
- QFLWZFQWSBQYPS-AWRAUJHKSA-N (3S)-3-[[(2S)-2-[[(2S)-2-[5-[(3aS,6aR)-2-oxo-1,3,3a,4,6,6a-hexahydrothieno[3,4-d]imidazol-4-yl]pentanoylamino]-3-methylbutanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-4-[1-bis(4-chlorophenoxy)phosphorylbutylamino]-4-oxobutanoic acid Chemical compound CCCC(NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H](NC(=O)CCCCC1SC[C@@H]2NC(=O)N[C@H]12)C(C)C)P(=O)(Oc1ccc(Cl)cc1)Oc1ccc(Cl)cc1 QFLWZFQWSBQYPS-AWRAUJHKSA-N 0.000 description 1
- KQZLRWGGWXJPOS-NLFPWZOASA-N 1-[(1R)-1-(2,4-dichlorophenyl)ethyl]-6-[(4S,5R)-4-[(2S)-2-(hydroxymethyl)pyrrolidin-1-yl]-5-methylcyclohexen-1-yl]pyrazolo[3,4-b]pyrazine-3-carbonitrile Chemical compound ClC1=C(C=CC(=C1)Cl)[C@@H](C)N1N=C(C=2C1=NC(=CN=2)C1=CC[C@@H]([C@@H](C1)C)N1[C@@H](CCC1)CO)C#N KQZLRWGGWXJPOS-NLFPWZOASA-N 0.000 description 1
- UNILWMWFPHPYOR-KXEYIPSPSA-M 1-[6-[2-[3-[3-[3-[2-[2-[3-[[2-[2-[[(2r)-1-[[2-[[(2r)-1-[3-[2-[2-[3-[[2-(2-amino-2-oxoethoxy)acetyl]amino]propoxy]ethoxy]ethoxy]propylamino]-3-hydroxy-1-oxopropan-2-yl]amino]-2-oxoethyl]amino]-3-[(2r)-2,3-di(hexadecanoyloxy)propyl]sulfanyl-1-oxopropan-2-yl Chemical compound O=C1C(SCCC(=O)NCCCOCCOCCOCCCNC(=O)COCC(=O)N[C@@H](CSC[C@@H](COC(=O)CCCCCCCCCCCCCCC)OC(=O)CCCCCCCCCCCCCCC)C(=O)NCC(=O)N[C@H](CO)C(=O)NCCCOCCOCCOCCCNC(=O)COCC(N)=O)CC(=O)N1CCNC(=O)CCCCCN\1C2=CC=C(S([O-])(=O)=O)C=C2CC/1=C/C=C/C=C/C1=[N+](CC)C2=CC=C(S([O-])(=O)=O)C=C2C1 UNILWMWFPHPYOR-KXEYIPSPSA-M 0.000 description 1
- ZPAMJYSSYPEWNA-UHFFFAOYSA-L 1-propyl-4-(1-propylpyridin-1-ium-4-yl)pyridin-1-ium;dichloride Chemical compound [Cl-].[Cl-].C1=C[N+](CCC)=CC=C1C1=CC=[N+](CCC)C=C1 ZPAMJYSSYPEWNA-UHFFFAOYSA-L 0.000 description 1
- 238000005160 1H NMR spectroscopy Methods 0.000 description 1
- QMNUDYFKZYBWQX-UHFFFAOYSA-N 1H-quinazolin-4-one Chemical class C1=CC=C2C(=O)N=CNC2=C1 QMNUDYFKZYBWQX-UHFFFAOYSA-N 0.000 description 1
- PYRKKGOKRMZEIT-UHFFFAOYSA-N 2-[6-(2-cyclopropylethoxy)-9-(2-hydroxy-2-methylpropyl)-1h-phenanthro[9,10-d]imidazol-2-yl]-5-fluorobenzene-1,3-dicarbonitrile Chemical compound C1=C2C3=CC(CC(C)(O)C)=CC=C3C=3NC(C=4C(=CC(F)=CC=4C#N)C#N)=NC=3C2=CC=C1OCCC1CC1 PYRKKGOKRMZEIT-UHFFFAOYSA-N 0.000 description 1
- FMKGJQHNYMWDFJ-CVEARBPZSA-N 2-[[4-(2,2-difluoropropoxy)pyrimidin-5-yl]methylamino]-4-[[(1R,4S)-4-hydroxy-3,3-dimethylcyclohexyl]amino]pyrimidine-5-carbonitrile Chemical compound FC(COC1=NC=NC=C1CNC1=NC=C(C(=N1)N[C@H]1CC([C@H](CC1)O)(C)C)C#N)(C)F FMKGJQHNYMWDFJ-CVEARBPZSA-N 0.000 description 1
- YSUIQYOGTINQIN-UZFYAQMZSA-N 2-amino-9-[(1S,6R,8R,9S,10R,15R,17R,18R)-8-(6-aminopurin-9-yl)-9,18-difluoro-3,12-dihydroxy-3,12-bis(sulfanylidene)-2,4,7,11,13,16-hexaoxa-3lambda5,12lambda5-diphosphatricyclo[13.2.1.06,10]octadecan-17-yl]-1H-purin-6-one Chemical compound NC1=NC2=C(N=CN2[C@@H]2O[C@@H]3COP(S)(=O)O[C@@H]4[C@@H](COP(S)(=O)O[C@@H]2[C@@H]3F)O[C@H]([C@H]4F)N2C=NC3=C2N=CN=C3N)C(=O)N1 YSUIQYOGTINQIN-UZFYAQMZSA-N 0.000 description 1
- TVTJUIAKQFIXCE-HUKYDQBMSA-N 2-amino-9-[(2R,3S,4S,5R)-4-fluoro-3-hydroxy-5-(hydroxymethyl)oxolan-2-yl]-7-prop-2-ynyl-1H-purine-6,8-dione Chemical compound NC=1NC(C=2N(C(N(C=2N=1)[C@@H]1O[C@@H]([C@H]([C@H]1O)F)CO)=O)CC#C)=O TVTJUIAKQFIXCE-HUKYDQBMSA-N 0.000 description 1
- QBWKPGNFQQJGFY-QLFBSQMISA-N 3-[(1r)-1-[(2r,6s)-2,6-dimethylmorpholin-4-yl]ethyl]-n-[6-methyl-3-(1h-pyrazol-4-yl)imidazo[1,2-a]pyrazin-8-yl]-1,2-thiazol-5-amine Chemical compound N1([C@H](C)C2=NSC(NC=3C4=NC=C(N4C=C(C)N=3)C3=CNN=C3)=C2)C[C@H](C)O[C@H](C)C1 QBWKPGNFQQJGFY-QLFBSQMISA-N 0.000 description 1
- ABTSMZASCBGYSH-UHFFFAOYSA-N 3-[4-(aminomethyl)-2,3-bis(3-hydroxypropyl)triazol-1-yl]propan-1-ol Chemical compound OCCCN1N(N(C=C1CN)CCCO)CCCO ABTSMZASCBGYSH-UHFFFAOYSA-N 0.000 description 1
- STIQWBOVFQRGRF-UHFFFAOYSA-N 3-prop-2-ynylpyrrole-2,5-dione Chemical class C(C#C)C=1C(=O)NC(C1)=O STIQWBOVFQRGRF-UHFFFAOYSA-N 0.000 description 1
- RJJVVYVLHWMYAA-UITAMQMPSA-N 4-[(Z)-hydroxyiminomethyl]-2-methoxyphenol Chemical compound COC1=CC(\C=N/O)=CC=C1O RJJVVYVLHWMYAA-UITAMQMPSA-N 0.000 description 1
- WYFCZWSWFGJODV-MIANJLSGSA-N 4-[[(1s)-2-[(e)-3-[3-chloro-2-fluoro-6-(tetrazol-1-yl)phenyl]prop-2-enoyl]-5-(4-methyl-2-oxopiperazin-1-yl)-3,4-dihydro-1h-isoquinoline-1-carbonyl]amino]benzoic acid Chemical compound O=C1CN(C)CCN1C1=CC=CC2=C1CCN(C(=O)\C=C\C=1C(=CC=C(Cl)C=1F)N1N=NN=C1)[C@@H]2C(=O)NC1=CC=C(C(O)=O)C=C1 WYFCZWSWFGJODV-MIANJLSGSA-N 0.000 description 1
- XFJBGINZIMNZBW-CRAIPNDOSA-N 5-chloro-2-[4-[(1r,2s)-2-[2-(5-methylsulfonylpyridin-2-yl)oxyethyl]cyclopropyl]piperidin-1-yl]pyrimidine Chemical compound N1=CC(S(=O)(=O)C)=CC=C1OCC[C@H]1[C@@H](C2CCN(CC2)C=2N=CC(Cl)=CN=2)C1 XFJBGINZIMNZBW-CRAIPNDOSA-N 0.000 description 1
- ZCNZNVIRVHJXKV-UHFFFAOYSA-N 6-hydroxy-3,4-dihydrochromen-2-one Chemical compound O1C(=O)CCC2=CC(O)=CC=C21 ZCNZNVIRVHJXKV-UHFFFAOYSA-N 0.000 description 1
- QTBSBXVTEAMEQO-UHFFFAOYSA-M Acetate Chemical compound CC([O-])=O QTBSBXVTEAMEQO-UHFFFAOYSA-M 0.000 description 1
- 102000052866 Amino Acyl-tRNA Synthetases Human genes 0.000 description 1
- 108700028939 Amino Acyl-tRNA Synthetases Proteins 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- LSNNMFCWUKXFEE-UHFFFAOYSA-M Bisulfite Chemical compound OS([O-])=O LSNNMFCWUKXFEE-UHFFFAOYSA-M 0.000 description 1
- 206010006187 Breast cancer Diseases 0.000 description 1
- 208000026310 Breast neoplasm Diseases 0.000 description 1
- 102100029895 Bromodomain-containing protein 4 Human genes 0.000 description 1
- 101710126815 Bromodomain-containing protein 4 Proteins 0.000 description 1
- KXDHJXZQYSOELW-UHFFFAOYSA-N Carbamic acid Chemical group NC(O)=O KXDHJXZQYSOELW-UHFFFAOYSA-N 0.000 description 1
- 229940127007 Compound 39 Drugs 0.000 description 1
- HMFHBZSHGGEWLO-SOOFDHNKSA-N D-ribofuranose Chemical compound OC[C@H]1OC(O)[C@H](O)[C@@H]1O HMFHBZSHGGEWLO-SOOFDHNKSA-N 0.000 description 1
- 108020001019 DNA Primers Proteins 0.000 description 1
- 239000003155 DNA primer Substances 0.000 description 1
- 230000033616 DNA repair Effects 0.000 description 1
- 108010024636 Glutathione Proteins 0.000 description 1
- 101001113483 Homo sapiens Poly [ADP-ribose] polymerase 1 Proteins 0.000 description 1
- UFHFLCQGNIYNRP-UHFFFAOYSA-N Hydrogen Chemical compound [H][H] UFHFLCQGNIYNRP-UHFFFAOYSA-N 0.000 description 1
- AVXURJPOCDRRFD-UHFFFAOYSA-N Hydroxylamine Chemical compound ON AVXURJPOCDRRFD-UHFFFAOYSA-N 0.000 description 1
- 241000257303 Hymenoptera Species 0.000 description 1
- 239000007987 MES buffer Substances 0.000 description 1
- NQTADLQHYWFPDB-UHFFFAOYSA-N N-Hydroxysuccinimide Chemical compound ON1C(=O)CCC1=O NQTADLQHYWFPDB-UHFFFAOYSA-N 0.000 description 1
- 101150015939 Parva gene Proteins 0.000 description 1
- 229920003171 Poly (ethylene oxide) Polymers 0.000 description 1
- 102100023712 Poly [ADP-ribose] polymerase 1 Human genes 0.000 description 1
- 238000011529 RT qPCR Methods 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- PYMYPHUHKUWMLA-LMVFSUKVSA-N Ribose Natural products OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PYMYPHUHKUWMLA-LMVFSUKVSA-N 0.000 description 1
- UIIMBOGNXHQVGW-DEQYMQKBSA-M Sodium bicarbonate-14C Chemical compound [Na+].O[14C]([O-])=O UIIMBOGNXHQVGW-DEQYMQKBSA-M 0.000 description 1
- DWAQJAXMDSEUJJ-UHFFFAOYSA-M Sodium bisulfite Chemical compound [Na+].OS([O-])=O DWAQJAXMDSEUJJ-UHFFFAOYSA-M 0.000 description 1
- 108091008874 T cell receptors Proteins 0.000 description 1
- 102000016266 T-Cell Antigen Receptors Human genes 0.000 description 1
- 108020005038 Terminator Codon Proteins 0.000 description 1
- LJOOWESTVASNOG-UFJKPHDISA-N [(1s,3r,4ar,7s,8s,8as)-3-hydroxy-8-[2-[(4r)-4-hydroxy-6-oxooxan-2-yl]ethyl]-7-methyl-1,2,3,4,4a,7,8,8a-octahydronaphthalen-1-yl] (2s)-2-methylbutanoate Chemical compound C([C@H]1[C@@H](C)C=C[C@H]2C[C@@H](O)C[C@@H]([C@H]12)OC(=O)[C@@H](C)CC)CC1C[C@@H](O)CC(=O)O1 LJOOWESTVASNOG-UFJKPHDISA-N 0.000 description 1
- LNUFLCYMSVYYNW-ZPJMAFJPSA-N [(2r,3r,4s,5r,6r)-2-[(2r,3r,4s,5r,6r)-6-[(2r,3r,4s,5r,6r)-6-[(2r,3r,4s,5r,6r)-6-[[(3s,5s,8r,9s,10s,13r,14s,17r)-10,13-dimethyl-17-[(2r)-6-methylheptan-2-yl]-2,3,4,5,6,7,8,9,11,12,14,15,16,17-tetradecahydro-1h-cyclopenta[a]phenanthren-3-yl]oxy]-4,5-disulfo Chemical compound O([C@@H]1[C@@H](COS(O)(=O)=O)O[C@@H]([C@@H]([C@H]1OS(O)(=O)=O)OS(O)(=O)=O)O[C@@H]1[C@@H](COS(O)(=O)=O)O[C@@H]([C@@H]([C@H]1OS(O)(=O)=O)OS(O)(=O)=O)O[C@@H]1[C@@H](COS(O)(=O)=O)O[C@H]([C@@H]([C@H]1OS(O)(=O)=O)OS(O)(=O)=O)O[C@@H]1C[C@@H]2CC[C@H]3[C@@H]4CC[C@@H]([C@]4(CC[C@@H]3[C@@]2(C)CC1)C)[C@H](C)CCCC(C)C)[C@H]1O[C@H](COS(O)(=O)=O)[C@@H](OS(O)(=O)=O)[C@H](OS(O)(=O)=O)[C@H]1OS(O)(=O)=O LNUFLCYMSVYYNW-ZPJMAFJPSA-N 0.000 description 1
- UCKNRFXCSJEJTQ-UHFFFAOYSA-N [4-(2-bromoethoxy)-3-methoxyphenyl]methanol Chemical compound COC1=CC(CO)=CC=C1OCCBr UCKNRFXCSJEJTQ-UHFFFAOYSA-N 0.000 description 1
- 238000002835 absorbance Methods 0.000 description 1
- 239000008351 acetate buffer Substances 0.000 description 1
- 150000007513 acids Chemical class 0.000 description 1
- 101150063416 add gene Proteins 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- 150000001298 alcohols Chemical class 0.000 description 1
- 125000006852 aliphatic spacer Chemical group 0.000 description 1
- 125000002355 alkine group Chemical group 0.000 description 1
- 125000000217 alkyl group Chemical group 0.000 description 1
- HMFHBZSHGGEWLO-UHFFFAOYSA-N alpha-D-Furanose-Ribose Natural products OCC1OC(O)C(O)C1O HMFHBZSHGGEWLO-UHFFFAOYSA-N 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 239000007864 aqueous solution Substances 0.000 description 1
- PYMYPHUHKUWMLA-WDCZJNDASA-N arabinose Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)C=O PYMYPHUHKUWMLA-WDCZJNDASA-N 0.000 description 1
- PYMYPHUHKUWMLA-UHFFFAOYSA-N arabinose Natural products OCC(O)C(O)C(O)C=O PYMYPHUHKUWMLA-UHFFFAOYSA-N 0.000 description 1
- XRWSZZJLZRKHHD-WVWIJVSJSA-N asunaprevir Chemical compound O=C([C@@H]1C[C@H](CN1C(=O)[C@@H](NC(=O)OC(C)(C)C)C(C)(C)C)OC1=NC=C(C2=CC=C(Cl)C=C21)OC)N[C@]1(C(=O)NS(=O)(=O)C2CC2)C[C@H]1C=C XRWSZZJLZRKHHD-WVWIJVSJSA-N 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- XMQFTWRPUQYINF-UHFFFAOYSA-N bensulfuron-methyl Chemical compound COC(=O)C1=CC=CC=C1CS(=O)(=O)NC(=O)NC1=NC(OC)=CC(OC)=N1 XMQFTWRPUQYINF-UHFFFAOYSA-N 0.000 description 1
- 125000001797 benzyl group Chemical group [H]C1=C([H])C([H])=C(C([H])=C1[H])C([H])([H])* 0.000 description 1
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 description 1
- DRTQHJPVMGBUCF-PSQAKQOGSA-N beta-L-uridine Natural products O[C@H]1[C@@H](O)[C@H](CO)O[C@@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-PSQAKQOGSA-N 0.000 description 1
- 239000006177 biological buffer Substances 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 238000012925 biological evaluation Methods 0.000 description 1
- 239000011616 biotin Substances 0.000 description 1
- 229960002685 biotin Drugs 0.000 description 1
- 235000020958 biotin Nutrition 0.000 description 1
- 235000010290 biphenyl Nutrition 0.000 description 1
- 239000004305 biphenyl Substances 0.000 description 1
- 150000001649 bromium compounds Chemical group 0.000 description 1
- 239000008366 buffered solution Substances 0.000 description 1
- 102220389202 c.83A>T Human genes 0.000 description 1
- 238000011088 calibration curve Methods 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 235000014633 carbohydrates Nutrition 0.000 description 1
- 125000002915 carbonyl group Chemical group [*:2]C([*:1])=O 0.000 description 1
- 150000001732 carboxylic acid derivatives Chemical class 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 238000007385 chemical modification Methods 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 229940125797 compound 12 Drugs 0.000 description 1
- 229940125810 compound 20 Drugs 0.000 description 1
- 229940125961 compound 24 Drugs 0.000 description 1
- 229940125846 compound 25 Drugs 0.000 description 1
- 229940125851 compound 27 Drugs 0.000 description 1
- 229940127204 compound 29 Drugs 0.000 description 1
- 229940125877 compound 31 Drugs 0.000 description 1
- 229940127113 compound 57 Drugs 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 238000009833 condensation Methods 0.000 description 1
- 230000005494 condensation Effects 0.000 description 1
- 238000006482 condensation reaction Methods 0.000 description 1
- 239000013058 crude material Substances 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000010511 deprotection reaction Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 230000001627 detrimental effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- IJKVHSBPTUYDLN-UHFFFAOYSA-N dihydroxy(oxo)silane Chemical compound O[Si](O)=O IJKVHSBPTUYDLN-UHFFFAOYSA-N 0.000 description 1
- 239000000539 dimer Substances 0.000 description 1
- 238000012377 drug delivery Methods 0.000 description 1
- 239000000975 dye Substances 0.000 description 1
- 239000003792 electrolyte Substances 0.000 description 1
- 238000006056 electrooxidation reaction Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- LYCAIKOWRPUZTN-UHFFFAOYSA-N ethylene glycol Natural products OCCO LYCAIKOWRPUZTN-UHFFFAOYSA-N 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 239000013604 expression vector Substances 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 239000012467 final product Substances 0.000 description 1
- 150000002211 flavins Chemical class 0.000 description 1
- 239000007850 fluorescent dye Substances 0.000 description 1
- 239000012634 fragment Substances 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 229960003180 glutathione Drugs 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- JAXFJECJQZDFJS-XHEPKHHKSA-N gtpl8555 Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)N[C@H](B1O[C@@]2(C)[C@H]3C[C@H](C3(C)C)C[C@H]2O1)CCC1=CC=C(F)C=C1 JAXFJECJQZDFJS-XHEPKHHKSA-N 0.000 description 1
- 238000003306 harvesting Methods 0.000 description 1
- 238000009396 hybridization Methods 0.000 description 1
- 229910052739 hydrogen Inorganic materials 0.000 description 1
- 239000001257 hydrogen Substances 0.000 description 1
- WGCNASOHLSPBMP-UHFFFAOYSA-N hydroxyacetaldehyde Natural products OCC=O WGCNASOHLSPBMP-UHFFFAOYSA-N 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 210000002865 immune cell Anatomy 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 150000002576 ketones Chemical group 0.000 description 1
- 150000002596 lactones Chemical class 0.000 description 1
- 229910052943 magnesium sulfate Inorganic materials 0.000 description 1
- 235000019341 magnesium sulphate Nutrition 0.000 description 1
- 125000005439 maleimidyl group Chemical group C1(C=CC(N1*)=O)=O 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- WCYWZMWISLQXQU-UHFFFAOYSA-N methyl Chemical group [CH3] WCYWZMWISLQXQU-UHFFFAOYSA-N 0.000 description 1
- KQNYTTDHCMFOME-UHFFFAOYSA-N methyl n-[[3-[(4-tert-butylpiperazin-1-yl)methyl]-8-fluoro-2-phenylquinoline-4-carbonyl]amino]-n-phenylcarbamate Chemical compound C=1C=CC=CC=1N(C(=O)OC)NC(=O)C(C1=CC=CC(F)=C1N=C1C=2C=CC=CC=2)=C1CN1CCN(C(C)(C)C)CC1 KQNYTTDHCMFOME-UHFFFAOYSA-N 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 230000004001 molecular interaction Effects 0.000 description 1
- PSHKMPUSSFXUIA-UHFFFAOYSA-N n,n-dimethylpyridin-2-amine Chemical compound CN(C)C1=CC=CC=N1 PSHKMPUSSFXUIA-UHFFFAOYSA-N 0.000 description 1
- 238000006386 neutralization reaction Methods 0.000 description 1
- 229910000510 noble metal Inorganic materials 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 238000011275 oncology therapy Methods 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 125000002524 organometallic group Chemical group 0.000 description 1
- 230000001590 oxidative effect Effects 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 150000002989 phenols Chemical class 0.000 description 1
- ZUOUZKKEUPVFJK-UHFFFAOYSA-N phenylbenzene Natural products C1=CC=CC=C1C1=CC=CC=C1 ZUOUZKKEUPVFJK-UHFFFAOYSA-N 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- 239000007981 phosphate-citrate buffer Substances 0.000 description 1
- 150000003013 phosphoric acid derivatives Chemical class 0.000 description 1
- 230000004962 physiological condition Effects 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 229920002704 polyhistidine Polymers 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 229910000027 potassium carbonate Inorganic materials 0.000 description 1
- 239000012286 potassium permanganate Substances 0.000 description 1
- 239000008057 potassium phosphate buffer Substances 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 230000002028 premature Effects 0.000 description 1
- 230000037452 priming Effects 0.000 description 1
- QLNJFJADRCOGBJ-UHFFFAOYSA-N propionamide Chemical compound CCC(N)=O QLNJFJADRCOGBJ-UHFFFAOYSA-N 0.000 description 1
- AOHJOMMDDJHIJH-UHFFFAOYSA-N propylenediamine Chemical compound CC(N)CN AOHJOMMDDJHIJH-UHFFFAOYSA-N 0.000 description 1
- JUJWROOIHBZHMG-UHFFFAOYSA-O pyridinium Chemical compound C1=CC=[NH+]C=C1 JUJWROOIHBZHMG-UHFFFAOYSA-O 0.000 description 1
- ZDYVRSLAEXCVBX-UHFFFAOYSA-N pyridinium p-toluenesulfonate Chemical compound C1=CC=[NH+]C=C1.CC1=CC=C(S([O-])(=O)=O)C=C1 ZDYVRSLAEXCVBX-UHFFFAOYSA-N 0.000 description 1
- 239000002096 quantum dot Substances 0.000 description 1
- IUVKMZGDUIUOCP-BTNSXGMBSA-N quinbolone Chemical compound O([C@H]1CC[C@H]2[C@H]3[C@@H]([C@]4(C=CC(=O)C=C4CC3)C)CC[C@@]21C)C1=CCCC1 IUVKMZGDUIUOCP-BTNSXGMBSA-N 0.000 description 1
- 230000009257 reactivity Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 239000002265 redox agent Substances 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 238000002390 rotary evaporation Methods 0.000 description 1
- 238000007086 side reaction Methods 0.000 description 1
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N silicon dioxide Inorganic materials O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 1
- 229910052814 silicon oxide Inorganic materials 0.000 description 1
- JVBXVOWTABLYPX-UHFFFAOYSA-L sodium dithionite Chemical compound [Na+].[Na+].[O-]S(=O)S([O-])=O JVBXVOWTABLYPX-UHFFFAOYSA-L 0.000 description 1
- 235000010267 sodium hydrogen sulphite Nutrition 0.000 description 1
- AKHNMLFCWUSKQB-UHFFFAOYSA-L sodium thiosulfate Chemical compound [Na+].[Na+].[O-]S([O-])(=O)=S AKHNMLFCWUSKQB-UHFFFAOYSA-L 0.000 description 1
- 235000019345 sodium thiosulphate Nutrition 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 230000006641 stabilisation Effects 0.000 description 1
- 238000011105 stabilization Methods 0.000 description 1
- 230000000087 stabilizing effect Effects 0.000 description 1
- 125000001424 substituent group Chemical group 0.000 description 1
- 229960002317 succinimide Drugs 0.000 description 1
- 238000001308 synthesis method Methods 0.000 description 1
- KCDXJAYRVLXPFO-UHFFFAOYSA-N syringaldehyde Chemical compound COC1=CC(C=O)=CC(OC)=C1O KCDXJAYRVLXPFO-UHFFFAOYSA-N 0.000 description 1
- COBXDAOIDYGHGK-UHFFFAOYSA-N syringaldehyde Natural products COC1=CC=C(C=O)C(OC)=C1O COBXDAOIDYGHGK-UHFFFAOYSA-N 0.000 description 1
- 125000001412 tetrahydropyranyl group Chemical group 0.000 description 1
- WROMPOXWARCANT-UHFFFAOYSA-N tfa trifluoroacetic acid Chemical compound OC(=O)C(F)(F)F.OC(=O)C(F)(F)F WROMPOXWARCANT-UHFFFAOYSA-N 0.000 description 1
- 150000003568 thioethers Chemical class 0.000 description 1
- 231100000331 toxic Toxicity 0.000 description 1
- 230000002588 toxic effect Effects 0.000 description 1
- 238000004704 ultra performance liquid chromatography Methods 0.000 description 1
- DRTQHJPVMGBUCF-UHFFFAOYSA-N uracil arabinoside Natural products OC1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-UHFFFAOYSA-N 0.000 description 1
- 229940045136 urea Drugs 0.000 description 1
- 229940045145 uridine Drugs 0.000 description 1
- ZENOXNGFMSCLLL-UHFFFAOYSA-N vanillyl alcohol Chemical compound COC1=CC(CO)=CC=C1O ZENOXNGFMSCLLL-UHFFFAOYSA-N 0.000 description 1
- 238000011179 visual inspection Methods 0.000 description 1
- 239000011800 void material Substances 0.000 description 1
- 238000010792 warming Methods 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P19/00—Preparation of compounds containing saccharide radicals
- C12P19/26—Preparation of nitrogen-containing carbohydrates
- C12P19/28—N-glycosides
- C12P19/30—Nucleotides
- C12P19/34—Polynucleotides, e.g. nucleic acids, oligoribonucleotides
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/12—Transferases (2.) transferring phosphorus containing groups, e.g. kinases (2.7)
- C12N9/1241—Nucleotidyltransferases (2.7.7)
- C12N9/1264—DNA nucleotidylexotransferase (2.7.7.31), i.e. terminal nucleotidyl transferase
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y207/00—Transferases transferring phosphorus-containing groups (2.7)
- C12Y207/07—Nucleotidyltransferases (2.7.7)
- C12Y207/07031—DNA nucleotidylexotransferase (2.7.7.31), i.e. terminal deoxynucleotidyl transferase
-
- C—CHEMISTRY; METALLURGY
- C25—ELECTROLYTIC OR ELECTROPHORETIC PROCESSES; APPARATUS THEREFOR
- C25B—ELECTROLYTIC OR ELECTROPHORETIC PROCESSES FOR THE PRODUCTION OF COMPOUNDS OR NON-METALS; APPARATUS THEREFOR
- C25B3/00—Electrolytic production of organic compounds
- C25B3/01—Products
- C25B3/05—Heterocyclic compounds
-
- C—CHEMISTRY; METALLURGY
- C25—ELECTROLYTIC OR ELECTROPHORETIC PROCESSES; APPARATUS THEREFOR
- C25B—ELECTROLYTIC OR ELECTROPHORETIC PROCESSES FOR THE PRODUCTION OF COMPOUNDS OR NON-METALS; APPARATUS THEREFOR
- C25B3/00—Electrolytic production of organic compounds
- C25B3/20—Processes
-
- C—CHEMISTRY; METALLURGY
- C40—COMBINATORIAL TECHNOLOGY
- C40B—COMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
- C40B80/00—Linkers or spacers specially adapted for combinatorial chemistry or libraries, e.g. traceless linkers or safety-catch linkers
Definitions
- TdT engineered terminal deoxynucleotidyl transferase
- engineered TdTs covalently attached to a nucleotide molecule via a tether and also nucleotide molecules comprising a redox-cleavable linker.
- systems for enzymatic DNA synthesis comprising an engineered TdT, a redox-cleavable linker a redox shuttle solution, and two or more electrodes.
- DNA polymerases are enzymes responsible for the replication of genetic material in vivo and in vitro. Specifically, these enzymes are responsible for catalyzing the addition of nucleotide triphosphates (e.g,, dNTPs and analogs thereof) to the three-prime end of a primer or seed strand of DNA.
- nucleotide triphosphates e.g,, dNTPs and analogs thereof
- the majority of DNA polymerases replicate DNA in a largely template-dependent manner. That is: synthesizing the reverse complement strand of a DNA strand.
- a few polymerases have template-independent activity, wherein they can synthesize random sequences of DNA without the influence or need of a template strand.
- Terminal deoxynucleotidyl transferase (hereinto referred to as TdT) is a DNA polymerase capable of catalyzing the random addition of nucleotides.
- TdT acts in conjunction with DNA repair pathways to generate highly diverse sequences at VDJ junction sites.
- TdT also displays template-independent activity, enabling its widespread use for applications such as poly A tailing of DNA. Because TdT does not require a DNA primer strand for DNA synthesis it is an ideal enzyme for in vitro DNA synthesis (see, e.g., Gouge, Jerome, et al. “Structures of intermediates along the catalytic cycle of terminal deoxy nucleotidyl transferase: dynamical aspects of the two-metal ion mechanism.” Journal of Molecular Biology 425.22 (2013): 4334-4352).
- TdT adds nucleotides in an uncontrolled manner. While this template independence provides a means to synthesize entirely novel sequences of DNA from scratch, to do so in a highly controlled and sequence specific mariner requires the control of the enzyme.
- TdT enzyme capable of controllably adding or inserting a single nucleotide (mononucleotide) at a time would enable new DNA synthesis strategies not previously possible, with benefits over existing strategies, and in particular would enable control of the enzyme for the synthesis of sequence-specified DNA, RNA, or other nucleic acid molecules.
- the present disclosure addresses, among other things, the problem of synthesizing any polymer wherein an enzyme (e.g., terminal deoxynucleotidyl transferase, referred to herein as TdT or Tdt) can mediate addition of monomers to a growing oligomer chain.
- an enzyme e.g., terminal deoxynucleotidyl transferase, referred to herein as TdT or Tdt
- TdT or Tdt terminal deoxynucleotidyl transferase
- compositions and methods wherein a small molecule is linked to an enzyme via a linker that can be electrochemically cleaved, e.g., by a redox electrochemical reaction.
- the small molecule is a nucleotide, nucleotide triphosphate or a nucleotide analog tethered to TdT via a redox- cleavable linker such as a quinone oxime ether linker; in some embodiments the conjugation to the enzyme is affected by click chemistry (after introducing a click chemistry' moiety' via reacting a site-specific cysteine residue with an appropriately functionalized maleimide).
- the TdT enzyme is incubated with a single-stranded DNA molecule; upon incubation, the enzy me-tethered nucleotide is covalently attached to the ssDNA.
- the ssDNA with a newly incorporated base is cleaved from the enzyme.
- Embodimen t 1 is a method of nucl eic acid molecule synthesis comprising the steps of:
- TdT engineered terminal deoxynucleotidyl transferase
- the engineered TdT comprises a nucleotide molecule covalently attached to the TdT via a redox-cleavable linker
- Embodiment 2 is the method of embodiment 1, wherein the redox-cleavable linker is a quinone oxime ether linker.
- Embodiment 3 is the method of embodiment 1 or embodiment 2, wherein the electrochemical conditions of step (d) comprise one or more suitable soluble redox shuttles in the presence of two or more electrodes, wherein the two or more electrodes control the oxidation state of the soluble redox shuttles.
- Embodiment 4 is the method of any one of embodiments 1-3, wherein the soluble redox shuttles comprise a thiazine, a viologen, dinucleotides, a flavin, a quinone, peroxide, bromide, vanadium and/or a metal complex.
- Embodiment 5 is the method of embodiment 4, wherein the soluble redox shuttles comprise a thiazine.
- Embodiment 6 is the method of embodiment 4, wherein the soluble redox shuttles comprise a viologen.
- Embodiment 7 is the method of embodiment any one of embodiments 1-3, wherein the soluble redox shuttles comprise dinucleotides.
- Embodiment 8 is the method of embodiment 7, wherein the dinucleotides comprise NAD/NADH, nicotinamide adenine dinucleotide, and/or reduced form.
- Embodiment 9 is the method of embodiment 4, wherein the soluble redox shuttles comprise a flavin.
- Embodiment 10 is the method of embodiment 4, wherein the soluble redox shuttles comprise a quinone.
- Embodiment 11 is the method of embodiment 4, wherein the soluble redox shuttles comprise peroxide.
- Embodiment 12 is the method of embodiment 4 wherein the soluble redox shuttles comprise bromide.
- Embodiment 13 is the method of embodiment 4, wherein the soluble redox shuttles comprise vanadium.
- Embodiment 14 is the method of embodiment 4. wherein the soluble redox shuttles comprise a metal complex.
- Embodiment 15 is the method of embodiment 5, wherein the thiazine is methylene blue.
- Embodiment 16 is the method of embodiment 4, wherein the quinone is 2,5- dihydroxy-l,4-benzoquinone (DHBQ), 9,10-anthraquinone-2,7-disulfonic acid (AQDS), 4,5- dihydroxybenzene-l,3-disulfonic acid (BQDS), or 2,3, 5,6- tetrakis((dimethylamino)methyl)hydroquinone (FQHr).
- the quinone is 2,5- dihydroxy-l,4-benzoquinone (DHBQ), 9,10-anthraquinone-2,7-disulfonic acid (AQDS), 4,5- dihydroxybenzene-l,3-disulfonic acid (BQDS), or 2,3, 5,6- tetrakis((dimethylamino)methyl)hydroquinone (FQHr).
- the quinone is 2,5- dihydroxy-l,4-benzoquinone (DHB
- Embodiment 17 is the method of embodiment 9, wherein the flavin is riboflavin (RF), flavin mononucleotide (FMN), or flavin adenine dinucleotide (FAD).
- RF riboflavin
- FMN flavin mononucleotide
- FAD flavin adenine dinucleotide
- Embodiment 18 is the method of embodiment 6, wherein the viologen is methyl viologen (MV) or bis-(trimethylammonio) propyl viologen (BTMAP-Vi).
- MV methyl viologen
- BTMAP-Vi bis-(trimethylammonio) propyl viologen
- Embodiment 19 is the method of embodiment 14, wherein the metal complex is TEMPO (2,2,6,6-tetramethylpiperidin-l-yl)oxyl, (2,2,6,6-tetramethylpiperidin-l- yl)oxidanyl).
- the metal complex is TEMPO (2,2,6,6-tetramethylpiperidin-l-yl)oxyl, (2,2,6,6-tetramethylpiperidin-l- yl)oxidanyl).
- Embodiment 20 is the method of embodiment 14, wherein the metal complex is an organometallic coordination complex.
- Embodiment 21 is the method of embodiment 20, wherein the organometallic coordination complex is a ferrocene, bis-(trimethylammomo) propyl ferrocene (BTMAP-Fc), ammonium ferricyanide ((NH4)4Fe(CN)6), potassium ferricyanide (K4Fe(CN)&), hexaamineruthemum (III) chloride (RuHex), permanganate, or dichromate.
- the organometallic coordination complex is a ferrocene, bis-(trimethylammomo) propyl ferrocene (BTMAP-Fc), ammonium ferricyanide ((NH4)4Fe(CN)6), potassium ferricyanide (K4Fe(CN)&), hexaamineruthemum (III) chloride (RuHex), permanganate, or dichromate.
- BTMAP-Fc bis-(trimethylammomo) propyl ferrocene
- Embodiment 22 is the method of any one of embodiments 1 -21, wherein the electrochemical conditions of step (d) further comprise a buffer, a supporting electrolyte, and/or a hydrotropic agent.
- Embodiment 23 is the method of embodiment 22, wherein the buffer comprises sodium citrate, potassium hydrogen phosphate, and/or potassium dihydrogen phosphate.
- Embodiment 24 is the method of embodiment 22, wherein the supporting electrolyte comprises sodium chloride, sodium salts, lithium salts, potassium salts, and/or magnesium salts.
- Embodiment 25 is the method of embodiment 22, wherein the hydrotropic agent comprises caffeine, urea, and/or nicotinamide (NA).
- the hydrotropic agent comprises caffeine, urea, and/or nicotinamide (NA).
- Embodiment 26 is the method of any one of embodiments 1-3, wherein the electrochemical conditions of step (d) comprise a redox shuttle solution comprising methylene blue, citrate-phosphate buffer, and sodium chloride.
- Embodiment 27 is the method of any one of embodiments 1-3, wherein the electrochemical conditions of step (d) comprise a redox shuttle solution comprising methylene blue, citrate-phosphate buffer, sodium chloride, and nicotinamide,
- Embodiment 28 is the method of any one of embodiments 1-3, wherein the electrochemical conditions of step (d) comprise a redox shuttle solution comprising flavin mononucleotide, citrate-phosphate buffer, sodium chloride, and nicotinamide.
- the electrochemical conditions of step (d) comprise a redox shuttle solution comprising flavin mononucleotide, citrate-phosphate buffer, sodium chloride, and nicotinamide.
- Embodiment 29 is the method of any one of embodiments 1-3, wherein the electrochemical conditions of step (d) comprise a redox shuttle solution comprising 10 mM methylene blue, 25 mM citrate-phosphate buffer at pH 3.4, and 100 mM sodium chloride.
- Embodiment 30 is the method of any one of embodiments 1 -3, wherein the electrochemical conditions of step (d) comprise a redox shuttle solution comprising 10 mM methylene blue, 25 mM citrate-phosphate buffer at pH 3.4, 100 mM sodium chloride, and 1 M nicotinamide,
- Embodiment 31 is the method of any one of embodiments 1-3, wherein the electrochemical conditions of step (d) comprise a redox shuttle solution comprising 50 mM flavin mononucleotide, 25 mM citrate-phosphate buffer at pH 3.4, 1 M sodium chloride, and 1 M nicotinamide.
- Embodiment 32 is the method of any one of embodiments 1-31, wherein the potential for cleavage is about +1.05 vs. SHE to -0.18V vs. SHE at 25 °C and pH 3; +0.84 V vs. SHE to -0.38 V vs. SHE at 25°C and pH 6.5; or +0.64V vs. SHE to -0.59V vs. SHE at 25°C and pH 10.
- Embodiment 33 is the method of any one of embodiments 1 --32. wherein the engineered TdT comprises one or more mutations to a wild-type TdT of SEQ ID NO: 1, wherein: a.
- the one or more mutations comprise one or more of the following mutations: C7A, QI8K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, L112P, Fl 19Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, V163I, S 164E, M165Q, V172W, T173Q, D177G, L179T, M 191 K, D208E, Q21 IK, F220W, Q223K, C230E, L233Q, Q242L, C256A, D263H, G265Q, H268D, E270G, C290A, D2.93E, T317R, M323L, R335N, V336T, E339R, D353E, wherein position numbers are relative to SEQ
- the one or more mutations comprise one or more of the following mutations: C7A, QI8K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, LI 12P, Fl 19Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, V163I, S164E, M165Q, V172W, T173Q, D177G, L179T, M 191 K, D208E, Q21 IK, F220W, Q223K, C230E, L233Q, Q242L, C256A, a deletion of D263, a deletion ofH264, a deletion of G265, a deletion of R266, a deletion of V267, a deletion of H268, a deletion of S269, a deletion ot'E270, a deletion ofK271,
- Embodiment 34 is an engineered terminal deoxynucleotidyl transferase (TdT) comprising one or more mutations to a wild-type TdT of SEQ ID NO: 1, wherein: a. the one or more mutations comprise one or more of the following mutations: C7A, Q18K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, L112P, F1I9Y, S127E, KI28D, Q130R, S134T, Q139R, C154P, N156T, V163I, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q21 IK, F220W, Q223K, C230E, 1 ,233Q, Q242L, C256A, D263
- the one or more mutations comprise one or more of the following mutations: C7A, Q18K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, LH2P, F119Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N 156T, VI 631, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q211K, F220W, Q223K, C230E, L233Q, Q242L, C256A, a deletion of D263, a deletion of H264, a deletion of G265, a deletion of R266, a deletion of V267, a deletion of H268, a deletion of S269, a deletion of E270, a deletion of K271, a deletion of S
- Embodiment 35 is the engineered TdT of embodiment 34, wherein the one or more mutations comprise one or more of the following mutations: C7A, Q18K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, L112P, Fl 19Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, V163I, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q21 IK, F220W, Q223K, C230E, L233Q, Q242L, C256A, D263H, G265Q, H268D, E270G, C290A, D293E, T317R, M323L, R335N, V336T,
- Embodiment 36 is the engineered TdT of embodiment 34 or embodiment 35, wherein the one or more mutations comprise 10, 15, 20, 25, 30, 35, 40, 45, 50, or more of the following mutations: C7A, Q18K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, SI 00 A, L112P, Fl 19Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, VI 631, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q211K, F220W, Q223K, C230E, L233Q, Q242L, C256A, D263H, G265Q, H268D, E270G, C290A, D293E, T317R,
- Embodiment 37 is the engineered TdT of any one of embodiments 34-36, wherein the one or more mutations comprise 50 or more of the following mutations: C7A, Q18K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, L112P, Fl 19Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, V163I, S164E, M165Q, V172W, T173Q, D177G, L179T, M 191 K, D208E, Q21 IK, F220W, Q223K, C230E, L233Q, Q242L, C256A, D263H, G265Q, H268D, E270G, C290A, D293E, T317R, M323L, R335N
- Embodiment 38 is the engineered TdT of any one of embodiments 34-37, wherein the one or more mutations comprise all of the following mutations: C7A, Q18K, L19K, D31A, E35G, C40A, M44R, S47A.
- Embodiment 39 is the engineered TdT of any one of embodiments 34-38, wherein the one or more mutations consist of the following mutations: C7A, QI 8K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, L112P, F119Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, V163I, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q211K, F220W, Q223K, C230E, L233Q, Q242L, C256A, D263H, G265Q, H268D, E270G, C290A, D293E, T317R, M323L, R335N, V3
- Embodiment 40 is the engineered TdT of any one of embodiments 34-39, wherein the engineered TdT comprises an amino acid sequence with 85, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, or 100% identity to SEQ ID NO: 2.
- Embodiment 41 is the engineered TdT of any one of embodiments 34-40, wherein the engineered TdT comprises the ammo acid sequence of SEQ ID NO: 2,
- Embodiment 42 is the engineered TdT of any one of embodiments 34-41, wherein the amino acid sequence of the engineered TdT consists of the ammo acid sequence of SEQ ID NO: 2.
- Embodiment 43 is the engineered TdT of any one of embodiments 34-41, wherein the ammo acid sequence of the engineered TdT consists of the amino acid sequence of SEQ ID NO: 2 and a fluorescent protein label, optionally a green fluorescent protein (GFP) label ,
- GFP green fluorescent protein
- Embodiment 44 is the engineered TdT of any one of embodiments 34-41, wherein the amino acid sequence of the engineered TdT consists of the ammo acid sequence of SEQ ID NO: 2 and an affinity tag, optionally a His-tag.
- Embodiment 45 is the engineered TdT of any one of embodiments 34-41, wherein the engineered TdT further comprises a fluorescent protein label, optionally a GFP label.
- Embodiment 46 is the engineered TdT of any one of embodiments 34-41, wherein the engineered TdT further comprises an affinity tag, optionally a His-tag.
- Embodiment 47 is the engineered TdT of any one of embodiments 34-46, wherein the TdT further comprises the conjugation mutation E32C.
- Embodiment 48 is the engineered TdT of any one of embodiments 34-46, wherein the TdT further comprises the conjugation mutation E29C.
- Embodiment 49 is the engineered TdT of any one of embodiments 34-46, wherein the TdT further comprises the conjugation mutation E37C.
- Embodiment 50 is the engineered TdT of any one of embodiments 34-46, wherein the TdT further comprises the conjugation mutation V152C.
- Embodiment 51 is the engineered TdT of any one of embodiments 34-46, wherein the TdT further comprises the conjugation mutation K102C.
- Embodiment 52 is the engineered TdT of any one of embodiments 34-46, wherein the TdT further comprises the conjugation mutation D22C.
- Embodiment 53 is the engineered TdT of any one of embodiments 34-46, wherein the TdT further comprises the conjugation mutation R44C.
- Embodiment 54 is the engineered TdT of any one of embodiments 34-46, wherein the TdT further comprises the conjugation mutation L93C.
- Embodiment 55 is the engineered TdT of any one of embodiments 34-46, wherein the TdT further comprises the conjugation mutation P154C.
- Embodiment 56 is the engineered TdT of any one of embodiments 34-46, wherein the TdT further comprises the conjugation mutation N361C.
- Embodiment 57 is the engineered TdT of any one of embodiments 34-38, wherein the TdT comprises the amino acid sequence of any one of SEQ ID NOs: 11, 12, 13, and/or 15.
- Embodiment 58 is the engineered TdT of any one of embodiments 34-38, wherein the TdT comprises the ammo acid sequence of SEQ ID NO: 15.
- Embodiment 59 is the engineered TdT of any one of embodiments 34-38, wherein the TdT comprises the amino acid sequence of any one of SEQ ID NOs: 25, 26, 27, 28, and 29.
- Embodiment 60 is an engineered TdT comprising the amino acid sequence of any one of SEQ ID NOs: 20, 21, 22, 23, and 24.
- Embodiment 61 is an engineered terminal deoxynucleotidyl transferase (TdT) consisting of the ammo acid sequence of SEQ ID NO: 15.
- Embodiment 62. is the engineered TdT of embodiment 34, wherein the one or more mutations comprise one or more of the following mutations: C7A, QI 8K, L19K, D31 A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, 18 IL, K99Q, SI 00 A, Li 12P, F119Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, V 1631, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q21IK, F220W, Q223K, C230E, L233Q, Q242L
- Embodiment 63 is the engineered TdT of embodiment 34 or 62, wherein the one or more mutations comprise 10, 15, 20, 25, 30, 35, 40, 45, 50, or more of the following mutations: C7A, Q18K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, 1811.,, K99Q, SI 00A, LI 12P, Fl 19Y, S127E, K128D, Q130R, S 134 T, Q139R, C154P, N156T, ⁇ J 631.
- Embodiment 64 is the engineered TdT of embodiments 34, 62 or 63, wherein the one or more mutations comprise 50 or more of the following mutations: C7A, Q18K, L19K, D3IA, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, L112P, Fl 19Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, V 1631, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q211K, F220W, Q223K, C230E, L233Q, Q242L, C256A, a deletion of D263, a deletion of H264, a deletion of G265, a deletion of R266, a deletion of V267, a deletion of H26
- Embodimen t 65 is the engineered TdT of any one of embodiments 34 or 62- 64, wherein the one or more mutations comprise the following mutations: C7A, Q18K, L19K, D3IA, E35G, C40A, M44R, S47A.
- Embodiment 66 is the engineered TdT of any one of embodiments 34 or 62-
- the one or more mutations consist of the following mutations: C7A, QI8K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, L112P, F119Y, S127E, K128D, QI30R, S134T, Q139R, C154P, N156T, V163I, SI64E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q21 IK, F220W, Q223K, C230E, L233Q, Q242L, C256A, a deletion of D263, a deletion of H264, a deletion of G265, a deletion of R266, a deletion of V267, a deletion of H268, a deletion of S269, a deletion of E270, a deletion of K271, a
- Embodiment 67 is the engineered TdT of any one of embodiments 34 or 62-
- the engineered TdT comprises an amino acid sequence with 85, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, or 100% identity to SEQ ID NO: 3.
- Embodiment 68 is the engineered TdT of any one of embodiments 34 or 62-
- engineered TdT comprises the amino acid sequence of SEQ ID NO: 3.
- Embodimen t 69 is the engineered TdT of any one of embodiments 34 or 62-
- amino acid sequence of the engineered TdT consists of the ammo acid sequence of SEQ ID NO: 3.
- Embodiment 70 is the engineered TdT of any one of embodiments 34 or 62- 68, wherein the ammo acid sequence of the engineered TdT consists of the amino acid sequence of SEQ ID NO: 3 and a fluorescent protein label, optionally a GFP label.
- Embodiment 71 is the engineered TdT of any one of embodiments 34 or 62- 68, wherein the amino acid sequence of the engineered TdT consists of the ammo acid sequence of SEQ ID NO: 3 and an affinity' tag, optionally aHis-tag.
- Embodiment 72. is the engineered TdT of any one of embodiments 34 or 62- 68, wherein the engineered TdT further comprises a fluorescent protein label, optionally a green fluorescent protein (GFP) label.
- GFP green fluorescent protein
- Embodiment 73 is the engineered TdT of any one of embodiments 34 or 62- 68, wherein the engineered TdT further comprises an affinity tag, optionally aHis-tag.
- Embodiment 74 is the engineered TdT of any one of embodiments 34 or 62- 73, wherein the TdT further comprises the conjugation mutation E32nc, wherein nc is anon- canonical ammo acid.
- Embodiment 75 is the engineered TdT of any one of embodiments 34 or 62- 73, wherein the TdT further comprises the conjugation mutation V152nc, wherein nc is a non-canonical amino acid.
- Embodiment 76 is the engineered TdT of any one of embodiments 34 or 62- 73, wherein the TdT further comprises the conjugation mutation P154nc, wherein nc is a non- canonical amino acid,
- Embodiment 77 is the engineered TdT of any one of embodiments 34 or 62- 73, wherein the TdT further comprises the conjugation mutation E32C.
- Embodiment 78 is the engineered TdT of any one of embodiments 34 or 62- 73, wherein the TdT further comprises the conjugation mutation V152C.
- Embodiment 79 is the engineered TdT of any one of embodiments 34 or 62- 73, wherein the TdT further comprises the conjugation mutation P154C.
- Embodiment 80 is the engineered TdT of any one of embodiments 34 or 62- 73, wherein the TdT further comprises the conjugation mutation E29C.
- Embodiment 81 is the engineered TdT of any one of embodiments 34 or 62- 73, wherein the TdT further comprises the conjugation mutation E37C.
- Embodiment 82 is the engineered TdT of any one of embodiments 34 or 62- 73, wherein the TdT further comprises the conjugation mutation R44C.
- Embodiment 83 is the engineered TdT of any one of embodiments 34 or 62- 73, wherein the TdT further comprises the conjugation mutation K102C.
- Embodiment 84 is the engineered TdT of any one of embodiments 34 or 62- 64, wherein the TdT comprises the amino acid sequence of SEQ ID NO: 4.
- Embodiment 85 is the engineered TdT of any one of embodiments 34 or 62- 64, wherein the TdT comprises the amino acid sequence of SEQ ID NO: 5.
- Embodiment 86 is the engineered TdT of any one of embodiments 34 or 62- 64, wherein the TdT comprises the amino acid sequence of SEQ ID NO: 6.
- Embodiment 87 is the engineered TdT of any one of embodiments 34 or 62- 64, wherein the TdT comprises the ammo acid sequence of SEQ ID NO: 7.
- Embodiment 88 is the engineered TdT of any one of embodiments 34 or 62- 64, wherein the TdT comprises the amino acid sequence of SEQ ID NO: 9.
- Embodiment 89 is the engineered TdT of any one of embodiments 34 or 62-64, wherein the TdT comprises the amino acid sequence of SEQ ID NO: 10.
- Embodiment 90 is the engineered TdT of any one of embodiments 34 or 62-64, wherein the TdT comprises the amino acid sequence of SEQ ID NO: 16.
- Embodiment 91 is the engineered TdT of any one of embodiments 34 or 62-64, wherein the TdT comprises the amino acid sequence of SEQ ID NO: 17.
- Embodiment 92 is the engineered TdT of any one of embodiments 34 or 62-64, wherein the TdT comprises the amino acid sequence of SEQ ID NO: 18
- Embodiment 93 is the engineered TdT of any one of embodiments 34 or 62-64, wherein the TdT comprises the amino acid sequence of SEQ ID NO: 19.
- Embodiment 94 is the engineered TdT of any of embodiments 34-93, wherein the TdT is capable of covalently binding to a nucleotide molecule at a conjugation residue of the engineered TdT via a tether, wherein:
- the conjugation residue comprises a cysteine residue, anon-canonical amino acid residue, or a lysine residue capable of reacting with a bifunctional crosslinker or a redox- cleavable linker;
- nucleotide molecule comprises a nucleotide, nucleotide triphosphate, or a nucleotide analog.
- Embodiment 95 is the engineered TdT of embodiment 94, wherein the TdT, tether, and nucleotide molecule comprise any one of the following structures:
- Embodiment 96 is the engineered TdT of embodiment 94, wherein the
- Embodiment 97 is the engineered TdT of embodiment 94, wherein the
- TdT TdT
- tether TdT
- nucleotide molecule comprise any one of the following structures:
- Embodiment 98 is the engineered TdT of embodiment 94, wherein the
- TdT, tether, and nucleotide molecule comprise any one of the following structures: wherein Z is any one of:
- Embodiment 99 is the engineered TdT of embodiment 94, wherein the
- TdT, tether, and nucleotide molecule comprise any one of the following structures: wherein Z is any one of:
- Embodiment 100 is the engineered TdT of embodiment 94, wherein the TdT, tether, and nucleotide molecule comprise any one of the following structures: wherein X is O or Nil; wherein ⁇ is H or OMe; and wherein Z is any one of:
- Embodiment 101 is the engineered TdT of embodiment 94, wherein the TdT, tether, and nucleotide molecule comprise any one of the following structures: wherein X is O or NH; wherein Y is H or OMe; and wherein Z is any one of:
- Embodiment 102 is the engineered TdT of embodiment 94, wherein the TdT, tether, and nucleotide molecule comprise any one of the following structures: wherein Y is H or OMe; and wherein Z is any one of:
- Embodiment 103 is the engineered TdT of embodiment 94, wherein the TdT, tether, and nucleotide molecule comprise any one of the following structures: wherein Y is H or OMe; and wherein Z is any one of:
- Embodiment 104 is a kit comprising the engineered TdT of any one of embodiments 34-93, wherein the TdT is capable of covalently binding to a nucleotide molecule at a conjugation residue of the engineered TdT via a tether, wherein the kit further comprises:
- Embodiment 105 is the kit of embodiment 104, wherein the kit further comprises a spacer.
- Embodiment 106 is the kit of any one of embodiments 104-106, wherein the functional group for covalent conjugation comprises a maleimide, N- hydroxysuccinimidyl group, or a dibenzylcyclooctyne (DBCO).
- the functional group for covalent conjugation comprises a maleimide, N- hydroxysuccinimidyl group, or a dibenzylcyclooctyne (DBCO).
- Embodiment 107 is the kit of any one of embodiments 104-106, wherein the functional group for covalent conjugation comprises a maleimide.
- Embodiment 108 is the kit of embodiment 107, wherein the maleimide is capable of ataching to a conjugation residue of the engineered TdT, wherein the conjugation residue is a cysteine residue in the TdT.
- Embodiment 109 is the kit of any one of embodiments 104-106, wherein the functional group for covalent conjugation comprises a DBCO.
- Embodiment 110 is the kit of embodiment 109, wherein the DBCO is capable of attaching to a conjugation residue of the engineered TdT, and wherein the conjugation residue is anon-canonical amino acid in the TdT.
- Embodiment 111 is the kit of embodiment 110, wherein the non- canonical amino acid is 4-azidophenylalanine.
- Embodiment 112 is the kit of any one of embodiments 104-111, wherein the redox-cleavable linker comprises a quinone oxime ether linker.
- Embodiment 113 is the kit of any one of embodiments 104- 111, wherein the redox-cleavable linker comprises a disulfide linker.
- Embodiment 114 is the kit of any one of embodiments 104-111, wherein the redox-cleavable linker comprises a syringic acid-based linker or van ill in-based linker.
- Embodiment 115 is the kit of any one of embodiments 105-114, wherein the spacer comprises one or more polyethylene glycol molecules (PEG) «, optionally wherein “n” is 1-12.
- PEG polyethylene glycol molecules
- Embodiment 116 is the kit of any one of embodiments 105-114, wherein the spacer comprises a Cl -CIO alkyl.
- Embodiment 117 is the kit of any one of embodiments 105- 114, wherein the spacer comprises a hydrocarbon chain wherein one or more of the carbons comprises a substituted sulfonate group.
- Embodiment 118 is the kit of any one of embodiments 105-114, wherein the spacer comprises a hydrocarbon chain comprising 1-6 carbons wherein one or more of the carbons comprises a substituted sulfonate group.
- Embodiment 119 is the kit of any one of embodiments 105-114, wherein the spacer comprises a C1-C6 alkyl.
- Embodiment 120 is the kit of embodiment 104, wherein the kit comprises a bifunctional crosslinker capable of being covalently linked to the engineered TdT and comprising a DBCO, wherein the DBCO of the bifunctional crosslinker is capable of reacting with any one of the following molecules:
- Embodiment 121 is the kit of embodiment 104. wherein the kit comprises a bifunctional crosslinker capable of being covalently linked to the engineered TdT and comprising a DBCO, wherein the DBCO of the bifunctional crosslinker is capable of reacting with any one of the following molecules:
- Embodiment 122 is the kit of embodiment 104. wherein the kit comprises a bifunctional crosslinker capable of being covalently linked to the engineered TdT and comprising a DBCO, wherein the DBCO of the bifunctional crosslinker is capable of reacting with any one of the following molecules:
- Embodiment 123 is the kit of embodiment 104, wherein the kit comprises a bifunctional crosslinker capable of being covalently linked to the engineered TdT and comprising a DBCO, wherein the DBCO of the bifunctional crosslinker is capable of reacting with any one of the following molecules:
- Embodiment 124 is the kit of embodiment 104, wherein the kit comprises one or more of the following molecules capable of being reacted with the engineered TdT, wherein the conjugation residue is a cysteine residue in the TdT:
- Embodiment 125 is the kit of embodiment 104, wherein the kit comprises one or more of the following molecules capable of being reacted with the engineered TdT, wherein the conjugation residue is a cysteine residue in the TdT:
- Embodiment 126 is the kit of embodiment 104, wherein the kit comprises one or more of the following molecules capable of being reacted with the engineered TdT, wherein the conjugation residue is a non-canonical ammo acid residue in the
- Embodiment 127 is the kit of embodiment 104, wherein the kit comprises one or more of the following molecules capable of being reacted with the engineered TdT, wherein the conjugation residue is a lysine residue in the TdT:
- Embodiment 128 is a system for enzymatic DNA synthesis comprising: a. the engineered TdT of any one of embodiments 34-93; b. a redox-cleavable linker; c. a redox shuttle solution; and d. two or more electrodes on a surface.
- Embodiment 129 is the system of embodiment 128, wherein the electrodes enable an applied potential or galvanic field to be supplied locally to the device or system thus enabling generation of active redox shuttle via reduction or oxidation at one or more electrodes,
- Embodiment 130 is the system of embodiment 128 or embodiment 129, wherein the electrode where the soluble redox shuttle is generated is comprised of a smaller surface area than the other electrode to which it is electrically coupled.
- Embodiment 131 is the system of any one of embodiments 128-130, wherein if the soluble redox shuttle is activated via cathodic process, then the cathode may be comprised of less active surface area than the anode by (a) having less electroactive area, or (b) being coupled to multiple anodes which in combination provide substantially more area than the cathode.
- Embodiment 132 is the system of any one of embodiments 128-131, wherein the surface, or electroactive, or area aspect ratios for the counter electrode relative to the working electrode, or C/E aspect ratio is at least: 3: 1, 6:1, 10: 1, 100: 1, 1000:1, or higher.
- Embodiment 133 is the system of any one of embodiments 128-132, wherein all the electrodes lie in a single plane.
- Embodiment 134 is the system of any one of embodiments 128-132, wherein a combination of electrodes may lie in multiple planes.
- Embodiment 135 is the system of any one of embodiments 128-132, wherein electrodes are oriented parallel to perpendicular with respect to one another during the current path,
- Embodiment 136 is the system of any one of embodiments 128-135, wherein the system has an inter-electrode gap (i.e., distance between electrodes included in the current path) of ⁇ 100 pm, ⁇ 10 pm, ⁇ 1 pm, ⁇ 100 pm, ⁇ 10 nm, ⁇ 1 nm, ⁇ 100 pm, ⁇ 10 pm, or ⁇ 1 pm.
- Embodiment 137 is the system of any one of embodiments 128-136, wherein the redox shuttle solution comprises at least one soluble redox shuttle in combination with a buffer, a supporting electrolyte, and/or a hydrotropic agent.
- Embodiment 138 is the system of embodiment 137, wherein the buffer is sodium citrate, potassium hydrogen phosphate, or potassium dihydrogen phosphate.
- Embodiment 139 is the system of embodiment 137 or 138, wherein the supporting electrolyte is sodium chloride, sodium salts, lithium salts, potassium salts, or magnesium salts.
- Embodiment 140 is the system of any one of embodiments 137-139, wherein the hydrotropic agent is caffeine, urea, and/or nicotinamide (NA).
- the hydrotropic agent is caffeine, urea, and/or nicotinamide (NA).
- Embodiment 141 is the system of any one of embodiments 128-136, wherein the redox shuttle solution comprises methylene blue, citrate-phosphate buffer, and sodium chloride.
- Embodiment 142 is the system of any one of embodiments 128-136, wherein the redox shuttle solution comprises methylene blue, citrate-phosphate buffer, sodium chloride, and nicotinamide.
- Embodiment 143 is the system of any one of embodiments 128-136, wherein the redox shuttle solution comprises flavin mononucleotide, citrate-phosphate buffer, sodium chloride, and nicotinamide.
- Embodiment 144 is the system of any one of embodiments 128-136, wherein the redox shuttle solution comprises 10 mM methylene blue, 25 mM citrate- phosphate buffer at pH 3.4, and 100 mM sodium chloride.
- Embodiment 145 is the system of any one of embodiments 128-136, wherein the redox shuttle solution comprises 10 mM methylene blue, 25 mM citrate- phosphate buffer at pH 3.4, 100 mM sodium chloride, and 1 M nicotinamide.
- Embodiment 146 is the system of any one of embodiments 128-136, wherein the redox shuttle solution comprises 50 mM flavin mononucleotide, 25 mM citrate- phosphate buffer at pH 3.4, 1 M sodium chloride, and 1 M nicotinamide.
- Embodiment 147 is a redox shuttle solution comprising at least one soluble redox shuttle in combination with one or more of the following: supporting electrolyte, buffer, and hydrotrope.
- Embodiment 148 is the redox shuttle solution of embodiment 147, comprising: 10 mM methylene blue, 25 mM citrate-phosphate buffer at pH 3.4, and 100 mM sodium chloride.
- Embodiment 149 is the redox shuttle solution of embodiment 147, comprising: 10 mM methylene blue, 25 mM citrate-phosphate buffer at pH 3.4, 100 mM sodium chloride, and 1 M nicotinamide.
- Embodiment 150 is the redox shuttle solution of embodiment 147, comprising: 50 mM flavin mononucleotide, 25 mM citrate-phosphate buffer at pH 3.4, 1 M sodium chloride, and 1 M nicotinamide.
- Embodiment 151 is a molecule comprising any one of the following structures:
- Embodiment 152 is a molecule comprising any one of the following structures:
- Embodiment 153 is a molecule comprising the following structure:
- Embodiment 154 is a molecule comprising the foilowing structure: wherein Z is any one of the following:
- Embodiment 155 is a molecule comprising the following structure: wherein Z is any one of the following:
- Embodiment 156 is a molecule comprising the following structure: wherein X is O or NH: wherein Y is H or OMe; and wherein Z is any one of:
- Embodiment 157 is a molecule comprising the following structure: wherein X is O or NH: wherein Y is H or OMe; and wherein Z is any one of:
- Embodiment 158 is a molecule comprising the following structure: wherein Y is H or OMe; and wherein Z is any one of:
- Embodiment 159 is a molecule covalently linked to a tether. comprising the following structure: wherein Y is H or OMe; and wherein Z is any one of:
- Embodiment 160 is a molecule comprising any one of the following structures:
- Embodiment 161 is a molecule comprising any one of the following structures:
- Embodiment 162 is a molecule comprising any one of the following structures:
- Embodiment 163 is a molecule comprising any one of the following structures:
- Embodiment 164 is a molecule comprising any one of the following structures:
- Embodiment 165 is a molecule comprising any one of the following structures:
- Embodiment 166 is a molecule comprising any one of the following structures:
- Figures 1A-C show a schematic of covalently attaching nucleotides to TdT via a redox cleavable linker (a quinone).
- Figure 1 A, Figure IB, and Figure I C refer to different time points.
- the enzyme-tethered nucleotide is added to the ssDNA strand, creating a covalent enzyme-ssDNA complex.
- Reductive cleavage of the quinone cleaves the ssDNA from the enzyme.
- Figure 2 shows conJ ugation sites in a PROSS-stabilized enzyme. Structures of the wild type mTdT shown in white (PDB 4127) aligned with the PROSS- stabilized enzyme (Prossl or Pl) shown in black. Conjugation sites D2.2, E29, E32, E37, R44, L93, KI 02, V152, P154, and N361 are labeled on the structure as well as the position on the bound nucleotide that will be tethered to the enzyme conjugation sites via a linker and attachment moiety on the nucleotide molecule.
- Figures 3A-B show expression and purification of construct Cl.
- Figure 3B shows an SEC chromatogram from Cl on Superdex200 10/300 GL column. The peak at 6.83 ml is at the void volume of the column and contains aggregate; the peak eluting at 15.78 ml is the expected elution volume for a 42 kDa monomer.
- Figures 4A-B show stabilized TdT variants have increased activity at elevated temperatures.
- Figure 4A shows a schematic of a free extension assay to assess TdT activity'. Enzyme is incubated with fluorophore-conjugated seed oligo and dNTPs at 37 °C for 20 minutes. Reactions are then analyzed via SDS-PAGE to assess the extension rate of the enzyme as seen by the migration of the extended oligo on the gel, larger fragments running slower at the top of the gel.
- Figure 4B show s results from a free extension assay run with wild type murine TdT (gray), the Pl stabilized variant (blue), and the P2 stabilized variant (red) at the indicated temperatures.
- the left-most lane show's that the starting fluorescent seed oligo runs at the bottom of the gel. Loss of activity for the WT enzyme is observed at 50 °C and above as seen by the shift in signal towards the bottom of the gel in those lanes. Contrarily, significant loss of activity is only observed for Pl above 55 °C and for P2 above 50 °C.
- Figure 5 show's an example of using a bifunctional linking molecule to install a reactive functional group on an enzyme.
- a maleimide-alkyne crosslinker is reacted with a cysteine residue.
- the enzyme is now modified with an alkyne.
- Figure 6 shows an example of using an enzyme that has been modified with an alkyne through a bifunctional crosslinker.
- a molecule with a nucleotide and cleavable linker can be attached via click chemistry.
- the conjugation occurs without additional reagents.
- Figures 7A-F show' deconvolved mass spectra for the Pl C7 variant before and after conjugation with sulfo-DBCO-maleimide and the azide-quinone-nucleotides.
- Figure 7 A show's a Pl C7 variant, unconjugated;
- Figure 7B shows a C7 variant reacted with sulfo-DBCO-maleimide;
- Figure 7C shows a C7 variant reacted with sulfo-DBCO-maleimide followed by azide-quinone-dATP;
- Figure 7D show's a C7 variant reacted with sulfo-DBCO- maleimide followed by azide-quinone-dGTP;
- Figure 7E show's a C7 variant reacted with sulfo-DBCO-maleimide followed by azide-quinone-dTTP;
- Figure 7F show's a sulfo- DBCO-maleimide followed by
- Figure 8 shows a graphic showing the potential and pH window' over which an activated mediator could be expected to cleave the redox-cleavable linker while avoiding the oxidation and reduction of w'ater.
- Figure 9 shows cyclic voltammograms for a selection of soluble redox shuttles (1 mM methylene blue or riboflavin) at 37°C in a buffered solution of IX TdT Storage Buffer (200 mM potassium phosphate pH 6.5, 100 mM sodium chloride).
- the working and counter electrodes are noble metals (platinum or gold) while the reference electrode is a silver/silver chloride (Ag/AgCl) couple.
- a 100 mV/sec scan rate is employed in the above example.
- Figure 10 shows cleavage of small molecule azide-quinone-nucleotide with leucomethylene blue.
- the reaction was monitored by LC-MS on an Agilent 1260 Infinity instrument with an Agilent 6120 Quadropole MS.
- Figures 11 A-B show ESI-MS (ESI-neg) data for the starting azide- quinone-nucleotide (azide-quinone-dCTP) ( Figure 11 A) and the resulting product (“cleaved pdt”) after treatment with leucomethylene biue ( Figure 11 B).
- Figures 12A-B show' ESI-MS (ESI-neg) data for the starting azide- quinone-nucleotide (azide-quinone-dCTP) ( Figure 12A) and the resulting product (“cleaved pdt”) after treatment with leucomethylene blue ( Figure 12B).
- Figure 13 show's cyclic voltammograms for three example formulations of redox shuttle (Formulation A: 10 mM methylene blue, 100 mM sodium chloride, 25 mM citrate-phosphate buffer, pH 3.4; Formulation B: 10 mM methylene blue, 100 mM sodium chloride, 1 M nicotinamide, 25 mM citrate-phosphate buffer, pH 3.4; Formulation C: 50 mM flavin mononucleotide, 1 M sodium chloride, 1 M nicotinamide, 25 mM citrate-phosphate buffer, pH 3.4) at room temperature.
- the working and counter electrodes are platinum while the reference electrode is a silver/silver chloride (Ag/AgCl) couple.
- a 100 mV/sec scan rate is employed in the above example.
- Figure 14 show's UV-Vis absorbance spectra for two variations of Formulation B (Dilution 1: 0,25 mM methylene blue, 100 mM sodium chloride, 2.5 mM nicotinamide, 25 mM citrate-phosphate buffer, pH 3.4; Dilution 2: 0.25 mM methylene blue, 100 mM sodium chloride, 100 mM nicotinamide, 25 mM citrate-phosphate buffer, pH 3.4).
- Figure 15 shows 1H-NMR spectra for 1 M nicotinamide in D2O with peak assignments.
- Figure 16 show's 1H-NMR spectra for 1 M nicotinamide with 100 mM methylene blue in D2O with peak assignments. Peaks B and E show' broadening in the presence of methylene blue. Note, the doublet at 7. 17 ppm is from methylene blue and is shown in greater detail in Figure 18.
- Figure 17 shows 1H-NMR spectra for 100 mM methylene blue in D2O with peak assignments.
- Figure 18 shows 1H-NMR spectra for 100 mM methylene blue with 1 M nicotinamide in D?.O with peak assignments. Peaks B’, C’, and D' broadening in the presence of nicotinamide. Note, the peaks at 7.32 ppm are from nicotinamide and are shown in greater detail in Figure 16.
- Figures 19A-C shows deconvoluted mass spectra for batch C7 protein conjugated to sulfo-DBCO-maleimide and azide-quinone-dTTP. The protein was split into batches and treated with either leucomethylene blue or methylene blue.
- Figure 19A shows deconvoluted mass spectra of the conjugated C7 variant.
- Figure 19B shows a C7-protein conjugated to sulfo-DBCO-maleimide/azide-quinone-dTTP and treated with leucomethylene blue.
- Figure 19C shows C7-protein conjugated to sulfo-DBCO-maleimide/azide-quinone- dTTP and treated with methylene blue.
- Figure 20 show's use of soluble redox shuttle to cleave the linker from the complexed DNA-enzyme complex.
- the redox mediator soluble redox shuttle
- the redox mediator is introduced into a system in one particular oxidation state, in this case, at a higher oxidation state. Applying a potential at the electrodes reduces the mediator.
- the reduced mediator interacts with the redox-cleavable linker (a quinone-oxime-ether is shown), cleaving the enzyme from the ssDNA.
- the newly attached nucleotide is left at the end of the ssDNA.
- Figure 21 shows a tris-glycine gel indicating binding of ssDNA to protein conjugates.
- the fluorescent oligonucleotide shifts upw-ard on gel upon covalent protein binding due to increase in molecular weight.
- Figure 22 show's a representation of experiment to demonstrate addition of nucleotides to an oligonucleotide using the TdT-nucleotide conjugates.
- a surface is functionalized with a capture oligo (Step 1), followed by hybridization of a seed oligo with an accessible 3’ end (Step 2), Conjugated TdT-dCTP is introduced and covalently binds the seed oligo via addition of its tethered nucleotide (Step 3). Unbound enzyme is then washed away (Step 4) and a soluble reducing agent (leucomethylene blue) is introduced to cleave the enzyme from the extended DNA (Step 5).
- a soluble reducing agent leucomethylene blue
- TdT conjugated to the next nucleotide to be added (here, TdT-dATP) is then introduced, resulting in a total addition of “CA” to the seed oligonucleotide.
- Figures 23A-B show' a distribution of synthesized sequences from a. 3- cycle, electrochemically controlled synthesis run.
- Figure 23A shows a schematic of the 3- cycle synthesis workflow adding “GCT.”
- Figure 23B shows average fraction target sequences, sequences with insertions, and sequences with deletions is shown with error bars representing the standard deviation across three replicates. 37% target sequence of “'GCT” was achieved.
- Figure 24 shows a 1H NMR spectrum of S (400 MHz
- Figures 25A-C show an HPLC chromatogram of the following compound I: .
- Figure 25B shows an HPLC chromatogram of compound II (a propargyl-amine substituted nucleotide analog) (expected cleavage product).
- Figure 25C shows an HPLC chromatogram of a crude reaction mixture when compound I was treated with leucomethylene blue (reducing agent).
- Figure 26 shows ESI-MS (negative mode) data for Boc-NH2O-PA- dGTP analog 5:
- Figure 27 shows ’‘ESI-MS (negative mode) for NH2O-P A-dGTP analog 6: [00208]
- Figure 28 shows ESI-MS (negative mode) data for the azide-quinone- dATP analog 21: i!
- Figure 29 shows ESI-MS (negative mode) data for the azide-quinone- dGTP analog
- Figure 30 shows ESI-MS (negative mode) data for the azide-quinone- dCTP compound 20:
- Figure 31 shows ESI-MS (negative mode) data for the azide-quinone- dTTP analog 22:
- Table 1 provides a listing of certain sequences referenced herein.
- ranges and amounts can be expressed as “about” a particular value or range. About also includes the exact amount. Hence “about 5 pL” means “about 5 pL” and also “5 pL.” Generally, the term “about” includes an amount that would be expected to be within experimental error, such as for example, within 15%, 10%, or 5%.
- nucleotide molecule refers to components of nucleic acids comprising a base, sugar, and phosphate moieties, both natural and unnatural, including nucleotides, ribonucleotides, and nucleotide analogs.
- a nucleotide refers to a molecule comprising a nucleoside and one or more phosphate groups.
- a nucleoside refers to a molecule comprising a nucleobase (e.g., adenine, thymine, cytosine, guanine, or uracil) and a five-carbon sugar (e.g., ribose or 2’ -deoxyribose).
- a nucleobase e.g., adenine, thymine, cytosine, guanine, or uracil
- a five-carbon sugar e.g., ribose or 2’ -deoxyribose
- Exemplary' natural nucleotides include, without limitation, ATP, UTP, CTP, GTP, ADP, UDP, CDP, GDP, AMP, UMP, CMP, GMP, dATP, dTTP, dCTP, dGTP, dADP, dTDP, dCDP, dGDP, dAMP, dTMP, dCMP, and dGMP.
- Exemplary natural deoxyribonucleotides include dATP, dTTP, dCTP, dGTP, dADP, dTDP, dCDP, dGDP, dAMP, dTMP, dCMP, and dGMP.
- Exemplary natural ribonucleotides include ATP, UTP, CTP, GTP, ADP, UDP, CDP, GDP, AMP, UMP, CMP, and GMP.
- the uracil base is uridine.
- a nucleotide analog, or unnatural nucleotide comprises a nucleotide which contains some type of modification to either the base, sugar, or phosphate moieties, such as, for example, a chemical modification. All chemical structures showing protonated triphosphates and/or sulfonates should be interpreted to include the protonated and the ionized salt forms in different buffers.
- affinity tag refers to a protein tag that can aid in purifying or detecting the protein it is attached to. Affinity tags are typically inserted into a target gene at the point of coding for expression at either the N or C terminus of the protein to be expressed.
- His-tag or “poly histidine tag” refers to a string of generally four, five, six, seven, eight, nine, ten, eleven, or twelve histidine residues, and in some cases, more.
- His-tag may refer to either the DNA sequence encoding the string of histidine residues or the string of histidine residues itself. His-tags may be useful for facilitating easy purification and detection of recombinant proteins. His-tags are typically inserted into a target gene at the point of coding for expression at either the N or C terminus of the protein to be expressed.
- GFP label or “GFP tag” or green fluorescent protein label” or “green fluorescent protein tag” refers to a fluorescent protein from Aequorea victoria (water jellyfish). GFP labels can be N-terminally and/or C-terminally fused to a wide variety' of proteins and are frequently' used as a fluorescent marker. A non-limiting example of a GFP amino acid sequence can be found, for example, in UniProt Entry P42212 • GFP AEQVI. It emits a green fluorescent signal if exposed to light.
- redox-cleavable linker refers to a molecule that is cleaved in response to either oxidation or reduction conditions.
- Redox-cleavable linkers may also include additional groups that increase the solubility of the redox-cleavable linker, such as polar functional groups (e.g., alcohol, amine, amide, carboxylic acid, sulfonic acid, and phosphate groups).
- polar functional groups e.g., alcohol, amine, amide, carboxylic acid, sulfonic acid, and phosphate groups.
- conjugation residue refers to an amino acid residue with a functional group capable of attaching or covalently linking to another molecule, such as a crosslinker or tether.
- conjugation residues include cysteine residues, lysine residues, arginine residues, or non-canonical amino acids residues.
- spacer refers to a molecule that may be used to link two other molecules, although spacers may be present by themselves or attached to only one other molecule.
- a spacer may be an organic spacer (e.g., an aliphatic spacer, an alkyl spacer, an aromatic spacer, an alkylene glycol, a polyethylene glycol, a carbohydrate such as a sugar, and the like).
- the spacer is a PEG spacer, a well-known inert spacer used in many biotechnological applications.
- a “bifunctional crosslinker” or “crosslinker” or “bifunctional linker” refers to a molecule that has at least two functional groups and is capable of reacting with one or more other molecules to covalently link them.
- the crosslinker has a functional group at each end. Examples of functional groups include, for example, maleimide groups and alkyne groups.
- tether or “tether molecule” refers to the molecule covalently linking/connecting./attaching a TdT to a nucleotide molecule, comprising a redox- cleavable linker, optionally comprising a spacer, and optionally comprising a bifunctional crosslinker.
- a “soluble redox shuttle” refers to a molecule that is soluble, and when in a particular oxidation state, is capable of reacting with a redox-cleavable linker, facilitating cleavage of the redox-cleavable linker.
- a “redox shuttle solution” is a solution that comprises a soluble redox shuttle and other components/formulants/additives, such as a supporting electrolyte, a buffer, and/or a hydrotrope/hydrotropic agent.
- sequence identity between first and second nucleic acid sequences indicates the percentage of nucleotides that are identical between the sequences; for example, if a first nucleic acid sequence is 95% identical to a second nucleic acid sequence, then the first nucleic acid sequence contains matches to 95% of the nucleotides in the second nucleic acid sequence.
- Sequence identity between two amino acid sequences indicates the percentage of amino acids that are identical between the sequences; for example, if a first amino acid sequence is 95% identical to a second amino acid sequence, then the first amino acid sequence contains matches to 95% of the nucleotides in the second amino acid sequence.
- the terms “% identical”, “% identity” or similar terms are intended to refer, in particular, to the percentage of nucleotides or amino acids which are determined to be identical using an optimal alignment between the sequences to be compared. Said percentage is purely statistical, and the differences between the two sequences may be but are not necessarily randomly distributed over the entire length of the sequences to be compared. Comparisons of two sequences are usually carried out by comparing said sequences after optimal alignment.
- the optimal alignment for a comparison may be carried out manually or with the aid of an appropriate algorithm such as the alignment algorithm by Needleman and Wunsch, 1970, J. Mol. Biol. 48, 443, or with the aid of computer programs using said algorithms (e.g., GAP, BESTFIT, and FASTA in Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Drive, Madison, Wis.).
- an appropriate algorithm such as the alignment algorithm by Needleman and Wunsch, 1970, J. Mol. Biol. 48, 443, or with the aid of computer programs using said algorithms (e.g., GAP, BESTFIT, and FASTA in Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Drive, Madison, Wis.).
- non-canonical ammo acid or “ncAA” or “non-canonical amino acid residue” or “nc” refer to an amino acid other than one of the 20 naturally occurring amino acids.
- exemplary' non-canonical ammo acids are described in Young et al., “Beyond the canonical 20 amino acids: expanding the genetic lexicon,” J. of Biological Chemistry 285(15): 11039-11044 (2010), the disclosure of which is herein incorporated by reference.
- n and m as they appear in chemical structures throughout the application refer to an integer from 1-12, unless defined otherwise.
- TdT genetically engineered terminal deoxynucleotidyl transferases
- the one or more mutations are introduced into the TdT and wherein, in some embodiments, some of the mutation(s) confer increased thermal stability to the TdT relative to the stability of the wild-type TdT (SEQ ID NO: 1).
- SEQ ID NO: 1 the sequence of the wild-type TdT
- ncAA non-canonical amino acid
- the genetically engineered TdT retains its enzymatic biological activity as a DM A polymerase to catalyze the addition of a nucleotide to a strand of nucleotides, thus extending a nucleotide strand in a controlled manner.
- TdTs with increased thermal stability are provided herein and are shown, for example, as sequences Pl and P2 in Table 1.
- the genetically engineered TdT enzymes are also referred to herein as TdT variants with increased thermal stability.
- the engineered TdT is capable of adding a single nucleotide molecule to a single stranded DNA molecule in an electrochemically controlled manner/reaction.
- mutations to the wild-type TdT are notated as, e.g., C7A, wherein the number represents the position/residue in the wildtype TdT amino acid sequence, the first letter (in this example, 'C(' ") represents the amino acid residue at that position in the wild-type TdT amino acid sequence, and the second letter (in this example, ‘"A”) represents the amino acid residue at that position in the mutant/engineered TdT.
- the engineered TdT comprises one or more mutations to a wild-type TdT of SEQ ID NO: 1, wherein: a) the one or more mutations comprise one or more of the following mutations: C7A, Q18K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, L112P, Fl 19Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, V163I, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q211K, F220W, Q223K, C230E, L233Q, Q242L, C256A, D263H, G265Q, H268D, E270G, C290A, D293E
- the one or more mutations comprise one or more of the following mutations: C7A, QI 8K, L19K, D3IA, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, SI 00 A, L112P, Fl 19Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, V163I, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q211K, F220W, Q223K, C230E, L233Q, Q242L, C256A, D263H, G265Q, II268D, E270G, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E.
- the one or more mutations comprise 10, 15, 20, 25, 30, 35, 40, 45, 50, or more of the following mutations: C7A, Q18K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, LI 12P, F119Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, V163I, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q211K, F220W, Q223K, C230E, L233Q, Q242L, C256A, D263H, G265Q, H268D, E270G, C290A, D293E, T317R, M323L, R335N, V336T, E3
- the one or more mutations comprise 50 or more of the following mutations: C7A, Q18K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, L112P, F119Y, S127E, K128D, QI30R, S134T, Q139R, C154P, N156T, V163I, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q211K, F220W, Q223K, C230E, L233Q, Q242L, C256A, D263H, G265Q, H268D, E270G, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E.
- the one or more mutations comprise all of the following mutations: C7A, Q18K, L19K, D31 A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, L112P, F119Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, V163I, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q21 IK, F220W, Q223K, C230E, L233Q, Q242L, C256A, D263H, G265Q, H268D, E270G, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E.
- the one or more mutations consist of the following mutations: C7A, Q18K, L19K, D3IA, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, LI 12P, F119Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, V163I, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q211K, F220W, Q2.23K, C230E, L233Q, Q242L, C256A, D263H, G265Q, H268D, E270G, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E.
- the engineered TdT comprises an amino acid sequence with 85, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, or 100% identity to SEQ ID NO: 2.
- the engineered TdT comprises the ammo acid sequence of SEQ ID NO: 2.
- the amino acid sequence of the engineered TdT consists of the amino acid sequence of SEQ ID NO: 2.
- the amino acid sequence of the engineered TdT consists of the amino acid sequence of SEQ ID NO: 2 and a fluorescent protein label, optionally a green fluorescent protein (GFP) label.
- the amino acid sequence of the engineered TdT consists of the amino acid sequence of SEQ ID NO: 2 and an affinity tag, optionally a His-tag.
- the engineered TdT further comprises a fluorescent protein label, optionally a GFP label.
- the engineered TdT further comprises an affinity tag, optionally a His-tag.
- the engineered TdT comprises a conjugation mutation.
- the conjugation mutation allows a cysteine residue, a non- canonical amino acid residue, a lysine residue, or an arginine residue, all comprising a reactive functional group, to be installed at the recited position to allow one skilled in the art to conjugate a bifunctional crosslinker or a redox-cleavable linker to the amino acid at that position.
- the conjugation mutation replaces the ammo acid present, at the recited engineered TdT sequence.
- the amino acid at position 32 in the Pl sequence is E
- the TdT further comprises the conjugation mutation comprises a “C” at position 32.
- the recited engineered TdT sequence is the amino acid sequence of P l (SEQ ID NO: 2).
- the TdT further comprises the conjugation mutation E32C.
- the TdT further comprises the conjugation mutation E29C.
- the TdT further comprises the conjugation mutation E37C.
- the TdT further comprises the conjugation mutation V152C
- the TdT further comprises the conjugation mutation K102C.
- the TdT further comprises the conjugation mutation D22C. In some embodiments, the TdT further comprises the conjugation mutation R44C. In some embodiments, the TdT further comprises the conjugation mutation L93C. In some embodiments, the TdT further comprises the conjugation mutation P154C. In some embodiments, the TdT further comprises the conjugation mutation N361C.
- the engineered TdT comprises the amino acid sequence of any one of SEQ ID NOs: 11, 12, 13, and/or 15. In some embodiments, the engineered TdT consists of the amino acid sequence of any one of SEQ ID NOs: 11, 12, 13, and/or 15. In some embodiments, the engineered TdT comprises the amino acid sequence of SEQ ID NO: 15. In some embodiments, the engineered TdT consists of the amino acid sequence of SEQ ID NO: 15. In some embodiments, the TdT comprises the amino acid sequence of any one of SEQ ID NOs: 25, 26, 27, 28, and 29. In some embodiments, the TdT consists of the amino acid sequence of any one of SEQ ID NOs: 25, 26, 27, 28, and 29,
- an engineered TdT comprising the amino acid sequence of any one of SEQ ID Nos: 20, 21, 22, 23, and 24.
- the engineered TdT comprises one or more more mutations to a wild-type TdT of SEQ ID NO: I , wherein the one or more mutations comprise one or more of the following mutations: C7A, Q 18K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, L112P, F119Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, V163I, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q211K, F220W, Q223K, C230E, L233Q, Q242L, C256A, a deletion of D263, a deletion of H264, a deletion of G265, a deletion of R266,
- the one or more mutations comprise 10, 15, 20, 25, 30, 35, 40, 45, 50, or more of the following mutations: C7A, Q18K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, LI 12P, F119Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, V163I, S164E, M165Q, V172W, T173Q, D177G, L179T, MI91K, D208E, Q211K, F220W, Q223K, C230E, L233Q, Q242L, C256A, a deletion of D263, a deletion of H264, a deletion of G265, a deletion of R266, a deletion of V267, a deletion of H268, a deletion of S269,
- the one or more mutations comprise 50 or more of the following mutations: C7A, Q18K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, L112P, F1 I9Y, SI27E, K128D, QI30R, S134T, Q139R, C154P, N156T, V163I, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q211K, F220W, Q223K, C230E, L233Q, Q242L, C256A, a deletion of D263, a deletion of H264, a deletion of G265, a deletion of R266, a deletion of V267, a deletion of H268, a deletion of S269, a deletion ofE270,
- the one or more mutations comprise the following mutations: C7A, Q18K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, L112P, F119Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, VI63I, SI64E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q211K, F220W, Q223K, C230E, L233Q, Q242L, C256A, a deletion of D263, a deletion of H264, a deletion of G265, a deletion of R266, a deletion of V267, a deletion of 11268, a deletion of S269, a deletion of E270, a deletion of K271,
- the one or more mutations consist of the following mutations: C7A, Q18K, L19K, D31 A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, L112P, F119Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, V163I, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q21 IK, F220W, Q223K, C230E, L233Q, Q242L, C256A, a deletion of D263, a deletion of H264, a deletion of G265, a deletion of R266, a deletion of V267, a deletion of H268, a deletion of S269, a deletion of E270, a deletion of K
- the engineered TdT comprises an amino acid sequence with 85, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, or 100% identity to SEQ ID NO: 3.
- the engineered TdT comprises the ammo acid sequence of SEQ ID NO: 3.
- the amino acid sequence of the engineered TdT consists of the amino acid sequence of SEQ ID NO: 3.
- the amino acid sequence of the engineered TdT consists of the amino acid sequence of SEQ ID NO: 3 and a fluorescent protein label, optionally a GFP label. In some embodiments, the amino acid sequence of the engineered TdT consists of the amino acid sequence of SEQ ID NO: 3 and an affinity tag, optionally a His-tag. In some embodiments, the engineered TdT further comprises a fluorescent protein label, optionally a green fluorescent protein (GFP) label. In some embodiments, the engineered TdT further comprises an affinity tag, optionally a His-tag.
- GFP green fluorescent protein
- the engineered TdT comprises a conjugation mutation.
- the conjugation mutation allows a cysteine residue, anon- canonical ammo acid residue, a lysine residue, or an arginine residue, all comprising a reactive functional group, to be installed at the recited position to allow one skilled in the art to conjugate a bifunctional crosslinker or a redox-cleavable linker to the amino acid at that position.
- the conjugation mutation replaces the ammo acid present at the recited engineered TdT sequence.
- the amino acid at position 152 in the P2 sequence is V
- the TdT further comprises the conjugation mutation comprises a “C” at position 32
- the recited engineered TdT sequence is the ammo acid sequence of P2 (SEQ ID NO: 3).
- the TdT further comprises the conjugation mutation E32nc, wherein “nc” is a non-canonical ammo acid.
- the TdT further comprises the conjugation mutation V152nc, wherein nc is a non-canonical ammo acid.
- the TdT further comprises the conjugation mutation P154nc, wherein nc is a non-canonical ammo acid.
- the TdT further comprises the conjugation mutation E32C.
- the TdT further comprises the conjugation mutation V152C.
- the TdT further comprises the conjugation mutation P154C.
- the TdT further comprises the conjugation mutation E29C.
- the TdT further comprises the conjugation mutation E37C.
- the TdT further comprises the conjugation mutation R44C.
- the TdT further comprises the conjugation mutation K102C.
- the TdT comprises the amino acid sequence of SEQ ID NO: 4. In some embodiments, the TdT comprises the amino acid sequence of SEQ ID NO: 5. In some embodiments, the TdT comprises the amino acid sequence of SEQ ID NO: 6. In some embodiments, the TdT comprises the amino acid sequence of SEQ ID NO: 7. In some embodiments, the TdT comprises the amino acid sequence of SEQ ID NO: 9. In some embodiments, the TdT comprises the amino acid sequence of SEQ ID NO: 10. In some embodiments, the TdT comprises the amino acid sequence of SEQ ID NO: 16. In some embodiments, the TdT comprises the amino acid sequence of SEQ ID NO: 17, In some embodiments, the TdT comprises the amino acid sequence of SEQ ID NO: 18. In some embodiments, the TdT comprises the amino acid sequence of SEQ ID NO: 19.
- an engineered TdT is capable of covalently binding to a nucleotide molecule at a conjugation residue of the engineered TdT via a tether.
- the engineered TdT is covalently bound to a nucleotide molecule via a tether using a two-step conjugation.
- a bifunctional crosslinker e.g., amaleimide-DBCO bifunctional crosslinker
- the maleinnde reacts with the cysteine residue, leaving a free DBCO.
- an azide-redox-cleavable linker covalently bound to a nucleotide molecule is reacted with the free DBCO on the TdT-bifunctional crosslinker molecule, forming a triazole moiety and completing the tether binding the engineered TdT to the nucleotide molecule.
- An example of a two-step reaction described in these embodiments is shown in Figures 5 and 6.
- molecules that can be directly conjugated to a cysteine residue on the TdT enzy me via a tether using a one-step conjugation without a bifunctional crosslinker are provided.
- an engineered TdT is provided, wherein the TdT is capable of covalently binding to a nucleotide molecule at a conjugation residue of the engineered TdT via a tether, wherein: (i) the conjugation residue comprises a cysteine residue, a non-canonical amino acid residue, a lysine residue, or an arginine residue capable of reacting with a bifunctional crosslinker or a redox-cleavable linker; and; (ii) wherein the nucleotide molecule comprises a nucleotide, nucleotide triphosphate, or a nucleotide analog.
- the TdT, tether, tether, and nucleotide molecule comprise any one of the following structures:
- the TdT, tether, and nucleotide molecule comprise any one of the following structures:
- the TdT, tether, and nucleotide molecule comprise any one of the following structures: wherein X is any one of:
- the TdT, tether, and nucleotide molecule comprise any one of the following structures: [00262] In some embodiments, the TdT, tether, and nucleotide molecule comprise any one of the following structures: wherein Z is any one of:
- the TdT, tether, and nucleotide molecule comprise any one of the following structures: wherein X is O or NH; wherein Y is H or OMe; and wherein Z is any one of:
- the TdT, tether, and nucleotide molecule compnse any one of the following structures: wherein X is O or NH; wherein Y is H or OMe; and wherein Z is any one of:
- the TdT, tether, and nucleotide molecule compnse any one of the following structures: wherein Y is H or OMe; and wherein Z is any one of: [00266] In some embodiments, the TdT, tether, and nucleotide molecule comprise any one of the following structures: wherein Y is H or OMe; and wherein Z is any one of:
- a kit comprising an engineered TdT is provided that is capable of covalently binding to a nucleotide molecule at a conjugation residue of the engineered TdT via a tether.
- the engineered TdT is covalently bound to a nucleotide molecule via a tether using a two-step conjugation.
- a bifunctional crosslinker e.g., a maleimide-DBCO bifunctional crosslinker
- the maleimide reacts with the cysteine residue, leaving a free DBCO.
- an azide-redox- cleavable linker covalently bound to a nucleotide molecule is reacted with the free DBCO on the TdT-bifunctional crosslinker molecule, forming a triazole moiety and completing the tether binding the engineered TdT to the nucleotide molecule.
- An example of a two-step reaction described in these embodiments is shown in Figures 5 and 6.
- kits comprising molecules that can be directly conjugated to a cysteine residue on the TdT enzyme via a tether using a one-step conjugation without a bifunctional crosslinker are provided.
- kits comprising an engineered TdT, wherein the TdT is capable of covalently binding to a nucleotide molecule at a conjugation residue of the engineered TdT via a tether, wherein the kit further comprises: (i) a bifunctional crosslinker and/or a redox-cleavable linker comprising a functional group for covalent conjugation to a conjugation residue of the TdT; and (ii) a nucleotide molecule.
- the kit further comprises a spacer.
- the functional group for covalent conjugation comprises a mal eimide, N-hydroxysuccinimidyl group, or a dibenzylcyclooctyne (DBCO).
- the functional group for covalent conjugation comprises a maleinnde.
- the maleimide is capable of attaching to a conjugation residue of the engineered TdT, wherein the conjugation residue is a cysteine residue in the TdT.
- the functional group for covalent conjugation comprises a DBCO.
- the DBCO is capable of attaching to a conjugation residue of the engineered TdT, wherein the conjugation residue is a non-canonical amino acid in the TdT.
- the non-canonical amino acid is 4-azidophenylalanine.
- the redox-cleavable linker comprises a quinone oxime ether linker. In some embodiments, the redox-cleavable linker comprises a disulfide linker. In some embodiments, the redox-cleavable linker comprises a syringic acid-based linker or vanillin-based linker.
- the spacer comprises one or more polyethylene glycol molecules (PEG)n, optionally wherein "‘n” is 1-12.
- the spacer comprises a Cl-Cl 0 alkyl.
- the spacer comprises a hydrocarbon chain wherein one or more of the carbons comprises a substituted sulfonate group.
- the spacer comprises a 4-aminobenzyl alcohol or an aminopropanol spacer.
- the spacer comprises an aminopropyl carbamate spacer.
- the spacer comprises a hydrocarbon chain comprising 1-6 carbons wherein one or more of the carbons comprises a substituted sulfonate group.
- the spacer comprises a C1-C6 alkyl.
- the kit comprises a bifunctional crosslinker capable of being covalently linked to the engineered TdT and comprising a DBCO, wherein the DBCO of the bifunctional crosslinker is capable of reacting with any one of the following molecules:
- the kit comprises a bifunctional crosslinker capable of being covalently linked to the engineered TdT and comprising a DBCO, wherein the DBCO of the bifunctional crosslinker is capable of reacting with any one of the foliowing
- the kit comprises a bifunctional crosslinker capable of being covalently linked to the engineered TdT and comprising a DBCO, wherein the DBCO of the bifunctional crosslinker is capable of reacting with any one of the following molecules:
- the kit comprises one or more of the following molecules capable of being reacted with the engineered TdT, wherein the conjugation residue is a cysteine residue in the TdT:
- the kit comprises one or more of the following molecules capable of being reacted with the engineered TdT, wherein the conjugation residue is a cysteine residue in the TdT:
- the kit comprises one or more of the following molecules capable of being reacted with the engineered TdT, wherein the conjugation residue is anon-canonical amino acid residue in the TdT:
- the kit comprises one or more of the following molecules capable of being reacted with the engineered TdT, wherein the conjugation residue is a lysine residue or an arginine residue in the TdT:
- the redox-cleavable linker is a quinone oxime ether linker.
- the electrochemical conditions of step (d) comprise one or more suitable soluble redox shuttles in the presence of two or more electrodes, wherein the two or more electrodes control the oxidation state of the soluble redox shuttles.
- the soluble redox shuttles comprise a thiazine, a viologen, dinucleotides, a flavin, a quinone, peroxide, bromide, vanadium and/or a metal complex.
- the soluble redox shuttles comprise a thiazine.
- the thiazine is methylene blue.
- the soluble redox shuttles comprise a viologen.
- the viologen is methyl viologen (MV) or bis- (tnmethylammonio) propyl viologen (BTMAP-Vi).
- the soluble redox shuttles comprise dinucleotides.
- the dinucleotides comprise NAD/NADH, nicotinamide adenine dinucleotide, and/or reduced form.
- the soluble redox shuttles comprise a flavin.
- the flavin is riboflavin (RF), flavin mononucleotide (FMN), or flavin adenine dinucleotide (FAD).
- the soluble redox shuttles comprise a quinone.
- the quinone is 2,5-dihydroxy-l,4-benzoquinone (DHBQ), 9,10-anthraquinone-2,7-disulfonic acid (AQDS), 4,5-dihydroxybenzene-l,3-disulfonic acid (BQDS), or 2,3, 5,6- tetrakis((dimethylamino)methyl)hydroquinone (FQH2).
- the soluble redox shuttles comprise peroxide. In some embodiments, the soluble redox shuttles comprise bromide. In some embodiments, the soluble redox shuttles comprise vanadium. In some embodiments, the soluble redox shuttles comprise a metal complex. In some embodiments, the metal complex is TEMPO (2,2,6,6-tetra.methylpiperidin-l-yl)oxyl, (2,2,6, 6- tetramethylpiperidin-l-yl)oxidanyl). In some embodiments, the metal complex is an organometallic coordination complex.
- the organometallic coordination complex is a ferrocene, bis-(trimethylammonio) propyl ferrocene (BTMAP-Fc), ammonium ferricyanide ((NH4>iFe(CN)6), potassium ferricyanide (K4Fe(CN)6), hexaamineruthenium (HI) chloride (RuHex), permanganate, or dichromate.
- BTMAP-Fc bis-(trimethylammonio) propyl ferrocene
- ammonium ferricyanide (NH4>iFe(CN)6)
- potassium ferricyanide K4Fe(CN)6)
- HI hexaamineruthenium chloride
- permanganate or dichromate.
- the electrochemical conditions of step (d) further comprise a buffer, a supporting electrolyte, and/or a hydrotropic agent.
- the buffer comprises sodium citrate, potassium hydrogen phosphate, and/or potassium dihydrogen phosphate.
- the supporting electrolyte comprises sodium chloride, sodium salts, lithium salts, potassium salts, and/or magnesium salts.
- the hydrotropic agent comprises caffeine, urea, and/or nicotinamide (NA).
- the electrochemical conditions of step (d) comprise a redox shuttle solution comprising methylene blue, citrate-phosphate buffer, and sodium chloride.
- the electrochemical conditions of step (d) comprise a redox shuttle solution comprising methylene blue, citrate-phosphate buffer, sodium chloride, and nicotinamide. In some embodiments, the electrochemical conditions of step (d) comprise a redox shuttle solution comprising flavin mononucleotide, citrate-phosphate buffer, sodium chloride, and nicotinamide. In some embodiments, the electrochemical conditions of step (d) comprise a redox shuttle solution comprising 10 mM methylene blue, 25 mM citrate-phosphate buffer at pH 3.4, and 100 mM sodium chloride.
- the electrochemical conditions of step (d) comprise a redox shuttle solution comprising 10 mM methylene blue, 25 mM citrate-phosphate buffer at pH 3.4, 100 mM sodium chloride, and 1 M nicotinamide. In some embodiments, the electrochemical conditions of step (d) comprise a redox shuttle solution comprising 50 mM flavin mononucleotide, 25 mM citrate-phosphate buffer at pH 3.4, 1 M sodium chloride, and 1 M nicotinamide.
- the potential for cleavage is about +1.05 vs. SHE to -0.18V vs. SHE at 25°C and pH 3; +0.84 V vs. SHE to -0.38 V vs. SHE at 25°C and pH 6.5; or +0.64V vs. SHE to -0.59V vs. SHE at 25°C and pH 10.
- the engineered TdT comprises one or more mutations to a wiid-type TdT of SEQ ID NO: 1, wherein: (a) the one or more mutations comprise one or more of the following mutations: C7A, QI8K, L19K, D31 A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, L1 12P, F119Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, V163I, S164E, M165Q, V172W, T173Q, D177G, L179T, M19IK, D208E, Q21 IK, F220W, Q223K, C230E, L233Q, Q242L, C256A, D263H, G265Q, H268D, E270G, C290A,
- the system for enzymatic DNA synthesis comprises: a) an engineered TdT described herein; b) a redox-cleavable linker; c) a redox shuttle solution; and d) two or more electrodes on a surface.
- the electrodes enable an applied potential or galvanic field to be supplied locally to the device or system thus enabling generation of active redox shuttle via reduction or oxidation at one or more electrodes.
- the electrode where the soluble redox shuttle is generated is comprised of a smaller surface area than the other electrode to which it is electrically coupled.
- the cathode may be comprised of less active surface area than the anode by (a) having less electroactive area, or (b) being coupled to multiple anodes which in combination provide substantially more area than the cathode.
- the surface, or electroactive, or area aspect ratios for the counter electrode relative to the working electrode, or C/E aspect ratio is at least: 3:1 , 6: 1, 10:1, 100: 1, 1000: 1, or higher.
- all the electrodes he in a single plane.
- a combination of electrodes may lie in multiple planes.
- the electrodes are oriented parallel to perpendicular with respect to one another during the current path.
- the system has an inter-electrode gap (i.e., distance between electrodes included in the current path) of ⁇ 100 pm, ⁇ 10 pm, ⁇ 1 pm, ⁇ 100 pm, ⁇ 10 nm, ⁇ 1 nm, ⁇ 100 pm, ⁇ 10 pm, or ⁇ 1 pm.
- the redox shuttle solution comprises at least one soluble redox shuttle in combination with a buffer, a supporting electrolyte, and/or a hydrotropic agent.
- the buffer is sodium citrate, potassium hydrogen phosphate, or potassium dihydrogen phosphate.
- the supporting electrolyte is sodium chloride, sodium salts, lithium salts, potassium salts, or magnesium salts.
- the hydrotropic agent is caffeine, urea, and/or nicotinamide (NA).
- the redox shutle solution comprises methylene blue, citratephosphate buffer, and sodium chloride.
- the redox shuttle solution comprises methylene blue, citrate-phosphate buffer, sodium chloride, and nicotinamide. In some embodiments, the redox shuttle solution comprises flavin mononucleotide, citratephosphate buffer, sodium chloride, and nicotinamide. In some embodiments, the redox shuttle solution comprises 10 niM methylene blue, 25 mM citrate-phosphate buffer at pH 3.4, and 100 inM sodium chloride. In some embodiments, the redox shuttle solution comprises 10 mM methylene blue, 25 rnM citrate-phosphate buffer at pH 3.4, 100 rnM sodium chloride, and 1 M nicotinamide. In some embodiments, the redox shutle solution comprises 50 mM flavin mononucleotide, 25 mM citrate-phosphate buffer at pH 3.4, 1 M sodium chloride, and 1 M nicotinamide.
- Table shows non-limiting examples of viable soluble redox shutles.
- SHE stands for standard hydrogen electrode.
- Table 3 shows non-limiting examples of shuttle solution fonnul ants/components/addi ti v es .
- a redox shuttle that is activated via its reduction at an electrode is employed, so that it may migrate to the cleavage site and transfer its electron(s) to the cleavage site linker bond thus being oxidized in the process of breaking that bond, so as to enable subsequent nucleotide addition.
- a redox shuttle that is activated via its oxidation at an electrode is employed, so that it may migrate to the cleavage site and harvest electron(s) from the cleavage site linker bond thus being reduced in the process of breaking that bond, so as to enable subsequent nucleotide addition.
- the redox-cleavable linker requires the same number of electrons per molecule, or bonds to break, as the redox shuttle requires to be activated at an electrode. In other embodiments, the redox-cleavable linker requires a different number of electrons per molecule, or bonds to break, as the redox shuttle requires to be activated at an electrode. In some embodiments, the redox shuttle requires 1, 2, 3, or 4 electrons per molecule in order to be activated (i.e., reduced or oxidized to sufficient degree as to act upon the redox-cleavable linker).
- shuttle solutions relying upon protonation or deprotonation are sufficiently buffered to prevent local pH changes and sluggish electrochemical kinetics.
- Some examples of biological buffers disclosed herein include potassium phosphate buffer, citrate buffer, or citrate-phosphate buffer.
- the electrochemical reaction at the working electrode may depend upon the use of an ionically conductive electrolyte between the working and counter electrodes.
- Supporting electrolytes, such as NaCI may be included to enable efficient kinetics.
- other salts based upon sodium, lithium, potassium, magnesium may be employed as supporting electrolytes.
- Other formulants may be used to increase the solubility’ of the redox shuttle so as to increase the rate of generation.
- some embodiments include amphiphilic hydrotropic agents, such as nicotinamide, caffeine, and urea, which are known to form weak molecular interactions with poorly soluble redox molecules, thereby increasing their stability' and solubility'-.
- amphiphilic hydrotropic agents such as nicotinamide, caffeine, and urea
- nicotinamide a biometric redox flow battery' based on flavin mononucleotide
- specific formulants can play multiple roles.
- the pyridinic N group in nicotinamide for example, has a pK a of 3.3, thereby enabling it to act as both a hydrotrope and a buffer when the solution pH is formulated close to its pKa value.
- the redox shuttle solution comprises at least one soluble redox shuttle in combination with one or more of the following: supporting electrolyte, buffer, and hydrotrope.
- the redox shuttle solution comprises: 10 mM methylene blue, 25 mM citrate-phosphate buffer at pH 3.4, and 100 mM sodium chloride.
- the redox shuttle solution comprises 10 mM methylene blue, 25 mM citratephosphate buffer at pH 3.4, 100 mM sodium chloride, and 1 M nicotinamide.
- the redox shuttle solution comprises: 50 mM flavin mononucleotide, 25 mM citrate-phosphate buffer at pH 3.4, 1 M sodium chloride, and 1 M nicotinamide.
- compositions, or formulations, of redox shuttles solutions comprising, in some embodiments, at least one redox shuttle in combination with one or more of the following constituent formulants --supporting electroly te, buffer, hydrotrope — are particularly useful for enabling enzymatic DNA synthesis in a device or system including two or more electrodes.
- Said electrodes enable an applied potential or galvanic field to be supplied locally to the device or system thus enabling generation or active redox shuttle via reduction or oxidation at one or more electrodes.
- the electrode wherein the redox shuttle is generated is comprised of a substantially smaller surface area than the other electrode to which it is electrically coupled, so as to mitigate deleterious, unintended, or undesired electrochemical reactions from occurring.
- Other embodiments of the device or system may employ electrode(s) wherein the redox shuttle is generated to be comprised of a substantially- smaller surface area by electrically coupling said electrode to more than one other electrode.
- the cathode may be comprised of less active surface area than the anode by (a) having less electroactive area, or (b) being coupled to multiple anodes which in combination provide substantially more area than the cathode so as to mitigate opportunity for deleterious, unintended, or undesired electrochemical reactions from occurring.
- Nonlimiting examples of surface, or electroactive, area aspect ratios for the counter electrode (i.e., electrode wherein no redox chemistry should occur) relative to the working electrode (i.e., electrode wherein the redox shuttle is to be generated), or C/E aspect ratio, are at least: 3: 1, 6: 1 , 10: 1 , 100: I, 1000: 1 , or higher.
- one or more reference electrodes may be employed to establish a potential difference between electrodes, however the aforementioned ratio relates to electroactive surface areas wherein the current path is involved.
- all the electrodes may lie in a single plane while in other embodiments a combination of electrodes may He in multiple planes, or be oriented parallel to perpendicular with respect to one another during the current path.
- Specific, non-limiting examples include an inter-electrode gap of any of the following distances (i.e., distance between electrodes included in the current path): . ⁇ 100 pm, ⁇ 10 pm, ⁇ 1 pm, ⁇ 100 nm, ⁇ 10 nm, ⁇ 1 nm, ⁇ 100 pm, ⁇ 10 pm, ⁇ 1 pm.
- Designing a device or system within these parameter ranges will likely enable a high degree of redox reversibility as exhibited by high Coulombic efficiency (i.e., charge transfer to the redox shuttle vs. other constituents) and high bond cleavage efficiency (i.e., migration of the active redox shuttle to the cleavage site followed by successful cleavage and subsequent nucleotide addition) over many cycles of redox shuttle activation, cleavage, and nucleotide addition.
- Coulombic efficiency i.e., charge transfer to the redox shuttle vs. other constituents
- high bond cleavage efficiency i.e., migration of the active redox shuttle to the cleavage site followed by successful cleavage and subsequent nucleotide addition
- Exemplary values of high Coulombic efficiency include >75%, >85%, >95%, >99%, >99.9%, and >99.99% while exemplary values of high bond cleavage efficiency include >50%, >60%, >75%, >90%, >99%, >99.9%. Designing a device or system within these parameter ranges will likely enable the device or system to W'ork over many cycles of redox shuttle activation, cleavage, and nucleotide addition on the order of >10X, >100X, >l,000X, >10,000X, >100,000X, >100,000,000X.
- the electrodes are activated by applying intermittent, pulse, or continuous voltage or current at one or more values to enable many cycles of redox shuttle activation, cleavage, and nucleotide addition. VIII. Nucleotide Molecutes
- Some embodiments comprise a nucleotide molecule.
- the molecule comprises any one of the foliowing structures:
- the molecule comprises any one of the following structures:
- the molecule comprises the following structure: wherein Z is any one of the following:
- the molecule comprises the following structure:
- the molecule comprises the following structure: wherein Z is any one of the following:
- the molecule comprises the following structure: wherein X is O or NH: wherein Y is H or OMe; and wherein Z is any one of:
- the molecule comprises the following structure: wherein X is O or NH; wherein Y is II or OMe; and wherein Z is any one of:
- the molecule comprises the following structure: wherein Y is H or OMe; and wherein Z is any one of:
- the molecule is covalently linked to a tether.
- a tether comprising the following structure: wherein Y is H or OMe; and wherein Z is any one of:
- the molecule comprises any one of the following structures:
- the molecule comprises any one of the following structures: structures:
- the molecule comprises any one of the foliowing structures:
- the molecule comprises any one of the foil owing structures :
- the molecule comprises any one of the following structures:
- the molecule comprises any one of the foil owing structures :
- the molecule comprises any one of the following structures:
- the molecule comprises any one of the following structures:
- TdT terminal deoxynucleotidyl tranferase
- C7 was engineered to site-specifically install/insert a cysteine (i.e., substitute another amino acid in a specified position of the wildtype protein) for conjugation of molecules to the TdT.
- cysteine i.e., substitute another amino acid in a specified position of the wildtype protein
- Other modifications were explored, but the C7 variant was the enzyme used in most studies.
- genes for all of the single conjugation site variants were cloned into the E. coll expression vector pET28b+.
- the cysteine variants yielded high soluble expression from BL21(DE3) cells via IPTG induction at 30°C for 2 hours or overnight (16-20 hrs) at 15°C.
- the p-AzF variants were co-transformed into BL21(DE3) cells with pEvol-pAzFRS. l.tl (Addgene 73547) containing genes for expression of the tRNA synthetase and tRNA necessary' to incorporate p-AzF at the amber codon TAG.
- TdT variants The ability' of TdT variants to add nucleotides to a ssDNA chain, called a seed oligo, was tested via incubation of the TdT variant, a seed oligo with a fluorescent dye, and a mixture of 4 2 ’-deoxy nucleotide triphosphates.
- the enzyme activity was tested at 37 °C, 50 °C, 55 °C, and 60 "C ( Figure 4B). Both Pl and P2 exhibited greater activity' than WT mTdT at temperatures >50 °C, with Pl maintaining activity even at 55 °C.
- Nucleotides can be conjugated to the enzyme via redox-cleavable linkers. Structures of four azide-redox-linker-nucleotide molecules are depicted below; labeled A-D.
- the azide can be tethered to a nucleotide via redox-cleavable linker, such as a quinone oxime ether (A and B) or a quinone propionic acid linker with appropriate methyl group substitution, known as the trimethyl lock linker (C and D). Examples of molecules that contain a combination of an azide, a redox cleavable linker, and a nucleotide are shown below.
- A-D are structures of molecules containing an azide, redox cleavable linker (quinone) and nucleotide triphosphate.
- T (2‘- deoxythymidme triphosphate) is shown as a representative example but non-limiting nucleotide triphosphate.
- the quinone propionic acid linker with appropriate methyl group substitution known as the trimethyl lock linker (A and B) can be attached to a nucleotide using suitable intermediate spacers, such as the 4-aminobenzyl alcohol or an aminopropanol spacer. Upon reduction of the quinone, both the redox-cleavable and the intermediate spacer could be cleaved from the nucleotide.
- A-B are structures of molecules containing an azide, redox cleavable linker (quinone) and nucleotide triphosphate.
- T (2’- deoxythymidme triphosphate) is sho wn as a representative nucleotide triphosphate.
- Figure 25C shows an HPLC chromatogram of a crude reaction mixture when compound I was treated wdth leucomethylene blue (reducing agent).
- Reducing agents including dithiothreitoi (DTT), sodium thiosulfate, sodium dithionite, and sodium bisulfite w'ere also tested. Treatment with the reducing agents did not lead to observation of expected cleavage products.
- the redox-cleavable linker could also consist of a disulfide linkage.
- Disulfides have reduction potentials well within the range that will not split water or interfere with nucleic acid chemistry. Disulfides may be used in bioconjugation chemistries and are reduced under physiological conditions (for example, using glutathione). Common reducing agents such as dithiothreitol and (tris(2-carboxyethyl)phosphine) (TCEP) are used to cleave disulfide bonds. Below are examples of azide-disulfide-nucleotide constructs. These molecules could also provide a means to tether a nucleotide to the enzyme through a redox cleavable linker.
- TCEP tris(2-carboxyethyl)phosphine
- ‘T’ (2’- deoxythymidine triphosphate) is shown as a representative nucleotide triphosphate.
- Another candidate for the redox-cleavable linker are substituted benzyl compounds, such as the synngic acid and vanillin family as depicted below. These linkers are cleaved through oxidative conditions that still fall within an acceptable potential. WO 2021/158412 Al; US 2022/0023820 Al. The proclivity toward cleavage can be altered via methoxy substitution on the ring (e.g., two meto-methoxy groups present in synngic acid analogs versus one /weto-methoxy group for vanillin analogs). These linkers can be tethered to nucleotides via carbamate/ urea linkages (A, B) or via an oxime linkage (C, D).
- A, B carbamate/ urea linkages
- C, D oxime linkage
- C) azide- vanillin/syringic-nucleotides consisting of an azide for enzyme attachment, attached via a polyethylene glycol chain where n 1 or more units, and a vanillin or syringic acid-based linker tethered to the nucleotide via an oxime linkage
- ‘"T” (2’- deoxythymidine triphosphate ) is shown as a representative but non-limiting nucleotide triphosphate.
- Nucleotides tethered to a quinone oxime ether and azide linkage can be synthesized according to Schemes 5-7 below.
- Commercially available modified nucleotides with a propargyl anime substituent e.g., 3
- Mass spectral data for compound 5 is shown in Figure 26. Removal of the Boc group using established protocols provides the aminooxy -substituted nucleotide 6.
- Mass spectral data for compound 6 is shown in Figure 2.7. Hutter, D. et al.
- a molecule containing both a quinone and an azide (deemed an azidequinone, such as 7, Schemes 6 and 7) was condensed with the aminooxygroup at acidic pH. Purification using reversed-phase chromatography yielded molecules of type 8. These molecules contained an azide, a quinone oxime ether, and a nucleotide and are deemed “azide-quinone-nucleotides.” An example synthesis is demonstrated for an analog of 2- deoxy guanosine-5 ’-triphosphate, but all four DNA nucleotide analogs can be synthesized using the representative protocol. Starting material structures and mass spectral data are provided for the four nucleotide analogs generated (Table 4).
- a quinone with a hexamethylene linker (compound 23, shown above) can be synthesized using methods disclosed in Park, S.; Westcot, N. P.; Luo, W.; Duto, D.; Yousaf, M. N. Bioconjugate Chemistry “General chemoselective and redox-responsive ligation and release strategy,” 25: 543-551 (2014).
- Compound 23 was synthesized as shown in Scheme 8.
- Table 4 shows aminooxy -substituted nucleotide starting materials and tabulated mass spectral data for azide-qumone nucleotide products. The mass spectral data is shown in Figures 28, 29, 30, and 31. 2. Protocols for Synthesis of Azide-Quinone Nucleotides
- a stock solution of NH2O-PA-dGTP 6 (800 uL of a 10 mM stock solution, 8 umol) was aliquoted into a separate 2-mL tube. The solution was placed on ice, and the pH was adjusted by adding MES buffer (256 uL), Keeping the solution on ice, the DMSO solution of 7 (400 uL at 20 mM, 8 umol) was slowly added. The solution was removed from ice and incubated at room temperature for 2.5 hours (protected from light).
- reaction mixture was quenched with 1 M TEAA and purified by direct loading onto a column (Biotage, 6 g C 18 column, 2% MeCN/HrO for 2 CV, gradient from 2% MeCN to 5% MecN for 3 CV; 5% to 15% MeCN for 3 CV; 15 to 50% MeCN for 5 CV; 50-95% MeCN for 4 CV.
- Fractions containing product were partially concentrated using a ThermoFisher Savant Speedvac, followed by lyophilization. Lyophilized materials were resuspended in water, and concentrations are determined using UV/ VIS. Concentrations were estimated based on extinction coefficients of the corresponding starting aminooxy-substituted triphosphates. Products were analyzed by LC- MS. In this case: MS (ESI) m/z calc for C26H33N9O17P3 836.13 [M-H]’, found 836.0.
- Acid 31 was converted to either the .V-hydroxysuccinimidyl ester (NHS ester) or the water-soluble sw/ ⁇ -TV-hydroxysuccinimidyl ester (sulfo-NHS ester) 32 through reaction with l-Ethyl-3-(3-dimethylaminopropyl)carbodiimide (EDCI) and the corresponding succinimide.
- the sw//o-NHS ester was then reacted with a propargylamino-substituted nucleotide in borate buffer (pH 8.5) to afford a compound containing an azide-qumone propionic acid (trimethyl lock) linker-nucleotide triphosphate construct.
- reaction mixture was protected from light and incubated at room temperature for 2 hours and purified by direct loading onto a column (Biotage, 6 g Cl 8 column, 2% MeCNZFhO for 2 CV, gradient from 2% MeCN to 5% MecN for 3 CV; 5% to 15% MeCN for 3 CV; 15 to 50% MeCN for 5 CV; 50-95% MeCN for 4 CV.
- Fractions containing product were concentrated via lyophilization. Lyophilized materials were resuspended in water, and concentrations were determined using UV/ VIS. Concentrations were estimated based on extinction coefficients of the corresponding starting propargylamine-substituted triphosphates.
- the products were analyzed by LCMS. MS (ESI) m/z calc for C28H37N7O16P3 820. 15 [M-H]", found 820.0.
- Compound 37 was synthesized by' addition of 3-aminopropan-l-ol 36 to 35 in the presence of diisopropylethylamine using A(A-dimethylformamide as a solvent. Resulting compound 37 was treated with A.A-disuccinimidyl carbonate (DSC) 38 in the presence of 4- dimethylaminopyridine (DMAP) to generate mixed carbonate 39, which was used crude.
- DSC A.A-disuccinimidyl carbonate
- DMAP 4- dimethylaminopyridine
- Crude mixed carbonate 39 was reacted with propargylamino-substituted nucleotides (in this case propargyl amine-dCTP 33) in a mixture of A r -methyl-2-pyrrolidone (NMP) and borate buffer (pH 8,5) to afford the resulting carbamate analog 40.
- propargylamino-substituted nucleotides in this case propargyl amine-dCTP 33
- NMP A r -methyl-2-pyrrolidone
- borate buffer pH 8,5
- Nucleotide analogs with a disulfide linker were synthesized by addition of either an AMiydroxysuccinimidyl ester 41 to an amine-substituted nucleotide (shown here as an analog of 2 ’-deoxy thy mi dine triphosphate (Scheme 14) or by preparation of mixed A’-hydroxysuccinimidyl carbonate 44 followed by addition to the corresponding propargylamine-substituted nucleotides 33 (Scheme 15) to generate carbamates such as compound 45.
- an AMiydroxysuccinimidyl ester 41 shown here as an analog of 2 ’-deoxy thy mi dine triphosphate (Scheme 14) or by preparation of mixed A’-hydroxysuccinimidyl carbonate 44 followed by addition to the corresponding propargylamine-substituted nucleotides 33 (Scheme 15) to generate carbamates such as compound 45.
- Scheme 15 Formation of and addition of a mixed carbonate of N- hydroxysuccinimide and an alcohol containing both a disulfide and an azide.
- the mixed carbonate is added to a propargylamine-substituted nucleotide to link the disulfide via a carbamate moiety.
- a 10 mM stock solution of azidoethyl-SS-propionic NHS ester 41 was prepared by dissolving 41 (3 mg, 0.01 mmol) into NMP (1 mL). In a separate tube, an aliquot of 10 mM of propargyl -amine-substituted dCTP (PA-dCTP) 33 (0.40 mL, 0.004 mmol) was diluted with 200 mM borate buffer, pH 8.5 (0.20 mL) and cooled in an ice bath for 10 minutes. The azidoethyl-SS-propionic NHS ester 41 stock solution (0.48 mL, 0.0048 mmol, 1.2 equivalents) was added slowly.
- PA-dCTP propargyl -amine-substituted dCTP
- a stock solution of mixed carbonate 44 was prepared at 100 mM concentration by dissolving 44 63 nig (0.2.0 mmol) into N-methyl-2-pyrrolidinone (NMP) (2.0 mL).
- NMP N-methyl-2-pyrrolidinone
- a solution of propargylamine-substituted dCTP supplied at 10 mM in water, 0.50 mL, 0.005 mmol was diluted with 200 mM borate buffer, pH 8.5 (500 uL).
- NMP N-methyl-2-pyrrolidinone
- reaction mixture was protected from light and incubated at room temperature for 2 hours and purified by direct loading onto a column (Biotage, 6 g Cl 8 column, 2% MeCN/H2O for 2 CV, gradient from 2% MeCN to 5% MecN for 3 CV; 5% to 15% MeCN for 3 CV; 15 to 50% MeCN for 5 CV; 50-95% MeCN for 4 CV.
- Fractions containing product were concentrated via lyophilization. Lyophilized materials were resuspended in water, and concentrations are determined using UV/VIS. Concentrations are estimated based on extinction coefficients of the corresponding starting propargylamino-substituted triphosphates.
- the products were analyzed by LCMS. MS (ESI) m/z calc for C17H25N7O15P3S2 724.01 [M-H]; found 724.0.
- the reaction mixture was cooled to room temperature before diluting the mixture with water (20 mL).
- the solution as then extracted twice with ethyl acetate (30 mL), combining the organic layers.
- the combined organic layers were washed with brine solution (25 ml) before drying the resulting mixture over anhydrous sodium sulfate and filtering.
- the crude mixture was concentrated under reduced pressure to give a light yellow oil.
- the residue was purified by flash column chromatography using ethyl acetate: hexane (v/v 1 : 9) to obtain 4-(2-azidoethoxy)-3-methoxybenzaldehyde 47.
- Protocol for reaction to generate vanillin oxime 48 A 50 mM stock solution of 4-(2 ⁇ azidoethoxy)-3-methoxybenzaldehyde 47 in NMP was prepared by dissolving aldehyde 47 (10 mg, 0.045 mmol into 900 pL of NMP). An aliquot of 55 mM aminoxy-substituted nucleotide (NH2O-PA-dGTP) 6 (91 pL, 0.005 mmol) was diluted with NMP (80 pL), and the pH was adjusted to between 2 and 3 using 1 M 4- Morpholineethanesulfonic acid (MES) (300 pL).
- MES 4- Morpholineethanesulfonic acid
- the solution was chilled on an ice bath for 15 minutes before slowly adding the 4-(2-azidoethoxy)-3-methoxybenzaldehyde solution (100 pL of 50 mM stock, 0.005 mmol). Once fully added and mixed, the solution was incubated in the dark on a mixer for 3 hours. After 3 hours and confirming conversion by HPLC, the crude mixture was quenched with 1 M triethylammonium acetate until the pH was approximately 7. The reaction mixture was purified by direct loading onto a column ( Biotage, 6 g Cl 8 column and eluted using an acetonitrile/water gradient. Fractions containing product (eluted at -10% acetonitrile/water) were concentrated via lyophilization.
- Lyophilized materials were resuspended in water, and concentrations were determined using UV/VIS. Concentrations w'ere estimated based on extinction coefficients of the corresponding starting aminooxy-substituted triphosphates. The products were analyzed by LCMS. MS (ESI) m/z calc for C26H31N9O17P3 834.11
- Protocol for reaction to generate syringic oxime Protocol for reaction to generate syringic oxime.
- the mixed carbonate of type 57 is reacted with propargyiamine-substituted nucleotide 3 to generate the corresponding carbamates of type 58.
- analogs of 2-deoxy guanosine-5’ -triphosphate are shown, but the synthesis would translate to other nucleotide analogs.
- model system 59 (shown below) (wherein the azidoethyl substituent is replaced with a methyl group).
- the model system was purified using reversed-phase chromatography (Cl 8 column, acetonitrile/ water gradient) followed by lyophilization. These conditions would be readily- translated to the system with the azidoethyl substituent described herein.
- MS (ESI) m'z calc for C23H30N4O18P3 743.08 [M-H]-, found 743.0.
- Model compound wherein a carbamate linkage connects a syringic acid analog to a nucleotide.
- the tethered small molecule can be a nucleotide triphosphate, and the small molecule can be tethered via a cleavable linker.
- the linker can be cleaved via electrochemical (redox) means. Synthesis and preparation of the small molecules (modified nucleotide triphosphates) that are covalently attached to the protein are described herein.
- DBCO dibenzylcyclooctyne
- azides such as a 4- azidophenylalanine (non-canonical amino acid) residue.
- a common intermediate hydroquinone (reduced form of quinone) was synthesized by aminolysis of commercially available lactone 9 with 3-azidopropylamine 10, in the presence of catalytic 2-hydroxy pyridine.
- the 1,4-hydroqumone 11 is oxidized with sodium periodate to form quinone 7.
- Compound 1 ⁇ can be purified using reverse-phase chromatography (Cl 8 column with an acetonitrile/water gradient); purified compound 18 can be condensed with modified nucleotide triphosphate 14 as described above.
- Scheme 2.0 Synthesis of the trifunctional system for covalently attaching nucleotides to 4-azidophenylaIanine (non-canomcal amino acid) residues through a redox-cleavable linker.
- MeCN acetonitrile
- THPTA /Fishy droxypropyltnazolylmethylamine (a Cu-ligand for accelerating click chemistry reactions).
- Trifunctional molecules are synthesized by copper-catalyzed click reactions between formed azide-quinone-nucleotides (tabulated in Table 4) and alkynes. Propargyl maleimides of different lengths were reacted with the azide-quinone-nucleotides (Scheme 21 and Scheme 22).
- Scheme 2.1 Synthesis of a trifunctional molecule containing a mal eimide, cleavable linker (quinone oxime ether) and a nucleotide triphosphate. Synthesis is accomplished by copper-catalyzed click chemistry of pre-formed azide-quinone- nucleotides and an alkyne.
- Scheme 22 Synthesis of a trifunctional molecule containing a maleimide, cleavable linker (quinone oxime ether) and a nucleotide triphosphate. Synthesis is accomplished by copper-catalyzed click chemistry of pre-formed azi de-quinone- nucleotides and an alkyne.
- Protocol for click reaction to generate compound 27 Note that the protocol is written for the 2-deoxyguanosine-5’-triphosphate analog but would translate to other nucleotide tn phosphates.
- a solution of azide-quinone-nucleotide 8 at 10 mM in H2O (100 pL, 0.001 mmol) was aliquoted into a tube, followed by a solution of Maleimide-PEG4- alkyne 12 (10 mM stock in water, 100 pL, 0.001 mmol).
- a 200 mM THPTA solution in water was mixed with a 100 mM CuSCh solution in water (10 pL each).
- the THPTA/CuSOr solution (8 pL, 0.0004 mmol CuSO-r and 0.0008 mmol THPTA) was added to the tube containing azide-quinone-nucleotide 8 and the maleimide-PEG4-alkyne 12.
- the reaction mixture was purified by chromatography (Biotage, 6 g C18 column, 2% MeCNZH2O for 2 CV, gradient from 2% MeCN to 5% MeCN for 3 CV ; 5% to 15% MeCN for 3 CV; 15 to 50% MeCN for 5 CV; 50-95% MeCN for 4 CV. Fractions were concentrated via lyophilization. Lyophilized materials are resuspended in water, and concentrations were determined using UV/VIS. Concentrations are estimated based on extinction coefficients of the corresponding starting aminooxy-substituted triphosphates. Products are analyzed by LC-MS.
- Protocol for click reaction to generate compound 29 Note that the protocol is written for the 2-deoxyguanosine-5 ’-triphosphate analog but would translate to other nucleotide triphosphates.
- a solution of azide-quinone-nucleotide 8 at 10 mM in HzO (100 uL, 0.001 mmol) was aliquoted into a tube, followed by a solution of A-propargyl maleimide 28 (50 mM stock in DMSO, 20 pL, 0.001 mmol).
- a 200 mM THPTA solution in water was mixed with a 100 mM CuSOr solution in water (5 pL each).
- the THPTA/CuSOr solution (5 uL, 0.00025 mmol CuSC>4 and 0.0005 mmol THPTA) was added to the tube containing azide-quinone-nucleotide 8 and A’-propargyl maleimide 28.
- a solution 100 mM sodium ascorbate in water (5 pL, 0.0005 mmol) was added to the tube, and the mixture was incubated at room temperature for 30 min.
- reaction mixture was quenched with 100 mM tn ethylammonium acetate buffer, then purified by chromatography (Biotage, 6 g C 18 column, 2% MeCN/H2O for 2 CV, gradient from 2% MeCN to 5% MeCN for 3 CV; 5% to 15% MeCN for 3 CV; 15 to 50% MeCN for 5 CV; 50-95% MeCN for 4 CV. Fractions were concentrated via lyophilization. Lyophilized materials was resuspended in water, and concentrations were determined using UV/VIS. Concentrations were estimated based on extinction coefficients of the corresponding starting aminooxy-substituted triphosphates. Products were analyzed by LC-MS. In this case: MS (ESI) m/z calc for C35H39N11O19P3 1010.16 [M-H] ⁇ , found 1010.0.
- a bifunctional crosslinker is first introduced to a naturally occurring ammo acid with specific reactivity, for example, a cysteine residue.
- Bifunctional molecules containing a maleimide, a spacer section, and a functional group for click chemistry can be used, for example, maleimide-PEG(n)-DBCO compounds (where DBCO stands for dibenzylcycooctyne).
- DBCO stands for dibenzylcycooctyne
- the maleimide reacts selectively with the cysteine residue, leaving a reactive DBCO moiety (Figure 5).
- Other bifunctional maleimide-alkynes include maleimide-PEG(n)-BCN compounds, where BCN stands for bicyclononyne.
- linking molecules can also be used to react with either naturally occurring or non-natural amino acids; for example, a crosslinker containing a A-hydroxysuccinimidyl group on one end and a clickable moiety (such as an azide or alkyne) at another site.
- a crosslinker containing a A-hydroxysuccinimidyl group on one end and a clickable moiety (such as an azide or alkyne) at another site The N- hydroxysuccinimidyl group could react with primary' amine functionalities, including lysine residues.
- crosslinker chemistries converts an ammo acid residue to a reactive functional group.
- this reactive functional group is a bioorthogonal functional group, meaning that the introduced functional group can be reacted exclusively without side reactions from other portions of the biomolecule.
- the method is demonstrated using maleimide-alkyne bifunctional crosslinkers. Incubation of the enzyme with a maleimide-alkyne crosslinker installs an alkyne - -the alkyne can be a DBCO group. The enzyme is then purified using a desalting or size-exclusion column to remove unconjugated small molecule. The enzyme can then be incubated with a second small molecule containing an azide; the azide reacts with DBCO functional groups without use of additional reagents ( Figure 6).
- the azide-qumone- nucleotide was added in 3-fold molar excess of the TdT-DBCO (e.g., 140 pM TdT-DBCO and 420 uM azide-quinone-nucleotide) in TdT Storage Buffer. The mixture was incubated at room temperature overnight (or incubated at room temperature for 4 hours, then moved to 4 °C if used >24 hours after the initial addition).
- the protein-linker-small molecule complex can be controlled via soluble electron (redox) shutles when placed in an environment with electrodes; the electrodes can be used to control the oxidation state of the soluble redox shuttle.
- soluble redox shuttles include methylene blue, quinones, NAD/NADH (nicotinamide adenine dinucleotide, and reduced form), flavins (e.g., riboflavin), viologens, TEMPO (2,2, 6,6- tetramethylpiperidin- 1 -yl)oxyl or (2,2,6,6-tetramethylpiperidin- 1 -yl)oxidanyl)/related compounds and certain organometallic coordination complexes, including ferrocenes, (NH4)4Fe(CN)6, and KrFetCNX When the soluble redox shuttle is in a particular oxidation state, it will react with the redox-cleavable linker, facilitating cleavage
- a soluble redox shuttle can be selected based on knowledge of the redox-cleavable linker used and desired operating pH. For example, an operating window' for cleavage of quinone oxime ethers at pH 6.5 is estimated to be between +0.85 V and -0.38 V at 25 °C (Figure 8). This estimate is based on the fact that DTT (dithiothreitol) (with potential of -0.27 V at pH 6.5) has been shown to cleave quinone oxime ethers. Park, S.; Westcott, N. P.; Luo, W.; Duto, D.; Yousaf, M. N. Bioconjugate Chemistry “General chemosei ective and redox-responsive ligation and release strategy,” 25: 543-551 (2014). Comparative cyclic voltammetry' data for a series of soluble redox shuttles is presented in Figure 9.
- the quinone oxime ether bond (redox-cleavable linker) was cleaved by soluble reducing agents, including leucomethylene blue (the reduced form of methylene blue). Cleavage was demonstrated both using the precursor small molecule azide-quinone nucleotide and on the full conjugate with TdT.
- An azide-quinone-nucleotide (azide-quinone- dCTP) in 200 mM phosphate, pH 6.5 with 100 mMNaCl was treated with leucomethylene blue (at 10-fold excess) at room temperature.
- exemplary' redox shuttle solution compositions are presented in Table 6 and comprise a redox shuttle (methylene blue or flavin mononucleotide) with a buffer, a supporting electrolyte, and a hydrotrope constituents).
- a redox shuttle methylene blue or flavin mononucleotide
- Analogous cyclic voltammograms for Formulations A, B, and C are shown in Figure 13. These examples demonstrate the range of reduction potentials and current densities accessible in the generation of active soluble redox shuttles, based on their identity and solution formulation.
- Methylene blue shows a distinct improvement in electrochemical kinetics between Formulations A and B, due to the addition of 1 M nicotinamide.
- AEp peak separation
- ko standard rate constant
- redox shuttle flavin mononucleotide
- NaCl electrolyte
- Figure 14 shows ultraviolet-visible light (UV-Vis) absorbance spectra of Formulation B dilutions (Dilution 1: 0.25 mM methylene blue and 2.5 mM nicotinamide; Dilution 2: 0.25 mM methylene blue and 100 mM nicotinamide).
- the peak at 605 nm is associated with the dimeric or tetrameric form, and the peak at 670 nm is attributed to the monomer.
- Fernandez- Perez, A. & Marban, G. “Visible Light Spectroscopic Analysis of Methylene Blue in Water; What Comes after Dimer?’' ACS Omega 5: 29801-2.9815 (2020).
- Suitable formulation design such as inclusion of buffering hydrotropic agents like nicotinamide, improves the stability of the soluble redox shuttle, supports rapid generation kinetics (i.e. electrochemical reversibility), and maximizes the efficiency of breaking the redox-cleavable linker.
- the conjugated protein was split into batches. One batch was treated with leucomethylene blue (soluble reducing agent); reduction was observed. One batch was treated with methylene blue (not in the active, reduced form); no change in protein mass was observed.
- TdT is covalently attached to a nucleotide triphosphate via a redox cleavable linker.
- the attachment and linker chemistry is designed so that the nucleotide triphosphate is accessible to the active site of the enzyme.
- the TdT will bind the ssDNA and incorporate the covalently attached linker to the 3’ end of the ssDNA, effectively tethering the ssDNA to the TdT protein by means of the newly incorporated nucleotide. This mechanism is also depicted in Figure 1.
- the TdT protein attached to the ssDNA would effectively block subsequent additions of nucleotide to the ssDNA strand.
- the ssDNA-TdT complex when in the presence of electrodes and a suitable redox shuttle ( Figure 20) would be electrochemically /redox- cleaved to generate ssDNA with the newly incorporated nucleotide, now' separated from the TdT/hnker.
- a fresh TdT covalently bound to nucleotide triphosphate would be introduced, allowing the next nucleotide in the sequence to be added to the growing chain.
- TdT proteins covalently conjugated to nucleotides can be used to incorporate multiple nucleotides onto the end of single-stranded DNA, using leucomethylene blue as a soluble reducing agent to cleave the protein from the ssDNA complex after nucleotide incorporation at each step (see Figure 1 for an illustration of this process).
- an oligonucleotide was anchored to a surface at the 3’ end; a second oligonucleotide was hybridized to that strand to generate a 3’ end available for extension via addition of nucleotides (Figure 22).
- the surface was outfitted with a gasket to allow for introduction of reagents.
- a second C7-nucleotide conjugate was added (with a different nucleotide).
- the starting C7- nucieotide conjugate was a result of reaction with azide-quinone-CTP in some cases or azide- quinone-dATP in other cases.
- the expected addition of the first nucleotide (either A or C) followed by the second nucleotide (either A or C) v/as observed Figure 22, Table 8). Addition of the appropriate nucleotides was determined via sequencing.
- Electrochemically controlled DNA synthesis using a nucleotide- conjugated enzyme was also tested in a 3-cycle synthesis run. As depicted in Figure 22, in each cycle of synthesis the enzyme conjugated to the nucleotide to be added was incubated with seed DNA attached to a surface surrounded by electrodes. Unbound enzyme was washed away and then a methylene blue solution was introduced. Upon applying a voltage to the electrodes, the methylene blue was reduced and cleaved the enzyme from the extended DNA. The cleaved enzyme was washed away and then the next enzyme conjugated to the next nucleotide to be added was introduced.
- nucleotides to the oligonucleotide was determined using sequencing techniques described here.
- the hybridized seed oligonucleotide was eluted from the surface via denaturation from the capture oligo using 0.1M NaOH and then subjected to the following protocol.
- Sequencing sample preparation began with polyadenylation (for sequences expected to end in thymine, cytosine, or guanine) or the addition of a poly(T) tail (for sequences expected to end in adenine) to the 3’ end of each DNA sample using commercial TdT.
- the samples were then amplified using primers that bind to a conserved 5’ sequence and the complementary 3’ tail while incorporating Illumina sequencing priming sites into the synthesized amplicons.
- This step w’as performed using qPCR in order to achieve optimal amplification of all samples regardless of input DNA concentration. A portion of each PCR product was analyzed by gel electrophoresis to confirm the size and relative abundance of the amplicons.
- the products of all samples were then normalized based on relative abundance and used as template for a second PCR reaction, which incorporated Illumina flow cell adaptors and a variable index into each sample.
- the products of the final PCR reaction were characterized by gel electrophoresis and all indexed samples were pooled together proportionally based on relative abundance. Size selection and purification of the pooled library w ? as performed by DNA gel extraction and the final library' was analyzed by Qubit dsDNA HS assay and Tapestation DI 000 screentape to determine the final molar concentration.
- the library was then diluted, denatured and sequenced following standard Illumina protocols.
- the term about refers to a numeric value, including, for example, whole numbers, fractions, and percentages, whether or not explicitly indicated.
- the term about generally refers to a range of numerical values (e.g., 4-7-5-10% of the recited range) that one of ordinary skill in the art would consider equivalent to the recited value (e.g., having the same function or result).
- the terms modify all of the values or ranges provided in the list.
- the term about may include numerical values that are rounded to the nearest significant figure.
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Biochemistry (AREA)
- Genetics & Genomics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Molecular Biology (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Biotechnology (AREA)
- General Chemical & Material Sciences (AREA)
- Microbiology (AREA)
- Medicinal Chemistry (AREA)
- Electrochemistry (AREA)
- Materials Engineering (AREA)
- Metallurgy (AREA)
- Biomedical Technology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Peptides Or Proteins (AREA)
Abstract
Provided herein are engineered terminal deoxynucleotidyl transferase (TdT) proteins with certain modifications, including mutations to confer thermal stability and to install an exposed amino acid residue to which a small molecule can be covalently tethered via bioconjugate chemistries such as click chemistry. Also provided herein are methods of nucleic acid molecule synthesis using engineered TdTs and nucleotide molecules attached to redox-cleavable linkers, wherein the engineered TdT incorporates the nucleotide molecule into a nucleic acid strand and is separated from the nucleotide molecule when the redoxcleavable linker is cleaved upon exposure to suitable electrochemical conditions. Also provided herein are engineered TdTs covalently attached to a nucleotide molecule via a tether and also nucleotide molecules comprising a redox-cleavable linker. Further provided herein are systems for enzymatic DNA synthesis comprising an engineered TdT, a redox-cleavable linker a redox shuttle solution, and two or more electrodes.
Description
COVALENTLY MODIFIED TEMPLATE-INDEFENDENT DNA POLYMERASE
AND METHODS OF USE THEREOF
CROSS- REFERENCE TO RELATED APPLICATIONS
[001] This application claims the benefit of priority of US Provisional Application No. 63/375,944, filed September 16, 2022, which is incorporated by reference herein in its entirety for any purpose,
STATEMENT OF GOVERNMENT RIGHTS
[002] This invention was made with Government support. The Government has certain rights in the invention.
SEQUENCE LISTING
[003] The instant application contains a Sequence Listing which has been submitted electronically in XML format and is hereby incorporated by reference in its entirety’. Said XML copy, created on August 21, 2023, is named “2023-08-21-01343-0001-00PCT- ST26”and is 37,935 bytes in size.
D E S C RI P T I O N
FIELD
[004] Provided herein are engineered terminal deoxynucleotidyl transferase (TdT) proteins with certain modifications, including mutations to confer thermal stability and to install an exposed amino acid residue to which a small molecule can be covalently tethered via bioconjugate chemistries such as click chemistry. Also provided herein are methods of nucleic acid molecule synthesis using engineered TdTs and nucleotide molecules attached to redox-cleavable linkers, wherein the engineered TdT incorporates the nucleotide molecule into a nucleic acid strand and is separated from the nucleotide molecule when the redox- cleavable linker is cleaved upon exposure to suitable electrochemical conditions. Also provided herein are engineered TdTs covalently attached to a nucleotide molecule via a tether and also nucleotide molecules comprising a redox-cleavable linker. Further provided herein are systems for enzymatic DNA synthesis comprising an engineered TdT, a redox-cleavable linker a redox shuttle solution, and two or more electrodes.
BACKGROUND
[005] DNA polymerases are enzymes responsible for the replication of genetic material in vivo and in vitro. Specifically, these enzymes are responsible for catalyzing the addition of nucleotide triphosphates (e.g,, dNTPs and analogs thereof) to the three-prime end of a primer or seed strand of DNA. The majority of DNA polymerases replicate DNA in a largely template-dependent manner. That is: synthesizing the reverse complement strand of a DNA strand. However, a few polymerases have template-independent activity, wherein they can synthesize random sequences of DNA without the influence or need of a template strand.
[006] Terminal deoxynucleotidyl transferase (hereinto referred to as TdT) is a DNA polymerase capable of catalyzing the random addition of nucleotides. In vivo, specifically in premature immune cells undergoing antibody and T cell receptor recombination, TdT acts in conjunction with DNA repair pathways to generate highly diverse sequences at VDJ junction sites. In vitro, TdT also displays template-independent activity, enabling its widespread use for applications such as poly A tailing of DNA. Because TdT does not require a DNA primer strand for DNA synthesis it is an ideal enzyme for in vitro DNA synthesis (see, e.g., Gouge, Jerome, et al. “Structures of intermediates along the catalytic cycle of terminal deoxy nucleotidyl transferase: dynamical aspects of the two-metal ion mechanism.” Journal of Molecular Biology 425.22 (2013): 4334-4352).
[007] However, TdT adds nucleotides in an uncontrolled manner. While this template independence provides a means to synthesize entirely novel sequences of DNA from scratch, to do so in a highly controlled and sequence specific mariner requires the control of the enzyme.
[008] The availability’ of a TdT enzyme capable of controllably adding or inserting a single nucleotide (mononucleotide) at a time would enable new DNA synthesis strategies not previously possible, with benefits over existing strategies, and in particular would enable control of the enzyme for the synthesis of sequence-specified DNA, RNA, or other nucleic acid molecules.
SUMMARY
[009] The present disclosure addresses, among other things, the problem of synthesizing any polymer wherein an enzyme (e.g., terminal deoxynucleotidyl transferase, referred to herein as TdT or Tdt) can mediate addition of monomers to a growing oligomer
chain. Provided herein, for example, are methods for controlling the sequence in which the monomer units are added. The present disclosure addresses, among other things, the problem of de novo DNA synthesis with sequence control, using enzymatic methods. Current methods for DN A'RN A synthesis rely on classical phosphoramidite chemistry, but this chemistry' is typically useful for sequences less than 200 base pairs in length and is prone to errors. Moreover, phosphoramidite chemishy requires use of toxic solvents and reagents.
[0010] Alternatively, use of enzymes to generate long sequences of DNA or RNA in a sequence-controlled manner could enable synthesis of long DN A'RNA sequences with limited error rates. De novo enzymatic DNA synthesis is enabling for many applications, including gene synthesis and biodefense (rapid bio-threat identification and countermeasure development). Enzy matic synthesis methods are performed in aqueous solutions, also providing an environmentally conscious solution for nucleic acid synthesis.
[0011] Provided herein, among other things, are compositions and methods wherein a small molecule is linked to an enzyme via a linker that can be electrochemically cleaved, e.g., by a redox electrochemical reaction. In some embodiments, the small molecule is a nucleotide, nucleotide triphosphate or a nucleotide analog tethered to TdT via a redox- cleavable linker such as a quinone oxime ether linker; in some embodiments the conjugation to the enzyme is affected by click chemistry (after introducing a click chemistry' moiety' via reacting a site-specific cysteine residue with an appropriately functionalized maleimide). In some embodiments, the TdT enzyme is incubated with a single-stranded DNA molecule; upon incubation, the enzy me-tethered nucleotide is covalently attached to the ssDNA. Upon cleavage of the quinone oxime ether residue via reduction chemistry’, the ssDNA with a newly incorporated base is cleaved from the enzyme.
[0012] Embodimen t 1 is a method of nucl eic acid molecule synthesis comprising the steps of:
(a) providing an engineered terminal deoxynucleotidyl transferase (TdT), wherein the engineered TdT comprises a nucleotide molecule covalently attached to the TdT via a redox-cleavable linker;
(b) providing a single-stranded nucleic acid molecule comprising a 5’ and a 3’ end;
(c) contacting the engineered TdT and the single-stranded nucleic acid molecule under conditions suitable for the TdT to bind to the 3’ end of the nucleic acid molecule and
form a TdT-nucleic acid strand complex, thereby incorporating the nucleotide molecule covalently linked to the TdT into the nucleic acid strand, resulting in the nucleic acid molecule tethered to the TdT and thereby blocking subsequent additions of a nucleotide molecule to the nucleic acid strand; and
(d) exposing the TdT-nucleic acid strand complex to suitable electrochemical conditions, wherein the redox-cleavable linker is cleaved thereby separating the TdT-redox- cleavable linker molecule from the nucleic acid strand with the newly incorporated nucleotide molecule, resulting in a nucleic acid strand with the newly incorporated nucleotide molecule; and
(e) repeating steps (a) through (d) thereby synthesizing a nucleic acid molecule.
[0013] Embodiment 2 is the method of embodiment 1, wherein the redox-cleavable linker is a quinone oxime ether linker.
[0014] Embodiment 3 is the method of embodiment 1 or embodiment 2, wherein the electrochemical conditions of step (d) comprise one or more suitable soluble redox shuttles in the presence of two or more electrodes, wherein the two or more electrodes control the oxidation state of the soluble redox shuttles.
[0015] Embodiment 4 is the method of any one of embodiments 1-3, wherein the soluble redox shuttles comprise a thiazine, a viologen, dinucleotides, a flavin, a quinone, peroxide, bromide, vanadium and/or a metal complex.
[0016] Embodiment 5 is the method of embodiment 4, wherein the soluble redox shuttles comprise a thiazine.
[0017] Embodiment 6 is the method of embodiment 4, wherein the soluble redox shuttles comprise a viologen.
[0018] Embodiment 7 is the method of embodiment any one of embodiments 1-3, wherein the soluble redox shuttles comprise dinucleotides.
[0019] Embodiment 8 is the method of embodiment 7, wherein the dinucleotides comprise NAD/NADH, nicotinamide adenine dinucleotide, and/or reduced form.
[0020] Embodiment 9 is the method of embodiment 4, wherein the soluble redox shuttles comprise a flavin.
[0021] Embodiment 10 is the method of embodiment 4, wherein the soluble redox shuttles comprise a quinone.
[0022] Embodiment 11 is the method of embodiment 4, wherein the soluble redox shuttles comprise peroxide.
[0023] Embodiment 12 is the method of embodiment 4 wherein the soluble redox shuttles comprise bromide.
[0024] Embodiment 13 is the method of embodiment 4, wherein the soluble redox shuttles comprise vanadium.
[0025] Embodiment 14 is the method of embodiment 4. wherein the soluble redox shuttles comprise a metal complex.
[0026] Embodiment 15 is the method of embodiment 5, wherein the thiazine is methylene blue.
[0027] Embodiment 16 is the method of embodiment 4, wherein the quinone is 2,5- dihydroxy-l,4-benzoquinone (DHBQ), 9,10-anthraquinone-2,7-disulfonic acid (AQDS), 4,5- dihydroxybenzene-l,3-disulfonic acid (BQDS), or 2,3, 5,6- tetrakis((dimethylamino)methyl)hydroquinone (FQHr).
[0028] Embodiment 17 is the method of embodiment 9, wherein the flavin is riboflavin (RF), flavin mononucleotide (FMN), or flavin adenine dinucleotide (FAD).
[0029] Embodiment 18 is the method of embodiment 6, wherein the viologen is methyl viologen (MV) or bis-(trimethylammonio) propyl viologen (BTMAP-Vi).
[0030] Embodiment 19 is the method of embodiment 14, wherein the metal complex is TEMPO (2,2,6,6-tetramethylpiperidin-l-yl)oxyl, (2,2,6,6-tetramethylpiperidin-l- yl)oxidanyl).
[0031] Embodiment 20 is the method of embodiment 14, wherein the metal complex is an organometallic coordination complex.
[0032] Embodiment 21 is the method of embodiment 20, wherein the organometallic coordination complex is a ferrocene, bis-(trimethylammomo) propyl ferrocene (BTMAP-Fc), ammonium ferricyanide ((NH4)4Fe(CN)6), potassium ferricyanide (K4Fe(CN)&), hexaamineruthemum (III) chloride (RuHex), permanganate, or dichromate.
[0033] Embodiment 22 is the method of any one of embodiments 1 -21, wherein the electrochemical conditions of step (d) further comprise a buffer, a supporting electrolyte, and/or a hydrotropic agent.
[0034] Embodiment 23 is the method of embodiment 22, wherein the buffer comprises sodium citrate, potassium hydrogen phosphate, and/or potassium dihydrogen phosphate.
[0035] Embodiment 24 is the method of embodiment 22, wherein the supporting electrolyte comprises sodium chloride, sodium salts, lithium salts, potassium salts, and/or magnesium salts.
[0036] Embodiment 25 is the method of embodiment 22, wherein the hydrotropic agent comprises caffeine, urea, and/or nicotinamide (NA).
[0037] Embodiment 26 is the method of any one of embodiments 1-3, wherein the electrochemical conditions of step (d) comprise a redox shuttle solution comprising methylene blue, citrate-phosphate buffer, and sodium chloride.
[0038] Embodiment 27 is the method of any one of embodiments 1-3, wherein the electrochemical conditions of step (d) comprise a redox shuttle solution comprising methylene blue, citrate-phosphate buffer, sodium chloride, and nicotinamide,
[0039] Embodiment 28 is the method of any one of embodiments 1-3, wherein the electrochemical conditions of step (d) comprise a redox shuttle solution comprising flavin mononucleotide, citrate-phosphate buffer, sodium chloride, and nicotinamide.
[0040] Embodiment 29 is the method of any one of embodiments 1-3, wherein the electrochemical conditions of step (d) comprise a redox shuttle solution comprising 10 mM methylene blue, 25 mM citrate-phosphate buffer at pH 3.4, and 100 mM sodium chloride.
[0041] Embodiment 30 is the method of any one of embodiments 1 -3, wherein the electrochemical conditions of step (d) comprise a redox shuttle solution comprising 10 mM methylene blue, 25 mM citrate-phosphate buffer at pH 3.4, 100 mM sodium chloride, and 1 M nicotinamide,
[0042] Embodiment 31 is the method of any one of embodiments 1-3, wherein the electrochemical conditions of step (d) comprise a redox shuttle solution comprising 50 mM flavin mononucleotide, 25 mM citrate-phosphate buffer at pH 3.4, 1 M sodium chloride, and 1 M nicotinamide.
[0043] Embodiment 32. is the method of any one of embodiments 1-31, wherein the potential for cleavage is about +1.05 vs. SHE to -0.18V vs. SHE at 25 °C and pH 3; +0.84 V vs. SHE to -0.38 V vs. SHE at 25°C and pH 6.5; or +0.64V vs. SHE to -0.59V vs. SHE at 25°C and pH 10.
[0044] Embodiment 33 is the method of any one of embodiments 1 --32. wherein the engineered TdT comprises one or more mutations to a wild-type TdT of SEQ ID NO: 1, wherein: a. the one or more mutations comprise one or more of the following mutations: C7A, QI8K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, L112P, Fl 19Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, V163I, S 164E, M165Q, V172W, T173Q, D177G, L179T, M 191 K, D208E, Q21 IK, F220W, Q223K, C230E, L233Q, Q242L, C256A, D263H, G265Q, H268D, E270G, C290A, D2.93E, T317R, M323L, R335N, V336T, E339R, D353E, wherein position numbers are relative to SEQ ID NO: 1 ; or b. the one or more mutations comprise one or more of the following mutations: C7A, QI8K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, LI 12P, Fl 19Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, V163I, S164E, M165Q, V172W, T173Q, D177G, L179T, M 191 K, D208E, Q21 IK, F220W, Q223K, C230E, L233Q, Q242L, C256A, a deletion of D263, a deletion ofH264, a deletion of G265, a deletion of R266, a deletion of V267, a deletion of H268, a deletion of S269, a deletion ot'E270, a deletion ofK271, a deletion of S272, Q274P, Q275S, E276P, a deletion of G279, a deletion of W280, a deletion of K281, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E, wherein position numbers are relative to SEQ ID NO: 1; or c. the engineered TdT has at least 85, 90, 91 , 92, 93, 94, 95, 96, 97, 98, 99, or 100% identity to SEQ ID NO: 2 or 3.
[0045] Embodiment 34 is an engineered terminal deoxynucleotidyl transferase (TdT) comprising one or more mutations to a wild-type TdT of SEQ ID NO: 1, wherein: a. the one or more mutations comprise one or more of the following mutations: C7A, Q18K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, L112P, F1I9Y, S127E, KI28D, Q130R, S134T, Q139R, C154P, N156T, V163I, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q21 IK, F220W, Q223K, C230E, 1 ,233Q, Q242L, C256A, D263H, G265Q, H268D, E270G, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E, wherein position numbers are relative to SEQ ID NO: 1; or b. the one or more mutations comprise one or more of the following mutations: C7A, Q18K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L,
K99Q, S100A, LH2P, F119Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N 156T, VI 631, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q211K, F220W, Q223K, C230E, L233Q, Q242L, C256A, a deletion of D263, a deletion of H264, a deletion of G265, a deletion of R266, a deletion of V267, a deletion of H268, a deletion of S269, a deletion of E270, a deletion of K271, a deletion of S272, Q274P, Q275S, E276P, a deletion ot'G279, a deletion of W280, a deletion ot'K281, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E, wherein position numbers are relative to SEQ ID NO: 1; or c. the engineered TdT has at least 85, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, or 100% identity to SEQ ID NO: 2 or 3.
[0046] Embodiment 35 is the engineered TdT of embodiment 34, wherein the one or more mutations comprise one or more of the following mutations: C7A, Q18K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, L112P, Fl 19Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, V163I, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q21 IK, F220W, Q223K, C230E, L233Q, Q242L, C256A, D263H, G265Q, H268D, E270G, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E.
[0047 ] Embodiment 36 is the engineered TdT of embodiment 34 or embodiment 35, wherein the one or more mutations comprise 10, 15, 20, 25, 30, 35, 40, 45, 50, or more of the following mutations: C7A, Q18K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, SI 00 A, L112P, Fl 19Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, VI 631, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q211K, F220W, Q223K, C230E, L233Q, Q242L, C256A, D263H, G265Q, H268D, E270G, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E.
[0048] Embodiment 37 is the engineered TdT of any one of embodiments 34-36, wherein the one or more mutations comprise 50 or more of the following mutations: C7A, Q18K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, L112P, Fl 19Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, V163I, S164E, M165Q, V172W, T173Q, D177G, L179T, M 191 K, D208E, Q21 IK, F220W, Q223K, C230E, L233Q, Q242L, C256A, D263H, G265Q, H268D, E270G, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E.
[0049] Embodiment 38 is the engineered TdT of any one of embodiments 34-37, wherein the one or more mutations comprise all of the following mutations: C7A, Q18K,
L19K, D31A, E35G, C40A, M44R, S47A. C68N, S75R, E78Q, G79E, 181 L, K99Q, S 100 A, LI 12P, Fl 19Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, V163I, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q211K, F220W, Q223K, C230E, L233Q, Q242L, C256A, D263H, G265Q, H268D, E270G, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E.
[0050] Embodiment 39 is the engineered TdT of any one of embodiments 34-38, wherein the one or more mutations consist of the following mutations: C7A, QI 8K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, L112P, F119Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, V163I, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q211K, F220W, Q223K, C230E, L233Q, Q242L, C256A, D263H, G265Q, H268D, E270G, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E.
[0051] Embodiment 40 is the engineered TdT of any one of embodiments 34-39, wherein the engineered TdT comprises an amino acid sequence with 85, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, or 100% identity to SEQ ID NO: 2.
[0052] Embodiment 41 is the engineered TdT of any one of embodiments 34-40, wherein the engineered TdT comprises the ammo acid sequence of SEQ ID NO: 2,
[0053] Embodiment 42 is the engineered TdT of any one of embodiments 34-41, wherein the amino acid sequence of the engineered TdT consists of the ammo acid sequence of SEQ ID NO: 2.
[0054] Embodiment 43 is the engineered TdT of any one of embodiments 34-41, wherein the ammo acid sequence of the engineered TdT consists of the amino acid sequence of SEQ ID NO: 2 and a fluorescent protein label, optionally a green fluorescent protein (GFP) label ,
[0055] Embodiment 44 is the engineered TdT of any one of embodiments 34-41, wherein the amino acid sequence of the engineered TdT consists of the ammo acid sequence of SEQ ID NO: 2 and an affinity tag, optionally a His-tag.
[0056] Embodiment 45 is the engineered TdT of any one of embodiments 34-41, wherein the engineered TdT further comprises a fluorescent protein label, optionally a GFP label.
[0057] Embodiment 46 is the engineered TdT of any one of embodiments 34-41, wherein the engineered TdT further comprises an affinity tag, optionally a His-tag.
[0058] Embodiment 47 is the engineered TdT of any one of embodiments 34-46, wherein the TdT further comprises the conjugation mutation E32C.
[0059] Embodiment 48 is the engineered TdT of any one of embodiments 34-46, wherein the TdT further comprises the conjugation mutation E29C.
[0060] Embodiment 49 is the engineered TdT of any one of embodiments 34-46, wherein the TdT further comprises the conjugation mutation E37C.
[0061] Embodiment 50 is the engineered TdT of any one of embodiments 34-46, wherein the TdT further comprises the conjugation mutation V152C.
[0062] Embodiment 51 is the engineered TdT of any one of embodiments 34-46, wherein the TdT further comprises the conjugation mutation K102C.
[0063] Embodiment 52 is the engineered TdT of any one of embodiments 34-46, wherein the TdT further comprises the conjugation mutation D22C.
[0064] Embodiment 53 is the engineered TdT of any one of embodiments 34-46, wherein the TdT further comprises the conjugation mutation R44C.
[0065 ] Embodiment 54 is the engineered TdT of any one of embodiments 34-46, wherein the TdT further comprises the conjugation mutation L93C.
[0066] Embodiment 55 is the engineered TdT of any one of embodiments 34-46, wherein the TdT further comprises the conjugation mutation P154C.
[0067] Embodiment 56 is the engineered TdT of any one of embodiments 34-46, wherein the TdT further comprises the conjugation mutation N361C.
[0068] Embodiment 57 is the engineered TdT of any one of embodiments 34-38, wherein the TdT comprises the amino acid sequence of any one of SEQ ID NOs: 11, 12, 13, and/or 15.
[0069] Embodiment 58 is the engineered TdT of any one of embodiments 34-38, wherein the TdT comprises the ammo acid sequence of SEQ ID NO: 15.
[0070] Embodiment 59 is the engineered TdT of any one of embodiments 34-38, wherein the TdT comprises the amino acid sequence of any one of SEQ ID NOs: 25, 26, 27, 28, and 29.
[0071] Embodiment 60 is an engineered TdT comprising the amino acid sequence of any one of SEQ ID NOs: 20, 21, 22, 23, and 24.
[0072] Embodiment 61 is an engineered terminal deoxynucleotidyl transferase (TdT) consisting of the ammo acid sequence of SEQ ID NO: 15.
[0073] Embodiment 62. is the engineered TdT of embodiment 34, wherein the one or more mutations comprise one or more of the following mutations: C7A, QI 8K, L19K, D31 A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, 18 IL, K99Q, SI 00 A, Li 12P, F119Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, V 1631, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q21IK, F220W, Q223K, C230E, L233Q, Q242L, C256A, a deletion of D263, a deletion of H264, a deletion of G265, a deletion of R266, a deletion of V267, a deletion of H268, a deletion of S269, a deletion of E270, a deietion of K271, a deletion of S272, Q274P, Q275S, E276P, a deietion of G279, a deletion ofW280, a deietion ofK281, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E, wherein position numbers are relative to SEQ ID NO: 1.
[0074] Embodiment 63 is the engineered TdT of embodiment 34 or 62, wherein the one or more mutations comprise 10, 15, 20, 25, 30, 35, 40, 45, 50, or more of the following mutations: C7A, Q18K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, 1811.,, K99Q, SI 00A, LI 12P, Fl 19Y, S127E, K128D, Q130R, S 134 T, Q139R, C154P, N156T, \ J 631. S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q211K, F220W, Q223K, C230E, L233Q, Q242L, C256A, a deletion of D263, a deletion of'H264, a deletion of G265, a deletion of R266, a deletion of V267. a deletion of H268, a deletion of S269, a deletion of E270, a deletion of K271, a deletion of S272, Q274P, Q2.75S, E276P, a deletion of G279, a deletion of W280, a deletion of K281, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E, wherein position numbers are relative to SEQ ID NO: 1.
[0075] Embodiment 64 is the engineered TdT of embodiments 34, 62 or 63, wherein the one or more mutations comprise 50 or more of the following mutations: C7A, Q18K, L19K, D3IA, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, L112P, Fl 19Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, V 1631, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q211K, F220W, Q223K, C230E, L233Q, Q242L, C256A, a deletion of D263, a deletion of H264, a deletion of G265, a deletion of R266, a deletion of V267, a deletion of H268, a deletion of S269, a deletion of E270, a deletion of K271, a deletion of S272, Q274P, Q275S, E276P, a deletion of G279, a deletion of W280, a deletion of K281, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E, wherein position numbers are relative to SEQ ID NO: 1.
[0076] Embodimen t 65 is the engineered TdT of any one of embodiments 34 or 62- 64, wherein the one or more mutations comprise the following mutations: C7A, Q18K,
L19K, D3IA, E35G, C40A, M44R, S47A. C68N, S75R, E78Q, G79E, 181 L, K99Q, S 100 A, LI 12P, Fl 19Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, V163I, S164E, MI65Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q211K, F220W, Q223K, C230E, L233Q, Q242L, C256A, a deletion of D2.63, a deletion ofH264, a deletion of G265, a deletion of R266, a deletion of V267, a deletion of H268, a deletion of S269, a deletion of E270, a deletion of K271, a deletion of S272, Q274P, Q275S, E276P, a deletion of G279, a deletion of W280, a deletion of K281, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E, wherein position numbers are relative to SEQ ID NO: 1.
[0077 ] Embodiment 66 is the engineered TdT of any one of embodiments 34 or 62-
65, wherein the one or more mutations consist of the following mutations: C7A, QI8K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, L112P, F119Y, S127E, K128D, QI30R, S134T, Q139R, C154P, N156T, V163I, SI64E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q21 IK, F220W, Q223K, C230E, L233Q, Q242L, C256A, a deletion of D263, a deletion of H264, a deletion of G265, a deletion of R266, a deletion of V267, a deletion of H268, a deletion of S269, a deletion of E270, a deletion of K271, a deletion of S272, Q274P, Q275S, E276P, a deletion of G279, a deletion of W280, a deletion of K281, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E, wherein position numbers are relative to SEQ ID NO: 1.
[0078 ] Embodiment 67 is the engineered TdT of any one of embodiments 34 or 62-
66, wherein the engineered TdT comprises an amino acid sequence with 85, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, or 100% identity to SEQ ID NO: 3.
[0079] Embodiment 68 is the engineered TdT of any one of embodiments 34 or 62-
67, wherein the engineered TdT comprises the amino acid sequence of SEQ ID NO: 3.
[0080] Embodimen t 69 is the engineered TdT of any one of embodiments 34 or 62-
68, wherein the amino acid sequence of the engineered TdT consists of the ammo acid sequence of SEQ ID NO: 3.
[0081] Embodiment 70 is the engineered TdT of any one of embodiments 34 or 62- 68, wherein the ammo acid sequence of the engineered TdT consists of the amino acid sequence of SEQ ID NO: 3 and a fluorescent protein label, optionally a GFP label.
[0082] Embodiment 71 is the engineered TdT of any one of embodiments 34 or 62- 68, wherein the amino acid sequence of the engineered TdT consists of the ammo acid sequence of SEQ ID NO: 3 and an affinity' tag, optionally aHis-tag.
[0083] Embodiment 72. is the engineered TdT of any one of embodiments 34 or 62- 68, wherein the engineered TdT further comprises a fluorescent protein label, optionally a green fluorescent protein (GFP) label.
[0084] Embodiment 73 is the engineered TdT of any one of embodiments 34 or 62- 68, wherein the engineered TdT further comprises an affinity tag, optionally aHis-tag.
[0085] Embodiment 74 is the engineered TdT of any one of embodiments 34 or 62- 73, wherein the TdT further comprises the conjugation mutation E32nc, wherein nc is anon- canonical ammo acid.
[0086] Embodiment 75 is the engineered TdT of any one of embodiments 34 or 62- 73, wherein the TdT further comprises the conjugation mutation V152nc, wherein nc is a non-canonical amino acid.
[0087] Embodiment 76 is the engineered TdT of any one of embodiments 34 or 62- 73, wherein the TdT further comprises the conjugation mutation P154nc, wherein nc is a non- canonical amino acid,
[0088] Embodiment 77 is the engineered TdT of any one of embodiments 34 or 62- 73, wherein the TdT further comprises the conjugation mutation E32C.
[0089] Embodiment 78 is the engineered TdT of any one of embodiments 34 or 62- 73, wherein the TdT further comprises the conjugation mutation V152C.
[0090] Embodiment 79 is the engineered TdT of any one of embodiments 34 or 62- 73, wherein the TdT further comprises the conjugation mutation P154C.
[0091 ] Embodiment 80 is the engineered TdT of any one of embodiments 34 or 62- 73, wherein the TdT further comprises the conjugation mutation E29C.
[ 0092] Embodiment 81 is the engineered TdT of any one of embodiments 34 or 62- 73, wherein the TdT further comprises the conjugation mutation E37C.
[0093] Embodiment 82 is the engineered TdT of any one of embodiments 34 or 62- 73, wherein the TdT further comprises the conjugation mutation R44C.
[0094] Embodiment 83 is the engineered TdT of any one of embodiments 34 or 62- 73, wherein the TdT further comprises the conjugation mutation K102C.
[0095] Embodiment 84 is the engineered TdT of any one of embodiments 34 or 62- 64, wherein the TdT comprises the amino acid sequence of SEQ ID NO: 4.
[0096] Embodiment 85 is the engineered TdT of any one of embodiments 34 or 62- 64, wherein the TdT comprises the amino acid sequence of SEQ ID NO: 5.
[0097] Embodiment 86 is the engineered TdT of any one of embodiments 34 or 62- 64, wherein the TdT comprises the amino acid sequence of SEQ ID NO: 6.
[0098] Embodiment 87 is the engineered TdT of any one of embodiments 34 or 62- 64, wherein the TdT comprises the ammo acid sequence of SEQ ID NO: 7.
[0099] Embodiment 88 is the engineered TdT of any one of embodiments 34 or 62- 64, wherein the TdT comprises the amino acid sequence of SEQ ID NO: 9.
[00100] Embodiment 89 is the engineered TdT of any one of embodiments 34 or 62-64, wherein the TdT comprises the amino acid sequence of SEQ ID NO: 10.
[00101] Embodiment 90 is the engineered TdT of any one of embodiments 34 or 62-64, wherein the TdT comprises the amino acid sequence of SEQ ID NO: 16.
[00102] Embodiment 91 is the engineered TdT of any one of embodiments 34 or 62-64, wherein the TdT comprises the amino acid sequence of SEQ ID NO: 17.
[00103] Embodiment 92 is the engineered TdT of any one of embodiments 34 or 62-64, wherein the TdT comprises the amino acid sequence of SEQ ID NO: 18
[00104] Embodiment 93 is the engineered TdT of any one of embodiments 34 or 62-64, wherein the TdT comprises the amino acid sequence of SEQ ID NO: 19.
[00105] Embodiment 94 is the engineered TdT of any of embodiments 34-93, wherein the TdT is capable of covalently binding to a nucleotide molecule at a conjugation residue of the engineered TdT via a tether, wherein:
(i) the conjugation residue comprises a cysteine residue, anon-canonical amino acid residue, or a lysine residue capable of reacting with a bifunctional crosslinker or a redox- cleavable linker; and
(ii) wherein the nucleotide molecule comprises a nucleotide, nucleotide triphosphate, or a nucleotide analog.
[00106] Embodiment 95 is the engineered TdT of embodiment 94, wherein the TdT, tether, and nucleotide molecule comprise any one of the following structures:
[00107] Embodiment 96 is the engineered TdT of embodiment 94, wherein the
TdT, tether, and nucleotide molecule comprise any one of the following structures:
ZJ
[00108] Embodiment 97 is the engineered TdT of embodiment 94, wherein the
[00109] Embodiment 98 is the engineered TdT of embodiment 94, wherein the
TdT, tether, and nucleotide molecule comprise any one of the following structures:
wherein Z is any one of:
[00110] Embodiment 99 is the engineered TdT of embodiment 94, wherein the
TdT, tether, and nucleotide molecule comprise any one of the following structures:
wherein Z is any one of:
[00111] Embodiment 100 is the engineered TdT of embodiment 94, wherein the TdT, tether, and nucleotide molecule comprise any one of the following structures:
wherein X is O or Nil; wherein ¥ is H or OMe; and wherein Z is any one of:
[00112] Embodiment 101 is the engineered TdT of embodiment 94, wherein the TdT, tether, and nucleotide molecule comprise any one of the following structures:
wherein X is O or NH; wherein Y is H or OMe; and wherein Z is any one of:
[00113] Embodiment 102 is the engineered TdT of embodiment 94, wherein the TdT, tether, and nucleotide molecule comprise any one of the following structures:
wherein Y is H or OMe; and wherein Z is any one of:
[00114] Embodiment 103 is the engineered TdT of embodiment 94, wherein the TdT, tether, and nucleotide molecule comprise any one of the following structures:
wherein Y is H or OMe; and wherein Z is any one of:
[00115] Embodiment 104 is a kit comprising the engineered TdT of any one of embodiments 34-93, wherein the TdT is capable of covalently binding to a nucleotide molecule at a conjugation residue of the engineered TdT via a tether, wherein the kit further comprises:
(i) a bifunctional crosslinker and/or a redox-cleavable linker comprising a functional group for covalent conjugation to a conjugation residue of the TdT; and
(li) a nucleotide molecule.
[00116] Embodiment 105 is the kit of embodiment 104, wherein the kit further comprises a spacer.
[00117] Embodiment 106 is the kit of any one of embodiments 104-106, wherein the functional group for covalent conjugation comprises a maleimide, N- hydroxysuccinimidyl group, or a dibenzylcyclooctyne (DBCO).
[00118] Embodiment 107 is the kit of any one of embodiments 104-106, wherein the functional group for covalent conjugation comprises a maleimide.
[00119] Embodiment 108 is the kit of embodiment 107, wherein the maleimide is capable of ataching to a conjugation residue of the engineered TdT, wherein the conjugation residue is a cysteine residue in the TdT.
[00120] Embodiment 109 is the kit of any one of embodiments 104-106, wherein the functional group for covalent conjugation comprises a DBCO.
[00121] Embodiment 110 is the kit of embodiment 109, wherein the DBCO is capable of attaching to a conjugation residue of the engineered TdT, and wherein the conjugation residue is anon-canonical amino acid in the TdT.
[00122] Embodiment 111 is the kit of embodiment 110, wherein the non- canonical amino acid is 4-azidophenylalanine.
[00123] Embodiment 112 is the kit of any one of embodiments 104-111, wherein the redox-cleavable linker comprises a quinone oxime ether linker.
[00124] Embodiment 113 is the kit of any one of embodiments 104- 111, wherein the redox-cleavable linker comprises a disulfide linker.
[00125] Embodiment 114 is the kit of any one of embodiments 104-111, wherein the redox-cleavable linker comprises a syringic acid-based linker or van ill in-based linker.
[00126] Embodiment 115 is the kit of any one of embodiments 105-114, wherein the spacer comprises one or more polyethylene glycol molecules (PEG)«, optionally wherein “n” is 1-12.
[00127] Embodiment 116 is the kit of any one of embodiments 105-114, wherein the spacer comprises a Cl -CIO alkyl.
[00128] Embodiment 117 is the kit of any one of embodiments 105- 114, wherein the spacer comprises a hydrocarbon chain wherein one or more of the carbons comprises a substituted sulfonate group.
[00129] Embodiment 118 is the kit of any one of embodiments 105-114, wherein the spacer comprises a hydrocarbon chain comprising 1-6 carbons wherein one or more of the carbons comprises a substituted sulfonate group.
[00130] Embodiment 119 is the kit of any one of embodiments 105-114, wherein the spacer comprises a C1-C6 alkyl.
[00131] Embodiment 120 is the kit of embodiment 104, wherein the kit comprises a bifunctional crosslinker capable of being covalently linked to the engineered TdT and comprising a DBCO, wherein the DBCO of the bifunctional crosslinker is capable of reacting with any one of the following molecules:
[00132] Embodiment 121 is the kit of embodiment 104. wherein the kit comprises a bifunctional crosslinker capable of being covalently linked to the engineered TdT and comprising a DBCO, wherein the DBCO of the bifunctional crosslinker is capable of reacting with any one of the following molecules:
wherein X is C(O), C(O)NH, CH2, or O; and wherein n = 1 for X = C(O), n =1 for C(O)NH, n =1 for CH2, and n = 1-4 for X = O, wherein n is an integer.
[00133] Embodiment 122 is the kit of embodiment 104. wherein the kit comprises a bifunctional crosslinker capable of being covalently linked to the engineered TdT and comprising a DBCO, wherein the DBCO of the bifunctional crosslinker is capable of reacting with any one of the following molecules:
[00134] Embodiment 123 is the kit of embodiment 104, wherein the kit comprises a bifunctional crosslinker capable of being covalently linked to the engineered TdT and comprising a DBCO, wherein the DBCO of the bifunctional crosslinker is capable of reacting with any one of the following molecules:
[00135] Embodiment 124 is the kit of embodiment 104, wherein the kit comprises one or more of the following molecules capable of being reacted with the engineered TdT, wherein the conjugation residue is a cysteine residue in the TdT:
[00136] Embodiment 125 is the kit of embodiment 104, wherein the kit comprises one or more of the following molecules capable of being reacted with the engineered TdT, wherein the conjugation residue is a cysteine residue in the TdT:
[00137] Embodiment 126 is the kit of embodiment 104, wherein the kit comprises one or more of the following molecules capable of being reacted with the engineered TdT, wherein the conjugation residue is a non-canonical ammo acid residue in the
[00138] Embodiment 127 is the kit of embodiment 104, wherein the kit comprises one or more of the following molecules capable of being reacted with the engineered TdT, wherein the conjugation residue is a lysine residue in the TdT:
[00139] Embodiment 128 is a system for enzymatic DNA synthesis comprising: a. the engineered TdT of any one of embodiments 34-93; b. a redox-cleavable linker; c. a redox shuttle solution; and d. two or more electrodes on a surface.
[00140] Embodiment 129 is the system of embodiment 128, wherein the electrodes enable an applied potential or galvanic field to be supplied locally to the device or system thus enabling generation of active redox shuttle via reduction or oxidation at one or more electrodes,
[00141] Embodiment 130 is the system of embodiment 128 or embodiment 129, wherein the electrode where the soluble redox shuttle is generated is comprised of a smaller surface area than the other electrode to which it is electrically coupled.
[00142] Embodiment 131 is the system of any one of embodiments 128-130, wherein if the soluble redox shuttle is activated via cathodic process, then the cathode may be comprised of less active surface area than the anode by (a) having less electroactive area, or (b) being coupled to multiple anodes which in combination provide substantially more area than the cathode.
[00143] Embodiment 132 is the system of any one of embodiments 128-131, wherein the surface, or electroactive, or area aspect ratios for the counter electrode relative to the working electrode, or C/E aspect ratio is at least: 3: 1, 6:1, 10: 1, 100: 1, 1000:1, or higher.
[00144] Embodiment 133 is the system of any one of embodiments 128-132, wherein all the electrodes lie in a single plane.
[00145] Embodiment 134 is the system of any one of embodiments 128-132, wherein a combination of electrodes may lie in multiple planes.
[00146] Embodiment 135 is the system of any one of embodiments 128-132, wherein electrodes are oriented parallel to perpendicular with respect to one another during the current path,
[00147] Embodiment 136 is the system of any one of embodiments 128-135, wherein the system has an inter-electrode gap (i.e., distance between electrodes included in the current path) of <100 pm, <10 pm, <1 pm, <100 pm, <10 nm, <1 nm, <100 pm, <10 pm, or <1 pm.
[00148] Embodiment 137 is the system of any one of embodiments 128-136, wherein the redox shuttle solution comprises at least one soluble redox shuttle in combination with a buffer, a supporting electrolyte, and/or a hydrotropic agent.
[00149] Embodiment 138 is the system of embodiment 137, wherein the buffer is sodium citrate, potassium hydrogen phosphate, or potassium dihydrogen phosphate.
[00150] Embodiment 139 is the system of embodiment 137 or 138, wherein the supporting electrolyte is sodium chloride, sodium salts, lithium salts, potassium salts, or magnesium salts.
[00151] Embodiment 140 is the system of any one of embodiments 137-139, wherein the hydrotropic agent is caffeine, urea, and/or nicotinamide (NA).
[00152] Embodiment 141 is the system of any one of embodiments 128-136, wherein the redox shuttle solution comprises methylene blue, citrate-phosphate buffer, and sodium chloride.
[00153] Embodiment 142 is the system of any one of embodiments 128-136, wherein the redox shuttle solution comprises methylene blue, citrate-phosphate buffer, sodium chloride, and nicotinamide.
[00154] Embodiment 143 is the system of any one of embodiments 128-136, wherein the redox shuttle solution comprises flavin mononucleotide, citrate-phosphate buffer, sodium chloride, and nicotinamide.
[00155] Embodiment 144 is the system of any one of embodiments 128-136, wherein the redox shuttle solution comprises 10 mM methylene blue, 25 mM citrate- phosphate buffer at pH 3.4, and 100 mM sodium chloride.
[00156] Embodiment 145 is the system of any one of embodiments 128-136, wherein the redox shuttle solution comprises 10 mM methylene blue, 25 mM citrate- phosphate buffer at pH 3.4, 100 mM sodium chloride, and 1 M nicotinamide.
[00157] Embodiment 146 is the system of any one of embodiments 128-136, wherein the redox shuttle solution comprises 50 mM flavin mononucleotide, 25 mM citrate- phosphate buffer at pH 3.4, 1 M sodium chloride, and 1 M nicotinamide.
[00158] Embodiment 147 is a redox shuttle solution comprising at least one soluble redox shuttle in combination with one or more of the following: supporting electrolyte, buffer, and hydrotrope.
[00159] Embodiment 148 is the redox shuttle solution of embodiment 147, comprising: 10 mM methylene blue, 25 mM citrate-phosphate buffer at pH 3.4, and 100 mM sodium chloride.
[00160] Embodiment 149 is the redox shuttle solution of embodiment 147, comprising: 10 mM methylene blue, 25 mM citrate-phosphate buffer at pH 3.4, 100 mM sodium chloride, and 1 M nicotinamide.
[00161] Embodiment 150 is the redox shuttle solution of embodiment 147, comprising: 50 mM flavin mononucleotide, 25 mM citrate-phosphate buffer at pH 3.4, 1 M sodium chloride, and 1 M nicotinamide.
[00164] Embodiment 153 is a molecule comprising the following structure:
[00165] Embodiment 154 is a molecule comprising the foilowing structure:
wherein Z is any one of the following:
[00166] Embodiment 155 is a molecule comprising the following structure:
wherein Z is any one of the following:
[00167] Embodiment 156 is a molecule comprising the following structure:
wherein X is O or NH: wherein Y is H or OMe; and wherein Z is any one of:
[00168] Embodiment 157 is a molecule comprising the following structure:
wherein X is O or NH: wherein Y is H or OMe; and wherein Z is any one of:
[00169] Embodiment 158 is a molecule comprising the following structure:
wherein Y is H or OMe; and wherein Z is any one of:
[00170] Embodiment 159 is a molecule covalently linked to a tether. comprising the following structure:
wherein Y is H or OMe; and wherein Z is any one of:
wherein X is C(0), C(O)NH, CH2, or O; and wherein n :::: 1 for X :::: C(O), n :::1 for C(O)NH, n =1 for CH2, and n = 1-4 for X = O, wherein n is an integer.
[00176] Embodiment 165 is a molecule comprising any one of the following structures:
[00177] Embodiment 166 is a molecule comprising any one of the following structures:
[00178] Additional objects and advantages will be set forth in part in the description which follows, and in part will be understood from the description, or may be learned by practice. The objects and advantages will be realized and attained by means of the elements and combinations particularly pointed out in the appended claims.
[00179] It is to be understood that both the foregoing general description and the following detailed description are exemplars' and explanatory only and are not restrictive of the claims.
[00180] The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate one (several) embodiment^ ) and together with the description, serve to explain the principles described herein.
BRIEF DESCRIPTION OF THE DRAWINGS
[00181] Figures 1A-C show a schematic of covalently attaching nucleotides to TdT via a redox cleavable linker (a quinone). Figure 1 A, Figure IB, and Figure I C refer to different time points. Upon incubation with ssDNA, the enzyme-tethered nucleotide is added to the ssDNA strand, creating a covalent enzyme-ssDNA complex. Reductive cleavage of the quinone cleaves the ssDNA from the enzyme.
[00182] Figure 2 shows conJ ugation sites in a PROSS-stabilized enzyme. Structures of the wild type mTdT shown in white (PDB 4127) aligned with the PROSS- stabilized enzyme (Prossl or Pl) shown in black. Conjugation sites D2.2, E29, E32, E37, R44, L93, KI 02, V152, P154, and N361 are labeled on the structure as well as the position on the bound nucleotide that will be tethered to the enzyme conjugation sites via a linker and attachment moiety on the nucleotide molecule.
[00183] Figures 3A-B show expression and purification of construct Cl. Figure 3 A shows an SDS-PAGE gel of expression and purification in-process samples. Expected molecular weight of C l is 41 .7 kDa. Lanes: l=Protein Ladder, 2=Total cell lysate from 15°C expression, 3=Soluble cell lysate from 15°C expression, 4=Total cell lysate from 30°C expression, 5=Soluble cell lysate from 30°C expression, 6:=final purified protein. Figure 3B shows an SEC chromatogram from Cl on Superdex200 10/300 GL column. The peak at 6.83 ml is at the void volume of the column and contains aggregate; the peak eluting at 15.78 ml is the expected elution volume for a 42 kDa monomer.
[00184] Figures 4A-B show stabilized TdT variants have increased activity at elevated temperatures. Figure 4A: shows a schematic of a free extension assay to assess TdT activity'. Enzyme is incubated with fluorophore-conjugated seed oligo and dNTPs at 37 °C for 20 minutes. Reactions are then analyzed via SDS-PAGE to assess the extension rate of the enzyme as seen by the migration of the extended oligo on the gel, larger fragments running slower at the top of the gel. Figure 4B: show s results from a free extension assay run with wild type murine TdT (gray), the Pl stabilized variant (blue), and the P2 stabilized variant (red) at the indicated temperatures. The left-most lane show's that the starting fluorescent seed oligo runs at the bottom of the gel. Loss of activity for the WT enzyme is observed at 50 °C
and above as seen by the shift in signal towards the bottom of the gel in those lanes. Contrarily, significant loss of activity is only observed for Pl above 55 °C and for P2 above 50 °C.
[00185] Figure 5 show's an example of using a bifunctional linking molecule to install a reactive functional group on an enzyme. In this case a maleimide-alkyne crosslinker is reacted with a cysteine residue. The enzyme is now modified with an alkyne.
[00186] Figure 6 shows an example of using an enzyme that has been modified with an alkyne through a bifunctional crosslinker. A molecule with a nucleotide and cleavable linker can be attached via click chemistry. In the case of adding a DBCO moiety (dibenzylcyclooctyne), the conjugation occurs without additional reagents.
[00187] Figures 7A-F show' deconvolved mass spectra for the Pl C7 variant before and after conjugation with sulfo-DBCO-maleimide and the azide-quinone-nucleotides. Figure 7 A show's a Pl C7 variant, unconjugated; Figure 7B shows a C7 variant reacted with sulfo-DBCO-maleimide; Figure 7C shows a C7 variant reacted with sulfo-DBCO-maleimide followed by azide-quinone-dATP; Figure 7D show's a C7 variant reacted with sulfo-DBCO- maleimide followed by azide-quinone-dGTP; Figure 7E show's a C7 variant reacted with sulfo-DBCO-maleimide followed by azide-quinone-dTTP; and Figure 7F show's a sulfo- DBCO-maleimide followed by azide-quinone-dCTP.
[00188] Figure 8 shows a graphic showing the potential and pH window' over which an activated mediator could be expected to cleave the redox-cleavable linker while avoiding the oxidation and reduction of w'ater.
[00189] Figure 9 shows cyclic voltammograms for a selection of soluble redox shuttles (1 mM methylene blue or riboflavin) at 37°C in a buffered solution of IX TdT Storage Buffer (200 mM potassium phosphate pH 6.5, 100 mM sodium chloride). The working and counter electrodes are noble metals (platinum or gold) while the reference electrode is a silver/silver chloride (Ag/AgCl) couple. A 100 mV/sec scan rate is employed in the above example.
[00190] Figure 10 shows cleavage of small molecule azide-quinone-nucleotide with leucomethylene blue. The reaction was monitored by LC-MS on an Agilent 1260 Infinity instrument with an Agilent 6120 Quadropole MS. Separations w'ere performed using an Agilent Infinity Lab Poroshell EC-C18 column (4.6 x 100 mm, 2.7 um) using the following solvent system at a flow' rate of 0.5 mL/min: solvent A :::: 0.05 M
triethylammonium acetate (TEAA); solvent B = 20% MeCN/0.05M TEAA; gradient method: 90% A/10% B for 3 min; linear gradient form 90%A/10% B to 20%A/80% B from 3-5 min; linear gradient from 20%A/80% B from 5 min to 8 min; maintained at 100% B from 8 min to 20 min. Compounds were detected by UV absorption at 2.10 nm, 254 nm, 280 nm, or 320 nm. Molecular weight range 400-2000; capillary voltage 3750 (pos) and 3500 (neg).
[00191] Figures 11 A-B show ESI-MS (ESI-neg) data for the starting azide- quinone-nucleotide (azide-quinone-dCTP) (Figure 11 A) and the resulting product (“cleaved pdt”) after treatment with leucomethylene biue (Figure 11 B).
[00192] Figures 12A-B show' ESI-MS (ESI-neg) data for the starting azide- quinone-nucleotide (azide-quinone-dCTP) (Figure 12A) and the resulting product (“cleaved pdt”) after treatment with leucomethylene blue (Figure 12B).
[00193] Figure 13 show's cyclic voltammograms for three example formulations of redox shuttle (Formulation A: 10 mM methylene blue, 100 mM sodium chloride, 25 mM citrate-phosphate buffer, pH 3.4; Formulation B: 10 mM methylene blue, 100 mM sodium chloride, 1 M nicotinamide, 25 mM citrate-phosphate buffer, pH 3.4; Formulation C: 50 mM flavin mononucleotide, 1 M sodium chloride, 1 M nicotinamide, 25 mM citrate-phosphate buffer, pH 3.4) at room temperature. The working and counter electrodes are platinum while the reference electrode is a silver/silver chloride (Ag/AgCl) couple. A 100 mV/sec scan rate is employed in the above example.
[00194] Figure 14 show's UV-Vis absorbance spectra for two variations of Formulation B (Dilution 1: 0,25 mM methylene blue, 100 mM sodium chloride, 2.5 mM nicotinamide, 25 mM citrate-phosphate buffer, pH 3.4; Dilution 2: 0.25 mM methylene blue, 100 mM sodium chloride, 100 mM nicotinamide, 25 mM citrate-phosphate buffer, pH 3.4).
[00195] Figure 15 shows 1H-NMR spectra for 1 M nicotinamide in D2O with peak assignments.
[00196] Figure 16 show's 1H-NMR spectra for 1 M nicotinamide with 100 mM methylene blue in D2O with peak assignments. Peaks B and E show' broadening in the presence of methylene blue. Note, the doublet at 7. 17 ppm is from methylene blue and is shown in greater detail in Figure 18.
[00197] Figure 17 shows 1H-NMR spectra for 100 mM methylene blue in D2O with peak assignments.
[00198] Figure 18 shows 1H-NMR spectra for 100 mM methylene blue with 1 M nicotinamide in D?.O with peak assignments. Peaks B’, C’, and D' broadening in the presence of nicotinamide. Note, the peaks at 7.32 ppm are from nicotinamide and are shown in greater detail in Figure 16.
[00199] Figures 19A-C shows deconvoluted mass spectra for batch C7 protein conjugated to sulfo-DBCO-maleimide and azide-quinone-dTTP. The protein was split into batches and treated with either leucomethylene blue or methylene blue. Figure 19A shows deconvoluted mass spectra of the conjugated C7 variant. Figure 19B shows a C7-protein conjugated to sulfo-DBCO-maleimide/azide-quinone-dTTP and treated with leucomethylene blue. Figure 19C shows C7-protein conjugated to sulfo-DBCO-maleimide/azide-quinone- dTTP and treated with methylene blue.
[00200] Figure 20 show's use of soluble redox shuttle to cleave the linker from the complexed DNA-enzyme complex. The redox mediator (soluble redox shuttle) is introduced into a system in one particular oxidation state, in this case, at a higher oxidation state. Applying a potential at the electrodes reduces the mediator. The reduced mediator interacts with the redox-cleavable linker (a quinone-oxime-ether is shown), cleaving the enzyme from the ssDNA. The newly attached nucleotide is left at the end of the ssDNA.
[00201] Figure 21 shows a tris-glycine gel indicating binding of ssDNA to protein conjugates. The fluorescent oligonucleotide shifts upw-ard on gel upon covalent protein binding due to increase in molecular weight.
[00202] Figure 22 show's a representation of experiment to demonstrate addition of nucleotides to an oligonucleotide using the TdT-nucleotide conjugates. A surface is functionalized with a capture oligo (Step 1), followed by hybridization of a seed oligo with an accessible 3’ end (Step 2), Conjugated TdT-dCTP is introduced and covalently binds the seed oligo via addition of its tethered nucleotide (Step 3). Unbound enzyme is then washed away (Step 4) and a soluble reducing agent (leucomethylene blue) is introduced to cleave the enzyme from the extended DNA (Step 5). The cleaved enzyme is then washed away, leaving the seed on the surface with a single +C extension on its 3’ end (Step 6). TdT conjugated to the next nucleotide to be added (here, TdT-dATP) is then introduced, resulting in a total addition of “CA” to the seed oligonucleotide.
[00203] Figures 23A-B show' a distribution of synthesized sequences from a. 3- cycle, electrochemically controlled synthesis run. Figure 23A shows a schematic of the 3-
cycle synthesis workflow adding “GCT.” Figure 23B shows average fraction target sequences, sequences with insertions, and sequences with deletions is shown with error bars representing the standard deviation across three replicates. 37% target sequence of “'GCT” was achieved.
[00204] Figure 24 shows a 1H NMR spectrum of S (400 MHz,
DMSO-d6).
[00205] Figures 25A-C. Figure 25 A shows an HPLC chromatogram of the
following compound I: . Figure 25B shows an HPLC chromatogram of compound II (a propargyl-amine substituted nucleotide analog) (expected cleavage product). Figure 25C shows an HPLC chromatogram of a crude reaction mixture when compound I was treated with leucomethylene blue (reducing agent).
[00207] Figure 27 shows ’‘ESI-MS (negative mode) for NH2O-P A-dGTP analog 6:
[00208] Figure 28 shows ESI-MS (negative mode) data for the azide-quinone-
dATP analog 21: i!
DESCRIPTION OF THE EMBODIMENTS
I. Definitions
[00213] Unless defined otherwise, all technical and scientific terms used herein have the same meaning as is commonly understood by one of skill in the art to which the claimed subject matter belongs. It is to be understood that the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of any subject matter claimed. To the extent any material incorporated herein byreference is inconsistent with the express content of this disclosure, the express content controls. In tins application, the use of the singular includes the plural unless specifically stated otherwise. It must be noted that, as used in the specification and the appended claims, the singular forms “a,” “an” and “the” include plural referents unless the context clearly dictates otherwise. In this application, the use of “or” means “and/or” unless stated otherwise. Furthermore, use of the term “including” as well as other forms, such as “’include”, “includes,” and “included,” is not limiting.
[00214] Although various features of the invention may be described in the context of a single embodiment, the features may7 also be provided separately- or in any7 suitable combination. Conversely, although the invention may be described herein in the context of separate embodiments for clarity, the invention may also be implemented in a single embodiment.
[00215] Reference in the specification to “some embodiments”, “an embodiment”, “one embodiment” or “other embodiments” means that a particular feature, structure, or characteristic described in connection with the embodiments is included in at least some embodiments, but not necessarily7 all embodiments, of the inventions.
[00216] As used herein, ranges and amounts can be expressed as “about” a particular value or range. About also includes the exact amount. Hence “about 5 pL” means “about 5 pL” and also “5 pL.” Generally, the term “about” includes an amount that would be expected to be within experimental error, such as for example, within 15%, 10%, or 5%.
[00217] The section headings used herein are for organizational purposes only and are not to be construed as limiting the subject matter described.
[00218] As used herein, “nucleotide molecule” refers to components of nucleic acids comprising a base, sugar, and phosphate moieties, both natural and unnatural, including nucleotides, ribonucleotides, and nucleotide analogs. As used herein, a nucleotide refers to a
molecule comprising a nucleoside and one or more phosphate groups. As used herein, a nucleoside refers to a molecule comprising a nucleobase (e.g., adenine, thymine, cytosine, guanine, or uracil) and a five-carbon sugar (e.g., ribose or 2’ -deoxyribose). Exemplary' natural nucleotides include, without limitation, ATP, UTP, CTP, GTP, ADP, UDP, CDP, GDP, AMP, UMP, CMP, GMP, dATP, dTTP, dCTP, dGTP, dADP, dTDP, dCDP, dGDP, dAMP, dTMP, dCMP, and dGMP. Exemplary natural deoxyribonucleotides include dATP, dTTP, dCTP, dGTP, dADP, dTDP, dCDP, dGDP, dAMP, dTMP, dCMP, and dGMP. Exemplary natural ribonucleotides include ATP, UTP, CTP, GTP, ADP, UDP, CDP, GDP, AMP, UMP, CMP, and GMP. For natural RNA, the uracil base is uridine. A nucleotide analog, or unnatural nucleotide, comprises a nucleotide which contains some type of modification to either the base, sugar, or phosphate moieties, such as, for example, a chemical modification. All chemical structures showing protonated triphosphates and/or sulfonates should be interpreted to include the protonated and the ionized salt forms in different buffers.
[00219] As used herein, “affinity tag" refers to a protein tag that can aid in purifying or detecting the protein it is attached to. Affinity tags are typically inserted into a target gene at the point of coding for expression at either the N or C terminus of the protein to be expressed.
[00220] As used herein, “His-tag” or “poly histidine tag” refers to a string of generally four, five, six, seven, eight, nine, ten, eleven, or twelve histidine residues, and in some cases, more. As used herein, “His-tag” may refer to either the DNA sequence encoding the string of histidine residues or the string of histidine residues itself. His-tags may be useful for facilitating easy purification and detection of recombinant proteins. His-tags are typically inserted into a target gene at the point of coding for expression at either the N or C terminus of the protein to be expressed.
[00221 ] As used herein, “GFP label” or “GFP tag” or green fluorescent protein label” or “green fluorescent protein tag” refers to a fluorescent protein from Aequorea victoria (water jellyfish). GFP labels can be N-terminally and/or C-terminally fused to a wide variety' of proteins and are frequently' used as a fluorescent marker. A non-limiting example of a GFP amino acid sequence can be found, for example, in UniProt Entry P42212 • GFP AEQVI. It emits a green fluorescent signal if exposed to light.
[00222] As used herein, “redox-cleavable linker” refers to a molecule that is cleaved in response to either oxidation or reduction conditions. Redox-cleavable linkers may also include additional groups that increase the solubility of the redox-cleavable linker, such as polar functional groups (e.g., alcohol, amine, amide, carboxylic acid, sulfonic acid, and phosphate groups).
[00223] As used herein, “conjugation residue” refers to an amino acid residue with a functional group capable of attaching or covalently linking to another molecule, such as a crosslinker or tether. Examples of conjugation residues include cysteine residues, lysine residues, arginine residues, or non-canonical amino acids residues.
[00224] As used herein, “spacer” refers to a molecule that may be used to link two other molecules, although spacers may be present by themselves or attached to only one other molecule. A spacer may be an organic spacer (e.g., an aliphatic spacer, an alkyl spacer, an aromatic spacer, an alkylene glycol, a polyethylene glycol, a carbohydrate such as a sugar, and the like). In some embodiments, the spacer is a PEG spacer, a well-known inert spacer used in many biotechnological applications.
[00225] As used herein, a “bifunctional crosslinker” or “crosslinker” or “bifunctional linker” refers to a molecule that has at least two functional groups and is capable of reacting with one or more other molecules to covalently link them. In some embodiments, the crosslinker has a functional group at each end. Examples of functional groups include, for example, maleimide groups and alkyne groups.
[00226] As used herein, a “tether” or “tether molecule” refers to the molecule covalently linking/connecting./attaching a TdT to a nucleotide molecule, comprising a redox- cleavable linker, optionally comprising a spacer, and optionally comprising a bifunctional crosslinker.
[00227] As used herein, a “soluble redox shuttle” refers to a molecule that is soluble, and when in a particular oxidation state, is capable of reacting with a redox-cleavable linker, facilitating cleavage of the redox-cleavable linker.
[00228] As used herein, a “redox shuttle solution” is a solution that comprises a soluble redox shuttle and other components/formulants/additives, such as a supporting electrolyte, a buffer, and/or a hydrotrope/hydrotropic agent.
[00229] The disclosure provides nucleic acid sequences and amino acid sequences having a certain degree of identity to a given nucleic acid sequence or amino acid
sequence, respectively (a reference sequence). “Sequence identity” between first and second nucleic acid sequences indicates the percentage of nucleotides that are identical between the sequences; for example, if a first nucleic acid sequence is 95% identical to a second nucleic acid sequence, then the first nucleic acid sequence contains matches to 95% of the nucleotides in the second nucleic acid sequence. “Sequence identity” between two amino acid sequences indicates the percentage of amino acids that are identical between the sequences; for example, if a first amino acid sequence is 95% identical to a second amino acid sequence, then the first amino acid sequence contains matches to 95% of the nucleotides in the second amino acid sequence. The terms “% identical”, “% identity” or similar terms are intended to refer, in particular, to the percentage of nucleotides or amino acids which are determined to be identical using an optimal alignment between the sequences to be compared. Said percentage is purely statistical, and the differences between the two sequences may be but are not necessarily randomly distributed over the entire length of the sequences to be compared. Comparisons of two sequences are usually carried out by comparing said sequences after optimal alignment. The optimal alignment for a comparison may be carried out manually or with the aid of an appropriate algorithm such as the alignment algorithm by Needleman and Wunsch, 1970, J. Mol. Biol. 48, 443, or with the aid of computer programs using said algorithms (e.g., GAP, BESTFIT, and FASTA in Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Drive, Madison, Wis.).
[00230] As used herein, the terms “non-canonical ammo acid” or “ncAA” or “non-canonical amino acid residue” or “nc” refer to an amino acid other than one of the 20 naturally occurring amino acids. Exemplary' non-canonical ammo acids are described in Young et al., “Beyond the canonical 20 amino acids: expanding the genetic lexicon,” J. of Biological Chemistry 285(15): 11039-11044 (2010), the disclosure of which is herein incorporated by reference.
[00231 ] As used herein, the terms “n” and “m” as they appear in chemical structures throughout the application refer to an integer from 1-12, unless defined otherwise. II. TdT Enzymes
[00232] Described herein are various biologically active, genetically engineered terminal deoxynucleotidyl transferases (TdT) comprising one or more mutations, wherein the one or more mutations are introduced into the TdT and wherein, in some embodiments, some of the mutation(s) confer increased thermal stability to the TdT relative
to the stability of the wild-type TdT (SEQ ID NO: 1). In some embodiments, a different naturally occurring amino acid residue, or a non-canonical amino acid (ncAA) residue, is substituted for an amino acid residue in the wild-type TdT. As described herein, the genetically engineered TdT retains its enzymatic biological activity as a DM A polymerase to catalyze the addition of a nucleotide to a strand of nucleotides, thus extending a nucleotide strand in a controlled manner. Examples of TdTs with increased thermal stability are provided herein and are shown, for example, as sequences Pl and P2 in Table 1. The genetically engineered TdT enzymes are also referred to herein as TdT variants with increased thermal stability. In some embodiments, the engineered TdT is capable of adding a single nucleotide molecule to a single stranded DNA molecule in an electrochemically controlled manner/reaction.
[00233] In some embodiments, mutations to the wild-type TdT (SEQ ID NO: 1) are notated as, e.g., C7A, wherein the number represents the position/residue in the wildtype TdT amino acid sequence, the first letter (in this example, 'C(' ") represents the amino acid residue at that position in the wild-type TdT amino acid sequence, and the second letter (in this example, ‘"A”) represents the amino acid residue at that position in the mutant/engineered TdT.
[00234] In some embodiments, the engineered TdT comprises one or more mutations to a wild-type TdT of SEQ ID NO: 1, wherein: a) the one or more mutations comprise one or more of the following mutations: C7A, Q18K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, L112P, Fl 19Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, V163I, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q211K, F220W, Q223K, C230E, L233Q, Q242L, C256A, D263H, G265Q, H268D, E270G, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E, wherein position numbers are relative to SEQ ID NO: 1; or b) the one or more mutations comprise one or more of the following mutations: C7A, Q18K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, SI 00 A, L112P, Fl 19Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, VI 631, S164E, M165Q, V172.W, T173Q, D177G, L179T, M191K, D208E, Q211K, F220W, Q223K, C230E, L233Q, Q242L, C256A, a deletion of D263, a deletion of H264, a deletion of G265, a deletion of R266, a deletion of V267, a deletion of H268, a deletion of S269, a deletion of E270, a deletion of K271, a deletion of S272, Q274P, Q275S, E276P, a deletion of G279, a deletion
of W280, a deletion of K281, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E, wherein position numbers are relative to SEQ ID NO: 1 ; or c) the engineered TdT has at least 85, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, or 100% identity to SEQ ID NO: 2 or 3.
[00235] In some embodiments, the one or more mutations comprise one or more of the following mutations: C7A, QI 8K, L19K, D3IA, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, SI 00 A, L112P, Fl 19Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, V163I, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q211K, F220W, Q223K, C230E, L233Q, Q242L, C256A, D263H, G265Q, II268D, E270G, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E.
[00236] In some embodiments, the one or more mutations comprise 10, 15, 20, 25, 30, 35, 40, 45, 50, or more of the following mutations: C7A, Q18K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, LI 12P, F119Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, V163I, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q211K, F220W, Q223K, C230E, L233Q, Q242L, C256A, D263H, G265Q, H268D, E270G, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E.
[00237] In some embodiments, the one or more mutations comprise 50 or more of the following mutations: C7A, Q18K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, L112P, F119Y, S127E, K128D, QI30R, S134T, Q139R, C154P, N156T, V163I, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q211K, F220W, Q223K, C230E, L233Q, Q242L, C256A, D263H, G265Q, H268D, E270G, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E.
[ 00238] In some embodiments, the one or more mutations comprise all of the following mutations: C7A, Q18K, L19K, D31 A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, L112P, F119Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, V163I, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q21 IK, F220W, Q223K, C230E, L233Q, Q242L, C256A, D263H, G265Q, H268D, E270G, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E.
[00239] In some embodiments, the one or more mutations consist of the following mutations: C7A, Q18K, L19K, D3IA, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, LI 12P, F119Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, V163I, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E,
Q211K, F220W, Q2.23K, C230E, L233Q, Q242L, C256A, D263H, G265Q, H268D, E270G, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E.
[00240] In some embodiments, the engineered TdT comprises an amino acid sequence with 85, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, or 100% identity to SEQ ID NO: 2. In some embodiments, the engineered TdT comprises the ammo acid sequence of SEQ ID NO: 2. In some embodiments, the amino acid sequence of the engineered TdT consists of the amino acid sequence of SEQ ID NO: 2.
[00241] In some embodiments, the amino acid sequence of the engineered TdT consists of the amino acid sequence of SEQ ID NO: 2 and a fluorescent protein label, optionally a green fluorescent protein (GFP) label. In some embodiments, the amino acid sequence of the engineered TdT consists of the amino acid sequence of SEQ ID NO: 2 and an affinity tag, optionally a His-tag. In some embodiments, the engineered TdT further comprises a fluorescent protein label, optionally a GFP label. In some embodiments, the engineered TdT further comprises an affinity tag, optionally a His-tag.
[00242] In some embodiments, the engineered TdT comprises a conjugation mutation. In some embodiments, the conjugation mutation allows a cysteine residue, a non- canonical amino acid residue, a lysine residue, or an arginine residue, all comprising a reactive functional group, to be installed at the recited position to allow one skilled in the art to conjugate a bifunctional crosslinker or a redox-cleavable linker to the amino acid at that position. In some embodiments, the conjugation mutation replaces the ammo acid present, at the recited engineered TdT sequence. For example, in “E32C,” the amino acid at position 32 in the Pl sequence (SEQ ID NO: 2) is E, and the TdT further comprises the conjugation mutation comprises a “C” at position 32. In some embodiments, the recited engineered TdT sequence is the amino acid sequence of P l (SEQ ID NO: 2). In some embodiments, the TdT further comprises the conjugation mutation E32C. In some embodiments, the TdT further comprises the conjugation mutation E29C. In some embodiments, the TdT further comprises the conjugation mutation E37C. In some embodiments, the TdT further comprises the conjugation mutation V152C, In some embodiments, the TdT further comprises the conjugation mutation K102C. In some embodiments, the TdT further comprises the conjugation mutation D22C. In some embodiments, the TdT further comprises the conjugation mutation R44C. In some embodiments, the TdT further comprises the conjugation mutation L93C. In some embodiments, the TdT further comprises the
conjugation mutation P154C. In some embodiments, the TdT further comprises the conjugation mutation N361C.
[00243] In some embodiments, the engineered TdT comprises the amino acid sequence of any one of SEQ ID NOs: 11, 12, 13, and/or 15. In some embodiments, the engineered TdT consists of the amino acid sequence of any one of SEQ ID NOs: 11, 12, 13, and/or 15. In some embodiments, the engineered TdT comprises the amino acid sequence of SEQ ID NO: 15. In some embodiments, the engineered TdT consists of the amino acid sequence of SEQ ID NO: 15. In some embodiments, the TdT comprises the amino acid sequence of any one of SEQ ID NOs: 25, 26, 27, 28, and 29. In some embodiments, the TdT consists of the amino acid sequence of any one of SEQ ID NOs: 25, 26, 27, 28, and 29,
[00244] In some embodiments, an engineered TdT is provided comprising the amino acid sequence of any one of SEQ ID NOs: 20, 21, 22, 23, and 24.
[00245] In some embodiments, the engineered TdT comprises one or more more mutations to a wild-type TdT of SEQ ID NO: I , wherein the one or more mutations comprise one or more of the following mutations: C7A, Q 18K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, L112P, F119Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, V163I, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q211K, F220W, Q223K, C230E, L233Q, Q242L, C256A, a deletion of D263, a deletion of H264, a deletion of G265, a deletion of R266, a deletion of V267, a deletion of H268, a deletion of S269, a deletion of E270, a deletion of K271, a deletion of S272, Q274P, Q275S, E276P, a deletion of G279, a deletion of W280, a deletion of K281, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E, wherein position numbers are relative to SEQ ID NO: 1.
[00246] In some embodiments, the one or more mutations comprise 10, 15, 20, 25, 30, 35, 40, 45, 50, or more of the following mutations: C7A, Q18K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, LI 12P, F119Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, V163I, S164E, M165Q, V172W, T173Q, D177G, L179T, MI91K, D208E, Q211K, F220W, Q223K, C230E, L233Q, Q242L, C256A, a deletion of D263, a deletion of H264, a deletion of G265, a deletion of R266, a deletion of V267, a deletion of H268, a deletion of S269, a deletion of E270, a deletion of K271, a deletion of S272, Q274P, Q275S, E276P, a deletion of G279, a deletion ofW280, a
deletion of K281, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E, wherein position numbers are relative to SEQ ID NO: 1.
[00247] In some embodiments, the one or more mutations comprise 50 or more of the following mutations: C7A, Q18K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, L112P, F1 I9Y, SI27E, K128D, QI30R, S134T, Q139R, C154P, N156T, V163I, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q211K, F220W, Q223K, C230E, L233Q, Q242L, C256A, a deletion of D263, a deletion of H264, a deletion of G265, a deletion of R266, a deletion of V267, a deletion of H268, a deletion of S269, a deletion ofE270, a deletion of K271, a deletion of S272, Q274P, Q275S, E276P, a deletion of G279, a deletion of W280, a deletion of K281, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E, wherein position numbers are relative to SEQ ID NO: 1.
[00248] In some embodiments, the one or more mutations comprise the following mutations: C7A, Q18K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, L112P, F119Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, VI63I, SI64E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q211K, F220W, Q223K, C230E, L233Q, Q242L, C256A, a deletion of D263, a deletion of H264, a deletion of G265, a deletion of R266, a deletion of V267, a deletion of 11268, a deletion of S269, a deletion of E270, a deletion of K271, a deletion of S272, Q274P, Q275S, E276P, a deletion of G279, a deletion ofW280, a deletion ofK281, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E, wherein position numbers are relative to SEQ ID NO: 1.
[00249] In some embodiments, the one or more mutations consist of the following mutations: C7A, Q18K, L19K, D31 A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, L112P, F119Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, V163I, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q21 IK, F220W, Q223K, C230E, L233Q, Q242L, C256A, a deletion of D263, a deletion of H264, a deletion of G265, a deletion of R266, a deletion of V267, a deletion of H268, a deletion of S269, a deletion of E270, a deletion of K2.71, a deletion of S272, Q274P, Q275S, E276P, a deletion of G279, a deletion of W280, a deletion of K28I, C290A, D293E, T3I7R, M323L, R335N, V336T, E339R, D353E, wherein position numbers are relative to SEQ ID NO: 1.
[00250] In some embodiments, the engineered TdT comprises an amino acid sequence with 85, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, or 100% identity to SEQ ID NO: 3. In some embodiments, the engineered TdT comprises the ammo acid sequence of SEQ ID NO: 3. In some embodiments, the amino acid sequence of the engineered TdT consists of the amino acid sequence of SEQ ID NO: 3.
[00251 ] In some embodiments, the amino acid sequence of the engineered TdT consists of the amino acid sequence of SEQ ID NO: 3 and a fluorescent protein label, optionally a GFP label. In some embodiments, the amino acid sequence of the engineered TdT consists of the amino acid sequence of SEQ ID NO: 3 and an affinity tag, optionally a His-tag. In some embodiments, the engineered TdT further comprises a fluorescent protein label, optionally a green fluorescent protein (GFP) label. In some embodiments, the engineered TdT further comprises an affinity tag, optionally a His-tag.
[00252] In some embodiments, the engineered TdT comprises a conjugation mutation. In some embodiments, the conjugation mutation allows a cysteine residue, anon- canonical ammo acid residue, a lysine residue, or an arginine residue, all comprising a reactive functional group, to be installed at the recited position to allow one skilled in the art to conjugate a bifunctional crosslinker or a redox-cleavable linker to the amino acid at that position. In some embodiments, the conjugation mutation replaces the ammo acid present at the recited engineered TdT sequence. For example, in “V152C,” the amino acid at position 152 in the P2 sequence (SEQ ID NO: 3) is V, and the TdT further comprises the conjugation mutation comprises a “C” at position 32, In some embodiments, the recited engineered TdT sequence is the ammo acid sequence of P2 (SEQ ID NO: 3). In some embodiments, the TdT further comprises the conjugation mutation E32nc, wherein “nc” is a non-canonical ammo acid. In some embodiments, the TdT further comprises the conjugation mutation V152nc, wherein nc is a non-canonical ammo acid. In some embodiments, the TdT further comprises the conjugation mutation P154nc, wherein nc is a non-canonical ammo acid. In some embodiments, the TdT further comprises the conjugation mutation E32C. In some embodiments, the TdT further comprises the conjugation mutation V152C. In some embodiments, the TdT further comprises the conjugation mutation P154C. In some embodiments, the TdT further comprises the conjugation mutation E29C. In some embodiments, the TdT further comprises the conjugation mutation E37C. In some
embodiments, the TdT further comprises the conjugation mutation R44C. In some embodiments, the TdT further comprises the conjugation mutation K102C.
[00253] In some embodiments, the TdT comprises the amino acid sequence of SEQ ID NO: 4. In some embodiments, the TdT comprises the amino acid sequence of SEQ ID NO: 5. In some embodiments, the TdT comprises the amino acid sequence of SEQ ID NO: 6. In some embodiments, the TdT comprises the amino acid sequence of SEQ ID NO: 7. In some embodiments, the TdT comprises the amino acid sequence of SEQ ID NO: 9. In some embodiments, the TdT comprises the amino acid sequence of SEQ ID NO: 10. In some embodiments, the TdT comprises the amino acid sequence of SEQ ID NO: 16. In some embodiments, the TdT comprises the amino acid sequence of SEQ ID NO: 17, In some embodiments, the TdT comprises the amino acid sequence of SEQ ID NO: 18. In some embodiments, the TdT comprises the amino acid sequence of SEQ ID NO: 19.
III. Tether Molecules
[00254] In some embodiments, an engineered TdT is capable of covalently binding to a nucleotide molecule at a conjugation residue of the engineered TdT via a tether. In some embodiments, the engineered TdT is covalently bound to a nucleotide molecule via a tether using a two-step conjugation. In some embodiments, in the first step, a bifunctional crosslinker (e.g., amaleimide-DBCO bifunctional crosslinker) is reacted with a cysteine residue on the engineered TdT; the maleinnde reacts with the cysteine residue, leaving a free DBCO. In some embodiments, in the second step, an azide-redox-cleavable linker covalently bound to a nucleotide molecule is reacted with the free DBCO on the TdT-bifunctional crosslinker molecule, forming a triazole moiety and completing the tether binding the engineered TdT to the nucleotide molecule. An example of a two-step reaction described in these embodiments is shown in Figures 5 and 6.
[00255] In some embodiments, molecules that can be directly conjugated to a cysteine residue on the TdT enzy me via a tether using a one-step conjugation without a bifunctional crosslinker are provided.
[00256] In some embodiments, an engineered TdT is provided, wherein the TdT is capable of covalently binding to a nucleotide molecule at a conjugation residue of the engineered TdT via a tether, wherein: (i) the conjugation residue comprises a cysteine residue, a non-canonical amino acid residue, a lysine residue, or an arginine residue capable
of reacting with a bifunctional crosslinker or a redox-cleavable linker; and; (ii) wherein the nucleotide molecule comprises a nucleotide, nucleotide triphosphate, or a nucleotide analog.
[00257] All chemical structures showing protonated triphosphates and/or sulfonates should be interpreted to include the protonated and the ionized salt forms in different buffers.
[00258] In some embodiments, the TdT, tether, tether, and nucleotide molecule comprise any one of the following structures:
[00259] In some embodiments, the TdT, tether, and nucleotide molecule comprise any one of the following structures:
[00260] In some embodiments, the TdT, tether, and nucleotide molecule comprise any one of the following structures:
wherein X is any one of:
[00261] In some embodiments, the TdT, tether, and nucleotide molecule comprise any one of the following structures:
[00262] In some embodiments, the TdT, tether, and nucleotide molecule comprise any one of the following structures:
wherein Z is any one of:
[00263] In some embodiments, the TdT, tether, and nucleotide molecule comprise any one of the following structures:
wherein X is O or NH;
wherein Y is H or OMe; and wherein Z is any one of:
[00264] In some embodiments, the TdT, tether, and nucleotide molecule compnse any one of the following structures:
wherein X is O or NH; wherein Y is H or OMe; and wherein Z is any one of:
[00265] In some embodiments, the TdT, tether, and nucleotide molecule compnse any one of the following structures:
wherein Y is H or OMe; and wherein Z is any one of:
[00266] In some embodiments, the TdT, tether, and nucleotide molecule comprise any one of the following structures:
wherein Y is H or OMe; and wherein Z is any one of:
[00267] In some embodiments, a kit comprising an engineered TdT is provided that is capable of covalently binding to a nucleotide molecule at a conjugation residue of the engineered TdT via a tether. In some embodiments, the engineered TdT is covalently bound to a nucleotide molecule via a tether using a two-step conjugation. In some embodiments, in the first step, a bifunctional crosslinker (e.g., a maleimide-DBCO bifunctional crosslinker) is reacted with a cysteine residue on the engineered TdT; the maleimide reacts with the cysteine residue, leaving a free DBCO. In some embodiments, in the second step, an azide-redox- cleavable linker covalently bound to a nucleotide molecule is reacted with the free DBCO on
the TdT-bifunctional crosslinker molecule, forming a triazole moiety and completing the tether binding the engineered TdT to the nucleotide molecule. An example of a two-step reaction described in these embodiments is shown in Figures 5 and 6.
[00268] In some embodiments, kits comprising molecules that can be directly conjugated to a cysteine residue on the TdT enzyme via a tether using a one-step conjugation without a bifunctional crosslinker are provided.
[00269] All chemical structures showing protonated triphosphates and/or sulfonates should be interpreted to include the protonated and the ionized salt forms in different buffers.
[00270] In some embodiments, kits are provided comprising an engineered TdT, wherein the TdT is capable of covalently binding to a nucleotide molecule at a conjugation residue of the engineered TdT via a tether, wherein the kit further comprises: (i) a bifunctional crosslinker and/or a redox-cleavable linker comprising a functional group for covalent conjugation to a conjugation residue of the TdT; and (ii) a nucleotide molecule. In some embodiments, the kit further comprises a spacer.
[00271] In some embodiments, the functional group for covalent conjugation comprises a mal eimide, N-hydroxysuccinimidyl group, or a dibenzylcyclooctyne (DBCO). In some embodiments, the functional group for covalent conjugation comprises a maleinnde. In some embodiments, the maleimide is capable of attaching to a conjugation residue of the engineered TdT, wherein the conjugation residue is a cysteine residue in the TdT. In some embodiments, the functional group for covalent conjugation comprises a DBCO. In some embodiments, the DBCO is capable of attaching to a conjugation residue of the engineered TdT, wherein the conjugation residue is a non-canonical amino acid in the TdT. In some embodiments, the non-canonical amino acid is 4-azidophenylalanine.
[00272] In some embodiments, the redox-cleavable linker comprises a quinone oxime ether linker. In some embodiments, the redox-cleavable linker comprises a disulfide linker. In some embodiments, the redox-cleavable linker comprises a syringic acid-based linker or vanillin-based linker.
[00273] In some embodiments, the spacer comprises one or more polyethylene glycol molecules (PEG)n, optionally wherein "‘n” is 1-12. In some embodiments, the spacer comprises a Cl-Cl 0 alkyl. In some embodiments, the spacer comprises a hydrocarbon chain wherein one or more of the carbons comprises a substituted sulfonate group. In some
embodiments, the spacer comprises a 4-aminobenzyl alcohol or an aminopropanol spacer. In some embodiments, the spacer comprises an aminopropyl carbamate spacer. In some embodiments, the spacer comprises a hydrocarbon chain comprising 1-6 carbons wherein one or more of the carbons comprises a substituted sulfonate group. In some embodiments, the spacer comprises a C1-C6 alkyl.
[00274] In some embodiments, the kit comprises a bifunctional crosslinker capable of being covalently linked to the engineered TdT and comprising a DBCO, wherein the DBCO of the bifunctional crosslinker is capable of reacting with any one of the following molecules:
[00275] In some embodiments, the kit comprises a bifunctional crosslinker capable of being covalently linked to the engineered TdT and comprising a DBCO, wherein the DBCO of the bifunctional crosslinker is capable of reacting with any one of the following molecules:
wherein X is C(0), C(O)NH, CH2, or O: and wherein n = 1 for X = C(O), n =1 for
C(O)NH, n =1 for CH2, and n :=: 1-4 for X = O, wherein n is an integer.
[00276] In some embodiments, the kit comprises a bifunctional crosslinker capable of being covalently linked to the engineered TdT and comprising a DBCO, wherein the DBCO of the bifunctional crosslinker is capable of reacting with any one of the foliowing
[00277] In some embodiments, the kit comprises a bifunctional crosslinker capable of being covalently linked to the engineered TdT and comprising a DBCO, wherein the DBCO of the bifunctional crosslinker is capable of reacting with any one of the following molecules:
[00278] wherein X is O or NH, and wherein Y is H or OMe.
[00279] In some embodiments, the kit comprises one or more of the following molecules capable of being reacted with the engineered TdT, wherein the conjugation residue is a cysteine residue in the TdT:
[00280] In some embodiments, the kit comprises one or more of the following molecules capable of being reacted with the engineered TdT, wherein the conjugation residue is a cysteine residue in the TdT:
[00281] In some embodiments, the kit comprises one or more of the following molecules capable of being reacted with the engineered TdT, wherein the conjugation residue is anon-canonical amino acid residue in the TdT:
[00282] In some embodiments, the kit comprises one or more of the following molecules capable of being reacted with the engineered TdT, wherein the conjugation residue is a lysine residue or an arginine residue in the TdT:
V. Methods of Nucleic Acid Synthesis
[00283] Methods for nucleic acid molecule synthesis are described herein. In some embodiments, the method of nucleic acid molecule synthesis comprises the steps of: (a) providing an engineered terminal deoxynucleotidyl transferase (TdT), wherein the engineered TdT comprises a nucleotide molecule covalently attached to the TdT via a redox-cleavable linker; (b) providing a single-stranded nucleic acid molecule comprising a 5'' and a 3’ end; (c) contacting the engineered TdT and the single-stranded nucleic acid molecule under conditions suitable for the TdT to bind to the 3’ end of the nucleic acid molecule and form a TdT-nucleic acid strand complex, thereby incorporating the nucleotide molecule covalently linked to the TdT into the nucleic acid strand, resulting in the nucleic acid molecule tethered to the TdT and thereby blocking subsequent additions of a nucleotide molecule to the nucleic acid strand; and (d) exposing the TdT-nucleic acid strand complex to suitable electrochemical conditions, wherein the redox-cleavable linker is cleaved thereby separating the TdT-redox- cleavable linker molecule from the nucleic acid strand with the newly incorporated nucleotide molecule, resulting in a nucleic acid strand with the newly incorporated nucleotide molecule; and (e) repeating steps (a) through (d) thereby synthesizing a nucleic acid molecule.
[00284] In some embodiments, the redox-cleavable linker is a quinone oxime ether linker. In some embodiments, the electrochemical conditions of step (d) comprise one or more suitable soluble redox shuttles in the presence of two or more electrodes, wherein the two or more electrodes control the oxidation state of the soluble redox shuttles. In some embodiments, the soluble redox shuttles comprise a thiazine, a viologen, dinucleotides, a
flavin, a quinone, peroxide, bromide, vanadium and/or a metal complex. In some embodiments, the soluble redox shuttles comprise a thiazine. In some embodiments, the thiazine is methylene blue. In some embodiments, the soluble redox shuttles comprise a viologen. In some embodiments, the viologen is methyl viologen (MV) or bis- (tnmethylammonio) propyl viologen (BTMAP-Vi). In some embodiments, the soluble redox shuttles comprise dinucleotides. In some embodiments, the dinucleotides comprise NAD/NADH, nicotinamide adenine dinucleotide, and/or reduced form. In some embodiments, the soluble redox shuttles comprise a flavin. In some embodiments, the flavin is riboflavin (RF), flavin mononucleotide (FMN), or flavin adenine dinucleotide (FAD). In some embodiments, the soluble redox shuttles comprise a quinone. In some embodiments, the quinone is 2,5-dihydroxy-l,4-benzoquinone (DHBQ), 9,10-anthraquinone-2,7-disulfonic acid (AQDS), 4,5-dihydroxybenzene-l,3-disulfonic acid (BQDS), or 2,3, 5,6- tetrakis((dimethylamino)methyl)hydroquinone (FQH2). In some embodiments, the soluble redox shuttles comprise peroxide. In some embodiments, the soluble redox shuttles comprise bromide. In some embodiments, the soluble redox shuttles comprise vanadium. In some embodiments, the soluble redox shuttles comprise a metal complex. In some embodiments, the metal complex is TEMPO (2,2,6,6-tetra.methylpiperidin-l-yl)oxyl, (2,2,6, 6- tetramethylpiperidin-l-yl)oxidanyl). In some embodiments, the metal complex is an organometallic coordination complex. In some embodiments, the organometallic coordination complex is a ferrocene, bis-(trimethylammonio) propyl ferrocene (BTMAP-Fc), ammonium ferricyanide ((NH4>iFe(CN)6), potassium ferricyanide (K4Fe(CN)6), hexaamineruthenium (HI) chloride (RuHex), permanganate, or dichromate.
[00285] In some embodiments, the electrochemical conditions of step (d) further comprise a buffer, a supporting electrolyte, and/or a hydrotropic agent. In some embodiments, the buffer comprises sodium citrate, potassium hydrogen phosphate, and/or potassium dihydrogen phosphate. In some embodiments, the supporting electrolyte comprises sodium chloride, sodium salts, lithium salts, potassium salts, and/or magnesium salts. In some embodiments, the hydrotropic agent comprises caffeine, urea, and/or nicotinamide (NA). In some embodiments, the electrochemical conditions of step (d) comprise a redox shuttle solution comprising methylene blue, citrate-phosphate buffer, and sodium chloride. In some embodiments, the electrochemical conditions of step (d) comprise a redox shuttle solution comprising methylene blue, citrate-phosphate buffer, sodium chloride, and nicotinamide. In
some embodiments, the electrochemical conditions of step (d) comprise a redox shuttle solution comprising flavin mononucleotide, citrate-phosphate buffer, sodium chloride, and nicotinamide. In some embodiments, the electrochemical conditions of step (d) comprise a redox shuttle solution comprising 10 mM methylene blue, 25 mM citrate-phosphate buffer at pH 3.4, and 100 mM sodium chloride. In some embodiments, the electrochemical conditions of step (d) comprise a redox shuttle solution comprising 10 mM methylene blue, 25 mM citrate-phosphate buffer at pH 3.4, 100 mM sodium chloride, and 1 M nicotinamide. In some embodiments, the electrochemical conditions of step (d) comprise a redox shuttle solution comprising 50 mM flavin mononucleotide, 25 mM citrate-phosphate buffer at pH 3.4, 1 M sodium chloride, and 1 M nicotinamide.
[00286] In some embodiments, the potential for cleavage is about +1.05 vs. SHE to -0.18V vs. SHE at 25°C and pH 3; +0.84 V vs. SHE to -0.38 V vs. SHE at 25°C and pH 6.5; or +0.64V vs. SHE to -0.59V vs. SHE at 25°C and pH 10.
[00287] In some embodiments, the engineered TdT comprises one or more mutations to a wiid-type TdT of SEQ ID NO: 1, wherein: (a) the one or more mutations comprise one or more of the following mutations: C7A, QI8K, L19K, D31 A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, L1 12P, F119Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, V163I, S164E, M165Q, V172W, T173Q, D177G, L179T, M19IK, D208E, Q21 IK, F220W, Q223K, C230E, L233Q, Q242L, C256A, D263H, G265Q, H268D, E270G, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E, wherein position numbers are relative to SEQ ID NO: 1; or b) the one or more mutations comprise one or more of the following mutations: C7A, Q18K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, LI 12P, F119Y, S127E, K128D, Q130R, S134T, Q139R, C 154P, N 156T , V163I, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q211K, F220W, Q223K, C230E, L233Q, Q242L, C256A, a deletion of D263, a deletion of H264, a deletion of G265, a deletion of R266, a deletion of V267, a deletion of EI268, a deletion of S269, a deletion of E270, a deletion of K271, a deletion of S272, Q274P, Q275S, E276P, a deletion of G279, a deletion of W280, a deletion of K281, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E, wherein position numbers are relative to SEQ ID NO: I ; or c) the engineered TdT has at least 85, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, or 100% identity to SEQ ID NO: 2 or 3.
VI. Systems for Enzymatic DNA Synthesis
[00288] Systems for enzymatic DNA systems are described herein. In some embodiments, the system for enzymatic DNA synthesis comprises: a) an engineered TdT described herein; b) a redox-cleavable linker; c) a redox shuttle solution; and d) two or more electrodes on a surface. In some embodiments, the electrodes enable an applied potential or galvanic field to be supplied locally to the device or system thus enabling generation of active redox shuttle via reduction or oxidation at one or more electrodes. In some embodiments, the electrode where the soluble redox shuttle is generated is comprised of a smaller surface area than the other electrode to which it is electrically coupled. In some embodiments, if the soluble redox shuttle is activated via cathodic process, then the cathode may be comprised of less active surface area than the anode by (a) having less electroactive area, or (b) being coupled to multiple anodes which in combination provide substantially more area than the cathode. In some embodiments, the surface, or electroactive, or area aspect ratios for the counter electrode relative to the working electrode, or C/E aspect ratio is at least: 3:1 , 6: 1, 10:1, 100: 1, 1000: 1, or higher. In some embodiments, all the electrodes he in a single plane. In some embodiments, a combination of electrodes may lie in multiple planes. In some embodiments, the electrodes are oriented parallel to perpendicular with respect to one another during the current path. In some embodiments, the system has an inter-electrode gap (i.e., distance between electrodes included in the current path) of <100 pm, <10 pm, <1 pm, <100 pm, <10 nm, <1 nm, <100 pm, <10 pm, or <1 pm.
[00289] In some embodiments, the redox shuttle solution comprises at least one soluble redox shuttle in combination with a buffer, a supporting electrolyte, and/or a hydrotropic agent. In some embodiments, the buffer is sodium citrate, potassium hydrogen phosphate, or potassium dihydrogen phosphate. In some embodiments, the supporting electrolyte is sodium chloride, sodium salts, lithium salts, potassium salts, or magnesium salts. In some embodiments, the hydrotropic agent is caffeine, urea, and/or nicotinamide (NA). In some embodiments, the redox shutle solution comprises methylene blue, citratephosphate buffer, and sodium chloride. In some embodiments, the redox shuttle solution comprises methylene blue, citrate-phosphate buffer, sodium chloride, and nicotinamide. In some embodiments, the redox shuttle solution comprises flavin mononucleotide, citratephosphate buffer, sodium chloride, and nicotinamide. In some embodiments, the redox shuttle solution comprises 10 niM methylene blue, 25 mM citrate-phosphate buffer at pH 3.4,
and 100 inM sodium chloride. In some embodiments, the redox shuttle solution comprises 10 mM methylene blue, 25 rnM citrate-phosphate buffer at pH 3.4, 100 rnM sodium chloride, and 1 M nicotinamide. In some embodiments, the redox shutle solution comprises 50 mM flavin mononucleotide, 25 mM citrate-phosphate buffer at pH 3.4, 1 M sodium chloride, and 1 M nicotinamide.
VII. Redox Shuttles and Redox Shuttle Solutions
Reduction potentials were experimentally determined or converted from literature values.
[00291] Table 3 shows non-limiting examples of shuttle solution fonnul ants/components/addi ti v es .
[00292] In some embodiments, a redox shuttle that is activated via its reduction at an electrode is employed, so that it may migrate to the cleavage site and transfer its electron(s) to the cleavage site linker bond thus being oxidized in the process of breaking that bond, so as to enable subsequent nucleotide addition. In other embodiments, a redox shuttle that is activated via its oxidation at an electrode is employed, so that it may migrate to the cleavage site and harvest electron(s) from the cleavage site linker bond thus being reduced in the process of breaking that bond, so as to enable subsequent nucleotide addition. In some embodiments, the redox-cleavable linker requires the same number of electrons per molecule, or bonds to break, as the redox shuttle requires to be activated at an electrode. In other embodiments, the redox-cleavable linker requires a different number of electrons per molecule, or bonds to break, as the redox shuttle requires to be activated at an electrode. In some embodiments, the redox shuttle requires 1, 2, 3, or 4 electrons per molecule in order to be activated (i.e., reduced or oxidized to sufficient degree as to act upon the redox-cleavable linker).
[00293] Local generation of sufficient quantities of active redox shuttle molecules depends on the electrochemical kinetics at the electrode surface and reaction environment (e.g., concentration of the inactive shuttle, temperature, etc.). The composition, or formulation, of the redox shuttle solution is relevant to reproducible cleavage of the linker and highly efficient DNA synthesis (both within a given nucleotide addition step and within the overall sequence). Non-limiting example fomiulants are presented in Table 3, along with their role in the formulation. In some embodiments disclosed herein, electrochemical oxidation or reduction of the shuttle will involve protonation or deprotonation, however metal ions such as lithium, sodium, and magnesium and potassium may be utilized in addition to or instead of protons. In some embodiments, shuttle solutions relying upon protonation or deprotonation are sufficiently buffered to prevent local pH changes and sluggish electrochemical kinetics. Some examples of biological buffers disclosed herein include potassium phosphate buffer, citrate buffer, or citrate-phosphate buffer. Similarly, the electrochemical reaction at the working electrode may depend upon the use of an ionically
conductive electrolyte between the working and counter electrodes. Supporting electrolytes, such as NaCI, may be included to enable efficient kinetics. Alternatively, other salts based upon sodium, lithium, potassium, magnesium may be employed as supporting electrolytes. Other formulants may be used to increase the solubility’ of the redox shuttle so as to increase the rate of generation. For example, some embodiments include amphiphilic hydrotropic agents, such as nicotinamide, caffeine, and urea, which are known to form weak molecular interactions with poorly soluble redox molecules, thereby increasing their stability' and solubility'-. Orita, A. et al., C’A biometric redox flow battery' based on flavin mononucleotide,” 7 Nat. Commun. 1, 13230 (2016). Finally, specific formulants can play multiple roles. The pyridinic N group in nicotinamide, for example, has a pKaof 3.3, thereby enabling it to act as both a hydrotrope and a buffer when the solution pH is formulated close to its pKa value.
[00294] Some embodiments comprise a redox shuttle solution. In some embodiments, the redox shuttle solution comprises at least one soluble redox shuttle in combination with one or more of the following: supporting electrolyte, buffer, and hydrotrope. In some embodiments, the redox shuttle solution comprises: 10 mM methylene blue, 25 mM citrate-phosphate buffer at pH 3.4, and 100 mM sodium chloride. In some embodiments, the redox shuttle solution comprises 10 mM methylene blue, 25 mM citratephosphate buffer at pH 3.4, 100 mM sodium chloride, and 1 M nicotinamide. In some embodiments, the redox shuttle solution comprises: 50 mM flavin mononucleotide, 25 mM citrate-phosphate buffer at pH 3.4, 1 M sodium chloride, and 1 M nicotinamide.
[00295] The aforementioned compositions, or formulations, of redox shuttles solutions comprising, in some embodiments, at least one redox shuttle in combination with one or more of the following constituent formulants --supporting electroly te, buffer, hydrotrope — are particularly useful for enabling enzymatic DNA synthesis in a device or system including two or more electrodes. Said electrodes enable an applied potential or galvanic field to be supplied locally to the device or system thus enabling generation or active redox shuttle via reduction or oxidation at one or more electrodes. In some embodiments, the electrode wherein the redox shuttle is generated is comprised of a substantially smaller surface area than the other electrode to which it is electrically coupled, so as to mitigate deleterious, unintended, or undesired electrochemical reactions from occurring. Other embodiments of the device or system may employ electrode(s) wherein the redox shuttle is generated to be comprised of a substantially- smaller surface area by electrically coupling said
electrode to more than one other electrode. For example, if the redox shuttle is activated via cathodic process, then the cathode may be comprised of less active surface area than the anode by (a) having less electroactive area, or (b) being coupled to multiple anodes which in combination provide substantially more area than the cathode so as to mitigate opportunity for deleterious, unintended, or undesired electrochemical reactions from occurring. Nonlimiting examples of surface, or electroactive, area aspect ratios for the counter electrode (i.e., electrode wherein no redox chemistry should occur) relative to the working electrode (i.e., electrode wherein the redox shuttle is to be generated), or C/E aspect ratio, are at least: 3: 1, 6: 1 , 10: 1 , 100: I, 1000: 1 , or higher. In some devices or systems, one or more reference electrodes may be employed to establish a potential difference between electrodes, however the aforementioned ratio relates to electroactive surface areas wherein the current path is involved. In some embodiments all the electrodes may lie in a single plane while in other embodiments a combination of electrodes may He in multiple planes, or be oriented parallel to perpendicular with respect to one another during the current path. In some embodiments, it may be desirable to minimize the inter-electrode distance between the active electrodes as much as possible, so as to mitigate excess voltage from being required. Specific, non-limiting examples include an inter-electrode gap of any of the following distances (i.e., distance between electrodes included in the current path): .<100 pm, <10 pm, <1 pm, <100 nm, <10 nm, <1 nm, <100 pm, <10 pm, <1 pm. Designing a device or system within these parameter ranges will likely enable a high degree of redox reversibility as exhibited by high Coulombic efficiency (i.e., charge transfer to the redox shuttle vs. other constituents) and high bond cleavage efficiency (i.e., migration of the active redox shuttle to the cleavage site followed by successful cleavage and subsequent nucleotide addition) over many cycles of redox shuttle activation, cleavage, and nucleotide addition. Exemplary values of high Coulombic efficiency include >75%, >85%, >95%, >99%, >99.9%, and >99.99% while exemplary values of high bond cleavage efficiency include >50%, >60%, >75%, >90%, >99%, >99.9%. Designing a device or system within these parameter ranges will likely enable the device or system to W'ork over many cycles of redox shuttle activation, cleavage, and nucleotide addition on the order of >10X, >100X, >l,000X, >10,000X, >100,000X, >100,000,000X. In some embodiments, the electrodes are activated by applying intermittent, pulse, or continuous voltage or current at one or more values to enable many cycles of redox shuttle activation, cleavage, and nucleotide addition.
VIII. Nucleotide Molecutes
[00296] Some embodiments comprise a nucleotide molecule. In some embodiments, the molecule comprises any one of the foliowing structures:
[00298] In some embodiments, the molecule comprises the following structure:
wherein Z is any one of the following:
; wherein Z is any one of the following:
[00300] In some embodiments, the molecule comprises the following structure:
wherein Z is any one of the following:
[00301] In some embodiments the molecule comprises the following structure:
wherein X is O or NH: wherein Y is H or OMe; and wherein Z is any one of:
[00302] In some embodiments, the molecule comprises the following structure:
wherein X is O or NH; wherein Y is II or OMe; and wherein Z is any one of:
[00303] In some embodiments, the molecule comprises the following structure:
wherein Y is H or OMe; and wherein Z is any one of:
[00304] In some embodiments, the molecule is covalently linked to a tether. comprising the following structure:
wherein Y is H or OMe; and wherein Z is any one of:
[00306] In some embodiments, the molecule comprises any one of the following structures: structures:
wherein X is C(0), C(O)NH, CH2, or O; and wherein n ::: 1 for X ::: C(O), n 1 for C(O)NH, n =1 for CH2, and n = 1 -4 for X = O. wherein n is an integer.
[00309] In some embodiments, the molecule comprises any one of the following structures:
EXAMPLES
Example 1. Design of TdT Variants
[00313] Variants of terminal deoxynucleotidyl tranferase (TdT) were generated starting from the amino acid sequence of wild-type murine TdT, and the software PROSS was used to design variants predicted to have increased stability (see for example amino acid sequences Pl and P2 of Table 1). Goldenzweig, A. et al. Molecular Cell “'Automated Structure- and Sequence-Based Design of Proteins for High Bacterial Expression and
Stability’,” 63(2): 337-346 (2016). Sites for nucleotide conjugation were then identified, including E32, V152, and P154 (Figure 2) by visualization of structural models of the stabilized variants in Pymol. In order to site-specifically attach a linker-nucleotide molecule via covalent chemistries, mutations were be made to introduce reactive ammo acid residues at the selected attachment sites. In some cases, a cysteine residue (C) was installed. In other cases, non-canonical amino acids, such as p- azido-L-phenylalanine (AzF) were installed. A list of all variants studied/used is shown in Table 1 .
[00314] Of note, a stabilized mTdT (called Pl) was developed with 85.64% identity with wild-type mTdT.
[00315] Pl variant of Table 1:
[00316] KISQYAAQRRTTLNNYNKKFTDALD1LAENAELRGNEGSALAF
RRAASVLKSLPFPITSMKDTEGIPNLGDKVKRIIQEILEDGESSEAKAVLNDERYQAFK LFTSVFGVGPKTAEKWYRMGFRTLEDIRSDKTLRFTRMQKAGFLYYEDLVSPVTRPE AEAIEQLVKEAVWQFLPGATVTMTGGFRRGKKTGHDVDFLITSPEATEEEEKQLLHK VTDWWKKQGLLLYEDIQESTFEKF'KLPSRKVDALDHFQKAFLILKLHHQRVDSGKS GQQEGKGWKAIRVDLVMAPYERRAFALLGWTGSRQFERDLRRYARHERKMLLDN IIALYDRTKNTFLRAESEEEIFAIILGLEYIEPWERNA (SEQ ID NO: 2)
[00317] Of interest, one variant (C7) was engineered to site-specifically install/insert a cysteine (i.e., substitute another amino acid in a specified position of the wildtype protein) for conjugation of molecules to the TdT. Other modifications were explored, but the C7 variant was the enzyme used in most studies.
[00318] C 7 variant of P 1 of Table 1 :
[00319] KISQYAAQRRTTLNNYNKKFTDALDILAENAELRGNEGSALAF RRAASVLKSLPFPITSMKDTEGIPNLGDKVKRIIQEILEDGESSEAKAVLNDERYQAF{ * } C { * } LFTS VFGV GPKTAEKWYRMGFRTLEDIRSDKTLRFTRMQKAGFLYYEDLVSP VTRPEAEAIEQLVKEAVWQFLPGATVTMTGGFRRGKKTGHDVDFLITSPEATEEEEK QLLHKVTDWWKKQGLLLYEDIQESTFEKFKLPSRKVDALDIIFQKAFLILKLiniQRV DSGKSGQQEGKGWKAIRVDLVMAPYERRAFALLGWTGSRQFERDLRRYARHERK MLLDNHALYDRTKNTFLRAESEEEIFAHLGLEYIEPWERN (SEQ ID NO: 8) Example 2. Generation of TdT Variants
[00320] To generate the TdT variants, genes for all of the single conjugation site variants were cloned into the E. coll expression vector pET28b+. The cysteine variants
yielded high soluble expression from BL21(DE3) cells via IPTG induction at 30°C for 2 hours or overnight (16-20 hrs) at 15°C. The p-AzF variants were co-transformed into BL21(DE3) cells with pEvol-pAzFRS. l.tl (Addgene 73547) containing genes for expression of the tRNA synthetase and tRNA necessary' to incorporate p-AzF at the amber codon TAG. These variants also y ielded a high level of soluble expression via IPTG and arabinose induction at 30°C tor 2 hours or overnight at 15°C with the addition of 2mM p AzF. All proteins were first purified over Ni-NTA via a His-tag that was incorporated into the expression construct. Size exclusion chromatography (SEC) using a Superdex200 column was then used to isolate the pure, monomeric proteins in TdT Storage Buffer (200mM Potassium Phosphate pH 6.5. lOOmM NaCi). 7.4-18 mg of final, pure protein was obtained from IL culture for each protein and an example of one of the purifications (Cl) is shown in Figure 3A and Figure 3B. Proteins were either stored at -80C with 5% glycerol or lyophilized and stored at room temperature for conjugation and functional testing.
Example 3, Test of Ability of TdT Variants to Add Nucleotides to ssDNA Chain
[00321] The ability' of TdT variants to add nucleotides to a ssDNA chain, called a seed oligo, was tested via incubation of the TdT variant, a seed oligo with a fluorescent dye, and a mixture of 4 2 ’-deoxy nucleotide triphosphates. The enzyme activity was tested at 37 °C, 50 °C, 55 °C, and 60 "C (Figure 4B). Both Pl and P2 exhibited greater activity' than WT mTdT at temperatures >50 °C, with Pl maintaining activity even at 55 °C.
Example 4. Nucleotides with Redox-Cleavable Linkers
[00322] Nucleotides can be conjugated to the enzyme via redox-cleavable linkers. Structures of four azide-redox-linker-nucleotide molecules are depicted below; labeled A-D. The azide can be tethered to a nucleotide via redox-cleavable linker, such as a quinone oxime ether (A and B) or a quinone propionic acid linker with appropriate methyl group substitution, known as the trimethyl lock linker (C and D). Examples of molecules that contain a combination of an azide, a redox cleavable linker, and a nucleotide are shown below.
[00323] As shown above. A-D are structures of molecules containing an azide, redox cleavable linker (quinone) and nucleotide triphosphate. In this case, “T” (2‘- deoxythymidme triphosphate) is shown as a representative example but non-limiting nucleotide triphosphate. A) Example of an azide-quinone-nucleotide wherein the quinone portion is installed as a quinone oxime ether. B) Example an azide-quinone-nucleotide wherein the quinone portion is installed as a quinone oxime ether, and the linker between the quinone and the azide consists of a methylene chain, a chain containing polyethylene oxide (PEG) groups, or a chain containing a ketone or amide functionality', wherein n = 1 for X = C(O), n =1 for C(O)NH, n =1 for CH2, and n = 1-4 for X = (), wherein n is an integer. C) Example of an azide-quinone-nucleotide wherein the quinone is tethered through a methyl substituted propionic acid linker, and D) example of an azide-quinone-nucleotide wherein the quinone is tethered through a methyl-substituted propionic acid linker wherein ‘’A” can be any where from 0 additional atoms to 6 additional atoms.
[00324] Additional examples of molecules containing the azide-redox-linker- nucleotide are depicted below'. The quinone propionic acid linker with appropriate methyl group substitution, known as the trimethyl lock linker (A and B) can be attached to a
nucleotide using suitable intermediate spacers, such as the 4-aminobenzyl alcohol or an aminopropanol spacer. Upon reduction of the quinone, both the redox-cleavable and the intermediate spacer could be cleaved from the nucleotide.
[00325] As show n above, A-B are structures of molecules containing an azide, redox cleavable linker (quinone) and nucleotide triphosphate. In this case, “T” (2’- deoxythymidme triphosphate) is sho wn as a representative nucleotide triphosphate. A) Example of an azide-quinone-nucleotide wherein the quinone is tethered through a methylsubstituted propionic acid linker, via an intermediate 4-aminobenzyl alcohol spacer and B) example of an azide-quinone-nucleotide wherein the quinone is tethered through a methylsubstituted propionic acid linker using a propylene diamine or aminopropanol spacer.
[00326] Schemes 1 -3 below show the expected reductive cleavage mechanism for the quinone tethered through a methyl -substituted propionic acid linker. In the first case (Scheme 1, labeled A), there is no intermediate spacer; reduction of the quinone leads to cyclization and displacement of amine. In cases where an intermediate spacer is present, reduction of the quinone leads to cyclization, followed by a secondary cleavage of the intermediate spacer (Scheme 2, labeled B, and Scheme 3, labeled C). Use of methylsubstituted propionic acid linkers (also known as trimethyl lock linkers) are well described. Okoh et. al. ChemBioChem “Trimethyl Lock: A Multifunctional Molecular Tool for Drug
Delivery', Cellular Imaging, and Stimuli-Responsive Materials,” 19( 7): 1668-1694 (2018).
[00327] Shown abo ve are expected products after the azide-quinone-nucleotide is tethered to the enzyme and the nucleotide is tethered to a growing strand of ssDNA. Scheme 1, A) Reduction of the quinone leads to cyclization and displacement of the amine. Scheme 2, B), and Scheme 3, C) Reduction of the quinone leads to cyclization, followed by a secondary cleavage o f an intermediate spacer molecule.
[00328] However, when the compound shown as structure I below' (with a type of trimethyl lock linker) was synthesized and cleavage was tested with reduced methylene blue (leucoinethylene blue), it did not cleave as expected. The expected cleavage products are the propargyl-amine substituted nucleotide analog II and the cyclized hydroquinone III. As demonstrated by the HPLC traces shown in Figures 25A-25C, expected cleavage products were not observed. Figure 25A shows an HPLC chromatogram of compound I. Figure 25B shows an HPLC chromatogram of compound II (expected cleavage product). Figure 25C shows an HPLC chromatogram of a crude reaction mixture when compound I was treated wdth leucomethylene blue (reducing agent). Reducing agents including dithiothreitoi (DTT), sodium thiosulfate, sodium dithionite, and sodium bisulfite w'ere also tested. Treatment with
the reducing agents did not lead to observation of expected cleavage products.
[00329] The redox-cleavable linker could also consist of a disulfide linkage.
Disulfides have reduction potentials well within the range that will not split water or interfere with nucleic acid chemistry. Disulfides may be used in bioconjugation chemistries and are reduced under physiological conditions (for example, using glutathione). Common reducing agents such as dithiothreitol and (tris(2-carboxyethyl)phosphine) (TCEP) are used to cleave disulfide bonds. Below are examples of azide-disulfide-nucleotide constructs. These molecules could also provide a means to tether a nucleotide to the enzyme through a redox cleavable linker. Depending on the design, cleavage of the disulfide bond could result in an additional cyclization of the reduced thiol (shown in Scheme 4). It is likely desirable to generate nucleic acids free of thiol substitutions due to detrimental secondary structure effects that could arise from maintaining thiol substituents on the growing nucleic acid chains. Examples of nucleotide-disulfide conjugates have been reported. WO 2022/212408 Al.
[00330] Show n above are representative structures consisting of A) azide- disulfide-nucleotides where the carbon chain length is variable (n::: an integer from 1-3) B) disulfide-nucleotides wherein the R group is a functional group or linker-functional group capable of undergoing a bioconjugation reaction (and the carbon chain length is variable, n ==: an integer from 1-3) C) azide-disulfide nucleotides linked via a carbamate and D) disulfidenucleotides tethered to a functional group, R, where R is a functional group or linkerfunctional group capable of undergoing a bioconjugation reaction. In this case, ‘T’ (2’- deoxythymidine triphosphate) is shown as a representative nucleotide triphosphate.
[00331] Scheme 4. Example of a nucleotide tethered to an enzyme via a disulfide linkage. Reductive cleavage of the disulfide bond leads to formation of free thiols. When configured appropriately, the thiol will cyclize into a carbonyl group to form a fivemembered ring (l,3-oxathiolan-2-one), leaving a propargyl -amine as the pendant functional group on the incorporated nucleotide.
[00332] Another candidate for the redox-cleavable linker are substituted benzyl compounds, such as the synngic acid and vanillin family as depicted below. These linkers are cleaved through oxidative conditions that still fall within an acceptable potential. WO 2021/158412 Al; US 2022/0023820 Al. The proclivity toward cleavage can be altered via methoxy substitution on the ring (e.g., two meto-methoxy groups present in synngic acid analogs versus one /weto-methoxy group for vanillin analogs). These linkers can be tethered to nucleotides via carbamate/ urea linkages (A, B) or via an oxime linkage (C, D).
[00333] Show n above are representative structures consisting of A) azide- vanillin/syringic-nucleotides consisting of an azide for enzyme attachment, attached via a polyethylene glycol chain where n = 1 or more units, and a vanillin or syringic acid-based linker tethered to the nucleotide via a urea or carbamate linkage B) analogs consisting of vanillin or syringic acid-based linkers tethered to the nucleotide via a urea or carbamate linkage wherein the R group is a functional group or linker-functional group capable of undergoing a bioconj ugation reaction (and the carbon chain length is variable). C) azide- vanillin/syringic-nucleotides consisting of an azide for enzyme attachment, attached via a polyethylene glycol chain where n = 1 or more units, and a vanillin or syringic acid-based
linker tethered to the nucleotide via an oxime linkage D) analogs consisting of vanillin or syringic acid-based linker tethered to the nucleotide via an oxime linkage wherein the R group is a functional group or linker-functional group capable of undergoing a bioconjugation reaction (and the carbon chain length is vanable). In the depicted cases, ‘"T” (2’- deoxythymidine triphosphate ) is shown as a representative but non-limiting nucleotide triphosphate.
A. Synthesis of Nucleotides Tethered to Quinone Oxime Ether Linkers
1. Synthesis of Trifunctional Structures
[00334] Nucleotides tethered to a quinone oxime ether and azide linkage can be synthesized according to Schemes 5-7 below. Commercially available modified nucleotides with a propargyl anime substituent (e.g., 3) can be modified to contain an aminooxy substituent via reaction of the propargyl amine 3 with a Boc-protected aminooxyacetic acid 4 to provide compound 5. Mass spectral data for compound 5 is shown in Figure 26. Removal of the Boc group using established protocols provides the aminooxy -substituted nucleotide 6. Mass spectral data for compound 6 is shown in Figure 2.7. Hutter, D. et al. ‘'Nucleosides Nucleotides Nucleic Acids,” Labeled nucleoside triphosphates with reversibly terminating aminoalkoxyl groups,” 29(11): (2010). Note that this synthesis to obtain 6 is an exemplary- synthesis only. Compounds disclosed in Table 4 were prepared using ammooxy-substituted nucleotides obtained from suppliers. This example synthesis is demonstrated for 2- deoxy guanosine-5 ’-triphosphate, but the synthetic protocol can be adapted for preparing analogs of the other deoxynucleotide triphosphate analogs (analogs of 2’-deoxy adenosine triphosphate, 2’ -deoxy cytidine triphosphate, and 2’-deoxythymidine triphosphate).
[00335] A molecule containing both a quinone and an azide (deemed an azidequinone, such as 7, Schemes 6 and 7) was condensed with the aminooxygroup at acidic pH. Purification using reversed-phase chromatography yielded molecules of type 8. These molecules contained an azide, a quinone oxime ether, and a nucleotide and are deemed “azide-quinone-nucleotides.” An example synthesis is demonstrated for an analog of 2- deoxy guanosine-5 ’-triphosphate, but all four DNA nucleotide analogs can be synthesized using the representative protocol. Starting material structures and mass spectral data are provided for the four nucleotide analogs generated (Table 4).
Scheme 6. Synthesis to generate compounds containing the azide, a quinone oxime ether, and a nucleotide. These structures are deemed azide-quinone-nucleotides.
Alternative example azide-quinone structure.
[00336] The synthesis of quinone 7 is shown in Scheme 7. The synthesis proceeded through aminolysis of a 6-hydroxychroman-2-one 9 with 3-azidopropylamine 10 followed by oxidation of the corresponding 1,4-hydroxy phenol with sodium periodate.
Although one quinone oxime ether analog synthesis was demonstrated, other quinones can be condensed with the aminooxy -substituted deoxynucleotidetriphosphate analogs to generate
azide-quinone-nucleotides with vary ing linker lengths. For example, a quinone with a hexamethylene linker (compound 23, shown above) can be synthesized using methods disclosed in Park, S.; Westcot, N. P.; Luo, W.; Duto, D.; Yousaf, M. N. Bioconjugate Chemistry “General chemoselective and redox-responsive ligation and release strategy,” 25: 543-551 (2014). Compound 23 was synthesized as shown in Scheme 8. from compounds reported in Park, et al., 25: 543-551 (2014). Deprotection of the tetrahydropyranyl protecting groups of known compound 24 with pyridinium ji-toluenesufonate in ethanol followed by oxidation with sodium periodate afforded compound 23. Quinone 23 was condensed with an aminooxy -substituted deoxynucleotidetriphosphate (in this case for an analog of 2- deoxyguanosine-5 ’-triphosphate) to generate compound 26.
[00337] Scheme 8, Synthesis of azide-quinone-nucleotides with a hexamethylene spacer between the azide and the quinone. A) Synthesis of quinone 23 from known starting material 24. B) Synthesis of an azide-qumone-nucleotide using quinone 23.
[00338] Table 4 shows aminooxy -substituted nucleotide starting materials and tabulated mass spectral data for azide-qumone nucleotide products. The mass spectral data is shown in Figures 28, 29, 30, and 31.
2. Protocols for Synthesis of Azide-Quinone Nucleotides
[00339] The compounds and/or intermediates were characterized by nuclear magnetic resonance spectroscopy (NMR) on 400 MHz NMR Spectrometer, Chemical shifts ( 8) are reported in ppm using the following convention: chemical shift, multiplicity (s = singlet, d :=: doublet, t = triplet, q = quartet, m = multiplet, br =broad), coupling constants, and integration. Spectra are referenced to residual dimethyl sulfoxide (2.50 ppm). Mass spectrometric analysis was performed on an Agilent 1260 Infinity instrument with an Agilent 6120 Quadropole MS. Separations were performed using an Agilent Infinity Lab Poroshell EC-C18 column (4.6 x 100 mm, 2,7 pm) using the following solvent system at a flow rate of 0.5 mL/min: solvent A = 0.05 M triethyl ammonium acetate (TEAA); solvent B = 20% MeCN/0.05M TEAA; gradient method: 90% A/10% B for 3 min; linear gradient form 90%A/10% B to 20%A/80% B from 3-5 min; linear gradient from 20%A/80% B from 5 mm to 8 min; maintained at 100% B from 8 min to 20 min. For less polar compounds, an alternative gradient was used where solvent A = 0.05 M TEAA and solvent B = 80% MeCN/0.05 M TEAA; gradient method: 97.5% A/2.5% B for 3 min; linear gradient from 97.5%A/2.5%B to 90% A/10% B from 3-5 min; linear gradient from 90%A/10% B to 40%A/60%B from 5-10 min; linear gradient from 40%A/60%B to 100%B from 10-15 min; maintained at 100% B from 15 mm to 25 mm. Compounds were detected by UV absorption at 210 nm, 254 nm, 280 nm, or 320 nm. Molecular weight range 400-2000; capillary7 voltage 3750 (pos) and 3500 (neg). Analytical thin-layer chromatography (TLC) was performed on pre-coated silica gel 60 F-254 plates (particle size 0.040-0.050 mm, 230-400 mesh) and visualization was accomplished with UV, visual inspection, or potassium permanganate solutions.
[00340] Scheme 7 (previously shown above). Protocol for synthesis of N-(3- azidopropyl)-3-(2,5-dihy droxyphenyl) propanamide 11.
[00341] To a solution of 6-hydroxy-3,4-dihydro-2H-l-benzopyran-2~one 9 (180 mg, 1.1 mmol) in CH2CI2 (5.5 mL) was added 2-hydroxypyridine (21 mg, 0.22 mmol) and 3 -azidopropyl amine 10 (120 mg, 1.21 mmol). The reaction was stirred for 19 hours at room temperature. The solvent was removed in vacuo (rotary evaporation) and the residue
was taken up in ethyl acetate and washed with 1 M aqueous HCL water, and brine. The organic layer was dried using MgSOy filtered, and concentrated in vacuo. The crude residue was taken up in methanol (5,5 mL) and used directly in the next reaction.
3. Protocol for synthesis of N-(3-azidopropyl)-3-(3,6-dioxocydohexa- 1,4-dien-l-yOpropanamide 7
[00342] To a solution of 1,4-hydroquinone 11 (1.1 mmol) in methanol (5.5 mL) was added a solution of NaIO4 (235 mg, 1.1 mmol) in water (2.25 ml,). The reaction mixture was stirred at room temperature for 5 mm, then additional water (2 mL) was added, and the reaction mixture was stirred for an additional 20 min. The mixture was diluted with ethyl acetate (40 mL) and washed with water (2 x 50 mL.) and brine (30 mL), and dried using MgSOi. The organic layer was concentrated in vacuo to afford quinone 7.
4. Alternative protocol starting from neat 1,4-hydroquinone 11 [00343] A solution of NalOr (58 mg, 0.27 mmol) m I X PBS (phosphate buffered saline) (1.0 ml.) was generated and stirred for 15 minutes at room temperature. In a separate vial, compound 11 (57.6 mg, 0.22 mmol) was dissolved in methanol (2.0 mL). The solution of NalOr was added to the solution of 11, and the reaction mixture was stirred at room temperature for 25 min. Additional methanol (1.0 mL) was added to help with dissolution. The mixture was diluted with water (30 mL) and extracted with ethyl acetate. The ethyl acetate layers were washed with water and brine, dried (NaaSOr) and concentrated in vacuo to afford 7 (49 mg, 85% yield) as a tan oil that solidifies upon standing. Tl NMR data for compound 7 is shown in Figure 24.
5. Synthesis of modified nucleotides
[00345] Scheme 5 (previously shown above). Representative protocol for synthesis of aminooxysubstituted nucleotides
[00346] A stock solution of PA-dGTP 3 at 100 mM in H2O was diluted to 25 mM using dimethylsulfoxide (DMSO) — 5 uL of the 100 mM stock was diluted with 15 uL of DMSO. A separate solution of 2,5-dioxopyrrolidin-l-yl 2-(((tert- butoxycarbonyl)amino)oxy)acetate 4 was prepared by dissolving 13 mg 4 in tetrahydrofuran (THF) (1 mL). Another stock solution of trimethylamine was prepared by dissolving 27 mg of trimethyl amine in THF (1 mL), The solution of 4 (12,2 uL) was added to the 25 mM PA- dGTP solution. Then, the solution of trimethylamine (5.5 uL) was added. The reaction mixture was incubated for no longer than 1 hour at room temperature. The material was purified by chromatography (Biotage, 6 g C18 column; linear gradient from 1 -95% MeCN over 10 column volumes) to generate 5: MS (ESI-neg) m/z calc for C21H30N6O17P3 [M-H]" 732.10, found 731.0. Concentrations of 5 for subsequent reactions can be determined by UV/VIS by generating calibration curves using starting materials at known stock concentrations (in this case, E ::: 11,000 cm^’M'1 at 274 nm).
[00347] R emoval of the fert-butoxycarbonyl (Boc) protecting group was performed following a protocol published in Hutter, D. et al. “Nucleosides Nucleotides Nucleic Acids” Labeled nucleoside triphosphates with reversibly terminating aminoalkoxyl groups,” 29(11) (2010) (see, in particular, the protocol for generation of compound 8m in the Supporting Information). The protocol uses trifluoroacetic acid (TFA) in methanol for a short time period (3 min), followed by precipitation with diethyl ether and dissolution in sodium bicarbonate, followed by neutralization with HC1 and purification by ion-exchange chromatography. The material was prepared as proof-of-concept following the protocol, but without doing the final ion exchange chromatography: MS (ESI-neg) m/z calc for C16H22N6O15P3 632.04, found 631.0.
[00348] Note that this synthesis to obtain 6 is an exemplary' synthesis only. Compounds disclosed in Table 4 were prepared using aminooxy -substituted nucleotides obtained from contracted suppliers.
6. Representative protocol for generating azide-qninnne-hnked nucleotide triphosphate analogs
[00349] Synthesis of azide-quinone-oxime ether dGTP analog 8
[00350] Scheme 6 (previously shown above). A 50 mM stock solution of azidequinone 7 in DMSO was prepared (example: dissolving 7 (22.7 mg) in DMSO (1.73 mL). A 20 mM stock solution of 7 in DMSO was generated by diluting 200 uL of the 50 mM stock with 300 uL. of DMSO.
[00351] A stock solution of NH2O-PA-dGTP 6 (800 uL of a 10 mM stock solution, 8 umol) was aliquoted into a separate 2-mL tube. The solution was placed on ice, and the pH was adjusted by adding MES buffer (256 uL), Keeping the solution on ice, the DMSO solution of 7 (400 uL at 20 mM, 8 umol) was slowly added. The solution was removed from ice and incubated at room temperature for 2.5 hours (protected from light). The solution was quenched with 1 M triethylammonium acetate (TEAA) and purified by chromatography (Biotage, 6 g Cl 8 column, 2% MeCN/EbO for 2 CV, gradient from 2% MeCN to 5% MeCN for 3 CV; 5% to 15% MeCN for 3 CV; 15 to 50% MeCN for 5 CV; 50- 95% MeCN for 4 CV. Fractions containing product were partially concentrated using a ThermoFisher Savant Speedvac, followed by lyophilization. Lyophilized materials were resuspended in water, and concentrations are determined using UV/VIS. Concentrations were estimated based on extinction coefficients of the corresponding starting aminooxysubstituted triphosphates. Products were analyzed by LC-MS. In this case: MS (ESI) m/z calc for C28H34N10O17P3 875.14 [M-H]', found 875.0.
[00352] In other cases, HC1 is used for low ering the pH of the solution. A representative protocol is provided below in Scheme 9:
7. Synthesis of azide-quinone-oxime ether dCTP analog 20
[00353] An aliquot ofNH2O-PA-dCTP 19 at 10 mM (800 uL, 8 umol) was aliquoted into a 2-mL tube and placed on ice. The solution was acidified to pH ~2-3 with 1 M HCI (16 uL). To this solution (on ice), was added a solution of 7 in DMSO (400 uL of a 20 mM stock solution, 8 umol). The solution was protected from light and incubated at room temperature for 2.5 hours. The reaction mixture was quenched with 1 M TEAA and purified by direct loading onto a column (Biotage, 6 g C 18 column, 2% MeCN/HrO for 2 CV, gradient from 2% MeCN to 5% MecN for 3 CV; 5% to 15% MeCN for 3 CV; 15 to 50% MeCN for 5 CV; 50-95% MeCN for 4 CV. Fractions containing product were partially concentrated using a ThermoFisher Savant Speedvac, followed by lyophilization. Lyophilized materials were resuspended in water, and concentrations are determined using UV/ VIS. Concentrations were estimated based on extinction coefficients of the corresponding starting aminooxy-substituted triphosphates. Products were analyzed by LC- MS. In this case: MS (ESI) m/z calc for C26H33N9O17P3 836.13 [M-H]’, found 836.0.
[00354] A protocol for synthesis of quinone oxime ether compound having the methylene linker is shown below- in Scheme 10:
[00355] Scheme 10. A solution of 24 (298 mg, 0.74 mmol) in absolute ethanol (5 mL) was treated with pyridinium p-toluenesulfonate (PPTS) (296 mg, 1.18 mmol). Tire reaction mixture was stirred at room temperature for 22 hours. The mixture was diluted with brine (15 mL) and extracted into ethyl acetate (2.X). lire combined ethyl acetate layers were washed with water and brine, dried (Na?.SO4), decanted, and concentrated under reduced pressure. The crude residue w-as purified on SiCh, eluting with hexanes:ethyl acetate (3: 1,
v/v), providing 25 as a tan oil (116 mg, -87% purity by SH NMR analysis — the impurity being residual ethyl acetate).
[00356] A solution of NalOr (62 mg, 0.29 mmol) in IX PBS (phosphate buffered saline) (0.5 mL) was generated and stirred for 5 minutes at room temperature. In a separate vial, compound 25 (54 mg, which translates to 47 mg based on 87% purity metric, 0.20 mmol) was dissolved in methanol (1.0 mL). The solution of NalOi was added to the solution of 25, and the reaction mixture was stirred at room temperature for 15 min. Additional methanol and water (1.0 mL each) were added to help with dissolution. The mixture was diluted with water and extracted with ethyl acetate. The ethyl acetate layers were washed with water and brine, dried (NhuSCh) and concentrated in vacuo to afford 23 (33.5 mg, 72% yield) as a brown oil that reconstitutes bright yellow in solution.
[00357] A protocol for condensation reaction to form the quinone oxime ether is shown below' in Scheme 11 :
[00358] Scheme 11 A 50 mM stock solution of azide-quinone 23 in DMSO was prepared and diluted further to a final concentrated of 20 mM using NMP. A stock solution of NH2O-PA~dGTP 6 (100 uL of a 55 mM stock solution, 5.5 nmol) was aliquoted into a separate 2-mL tube. The solution was diluted with NMP 400 uL), placed on ice, and the pH w'as adjusted to ca. 3 by adding IM HCI (aqueous). Keeping the solution on ice, the DMSO solution of 23 (100 ul ) was slowly added. The solution was removed from ice and incubated at room temperature for 2.5 hours (protected from light). The solution was quenched with 1 M tn ethylammonium acetate (TEA A) and purified by chromatography (Biotage, 6 g CT 8 column, 2% MeCN/H2O for 2 CV, gradient from 2% MeCN to 5% MeCN for 3 CV; 5% to 15% MeCN for 3 CV; 15 to 50% MeCN for 5 CV; 50-95% MeCN for 4 CV. Fractions were concentrated via lyophilization. Lyophilized materials were resuspended in water, and
concentrations were determined using UV/VIS. Concentrations are estimated based on extinction coefficients of the corresponding starting aminooxy-substituted triphosphates. Products were analyzed by LC-MS. In this case: MS (ESI) m/z calc for C28H35N9O16P3 846.14 [M-H]", found 846.0.
B. Synthesis of Nucleotides Tethered to Trimethyl Lock Linkers
1. Representative Synthesis of Compounds containing the methyL substituted quinone propionic acid (trhnethyi lock) structure.
[00359] Analogs wherein there is a quinone propionic acid (trimethyl lock) structure were generated using a starting material with reported synthesis and structure. Ciampi et al. J. Am. Chem. Soc. “Electrochemical “Switching” of Si(100) Molecular Assemblies,” 134:844-847 (2012). A representative synthesis that was earned out is depicted in Scheme 12. Oxidation of hydroquinone 30 with A-bromosuccmimide (NBS) affords acid 31. Acid 31 was converted to either the .V-hydroxysuccinimidyl ester (NHS ester) or the water-soluble sw/^-TV-hydroxysuccinimidyl ester (sulfo-NHS ester) 32 through reaction with l-Ethyl-3-(3-dimethylaminopropyl)carbodiimide (EDCI) and the corresponding succinimide. The sw//o-NHS ester was then reacted with a propargylamino-substituted nucleotide in borate buffer (pH 8.5) to afford a compound containing an azide-qumone propionic acid (trimethyl lock) linker-nucleotide triphosphate construct.
[00360] Scheme 12. Representative synthesis that was carried out of a compound containing an azide, trimethyl lock functionality', and a nucleotide triphosphate.
2. Protocols for synthesis of azide-trimetoyl tock-dCTP (34).
[00361] Synthesis of compound 31. A solution of hydroquinone 30 (50 mg, 0.17 mmol) in acetonitrile (0.50 mL) and water (0.050 mL) was treated with N- bromosuccinimide (NBS) (32 mg, 0.18 mmol) and stirred at room temperature for 1 hour. The solvents were removed under reduced pressure, and the residue was either purified by
chromatography on SiOz (elution with hexanes/ethyl acetate, 3: 1) to afford 31, or the material carried on crude to the next reaction.
[00362] Formation of .si/Z^-NHS ester 32. A solution of acid 31 (20 mg, 0.060 mmol) in dichloromethane (0.6 mL) and AA-dimethylformamide (0.6 mL) was treated with tn ethylamine (50 uL) followed by 5w//b-7V-hydroxysuccinimide, sodium salt (20 mg, 0.092 mmol) and EDCI-HC1 (18 mg, 0.094 mmol). The reaction mixture was stirred at room temperature for 1 hour (overnight reaction times also yield product with no loss in yield/purity). Aliquots were quenched with water and used directly in the next reaction.
[00363] Addition of sulfo-NHS ester 32 to propar gylamine-substituted nucleotide 33. An aliquot corresponding to approximately 0.63 mg sulfo-NHS ester 32 (0.001 mmol) was added to a solution of propargylamine-dCTP 33 (100 uL, 0.001 mmol, c = 10 mM in water) and 200 mM borate buffer, pH 8.5 (50 uL). The reaction mixture was protected from light and incubated at room temperature for 2 hours and purified by direct loading onto a column (Biotage, 6 g Cl 8 column, 2% MeCNZFhO for 2 CV, gradient from 2% MeCN to 5% MecN for 3 CV; 5% to 15% MeCN for 3 CV; 15 to 50% MeCN for 5 CV; 50-95% MeCN for 4 CV. Fractions containing product were concentrated via lyophilization. Lyophilized materials were resuspended in water, and concentrations were determined using UV/ VIS. Concentrations were estimated based on extinction coefficients of the corresponding starting propargylamine-substituted triphosphates. The products were analyzed by LCMS. MS (ESI) m/z calc for C28H37N7O16P3 820. 15 [M-H]", found 820.0.
3. Synthesis of ammopropytearbamate of the quinone propionic acid (trimethyl lock)
[00364] Chemistry' for attachment of the intermediate aminopropyl spacer was demonstrated using a model quinone propionic acid system (compound 35) that does not contain the azidopropyl substituent for enzy me conjugation. The synthesis of the model compound containing an aminopropyl carbamate spacer is shown in Scheme 13. Synthesis of compound 35 was achieved following established protocols. Ciampi et al. J. Am. Chem. Soc. “Electrochemical “Switching” of Si(lOO) Molecular Assemblies,” 134:844-847 (2012). Compound 37 was synthesized by' addition of 3-aminopropan-l-ol 36 to 35 in the presence of diisopropylethylamine using A(A-dimethylformamide as a solvent. Resulting compound 37 was treated with A.A-disuccinimidyl carbonate (DSC) 38 in the presence of 4- dimethylaminopyridine (DMAP) to generate mixed carbonate 39, which was used crude.
Crude mixed carbonate 39 was reacted with propargylamino-substituted nucleotides (in this case propargyl amine-dCTP 33) in a mixture of Ar-methyl-2-pyrrolidone (NMP) and borate buffer (pH 8,5) to afford the resulting carbamate analog 40.
[00365] Scheme 13. Synthesis of a compound wherein a methyl-substituted quinone propionic acid is tethered to a nucleotide triphosphate via an intermediate aminopropanol spacer.
4. Protocol for addition of 3 -amino- 1 -propanol to toe trimetoyl lock NHS ester compound to generate compound 37:
[00366] See Scheme 13 (above). To a solution of compound 35 (25 mg, 0.072 mmol) in Ar,Ar-dimethylformamide (DMF) (0.72 mL) was added a solution of 3-amino-l- propanol 36 (9 mg, 0.12 mol) and diisopropylethylamine (zPnNEt) (0.026 mL, 0. 15 mmol). This reaction mixture was maintained at room temperature overnight and diluted with ethyl acetate. The diluted solution w-as washed with water and brine, dried (Na2.SO4), and concentrated under reduced pressure. The crude material was purified by chromatography on S1O2 (ethyl acetate as eluent) to afford compound 37 as a yellow solid (15.4 mg, 69% yield). This material was carried forward to the next reaction.
[00367] Formation of mixed NHS carbonate 39. A solution of 37 (12.8 mg, 0,042 mmol) and 4-dimethyIaminopyridine (2.6 mg, 0.021 mmol) in acetonitrile/CH2C12 (0.4 mL, 1 : 1 volume/ volume) was cooled to 0 °C and treated with A jV-disuccinimidyl carbonate (DSC) 38 (10.8 mg, 0.042 mmol). The reaction mixture was stirred overnight, gradually warming to room temperature. The mixture was diluted with CH2CI2 and washed with water and brine, dried (MgSCh), and concentrated in vacuo. Compound 39 was used without further purification.
[00368] Reaction of propargylamine-substltuied NT (aCTP) with mixed NHS carbonate 39.
[00369] A stock solution of mixed carbonate 39 was prepared at 100 mM concentration by dissolving 11.1 mg (0.0247 mmol) 39 into .V-methyl-2-pyrrolidinone (NMP) (0.247 mL). In a separate tube, a solution of propargylamine-substituted-dCTP (supplied at 10 mM in water, 0.30 mL, 0.003 mmol) was diluted with 200 mM borate buffer, pH 8.5 (300 uL). To this solution was added NMP (0.45 mL) and the 100 mM solution of 39 (0.15 mL, 0.015 mmol, 5 equiv). The reaction mixture was protected from light and incubated at room temperature for 2 hours and purified by direct loading onto a column (Biotage, 6 g C 18 column, 2% MeCN/HbO for 2 CV, gradient from 2% MeCN to 5% MecN for 3 CV; 5% to 15% MeCN for 3 CV; 15 to 50% MeCN for 5 CV; 50-95% MeCN for 4 CV. Fractions containing product 40 were concentrated via lyophilization. Lyophilized materials are resuspended in water, and concentrations were determined using UV/VIS. Concentrations were estimated based on extinction coefficients of the corresponding starting propargylamine-substituted triphosphates. The products were analyzed by LCMS. MS (ESI) m/z calc for C30H41 N5O18P3 852.17 [M-H]~, found 852.0.
C. Synthesis of Nucleotides Tethered to Disulfide Linkers
[00370] Nucleotide analogs with a disulfide linker were synthesized by addition of either an AMiydroxysuccinimidyl ester 41 to an amine-substituted nucleotide (shown here as an analog of 2 ’-deoxy thy mi dine triphosphate (Scheme 14) or by preparation of mixed A’-hydroxysuccinimidyl carbonate 44 followed by addition to the corresponding propargylamine-substituted nucleotides 33 (Scheme 15) to generate carbamates such as compound 45.
[00371] Scheme 14. Addition of the AMiyroxysuccinimidyl ester of azidoethyldisulfidepropionic acid to a propargyl-amine-substituted nucleotide.
[00372] Scheme 15. Formation of and addition of a mixed carbonate of N- hydroxysuccinimide and an alcohol containing both a disulfide and an azide. The mixed carbonate is added to a propargylamine-substituted nucleotide to link the disulfide via a carbamate moiety.
[00373] Protocols:
[00374] Az-disuIfide~NHS ester dCTP-PA coupling reaction (CSTR 390)
[00375] A 10 mM stock solution of azidoethyl-SS-propionic NHS ester 41 was prepared by dissolving 41 (3 mg, 0.01 mmol) into NMP (1 mL). In a separate tube, an aliquot of 10 mM of propargyl -amine-substituted dCTP (PA-dCTP) 33 (0.40 mL, 0.004 mmol) was diluted with 200 mM borate buffer, pH 8.5 (0.20 mL) and cooled in an ice bath for 10 minutes. The azidoethyl-SS-propionic NHS ester 41 stock solution (0.48 mL, 0.0048 mmol, 1.2 equivalents) was added slowly. Once fully added and mixed, the solution was incubated in the dark on a mixer for 3 hours. After 3 hours and confirming conversion by HPLC, the crude mixture was quenched with IM triethylammonium acetate (aqueous) until the pH was approximately 7 and purified by direct loading onto a column (Biotage, 6 g Cl 8 column) and eluted using an acetonitrile/water gradient. Fractions (at -10% acetonitrile/water) containing product 42 were concentrated via lyophilization. Lyophilized materials were resuspended in water, and concentrations are determined using UV/VIS. Concentrations are estimated based on extinction coefficients of the corresponding starting propargylamino-substituted triphosphates. The products are analyzed by LCMS. MS (ESI) m/z calc for C17H25N7O14P3S2708.01 [M-H]-, found 707.9.
[00376] Mixed carbonate of the azidoethyldisulfidealcohol 44. A solution of azidoethyldisulfide-ethyl alcohol (53,8 mg, 0.30 mmol) and 4-dimethyl aminopyridine (19 mg, 0.16 mmol) in acetonitrile/CHzCh (1.6 mL, 1: 1 volume/volume) was cooled to 0 °C and treated with Ar,A'-disuccinimidyl carbonate (DSC) 38 (72 mg, 0.28 mmol). The reaction mixture was stirred overnight, gradually wanning to room temperature. The mixture was diluted with CH2CI2 and washed with water and brine, dried (MgSOr), and concentrated in vacuo. Compound 44 (68 mg) was used in the following reaction without further purification.
[00377] Addition of mixed NHS carbonate to propargylamine-substituted nucleotide (dCTP) for preparation of compound 45.
[00378] A stock solution of mixed carbonate 44 was prepared at 100 mM concentration by dissolving 44 63 nig (0.2.0 mmol) into N-methyl-2-pyrrolidinone (NMP) (2.0 mL). In a separate tube, a solution of propargylamine-substituted dCTP (supplied at 10 mM in water, 0.50 mL, 0.005 mmol) was diluted with 200 mM borate buffer, pH 8.5 (500 uL). To this solution was added NMP (0.75 mL), cooled to 0 °C on ice, and the 100 mM solution of 44 (0.25 mL, 0.025 mmol, 5 equiv) is added. The reaction mixture was protected from light and incubated at room temperature for 2 hours and purified by direct loading onto a column (Biotage, 6 g Cl 8 column, 2% MeCN/H2O for 2 CV, gradient from 2% MeCN to 5% MecN for 3 CV; 5% to 15% MeCN for 3 CV; 15 to 50% MeCN for 5 CV; 50-95% MeCN for 4 CV. Fractions containing product were concentrated via lyophilization. Lyophilized materials were resuspended in water, and concentrations are determined using UV/VIS. Concentrations are estimated based on extinction coefficients of the corresponding starting propargylamino-substituted triphosphates. The products were analyzed by LCMS. MS (ESI) m/z calc for C17H25N7O15P3S2 724.01 [M-H]; found 724.0.
D. Synthesis of Nucleotides Tethered to Syringic Arid-Based Linkers or
Vanillin-Based Linkers
1. Synthesis and Protocols for vanHIm/syringic analogs
[00379] Scheme 16. Synthesis of nucleotide analogs modified via a azidevanillin-oxime linkage.
[00380] An azidoethyl substituted vanillin analog was prepared by reaction with 4-(2-bromoethoxy)-3-methoxybenzaldehyde 46 and sodium azide. The resulting aldehyde 47 was condensed with aminooxy-substituted nucleotide 6 to afford analog 48. (Scheme 16). In this case, analogs of 2-deoxy guanosine-5’ -triphosphate are shown, but the synthesis would translate to other nucleotide analogs.
[00381 ] Analogs containing an additional meto-methoxy substituent were synthesized using similar protocols (Scheme 17). The aldehyde starting material was prepared in two steps; 4-hydroxy-3,5-dimethoxybenzaldehyde 49 was alkylated with 1,2- bromoethane 50 to generate precursor 51. The synthesis was then analogous to that in Scheme 9— bromide displacement with sodium azide yields compound 52 followed by condensation with aminooxy-substituted nucleotide 6 to generate analog 53.
[00382] Scheme 17. Synthesis of nucleotide analogs modified via a azide- syringic aldehyde-oxime linkage
[00383] Protocols:
[00384] Protocol far reaction to convert 4-(2-bromoethoxy)-3- methoxybenzaldehyde to the corresponding azide 47. Adapted from Kiran, K. et al., Russian Journal of General Chemistry “Synthesis, characterization, and antibacterial activity of some novel vanillin related hydrazone derivatives bearing 1,2,3-triazole ring,” 87: 12.88- 1294 (2017). To a solution of 4-(2-bromoethoxy)-3-methoxybenzaldehyde 46 (0.00387 mol) in DMF (10 mL) was added sodium azide (0.00465 mol). The mixture was heated at 100 °C for 1 hour. The reaction mixture was cooled to room temperature before diluting the mixture with water (20 mL). The solution as then extracted twice with ethyl acetate (30 mL),
combining the organic layers. The combined organic layers were washed with brine solution (25 ml) before drying the resulting mixture over anhydrous sodium sulfate and filtering. The crude mixture was concentrated under reduced pressure to give a light yellow oil. The residue was purified by flash column chromatography using ethyl acetate: hexane (v/v 1 : 9) to obtain 4-(2-azidoethoxy)-3-methoxybenzaldehyde 47.
[00385] Protocol for reaction to generate vanillin oxime 48. A 50 mM stock solution of 4-(2~azidoethoxy)-3-methoxybenzaldehyde 47 in NMP was prepared by dissolving aldehyde 47 (10 mg, 0.045 mmol into 900 pL of NMP). An aliquot of 55 mM aminoxy-substituted nucleotide (NH2O-PA-dGTP) 6 (91 pL, 0.005 mmol) was diluted with NMP (80 pL), and the pH was adjusted to between 2 and 3 using 1 M 4- Morpholineethanesulfonic acid (MES) (300 pL). The solution was chilled on an ice bath for 15 minutes before slowly adding the 4-(2-azidoethoxy)-3-methoxybenzaldehyde solution (100 pL of 50 mM stock, 0.005 mmol). Once fully added and mixed, the solution was incubated in the dark on a mixer for 3 hours. After 3 hours and confirming conversion by HPLC, the crude mixture was quenched with 1 M triethylammonium acetate until the pH was approximately 7. The reaction mixture was purified by direct loading onto a column ( Biotage, 6 g Cl 8 column and eluted using an acetonitrile/water gradient. Fractions containing product (eluted at -10% acetonitrile/water) were concentrated via lyophilization. Lyophilized materials were resuspended in water, and concentrations were determined using UV/VIS. Concentrations w'ere estimated based on extinction coefficients of the corresponding starting aminooxy-substituted triphosphates. The products were analyzed by LCMS. MS (ESI) m/z calc for C26H31N9O17P3 834.11 | M-H f. found 834.0.
[00386] Protocol for reaction to generate syringic oxime.
[00387] The starting materials (corresponding bromide and azide) were synthesized according to Chang, X. et al. Acta Pharm Sin. B ’‘Design, synthesis, and biological evaluation of quinazolin-4(3 H)-one derivatives co-targeting poly(ADP-ribose) polymerase- 1 and bromodomain containing protein 4 for breast cancer therapy,” 11 (1 ): 156- 180 (2021) and Kiran, K. et al. Russian Journal of General Chemistr}' “Synthesis, characterization, and antibacterial activity of some novel vanillin related hydrazone derivatives bearing 1,2,3-triazole ring,” 87: 1288-1294 (2017), respectively.
[00388] Synthesis of 4~(2~azidoethoxy)-3,5-dimethoxybenzaldehyde-NH2O~ PA-dGTP oxime 53. A 50 mM stock solution of 4-(2-azidoethoxy)-3,5-
dimethoxybenzaldehyde 52 in NMP was prepared by dissolving 12. mg (0.048 mmol) of 52 into 900 pl. of NMP. An aliquot of 55 mM NII2O-PA-dGTP (91 pL, 0.005 mmol) 6 was diluted with NMP (80 pL), and the pH was adjusted to between 2 and 3 using 1 M 4- Morpholmeethanesulfonic acid (MES) (300 pL). The solution was chilled on an ice bath for 15 minutes before slowly adding the 4-(2-azidoethoxy)-3,5-dimethoxybenzaldehyde solution (100 pL, 0.005 mmol). Once fully added and mixed, the solution was incubated in the dark on a mixer for 3 hours. After 3 hours and confirming conversion by HPLC, the crude mixture was quenched with IM tri ethylammonium acetate until the pH was approximately 7. The reaction mixture was purified by direct loading onto a column (Biotage, 6 g C18 column and eluted using an acetonitrile/water gradient. Fractions containing product (eluted at -40% acetonitnle/water) were concentrated via lyophilization. Lyophilized materials were resuspended in water, and concentrations were determined using UV/VIS. Concentrations are estimated based on extinction coefficients of the corresponding starting aminooxy- substituted triphosphates. The products were analyzed by LCMS. MS (ESI) m/z calc for C27H33N9O18P3 864.12 [M-HJ-, found 864.0.
[00389] Sy nthesis of vanillin and syringic acid nucleotide analogs via carbamate linkages
[00390] Scheme 18. Synthesis of nucleotide analogs with vanillin or syringic acid linkers joined via a carbamate linkage.
[00391] Analogs containing a vanillin or syringic acid linker joined to the nucleotide via a carbamate linkage can be synthesized as depicted in Scheme 18. Phenols of type 54 are alkylated with 1 ,2-dibromoethane 50. followed by substitution of the bromide with sodium azide to afford alcohols of type 55. The alcohol 55 is converted to the corresponding mixed N-hydroxysuccinimidyl carbonate via reaction with N,N-disuccinimidyl carbonate in the presence of 4-dimethylaminopyridine. The mixed carbonate of type 57 is reacted with propargyiamine-substituted nucleotide 3 to generate the corresponding carbamates of type 58. In this case, analogs of 2-deoxy guanosine-5’ -triphosphate are shown, but the synthesis would translate to other nucleotide analogs.
[00392] Protocols for synthesis of vanillin/syringic carbamates
[00393] Synthesis of bromide and azide versions of the vanillin analog — note, protocols are written for the vanillin analog but can be translated to the syringic acid analog.
[00394] Synthesis of compound 55 (Y = H). To a mixture of potassium carbonate (0.44 g, 3.2 mmol), and 4-(hydroxymethyl)-2-methoxyphenol 54 (Y = H) (0.1 g, 0.65 mmol) in dry DMF (1.2 mL) was added 1 ,2-dibromoethane 50 (1.21 g. 6.5 mmol). The mixture was sealed and heated to 55°C overnight. The reaction mixture was cooled to room temperature before diluting the mixture with water (5 mL). The solution was extracted twice
with ethyl acetate (5 mL). The organic layers were combined and washed with 1 M NaOH, water, and brine solution (3 mL), The resulting organic solution was dried over anhydrous sodium sulfate and filtered. The crude mixture was concentrated under reduced pressure. The crude residue was purified by flash column chromatography using ethyl acetate:hexane (v/v 1 : 9) to obtain (4-(2-bromoethoxy)-3-methoxyphenyl)methanol 55 (Y = H).
[00395] Synthesis of compound 56 (Y ===H). To a solution of brontide 55 (Y===H) (100 mg, 0.385 mmol) in DMF (1.2 mL) was added sodium azide (30 mg, 0.46 mmol). The mixture was heated at 100°C for 1 hour. The reaction mixture was cooled to room temperature before diluting the mixture with water (20 mL). The solution was then extracted twice with ethyl acetate (30 mL), combining the organic layers. The combined organic layers were washed with brine solution (25 niL) before drying the resulting mixture over anhydrous sodium sulfate and filtering. The crude mixture was concentrated raider reduced pressure to give a light yellow oil. The residue was purified by flash column chromatography using ethyl acetate: hexane (v/v 1 : 9) to obtain (4-(2-azidoethoxy)-3-methoxyphenyl)methanol 56 (Y = H).
[00396] Reaction to form mixed carbonate of type 57 (Y = H). A solution of 4- (2-azidoethoxy)-3-methoxyphenyl)methanol 56 (Y ::: H) (23.6 mg, 0.11 mmol) and 4- dimethylaminopyridine (6.7 mg, 0.055 mmol) in acetonitrile/CH2C12. (0.30 mL, 1:1 volume/volume) was cooled to 0 °C and treated with N,N-disuccinimidyl carbonate (DSC) 38 (28 mg, 0. 1 1 mmol). The reaction mixture was stirred overnight, gradually wanning to room temperature. The mixture was diluted with CH2C12 and washed with water and brine, dried (MgSO4), and concentrated under reduced pressure. Compound 57 (Y=H) (68 mg) was used in the following reaction without further purification.
[00397] Addition of mixed NHS carbonate 57 (Y=H) to propargylaminesubstituted nucleotide (dGTP). A stock solution of mixed carbonate 57 (Y = H) was prepared at 100 mM concentration by dissolving crude 57 {V I I} 12 mg (0.033 mmol) into N-methyl- 2-pyrrolidinone (NMP) (0.33 mL). In a separate tube, a solution of propargylaminesubstituted dGTP (supplied at 10 mM in water, 0.30 mL, 0,003 mmol) was diluted with 200 mM borate buffer, pH 8.5 (300 uL). To this solution was added NMP (0.45 mL) and the 100 mM solution of 57 (Y=H) (0.15 mL, 0.015 mmol, 5 equiv). The reaction mixture wns protected from light and incubated at room temperature for 2 hours. Analysis by LCMS indicated product formation of compound 58 (Y=H). MS (ESI) m/z calc for
C25H30N8O17P3 807.09 [M-H]-, found 807.0. Note, in this case, the product was a minor constituent, but optimization of reaction conditions and mixed carbonate formation would lead to larger amounts of product. Additionally, the reaction was accomplished on a model system 59 (shown below) (wherein the azidoethyl substituent is replaced with a methyl group). The model system was purified using reversed-phase chromatography (Cl 8 column, acetonitrile/ water gradient) followed by lyophilization. These conditions would be readily- translated to the system with the azidoethyl substituent described herein. (For model system 59, MS (ESI) m'z calc for C23H30N4O18P3 743.08 [M-H]-, found 743.0.
[00398] Model compound wherein a carbamate linkage connects a syringic acid analog to a nucleotide.
[00399] Approaches for Small Molecules Attachment to TdT
[00400] The tethered small molecule can be a nucleotide triphosphate, and the small molecule can be tethered via a cleavable linker. The linker can be cleaved via electrochemical (redox) means. Synthesis and preparation of the small molecules (modified nucleotide triphosphates) that are covalently attached to the protein are described herein.
[00401] Two approaches were taken to install a redox-cl eav able linker that is tethered to both TdT and a pendant nucleotide. In approach 1 , trifunctional systems were
synthesized with three key elements: 1) a functional group handle for covalent conjugation to the TdT enzyme 2) a redox-cleavable linker (e.g., a quinone oxime ether) and 3) the nucleotide to be incorporated (see approaches 1 and 2 below). The synthetic strategy' facilitates changes to the enzyme attachment chemistry. In one case, a maleinnde was introduced for attachment to cysteine residues. In another case, a DBCO (dibenzylcyclooctyne) was introduced for attachment to azides, such as a 4- azidophenylalanine (non-canonical amino acid) residue. A synthetic scheme for synthesis of these molecules is described herein.
[00402] Approaches 1 and 2, The trifunctional structures below each contain a functional group for enzyme attachment; structure 1 has a maleimide for attachment to cysteine; structure 2 has a dibenzylcyclooctyne (DBCO) for attachment to a non-canonical amino acid. The structures have a redox cleavable linker (quinone oxime ether) and a modified nucleotide.
[00403] The synthetic routes for obtaining structures 1 and 2 (above) are depicted below (Scheme 19 and Scheme 20).
[00404] A common intermediate hydroquinone (reduced form of quinone) was synthesized by aminolysis of commercially available lactone 9 with 3-azidopropylamine 10, in the presence of catalytic 2-hydroxy pyridine. The 1,4-hydroqumone 11 is oxidized with sodium periodate to form quinone 7.
[00405] The synthesis of the trifunctional system for conjugation to cysteine is shown in Scheme 19.
[00406] Copper-catalyzed click chemistr}' to react the azide of compound 7 with the alkyne of compound 12 provided the quinone 13. Compound 13 was purified using reversed phase chromatography (C l 8 column with an acetonitrile/water gradient). The purified azide-quinone 13 was condensed with the hydroxylamine of the modified nucleotide triphosphate 14 by mixing 13 and 14 in a 1 : 1 ratio in acidic media. The final product 1 was used for enzyme conjugations without, further purification.
[00407] Scheme 19, Synthesis of the trifunctional system for covalently attaching nucleotides to cysteine residues through a redox-cleavable linker. (THF = tetrahydrofuran, THPTA :::: tris-hydroxypropyltriazolylmethylamine (a Cu-ligand for accelerating click chemistr}' reactions; DMSO = dimethylsulfoxide).
[00408] The synthesis of the trifunctional system for conjugation to 4- azidophenylalanine (a non-canonical amino acid) is depicted in Scheme 20. The same azidequinone 7 described in Scheme 1 is used. Quinone 7 can be reacted with amine- functionalized alkyne 15 using copper-catalyzed click chemistry. Once this reaction proceeds for 1 hour, dibenzocyclooctyne-A-hydroxysuccinimidyl ester (DBCO-NHS) 17 can be added in the same pot to afford compound 1§. Compound 1§ can be purified using reverse-phase chromatography (Cl 8 column with an acetonitrile/water gradient); purified compound 18 can be condensed with modified nucleotide triphosphate 14 as described above.
[00409] Scheme 2.0. Synthesis of the trifunctional system for covalently attaching nucleotides to 4-azidophenylaIanine (non-canomcal amino acid) residues through a redox-cleavable linker. (MeCN = acetonitrile; THPTA = /Fishy droxypropyltnazolylmethylamine (a Cu-ligand for accelerating click chemistry reactions).
[00410] Trifunctional molecules are synthesized by copper-catalyzed click reactions between formed azide-quinone-nucleotides (tabulated in Table 4) and alkynes. Propargyl maleimides of different lengths were reacted with the azide-quinone-nucleotides (Scheme 21 and Scheme 22).
[00411] Scheme 2.1. Synthesis of a trifunctional molecule containing a mal eimide, cleavable linker (quinone oxime ether) and a nucleotide triphosphate. Synthesis is accomplished by copper-catalyzed click chemistry of pre-formed azide-quinone- nucleotides and an alkyne.
[00412] Scheme 22. Synthesis of a trifunctional molecule containing a maleimide, cleavable linker (quinone oxime ether) and a nucleotide triphosphate. Synthesis is accomplished by copper-catalyzed click chemistry of pre-formed azi de-quinone- nucleotides and an alkyne.
[00413] Protocol for click reaction to generate compound 27. Note that the protocol is written for the 2-deoxyguanosine-5’-triphosphate analog but would translate to other nucleotide tn phosphates. A solution of azide-quinone-nucleotide 8 at 10 mM in H2O (100 pL, 0.001 mmol) was aliquoted into a tube, followed by a solution of Maleimide-PEG4- alkyne 12 (10 mM stock in water, 100 pL, 0.001 mmol). In a separate tube, a 200 mM THPTA solution in water was mixed with a 100 mM CuSCh solution in water (10 pL each). The THPTA/CuSOr solution (8 pL, 0.0004 mmol CuSO-r and 0.0008 mmol THPTA) was
added to the tube containing azide-quinone-nucleotide 8 and the maleimide-PEG4-alkyne 12. A solution of 100 mM sodium ascorbate in water (16 pL, 0.0016 mmol) was added to the tube, and the mixture was incubated at room temperature for 1.5 hours. The reaction mixture was purified by chromatography (Biotage, 6 g C18 column, 2% MeCNZH2O for 2 CV, gradient from 2% MeCN to 5% MeCN for 3 CV ; 5% to 15% MeCN for 3 CV; 15 to 50% MeCN for 5 CV; 50-95% MeCN for 4 CV. Fractions were concentrated via lyophilization. Lyophilized materials are resuspended in water, and concentrations were determined using UV/VIS. Concentrations are estimated based on extinction coefficients of the corresponding starting aminooxy-substituted triphosphates. Products are analyzed by LC-MS. In this case: MS (ESI) m / calc for C46H60N 12O24P3 1257.31 [M-H ]" , found 1257. 1. Note that lower equivalents of Cu and THPTA and shorter reaction times (as reflected in the protocol for synthesis of 29) are recommended.
[00414] Protocol for click reaction to generate compound 29. Note that the protocol is written for the 2-deoxyguanosine-5 ’-triphosphate analog but would translate to other nucleotide triphosphates. A solution of azide-quinone-nucleotide 8 at 10 mM in HzO (100 uL, 0.001 mmol) was aliquoted into a tube, followed by a solution of A-propargyl maleimide 28 (50 mM stock in DMSO, 20 pL, 0.001 mmol). In a separate tube, a 200 mM THPTA solution in water was mixed with a 100 mM CuSOr solution in water (5 pL each). The THPTA/CuSOr solution (5 uL, 0.00025 mmol CuSC>4 and 0.0005 mmol THPTA) was added to the tube containing azide-quinone-nucleotide 8 and A’-propargyl maleimide 28. A solution 100 mM sodium ascorbate in water (5 pL, 0.0005 mmol) was added to the tube, and the mixture was incubated at room temperature for 30 min. The reaction mixture was quenched with 100 mM tn ethylammonium acetate buffer, then purified by chromatography (Biotage, 6 g C 18 column, 2% MeCN/H2O for 2 CV, gradient from 2% MeCN to 5% MeCN for 3 CV; 5% to 15% MeCN for 3 CV; 15 to 50% MeCN for 5 CV; 50-95% MeCN for 4 CV. Fractions were concentrated via lyophilization. Lyophilized materials was resuspended in water, and concentrations were determined using UV/VIS. Concentrations were estimated based on extinction coefficients of the corresponding starting aminooxy-substituted triphosphates. Products were analyzed by LC-MS. In this case: MS (ESI) m/z calc for C35H39N11O19P3 1010.16 [M-H]~, found 1010.0.
[00415] In another approach, a bifunctional crosslinker is first introduced to a naturally occurring ammo acid with specific reactivity, for example, a cysteine residue.
Bifunctional molecules containing a maleimide, a spacer section, and a functional group for click chemistry can be used, for example, maleimide-PEG(n)-DBCO compounds (where DBCO stands for dibenzylcycooctyne). The maleimide reacts selectively with the cysteine residue, leaving a reactive DBCO moiety (Figure 5). Other bifunctional maleimide-alkynes include maleimide-PEG(n)-BCN compounds, where BCN stands for bicyclononyne. Other linking molecules can also be used to react with either naturally occurring or non-natural amino acids; for example, a crosslinker containing a A-hydroxysuccinimidyl group on one end and a clickable moiety (such as an azide or alkyne) at another site. The N- hydroxysuccinimidyl group could react with primary' amine functionalities, including lysine residues.
[00416] Using the above-mentioned crosslinker chemistries converts an ammo acid residue to a reactive functional group. In some instances, this reactive functional group is a bioorthogonal functional group, meaning that the introduced functional group can be reacted exclusively without side reactions from other portions of the biomolecule. In Figures 5 and 6, the method is demonstrated using maleimide-alkyne bifunctional crosslinkers. Incubation of the enzyme with a maleimide-alkyne crosslinker installs an alkyne - -the alkyne can be a DBCO group. The enzyme is then purified using a desalting or size-exclusion column to remove unconjugated small molecule. The enzyme can then be incubated with a second small molecule containing an azide; the azide reacts with DBCO functional groups without use of additional reagents (Figure 6).
[00417] Conjugation of the azide-quinone-nucleotides to TdT variants was accomplished using the methods portrayed in Figures 5 and 6. A general protocol for addition of a bifunctional maleimide-DBCO crosslinker (sulfo-DBCO-maleimide): a solution of 200 pM TdT with a site-specific cysteine residue was incubated with a 10-fold excess of sulfo-DBCO maleimide (Click Chemistry Tools) in TdT Storage Buffer (200mM Potassium Phosphate pH 6.5, lOOmM NaCl) at room temperature overnight (or incubated at room temperature for 4 hours, then moved to 4 °C for -2.4 hours). Excess sulfo-DBCO- maleimide was removed using a desalting column (Zeba™spin) equilibrated with TdT Storage Buffer. The concentration of recovered protein was determined by measuring UV/'VIS (Nanodrop) and calculating using £=54890 M'fom"1.
[00418] Once the TdT w-as modified with a pendant DBCO, the azide-qumone- nucleotide was added in 3-fold molar excess of the TdT-DBCO (e.g., 140 pM TdT-DBCO
and 420 uM azide-quinone-nucleotide) in TdT Storage Buffer. The mixture was incubated at room temperature overnight (or incubated at room temperature for 4 hours, then moved to 4 °C if used >24 hours after the initial addition). Excess azide-quinone-nucleotide was removed using a desalting column (Zeba™spin) equilibrated with 20 mM potassium phosphate pH 6.5, 100 mM NaCl. The eluent was further purified using a Zeba™ Dye and Biotin Removal Column equilibrated with 20 mM potassium phosphate pH 6.5, 100 mM NaCl. Samples were lyophilized and stored at room temperature until reconstituted.
Example 5. Verification of Nucleotide Addition to TdT
[00419] The conjugation of small molecules to the modified TdT was verified by LC-mass spectrometry. LCMS analysis is done using a Waters Acquity UPLC coupled to a Waters Xevo QTof. The typical method used was as follows: Buffer A 0.05% TFA in water and Buffer B 0.05% acetonitrile. The gradient wzas 10-20%B in 1 min, 20-50%B in 9 min, 0.5 mL/min, post-column split, using the following column: 2. 1x50 mm Halo Diphenyl 2.7 pm.
[00420] Deconvoluted mass spectral data was tabulated in Table 5 and Figure 7A, Figure 7B, Figure 7C, Figure 7D, Figure 7E, and Figure 7F. The C7 variant (unconjugated) mass was detected without the N-terminal methionine; proteins typically lose the N-terminal methionine w'hen recombinantly expressed in bacteria. The structures and expected masses of the small molecule conj ugates added to the modified TdT are sho wn below.
[00421] Note that the maleimide is drawn in its unreacted form for simplicity. The calculated mass of this segment attached to protein would not change.
[00422] Deconvoluted mass spectral data for the unconjugated C7 variant, the C7 vanant with the sulfo-DBCO-maleimide bifunctional crosslinker added, and the C7 variant reacted first with sulfo-DBCO-maleimide, followed by the azide-quinone-nucleotides are provided (Figure 7 A, Figure 7B, Figure 7C, Figure 7D, Figure 7E, Figure 7F).
Example 6. Cleavage of Nucleotides and Synthetic Control Using Soluble Redox Agents
[00424] The protein-linker-small molecule complex can be controlled via soluble electron (redox) shutles when placed in an environment with electrodes; the electrodes can be used to control the oxidation state of the soluble redox shuttle. Examples of soluble redox shuttles include methylene blue, quinones, NAD/NADH (nicotinamide adenine dinucleotide, and reduced form), flavins (e.g., riboflavin), viologens, TEMPO (2,2, 6,6- tetramethylpiperidin- 1 -yl)oxyl or (2,2,6,6-tetramethylpiperidin- 1 -yl)oxidanyl)/related
compounds and certain organometallic coordination complexes, including ferrocenes, (NH4)4Fe(CN)6, and KrFetCNX When the soluble redox shuttle is in a particular oxidation state, it will react with the redox-cleavable linker, facilitating cleavage of the small molecule from the protein, lite user can thus control cleavage events via the applied voltage at the electrodes.
[00425] A soluble redox shuttle can be selected based on knowledge of the redox-cleavable linker used and desired operating pH. For example, an operating window' for cleavage of quinone oxime ethers at pH 6.5 is estimated to be between +0.85 V and -0.38 V at 25 °C (Figure 8). This estimate is based on the fact that DTT (dithiothreitol) (with potential of -0.27 V at pH 6.5) has been shown to cleave quinone oxime ethers. Park, S.; Westcott, N. P.; Luo, W.; Duto, D.; Yousaf, M. N. Bioconjugate Chemistry “General chemosei ective and redox-responsive ligation and release strategy,” 25: 543-551 (2014). Comparative cyclic voltammetry' data for a series of soluble redox shuttles is presented in Figure 9.
[00426] The quinone oxime ether bond (redox-cleavable linker) was cleaved by soluble reducing agents, including leucomethylene blue (the reduced form of methylene blue). Cleavage was demonstrated both using the precursor small molecule azide-quinone nucleotide and on the full conjugate with TdT. An azide-quinone-nucleotide (azide-quinone- dCTP) in 200 mM phosphate, pH 6.5 with 100 mMNaCl was treated with leucomethylene blue (at 10-fold excess) at room temperature. After dilution with 20% MeCN/0.01 M TEAA (triethylammomum acetate), the mixture was analyzed by LCMS (Figure 10, Figure 11A and Figure 1 IB). The azide-quinone nucleotide disappeared, and the resulting product (labeled “cleaved pdf’) was detected by mass spectrometry.
[00427] A second batch of leucomethylene blue reactions was performed on starting material that had fewer impurities in the mixture. MS data for the starting material and converted product are shown in Figure 12A and Figure 12B.
Example 7. Redox Shuttle Solutions
[00428] Three non-limiting examples of exemplary' redox shuttle solution compositions, or formulations, are presented in Table 6 and comprise a redox shuttle (methylene blue or flavin mononucleotide) with a buffer, a supporting electrolyte, and a hydrotrope constituents). Analogous cyclic voltammograms for Formulations A, B, and C are shown in Figure 13. These examples demonstrate the range of reduction potentials and
current densities accessible in the generation of active soluble redox shuttles, based on their identity and solution formulation. Methylene blue shows a distinct improvement in electrochemical kinetics between Formulations A and B, due to the addition of 1 M nicotinamide. The peak separation (AEp) decreases from 105 to 49 mV, and its corresponding standard rate constant (ko) increases from 4.5 x 10'4 to 2.6 x 10’3 cm/s. This improvement suggests both greater stability of methylene blue in solutions containing formulants such as nicotinamide due to its buffering effect. Nicotinamide also induces an anodic shift in the reduction potential of methylene blue despite a constant nominal pH, either from its increased buffer capacity or by acting as an additional proton source/sink during reduction/oxidation. Formulation C shows a higher current response than Formulation B, despite slower kinetics (AEP = 103 mV, ko = 1.9 x 10‘4 cm/s), due to the greater concentrations of both the redox shuttle (flavin mononucleotide) and supporting electrolyte (NaCl).
[00429] Further chemical analysis of Formulation B supports the role nicotinamide plays in stabilizing methylene blue during the generation of the active redox shuttle. Figure 14 shows ultraviolet-visible light (UV-Vis) absorbance spectra of Formulation B dilutions (Dilution 1: 0.25 mM methylene blue and 2.5 mM nicotinamide; Dilution 2: 0.25 mM methylene blue and 100 mM nicotinamide). The peak at 605 nm is associated with the dimeric or tetrameric form, and the peak at 670 nm is attributed to the monomer. Fernandez- Perez, A. & Marban, G., “Visible Light Spectroscopic Analysis of Methylene Blue in Water; What Comes after Dimer?’' ACS Omega 5: 29801-2.9815 (2020). Large excess of nicotinamide induces a clear absorbance shift from oligomeric to monomeric methylene blue species in solution, providing evidence for solution stabilization through the formation of nicotinamide-methylene blue associations. Coordination is further supported by 1H nuclear resonance spectroscopy (1H-NMR, Figures 15-18). IH peaks assigned to hydrogens B and E of nicotinamide undergo broadening in the presence of methylene blue, as do the IH peaks assigned to hydrogens B’, C’, and D’ of methylene blue in the presence of nicotinamide. Moreover, the j -coupling constants for hydrogens B’ and E are identical at 2.56 Hz, indicating close physical proximity between the two species. Suitable formulation design, such as inclusion of buffering hydrotropic agents like nicotinamide, improves the stability of the soluble redox shuttle, supports rapid generation kinetics (i.e. electrochemical reversibility), and maximizes the efficiency of breaking the redox-cleavable linker.
Example 8. Cleavage of Linker Ataching C7 V ariant to Nucleotide
[00430] The C7 variant conjugated to sulfo-DBCO-maleimide followed by the azide-qumone-dTTP was treated with leucomethyiene blue. Another sample was treated with methylene blue (not yet reduced to the active form). The expected mass loss from reductive cleavage of the quinone oxime ether bond was observed when the protein conjugate was treated with leucomethyiene blue; no change in mass was observed when the protein conjugated was treated with methylene blue (in the inactive form). Mass spectral data are tabulated in Table 7, and deconvoluted mass spectra are shown in Figure 19A, Figure 19B, and Figure 19C. ’The expected cleavage products and expected masses for the small molecules that were conjugated and/or cleaved from the enzyme are shown below. Die small molecule is drawn as the unconjugated maleimide for simplicity, but the maleimide would be bonded to the protein as a thioether (via a reaction of the cysteine residue with the maleimide.)
[00431] The conjugated protein was split into batches. One batch was treated with leucomethylene blue (soluble reducing agent); reduction was observed. One batch was treated with methylene blue (not in the active, reduced form); no change in protein mass was observed.
Example 9. TdT Atached To Nucleotide Via Quinone Oxime Ether Linker Bound To ssDNA
[00432] One application of the technology is nucleic acid synthesis. In this application, engineered TdT is covalently attached to a nucleotide triphosphate via a redox cleavable linker. The attachment and linker chemistry is designed so that the nucleotide
triphosphate is accessible to the active site of the enzyme. In the presence of a single stranded nucleic acid (ssDNA), the TdT will bind the ssDNA and incorporate the covalently attached linker to the 3’ end of the ssDNA, effectively tethering the ssDNA to the TdT protein by means of the newly incorporated nucleotide. This mechanism is also depicted in Figure 1. The TdT protein attached to the ssDNA would effectively block subsequent additions of nucleotide to the ssDNA strand. The ssDNA-TdT complex, when in the presence of electrodes and a suitable redox shuttle (Figure 20) would be electrochemically /redox- cleaved to generate ssDNA with the newly incorporated nucleotide, now' separated from the TdT/hnker. A fresh TdT covalently bound to nucleotide triphosphate would be introduced, allowing the next nucleotide in the sequence to be added to the growing chain.
[00433] It w-as demonstrated that TdT proteins conjugated to a nucleotide via a quinone oxime ether linker do indeed bind ssDNA. A Cy2-labeled seed oligo was incubated with C7 variants conjugated to an azide-quinone-dTTP molecule for 30 minutes at 37 °C. This reaction was then analyzed via SDS-PAGE on a denaturing Tris-Glycine gel that separated the unbound seed oligo from the seed oligo covalently bound to TdT based on size. Results in Figure 21 show the presence of additional higher molecular weight species shifted upwards on the gel following incubation with conjugated C7 compared to seed oligo alone confirming covalent binding of the conjugated enzyme to seed oligo.
Example 10. Verification that Oligonucleotides Extended by C7 Variant
[00434] TdT proteins covalently conjugated to nucleotides can be used to incorporate multiple nucleotides onto the end of single-stranded DNA, using leucomethylene blue as a soluble reducing agent to cleave the protein from the ssDNA complex after nucleotide incorporation at each step (see Figure 1 for an illustration of this process).
[00435] To verify that oligonucleotides could be extended by the covalently- tethered C7 conjugates, an oligonucleotide was anchored to a surface at the 3’ end; a second oligonucleotide was hybridized to that strand to generate a 3’ end available for extension via addition of nucleotides (Figure 22). The surface was outfitted with a gasket to allow for introduction of reagents. In this case, a C7-nucleotide conjugate w?as introduced, incubated on the surface, then washed from the surface. After cleavage with leucomethylene blue, a second C7-nucleotide conjugate was added (with a different nucleotide). The starting C7- nucieotide conjugate was a result of reaction with azide-quinone-CTP in some cases or azide-
quinone-dATP in other cases. In both cases, the expected addition of the first nucleotide (either A or C) followed by the second nucleotide (either A or C) v/as observed (Figure 22, Table 8). Addition of the appropriate nucleotides was determined via sequencing.
[00436] A single cycle of extension with either C7-C or C7-A resulted in 85% and 64% intended single addition respectively. Two combinations of sequences were tried (adding either C or A first, followed by either A or C), and the expected two-nucleotide addition was observed 32% of the time regardless of order of nucleotide addition.
Example 11. Electrochemically Controlled DNA Synthesis
[00438] Electrochemically controlled DNA synthesis using a nucleotide- conjugated enzyme was also tested in a 3-cycle synthesis run. As depicted in Figure 22, in each cycle of synthesis the enzyme conjugated to the nucleotide to be added was incubated with seed DNA attached to a surface surrounded by electrodes. Unbound enzyme was washed away and then a methylene blue solution was introduced. Upon applying a voltage to the electrodes, the methylene blue was reduced and cleaved the enzyme from the extended DNA. The cleaved enzyme was washed away and then the next enzyme conjugated to the next nucleotide to be added was introduced. Following this method, enzymes conjugated to guanine (G), cytosine (C), and then thymine (T) were added over three cycles with electrochemical cleavages in between. The DNA was then recovered from the surface using
sodium hydroxide denaturation and then the sample was sequenced. The distribution of synthesized sequences is shown in Figure 23B. 37% target was achieved with the remaining sequences containing largely deletions (48%) and a minor population containing insertions (25%).
Example 12. Protocol For Sequencing Products
[00439] The addition of nucleotides to the oligonucleotide was determined using sequencing techniques described here. The hybridized seed oligonucleotide was eluted from the surface via denaturation from the capture oligo using 0.1M NaOH and then subjected to the following protocol.
[00440] Sequencing sample preparation began with polyadenylation (for sequences expected to end in thymine, cytosine, or guanine) or the addition of a poly(T) tail (for sequences expected to end in adenine) to the 3’ end of each DNA sample using commercial TdT. The samples were then amplified using primers that bind to a conserved 5’ sequence and the complementary 3’ tail while incorporating Illumina sequencing priming sites into the synthesized amplicons. This step w’as performed using qPCR in order to achieve optimal amplification of all samples regardless of input DNA concentration. A portion of each PCR product was analyzed by gel electrophoresis to confirm the size and relative abundance of the amplicons. The products of all samples were then normalized based on relative abundance and used as template for a second PCR reaction, which incorporated Illumina flow cell adaptors and a variable index into each sample. The products of the final PCR reaction were characterized by gel electrophoresis and all indexed samples were pooled together proportionally based on relative abundance. Size selection and purification of the pooled library w?as performed by DNA gel extraction and the final library' was analyzed by Qubit dsDNA HS assay and Tapestation DI 000 screentape to determine the final molar concentration. The library was then diluted, denatured and sequenced following standard Illumina protocols.
EQUIVALENTS
[00441] The foregoing writen specification is considered to be sufficient to enable one skilled in the art to practice the embodiments. The foregoing description and Examples detail certain embodiments and describes the best mode contemplated by the inventors. It will be appreciated, however, that no matter how detailed the foregoing may
appear in text, the embodiment may be practiced in many ways and should be construed in accordance with the appended claims and any equivalents thereof.
[00442] As used herein, the term about refers to a numeric value, including, for example, whole numbers, fractions, and percentages, whether or not explicitly indicated. The term about generally refers to a range of numerical values (e.g., 4-7-5-10% of the recited range) that one of ordinary skill in the art would consider equivalent to the recited value (e.g., having the same function or result). When terms such as at least and about precede a list of numerical values or ranges, the terms modify all of the values or ranges provided in the list. In some instances, the term about may include numerical values that are rounded to the nearest significant figure.
Claims
What is Claimed is:
1. A method of nucleic acid molecule synthesis comprising the steps of:
(a) providing an engineered terminal deoxynucleotidyl transferase (TdT), wherein the engineered TdT comprises a nucleotide molecule covalently attached to the TdT via a redox-cleavable linker:
(b) providing a single-stranded nucleic acid molecule comprising a 5’ and a 3’ end;
(c) contacting the engineered TdT and the single-stranded nucleic acid molecule under conditions suitable for the TdT to bind to the 3’ end of the nucleic acid molecule and form a TdT-nucleic acid strand complex, thereby incorporating the nucleotide molecule covalently linked to the TdT into the nucleic acid strand, resulting in the nucleic acid molecule tethered to the TdT and thereby blocking subsequent additions of a nucleotide molecule to the nucleic acid strand; and
(d) exposing the TdT-nucleic acid strand complex to suitable electrochemical conditions, wherein the redox-cleavable linker is cleaved thereby separating the TdT-redox- cleavable linker molecule from the nucleic acid strand with the newly incorporated nucleotide molecule, resulting m a nucleic acid strand with the newly incorporated nucleotide molecule; and
(e) repeating steps (a) through (d) thereby synthesizing a nucleic acid molecule.
2. The method of claim 1 , wherein the redox-cleavable linker is a quinone oxime ether linker.
3. The method of claim 1 or claim 2, wherein the electrochemical conditions of step (d) comprise one or more suitable soluble redox shuttles in the presence of two or more electrodes, wherein the two or more electrodes control the oxidation state of the soluble redox shuttles,
4. The method of any one of claims 1-3, wherein the soluble redox shuttles comprise a thiazine, a viologen, dinucleotides, a flavin, a quinone, peroxide, bromide, vanadium and/or a metal complex.
5. The method of claim 4, wherein the soluble redox shutles comprise a thiazine,
6. The method of claim 4, wherein the soluble redox shuttles comprise a viologen.
7. The method of claim any one of claims 1-3, wherein the soluble redox shuttles comprise dinucleotides.
8. The method of claim 7, wherein the dinucleotides comprise NAD/NADH, nicotinamide adenine dinucleotide, and/or reduced form.
9. The method of claim 4, wherein the soluble redox shuttles comprise a flavin.
10. The method of claim 4, wherein the soluble redox shuttles comprise a quinone.
11. The method of claim 4, wherein the soluble redox shuttles comprise peroxide.
12. The method of claim 4 wherein the soluble redox shuttles comprise bromide.
13. The method of claim 4, wherein the soluble redox shutles comprise vanadium.
14. The method of claim 4, wherein the soluble redox shuttles comprise a metal complex.
15. The method of claim 5, wherein the thiazine is methylene blue.
16. The method of claim 4, wherein the quinone is 2,5-dihydroxy-l,4- benzoquinone (DHBQ), 9,10-anthraquinone-2,7-disulfonic acid (AQDS), 4,5- dihydroxybenzene-l,3-disulfonic acid (BQDS), or 2, 3,5,6- tetrakis((dimethylamino)methyl)hydroquinone (FQH2).
17. The method of claim 9, wherein the flavin is riboflavin (RF), flavin mononucleotide (FMN), or flavin adenine dinucleotide (FAD).
18. The method of claim 6, wherein the viologen is methyl viologen (MV) or bis- (trimethylammonio) propyl viologen (BTMAP-Vi).
19. The method of claim 14, wherein the metal complex is TEMPO (2, 2,6,6- tetramethylpiperidin-l-yl)oxyl, (2,2,6,6-tetramethylpiperidin-l-yl)oxidanyl).
20. The method of claim 14, wherein the metal complex is an organometallic coordination complex.
21. The method of claim 20, wherein the organometallic coordination complex is a ferrocene, bis-(trimethylarnmonio) propyl ferrocene (BTMAP-Fc), ammonium ferricyanide ((NH4)4Fe(CN)e), potassium ferricyanide (K4Fe(CN)e), hexaamineruthenium (III) chloride (RuHex), permanganate, or dichromate.
22. The method of any one of claims 1-21, wherein the electrochemical conditions of step (d) further comprise a buffer, a supporting electrolyte, and/or a hydrotropic agent.
23. The method of claim 22, wherein the buffer comprises sodium citrate, potassium hydrogen phosphate, and/or potassium dihydrogen phosphate.
24. The method of ciaim 22, wherein the supporting electrolyte comprises sodium chloride, sodium salts, lithium salts, potassium salts, and/or magnesium salts.
25. The method of claim 22, wherein the hy drotropic agent comprises caffeine, urea, and/or nicotinamide (NA).
26. The method of any one of claims 1-3, wherein the electrochemical conditions of step (d) comprise a redox shuttle solution comprising methylene blue, citrate-phosphate buffer, and sodium chloride.
27. The method of any one of claims 1-3, wherein the electrochemical conditions of step (d) comprise a redox shuttle solution comprising methylene blue, citrate-phosphate buffer, sodium chloride, and nicotinamide.
28. The method of any one of claims 1 -3, wherein the electrochemical conditions of step (d) comprise a redox shutle solution comprising flavin mononucleotide, citratephosphate buffer, sodium chloride, and nicotinamide.
29. The method of any one of claims 1-3, wherein the electrochemical conditions of step (d) comprise a redox shuttle solution comprising 10 mM methylene blue, 25 mM citrate-phosphate buffer at pH 3.4, and 100 mM sodium chloride.
30. The method of any one of claims 1-3, wherein the electrochemical conditions of step (d) comprise a redox shuttle solution comprising 10 mM methylene blue, 25 mM citrate-phosphate buffer at pH 3.4, 100 mM sodium chloride, and 1 M nicotinamide.
31. The method of any one of claims 1-3, wherein the electrochemical conditions of step (d) comprise a redox shuttle solution comprising 50 mM flavin mononucleotide, 25 mM citrate-phosphate buffer at pH 3.4, 1 M sodium chloride, and 1 M nicotinamide.
32. The method of any one of claims 1-31, wherein the potential for cleavage is about +1.05 vs. SHE to -0.18V vs. SHE at 25°C and pH 3; +0.84 V vs. SHE to -0.38 V vs. SHE at 25°C and pH 6.5; or +0.64V vs. SHE to -0.59V vs. SHE at 25°C and pH 10.
33. The method of any one of claims 1 -32, wherein the engineered TdT comprises one or more mutations to a wild-type TdT of SEQ ID NO: 1, wherein: a. the one or more mutations comprise one or more of the following mutations: C7A, Q18K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100 A, L112P, F119Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T,
VI 631, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q211K, F220W, Q223K, C230E, L233Q, Q242L, C256A, D263H, G265Q, H268D, E270G, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E, wherein position numbers are relative to SEQ ID NO: 1; or b. the one or more mutations comprise one or more of the following mutations: C7A, Q18K, L19K, D31 A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, L112P, F119Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, VI 631, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q211K, F220W, Q223K, C230E, L233Q, Q242L, C256A, a deletion of D263, a deletion of H264, a deletion of G265, a deletion of R266, a deletion of V267, a deletion of H268, a deletion of S269, a deletion of E270, a deletion of K271, a deletion of S272, Q274P, Q275S, E276P, a deletion of G279, a deletion of W280, a deletion of K281, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E, wherein position numbers are relative to SEQ ID NO: 1; or c. the engineered TdT has at least 85, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, or 100% identity to SEQ ID NO: 2 or 3.
34. An engineered terminal deoxynucleotidyl transferase (TdT) comprising one or more mutations to a wild-type TdT of SEQ ID NO: 1, wherein: a. the one or more mutations comprise one or more of the following mutations: C7A, QI8K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, L112P, Fl 19Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, V163I, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q211K, F220W, Q223K, C230E, L233Q, Q242L, C256A, D263H, G265Q, H268D, E270G, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E, wherein position numbers are relative to SEQ ID NO: 1 ; or b. the one or more mutations comprise one or more of the following mutations: C7A, QI8K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, L112P, Fl 19Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, VI 631, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q211K, F220W, Q223K, C230E, L233Q, Q242L, C256A, a deletion of D263, a deletion ofH264, a deletion of G265, a deletion of R266, a deletion of V267, a deletion of H268, a deletion of S269, a deletion ofE270, a deletion ofK271, a deletion of S272, Q274P, Q275S, E276P, a deletion
of G279, a deletion of W280, a deletion of K281, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E, wherein position numbers are relative to SEQ ID NO: 1; or c. the engineered TdT has at least 85, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, or 100% identity to SEQ ID NO: 2. or 3.
35. The engineered TdT of claim 34, wherein the one or more mutations comprise one or more of the following mutations: C7A, Q18K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, LI 12P, Fl 19Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, V1631, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q21 IK, F220W, Q223K, C230E, L233Q, Q242L, C256A, D263H, G265Q, H268D, E270G, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E.
36. The engineered TdT of claim 34 or claim 35, wherein the one or more mutations comprise 10, 15, 20, 25, 30, 35, 40, 45, 50, or more of the following mutations: C7A, Q18K, L19K, D31 A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, L112P, Fl 19Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, VI 631, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q211K, F220W, Q223K, C230E, L233Q, Q242L, C256A, D263H, G265Q, H268D, E270G, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E.
37. The engineered TdT of any one of claims 34-36, wherein the one or more mutations comprise 50 or more of the following mutations: C7A, Q18K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, 181 L, K99Q, S100A, LI 12P, Fl 19Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, VI 631, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q211K, F220W, Q223K, C230E, L233Q, Q242L, C256A, D263H, G265Q, H268D, E270G, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E.
38. The engineered TdT of any one of claims 34-37, wherein the one or more mutations comprise all of the following mutations: C7A, Q18K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, LI 12P, Fl 19Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, VI 631, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q211K, F220W, Q223K, C230E, L233Q, Q242L, C256A, D263H, G265Q, H268D, E270G, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E.
39. The engineered TdT of any one of claims 34-38, wherein the one or more mutations consist of the following mutations: C7A, Q18K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, SI 00 A, L112P, F119Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, V163I, S164E, M165Q, V172W, T173Q, D177G, L179T, M19IK, D208E, Q211K, F220W, Q223K, C230E, L233Q, Q242L, C256A, D263H, G265Q, H268D, E270G, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E.
40. The engineered TdT of any one of claims 34-39, wherein the engineered TdT comprises an amino acid sequence with 85, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, or 100% identity to SEQ ID NO: 2.
41. The engineered TdT of any one of ciaims 34-40, wherein the engineered TdT comprises the amino acid sequence of SEQ ID NO: 2.
42. The engineered TdT of any one of claims 34-41, wherein the amino acid sequence of the engineered TdT consists of the ammo acid sequence of SEQ ID NO: 2.
43. The engineered TdT of any one of claims 34-41, wherein the ammo acid sequence of the engineered TdT consists of the ammo acid sequence of SEQ ID NO: 2 and a fluorescent protein label, optionally a green fluorescent protein (GFP) label.
44. The engineered TdT of any one of ciaims 34-41, wherein the amino acid sequence of the engineered TdT consists of the amino acid sequence of SEQ ID NO: 2 and an affinity tag, optionally aHis-tag.
45. The engineered TdT of any one of claims 34-41, wherein the engineered TdT further comprises a fluorescent protein label, optionally a GFP label.
46. The engineered TdT of any one of claims 34-41, wherein the engineered TdT further comprises an affinity tag, optionally a His-tag.
47. The engineered TdT of any one of claims 34-46, wherein the TdT further comprises the conjugation mutation E32C.
48. The engineered TdT of any one of claims 34-46, wherein the TdT further comprises the conjugation mutation E29C,
49. The engineered TdT of any one of claims 34-46, wherein the TdT further comprises the conjugation mutation E37C.
50. The engineered TdT of any one of claims 34-46, wherein the TdT further comprises the conjugation mutation V152C.
51. The engineered TdT of any one of claims 34-46, wherein the TdT further comprises the conjugation mutation K102C.
52. The engineered TdT of any one of claims 34-46, wherein the TdT further comprises the conjugation mutation D22C.
53. The engineered TdT of any one of claims 34-46, wherein the TdT further comprises the conjugation mutation R44C.
54. The engineered TdT of any one of claims 34-46, wherein the TdT further comprises the conjugation mutation L93C.
55. The engineered TdT of any one of claims 34-46, wherein the TdT further comprises the conjugation mutation P154C.
56. The engineered TdT of any one of claims 34-46, wherein the TdT further comprises the conjugation mutation N361C.
57. The engineered TdT of any one of claims 34-38, wherein the TdT comprises the amino acid sequence of any one of SEQ ID NOs: 11, 12, 13, and/or 15.
58. The engineered TdT of any one of claims 34-38, wherein the TdT comprises the amino acid sequence of SEQ ID NO: 15.
59. The engineered TdT of any one of claims 34-38, wherein the TdT comprises the ammo acid sequence of any one of SEQ ID NOs: 25, 26, 2.7, 28, and 29.
60. An engineered TdT comprising the ammo acid sequence of any one of SEQ ID NOs: 20, 21, 22, 23, and 24.
61. An engineered terminal deoxynucleotidyl transferase (TdT) consisting of the ammo acid sequence of SEQ ID NO: 15.
62. The engineered TdT of claim 34, wherein the one or more mutations comprise one or more of the following mutations: C7A, Q18K, L19K, D31 A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, 181L, K99Q, S100A, L112P, F119Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, V163I, SI64E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q211K, F220W, Q223K, C230E, L233Q, Q242L, C256A, a deletion of D263, a deletion of H264, a deletion of G265, a deletion of R266, a deletion of V267, a deletion of H2.68, a deletion of S269, a deletion of E270, a deletion of K2.71, a deletion of S272, Q274P, Q275S, E276P, a deletion of G279, a deletion of W280, a deletion of K281, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E, wherein position numbers are relative to SEQ ID NO: 1.
63. The engineered TdT of claim 34 or 62, wherein the one or more mutations comprise 10, 15, 20, 25, 30, 35, 40, 45, 50, or more of the following mutations: C7A, Q18K, L19K, D31 A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, L112P, F119Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, V163I, S164E, M165Q, V172W, T173Q, D177G, L179T, M 19 IK, D208E, Q211K, F220W, Q223K, C230E, L233Q, Q242L, C256A, a deletion of D263, a deletion of H264, a deletion of G265, a deletion of R266, a deletion of V267, a deletion of H268, a deletion of S269, a deletion of E270, a deletion of K271, a deletion of S272, Q274P, Q275S, E276P, a deletion of G279, a deletion of W280, a deletion of K281, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E, wherein position numbers are relative to SEQ ID NO: 1 .
64. The engineered TdT of claims 34, 62 or 63, wherein the one or more mutations comprise 50 or more of the following mutations: C7A, Q18K, L19K, D31A, E35G, €40 A, M44R, S47A, C68N, S75R, E78Q, G79E, 181 L, K99Q, S100A, LI 12P, Fl 19Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, V163I, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q211K, F220W, Q223K, C230E, L233Q, Q242L, C256A, a deletion of D263, a deletion of H264, a deletion of G265, a deletion of R266, a deletion of V267, a deletion of H268, a deletion of S269, a deletion of E270, a deletion of K271, a deletion of S272, Q274P, Q275S, E276P, a deletion of G279, a deletion of W280, a deletion of K281, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E, wherein position numbers are relative to SEQ ID NO: 1.
65. The engineered TdT of any one of claims 34 or 62-64, wherein the one or more mutations comprise the following mutations: C7A, Q18K, L19K, D31A, E35G, C40A, M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, L112P, F119Y, S127E, K128D, Q130R, SI 34 T, Q139R, C154P, N156T, V 1631, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q211K, F220W, Q223K, C230E, 1.2330. Q242L, C2.56A, a deletion of D263, a deletion of H264, a deletion of G265, a deletion of R266, a deletion of V267, a deletion of H268, a deletion of S269, a deletion of E270, a deletion of K271, a deletion of S272, Q274P, Q275S, E276P, a deletion of G279, a deletion of W280, a deletion of K281, C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E, wherein position numbers are relative to SEQ ID NO: 1.
66. The engineered TdT of any one of claims 34 or 62-65, wherein the one or more mutations consist of the following mutations: C7A, Q18K, L19K, D31A, E35G, C40A,
M44R, S47A, C68N, S75R, E78Q, G79E, I81L, K99Q, S100A, L112P, F119Y, S127E, K128D, Q130R, S134T, Q139R, C154P, N156T, V163I, S164E, M165Q, V172W, T173Q, D177G, L179T, M191K, D208E, Q211K, F220W, Q223K, C230E, L233Q, Q242L, C256A, a deletion of D263, a deletion of H264, a deletion of G265, a deletion of R266, a deletion of V267, a deletion of H268, a deletion of S269, a deletion of E270, a deletion of K271, a deletion of S272, Q274P, Q275S, E276P, a deletion of G279, a deletion of W280, a deletion of K281 , C290A, D293E, T317R, M323L, R335N, V336T, E339R, D353E, wherein position numbers are relative to SEQ ID NO: 1.
67. The engineered TdT of any one of claims 34 or 62-66, wherein the engineered TdT comprises an amino acid sequence with 85, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, or 100% identity to SEQ ID NO: 3.
68. The engineered TdT of any one of claims 34 or 62-67, wherein the engineered TdT comprises the amino acid sequence of SEQ ID NO: 3.
69. The engineered TdT of any one of claims 34 or 62-68, wherein the amino acid sequence of the engineered TdT consists of the ammo acid sequence of SEQ ID NO: 3.
70. The engineered TdT of any one of claims 34 or 62-68, wherein the amino acid sequence of the engineered TdT consists of the ammo acid sequence of SEQ ID NO: 3 and a fluorescent protein label, optionally a GFP label.
71. The engineered TdT of any one of claims 34 or 62-68, wherein the amino acid sequence of the engineered TdT consists of the amino acid sequence of SEQ ID NO: 3 and an affinity tag, optionally a His-tag.
72. The engineered TdT of any one of claims 34 or 62-68, wherein the engineered TdT further comprises a fluorescent protein label, optionally a green fluorescent protein (GFP) label.
73. The engineered TdT of any one of claims 34 or 62-68, wherein the engineered TdT further comprises an affinity tag, optionally aHis-tag.
74. The engineered TdT of any one of claims 34 or 62-73, wherein the TdT further comprises the conjugation mutation E32nc, wherein nc is a non-canonical amino acid.
75. The engineered TdT of any one of claims 34 or 62-73, wherein the TdT further comprises the conjugation mutation V152nc, wherein nc is a non-canonical amino acid.
76. The engineered TdT of any one of claims 34 or 62-73, wherein the TdT further comprises the conjugation mutation P154nc, wherein nc is a non-canonical amino acid.
77. The engineered TdT of any one of ciaims 34 or 62-73, wherein the TdT further comprises the conjugation mutation E32C.
78. The engineered TdT of any one of claims 34 or 62-73, wherein the TdT further comprises the conjugation mutation V152C.
79. The engineered TdT of any one of claims 34 or 62-73, wherein the TdT further comprises the conjugation mutation P154C.
80. The engineered TdT of any one of ciaims 34 or 62-73, wherein the TdT further comprises the conjugation mutation E29C.
81. The engineered TdT of any one of claims 34 or 62-73, wherein the TdT further comprises the conjugation mutation E37C.
82. The engineered TdT of any one of claims 34 or 62-73, wherein the TdT further comprises the conjugation mutation R44C.
83. The engineered TdT of any one of claims 34 or 62-73, wherein the TdT further comprises the conjugation mutation K102C.
84. The engineered TdT of any one of claims 34 or 62-64, wherein the TdT comprises the amino acid sequence of SEQ ID NO: 4.
85. The engineered TdT of any one of claims 34 or 62-64, wherein the TdT comprises the amino acid sequence of SEQ ID NO: 5.
86. The engineered TdT of any one of claims 34 or 62-64, wherein the TdT comprises the ammo acid sequence of SEQ ID NO: 6.
87. The engineered TdT of any one of claims 34 or 62-64, wherein the TdT comprises the amino acid sequence of SEQ ID NO: 7.
88. The engineered TdT of any one of claims 34 or 62-64, wherein the TdT comprises the amino acid sequence of SEQ ID NO: 9.
89. The engineered TdT of any one of claims 34 or 62-64, wherein the TdT comprises the ammo acid sequence of SEQ ID NO: 10.
90. The engineered TdT of any one of claims 34 or 62-64, wherein the TdT comprises the ammo acid sequence of SEQ ID NO: 16.
91. The engineered TdT of any one of claims 34 or 62-64, wherein the TdT comprises the amino acid sequence of SEQ ID NO: 17.
92. The engineered TdT of any one of claims 34 or 62-64, wherein the TdT comprises the amino acid sequence of SEQ ID NO: 18
93. The engineered TdT of any one of claims 34 or 62-64, wherein the TdT comprises the amino acid sequence of SEQ ID NO: 19.
94. The engineered TdT of any of claims 34-93, wherein the TdT is capable of covalently binding to a nucleotide molecule at a conjugation residue of the engineered TdT via a tether, wherein:
(i) the conjugation residue comprises a cysteine residue, a non-canonical amino acid residue, or a lysine residue capable of reacting with a bifunctional crosslinker or a redox- cleavable linker; and
(ii) wherein the nucleotide molecule comprises a nucleotide, nucleotide triphosphate, or a nucleotide analog.
95. The engineered TdT of claim 94, wherein the TdT, tether, and nucleotide molecule comprise any one of the following structures:
96. The engineered TdT of claim 94. wherein the TdT, tether, and nucleotide molecule comprise any one of the following structures:
97. The engineered TdT of claim 94. wherein the TdT, tether, and nucleotide molecule comprise any one of the following structures:
98. The engineered TdT of claim 94, wherein the TdT, tether, and nucleotide molecule comprise any one of the following structures:
99. The engineered TdT of claim 94, wherein the TdT, tether, and nucleotide molecule comprise any one of the following structures:
100. The engineered TdT of claim 94, wherein the TdT, tether, and nucleotide molecule comprise any one of the following structures:
101. The engineered TdT of claim 94, wherein the TdT, tether, and nucleotide molecule comprise any one of the following structures:
102. The engineered TdT of claim 94. wherein the TdT, tether, and nucleotide molecule comprise any one of the following structures:
wherein Y is H or OMe; and wherein Z is any one of:
103. The engineered TdT of claim 94, wherein the TdT, tether, and nucleotide molecule comprise any one of the following structures:
wherein Y is H or OMe; and wherein Z is any one of:
104. A kit comprising the engineered TdT of any one of claims 34-93, wherein the TdT is capable of covalently binding to a nucleotide molecule at a conjugation residue of the engineered TdT via a tether, wherein the kit further comprises:
(i) a bifunctional crosslinker and/or a redox-cleavable linker comprising a functional group for covalent conjugation to a conjugation residue of the TdT: and
(ii) a nucleotide molecule.
105. The kit of claim 104, wherein the kit further comprises a spacer.
106. The kit of any one of claims 104-106, wherein the functional group for covalent conjugation comprises a maleiinide, N-hydroxysuccinimidyl group, or a dibenzylcyclooctyne (DBCO).
107. The kit of any one of claims 104-106, wherein the functional group for covalent conjugation comprises a maleinnde.
108. The kit of claim 107, wherein the maleiinide is capable of attaching to a conjugation residue of the engineered TdT, wherein the conjugation residue is a cysteine residue in the TdT.
109. The kit of any one of claims 104-106, w-herein the functional group for covalent conjugation comprises a DBCO.
110. The kit of claim 109, wherein the DBCO is capable of attaching to a conjugation residue of the engineered TdT, and wherein the conjugation residue is anon- canonical amino acid in the TdT.
111. The kit of claim 110, wherein the non-canonical amino acid is 4- azidophenylalanine.
112. The kit of any one of claims 104-111, wherein the redox-cleavable linker comprises a quinone oxime ether linker,
113. The kit of any one of claims 104-111, wherein the redox-cleavable linker comprises a disulfide linker.
114. The kit of any one of claims 104-111 , wherein the redox-cleavable linker comprises a syringic acid-based linker or vanillin-based linker.
115. The kit of any one of claims 105-114, wherein the spacer comprises one or more polyethylene glycol molecules (PEGk, optionally wherein “n” is 1 -12,
116. The kit of any one of claims 105-114, wherein the spacer comprises a C1-C10 alkyd.
117. The kit of any one of claims 105-114, wherein the spacer comprises a hydrocarbon chain wherein one or more of the carbons comprises a substituted sulfonate group.
118. The kit of any one of claims 105-114, wherein the spacer comprises a hydrocarbon chain comprising 1-6 carbons wherein one or more of the carbons comprises a substituted sulfonate group.
119. The kit of any one of claims 105-114, wherein the spacer comprises a C1-C6 alkyl.
120. The kit of claim 104, wherein the kit comprises a bifunctional crosslinker capable of being covalently linked to the engineered TdT and comprising a DBCO, wherein the DBCO of the bifunctional crosslinker is capable of reacting with any one of the following molecules:
121 . The kit of claim 104, wherein the kit comprises a bifunctional crosslinker capable of being covalently linked to the engineered TdT and comprising a DBCO, wherein the DBCO of the bifunctional crosslinker is capable of reacting with any one of the following molecules:
wherein X is C(0), C(O)NH, CH2, or O; and wherein n = 1 for X = C(O), n =1 for C(O)NH, n ===1 for CH2, and n :::: 1-4 for X ::: O, wherein n is an integer.
122. The kit of claim 104, wherein the kit comprises a bifunctional crosslinker capable of being covalently linked to the engineered TdT and comprising a DBCO, wherein the DBCO of the bifunctional crosstinker is capable of reacting with any one of the following molecules:
123. The kit of claim 104, wherein the kit comprises a bifunctional crosslinker capable of being covalently linked to the engineered TdT and comprising a DBCO, wherein the DBCO of the bifunctional crosslinker is capable of reacting with any one of the following molecules:
124. The kit of claim 104, wherein the kit comprises one or more of the following molecules capable of being reacted with the engineered TdT, wherein the conjugation residue is a cysteine residue in the TdT:
125. The kit of claim 104, wherein the kit comprises one or more of the following molecules capable of being reacted with the engineered TdT, wherein the conjugation residue is a cysteine residue in the TdT:
126, The kit of claim 104. wherein the kit compri ses one or more of the following molecules capable of being reacted with the engineered TdT, wherein the conjugation residue is anon-canonical amino acid residue in the TdT:
127, The kit of claim 104, wherein the kit comprises one or more of the following molecules capable of being reacted with the engineered TdT, wherein the conjugation residue is a lysine residue in the TdT:
128. A system for enzymatic DNA synthesis comprising: a. the engineered TdT of any one of claims 34-93; b. a redox-cleavable linker; c. a redox shutle solution; and d. two or more electrodes on a surface.
129. The system of claim 128, wherein the electrodes enable an applied potential or galvanic field to be supplied locally to the device or system thus enabling generation of active redox shuttle via reduction or oxidation at one or more electrodes.
130. The system of claim 128 or claim 129, wherein the electrode where the soluble redox shuttle is generated is comprised of a smaller surface area than the other electrode to which it is electrically coupled.
131. The system of any one of claims 128-130, wherein if the soluble redox shuttle is activated via cathodic process, then the cathode may be comprised of less active surface area than the anode by (a) having less electroactive area, or (b) being coupled to multiple anodes which in combination provide substantially more area than the cathode.
132. The system of any one of claims 128-131, wherein the surface, or electroactive, or area aspect ratios for the counter electrode relative to the working electrode, or C/E aspect ratio is at least: 3:1 , 6: 1, 10: 1, 100:1 , 1000: 1, or higher.
133. The system of any one of claims 128-132, wherein all the electrodes lie in a single plane.
134. The system of any one of claims 128-132, wherein a combination of electrodes may lie in multiple planes.
135. The system of any one of claims 128-132, wherein electrodes are oriented parallel to perpendicular with respect to one another during the current path.
136. The system of any one of claims 128-135, wherein the system has an interelectrode gap (i.e., distance between electrodes included in the current path) of <100 pm, <10 pm, <1 pm, <100 pm, <10 nm, <1 nm, <100 pm, <10 pm, or <1 pm.
137. The system of any one of claims 12.8-136, wherein the redox shuttle solution comprises at least one soluble redox shuttle in combination with a buffer, a supporting electrolyte, and/or a hydrotropic agent.
138. The system of claim 137, wherein the buffer is sodium citrate, potassium hydrogen phosphate, or potassium dihy drogen phosphate.
139. The system of claim 137 or 138, wherein the supporting electrolyte is sodium chloride, sodium salts, lithium salts, potassium salts, or magnesium salts.
140. The system of any one of claims 137-139, wherein the hydrotropic agent is caffeine, urea, and/or nicotinamide (NA).
141. The system of any one of claims 128-136, wherein the redox shuttle solution comprises methylene blue, citrate-phosphate buffer, and sodium chloride.
142. The system of any one of claims 128-136, wherein the redox shuttle solution comprises methylene blue, citrate-phosphate buffer, sodium chloride, and nicotinamide.
143. The system of any one of claims 128-136, wherein the redox shuttle solution comprises flavin mononucleotide, citrate-phosphate buffer, sodium chloride, and nicotinamide.
144. The system of any one of claims 128-136, wherein the redox shuttle solution comprises 10 mM methylene blue, 25 mM citrate-phosphate buffer at pH 3.4, and 100 mM sodium chloride.
145. The system of any one of claims 128-136, wherein the redox shuttle solution comprises 10 mM methylene blue, 25 mM citrate-phosphate buffer at pH 3.4, 100 mM sodium chloride, and 1 M nicotinamide.
146. The system of any one of claims 128-136, wherein the redox shuttle solution comprises 50 mM flavin mononucleotide, 25 mM citrate-phosphate buffer at pH 3.4, 1 M sodium chloride, and 1 M nicotinamide.
147. A redox shuttle solution comprising at least one soluble redox shuttle in combmation with one or more of the following: supporting electrolyte, buffer, and hydrotrope.
148. The redox shuttle solution of claim 147, comprising: 10 mM methylene blue, 25 mM citrate-phosphate buffer at pH 3.4, and 100 mM sodium chloride.
149. The redox shuttle solution of claim 147, comprising: 10 mM methylene blue, 25 mM citrate-phosphate buffer at pH 3.4, 100 mM sodium chloride, and 1 M nicotinamide.
150. The redox shuttle solution of claim 147, comprising: 50 mM flavin mononucleotide, 25 mM citrate-phosphate buffer at pH 3.4, 1 M sodium chloride, and 1 M nicotinamide.
157, A molecule comprising the following structure:
wherein X is O or NH: wherein Y is H or OMe; and wherem Z is any one of:
158. A molecule comprising the following structure:
wherein Y is H or OMe; and wherein Z is any one of:
159. A molecule covalently linked to a tether, comprising the following structure:
wherein Y is H or OMe; and wherein Z is any one of:
160. A molecule comprising any one of the following structures:
161, A molecule comprising any one of the following structures:
wherein X is C(O), C(O)NH, CH2, or O; and wherein n :::: 1 for X ::: C(O), n for C(O)NH, n =1 for CH2, and n = 1-4 for X = O, wherein n is an integer.
165. A molecule comprising any one of the following structures:
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US202263375944P | 2022-09-16 | 2022-09-16 | |
US63/375,944 | 2022-09-16 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2024059703A1 true WO2024059703A1 (en) | 2024-03-21 |
Family
ID=88412153
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2023/074179 WO2024059703A1 (en) | 2022-09-16 | 2023-09-14 | Covalently modified template-independent dna polymerase and methods of use thereof |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2024059703A1 (en) |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040043396A1 (en) * | 2002-04-05 | 2004-03-04 | Rainer Mueller | Recombinant terminal deoxynucleotidyl transferase with improved functionality |
WO2017223517A1 (en) * | 2016-06-24 | 2017-12-28 | The Regents Of The University Of California | Nucleic acid synthesis and sequencing using tethered nucleoside triphosphates |
US20210238577A1 (en) * | 2020-02-04 | 2021-08-05 | Microsoft Technology Licensing, Llc | Electrochemically-cleavable linkers |
US20220023820A1 (en) | 2018-12-21 | 2022-01-27 | Microsoft Technology Licensing, Llc | Regulation of dna synthesis by nucleotides linked to protecting groups |
US20220025421A1 (en) * | 2020-07-21 | 2022-01-27 | Illumina Singapore Pte. Ltd. | Base-modified nucleotides as substrates for tdt-based enzymatic nucleic acid synthesis |
US20220145289A1 (en) * | 2020-10-19 | 2022-05-12 | Twist Bioscience Corporation | Methods of synthesizing oligonucleotides using tethered nucleotides |
WO2022212408A1 (en) | 2021-03-30 | 2022-10-06 | Ultima Genomics, Inc. | Benign scar-forming cleavable linkers |
-
2023
- 2023-09-14 WO PCT/US2023/074179 patent/WO2024059703A1/en unknown
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040043396A1 (en) * | 2002-04-05 | 2004-03-04 | Rainer Mueller | Recombinant terminal deoxynucleotidyl transferase with improved functionality |
WO2017223517A1 (en) * | 2016-06-24 | 2017-12-28 | The Regents Of The University Of California | Nucleic acid synthesis and sequencing using tethered nucleoside triphosphates |
US20220023820A1 (en) | 2018-12-21 | 2022-01-27 | Microsoft Technology Licensing, Llc | Regulation of dna synthesis by nucleotides linked to protecting groups |
US20210238577A1 (en) * | 2020-02-04 | 2021-08-05 | Microsoft Technology Licensing, Llc | Electrochemically-cleavable linkers |
WO2021158412A1 (en) | 2020-02-04 | 2021-08-12 | Microsoft Technology Licensing, Llc | Electrochemically-cleavable linkers |
US20220025421A1 (en) * | 2020-07-21 | 2022-01-27 | Illumina Singapore Pte. Ltd. | Base-modified nucleotides as substrates for tdt-based enzymatic nucleic acid synthesis |
US20220145289A1 (en) * | 2020-10-19 | 2022-05-12 | Twist Bioscience Corporation | Methods of synthesizing oligonucleotides using tethered nucleotides |
WO2022212408A1 (en) | 2021-03-30 | 2022-10-06 | Ultima Genomics, Inc. | Benign scar-forming cleavable linkers |
Non-Patent Citations (15)
Title |
---|
CHANG, X. ET AL.: "Design, synthesis, and biological evaluation of quinazolin-4(3 H)-one derivatives co-targeting poly(ADP-ribose) polymerase-1 and bromodomain containing protein 4 for breast cancer therapy", ACTA PHARM SIN. B, vol. 11, no. 1, 2021, pages 156 - 180 |
CIAMPI ET AL.: "Electrochemical ''Switching'' of Si(1 00) Molecular Assemblies", J. AM. CHEM. SOC., vol. 134, 2012, pages 844 - 847 |
CIAMPI ET AL.: "Electrochemical ''Switching'' of Si(100) Molecular Assemblies", J. AM. CHEM. SOC., vol. 134, 2012, pages 844 - 847 |
EUGENE W L CHAN ET AL: "An Electroactive Catalytic Dynamic Substrate that Immobilizes and Releases Patterned Ligands, Proteins, and Cells", ANGEWANDTE CHEMIE, WILEY - V C H VERLAG GMBH & CO. KGAA, DE, vol. 120, no. 33, 20 June 2008 (2008-06-20), pages 6363 - 6367, XP071342380, ISSN: 0044-8249, DOI: 10.1002/ANGE.200800166 * |
FERNANDEZ-PEREZ, AMARBAN, G.: "Visible Light Spectroscopic Analysis of Methylene Blue in Water: What Comes after Dimer?", ACS OMEGA, vol. 5, 2020, pages 29801 - 29815 |
GOIDENXWEIG, A ET AL.: "Automated Structure- and Sequence-Based Design of Proteins for High Bacterial Expression and Stability", MOLECULAR CELL, vol. 63, no. 2, 2016, pages 337 - 346, XP029653539, DOI: 10.1016/j.molcel.2016.06.012 |
GOUGE, JEROME ET AL.: "Structures of intermediates along the catalytic cycle of terminal deoxynucleotidyl transferase: dynamical aspects of the two-metal ion mechanism", JOURNAL OF MOLECULAR BIOLOGY, vol. 425, no. 22, 2013, pages 4334 - 4352 |
HUTTER. D. ET AL.: "Labeled nucleoside triphosphates with reversibly terminating aminoalkoxyl groups", NITELEOSIDES NUCLEOTIDES NUCLEIC ACIDS, vol. 29, no. 11, 2010 |
HUTTER. D. ET AL.: "Labeled nucleoside triphosphates with reversibly terminating aminoalkoxyl groups", NUCLEOSIDES NUCLEOTIDES NUCLEIC ACIDS, vol. 29, no. 11, 2010 |
KIRAN, K ET AL.: "Synthesis, characterization. and antibacterial activity of some novel vanillin related hydrazone derivatives bearing 1.2,3-triazole ring", RUSSIAN JOURNAL OF GENERA! CHEMISTRY, vol. 87, 2017, pages 1288 - 1294, XP036289662, DOI: 10.1134/S1070363217060251 |
KIRAN, K. ET AL.: "Synthesis, characterization, and antibacterial activity of some novel vanillin related hydrazone derivatives bearing 1,2,3-triazole ring", RUSSIAN JOURNAL OF GENERAL CHEMISTRY, vol. 87, 2017, pages 1288 - 1294, XP036289662, DOI: 10.1134/S1070363217060251 |
OKOH ET AL.: "Trimethyl Lock: A Multifunctional Molecular Tool for Drug Delivery, Cellular Imaging, and Stimuli-Responsive Materials", CHEMBIOCHEM, vol. 19, no. 7, 2018, pages 1668 - 1694 |
ORITA, A ET AL.: "A biometric redox flow battery based on flavin mononucleotide", NAT. COMMUN, vol. 7, 2016, pages 13230 |
PARK, S.WESTCOTT, N. P.LUO, W.DUTTO, D.YOUSAF, M. N: "General chemoselective and redox-responsive ligation and release strategy", BIOCONJUGATE CHEMISTRY, vol. 25, 2014, pages 543 - 551 |
YOUNG ET AL.: "Beyond the canonical 20 amino acids: expanding the genetic lexicon", J. OF BIOLOGICAL CHEMISTRY, vol. 285, no. 15, 2010, pages 11039 - 1 1044, XP055157080, DOI: 10.1074/jbc.R109.091306 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20200231539A1 (en) | Multiple cycloaddition reactions for labeling of molecules | |
JP6831903B2 (en) | Peptide compounds and methods for producing them, screening compositions, and methods for selecting peptide compounds. | |
EP2804872B1 (en) | Norbornene modified peptides and their labelling with tetrazine compounds | |
US8889843B2 (en) | Nucleic acid synthesizing dimer amidite and nucleic acid synthesizing method | |
US9243274B2 (en) | Genetically encoded initiator for polymer growth from proteins | |
Bhandari et al. | Mechanistic studies on the radical SAM enzyme tryptophan lyase (NosL) | |
US20220010296A1 (en) | Methods and compositions | |
WO2024059703A1 (en) | Covalently modified template-independent dna polymerase and methods of use thereof | |
EP3879013A1 (en) | Peptide library production method | |
Kore et al. | Synthesis of 5-[3, 6-di (pyridin-2-yl) pyridazine-4-yl]-2′-deoxyuridine-5′-O-triphosphate—a potential probe for fluorescence detection and imaging DNA | |
Hu et al. | Method for enzyme design with genetically encoded unnatural amino acids | |
JP6618534B2 (en) | Amino acid modified nucleic acids and their use | |
EP3467107A1 (en) | Ribosome display complex and production method therefor | |
US20220170062A1 (en) | Rna capping method, production method for modified rna, and modified rna | |
KR101568336B1 (en) | Cell producing mutant of target protein, preparing method thereof, and producing method of mutant of target protein using the cell | |
EP3019194B1 (en) | Production of selenoproteins (selprot) | |
US20200055891A1 (en) | Novel phosphotriazole mrna 5'-end cap analogs, composition comprising the same, rna molecule incorporating the same, uses thereof and method of synthesizing rna molecule, protein or peptide | |
Kugler | Investigation of Peptidic Templates for Bioorthogonal Ligations | |
JP4163821B2 (en) | Oligonucleotide modification method | |
Morris | Biochemical Insights into Unnatural Nucleotides that Expand the Genetic Alphabet and Code of Living Organisms | |
Talukder | Small molecules as probes of biological systems | |
EP2016088A2 (en) | Process for preparing disulphides and thiosulphinates and compounds prepared | |
Mayer | Expanding the chemical biology toolbox: Site-specific incorporation of unnatural amino acids and bioorthogonal protein labeling to study structure and function of proteins | |
EP4347572A1 (en) | Amino acids bearing a tetrazine moiety | |
Middel | Photocleavable Templates for PNA Base Pairing Mediated Native Chemical Ligation |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 23789839 Country of ref document: EP Kind code of ref document: A1 |