US12054541B2 - Split inteins, conjugates and uses thereof - Google Patents
Split inteins, conjugates and uses thereof Download PDFInfo
- Publication number
- US12054541B2 US12054541B2 US17/491,729 US202117491729A US12054541B2 US 12054541 B2 US12054541 B2 US 12054541B2 US 202117491729 A US202117491729 A US 202117491729A US 12054541 B2 US12054541 B2 US 12054541B2
- Authority
- US
- United States
- Prior art keywords
- intein
- fragment
- seq
- split
- protein
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 230000017730 intein-mediated protein splicing Effects 0.000 title claims abstract description 257
- 238000000034 method Methods 0.000 claims abstract description 36
- 239000012634 fragment Substances 0.000 claims description 157
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 79
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 46
- 229920001184 polypeptide Polymers 0.000 claims description 45
- 150000007970 thio esters Chemical class 0.000 claims description 40
- 108010013829 alpha subunit DNA polymerase III Proteins 0.000 claims description 36
- 102000037865 fusion proteins Human genes 0.000 claims description 30
- 108020001507 fusion proteins Proteins 0.000 claims description 30
- 239000011347 resin Substances 0.000 claims description 28
- 229920005989 resin Polymers 0.000 claims description 28
- 239000012038 nucleophile Substances 0.000 claims description 23
- 150000003573 thiols Chemical class 0.000 claims description 17
- 229920000642 polymer Polymers 0.000 claims description 16
- 239000006228 supernatant Substances 0.000 claims description 11
- 239000011324 bead Substances 0.000 claims description 6
- 108091034117 Oligonucleotide Proteins 0.000 claims description 5
- 239000003814 drug Substances 0.000 claims description 5
- 229940079593 drug Drugs 0.000 claims description 4
- 239000002245 particle Substances 0.000 claims description 4
- 108091033319 polynucleotide Proteins 0.000 claims description 4
- 102000040430 polynucleotide Human genes 0.000 claims description 4
- 239000002157 polynucleotide Substances 0.000 claims description 4
- 238000005406 washing Methods 0.000 claims description 4
- 150000001412 amines Chemical class 0.000 claims description 3
- 239000012130 whole-cell lysate Substances 0.000 claims description 3
- 125000002344 aminooxy group Chemical group [H]N([H])O[*] 0.000 claims description 2
- OAKJQQAXSVQMHS-UHFFFAOYSA-N Hydrazine Chemical group NN OAKJQQAXSVQMHS-UHFFFAOYSA-N 0.000 claims 2
- 150000001356 alkyl thiols Chemical class 0.000 claims 1
- 150000001504 aryl thiols Chemical class 0.000 claims 1
- ZNEWHQLOPFWXOF-UHFFFAOYSA-N coenzyme M Chemical compound OS(=O)(=O)CCS ZNEWHQLOPFWXOF-UHFFFAOYSA-N 0.000 claims 1
- 239000002105 nanoparticle Substances 0.000 claims 1
- 108090000623 proteins and genes Proteins 0.000 abstract description 144
- 102000004169 proteins and genes Human genes 0.000 abstract description 129
- 235000018102 proteins Nutrition 0.000 description 121
- 238000006243 chemical reaction Methods 0.000 description 67
- 239000013612 plasmid Substances 0.000 description 52
- 238000000746 purification Methods 0.000 description 47
- 230000000694 effects Effects 0.000 description 46
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 36
- 210000004027 cell Anatomy 0.000 description 31
- 239000000543 intermediate Substances 0.000 description 30
- 238000004007 reversed phase HPLC Methods 0.000 description 28
- 238000007079 thiolysis reaction Methods 0.000 description 27
- 238000004458 analytical method Methods 0.000 description 26
- 238000003776 cleavage reaction Methods 0.000 description 26
- 230000004927 fusion Effects 0.000 description 25
- 230000035772 mutation Effects 0.000 description 25
- 230000007017 scission Effects 0.000 description 25
- 239000000499 gel Substances 0.000 description 24
- 239000000872 buffer Substances 0.000 description 23
- 239000000047 product Substances 0.000 description 22
- 230000015572 biosynthetic process Effects 0.000 description 21
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 20
- 150000001413 amino acids Chemical group 0.000 description 19
- DTQVDTLACAAQTR-UHFFFAOYSA-N Trifluoroacetic acid Chemical compound OC(=O)C(F)(F)F DTQVDTLACAAQTR-UHFFFAOYSA-N 0.000 description 18
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 18
- 235000001014 amino acid Nutrition 0.000 description 18
- 229940024606 amino acid Drugs 0.000 description 18
- 210000004899 c-terminal region Anatomy 0.000 description 17
- 238000001727 in vivo Methods 0.000 description 17
- 230000014509 gene expression Effects 0.000 description 16
- YMWUJEATGCHHMB-UHFFFAOYSA-N Dichloromethane Chemical compound ClCCl YMWUJEATGCHHMB-UHFFFAOYSA-N 0.000 description 15
- ZMXDDKWLCZADIW-UHFFFAOYSA-N N,N-Dimethylformamide Chemical compound CN(C)C=O ZMXDDKWLCZADIW-UHFFFAOYSA-N 0.000 description 15
- 238000003556 assay Methods 0.000 description 15
- 239000012139 lysis buffer Substances 0.000 description 15
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 15
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 14
- 108090000848 Ubiquitin Proteins 0.000 description 14
- 102000044159 Ubiquitin Human genes 0.000 description 14
- 150000001408 amides Chemical class 0.000 description 14
- 230000003197 catalytic effect Effects 0.000 description 13
- 239000000243 solution Substances 0.000 description 13
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 12
- ZNEWHQLOPFWXOF-UHFFFAOYSA-M coenzyme M(1-) Chemical compound [O-]S(=O)(=O)CCS ZNEWHQLOPFWXOF-UHFFFAOYSA-M 0.000 description 12
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 12
- PBVAJRFEEOIAGW-UHFFFAOYSA-N 3-[bis(2-carboxyethyl)phosphanyl]propanoic acid;hydrochloride Chemical compound Cl.OC(=O)CCP(CCC(O)=O)CCC(O)=O PBVAJRFEEOIAGW-UHFFFAOYSA-N 0.000 description 11
- 238000010828 elution Methods 0.000 description 11
- 238000000338 in vitro Methods 0.000 description 11
- 229930027917 kanamycin Natural products 0.000 description 11
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 11
- 229960000318 kanamycin Drugs 0.000 description 11
- 229930182823 kanamycin A Natural products 0.000 description 11
- 239000000203 mixture Substances 0.000 description 11
- 239000007787 solid Substances 0.000 description 11
- 241000894007 species Species 0.000 description 11
- 241000588724 Escherichia coli Species 0.000 description 10
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 10
- 229910019142 PO4 Inorganic materials 0.000 description 10
- 239000013592 cell lysate Substances 0.000 description 10
- 238000002523 gelfiltration Methods 0.000 description 10
- 238000004128 high performance liquid chromatography Methods 0.000 description 10
- 235000021317 phosphate Nutrition 0.000 description 10
- 239000011780 sodium chloride Substances 0.000 description 10
- 239000004202 carbamide Substances 0.000 description 9
- 239000002773 nucleotide Substances 0.000 description 9
- 229920001223 polyethylene glycol Polymers 0.000 description 9
- 238000001262 western blot Methods 0.000 description 9
- 239000002202 Polyethylene glycol Substances 0.000 description 8
- 238000011068 loading method Methods 0.000 description 8
- 125000003729 nucleotide group Chemical group 0.000 description 8
- 239000011550 stock solution Substances 0.000 description 8
- 239000000126 substance Substances 0.000 description 8
- -1 thiol sodium 2-mercaptoethanesulfonate Chemical class 0.000 description 8
- 108091035707 Consensus sequence Proteins 0.000 description 7
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 7
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 7
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 7
- 230000007062 hydrolysis Effects 0.000 description 7
- 238000006460 hydrolysis reaction Methods 0.000 description 7
- 238000011534 incubation Methods 0.000 description 7
- 239000008188 pellet Substances 0.000 description 7
- 239000010452 phosphate Substances 0.000 description 7
- DGVVWUTYPXICAM-UHFFFAOYSA-N β‐Mercaptoethanol Chemical compound OCCS DGVVWUTYPXICAM-UHFFFAOYSA-N 0.000 description 7
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 6
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 6
- 239000011542 SDS running buffer Substances 0.000 description 6
- QWCKQJZIFLGMSD-UHFFFAOYSA-N alpha-aminobutyric acid Chemical compound CCC(N)C(O)=O QWCKQJZIFLGMSD-UHFFFAOYSA-N 0.000 description 6
- OWMVSZAMULFTJU-UHFFFAOYSA-N bis-tris Chemical compound OCCN(CCO)C(CO)(CO)CO OWMVSZAMULFTJU-UHFFFAOYSA-N 0.000 description 6
- 238000010367 cloning Methods 0.000 description 6
- 239000000562 conjugate Substances 0.000 description 6
- GNBHRKFJIUUOQI-UHFFFAOYSA-N fluorescein Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 GNBHRKFJIUUOQI-UHFFFAOYSA-N 0.000 description 6
- 229910052740 iodine Inorganic materials 0.000 description 6
- 238000004949 mass spectrometry Methods 0.000 description 6
- 230000004048 modification Effects 0.000 description 6
- 238000012986 modification Methods 0.000 description 6
- 150000007523 nucleic acids Chemical class 0.000 description 6
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 6
- 230000016434 protein splicing Effects 0.000 description 6
- AIDBEARHLBRLMO-UHFFFAOYSA-M sodium;dodecyl sulfate;2-morpholin-4-ylethanesulfonic acid Chemical compound [Na+].OS(=O)(=O)CCN1CCOCC1.CCCCCCCCCCCCOS([O-])(=O)=O AIDBEARHLBRLMO-UHFFFAOYSA-M 0.000 description 6
- 229910052720 vanadium Inorganic materials 0.000 description 6
- 239000011534 wash buffer Substances 0.000 description 6
- PJUPKRYGDFTMTM-UHFFFAOYSA-N 1-hydroxybenzotriazole;hydrate Chemical compound O.C1=CC=C2N(O)N=NC2=C1 PJUPKRYGDFTMTM-UHFFFAOYSA-N 0.000 description 5
- 102000004190 Enzymes Human genes 0.000 description 5
- 108090000790 Enzymes Proteins 0.000 description 5
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 5
- 239000004471 Glycine Substances 0.000 description 5
- SNDPXSYFESPGGJ-UHFFFAOYSA-N L-norVal-OH Natural products CCCC(N)C(O)=O SNDPXSYFESPGGJ-UHFFFAOYSA-N 0.000 description 5
- 108700005078 Synthetic Genes Proteins 0.000 description 5
- 235000018417 cysteine Nutrition 0.000 description 5
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 5
- 231100000673 dose–response relationship Toxicity 0.000 description 5
- 125000000524 functional group Chemical group 0.000 description 5
- NPZTUJOABDZTLV-UHFFFAOYSA-N hydroxybenzotriazole Substances O=C1C=CC=C2NNN=C12 NPZTUJOABDZTLV-UHFFFAOYSA-N 0.000 description 5
- 210000003000 inclusion body Anatomy 0.000 description 5
- 230000003993 interaction Effects 0.000 description 5
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 5
- 239000012528 membrane Substances 0.000 description 5
- 108020004707 nucleic acids Proteins 0.000 description 5
- 102000039446 nucleic acids Human genes 0.000 description 5
- 238000005580 one pot reaction Methods 0.000 description 5
- 239000012071 phase Substances 0.000 description 5
- 230000002441 reversible effect Effects 0.000 description 5
- 239000013598 vector Substances 0.000 description 5
- RTZKZFJDLAIYFH-UHFFFAOYSA-N Diethyl ether Chemical compound CCOCC RTZKZFJDLAIYFH-UHFFFAOYSA-N 0.000 description 4
- 101800002094 Mxe GyrA intein Proteins 0.000 description 4
- NQRYJNQNLNOLGT-UHFFFAOYSA-N Piperidine Chemical compound C1CCNCC1 NQRYJNQNLNOLGT-UHFFFAOYSA-N 0.000 description 4
- 238000009835 boiling Methods 0.000 description 4
- 235000014633 carbohydrates Nutrition 0.000 description 4
- 150000001720 carbohydrates Chemical class 0.000 description 4
- 238000005119 centrifugation Methods 0.000 description 4
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 4
- 150000001875 compounds Chemical class 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 125000001165 hydrophobic group Chemical group 0.000 description 4
- 238000012750 in vivo screening Methods 0.000 description 4
- 210000004962 mammalian cell Anatomy 0.000 description 4
- 238000007857 nested PCR Methods 0.000 description 4
- 230000002018 overexpression Effects 0.000 description 4
- 239000002243 precursor Substances 0.000 description 4
- 230000002028 premature Effects 0.000 description 4
- 230000009145 protein modification Effects 0.000 description 4
- 239000000376 reactant Substances 0.000 description 4
- 230000002829 reductive effect Effects 0.000 description 4
- 239000007858 starting material Substances 0.000 description 4
- 238000012360 testing method Methods 0.000 description 4
- 125000003088 (fluoren-9-ylmethoxy)carbonyl group Chemical group 0.000 description 3
- DHBXNPKRAUYBTH-UHFFFAOYSA-N 1,1-ethanedithiol Chemical compound CC(S)S DHBXNPKRAUYBTH-UHFFFAOYSA-N 0.000 description 3
- 102100020928 14 kDa phosphohistidine phosphatase Human genes 0.000 description 3
- 101710082470 14 kDa phosphohistidine phosphatase Proteins 0.000 description 3
- LRQKBLKVPFOOQJ-UHFFFAOYSA-N 2-aminohexanoic acid Chemical compound CCCCC(N)C(O)=O LRQKBLKVPFOOQJ-UHFFFAOYSA-N 0.000 description 3
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 3
- BZTDTCNHAFUJOG-UHFFFAOYSA-N 6-carboxyfluorescein Chemical compound C12=CC=C(O)C=C2OC2=CC(O)=CC=C2C11OC(=O)C2=CC=C(C(=O)O)C=C21 BZTDTCNHAFUJOG-UHFFFAOYSA-N 0.000 description 3
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 3
- 229920000936 Agarose Polymers 0.000 description 3
- 238000009010 Bradford assay Methods 0.000 description 3
- 125000001433 C-terminal amino-acid group Chemical group 0.000 description 3
- 229920002307 Dextran Polymers 0.000 description 3
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 description 3
- SNDPXSYFESPGGJ-BYPYZUCNSA-N L-2-aminopentanoic acid Chemical compound CCC[C@H](N)C(O)=O SNDPXSYFESPGGJ-BYPYZUCNSA-N 0.000 description 3
- LRQKBLKVPFOOQJ-YFKPBYRVSA-N L-norleucine Chemical compound CCCC[C@H]([NH3+])C([O-])=O LRQKBLKVPFOOQJ-YFKPBYRVSA-N 0.000 description 3
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 3
- JGFZNNIVVJXRND-UHFFFAOYSA-N N,N-Diisopropylethylamine (DIPEA) Chemical compound CCN(C(C)C)C(C)C JGFZNNIVVJXRND-UHFFFAOYSA-N 0.000 description 3
- 239000002033 PVDF binder Substances 0.000 description 3
- DNIAPMSPPWPWGF-UHFFFAOYSA-N Propylene glycol Chemical compound CC(O)CO DNIAPMSPPWPWGF-UHFFFAOYSA-N 0.000 description 3
- 229920002684 Sepharose Polymers 0.000 description 3
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 3
- 239000006180 TBST buffer Substances 0.000 description 3
- 241000723792 Tobacco etch virus Species 0.000 description 3
- 238000009825 accumulation Methods 0.000 description 3
- 125000001931 aliphatic group Chemical group 0.000 description 3
- 125000000539 amino acid group Chemical group 0.000 description 3
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 3
- 229960000723 ampicillin Drugs 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 230000000903 blocking effect Effects 0.000 description 3
- 239000006227 byproduct Substances 0.000 description 3
- 239000001913 cellulose Substances 0.000 description 3
- 229920002678 cellulose Polymers 0.000 description 3
- KRKNYBCHXYNGOX-UHFFFAOYSA-N citric acid Chemical compound OC(=O)CC(O)(C(O)=O)CC(O)=O KRKNYBCHXYNGOX-UHFFFAOYSA-N 0.000 description 3
- 125000000151 cysteine group Chemical class N[C@@H](CS)C(=O)* 0.000 description 3
- 238000011161 development Methods 0.000 description 3
- 238000010790 dilution Methods 0.000 description 3
- 239000012895 dilution Substances 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 229910052731 fluorine Inorganic materials 0.000 description 3
- 230000002068 genetic effect Effects 0.000 description 3
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical class O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 3
- 239000007924 injection Substances 0.000 description 3
- 238000002347 injection Methods 0.000 description 3
- 150000002632 lipids Chemical class 0.000 description 3
- 235000013336 milk Nutrition 0.000 description 3
- 239000008267 milk Substances 0.000 description 3
- 210000004080 milk Anatomy 0.000 description 3
- 230000007935 neutral effect Effects 0.000 description 3
- NFHFRUOZVGFOOS-UHFFFAOYSA-N palladium;triphenylphosphane Chemical compound [Pd].C1=CC=CC=C1P(C=1C=CC=CC=1)C1=CC=CC=C1.C1=CC=CC=C1P(C=1C=CC=CC=1)C1=CC=CC=C1.C1=CC=CC=C1P(C=1C=CC=CC=1)C1=CC=CC=C1.C1=CC=CC=C1P(C=1C=CC=CC=1)C1=CC=CC=C1 NFHFRUOZVGFOOS-UHFFFAOYSA-N 0.000 description 3
- 150000003013 phosphoric acid derivatives Chemical class 0.000 description 3
- 229920002981 polyvinylidene fluoride Polymers 0.000 description 3
- 238000002360 preparation method Methods 0.000 description 3
- 239000012460 protein solution Substances 0.000 description 3
- 238000007086 side reaction Methods 0.000 description 3
- 229910052717 sulfur Inorganic materials 0.000 description 3
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical class CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 3
- ZGYICYBLPGRURT-UHFFFAOYSA-N tri(propan-2-yl)silicon Chemical compound CC(C)[Si](C(C)C)C(C)C ZGYICYBLPGRURT-UHFFFAOYSA-N 0.000 description 3
- GPRLSGONYQIRFK-MNYXATJNSA-N triton Chemical compound [3H+] GPRLSGONYQIRFK-MNYXATJNSA-N 0.000 description 3
- 239000003643 water by type Substances 0.000 description 3
- 229910052727 yttrium Inorganic materials 0.000 description 3
- UHPQFNXOFFPHJW-UHFFFAOYSA-N (4-methylphenyl)-phenylmethanamine Chemical compound C1=CC(C)=CC=C1C(N)C1=CC=CC=C1 UHPQFNXOFFPHJW-UHFFFAOYSA-N 0.000 description 2
- MYRTYDVEIRVNKP-UHFFFAOYSA-N 1,2-Divinylbenzene Chemical class C=CC1=CC=CC=C1C=C MYRTYDVEIRVNKP-UHFFFAOYSA-N 0.000 description 2
- FZTIWOBQQYPTCJ-UHFFFAOYSA-N 4-[4-(4-carboxyphenyl)phenyl]benzoic acid Chemical compound C1=CC(C(=O)O)=CC=C1C1=CC=C(C=2C=CC(=CC=2)C(O)=O)C=C1 FZTIWOBQQYPTCJ-UHFFFAOYSA-N 0.000 description 2
- 125000004042 4-aminobutyl group Chemical group [H]C([*])([H])C([H])([H])C([H])([H])C([H])([H])N([H])[H] 0.000 description 2
- 241001247255 Aphanothece halophytica Species 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 208000034628 Celiac artery compression syndrome Diseases 0.000 description 2
- 229920002101 Chitin Polymers 0.000 description 2
- 241000159506 Cyanothece Species 0.000 description 2
- 108010030351 DEC-205 receptor Proteins 0.000 description 2
- 108020004414 DNA Proteins 0.000 description 2
- 108010054814 DNA Gyrase Proteins 0.000 description 2
- LYCAIKOWRPUZTN-UHFFFAOYSA-N Ethylene glycol Chemical compound OCCO LYCAIKOWRPUZTN-UHFFFAOYSA-N 0.000 description 2
- 108010033040 Histones Proteins 0.000 description 2
- XEEYBQQBJWHFJM-UHFFFAOYSA-N Iron Chemical compound [Fe] XEEYBQQBJWHFJM-UHFFFAOYSA-N 0.000 description 2
- 108010025815 Kanamycin Kinase Proteins 0.000 description 2
- 239000004793 Polystyrene Substances 0.000 description 2
- 229940124158 Protease/peptidase inhibitor Drugs 0.000 description 2
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 2
- 101800001978 Ssp dnaB intein Proteins 0.000 description 2
- PPBRXRYQALVLMV-UHFFFAOYSA-N Styrene Chemical compound C=CC1=CC=CC=C1 PPBRXRYQALVLMV-UHFFFAOYSA-N 0.000 description 2
- 241000192584 Synechocystis Species 0.000 description 2
- 108010076818 TEV protease Proteins 0.000 description 2
- 239000007983 Tris buffer Substances 0.000 description 2
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 2
- 239000002253 acid Substances 0.000 description 2
- 230000003213 activating effect Effects 0.000 description 2
- 125000002252 acyl group Chemical group 0.000 description 2
- OIRDTQYFTABQOQ-KQYNXXCUSA-N adenosine Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OIRDTQYFTABQOQ-KQYNXXCUSA-N 0.000 description 2
- 238000001261 affinity purification Methods 0.000 description 2
- 230000002776 aggregation Effects 0.000 description 2
- 238000004220 aggregation Methods 0.000 description 2
- 150000001345 alkine derivatives Chemical class 0.000 description 2
- 102000006646 aminoglycoside phosphotransferase Human genes 0.000 description 2
- 125000003118 aryl group Chemical group 0.000 description 2
- 150000001540 azides Chemical class 0.000 description 2
- 230000001580 bacterial effect Effects 0.000 description 2
- 150000001732 carboxylic acid derivatives Chemical class 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 239000003795 chemical substances by application Substances 0.000 description 2
- 229920001577 copolymer Polymers 0.000 description 2
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 2
- 238000006297 dehydration reaction Methods 0.000 description 2
- 239000003398 denaturant Substances 0.000 description 2
- 238000010511 deprotection reaction Methods 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 238000001425 electrospray ionisation time-of-flight mass spectrometry Methods 0.000 description 2
- 239000012149 elution buffer Substances 0.000 description 2
- 125000002485 formyl group Chemical group [H]C(*)=O 0.000 description 2
- 238000001502 gel electrophoresis Methods 0.000 description 2
- 239000011521 glass Substances 0.000 description 2
- 229930195712 glutamate Natural products 0.000 description 2
- 235000013922 glutamic acid Nutrition 0.000 description 2
- 239000004220 glutamic acid Substances 0.000 description 2
- 125000003630 glycyl group Chemical group [H]N([H])C([H])([H])C(*)=O 0.000 description 2
- 238000003306 harvesting Methods 0.000 description 2
- 239000006166 lysate Substances 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 229930182817 methionine Natural products 0.000 description 2
- 238000002156 mixing Methods 0.000 description 2
- 238000002715 modification method Methods 0.000 description 2
- 102000035118 modified proteins Human genes 0.000 description 2
- 108091005573 modified proteins Proteins 0.000 description 2
- 239000000178 monomer Substances 0.000 description 2
- 238000000569 multi-angle light scattering Methods 0.000 description 2
- 238000002887 multiple sequence alignment Methods 0.000 description 2
- 238000002703 mutagenesis Methods 0.000 description 2
- 231100000350 mutagenesis Toxicity 0.000 description 2
- 230000000269 nucleophilic effect Effects 0.000 description 2
- 229920001542 oligosaccharide Polymers 0.000 description 2
- 150000002482 oligosaccharides Chemical class 0.000 description 2
- KDLHZDBZIXYQEI-UHFFFAOYSA-N palladium Substances [Pd] KDLHZDBZIXYQEI-UHFFFAOYSA-N 0.000 description 2
- 239000000137 peptide hydrolase inhibitor Substances 0.000 description 2
- PARWUHTVGZSQPD-UHFFFAOYSA-N phenylsilane Chemical compound [SiH3]C1=CC=CC=C1 PARWUHTVGZSQPD-UHFFFAOYSA-N 0.000 description 2
- 229920001451 polypropylene glycol Polymers 0.000 description 2
- 229920002223 polystyrene Polymers 0.000 description 2
- 229920002451 polyvinyl alcohol Polymers 0.000 description 2
- 235000019422 polyvinyl alcohol Nutrition 0.000 description 2
- 230000004481 post-translational protein modification Effects 0.000 description 2
- 230000004853 protein function Effects 0.000 description 2
- 238000001814 protein method Methods 0.000 description 2
- 239000011541 reaction mixture Substances 0.000 description 2
- 238000011084 recovery Methods 0.000 description 2
- 238000009877 rendering Methods 0.000 description 2
- 210000003705 ribosome Anatomy 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 238000002864 sequence alignment Methods 0.000 description 2
- 238000002741 site-directed mutagenesis Methods 0.000 description 2
- 239000002002 slurry Substances 0.000 description 2
- WWGXHTXOZKVJDN-UHFFFAOYSA-M sodium;n,n-diethylcarbamodithioate;trihydrate Chemical compound O.O.O.[Na+].CCN(CC)C([S-])=S WWGXHTXOZKVJDN-UHFFFAOYSA-M 0.000 description 2
- 239000002904 solvent Substances 0.000 description 2
- 238000000527 sonication Methods 0.000 description 2
- 238000003860 storage Methods 0.000 description 2
- KZNICNPSHKQLFF-UHFFFAOYSA-N succinimide Chemical compound O=C1CCC(=O)N1 KZNICNPSHKQLFF-UHFFFAOYSA-N 0.000 description 2
- 235000000346 sugar Nutrition 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 150000003536 tetrazoles Chemical class 0.000 description 2
- CBDKQYKMCICBOF-UHFFFAOYSA-N thiazoline Chemical compound C1CN=CS1 CBDKQYKMCICBOF-UHFFFAOYSA-N 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 2
- 229920003169 water-soluble polymer Polymers 0.000 description 2
- JDTOWOURWBDELG-QHCPKHFHSA-N (2r)-2-[(2-methylpropan-2-yl)oxycarbonylamino]-3-tritylsulfanylpropanoic acid Chemical compound C=1C=CC=CC=1C(C=1C=CC=CC=1)(SC[C@H](NC(=O)OC(C)(C)C)C(O)=O)C1=CC=CC=C1 JDTOWOURWBDELG-QHCPKHFHSA-N 0.000 description 1
- OJBNDXHENJDCBA-QFIPXVFZSA-N (2s)-2-(9h-fluoren-9-ylmethoxycarbonylamino)-6-(prop-2-enoxycarbonylamino)hexanoic acid Chemical compound C1=CC=C2C(COC(=O)N[C@@H](CCCCNC(=O)OCC=C)C(=O)O)C3=CC=CC=C3C2=C1 OJBNDXHENJDCBA-QFIPXVFZSA-N 0.000 description 1
- XSYUPRQVAHJETO-WPMUBMLPSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-(1h-imidazol-5-yl)propanoyl]amino]-3-(1h-imidazol-5-yl)propanoyl]amino]-3-(1h-imidazol-5-yl)propanoyl]amino]-3-(1h-imidazol-5-yl)propanoyl]amino]-3-(1h-imidazol-5-yl)propanoyl]amino]-3-(1h-imidaz Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 XSYUPRQVAHJETO-WPMUBMLPSA-N 0.000 description 1
- UYHQUNLVWOAJQW-UHFFFAOYSA-N 1,3-benzothiazole-2-carbonitrile Chemical compound C1=CC=C2SC(C#N)=NC2=C1 UYHQUNLVWOAJQW-UHFFFAOYSA-N 0.000 description 1
- KJUGUADJHNHALS-UHFFFAOYSA-N 1H-tetrazole Substances C=1N=NNN=1 KJUGUADJHNHALS-UHFFFAOYSA-N 0.000 description 1
- NDKDFTQNXLHCGO-UHFFFAOYSA-N 2-(9h-fluoren-9-ylmethoxycarbonylamino)acetic acid Chemical compound C1=CC=C2C(COC(=O)NCC(=O)O)C3=CC=CC=C3C2=C1 NDKDFTQNXLHCGO-UHFFFAOYSA-N 0.000 description 1
- FMYBFLOWKQRBST-UHFFFAOYSA-N 2-[bis(carboxymethyl)amino]acetic acid;nickel Chemical compound [Ni].OC(=O)CN(CC(O)=O)CC(O)=O FMYBFLOWKQRBST-UHFFFAOYSA-N 0.000 description 1
- WHNPOQXWAMXPTA-UHFFFAOYSA-N 3-methylbut-2-enamide Chemical compound CC(C)=CC(N)=O WHNPOQXWAMXPTA-UHFFFAOYSA-N 0.000 description 1
- NJYVEMPWNAYQQN-UHFFFAOYSA-N 5-carboxyfluorescein Chemical compound C12=CC=C(O)C=C2OC2=CC(O)=CC=C2C21OC(=O)C1=CC(C(=O)O)=CC=C21 NJYVEMPWNAYQQN-UHFFFAOYSA-N 0.000 description 1
- 208000035657 Abasia Diseases 0.000 description 1
- HRPVXLWXLXDGHG-UHFFFAOYSA-N Acrylamide Chemical compound NC(=O)C=C HRPVXLWXLXDGHG-UHFFFAOYSA-N 0.000 description 1
- 241000192542 Anabaena Species 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 239000002126 C01EB10 - Adenosine Substances 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- 241001277507 Chrysosporum ovalisporum Species 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- 108700010070 Codon Usage Proteins 0.000 description 1
- 241000249091 Coleofasciculus chthonoplastes PCC 7420 Species 0.000 description 1
- 241000383377 Crocosphaera watsonii WH 8501 Species 0.000 description 1
- PJWWRFATQTVXHA-UHFFFAOYSA-N Cyclohexylaminopropanesulfonic acid Chemical compound OS(=O)(=O)CCCNC1CCCCC1 PJWWRFATQTVXHA-UHFFFAOYSA-N 0.000 description 1
- 241001418197 Cylindrospermopsis raciborskii CS-505 Species 0.000 description 1
- 101100015729 Drosophila melanogaster drk gene Proteins 0.000 description 1
- UNXHWFMMPAWVPI-UHFFFAOYSA-N Erythritol Natural products OCC(O)C(O)CO UNXHWFMMPAWVPI-UHFFFAOYSA-N 0.000 description 1
- 230000005526 G1 to G0 transition Effects 0.000 description 1
- 108090000288 Glycoproteins Proteins 0.000 description 1
- 102000003886 Glycoproteins Human genes 0.000 description 1
- 101710103773 Histone H2B Proteins 0.000 description 1
- 102100021639 Histone H2B type 1-K Human genes 0.000 description 1
- 102000006947 Histones Human genes 0.000 description 1
- 101000619708 Homo sapiens Peroxiredoxin-6 Proteins 0.000 description 1
- 101001113483 Homo sapiens Poly [ADP-ribose] polymerase 1 Proteins 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- 108090001030 Lipoproteins Proteins 0.000 description 1
- 102000004895 Lipoproteins Human genes 0.000 description 1
- XOGTZOOQQBDUSI-UHFFFAOYSA-M Mesna Chemical compound [Na+].[O-]S(=O)(=O)CCS XOGTZOOQQBDUSI-UHFFFAOYSA-M 0.000 description 1
- 241000488294 Microcystis aeruginosa NIES-843 Species 0.000 description 1
- 101001018036 Mus musculus Lymphocyte antigen 75 Proteins 0.000 description 1
- 101710135898 Myc proto-oncogene protein Proteins 0.000 description 1
- 102100038895 Myc proto-oncogene protein Human genes 0.000 description 1
- HCLVBNCSXUWSDY-UHFFFAOYSA-N NOC1=CC=CC2=C1N=C(S2)C#N Chemical class NOC1=CC=CC2=C1N=C(S2)C#N HCLVBNCSXUWSDY-UHFFFAOYSA-N 0.000 description 1
- 241000894763 Nostoc punctiforme PCC 73102 Species 0.000 description 1
- 241000192673 Nostoc sp. Species 0.000 description 1
- 238000009004 PCR Kit Methods 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- 102100022239 Peroxiredoxin-6 Human genes 0.000 description 1
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- 229920003171 Poly (ethylene oxide) Polymers 0.000 description 1
- 239000004952 Polyamide Substances 0.000 description 1
- 239000004698 Polyethylene Substances 0.000 description 1
- 108010039918 Polylysine Proteins 0.000 description 1
- 239000004721 Polyphenylene oxide Substances 0.000 description 1
- 239000004372 Polyvinyl alcohol Substances 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 241000530613 Pseudanabaena limnetica Species 0.000 description 1
- 229920001218 Pullulan Polymers 0.000 description 1
- 239000004373 Pullulan Substances 0.000 description 1
- 238000012181 QIAquick gel extraction kit Methods 0.000 description 1
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 1
- 229920005654 Sephadex Polymers 0.000 description 1
- 239000012507 Sephadex™ Substances 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- 229920002472 Starch Polymers 0.000 description 1
- 239000012505 Superdex™ Substances 0.000 description 1
- 241000192707 Synechococcus Species 0.000 description 1
- 241001453296 Synechococcus elongatus Species 0.000 description 1
- 241001504076 Thermosynechococcus elongatus BP-1 Species 0.000 description 1
- 241001453191 Thermosynechococcus vulcanus Species 0.000 description 1
- 101710150448 Transcriptional regulator Myc Proteins 0.000 description 1
- 241001170687 Trichodesmium erythraeum IMS101 Species 0.000 description 1
- 241000970911 Trichormus variabilis ATCC 29413 Species 0.000 description 1
- 229920004890 Triton X-100 Polymers 0.000 description 1
- 239000013504 Triton X-100 Substances 0.000 description 1
- 102000012275 Ubiquitin domains Human genes 0.000 description 1
- 108050002897 Ubiquitin domains Proteins 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 238000002835 absorbance Methods 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 238000005917 acylation reaction Methods 0.000 description 1
- 229960005305 adenosine Drugs 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N adenyl group Chemical group N1=CN=C2N=CNC2=C1N GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 150000001336 alkenes Chemical class 0.000 description 1
- 125000000217 alkyl group Chemical group 0.000 description 1
- 150000001350 alkyl halides Chemical class 0.000 description 1
- 150000004716 alpha keto acids Chemical class 0.000 description 1
- 238000007098 aminolysis reaction Methods 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 239000000611 antibody drug conjugate Substances 0.000 description 1
- 229940049595 antibody-drug conjugate Drugs 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 125000000613 asparagine group Chemical group N[C@@H](CC(N)=O)C(=O)* 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 239000012152 bradford reagent Substances 0.000 description 1
- 229910052794 bromium Inorganic materials 0.000 description 1
- UDSAIICHUKSCKT-UHFFFAOYSA-N bromophenol blue Chemical compound C1=C(Br)C(O)=C(Br)C=C1C1(C=2C=C(Br)C(O)=C(Br)C=2)C2=CC=CC=C2S(=O)(=O)O1 UDSAIICHUKSCKT-UHFFFAOYSA-N 0.000 description 1
- 235000019846 buffering salt Nutrition 0.000 description 1
- 125000002915 carbonyl group Chemical group [*:2]C([*:1])=O 0.000 description 1
- 239000003054 catalyst Substances 0.000 description 1
- 230000006037 cell lysis Effects 0.000 description 1
- 235000010980 cellulose Nutrition 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 238000007385 chemical modification Methods 0.000 description 1
- 239000003638 chemical reducing agent Substances 0.000 description 1
- 229910052801 chlorine Inorganic materials 0.000 description 1
- HVYWMOMLDIMFJA-DPAQBDIFSA-N cholesterol group Chemical group [C@@H]1(CC[C@H]2[C@@H]3CC=C4C[C@@H](O)CC[C@]4(C)[C@H]3CC[C@]12C)[C@H](C)CCCC(C)C HVYWMOMLDIMFJA-DPAQBDIFSA-N 0.000 description 1
- 238000011097 chromatography purification Methods 0.000 description 1
- 239000012141 concentrate Substances 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 239000000356 contaminant Substances 0.000 description 1
- 238000001816 cooling Methods 0.000 description 1
- NKLPQNGYXWVELD-UHFFFAOYSA-M coomassie brilliant blue Chemical compound [Na+].C1=CC(OCC)=CC=C1NC1=CC=C(C(=C2C=CC(C=C2)=[N+](CC)CC=2C=C(C=CC=2)S([O-])(=O)=O)C=2C=CC(=CC=2)N(CC)CC=2C=C(C=CC=2)S([O-])(=O)=O)C=C1 NKLPQNGYXWVELD-UHFFFAOYSA-M 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- 230000018044 dehydration Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 239000003599 detergent Substances 0.000 description 1
- 238000000502 dialysis Methods 0.000 description 1
- VHJLVAABSRFDPM-QWWZWVQMSA-N dithiothreitol Chemical compound SC[C@@H](O)[C@H](O)CS VHJLVAABSRFDPM-QWWZWVQMSA-N 0.000 description 1
- 229940116441 divinylbenzene Drugs 0.000 description 1
- 238000009510 drug design Methods 0.000 description 1
- 238000000132 electrospray ionisation Methods 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 230000006718 epigenetic regulation Effects 0.000 description 1
- 238000011067 equilibration Methods 0.000 description 1
- 150000002170 ethers Chemical class 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 239000012467 final product Substances 0.000 description 1
- 238000000684 flow cytometry Methods 0.000 description 1
- 238000005194 fractionation Methods 0.000 description 1
- WHUUTDBJXJRKMK-VKHMYHEASA-L glutamate group Chemical group N[C@@H](CCC(=O)[O-])C(=O)[O-] WHUUTDBJXJRKMK-VKHMYHEASA-L 0.000 description 1
- 150000004676 glycans Chemical class 0.000 description 1
- 125000003827 glycol group Chemical group 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 101150098203 grb2 gene Proteins 0.000 description 1
- 229960000789 guanidine hydrochloride Drugs 0.000 description 1
- PJJJBBJSCAKJQF-UHFFFAOYSA-N guanidinium chloride Chemical compound [Cl-].NC(N)=[NH2+] PJJJBBJSCAKJQF-UHFFFAOYSA-N 0.000 description 1
- 150000008282 halocarbons Chemical class 0.000 description 1
- 229910052736 halogen Inorganic materials 0.000 description 1
- 150000002367 halogens Chemical class 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 231100000086 high toxicity Toxicity 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 229920001519 homopolymer Polymers 0.000 description 1
- 102000049595 human PARP1 Human genes 0.000 description 1
- 229940042795 hydrazides for tuberculosis treatment Drugs 0.000 description 1
- 150000002429 hydrazines Chemical class 0.000 description 1
- WGCNASOHLSPBMP-UHFFFAOYSA-N hydroxyacetaldehyde Natural products OCC=O WGCNASOHLSPBMP-UHFFFAOYSA-N 0.000 description 1
- JGPMMRGNQUBGND-UHFFFAOYSA-N idebenone Chemical compound COC1=C(OC)C(=O)C(CCCCCCCCCCO)=C(C)C1=O JGPMMRGNQUBGND-UHFFFAOYSA-N 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 239000010954 inorganic particle Substances 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- PGLTVOMIXTUURA-UHFFFAOYSA-N iodoacetamide Chemical compound NC(=O)CI PGLTVOMIXTUURA-UHFFFAOYSA-N 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- 229910052742 iron Inorganic materials 0.000 description 1
- 230000002427 irreversible effect Effects 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- OOYGSFOGFJDDHP-KMCOLRRFSA-N kanamycin A sulfate Chemical compound OS(O)(=O)=O.O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N OOYGSFOGFJDDHP-KMCOLRRFSA-N 0.000 description 1
- 229960002064 kanamycin sulfate Drugs 0.000 description 1
- 150000002576 ketones Chemical class 0.000 description 1
- 238000003367 kinetic assay Methods 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002923 metal particle Substances 0.000 description 1
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 229940126619 mouse monoclonal antibody Drugs 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 239000002777 nucleoside Substances 0.000 description 1
- 125000003835 nucleoside group Chemical group 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 230000020477 pH reduction Effects 0.000 description 1
- 238000012856 packing Methods 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 238000010647 peptide synthesis reaction Methods 0.000 description 1
- 125000001151 peptidyl group Chemical group 0.000 description 1
- 230000003094 perturbing effect Effects 0.000 description 1
- 125000001997 phenyl group Chemical group [H]C1=C([H])C([H])=C(*)C([H])=C1[H] 0.000 description 1
- 239000008363 phosphate buffer Substances 0.000 description 1
- 125000002467 phosphate group Chemical group [H]OP(=O)(O[H])O[*] 0.000 description 1
- 102000020233 phosphotransferase Human genes 0.000 description 1
- 229920000191 poly(N-vinyl pyrrolidone) Polymers 0.000 description 1
- 229920002454 poly(glycidyl methacrylate) polymer Polymers 0.000 description 1
- 229920001583 poly(oxyethylated polyols) Polymers 0.000 description 1
- 229920002647 polyamide Polymers 0.000 description 1
- 229920006122 polyamide resin Polymers 0.000 description 1
- 229920000768 polyamine Polymers 0.000 description 1
- 229920000570 polyether Polymers 0.000 description 1
- 229920000656 polylysine Polymers 0.000 description 1
- 229920000193 polymethacrylate Polymers 0.000 description 1
- 229920001282 polysaccharide Polymers 0.000 description 1
- 239000005017 polysaccharide Substances 0.000 description 1
- 229920005990 polystyrene resin Polymers 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 229940121649 protein inhibitor Drugs 0.000 description 1
- 239000012268 protein inhibitor Substances 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 238000001243 protein synthesis Methods 0.000 description 1
- 230000012743 protein tagging Effects 0.000 description 1
- 239000001990 protein-drug conjugate Substances 0.000 description 1
- 230000017854 proteolysis Effects 0.000 description 1
- 235000019423 pullulan Nutrition 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 238000010791 quenching Methods 0.000 description 1
- 230000000171 quenching effect Effects 0.000 description 1
- 230000035484 reaction time Effects 0.000 description 1
- 230000009257 reactivity Effects 0.000 description 1
- 238000000611 regression analysis Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 239000012146 running buffer Substances 0.000 description 1
- 239000012266 salt solution Substances 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 239000000377 silicon dioxide Substances 0.000 description 1
- 238000004088 simulation Methods 0.000 description 1
- 238000001542 size-exclusion chromatography Methods 0.000 description 1
- 239000007790 solid phase Substances 0.000 description 1
- 238000010532 solid phase synthesis reaction Methods 0.000 description 1
- 230000000087 stabilizing effect Effects 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 235000019698 starch Nutrition 0.000 description 1
- 150000003431 steroids Chemical class 0.000 description 1
- 150000003440 styrenes Chemical class 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 229960002317 succinimide Drugs 0.000 description 1
- 229920001059 synthetic polymer Polymers 0.000 description 1
- WROMPOXWARCANT-UHFFFAOYSA-N tfa trifluoroacetic acid Chemical compound OC(=O)C(F)(F)F.OC(=O)C(F)(F)F WROMPOXWARCANT-UHFFFAOYSA-N 0.000 description 1
- 238000010257 thawing Methods 0.000 description 1
- 230000001225 therapeutic effect Effects 0.000 description 1
- 229940113082 thymine Drugs 0.000 description 1
- 238000004448 titration Methods 0.000 description 1
- 230000014616 translation Effects 0.000 description 1
- 150000003852 triazoles Chemical class 0.000 description 1
- 230000001960 triggered effect Effects 0.000 description 1
- 230000007306 turnover Effects 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 229940035893 uracil Drugs 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K16/00—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
- C07K16/18—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/62—DNA sequences coding for fusion proteins
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/93—Ligases (6)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P21/00—Preparation of peptides or proteins
- C12P21/02—Preparation of peptides or proteins having a known sequence of two or more amino acids, e.g. glutathione
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P21/00—Preparation of peptides or proteins
- C12P21/06—Preparation of peptides or proteins produced by the hydrolysis of a peptide bond, e.g. hydrolysate products
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/50—Fusion polypeptide containing protease site
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/90—Fusion polypeptide containing a motif for post-translational modification
- C07K2319/92—Fusion polypeptide containing a motif for post-translational modification containing an intein ("protein splicing")domain
Definitions
- Protein splicing is a post-translational process catalyzed by a family of proteins known as inteins.(1) During this process, an intein domain catalyzes its own excision from a larger precursor protein and simultaneously ligates the two flanking polypeptide sequences (exteins) together. While most inteins catalyze splicing in cis, a small subset of these proteins exist as naturally fragmented domains that are separately expressed but rapidly associate and catalyze splicing in trans. Given their capacity to make and break polypeptide bonds (inteins can be considered protein ligases), both cis and trans-splicing inteins have found widespread use as chemical biological tools.(2)
- inteins Despite the growing use of inteins in chemical biology, their practical utility has been constrained by two common characteristics of the family, namely (i) slow kinetics and (ii) context dependent efficiency with respect to the immediate flanking extein sequences.(3,4) Recently, a split intein from the cyanobacterium Nostoc punctiforme (Npu) was shown to catalyze protein trans-splicing on the order of a minute, rather than hours like most cis- or trans-splicing inteins.(5) Furthermore, this intein was slightly more tolerant of sequence variation at the critical +2 C-extein residue than other inteins.(6)
- split intein N- and C-fragments Disclosed are split intein N- and C-fragments, variants thereof, and methods of using these split inteins in polypeptide purification and modification.
- fusion proteins of a polypeptide and a split intein N-fragment, or variant thereof as described below in greater detail.
- complexes of the fusion protein and a split intein C-fragment or variant thereof as described in detail below can be via a covalent interaction between the fusion protein and C-fragment or variant or via a non-covalent interaction (e.g., ionic, H-bonding, and/or van der Waals interaction).
- split intein C-fragments or variants thereof further comprises a linker, such as a peptide linker, or other linkers as described below in detail.
- a linker such as a peptide linker, or other linkers as described below in detail.
- a specific peptide linker contemplated is -SGGC (SEQ ID NO: 705) attached to any of the split intein C-fragments described below.
- the linker can be tailored so as to allow for attachment of a split intein C-fragment of interest to a support, e.g., a bead, a resin, a slide, a particle.
- the split intein C-fragment, or variant thereof is bound to a support.
- the support is a bead, a resin, a particle or a slide.
- selection of the N-fragment and C-fragment can be from the same wild type split intein (e.g., both from Npu, or a variant of either the N- or C-fragment as discussed in great detail below), or alternatively can be selected from different wild type split inteins or the consensus split intein sequences discussed below, as it has been discovered that the affinity of a N-fragment for a different C-fragment (e.g., Npu N-fragment or variant thereof with Ssp C-fragment or variant thereof) still maintains sufficient binding affinity for use in the disclosed methods.
- the fusion protein can be in a whole cell lysate or secreted from a cell (e.g., a mammalian cell) and in a cell supernatant.
- the polypeptide of the fusion protein is an antibody, e.g., an IgG antibody.
- the N-fragment is fused to one or both of the heavy chains of the antibody.
- the N-fragment is fused to one or both of the light chains of the antibody.
- the methods disclosed herein can further comprise washing the intein intermediate (prior to contact with the nucleophile) to remove the cell lysate or cell supernatant, for example.
- the methods disclosed herein can further comprise isolating the resulting conjugate of the polypeptide and nucleophile.
- the methods disclosed herein can be useful as an efficient purification for polypeptides prepared by recombinant protein methods.
- the nucleophile can be a thiol to form a conjugate that is an ⁇ -thioester of the polypeptide.
- the resulting ⁇ -thioester can be further modified by contacting with a second nucleophile, employing the well known ⁇ -thioester chemistry for protein modification.
- the methods disclosed herein can provide conjugates of the polypeptide, which in some cases is an antibody (e.g., an IgG antibody), and a nucleophile (e.g., a drug, a polymer, an oligonucleotide).
- FIG. 1 shows trans-splicing of split DnaE inteins.
- FIG. 2 shows in vitro half-lives of trans-splicing reactions.
- Indicated split intein pairs fused to model exteins Ub or SUMO Ub-Int N and Int C -SUMO
- Ub-Int N and Int C -SUMO were mixed at either 30° C. or 37° C., and the formation of products was monitored over time by gel electrophoresis.
- Representative coomassie-stained SDS-PAGE gels showing (b) fast Ava splicing at 37° C. and (c) inefficient Ssp splicing at 37° C.
- FIG. 3 shows sequence-activity relationships in split DnaE inteins.
- (c) In vivo analysis of the C120G mutation in the Aha intein ( ⁇ SD, n 3).
- (e) In vivo analysis of Ssp-to-Npu point mutations that improve Ssp activity ( ⁇ SD, n 4). Note that all residue numberings correspond to the relevant positions on Npu as defined by the NMR structure (PDB: 2KEQ).(21)
- FIG. 4 shows engineered versions of ultrafast DnaE inteins support efficient expressed protein ligation.
- FIG. 5 shows sequence alignments of split DnaE inteins. Numbering follows that of Npu as assigned for the NMR structure (PDB 2KEQ). Critical catalytic residues are marked with an asterisk. Sequence identifiers for the N-intein sequences depicted in the figure are as follows: Npu (SEQ ID NO:1), Ssp (SEQ ID NO:2), Aha (SEQ ID NO:3), Aov (SEQ ID NO:4), Asp (SEQ ID NO: 5), Ava (SEQ ID NO:6), Cra (CS505) (SEQ ID NO:7), Csp (CCY0110) (SEQ ID NO:8), Csp (PCC8801) (SEQ ID NO:9), Cwa (SEQ ID NO;10), Maer (NIES843) (SEQ ID NO: 11), Mcht (PCC7420) (SEQ ID NO:12), Oli (SEQ ID NO:13), Sel (PC7942) (SEQ ID NO:14), Ssp
- Sequence identifiers for the C-intein sequences depicted in the figure are as follows: Npu (SEQ ID NO:39), Ssp (SEQ ID NO:40), Aha (SEQ ID NO: 41), Aov (SEQ ID NO:42), Asp (SEQ ID NO: 43), Ava (SEQ ID NO:44), Cra (CS505) (SEQ ID NO:45), Csp (CCY0110) (SEQ ID NO:46), Csp (PCC8801) (SEQ ID NO:47), Cwa (SEQ ID NO:48), Maer (NIES843) (SEQ ID NO:49), Mcht (PCC7420) (SEQ ID NO:50), Oli (SEQ ID NO:51), Sel (PC7942) (SEQ ID NO:52), Ssp (PCC7002) (SEQ ID NO:53), Tel (SEQ ID NO:54), Ter (SEQ ID NO:55) and Tvu (SEQ ID NO:56).
- FIG. 6 shows sequence logos for high- and low-activity inteins.
- Inteins are ranked based on in vivo activity with a “CFN” C-extein sequence.
- the high and low activity inteins are distinguished based on a cut-off IC 50 value of 350 ⁇ g/mL of kanamycin, and the Aha intein is included in the high-activity set, given that the C 120 G mutation dramatically restores high activity.
- FIG. 7 shows purification of C-terminal ⁇ -thioesters using split-inteins.
- FIG. 8 shows Purification of soluble protein ⁇ -thioesters using the Npu C -AA affinity column.
- FIG. 9 shows RP-HPLC and MS analysis of Ub and MBP ⁇ -thioesters.
- FIG. 10 shows the effect of the ⁇ 1 residue on the efficiency of the on-resin thiolysis.
- 20 different Ub-Npu N proteins were expressed containing each of the 20 proteinogenic amino acids at the C-terminus of Ub ( ⁇ 1 residue) and purified over Npu C -AA columns.
- Cleavage yields from the Npu C -AA column were estimated by gel elctrophoresis and amounts of thioester versus side reactions (mainly hydrolysis) were determined by RP-HPLC and MS analysis.
- FIG. 11 shows purification of H2B(1-116)- ⁇ -thioester under denaturing conditions.
- FIG. 12 shows purification of ⁇ DEC thioesters expressed in 293T cells using the split-intein column.
- EPL Expressed Protein Ligation
- FIG. 13 shows EPL directly using Int C -column eluted thioesters.
- RP-HPLC (30-73% B gradient, 214 nm and 440 nm detection) and MS analysis of the reactions between the H-CGK(Fl)-NH 2 peptide and MBP (A) and PHPT1 (B) MES thioesters, purified from E. coli using the Int C -column.
- FIG. 14 shows a one-pot purification/ligation experiment of ubiquitin to the H-CGK(Fluorescein)-NH 2 peptide (CGK(Fl)).
- Ub-Npu N from E. coli cell lysates was bound to the Int C -column, and after removal of contaminants through extensive washes, intein cleavage and ligation were triggered by addition of 200 mM MES and 1 mM CGK(Fl) peptide.
- FIG. 15 shows the semi-synthesis of H2B-K120Ac under denaturing conditions.
- A) Coomassie stained SDS-PAGE analysis of H2B(1-116) ⁇ -thioester generation in the presence of 2 M urea (sup: cell lysate supernatant, trit: 1% triton wash of the inclusion bodies, inp: solubilized inclusion bodies used as input for the Int C -column).
- E1-E6 were pooled, concentrated to 150 ⁇ M and ligated to the peptide H-CVTK(Ac)YTSAK-OH at 1 mM for 3 h at r.t.
- FIG. 16 shows the characterization of ⁇ DEC205 ligated to the H-CGK(Fluorescein)-NH2 peptide (CGK(Fl)).
- Elution fractions from the Npu C -column containing ⁇ DEC205-MES thioester were concentrated to 20 ⁇ M and ligated to the CGK(Fl) fluorescent peptide at 1 mM for 48 h at r.t.
- Expected mass for ligation product 50221.2 Da.
- Free HC 49575.0 Da.
- FIG. 17 shows purification of ⁇ DEC thioesters expressed in CHO cells using a split-intein column. Top) Coomassie stained SDSPAGE gel of the purification of ⁇ DEC-MES thioester from CHO cells using a Npu C -column. Bottom) Western blot analysis of the same purification.
- FIG. 18 shows purification of ⁇ DEC thioesters using an Ava C split-intein column and Western blot analysis of the purification of ⁇ DEC thioesters from mammalian cell supernatants using an Ava C -column.
- FIG. 19 shows expression tests of ⁇ DEC205 antibody fused to Ava N split intein through the C-terminus of the antibody light chain and western blot analysis of CHO cell supernatants expressing the ⁇ DEC205-AvaN fusion at different timepoints.
- inteins Of the roughly 600 inteins currently catalogued, (7) less than 5% are split inteins, mostly from a family known as the cyanobacterial split DnaE inteins (8). Surprisingly, only six of these, including Npu, have been experimentally analyzed to any extent, (6,9,10) and only Npu and its widely-studied, low-efficiency ortholog from Synechocystis species PCC6803 (Ssp) have been rigorously characterized in vitro.(5,11)
- This assay can be carried out in the background of varying local C-extein sequences without significantly perturbing the dynamic range. Since all DnaE inteins splice the same local extein sequences in their endogenous context, this screen was originally carried out in a wild-type C-extein background (CFN) within the KanR enzyme. As expected, bacteria expressing the Npu intein had a high relative IC 50 , whereas clones expressing Ssp showed poor resistance to kanamycin. Remarkably, more than half of the DnaE inteins showed splicing efficiency comparable to Npu in vivo at 30° C. ( FIG. 1 b ).
- Npu, Cra(CS505), and Cwa inteins were analyzed in vitro in the presence of a +2 glycine. All three of these reactions were characterized by rapid accumulation of thioester intermediates, which slowly resolved over tens of minutes into the spliced product and the N-extein cleavage product.
- FIGS. 3 a , 6 Further analysis of the split intein sequence alignment indicated that several positions have strong amino acid conservation amongst the high-activity inteins but diverge for the low-activity inteins. These may be sites where the fast inteins have retained beneficial interactions that have been lost in slow ones. To test this idea, several positions were chosen where this sequence-activity correlation was apparent and replaced the residue in Ssp with the corresponding amino acid found in the fast inteins. Consistent with this hypothesis, several point mutations increased the activity of Ssp in vivo ( FIG. 3 e ). While the specific roles of these residues are not explicitly clear, especially given that they lie outside of the active site ( FIG.
- intein folds their locations on the intein fold (14) may provide some insights into their function.
- an aromatic residue is preferred in the high-activity inteins. This position is adjacent to the conserved catalytic TXXH motif (positions 69-72), and an aromatic residue may facilitate packing interactions to stabilize those residues.
- a glutamate is preferred at position 122, proximal to catalytic histidine 125. The glutamate at position 89 is involved in an intimate ion cluster that was previously shown to be important for stabilizing the split intein complex.(13)
- E23 is distant from the catalytic site and has no obvious structural role. This position is conceivably important for fold stability or dynamics as has previously been observed for activating point mutations in other inteins.(15,16)
- the fragments of the different fast-splicing split inteins can be mixed as non-cognate pairs and still retain highly efficient splicing activity, further expanding the options available for any trans-splicing application.
- the N-fragment split intein of Npu or variant thereof can bind to the C-fragment of Npu or variant thereof or any of the other split intein C-fragments or variants discussed below.
- the N-fragment of Ssp or variant thereof can bind to the C-fragment of Ssp or variant thereof or any of the other split intein C-fragments or variants thereof discussed below;
- the N-fragment of Aha or variant thereof can bind to the C-fragment of Aha or variant thereof or any of the other split intein C-fragments or variants thereof discussed below;
- the N-fragment of Aov or variant thereof can bind to the C-fragment of Aov or variant thereof or any of the other split intein C-fragments or variants thereof discussed below;
- the N-fragment of Asp or variant thereof can bind to the C-fragment of Asp or variant thereof or any of the other split intein C-fragments or variants thereof discussed below;
- the N-fragment of Ava or variant thereof can bind to the C-fragment of Ava or variant thereof or any of the other split intein C-fragments or variants thereof discussed below;
- any split intein can be artificially fused and then utilized as a cis-splicing intein in this application (1 in FIG. 4 a ).
- Ultrafast split inteins are especially attractive in this regard due to their speed and efficiency.
- artificially fused variants of Npu, Ava, and Mcht with an N-terminal ubiquitin domain were generated.
- premature C-terminal cleavage or undesired high levels of competing hydrolysis residues Asn137 and Cys+1 were mutated to Ala.
- This thioester intermediate is generally thought to be transiently populated in protein splicing, and to our knowledge, it has never been directly observed.(1) Surprisingly, when analyzing the ubiquitin-DnaE intein fusions by reverse phase HPLC, two major peaks and a third minor peak were often observed, all bearing the same mass. The relative abundance of these species could be modulated by unfolding the proteins or by changes in pH, and the two major species were almost equally populated from pH 4-6 ( FIG. 4 d ). The major peaks most likely correspond to the precursor amide, 1, and the linear thioester, 2, and the minor peak as the tetrahedral oxythiazolidine intermediate.
- polypeptide refers to any amino acid based polymer, interchangeable referred to as a “protein” throughout, and can include glycoproteins and lipoproteins.
- the polypeptide is a polypeptide excreted from a cell (e.g., a mammalian cell).
- the polypeptide is an antibody or a fragment thereof.
- the polypeptide can be any naturally occurring or synthetic polypeptide of interest, including polypeptides having one or more amino acid residues other than the 20 naturally occurring amino acids.
- the polypeptide has a molecular weight of 45 kDa or greater, 50 kDa or greater, 60 kDa or greater, 75 kDa or greater, 100 kDa or greater, 120 kDa or greater, or 150 kDa or greater.
- the polypeptide can be, e.g., an antibody or a fragment thereof.
- the split intein N-fragment can be fused to one or both of the heavy chains, and/or to one or both of the light chains.
- the polypeptide is a protein secreted from a cell, e.g., a mammalian cell.
- the split intein N-fragment comprises a sequence as shown in FIG. 5 , e.g., Npu (SEQ ID NO: 1), Ssp (SEQ ID NO: 2), Aha (SEQ ID NO: 3), Aov (SEQ ID NO: 4), Asp (SEQ ID NO: 5), Ava (SEQ ID NO: 6), Cra(CS505) (SEQ ID NO: 7), Csp(CCY0110) (SEQ ID NO: 8), Csp(PCC8801) (SEQ ID NO: 9), Cwa (SEQ ID NO: 10), Maer(NIES843) (SEQ ID NO: 11), Mcht(PCC7420) (SEQ ID NO: 12), Oli (SEQ ID NO: 13), Sel(PC7942) (SEQ ID NO: 14), Ssp(PCC7002) (SEQ ID NO: 15), Tel (SEQ ID NO: 16), Ter (SEQ ID NO: 17), Tvu (SEQ ID NO: 18), or a variant thereof.
- the spilt intein N-fragment sequence comprises a sequence other than Npu (SEQ ID NO: 1) or Ssp (SEQ ID NO: 2), and in other cases, comprises a sequence other than Npu (SEQ ID NO: 1), Ssp (SEQ ID NO: 2), or Aha (SEQ ID NO: 3).
- the split intein N-fragment sequence comprises a sequence of Ava (SEQ ID NO: 6), Cra (SEQ ID NO: 7), Csp(PCC8801) (SEQ ID NO: 9), Cwa (SEQ ID NO: 10), Mcht(PCC7420) (SEQ ID NO: 12), Oli (SEQ ID NO: 13), Ter (SEQ ID NO: 17) and Tvu (SEQ ID NO: 18).
- the split intein N-fragment has a sequence comprising a consensus sequence of SEQ ID NO: 19:
- a variant of a split intein N-fragment is a mutated split intein N-fragment as disclosed herein that maintains the activity of the split intein N-fragment (e.g., its ability to bind to a split intein C-fragment and/or catalyze nucleophilic attack of the polypeptide fused to it).
- Contemplated variants of a split intein N-fragments disclosed herein include mutation of one or more C residues, except for Cys1, to an aliphatic residue, such as an A, I, L, or F, or to a S residue.
- One such variant contemplated is a mutant Npu with Cys28 and Cys59 mutated to Ser, SEQ ID NO: 20
- the split intein C-fragments disclosed herein are mutated from the naturally occurring sequences to mutate the N137 and C+1 residues to a residue other than Asn or Gln for N137 and a residue other than Cys for C+1 (SEQ ID NOs: 129-146).
- the mutations at these two positions are to a hydrophobic residue, e.g., not containing a free SH thiol (Cys), a carboxylic acid (Asp, Glu), or a base (Arg, His, Lys) or other undesired group (e.g., Asn, Gln) on the side chain.
- the two mutated aliphatic residues can be the same or different and can be A, V, I, S, M, H, L, F, Y, G, or W or can be a unnatural (e.g., not encoded by genetic code) aliphatic amino acid residue such as norleucine, 2-aminobutyric acid, nor-valine, 2-aminopentoic acid, or 2-aminohexaanoic acid (SEQ ID NOs: 219-236).
- mutations where both residues are selected from A, I, V, L, Y, G and F (SEQ ID NOs: 309-326).
- at least one of the two mutated residues is A.
- a mutated split intein C-fragment comprising a mutation at N137 and Cys+1 of Npu (SEQ ID NOs: 129, 147, 165, 183, 201, 219, 237, 255, 273, 291, 309, 327, 345, 363, 381 and 399), Ssp (SEQ ID NOs: 130, 148, 166, 184, 202, 220, 238, 256, 274, 292, 310, 328, 346, 364, 382 and 400), Aha (SEQ ID NOs: 131, 149, 167, 185, 203, 221, 239, 257, 275, 293, 311, 329, 347, 365, 383 and 401), Aov (SEQ ID NOs: 132, 150, 168, 186, 204, 222, 240, 258, 276, 294, 312, 330, 348, 366, 384 and 402), Asp (SEQ ID NOs: 129, 147,
- At least one of the mutations is A (SEQ ID NOs: 183-200, 255-272, 327-344, and 345-416) and in more specific cases, both mutations are A (SEQ IS NOs: 399-416).
- the solid support is a polymer or substance that allows for linkage of the split intein C-fragment, optionally via a linker.
- the linker can be further amino acid residues engineered to the C-terminus of the split intein C-fragment or can be other known linkers for attachment of a peptide to a support.
- One contemplated linker is a small peptide -SGGC (SEQ ID NO: 705), where the thiol of the C-terminal Cys can be used to attach the split intein C-fragment to the support.
- SEQ ID NO: 705 small peptide -SGGC
- sequences noted above having a -SGGC peptide linker (SEQ ID NO: 705) (e.g., specifying the residues starting at the N137 position: AAFN-SGGC) (SEQ ID NO:706).
- the length of a peptide linker can be modified to provide varying lengths and flexibility in any individual situation (e.g., more than 2 Gly residues). It will also be apparent that the C-terminus residue of a peptide linker can be modified to introduce an appropriately reactive functional group to attach the split intein C-fragment to a surface of choice (e.g., Lys to react via an amine, Cys to react via a thiol, or Asp or Glu to react via a carboxylic acid).
- peptide linker to provide other functional group moieties to allow for different attachment chemistry of the C-fragment to a support of interest (e.g., azide, alkynes, carbonyls, amino-oxy, cyano-benzothiazoles, tetrazoles, alkenes, alkyl-halides).
- the linker can alternatively be a polymeric linker.
- a consensus sequence for the split intein C-fragment is derived: SEQ ID NO: 707 (VKIISRQSLGKQNVYDIGVEKDHNFLLANGLIASN), as well as a mutated version where the N137 is mutated to other than Asn or Gln (SEQ ID NO:708), or more specifically, N137 is mutant to a naturally occurring or unnaturally occurring hydrophobic residue, such as A, V, I, M, H, L, F, Y, G, S, H, or W or can be a unnatural (e.g., not encoded by genetic code) aliphatic amino acid residue such as norleucine, 2-aminobutyric acid, nor-valine, 2-aminopentanoic acid, or 2-aminohexanoic acid (SEQ ID NO: 709).
- a naturally occurring or unnaturally occurring hydrophobic residue such as A, V, I, M, H, L, F, Y, G, S, H, or W
- +1 position can be a naturally occurring or unnaturally occurring hydrophobic residue such as A, V, I, M, H, L, F, Y, G, S, H, or W or can be a unnatural (e.g., not encoded by genetic code) aliphatic amino acid residue such as norleucine, 2-aminobutyric acid, nor-valine, 2-aminopentanoic acid, or 2-aminohexanoic acid (SEQ ID NOs: 713, 717, 721, and 725).
- mutation at +1 position is selected from A, I, V, L, Y, G, and F (SEQ ID NOs: 714, 718, 722, and 726).
- at least one of the mutated residues of the consensus sequence is A (SEQ ID NOs: 715, 719, and 723).
- the consensus C-fragment sequence has both mutations as Ala (SEQ ID NO:727).
- a consensus sequence comprising FN at the +2 and +3 positions (SEQ ID NO:728-743.
- a consensus sequence comprising a peptide linker for attachment to a solid support, and one embodiment is -SGGC at positions +4-+7 (SEQ ID NO: 744-759).
- the split intein C-fragment or variant thereof as disclosed herein can be attached to a solid support via a linker.
- the linker is a polymer, including but not limited to a water-soluble polymer, a nucleic acid, a polypeptide, an oligosaccharide, a carbohydrate, a lipid, or combinations thereof. It is not critical what the linker's chemical structure is, since it serves primarily as a linker. The linker should be chosen so as not to interfere with the activity of the C-fragment.
- the linker can be made up of amino acids linked together by peptide bonds.
- the linker comprises Y n , wherein Y is a naturally occurring amino acid or a steroisomer thereof and “n” is any one of 1 through 20.
- the linker is therefore can be made up of from 1 to 20 amino acids linked by peptide bonds, wherein the amino acids are selected from the 20 naturally-occurring amino acids.
- the 1 to 20 amino acids are selected from Gly, Ala, Ser, Cys.
- the linker is made up of a majority of amino acids that are sterically un-hindered, such as Gly.
- Non-peptide linkers are also possible.
- These alkyl linkers may further be substituted by any non-sterically hindering group such as lower alkyl (e.g., C 1 -C 6 ), halogen (e.g., Cl, Br), CN, NH 2 , phenyl, etc.
- Non-peptide linker is a polyethylene glycol group, such as: —HN—(CH 2 ) 2 —(O—CH 2 —CH 2 ) n —O—CH 2 —CO, wherein n is such that the overall molecular weight of the linker ranges from approximately 101 to 5000, preferably 101 to 500.
- the linker has a length of about 0-14 sub-units (e.g., amino acids).
- the length of the linker in various embodiments is at least about 10 nucleotides, 10-30 nucleotides, or even greater than 30 nucleotides.
- the bases of the polynucleotide linker are all adenines, all thymines, all cytidines, all guanines, all uracils, or all some other modified base.
- a non-nucleotide linker of the invention comprises a basic nucleotide, polyether, polyamine, polyamide, peptide, carbohydrate, lipid, polyhydrocarbon, or other polymeric compounds.
- Specific examples include those described by Seela and Kaiser, Nucleic Acids Res. 1990, 18:6353 and Nucleic Acids Res. 1987, 15:3113; Cload and Schepartz, J. Am. Chem. Soc. 1991, 113:6324; Richardson and Schepartz, J. Am. Chem. Soc. 1991, 113:5109; Ma et al., Nucleic Acids Res.
- non-nucleotide further means any group or compound that can be incorporated into a nucleic acid chain in the place of one or more nucleotide units, including either sugar and/or phosphate substitutions, and allows the remaining bases to exhibit their enzymatic activity.
- the group or compound can be abasic in that it does not contain a commonly recognized nucleotide base, such as adenosine, guanine, cytosine, uracil or thymine, for example at the Cl position of the sugar.
- linkers contemplated include linear polymers (e.g., polyethylene glycol, polylysine, dextran, etc.), branched chain polymers (see, for example, U.S. Pat. No. 4,289,872 to Denkenwalter et al., issued Sep. 15, 1981; U.S. Pat. No. 5,229,490 to Tam, issued Jul. 20, 1993; WO 93/21259 by Frechet et al., published 28 Oct. 1993); lipids; cholesterol groups (such as a steroid); or carbohydrates or oligosaccharides.
- Other linkers include one or more water soluble polymer attachments such as polyoxyethylene glycol, or polypropylene glycol as described U.S. Pat. Nos.
- polystyrene resin 4,640,835, 4,496,689, 4,301,144, 4,670,417, 4,791,192 and 4,179,337.
- Other useful polymers as linkers known in the art include monomethoxy-polyethylene glycol, dextran, cellulose, or other carbohydrate based polymers, poly-(N-vinyl pyrrolidone) polyethylene glycol, propylene glycol homopolymers, a polypropylene oxide/ethylene oxide co-polymer, polyoxyethylated polyols (e.g., glycerol) and polyvinyl alcohol, as well as mixtures of these polymers.
- polystylated polyols e.g., glycerol
- oligonucleotide such as poly-A or hydrophilic or amphiphilic polymers are contemplated as linkers, including, for example, amphiphiles (including oligonucletoides).
- Contemplated solid supports include resins, particles, and beads. More specific solid supports include polyhydroxy polymers, e.g. based on polysaccharides, such as agarose, dextran, cellulose, starch, pullulan, or the like, and synthetic polymers, such as polyacrylic amide, polymethacrylic amide, poly(hydroxyalkylvinyl ethers), poly(hydroxyalkylacrylates) and polymethacrylates (e.g. polyglycidylmethacrylate), polyvinyl alcohols and polymers based on styrenes and divinylbenzenes, and copolymers in which two or more of the monomers corresponding to the above-mentioned polymers are included.
- polysaccharides such as agarose, dextran, cellulose, starch, pullulan, or the like
- synthetic polymers such as polyacrylic amide, polymethacrylic amide, poly(hydroxyalkylvinyl ethers), poly(hydroxyalkylacryl
- solid supports contemplated include agarose, sepharose, cellulose, polystyrene, polyethylene glycol, derivatized agarose, acrylamide, sephadex, sepharose, polyethyleneglycol (PEG)-acrylamide, and polystyrene-PEG based supports.
- the solid support can be a resin such as p-methylbenzhydrylamine (pMBHA) resin (Peptides International, Louisville, Ky.), polystyrenes (e.g., PAM-resin obtained from Bachem Inc., Peninsula Laboratories, etc.), including chloromethylpolystyrene, hydroxymethylpolystyrene and aminomethylpolystyrene, poly (dimethylacrylamide)-grafted styrene co-divinyl-benzene (e.g., POLYMPE resin, obtained from Aminotech, Canada), polyamide resin (obtained from Peninsula Laboratories), polystyrene resin grafted with polyethylene glycol (e.g., TENTAGEL or ARGOGEL, Bayer, Tubingen, Germany) polydimethylacrylamide resin (obtained from Milligen/Biosearch, California), or Sepharose (Pharmacia, Sweden).
- pMBHA p-methylbenzhydrylamine
- EPL Expressed Protein Ligation
- EPL was first described in 1998 (21), as an expansion to recombinant proteins of Native Chemical Ligation (NCL) (19,22), and it consists on the reaction between a C-terminal recombinant protein ⁇ -thioester with a synthetic peptide containing a Cys at its N-terminus through the formation of a new native peptide bond between the two fragments.
- NCL Native Chemical Ligation
- Inteins such as GyrA or VMA, have been successfully harnessed to prepare a wide variety of protein thioesters.
- inteins are usually fused to affinity tags such as the chitin binding domain or the hexa-His tag.
- affinity tags such as the chitin binding domain or the hexa-His tag.
- the reaction conditions required for efficient thiolysis affect the performance of such tags and subsequent additional purification steps are often required to obtain the desired pure product for ligation (24-26).
- significant levels of in vivo premature cleavage can occur, significantly reducing the final product yield.
- the Npu split-intein was chosen, which is one of the fastest DnaE split-inteins previously known [10]. Initially the ability of split Npu to generate protein thioesters in solution was tested by mixing the model protein ubiquitin fused to NpuN with a mutant NpuC (Asn137 and Cys+1 to Ala) in the presence (and absence) of the thiol MESNa. SDSPAGE, HPLC and MS analysis of the reactions showed the formation of the desired ubiqutin C-terminal ⁇ -thioester in a few hours.
- NpuC-AA SEQ. ID NO: 777
- NpuC-AA affinity resin protein C-terminal ⁇ -thioesters (Ubiquitin, MBP, PHPT1) could be easily produced and purified out of cell lysates ( FIG. 8 ).
- HPLC and MS analysis confirmed the formation of the desired protein thioesters with very low levels of undesired hydrolysis ( FIG. 9 ).
- split-inteins and also inteins
- the N-terminal junction is regarded as more tolerant towards deviations from the native N-extein residues it was important to evaluate the effect that the C-terminal amino acid of the protein of interest ( ⁇ 1 residue according to intein numbering conventions) would have on the yields of thioester formation.
- a complete library of Ub-X-NpuN fusion proteins was constructed where the C-terminal Ub residue (X) was varied from its native Gly to all other 19 proteinogenic amino acids. Proteins were expressed in E. coli and cell lysates, applied to the NpuC-AA affinity resin and purified.
- Protein yields were estimated from the SDSPAGE analysis for each purification and hydrolysis levels from RP-HPLC and MS analysis of the elution fractions ( FIG. 10 ). Results show similar trends to those known for non-split inteins, such as GyrA (29), and most amino acids display high yields of cleavage after overnight incubation with MESNa, the exceptions being Pro and Glu, for which recovery were 49 and 50%, respectively. As expected, the Asn ⁇ -thioester could not be isolated due to the well-known reaction of its side-chain with the adjacent ⁇ -thioester to form a succinimide.
- the system was next tested for the purification from inclusion bodies of a fragment of the histone H2B.
- Preparation of site-specifically modified histones using EPL is a topic of major interest due to their crucial role in the understanding of epigenetic regulation.
- histone fragments are remarkably poorly behaved and the preparation of their recombinant C-terminal ⁇ -thioesters particularly challenging.
- a H2B(1-116) fragment fused to NpuN was expressed in E. coli and the inclusion bodies extracted with 6 M urea.
- the H2B-NpuN fusion was subsequently diluted into a 3 M urea buffer and the corresponding C-terminal ⁇ -thioester generated concomitant with purification over the NpuC-AA affinity resin ( FIG. 11 ).
- H2B(1-116) C-terminal thioester was obtained in excellent purity (>90% by RP-HPLC) and isolated yield ( ⁇ 20 mg per L of culture). This represents a significant improvement over previous protocols which afford less protein (4 mg per L of culture) and require the use of multiple chromatographic purification steps including RP-HPLC.
- the H2B(1-116)-MES thioester obtained from the IntC-column can be directly used in EPL reactions without further purification. Accordingly, the protein was successfully ligated to a synthetic H2B(117-125) peptide containing an acetylated Lys at position 120 to yield semi-synthetic H2B-K120Ac ( FIG. 15 ).
- the modification of antibodies is a field of intense research, specially focused on the development of therapeutic antibody-drug-conjugates (30).
- the identity of the N-intein could have a significant effect on the expression levels of its fusion to a given protein of interest.
- the N-fragment of several of the fastest split DnaE inteins cross-reacted with NpuC, allowing one to use any of them with the same NpuC based affinity column. Accordingly, the expression levels of a model antibody ( ⁇ DEC, antibody against the DEC205 receptor) were tested and found the highest levels of expression were obtained when ⁇ DEC was fused to the AvaN intein ( FIG. 12 A ).
- ⁇ DEC-AvaN fusions were transfected into 293T cells and after 4 days of culture the supernatants were collected and purified over the NpuC-AA-column.
- the presence of a C-terminal thioester in the purified ⁇ DEC was confirmed by reacting it with a short fluorescent peptide with an N-terminal Cys residue ( FIGS. 12 B and C).
- MS of the deglycosylated and reduced ⁇ DEC-fluorophore conjugate was used to confirm its identity and SEC-MALS demonstrate the product was monodisperse and of the expected size for an IgG antibody.
- Split-inteins can be engineered for the preparation of protein ⁇ -thioesters and that the strong affinity between the two split-intein fragments provides a powerful handle for their purification.
- the generality of the approach is demonstrated by using it to generate highly pure thioesters of both soluble (ubiqutin, MBP, PHPT) and insoluble proteins (H2B fragment) as well as monoclonal antibodies ( ⁇ DEC).
- ubiqutin, MBP, PHPT soluble proteins
- H2B fragment insoluble proteins
- ⁇ DEC monoclonal antibodies
- several N-inteins can be tested for optimal expression levels of the protein of interest and used with one single NpuC-column.
- split inteins disclosed herein can be used to purify and modify a polypeptide of interest.
- a polypeptide of interest is provided in a fusion protein with a split intein N-fragment, e.g., via well-known recombinant protein methods.
- the fusion protein is then contacted with a corresponding split intein C-fragment under conditions that allow binding of the N-fragment and C-fragment to form an intein intermediate.
- the split intein C-fragment can be bound to a support (e.g., a solid support such as a resin) or can subsequently (e.g., after binding to the split intein N-fragment to form the intein intermediate) be bound to a support.
- Washes can include detergents, denaturing agents and salt solutions (e.g., NaCl).
- the intein intermediate can be reacted with a nucleophile to release the polypeptide of interest from the bound N- and C-fragment inteins wherein the C-terminus of the polypeptide is modified by the nucleophile added.
- the nucleophile can be a thiol to directed yield the polypeptide as an ⁇ -thioester, which in turn can be further modified, e.g., with a different nucleophile (e.g., a drug, a polymer, another polypeptide, a oligonucleotide), or any other moiety using the well-known ⁇ -thioester chemistry for protein modification at the C-terminus.
- a different nucleophile e.g., a drug, a polymer, another polypeptide, a oligonucleotide
- One advantage of this chemistry is that only the C-terminus is modified with a thioester for further modification, thus allowing for selective modification only at the C
- the nucleophile that is used in the methods disclosed herein either with the intein intermediate or as a subsequent nucleophile reacting with, e.g., a ⁇ -thioester can be any compound or material having a suitable nucleophilic moiety.
- a thiol moiety is contemplaed as the nucleophile.
- the thiol is a 1,2-aminothiol, or a 1,2-aminoselenol.
- An ⁇ -selenothioester can be formed by using a selenothiol (R-SeH).
- Alternative nucleophiles contemplated include amines (i.e.
- nucleophile can be a functional group within a compound of interest for conjugation to the polypeptide of interest (e.g., a drug to form a protein-drug conjugate) or could alternatively bear an additional functional group for subsequent known bioorthogonal reactions such as an azide or an alkyne (for a click chemistry reaction between the two function groups to form a triazole), a tetrazole, an ⁇ -ketoacid, an aldehyde or ketone, or a cyanobenzothiazole.
- IPTG isopropyl- ⁇ -D-thiogalactopyranoside
- DIPEA N,N-diisopropylethylamine
- Kanamycin sulfate Kan
- BME ⁇ -Mercaptoethanol
- DTT sodium 2-mercaptoethanesulfonate
- EDT ethanedithiol
- Coomassie brilliant blue N,N-dimethylformamide (DMF)
- Tetrakis(triphenylphosphine)palladium(0) Pd(PPh 3 ) 4
- phenylsilane triisopropylsilane (TIS) sodium diethyldithiocarbamate trihydrate
- 5(6)-carboxyfluorescein were purchased from Sigma-Aldrich (St. Louis, MO).
- Tris(2-carboxyethyl)phosphine hydrochloride was purchased from Thermo Scientific (Rockford, IL). Fmoc-Gly-OH, Fmoc-Lys(Alloc)-OH, and Boc-Cys(Trt)-OH were purchasd from Novabiochem (Laufelfingen, Switzerland). Piperidine was purchased from Alfa Aesar (Ward Hill, MA). Dichloromethane (DCM) and rink amide resin were purchased from EMD Chemicals (Billerica, MA). 1-Hydroxybenzotriazole hydrate (HOBt) was purchased from AnaSpec (Fremont, Ca).
- Trifluoroacetic acid was purchased from Halocarbon (North Augusta, SC). Complete protease inhibitor tablets were purchased from Roche Diagnostics (Mannheim, Germany). Nickel-nitrilotriacetic acid (Ni-NTA) resin was from Novagen (Gibbstown, NJ).
- the QuikChange XL II site directed mutagenesis kit was from Agilent (La Jolla, CA). DpnI and the Phusion High-Fidelity PCR kit were from New England Biolabs (Ipswich, MA). DNA purification kits (QIAprep spin minikit, QIAquick gel extraction kit, QIAquick PCR purification kit) were from Qiagen (Valencia, CA).
- DH5 ⁇ competent cells and One Shot BL21(DE3) chemically competent E. coli were purchased from Invitrogen (Carlsbad, CA) and used to generate “in-house” high-competency cell lines. Oligonucleotides were purchased from Integrated DNA Technologies (Coralville, IA). The new intein genes were generated synthetically and purchased from GENEWIZ (South Plainfield, NJ). All plasmids used in this study were sequenced by GENEWIZ.
- Criterion XT Bis-Tris gels (12%), Immun-blot PVDF membrane (0.2 ⁇ m), and Bradford reagent dye concentrate were purchased from Bio-Rad (Hercules, CA). 20 ⁇ MES-SDS running buffer was purchased from Boston Bioproducts (Ashland, MA). Mouse anti-myc monoclonal antibody ( ⁇ -myc) was purchased from Invitrogen (Carlsbad, CA). Anti-His Tag, clone HIS.H8 mouse monoclonal antibody ( ⁇ -His6) was purchased from Millipore (Billerica, MA). Mouse HA.11 monoclonal antibody ( ⁇ -HA) was purchased from Covance (Princeton, NJ). IRDye 800CW goat anti-Mouse IgG secondary antibody (Licor mouse 800) and Licor Blocking Buffer were purchased from LI-COR Biotechnology (Lincoln, NE).
- Size-exclusion chromatography was carried out on an ⁇ KTA FPLC system from GE Healthcare. Both preparative and analytical FPLC were carried out on a Superdex 75 10/300 or S200 10/300 column. For all runs, proteins were eluted over 1.35 column volumes of buffer (flow rate: 0.5 mL/min).
- Analytical RP-HPLC was performed on Hewlett-Packard 1100 and 1200 series instruments equipped with a C18 Vydac column (5 ⁇ m, 4.6 ⁇ 150 mm) at a flow rate of 1 mL/min. Preparative RP-HPLC was performed on a Waters prep LC system comprised of a Waters 2545 Binary Gradient Module and a Waters 2489 UV detector.
- Protein sequences of the split DnaE inteins were obtained from the NEB InBasel. This list consisted of 23 entries as of May 2011. Of these entries, two were discarded from the study as they did not have a C-intein sequence: Csp(PCC7822) and Nosp(CCY9414). Two pairs of inteins had identical sequences: Nsp(PCC7120) with Asp (these are most likely the same organism with two different names) and Sel(PCC6301) with Sel(PC7942). Thus, Nsp(PCC7120) and Sel(PCC6301) were removed from the library.
- the Mcht(PCC7420) and Oli C-intein sequences were identical, but both inteins were kept in the library as their N-intein sequences were different.
- the Aov intein had an “X” at position 87 in place of an absolutely conserved isoleucine (I), so 187 was utilized at this position.
- the plasmid for the kanamycin resistance assays bearing the Csp(PCC7424) intein proved to be unstable and yielded highly variable results; thus this intein was excluded from the analyses.
- the final library contained 18 inteins, Table 1.
- N- and C-inteins were manually aligned using the multiple alignment software Jalview2. All N-intein sequences were “left-justified” to align the first cysteine residue, and the variable N-intein tail region was not aligned. All C-intein sequences were “right-justified” to align the C-terminal asparagine.
- the residue numbering used in this study is based on the numbering for the NMR structure of a fused Npu intein (PDB code 2KEQ).
- PDB code 2KEQ fused Npu intein
- the C-intein numbering starts at 103, except for the Tel and Tvu inteins, which have a gap at this position and start at 104.
- sequence logos FIG. 6
- the high activity sequence logos were comprised of Cwa, Cra(CS505), Csp(PCC8801), Ava, Npu, Csp(CCY0110), Mcht(PCC7420), Maer(NIES843), Asp, Oli, and Aha (which was included based on the high activity of the C120G mutant).
- the low activity sequence logos were comprised of Aov, Ter, Ssp(PCC7002), Tvu, Tel, Ssp, and Sel(PC7942).
- the sequence logos were generated using WebLogo.
- Heat maps were generated using the statistical computing and graphics program “R”.
- KanR aminoglycoside phosphotransferase
- Npu gene fragments were cloned into a pBluescript KS (+) vector between KpnI and SacI restriction sites as previously described (36,37).
- This construct contained the following architecture: [KanR promoter]-[RBS]-[myc-KanRN]-[IntN]-[iRBS]-[IntC]-[CFN-KanRC] where the KanR promoter is the constitutive promoter found in most kanamycin-resistant plasmids, RBS is a common E.
- coli ribosomal binding site iRBS is an intervening ribosomal binding site preceded by a linker
- myc encodes for a c-myc epitope tag (EQKLISEEDL) (SEQ ID NO: 760)
- KanRN and KanRC are fragments of the KanR protein
- IntN and IntC are split intein fragments.
- An analogous Ssp plasmid was also constructed as previously described (36,37). These plasmids are referred to as myc-KanR-NpuDnaE-Split and myc-KanR-SspDnaE-Split.
- the entire synthetic gene was amplified with Phusion High-Fidelity Polymerase using primers annealing to the 5′ and 3′ overhangs.
- the resulting megaprimer was inserted into the myc-KanR plasmid in place of Npu by overlap-extension PCR with Phusion polymerase (39).
- the plasmids are named as: myc-KanR-XyzDnaE-Split (where Xyz indicates the intein name as given in Table 1). Specific point mutations were made to various inteins using a QuikChange Site-Directed Mutagenesis kit with the standard recommended protocol.
- Intein activity-coupled kanamycin resistance (KanR) assays were conducted in 96-well plate format as previously described (36,37). Typically, plasmids were transformed into 15 ⁇ L of sub-cloning efficiency DH5a cells by heat shock, and the transformed cells were grown for 18 hours at 37° C. in 3 mL of Luria-Bertani (LB) media with 100 ⁇ g/mL of ampicillin (LB/amp). The over-night cultures were diluted 250-fold into LB/amp solutions containing 8 different kanamycin concentrations (150 ⁇ L per culture). The cells were grown at 30° C.
- KanR Intein activity-coupled kanamycin resistance
- O ⁇ D O ⁇ b ⁇ s O ⁇ D Min + ( O ⁇ D Max - O ⁇ D Min ) 1 + 1 ⁇ 0 [ ( logIC 50 - log ⁇ [ Kan ] ) ⁇ HillSlope ]
- O ⁇ D O ⁇ b ⁇ s O ⁇ D Min + ( O ⁇ D Max - O ⁇ D Min ) 1 + 1 ⁇ 0 [ ( logIC 50 - log ⁇ [ Kan ] ) ⁇ HillSlope ]
- OD min was fixed to the background absorbance at 650 nm, and all other parameters were allowed to vary.
- the reported error bars for the IC 50 bar graphs ( FIG. 1 b ) represent the standard error in the best-fit IC 50 value from three or four collectively fit dose-response curves.
- DH5a cells were transformed with the assay plasmids identically as for the 96-well plate setup and grown for 18 hours at 37° C. while shaking. The overnight cultures were used to inoculate 3 mL of fresh LB/amp at a 1:300 dilution, and the cells were incubated at 30° C. for 24 hours. The ODs of the 30° C. cultures were measured at 650 nm to assess relative bacterial levels, then 150 ⁇ L of each culture was transferred to an Eppendorf tube and centrifuged at 17,000 rcf for 2 minutes.
- the supernatant was aspirated off, and the cell pellets were resuspended/lysed in ⁇ 200 ⁇ L of 2 ⁇ SDS gel loading dye containing 4% BME (the resuspension volumes were varied slightly to normalize for differences in OD).
- the samples were boiled for 10 minutes, then centrifuged at 17,000 rcf for 1 minute.
- Each sample (5 ⁇ L) was loaded onto a 12% Bis-Tris gel and run in MES-SDS running buffer.
- the proteins were transferred to PVDF membrane in Towbin transfer buffer (25 mM Tris, 192 mM glycine, 15% methanol) at 100V for 90 minutes.
- Membranes were blocked with 4% milk in TBST, then the primary antibody ( ⁇ -myc, 1:5000) and secondary antibody (Licor mouse 800, 1:15,000) were sequentially applied in 4% milk in TBST. The blots were imaged using the Licor Odyssey scanner.
- N-intein expression plasmids were derived from a previously described NpuN plasmid, pMR-Ub-NpuN(WT) (36,37). This plasmid encoded for the following protein sequence:
- IntN plasmids were cloned using overlap-extension PCR to generate Ub-IntN fusion genes in homologous plasmids in a traceless manner (39).
- N-intein genes were amplified by Phusion polymerase from the synthetic gene plasmids using primers with overhangs that anneal to the plasmid sequences surrounding NpuN in pMR-Ub-NpuN- ⁇ Gly 4 .
- the resulting megaprimer was then used to insert the new N-intein gene in place of NpuN to generate a new plasmid called pMR-Ub-IntN that was identical to the NpuN plasmid except for the N-intein gene.
- the C-intein plasmids were all derived from a previously described NpuC plasmid, pET-NpuC(WT)-SUMO (37). This plasmid encoded for the following protein sequence:
- IntC plasmids were cloned using overlap-extension PCR to generate IntC-SUMO fusion genes in homologous plasmids in a traceless manner (39).
- C-intein genes were amplified by Phusion polymerase from the synthetic gene plasmids using primers with overhangs that anneal to the plasmid sequences surrounding NpuC in pET-NpuC(WT)-SUMO.
- the resulting megaprimer was then used to insert the new C-intein gene in place of NpuC to generate a new plasmid called pET-IntC-SUMO that was identical to the NpuC plasmid except for the C-intein gene.
- lysis buffer 50 mM phosphate, 300 mM NaCl, 5 mM imidazole, 2 mM BME, pH 8.0
- the cell pellets were resuspended by adding an additional 15 mL of lysis buffer supplemented with Complete protease inhibitor cocktail. Cells were lysed by sonication (35% amplitude, 8 ⁇ 20 second pulses separated by 30 seconds on ice). The soluble fraction was recovered by centrifugation (35,000 rcf, 30 min). The soluble fraction was mixed with 2 mL of Ni-NTA resin and incubated at 4° C. for 30 minutes. After incubation, the slurry was loaded onto a fitted column.
- the column was washed with 5 column volumes (CV) of lysis buffer, 5 CV of wash buffer 1 (lysis buffer with 20 mM imidazole), and 3 CV of wash buffer 2 (lysis buffer with 50 mM imidazole).
- the protein was eluted with elution buffer (lysis buffer with 250 mM imidazole) in four 1.5 CV elution fractions.
- the wash and elution fractions were analyzed by SDS-PAGE.
- the proteins were purified by gel filtration.
- the wash and elution fractions were all treated with 50 mM DTT for 30 minutes on ice.
- the first elution fraction was then directly injected on an S75 10/300 gel filtration column (3 ⁇ 1 mL injections) and eluted over 1.35 CV in freshly prepared, degassed splicing buffer (100 mM phosphates, 150 mM NaCl, 1 mM DTT, 1 mM EDTA, pH 7.2).
- the concentration of pure proteins were determined by UV A280 nm and by the Bradford assay.
- the Ub-CwaN protein did not express well in the soluble fraction, and all of the enriched protein was aggregated, as observed by gel filtration analysis.
- the protein was extracted from the insoluble fraction of the lysate as follows. First, the lysate pellet was resuspended in 20 mL of Triton wash buffer (lysis buffer with 0.1% Triton X-100) and incubated at room temperature for 30 minutes. The Triton wash was centrifuged at 35,000 rcf for 30 minutes, and the supernatant was discarded.
- Triton wash buffer lysis buffer with 0.1% Triton X-100
- the pellet was resuspended in 20 mL of lysis buffer containing 6 M urea, and the mixture was incubated overnight at 4° C. The mixture was centrifuged at 35,000 rcf for 30 minutes, and then the supernatant was mixed with 2 mL of NiNTA resin.
- the Ni column was run identically as for the native purifications described above, except that every buffer had a background of 6 M urea. Following enrichment over a Ni-NTA column, the 50 mM imidazole wash and the first two elution fractions were pooled and diluted to 0.2 mg/mL.
- the diluted protein was refolded into lysis buffer (without urea) by step-wise dialysis removal of the urea at 4° C.
- the protein was concentrated four-fold to 3 mL and immediately purified by gel filtration as indicated for the native purifications above.
- the pure protein was analyzed by analytical gel filtration, analytical RP-HPLC, and mass spectrometry. Note that this construct was highly susceptible to aggregation. When re-folded at 2 mg/mL rather than 0.2 mg/mL, less than 10% of the obtained protein was monomeric, whereas more dilute refolding yielded roughly 50% monomeric protein.
- the obtained protein was 80% monomeric, and the monomer to aggregate ratio did not change after 24 hours of storage at 4° C.
- the concentration of pure protein was determined by the Bradford assay.
- the AvaC-SUMO and Csp(PCC8801)C-SUMO proteins did not express well at 37° C., so the proteins were re-expressed by induction at 18° C. for 16 hours.
- the 50 mM imidazole wash and the first two elution fractions were pooled and dialyzed into TEV cleavage buffer (50 mM phosphate, 300 mM NaCl, 5 mM imidazole, 0.5 mM EDTA, 0.5 mM DTT, pH 8.0) then treated with 40 ⁇ g of His-tagged TEV protease overnight at room temperature.
- TEV cleavage buffer 50 mM phosphate, 300 mM NaCl, 5 mM imidazole, 0.5 mM EDTA, 0.5 mM DTT, pH 8.0
- the cleavage was confirmed by RP-HPLC/MS, after which the reaction solution was incubated with Ni-NTA resin at room temperature for 30 min. The flow-through and two 1.5 CV washes with wash buffer 1 were collected and pooled.
- the protein was then concentrated to 3-4 mL, injected onto the S75 10/300 gel filtration column (3 ⁇ 1 mL injections), and eluted over 1.35 CV in freshly prepared, degassed splicing buffer (100 mM phosphates, 150 mM NaCl, 1 mM DTT, 1 mM EDTA, pH 7.2).
- degassed splicing buffer 100 mM phosphates, 150 mM NaCl, 1 mM DTT, 1 mM EDTA, pH 7.2.
- FPLC fractions were analyzed by SDS-PAGE, and the purest fractions were pooled and analyzed by analytical gel filtration, analytical RP-HPLC, and mass spectrometry.
- the concentration of pure protein was determined by UV A 280nm and by the Bradford assay.
- a typical reaction volume was 300 ⁇ L and was carried out in an Eppendorf tube on a heat block. During the reaction, 20 ⁇ L aliquots of the reaction solution were removed at the desired time points and quenched in 20 ⁇ L of 2 ⁇ concentrated SDS gel loading dye on ice to afford a final quenched solution with 40 mM Tris ( ⁇ pH 7.0), 10% (v/v) glycerol, 1% (w/v) SDS, 0.02% (w/v) bromophenol blue, and 2% (v/v) BME. For each reaction, an artificial zero time point was taken by mixing equivalent amounts of starting materials directly into the quencher solution. Samples were boiled for 10 minutes then centrifuged at 17,000 rcf for 1 minute. Aliquots of starting materials and time points (15 ⁇ L) were loaded onto Bis-Tris gels and run in MES-SDS running buffer. The gels were Coomassie-stained then imaged using the Licor Odyssey scanner.
- each lane of a gel was analyzed using the Licor Odyssey quantification function or ImageJ. Given the close proximity of the starting material bands, these bands were typically integrated together. To normalize for loading error, the integrated intensity of each band in a lane was expressed as a fraction intensity of the total band intensity in that lane (which remained relatively constant between lanes).
- reaction solution 90 ⁇ L aliquots of the reaction solution were removed at the desired time points and quenched in 30 ⁇ L of a quenching solution (6 M guanidine hydrochloride with 4% trifluoroacetic acid). 100 ⁇ L of each quenched time point were injected onto an analytical C18 RP-HPLC column and eluted over a 25-73% buffer B gradient in 30 minutes, preceded by a two minute isocratic phase in 25% buffer B (see Equipment section for column and running buffer specifications). At different time points, various HPLC peaks were collected and their identities were confirmed by mass spectrometry. The IntC-(Ub)SUMO species were identified by MS, verifying branched intermediate formation and depletion.
- the rate of branched intermediate resolution (k 5 ) is on the same order—of magnitude as the rates of the first two reversible steps, whereas the cleavage rates from L (k 6 ) and B (k 7 ) are relatively slow.
- branched intermediate resolution (k 5 ) is also slow, on the same order of magnitude as the cleavage.
- Fmoc-based solid phase peptide synthesis was used to produce a peptide with the sequence H-Cys-Gly-Lys(Fluorescein)-NH 2 .
- the peptide was synthesized on Rink amide resin at a 0.2 mmol scale as follows: 20% piperidine in DMF was used for Fmoc deprotection using a one minute equilibration of the resin followed by a 20 minute incubation. After Fmoc deprotection, amino acids were coupled using DIC/HOBt as activating agents.
- the amino acid (1.1 mmol) was dissolved in 50:50 DCM:DMF (2 mL) and was activated with DIC (1.0 mmol) and HOBt (1.2 mmol) at 0° C. for 15 minutes. The mixture was added to the N-terminally deprotected resin and coupled for 10 minutes at room temperature.
- the lysine side chain was deprotected by treatment with Pd(Ph 3 ) 4 (0.1 eq.) and phenylsilane (25 eq.) in dry DCM for 30 minutes.
- the peptidyl resin was washed with DCM (2 ⁇ 5 mL) and DMF (2 ⁇ 5 mL) followed by two washes with 0.5% DIPEA in DMF (v/v) and two washes with 0.5% sodium diethyldithiocarbamate trihydrate in DMF (w/v) to remove any remaining traces of the Pd catalyst.
- 5(6)-Carboxyfluorescein was then coupled to the lysine side chain using the DIC/HOBt activation method overnight at room temperature.
- the peptide was cleaved off the resin using 94% TFA, 1% TIS, 2.5% EDT, and 2.5% H 2 O (6.5 mL) for one hour. After cleavage, roughly half of the TFA was evaporated under a stream of nitrogen. The crude peptide was precipitated with cold ether and washed with cold ether twice. Finally, the peptide was purified by RP-HPLC on C18 prep column over a 15-80% buffer B gradient in 40 minutes. The purified peptide was analyzed by analytical RP-HPLC and ESI-MS to confirm its identity. Note that no attempt was made to separately isolate the 5-carboxyfluorescein and 6-carboxyfluorescein conjugates, thus the peptide is a mixture of these two isomers.
- This plasmid was modified to replace the MxeGyrA intein with a fused Npu intein.
- the myc-KanR-NpuDnaE-Split plasmid was modified by QuikChange to remove the iRBS sequence separating the NpuN and NpuC genes.
- the resulting plasmid, myc-KanR-NpuDnaE-Fused, was then used as a template to amplify megaprimers bearing the fused Npu intein with overhangs homologous to the sequences surrounding MxeGyrA in the modified pTXB1 vector.
- Npu gene with the N137A mutation was inserted in place of MxeGyrA using overlap-extension PCR with the Phusion polymerase. (39) Importantly, this construct was modified to include the native C-extein residues of Npu (CFN) instead of those for MxeGyrA (TEA).
- CFN C-extein residues of Npu
- TEA MxeGyrA
- proteins were fused to AvaDnaE or MchtDnaE to test the sequence dependence on thiolysis from these inteins.
- the proteins utilized were the N-terminal S113 domain of human Grb2 (AAs 1-55+/ ⁇ an exogenous C-terminal Gly), the S112 domain of human Abl kinase (AAs 122-217), eGFP, and the catalytic domain of human PARP1 (AAs 657-1015). All plasmids were cloned using the aforementioned methods to yield plasmids encoding the following proteins:
- the cell pellets were transferred to 50 mL conical tubes with 5 mL of lysis buffer (50 mM phosphate, 300 mM NaCl, 5 mM imidazole, No BME, pH 8.0) and stored at ⁇ 80° C. The cell pellets were resuspended by adding an additional 15 mL of lysis buffer supplemented with Complete protein inhibitor cocktail. Cells were lysed by sonication (35% amplitude, 8 ⁇ 20 second pulses separated by 30 seconds on ice). The soluble fraction was recovered by centrifugation (35,000 rcf, 30 min).
- the soluble fraction was mixed with 2 mL of Ni-NTA resin and incubated at 4° C. for 30 minutes. After incubation, the slurry was loaded onto a fitted column. After discarding the flow-through, the column was washed with 5 column volumes (CV) of lysis buffer, 5 CV of wash buffer 1 (lysis buffer with 20 mM imidazole), and 3 CV of wash buffer 2 (lysis buffer with 50 mM imidazole). The protein was eluted with elution buffer (lysis buffer with 250 mM imidazole) in four 1.5 CV elution fractions. The wash and elution fractions were analyzed by SDS-PAGE with loading dye containing no thiols.
- the cleanest fractions were pooled and treated with 10 mM TCEP for 20 minutes on ice. Then, the solution was injected on an S75 or S200 10/300 gel filtration column (2 ⁇ 1 mL injections), and eluted over 1.35 CV in thiolysis buffer (100 mM phosphates, 150 mM NaCl, 1 mM EDTA, 1 mM TCEP, pH 7.2).
- thiolysis buffer 100 mM phosphates, 150 mM NaCl, 1 mM EDTA, 1 mM TCEP, pH 7.2.
- the FPLC fractions were analyzed by SDS-PAGE with loading dye containing no thiols, and the purest fractions were pooled and analyzed by analytical RP-HPLC and mass spectrometry. The concentration of pure protein was determined by UV A 280nm .
- each Ub-Intein fusion protein four reactions were carried on a 100 ⁇ L scale at 30° C.
- the fusion protein 50 ⁇ M was incubated in thiolysis buffer (100 mM phosphate, 150 mM NaCl, 1 mM EDTA, 1 mM TCEP, pH 7.2) supplemented with freshly added TCEP (an additional 5 mM).
- the protein was incubated identically as for the first reaction, except that each reaction had either 100 mM MESNa or 1 mM CGK-Fluorescein.
- both MESNa and the peptide were added.
- reaction solution 5 ⁇ L were removed and quenched in 30 ⁇ L 2 ⁇ SDS loading dye containing no thiols. As time points were collected, they were stored at ⁇ 20° C. until the end of the reaction. After the reaction, the 35 ⁇ L quenched time points were thawed, treated with 1 ⁇ L of a 1 M TCEP stock solution, boiled for 10 minutes, and centrifuged at 17,000 rcf for 1 minute. Time points (5 ⁇ L) were loaded onto 12% Bis-Tris gels and run in MES-SDS running buffer. The gels were first imaged on a fluorescence imager to visualize the Ub-CGK-Fluorescein ligation product.
- the gels were coomassie-stained and imaged using the Licor Odyssey scanner.
- the reaction endpoints were quenched by 20-fold dilution in H 2 O with 0.1% TFA and injected on an analytical C18 RP-HPLC column. The mixture was separated over a 2 minute isocratic phase in 0% B followed by a 0-73% B linear gradient in 30 minutes. The major peaks were collected and analyzed by MS.
- the 45 ⁇ L quenched time points were thawed, treated with 1 ⁇ L of a 1 M TCEP stock solution, boiled for 10 minutes, and centrifuged at 17,000 rcf for 1 minute.
- Time points (15 ⁇ L) were loaded onto 12% Bis-Tris gels and run in MES-SDS running buffer. Then the gels were coomassie-stained and imaged using the Licor Odyssey scanner.
- the reaction endpoints were quenched by 4-fold dilution in H 2 O with 0.1% TFA and injected on an analytical C18 RP-HPLC column. The mixture was separated over a 2 minute isocratic phase in 0% B followed by a 0-73% B linear gradient in 30 minutes. The product peaks were collected and analyzed by MS.
- this species appears to accumulate during protein expression resulting in a mixture of “trapped” (dehydrated) and “free” (native, hydrated) fusion protein.
- the “free” protein Upon addition of MESNa at neutral pH, the “free” protein rapidly undergoes thiolysis to yield the desired product, and the “trapped” protein slowly rehydrates and is also thiolyzed to yield the same desired product.
- a “burst” phase was observed followed by a slower phase.
- the accumulation of dehydrated fusion protein could be reduced by expression at lower temperatures (18° C. instead of 37° C.), and these reactions could be driven faster and closer to completion by increasing the MESNa concentration from 100 mM to 200 mM MESNa.
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Molecular Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Biomedical Technology (AREA)
- Microbiology (AREA)
- Medicinal Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Biophysics (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Gastroenterology & Hepatology (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Immunology (AREA)
- Peptides Or Proteins (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
Abstract
Description
(CLSYDTEILTVEYGAVPIGKIVEENIECTVYSVDENGFVYTQPIAQWHD |
RGEQEVFEYCLEDGSTIRATKDHKFMTEDGEMLPIDEIFEQGLDLKQVKG |
LPD). |
(CLSYETEILTVEYGLLPIGKIVEKRIESTVYSVDNNGNIYTQPVAQWHD |
RGEQEVFEYSLEDGSLIRATKDHKFMTVDGQMLPIDEIFERELDLMRVDN |
LPN). |
Mutated Split Intein C-Fragments and Binding to a Support
TABLE 1 | |||
DnaE Intein | |||
Name | Genus | Species | Strain |
Npu | Nostoc | punctiforme | PCC73102 |
Ssp | Synechocystis | species | PCC6803 |
Aha | Aphanothece | halophytica | |
Aov | Aphanizomenon | ovalisporum | |
Asp | Anabaena | species | PCC7120 |
Ava | Anabaena | variabilis | ATCC29413 |
Cra(CS505) | Cylindrospermopsis | raciborskii | CS-505 |
Csp(CCY0110) | Cyanothece | species | CCY0110 |
Csp(PCC8801) | Cyanothece | species | PCC8801 |
Cwa | Crocosphaera | watsonii | WH 8501 |
Maer(NIES843) | Microcystis | aeruginosa | NIES-843 |
Mcht | Microcoleus | chthonoplastes | PCC7420 |
(PCC7420)-2 | |||
Oli | Oscillatoria | limnetica | Solar Lake |
Sel(PC7942) | Synechococcus | elongatus | PC7942 |
Ssp(PCC7002) | Synechococcus | species | PCC7002 |
Tel | Thermosynechococcus | elongatus | BP-1 |
Ter-3 | Trichodesmium | erythraeum | IMS101 |
Tvu | Thermosynechococcus | vulcanus | |
[KanR promoter]-[RBS]-[myc-KanRN]-[IntN]-[iRBS]-[IntC]-[CFN-KanRC]
where the KanR promoter is the constitutive promoter found in most kanamycin-resistant plasmids, RBS is a common E. coli ribosomal binding site, iRBS is an intervening ribosomal binding site preceded by a linker, myc encodes for a c-myc epitope tag (EQKLISEEDL) (SEQ ID NO: 760), KanRN and KanRC are fragments of the KanR protein, and IntN and IntC are split intein fragments. An analogous Ssp plasmid was also constructed as previously described (36,37). These plasmids are referred to as myc-KanR-NpuDnaE-Split and myc-KanR-SspDnaE-Split. To generate the screening vectors for the remaining split inteins, synthetic genes were designed and purchased from GENEWIZ containing the following architecture:
[5′ overhang]-[IntN]-[iRBS]-[IntC]-[3′ overhang]
where the 5′ and 3′ overhangs were the exact 39 bp found upstream of NpuN and 25 bp found downstream of NpuC, respectively, in the myc-KanR-NpuDnaE-Split plasmid. For all inteins, the purchased gene sequences were codon-optimized with the default E. coli codon usage table generated based on all E. coli coding sequences in GenBank8. The synthetic genes were received in pUC57 vectors.
In each regression analysis, typically three or four independent dose response curves were collectively fit to the equation above using the GraphPad Prism software. In each fit, ODmin was fixed to the background absorbance at 650 nm, and all other parameters were allowed to vary. The reported error bars for the IC50 bar graphs (
(SEQ ID NO: 761) |
MHHHHHHGGMQIFVKTLTGKTITLEVESSDTIDNVKSKIQDKEGIPPDQQ |
ELIFAGKQLEDGRTLSDYNIQKESTLHLVLRLRGGGGGGKFAEY CLSYET |
EILTVEYGLLPIGKIVEKRIECTVYSVDNNGNIYTQPVAQWHDRGEQEVF |
EYCLEDGSLIRATKDHKFMTVDGQMLPIDEIFERELDLMRVDNLPN |
where the NpuN sequence is given in bold, the immediate native local extein residues are underlined, and these residues are preceded by His6-Ub with a Gly4 linker. Significant in vivo proteolysis was previously observed during expression of this construct, so this plasmid was modified using QuikChange to remove the Gly4 sequence. The resulting plasmid, pMR-Ub-NpuN-ΔGly4 was used as the template for all other Ub-IntN plasmids and encoded for the following protein sequence:
(SEQ ID NO: 762) |
MHHHHHHGGMQIFVKTLTGKTITLEVESSDTIDNVKSKIQDKEGIPPDQQ |
ELIFAGKQLEDGRTLSDYNIQKESTLHLVLRLRGGKFAEY CLSYETEILT |
VEYGLLPIGKIVEKRIECTVYSVDNNGNIYTQPVAQWHDRGEQEVFEYCL |
EDGSLIRATKDHKFMTVDGQMLPIDEIFERELDLMRVDNLPN |
(SEQ ID NO: 763) |
MGSSHHHHHHGENLYFQ|GIKIATRKYLGKQNVYDIGVERDHNFALKNGF |
IASN CFNSGLVPRGSASMSDSEVNQEAKPEVKPEVKPETHINLKVSDGSS |
EIFFKIKKTTPLRRLMEAFAKRQGKEMDSLRFLYDGIRIQADQTPEDLDM |
EDNDIIEAHREQIGGYPYDVPDYA |
where the NpuC sequence is given in bold, and the immediate native local extein residues are underlined, followed by a linker sequence and SUMO-HA. This construct is preceded by a His6-tag and a tobacco etch virus (TEV) protease recognition sequence. The TEV protease cleavage site is indicated by “|” and leaves behind a glycine residue in place of an N-terminal IntC methionine.
For reactant depletion: Y=S·(e −k
For product formation: Y=Y max·(1−e −k
max
Y is the fractional intensity of a species, t is time in minutes, S is a scaling factor for reactant depletion (allowed to vary), Z indicates the fraction of reactant remaining at the reaction endpoint (allowed to vary), Ymax is a scaling factor for product formation, and kobs is the observed first-order rate constant for the splicing reaction (allowed to vary). Half-lives were calculated from the best-fit value for the first-order rate constant:
For reactions with no detectable side product formation, the rate of product (Ub-SUMO) and IntN formation were consistent with the rate of starting material depletion.
(SEQ ID NO: 764) |
MQIFVKTLTGKTITLEVEPSDTIENVKAKIQDKEGIPPDQQRLIFAGKQL |
EDGRTLSDYNIQKESTLHLVLRLRGGCITGDALVALPEGESVRIADIVPG |
ARPNSDNAIDLKVLDRHGNPVLADRLFHSGEHPVYTVRTVEGLRVTGTAN |
HPLLCLVDVAGVPTLLWKLIDEIKPGDYAVIQRSAFSVDCAGFARGKPEF |
APTTYTVGVPGLVRFLEAHHRDPDAQAIADELTDGRFYYAKVASVTDAGV |
QPVYSLRVDTADHAFITNGFVSHATEAHHHHHH |
in which the intein sequence for MxeGyrA (N198A) is shown in bold, preceded by ubiquitin and followed by the endogenous local C-extein sequence (underlined) and a His6-tag.
(SEQ ID NO: 765) |
MQIFVKTLTGKTITLEVEPSDTIENVKAKIQDKEGIPPDQQRLIFAGKQL |
EDGRTLSDYNIQKESTLHLVLRLRGGCLSYETEILTVEYGLLPIGKIVEK |
RIECTVYSVDNNGNIYTQPVAQWHDRGEQEVFEYCLEDGSLIRATKDHKF |
MTVDGQMLPIDEIFERELDLMRVDNLPNIKIATRKYLGKQNVYDIGVERD |
HNFALKNGFIASACFNHHHHHH |
This fusion showed substantial in vivo hydrolysis of ubiquitin when expressed in E. coli. Thus, it was further modified using QuikChange mutagenesis by mutating the +1 cysteine to alanine, generating the plasmid pTXB1-Ub-NpuDnaE-AAFN-H6. This plasmid encoded for the following protein (Ub-NpuDnaE-AAFN-H6) that was used for in vitro thiolysis experiments:
(SEQ ID NO: 766) |
MQIFVKTLTGKTITLEVEPSDTIENVKAKIQDKEGIPPDQQRLIFAGKQL |
EDGRTLSDYNIQKESTLHLVLRLRGGCLSYETEILTVEYGLLPIGKIVEK |
RIECTVYSVDNNGNIYTQPVAQWHDRGEQEVFEYCLEDGSLIRATKDHKF |
MTVDGQMLPIDEIFERELDLMRVDNLPNIKIATRKYLGKQNVYDIGVERD |
HNFALKNGFIASA AFNHHHHHH |
The pTXB1-Ub-AvaDnaE-AAFN-H6 and pTXB1-Ub-MchtDnaE-AAFN-H6 plasmids, encoding for the following protein sequences (Ub-AvaDnaE-AAFN-H6 and Ub-MchtDnaE-AAFN-H6, respectively), were cloned analogously by modifying the pTXB1-Ub-NpuDnaE-AAFN-H6 plasmid.
Ub-AvaDnaE-AAFN-H6 |
(SEQ ID NO: 767) |
MQIFVKTLTGKTITLEVEPSDTIENVKAKIQDKEGIPPDQQRLIFAGKQL |
EDGRTLSDYNIQKESTLHLVLRLRGGCLSYDTEVLTVEYGFVPIGEIVDK |
GIECSVFSIDSNGIVYTQPIAQWHHRGKQEVFEYCLEDGSIIKATKDHKF |
MTQDGKMLPIDEIFEQELDLLQVKGLPEIKIASRKFLGVENVYDIGVGRD |
HNFFVKNGLIASA AFNHHHHHH |
Ub-MchtDnaE-AAFN-H6 |
(SEQ ID NO: 768) |
MQIFVKTLTGKTITLEVEPSDTIENVKAKIQDKEGIPPDQQRLIFAGKQL |
EDGRTLSDYNIQKESTLHLVLRLRGGCLSYDTQILTVEYGAVAIGEIVEK |
QIECTVYSVDENGYVYTQPIAQWHNRGEQEVFEYLLEDGATIRATKDHKF |
MTDEDQMLPIDQIFEQGLELKQVEVLQFVFVKIVRRQSLGVQNVYDIGVE |
KDHNFCLASGEIASA AFNHHHHHH |
As a control for the removal of the +1 Cys residue in the DnaE intein constructs, the +1 Thr residue was mutated from the pTXB1-Ub-MxeGyrA-ATEA-H6 plasmid by QuikChange mutagenesis to yield the plasmid pTXB1-Ub-MxeGyrA-AAEA-H6, encoding for the protein Ub-MxeGyrA-AAEA-H6.
(SEQ ID NO: 769) |
MQIFVKTLTGKTITLEVEPSDTIENVKAKIQDKEGIPPDQQRLIFAGKQL |
EDGRTLSDYNIQKESTLHLVLRLRGGCITGDALVALPEGESVRIADIVPG |
ARPNSDNAIDLKVLDRHGNPVLADRLFHSGEHPVYTVRTVEGLRVTGTAN |
HPLLCLVDVAGVPTLLWKLIDEIKPGDYAVIQRSAFSVDCAGFARGKPEF |
APTTYTVGVPGLVRFLEAHHRDPDAQAIADELTDGRFYYAKVASVTDAGV |
QPVYSLRVDTADHAFITNGFVSHA AEAHHHHHH |
SH3-AvaDnaE-AAFN-H6 |
(SEQ ID NO: 770) |
MEAIAKYDFKATADDELSFKRGDILKVLNEECDQNWYKAELNGKDGFIPK |
NYIEMCLSYDTEVLTVEYGFVPIGEIVDKGIECSVFSIDSNGIVYTQPIA |
QWHHRGKQEVFEYCLEDGSIIKATKDHKFMTQDGKMLPIDEIFEQELDLL |
QVKGLPEIKIASRKFLGVENVYDIGVGRDHNFFVKNGLIASAAFNHHHHH |
H |
SH3-Gly-AvaDnaE-AAFN-H6 |
(SEQ ID NO: 771) |
MEAIAKYDFKATADDELSFKRGDILKVLNEECDQNWYKAELNGKDGFIPK |
NYIEMGCLSYDTEVLTVEYGFVPIGEIVDKGIECSVFSIDSNGIVYTQPI |
AQWHHRGKQEVFEYCLEDGSIIKATKDHKFMTQDGKMLPIDEIFEQELDL |
LQVKGLPEIKIASRKFLGVENVYDIGVGRDHNFFVKNGLIASAAFNHHHH |
HH |
SH3-MchtDnaE-AAFN-H6 |
(SEQ ID NO: 772) |
MEAIAKYDFKATADDELSFKRGDILKVLNEECDQNWYKAELNGKDGFIPK |
NYIEMCLSYDTQILTVEYGAVAIGEIVEKQIECTVYSVDENGYVYTQPIA |
QWHNRGEQEVFEYLLEDGATIRATKDHKFMTDEDQMLPIDQIFEQGLELK |
QVEVLQPVFVKIVRRQSLGVQNVYDIGVEKDHNFCLASGEIASAAFNHHH |
HHH |
SH2-AvaDnaE-AAFN-H6 |
(SEQ ID NO: 773) |
MLEKHSWYHGPVSRNAAEYLLSSGINGSFLVRESESSPGQRSISLRYEGR |
VYHYRINTASDGKLYVSSESRFNTLAELVHHHSTVADGLITTLHYPACLS |
YDTEVLTVEYGFVPIGEIVDKGIECSVFSIDSNGIVYTQPIAQWHHRGKQ |
EVFEYCLEDGSIIKATKDHKFMTQDGKMLPIDEIFEQELDLLQVKGLPEI |
KIASRKFLGVENVYDIGVGRDHNFFVKNGLIASAAFNHHHHHH |
eGFP-AvaDnaE-AAFN-H6 |
(SEQ ID NO: 774) |
MVSKGEELFTGVVPILVELDGDVNGHKFSVSGEGEGDATYGKLTLKFICT |
TGKLPVPWPTLVTTLTYGVQCFSRYPDHMKQHDFFKSAMPEGYVQERTIF |
FKDDGNYKTRAEVKFEGDTLVNRIELKGIDFKEDGNILGHKLEYNYNSHN |
VYIMADKQKNGIKVNFKIRHNIEDGSVQLADHYQQNTPIGDGPVLLPDNH |
YLSTQSALSKDPNEKRDHMVLLEFVTAAGITLGMDELYKCLSYDTEVLTV |
EYGFVPIGEIVDKGIECSVFSIDSNGIVYTQPIAQWHHRGKQEVFEYCLE |
DGSIIKATKDHKFMTQDGKMLPIDEIFEQELDLLQVKGLPEIKIASRKFL |
GVENVYDIGVGRDHNFFVKNGLIASAAFNHHHHHH |
PARPC-AvaDnaE-AAFN-H6 |
(SEQ ID NO: 775) |
MVNPGTKSKLPKPVQDLIKMIFDVESMKKAMVEYEIDLQKMPLGKLSKRQ |
IQAAYSILSEVQQAVSQGSSDSQILDLSNRFYTLIPHDFGMKKPPLLNNA |
DSVQAKAEMLDNLLDIEVAYSLLRGGSDDSSKDPIDVNYEKLKTDIKVVD |
RDSEEAEIIRKYVKNTHATTHNAYDLEVIDIFKIEREGECQRYKPFKQLH |
NRRLLWHGSRTTNFAGILSQGLRIAPPEAPVTGYMFGKGIYFADMVSKSA |
NYCHTSQGDPIGLILLGEVALGNMYELKHASHISKLPKGKHSVKGLGKTT |
PDPSANISLDGVDVPLGTGISSGVNDTSLLYNEYIVYDIAQVNLKYLLKL |
KFNFKTSLWCLSYDTEVLTVEYGFVPIGEIVDKGIECSVFSIDSNGIVYT |
QPIAQWHHRGKQEVFEYCLEDGSIIKATKDHKFMTQDGKMLPIDEIFEQE |
LDLLQVKGLPEIKIASRKFLGVENVYDIGVGRDHNFFVKNGLIASAAFNH |
HHHHH |
- (1) Mills, K. V.; Perler, F. B. Protein Pept. Lett. 2005, 12, 751-5.
- (2) Vila-Perelló, M.; Muir, T. W. Cell 2010, 143, 191-200.
- (3) Southworth, M.; Amaya, K.; Evans, T.; Xu, M.; Perler, F. Biotechniques 1999, 27, 110-120.
- (4) Amitai, G.; Callahan, B. P.; Stanger, M. J.; Belfort, G.; Belfort, M. Proc. Natl.
Acad. Sci. USA 2009, 106, 11005-10. - (5) Zettler, J.; Schütz, V.; Mootz, H. D. FEBS Lett. 2009, 583, 909-14.
- (6) Iwai, H.; Züger, S.; Jin, J.; Tam, P.-H. FEBS Lett. 2006, 580, 1853-8.
- (7) Perler, F. B. Nucleic Acids Res. 2002, 30, 383-4.
- (8) Caspi, J.; Amitai, G.; Belenkiy, O.; Pietrokovski, S. Mol. Microbiol. 2003, 50, 1569-77.
- (9) Dassa, B.; Amitai, G.; Caspi, J.; Schueler-Furman, O.; Pietrokovski, S. Biochemistry 2007, 46, 322-330.
- (10) Chen, L.; Zhang, Y.; Li, G.; Huang, H.; Zhou, N. Anal. Biochem. 2010, 407, 180-7.
- (11) Martin, D. D.; Xu, M. Q.; Evans, T. C.
Biochemistry 2001, 40, 1393-402. - (12) Lockless, S. W.; Muir, T. W. Proc. Natl.
Acad. Sci. USA 2009, 106, 10999-1004. - (13) Shah, N. H.; Vila-Perelló, M.; Muir, T. W. Angew. Chem. Int. Ed. Engl. 2011, 50, 6511-5.
- (14) Oeemig, J. S.; Aranko, A. S.; Djupsjöbacka, J.; Heinämäki, K.; Iwaï, H. FEBS Lett. 2009, 583, 1451-1456.
- (15) Du, Z.; Liu, Y.; Ban, D.; Lopez, M. M.; Belfort, M.; Wang, C. J. Mol. Biol. 2010, 400, 755-67.
- (16) Appleby-Tagoe, J. H.; Thiel, I. V.; Wang, Y.; Wang, Y.; Mootz, H. D.; Liu, X.-Q. J. Biol. Chem. 2011, 286, 34440-7.
- (17) Busche, A. E. L.; Aranko, A. S.; Talebzadeh-Farooji, M.; Bernhard, F.; Dötsch, V.; Iwai, H. Angew. Chem. Int. Ed. Engl. 2009, 48, 6128-31.
- (18) Dhar, T.; Mootz, H. D. Chem. Commun. 2011, 47, 3063-5.
- (19) Pellois J-P, Muir T W: Current Opinion in Chemical Biology 2006, 10:487-491.
- (20) Cheriyan M, Perler F B: Adv Drug Deliv Rev 2009.
- (21) Muir T W, Sondhi D, Cole P A: Proc. Natl. Acad. Sci. U.S.A 1998, 95:6705-6710.
- (22) Dawson P E, Muir T W, Clark-Lewis I, Kent S B: Science 1994, 266:776-779.
- (23) Evans T C J R T C, Xu M-Q: Chem Rev 2002, 102:4869-4884.
- (24) Wu Y-W, Oesterlin L K, Tan K-T, Waldmann H, Alexandrov K, Goody R S: Nat Chem Biol 2010, 6:534-540.
- (25) Frutos S, Goger M, Giovani B, Cowburn D, Muir T W: Nature chemical biology 2010, 6:527-533.
- (26) lsen S K, Capili A D, Lu X, Tan D S, Lima C D: Nature 2010, 463:906-912.
- (27) Lu W, Sun Z, Tang Y, Chen J, Tang F, Zhang J, Liu J-N: Journal of Chromatography A 2011, 1218:2553-2560.
- (28) Zettler J, Schatz V, Mootz H D: FEBS Letters 2009, 583:909-914.
- (29) Southworth M W, Amaya K, Evans T C, Xu M Q, Perler F B: BioTechniques 1999, 27:110-114, 116, 118-120.
- (30) Carter P J: Exp Cell Res 2011, 317:1261-1269.
- (31) Perler, F. B. Nucleic Acids Res. 2002, 30, 383-4.
- (32) Waterhouse, A. M.; Procter, J. B.; Martin, D. M. A.; Clamp, M.; Barton, G. J.
Bioinformatics 2009, 25, 1189-91. - (33) Oeemig, J. S.; Aranko, A. S.; Djupsjobacka, J.; Heinämäki, K.; Iwaï, H. FEBS Lett. 2009, 583, 1451-1456.
- (34) Crooks, G. E.; Hon, G.; Chandonia, J.-M.; Brenner, S. E. Genome Res. 2004, 14, 1188-90.
- (35) R Development Core Team; R Foundation for Statistical Computing: Vienna, Austria, 2011.
- (36) Lockless, S. W.; Muir, T. W. Proc. Natl.
Acad. Sci. USA 2009, 106, 10999-1004. - (37) Shah, N. H.; Vila-Perelló, M.; Muir, T. W. Angew. Chem. Int. Ed. 2011, 50, 6511-5.
- (38) Stothard,
P. BioTechniques 2000, 28, 1102, 1104. - (39) Bryksin, A. V.; Matsumura,
I. BioTechniques 2010, 48, 463-5. - (40) Reid, B. P.; BPReid: Hanover, N H, 2009.
- (41) Schwarzer, D.; Ludwig, C.; Thiel, I. V.; Mootz, H. D. Biochemistry 2012, 51, 233-42.
- (42) Sun, P.; Ye, S.; Ferrandon, S.; Evans, T. C.; Xu, M.-Q.; Rao, Z. J. Mol. Biol. 2005, 353, 1093-105
Claims (19)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US17/491,729 US12054541B2 (en) | 2012-06-27 | 2021-10-01 | Split inteins, conjugates and uses thereof |
Applications Claiming Priority (6)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201261665215P | 2012-06-27 | 2012-06-27 | |
PCT/US2013/047235 WO2014004336A2 (en) | 2012-06-27 | 2013-06-24 | Split inteins, conjugates and uses thereof |
US201414411702A | 2014-12-29 | 2014-12-29 | |
US15/809,440 US10526401B2 (en) | 2012-06-27 | 2017-11-10 | Method of using split inteins, conjugates to generate protein conjugates |
US16/690,213 US11161899B2 (en) | 2012-06-27 | 2019-11-21 | Method using split inteins to generate protein conjugates |
US17/491,729 US12054541B2 (en) | 2012-06-27 | 2021-10-01 | Split inteins, conjugates and uses thereof |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US16/690,213 Continuation US11161899B2 (en) | 2012-06-27 | 2019-11-21 | Method using split inteins to generate protein conjugates |
Publications (2)
Publication Number | Publication Date |
---|---|
US20220098293A1 US20220098293A1 (en) | 2022-03-31 |
US12054541B2 true US12054541B2 (en) | 2024-08-06 |
Family
ID=49783989
Family Applications (4)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US14/411,702 Abandoned US20150344549A1 (en) | 2012-06-27 | 2013-06-24 | Split inteins, conjugates and uses thereof |
US15/809,440 Active US10526401B2 (en) | 2012-06-27 | 2017-11-10 | Method of using split inteins, conjugates to generate protein conjugates |
US16/690,213 Active US11161899B2 (en) | 2012-06-27 | 2019-11-21 | Method using split inteins to generate protein conjugates |
US17/491,729 Active US12054541B2 (en) | 2012-06-27 | 2021-10-01 | Split inteins, conjugates and uses thereof |
Family Applications Before (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US14/411,702 Abandoned US20150344549A1 (en) | 2012-06-27 | 2013-06-24 | Split inteins, conjugates and uses thereof |
US15/809,440 Active US10526401B2 (en) | 2012-06-27 | 2017-11-10 | Method of using split inteins, conjugates to generate protein conjugates |
US16/690,213 Active US11161899B2 (en) | 2012-06-27 | 2019-11-21 | Method using split inteins to generate protein conjugates |
Country Status (10)
Country | Link |
---|---|
US (4) | US20150344549A1 (en) |
EP (3) | EP2877490B1 (en) |
JP (2) | JP2015522020A (en) |
AU (1) | AU2013280693B2 (en) |
CA (1) | CA2877882A1 (en) |
DK (2) | DK3431497T3 (en) |
ES (2) | ES2709333T3 (en) |
PL (1) | PL2877490T3 (en) |
PT (1) | PT2877490T (en) |
WO (1) | WO2014004336A2 (en) |
Families Citing this family (71)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
AU2012333134B2 (en) | 2011-07-22 | 2017-05-25 | John Paul Guilinger | Evaluation and improvement of nuclease cleavage specificity |
US9163284B2 (en) | 2013-08-09 | 2015-10-20 | President And Fellows Of Harvard College | Methods for identifying a target site of a Cas9 nuclease |
US9340800B2 (en) | 2013-09-06 | 2016-05-17 | President And Fellows Of Harvard College | Extended DNA-sensing GRNAS |
AU2015298571B2 (en) | 2014-07-30 | 2020-09-03 | President And Fellows Of Harvard College | Cas9 proteins including ligand-dependent inteins |
CN104263747B (en) * | 2014-09-22 | 2017-03-15 | 程永升 | A kind of double label carriers of long-acting protein medicament expression renaturation modification and method |
KR101996774B1 (en) * | 2014-11-03 | 2019-07-04 | 메르크 파텐트 게엠베하 | Soluble intein fusion proteins and methods for purifying biomolecules |
WO2017070632A2 (en) | 2015-10-23 | 2017-04-27 | President And Fellows Of Harvard College | Nucleobase editors and uses thereof |
HUE062276T2 (en) * | 2016-01-29 | 2023-10-28 | Univ Princeton | Split inteins with exceptional splicing activity |
US11293021B1 (en) | 2016-06-23 | 2022-04-05 | Inscripta, Inc. | Automated cell processing methods, modules, instruments, and systems |
CA3032699A1 (en) | 2016-08-03 | 2018-02-08 | President And Fellows Of Harvard College | Adenosine nucleobase editors and uses thereof |
WO2018031683A1 (en) | 2016-08-09 | 2018-02-15 | President And Fellows Of Harvard College | Programmable cas9-recombinase fusion proteins and uses thereof |
WO2018039438A1 (en) | 2016-08-24 | 2018-03-01 | President And Fellows Of Harvard College | Incorporation of unnatural amino acids into proteins using base editing |
EP3526320A1 (en) | 2016-10-14 | 2019-08-21 | President and Fellows of Harvard College | Aav delivery of nucleobase editors |
US10745677B2 (en) | 2016-12-23 | 2020-08-18 | President And Fellows Of Harvard College | Editing of CCR5 receptor gene to protect against HIV infection |
EP3592853A1 (en) | 2017-03-09 | 2020-01-15 | President and Fellows of Harvard College | Suppression of pain by gene editing |
JP2020510439A (en) | 2017-03-10 | 2020-04-09 | プレジデント アンド フェローズ オブ ハーバード カレッジ | Base-editing factor from cytosine to guanine |
KR102687373B1 (en) | 2017-03-23 | 2024-07-23 | 프레지던트 앤드 펠로우즈 오브 하바드 칼리지 | Nucleobase editing agent comprising a nucleic acid programmable DNA binding protein |
WO2018209320A1 (en) | 2017-05-12 | 2018-11-15 | President And Fellows Of Harvard College | Aptazyme-embedded guide rnas for use with crispr-cas9 in genome editing and transcriptional activation |
US11939385B2 (en) * | 2017-05-16 | 2024-03-26 | ALX Oncology Inc. | Activatable antibodies and methods of use thereof |
US10011849B1 (en) | 2017-06-23 | 2018-07-03 | Inscripta, Inc. | Nucleic acid-guided nucleases |
US9982279B1 (en) | 2017-06-23 | 2018-05-29 | Inscripta, Inc. | Nucleic acid-guided nucleases |
SI3645719T1 (en) | 2017-06-30 | 2022-07-29 | Inscripta, Inc., | Automated cell processing methods, modules, instruments, and systems |
EP3658573A1 (en) | 2017-07-28 | 2020-06-03 | President and Fellows of Harvard College | Methods and compositions for evolving base editors using phage-assisted continuous evolution (pace) |
WO2019139645A2 (en) | 2017-08-30 | 2019-07-18 | President And Fellows Of Harvard College | High efficiency base editors comprising gam |
CN111757937A (en) | 2017-10-16 | 2020-10-09 | 布罗德研究所股份有限公司 | Use of adenosine base editor |
US10557216B2 (en) | 2018-04-24 | 2020-02-11 | Inscripta, Inc. | Automated instrumentation for production of T-cell receptor peptide libraries |
US10858761B2 (en) | 2018-04-24 | 2020-12-08 | Inscripta, Inc. | Nucleic acid-guided editing of exogenous polynucleotides in heterologous cells |
US11142740B2 (en) | 2018-08-14 | 2021-10-12 | Inscripta, Inc. | Detection of nuclease edited sequences in automated modules and instruments |
IL292273B2 (en) | 2018-08-14 | 2023-10-01 | Inscripta Inc | Instruments, modules, and methods for improved detection of edited sequences in live cells |
US11214781B2 (en) | 2018-10-22 | 2022-01-04 | Inscripta, Inc. | Engineered enzyme |
WO2020191249A1 (en) | 2019-03-19 | 2020-09-24 | The Broad Institute, Inc. | Methods and compositions for editing nucleotide sequences |
WO2020198174A1 (en) | 2019-03-25 | 2020-10-01 | Inscripta, Inc. | Simultaneous multiplex genome editing in yeast |
US11001831B2 (en) | 2019-03-25 | 2021-05-11 | Inscripta, Inc. | Simultaneous multiplex genome editing in yeast |
WO2020210751A1 (en) | 2019-04-12 | 2020-10-15 | The Broad Institute, Inc. | System for genome editing |
US10837021B1 (en) | 2019-06-06 | 2020-11-17 | Inscripta, Inc. | Curing for recursive nucleic acid-guided cell editing |
CN114423869A (en) | 2019-07-19 | 2022-04-29 | 旗舰先锋创新Vi有限责任公司 | Recombinase compositions and methods of use |
US20220315906A1 (en) | 2019-08-08 | 2022-10-06 | The Broad Institute, Inc. | Base editors with diversified targeting scope |
CA3152679A1 (en) * | 2019-08-28 | 2021-03-04 | The Trustees Of Princeton University | Atypical split inteins and uses thereof |
US20220340677A1 (en) * | 2019-09-09 | 2022-10-27 | Wuhan Yzy Biopharma Co., Ltd. | Split intein and preparation method for recombinant polypeptide using the same |
WO2021072328A1 (en) | 2019-10-10 | 2021-04-15 | The Broad Institute, Inc. | Methods and compositions for prime editing rna |
WO2021102059A1 (en) | 2019-11-19 | 2021-05-27 | Inscripta, Inc. | Methods for increasing observed editing in bacteria |
GB201917046D0 (en) | 2019-11-22 | 2020-01-08 | Ge Healthcare Bioprocess R&D Ab | Improved protein production |
KR20220118498A (en) | 2019-12-18 | 2022-08-25 | 인스크립타 인코포레이티드 | Cascade/DCAS3 Complementary Assay for In Vivo Detection of Nucleic Acid-Guided Nuclease Edited Cells |
EP4096770A1 (en) | 2020-01-27 | 2022-12-07 | Inscripta, Inc. | Electroporation modules and instrumentation |
WO2021158999A1 (en) | 2020-02-05 | 2021-08-12 | The Broad Institute, Inc. | Gene editing methods for treating spinal muscular atrophy |
US20230127008A1 (en) | 2020-03-11 | 2023-04-27 | The Broad Institute, Inc. | Stat3-targeted base editor therapeutics for the treatment of melanoma and other cancers |
US20210332388A1 (en) | 2020-04-24 | 2021-10-28 | Inscripta, Inc. | Compositions, methods, modules and instruments for automated nucleic acid-guided nuclease editing in mammalian cells |
WO2021222318A1 (en) | 2020-04-28 | 2021-11-04 | The Broad Institute, Inc. | Targeted base editing of the ush2a gene |
MX2022014008A (en) | 2020-05-08 | 2023-02-09 | Broad Inst Inc | Methods and compositions for simultaneous editing of both strands of a target double-stranded nucleotide sequence. |
US11787841B2 (en) | 2020-05-19 | 2023-10-17 | Inscripta, Inc. | Rationally-designed mutations to the thrA gene for enhanced lysine production in E. coli |
US11299731B1 (en) | 2020-09-15 | 2022-04-12 | Inscripta, Inc. | CRISPR editing to embed nucleic acid landing pads into genomes of live cells |
JP2023543803A (en) | 2020-09-24 | 2023-10-18 | ザ ブロード インスティテュート,インコーポレーテッド | Prime Editing Guide RNA, its composition, and its uses |
US11512297B2 (en) | 2020-11-09 | 2022-11-29 | Inscripta, Inc. | Affinity tag for recombination protein recruitment |
AU2021415461A1 (en) | 2021-01-04 | 2023-08-17 | Inscripta, Inc. | Mad nucleases |
US11332742B1 (en) | 2021-01-07 | 2022-05-17 | Inscripta, Inc. | Mad nucleases |
CA3203876A1 (en) | 2021-01-11 | 2022-07-14 | David R. Liu | Prime editor variants, constructs, and methods for enhancing prime editing efficiency and precision |
US11884924B2 (en) | 2021-02-16 | 2024-01-30 | Inscripta, Inc. | Dual strand nucleic acid-guided nickase editing |
WO2022187546A1 (en) | 2021-03-03 | 2022-09-09 | Shape Therapeutics Inc. | Auxotrophic cells for virus production and compositions and methods of making |
CN117355536A (en) | 2021-05-12 | 2024-01-05 | 思拓凡生物工艺研发有限公司 | Improved protein purification |
JP2024530487A (en) | 2021-08-06 | 2024-08-21 | ザ ブロード インスティテュート,インコーポレーテッド | Improved Prime Editor and Usage |
WO2023027169A1 (en) * | 2021-08-27 | 2023-03-02 | 国立大学法人 東京医科歯科大学 | Sorting system for live cells |
EP4399292A2 (en) | 2021-09-08 | 2024-07-17 | Flagship Pioneering Innovations VI, LLC | Methods and compositions for modulating a genome |
WO2023076898A1 (en) | 2021-10-25 | 2023-05-04 | The Broad Institute, Inc. | Methods and compositions for editing a genome with prime editing and a recombinase |
GB202409668D0 (en) | 2021-12-03 | 2024-08-14 | Broad Inst Inc | Self-assembling virus-like particles for delivery of prime editors and methods of making and using same |
CN114606252A (en) * | 2022-04-20 | 2022-06-10 | 广州市乾相生物科技有限公司 | Oligopeptide synthesis and purification method based on filtration method |
WO2023205687A1 (en) | 2022-04-20 | 2023-10-26 | The Broad Institute, Inc. | Improved prime editing methods and compositions |
WO2023250174A1 (en) | 2022-06-23 | 2023-12-28 | Prime Medicine, Inc. | Split prime editors |
WO2024077267A1 (en) | 2022-10-07 | 2024-04-11 | The Broad Institute, Inc. | Prime editing methods and compositions for treating triplet repeat disorders |
WO2024097763A1 (en) * | 2022-11-01 | 2024-05-10 | Memorial Sloan-Kettering Cancer Center | Intein-based sorting system and modular chimeric polypeptides |
WO2024108092A1 (en) | 2022-11-17 | 2024-05-23 | The Broad Institute, Inc. | Prime editor delivery by aav |
WO2024163862A2 (en) | 2023-02-03 | 2024-08-08 | The Broad Institute, Inc. | Gene editing methods, systems, and compositions for treating spinal muscular atrophy |
Citations (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4179337A (en) | 1973-07-20 | 1979-12-18 | Davis Frank F | Non-immunogenic polypeptides |
US4289872A (en) | 1979-04-06 | 1981-09-15 | Allied Corporation | Macromolecular highly branched homogeneous compound based on lysine units |
US4301144A (en) | 1979-07-11 | 1981-11-17 | Ajinomoto Company, Incorporated | Blood substitute containing modified hemoglobin |
US4496689A (en) | 1983-12-27 | 1985-01-29 | Miles Laboratories, Inc. | Covalently attached complex of alpha-1-proteinase inhibitor with a water soluble polymer |
US4640835A (en) | 1981-10-30 | 1987-02-03 | Nippon Chemiphar Company, Ltd. | Plasminogen activator derivatives |
US4670417A (en) | 1985-06-19 | 1987-06-02 | Ajinomoto Co., Inc. | Hemoglobin combined with a poly(alkylene oxide) |
US4791192A (en) | 1986-06-26 | 1988-12-13 | Takeda Chemical Industries, Ltd. | Chemically modified protein with polyethyleneglycol |
WO1989002439A1 (en) | 1987-09-21 | 1989-03-23 | Ml Technology Ventures, L.P. | Non-nucleotide linking reagents for nucleotide probes |
US5229490A (en) | 1987-05-06 | 1993-07-20 | The Rockefeller University | Multiple antigen peptide system |
WO1993021259A1 (en) | 1992-04-14 | 1993-10-28 | Cornell Research Foundation Inc. | Dendritic based macromolecules and method of production |
WO1995006731A2 (en) | 1993-09-02 | 1995-03-09 | Ribozyme Pharmaceuticals, Inc. | Non-nucleotide containing enzymatic nucleic acid |
WO1995011910A1 (en) | 1993-10-27 | 1995-05-04 | Ribozyme Pharmaceuticals, Inc. | 2'-amido and 2'-peptido modified oligonucleotides |
US20040077842A1 (en) | 2000-11-01 | 2004-04-22 | Jeff Himawan | Method of producing biospecific molecules by protein trans-splicing |
US20070207502A1 (en) | 1998-12-18 | 2007-09-06 | The Penn State Research Foundation | Cyclic peptides |
CN101281201A (en) | 2008-05-07 | 2008-10-08 | 南京大学 | Method using implicit peptides trans splicing to manufacture protein molecular weight standard |
CN101333555A (en) | 2007-12-25 | 2008-12-31 | 浙江大学 | Novel horizontal screening system construct by recipient cell G protein coupling and applications |
WO2009132455A1 (en) | 2008-04-30 | 2009-11-05 | Paul Xiang-Qin Liu | Protein splicing using short terminal split inteins |
WO2011057163A2 (en) | 2009-11-06 | 2011-05-12 | Agrivida, Inc. | Intein-modified enzymes, their production and industrial applications |
WO2013045632A1 (en) | 2011-09-28 | 2013-04-04 | Era Biotech, S.A. | Split inteins and uses thereof |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8986953B2 (en) * | 2011-01-20 | 2015-03-24 | University Of Rochester | Macrocyclic compounds with a hybrid peptidic/non-peptidic backbone and methods for their preparation |
-
2013
- 2013-06-24 DK DK18192415.0T patent/DK3431497T3/en active
- 2013-06-24 PT PT13809861T patent/PT2877490T/en unknown
- 2013-06-24 PL PL13809861T patent/PL2877490T3/en unknown
- 2013-06-24 US US14/411,702 patent/US20150344549A1/en not_active Abandoned
- 2013-06-24 JP JP2015520347A patent/JP2015522020A/en active Pending
- 2013-06-24 EP EP13809861.1A patent/EP2877490B1/en active Active
- 2013-06-24 AU AU2013280693A patent/AU2013280693B2/en active Active
- 2013-06-24 ES ES13809861T patent/ES2709333T3/en active Active
- 2013-06-24 EP EP18192415.0A patent/EP3431497B1/en active Active
- 2013-06-24 CA CA2877882A patent/CA2877882A1/en active Pending
- 2013-06-24 EP EP22186326.9A patent/EP4219549A1/en active Pending
- 2013-06-24 WO PCT/US2013/047235 patent/WO2014004336A2/en active Application Filing
- 2013-06-24 DK DK13809861.1T patent/DK2877490T3/en active
- 2013-06-24 ES ES18192415T patent/ES2930185T3/en active Active
-
2017
- 2017-11-09 JP JP2017216730A patent/JP6591511B2/en active Active
- 2017-11-10 US US15/809,440 patent/US10526401B2/en active Active
-
2019
- 2019-11-21 US US16/690,213 patent/US11161899B2/en active Active
-
2021
- 2021-10-01 US US17/491,729 patent/US12054541B2/en active Active
Patent Citations (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4179337A (en) | 1973-07-20 | 1979-12-18 | Davis Frank F | Non-immunogenic polypeptides |
US4289872A (en) | 1979-04-06 | 1981-09-15 | Allied Corporation | Macromolecular highly branched homogeneous compound based on lysine units |
US4301144A (en) | 1979-07-11 | 1981-11-17 | Ajinomoto Company, Incorporated | Blood substitute containing modified hemoglobin |
US4640835A (en) | 1981-10-30 | 1987-02-03 | Nippon Chemiphar Company, Ltd. | Plasminogen activator derivatives |
US4496689A (en) | 1983-12-27 | 1985-01-29 | Miles Laboratories, Inc. | Covalently attached complex of alpha-1-proteinase inhibitor with a water soluble polymer |
US4670417A (en) | 1985-06-19 | 1987-06-02 | Ajinomoto Co., Inc. | Hemoglobin combined with a poly(alkylene oxide) |
US4791192A (en) | 1986-06-26 | 1988-12-13 | Takeda Chemical Industries, Ltd. | Chemically modified protein with polyethyleneglycol |
US5229490A (en) | 1987-05-06 | 1993-07-20 | The Rockefeller University | Multiple antigen peptide system |
WO1989002439A1 (en) | 1987-09-21 | 1989-03-23 | Ml Technology Ventures, L.P. | Non-nucleotide linking reagents for nucleotide probes |
WO1993021259A1 (en) | 1992-04-14 | 1993-10-28 | Cornell Research Foundation Inc. | Dendritic based macromolecules and method of production |
WO1995006731A2 (en) | 1993-09-02 | 1995-03-09 | Ribozyme Pharmaceuticals, Inc. | Non-nucleotide containing enzymatic nucleic acid |
WO1995011910A1 (en) | 1993-10-27 | 1995-05-04 | Ribozyme Pharmaceuticals, Inc. | 2'-amido and 2'-peptido modified oligonucleotides |
US20070207502A1 (en) | 1998-12-18 | 2007-09-06 | The Penn State Research Foundation | Cyclic peptides |
US20040077842A1 (en) | 2000-11-01 | 2004-04-22 | Jeff Himawan | Method of producing biospecific molecules by protein trans-splicing |
CN101333555A (en) | 2007-12-25 | 2008-12-31 | 浙江大学 | Novel horizontal screening system construct by recipient cell G protein coupling and applications |
WO2009132455A1 (en) | 2008-04-30 | 2009-11-05 | Paul Xiang-Qin Liu | Protein splicing using short terminal split inteins |
CN101281201A (en) | 2008-05-07 | 2008-10-08 | 南京大学 | Method using implicit peptides trans splicing to manufacture protein molecular weight standard |
WO2011057163A2 (en) | 2009-11-06 | 2011-05-12 | Agrivida, Inc. | Intein-modified enzymes, their production and industrial applications |
WO2013045632A1 (en) | 2011-09-28 | 2013-04-04 | Era Biotech, S.A. | Split inteins and uses thereof |
Non-Patent Citations (63)
Title |
---|
Amitai et al., Modulation of intein activity by its neighboring extein substrates. Proc. Natl. Acad. Sci. USA, 106: 11005-10 (2009). |
Appleby-Tagoe et al., Highly efficient and more general cis- and trans-splicing inteins through sequential directed evolution. J. Biol. Chem. 286: 34440-7 (2011). |
Bryksin et al., Overlap extension PCR cloning: A simple and reliable way to create recombinant plasmids. BioTechniques, 48: 463-5 (2010). |
Busche et al., Segmental isotopic labeling of a central domain in a multidomain protein by protein trans-splicing using only one robust DnaE intein. Angew. Chem. Int. Ed. Engl. 48: 6128-31 (2009). |
Carter, Introduction to current and future protein therapeutics: a protein engineering perspective. Exp. Cell Res. 317: 1261-9 (2011). |
Caspi et al., Distribution of split DnaE inteins in cyanobacteria. Mol. Microbiol. 50: 1569-77 (2003). |
Chen et al., Functional characterization of a naturally occurring trans-splicing intein from Synechococcus elongatus in a mammalian cell system. Anal. Biochem. 407: 180-7 (2010). |
Cheriyan et al., Protein splicing: A versatile tool for drug discovery. Adv. Drug Deliv. Rev. 61(11): 899-907 (2009). |
Chu et al., Journal of Peptide Science 16:582-588, published on line Apr. 6, 2010. * |
Cload et al., Polyether tethered oligonucleotide probes. J. Am. Chem. Soc. 113: 6324-6 (1991). |
Crooks et al., WebLogo: a sequence logo generator. Genome Res. 14: 1188-90 (2004). |
Dassa et al., Trans protein splicing of cyanobacterial split inteins in endogenous and exogenous combinations. Biochemistry, 46: 322-30 (2007). |
Database Geneseq (online), Accession No. ATS48105, Beta-glucuronidse (gusA)-DnaE fusion protein sequence No. 1, (2009). |
Database Geneseq (online), Accession No. AVB21461, Nostoc punctiforme DnaE fragment protein sequence SEQ 20, (2009). |
Database Geneseq (online), Accession No. AZI18530, Intein protein sequence, Seq ID No. 2347, Jul. 7, 2011. |
Database Geneseq (online), Accession No. AZY15692, Nostoc sp. PCC 7120 derived DnaE intein (INT), Seq ID No. 68, Sep. 13, 2012. |
Database Geneseq (online), Accession No. BAM89830, DnaE protein N-Terminal split intein, Seq ID No. 28, May 23, 2013. |
Dawson et al., Synthesis of proteins by native chemical ligation. Science, 266: 776-9 (1994). |
Dhar et al., Modification of transmembrane and GPI-anchored proteins on living cells by efficient protein trans-splicing using the Npu DnaE intein. Chem. Commun. (Camb), 47: 3063-5 (2011). |
Du et al., Backbone dynamics and global effects of an activating mutation in minimized Mtu RecA inteins. J. Mol. Biol. 400: 755-67 (2010). |
Durand et al., Circular dichroism studies of an oligodeoxyribonucleotide containing a hairpin loop made of a hexaethylene glycol chain: conformation and stability. Nucl. Acids Res. 18: 6353-9 (1990). |
Evans et al., Mechanistic and kinetic considerations of protein splicing. Chem. Rev. 102: 4869-84 (2002). |
Ferentz et al., Disulfide-crosslinked oligonucleotides. J. Am. Chem. Soc. 113: 4000-2 (1991). |
Frutos et al., Branched intermediate formation stimulates peptide bond cleavage in protein splicing. Nat. Chem. Biol. 6: 527-33 (2010). |
Hao et al., Computational Discovery of Picomolar Qo Site Inhibitors of Cytochrome bc1 Complex. J. Am. Chem. Soc. 134(27): 11168-76 (2012). |
Iwai et al., Highly efficient protein trans-splicing by a naturally split DnaE intein from Nostoc punctiforme. FEBS Lett. 580: 1853-8 (2006). |
Jschke et al., Automated incorporation of polyethylene glycol into synthetic oligonucleotides. Tetrahedron Lett. 34: 301-4 (1993). |
Kalia et al., "Reactivity of Intein Thioesters: Appending a Functional Group to a Protein", ChemBioChem, 7, 1375-1383 (2006). |
Lockless et al., Traceless protein splicing utilizing evolved split inteins. Proc. Natl. Acad. Sci. USA, 106: 10999-1004 (2009). |
Lu et al., Split intein facilitated tag affinity purification for recombinant proteins with controllable tag removal by inducible auto-cleavage. J. Chromatography, 1218: 2553-60 (2011). |
Ma et al., Design and synthesis of RNA miniduplexes via a synthetic linker approach. 2. Generation of covalently closed, double-stranded cyclic HIV-1 TAR RNA analogs with high Tat-binding affinity. Nucl. Acids Res.21: 2585-9 (1993). |
Ma et al., Design and synthesis of RNA miniduplexes via a synthetic linker approach. Biochemistry, 32: 1751-8 (1993). |
Martin et al., Characterization of a naturally occurring trans-splicing intein from Synechocystis sp. PCC6803. Biochemistry, 40: 1393-402 (2001). |
McCurdy et al., Deoxyoligonucleotides with inverted polarity: Synthesis and use in triple-helix formation. Nucleosides & Nucleotides, 10(1-3): 287-90 (1991). |
Mills et al., The mechanism of intein-mediated protein splicing: variations on a theme. Protein Pept. Lett. 12: 751-5 (2005). |
Mohlmann et al., Site-specific modification of ED-B-targeting antibody using intein-fusion technology. BMC Biotechnol. 11: 76-85 (2011). |
Mootz, Split inteins as versatile tools for protein semisynthesis, Chembiochem, 10(16):2579-89 (2009). |
Muir et al., Expressed protein ligation: a general method for protein engineering. Proc. Natl. Acad. Sci. USA, 95: 6705-10 (1998). |
Oeemig et al., Solution structure of DnaE intein from Nostoc punctiforme: structural basis for the design of a new split intein suitable for site-specific chemical modification. FEBS Lett. 583: 1451-6 (2009). |
Olsen et al., Active site remodelling accompanies thioester bond formation in the SUMO E1. Nature, 463: 906-12 (2010). |
Ono et al., DNA triplex formation of oligonucleotide analogues consisting of linker groups and octamer segments that have opposite sugar-phosphate backbone polarities. Biochemistry, 30: 9914-21 (1991). |
Pellois et al., Semisynthetic proteins in mechanistic studies: Using chemistry to go where nature can't. Curr. Opin. Chem. Biol. 10: 487-91 (2006). |
Perler et al., InBase: the Intein Database. Nucl. Acids Res. 30: 383-4 (2002). |
Richardson et al., Tethered oligonucleotide probes. A strategy for the recognition of structured RNA. J. Am. Chem. Soc. 113: 5109-11 (1991). |
Schwarzer et al., Probing intein-catalyzed thioester formation by unnatural amino acid substitutions in the active site. Biochemistry, 51: 233-42 (2012). |
Seela et al., Oligodeoxyribonucleotides containing 1,3-propanediol as nucleoside substitute. Nucl. Acids Res. 15: 3113-29 (1987). |
Shah et al., Extein Residues Play an Intimate Role in the Rate-Limiting Step of Protein Trans-Splicing, J. Amer. Chem. Soc., 135(15):5839-47 (2013). |
Shah et al., Inteins: nature's gift to protein chemists, Chem. Sci., 5(2):446-461 (2014). |
Shah et al., Kinetic control of one-pot trans-splicing reactions by using a wild-type and designed split intein. Angew. Chem. Int. Ed. 50: 6511-5 (2011). |
Shah et al., Ultrafast protein splicing is common among cyanobacterial split inteins: Implications for protein engineering. J. Am. Chem. Soc. 134(28): 11338-41 (2012). |
Shah et al., Ultrafast protein splicing is common among cyanobacterial split inteins: Implications for protein engineering. J. Am. Chem. Soc. A-D (2012). |
Southworth et al., Purification of proteins fused to either the amino or carboxy terminus of the Mycobacterium xenopi gyrase A intein. Biotechniques, 27: 110-4, 116, 118-20 (1999). |
Stothard et al., The sequence manipulation suite: JavaScript programs for analyzing and formatting protein and DNA sequences. BioTechniques, 28: 1102, 1104 (2000). |
Sun et al., Crystal structures of an intein from the split dnaE gene of Synechocystis sp. PCC6803 reveal the catalytic model without the penultimate histidine and the mechanism of zinc ion inhibition of protein splicing. J. Mol. Biol. 353: 1093-105 (2005). |
Thom et al., "Recobinant Protein Hydrazides: Application to Site-Specific Protein PEFylation", Bioconjugate Chemistry. 22, 1017-1020 (2011). |
Vila-Perelló et al., Biological applications of protein splicing. Cell, 143: 191-200 (2010). |
Vila-Perelló et al., Streamlined Expressed Protein Ligation Using Split Interins, J. Amer. Chem. Soc., 135(1):286-92 (2013). |
Volkmann et al., Recent Progress in intein research: from mechanism to directed evolution and applications, Cell. Molec. Life Sci., 70(7):1182-206 (2012). |
Waterhouse et al., Jalview Version 2—a multiple sequence alignment editor and analysis workbench. Bioinformatics, 25: 1189-91 (2009). |
Wu et al., Conserved residues that modulate protein trans-splicing of Npu DnaE split intein, Biochem. J. 461:247-55 (2014). |
Wu et al., Membrane targeting mechanism of Rab GTPases elucidated by semisynthetic protein probes. Nat. Chem. Biol. 6: 534-40 (2010). |
Yang et al., Intein-mediated assembly of a functional β-glucuronidase in transgenic plants, PNAS, 100(6):3513-8 (2002). |
Zettler et al., The naturally split Npu DnaE intein exhibits an extraordinarily high rate in the protein trans-splicing reaction. FEBS Lett. 583: 909-14 (2009). |
Also Published As
Publication number | Publication date |
---|---|
AU2013280693A1 (en) | 2015-02-12 |
US11161899B2 (en) | 2021-11-02 |
EP2877490B1 (en) | 2018-09-05 |
ES2709333T3 (en) | 2019-04-16 |
EP3431497A1 (en) | 2019-01-23 |
PT2877490T (en) | 2019-02-12 |
JP2015522020A (en) | 2015-08-03 |
US20220098293A1 (en) | 2022-03-31 |
EP2877490A2 (en) | 2015-06-03 |
US20150344549A1 (en) | 2015-12-03 |
AU2013280693B2 (en) | 2018-01-04 |
US20180057577A1 (en) | 2018-03-01 |
DK2877490T3 (en) | 2018-12-17 |
WO2014004336A3 (en) | 2014-03-27 |
EP4219549A1 (en) | 2023-08-02 |
JP2018111685A (en) | 2018-07-19 |
EP2877490A4 (en) | 2016-07-06 |
CA2877882A1 (en) | 2014-01-03 |
ES2930185T3 (en) | 2022-12-07 |
US10526401B2 (en) | 2020-01-07 |
JP6591511B2 (en) | 2019-10-16 |
WO2014004336A2 (en) | 2014-01-03 |
EP3431497B1 (en) | 2022-07-27 |
DK3431497T3 (en) | 2022-10-10 |
US20200087388A1 (en) | 2020-03-19 |
PL2877490T3 (en) | 2019-03-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US12054541B2 (en) | Split inteins, conjugates and uses thereof | |
US20210371467A1 (en) | Split inteins with exceptional splicing activity | |
JP6883529B2 (en) | Methods and products for synthesizing fusion proteins | |
Ayers et al. | Introduction of unnatural amino acids into proteins using expressed protein ligation | |
CN110582566B (en) | Peptide ligase and use thereof | |
KR20190141229A (en) | Protein and peptide tag with improved spontaneous isopeptide bond formation rate and use thereof | |
Steinhagen et al. | Large scale modification of biomolecules using immobilized sortase A from Staphylococcus aureus | |
WO2020249723A1 (en) | Cysteine-free inteins | |
WO2014187960A1 (en) | Removal of n-terminal extensions from fusion proteins | |
Bachmann et al. | Chemical-tag labeling of proteins using fully recombinant split inteins | |
Matern et al. | Ligation of synthetic peptides to proteins using semisynthetic protein trans-splicing | |
Palei et al. | Preparation of Semisynthetic Peptide Macrocycles Using Split Inteins | |
JP2024513126A (en) | Polypeptides that interact with peptide tags at their loops or ends and their uses | |
WO2005012902A1 (en) | Method of screening useful protein |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: THE TRUSTEES OF PRINCETON UNIVERSITY, NEW JERSEY Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:MUIR, TOM W.;VILA-PERELLO, MIQUEL;LIU, ZHIHUA;AND OTHERS;SIGNING DATES FROM 20120730 TO 20120816;REEL/FRAME:057669/0030 |
|
FEPP | Fee payment procedure |
Free format text: ENTITY STATUS SET TO UNDISCOUNTED (ORIGINAL EVENT CODE: BIG.); ENTITY STATUS OF PATENT OWNER: SMALL ENTITY |
|
FEPP | Fee payment procedure |
Free format text: ENTITY STATUS SET TO SMALL (ORIGINAL EVENT CODE: SMAL); ENTITY STATUS OF PATENT OWNER: SMALL ENTITY |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: NON FINAL ACTION MAILED |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: NON FINAL ACTION MAILED |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: FINAL REJECTION MAILED |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: RESPONSE AFTER FINAL ACTION FORWARDED TO EXAMINER |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: NOTICE OF ALLOWANCE MAILED -- APPLICATION RECEIVED IN OFFICE OF PUBLICATIONS |
|
ZAAA | Notice of allowance and fees due |
Free format text: ORIGINAL CODE: NOA |
|
ZAAB | Notice of allowance mailed |
Free format text: ORIGINAL CODE: MN/=. |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: PUBLICATIONS -- ISSUE FEE PAYMENT VERIFIED |
|
STCF | Information on status: patent grant |
Free format text: PATENTED CASE |