EP3818072A1 - Means and methods for site-specific protein modification using transpeptidases - Google Patents
Means and methods for site-specific protein modification using transpeptidasesInfo
- Publication number
- EP3818072A1 EP3818072A1 EP19745053.9A EP19745053A EP3818072A1 EP 3818072 A1 EP3818072 A1 EP 3818072A1 EP 19745053 A EP19745053 A EP 19745053A EP 3818072 A1 EP3818072 A1 EP 3818072A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- amino acid
- ubiquitin
- seq
- polypeptide
- sortase
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 149
- 108090000279 Peptidyltransferases Proteins 0.000 title claims abstract description 57
- 230000009145 protein modification Effects 0.000 title description 3
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 251
- 229920001184 polypeptide Polymers 0.000 claims abstract description 233
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 233
- 102000044159 Ubiquitin Human genes 0.000 claims description 353
- 108090000848 Ubiquitin Proteins 0.000 claims description 353
- 235000001014 amino acid Nutrition 0.000 claims description 207
- 229940024606 amino acid Drugs 0.000 claims description 202
- 150000001413 amino acids Chemical class 0.000 claims description 199
- 210000004027 cell Anatomy 0.000 claims description 192
- 108090000250 sortase A Proteins 0.000 claims description 124
- 125000000539 amino acid group Chemical group 0.000 claims description 100
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 94
- 238000006243 chemical reaction Methods 0.000 claims description 84
- 230000001404 mediated effect Effects 0.000 claims description 75
- 101710123256 Pyrrolysine-tRNA ligase Proteins 0.000 claims description 63
- 230000014509 gene expression Effects 0.000 claims description 53
- XYFCBTPGUUZFHI-UHFFFAOYSA-N Phosphine Chemical compound P XYFCBTPGUUZFHI-UHFFFAOYSA-N 0.000 claims description 48
- 239000004472 Lysine Substances 0.000 claims description 40
- 108010037022 subtiligase Proteins 0.000 claims description 37
- 235000018977 lysine Nutrition 0.000 claims description 33
- 150000007970 thio esters Chemical group 0.000 claims description 33
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 claims description 32
- 125000003588 lysine group Chemical group [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 claims description 31
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 claims description 29
- 102000052866 Amino Acyl-tRNA Synthetases Human genes 0.000 claims description 24
- 108700028939 Amino Acyl-tRNA Synthetases Proteins 0.000 claims description 24
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 claims description 24
- 235000004279 alanine Nutrition 0.000 claims description 24
- 210000004962 mammalian cell Anatomy 0.000 claims description 24
- 229910000073 phosphorus hydride Inorganic materials 0.000 claims description 24
- 108091033319 polynucleotide Proteins 0.000 claims description 24
- 102000040430 polynucleotide Human genes 0.000 claims description 24
- 239000002157 polynucleotide Substances 0.000 claims description 24
- UYRPRYSDOVYCOU-UHFFFAOYSA-N 2-diphenylphosphanylbenzoic acid Chemical compound OC(=O)C1=CC=CC=C1P(C=1C=CC=CC=1)C1=CC=CC=C1 UYRPRYSDOVYCOU-UHFFFAOYSA-N 0.000 claims description 20
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 claims description 20
- 125000003630 glycyl group Chemical group [H]N([H])C([H])([H])C(*)=O 0.000 claims description 20
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 claims description 20
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 19
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 claims description 19
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 claims description 19
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 claims description 18
- 235000013922 glutamic acid Nutrition 0.000 claims description 18
- 239000004220 glutamic acid Substances 0.000 claims description 18
- 230000003711 photoprotective effect Effects 0.000 claims description 18
- -1 (2-azidoacetyl)glycyl Chemical group 0.000 claims description 17
- 125000000852 azido group Chemical group *N=[N+]=[N-] 0.000 claims description 17
- 102100037932 Ubiquitin D Human genes 0.000 claims description 16
- 108020004705 Codon Proteins 0.000 claims description 14
- 238000000338 in vitro Methods 0.000 claims description 14
- 230000001580 bacterial effect Effects 0.000 claims description 13
- 230000002068 genetic effect Effects 0.000 claims description 13
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 claims description 12
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 claims description 12
- 238000010511 deprotection reaction Methods 0.000 claims description 12
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 claims description 12
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 claims description 11
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 claims description 11
- 229960000310 isoleucine Drugs 0.000 claims description 11
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 claims description 11
- 239000004475 Arginine Substances 0.000 claims description 10
- 241000191967 Staphylococcus aureus Species 0.000 claims description 10
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 claims description 10
- 238000012258 culturing Methods 0.000 claims description 10
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 claims description 10
- 125000006239 protecting group Chemical group 0.000 claims description 10
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 claims description 9
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 claims description 9
- 235000009582 asparagine Nutrition 0.000 claims description 9
- 229960001230 asparagine Drugs 0.000 claims description 9
- 125000001433 C-terminal amino-acid group Chemical group 0.000 claims description 8
- 101000941158 Homo sapiens Ubiquitin-related modifier 1 Proteins 0.000 claims description 8
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 claims description 8
- 241000205276 Methanosarcina Species 0.000 claims description 8
- 241000205275 Methanosarcina barkeri Species 0.000 claims description 8
- 102100031319 Ubiquitin-related modifier 1 Human genes 0.000 claims description 8
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 claims description 8
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 claims description 8
- 235000018417 cysteine Nutrition 0.000 claims description 8
- 239000004474 valine Substances 0.000 claims description 8
- 241000238631 Hexapoda Species 0.000 claims description 7
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 claims description 6
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 claims description 6
- 102100031911 NEDD8 Human genes 0.000 claims description 6
- 108700004934 NEDD8 Proteins 0.000 claims description 6
- 101150107958 NEDD8 gene Proteins 0.000 claims description 6
- 101100532088 Oryza sativa subsp. japonica RUB2 gene Proteins 0.000 claims description 6
- 101100532090 Oryza sativa subsp. japonica RUB3 gene Proteins 0.000 claims description 6
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 claims description 6
- 125000003277 amino group Chemical group 0.000 claims description 6
- 230000001268 conjugating effect Effects 0.000 claims description 6
- 229930182817 methionine Natural products 0.000 claims description 6
- 101150024074 rub1 gene Proteins 0.000 claims description 6
- 101150014930 UFM1 gene Proteins 0.000 claims description 5
- 210000005260 human cell Anatomy 0.000 claims description 5
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 claims description 4
- 241000251539 Vertebrata <Metazoa> Species 0.000 claims description 4
- 210000004102 animal cell Anatomy 0.000 claims description 4
- 230000002538 fungal effect Effects 0.000 claims description 4
- 210000000130 stem cell Anatomy 0.000 claims description 4
- 210000005253 yeast cell Anatomy 0.000 claims description 4
- GOLORTLGFDVFDW-UHFFFAOYSA-N 3-(1h-benzimidazol-2-yl)-7-(diethylamino)chromen-2-one Chemical compound C1=CC=C2NC(C3=CC4=CC=C(C=C4OC3=O)N(CC)CC)=NC2=C1 GOLORTLGFDVFDW-UHFFFAOYSA-N 0.000 claims description 3
- 108010061408 Autophagy-Related Protein 12 Proteins 0.000 claims description 3
- 102000012035 Autophagy-Related Protein 12 Human genes 0.000 claims description 3
- 108010032769 Autophagy-Related Protein 8 Family Proteins 0.000 claims description 3
- 102000007353 Autophagy-Related Protein 8 Family Human genes 0.000 claims description 3
- 241000244203 Caenorhabditis elegans Species 0.000 claims description 3
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 claims description 3
- 101000662056 Homo sapiens Ubiquitin D Proteins 0.000 claims description 3
- 101001057508 Homo sapiens Ubiquitin-like protein ISG15 Proteins 0.000 claims description 3
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 claims description 3
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 claims description 3
- 241000124008 Mammalia Species 0.000 claims description 3
- 241000699666 Mus <mouse, genus> Species 0.000 claims description 3
- 241000244206 Nematoda Species 0.000 claims description 3
- 102100027266 Ubiquitin-like protein ISG15 Human genes 0.000 claims description 3
- 108090000623 proteins and genes Proteins 0.000 description 163
- 102000004169 proteins and genes Human genes 0.000 description 137
- 235000018102 proteins Nutrition 0.000 description 113
- 238000010798 ubiquitination Methods 0.000 description 85
- 238000011534 incubation Methods 0.000 description 60
- 230000015572 biosynthetic process Effects 0.000 description 58
- 230000010741 sumoylation Effects 0.000 description 53
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 48
- 239000000203 mixture Substances 0.000 description 45
- 239000011347 resin Substances 0.000 description 43
- 229920005989 resin Polymers 0.000 description 43
- 108020004566 Transfer RNA Proteins 0.000 description 41
- 108010043401 Small Ubiquitin-Related Modifier Proteins Proteins 0.000 description 38
- 102000002669 Small Ubiquitin-Related Modifier Proteins Human genes 0.000 description 38
- 238000013459 approach Methods 0.000 description 36
- 210000004899 c-terminal region Anatomy 0.000 description 36
- 239000013612 plasmid Substances 0.000 description 36
- 108050006400 Cyclin Proteins 0.000 description 34
- 102000009339 Proliferating Cell Nuclear Antigen Human genes 0.000 description 34
- 239000007983 Tris buffer Substances 0.000 description 34
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 34
- 230000035772 mutation Effects 0.000 description 33
- 238000004458 analytical method Methods 0.000 description 32
- 239000002609 medium Substances 0.000 description 32
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 32
- 239000000126 substance Substances 0.000 description 32
- 239000000047 product Substances 0.000 description 31
- 239000000872 buffer Substances 0.000 description 29
- 239000013598 vector Substances 0.000 description 29
- 241000588724 Escherichia coli Species 0.000 description 23
- 108010093668 Deubiquitinating Enzymes Proteins 0.000 description 20
- 102000001477 Deubiquitinating Enzymes Human genes 0.000 description 20
- JGFZNNIVVJXRND-UHFFFAOYSA-N N,N-Diisopropylethylamine (DIPEA) Chemical compound CCN(C(C)C)C(C)C JGFZNNIVVJXRND-UHFFFAOYSA-N 0.000 description 20
- 229940088598 enzyme Drugs 0.000 description 20
- 239000000243 solution Substances 0.000 description 20
- 108020004414 DNA Proteins 0.000 description 19
- 238000010348 incorporation Methods 0.000 description 19
- 238000003776 cleavage reaction Methods 0.000 description 18
- 230000001939 inductive effect Effects 0.000 description 18
- 238000001262 western blot Methods 0.000 description 18
- 229910001868 water Inorganic materials 0.000 description 17
- 238000005119 centrifugation Methods 0.000 description 16
- 238000001542 size-exclusion chromatography Methods 0.000 description 16
- 108091005946 superfolder green fluorescent proteins Proteins 0.000 description 16
- 238000005406 washing Methods 0.000 description 16
- 102000004190 Enzymes Human genes 0.000 description 15
- 108090000790 Enzymes Proteins 0.000 description 15
- 230000000694 effects Effects 0.000 description 15
- 239000011521 glass Substances 0.000 description 15
- 238000001727 in vivo Methods 0.000 description 15
- 230000007017 scission Effects 0.000 description 15
- 239000002904 solvent Substances 0.000 description 15
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 14
- YBYRMVIVWMBXKQ-UHFFFAOYSA-N phenylmethanesulfonyl fluoride Chemical compound FS(=O)(=O)CC1=CC=CC=C1 YBYRMVIVWMBXKQ-UHFFFAOYSA-N 0.000 description 14
- 239000011534 wash buffer Substances 0.000 description 14
- 101710138219 Ubiquitin D Proteins 0.000 description 13
- 239000000499 gel Substances 0.000 description 13
- 238000006460 hydrolysis reaction Methods 0.000 description 13
- 229960000318 kanamycin Drugs 0.000 description 13
- 238000004895 liquid chromatography mass spectrometry Methods 0.000 description 13
- 239000006166 lysate Substances 0.000 description 13
- 238000000746 purification Methods 0.000 description 13
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 12
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 12
- 238000003556 assay Methods 0.000 description 12
- 230000007062 hydrolysis Effects 0.000 description 12
- 229930027917 kanamycin Natural products 0.000 description 12
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 12
- 229930182823 kanamycin A Natural products 0.000 description 12
- 230000009467 reduction Effects 0.000 description 12
- 238000006722 reduction reaction Methods 0.000 description 12
- 239000002002 slurry Substances 0.000 description 12
- 238000003786 synthesis reaction Methods 0.000 description 12
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 12
- 239000004098 Tetracycline Substances 0.000 description 11
- 230000001419 dependent effect Effects 0.000 description 11
- 238000005516 engineering process Methods 0.000 description 11
- 238000011068 loading method Methods 0.000 description 11
- 229930101283 tetracycline Natural products 0.000 description 11
- 229960002180 tetracycline Drugs 0.000 description 11
- 235000019364 tetracycline Nutrition 0.000 description 11
- 150000003522 tetracyclines Chemical class 0.000 description 11
- 108020005038 Terminator Codon Proteins 0.000 description 10
- 239000003242 anti bacterial agent Substances 0.000 description 10
- 239000011575 calcium Substances 0.000 description 10
- 229960005091 chloramphenicol Drugs 0.000 description 10
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 10
- 230000008569 process Effects 0.000 description 10
- 239000000758 substrate Substances 0.000 description 10
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 9
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 9
- 239000004471 Glycine Substances 0.000 description 9
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 9
- 230000003197 catalytic effect Effects 0.000 description 9
- 230000008878 coupling Effects 0.000 description 9
- 238000010168 coupling process Methods 0.000 description 9
- 238000005859 coupling reaction Methods 0.000 description 9
- 229920003023 plastic Polymers 0.000 description 9
- IAZDPXIOMUYVGZ-WFGJKAKNSA-N Dimethyl sulfoxide Chemical compound [2H]C([2H])([2H])S(=O)C([2H])([2H])[2H] IAZDPXIOMUYVGZ-WFGJKAKNSA-N 0.000 description 8
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 8
- NQRYJNQNLNOLGT-UHFFFAOYSA-N Piperidine Chemical compound C1CCNCC1 NQRYJNQNLNOLGT-UHFFFAOYSA-N 0.000 description 8
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 8
- 239000004473 Threonine Substances 0.000 description 8
- 229960000723 ampicillin Drugs 0.000 description 8
- 230000001588 bifunctional effect Effects 0.000 description 8
- 239000006285 cell suspension Substances 0.000 description 8
- 108091005763 multidomain proteins Proteins 0.000 description 8
- 239000011541 reaction mixture Substances 0.000 description 8
- 238000013518 transcription Methods 0.000 description 8
- 230000035897 transcription Effects 0.000 description 8
- 238000001890 transfection Methods 0.000 description 8
- 229940088710 antibiotic agent Drugs 0.000 description 7
- 238000010367 cloning Methods 0.000 description 7
- 238000002330 electrospray ionisation mass spectrometry Methods 0.000 description 7
- DEFVIWRASFVYLL-UHFFFAOYSA-N ethylene glycol bis(2-aminoethyl)tetraacetic acid Chemical compound OC(=O)CN(CC(O)=O)CCOCCOCCN(CC(O)=O)CC(O)=O DEFVIWRASFVYLL-UHFFFAOYSA-N 0.000 description 7
- 238000002474 experimental method Methods 0.000 description 7
- YMAWOPBAYDPSLA-UHFFFAOYSA-N glycylglycine Chemical group [NH3+]CC(=O)NCC([O-])=O YMAWOPBAYDPSLA-UHFFFAOYSA-N 0.000 description 7
- 239000012139 lysis buffer Substances 0.000 description 7
- 239000011550 stock solution Substances 0.000 description 7
- 230000002123 temporal effect Effects 0.000 description 7
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 6
- 238000005160 1H NMR spectroscopy Methods 0.000 description 6
- PPXUUPXQWDQNGO-UHFFFAOYSA-N 2-azidoacetic acid Chemical compound OC(=O)CN=[N+]=[N-] PPXUUPXQWDQNGO-UHFFFAOYSA-N 0.000 description 6
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 6
- 229940124158 Protease/peptidase inhibitor Drugs 0.000 description 6
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 6
- 230000003115 biocidal effect Effects 0.000 description 6
- 238000000502 dialysis Methods 0.000 description 6
- UJTPZISIAWDGFF-UHFFFAOYSA-N ethenylsulfonylbenzene Chemical compound C=CS(=O)(=O)C1=CC=CC=C1 UJTPZISIAWDGFF-UHFFFAOYSA-N 0.000 description 6
- 230000006870 function Effects 0.000 description 6
- 230000017730 intein-mediated protein splicing Effects 0.000 description 6
- 239000000463 material Substances 0.000 description 6
- BDAGIHXWWSANSR-UHFFFAOYSA-N methanoic acid Natural products OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 description 6
- 238000012986 modification Methods 0.000 description 6
- 150000007523 nucleic acids Chemical class 0.000 description 6
- 239000000137 peptide hydrolase inhibitor Substances 0.000 description 6
- 238000011084 recovery Methods 0.000 description 6
- 102220460416 rs1292380177 Human genes 0.000 description 6
- 239000006228 supernatant Substances 0.000 description 6
- 238000013519 translation Methods 0.000 description 6
- 102220467177 Epsin-2_R72A_mutation Human genes 0.000 description 5
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 5
- 102000003960 Ligases Human genes 0.000 description 5
- 108090000364 Ligases Proteins 0.000 description 5
- 125000000729 N-terminal amino-acid group Chemical group 0.000 description 5
- 238000003428 Staudinger Azide reduction reaction Methods 0.000 description 5
- 238000010521 absorption reaction Methods 0.000 description 5
- 238000013019 agitation Methods 0.000 description 5
- 230000008901 benefit Effects 0.000 description 5
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 description 5
- 230000008033 biological extinction Effects 0.000 description 5
- 238000009835 boiling Methods 0.000 description 5
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 5
- 239000000706 filtrate Substances 0.000 description 5
- 230000001965 increasing effect Effects 0.000 description 5
- 239000013067 intermediate product Substances 0.000 description 5
- 239000012160 loading buffer Substances 0.000 description 5
- 230000004048 modification Effects 0.000 description 5
- 102000039446 nucleic acids Human genes 0.000 description 5
- 108020004707 nucleic acids Proteins 0.000 description 5
- 239000002773 nucleotide Substances 0.000 description 5
- 125000003729 nucleotide group Chemical group 0.000 description 5
- 238000010647 peptide synthesis reaction Methods 0.000 description 5
- 239000002244 precipitate Substances 0.000 description 5
- 238000002360 preparation method Methods 0.000 description 5
- 230000001105 regulatory effect Effects 0.000 description 5
- 230000002441 reversible effect Effects 0.000 description 5
- 239000007790 solid phase Substances 0.000 description 5
- 238000006467 substitution reaction Methods 0.000 description 5
- 229920001817 Agar Polymers 0.000 description 4
- 241000894006 Bacteria Species 0.000 description 4
- 229920002101 Chitin Polymers 0.000 description 4
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 4
- 108010008488 Glycylglycine Proteins 0.000 description 4
- 241000205274 Methanosarcina mazei Species 0.000 description 4
- 239000012505 Superdex™ Substances 0.000 description 4
- 102100039414 WD repeat-containing protein 48 Human genes 0.000 description 4
- 238000001042 affinity chromatography Methods 0.000 description 4
- 239000008272 agar Substances 0.000 description 4
- 235000003704 aspartic acid Nutrition 0.000 description 4
- IVRMZWNICZWHMI-UHFFFAOYSA-N azide group Chemical group [N-]=[N+]=[N-] IVRMZWNICZWHMI-UHFFFAOYSA-N 0.000 description 4
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 4
- 239000013592 cell lysate Substances 0.000 description 4
- 239000003795 chemical substances by application Substances 0.000 description 4
- 239000013604 expression vector Substances 0.000 description 4
- 239000012091 fetal bovine serum Substances 0.000 description 4
- 239000012634 fragment Substances 0.000 description 4
- WHUUTDBJXJRKMK-VKHMYHEASA-L glutamate group Chemical group N[C@@H](CCC(=O)[O-])C(=O)[O-] WHUUTDBJXJRKMK-VKHMYHEASA-L 0.000 description 4
- 229910052739 hydrogen Inorganic materials 0.000 description 4
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 4
- 239000007788 liquid Substances 0.000 description 4
- 239000012528 membrane Substances 0.000 description 4
- 239000008188 pellet Substances 0.000 description 4
- 239000000843 powder Substances 0.000 description 4
- 150000003384 small molecules Chemical class 0.000 description 4
- 125000006850 spacer group Chemical group 0.000 description 4
- 238000010186 staining Methods 0.000 description 4
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 4
- 150000003573 thiols Chemical class 0.000 description 4
- 101150042041 wdr48 gene Proteins 0.000 description 4
- JYEVQYFWINBXJU-QFIPXVFZSA-N (2s)-6-(9h-fluoren-9-ylmethoxycarbonylamino)-2-[(2-methylpropan-2-yl)oxycarbonylamino]hexanoic acid Chemical compound C1=CC=C2C(COC(=O)NCCCC[C@H](NC(=O)OC(C)(C)C)C(O)=O)C3=CC=CC=C3C2=C1 JYEVQYFWINBXJU-QFIPXVFZSA-N 0.000 description 3
- 125000003088 (fluoren-9-ylmethoxy)carbonyl group Chemical group 0.000 description 3
- OSWFIVFLDKOXQC-UHFFFAOYSA-N 4-(3-methoxyphenyl)aniline Chemical compound COC1=CC=CC(C=2C=CC(N)=CC=2)=C1 OSWFIVFLDKOXQC-UHFFFAOYSA-N 0.000 description 3
- 108020005098 Anticodon Proteins 0.000 description 3
- 102220488155 Bone morphogenetic protein receptor type-2_Q92L_mutation Human genes 0.000 description 3
- 102000053602 DNA Human genes 0.000 description 3
- 230000033616 DNA repair Effects 0.000 description 3
- 241000196324 Embryophyta Species 0.000 description 3
- 241001646716 Escherichia coli K-12 Species 0.000 description 3
- 102220566738 FRAS1-related extracellular matrix protein 1_E93A_mutation Human genes 0.000 description 3
- 239000007821 HATU Substances 0.000 description 3
- SRBFZHDQGSBBOR-HWQSCIPKSA-N L-arabinopyranose Chemical compound O[C@H]1COC(O)[C@H](O)[C@H]1O SRBFZHDQGSBBOR-HWQSCIPKSA-N 0.000 description 3
- 108091028043 Nucleic acid sequence Proteins 0.000 description 3
- 241000193996 Streptococcus pyogenes Species 0.000 description 3
- YXFVVABEGXRONW-UHFFFAOYSA-N Toluene Chemical compound CC1=CC=CC=C1 YXFVVABEGXRONW-UHFFFAOYSA-N 0.000 description 3
- ZMANZCXQSJIPKH-UHFFFAOYSA-N Triethylamine Chemical compound CCN(CC)CC ZMANZCXQSJIPKH-UHFFFAOYSA-N 0.000 description 3
- 101150096719 USP1 gene Proteins 0.000 description 3
- 238000000246 agarose gel electrophoresis Methods 0.000 description 3
- 239000003153 chemical reaction reagent Substances 0.000 description 3
- 150000001875 compounds Chemical class 0.000 description 3
- 230000021615 conjugation Effects 0.000 description 3
- 238000010276 construction Methods 0.000 description 3
- 238000004520 electroporation Methods 0.000 description 3
- 239000012149 elution buffer Substances 0.000 description 3
- 210000003527 eukaryotic cell Anatomy 0.000 description 3
- 238000000605 extraction Methods 0.000 description 3
- 235000019253 formic acid Nutrition 0.000 description 3
- 238000003306 harvesting Methods 0.000 description 3
- 230000006698 induction Effects 0.000 description 3
- 150000002668 lysine derivatives Chemical class 0.000 description 3
- 108020004999 messenger RNA Proteins 0.000 description 3
- 238000010369 molecular cloning Methods 0.000 description 3
- 229910052757 nitrogen Inorganic materials 0.000 description 3
- 210000004940 nucleus Anatomy 0.000 description 3
- 230000037361 pathway Effects 0.000 description 3
- 230000004962 physiological condition Effects 0.000 description 3
- 230000004481 post-translational protein modification Effects 0.000 description 3
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 3
- 239000013615 primer Substances 0.000 description 3
- 238000001742 protein purification Methods 0.000 description 3
- 230000010076 replication Effects 0.000 description 3
- 239000011734 sodium Substances 0.000 description 3
- 239000007787 solid Substances 0.000 description 3
- 238000000527 sonication Methods 0.000 description 3
- 238000003756 stirring Methods 0.000 description 3
- 239000000725 suspension Substances 0.000 description 3
- 230000011637 translesion synthesis Effects 0.000 description 3
- 230000001960 triggered effect Effects 0.000 description 3
- 238000001644 13C nuclear magnetic resonance spectroscopy Methods 0.000 description 2
- NDKDFTQNXLHCGO-UHFFFAOYSA-N 2-(9h-fluoren-9-ylmethoxycarbonylamino)acetic acid Chemical compound C1=CC=C2C(COC(=O)NCC(=O)O)C3=CC=CC=C3C2=C1 NDKDFTQNXLHCGO-UHFFFAOYSA-N 0.000 description 2
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 2
- HWBAHOVOSOAFLE-UHFFFAOYSA-N 2-[[2-[(2-methylpropan-2-yl)oxycarbonylamino]acetyl]amino]acetic acid Chemical compound CC(C)(C)OC(=O)NCC(=O)NCC(O)=O HWBAHOVOSOAFLE-UHFFFAOYSA-N 0.000 description 2
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 2
- 102000014914 Carrier Proteins Human genes 0.000 description 2
- 108010035563 Chloramphenicol O-acetyltransferase Proteins 0.000 description 2
- 101000749287 Clitocybe nebularis Clitocypin Proteins 0.000 description 2
- 101000767029 Clitocybe nebularis Clitocypin-1 Proteins 0.000 description 2
- 229940094664 Cysteine protease inhibitor Drugs 0.000 description 2
- 230000004543 DNA replication Effects 0.000 description 2
- 102000007260 Deoxyribonuclease I Human genes 0.000 description 2
- 108010008532 Deoxyribonuclease I Proteins 0.000 description 2
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 2
- XEKOWRVHYACXOJ-UHFFFAOYSA-N Ethyl acetate Chemical compound CCOC(C)=O XEKOWRVHYACXOJ-UHFFFAOYSA-N 0.000 description 2
- 108010070675 Glutathione transferase Proteins 0.000 description 2
- 102000005720 Glutathione transferase Human genes 0.000 description 2
- 239000007995 HEPES buffer Substances 0.000 description 2
- 238000012404 In vitro experiment Methods 0.000 description 2
- LRQKBLKVPFOOQJ-YFKPBYRVSA-N L-norleucine Chemical group CCCC[C@H]([NH3+])C([O-])=O LRQKBLKVPFOOQJ-YFKPBYRVSA-N 0.000 description 2
- XOGTZOOQQBDUSI-UHFFFAOYSA-M Mesna Chemical compound [Na+].[O-]S(=O)(=O)CCS XOGTZOOQQBDUSI-UHFFFAOYSA-M 0.000 description 2
- 125000001429 N-terminal alpha-amino-acid group Chemical group 0.000 description 2
- DFPAKSUCGFBDDF-UHFFFAOYSA-N Nicotinamide Chemical compound NC(=O)C1=CC=CN=C1 DFPAKSUCGFBDDF-UHFFFAOYSA-N 0.000 description 2
- 229930182555 Penicillin Natural products 0.000 description 2
- JGSARLDLIJGVTE-MBNYWOFBSA-N Penicillin G Chemical group N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 JGSARLDLIJGVTE-MBNYWOFBSA-N 0.000 description 2
- 108091005616 SUMOylated proteins Proteins 0.000 description 2
- PXIPVTKHYLBLMZ-UHFFFAOYSA-N Sodium azide Chemical compound [Na+].[N-]=[N+]=[N-] PXIPVTKHYLBLMZ-UHFFFAOYSA-N 0.000 description 2
- 108090000251 Sortase B Proteins 0.000 description 2
- 238000003800 Staudinger reaction Methods 0.000 description 2
- 239000006180 TBST buffer Substances 0.000 description 2
- PZBFGYYEXUXCOF-UHFFFAOYSA-N TCEP Chemical compound OC(=O)CCP(CCC(O)=O)CCC(O)=O PZBFGYYEXUXCOF-UHFFFAOYSA-N 0.000 description 2
- 108010076818 TEV protease Proteins 0.000 description 2
- 101150063416 add gene Proteins 0.000 description 2
- 150000001412 amines Chemical class 0.000 description 2
- 230000003321 amplification Effects 0.000 description 2
- 239000002543 antimycotic Substances 0.000 description 2
- 239000012062 aqueous buffer Substances 0.000 description 2
- PYMYPHUHKUWMLA-WDCZJNDASA-N arabinose Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)C=O PYMYPHUHKUWMLA-WDCZJNDASA-N 0.000 description 2
- PYMYPHUHKUWMLA-UHFFFAOYSA-N arabinose Natural products OCC(O)C(O)C(O)C=O PYMYPHUHKUWMLA-UHFFFAOYSA-N 0.000 description 2
- 210000004436 artificial bacterial chromosome Anatomy 0.000 description 2
- 210000001106 artificial yeast chromosome Anatomy 0.000 description 2
- 108091008324 binding proteins Proteins 0.000 description 2
- 239000012267 brine Substances 0.000 description 2
- 230000001413 cellular effect Effects 0.000 description 2
- 230000005754 cellular signaling Effects 0.000 description 2
- 239000002738 chelating agent Substances 0.000 description 2
- 238000004587 chromatography analysis Methods 0.000 description 2
- 230000008045 co-localization Effects 0.000 description 2
- 239000002299 complementary DNA Substances 0.000 description 2
- 238000007796 conventional method Methods 0.000 description 2
- ZYGHJZDHTFUPRJ-UHFFFAOYSA-N coumarin Chemical compound C1=CC=C2OC(=O)C=CC2=C1 ZYGHJZDHTFUPRJ-UHFFFAOYSA-N 0.000 description 2
- 238000004132 cross linking Methods 0.000 description 2
- 230000009089 cytolysis Effects 0.000 description 2
- 210000000172 cytosol Anatomy 0.000 description 2
- 238000006477 desulfuration reaction Methods 0.000 description 2
- 230000023556 desulfurization Effects 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 238000009826 distribution Methods 0.000 description 2
- 238000009510 drug design Methods 0.000 description 2
- 238000010828 elution Methods 0.000 description 2
- 230000002255 enzymatic effect Effects 0.000 description 2
- 238000001704 evaporation Methods 0.000 description 2
- 230000008020 evaporation Effects 0.000 description 2
- 238000000799 fluorescence microscopy Methods 0.000 description 2
- 230000004927 fusion Effects 0.000 description 2
- 102000037865 fusion proteins Human genes 0.000 description 2
- 108020001507 fusion proteins Proteins 0.000 description 2
- 150000002333 glycines Chemical class 0.000 description 2
- 230000002209 hydrophobic effect Effects 0.000 description 2
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 2
- 229940099472 immunoglobulin a Drugs 0.000 description 2
- 239000012535 impurity Substances 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 238000002955 isolation Methods 0.000 description 2
- 108010076401 isopeptidase Proteins 0.000 description 2
- 210000003734 kidney Anatomy 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 229960004635 mesna Drugs 0.000 description 2
- 102000035118 modified proteins Human genes 0.000 description 2
- 108091005573 modified proteins Proteins 0.000 description 2
- 238000006386 neutralization reaction Methods 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- 239000012044 organic layer Substances 0.000 description 2
- 230000002018 overexpression Effects 0.000 description 2
- 229940049954 penicillin Drugs 0.000 description 2
- VLTRZXGMWDSKGL-UHFFFAOYSA-N perchloric acid Chemical compound OCl(=O)(=O)=O VLTRZXGMWDSKGL-UHFFFAOYSA-N 0.000 description 2
- 229920000729 poly(L-lysine) polymer Polymers 0.000 description 2
- 229920000642 polymer Polymers 0.000 description 2
- 238000001556 precipitation Methods 0.000 description 2
- 125000001500 prolyl group Chemical group [H]N1C([H])(C(=O)[*])C([H])([H])C([H])([H])C1([H])[H] 0.000 description 2
- 230000004224 protection Effects 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 102200159111 rs1035794099 Human genes 0.000 description 2
- 102220220587 rs1060503085 Human genes 0.000 description 2
- 150000003839 salts Chemical class 0.000 description 2
- 235000020183 skimmed milk Nutrition 0.000 description 2
- HPALAKNZSZLMCH-UHFFFAOYSA-M sodium;chloride;hydrate Chemical compound O.[Na+].[Cl-] HPALAKNZSZLMCH-UHFFFAOYSA-M 0.000 description 2
- 230000000087 stabilizing effect Effects 0.000 description 2
- 238000003860 storage Methods 0.000 description 2
- 229960005322 streptomycin Drugs 0.000 description 2
- DYHSDKLCOJIUFX-UHFFFAOYSA-N tert-butoxycarbonyl anhydride Chemical compound CC(C)(C)OC(=O)OC(=O)OC(C)(C)C DYHSDKLCOJIUFX-UHFFFAOYSA-N 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 125000000341 threoninyl group Chemical group [H]OC([H])(C([H])([H])[H])C([H])(N([H])[H])C(*)=O 0.000 description 2
- 230000005030 transcription termination Effects 0.000 description 2
- 230000002103 transcriptional effect Effects 0.000 description 2
- 102000019485 ubiquitin binding proteins Human genes 0.000 description 2
- 108091016210 ubiquitin binding proteins Proteins 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- UGNIYGNGCNXHTR-SFHVURJKSA-N (2s)-2-(9h-fluoren-9-ylmethoxycarbonylamino)-3-methylbutanoic acid Chemical compound C1=CC=C2C(COC(=O)N[C@@H](C(C)C)C(O)=O)C3=CC=CC=C3C2=C1 UGNIYGNGCNXHTR-SFHVURJKSA-N 0.000 description 1
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 1
- UKAUYVFTDYCKQA-UHFFFAOYSA-N -2-Amino-4-hydroxybutanoic acid Natural products OC(=O)C(N)CCO UKAUYVFTDYCKQA-UHFFFAOYSA-N 0.000 description 1
- RYHBNJHYFVUHQT-UHFFFAOYSA-N 1,4-Dioxane Chemical compound C1COCCO1 RYHBNJHYFVUHQT-UHFFFAOYSA-N 0.000 description 1
- JFLSOKIMYBSASW-UHFFFAOYSA-N 1-chloro-2-[chloro(diphenyl)methyl]benzene Chemical compound ClC1=CC=CC=C1C(Cl)(C=1C=CC=CC=1)C1=CC=CC=C1 JFLSOKIMYBSASW-UHFFFAOYSA-N 0.000 description 1
- FPIRBHDGWMWJEP-UHFFFAOYSA-N 1-hydroxy-7-azabenzotriazole Chemical compound C1=CN=C2N(O)N=NC2=C1 FPIRBHDGWMWJEP-UHFFFAOYSA-N 0.000 description 1
- OXBLVCZKDOZZOJ-UHFFFAOYSA-N 2,3-Dihydrothiophene Chemical compound C1CC=CS1 OXBLVCZKDOZZOJ-UHFFFAOYSA-N 0.000 description 1
- 108010011170 Ala-Trp-Arg-His-Pro-Gln-Phe-Gly-Gly Proteins 0.000 description 1
- USFZMSVCRYTOJT-UHFFFAOYSA-N Ammonium acetate Chemical compound N.CC(O)=O USFZMSVCRYTOJT-UHFFFAOYSA-N 0.000 description 1
- 239000005695 Ammonium acetate Substances 0.000 description 1
- 108091093088 Amplicon Proteins 0.000 description 1
- 241000219195 Arabidopsis thaliana Species 0.000 description 1
- 108010016529 Bacillus amyloliquefaciens ribonuclease Proteins 0.000 description 1
- 101100257763 Bacillus anthracis srtA gene Proteins 0.000 description 1
- 101100244725 Caenorhabditis elegans pef-1 gene Proteins 0.000 description 1
- KXDHJXZQYSOELW-UHFFFAOYSA-M Carbamate Chemical compound NC([O-])=O KXDHJXZQYSOELW-UHFFFAOYSA-M 0.000 description 1
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 1
- 108010005843 Cysteine Proteases Proteins 0.000 description 1
- 102000005927 Cysteine Proteases Human genes 0.000 description 1
- 102000007528 DNA Polymerase III Human genes 0.000 description 1
- 108010071146 DNA Polymerase III Proteins 0.000 description 1
- 230000005778 DNA damage Effects 0.000 description 1
- 231100000277 DNA damage Toxicity 0.000 description 1
- 230000005971 DNA damage repair Effects 0.000 description 1
- 239000003155 DNA primer Substances 0.000 description 1
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 241000192125 Firmicutes Species 0.000 description 1
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 1
- 101710154606 Hemagglutinin Proteins 0.000 description 1
- 101000996935 Homo sapiens Putative oxidoreductase GLYR1 Proteins 0.000 description 1
- 101000607909 Homo sapiens Ubiquitin carboxyl-terminal hydrolase 1 Proteins 0.000 description 1
- PMMYEEVYMWASQN-DMTCNVIQSA-N Hydroxyproline Chemical compound O[C@H]1CN[C@H](C(O)=O)C1 PMMYEEVYMWASQN-DMTCNVIQSA-N 0.000 description 1
- DGAQECJNVWCQMB-PUAWFVPOSA-M Ilexoside XXIX Chemical compound C[C@@H]1CC[C@@]2(CC[C@@]3(C(=CC[C@H]4[C@]3(CC[C@@H]5[C@@]4(CC[C@@H](C5(C)C)OS(=O)(=O)[O-])C)C)[C@@H]2[C@]1(C)O)C)C(=O)O[C@H]6[C@@H]([C@H]([C@@H]([C@H](O6)CO)O)O)O.[Na+] DGAQECJNVWCQMB-PUAWFVPOSA-M 0.000 description 1
- 229910021578 Iron(III) chloride Inorganic materials 0.000 description 1
- UKAUYVFTDYCKQA-VKHMYHEASA-N L-homoserine Chemical group OC(=O)[C@@H](N)CCO UKAUYVFTDYCKQA-VKHMYHEASA-N 0.000 description 1
- QEFRNWWLZKMPFJ-ZXPFJRLXSA-N L-methionine (R)-S-oxide Chemical group C[S@@](=O)CC[C@H]([NH3+])C([O-])=O QEFRNWWLZKMPFJ-ZXPFJRLXSA-N 0.000 description 1
- QEFRNWWLZKMPFJ-UHFFFAOYSA-N L-methionine sulphoxide Chemical group CS(=O)CCC(N)C(O)=O QEFRNWWLZKMPFJ-UHFFFAOYSA-N 0.000 description 1
- ZFOMKMMPBOQKMC-KXUCPTDWSA-N L-pyrrolysine Chemical compound C[C@@H]1CC=N[C@H]1C(=O)NCCCC[C@H]([NH3+])C([O-])=O ZFOMKMMPBOQKMC-KXUCPTDWSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 1
- 102000019298 Lipocalin Human genes 0.000 description 1
- 108050006654 Lipocalin Proteins 0.000 description 1
- 239000012097 Lipofectamine 2000 Substances 0.000 description 1
- 239000006142 Luria-Bertani Agar Substances 0.000 description 1
- 101710175625 Maltose/maltodextrin-binding periplasmic protein Proteins 0.000 description 1
- 102000003939 Membrane transport proteins Human genes 0.000 description 1
- 108090000301 Membrane transport proteins Proteins 0.000 description 1
- 102000016943 Muramidase Human genes 0.000 description 1
- 108010014251 Muramidase Proteins 0.000 description 1
- 102100038895 Myc proto-oncogene protein Human genes 0.000 description 1
- 101710135898 Myc proto-oncogene protein Proteins 0.000 description 1
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 1
- 206010028980 Neoplasm Diseases 0.000 description 1
- 239000000020 Nitrocellulose Substances 0.000 description 1
- 101710093908 Outer capsid protein VP4 Proteins 0.000 description 1
- 101710135467 Outer capsid protein sigma-1 Proteins 0.000 description 1
- 101710176177 Protein A56 Proteins 0.000 description 1
- 102000001708 Protein Isoforms Human genes 0.000 description 1
- 108010029485 Protein Isoforms Proteins 0.000 description 1
- 102100034301 Putative oxidoreductase GLYR1 Human genes 0.000 description 1
- 238000012181 QIAquick gel extraction kit Methods 0.000 description 1
- 239000012083 RIPA buffer Substances 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 108091028664 Ribonucleotide Proteins 0.000 description 1
- 102000051619 SUMO-1 Human genes 0.000 description 1
- 108700038981 SUMO-1 Proteins 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 1
- BQCADISMDOOEFD-UHFFFAOYSA-N Silver Chemical compound [Ag] BQCADISMDOOEFD-UHFFFAOYSA-N 0.000 description 1
- 241000191940 Staphylococcus Species 0.000 description 1
- 241000193998 Streptococcus pneumoniae Species 0.000 description 1
- 101100257768 Streptococcus pyogenes srtA gene Proteins 0.000 description 1
- 108090000787 Subtilisin Proteins 0.000 description 1
- 101710150448 Transcriptional regulator Myc Proteins 0.000 description 1
- HEDRZPFGACZZDS-MICDWDOJSA-N Trichloro(2H)methane Chemical compound [2H]C(Cl)(Cl)Cl HEDRZPFGACZZDS-MICDWDOJSA-N 0.000 description 1
- 229920004890 Triton X-100 Polymers 0.000 description 1
- 239000013504 Triton X-100 Substances 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- 102100039865 Ubiquitin carboxyl-terminal hydrolase 1 Human genes 0.000 description 1
- 102100029643 Ubiquitin carboxyl-terminal hydrolase 2 Human genes 0.000 description 1
- 101710167633 Ubiquitin carboxyl-terminal hydrolase 2 Proteins 0.000 description 1
- 102000006275 Ubiquitin-Protein Ligases Human genes 0.000 description 1
- 108010083111 Ubiquitin-Protein Ligases Proteins 0.000 description 1
- 241000607626 Vibrio cholerae Species 0.000 description 1
- 101001055983 Vibrio cholerae serotype O1 (strain ATCC 39315 / El Tor Inaba N16961) Multifunctional-autoprocessing repeats-in-toxin Proteins 0.000 description 1
- 238000002835 absorbance Methods 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 239000012190 activator Substances 0.000 description 1
- 125000000217 alkyl group Chemical group 0.000 description 1
- 230000000689 aminoacylating effect Effects 0.000 description 1
- 235000019257 ammonium acetate Nutrition 0.000 description 1
- 229940043376 ammonium acetate Drugs 0.000 description 1
- 239000000427 antigen Substances 0.000 description 1
- 102000036639 antigens Human genes 0.000 description 1
- 108091007433 antigens Proteins 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-L aspartate group Chemical group N[C@@H](CC(=O)[O-])C(=O)[O-] CKLJMWTZIZZHCS-REOHCLBHSA-L 0.000 description 1
- 238000010461 azide-alkyne cycloaddition reaction Methods 0.000 description 1
- 239000011324 bead Substances 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000012742 biochemical analysis Methods 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- GXJABQQUPOEUTA-RDJZCZTQSA-N bortezomib Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)B(O)O)NC(=O)C=1N=CC=NC=1)C1=CC=CC=C1 GXJABQQUPOEUTA-RDJZCZTQSA-N 0.000 description 1
- 125000004744 butyloxycarbonyl group Chemical group 0.000 description 1
- 239000003715 calcium chelating agent Substances 0.000 description 1
- 201000011510 cancer Diseases 0.000 description 1
- 229910052799 carbon Inorganic materials 0.000 description 1
- 238000001460 carbon-13 nuclear magnetic resonance spectrum Methods 0.000 description 1
- 150000007942 carboxylates Chemical group 0.000 description 1
- 238000005277 cation exchange chromatography Methods 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 239000006143 cell culture medium Substances 0.000 description 1
- 230000006037 cell lysis Effects 0.000 description 1
- 210000002421 cell wall Anatomy 0.000 description 1
- 230000033077 cellular process Effects 0.000 description 1
- PBAYDYUZOSNJGU-UHFFFAOYSA-N chelidonic acid Natural products OC(=O)C1=CC(=O)C=C(C(O)=O)O1 PBAYDYUZOSNJGU-UHFFFAOYSA-N 0.000 description 1
- 238000010382 chemical cross-linking Methods 0.000 description 1
- 238000002512 chemotherapy Methods 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 238000010549 co-Evaporation Methods 0.000 description 1
- 230000004186 co-expression Effects 0.000 description 1
- 238000012761 co-transfection Methods 0.000 description 1
- 238000009833 condensation Methods 0.000 description 1
- 230000005494 condensation Effects 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 238000001816 cooling Methods 0.000 description 1
- 229960000956 coumarin Drugs 0.000 description 1
- 235000001671 coumarin Nutrition 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 description 1
- 230000006378 damage Effects 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 238000007865 diluting Methods 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 239000013024 dilution buffer Substances 0.000 description 1
- 239000000539 dimer Substances 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- PMMYEEVYMWASQN-UHFFFAOYSA-N dl-hydroxyproline Natural products OC1C[NH2+]C(C([O-])=O)C1 PMMYEEVYMWASQN-UHFFFAOYSA-N 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 238000001035 drying Methods 0.000 description 1
- 230000009977 dual effect Effects 0.000 description 1
- 230000004064 dysfunction Effects 0.000 description 1
- 230000012202 endocytosis Effects 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 235000019439 ethyl acetate Nutrition 0.000 description 1
- WBJINCZRORDGAQ-UHFFFAOYSA-N ethyl formate Chemical compound CCOC=O WBJINCZRORDGAQ-UHFFFAOYSA-N 0.000 description 1
- 230000005284 excitation Effects 0.000 description 1
- 230000007717 exclusion Effects 0.000 description 1
- 230000006650 fundamental cellular process Effects 0.000 description 1
- 238000007429 general method Methods 0.000 description 1
- 125000000404 glutamine group Chemical group N[C@@H](CCC(N)=O)C(=O)* 0.000 description 1
- 229940068388 glycine 5 mg/ml Drugs 0.000 description 1
- 229940043257 glycylglycine Drugs 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 239000000185 hemagglutinin Substances 0.000 description 1
- 238000000589 high-performance liquid chromatography-mass spectrometry Methods 0.000 description 1
- 238000009396 hybridization Methods 0.000 description 1
- 150000002431 hydrogen Chemical group 0.000 description 1
- 239000001257 hydrogen Substances 0.000 description 1
- NPZTUJOABDZTLV-UHFFFAOYSA-N hydroxybenzotriazole Substances O=C1C=CC=C2NNN=C12 NPZTUJOABDZTLV-UHFFFAOYSA-N 0.000 description 1
- 229960002591 hydroxyproline Drugs 0.000 description 1
- 239000005457 ice water Substances 0.000 description 1
- 230000002779 inactivation Effects 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 230000002458 infectious effect Effects 0.000 description 1
- 239000004615 ingredient Substances 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- RBTARNINKXHZNM-UHFFFAOYSA-K iron trichloride Chemical compound Cl[Fe](Cl)Cl RBTARNINKXHZNM-UHFFFAOYSA-K 0.000 description 1
- 230000003902 lesion Effects 0.000 description 1
- 125000001909 leucine group Chemical group [H]N(*)C(C(*)=O)C([H])([H])C(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 239000004325 lysozyme Substances 0.000 description 1
- 235000010335 lysozyme Nutrition 0.000 description 1
- 229960000274 lysozyme Drugs 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000004949 mass spectrometry Methods 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 230000035800 maturation Effects 0.000 description 1
- 238000002844 melting Methods 0.000 description 1
- 230000008018 melting Effects 0.000 description 1
- 230000009061 membrane transport Effects 0.000 description 1
- 125000000956 methoxy group Chemical group [H]C([H])([H])O* 0.000 description 1
- LSDPWZHWYPCBBB-UHFFFAOYSA-O methylsulfide anion Chemical compound [SH2+]C LSDPWZHWYPCBBB-UHFFFAOYSA-O 0.000 description 1
- 230000003278 mimic effect Effects 0.000 description 1
- 108091005601 modified peptides Proteins 0.000 description 1
- 230000009456 molecular mechanism Effects 0.000 description 1
- 230000008450 motivation Effects 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 239000013642 negative control Substances 0.000 description 1
- 230000004770 neurodegeneration Effects 0.000 description 1
- 208000015122 neurodegenerative disease Diseases 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 229960003966 nicotinamide Drugs 0.000 description 1
- 235000005152 nicotinamide Nutrition 0.000 description 1
- 239000011570 nicotinamide Substances 0.000 description 1
- 229920001220 nitrocellulos Polymers 0.000 description 1
- 231100000065 noncytotoxic Toxicity 0.000 description 1
- 230000002020 noncytotoxic effect Effects 0.000 description 1
- 238000000655 nuclear magnetic resonance spectrum Methods 0.000 description 1
- 230000025308 nuclear transport Effects 0.000 description 1
- 230000030648 nucleus localization Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 150000002923 oximes Chemical class 0.000 description 1
- 230000035699 permeability Effects 0.000 description 1
- 150000003003 phosphines Chemical class 0.000 description 1
- 229910052698 phosphorus Inorganic materials 0.000 description 1
- BZQFBWGGLXLEPQ-REOHCLBHSA-N phosphoserine Chemical compound OC(=O)[C@@H](N)COP(O)(O)=O BZQFBWGGLXLEPQ-REOHCLBHSA-N 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 239000013641 positive control Substances 0.000 description 1
- 230000029279 positive regulation of transcription, DNA-dependent Effects 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 239000012474 protein marker Substances 0.000 description 1
- 230000017854 proteolysis Effects 0.000 description 1
- 230000002797 proteolythic effect Effects 0.000 description 1
- 238000010791 quenching Methods 0.000 description 1
- 230000000171 quenching effect Effects 0.000 description 1
- 239000011535 reaction buffer Substances 0.000 description 1
- 230000035484 reaction time Effects 0.000 description 1
- 230000007115 recruitment Effects 0.000 description 1
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 1
- 230000007363 regulatory process Effects 0.000 description 1
- 230000008439 repair process Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 239000002336 ribonucleotide Substances 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 210000003705 ribosome Anatomy 0.000 description 1
- 102200107458 rs121965013 Human genes 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 230000011664 signaling Effects 0.000 description 1
- 229910052709 silver Inorganic materials 0.000 description 1
- 239000004332 silver Substances 0.000 description 1
- 229910052708 sodium Inorganic materials 0.000 description 1
- XCTFOFPZPRZOFV-UHFFFAOYSA-M sodium;1-sulfanylethanesulfonate Chemical compound [Na+].CC(S)S([O-])(=O)=O XCTFOFPZPRZOFV-UHFFFAOYSA-M 0.000 description 1
- 238000010532 solid phase synthesis reaction Methods 0.000 description 1
- 238000012421 spiking Methods 0.000 description 1
- 239000007921 spray Substances 0.000 description 1
- 230000006641 stabilisation Effects 0.000 description 1
- 238000011105 stabilization Methods 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 239000012089 stop solution Substances 0.000 description 1
- 229940031000 streptococcus pneumoniae Drugs 0.000 description 1
- 238000012916 structural analysis Methods 0.000 description 1
- BDHFUVZGWQCTTF-UHFFFAOYSA-M sulfonate Chemical compound [O-]S(=O)=O BDHFUVZGWQCTTF-UHFFFAOYSA-M 0.000 description 1
- 229910052717 sulfur Inorganic materials 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 125000005931 tert-butyloxycarbonyl group Chemical group [H]C([H])([H])C(OC(*)=O)(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 150000003568 thioethers Chemical class 0.000 description 1
- 239000003053 toxin Substances 0.000 description 1
- 231100000765 toxin Toxicity 0.000 description 1
- FGMPLJWBKKVCDB-UHFFFAOYSA-N trans-L-hydroxy-proline Natural products ON1CCCC1C(O)=O FGMPLJWBKKVCDB-UHFFFAOYSA-N 0.000 description 1
- 239000012096 transfection reagent Substances 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- 150000003852 triazoles Chemical class 0.000 description 1
- 229910052721 tungsten Inorganic materials 0.000 description 1
- 229910052720 vanadium Inorganic materials 0.000 description 1
- 229940118696 vibrio cholerae Drugs 0.000 description 1
- 239000013603 viral vector Substances 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
- 229910052727 yttrium Inorganic materials 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/62—DNA sequences coding for fusion proteins
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K1/00—General methods for the preparation of peptides, i.e. processes for the organic chemical preparation of peptides or proteins of any length
- C07K1/006—General methods for the preparation of peptides, i.e. processes for the organic chemical preparation of peptides or proteins of any length of peptides containing derivatised side chain amino acids
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K1/00—General methods for the preparation of peptides, i.e. processes for the organic chemical preparation of peptides or proteins of any length
- C07K1/107—General methods for the preparation of peptides, i.e. processes for the organic chemical preparation of peptides or proteins of any length by chemical modification of precursor peptides
- C07K1/1072—General methods for the preparation of peptides, i.e. processes for the organic chemical preparation of peptides or proteins of any length by chemical modification of precursor peptides by covalent attachment of residues or functional groups
- C07K1/1075—General methods for the preparation of peptides, i.e. processes for the organic chemical preparation of peptides or proteins of any length by chemical modification of precursor peptides by covalent attachment of residues or functional groups by covalent attachment of amino acids or peptide residues
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1025—Acyltransferases (2.3)
- C12N9/104—Aminoacyltransferases (2.3.2)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/48—Hydrolases (3) acting on peptide bonds (3.4)
- C12N9/50—Proteinases, e.g. Endopeptidases (3.4.21-3.4.25)
- C12N9/52—Proteinases, e.g. Endopeptidases (3.4.21-3.4.25) derived from bacteria or Archaea
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P21/00—Preparation of peptides or proteins
- C12P21/02—Preparation of peptides or proteins having a known sequence of two or more amino acids, e.g. glutathione
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y203/00—Acyltransferases (2.3)
- C12Y203/02—Aminoacyltransferases (2.3.2)
- C12Y203/02012—Peptidyltransferase (2.3.2.12)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y304/00—Hydrolases acting on peptide bonds, i.e. peptidases (3.4)
- C12Y304/22—Cysteine endopeptidases (3.4.22)
- C12Y304/2207—Sortase A (3.4.22.70)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/90—Fusion polypeptide containing a motif for post-translational modification
- C07K2319/92—Fusion polypeptide containing a motif for post-translational modification containing an intein ("protein splicing")domain
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/95—Fusion polypeptide containing a motif/fusion for degradation (ubiquitin fusions, PEST sequence)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/93—Ligases (6)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y304/00—Hydrolases acting on peptide bonds, i.e. peptidases (3.4)
- C12Y304/21—Serine endopeptidases (3.4.21)
- C12Y304/21062—Subtilisin (3.4.21.62)
Definitions
- Ub ubiquitin
- PTMs posttranslational modifications
- Ubiquitylation in which the C-terminal carboxylate of Ub is attached to the e-amino group of a lysine in a substrate protein to form an isopeptide-bond, is naturally mediated by E1/E2/E3- enzymes.
- ubiquitins can be added either to additional lysine residues within the substrate protein or to an already attached ubiquitin via one of the seven lysines of ubiquitin itself.
- substrate proteins Thereby, diverse ubiquitin topologies are formed on substrate proteins (Kulathu et al., Nat Rev Mol Cell Biol 13, 508-23 (2012)).
- ubiquitylation is a reversible process and tightly regulated by a family of enzymes called deubiquitinases (DUBs) ( Komander et al., Nat Rev Mol Cell Biol 10, 550-63 (2009)).
- target proteins can also be covalently modified by ubiquitin-like-proteins (Ubls) that share the common b-grasp fold, including SUMO and NEDD8 (van der Veen et al., Annu Rev Biochem 81 , 323-57 (2012)).
- Ubiquitylation and modification of target proteins with Ubls play crucial roles in a variety of cellular processes, such as protein degradation, DNA repair, nuclear transport, endocytosis, and chromosomal organization.
- many different human diseases including different types of cancer and neurodegenerative diseases, are being linked to dysfunction of ubiquitylation pathways (Flotho et al., Annu Rev Biochem 82, 357-85 (2013)).
- the present inventors developed a new approach to site-specifically modify target proteins - both in vitro and in cellulo.
- This new approach can be employed to ubiquitylate and SUMOylate a target protein or to conjugate site-specifically any other ubiquitin-like protein, polypeptide or dye-polypeptide to the target protein.
- This new approach which the present inventors term sortylation, overcomes current limitations for generating site-specifically modified complex, non-refoldable protein targets, e.g. via ubiquitylation or SUMOylation, and for studying such ubiquitylation/SUMOylation events under physiological conditions in living cells.
- A, B and/or C means A, B, C, A+B, A+C, B+C and A+B+C.
- the term “less than”, “more than” or“larger than” includes the concrete number. For example, less than 20 means ⁇ 20 and more than 20 means >20.
- sequence identity is a property of sequences that measures their similarity or relationship.
- sequence identity or“identity” as used in the present disclosure means the percentage of pair- wise identical residues— following (homologous) alignment of a sequence of a polypeptide of the disclosure with a sequence in question— with respect to the number of residues in the longer of these two sequences. Sequence identity is measured by dividing the number of identical amino acid residues by the total number of residues and multiplying the product by 100.
- the term“homology” is used herein in its usual meaning and includes identical amino acids as well as amino acids which are regarded to be conservative substitutions (for example, exchange of a glutamate residue by an aspartate residue) at equivalent positions in the linear amino acid sequence of a polypeptide of the disclosure (e.g., any lipocalin muteins of the disclosure).
- the percentage of sequence homology or sequence identity can, for example, be determined herein using the program BLASTP, version blastp 2.2.5 (November 16, 2002) (cf. Altschul et al., Nucleic Acids Res, 1997).
- the percentage of homology is based on the alignment of the entire polypeptide sequence (matrix: BLOSUM 62; gap costs: 11.1 ; cut-off value set to 10 3 ) including the propeptide sequences, preferably using the wild-type protein scaffold as reference in a pairwise comparison. It is calculated as the percentage of numbers of “positives” (homologous amino acids) indicated as result in the BLASTP program output divided by the total number of amino acids selected by the program for the alignment.
- isolated means a substance in a form or environment that does not occur in nature.
- isolated substances include (1 ) any non-naturally occurring substance, (2) any substance including, but not limited to, any enzyme, variant, nucleic acid, protein, peptide or cofactor, that is at least partially removed from one or more or all of the naturally occurring constituents with which it is associated in nature; (3) any substance modified by the hand of man relative to that substance found in nature, e.g.
- cDNA made from mRNA or (4) any substance modified by increasing the amount of the substance relative to other components with which it is naturally associated (e.g., recombinant production in a host cell; multiple copies of a gene encoding the substance; and use of a stronger promoter than the promoter naturally associated with the gene encoding the substance).
- nucleotide sequence or“nucleic acid sequence” used herein refers to either DNA or RNA.
- Nucleic acid sequence or “polynucleotide sequence” or simply“polynucleotide” refers to a single or double-stranded polymer of deoxyribonucleotide or ribonucleotide bases read from the 5' to the 3' end. It includes both self-replicating plasmids, infectious polymers of DNA or RNA, and non-functional DNA or RNA.
- expressing refers to the synthesis of a gene product encoded by a polynucleotide.
- a polypeptide “expressing” means when a polynucleotide is transcribed to mRNA and the mRNA is translated to a polypeptide.
- the term “expressing” in context of a RNA means when a DNA is transcribed to RNA, e.g. a tRNA.
- amino acid or“natural amino acid” are used interchangeably herein and refer to naturally occurring and synthetic amino acids, as well as amino acid analogs and amino acid mimetics that function in a manner similar to the naturally occurring amino acids.
- Naturally occurring amino acids are those encoded by the genetic code, as well as those amino acids that are later modified, e.g., hydroxyproline, y-carboxyglutamate, and O-phosphoserine.
- Amino acid analogs refers to compounds that have the same basic chemical structure as a naturally occurring amino acid, i.e., a carbon that is bound to a hydrogen, a carboxyl group, an amino group, and an R group, e.g., homoserine, norleucine, methionine sulfoxide, methionine methyl sulfonium. Such analogs have modified R groups (e.g., norleucine) or modified peptide backbones, but retain the same basic chemical structure as a naturally occurring amino acid. However, such an analog is not to be confused with an unnatural amino acid, which comprises one or more amino acid residues fused to the R group of an amino acid.
- Amino acid mimetics refers to chemical compounds that have a structure that is different from the general chemical structure of an amino acid, but that function in a manner similar to a naturally occurring amino acid.
- polypeptide and “protein” are interchangeably used.
- polypeptide refers to a protein or peptide that contains two or more amino acids, typically at least 3, preferably at least 20, more preferred at least 30, such as at least 50 amino acids. Accordingly, a polypeptide comprises an amino acid sequence, and, thus, sometimes a polypeptide comprising an amino acid sequence is referred to herein as a“polypeptide comprising a polypeptide sequence”. Thus, herein the term“polypeptide sequence” is interchangeably used with the term“amino acid sequence”.
- the term "vector" as used herein refers to a nucleic acid sequence into which an expression cassette comprising a gene of the present invention or gene encoding the protein of interest may be inserted or cloned. Furthermore, the vector may encode an antibiotic resistance gene conferring selection of the host cell. Preferably, the vector is an expression vector.
- the vector may be capable of autonomous replication in a host cell (e. g., vectors having an origin of replication which functions in the host cell).
- the vector may have a linear, circular, or supercoiled configuration and may be complexed with other vectors or other material for certain purposes.
- Vectors used herein for expressing an expression cassette comprising a gene of the present invention or gene encoding the protein of interest usually contain transcriptional control elements suitable to drive transcription such as e.g. promoters, enhancers, polyadenylation signals, transcription pausing or termination signals as elements of an expression cassette.
- transcriptional control elements suitable to drive transcription such as e.g. promoters, enhancers, polyadenylation signals, transcription pausing or termination signals as elements of an expression cassette.
- suitable translational control elements are preferably included in the vector, such as e.g. 5' untranslated regions leading to 5' cap structures suitable for recruiting ribosomes and stop codons to terminate the translation process.
- the nucleotide sequence serving as the selectable marker genes as well as the nucleotide sequence encoding the protein of interest can be transcribed under the control of transcription elements present in appropriate promoters.
- the resultant transcripts of the selectable marker genes and that of the protein of interest harbour functional translation elements that facilitate substantial levels of protein expression (i.e. translation) and proper translation termination.
- the vector may comprise a polylinker (multiple cloning site), i.e. a short segment of DNA that contains many restriction sites, a standard feature on many plasmids used for molecular cloning. Multiple cloning sites typically contain more than 5, 10, 15, 20, 25, or more than 25 restrictions sites. Restriction sites within an MCS are typically unique (i.e., they occur only once within that particular plasmid). MCSs are commonly used during procedures involving molecular cloning or subcloning.
- vector refers to a circular double stranded DNA loop into which additional DNA segments may be introduced via ligation or by means of restriction-free cloning.
- Other vectors include cosmids, bacterial artificial chromosomes (BAC), yeast artificial chromosomes (YAC) or mini-chromosomes.
- BAC bacterial artificial chromosome
- YAC yeast artificial chromosome
- mini-chromosomes Another type of vector is a viral vector, wherein additional DNA segments may be ligated into the viral genome.
- the invention further relates to a vector that can be integrated into the host cells genome and thereby replicates along with the host cells genome.
- the expression vector may comprise a predefined restriction site, which can be used for linearization of the vector nucleic acid prior to transfection. The skilled person knows how to integrate into the genome. For example, it is important how to place the linearization restriction site, because said restriction site determines where the vector nucleic acid is opened/linearized and thus determines the order/arrangement
- An antibiotic resistance gene in accordance with the invention, means a gene which provides the transformed cells with a selection advantage (e.g. resistance against an antibiotic) by expressing the corresponding gene product.
- the gene product confers a characteristic to the cell expressing the antibiotic resistance gene that allows it to be distinguished from cells that do not express the antibiotic resistance gene (i.e. selection of cells) if the antibiotic, to which the gene product confers resistance to, is applied to the cell culture medium. Resistance by the gene product to the cell may be conferred via different molecular mechanisms (e.g. inactivation of the drug, increased efflux).
- the expression cassette comprising a gene of the present invention or gene encoding the protein of interest is inserted into the expression vector as a DNA construct.
- This DNA construct can be recombinantly made from a synthetic DNA molecule, a genomic DNA molecule, a cDNA molecule or a combination thereof.
- the DNA construct is preferably made by ligating the different fragments to one another according to standard techniques known in the art.
- the expression cassette or vector according to the invention which is present in the host may either be integrated into the genome of the host or it may be maintained in some form extrachromosomally.
- the expression cassettes may comprise an appropriate transcription termination site. This, as continued transcription from an upstream promoter through a second transcription unit may inhibit the function of the downstream promoter, a phenomenon known as promoter occlusion or transcriptional interference. This event has been described in both prokaryotes and eukaryotes. The proper placement of transcriptional termination signals between two transcription units can prevent promoter occlusion. Transcription termination sites are well characterized and their incorporation in expression vectors has been shown to have multiple beneficial effects on gene expression.
- FIG. 1 Site-specific incorporation of AzGGK into proteins in E. coli.
- MS-analysis confirming the integrity of sfGFP-N150AzGGK and sfGFP-N150GGK.
- Srt2A-mediated ubiquitylation of GGK-bearing proteins a) Srt2A-mediated formation of diUbs analysed by SDS-PAGE gels. * denotes an impurity from Srt2A.
- b) Incubation of diUbs with USPD CD shows that Srt2A-generated K6-DiUb(AT) and K6-DiUb(LAT) are stable towards DUB cleavage, while natively linked K6-DiUb is quantitatively cleaved within an hour at 37 °C. left: SDS-PAGE analysis; right: MS-analysis of DUB-cleavage.
- PCNA (PDB: 1 axc) is a homotrimeric DNA-repair protein that is ubiquitylated at position K164.
- FIG. 4 Site-specific SUMOylation of GGK-bearing proteins, a) Ubiqutin (PDB: 1 ubq) and SUMO (PDB: 1y8r) show a similar globular b-grasp fold with an unstructured C-terminus.
- the wild type C-terminal sequence of SUM01 is QEQTGG.
- For recognition by Srt2A two point mutations are introduced: Q92L and E93A.
- SUMOylation shows specificity for sfGFP-GGK.
- c) SUMOylation in living E. coli using mSrt2A is specific for sfGFP-GGK as analysed by anti-His6 Western-Blots. Time points refer to times after 2DPBA addition. Srt2A and mSrt2A-mediated SUMOylation on purified proteins and in living E. coli were carried out as described in Supplementary Methods.
- SUMOylation in live HEK293T cells shows specificity for Ub(LAT) and SUMO(LAT) as analysed by anti-His6 Western-Blots.
- Overexpression of SUMO(wt) and Ub(wt) did not lead to sfGFP-Ub or sfGFP-SUMO conjugates.
- 2DPBA i.e.
- FIG. 6 General scheme showing sortase-mediated ubiquitylation.
- the unnatural amino acid AzGGK is site-specifically incorporated into proteins via genetic code expansion.
- Staudinger reduction converts AzGGK-bearing proteins to GGK-bearing proteins, which in turn undergo transpeptidation with a modified ubiquitin bearing a sortase recognition motif (LPLTG or LALTG) via SrtA.
- LLTG or LALTG sortase recognition motif
- Generated ubiquitylated proteins display a native isopeptide-bond and two point mutations (R72P or R72A and R74T) in the linker region.
- Figure 7 Sortase-mediated transpeptidation between GGK and a peptide resembling the Srt5M-compatible C-terminus. a) Structural formula of unnatural amino acid GGK. b) Structural formula of the sortase-mediated transpeptidation product that is formed between peptide Fmoc-VLPLTGG and GGK. c) LC-MS analysis of Srt5M-mediated transpeptidation between Fmoc- VLPLTGG and GGK shows close to quantitative formation of product within 30 minutes (red box), while incubation of Fmoc-VLPLTGG with AzGGK in the presence of Srt5M did not yield the transpeptidation product, as expected (blue box).
- Figure 8 Structural analysis of wild type PylRS. Crystal structure of /WmPylRS (PDB: 2Q7H). The PylRS C-lobe is shown in cartoon model in red, blue and green. Key amino acid positions in the active site are displayed in stick model in orange. The blue and green part (723 bp) represent the part subjected to DNA-shuffling, the green part (495 bp) was subjected to error-prone PCR. The PylRS variants used for the shuffling approach contained amino acid mutations at positions highlighted in orange.
- Figure 9 Site-specific incorporation of AzGGK into sfGFP in E. coli.
- Expression of sGFP-150TAG-His6 in the presence of AzGGKRS/tRNAcu A and AzGGK (4 mM) under auto- induction conditions lead to AzGGK-dependent synthesis of full-length sfGFP-His6, as confirmed by LS-MS.
- Apart form the mass peak corresponding to sfGFP-AzGGK a further small peak (green) was observed in the ESI-MS analysis, which could be assigned to misincorporation of phenylalanine. Omitting phenylalanine from the amino acid mix lead to clean expression of sfGFP- AzGGK.
- FIG. 10 Srt5M-mediated ubiquitylation of sfGFP-GGK.
- sfGFP-GGK (20 mM) was incubated at 37 °C with 100 pM Ub(PT) or Ub(LPT) in the presence of Srt5M (20 pM) and reactions were quenched at indicated time points with 4x Laemmli buffer (and heated to 95 °C) and analysed via SDS-PAGE. Incubation of sfGFP-BocK under the same conditions did not lead to ubiquitylated product.
- FIG 11 Srt5M-catalysed formation of diubiquitins.
- FIG. 12 Srt2A-mediated ubiquitylation of GGK-bearing proteins a) Incubation of sfGFP-GGK with Ub(AT) and Ub(LAT) in the presence of Srt2A leads to specific formation of sfGFP-Ub(AT) and sfGFP-Ub(LAT) conjugates, while sfGFP-BocK is unreactive under the same reaction conditions b) MS-analysis of sfGFP-Ub(AT) and sfGFP-Ub(LAT) conjugates c) While incubation of Ub-K6GGK with Ub(AT) leads to efficient formation of the corresponding K6-linked diUb in the presence of Srt2A, addition of Ub(wt) instead of Ub(AT) did not lead to K6-diUb formation, showing that Srt2A cannot recognize Ub(wt).
- FIG. 13 Optimizing conditions for Srt2A-mediated K6-diUb-formation.
- the sortagging motif LALTG is re-installed in the generated diubiquitin molecule, making sortylation in principle a reversible approach.
- the formed diubiquitins were stable over a period of 16 hours in the presence of high concentrations of Srt2A.
- FIG. 14 Preparative formation of Srt2A-generated diubiquitins.
- FIG. 15 Deubiquitylation assays. Incubation of Srt2A-generated diubiquitins and native diubiquitins with the promiscuous DUB USP2 CD showed that Srt2A-generated diubiquitins are stable against isopeptidase activity of the DUB, while all the native diubiquitins were cleaved within an hour at 37 °C. ** denotes bands corresponding to C-terminal His6-tag cleavage of the donor ubiquitin. Deubiquitylation assays were carried out as described in Supplementary Methods.
- FIG. 16 Ubiquitylation of PCNA-K164GGK.
- PCNA-K164GGK was purified from PCNA-K164AzGGK-CPD-His6 and treated with Ub(LAT) in the presence of Srt2A at 37 °C or 25 °C.
- PCNA-Ub(LAT) conjugate formation was observed within 30 minutes. * denotes thioester formed between Srt2A and Ub(LAT). The formation of PCNA-Ub(LAT) conjugate is specific for PCNA-bearing GGK. No ubiquitylated PCNA is detected when using PCNA-K164BocK.
- FIG. 17 Srt2A-mediated SUMOylation of GGK-bearing proteins, a) Time course for formation of SUMO-sfGFP conjugates.
- sfGFP-GGK (20 mM) was incubated in the presence of Srt2A (20 mM) with an excess of SUMO(AT) or SUMO(LAT) (100 mM) at 37 °C and product formation was analysed by SDS-PAGE at different time points.
- Srt2A 20 mM
- Figure 18 Comparison of different sortase A enzymes, a) Comparison of S. aureus wt SrtA (PDB: 2kid) and S. pyogenes wt SrtA (PDB: 3fn5).
- S. aureus SrtA is strongly Ca 2+ -dependent. Binding of Ca 2+ to glutamate residues in the b3/b4 loop, distal to the active site enhances substrate binding by stabilizing a closed conformation of the active site b6/b7 loop.
- SrtA is Ca 2+ - independent and the b3/b4-Ioor and b6/b7-Ioor are kept in a closed conformation through hydrogen-bonding between K126 and D196.
- the present inventors introduced K47 and Q50 mutations into the b3/b4-Ioor of Srt2A, generating mSrt2A.
- FIG. 19 Comparison between Srt2A and mSrt2A.
- sfGFP-AzGGK was co-expressed together with SUMO(LAT) and Srt2A and mSrt2A for 24 hours. After washing of cells to remove AzGGK, cells were treated with 2DPBA for indicated time points, washed again and analysed by anti-His6 Western-Blots. Consistent with results for in vitro experiments conducted in the absence of Ca 2+ (a), in vivo SUMOylation was much more effective with mSrt2A. Srt2A and mSrt2A-mediated ubiquitylation/SUMOylation on purified proteins and in living £. coli was carried out as described in Supplementary Methods.
- FIG. 20 In vivo ubiquitylation and SUMOylation of PCNA.
- a) PCNA-K164AzGGK- CPD-His6 was co-expressed together with mSrt2A and Ub(LAT) for 24 hours. After washing of cells to remove AzGGK, cells were treated with 2DPBA for indicated time points, washed again and analysed by anti-His6 Western-Blots.
- Expression of PCNA-K164BocK-CPD-His6 shows that in vivo ubiquitylation is dependent on GGK-bearing proteins b) same as (a) for SUMOylation.
- In vivo ubiquitylation/SUMOylation on PCNA-CPD-His6 in living £. coli was carried out as described in Supplementary Methods.
- Figure 21 Sortase-mediated ubiquitylation and SUMOylation in mammalian cells, a)
- sfGFP-N150AzGGK-His6 was expressed for 48 hours in HEK293T cells.
- Cells were washed with AzGGK-free medium, incubated with 0.5 mM DPBA overnight, washed again and lysed by consecutive freeze-thaw cycles.
- Lysates were treated with 20 mM SrtA variant (either Srt5M, Srt7M, Srt2A or mSrt2A) and 100 pM Ub(LPT) or Ub(LAT) for one hour at 37 °C and analysed by anti-His6 Western Blotting.
- Figure 22 Examples of unnatural amino acids. Structural formulas of discussed unnatural amino acids: AzGGK, AzGK, GGK and GK. Furthermore structural formulas of unnatural amino acids, where the N-terminal glycine moiety in GGK is protected with a photocaging group (either 2-nitrobenzyl or coumarin) are described.
- Figure 23 Generation of ubiquitin chains conjugated to a polypeptide of interest.
- a bifunctional ubiquitin bearing a Srt2A-compatible C-terminus and a protected GGK moiety (AzGGK or photocaged GGK) at a specific position is reacted with a protein of interest (POI) bearing GGK in the presence of Srt2A to give a site-specifi cally ubiquitylated POI.
- POI protein of interest
- the protected GGK amino acid in the Ub molecule is deprotected using light or a phosphine to yield GGK and is reacted with a Ubiquitin bearing a C-terminal motif compatible with Srt5M and reacted in the presence of Srt5M to the shown product.
- subtiligase Ubiquitin (1-74) bearing a protected GGK amino acid (AzGGK or photocaged GGK) is expressed as a thioester via intein technology and reacted with a POI bearing GGK at a specific position using subtiligase.
- the protecting group is removed and GGK can react with a second Ub(1-74) thioester in the presence of subtiligase.
- Figure 24 Transpeptidase reaction employing a subtiligase.
- a recombinant protein is expressed as a intein fusion and eluted from a chitin column as a thioester using a thiol, (e.g. MESNA, Sodium Mercaptoethansulfonate).
- a thiol e.g. MESNA, Sodium Mercaptoethansulfonate.
- the thioester can be ligated to the N-terminus of a peptide using Subtiligase.
- Subtiligase is known to interact with 4 residues (P 4 - P-i) N-terminal to and 2 residues (R-i' - P 2 ') C-terminal to the ligation site.
- Figure 25 Hydrolysis assays of differently linked Dillbs reveal orthogonal Sortases.
- FIG. 26 Orthogonal Sortases allow assembly of isopeptide linked triubiquitin.
- S2A forms an isopeptide bond between Ub(LAT) and UbK48GGK-LPT yielding Ub-isoK48(LAT)-Ub-LPT.
- S5M and UbK6GGK results in a TriUb linked via two isopeptide bonds at K48 and K6.
- Figure 27 Orthogonal Sortases allow assembly of Dillb-SUM02 hybrid chains.
- the present invention is based on the surprising finding that unnatural amino acids can serve as a platform for a transpeptidase conjugation when integrated in a polypeptide of interest. More specifically, the present inventors developed a method for site-specifi cally conjugating two polypeptides via a transpeptidation reaction.
- the first polypeptide is modified such that a certain amino acid is substituted with an unnatural amino acid, thereby defining the position in the first polypeptide where the second polypeptide is to be conjugated.
- the unnatural amino acid comprises a first amino acid and at least one further amino acid conjugated to the side chain of the first amino acid.
- the second polypeptide comprises a recognition motif for the transpeptidase.
- the transpeptidase when the first polypeptide, the second polypeptide and the transpeptidase are brought into close proximity, the transpeptidase will recognize the recognition motif in the second polypeptide and subsequently catalyse a transpeptidation reaction.
- the second polypeptide In this transpeptidation reaction the second polypeptide is conjugated via its recognition motif site-specifically to the unnatural amino acid of the first polypeptide.
- the transpeptidase recognizes the recognition motif in the second polypeptide - usually a sequence of five amino acids (e.g. Sortase A recognizes the amino acid sequence LPXTG) - and forms a covalent thioester bond to the recognition motif of the second polypeptide (Sortase A cleaves between Threonine and Glycine in the recognition motif and forms a covalent bond between a cysteine residue in the Sortase A catalytic site and the carboxy group of the Threonine residue of the recognition motif).
- Sortase A recognizes the amino acid sequence LPXTG
- Sertase A cleaves between Threonine and Glycine in the recognition motif and forms a covalent bond between a cysteine residue in the Sortase A catalytic site and the carboxy group of the Threonine residue of the recognition motif.
- a thioester intermediate product is generated in which the transpeptidase is covalently linked to the second polypeptide.
- this intermediate product encounters the first polypeptide comprising a free N- terminal amino acid residue (e.g. Sortase A requires an N-terminal Glycine residue), wherein the transpeptidase catalyses the formation of a covalent peptide bond between said free N-terminal amino acid residue of the first polypeptide and the recognition motif of the second polypeptide (the Threonine residue), thereby generating a conjugate comprising the first and the second polypeptide.
- the transpeptidase catalyses the formation of a covalent peptide bond between said free N-terminal amino acid residue of the first polypeptide and the recognition motif of the second polypeptide (the Threonine residue), thereby generating a conjugate comprising the first and the second polypeptide.
- the present inventors surprisingly discovered that the described transpeptidation reaction cannot only be conducted between the recognition motif of the second polypeptide and a free amino acid residue at the N-terminus of the first polypeptide but also between the recognition motif of the second polypeptide and an unnatural amino acid integrated in the first polypeptide, if said unnatural amino acid comprises a free amino acid (comprising a free N-terminus as required by the transpeptidase, i.e. a N-terminus amenable to the formation of a covalent peptide bond) fused to its side chain (e.g.
- a Glycine residue conjugated to the side chain of a Lysine residue integrated in the first polypeptide can be used in the second step of the reaction by Sortase A).
- Figure 6 shows such a transpeptidation reaction.
- an Ubiquitin polypeptide (second polypeptide) was modified such that it comprises a Sortase A recognition motif at its C-terminus and a polypeptide of interest (first polypeptide) was modified to comprise an unnatural amino acid comprising a glycylglycine moiety fused via an isopeptide bond to a Lysine residue integrated in the first polypeptide (termed GGK-bearing POI in Figure 6).
- Sortase A recognizes the“LPLTGG” motif of the Ubiquitin polypeptide (second polypeptide) and forms a intermediate product - comprising the Sortase A and the modified Ubiquitin - by forming a thioester between its catalytic cysteine and Threonine, cleaving thereby the recognition motif between the Threonine and the first Glycine.
- the Sortase A conjugates the Threonine to the N-terminal Glycine residue of the unnatural amino acid (GGK) integrated in the first polypeptide, thereby generating a conjugate comprising an Ubiquitin fused to a predetermined specific site in the polypeptide of interest.
- the present inventors developed a new method allowing to modify a first polypeptide by conjugating a second polypeptide to a predetermined specific site in the first polypeptide via a transpeptidation reaction.
- the method of the present inventors is applicable to proteins under native conditions, allowing the modification of large multi-domain and non-refoldable proteins; an endeavour that is challenging with present chemical methods.
- This is the first approach where a site-specifically introduced unnatural amino acid serves as a platform for a chemoenzymatic reaction, namely the transpeptidation reaction of a first polypeptide comprising an unnatural amino acid and a second polypeptide comprising a recognition motif for the transpeptidase.
- the present invention relates to a method for modifying a polypeptide, comprising:
- transpeptidase refers to an enzyme catalyzing a transpeptidation reaction, i.e. the transfer of an amino or peptide group from one molecule to another.
- a transpeptidase as used herein recognizes a recognition motif in a polypeptide and forms a covalent bond with an amino acid residue of the recognition motif of the polypeptide, thereby generating an intermediate product in which the transpeptidase is covalently linked to the polypeptide.
- transpeptidase as used herein forms a peptide-bond between a first polypeptide comprising a free N-terminal amino acid and a second polypeptide comprising a recognition motif for the transpeptidase.
- transpeptidases known in the art are Sortase A from Staphylococcus aureus or Sortase B from Staphylococcus aureus or Staphylococcus pneumoniae.
- recognition motif refers to a motif of amino acids recognized by the transpeptidase in the first step of the transpeptidation reaction as described herein.
- a person skilled in the art knows the recognition motif of a certain transpeptidase.
- a recognition motif comprises a sequence of a few amino acids, e.g. five amino acids.
- recognition motifs and transpeptidases that can be used to put the present invention into practice are“LPXTG” for Sortase A of Staphylococcus aureus, “NPQTN” for SortaseB of Staphylococcus aureus, “YPRTG” for Sortase B of Streptococcus pneumoniae, “LPXTA” or“LPXTG” for Sortase A of streptococcus pyogenes“LAXTG” for Sortase2A as disclosed herein or“LPXSG” for Sortase4S as disclosed herein.
- Further transpeptidases with other recognition motifs that can be used to put the present invention into practice are known in the art.
- the recognition motif for the transpeptidase can be located at the N-terminus, at the C-terminus or be incorporated in the amino acid sequence of a polypeptide, e.g. the second polypeptide according to the method of the present invention.
- the locus of the amino acid sequence comprising the recognition motif preferably forms a loop.
- the recognition motif is preferably located at the N-terminus or at the C-terminus of the polypeptide and even more preferably at the C-terminus of the polypeptide.
- the recognition motif can be conjugated to the N-terminus or the C-terminus or integrated in any polypeptide. The person skilled in the art knows how to.
- a Leucine introduced at the N-terminus of the recognition motif of the second polypeptide may act as a spacer making the recognition motif more accessible and thus increasing the conjugation rate in the transpeptidase reaction. Accordingly, in a preferred embodiment the second polypeptide and the recognition motif are separated by a spacer, preferably a Leucine.
- the term“providing” as used herein refers in its broadest sense to providing a polypeptide such that the polypeptide can fulfill its function in the transpeptidation reaction of the present invention. The person skilled in the art knows how to provide a polypeptide. Examples are solid phased peptide synthesis, in vitro transcription and translation or expressing the polypeptide in a host cell.
- the polypeptide is purified or isolated.
- a tag may be used to for purification of the polypeptide.
- a nucleotide sequence encoding a polypeptide may also encode a tag which is advantageously genetically fused in frame to the nucleotide sequence encoding said polypeptide. Said tag may be at the C-or N-terminus of said polypeptide.
- tags examples include, but are not limited to, HAT, FLAG, c-myc, hemagglutinin antigen, His (e.g., 6xHis) tags, flag-tag, strep-tag, strepl l-tag, TAP-tag, chitin binding domain (CBD), maltose-binding protein, immunoglobulin A (IgA), His-6-tag, glutathione-S-transferase (GST) tag, intein and streptavidine binding protein (SBP) tag.
- the first polypeptide comprises a tag, which can be used for detection or purifying or isolating the first polypeptide. More preferably, the first polypeptide comprises a tag at the C-terminus, which can be used for detection or purifying or isolating the first polypeptide.
- modified polypeptide refers to the conjugate of the first polypeptide and the second polypeptide obtained by the method of the present invention. More precisely, the“modified polypeptide” comprises the second polypeptide that is conjugated via its recognition motif to the unnatural amino acid, i.e. to the N-terminus of the at least one further amino acid, which is conjugated to the side chain of the first amino acid, incorporated site- specifically into the first polypeptide.
- the term “obtaining” as used herein means in its broadest sense incubating the components used in the method of the invention under conditions allowing the transpeptidase reaction such that the modified polypeptide is generated.
- a person skilled in the art knows how to provide conditions suitable for an enzymatic reaction such as a transpeptidase reaction and will adjust the important parameters (e.g. temperature, pH, salt concentration, etc.) accordingly. Examples of conditions allowing to put the present invention into practice are shown in the examples.
- the term“obtaining” means culturing the host cell under conditions allowing expression of the polypeptides and/or RNAs required for the transpeptidation reaction of the invention and allowing the transpeptidation reaction in the host cell.
- the term“unnatural amino acid” as used herein refers to a first amino acid (X), wherein at least one further amino acid (Z) has been conjugated to the side chain (also known in the art as“R” of an amino acid) of the first amino acid (X), such that the first amino acid and the one or more further amino acids form a linear amino acid side chain (e.g. X-Z or X-Z-Z).
- the first natural amino acid (X) is integrated in the amino acid sequence of the polypeptide, wherein the one or more further amino acids (Z) protrude from the linear amino acid sequence of the polypeptide as a“side chain” such that the N- terminus of the“side chain” can form an isopeptide-bond in the transpeptidation reaction.
- the unnatural amino acid comprises at least one amino acid (Z) conjugated to the first amino acid (X), wherein the at least one amino acid (Z) is required for the second step of the transpeptidation reaction, i.e. conjugating an amino acid of the recognition motif via a peptide-bond to the amino acid (Z).
- the terminal amino acid (Z) is conjugated to the amino acid of the recognition motif.
- the amino acids (Z) in the unnatural amino acid have to be selected in dependence of the transpeptidase that is used in the method of the invention in order to allow the second step of the transpeptidation reaction.
- the terminal amino acid is a Glycine (e.g. X-G, X-Z-G, or X-Z-Z-G etc.).
- An unnatural amino acid of the present invention serves as a motif required for the transpeptidase reaction, more precisely the second step of the transpeptidase reaction as described herein.
- An unnatural amino acid can be integrated at any position of the first polypeptide according to the method of the invention, i.e. at the N-terminus, at the C-terminus or integrated in the amino acid sequence of the first polypeptide. However, in a preferred embodiment, the unnatural amino acid is integrated in the amino acid sequence of the first polypeptide.
- the unnatural amino acid comprises two or more amino acid residues, wherein the first amino acid residue is integrated in the amino acid chain of the first polypeptide, wherein one or more further amino acid residues are attached to the side chain of the first amino acid residue, and wherein the one or more further amino acid residues react with the transpeptidase in the transpeptidation reaction.
- the first amino acid preferably comprises an amino group, a thiol, a hydroxyl group or a carboxyl group in its side chain and wherein the two or more further amino acids are linked to the amino group, thiol, hydroxyl group or carboxyl group in the side chain of the first amino acid.
- the first amino acid of the unnatural amino acid is Lysine, wherein the one or more amino acids are conjugated via an isopeptide-bond to the s- amino group of the Lysine.
- the unnatural amino acid is N 6 -glycylglycyl-L-lysines (GGK) or N 6 -glycyl-L-lysine (GK), which are shown in Figure 22.
- the unnatural amino acid is /V 6 -((2-azidoacetyl)glycyl)-L- lysine (AzGGK) or /V 6 -((2-azidoacetyl)-L-lysine (AzGK) as shown in Figure 22, wherein transpeptidation is induced by providing a phosphine, preferably 2-(diphenylphosphino)benzoic acid (2DPBA).
- a phosphine preferably 2-(diphenylphosphino)benzoic acid (2DPBA).
- the terminal Glycine residue of the unnatural amino acid - which is required for the second step of the transpeptidation reaction - is masked with an azido group preventing the second step of the transpeptidation reaction.
- the azido group of AzGGK or AzGK can be reduced quantitatively via Staudinger reduction with a phosphine to restore GGK.
- the phosphine is preferably water-soluble. In case the method of the present invention is performed in a host cell, the phosphine has to be cell-permeable and non-cytotoxic. Examples of phosphines are described in Luo et al. (Nat. Chem. 2016, 8, 1027-1034; doi: 10.1038/nchem.2573. Epub 2016 Jul 25), which is incorporated herein by reference.
- a particularly preferred phosphine is 2DPBA (2- (diphenylphosphino)benzoic acid).
- the second step of the transpeptidation reaction is induced by providing a suitable phosphine, allowing a temporal control of the reaction.
- the unnatural amino acid is GK or GGK, whose N- terminal glycine residue is protected with a photoremovable protecting group.
- photoremovable protecting group or photoprotective group are used interchangeably herein and related to a chemical moiety that is attached to the N-terminus of the amino acid (Z) fused to the side chain of the first amino acid (X) in the unnatural amino acid, wherein said chemical moiety prevents the transpeptidation reaction and can be removed with a light pulse, thereby inducing the transpeptidation reaction.
- the primary amino group of the N-terminal glycine is masked with a photoremovable protecting group that prevents the second step of the transpeptidation reaction.
- the photoremovable protecting group has to be removed with a light pulse to restore GK or GGK.
- the wavelength of the light pulse has to be adjusted according to the photoremovable group such that the photoremovable group is removed from GK or GGK.
- the photo protective group is 2-nitrobenzyl or coumarin, wherein the light pulse for removing the photo protective group has preferably a wavelength of about 365 nm. Further photo protective groups are known in the art and can be used to put the present invention into practice.
- the second step of the transpeptidation reaction is induced by providing a suitable light pulse, allowing a temporal and spatial control of the reaction.
- transpeptidase is Sortase A, preferably Sortase A (SrtA) from Staphylococcus aureus , which is exemplarily shown in SEQ ID NO: 1.
- a Sortase A that can be used in accordance with the present invention has an amino acid sequence which has an identity of at least 60%, of at least 65%, of at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the amino acid sequence shown in SEQ ID NO: 1. Further preferred are mutants of Sortase A capable of conjugating the second polypeptide to the first polypeptide.
- a preferred mutant of the invention is Srt2A having an amino acid sequence which has an identity of at least 60%, of at least 65%, of at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the amino acid sequence shown in SEQ ID NO: 2, wherein the position corresponding to position 36 of SEQ ID NO: 2 is Arginine, the position corresponding to position 44 of SEQ ID NO: 2 is Cysteine, the position corresponding to position 46 of SEQ ID NO: 2 is Histidine, the position corresponding to position 47 of SEQ ID NO: 2 is Aspartic acid, the position corresponding to position 80 of SEQ ID NO: 2 is Proline, the position corresponding to position 94 of SEQ ID NO: 2 is Isoleucine, the position corresponding to position 102 of SEQ ID NO: 2 is Lysine, the position corresponding to position 104 of SEQ ID NO: 2 is Histidine
- a further preferred mutant of the invention is Srt5M having an amino acid sequence which has an identity of at least 60%, of at least 65%, of at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the amino acid sequence shown in SEQ ID NO: 3, wherein the position corresponding to position 36 of SEQ ID NO: 3 is Arginine, the position corresponding to position 102 of SEQ ID NO: 3 is Asparagine, the position corresponding to position 107 of SEQ ID NO: 3 is Alanine, the position corresponding to position 132 of SEQ ID NO: 3 is Glutamic acid and the position corresponding to position 138 of SEQ ID NO: 3 is Threonine, i.e.
- a further preferred mutant of the invention is Srt7M having an amino acid sequence which has an identity of at least 60%, of at least 65%, of at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the amino acid sequence shown in SEQ ID NO: 4, wherein the position corresponding to position 36 of SEQ ID NO: 4 is Arginine, the position corresponding to position 47 of SEQ ID NO: 4 is Lysine, the position corresponding to position 50 of SEQ ID NO: 4 is Glutamine, the position corresponding to position 102 of SEQ ID NO: 4 is Asparagine, the position corresponding to position 107 of SEQ ID NO: 4 is Alanine, the position corresponding to position 132 of SEQ ID NO: 4 is Glutamic acid and the position corresponding to position 138 of SEQ
- a further preferred mutant of the invention is mSrt2A having an amino acid sequence which has an identity of at least 60%, of at least 65%, of at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the amino acid sequence shown in SEQ ID NO: 5, wherein the position corresponding to position 36 of SEQ ID NO: 5 is Arginine, the position corresponding to position 44 of SEQ ID NO: 5 is Cysteine, the position corresponding to position 46 of SEQ ID NO: 5 is Histidine, the position corresponding to position 47 of SEQ ID NO: 5 is Lysine, the position corresponding to position 50 of SEQ ID NO: 5 is Glutamine, the position corresponding to position 80 of SEQ ID NO: 5 is Proline, the position corresponding to position 94 of S
- a further preferred mutant of the invention is Srt4S having an amino acid sequence which has an identity of at least 60%, of at least 65%, of at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the amino acid sequence shown in SEQ ID NO: 6, wherein the position corresponding to position 36 of SEQ ID NO: 6 is Arginine, the position corresponding to position 40 of SEQ ID NO: 6 is Aspartic acid, the position corresponding to position 44 of SEQ ID NO: 6 is Cysteine, the position corresponding to position 46 of SEQ ID NO: 6 is Valine, the position corresponding to position 60 of SEQ ID NO: 6 is Threonine, the position corresponding to position 64 of S
- a derivative of mSrt2A can have an amino acid sequence which has an identity of at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the amino acid sequence shown in SEQ ID NO: 5, wherein the derivative comprises the described mutations of mSrt2A.
- Srt5M contains five mutations that confer 140-fold increased activity compared to wt SrtA.
- Srt7M is based on Srt5M but works Ca 2+ -independently.
- Srt2A recognizes the motif LAXTG compared to LPXTG of SrtA, Srt5M and Srt7M.
- mSrt2A is based on Srt2A but works Ca 2+ -independently.
- Srt4S recognizes the motif LPXSG.
- SEQ ID NO: 1-6 relate to the catalytic active sites of the sortases, lacking amino acids 1-58 of the wild type sortase A from Staphylococcus aureus.
- the Sortase A is the full length Sortase A from Staphylococcus aureus as shown in SEQ ID NO: 42 or a mutant thereof which has an identity of at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the amino acid sequence shown in SEQ ID NO: 42.
- a mutant of the full length Sortase A as shown in SEQ ID NO: 42 comprises in the catalytic active site the mutations as described above for SEQ ID NO: 2-6, i.e.
- SEQ ID NO: 42 is mutated as described above for SEQ ID NO: 2-6 at the corresponding positions in SEQ ID NO: 42.
- the corresponding positions in SEQ ID NO: 42 can for example be determined by adding to the above numbering of the positions “58”.
- a full length mutant of Sortase A corresponding to the Srt2A mutant shown in SEQ ID NO: 2 comprises the following mutations: position 94 of SEQ ID NO: 42 is Arginine, the position corresponding to position 102 of SEQ ID NO: 42 is Cysteine, the position corresponding to position 104 of SEQ ID NO: 42 is Histidine, the position corresponding to position 105 of SEQ ID NO: 42 is Aspartic acid, the position corresponding to position 138 of SEQ ID NO: 42 is Proline, the position corresponding to position 152 of SEQ ID NO: 42 is Isoleucine, the position corresponding to position 160 of SEQ ID NO: 42 is Lysine, the position corresponding to position 162 of SEQ ID NO: 42 is
- the second polypeptide is an ubiquitin (e.g. as shown in SEQ ID NO: 30) or ubiquitin-like-protein, wherein the C-terminus of the ubiquitin or ubiquitin-like- protein has been modified such that it comprises a recognition motif for the sortase A enzyme as described herein.
- the C-terminus of the ubiquitin or ubiquitin-like-protein has been modified such that it comprises a recognition motif for the sortase A enzyme but remains as identical as possible to the native C-terminus of the ubiquitin or ubiquitin-like-protein, i.e. only the required amino acid substitutions for generating the recognition motif are introduced.
- a wild type ubiquitin as shown in SEQ ID NO: 30 comprises the C-terminus“LRLRGG” which can be modified to comprise a recognition motif, by introducing two amino acid substitutions, to“LPLTGG” (containing the recognition motif of Srt5M) or“LALTGG” (containing the recognition motif of Srt2A).
- to“LPLTGG” containing the recognition motif of Srt5M
- LALTGG containing the recognition motif of Srt2A
- ubiquitin-like proteins can be used as a second polypeptide in a similar fashion by modifying the C-terminus to comprise a sortase A recognition motif, since they all display a highly conserved C-terminal glycylglycine or glycine motif.
- Ubiquitin-like proteins are known in the art and share the common b-grasp fold of ubiquitin, including e.g. SUMO (e.g. SUM01 and SUM02), NEDD8, URM1 or Ufm1 , described by van der Veen et al. ( Annu Rev Biochem 81 , 323-57 (2012)).
- preferred ubiquitin-like-proteins of the present invention comprise SUM01 , SUM02, NEDD8, URM1 , ATG8, ATG12, URM1 , FAT10 or ISG15 or Ufm1.
- a Lysine residue is preferably substituted in the first polypeptide with the unnatural amino acid.
- the second polypeptide and the recognition motif are separated by a spacer, preferably a Leucine.
- the unnatural amino acid can be integrated in the first polypeptide using any method known in the art, e.g. solid phase polypeptide synthesis.
- the method is performed in a host cell, wherein the first polypeptide is expressed in said host cell using genetic code expansion comprising:
- Genetic code expansion allows the site-specific incorporation of an unnatural amino acid at virtually any chosen position into any polypeptide and is thus useful for the generation of the first polypeptide of the invention. Importantly, the approach is applicable to proteins under native conditions, allowing in context of the present invention the modification (e.g. ubiquitylation) of large multi-domain and non-refoldable proteins.
- a person skilled in the art knows how to employ genetic code expansion, which is by way of example disclosed in Neumann et al (Nat Chem Biol 2008, 4, 232) and Lang et al (Nat Chem 2012, 4, 298).
- unnatural amino acids are genetically encoded in response to an amber codon (or any codon not coding for an amino acid) introduced into a gene of interest by employing engineered pyrrolysyl tRNA synthetase (PylRS)/tRNA C u A pairs from Methanosarcina species, including Methanosarcina barkeri (Mb) and Methanosarcina mazei ⁇ Mm), wherein the pyrrolysyl tRNA synthetase has been mutated such that an unnatural amino acid is accepted by the pyrrolysyl tRNA synthetase.
- PylRS engineered pyrrolysyl tRNA synthetase
- the orthogonal heterologous tRNA synthetase/tRNA pair is provided to the host cell by providing one or more polynucleotides comprising one or more expression cassettes encoding the heterologous tRNA synthetase and the heterologous tRNA, wherein the one or more polynucleotides can be provided as a vector or stably integrated in the host cell’s genome.
- the transpeptidase is provided to the host cell by providing one or more polynucleotides comprising an expression cassette encoding the transpeptidase, wherein the one or more polynucleotides can be provided as a vector or stably integrated in the host cell’s genome.
- the orthogonal heterologous tRNA synthetase/tRNA pair and the transpeptidase are provided to the host cell by providing one or more polynucleotides comprising an expression cassette encoding, preferably as a vector.
- a host cell which comprises the expression cassettes coding for the orthogonal heterologous tRNA synthetase/tRNA pair and the transpeptidase integrated into the genome.
- a host cell which comprises expression cassettes coding for the orthogonal heterologous tRNA synthetase/tRNA pair, the transpeptidase and the modified ubiquitin or ubiquitin-like protein integrated into the genome.
- the term“culturing” as used herein relates to culturing the host cell under conditions allowing expression of the polypeptides required to put the method of the present invention into practice.
- the term“culturing” as used herein relates to culturing the host cell under conditions allowing expression of the first polypeptide comprising one or more unnatural amino acids using genetic code expansion, comprising the addition of the unnatural amino acid to the growth medium of the host cell in an amount sufficient to express the first polypeptide comprising one or more unnatural amino acids.
- a“host cell” refers to a cell which is capable of protein expression and optionally protein secretion, wherein the host cell can be comprised by a living organism. Such host cell is applied in the methods of the present invention.
- Host cells provided by the present invention can be eukaryotic or prokaryotic host cells.
- a prokaryotic cell lacks a membrane-bound nucleus, while a eukaryotic cell has a membrane- bound nucleus.
- eukaryotic cells include, but are not limited to, vertebrate cells, mammalian cells, human cells, animal cells, invertebrate cells, plant cells, nematodal cells, insect cells, stem cells, fungal cells or yeast cells.
- A“heterologous tRNA” that can be used according to the present invention is orthogonal to the tRNA synthetase used in the method of the invention and recognizes a codon (i.e. comprises the respective anticodon) that is not recognized by an endogenous tRNA of the host cell, e.g. UAG, UGA or UAA.
- the heterologous tRNA recognizes the codon UAG, i.e. comprises the anticodon CUA.
- the orthogonal heterologous tRNA synthetase/tRNA pair is a pyrrolysyl tRNA synthetase and a tRNA from a Methanosarcina species, preferably from Methanosarcina barkeri or Methanosarcina mazei.
- the orthogonal heterologous tRNA synthetase/tRNA pair is a pyrrolysyl tRNA synthetase having an amino acid sequence which has an identity of at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% to the amino acid sequence shown in SEQ ID NO: 7 or a pyrrolysyl tRNA synthetase as shown in SEQ ID NO: 7 and tRNA from a Methanosarcina species having an amino acid sequence which has an identity of at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% to the amino acid sequence shown in SEQ ID NO: 8 or SEQ ID NO: 9 or a tRNA from a Methano
- tRNA from a Methanosarcina species as shown in SEQ ID NO: 8 or SEQ ID NO: 9, comprising the anticodon CUA, as shown for the Methanosarcina barkeri tRNAcu A i n SEQ ID NO: 14.
- the pyrrolysyl tRNA synthetase is a mutant pyrrolysyl tRNA synthetase capable of aminoacylating the orthogonal tRNA with the unnatural amino acids AzGGK, AzGK, GK or GGK.
- a mutant pyrrolysyl tRNA synthetase for AzGGK has an amino acid sequence which has an identity of at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% to the amino acid sequence shown in SEQ ID NO: 10, wherein the position corresponding to position 274 of SEQ ID NO: 10 is Alanine, the position corresponding to position 311 of SEQ ID NO: 10 is Glutamine and the position corresponding to position 313 of SEQ ID NO: 10 is Serine.
- a pyrrolysyl tRNA synthetase for AzGGK has an amino acid sequence as shown in SEQ ID NO: 10.
- a mutant pyrrolysyl tRNA synthetase for AzGK has an amino acid sequence which has an identity of at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% to the amino acid sequence shown in SEQ ID NO: 1 1 , wherein the position corresponding to position 271 of SEQ ID NO: 11 is Leucine, the position corresponding to position 274 of SEQ ID NO: 1 1 is Alanine and the position corresponding to position 313 of SEQ ID NO: 1 1 is Phenylalanine.
- a pyrrolysyl tRNA synthetase for AzGK has an amino acid sequence as shown in SEQ ID NO: 1 1.
- a mutant pyrrolysyl tRNA synthetase for GK has an amino acid sequence which has an identity of at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% to the amino acid sequence shown in SEQ ID NO: 12, wherein the position corresponding to position 266 of SEQ ID NO: 12 is Methionine, the position corresponding to position 270 of SEQ ID NO: 12 is Isoleucine, the position corresponding to position 271 of SEQ ID NO: 12 is Phenylalanine, the position corresponding to position 274 of SEQ ID NO: 12 is Alanine and the position corresponding to position 313 of SEQ ID NO: 12 is Phenylalanine.
- a pyrrolysyl tRNA synthetase for GK has an amino acid sequence as shown in SEQ ID NO: 12.
- a mutant pyrrolysyl tRNA synthetase for GGK has an amino acid sequence which has an identity of at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% to the amino acid sequence shown in SEQ ID NO: 13, wherein the position corresponding to position 266 of SEQ ID NO: 12 is Methionine, the position corresponding to position 270 of SEQ ID NO: 12 is Isoleucine, the position corresponding to position 271 of SEQ ID NO: 12 is Phenylalanine, the position corresponding to position 274 of SEQ ID NO: 12 is Alanine and the position corresponding to position 313 of SEQ ID NO: 12 is Phenylalanine.
- a pyrrolysyl tRNA synthetase for GK has an amino acid sequence as shown in SEQ ID NO: 12.
- a preferred combination of the present invention of a orthogonal heterologous tRNA synthetase/tRNA pair is the tRNAcu A as shown in SEQ ID NO: 14 with a mutant pyrrolysyl tRNA synthetase having an amino acid sequence selected from the group consisting of SEQ ID NO: 10, SEQ ID NO: 11 , SEQ ID NO: 12, and SEQ ID NO: 13
- the host cell is a vertebrate cells, mammalian cells, human cells, animal cells, invertebrate cells, plant cells, nematodal cells, insect cells, stem cells, fungal cells, yeast cells, a bacterial cell, or a multicellular organism comprising a host cell, wherein the multicellular organism is preferably a mammal, an insect or a nematode and more preferably Caenorhabditis elegans, Drosophila or mouse.
- the method is performed in vitro and wherein the modified polypeptide is obtained by admixing and incubating the provided components under conditions allowing transpeptidase mediated transpeptidation reaction.
- the required polypeptides can be obtained by solid phase peptide synthesis, in vitro transcription and translation or by expression in a host cell as described herein, wherein the expressed polypeptides are optionally purified.
- an in vitro method according to the present invention is disclosed herein in the examples.
- the second polypeptide comprises a recognition motif selected from the group consisting of SEQ ID NO: 15 (LPXTG), SEQ ID NO: 16 (LLPXTG), SEQ ID NO: 17 (LAXTG), SEQ ID NO: 18 (LLAXTG), SEQ ID NO: 19 (LPXSG), SEQ ID NO: 20 (LLPXSG), wherein preferably a further Glycine residue is attached to the C-terminus of the recognition motif.
- the recognition motifs are particularly preferred if the second polypeptide is an ubiquitin or an ubiquitin-like protein.
- the recognition motifs shown in SEQ ID NO: 15 and 16 work with SrtA, Srt5M and Srt7M.
- the recognition motifs shown in SEQ ID NO: 17 and 18 work with Srt2A and mSrt2A.
- the recognition motifs shown in SEQ ID NO: 18 and 19 work with Srt4S.
- X can be any amino acid.
- the second polypeptide is ubiquitin or an ubiquitin-like protein
- X is preferably the amino acid comprised by the native C-terminus at the corresponding position of ubiquitin or the ubiquitin-like protein.
- Ubiquitin X is Leucine and in case of SUM01 X is Glutamine.
- the native Ubiquitin C-terminus (LRLRGG; the native Ubiquitin is shown in SEQ ID NO: 30) is modified to LPLTGG (Ub(PT) as shown in SEQ ID NO: 31 ) in case of SrtA or a mutant thereof is used or LALTGG (Ub(AT) as shown in SEQ ID NO: 33) in case Srt2A or mSrt2A is used.
- LPLTGG Ub(PT) as shown in SEQ ID NO: 31
- LALTGG Ub(AT) as shown in SEQ ID NO: 33
- a Leucine can be incorporated at the N-terminus of the recognition motif resulting in LLPLTGG (Ub(LPT) as shown in SEQ ID NO: 32) or LLALTGG (Ub(LAT) as shown in SEQ ID NO: 33).
- SUM01 comprises the native C-terminus QEQTGG, which can be modified to LAQTGG for use with Srt2a or mSrt2A.
- a further Leucine spacer can be introduced resulting in the C-terminus LLAQTGG of a modified SUM01 protein.
- Ubiquitin- and SUMO-conjugates obtained by the method of the invention thus preferably display a native isopeptide-bond between their C-terminal glycine and a chosen lysine in the first polypeptide protein, wherein the C-terminus comprises two amino acid substitution and optionally one additional Leucine incorporated as a spacer.
- ubiquitylation is a reversible process and tightly regulated by a family of enzymes called deubiquitinases (DUBs), as disclosed by Komander et al. ( Nat Rev Mol Cell Biol 10, 550-63 (2009)). Modifying the natural C-terminus of ubiquitin from“LRLRGG” to“LALTGG” (corresponding to R72A und R74T mutations in SEQ ID NO: 30) confers resistance to cleavage by various DUB families.
- DUBs deubiquitinases
- the present invention relates to a method for modifying a polypeptide, comprising: (i) expressing in a host cell a first polypeptide wherein one or more Lysine residues have been substituted with AzGGK, AzGK, GK, GGK, photoprotected GK, or photoprotected GGK and removing the azido group of AzGGK or AzGK by providing a phosphine, preferably 2- (diphenylphosphino)benzoic acid (2DPBA) or removing the photoprotective group by providing a light pulse, when required;
- a phosphine preferably 2- (diphenylphosphino)benzoic acid (2DPBA)
- the present invention relates to a method for modifying a polypeptide, comprising:
- expressing in a host cell a first polypeptide wherein one or more Lysine residues have been substituted with AzGGK, AzGK, GK, GGK, photoprotected GK, or photoprotected GGK and removing the azido group of AzGGK or AzGK by providing a phosphine, preferably 2- (diphenylphosphino)benzoic acid (2DPBA) or removing the photoprotective group by providing a light pulse, when required, wherein expressing the first polypeptide comprises: a) expressing in the host cell an heterologous tRNA synthetase for AzGGK, AzGK, GK, GGK, photoprotected GK, or photoprotected GGK disclosed herein and an orthogonal heterologous tRNA that recognizes a codon that is not recognized by an endogenous tRNA; and
- the present invention relates to an in vitro method for modifying a polypeptide of interest, comprising: (i) providing a first polypeptide in which one or more Lysine residues have been substituted with the AzGGK, AzGK, GK, GGK, photoprotected GK, or photoprotected GGK and removing the azido group of AzGGK or AzGK by providing a phosphine, preferably 2- (diphenylphosphino)benzoic acid (2DPBA) or removing the photoprotective group by providing a light pulse, when required;
- a phosphine preferably 2- (diphenylphosphino)benzoic acid (2DPBA)
- the method of the present invention can be put into practice by providing a subtiligase as a transpeptidase.
- the first polypeptide comprises one or more unnatural amino acids selected from the group consisting of AzGGK, AzGK, GK, GGK, photoprotected GK, and photoprotected GGK and the second polypeptide is an ubiquitin or ubiquitin-like-protein comprising a thioester at the C-terminus, wherein the subtiligase has preferably an amino acid sequence which has an identity of at least 70% to the amino acid sequence shown in SEQ ID NO: 41.
- the present invention relates to a method for modifying a polypeptide, comprising:
- a first polypeptide comprising one or more unnatural amino acids selected from the group consisting of AzGGK, AzGK, GK, GGK, photoprotected GK, and photoprotected GGK and removing the azido group of AzGGK or AzGK by providing a phosphine, preferably 2-(diphenylphosphino)benzoic acid (2DPBA) or removing the photoprotective group by providing a light pulse, when required;
- a phosphine preferably 2-(diphenylphosphino)benzoic acid (2DPBA)
- the term“subtiligase” as used herein relates to an engineered peptide ligase derived from the protease subtilisin that catalyzes the ligation of a polypeptide containing a donor C-terminal thioester (in this case ubiquitin or ubiquitin-like protein) to an acceptor peptide containing an o amine (in this case a first polypeptide comprising AzGGK, AzGK, GK, GGK, photoprotected GK, and photoprotected GGK).
- a donor C-terminal thioester in this case ubiquitin or ubiquitin-like protein
- acceptor peptide containing an o amine in this case a first polypeptide comprising AzGGK, AzGK, GK, GGK, photoprotected GK, and photoprotected GGK.
- subtiligase has been used for the efficient ligation of cysteine- free peptides to protein thioesters and was dubbed enzyme-catalyzed EPL.
- a subtiligase that can be used in the method of the present invention is exemplarily shown in SEQ ID NO: 41.
- a subtiligase that can be used in accordance with the present invention has an amino acid sequence which has an identity of at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the amino acid sequence shown in SEQ ID NO: 41.
- An ubiquitin or ubiquitin-like protein comprising a thioester can for example be generated using intein technology. Thereby ubiquitin or ubiquitin-like protein (missing the two C-terminal Glycine residues) is expressed as an intein fusion with a CBD (chitin binding domain) C-terminal tag. After expression, the cell homogenate is passed through a column containing chitin. This allows the CBD of the chimeric protein to bind to the column. Elution with a thiol, e.g.
- MESNA sodium 2-mercaptoehane sulfonate
- the present inventors performed a subtiligase mediated reaction using as the second polypeptide a wild type Ubiquitin, which lacks the C- terminal two Glycine residues, i.e. amino acids 1-74 of SEQ ID NO: 30, wherein a thioester has been conjugated to the C-terminal amino acid R (amino acid 74 of SEQ ID NO: 30).
- P1 and P4 are R (amino acid 74 and 72 of SEQ ID NO: 30, respectively).
- the present inventors used GGK (the oamine of lysine was protected with a tert.
- GGK butyloxycarbonyl group
- a subtiligase mediated transpeptidation reaction can be employed to conjugate Ubiquitin comprising a thioester at the C-terminus to a first polypeptide comprising one or more unnatural amino acids selected from the group consisting of GGK, AzGGK and photoprotected GGK.
- P4 e.g. L, A, E
- P1 e.g. Y, L, I, V, F, A, W, H, G
- subtiligase mediated transpeptidation reaction can be transferred to other ubiquitins (e.g. amino acid 1-75 of SEQ ID NO: 30) or ubiquitin-like proteins or first polypeptides comprising other unnatural amino acids (e.g. GK).
- ubiquitins e.g. amino acid 1-75 of SEQ ID NO: 30
- ubiquitin-like proteins or first polypeptides comprising other unnatural amino acids e.g. GK
- Ubiquitin and many ubiquitin-like proteins have a conserved C-terminus comprising two or one terminal Glycine residues.
- the method of the present invention can be put into practice using a subtiligase and ubiquitin or ubiquitin-like protein as a second polypeptide in the transpeptidation reaction.
- the first polypeptide comprises AzGGK, GGK or photoprotected GGK
- the ubiquitin or ubiquitin-like protein preferably lacks the two C-terminal Glycines, i.e. the ubiquitin preferably comprises amino acids 1-74 or the amino acid sequence shown in SEQ ID NO: 30.
- the ubiquitin or ubiquitin-like protein preferably lacks the C-terminal Glycine, i.e. the ubiquitin preferably comprises amino acids 1-75 or the amino acid sequence shown in SEQ ID NO: 30. This will generate a native conjugate comprising a wild type ubiquitin or ubiquitin-like protein and a first polypeptide without any mutations.
- the second polypeptide is an ubiquitin or ubiquitin-like-protein comprising a recognition motif for a first Sortase A enzyme in the C-terminus and one or more Lysine residues in the ubiquitin or ubiquitin-like- protein have been substituted with AzGGK, AzGK, photoprotected GK, or photoprotected GGK, the method further comprising
- an ubiquitin or ubiquitin-like protein comprising a C-terminus comprising a recognition motif for a second Sortase A enzyme that is different to the recognition motif of the first Sortase A enzyme, wherein one or more Lysine residues in the ubiquitin or ubiquitin-like-protein have been substituted with AzGGK, AzGK, photoprotected GK, or photoprotected GGK, and said second Sortase A;
- step (vii) optionally repeating step (v) and (vi), wherein the ubiquitin or ubiquitin-like protein comprises a C-terminus comprising a recognition motif for a sortase A enzyme that is different to the recognition motifs of the ubiquitin or ubiquitin-like protein of the preceding steps; and
- the present invention relates to a method for modifying a polypeptide, comprising:
- an ubiquitin or ubiquitin-like protein comprising a C-terminus comprising a recognition motif for a second Sortase A enzyme that is different to the recognition motif of the first Sortase A enzyme, wherein one or more Lysine residues in the ubiquitin or ubiquitin-like-protein have been substituted with AzGGK, AzGK, photoprotected GK, or photoprotected GGK, and said second Sortase A;
- step (vii) optionally repeating step (v) and (vi), wherein the ubiquitin or ubiquitin-like protein comprises a C-terminus comprising a recognition motif for a Sortase A enzyme that is different to the recognition motifs for the Sortase A enzymes of the ubiquitin or ubiquitin-like protein of the preceding steps; and
- the first and the second Sortase A recognized different recognition motifs, i.e. they are orthogonal.
- the first Sortase A is selected from Srt2A, Srt5M and Srt4S and the second Sortase A is selected from the remaining two Sortase A.
- Such a transpeptidation reaction employing two orthogonal Sortase A variants is exemplarily shown in Figure 23 A and Figure 26 A.
- two ubiquitins were coupled to a first ubiquitin (serving as first polypeptide within the context of this embodiment) leading to a triple ubiquitin (see Figure 26 B,C).
- a further (i.e.) i.e.
- the third Sortase A is the remaining Sortase A that has not been selected as the first or second Sortase A.
- the last ubiquitin or ubiquitin-like protein comprises no unnatural amino acids.
- one or more Lysine residues in the ubiquitin or ubiquitin-like-protein have been substituted with AzGGK, AzGK, photoprotected GK, or photoprotected GGK, the method further comprising
- an ubiquitin or ubiquitin-like protein comprising a thioester at the C-terminus, wherein one or more Lysine residues in the ubiquitin or ubiquitin-like-protein have been substituted with AzGGK, AzGK, photoprotected GK, or photoprotected GGK, and a subtiligase;
- step (vii) optionally repeating step (v) and (vi);
- the present invention relates to a method for modifying a polypeptide, comprising:
- an ubiquitin or ubiquitin-like protein comprising a thioester at the C-terminus, wherein one or more Lysine residues in the ubiquitin or ubiquitin-like-protein have been substituted with AzGGK, AzGK, photoprotected GK, or photoprotected GGK, and
- step (vii) optionally repeating step (v) and (vi);
- the present invention combines the sortase A based transpeptidation reaction as disclosed herein with the subtiligase based transpeptidation reaction as disclosed herein in order to generate a modified polypeptide comprising a chain of two or more ubiquitins or ubiquitin-like-proteins.
- the first ubiquitin is conjugated to the first polypeptide employing the sortase A based transpeptidation reaction and the second ubiquitin is conjugated to the first ubiquitin employing the subtiligase based transpeptidation reaction and vice versa.
- Further ubiquitins can be conjugated to the second ubiquitin employing the transpeptidation reaction that has been employed to conjugate the second ubiquitin to the first ubiquitin.
- the present invention relates to a polypeptide obtainable by the method of the present invention. Even more preferred is a polypeptide obtained by the method of the present invention.
- the present invention relates to a multidomain or non- refoldable polypeptide conjugated to one or more ubiquitins or ubiquitin-like-proteins, wherein the one or more ubiquitins or ubiquitin-like-proteins comprise a C-terminal amino acid sequence selected from the group consisting of SEQ ID NO: 15 (LPXTGG), SEQ ID NO: 16 (LLPXTGG), SEQ ID NO: 17 (LAXTGG), SEQ ID NO: 18 (LLAXTGG), SEQ ID NO: 19 (LPXSGG) and SEQ ID NO: 20 (LLPXSGG).
- the disclosed C-terminal amino acid sequences are the six or seven terminal amino acids in the ubiquitin or ubiquitin-like-protein, i.e. in case of wild type ubiquitin the disclosed C-terminal amino acid sequences substitute amino acids 71 to 76 of the wild type ubiquitin amino acid sequence as shown in SEQ ID NO: 30.
- the method of the present invention allows for the first time the site-specific attachment of ubiquitins or ubiquitin-like proteins to non-refoldable and/or multi-domain proteins.
- the present invention relates to a pyrrolysyl tRNA synthetase comprising an amino acid sequence having at least 70%, at least 75%, at least 80%, at least 85%, at least 90% , at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity to the amino acid sequence set forth in
- SEQ ID NO: 10 wherein amino acid residue 274 is Alanine, amino acid residue 311 is Glutamine and amino acid residue 313 is Serine;
- SEQ ID NO: 1 1 wherein amino acid residue 271 is Leucine, amino acid residue 274 is Alanine and amino acid residue 313 is Phenylalanine;
- the present invention further relates to a polynucleotide encoding the pyrrolysyl tRNA synthetase of the invention.
- the present invention further relates to a vector comprising the polynucleotide encoding the pyrrolysyl tRNA synthetase of the invention.
- the present invention further relates to the use of the pyrrolysyl tRNA synthetase of the invention in the method of the invention or for genetic code expansion
- the present invention relates to a sortase A mutant comprising an amino acid sequence having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity to the amino acid sequence set forth in SEQ ID NO: 5, wherein amino acid residue 36 is Arginine, amino acid residue 44 is Cysteine, amino acid residue 46 is Histidine, amino acid residue 47 is Lysine, amino acid residue 50 is Glutamine, amino acid residue 80 is Proline, amino acid residue 94 is Isoleucine, amino acid residue 102 is Lysine, amino acid residue 104 is Histidine, amino acid residue 106 is Asparagine, amino acid residue 107 is Alanine, amino acid residue 109 is Glutamic acid, amino acid residue 115 is Glutamic acid, amino acid residue 124 is Valine, amino acid residue 132 is Glutamic acid
- the sortase A mutant of the invention is based on Srt2A (SEQ ID NO: 4), which further comprises the D47K and E50Q amino acid mutations.
- the mSrt2A of the invention allows working in in conditions with low Ca 2+ concentrations, which was difficult with the Ca 2+ -dependent Srt2A.
- the present invention further relates to a polynucleotide encoding the sortase A mutant of the invention.
- the present invention further relates to a vector comprising a polynucleotide encoding the sortase A mutant of the invention.
- the present invention further relates to the use of the sortase A mutant of the invention in the method of the invention or for catalyzing a transpeptidation reaction.
- the present invention relates to a kit comprising one or more polynucleotides encoding
- an ubiquitin or ubiquitin-like-protein wherein the C-terminus of the ubiquitin or ubiquitin-like- protein comprises a recognition motif for the Sortase A or a thioester as disclosed herein;
- the present invention relates to a host cell comprising:
- an ubiquitin or ubiquitin-like-protein wherein the C-terminus of the ubiquitin or ubiquitin-like- protein comprises a recognition motif for the sortase A or a thioester as disclosed herein;
- polypeptide comprising one or more unnatural amino acids selected from the group consisting of AzGGK, AzGK, photoprotected GK, photoprotected GGK, GGK and GK.
- the method of the present inventors allows attachment of ubiquitin or any ubiquitin- like protein to multi-domain proteins, as exemplified by preparation of mono-ubiquitylated proliferating cell nuclear antigen (PCNA), a key DNA replication/repair protein as disclosed in the examples.
- PCNA mono-ubiquitylated proliferating cell nuclear antigen
- the present inventors show that sortase-mediated transpeptidation enables the site- specific, inducible and E1/E2/E3-enzyme-independent ubiquitylation and SUMOylation of proteins in living mammalian cells, opening up many new opportunities for the targeted analysis of ubiquitylation and SUMOylation regulatory processes.
- the present inventors combine genetic code expansion, bioorthogonal Staudinger reduction and sortase-mediated transpeptidation to develop a novel and generally applicable tool to ubiquitylate or attach ubiquitin- like proteins to target proteins in an inducible fashion.
- the generated ubiquitin-/ubiquitin-like protein-conjugates display a native isopeptide-bond connecting the C-terminal glycine of ubiquitin with a chosen lysine in a target protein.
- Introduction of two point mutations in the ubiquitin C- terminus makes the conjugates resistant to isopeptide-cleavage by deubiquitinases.
- the method of the present inventors allows the site-specific attachment of ubiquitins and ubiquitin-like proteins to non-refoldable, multi-domain proteins and enables for the first time the site-specific, inducible and ubiquitin-ligase-independent ubiquitylation of proteins in mammalian cells, providing a powerful tool to dissect the biological functions of ubiquitylation with temporal control.
- the method of the present inventors can be transferred from ubiquitin to any other polypeptide without further ado as described herein.
- the invention is further characterized by the following items.
- a method for modifying a polypeptide comprising:
- the first amino acid is preferably a Lysine residue and wherein the two or more further amino acids are linked via an isopeptide-bond to the amino group in the side chain of the first amino acid.
- the transpeptidase is sortase A comprising an amino acid sequence which has an identity of at least 60% to the amino acid sequence shown in SEQ ID NO: 1 , preferably sortase A (SrtA) from Staphylococcus aureus or a mutant thereof capable of conjugating the second polypeptide to the first polypeptide, such as Srt2A having an amino acid sequence as shown in SEQ ID NO: 2, Srt5M having an amino acid sequence as shown in SEQ ID NO: 3, Srt7M having an amino acid sequence as shown in SEQ ID NO: 4, mSrt2A having an amino acid sequence as shown in SEQ ID NO: 5, or Srt4S having an amino acid sequence as shown in SEQ ID NO: 6.
- the unnatural amino acid is N 6 - glycylglycyl-L-lysine (GGK) or N 6 -glycyl-L-lysine (GK).
- the unnatural amino acid is /V 6 -(( 2- azidoacetyl)glycyl)-L-lysine (AzGGK) or /V 6 -((2-azidoacetyl)-L-lysine (AzGK) and wherein transpeptidation is induced by providing a phosphine, preferably 2- (diphenylphosphino)benzoic acid (2DPBA); or
- the unnatural amino acid is GK or GGK, whose N-terminal glycine residue is protected with a photoremovable protecting group, prefereably 2-nitrobenzyl or coumarin, and wherein deprotection to GK or GGK is induced with a light pulse.
- a photoremovable protecting group prefereably 2-nitrobenzyl or coumarin
- deprotection to GK or GGK is induced with a light pulse.
- the ubiquitin-like-protein is preferably SUM01 , SUM02, NEDD8, URM1 , Ufm1 , ATG8, ATG12, URM1 , FAT10 or ISG15.
- heterologous tRNA recognizes a codon that is not recognized by an endogenous tRNA
- the orthogonal heterologous tRNA synthetase/tRNA pair is a pyrrolysyl tRNA synthetase and a tRNA from a Methanosarcina species, preferably the pyrrolysyl tRNA synthetase is a mutant pyrrolysyl tRNA synthetase having an amino acid sequence selected from the group consisting of SEQ ID NO: 10, SEQ ID NO: 1 1 , SEQ ID NO: 12, and SEQ ID NO: 13 and/or the tRNA is the Methanosarcina barkeri tRNAcu A as shown in SEQ ID NO: 14.
- the host cell is a vertebrate cell, mammalian cell, human cell, animal cell, invertebrate cell, plant cell, nematodal cell, insect cell, stem cell, fungal cell, yeast cell, a bacterial cell, or a multicellular organism comprising a host cell, wherein the multicellular organism is preferably a mammal, an insect or a nematode and more preferably Caenorhabditis elegans, Drosophila or mouse.
- the method is performed in vitro, wherein the modified polypeptide is obtained by admixing and incubating the provided components under conditions allowing transpeptidase mediated transpeptidation reaction.
- the second polypeptide comprises a recognition motif selected from the group consisting of SEQ ID NO: 15 (LPXTG), SEQ ID NO: 16 (LLPXTG), SEQ ID NO: 17 (LAXTG), SEQ ID NO: 18 (LLAXTG), SEQ ID NO: 19 (LPXSG) and SEQ ID NO: 20 (LLPXSG).
- SEQ ID NO: 15 LPXTG
- SEQ ID NO: 16 LLPXTG
- SEQ ID NO: 17 LAXTG
- SEQ ID NO: 18 LAAXTG
- SEQ ID NO: 19 LPXSG
- SEQ ID NO: 20 LLPXSG
- transpeptidase is a subtiligase
- the first polypeptide comprises an unnatural amino acid as defined in item 4 or 5;
- the second polypeptide is an ubiquitin or ubiquitin-like-protein comprising a thioester at the C-terminus
- subtiligase has preferably an amino acid sequence which has an identity of at least 70% to the amino acid sequence shown in SEQ ID NO: 41.
- the method according to any one of items 6 to 1 1 wherein the ubiquitin or ubiquitin-like- protein comprises a recognition motif for a first sortase A enzyme and one or more Lysine residues in the ubiquitin or ubiquitin-like-protein have been substituted with an unnatural amino acid as defined in item 5, the method further comprising
- an ubiquitin or ubiquitin-like protein comprising a C-terminus comprising a recognition motif for a second sortase A enzyme that is different to the recognition motif of the first sortase A enzyme, wherein one or more Lysine residues in the ubiquitin or ubiquitin- like-protein have been substituted with an unnatural amino acid as defined in item 5, and
- step (vii) optionally repeating step (v) and (vi), wherein the ubiquitin or ubiquitin-like protein comprises a C-terminus comprising a recognition motif for a sortase A enzyme that is different to the recognition motifs of the ubiquitin or ubiquitin-like protein of the preceding steps; and
- step (v) removing the azido group of AzGGK or AzGK of the unnatural amino acid in the ubiquitin or ubiquitin-like-protein by providing a phosphine, preferably 2- (diphenylphosphino)benzoic acid (2DPBA) or removing the photoprotective group by providing a light pulse; (vi) incubating the polypeptide obtained in step (v) with a phosphine, preferably 2- (diphenylphosphino)benzoic acid (2DPBA) or removing the photoprotective group by providing a light pulse; (vi) incubating the polypeptide obtained in step (v) with a phosphine, preferably 2- (diphenylphosphino)benzoic acid (2DPBA) or removing the photoprotective group by providing a light pulse; (vi) incubating the polypeptide obtained in step (v) with a phosphine, preferably 2- (diphenylphosphino)benzoic acid (2DPBA)
- an ubiquitin or ubiquitin-like protein comprising a thioester at the C-terminus, wherein one or more Lysine residues in the ubiquitin or ubiquitin-like-protein have been substituted with an unnatural amino acid as defined in item 5, and
- step (vii) optionally repeating step (v) and (vi);
- a modified polypeptide comprising a chain of two or more ubiquitins or ubiquitin-like-proteins.
- a modified polypeptide obtainable by the method according to any one of items 1-14.
- a pyrrolysyl tRNA synthetase comprising an amino acid sequence having at least 70% sequence identity to the amino acid sequence set forth in
- a sortase A mutant comprising an amino acid sequence having at least 60% sequence identity to the amino acid sequence set forth in SEQ ID NO: 5, wherein amino acid residue 36 is Arginine, amino acid residue 44 is Cysteine, amino acid residue 46 is Histidine, amino acid residue 47 is Lysine, amino acid residue 50 is Glutamine, amino acid residue 80 is Proline, amino acid residue 94 is Isoleucine, amino acid residue 102 is Lysine, amino acid residue 104 is Histidine, amino acid residue 106 is Asparagine, amino acid residue 107 is Alanine, amino acid residue 109 is Glutamic acid, amino acid residue 115 is Glutamic acid, amino acid residue 124 is Valine, amino acid residue 132 is Glutamic acid, and amino acid residue 138 is Serine.
- a polynucleotide encoding a sortase A mutant as defined in item 20 Use of the sortase A mutant according to item 20 in the method according to any one of item 3 to 1 1 and 13 or for catalyzing a transpeptidation reaction.
- a kit comprising one or more polynucleotides encoding
- a host cell comprising:
- an ubiquitin or ubiquitin-like-protein wherein the C-terminus of the ubiquitin or ubiquitin-like-protein comprises a recognition motif for the sortase A or a thioester;
- the overnight culture was diluted to an OD 6 oo of 0.05 in auto- induction medium (17 amino acid mix (no phenylalanine was added), see Supplementary Methods), supplemented with full-strength antibiotics (tetracycline and ampicillin) and 4 mM AzGGK.
- auto- induction medium 17 amino acid mix (no phenylalanine was added), see Supplementary Methods
- full-strength antibiotics tetracycline and ampicillin
- 4 mM AzGGK 4 mM AzGGK
- lysis buffer (20 mM Tris pH 8.0, 30 mM imidazole, 300 mM NaCI, 0.175 mg/ml_ PMSF, 0.1 mg/ml_ DNase I and one completeTM protease inhibitor tablet (Roche)).
- the cell suspension was incubated on ice for 30 minutes and sonicated with cooling in an ice-water bath.
- the lysed cells were centrifuged (15,000 xg, 40 min, 4 °C), the cleared lysate was added to Ni 2+ -NTA slurry (Jena Bioscience) (0.2 ml. of slurry per 100 ml. of culture) and the mixture was incubated with agitation for one hour at 4 °C.
- Reduction of the azide moiety of AzGGK to the amine moiety (GGK) on the POI was either performed directly in the cell lysate or on purified proteins.
- 1 mM 2DPBA 100 mM stock solution in EtOH
- the lysate was centrifuged (15,000 xg, 30 min, 4 °C) and the purification was performed as described in the Online or Supplementary Methods.
- 2 equivalents of 2DPBA were added to the purified re-buffered protein, followed by incubation for 2 h at room temperature and re-buffering to remove excess of 2DPBA.
- the overnight culture was diluted to an OD 6 oo of 0.05 in auto- induction medium supplemented with half-strength antibiotics and 4 mM AzGGK. After incubation for 24 hours at 37 °C the cells were gently washed two times with PBS (2000 xg, 15 min, 4 °C) and re-suspended in auto-induction media supplemented with half-strength antibiotics. After incubation at 37 °C (200 rpm) for one hour a 1 ml. sample was taken, centrifuged (2000 xg, 2 min, 4 °C), washed twice with PBS (2000 xg, 2 min, 4 °C), flash frozen and stored at -20 °C.
- HEK293T Human embryonic kidney 293T cells
- DMEM Dulbecco’s modified Eagle's medium
- FBS fetal bovine serum
- antibiotic-antimycotic solution 25 pg/mL amphotenicin B, 10 mg/mL streptomycin, and 10,000 units of penicillin, Sigma-Aldrich
- HEK293T cells were seeded in Poly-L-lysine coated 6-well plates (Greiner) at 6x10 5 cells/well to reach 60-80 % confluence for transfection.
- Fresh DMEM, supplemented with 2 mM AzGGK was added followed by transfection using 1 pg of the respective sortase plasmids (plRES_mSrt2A or plRES_Srt2A), 1.5 pg of the corresponding Ub(LAT) or SUMO(LAT) plasmids (pcDNAJJb(LAT) or pcDNA_SUMO(LAT), 0.56 pg pEF1- sf G F P N 150T AG/P C N AK 164T AG plasmid and 0.19 pg of the pEF1-AzGGK-PylRS plasmid.
- Cell lysis was performed with RIPA buffer followed by Western-Blotting using a-His antibody to detect ubiquitylation and SUMOylation of target proteins.
- Expression of Srt2A or mSrt2A was detected via D-Myc Western-Blotting.
- Ni 2+ -NTA slurry (Jena Bioscience) (0.1 mL/mg of bifunctional ubiquitin) was added to the reaction mixture and the mixture was incubated with agitating for 1 h at 4 °C. After incubation, the mixture was transferred to an empty plastic column and washed with 40 column volumes of wash buffer (50 mM Tris pH 7.5, 150 mM NaCI, 5 mM CaCI 2, 30 mM imidazole) to remove Srt2A and the excess of donor ubiquitin. The protein was eluated in 0.2 mL fractions with wash buffer supplemented with 300 mM imidazole pH 8.0.
- wash buffer 50 mM Tris pH 7.5, 150 mM NaCI, 5 mM CaCI 2, 30 mM imidazole
- the fractions containing the mixture of diubiquitin and unreacted bifunctional ubiquitin were pooled together and concentrated with Amicon® with the corresponding MWCO centrifugal filter units (Millipore).
- size-exclusion chromatography SEC was performed using a Superdex S75 16/600 (GE Healthcare) with sortase buffer.
- Fractions containing Ub-isoK48(LAT)- Ub-LPT were pooled together and concentrated with Amicon® with the corresponding MWCO centrifugal filter units (Millipore).
- Diubiquitin was stored at -80 °C until further use.
- Ni 2+ -NTA slurry (Jena Bioscience) (0.1 ml_/mg of diubiquitin) was added to the reaction mixture and the mixture was incubated agitating for 1 h at 4 °C. After incubation, the mixture was transferred to an empty plastic column and washed with 40 CV of wash buffer (50 mM Tris pH 7.5, 150 mM NaCI, 5 mM CaCI 2, 30 mM imidazole) to remove Srt5M, unreacted bifunctional ubiquitin as well as UbK6GGK. The protein was eluated in 0.2 ml. fractions with wash buffer supplemented with 300 mM imidazole pH 8.0.
- wash buffer 50 mM Tris pH 7.5, 150 mM NaCI, 5 mM CaCI 2, 30 mM imidazole
- DiUb-SUM02 hybrid chains were prepared analogously to TriUb but instead of UbK6GGK SUMOKXXGGK variants were used.
- the sortase-mediated formation of a peptide-bond between a protein of interest (POI) and a modified ubiquitin bearing a sortase recognition tag involves synthesis and site-specific incorporation of a UAA, in which two glycine residues are coupled via an isopeptide-bond to the s- amino group of lysine glycylglycyl-L-lysine GGK, Fig. 7a).
- ubiquitin is expressed with a modified C-terminal sequence bearing two amino acid mutations that serves as recognition tag for the enzyme sortase A (SrtA).
- SrtA is an enzyme from Staphylococcus aureus that catalyzes the covalent attachment of proteins to the bacterial cell wall. SrtA cleaves the peptide-bond between T and G in the recognition motif LPXTG, yielding an activated thioester-intermediate that subsequently undergoes specific transpeptidation with a peptide containing N-terminal glycine residues.
- ubiquitin should be conjugated to a target protein via a native isopeptide-bond between a chosen lysine in the POI and G76 of ubiquitin (Fig. 6).
- the sortase-generated Ub-POI conjugates display two point mutations in the Ub-C-terminus (R72P and R74T), still all surface patches (I36-, I44-, F4-patch and TEK box) that are essential for recognition by ubiquitin-binding domains (UBDs) remain untouched. Since genetic code expansion allows the site-specific incorporation of a UAA at virtually any chosen position into any POI, this method will be useful for the generation of essentially any ubiquitylated protein. Importantly, the approach is applicable to proteins under native conditions, allowing the ubiquitylation of large multi-domain and non-refoldable proteins; an endeavour that is challenging with present chemical methods. Furthermore, such an approach will be extendable to attachment of other Ubls via native isopeptide-linkages, since they all display the highly conserved C-terminal glycylglycine motif.
- GGK is a possible donor in a sortase-mediated transpeptidation reaction
- the present inventors synthesized a seven amino acid long peptide resembling the sortase-compatible C-terminus of ubiquitin (Fmoc-VLPLTGG) via solid phase peptide synthesis (SPPS).
- GGK was synthesized by coupling tert - butyloxycarbonyl protected diglycine to the ⁇ - amino group of lysine (Supplementary Methods).
- the present inventors expressed and purified a mutant version of wt SrtA, Srt5M.
- This engineered sortase variant contains five mutations that confer 140-fold increased activity compared to wt SrtA.
- Incubation of GGK with the Fmoc- VLPLTGG peptide in the presence of Srt5M led to nearly quantitative formation of transpeptidation product within 30 minutes as observed by LC-MS (Fig. 7b and 7c).
- ubiquitin bearing the sortase-compatible C-terminus (dubbed Ub(PT)) reacted to the expected conjugate, when incubated with GGK in the presence of Srt5M (Fig. 7d).
- the azido group of AzGGK can be reduced quantitatively via Staudinger reduction with the water- soluble and cell-permeable phosphine, 2DPBA (2-(diphenylphosphino)benzoic acid) to restore GGK.
- 2DPBA 2-(diphenylphosphino)benzoic acid
- incubation of AzGGK with the peptide Fmoc-VLPLTGG in the presence of Srt5M did not lead to conjugate formation (Fig. 7c).
- sortase- mediated transpeptidation proceeded smoothly.
- AzGGK blocks its activity as a donor-substrate in sortase-mediated transpeptidation
- site-specific incorporation of AzGGK and its in vivo reduction with 2DPBA will provide an approach for site-specific ubiquitylation and SUMOylation in living cells with temporal control.
- lysine-derivatives for which PylRS mutants have been engineered, the side chain is attached via an isopeptide-bond to the s- amino group of lysine, similar as in the wt substrate pyrrolysine. None of the so far incorporated lysine-derivatives carries however a planar and polar peptide-bond in its side chain as it is the case for the amino acid AzGGK (Fig. 1 a).
- the present inventors Guided by structural analyses of the C-terminal catalytic centre of wt PylRS and its mutants, the present inventors screened a panel of > 25 different MbPylRS mutants that accepted lysine-derivatives with long and bulky side chains for their ability to direct the selective and site-specific incorporation of AzGGK. As none of the tested PylRS showed the desired activity, the present inventors created a new PylRS library by DNA-shuffling of the PylRS C-terminal domain of 17 known synthetases and spiking in an unbiased error-prone PCR product of the wt catalytic PylRS C-lobe (Fig. 8). This PylRS library was subjected to alternating rounds of positive and negative selection in E. coli.
- the present inventors combined the positive selection step with a fluorescence readout by co-transforming surviving clones from the negative selection with a reporter plasmid bearing both a chloramphenicol-acetyltransferase gene interrupted by an amber codon, as well as a superfolder green fluorescent protein (sfGFP) gene interrupted by an amber codon.
- sfGFP-expressing colonies that grew in the presence of chloramphenicol were picked and the synthetase efficiency and selectivity evaluated by fluorescence-intensity in the presence and absence of AzGGK.
- the present inventors investigated if it was possible to site-specifically ubiquitylate such modified proteins using Srt5M.
- the present inventors incubated sfGFP-GGK with an excess of Ub(PT) in the presence of Srt5M in aqueous buffer at 37 °C.
- SDS-PAGE- and MS-analysis revealed formation of a sfGFP-Ub(PT) conjugate (Fig. 2a and 2b).
- Nonspecific ubiquitylation was not detected with control sfGFP that contained BocK in place of GGK at the same site, or with sfGFP-GGK and Ub(PT) in the absence of Srt5M (Fig.
- the present inventors introduced a leucine spacer within the LPLTGG sequence between L71 and P72, generating Ub(LPT) with an LLPLTGG C-terminal sequence to shift the sortagging-motif by one amino acid away from the compact ubiquitin b-grasp fold, making it more accessible.
- Ub(LPT) with sfGFP-GGK in the presence of Srt5M afforded > 60% of ubiquitylated sfGFP within five minutes (Fig. 2a).
- the identity of sfGFP-Ub(LPT) conjugate was confirmed by LC-MS (Fig.
- the present inventors were able to produce mono- ubiquitylated proteins and diUbs, the present inventors reasoned that especially the proline residue introduced at position 72 of ubiquitin might be a poor mimic for the native C-terminus, as it gives it an unusual conformational rigidity and the cis-isomer of the L-P peptide-bond might be elevated in respect to the native L-R peptide-bond.
- the present inventors therefore turned their attention to a recently evolved sortase (Srt2A) with reprogrammed substrate specificity (Dorr et al., Proc Natl Acad Sci U S A 1 1 1 , 13343-8 (2014)).
- Srt2A recognizes a LAXTG motif rather than a LPXTG motif, omitting the need to introduce a proline residue into the ubiquitin C-terminus.
- the present inventors expressed Srt2A and ubiquitin bearing an LALTGG (dubbed Ub(AT)) or an LLALTGG (dubbed Ub(LAT)) C-terminus.
- LALTGG deoxyribonucleic acid
- Ub(LAT) dubbed Ub(LAT)
- the sfGFP-Ub(LAT) conjugate formed with conversion rates > 50% within five minutes incubation at 37 °C.
- Control experiments with sfGFP-BocK demonstrated the specificity and selectivity of Srt2A-mediated ubiquitylation using both Ub(LAT) or Ub(AT).
- the present inventors incubated all seven Ub-GGK-His6 acceptor ubiquitins with donor ubiquitins Ub(AT) and Ub(LAT) in the presence of Srt2A.
- K6-, K1 1 -, K33-, K48- and K63-linked diubiquitins (diUbs) were formed in very good conversion rates (> 60%) employing either Ub(AT) or Ub(LAT) (Fig. 3a).
- incubation of Ub-K6GGK with Ub(wt) displaying the native C- terminal sequence LRLRGG did not lead to diUb formation (Fig. 12c).
- the present inventors next determined if their isopeptide-linked diUbs were resistant to DUBs. It has been shown previously that the mutation L73P in the ubiquitin C-terminus confers resistance to various DUB families. The present inventors examined if the introduced R72A and R74T mutations in sortase-generated diubiquitins would also be refractory to cleavage by DUBs.
- the present inventors incubated the purified diUbs with the catalytic domain of ubiquitin carboxyl terminal hydrolase 2 (USP2 CD ) ⁇ While all native diUbs were efficiently cleaved to the corresponding monoubiquitins within one hour, all of the sortase-generated diUbs were resistant to USP2 CD cleavage of the GG-isopeptide-bond (Fig. 3b). On incubating the sortase-generated diUbs with USP2 CD the present inventors observed a faint band that migrated slightly further than our diUbs in SDS-PAGE gels.
- PCNA is a ring-shaped, homotrimeric protein that functions as a sliding clamp during DNA replication and enhances the processivity of DNA polymerase delta (pol ⁇ ). DNA lesions in the template strand lead to stalling of the replication fork, which triggers mono-ubiquitylation of PCNA at K164 (PCNA-Ub) via specific E2/E3 enzymes (Fig. 3c).
- PCNA-Ub in turn leads to recruitment of specialized translesion synthesis (TLS) polymerases to the DNA damage site in order to traverse the damage.
- TLS polymerases contain conserved UBDs for recognition of PCNA-Ub.
- Ubiquitylated PCNA with non-native disulphide or triazole linkages has been obtained via different chemical approaches.
- the present inventors introduced an amber codon at position 164 into the gene coding for PCNA and expressed it in the presence of AzGGK and its specific tRNA/synthetase pair.
- PCNA-K164GGK was purified after reduction with 2DPBA at multi milligram scale.
- SUMO small-ubiquitin-like-modifier proteins display the common b-grasp fold with a flexible six- residue C-terminal tail and the characteristic GG motif that is exposed after proteolytic maturation and enzymatically attached via an isopeptide-bond to a lysine in the target protein.
- the native C-terminal sequence of mature, proteolytically processed SUM01 is QEQTGG.
- S. aureus SrtA derived sortase mutants including Srt5M and Srt2A
- the activity of S. aureus SrtA derived sortase mutants is strongly dependent on Ca 2+ ions. Binding of Ca 2+ to glutamate residues in the b3/b4 loop enhances substrate binding by stabilizing a closed conformation of the active site b6/b7 loop in S. aureus SrtA ( Figure 18). This strong Ca 2+ -dependency may make it difficult to use Srt2A in conditions with low Ca 2+ concentrations, including in the cytosol of living cells or in the presence of Ca 2+ -binding compounds.
- SrtA superfamily shows a conserved active site among different Gram-positive bacteria
- amino acids in the b3/b4 loop that bind Ca 2+ are however not conserved.
- Streptococcus pyogenes SrtA and Bacillus anthracis SrtA show Ca 2+ - independent catalytic activities.
- a Ca 2+ -independent Srt5M mutant (dubbed Srt7M) was recently obtained by substituting two glutamate residues within the b3/b4 loop with neutral or positively charged amino acids.
- Srt7M was used for efficient in vitro peptide ligation both in the presence and absence of CaCI 2 and was shown to be functional in living C. elegans.
- the Srt2A variant is derived from Srt5M and shows in total 1 1 mutations to its parental Srt5M enzyme.
- the negatively charged amino acids in the b3/b4 loop (D47, E50 and D54) that are not conserved in Ca 2+ -independent SrtA mutants are however present in Srt2A.
- the present inventors substituted D47 in Srt2A with a lysine residue, speculating that it might form a salt bridge with E113, thereby balancing the electrostatic repulsion, which might destabilize the closed conformation of the b6/b7 loop in the absence of Ca 2+ .
- the present inventors introduced an E108Q mutation to reduce the negative charge within this pocket (Fig. 18b).
- the present inventors expressed the Srt2A variant bearing two point mutations D47K and E50Q (dubbed mSrt2A) and tested its catalytic efficiency in forming diUbs from Ub-K6GGK and Ub(LAT) in the absence and presence of 5 mM CaCI2.
- the present inventors supplemented the samples lacking CaCI 2 with 5 mM of Ca 2+ -chelating agent ethylene glycol tetraacetic acid (EGTA).
- EGTA ethylene glycol tetraacetic acid
- the present inventors co-expressed mSrt2A, SUMO(AT) and sfGFP-AzGGK-His6 in E. coli. After 24 hours, cells were washed to remove residual AzGGK and treated with 2DPBA to induce reduction to sfGFP-GGK-His6 and thereby trigger SUMOylation. Formation of sfGFP-SUMO(AT) conjugate was analysed by anti-His6 Western Blotting and was visible already 30 minutes after 2DPBA addition. SUMOylation did not take place when the present inventors expressed sfGFP- BocK instead of sfGFP-AzGGK or when the present inventors omitted 2DPBA, proving the specificity and inducibility of their approach (Fig.
- the present inventors transferred the mutations of AzGGKRS into a mammalian optimized /WmPylRS.
- Western blots and fluorescence imaging demonstrated highly efficient incorporation of AzGGK into sfGFP-N150TAG-His6 in HEK293T cells using the AzGGRS/tRNAcu A pair (Fig. 5a).
- the incorporation was confirmed by LC-MS analysis of purified sfGFP-N150AzGGK-His6 (Fig. 5b).
- the present inventors next tested if AzGGK-bearing proteins could be reduced with 2DPBA in vivo.
- 2DPBA and other benign triarylphosphine reagents have been used extensively for bioorthogonal Staudinger ligation/reduction in mammalian cells and 2DPBA shows good cell permeability.
- the present inventors expressed sfGFP-N150AzGGK-His6 in HEK293T cells, washed cells to remove AzGGK and treated them with 500 mM 2DPBA. Cells were lysed and treated with sortase and Ub(LAT) for one hour.
- Western-Blot analysis revealed specific formation of sfGFP-Ub(LAT) conjugates in HEK293T-lysates that had been treated with 2DPBA, proving the inducibility of sortylation (Fig. 21a).
- the present inventors next turned to the question of whether the sortase-mediated ubiquitylation and SUMOylation approach could be used in the cytosol of living HEK293T cells.
- the present inventors co-expressed Ub(LAT) or SUMO(LAT) together with a codon optimized version of C-terminal Myc-tagged mSrt2A and sfGFP-N150AzGGK-His6 in HEK293T cells for 36 hours, washed cells with AzGGK-free medium and treated them with 400 mM 2DPBA overnight. After washing, cells were lysed and analysed by anti-His6 Western-Blotting.
- coli in HEK293T cells, mSrt2A and Srt2A lead to similar ubiquitylation and SUMOylation yields 16 hours after triggering the reaction through addition of 2DPBA (Fig. 21 b).
- the present inventors set out to ubiquitylate/SUMOylate PCNA in live mammalian cells (Fig. 5d). Sortylation is dependent on co- expression and co-localization of all three proteins (PCNA, Ub(LAT) or SUMO(LAT) and sortase) that have to come into proximity to yield ubiquitylated or SUMOylated PCNA.
- the present inventors envisioned that they might enhance co-localization by fusing a nuclear localization sequence to Ub/SUMO constructs and sortase.
- the present inventors co-expressed all components in HEK293T cells for 48 hours, and triggered ubiquitylation/SUMOylation via Staudinger reduction of PCNA-K164AzGGK with 2DPBA.
- Western-Blot analysis 16 hours after addition of 2DPBA showed specific ubiquitylation/SUMOylation of PCNA in the presence of Ub(LAT) or SUMO(LAT), but not when over-expressing Ub(wt) or SUMO(wt) (Fig. 5d).
- Sortase 2A hydrolyzes LPLSG-linked DiUb (but with low activity compared to the positive control (LALTG-linked DiUb) and is therefore not orthogonal to the LPLSG motif (Sortase 4S). Hydrolysis of the LPLTG linked DiUb was not observed. Thus, Sortase 2A is orthogonal to the LPLTG motif (see Fig. 25A).
- Sortase 5M hydrolyzes the LPLSG linked DiUb and is therefore not orthogonal to the LPLSG motif (Sortase 4S). Hydrolysis of the LALTG linked DiUb was not observed. Sortase 5M is thus orthogonal to the LALTG motif (see Fig. 25B).
- Sortase 4S hydrolyzes the LPLTG linked DiUb and is therefore not orthogonal to the LPLTG motif (Sortase 5M). But hydrolysis of the LALTG linked DiUb was not observed (see Fig. 25C). Thus, Sortase 4S is orthogonal to the LALTG motif (S2A) but not vice versa
- the inventors therefore identified the following orthogonal pair: S5M + S2A.
- the orthogonal Sortase pair S5M/S2A the inventors generated a triubiquitin (TriUb) linked via two different isopeptide bonds.
- TriUb triubiquitin
- a’’bifunctional” ubiquitin with K48GGK and the LALTG C-terminus gets converted into a diubiquitin using Ub-LPLTG and S5M (Fig. 26A).
- the purified diubiquitin gets converted into a triubiquitin using S2A and UbK6GGK to produce a triubiquitin linked via isopeptide bonds at positions K48 and K6.
- the assay shows successful formation of TriUb reaching maximum yield after 1 h reaction time (Fig. 26B).
- the band emerging below the DiUb corresponds to the cleavage of the His-Tag as can be seen in the negative control without UbK6GGK.
- Purification of TriUb from the reaction mixture was successful and yielded pure TriUb.
- the measured mass of TriUb generated by orthogonal sortases corresponds to the calculated mass (Fig. 26C).
- the inventors set out to expand orthogonal sortases to SUMO-Ub hybrid chains which play an important role in DNA damage repair.
- the inventors generated the different hybrid chains via two iterative Sortase reactions using the orthogonal Sortase pair S5M/S2A (Fig. 27A).
- the inventors needed to place GGK at different lysine positions in SUM02 (K1 1 , K21 , K33, K35, K42 and K45).
- the LALTG motif was introduced between the ubiquitins and the LPLTG motif between the DiUb and the SUM02.
- the hybrid chain formation assay showed excellent conversion of the DiUb to the hybrid chain within 1 h (Fig. 27B).
- the hybrid could be purified by SEC (Fig. 27C).
- the hybrid chain formation was also shown by the inventors for multiple SUM02 sites (Fig. 27D).
- the present inventors have shown that site-specific incorporation of AzGGK into proteins in bacteria and mammalian cells via genetic code expansion and its reduction to GGK through a bioorthogonal Staudinger reaction allows sortase-mediated ubiquitylation and SUMOylation.
- Sortase-generated Ub-conjugates display a native isopeptide-bond linking the C-terminal G76 to the D-amino group of a chosen lysine in a target protein.
- Sortase-mediated ubiquitylation works under native, aqueous conditions and allows modification of complex, non-refoldable, multi-domain protein targets; an endeavour that is challenging using present chemical ubiquitylation approaches or protein targets where the corresponding E2/E3-enzymes are not known or show reduced activity/specificity in vitro.
- the present inventors utilize sortylation to site-specifically ubiquitylate the homotrimeric DNA repair protein PCNA. As seen for sortase-generated diUbs, the PCNA-Ub(AT) conjugate is refractory to cleavage by its specific DUB complex.
- Sortase-mediated transpeptidation yields SUMO-protein conjugates linking a specific lysine in the target protein with G97 in SUM01 via a native isopeptide-bond, presenting a much sought-after tool to create site-specific, well- defined SUMO conjugates.
- sortylation will also be conferrable to covalent modification of proteins with other Ubls, such as NEDD8, URM1 or Ufm1 , processes that are much less understood than ubiquitylation and SUMOylation.
- the technology described herein both extends and complements existing methods for studying ubiquitylation and SUMOylation networks. Besides providing a general and easily applicable method for studying effects of stable mono-ubiquitylation on multi-domain and non- refoldable protein targets, the sortase-mediated approach can be developed into a tool for identifying ubiquitin binding proteins.
- the donor ubiquitin Ub(AT)
- UAA photo crosslinking
- chemical crosslinking moiety bearing either a photo crosslinking or a chemical crosslinking moiety.
- Sortase-mediated reaction with a POI containing GGK at a specific site will create a well-defined POI-Ub conjugate that can be used for the proteomic identification (using photo crosslinking UAAs) of Ub-binding proteins in mammalian cell lysates or for chemical stabilization of transient E3/DUB-substrate complexes (using e.g. bromoalkyl-bearing UAAs).
- sortase-mediated transpeptidation can be extended to site-specific mono-ubiquitylation and mono-SUMOylation under physiological conditions in living cells, creating a system that is orthogonal to endogenous E1/E2/E3-enzymes.
- the availability of a bacterial system to specifically produce ubiquitylated and SUMOylated eukaryotic proteins in the work horse E. coli may facilitate future crystallographic, biophysical and biochemical analyses of ubiquitylated and SUMOylated proteins.
- E. coli ubiquitylation and SUMOylation is at the moment carried out by co-transforming E. coli with three different plasmids; the present inventors envision that generation of E. coli strains stably expressing sortase and/or Ub/SUMO variants may lead to a more modular and efficient expression system.
- the present inventors’ in vivo mammalian cell ubiquitylation and SUMOylation system may benefit from engineered cell lines, stably expressing sortase and/or modified Ub/SUMO.
- Mono-ubiquitylation of target proteins in mammalian cells regulates processes ranging from membrane transport to transcriptional activation. Since the present inventors approach relies on site-specific incorporation of AzGGK rather than GGK, ubiquitylation becomes triggerable by a small molecule, in principle enabling the study of temporal aspects of mono-ubiquitylation in live cells.
- the present inventors describe a chemo enzymatic approach to ubiquitylate and SUMOylate proteins in vitro and in live cells.
- the approach creates DUB-resistant Ub/SUMO conjugates with native isopeptide-linkages and is easily implementable in typical biology research labs, as the required amino acid AzGGK can be synthesized at multi-gram scale within a day.
- the present inventors imagine this technology, which for the first time shows sortase-based ubiquitylation and SUMOylation in living cells, thereby creating an enzymatic approach that is orthogonal to highly specialized E1/E2/E3-enzymes, will have the potential to provide immediate impact to many ubiquitin researchers.
- Coupling constants are reported in Hertz (Hz) while peak multiplicities are descripted as follows: s (singlet), d (doublet), t (triplet), q (quartet), quint (quintet), dt (doublet of triplets), ddd (doublet of doublet of doublets), m (multiplet), br (broad signal).
- Small molecule LC-MS was carried out on an Agilent Technologies 1260 Infinity LC-MS system with a 6310 Quadrupole spectrometer. The solvent system consisted of 0.1 % formic acid in water as buffer A and 0.1 % formic acid in ACN as buffer B.
- Oligonucleotide primers were designed with NEBuilder and purchased from Sigma. 15 % SDS-PAGE gels (1 10 V for 15 min, then 200 V for 45 min) were run on a BoltTM Mini Gel Tank (Invitrogen) system. Gels were stained with Quick Coomassie Stain (Generon). Protein Color Prestained Protein Standard, Broad Range 11-245 kDa (NEB) was used as protein marker. Western blots were carried out on iBIot® 2 Dry Blotting System (Life Technologies) using Method P0 (20 V for 1 min, 23 V for 4 min, 25 V for 2 min).
- the nitrocellulose membrane was blocked with 5 % skim milk powder solution in 1x TBST buffer (1 h at room temperature) and stained with 1 :5000 Anti-His6-Peroxidase antibody (Roche) in 1 % skim milk powder solution in 1 x TBST (1 h at room temperature). After washing, the membrane was treated with WesternBrightTM ECL-spray (Advansta) and the proteins visualized using ImageQuantTM LAS 4000 (GE Life Sciences). Protein and DNA concentrations were measured on NanoPhotometer® NP60 (Implen).
- SPPS solid phase peptide synthesis
- SPPS was performed according to the Fmoc-strategy for solid phase synthesis using CTC- resin. Therefore, 1.2 eq. of Fmoc-protected amino acid and 2.5 eq. of DIPEA were dissolved in anhydrous DCM (10 mL/g resin) and then added to 1.0 eq. of CTC-resin (100-200 mesh, 1.0-1.6 mmol/g maximal loading capacity) in a syringe equipped with a frit. The mixture was allowed to shake for 1 h at RT. Capping of the remaining chlorotrityl-groups was performed by adding 3 eq.
- Transformants were grown overnight in 1 L of LB-Kanamycin medium at 37 °C while shaking at 200 rpm. Cells were harvested by centrifugation at 4,000 g and 20 °C for 10 minutes. Plasmids were isolated with a Plasmid Midi-prep kit (Qiagen) following manufacturers instructions and stored at -20 °C.
- L/toPylRS derivatives (Supplementary Table S2) of our laboratory known to incorporate large and bulky unnatural amino acids, were subjected to DNA-shuffling in combination with the L/toPylRS error-prone PCR product.
- the 723 bp long Mb PylRS C-lobe fragment was amplified with the Shuffle_C-lobe primer pair (Supplementary Table S1 ) and the high-fidelity Q5-Polymerase (New England Biolabs).
- DNA fragments were reassembled by self-priming PCR followed by amplification with the Shuffle_PylRS_Clobe primer- pair (Supplementary Table S1 ) and the high-fidelity Q5-Polymerase (NEB Technologies).
- the PCR product was cloned into the wt PylRS backbone by restriction digest with Pstl + Bstell (NEB Technologies) restriction enzymes for 2 hours at 37 °C, overnight ligation at 16 °C with T4 Ligase (NEB Technologies) and electroporation into electrocompetent E.coli ⁇ H10b cells at 2.0 kV, 200 Ohm, and 25 pF (BioRad MicroPulser).
- Cells were recovered in 1 mL SOB-medium for one hour at 37 °C and 200 rpm. Recovered cells were grown overnight in 1 L LB with full strength Kan medium at 37 °C and 200 rpm. Cells were harvested by centrifugation at 4,000 g and 20 °C and plasmids were isolated with a plasmid midi-prep kit (Qiagen), followed by storage at -20°C. Library depth was calculated by dilution experiments and verified by sequencing, using the PylRS_seq primer- pair (Supplementary Table S1 ).
- AzGGKRS evolution was performed by subsequent positive and negative selection, 11,12 followed by a second positive selection coupled to a reporter-sfGFP-150TAG-readout.
- the positive selection plasmid pRep_PylT encodes a tetracycline resistance cassette and a constitutively expressed chloramphenicol acetyltransferase gene, bearing an amber codon at position 1 11 and a constitutively expressed Mb-pyrrolysyl-tRNA C u A (PylT).
- the dual-reporter plasmid psfGFP_150TAG_CAT_1 11TAG_PylT additionally encodes a sfGFP-150TAG.
- the negative selection plasmid pYOBB_PylT encodes a chloramphenicol resistance cassette, a constitutively expressed PylT and a L-arabinose inducible barnase gene, interrupted by amber codons at positions 3 and 45.
- 3 pg of lib-shuffle library were transformed by electroporation into 100 pL freshly prepared electrocompetent E.
- the cell suspension was diluted in 500 mL 2xYT-medium with half- strength kanamycin and tetracycline to reach an OD 6 oo ⁇ 0.1 , followed by an incubation at 200 rpm and 37 °C until reaching an OD 6 oo ⁇ 0.3.
- 10 mL of the cell suspension were transferred into a 50 mL centrifugation tube, 2 mM AzGGK were added to the cells and incubated for 4 hours at 37 °C.
- 600 pL of AzGGK-containing culture were plated on 24 cm x 24 cm plates containing 200 mL of GMML-agar with 240 pL chloramphenicol (1 .2 x strength), 200 pL tetracycline (full strength) and 200 pL kanamycin (full strength), as well as a 2 mM concentration of AzGGK. Plates were incubated for 36 - 48 h aiming for a single-clone distribution on the plates.
- lib-shuffle-DNA obtained from the surviving positive selection clones were transformed by electroporation into freshly prepared electrocompetent E. coli ⁇ H10b cells containing the negative selection plasmid pYOBB_PylT.
- 2 x 500 pL of the SOC-rescued culture were plated on two square 24 cm x 24 cm plates containing 200 ml of LB-agar, full strength kanamycin and chloramphenicol, as well as 0.2 % L-arabinose. Plates were incubated for 36 - 48h at 37 °C, aiming for single-clone distribution on the plates.
- sfGFP-expressing colonies were picked into 96-deep-well plates containing 1 mL of non-inducing media with full strength kanamycin and tetracycline, followed by 48 hours incubation at 200 rpm and 37 °C.
- 50 pL of the cell suspension were transferred into two new 96- deep-well plates containing 1 mL of auto-inducing medium with full strength kanamycin and tetracycline, as well as one of them containing 2 mM AzGGK, followed by 48 hours incubation at 37 °C (200 rpm).
- the residual 96-deep-well plate with non-inducing medium was centrifuged at 4.000 g and 4 °C, followed by storage at -20 °C.
- IPTG was added to a final concentration of 0.4 mM and protein expression was induced for three hours at 30 °C.
- the cells were harvested by centrifugation (4,000xg, 15 min, 4 °C) and re- suspended in lysis buffer (50 mM Tris pH 8.0, 300 mM NaCI supplemented with 1 mM MgCI 2 , 0.1 mg/mL DNAsel, one completeTM protease inhibitor tablet (Roche) and 0.175 mg/ml PMSF).
- lysis buffer 50 mM Tris pH 8.0, 300 mM NaCI supplemented with 1 mM MgCI 2 , 0.1 mg/mL DNAsel, one completeTM protease inhibitor tablet (Roche) and 0.175 mg/ml PMSF.
- the fractions containing the protein were pooled together, concentrated and rebuffered (20 mM Tris pH 7.5, 150 mM NaCI, 5 mM CaCI 2 ) with Amicon® Ultra-4 10K MWCO centrifugal filter units (Millipore). Enzyme concentration was calculated from the measured A280 absorption (extinction coefficients were calculated with ProtParam (https://web.expasy.org/protparam/)). All Sortase variants were stored at 4 °C for further use.
- Sortase variants harbouring a C-terminal TEV-His-tag were expressed and purified identically.
- the fractions containing the protein were pooled together and 200 pL of TEV protease (1.8 mg/mL) was added.
- the mixture was transferred to a dialysis tubing (Roth) and the dialysis bag was immersed in 2 L of cold dialysis buffer (25 mM Tris pH 8.0, 150 mM NaCI, 2 mM DTT) and stirred at 4 °C overnight.
- the protein mixture was recovered from the dialysis tubing and centrifuged (15,000xg, 10 min, 4 °C) in order to precipitate the TEV protease.
- 2 mL of Ni 2+ -NTA slurry (Jena Bioscience) were added to the supernatant and the mixture was incubated with agitation for one hour at 4 °C.
- the mixture was then poured into an empty plastic column and the flow-through was collected.
- the Ni 2+ -NTA beads were washed twice with 15 mL of wash buffer (20 mM Tris pH 8.0, 150 mM NaCI and 5 mM CaCI 2 ).
- IPTG was added to a final concentration of 1 mM and protein expression was induced for 4 hours at 37 °C.
- the cells were harvested by centrifugation (4,000xg, 15 min, 4 °C) and resuspended in lysis buffer (50 mM Tris pH 7.6, supplemented with 10 mM MgCI2, 1 mM EDTA, 0.1 % NP-40, 0.1 mg/mL DNAsel, one completeTM protease inhibitor tablet and 0.175 mg/mL PMSF). Cells were lysed by sonication and centrifuged (15,000 xg, 40 min, 4 °C).
- the cleared lysate was transferred into a glass beaker in an ice-bath that was placed on a magnetic stirrer. Precipitation was performed with 35 % perchloric acid until pH 4.0 - 4.5 was reached. After 5 minutes incubation at 4 °C while stirring, the milky solution was centrifuged (15,000xg, 40 min, 4 °C) and the supernatant was transferred into a dialysis tubing with a MWCO of 2 kDa. Dialysis was performed over night at 4 °C with 50 mM Ammonium acetate buffer pH 4.5.
- the dialyzed solution was centrifuged (15,000xg, 40 min, 4 °C), filtered and purified via a HiTrap SP FF 5 mL cation exchange chromatography (GE, gradient 0 - 1 M NaCI). Fractions that were > 95 % purity, as judged by SDS-PAGE, were pooled and rebuffered (20 mM Tris pH 8.0, 150 mM NaCI, 5 mM CaCI 2 ) with Amicon® Ultra-15 3kDa MWCO centrifugal filter units (Millipore). Enzyme concentration was calculated from the measured A280 absorption (extinction coefficients were calculated with ProtParam (https://web.expasy.org/protparam/)). All purified Ubiquitin variants were stored at 4 °C for further use.
- CPD is a cysteine protease domain of the Vibrio cholerae MARTX toxin 13 and pBK_AzGGKRS (which encodes AzGGKRS) plasmids.
- the cells were cultured overnight in 50 mL of non-inducing medium containing full strength antibiotics (tetracycline and ampicilline), at 37 °C, 200 rpm. The overnight culture was diluted to an OD 6 oo of 0.05 in autoinduction medium supplemented with full strength antibiotics and 4 mM AzGGK. After incubation overnight at 37 °C the cells were harvested by centrifugation (4000 xg, 15 min, 4 °C), flash frozen in liquid nitrogen and stored at -80 °C.
- the composition of non- inducing and autoinducing media can be found in Supplementary Tables S3-S7.
- the pellet was thawed on ice and re-suspended in lysis buffer (20 mM Tris pH 7.5, 0.5 mM PMSF, 300 mM NaCI, 0.2 % NP-40, 0.2 mg/ml Lysozyme).
- the cell suspension was incubated on ice for 30 minutes and sonicated at 4 °C on ice.
- the lysed cells were centrifuged (15,000xg, 30 min, 4 °C), the cleared lysate added to Ni 2+ -NTA slurry (Jena Bioscience) (0.2 mL of slurry per 100 mL of culture) and the mixture was incubated with agitation for 1 h at 4 °C.
- the column bound CPD-His6 was eluted afterwards with elution buffer 2 (20 mM Tris pH 7.5, 150 mM NaCI, 5 mM CaCI 2 and 300 mM imidazole).
- the fractions containing PCNA-K164AzGGK (identified via 15% SDS-PAGE) were pooled together, concentrated and rebuffered (20 mM Tris pH 7.5, 150 mM NaCI, 5 mM CaCI 2 ) with Amicon® Ultra-4 10K MWCO centrifugal filter units (Millipore). Protein concentration was calculated from the measured A280 absorption (extinction coefficients were calculated with ProtParam (https://web.expasy.org/protparam/)). PCNA was stored at 4 °C for further use.
- IPTG was added to a final concentration of 1 mM and protein expression was induced for 4 h at 37 °C.
- the cells were harvested by centrifugation (4,000xg, 15 min, 4 °C) and resuspended in lysis buffer (50 mM Tris pH 7.6, supplemented with 10 mM MgCI 2 , 1 mM EDTA, 0.1 % NP-40, 0.1 mg/ml_ DNAsel, one completeTM protease inhibitor tablet and 0.175 mg/ml_ PMSF). Cells were lysed by sonication and centrifuged (15,000xg, 40 min, 4 °C).
- the obtained cell pellets were resuspended in 20 ml. of lysis buffer (20 mM Tris pH 8.0, 30 mM imidazole, 300 mM NaCI, 0.175 mg/ml_ PMSF, 0.1 mg/ml_ DNase I and one completeTM protease inhibitor tablet (Roche)).
- the cell suspension was incubated on ice for 30 minutes and sonicated at 4 °C on ice.
- the lysed cells were centrifuged (15,000xg, 40 min, 4 °C), the cleared lysate added to Ni 2+ -NTA slurry (Jena Bioscience) (0.2 ml. of slurry per 100 ml.
- Ub-GGK acceptor ubiquitins with GGK at positions K6, K1 1 , K33, K48 and K63 were diluted to 20 mM in sortase buffer (50 mM Tris pH 7.5, 150 mM NaCI, 5 mM CaCI 2 ). Afterwards 100 mM of the donor ubiquitin (either Ub(AT) or Ub(LAT)) was added followed by the addition of 20 pM Srt2A (without His-Tag). Incubation was performed at 37 °C, 600 rpm, for one hour when Ub(LAT) was used and for 18 hours when Ub(AT) was used. Sortase-mediated transpeptidation was stopped by the addition of 200 pM phenylvinylsulfon and further incubation for 10 minutes at 37 °C, 600 rpm.
- Ni 2+ -NTA slurry (Jena Bioscience) (0.1 ml_/mg of acceptor ubiquitin) was added to the reaction mixture and the mixture was incubated agitating for 1 h at 4 °C. After incubation, the mixture was transferred to an empty plastic column and washed with 40 CV of wash buffer (50 mM Tris pH 7.5, 150 mM NaCI, 5 mM CaCI 2, 30 mM imidazole) to remove Srt2A and the excess of donor ubiquitin. The protein was eluated in 0.2 ml. fractions with wash buffer supplemented with 300 mM imidazole pH 8.0.
- wash buffer 50 mM Tris pH 7.5, 150 mM NaCI, 5 mM CaCI 2, 30 mM imidazole
- the fractions containing the mixture of diubiquitin and unreacted acceptor ubiquitin were pooled together and concentrated with Amicon® with the corresponding MWCO centrifugal filter units (Millipore).
- size- exclusion chromatography SEC was performed using a Superdex S75 16/600 (GE Healthcare) with sortase buffer. Fractions containing the diubiquitn were pooled together and concentrated with Amicon® with the corresponding MWCO centrifugal filter units (Millipore). Diubiquitins were stored at 4 °C until further use.
- PCNA-K164GGK was diluted to 5 mM in sortase buffer (50 mM Tris pH 7.5, 150 mM NaCI, 5 mM CaCI 2 ). Afterwards 100 pM of His-TEV-Ub(AT) and 20 pM Srt2A (without His-Tag) was added and the mixture incubated at 25 °C (600 rpm) for 44 hours. Sortase mediated transpeptidation was stopped by the addition of 200 pM phenylvinylsulfon and further incubation for 10 minutes at 25 °C, 600 rpm.
- SEC size-exclusion chromatography
- SEC-buffer 20 mM HEPES pH 7.5, 150 mM KCI, 0.5 mM TCEP, 5 % Glycerol (w/v)
- Fractions containing pure ubiquitylated PCNA were pooled together and concentrated with Amicon® with the corresponding MWCO centrifugal filter units (Millipore). Protein concentration was calculated from the measured A280 absorption (extinction coefficients were calculated with ProtParam (https://web.expasy.org/protparam/)).
- Ubiquitylated PCNA was stored at 4 °C until further use.
- USP2 CD 100 ng, Boston Biochem
- DUB dilution buffer 25 mM Tris pH 7.5, 150 mM NaCI, 10 mM DTT
- 2 pg of native diubiquitin (UbiQBio) or sortase-generated diubiquitin were added to 3 pL 10x DUB buffer (500 mM Tris pH 7.5, 500 mM NaCI, 50 mM DTT) and constituted to 20 pL with H 2 0.
- 10 pL of the activated DUB was added to diubiquitin samples followed by incubation at 37 °C. 6 pL samples were taken at the denoted time points and quenched by the addition of 4x SDS loading buffer and boiling at 95 °C for 10 minutes. Samples were loaded on SDS-PAGE gels and visualized by Coomassie staining.
- PCNA-Ub(AT) conjugate Natively ubiquitylated PCNA (a kind gift from Christian Biertijmpfel, MPI Martinsried) or sortase-gen rated PCNA-Ub(AT) conjugate was diluted to 1 pM into DUB buffer (50 mM HEPES pH 7.5, 150 mM NaCI, 0.5 mM TCEP, 1 mM EDTA). UAF1 and USP1 (Boston Biochem) were added in equimolar ratio to a final concentration of 100 nM to the ubiquitylated PCNA conjugates in DUB buffer. The mixture was incubated at 37 °C.
- Fmoc-VLPLTGG (20 mM stock solution in DMSO) was diluted in sortase buffer (50 mM Tris pH 7.5, 150 mM NaCI, 5 mM CaCI 2 ) to a final concentration of 1 mM followed by the addition of 10 mM AzGGK (50 mM stock in H 2 0) or GGK (50 mM stock in H 2 0). Subsequently Srt5M (300-800 mM stock in sortase buffer) was added to a final concentration of 20 mM. Incubation was performed at 37 °C (600 rpm). Samples were taken at the denoted time points by quenching the reaction mixture with 10 volumes of 0.5 % formic acid prior to HPLC-MS analysis. Typical reaction volumes were 50 mI_.
- DMEM fetal bovine serum
- Biochrom Biochrom
- 1 % antibiotic-antimycotic solution 25 pg/mL amphotenicin B, 10 mg/ml_ streptomycin, and 10,000 units of penicillin, Sigma-Aldrich
- HEK293T cells were seeded in Poly-L-lysine coated 6-well plates (Greiner) at 6x10 5 cells/well for Western-Blotting and in a 100 mm dish at 3x10 6 cells/dish for protein purification followed by ESI-MS analysis. Fresh DMEM, supplemented with 2 mM AzGGK, was added to the cells prior to transfection. Transfection was performed using PEI Transfection Reagent (Sigma). A plasmid ratio of 3:1/pEF1-sfGFPN150TAG:pEF1-AzGGKRS was used for the co-transfection. Cells were incubated for 24 to 48 hours prior to optional in vivo reduction. For protein purification of sfGFP-N150AzGGK-His6, cells were lysed with 500 pl_ lysis buffer (50 mM Tris pH 8.0, 150 mM
- 2DPBA-treated cells (growing in 6-well plates) were scraped in 500 pL PBS, transferred into reaction tubes, and pelleted via centrifugation (500xg, 5 min, 4 °C). The supernatant was discarded and cells were re-suspended in 50-100 pL buffer of choice. After re-suspension, cells were flash frozen in liquid nitrogen and thawed on ice. This was repeated three times to break the membrane. Cell debris and nuclei were pelleted via centrifugation for 15 min at max. speed and 4 °C. The supernatant was removed and used for in lysate sortase-mediated ubiquitylation experiments.
- freeze thaw lysis was performed using calcium-depleting buffer (50 mM Tris pH 7.5, 150 mM NaCI, 5 mM EGTA) containing ethylene glycol tetraacetic acid (EGTA), which is a calcium chelating agent.
- calcium-depleting buffer 50 mM Tris pH 7.5, 150 mM NaCI, 5 mM EGTA
- EGTA ethylene glycol tetraacetic acid
- sortase buffer 50 mM Tris pH 7.5, 150 mM NaCI, 5 mM CaCI 2
- 25 pL of supernatant were supplemented with 20 pM of the corresponding sortase variant, followed by ten minutes incubation at 37 °C.
- human codon optimized ubiquitin, SUMO, PCNA and sortase (mSrt2A and Srt2A) and /WmAzGGKRS genes were purchased as DNA Strings (GeneArt, Thermo Fisher) and cloned into pcDNA3.1 , pIRES and pEF1 vectors via standard restriction cloning. Point mutations, insertions and deletions were introduced using Site-directed, Ligase- Independent Mutagenesis (SLIM). Srt5M and Srt2A in pET29b vectors were purchased (Addgene Plasmids #75144 and #75145) and mutations were introduced using SLIM cloning.
- SLIM Site-directed, Ligase- Independent Mutagenesis
- pBAD Duett vectors were created using Gibson Assembly (New England Biolabs).
- pET17 vectors containing modified ubiquitin and SUMO were created via restriction cloning. Point mutations and insertions were introduced using SLIM.
- An overview of bacterial and mammalian cell constructs can be found in Supplementary Tables S8 and S9.
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- General Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- Biomedical Technology (AREA)
- Microbiology (AREA)
- Medicinal Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Biophysics (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Analytical Chemistry (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Enzymes And Modification Thereof (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
Description
Claims
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP18181497 | 2018-07-03 | ||
PCT/EP2019/067820 WO2020007899A1 (en) | 2018-07-03 | 2019-07-03 | Means and methods for site-specific protein modification using transpeptidases |
Publications (1)
Publication Number | Publication Date |
---|---|
EP3818072A1 true EP3818072A1 (en) | 2021-05-12 |
Family
ID=62873174
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP19745053.9A Pending EP3818072A1 (en) | 2018-07-03 | 2019-07-03 | Means and methods for site-specific protein modification using transpeptidases |
Country Status (2)
Country | Link |
---|---|
EP (1) | EP3818072A1 (en) |
WO (1) | WO2020007899A1 (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116949004B (en) * | 2023-07-13 | 2024-04-09 | 苏州源启生物科技有限公司 | Transpeptidase Sortase A and preparation method and application thereof |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB0721291D0 (en) * | 2007-10-30 | 2007-12-12 | Medical Res Council | Methods and compositions |
US10260038B2 (en) * | 2013-05-10 | 2019-04-16 | Whitehead Institute For Biomedical Research | Protein modification of living cells using sortase |
WO2015042393A2 (en) * | 2013-09-20 | 2015-03-26 | President And Fellows Of Harvard College | Evolved sortases and uses thereof |
GB201518076D0 (en) * | 2015-10-13 | 2015-11-25 | Nat Inst For Biotechnology In The Negev The Ltd | Methods and kits for determining deacylase activity |
-
2019
- 2019-07-03 WO PCT/EP2019/067820 patent/WO2020007899A1/en unknown
- 2019-07-03 EP EP19745053.9A patent/EP3818072A1/en active Pending
Also Published As
Publication number | Publication date |
---|---|
WO2020007899A1 (en) | 2020-01-09 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Fottner et al. | Site-specific ubiquitylation and SUMOylation using genetic-code expansion and sortase | |
CN108124453B (en) | Cas9 retroviral integrase and Cas9 recombinase system for targeted incorporation of DNA sequences into the genome of a cell or organism | |
AU2022200851B2 (en) | Using nucleosome interacting protein domains to enhance targeted genome modification | |
JP7109547B2 (en) | An engineered Cas9 system for eukaryotic genome modification | |
KR20190059966A (en) | S. The Piogenes CAS9 mutant gene and the polypeptide encoded thereby | |
WO2005051976A2 (en) | Protein and peptide ligation processes and one-step purification processes | |
US20160304571A1 (en) | Synthesis of Site Specifically-Linked Ubiquitin | |
US20210363206A1 (en) | Proteins that inhibit cas12a (cpf1), a cripr-cas nuclease | |
EP3818072A1 (en) | Means and methods for site-specific protein modification using transpeptidases | |
WO2011024887A1 (en) | Conjugate containing cyclic peptide and method for producing same | |
WO2020045281A1 (en) | Novel nuclease domain and uses thereof | |
US20200291444A1 (en) | Macrocyclisation Tags | |
Singh et al. | One-Step sortase-mediated chemoenzymatic semisynthesis of deubiquitinase-resistant ub-peptide conjugates | |
Exner | Incorporation of novel noncanonical amino acids in model proteins using rational and evolved variants of Methanosarcina mazei pyrrolysyl-tRNA synthetase | |
JP2011239686A (en) | Method of producing peptide and peptide derivative | |
KR101646728B1 (en) | A method of synthesizing unnatural protein using degenercy reprogramming | |
Fottner | Development of a novel chemoenzymatic approach to ubiquitylate target proteins | |
Tsai et al. | Genetically Incorporated Non-Canonical Amino Acids: Methods and Protocols | |
Liu | Genetic Code Expansion for Protein Activity Control and Protein Conjugation | |
Radtke | Unnatural Amino Acids for Protein Modification by Oxime Ligation | |
WO2023039434A1 (en) | Systems and methods for transposing cargo nucleotide sequences | |
Carpenter | Elucidating Ubiquitin Chain Formation Catalyzed by E2 Enzymes through use of a Di-Ubiquitin Probe | |
Tworak et al. | High throughput protein production | |
WO2019213528A1 (en) | Producing recombinant proteins with reduced levels of host cell proteins | |
US20100068766A1 (en) | Composition for cleaving and/or connecting single strand dna |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: UNKNOWN |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE INTERNATIONAL PUBLICATION HAS BEEN MADE |
|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE |
|
17P | Request for examination filed |
Effective date: 20210202 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR |
|
DAV | Request for validation of the european patent (deleted) | ||
DAX | Request for extension of the european patent (deleted) | ||
RAP1 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: ETH ZURICH |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: EXAMINATION IS IN PROGRESS |
|
17Q | First examination report despatched |
Effective date: 20231207 |