CN115491361A - Application of expandase and mutant thereof in production of G-7-ADCA - Google Patents
Application of expandase and mutant thereof in production of G-7-ADCA Download PDFInfo
- Publication number
- CN115491361A CN115491361A CN202110676893.1A CN202110676893A CN115491361A CN 115491361 A CN115491361 A CN 115491361A CN 202110676893 A CN202110676893 A CN 202110676893A CN 115491361 A CN115491361 A CN 115491361A
- Authority
- CN
- China
- Prior art keywords
- amino acid
- ala
- thr
- leu
- asp
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 101710104123 Deacetoxycephalosporin C synthase Proteins 0.000 title claims abstract description 53
- 238000004519 manufacturing process Methods 0.000 title description 4
- JGSARLDLIJGVTE-MBNYWOFBSA-N Penicillin G Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 JGSARLDLIJGVTE-MBNYWOFBSA-N 0.000 claims abstract description 81
- 229940056360 penicillin g Drugs 0.000 claims abstract description 33
- 230000000694 effects Effects 0.000 claims abstract description 27
- 238000006049 ring expansion reaction Methods 0.000 claims abstract description 4
- 108090000623 proteins and genes Proteins 0.000 claims description 91
- 102000004169 proteins and genes Human genes 0.000 claims description 75
- 235000018102 proteins Nutrition 0.000 claims description 73
- 150000001413 amino acids Chemical group 0.000 claims description 64
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 claims description 44
- 229960004441 tyrosine Drugs 0.000 claims description 44
- 235000002374 tyrosine Nutrition 0.000 claims description 44
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 claims description 44
- 125000000539 amino acid group Chemical group 0.000 claims description 42
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 claims description 40
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 claims description 36
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 claims description 36
- 229960002433 cysteine Drugs 0.000 claims description 36
- 235000018417 cysteine Nutrition 0.000 claims description 36
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 claims description 36
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 claims description 32
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 claims description 32
- 229960000310 isoleucine Drugs 0.000 claims description 32
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 claims description 30
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 claims description 29
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 claims description 28
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 claims description 28
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 claims description 28
- 235000014304 histidine Nutrition 0.000 claims description 28
- 229960002885 histidine Drugs 0.000 claims description 28
- 229960004295 valine Drugs 0.000 claims description 28
- 239000004474 valine Substances 0.000 claims description 28
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 claims description 24
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 claims description 24
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 claims description 24
- 238000006243 chemical reaction Methods 0.000 claims description 24
- 229960003136 leucine Drugs 0.000 claims description 24
- 229930182817 methionine Natural products 0.000 claims description 24
- 229960004452 methionine Drugs 0.000 claims description 24
- 238000000034 method Methods 0.000 claims description 24
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 claims description 20
- 239000004475 Arginine Substances 0.000 claims description 20
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 claims description 20
- 239000004471 Glycine Substances 0.000 claims description 20
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 claims description 20
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 claims description 20
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 claims description 20
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 claims description 20
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 claims description 20
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 claims description 20
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 claims description 20
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 claims description 20
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 claims description 20
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 claims description 20
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 claims description 20
- 239000004472 Lysine Substances 0.000 claims description 20
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 claims description 20
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 claims description 20
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 claims description 20
- 239000004473 Threonine Substances 0.000 claims description 20
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 claims description 20
- 229960003767 alanine Drugs 0.000 claims description 20
- 235000004279 alanine Nutrition 0.000 claims description 20
- 229960003121 arginine Drugs 0.000 claims description 20
- 235000009697 arginine Nutrition 0.000 claims description 20
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 claims description 20
- 229960001230 asparagine Drugs 0.000 claims description 20
- 235000009582 asparagine Nutrition 0.000 claims description 20
- 229960005261 aspartic acid Drugs 0.000 claims description 20
- 235000003704 aspartic acid Nutrition 0.000 claims description 20
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 claims description 20
- 235000013922 glutamic acid Nutrition 0.000 claims description 20
- 229960002989 glutamic acid Drugs 0.000 claims description 20
- 239000004220 glutamic acid Substances 0.000 claims description 20
- 235000004554 glutamine Nutrition 0.000 claims description 20
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 claims description 20
- 229960002743 glutamine Drugs 0.000 claims description 20
- 229960002449 glycine Drugs 0.000 claims description 20
- 235000018977 lysine Nutrition 0.000 claims description 20
- 229960003646 lysine Drugs 0.000 claims description 20
- 235000008729 phenylalanine Nutrition 0.000 claims description 20
- 229960005190 phenylalanine Drugs 0.000 claims description 20
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 claims description 20
- 229960002429 proline Drugs 0.000 claims description 20
- 229960001153 serine Drugs 0.000 claims description 20
- 235000004400 serine Nutrition 0.000 claims description 20
- 235000008521 threonine Nutrition 0.000 claims description 20
- 229960002898 threonine Drugs 0.000 claims description 20
- 229960004799 tryptophan Drugs 0.000 claims description 20
- 229930182555 Penicillin Natural products 0.000 claims description 15
- 229940049954 penicillin Drugs 0.000 claims description 15
- 230000014509 gene expression Effects 0.000 claims description 10
- 108020004707 nucleic acids Proteins 0.000 claims description 9
- 102000039446 nucleic acids Human genes 0.000 claims description 9
- 150000007523 nucleic acids Chemical class 0.000 claims description 9
- 238000000855 fermentation Methods 0.000 claims description 8
- 230000004151 fermentation Effects 0.000 claims description 8
- 239000013598 vector Substances 0.000 claims description 8
- 229940024606 amino acid Drugs 0.000 claims description 7
- 235000001014 amino acid Nutrition 0.000 claims description 7
- 230000008569 process Effects 0.000 claims description 7
- 241000894006 Bacteria Species 0.000 claims description 6
- 238000002360 preparation method Methods 0.000 claims description 4
- 230000009261 transgenic effect Effects 0.000 claims description 4
- 238000010367 cloning Methods 0.000 claims 1
- 244000005700 microbiome Species 0.000 claims 1
- 102000004190 Enzymes Human genes 0.000 abstract description 21
- 108090000790 Enzymes Proteins 0.000 abstract description 21
- NVIAYEIXYQCDAN-CLZZGJSISA-N 7beta-aminodeacetoxycephalosporanic acid Chemical compound S1CC(C)=C(C(O)=O)N2C(=O)[C@@H](N)[C@@H]12 NVIAYEIXYQCDAN-CLZZGJSISA-N 0.000 abstract description 15
- 238000005516 engineering process Methods 0.000 abstract description 9
- 108010073038 Penicillin Amidase Proteins 0.000 abstract description 7
- 230000009471 action Effects 0.000 abstract description 4
- 239000000758 substrate Substances 0.000 abstract description 4
- 238000009412 basement excavation Methods 0.000 abstract description 2
- 125000003275 alpha amino acid group Chemical group 0.000 abstract 1
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 24
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 19
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 19
- 108010000761 leucylarginine Proteins 0.000 description 19
- 108010061238 threonyl-glycine Proteins 0.000 description 19
- MKZCBYZBCINNJN-DLOVCJGASA-N Ala-Asp-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MKZCBYZBCINNJN-DLOVCJGASA-N 0.000 description 18
- 108010012581 phenylalanylglutamate Proteins 0.000 description 17
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 14
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 14
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 14
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 14
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 14
- JQSWHKKUZMTOIH-QWRGUYRKSA-N Asn-Gly-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N JQSWHKKUZMTOIH-QWRGUYRKSA-N 0.000 description 14
- FMNBYVSGRCXWEK-FOHZUACHSA-N Asn-Thr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O FMNBYVSGRCXWEK-FOHZUACHSA-N 0.000 description 14
- VRJZMZGGAKVSIQ-SRVKXCTJSA-N Cys-Tyr-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VRJZMZGGAKVSIQ-SRVKXCTJSA-N 0.000 description 14
- RZSLYUUFFVHFRQ-FXQIFTODSA-N Gln-Ala-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O RZSLYUUFFVHFRQ-FXQIFTODSA-N 0.000 description 14
- NPTGGVQJYRSMCM-GLLZPBPUSA-N Gln-Gln-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NPTGGVQJYRSMCM-GLLZPBPUSA-N 0.000 description 14
- ZNTDJIMJKNNSLR-RWRJDSDZSA-N Gln-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZNTDJIMJKNNSLR-RWRJDSDZSA-N 0.000 description 14
- GMVCSRBOSIUTFC-FXQIFTODSA-N Glu-Ser-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMVCSRBOSIUTFC-FXQIFTODSA-N 0.000 description 14
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 14
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 14
- ZPVJJPAIUZLSNE-DCAQKATOSA-N His-Arg-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O ZPVJJPAIUZLSNE-DCAQKATOSA-N 0.000 description 14
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 14
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 14
- UHNQRAFSEBGZFZ-YESZJQIVSA-N Leu-Phe-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N UHNQRAFSEBGZFZ-YESZJQIVSA-N 0.000 description 14
- HKXSZKJMDBHOTG-CIUDSAMLSA-N Lys-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN HKXSZKJMDBHOTG-CIUDSAMLSA-N 0.000 description 14
- MCNGIXXCMJAURZ-VEVYYDQMSA-N Met-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCSC)N)O MCNGIXXCMJAURZ-VEVYYDQMSA-N 0.000 description 14
- SXWQMBGNFXAGAT-FJXKBIBVSA-N Met-Gly-Thr Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SXWQMBGNFXAGAT-FJXKBIBVSA-N 0.000 description 14
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 14
- IWRZUGHCHFZYQZ-UFYCRDLUSA-N Phe-Arg-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 IWRZUGHCHFZYQZ-UFYCRDLUSA-N 0.000 description 14
- ALHULIGNEXGFRM-QWRGUYRKSA-N Phe-Cys-Gly Chemical compound OC(=O)CNC(=O)[C@H](CS)NC(=O)[C@@H](N)CC1=CC=CC=C1 ALHULIGNEXGFRM-QWRGUYRKSA-N 0.000 description 14
- AXIOGMQCDYVTNY-ACRUOGEOSA-N Phe-Phe-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 AXIOGMQCDYVTNY-ACRUOGEOSA-N 0.000 description 14
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 14
- FKKHDBFNOLCYQM-FXQIFTODSA-N Pro-Cys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O FKKHDBFNOLCYQM-FXQIFTODSA-N 0.000 description 14
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 14
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 14
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 14
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 14
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 14
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 14
- DMWNPLOERDAHSY-MEYUZBJRSA-N Tyr-Leu-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DMWNPLOERDAHSY-MEYUZBJRSA-N 0.000 description 14
- LMKKMCGTDANZTR-BZSNNMDCSA-N Tyr-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 LMKKMCGTDANZTR-BZSNNMDCSA-N 0.000 description 14
- XYBNMHRFAUKPAW-IHRRRGAJSA-N Tyr-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CC=C(C=C1)O)N XYBNMHRFAUKPAW-IHRRRGAJSA-N 0.000 description 14
- RYQUMYBMOJYYDK-NHCYSSNCSA-N Val-Pro-Glu Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RYQUMYBMOJYYDK-NHCYSSNCSA-N 0.000 description 14
- 108010044940 alanylglutamine Proteins 0.000 description 14
- 108010013835 arginine glutamate Proteins 0.000 description 14
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 14
- 108010081551 glycylphenylalanine Proteins 0.000 description 14
- 108010071207 serylmethionine Proteins 0.000 description 14
- HNJNAMGZQZPSRE-GUBZILKMSA-N Arg-Pro-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O HNJNAMGZQZPSRE-GUBZILKMSA-N 0.000 description 13
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 13
- BSKMOCNNLNDIMU-CDMKHQONSA-N Phe-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O BSKMOCNNLNDIMU-CDMKHQONSA-N 0.000 description 13
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 13
- UCHSVZYJKJLPHF-BZSNNMDCSA-N Asp-Phe-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O UCHSVZYJKJLPHF-BZSNNMDCSA-N 0.000 description 12
- XIKYNVKEUINBGL-IUCAKERBSA-N Glu-His-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O XIKYNVKEUINBGL-IUCAKERBSA-N 0.000 description 12
- SJJHXJDSNQJMMW-SRVKXCTJSA-N Glu-Lys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SJJHXJDSNQJMMW-SRVKXCTJSA-N 0.000 description 12
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 12
- GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 description 11
- 108010049041 glutamylalanine Proteins 0.000 description 11
- KPGXRSRHYNQIFN-UHFFFAOYSA-N 2-oxoglutaric acid Chemical compound OC(=O)CCC(=O)C(O)=O KPGXRSRHYNQIFN-UHFFFAOYSA-N 0.000 description 10
- QARPMYDMYVLFMW-KKUMJFAQSA-N Phe-Pro-Glu Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 QARPMYDMYVLFMW-KKUMJFAQSA-N 0.000 description 10
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 10
- 238000004321 preservation Methods 0.000 description 10
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 9
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 9
- KYDYGANDJHFBCW-DRZSPHRISA-N Ala-Phe-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KYDYGANDJHFBCW-DRZSPHRISA-N 0.000 description 9
- LMPKCSXZJSXBBL-NHCYSSNCSA-N Arg-Gln-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O LMPKCSXZJSXBBL-NHCYSSNCSA-N 0.000 description 9
- YLVGUOGAFAJMKP-JYJNAYRXSA-N Arg-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YLVGUOGAFAJMKP-JYJNAYRXSA-N 0.000 description 9
- KACWACLNYLSVCA-VHWLVUOQSA-N Asp-Trp-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KACWACLNYLSVCA-VHWLVUOQSA-N 0.000 description 9
- VTTSANCGJWLPNC-ZPFDUUQYSA-N Glu-Arg-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VTTSANCGJWLPNC-ZPFDUUQYSA-N 0.000 description 9
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 9
- RPLLQZBOVIVGMX-QWRGUYRKSA-N Gly-Asp-Phe Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RPLLQZBOVIVGMX-QWRGUYRKSA-N 0.000 description 9
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 9
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 9
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 9
- NYEYYMLUABXDMC-NHCYSSNCSA-N Ile-Gly-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)O)N NYEYYMLUABXDMC-NHCYSSNCSA-N 0.000 description 9
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 9
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 9
- YKBSXQFZWFXFIB-VOAKCMCISA-N Lys-Thr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O YKBSXQFZWFXFIB-VOAKCMCISA-N 0.000 description 9
- VVKVHAOOUGNDPJ-SRVKXCTJSA-N Ser-Tyr-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VVKVHAOOUGNDPJ-SRVKXCTJSA-N 0.000 description 9
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 9
- ABWNZPOIUJMNKT-IXOXFDKPSA-N Thr-Phe-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O ABWNZPOIUJMNKT-IXOXFDKPSA-N 0.000 description 9
- MPYZGXUYLNPSNF-NAZCDGGXSA-N Trp-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)O MPYZGXUYLNPSNF-NAZCDGGXSA-N 0.000 description 9
- AEOFMCAKYIQQFY-YDHLFZDLSA-N Tyr-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AEOFMCAKYIQQFY-YDHLFZDLSA-N 0.000 description 9
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 9
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 9
- 108010031719 prolyl-serine Proteins 0.000 description 9
- CIWBSHSKHKDKBQ-JLAZNSOCSA-N Ascorbic acid Chemical compound OC[C@H](O)[C@H]1OC(=O)C(O)=C1O CIWBSHSKHKDKBQ-JLAZNSOCSA-N 0.000 description 8
- QQXOYLWJQUPXJU-WHFBIAKZSA-N Asp-Cys-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O QQXOYLWJQUPXJU-WHFBIAKZSA-N 0.000 description 8
- KAFOIVJDVSZUMD-DCAQKATOSA-N Leu-Gln-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-DCAQKATOSA-N 0.000 description 8
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 8
- FMFNIDICDKEMOE-XUXIUFHCSA-N Leu-Val-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMFNIDICDKEMOE-XUXIUFHCSA-N 0.000 description 8
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 8
- 238000001514 detection method Methods 0.000 description 8
- 239000013604 expression vector Substances 0.000 description 8
- 108010078144 glutaminyl-glycine Proteins 0.000 description 8
- 108010034529 leucyl-lysine Proteins 0.000 description 8
- 230000035772 mutation Effects 0.000 description 8
- UWOPETAWXDZUJR-ACZMJKKPSA-N Asp-Cys-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O UWOPETAWXDZUJR-ACZMJKKPSA-N 0.000 description 7
- KNDCWFXCFKSEBM-AVGNSLFASA-N Asp-Tyr-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O KNDCWFXCFKSEBM-AVGNSLFASA-N 0.000 description 7
- JXQVYPWVGUOIDV-MXAVVETBSA-N Phe-Ser-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JXQVYPWVGUOIDV-MXAVVETBSA-N 0.000 description 7
- MVIJMIZJPHQGEN-IHRRRGAJSA-N Phe-Ser-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@H](CO)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 MVIJMIZJPHQGEN-IHRRRGAJSA-N 0.000 description 7
- 108010016616 cysteinylglycine Proteins 0.000 description 7
- 108010054813 diprotin B Proteins 0.000 description 7
- OVPHVTCDVYYTHN-AVGNSLFASA-N Asp-Glu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OVPHVTCDVYYTHN-AVGNSLFASA-N 0.000 description 6
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 6
- BKTXKJMNTSMJDQ-AVGNSLFASA-N Leu-His-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BKTXKJMNTSMJDQ-AVGNSLFASA-N 0.000 description 6
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 6
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 6
- 239000013612 plasmid Substances 0.000 description 6
- MCKSLROAGSDNFC-ACZMJKKPSA-N Ala-Asp-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MCKSLROAGSDNFC-ACZMJKKPSA-N 0.000 description 5
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 5
- JJHBEVZAZXZREW-LFSVMHDDSA-N Ala-Thr-Phe Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O JJHBEVZAZXZREW-LFSVMHDDSA-N 0.000 description 5
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 5
- MTANSHNQTWPZKP-KKUMJFAQSA-N Arg-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)O MTANSHNQTWPZKP-KKUMJFAQSA-N 0.000 description 5
- AUIJUTGLPVHIRT-FXQIFTODSA-N Arg-Ser-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N AUIJUTGLPVHIRT-FXQIFTODSA-N 0.000 description 5
- MKRDNSWGJWTBKZ-GVXVVHGQSA-N Gln-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MKRDNSWGJWTBKZ-GVXVVHGQSA-N 0.000 description 5
- DYFJZDDQPNIPAB-NHCYSSNCSA-N Glu-Arg-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O DYFJZDDQPNIPAB-NHCYSSNCSA-N 0.000 description 5
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 5
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 5
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 5
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 5
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 5
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 5
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 5
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 5
- BJWKOATWNQJPSK-SRVKXCTJSA-N Leu-Met-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BJWKOATWNQJPSK-SRVKXCTJSA-N 0.000 description 5
- ZTPWXNOOKAXPPE-DCAQKATOSA-N Lys-Arg-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N ZTPWXNOOKAXPPE-DCAQKATOSA-N 0.000 description 5
- 108010066427 N-valyltryptophan Proteins 0.000 description 5
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 5
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 5
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 5
- GYUUYCIXELGTJS-MEYUZBJRSA-N Thr-Phe-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O GYUUYCIXELGTJS-MEYUZBJRSA-N 0.000 description 5
- JTMZSIRTZKLBOA-NWLDYVSISA-N Trp-Thr-Gln Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O JTMZSIRTZKLBOA-NWLDYVSISA-N 0.000 description 5
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 5
- HWXBTNAVRSUOJR-UHFFFAOYSA-N alpha-hydroxyglutaric acid Natural products OC(=O)C(O)CCC(O)=O HWXBTNAVRSUOJR-UHFFFAOYSA-N 0.000 description 5
- 229940009533 alpha-ketoglutaric acid Drugs 0.000 description 5
- 230000015572 biosynthetic process Effects 0.000 description 5
- 230000003197 catalytic effect Effects 0.000 description 5
- 108010010147 glycylglutamine Proteins 0.000 description 5
- 239000000203 mixture Substances 0.000 description 5
- 239000000047 product Substances 0.000 description 5
- 238000003786 synthesis reaction Methods 0.000 description 5
- DPSUVAPLRQDWAO-YDHLFZDLSA-N Asn-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)N)N DPSUVAPLRQDWAO-YDHLFZDLSA-N 0.000 description 4
- 108020004414 DNA Proteins 0.000 description 4
- DHNWZLGBTPUTQQ-QEJZJMRPSA-N Gln-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N DHNWZLGBTPUTQQ-QEJZJMRPSA-N 0.000 description 4
- GGAPIOORBXHMNY-ULQDDVLXSA-N Lys-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)O GGAPIOORBXHMNY-ULQDDVLXSA-N 0.000 description 4
- NKDSBBBPGIVWEI-RCWTZXSCSA-N Met-Arg-Thr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NKDSBBBPGIVWEI-RCWTZXSCSA-N 0.000 description 4
- CZQZSMJXFGGBHM-KKUMJFAQSA-N Phe-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O CZQZSMJXFGGBHM-KKUMJFAQSA-N 0.000 description 4
- OQSQCUWQOIHECT-YJRXYDGGSA-N Ser-Tyr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OQSQCUWQOIHECT-YJRXYDGGSA-N 0.000 description 4
- 241000187433 Streptomyces clavuligerus Species 0.000 description 4
- WBPFYNYTYASCQP-CYDGBPFRSA-N Val-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N WBPFYNYTYASCQP-CYDGBPFRSA-N 0.000 description 4
- 229960005070 ascorbic acid Drugs 0.000 description 4
- 235000010323 ascorbic acid Nutrition 0.000 description 4
- 239000011668 ascorbic acid Substances 0.000 description 4
- 238000004128 high performance liquid chromatography Methods 0.000 description 4
- 239000008363 phosphate buffer Substances 0.000 description 4
- 238000003259 recombinant expression Methods 0.000 description 4
- JAYIQMNQDMOBFY-KKUMJFAQSA-N Arg-Glu-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JAYIQMNQDMOBFY-KKUMJFAQSA-N 0.000 description 3
- 229930186147 Cephalosporin Natural products 0.000 description 3
- 108010025076 Holoenzymes Proteins 0.000 description 3
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 3
- 241000228417 Sarocladium strictum Species 0.000 description 3
- 239000003242 anti bacterial agent Substances 0.000 description 3
- 229940088710 antibiotic agent Drugs 0.000 description 3
- 229940124587 cephalosporin Drugs 0.000 description 3
- 150000001780 cephalosporins Chemical class 0.000 description 3
- 238000012258 culturing Methods 0.000 description 3
- 238000013461 design Methods 0.000 description 3
- 239000003623 enhancer Substances 0.000 description 3
- 230000007613 environmental effect Effects 0.000 description 3
- 229930027917 kanamycin Natural products 0.000 description 3
- 229960000318 kanamycin Drugs 0.000 description 3
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 3
- 229930182823 kanamycin A Natural products 0.000 description 3
- 239000002609 medium Substances 0.000 description 3
- 108020004999 messenger RNA Proteins 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 239000013642 negative control Substances 0.000 description 3
- 239000013641 positive control Substances 0.000 description 3
- NYZGVTGOMPHSJW-CIUDSAMLSA-N Arg-Glu-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N NYZGVTGOMPHSJW-CIUDSAMLSA-N 0.000 description 2
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 description 2
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 2
- BDFCIKANUNMFGB-PMVVWTBXSA-N His-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 BDFCIKANUNMFGB-PMVVWTBXSA-N 0.000 description 2
- RYQWALWYQWBUKN-FHWLQOOXSA-N Phe-Phe-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RYQWALWYQWBUKN-FHWLQOOXSA-N 0.000 description 2
- CHYAYDLYYIJCKY-OSUNSFLBSA-N Pro-Thr-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CHYAYDLYYIJCKY-OSUNSFLBSA-N 0.000 description 2
- 108091081024 Start codon Proteins 0.000 description 2
- 241001052560 Thallis Species 0.000 description 2
- WNLRTRBMVRJNCN-UHFFFAOYSA-N adipic acid Chemical compound OC(=O)CCCCC(O)=O WNLRTRBMVRJNCN-UHFFFAOYSA-N 0.000 description 2
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 2
- 239000004197 calcium inosinate Substances 0.000 description 2
- 239000004195 dipotassium inosinate Substances 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 239000012634 fragment Substances 0.000 description 2
- 108010050848 glycylleucine Proteins 0.000 description 2
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 238000002156 mixing Methods 0.000 description 2
- 238000009790 rate-determining step (RDS) Methods 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 230000005026 transcription initiation Effects 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 230000001131 transforming effect Effects 0.000 description 2
- CIPQGGYPCPIDBB-IUODEOHRSA-N (6r,7r)-3-methyl-8-oxo-7-[(2-phenylacetyl)amino]-5-thia-1-azabicyclo[4.2.0]oct-2-ene-2-carboxylic acid Chemical compound N([C@H]1[C@@H]2N(C1=O)C(=C(CS2)C)C(O)=O)C(=O)CC1=CC=CC=C1 CIPQGGYPCPIDBB-IUODEOHRSA-N 0.000 description 1
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 1
- LZRNYBIJOSKKRJ-XVYDVKMFSA-N Ala-Asp-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LZRNYBIJOSKKRJ-XVYDVKMFSA-N 0.000 description 1
- BVLPIIBTWIYOML-ZKWXMUAHSA-N Ala-Val-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BVLPIIBTWIYOML-ZKWXMUAHSA-N 0.000 description 1
- JBMKAUGHUNFTOL-UHFFFAOYSA-N Aldoclor Chemical group C1=C(Cl)C(S(=O)(=O)N)=CC2=C1NC=NS2(=O)=O JBMKAUGHUNFTOL-UHFFFAOYSA-N 0.000 description 1
- 241000187390 Amycolatopsis lactamdurans Species 0.000 description 1
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 1
- KJGNDQCYBNBXDA-GUBZILKMSA-N Arg-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N KJGNDQCYBNBXDA-GUBZILKMSA-N 0.000 description 1
- DXQIQUIQYAGRCC-CIUDSAMLSA-N Arg-Asp-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)CN=C(N)N DXQIQUIQYAGRCC-CIUDSAMLSA-N 0.000 description 1
- FVBZXNSRIDVYJS-AVGNSLFASA-N Arg-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N FVBZXNSRIDVYJS-AVGNSLFASA-N 0.000 description 1
- QNNBHTFDFFFHGC-KKUMJFAQSA-N Asn-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QNNBHTFDFFFHGC-KKUMJFAQSA-N 0.000 description 1
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- JPWIMMUNWUKOAD-STQMWFEESA-N Gly-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN JPWIMMUNWUKOAD-STQMWFEESA-N 0.000 description 1
- KFQDSSNYWKZFOO-LSJOCFKGSA-N His-Val-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KFQDSSNYWKZFOO-LSJOCFKGSA-N 0.000 description 1
- 108020005350 Initiator Codon Proteins 0.000 description 1
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 1
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 1
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 1
- 241000187654 Nocardia Species 0.000 description 1
- 241000228150 Penicillium chrysogenum Species 0.000 description 1
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 1
- QBFONMUYNSNKIX-AVGNSLFASA-N Pro-Arg-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QBFONMUYNSNKIX-AVGNSLFASA-N 0.000 description 1
- GXWRTSIVLSQACD-RCWTZXSCSA-N Pro-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@@H]1CCCN1)O GXWRTSIVLSQACD-RCWTZXSCSA-N 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- OLIJLNWFEQEFDM-SRVKXCTJSA-N Ser-Asp-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLIJLNWFEQEFDM-SRVKXCTJSA-N 0.000 description 1
- GZGFSPWOMUKKCV-NAKRPEOUSA-N Ser-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO GZGFSPWOMUKKCV-NAKRPEOUSA-N 0.000 description 1
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 1
- FZWLAAWBMGSTSO-UHFFFAOYSA-N Thiazole Chemical group C1=CSC=N1 FZWLAAWBMGSTSO-UHFFFAOYSA-N 0.000 description 1
- LVRFMARKDGGZMX-IZPVPAKOSA-N Thr-Tyr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=C(O)C=C1 LVRFMARKDGGZMX-IZPVPAKOSA-N 0.000 description 1
- 108091036066 Three prime untranslated region Proteins 0.000 description 1
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 1
- 125000002252 acyl group Chemical group 0.000 description 1
- 239000001361 adipic acid Substances 0.000 description 1
- 235000011037 adipic acid Nutrition 0.000 description 1
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 1
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 230000002924 anti-infective effect Effects 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- IYNDLOXRXUOGIU-LQDWTQKMSA-M benzylpenicillin potassium Chemical compound [K+].N([C@H]1[C@H]2SC([C@@H](N2C1=O)C([O-])=O)(C)C)C(=O)CC1=CC=CC=C1 IYNDLOXRXUOGIU-LQDWTQKMSA-M 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 238000006555 catalytic reaction Methods 0.000 description 1
- 229960004841 cefadroxil Drugs 0.000 description 1
- NBFNMSULHIODTC-CYJZLJNKSA-N cefadroxil monohydrate Chemical compound O.C1([C@@H](N)C(=O)N[C@H]2[C@@H]3N(C2=O)C(=C(CS3)C)C(O)=O)=CC=C(O)C=C1 NBFNMSULHIODTC-CYJZLJNKSA-N 0.000 description 1
- 229960002588 cefradine Drugs 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 229940106164 cephalexin Drugs 0.000 description 1
- ZAIPMKNFIOOWCQ-UEKVPHQBSA-N cephalexin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@@H]3N(C2=O)C(=C(CS3)C)C(O)=O)=CC=CC=C1 ZAIPMKNFIOOWCQ-UEKVPHQBSA-N 0.000 description 1
- RDLPVSKMFDYCOR-UEKVPHQBSA-N cephradine Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@@H]3N(C2=O)C(=C(CS3)C)C(O)=O)=CCC=CC1 RDLPVSKMFDYCOR-UEKVPHQBSA-N 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 239000013599 cloning vector Substances 0.000 description 1
- 239000011248 coating agent Substances 0.000 description 1
- 238000000576 coating method Methods 0.000 description 1
- 238000002742 combinatorial mutagenesis Methods 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 238000005094 computer simulation Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 208000012839 conversion disease Diseases 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 108010093176 deacetoxycephalosporin C synthetase Proteins 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 238000004134 energy conservation Methods 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 238000001976 enzyme digestion Methods 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 108010028295 histidylhistidine Proteins 0.000 description 1
- -1 hnRNA Proteins 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 238000004811 liquid chromatography Methods 0.000 description 1
- 231100000053 low toxicity Toxicity 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000003278 mimic effect Effects 0.000 description 1
- 239000002773 nucleotide Substances 0.000 description 1
- 125000003729 nucleotide group Chemical group 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 230000001590 oxidative effect Effects 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 239000012064 sodium phosphate buffer Substances 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 230000014621 translational initiation Effects 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 238000010200 validation analysis Methods 0.000 description 1
- 239000002351 wastewater Substances 0.000 description 1
- 238000005303 weighing Methods 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/0071—Oxidoreductases (1.) acting on paired donors with incorporation of molecular oxygen (1.14)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P35/00—Preparation of compounds having a 5-thia-1-azabicyclo [4.2.0] octane ring system, e.g. cephalosporin
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y114/00—Oxidoreductases acting on paired donors, with incorporation or reduction of molecular oxygen (1.14)
- C12Y114/20—Oxidoreductases acting on paired donors, with incorporation or reduction of molecular oxygen (1.14) with 2-oxoglutarate as one donor, and the other dehydrogenated (1.14.20)
- C12Y114/20001—Deacetoxycephalosporin-C synthase (1.14.20.1)
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Genetics & Genomics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biochemistry (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Microbiology (AREA)
- Biotechnology (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
The invention provides an expandase and application of a mutant thereof in producing G-7-ADCA. The amino acid sequence is shown in one of SEQ ID No. 5-13. The invention generates a series of enzymes with the ring expansion activity by NCBI sequence comparison, excavation, directed evolution and other technologies, and the enzymes can expand the substrate penicillin G into G-7-ADCA in one step; G-7-ADCA can be converted into 7-ADCA by removing side chain under the action of penicillin acylase. Therefore, these expandases have important application values in industry.
Description
Technical Field
The invention relates to an application of expandase and a mutant thereof in producing G-7-ADCA, belonging to the technical field of biology.
Background
The cephalosporin antibiotics are anti-infective drugs widely applied clinically at present, and have the advantages of low toxicity, broad spectrum and the like. 7-amino-3-desacetoxycephalosporanic acid (7-ADCA) is one of the important mother nuclei for synthesis of cephalosporin antibiotics, can be used for synthesis of cefalexin, cefradine, cefadroxil and the like, and has huge market consumption.
Currently, there are two main methods for industrially producing 7-ADCA: chemoenzymatic and fermentation processes. The 7-ADCA preparing process with chemical enzyme process includes the steps of oxidizing penicillin G chemically to produce phenylacetyl-7-aminodesacetoxycephalosporanic acid (G-7-ADCA), and eliminating side chain under the action of penicillin amidase to produce 7-ADCA. The process is complex, costly, produces large amounts of waste water and pollution, and is environmentally damaging (Wei CL et al, applied and Environmental microbiology, 2005). Under the national pressure of energy conservation and emission reduction and environmental protection, the process is gradually eliminated.
The 7-ADCA can also be produced industrially by fermentation. In 1998, cho et al have found that expandase (DAOCS) has some activity on penicillin G, making it possible to produce G-7-ADCA from penicillin G (Cho et al, PNAS, 1998). Therefore, it is now common practice in some enterprises to clone the expandase (DAOCS) gene into Penicillium chrysogenum, produce G-7-ADCA by direct fermentation with the addition of adipic acid during fermentation, and then prepare 7-ADCA by side chain removal with penicillin acylase. However, the fermentation method has the problems of complex process, long fermentation period and the like, and the core technology is mainly mastered in foreign companies, so that no industrial method for producing 7-ADCA by adopting the fermentation method is seen in China at present.
The preparation of 7-ADCA by the holoenzyme method has the advantages of environmental protection, low cost and the like. The holoenzyme method starts from cheap penicillin G potassium salt, generates G-7-ADCA through the catalytic ring expansion reaction of penicillin expandase, and then hydrolyzes to remove an acyl side chain through the action of penicillin acylase to generate 7-ADCA (shown in figure 1). The study of penicillin acylases has been well established and has been in industrial use for many years. Most currently used are immobilized penicillin acylases; whereas the activity of expandase is lower and is the rate-limiting step in the pathway for the synthesis of 7-ADCA from penicillin G. Therefore, there is an urgent need to improve the activity of expandase.
Penicillin expandase (DAOCS), also known as deacetoxycephalosporin C synthetase, is a Fe-dependent enzyme 2+ 、O 2 And alpha-ketoglutaric acid, which mainly catalyzes the conversion (ring expansion) of five-membered thiazole ring to six-membered thiazide ring of penicillin, and is a key step for catalyzing streptomyces clavuligerus and other prokaryotes to synthesize cephalosporin antibiotics. Therefore, DAOCS has important industrial application value.
However, in practical application, it is found that DAOCS has low biological activity on penicillin G and cannot meet industrial application, which becomes the rate-limiting step for synthesizing 7-ADCA by using a holoenzyme method. Therefore, the modification of the expandase is urgently needed so as to greatly improve the conversion efficiency of the expandase on penicillin G and lay a foundation for industrial application.
Currently, DAOCS derived from cephalosporium acremonium (c.acremonium), streptomyces clavuligerus (s.clavuligerus) and nocardia (n.lactamdurans) have been expressed and purified, and scdaos derived from streptomyces clavuligerus has obtained a crystal structure, which makes it possible to modify scdaos by protein engineering techniques, improve its catalytic activity, broaden its substrate spectrum, and have made some progress.
Disclosure of Invention
The technical problem to be solved by the invention is how to improve the catalytic activity of the expandase on the expansion of penicillin G.
In order to solve the above technical problems, the present invention firstly provides a protein.
The protein provided by the invention is (a) or (b) or (c) or (d):
(a) The amino acid sequence of the protein is shown as SEQ ID No.4 (named as H7), or the protein which is obtained by substituting, deleting or adding one or more amino acids in the amino acid sequence in (a) and has the activity of expandase and is derived from (a);
(b) The amino acid sequence of the protein is shown as SEQ ID No.5 (named as E727), or the protein which is obtained by substituting, deleting or adding one or more amino acids in the amino acid sequence in the (b) and has the activity of expandase and is derived from the (b);
(c) A protein which has an amino acid sequence shown as SEQ ID No.6 (named as E735), or is derived from (c) by substituting, deleting or adding one or more amino acids in the amino acid sequence in (c) and has an expandase activity;
(d) The amino acid sequence of the protein is shown as SEQ ID No.13 (named as scDAOCS-V303K), or the protein which is obtained by substituting, deleting or adding one or more amino acids in the amino acid sequence in (d) and has the activity of expandase and is derived from (d).
Further, the protein is any one of the following (1) - (6):
(1) A protein represented by the amino acid sequence shown in SEQ ID NO.5 wherein n1 is at position 155 and n2 is at position 184;
(2) A protein represented by the amino acid sequence shown in SEQ ID NO.5 wherein n3 is at position 275, n4 is at amino acid residue 281, and n5 is at position 305;
(3) A protein represented by the amino acid sequence shown in SEQ ID NO.5 wherein position 155 is n1, position 184 is n2, position 275 is n3, position 281 is n4, and position 305 is n 5;
(4) A protein represented by the amino acid sequence shown in SEQ ID NO.6 with m1 at position 155 and m2 at position 184;
(5) A protein represented by the amino acid sequence shown in SEQ ID NO.6, wherein m3 is at position 275, m4 is at position 281, and m5 is at position 305;
(6) A protein represented by the amino acid sequence shown in SEQ ID NO.6, wherein m1 is at position 155, m2 is at position 184, m3 is at position 275, m4 is at position 281, and m5 is at position 305;
specifically, SEQ ID NO: 5n 1 at position 155 may be: cysteine, tyrosine, glycine, alanine, valine, leucine, isoleucine, proline, tryptophan, serine, phenylalanine, asparagine, glutamine, threonine, aspartic acid, glutamic acid, lysine, arginine, or histidine, and n2 at position 184 can be: tyrosine, histidine, glycine, alanine, valine, leucine, isoleucine, methionine, proline, tryptophan, serine, cysteine, phenylalanine, asparagine, glutamine, threonine, aspartic acid, glutamic acid, lysine or arginine, and n3 at position 275 may be: valine, isoleucine, glycine, alanine, leucine, methionine, proline, tryptophan, serine, tyrosine, cysteine, phenylalanine, asparagine, glutamine, threonine, aspartic acid, glutamic acid, lysine, arginine or histidine, and n4 at position 281 may be: cysteine, tyrosine, glycine, alanine, valine, leucine, isoleucine, proline, tryptophan, serine, phenylalanine, asparagine, glutamine, threonine, aspartic acid, glutamic acid, lysine, arginine, or histidine, n5 at position 305 can be: leucine, methionine, glycine, alanine, valine, isoleucine, proline, tryptophan, serine, tyrosine, cysteine, phenylalanine, asparagine, glutamine, threonine, aspartic acid, glutamic acid, lysine, arginine, or histidine;
SEQ ID NO:6, m1 at position 155 can be phenylalanine, lysine, glycine, alanine, valine, leucine, isoleucine, methionine, proline, tryptophan, serine, tyrosine, cysteine, asparagine, glutamine, threonine, aspartic acid, glutamic acid, arginine, or histidine, m2 at position 184 can be aspartic acid, glutamic acid, glycine, alanine, valine, leucine, isoleucine, methionine, proline, tryptophan, serine, tyrosine, cysteine, phenylalanine, asparagine, glutamine, threonine, lysine, arginine, or histidine, m3 at position 275 can be glutamic acid, histidine, glycine, alanine, valine, leucine, isoleucine, methionine, proline, tryptophan, serine, tyrosine, cysteine, phenylalanine, asparagine, glutamine, threonine, aspartic acid, lysine, or arginine, m4 at position 281 can be: tyrosine, proline, glycine, alanine, valine, leucine, isoleucine, methionine, tryptophan, serine, cysteine, phenylalanine, asparagine, glutamine, threonine, aspartic acid, glutamic acid, lysine, arginine or histidine, and m5 at position 305 may be isoleucine, methionine, glycine, alanine, valine, leucine, proline, tryptophan, serine, tyrosine, cysteine, phenylalanine, asparagine, glutamine, threonine, aspartic acid, glutamic acid, lysine, arginine or histidine.
Further, the protein is any one of the following (1) - (6):
(1) The 155 th amino acid residue of the amino acid sequence shown in SEQ ID NO.5 is mutated from cysteine to tyrosine, and the 184 th amino acid residue is mutated from tyrosine to histidine to obtain a protein which is named as E727-M2 (C155Y/Y184H), and the amino acid sequence of the protein is shown in SEQ ID NO. 7;
(2) The protein obtained by mutating the 275 th amino acid residue of the amino acid sequence shown in SEQ ID NO.5 from valine to isoleucine, the 281 th amino acid residue from cysteine to tyrosine and the 305 th amino acid residue from leucine to methionine is named as E727-M3 (V275I/C281Y/L305M), and the amino acid sequence of the protein is shown in SEQ ID NO. 8;
(3) The protein obtained by mutating the 155 th amino acid residue of the amino acid sequence shown in SEQ ID NO.5 from cysteine to tyrosine, mutating the 184 th amino acid residue from tyrosine to histidine, mutating the 275 th amino acid residue from valine to isoleucine, mutating the 281 th amino acid residue from cysteine to tyrosine, and mutating the 305 th amino acid residue from leucine to methionine is named as E727-M5 (C155Y/Y184H/V275I/C281Y/L305M), and the amino acid sequence thereof is shown in SEQ ID NO. 9;
(4) The protein obtained by mutating the 155 th amino acid residue of the amino acid sequence shown in SEQ ID NO.6 from cysteine to tyrosine and mutating the 184 th amino acid residue from tyrosine to histidine is named as E735-M2 (C155Y/Y184H), and the amino acid sequence thereof is shown in SEQ ID NO. 10;
(5) The protein obtained by mutating the 275 th amino acid residue of the amino acid sequence shown in SEQ ID NO.6 from valine to isoleucine, the 281 th amino acid residue from cysteine to tyrosine and the 305 th amino acid residue from isoleucine to methionine is named as E735-M3 (V275I/C281Y/I305M), and the amino acid sequence of the protein is shown in SEQ ID NO. 11;
(6) The protein obtained by mutating the 155 th amino acid residue of the amino acid sequence shown in SEQ ID NO.6 from cysteine to tyrosine, mutating the 184 th amino acid residue from tyrosine to histidine, mutating the 275 th amino acid residue from valine to isoleucine, mutating the 281 th amino acid residue from cysteine to tyrosine, and mutating the 305 th amino acid residue from isoleucine to methionine is named as E735-M5 (C155Y/Y184H/V275I/C281Y/I305M), and the amino acid sequence thereof is shown in SEQ ID No. 12;
nucleic acid molecules which code for the above proteins also belong to the scope of protection of the present invention.
The nucleic acid molecule may be DNA, such as cDNA, genomic DNA or recombinant DNA; the nucleic acid molecule can also be an RNA, such as an mRNA, hnRNA, or tRNA, and the like.
The nucleic acid molecule is specifically a gene encoding the protein.
The recombinant vector, expression cassette, transgenic cell line or recombinant bacterium containing the nucleic acid molecule also belongs to the protection scope of the invention.
The recombinant vector can be a recombinant expression vector and can also be a recombinant cloning vector.
The recombinant expression vector can be constructed using an existing expression vector. The expression vector may also comprise the 3' untranslated region of the foreign gene, i.e., contain the polyadenylation signal and any other DNA segments involved in mRNA processing or gene expression. The poly A signal can direct the addition of poly A to the 3' end of the mRNA precursor. When the gene is used for constructing a recombinant expression vector, any enhanced, constitutive, tissue-specific or inducible promoter can be added in front of the transcription initiation nucleotide, and can be used alone or combined with other promoters; in addition, when the gene of the present invention is used to construct a recombinant expression vector, enhancers, including translational or transcriptional enhancers, may be used, and these enhancer regions may be ATG initiation codon or initiation codon of adjacent regions, etc., but must be in the same reading frame as the coding sequence to ensure proper translation of the entire sequence. The translational control signals and initiation codons are widely derived, either naturally or synthetically. The translation initiation region may be derived from a transcription initiation region or a structural gene.
The expression cassette consists of a promoter capable of driving expression of the gene, and a transcription termination sequence.
The application of the protein as penicillin expandase also belongs to the protection scope of the invention.
The application of the protein in catalyzing penicillin G to generate G-7-ADCA by expanding penicillin G as penicillin expandase to improve the conversion rate of penicillin G also belongs to the protection scope of the invention.
The application of the protein or the nucleic acid molecule or the recombinant vector, the expression cassette, the transgenic cell line or the recombinant bacterium in any one of the following is also within the protection scope of the invention:
(b1) Preparing a product having penicillin expandase activity;
(b2) Preparation of G-7-ADCA.
The invention also provides a method for preparing G-7-ADCA.
The method for preparing G-7-ADCA provided by the invention specifically comprises the following steps: preparing the protein; the protein is used as penicillin expandase to catalyze penicillin G to expand ring to generate G-7-ADCA.
The invention also provides a method for improving the conversion rate of penicillin G in the reaction of penicillin expandase enzyme catalyzing penicillin G to expand ring to generate G-7-ADCA.
The method for improving the conversion rate of penicillin G in the reaction of catalyzing penicillin G to expand ring to generate G-7-ADCA by using the penicillin expandase, which is provided by the invention, is characterized in that the protein is used as the penicillin expandase to catalyze penicillin G to expand ring to generate G-7-ADCA.
The invention generates a series of enzymes with ring expanding activity by NCBI sequence comparison, excavation, directed evolution and other technologies, and the enzymes can expand a substrate penicillin G into G-7-ADCA in one step; G-7-ADCA can be converted into 7-ADCA by removing side chain under the action of penicillin acylase. Therefore, these expandases have important application values in industry.
Drawings
FIG. 1 is a scheme showing the reaction scheme of the 7-ADCA catalyzed by the expandase enzyme.
FIG. 2 shows the sequence alignment information used in SCHEMA technology.
FIG. 3 is a schematic diagram of the position of the Loop-associated mutation site obtained according to the Rossetta Design method.
Detailed Description
The experimental procedures used in the following examples are all conventional procedures unless otherwise specified.
Materials, reagents and the like used in the following examples are commercially available unless otherwise specified.
Example 1: expandase mutant generated based on SCHEMA technology
1. Gene acquisition and vector construction
The mimic DNA shuffling was analyzed for recombination using SCHEMA technology. According to the result of computational simulation, 20 sequences were selected. After codon optimization, a full-length gene is obtained by using a full-gene synthesis mode and is constructed on a pET-24a expression vector. The three expandase sequences (E632, SEQ ID No.1,8MUT SEQ ID No.2, and E633 SEQ ID No. 3) used in SCHEMA technology and the alignment are shown in FIG. 2.
Comparing the sequences by SCHEMA technology to obtain the sequences shown in the table 1, carrying out gene synthesis according to the obtained sequences, and carrying out screening according to the catalytic efficiency after expression.
2. Protein expression and Activity detection
Transforming the expression vector into a host bacterium BL21 (DE 3), setting a negative control group (empty vector for transforming pET-24 a) and a positive control group (pET-24 a-scDAOCS) at the same time, and culturing at 37 ℃ overnight; selecting a single colony, transferring the single colony into an LB culture medium containing 100 mu g/mL kanamycin, and culturing for 8-10h at 37 ℃ and 220 rpm; then, the cells were inoculated into 50mL of LTB medium containing 100. Mu.g/mL in an amount of 1/100, and cultured at 37 ℃ until OD600 reached about 0.6, and then 0.1mM IPTG was added to induce protein expression for 16 hours, thereby obtaining recombinant cells expressing expandase.
3. Reaction system
And collecting the thallus. Pouring the thalli into a 50mL centrifuge tube, and centrifuging at 8000rpm for 10min to obtain thalli; the mycelia were washed once with 50mM phosphate buffer pH 7.4; centrifuging at 8000rpm for 10min to collect thallus; and (5) weighing.
The formula of the reaction solution is as follows: 50mM phosphate buffer pH7.4, 50. Mu.g/mL FeSO 4 0.4mM ascorbic acid, 5mM or 10mM alpha-ketoglutaric acid, 5mM or 10mM penicillin G.
And (4) reacting. The reaction solution was added to the cells at a ratio of 10mL to 1g of wet cells, and then resuspended. The mixture was placed on a shaker at 25 ℃ and 220rpm and reacted for 2 hours.
Detecting the amount of G-7-ADCA produced. After reacting for 2h, 12000rpm, centrifuging for 1min, and taking the supernatant for HPLC detection.
The detection conditions of the liquid chromatogram are as follows: a chromatographic column: agilent ZORBAX SB-C18 Stable bond analytical4.6 x 250mm; mobile phase: 20mM pH 3.0 sodium phosphate buffer: methanol = 55. Flow rate: 1mL/min, and the detection wavelength is 215nm.
4. Conversion data results for catalytic penicillin G conversion
TABLE 1 conversion of expandase mutants produced using SCHEMA technology
The reaction conditions are as follows:
50mM phosphate buffer pH7.4, 50. Mu.g/mL FeSO4,0.4mM ascorbic acid, 10mM alpha-ketoglutaric acid, 10mM penicillin G, 200. Mu.L crude enzyme solution. The mixture was placed on a shaker at 25 ℃ and 220rpm for 30min.
5. Modification of enzymes
E727 and E735 with higher activity are subjected to double mutation, triple mutation and five-mutant transformation, mutation sites in a high-activity mutant H7 are selected and transplanted into E727 and E735, and E727-M2 (C155Y/Y184H), E727-M3 (V275I/C281Y/L305M), E727-M5 (C155Y/Y184H/V275I/C281Y/L305M), E735-M2 (C155Y/Y184H), E735-M3 (V275I/C281Y/I305M) and E727-M5 (C155Y/Y H/V275I/C281Y/I305M) mutants are obtained. The activity is as follows:
TABLE 2 conversion of expandase mutants generated using SCHEMA program
As can be seen from Table 2, the conversion of penicillin G catalyzed by E727-M2 was the highest and better than that of H7.
6. Pure enzyme reaction and HPLC detection thereof
The pure enzyme activity was verified in 1mL system. PBS buffer pH7.4, 50. Mu.g/mL FeSO4,0.4mM ascorbic acid, 5mM alpha-ketoglutaric acid, 5mM penicillin G, 100. Mu.L enzyme solution. The mixture was placed on a shaker at 25 ℃ and 220rpm for 30min. The HPLC detection results are shown in Table 3.
TABLE 3 enzymatic Activity catalyzed by expandase mutants
Enzyme activity (U) is defined as the amount of enzyme that can convert 1 micromole of substrate in 1 minute at 25 ℃; the specific enzyme activity (U/mg) is the enzyme activity per mg of protein.
Experiments prove that the enzyme activity of the mutant E727-M2 is improved by 25 percent compared with the enzyme activity of scH7, and the mutant has better application prospect.
Example 2 mutant obtained by modification of expandase H7 based on Rosetta Design method
1. And (5) obtaining the gene.
All computational designs were performed using Rosetta design software developed by David Baker group, washington university, usa. The first 20 conformations that meet all the necessary conditions were selected for analysis and experimental validation. 4 combined mutation libraries are constructed by using the 7-mutant scDAOCS H7 of the expandase as a template and simulating loop-related mutation sites through dynamics, wherein the mutation libraries are shown in the following table 4. The position of the mutation site on the expandase is shown in FIG. 3.
TABLE 4 mutant library based on RosettaDesign method
2. And (4) establishing a mutant library.
Site combinatorial mutagenesis was performed with the corresponding primers to the selected residues as in table 5.
TABLE 5 primer sequence Listing
Taking the plasmid scDAOCS H7 as a template, the primers T72-F1, T72-F2, T72-F3, T72-F4, T73-F1, T73-F2, T73-F3 and T73-F4 in the table 5 are as follows: 1:1:1:3:1:1:1, and 2. Mu.L of the mixture was used as an upstream primer and 2. Mu.L of T72-T73-R74-R2 in Table 5 was used as a downstream primer to amplify the gene fragment L1. Taking the plasmid scDAOCS H7 as a template, mixing Y94-D95-Y97-F1, Y94-D95-Y97-F2 and Y94-D95-Y97-F3 in the table 5 according to a ratio of 2. The plasmid scDAOCS H7 is used as a template, L179-F1, L179-F2 and L179-F3 in the table 5 are mixed according to a ratio of 2. The gene fragment L4 was amplified by mixing the following 2 parts of N301-F1, N301-F2, Y302-F1, Y302-F2, Y302-F3, V303-F1, V303-F2 in Table 4 with the plasmid scDAOCS H7 as a template, and taking 2. Mu.L of N301-Y302-V303-R2. Mu.L in Table 5 as a downstream primer, in a ratio of 2. First round PCR procedure: heat preservation at 94 ℃ for 2min, (heat preservation at 98 ℃ for 15s, heat preservation at 55 ℃ for 30s, heat preservation at 72 ℃ for 30 s) x 28 cycles, and heat preservation at 72 ℃ for 5min.
And (3) respectively taking 2 mu L of first round PCR products L1, L2, L3 and L4 as primers, taking the plasmid scDAOCS H7 as a template, and performing second PCR to construct ML1, ML2, ML3 and ML4 mutant libraries. Second round PCR program: heat preservation at 94 ℃ for 2min, (heat preservation at 98 ℃ for 15s, heat preservation at 60 ℃ for 30s, heat preservation at 72 ℃ for 5 min) x 28 cycles, and heat preservation at 72 ℃ for 10min.
The second round of PCR products obtained were subjected to the following operations:
to the product of the second round of PCR, 1. Mu.L of Dpn I enzyme was added for digesting the plasmid template, and the mixture was treated at 37 ℃ for 3 hours. And (3) performing electric transformation on 2 mu L of the second round PCR product after enzyme digestion to Escherichia coli BL21 (DE 3), uniformly coating the bacterial liquid of the electric transformed Escherichia coli BL21 (DE 3) on an LB (Lankanamycine) -resistant (with the concentration of 50 mu g/mL) plate, and culturing at the constant temperature of 37 ℃ for 14h to grow a single colony, namely a mutant library. All single colonies on one plate were scraped off for sequencing to verify the diversity of the mutant pool.
3. Mutant screening and reaction system
Single colonies from the mutant pool were inoculated into 300. Mu.L LB medium containing 50. Mu.g/mL kanamycin at the final concentration in 96 deep-well plates, respectively, cultured at 37 ℃ for 8-12 hours while setting a negative control group (empty vector of pET-24 a) and a positive control group (pET-24 a-scDAOCSH 7), and then 200. Mu.L of seed solution was inoculated into 800. Mu.L LB medium containing IPTG at the final concentration of 0.1-0.2mM and 50. Mu.g/mL kanamycin. After 16 hours at 25 ℃, the cells were collected by centrifugation, and 400. Mu.L of 50mM phosphate buffer pH7.4 containing 50. Mu.g/mL FeSO4,0.4mM ascorbic acid, 5mM or 10mM alpha-ketoglutaric acid, 5mM or 10mM penicillin G was added thereto and reacted at 25 ℃ for 2 hours. After reacting for 2h, 12000rpm, centrifuging for 1min, and taking the supernatant for HPLC detection. The detection conditions of the liquid chromatography are shown in the first embodiment. The following enzyme-catalyzed reaction conversion data were obtained:
TABLE 6 mutant conversion
And taking pET24a as a negative control and scDAOCS-H7 as a positive control, carrying out shake flask rescreening on the scDAOCS-V303K mutant, and sequencing the mutant.
The results are shown in Table 7, where the mutant scdaos-H7-V303K has improved activity in whole cells and in crude enzyme compared to scdaos-H7.
TABLE 7 conversion rates of rescreened pools of high activity mutants
In conclusion, compared with the scDAOCS-H7 expandase, the expandase mutant is constructed, so that the enzyme activity is greatly improved, the conversion rate is improved by 25%, and the method has a wide application prospect.
Sequence listing
Wild type WT-scDAOCS
MDTTVPTFSLAELQQGLHQDEFRRCLRDKGLFYLTDCGLTDTELKSAKDLVIDFFEHGSEAEKRAVTSPVPTMRRGFTGLESESTAQITNTGSYSDYSMCYSMGTADNLFPSGDFERIWTQYFDRQYTASRAVAREVLRATGTEPDGGVEAFLDCEPLLRFRYFPQVPEHRSAEEQPLRMAPHYDLSMVTLIQQTPCANGFVSLQAEVGGAFTDLPYRPDAVLVFCGAIATLVTGGQVKAPRHHVAAPRRDQIAGSSRTSSVFFLRPNADFTFSVPLARECGFDVSLDGETATFQDWIGGNYVNIRRTSKA
SEQ IDNo.1(E632)
MDTTVPTFSLDELQEGLHQDEFRRCLTEKGVFYLTDSGLSEADHKSAKDVAVDFFEHGTEEEKRAVTSPIPTIRRGFSGLESESTAQITNTGTYTDYSMCYSMGTSDNLFPTADFERVWTHYFDRMYDASREVARQVLKATGTEPDGGVDAFLDCEPLLRFRYFPEVPEHRSAEEEPLRMAPHYDLSIVTLIQQTPCANGFVSLQAEVDGTFVDLPARPDAVLVFCGAVATLVTGGKVKAPRHHVAAPGRDQRAGSSRTSSVFFLRPKSDFSFSVPLARECGFDVSLDGETATFGDWIGGNYVNIRRTSKA
SEQ IDNo.2(8MUT)
MDTTVPTFSLAELQQGLHQDEFRRCLRDKGLFYLTDCGLTDTELKSAKDLVIDFFEHGSEAEKRAVTSPVPTTRRGFTGLESESTAQITNTGSYSDYSMCYSMGTADNLFPSGDFERIWTQYFDRQYTASRAVAREVLRATGTEPDGGVEAFLDYEPLLRFRYFPQVPEHRSAEEQPLLMAPHHDLSMVTLIQQTPCANGFVSLQAEVGGAFVDLPYRPDAVLVFCGAIATLVTGGQVKAPRHHVAAPRRDQIAGSSRTSSVFFLRPNADFTFSIPLAREYGFDVSLDGETATFQDWIGGNYVNMRRTSKA
SEQ IDNo.3(E633)
MDTTVPTFHLAELQEGLHQDEFRSCLMEKGVFYLTGSSLSEADQKSAKDVVVDFFEHGTEEEKRAVTSPVPTIRRGFTGLESESTAQITNTGSYSDYSMCYSMGTADNLFPSGDFERIWTQYFDRQYTASRAVAREVLRATGTEPDGGVEAFLDCEPLLRFRYFPEVPEHRSAEEEPLRMAPHYDLSMVTLIQQTPCANGFVSLQAEVGGAFTDLPYRPDAVLVFCGAIATLVTGGQVKAPKHHVVAPARDRIAGSSRTSSVFFLRPNADFTFSVPLAKRCGFDIGLDGDTAAFQDWIAGNYVNLRTKTKA
SEQ ID No.4(ScDAOCSH7):
MDTTVPTFSLAELQQGLHQDEFRRCLRDKGLFYLTDCGLTDTELKSAKDLVIDFFEHGSEAEKRAVTSPVPTTRRGFTGLESESTAQITNTGSYSDYSMCYSMGTADNLFPSGDFERIWTQYFDRQYTASRAVAREVLRATGTEPDGGVEAFLDYEPLLRFRYFPQVPEHRSAEEQPLRMAPHHDLSMVTLIQQTPCANGFVSLQAEVGGAFVDLPYRPDAVLVFCGAIATLVTGGQVKAPRHHVAAPRRDQIAGSSRTSSVFFLRPNADFTFSIPLAREYGFDVSLDGETATFQDWIGGNYVNMRRTSKA
SEQ ID No.5(E727):
MDTTVPTFHLAELQEGLHQDEFRSCLMEKGVFYLTGSSLSEADQKSAKDVVIDFFEHGSEAEKRAVTSPVPTTRRGFTGLESESTAQITNTGSYSDYSMCYSMGTADNLFPSGDFERIWTHYFDRMYDASREVARQVLKATGTEPDGGVDAFLDCEPLLRFRYFPEVPEHRSAEEEPLRMAPHYDLSIVTLIQQTPCANGFVSLQAEVDGTFVDLPARPDAVLVFCGAVATLVTGGKVKAPRHHVVAPARDRIAGSSRTSSVFFLRPNADFTFSVPLAKRCGFDIGLDGDTAAFQDWIAGNYVNLRTKTKA
SEQ ID No.6(E735):
MDTTVPTFSLAELQQGLHQDEFRRCLRDKGLFYLTDCGLTDTELKSAKDLVIDFFEHGSEAEKRAVTSPVPTTRRGFTGLESESTAQITNTGSYTDYSMCYSMGTSDNLFPTADFERVWTHYFDRMYDASREVARQVLKATGTEPDGGVDAFLDCEPLLRFRYFPEVPEHRSAEEEPLRMAPHYDLSMVTLIQQTPCANGFVSLQAEVDGTFVDLPARPDAVLVFCGAVATLVTGGKVKAPKHHVVAPARDRIAGSSRTSSVFFLRPNADFTFSVPLAKRCGFDIGLDGDTAAFQDWIAGNYVNLRTKTKA
SEQ ID No.7(E727-M2):
MDTTVPTFHLAELQEGLHQDEFRSCLMEKGVFYLTGSSLSEADQKSAKDVVIDFFEHGSEAEKRAVTSPVPTTRRGFTGLESESTAQITNTGSYSDYSMCYSMGTADNLFPSGDFERIWTHYFDRMYDASREVARQVLKATGTEPDGGVDAFLDYEPLLRFRYFPEVPEHRSAEEEPLRMAPHHDLSIVTLIQQTPCANGFVSLQAEVDGTFVDLPARPDAVLVFCGAVATLVTGGKVKAPRHHVVAPARDRIAGSSRTSSVFFLRPNADFTFSVPLAKRCGFDIGLDGDTAAFQDWIAGNYVNLRTKTKA
SEQ ID No.8(E727-M3):
MDTTVPTFHLAELQEGLHQDEFRSCLMEKGVFYLTGSSLSEADQKSAKDVVIDFFEHGSEAEKRAVTSPVPTTRRGFTGLESESTAQITNTGSYSDYSMCYSMGTADNLFPSGDFERIWTHYFDRMYDASREVARQVLKATGTEPDGGVDAFLDCEPLLRFRYFPEVPEHRSAEEEPLRMAPHYDLSIVTLIQQTPCANGFVSLQAEVDGTFVDLPARPDAVLVFCGAVATLVTGGKVKAPRHHVVAPARDRIAGSSRTSSVFFLRPNADFTFSIPLAKRYGFDIGLDGDTAAFQDWIAGNYVNMRTKTKA
SEQ ID No.9(E727-M5):
MDTTVPTFHLAELQEGLHQDEFRSCLMEKGVFYLTGSSLSEADQKSAKDVVIDFFEHGSEAEKRAVTSPVPTTRRGFTGLESESTAQITNTGSYSDYSMCYSMGTADNLFPSGDFERIWTHYFDRMYDASREVARQVLKATGTEPDGGVDAFLDYEPLLRFRYFPEVPEH
RSAEEEPLRMAPHHDLSIVTLIQQTPCANGFVSLQAEVDGTFVDLPARPDAVLVFCGAVATLVTGGKVKAPRHHVVAPARDRIAGSSRTSSVFFLRPNADFTFSIPLAKRYGFDIGLDGDTAAFQDWIAGNYVNMRTKTKA
SEQ ID No.10(E735-M2):
MDTTVPTFSLAELQQGLHQDEFRRCLRDKGLFYLTDCGLTDTELKSAKDLVIDFFEHGSEAEKRAVTSPVPTTRRGFTGLESESTAQITNTGSYTDYSMCYSMGTSDNLFPTADFERVWTHYFDRMYDASREVARQVLKATGTEPDGGVDAFLDYEPLLRFRYFPEVPEHRSAEEEPLRMAPHHDLSMVTLIQQTPCANGFVSLQAEVDGTFVDLPARPDAVLVFCGAVATLVTGGKVKAPKHHVVAPARDRIAGSSRTSSVFFLRPNADFTFSVPLAKRCGFDIGLDGDTAAFQDWIAGNYVNLRTKTKA
SEQ ID No.11(E735-M3):
MDTTVPTFSLAELQQGLHQDEFRRCLRDKGLFYLTDCGLTDTELKSAKDLVIDFFEHGSEAEKRAVTSPVPTTRRGFTGLESESTAQITNTGSYTDYSMCYSMGTSDNLFPTADFERVWTHYFDRMYDASREVARQVLKATGTEPDGGVDAFLDCEPLLRFRYFPEVPEHRSAEEEPLRMAPHYDLSMVTLIQQTPCANGFVSLQAEVDGTFVDLPARPDAVLVFCGAVATLVTGGKVKAPKHHVVAPARDRIAGSSRTSSVFFLRPNADFTFSIPLAKRYGFDIGLDGDTAAFQDWIAGNYVNMRTKTKA
SEQ ID No.12(E735-M5):
MDTTVPTFSLAELQQGLHQDEFRRCLRDKGLFYLTDCGLTDTELKSAKDLVIDFFEHGSEAEKRAVTSPVPTTRRGFTGLESESTAQITNTGSYTDYSMCYSMGTSDNLFPTADFERVWTHYFDRMYDASREVARQVLKATGTEPDGGVDAFLDYEPLLRFRYFPEVPEHRSAEEEPLRMAPHHDLSMVTLIQQTPCANGFVSLQAEVDGTFVDLPARPDAVLVFCGAVATLVTGGKVKAPKHHVVAPARDRIAGSSRTSSVFFLRPNADFTFSIPLAKRYGFDIGLDGDTAAFQDWIAGNYVNMRTKTKA
SEQ ID No.13(scDAOCS-V303K):
MDTTVPTFSLAELQQGLHQDEFRRCLRDKGLFYLTDCGLTDTELKSAKDLVIDFFEHGSEAEKRAVTSPVPTTRRGFTGLESESTAQITNTGSYSDYSMCYSMGTADNLFPSGDFERIWTQYFDRQYTASRAVAREVLRATGTEPDGGVEAFLDYEPLLRFRYFPQVPEHRSAEEQPLRMAPHHDLSMVTLIQQTPCANGFVSLQAEVGGAFVDLPYRPDAVLVFCGAIATLVTGGQVKAPRHHVAAPRRDQIAGSSRTSSVFFLRPNADFTFSIPLAREYGFDVSLDGETATFQDWIGGNYKNMRRTSKA。
SEQUENCE LISTING
<110> institute of biotechnology for Tianjin industry of Chinese academy of sciences
<120> expandase and application of mutant thereof in production of G-7-ADCA
<130> GNCAQ211831
<160> 1
<170> PatentIn version 3.5
<210> WT-scDAOCS
<211> 311
<212> PRT
<213> C. acremonium
<400> WT-scDAOCS
Met Asp Thr Thr Val Pro Thr Phe Ser Leu Ala Glu Leu Gln Gln Gly
1 5 10 15
Leu His Gln Asp Glu Phe Arg Arg Cys Leu Arg Asp Lys Gly Leu Phe
20 25 30
Tyr Leu Thr Asp Cys Gly Leu Thr Asp Thr Glu Leu Lys Ser Ala Lys
35 40 45
Asp Leu Val Ile Asp Phe Phe Glu His Gly Ser Glu Ala Glu Lys Arg
50 55 60
Ala Val Thr Ser Pro Val Pro Thr Met Arg Arg Gly Phe Thr Gly Leu
65 70 75 80
Glu Ser Glu Ser Thr Ala Gln Ile Thr Asn Thr Gly Ser Tyr Ser Asp
85 90 95
Tyr Ser Met Cys Tyr Ser Met Gly Thr Ala Asp Asn Leu Phe Pro Ser
100 105 110
Gly Asp Phe Glu Arg Ile Trp Thr Gln Tyr Phe Asp Arg Gln Tyr Thr
115 120 125
Ala Ser Arg Ala Val Ala Arg Glu Val Leu Arg Ala Thr Gly Thr Glu
130 135 140
Pro Asp Gly Gly Val Glu Ala Phe Leu Asp Cys Glu Pro Leu Leu Arg
145 150 155 160
Phe Arg Tyr Phe Pro Gln Val Pro Glu His Arg Ser Ala Glu Glu Gln
165 170 175
Pro Leu Arg Met Ala Pro His Tyr Asp Leu Ser Met Val Thr Leu Ile
180 185 190
Gln Gln Thr Pro Cys Ala Asn Gly Phe Val Ser Leu Gln Ala Glu Val
195 200 205
Gly Gly Ala Phe Thr Asp Leu Pro Tyr Arg Pro Asp Ala Val Leu Val
210 215 220
Phe Cys Gly Ala Ile Ala Thr Leu Val Thr Gly Gly Gln Val Lys Ala
225 230 235 240
Pro Arg His His Val Ala Ala Pro Arg Arg Asp Gln Ile Ala Gly Ser
245 250 255
Ser Arg Thr Ser Ser Val Phe Phe Leu Arg Pro Asn Ala Asp Phe Thr
260 265 270
Phe Ser Val Pro Leu Ala Arg Glu Cys Gly Phe Asp Val Ser Leu Asp
275 280 285
Gly Glu Thr Ala Thr Phe Gln Asp Trp Ile Gly Gly Asn Tyr Val Asn
290 295 300
Ile Arg Arg Thr Ser Lys Ala
305 310
<210> 1
<211> 311
<212> PRT
<213> Artificial sequence
<400> 1
Met Asp Thr Thr Val Pro Thr Phe Ser Leu Asp Glu Leu Gln Glu Gly
1 5 10 15
Leu His Gln Asp Glu Phe Arg Arg Cys Leu Thr Glu Lys Gly Val Phe
20 25 30
Tyr Leu Thr Asp Ser Gly Leu Ser Glu Ala Asp His Lys Ser Ala Lys
35 40 45
Asp Val Ala Val Asp Phe Phe Glu His Gly Thr Glu Glu Glu Lys Arg
50 55 60
Ala Val Thr Ser Pro Ile Pro Thr Ile Arg Arg Gly Phe Ser Gly Leu
65 70 75 80
Glu Ser Glu Ser Thr Ala Gln Ile Thr Asn Thr Gly Thr Tyr Thr Asp
85 90 95
Tyr Ser Met Cys Tyr Ser Met Gly Thr Ser Asp Asn Leu Phe Pro Thr
100 105 110
Ala Asp Phe Glu Arg Val Trp Thr His Tyr Phe Asp Arg Met Tyr Asp
115 120 125
Ala Ser Arg Glu Val Ala Arg Gln Val Leu Lys Ala Thr Gly Thr Glu
130 135 140
Pro Asp Gly Gly Val Asp Ala Phe Leu Asp Cys Glu Pro Leu Leu Arg
145 150 155 160
Phe Arg Tyr Phe Pro Glu Val Pro Glu His Arg Ser Ala Glu Glu Glu
165 170 175
Pro Leu Arg Met Ala Pro His Tyr Asp Leu Ser Ile Val Thr Leu Ile
180 185 190
Gln Gln Thr Pro Cys Ala Asn Gly Phe Val Ser Leu Gln Ala Glu Val
195 200 205
Asp Gly Thr Phe Val Asp Leu Pro Ala Arg Pro Asp Ala Val Leu Val
210 215 220
Phe Cys Gly Ala Val Ala Thr Leu Val Thr Gly Gly Lys Val Lys Ala
225 230 235 240
Pro Arg His His Val Ala Ala Pro Gly Arg Asp Gln Arg Ala Gly Ser
245 250 255
Ser Arg Thr Ser Ser Val Phe Phe Leu Arg Pro Lys Ser Asp Phe Ser
260 265 270
Phe Ser Val Pro Leu Ala Arg Glu Cys Gly Phe Asp Val Ser Leu Asp
275 280 285
Gly Glu Thr Ala Thr Phe Gly Asp Trp Ile Gly Gly Asn Tyr Val Asn
290 295 300
Ile Arg Arg Thr Ser Lys Ala
305 310
<210> 2
<211> 311
<212> PRT
<213> Artificial sequence
<400> 2
Met Asp Thr Thr Val Pro Thr Phe Ser Leu Ala Glu Leu Gln Gln Gly
1 5 10 15
Leu His Gln Asp Glu Phe Arg Arg Cys Leu Arg Asp Lys Gly Leu Phe
20 25 30
Tyr Leu Thr Asp Cys Gly Leu Thr Asp Thr Glu Leu Lys Ser Ala Lys
35 40 45
Asp Leu Val Ile Asp Phe Phe Glu His Gly Ser Glu Ala Glu Lys Arg
50 55 60
Ala Val Thr Ser Pro Val Pro Thr Thr Arg Arg Gly Phe Thr Gly Leu
65 70 75 80
Glu Ser Glu Ser Thr Ala Gln Ile Thr Asn Thr Gly Ser Tyr Ser Asp
85 90 95
Tyr Ser Met Cys Tyr Ser Met Gly Thr Ala Asp Asn Leu Phe Pro Ser
100 105 110
Gly Asp Phe Glu Arg Ile Trp Thr Gln Tyr Phe Asp Arg Gln Tyr Thr
115 120 125
Ala Ser Arg Ala Val Ala Arg Glu Val Leu Arg Ala Thr Gly Thr Glu
130 135 140
Pro Asp Gly Gly Val Glu Ala Phe Leu Asp Tyr Glu Pro Leu Leu Arg
145 150 155 160
Phe Arg Tyr Phe Pro Gln Val Pro Glu His Arg Ser Ala Glu Glu Gln
165 170 175
Pro Leu Leu Met Ala Pro His His Asp Leu Ser Met Val Thr Leu Ile
180 185 190
Gln Gln Thr Pro Cys Ala Asn Gly Phe Val Ser Leu Gln Ala Glu Val
195 200 205
Gly Gly Ala Phe Val Asp Leu Pro Tyr Arg Pro Asp Ala Val Leu Val
210 215 220
Phe Cys Gly Ala Ile Ala Thr Leu Val Thr Gly Gly Gln Val Lys Ala
225 230 235 240
Pro Arg His His Val Ala Ala Pro Arg Arg Asp Gln Ile Ala Gly Ser
245 250 255
Ser Arg Thr Ser Ser Val Phe Phe Leu Arg Pro Asn Ala Asp Phe Thr
260 265 270
Phe Ser Ile Pro Leu Ala Arg Glu Tyr Gly Phe Asp Val Ser Leu Asp
275 280 285
Gly Glu Thr Ala Thr Phe Gln Asp Trp Ile Gly Gly Asn Tyr Val Asn
290 295 300
Met Arg Arg Thr Ser Lys Ala
305 310
<210> 3
<211> 311
<212> PRT
<213> Artificial sequence
<400> 3
Met Asp Thr Thr Val Pro Thr Phe His Leu Ala Glu Leu Gln Glu Gly
1 5 10 15
Leu His Gln Asp Glu Phe Arg Ser Cys Leu Met Glu Lys Gly Val Phe
20 25 30
Tyr Leu Thr Gly Ser Ser Leu Ser Glu Ala Asp Gln Lys Ser Ala Lys
35 40 45
Asp Val Val Val Asp Phe Phe Glu His Gly Thr Glu Glu Glu Lys Arg
50 55 60
Ala Val Thr Ser Pro Val Pro Thr Ile Arg Arg Gly Phe Thr Gly Leu
65 70 75 80
Glu Ser Glu Ser Thr Ala Gln Ile Thr Asn Thr Gly Ser Tyr Ser Asp
85 90 95
Tyr Ser Met Cys Tyr Ser Met Gly Thr Ala Asp Asn Leu Phe Pro Ser
100 105 110
Gly Asp Phe Glu Arg Ile Trp Thr Gln Tyr Phe Asp Arg Gln Tyr Thr
115 120 125
Ala Ser Arg Ala Val Ala Arg Glu Val Leu Arg Ala Thr Gly Thr Glu
130 135 140
Pro Asp Gly Gly Val Glu Ala Phe Leu Asp Cys Glu Pro Leu Leu Arg
145 150 155 160
Phe Arg Tyr Phe Pro Glu Val Pro Glu His Arg Ser Ala Glu Glu Glu
165 170 175
Pro Leu Arg Met Ala Pro His Tyr Asp Leu Ser Met Val Thr Leu Ile
180 185 190
Gln Gln Thr Pro Cys Ala Asn Gly Phe Val Ser Leu Gln Ala Glu Val
195 200 205
Gly Gly Ala Phe Thr Asp Leu Pro Tyr Arg Pro Asp Ala Val Leu Val
210 215 220
Phe Cys Gly Ala Ile Ala Thr Leu Val Thr Gly Gly Gln Val Lys Ala
225 230 235 240
Pro Lys His His Val Val Ala Pro Ala Arg Asp Arg Ile Ala Gly Ser
245 250 255
Ser Arg Thr Ser Ser Val Phe Phe Leu Arg Pro Asn Ala Asp Phe Thr
260 265 270
Phe Ser Val Pro Leu Ala Lys Arg Cys Gly Phe Asp Ile Gly Leu Asp
275 280 285
Gly Asp Thr Ala Ala Phe Gln Asp Trp Ile Ala Gly Asn Tyr Val Asn
290 295 300
Leu Arg Thr Lys Thr Lys Ala
305 310
<210> 4
<211> 311
<212> PRT
<213> Artificial sequence
<400> 4
Met Asp Thr Thr Val Pro Thr Phe Ser Leu Ala Glu Leu Gln Gln Gly
1 5 10 15
Leu His Gln Asp Glu Phe Arg Arg Cys Leu Arg Asp Lys Gly Leu Phe
20 25 30
Tyr Leu Thr Asp Cys Gly Leu Thr Asp Thr Glu Leu Lys Ser Ala Lys
35 40 45
Asp Leu Val Ile Asp Phe Phe Glu His Gly Ser Glu Ala Glu Lys Arg
50 55 60
Ala Val Thr Ser Pro Val Pro Thr Thr Arg Arg Gly Phe Thr Gly Leu
65 70 75 80
Glu Ser Glu Ser Thr Ala Gln Ile Thr Asn Thr Gly Ser Tyr Ser Asp
85 90 95
Tyr Ser Met Cys Tyr Ser Met Gly Thr Ala Asp Asn Leu Phe Pro Ser
100 105 110
Gly Asp Phe Glu Arg Ile Trp Thr Gln Tyr Phe Asp Arg Gln Tyr Thr
115 120 125
Ala Ser Arg Ala Val Ala Arg Glu Val Leu Arg Ala Thr Gly Thr Glu
130 135 140
Pro Asp Gly Gly Val Glu Ala Phe Leu Asp Tyr Glu Pro Leu Leu Arg
145 150 155 160
Phe Arg Tyr Phe Pro Gln Val Pro Glu His Arg Ser Ala Glu Glu Gln
165 170 175
Pro Leu Arg Met Ala Pro His His Asp Leu Ser Met Val Thr Leu Ile
180 185 190
Gln Gln Thr Pro Cys Ala Asn Gly Phe Val Ser Leu Gln Ala Glu Val
195 200 205
Gly Gly Ala Phe Val Asp Leu Pro Tyr Arg Pro Asp Ala Val Leu Val
210 215 220
Phe Cys Gly Ala Ile Ala Thr Leu Val Thr Gly Gly Gln Val Lys Ala
225 230 235 240
Pro Arg His His Val Ala Ala Pro Arg Arg Asp Gln Ile Ala Gly Ser
245 250 255
Ser Arg Thr Ser Ser Val Phe Phe Leu Arg Pro Asn Ala Asp Phe Thr
260 265 270
Phe Ser Ile Pro Leu Ala Arg Glu Tyr Gly Phe Asp Val Ser Leu Asp
275 280 285
Gly Glu Thr Ala Thr Phe Gln Asp Trp Ile Gly Gly Asn Tyr Val Asn
290 295 300
Met Arg Arg Thr Ser Lys Ala
305 310
<210> 5
<211> 311
<212> PRT
<213> Artificial sequence
<400> 5
Met Asp Thr Thr Val Pro Thr Phe His Leu Ala Glu Leu Gln Glu Gly
1 5 10 15
Leu His Gln Asp Glu Phe Arg Ser Cys Leu Met Glu Lys Gly Val Phe
20 25 30
Tyr Leu Thr Gly Ser Ser Leu Ser Glu Ala Asp Gln Lys Ser Ala Lys
35 40 45
Asp Val Val Ile Asp Phe Phe Glu His Gly Ser Glu Ala Glu Lys Arg
50 55 60
Ala Val Thr Ser Pro Val Pro Thr Thr Arg Arg Gly Phe Thr Gly Leu
65 70 75 80
Glu Ser Glu Ser Thr Ala Gln Ile Thr Asn Thr Gly Ser Tyr Ser Asp
85 90 95
Tyr Ser Met Cys Tyr Ser Met Gly Thr Ala Asp Asn Leu Phe Pro Ser
100 105 110
Gly Asp Phe Glu Arg Ile Trp Thr His Tyr Phe Asp Arg Met Tyr Asp
115 120 125
Ala Ser Arg Glu Val Ala Arg Gln Val Leu Lys Ala Thr Gly Thr Glu
130 135 140
Pro Asp Gly Gly Val Asp Ala Phe Leu Asp Cys Glu Pro Leu Leu Arg
145 150 155 160
Phe Arg Tyr Phe Pro Glu Val Pro Glu His Arg Ser Ala Glu Glu Glu
165 170 175
Pro Leu Arg Met Ala Pro His Tyr Asp Leu Ser Ile Val Thr Leu Ile
180 185 190
Gln Gln Thr Pro Cys Ala Asn Gly Phe Val Ser Leu Gln Ala Glu Val
195 200 205
Asp Gly Thr Phe Val Asp Leu Pro Ala Arg Pro Asp Ala Val Leu Val
210 215 220
Phe Cys Gly Ala Val Ala Thr Leu Val Thr Gly Gly Lys Val Lys Ala
225 230 235 240
Pro Arg His His Val Val Ala Pro Ala Arg Asp Arg Ile Ala Gly Ser
245 250 255
Ser Arg Thr Ser Ser Val Phe Phe Leu Arg Pro Asn Ala Asp Phe Thr
260 265 270
Phe Ser Val Pro Leu Ala Lys Arg Cys Gly Phe Asp Ile Gly Leu Asp
275 280 285
Gly Asp Thr Ala Ala Phe Gln Asp Trp Ile Ala Gly Asn Tyr Val Asn
290 295 300
Leu Arg Thr Lys Thr Lys Ala
305 310
<210> 6
<211> 311
<212> PRT
<213> Artificial sequence
<400> 6
Met Asp Thr Thr Val Pro Thr Phe Ser Leu Ala Glu Leu Gln Gln Gly
1 5 10 15
Leu His Gln Asp Glu Phe Arg Arg Cys Leu Arg Asp Lys Gly Leu Phe
20 25 30
Tyr Leu Thr Asp Cys Gly Leu Thr Asp Thr Glu Leu Lys Ser Ala Lys
35 40 45
Asp Leu Val Ile Asp Phe Phe Glu His Gly Ser Glu Ala Glu Lys Arg
50 55 60
Ala Val Thr Ser Pro Val Pro Thr Thr Arg Arg Gly Phe Thr Gly Leu
65 70 75 80
Glu Ser Glu Ser Thr Ala Gln Ile Thr Asn Thr Gly Ser Tyr Thr Asp
85 90 95
Tyr Ser Met Cys Tyr Ser Met Gly Thr Ser Asp Asn Leu Phe Pro Thr
100 105 110
Ala Asp Phe Glu Arg Val Trp Thr His Tyr Phe Asp Arg Met Tyr Asp
115 120 125
Ala Ser Arg Glu Val Ala Arg Gln Val Leu Lys Ala Thr Gly Thr Glu
130 135 140
Pro Asp Gly Gly Val Asp Ala Phe Leu Asp Cys Glu Pro Leu Leu Arg
145 150 155 160
Phe Arg Tyr Phe Pro Glu Val Pro Glu His Arg Ser Ala Glu Glu Glu
165 170 175
Pro Leu Arg Met Ala Pro His Tyr Asp Leu Ser Met Val Thr Leu Ile
180 185 190
Gln Gln Thr Pro Cys Ala Asn Gly Phe Val Ser Leu Gln Ala Glu Val
195 200 205
Asp Gly Thr Phe Val Asp Leu Pro Ala Arg Pro Asp Ala Val Leu Val
210 215 220
Phe Cys Gly Ala Val Ala Thr Leu Val Thr Gly Gly Lys Val Lys Ala
225 230 235 240
Pro Lys His His Val Val Ala Pro Ala Arg Asp Arg Ile Ala Gly Ser
245 250 255
Ser Arg Thr Ser Ser Val Phe Phe Leu Arg Pro Asn Ala Asp Phe Thr
260 265 270
Phe Ser Val Pro Leu Ala Lys Arg Cys Gly Phe Asp Ile Gly Leu Asp
275 280 285
Gly Asp Thr Ala Ala Phe Gln Asp Trp Ile Ala Gly Asn Tyr Val Asn
290 295 300
Leu Arg Thr Lys Thr Lys Ala
305 310
<210> 7
<211> 311
<212> PRT
<213> Artificial sequence
<400> 7
Met Asp Thr Thr Val Pro Thr Phe His Leu Ala Glu Leu Gln Glu Gly
1 5 10 15
Leu His Gln Asp Glu Phe Arg Ser Cys Leu Met Glu Lys Gly Val Phe
20 25 30
Tyr Leu Thr Gly Ser Ser Leu Ser Glu Ala Asp Gln Lys Ser Ala Lys
35 40 45
Asp Val Val Ile Asp Phe Phe Glu His Gly Ser Glu Ala Glu Lys Arg
50 55 60
Ala Val Thr Ser Pro Val Pro Thr Thr Arg Arg Gly Phe Thr Gly Leu
65 70 75 80
Glu Ser Glu Ser Thr Ala Gln Ile Thr Asn Thr Gly Ser Tyr Ser Asp
85 90 95
Tyr Ser Met Cys Tyr Ser Met Gly Thr Ala Asp Asn Leu Phe Pro Ser
100 105 110
Gly Asp Phe Glu Arg Ile Trp Thr His Tyr Phe Asp Arg Met Tyr Asp
115 120 125
Ala Ser Arg Glu Val Ala Arg Gln Val Leu Lys Ala Thr Gly Thr Glu
130 135 140
Pro Asp Gly Gly Val Asp Ala Phe Leu Asp Tyr Glu Pro Leu Leu Arg
145 150 155 160
Phe Arg Tyr Phe Pro Glu Val Pro Glu His Arg Ser Ala Glu Glu Glu
165 170 175
Pro Leu Arg Met Ala Pro His His Asp Leu Ser Ile Val Thr Leu Ile
180 185 190
Gln Gln Thr Pro Cys Ala Asn Gly Phe Val Ser Leu Gln Ala Glu Val
195 200 205
Asp Gly Thr Phe Val Asp Leu Pro Ala Arg Pro Asp Ala Val Leu Val
210 215 220
Phe Cys Gly Ala Val Ala Thr Leu Val Thr Gly Gly Lys Val Lys Ala
225 230 235 240
Pro Arg His His Val Val Ala Pro Ala Arg Asp Arg Ile Ala Gly Ser
245 250 255
Ser Arg Thr Ser Ser Val Phe Phe Leu Arg Pro Asn Ala Asp Phe Thr
260 265 270
Phe Ser Val Pro Leu Ala Lys Arg Cys Gly Phe Asp Ile Gly Leu Asp
275 280 285
Gly Asp Thr Ala Ala Phe Gln Asp Trp Ile Ala Gly Asn Tyr Val Asn
290 295 300
Leu Arg Thr Lys Thr Lys Ala
305 310
<210> 8
<211> 311
<212> PRT
<213> Artificial sequence
<400> 8
Met Asp Thr Thr Val Pro Thr Phe His Leu Ala Glu Leu Gln Glu Gly
1 5 10 15
Leu His Gln Asp Glu Phe Arg Ser Cys Leu Met Glu Lys Gly Val Phe
20 25 30
Tyr Leu Thr Gly Ser Ser Leu Ser Glu Ala Asp Gln Lys Ser Ala Lys
35 40 45
Asp Val Val Ile Asp Phe Phe Glu His Gly Ser Glu Ala Glu Lys Arg
50 55 60
Ala Val Thr Ser Pro Val Pro Thr Thr Arg Arg Gly Phe Thr Gly Leu
65 70 75 80
Glu Ser Glu Ser Thr Ala Gln Ile Thr Asn Thr Gly Ser Tyr Ser Asp
85 90 95
Tyr Ser Met Cys Tyr Ser Met Gly Thr Ala Asp Asn Leu Phe Pro Ser
100 105 110
Gly Asp Phe Glu Arg Ile Trp Thr His Tyr Phe Asp Arg Met Tyr Asp
115 120 125
Ala Ser Arg Glu Val Ala Arg Gln Val Leu Lys Ala Thr Gly Thr Glu
130 135 140
Pro Asp Gly Gly Val Asp Ala Phe Leu Asp Cys Glu Pro Leu Leu Arg
145 150 155 160
Phe Arg Tyr Phe Pro Glu Val Pro Glu His Arg Ser Ala Glu Glu Glu
165 170 175
Pro Leu Arg Met Ala Pro His Tyr Asp Leu Ser Ile Val Thr Leu Ile
180 185 190
Gln Gln Thr Pro Cys Ala Asn Gly Phe Val Ser Leu Gln Ala Glu Val
195 200 205
Asp Gly Thr Phe Val Asp Leu Pro Ala Arg Pro Asp Ala Val Leu Val
210 215 220
Phe Cys Gly Ala Val Ala Thr Leu Val Thr Gly Gly Lys Val Lys Ala
225 230 235 240
Pro Arg His His Val Val Ala Pro Ala Arg Asp Arg Ile Ala Gly Ser
245 250 255
Ser Arg Thr Ser Ser Val Phe Phe Leu Arg Pro Asn Ala Asp Phe Thr
260 265 270
Phe Ser Ile Pro Leu Ala Lys Arg Tyr Gly Phe Asp Ile Gly Leu Asp
275 280 285
Gly Asp Thr Ala Ala Phe Gln Asp Trp Ile Ala Gly Asn Tyr Val Asn
290 295 300
Met Arg Thr Lys Thr Lys Ala
305 310
<210> 9
<211> 311
<212> PRT
<213> Artificial sequence
<400> 9
Met Asp Thr Thr Val Pro Thr Phe His Leu Ala Glu Leu Gln Glu Gly
1 5 10 15
Leu His Gln Asp Glu Phe Arg Ser Cys Leu Met Glu Lys Gly Val Phe
20 25 30
Tyr Leu Thr Gly Ser Ser Leu Ser Glu Ala Asp Gln Lys Ser Ala Lys
35 40 45
Asp Val Val Ile Asp Phe Phe Glu His Gly Ser Glu Ala Glu Lys Arg
50 55 60
Ala Val Thr Ser Pro Val Pro Thr Thr Arg Arg Gly Phe Thr Gly Leu
65 70 75 80
Glu Ser Glu Ser Thr Ala Gln Ile Thr Asn Thr Gly Ser Tyr Ser Asp
85 90 95
Tyr Ser Met Cys Tyr Ser Met Gly Thr Ala Asp Asn Leu Phe Pro Ser
100 105 110
Gly Asp Phe Glu Arg Ile Trp Thr His Tyr Phe Asp Arg Met Tyr Asp
115 120 125
Ala Ser Arg Glu Val Ala Arg Gln Val Leu Lys Ala Thr Gly Thr Glu
130 135 140
Pro Asp Gly Gly Val Asp Ala Phe Leu Asp Tyr Glu Pro Leu Leu Arg
145 150 155 160
Phe Arg Tyr Phe Pro Glu Val Pro Glu His Arg Ser Ala Glu Glu Glu
165 170 175
Pro Leu Arg Met Ala Pro His His Asp Leu Ser Ile Val Thr Leu Ile
180 185 190
Gln Gln Thr Pro Cys Ala Asn Gly Phe Val Ser Leu Gln Ala Glu Val
195 200 205
Asp Gly Thr Phe Val Asp Leu Pro Ala Arg Pro Asp Ala Val Leu Val
210 215 220
Phe Cys Gly Ala Val Ala Thr Leu Val Thr Gly Gly Lys Val Lys Ala
225 230 235 240
Pro Arg His His Val Val Ala Pro Ala Arg Asp Arg Ile Ala Gly Ser
245 250 255
Ser Arg Thr Ser Ser Val Phe Phe Leu Arg Pro Asn Ala Asp Phe Thr
260 265 270
Phe Ser Ile Pro Leu Ala Lys Arg Tyr Gly Phe Asp Ile Gly Leu Asp
275 280 285
Gly Asp Thr Ala Ala Phe Gln Asp Trp Ile Ala Gly Asn Tyr Val Asn
290 295 300
Met Arg Thr Lys Thr Lys Ala
305 310
<210> 10
<211> 311
<212> PRT
<213> Artificial sequence
<400> 10
Met Asp Thr Thr Val Pro Thr Phe Ser Leu Ala Glu Leu Gln Gln Gly
1 5 10 15
Leu His Gln Asp Glu Phe Arg Arg Cys Leu Arg Asp Lys Gly Leu Phe
20 25 30
Tyr Leu Thr Asp Cys Gly Leu Thr Asp Thr Glu Leu Lys Ser Ala Lys
35 40 45
Asp Leu Val Ile Asp Phe Phe Glu His Gly Ser Glu Ala Glu Lys Arg
50 55 60
Ala Val Thr Ser Pro Val Pro Thr Thr Arg Arg Gly Phe Thr Gly Leu
65 70 75 80
Glu Ser Glu Ser Thr Ala Gln Ile Thr Asn Thr Gly Ser Tyr Thr Asp
85 90 95
Tyr Ser Met Cys Tyr Ser Met Gly Thr Ser Asp Asn Leu Phe Pro Thr
100 105 110
Ala Asp Phe Glu Arg Val Trp Thr His Tyr Phe Asp Arg Met Tyr Asp
115 120 125
Ala Ser Arg Glu Val Ala Arg Gln Val Leu Lys Ala Thr Gly Thr Glu
130 135 140
Pro Asp Gly Gly Val Asp Ala Phe Leu Asp Tyr Glu Pro Leu Leu Arg
145 150 155 160
Phe Arg Tyr Phe Pro Glu Val Pro Glu His Arg Ser Ala Glu Glu Glu
165 170 175
Pro Leu Arg Met Ala Pro His His Asp Leu Ser Met Val Thr Leu Ile
180 185 190
Gln Gln Thr Pro Cys Ala Asn Gly Phe Val Ser Leu Gln Ala Glu Val
195 200 205
Asp Gly Thr Phe Val Asp Leu Pro Ala Arg Pro Asp Ala Val Leu Val
210 215 220
Phe Cys Gly Ala Val Ala Thr Leu Val Thr Gly Gly Lys Val Lys Ala
225 230 235 240
Pro Lys His His Val Val Ala Pro Ala Arg Asp Arg Ile Ala Gly Ser
245 250 255
Ser Arg Thr Ser Ser Val Phe Phe Leu Arg Pro Asn Ala Asp Phe Thr
260 265 270
Phe Ser Val Pro Leu Ala Lys Arg Cys Gly Phe Asp Ile Gly Leu Asp
275 280 285
Gly Asp Thr Ala Ala Phe Gln Asp Trp Ile Ala Gly Asn Tyr Val Asn
290 295 300
Leu Arg Thr Lys Thr Lys Ala
305 310
<210> 11
<211> 311
<212> PRT
<213> Artificial sequence
<400> 11
Met Asp Thr Thr Val Pro Thr Phe Ser Leu Ala Glu Leu Gln Gln Gly
1 5 10 15
Leu His Gln Asp Glu Phe Arg Arg Cys Leu Arg Asp Lys Gly Leu Phe
20 25 30
Tyr Leu Thr Asp Cys Gly Leu Thr Asp Thr Glu Leu Lys Ser Ala Lys
35 40 45
Asp Leu Val Ile Asp Phe Phe Glu His Gly Ser Glu Ala Glu Lys Arg
50 55 60
Ala Val Thr Ser Pro Val Pro Thr Thr Arg Arg Gly Phe Thr Gly Leu
65 70 75 80
Glu Ser Glu Ser Thr Ala Gln Ile Thr Asn Thr Gly Ser Tyr Thr Asp
85 90 95
Tyr Ser Met Cys Tyr Ser Met Gly Thr Ser Asp Asn Leu Phe Pro Thr
100 105 110
Ala Asp Phe Glu Arg Val Trp Thr His Tyr Phe Asp Arg Met Tyr Asp
115 120 125
Ala Ser Arg Glu Val Ala Arg Gln Val Leu Lys Ala Thr Gly Thr Glu
130 135 140
Pro Asp Gly Gly Val Asp Ala Phe Leu Asp Cys Glu Pro Leu Leu Arg
145 150 155 160
Phe Arg Tyr Phe Pro Glu Val Pro Glu His Arg Ser Ala Glu Glu Glu
165 170 175
Pro Leu Arg Met Ala Pro His Tyr Asp Leu Ser Met Val Thr Leu Ile
180 185 190
Gln Gln Thr Pro Cys Ala Asn Gly Phe Val Ser Leu Gln Ala Glu Val
195 200 205
Asp Gly Thr Phe Val Asp Leu Pro Ala Arg Pro Asp Ala Val Leu Val
210 215 220
Phe Cys Gly Ala Val Ala Thr Leu Val Thr Gly Gly Lys Val Lys Ala
225 230 235 240
Pro Lys His His Val Val Ala Pro Ala Arg Asp Arg Ile Ala Gly Ser
245 250 255
Ser Arg Thr Ser Ser Val Phe Phe Leu Arg Pro Asn Ala Asp Phe Thr
260 265 270
Phe Ser Ile Pro Leu Ala Lys Arg Tyr Gly Phe Asp Ile Gly Leu Asp
275 280 285
Gly Asp Thr Ala Ala Phe Gln Asp Trp Ile Ala Gly Asn Tyr Val Asn
290 295 300
Met Arg Thr Lys Thr Lys Ala
305 310
<210> 12
<211> 311
<212> PRT
<213> Artificial sequence
<400> 12
Met Asp Thr Thr Val Pro Thr Phe Ser Leu Ala Glu Leu Gln Gln Gly
1 5 10 15
Leu His Gln Asp Glu Phe Arg Arg Cys Leu Arg Asp Lys Gly Leu Phe
20 25 30
Tyr Leu Thr Asp Cys Gly Leu Thr Asp Thr Glu Leu Lys Ser Ala Lys
35 40 45
Asp Leu Val Ile Asp Phe Phe Glu His Gly Ser Glu Ala Glu Lys Arg
50 55 60
Ala Val Thr Ser Pro Val Pro Thr Thr Arg Arg Gly Phe Thr Gly Leu
65 70 75 80
Glu Ser Glu Ser Thr Ala Gln Ile Thr Asn Thr Gly Ser Tyr Thr Asp
85 90 95
Tyr Ser Met Cys Tyr Ser Met Gly Thr Ser Asp Asn Leu Phe Pro Thr
100 105 110
Ala Asp Phe Glu Arg Val Trp Thr His Tyr Phe Asp Arg Met Tyr Asp
115 120 125
Ala Ser Arg Glu Val Ala Arg Gln Val Leu Lys Ala Thr Gly Thr Glu
130 135 140
Pro Asp Gly Gly Val Asp Ala Phe Leu Asp Tyr Glu Pro Leu Leu Arg
145 150 155 160
Phe Arg Tyr Phe Pro Glu Val Pro Glu His Arg Ser Ala Glu Glu Glu
165 170 175
Pro Leu Arg Met Ala Pro His His Asp Leu Ser Met Val Thr Leu Ile
180 185 190
Gln Gln Thr Pro Cys Ala Asn Gly Phe Val Ser Leu Gln Ala Glu Val
195 200 205
Asp Gly Thr Phe Val Asp Leu Pro Ala Arg Pro Asp Ala Val Leu Val
210 215 220
Phe Cys Gly Ala Val Ala Thr Leu Val Thr Gly Gly Lys Val Lys Ala
225 230 235 240
Pro Lys His His Val Val Ala Pro Ala Arg Asp Arg Ile Ala Gly Ser
245 250 255
Ser Arg Thr Ser Ser Val Phe Phe Leu Arg Pro Asn Ala Asp Phe Thr
260 265 270
Phe Ser Ile Pro Leu Ala Lys Arg Tyr Gly Phe Asp Ile Gly Leu Asp
275 280 285
Gly Asp Thr Ala Ala Phe Gln Asp Trp Ile Ala Gly Asn Tyr Val Asn
290 295 300
Met Arg Thr Lys Thr Lys Ala
305 310
<210> 13
<211> 311
<212> PRT
<213> Artificial sequence
<400> 13
Met Asp Thr Thr Val Pro Thr Phe Ser Leu Ala Glu Leu Gln Gln Gly
1 5 10 15
Leu His Gln Asp Glu Phe Arg Arg Cys Leu Arg Asp Lys Gly Leu Phe
20 25 30
Tyr Leu Thr Asp Cys Gly Leu Thr Asp Thr Glu Leu Lys Ser Ala Lys
35 40 45
Asp Leu Val Ile Asp Phe Phe Glu His Gly Ser Glu Ala Glu Lys Arg
50 55 60
Ala Val Thr Ser Pro Val Pro Thr Thr Arg Arg Gly Phe Thr Gly Leu
65 70 75 80
Glu Ser Glu Ser Thr Ala Gln Ile Thr Asn Thr Gly Ser Tyr Ser Asp
85 90 95
Tyr Ser Met Cys Tyr Ser Met Gly Thr Ala Asp Asn Leu Phe Pro Ser
100 105 110
Gly Asp Phe Glu Arg Ile Trp Thr Gln Tyr Phe Asp Arg Gln Tyr Thr
115 120 125
Ala Ser Arg Ala Val Ala Arg Glu Val Leu Arg Ala Thr Gly Thr Glu
130 135 140
Pro Asp Gly Gly Val Glu Ala Phe Leu Asp Tyr Glu Pro Leu Leu Arg
145 150 155 160
Phe Arg Tyr Phe Pro Gln Val Pro Glu His Arg Ser Ala Glu Glu Gln
165 170 175
Pro Leu Arg Met Ala Pro His His Asp Leu Ser Met Val Thr Leu Ile
180 185 190
Gln Gln Thr Pro Cys Ala Asn Gly Phe Val Ser Leu Gln Ala Glu Val
195 200 205
Gly Gly Ala Phe Val Asp Leu Pro Tyr Arg Pro Asp Ala Val Leu Val
210 215 220
Phe Cys Gly Ala Ile Ala Thr Leu Val Thr Gly Gly Gln Val Lys Ala
225 230 235 240
Pro Arg His His Val Ala Ala Pro Arg Arg Asp Gln Ile Ala Gly Ser
245 250 255
Ser Arg Thr Ser Ser Val Phe Phe Leu Arg Pro Asn Ala Asp Phe Thr
260 265 270
Phe Ser Ile Pro Leu Ala Arg Glu Tyr Gly Phe Asp Val Ser Leu Asp
275 280 285
Gly Glu Thr Ala Thr Phe Gln Asp Trp Ile Gly Gly Asn Tyr Lys Asn
290 295 300
Met Arg Arg Thr Ser Lys Ala
305 310
Claims (10)
1. A protein which is (b), (c) or (d) below:
(b) The amino acid sequence of the protein is shown as SEQ ID No.5, or the protein which is derived from the protein (b) and has the activity of expandase, wherein the amino acid sequence in the protein (b) is substituted, deleted or added with one or more amino acids;
(c) The amino acid sequence of the protein is shown as SEQ ID No.6, or the protein which is obtained by substituting, deleting or adding one or more amino acids in the amino acid sequence in (c) and has the activity of expandase and is derived from (c);
(d) The amino acid sequence of the protein is shown as SEQ ID No.13, or the protein which is derived from (d) and has the activity of expandase, wherein one or more amino acids are substituted, deleted or added in the amino acid sequence in (d).
2. The protein of claim 1, wherein: the protein is any one of the following (1) to (6):
(1) A protein represented by the amino acid sequence shown in SEQ ID NO.5 wherein n1 is at position 155 and n2 is at position 184;
(2) A protein represented by the amino acid sequence shown by SEQ ID NO.5 wherein the 275 th position is n3, the 281 th amino acid residue is n4, and the 305 th position is n 5;
(3) A protein represented by the amino acid sequence shown in SEQ ID NO.5 wherein the amino acid sequence at position 155 is n1, the amino acid sequence at position 184 is n2, the amino acid sequence at position 275 is n3, the amino acid sequence at position 281 is n4, and the amino acid sequence at position 305 is n 5;
(4) A protein represented by the amino acid sequence shown in SEQ ID NO.6 with m1 at position 155 and m2 at position 184;
(5) A protein represented by the amino acid sequence shown in SEQ ID NO.6, wherein m3 is at position 275, m4 is at position 281, and m5 is at position 305;
(6) M1 at position 155, m2 at position 184, m3 at position 275, m4 at position 281, and m5 at position 305, and a protein represented by the amino acid sequence of SEQ ID NO. 6.
3. The protein of claim 2, wherein: SEQ ID NO:5 wherein n1 at position 155 is: cysteine, tyrosine, glycine, alanine, valine, leucine, isoleucine, proline, tryptophan, serine, phenylalanine, asparagine, glutamine, threonine, aspartic acid, glutamic acid, lysine, arginine, or histidine, n2 at position 184 is: tyrosine, histidine, glycine, alanine, valine, leucine, isoleucine, methionine, proline, tryptophan, serine, cysteine, phenylalanine, asparagine, glutamine, threonine, aspartic acid, glutamic acid, lysine or arginine, n3 at position 275 is: valine, isoleucine, glycine, alanine, leucine, methionine, proline, tryptophan, serine, tyrosine, cysteine, phenylalanine, asparagine, glutamine, threonine, aspartic acid, glutamic acid, lysine, arginine or histidine, n4 at position 281 is: cysteine, tyrosine, glycine, alanine, valine, leucine, isoleucine, proline, tryptophan, serine, phenylalanine, asparagine, glutamine, threonine, aspartic acid, glutamic acid, lysine, arginine, or histidine, n5 at position 305 is: leucine, methionine, glycine, alanine, valine, isoleucine, proline, tryptophan, serine, tyrosine, cysteine, phenylalanine, asparagine, glutamine, threonine, aspartic acid, glutamic acid, lysine, arginine, or histidine;
SEQ ID NO:6, wherein m1 at position 155 in the amino acid sequence is phenylalanine, lysine, glycine, alanine, valine, leucine, isoleucine, methionine, proline, tryptophan, serine, tyrosine, cysteine, asparagine, glutamine, threonine, aspartic acid, glutamic acid, arginine or histidine, m2 at position 184 is aspartic acid, glutamic acid, glycine, alanine, valine, leucine, isoleucine, methionine, proline, tryptophan, serine, tyrosine, cysteine, phenylalanine, asparagine, glutamine, threonine, lysine, arginine or histidine, m3 at position 275 is glutamic acid, histidine, glycine, alanine, valine, leucine, isoleucine, methionine, proline, tryptophan, serine, tyrosine, cysteine, phenylalanine, asparagine, glutamine, threonine, aspartic acid, lysine or arginine, m4 at position 281 is: tyrosine, proline, glycine, alanine, valine, leucine, isoleucine, methionine, tryptophan, serine, cysteine, phenylalanine, asparagine, glutamine, threonine, aspartic acid, glutamic acid, lysine, arginine or histidine, and m5 at position 305 is isoleucine, methionine, glycine, alanine, valine, leucine, proline, tryptophan, serine, tyrosine, cysteine, phenylalanine, asparagine, glutamine, threonine, aspartic acid, glutamic acid, lysine, arginine or histidine.
4. The protein of any one of claims 1-3, wherein: the protein is any one of the following (1) to (6):
(1) The amino acid sequence of the protein is shown in SEQ ID NO. 7, wherein the 155 th amino acid residue of the amino acid sequence shown in SEQ ID NO.5 is mutated from cysteine to tyrosine, and the 184 th amino acid residue is mutated from tyrosine to histidine;
(2) The amino acid sequence of the protein is shown in SEQ ID NO. 8, wherein the 275 th amino acid residue of the amino acid sequence shown in SEQ ID NO.5 is mutated from valine to isoleucine, the 281 th amino acid residue is mutated from cysteine to tyrosine, and the 305 th amino acid residue is mutated from leucine to methionine;
(3) The amino acid sequence of the protein is shown in SEQ ID NO.5, wherein the 155 th amino acid residue of the amino acid sequence is mutated from cysteine to tyrosine, the 184 th amino acid residue is mutated from tyrosine to histidine, the 275 th amino acid residue is mutated from valine to isoleucine, the 281 th amino acid residue is mutated from cysteine to tyrosine, and the 305 th amino acid residue is mutated from leucine to methionine, and the amino acid sequence is shown in SEQ ID NO. 9;
(4) The amino acid sequence of the protein is shown in SEQ ID NO. 10, wherein the 155 th amino acid residue of the amino acid sequence shown in SEQ ID NO.6 is mutated from cysteine to tyrosine, and the 184 th amino acid residue is mutated from tyrosine to histidine;
(5) The protein is obtained by mutating the 275 th amino acid residue of the amino acid sequence shown in SEQ ID NO.6 from valine to isoleucine, the 281 th amino acid residue from cysteine to tyrosine, and the 305 th amino acid residue from isoleucine to methionine, and the amino acid sequence of the protein is shown in SEQ ID NO. 11;
(6) The protein is obtained by mutating the 155 th amino acid residue of the amino acid sequence shown in SEQ ID NO.6 from cysteine to tyrosine, mutating the 184 th amino acid residue from tyrosine to histidine, mutating the 275 th amino acid residue from valine to isoleucine, mutating the 281 th amino acid residue from cysteine to tyrosine, and mutating the 305 th amino acid residue from isoleucine to methionine, and the amino acid sequence of the protein is shown in SEQ ID No. 12.
5. A nucleic acid molecule encoding the protein according to any one of claims 1 to 4, which is a gene encoding the protein according to any one of claims 1 to 4.
6. A recombinant vector, expression cassette, transgenic cell line or recombinant bacterium comprising the nucleic acid molecule of claim 5.
7. Use of a protein according to any one of claims 1 to 4 as penicillin expandase.
8. Use of a protein as defined in any one of claims 1-4 as penicillin expandase for catalyzing the expansion of penicillin G to G-7-ADCA for increasing the conversion of penicillin G.
9. Use of the protein of any one of claims 1 to 4 or the nucleic acid molecule of claim 5 or the recombinant vector, expression cassette, transgenic cell line or recombinant bacterium of claim 6 in any one of:
(b1) Preparing a product having penicillin expandase activity;
(b2) Preparation of G-7-ADCA.
10. A process for the preparation of G-7-ADCA or a process for increasing the conversion of penicillin G in a penicillin expandase catalyzed penicillin G expansion to G-7-ADCA, wherein the penicillin G conversion is one of the following:
1) Preparing a protein according to any one of claims 1 to 4; catalyzing penicillin G to expand ring by using the protein as penicillin expandase to generate G-7-ADCA;
2) Fermenting a microorganism containing a gene encoding a protein according to any of claims 1-4, so that the fermentation product obtained catalyzes the expansion of penicillin G to G-7-ADCA;
3) Cloning the gene encoding the protein of any one of claims 1-4 into a strain producing penicillin G to obtain a recombinant strain, and fermenting the recombinant strain to catalyze the ring expansion of penicillin G to obtain G-7-ADCA.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110676893.1A CN115491361A (en) | 2021-06-18 | 2021-06-18 | Application of expandase and mutant thereof in production of G-7-ADCA |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110676893.1A CN115491361A (en) | 2021-06-18 | 2021-06-18 | Application of expandase and mutant thereof in production of G-7-ADCA |
Publications (1)
Publication Number | Publication Date |
---|---|
CN115491361A true CN115491361A (en) | 2022-12-20 |
Family
ID=84465268
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110676893.1A Pending CN115491361A (en) | 2021-06-18 | 2021-06-18 | Application of expandase and mutant thereof in production of G-7-ADCA |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN115491361A (en) |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
IN188444B (en) * | 1996-03-29 | 2002-09-28 | Werner Grabher | |
CN1448506A (en) * | 2002-04-01 | 2003-10-15 | 骏翰生化股份有限公司 | Mutation penicillin ring enlargement enzyme and process preparing 7-ADCA using same |
US20090087893A1 (en) * | 2004-04-22 | 2009-04-02 | Orchid Chemicals & Pharmaceutical Ltd. | Modified Expandase Enzyme and its use |
CN107304418A (en) * | 2016-04-18 | 2017-10-31 | 百瑞全球有限公司 | Penicillin ring enlargement enzyme mutant, the DNA for encoding the mutant, the kit containing the mutant and its application |
CN112852913A (en) * | 2020-04-17 | 2021-05-28 | 中国科学院天津工业生物技术研究所 | Deacetoxycephalosporin C synthetase mutant and application thereof in synthesis of beta-lactam antibiotic parent nucleus |
-
2021
- 2021-06-18 CN CN202110676893.1A patent/CN115491361A/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
IN188444B (en) * | 1996-03-29 | 2002-09-28 | Werner Grabher | |
CN1448506A (en) * | 2002-04-01 | 2003-10-15 | 骏翰生化股份有限公司 | Mutation penicillin ring enlargement enzyme and process preparing 7-ADCA using same |
US20090087893A1 (en) * | 2004-04-22 | 2009-04-02 | Orchid Chemicals & Pharmaceutical Ltd. | Modified Expandase Enzyme and its use |
CN107304418A (en) * | 2016-04-18 | 2017-10-31 | 百瑞全球有限公司 | Penicillin ring enlargement enzyme mutant, the DNA for encoding the mutant, the kit containing the mutant and its application |
CN112852913A (en) * | 2020-04-17 | 2021-05-28 | 中国科学院天津工业生物技术研究所 | Deacetoxycephalosporin C synthetase mutant and application thereof in synthesis of beta-lactam antibiotic parent nucleus |
Non-Patent Citations (2)
Title |
---|
LIN等: "Reconstitution of TCA cycle with DAOCS to engineer Escherichia coli into an efficient whole cell catalyst of penicillin G", PNAS, vol. 112, no. 32, 11 August 2015 (2015-08-11), pages 9855 * |
季俊杰;张红梅;: "定点突变改造棒状链霉菌青霉素扩环酶", 江苏农业科学, no. 08, 25 August 2013 (2013-08-25), pages 53 - 54 * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Altenbuchner et al. | Hydantoinases and related enzymes as biocatalysts for the synthesis of unnatural chiral amino acids | |
CN112877307B (en) | Amino acid dehydrogenase mutant and application thereof | |
WO2017143944A1 (en) | Penicillin g acylase mutant | |
Yang et al. | Expression and purification of extracellular penicillin G acylase in Bacillus subtilis | |
CN111778229B (en) | Cyclohexene carboxylate hydrolase, mutant thereof, encoding gene, expression vector, recombinant bacterium and application | |
CN113355299B (en) | Ketoacid reductase, gene, engineering bacterium and application in synthesis of chiral aromatic 2-hydroxy acid | |
CN112852913B (en) | Deacetoxycephalosporin C synthetase mutant and application thereof in synthesis of beta-lactam antibiotic parent nucleus | |
Forney et al. | Alteration of the catalytic efficiency of penicillin amidase from Escherichia coli | |
CN115491361A (en) | Application of expandase and mutant thereof in production of G-7-ADCA | |
Luo et al. | Cloning and co-expression of D-amino acid oxidase and glutaryl-7-aminocephalosporanic acid acylase genes in Escherichia coli | |
CN110699345A (en) | Halogen alcohol dehalogenase mutant and application thereof | |
CN114196659B (en) | Amidase mutant, coding gene, engineering bacteria and application thereof | |
CN117603930B (en) | Recombinant bacterium for expressing mutant sirohem synthase | |
CN113755539B (en) | Dihydropyrimidine amino hydrolase and application thereof | |
CN115505581B (en) | Aromatic amine-N acetyl transferase mutant and application thereof | |
CN112011523B (en) | Acetylacetone lyase mutant capable of improving acetylacetone synthesis efficiency, gene, expression vector, cell and application thereof | |
CN112852912B (en) | Method for synthesizing 7-aminodesacetoxycephalosporanic acid | |
CN109913431A (en) | A kind of monoamine oxidase from aspergillus albicans is used for the preparation of chiral amine intermediates | |
CN115029329B (en) | Carbonyl reductase mutant and application thereof in preparation of R-mandelic acid | |
CN117947137A (en) | Penicillin amidase for synthesizing cefaclor | |
CN117247928A (en) | Cephalosporin C acylase catalyzed acylation reaction and application thereof | |
CN117247925A (en) | Cephalosporin C acylase mutant and application thereof in cephalosporin mother nucleus synthesis | |
CN117247927A (en) | Artificial synthesis of cephalosporin C acylase and mutant application thereof | |
CN118185913A (en) | Paracoccus-derived amidase and application of mutant thereof | |
CN118272331A (en) | Alkene reductase mutant and application thereof in (R) -citronellal synthesis |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |