CN113493780A - 一种利用sumo融合表达系统制备重组肝素酶ii的方法及其所制备的sumo_肝素酶ii融合蛋白 - Google Patents
一种利用sumo融合表达系统制备重组肝素酶ii的方法及其所制备的sumo_肝素酶ii融合蛋白 Download PDFInfo
- Publication number
- CN113493780A CN113493780A CN202110097134.XA CN202110097134A CN113493780A CN 113493780 A CN113493780 A CN 113493780A CN 202110097134 A CN202110097134 A CN 202110097134A CN 113493780 A CN113493780 A CN 113493780A
- Authority
- CN
- China
- Prior art keywords
- heparinase
- sumo
- fusion protein
- protein
- lys
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 108010006406 heparinase II Proteins 0.000 title claims abstract description 86
- 102000037865 fusion proteins Human genes 0.000 title claims abstract description 64
- 108020001507 fusion proteins Proteins 0.000 title claims abstract description 64
- 238000000034 method Methods 0.000 title claims abstract description 32
- 230000004927 fusion Effects 0.000 title claims abstract description 12
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 49
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 47
- 102000051619 SUMO-1 Human genes 0.000 claims abstract description 24
- 238000000746 purification Methods 0.000 claims abstract description 16
- 241000605114 Pedobacter heparinus Species 0.000 claims abstract description 15
- 108700038981 SUMO-1 Proteins 0.000 claims abstract description 15
- 101000708016 Caenorhabditis elegans Sentrin-specific protease Proteins 0.000 claims abstract description 13
- 239000013612 plasmid Substances 0.000 claims abstract description 13
- 108091028043 Nucleic acid sequence Proteins 0.000 claims abstract description 9
- 238000005520 cutting process Methods 0.000 claims abstract description 9
- 241000588724 Escherichia coli Species 0.000 claims abstract description 8
- 108010076504 Protein Sorting Signals Proteins 0.000 claims abstract description 6
- 230000001131 transforming effect Effects 0.000 claims abstract description 4
- 101000879203 Caenorhabditis elegans Small ubiquitin-related modifier Proteins 0.000 claims abstract 9
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract 3
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 claims description 24
- 238000001042 affinity chromatography Methods 0.000 claims description 9
- 239000007853 buffer solution Substances 0.000 claims description 8
- 125000000487 histidyl group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C([H])=N1 0.000 claims description 5
- 238000002156 mixing Methods 0.000 claims description 4
- 102100024025 Heparanase Human genes 0.000 claims 1
- 108010037536 heparanase Proteins 0.000 claims 1
- 210000004027 cell Anatomy 0.000 abstract description 8
- 210000003000 inclusion body Anatomy 0.000 abstract description 3
- 238000002360 preparation method Methods 0.000 abstract description 3
- 230000015572 biosynthetic process Effects 0.000 abstract description 2
- 230000002255 enzymatic effect Effects 0.000 abstract 1
- 235000018102 proteins Nutrition 0.000 description 30
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 16
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 15
- 239000006228 supernatant Substances 0.000 description 15
- 230000006698 induction Effects 0.000 description 14
- 150000001413 amino acids Chemical group 0.000 description 12
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 12
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 11
- 108010022901 Heparin Lyase Proteins 0.000 description 10
- 238000004458 analytical method Methods 0.000 description 9
- 239000007983 Tris buffer Substances 0.000 description 8
- 238000001962 electrophoresis Methods 0.000 description 8
- 239000011780 sodium chloride Substances 0.000 description 8
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 8
- 239000000872 buffer Substances 0.000 description 7
- 241000894006 Bacteria Species 0.000 description 6
- 230000000694 effects Effects 0.000 description 6
- 239000000243 solution Substances 0.000 description 6
- 108010047495 alanylglycine Proteins 0.000 description 5
- 108010047857 aspartylglycine Proteins 0.000 description 5
- 230000001580 bacterial effect Effects 0.000 description 5
- 108010089804 glycyl-threonine Proteins 0.000 description 5
- 230000001939 inductive effect Effects 0.000 description 5
- 239000012528 membrane Substances 0.000 description 5
- 238000005457 optimization Methods 0.000 description 5
- 230000001376 precipitating effect Effects 0.000 description 5
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 4
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 4
- 238000001976 enzyme digestion Methods 0.000 description 4
- 239000007788 liquid Substances 0.000 description 4
- 108010064235 lysylglycine Proteins 0.000 description 4
- 108010070643 prolylglutamic acid Proteins 0.000 description 4
- 239000012474 protein marker Substances 0.000 description 4
- 108010020532 tyrosyl-proline Proteins 0.000 description 4
- 108010073969 valyllysine Proteins 0.000 description 4
- 238000005406 washing Methods 0.000 description 4
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 3
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 3
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 3
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 3
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 3
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 3
- 101710175625 Maltose/maltodextrin-binding periplasmic protein Proteins 0.000 description 3
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 3
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 3
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 3
- 239000002033 PVDF binder Substances 0.000 description 3
- 238000001261 affinity purification Methods 0.000 description 3
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 3
- 238000005119 centrifugation Methods 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- UQLDLKMNUJERMK-UHFFFAOYSA-L di(octadecanoyloxy)lead Chemical compound [Pb+2].CCCCCCCCCCCCCCCCCC([O-])=O.CCCCCCCCCCCCCCCCCC([O-])=O UQLDLKMNUJERMK-UHFFFAOYSA-L 0.000 description 3
- 238000000855 fermentation Methods 0.000 description 3
- 230000004151 fermentation Effects 0.000 description 3
- 238000011534 incubation Methods 0.000 description 3
- 108010034529 leucyl-lysine Proteins 0.000 description 3
- 239000003055 low molecular weight heparin Substances 0.000 description 3
- 108010009298 lysylglutamic acid Proteins 0.000 description 3
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 3
- 229920002981 polyvinylidene fluoride Polymers 0.000 description 3
- 239000000047 product Substances 0.000 description 3
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 2
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 2
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 2
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 2
- QQACQIHVWCVBBR-GVARAGBVSA-N Ala-Ile-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QQACQIHVWCVBBR-GVARAGBVSA-N 0.000 description 2
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 2
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 2
- KYDYGANDJHFBCW-DRZSPHRISA-N Ala-Phe-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KYDYGANDJHFBCW-DRZSPHRISA-N 0.000 description 2
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 2
- UZSQXCMNUPKLCC-FJXKBIBVSA-N Arg-Thr-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UZSQXCMNUPKLCC-FJXKBIBVSA-N 0.000 description 2
- ZDOQDYFZNGASEY-BIIVOSGPSA-N Asn-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZDOQDYFZNGASEY-BIIVOSGPSA-N 0.000 description 2
- UPALZCBCKAMGIY-PEFMBERDSA-N Asn-Gln-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UPALZCBCKAMGIY-PEFMBERDSA-N 0.000 description 2
- REQUGIWGOGSOEZ-ZLUOBGJFSA-N Asn-Ser-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N REQUGIWGOGSOEZ-ZLUOBGJFSA-N 0.000 description 2
- JNCRAQVYJZGIOW-QSFUFRPTSA-N Asn-Val-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNCRAQVYJZGIOW-QSFUFRPTSA-N 0.000 description 2
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 2
- HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 description 2
- UAXIKORUDGGIGA-DCAQKATOSA-N Asp-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O UAXIKORUDGGIGA-DCAQKATOSA-N 0.000 description 2
- WAEDSQFVZJUHLI-BYULHYEWSA-N Asp-Val-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WAEDSQFVZJUHLI-BYULHYEWSA-N 0.000 description 2
- YKKHFPGOZXQAGK-QWRGUYRKSA-N Cys-Gly-Tyr Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YKKHFPGOZXQAGK-QWRGUYRKSA-N 0.000 description 2
- 108020004414 DNA Proteins 0.000 description 2
- 102000004190 Enzymes Human genes 0.000 description 2
- 108090000790 Enzymes Proteins 0.000 description 2
- LVNILKSSFHCSJZ-IHRRRGAJSA-N Gln-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LVNILKSSFHCSJZ-IHRRRGAJSA-N 0.000 description 2
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 2
- VLOLPWWCNKWRNB-LOKLDPHHSA-N Gln-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O VLOLPWWCNKWRNB-LOKLDPHHSA-N 0.000 description 2
- HPJLZFTUUJKWAJ-JHEQGTHGSA-N Glu-Gly-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HPJLZFTUUJKWAJ-JHEQGTHGSA-N 0.000 description 2
- DVLZZEPUNFEUBW-AVGNSLFASA-N Glu-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N DVLZZEPUNFEUBW-AVGNSLFASA-N 0.000 description 2
- QNJNPKSWAHPYGI-JYJNAYRXSA-N Glu-Phe-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 QNJNPKSWAHPYGI-JYJNAYRXSA-N 0.000 description 2
- CQGBSALYGOXQPE-HTUGSXCWSA-N Glu-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O CQGBSALYGOXQPE-HTUGSXCWSA-N 0.000 description 2
- ZSIDREAPEPAPKL-XIRDDKMYSA-N Glu-Trp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)O)N ZSIDREAPEPAPKL-XIRDDKMYSA-N 0.000 description 2
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 2
- XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 2
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 2
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 2
- HTTJABKRGRZYRN-UHFFFAOYSA-N Heparin Chemical compound OC1C(NC(=O)C)C(O)OC(COS(O)(=O)=O)C1OC1C(OS(O)(=O)=O)C(O)C(OC2C(C(OS(O)(=O)=O)C(OC3C(C(O)C(O)C(O3)C(O)=O)OS(O)(=O)=O)C(CO)O2)NS(O)(=O)=O)C(C(O)=O)O1 HTTJABKRGRZYRN-UHFFFAOYSA-N 0.000 description 2
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 2
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 2
- PWUMCBLVWPCKNO-MGHWNKPDSA-N Ile-Leu-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PWUMCBLVWPCKNO-MGHWNKPDSA-N 0.000 description 2
- APQYGMBHIVXFML-OSUNSFLBSA-N Ile-Val-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N APQYGMBHIVXFML-OSUNSFLBSA-N 0.000 description 2
- 108010065920 Insulin Lispro Proteins 0.000 description 2
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 2
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 2
- VHNOAIFVYUQOOY-XUXIUFHCSA-N Lys-Arg-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VHNOAIFVYUQOOY-XUXIUFHCSA-N 0.000 description 2
- OVIVOCSURJYCTM-GUBZILKMSA-N Lys-Asp-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O OVIVOCSURJYCTM-GUBZILKMSA-N 0.000 description 2
- QIJVAFLRMVBHMU-KKUMJFAQSA-N Lys-Asp-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QIJVAFLRMVBHMU-KKUMJFAQSA-N 0.000 description 2
- YEIYAQQKADPIBJ-GARJFASQSA-N Lys-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O YEIYAQQKADPIBJ-GARJFASQSA-N 0.000 description 2
- XIZQPFCRXLUNMK-BZSNNMDCSA-N Lys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N XIZQPFCRXLUNMK-BZSNNMDCSA-N 0.000 description 2
- ATNKHRAIZCMCCN-BZSNNMDCSA-N Lys-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N ATNKHRAIZCMCCN-BZSNNMDCSA-N 0.000 description 2
- BOJYMMBYBNOOGG-DCAQKATOSA-N Lys-Pro-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BOJYMMBYBNOOGG-DCAQKATOSA-N 0.000 description 2
- CNGOEHJCLVCJHN-SRVKXCTJSA-N Lys-Pro-Glu Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O CNGOEHJCLVCJHN-SRVKXCTJSA-N 0.000 description 2
- MDDUIRLQCYVRDO-NHCYSSNCSA-N Lys-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN MDDUIRLQCYVRDO-NHCYSSNCSA-N 0.000 description 2
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 2
- FZUNSVYYPYJYAP-NAKRPEOUSA-N Met-Ile-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O FZUNSVYYPYJYAP-NAKRPEOUSA-N 0.000 description 2
- RRIHXWPHQSXHAQ-XUXIUFHCSA-N Met-Ile-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O RRIHXWPHQSXHAQ-XUXIUFHCSA-N 0.000 description 2
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 2
- KBVJZCVLQWCJQN-KKUMJFAQSA-N Phe-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KBVJZCVLQWCJQN-KKUMJFAQSA-N 0.000 description 2
- DMEYUTSDVRCWRS-ULQDDVLXSA-N Phe-Lys-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DMEYUTSDVRCWRS-ULQDDVLXSA-N 0.000 description 2
- MVIJMIZJPHQGEN-IHRRRGAJSA-N Phe-Ser-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@H](CO)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 MVIJMIZJPHQGEN-IHRRRGAJSA-N 0.000 description 2
- SSSFPISOZOLQNP-GUBZILKMSA-N Pro-Arg-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSFPISOZOLQNP-GUBZILKMSA-N 0.000 description 2
- ILMLVTGTUJPQFP-FXQIFTODSA-N Pro-Asp-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ILMLVTGTUJPQFP-FXQIFTODSA-N 0.000 description 2
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 2
- WFIVLLFYUZZWOD-RHYQMDGZSA-N Pro-Lys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WFIVLLFYUZZWOD-RHYQMDGZSA-N 0.000 description 2
- AJBQTGZIZQXBLT-STQMWFEESA-N Pro-Phe-Gly Chemical compound C([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 AJBQTGZIZQXBLT-STQMWFEESA-N 0.000 description 2
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 2
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 2
- QFBNNYNWKYKVJO-DCAQKATOSA-N Ser-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N QFBNNYNWKYKVJO-DCAQKATOSA-N 0.000 description 2
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 2
- OSFZCEQJLWCIBG-BZSNNMDCSA-N Ser-Tyr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OSFZCEQJLWCIBG-BZSNNMDCSA-N 0.000 description 2
- 241001052560 Thallis Species 0.000 description 2
- MVFQLSPDMMFCMW-KKUMJFAQSA-N Tyr-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O MVFQLSPDMMFCMW-KKUMJFAQSA-N 0.000 description 2
- PRONOHBTMLNXCZ-BZSNNMDCSA-N Tyr-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PRONOHBTMLNXCZ-BZSNNMDCSA-N 0.000 description 2
- ZZDYJFVIKVSUFA-WLTAIBSBSA-N Tyr-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ZZDYJFVIKVSUFA-WLTAIBSBSA-N 0.000 description 2
- KSGKJSFPWSMJHK-JNPHEJMOSA-N Tyr-Tyr-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KSGKJSFPWSMJHK-JNPHEJMOSA-N 0.000 description 2
- PVPAOIGJYHVWBT-KKHAAJSZSA-N Val-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N)O PVPAOIGJYHVWBT-KKHAAJSZSA-N 0.000 description 2
- 108010081404 acein-2 Proteins 0.000 description 2
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 2
- 108010044940 alanylglutamine Proteins 0.000 description 2
- 235000001014 amino acid Nutrition 0.000 description 2
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 2
- 108010062796 arginyllysine Proteins 0.000 description 2
- 108010093581 aspartyl-proline Proteins 0.000 description 2
- 108010092854 aspartyllysine Proteins 0.000 description 2
- 238000010828 elution Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000010353 genetic engineering Methods 0.000 description 2
- 108010079547 glutamylmethionine Proteins 0.000 description 2
- RWSXRVCMGQZWBV-WDSKDSINSA-N glutathione Chemical compound OC(=O)[C@@H](N)CCC(=O)N[C@@H](CS)C(=O)NCC(O)=O RWSXRVCMGQZWBV-WDSKDSINSA-N 0.000 description 2
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 2
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 2
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 2
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 2
- 108010045126 glycyl-tyrosyl-glycine Proteins 0.000 description 2
- 108010010147 glycylglutamine Proteins 0.000 description 2
- 108010015792 glycyllysine Proteins 0.000 description 2
- 108010084389 glycyltryptophan Proteins 0.000 description 2
- 229960002897 heparin Drugs 0.000 description 2
- 229920000669 heparin Polymers 0.000 description 2
- 108010083213 heparitinsulfate lyase Proteins 0.000 description 2
- 238000003119 immunoblot Methods 0.000 description 2
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 2
- 239000012160 loading buffer Substances 0.000 description 2
- 108010003700 lysyl aspartic acid Proteins 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 244000005700 microbiome Species 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 239000008188 pellet Substances 0.000 description 2
- 108010012581 phenylalanylglutamate Proteins 0.000 description 2
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 2
- 108010051242 phenylalanylserine Proteins 0.000 description 2
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 2
- 239000000843 powder Substances 0.000 description 2
- 239000002244 precipitate Substances 0.000 description 2
- 238000001556 precipitation Methods 0.000 description 2
- 238000005215 recombination Methods 0.000 description 2
- 238000005070 sampling Methods 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 108010026333 seryl-proline Proteins 0.000 description 2
- 235000020183 skimmed milk Nutrition 0.000 description 2
- 238000002791 soaking Methods 0.000 description 2
- 239000000725 suspension Substances 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 108700004896 tripeptide FEG Proteins 0.000 description 2
- 238000001262 western blot Methods 0.000 description 2
- PKOHVHWNGUHYRE-ZFWWWQNUSA-N (2s)-1-[2-[[(2s)-2-amino-3-(1h-indol-3-yl)propanoyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound O=C([C@H](CC=1C2=CC=CC=C2NC=1)N)NCC(=O)N1CCC[C@H]1C(O)=O PKOHVHWNGUHYRE-ZFWWWQNUSA-N 0.000 description 1
- AXFMEGAFCUULFV-BLFANLJRSA-N (2s)-2-[[(2s)-1-[(2s,3r)-2-amino-3-methylpentanoyl]pyrrolidine-2-carbonyl]amino]pentanedioic acid Chemical compound CC[C@@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AXFMEGAFCUULFV-BLFANLJRSA-N 0.000 description 1
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 1
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 1
- VBDMWOKJZDCFJM-FXQIFTODSA-N Ala-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N VBDMWOKJZDCFJM-FXQIFTODSA-N 0.000 description 1
- DVWVZSJAYIJZFI-FXQIFTODSA-N Ala-Arg-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DVWVZSJAYIJZFI-FXQIFTODSA-N 0.000 description 1
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 1
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 1
- UHMQKOBNPRAZGB-CIUDSAMLSA-N Ala-Glu-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N UHMQKOBNPRAZGB-CIUDSAMLSA-N 0.000 description 1
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 1
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 1
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 1
- HQJKCXHQNUCKMY-GHCJXIJMSA-N Ala-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C)N HQJKCXHQNUCKMY-GHCJXIJMSA-N 0.000 description 1
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 1
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 1
- RGQCNKIDEQJEBT-CQDKDKBSSA-N Ala-Leu-Tyr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 RGQCNKIDEQJEBT-CQDKDKBSSA-N 0.000 description 1
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 1
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 1
- JJHBEVZAZXZREW-LFSVMHDDSA-N Ala-Thr-Phe Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O JJHBEVZAZXZREW-LFSVMHDDSA-N 0.000 description 1
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 1
- YSUVMPICYVWRBX-VEVYYDQMSA-N Arg-Asp-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YSUVMPICYVWRBX-VEVYYDQMSA-N 0.000 description 1
- MZRBYBIQTIKERR-GUBZILKMSA-N Arg-Glu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MZRBYBIQTIKERR-GUBZILKMSA-N 0.000 description 1
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 1
- RFXXUWGNVRJTNQ-QXEWZRGKSA-N Arg-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N RFXXUWGNVRJTNQ-QXEWZRGKSA-N 0.000 description 1
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 1
- PYZPXCZNQSEHDT-GUBZILKMSA-N Arg-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N PYZPXCZNQSEHDT-GUBZILKMSA-N 0.000 description 1
- MNBHKGYCLBUIBC-UFYCRDLUSA-N Arg-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCNC(N)=N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MNBHKGYCLBUIBC-UFYCRDLUSA-N 0.000 description 1
- UIUXXFIKWQVMEX-UFYCRDLUSA-N Arg-Phe-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UIUXXFIKWQVMEX-UFYCRDLUSA-N 0.000 description 1
- OVQJAKFLFTZDNC-GUBZILKMSA-N Arg-Pro-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O OVQJAKFLFTZDNC-GUBZILKMSA-N 0.000 description 1
- YFHATWYGAAXQCF-JYJNAYRXSA-N Arg-Pro-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YFHATWYGAAXQCF-JYJNAYRXSA-N 0.000 description 1
- DDBMKOCQWNFDBH-RHYQMDGZSA-N Arg-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O DDBMKOCQWNFDBH-RHYQMDGZSA-N 0.000 description 1
- QHUOOCKNNURZSL-IHRRRGAJSA-N Arg-Tyr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O QHUOOCKNNURZSL-IHRRRGAJSA-N 0.000 description 1
- QNJIRRVTOXNGMH-GUBZILKMSA-N Asn-Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC(N)=O QNJIRRVTOXNGMH-GUBZILKMSA-N 0.000 description 1
- OKZOABJQOMAYEC-NUMRIWBASA-N Asn-Gln-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OKZOABJQOMAYEC-NUMRIWBASA-N 0.000 description 1
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 1
- PNHQRQTVBRDIEF-CIUDSAMLSA-N Asn-Leu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)N)N PNHQRQTVBRDIEF-CIUDSAMLSA-N 0.000 description 1
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 1
- KHCNTVRVAYCPQE-CIUDSAMLSA-N Asn-Lys-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O KHCNTVRVAYCPQE-CIUDSAMLSA-N 0.000 description 1
- YRTOMUMWSTUQAX-FXQIFTODSA-N Asn-Pro-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O YRTOMUMWSTUQAX-FXQIFTODSA-N 0.000 description 1
- GKKUBLFXKRDMFC-BQBZGAKWSA-N Asn-Pro-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O GKKUBLFXKRDMFC-BQBZGAKWSA-N 0.000 description 1
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 1
- QUMKPKWYDVMGNT-NUMRIWBASA-N Asn-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QUMKPKWYDVMGNT-NUMRIWBASA-N 0.000 description 1
- CPYHLXSGDBDULY-IHPCNDPISA-N Asn-Trp-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CPYHLXSGDBDULY-IHPCNDPISA-N 0.000 description 1
- KTDWFWNZLLFEFU-KKUMJFAQSA-N Asn-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O KTDWFWNZLLFEFU-KKUMJFAQSA-N 0.000 description 1
- XEGZSHSPQNDNRH-JRQIVUDYSA-N Asn-Tyr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XEGZSHSPQNDNRH-JRQIVUDYSA-N 0.000 description 1
- MYRLSKYSMXNLLA-LAEOZQHASA-N Asn-Val-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MYRLSKYSMXNLLA-LAEOZQHASA-N 0.000 description 1
- KBQOUDLMWYWXNP-YDHLFZDLSA-N Asn-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)N)N KBQOUDLMWYWXNP-YDHLFZDLSA-N 0.000 description 1
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 1
- CNKAZIGBGQIHLL-GUBZILKMSA-N Asp-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N CNKAZIGBGQIHLL-GUBZILKMSA-N 0.000 description 1
- QRULNKJGYQQZMW-ZLUOBGJFSA-N Asp-Asn-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QRULNKJGYQQZMW-ZLUOBGJFSA-N 0.000 description 1
- XACXDSRQIXRMNS-OLHMAJIHSA-N Asp-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)O XACXDSRQIXRMNS-OLHMAJIHSA-N 0.000 description 1
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 1
- HRGGPWBIMIQANI-GUBZILKMSA-N Asp-Gln-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HRGGPWBIMIQANI-GUBZILKMSA-N 0.000 description 1
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 1
- OVPHVTCDVYYTHN-AVGNSLFASA-N Asp-Glu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OVPHVTCDVYYTHN-AVGNSLFASA-N 0.000 description 1
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 1
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 1
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 1
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 1
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 1
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 1
- TZBJAXGYGSIUHQ-XUXIUFHCSA-N Asp-Leu-Leu-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O TZBJAXGYGSIUHQ-XUXIUFHCSA-N 0.000 description 1
- DPNWSMBUYCLEDG-CIUDSAMLSA-N Asp-Lys-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O DPNWSMBUYCLEDG-CIUDSAMLSA-N 0.000 description 1
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 1
- JXGJJQJHXHXJQF-CIUDSAMLSA-N Asp-Met-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O JXGJJQJHXHXJQF-CIUDSAMLSA-N 0.000 description 1
- LIJXJYGRSRWLCJ-IHRRRGAJSA-N Asp-Phe-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LIJXJYGRSRWLCJ-IHRRRGAJSA-N 0.000 description 1
- KOWYNSKRPUWSFG-IHPCNDPISA-N Asp-Phe-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)NC(=O)[C@H](CC(=O)O)N KOWYNSKRPUWSFG-IHPCNDPISA-N 0.000 description 1
- FIRWLDUOFOULCA-XIRDDKMYSA-N Asp-Trp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N FIRWLDUOFOULCA-XIRDDKMYSA-N 0.000 description 1
- BPAUXFVCSYQDQX-JRQIVUDYSA-N Asp-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)O)N)O BPAUXFVCSYQDQX-JRQIVUDYSA-N 0.000 description 1
- ALMIMUZAWTUNIO-BZSNNMDCSA-N Asp-Tyr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ALMIMUZAWTUNIO-BZSNNMDCSA-N 0.000 description 1
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 1
- LHRCZIRWNFRIRG-SRVKXCTJSA-N Cys-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N)O LHRCZIRWNFRIRG-SRVKXCTJSA-N 0.000 description 1
- 102000005927 Cysteine Proteases Human genes 0.000 description 1
- 108010005843 Cysteine Proteases Proteins 0.000 description 1
- REJJNXODKSHOKA-ACZMJKKPSA-N Gln-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N REJJNXODKSHOKA-ACZMJKKPSA-N 0.000 description 1
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 1
- CYTSBCIIEHUPDU-ACZMJKKPSA-N Gln-Asp-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O CYTSBCIIEHUPDU-ACZMJKKPSA-N 0.000 description 1
- MADFVRSKEIEZHZ-DCAQKATOSA-N Gln-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N MADFVRSKEIEZHZ-DCAQKATOSA-N 0.000 description 1
- UFNSPPFJOHNXRE-AUTRQRHGSA-N Gln-Gln-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UFNSPPFJOHNXRE-AUTRQRHGSA-N 0.000 description 1
- CGVWDTRDPLOMHZ-FXQIFTODSA-N Gln-Glu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CGVWDTRDPLOMHZ-FXQIFTODSA-N 0.000 description 1
- QQAPDATZKKTBIY-YUMQZZPRSA-N Gln-Gly-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O QQAPDATZKKTBIY-YUMQZZPRSA-N 0.000 description 1
- VGTDBGYFVWOQTI-RYUDHWBXSA-N Gln-Gly-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VGTDBGYFVWOQTI-RYUDHWBXSA-N 0.000 description 1
- PODFFOWWLUPNMN-DCAQKATOSA-N Gln-His-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O PODFFOWWLUPNMN-DCAQKATOSA-N 0.000 description 1
- HDUDGCZEOZEFOA-KBIXCLLPSA-N Gln-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HDUDGCZEOZEFOA-KBIXCLLPSA-N 0.000 description 1
- YRWWJCDWLVXTHN-LAEOZQHASA-N Gln-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N YRWWJCDWLVXTHN-LAEOZQHASA-N 0.000 description 1
- CAXXTYYGFYTBPV-IUCAKERBSA-N Gln-Leu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CAXXTYYGFYTBPV-IUCAKERBSA-N 0.000 description 1
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 1
- HSHCEAUPUPJPTE-JYJNAYRXSA-N Gln-Leu-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HSHCEAUPUPJPTE-JYJNAYRXSA-N 0.000 description 1
- FKXCBKCOSVIGCT-AVGNSLFASA-N Gln-Lys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FKXCBKCOSVIGCT-AVGNSLFASA-N 0.000 description 1
- HMIXCETWRYDVMO-GUBZILKMSA-N Gln-Pro-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O HMIXCETWRYDVMO-GUBZILKMSA-N 0.000 description 1
- OUBUHIODTNUUTC-WDCWCFNPSA-N Gln-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O OUBUHIODTNUUTC-WDCWCFNPSA-N 0.000 description 1
- VYOILACOFPPNQH-UMNHJUIQSA-N Gln-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N VYOILACOFPPNQH-UMNHJUIQSA-N 0.000 description 1
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 1
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 1
- WLIPTFCZLHCNFD-LPEHRKFASA-N Glu-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O WLIPTFCZLHCNFD-LPEHRKFASA-N 0.000 description 1
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 1
- WTMZXOPHTIVFCP-QEWYBTABSA-N Glu-Ile-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WTMZXOPHTIVFCP-QEWYBTABSA-N 0.000 description 1
- DWBBKNPKDHXIAC-SRVKXCTJSA-N Glu-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCC(O)=O DWBBKNPKDHXIAC-SRVKXCTJSA-N 0.000 description 1
- ZTVGZOIBLRPQNR-KKUMJFAQSA-N Glu-Met-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZTVGZOIBLRPQNR-KKUMJFAQSA-N 0.000 description 1
- CBWKURKPYSLMJV-SOUVJXGZSA-N Glu-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CBWKURKPYSLMJV-SOUVJXGZSA-N 0.000 description 1
- PAZQYODKOZHXGA-SRVKXCTJSA-N Glu-Pro-His Chemical compound N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O PAZQYODKOZHXGA-SRVKXCTJSA-N 0.000 description 1
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 1
- DDXZHOHEABQXSE-NKIYYHGXSA-N Glu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O DDXZHOHEABQXSE-NKIYYHGXSA-N 0.000 description 1
- VXEFAWJTFAUDJK-AVGNSLFASA-N Glu-Tyr-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O VXEFAWJTFAUDJK-AVGNSLFASA-N 0.000 description 1
- 108010024636 Glutathione Proteins 0.000 description 1
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 1
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 1
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 1
- RPLLQZBOVIVGMX-QWRGUYRKSA-N Gly-Asp-Phe Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RPLLQZBOVIVGMX-QWRGUYRKSA-N 0.000 description 1
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 1
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 1
- JNGJGFMFXREJNF-KBPBESRZSA-N Gly-Glu-Trp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JNGJGFMFXREJNF-KBPBESRZSA-N 0.000 description 1
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 1
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 1
- FQKKPCWTZZEDIC-XPUUQOCRSA-N Gly-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 FQKKPCWTZZEDIC-XPUUQOCRSA-N 0.000 description 1
- SWQALSGKVLYKDT-ZKWXMUAHSA-N Gly-Ile-Ala Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SWQALSGKVLYKDT-ZKWXMUAHSA-N 0.000 description 1
- SXJHOPPTOJACOA-QXEWZRGKSA-N Gly-Ile-Arg Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SXJHOPPTOJACOA-QXEWZRGKSA-N 0.000 description 1
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 1
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 1
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 description 1
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 1
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 1
- YHYDTTUSJXGTQK-UWVGGRQHSA-N Gly-Met-Leu Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(C)C)C(O)=O YHYDTTUSJXGTQK-UWVGGRQHSA-N 0.000 description 1
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 1
- WDXLKVQATNEAJQ-BQBZGAKWSA-N Gly-Pro-Asp Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WDXLKVQATNEAJQ-BQBZGAKWSA-N 0.000 description 1
- JJGBXTYGTKWGAT-YUMQZZPRSA-N Gly-Pro-Glu Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O JJGBXTYGTKWGAT-YUMQZZPRSA-N 0.000 description 1
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 1
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 1
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 1
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 1
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 1
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 1
- WTUSRDZLLWGYAT-KCTSRDHCSA-N Gly-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)CN WTUSRDZLLWGYAT-KCTSRDHCSA-N 0.000 description 1
- YJDALMUYJIENAG-QWRGUYRKSA-N Gly-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN)O YJDALMUYJIENAG-QWRGUYRKSA-N 0.000 description 1
- HQSKKSLNLSTONK-JTQLQIEISA-N Gly-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 HQSKKSLNLSTONK-JTQLQIEISA-N 0.000 description 1
- DNAZKGFYFRGZIH-QWRGUYRKSA-N Gly-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 DNAZKGFYFRGZIH-QWRGUYRKSA-N 0.000 description 1
- NGRPGJGKJMUGDM-XVKPBYJWSA-N Gly-Val-Gln Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NGRPGJGKJMUGDM-XVKPBYJWSA-N 0.000 description 1
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 1
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 1
- DZMVESFTHXSSPZ-XVYDVKMFSA-N His-Ala-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DZMVESFTHXSSPZ-XVYDVKMFSA-N 0.000 description 1
- YPLYIXGKCRQZGW-SRVKXCTJSA-N His-Arg-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YPLYIXGKCRQZGW-SRVKXCTJSA-N 0.000 description 1
- HRGGKHFHRSFSDE-CIUDSAMLSA-N His-Asn-Ser Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N HRGGKHFHRSFSDE-CIUDSAMLSA-N 0.000 description 1
- VOKCBYNCZVSILJ-KKUMJFAQSA-N His-Asn-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)O VOKCBYNCZVSILJ-KKUMJFAQSA-N 0.000 description 1
- LCNNHVQNFNJLGK-AVGNSLFASA-N His-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N LCNNHVQNFNJLGK-AVGNSLFASA-N 0.000 description 1
- STWGDDDFLUFCCA-GVXVVHGQSA-N His-Glu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O STWGDDDFLUFCCA-GVXVVHGQSA-N 0.000 description 1
- 108010093488 His-His-His-His-His-His Proteins 0.000 description 1
- SOYCWSKCUVDLMC-AVGNSLFASA-N His-Pro-Arg Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N2CCC[C@H]2C(=O)N[C@@H](CCCNC(=N)N)C(=O)O SOYCWSKCUVDLMC-AVGNSLFASA-N 0.000 description 1
- CUEQQFOGARVNHU-VGDYDELISA-N His-Ser-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUEQQFOGARVNHU-VGDYDELISA-N 0.000 description 1
- XHQYFGPIRUHQIB-PBCZWWQYSA-N His-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CN=CN1 XHQYFGPIRUHQIB-PBCZWWQYSA-N 0.000 description 1
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 1
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 1
- YPWHUFAAMNHMGS-QSFUFRPTSA-N Ile-Ala-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YPWHUFAAMNHMGS-QSFUFRPTSA-N 0.000 description 1
- GYAFMRQGWHXMII-IUKAMOBKSA-N Ile-Asp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N GYAFMRQGWHXMII-IUKAMOBKSA-N 0.000 description 1
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 1
- NYEYYMLUABXDMC-NHCYSSNCSA-N Ile-Gly-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)O)N NYEYYMLUABXDMC-NHCYSSNCSA-N 0.000 description 1
- PWDSHAAAFXISLE-SXTJYALSSA-N Ile-Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O PWDSHAAAFXISLE-SXTJYALSSA-N 0.000 description 1
- NZGTYCMLUGYMCV-XUXIUFHCSA-N Ile-Lys-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N NZGTYCMLUGYMCV-XUXIUFHCSA-N 0.000 description 1
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 1
- UFRXVQGGPNSJRY-CYDGBPFRSA-N Ile-Met-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N UFRXVQGGPNSJRY-CYDGBPFRSA-N 0.000 description 1
- XQLGNKLSPYCRMZ-HJWJTTGWSA-N Ile-Phe-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)O)N XQLGNKLSPYCRMZ-HJWJTTGWSA-N 0.000 description 1
- ANTFEOSJMAUGIB-KNZXXDILSA-N Ile-Thr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N ANTFEOSJMAUGIB-KNZXXDILSA-N 0.000 description 1
- OMDWJWGZGMCQND-CFMVVWHZSA-N Ile-Tyr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OMDWJWGZGMCQND-CFMVVWHZSA-N 0.000 description 1
- NXRNRBOKDBIVKQ-CXTHYWKRSA-N Ile-Tyr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N NXRNRBOKDBIVKQ-CXTHYWKRSA-N 0.000 description 1
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 1
- WIYDLTIBHZSPKY-HJWJTTGWSA-N Ile-Val-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WIYDLTIBHZSPKY-HJWJTTGWSA-N 0.000 description 1
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 1
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 1
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 1
- 241000880493 Leptailurus serval Species 0.000 description 1
- QPRQGENIBFLVEB-BJDJZHNGSA-N Leu-Ala-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QPRQGENIBFLVEB-BJDJZHNGSA-N 0.000 description 1
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 1
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 1
- VKOAHIRLIUESLU-ULQDDVLXSA-N Leu-Arg-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VKOAHIRLIUESLU-ULQDDVLXSA-N 0.000 description 1
- VIWUBXKCYJGNCL-SRVKXCTJSA-N Leu-Asn-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 VIWUBXKCYJGNCL-SRVKXCTJSA-N 0.000 description 1
- WXHFZJFZWNCDNB-KKUMJFAQSA-N Leu-Asn-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXHFZJFZWNCDNB-KKUMJFAQSA-N 0.000 description 1
- YKNBJXOJTURHCU-DCAQKATOSA-N Leu-Asp-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKNBJXOJTURHCU-DCAQKATOSA-N 0.000 description 1
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 1
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 1
- KXODZBLFVFSLAI-AVGNSLFASA-N Leu-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KXODZBLFVFSLAI-AVGNSLFASA-N 0.000 description 1
- NRFGTHFONZYFNY-MGHWNKPDSA-N Leu-Ile-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NRFGTHFONZYFNY-MGHWNKPDSA-N 0.000 description 1
- KYIIALJHAOIAHF-KKUMJFAQSA-N Leu-Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KYIIALJHAOIAHF-KKUMJFAQSA-N 0.000 description 1
- FAELBUXXFQLUAX-AJNGGQMLSA-N Leu-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C FAELBUXXFQLUAX-AJNGGQMLSA-N 0.000 description 1
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 1
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 1
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 1
- CPONGMJGVIAWEH-DCAQKATOSA-N Leu-Met-Ala Chemical compound CSCC[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O CPONGMJGVIAWEH-DCAQKATOSA-N 0.000 description 1
- JVTYXRRFZCEPPK-RHYQMDGZSA-N Leu-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(C)C)N)O JVTYXRRFZCEPPK-RHYQMDGZSA-N 0.000 description 1
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 1
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 1
- YLMIDMSLKLRNHX-HSCHXYMDSA-N Leu-Trp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YLMIDMSLKLRNHX-HSCHXYMDSA-N 0.000 description 1
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 1
- RDFIVFHPOSOXMW-ACRUOGEOSA-N Leu-Tyr-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RDFIVFHPOSOXMW-ACRUOGEOSA-N 0.000 description 1
- AXVIGSRGTMNSJU-YESZJQIVSA-N Leu-Tyr-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N AXVIGSRGTMNSJU-YESZJQIVSA-N 0.000 description 1
- VQHUBNVKFFLWRP-ULQDDVLXSA-N Leu-Tyr-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 VQHUBNVKFFLWRP-ULQDDVLXSA-N 0.000 description 1
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 1
- CLBGMWIYPYAZPR-AVGNSLFASA-N Lys-Arg-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O CLBGMWIYPYAZPR-AVGNSLFASA-N 0.000 description 1
- NQCJGQHHYZNUDK-DCAQKATOSA-N Lys-Arg-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCN=C(N)N NQCJGQHHYZNUDK-DCAQKATOSA-N 0.000 description 1
- DNEJSAIMVANNPA-DCAQKATOSA-N Lys-Asn-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DNEJSAIMVANNPA-DCAQKATOSA-N 0.000 description 1
- YVSHZSUKQHNDHD-KKUMJFAQSA-N Lys-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N YVSHZSUKQHNDHD-KKUMJFAQSA-N 0.000 description 1
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 1
- VQXAVLQBQJMENB-SRVKXCTJSA-N Lys-Glu-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O VQXAVLQBQJMENB-SRVKXCTJSA-N 0.000 description 1
- ULUQBUKAPDUKOC-GVXVVHGQSA-N Lys-Glu-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ULUQBUKAPDUKOC-GVXVVHGQSA-N 0.000 description 1
- GPJGFSFYBJGYRX-YUMQZZPRSA-N Lys-Gly-Asp Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O GPJGFSFYBJGYRX-YUMQZZPRSA-N 0.000 description 1
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 1
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 1
- RFQATBGBLDAKGI-VHSXEESVSA-N Lys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCCN)N)C(=O)O RFQATBGBLDAKGI-VHSXEESVSA-N 0.000 description 1
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 1
- YXPJCVNIDDKGOE-MELADBBJSA-N Lys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N)C(=O)O YXPJCVNIDDKGOE-MELADBBJSA-N 0.000 description 1
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 1
- WLXGMVVHTIUPHE-ULQDDVLXSA-N Lys-Phe-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O WLXGMVVHTIUPHE-ULQDDVLXSA-N 0.000 description 1
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 1
- LUTDBHBIHHREDC-IHRRRGAJSA-N Lys-Pro-Lys Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O LUTDBHBIHHREDC-IHRRRGAJSA-N 0.000 description 1
- TVHCDSBMFQYPNA-RHYQMDGZSA-N Lys-Thr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TVHCDSBMFQYPNA-RHYQMDGZSA-N 0.000 description 1
- VKCPHIOZDWUFSW-ONGXEEELSA-N Lys-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN VKCPHIOZDWUFSW-ONGXEEELSA-N 0.000 description 1
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 1
- CWFYZYQMUDWGTI-GUBZILKMSA-N Met-Arg-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O CWFYZYQMUDWGTI-GUBZILKMSA-N 0.000 description 1
- IVCPHARVJUYDPA-FXQIFTODSA-N Met-Asn-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IVCPHARVJUYDPA-FXQIFTODSA-N 0.000 description 1
- TZLYIHDABYBOCJ-FXQIFTODSA-N Met-Asp-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O TZLYIHDABYBOCJ-FXQIFTODSA-N 0.000 description 1
- FWTBMGAKKPSTBT-GUBZILKMSA-N Met-Gln-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FWTBMGAKKPSTBT-GUBZILKMSA-N 0.000 description 1
- IUYCGMNKIZDRQI-BQBZGAKWSA-N Met-Gly-Ala Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O IUYCGMNKIZDRQI-BQBZGAKWSA-N 0.000 description 1
- XKJUFUPCHARJKX-UWVGGRQHSA-N Met-Gly-His Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 XKJUFUPCHARJKX-UWVGGRQHSA-N 0.000 description 1
- AFFKUNVPPLQUGA-DCAQKATOSA-N Met-Leu-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O AFFKUNVPPLQUGA-DCAQKATOSA-N 0.000 description 1
- HGAJNEWOUHDUMZ-SRVKXCTJSA-N Met-Leu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O HGAJNEWOUHDUMZ-SRVKXCTJSA-N 0.000 description 1
- BEZJTLKUMFMITF-AVGNSLFASA-N Met-Lys-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCNC(N)=N BEZJTLKUMFMITF-AVGNSLFASA-N 0.000 description 1
- VQILILSLEFDECU-GUBZILKMSA-N Met-Pro-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O VQILILSLEFDECU-GUBZILKMSA-N 0.000 description 1
- QLESZRANMSYLCZ-CYDGBPFRSA-N Met-Pro-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QLESZRANMSYLCZ-CYDGBPFRSA-N 0.000 description 1
- RDLSEGZJMYGFNS-FXQIFTODSA-N Met-Ser-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RDLSEGZJMYGFNS-FXQIFTODSA-N 0.000 description 1
- KYXDADPHSNFWQX-VEVYYDQMSA-N Met-Thr-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O KYXDADPHSNFWQX-VEVYYDQMSA-N 0.000 description 1
- JACMWNXOOUYXCD-JYJNAYRXSA-N Met-Val-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JACMWNXOOUYXCD-JYJNAYRXSA-N 0.000 description 1
- IIHMNTBFPMRJCN-RCWTZXSCSA-N Met-Val-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IIHMNTBFPMRJCN-RCWTZXSCSA-N 0.000 description 1
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 1
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 1
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- MRNRMSDVVSKPGM-AVGNSLFASA-N Phe-Asn-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MRNRMSDVVSKPGM-AVGNSLFASA-N 0.000 description 1
- WGXOKDLDIWSOCV-MELADBBJSA-N Phe-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O WGXOKDLDIWSOCV-MELADBBJSA-N 0.000 description 1
- FIRWJEJVFFGXSH-RYUDHWBXSA-N Phe-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 FIRWJEJVFFGXSH-RYUDHWBXSA-N 0.000 description 1
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 description 1
- OVJMCXAPGFDGMG-HKUYNNGSSA-N Phe-Gly-Trp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O OVJMCXAPGFDGMG-HKUYNNGSSA-N 0.000 description 1
- HQPWNHXERZCIHP-PMVMPFDFSA-N Phe-Leu-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 HQPWNHXERZCIHP-PMVMPFDFSA-N 0.000 description 1
- ZUQACJLOHYRVPJ-DKIMLUQUSA-N Phe-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZUQACJLOHYRVPJ-DKIMLUQUSA-N 0.000 description 1
- OKQQWSNUSQURLI-JYJNAYRXSA-N Phe-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CC=CC=C1)N OKQQWSNUSQURLI-JYJNAYRXSA-N 0.000 description 1
- RBRNEFJTEHPDSL-ACRUOGEOSA-N Phe-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 RBRNEFJTEHPDSL-ACRUOGEOSA-N 0.000 description 1
- IPFXYNKCXYGSSV-KKUMJFAQSA-N Phe-Ser-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N IPFXYNKCXYGSSV-KKUMJFAQSA-N 0.000 description 1
- BTAIJUBAGLVFKQ-BVSLBCMMSA-N Phe-Trp-Val Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](C(C)C)C(O)=O)C1=CC=CC=C1 BTAIJUBAGLVFKQ-BVSLBCMMSA-N 0.000 description 1
- FRMKIPSIZSFTTE-HJOGWXRNSA-N Phe-Tyr-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FRMKIPSIZSFTTE-HJOGWXRNSA-N 0.000 description 1
- GLUYKHMBGKQBHE-JYJNAYRXSA-N Phe-Val-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 GLUYKHMBGKQBHE-JYJNAYRXSA-N 0.000 description 1
- GNZCMRRSXOBHLC-JYJNAYRXSA-N Phe-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N GNZCMRRSXOBHLC-JYJNAYRXSA-N 0.000 description 1
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 1
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 1
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 1
- AHXPYZRZRMQOAU-QXEWZRGKSA-N Pro-Asn-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1)C(O)=O AHXPYZRZRMQOAU-QXEWZRGKSA-N 0.000 description 1
- CJZTUKSFZUSNCC-FXQIFTODSA-N Pro-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 CJZTUKSFZUSNCC-FXQIFTODSA-N 0.000 description 1
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 1
- KPDRZQUWJKTMBP-DCAQKATOSA-N Pro-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 KPDRZQUWJKTMBP-DCAQKATOSA-N 0.000 description 1
- HXOLCSYHGRNXJJ-IHRRRGAJSA-N Pro-Asp-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HXOLCSYHGRNXJJ-IHRRRGAJSA-N 0.000 description 1
- DEDANIDYQAPTFI-IHRRRGAJSA-N Pro-Asp-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DEDANIDYQAPTFI-IHRRRGAJSA-N 0.000 description 1
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 1
- PTLOFJZJADCNCD-DCAQKATOSA-N Pro-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 PTLOFJZJADCNCD-DCAQKATOSA-N 0.000 description 1
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 description 1
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 1
- FDINZVJXLPILKV-DCAQKATOSA-N Pro-His-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O FDINZVJXLPILKV-DCAQKATOSA-N 0.000 description 1
- DTQIXTOJHKVEOH-DCAQKATOSA-N Pro-His-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CS)C(=O)O DTQIXTOJHKVEOH-DCAQKATOSA-N 0.000 description 1
- FKVNLUZHSFCNGY-RVMXOQNASA-N Pro-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 FKVNLUZHSFCNGY-RVMXOQNASA-N 0.000 description 1
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 1
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 1
- WOIFYRZPIORBRY-AVGNSLFASA-N Pro-Lys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WOIFYRZPIORBRY-AVGNSLFASA-N 0.000 description 1
- KBUAPZAZPWNYSW-SRVKXCTJSA-N Pro-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KBUAPZAZPWNYSW-SRVKXCTJSA-N 0.000 description 1
- RNEFESSBTOQSAC-DCAQKATOSA-N Pro-Ser-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O RNEFESSBTOQSAC-DCAQKATOSA-N 0.000 description 1
- IIRBTQHFVNGPMQ-AVGNSLFASA-N Pro-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 IIRBTQHFVNGPMQ-AVGNSLFASA-N 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- HQTKVSCNCDLXSX-BQBZGAKWSA-N Ser-Arg-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O HQTKVSCNCDLXSX-BQBZGAKWSA-N 0.000 description 1
- WXUBSIDKNMFAGS-IHRRRGAJSA-N Ser-Arg-Tyr Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXUBSIDKNMFAGS-IHRRRGAJSA-N 0.000 description 1
- SFZKGGOGCNQPJY-CIUDSAMLSA-N Ser-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N SFZKGGOGCNQPJY-CIUDSAMLSA-N 0.000 description 1
- KJMOINFQVCCSDX-XKBZYTNZSA-N Ser-Gln-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KJMOINFQVCCSDX-XKBZYTNZSA-N 0.000 description 1
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 1
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 1
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 1
- UAJAYRMZGNQILN-BQBZGAKWSA-N Ser-Gly-Met Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UAJAYRMZGNQILN-BQBZGAKWSA-N 0.000 description 1
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 1
- DJACUBDEDBZKLQ-KBIXCLLPSA-N Ser-Ile-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O DJACUBDEDBZKLQ-KBIXCLLPSA-N 0.000 description 1
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 1
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 1
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 1
- QPPYAWVLAVXISR-DCAQKATOSA-N Ser-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QPPYAWVLAVXISR-DCAQKATOSA-N 0.000 description 1
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 1
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 1
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 1
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 1
- JVTHIXKSVYEWNI-JRQIVUDYSA-N Thr-Asn-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JVTHIXKSVYEWNI-JRQIVUDYSA-N 0.000 description 1
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 1
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 1
- KRPKYGOFYUNIGM-XVSYOHENSA-N Thr-Asp-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O KRPKYGOFYUNIGM-XVSYOHENSA-N 0.000 description 1
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 1
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 1
- YSXYEJWDHBCTDJ-DVJZZOLTSA-N Thr-Gly-Trp Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O YSXYEJWDHBCTDJ-DVJZZOLTSA-N 0.000 description 1
- XOWKUMFHEZLKLT-CIQUZCHMSA-N Thr-Ile-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O XOWKUMFHEZLKLT-CIQUZCHMSA-N 0.000 description 1
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 1
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 1
- BDGBHYCAZJPLHX-HJGDQZAQSA-N Thr-Lys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BDGBHYCAZJPLHX-HJGDQZAQSA-N 0.000 description 1
- ZSPQUTWLWGWTPS-HJGDQZAQSA-N Thr-Lys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZSPQUTWLWGWTPS-HJGDQZAQSA-N 0.000 description 1
- VGYVVSQFSSKZRJ-OEAJRASXSA-N Thr-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=CC=C1 VGYVVSQFSSKZRJ-OEAJRASXSA-N 0.000 description 1
- GFRIEEKFXOVPIR-RHYQMDGZSA-N Thr-Pro-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O GFRIEEKFXOVPIR-RHYQMDGZSA-N 0.000 description 1
- XHWCDRUPDNSDAZ-XKBZYTNZSA-N Thr-Ser-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O XHWCDRUPDNSDAZ-XKBZYTNZSA-N 0.000 description 1
- NQQMWWVVGIXUOX-SVSWQMSJSA-N Thr-Ser-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NQQMWWVVGIXUOX-SVSWQMSJSA-N 0.000 description 1
- CSNBWOJOEOPYIJ-UVOCVTCTSA-N Thr-Thr-Lys Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O CSNBWOJOEOPYIJ-UVOCVTCTSA-N 0.000 description 1
- SOUPNXUJAJENFU-SWRJLBSHSA-N Thr-Trp-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O SOUPNXUJAJENFU-SWRJLBSHSA-N 0.000 description 1
- XGFYGMKZKFRGAI-RCWTZXSCSA-N Thr-Val-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XGFYGMKZKFRGAI-RCWTZXSCSA-N 0.000 description 1
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 1
- OETOOJXFNSEYHQ-WFBYXXMGSA-N Trp-Ala-Asp Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O)=CNC2=C1 OETOOJXFNSEYHQ-WFBYXXMGSA-N 0.000 description 1
- TZNNEYFZZAHLBL-BPUTZDHNSA-N Trp-Arg-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O TZNNEYFZZAHLBL-BPUTZDHNSA-N 0.000 description 1
- HXMJXDNSFVNSEH-IHPCNDPISA-N Trp-Cys-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N HXMJXDNSFVNSEH-IHPCNDPISA-N 0.000 description 1
- JLTQXEOXIJMCLZ-ZVZYQTTQSA-N Trp-Gln-Val Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O)=CNC2=C1 JLTQXEOXIJMCLZ-ZVZYQTTQSA-N 0.000 description 1
- YYXIWHBHTARPOG-HJXMPXNTSA-N Trp-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N YYXIWHBHTARPOG-HJXMPXNTSA-N 0.000 description 1
- CCZXBOFIBYQLEV-IHPCNDPISA-N Trp-Leu-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(O)=O CCZXBOFIBYQLEV-IHPCNDPISA-N 0.000 description 1
- KRCPXGSWDOGHAM-XIRDDKMYSA-N Trp-Lys-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O KRCPXGSWDOGHAM-XIRDDKMYSA-N 0.000 description 1
- VDUJEEQMRQCLHB-YTQUADARSA-N Trp-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O VDUJEEQMRQCLHB-YTQUADARSA-N 0.000 description 1
- NECCMBOBBANRIT-RNXOBYDBSA-N Trp-Phe-Tyr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NECCMBOBBANRIT-RNXOBYDBSA-N 0.000 description 1
- AYHSJESDFKREAR-KKUMJFAQSA-N Tyr-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AYHSJESDFKREAR-KKUMJFAQSA-N 0.000 description 1
- SCCKSNREWHMKOJ-SRVKXCTJSA-N Tyr-Asn-Ser Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O SCCKSNREWHMKOJ-SRVKXCTJSA-N 0.000 description 1
- NRFTYDWKWGJLAR-MELADBBJSA-N Tyr-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O NRFTYDWKWGJLAR-MELADBBJSA-N 0.000 description 1
- FGJWNBBFAUHBEP-IHPCNDPISA-N Tyr-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N FGJWNBBFAUHBEP-IHPCNDPISA-N 0.000 description 1
- HDSKHCBAVVWPCQ-FHWLQOOXSA-N Tyr-Glu-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HDSKHCBAVVWPCQ-FHWLQOOXSA-N 0.000 description 1
- HIINQLBHPIQYHN-JTQLQIEISA-N Tyr-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HIINQLBHPIQYHN-JTQLQIEISA-N 0.000 description 1
- DAOREBHZAKCOEN-ULQDDVLXSA-N Tyr-Leu-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O DAOREBHZAKCOEN-ULQDDVLXSA-N 0.000 description 1
- WOAQYWUEUYMVGK-ULQDDVLXSA-N Tyr-Lys-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WOAQYWUEUYMVGK-ULQDDVLXSA-N 0.000 description 1
- YYLHVUCSTXXKBS-IHRRRGAJSA-N Tyr-Pro-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YYLHVUCSTXXKBS-IHRRRGAJSA-N 0.000 description 1
- VYQQQIRHIFALGE-UWJYBYFXSA-N Tyr-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VYQQQIRHIFALGE-UWJYBYFXSA-N 0.000 description 1
- RWOKVQUCENPXGE-IHRRRGAJSA-N Tyr-Ser-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RWOKVQUCENPXGE-IHRRRGAJSA-N 0.000 description 1
- HRHYJNLMIJWGLF-BZSNNMDCSA-N Tyr-Ser-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 HRHYJNLMIJWGLF-BZSNNMDCSA-N 0.000 description 1
- HZDQUVQEVVYDDA-ACRUOGEOSA-N Tyr-Tyr-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 HZDQUVQEVVYDDA-ACRUOGEOSA-N 0.000 description 1
- QVYFTFIBKCDHIE-ACRUOGEOSA-N Tyr-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O QVYFTFIBKCDHIE-ACRUOGEOSA-N 0.000 description 1
- 102000044159 Ubiquitin Human genes 0.000 description 1
- 108090000848 Ubiquitin Proteins 0.000 description 1
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 1
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 1
- XQVRMLRMTAGSFJ-QXEWZRGKSA-N Val-Asp-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XQVRMLRMTAGSFJ-QXEWZRGKSA-N 0.000 description 1
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 1
- COSLEEOIYRPTHD-YDHLFZDLSA-N Val-Asp-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 COSLEEOIYRPTHD-YDHLFZDLSA-N 0.000 description 1
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 1
- PMDOQZFYGWZSTK-LSJOCFKGSA-N Val-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C PMDOQZFYGWZSTK-LSJOCFKGSA-N 0.000 description 1
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 1
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 1
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 1
- GVJUTBOZZBTBIG-AVGNSLFASA-N Val-Lys-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N GVJUTBOZZBTBIG-AVGNSLFASA-N 0.000 description 1
- CXWJFWAZIVWBOS-XQQFMLRXSA-N Val-Lys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CXWJFWAZIVWBOS-XQQFMLRXSA-N 0.000 description 1
- WMRWZYSRQUORHJ-YDHLFZDLSA-N Val-Phe-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WMRWZYSRQUORHJ-YDHLFZDLSA-N 0.000 description 1
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 1
- LTTQCQRTSHJPPL-ZKWXMUAHSA-N Val-Ser-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LTTQCQRTSHJPPL-ZKWXMUAHSA-N 0.000 description 1
- DLLRRUDLMSJTMB-GUBZILKMSA-N Val-Ser-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N DLLRRUDLMSJTMB-GUBZILKMSA-N 0.000 description 1
- IRAUYEAFPFPVND-UVBJJODRSA-N Val-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 IRAUYEAFPFPVND-UVBJJODRSA-N 0.000 description 1
- QTXGUIMEHKCPBH-FHWLQOOXSA-N Val-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 QTXGUIMEHKCPBH-FHWLQOOXSA-N 0.000 description 1
- VTIAEOKFUJJBTC-YDHLFZDLSA-N Val-Tyr-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VTIAEOKFUJJBTC-YDHLFZDLSA-N 0.000 description 1
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 1
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 1
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- 108010011559 alanylphenylalanine Proteins 0.000 description 1
- 108010087924 alanylproline Proteins 0.000 description 1
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 108010008355 arginyl-glutamine Proteins 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 108010077245 asparaginyl-proline Proteins 0.000 description 1
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 1
- 108010038633 aspartylglutamate Proteins 0.000 description 1
- 108010068265 aspartyltyrosine Proteins 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 238000009395 breeding Methods 0.000 description 1
- 230000001488 breeding effect Effects 0.000 description 1
- 229910052799 carbon Inorganic materials 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 238000004140 cleaning Methods 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 239000012141 concentrate Substances 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000000593 degrading effect Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000000502 dialysis Methods 0.000 description 1
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 1
- 239000003292 glue Substances 0.000 description 1
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 1
- 229960003180 glutathione Drugs 0.000 description 1
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 1
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 1
- 108010008671 glycyl-tryptophyl-methionine Proteins 0.000 description 1
- 108010020688 glycylhistidine Proteins 0.000 description 1
- 108010050848 glycylleucine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 108010037850 glycylvaline Proteins 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 1
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 1
- 108010057821 leucylproline Proteins 0.000 description 1
- 108010012058 leucyltyrosine Proteins 0.000 description 1
- 238000011068 loading method Methods 0.000 description 1
- 229940127215 low-molecular weight heparin Drugs 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 108010017391 lysylvaline Proteins 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 239000003607 modifier Substances 0.000 description 1
- 238000012856 packing Methods 0.000 description 1
- 108010072637 phenylalanyl-arginyl-phenylalanine Proteins 0.000 description 1
- 108010077112 prolyl-proline Proteins 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 238000004153 renaturation Methods 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 108010071207 serylmethionine Proteins 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 238000007921 solubility assay Methods 0.000 description 1
- 238000003892 spreading Methods 0.000 description 1
- 230000007480 spreading Effects 0.000 description 1
- 108010061238 threonyl-glycine Proteins 0.000 description 1
- 231100000331 toxic Toxicity 0.000 description 1
- 230000002588 toxic effect Effects 0.000 description 1
- 230000013715 transcription antitermination Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 238000010023 transfer printing Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 108010038745 tryptophylglycine Proteins 0.000 description 1
- 108010051110 tyrosyl-lysine Proteins 0.000 description 1
- 108010078580 tyrosylleucine Proteins 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/88—Lyases (4.)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y402/00—Carbon-oxygen lyases (4.2)
- C12Y402/02—Carbon-oxygen lyases (4.2) acting on polysaccharides (4.2.2)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/20—Fusion polypeptide containing a tag with affinity for a non-protein ligand
- C07K2319/21—Fusion polypeptide containing a tag with affinity for a non-protein ligand containing a His-tag
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/35—Fusion polypeptide containing a fusion for enhanced stability/folding during expression, e.g. fusions with chaperones or thioredoxin
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Genetics & Genomics (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Zoology (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Biotechnology (AREA)
- Biochemistry (AREA)
- Biomedical Technology (AREA)
- Microbiology (AREA)
- Molecular Biology (AREA)
- Medicinal Chemistry (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Peptides Or Proteins (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
本发明公开了利用SUMO融合表达系统制备重组肝素酶II的方法及其所制备的肝素酶II,该制备方法包括下列步骤:选择来自肝素黄杆菌(Pedobacter heparinus)的肝素酶II序列,其氨基酸序列如SEQIDNo.1所示,共772aa;去除信号肽序列,获得肝素酶II的DNA序列,如SEQIDNo.2;将所述肝素酶II的DNA序列插入带有N端SUMO蛋白标签的pSMART载体质粒;将正确的质粒转化至BL21(DE3)大肠杆菌感受态细胞中,挑取单克隆,获得融合蛋白;用SUMO蛋白酶将SUMO标签蛋白从融合蛋白上切割下来,即获得肝素酶II。本发明的优点是:提高目标蛋白质的可溶性、促进目标蛋白质的正确折叠,避免形成包涵体;降低纯化成本、提高纯化效率;SUMO蛋白标签分子量较小,对肝素酶II的酶活性影响较小,获得纯化的肝素酶II。
Description
本发明涉及生物技术领域,尤其涉及涉及一种利用SUMO融合表达系统制备重组肝素酶II(Heparinase II)的方法。
背景技术
肝素酶主要从一些利用肝素为碳源的细菌中分离得到,最初来源于肝素黄杆菌(Pedobacter heparinus),从肝素黄杆菌中分离纯化出的肝素酶有三种,分别为肝素酶I、肝素酶II、肝素酶III。肝素酶的主要作用是降解肝素,在酶降解法生产低分子肝素和超低分子肝素中有广泛应用。
目前用于低分子肝素生产的肝素酶产品主要有两种来源:一是直接从产肝素酶的微生物中发酵提取,如专利号CN102286448A记载,一种肝素黄杆菌肝素酶I、II、III的制备方法(深圳市海普瑞药业股份有限公司);二是通过基因工程技术构建重组肝素酶工程菌,发酵提取重组肝素酶。
直接从产肝素酶的微生物(如肝素黄杆菌等)中提取肝素酶II,发酵条件和纯化工艺相对复杂,通过菌种选育提高产量的潜力有限。
基因工程重组技术已广泛应用于蛋白质制备,在构建工程菌时,选择合适的蛋白纯化标签,与目标蛋白质形成融合蛋白,可以提高蛋白质的可溶性、屏蔽毒性蛋白、促进蛋白质的正确折叠、增强对高温和蛋白酶的耐受性。目前已有将麦芽糖结合蛋白(MBP)标签与肝素酶II形成融合蛋白等应用,如专利号CN101942024A记载,一种肝素酶Ⅱ融合蛋白及其编码基因与表达方法(清华大学、北京思清源生物科技有限公司)。
采用基因工程重组技术在大肠杆菌系统中生产肝素酶II等外源蛋白质时面临两个挑战:一是所用蛋白质表达系统的表达水平较低;二是所表达的蛋白质被错误折叠进不溶性的聚集体——包涵体中,提取和复性困难。采用可溶性蛋白标签与肝素酶II形成融合蛋白,可以显著提高重组产物的溶解性。
类似谷胱甘肽巯基转移酶标签(GST-Tag)、麦芽糖结合蛋白标签(MBP- Tag)、转录抗终止因子A标签(NusA-Tag)等融合蛋白标签,它们的分子量较大(26-55kDa),可能会干扰目标蛋白质的正常功能,并且在表达和后续纯化过程中去除标签等环节相对降低了目标蛋白质的产率。
发明内容
为解决现有技术的缺陷和不足,本发明的目的是提供一种利用SUMO融合表达系统制备重组肝素酶II(Heparinase II)的方法,以极大地提高目标蛋白质的表达水平、稳定性和可溶性,提高目标蛋白质的活性及产率。
为实现上述目的,本发明的技术方案如下:
根据本发明的一个方面,提供了一种利用SUMO融合表达系统制备重组肝素酶II(Heparinase II)的方法,包括下列步骤:
S01:选择来自肝素黄杆菌(Pedobacter heparinus)的肝素酶II序列,其氨基酸序列如SEQIDNo.1所示,共772aa;
S02:去除信号肽序列,获得肝素酶II的DNA序列,如SEQIDNo.2;
S03:将所述肝素酶II的DNA序列插入带有N端SUMO蛋白标签的 pSMART载体质粒;
S04:将正确的质粒转化至BL21(DE3)大肠杆菌感受态细胞中,挑取单克隆,经发酵和纯化获得SUMO_肝素酶II融合蛋白;
S05:用SUMO蛋白酶将SUMO标签蛋白从步骤S04所述的SUMO_肝素酶 II融合蛋白上切割下来,即获得肝素酶II。
进一步地,还包括融合蛋白表达条件的优化步骤。
进一步地,所述融合蛋白的优化条件是融合蛋白在15℃以0.2mmol/l 异丙基硫代半乳糖苷(IPTG)诱导表达。
进一步地,还包括融合蛋白的纯化步骤。
进一步地,所述融合蛋白的纯化步骤采用Ni柱亲和层析方法进行分离纯化。
进一步地,所述的SUMO标签蛋白从步骤S04所述的SUMO_肝素酶II 融合蛋白上切割下来是包括下列步骤:将所述融合蛋白透析更换缓冲液,加入适量的SUMO蛋白酶,混匀后置于4℃反应过夜,水解后的融合蛋白样本用Ni柱亲和层析方法将带有6个组氨酸片段的SUMO蛋白标签(能结合到Ni柱上)与肝素酶II(不能结合到Ni柱上)分离。
根据本发明的另一个方面,还提供了一种按照上述方法制备的SUMO_ 肝素酶II溶合蛋白,所述融合蛋白的氨基酸序列如SEQIDNo.3。
根据本发明的再一个方面,还提供了一种按照上述方法制备的肝素酶 II。
通过本发明的技术方案,利用SUMO融合表达系统制备的重组肝素酶II,融合在目标蛋白质N端的较小的类泛素蛋白修饰物(small ubiquitin-like modifier,SUMO)标签蛋白,分子量大约11kDa,能够极大地提高目标蛋白质的稳定性和可溶性,对目标蛋白质的活性影响较小。并且,SUMO蛋白酶(SUMO Protease)是一种具有较高活性的半胱氨酸蛋白酶,它能识别SUMO标签蛋白的三级结构,而不是氨基酸序列,因此可以高效而且特异性地将SUMO标签蛋白从重组融合蛋白上切割下来,从而获得肝素酶II。
采用SUMO蛋白标签融合表达系统制备重组肝素酶II的优点:
1)提高目标蛋白质的可溶性、促进目标蛋白质的正确折叠,避免形成包涵体;
2)SUMO蛋白标签近N端有6个组氨酸片段,使融合蛋白可以用Ni柱亲和层析方法一步纯化,降低纯化成本、提高纯化效率;
3)SUMO蛋白标签分子量较小,大约11kDa,对肝素酶II的酶活性影响较小,即使不切除融合蛋白中的蛋白标签,也能得到较高活性的肝素酶 II;
4)SUMO蛋白酶能识别SUMO标签蛋白的三级结构,可以高效而且特异性地将SUMO标签蛋白从重组融合蛋白上切割下来,从而获得纯化的肝素酶II。
下面结合附图,对本发明的具体实施方式作进一步的详细说明。对于所属技术领域的技术人员而言,从对本发明的详细说明中,本发明的上述和其他的目的、特征和优点将显而易见。
附图说明
图1为本发明一实施例中肝素酶II表达鉴定的SDS-PAGE电泳分析图;
图2为本发明一实施例中肝素酶II表达优化的SDS-PAGE电泳分析图;
图3为本发明一实施例中肝素酶II可溶性分析的SDS-PAGE电泳分析图;
图4为本发明一实施例中肝素酶II亲和纯化结果的SDS-PAGE电泳分析图;
图5为本发明一实施例中肝素酶II免疫印迹Western Blot的SDS- PAGE电泳分析图;
图6为本发明一实施例中肝素酶II去除标签的SDS-PAGE电泳分析图。
具体实施方式
下面通过具体实施方式及实施例并结合附图对本发明作进一步的详细说明。
根据本发明一种典型的实施方式,提供了一种利用SUMO融合表达系统制备重组肝素酶II(Heparinase II)的方法,包括下列步骤:
S01:选择来自肝素黄杆菌(Pedobacter heparinus)的肝素酶II序列,其氨基酸序列如SEQIDNo.1所示,共772aa;
S02:去除信号肽序列,获得肝素酶II的DNA序列,如SEQIDNo.2;
S03:将所述肝素酶II的DNA序列插入带有N端SUMO蛋白标签的 pSMART载体质粒;
S04:将正确的质粒转化至BL21(DE3)大肠杆菌感受态细胞中,挑取单克隆,经发酵和纯化获得融合蛋白;
S05:用SUMO蛋白酶将SUMO标签蛋白从步骤S04所述的融合蛋白上切割下来,即获得肝素酶II。
优选的,还包括融合蛋白表达条件的优化步骤。
优选的,所述融合蛋白的优化条件是融合蛋白在15℃以0.2mmol/l异丙基硫代半乳糖苷(IPTG)诱导表达。
优选的,还包括融合蛋白的纯化步骤。
优选的,所述融合蛋白的纯化步骤采用Ni柱亲和层析方法进行分离纯化。
优选的,所述的SUMO标签蛋白从步骤S04所述的SUMO_肝素酶II融合蛋白上切割下来是包括下列步骤:将所述融合蛋白透析更换缓冲液,加入适量的SUMO蛋白酶,混匀后置于4℃反应过夜,水解后的融合蛋白样本用Ni柱亲和层析方法将带有6个组氨酸片段的SUMO蛋白标签(能结合到 Ni柱上)与肝素酶II(不能结合到Ni柱上)分离。
根据本发明的一典型实施方式,还提供了按照上述方法制备的SUMO_ 肝素酶II融合蛋白,所述融合蛋白的氨基酸序列如SEQIDNo.3。
根据本发明的一典型实施方式,还提供了一种按照上述方法制备的肝素酶II。
选择来自肝素黄杆菌(Pedobacter heparinus)的肝素酶II序列,共 772aa,其氨基酸序列如SEQIDNo.1所示,发明人经蛋白质序列分析发现,肝素酶II没有跨膜结构,第1-25个氨基酸为信号肽 (MKKQILYLIVLQQLFLCSAYA),后747个氨基酸为肝素酶II序列。采取大肠杆菌胞内表达方式制备肝素酶II,转入工程菌的序列应预先去除信号肽序列。将肝素酶II的DNA序列如SEQIDNo.2所示,插入带有N端SUMO蛋白标签的pSMART载体质粒。将正确的质粒转化至BL21(DE3)大肠杆菌感受态细胞中,挑取单克隆,经SDS-PAGE验证,融合蛋白表达正常。融合蛋白由109aa的SUMO蛋白标签和747aa的肝素酶II组成,SUMO-肝素酶II 融合蛋白的氨基酸序列如SEQIDNo.3所示。
经优化,确定融合蛋白在15℃以0.2mmol/l异丙基硫代半乳糖苷(IPTG) 诱导表达最佳。
在融合蛋白靠近N端处设计了6个组氨酸(histidine,H)序列,工程菌发酵液去除上清液,并将菌体破碎后,采用Ni柱亲和层析方法将融合蛋白分离纯化。
将纯化后的融合蛋白样品透析更换缓冲液,加入适量的SUMO蛋白酶,混匀后置于4℃反应过夜。水解后的融合蛋白样本用Ni柱亲和层析方法将带有6个组氨酸片段的SUMO蛋白标签(能结合到Ni柱上)与肝素酶II (不能结合到Ni柱上)分离。
下面将结合实施例进一步说明本发明的有益效果。
实施例一
1.质粒转化至BL21(DE3)感受态细胞中
1.1将质粒2μL加入100μL感受态细菌中,置冰上30min;
1.2 42℃热激90s,迅速置冰中5min;加入500μL LB培养液;1.3 37℃,220rpm振摇1h,离心后全部涂布于含抗性的LB 平板,37℃倒置培养过夜。
2.表达鉴定
2.1挑取5个平板上单克隆接种于含适量抗性的4mL LB培养液的试管中;
2.2 37℃、220rpm振摇至菌体OD600为0.6-0.8;
2.3取出1mL培养物,12000g室温离心5min,弃上清,用80μL 1×PBS缓冲液重悬菌体沉淀并加入20μL的5×Loading Buffer;
2.4向剩余的培养物中加入IPTG至终浓度为0.5mM,在37℃220 rpm振摇4h,诱导融合蛋白表达;
2.5取出0.5mL培养物,12000g室温离心5min,弃上清,用80μL 1×PBS缓冲液重悬菌体沉淀并加入20μL的5×Loading Buffer;
2.6进行12%SDS-PAGE分析,表达鉴定结果如图1所示,其中M:蛋白质分子量标记(Protein Marker),1:诱导前样品,2:诱导后样品Ⅰ,3:诱导后样品Ⅱ,4:诱导后样品Ⅲ,5:诱导后样品Ⅳ,6:诱导后样品Ⅴ。
3.表达优化
3.1挑取划线平板上单克隆接种于13支含适量抗性的4mL LB培养液的试管中;
3.2 37℃、220rpm振摇至菌体OD600为0.6-0.8;
3.3取出0.4mL培养物,12000g室温离心5min,弃上清,用80μL 1×PBS缓冲液重悬菌体沉淀加入20μL的5×Loading Buffer;
3.4向剩余的培养物中加入IPTG至终浓度分别为0.2mM与1mM,分别在37℃和15℃220rpm振摇4h和16h,诱导融合蛋白表达;
3.5取出0.2mL培养物,12000g室温离心5min,弃上清,用80μL 1×PBS缓冲液重悬菌体沉淀加入20μL的5×Loading Buffer。
3.6进行SDS-PAGE分析,表达优化结果如图2所示,其中
M:蛋白质分子量标记(Protein Marker);
1:未诱导样品;
2-4:37℃、1.0mM IPTG诱导后样品;
5-7:37℃、0.2mM IPTG诱导后样品;
8-10:15℃、1.0mM IPTG诱导后样品;
11-13:15℃、0.2mM IPTG诱导后样品。
4.可溶性分析
取37℃诱导后菌液和15℃诱导后菌液2mL,12000g室温离心5min,弃上清,用1mL的Buffer A(20mM Tris,300mM NaCl,10%Glycerol, pH8.0)重悬离心后沉淀,超声波破碎(Φ3,15%,2s/8s,10min),上清沉淀分别制样,进行12%SDS-PAGE分析,15℃诱导下,融合蛋白均可溶,可溶性分析结果如图3所示,其中
M:蛋白质分子量标记(Protein Marker);
1:未诱导样品;
2:诱导后样品;
3:37℃、1.0mM IPTG诱后沉淀样;
4:37℃、1.0mM IPTG诱后上清样;
5:37℃、0.2mM IPTG诱后沉淀样;
6:37℃、0.2mM IPTG诱后上清样;
7:15℃、1.0mM IPTG诱后沉淀样;
8:15℃、1.0mM IPTG诱后上清样;
9:15℃、0.2mM IPTG诱后沉淀样;
10:15℃、0.2mM IPTG诱后上清样。
5.放大表达与亲和纯化
5.1取最优克隆保种菌接种至含合适抗生素的1L LB培养基中, 37℃220rpm振摇至OD600=0.6~0.8,加入终浓度为0.2mM IPTG后, 15℃220rpm振摇16h。
5.2利用8000rpm离心10min,弃去上清,收集所有菌体。
5.3使用Buffer A(20mM Tris,300mM NaCl,10%Glycerol, pH8.0)重悬菌体,超声破碎(Φ10,15%,2s/8s,30min)。16000rpm 离心10min,收集上清。
5.4在上清中加入2mL Ni-NTA,混匀后4℃孵育1h。
5.5将孵育物加入空柱管,收集流出液。
5.6分别以Buffer B(20mM Tris,300mM NaCl,10%Glycerol, 20mM Imidazole,pH8.0)与Buffer C(20mM Tris,300mM NaCl, 10%Glycerol,40mM Imidazole,pH8.0)清洗填料,收集清洗液。
5.7利用Buffer D(20mM Tris,300mM NaCl,10%Glycerol, 500mM Imidazole,pH8.0)洗脱,收集洗脱液。
5.8SDS-PAGE电泳检测,亲和纯化结果如图4所示,其中
M:蛋白质标记Protein Marker;
1:破碎后沉淀;
2:破碎后上清;
3:Ni-NTA孵育后流出液;
4:Buffer B清洗样;
5:Buffer C清洗样;
6:Buffer D洗脱样。
6.洗脱样免疫印迹(Western Blot)分析
6.1SDS-PAGE
电泳样品:预染色的蛋白质分子量标记(Protein Marker), Buffer D洗脱样
电泳条件:200V,60min
6.2半干式电转印
a.在电泳的正在进行的时候,将PVDF膜切出与凝胶一样大小,并剪去一个小角,帮助判断方向,放置在甲醇中浸泡30s,后转移到电转液中;
b.薄滤纸和厚滤纸切出比凝胶略大一些,各两张,置转移缓冲液浸泡;
c.电泳完成后,将胶的浓缩胶切除,放置在电转液中。按照厚滤纸,薄滤纸,NC膜,胶,薄滤纸,厚滤纸顺序放置在电转仪平面上。
d.每层之间的气泡要全部去除。可以用手轻轻按住一端,再用50mL 离心管轻轻在上层往一边移动去除气泡,再将手换一端,离心管往另外一端去除气泡;
e.将电转仪链接到仪器上设置,30V,蛋白小于50KDa,用 30mins;蛋白大于50KDa,用45mins。
6.3膜的封闭和抗体孵育及最终处理
a.洗转印膜:室温用1×PBST漂洗5min共3次,在加入任何溶液的时候都不得在膜上加入,沿盒子的角缓慢加入,以免将膜上结合的物质冲掉,摇床转速40rpm。
b.将PBST倒掉,放入5%脱脂奶粉的封闭液内,摇床调到最低转速,室温封闭2h。
c.室温用1×PBST漂洗5min,共3次.
d.加入抗体(1:2000到5%脱脂奶粉中),孵育1h,使用后回收到-20℃。
e.室温用1×PBST漂洗5min,共3次。f.缓慢加入少量ECL 液,能将膜全部覆盖,静置2min后拍照,如图5所示,其中,M:蛋 白质标记Protein Marker样本Sample:BufferD洗脱样。
7.去除标签蛋白
7.1将洗脱样全部透析至Buffer A(20mM Tris,300mM NaCl,pH 8.0)中。
7.2透析完成后,在融合蛋白中加入适量的SUMO蛋白酶(SUMO Protease),混匀后,放置于4℃反应过夜。
7.3将反应完成后的产物,加入Ni-NTA纯化柱中,缓慢上样,收集流出液。
7.4利用Buffer A(20mM Tris,300mM NaCl,pH 8.0)清洗纯化柱。
7.5利用Buffer D(20mM Tris,300mM NaCl,500mM Imidazole, pH 8.0)洗脱。
7.6SDS-PAGE电泳,如图6所示,其中
M:蛋白质分子量标记(Protein Marker)
1:酶切反应前样品
2:酶切反应后样品
3:酶切后过柱流出样(肝素酶II)
4:酶切后过柱清洗样
5:酶切后过柱洗脱样(SUMO蛋白标签)。
综上,将正确的质粒转化至BL21(DE3)大肠杆菌感受态细胞中,挑取单克隆,经SDS-PAGE验证,融合蛋白表达正常。各优化条件下,融合蛋白均表达明显,且融合蛋白在15℃诱导温度下可溶性更好,选择15℃下0.2mM IPTG诱导,Ni-NTA纯化后去除标签得到无融合标签的目的蛋白肝素酶II。
当然,本发明还可有其他实施例,在不背离本发明精神及其实质的情况下,所属技术领域的技术人员当可根据本发明作出各种相应的改变和变形,但这些相应的改变和变形都应属于本发明的权利要求的保护范围。
SEQ ID No.1:
肝素酶II(Heparinase II)的氨基酸序列:
MKRQLYLYVIFVVVELMVFTTKGYSQTKADVVWKDVDGVSMPIPPKTHPRLYLREQQVPD LKNRMNDPKLKKVWADMIKMQEDWKPADIPEVKDFRFYFNQKGLTVRVELMALNYLMTKD PKVGREAITSIIDTLETATFKPAGDISRGIGLFMVTGAIVYDWCYDQLKPEEKTRFVKAF VRLAKMLECGYPPVKDKSIVGHASEWMIMRDLLSVGIAIYDEFPEMYNLAAGRFFKEHLV ARNWFYPSHNYHQGMSYLNVRFTNDLFALWILDRMGAGNVFNPGQQFILYDAIYKRRPDG QILAGGDVDYSRKKPKYYTMPALLAGSYYKDEYLNYEFLKDPNVEPHCKLFEFLWRDTQL GSRKPDDLPLSRYSGSPFGWMIARTGWGPESVIAEMKVNEYSFLNHQHQDAGAFQIYYKG PLAIDAGSYTGSSGGYNSPHNKNFFKRTIAHNSLLIYDPKETFSSSGYGGSDHTDFAAND GGQRLPGKGWIAPRDLKEMLAGDFRTGKILAQGFGPDNQTPDYTYLKGDITAAYSAKVKE VKRSFLFLNLKDAKVPAAMIVFDKVVASNPDFKKFWLLHSIEQPEIKGNQITIKRTKNGD SGMLVNTALLPDAANSNITSIGGKGKDFWVFGTNYTNDPKPGTDEALERGEWRVEITPKK AAAEDYYLNVIQIADNTQQKLHEVKRIDGDKVVGVQLADRIVTFSKTSETVDRPFGFSVV GKGTFKFVMTDLLPGTWQVLKDGKILYPALSAKGDDGALYFEGTEGTYRFLR SEQ ID No.2:
肝素酶II(Heparinase II)DNA:
CAAACAAAAGCAGACGTAGTATGGAAAGACGTAGACGGAGTAAGCATGCCAATACCACCA AAAACACACCCAAGACTATACCTAAGAGAACAACAAGTACCAGACCTAAAAAACAGAATG AACGACCCAAAACTAAAAAAAGTATGGGCAGACATGATAAAAATGCAAGAAGACTGGAAA CCAGCAGACATACCAGAAGTAAAAGACTTCAGATTCTACTTCAACCAAAAAGGACTAACA GTAAGAGTAGAACTAATGGCACTAAACTACCTAATGACAAAAGACCCAAAAGTAGGAAGA GAAGCAATAACAAGCATAATAGACACACTAGAAACAGCAACATTCAAACCAGCAGGAGAC ATAAGCAGAGGAATAGGACTATTCATGGTAACAGGAGCAATAGTATACGACTGGTGCTAC GACCAACTAAAACCAGAAGAAAAAACAAGATTCGTAAAAGCATTCGTAAGACTAGCAAAA ATGCTAGAATGCGGATACCCACCAGTAAAAGACAAAAGCATAGTAGGACACGCAAGCGAA TGGATGATAATGAGAGACCTACTAAGCGTAGGAATAGCAATATACGACGAATTCCCAGAA ATGTACAACCTAGCAGCAGGAAGATTCTTCAAAGAACACCTAGTAGCAAGAAACTGGTTC TACCCAAGCCACAACTACCACCAAGGAATGAGCTACCTAAACGTAAGATTCACAAACGAC CTATTCGCACTATGGATACTAGACAGAATGGGAGCAGGAAACGTATTCAACCCAGGACAA CAATTCATACTATACGACGCAATATACAAAAGAAGACCAGACGGACAAATACTAGCAGGA GGAGACGTAGACTACAGCAGAAAAAAACCAAAATACTACACAATGCCAGCACTACTAGCA GGAAGCTACTACAAAGACGAATACCTAAACTACGAATTCCTAAAAGACCCAAACGTAGAA CCACACTGCAAACTATTCGAATTCCTATGGAGAGACACACAACTAGGAAGCAGAAAACCA GACGACCTACCACTAAGCAGATACAGCGGAAGCCCATTCGGATGGATGATAGCAAGAACA GGATGGGGACCAGAAAGCGTAATAGCAGAAATGAAAGTAAACGAATACAGCTTCCTAAAC CACCAACACCAAGACGCAGGAGCATTCCAAATATACTACAAAGGACCACTAGCAATAGAC GCAGGAAGCTACACAGGAAGCAGCGGAGGATACAACAGCCCACACAACAAAAACTTCTTC AAAAGAACAATAGCACACAACAGCCTACTAATATACGACCCAAAAGAAACATTCAGCAGC AGCGGATACGGAGGAAGCGACCACACAGACTTCGCAGCAAACGACGGAGGACAAAGACTA CCAGGAAAAGGATGGATAGCACCAAGAGACCTAAAAGAAATGCTAGCAGGAGACTTCAGA ACAGGAAAAATACTAGCACAAGGATTCGGACCAGACAACCAAACACCAGACTACACATAC CTAAAAGGAGACATAACAGCAGCATACAGCGCAAAAGTAAAAGAAGTAAAAAGAAGCTTC CTATTCCTAAACCTAAAAGACGCAAAAGTACCAGCAGCAATGATAGTATTCGACAAAGTA GTAGCAAGCAACCCAGACTTCAAAAAATTCTGGCTACTACACAGCATAGAACAACCAGAA ATAAAAGGAAACCAAATAACAATAAAAAGAACAAAAAACGGAGACAGCGGAATGCTAGTA AACACAGCACTACTACCAGACGCAGCAAACAGCAACATAACAAGCATAGGAGGAAAAGGA AAAGACTTCTGGGTATTCGGAACAAACTACACAAACGACCCAAAACCAGGAACAGACGAA GCACTAGAAAGAGGAGAATGGAGAGTAGAAATAACACCAAAAAAAGCAGCAGCAGAAGAC TACTACCTAAACGTAATACAAATAGCAGACAACACACAACAAAAACTACACGAAGTAAAA AGAATAGACGGAGACAAAGTAGTAGGAGTACAACTAGCAGACAGAATAGTAACATTCAGC AAAACAAGCGAAACAGTAGACAGACCATTCGGATTCAGCGTAGTAGGAAAAGGAACATTC AAATTCGTAATGACAGACCTACTACCAGGAACATGGCAAGTACTAAAAGACGGAAAAATACTATACCCAGCACTAAGCGCAAAAGGAGACGACGGAGCACTATACTTCGAAGGAACAGAAGGAACATACAGATTCCTAAGATAG
SEQ ID No.3:
SUMO-肝素酶II融合蛋白的氨基酸序列(SUMO-Heparinase II): MGHHHHHHMSDSEVNQEAKPEVKPEVKPETHINLKVSDGSSEIFFKIKKTTPLRRLMEAFAKR QGKEMDSLRFLYDGIRIQADQTPEDLDMEDNDIIEAHREQIGGQGSQTKADVVWKDVDGVSMP IPPKTHPRLYLREQQVPDLKNRMNDPKLKKVWADMIKMQEDWKPADIPEVKDFRFYFNQKGLT VRVELMALNYLMTKDPKVGREAITSIIDTLETATFKPAGDISRGIGLFMVTGAIVYDWCYDQL KPEEKTRFVKAFVRLAKMLECGYPPVKDKSIVGHASEWMIMRDLLSVGIAIYDEFPEMYNLAA GRFFKEHLVARNWFYPSHNYHQGMSYLNVRFTNDLFALWILDRMGAGNVFNPGQQFILYDAIY KRRPDGQILAGGDVDYSRKKPKYYTMPALLAGSYYKDEYLNYEFLKDPNVEPHCKLFEFLWRD TQLGSRKPDDLPLSRYSGSPFGWMIARTGWGPESVIAEMKVNEYSFLNHQHQDAGAFQIYYKG PLAIDAGSYTGSSGGYNSPHNKNFFKRTIAHNSLLIYDPKETFSSSGYGGSDHTDFAANDGGQ RLPGKGWIAPRDLKEMLAGDFRTGKILAQGFGPDNQTPDYTYLKGDITAAYSAKVKEVKRSFL FLNLKDAKVPAAMIVFDKVVASNPDFKKFWLLHSIEQPEIKGNQITIKRTKNGDSGMLVNTAL LPDAANSNITSIGGKGKDFWVFGTNYTNDPKPGTDEALERGEWRVEITPKKAAAEDYYLNVIQ IADNTQQKLHEVKRIDGDKVVGVQLADRIVTFSKTSETVDRPFGFSVVGKGTFKFVMTDLLPG TWQVLKDGKILYPALSAKGDDGALYFEGTEGTYRFLR。
序列表
<110> 上海宝维医药技术有限公司
<120> 一种利用SUMO融合表达系统制备重组肝素酶II的方法及其所制备的SUMO_肝素酶II融合蛋白
<130> 2020
<160> 3
<170> SIPOSequenceListing 1.0
<210> 1
<211> 772
<212> PRT
<213> 人工序列()
<400> 1
Met Lys Arg Gln Leu Tyr Leu Tyr Val Ile Phe Val Val Val Glu Leu
1 5 10 15
Met Val Phe Thr Thr Lys Gly Tyr Ser Gln Thr Lys Ala Asp Val Val
20 25 30
Trp Lys Asp Val Asp Gly Val Ser Met Pro Ile Pro Pro Lys Thr His
35 40 45
Pro Arg Leu Tyr Leu Arg Glu Gln Gln Val Pro Asp Leu Lys Asn Arg
50 55 60
Met Asn Asp Pro Lys Leu Lys Lys Val Trp Ala Asp Met Ile Lys Met
65 70 75 80
Gln Glu Asp Trp Lys Pro Ala Asp Ile Pro Glu Val Lys Asp Phe Arg
85 90 95
Phe Tyr Phe Asn Gln Lys Gly Leu Thr Val Arg Val Glu Leu Met Ala
100 105 110
Leu Asn Tyr Leu Met Thr Lys Asp Pro Lys Val Gly Arg Glu Ala Ile
115 120 125
Thr Ser Ile Ile Asp Thr Leu Glu Thr Ala Thr Phe Lys Pro Ala Gly
130 135 140
Asp Ile Ser Arg Gly Ile Gly Leu Phe Met Val Thr Gly Ala Ile Val
145 150 155 160
Tyr Asp Trp Cys Tyr Asp Gln Leu Lys Pro Glu Glu Lys Thr Arg Phe
165 170 175
Val Lys Ala Phe Val Arg Leu Ala Lys Met Leu Glu Cys Gly Tyr Pro
180 185 190
Pro Val Lys Asp Lys Ser Ile Val Gly His Ala Ser Glu Trp Met Ile
195 200 205
Met Arg Asp Leu Leu Ser Val Gly Ile Ala Ile Tyr Asp Glu Phe Pro
210 215 220
Glu Met Tyr Asn Leu Ala Ala Gly Arg Phe Phe Lys Glu His Leu Val
225 230 235 240
Ala Arg Asn Trp Phe Tyr Pro Ser His Asn Tyr His Gln Gly Met Ser
245 250 255
Tyr Leu Asn Val Arg Phe Thr Asn Asp Leu Phe Ala Leu Trp Ile Leu
260 265 270
Asp Arg Met Gly Ala Gly Asn Val Phe Asn Pro Gly Gln Gln Phe Ile
275 280 285
Leu Tyr Asp Ala Ile Tyr Lys Arg Arg Pro Asp Gly Gln Ile Leu Ala
290 295 300
Gly Gly Asp Val Asp Tyr Ser Arg Lys Lys Pro Lys Tyr Tyr Thr Met
305 310 315 320
Pro Ala Leu Leu Ala Gly Ser Tyr Tyr Lys Asp Glu Tyr Leu Asn Tyr
325 330 335
Glu Phe Leu Lys Asp Pro Asn Val Glu Pro His Cys Lys Leu Phe Glu
340 345 350
Phe Leu Trp Arg Asp Thr Gln Leu Gly Ser Arg Lys Pro Asp Asp Leu
355 360 365
Pro Leu Ser Arg Tyr Ser Gly Ser Pro Phe Gly Trp Met Ile Ala Arg
370 375 380
Thr Gly Trp Gly Pro Glu Ser Val Ile Ala Glu Met Lys Val Asn Glu
385 390 395 400
Tyr Ser Phe Leu Asn His Gln His Gln Asp Ala Gly Ala Phe Gln Ile
405 410 415
Tyr Tyr Lys Gly Pro Leu Ala Ile Asp Ala Gly Ser Tyr Thr Gly Ser
420 425 430
Ser Gly Gly Tyr Asn Ser Pro His Asn Lys Asn Phe Phe Lys Arg Thr
435 440 445
Ile Ala His Asn Ser Leu Leu Ile Tyr Asp Pro Lys Glu Thr Phe Ser
450 455 460
Ser Ser Gly Tyr Gly Gly Ser Asp His Thr Asp Phe Ala Ala Asn Asp
465 470 475 480
Gly Gly Gln Arg Leu Pro Gly Lys Gly Trp Ile Ala Pro Arg Asp Leu
485 490 495
Lys Glu Met Leu Ala Gly Asp Phe Arg Thr Gly Lys Ile Leu Ala Gln
500 505 510
Gly Phe Gly Pro Asp Asn Gln Thr Pro Asp Tyr Thr Tyr Leu Lys Gly
515 520 525
Asp Ile Thr Ala Ala Tyr Ser Ala Lys Val Lys Glu Val Lys Arg Ser
530 535 540
Phe Leu Phe Leu Asn Leu Lys Asp Ala Lys Val Pro Ala Ala Met Ile
545 550 555 560
Val Phe Asp Lys Val Val Ala Ser Asn Pro Asp Phe Lys Lys Phe Trp
565 570 575
Leu Leu His Ser Ile Glu Gln Pro Glu Ile Lys Gly Asn Gln Ile Thr
580 585 590
Ile Lys Arg Thr Lys Asn Gly Asp Ser Gly Met Leu Val Asn Thr Ala
595 600 605
Leu Leu Pro Asp Ala Ala Asn Ser Asn Ile Thr Ser Ile Gly Gly Lys
610 615 620
Gly Lys Asp Phe Trp Val Phe Gly Thr Asn Tyr Thr Asn Asp Pro Lys
625 630 635 640
Pro Gly Thr Asp Glu Ala Leu Glu Arg Gly Glu Trp Arg Val Glu Ile
645 650 655
Thr Pro Lys Lys Ala Ala Ala Glu Asp Tyr Tyr Leu Asn Val Ile Gln
660 665 670
Ile Ala Asp Asn Thr Gln Gln Lys Leu His Glu Val Lys Arg Ile Asp
675 680 685
Gly Asp Lys Val Val Gly Val Gln Leu Ala Asp Arg Ile Val Thr Phe
690 695 700
Ser Lys Thr Ser Glu Thr Val Asp Arg Pro Phe Gly Phe Ser Val Val
705 710 715 720
Gly Lys Gly Thr Phe Lys Phe Val Met Thr Asp Leu Leu Pro Gly Thr
725 730 735
Trp Gln Val Leu Lys Asp Gly Lys Ile Leu Tyr Pro Ala Leu Ser Ala
740 745 750
Lys Gly Asp Asp Gly Ala Leu Tyr Phe Glu Gly Thr Glu Gly Thr Tyr
755 760 765
Arg Phe Leu Arg
770
<210> 2
<211> 2244
<212> DNA
<213> 人工序列()
<400> 2
caaacaaaag cagacgtagt atggaaagac gtagacggag taagcatgcc aataccacca 60
aaaacacacc caagactata cctaagagaa caacaagtac cagacctaaa aaacagaatg 120
aacgacccaa aactaaaaaa agtatgggca gacatgataa aaatgcaaga agactggaaa 180
ccagcagaca taccagaagt aaaagacttc agattctact tcaaccaaaa aggactaaca 240
gtaagagtag aactaatggc actaaactac ctaatgacaa aagacccaaa agtaggaaga 300
gaagcaataa caagcataat agacacacta gaaacagcaa cattcaaacc agcaggagac 360
ataagcagag gaataggact attcatggta acaggagcaa tagtatacga ctggtgctac 420
gaccaactaa aaccagaaga aaaaacaaga ttcgtaaaag cattcgtaag actagcaaaa 480
atgctagaat gcggataccc accagtaaaa gacaaaagca tagtaggaca cgcaagcgaa 540
tggatgataa tgagagacct actaagcgta ggaatagcaa tatacgacga attcccagaa 600
atgtacaacc tagcagcagg aagattcttc aaagaacacc tagtagcaag aaactggttc 660
tacccaagcc acaactacca ccaaggaatg agctacctaa acgtaagatt cacaaacgac 720
ctattcgcac tatggatact agacagaatg ggagcaggaa acgtattcaa cccaggacaa 780
caattcatac tatacgacgc aatatacaaa agaagaccag acggacaaat actagcagga 840
ggagacgtag actacagcag aaaaaaacca aaatactaca caatgccagc actactagca 900
ggaagctact acaaagacga atacctaaac tacgaattcc taaaagaccc aaacgtagaa 960
ccacactgca aactattcga attcctatgg agagacacac aactaggaag cagaaaacca 1020
gacgacctac cactaagcag atacagcgga agcccattcg gatggatgat agcaagaaca 1080
ggatggggac cagaaagcgt aatagcagaa atgaaagtaa acgaatacag cttcctaaac 1140
caccaacacc aagacgcagg agcattccaa atatactaca aaggaccact agcaatagac 1200
gcaggaagct acacaggaag cagcggagga tacaacagcc cacacaacaa aaacttcttc 1260
aaaagaacaa tagcacacaa cagcctacta atatacgacc caaaagaaac attcagcagc 1320
agcggatacg gaggaagcga ccacacagac ttcgcagcaa acgacggagg acaaagacta 1380
ccaggaaaag gatggatagc accaagagac ctaaaagaaa tgctagcagg agacttcaga 1440
acaggaaaaa tactagcaca aggattcgga ccagacaacc aaacaccaga ctacacatac 1500
ctaaaaggag acataacagc agcatacagc gcaaaagtaa aagaagtaaa aagaagcttc 1560
ctattcctaa acctaaaaga cgcaaaagta ccagcagcaa tgatagtatt cgacaaagta 1620
gtagcaagca acccagactt caaaaaattc tggctactac acagcataga acaaccagaa 1680
ataaaaggaa accaaataac aataaaaaga acaaaaaacg gagacagcgg aatgctagta 1740
aacacagcac tactaccaga cgcagcaaac agcaacataa caagcatagg aggaaaagga 1800
aaagacttct gggtattcgg aacaaactac acaaacgacc caaaaccagg aacagacgaa 1860
gcactagaaa gaggagaatg gagagtagaa ataacaccaa aaaaagcagc agcagaagac 1920
tactacctaa acgtaataca aatagcagac aacacacaac aaaaactaca cgaagtaaaa 1980
agaatagacg gagacaaagt agtaggagta caactagcag acagaatagt aacattcagc 2040
aaaacaagcg aaacagtaga cagaccattc ggattcagcg tagtaggaaa aggaacattc 2100
aaattcgtaa tgacagacct actaccagga acatggcaag tactaaaaga cggaaaaata 2160
ctatacccag cactaagcgc aaaaggagac gacggagcac tatacttcga aggaacagaa 2220
ggaacataca gattcctaag atag 2244
<210> 3
<211> 856
<212> PRT
<213> 人工序列()
<400> 3
Met Gly His His His His His His Met Ser Asp Ser Glu Val Asn Gln
1 5 10 15
Glu Ala Lys Pro Glu Val Lys Pro Glu Val Lys Pro Glu Thr His Ile
20 25 30
Asn Leu Lys Val Ser Asp Gly Ser Ser Glu Ile Phe Phe Lys Ile Lys
35 40 45
Lys Thr Thr Pro Leu Arg Arg Leu Met Glu Ala Phe Ala Lys Arg Gln
50 55 60
Gly Lys Glu Met Asp Ser Leu Arg Phe Leu Tyr Asp Gly Ile Arg Ile
65 70 75 80
Gln Ala Asp Gln Thr Pro Glu Asp Leu Asp Met Glu Asp Asn Asp Ile
85 90 95
Ile Glu Ala His Arg Glu Gln Ile Gly Gly Gln Gly Ser Gln Thr Lys
100 105 110
Ala Asp Val Val Trp Lys Asp Val Asp Gly Val Ser Met Pro Ile Pro
115 120 125
Pro Lys Thr His Pro Arg Leu Tyr Leu Arg Glu Gln Gln Val Pro Asp
130 135 140
Leu Lys Asn Arg Met Asn Asp Pro Lys Leu Lys Lys Val Trp Ala Asp
145 150 155 160
Met Ile Lys Met Gln Glu Asp Trp Lys Pro Ala Asp Ile Pro Glu Val
165 170 175
Lys Asp Phe Arg Phe Tyr Phe Asn Gln Lys Gly Leu Thr Val Arg Val
180 185 190
Glu Leu Met Ala Leu Asn Tyr Leu Met Thr Lys Asp Pro Lys Val Gly
195 200 205
Arg Glu Ala Ile Thr Ser Ile Ile Asp Thr Leu Glu Thr Ala Thr Phe
210 215 220
Lys Pro Ala Gly Asp Ile Ser Arg Gly Ile Gly Leu Phe Met Val Thr
225 230 235 240
Gly Ala Ile Val Tyr Asp Trp Cys Tyr Asp Gln Leu Lys Pro Glu Glu
245 250 255
Lys Thr Arg Phe Val Lys Ala Phe Val Arg Leu Ala Lys Met Leu Glu
260 265 270
Cys Gly Tyr Pro Pro Val Lys Asp Lys Ser Ile Val Gly His Ala Ser
275 280 285
Glu Trp Met Ile Met Arg Asp Leu Leu Ser Val Gly Ile Ala Ile Tyr
290 295 300
Asp Glu Phe Pro Glu Met Tyr Asn Leu Ala Ala Gly Arg Phe Phe Lys
305 310 315 320
Glu His Leu Val Ala Arg Asn Trp Phe Tyr Pro Ser His Asn Tyr His
325 330 335
Gln Gly Met Ser Tyr Leu Asn Val Arg Phe Thr Asn Asp Leu Phe Ala
340 345 350
Leu Trp Ile Leu Asp Arg Met Gly Ala Gly Asn Val Phe Asn Pro Gly
355 360 365
Gln Gln Phe Ile Leu Tyr Asp Ala Ile Tyr Lys Arg Arg Pro Asp Gly
370 375 380
Gln Ile Leu Ala Gly Gly Asp Val Asp Tyr Ser Arg Lys Lys Pro Lys
385 390 395 400
Tyr Tyr Thr Met Pro Ala Leu Leu Ala Gly Ser Tyr Tyr Lys Asp Glu
405 410 415
Tyr Leu Asn Tyr Glu Phe Leu Lys Asp Pro Asn Val Glu Pro His Cys
420 425 430
Lys Leu Phe Glu Phe Leu Trp Arg Asp Thr Gln Leu Gly Ser Arg Lys
435 440 445
Pro Asp Asp Leu Pro Leu Ser Arg Tyr Ser Gly Ser Pro Phe Gly Trp
450 455 460
Met Ile Ala Arg Thr Gly Trp Gly Pro Glu Ser Val Ile Ala Glu Met
465 470 475 480
Lys Val Asn Glu Tyr Ser Phe Leu Asn His Gln His Gln Asp Ala Gly
485 490 495
Ala Phe Gln Ile Tyr Tyr Lys Gly Pro Leu Ala Ile Asp Ala Gly Ser
500 505 510
Tyr Thr Gly Ser Ser Gly Gly Tyr Asn Ser Pro His Asn Lys Asn Phe
515 520 525
Phe Lys Arg Thr Ile Ala His Asn Ser Leu Leu Ile Tyr Asp Pro Lys
530 535 540
Glu Thr Phe Ser Ser Ser Gly Tyr Gly Gly Ser Asp His Thr Asp Phe
545 550 555 560
Ala Ala Asn Asp Gly Gly Gln Arg Leu Pro Gly Lys Gly Trp Ile Ala
565 570 575
Pro Arg Asp Leu Lys Glu Met Leu Ala Gly Asp Phe Arg Thr Gly Lys
580 585 590
Ile Leu Ala Gln Gly Phe Gly Pro Asp Asn Gln Thr Pro Asp Tyr Thr
595 600 605
Tyr Leu Lys Gly Asp Ile Thr Ala Ala Tyr Ser Ala Lys Val Lys Glu
610 615 620
Val Lys Arg Ser Phe Leu Phe Leu Asn Leu Lys Asp Ala Lys Val Pro
625 630 635 640
Ala Ala Met Ile Val Phe Asp Lys Val Val Ala Ser Asn Pro Asp Phe
645 650 655
Lys Lys Phe Trp Leu Leu His Ser Ile Glu Gln Pro Glu Ile Lys Gly
660 665 670
Asn Gln Ile Thr Ile Lys Arg Thr Lys Asn Gly Asp Ser Gly Met Leu
675 680 685
Val Asn Thr Ala Leu Leu Pro Asp Ala Ala Asn Ser Asn Ile Thr Ser
690 695 700
Ile Gly Gly Lys Gly Lys Asp Phe Trp Val Phe Gly Thr Asn Tyr Thr
705 710 715 720
Asn Asp Pro Lys Pro Gly Thr Asp Glu Ala Leu Glu Arg Gly Glu Trp
725 730 735
Arg Val Glu Ile Thr Pro Lys Lys Ala Ala Ala Glu Asp Tyr Tyr Leu
740 745 750
Asn Val Ile Gln Ile Ala Asp Asn Thr Gln Gln Lys Leu His Glu Val
755 760 765
Lys Arg Ile Asp Gly Asp Lys Val Val Gly Val Gln Leu Ala Asp Arg
770 775 780
Ile Val Thr Phe Ser Lys Thr Ser Glu Thr Val Asp Arg Pro Phe Gly
785 790 795 800
Phe Ser Val Val Gly Lys Gly Thr Phe Lys Phe Val Met Thr Asp Leu
805 810 815
Leu Pro Gly Thr Trp Gln Val Leu Lys Asp Gly Lys Ile Leu Tyr Pro
820 825 830
Ala Leu Ser Ala Lys Gly Asp Asp Gly Ala Leu Tyr Phe Glu Gly Thr
835 840 845
Glu Gly Thr Tyr Arg Phe Leu Arg
850 855
Claims (8)
1.一种利用SUMO融合表达系统制备重组肝素酶II的方法,其特征在于,包括下列步骤:
S01:选择来自肝素黄杆菌(Pedobacter heparinus)的肝素酶II序列,其氨基酸序列如SEQ ID No.1所示,共772aa;
S02:去除信号肽序列,获得肝素酶II的DNA序列,如SEQ ID No.2;
S03:将所述肝素酶II的DNA序列插入带有N端SUMO蛋白标签的pSMART载体质粒;
S04:将正确的质粒转化至BL21(DE3)大肠杆菌感受态细胞中,挑取单克隆,经发酵和纯化获得SUMO_肝素酶II融合蛋白;
S05:用SUMO蛋白酶将SUMO标签蛋白从步骤S04所述的SUMO_肝素酶II融合蛋白上切割下来,即获得肝素酶II。
2.根据权利要求1所述的方法,其特征在于,还包括融合蛋白表达条件的优化步骤。
3.根据权利要求2所述的方法,其特征在于,所述融合蛋白的优化条件是融合蛋白在15℃以0.2mmol/l异丙基硫代半乳糖苷(IPTG)诱导表达。
4.根据权利要求1-3所述的方法,其特征在于,还包括融合蛋白的纯化步骤。
5.根据权利要求4所述的方法,其特征在于,所述融合蛋白的纯化步骤采用Ni柱亲和层析方法进行分离纯化。
6.根据权利要求1所述的方法,其特征在于,所述的SUMO标签蛋白从步骤S04所述的SUMO_肝素酶II融合蛋白上切割下来是包括下列步骤:将所述融合蛋白透析更换缓冲液,加入适量的SUMO蛋白酶,混匀后置于4℃反应过夜,水解后的融合蛋白样本用Ni柱亲和层析方法将带有6个组氨酸片段的SUMO蛋白标签(能结合到Ni柱上)与肝素酶II(不能结合到Ni柱上)分离。
7.权利要求1-5任一项所述的方法制备的SUMO_肝素酶II融合蛋白,其特征在于,所述融合蛋白的氨基酸序列如SEQ ID No.3。
8.权利要求1-5任一项所述的方法制备的肝素酶II。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110097134.XA CN113493780A (zh) | 2021-01-25 | 2021-01-25 | 一种利用sumo融合表达系统制备重组肝素酶ii的方法及其所制备的sumo_肝素酶ii融合蛋白 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110097134.XA CN113493780A (zh) | 2021-01-25 | 2021-01-25 | 一种利用sumo融合表达系统制备重组肝素酶ii的方法及其所制备的sumo_肝素酶ii融合蛋白 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN113493780A true CN113493780A (zh) | 2021-10-12 |
Family
ID=77995013
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110097134.XA Withdrawn CN113493780A (zh) | 2021-01-25 | 2021-01-25 | 一种利用sumo融合表达系统制备重组肝素酶ii的方法及其所制备的sumo_肝素酶ii融合蛋白 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113493780A (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114438060A (zh) * | 2022-02-18 | 2022-05-06 | 嘉兴维亚生物科技有限公司 | 一种重组改性ulp1蛋白酶及其制备方法 |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101942024A (zh) * | 2010-08-20 | 2011-01-12 | 清华大学 | 一种肝素酶ⅱ融合蛋白及其编码基因与表达方法 |
CN102234640A (zh) * | 2010-04-22 | 2011-11-09 | 哈尔滨市哈科隆生物制药研究所 | 重组小分子泛素样修饰物蛋白酶及其制备方法和应用 |
CN103160487A (zh) * | 2011-12-15 | 2013-06-19 | 曹林 | 肝素酶i融合蛋白 |
-
2021
- 2021-01-25 CN CN202110097134.XA patent/CN113493780A/zh not_active Withdrawn
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102234640A (zh) * | 2010-04-22 | 2011-11-09 | 哈尔滨市哈科隆生物制药研究所 | 重组小分子泛素样修饰物蛋白酶及其制备方法和应用 |
CN101942024A (zh) * | 2010-08-20 | 2011-01-12 | 清华大学 | 一种肝素酶ⅱ融合蛋白及其编码基因与表达方法 |
CN103160487A (zh) * | 2011-12-15 | 2013-06-19 | 曹林 | 肝素酶i融合蛋白 |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114438060A (zh) * | 2022-02-18 | 2022-05-06 | 嘉兴维亚生物科技有限公司 | 一种重组改性ulp1蛋白酶及其制备方法 |
CN114438060B (zh) * | 2022-02-18 | 2023-07-21 | 嘉兴维亚生物科技有限公司 | 一种重组改性ulp1蛋白酶及其制备方法 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107245494B (zh) | Aβ42在大肠杆菌中的高效可溶性表达及纯化方法 | |
CN113025598A (zh) | 一种利用sumo融合表达系统制备重组肝素酶iii的方法及其所制备的sumo_肝素酶iii融合蛋白 | |
CN111117977B (zh) | 一种重组多肽连接酶原及其制备、激活方法与应用 | |
CN113025675B (zh) | 多肽的制备方法 | |
US20230203109A1 (en) | Fusion polypeptides for target peptide production | |
CN114107353B (zh) | 一种高效表达多肽毒素的质粒及其制备方法与应用 | |
CN113004375B (zh) | 一种高效介导重组多肽形成包涵体的小分子蛋白 | |
CN113493780A (zh) | 一种利用sumo融合表达系统制备重组肝素酶ii的方法及其所制备的sumo_肝素酶ii融合蛋白 | |
US20190016754A1 (en) | Methods of producing and purifying matrix-binding fusion proteins by ion-exchange chromatography | |
CN109055339B (zh) | Tev蛋白酶突变体、基因、生物材料、制备方法、试剂或试剂盒和应用 | |
CN108003243B (zh) | 一种纯化蛋白的方法 | |
CN107629129B (zh) | 生产和纯化多肽的方法 | |
WO2014177021A1 (zh) | 强分泌性信号肽增强小肽模序及其应用 | |
CN107746432B (zh) | 一种Aβ42修饰蛋白及其表达与纯化方法 | |
CN113249353B (zh) | 一种n糖基转移酶突变体f13及其应用 | |
CN112980820A (zh) | 一种利用sumo融合表达系统制备重组肝素酶i的方法及其所制备的sumo_肝素酶i融合蛋白 | |
CN113249352B (zh) | 一种n糖基转移酶突变体p1及其应用 | |
CN112458106A (zh) | 一种多拷贝金鲳鲜味肽的制备方法及表达载体和重组菌 | |
CN113151227B (zh) | 一种蛋白酶基因及其异源表达 | |
CN117362451B (zh) | 一种包含tev蛋白酶的融合蛋白、制备方法及其应用 | |
CN117801123B (zh) | 沃索利肽可溶性中间体、中间体制备方法及沃索利肽的制备方法 | |
CN104945488B (zh) | 一种具有免疫球蛋白结合能力的多肽 | |
CN114854782B (zh) | 一种高效表达具有高活性的重组多肽连接酶原的方法 | |
CN100363497C (zh) | 基因重组技术制备pgrp(31-98)片段的方法 | |
CN110511951B (zh) | Plb蛋白在构建具有类伴侣样蛋白作用的融合蛋白表达载体中的应用 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WW01 | Invention patent application withdrawn after publication | ||
WW01 | Invention patent application withdrawn after publication |
Application publication date: 20211012 |