CN105838694A - 一种融合标签蛋白 - Google Patents
一种融合标签蛋白 Download PDFInfo
- Publication number
- CN105838694A CN105838694A CN201610331686.1A CN201610331686A CN105838694A CN 105838694 A CN105838694 A CN 105838694A CN 201610331686 A CN201610331686 A CN 201610331686A CN 105838694 A CN105838694 A CN 105838694A
- Authority
- CN
- China
- Prior art keywords
- ala
- gly
- thr
- asp
- glu
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 83
- 230000004927 fusion Effects 0.000 title claims abstract description 61
- 102000004169 proteins and genes Human genes 0.000 title abstract description 64
- 230000014509 gene expression Effects 0.000 claims abstract description 68
- 150000001413 amino acids Chemical class 0.000 claims abstract description 27
- 102000005744 Glycoside Hydrolases Human genes 0.000 claims abstract description 25
- 108010031186 Glycoside Hydrolases Proteins 0.000 claims abstract description 25
- 102000037865 fusion proteins Human genes 0.000 claims abstract description 24
- 108020001507 fusion proteins Proteins 0.000 claims abstract description 24
- 241000894006 Bacteria Species 0.000 claims description 10
- 230000000903 blocking effect Effects 0.000 claims description 6
- 125000003275 alpha amino acid group Chemical group 0.000 claims 5
- QGZKDVFQNNGYKY-UHFFFAOYSA-N Ammonia Chemical compound N QGZKDVFQNNGYKY-UHFFFAOYSA-N 0.000 claims 2
- 239000002253 acid Substances 0.000 claims 1
- 229910021529 ammonia Inorganic materials 0.000 claims 1
- 230000009261 transgenic effect Effects 0.000 claims 1
- 230000000694 effects Effects 0.000 abstract description 12
- 108010038633 aspartylglutamate Proteins 0.000 description 32
- 108010093581 aspartyl-proline Proteins 0.000 description 18
- 102000005789 Vascular Endothelial Growth Factors Human genes 0.000 description 17
- 108010019530 Vascular Endothelial Growth Factors Proteins 0.000 description 17
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 14
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 14
- 238000000034 method Methods 0.000 description 14
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 12
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 12
- PIXQDIGKDNNOOV-GUBZILKMSA-N Ala-Lys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O PIXQDIGKDNNOOV-GUBZILKMSA-N 0.000 description 12
- MCYJBCKCAPERSE-FXQIFTODSA-N Arg-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N MCYJBCKCAPERSE-FXQIFTODSA-N 0.000 description 12
- OCDJOVKIUJVUMO-SRVKXCTJSA-N Arg-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N OCDJOVKIUJVUMO-SRVKXCTJSA-N 0.000 description 12
- AZHXYLJRGVMQKW-UMPQAUOISA-N Arg-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCCN=C(N)N)N)O AZHXYLJRGVMQKW-UMPQAUOISA-N 0.000 description 12
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 12
- WONGRTVAMHFGBE-WDSKDSINSA-N Asn-Gly-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N WONGRTVAMHFGBE-WDSKDSINSA-N 0.000 description 12
- ZNYKKCADEQAZKA-FXQIFTODSA-N Asn-Ser-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O ZNYKKCADEQAZKA-FXQIFTODSA-N 0.000 description 12
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 12
- USNJAPJZSGTTPX-XVSYOHENSA-N Asp-Phe-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O USNJAPJZSGTTPX-XVSYOHENSA-N 0.000 description 12
- DTMLKCYOQKZXKZ-HJGDQZAQSA-N Gln-Arg-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DTMLKCYOQKZXKZ-HJGDQZAQSA-N 0.000 description 12
- JKPGHIQCHIIRMS-AVGNSLFASA-N Gln-Asp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N JKPGHIQCHIIRMS-AVGNSLFASA-N 0.000 description 12
- UEILCTONAMOGBR-RWRJDSDZSA-N Gln-Thr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UEILCTONAMOGBR-RWRJDSDZSA-N 0.000 description 12
- UBRQJXFDVZNYJP-AVGNSLFASA-N Gln-Tyr-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UBRQJXFDVZNYJP-AVGNSLFASA-N 0.000 description 12
- VPKBCVUDBNINAH-GARJFASQSA-N Glu-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O VPKBCVUDBNINAH-GARJFASQSA-N 0.000 description 12
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 12
- GFLQTABMFBXRIY-GUBZILKMSA-N Glu-Gln-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GFLQTABMFBXRIY-GUBZILKMSA-N 0.000 description 12
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 12
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 12
- XIJOPMSILDNVNJ-ZVZYQTTQSA-N Glu-Val-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O XIJOPMSILDNVNJ-ZVZYQTTQSA-N 0.000 description 12
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 12
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 12
- UMBDRSMLCUYIRI-DVJZZOLTSA-N Gly-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)CN)O UMBDRSMLCUYIRI-DVJZZOLTSA-N 0.000 description 12
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 12
- MJNWEIMBXKKCSF-XVYDVKMFSA-N His-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N MJNWEIMBXKKCSF-XVYDVKMFSA-N 0.000 description 12
- UAQSZXGJGLHMNV-XEGUGMAKSA-N Ile-Gly-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N UAQSZXGJGLHMNV-XEGUGMAKSA-N 0.000 description 12
- UIEZQYNXCYHMQS-BJDJZHNGSA-N Ile-Lys-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)O)N UIEZQYNXCYHMQS-BJDJZHNGSA-N 0.000 description 12
- SAVXZJYTTQQQDD-QEWYBTABSA-N Ile-Phe-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SAVXZJYTTQQQDD-QEWYBTABSA-N 0.000 description 12
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 12
- 241000880493 Leptailurus serval Species 0.000 description 12
- GAOJCVKPIGHTGO-UWVGGRQHSA-N Lys-Arg-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O GAOJCVKPIGHTGO-UWVGGRQHSA-N 0.000 description 12
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 description 12
- RDLSEGZJMYGFNS-FXQIFTODSA-N Met-Ser-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RDLSEGZJMYGFNS-FXQIFTODSA-N 0.000 description 12
- DDYIRGBOZVKRFR-AVGNSLFASA-N Phe-Asp-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DDYIRGBOZVKRFR-AVGNSLFASA-N 0.000 description 12
- CZQZSMJXFGGBHM-KKUMJFAQSA-N Phe-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O CZQZSMJXFGGBHM-KKUMJFAQSA-N 0.000 description 12
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 12
- VFDRDMOMHBJGKD-UFYCRDLUSA-N Phe-Tyr-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N VFDRDMOMHBJGKD-UFYCRDLUSA-N 0.000 description 12
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 12
- QVIZLAUEAMQKGS-GUBZILKMSA-N Pro-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 QVIZLAUEAMQKGS-GUBZILKMSA-N 0.000 description 12
- XQPHBAKJJJZOBX-SRVKXCTJSA-N Pro-Lys-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O XQPHBAKJJJZOBX-SRVKXCTJSA-N 0.000 description 12
- QUBVFEANYYWBTM-VEVYYDQMSA-N Pro-Thr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUBVFEANYYWBTM-VEVYYDQMSA-N 0.000 description 12
- YUSRGTQIPCJNHQ-CIUDSAMLSA-N Ser-Arg-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YUSRGTQIPCJNHQ-CIUDSAMLSA-N 0.000 description 12
- ZXLUWXWISXIFIX-ACZMJKKPSA-N Ser-Asn-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZXLUWXWISXIFIX-ACZMJKKPSA-N 0.000 description 12
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 12
- LAFLAXHTDVNVEL-WDCWCFNPSA-N Thr-Gln-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O LAFLAXHTDVNVEL-WDCWCFNPSA-N 0.000 description 12
- WDFPMSHYMRBLKM-NKIYYHGXSA-N Thr-Glu-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O WDFPMSHYMRBLKM-NKIYYHGXSA-N 0.000 description 12
- KBLYJPQSNGTDIU-LOKLDPHHSA-N Thr-Glu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O KBLYJPQSNGTDIU-LOKLDPHHSA-N 0.000 description 12
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 12
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 12
- NKUIXQOJUAEIET-AQZXSJQPSA-N Trp-Asp-Thr Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@H](O)C)C(O)=O)=CNC2=C1 NKUIXQOJUAEIET-AQZXSJQPSA-N 0.000 description 12
- XOLLWQIBBLBAHQ-WDSOQIARSA-N Trp-Pro-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O XOLLWQIBBLBAHQ-WDSOQIARSA-N 0.000 description 12
- NIHNMOSRSAYZIT-BPNCWPANSA-N Tyr-Ala-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NIHNMOSRSAYZIT-BPNCWPANSA-N 0.000 description 12
- XFEMMSGONWQACR-KJEVXHAQSA-N Tyr-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XFEMMSGONWQACR-KJEVXHAQSA-N 0.000 description 12
- OGNMURQZFMHFFD-NHCYSSNCSA-N Val-Asn-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N OGNMURQZFMHFFD-NHCYSSNCSA-N 0.000 description 12
- WMRWZYSRQUORHJ-YDHLFZDLSA-N Val-Phe-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WMRWZYSRQUORHJ-YDHLFZDLSA-N 0.000 description 12
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 12
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 12
- 108010008355 arginyl-glutamine Proteins 0.000 description 12
- 108010060035 arginylproline Proteins 0.000 description 12
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 12
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 12
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 12
- 108010053725 prolylvaline Proteins 0.000 description 12
- 108010026333 seryl-proline Proteins 0.000 description 12
- 241000588724 Escherichia coli Species 0.000 description 11
- 230000009182 swimming Effects 0.000 description 11
- ZGHMRONFHDVXEF-AVGNSLFASA-N Gln-Ser-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZGHMRONFHDVXEF-AVGNSLFASA-N 0.000 description 10
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 10
- ZFVFHHZBCVNLGD-GUBZILKMSA-N Ser-His-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZFVFHHZBCVNLGD-GUBZILKMSA-N 0.000 description 10
- DXNUZQGVOMCGNS-SWRJLBSHSA-N Thr-Gln-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O DXNUZQGVOMCGNS-SWRJLBSHSA-N 0.000 description 10
- 108091008605 VEGF receptors Proteins 0.000 description 10
- 239000012634 fragment Substances 0.000 description 10
- 108010051242 phenylalanylserine Proteins 0.000 description 10
- 241000282326 Felis catus Species 0.000 description 9
- 102100033177 Vascular endothelial growth factor receptor 2 Human genes 0.000 description 9
- 210000004027 cell Anatomy 0.000 description 9
- 210000003000 inclusion body Anatomy 0.000 description 9
- 238000003199 nucleic acid amplification method Methods 0.000 description 9
- 239000006228 supernatant Substances 0.000 description 9
- DXTYEWAQOXYRHZ-KKXDTOCCSA-N Ala-Phe-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N DXTYEWAQOXYRHZ-KKXDTOCCSA-N 0.000 description 8
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 8
- CTWCFPWFIGRAEP-CIUDSAMLSA-N Asp-Lys-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O CTWCFPWFIGRAEP-CIUDSAMLSA-N 0.000 description 8
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 8
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 8
- MJWVXZABPOKJJF-ACRUOGEOSA-N Leu-Phe-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MJWVXZABPOKJJF-ACRUOGEOSA-N 0.000 description 8
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 8
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 8
- ZUUDNCOCILSYAM-KKHAAJSZSA-N Thr-Asp-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZUUDNCOCILSYAM-KKHAAJSZSA-N 0.000 description 8
- 230000003321 amplification Effects 0.000 description 8
- 108010047857 aspartylglycine Proteins 0.000 description 8
- 238000010586 diagram Methods 0.000 description 8
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 8
- 239000001963 growth medium Substances 0.000 description 8
- WQPDUTSPKFMPDP-OUMQNGNKSA-N hirudin Chemical compound C([C@@H](C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C=CC(OS(O)(=O)=O)=CC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC=1NC=NC=1)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CCCCN)NC(=O)[C@H]1N(CCC1)C(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H]1NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(O)=O)NC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]2CSSC[C@@H](C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(=O)N[C@H](C(NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N2)=O)CSSC1)C(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H]1NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)CNC(=O)[C@H](CO)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CC=2C=CC(O)=CC=2)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)C(C)C)[C@@H](C)O)CSSC1)C(C)C)[C@@H](C)O)[C@@H](C)O)C1=CC=CC=C1 WQPDUTSPKFMPDP-OUMQNGNKSA-N 0.000 description 8
- 108091008146 restriction endonucleases Proteins 0.000 description 8
- 102000004190 Enzymes Human genes 0.000 description 7
- 108090000790 Enzymes Proteins 0.000 description 7
- 108010007267 Hirudins Proteins 0.000 description 7
- 102000007625 Hirudins Human genes 0.000 description 7
- 241000710799 Rubella virus Species 0.000 description 7
- 239000000427 antigen Substances 0.000 description 7
- 102000036639 antigens Human genes 0.000 description 7
- 108091007433 antigens Proteins 0.000 description 7
- 238000006243 chemical reaction Methods 0.000 description 7
- 230000002068 genetic effect Effects 0.000 description 7
- 229940006607 hirudin Drugs 0.000 description 7
- 230000003248 secreting effect Effects 0.000 description 7
- HUUOZYZWNCXTFK-INTQDDNPSA-N Ala-His-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N HUUOZYZWNCXTFK-INTQDDNPSA-N 0.000 description 6
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 6
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 6
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 6
- 108010013369 Enteropeptidase Proteins 0.000 description 6
- RMOCFPBLHAOTDU-ACZMJKKPSA-N Gln-Asn-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RMOCFPBLHAOTDU-ACZMJKKPSA-N 0.000 description 6
- CKOFNWCLWRYUHK-XHNCKOQMSA-N Glu-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CKOFNWCLWRYUHK-XHNCKOQMSA-N 0.000 description 6
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 description 6
- CKONPJHGMIDMJP-IHRRRGAJSA-N His-Val-His Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 CKONPJHGMIDMJP-IHRRRGAJSA-N 0.000 description 6
- GLYJPWIRLBAIJH-FQUUOJAGSA-N Ile-Lys-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N GLYJPWIRLBAIJH-FQUUOJAGSA-N 0.000 description 6
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 6
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 6
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 6
- VHWOBXIWBDWZHK-IHRRRGAJSA-N Phe-Arg-Asp Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 VHWOBXIWBDWZHK-IHRRRGAJSA-N 0.000 description 6
- WIVCOAKLPICYGY-KKUMJFAQSA-N Phe-Asp-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N WIVCOAKLPICYGY-KKUMJFAQSA-N 0.000 description 6
- SPLBRAKYXGOFSO-UNQGMJICSA-N Pro-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@@H]2CCCN2)O SPLBRAKYXGOFSO-UNQGMJICSA-N 0.000 description 6
- 108010076504 Protein Sorting Signals Proteins 0.000 description 6
- VGNYHOBZJKWRGI-CIUDSAMLSA-N Ser-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO VGNYHOBZJKWRGI-CIUDSAMLSA-N 0.000 description 6
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 6
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 6
- NRFTYDWKWGJLAR-MELADBBJSA-N Tyr-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O NRFTYDWKWGJLAR-MELADBBJSA-N 0.000 description 6
- BGFCXQXETBDEHP-BZSNNMDCSA-N Tyr-Phe-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O BGFCXQXETBDEHP-BZSNNMDCSA-N 0.000 description 6
- DJSYPCWZPNHQQE-FHWLQOOXSA-N Tyr-Tyr-Gln Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCC(N)=O)C(O)=O)C1=CC=C(O)C=C1 DJSYPCWZPNHQQE-FHWLQOOXSA-N 0.000 description 6
- DLRZGNXCXUGIDG-KKHAAJSZSA-N Val-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O DLRZGNXCXUGIDG-KKHAAJSZSA-N 0.000 description 6
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 6
- 108010020688 glycylhistidine Proteins 0.000 description 6
- 108010018006 histidylserine Proteins 0.000 description 6
- 108010029020 prolylglycine Proteins 0.000 description 6
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 6
- 108010061238 threonyl-glycine Proteins 0.000 description 6
- 102100029727 Enteropeptidase Human genes 0.000 description 5
- 102000005720 Glutathione transferase Human genes 0.000 description 5
- 108010070675 Glutathione transferase Proteins 0.000 description 5
- 101710194906 Hirudin-3 Proteins 0.000 description 5
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 5
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 5
- 230000000890 antigenic effect Effects 0.000 description 5
- 230000029087 digestion Effects 0.000 description 5
- 230000028023 exocytosis Effects 0.000 description 5
- 238000002474 experimental method Methods 0.000 description 5
- 239000013612 plasmid Substances 0.000 description 5
- 239000003053 toxin Substances 0.000 description 5
- 231100000765 toxin Toxicity 0.000 description 5
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 4
- WTXCNOPZMQRTNN-BWBBJGPYSA-N Cys-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N)O WTXCNOPZMQRTNN-BWBBJGPYSA-N 0.000 description 4
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 4
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 4
- GQUDMNDPQTXZRV-DCAQKATOSA-N Lys-Arg-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GQUDMNDPQTXZRV-DCAQKATOSA-N 0.000 description 4
- 101710175625 Maltose/maltodextrin-binding periplasmic protein Proteins 0.000 description 4
- FIRWJEJVFFGXSH-RYUDHWBXSA-N Phe-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 FIRWJEJVFFGXSH-RYUDHWBXSA-N 0.000 description 4
- DDPVJPIGACCMEH-XQXXSGGOSA-N Thr-Ala-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DDPVJPIGACCMEH-XQXXSGGOSA-N 0.000 description 4
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 4
- GFZQWWDXJVGEMW-ULQDDVLXSA-N Tyr-Arg-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GFZQWWDXJVGEMW-ULQDDVLXSA-N 0.000 description 4
- KHUVIWRRFMPVHD-JYJNAYRXSA-N Tyr-Met-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O KHUVIWRRFMPVHD-JYJNAYRXSA-N 0.000 description 4
- 108010087302 Viral Structural Proteins Proteins 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 4
- 108010050848 glycylleucine Proteins 0.000 description 4
- 239000003550 marker Substances 0.000 description 4
- 239000002504 physiological saline solution Substances 0.000 description 4
- 239000000047 product Substances 0.000 description 4
- 230000028327 secretion Effects 0.000 description 4
- 108010048818 seryl-histidine Proteins 0.000 description 4
- 108700004896 tripeptide FEG Proteins 0.000 description 4
- 206010001052 Acute respiratory distress syndrome Diseases 0.000 description 3
- 208000013616 Respiratory Distress Syndrome Diseases 0.000 description 3
- 102000016549 Vascular Endothelial Growth Factor Receptor-2 Human genes 0.000 description 3
- 108010053099 Vascular Endothelial Growth Factor Receptor-2 Proteins 0.000 description 3
- 238000000246 agarose gel electrophoresis Methods 0.000 description 3
- 238000001962 electrophoresis Methods 0.000 description 3
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 3
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical class O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 3
- 229930027917 kanamycin Natural products 0.000 description 3
- 239000007788 liquid Substances 0.000 description 3
- 239000002773 nucleotide Substances 0.000 description 3
- 125000003729 nucleotide group Chemical group 0.000 description 3
- 235000015097 nutrients Nutrition 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 108090000765 processed proteins & peptides Proteins 0.000 description 3
- 239000000243 solution Substances 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- 239000013598 vector Substances 0.000 description 3
- ODWSTKXGQGYHSH-FXQIFTODSA-N Ala-Arg-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O ODWSTKXGQGYHSH-FXQIFTODSA-N 0.000 description 2
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 2
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 2
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 2
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 2
- ADPACBMPYWJJCE-FXQIFTODSA-N Arg-Ser-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O ADPACBMPYWJJCE-FXQIFTODSA-N 0.000 description 2
- SQZIAWGBBUSSPJ-ZKWXMUAHSA-N Asn-Cys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N SQZIAWGBBUSSPJ-ZKWXMUAHSA-N 0.000 description 2
- ZMUQQMGITUJQTI-CIUDSAMLSA-N Asn-Leu-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMUQQMGITUJQTI-CIUDSAMLSA-N 0.000 description 2
- FBODFHMLALOPHP-GUBZILKMSA-N Asn-Lys-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O FBODFHMLALOPHP-GUBZILKMSA-N 0.000 description 2
- HCZQKHSRYHCPSD-IUKAMOBKSA-N Asn-Thr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HCZQKHSRYHCPSD-IUKAMOBKSA-N 0.000 description 2
- YSYTWUMRHSFODC-QWRGUYRKSA-N Asn-Tyr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O YSYTWUMRHSFODC-QWRGUYRKSA-N 0.000 description 2
- GWTLRDMPMJCNMH-WHFBIAKZSA-N Asp-Asn-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GWTLRDMPMJCNMH-WHFBIAKZSA-N 0.000 description 2
- GPPIDDWYKJPRES-YDHLFZDLSA-N Asp-Phe-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GPPIDDWYKJPRES-YDHLFZDLSA-N 0.000 description 2
- 108050001049 Extracellular proteins Proteins 0.000 description 2
- HHRAEXBUNGTOGZ-IHRRRGAJSA-N Gln-Phe-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O HHRAEXBUNGTOGZ-IHRRRGAJSA-N 0.000 description 2
- UZWUBBRJWFTHTD-LAEOZQHASA-N Glu-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O UZWUBBRJWFTHTD-LAEOZQHASA-N 0.000 description 2
- 241001387508 Glutamicibacter arilaitensis Species 0.000 description 2
- LERGJIVJIIODPZ-ZANVPECISA-N Gly-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)CN)C)C(O)=O)=CNC2=C1 LERGJIVJIIODPZ-ZANVPECISA-N 0.000 description 2
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 2
- UTYGDAHJBBDPBA-BYULHYEWSA-N Gly-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN UTYGDAHJBBDPBA-BYULHYEWSA-N 0.000 description 2
- CVFOYJJOZYYEPE-KBPBESRZSA-N Gly-Lys-Tyr Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CVFOYJJOZYYEPE-KBPBESRZSA-N 0.000 description 2
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 2
- JJGBXTYGTKWGAT-YUMQZZPRSA-N Gly-Pro-Glu Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O JJGBXTYGTKWGAT-YUMQZZPRSA-N 0.000 description 2
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 2
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 2
- XHVONGZZVUUORG-WEDXCCLWSA-N Gly-Thr-Lys Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN XHVONGZZVUUORG-WEDXCCLWSA-N 0.000 description 2
- IZVICCORZOSGPT-JSGCOSHPSA-N Gly-Val-Tyr Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IZVICCORZOSGPT-JSGCOSHPSA-N 0.000 description 2
- ZPVJJPAIUZLSNE-DCAQKATOSA-N His-Arg-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O ZPVJJPAIUZLSNE-DCAQKATOSA-N 0.000 description 2
- QCBYAHHNOHBXIH-UWVGGRQHSA-N His-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CN=CN1 QCBYAHHNOHBXIH-UWVGGRQHSA-N 0.000 description 2
- BCSGDNGNHKBRRJ-ULQDDVLXSA-N His-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N BCSGDNGNHKBRRJ-ULQDDVLXSA-N 0.000 description 2
- SLQVFYWBGNNOTK-BYULHYEWSA-N Ile-Gly-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N SLQVFYWBGNNOTK-BYULHYEWSA-N 0.000 description 2
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 2
- IGUOAYLTQJLPPD-DCAQKATOSA-N Leu-Asn-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IGUOAYLTQJLPPD-DCAQKATOSA-N 0.000 description 2
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 2
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 2
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 2
- GPJGFSFYBJGYRX-YUMQZZPRSA-N Lys-Gly-Asp Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O GPJGFSFYBJGYRX-YUMQZZPRSA-N 0.000 description 2
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 2
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 2
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 2
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 2
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 2
- QMMRHASQEVCJGR-UBHSHLNASA-N Phe-Ala-Pro Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 QMMRHASQEVCJGR-UBHSHLNASA-N 0.000 description 2
- DBALDZKOTNSBFM-FXQIFTODSA-N Pro-Ala-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DBALDZKOTNSBFM-FXQIFTODSA-N 0.000 description 2
- WGAQWMRJUFQXMF-ZPFDUUQYSA-N Pro-Gln-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WGAQWMRJUFQXMF-ZPFDUUQYSA-N 0.000 description 2
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 2
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 2
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 2
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 2
- KZPRPBLHYMZIMH-MXAVVETBSA-N Ser-Phe-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZPRPBLHYMZIMH-MXAVVETBSA-N 0.000 description 2
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 2
- DCCGCVLVVSAJFK-NUMRIWBASA-N Thr-Asp-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O DCCGCVLVVSAJFK-NUMRIWBASA-N 0.000 description 2
- FHDLKMFZKRUQCE-HJGDQZAQSA-N Thr-Glu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHDLKMFZKRUQCE-HJGDQZAQSA-N 0.000 description 2
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 2
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 2
- WVVOFCVMHAXGLE-LFSVMHDDSA-N Thr-Phe-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O WVVOFCVMHAXGLE-LFSVMHDDSA-N 0.000 description 2
- KAJRRNHOVMZYBL-IRIUXVKKSA-N Thr-Tyr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAJRRNHOVMZYBL-IRIUXVKKSA-N 0.000 description 2
- OKAMOYTUQMIFJO-JBACZVJFSA-N Trp-Glu-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=CC=C1 OKAMOYTUQMIFJO-JBACZVJFSA-N 0.000 description 2
- MBFJIHUHHCJBSN-AVGNSLFASA-N Tyr-Asn-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MBFJIHUHHCJBSN-AVGNSLFASA-N 0.000 description 2
- CKHQKYHIZCRTAP-SOUVJXGZSA-N Tyr-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O CKHQKYHIZCRTAP-SOUVJXGZSA-N 0.000 description 2
- WDGDKHLSDIOXQC-ACRUOGEOSA-N Tyr-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 WDGDKHLSDIOXQC-ACRUOGEOSA-N 0.000 description 2
- QPBJXNYYQTUTDD-KKUMJFAQSA-N Tyr-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QPBJXNYYQTUTDD-KKUMJFAQSA-N 0.000 description 2
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 2
- COSLEEOIYRPTHD-YDHLFZDLSA-N Val-Asp-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 COSLEEOIYRPTHD-YDHLFZDLSA-N 0.000 description 2
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 2
- 238000010521 absorption reaction Methods 0.000 description 2
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 2
- 238000005267 amalgamation Methods 0.000 description 2
- 230000033115 angiogenesis Effects 0.000 description 2
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 2
- 108010068265 aspartyltyrosine Proteins 0.000 description 2
- 230000033228 biological regulation Effects 0.000 description 2
- 238000003776 cleavage reaction Methods 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 239000013604 expression vector Substances 0.000 description 2
- 239000012530 fluid Substances 0.000 description 2
- 239000003292 glue Substances 0.000 description 2
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 2
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 2
- 108010089804 glycyl-threonine Proteins 0.000 description 2
- 108010010147 glycylglutamine Proteins 0.000 description 2
- 108010015792 glycyllysine Proteins 0.000 description 2
- 206010020718 hyperplasia Diseases 0.000 description 2
- 230000006698 induction Effects 0.000 description 2
- 239000003112 inhibitor Substances 0.000 description 2
- 239000002054 inoculum Substances 0.000 description 2
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 2
- 108010034529 leucyl-lysine Proteins 0.000 description 2
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 2
- 108010009298 lysylglutamic acid Proteins 0.000 description 2
- AEDDIBAIWPIIBD-ZJKJAXBQSA-N mangiferin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1C1=C(O)C=C(OC=2C(=CC(O)=C(O)C=2)C2=O)C2=C1O AEDDIBAIWPIIBD-ZJKJAXBQSA-N 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 108020004999 messenger RNA Proteins 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- -1 mpe Proteins 0.000 description 2
- 230000001575 pathological effect Effects 0.000 description 2
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 2
- 108010090894 prolylleucine Proteins 0.000 description 2
- 238000004153 renaturation Methods 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 230000007017 scission Effects 0.000 description 2
- 230000019491 signal transduction Effects 0.000 description 2
- 208000011580 syndromic disease Diseases 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 108010003137 tyrosyltyrosine Proteins 0.000 description 2
- 238000002525 ultrasonication Methods 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- 108010073969 valyllysine Proteins 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- OWEGMIWEEQEYGQ-UHFFFAOYSA-N 100676-05-9 Natural products OC1C(O)C(O)C(CO)OC1OCC1C(O)C(O)C(O)C(OC2C(OC(O)C(O)C2O)CO)O1 OWEGMIWEEQEYGQ-UHFFFAOYSA-N 0.000 description 1
- 241000606161 Chlamydia Species 0.000 description 1
- 229930091371 Fructose Natural products 0.000 description 1
- RFSUNEUAIZKAJO-ARQDHWQXSA-N Fructose Chemical compound OC[C@H]1O[C@](O)(CO)[C@@H](O)[C@@H]1O RFSUNEUAIZKAJO-ARQDHWQXSA-N 0.000 description 1
- 239000005715 Fructose Substances 0.000 description 1
- 238000012404 In vitro experiment Methods 0.000 description 1
- 102000004195 Isomerases Human genes 0.000 description 1
- 108090000769 Isomerases Proteins 0.000 description 1
- 101710164702 Major outer membrane protein Proteins 0.000 description 1
- GUBGYTABKSRVRQ-PICCSMPSSA-N Maltose Natural products O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-PICCSMPSSA-N 0.000 description 1
- YWQSXCGKJDUYTL-UHFFFAOYSA-N Mangiferin Natural products CC(CCC=C(C)C)C1CC(C)C2C3CCC4C(C)(C)CCCC45CC35CCC12C YWQSXCGKJDUYTL-UHFFFAOYSA-N 0.000 description 1
- 241001195348 Nusa Species 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- 101710116435 Outer membrane protein Proteins 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 206010035664 Pneumonia Diseases 0.000 description 1
- 101710089165 Protein white Proteins 0.000 description 1
- RXUWDKBZZLIASQ-UHFFFAOYSA-N Puerarin Natural products OCC1OC(Oc2c(O)cc(O)c3C(=O)C(=COc23)c4ccc(O)cc4)C(O)C(O)C1O RXUWDKBZZLIASQ-UHFFFAOYSA-N 0.000 description 1
- 229920002684 Sepharose Polymers 0.000 description 1
- 102100025292 Stress-induced-phosphoprotein 1 Human genes 0.000 description 1
- 102100036407 Thioredoxin Human genes 0.000 description 1
- 102000009484 Vascular Endothelial Growth Factor Receptors Human genes 0.000 description 1
- 239000003513 alkali Substances 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 239000002585 base Substances 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 238000006555 catalytic reaction Methods 0.000 description 1
- 230000030833 cell death Effects 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 230000009429 distress Effects 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 229930003935 flavonoid Natural products 0.000 description 1
- 150000002215 flavonoids Chemical class 0.000 description 1
- 235000017173 flavonoids Nutrition 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 108010010961 hirudin HV3 Proteins 0.000 description 1
- 230000036039 immunity Effects 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 239000010977 jade Substances 0.000 description 1
- 238000011031 large-scale manufacturing process Methods 0.000 description 1
- 229940043357 mangiferin Drugs 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 239000000155 melt Substances 0.000 description 1
- 238000006384 oligomerization reaction Methods 0.000 description 1
- 230000004962 physiological condition Effects 0.000 description 1
- 210000002706 plastid Anatomy 0.000 description 1
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 230000012846 protein folding Effects 0.000 description 1
- HKEAFJYKMMKDOR-VPRICQMDSA-N puerarin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1C1=C(O)C=CC(C2=O)=C1OC=C2C1=CC=C(O)C=C1 HKEAFJYKMMKDOR-VPRICQMDSA-N 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 238000000197 pyrolysis Methods 0.000 description 1
- 238000012797 qualification Methods 0.000 description 1
- 230000000630 rising effect Effects 0.000 description 1
- 239000013049 sediment Substances 0.000 description 1
- 230000007928 solubilization Effects 0.000 description 1
- 238000005063 solubilization Methods 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 108060008226 thioredoxin Proteins 0.000 description 1
- 229940094937 thioredoxin Drugs 0.000 description 1
- 231100000419 toxicity Toxicity 0.000 description 1
- 230000001988 toxicity Effects 0.000 description 1
- 231100000033 toxigenic Toxicity 0.000 description 1
- 230000001551 toxigenic effect Effects 0.000 description 1
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 1
- 230000002477 vacuolizing effect Effects 0.000 description 1
- 229940124676 vascular endothelial growth factor receptor Drugs 0.000 description 1
- 239000000304 virulence factor Substances 0.000 description 1
- 230000007923 virulence factor Effects 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/24—Hydrolases (3) acting on glycosyl compounds (3.2)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/35—Fusion polypeptide containing a fusion for enhanced stability/folding during expression, e.g. fusions with chaperones or thioredoxin
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2800/00—Nucleic acids vectors
- C12N2800/10—Plasmid DNA
- C12N2800/101—Plasmid DNA for bacteria
Landscapes
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Organic Chemistry (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Biomedical Technology (AREA)
- General Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- Biochemistry (AREA)
- Microbiology (AREA)
- Molecular Biology (AREA)
- General Health & Medical Sciences (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Medicinal Chemistry (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Peptides Or Proteins (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
本发明公开了一种融合标签蛋白,本发明的融合标签蛋白bff基于耐有机糖苷酶Fru6构建,为截断至耐有机糖苷酶Fru6N端前n个氨基酸截短体,所述n<452。本发明还提供了基于本发明的融合标签蛋白构建的融合蛋白表达系统,并选择bff312,bff209,bff217,bff243,bff350作为本发明优选的融合标签蛋白,构建融合蛋白表达系统,验证本发明融合标签蛋白的效果。本发明通过融合标签的连接能有效提高后接蛋白的表达及蛋白折叠,有效提高可溶性蛋白量。
Description
技术领域
本发明属于分子生物学技术领域,具体涉及基于耐有机糖苷酶Fru6的新型的融合标签蛋白,以及基于该融合标签的新型融合蛋白表达系统。
背景技术
大肠杆菌因其背景清晰、生长快速、表达量大,易于操作等特点仍为最常用的宿主,但是它也具有很多局限性,例如:一些异源蛋白在大肠杆菌体内表达时表达量很低,一些蛋白表达后易于降解,或者表达的外源蛋白对宿主细胞具有毒性,导致细菌生长停滞或死亡。或者因未能正确折叠而聚合在一起产生大量包涵体。事实上,大规模生产性状确切的活性蛋白仍是重组蛋白技术中的瓶颈问题。重组蛋白质异源表达中有超过半数以上是以包涵体形式存在的。对包涵体进行体外复性实验使其重新折叠的方法不仅复杂,而且难以保证变复性后蛋白结构与正常生理状态下一致。
融合标签的兴起为重组蛋白的生产带来了福音,融合标签技术在提高蛋白质的可溶性表达中得到广泛应用。
融合标签能增加重组蛋白表达量或在蛋白质折叠过程起作用,增强重组蛋白质的可溶性表达,常被用作标签蛋白与目的蛋白融合在一起表达。此类标签蛋白被称为SETs(Solubility-Enhancing Tags),这类蛋白主要有谷胱甘肽S-转移酶(GST),麦芽糖结合蛋白(MBP),硫氧还蛋白A(TrxA),转录终止抗终止因子(NusA),蛋白二硫键折叠异构酶(DsbA)等。但此类标签蛋白究竟如何帮助目的蛋白的表达和折叠,机理尚不明确,且这些SETs不能帮助所有的蛋白质表达和折叠。不同蛋白和融合标签的匹配性不同,对于某种难以表达的蛋白,我们可能需要尝试使用不同的SETs以找到合适的表达条件。
发明内容
本发明目的在于提供一种融合标签蛋白bff以及基于此标签蛋白所构建的融合蛋白表达系统。
本发明所提供的融合签蛋白bff以耐有机糖苷酶Fru6为基础构建,耐有机糖苷酶Fru6分子量约为55kDa,来源于耐有机糖苷酶Fru6产生菌Arthrobacter arilaitensisNJEM01(已公开于专利CN102732456B,保藏编号为CCTCC NO:M2012155),其氨基酸序列如序列表中SEQ ID No:1所示,其中包括长度为53个氨基酸的信号肽。信号肽具有强分泌性介导该酶分泌表达。本发明所述融合标签bff为截断至耐有机糖苷酶Fru6N端前n个氨基酸截短体。
本发明还涉及bff的编码基因,为耐有机糖苷酶Fru6N端n个氨基酸的编码基因,耐有机糖苷酶Fru6编码基因的核苷酸序列如序列表中SEQ ID No.1所示。
截短体bff在大肠杆菌中表达时可溶表达量很高而且性质稳定,故可作为一种潜在的具有增溶作用的标签蛋白。
本发明根据修正后Wilkinson和Harrison法(http://www.biotech.ou.edu/)预测不同长度的bff截短体可溶性,预测结果如图1所示,预测时要将信号肽去除后再预测,信号肽去除前长度在452以上个的氨基酸的bff截短体均为不溶,因此上述n<452。
根据预测结果,本发明选择截短体bff209,bff217,bff243,bff312,bff350作为本发明的优选融合标签蛋白,验证本发明的技术效果。所述bff209为n=209的情形,即为截断至耐有机糖苷酶Fru6N端前209个氨基酸截短体,其氨基酸序列和核苷酸序列如序列表中SEQ ID No:3-4所示。同理,bff217,bff243,bff312,bff350分别指n=217,n=243,n=312,n=350的情形,其氨基酸序列分别对应序列表中SEQ ID No:5-6、SEQ ID No:7-8、SEQ IDNo:9-10、SEQ ID No:11-12。
为进一步方便目的蛋白的表达和纯化,本发明还提供了基于所述融合标签蛋白构建的融合标签蛋白表达系统。改造表达载体pET-28a,构建含有融合标签的克隆及蛋白表达质粒载体pET-28a-bff,为后续获得天然N端的目的蛋白,在基因后面加入肠激酶酶切位点(DDDDK),在表达融合蛋白后能够切除融合标签。
本发明中应用四种药用蛋白来测试新型融合表达系统的效果,所用四种药用蛋白为衣原体外膜蛋白抗原决定簇rabOmp、血管内皮生长因子(VEGF)与其相应的受体(VEGFR家族)、社区获得性呼吸窘迫综合征毒素(MPE)蛋白和风疹病毒结构蛋白(Rub)蛋白。
衣原体外膜蛋白抗原决定簇对于研究衣原体主要外膜蛋白的抗原表位对于免疫防御和免疫诊断有重要意义。衣原体外膜蛋白抗原决定簇在大肠杆菌中通常以包涵体的形式表达,要通过复杂的复性和纯化才能获得。衣原体外膜蛋白抗原决定簇的蛋白序列长为216个氨基酸,分子量23.8kDa。
血管内皮生长因子(VEGF)与其相应的受体(VEGFR家族)是调节血管生成的关键蛋白,而VEGF/VEGFR信号通路抑制剂(如VEGF/VEGFR的抗体)对治疗病理性血管增生有较好疗效。本研究中的血管内皮生长因子受体2(VEGFR-2)是VEGFR家族中一员,蛋白序列长为275个氨基酸,分子量30.9kDa。
社区获得性呼吸窘迫综合征毒素是肺炎衣原体的致病因子,参与肺炎等疾病过程,能造成宿主细胞空泡化从而导致细胞死亡。社区获得性呼吸窘迫综合征毒素(MPE)蛋白序列长248个氨基酸,分子量为28.0kDa。
风疹病毒抗原的酶联吸附法是目前用来诊断诊断风疹病毒常用方法之一,但是由于酶联吸附法需要敏感性高、杭原性好的抗原。目前市场上生产的抗原多来源于细胞培养抗原,但是细胞培养成本高,因此重组风疹病毒抗原在原核生物中的表达有重要的现实意义。风疹病毒结构蛋白(Rub)蛋白序列长211个氨基酸,分子量为22.2kDa。
本发明提供的融合标签蛋白bff312应用于在大肠杆菌中难以表达的水蛭素时,使其获得了高效分泌表达,总表达量大幅提高,其中99%的融合蛋白可溶性表达,60%以上融合蛋白分泌至培养基中,其中胞外蛋白经肠激酶切除前后,测得水蛭素生物活性分别为13ATU/mL和21ATU/mL,可见融合截短体bff312后水蛭素3得以正确折叠而具有生物活性;本发明提供的融合标签bff312应用于单独以包涵体形式表达的血管内皮生长因子受体VEGFR-2,实现了高效可溶性表达,可溶性蛋白比例高达93%,可溶性蛋白量是0.86g/L为单独表达VEGFR-2时(0.08g/L)的11倍。
本发明提供的bff209,bff217,bff243,bff350应用于衣原体外膜蛋白抗原决定簇rabOmp,可溶性rabOmp蛋白总量均增加,优选的,融合标签bff209后可溶性蛋白比例达81%,可溶性蛋白量为0.72g/L,比单独表达提高了6倍。相比较而言,融合常见的商用融合标签麦芽糖结合蛋白(MBP)和谷胱甘肽S转移酶(GST),可溶性融合蛋白比例和可溶性蛋白量分别是25%,0.20g/L和24%,0.12g/L,融合本发明的中融合标签bff209的可溶性表达量比其融合MBP和GST的可溶性表达量提高了3.6倍和6倍。
本发明的融合表达系统在表达数种自身难以可溶表达的蛋白如水蛭素3(HV3),衣原体外膜蛋白抗原决定簇(rabOmp),血管内皮生长因子受体2(VEGFR-2),社区获得性呼吸窘迫综合征毒素(MPE),风疹病毒结构蛋白(Rub)等抗原时均获得了很高的表达量,获得了很好的表达效果。其中实现水蛭素胞外表达。
本发明提供的融合标签蛋白bff扩展了融合标签库,为融合标签的选择提供了更多的可能。
附图说明
图1是不同长度的bff截短体可溶性预测结果图。
图2是BL21/pET-ff53bff诱导表达的SDS-PAGE图谱。
图3是截短体质粒的构建过程示意图。
图4是基因扩增结果图。
图5是BL21/pET-bff513,BL21/pET-bff430诱导表达的SDS-PAGE图谱。
图6是BL21/pET-bff401,BL21/pET-bff312,BL21/pET-bff200诱导表达的SDS-PAGE图谱。
图7是重叠PCR法扩增目的基因步骤图。
图8是实施例2构建pET-bff312hv示意图。
图9a、图9b、图9c是实施例2电泳结果图;图中箭头方向为bff截短体融合蛋白的位置,图注为重组子所包含基因。
图9d是实施例2中水蛭素3(HV3),血管内皮生长因子受体2(VEGFR-2)单独表达以及融合截短体bff312融合标签时表达情况,和可溶性蛋白质浓度比例图。
图10是本发明所述新型融合标签蛋白重组载体pET-28a-bff209,pET-28a-bff217,pET-28a-bff243,pET-28a-bff350构建示意图。
图11是目的基因扩增结果图,M为2000bp长的Marker;泳道1-4分别为PCR产物bff350,bff243,bff217,和bff219。
图12是重组子菌落PCR鉴定结果图,M为2000bp长的Marker;泳道1-12为含pET-bff350,pET-bff243,pET-bff217和pET-bff219的三个重组子的菌落。
图13是目的基因扩增结果图,M为2000bp长的Marker;泳道1-3为PCR产物vegfr(NdeI,XhoI);泳道4-5为PCR产物mpe(NdeI,XhoI);泳道6-8分别为PCR产物rabomp(NdeI,XhoI),rabomp(NdeI,BamHI)和rabomp(BamHI,XhoI);泳道9为PCR产物rub(NdeI,XhoI)。
图14a是rabOmp单独表达及融合蛋白标签bff209,bff217,bff243,bff350表达结果对比图。
图14b是rabOmp单独表达及融合蛋白标签bff209,bff217,bff243,bff350表达蛋白含量可溶性蛋白浓度比例图。
图14c是vegfr单独表达及融合蛋白标签bff209,bff217,bff243,bff350表达结果对比图。
图14d是vegfr单独表达及融合蛋白标签bff209,bff217,bff243,bff350表达蛋白含量可溶性蛋白浓度比例图。
图14e是mpe单独表达及融合蛋白标签bff209,bff217,bff243,bff350表达结果对比图。
图14f是mpe单独表达及融合蛋白标签bff209,bff217,bff243,bff350表达蛋白含量可溶性蛋白浓度比例图。
图14g是rub单独表达及融合蛋白标签bff209,bff217,bff243,bff350表达结果对比图。
图14h是rub单独表达及融合蛋白标签bff209,bff217,bff243,bff350表达蛋白含量可溶性蛋白浓度比例图。
上述图中,M为蛋白Marker;C为发酵液上清,S为细胞裂解液上清,ib为细胞裂解液沉淀。
具体实施方式
下面结合附图,通过实施例具体说明本发明提供的新型融合标签蛋白的构建过程以及新型融合标签表达系统的效果,但不以任何方式限制本发明的范围。
实施例中,如实验具体条件未注明,请参照《分子克隆实验指南》(J.萨姆布鲁克D.W拉塞尔著)所述常规实验,或者仪器、试剂供应商所提供的条件。常规质粒构建所涉及的PCR、酶切、连接等实验,以及蛋白质表达所涉及的转化,细菌培养、诱导等实验为本领域研究人员所熟悉,具体实验细节在本发明中不再做详细描述,亦可参照《分子克隆实验指南》。
实施例1:融合标签bff的偶然发现
果糖苷酶Fru6分子量约为55kDa,其野生酶是一种胞外酶,能够高效合成黄酮类化合物葛根素和芒果苷,并能催化低聚果糖的生成。在生物催化、医药中间体的合成中有着广泛的应用价值。在研究糖苷酶Fru6表达模序时偶然发现,该酶有高度可溶性以及分泌性并且性质稳定。将该酶的开放式阅读框(包括自身信号肽)克隆至大肠杆菌中,可实现高效的胞外分泌表达,信号肽具有强分泌性介导该酶分泌表达,但不影响技术效果。
如图2所示为耐有机糖苷酶Fru6重组子BL21/pET-ff53bff诱导表达的SDS-PAGE图谱,显示出果糖苷酶Fru6的高效折叠能力和胞外分泌能力。
本发明所提供的新型融合签蛋白bff以耐有机糖苷酶Fru6为基础构建,来源于耐有机糖苷酶Fru6产生菌Arthrobacter arilaitensis NJEM01(已公开于专利CN102732456B,保藏编号为CCTCC NO:M2012155)。
本实施例考察了bff及其截短体自身折叠及胞外分泌能力,基于原子经济性原则以及期望获得高效有用的融合标签,随机选取了耐有机糖苷酶Fru6N端截短至513,430,401,312,200位氨基酸的截短体。以耐有机糖苷酶Fru6基因pET-ff53bff为模板,以PF,P200R,P312R,P401R,P513R,P430R为引物扩增,引物序列见表1-1,按照表1-2的PCR反应体系和表1-3的PCR反应条件扩增片段,得到的PCR产物为bff截短体基因片段bff513,bff430,bff401,bff312和bff200,如图3,得到两端分别带有Nco I和Xho I酶切位点的bff截短体基因片段。经纯化后的基因片段bff513,bff430,bff401,bff312,bff200与载体pET-28a用限制性内切酶Xho I和Nco I进行双酶切,37℃酶切8h。将这些片段通过酶切位点插入pET-28a载体中,得到表达载体pET-bff513,pET-bff430,pET-bff401,pET-bff312,pET-bff200。产物使用琼脂糖凝胶电泳验证,结果如图4,分别在约1539bp,1290bp,1203bp,936bp和600bp处有明显条带。
表1-1 引物序列
引物序列如表1-1所示,下划线处为Nco I和Xho I限制性酶切位点。
表1-2 PCR反应体系
表1-3 PCR反应条件
经测序验证正确后,提取质粒转化至E.coli BL21(DE3)感受态细胞,得到重组子BL21/pET-bff513,BL21/pET-bff430,BL21/pET-bff401,BL21/pET-bff312,BL21/pET-bff200。重组子经诱导表达6小时后进行SDS-PAGE,见图2,BL21/pET-bff513,BL21/pET-bff430,BL21/pET-bff401,BL21/pET-bff312,BL21/pET-bff200所表达bff截短体的分子量分别为51.3,42.5,39.4,29.5,16.7kDa。
图5显示,截短至513个和430个氨基酸时,分泌至培养基中的蛋白很少并且出现部分包涵体,图6显示,截短至401个和312个氨基酸时胞外表达量增加,而截短至200个氨基酸时总表达量很少,但仍有分泌。就目前几个截短体而言,截至312个氨基酸(截短体bff312)分泌至培养基中量较多,表达量也较高,能够正确折叠,并所表达的蛋白均已可溶形式表达,甚至实现胞外分泌表达,并且没有不可溶的包涵体存在。
鉴于该蛋白及其截短体的高分泌表达量和高度可溶,推测该蛋白截短体能够实现一些小分子量蛋白的分泌表达和一些通常以包涵体形式表达的药物蛋白可溶性表达。并以高效胞外分泌性的截短体bff312为例,作为融合标签进行后续实验研究。
实施例2:融合标签bff312融合水蛭素(HV3)和血管内皮生长因子受体2(VEGFR-2)质粒构建、表达和活性测定
截短体基因bff312可利用重叠PCR法与HV3/VEGFR-2(下文用hv/vegfr表述)融合。设计重叠PCR引物PF,HVR,HVF,P312HV3R,vegfrR,vegfrF,P312vegfrR。融合bff312和hv:使用引物PF与HVR扩出bff312,使用引物HVF与P312HV3R扩出hv,使用引物PF与P312HV3R扩出bff312hv。融合bff312和vegfr:使用引物PF与vegfrR扩出bff312,使用引物vegfrF与P312vegfrR扩出vegfr,使用引物PF与P312vegfrR扩出bff312vegfr,bff312hv与hv,bff312vegfr与vegfr之间有30个碱基序列重叠,反应步骤如图7所示,图中F1为扩增bff312的上游引物PF,R1为扩增bff312的下游引物HVR/vegfrR,F2为扩增hv/vegfr的上游引物HVF/vegfrF,R2为扩增hv/vegfr的下游引物P312HV3R/P312vegfrR。获得bff312hv/bff312vegfr。
图8是构建pET-bff312hv示意图,pET-bff312vegfr构建方式与pET-bff312hv相同。其中得到的基因片段bff312hv上,bff312和hv之间设有能特异性识别氨基酸序列的蛋白酶肠激酶酶切位点(DDDDK)编码基因(bff312vegfr基因片段在bff312和vegfr之间同样设有DDDDK编码基因)。bff312hv和bff312vegfr与载体pET-28a用限制性内切酶Xho I和NcoI进行双酶切,37℃酶切8h。
表2-1 重叠PCR引物
下划线处为Nco I和Xho I限制性酶切位点。
表2-2 Over lap-PCR反应体系
表2-3 Over-lap PCR扩增条件
酶切产物切胶回收后连接,16℃连接过夜。转化至E.coli BL21(DE3)感受态细胞,得到重组子BL21/pET-bff312hv和BL21/pET-bff312vegfr。
将重组子接种至LBK培养基中(含50g/mL卡那霉素),37℃,200rpm过夜培养,然后以2%的接种量接种至30mL的LBK培养基中,37℃培养2~3h(OD6000.6~0.9)加IPTG至终浓度为1mM,30℃诱导表达6h。取培养液1mL经过4℃,12000rpm离心10min获得上清为发酵液上清(C),去除上清后的菌泥使用生理盐水重悬后进行超声破碎5min,离心得到的上清为可溶性组分(S),离心后沉淀使用生理盐水重悬为不溶性组分(ib)。
经上述处理,电泳结果如图9a、图9b和图9c所示,根据图9a、图9b的SDS-PAGE图谱灰度扫描获得可溶性组分比例如图9d所示。
水蛭素3(HV3)蛋白序列长为69个氨基酸,分子量为7.4kDa;融合bff截短体bff312后分子量为37.2kDa。结果显示,在大肠杆菌中难以表达的水蛭素HV3融合含信号肽的bff截短体bff312获得了高效的分泌表达,蛋白总表达量大幅增加,并且99%的融合蛋白以可溶形式表达,60%以上的融合蛋白分泌在培养基中,不仅获得了高效的可溶性表达(0.8g/L的可溶性蛋白),而且融合蛋白分泌至培养基,大大简化了蛋白的纯化。
本发明中将上诉融合标签bff312融合于水蛭素N端,通过肠激酶切除标签后获得真实的水蛭素N端,并且融合表达后的大部分蛋白被分泌至胞外不易被水解。其中胞外蛋白经肠激酶切除前后,测得水蛭素生物活性分别为13ATU/mL和21ATU/mL,可见融合截短体bff312后水蛭素3得以正确折叠而具有生物活性。
血管内皮生长因子(VEGF)与其相应的受体(VEGFR家族)是调节血管生成的关键蛋白,而VEGF/VEGFR信号通路抑制剂(如VEGF/VEGFR的抗体)对治疗病理性血管增生有较好疗效。血管内皮生长因子受体2(VEGFR-2)是VEGFR家族中一员,蛋白序列长275个氨基酸,分子量为30.9kDa;融合bff截短bff312后分子量为60.4kDa。单独表达时通常以包涵体形式表达,而融合bff截短体bff312后可溶性蛋白比例高达93%,可溶性蛋白量0.86g/L是单独表达VEGFR-2可溶性蛋白量(0.08g/L)的11倍。
实施例3:新型融合蛋白表达系统的构建以及测试
在本实施例中,四种不同的蛋白被用来测试新型融合蛋白表达系统的效果。
(1)新型融合蛋白表达系统pET-bff350,pET-bff243,pET-bff217和pET-bff209的构建。
图10是本发明所述新型融合标签蛋白重组载体pET-28a-bff209,pET-28a-bff217,pET-28a-bff243,pET-28a-bff350构建示意图。pET-bff350,pET-bff243,pET-bff217和pET-bff209的构建同实施例1中截短体载体构建方法。其中选择的酶切位点为NdeI和Nco I,并且bff350,bff243,bff217和bff209的反向引物上设有编码DDDDK的核苷酸序列,以便以后融合目的蛋白后使用特异性识别DDDDK的肠激酶切除融合标签。以Fru6基因pET-ff53bff为模板为模板,用表3-1中引物PF、P350R、P243R、P217R、P209R分别扩增bff截短体基因片段bff350,bff243,bff 217,bff 209,琼脂糖凝胶电泳验证结果如图11所示,分别在约1065bp,744bp,666bp,和642bp处有明显条带,与各截短体基因长度相符。将基因片段和载体pET-28a经过双酶切后,连接后获得重组质粒pET-bff350,pET-bff243,pET-bff217和pET-bff209。
表3-1 引物序列表
将重组质粒pET-bff350,pET-bff243,pET-bff217和pET-bff209转化至感受态细胞,挑选得到的重组子使用相应的引物进行菌落PCR,PCR产物使用琼脂糖凝胶电泳验证,结果如图12所示,泳道1-12分别为含pET-bff350,pET-bff243,pET-bff217和pET-bff209的三个重组子的菌落PCR验证,与各截短体基因长度相符。菌落PCR验证的重组子经测序后表明目的基因己成功连接于载体。
(2)目的蛋白衣原体外膜蛋白rabOmp,风疹病毒结构蛋白Rub,社区获得性呼吸窘迫综合征毒素Mpe,血管内皮生长因子受体2vegfr-2的可溶性表达。
分别以pET-rabomp,pET-vegfr,pET-mpe,pET-rub为模板,用引物分别扩增基因片段vegfr(Nde I,Xho I),mpe(Nde I,Xho I),rub(Nde I,Xho I),rabomp(Nde I,Xho I),rabomp(Nde I,BamH I)和rabomp(BamH I,Xho I),对应引物序列见表3-1,产物使用琼脂糖凝胶电泳验证,结果见图13,图中泳道1-3为PCR产物vegfr(NdeI,XhoI);泳道4-5为PCR产物mpe(NdeI,XhoI);泳道6-8分别为PCR产物rabomp(NdeI,XhoI),rabomp(NdeI,BamHI)和rabomp(BamHI,XhoI);泳道9为PCR产物rub(NdeI,XhoI)。分别在约825bp,744bp,633bp,648bp,648bp,和648bp处有明显条带,与各目的基因长度相符。
按照上述方法扩增得到的基因片段rabomp(Nde I,Xho I),vegfr(Nde I,Xho I),rub(Nde I,Xho I),mpe(Nde I,Xho I)。将经纯化后的PCR产物及载体pET-bff350,pET-bff243,pET-bff217和pET-bff209用限制性内切酶Xho I和Nde I,37℃酶切8h。
酶切产物切胶回收后连接,16℃连接过夜。转化至E.coli BL21(DE3)感受态细胞,得到重组子BL21/pET-bff209rabomp,BL21/pET-bff217rabomp,BL21/pET-bff243rabomp,BL21/pET-bff350rabomp,BL21/pET-bff209vegfr,BL21/pET-bff217vegfr,BL21/pET-bff243vegfr,BL21/pET-bff350vegfr,BL21/pET-bff209mpe,BL21/pET-bff217mpe,BL21/pET-bff243mpe,BL21/pET-bff350mpe,BL21/pET-bff209rub,BL21/pET-bff217rub,BL21/pET-bff243rub,BL21/pET-bff350rub。
经测序验证正确后,将重组子接种于含50g/mL卡那霉素的LB液体培养基中,37℃,200rpm过夜培养,然后接种至新鲜的含50g/mL卡那霉素的LB培养基中,接种量为2%,37℃培养2~3h(OD6000.6~0.9),加入异丙基硫代半乳糖苷(IPTG)至终浓度为1mM,30℃诱导表达6h。取培养液1mL经过4℃,12000rpm离心10min获得上清为发酵液上清(C),去除上清后的菌泥使用生理盐水重悬后进行超声破碎5min,离心得到的上清为可溶性组分(S),离心后沉淀使用生理盐水重悬为不溶性组分(ib)。
利用SDS-PAGE蛋白凝胶电泳分析各融合蛋白的表达情况,结果如图14a-h所示。融合bff截短体bff209,bff217,bff243,bff350后,融合蛋白成熟肽分子量分别为41.8,42.6,45.5,57.5kDa。为探究最佳长度的bff截短体融合标签,将不同长度的bff截短体标签bff209,bff217,bff243,bff350分别融合rabOmp,vegfr-2,mpe,rub进行表达,其可溶性表达量显著增加,不同截短体融合标签应用于rabOmp,vegfr-2,mpe,rub的总结见表3-2。
表3-2 截短体标签用于蛋白表达总结
来源于耐有机糖苷酶Fru6的截短体标签对一些通常以包涵体形式表达的蛋白有显著的增溶左右,并对一些短肽类蛋白可能具有帮助其分泌至培养基的作用。标签的长度可能会带来效果的差异,对于不同的目的蛋白,融合标签可能会不同。本发明所述的新型融合标签蛋白bff对在大肠杆菌中表达异源蛋白及后续纯化有着重要意义。
序列表
<110> 南京工业大学
<120> 一种融合标签蛋白
<130> xb16051801
<160> 12
<170> PatentIn version 3.5
<210> 1
<211> 1647
<212> DNA
<213> Artificial
<220>
<223> Fru6
<220>
<221> CDS
<222> (1)..(1647)
<400> 1
atg gag aga gcg tgt gtc gcg gtc cga gag att gtc cga ttc cat atc 48
Met Glu Arg Ala Cys Val Ala Val Arg Glu Ile Val Arg Phe His Ile
1 5 10 15
gaa caa agg caa aca atc gtg aac aag caa cgc acc aaa cgc ggc att 96
Glu Gln Arg Gln Thr Ile Val Asn Lys Gln Arg Thr Lys Arg Gly Ile
20 25 30
ctt gcc gca gca ttg agc atc gga gcg ctc gga gca act ttg att tcg 144
Leu Ala Ala Ala Leu Ser Ile Gly Ala Leu Gly Ala Thr Leu Ile Ser
35 40 45
ggg cca gcc atg gct gcc acc gaa cca gtg cct ggc ttc ccc cag cca 192
Gly Pro Ala Met Ala Ala Thr Glu Pro Val Pro Gly Phe Pro Gln Pro
50 55 60
acc gag cac acc cag aag gca tac agc ccc acc gac gat ttc acc tcg 240
Thr Glu His Thr Gln Lys Ala Tyr Ser Pro Thr Asp Asp Phe Thr Ser
65 70 75 80
cgt tgg acc cgt gct gac gca aag cag atc aag gcc atg tcc gac ccg 288
Arg Trp Thr Arg Ala Asp Ala Lys Gln Ile Lys Ala Met Ser Asp Pro
85 90 95
aac gcg gga tcg cgt gaa aac tcc atg ccc aag gag tac acc atg cct 336
Asn Ala Gly Ser Arg Glu Asn Ser Met Pro Lys Glu Tyr Thr Met Pro
100 105 110
acc gtg ccc cag gat ttc cct gac atg agc aac gag gaa gtc tgg gtt 384
Thr Val Pro Gln Asp Phe Pro Asp Met Ser Asn Glu Glu Val Trp Val
115 120 125
tgg gat act tgg cca ctg act gac gag cac gcc aac cag tac agc gtt 432
Trp Asp Thr Trp Pro Leu Thr Asp Glu His Ala Asn Gln Tyr Ser Val
130 135 140
aat ggc cag gag atc atc ttc tcg ctg gtt gcc gat cgc gac ctt ggc 480
Asn Gly Gln Glu Ile Ile Phe Ser Leu Val Ala Asp Arg Asp Leu Gly
145 150 155 160
ttc gat gag cgc cac cag tac gca cgc att gga tac ttc tac cgt cca 528
Phe Asp Glu Arg His Gln Tyr Ala Arg Ile Gly Tyr Phe Tyr Arg Pro
165 170 175
gca ggc gtt cct gcc gat gag cgt cct gaa gat ggt ggc tgg acc tat 576
Ala Gly Val Pro Ala Asp Glu Arg Pro Glu Asp Gly Gly Trp Thr Tyr
180 185 190
ggc ggc cag gtc ttc gat gag ggc gtc acc ggc aag atc ttc gag gac 624
Gly Gly Gln Val Phe Asp Glu Gly Val Thr Gly Lys Ile Phe Glu Asp
195 200 205
cag tcc ttc agc cac cag act cag tgg tcc ggt tcg gcg cga gtg acc 672
Gln Ser Phe Ser His Gln Thr Gln Trp Ser Gly Ser Ala Arg Val Thr
210 215 220
aaa aat ggc gaa atc aag ctg ttc ttc acc gac gtt gcg ttc tac cgc 720
Lys Asn Gly Glu Ile Lys Leu Phe Phe Thr Asp Val Ala Phe Tyr Arg
225 230 235 240
gac aag gac ggc aag gac atc aag cca tac gac ccg cgc atc gca ttg 768
Asp Lys Asp Gly Lys Asp Ile Lys Pro Tyr Asp Pro Arg Ile Ala Leu
245 250 255
agc gtt ggc cac gtc cac tcg aac aag aat ggc gtg aag ctc acc ggc 816
Ser Val Gly His Val His Ser Asn Lys Asn Gly Val Lys Leu Thr Gly
260 265 270
ttt gac aag gtc acc gac ctg ctg cag gca gac ggc aag tac tac cag 864
Phe Asp Lys Val Thr Asp Leu Leu Gln Ala Asp Gly Lys Tyr Tyr Gln
275 280 285
acc gct gag cag aac tcg tac ttc aac ttc cgt gat cca ttc acc ttt 912
Thr Ala Glu Gln Asn Ser Tyr Phe Asn Phe Arg Asp Pro Phe Thr Phe
290 295 300
gaa gat cca gcg cat cca ggc gaa acc tac atg gtt ttc gag ggc aac 960
Glu Asp Pro Ala His Pro Gly Glu Thr Tyr Met Val Phe Glu Gly Asn
305 310 315 320
acc gct cag aag cgc gat gaa gcc aag tgc acc tcc gag gac ctt ggc 1008
Thr Ala Gln Lys Arg Asp Glu Ala Lys Cys Thr Ser Glu Asp Leu Gly
325 330 335
tac cgc aag ggt gaa acc aac ggc gaa acc gtg gat gag gtc aat aaa 1056
Tyr Arg Lys Gly Glu Thr Asn Gly Glu Thr Val Asp Glu Val Asn Lys
340 345 350
tcc ggc gca acc tac cag atc ggc aac gtc ggc ctg gcc cgt gcc aag 1104
Ser Gly Ala Thr Tyr Gln Ile Gly Asn Val Gly Leu Ala Arg Ala Lys
355 360 365
aac aag gag ctg acc gag tgg gag ttc ctg ccg ccg atc ctg tct gct 1152
Asn Lys Glu Leu Thr Glu Trp Glu Phe Leu Pro Pro Ile Leu Ser Ala
370 375 380
aac tgc gtc acc gac cag act gaa cgt ccg cag atc tac atg cag gat 1200
Asn Cys Val Thr Asp Gln Thr Glu Arg Pro Gln Ile Tyr Met Gln Asp
385 390 395 400
ggc aag tac tac ctg ttc acc att agc cac cgc tcc acc ttc gcc gct 1248
Gly Lys Tyr Tyr Leu Phe Thr Ile Ser His Arg Ser Thr Phe Ala Ala
405 410 415
ggc atc gac ggc cca gaa ggc gtc tat ggc ttc gtc ggc aac ggc atc 1296
Gly Ile Asp Gly Pro Glu Gly Val Tyr Gly Phe Val Gly Asn Gly Ile
420 425 430
cgc agc gat tac cag ccg ctg aac cgt ggc tcg ggc ctg gca ttg ggc 1344
Arg Ser Asp Tyr Gln Pro Leu Asn Arg Gly Ser Gly Leu Ala Leu Gly
435 440 445
agc cca tcg aac ctg aac ttt gcc gca ggc agc cca ttc gct cct gac 1392
Ser Pro Ser Asn Leu Asn Phe Ala Ala Gly Ser Pro Phe Ala Pro Asp
450 455 460
tac aat cag cac ccg gga caa ttc cag gca tac tcg cac tac gtc atg 1440
Tyr Asn Gln His Pro Gly Gln Phe Gln Ala Tyr Ser His Tyr Val Met
465 470 475 480
cct ggt ggt ctg gtg cag tcc ttc att aac acc atc ggc acc aag gag 1488
Pro Gly Gly Leu Val Gln Ser Phe Ile Asn Thr Ile Gly Thr Lys Glu
485 490 495
gac ttc gtt cgc ggt ggc aca ctg ggc cca acc gtg aag ttg gac atc 1536
Asp Phe Val Arg Gly Gly Thr Leu Gly Pro Thr Val Lys Leu Asp Ile
500 505 510
aag ggc gac tcg gca acg gtc gac tac aac tat ggc gat aac ggt ctc 1584
Lys Gly Asp Ser Ala Thr Val Asp Tyr Asn Tyr Gly Asp Asn Gly Leu
515 520 525
ggc gca tgg gca gat att cct gcc aac cgc gag ctg aag aat ggc aaa 1632
Gly Ala Trp Ala Asp Ile Pro Ala Asn Arg Glu Leu Lys Asn Gly Lys
530 535 540
gca gta gca aag taa 1647
Ala Val Ala Lys
545
<210> 2
<211> 548
<212> PRT
<213> Artificial
<220>
<223> Synthetic Construct
<400> 2
Met Glu Arg Ala Cys Val Ala Val Arg Glu Ile Val Arg Phe His Ile
1 5 10 15
Glu Gln Arg Gln Thr Ile Val Asn Lys Gln Arg Thr Lys Arg Gly Ile
20 25 30
Leu Ala Ala Ala Leu Ser Ile Gly Ala Leu Gly Ala Thr Leu Ile Ser
35 40 45
Gly Pro Ala Met Ala Ala Thr Glu Pro Val Pro Gly Phe Pro Gln Pro
50 55 60
Thr Glu His Thr Gln Lys Ala Tyr Ser Pro Thr Asp Asp Phe Thr Ser
65 70 75 80
Arg Trp Thr Arg Ala Asp Ala Lys Gln Ile Lys Ala Met Ser Asp Pro
85 90 95
Asn Ala Gly Ser Arg Glu Asn Ser Met Pro Lys Glu Tyr Thr Met Pro
100 105 110
Thr Val Pro Gln Asp Phe Pro Asp Met Ser Asn Glu Glu Val Trp Val
115 120 125
Trp Asp Thr Trp Pro Leu Thr Asp Glu His Ala Asn Gln Tyr Ser Val
130 135 140
Asn Gly Gln Glu Ile Ile Phe Ser Leu Val Ala Asp Arg Asp Leu Gly
145 150 155 160
Phe Asp Glu Arg His Gln Tyr Ala Arg Ile Gly Tyr Phe Tyr Arg Pro
165 170 175
Ala Gly Val Pro Ala Asp Glu Arg Pro Glu Asp Gly Gly Trp Thr Tyr
180 185 190
Gly Gly Gln Val Phe Asp Glu Gly Val Thr Gly Lys Ile Phe Glu Asp
195 200 205
Gln Ser Phe Ser His Gln Thr Gln Trp Ser Gly Ser Ala Arg Val Thr
210 215 220
Lys Asn Gly Glu Ile Lys Leu Phe Phe Thr Asp Val Ala Phe Tyr Arg
225 230 235 240
Asp Lys Asp Gly Lys Asp Ile Lys Pro Tyr Asp Pro Arg Ile Ala Leu
245 250 255
Ser Val Gly His Val His Ser Asn Lys Asn Gly Val Lys Leu Thr Gly
260 265 270
Phe Asp Lys Val Thr Asp Leu Leu Gln Ala Asp Gly Lys Tyr Tyr Gln
275 280 285
Thr Ala Glu Gln Asn Ser Tyr Phe Asn Phe Arg Asp Pro Phe Thr Phe
290 295 300
Glu Asp Pro Ala His Pro Gly Glu Thr Tyr Met Val Phe Glu Gly Asn
305 310 315 320
Thr Ala Gln Lys Arg Asp Glu Ala Lys Cys Thr Ser Glu Asp Leu Gly
325 330 335
Tyr Arg Lys Gly Glu Thr Asn Gly Glu Thr Val Asp Glu Val Asn Lys
340 345 350
Ser Gly Ala Thr Tyr Gln Ile Gly Asn Val Gly Leu Ala Arg Ala Lys
355 360 365
Asn Lys Glu Leu Thr Glu Trp Glu Phe Leu Pro Pro Ile Leu Ser Ala
370 375 380
Asn Cys Val Thr Asp Gln Thr Glu Arg Pro Gln Ile Tyr Met Gln Asp
385 390 395 400
Gly Lys Tyr Tyr Leu Phe Thr Ile Ser His Arg Ser Thr Phe Ala Ala
405 410 415
Gly Ile Asp Gly Pro Glu Gly Val Tyr Gly Phe Val Gly Asn Gly Ile
420 425 430
Arg Ser Asp Tyr Gln Pro Leu Asn Arg Gly Ser Gly Leu Ala Leu Gly
435 440 445
Ser Pro Ser Asn Leu Asn Phe Ala Ala Gly Ser Pro Phe Ala Pro Asp
450 455 460
Tyr Asn Gln His Pro Gly Gln Phe Gln Ala Tyr Ser His Tyr Val Met
465 470 475 480
Pro Gly Gly Leu Val Gln Ser Phe Ile Asn Thr Ile Gly Thr Lys Glu
485 490 495
Asp Phe Val Arg Gly Gly Thr Leu Gly Pro Thr Val Lys Leu Asp Ile
500 505 510
Lys Gly Asp Ser Ala Thr Val Asp Tyr Asn Tyr Gly Asp Asn Gly Leu
515 520 525
Gly Ala Trp Ala Asp Ile Pro Ala Asn Arg Glu Leu Lys Asn Gly Lys
530 535 540
Ala Val Ala Lys
545
<210> 3
<211> 627
<212> DNA
<213> Artificial
<220>
<223> bff209
<220>
<221> CDS
<222> (1)..(627)
<400> 3
atg gag aga gcg tgt gtc gcg gtc cga gag att gtc cga ttc cat atc 48
Met Glu Arg Ala Cys Val Ala Val Arg Glu Ile Val Arg Phe His Ile
1 5 10 15
gaa caa agg caa aca atc gtg aac aag caa cgc acc aaa cgc ggc att 96
Glu Gln Arg Gln Thr Ile Val Asn Lys Gln Arg Thr Lys Arg Gly Ile
20 25 30
ctt gcc gca gca ttg agc atc gga gcg ctc gga gca act ttg att tcg 144
Leu Ala Ala Ala Leu Ser Ile Gly Ala Leu Gly Ala Thr Leu Ile Ser
35 40 45
ggg cca gcc atg gct gcc acc gaa cca gtg cct ggc ttc ccc cag cca 192
Gly Pro Ala Met Ala Ala Thr Glu Pro Val Pro Gly Phe Pro Gln Pro
50 55 60
acc gag cac acc cag aag gca tac agc ccc acc gac gat ttc acc tcg 240
Thr Glu His Thr Gln Lys Ala Tyr Ser Pro Thr Asp Asp Phe Thr Ser
65 70 75 80
cgt tgg acc cgt gct gac gca aag cag atc aag gcc atg tcc gac ccg 288
Arg Trp Thr Arg Ala Asp Ala Lys Gln Ile Lys Ala Met Ser Asp Pro
85 90 95
aac gcg gga tcg cgt gaa aac tcc atg ccc aag gag tac acc atg cct 336
Asn Ala Gly Ser Arg Glu Asn Ser Met Pro Lys Glu Tyr Thr Met Pro
100 105 110
acc gtg ccc cag gat ttc cct gac atg agc aac gag gaa gtc tgg gtt 384
Thr Val Pro Gln Asp Phe Pro Asp Met Ser Asn Glu Glu Val Trp Val
115 120 125
tgg gat act tgg cca ctg act gac gag cac gcc aac cag tac agc gtt 432
Trp Asp Thr Trp Pro Leu Thr Asp Glu His Ala Asn Gln Tyr Ser Val
130 135 140
aat ggc cag gag atc atc ttc tcg ctg gtt gcc gat cgc gac ctt ggc 480
Asn Gly Gln Glu Ile Ile Phe Ser Leu Val Ala Asp Arg Asp Leu Gly
145 150 155 160
ttc gat gag cgc cac cag tac gca cgc att gga tac ttc tac cgt cca 528
Phe Asp Glu Arg His Gln Tyr Ala Arg Ile Gly Tyr Phe Tyr Arg Pro
165 170 175
gca ggc gtt cct gcc gat gag cgt cct gaa gat ggt ggc tgg acc tat 576
Ala Gly Val Pro Ala Asp Glu Arg Pro Glu Asp Gly Gly Trp Thr Tyr
180 185 190
ggc ggc cag gtc ttc gat gag ggc gtc acc ggc aag atc ttc gag gac 624
Gly Gly Gln Val Phe Asp Glu Gly Val Thr Gly Lys Ile Phe Glu Asp
195 200 205
cag 627
Gln
<210> 4
<211> 209
<212> PRT
<213> Artificial
<220>
<223> Synthetic Construct
<400> 4
Met Glu Arg Ala Cys Val Ala Val Arg Glu Ile Val Arg Phe His Ile
1 5 10 15
Glu Gln Arg Gln Thr Ile Val Asn Lys Gln Arg Thr Lys Arg Gly Ile
20 25 30
Leu Ala Ala Ala Leu Ser Ile Gly Ala Leu Gly Ala Thr Leu Ile Ser
35 40 45
Gly Pro Ala Met Ala Ala Thr Glu Pro Val Pro Gly Phe Pro Gln Pro
50 55 60
Thr Glu His Thr Gln Lys Ala Tyr Ser Pro Thr Asp Asp Phe Thr Ser
65 70 75 80
Arg Trp Thr Arg Ala Asp Ala Lys Gln Ile Lys Ala Met Ser Asp Pro
85 90 95
Asn Ala Gly Ser Arg Glu Asn Ser Met Pro Lys Glu Tyr Thr Met Pro
100 105 110
Thr Val Pro Gln Asp Phe Pro Asp Met Ser Asn Glu Glu Val Trp Val
115 120 125
Trp Asp Thr Trp Pro Leu Thr Asp Glu His Ala Asn Gln Tyr Ser Val
130 135 140
Asn Gly Gln Glu Ile Ile Phe Ser Leu Val Ala Asp Arg Asp Leu Gly
145 150 155 160
Phe Asp Glu Arg His Gln Tyr Ala Arg Ile Gly Tyr Phe Tyr Arg Pro
165 170 175
Ala Gly Val Pro Ala Asp Glu Arg Pro Glu Asp Gly Gly Trp Thr Tyr
180 185 190
Gly Gly Gln Val Phe Asp Glu Gly Val Thr Gly Lys Ile Phe Glu Asp
195 200 205
Gln
<210> 5
<211> 651
<212> DNA
<213> Artificial
<220>
<223> bff217
<220>
<221> CDS
<222> (1)..(651)
<400> 5
atg gag aga gcg tgt gtc gcg gtc cga gag att gtc cga ttc cat atc 48
Met Glu Arg Ala Cys Val Ala Val Arg Glu Ile Val Arg Phe His Ile
1 5 10 15
gaa caa agg caa aca atc gtg aac aag caa cgc acc aaa cgc ggc att 96
Glu Gln Arg Gln Thr Ile Val Asn Lys Gln Arg Thr Lys Arg Gly Ile
20 25 30
ctt gcc gca gca ttg agc atc gga gcg ctc gga gca act ttg att tcg 144
Leu Ala Ala Ala Leu Ser Ile Gly Ala Leu Gly Ala Thr Leu Ile Ser
35 40 45
ggg cca gcc atg gct gcc acc gaa cca gtg cct ggc ttc ccc cag cca 192
Gly Pro Ala Met Ala Ala Thr Glu Pro Val Pro Gly Phe Pro Gln Pro
50 55 60
acc gag cac acc cag aag gca tac agc ccc acc gac gat ttc acc tcg 240
Thr Glu His Thr Gln Lys Ala Tyr Ser Pro Thr Asp Asp Phe Thr Ser
65 70 75 80
cgt tgg acc cgt gct gac gca aag cag atc aag gcc atg tcc gac ccg 288
Arg Trp Thr Arg Ala Asp Ala Lys Gln Ile Lys Ala Met Ser Asp Pro
85 90 95
aac gcg gga tcg cgt gaa aac tcc atg ccc aag gag tac acc atg cct 336
Asn Ala Gly Ser Arg Glu Asn Ser Met Pro Lys Glu Tyr Thr Met Pro
100 105 110
acc gtg ccc cag gat ttc cct gac atg agc aac gag gaa gtc tgg gtt 384
Thr Val Pro Gln Asp Phe Pro Asp Met Ser Asn Glu Glu Val Trp Val
115 120 125
tgg gat act tgg cca ctg act gac gag cac gcc aac cag tac agc gtt 432
Trp Asp Thr Trp Pro Leu Thr Asp Glu His Ala Asn Gln Tyr Ser Val
130 135 140
aat ggc cag gag atc atc ttc tcg ctg gtt gcc gat cgc gac ctt ggc 480
Asn Gly Gln Glu Ile Ile Phe Ser Leu Val Ala Asp Arg Asp Leu Gly
145 150 155 160
ttc gat gag cgc cac cag tac gca cgc att gga tac ttc tac cgt cca 528
Phe Asp Glu Arg His Gln Tyr Ala Arg Ile Gly Tyr Phe Tyr Arg Pro
165 170 175
gca ggc gtt cct gcc gat gag cgt cct gaa gat ggt ggc tgg acc tat 576
Ala Gly Val Pro Ala Asp Glu Arg Pro Glu Asp Gly Gly Trp Thr Tyr
180 185 190
ggc ggc cag gtc ttc gat gag ggc gtc acc ggc aag atc ttc gag gac 624
Gly Gly Gln Val Phe Asp Glu Gly Val Thr Gly Lys Ile Phe Glu Asp
195 200 205
cag tcc ttc agc cac cag act cag tgg 651
Gln Ser Phe Ser His Gln Thr Gln Trp
210 215
<210> 6
<211> 217
<212> PRT
<213> Artificial
<220>
<223> Synthetic Construct
<400> 6
Met Glu Arg Ala Cys Val Ala Val Arg Glu Ile Val Arg Phe His Ile
1 5 10 15
Glu Gln Arg Gln Thr Ile Val Asn Lys Gln Arg Thr Lys Arg Gly Ile
20 25 30
Leu Ala Ala Ala Leu Ser Ile Gly Ala Leu Gly Ala Thr Leu Ile Ser
35 40 45
Gly Pro Ala Met Ala Ala Thr Glu Pro Val Pro Gly Phe Pro Gln Pro
50 55 60
Thr Glu His Thr Gln Lys Ala Tyr Ser Pro Thr Asp Asp Phe Thr Ser
65 70 75 80
Arg Trp Thr Arg Ala Asp Ala Lys Gln Ile Lys Ala Met Ser Asp Pro
85 90 95
Asn Ala Gly Ser Arg Glu Asn Ser Met Pro Lys Glu Tyr Thr Met Pro
100 105 110
Thr Val Pro Gln Asp Phe Pro Asp Met Ser Asn Glu Glu Val Trp Val
115 120 125
Trp Asp Thr Trp Pro Leu Thr Asp Glu His Ala Asn Gln Tyr Ser Val
130 135 140
Asn Gly Gln Glu Ile Ile Phe Ser Leu Val Ala Asp Arg Asp Leu Gly
145 150 155 160
Phe Asp Glu Arg His Gln Tyr Ala Arg Ile Gly Tyr Phe Tyr Arg Pro
165 170 175
Ala Gly Val Pro Ala Asp Glu Arg Pro Glu Asp Gly Gly Trp Thr Tyr
180 185 190
Gly Gly Gln Val Phe Asp Glu Gly Val Thr Gly Lys Ile Phe Glu Asp
195 200 205
Gln Ser Phe Ser His Gln Thr Gln Trp
210 215
<210> 7
<211> 729
<212> DNA
<213> Artificial
<220>
<223> bff243
<220>
<221> CDS
<222> (1)..(729)
<400> 7
atg gag aga gcg tgt gtc gcg gtc cga gag att gtc cga ttc cat atc 48
Met Glu Arg Ala Cys Val Ala Val Arg Glu Ile Val Arg Phe His Ile
1 5 10 15
gaa caa agg caa aca atc gtg aac aag caa cgc acc aaa cgc ggc att 96
Glu Gln Arg Gln Thr Ile Val Asn Lys Gln Arg Thr Lys Arg Gly Ile
20 25 30
ctt gcc gca gca ttg agc atc gga gcg ctc gga gca act ttg att tcg 144
Leu Ala Ala Ala Leu Ser Ile Gly Ala Leu Gly Ala Thr Leu Ile Ser
35 40 45
ggg cca gcc atg gct gcc acc gaa cca gtg cct ggc ttc ccc cag cca 192
Gly Pro Ala Met Ala Ala Thr Glu Pro Val Pro Gly Phe Pro Gln Pro
50 55 60
acc gag cac acc cag aag gca tac agc ccc acc gac gat ttc acc tcg 240
Thr Glu His Thr Gln Lys Ala Tyr Ser Pro Thr Asp Asp Phe Thr Ser
65 70 75 80
cgt tgg acc cgt gct gac gca aag cag atc aag gcc atg tcc gac ccg 288
Arg Trp Thr Arg Ala Asp Ala Lys Gln Ile Lys Ala Met Ser Asp Pro
85 90 95
aac gcg gga tcg cgt gaa aac tcc atg ccc aag gag tac acc atg cct 336
Asn Ala Gly Ser Arg Glu Asn Ser Met Pro Lys Glu Tyr Thr Met Pro
100 105 110
acc gtg ccc cag gat ttc cct gac atg agc aac gag gaa gtc tgg gtt 384
Thr Val Pro Gln Asp Phe Pro Asp Met Ser Asn Glu Glu Val Trp Val
115 120 125
tgg gat act tgg cca ctg act gac gag cac gcc aac cag tac agc gtt 432
Trp Asp Thr Trp Pro Leu Thr Asp Glu His Ala Asn Gln Tyr Ser Val
130 135 140
aat ggc cag gag atc atc ttc tcg ctg gtt gcc gat cgc gac ctt ggc 480
Asn Gly Gln Glu Ile Ile Phe Ser Leu Val Ala Asp Arg Asp Leu Gly
145 150 155 160
ttc gat gag cgc cac cag tac gca cgc att gga tac ttc tac cgt cca 528
Phe Asp Glu Arg His Gln Tyr Ala Arg Ile Gly Tyr Phe Tyr Arg Pro
165 170 175
gca ggc gtt cct gcc gat gag cgt cct gaa gat ggt ggc tgg acc tat 576
Ala Gly Val Pro Ala Asp Glu Arg Pro Glu Asp Gly Gly Trp Thr Tyr
180 185 190
ggc ggc cag gtc ttc gat gag ggc gtc acc ggc aag atc ttc gag gac 624
Gly Gly Gln Val Phe Asp Glu Gly Val Thr Gly Lys Ile Phe Glu Asp
195 200 205
cag tcc ttc agc cac cag act cag tgg tcc ggt tcg gcg cga gtg acc 672
Gln Ser Phe Ser His Gln Thr Gln Trp Ser Gly Ser Ala Arg Val Thr
210 215 220
aaa aat ggc gaa atc aag ctg ttc ttc acc gac gtt gcg ttc tac cgc 720
Lys Asn Gly Glu Ile Lys Leu Phe Phe Thr Asp Val Ala Phe Tyr Arg
225 230 235 240
gac aag gac 729
Asp Lys Asp
<210> 8
<211> 243
<212> PRT
<213> Artificial
<220>
<223> Synthetic Construct
<400> 8
Met Glu Arg Ala Cys Val Ala Val Arg Glu Ile Val Arg Phe His Ile
1 5 10 15
Glu Gln Arg Gln Thr Ile Val Asn Lys Gln Arg Thr Lys Arg Gly Ile
20 25 30
Leu Ala Ala Ala Leu Ser Ile Gly Ala Leu Gly Ala Thr Leu Ile Ser
35 40 45
Gly Pro Ala Met Ala Ala Thr Glu Pro Val Pro Gly Phe Pro Gln Pro
50 55 60
Thr Glu His Thr Gln Lys Ala Tyr Ser Pro Thr Asp Asp Phe Thr Ser
65 70 75 80
Arg Trp Thr Arg Ala Asp Ala Lys Gln Ile Lys Ala Met Ser Asp Pro
85 90 95
Asn Ala Gly Ser Arg Glu Asn Ser Met Pro Lys Glu Tyr Thr Met Pro
100 105 110
Thr Val Pro Gln Asp Phe Pro Asp Met Ser Asn Glu Glu Val Trp Val
115 120 125
Trp Asp Thr Trp Pro Leu Thr Asp Glu His Ala Asn Gln Tyr Ser Val
130 135 140
Asn Gly Gln Glu Ile Ile Phe Ser Leu Val Ala Asp Arg Asp Leu Gly
145 150 155 160
Phe Asp Glu Arg His Gln Tyr Ala Arg Ile Gly Tyr Phe Tyr Arg Pro
165 170 175
Ala Gly Val Pro Ala Asp Glu Arg Pro Glu Asp Gly Gly Trp Thr Tyr
180 185 190
Gly Gly Gln Val Phe Asp Glu Gly Val Thr Gly Lys Ile Phe Glu Asp
195 200 205
Gln Ser Phe Ser His Gln Thr Gln Trp Ser Gly Ser Ala Arg Val Thr
210 215 220
Lys Asn Gly Glu Ile Lys Leu Phe Phe Thr Asp Val Ala Phe Tyr Arg
225 230 235 240
Asp Lys Asp
<210> 9
<211> 936
<212> DNA
<213> Artificial
<220>
<223> bff312
<220>
<221> CDS
<222> (1)..(936)
<400> 9
atg gag aga gcg tgt gtc gcg gtc cga gag att gtc cga ttc cat atc 48
Met Glu Arg Ala Cys Val Ala Val Arg Glu Ile Val Arg Phe His Ile
1 5 10 15
gaa caa agg caa aca atc gtg aac aag caa cgc acc aaa cgc ggc att 96
Glu Gln Arg Gln Thr Ile Val Asn Lys Gln Arg Thr Lys Arg Gly Ile
20 25 30
ctt gcc gca gca ttg agc atc gga gcg ctc gga gca act ttg att tcg 144
Leu Ala Ala Ala Leu Ser Ile Gly Ala Leu Gly Ala Thr Leu Ile Ser
35 40 45
ggg cca gcc atg gct gcc acc gaa cca gtg cct ggc ttc ccc cag cca 192
Gly Pro Ala Met Ala Ala Thr Glu Pro Val Pro Gly Phe Pro Gln Pro
50 55 60
acc gag cac acc cag aag gca tac agc ccc acc gac gat ttc acc tcg 240
Thr Glu His Thr Gln Lys Ala Tyr Ser Pro Thr Asp Asp Phe Thr Ser
65 70 75 80
cgt tgg acc cgt gct gac gca aag cag atc aag gcc atg tcc gac ccg 288
Arg Trp Thr Arg Ala Asp Ala Lys Gln Ile Lys Ala Met Ser Asp Pro
85 90 95
aac gcg gga tcg cgt gaa aac tcc atg ccc aag gag tac acc atg cct 336
Asn Ala Gly Ser Arg Glu Asn Ser Met Pro Lys Glu Tyr Thr Met Pro
100 105 110
acc gtg ccc cag gat ttc cct gac atg agc aac gag gaa gtc tgg gtt 384
Thr Val Pro Gln Asp Phe Pro Asp Met Ser Asn Glu Glu Val Trp Val
115 120 125
tgg gat act tgg cca ctg act gac gag cac gcc aac cag tac agc gtt 432
Trp Asp Thr Trp Pro Leu Thr Asp Glu His Ala Asn Gln Tyr Ser Val
130 135 140
aat ggc cag gag atc atc ttc tcg ctg gtt gcc gat cgc gac ctt ggc 480
Asn Gly Gln Glu Ile Ile Phe Ser Leu Val Ala Asp Arg Asp Leu Gly
145 150 155 160
ttc gat gag cgc cac cag tac gca cgc att gga tac ttc tac cgt cca 528
Phe Asp Glu Arg His Gln Tyr Ala Arg Ile Gly Tyr Phe Tyr Arg Pro
165 170 175
gca ggc gtt cct gcc gat gag cgt cct gaa gat ggt ggc tgg acc tat 576
Ala Gly Val Pro Ala Asp Glu Arg Pro Glu Asp Gly Gly Trp Thr Tyr
180 185 190
ggc ggc cag gtc ttc gat gag ggc gtc acc ggc aag atc ttc gag gac 624
Gly Gly Gln Val Phe Asp Glu Gly Val Thr Gly Lys Ile Phe Glu Asp
195 200 205
cag tcc ttc agc cac cag act cag tgg tcc ggt tcg gcg cga gtg acc 672
Gln Ser Phe Ser His Gln Thr Gln Trp Ser Gly Ser Ala Arg Val Thr
210 215 220
aaa aat ggc gaa atc aag ctg ttc ttc acc gac gtt gcg ttc tac cgc 720
Lys Asn Gly Glu Ile Lys Leu Phe Phe Thr Asp Val Ala Phe Tyr Arg
225 230 235 240
gac aag gac ggc aag gac atc aag cca tac gac ccg cgc atc gca ttg 768
Asp Lys Asp Gly Lys Asp Ile Lys Pro Tyr Asp Pro Arg Ile Ala Leu
245 250 255
agc gtt ggc cac gtc cac tcg aac aag aat ggc gtg aag ctc acc ggc 816
Ser Val Gly His Val His Ser Asn Lys Asn Gly Val Lys Leu Thr Gly
260 265 270
ttt gac aag gtc acc gac ctg ctg cag gca gac ggc aag tac tac cag 864
Phe Asp Lys Val Thr Asp Leu Leu Gln Ala Asp Gly Lys Tyr Tyr Gln
275 280 285
acc gct gag cag aac tcg tac ttc aac ttc cgt gat cca ttc acc ttt 912
Thr Ala Glu Gln Asn Ser Tyr Phe Asn Phe Arg Asp Pro Phe Thr Phe
290 295 300
gaa gat cca gcg cat cca ggc gaa 936
Glu Asp Pro Ala His Pro Gly Glu
305 310
<210> 10
<211> 312
<212> PRT
<213> Artificial
<220>
<223> Synthetic Construct
<400> 10
Met Glu Arg Ala Cys Val Ala Val Arg Glu Ile Val Arg Phe His Ile
1 5 10 15
Glu Gln Arg Gln Thr Ile Val Asn Lys Gln Arg Thr Lys Arg Gly Ile
20 25 30
Leu Ala Ala Ala Leu Ser Ile Gly Ala Leu Gly Ala Thr Leu Ile Ser
35 40 45
Gly Pro Ala Met Ala Ala Thr Glu Pro Val Pro Gly Phe Pro Gln Pro
50 55 60
Thr Glu His Thr Gln Lys Ala Tyr Ser Pro Thr Asp Asp Phe Thr Ser
65 70 75 80
Arg Trp Thr Arg Ala Asp Ala Lys Gln Ile Lys Ala Met Ser Asp Pro
85 90 95
Asn Ala Gly Ser Arg Glu Asn Ser Met Pro Lys Glu Tyr Thr Met Pro
100 105 110
Thr Val Pro Gln Asp Phe Pro Asp Met Ser Asn Glu Glu Val Trp Val
115 120 125
Trp Asp Thr Trp Pro Leu Thr Asp Glu His Ala Asn Gln Tyr Ser Val
130 135 140
Asn Gly Gln Glu Ile Ile Phe Ser Leu Val Ala Asp Arg Asp Leu Gly
145 150 155 160
Phe Asp Glu Arg His Gln Tyr Ala Arg Ile Gly Tyr Phe Tyr Arg Pro
165 170 175
Ala Gly Val Pro Ala Asp Glu Arg Pro Glu Asp Gly Gly Trp Thr Tyr
180 185 190
Gly Gly Gln Val Phe Asp Glu Gly Val Thr Gly Lys Ile Phe Glu Asp
195 200 205
Gln Ser Phe Ser His Gln Thr Gln Trp Ser Gly Ser Ala Arg Val Thr
210 215 220
Lys Asn Gly Glu Ile Lys Leu Phe Phe Thr Asp Val Ala Phe Tyr Arg
225 230 235 240
Asp Lys Asp Gly Lys Asp Ile Lys Pro Tyr Asp Pro Arg Ile Ala Leu
245 250 255
Ser Val Gly His Val His Ser Asn Lys Asn Gly Val Lys Leu Thr Gly
260 265 270
Phe Asp Lys Val Thr Asp Leu Leu Gln Ala Asp Gly Lys Tyr Tyr Gln
275 280 285
Thr Ala Glu Gln Asn Ser Tyr Phe Asn Phe Arg Asp Pro Phe Thr Phe
290 295 300
Glu Asp Pro Ala His Pro Gly Glu
305 310
<210> 11
<211> 1050
<212> DNA
<213> Artificial
<220>
<223> bff350
<220>
<221> CDS
<222> (1)..(1050)
<400> 11
atg gag aga gcg tgt gtc gcg gtc cga gag att gtc cga ttc cat atc 48
Met Glu Arg Ala Cys Val Ala Val Arg Glu Ile Val Arg Phe His Ile
1 5 10 15
gaa caa agg caa aca atc gtg aac aag caa cgc acc aaa cgc ggc att 96
Glu Gln Arg Gln Thr Ile Val Asn Lys Gln Arg Thr Lys Arg Gly Ile
20 25 30
ctt gcc gca gca ttg agc atc gga gcg ctc gga gca act ttg att tcg 144
Leu Ala Ala Ala Leu Ser Ile Gly Ala Leu Gly Ala Thr Leu Ile Ser
35 40 45
ggg cca gcc atg gct gcc acc gaa cca gtg cct ggc ttc ccc cag cca 192
Gly Pro Ala Met Ala Ala Thr Glu Pro Val Pro Gly Phe Pro Gln Pro
50 55 60
acc gag cac acc cag aag gca tac agc ccc acc gac gat ttc acc tcg 240
Thr Glu His Thr Gln Lys Ala Tyr Ser Pro Thr Asp Asp Phe Thr Ser
65 70 75 80
cgt tgg acc cgt gct gac gca aag cag atc aag gcc atg tcc gac ccg 288
Arg Trp Thr Arg Ala Asp Ala Lys Gln Ile Lys Ala Met Ser Asp Pro
85 90 95
aac gcg gga tcg cgt gaa aac tcc atg ccc aag gag tac acc atg cct 336
Asn Ala Gly Ser Arg Glu Asn Ser Met Pro Lys Glu Tyr Thr Met Pro
100 105 110
acc gtg ccc cag gat ttc cct gac atg agc aac gag gaa gtc tgg gtt 384
Thr Val Pro Gln Asp Phe Pro Asp Met Ser Asn Glu Glu Val Trp Val
115 120 125
tgg gat act tgg cca ctg act gac gag cac gcc aac cag tac agc gtt 432
Trp Asp Thr Trp Pro Leu Thr Asp Glu His Ala Asn Gln Tyr Ser Val
130 135 140
aat ggc cag gag atc atc ttc tcg ctg gtt gcc gat cgc gac ctt ggc 480
Asn Gly Gln Glu Ile Ile Phe Ser Leu Val Ala Asp Arg Asp Leu Gly
145 150 155 160
ttc gat gag cgc cac cag tac gca cgc att gga tac ttc tac cgt cca 528
Phe Asp Glu Arg His Gln Tyr Ala Arg Ile Gly Tyr Phe Tyr Arg Pro
165 170 175
gca ggc gtt cct gcc gat gag cgt cct gaa gat ggt ggc tgg acc tat 576
Ala Gly Val Pro Ala Asp Glu Arg Pro Glu Asp Gly Gly Trp Thr Tyr
180 185 190
ggc ggc cag gtc ttc gat gag ggc gtc acc ggc aag atc ttc gag gac 624
Gly Gly Gln Val Phe Asp Glu Gly Val Thr Gly Lys Ile Phe Glu Asp
195 200 205
cag tcc ttc agc cac cag act cag tgg tcc ggt tcg gcg cga gtg acc 672
Gln Ser Phe Ser His Gln Thr Gln Trp Ser Gly Ser Ala Arg Val Thr
210 215 220
aaa aat ggc gaa atc aag ctg ttc ttc acc gac gtt gcg ttc tac cgc 720
Lys Asn Gly Glu Ile Lys Leu Phe Phe Thr Asp Val Ala Phe Tyr Arg
225 230 235 240
gac aag gac ggc aag gac atc aag cca tac gac ccg cgc atc gca ttg 768
Asp Lys Asp Gly Lys Asp Ile Lys Pro Tyr Asp Pro Arg Ile Ala Leu
245 250 255
agc gtt ggc cac gtc cac tcg aac aag aat ggc gtg aag ctc acc ggc 816
Ser Val Gly His Val His Ser Asn Lys Asn Gly Val Lys Leu Thr Gly
260 265 270
ttt gac aag gtc acc gac ctg ctg cag gca gac ggc aag tac tac cag 864
Phe Asp Lys Val Thr Asp Leu Leu Gln Ala Asp Gly Lys Tyr Tyr Gln
275 280 285
acc gct gag cag aac tcg tac ttc aac ttc cgt gat cca ttc acc ttt 912
Thr Ala Glu Gln Asn Ser Tyr Phe Asn Phe Arg Asp Pro Phe Thr Phe
290 295 300
gaa gat cca gcg cat cca ggc gaa acc tac atg gtt ttc gag ggc aac 960
Glu Asp Pro Ala His Pro Gly Glu Thr Tyr Met Val Phe Glu Gly Asn
305 310 315 320
acc gct cag aag cgc gat gaa gcc aag tgc acc tcc gag gac ctt ggc 1008
Thr Ala Gln Lys Arg Asp Glu Ala Lys Cys Thr Ser Glu Asp Leu Gly
325 330 335
tac cgc aag ggt gaa acc aac ggc gaa acc gtg gat gag gtc 1050
Tyr Arg Lys Gly Glu Thr Asn Gly Glu Thr Val Asp Glu Val
340 345 350
<210> 12
<211> 350
<212> PRT
<213> Artificial
<220>
<223> Synthetic Construct
<400> 12
Met Glu Arg Ala Cys Val Ala Val Arg Glu Ile Val Arg Phe His Ile
1 5 10 15
Glu Gln Arg Gln Thr Ile Val Asn Lys Gln Arg Thr Lys Arg Gly Ile
20 25 30
Leu Ala Ala Ala Leu Ser Ile Gly Ala Leu Gly Ala Thr Leu Ile Ser
35 40 45
Gly Pro Ala Met Ala Ala Thr Glu Pro Val Pro Gly Phe Pro Gln Pro
50 55 60
Thr Glu His Thr Gln Lys Ala Tyr Ser Pro Thr Asp Asp Phe Thr Ser
65 70 75 80
Arg Trp Thr Arg Ala Asp Ala Lys Gln Ile Lys Ala Met Ser Asp Pro
85 90 95
Asn Ala Gly Ser Arg Glu Asn Ser Met Pro Lys Glu Tyr Thr Met Pro
100 105 110
Thr Val Pro Gln Asp Phe Pro Asp Met Ser Asn Glu Glu Val Trp Val
115 120 125
Trp Asp Thr Trp Pro Leu Thr Asp Glu His Ala Asn Gln Tyr Ser Val
130 135 140
Asn Gly Gln Glu Ile Ile Phe Ser Leu Val Ala Asp Arg Asp Leu Gly
145 150 155 160
Phe Asp Glu Arg His Gln Tyr Ala Arg Ile Gly Tyr Phe Tyr Arg Pro
165 170 175
Ala Gly Val Pro Ala Asp Glu Arg Pro Glu Asp Gly Gly Trp Thr Tyr
180 185 190
Gly Gly Gln Val Phe Asp Glu Gly Val Thr Gly Lys Ile Phe Glu Asp
195 200 205
Gln Ser Phe Ser His Gln Thr Gln Trp Ser Gly Ser Ala Arg Val Thr
210 215 220
Lys Asn Gly Glu Ile Lys Leu Phe Phe Thr Asp Val Ala Phe Tyr Arg
225 230 235 240
Asp Lys Asp Gly Lys Asp Ile Lys Pro Tyr Asp Pro Arg Ile Ala Leu
245 250 255
Ser Val Gly His Val His Ser Asn Lys Asn Gly Val Lys Leu Thr Gly
260 265 270
Phe Asp Lys Val Thr Asp Leu Leu Gln Ala Asp Gly Lys Tyr Tyr Gln
275 280 285
Thr Ala Glu Gln Asn Ser Tyr Phe Asn Phe Arg Asp Pro Phe Thr Phe
290 295 300
Glu Asp Pro Ala His Pro Gly Glu Thr Tyr Met Val Phe Glu Gly Asn
305 310 315 320
Thr Ala Gln Lys Arg Asp Glu Ala Lys Cys Thr Ser Glu Asp Leu Gly
325 330 335
Tyr Arg Lys Gly Glu Thr Asn Gly Glu Thr Val Asp Glu Val
340 345 350
Claims (8)
1.一种融合标签蛋白,其特征在于,所述融合标签蛋白基于耐有机糖苷酶Fru6构建,为截断至耐有机糖苷酶Fru6N端前n个氨基酸截短体,所述n<452,所述耐有机糖苷酶Fru6氨基酸序列如序列表中SEQ ID No:1所示。
2.根据权利要求1所述融合标签蛋白,其特征在于,所述融合标签蛋白为截断至耐有机糖苷酶Fru6N端前209个氨基酸截短体,其氨基酸序列如序列表中SEQ ID No:4所示。
3.根据权利要求1所述融合标签蛋白,其特征在于,所述融合标签蛋白为截断至耐有机糖苷酶Fru6N端前217个氨基酸截短体,其氨基酸序列如序列表中SEQ ID No:6所示。
4.根据权利要求1所述融合标签蛋白,其特征在于,所述融合标签蛋白为截断至耐有机糖苷酶Fru6N端前243个氨基酸截短体,其氨基酸序列如序列表中SEQ ID No:8所示。
5.根据权利要求1所述融合标签蛋白,其特征在于,所述融合标签蛋白为截断至耐有机糖苷酶Fru6N端前312个氨基酸截短体,其氨基酸序列如序列表中SEQ ID No:10所示。
6.根据权利要求1所述融合标签蛋白,其特征在于,所述融合标签蛋白为截断至耐有机糖苷酶Fru6N端前350个氨基酸截短体,其氨基酸序列如序列表中SEQ ID No:12所示。
7.如权利要求1-6任一项所述融合标签蛋白的编码基因,以及含有该编码基因的重组载体,表达盒,转基因细胞系或重组菌。
8.如权利要求1-6任一项所述融合标签蛋白构建的融合蛋白表达系统。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610331686.1A CN105838694B (zh) | 2016-05-18 | 2016-05-18 | 一种融合标签蛋白 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610331686.1A CN105838694B (zh) | 2016-05-18 | 2016-05-18 | 一种融合标签蛋白 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN105838694A true CN105838694A (zh) | 2016-08-10 |
CN105838694B CN105838694B (zh) | 2019-07-26 |
Family
ID=56593636
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610331686.1A Active CN105838694B (zh) | 2016-05-18 | 2016-05-18 | 一种融合标签蛋白 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN105838694B (zh) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108588058A (zh) * | 2018-04-28 | 2018-09-28 | 南京工业大学 | β-呋喃果糖苷酶突变体及其应用 |
CN113388009A (zh) * | 2021-05-18 | 2021-09-14 | 中国科学院福建物质结构研究所 | 一种标签蛋白及其编码基因、重组载体与应用 |
CN117801124A (zh) * | 2024-02-29 | 2024-04-02 | 天津凯莱英生物科技有限公司 | 利西那肽前体的融合蛋白及其应用 |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20120107876A1 (en) * | 2009-05-01 | 2012-05-03 | Lupin Limited | Novel fusion tag offering solubility to insoluble recombinant protein |
CN103254277A (zh) * | 2013-05-03 | 2013-08-21 | 南京工业大学 | 强分泌性信号肽增强小肽模序及其应用 |
CN103554245A (zh) * | 2013-10-15 | 2014-02-05 | 北京大学 | 一种融合标签蛋白 |
-
2016
- 2016-05-18 CN CN201610331686.1A patent/CN105838694B/zh active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20120107876A1 (en) * | 2009-05-01 | 2012-05-03 | Lupin Limited | Novel fusion tag offering solubility to insoluble recombinant protein |
CN103254277A (zh) * | 2013-05-03 | 2013-08-21 | 南京工业大学 | 强分泌性信号肽增强小肽模序及其应用 |
CN103554245A (zh) * | 2013-10-15 | 2014-02-05 | 北京大学 | 一种融合标签蛋白 |
Non-Patent Citations (3)
Title |
---|
LAVALLIE E R ET AL.: "A thioredoxin gene fusion expression system that circumvents inclusion body formation in the E. coli cytoplasm", 《BIOTECHNOLOGY》 * |
吴珊珊等: "融合标签在蛋白质可溶性表达中的应用进展", 《化工进展》 * |
朱芸等: "大肠杆菌BL21( DE3) 膜组分相关基因的敲除对重组蛋白胞外分泌的影响", 《微生物学报》 * |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108588058A (zh) * | 2018-04-28 | 2018-09-28 | 南京工业大学 | β-呋喃果糖苷酶突变体及其应用 |
CN108588058B (zh) * | 2018-04-28 | 2020-04-21 | 南京工业大学 | β-呋喃果糖苷酶突变体及其应用 |
CN113388009A (zh) * | 2021-05-18 | 2021-09-14 | 中国科学院福建物质结构研究所 | 一种标签蛋白及其编码基因、重组载体与应用 |
CN113388009B (zh) * | 2021-05-18 | 2022-07-19 | 中国科学院福建物质结构研究所 | 一种标签蛋白及其编码基因、重组载体与应用 |
CN117801124A (zh) * | 2024-02-29 | 2024-04-02 | 天津凯莱英生物科技有限公司 | 利西那肽前体的融合蛋白及其应用 |
Also Published As
Publication number | Publication date |
---|---|
CN105838694B (zh) | 2019-07-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2017249424A1 (en) | Recombinant arterivirus replicon systems and uses thereof | |
JP4680237B2 (ja) | 穀類の非貯蔵タンパクを融合キャリアとして使用する胚乳におけるポリペプチド発現方法及びその使用 | |
CN110317278B (zh) | Svv和fmdv的融合蛋白及其编码基因、表达载体、细胞系、工程菌和疫苗与应用 | |
CN104725517B (zh) | 一种提高酵母细胞表面目的蛋白展示量的方法 | |
CN105838694A (zh) | 一种融合标签蛋白 | |
CN103193887B (zh) | 一种重组猪白细胞介素2-Fc融合蛋白及其编码基因和表达方法 | |
CN107353346A (zh) | 一种由猪白蛋白与猪干扰素γ组成的融合蛋白及其制备方法和一种重组猪长效干扰素γ | |
CN108314718A (zh) | 一种家蝇抗菌肽maf-1a肽聚体、其编码基因及其表达和应用 | |
CN103725697B (zh) | 化学合成的金黄色葡萄球菌的表面蛋白FnBPA基因片段及其表达、应用 | |
CN108840952A (zh) | 一种由鸡白蛋白、鸡干扰素γ和鸡干扰素α组成的融合蛋白及其制备方法 | |
CN107266587A (zh) | 一种重组牛长效干扰素α及制备此长效干扰素的融合蛋白及其制备方法 | |
CN109071611A (zh) | 产生重组蛋白的方法 | |
CN110408583A (zh) | 一种表达三肽酶的重组枯草芽孢杆菌及其构建方法 | |
CN102485890B (zh) | 用毕赤酵母分泌表达重组人蛋白质二硫键异构酶 | |
CN107177539A (zh) | 一种日本血吸虫和破伤风双价口服或滴鼻疫苗 | |
TW201236693A (en) | Method of inducing antibody in birds by the OmpA of Riemerella anatipestifer and the Riemerella anatipestifer vaccine | |
CN108840935A (zh) | 一种由羊白蛋白与羊干扰素γ组成的融合蛋白及其制备方法和一种重组羊长效干扰素γ | |
CN111100833A (zh) | 表达鲇鱼爱德华氏菌外膜蛋白的重组菌、制备方法及应用 | |
CN109022471A (zh) | 生产草酸氧化酶的大肠杆菌表达系统、草酸氧化酶的生产方法及其应用 | |
CN108864293A (zh) | 一种由犬白蛋白与犬干扰素γ组成的融合蛋白及其制备方法和一种重组犬长效干扰素γ | |
CN108794644A (zh) | 一种由牛白细胞介素2、牛干扰素γ和牛干扰素α组成的融合蛋白及其制备方法 | |
CN108840934A (zh) | 一种重组羊长效干扰素τ及制备此长效干扰素的融合蛋白及其制备方法 | |
CN116024247B (zh) | 一种鳜蛙虹彩病毒疫苗及其制备方法和应用 | |
CN110564750B (zh) | 一种鲫造血器官坏死症酵母口服疫苗及应用 | |
CN108794645A (zh) | 一种由牛白蛋白、牛干扰素γ和牛干扰素α组成的融合蛋白及其制备方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |