CN109293783B - 多肽、其生产方法和用途 - Google Patents
多肽、其生产方法和用途 Download PDFInfo
- Publication number
- CN109293783B CN109293783B CN201811254050.7A CN201811254050A CN109293783B CN 109293783 B CN109293783 B CN 109293783B CN 201811254050 A CN201811254050 A CN 201811254050A CN 109293783 B CN109293783 B CN 109293783B
- Authority
- CN
- China
- Prior art keywords
- gly
- pro
- ala
- glu
- ser
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 108090000765 processed proteins & peptides Proteins 0.000 title claims abstract description 52
- 102000004196 processed proteins & peptides Human genes 0.000 title claims abstract description 42
- 229920001184 polypeptide Polymers 0.000 title claims abstract description 41
- 238000004519 manufacturing process Methods 0.000 title claims abstract description 12
- 230000021164 cell adhesion Effects 0.000 claims abstract description 10
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 29
- 238000000034 method Methods 0.000 claims description 13
- 239000000203 mixture Substances 0.000 claims description 13
- 241000588724 Escherichia coli Species 0.000 claims description 11
- 239000002537 cosmetic Substances 0.000 claims description 8
- 239000013604 expression vector Substances 0.000 claims description 8
- 230000029087 digestion Effects 0.000 claims description 7
- 108091033319 polynucleotide Proteins 0.000 claims description 6
- 102000040430 polynucleotide Human genes 0.000 claims description 6
- 239000002157 polynucleotide Substances 0.000 claims description 6
- 230000036541 health Effects 0.000 claims description 5
- 239000003814 drug Substances 0.000 claims description 3
- 238000003306 harvesting Methods 0.000 claims description 2
- 239000013587 production medium Substances 0.000 claims description 2
- 229940079593 drug Drugs 0.000 claims 1
- 230000001737 promoting effect Effects 0.000 claims 1
- 230000000694 effects Effects 0.000 abstract description 18
- 210000004897 n-terminal region Anatomy 0.000 abstract description 8
- 210000004899 c-terminal region Anatomy 0.000 abstract description 5
- 108010035532 Collagen Proteins 0.000 description 68
- 102000008186 Collagen Human genes 0.000 description 68
- 229920001436 collagen Polymers 0.000 description 67
- 108090000623 proteins and genes Proteins 0.000 description 39
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 36
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 29
- 102000004169 proteins and genes Human genes 0.000 description 29
- 108010087846 prolyl-prolyl-glycine Proteins 0.000 description 25
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 24
- BRPMXFSTKXXNHF-IUCAKERBSA-N (2s)-1-[2-[[(2s)-pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H]1NCCC1 BRPMXFSTKXXNHF-IUCAKERBSA-N 0.000 description 23
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 22
- CAVKXZMMDNOZJU-UHFFFAOYSA-N Gly-Pro-Ala-Gly-Pro Natural products C1CCC(C(O)=O)N1C(=O)CNC(=O)C(C)NC(=O)C1CCCN1C(=O)CN CAVKXZMMDNOZJU-UHFFFAOYSA-N 0.000 description 18
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 18
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 18
- 108010047495 alanylglycine Proteins 0.000 description 18
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 17
- 241000894006 Bacteria Species 0.000 description 16
- 210000004027 cell Anatomy 0.000 description 15
- 239000000047 product Substances 0.000 description 15
- 108010029020 prolylglycine Proteins 0.000 description 15
- OOCFXNOVSLSHAB-IUCAKERBSA-N Gly-Pro-Pro Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OOCFXNOVSLSHAB-IUCAKERBSA-N 0.000 description 13
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 13
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 description 13
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 12
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 12
- 108010079364 N-glycylalanine Proteins 0.000 description 12
- 150000001413 amino acids Chemical class 0.000 description 12
- 238000005215 recombination Methods 0.000 description 12
- 230000006798 recombination Effects 0.000 description 12
- 125000000539 amino acid group Chemical group 0.000 description 11
- 108010078144 glutaminyl-glycine Proteins 0.000 description 11
- 239000011780 sodium chloride Substances 0.000 description 11
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 10
- HAVKMRGWNXMCDR-STQMWFEESA-N Arg-Gly-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HAVKMRGWNXMCDR-STQMWFEESA-N 0.000 description 9
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 9
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 9
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 9
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 8
- 108020004705 Codon Proteins 0.000 description 8
- NSORZJXKUQFEKL-JGVFFNPUSA-N Gln-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)N)N)C(=O)O NSORZJXKUQFEKL-JGVFFNPUSA-N 0.000 description 8
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 8
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 8
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 8
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 8
- 239000007788 liquid Substances 0.000 description 8
- 210000001519 tissue Anatomy 0.000 description 8
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 7
- 108010050848 glycylleucine Proteins 0.000 description 7
- 108010057821 leucylproline Proteins 0.000 description 7
- 238000005457 optimization Methods 0.000 description 7
- SCAKQYSGEIHPLV-IUCAKERBSA-N (4S)-4-[(2-aminoacetyl)amino]-5-[(2S)-2-(carboxymethylcarbamoyl)pyrrolidin-1-yl]-5-oxopentanoic acid Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SCAKQYSGEIHPLV-IUCAKERBSA-N 0.000 description 6
- MQIGTEQXYCRLGK-BQBZGAKWSA-N Ala-Gly-Pro Chemical compound C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O MQIGTEQXYCRLGK-BQBZGAKWSA-N 0.000 description 6
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 6
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 6
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 6
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 6
- 108010033276 Peptide Fragments Proteins 0.000 description 6
- 102000007079 Peptide Fragments Human genes 0.000 description 6
- JMVQDLDPDBXAAX-YUMQZZPRSA-N Pro-Gly-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 JMVQDLDPDBXAAX-YUMQZZPRSA-N 0.000 description 6
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 6
- QGZKDVFQNNGYKY-UHFFFAOYSA-N ammonia Natural products N QGZKDVFQNNGYKY-UHFFFAOYSA-N 0.000 description 6
- 230000015572 biosynthetic process Effects 0.000 description 6
- 238000001514 detection method Methods 0.000 description 6
- 108010026333 seryl-proline Proteins 0.000 description 6
- 239000000243 solution Substances 0.000 description 6
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 5
- 108010072062 GEKG peptide Proteins 0.000 description 5
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 5
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 5
- 108091028043 Nucleic acid sequence Proteins 0.000 description 5
- BNBBNGZZKQUWCD-IUCAKERBSA-N Pro-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 BNBBNGZZKQUWCD-IUCAKERBSA-N 0.000 description 5
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 5
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 5
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 5
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 5
- 229910021529 ammonia Inorganic materials 0.000 description 5
- NKLPQNGYXWVELD-UHFFFAOYSA-M coomassie brilliant blue Chemical compound [Na+].C1=CC(OCC)=CC=C1NC1=CC=C(C(=C2C=CC(C=C2)=[N+](CC)CC=2C=C(C=CC=2)S([O-])(=O)=O)C=2C=CC(=CC=2)N(CC)CC=2C=C(C=CC=2)S([O-])(=O)=O)C=C1 NKLPQNGYXWVELD-UHFFFAOYSA-M 0.000 description 5
- 238000001962 electrophoresis Methods 0.000 description 5
- 239000012634 fragment Substances 0.000 description 5
- 230000002068 genetic effect Effects 0.000 description 5
- 108010077515 glycylproline Proteins 0.000 description 5
- 108010037850 glycylvaline Proteins 0.000 description 5
- 239000001963 growth medium Substances 0.000 description 5
- 150000002460 imidazoles Chemical class 0.000 description 5
- 230000008569 process Effects 0.000 description 5
- 108010031719 prolyl-serine Proteins 0.000 description 5
- 230000003252 repetitive effect Effects 0.000 description 5
- 239000000523 sample Substances 0.000 description 5
- AJPJDKMHJJGVTQ-UHFFFAOYSA-M sodium dihydrogen phosphate Chemical compound [Na+].OP(O)([O-])=O AJPJDKMHJJGVTQ-UHFFFAOYSA-M 0.000 description 5
- 238000006467 substitution reaction Methods 0.000 description 5
- 238000003786 synthesis reaction Methods 0.000 description 5
- 108010061238 threonyl-glycine Proteins 0.000 description 5
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 4
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 4
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 4
- 108020004414 DNA Proteins 0.000 description 4
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 4
- PGPJSRSLQNXBDT-YUMQZZPRSA-N Gln-Arg-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O PGPJSRSLQNXBDT-YUMQZZPRSA-N 0.000 description 4
- NLKVNZUFDPWPNL-YUMQZZPRSA-N Glu-Arg-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O NLKVNZUFDPWPNL-YUMQZZPRSA-N 0.000 description 4
- YYXJFBMCOUSYSF-RYUDHWBXSA-N Gly-Phe-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYXJFBMCOUSYSF-RYUDHWBXSA-N 0.000 description 4
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 description 4
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 4
- VSRXPEHZMHSFKU-IUCAKERBSA-N Lys-Gln-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VSRXPEHZMHSFKU-IUCAKERBSA-N 0.000 description 4
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 4
- 108091005804 Peptidases Proteins 0.000 description 4
- 239000001888 Peptone Substances 0.000 description 4
- 108010080698 Peptones Proteins 0.000 description 4
- ZPPVJIJMIKTERM-YUMQZZPRSA-N Pro-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ZPPVJIJMIKTERM-YUMQZZPRSA-N 0.000 description 4
- ABSSTGUCBCDKMU-UWVGGRQHSA-N Pro-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 ABSSTGUCBCDKMU-UWVGGRQHSA-N 0.000 description 4
- 239000004365 Protease Substances 0.000 description 4
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 4
- MSIYNSBKKVMGFO-BHNWBGBOSA-N Thr-Gly-Pro Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N)O MSIYNSBKKVMGFO-BHNWBGBOSA-N 0.000 description 4
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 4
- 239000002585 base Substances 0.000 description 4
- 230000003115 biocidal effect Effects 0.000 description 4
- 239000000872 buffer Substances 0.000 description 4
- 238000005119 centrifugation Methods 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 239000013613 expression plasmid Substances 0.000 description 4
- 239000003292 glue Substances 0.000 description 4
- 108010015792 glycyllysine Proteins 0.000 description 4
- 108010081551 glycylphenylalanine Proteins 0.000 description 4
- 238000009396 hybridization Methods 0.000 description 4
- 230000001939 inductive effect Effects 0.000 description 4
- 238000003780 insertion Methods 0.000 description 4
- 230000037431 insertion Effects 0.000 description 4
- 239000000463 material Substances 0.000 description 4
- 235000019319 peptone Nutrition 0.000 description 4
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 4
- 238000010186 staining Methods 0.000 description 4
- 238000012360 testing method Methods 0.000 description 4
- CUVSTAMIHSSVKL-UWVGGRQHSA-N (4s)-4-[(2-aminoacetyl)amino]-5-[[(2s)-6-amino-1-(carboxymethylamino)-1-oxohexan-2-yl]amino]-5-oxopentanoic acid Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN CUVSTAMIHSSVKL-UWVGGRQHSA-N 0.000 description 3
- FRXSZNDVFUDTIR-UHFFFAOYSA-N 6-methoxy-1,2,3,4-tetrahydroquinoline Chemical compound N1CCCC2=CC(OC)=CC=C21 FRXSZNDVFUDTIR-UHFFFAOYSA-N 0.000 description 3
- ZATRYQNPUHGXCU-DTWKUNHWSA-N Arg-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZATRYQNPUHGXCU-DTWKUNHWSA-N 0.000 description 3
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 3
- HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 description 3
- 125000001433 C-terminal amino-acid group Chemical group 0.000 description 3
- DZLQXIFVQFTFJY-BYPYZUCNSA-N Cys-Gly-Gly Chemical compound SC[C@H](N)C(=O)NCC(=O)NCC(O)=O DZLQXIFVQFTFJY-BYPYZUCNSA-N 0.000 description 3
- PXHABOCPJVTGEK-BQBZGAKWSA-N Glu-Gln-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O PXHABOCPJVTGEK-BQBZGAKWSA-N 0.000 description 3
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 3
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 3
- DTRUBYPMMVPQPD-YUMQZZPRSA-N Gly-Gln-Arg Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DTRUBYPMMVPQPD-YUMQZZPRSA-N 0.000 description 3
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 3
- IEGFSKKANYKBDU-QWHCGFSZSA-N Gly-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)CN)C(=O)O IEGFSKKANYKBDU-QWHCGFSZSA-N 0.000 description 3
- QSQXZZCGPXQBPP-BQBZGAKWSA-N Gly-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)CN)C(=O)N[C@@H](CS)C(=O)O QSQXZZCGPXQBPP-BQBZGAKWSA-N 0.000 description 3
- HJARVELKOSZUEW-YUMQZZPRSA-N Gly-Pro-Gln Chemical compound [H]NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJARVELKOSZUEW-YUMQZZPRSA-N 0.000 description 3
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 3
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 3
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 3
- ITWQLSZTLBKWJM-YUMQZZPRSA-N Lys-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCCN ITWQLSZTLBKWJM-YUMQZZPRSA-N 0.000 description 3
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 3
- MMJJFXWMCMJMQA-STQMWFEESA-N Phe-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CC=CC=C1 MMJJFXWMCMJMQA-STQMWFEESA-N 0.000 description 3
- ULIWFCCJIOEHMU-BQBZGAKWSA-N Pro-Gly-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 ULIWFCCJIOEHMU-BQBZGAKWSA-N 0.000 description 3
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 description 3
- WIPAMEKBSHNFQE-IUCAKERBSA-N Pro-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@@H]1CCCN1 WIPAMEKBSHNFQE-IUCAKERBSA-N 0.000 description 3
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 3
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 3
- XEYUMGGWQCIWAR-XVKPBYJWSA-N Val-Gln-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N XEYUMGGWQCIWAR-XVKPBYJWSA-N 0.000 description 3
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 3
- 238000010521 absorption reaction Methods 0.000 description 3
- 230000001464 adherent effect Effects 0.000 description 3
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 3
- 108010047857 aspartylglycine Proteins 0.000 description 3
- 230000004071 biological effect Effects 0.000 description 3
- 108010004073 cysteinylcysteine Proteins 0.000 description 3
- 239000000284 extract Substances 0.000 description 3
- 108010025801 glycyl-prolyl-arginine Proteins 0.000 description 3
- 238000001727 in vivo Methods 0.000 description 3
- 230000006698 induction Effects 0.000 description 3
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 3
- 108010064235 lysylglycine Proteins 0.000 description 3
- 239000002773 nucleotide Substances 0.000 description 3
- 125000003729 nucleotide group Chemical group 0.000 description 3
- 239000008363 phosphate buffer Substances 0.000 description 3
- 239000013612 plasmid Substances 0.000 description 3
- 238000002360 preparation method Methods 0.000 description 3
- 238000012216 screening Methods 0.000 description 3
- 239000000126 substance Substances 0.000 description 3
- 239000006228 supernatant Substances 0.000 description 3
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 2
- 241000251468 Actinopterygii Species 0.000 description 2
- 229920001817 Agar Polymers 0.000 description 2
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 2
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 2
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 2
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 2
- XSGBIBGAMKTHMY-WHFBIAKZSA-N Asn-Asp-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O XSGBIBGAMKTHMY-WHFBIAKZSA-N 0.000 description 2
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 2
- HAFCJCDJGIOYPW-WDSKDSINSA-N Asp-Gly-Gln Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O HAFCJCDJGIOYPW-WDSKDSINSA-N 0.000 description 2
- POTCZYQVVNXUIG-BQBZGAKWSA-N Asp-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O POTCZYQVVNXUIG-BQBZGAKWSA-N 0.000 description 2
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 2
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 2
- 102000014914 Carrier Proteins Human genes 0.000 description 2
- 108010078791 Carrier Proteins Proteins 0.000 description 2
- 108010022452 Collagen Type I Proteins 0.000 description 2
- 102000012422 Collagen Type I Human genes 0.000 description 2
- 102100033601 Collagen alpha-1(I) chain Human genes 0.000 description 2
- VSXBYIJUAXPAAL-WDSKDSINSA-N Gln-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O VSXBYIJUAXPAAL-WDSKDSINSA-N 0.000 description 2
- GNMQDOGFWYWPNM-LAEOZQHASA-N Gln-Gly-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)CNC(=O)[C@@H](N)CCC(N)=O)C(O)=O GNMQDOGFWYWPNM-LAEOZQHASA-N 0.000 description 2
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 2
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 2
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 2
- IWAXHBCACVWNHT-BQBZGAKWSA-N Gly-Asp-Arg Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IWAXHBCACVWNHT-BQBZGAKWSA-N 0.000 description 2
- HFXJIZNEXNIZIJ-BQBZGAKWSA-N Gly-Glu-Gln Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFXJIZNEXNIZIJ-BQBZGAKWSA-N 0.000 description 2
- SWQALSGKVLYKDT-ZKWXMUAHSA-N Gly-Ile-Ala Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SWQALSGKVLYKDT-ZKWXMUAHSA-N 0.000 description 2
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 2
- JYPCXBJRLBHWME-IUCAKERBSA-N Gly-Pro-Arg Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JYPCXBJRLBHWME-IUCAKERBSA-N 0.000 description 2
- GAAHQHNCMIAYEX-UWVGGRQHSA-N Gly-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GAAHQHNCMIAYEX-UWVGGRQHSA-N 0.000 description 2
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 2
- NGRPGJGKJMUGDM-XVKPBYJWSA-N Gly-Val-Gln Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NGRPGJGKJMUGDM-XVKPBYJWSA-N 0.000 description 2
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 2
- GQKSJYINYYWPMR-NGZCFLSTSA-N Ile-Gly-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N GQKSJYINYYWPMR-NGZCFLSTSA-N 0.000 description 2
- 108010065920 Insulin Lispro Proteins 0.000 description 2
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 2
- GPJGFSFYBJGYRX-YUMQZZPRSA-N Lys-Gly-Asp Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O GPJGFSFYBJGYRX-YUMQZZPRSA-N 0.000 description 2
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 2
- 241001465754 Metazoa Species 0.000 description 2
- 108020004711 Nucleic Acid Probes Proteins 0.000 description 2
- 108700011066 PreScission Protease Proteins 0.000 description 2
- DMKWYMWNEKIPFC-IUCAKERBSA-N Pro-Gly-Arg Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O DMKWYMWNEKIPFC-IUCAKERBSA-N 0.000 description 2
- XQSREVQDGCPFRJ-STQMWFEESA-N Pro-Gly-Phe Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XQSREVQDGCPFRJ-STQMWFEESA-N 0.000 description 2
- 108010050808 Procollagen Proteins 0.000 description 2
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 2
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 2
- 244000000231 Sesamum indicum Species 0.000 description 2
- 235000003434 Sesamum indicum Nutrition 0.000 description 2
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 2
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 2
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 2
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 2
- 239000002253 acid Substances 0.000 description 2
- 239000008272 agar Substances 0.000 description 2
- 108010029483 alpha 1 Chain Collagen Type I Proteins 0.000 description 2
- 230000003321 amplification Effects 0.000 description 2
- 108010077245 asparaginyl-proline Proteins 0.000 description 2
- 108010092854 aspartyllysine Proteins 0.000 description 2
- 230000001580 bacterial effect Effects 0.000 description 2
- 239000012148 binding buffer Substances 0.000 description 2
- 238000009835 boiling Methods 0.000 description 2
- 210000000988 bone and bone Anatomy 0.000 description 2
- UDSAIICHUKSCKT-UHFFFAOYSA-N bromophenol blue Chemical compound C1=C(Br)C(O)=C(Br)C=C1C1(C=2C=C(Br)C(O)=C(Br)C=2)C2=CC=CC=C2S(=O)(=O)O1 UDSAIICHUKSCKT-UHFFFAOYSA-N 0.000 description 2
- 229940041514 candida albicans extract Drugs 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 235000019441 ethanol Nutrition 0.000 description 2
- 235000019688 fish Nutrition 0.000 description 2
- 238000004108 freeze drying Methods 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 108010049041 glutamylalanine Proteins 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- 210000003000 inclusion body Anatomy 0.000 description 2
- 238000011534 incubation Methods 0.000 description 2
- 239000012160 loading buffer Substances 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 229910000403 monosodium phosphate Inorganic materials 0.000 description 2
- 235000019799 monosodium phosphate Nutrition 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- 239000002853 nucleic acid probe Substances 0.000 description 2
- 238000005325 percolation Methods 0.000 description 2
- 108010005636 polypeptide C Proteins 0.000 description 2
- 230000004481 post-translational protein modification Effects 0.000 description 2
- 239000000843 powder Substances 0.000 description 2
- 230000001376 precipitating effect Effects 0.000 description 2
- 108010053725 prolylvaline Proteins 0.000 description 2
- 238000001742 protein purification Methods 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- 108091008146 restriction endonucleases Proteins 0.000 description 2
- 230000035939 shock Effects 0.000 description 2
- 238000012549 training Methods 0.000 description 2
- 239000011534 wash buffer Substances 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- 239000012138 yeast extract Substances 0.000 description 2
- DGVVWUTYPXICAM-UHFFFAOYSA-N β‐Mercaptoethanol Chemical compound OCCS DGVVWUTYPXICAM-UHFFFAOYSA-N 0.000 description 2
- CEHZCZCQHUNAJF-AVGNSLFASA-N (2s)-1-[2-[[(2s)-1-[(2s)-2-amino-5-(diaminomethylideneamino)pentanoyl]pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N1[C@H](C(O)=O)CCC1 CEHZCZCQHUNAJF-AVGNSLFASA-N 0.000 description 1
- AUXMWYRZQPIXCC-KNIFDHDWSA-N (2s)-2-amino-4-methylpentanoic acid;(2s)-2-aminopropanoic acid Chemical compound C[C@H](N)C(O)=O.CC(C)C[C@H](N)C(O)=O AUXMWYRZQPIXCC-KNIFDHDWSA-N 0.000 description 1
- OTEWWRBKGONZBW-UHFFFAOYSA-N 2-[[2-[[2-[(2-azaniumylacetyl)amino]-4-methylpentanoyl]amino]acetyl]amino]acetate Chemical compound NCC(=O)NC(CC(C)C)C(=O)NCC(=O)NCC(O)=O OTEWWRBKGONZBW-UHFFFAOYSA-N 0.000 description 1
- GOJUJUVQIVIZAV-UHFFFAOYSA-N 2-amino-4,6-dichloropyrimidine-5-carbaldehyde Chemical group NC1=NC(Cl)=C(C=O)C(Cl)=N1 GOJUJUVQIVIZAV-UHFFFAOYSA-N 0.000 description 1
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 1
- WQVFQXXBNHHPLX-ZKWXMUAHSA-N Ala-Ala-His Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O WQVFQXXBNHHPLX-ZKWXMUAHSA-N 0.000 description 1
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 1
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 1
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 1
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 1
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 1
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 1
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 1
- QHASENCZLDHBGX-ONGXEEELSA-N Ala-Gly-Phe Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QHASENCZLDHBGX-ONGXEEELSA-N 0.000 description 1
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 1
- GSHKMNKPMLXSQW-KBIXCLLPSA-N Ala-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C)N GSHKMNKPMLXSQW-KBIXCLLPSA-N 0.000 description 1
- QPBSRMDNJOTFAL-AICCOOGYSA-N Ala-Leu-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QPBSRMDNJOTFAL-AICCOOGYSA-N 0.000 description 1
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 1
- XKXAZPSREVUCRT-BPNCWPANSA-N Ala-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=C(O)C=C1 XKXAZPSREVUCRT-BPNCWPANSA-N 0.000 description 1
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 1
- PTVGLOCPAVYPFG-CIUDSAMLSA-N Arg-Gln-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PTVGLOCPAVYPFG-CIUDSAMLSA-N 0.000 description 1
- AQPVUEJJARLJHB-BQBZGAKWSA-N Arg-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N AQPVUEJJARLJHB-BQBZGAKWSA-N 0.000 description 1
- HQIZDMIGUJOSNI-IUCAKERBSA-N Arg-Gly-Arg Chemical compound N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQIZDMIGUJOSNI-IUCAKERBSA-N 0.000 description 1
- IYMAXBFPHPZYIK-BQBZGAKWSA-N Arg-Gly-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IYMAXBFPHPZYIK-BQBZGAKWSA-N 0.000 description 1
- YNSGXDWWPCGGQS-YUMQZZPRSA-N Arg-Gly-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O YNSGXDWWPCGGQS-YUMQZZPRSA-N 0.000 description 1
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 1
- RFXXUWGNVRJTNQ-QXEWZRGKSA-N Arg-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N RFXXUWGNVRJTNQ-QXEWZRGKSA-N 0.000 description 1
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 1
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 1
- SSZGOKWBHLOCHK-DCAQKATOSA-N Arg-Lys-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N SSZGOKWBHLOCHK-DCAQKATOSA-N 0.000 description 1
- PRLPSDIHSRITSF-UNQGMJICSA-N Arg-Phe-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PRLPSDIHSRITSF-UNQGMJICSA-N 0.000 description 1
- 108010051330 Arg-Pro-Gly-Pro Proteins 0.000 description 1
- ICRHGPYYXMWHIE-LPEHRKFASA-N Arg-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ICRHGPYYXMWHIE-LPEHRKFASA-N 0.000 description 1
- XRLOBFSLPCHYLQ-ULQDDVLXSA-N Arg-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O XRLOBFSLPCHYLQ-ULQDDVLXSA-N 0.000 description 1
- PTNFNTOBUDWHNZ-GUBZILKMSA-N Asn-Arg-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O PTNFNTOBUDWHNZ-GUBZILKMSA-N 0.000 description 1
- LJUOLNXOWSWGKF-ACZMJKKPSA-N Asn-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N LJUOLNXOWSWGKF-ACZMJKKPSA-N 0.000 description 1
- VJTWLBMESLDOMK-WDSKDSINSA-N Asn-Gln-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VJTWLBMESLDOMK-WDSKDSINSA-N 0.000 description 1
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 1
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 1
- OLVIPTLKNSAYRJ-YUMQZZPRSA-N Asn-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N OLVIPTLKNSAYRJ-YUMQZZPRSA-N 0.000 description 1
- KHCNTVRVAYCPQE-CIUDSAMLSA-N Asn-Lys-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O KHCNTVRVAYCPQE-CIUDSAMLSA-N 0.000 description 1
- CDGHMJJJHYKMPA-DLOVCJGASA-N Asn-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC(=O)N)N CDGHMJJJHYKMPA-DLOVCJGASA-N 0.000 description 1
- NCXTYSVDWLAQGZ-ZKWXMUAHSA-N Asn-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O NCXTYSVDWLAQGZ-ZKWXMUAHSA-N 0.000 description 1
- ZAESWDKAMDVHLL-RCOVLWMOSA-N Asn-Val-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O ZAESWDKAMDVHLL-RCOVLWMOSA-N 0.000 description 1
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 1
- ZLGKHJHFYSRUBH-FXQIFTODSA-N Asp-Arg-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLGKHJHFYSRUBH-FXQIFTODSA-N 0.000 description 1
- AXXCUABIFZPKPM-BQBZGAKWSA-N Asp-Arg-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O AXXCUABIFZPKPM-BQBZGAKWSA-N 0.000 description 1
- PXLNPFOJZQMXAT-BYULHYEWSA-N Asp-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O PXLNPFOJZQMXAT-BYULHYEWSA-N 0.000 description 1
- LJRPYAZQQWHEEV-FXQIFTODSA-N Asp-Gln-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O LJRPYAZQQWHEEV-FXQIFTODSA-N 0.000 description 1
- PSLSTUMPZILTAH-BYULHYEWSA-N Asp-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PSLSTUMPZILTAH-BYULHYEWSA-N 0.000 description 1
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 1
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 1
- JJQGZGOEDSSHTE-FOHZUACHSA-N Asp-Thr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JJQGZGOEDSSHTE-FOHZUACHSA-N 0.000 description 1
- XWKBWZXGNXTDKY-ZKWXMUAHSA-N Asp-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O XWKBWZXGNXTDKY-ZKWXMUAHSA-N 0.000 description 1
- 241000972773 Aulopiformes Species 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 101100494262 Caenorhabditis elegans best-12 gene Proteins 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- LHLSSZYQFUNWRZ-NAKRPEOUSA-N Cys-Arg-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LHLSSZYQFUNWRZ-NAKRPEOUSA-N 0.000 description 1
- UPJGYXRAPJWIHD-CIUDSAMLSA-N Cys-Asn-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UPJGYXRAPJWIHD-CIUDSAMLSA-N 0.000 description 1
- SQJSYLDKQBZQTG-FXQIFTODSA-N Cys-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N SQJSYLDKQBZQTG-FXQIFTODSA-N 0.000 description 1
- RESAHOSBQHMOKH-KKUMJFAQSA-N Cys-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CS)N RESAHOSBQHMOKH-KKUMJFAQSA-N 0.000 description 1
- NITLUESFANGEIW-BQBZGAKWSA-N Cys-Pro-Gly Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O NITLUESFANGEIW-BQBZGAKWSA-N 0.000 description 1
- MQQLYEHXSBJTRK-FXQIFTODSA-N Cys-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N MQQLYEHXSBJTRK-FXQIFTODSA-N 0.000 description 1
- 241000305071 Enterobacterales Species 0.000 description 1
- 241000588921 Enterobacteriaceae Species 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 241000588722 Escherichia Species 0.000 description 1
- 108010049003 Fibrinogen Proteins 0.000 description 1
- 102000008946 Fibrinogen Human genes 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- ZPDVKYLJTOFQJV-WDSKDSINSA-N Gln-Asn-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ZPDVKYLJTOFQJV-WDSKDSINSA-N 0.000 description 1
- WQWMZOIPXWSZNE-WDSKDSINSA-N Gln-Asp-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O WQWMZOIPXWSZNE-WDSKDSINSA-N 0.000 description 1
- AJDMYLOISOCHHC-YVNDNENWSA-N Gln-Gln-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AJDMYLOISOCHHC-YVNDNENWSA-N 0.000 description 1
- NSNUZSPSADIMJQ-WDSKDSINSA-N Gln-Gly-Asp Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NSNUZSPSADIMJQ-WDSKDSINSA-N 0.000 description 1
- QQAPDATZKKTBIY-YUMQZZPRSA-N Gln-Gly-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O QQAPDATZKKTBIY-YUMQZZPRSA-N 0.000 description 1
- VGTDBGYFVWOQTI-RYUDHWBXSA-N Gln-Gly-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VGTDBGYFVWOQTI-RYUDHWBXSA-N 0.000 description 1
- ORYMMTRPKVTGSJ-XVKPBYJWSA-N Gln-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O ORYMMTRPKVTGSJ-XVKPBYJWSA-N 0.000 description 1
- JNENSVNAUWONEZ-GUBZILKMSA-N Gln-Lys-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O JNENSVNAUWONEZ-GUBZILKMSA-N 0.000 description 1
- BBFCMGBMYIAGRS-AUTRQRHGSA-N Gln-Val-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BBFCMGBMYIAGRS-AUTRQRHGSA-N 0.000 description 1
- AFODTOLGSZQDSL-PEFMBERDSA-N Glu-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N AFODTOLGSZQDSL-PEFMBERDSA-N 0.000 description 1
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 1
- MXPBQDFWIMBACQ-ACZMJKKPSA-N Glu-Cys-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(O)=O MXPBQDFWIMBACQ-ACZMJKKPSA-N 0.000 description 1
- NUSWUSKZRCGFEX-FXQIFTODSA-N Glu-Glu-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O NUSWUSKZRCGFEX-FXQIFTODSA-N 0.000 description 1
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 1
- VGUYMZGLJUJRBV-YVNDNENWSA-N Glu-Ile-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VGUYMZGLJUJRBV-YVNDNENWSA-N 0.000 description 1
- YRMZCZIRHYCNHX-RYUDHWBXSA-N Glu-Phe-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O YRMZCZIRHYCNHX-RYUDHWBXSA-N 0.000 description 1
- HMJULNMJWOZNFI-XHNCKOQMSA-N Glu-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N)C(=O)O HMJULNMJWOZNFI-XHNCKOQMSA-N 0.000 description 1
- LWYUQLZOIORFFJ-XKBZYTNZSA-N Glu-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O LWYUQLZOIORFFJ-XKBZYTNZSA-N 0.000 description 1
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 1
- DTLLNDVORUEOTM-WDCWCFNPSA-N Glu-Thr-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DTLLNDVORUEOTM-WDCWCFNPSA-N 0.000 description 1
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 1
- QLNKFGTZOBVMCS-JBACZVJFSA-N Glu-Tyr-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O QLNKFGTZOBVMCS-JBACZVJFSA-N 0.000 description 1
- BRFJMRSRMOMIMU-WHFBIAKZSA-N Gly-Ala-Asn Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O BRFJMRSRMOMIMU-WHFBIAKZSA-N 0.000 description 1
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 1
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 1
- LERGJIVJIIODPZ-ZANVPECISA-N Gly-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)CN)C)C(O)=O)=CNC2=C1 LERGJIVJIIODPZ-ZANVPECISA-N 0.000 description 1
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 1
- VXKCPBPQEKKERH-IUCAKERBSA-N Gly-Arg-Pro Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N1CCC[C@H]1C(O)=O VXKCPBPQEKKERH-IUCAKERBSA-N 0.000 description 1
- XUORRGAFUQIMLC-STQMWFEESA-N Gly-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN)O XUORRGAFUQIMLC-STQMWFEESA-N 0.000 description 1
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 1
- UXJHNZODTMHWRD-WHFBIAKZSA-N Gly-Asn-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O UXJHNZODTMHWRD-WHFBIAKZSA-N 0.000 description 1
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 1
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 1
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 1
- VNBNZUAPOYGRDB-ZDLURKLDSA-N Gly-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN)O VNBNZUAPOYGRDB-ZDLURKLDSA-N 0.000 description 1
- JMQFHZWESBGPFC-WDSKDSINSA-N Gly-Gln-Asp Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JMQFHZWESBGPFC-WDSKDSINSA-N 0.000 description 1
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 1
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 1
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 1
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 1
- TVDHVLGFJSHPAX-UWVGGRQHSA-N Gly-His-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 TVDHVLGFJSHPAX-UWVGGRQHSA-N 0.000 description 1
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 1
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 1
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 1
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 1
- ZWRDOVYMQAAISL-UWVGGRQHSA-N Gly-Met-Lys Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCCN ZWRDOVYMQAAISL-UWVGGRQHSA-N 0.000 description 1
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 1
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 1
- WDXLKVQATNEAJQ-BQBZGAKWSA-N Gly-Pro-Asp Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WDXLKVQATNEAJQ-BQBZGAKWSA-N 0.000 description 1
- GLACUWHUYFBSPJ-FJXKBIBVSA-N Gly-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GLACUWHUYFBSPJ-FJXKBIBVSA-N 0.000 description 1
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 1
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 1
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 1
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 1
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 1
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 1
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 1
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 1
- QSLKWWDKIXMWJV-SRVKXCTJSA-N His-Cys-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N QSLKWWDKIXMWJV-SRVKXCTJSA-N 0.000 description 1
- CHZRWFUGWRTUOD-IUCAKERBSA-N His-Gly-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N CHZRWFUGWRTUOD-IUCAKERBSA-N 0.000 description 1
- 206010020751 Hypersensitivity Diseases 0.000 description 1
- NPROWIBAWYMPAZ-GUDRVLHUSA-N Ile-Asp-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N NPROWIBAWYMPAZ-GUDRVLHUSA-N 0.000 description 1
- CTHAJJYOHOBUDY-GHCJXIJMSA-N Ile-Cys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N CTHAJJYOHOBUDY-GHCJXIJMSA-N 0.000 description 1
- IOVUXUSIGXCREV-DKIMLUQUSA-N Ile-Leu-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IOVUXUSIGXCREV-DKIMLUQUSA-N 0.000 description 1
- DGTOKVBDZXJHNZ-WZLNRYEVSA-N Ile-Thr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N DGTOKVBDZXJHNZ-WZLNRYEVSA-N 0.000 description 1
- 150000008575 L-amino acids Chemical class 0.000 description 1
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 1
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 1
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 1
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 1
- HDHQQEDVWQGBEE-DCAQKATOSA-N Leu-Met-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O HDHQQEDVWQGBEE-DCAQKATOSA-N 0.000 description 1
- XWEVVRRSIOBJOO-SRVKXCTJSA-N Leu-Pro-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O XWEVVRRSIOBJOO-SRVKXCTJSA-N 0.000 description 1
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 1
- GAOJCVKPIGHTGO-UWVGGRQHSA-N Lys-Arg-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O GAOJCVKPIGHTGO-UWVGGRQHSA-N 0.000 description 1
- BRSGXFITDXFMFF-IHRRRGAJSA-N Lys-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N BRSGXFITDXFMFF-IHRRRGAJSA-N 0.000 description 1
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 1
- GCMWRRQAKQXDED-IUCAKERBSA-N Lys-Glu-Gly Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)N[C@@H](CCC([O-])=O)C(=O)NCC([O-])=O GCMWRRQAKQXDED-IUCAKERBSA-N 0.000 description 1
- QZONCCHVHCOBSK-YUMQZZPRSA-N Lys-Gly-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O QZONCCHVHCOBSK-YUMQZZPRSA-N 0.000 description 1
- UETQMSASAVBGJY-QWRGUYRKSA-N Lys-Gly-His Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 UETQMSASAVBGJY-QWRGUYRKSA-N 0.000 description 1
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 1
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 1
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 1
- TXTZMVNJIRZABH-ULQDDVLXSA-N Lys-Val-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TXTZMVNJIRZABH-ULQDDVLXSA-N 0.000 description 1
- OFNCSQNBSWGGNV-DCAQKATOSA-N Met-Cys-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 OFNCSQNBSWGGNV-DCAQKATOSA-N 0.000 description 1
- LQMHZERGCQJKAH-STQMWFEESA-N Met-Gly-Phe Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LQMHZERGCQJKAH-STQMWFEESA-N 0.000 description 1
- MVBZBRKNZVJEKK-DTWKUNHWSA-N Met-Gly-Pro Chemical compound CSCC[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N MVBZBRKNZVJEKK-DTWKUNHWSA-N 0.000 description 1
- KYXDADPHSNFWQX-VEVYYDQMSA-N Met-Thr-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O KYXDADPHSNFWQX-VEVYYDQMSA-N 0.000 description 1
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 1
- 240000007594 Oryza sativa Species 0.000 description 1
- 235000007164 Oryza sativa Nutrition 0.000 description 1
- FRPVPGRXUKFEQE-YDHLFZDLSA-N Phe-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FRPVPGRXUKFEQE-YDHLFZDLSA-N 0.000 description 1
- LLGTYVHITPVGKR-RYUDHWBXSA-N Phe-Gln-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O LLGTYVHITPVGKR-RYUDHWBXSA-N 0.000 description 1
- UAMFZRNCIFFMLE-FHWLQOOXSA-N Phe-Glu-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N UAMFZRNCIFFMLE-FHWLQOOXSA-N 0.000 description 1
- WEMYTDDMDBLPMI-DKIMLUQUSA-N Phe-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N WEMYTDDMDBLPMI-DKIMLUQUSA-N 0.000 description 1
- XMQSOOJRRVEHRO-ULQDDVLXSA-N Phe-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMQSOOJRRVEHRO-ULQDDVLXSA-N 0.000 description 1
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 1
- GLJZDMZJHFXJQG-BZSNNMDCSA-N Phe-Ser-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLJZDMZJHFXJQG-BZSNNMDCSA-N 0.000 description 1
- KIQUCMUULDXTAZ-HJOGWXRNSA-N Phe-Tyr-Tyr Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O KIQUCMUULDXTAZ-HJOGWXRNSA-N 0.000 description 1
- VXCHGLYSIOOZIS-GUBZILKMSA-N Pro-Ala-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 VXCHGLYSIOOZIS-GUBZILKMSA-N 0.000 description 1
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 1
- WPQKSRHDTMRSJM-CIUDSAMLSA-N Pro-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 WPQKSRHDTMRSJM-CIUDSAMLSA-N 0.000 description 1
- HJSCRFZVGXAGNG-SRVKXCTJSA-N Pro-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 HJSCRFZVGXAGNG-SRVKXCTJSA-N 0.000 description 1
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 1
- LGSANCBHSMDFDY-GARJFASQSA-N Pro-Glu-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O LGSANCBHSMDFDY-GARJFASQSA-N 0.000 description 1
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 1
- XYHMFGGWNOFUOU-QXEWZRGKSA-N Pro-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 XYHMFGGWNOFUOU-QXEWZRGKSA-N 0.000 description 1
- LNOWDSPAYBWJOR-PEDHHIEDSA-N Pro-Ile-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LNOWDSPAYBWJOR-PEDHHIEDSA-N 0.000 description 1
- CLJLVCYFABNTHP-DCAQKATOSA-N Pro-Leu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O CLJLVCYFABNTHP-DCAQKATOSA-N 0.000 description 1
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 1
- FYKUEXMZYFIZKA-DCAQKATOSA-N Pro-Pro-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FYKUEXMZYFIZKA-DCAQKATOSA-N 0.000 description 1
- DWPXHLIBFQLKLK-CYDGBPFRSA-N Pro-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 DWPXHLIBFQLKLK-CYDGBPFRSA-N 0.000 description 1
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 1
- STGVYUTZKGPRCI-GUBZILKMSA-N Pro-Val-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 STGVYUTZKGPRCI-GUBZILKMSA-N 0.000 description 1
- FUOGXAQMNJMBFG-WPRPVWTQSA-N Pro-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FUOGXAQMNJMBFG-WPRPVWTQSA-N 0.000 description 1
- 108010079005 RDV peptide Proteins 0.000 description 1
- 206010070834 Sensitisation Diseases 0.000 description 1
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 1
- NJSPTZXVPZDRCU-UBHSHLNASA-N Ser-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N NJSPTZXVPZDRCU-UBHSHLNASA-N 0.000 description 1
- CDVFZMOFNJPUDD-ACZMJKKPSA-N Ser-Gln-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CDVFZMOFNJPUDD-ACZMJKKPSA-N 0.000 description 1
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 1
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 1
- KDGARKCAKHBEDB-NKWVEPMBSA-N Ser-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CO)N)C(=O)O KDGARKCAKHBEDB-NKWVEPMBSA-N 0.000 description 1
- JEHPKECJCALLRW-CUJWVEQBSA-N Ser-His-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEHPKECJCALLRW-CUJWVEQBSA-N 0.000 description 1
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 1
- HDBOEVPDIDDEPC-CIUDSAMLSA-N Ser-Lys-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O HDBOEVPDIDDEPC-CIUDSAMLSA-N 0.000 description 1
- QMCDMHWAKMUGJE-IHRRRGAJSA-N Ser-Phe-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O QMCDMHWAKMUGJE-IHRRRGAJSA-N 0.000 description 1
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 1
- DKGRNFUXVTYRAS-UBHSHLNASA-N Ser-Ser-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O DKGRNFUXVTYRAS-UBHSHLNASA-N 0.000 description 1
- FHXGMDRKJHKLKW-QWRGUYRKSA-N Ser-Tyr-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 FHXGMDRKJHKLKW-QWRGUYRKSA-N 0.000 description 1
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 1
- 238000002105 Southern blotting Methods 0.000 description 1
- 208000037065 Subacute sclerosing leukoencephalitis Diseases 0.000 description 1
- 206010042297 Subacute sclerosing panencephalitis Diseases 0.000 description 1
- DCCGCVLVVSAJFK-NUMRIWBASA-N Thr-Asp-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O DCCGCVLVVSAJFK-NUMRIWBASA-N 0.000 description 1
- NRUPKQSXTJNQGD-XGEHTFHBSA-N Thr-Cys-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NRUPKQSXTJNQGD-XGEHTFHBSA-N 0.000 description 1
- DSLHSTIUAPKERR-XGEHTFHBSA-N Thr-Cys-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O DSLHSTIUAPKERR-XGEHTFHBSA-N 0.000 description 1
- DIPIPFHFLPTCLK-LOKLDPHHSA-N Thr-Gln-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O DIPIPFHFLPTCLK-LOKLDPHHSA-N 0.000 description 1
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 1
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 1
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 1
- NIEWSKWFURSECR-FOHZUACHSA-N Thr-Gly-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NIEWSKWFURSECR-FOHZUACHSA-N 0.000 description 1
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 1
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 1
- ZTPXSEUVYNNZRB-CDMKHQONSA-N Thr-Gly-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZTPXSEUVYNNZRB-CDMKHQONSA-N 0.000 description 1
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 1
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 1
- JWQNAFHCXKVZKZ-UVOCVTCTSA-N Thr-Lys-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWQNAFHCXKVZKZ-UVOCVTCTSA-N 0.000 description 1
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 1
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 1
- ZPZNQAZHMCLTOA-PXDAIIFMSA-N Trp-Tyr-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=C(O)C=C1 ZPZNQAZHMCLTOA-PXDAIIFMSA-N 0.000 description 1
- HSVPZJLMPLMPOX-BPNCWPANSA-N Tyr-Arg-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O HSVPZJLMPLMPOX-BPNCWPANSA-N 0.000 description 1
- YGKVNUAKYPGORG-AVGNSLFASA-N Tyr-Asp-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YGKVNUAKYPGORG-AVGNSLFASA-N 0.000 description 1
- ARJASMXQBRNAGI-YESZJQIVSA-N Tyr-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N ARJASMXQBRNAGI-YESZJQIVSA-N 0.000 description 1
- SINRIKQYQJRGDQ-MEYUZBJRSA-N Tyr-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SINRIKQYQJRGDQ-MEYUZBJRSA-N 0.000 description 1
- TYFLVOUZHQUBGM-IHRRRGAJSA-N Tyr-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TYFLVOUZHQUBGM-IHRRRGAJSA-N 0.000 description 1
- LABUITCFCAABSV-UHFFFAOYSA-N Val-Ala-Tyr Natural products CC(C)C(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LABUITCFCAABSV-UHFFFAOYSA-N 0.000 description 1
- YTPLVNUZZOBFFC-SCZZXKLOSA-N Val-Gly-Pro Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N1CCC[C@@H]1C(O)=O YTPLVNUZZOBFFC-SCZZXKLOSA-N 0.000 description 1
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 1
- BMOFUVHDBROBSE-DCAQKATOSA-N Val-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N BMOFUVHDBROBSE-DCAQKATOSA-N 0.000 description 1
- QTXGUIMEHKCPBH-FHWLQOOXSA-N Val-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 QTXGUIMEHKCPBH-FHWLQOOXSA-N 0.000 description 1
- LZRWTJSPTJSWDN-FKBYEOEOSA-N Val-Trp-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O)N LZRWTJSPTJSWDN-FKBYEOEOSA-N 0.000 description 1
- PMKQKNBISAOSRI-XHSDSOJGSA-N Val-Tyr-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N PMKQKNBISAOSRI-XHSDSOJGSA-N 0.000 description 1
- 238000002835 absorbance Methods 0.000 description 1
- 230000021736 acetylation Effects 0.000 description 1
- 238000006640 acetylation reaction Methods 0.000 description 1
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 1
- 108010005233 alanylglutamic acid Proteins 0.000 description 1
- 108010087924 alanylproline Proteins 0.000 description 1
- 239000003513 alkali Substances 0.000 description 1
- 208000030961 allergic reaction Diseases 0.000 description 1
- 108010013835 arginine glutamate Proteins 0.000 description 1
- 108010086780 arginyl-glycyl-aspartyl-alanine Proteins 0.000 description 1
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 1
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 108010093581 aspartyl-proline Proteins 0.000 description 1
- 108010038633 aspartylglutamate Proteins 0.000 description 1
- 125000001797 benzyl group Chemical group [H]C1=C([H])C([H])=C(C([H])=C1[H])C([H])([H])* 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 210000004204 blood vessel Anatomy 0.000 description 1
- 238000004422 calculation algorithm Methods 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 239000012876 carrier material Substances 0.000 description 1
- 210000000845 cartilage Anatomy 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000004956 cell adhesive effect Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 238000005253 cladding Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000004132 cross linking Methods 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 238000000855 fermentation Methods 0.000 description 1
- 230000004151 fermentation Effects 0.000 description 1
- 229940012952 fibrinogen Drugs 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 238000013467 fragmentation Methods 0.000 description 1
- 238000006062 fragmentation reaction Methods 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 108010045624 glutamyl-lysyl-alanyl-histidyl-aspartyl-glycyl-glycyl-arginine Proteins 0.000 description 1
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 1
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 108010087823 glycyltyrosine Proteins 0.000 description 1
- 238000009499 grossing Methods 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 230000000640 hydroxylating effect Effects 0.000 description 1
- 230000001900 immune effect Effects 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 239000004615 ingredient Substances 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 1
- 108010027338 isoleucylcysteine Proteins 0.000 description 1
- 230000006651 lactation Effects 0.000 description 1
- 210000002429 large intestine Anatomy 0.000 description 1
- 238000011031 large-scale manufacturing process Methods 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 108010003700 lysyl aspartic acid Proteins 0.000 description 1
- 239000002609 medium Substances 0.000 description 1
- 239000008267 milk Substances 0.000 description 1
- 235000013336 milk Nutrition 0.000 description 1
- 210000004080 milk Anatomy 0.000 description 1
- 239000003471 mutagenic agent Substances 0.000 description 1
- 231100000707 mutagenic chemical Toxicity 0.000 description 1
- 210000002569 neuron Anatomy 0.000 description 1
- 235000016709 nutrition Nutrition 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 230000035790 physiological processes and functions Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 239000012460 protein solution Substances 0.000 description 1
- 238000002708 random mutagenesis Methods 0.000 description 1
- 239000002994 raw material Substances 0.000 description 1
- 238000003259 recombinant expression Methods 0.000 description 1
- 238000012827 research and development Methods 0.000 description 1
- 235000009566 rice Nutrition 0.000 description 1
- 235000019515 salmon Nutrition 0.000 description 1
- 230000008313 sensitization Effects 0.000 description 1
- 108010071207 serylmethionine Proteins 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 210000003491 skin Anatomy 0.000 description 1
- 230000002269 spontaneous effect Effects 0.000 description 1
- 210000002435 tendon Anatomy 0.000 description 1
- 238000010998 test method Methods 0.000 description 1
- 210000000515 tooth Anatomy 0.000 description 1
- 108010038745 tryptophylglycine Proteins 0.000 description 1
- 108010003137 tyrosyltyrosine Proteins 0.000 description 1
- 238000002525 ultrasonication Methods 0.000 description 1
- 108010011876 valyl-glycyl-valyl-alanyl-prolyl-glycine Proteins 0.000 description 1
- 230000009385 viral infection Effects 0.000 description 1
- 230000010148 water-pollination Effects 0.000 description 1
- 108010000998 wheylin-2 peptide Proteins 0.000 description 1
- 230000037303 wrinkles Effects 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/78—Connective tissue peptides, e.g. collagen, elastin, laminin, fibronectin, vitronectin or cold insoluble globulin [CIG]
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/62—DNA sequences coding for fusion proteins
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2800/00—Nucleic acids vectors
- C12N2800/22—Vectors comprising a coding region that has been codon optimised for expression in a respective host
Landscapes
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Biomedical Technology (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biophysics (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Microbiology (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Toxicology (AREA)
- Gastroenterology & Hepatology (AREA)
- Medicinal Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Peptides Or Proteins (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
本发明涉及多肽、其生产方法和用途。所述多肽包含N端区域和C端区域。本发明的多肽具有显著的细胞黏附效果。
Description
技术领域
本发明属于基因工程技术领域,涉及多肽、其生产方法和用途。
背景技术:
胶原蛋白一般为白色、透明、无分支的原纤维,是皮肤和骨骼的基础支 撑物,可以占到蛋白质总量的25%~35%,主要分布于人体的皮肤、血管、骨 骼、筋腱、牙齿和软骨等处,是这些组织的主要基质和支架,保护并连结各 种组织,在体内发挥着重要的生理功能。因此,胶原蛋白可以广泛的应用在 医药和化妆品等行业中。
当前市场上销售的胶原蛋白产品都是取自猪、牛、鱼等动物组织中。以 胶原蛋白的氨基酸组成而言:哺乳动物猪、牛与人的相似度为95%,鱼与人 的相似度65%。虽然哺乳动物猪、牛等与人的胶原蛋白相似度较高,其仍然 难以避免病毒感染以及致敏性的危险。而食用鱼类所萃取的胶原蛋白人体的 利用率低于65%,无法完全被人体吸收与利用。所以其常用于食品运用,少 部分运用于化妆品,但无法被用于医疗器材或较精密的组织工程产品。所以, 目前的胶原蛋白只能在化妆品和保健品中使用,根本无法发挥胶原蛋白的原 本生物学功能。
从结构上来说,人体天然的胶原蛋白的结构非常的复杂,所以才导致人 源胶原蛋白极难通过常规手段表达和大量制备。胶原蛋白最普遍的结构特征 是由3条肽链形成的三螺旋结构,即由3条A肽链以右手超螺旋方式形成蛋白 质,这样的三股螺旋区域被称为胶原区域。每个A肽链在分子结构上都是由 重复出现的Gly-X-Y(X、Y代表Gly之外的任何氨基酸残基,X往往是Pro,Y 往往是Hyp)肽段构成左手螺旋,3条链在氨基酸残基的相互作用下,以同一 轴为中心,以右手超螺旋方式形成稳定的三股螺旋结构。在生物体中,胶原 蛋白的合成和修饰从原胶原开始,经历了羟基化、糖基化、相互交联等诸多 化学变化,受到了多种生物酶的复杂调控。原胶原除了含有胶原链之外,还 含有球状的头部和尾部。没有这些头部和尾部,胶原链就不会折叠成为正确 的三螺旋,从而缺乏胶原蛋白的生物学活性。因此,按照原始基因序列制备 的胶原蛋白不可能在体外自发的组织形成正确的空间结构。这样的困难严重 阻碍了人胶原蛋白的研发和生产。
生产胶原蛋白的传统方法是利用酸、碱、酶解法处理动物来源的组织, 提取胶原蛋白衍生物。这些方法提取的胶原蛋白本身已经丧失了原本的生物 学活性,无法应用于生物医学领域发挥真正的功能。大多数经过制备后的胶 原蛋白产品,拉伸强度较弱;纯胶原蛋白在体内降解较快,以及可能存在潜 在的抗原性等,同时由于胶原蛋白的来源、加工工艺和原料配比的差异,产 品的营养成分及饲用价值也不相同,且皮料在加工过程中不仅要接触许多化 学物质,且易受到细菌的感染,严重限制了胶原蛋白的应用。虽然国外研究 机构通过培育含人胶原蛋白基因的小鼠,得到了含有人胶原蛋白的乳汁,但 是这样生产的成本过高,生产周期过长,无法投入大规模生产。
发明内容
针对上述现有技术的缺陷,本发明提供了:
1.多肽,所述多肽包含SEQ ID No.1(人I型胶原蛋白COL1A1的全长序 列)中的至少60个连续的氨基酸残基的N端区域和包含以SEQ ID No.2所示 的序列GAPGPCCGG的C端区域,
其中优选地,所述多肽是胶原蛋白多肽,优选是人源胶原蛋白多肽;
其中优选地,所述N端区域为重复n次的基本重复单元,n为大于等于1 的整数,优选地为1、2、3、4、5、6、7、8、9或10,优选地n为4;
其中优选地,所述N端区域与C端区域是连续的或者间隔1个或多个氨基 酸残基。
2.根据1所述的多肽,其中所述N端区域包含选自SEQ ID No.3和SEQ ID No.4的序列。
3.1或2所述的多肽,其包含
a)SEQ ID No.5或SEQ ID No.6的氨基酸序列
b)与SEQ ID No.5或SEQ ID No.6的氨基酸序列具有90%、92%、95%、 96%、97%、98%或99%同一性的氨基酸序列,其保留SEQ ID No.5或SEQ ID No.6的氨基酸序列的细胞黏附效果;
c)SEQ ID No.5或SEQ ID No.6的氨基酸序列中添加、取代、缺失或插 入1个或多个氨基酸残基的氨基酸序列,优选为保守氨基酸取代,所述氨基 酸序列保留SEQ ID No.5或SEQ ID No.6的氨基酸序列的细胞黏附效果;或
d)由核苷酸序列编码的氨基酸序列,所述核苷酸序列与编码SEQ ID No. 5或SEQID No.6的氨基酸序列的多核苷酸序列在严格条件下杂交,其保留 SEQ ID No.5或SEQ IDNo.6的氨基酸序列的细胞黏附效果,所述严格条件 是中等严格条件,中-高严格条件,高严格条件或非常高严格条件。
4.多核苷酸,其编码根据1-3中任一项所述的多肽。
5.表达载体,其包含根据4所述的多核苷酸。
6.宿主细胞,其包含根据5所述的表达载体,其中所述宿主细胞优选是 大肠杆菌。
7.根据权利要求1所述的多肽的生产方法,其包括:
(1)在生产培养基中培养根据6所述的宿主细胞并生产多肽;
(2)收获并纯化多肽;和
(3)任选地对多肽进行酶切。
8.组合物,优选医疗器材、组织工程产品、化妆品或保健品,其包含根 据1所述的多肽。
9.根据1所述的多肽在制备组合物,优选医疗器材、组织工程产品、化 妆品、保健品中的用途。
10.根据1所述的多肽促进细胞黏附的用途。
与现有技术相比,本发明具有以下特点:
(1)本发明首次选择的I型胶原蛋白序列为长期筛选优化的序列;相比于 I型胶原蛋白的其它片段具有显著更好的细胞黏附效果。本发明的重组I型胶 原蛋白片段以较短的序列长度便更好地实现了I型胶原蛋白片段的细胞黏附 活性。
(2)采用大肠杆菌表达系统,适于大规模放大,20小时即可完成一轮发 酵,生产成本非常低,由于对基因序列进行了大肠杆菌的密码子优化以及选 用2×YT培养基,使得产量非常大;
(3)生产的重组人源胶原蛋白具有非常好的亲水性和稳定性,其氨基酸 组成与天然胶原蛋白氨基酸序列相应部分100%相同,应用于人体不会产生 免疫排斥和过敏反应,可以广泛应用于生物医药和化妆品行业;
(4)本发明产品经过活性检测,具有达到甚至超过人体天然蛋白的生物 学活性,是目前报道过的最优化的胶原蛋白设计方案,可以在人体中行使天 然蛋白的功能,达到真正的产品应用的目的。
附图说明
图1为用于本发明载体pET32a-C1S4T(SEQ ID NO.5)、pET32a-C1S5T (SEQ IDNO.6)构建的质粒pET32a的图谱;
图2为本发明C1S4T蛋白纯化酶切后得到的目的蛋白电泳图;C1S4T蛋白 的电泳检测分子量约为36kDa,对应于包含SEQ ID NO.5的氨基酸序列的蛋 白质。
图3为本发明C1S5T蛋白纯化酶切后得到的目的蛋白电泳图;C1S5T蛋白 的电泳检测分子量约为36kDa,对应于包含SEQ ID NO.6的氨基酸序列的蛋 白质。
图4为本发明C1S4T蛋白与人胶原蛋白相比较的生物活性检测结果。
图5为本发明C1S5T蛋白与人胶原蛋白相比较的生物活性检测结果。
具体实施方式
下文提供进一步的描述以便于理解本发明。
如本文中使用,“医疗器械”是指直接或者间接用于人体的仪器、设备、 器具、体外诊断试剂及校准物、材料以及其他类似或者相关的物品。
如本文中使用,“组织工程产品”是指用于组织工程的产品。组织工程是 一门以细胞生物学和材料科学相结合,进行体外或体内构建组织或器官的新 兴学科。
在本发明中,选择I型胶原蛋白序列为筛选优化的序列。所述人胶原I型 COL1A1的序列是NCBI参照序列:NP_000079(SEQ ID No.1),参见 https://www.ncbi.nlm.nih.gov/protein/NP_000079.2
MFSFVDLRLLLLLAATALLTHGQEEGQVEGQDEDIPPITCVQNGLRYHDR DVWKPEPCRICVCDNGKVLCDDVICDETKNCPGAEVPEGECCPVCPDGS ESPTDQETTGVEGPKGDTGPRGPRGPAGPPGRDGIPGQPGLPGPPGPPGPP GPPGLGGNFAPQLSYGYDEKSTGGISVPGPMGPSGPRGLPGPPGAPGPQG FQGPPGEPGEPGASGPMGPRGPPGPPGKNGDDGEAGKPGRPGERGPPGP QGARGLPGTAGLPGMKGHRGFSGLDGAKGDAGPAGPKGEPGSPGENGA PGQMGPRGLPGERGRPGAPGPAGARGNDGATGAAGPPGPTGPAGPPGFP GAVGAKGEAGPQGPRGSEGPQGVRGEPGPPGPAGAAGPAGNPGADGQP GAKGANGAPGIAGAPGFPGARGPSGPQGPGGPPGPKGNSGEPGAPGSKG DTGAKGEPGPVGVQGPPGPAGEEGKRGARGEPGPTGLPGPPGERGGPGS RGFPGADGVAGPKGPAGERGSPGPAGPKGSPGEAGRPGEAGLPGAKGLT GSPGSPGPDGKTGPPGPAGQDGRPGPPGPPGARGQAGVMGFPGPKGAAGEPGKAGERGVPGPPGAVGPAGKDGEAGAQGPPGPAGPAGERGEQGPAGSPGFQGLPGPAGPPGEAGKPGEQGVPGD LGAPGPSGARGERGFPGERGVQGPPGPAGPRGANGAPGNDGAKGDAGAPGAPGSQGAPGLQGMPGE RGAAGLPGPKGDRGDAGPKGADGSPGKDGVRGLTGPIGPPGPAGAPGDK GESGPSGPAGPTGARGAPGDRGEPGPPGPAGFAGPPGADGQPGAKGEPG DAGAKGDAGPPGPAGPAGPPGPIGNVGAPGAKGARGSAGPPGATGFPGA AGRVGPPGPSGNAGPPGPPGPAGKEGGKGPRGETGPAGRPGEVGPPGPPG PAGEKGSPGADGPAGAPGTPGPQGIAGQRGVVGLPG QRGERGFPGLPGPSGEPGKQGPSGASGERGPPGPMGPPGLAGPPGESGREGAPGAEGSP GRDGSPGAKGDRGETGPAGPPGAPGAPGAPGPVGPAGKSGDRGETGPAG PAGPVGPVGARGPAGPQGPRGDKGETGEQGDRGIKGHRGFSGLQGPPGP PGSPGEQGPSGASGPAGPRGPPGSAGAPGKDGLNGLPGPIGPPGPRGRTG DAGPVGPPGPPGPPGPPGPPSAGFDFSFLPQPPQEKAHDGGRYYRADDAN VVRDRDLEVDTTLKSLSQQIENIRSPEGSRKNPARTCRDLKMCHSDWKSG EYWIDPNQGCNLDAIKVFCNMETGETCVYPTQPSVAQKNWYISKNPKDK RHVWFGESMTDGFQFEYGGQGSDPADVAIQLTFLRLMSTEASQNITYHC KNSVAYMDQQTGNLKKALLLQGSNEIEIRAEGNSRFTYSVTVDGCTSHTG AWGKTVIEYKTTKTSRLPIIDVAPLDVGAPDQEFGFDVGPVCFL (SEQ ID No.1)
上述序列中粗体下划线部分即为本发明选择的氨基酸序列。申请人经过 大量的研究发现,选择的上述序列比商品化的人胶原蛋白或SEQ ID No.1中 的其他序列实现更好的黏附效果。在本发明中,多肽不是SEQ ID No.1的全 长序列。
本发明部分基于以下发现:包含SEQ ID No.1中的至少60个连续的氨基酸 残基的N端区域和以SEQ ID No.2所示的序列GAPGPCCGG的C端区域的多 肽能够比商品化的人胶原蛋白实现更好的黏附效果,如实施例证明。本领域 技术人员可以适当选择构成N端区域的连续的氨基酸残基。例如,连续的氨 基酸残基的长度可以是60-100、72-84等等。
在本发明中,对几种具体的N端区域进行了测试:
(1)GPAGSPGFQGLPGPAGPPGEAGKPGEQGVPGDLGAPGPSGARGER GFPGERGVQGPPGPA(SEQID No.3);
(2)GEKGSPGADGPAGAPGTPGPQGIAGQRGVVGLPGQRGERGFPGLPGPS GEPGKQGPSGAS(SEQID No.4);
在本发明中,C端氨基酸序列可以为GAPGPCCGG(SEQ ID No.2),该序 列增强胶原活性的端序列肽段。
多肽在本文中可以是重组人源胶原蛋白C1S4T,为单链结构,包括氨基酸 249个,基本重复单元为 GPAGSPGFQGLPGPAGPPGEAGKPGEQGVPGDLGAPGPSGARGERGFPGE RGVQGPPGPA(SEQ ID No.3),为人胶原蛋白I型肽段,C端氨基酸序列为 GAPGPCCGG(SEQ ID No.2),为增强胶原活性的端序列肽段。C1S4T的氨 基酸序列如下: GPAGSPGFQGLPGPAGPPGEAGKPGEQGVPGDLGAPGPSGARGERGFPGE RGVQGPPGPAGPAGSPGFQGLPGPAGPPGEAGKPGEQGVPGDLGAPGPSG ARGERGFPGERGVQGPPGPAGPAGSPGFQGLPGPAGPPGEAGKPGEQGVP GDLGAPGPSGARGERGFPGERGVQGPPGPAGPAGSPGFQGLPGPAGPPGE AGKPGEQGVPGDLGAPGPSGARGERGFPGERGVQGPPGPAGAPGPCCGG (SEQID No.5)。C1S4T的DNA序列如下: GGTCCGGCCGGTAGCCCGGGTTTTCAAGGTCTGCCGGGTCCCGCTGGTCCTCCGGGTGAGGCTGGTAAACCCGGTGAGCAAGGTGTTCCCGGTGAT CTGGGTGCACCGGGTCCGAGTGGTGCACGTGGTGAGCGTGGCTTTCCG GGTGAGCGTGGCGTTCAAGGTCCCCCCGGTCCGGCTGGTCCGGCTGGT AGTCCCGGTTTCCAAGGTCTGCCCGGTCCCGCTGGTCCTCCGGGTGAA GCCGGTAAACCGGGCGAGCAAGGTGTTCCGGGTGATTTAGGTGCCCCC GGTCCGAGCGGTGCACGTGGTGAGCGCGGCTTCCCGGGTGAACGCGG TGTTCAAGGTCCCCCCGGTCCGGCTGGTCCCGCTGGTAGTCCGGGTTT CCAAGGTTTACCCGGTCCGGCTGGTCCCCCCGGTGAAGCTGGTAAACC GGGTGAACAAGGTGTTCCGGGTGATTTAGGCGCACCCGGTCCTAGCGG TGCACGTGGTGAGCGTGGCTTCCCGGGTGAACGTGGTGTTCAAGGTCC GCCCGGTCCCGCTGGTCCGGCTGGTAGCCCCGGTTTCCAAGGTCTGCC GGGTCCGGCTGGTCCTCCGGGCGAAGCTGGTAAGCCGGGTGAGCAAG GTGTGCCGGGTGACTTAGGTGCACCGGGTCCGAGTGGTGCACGTGGC GAGCGTGGTTTTCCGGGCGAACGTGGTGTTCAAGGTCCGCCGGGTCCGGCCGGTGCACCGGGTCCGTGTTGTGGTGGT(SEQ ID No.7)。
多肽在本文中可以是人源胶原蛋白C1S5T,为单链结构,包括249个氨基 酸,基本重复单元为 GEKGSPGADGPAGAPGTPGPQGIAGQRGVVGLPGQRGERGFPGLPGPSG EPGKQGPSGAS(SEQID No.4),为人胶原蛋白I型肽段,C端氨基酸序列 为GAPGPCCGG(SEQ ID No.2),为增强胶原活性的端序列肽段。C1S5T的 氨基酸序列如下: GEKGSPGADGPAGAPGTPGPQGIAGQRGVVGLPGQRGERGFPGLPGPSG EPGKQGPSGASGEKGSPGADGPAGAPGTPGPQGIAGQRGVVGLPGQRGE RGFPGLPGPSGEPGKQGPSGASGEKGSPGADGPAGAPGTPGPQGIAGQRG VVGLPGQRGERGFPGLPGPSGEPGKQGPSGASGEKGSPGADGPAGAPGTP GPQGIAGQRGVVGLPGQRGERGFPGLPGPSGEPGKQGPSGASGAPGPCC GG(SEQ IDNo.6)。C1S5T的DNA序列如下: GGTGAAAAAGGCAGCCCGGGTGCCGATGGTCCCGCTGGTGCACCGGG TACACCGGGTCCTCAAGGTATTGCCGGTCAACGTGGTGTTGTGGGTCT GCCGGGTCAGCGTGGTGAACGCGGTTTTCCGGGTCTGCCGGGTCCGA GTGGTGAACCGGGTAAACAAGGTCCGAGCGGTGCCAGTGGTGAAAAA GGTAGCCCGGGTGCAGACGGTCCCGCTGGTGCCCCCGGTACACCGGGT CCTCAAGGCATTGCTGGTCAGCGTGGCGTTGTGGGTCTGCCCGGTCAG CGTGGCGAGCGTGGTTTTCCCGGTTTACCCGGTCCGAGTGGCGAGCCC GGTAAGCAAGGTCCGAGTGGTGCCAGCGGTGAGAAGGGTAGTCCGGG TGCAGACGGTCCCGCTGGTGCCCCCGGTACCCCGGGTCCGCAAGGTAT TGCTGGTCAACGTGGTGTTGTTGGTTTACCGGGTCAGCGCGGCGAACG TGGTTTCCCGGGTCTGCCCGGTCCGAGTGGCGAGCCGGGTAAGCAAG GTCCGAGCGGCGCAAGCGGCGAAAAAGGTAGTCCGGGTGCAGATGGTCCCGCTGGTGCACCGGGTACACCGGGTCCTCAAGGTATCGCTGGTCAG CGCGGTGTTGTTGGTCTGCCGGGTCAACGCGGTGAACGTGGTTTCCCG GGTCTGCCCGGTCCGAGTGGCGAACCGGGTAAACAAGGTCCGAGCGG TGCCAGCGGTGCACCGGGTCCGTGTTGTGGTGGT(SEQ ID No.8)。
在本发明中,重组人源胶原蛋白可以通过本领域中常规的方法进行。 例如,可以如下步骤生产:(1)大肠杆菌基因工程菌的构建;(2)大肠杆菌基 因工程菌的发酵培养;(3)重组人源胶原蛋白的诱导和表达;以及(4)重组人 源胶原蛋白的纯化和任选的酶切。
在步骤(1)中,大肠杆菌基因工程菌的构建可以如下进行:(1)利用PCR 方法对人源性I型胶原蛋白的基因螺旋区的DNA片段进行密码子优化和拼接 重组,最终得到目的基因片段;(2)将得到的目的基因片段插入PET-32a表达 载体中得到重组表达质粒;(3)将重组表达质粒转入大肠杆菌感受态细胞 BL21(DE3)中,筛选得到阳性大肠杆菌基因工程菌。
在步骤(2)与(3)中,大肠杆菌基因工程菌的发酵培养和重组人源胶原蛋 白的诱导和表达可以如下进行:(1)从LAB平板中挑取优选后的大肠杆菌基因 工程菌单菌落,置于10ml的LB培养基中37℃,220rpm培养12-16小时;(2)将 菌液按照1:100接种到2×YT培养基中放大培养,37℃培养约3小时,待OD600在0.4-0.6时,加入终浓度为0.5mM IPTG进行诱导,16℃继续培养20小时, 离心收集菌体。
在步骤(4)中,重组人源胶原蛋白多肽的纯化和酶切可以如下进行:(1) 用磷酸盐缓冲液(40mM NaH2PO3,500mM NaCl,pH 7.8)重悬细菌,超声破 碎,离心收集上清液;(2)利用NI-NTA亲和柱结合重组人源胶原蛋白,10mM 咪唑漂洗杂蛋白后,加入PrescissionProtease(PPase)蛋白酶4℃,16h柱上酶 切,最后获得目的胶原蛋白多肽。
大肠杆菌仅仅是例示性的。宿主细胞可以是真核细胞,例如真菌和酵母, 原核细胞,例如肠杆菌科细菌。应当理解,本领域技术人员可以通过用其它 表达菌株替换上述大肠杆菌菌株作为宿主细胞。
本发明的肽包含以SEQ ID No.5或SEQ ID No.6所示的序列或或以 SEQ ID No.5或SEQ ID No.6所示的序列中取代、缺失、插入和/或添加一 个、两个或多个氨基酸的序列,所述肽显示黏附活性。“多个”可以是2、3、 4、5、6、7、或8个。
氨基酸取代意味着在相同位置,某个氨基酸残基被其他氨基酸残基替 代。插入的氨基酸残基可以在任何位置插入,插入的氨基酸残基也可以全部 或部分彼此相邻,或插入的氨基酸之间都不彼此相邻。可以从SEQ ID NO:1 的序列中删除1、2或3个氨基酸,只要所述肽显示抑制神经细胞的活性或 除皱活性。
本领域技术人员已知,本发明所述的肽可以在氨基酸序列之间的一个或 多个位置进行翻译后修饰。翻译后修饰的例子可以包括磷酸化作用、乙酰化 作用和脱酰氨基作用。
本发明还提供SEQ ID No.5或6所示的肽的类似物,只要类似物显示细 胞黏附活性。这些类似物与天然的肽差别可以是氨基酸序列上的差异,也可 以是不影响序列的修饰形式上的差异,或者兼而有之。这些肽包括天然或诱 导的遗传变异体。诱导变异体可以通过各种技术得到,如通过辐射或暴露于 诱变剂而产生随机诱变,还可通过定点诱变法或其他已知分子生物学的技 术。类似物还包括具有不同于天然L-氨基酸的残基(如D-氨基酸)的类似物, 以及具有非天然存在的或合成的氨基酸(如β、γ-氨基酸)的类似物。
在本发明中,取代可以是保守氨基酸取代,指与SEQ ID NO:5或6的 氨基酸序列相比,有至多3个,更佳地至多2个氨基酸或1个氨基酸被性质 相似或相近的氨基酸所替换而形成肽。这些保守性变异肽可以根据表1进行 氨基酸替换而产生。
最初的残基 | 代表性的取代 | 优选的取代 |
Ala(A) | Val;Leu;Ile | Val |
Arg(R) | Lys;Gln;Asn | Lys |
Asn(N) | Gln;His;Lys;Arg | Gln |
Asp(D) | Glu | Glu |
Cys(C) | Ser | Ser |
Gln(Q) | Asn | Asn |
Glu(E) | Asp | Asp |
Gly(G) | Pro;Ala | Ala |
His(H) | Asn;Gln;Lys;Arg | Arg |
Ile(I) | Leu;Val;Met;Ala;Phe | Leu |
Leu(L) | Ile;Val;Met;Ala;Phe | Ile |
Lys(K) | Arg;Gln;Asn | Arg |
Met(M) | Leu;Phe;Ile | Leu |
Phe(F) | Leu;Val;Ile;Ala;Tyr | Leu |
Pro(P) | Ala | Ala |
Ser(S) | Thr | Thr |
Thr(T) | Ser | Ser |
Trp(W) | Tyr;Phe | Tyr |
Tyr(Y) | Trp;Phe;Thr;Ser | Phe |
Val(V) | Ile;Leu;Met;Phe;Ala | Leu |
序列同一性:参数“序列同一性”描述两个氨基酸序列之间或两个核苷酸 序列之间的相关性。
就本发明而言,两个氨基酸序列之间的序列同一性程度使用如EMBOSS 软件包(EMBOSS:The European Molecular Biology Open Software Suite,Rice 等,2000,Trends Genet.16:276-277),优选3.0.0版或更高版本的Needle程序中 所执行的Needleman-Wunsch算法(Needleman和Wunsch,1970,J.Mol.Biol. 48:443-453)来测定。使用的可选参数为缺口罚分(gap penalty)10,缺口延伸罚 分(gap extension penalty)0.5和EBLOSUM62(BLOSUM62的EMBOSS版)取代 矩阵。使用Needle标记为“最高同一性(longestidentity)”的输出结果(使用 -nobrief选项获得)作为同一性百分比,并计算如下:
(同样的残基×100)/(比对长度-比对中缺口的总数)
就本发明而言,杂交表示多核苷酸在非常低至非常高的严格条件下与标 记的核酸探针杂交,所述核酸探针对应于SEQ ID NO:5或6的多肽编码序列。 可使用例如X射线片(X-ray film)检测在这些条件下与核酸探针杂交的分子。
对于长度至少100个核苷酸的长探针,将非常低至非常高的严格条件定 义为在42℃,在5X SSPE、0.3%SDS、200微克/ml已剪切并且变性的鲑精 DNA中,并且对于非常低和低严格性为25%的甲酰胺、对于中和中-高严格 性为35%的甲酰胺、或对于高和非常高严格性为50%的甲酰胺,根据标准的 Southern印迹法进行预杂交和杂交最佳12至24小时。使用2X SSC、0.2%SDS 至少在45℃(非常低严格性),至少在50℃(低严格性),至少在55℃(中严格性), 至少在60℃(中-高严格性),至少在65℃(高严格性),至少在70℃(非常高严格 性)将载体材料最终洗涤三次,每次15分钟。
实施例
提供以下实施例来阐述本发明。本领域技术人员应当理解实施例仅仅是 例示性的而非限制性的。本发明仅仅由所附权利要求书的范围限定。
实施例1:重组人源胶原蛋白多肽的构建及表达
C1S4T基因表达载体的构建及表达
1.实施例1中使用的人源胶原蛋白C1S4T全长基因序列以SEQ ID No.7 显示。该序列已经针对大肠杆菌的密码子进行了密码子优化。
2.C1S4T基因全长747bp,根据优化后的C1S4T密码子基因序列SEQ ID No.7,委托上海华津生物科技有限公司进行基因片段的合成,并将合成后的 C1S4T基因片段通过BamHI(NEB公司货号:R0136L)和Xho I(NEB公司, 货号:R0146L)的酶切位点插入PET32a表达载体。将该构建成功的表达质粒 转化大肠杆菌感受态细胞BL21(DE3)(Merck公司)。具体过程为:1:取1μl的 该质粒于100μl的大肠杆菌感受态细胞BL21(DE3)中,冰上静置30min。2:将该混合物于42℃水浴锅中热激90s,然后迅速置于冰上静置2min。3:向该混 合物中加入600μl无抗性的LB,37℃,220rpm条件下培养1h。4:取200μl该 菌液均匀的涂布在含有氨苄青霉素抗性的LB平板上(10g/L蛋白胨,5g/L酵母 提取物,10g/L氯化钠,15g/L琼脂,100μg/ml氨苄抗生素)。5:将平板倒置 培养于37℃温箱中,培养约20h待长出清晰可见的菌落。
3.从转化好的LB平板中挑取单克隆菌落于10ml LB(含100μg/ml氨苄抗 生素)培养基中培养12h-16h后,再按照1:100的比例转接到2×YT培养基(16g/L 蛋白胨,10g/L酵母提取物,5g/L氯化钠)中进行扩大培养,37℃,220rpm培 养至菌液OD600在0.4-0.6时,加入终浓度为0.5mM IPTG(Sigma公司,货号: I5502-1G)进行诱导表达,诱导条件为18℃、180rpm培养20h。最后离心收集 菌体,保存于-20℃或者立即进入下步纯化。
4.用磷酸盐缓冲液(pH 7.8)(40mM磷酸二氢钠,500mM氯化钠)约50ml 重悬(1L)菌体沉淀,利用高压破菌仪器(新芝生物)进行破菌后,13000rpm离 心30min,使可溶性蛋白与包涵体充分分离。
5.用5倍柱体积的结合缓冲液(Binding buffer)(40mM NaH2PO3,500mM NaCl,pH7.8)平衡Ni-NTA(Qiagen公司,货号:30210)亲和柱。然后加入 蛋白上清于4℃条件下孵育0.5-1h,使目的重组蛋白充分结合到柱材上。再用 200ml含有10mM咪唑(Sigma公司)的洗涤缓冲液(washing buffer)(10mM咪 唑,40mM NaH2PO3,500mM NaCl,pH 7.8)漂洗杂蛋白。最后加入适量具 有His标签的Prescission Protease(简称PPase)(Sigma,SAE0045)蛋白酶,于4℃ 孵育16h后,收集穿流液,即为去除载体蛋白的目的胶原蛋白。所得产物透 析过夜,冻干为干粉待用。
6.所得C1S4T蛋白利用SDS-PAGE检测纯度。具体过程为:取纯化后的蛋 白液40μl,加入10μl 5×的蛋白上样缓冲液(250mM的Tris-HCl(pH:6.8), 10%SDS,0.5%溴酚蓝,50%甘油,5%β-巯基乙醇),置于100℃沸水中煮 10min,然后每孔10μl加入SDS-PAGE蛋白胶中,电压80V跑2h后,用考马斯 亮蓝染色液(0.1%考马斯亮蓝R-250,25%异丙醇,10%冰醋酸)进行蛋白染 色20min,再利用蛋白脱色液(10%醋酸,5%乙醇)进行脱色。最后与人天然胶原蛋白相对照测量蛋白活性。
C1S5T基因表达载体的构建及表达
1.人源胶原蛋白C1S5T全长基因序列以SEQ ID No.8显示。该序列已经 针对大肠杆菌的密码子进行了密码子优化。
2.C1S5T基因全长747bp,根据优化后的C1S5T密码子基因序列,委托上 海华津生物科技有限公司进行基因片段的合成,并将合成后的C1S5T基因片 段通过BamH I(NEB公司货号:R0136L)和Xho I(NEB公司,货号:R0146L) 的酶切位点插入PET32a表达载体。将该构建成功的表达质粒转化大肠杆菌感 受态细胞BL21(DE3)(Merck公司)。具体过程为:1:取1μl的该质粒于100μl 的大肠杆菌感受态细胞BL21(DE3)中,冰上静置30min。2:将该混合物于42℃ 水浴锅中热激90s,然后迅速置于冰上静置2min。3:向该混合物中加入600μl 无抗性的LB,37℃,220rpm条件下培养1h。4:取200μl该菌液均匀的涂布在 含有氨苄青霉素抗性的LAB平板上(10g/L蛋白胨,5g/L酵母提取物,10g/L氯 化钠,15g/L琼脂,100μg/ml氨苄抗生素)。5:将平板倒置培养于37℃温箱中, 培养约20h待长出清晰可见的菌落。
3.从转化好的LB平板中挑取单克隆菌落于10ml LB(含100μg/ml氨苄抗 生素)培养基中培养12h-16h后,再按照1:100的比例转接到2×YT培养基(16g/L 蛋白胨,10g/L酵母提取物,5g/L氯化钠)中进行扩大培养,37℃,220rpm培 养至菌液OD600在0.4-0.6时,加入终浓度为0.5mM IPTG(Sigma公司,货号: I5502-1G)进行诱导表达,诱导条件为18℃、180rpm培养20h。最后离心收集 菌体,保存于-20℃或者立即进入下步纯化。
4.用磷酸盐缓冲液(pH 7.8)(40mM磷酸二氢钠,500mM氯化钠)约50ml 重悬(1L)菌体沉淀,利用高压破菌仪器(新芝生物)进行破菌后,13000rpm离 心30min,使可溶性蛋白与包涵体充分分离。
5.用5倍柱体积的结合缓冲液(Binding buffer)(40mM NaH2PO3,500mM NaCl,pH7.8)平衡Ni-NTA亲和柱(Qiagen公司,货号:30210)。然后加入蛋白 上清于4℃条件下孵育0.5-1h,使目的重组蛋白充分结合到柱材上。再用200ml 含有10mM咪唑(Sigma公司)的洗涤缓冲液(washing buffer)(10mM咪唑, 40mM NaH2PO3,500mM NaCl,pH 7.8)漂洗杂蛋白。最后加入适量具有His 标签的Prescission Protease(简称PPase)蛋白酶(Sigma,SAE0045),于4℃孵育 16h后,收集穿流液,即为去除载体蛋白的目的胶原蛋白。所得产物透析过 夜,冻干为干粉待用。
6.所得C1S5T蛋白利用SDS-PAGE检测纯度。具体过程为:取纯化后的 蛋白液40μl,加入10μl 5×的蛋白上样缓冲液(250mM的Tris-HCl(pH:6.8), 10%SDS,0.5%溴酚蓝,50%甘油,5%β-巯基乙醇),置于100℃沸水中煮 10min,然后每孔10μl加入SDS-PAGE蛋白胶中,电压80V跑2h后,用考马斯 亮蓝染色液(0.1%考马斯亮蓝R-250,25%异丙醇,10%冰醋酸)进行蛋白染 色20min,再利用蛋白脱色液(10%醋酸,5%乙醇)进行脱色。最后与人天然胶原蛋白相对照测量蛋白活性。
结果
图2和图3的电泳图表明得到表观分子量为36kDa的C1S4T和C1S5T,分 子量分别对应于SEQ ID NO:5和6的氨基酸序列的多肽,表明多肽C1S4T和 C1S5T得到正确表达。
实施例2C1S4T和C1S5T蛋白的活性检测
胶原蛋白的活性检测方法可以参考文献Juming Yao,Satoshi Yanagisawa,Tetsuo Asakura,Design,Expression and Characterization of Collagen-LikeProteins Based on the Cell Adhesive and Crosslinking Sequences Derived fromNative Collagens,J Biochem.136,643-649(2004)。具体实施方法如下:
1、利用紫外吸收法检测待测蛋白样品的浓度,包括对照人胶原蛋白 (Sigma,C7774)、C1S4T和C1S5T蛋白样品。具体为分别测定样品在215nm 和225nm下的紫外光吸收,利用经验公式C(μg/mL)=144X(A215-A225)计算 蛋白质浓度,注意需在A215<1.5的情况下检测。该方法的原理是测定肽键在 远紫外光下的特征吸收,不受生色团含量的影响,干扰物质少,操作简便, 适合检测考马斯亮蓝不显色的人胶原蛋白及其类似物。(参考文献为Walker JM.The Protein Protocols Handbook,second edition.Humana Press.43-45.)。检 测完蛋白浓度后,用PBS将所有待测蛋白浓度调整到0.5mg/ml。
2、向96孔板中加入100μl各种蛋白溶液和空白PBS溶液对照,室温静置 60min。
3、每孔中加入105个培养状态良好的3T3细胞(来自清华大学童佩老师), 37℃孵育60min。
4、每孔用PBS清洗4次。
5、用LDH检测试剂盒(Roche,04744926001)检测OD492nm的吸光度。 根据空白对照的数值,可以计算出细胞的贴壁率。计算公式如下:细胞贴壁 率=(测试孔-空白孔)×100%/(阳性孔-空白孔)。细胞的贴壁率即可以反 应胶原蛋白的活性。蛋白的活性越高,越能在短时间给细胞提供优质的外环 境,帮助细胞贴壁。
结果参见图4和图5。
图4和图5的结果表明,两种人重组胶原蛋白(即C1S4T和C1S5T)与商品 化的人胶原蛋白相比,皆具有相当或更好的黏附活性。本发明的重组I型胶 原蛋白C1S4T和C1S5T实现了I型胶原蛋白的细胞黏附活性。
序列表
<110> 山西锦波生物医药股份有限公司
<120> 多肽、其生产方法和用途
<130> C18P2983
<160> 8
<170> PatentIn version 3.5
<210> 1
<211> 1464
<212> PRT
<213> 人
<400> 1
Met Phe Ser Phe Val Asp Leu Arg Leu Leu Leu Leu Leu Ala Ala Thr
1 5 10 15
Ala Leu Leu Thr His Gly Gln Glu Glu Gly Gln Val Glu Gly Gln Asp
20 25 30
Glu Asp Ile Pro Pro Ile Thr Cys Val Gln Asn Gly Leu Arg Tyr His
35 40 45
Asp Arg Asp Val Trp Lys Pro Glu Pro Cys Arg Ile Cys Val Cys Asp
50 55 60
Asn Gly Lys Val Leu Cys Asp Asp Val Ile Cys Asp Glu Thr Lys Asn
65 70 75 80
Cys Pro Gly Ala Glu Val Pro Glu Gly Glu Cys Cys Pro Val Cys Pro
85 90 95
Asp Gly Ser Glu Ser Pro Thr Asp Gln Glu Thr Thr Gly Val Glu Gly
100 105 110
Pro Lys Gly Asp Thr Gly Pro Arg Gly Pro Arg Gly Pro Ala Gly Pro
115 120 125
Pro Gly Arg Asp Gly Ile Pro Gly Gln Pro Gly Leu Pro Gly Pro Pro
130 135 140
Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly Leu Gly Gly Asn Phe Ala
145 150 155 160
Pro Gln Leu Ser Tyr Gly Tyr Asp Glu Lys Ser Thr Gly Gly Ile Ser
165 170 175
Val Pro Gly Pro Met Gly Pro Ser Gly Pro Arg Gly Leu Pro Gly Pro
180 185 190
Pro Gly Ala Pro Gly Pro Gln Gly Phe Gln Gly Pro Pro Gly Glu Pro
195 200 205
Gly Glu Pro Gly Ala Ser Gly Pro Met Gly Pro Arg Gly Pro Pro Gly
210 215 220
Pro Pro Gly Lys Asn Gly Asp Asp Gly Glu Ala Gly Lys Pro Gly Arg
225 230 235 240
Pro Gly Glu Arg Gly Pro Pro Gly Pro Gln Gly Ala Arg Gly Leu Pro
245 250 255
Gly Thr Ala Gly Leu Pro Gly Met Lys Gly His Arg Gly Phe Ser Gly
260 265 270
Leu Asp Gly Ala Lys Gly Asp Ala Gly Pro Ala Gly Pro Lys Gly Glu
275 280 285
Pro Gly Ser Pro Gly Glu Asn Gly Ala Pro Gly Gln Met Gly Pro Arg
290 295 300
Gly Leu Pro Gly Glu Arg Gly Arg Pro Gly Ala Pro Gly Pro Ala Gly
305 310 315 320
Ala Arg Gly Asn Asp Gly Ala Thr Gly Ala Ala Gly Pro Pro Gly Pro
325 330 335
Thr Gly Pro Ala Gly Pro Pro Gly Phe Pro Gly Ala Val Gly Ala Lys
340 345 350
Gly Glu Ala Gly Pro Gln Gly Pro Arg Gly Ser Glu Gly Pro Gln Gly
355 360 365
Val Arg Gly Glu Pro Gly Pro Pro Gly Pro Ala Gly Ala Ala Gly Pro
370 375 380
Ala Gly Asn Pro Gly Ala Asp Gly Gln Pro Gly Ala Lys Gly Ala Asn
385 390 395 400
Gly Ala Pro Gly Ile Ala Gly Ala Pro Gly Phe Pro Gly Ala Arg Gly
405 410 415
Pro Ser Gly Pro Gln Gly Pro Gly Gly Pro Pro Gly Pro Lys Gly Asn
420 425 430
Ser Gly Glu Pro Gly Ala Pro Gly Ser Lys Gly Asp Thr Gly Ala Lys
435 440 445
Gly Glu Pro Gly Pro Val Gly Val Gln Gly Pro Pro Gly Pro Ala Gly
450 455 460
Glu Glu Gly Lys Arg Gly Ala Arg Gly Glu Pro Gly Pro Thr Gly Leu
465 470 475 480
Pro Gly Pro Pro Gly Glu Arg Gly Gly Pro Gly Ser Arg Gly Phe Pro
485 490 495
Gly Ala Asp Gly Val Ala Gly Pro Lys Gly Pro Ala Gly Glu Arg Gly
500 505 510
Ser Pro Gly Pro Ala Gly Pro Lys Gly Ser Pro Gly Glu Ala Gly Arg
515 520 525
Pro Gly Glu Ala Gly Leu Pro Gly Ala Lys Gly Leu Thr Gly Ser Pro
530 535 540
Gly Ser Pro Gly Pro Asp Gly Lys Thr Gly Pro Pro Gly Pro Ala Gly
545 550 555 560
Gln Asp Gly Arg Pro Gly Pro Pro Gly Pro Pro Gly Ala Arg Gly Gln
565 570 575
Ala Gly Val Met Gly Phe Pro Gly Pro Lys Gly Ala Ala Gly Glu Pro
580 585 590
Gly Lys Ala Gly Glu Arg Gly Val Pro Gly Pro Pro Gly Ala Val Gly
595 600 605
Pro Ala Gly Lys Asp Gly Glu Ala Gly Ala Gln Gly Pro Pro Gly Pro
610 615 620
Ala Gly Pro Ala Gly Glu Arg Gly Glu Gln Gly Pro Ala Gly Ser Pro
625 630 635 640
Gly Phe Gln Gly Leu Pro Gly Pro Ala Gly Pro Pro Gly Glu Ala Gly
645 650 655
Lys Pro Gly Glu Gln Gly Val Pro Gly Asp Leu Gly Ala Pro Gly Pro
660 665 670
Ser Gly Ala Arg Gly Glu Arg Gly Phe Pro Gly Glu Arg Gly Val Gln
675 680 685
Gly Pro Pro Gly Pro Ala Gly Pro Arg Gly Ala Asn Gly Ala Pro Gly
690 695 700
Asn Asp Gly Ala Lys Gly Asp Ala Gly Ala Pro Gly Ala Pro Gly Ser
705 710 715 720
Gln Gly Ala Pro Gly Leu Gln Gly Met Pro Gly Glu Arg Gly Ala Ala
725 730 735
Gly Leu Pro Gly Pro Lys Gly Asp Arg Gly Asp Ala Gly Pro Lys Gly
740 745 750
Ala Asp Gly Ser Pro Gly Lys Asp Gly Val Arg Gly Leu Thr Gly Pro
755 760 765
Ile Gly Pro Pro Gly Pro Ala Gly Ala Pro Gly Asp Lys Gly Glu Ser
770 775 780
Gly Pro Ser Gly Pro Ala Gly Pro Thr Gly Ala Arg Gly Ala Pro Gly
785 790 795 800
Asp Arg Gly Glu Pro Gly Pro Pro Gly Pro Ala Gly Phe Ala Gly Pro
805 810 815
Pro Gly Ala Asp Gly Gln Pro Gly Ala Lys Gly Glu Pro Gly Asp Ala
820 825 830
Gly Ala Lys Gly Asp Ala Gly Pro Pro Gly Pro Ala Gly Pro Ala Gly
835 840 845
Pro Pro Gly Pro Ile Gly Asn Val Gly Ala Pro Gly Ala Lys Gly Ala
850 855 860
Arg Gly Ser Ala Gly Pro Pro Gly Ala Thr Gly Phe Pro Gly Ala Ala
865 870 875 880
Gly Arg Val Gly Pro Pro Gly Pro Ser Gly Asn Ala Gly Pro Pro Gly
885 890 895
Pro Pro Gly Pro Ala Gly Lys Glu Gly Gly Lys Gly Pro Arg Gly Glu
900 905 910
Thr Gly Pro Ala Gly Arg Pro Gly Glu Val Gly Pro Pro Gly Pro Pro
915 920 925
Gly Pro Ala Gly Glu Lys Gly Ser Pro Gly Ala Asp Gly Pro Ala Gly
930 935 940
Ala Pro Gly Thr Pro Gly Pro Gln Gly Ile Ala Gly Gln Arg Gly Val
945 950 955 960
Val Gly Leu Pro Gly Gln Arg Gly Glu Arg Gly Phe Pro Gly Leu Pro
965 970 975
Gly Pro Ser Gly Glu Pro Gly Lys Gln Gly Pro Ser Gly Ala Ser Gly
980 985 990
Glu Arg Gly Pro Pro Gly Pro Met Gly Pro Pro Gly Leu Ala Gly Pro
995 1000 1005
Pro Gly Glu Ser Gly Arg Glu Gly Ala Pro Gly Ala Glu Gly Ser
1010 1015 1020
Pro Gly Arg Asp Gly Ser Pro Gly Ala Lys Gly Asp Arg Gly Glu
1025 1030 1035
Thr Gly Pro Ala Gly Pro Pro Gly Ala Pro Gly Ala Pro Gly Ala
1040 1045 1050
Pro Gly Pro Val Gly Pro Ala Gly Lys Ser Gly Asp Arg Gly Glu
1055 1060 1065
Thr Gly Pro Ala Gly Pro Ala Gly Pro Val Gly Pro Val Gly Ala
1070 1075 1080
Arg Gly Pro Ala Gly Pro Gln Gly Pro Arg Gly Asp Lys Gly Glu
1085 1090 1095
Thr Gly Glu Gln Gly Asp Arg Gly Ile Lys Gly His Arg Gly Phe
1100 1105 1110
Ser Gly Leu Gln Gly Pro Pro Gly Pro Pro Gly Ser Pro Gly Glu
1115 1120 1125
Gln Gly Pro Ser Gly Ala Ser Gly Pro Ala Gly Pro Arg Gly Pro
1130 1135 1140
Pro Gly Ser Ala Gly Ala Pro Gly Lys Asp Gly Leu Asn Gly Leu
1145 1150 1155
Pro Gly Pro Ile Gly Pro Pro Gly Pro Arg Gly Arg Thr Gly Asp
1160 1165 1170
Ala Gly Pro Val Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly Pro
1175 1180 1185
Pro Gly Pro Pro Ser Ala Gly Phe Asp Phe Ser Phe Leu Pro Gln
1190 1195 1200
Pro Pro Gln Glu Lys Ala His Asp Gly Gly Arg Tyr Tyr Arg Ala
1205 1210 1215
Asp Asp Ala Asn Val Val Arg Asp Arg Asp Leu Glu Val Asp Thr
1220 1225 1230
Thr Leu Lys Ser Leu Ser Gln Gln Ile Glu Asn Ile Arg Ser Pro
1235 1240 1245
Glu Gly Ser Arg Lys Asn Pro Ala Arg Thr Cys Arg Asp Leu Lys
1250 1255 1260
Met Cys His Ser Asp Trp Lys Ser Gly Glu Tyr Trp Ile Asp Pro
1265 1270 1275
Asn Gln Gly Cys Asn Leu Asp Ala Ile Lys Val Phe Cys Asn Met
1280 1285 1290
Glu Thr Gly Glu Thr Cys Val Tyr Pro Thr Gln Pro Ser Val Ala
1295 1300 1305
Gln Lys Asn Trp Tyr Ile Ser Lys Asn Pro Lys Asp Lys Arg His
1310 1315 1320
Val Trp Phe Gly Glu Ser Met Thr Asp Gly Phe Gln Phe Glu Tyr
1325 1330 1335
Gly Gly Gln Gly Ser Asp Pro Ala Asp Val Ala Ile Gln Leu Thr
1340 1345 1350
Phe Leu Arg Leu Met Ser Thr Glu Ala Ser Gln Asn Ile Thr Tyr
1355 1360 1365
His Cys Lys Asn Ser Val Ala Tyr Met Asp Gln Gln Thr Gly Asn
1370 1375 1380
Leu Lys Lys Ala Leu Leu Leu Gln Gly Ser Asn Glu Ile Glu Ile
1385 1390 1395
Arg Ala Glu Gly Asn Ser Arg Phe Thr Tyr Ser Val Thr Val Asp
1400 1405 1410
Gly Cys Thr Ser His Thr Gly Ala Trp Gly Lys Thr Val Ile Glu
1415 1420 1425
Tyr Lys Thr Thr Lys Thr Ser Arg Leu Pro Ile Ile Asp Val Ala
1430 1435 1440
Pro Leu Asp Val Gly Ala Pro Asp Gln Glu Phe Gly Phe Asp Val
1445 1450 1455
Gly Pro Val Cys Phe Leu
1460
<210> 2
<211> 9
<212> PRT
<213> 人工序列
<220>
<223> 多肽C端序列
<400> 2
Gly Ala Pro Gly Pro Cys Cys Gly Gly
1 5
<210> 3
<211> 60
<212> PRT
<213> 人工序列
<220>
<223> N端重复单元1
<400> 3
Gly Pro Ala Gly Ser Pro Gly Phe Gln Gly Leu Pro Gly Pro Ala Gly
1 5 10 15
Pro Pro Gly Glu Ala Gly Lys Pro Gly Glu Gln Gly Val Pro Gly Asp
20 25 30
Leu Gly Ala Pro Gly Pro Ser Gly Ala Arg Gly Glu Arg Gly Phe Pro
35 40 45
Gly Glu Arg Gly Val Gln Gly Pro Pro Gly Pro Ala
50 55 60
<210> 4
<211> 60
<212> PRT
<213> 人工序列
<220>
<223> N端重复单元2
<400> 4
Gly Glu Lys Gly Ser Pro Gly Ala Asp Gly Pro Ala Gly Ala Pro Gly
1 5 10 15
Thr Pro Gly Pro Gln Gly Ile Ala Gly Gln Arg Gly Val Val Gly Leu
20 25 30
Pro Gly Gln Arg Gly Glu Arg Gly Phe Pro Gly Leu Pro Gly Pro Ser
35 40 45
Gly Glu Pro Gly Lys Gln Gly Pro Ser Gly Ala Ser
50 55 60
<210> 5
<211> 249
<212> PRT
<213> 人工序列
<220>
<223> C1S4T的氨基酸序列
<400> 5
Gly Pro Ala Gly Ser Pro Gly Phe Gln Gly Leu Pro Gly Pro Ala Gly
1 5 10 15
Pro Pro Gly Glu Ala Gly Lys Pro Gly Glu Gln Gly Val Pro Gly Asp
20 25 30
Leu Gly Ala Pro Gly Pro Ser Gly Ala Arg Gly Glu Arg Gly Phe Pro
35 40 45
Gly Glu Arg Gly Val Gln Gly Pro Pro Gly Pro Ala Gly Pro Ala Gly
50 55 60
Ser Pro Gly Phe Gln Gly Leu Pro Gly Pro Ala Gly Pro Pro Gly Glu
65 70 75 80
Ala Gly Lys Pro Gly Glu Gln Gly Val Pro Gly Asp Leu Gly Ala Pro
85 90 95
Gly Pro Ser Gly Ala Arg Gly Glu Arg Gly Phe Pro Gly Glu Arg Gly
100 105 110
Val Gln Gly Pro Pro Gly Pro Ala Gly Pro Ala Gly Ser Pro Gly Phe
115 120 125
Gln Gly Leu Pro Gly Pro Ala Gly Pro Pro Gly Glu Ala Gly Lys Pro
130 135 140
Gly Glu Gln Gly Val Pro Gly Asp Leu Gly Ala Pro Gly Pro Ser Gly
145 150 155 160
Ala Arg Gly Glu Arg Gly Phe Pro Gly Glu Arg Gly Val Gln Gly Pro
165 170 175
Pro Gly Pro Ala Gly Pro Ala Gly Ser Pro Gly Phe Gln Gly Leu Pro
180 185 190
Gly Pro Ala Gly Pro Pro Gly Glu Ala Gly Lys Pro Gly Glu Gln Gly
195 200 205
Val Pro Gly Asp Leu Gly Ala Pro Gly Pro Ser Gly Ala Arg Gly Glu
210 215 220
Arg Gly Phe Pro Gly Glu Arg Gly Val Gln Gly Pro Pro Gly Pro Ala
225 230 235 240
Gly Ala Pro Gly Pro Cys Cys Gly Gly
245
<210> 6
<211> 249
<212> PRT
<213> 人工序列
<220>
<223> C1S5T的氨基酸序列
<400> 6
Gly Glu Lys Gly Ser Pro Gly Ala Asp Gly Pro Ala Gly Ala Pro Gly
1 5 10 15
Thr Pro Gly Pro Gln Gly Ile Ala Gly Gln Arg Gly Val Val Gly Leu
20 25 30
Pro Gly Gln Arg Gly Glu Arg Gly Phe Pro Gly Leu Pro Gly Pro Ser
35 40 45
Gly Glu Pro Gly Lys Gln Gly Pro Ser Gly Ala Ser Gly Glu Lys Gly
50 55 60
Ser Pro Gly Ala Asp Gly Pro Ala Gly Ala Pro Gly Thr Pro Gly Pro
65 70 75 80
Gln Gly Ile Ala Gly Gln Arg Gly Val Val Gly Leu Pro Gly Gln Arg
85 90 95
Gly Glu Arg Gly Phe Pro Gly Leu Pro Gly Pro Ser Gly Glu Pro Gly
100 105 110
Lys Gln Gly Pro Ser Gly Ala Ser Gly Glu Lys Gly Ser Pro Gly Ala
115 120 125
Asp Gly Pro Ala Gly Ala Pro Gly Thr Pro Gly Pro Gln Gly Ile Ala
130 135 140
Gly Gln Arg Gly Val Val Gly Leu Pro Gly Gln Arg Gly Glu Arg Gly
145 150 155 160
Phe Pro Gly Leu Pro Gly Pro Ser Gly Glu Pro Gly Lys Gln Gly Pro
165 170 175
Ser Gly Ala Ser Gly Glu Lys Gly Ser Pro Gly Ala Asp Gly Pro Ala
180 185 190
Gly Ala Pro Gly Thr Pro Gly Pro Gln Gly Ile Ala Gly Gln Arg Gly
195 200 205
Val Val Gly Leu Pro Gly Gln Arg Gly Glu Arg Gly Phe Pro Gly Leu
210 215 220
Pro Gly Pro Ser Gly Glu Pro Gly Lys Gln Gly Pro Ser Gly Ala Ser
225 230 235 240
Gly Ala Pro Gly Pro Cys Cys Gly Gly
245
<210> 7
<211> 747
<212> DNA
<213> 人工序列
<220>
<223> C1S4T的DNA序列
<400> 7
ggtccggccg gtagcccggg ttttcaaggt ctgccgggtc ccgctggtcc tccgggtgag 60
gctggtaaac ccggtgagca aggtgttccc ggtgatctgg gtgcaccggg tccgagtggt 120
gcacgtggtg agcgtggctt tccgggtgag cgtggcgttc aaggtccccc cggtccggct 180
ggtccggctg gtagtcccgg tttccaaggt ctgcccggtc ccgctggtcc tccgggtgaa 240
gccggtaaac cgggcgagca aggtgttccg ggtgatttag gtgcccccgg tccgagcggt 300
gcacgtggtg agcgcggctt cccgggtgaa cgcggtgttc aaggtccccc cggtccggct 360
ggtcccgctg gtagtccggg tttccaaggt ttacccggtc cggctggtcc ccccggtgaa 420
gctggtaaac cgggtgaaca aggtgttccg ggtgatttag gcgcacccgg tcctagcggt 480
gcacgtggtg agcgtggctt cccgggtgaa cgtggtgttc aaggtccgcc cggtcccgct 540
ggtccggctg gtagccccgg tttccaaggt ctgccgggtc cggctggtcc tccgggcgaa 600
gctggtaagc cgggtgagca aggtgtgccg ggtgacttag gtgcaccggg tccgagtggt 660
gcacgtggcg agcgtggttt tccgggcgaa cgtggtgttc aaggtccgcc gggtccggcc 720
ggtgcaccgg gtccgtgttg tggtggt 747
<210> 8
<211> 747
<212> DNA
<213> 人工序列
<220>
<223> C1S5T的DNA序列
<400> 8
ggtgaaaaag gcagcccggg tgccgatggt cccgctggtg caccgggtac accgggtcct 60
caaggtattg ccggtcaacg tggtgttgtg ggtctgccgg gtcagcgtgg tgaacgcggt 120
tttccgggtc tgccgggtcc gagtggtgaa ccgggtaaac aaggtccgag cggtgccagt 180
ggtgaaaaag gtagcccggg tgcagacggt cccgctggtg cccccggtac accgggtcct 240
caaggcattg ctggtcagcg tggcgttgtg ggtctgcccg gtcagcgtgg cgagcgtggt 300
tttcccggtt tacccggtcc gagtggcgag cccggtaagc aaggtccgag tggtgccagc 360
ggtgagaagg gtagtccggg tgcagacggt cccgctggtg cccccggtac cccgggtccg 420
caaggtattg ctggtcaacg tggtgttgtt ggtttaccgg gtcagcgcgg cgaacgtggt 480
ttcccgggtc tgcccggtcc gagtggcgag ccgggtaagc aaggtccgag cggcgcaagc 540
ggcgaaaaag gtagtccggg tgcagatggt cccgctggtg caccgggtac accgggtcct 600
caaggtatcg ctggtcagcg cggtgttgtt ggtctgccgg gtcaacgcgg tgaacgtggt 660
ttcccgggtc tgcccggtcc gagtggcgaa ccgggtaaac aaggtccgag cggtgccagc 720
ggtgcaccgg gtccgtgttg tggtggt 747
Claims (12)
1.多肽,所述多肽由以SEQ ID No. 5或SEQ ID No. 6所示的氨基酸序列组成。
2.多核苷酸,其编码根据权利要求1所述的多肽。
3.表达载体,其包含根据权利要求2所述的多核苷酸。
4.宿主细胞,其包含根据权利要求3所述的表达载体。
5.根据权利要求4所述的宿主细胞,其中所述宿主细胞是大肠杆菌。
6.根据权利要求1所述的多肽的生产方法,其包括:
(1) 在生产培养基中培养根据权利要求4或5所述的宿主细胞并生产多肽;和
(2) 收获并纯化多肽。
7.根据权利要求6所述的生产方法,所述方法包括对多肽进行酶切的步骤。
8.组合物,其包含根据权利要求1所述的多肽。
9.根据权利要求8所述的组合物,其中所述组合物是医疗器材、组织工程产品、化妆品或保健品。
10.根据权利要求1所述的多肽在制备组合物中的用途。
11.根据权利要求10所述的用途,其中所述组合物是医疗器材、组织工程产品、化妆品或保健品。
12.根据权利要求1所述的多肽在制备用于促进细胞黏附的药物中的用途。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811254050.7A CN109293783B (zh) | 2018-10-25 | 2018-10-25 | 多肽、其生产方法和用途 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811254050.7A CN109293783B (zh) | 2018-10-25 | 2018-10-25 | 多肽、其生产方法和用途 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN109293783A CN109293783A (zh) | 2019-02-01 |
CN109293783B true CN109293783B (zh) | 2019-10-11 |
Family
ID=65157920
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811254050.7A Active CN109293783B (zh) | 2018-10-25 | 2018-10-25 | 多肽、其生产方法和用途 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109293783B (zh) |
Families Citing this family (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109988234A (zh) * | 2019-02-20 | 2019-07-09 | 江苏悦智生物医药有限公司 | 酵母重组人源I型胶原α1链蛋白、合成方法及其应用 |
CN110845603B (zh) * | 2019-10-31 | 2021-07-02 | 山西锦波生物医药股份有限公司 | 人胶原蛋白17型多肽、其生产方法和用途 |
CN113621052B (zh) * | 2021-08-23 | 2022-06-07 | 山西锦波生物医药股份有限公司 | 一种重组i型人源化胶原蛋白多肽及其制备方法和用途 |
CN113730557B (zh) * | 2021-09-03 | 2023-12-22 | 山西锦波生物医药股份有限公司 | 重组i型人源化胶原蛋白在盆底修复中的用途 |
CN113683681B (zh) * | 2021-09-15 | 2023-10-03 | 山西锦波生物医药股份有限公司 | 一种重组i型人源化胶原蛋白c1l3t及其制备方法和用途 |
CN113788891B (zh) * | 2021-09-15 | 2023-10-31 | 山西锦波生物医药股份有限公司 | 一种重组i型人源化胶原蛋白c1l4t及其制备方法和用途 |
CN113683679B (zh) * | 2021-09-15 | 2023-10-31 | 山西锦波生物医药股份有限公司 | 一种重组i型人源化胶原蛋白c1l6t及其制备方法和用途 |
CN113683680B (zh) * | 2021-09-15 | 2023-11-03 | 山西锦波生物医药股份有限公司 | 一种重组ⅰ型人源化胶原蛋白c1l1t及其制备方法和用途 |
CN113683678B (zh) * | 2021-09-15 | 2023-11-24 | 山西锦波生物医药股份有限公司 | 一种重组i型人源化胶原蛋白c1l2t及其制备方法和用途 |
CN114316030B (zh) * | 2022-01-27 | 2023-06-30 | 西安巨子生物基因技术股份有限公司 | 一种透皮吸收性的i型重组胶原蛋白及其用途 |
CN116813793A (zh) * | 2022-05-13 | 2023-09-29 | 山西锦波生物医药股份有限公司 | 融合蛋白以及其应用 |
CN114940712B (zh) * | 2022-06-01 | 2023-12-26 | 山西锦波生物医药股份有限公司 | 一种生物合成人体结构性材料的制备方法 |
CN114920827B (zh) * | 2022-06-29 | 2023-06-09 | 山西锦波生物医药股份有限公司 | 多肽及其用途 |
CN116715754B (zh) * | 2023-05-12 | 2024-03-12 | 山西锦波生物医药股份有限公司 | 多肽及其用途 |
CN118146301A (zh) * | 2024-01-04 | 2024-06-07 | 广州白云山花城药业有限公司 | 一种促成骨细胞分化的活性肽及其应用 |
-
2018
- 2018-10-25 CN CN201811254050.7A patent/CN109293783B/zh active Active
Non-Patent Citations (3)
Title |
---|
类人胶原蛋白真核表达载体的构建及在毕赤酵母中的分泌表达;徐立群;《中国优秀硕士学位论文全文数据库 基础科学辑》;20131215(第S2期);A006-189 * |
重组生产胶原蛋白的研究进展;杨立霞 等;《河北化工》;20070720;第30卷(第7期);43-46 * |
重组胶原蛋白多肽在大肠杆菌中的表达优化;吴铭 等;《湖北农业科学》;20140520;第53卷(第10期);2443-2447 * |
Also Published As
Publication number | Publication date |
---|---|
CN109293783A (zh) | 2019-02-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109293783B (zh) | 多肽、其生产方法和用途 | |
CN109593126B (zh) | 多肽、其生产方法和用途 | |
CN108822209A (zh) | 多肽、其生产方法和用途 | |
CN110845603B (zh) | 人胶原蛋白17型多肽、其生产方法和用途 | |
EP4317181A1 (en) | Recombinant type-i humanized collagen polypeptide, and preparation method therefor and use thereof | |
CN103122027B (zh) | 一种重组人源胶原蛋白及其生产方法 | |
Liu et al. | Extraction and characterization of acid-and pepsin-soluble collagens from the scales, skins and swim-bladders of grass carp (Ctenopharyngodon idella) | |
US5607849A (en) | Gene encoding transglutaminase derived from fish | |
Peng et al. | A Streptococcus pyogenes derived collagen-like protein as a non-cytotoxic and non-immunogenic cross-linkable biomaterial | |
CN113683679B (zh) | 一种重组i型人源化胶原蛋白c1l6t及其制备方法和用途 | |
CN113683680B (zh) | 一种重组ⅰ型人源化胶原蛋白c1l1t及其制备方法和用途 | |
CN114106150B (zh) | 重组胶原蛋白、制备方法及其应用 | |
WO2024199081A1 (zh) | 一种生物合成人体结构性材料的制备方法 | |
Zhao et al. | Green biomanufacturing in recombinant collagen biosynthesis: trends and selection in various expression systems | |
CN115521371B (zh) | 一种重组人源化iii型胶原蛋白、制备方法及应用 | |
CN112876569B (zh) | 一种rhTSG6-FNⅢ1-C融合蛋白及其在皮肤护理组合物中的用途和其制备方法 | |
CN109021071A (zh) | 肽及其制备方法和用途 | |
CN116874590B (zh) | 一种重组iii型胶原蛋白及其制备方法 | |
CN116355076A (zh) | 一种重组多肽及其制备方法和应用 | |
KR101626758B1 (ko) | 피부 투과성을 갖는 인간 염기성 섬유아세포 성장인자의 개발, 생산 및 화장품 조성물 | |
CN110498859B (zh) | 重组PLB-hbFGF融合蛋白及其应用 | |
Wu et al. | Molecular modification, expression and purification of new subtype antioxidant peptide from Pinctada fucata by recombinant Escherichia coli to improve antioxidant-activity | |
KR101784288B1 (ko) | 피부 세포 증식 효과가 증가한 성장 분화 인자 11과 열 충격 단백질 10의 융합단백질 및 이를 유효성분으로 함유하는 피부 주름 개선용 화장료 조성물 | |
CN117843763A (zh) | 生物合成人体结构性材料xvii型胶原蛋白的方法 | |
CN117024569A (zh) | 生物合成人体结构性材料viii型胶原蛋白的方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CP02 | Change in the address of a patent holder |
Address after: 030032 No.18, Jinbo street, Tanghuai Park, Taiyuan comprehensive reform demonstration zone, Taiyuan City, Shanxi Province Patentee after: SHANXI JINBO BIOMEDICAL Co.,Ltd. Address before: 030032 Shanxi city of Taiyuan province by the economic and Technological Development Zone No. 18 North Street Patentee before: SHANXI JINBO BIOMEDICAL Co.,Ltd. |
|
CP02 | Change in the address of a patent holder |