KR20080074134A - 재조합 콜라겐 유사 단백질의 생산 - Google Patents
재조합 콜라겐 유사 단백질의 생산 Download PDFInfo
- Publication number
- KR20080074134A KR20080074134A KR1020087012272A KR20087012272A KR20080074134A KR 20080074134 A KR20080074134 A KR 20080074134A KR 1020087012272 A KR1020087012272 A KR 1020087012272A KR 20087012272 A KR20087012272 A KR 20087012272A KR 20080074134 A KR20080074134 A KR 20080074134A
- Authority
- KR
- South Korea
- Prior art keywords
- gly
- ala
- pro
- ser
- protein
- Prior art date
Links
- 238000004519 manufacturing process Methods 0.000 title claims abstract description 20
- 101710137510 Saimiri transformation-associated protein Proteins 0.000 title claims abstract description 19
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 87
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 79
- 238000000034 method Methods 0.000 claims abstract description 29
- 210000005253 yeast cell Anatomy 0.000 claims abstract description 23
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 claims abstract description 16
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 claims abstract description 16
- 230000004186 co-expression Effects 0.000 claims abstract description 9
- 239000003814 drug Substances 0.000 claims abstract description 3
- 241000237536 Mytilus edulis Species 0.000 claims description 84
- 108010035532 Collagen Proteins 0.000 claims description 79
- 102000008186 Collagen Human genes 0.000 claims description 79
- 229920001436 collagen Polymers 0.000 claims description 78
- 235000020638 mussel Nutrition 0.000 claims description 70
- 150000007523 nucleic acids Chemical class 0.000 claims description 39
- 108010043005 Prolyl Hydroxylases Proteins 0.000 claims description 38
- 102000004079 Prolyl Hydroxylases Human genes 0.000 claims description 38
- 108020004707 nucleic acids Proteins 0.000 claims description 31
- 102000039446 nucleic acids Human genes 0.000 claims description 31
- 108010076504 Protein Sorting Signals Proteins 0.000 claims description 28
- 240000004808 Saccharomyces cerevisiae Species 0.000 claims description 28
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 claims description 28
- 239000012634 fragment Substances 0.000 claims description 23
- 239000000463 material Substances 0.000 claims description 22
- 102000004190 Enzymes Human genes 0.000 claims description 17
- 108090000790 Enzymes Proteins 0.000 claims description 17
- 210000004027 cell Anatomy 0.000 claims description 17
- 239000013604 expression vector Substances 0.000 claims description 17
- 102000016942 Elastin Human genes 0.000 claims description 8
- 108010014258 Elastin Proteins 0.000 claims description 8
- 108010022355 Fibroins Proteins 0.000 claims description 8
- 229920002549 elastin Polymers 0.000 claims description 8
- SBKVPJHMSUXZTA-MEJXFZFPSA-N (2S)-2-[[(2S)-2-[[(2S)-1-[(2S)-5-amino-2-[[2-[[(2S)-1-[(2S)-6-amino-2-[[(2S)-2-[[(2S)-5-amino-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-amino-3-(1H-indol-3-yl)propanoyl]amino]-3-(1H-imidazol-4-yl)propanoyl]amino]-3-(1H-indol-3-yl)propanoyl]amino]-4-methylpentanoyl]amino]-5-oxopentanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]pyrrolidine-2-carbonyl]amino]acetyl]amino]-5-oxopentanoyl]pyrrolidine-2-carbonyl]amino]-4-methylsulfanylbutanoyl]amino]-3-(4-hydroxyphenyl)propanoic acid Chemical compound C([C@@H](C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CNC=N1 SBKVPJHMSUXZTA-MEJXFZFPSA-N 0.000 claims description 7
- 101710140452 Mating factor alpha-1 Proteins 0.000 claims description 7
- 239000013612 plasmid Substances 0.000 claims description 7
- 238000009987 spinning Methods 0.000 claims description 7
- 239000004744 fabric Substances 0.000 claims description 6
- 241000235058 Komagataella pastoris Species 0.000 claims description 5
- 239000010985 leather Substances 0.000 claims description 5
- 210000002435 tendon Anatomy 0.000 claims description 5
- 239000004753 textile Substances 0.000 claims description 5
- 241000222122 Candida albicans Species 0.000 claims description 4
- 241001147138 Mytilus galloprovincialis Species 0.000 claims description 4
- 241000235648 Pichia Species 0.000 claims description 4
- 241000235070 Saccharomyces Species 0.000 claims description 4
- 229940095731 candida albicans Drugs 0.000 claims description 4
- 210000000845 cartilage Anatomy 0.000 claims description 4
- 239000003356 suture material Substances 0.000 claims description 4
- 102100039556 Galectin-4 Human genes 0.000 claims description 3
- 101000608765 Homo sapiens Galectin-4 Proteins 0.000 claims description 3
- 239000011248 coating agent Substances 0.000 claims description 3
- 238000000576 coating method Methods 0.000 claims description 3
- 239000002537 cosmetic Substances 0.000 claims description 3
- 230000001939 inductive effect Effects 0.000 claims description 3
- 238000000465 moulding Methods 0.000 claims description 3
- 239000000123 paper Substances 0.000 claims description 3
- 101100327917 Caenorhabditis elegans chup-1 gene Proteins 0.000 claims description 2
- 101710111073 External scaffolding protein D Proteins 0.000 claims description 2
- 101150094690 GAL1 gene Proteins 0.000 claims description 2
- 102100028501 Galanin peptides Human genes 0.000 claims description 2
- 101100121078 Homo sapiens GAL gene Proteins 0.000 claims description 2
- 241000882864 Mytilus sp. Species 0.000 claims description 2
- 101100386089 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) MET17 gene Proteins 0.000 claims description 2
- 241000235347 Schizosaccharomyces pombe Species 0.000 claims description 2
- 230000001131 transforming effect Effects 0.000 claims description 2
- 125000003275 alpha amino acid group Chemical group 0.000 claims 3
- 239000000969 carrier Substances 0.000 claims 1
- 229940079593 drug Drugs 0.000 claims 1
- 210000002472 endoplasmic reticulum Anatomy 0.000 claims 1
- 238000002360 preparation method Methods 0.000 claims 1
- 230000001105 regulatory effect Effects 0.000 claims 1
- 238000001356 surgical procedure Methods 0.000 claims 1
- 238000005516 engineering process Methods 0.000 abstract description 3
- 235000018102 proteins Nutrition 0.000 description 64
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 51
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 41
- 238000012163 sequencing technique Methods 0.000 description 40
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 33
- 108010029020 prolylglycine Proteins 0.000 description 32
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 27
- 150000001413 amino acids Chemical group 0.000 description 25
- 108010078144 glutaminyl-glycine Proteins 0.000 description 25
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 24
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 20
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 20
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 20
- 230000015572 biosynthetic process Effects 0.000 description 19
- 108010079364 N-glycylalanine Proteins 0.000 description 18
- 108010047495 alanylglycine Proteins 0.000 description 18
- 235000001014 amino acid Nutrition 0.000 description 18
- 239000000835 fiber Substances 0.000 description 17
- 230000014509 gene expression Effects 0.000 description 17
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 17
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 16
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 16
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 16
- 108010043293 glycyl-prolyl-glycyl-glycine Proteins 0.000 description 16
- 229940024606 amino acid Drugs 0.000 description 15
- 238000004132 cross linking Methods 0.000 description 15
- NSORZJXKUQFEKL-JGVFFNPUSA-N Gln-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)N)N)C(=O)O NSORZJXKUQFEKL-JGVFFNPUSA-N 0.000 description 14
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 14
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 14
- 108010077515 glycylproline Proteins 0.000 description 14
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 13
- PEZMQPADLFXCJJ-ZETCQYMHSA-N 2-[[2-[[(2s)-1-(2-aminoacetyl)pyrrolidine-2-carbonyl]amino]acetyl]amino]acetic acid Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(=O)NCC(O)=O PEZMQPADLFXCJJ-ZETCQYMHSA-N 0.000 description 12
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 12
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 12
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 12
- 108010050848 glycylleucine Proteins 0.000 description 12
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 11
- 108091028043 Nucleic acid sequence Proteins 0.000 description 11
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 11
- 229940088598 enzyme Drugs 0.000 description 11
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 11
- CAVKXZMMDNOZJU-UHFFFAOYSA-N Gly-Pro-Ala-Gly-Pro Natural products C1CCC(C(O)=O)N1C(=O)CNC(=O)C(C)NC(=O)C1CCCN1C(=O)CN CAVKXZMMDNOZJU-UHFFFAOYSA-N 0.000 description 10
- ZPPVJIJMIKTERM-YUMQZZPRSA-N Pro-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ZPPVJIJMIKTERM-YUMQZZPRSA-N 0.000 description 10
- 229960004502 levodopa Drugs 0.000 description 10
- 238000003786 synthesis reaction Methods 0.000 description 10
- WOJJIRYPFAZEPF-YFKPBYRVSA-N 2-[[(2s)-2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]propanoyl]amino]acetate Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)CNC(=O)CN WOJJIRYPFAZEPF-YFKPBYRVSA-N 0.000 description 9
- JDIQCVUDDFENPU-ZKWXMUAHSA-N Ala-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CNC=N1 JDIQCVUDDFENPU-ZKWXMUAHSA-N 0.000 description 9
- 108020004414 DNA Proteins 0.000 description 9
- WTDRDQBEARUVNC-LURJTMIESA-N L-DOPA Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C(O)=C1 WTDRDQBEARUVNC-LURJTMIESA-N 0.000 description 9
- PXHVJJICTQNCMI-UHFFFAOYSA-N Nickel Chemical compound [Ni] PXHVJJICTQNCMI-UHFFFAOYSA-N 0.000 description 9
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 9
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 9
- 108010010147 glycylglutamine Proteins 0.000 description 9
- 108010064235 lysylglycine Proteins 0.000 description 9
- 238000006467 substitution reaction Methods 0.000 description 9
- OTEWWRBKGONZBW-UHFFFAOYSA-N 2-[[2-[[2-[(2-azaniumylacetyl)amino]-4-methylpentanoyl]amino]acetyl]amino]acetate Chemical compound NCC(=O)NC(CC(C)C)C(=O)NCC(=O)NCC(O)=O OTEWWRBKGONZBW-UHFFFAOYSA-N 0.000 description 8
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 8
- YAXNATKKPOWVCP-ZLUOBGJFSA-N Ala-Asn-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O YAXNATKKPOWVCP-ZLUOBGJFSA-N 0.000 description 8
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 8
- PABFFPWEJMEVEC-JGVFFNPUSA-N Gly-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)CN)C(=O)O PABFFPWEJMEVEC-JGVFFNPUSA-N 0.000 description 8
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 8
- JYPCXBJRLBHWME-IUCAKERBSA-N Gly-Pro-Arg Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JYPCXBJRLBHWME-IUCAKERBSA-N 0.000 description 8
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 8
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 8
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 8
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 8
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 8
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 8
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 8
- 239000000126 substance Substances 0.000 description 8
- 239000013598 vector Substances 0.000 description 8
- MQIGTEQXYCRLGK-BQBZGAKWSA-N Ala-Gly-Pro Chemical compound C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O MQIGTEQXYCRLGK-BQBZGAKWSA-N 0.000 description 7
- VSXBYIJUAXPAAL-WDSKDSINSA-N Gln-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O VSXBYIJUAXPAAL-WDSKDSINSA-N 0.000 description 7
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 7
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 7
- HJARVELKOSZUEW-YUMQZZPRSA-N Gly-Pro-Gln Chemical compound [H]NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJARVELKOSZUEW-YUMQZZPRSA-N 0.000 description 7
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 7
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 7
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 7
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 7
- 108020004511 Recombinant DNA Proteins 0.000 description 7
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 7
- 108010070944 alanylhistidine Proteins 0.000 description 7
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 7
- 108010025801 glycyl-prolyl-arginine Proteins 0.000 description 7
- 108010073093 leucyl-glycyl-glycyl-glycine Proteins 0.000 description 7
- 229910021645 metal ion Inorganic materials 0.000 description 7
- 230000008569 process Effects 0.000 description 7
- 108010061238 threonyl-glycine Proteins 0.000 description 7
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 6
- VWEWCZSUWOEEFM-WDSKDSINSA-N Ala-Gly-Ala-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(=O)NCC(O)=O VWEWCZSUWOEEFM-WDSKDSINSA-N 0.000 description 6
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 6
- FQCILXROGNOZON-YUMQZZPRSA-N Gln-Pro-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O FQCILXROGNOZON-YUMQZZPRSA-N 0.000 description 6
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 6
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 6
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 6
- VXKCPBPQEKKERH-IUCAKERBSA-N Gly-Arg-Pro Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N1CCC[C@H]1C(O)=O VXKCPBPQEKKERH-IUCAKERBSA-N 0.000 description 6
- DWUKOTKSTDWGAE-BQBZGAKWSA-N Gly-Asn-Arg Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DWUKOTKSTDWGAE-BQBZGAKWSA-N 0.000 description 6
- XEJTYSCIXKYSHR-WDSKDSINSA-N Gly-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN XEJTYSCIXKYSHR-WDSKDSINSA-N 0.000 description 6
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 6
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 6
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 6
- GAAHQHNCMIAYEX-UWVGGRQHSA-N Gly-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GAAHQHNCMIAYEX-UWVGGRQHSA-N 0.000 description 6
- IPIVXQQRZXEUGW-UWJYBYFXSA-N His-Ala-His Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 IPIVXQQRZXEUGW-UWJYBYFXSA-N 0.000 description 6
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 6
- BNBBNGZZKQUWCD-IUCAKERBSA-N Pro-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 BNBBNGZZKQUWCD-IUCAKERBSA-N 0.000 description 6
- 239000000853 adhesive Substances 0.000 description 6
- 230000001070 adhesive effect Effects 0.000 description 6
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 6
- ROOXNKNUYICQNP-UHFFFAOYSA-N ammonium persulfate Chemical compound [NH4+].[NH4+].[O-]S(=O)(=O)OOS([O-])(=O)=O ROOXNKNUYICQNP-UHFFFAOYSA-N 0.000 description 6
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 6
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 6
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 6
- 108010001064 glycyl-glycyl-glycyl-glycine Proteins 0.000 description 6
- 108010054666 glycyl-leucyl-glycyl-glycine Proteins 0.000 description 6
- 108010015792 glycyllysine Proteins 0.000 description 6
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 6
- 210000003660 reticulum Anatomy 0.000 description 6
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 5
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 5
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 5
- YNSGXDWWPCGGQS-YUMQZZPRSA-N Arg-Gly-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O YNSGXDWWPCGGQS-YUMQZZPRSA-N 0.000 description 5
- GKKUBLFXKRDMFC-BQBZGAKWSA-N Asn-Pro-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O GKKUBLFXKRDMFC-BQBZGAKWSA-N 0.000 description 5
- 238000005033 Fourier transform infrared spectroscopy Methods 0.000 description 5
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 5
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 5
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 5
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 5
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 5
- PMMYEEVYMWASQN-DMTCNVIQSA-N Hydroxyproline Chemical compound O[C@H]1CN[C@H](C(O)=O)C1 PMMYEEVYMWASQN-DMTCNVIQSA-N 0.000 description 5
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 5
- 241000880493 Leptailurus serval Species 0.000 description 5
- 241000237524 Mytilus Species 0.000 description 5
- UUHXBJHVTVGSKM-BQBZGAKWSA-N Pro-Gly-Asn Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UUHXBJHVTVGSKM-BQBZGAKWSA-N 0.000 description 5
- JMVQDLDPDBXAAX-YUMQZZPRSA-N Pro-Gly-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 JMVQDLDPDBXAAX-YUMQZZPRSA-N 0.000 description 5
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 description 5
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 5
- 238000004458 analytical method Methods 0.000 description 5
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 5
- 108010047857 aspartylglycine Proteins 0.000 description 5
- 238000004630 atomic force microscopy Methods 0.000 description 5
- 238000010367 cloning Methods 0.000 description 5
- 238000012217 deletion Methods 0.000 description 5
- 230000037430 deletion Effects 0.000 description 5
- 108010009297 diglycyl-histidine Proteins 0.000 description 5
- 108010089804 glycyl-threonine Proteins 0.000 description 5
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 5
- 238000009396 hybridization Methods 0.000 description 5
- 238000003780 insertion Methods 0.000 description 5
- 230000037431 insertion Effects 0.000 description 5
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 5
- 239000000203 mixture Substances 0.000 description 5
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 4
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 4
- HGKHPCFTRQDHCU-IUCAKERBSA-N Arg-Pro-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HGKHPCFTRQDHCU-IUCAKERBSA-N 0.000 description 4
- 108010051330 Arg-Pro-Gly-Pro Proteins 0.000 description 4
- 108091028026 C-DNA Proteins 0.000 description 4
- NLKVNZUFDPWPNL-YUMQZZPRSA-N Glu-Arg-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O NLKVNZUFDPWPNL-YUMQZZPRSA-N 0.000 description 4
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 4
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 4
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 4
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 4
- RFQATBGBLDAKGI-VHSXEESVSA-N Lys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCCN)N)C(=O)O RFQATBGBLDAKGI-VHSXEESVSA-N 0.000 description 4
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 4
- FYUIFUJFNCLUIX-XVYDVKMFSA-N Ser-His-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O FYUIFUJFNCLUIX-XVYDVKMFSA-N 0.000 description 4
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 4
- MSIYNSBKKVMGFO-BHNWBGBOSA-N Thr-Gly-Pro Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N)O MSIYNSBKKVMGFO-BHNWBGBOSA-N 0.000 description 4
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 4
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 4
- 238000000149 argon plasma sintering Methods 0.000 description 4
- 108010077245 asparaginyl-proline Proteins 0.000 description 4
- 108010092854 aspartyllysine Proteins 0.000 description 4
- PMMYEEVYMWASQN-UHFFFAOYSA-N dl-hydroxyproline Natural products OC1C[NH2+]C(C([O-])=O)C1 PMMYEEVYMWASQN-UHFFFAOYSA-N 0.000 description 4
- 238000001493 electron microscopy Methods 0.000 description 4
- 238000001825 field-flow fractionation Methods 0.000 description 4
- 108010087823 glycyltyrosine Proteins 0.000 description 4
- 229960002591 hydroxyproline Drugs 0.000 description 4
- 108010054155 lysyllysine Proteins 0.000 description 4
- 239000011159 matrix material Substances 0.000 description 4
- 229910052751 metal Inorganic materials 0.000 description 4
- 239000002184 metal Substances 0.000 description 4
- 230000003647 oxidation Effects 0.000 description 4
- 238000007254 oxidation reaction Methods 0.000 description 4
- 108010087846 prolyl-prolyl-glycine Proteins 0.000 description 4
- 108010053725 prolylvaline Proteins 0.000 description 4
- 239000011435 rock Substances 0.000 description 4
- 108010048818 seryl-histidine Proteins 0.000 description 4
- -1 threads Substances 0.000 description 4
- UFTFJSFQGQCHQW-UHFFFAOYSA-N triformin Chemical compound O=COCC(OC=O)COC=O UFTFJSFQGQCHQW-UHFFFAOYSA-N 0.000 description 4
- BRPMXFSTKXXNHF-IUCAKERBSA-N (2s)-1-[2-[[(2s)-pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H]1NCCC1 BRPMXFSTKXXNHF-IUCAKERBSA-N 0.000 description 3
- NWFLONJLUJYCNS-UHFFFAOYSA-N 2-[[2-[[2-[(2-amino-3-phenylpropanoyl)amino]acetyl]amino]acetyl]amino]-3-phenylpropanoic acid Chemical compound C=1C=CC=CC=1CC(C(O)=O)NC(=O)CNC(=O)CNC(=O)C(N)CC1=CC=CC=C1 NWFLONJLUJYCNS-UHFFFAOYSA-N 0.000 description 3
- XWTNPSHCJMZAHQ-QMMMGPOBSA-N 2-[[2-[[2-[[(2s)-2-amino-4-methylpentanoyl]amino]acetyl]amino]acetyl]amino]acetic acid Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(=O)NCC(O)=O XWTNPSHCJMZAHQ-QMMMGPOBSA-N 0.000 description 3
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 3
- ODWSTKXGQGYHSH-FXQIFTODSA-N Ala-Arg-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O ODWSTKXGQGYHSH-FXQIFTODSA-N 0.000 description 3
- KUDREHRZRIVKHS-UWJYBYFXSA-N Ala-Asp-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KUDREHRZRIVKHS-UWJYBYFXSA-N 0.000 description 3
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 3
- YFWTXMRJJDNTLM-LSJOCFKGSA-N Arg-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YFWTXMRJJDNTLM-LSJOCFKGSA-N 0.000 description 3
- AQPVUEJJARLJHB-BQBZGAKWSA-N Arg-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N AQPVUEJJARLJHB-BQBZGAKWSA-N 0.000 description 3
- PNIGSVZJNVUVJA-BQBZGAKWSA-N Arg-Gly-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O PNIGSVZJNVUVJA-BQBZGAKWSA-N 0.000 description 3
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 3
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 3
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 3
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 3
- 102000001187 Collagen Type III Human genes 0.000 description 3
- 108010069502 Collagen Type III Proteins 0.000 description 3
- 102000053602 DNA Human genes 0.000 description 3
- PXHABOCPJVTGEK-BQBZGAKWSA-N Glu-Gln-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O PXHABOCPJVTGEK-BQBZGAKWSA-N 0.000 description 3
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 3
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 3
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 3
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 3
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 3
- WDXLKVQATNEAJQ-BQBZGAKWSA-N Gly-Pro-Asp Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WDXLKVQATNEAJQ-BQBZGAKWSA-N 0.000 description 3
- OOCFXNOVSLSHAB-IUCAKERBSA-N Gly-Pro-Pro Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OOCFXNOVSLSHAB-IUCAKERBSA-N 0.000 description 3
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 3
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 3
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 3
- FDQYIRHBVVUTJF-ZETCQYMHSA-N His-Gly-Gly Chemical compound [O-]C(=O)CNC(=O)CNC(=O)[C@@H]([NH3+])CC1=CN=CN1 FDQYIRHBVVUTJF-ZETCQYMHSA-N 0.000 description 3
- QCBYAHHNOHBXIH-UWVGGRQHSA-N His-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CN=CN1 QCBYAHHNOHBXIH-UWVGGRQHSA-N 0.000 description 3
- 101001072202 Homo sapiens Protein disulfide-isomerase Proteins 0.000 description 3
- 108010065920 Insulin Lispro Proteins 0.000 description 3
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 3
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 3
- PBCHMHROGNUXMK-DLOVCJGASA-N Leu-Ala-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 PBCHMHROGNUXMK-DLOVCJGASA-N 0.000 description 3
- OBVHKUFUDCPZDW-JYJNAYRXSA-N Met-Arg-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OBVHKUFUDCPZDW-JYJNAYRXSA-N 0.000 description 3
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 3
- 108091034117 Oligonucleotide Proteins 0.000 description 3
- GMWNQSGWWGKTSF-LFSVMHDDSA-N Phe-Thr-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMWNQSGWWGKTSF-LFSVMHDDSA-N 0.000 description 3
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 3
- DMKWYMWNEKIPFC-IUCAKERBSA-N Pro-Gly-Arg Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O DMKWYMWNEKIPFC-IUCAKERBSA-N 0.000 description 3
- ULIWFCCJIOEHMU-BQBZGAKWSA-N Pro-Gly-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 ULIWFCCJIOEHMU-BQBZGAKWSA-N 0.000 description 3
- XQSREVQDGCPFRJ-STQMWFEESA-N Pro-Gly-Phe Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XQSREVQDGCPFRJ-STQMWFEESA-N 0.000 description 3
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 3
- ABSSTGUCBCDKMU-UWVGGRQHSA-N Pro-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 ABSSTGUCBCDKMU-UWVGGRQHSA-N 0.000 description 3
- ITUDDXVFGFEKPD-NAKRPEOUSA-N Pro-Ser-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ITUDDXVFGFEKPD-NAKRPEOUSA-N 0.000 description 3
- 102100036352 Protein disulfide-isomerase Human genes 0.000 description 3
- HQTKVSCNCDLXSX-BQBZGAKWSA-N Ser-Arg-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O HQTKVSCNCDLXSX-BQBZGAKWSA-N 0.000 description 3
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 3
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 3
- KDGARKCAKHBEDB-NKWVEPMBSA-N Ser-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CO)N)C(=O)O KDGARKCAKHBEDB-NKWVEPMBSA-N 0.000 description 3
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 3
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 3
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 3
- 238000009825 accumulation Methods 0.000 description 3
- 108010005233 alanylglutamic acid Proteins 0.000 description 3
- 229910001870 ammonium persulfate Inorganic materials 0.000 description 3
- 108010068265 aspartyltyrosine Proteins 0.000 description 3
- 238000004587 chromatography analysis Methods 0.000 description 3
- 238000002983 circular dichroism Methods 0.000 description 3
- 238000003776 cleavage reaction Methods 0.000 description 3
- 239000002299 complementary DNA Substances 0.000 description 3
- 230000001086 cytosolic effect Effects 0.000 description 3
- 230000004927 fusion Effects 0.000 description 3
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 3
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 3
- 108010036413 histidylglycine Proteins 0.000 description 3
- 230000003834 intracellular effect Effects 0.000 description 3
- USSBDBZGEDUBHE-UHFFFAOYSA-L magnesium;2-oxidooxycarbonylbenzoate Chemical compound [Mg+2].[O-]OC(=O)C1=CC=CC=C1C([O-])=O USSBDBZGEDUBHE-UHFFFAOYSA-L 0.000 description 3
- 230000007246 mechanism Effects 0.000 description 3
- 230000001404 mediated effect Effects 0.000 description 3
- 229910052759 nickel Inorganic materials 0.000 description 3
- 229920002791 poly-4-hydroxybutyrate Polymers 0.000 description 3
- 229920000642 polymer Polymers 0.000 description 3
- 108090000765 processed proteins & peptides Proteins 0.000 description 3
- 230000007017 scission Effects 0.000 description 3
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- FGMPLJWBKKVCDB-UHFFFAOYSA-N trans-L-hydroxy-proline Natural products ON1CCCC1C(O)=O FGMPLJWBKKVCDB-UHFFFAOYSA-N 0.000 description 3
- 238000005406 washing Methods 0.000 description 3
- SCAKQYSGEIHPLV-IUCAKERBSA-N (4S)-4-[(2-aminoacetyl)amino]-5-[(2S)-2-(carboxymethylcarbamoyl)pyrrolidin-1-yl]-5-oxopentanoic acid Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SCAKQYSGEIHPLV-IUCAKERBSA-N 0.000 description 2
- HXUVTXPOZRFMOY-NSHDSACASA-N 2-[[(2s)-2-[[2-[(2-aminoacetyl)amino]acetyl]amino]-3-phenylpropanoyl]amino]acetic acid Chemical compound NCC(=O)NCC(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 HXUVTXPOZRFMOY-NSHDSACASA-N 0.000 description 2
- QMOQBVOBWVNSNO-UHFFFAOYSA-N 2-[[2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]acetyl]amino]acetate Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(O)=O QMOQBVOBWVNSNO-UHFFFAOYSA-N 0.000 description 2
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 2
- WQVFQXXBNHHPLX-ZKWXMUAHSA-N Ala-Ala-His Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O WQVFQXXBNHHPLX-ZKWXMUAHSA-N 0.000 description 2
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 2
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 2
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 2
- QHASENCZLDHBGX-ONGXEEELSA-N Ala-Gly-Phe Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QHASENCZLDHBGX-ONGXEEELSA-N 0.000 description 2
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 2
- OKEWAFFWMHBGPT-XPUUQOCRSA-N Ala-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CN=CN1 OKEWAFFWMHBGPT-XPUUQOCRSA-N 0.000 description 2
- CBCCCLMNOBLBSC-XVYDVKMFSA-N Ala-His-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CBCCCLMNOBLBSC-XVYDVKMFSA-N 0.000 description 2
- HJGZVLLLBJLXFC-LSJOCFKGSA-N Ala-His-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O HJGZVLLLBJLXFC-LSJOCFKGSA-N 0.000 description 2
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 2
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 2
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 2
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 2
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 2
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 2
- HQIZDMIGUJOSNI-IUCAKERBSA-N Arg-Gly-Arg Chemical compound N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQIZDMIGUJOSNI-IUCAKERBSA-N 0.000 description 2
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 2
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 2
- UZSQXCMNUPKLCC-FJXKBIBVSA-N Arg-Thr-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UZSQXCMNUPKLCC-FJXKBIBVSA-N 0.000 description 2
- SWLOHUMCUDRTCL-ZLUOBGJFSA-N Asn-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N SWLOHUMCUDRTCL-ZLUOBGJFSA-N 0.000 description 2
- NXVGBGZQQFDUTM-XVYDVKMFSA-N Asn-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N NXVGBGZQQFDUTM-XVYDVKMFSA-N 0.000 description 2
- HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 description 2
- FRSGNOZCTWDVFZ-ACZMJKKPSA-N Asp-Asp-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O FRSGNOZCTWDVFZ-ACZMJKKPSA-N 0.000 description 2
- VFUXXFVCYZPOQG-WDSKDSINSA-N Asp-Glu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VFUXXFVCYZPOQG-WDSKDSINSA-N 0.000 description 2
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 2
- LBOVBQONZJRWPV-YUMQZZPRSA-N Asp-Lys-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LBOVBQONZJRWPV-YUMQZZPRSA-N 0.000 description 2
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 2
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 2
- 241000588724 Escherichia coli Species 0.000 description 2
- 108010001498 Galectin 1 Proteins 0.000 description 2
- 102100021736 Galectin-1 Human genes 0.000 description 2
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 2
- SMLDOQHTOAAFJQ-WDSKDSINSA-N Gln-Gly-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SMLDOQHTOAAFJQ-WDSKDSINSA-N 0.000 description 2
- QGWXAMDECCKGRU-XVKPBYJWSA-N Gln-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(N)=O)C(=O)NCC(O)=O QGWXAMDECCKGRU-XVKPBYJWSA-N 0.000 description 2
- HMJULNMJWOZNFI-XHNCKOQMSA-N Glu-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N)C(=O)O HMJULNMJWOZNFI-XHNCKOQMSA-N 0.000 description 2
- BRFJMRSRMOMIMU-WHFBIAKZSA-N Gly-Ala-Asn Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O BRFJMRSRMOMIMU-WHFBIAKZSA-N 0.000 description 2
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 2
- JXYMPBCYRKWJEE-BQBZGAKWSA-N Gly-Arg-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JXYMPBCYRKWJEE-BQBZGAKWSA-N 0.000 description 2
- IWAXHBCACVWNHT-BQBZGAKWSA-N Gly-Asp-Arg Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IWAXHBCACVWNHT-BQBZGAKWSA-N 0.000 description 2
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 2
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 2
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 2
- PDAWDNVHMUKWJR-ZETCQYMHSA-N Gly-Gly-His Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 PDAWDNVHMUKWJR-ZETCQYMHSA-N 0.000 description 2
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 2
- YFGONBOFGGWKKY-VHSXEESVSA-N Gly-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)CN)C(=O)O YFGONBOFGGWKKY-VHSXEESVSA-N 0.000 description 2
- VIIBEIQMLJEUJG-LAEOZQHASA-N Gly-Ile-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O VIIBEIQMLJEUJG-LAEOZQHASA-N 0.000 description 2
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 2
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 2
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 2
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 2
- LXTRSHQLGYINON-DTWKUNHWSA-N Gly-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN LXTRSHQLGYINON-DTWKUNHWSA-N 0.000 description 2
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 2
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 2
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 2
- LYZYGGWCBLBDMC-QWHCGFSZSA-N Gly-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)CN)C(=O)O LYZYGGWCBLBDMC-QWHCGFSZSA-N 0.000 description 2
- 239000004471 Glycine Substances 0.000 description 2
- ZUPVLBAXUUGKKN-VHSXEESVSA-N His-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC2=CN=CN2)N)C(=O)O ZUPVLBAXUUGKKN-VHSXEESVSA-N 0.000 description 2
- OZBDSFBWIDPVDA-BZSNNMDCSA-N His-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CN=CN3)N OZBDSFBWIDPVDA-BZSNNMDCSA-N 0.000 description 2
- VIJMRAIWYWRXSR-CIUDSAMLSA-N His-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 VIJMRAIWYWRXSR-CIUDSAMLSA-N 0.000 description 2
- PPSQSIDMOVPKPI-BJDJZHNGSA-N Ile-Cys-Leu Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O PPSQSIDMOVPKPI-BJDJZHNGSA-N 0.000 description 2
- GQKSJYINYYWPMR-NGZCFLSTSA-N Ile-Gly-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N GQKSJYINYYWPMR-NGZCFLSTSA-N 0.000 description 2
- XEEYBQQBJWHFJM-UHFFFAOYSA-N Iron Chemical compound [Fe] XEEYBQQBJWHFJM-UHFFFAOYSA-N 0.000 description 2
- 108010076876 Keratins Proteins 0.000 description 2
- 102000011782 Keratins Human genes 0.000 description 2
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 2
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 2
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 2
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 2
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 2
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 2
- IRRZDAIFYHNIIN-JYJNAYRXSA-N Lys-Gln-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IRRZDAIFYHNIIN-JYJNAYRXSA-N 0.000 description 2
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 2
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 2
- 239000004472 Lysine Substances 0.000 description 2
- PWHULOQIROXLJO-UHFFFAOYSA-N Manganese Chemical compound [Mn] PWHULOQIROXLJO-UHFFFAOYSA-N 0.000 description 2
- 108010038049 Mating Factor Proteins 0.000 description 2
- ALTHVGNGGZZSAC-SRVKXCTJSA-N Met-Val-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCNC(N)=N ALTHVGNGGZZSAC-SRVKXCTJSA-N 0.000 description 2
- PVSPJQWHEIQTEH-JYJNAYRXSA-N Met-Val-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PVSPJQWHEIQTEH-JYJNAYRXSA-N 0.000 description 2
- 241000234295 Musa Species 0.000 description 2
- 102000057297 Pepsin A Human genes 0.000 description 2
- 108090000284 Pepsin A Proteins 0.000 description 2
- FPTXMUIBLMGTQH-ONGXEEELSA-N Phe-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 FPTXMUIBLMGTQH-ONGXEEELSA-N 0.000 description 2
- NPLGQVKZFGJWAI-QWHCGFSZSA-N Phe-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O NPLGQVKZFGJWAI-QWHCGFSZSA-N 0.000 description 2
- CKJACGQPCPMWIT-UFYCRDLUSA-N Phe-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 CKJACGQPCPMWIT-UFYCRDLUSA-N 0.000 description 2
- ZLAKUZDMKVKFAI-JYJNAYRXSA-N Phe-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O ZLAKUZDMKVKFAI-JYJNAYRXSA-N 0.000 description 2
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 2
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 2
- WSRWHZRUOCACLJ-UWVGGRQHSA-N Pro-Gly-His Chemical compound C([C@@H](C(=O)O)NC(=O)CNC(=O)[C@H]1NCCC1)C1=CN=CN1 WSRWHZRUOCACLJ-UWVGGRQHSA-N 0.000 description 2
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 2
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 description 2
- DCHQYSOGURGJST-FJXKBIBVSA-N Pro-Thr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O DCHQYSOGURGJST-FJXKBIBVSA-N 0.000 description 2
- FUOGXAQMNJMBFG-WPRPVWTQSA-N Pro-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FUOGXAQMNJMBFG-WPRPVWTQSA-N 0.000 description 2
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 2
- XWCYBVBLJRWOFR-WDSKDSINSA-N Ser-Gln-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O XWCYBVBLJRWOFR-WDSKDSINSA-N 0.000 description 2
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 2
- NVNPWELENFJOHH-CIUDSAMLSA-N Ser-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CO)N NVNPWELENFJOHH-CIUDSAMLSA-N 0.000 description 2
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 2
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 2
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 101710172711 Structural protein Proteins 0.000 description 2
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 2
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 2
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 2
- KEGBFULVYKYJRD-LFSVMHDDSA-N Thr-Ala-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KEGBFULVYKYJRD-LFSVMHDDSA-N 0.000 description 2
- AXEJRUGTOJPZKG-XGEHTFHBSA-N Thr-Val-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O)N)O AXEJRUGTOJPZKG-XGEHTFHBSA-N 0.000 description 2
- CDHQEOXPWBDFPL-QWRGUYRKSA-N Tyr-Gly-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDHQEOXPWBDFPL-QWRGUYRKSA-N 0.000 description 2
- HIINQLBHPIQYHN-JTQLQIEISA-N Tyr-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HIINQLBHPIQYHN-JTQLQIEISA-N 0.000 description 2
- MVYRJYISVJWKSX-KBPBESRZSA-N Tyr-His-Gly Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)NCC(=O)O)N)O MVYRJYISVJWKSX-KBPBESRZSA-N 0.000 description 2
- 108010064997 VPY tripeptide Proteins 0.000 description 2
- REJBPZVUHYNMEN-LSJOCFKGSA-N Val-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N REJBPZVUHYNMEN-LSJOCFKGSA-N 0.000 description 2
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 2
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 description 2
- ZZGPVSZDZQRJQY-ULQDDVLXSA-N Val-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](Cc1ccccc1)C(O)=O ZZGPVSZDZQRJQY-ULQDDVLXSA-N 0.000 description 2
- LTTQCQRTSHJPPL-ZKWXMUAHSA-N Val-Ser-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LTTQCQRTSHJPPL-ZKWXMUAHSA-N 0.000 description 2
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- 230000009471 action Effects 0.000 description 2
- 108010066875 alanyl-prolyl-tryptophyl-cysteine Proteins 0.000 description 2
- 108010044940 alanylglutamine Proteins 0.000 description 2
- 108010087924 alanylproline Proteins 0.000 description 2
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 description 2
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 2
- 239000003054 catalyst Substances 0.000 description 2
- 239000001913 cellulose Substances 0.000 description 2
- 229920002678 cellulose Polymers 0.000 description 2
- 239000013599 cloning vector Substances 0.000 description 2
- 230000036570 collagen biosynthesis Effects 0.000 description 2
- 230000000052 comparative effect Effects 0.000 description 2
- 230000009260 cross reactivity Effects 0.000 description 2
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 2
- 235000018417 cysteine Nutrition 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- 238000012377 drug delivery Methods 0.000 description 2
- 239000000975 dye Substances 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 229920001971 elastomer Polymers 0.000 description 2
- 230000007613 environmental effect Effects 0.000 description 2
- 210000003527 eukaryotic cell Anatomy 0.000 description 2
- 239000013613 expression plasmid Substances 0.000 description 2
- 239000006260 foam Substances 0.000 description 2
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 2
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 2
- 108010049041 glutamylalanine Proteins 0.000 description 2
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 2
- 108010020688 glycylhistidine Proteins 0.000 description 2
- 108010037850 glycylvaline Proteins 0.000 description 2
- 108010040030 histidinoalanine Proteins 0.000 description 2
- 108010092114 histidylphenylalanine Proteins 0.000 description 2
- 239000000017 hydrogel Substances 0.000 description 2
- 230000001965 increasing effect Effects 0.000 description 2
- 239000000543 intermediate Substances 0.000 description 2
- 108010057821 leucylproline Proteins 0.000 description 2
- 229910052748 manganese Inorganic materials 0.000 description 2
- 239000011572 manganese Substances 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- SKEFKEOTNIPLCQ-LWIQTABASA-N mating hormone Chemical compound C([C@@H](C(=O)NC(CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N1[C@@H](CCC1)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCS(C)=O)C(=O)NC(CC=1C=CC(O)=CC=1)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CN=CN1 SKEFKEOTNIPLCQ-LWIQTABASA-N 0.000 description 2
- 108010005942 methionylglycine Proteins 0.000 description 2
- 238000013508 migration Methods 0.000 description 2
- 230000005012 migration Effects 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 238000012544 monitoring process Methods 0.000 description 2
- 239000002121 nanofiber Substances 0.000 description 2
- 229940111202 pepsin Drugs 0.000 description 2
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 2
- 230000021715 photosynthesis, light harvesting Effects 0.000 description 2
- 230000035479 physiological effects, processes and functions Effects 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 210000001236 prokaryotic cell Anatomy 0.000 description 2
- 108010031719 prolyl-serine Proteins 0.000 description 2
- 230000006798 recombination Effects 0.000 description 2
- 238000011084 recovery Methods 0.000 description 2
- 108091008146 restriction endonucleases Proteins 0.000 description 2
- 238000001338 self-assembly Methods 0.000 description 2
- 239000002904 solvent Substances 0.000 description 2
- 230000032258 transport Effects 0.000 description 2
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 2
- 108010051110 tyrosyl-lysine Proteins 0.000 description 2
- 241000701161 unidentified adenovirus Species 0.000 description 2
- 241001529453 unidentified herpesvirus Species 0.000 description 2
- 241001430294 unidentified retrovirus Species 0.000 description 2
- 108010011876 valyl-glycyl-valyl-alanyl-prolyl-glycine Proteins 0.000 description 2
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 1
- CEHZCZCQHUNAJF-AVGNSLFASA-N (2s)-1-[2-[[(2s)-1-[(2s)-2-amino-5-(diaminomethylideneamino)pentanoyl]pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N1[C@H](C(O)=O)CCC1 CEHZCZCQHUNAJF-AVGNSLFASA-N 0.000 description 1
- SUQWGICKJIJKNO-IHRRRGAJSA-N (2s)-2-[[2-[[(2s)-6-amino-2-[[(2s)-2,6-diaminohexanoyl]amino]hexanoyl]amino]acetyl]amino]pentanedioic acid Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O SUQWGICKJIJKNO-IHRRRGAJSA-N 0.000 description 1
- HKZAAJSTFUZYTO-LURJTMIESA-N (2s)-2-[[2-[[2-[[2-[(2-aminoacetyl)amino]acetyl]amino]acetyl]amino]acetyl]amino]-3-hydroxypropanoic acid Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O HKZAAJSTFUZYTO-LURJTMIESA-N 0.000 description 1
- RQPKNXVVIBYOBX-KDBLBPRBSA-N (2s)-2-amino-3-(3,4-dihydroxyphenyl)propanoic acid;(2s)-2-(dihydroxyamino)-3-phenylpropanoic acid Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C(O)=C1.ON(O)[C@H](C(O)=O)CC1=CC=CC=C1 RQPKNXVVIBYOBX-KDBLBPRBSA-N 0.000 description 1
- RVQXICDGMITIBF-KDBLBPRBSA-N (2s)-2-amino-3-(3,4-dihydroxyphenyl)propanoic acid;(2s)-2-(hydroxyamino)-3-phenylpropanoic acid Chemical compound ON[C@H](C(O)=O)CC1=CC=CC=C1.OC(=O)[C@@H](N)CC1=CC=C(O)C(O)=C1 RVQXICDGMITIBF-KDBLBPRBSA-N 0.000 description 1
- RLCSROTYKMPBDL-USJZOSNVSA-N 2-[[(2s)-1-[(2s)-2-[[(2s)-2-[[2-[[(2s)-2-amino-3-methylbutanoyl]amino]acetyl]amino]-3-methylbutanoyl]amino]propanoyl]pyrrolidine-2-carbonyl]amino]acetic acid Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RLCSROTYKMPBDL-USJZOSNVSA-N 0.000 description 1
- GWIAAIUASRVOIA-UHFFFAOYSA-N 2-aminonaphthalene-1-sulfonic acid Chemical compound C1=CC=CC2=C(S(O)(=O)=O)C(N)=CC=C21 GWIAAIUASRVOIA-UHFFFAOYSA-N 0.000 description 1
- ITZMJCSORYKOSI-AJNGGQMLSA-N APGPR Enterostatin Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N1[C@H](C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)CCC1 ITZMJCSORYKOSI-AJNGGQMLSA-N 0.000 description 1
- 102000054930 Agouti-Related Human genes 0.000 description 1
- SBGXWWCLHIOABR-UHFFFAOYSA-N Ala Ala Gly Ala Chemical compound CC(N)C(=O)NC(C)C(=O)NCC(=O)NC(C)C(O)=O SBGXWWCLHIOABR-UHFFFAOYSA-N 0.000 description 1
- UWQJHXKARZWDIJ-ZLUOBGJFSA-N Ala-Ala-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O UWQJHXKARZWDIJ-ZLUOBGJFSA-N 0.000 description 1
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 1
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 1
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 1
- XQJAFSDFQZPYCU-UWJYBYFXSA-N Ala-Asn-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N XQJAFSDFQZPYCU-UWJYBYFXSA-N 0.000 description 1
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 1
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 1
- RXTBLQVXNIECFP-FXQIFTODSA-N Ala-Gln-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RXTBLQVXNIECFP-FXQIFTODSA-N 0.000 description 1
- ZDYNWWQXFRUOEO-XDTLVQLUSA-N Ala-Gln-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZDYNWWQXFRUOEO-XDTLVQLUSA-N 0.000 description 1
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 1
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 1
- FBHOPGDGELNWRH-DRZSPHRISA-N Ala-Glu-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FBHOPGDGELNWRH-DRZSPHRISA-N 0.000 description 1
- PUBLUECXJRHTBK-ACZMJKKPSA-N Ala-Glu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O PUBLUECXJRHTBK-ACZMJKKPSA-N 0.000 description 1
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 1
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 1
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 1
- HQJKCXHQNUCKMY-GHCJXIJMSA-N Ala-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C)N HQJKCXHQNUCKMY-GHCJXIJMSA-N 0.000 description 1
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 1
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 1
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 1
- OMDNCNKNEGFOMM-BQBZGAKWSA-N Ala-Met-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O OMDNCNKNEGFOMM-BQBZGAKWSA-N 0.000 description 1
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 1
- WEZNQZHACPSMEF-QEJZJMRPSA-N Ala-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 WEZNQZHACPSMEF-QEJZJMRPSA-N 0.000 description 1
- BTRULDJUUVGRNE-DCAQKATOSA-N Ala-Pro-Lys Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O BTRULDJUUVGRNE-DCAQKATOSA-N 0.000 description 1
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 1
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 1
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 1
- NHWYNIZWLJYZAG-XVYDVKMFSA-N Ala-Ser-His Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N NHWYNIZWLJYZAG-XVYDVKMFSA-N 0.000 description 1
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 1
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 1
- VRTOMXFZHGWHIJ-KZVJFYERSA-N Ala-Thr-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VRTOMXFZHGWHIJ-KZVJFYERSA-N 0.000 description 1
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 1
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 1
- OOBVTWHLKYJFJH-FXQIFTODSA-N Arg-Ala-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O OOBVTWHLKYJFJH-FXQIFTODSA-N 0.000 description 1
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 1
- MUXONAMCEUBVGA-DCAQKATOSA-N Arg-Arg-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O MUXONAMCEUBVGA-DCAQKATOSA-N 0.000 description 1
- USNSOPDIZILSJP-FXQIFTODSA-N Arg-Asn-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O USNSOPDIZILSJP-FXQIFTODSA-N 0.000 description 1
- NONSEUUPKITYQT-BQBZGAKWSA-N Arg-Asn-Gly Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N)CN=C(N)N NONSEUUPKITYQT-BQBZGAKWSA-N 0.000 description 1
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 1
- IYMAXBFPHPZYIK-BQBZGAKWSA-N Arg-Gly-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IYMAXBFPHPZYIK-BQBZGAKWSA-N 0.000 description 1
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 1
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 1
- PHHRSPBBQUFULD-UWVGGRQHSA-N Arg-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N PHHRSPBBQUFULD-UWVGGRQHSA-N 0.000 description 1
- SYAUZLVLXCDRSH-IUCAKERBSA-N Arg-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N SYAUZLVLXCDRSH-IUCAKERBSA-N 0.000 description 1
- ZATRYQNPUHGXCU-DTWKUNHWSA-N Arg-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZATRYQNPUHGXCU-DTWKUNHWSA-N 0.000 description 1
- UAOSDDXCTBIPCA-QXEWZRGKSA-N Arg-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UAOSDDXCTBIPCA-QXEWZRGKSA-N 0.000 description 1
- HJDNZFIYILEIKR-OSUNSFLBSA-N Arg-Ile-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HJDNZFIYILEIKR-OSUNSFLBSA-N 0.000 description 1
- LLUGJARLJCGLAR-CYDGBPFRSA-N Arg-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LLUGJARLJCGLAR-CYDGBPFRSA-N 0.000 description 1
- MJINRRBEMOLJAK-DCAQKATOSA-N Arg-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N MJINRRBEMOLJAK-DCAQKATOSA-N 0.000 description 1
- IGFJVXOATGZTHD-UHFFFAOYSA-N Arg-Phe-His Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccccc1)C(=O)NC(Cc2c[nH]cn2)C(=O)O IGFJVXOATGZTHD-UHFFFAOYSA-N 0.000 description 1
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 1
- QQEWINYJRFBLNN-DLOVCJGASA-N Asn-Ala-Phe Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QQEWINYJRFBLNN-DLOVCJGASA-N 0.000 description 1
- GXMSVVBIAMWMKO-BQBZGAKWSA-N Asn-Arg-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N GXMSVVBIAMWMKO-BQBZGAKWSA-N 0.000 description 1
- KXEGPPNPXOKKHK-ZLUOBGJFSA-N Asn-Asp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KXEGPPNPXOKKHK-ZLUOBGJFSA-N 0.000 description 1
- BHQQRVARKXWXPP-ACZMJKKPSA-N Asn-Asp-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N BHQQRVARKXWXPP-ACZMJKKPSA-N 0.000 description 1
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 1
- KLKHFFMNGWULBN-VKHMYHEASA-N Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)NCC(O)=O KLKHFFMNGWULBN-VKHMYHEASA-N 0.000 description 1
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 1
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 1
- JLNFZLNDHONLND-GARJFASQSA-N Asn-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N JLNFZLNDHONLND-GARJFASQSA-N 0.000 description 1
- XFJKRRCWLTZIQA-XIRDDKMYSA-N Asn-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N XFJKRRCWLTZIQA-XIRDDKMYSA-N 0.000 description 1
- RAUPFUCUDBQYHE-AVGNSLFASA-N Asn-Phe-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RAUPFUCUDBQYHE-AVGNSLFASA-N 0.000 description 1
- RVHGJNGNKGDCPX-KKUMJFAQSA-N Asn-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N RVHGJNGNKGDCPX-KKUMJFAQSA-N 0.000 description 1
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 1
- RDLYUKRPEJERMM-XIRDDKMYSA-N Asn-Trp-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O RDLYUKRPEJERMM-XIRDDKMYSA-N 0.000 description 1
- ZAESWDKAMDVHLL-RCOVLWMOSA-N Asn-Val-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O ZAESWDKAMDVHLL-RCOVLWMOSA-N 0.000 description 1
- AXXCUABIFZPKPM-BQBZGAKWSA-N Asp-Arg-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O AXXCUABIFZPKPM-BQBZGAKWSA-N 0.000 description 1
- VPSHHQXIWLGVDD-ZLUOBGJFSA-N Asp-Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VPSHHQXIWLGVDD-ZLUOBGJFSA-N 0.000 description 1
- TVVYVAUGRHNTGT-UGYAYLCHSA-N Asp-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O TVVYVAUGRHNTGT-UGYAYLCHSA-N 0.000 description 1
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 1
- VHQOCWWKXIOAQI-WDSKDSINSA-N Asp-Gln-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VHQOCWWKXIOAQI-WDSKDSINSA-N 0.000 description 1
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 1
- JUWZKMBALYLZCK-WHFBIAKZSA-N Asp-Gly-Asn Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O JUWZKMBALYLZCK-WHFBIAKZSA-N 0.000 description 1
- KHGPWGKPYHPOIK-QWRGUYRKSA-N Asp-Gly-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KHGPWGKPYHPOIK-QWRGUYRKSA-N 0.000 description 1
- SPKCGKRUYKMDHP-GUDRVLHUSA-N Asp-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N SPKCGKRUYKMDHP-GUDRVLHUSA-N 0.000 description 1
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 1
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 1
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 1
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 1
- IOXWDLNHXZOXQP-FXQIFTODSA-N Asp-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N IOXWDLNHXZOXQP-FXQIFTODSA-N 0.000 description 1
- GYWQGGUCMDCUJE-DLOVCJGASA-N Asp-Phe-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O GYWQGGUCMDCUJE-DLOVCJGASA-N 0.000 description 1
- KBJVTFWQWXCYCQ-IUKAMOBKSA-N Asp-Thr-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KBJVTFWQWXCYCQ-IUKAMOBKSA-N 0.000 description 1
- DKQCWCQRAMAFLN-UBHSHLNASA-N Asp-Trp-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O DKQCWCQRAMAFLN-UBHSHLNASA-N 0.000 description 1
- NJLLRXWFPQQPHV-SRVKXCTJSA-N Asp-Tyr-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJLLRXWFPQQPHV-SRVKXCTJSA-N 0.000 description 1
- USENATHVGFXRNO-SRVKXCTJSA-N Asp-Tyr-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 USENATHVGFXRNO-SRVKXCTJSA-N 0.000 description 1
- ZQFZEBRNAMXXJV-KKUMJFAQSA-N Asp-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O ZQFZEBRNAMXXJV-KKUMJFAQSA-N 0.000 description 1
- NWAHPBGBDIFUFD-KKUMJFAQSA-N Asp-Tyr-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O NWAHPBGBDIFUFD-KKUMJFAQSA-N 0.000 description 1
- CZIVKMOEXPILDK-SRVKXCTJSA-N Asp-Tyr-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O CZIVKMOEXPILDK-SRVKXCTJSA-N 0.000 description 1
- XWKBWZXGNXTDKY-ZKWXMUAHSA-N Asp-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O XWKBWZXGNXTDKY-ZKWXMUAHSA-N 0.000 description 1
- SFJUYBCDQBAYAJ-YDHLFZDLSA-N Asp-Val-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SFJUYBCDQBAYAJ-YDHLFZDLSA-N 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 241000237519 Bivalvia Species 0.000 description 1
- 108010022452 Collagen Type I Proteins 0.000 description 1
- 102000012422 Collagen Type I Human genes 0.000 description 1
- 108020004635 Complementary DNA Proteins 0.000 description 1
- RYGMFSIKBFXOCR-UHFFFAOYSA-N Copper Chemical compound [Cu] RYGMFSIKBFXOCR-UHFFFAOYSA-N 0.000 description 1
- 241001275954 Cortinarius caperatus Species 0.000 description 1
- CEZSLNCYQUFOSL-BQBZGAKWSA-N Cys-Arg-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O CEZSLNCYQUFOSL-BQBZGAKWSA-N 0.000 description 1
- RWAZRMXTVSIVJR-YUMQZZPRSA-N Cys-Gly-His Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC1=CNC=N1)C(O)=O RWAZRMXTVSIVJR-YUMQZZPRSA-N 0.000 description 1
- OZHXXYOHPLLLMI-CIUDSAMLSA-N Cys-Lys-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OZHXXYOHPLLLMI-CIUDSAMLSA-N 0.000 description 1
- SMEYEQDCCBHTEF-FXQIFTODSA-N Cys-Pro-Ala Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O SMEYEQDCCBHTEF-FXQIFTODSA-N 0.000 description 1
- 101150097493 D gene Proteins 0.000 description 1
- 230000007023 DNA restriction-modification system Effects 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 241000402754 Erythranthe moschata Species 0.000 description 1
- 102000010834 Extracellular Matrix Proteins Human genes 0.000 description 1
- 108010037362 Extracellular Matrix Proteins Proteins 0.000 description 1
- VTLYFUHAOXGGBS-UHFFFAOYSA-N Fe3+ Chemical class [Fe+3] VTLYFUHAOXGGBS-UHFFFAOYSA-N 0.000 description 1
- RGXXLQWXBFNXTG-CIUDSAMLSA-N Gln-Arg-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O RGXXLQWXBFNXTG-CIUDSAMLSA-N 0.000 description 1
- PGPJSRSLQNXBDT-YUMQZZPRSA-N Gln-Arg-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O PGPJSRSLQNXBDT-YUMQZZPRSA-N 0.000 description 1
- MWLYSLMKFXWZPW-ZPFDUUQYSA-N Gln-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCC(N)=O MWLYSLMKFXWZPW-ZPFDUUQYSA-N 0.000 description 1
- XOKGKOQWADCLFQ-GARJFASQSA-N Gln-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O XOKGKOQWADCLFQ-GARJFASQSA-N 0.000 description 1
- WQWMZOIPXWSZNE-WDSKDSINSA-N Gln-Asp-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O WQWMZOIPXWSZNE-WDSKDSINSA-N 0.000 description 1
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 1
- NVEASDQHBRZPSU-BQBZGAKWSA-N Gln-Gln-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O NVEASDQHBRZPSU-BQBZGAKWSA-N 0.000 description 1
- CGVWDTRDPLOMHZ-FXQIFTODSA-N Gln-Glu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CGVWDTRDPLOMHZ-FXQIFTODSA-N 0.000 description 1
- NSNUZSPSADIMJQ-WDSKDSINSA-N Gln-Gly-Asp Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NSNUZSPSADIMJQ-WDSKDSINSA-N 0.000 description 1
- CLPQUWHBWXFJOX-BQBZGAKWSA-N Gln-Gly-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O CLPQUWHBWXFJOX-BQBZGAKWSA-N 0.000 description 1
- HYPVLWGNBIYTNA-GUBZILKMSA-N Gln-Leu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HYPVLWGNBIYTNA-GUBZILKMSA-N 0.000 description 1
- QBLMTCRYYTVUQY-GUBZILKMSA-N Gln-Leu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QBLMTCRYYTVUQY-GUBZILKMSA-N 0.000 description 1
- IHSGESFHTMFHRB-GUBZILKMSA-N Gln-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O IHSGESFHTMFHRB-GUBZILKMSA-N 0.000 description 1
- DQLVHRFFBQOWFL-JYJNAYRXSA-N Gln-Lys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N)O DQLVHRFFBQOWFL-JYJNAYRXSA-N 0.000 description 1
- XZUUUKNKNWVPHQ-JYJNAYRXSA-N Gln-Phe-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O XZUUUKNKNWVPHQ-JYJNAYRXSA-N 0.000 description 1
- YPFFHGRJCUBXPX-NHCYSSNCSA-N Gln-Pro-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O)C(O)=O YPFFHGRJCUBXPX-NHCYSSNCSA-N 0.000 description 1
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 1
- AFODTOLGSZQDSL-PEFMBERDSA-N Glu-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N AFODTOLGSZQDSL-PEFMBERDSA-N 0.000 description 1
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 1
- NADWTMLCUDMDQI-ACZMJKKPSA-N Glu-Asp-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N NADWTMLCUDMDQI-ACZMJKKPSA-N 0.000 description 1
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 1
- HJIFPJUEOGZWRI-GUBZILKMSA-N Glu-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N HJIFPJUEOGZWRI-GUBZILKMSA-N 0.000 description 1
- VFZIDQZAEBORGY-GLLZPBPUSA-N Glu-Gln-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VFZIDQZAEBORGY-GLLZPBPUSA-N 0.000 description 1
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 1
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 1
- KASDBWKLWJKTLJ-GUBZILKMSA-N Glu-Glu-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O KASDBWKLWJKTLJ-GUBZILKMSA-N 0.000 description 1
- KUTPGXNAAOQSPD-LPEHRKFASA-N Glu-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O KUTPGXNAAOQSPD-LPEHRKFASA-N 0.000 description 1
- CUXJIASLBRJOFV-LAEOZQHASA-N Glu-Gly-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUXJIASLBRJOFV-LAEOZQHASA-N 0.000 description 1
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 1
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 1
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 1
- JJSVALISDCNFCU-SZMVWBNQSA-N Glu-Leu-Trp Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O JJSVALISDCNFCU-SZMVWBNQSA-N 0.000 description 1
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 description 1
- CBEUFCJRFNZMCU-SRVKXCTJSA-N Glu-Met-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O CBEUFCJRFNZMCU-SRVKXCTJSA-N 0.000 description 1
- KJBGAZSLZAQDPV-KKUMJFAQSA-N Glu-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N KJBGAZSLZAQDPV-KKUMJFAQSA-N 0.000 description 1
- WVWZIPOJECFDAG-AVGNSLFASA-N Glu-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N WVWZIPOJECFDAG-AVGNSLFASA-N 0.000 description 1
- YTRBQAQSUDSIQE-FHWLQOOXSA-N Glu-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 YTRBQAQSUDSIQE-FHWLQOOXSA-N 0.000 description 1
- CBWKURKPYSLMJV-SOUVJXGZSA-N Glu-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CBWKURKPYSLMJV-SOUVJXGZSA-N 0.000 description 1
- ITVBKCZZLJUUHI-HTUGSXCWSA-N Glu-Phe-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ITVBKCZZLJUUHI-HTUGSXCWSA-N 0.000 description 1
- MIIGESVJEBDJMP-FHWLQOOXSA-N Glu-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 MIIGESVJEBDJMP-FHWLQOOXSA-N 0.000 description 1
- UDEPRBFQTWGLCW-CIUDSAMLSA-N Glu-Pro-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O UDEPRBFQTWGLCW-CIUDSAMLSA-N 0.000 description 1
- CQAHWYDHKUWYIX-YUMQZZPRSA-N Glu-Pro-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O CQAHWYDHKUWYIX-YUMQZZPRSA-N 0.000 description 1
- PAZQYODKOZHXGA-SRVKXCTJSA-N Glu-Pro-His Chemical compound N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O PAZQYODKOZHXGA-SRVKXCTJSA-N 0.000 description 1
- MRWYPDWDZSLWJM-ACZMJKKPSA-N Glu-Ser-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O MRWYPDWDZSLWJM-ACZMJKKPSA-N 0.000 description 1
- BXSZPACYCMNKLS-AVGNSLFASA-N Glu-Ser-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BXSZPACYCMNKLS-AVGNSLFASA-N 0.000 description 1
- CQGBSALYGOXQPE-HTUGSXCWSA-N Glu-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O CQGBSALYGOXQPE-HTUGSXCWSA-N 0.000 description 1
- VJVAQZYGLMJPTK-QEJZJMRPSA-N Glu-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VJVAQZYGLMJPTK-QEJZJMRPSA-N 0.000 description 1
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 1
- HQTDNEZTGZUWSY-XVKPBYJWSA-N Glu-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)NCC(O)=O HQTDNEZTGZUWSY-XVKPBYJWSA-N 0.000 description 1
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 1
- GZUKEVBTYNNUQF-WDSKDSINSA-N Gly-Ala-Gln Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GZUKEVBTYNNUQF-WDSKDSINSA-N 0.000 description 1
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 1
- MZZSCEANQDPJER-ONGXEEELSA-N Gly-Ala-Phe Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MZZSCEANQDPJER-ONGXEEELSA-N 0.000 description 1
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 1
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 1
- UPOJUWHGMDJUQZ-IUCAKERBSA-N Gly-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UPOJUWHGMDJUQZ-IUCAKERBSA-N 0.000 description 1
- XUDLUKYPXQDCRX-BQBZGAKWSA-N Gly-Arg-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O XUDLUKYPXQDCRX-BQBZGAKWSA-N 0.000 description 1
- JPXNYFOHTHSREU-UWVGGRQHSA-N Gly-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN JPXNYFOHTHSREU-UWVGGRQHSA-N 0.000 description 1
- WJZLEENECIOOSA-WDSKDSINSA-N Gly-Asn-Gln Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)O WJZLEENECIOOSA-WDSKDSINSA-N 0.000 description 1
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 1
- FUTAPPOITCCWTH-WHFBIAKZSA-N Gly-Asp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FUTAPPOITCCWTH-WHFBIAKZSA-N 0.000 description 1
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 1
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 1
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 1
- QGZSAHIZRQHCEQ-QWRGUYRKSA-N Gly-Asp-Tyr Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QGZSAHIZRQHCEQ-QWRGUYRKSA-N 0.000 description 1
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 1
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 1
- VOCMRCVMAPSSAL-IUCAKERBSA-N Gly-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN VOCMRCVMAPSSAL-IUCAKERBSA-N 0.000 description 1
- AQLHORCVPGXDJW-IUCAKERBSA-N Gly-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN AQLHORCVPGXDJW-IUCAKERBSA-N 0.000 description 1
- JLJLBWDKDRYOPA-RYUDHWBXSA-N Gly-Gln-Tyr Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JLJLBWDKDRYOPA-RYUDHWBXSA-N 0.000 description 1
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 1
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 1
- QPCVIQJVRGXUSA-LURJTMIESA-N Gly-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QPCVIQJVRGXUSA-LURJTMIESA-N 0.000 description 1
- ADZGCWWDPFDHCY-ZETCQYMHSA-N Gly-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 ADZGCWWDPFDHCY-ZETCQYMHSA-N 0.000 description 1
- ALOBJFDJTMQQPW-ONGXEEELSA-N Gly-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN ALOBJFDJTMQQPW-ONGXEEELSA-N 0.000 description 1
- AAHSHTLISQUZJL-QSFUFRPTSA-N Gly-Ile-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AAHSHTLISQUZJL-QSFUFRPTSA-N 0.000 description 1
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 1
- TWTPDFFBLQEBOE-IUCAKERBSA-N Gly-Leu-Gln Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O TWTPDFFBLQEBOE-IUCAKERBSA-N 0.000 description 1
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 1
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 1
- BXICSAQLIHFDDL-YUMQZZPRSA-N Gly-Lys-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BXICSAQLIHFDDL-YUMQZZPRSA-N 0.000 description 1
- FHQRLHFYVZAQHU-IUCAKERBSA-N Gly-Lys-Gln Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O FHQRLHFYVZAQHU-IUCAKERBSA-N 0.000 description 1
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 1
- FEUPVVCGQLNXNP-IRXDYDNUSA-N Gly-Phe-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FEUPVVCGQLNXNP-IRXDYDNUSA-N 0.000 description 1
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 1
- QSQXZZCGPXQBPP-BQBZGAKWSA-N Gly-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)CN)C(=O)N[C@@H](CS)C(=O)O QSQXZZCGPXQBPP-BQBZGAKWSA-N 0.000 description 1
- JJGBXTYGTKWGAT-YUMQZZPRSA-N Gly-Pro-Glu Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O JJGBXTYGTKWGAT-YUMQZZPRSA-N 0.000 description 1
- OCPPBNKYGYSLOE-IUCAKERBSA-N Gly-Pro-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN OCPPBNKYGYSLOE-IUCAKERBSA-N 0.000 description 1
- GLACUWHUYFBSPJ-FJXKBIBVSA-N Gly-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GLACUWHUYFBSPJ-FJXKBIBVSA-N 0.000 description 1
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 1
- CSMYMGFCEJWALV-WDSKDSINSA-N Gly-Ser-Gln Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O CSMYMGFCEJWALV-WDSKDSINSA-N 0.000 description 1
- MKIAPEZXQDILRR-YUMQZZPRSA-N Gly-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN MKIAPEZXQDILRR-YUMQZZPRSA-N 0.000 description 1
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 1
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 1
- XBGGUPMXALFZOT-VIFPVBQESA-N Gly-Tyr Chemical compound NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-VIFPVBQESA-N 0.000 description 1
- KBBFOULZCHWGJX-KBPBESRZSA-N Gly-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)CN)O KBBFOULZCHWGJX-KBPBESRZSA-N 0.000 description 1
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 1
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 1
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 1
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 1
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 1
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 1
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 1
- BIAKMWKJMQLZOJ-ZKWXMUAHSA-N His-Ala-Ala Chemical compound C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O BIAKMWKJMQLZOJ-ZKWXMUAHSA-N 0.000 description 1
- DCRODRAURLJOFY-XPUUQOCRSA-N His-Ala-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)NCC(O)=O DCRODRAURLJOFY-XPUUQOCRSA-N 0.000 description 1
- CJGDTAHEMXLRMB-ULQDDVLXSA-N His-Arg-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CJGDTAHEMXLRMB-ULQDDVLXSA-N 0.000 description 1
- NOQPTNXSGNPJNS-YUMQZZPRSA-N His-Asn-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O NOQPTNXSGNPJNS-YUMQZZPRSA-N 0.000 description 1
- ZJSMFRTVYSLKQU-DJFWLOJKSA-N His-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ZJSMFRTVYSLKQU-DJFWLOJKSA-N 0.000 description 1
- QSLKWWDKIXMWJV-SRVKXCTJSA-N His-Cys-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N QSLKWWDKIXMWJV-SRVKXCTJSA-N 0.000 description 1
- VBOFRJNDIOPNDO-YUMQZZPRSA-N His-Gly-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N VBOFRJNDIOPNDO-YUMQZZPRSA-N 0.000 description 1
- FYTCLUIYTYFGPT-YUMQZZPRSA-N His-Gly-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FYTCLUIYTYFGPT-YUMQZZPRSA-N 0.000 description 1
- BDFCIKANUNMFGB-PMVVWTBXSA-N His-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 BDFCIKANUNMFGB-PMVVWTBXSA-N 0.000 description 1
- MJUUWJJEUOBDGW-IHRRRGAJSA-N His-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 MJUUWJJEUOBDGW-IHRRRGAJSA-N 0.000 description 1
- BSVLMPMIXPQNKC-KBPBESRZSA-N His-Phe-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O BSVLMPMIXPQNKC-KBPBESRZSA-N 0.000 description 1
- JGFWUKYIQAEYAH-DCAQKATOSA-N His-Ser-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JGFWUKYIQAEYAH-DCAQKATOSA-N 0.000 description 1
- FFYYUUWROYYKFY-IHRRRGAJSA-N His-Val-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O FFYYUUWROYYKFY-IHRRRGAJSA-N 0.000 description 1
- GGXUJBKENKVYNV-ULQDDVLXSA-N His-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N GGXUJBKENKVYNV-ULQDDVLXSA-N 0.000 description 1
- 238000004566 IR spectroscopy Methods 0.000 description 1
- CYHYBSGMHMHKOA-CIQUZCHMSA-N Ile-Ala-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CYHYBSGMHMHKOA-CIQUZCHMSA-N 0.000 description 1
- VZIFYHYNQDIPLI-HJWJTTGWSA-N Ile-Arg-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N VZIFYHYNQDIPLI-HJWJTTGWSA-N 0.000 description 1
- HVWXAQVMRBKKFE-UGYAYLCHSA-N Ile-Asp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HVWXAQVMRBKKFE-UGYAYLCHSA-N 0.000 description 1
- KIMHKBDJQQYLHU-PEFMBERDSA-N Ile-Glu-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KIMHKBDJQQYLHU-PEFMBERDSA-N 0.000 description 1
- IGJWJGIHUFQANP-LAEOZQHASA-N Ile-Gly-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N IGJWJGIHUFQANP-LAEOZQHASA-N 0.000 description 1
- UASTVUQJMLZWGG-PEXQALLHSA-N Ile-His-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N UASTVUQJMLZWGG-PEXQALLHSA-N 0.000 description 1
- YNMQUIVKEFRCPH-QSFUFRPTSA-N Ile-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O)N YNMQUIVKEFRCPH-QSFUFRPTSA-N 0.000 description 1
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 1
- XDUVMJCBYUKNFJ-MXAVVETBSA-N Ile-Lys-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N XDUVMJCBYUKNFJ-MXAVVETBSA-N 0.000 description 1
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 1
- GLYJPWIRLBAIJH-FQUUOJAGSA-N Ile-Lys-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N GLYJPWIRLBAIJH-FQUUOJAGSA-N 0.000 description 1
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 1
- MASWXTFJVNRZPT-NAKRPEOUSA-N Ile-Met-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)O)N MASWXTFJVNRZPT-NAKRPEOUSA-N 0.000 description 1
- UAELWXJFLZBKQS-WHOFXGATSA-N Ile-Phe-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O UAELWXJFLZBKQS-WHOFXGATSA-N 0.000 description 1
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 1
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 1
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 1
- DLEBSGAVWRPTIX-PEDHHIEDSA-N Ile-Val-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)[C@@H](C)CC DLEBSGAVWRPTIX-PEDHHIEDSA-N 0.000 description 1
- NJGXXYLPDMMFJB-XUXIUFHCSA-N Ile-Val-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N NJGXXYLPDMMFJB-XUXIUFHCSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 1
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 1
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 1
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 1
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 1
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 1
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 1
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 1
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 1
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 1
- GRZSCTXVCDUIPO-SRVKXCTJSA-N Leu-Arg-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRZSCTXVCDUIPO-SRVKXCTJSA-N 0.000 description 1
- FIJMQLGQLBLBOL-HJGDQZAQSA-N Leu-Asn-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FIJMQLGQLBLBOL-HJGDQZAQSA-N 0.000 description 1
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 1
- YKNBJXOJTURHCU-DCAQKATOSA-N Leu-Asp-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKNBJXOJTURHCU-DCAQKATOSA-N 0.000 description 1
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 1
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 1
- JQSXWJXBASFONF-KKUMJFAQSA-N Leu-Asp-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JQSXWJXBASFONF-KKUMJFAQSA-N 0.000 description 1
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 1
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 1
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 1
- QDSKNVXKLPQNOJ-GVXVVHGQSA-N Leu-Gln-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QDSKNVXKLPQNOJ-GVXVVHGQSA-N 0.000 description 1
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 1
- KVMULWOHPPMHHE-DCAQKATOSA-N Leu-Glu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KVMULWOHPPMHHE-DCAQKATOSA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 1
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 1
- KXODZBLFVFSLAI-AVGNSLFASA-N Leu-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KXODZBLFVFSLAI-AVGNSLFASA-N 0.000 description 1
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 1
- JFSGIJSCJFQGSZ-MXAVVETBSA-N Leu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N JFSGIJSCJFQGSZ-MXAVVETBSA-N 0.000 description 1
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 1
- UBZGNBKMIJHOHL-BZSNNMDCSA-N Leu-Leu-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 UBZGNBKMIJHOHL-BZSNNMDCSA-N 0.000 description 1
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 1
- QNTJIDXQHWUBKC-BZSNNMDCSA-N Leu-Lys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNTJIDXQHWUBKC-BZSNNMDCSA-N 0.000 description 1
- ONPJGOIVICHWBW-BZSNNMDCSA-N Leu-Lys-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 ONPJGOIVICHWBW-BZSNNMDCSA-N 0.000 description 1
- SYRTUBLKWNDSDK-DKIMLUQUSA-N Leu-Phe-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYRTUBLKWNDSDK-DKIMLUQUSA-N 0.000 description 1
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 1
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 1
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 1
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 1
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 1
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 1
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 1
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 1
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 101000680845 Luffa aegyptiaca Ribosome-inactivating protein luffin P1 Proteins 0.000 description 1
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 1
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 1
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 1
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 1
- LZWNAOIMTLNMDW-NHCYSSNCSA-N Lys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N LZWNAOIMTLNMDW-NHCYSSNCSA-N 0.000 description 1
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 1
- IBQMEXQYZMVIFU-SRVKXCTJSA-N Lys-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N IBQMEXQYZMVIFU-SRVKXCTJSA-N 0.000 description 1
- YEIYAQQKADPIBJ-GARJFASQSA-N Lys-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O YEIYAQQKADPIBJ-GARJFASQSA-N 0.000 description 1
- NTBFKPBULZGXQL-KKUMJFAQSA-N Lys-Asp-Tyr Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTBFKPBULZGXQL-KKUMJFAQSA-N 0.000 description 1
- GKFNXYMAMKJSKD-NHCYSSNCSA-N Lys-Asp-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GKFNXYMAMKJSKD-NHCYSSNCSA-N 0.000 description 1
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 1
- ODUQLUADRKMHOZ-JYJNAYRXSA-N Lys-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)O ODUQLUADRKMHOZ-JYJNAYRXSA-N 0.000 description 1
- ITWQLSZTLBKWJM-YUMQZZPRSA-N Lys-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCCN ITWQLSZTLBKWJM-YUMQZZPRSA-N 0.000 description 1
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 1
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 1
- DTUZCYRNEJDKSR-NHCYSSNCSA-N Lys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN DTUZCYRNEJDKSR-NHCYSSNCSA-N 0.000 description 1
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 1
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 1
- CANPXOLVTMKURR-WEDXCCLWSA-N Lys-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN CANPXOLVTMKURR-WEDXCCLWSA-N 0.000 description 1
- VLMNBMFYRMGEMB-QWRGUYRKSA-N Lys-His-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CNC=N1 VLMNBMFYRMGEMB-QWRGUYRKSA-N 0.000 description 1
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 1
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 1
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 1
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 1
- ATNKHRAIZCMCCN-BZSNNMDCSA-N Lys-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N ATNKHRAIZCMCCN-BZSNNMDCSA-N 0.000 description 1
- GZGWILAQHOVXTD-DCAQKATOSA-N Lys-Met-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O GZGWILAQHOVXTD-DCAQKATOSA-N 0.000 description 1
- IPSDPDAOSAEWCN-RHYQMDGZSA-N Lys-Met-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IPSDPDAOSAEWCN-RHYQMDGZSA-N 0.000 description 1
- AFLBTVGQCQLOFJ-AVGNSLFASA-N Lys-Pro-Arg Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AFLBTVGQCQLOFJ-AVGNSLFASA-N 0.000 description 1
- CNGOEHJCLVCJHN-SRVKXCTJSA-N Lys-Pro-Glu Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O CNGOEHJCLVCJHN-SRVKXCTJSA-N 0.000 description 1
- GHKXHCMRAUYLBS-CIUDSAMLSA-N Lys-Ser-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O GHKXHCMRAUYLBS-CIUDSAMLSA-N 0.000 description 1
- SQXZLVXQXWILKW-KKUMJFAQSA-N Lys-Ser-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQXZLVXQXWILKW-KKUMJFAQSA-N 0.000 description 1
- MIMXMVDLMDMOJD-BZSNNMDCSA-N Lys-Tyr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O MIMXMVDLMDMOJD-BZSNNMDCSA-N 0.000 description 1
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 1
- ZEDVFJPQNNBMST-CYDGBPFRSA-N Met-Arg-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZEDVFJPQNNBMST-CYDGBPFRSA-N 0.000 description 1
- DJDFBVNNDAUPRW-GUBZILKMSA-N Met-Glu-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O DJDFBVNNDAUPRW-GUBZILKMSA-N 0.000 description 1
- GPAHWYRSHCKICP-GUBZILKMSA-N Met-Glu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GPAHWYRSHCKICP-GUBZILKMSA-N 0.000 description 1
- FYRUJIJAUPHUNB-IUCAKERBSA-N Met-Gly-Arg Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N FYRUJIJAUPHUNB-IUCAKERBSA-N 0.000 description 1
- BEZJTLKUMFMITF-AVGNSLFASA-N Met-Lys-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCNC(N)=N BEZJTLKUMFMITF-AVGNSLFASA-N 0.000 description 1
- KYXDADPHSNFWQX-VEVYYDQMSA-N Met-Thr-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O KYXDADPHSNFWQX-VEVYYDQMSA-N 0.000 description 1
- CIIJWIAORKTXAH-FJXKBIBVSA-N Met-Thr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O CIIJWIAORKTXAH-FJXKBIBVSA-N 0.000 description 1
- 108010006519 Molecular Chaperones Proteins 0.000 description 1
- 241000237852 Mollusca Species 0.000 description 1
- 241001529936 Murinae Species 0.000 description 1
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 1
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- 108010047562 NGR peptide Proteins 0.000 description 1
- 239000004677 Nylon Substances 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- CYZBFPYMSJGBRL-DRZSPHRISA-N Phe-Ala-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CYZBFPYMSJGBRL-DRZSPHRISA-N 0.000 description 1
- UHRNIXJAGGLKHP-DLOVCJGASA-N Phe-Ala-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O UHRNIXJAGGLKHP-DLOVCJGASA-N 0.000 description 1
- AYPMIIKUMNADSU-IHRRRGAJSA-N Phe-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AYPMIIKUMNADSU-IHRRRGAJSA-N 0.000 description 1
- LDSOBEJVGGVWGD-DLOVCJGASA-N Phe-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 LDSOBEJVGGVWGD-DLOVCJGASA-N 0.000 description 1
- PDUVELWDJZOUEI-IHRRRGAJSA-N Phe-Cys-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PDUVELWDJZOUEI-IHRRRGAJSA-N 0.000 description 1
- FIRWJEJVFFGXSH-RYUDHWBXSA-N Phe-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 FIRWJEJVFFGXSH-RYUDHWBXSA-N 0.000 description 1
- KJJROSNFBRWPHS-JYJNAYRXSA-N Phe-Glu-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KJJROSNFBRWPHS-JYJNAYRXSA-N 0.000 description 1
- UAMFZRNCIFFMLE-FHWLQOOXSA-N Phe-Glu-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N UAMFZRNCIFFMLE-FHWLQOOXSA-N 0.000 description 1
- HGNGAMWHGGANAU-WHOFXGATSA-N Phe-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HGNGAMWHGGANAU-WHOFXGATSA-N 0.000 description 1
- HNFUGJUZJRYUHN-JSGCOSHPSA-N Phe-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HNFUGJUZJRYUHN-JSGCOSHPSA-N 0.000 description 1
- MYQCCQSMKNCNKY-KKUMJFAQSA-N Phe-His-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CO)C(=O)O)N MYQCCQSMKNCNKY-KKUMJFAQSA-N 0.000 description 1
- GYEPCBNTTRORKW-PCBIJLKTSA-N Phe-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O GYEPCBNTTRORKW-PCBIJLKTSA-N 0.000 description 1
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 1
- DOXQMJCSSYZSNM-BZSNNMDCSA-N Phe-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O DOXQMJCSSYZSNM-BZSNNMDCSA-N 0.000 description 1
- PEFJUUYFEGBXFA-BZSNNMDCSA-N Phe-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 PEFJUUYFEGBXFA-BZSNNMDCSA-N 0.000 description 1
- OXKJSGGTHFMGDT-UFYCRDLUSA-N Phe-Phe-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)C1=CC=CC=C1 OXKJSGGTHFMGDT-UFYCRDLUSA-N 0.000 description 1
- XOHJOMKCRLHGCY-UNQGMJICSA-N Phe-Pro-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOHJOMKCRLHGCY-UNQGMJICSA-N 0.000 description 1
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 1
- QTDBZORPVYTRJU-KKXDTOCCSA-N Phe-Tyr-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O QTDBZORPVYTRJU-KKXDTOCCSA-N 0.000 description 1
- YUPRIZTWANWWHK-DZKIICNBSA-N Phe-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N YUPRIZTWANWWHK-DZKIICNBSA-N 0.000 description 1
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 1
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 1
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 1
- GRIRJQGZZJVANI-CYDGBPFRSA-N Pro-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 GRIRJQGZZJVANI-CYDGBPFRSA-N 0.000 description 1
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 1
- SZZBUDVXWZZPDH-BQBZGAKWSA-N Pro-Cys-Gly Chemical compound OC(=O)CNC(=O)[C@H](CS)NC(=O)[C@@H]1CCCN1 SZZBUDVXWZZPDH-BQBZGAKWSA-N 0.000 description 1
- XJROSHJRQTXWAE-XGEHTFHBSA-N Pro-Cys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XJROSHJRQTXWAE-XGEHTFHBSA-N 0.000 description 1
- PULPZRAHVFBVTO-DCAQKATOSA-N Pro-Glu-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PULPZRAHVFBVTO-DCAQKATOSA-N 0.000 description 1
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 1
- FEPSEIDIPBMIOS-QXEWZRGKSA-N Pro-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEPSEIDIPBMIOS-QXEWZRGKSA-N 0.000 description 1
- XYHMFGGWNOFUOU-QXEWZRGKSA-N Pro-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 XYHMFGGWNOFUOU-QXEWZRGKSA-N 0.000 description 1
- ZTMLZUNPFDGPKY-VKOGCVSHSA-N Pro-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@@H]3CCCN3 ZTMLZUNPFDGPKY-VKOGCVSHSA-N 0.000 description 1
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 1
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 1
- SMFQZMGHCODUPQ-ULQDDVLXSA-N Pro-Lys-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SMFQZMGHCODUPQ-ULQDDVLXSA-N 0.000 description 1
- WIPAMEKBSHNFQE-IUCAKERBSA-N Pro-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@@H]1CCCN1 WIPAMEKBSHNFQE-IUCAKERBSA-N 0.000 description 1
- AJBQTGZIZQXBLT-STQMWFEESA-N Pro-Phe-Gly Chemical compound C([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 AJBQTGZIZQXBLT-STQMWFEESA-N 0.000 description 1
- RNEFESSBTOQSAC-DCAQKATOSA-N Pro-Ser-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O RNEFESSBTOQSAC-DCAQKATOSA-N 0.000 description 1
- UCTIUWKCVNGEFH-OBJOEFQTSA-N Pro-Val-Gly-Pro Chemical compound N([C@@H](C(C)C)C(=O)NCC(=O)N1[C@@H](CCC1)C(O)=O)C(=O)[C@@H]1CCCN1 UCTIUWKCVNGEFH-OBJOEFQTSA-N 0.000 description 1
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 1
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 1
- 108010003201 RGH 0205 Proteins 0.000 description 1
- 101100209986 Rattus norvegicus Slc18a1 gene Proteins 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 1
- DWUIECHTAMYEFL-XVYDVKMFSA-N Ser-Ala-His Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DWUIECHTAMYEFL-XVYDVKMFSA-N 0.000 description 1
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 1
- IDQFQFVEWMWRQQ-DLOVCJGASA-N Ser-Ala-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IDQFQFVEWMWRQQ-DLOVCJGASA-N 0.000 description 1
- IDCKUIWEIZYVSO-WFBYXXMGSA-N Ser-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C)C(O)=O)=CNC2=C1 IDCKUIWEIZYVSO-WFBYXXMGSA-N 0.000 description 1
- GXXTUIUYTWGPMV-FXQIFTODSA-N Ser-Arg-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O GXXTUIUYTWGPMV-FXQIFTODSA-N 0.000 description 1
- KYKKKSWGEPFUMR-NAKRPEOUSA-N Ser-Arg-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KYKKKSWGEPFUMR-NAKRPEOUSA-N 0.000 description 1
- VAIZFHMTBFYJIA-ACZMJKKPSA-N Ser-Asp-Gln Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O VAIZFHMTBFYJIA-ACZMJKKPSA-N 0.000 description 1
- SFZKGGOGCNQPJY-CIUDSAMLSA-N Ser-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N SFZKGGOGCNQPJY-CIUDSAMLSA-N 0.000 description 1
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 1
- ZOHGLPQGEHSLPD-FXQIFTODSA-N Ser-Gln-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZOHGLPQGEHSLPD-FXQIFTODSA-N 0.000 description 1
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 1
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 1
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 1
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 1
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 1
- ZUDXUJSYCCNZQJ-DCAQKATOSA-N Ser-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N ZUDXUJSYCCNZQJ-DCAQKATOSA-N 0.000 description 1
- OWCVUSJMEBGMOK-YUMQZZPRSA-N Ser-Lys-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O OWCVUSJMEBGMOK-YUMQZZPRSA-N 0.000 description 1
- QJKPECIAWNNKIT-KKUMJFAQSA-N Ser-Lys-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QJKPECIAWNNKIT-KKUMJFAQSA-N 0.000 description 1
- ZSLFCBHEINFXRS-LPEHRKFASA-N Ser-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ZSLFCBHEINFXRS-LPEHRKFASA-N 0.000 description 1
- XKFJENWJGHMDLI-QWRGUYRKSA-N Ser-Phe-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O XKFJENWJGHMDLI-QWRGUYRKSA-N 0.000 description 1
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 1
- JURQXQBJKUHGJS-UHFFFAOYSA-N Ser-Ser-Ser-Ser Chemical compound OCC(N)C(=O)NC(CO)C(=O)NC(CO)C(=O)NC(CO)C(O)=O JURQXQBJKUHGJS-UHFFFAOYSA-N 0.000 description 1
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 1
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 1
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 1
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 1
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 101150018337 Serpinh1 gene Proteins 0.000 description 1
- FKNQFGJONOIPTF-UHFFFAOYSA-N Sodium cation Chemical compound [Na+] FKNQFGJONOIPTF-UHFFFAOYSA-N 0.000 description 1
- 229920001872 Spider silk Polymers 0.000 description 1
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 1
- PKXHGEXFMIZSER-QTKMDUPCSA-N Thr-Arg-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O PKXHGEXFMIZSER-QTKMDUPCSA-N 0.000 description 1
- MFEBUIFJVPNZLO-OLHMAJIHSA-N Thr-Asp-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MFEBUIFJVPNZLO-OLHMAJIHSA-N 0.000 description 1
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 1
- HJOSVGCWOTYJFG-WDCWCFNPSA-N Thr-Glu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O HJOSVGCWOTYJFG-WDCWCFNPSA-N 0.000 description 1
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 1
- NIEWSKWFURSECR-FOHZUACHSA-N Thr-Gly-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NIEWSKWFURSECR-FOHZUACHSA-N 0.000 description 1
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 1
- YZUWGFXVVZQJEI-PMVVWTBXSA-N Thr-Gly-His Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O YZUWGFXVVZQJEI-PMVVWTBXSA-N 0.000 description 1
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 1
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 1
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 1
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 1
- WBCCCPZIJIJTSD-TUBUOCAGSA-N Thr-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H]([C@@H](C)O)N WBCCCPZIJIJTSD-TUBUOCAGSA-N 0.000 description 1
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 1
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 1
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 1
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 1
- KKPOGALELPLJTL-MEYUZBJRSA-N Thr-Lys-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KKPOGALELPLJTL-MEYUZBJRSA-N 0.000 description 1
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 1
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 1
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 1
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 1
- NJGMALCNYAMYCB-JRQIVUDYSA-N Thr-Tyr-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJGMALCNYAMYCB-JRQIVUDYSA-N 0.000 description 1
- ILUOMMDDGREELW-OSUNSFLBSA-N Thr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O ILUOMMDDGREELW-OSUNSFLBSA-N 0.000 description 1
- SPIFGZFZMVLPHN-UNQGMJICSA-N Thr-Val-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SPIFGZFZMVLPHN-UNQGMJICSA-N 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- MQVGIFJSFFVGFW-XEGUGMAKSA-N Trp-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MQVGIFJSFFVGFW-XEGUGMAKSA-N 0.000 description 1
- WLQRIHCMPFHGKP-PMVMPFDFSA-N Trp-Leu-Phe Chemical compound C([C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)CC(C)C)C(O)=O)C1=CC=CC=C1 WLQRIHCMPFHGKP-PMVMPFDFSA-N 0.000 description 1
- UIRPULWLRODAEQ-QEJZJMRPSA-N Trp-Ser-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 UIRPULWLRODAEQ-QEJZJMRPSA-N 0.000 description 1
- BABINGWMZBWXIX-BPUTZDHNSA-N Trp-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N BABINGWMZBWXIX-BPUTZDHNSA-N 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- ZWZOCUWOXSDYFZ-CQDKDKBSSA-N Tyr-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ZWZOCUWOXSDYFZ-CQDKDKBSSA-N 0.000 description 1
- DXYWRYQRKPIGGU-BPNCWPANSA-N Tyr-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DXYWRYQRKPIGGU-BPNCWPANSA-N 0.000 description 1
- AYHSJESDFKREAR-KKUMJFAQSA-N Tyr-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AYHSJESDFKREAR-KKUMJFAQSA-N 0.000 description 1
- QUILOGWWLXMSAT-IHRRRGAJSA-N Tyr-Gln-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QUILOGWWLXMSAT-IHRRRGAJSA-N 0.000 description 1
- LOOCQRRBKZTPKO-AVGNSLFASA-N Tyr-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LOOCQRRBKZTPKO-AVGNSLFASA-N 0.000 description 1
- FIRUOPRJKCBLST-KKUMJFAQSA-N Tyr-His-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O FIRUOPRJKCBLST-KKUMJFAQSA-N 0.000 description 1
- WVGKPKDWYQXWLU-BZSNNMDCSA-N Tyr-His-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CCCCN)C(=O)O)N)O WVGKPKDWYQXWLU-BZSNNMDCSA-N 0.000 description 1
- CVXURBLRELTJKO-BWAGICSOSA-N Tyr-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)O CVXURBLRELTJKO-BWAGICSOSA-N 0.000 description 1
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 1
- KGSDLCMCDFETHU-YESZJQIVSA-N Tyr-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O KGSDLCMCDFETHU-YESZJQIVSA-N 0.000 description 1
- OGPKMBOPMDTEDM-IHRRRGAJSA-N Tyr-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N OGPKMBOPMDTEDM-IHRRRGAJSA-N 0.000 description 1
- LVFZXRQQQDTBQH-IRIUXVKKSA-N Tyr-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LVFZXRQQQDTBQH-IRIUXVKKSA-N 0.000 description 1
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 1
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 1
- JYVKKBDANPZIAW-AVGNSLFASA-N Val-Arg-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N JYVKKBDANPZIAW-AVGNSLFASA-N 0.000 description 1
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 1
- OGNMURQZFMHFFD-NHCYSSNCSA-N Val-Asn-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N OGNMURQZFMHFFD-NHCYSSNCSA-N 0.000 description 1
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 1
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 1
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 1
- NXRAUQGGHPCJIB-RCOVLWMOSA-N Val-Gly-Asn Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O NXRAUQGGHPCJIB-RCOVLWMOSA-N 0.000 description 1
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 1
- YTPLVNUZZOBFFC-SCZZXKLOSA-N Val-Gly-Pro Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N1CCC[C@@H]1C(O)=O YTPLVNUZZOBFFC-SCZZXKLOSA-N 0.000 description 1
- HQYVQDRYODWONX-DCAQKATOSA-N Val-His-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N HQYVQDRYODWONX-DCAQKATOSA-N 0.000 description 1
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 1
- KDKLLPMFFGYQJD-CYDGBPFRSA-N Val-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N KDKLLPMFFGYQJD-CYDGBPFRSA-N 0.000 description 1
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 1
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 1
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 1
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 1
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 1
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 1
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 1
- KTEZUXISLQTDDQ-NHCYSSNCSA-N Val-Lys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KTEZUXISLQTDDQ-NHCYSSNCSA-N 0.000 description 1
- DIOSYUIWOQCXNR-ONGXEEELSA-N Val-Lys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O DIOSYUIWOQCXNR-ONGXEEELSA-N 0.000 description 1
- JMCOXFSCTGKLLB-FKBYEOEOSA-N Val-Phe-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N JMCOXFSCTGKLLB-FKBYEOEOSA-N 0.000 description 1
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 1
- TVGWMCTYUFBXAP-QTKMDUPCSA-N Val-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N)O TVGWMCTYUFBXAP-QTKMDUPCSA-N 0.000 description 1
- AYHNXCJKBLYVOA-KSZLIROESA-N Val-Trp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N AYHNXCJKBLYVOA-KSZLIROESA-N 0.000 description 1
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 1
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 1
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 description 1
- 239000000370 acceptor Substances 0.000 description 1
- MQRWBMAEBQOWAF-UHFFFAOYSA-N acetic acid;nickel Chemical compound [Ni].CC(O)=O.CC(O)=O MQRWBMAEBQOWAF-UHFFFAOYSA-N 0.000 description 1
- 239000012190 activator Substances 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 238000005273 aeration Methods 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 229960001441 aminoacridine Drugs 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 239000007864 aqueous solution Substances 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- 125000003118 aryl group Chemical group 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 1
- 108010093581 aspartyl-proline Proteins 0.000 description 1
- 108010038633 aspartylglutamate Proteins 0.000 description 1
- 230000000712 assembly Effects 0.000 description 1
- 238000000429 assembly Methods 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 230000002457 bidirectional effect Effects 0.000 description 1
- 230000003851 biochemical process Effects 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 229920001400 block copolymer Polymers 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000000978 circular dichroism spectroscopy Methods 0.000 description 1
- 230000009918 complex formation Effects 0.000 description 1
- 238000010668 complexation reaction Methods 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 150000004696 coordination complex Chemical class 0.000 description 1
- 229910052802 copper Inorganic materials 0.000 description 1
- 239000010949 copper Substances 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 108010016616 cysteinylglycine Proteins 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 210000000805 cytoplasm Anatomy 0.000 description 1
- 238000013016 damping Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 239000010432 diamond Substances 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 239000000806 elastomer Substances 0.000 description 1
- 238000000635 electron micrograph Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 210000002744 extracellular matrix Anatomy 0.000 description 1
- 239000010408 film Substances 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 239000007850 fluorescent dye Substances 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 229930182830 galactose Natural products 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- KZNQNBZMBZJQJO-YFKPBYRVSA-N glyclproline Chemical compound NCC(=O)N1CCC[C@H]1C(O)=O KZNQNBZMBZJQJO-YFKPBYRVSA-N 0.000 description 1
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 1
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 1
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 1
- 210000003494 hepatocyte Anatomy 0.000 description 1
- 108010028295 histidylhistidine Proteins 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 230000003284 homeostatic effect Effects 0.000 description 1
- 229910052739 hydrogen Inorganic materials 0.000 description 1
- 239000001257 hydrogen Substances 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 238000001746 injection moulding Methods 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 229910052742 iron Inorganic materials 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- 108010053037 kyotorphin Proteins 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 1
- 108010034529 leucyl-lysine Proteins 0.000 description 1
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 1
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 1
- 108010087810 leucyl-seryl-glutamyl-leucine Proteins 0.000 description 1
- 210000003041 ligament Anatomy 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 108010003700 lysyl aspartic acid Proteins 0.000 description 1
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 1
- 108010059573 lysyl-lysyl-glycyl-glutamic acid Proteins 0.000 description 1
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 1
- 108010009298 lysylglutamic acid Proteins 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 229920002521 macromolecule Polymers 0.000 description 1
- 238000001840 matrix-assisted laser desorption--ionisation time-of-flight mass spectrometry Methods 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 150000002739 metals Chemical class 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 230000003562 morphometric effect Effects 0.000 description 1
- 238000013425 morphometry Methods 0.000 description 1
- 239000005445 natural material Substances 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 229940078494 nickel acetate Drugs 0.000 description 1
- JDRCAGKFDGHRNQ-UHFFFAOYSA-N nickel(3+) Chemical compound [Ni+3] JDRCAGKFDGHRNQ-UHFFFAOYSA-N 0.000 description 1
- 239000002773 nucleotide Substances 0.000 description 1
- 125000003729 nucleotide group Chemical group 0.000 description 1
- 229920001778 nylon Polymers 0.000 description 1
- 239000007800 oxidant agent Substances 0.000 description 1
- 238000005691 oxidative coupling reaction Methods 0.000 description 1
- 230000036542 oxidative stress Effects 0.000 description 1
- 229910052763 palladium Inorganic materials 0.000 description 1
- KDLHZDBZIXYQEI-UHFFFAOYSA-N palladium Substances [Pd] KDLHZDBZIXYQEI-UHFFFAOYSA-N 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 238000005192 partition Methods 0.000 description 1
- MXHCPCSDRGLRER-UHFFFAOYSA-N pentaglycine Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(=O)NCC(O)=O MXHCPCSDRGLRER-UHFFFAOYSA-N 0.000 description 1
- 230000000737 periodic effect Effects 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010051242 phenylalanylserine Proteins 0.000 description 1
- 239000003016 pheromone Substances 0.000 description 1
- 230000000704 physical effect Effects 0.000 description 1
- 238000000053 physical method Methods 0.000 description 1
- 230000035790 physiological processes and functions Effects 0.000 description 1
- 108091033319 polynucleotide Proteins 0.000 description 1
- 102000040430 polynucleotide Human genes 0.000 description 1
- 239000002157 polynucleotide Substances 0.000 description 1
- 229920001184 polypeptide Polymers 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 230000001124 posttranscriptional effect Effects 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 102000004196 processed proteins & peptides Human genes 0.000 description 1
- 108010016009 procolipase activation peptide Proteins 0.000 description 1
- 230000000750 progressive effect Effects 0.000 description 1
- 230000009465 prokaryotic expression Effects 0.000 description 1
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 1
- 108010079317 prolyl-tyrosine Proteins 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 108010070643 prolylglutamic acid Proteins 0.000 description 1
- 108010015796 prolylisoleucine Proteins 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 235000004252 protein component Nutrition 0.000 description 1
- 238000003259 recombinant expression Methods 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 238000004366 reverse phase liquid chromatography Methods 0.000 description 1
- 210000000582 semen Anatomy 0.000 description 1
- 108010026333 seryl-proline Proteins 0.000 description 1
- 239000013605 shuttle vector Substances 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 239000001509 sodium citrate Substances 0.000 description 1
- NLJMYIDDQXHKNR-UHFFFAOYSA-K sodium citrate Chemical compound O.O.[Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O NLJMYIDDQXHKNR-UHFFFAOYSA-K 0.000 description 1
- 229910001415 sodium ion Inorganic materials 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 238000001179 sorption measurement Methods 0.000 description 1
- 238000002798 spectrophotometry method Methods 0.000 description 1
- 108010028203 spidroin 2 Proteins 0.000 description 1
- 238000005507 spraying Methods 0.000 description 1
- 239000007858 starting material Substances 0.000 description 1
- 230000035882 stress Effects 0.000 description 1
- 210000001519 tissue Anatomy 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 239000013638 trimer Substances 0.000 description 1
- 108700004896 tripeptide FEG Proteins 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 239000004474 valine Substances 0.000 description 1
- 229910052725 zinc Inorganic materials 0.000 description 1
- 239000011701 zinc Substances 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/52—Genes encoding for enzymes or proenzymes
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/43504—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from invertebrates
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/64—General methods for preparing the vector, for introducing it into the cell or for selecting the vector-containing host
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Biophysics (AREA)
- Biotechnology (AREA)
- Biochemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Wood Science & Technology (AREA)
- Physics & Mathematics (AREA)
- Medicinal Chemistry (AREA)
- Plant Pathology (AREA)
- Gastroenterology & Hepatology (AREA)
- Microbiology (AREA)
- Toxicology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Tropical Medicine & Parasitology (AREA)
- Cell Biology (AREA)
- Peptides Or Proteins (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Medicinal Preparation (AREA)
- Materials For Medical Uses (AREA)
- Cosmetics (AREA)
- Prostheses (AREA)
Abstract
본 발명은 재조합 콜라겐 유사 단백질을 생산하는 효모 세포에 관한 것이다.
더 나아가 본 발명은 상기 재조합 단백질 생산의 용도를 위한 공동 발현 시스템 또는 그 부분들의 키트와 상기 재조합 단백질 및 그로부터 생산된 실을 생산하는 방법에 관한 것이다. 또한 본 발명은 본 발명의 방법에 따른 단백질 또는 실뿐만 아니라 의료 및 기술의 다양한 분야에 있어 이들의 사용에 관한 것이다.
Description
본 발명은 재조합 콜라겐 유사 단백질을 생산하는 효모 세포에 관한 것이다.
더 나아가 본 발명은 상기 재조합 단백질 생산의 용도를 위한 공동 발현 시스템 또는 그 부분들의 키트와 상기 재조합 단백질 및 그로부터 생산된 실을 생산하는 방법에 관한 것이다. 또한 본 발명은 본 발명의 방법에 따른 단백질 또는 실뿐만 아니라 의료 및 기술의 다양한 분야에 있어 이들의 사용에 관한 것이다.
바다 홍합(Marine mussels)은 조간대(inter-tidal zone)의 거친 서식지에서 발견된다. 그리고 여기서 바다 홍합은 파도와 바람에 노출된 이식 암석(colonizing rocks)에서 매우 성공적으로 자란다. 이러한 성공은 암석의 단단한 표면에 그들을 고정하는 고유한 고정수단에 부분적으로 기인한다. 이러한 고정수단의 일부는 족사(byssus) 또는 홍합 실크(mussel silk)라고 알려진 섬유 구조이다. 상기 족사는 파도의 거친 타격에서 살아남게 하기 위해 홍합을 단단한 표면 또는 암석에 부착시키는 충분한 접착력을 제공한다.
홍합 족사(mussel byssus)는 완벽하게 조그마한 힘줄(건)을 닮은 작은 실들의 다발을 구성하는 세포외 기질(extra-cellular matrix)로 이루어졌다(Qin, Coyne, K.J., and Waite, J.H., 1997, Tough tendons., Mussel byssus has collagen with silk-like domains., Journal of Biological Chemistry 272, 32623-32627). 족사 실(byssus threads)의 한쪽 끝은 연한 고무를 다른 쪽 끝은 딱딱한 나일론을 닮았다. 이러한 속성은 솔기가 없고 점진적인 전이(transition)로 나타나는 독특한 기계적인 속성을 보여준다(Qin, Waite, J.H., 1995, Exotic Collagen Gradients in the Byssus of ht eMussel Mytilus Edulia., The Journal of Experimental Biology 198, 633-644). 또한 족사 실은 상당한 변형에도 붕괴 없이 견딜 수 있고, 압력이 제거되면 원 상태로 되돌아오는 엘라스토머적(elastomeric)인 속성을 갖는다(Vaccaro E., Waite J.H., 2001, Yield and Post-Yield Behavior of Mussel Byssal Thread: A Self-Healing Biomolecular Material, Biomacromolecules 2, 906-911). 족사 실은 말단(distal)의 끝에서 바위에 점착 플라크(adhesive plaque)에 의해 고정된다. 인접하는 끝에서 족사 실은 홍합 다리의 기저에 단단히 붙은 소위 족사 줄기(byssus stem)에 결합한다(도 3 참조).
바다 홍합의 족사 실은 에너지를 흡수하고 방산(dissipation)하는 대용량의 엘라스토머적 섬유이다. 총 흡수 에너지의 70% 이상이 족사에서 방산될 수 있다. 미틸러스(Mytilus) 속(M. edulis 및 M. galoprovincialis)에서 각각 새로운 실은 반경 0.1cm 보다 작고 수 센티미터 길이의 넓이를 갖으며, 사출 성형 반응과 유사한 과정에 의해서 발의 배 쪽(ventral) 홈(groove)에서 5분 안에 생산된다(Waite J.H., 1992, Results Probl., Cell Differ, 19, 27p).
형태학적으로 족사는 인접부에서 말단부로 뿌리(root), 줄기(stem), 실(thread), 및 플라크(plaque) (또는 패드)의 4부분으로 나뉜다. 더 나아가 실은 외관에 따라서, 예를 들면 인접부와 말단부로 나누어지는데 말단부는 매끄럽고 경직되며, 인접부는 연약하고 더 약하다.
족사 실은 엘라스토머적(elastomeric)이다. 영 계수(Young's modulus)는 10 내지 500 MPa의 범위로 낮은데 신장성(extensibility)은 200% 이상일 수 있고, 회복력이 있다. 엘라스틴, 레실린(resiline), 및 압덕틴(abductine)과 같은 다른 단백질 엘라스토머와 같이 족사 실은 매우 질기다. 질김과 에너지 방산(dissipation )은 흡착기관의 두 가지 중대한 속성이다. 주기적 압력(stress)-변형(strain)―분석에 관한 섬유의 에너지 방산은 흡수된 총에너지에 관하여 종종 표준화되며 이력현상(hysteresis) 또는 퍼센트 이력현상으로 보고된다.
하나의 실에 대한 압력-변형 순환은 실의 분리된 인접부 및 말단부의 기계적 기여로 나누어진다. 상기에서 언급했듯이 말단부는 좀더 단단하고 경직되고, 감폭(damping)에 우수한 반면, 인접부는 좀더 연하고 약하지만 더욱 상당한 이력현상 을 갖는다.
족사 실의 기계적 속성은 시간 및 변형 의존적 움직임(behavior)에 의해 좀더 복잡해진다. 항복점(yield point) 위의 변형을 받을 때, 말단부는 도식적인 압력 연화(softening)를 나타내는데, 예를 들면 두 번째 사이클의 시작 계수가 첫 번째 사이클(500-80 MPa)의 계수의 20% 정도 감소한다. 첫 번째 사이클의 계수의 완벽한 회복은 예를 들면 24 시간 보다 길지만 상당한 부분 회복(원래 값의 30%)이 1시간 내에 발생한 수 있다. 이 경우 35MPa의 시작 계수부터 50MPa의 점근(asymptotic) 수준으로 대략 40%의 증가된 변형-경직(strain-stiffening)이 있다.
마스콜로와 와이테(MASCOLO, WAITE, 1986)는 미틸러스(Mytilus)의 족사 실에서 화학적 경사도를 처음 동정했다. 펩신으로 실을 처리한 후 각각 50 kDa 및 60 kDa의 ColP 및 ColD라고 불리는 두개의 펩신 저항성 콜라겐 단편을 동정했다. ColP는 인접부에 우세하게 관찰되고, 말단부에서는 거의 발견되지 않는다. 반대로, ColD의 양은 말단부에서 대략 100%로 증가한다(LUCAS et al., 2002; QIN & WAITE, 1995). 홍합 발에서와 마찬가지로 족사 실에도 실 구조의 형성에 참여하는 콜라겐 유사 단백질이 있다. 이러한 부가적 단백질은 ColNG(NG, no gradient)라고 불리는데 ColD 및 ColP와는 달리 전체 실에 걸쳐 고르게 분포한다. 그 생리적 기능은 다른 두개의 실 콜라겐 사이에서 어댑터 역할을 하는 것이다(QIN & WAITE, 1998).
펩신-절단 단편인 ColD 및 ColP는 소위 프리콜라겐(precollagen) P 및 D에서 유래한다. 미틸러스 에듈리스(M. edulis)에서 유래한 두개의 프리콜라겐들은 히스티딘과 다이 히드록시 페닐알라닌(DOPA, dihydroxyphenylalanine)이 풍부한 말단으로 각각 끝나는 다른 측면 부위들로 측면 배열된 중앙 콜라겐 헬릭스(helix)의 공통 기본 구조를 특징으로 한다(도 1 참조).
족사 콜라겐을 섬유(fiber)로 조립하는 메커니즘은 족사 생화학의 어려운 측면이었다. 콜라겐이 가교 결합(cross-linking)을 통해 안정화된다는 것은 잘 알려져 있으나 그 화학적 성질은 여전히 잘 알려지지 않았다. 콜라겐의 가교 결합에는 두 가지 구별되는 가교 결합(cross-linking)의 가능성이 있는데, 이는 콜라겐 유닛 간의 금속 복합체 형성(complexation) 및 공유 결합 형성이다(Coombs T.L., Keller P.J., 1981, Mytilus byssal threads as an environmental marker for metal ions., Aquat., Toxicol., p291-300; Swann C.P., Adewole T., Waite J.H. 1998, Preferential manganese accumulation in dreissenid byssal threads, Comparative Biochemistry and Physiology, Part B, Biochemistry & Molecular Biology 119B, p755-759). 금속 복합체 형성은 족사에서 발견된 높은 수준의 철, 구리, 니켈 및 아연과 족사 단백질의 양쪽 말단에 금속 결합한 과-히스티딘(histidine-rich) 서열에 의해 제시된다. 더욱이 다이히드록시페닐알라닌(DOPA, dihydroxyphenylalanine)이 모든 프리콜라겐의 양쪽 말단에 있고, 펩타이드(peptidyl)-DOPA는 우수한 금속 결합 부위를 제공하며, 펩타이드-디 히드록시 페닐알라닌-Fe(III) 킬레이트는 해양 접합 플라크(marine adhesive plaque) Mefp-1 단백질에서 보고 된 바 있다(Taylor S.W., Chase D.B., Emptage M.H., Nelson M.J., 및 Waite J.H., 1996, Ferric Ion Complexes of a DOPA-Containing Adhesive Protein from Mytilus edulis., Inorganic Chemistry 35, p7572-7577). 더 나아가 족사 섬유(byssal fiber)에서 EDTA에 의한 금속 이온의 제거는 섬유의 항복 강도(yield strength)를 감소시키는 것으로 알려졌다. 공유 가교 결합은 티로신(tyrosine), DOPA, 및 시스테인(cysteine) 사이에서 산화적 커플링에 의해 일반적으로 형성된다. 높은 유속 및 폭기 조건에서 압력이 가해진 족사에 대한 실험에서 산화의 주요 산물은 5,5?-diDOPA으로 밝혀졌다(Sun C., Vaccaro E., 및 Waite J.H., 2001, Oxidative stress and the mechanical properties of naturally occurring chimeric collagen-containing fibers, Biophysical Journal 81, p3590-3595). 산화 DOPA에 라이신의 미챌-타입(Michael-type) 부가와 같은 다른 가능한 커플링 산물은 발견되지 않았다(Waite J.H., Qin X.-X., 및 Coyne K.J., 1998, The peculiar collagens of mussel byssus, Matrix Biology 17, p93-106).
보통 콜라겐과 같이 각각의 홍합 콜라겐은 세포질내 세망(endoplasmatic reticulum)으로 알파 체인의 수송을 보증하는 20개 아미노산 신호 서열(signal sequence)을 갖는다. 세포질내 세망에서 세 개의 동일한 알파체인은 호모 트라이머(homotrimer)로 조립된다. 펩신-절단된 프리콜라겐D(preColD)를 의미하는 콜라겐D 알파-체인은 SDS-PAGE 분석 시 60kDa, 말디토프 질량 분석기(MALDI-TOF mass spectometry)에 의한 분석 시 47 kDa의 분자량을 갖는다(QIN et al., 1997). 펩신-절단된 프리 콜라겐P(preColP)를 의미하는 콜라겐P(ColP) 알파-체인은 SDS-PAGE 분석 시 55kDa, 말디토프 질량 분석기에 의한 분석기 40 kDa의 분자량을 갖는다(COYNE et al., 1997). 알파-체인의 전구체는 프리 콜라겐 D 및 프리 콜라겐 P로 불리는데 각각 SDS-PAGE 분석에 의해 95 및 97 kDa의 분자량을 갖고, 말디토프 질량 분석기에 의한 분석 시 각각 75 및 80 kDa의 분자량을 갖는다(COYNE et al., 1997; QIN et al., 1997). 두 콜라겐 모두는 콜라겐 타입 I-III에서 전형적인 특징들을 갖는다. 두개 모두 34% 이상의 글리신의 양을 보이며, 콜라겐 도메인 안에 20%의 결합된 프롤린 및 히드록시 프롤린 함량을 함유한다.
플랭킹 부위(flanking region)는 엘라스틴(preColP) 및 실크-피브로인(preColD)과 같은 다른 구조 단백질에 전적으로 해당한다. 이러한 구조적 구성은 홍합 족사의 기계적 작용에 대해 설명한다. 이러한 이유에서 새로운 기술적 응용에 있어서 구성성분(building block)으로서 이렇게 훌륭한 자연 소재인 홍합 족사 콜라겐을 유전자 재조합으로 생산하는 것은 매우 적절한 것이다.
한정된 특징을 갖는 소재, 그 중에서도 과부하 또는 압력후에도 스스로 재생할 수 있는 소재의 개발은 오랫동안 소재과학 분야에서 높은 관심을 받아왔다. 혼성 구조는 특히 전기적 구성 성분 및 장치, 에너지 컨버터 그리고 다른 소재 기술분야에서 점증하는 관심을 받고 있다. 다른 기계적 특성을 갖는 소재의 조합에 의 해서 구조의 접점(structural interface)은 새로운 기술적 문제를 초래할 것이다.
따라서 많은 응용분야에서 소재의 총체적인 부하를 경감하기 위한 누진적 구조(graduated structure)를 제공하는 것은 매우 바람직할 것이다.
더 나아가, 의료분야에서 홍합 콜라겐의 응용과 사용은 높은 잠재적 생적합성(biocompatibility) 때문에 많은 관심을 일으킨다. 이에 근거하여 양질의 면역적합성을 갖는 의료적 이식 및 조직이 생성될 수 있다. 재조합 홍합 콜라겐의 생성은 매우 흥미로우며, 종래 기술적 응용에서 해결해야하는 중요한 과제가 고려될 수 있다.
따라서 본 발명의 주된 목적은 향상된 특성 특히 우량한 강도와 유연성 및 대량으로 발현되는 향상된 특성을 갖는 재조합 홍합 족사 단백질을 제공하는데 있다. 본 발명의 다른 목적은 누진적 구조를 제공하기 위해 근거한 빌딩 블록(building block)의 특이적 배열에 의해 필요한 용도에 특별히 적합한 재조합 홍합 족사 단백질을 제공하는데 있다. 더 나아가 본 발명의 다른 목적은 기존에 알려진 진핵 발현 시스템에서 간이하게 발현될 수 있는 재조합 홍합 족사 단백질을 암호화하는 발현 벡터를 제공하는데 있다. 추가적으로 본 발명의 다른 목적은 향상된 종이, 직물, 및 가죽 제품을 제공하는데 있다. 본 발명의 다른 목적은 새로운 단백질 과 바이오테크놀로지, 의료, 약학과 응용 식품, 화장품 분야 그리고 전자 장비 및 다른 상업적 응용 분야에서 사용을 위한 재조합 홍합 족사 단백질에서 유래한 구(sphere), 나노 섬유(fibril), 하이드로겔(hydrogel), 실(thread), 발포제(foam)와 같은 다른 소재를 제공하는데 있다. 본 발명의 다른 목적은 대량 및 양질의 콜라겐 유사 단백질, 특히 홍합 족사 단백질을 발현할 수 있는 숙주세포를 제공하는데 있다.
상기 목적들은 독립항의 주제어(subject-matter)에 의해 해결될 수 있다. 바람직한 구현예는 종속항에서 상세히 설명된다.
지금까지 재조합 홍합 족사 단백질의 발현은 보고 된 바 없다. 이것은 적어도 부분적으로 상기 단백질 및 그로부터 생산된 실의 발현에 있어 복잡한 과정에 기인하는 것 같다. 콜라겐 생합성의 복잡성은 재조합 콜라겐을 발현하려는 어떠한 시도에 대한 성과에 낮은 예측을 인도한다. 따라서 이러한 시도들은 아마도 부적절하게 접힌(folded) 단백질, 낮은 수율, 최악의 경우에는 콜라겐의 발현이 전혀 없도록 인도될 수 있다.
본 발명은 적절하게 접힌 콜라겐 유사 단백질, 특히 홍합 족사 단백질을 대량 생산하는 숙주 세포 시스템을 제공한다.
본 발명은 특히 하기의 양태 및 구현예에 따른다.
본 발명의 한 양태에 따르면, 본 발명은 재조합 콜라겐 유사 단백질 특히 재조합 홍합(mussel) 족사(byssus) 단백질을 생산하기 위한 하기 요소로 형질 전환된 효모 세포를 제공한다:
a) 상기 재조합 콜라겐 유사 단백질을 암호화하는 일차 발현 벡터; 및
b) 프롤릴-4-히드록실라제(P4H, prolyl-4-hydroxylase)의 유전자를 암호화하는 핵산을 포함하는 2차 벡터.
본 발명자들은 콜라겐 생합성의 복잡성 때문에 콜라겐 유사 단백질의 재조합 생산에 있어서 특정 인자들이 고려되어야 한다는 것을 발견했다. 가장 중요한 세포질내 세망에서의 알파-PH(P4HA) 및 PDI(P4HB) 두개의 서브 유닛으로 구성된 사단위체(tetrameric) 효소인 프롤릴-4 히드록실라제(prolyl-4 hydroxylase)에 의한 프롤린에서 히드록시프롤린(hydroxyproline)으로 전사 후 조절(post-translational modification)이 고려되어야 한다는 것을 발견하였다. 이러한 이유로 원핵 발현 시스템, 예를 들면 박테리아 발현 시스템은 본 발명에서 사용될 수 없을 수 있다.
반면에 효모는 콜라겐의 합성에 필요한 세포 분획(cell compartmentation)을 제공하나, 콜라겐의 합성에 필요한 프롤릴-4 히드록실라제 효소가 결함되어 있다. 이와 별개로 효모는 그 배양이 대량 배양 시스템에 있어서 상대적으로 달성하기 쉽고, 그로 인해 다른 발현 시스템에 비해 재조합 단백질의 수율이 우수하기 때문에 재조합 콜라겐에 대한 바람직한 발현 시스템이 될 수 있다. 따라서 효모에서의 발현은 재조합 콜라겐 유사 단백질, 특히 홍합 족사 단백질의 효과적인(또한 가격적인 측면에서도) 생산에 이를 수 있을 것이다. 그러나 상기 효소 세포의 결함의 결과로 효모세포에서의 상기 단백질의 발현은 현재까지 달성되지 않고 있다.
본 발명자들은 P4H를 갖지 않는 효모 세포에서 인간 P4H 서브유닛을 유전자 재조합 방법으로 생산할 수 있었고, 정확하게 접을(folding) 수 있음을 밝혀낼 수 있었다. 이와 별개로 이들 효모 품종에서 두개의 인간 P4H 서브유닛의 공동 발현에 의해서 홍합 족사 콜라겐의 합성이 가능하고, 안정한 콜라겐이 형성되는 것이 밝혀낼 수 있었다.
흥미롭게도, 두개의 P4H(알파-PH(P4HA) 및 PDI(P4HB)) 유전자의 공동발현은 효모에서 안정한 트리플 헬릭스(triple helix)의 형성으로 충분하고, 콜라겐 특이적 차페론(chaperone), 폴딩 촉진체들(folding promoters), 또는 다른 효소들 예를 들면 Hsp47과 같은 다른 효소들이 필요하지 않고, 다시 말하면 효모에 내재하는 차페론이 충분히 활성을 갖는다. 효모에서 유전자 조작으로 생산된 인간 콜라겐은 천연(native) 콜라겐과 같은 히드록시프롤린의 함량을 갖고, 또한 많은 다른 특성에서 살펴볼 때 동일하다.
효모의 세포질내 세망(ER)으로 효과적인 수송을 위하여 공동으로 발현되는 P4H 서브유닛의 신호 서열(signal sequence)은 중요한 역할을 한다. 국소화(localization)의 최대 충족(sufficiency)은 본 발명의 바람직한 구현예에서 치환에 의해 달성될 수 있는데, 효모의 신호서열, 예를 들면 사카로마이세스 세레비지에(S. cerevisae)의 페로몬 교배 인자 알파-1(mating factor alpha 1, MFa-1)로 사람의 유전자를 치환하여 달성될 수 있다. MFa-1 신호 서열에 의해 조작된 P4H 서브유닛들은 세포질내 세망의 루멘(lumen)으로 효과적으로 수송될 수 있다.
보다 바람직하게는 신호 서열은 서열번호 10에 따른 사카로마이세스 세레비지에의 교배 인자 알파-1(mating factor alpha 1, MFa-1)이다.
상기 효모 세포에 있어서 바람직하게는 사카로마이세스 세레비지에(Saccharomyces cerevisiae), 스키조사카로마이세스 폼베(Schizosaccharomyces pombe), 피치아 파스토리스(Pichia pastoris), 칸디다 알비칸스(Candida albicans), 또는 한세눌라 폴리모프(Hansenula polymorph) 세포가 사용될 수 있다.
일차 발현 벡터는 바람직하게는 하나 또는 하나 이상의 조절인자를 더 포함할 수 있다. 상기 발현 벡터는 효모 세포에서 발현에 적합하여야 한다.
바람직하게 상기 조절인자는 유도성(inducible) 또는 항상성(constitutive) 프로모터에서 선택될 수 있으며, 더욱 상세하게 GPD, GAL4, CUP1, MET25, GAL1 또는 GAL1-10 유전자에서 선택될 수 있다.
본 발명의 다른 구현예에서 상기 발현벡터는 플라스미드이다.
바람직하게 상기 재조합 콜라겐 유사 단백질은 실크 피브로인 또는 엘라스틴이 플랭킹(flanking)된 콜라겐 도메인의 하나 이상의 단편을 포함하거나 그로 구성된 재조합 홍합 족사 단백질이다.
상기 재조합 홍합 족사 단백질은 형성된 단백질의 다른 특성을 제공하는 하나 이상의 빌딩 블록 타입으로 구성된다. 상기 언급한 것처럼, 엘라스틴과 실크 피브로인은 홍합 족사와 유전자 재조합으로 생산된 홍합 족사 단백질의 기계적 작용에 대해 설명해줄 수 있는 특정한 기계적 특성을 갖는다.
따라서 이들 단편들은 한 가지 유형의 단편으로만 사용되거나 또는 변형하여 재조합 단백질은 둘 이상의 다른 단편을 포함할 수도 있다. 예를 들면, 현저한 신축성(elasticity)이 필요하다면, 단백질은 엘라스틴이 플랭킹 된 콜라겐 단편만을 또는 단편들을 주된 구성요소로 포함할 수 있다. 현저한 강도와 단단함이 필요하다면 단백질은 실크 피브로인이 플랭킹 된 콜라겐 단편을 포함할 수 있다. 더 나아가 바람직한 변형으로서 상기 단백질은 단편들의 두 가지 타입의 혼합을 포함할 수 있는데 예를 들면, 한쪽 부위에서 다른 부위로 경사를 형성하여 두 가지 타입의 혼합을 포함할 수 있다. 따라서 단백질/실은 예를 들면 한쪽 부위에 높은 신축성을 갖고 다른 쪽 부위에 경직성을 가질 수 있는 등의 특별하게 적합한 구조를 갖고 형성될 수 있다.
용어 “플랭킹”은 엘라스틴(또는 실크-피브로인)이 콜라겐 도메인의 양쪽 측면에 위치하는 것을 의미한다.
상기 단편들은 자연적으로 얻을 수 있는데 예를 들면 상기 단편들은 미틸러스 속에 속한 종(Mytilus sp.)에서 얻을 수 있고, 바람직하게는 미틸러스 에듈리스(M. edulis), 미틸러스 갈로프로빈시알리스(M. galloprovincialis), 미틸러스 캘리포니안스(M. californians), 또는 제우케리아 데미싸(Geukeria demissa)에서 얻을 수 있다.
본 발명의 바람직한 구현예에 따르면, 본 발명의 상기 재조합 홍합 족사 단백질은 하나 이상의 프리콜라겐P(preColP) 및/또는 프리콜라겐D(preColD) 단편, 또는 그 유사체를 포함하거나 그들로 이루어진다. 상기 단편들은 상기에서 개설되었다. 두개의 프리콜라겐(예를 들면, D 또는 P)들은 미틸러스 에듈리스(M. edulis)에서 얻을 수 있고, 히스티딘과 다이 히드록시 페닐알라닌(DOPA, di hydroxy phenylalanine)이 풍부한 말단이 각각의 끝에 위치한 다른 플랭킹 부위에 의해 플랭킹 된 중앙 콜라겐 헬릭스(helix)의 공통된 기본 구조를 특징으로 한다(도 1 참조). 플랭킹 부위는 엘라스틴(preColP) 및 실크-피브로인(preColD) 같이 알려진 구조 단백질에 완전하게 부합한다.
프리콜라겐P(preColP)와 프리콜라겐D(preColD)의 서열은 각각의 핵산으로부터 번역(translation)된다. 따라서 본 발명은 하기에서 아미노산이 언급될 때마다 프리콜라겐P(preColP)와 프리콜라겐D(preColD)를 지시하며, 이들 서열은 상기 언급된 다양한 기술적 응용에서 사용될 수 있다. 본 발명의 처음에 언급된 핵산 서열은 프리콜라겐P(preColP)와 프리콜라겐D(preColD)를 암호화하는 서열을 지시한다.
본 발명의 다른 구현예에 따르면, 본 발명의 재조합 단백질은 서열번호 3 및/또는 4의 하나 이상의 단편 또는 그 변이체를 포함하거나, 서열번호 3 및/또는 4의 하나 이상의 단편으로 이루어진다. 상기 서열번호들은 프리콜라겐P(preColP)와 프리콜라겐D(preColD)의 서열을 나타낸다.
또한 상기 본 발명은 그 아미노산 서열의 변이체를 포함한다. 예를 들면 상기 변이체들은 상기 언급된 아미노산 서열과 비교할 때 하나 이상의 치환, 삽입, 및/또는 결실을 함유할 수 있다.
단백질의 특별한 변이체에서, 치환, 삽입, 및/또는 결실로 기인한 소위 사일런트 변이체(silent change)도 본 발명의 일부로 간주된다.
상기 아미노산 치환은 바람직하게는 보존적(conservative) 아미노산 치환과 같이 하나의 아미노산을 유사한 구조 및/또는 화학적 속성을 갖는 유사한 아미노산으로 치환한 결과이다.
아미노산 치환은 관련 잔기(involved residue)의 극성(polarity), 전하(charge), 가용성(solubility), 소수성, 친수성, 및/또는 양친매성(amphipathic, 또는 amphiphilic)의 성질의 유사성에 기초하여 수행될 수 있다. 소수성 아미노산의 예로 알라닌, 루신, 이소루신, 발린, 프롤린, 페닐알라닌, 트립토판, 및 메티오닌이 있다. 극성 및 중성 아미노산은 글리신, 세린, 트레오닌, 시스테인, 티로신, 아스파라진, 및 글루타민을 포함한다. 양성(positive) 전하(염기성)의 아미노산은 아르기닌, 라이신, 및 히스티딘을 포함하고, 음성(negative) 전하의 아미노산은 아스파르트 산(aspartic acid)과 글루탐 산(glutamic acid)을 포함한다.
삽입(Insertions) 또는 결실(deletion)은 통상 하나에서 다섯 개의 아미노산의 범위일 수 있다. 변이의 허용되는 정도는 재조합 DNA 기술을 이용하여 폴리펩티드 분자에서 조직적으로 적용된 삽입, 결실, 또는 치환으로 실험적으로 결정될 수 있다. 그 결과 생기는 변이체는 그들의 특성, 특히 그들의 물리적 특성에 대해 실험될 수 있다.
또한 본 발명에서 사용되는 용어 ‘변이체’는 처음의 오리지널 홍합 신호 서열에 포함된 19개 아미노산 서열이 다른 신호 서열로 치환된 프리콜라겐P(preColP)와 프리콜라겐D(preColD)의 상기 아미노산 서열을 포함할 수 있음을 유의해야 한다.
이것의 바람직한 예는 교배 인자 알파의 신호 서열(SEQ ID NO. 10: MRFPSIFTAV LFAASSALA)로 홍합 신호서열을 치환한 것이다. 상기 신호 서열은 특히 효소에서 핵산의 발현에 적합하다.
또한 본 발명은 상기 정의된 재조합 단백질을 암호화 하는 단리된 핵산을 제공한다. 본 발명에서 핵산과 관련하여 용어 “단리된(isolated)”은 그것이 유래된 개체의 자연 발생적인 유전체에서 즉시 인접하는(한쪽은 5’에 다른 한쪽은 3’에) 두개의 서열에 즉시 인접하지 않는 자연 발생적인 핵산을 가리킨다.
예를 들면, 이에 한정되는 것은 아니나, 단리된 핵산은 자연 발생적인 유전체에서 상기 재조합 DNA 분자를 즉시 플랭킹하는 것으로 보통 발견되는 핵산 서열의 하나가 제거되거나 없다면 어떠한 길이의 재조합 DNA 분자도 될 수 있다.
따라서, 단리된(isolated) 핵산은 이에 한정되는 것은 아니나, 분리된(separate) 분자(예를 들면 cDNA 또는 제한효소로 처리되거나 PCR에 의해 생산된 게놈 DNA의 단편)로 존재하는 재조합 DNA를 포함할 뿐만 아니라, 다른 서열들과 별개이고, 또한 자발적으로 복제 가능한 플라스미드, 바이러스(예를 들면, 레트로바이러스, 아데노바이러스, 또는 헤르페스 바이러스)등의 벡터에 통합된 재조합 DNA, 또는 원핵?진핵 세포의 게놈 DNA로 통합된 재조합 DNA를 포함한다. 게다가 단리된(isolated) 핵산은 하이브리드(hybrid) 또는 융합(fusion) 핵산 서열의 일부인 재조합 DNA 분자를 포함할 수 있다.
또한 비-자연발생적인 핵산 서열은 자연에서 발견되지 않고, 자연발생적인 유전체에서는 즉시 인접하는(contiguous) 서열을 갖기 않기 때문에 용어 “단리된”은 어떠한 비-자연발생적(non-naturally-occurring)인 핵산을 포함한다.
예를 들면, 조작된 핵산(engineered nucleic acid)과 같은 비-자연발생적인 핵산은 단리된 핵산으로 간주된다. 조작된 핵산은 통상의 분자 클로닝 또는 화학적 핵산 생산 테크닉에 의해 생산될 수 있다. 단리된 비-자연발생적 핵산은 다른 서열과 독립적이거나, 자발적으로 복제 가능한 플라스미드, 바이러스(예를 들면, 레트로바이러스, 아데노바이러스, 또는 헤르페스 바이러스)등의 벡터 또는 원핵 또는 진핵 세포의 게놈 DNA로 통합될 수 있다.
게다가, 비-자연발생적 핵산은 하이브리드 또는 융합 핵산 서열의 일부인 핵산 분자를 포함할 수 있다.
당업자들에게 있어서, cDNA, 유전체 라이브러리, 또는 유전체 DNA 제한 절단을 포함하는 겔 조각에 있는 다른 수없이 많은 핵산 분자들 가운데 존재하는 핵산은 단리된(isolated) 핵산으로 간주되지 않음은 명확하다.
상기 아미노산을 암호화 하는 핵산은 재조합 홍합 족사 단백질의 미성숙 또는 성숙 아미노산 서열을 암호화하는 핵산 서열일 수 있다.
본 발명의 바람직한 구현예에서, 단리된(isolated) 핵산은 서열번호 1 또는/ 및 서열번호 2, 또는 그 변이체의 핵산을 포함하거나 서열번호 1 또는/ 및 서열번호 2, 또는 그 변이체로 이루어진다. 상기 변이체가 보통 엄격한(moderately stringent) 또는 엄격한 조건에서 서열번호 1 또는 서열번호 2의 핵산 서열과 혼성화(hybridize)하거나, 서열번호 1 또는 서열번호 2의 핵산 서열과 같은 또는 기능적으로 동등의 아미노산을 암호화 하는 유전자 코드의 축퇴(degeneracy)로 인한 핵산 변화를 포함한다면, 상기 변이체는 서열번호 1 또는 서열번호 2의 서열과 비교할 때 각각 하나 이상의 치환, 삽입, 및/또는 결실을 갖는 것으로 정의된다.
또한 상기 언급했듯이, 본 발명은 상기 핵산의 변이체를 포함하는데 처음 19개의 아미노산(신호 서열)을 코딩하는 핵산이 바람직하게는 효모 신호 서열(교배 인자 알파, 서열번호 10)로 치환된 핵산의 변이체를 포함한다.
본 발명에 있어서, 혼성화의 엄격함(Stringency)은 폴리뉴클레오티드 중합체가 안정한 조건인 것을 말한다. 당업자에게 잘 알려졌듯이, 중합체의 안정성은 나트륨 이온 농도와 온도의 함수이다(Sambrook et al., Molecular Cloning, A Laboratory Manual 2nd Ed., Cold Spring Harbor Laboratory, 1989). 혼성화에 사용되는 엄격함(Stringency) 레벨은 당업계에서 쉽게 변화될 수 있다.
엄격한 세척(stringent washing) 조건은 65°C에서 0.2 x SSC (0.03 M NaCl, 0.003 M sodium citrate, pH 7) 및 0.1% SDS의 용매를 사용하여 세척하는 것을 의미한다. 30 뉴클레오티드까지의 올리고뉴클레오티드 같은 작은 단편에 대한 혼성화 온도는 예를 들면 65℃ 이하에서 50℃, 바람직하게는 55℃ 이상 65℃ 이하이다. 엄격한 혼성화 온도는 각각의 핵산 및 핵산 조성의 크기 및 길이에 의존하는데, 당업계의 숙련된 기술자에 의해 실험적으로 결정될 수 있다. 보통 엄격한(moderate stringent) 혼성화 온도는 예를 들면 42℃이고, 세척조건은 42℃에서 0.2 x SSC 및 0.1% SDS의 용매를 사용하여 세척한다.
본 발명에서 사용되는 프롤릴-4-히드록실라제(P4H)는 바람직하게는 사람 또는 홍합의 프롤릴-4-히드록실라제(P4H)이다.
본 발명의 다른 양태에 따르면, 본 발명은 하기 요소를 포함하는 공동 발현 시스템 또는 그 부분의 키트를 제공 한다:
a) 본 발명에서 정의된 일차 발현 벡터; 및
b) 상기 정의된 이차 발현 벡터.
상기 공동 발현 시스템 또는 그 부분의 키트는 효모 세포에서 재조합 홍합 단백질을 발현시키는데 있어서 유용하게 사용될 수 있다.
본 발명의 다른 양태에 따르면, 본 발명은 하기 단계를 포함하는 재조합 콜라겐 유사 단백질 특히 홍합 족사단백질을 생산하는 방법을 제공 한다:
a) 상기 효모 세포를 준비하는 단계;
b) 상기 효모세포를 발현 벡터 또는 상기 공동 발현 시스템으로 형질 전환시키는 단계;
c) 적절한 조건에서 상기 효모 숙주세포로부터 재조합 단백질을 발현시키는 단계; 및
d) 상기 재조합 단백질을 회수(recovering)하는 단계.
더 나아가 본 발명은 하기 단계로 구성된 재조합 홍합 족사 단백질로부터 실을 생산하는 방법을 제공 한다:
a) 상기 방법에 따라 생산된 재조합 단백질을 준비하는 단계; 및
b) 상기 단백질을 적절한 방법에 의해 실(thread)로 방적(spinning)하거나 몰딩(molding)하는 단계.
상기 스피닝은 바람직하게는 전자기적 스피닝(electrospinning)이 될 수 있다. 전자기적 스피닝은 나노미터 지름의 섬유를 지속적으로 형성시키는데 전자기력을 사용하는 섬유 형성 기술이다. 천연 및 인공 폴리머의 다양한 종류는 용해 상(melt phase) 및 용액에서 전자기적 스피닝에 의해 제조되고, 높은 표면적 물질(여과 막 및 바이오 의료 장비)을 요구하는 응용 분야에서 중요하다.
본 발명의 다른 양태에 따르면, 본 발명은 상기 방법 중의 하나에 의해 얻을 수 있는 단백질 또는 실을 제공한다.
상기 본 발명의 단백질 및 실은 바람직하게는 바이오테크놀로지 및/또는 의료분야에 응용될 수 있다.
예를 들면, 상기 단백질 및 실은 봉합 물질(suture material), 피복 시스템(coverage system), 또는 상처 봉합(wound closure)에 사용될 수 있다. 더 나아가 상기 단백질 및 실은 바람직하게 대체 물질(replacement material), 바람직하게는 인공 연골 또는 건(tendon)의 제조에 사용될 수 있다.
또한 본 발명의 상기 단백질 및 실은 접착용 스트립(adhesive strip), 피부 이식조직(skin graft), 대체 인대(replacement ligament), 및 수술용 망(surgical mesh)과 같은 의료용 장비를 생산하는데 사용될 수 있다. 그리고, 의류 직물(clothing fabric), 방탄조끼 안감(bullet-proof vest lining), 컨테이너 직물(container fabric), 가방 또는 지갑 끈, 케이블, 로프, 접착 묶음 재료, 비-접착 묶음 재료, 끈 재료, 자동차 덮개 및 부품, 비행기 구조 재료, 레인코트(weatherproofing) 재료, 탄력적 칸막이 재료, 스포츠 장비등과 같은 상업적 또는 공업적으로 넓은 범위에서 사용될 수 있다. 그리고 사실상 높은 신장 강도 및 신축성이 필요한 어떠한 섬유나 옷감에 대해서도 사용할 수 있다. 또한 본 발명에서는 융통성 있게 다른 조성물과의 혼합사용으로 또는 건조 스프레이 코팅, 구슬 유사 입자(bead-like particle)와 같은 다른 형태의 안정한 섬유 산물로서 사용될 수 있다.
종이의 처리와 제조뿐 아니라 비행기 구조 재료, 자동차 덮개 및 부품, 또는 의류의 직물(fabric 또는 textile) 및 가죽의 처리와 제조에도 본 발명의 홍합 족사 콜라겐의 바람직한 응용이 있음은 명백하게 언급되었다.
본 발명의 재조합 홍합 족사 단백질은 셀룰로오스, 케라틴, 및 콜라겐 산물에 첨가될 수 있으며, 또한 본 발명은 셀룰로오스, 케라틴, 및/또는 콜라겐과 본 발명의 단백질을 포함하는 종이, 스킨케어(skin care) 제품, 또는 헤어케어 제품과 관련되어 있다. 본 발명의 단백질이 함유된 종이, 스킨케어(skin care) 제품, 또는 헤어케어 제품은 특히 향상된 신장강도(tensile strength 및 tear strength)의 향상된 특성을 보인다.
더 나아가 본 발명의 재조합 홍합 족사 단백질은 직물 및 가죽 제품에 대한 코팅용으로 사용될 수 있는데, 코팅된 제품의 내구성 및 안정성을 부여한다. 특히 단백질은 가죽 제품의 코팅에 적용가능성을 보이는데, 이런 경우에 무두질 및 환경에 대한 악영향은 피할 수 있거나 적어도 줄일 수 있다.
또한 본 발명은 상기 홍합 족사 단백질을 포함하는 제품에 관한 것이다. 예를 들면 상처 봉합(wound closure), 피복 시스템(coverage system), 봉합 재료(suture material), 대체 재료(replacement material)(바람직하게는 인공 연골 또는 건), 화장품(cosmetics), 약물 전달 물질(drug delivery vehicle), 직물, 옷감, 종이 제품, 가죽 제품, 자동차 부품 또는 비행기 부품에 관한 것이다. 일반적으로 본 발명은 구(sphere), 나노 섬유(fibril), 하이드로겔(hydrogel), 발포제(foam), 필름과 같은 재조합 홍합 족사 단백질에 근거한 재료에 관한 것이다.
다른 정의가 없다면, 본 발명에서 사용된 모든 기술적인, 과학적인 용어들은 본 발명이 속한 기술분야에서 통상의 기술을 갖는 자에 의해 보통 이해되는 의미와 같은 의미를 갖는다. 본 발명에서 언급된 모든 간행물, 특허 출원, 특허, 및 다른 참고문헌은 온전히 참고문헌으로 포함되었다. 다툼이 있는 경우에는, 본 발명의 정의를 포함하여 명세서가 우선한다. 또한 본 발명의 실시예는 단지 본 발명을 설명하고자 하는 것으로 이에 한정되는 것은 아니다.
지금부터 본 발명의 실시예 및 수반하는 다음을 나타내는 도면을 통하여 더 상술하기로 한다
도 1은 홍합 족사 콜라겐의 일반적인 구조를 설명하는 것이고;
도 2는 말단부에서 인접부 방향으로 족사 실의 전자현미경 이미지의 시리즈를 나타내는 것이고(a 말단부; b 중앙부; c 인접부로 표시된 부위는 각각 확대되어 나타낸다);
도 3은 홍합 족사의 구조를 나타내고;
도 4는 족사 실에 의해 고체 표면에 붙은 홍합을 나타내고;
도 5의 (a)는 실에 있어 프리콜라겐의 분포; (b)는 플랭킹 도메인과 콜라겐의 서브유닛의 개략도(Schematic)(다이아몬드로 표시된 말단부는 히스티딘이 풍부하며, DOPA는 Y로 표시되었다); (c)는 축(axial) 및 측면(lateral)의 프리콜라겐 사이의 가교 연결된 상호작용의 모델을 나타내고;
도 6은 프롤릴-4-히드록실라제(P4H)의 구조를 도시한 것이고;
도 7은 각각의 발현 플라스미드에 클로닝 될 교배 인자-α(MF-α) 신호 서열을 발생하기 위한 올리고뉴클레오티드를 도시한 것이고;
도 8은 α-PH에 대한 클로닝 전략을 나타낸 것이고;
도 9는 벡터의 개열지도를 도시한 것이다.
효모에서
홍합
족사의
콜라겐 단백질의 발현
일반적인 콜라겐의 합성은 복잡한 생화학 과정을 반영한다. 상기 과정은 세포질내 세망(ER)에서 예를 들면 각각 콜라겐의 특정 프롤린에서 4-히드록시프롤린(4-hydroxyproline)으로의 프롤릴 4-히드록실라제(P4H)에 의한 전사 후 조절(post-translational modification)을 요구한다. 척추동물에서 α2β2의 4단위체인 P4H는 콜라겐의 합성에 중요한 역할을 한다. P4H에 의해 발생되는 4-히드록시프롤린(4-hydroxyproline) 잔기(residue)는 새로 합성되는 콜라겐 펩타이드 체인의 트리플-헬릭스(triple-helical) 콜라겐으로 폴딩(folding)에 필수적이다(Prockop D.J., Kivirikko K.I., 1995, Annu. Rev., Biochem., 64, 403-434).
인간 P4H(Prolyl-4-Hydroxylase) 발현 구성(expression construct)
P4H의 구성에는 α-PH와 PDI의 P4H의 두 서브유닛에 대한 유전자 부근에 효소 벡터로 신호 서열의 클로닝이 필요하다. 두개의 유전자는 양방향(bi-directional) 프로모터의 조절 아래 위치하는데, 상기 프로모터는 갈락토오스(Gal1/10) 존재 시에 유도된다(도 6 참조). 신호 서열은 P4H의 서브유닛의 세포 질내 세망으로 이동에 필요하다. 이동의 최대 효율은 인간 신호 서열이 효모 자신의 신호서열인 교배 인자 α-MF로 치환될 때 달성되었다(Myllyharju J., Nokelainen M., Vuorela A., 및 Kivirikko K.I., 2000, Expression of recombinant human I-III collagens in the yeast Pichia pastoris, Biochem, Soc. Trans. 28, 353-357)(도 6 참조).
신호 서열 없는 α-PH에 대한 유전자는 HepG2 간세포(아담스키교수에 의해 제공됨, GSF Munich, Germany)에서 c-DNA 라이브러리에서 PCR에 의해 증폭될 수 있고, 신호 서열이 없는 β-서브유닛(PDI)의 c-DNA는 대장균 클로닝 벡터(네일 뷸레이드(Neil Bulleid) 교수에 의해 제공됨, University of Manchester, UK)에서 증폭 될 수 있다. 각각의 유전자에 대해 상응하는 교배 인자 α 신호 서열은 두개의 단일 가닥 올리고뉴클레오티드에 근거하여 설계될 것이다. 올리고 A와 올리고 B는 어닐링 후에 이중 가닥 DNA가 바로 각각의 벡터로 클로닝 될 수 있는 방법으로 설계되었다(도 7 참조).
α-PH의 클로닝 전략은 도 8의 예에 나타난다. PDI의 cDNA 클로닝은 동일한 방법으로 수행될 수 있다. 두개의 다른 효소 벡터를 사용할 수 있는데, 양방향 Gal1/10 프로모터를 함유하고, 하나의 플라스미드에서 두개 서브유닛의 동시 발현을 허용하는 pRS315(CEN, 싱글 카피 넘버 플라스미드를 반영하는)와 pRS425(2μ, 멀티 카피 넘버 플라스미드를 반영하는)의 효소 벡터이다.
프리콜라겐D(preColD)와 프리콜라겐P(preColP)의 재조합 생산
홍합 족사의 두 주요 단백질 구성요소인 프리콜리겐D(preColD)와 프리콜라겐P(preColP)의 재조합 합성은 본 발명의 실시예에 따른다. 대장균 클로닝 벡터에 있는 프리콜리겐D(preColD)와 프리콜라겐P(preColP)의 c-DNA를 와이테 교수(Prof. Waite, UCSB, USA)에게서 얻었다. 상기 c-DNA를 PCR에 의해 증폭한 후 다른 효소 발현 벡터로 클로닝 했다. 효소 발현 벡터는 세포 당 카피 넘버가 다를 뿐 아니라 활성제(activator)(유도성 프로모터 GAL4 또는 항상성 프로모터 GPD)의 선택에 있어서도 다르다. 또한 원래 신호 서열은 최대 이동 효율을 위해 효소의 교배인자 알파의 신호서열로 대체될 것이다.
재조합 합성과정 중에 콜라겐P(ColP)와 콜라겐D(ColD)의 탐지
홍합 콜라겐의 효율적인 재조합 합성에 대한 테스트는 홍합 콜라겐에 대한 다클론성(polyclonal) 항체의 효용성을 필요로 한다. 인간 콜라겐 유형 I-III에 대한 다클론성 항체의 예비 테스트는 홍합 족사로부터 화학적으로 변형된 콜라겐에 대한 매우 약한 교차-반응성(cross-reactivity)을 보여준다. 이 교차-반응성은 재조합 합성 동안에 존재하는 콜라겐의 레벨을 탐지하는데 충분하지 않다. 따라서 홍합 콜라겐에 대하여 항체를 증가할 필요가 있다. 항체를 증가시키기 위해서, 정제된 천연 홍합 콜라겐이 필요하다. 족사는 신선한 홍합에서 추출될 수 있고, 여러 크로마토그래피 방법(그 중에서도 특히 역 위상(reverse phase) 크로마토그래피)을 사용하여 정제될 수 있다.
프리콜라겐D(preColD)와 프리콜라겐P(preColP)를 포함한 정제된 단백질 샘플로 토끼를 면역화 시키는데 사용하고, 항체를 생산하였다.
재조합 콜라겐에 대한 생체물리학적 연구
각각의 프리콜라겐P 및 프리콜라겐D 단백질을 특성화하고 자기 조립(self-assembly)의 섬유 형성 효율을 측정하는데 다양한 물리적 방법을 사용할 수 있다. 이런 방법들은 원/근-자외선 원형 광이색성(far- and near-UV circular dichroism), 정지 및 이동 광산란 분석법(light scattering), 푸리에 변환 적외선 분광광도법(FTIR, fourier transformed infrared spectroscopy), 전자 현미경(electron microscopy, EM), 원자력 현미경(AFM, atomic force microscopy), 및 장 흐름 분획법(field flow fractionation)을 포함한다.
프리콜라겐
P 및
프리콜라겐
D의 특성
프리콜라겐P 및 프리콜라겐D의 이차 및 삼차 구조를 결정하기 위하여 원형 광이색성(circular dichroism) 및 푸리에 변환 적외선 분광광도법(FTIR, fourier transformed infrared spectroscopy)을 사용하였다. 또한 그들의 화학적 열역학적 안정성을 여러 가지 조건하에서 테스트하였다. 콜라겐의 형성에 관여하는 단백질의 형태에 대한 데이터는 광산란 분석법(light scattering), 원자력 현미경(atomic force microscopy), 및 장 흐름 분획법(field flow fractionation)에 의해 제공된다.
섬유 형성의 효율 및 속도의 평가
홍합 족사 콜라겐의 이차 및 삼차 구조를 원형 광이색성 및 푸리에 변환 적외선 분광광도법으로 분석할 수 있다. 원자력 현미경(atomic force microscopy)과 전자 현미경은 조립된 집합체와 섬유의 형태와 사차 구조에 대한 정보를 제공한다. 단일 상 매트릭스-프리 크로마토그래피(a one-phase matrix-free chromatography)의 장 흐름 분획법(FFF, field flow fractionation)은 콜라겐의 조립 동안에 형성된 다른 종류의 형태를 평가하는데 사용될 수 있다. 장 흐름 분획법은 매트릭스 프리 크로마토그래피 기법이므로, 다른 용해된 마크로 분자들, 특히 다른 전통적 크로마토그래피 기법에 의해 분리될 수 없는 특히 섬유(fiber)를 분리할 수 있다.
조립 과정의 반응속도(kinetics)는 섬유 형성 동안 이차 및 삼차 구조에 변화를 모니터링 하여 시간의 함수로서 수행될 수 있는 원형 광이색성 및 푸리에 변환 적외선 분광광도법으로 측정할 수 있다. 더 나아가 정지 광산란 분석법(light scattering), 이동 광산란 분석법, 및 시간 경과 원자력 현미경(time-lapse AFM)은 실시간으로 단백질의 조립을 모니터할 수 있게 한다.
형광 염료는 단백질 조립과 관련한 구조적 변화를 연구하는데 사용될 수 있 다. 3-클로로 6-메톡시 9 아미노아크리딘(3-chloro-6-methoxy-9 aminoacridine)과 아미노 나프탈렌 술폰 산(amino naphthalene sulfonic acid)의 N-벤질 유도체(N-benzyl derivative)와 같은 염료의 형광 특성은 단백질 환경의 극성에 따라 변화한다. 따라서 이들 염료로 콜라겐을 표지화(labeling)하는 것은 그 조립 과정을 연구하는데 사용될 수 있다.
콜라겐의 가교 결합과 조립에 금속의 역할에 대한 연구
다이
히드록시 페닐알라닌(
DOPA
,
di
hydroxy
phenylalanine
)과 티로신(
tyrosine
) 가교 결합에 있어서
GGH
의 역할
아미노산 서열 GGH는 프리콜라겐P 와 프리콜라겐D 두개 단백질 모두의 카르복실 말단에서 관찰되었다. 트리펩티드인 NH2-Gly-Gly-His-COOH(GGH)는 니켈 아세테이트[Ni(OAc)2]와 산화제인 마그네슘 모노퍼옥시프탈레이트(MMPP, monoperoxyphthalate)의 존재 하에 수용액에서 관련 단백질의 가교결합을 매개한다(Brown K.C., Kodadek T. 2001, Protein cross-linking mediated by metal ion complexes, Metal Ions in Biological Systems 38, 351-384; Fancy D.A., Denison C., Kim K., Xie Y., Holdeman T., Amini F., Kodadek T., 2000, Scope, limitations and mechanistic aspects of the photo-induced cross-linking of proteins by water-soluble metal complexes, Chemistry & Biology 7, 697-708). 상기 펩티드는 니켈 센터에 바람직한 배위(coordination) 환경을 제공하고, 추정의 니켈(III) 중간체는 접근 가능한 티로신의 방향족(aromatic) 링에서 전자를 추출하여 수소이온의 상실 후에 티로신 라디칼(tyrosyl radical)에 이르는 것으로 생각된다(도 5 참조). 매우 활성화된 라디칼 중간체는 근처의 티로신과 짝을 이루어 가교 결합한 부가물을 생성한다.
[티로신 가교결합 메커니즘]
홍합 콜라겐의 카르복시 말단에서 GGH의 가능한 역할은 활성 촉매 Ni-GGH를 형성하기 위하여 Ni(II)과 결합하는 것일 수 있다. 이 복합체는 가교 결합을 형성하기 위하여 티로신과 다이 히드록시 페닐알라닌의 공기 산화를 서서히 촉진시킨다. 티로신 및/또는 다이 히드록시 페닐알라닌에 대한 촉매의 접근성은 산화 속도를 상당히 증가시킨다. 이러한 가설을 테스트하기 위해 GGH 서열을 유전적으로 제거하거나 변형하여 니켈과 결합할 수 없게 할 수 있다. 가교 결합 및 조립의 속도는 전술한 것과 같이 모니터 할 수 있다.
콜라겐에서 가교 결합을 형성하기 위한 티로신의 화학적 산화
루테니움(II)(ruthenium(II)) 트리스(바이피리딜)(tris(bipyridyl)) 디캐타이온(dication)[Ru(bpy)3 2+] 및 암모늄 퍼설페이트(APS, ammonium persulfate)와 같은 전자 수용체의 존재 하에서 가시광선의 조사(irradiation)는 근접하는 단백질들 간에 매우 효과적인 가교 결합을 유도한다(Brown K.C., Kodadek T., 2001, Protein cross-linking mediated by metal ion complexes, Metal Ions in Biological Systems 38, 351-384; Burdine L., Gillette T.G., Lin H.-J., Kodadek T., 2004, Periodate-Triggered Cross-Linking of DOPA-Containing Peptide-Protein Complexes, Journal of the American Chemical Society 126, 11442-11443; Kim K., Fancy D.A., Carney D., Kodadek T., 1999, Photoinduced Protein Cross-Linking Mediated by Palladium Porphyrins, Journal of the American Chemical Society 121, 11896-11897). 이 과정은 매우 효과적이며, 이 과정의 메커니즘은 Ni/GGH/MMPP의 메커니즘과 유사하다고 추정되었다. 프리콜라겐P 및/또는 프리콜라겐D의 자기 조립(self-assembly)에 의해 형성되는 섬유는 암모늄 퍼설페이트의 존재 하에 [Ru(bpy)3 2+]와 조사(irradiation)를 받을 것이다. 이 과정은 족사 콜라겐에서 티로신과 DOPA의 향상된 가교 결합을 이끌어 낼 것이고, 상기 물리 화학적 성질에 의해 평가될 수 있는 변형된 기계적 속성을 갖는 섬유로 이끌 것이다.
다른 예와 서열:
홍합 콜라겐 preColP 및 proColD의 DNA 서열은 다음과 같이 제공된다. 두개 모두의 프리 콜라겐 단백질(P 및 D)을 pGEM-T 클로닝 벡터에 합병하였다. 시작물질을 검증하기 위하여, 두개의 cDNA를 완전히 서열분석하고, 표준화 프라이머로서 T7과 SP6를 사용하였다. 프리콜라겐(P 또는 D)의 내부(internal) 프라이머로서 T7/1 및 SP6/1을 각각 사용하였다. 얻어지는 DNA 서열은 두개의 프리콜라겐의 간행된 버전에 따라 차이점을 보이고, 적절히 비교된다.
preColP
(
SEQ
ID
NO
:1)
atggttcg gttctcccta gcatcggtac tattactggc agtcaccagc acagctttcg ctggaccagt tagtgattat ggtggtggtg gaatcaaagt agtaccctac cacggaggcg gaggtggaag cggcggcggt ggcggtggag gccatggcgg aagcggtatt ggtggtatcg gaggaggatc atcacatgca catgcccact cttcagcatc tgcccatgtg caccattttg gaccaggtgg atcttcacat gcatcagctg gttcatcatc ccatgcatcc gcatcccata acggtttagg aggtggcagt gctcatgcac atagcagttc cagcgccaac gctcattccg gtggattcgg tggattcggc ggtattggtg gtattggcgg tattggccca ggaggaagtg tcggaggcgg tattggccca ggaggaagtg tcggaggcgg cattggcggt attggcggta ttggcggcgg tggtggacca ggcggtaatg gcggtatcgg attcggacca ggattcggag gaggattcgg accaggttca tctgctagtg gatccggaag tggcagcgca ttcggtggtc caggaggttc aagcgcaagc gcaaacgcag ctgcacgtgc aaatgcaaat ggtggtggag gattcggtgg accaggtacc ccaggaaact caggaccacc aggccaaccc ggactaccag gagcaccagg ccaaccagga cgtccaggaa gtaccccacc aggtcgacca ggaaaccccg gaccaccagg tcaaccaggt aacccaggacgtccaggctc ttcaggaaga ccaggaggat ccggccaacc aggaggtcca ggacgtccag gaacccccgg caaaccagga aaccgaggac aaccaggaca gccaggcggc ccaggacaac caggtcaccc aggagcagga ggacaaccag gacgaaacgg aaatccagga aaccccggta aaccaggaac accaggtcac ccaggaacag caggatcacg aggaatgcca ggaaccccag gaaccccagg acaaccagga attccaggca ccgtcggagg acgaggacca agaggaccag ctggaatcat cggattaatt ggaccaaaag gaaatccagg agagccagga aatccaggtg caccaggagg cccaggatct acaggaccac aaggaccaca aggaccagcc ggaggaccag gagcatcagg cggaccagga gacaaaggcg caccaggtac accaggagga actggaccaa gaggaccaat cggaccatca ggaccatcag gagcaccagg ggaccaagga ccacaaggag gtagaggaac accaggactc gcaggcaaac caggacctaa aggactacaa ggatcaaatg gagaagttgg accccaagga ccatctggac ccgcaggacc acaaggccca caaggaaaga acggtgtcaa aggagcagca ggagatcaag gagctagggg accagaagga aaagccggac cagctggacc acaaggagaa acaggaccaa aaggaccaac aggagcacaa ggaccagccg gtccagccgg accatcagga gaacaaggac caggagggga aagaggaggc cagggaccac aaggagctga aggaccaagt ggaccagcag gaccaagagg accagcagga tcacaaggac caagtggtga acgcggagaa ccaggagcac caggtaaaaa aggaccaaat ggagaccgag gaaaccaagg atcaccagga gcaccaggca aaaacggagc acgaggaaat agaggatcaa gaggaagcaa cggatcaccc ggcagatcag gatcaccagg aagccgagga aaaccaggac cacaaggacc acatggacca agaggagcaa gaggatcacc aggacaaaaa ggaccacgtg gagaccaagg agcaccaggt gttattcgta ttgttatcga tgaccagaga acaggaccag aagttgcaga attcccagga tttggtggat tcggaggagc ttcagctaac gcagcaagtt cagcaaatgc atttgctggt ggacccggtg gttccgctgg agcaggttca tcatcaggag ctaacgcaaa cgcaggtgga ttcccattcg gaggaggacc attcggagga gcaggaggtg gtcccggagc agcaggaggc ccaggaggag caggaggccc aggaggagta ggaggaggag ttggaggtgg accaggagga gtaggaggtg gagtaggagg tggaccagga ggagtaggag gtggaccagg aggagcagga ccaggaggag caggaggatt tggaccagga ggagcaggag gatttggtgg atttggagga ggatctagcg ctggagcatc atcatcagga tcagcatctg catctaacgg tggaccattc ggagtactca atgtaggacc cggaggtaga atcggtggtg gaagcgcatc agcatctgca gcatctagag cacatgcaca cgcttttggt ggtctcggag ggggaagtgc ctcagctggt agtcattcct catctagctc acactcattt ggcggacacg tattccacag tgtgacccat catggaggtc catcacatgt ttcaagcgga ggtcacggag gtcatggagg aggtccatac aaacctggat attaa
유전자 코드의 축퇴현상 때문에 모든 염기의 교환이 아미노산의 교환에 이르는 것은 아니다. 따라서 DNA 서열을 아미노산 서열로 번역하고 비교하였다. 여기서 간행된 서열(COYNE et al., 1997)[gi:2386675](variant P38, (COYNE & WAITE, 2000))을 서열분석에 의해 얻어진 프리콜라겐P의 서열과 정렬한 것을 나타내었다. 데이터베이스 서열은 서열번호 9에 부합하며 서열 분석된 프리콜라겐P의 서열은 서열번호 3이다.
Database 1 MVRFSLASVLLLAVTSTAFAGPVSDYGGGGIKVVPYHGGGGGGGGGGGGG 50
||||||||||||||||||||||||||||||||||||||||||.|||||||
Sequencing 1 MVRFSLASVLLLAVTSTAFAGPVSDYGGGGIKVVPYHGGGGGSGGGGGGG 50
Database 51 HGGSFRNGRHGGIGGIGGGSSHAHAHSSASAHVHHFGPGGSSHASAGSSS 100
|||| |||||||||||||||||||||||||||||||||||||||
Sequencing 51 HGGS-------GIGGIGGGSSHAHAHSSASAHVHHFGPGGSSHASAGSSS 93
Database 101 HASASHNGLGGGSAHAHSSSSANAHSGGFGGFGGIGGIGGIGPGGSVGGG 150
||||||||||||||||||||||||||||||||||||||||||||||||||
Sequencing 94 HASASHNGLGGGSAHAHSSSSANAHSGGFGGFGGIGGIGGIGPGGSVGGG 143
Database 151 IGPGGSVGGGIGGIGGIGGGGGPGGNGGIGFGPGFGGGFGPGSSASGSGS 200
||||||||||||||||||||||||||||||||||||||||||||||||||
Sequencing 144 IGPGGSVGGGIGGIGGIGGGGGPGGNGGIGFGPGFGGGFGPGSSASGSGS 193
Database 201 GSAFGGPGGSSASANAAARANANGGGGFGGPGTPGNSGPPGQPGLPGAPG 250
||||||||||||||||||||||||||||||||||||||||||||||||||
Sequencing 194 GSAFGGPGGSSASANAAARANANGGGGFGGPGTPGNSGPPGQPGLPGAPG 243
Database 251 QPGRPGSTPPGRLGNPGPPGQPGNPGRPGSSGRPGGSGQPGGPGRPGTPG 300
||||||||||||.|||||||||||||||||||||||||||||||||||||
Sequencing 244 QPGRPGSTPPGRPGNPGPPGQPGNPGRPGSSGRPGGSGQPGGPGRPGTPG 293
Database 301 KPGNRGQPGQPGGPGQPGHPGAGGQPGRNGNPGNPGKPGTPGHPGTAGSR 350
||||||||||||||||||||||||||||||||||||||||||||||||||
Sequencing 294 KPGNRGQPGQPGGPGQPGHPGAGGQPGRNGNPGNPGKPGTPGHPGTAGSR 343
Database 351 GMPGTPGTPGQPGIPGTVGGRGPRGPAGIIGLIGPKGNPGEPGNPGAPGG 400
||||||||||||||||||||||||||||||||||||||||||||||||||
Sequencing 344 GMPGTPGTPGQPGIPGTVGGRGPRGPAGIIGLIGPKGNPGEPGNPGAPGG 393
Database 401 PGSTGPQGPQGPAGGPGASGGPGDKGAPGTPGGTGPRGPIGPSGPSGAPG 450
||||||||||||||||||||||||||||||||||||||||||||||||||
Sequencing 394 PGSTGPQGPQGPAGGPGASGGPGDKGAPGTPGGTGPRGPIGPSGPSGAPG 443
Database 451 DQGPQGGRGTPGLAGKPGPKGLQGSNGEVGPQGPSGPAGPQGPQGKNGVK 500
||||||||||||||||||||||||||||||||||||||||||||||||||
Sequencing 444 DQGPQGGRGTPGLAGKPGPKGLQGSNGEVGPQGPSGPAGPQGPQGKNGVK 493
Database 501 GAAGDQGARGPEGKAGPAGPQGETGPKGPTGAQGPAGPAGPSGEQGPGGE 550
||||||||||||||||||||||||||||||||||||||||||||||||||
Sequencing 494 GAAGDQGARGPEGKAGPAGPQGETGPKGPTGAQGPAGPAGPSGEQGPGGE 543
Database 551 RGGQGPQGAEGPSGPAGPRGPAGSQGPSGERGEPGAPGKKGPNGDRGNQG 600
||||||||||||||||||||||||||||||||||||||||||||||||||
Sequencing 544 RGGQGPQGAEGPSGPAGPRGPAGSQGPSGERGEPGAPGKKGPNGDRGNQG 593
Database 601 SPGAPGKNGARGNRGSRGSNGSPGRSGSPGSRGKPGPQGPHGPRGLRGSP 650
|||||||||||||||||||||||||||||||||||||||||||||.||||
Sequencing 594 SPGAPGKNGARGNRGSRGSNGSPGRSGSPGSRGKPGPQGPHGPRGARGSP 643
Database 651 GQKGPRGDQGAPGVIRIVIDDQRTGPEVAEFPGFGGFGGASANAASSANA 700
||||||||||||||||||||||||||||||||||||||||||||||||||
Sequencing 644 GQKGPRGDQGAPGVIRIVIDDQRTGPEVAEFPGFGGFGGASANAASSANA 693
Database 701 FAGGPGGSAGAGSSSGANANAGGF-----PFGGAGGGPGAAGGPGGAGGP 745
|||||||||||||||||||||||| |||||||||||||||||||||
Sequencing 694 FAGGPGGSAGAGSSSGANANAGGFPFGGGPFGGAGGGPGAAGGPGGAGGP 743
Database 746 GGVGGGVGGGPGGVGGGVGGGPGGVGGGPGGAGPGGAGGFGPGGAGGFGG 795
||||||||||||||||||||||||||||||||||||||||||||||||||
Sequencing 744 GGVGGGVGGGPGGVGGGVGGGPGGVGGGPGGAGPGGAGGFGPGGAGGFGG 793
Database 796 FGGGSSAGASSSGSASASNGGPFGVLNVGPGGRIGGGSASASAASRAHAH 845
||||||||||||||||||||||||||||||||||||||||||||||||||
Sequencing 794 FGGGSSAGASSSGSASASNGGPFGVLNVGPGGRIGGGSASASAASRAHAH 843
Database 846 AFGGLGGGSASAGSHSSSSSHSFGGHVFHSVTHHGGPSHVSSGGHGGHGG 895
||||||||||||||||||||||||||||||||||||||||||||||||||
Sequencing 844 AFGGLGGGSASAGSHSSSSSHSFGGHVFHSVTHHGGPSHVSSGGHGGHGG 893
Database 896 GPYKPGY 902
|||||||
Sequencing 894 GPYKPGY 900
코인과 와이테는 이미 그 cDNA 서열의 특정 부위에서 다른 프리콜라겐P 변이체(P22, P33 및 P38)의 존재를 밝혀내었다(COYNE & WAITE, 2000). 이들 짧고, 변이체 P22의 알려진 서열 부위를 프리콜라겐P의 DNA 서열과 비교하면, 100% 짝을 맞출 수 있다.
preColD
(
SEQ
ID
NO
:2)
atggtcta caaactcctg accgtgtgtc ttgtagcatc tcttctagag atttgcttag ctgactataa cggcaacaaa cagtatggcg gcagatacgg caacagatac ggaaacggtt taggaggcgg taatggtggt gcaggagccg tagcccatgc ccatgcccat gcccatgcca gtgccggagc aaacggaaga gcaagagcac atgcacgagc cttggcccat gcacatgccg gtggtggcgc tgcacatgga cacccaggat tcccagttgg tggtagcgca agcgcagccg cacgagcagc agcacgagca tcagcaggag gattaggtgg attcggatca gcagcagcca atgcagcagc agcagcaaga gcaggagcag gatttggtgg attcggtgga ttaggaggat tcggaggact cggaggagtt ggcggtccag gtcaaccagg acatgccggt aaacacggaa ccgcaggagc agcaggcaaa gcaggacgtc caggaccatg tggagataga ggggcaccag gagtaccagg caaacaagga ccagtaggag gacaaggacc agcaggacca cgaggaccac gaggagatga aggaccagtt ggaccaaagg gcgaaccagg agcaagagga gctgatggta aaccaggaga caaaggacct gatggagaaa ccggaccacaaggaccagct ggaccaaagg gacaagtagg agaccaaggc aaaccaggag caaagggaga aaccggagat caaggagcac gaggtgaagc aggaaaggcc ggcgaacaag gaccaggagg catccaagga ccaaagggac cagtaggagg acaaggacca gcaggaccag ccggaccact cggaccacaa ggaccaatgg gtgaacgagg accacaagga ccaacaggat cagaaggacc agttggagca ccaggaccaa agggatcagt cggagaccaa ggagcacaag gagaccaagg agcaactggc gctgatggca aaaagggaga accaggagag agaggacaac aaggagcagc aggaccagtc ggccgaccag gaccaagagg agatagagga gcaaagggaa ttcaaggaag ccgaggacga ccaggtggta tgggtagacg aggaaaccgt ggatcccaag gagcagtagg accacgagga gaaactggcc cagacggtaa ccaaggacaa cgtggagaac aaggagcacc aggagttatc acccttgtca ttgaagacct cagaacagcc ggagtagaaa gccccgtaga aacctttgac gcaggagcag gaaccggtgg accagcacca ggagtaggag cagcagcaac agcaggagca tttgcaggag caggaccagg aggagctaat gcaggaggaa acgcagccgc aggagcagga ccaggagtag gaccaggagg actcggagga ctaggaggac ttggtgcagg tggactcgga ggtggactcg gcggtggact cggaggatta ggaggagcag gaggtttagg tggtggactc ggaggattag gaggaggttt aggtggtgga ctcggaggtt taggaggtgg agcaggagga gcaggaggcg caggagcagg aggaaacggt ggagcaggag caggaggagc aggaggaaac ggtggaggat cagccgcagc acgagcagca gcacaagcag cagcagcagc aggaggaaac ggtggagcag cacaagcagc agcacaagca gcagcatcag cagcagcaaa ttcaggactt ggagcaggag cagcaagagc agcagcatca gcagccgcta gagcaaccgt agcaggacat ggaagtggaa ccgccgcagc agcagccaac gcagccgcac aagcacatgc agcaacacga ggacaaggag gatcacacgc acacgctgcc gccgcagctc acgcagccgc aagtagcgta atccatggtg gtgactatca cggaaacgat gccggctatc acaaaccagg atattaa
다음에서 간행된 서열(QIN et al., 1997) [gi:2772914]을 프리콜라겐D의 서열분석에 의해 얻어진 전체 서열과 정렬한 것을 나타내었다. DNA 서열을 단백질 서열로 번역하였다. 데이터베이스 서열은 서열번호 8이고, 서열 분석에 의해 얻어진 서열은 서열번호 4이다.
Database 1 MVYKLLTVCLVASLLEICLADYNGNKQYGGRYGNRYGNGLGGGNGGAGAV 50
||||||||||||||||||||||||||||||||||||||||||||||||||
Sequencing 1 MVYKLLTVCLVASLLEICLADYNGNKQYGGRYGNRYGNGLGGGNGGAGAV 50
Database 51 AHAHAHAHASAGANGRARAHARALAHAHAGGGAAHGHPGFPVGGSASAAA 100
||||||||||||||||||||||||||||||||||||||||||||||||||
Sequencing 51 AHAHAHAHASAGANGRARAHARALAHAHAGGGAAHGHPGFPVGGSASAAA 100
Database 101 RAAARASAGGLGGFGSAAANAAAAARAGAGFGGFGGLGGFGGLGGVGGPG 150
||||||||||||||||||||||||||||||||||||||||||||||||||
Sequencing 101 RAAARASAGGLGGFGSAAANAAAAARAGAGFGGFGGLGGFGGLGGVGGPG 150
Database 151 QPGGPGGPGGPGGPGGPGMPGGPGGPSGPGTGGPGQPGGPGGPGGPGGPG 200
|
Sequencing 151 Q------------------------------------------------- 151
Database 201 GPSMPGGPGGPGGPGMPGGPGGPGGPGGAGGIPGMTGPAGPPGPAGPQGP 250
Sequencing 151 -------------------------------------------------- 151
Database 251 EGEQGPRGRTPAGTPGPPGNPGEPGQGGAPGAPGAPGHAGKHGTAGAAGK 300
|||||||||||||||
Sequencing 152 -----------------------------------PGHAGKHGTAGAAGK 166
Database 301 AGRPGPXGQAGASGSSGQHGASGAPGRPGNPGSTGRPGATGDPGRPGATG 350
||||
Sequencing 167 AGRP---------------------------------------------- 170
Database 351 TTGRPGPSGAPGNPGAPGALGAPGPRGSPGFVGLPGPRGSPGEPGNQGPI 400
Sequencing 170 -------------------------------------------------- 170
Database 401 GGPGYPGPRGPQGPDGAMGPQGPCGDRGAPGVPGKQGPVGGQGPAGPRGP 450
|||||||||||||||||||||||||||||
Sequencing 171 ---------------------GPCGDRGAPGVPGKQGPVGGQGPAGPRGP 199
Database 451 RGDEGPVGPKGEPGARGADGKPGDKGPDGETGPQGPAGPKGQVGDQGKPG 500
||||||||||||||||||||||||||||||||||||||||||||||||||
Sequencing 200 RGDEGPVGPKGEPGARGADGKPGDKGPDGETGPQGPAGPKGQVGDQGKPG 249
Database 501 AKGETGDQGARGEAGKAGEQGPGGIQGPKGPVGGQGPAGPAGPLGPQGPM 550
||||||||||||||||||||||||||||||||||||||||||||||||||
Sequencing 250 AKGETGDQGARGEAGKAGEQGPGGIQGPKGPVGGQGPAGPAGPLGPQGPM 299
Database 551 GERGPQGPTGSEGPVGAPGPKGSVGDQGAQGDQGATGADGKKGEPGERGQ 600
||||||||||||||||||||||||||||||||||||||||||||||||||
Sequencing 300 GERGPQGPTGSEGPVGAPGPKGSVGDQGAQGDQGATGADGKKGEPGERGQ 349
Database 601 QGAAGPVGRPGPRGDRGAKGIQGSRGRPGGMGRRGNRGSQGAVGPRGETG 650
||||||||||||||||||||||||||||||||||||||||||||||||||
Sequencing 350 QGAAGPVGRPGPRGDRGAKGIQGSRGRPGGMGRRGNRGSQGAVGPRGETG 399
Database 651 PDGNQGQRGEQGAPGVITLVIEDLRTAGVESPVETFDAGAGTGGPAPGVG 700
||||||||||||||||||||||||||||||||||||||||||||||||||
Sequencing 400 PDGNQGQRGEQGAPGVITLVIEDLRTAGVESPVETFDAGAGTGGPAPGVG 449
Database 701 AAATAGAFAGAGPGGANAGGNAAAGAGPGVGPGGLGGLGGLGAGGLGGGL 750
||||||||||||||||||||||||||||||||||||||||||||||||||
Sequencing 450 AAATAGAFAGAGPGGANAGGNAAAGAGPGVGPGGLGGLGGLGAGGLGGGL 499
Database 751 GGGLGGLGGAGGLGGGLGGLGGGLGGGLGGLG---GGAGGAGAGGNGGAG 797
|||||||||||||||||||||||||||||||| |||||||||||||||
Sequencing 500 GGGLGGLGGAGGLGGGLGGLGGGLGGGLGGLGGGAGGAGGAGAGGNGGAG 549
Database 798 AGGAGGNGGGSAAARAAAQAAAAAGGNGGAAQAAAQAAASAAANSGLGAG 847
||||||||||||||||||||||||||||||||||||||||||||||||||
Sequencing 550 AGGAGGNGGGSAAARAAAQAAAAAGGNGGAAQAAAQAAASAAANSGLGAG 599
Database 848 AARAAASAAARATVTGHGSGTAAAAANAAAQAHAATRGQGGSHAHAAAAA 897
||||||||||||||.|||||||||||||||||||||||||||||||||||
Sequencing 600 AARAAASAAARATVAGHGSGTAAAAANAAAQAHAATRGQGGSHAHAAAAA 649
Database 898 HAAASSVIHGGDYHGNDAGYHKPGY 922
|||||||||||||||||||||||||
Sequencing 650 HAAASSVIHGGDYHGNDAGYHKPGY 674
두개의 서열 간에 상당한 차이가 있다. 프리콜라겐D 서열은 간행된 서열보다 250 아미노산만큼 짧게 사용되었다. 콜라겐 도메인에서 아미노산의 중요 부분이 소실되었다. 따라서 본 발명에서 공개된 프리콜라겐D는 지금까지 간행되지 않았고, 프리콜라겐D 유전자의 알려지지 않은 버전이다. 콜라겐 도메인의 절단은 총 단백질에서 실크 피브로인 도메인의 양을 증가시키고, 총 단백질의 습성이 달라지는 것을 주목해야 한다.
P4H
의 발현 구성(
EXPRESSION
CONSTRUCT
)
다음으로 SacII에서 ApaI까지의 부위에서 P4H에 대한 발현 플라스미드에 따른 DNA 서열은 이중가닥으로 나타내었다. MFa/P4H 융합 구성의 시작과 끝은 두개 모두 프린트체로 표시되었다. 사용된 제한 효소 자리는 밑줄로 표시되었다.
SEQ
ID
NO
: 7
1 ccgcggtcat tacagttcat ctttcacagc tttctgatca tcgtcttcct ccatgtctgg
1 ggcgcc agta atgtcaagta gaaagtgtcg aaagactagt agcagaagga ggtacagacc
SacII 2xStop
61 ctcctctgct tcttccaggt cctcgagatc gtcatcatcc cctgccccat cctggccacc
61 gaggagacga agaaggtcca ggagctctag cagtagtagg ggacggggta ggaccggtgg
121 cgagaggtcc ttaaagaatt ttggtaggtc gcacgcaagg ggcaacatta gttactggca
121 gctctccagg aatttcttaa aaccatccag cgtgcgttcc ccgttgtaat caatgaccgt
181 cctgtcggca ctggcaggaa agaacttgag tgtggggaag ctgtgcactt tgacggcctc
181 ggacagccgt gaccgtcctt tcttgaactc acaccccttc gacacgtgaa actgccggag
241 cacctcgttg gcagtcgagt ccatcttggc gatgacgatg ttctcatggt ccttgtacgt
241 gtggagcaac cgtcagctca ggtagaaccg ctactgctac aagagtacca ggaacatgca
301 ctctcccagt ttatcccaaa tgggagccaa ctgtttgcag tgaccacacc atggggcata
301 gagagggtca aatagggttt accctcggtt gacaaacgtc actggtgtgg taccccgtat
361 gaactccaca aagacgtttt ttttctcatc aaaagccacg tcttcaaagt tcttcccaac
361 cttgaggtgt ttctgcaaaa aaaagagtag ttttcggtgc agaagtttca agaagggttg
421 aagcaccttg acaggctgct tgtcccagtc ctccggcagc tcctggctca tcaggtgggg
421 ttcgtggaac tgtccgacga acagggtcag gaggccgtcg aggaccgagt agtccacccc
481 cttgattttg ccctccagga agcggtggca gaactctgtg atcctctctg ccgtcagctc
481 gaactaaaac gggaggtcct tcgccaccgt cttgagacac taggagagac ggcagtcgag
541 ctccgattcg ggcttgtact tggtcatctc ctcctccagg gtgatgaggc gcacggccgg
541 gaggctaagc ccgaacatga accagtagag gaggaggtcc cactactccg cgtgccggcc
601 gcactcttcc ttcttcaggc caaagaactc gaggatgcgc tggttgtcgg tgtggtcgct
601 cgtgagaagg aagaagtccg gtttcttgag ctcctacgcg accaacagcc acaccagcga
661 gtcgatgaag atgaacagga tcttgccctt gaagctctcg gctgctgttt tgaagttgct
661 cagctacttc tacttgtcct agaacgggaa cttcgagagc cgacgacaaa acttcaacga
721 cagtttgccg tcatagtcag acacactctt gggcaagaac agcaggatgt gagtcttgat
721 gtcaaacggc agtatcagtc tgtgtgagaa cccgttcttg tcgtcctaca ctcagaacta
781 ttcacctcca aaaatcttcg gggctgtctg ctcggtgaac tcgatgacaa ggggcagctg
781 aagtggaggt ttttagaagc cccgacagac gagccacttg agctactgtt ccccgtcgac
841 gttgtgtttg ataaagtcca gcaggttctc cttggtgacc tccccttcaa agttgttccg
841 caacacaaac tatttcaggt cgtccaagag gaaccactgg aggggaagtt tcaacaaggc
901 gccttcatca aacttcttaa agaggacaac cccatctttg tcgagctggt atttggagaa
901 cggaagtagt ttgaagaatt tctcctgttg gggtagaaac agctcgacca taaacctctt
961 cacgtcactg ttggaagtga tcccaaatgg tatgtcatcg atggcctctg ctgcctgcaa
961 gtgcagtgac aaccttcact agggtttacc atacagtagc taccggagac gacggacgtt
1021 aaactgcttg gcagagtccg actccacgtc cttgaagaag ccgatgacag ccacctcgct
1021 tttgacgaac cgtctcaggc tgaggtgcag gaacttcttc ggctactgtc ggtggagcga
1081 ggactccacc aaggactctg cagctgcgcc gtcaggcagg gtggtggcag ccgggcccgt
1081 cctgaggtgg ttcctgagac gtcgacgcgg cagtccgtcc caccaccgtc ggcccgggca
ApaI
1141 gcgcttcttc agccagttca cgatgtcatc agcctctctg ccagctgtat attccttggg
1141 cgcgaagaag tcggtcaagt gctacagtag tcggagagac ggtcgacata taaggaaccc
1201 ggaagccgtg tctccattcc tgaagaactt gatggtggga tagccgcgca cgccgtactg
1201 ccttcggcac agaggtaagg acttcttgaa ctaccaccct atcggcgcgt gcggcatgac
1261 ctgggccagg tcagactcct ccgtggcgtc caccttggcc aacctgatct cggaaccttc
1261 gacccggtcc agtctgagga ggcaccgcag gtggaaccgg ttggactaga gccttggaag
1321 tgccttcagc ttcccagcgg ctttggcata ctcaggggcc agagccttgc agtggccaca
1321 acggaagtcg aagggtcgcc gaaaccgtat gagtccccgg tctcggaacg tcaccggtgt
1381 ggttccccgt atcttgaggt ggtcgtccat gaacacccgg cggtcgcgga ggcgcttcaa
1381 ccaaggggca tagaactcca ccagcaggta cttgtgggcc gccagcgcct ccgcgaagtt
1441 gcttttccgc agcaccagga cgtggtcctc ctcctccgga gcgtcagcta atgcggagga
1441 cgaaaaggcg tcgtggtcct gcaccaggag gaggaggcct cgcagtcgat tacgcctcct
BspEI
1501 tgctgcgaat aaaactgcag taaaaattga aggaaatctc atggatccgg ggttttttct
1501 acgacgctta ttttgacgtc atttttaact tcctttagag ta cctaggcc ccaaaaaaga
Start BamHI
1561 ccttgacgtt aaagtataga ggtatattaa caattttttg ttgatacttt tattacattt
1561 ggaactgcaa tttcatatct ccatataatt gttaaaaaac aactatgaaa ataatgtaaa
1621 gaataagaag taatacaaac cgaaaatgtt gaaagtatta gttaaagtgg ttatgcagtt
1621 cttattcttc attatgtttg gcttttacaa ctttcataat caatttcacc aatacgtcaa
1681 tttgcattta tatatctgtt aatagatcaa aaatcatcgc ttcgctgatt aattacccca
1681 aaacgtaaat atatagacaa ttatctagtt tttagtagcg aagcgactaa ttaatggggt
1741 gaaataaggc taaaaaacta atcgcattat catcctatgg ttgttaattt gattcgttca
1741 ctttattccg attttttgat tagcgtaata gtaggatacc aacaattaaa ctaagcaagt
1801 tttgaaggtt tgtggggcca ggttactgcc aatttttcct cttcataacc ataaaagcta
1801 aaacttccaa acaccccggt ccaatgacgg ttaaaaagga gaagtattgg tattttcgat
1861 gtattgtaga atctttattg ttcggagcag tgcggcgcga ggcacatctg cgtttcagga
1861 cataacatct tagaaataac aagcctcgtc acgccgcgct ccgtgtagac gcaaagtcct
1921 acgcgaccgg tgaagacgag gacgcacgga ggagagtctt ccttcggagg gctgtcaccc
1921 tgcgctggcc acttctgctc ctgcgtgcct cctctcagaa ggaagcctcc cgacagtggg
1981 gctcggcggc ttctaatccg tacttcaata tagcaatgag cagttaagcg tattactgaa
1981 cgagccgccg aagattaggc atgaagttat atcgttactc gtcaattcgc ataatgactt
2041 agttccaaag agaaggtttt tttaggctaa gataatgggg ctctttacat ttccacaaca
2041 tcaaggtttc tcttccaaaa aaatccgatt ctattacccc gagaaatgta aaggtgttgt
2101 tataagtaag attagatatg gatatgtata tggatatgta tatggtggta atgccatgta
2101 atattcattc taatctatac ctatacatat acctatacat ataccaccat tacggtacat
2161 atatgattat taaacttctt tgcgtccatc caaaaaaaaa gtaagaattt ttgaaaattc
2161 tatactaata atttgaagaa acgcaggtag gttttttttt cattcttaaa aacttttaag
2221 aaggaattcg atatcaagct tatcgatacc gtcgacatga gatttccttc aatttttact
2221 ttccttaagc tatagttcga atagctatgg cagctg tact ctaaaggaag ttaaaaatga
EcoRI SalI Start
2281 gcagttttat tcgcagcatc ctccgcgcta gctcatccag gcttttttac ttcaattggt
2281 cgtcaaaata agcgtcgtag gaggcgcgat cgagtaggtc cgaaaaaatg aagttaacca
NheI
2341 cagatgactg atttgatcca tactgagaaa gatctggtga cttctctgaa agattatatt
2341 gtctactgac taaactaggt atgactcttt ctagaccact gaagagactt tctaatataa
2401 aaggcagaag aggacaagtt agaacaaata aaaaaatggg cagagaagtt agatcggcta
2401 ttccgtcttc tcctgttcaa tcttgtttat ttttttaccc gtctcttcaa tctagccgat
2461 actagtacag cgacaaaaga tccagaagga tttgttgggc atccagtaaa tgcattcaaa
2461 tgatcatgtc gctgttttct aggtcttcct aaacaacccg taggtcattt acgtaagttt
2521 ttaatgaaac gtctgaatac tgagtggagt gagttggaga atctggtcct taaggatatg
2521 aattactttg cagacttatg actcacctca ctcaacctct tagaccagga attcctatac
2581 tcagatggct ttatctctaa cctaaccatt cagagaccag tactttctaa tgatgaagat
2581 agtctaccga aatagagatt ggattggtaa gtctctggtc atgaaagatt actacttcta
2641 caggttgggg cagccaaagc tctgttacgt ctccaggata cctacaattt ggatacagat
2641 gtccaacccc gtcggtttcg agacaatgca gaggtcctat ggatgttaaa cctatgtcta
2701 accatctcaa agggtaatct tccaggagtg aaacacaaat cttttctaac ggctgaggac
2701 tggtagagtt tcccattaga aggtcctcac tttgtgttta gaaaagattg ccgactcctg
2761 tgctttgagt tgggcaaagt ggcctataca gaagcagatt attaccatac ggaactgtgg
2761 acgaaactca acccgtttca ccggatatgt cttcgtctaa taatggtatg ccttgacacc
2821 atggaacaag ccctaaggca actggatgaa ggcgagattt ctaccataga taaagtctct
2821 taccttgttc gggattccgt tgacctactt ccgctctaaa gatggtatct atttcagaga
2881 gttctagatt atttgagcta tgcggtatat cagcagggag acctggataa ggcacttttg
2881 caagatctaa taaactcgat acgccatata gtcgtccctc tggacctatt ccgtgaaaac
2941 ctcacaaaga agcttcttga actagatcct gaacatcaga gagctaatgg taacttaaaa
2941 gagtgtttct tcgaagaact tgatctagga cttgtagtct ctcgattacc attgaatttt
3001 tattttgagt atataatggc taaagaaaaa gatgtcaata agtctgcttc agatgaccaa
3001 ataaaactca tatattaccg atttcttttt ctacagttat tcagacgaag tctactggtt
3061 tctgatcaga aaactacacc aaagaaaaaa ggggttgctg tggattacct gccagagaga
3061 agactagtct tttgatgtgg tttctttttt ccccaacgac acctaatgga cggtctctct
3121 cagaagtacg aaatgctgtg ccgtggggag ggtatcaaaa tgacccctcg gagacagaaa
3121 gtcttcatgc tttacgacac ggcacccctc ccatagtttt actggggagc ctctgtcttt
3181 aaactctttt gccgctacca tgatggaaac cgtaatccta aatttattct ggctccagct
3181 tttgagaaaa cggcgatggt actacctttg gcattaggat ttaaataaga ccgaggtcga
3241 aaacaggagg atgaatggga caagcctcgt attattcgct tccatgatat tatttctgat
3241 tttgtcctcc tacttaccct gttcggagca taataagcga aggtactata ataaagacta
3301 gcagaaattg aaatcgtcaa agacctagca aaaccaaggc tgagccgagc tacagtacat
3301 cgtctttaac tttagcagtt tctggatcgt tttggttccg actcggctcg atgtcatgta
3361 gaccctgaga ctggaaaatt gaccacagca cagtacagag tatctaagag tgcctggctc
3361 ctgggactct gaccttttaa ctggtgtcgt gtcatgtctc atagattctc acggaccgag
3421 tctggctatg aaaatcctgt ggtgtctcga attaatatga gaatacaaga tctaacagga
3421 agaccgatac ttttaggaca ccacagagct taattatact cttatgttct agattgtcct
3481 ctagatgttt ccacagcaga ggaattacag gtagcaaatt atggagttgg aggacagtat
3481 gatctacaaa ggtgtcgtct ccttaatgtc catcgtttaa tacctcaacc tcctgtcata
3541 gaaccccatt ttgactttgc acggaaagat gagccagatg ctttcaaaga gctggggaca
3541 cttggggtaa aactgaaacg tgcctttcta ctcggtctac gaaagtttct cgacccctgt
3601 ggaaatagaa ttgctacatg gctgttttat atgagtgatg tgtctgcagg aggagccact
3601 cctttatctt aacgatgtac cgacaaaata tactcactac acagacgtcc tcctcggtga
3661 gtttttcctg aagttggagc tagtgtttgg cccaaaaaag gaactgctgt tttctggtat
3661 caaaaaggac ttcaacctcg atcacaaacc gggttttttc cttgacgaca aaagaccata
3721 aatctgtttg ccagtggaga aggagattat agtacacggc atgcagcctg tccagtgcta
3721 ttagacaaac ggtcacctct tcctctaata tcatgtgccg tacgtcggac aggtcacgat
3781 gttggcaaca aatgggtatc caataaatgg ctccatgaac gtggacaaga atttcgaaga
3781 caaccgttgt ttacccatag gttatttacc gaggtacttg cacctgttct taaagcttct
3841 ccttgtacgt tgtcagaatt ggaatgatga gggccc
3841 ggaacatgca acagtcttaa ccttactact cccggg
2xStop ApaI
단백질 서열
MFa
P4HA
서열(
SEQ
ID
NO
:5)
MRFPSIFTAV LFAASSALAH PGFFTSIGQM TDLIHTEKDL VTSLKDYIKA EEDKLEQIKK WAEKLDRLTS TATKDPEGFV GHPVNAFKLM KRLNTEWSEL ENLVLKDMSD GFISNLTIQR PVLSNDEDQV GAAKALLRLQ DTYNLDTDTI SKGNLPGVKH KSFLTAEDCF ELGKVAYTEA DYYHTELWME QALRQLDEGE ISTIDKVSVL DYLSYAVYQQ GDLDKALLLT KKLLELDPEH QRANGNLKYF EYIMAKEKDV NKSASDDQSD QKTTPKKKGV AVDYLPERQK YEMLCRGEGI KMTPRRQKKL FCRYHDGNRN PKFILAPAKQ EDEWDKPRII RFHDIISDAE IEIVKDLAKP RLSRATVHDP ETGKLTTAQY RVSKSAWLSG YENPVVSRIN MRIQDLTGLD VSTAEELQVA NYGVGGQYEP HFDFARKDEP DAFKELGTGN RIATWLFYMS DVSAGGATVF PEVGASVWPK KGTAVFWYNL FASGEGDYST RHAACPVLVG NKWVSNKWLH ERGQEFRRPC TLSELE
MFa
P4HB
서열(
SEQ
ID
NO
:6)
MRFPSIFTAV LFAASSALAD APEEEDHVLV LRKSNFAEAL AAHKYLLVEF YAPWCGHCKA LAPEYAKAAG KLKAEGSEIR LAKVDATEES DLAQQYGVRG YPTIKFFRNG DTASPKEYTA GREADDIVNW LKKRTGPAAT TLPDGAAAES LVESSEVAVI GFFKDVESDS AKQFLQAAEA IDDIPFGITS NSDVFSKYQL DKDGVVLFKK FDEGRNNFEG EVTKENLLDF IKHNQLPLVI EFTEQTAPKI FGGEIKTHIL LFLPKSVSDY DGKLSNFKTA AESFKGKILF IFIDSDHTDN QRILEFFGLK KEECPAVRLI TLEEEMTKYK PESEELTAER ITEFCHRFLE GKIKPHLMSQ ELPEDWDKQP VKVLVGKNFE DVAFDEKKNV FVEFYAPWCG HCKQLAPIWD KLGETYKDHE NIVIAKMDST ANEVEAVKVH SFPTLKFFPA SADRTVIDYN GERTLDGFKK FLESGGQDGA GDDDDLEDLE EAEEPDMEED DDQKAVKDEL
MFa
서열(
SEQ
ID
NO
: 10)
MRFPSIFTAV LFAASSALA
참고문헌
1. Yonge M., 1962, On the significance of the byssus in the bivalvia and its effects in evolution, J. Mar. Biol. Ass., U. K. 42, p113-125
2. Qin X.-X., Coyne K.J., & Waite J.H., 1997, Tough tendons, Mussel byssus has collagen with silk-like domains, Journal of Biological Chemistry 272, p32623-32627
3. Waite J.H., 1992, Results Probl. Cell Differ 19, p27
4. Qin X.-X., & Waite J.H., 1995, Exotic Collagen Gradients in the Byssus of ht eMussel Mytilus Edulia, The Journal of Experimental Biology 198, p633-644
5. Vaccaro E. & Waite J.H., 2001, Yield and Post-Yield Behavior of Mussel Byssal Thread: A Self-Healing Biomolecular Material, Biomacromolecules 2, p906-911
6. Coyne K.J., Qin X.-X., & Waite J.H. 1997, Extensible collagen in mussel byssus: a natural block copolymer, Science 277, p1830-1832
7. Waite J.H., Qin X.-X., & Coyne K.J., 1998, The peculiar collagens of mussel byssus, Matrix Biology 17, p93-106
8. Coombs T.L., & Keller P.J., 1981, Mytilus byssal threads as an environmental marker for metal ions, Aquat. Toxicol., 1981, p291-300
9. Swann C.P., Adewole T., & Waite J.H., 1998, Preferential manganese accumulation in dreissenid byssal threads, Comparative Biochemistry and Physiology Part B: Biochemistry & Molecular Biology 119B, p755-759
10. Taylor S.W., Chase D.B., Emptage M.H., Nelson M.J., & Waite J.H., 1996, Ferric Ion Complexes of a DOPA-Containing Adhesive Protein from Mytilus edulis, Inorganic Chemistry 35, p7572-7577
11. Sun C., Vaccaro E., & Waite J.H., 2001, Oxidative stress and the mechanical properties of naturally occurring chimeric collagen-containing fibers, Biophysical Journal 81, p3590-3595
12. Myllyharju J., Nokelainen M., Vuorela A., & Kivirikko K.I., 2000, Expression of recombinant human I-III collagens in the yeast Pichia pastoris, Biochem. Soc. Trans. 28, p353-357
13. Prockop D.J., & Kivirikko K.I., 1995, Annu. Rev. Biochem. 64, p403-434
14. Olsen D.R., Leigh S.D., Chang R., McMullin H., Ong W., Ernest T., Chisholm G., Birk D.E., Berg R.A., Hitzeman R.A., & Toman P.D., 2001, Production of Human Type 1 Collagen in Yeast Reveals Unexpected New Insights into Molecular assembly of Collagen Trimers., J. Biol. Chem. 276, p24038-24043.
15. Scheibel T., 2004, Spider silks: recombinant synthesis, assembly, spinning, and engineering of synthetic proteins, Microbial Cell Factories 3, No pp. given.
16. Huemmerich D., Scheibel T., Vollrath F., Cohen S., Gat U., & Ittah S., 2004, Novel Assembly Properties of Recombinant Spider Dragline Silk Proteins, Current Biology 14, p2070-2074.
17. Huemmerich, D., Helsen, C.W., Quedzuweit, S., Oschmann, J., Rudolph, R., & Scheibel, T., 2004, Primary Structure Elements of Spider Dragline Silks and Their Contribution to Protein Solubility, Biochemistry 43, p13604-13612.
18. Brown, K.C., & Kodadek, T., 2001, Protein cross-linking mediated by metal ion complexes, Metal Ions in Biological Systems 38, p351-384
19. Fancy, D.A., Denison, C., Kim, K., Xie, Y., Holdeman, T., Amini, F., & Kodadek, T., 2000, Scope, limitations and mechanistic aspects of the photo-induced cross-linking of proteins by water-soluble metal complexes, Chemistry & Biology 7, p697-708
20. Burdine, L., Gillette, T.G., Lin, H.-J., & Kodadek, T., 2004, Periodate-Triggered Cross-Linking of DOPA-Containing Peptide-Protein Complexes., Journal of the American Chemical Society 126, p11442-11443
21. Kim, K., Fancy, D.A., Carney, D., & Kodadek, T., 1999, Photoinduced Protein Cross-Linking Mediated by Palladium Porphyrins, Journal of the American Chemical Society 121, p11896-11897
기타 참고문헌:
Brake, A.J. (1990) Alpha-factor leader-directed secretion of heterologous proteins from yeast. Methods Enzymol. 185: 408-21
Bulleid, N.J., John, D.C. & Kadler, K.E. (2000) Recombinant expression systems for the production of collagen. Biochem. Soc. Trans. 28: 350-3
Coyne, K.J. & Waite, J.H. (2000) In search of molecular dovetails in mussel byssus: from the threads to the stem. J. Exp. Biol. 203: 1425-31
Keizer-Gunnink, I., Vuorela, A., Myllyharju, J., Pihlajaniemi, T., Kivirikko, K.I. & Veenhuis, M. (2000) Accumulation of properly folded human type III procollagen molecules in specific intracellular membranous compartments in the yeast Pichia pastoris. Matrix Biol. 19: 29-36
Lucas, J.M., Vaccaro, E. & Waite, J.H. (2002) A molecular, morphometric and mechanical comparison of the structural elements of byssus from Mytilus edulis and Mytilus galloprovincialis. J. Exp. Biol. 205: 1807-1
Mascolo, J.M. & Waite, J.H. (1986) Protein gradients in byssal threads of some marine bivalve molluscs. J. Exp. Zool. 240: 1-7
Qin, X.X. & Waite, J.H. (1998) A potential mediator of collagenous block copolymer gradients in mussel byssal threads. Proc. Natl. Acad. Sci. USA 95: 10517-22
Sikorski R.S. & Hieter P. (1989) A system of shuttle vectors and yeast host strains designed for efficient manipulation of DNA in Saccharomyces cerevisiae. Genetics 122, 19-27
Toman, P.D., Chisholm, G., McMullin, H., Giere, L.M., Olsen, D.R., Kovach, R.J., Leigh, S.D., Fong, B.E., Chang, R., Daniels, G.A., Berg, R.A. & Hitzeman, R.A. (2000) Production of recombinant human type I procollagen trimers using a four-gene expression system in the yeast Saccharomyces cerevisiae. J. Biol. Chem. 275: 23303-9
Vaughn, P.R., Galanis, M., Richards, K.M., Tebb, T.A., Ramshaw, J.A. & Werkmeister, J.A. (1998) Production of recombinant hydroxylated human type III collagen fragment in Saccharomyces cerevisiae. DNA Cell Biol. 17: 511-8
Vuorela, A., Myllyharju, J., Nissi, R., Pihlajaniemi, T. & Kivirikko, K.I. (1997) Assembly of human prolyl 4-hydroxylase and type III collagen in the yeast pichia pastoris: formation of a stable enzyme tetramer requires coexpression with collagen and assembly of a stable collagen requires coexpression with prolyl 4-hydroxylase. EMBO J. 16: 6702-12
Waite, J.H., Vaccaro, E., Sun, C. & Lucas, J.M. (2002) Elastomeric gradients: a hedge against stress concentration in marine holdfasts Philos. Trans. R. Soc. Lond B Biol. Sci. 357: 143-53
<110> Technische Universitaet Muenchen
<120> Production of recombinant collagen like proteins
<160> 10
<170> KopatentIn 1.71
<210> 1
<211> 2703
<212> DNA
<213> Mytilus edulis
<400> 1
atggttcggt tctccctagc atcggtacta ttactggcag tcaccagcac agctttcgct 60
ggaccagtta gtgattatgg tggtggtgga atcaaagtag taccctacca cggaggcgga 120
ggtggaagcg gcggcggtgg cggtggaggc catggcggaa gcggtattgg tggtatcgga 180
ggaggatcat cacatgcaca tgcccactct tcagcatctg cccatgtgca ccattttgga 240
ccaggtggat cttcacatgc atcagctggt tcatcatccc atgcatccgc atcccataac 300
ggtttaggag gtggcagtgc tcatgcacat agcagttcca gcgccaacgc tcattccggt 360
ggattcggtg gattcggcgg tattggtggt attggcggta ttggcccagg aggaagtgtc 420
ggaggcggta ttggcccagg aggaagtgtc ggaggcggca ttggcggtat tggcggtatt 480
ggcggcggtg gtggaccagg cggtaatggc ggtatcggat tcggaccagg attcggagga 540
ggattcggac caggttcatc tgctagtgga tccggaagtg gcagcgcatt cggtggtcca 600
ggaggttcaa gcgcaagcgc aaacgcagct gcacgtgcaa atgcaaatgg tggtggagga 660
ttcggtggac caggtacccc aggaaactca ggaccaccag gccaacccgg actaccagga 720
gcaccaggcc aaccaggacg tccaggaagt accccaccag gtcgaccagg aaaccccgga 780
ccaccaggtc aaccaggtaa cccaggacgt ccaggctctt caggaagacc aggaggatcc 840
ggccaaccag gaggtccagg acgtccagga acccccggca aaccaggaaa ccgaggacaa 900
ccaggacagc caggcggccc aggacaacca ggtcacccag gagcaggagg acaaccagga 960
cgaaacggaa atccaggaaa ccccggtaaa ccaggaacac caggtcaccc aggaacagca 1020
ggatcacgag gaatgccagg aaccccagga accccaggac aaccaggaat tccaggcacc 1080
gtcggaggac gaggaccaag aggaccagct ggaatcatcg gattaattgg accaaaagga 1140
aatccaggag agccaggaaa tccaggtgca ccaggaggcc caggatctac aggaccacaa 1200
ggaccacaag gaccagccgg aggaccagga gcatcaggcg gaccaggaga caaaggcgca 1260
ccaggtacac caggaggaac tggaccaaga ggaccaatcg gaccatcagg accatcagga 1320
gcaccagggg accaaggacc acaaggaggt agaggaacac caggactcgc aggcaaacca 1380
ggacctaaag gactacaagg atcaaatgga gaagttggac cccaaggacc atctggaccc 1440
gcaggaccac aaggcccaca aggaaagaac ggtgtcaaag gagcagcagg agatcaagga 1500
gctaggggac cagaaggaaa agccggacca gctggaccac aaggagaaac aggaccaaaa 1560
ggaccaacag gagcacaagg accagccggt ccagccggac catcaggaga acaaggacca 1620
ggaggggaaa gaggaggcca gggaccacaa ggagctgaag gaccaagtgg accagcagga 1680
ccaagaggac cagcaggatc acaaggacca agtggtgaac gcggagaacc aggagcacca 1740
ggtaaaaaag gaccaaatgg agaccgagga aaccaaggat caccaggagc accaggcaaa 1800
aacggagcac gaggaaatag aggatcaaga ggaagcaacg gatcacccgg cagatcagga 1860
tcaccaggaa gccgaggaaa accaggacca caaggaccac atggaccaag aggagcaaga 1920
ggatcaccag gacaaaaagg accacgtgga gaccaaggag caccaggtgt tattcgtatt 1980
gttatcgatg accagagaac aggaccagaa gttgcagaat tcccaggatt tggtggattc 2040
ggaggagctt cagctaacgc agcaagttca gcaaatgcat ttgctggtgg acccggtggt 2100
tccgctggag caggttcatc atcaggagct aacgcaaacg caggtggatt cccattcgga 2160
ggaggaccat tcggaggagc aggaggtggt cccggagcag caggaggccc aggaggagca 2220
ggaggcccag gaggagtagg aggaggagtt ggaggtggac caggaggagt aggaggtgga 2280
gtaggaggtg gaccaggagg agtaggaggt ggaccaggag gagcaggacc aggaggagca 2340
ggaggatttg gaccaggagg agcaggagga tttggtggat ttggaggagg atctagcgct 2400
ggagcatcat catcaggatc agcatctgca tctaacggtg gaccattcgg agtactcaat 2460
gtaggacccg gaggtagaat cggtggtgga agcgcatcag catctgcagc atctagagca 2520
catgcacacg cttttggtgg tctcggaggg ggaagtgcct cagctggtag tcattcctca 2580
tctagctcac actcatttgg cggacacgta ttccacagtg tgacccatca tggaggtcca 2640
tcacatgttt caagcggagg tcacggaggt catggaggag gtccatacaa acctggatat 2700
taa 2703
<210> 2
<211> 2025
<212> DNA
<213> Mytilus edulis
<400> 2
atggtctaca aactcctgac cgtgtgtctt gtagcatctc ttctagagat ttgcttagct 60
gactataacg gcaacaaaca gtatggcggc agatacggca acagatacgg aaacggttta 120
ggaggcggta atggtggtgc aggagccgta gcccatgccc atgcccatgc ccatgccagt 180
gccggagcaa acggaagagc aagagcacat gcacgagcct tggcccatgc acatgccggt 240
ggtggcgctg cacatggaca cccaggattc ccagttggtg gtagcgcaag cgcagccgca 300
cgagcagcag cacgagcatc agcaggagga ttaggtggat tcggatcagc agcagccaat 360
gcagcagcag cagcaagagc aggagcagga tttggtggat tcggtggatt aggaggattc 420
ggaggactcg gaggagttgg cggtccaggt caaccaggac atgccggtaa acacggaacc 480
gcaggagcag caggcaaagc aggacgtcca ggaccatgtg gagatagagg ggcaccagga 540
gtaccaggca aacaaggacc agtaggagga caaggaccag caggaccacg aggaccacga 600
ggagatgaag gaccagttgg accaaagggc gaaccaggag caagaggagc tgatggtaaa 660
ccaggagaca aaggacctga tggagaaacc ggaccacaag gaccagctgg accaaaggga 720
caagtaggag accaaggcaa accaggagca aagggagaaa ccggagatca aggagcacga 780
ggtgaagcag gaaaggccgg cgaacaagga ccaggaggca tccaaggacc aaagggacca 840
gtaggaggac aaggaccagc aggaccagcc ggaccactcg gaccacaagg accaatgggt 900
gaacgaggac cacaaggacc aacaggatca gaaggaccag ttggagcacc aggaccaaag 960
ggatcagtcg gagaccaagg agcacaagga gaccaaggag caactggcgc tgatggcaaa 1020
aagggagaac caggagagag aggacaacaa ggagcagcag gaccagtcgg ccgaccagga 1080
ccaagaggag atagaggagc aaagggaatt caaggaagcc gaggacgacc aggtggtatg 1140
ggtagacgag gaaaccgtgg atcccaagga gcagtaggac cacgaggaga aactggccca 1200
gacggtaacc aaggacaacg tggagaacaa ggagcaccag gagttatcac ccttgtcatt 1260
gaagacctca gaacagccgg agtagaaagc cccgtagaaa cctttgacgc aggagcagga 1320
accggtggac cagcaccagg agtaggagca gcagcaacag caggagcatt tgcaggagca 1380
ggaccaggag gagctaatgc aggaggaaac gcagccgcag gagcaggacc aggagtagga 1440
ccaggaggac tcggaggact aggaggactt ggtgcaggtg gactcggagg tggactcggc 1500
ggtggactcg gaggattagg aggagcagga ggtttaggtg gtggactcgg aggattagga 1560
ggaggtttag gtggtggact cggaggttta ggaggtggag caggaggagc aggaggcgca 1620
ggagcaggag gaaacggtgg agcaggagca ggaggagcag gaggaaacgg tggaggatca 1680
gccgcagcac gagcagcagc acaagcagca gcagcagcag gaggaaacgg tggagcagca 1740
caagcagcag cacaagcagc agcatcagca gcagcaaatt caggacttgg agcaggagca 1800
gcaagagcag cagcatcagc agccgctaga gcaaccgtag caggacatgg aagtggaacc 1860
gccgcagcag cagccaacgc agccgcacaa gcacatgcag caacacgagg acaaggagga 1920
tcacacgcac acgctgccgc cgcagctcac gcagccgcaa gtagcgtaat ccatggtggt 1980
gactatcacg gaaacgatgc cggctatcac aaaccaggat attaa 2025
<210> 3
<211> 900
<212> PRT
<213> Mytilus edulis
<400> 3
Met Val Arg Phe Ser Leu Ala Ser Val Leu Leu Leu Ala Val Thr Ser
1 5 10 15
Thr Ala Phe Ala Gly Pro Val Ser Asp Tyr Gly Gly Gly Gly Ile Lys
20 25 30
Val Val Pro Tyr His Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly
35 40 45
Gly Gly His Gly Gly Ser Gly Ile Gly Gly Ile Gly Gly Gly Ser Ser
50 55 60
His Ala His Ala His Ser Ser Ala Ser Ala His Val His His Phe Gly
65 70 75 80
Pro Gly Gly Ser Ser His Ala Ser Ala Gly Ser Ser Ser His Ala Ser
85 90 95
Ala Ser His Asn Gly Leu Gly Gly Gly Ser Ala His Ala His Ser Ser
100 105 110
Ser Ser Ala Asn Ala His Ser Gly Gly Phe Gly Gly Phe Gly Gly Ile
115 120 125
Gly Gly Ile Gly Gly Ile Gly Pro Gly Gly Ser Val Gly Gly Gly Ile
130 135 140
Gly Pro Gly Gly Ser Val Gly Gly Gly Ile Gly Gly Ile Gly Gly Ile
145 150 155 160
Gly Gly Gly Gly Gly Pro Gly Gly Asn Gly Gly Ile Gly Phe Gly Pro
165 170 175
Gly Phe Gly Gly Gly Phe Gly Pro Gly Ser Ser Ala Ser Gly Ser Gly
180 185 190
Ser Gly Ser Ala Phe Gly Gly Pro Gly Gly Ser Ser Ala Ser Ala Asn
195 200 205
Ala Ala Ala Arg Ala Asn Ala Asn Gly Gly Gly Gly Phe Gly Gly Pro
210 215 220
Gly Thr Pro Gly Asn Ser Gly Pro Pro Gly Gln Pro Gly Leu Pro Gly
225 230 235 240
Ala Pro Gly Gln Pro Gly Arg Pro Gly Ser Thr Pro Pro Gly Arg Pro
245 250 255
Gly Asn Pro Gly Pro Pro Gly Gln Pro Gly Asn Pro Gly Arg Pro Gly
260 265 270
Ser Ser Gly Arg Pro Gly Gly Ser Gly Gln Pro Gly Gly Pro Gly Arg
275 280 285
Pro Gly Thr Pro Gly Lys Pro Gly Asn Arg Gly Gln Pro Gly Gln Pro
290 295 300
Gly Gly Pro Gly Gln Pro Gly His Pro Gly Ala Gly Gly Gln Pro Gly
305 310 315 320
Arg Asn Gly Asn Pro Gly Asn Pro Gly Lys Pro Gly Thr Pro Gly His
325 330 335
Pro Gly Thr Ala Gly Ser Arg Gly Met Pro Gly Thr Pro Gly Thr Pro
340 345 350
Gly Gln Pro Gly Ile Pro Gly Thr Val Gly Gly Arg Gly Pro Arg Gly
355 360 365
Pro Ala Gly Ile Ile Gly Leu Ile Gly Pro Lys Gly Asn Pro Gly Glu
370 375 380
Pro Gly Asn Pro Gly Ala Pro Gly Gly Pro Gly Ser Thr Gly Pro Gln
385 390 395 400
Gly Pro Gln Gly Pro Ala Gly Gly Pro Gly Ala Ser Gly Gly Pro Gly
405 410 415
Asp Lys Gly Ala Pro Gly Thr Pro Gly Gly Thr Gly Pro Arg Gly Pro
420 425 430
Ile Gly Pro Ser Gly Pro Ser Gly Ala Pro Gly Asp Gln Gly Pro Gln
435 440 445
Gly Gly Arg Gly Thr Pro Gly Leu Ala Gly Lys Pro Gly Pro Lys Gly
450 455 460
Leu Gln Gly Ser Asn Gly Glu Val Gly Pro Gln Gly Pro Ser Gly Pro
465 470 475 480
Ala Gly Pro Gln Gly Pro Gln Gly Lys Asn Gly Val Lys Gly Ala Ala
485 490 495
Gly Asp Gln Gly Ala Arg Gly Pro Glu Gly Lys Ala Gly Pro Ala Gly
500 505 510
Pro Gln Gly Glu Thr Gly Pro Lys Gly Pro Thr Gly Ala Gln Gly Pro
515 520 525
Ala Gly Pro Ala Gly Pro Ser Gly Glu Gln Gly Pro Gly Gly Glu Arg
530 535 540
Gly Gly Gln Gly Pro Gln Gly Ala Glu Gly Pro Ser Gly Pro Ala Gly
545 550 555 560
Pro Arg Gly Pro Ala Gly Ser Gln Gly Pro Ser Gly Glu Arg Gly Glu
565 570 575
Pro Gly Ala Pro Gly Lys Lys Gly Pro Asn Gly Asp Arg Gly Asn Gln
580 585 590
Gly Ser Pro Gly Ala Pro Gly Lys Asn Gly Ala Arg Gly Asn Arg Gly
595 600 605
Ser Arg Gly Ser Asn Gly Ser Pro Gly Arg Ser Gly Ser Pro Gly Ser
610 615 620
Arg Gly Lys Pro Gly Pro Gln Gly Pro His Gly Pro Arg Gly Ala Arg
625 630 635 640
Gly Ser Pro Gly Gln Lys Gly Pro Arg Gly Asp Gln Gly Ala Pro Gly
645 650 655
Val Ile Arg Ile Val Ile Asp Asp Gln Arg Thr Gly Pro Glu Val Ala
660 665 670
Glu Phe Pro Gly Phe Gly Gly Phe Gly Gly Ala Ser Ala Asn Ala Ala
675 680 685
Ser Ser Ala Asn Ala Phe Ala Gly Gly Pro Gly Gly Ser Ala Gly Ala
690 695 700
Gly Ser Ser Ser Gly Ala Asn Ala Asn Ala Gly Gly Phe Pro Phe Gly
705 710 715 720
Gly Gly Pro Phe Gly Gly Ala Gly Gly Gly Pro Gly Ala Ala Gly Gly
725 730 735
Pro Gly Gly Ala Gly Gly Pro Gly Gly Val Gly Gly Gly Val Gly Gly
740 745 750
Gly Pro Gly Gly Val Gly Gly Gly Val Gly Gly Gly Pro Gly Gly Val
755 760 765
Gly Gly Gly Pro Gly Gly Ala Gly Pro Gly Gly Ala Gly Gly Phe Gly
770 775 780
Pro Gly Gly Ala Gly Gly Phe Gly Gly Phe Gly Gly Gly Ser Ser Ala
785 790 795 800
Gly Ala Ser Ser Ser Gly Ser Ala Ser Ala Ser Asn Gly Gly Pro Phe
805 810 815
Gly Val Leu Asn Val Gly Pro Gly Gly Arg Ile Gly Gly Gly Ser Ala
820 825 830
Ser Ala Ser Ala Ala Ser Arg Ala His Ala His Ala Phe Gly Gly Leu
835 840 845
Gly Gly Gly Ser Ala Ser Ala Gly Ser His Ser Ser Ser Ser Ser His
850 855 860
Ser Phe Gly Gly His Val Phe His Ser Val Thr His His Gly Gly Pro
865 870 875 880
Ser His Val Ser Ser Gly Gly His Gly Gly His Gly Gly Gly Pro Tyr
885 890 895
Lys Pro Gly Tyr
900
<210> 4
<211> 674
<212> PRT
<213> Mytilus edulis
<400> 4
Met Val Tyr Lys Leu Leu Thr Val Cys Leu Val Ala Ser Leu Leu Glu
1 5 10 15
Ile Cys Leu Ala Asp Tyr Asn Gly Asn Lys Gln Tyr Gly Gly Arg Tyr
20 25 30
Gly Asn Arg Tyr Gly Asn Gly Leu Gly Gly Gly Asn Gly Gly Ala Gly
35 40 45
Ala Val Ala His Ala His Ala His Ala His Ala Ser Ala Gly Ala Asn
50 55 60
Gly Arg Ala Arg Ala His Ala Arg Ala Leu Ala His Ala His Ala Gly
65 70 75 80
Gly Gly Ala Ala His Gly His Pro Gly Phe Pro Val Gly Gly Ser Ala
85 90 95
Ser Ala Ala Ala Arg Ala Ala Ala Arg Ala Ser Ala Gly Gly Leu Gly
100 105 110
Gly Phe Gly Ser Ala Ala Ala Asn Ala Ala Ala Ala Ala Arg Ala Gly
115 120 125
Ala Gly Phe Gly Gly Phe Gly Gly Leu Gly Gly Phe Gly Gly Leu Gly
130 135 140
Gly Val Gly Gly Pro Gly Gln Pro Gly His Ala Gly Lys His Gly Thr
145 150 155 160
Ala Gly Ala Ala Gly Lys Ala Gly Arg Pro Gly Pro Cys Gly Asp Arg
165 170 175
Gly Ala Pro Gly Val Pro Gly Lys Gln Gly Pro Val Gly Gly Gln Gly
180 185 190
Pro Ala Gly Pro Arg Gly Pro Arg Gly Asp Glu Gly Pro Val Gly Pro
195 200 205
Lys Gly Glu Pro Gly Ala Arg Gly Ala Asp Gly Lys Pro Gly Asp Lys
210 215 220
Gly Pro Asp Gly Glu Thr Gly Pro Gln Gly Pro Ala Gly Pro Lys Gly
225 230 235 240
Gln Val Gly Asp Gln Gly Lys Pro Gly Ala Lys Gly Glu Thr Gly Asp
245 250 255
Gln Gly Ala Arg Gly Glu Ala Gly Lys Ala Gly Glu Gln Gly Pro Gly
260 265 270
Gly Ile Gln Gly Pro Lys Gly Pro Val Gly Gly Gln Gly Pro Ala Gly
275 280 285
Pro Ala Gly Pro Leu Gly Pro Gln Gly Pro Met Gly Glu Arg Gly Pro
290 295 300
Gln Gly Pro Thr Gly Ser Glu Gly Pro Val Gly Ala Pro Gly Pro Lys
305 310 315 320
Gly Ser Val Gly Asp Gln Gly Ala Gln Gly Asp Gln Gly Ala Thr Gly
325 330 335
Ala Asp Gly Lys Lys Gly Glu Pro Gly Glu Arg Gly Gln Gln Gly Ala
340 345 350
Ala Gly Pro Val Gly Arg Pro Gly Pro Arg Gly Asp Arg Gly Ala Lys
355 360 365
Gly Ile Gln Gly Ser Arg Gly Arg Pro Gly Gly Met Gly Arg Arg Gly
370 375 380
Asn Arg Gly Ser Gln Gly Ala Val Gly Pro Arg Gly Glu Thr Gly Pro
385 390 395 400
Asp Gly Asn Gln Gly Gln Arg Gly Glu Gln Gly Ala Pro Gly Val Ile
405 410 415
Thr Leu Val Ile Glu Asp Leu Arg Thr Ala Gly Val Glu Ser Pro Val
420 425 430
Glu Thr Phe Asp Ala Gly Ala Gly Thr Gly Gly Pro Ala Pro Gly Val
435 440 445
Gly Ala Ala Ala Thr Ala Gly Ala Phe Ala Gly Ala Gly Pro Gly Gly
450 455 460
Ala Asn Ala Gly Gly Asn Ala Ala Ala Gly Ala Gly Pro Gly Val Gly
465 470 475 480
Pro Gly Gly Leu Gly Gly Leu Gly Gly Leu Gly Ala Gly Gly Leu Gly
485 490 495
Gly Gly Leu Gly Gly Gly Leu Gly Gly Leu Gly Gly Ala Gly Gly Leu
500 505 510
Gly Gly Gly Leu Gly Gly Leu Gly Gly Gly Leu Gly Gly Gly Leu Gly
515 520 525
Gly Leu Gly Gly Gly Ala Gly Gly Ala Gly Gly Ala Gly Ala Gly Gly
530 535 540
Asn Gly Gly Ala Gly Ala Gly Gly Ala Gly Gly Asn Gly Gly Gly Ser
545 550 555 560
Ala Ala Ala Arg Ala Ala Ala Gln Ala Ala Ala Ala Ala Gly Gly Asn
565 570 575
Gly Gly Ala Ala Gln Ala Ala Ala Gln Ala Ala Ala Ser Ala Ala Ala
580 585 590
Asn Ser Gly Leu Gly Ala Gly Ala Ala Arg Ala Ala Ala Ser Ala Ala
595 600 605
Ala Arg Ala Thr Val Ala Gly His Gly Ser Gly Thr Ala Ala Ala Ala
610 615 620
Ala Asn Ala Ala Ala Gln Ala His Ala Ala Thr Arg Gly Gln Gly Gly
625 630 635 640
Ser His Ala His Ala Ala Ala Ala Ala His Ala Ala Ala Ser Ser Val
645 650 655
Ile His Gly Gly Asp Tyr His Gly Asn Asp Ala Gly Tyr His Lys Pro
660 665 670
Gly Tyr
<210> 5
<211> 536
<212> PRT
<213> Saccharomyces cerevisiae
<400> 5
Met Arg Phe Pro Ser Ile Phe Thr Ala Val Leu Phe Ala Ala Ser Ser
1 5 10 15
Ala Leu Ala His Pro Gly Phe Phe Thr Ser Ile Gly Gln Met Thr Asp
20 25 30
Leu Ile His Thr Glu Lys Asp Leu Val Thr Ser Leu Lys Asp Tyr Ile
35 40 45
Lys Ala Glu Glu Asp Lys Leu Glu Gln Ile Lys Lys Trp Ala Glu Lys
50 55 60
Leu Asp Arg Leu Thr Ser Thr Ala Thr Lys Asp Pro Glu Gly Phe Val
65 70 75 80
Gly His Pro Val Asn Ala Phe Lys Leu Met Lys Arg Leu Asn Thr Glu
85 90 95
Trp Ser Glu Leu Glu Asn Leu Val Leu Lys Asp Met Ser Asp Gly Phe
100 105 110
Ile Ser Asn Leu Thr Ile Gln Arg Pro Val Leu Ser Asn Asp Glu Asp
115 120 125
Gln Val Gly Ala Ala Lys Ala Leu Leu Arg Leu Gln Asp Thr Tyr Asn
130 135 140
Leu Asp Thr Asp Thr Ile Ser Lys Gly Asn Leu Pro Gly Val Lys His
145 150 155 160
Lys Ser Phe Leu Thr Ala Glu Asp Cys Phe Glu Leu Gly Lys Val Ala
165 170 175
Tyr Thr Glu Ala Asp Tyr Tyr His Thr Glu Leu Trp Met Glu Gln Ala
180 185 190
Leu Arg Gln Leu Asp Glu Gly Glu Ile Ser Thr Ile Asp Lys Val Ser
195 200 205
Val Leu Asp Tyr Leu Ser Tyr Ala Val Tyr Gln Gln Gly Asp Leu Asp
210 215 220
Lys Ala Leu Leu Leu Thr Lys Lys Leu Leu Glu Leu Asp Pro Glu His
225 230 235 240
Gln Arg Ala Asn Gly Asn Leu Lys Tyr Phe Glu Tyr Ile Met Ala Lys
245 250 255
Glu Lys Asp Val Asn Lys Ser Ala Ser Asp Asp Gln Ser Asp Gln Lys
260 265 270
Thr Thr Pro Lys Lys Lys Gly Val Ala Val Asp Tyr Leu Pro Glu Arg
275 280 285
Gln Lys Tyr Glu Met Leu Cys Arg Gly Glu Gly Ile Lys Met Thr Pro
290 295 300
Arg Arg Gln Lys Lys Leu Phe Cys Arg Tyr His Asp Gly Asn Arg Asn
305 310 315 320
Pro Lys Phe Ile Leu Ala Pro Ala Lys Gln Glu Asp Glu Trp Asp Lys
325 330 335
Pro Arg Ile Ile Arg Phe His Asp Ile Ile Ser Asp Ala Glu Ile Glu
340 345 350
Ile Val Lys Asp Leu Ala Lys Pro Arg Leu Ser Arg Ala Thr Val His
355 360 365
Asp Pro Glu Thr Gly Lys Leu Thr Thr Ala Gln Tyr Arg Val Ser Lys
370 375 380
Ser Ala Trp Leu Ser Gly Tyr Glu Asn Pro Val Val Ser Arg Ile Asn
385 390 395 400
Met Arg Ile Gln Asp Leu Thr Gly Leu Asp Val Ser Thr Ala Glu Glu
405 410 415
Leu Gln Val Ala Asn Tyr Gly Val Gly Gly Gln Tyr Glu Pro His Phe
420 425 430
Asp Phe Ala Arg Lys Asp Glu Pro Asp Ala Phe Lys Glu Leu Gly Thr
435 440 445
Gly Asn Arg Ile Ala Thr Trp Leu Phe Tyr Met Ser Asp Val Ser Ala
450 455 460
Gly Gly Ala Thr Val Phe Pro Glu Val Gly Ala Ser Val Trp Pro Lys
465 470 475 480
Lys Gly Thr Ala Val Phe Trp Tyr Asn Leu Phe Ala Ser Gly Glu Gly
485 490 495
Asp Tyr Ser Thr Arg His Ala Ala Cys Pro Val Leu Val Gly Asn Lys
500 505 510
Trp Val Ser Asn Lys Trp Leu His Glu Arg Gly Gln Glu Phe Arg Arg
515 520 525
Pro Cys Thr Leu Ser Glu Leu Glu
530 535
<210> 6
<211> 510
<212> PRT
<213> Saccharomyces cerevisiae
<400> 6
Met Arg Phe Pro Ser Ile Phe Thr Ala Val Leu Phe Ala Ala Ser Ser
1 5 10 15
Ala Leu Ala Asp Ala Pro Glu Glu Glu Asp His Val Leu Val Leu Arg
20 25 30
Lys Ser Asn Phe Ala Glu Ala Leu Ala Ala His Lys Tyr Leu Leu Val
35 40 45
Glu Phe Tyr Ala Pro Trp Cys Gly His Cys Lys Ala Leu Ala Pro Glu
50 55 60
Tyr Ala Lys Ala Ala Gly Lys Leu Lys Ala Glu Gly Ser Glu Ile Arg
65 70 75 80
Leu Ala Lys Val Asp Ala Thr Glu Glu Ser Asp Leu Ala Gln Gln Tyr
85 90 95
Gly Val Arg Gly Tyr Pro Thr Ile Lys Phe Phe Arg Asn Gly Asp Thr
100 105 110
Ala Ser Pro Lys Glu Tyr Thr Ala Gly Arg Glu Ala Asp Asp Ile Val
115 120 125
Asn Trp Leu Lys Lys Arg Thr Gly Pro Ala Ala Thr Thr Leu Pro Asp
130 135 140
Gly Ala Ala Ala Glu Ser Leu Val Glu Ser Ser Glu Val Ala Val Ile
145 150 155 160
Gly Phe Phe Lys Asp Val Glu Ser Asp Ser Ala Lys Gln Phe Leu Gln
165 170 175
Ala Ala Glu Ala Ile Asp Asp Ile Pro Phe Gly Ile Thr Ser Asn Ser
180 185 190
Asp Val Phe Ser Lys Tyr Gln Leu Asp Lys Asp Gly Val Val Leu Phe
195 200 205
Lys Lys Phe Asp Glu Gly Arg Asn Asn Phe Glu Gly Glu Val Thr Lys
210 215 220
Glu Asn Leu Leu Asp Phe Ile Lys His Asn Gln Leu Pro Leu Val Ile
225 230 235 240
Glu Phe Thr Glu Gln Thr Ala Pro Lys Ile Phe Gly Gly Glu Ile Lys
245 250 255
Thr His Ile Leu Leu Phe Leu Pro Lys Ser Val Ser Asp Tyr Asp Gly
260 265 270
Lys Leu Ser Asn Phe Lys Thr Ala Ala Glu Ser Phe Lys Gly Lys Ile
275 280 285
Leu Phe Ile Phe Ile Asp Ser Asp His Thr Asp Asn Gln Arg Ile Leu
290 295 300
Glu Phe Phe Gly Leu Lys Lys Glu Glu Cys Pro Ala Val Arg Leu Ile
305 310 315 320
Thr Leu Glu Glu Glu Met Thr Lys Tyr Lys Pro Glu Ser Glu Glu Leu
325 330 335
Thr Ala Glu Arg Ile Thr Glu Phe Cys His Arg Phe Leu Glu Gly Lys
340 345 350
Ile Lys Pro His Leu Met Ser Gln Glu Leu Pro Glu Asp Trp Asp Lys
355 360 365
Gln Pro Val Lys Val Leu Val Gly Lys Asn Phe Glu Asp Val Ala Phe
370 375 380
Asp Glu Lys Lys Asn Val Phe Val Glu Phe Tyr Ala Pro Trp Cys Gly
385 390 395 400
His Cys Lys Gln Leu Ala Pro Ile Trp Asp Lys Leu Gly Glu Thr Tyr
405 410 415
Lys Asp His Glu Asn Ile Val Ile Ala Lys Met Asp Ser Thr Ala Asn
420 425 430
Glu Val Glu Ala Val Lys Val His Ser Phe Pro Thr Leu Lys Phe Phe
435 440 445
Pro Ala Ser Ala Asp Arg Thr Val Ile Asp Tyr Asn Gly Glu Arg Thr
450 455 460
Leu Asp Gly Phe Lys Lys Phe Leu Glu Ser Gly Gly Gln Asp Gly Ala
465 470 475 480
Gly Asp Asp Asp Asp Leu Glu Asp Leu Glu Glu Ala Glu Glu Pro Asp
485 490 495
Met Glu Glu Asp Asp Asp Gln Lys Ala Val Lys Asp Glu Leu
500 505 510
<210> 7
<211> 3876
<212> DNA
<213> Mytilus edulis
<400> 7
ccgcggtcat tacagttcat ctttcacagc tttctgatca tcgtcttcct ccatgtctgg 60
ctcctctgct tcttccaggt cctcgagatc gtcatcatcc cctgccccat cctggccacc 120
cgagaggtcc ttaaagaatt ttggtaggtc gcacgcaagg ggcaacatta gttactggca 180
cctgtcggca ctggcaggaa agaacttgag tgtggggaag ctgtgcactt tgacggcctc 240
cacctcgttg gcagtcgagt ccatcttggc gatgacgatg ttctcatggt ccttgtacgt 300
ctctcccagt ttatcccaaa tgggagccaa ctgtttgcag tgaccacacc atggggcata 360
gaactccaca aagacgtttt ttttctcatc aaaagccacg tcttcaaagt tcttcccaac 420
aagcaccttg acaggctgct tgtcccagtc ctccggcagc tcctggctca tcaggtgggg 480
cttgattttg ccctccagga agcggtggca gaactctgtg atcctctctg ccgtcagctc 540
ctccgattcg ggcttgtact tggtcatctc ctcctccagg gtgatgaggc gcacggccgg 600
gcactcttcc ttcttcaggc caaagaactc gaggatgcgc tggttgtcgg tgtggtcgct 660
gtcgatgaag atgaacagga tcttgccctt gaagctctcg gctgctgttt tgaagttgct 720
cagtttgccg tcatagtcag acacactctt gggcaagaac agcaggatgt gagtcttgat 780
ttcacctcca aaaatcttcg gggctgtctg ctcggtgaac tcgatgacaa ggggcagctg 840
gttgtgtttg ataaagtcca gcaggttctc cttggtgacc tccccttcaa agttgttccg 900
gccttcatca aacttcttaa agaggacaac cccatctttg tcgagctggt atttggagaa 960
cacgtcactg ttggaagtga tcccaaatgg tatgtcatcg atggcctctg ctgcctgcaa 1020
aaactgcttg gcagagtccg actccacgtc cttgaagaag ccgatgacag ccacctcgct 1080
ggactccacc aaggactctg cagctgcgcc gtcaggcagg gtggtggcag ccgggcccgt 1140
gcgcttcttc agccagttca cgatgtcatc agcctctctg ccagctgtat attccttggg 1200
ggaagccgtg tctccattcc tgaagaactt gatggtggga tagccgcgca cgccgtactg 1260
ctgggccagg tcagactcct ccgtggcgtc caccttggcc aacctgatct cggaaccttc 1320
tgccttcagc ttcccagcgg ctttggcata ctcaggggcc agagccttgc agtggccaca 1380
ggttccccgt atcttgaggt ggtcgtccat gaacacccgg cggtcgcgga ggcgcttcaa 1440
gcttttccgc agcaccagga cgtggtcctc ctcctccgga gcgtcagcta atgcggagga 1500
tgctgcgaat aaaactgcag taaaaattga aggaaatctc atggatccgg ggttttttct 1560
ccttgacgtt aaagtataga ggtatattaa caattttttg ttgatacttt tattacattt 1620
gaataagaag taatacaaac cgaaaatgtt gaaagtatta gttaaagtgg ttatgcagtt 1680
tttgcattta tatatctgtt aatagatcaa aaatcatcgc ttcgctgatt aattacccca 1740
gaaataaggc taaaaaacta atcgcattat catcctatgg ttgttaattt gattcgttca 1800
tttgaaggtt tgtggggcca ggttactgcc aatttttcct cttcataacc ataaaagcta 1860
gtattgtaga atctttattg ttcggagcag tgcggcgcga ggcacatctg cgtttcagga 1920
acgcgaccgg tgaagacgag gacgcacgga ggagagtctt ccttcggagg gctgtcaccc 1980
gctcggcggc ttctaatccg tacttcaata tagcaatgag cagttaagcg tattactgaa 2040
agttccaaag agaaggtttt tttaggctaa gataatgggg ctctttacat ttccacaaca 2100
tataagtaag attagatatg gatatgtata tggatatgta tatggtggta atgccatgta 2160
atatgattat taaacttctt tgcgtccatc caaaaaaaaa gtaagaattt ttgaaaattc 2220
aaggaattcg atatcaagct tatcgatacc gtcgacatga gatttccttc aatttttact 2280
gcagttttat tcgcagcatc ctccgcgcta gctcatccag gcttttttac ttcaattggt 2340
cagatgactg atttgatcca tactgagaaa gatctggtga cttctctgaa agattatatt 2400
aaggcagaag aggacaagtt agaacaaata aaaaaatggg cagagaagtt agatcggcta 2460
actagtacag cgacaaaaga tccagaagga tttgttgggc atccagtaaa tgcattcaaa 2520
ttaatgaaac gtctgaatac tgagtggagt gagttggaga atctggtcct taaggatatg 2580
tcagatggct ttatctctaa cctaaccatt cagagaccag tactttctaa tgatgaagat 2640
caggttgggg cagccaaagc tctgttacgt ctccaggata cctacaattt ggatacagat 2700
accatctcaa agggtaatct tccaggagtg aaacacaaat cttttctaac ggctgaggac 2760
tgctttgagt tgggcaaagt ggcctataca gaagcagatt attaccatac ggaactgtgg 2820
atggaacaag ccctaaggca actggatgaa ggcgagattt ctaccataga taaagtctct 2880
gttctagatt atttgagcta tgcggtatat cagcagggag acctggataa ggcacttttg 2940
ctcacaaaga agcttcttga actagatcct gaacatcaga gagctaatgg taacttaaaa 3000
tattttgagt atataatggc taaagaaaaa gatgtcaata agtctgcttc agatgaccaa 3060
tctgatcaga aaactacacc aaagaaaaaa ggggttgctg tggattacct gccagagaga 3120
cagaagtacg aaatgctgtg ccgtggggag ggtatcaaaa tgacccctcg gagacagaaa 3180
aaactctttt gccgctacca tgatggaaac cgtaatccta aatttattct ggctccagct 3240
aaacaggagg atgaatggga caagcctcgt attattcgct tccatgatat tatttctgat 3300
gcagaaattg aaatcgtcaa agacctagca aaaccaaggc tgagccgagc tacagtacat 3360
gaccctgaga ctggaaaatt gaccacagca cagtacagag tatctaagag tgcctggctc 3420
tctggctatg aaaatcctgt ggtgtctcga attaatatga gaatacaaga tctaacagga 3480
ctagatgttt ccacagcaga ggaattacag gtagcaaatt atggagttgg aggacagtat 3540
gaaccccatt ttgactttgc acggaaagat gagccagatg ctttcaaaga gctggggaca 3600
ggaaatagaa ttgctacatg gctgttttat atgagtgatg tgtctgcagg aggagccact 3660
gtttttcctg aagttggagc tagtgtttgg cccaaaaaag gaactgctgt tttctggtat 3720
aatctgtttg ccagtggaga aggagattat agtacacggc atgcagcctg tccagtgcta 3780
gttggcaaca aatgggtatc caataaatgg ctccatgaac gtggacaaga atttcgaaga 3840
ccttgtacgt tgtcagaatt ggaatgatga gggccc 3876
<210> 8
<211> 922
<212> PRT
<213> Mytilus edulis
<400> 8
Met Val Tyr Lys Leu Leu Thr Val Cys Leu Val Ala Ser Leu Leu Glu
1 5 10 15
Ile Cys Leu Ala Asp Tyr Asn Gly Asn Lys Gln Tyr Gly Gly Arg Tyr
20 25 30
Gly Asn Arg Tyr Gly Asn Gly Leu Gly Gly Gly Asn Gly Gly Ala Gly
35 40 45
Ala Val Ala His Ala His Ala His Ala His Ala Ser Ala Gly Ala Asn
50 55 60
Gly Arg Ala Arg Ala His Ala Arg Ala Leu Ala His Ala His Ala Gly
65 70 75 80
Gly Gly Ala Ala His Gly His Pro Gly Phe Pro Val Gly Gly Ser Ala
85 90 95
Ser Ala Ala Ala Arg Ala Ala Ala Arg Ala Ser Ala Gly Gly Leu Gly
100 105 110
Gly Phe Gly Ser Ala Ala Ala Asn Ala Ala Ala Ala Ala Arg Ala Gly
115 120 125
Ala Gly Phe Gly Gly Phe Gly Gly Leu Gly Gly Phe Gly Gly Leu Gly
130 135 140
Gly Val Gly Gly Pro Gly Gln Pro Gly Gly Pro Gly Gly Pro Gly Gly
145 150 155 160
Pro Gly Gly Pro Gly Gly Pro Gly Met Pro Gly Gly Pro Gly Gly Pro
165 170 175
Ser Gly Pro Gly Thr Gly Gly Pro Gly Gln Pro Gly Gly Pro Gly Gly
180 185 190
Pro Gly Gly Pro Gly Gly Pro Gly Gly Pro Ser Met Pro Gly Gly Pro
195 200 205
Gly Gly Pro Gly Gly Pro Gly Met Pro Gly Gly Pro Gly Gly Pro Gly
210 215 220
Gly Pro Gly Gly Ala Gly Gly Ile Pro Gly Met Thr Gly Pro Ala Gly
225 230 235 240
Pro Pro Gly Pro Ala Gly Pro Gln Gly Pro Glu Gly Glu Gln Gly Pro
245 250 255
Arg Gly Arg Thr Pro Ala Gly Thr Pro Gly Pro Pro Gly Asn Pro Gly
260 265 270
Glu Pro Gly Gln Gly Gly Ala Pro Gly Ala Pro Gly Ala Pro Gly His
275 280 285
Ala Gly Lys His Gly Thr Ala Gly Ala Ala Gly Lys Ala Gly Arg Pro
290 295 300
Gly Pro Xaa Gly Gln Ala Gly Ala Ser Gly Ser Ser Gly Gln His Gly
305 310 315 320
Ala Ser Gly Ala Pro Gly Arg Pro Gly Asn Pro Gly Ser Thr Gly Arg
325 330 335
Pro Gly Ala Thr Gly Asp Pro Gly Arg Pro Gly Ala Thr Gly Thr Thr
340 345 350
Gly Arg Pro Gly Pro Ser Gly Ala Pro Gly Asn Pro Gly Ala Pro Gly
355 360 365
Ala Leu Gly Ala Pro Gly Pro Arg Gly Ser Pro Gly Phe Val Gly Leu
370 375 380
Pro Gly Pro Arg Gly Ser Pro Gly Glu Pro Gly Asn Gln Gly Pro Ile
385 390 395 400
Gly Gly Pro Gly Tyr Pro Gly Pro Arg Gly Pro Gln Gly Pro Asp Gly
405 410 415
Ala Met Gly Pro Gln Gly Pro Cys Gly Asp Arg Gly Ala Pro Gly Val
420 425 430
Pro Gly Lys Gln Gly Pro Val Gly Gly Gln Gly Pro Ala Gly Pro Arg
435 440 445
Gly Pro Arg Gly Asp Glu Gly Pro Val Gly Pro Lys Gly Glu Pro Gly
450 455 460
Ala Arg Gly Ala Asp Gly Lys Pro Gly Asp Lys Gly Pro Asp Gly Glu
465 470 475 480
Thr Gly Pro Gln Gly Pro Ala Gly Pro Lys Gly Gln Val Gly Asp Gln
485 490 495
Gly Lys Pro Gly Ala Lys Gly Glu Thr Gly Asp Gln Gly Ala Arg Gly
500 505 510
Glu Ala Gly Lys Ala Gly Glu Gln Gly Pro Gly Gly Ile Gln Gly Pro
515 520 525
Lys Gly Pro Val Gly Gly Gln Gly Pro Ala Gly Pro Ala Gly Pro Leu
530 535 540
Gly Pro Gln Gly Pro Met Gly Glu Arg Gly Pro Gln Gly Pro Thr Gly
545 550 555 560
Ser Glu Gly Pro Val Gly Ala Pro Gly Pro Lys Gly Ser Val Gly Asp
565 570 575
Gln Gly Ala Gln Gly Asp Gln Gly Ala Thr Gly Ala Asp Gly Lys Lys
580 585 590
Gly Glu Pro Gly Glu Arg Gly Gln Gln Gly Ala Ala Gly Pro Val Gly
595 600 605
Arg Pro Gly Pro Arg Gly Asp Arg Gly Ala Lys Gly Ile Gln Gly Ser
610 615 620
Arg Gly Arg Pro Gly Gly Met Gly Arg Arg Gly Asn Arg Gly Ser Gln
625 630 635 640
Gly Ala Val Gly Pro Arg Gly Glu Thr Gly Pro Asp Gly Asn Gln Gly
645 650 655
Gln Arg Gly Glu Gln Gly Ala Pro Gly Val Ile Thr Leu Val Ile Glu
660 665 670
Asp Leu Arg Thr Ala Gly Val Glu Ser Pro Val Glu Thr Phe Asp Ala
675 680 685
Gly Ala Gly Thr Gly Gly Pro Ala Pro Gly Val Gly Ala Ala Ala Thr
690 695 700
Ala Gly Ala Phe Ala Gly Ala Gly Pro Gly Gly Ala Asn Ala Gly Gly
705 710 715 720
Asn Ala Ala Ala Gly Ala Gly Pro Gly Val Gly Pro Gly Gly Leu Gly
725 730 735
Gly Leu Gly Gly Leu Gly Ala Gly Gly Leu Gly Gly Gly Leu Gly Gly
740 745 750
Gly Leu Gly Gly Leu Gly Gly Ala Gly Gly Leu Gly Gly Gly Leu Gly
755 760 765
Gly Leu Gly Gly Gly Leu Gly Gly Gly Leu Gly Gly Leu Gly Gly Gly
770 775 780
Ala Gly Gly Ala Gly Ala Gly Gly Asn Gly Gly Ala Gly Ala Gly Gly
785 790 795 800
Ala Gly Gly Asn Gly Gly Gly Ser Ala Ala Ala Arg Ala Ala Ala Gln
805 810 815
Ala Ala Ala Ala Ala Gly Gly Asn Gly Gly Ala Ala Gln Ala Ala Ala
820 825 830
Gln Ala Ala Ala Ser Ala Ala Ala Asn Ser Gly Leu Gly Ala Gly Ala
835 840 845
Ala Arg Ala Ala Ala Ser Ala Ala Ala Arg Ala Thr Val Thr Gly His
850 855 860
Gly Ser Gly Thr Ala Ala Ala Ala Ala Asn Ala Ala Ala Gln Ala His
865 870 875 880
Ala Ala Thr Arg Gly Gln Gly Gly Ser His Ala His Ala Ala Ala Ala
885 890 895
Ala His Ala Ala Ala Ser Ser Val Ile His Gly Gly Asp Tyr His Gly
900 905 910
Asn Asp Ala Gly Tyr His Lys Pro Gly Tyr
915 920
<210> 9
<211> 19
<212> PRT
<213> Saccharomyces cerevisiae
<400> 9
Met Arg Phe Pro Ser Ile Phe Thr Ala Val Leu Phe Ala Ala Ser Ser
1 5 10 15
Ala Leu Ala
<210> 10
<211> 952
<212> PRT
<213> Mytilus edulis
<400> 10
Met Val Arg Phe Ser Leu Ala Ser Val Leu Leu Leu Ala Val Thr Ser
1 5 10 15
Thr Ala Phe Ala Gly Pro Val Ser Asp Tyr Gly Gly Gly Gly Ile Lys
20 25 30
Val Val Pro Tyr His Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly
35 40 45
Gly Gly His Gly Gly Ser Phe Arg Asn Gly Arg His Gly Gly Ile Gly
50 55 60
Gly Ile Gly Gly Gly Ser Ser His Ala His Ala His Ser Ser Ala Ser
65 70 75 80
Ala His Val His His Phe Gly Pro Gly Gly Ser Ser His Ala Ser Ala
85 90 95
Gly Ser Ser Ser His Ala Ser Ala Ser His Asn Gly Leu Gly Gly Gly
100 105 110
Ser Ala His Ala His Ser Ser Ser Ser Ala Asn Ala His Ser Gly Gly
115 120 125
Phe Gly Gly Phe Gly Gly Ile Gly Gly Ile Gly Gly Ile Gly Pro Gly
130 135 140
Gly Ser Val Gly Gly Gly Ile Gly Pro Gly Gly Ser Val Gly Gly Gly
145 150 155 160
Ile Gly Gly Ile Gly Gly Ile Gly Gly Gly Gly Gly Pro Gly Gly Asn
165 170 175
Gly Gly Ile Gly Phe Gly Pro Gly Phe Gly Gly Gly Phe Gly Pro Gly
180 185 190
Ser Ser Ala Ser Gly Ser Gly Ser Gly Ser Ala Phe Gly Gly Pro Gly
195 200 205
Gly Ser Ser Ala Ser Ala Asn Ala Ala Ala Arg Ala Asn Ala Asn Gly
210 215 220
Gly Gly Gly Phe Gly Gly Pro Gly Thr Pro Gly Asn Ser Gly Pro Pro
225 230 235 240
Gly Gln Pro Gly Leu Pro Gly Ala Pro Gly Gln Pro Gly Arg Pro Gly
245 250 255
Ser Thr Pro Pro Gly Arg Leu Gly Asn Pro Gly Pro Pro Gly Gln Pro
260 265 270
Gly Asn Pro Gly Arg Pro Gly Ser Ser Gly Arg Pro Gly Gly Ser Gly
275 280 285
Gln Pro Gly Gly Pro Gly Arg Pro Gly Thr Pro Gly Lys Pro Gly Asn
290 295 300
Arg Gly Gln Pro Gly Gln Pro Gly Gly Pro Gly Gln Pro Gly His Pro
305 310 315 320
Gly Ala Gly Gly Gln Pro Gly Arg Asn Gly Asn Pro Gly Asn Pro Gly
325 330 335
Lys Pro Gly Thr Pro Gly His Pro Gly Thr Ala Gly Ser Arg Gly Met
340 345 350
Pro Gly Thr Pro Gly Thr Pro Gly Gln Pro Gly Ile Pro Gly Thr Val
355 360 365
Gly Gly Arg Gly Pro Arg Gly Pro Ala Gly Ile Ile Gly Leu Ile Gly
370 375 380
Pro Lys Gly Asn Pro Gly Glu Pro Gly Asn Pro Gly Ala Pro Gly Gly
385 390 395 400
Pro Gly Ser Thr Gly Pro Gln Gly Pro Gln Gly Pro Ala Gly Gly Pro
405 410 415
Gly Ala Ser Gly Gly Pro Gly Asp Lys Gly Ala Pro Gly Thr Pro Gly
420 425 430
Gly Thr Gly Pro Arg Gly Pro Ile Gly Pro Ser Gly Pro Ser Gly Ala
435 440 445
Pro Gly Asp Gln Gly Pro Gln Gly Gly Arg Gly Thr Pro Gly Leu Ala
450 455 460
Gly Lys Pro Gly Pro Lys Gly Leu Gln Gly Ser Asn Gly Glu Val Gly
465 470 475 480
Pro Gln Gly Pro Ser Gly Pro Ala Gly Pro Gln Gly Pro Gln Gly Lys
485 490 495
Asn Gly Val Lys Gly Ala Ala Gly Asp Gln Gly Ala Arg Gly Pro Glu
500 505 510
Gly Lys Ala Gly Pro Ala Gly Pro Gln Gly Glu Thr Gly Pro Lys Gly
515 520 525
Pro Thr Gly Ala Gln Gly Pro Ala Gly Pro Ala Gly Pro Ser Gly Glu
530 535 540
Gln Gly Pro Gly Gly Glu Arg Gly Gly Gln Gly Pro Gln Gly Ala Glu
545 550 555 560
Gly Pro Ser Gly Pro Ala Gly Pro Arg Gly Pro Ala Gly Ser Gln Gly
565 570 575
Pro Ser Gly Glu Arg Gly Glu Pro Gly Ala Pro Gly Lys Lys Gly Pro
580 585 590
Asn Gly Asp Arg Gly Asn Gln Gly Ser Pro Gly Ala Pro Gly Lys Asn
595 600 605
Gly Ala Arg Gly Asn Arg Gly Ser Arg Gly Ser Asn Gly Ser Pro Gly
610 615 620
Arg Ser Gly Ser Pro Gly Ser Arg Gly Lys Pro Gly Pro Gln Gly Pro
625 630 635 640
His Gly Pro Arg Gly Leu Arg Gly Ser Pro Gly Gln Lys Gly Pro Arg
645 650 655
Gly Asp Gln Gly Ala Pro Gly Val Ile Arg Ile Val Ile Asp Asp Gln
660 665 670
Arg Thr Gly Pro Glu Val Ala Glu Phe Pro Gly Phe Gly Gly Phe Gly
675 680 685
Gly Ala Ser Ala Asn Ala Ala Ser Ser Ala Asn Ala Phe Ala Gly Gly
690 695 700
Pro Gly Gly Ser Ala Gly Ala Gly Ser Ser Ser Gly Ala Asn Ala Asn
705 710 715 720
Ala Gly Gly Phe Pro Phe Gly Gly Ala Gly Gly Gly Pro Gly Ala Ala
725 730 735
Gly Gly Pro Gly Gly Ala Gly Gly Pro Gly Gly Val Gly Gly Gly Val
740 745 750
Gly Gly Gly Pro Gly Gly Val Gly Gly Gly Val Gly Gly Gly Pro Gly
755 760 765
Gly Val Gly Gly Gly Pro Gly Gly Ala Gly Pro Gly Gly Ala Gly Gly
770 775 780
Phe Gly Pro Gly Gly Ala Gly Gly Phe Gly Gly Phe Gly Gly Gly Ser
785 790 795 800
Ser Ala Gly Ala Ser Ser Ser Gly Ser Ala Ser Ala Ser Asn Gly Gly
805 810 815
Pro Phe Gly Val Leu Asn Val Gly Pro Gly Gly Arg Ile Gly Gly Gly
820 825 830
Ser Ala Ser Ala Ser Ala Ala Ser Arg Ala His Ala His Phe Gly Gly
835 840 845
Gly Ser Ser Ala Gly Ala Ser Ser Ser Gly Ser Ala Ser Ala Ser Asn
850 855 860
Gly Gly Pro Phe Gly Val Leu Asn Val Gly Pro Gly Gly Arg Ile Gly
865 870 875 880
Gly Gly Ser Ala Ser Ala Ser Ala Ala Ser Arg Ala His Ala His Ala
885 890 895
Phe Gly Gly Leu Gly Gly Gly Ser Ala Ser Ala Gly Ser His Ser Ser
900 905 910
Ser Ser Ser His Ser Phe Gly Gly His Val Phe His Ser Val Thr His
915 920 925
His Gly Gly Pro Ser His Val Ser Ser Gly Gly His Gly Gly His Gly
930 935 940
Gly Gly Pro Tyr Lys Pro Gly Tyr
945 950
Claims (25)
- 재조합 콜라겐 유사 단백질 바람직하게는 재조합 홍합(mussel) 족사(byssus) 단백질을 생산하기 위한 하기 요소로 형질 전환된 효모 세포:a) 상기 재조합 콜라겐 유사 단백질을 암호화하는 일차 발현 벡터; 및b) 프롤릴-4-히드록실라제(P4H, prolyl-4-hydroxylase)를 암호화하는 핵산을 포함하는 2차 발현 벡터.
- 제1항에 있어서, 상기 프롤릴-4-히드록실라제의 서열은 상기 효모 세포의 세포질내 세망(endoplasmic reticulum)으로 상기 서열의 효과적인 수송을 위한 신호 서열이 연결된 것을 특징으로 하는 효모 세포.
- 제2항에 있어서, 상기 신호 서열은 서열번호 10의 사카로마이세스 세레비지에(Saccharomyces cerevisiae) 교배 인자 알파-1(MFa)의 신호서열인 것을 특징으로 하는 효모 세포.
- 제1항 내지 제3항 중 하나 이상의 항에 있어서, 상기 효모 세포는 사카로마이세스 세레비지에(Saccharomyces cerevisiae), 스키조사카로마이세스 폼베(Schizosaccharomyces pombe), 피치아 파스토리스(Pichia pastoris), 칸디다 알비칸스(Candida albicans), 및 한세눌라 폴리모프(Hansenula polymorph) 세포로 이 루어진 군에서 선택된 어느 하나인 것을 특징으로 하는 효모 세포.
- 제1항 내지 제4항 중 하나 이상의 항에 있어서, 상기 1차 발현 벡터는 하나 이상의 조절인자(regulatory element)를 포함하는 것을 특징으로 하는 효모 세포.
- 제5항에 있어서, 상기 조절인자는 유도성(inducible) 또는 항상성(constitutive) 프로모터에서 선택된 프로모터를 함유하는 것을 특징으로 하는 효모 세포.
- 제6항에 있어서, 상기 프로모터는 GPD, GAL4, CUP1, MET25, GAL1 또는 GAL1-10에서 선택된 프로모터인 것을 특징으로 하는 효소 세포.
- 전술한 항 중 하나 이상의 항에 있어서, 상기 발현 벡터는 플라스미드인 것을 특징으로 하는 효소 세포.
- 전술한 항 중 하나 이상의 항에 있어서, 상기 재조합 콜라겐 유사 단백질은 홍합 엘라스틴(elastin) 또는 홍합 실크 피브로인(silk fibroin) 도메인이 측면 배치(flanking)된 하나 이상의 홍합 콜라겐 도메인 단편(fragment)으로 이루어지거나 포함하는 재조합 홍합 족사 단백질인 것을 특징으로 하는 효소 세포.
- 전술한 항 중 하나 이상의 항에 있어서, 상기 단편(fragment)은 미틸러스 속(Mytilus sp.)의 미틸러스 에듈리스(M. edulis), 미틸러스 갈로프로빈시알리스(M. galloprovincialis), 미틸러스 캘리포니안스(M. californians), 또는 제우케리아 데미싸(Geukeria demissa)에서 유래된 것을 특징으로 하는 효소 세포.
- 전술한 항 중 하나 이상의 항에 있어서, 상기 재조합 홍합(mussel) 족사(byssus) 단백질은 preColP 및/또는 preColD 또는 그 변이체의 하나 또는 하나 이상의 단편으로 이루어지거나 포함하는 것을 특징으로 하는 효소 세포.
- 전술한 항 중 하나 이상의 항에 있어서, 상기 재조합 단백질은 서열번호 3 및/또는 서열번호4, 또는 그 변이체의 아미노산 서열을 포함하거나 그 서열로 이루어진 것을 특징으로 하는 재조합 단백질인 것을 특징으로 하는 효소 세포.
- 전술한 항 중 하나 이상의 항에 있어서, 상기 재조합 단백질의 각 아미노산 서열의 신호 서열은 효모 특이적 신호 서열 바람직하게는 사카로마이세스 세레비지에(Saccharomyces cerevisiae)의 교배 인자 알파 1으로 치환된 것을 특징으로 하는 효모 세포.
- 전술한 항 중 하나 이상의 항에 있어서, 상기 프롤릴-4-히드록실라제(P4H, prolyl-4-hydroxylase)는 사람 또는 홍합의 프롤릴-4-히드록실라제(P4H, prolyl-4- hydroxylase)인 것을 특징으로 하는 효모 세포.
- 하기 구성요소를 포함하는 재조합 콜라겐 유사 단백질의 생산 용도를 위한 공동 발현 시스템 또는 그 부분 키트:(a) 제1항 내지 제14항의 하나 이상의 항에서 정의된 일차 발현 벡터; 및(b) 제1항 내지 제14항의 하나 이상의 항에서 정의된 이차 발현 벡터
- 하기 단계를 포함하는 재조합 콜라겐 유사 단백질, 바람직하게는 재조합 홍합 족사 단백질의 생산 방법:a) 효모 세포를 준비하는 단계;b) 상기 효모세포에 제15항의 공동 발현 시스템 또는 제1항 내지 제14항 중 하나 이상의 항에서 정의된 일차 및 이차 발현 벡터로 상기 효모세포를 형질 전환시키는 단계;c) 적절한 조건에서 상기 효모 세포로부터 재조합 콜라겐 유사 단백질 바람직하게는 재조합 홍합 족사 단백질을 발현시키는 단계; 및d) 상기 재조합 단백질을 회수하는 단계.
- 하기 단계를 포함하는 재조합 홍합 족사 단백질에서 실(thread)을 생산하는 방법:a) 제16항에 따른 재조합 단백질을 생산하는 단계; 및b) 적절한 방법에 의해 상기 단백질을 실(thread)로 (전자)방적(spinning)하거나 몰딩(molding)하는 단계.
- 제17항의 방법에 따라 생산되는 실 또는 제16항의 방법에 따라 생산되는 단백질.
- 제18항의 단백질/실의 바이오테크놀로지 또는 의약 분야에서의 용도.
- 제18항의 단백질/실의 상처 봉합 또는 피복 시스템(coverage system)의 제조를 위한 용도.
- 제20항에 있어서, 봉합 물질(suture material)의 제조를 위한 용도.
- 제21항에 있어서, 상기 봉합 물질은 신경외과 또는 안외과(ophthalmic surgery)에서 사용하기 위한 용도.
- 제18항에 따른 단백질/실의 대체물질 바람직하게는 인공 연골 또는 건(tendon) 물질의 제조를 위한 용도.
- 제18항의 단백질/실을 사용하여 얻을 수 있는 상처 봉합 또는 피복 시스템, 봉합 물질, 대체 물질, 바람직하게는 인공 연골 또는 건 물질.
- 제18항의 단백질/실을 포함하는 화장품, 약물 전단 운반체, 직물, 옷감, 종이 제품, 가죽 제품, 자동차 부품, 또는 항공기 부품.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP05025131A EP1787995A1 (en) | 2005-11-17 | 2005-11-17 | Recombinant mussel byssus protein |
EP05025131.3 | 2005-11-17 |
Publications (1)
Publication Number | Publication Date |
---|---|
KR20080074134A true KR20080074134A (ko) | 2008-08-12 |
Family
ID=36581654
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020087012272A KR20080074134A (ko) | 2005-11-17 | 2006-11-17 | 재조합 콜라겐 유사 단백질의 생산 |
Country Status (9)
Country | Link |
---|---|
US (1) | US20090162896A1 (ko) |
EP (2) | EP1787995A1 (ko) |
JP (1) | JP2009515540A (ko) |
KR (1) | KR20080074134A (ko) |
CN (1) | CN101316862A (ko) |
AU (1) | AU2006314708B2 (ko) |
CA (1) | CA2629821C (ko) |
RU (1) | RU2008123706A (ko) |
WO (1) | WO2007057207A1 (ko) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20180113181A (ko) * | 2017-04-05 | 2018-10-15 | 한양대학교 에리카산학협력단 | 자극 반응성 및 표면 부착성을 지닌 엘라스틴 기반 펩타이드 및 홍합 족사 단백질로 이루어진 다중 블럭 코폴리펩타이드 및 이의 제조 방법 및 그 용도 |
Families Citing this family (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
ES2530190T3 (es) * | 2007-06-20 | 2015-02-27 | Basf Se | Proteínas repetitivas sintéticas, su producción y su uso |
EP2178949B1 (en) | 2007-08-14 | 2017-12-13 | Cook Medical Technologies LLC | Photoactivated crosslinking of a protein or peptide |
CN102146426B (zh) * | 2010-12-23 | 2013-04-24 | 陕西九州生物医药科技园发展有限公司 | 毕赤酵母表达重组类人胶原蛋白的生产方法 |
WO2013071356A1 (en) * | 2011-11-16 | 2013-05-23 | Commonwealth Scientific And Industrial Research Organisation | Collagen-like silk genes |
ITGE20130040A1 (it) * | 2013-04-18 | 2014-10-19 | Univ Degli Studi Genova | Metodo per la produzione di collagene marino ricombinante e organismo capace di produrre detto collagene marino |
CN104323927A (zh) * | 2014-11-06 | 2015-02-04 | 济南星河康泰贸易有限公司 | 贻贝粘蛋白在制备皮肤美容化妆品上的应用 |
US11913166B2 (en) | 2015-09-21 | 2024-02-27 | Modern Meadow, Inc. | Fiber reinforced tissue composites |
ES2806990T3 (es) * | 2016-02-15 | 2021-02-19 | Modern Meadow Inc | Procedimiento para fabricar un material biofabricado que contiene fibrillas de colágeno |
US10273549B2 (en) * | 2016-04-21 | 2019-04-30 | Vitrolabs Inc. | Engineered skin equivalent, method of manufacture thereof and products derived therefrom |
EP3333179A1 (en) * | 2016-12-07 | 2018-06-13 | Merz Pharma GmbH & Co. KGaA | Novel recombinant botulinum toxin with accelarated onset of effect |
RU2658428C9 (ru) * | 2017-10-03 | 2018-10-03 | Общество с ограниченной ответственностью "Медсервис" | Средство для лечения состояний человеческого организма, связанных с уменьшением уровня экспрессии гена Р4НА1 и/или уменьшением количества белка пролил 4-гидроксилазы альфа 1 на основе генно-терапевтических субстанций с геном Р4НА1, способ получения и использования |
AU2018253595A1 (en) | 2017-11-13 | 2019-05-30 | Modern Meadow, Inc. | Biofabricated leather articles having zonal properties |
CN107699978A (zh) * | 2017-11-28 | 2018-02-16 | 苏州大学 | 人造足丝及其制备方法 |
RU2700649C2 (ru) * | 2017-12-25 | 2019-09-18 | Селл энд Джин Терапи Лтд | Генетическая конструкция на основе невирусной векторной плазмиды, включающей кДНК гена Р4НА2, для купирования проявлений состояний человеческого организма, связанных с уменьшением экспрессии гена Р4НА2 и/или уменьшением количества белка пролил 4-гидрокислаза альфа 2, способ получения и использования |
US11178934B2 (en) * | 2018-07-18 | 2021-11-23 | Bolt Threads Inc. | Resilin material footwear and fabrication methods |
AU2020209847B2 (en) | 2019-01-17 | 2024-10-17 | Modern Meadow, Inc. | Layered collagen materials and methods of making the same |
CN116650342A (zh) * | 2019-10-30 | 2023-08-29 | 诺斯贝尔化妆品股份有限公司 | 酵母重组胶原蛋白的胶原纳米速溶面膜及制备方法 |
BR112022010434A2 (pt) | 2019-11-29 | 2022-10-11 | Lallemand Hungary Liquidity Man Llc | Levedura que expressa glucoamilase heteróloga |
CN112301496A (zh) * | 2020-11-03 | 2021-02-02 | 上海普平生物科技有限公司 | 可控降解手术缝合线的制备方法 |
CN114106150B (zh) * | 2021-12-03 | 2022-08-16 | 江苏创健医疗科技有限公司 | 重组胶原蛋白、制备方法及其应用 |
CN114774460A (zh) * | 2021-12-27 | 2022-07-22 | 江苏创健医疗科技有限公司 | 酵母重组人源i型三螺旋胶原蛋白及其制备方法 |
CN115109795B (zh) * | 2022-06-14 | 2023-04-21 | 济南磐升生物技术有限公司 | 重组人源iii型胶原蛋白注射剂及其在皮肤胶原再生中的应用 |
CN117701618B (zh) * | 2023-12-15 | 2024-10-18 | 肽源(广州)生物科技有限公司 | 一种在毕赤酵母中高效分泌表达全链长胶原蛋白的方法及其应用 |
CN118440975A (zh) * | 2024-05-30 | 2024-08-06 | 江苏创健医疗科技股份有限公司 | 一种酵母表达三螺旋结构的重组人源化胶原蛋白的制备方法 |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5593859A (en) * | 1991-10-23 | 1997-01-14 | Thomas Jefferson University | Synthesis of human procollagens and collagens in recombinant DNA systems |
-
2005
- 2005-11-17 EP EP05025131A patent/EP1787995A1/en not_active Withdrawn
-
2006
- 2006-11-17 KR KR1020087012272A patent/KR20080074134A/ko not_active Application Discontinuation
- 2006-11-17 AU AU2006314708A patent/AU2006314708B2/en active Active
- 2006-11-17 RU RU2008123706/13A patent/RU2008123706A/ru not_active Application Discontinuation
- 2006-11-17 CN CNA2006800428380A patent/CN101316862A/zh active Pending
- 2006-11-17 US US12/085,266 patent/US20090162896A1/en not_active Abandoned
- 2006-11-17 WO PCT/EP2006/011061 patent/WO2007057207A1/en active Application Filing
- 2006-11-17 JP JP2008540524A patent/JP2009515540A/ja active Pending
- 2006-11-17 EP EP06818633.7A patent/EP1948684B1/en active Active
- 2006-11-17 CA CA2629821A patent/CA2629821C/en active Active
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20180113181A (ko) * | 2017-04-05 | 2018-10-15 | 한양대학교 에리카산학협력단 | 자극 반응성 및 표면 부착성을 지닌 엘라스틴 기반 펩타이드 및 홍합 족사 단백질로 이루어진 다중 블럭 코폴리펩타이드 및 이의 제조 방법 및 그 용도 |
Also Published As
Publication number | Publication date |
---|---|
EP1948684A1 (en) | 2008-07-30 |
CA2629821C (en) | 2013-05-28 |
WO2007057207A1 (en) | 2007-05-24 |
EP1787995A1 (en) | 2007-05-23 |
EP1948684B1 (en) | 2016-01-27 |
CA2629821A1 (en) | 2007-05-24 |
RU2008123706A (ru) | 2009-12-27 |
AU2006314708A1 (en) | 2007-05-24 |
JP2009515540A (ja) | 2009-04-16 |
AU2006314708B2 (en) | 2012-09-13 |
CN101316862A (zh) | 2008-12-03 |
US20090162896A1 (en) | 2009-06-25 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2006314708B2 (en) | Production of recombinant collagen like proteins | |
US7521228B2 (en) | Spider silk protein encoding nucleic acids, polypeptides, antibodies and methods of use thereof | |
US8674077B2 (en) | Processes for producing silk dope | |
AU2012339619B2 (en) | Collagen-like silk genes | |
US11827674B2 (en) | Wet adhesive peptides | |
WO2013120143A1 (en) | Method of promoting the formation of cross-links between coiled coil silk proteins | |
AU2013239320B2 (en) | Silk polypeptides | |
US20110177997A1 (en) | Cross-Beta Silk Genes |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WITN | Application deemed withdrawn, e.g. because no request for examination was filed or no examination fee was paid |