KR20070083870A - 콜라겐 생산성 식물 및 그 생성 및 사용방법 - Google Patents
콜라겐 생산성 식물 및 그 생성 및 사용방법 Download PDFInfo
- Publication number
- KR20070083870A KR20070083870A KR1020077009817A KR20077009817A KR20070083870A KR 20070083870 A KR20070083870 A KR 20070083870A KR 1020077009817 A KR1020077009817 A KR 1020077009817A KR 20077009817 A KR20077009817 A KR 20077009817A KR 20070083870 A KR20070083870 A KR 20070083870A
- Authority
- KR
- South Korea
- Prior art keywords
- gly
- pro
- ala
- plant
- glu
- Prior art date
Links
- 102000008186 Collagen Human genes 0.000 title claims abstract description 248
- 108010035532 Collagen Proteins 0.000 title claims abstract description 248
- 229920001436 collagen Polymers 0.000 title claims abstract description 246
- 238000000034 method Methods 0.000 claims abstract description 68
- 101710096389 Collagen alpha chain Proteins 0.000 claims abstract description 40
- 230000000694 effects Effects 0.000 claims abstract description 21
- 102200024044 rs1555523872 Human genes 0.000 claims abstract 33
- 241000196324 Embryophyta Species 0.000 claims description 298
- 230000014509 gene expression Effects 0.000 claims description 48
- 150000007523 nucleic acids Chemical class 0.000 claims description 45
- 210000004027 cell Anatomy 0.000 claims description 40
- 108010076504 Protein Sorting Signals Proteins 0.000 claims description 35
- 108020004707 nucleic acids Proteins 0.000 claims description 35
- 102000039446 nucleic acids Human genes 0.000 claims description 35
- 244000061176 Nicotiana tabacum Species 0.000 claims description 31
- 235000002637 Nicotiana tabacum Nutrition 0.000 claims description 31
- 230000008685 targeting Effects 0.000 claims description 31
- 108010022452 Collagen Type I Proteins 0.000 claims description 25
- 102000012422 Collagen Type I Human genes 0.000 claims description 25
- 230000003834 intracellular effect Effects 0.000 claims description 22
- 210000003934 vacuole Anatomy 0.000 claims description 22
- 235000007340 Hordeum vulgare Nutrition 0.000 claims description 21
- 240000005979 Hordeum vulgare Species 0.000 claims description 21
- 102100035199 Procollagen glycosyltransferase Human genes 0.000 claims description 21
- 230000033444 hydroxylation Effects 0.000 claims description 17
- 238000005805 hydroxylation reaction Methods 0.000 claims description 17
- 210000003463 organelle Anatomy 0.000 claims description 17
- 230000035882 stress Effects 0.000 claims description 17
- 230000014759 maintenance of location Effects 0.000 claims description 13
- 108010042388 protease C Proteins 0.000 claims description 10
- 108010043393 protease N Proteins 0.000 claims description 10
- 210000004899 c-terminal region Anatomy 0.000 claims description 7
- 230000001939 inductive effect Effects 0.000 claims description 7
- 108091033319 polynucleotide Proteins 0.000 claims description 7
- 102000040430 polynucleotide Human genes 0.000 claims description 7
- 239000002157 polynucleotide Substances 0.000 claims description 7
- 240000007594 Oryza sativa Species 0.000 claims description 6
- 235000007164 Oryza sativa Nutrition 0.000 claims description 6
- 235000002595 Solanum tuberosum Nutrition 0.000 claims description 6
- 244000061456 Solanum tuberosum Species 0.000 claims description 6
- 235000009566 rice Nutrition 0.000 claims description 6
- 235000010469 Glycine max Nutrition 0.000 claims description 5
- 244000068988 Glycine max Species 0.000 claims description 5
- 235000017587 Medicago sativa ssp. sativa Nutrition 0.000 claims description 5
- 235000021307 Triticum Nutrition 0.000 claims description 5
- 240000008042 Zea mays Species 0.000 claims description 5
- 235000002017 Zea mays subsp mays Nutrition 0.000 claims description 5
- 230000006378 damage Effects 0.000 claims description 5
- 235000014698 Brassica juncea var multisecta Nutrition 0.000 claims description 4
- 235000006008 Brassica napus var napus Nutrition 0.000 claims description 4
- 240000000385 Brassica napus var. napus Species 0.000 claims description 4
- 235000006618 Brassica rapa subsp oleifera Nutrition 0.000 claims description 4
- 235000004977 Brassica sinapistrum Nutrition 0.000 claims description 4
- 229920000742 Cotton Polymers 0.000 claims description 4
- 235000007688 Lycopersicon esculentum Nutrition 0.000 claims description 4
- 240000003768 Solanum lycopersicum Species 0.000 claims description 4
- 241000218632 Strawberry vein banding virus Species 0.000 claims description 4
- 150000001875 compounds Chemical class 0.000 claims description 4
- 238000001035 drying Methods 0.000 claims description 4
- 108090000848 Ubiquitin Proteins 0.000 claims description 3
- 102000044159 Ubiquitin Human genes 0.000 claims description 3
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 claims description 3
- 235000005822 corn Nutrition 0.000 claims description 3
- 229910001385 heavy metal Inorganic materials 0.000 claims description 3
- 230000000640 hydroxylating effect Effects 0.000 claims description 3
- 231100000783 metal toxicity Toxicity 0.000 claims description 3
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 claims description 3
- 244000000626 Daucus carota Species 0.000 claims description 2
- 235000002767 Daucus carota Nutrition 0.000 claims description 2
- 240000004658 Medicago sativa Species 0.000 claims description 2
- 208000027418 Wounds and injury Diseases 0.000 claims description 2
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 claims description 2
- 230000008645 cold stress Effects 0.000 claims description 2
- 210000005260 human cell Anatomy 0.000 claims description 2
- 208000014674 injury Diseases 0.000 claims description 2
- 235000009973 maize Nutrition 0.000 claims description 2
- 238000005507 spraying Methods 0.000 claims description 2
- 241000209140 Triticum Species 0.000 claims 2
- 241000219146 Gossypium Species 0.000 claims 1
- 238000009825 accumulation Methods 0.000 abstract description 5
- 108010043005 Prolyl Hydroxylases Proteins 0.000 description 104
- 102000004079 Prolyl Hydroxylases Human genes 0.000 description 103
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 87
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 63
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 61
- 108010047495 alanylglycine Proteins 0.000 description 58
- 108010087846 prolyl-prolyl-glycine Proteins 0.000 description 54
- 108090000623 proteins and genes Proteins 0.000 description 51
- 108010029020 prolylglycine Proteins 0.000 description 50
- BRPMXFSTKXXNHF-IUCAKERBSA-N (2s)-1-[2-[[(2s)-pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H]1NCCC1 BRPMXFSTKXXNHF-IUCAKERBSA-N 0.000 description 47
- 108020004414 DNA Proteins 0.000 description 46
- CAVKXZMMDNOZJU-UHFFFAOYSA-N Gly-Pro-Ala-Gly-Pro Natural products C1CCC(C(O)=O)N1C(=O)CNC(=O)C(C)NC(=O)C1CCCN1C(=O)CN CAVKXZMMDNOZJU-UHFFFAOYSA-N 0.000 description 38
- OOCFXNOVSLSHAB-IUCAKERBSA-N Gly-Pro-Pro Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OOCFXNOVSLSHAB-IUCAKERBSA-N 0.000 description 37
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 34
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 description 33
- 108010077515 glycylproline Proteins 0.000 description 31
- 108010061238 threonyl-glycine Proteins 0.000 description 30
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 29
- 108010050848 glycylleucine Proteins 0.000 description 29
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 28
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 28
- 108010078144 glutaminyl-glycine Proteins 0.000 description 25
- 230000009261 transgenic effect Effects 0.000 description 25
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 24
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 24
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 23
- 102000004169 proteins and genes Human genes 0.000 description 23
- MQIGTEQXYCRLGK-BQBZGAKWSA-N Ala-Gly-Pro Chemical compound C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O MQIGTEQXYCRLGK-BQBZGAKWSA-N 0.000 description 22
- 108010064235 lysylglycine Proteins 0.000 description 22
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 21
- 108010079364 N-glycylalanine Proteins 0.000 description 21
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 21
- 235000018102 proteins Nutrition 0.000 description 21
- 102000004190 Enzymes Human genes 0.000 description 20
- 108090000790 Enzymes Proteins 0.000 description 20
- 229940088598 enzyme Drugs 0.000 description 20
- 108090000765 processed proteins & peptides Proteins 0.000 description 20
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 19
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 19
- 108010050808 Procollagen Proteins 0.000 description 19
- 239000002243 precursor Substances 0.000 description 19
- 210000001519 tissue Anatomy 0.000 description 19
- 230000003612 virological effect Effects 0.000 description 19
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 18
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 18
- 108010047857 aspartylglycine Proteins 0.000 description 18
- 229940096422 collagen type i Drugs 0.000 description 18
- 102000004196 processed proteins & peptides Human genes 0.000 description 18
- ZATRYQNPUHGXCU-DTWKUNHWSA-N Arg-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZATRYQNPUHGXCU-DTWKUNHWSA-N 0.000 description 17
- 108091026890 Coding region Proteins 0.000 description 17
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 17
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 17
- 101710102040 Procollagen glycosyltransferase Proteins 0.000 description 17
- 239000000523 sample Substances 0.000 description 17
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 16
- 101710094171 Thiol protease aleurain Proteins 0.000 description 16
- SCAKQYSGEIHPLV-IUCAKERBSA-N (4S)-4-[(2-aminoacetyl)amino]-5-[(2S)-2-(carboxymethylcarbamoyl)pyrrolidin-1-yl]-5-oxopentanoic acid Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SCAKQYSGEIHPLV-IUCAKERBSA-N 0.000 description 15
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 15
- NLKVNZUFDPWPNL-YUMQZZPRSA-N Glu-Arg-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O NLKVNZUFDPWPNL-YUMQZZPRSA-N 0.000 description 15
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 15
- 241000700605 Viruses Species 0.000 description 15
- 108010044940 alanylglutamine Proteins 0.000 description 15
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 14
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 14
- 108091027544 Subgenomic mRNA Proteins 0.000 description 14
- 210000003763 chloroplast Anatomy 0.000 description 14
- 239000000835 fiber Substances 0.000 description 14
- 230000002792 vascular Effects 0.000 description 14
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 13
- 102000057297 Pepsin A Human genes 0.000 description 13
- 108090000284 Pepsin A Proteins 0.000 description 13
- BNBBNGZZKQUWCD-IUCAKERBSA-N Pro-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 BNBBNGZZKQUWCD-IUCAKERBSA-N 0.000 description 13
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 13
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 13
- MSIYNSBKKVMGFO-BHNWBGBOSA-N Thr-Gly-Pro Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N)O MSIYNSBKKVMGFO-BHNWBGBOSA-N 0.000 description 13
- 238000013459 approach Methods 0.000 description 13
- 229940111202 pepsin Drugs 0.000 description 13
- 230000009466 transformation Effects 0.000 description 13
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 12
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 12
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 12
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 11
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 11
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 11
- 108010077245 asparaginyl-proline Proteins 0.000 description 11
- 108010031719 prolyl-serine Proteins 0.000 description 11
- 108010026333 seryl-proline Proteins 0.000 description 11
- 239000013598 vector Substances 0.000 description 11
- 241000589158 Agrobacterium Species 0.000 description 10
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 10
- NSORZJXKUQFEKL-JGVFFNPUSA-N Gln-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)N)N)C(=O)O NSORZJXKUQFEKL-JGVFFNPUSA-N 0.000 description 10
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 10
- MMJJFXWMCMJMQA-STQMWFEESA-N Phe-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CC=CC=C1 MMJJFXWMCMJMQA-STQMWFEESA-N 0.000 description 10
- ZPPVJIJMIKTERM-YUMQZZPRSA-N Pro-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ZPPVJIJMIKTERM-YUMQZZPRSA-N 0.000 description 10
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 10
- 239000004365 Protease Substances 0.000 description 10
- KDGARKCAKHBEDB-NKWVEPMBSA-N Ser-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CO)N)C(=O)O KDGARKCAKHBEDB-NKWVEPMBSA-N 0.000 description 10
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 10
- YTPLVNUZZOBFFC-SCZZXKLOSA-N Val-Gly-Pro Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N1CCC[C@@H]1C(O)=O YTPLVNUZZOBFFC-SCZZXKLOSA-N 0.000 description 10
- 230000029087 digestion Effects 0.000 description 10
- 108010081551 glycylphenylalanine Proteins 0.000 description 10
- 108010057821 leucylproline Proteins 0.000 description 10
- 108010009298 lysylglutamic acid Proteins 0.000 description 10
- 108010024607 phenylalanylalanine Proteins 0.000 description 10
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 9
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 9
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 9
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 9
- 241000282326 Felis catus Species 0.000 description 9
- IEGFSKKANYKBDU-QWHCGFSZSA-N Gly-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)CN)C(=O)O IEGFSKKANYKBDU-QWHCGFSZSA-N 0.000 description 9
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 9
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 9
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 9
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 9
- 230000003367 anti-collagen effect Effects 0.000 description 9
- 238000010276 construction Methods 0.000 description 9
- 108010025801 glycyl-prolyl-arginine Proteins 0.000 description 9
- 229920001184 polypeptide Polymers 0.000 description 9
- 238000001262 western blot Methods 0.000 description 9
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 8
- 241000219194 Arabidopsis Species 0.000 description 8
- 101100328884 Caenorhabditis elegans sqt-3 gene Proteins 0.000 description 8
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 8
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 8
- JYPCXBJRLBHWME-IUCAKERBSA-N Gly-Pro-Arg Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JYPCXBJRLBHWME-IUCAKERBSA-N 0.000 description 8
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 8
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 8
- 241000880493 Leptailurus serval Species 0.000 description 8
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 description 8
- ABSSTGUCBCDKMU-UWVGGRQHSA-N Pro-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 ABSSTGUCBCDKMU-UWVGGRQHSA-N 0.000 description 8
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 8
- 230000002776 aggregation Effects 0.000 description 8
- 238000004220 aggregation Methods 0.000 description 8
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 8
- 210000002826 placenta Anatomy 0.000 description 8
- 239000006228 supernatant Substances 0.000 description 8
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 7
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 7
- 102100038132 Endogenous retrovirus group K member 6 Pro protein Human genes 0.000 description 7
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 7
- HJARVELKOSZUEW-YUMQZZPRSA-N Gly-Pro-Gln Chemical compound [H]NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJARVELKOSZUEW-YUMQZZPRSA-N 0.000 description 7
- GAAHQHNCMIAYEX-UWVGGRQHSA-N Gly-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GAAHQHNCMIAYEX-UWVGGRQHSA-N 0.000 description 7
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 7
- 101000595913 Homo sapiens Procollagen glycosyltransferase Proteins 0.000 description 7
- DMHGKBGOUAJRHU-RVMXOQNASA-N Ile-Arg-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N DMHGKBGOUAJRHU-RVMXOQNASA-N 0.000 description 7
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 7
- MUYQDMBLDFEVRJ-LSJOCFKGSA-N Met-Ala-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 MUYQDMBLDFEVRJ-LSJOCFKGSA-N 0.000 description 7
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 7
- DCHQYSOGURGJST-FJXKBIBVSA-N Pro-Thr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O DCHQYSOGURGJST-FJXKBIBVSA-N 0.000 description 7
- DLRZGNXCXUGIDG-KKHAAJSZSA-N Val-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O DLRZGNXCXUGIDG-KKHAAJSZSA-N 0.000 description 7
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 7
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 7
- 108010038633 aspartylglutamate Proteins 0.000 description 7
- 108010015792 glycyllysine Proteins 0.000 description 7
- 108010037850 glycylvaline Proteins 0.000 description 7
- 108010036413 histidylglycine Proteins 0.000 description 7
- 108010034529 leucyl-lysine Proteins 0.000 description 7
- 108010003700 lysyl aspartic acid Proteins 0.000 description 7
- 108010053725 prolylvaline Proteins 0.000 description 7
- 238000001338 self-assembly Methods 0.000 description 7
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 6
- AQPVUEJJARLJHB-BQBZGAKWSA-N Arg-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N AQPVUEJJARLJHB-BQBZGAKWSA-N 0.000 description 6
- 101710132601 Capsid protein Proteins 0.000 description 6
- 101710094648 Coat protein Proteins 0.000 description 6
- 101710091045 Envelope protein Proteins 0.000 description 6
- VXKCPBPQEKKERH-IUCAKERBSA-N Gly-Arg-Pro Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N1CCC[C@H]1C(O)=O VXKCPBPQEKKERH-IUCAKERBSA-N 0.000 description 6
- PABFFPWEJMEVEC-JGVFFNPUSA-N Gly-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)CN)C(=O)O PABFFPWEJMEVEC-JGVFFNPUSA-N 0.000 description 6
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 6
- RFQATBGBLDAKGI-VHSXEESVSA-N Lys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCCN)N)C(=O)O RFQATBGBLDAKGI-VHSXEESVSA-N 0.000 description 6
- 101710125418 Major capsid protein Proteins 0.000 description 6
- MVBZBRKNZVJEKK-DTWKUNHWSA-N Met-Gly-Pro Chemical compound CSCC[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N MVBZBRKNZVJEKK-DTWKUNHWSA-N 0.000 description 6
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 6
- 101710141454 Nucleoprotein Proteins 0.000 description 6
- DMKWYMWNEKIPFC-IUCAKERBSA-N Pro-Gly-Arg Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O DMKWYMWNEKIPFC-IUCAKERBSA-N 0.000 description 6
- FUOGXAQMNJMBFG-WPRPVWTQSA-N Pro-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FUOGXAQMNJMBFG-WPRPVWTQSA-N 0.000 description 6
- 101710083689 Probable capsid protein Proteins 0.000 description 6
- 101710188315 Protein X Proteins 0.000 description 6
- 108010005233 alanylglutamic acid Proteins 0.000 description 6
- 108010092854 aspartyllysine Proteins 0.000 description 6
- 230000015572 biosynthetic process Effects 0.000 description 6
- 108010016616 cysteinylglycine Proteins 0.000 description 6
- 239000012634 fragment Substances 0.000 description 6
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 6
- 125000003588 lysine group Chemical group [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 6
- 108010054155 lysyllysine Proteins 0.000 description 6
- 239000013612 plasmid Substances 0.000 description 6
- 239000013641 positive control Substances 0.000 description 6
- 108010077112 prolyl-proline Proteins 0.000 description 6
- 108010015796 prolylisoleucine Proteins 0.000 description 6
- 238000013519 translation Methods 0.000 description 6
- 239000013638 trimer Substances 0.000 description 6
- 108010073969 valyllysine Proteins 0.000 description 6
- 101000717362 Acetabularia peniculus Ribulose bisphosphate carboxylase small subunit, chloroplastic 7 Proteins 0.000 description 5
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 5
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 5
- AXXCUABIFZPKPM-BQBZGAKWSA-N Asp-Arg-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O AXXCUABIFZPKPM-BQBZGAKWSA-N 0.000 description 5
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 5
- LBOVBQONZJRWPV-YUMQZZPRSA-N Asp-Lys-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LBOVBQONZJRWPV-YUMQZZPRSA-N 0.000 description 5
- 101100328886 Caenorhabditis elegans col-2 gene Proteins 0.000 description 5
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 5
- PXHABOCPJVTGEK-BQBZGAKWSA-N Glu-Gln-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O PXHABOCPJVTGEK-BQBZGAKWSA-N 0.000 description 5
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 5
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 5
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 5
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 5
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 5
- SSFWXSNOKDZNHY-QXEWZRGKSA-N Gly-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN SSFWXSNOKDZNHY-QXEWZRGKSA-N 0.000 description 5
- 102100021181 Golgi phosphoprotein 3 Human genes 0.000 description 5
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 5
- GPJGFSFYBJGYRX-YUMQZZPRSA-N Lys-Gly-Asp Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O GPJGFSFYBJGYRX-YUMQZZPRSA-N 0.000 description 5
- UETQMSASAVBGJY-QWRGUYRKSA-N Lys-Gly-His Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 UETQMSASAVBGJY-QWRGUYRKSA-N 0.000 description 5
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 5
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 5
- 108090000631 Trypsin Proteins 0.000 description 5
- 102000004142 Trypsin Human genes 0.000 description 5
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 5
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 description 5
- 108010062796 arginyllysine Proteins 0.000 description 5
- 108010060035 arginylproline Proteins 0.000 description 5
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 5
- 108010068265 aspartyltyrosine Proteins 0.000 description 5
- 238000003776 cleavage reaction Methods 0.000 description 5
- 238000010367 cloning Methods 0.000 description 5
- 239000000499 gel Substances 0.000 description 5
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 5
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 5
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 5
- 108010089804 glycyl-threonine Proteins 0.000 description 5
- 239000002245 particle Substances 0.000 description 5
- 108010012581 phenylalanylglutamate Proteins 0.000 description 5
- 239000000047 product Substances 0.000 description 5
- 230000007017 scission Effects 0.000 description 5
- 239000011780 sodium chloride Substances 0.000 description 5
- 239000012588 trypsin Substances 0.000 description 5
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 4
- 102000007469 Actins Human genes 0.000 description 4
- 108010085238 Actins Proteins 0.000 description 4
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 4
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 4
- 241000219195 Arabidopsis thaliana Species 0.000 description 4
- HQIZDMIGUJOSNI-IUCAKERBSA-N Arg-Gly-Arg Chemical compound N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQIZDMIGUJOSNI-IUCAKERBSA-N 0.000 description 4
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 4
- 108010051330 Arg-Pro-Gly-Pro Proteins 0.000 description 4
- UZSQXCMNUPKLCC-FJXKBIBVSA-N Arg-Thr-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UZSQXCMNUPKLCC-FJXKBIBVSA-N 0.000 description 4
- HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 description 4
- YNCHFVRXEQFPBY-BQBZGAKWSA-N Asp-Gly-Arg Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N YNCHFVRXEQFPBY-BQBZGAKWSA-N 0.000 description 4
- 108010090461 DFG peptide Proteins 0.000 description 4
- FQCILXROGNOZON-YUMQZZPRSA-N Gln-Pro-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O FQCILXROGNOZON-YUMQZZPRSA-N 0.000 description 4
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 4
- CQAHWYDHKUWYIX-YUMQZZPRSA-N Glu-Pro-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O CQAHWYDHKUWYIX-YUMQZZPRSA-N 0.000 description 4
- HQTDNEZTGZUWSY-XVKPBYJWSA-N Glu-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)NCC(O)=O HQTDNEZTGZUWSY-XVKPBYJWSA-N 0.000 description 4
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 4
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 4
- CLODWIOAKCSBAN-BQBZGAKWSA-N Gly-Arg-Asp Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O CLODWIOAKCSBAN-BQBZGAKWSA-N 0.000 description 4
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 4
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 4
- YYXJFBMCOUSYSF-RYUDHWBXSA-N Gly-Phe-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYXJFBMCOUSYSF-RYUDHWBXSA-N 0.000 description 4
- SCJJPCQUJYPHRZ-BQBZGAKWSA-N Gly-Pro-Asn Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O SCJJPCQUJYPHRZ-BQBZGAKWSA-N 0.000 description 4
- LCWXJXMHJVIJFK-UHFFFAOYSA-N Hydroxylysine Natural products NCC(O)CC(N)CC(O)=O LCWXJXMHJVIJFK-UHFFFAOYSA-N 0.000 description 4
- PMMYEEVYMWASQN-DMTCNVIQSA-N Hydroxyproline Chemical compound O[C@H]1CN[C@H](C(O)=O)C1 PMMYEEVYMWASQN-DMTCNVIQSA-N 0.000 description 4
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 4
- QLRMMMQNCWBNPQ-QXEWZRGKSA-N Ile-Arg-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N QLRMMMQNCWBNPQ-QXEWZRGKSA-N 0.000 description 4
- GQKSJYINYYWPMR-NGZCFLSTSA-N Ile-Gly-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N GQKSJYINYYWPMR-NGZCFLSTSA-N 0.000 description 4
- 108010065920 Insulin Lispro Proteins 0.000 description 4
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 4
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 4
- 108091028043 Nucleic acid sequence Proteins 0.000 description 4
- 108091034117 Oligonucleotide Proteins 0.000 description 4
- 108091005804 Peptidases Proteins 0.000 description 4
- LLGTYVHITPVGKR-RYUDHWBXSA-N Phe-Gln-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O LLGTYVHITPVGKR-RYUDHWBXSA-N 0.000 description 4
- XQSREVQDGCPFRJ-STQMWFEESA-N Pro-Gly-Phe Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XQSREVQDGCPFRJ-STQMWFEESA-N 0.000 description 4
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 4
- HQTKVSCNCDLXSX-BQBZGAKWSA-N Ser-Arg-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O HQTKVSCNCDLXSX-BQBZGAKWSA-N 0.000 description 4
- OWCVUSJMEBGMOK-YUMQZZPRSA-N Ser-Lys-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O OWCVUSJMEBGMOK-YUMQZZPRSA-N 0.000 description 4
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 4
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 4
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 4
- 230000035508 accumulation Effects 0.000 description 4
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 4
- 108010041407 alanylaspartic acid Proteins 0.000 description 4
- 235000001014 amino acid Nutrition 0.000 description 4
- 150000001413 amino acids Chemical class 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 4
- 230000001580 bacterial effect Effects 0.000 description 4
- 239000000872 buffer Substances 0.000 description 4
- 108010060199 cysteinylproline Proteins 0.000 description 4
- YSMODUONRAFBET-UHFFFAOYSA-N delta-DL-hydroxylysine Natural products NCC(O)CCC(N)C(O)=O YSMODUONRAFBET-UHFFFAOYSA-N 0.000 description 4
- PMMYEEVYMWASQN-UHFFFAOYSA-N dl-hydroxyproline Natural products OC1C[NH2+]C(C([O-])=O)C1 PMMYEEVYMWASQN-UHFFFAOYSA-N 0.000 description 4
- 241001493065 dsRNA viruses Species 0.000 description 4
- YSMODUONRAFBET-UHNVWZDZSA-N erythro-5-hydroxy-L-lysine Chemical compound NC[C@H](O)CC[C@H](N)C(O)=O YSMODUONRAFBET-UHNVWZDZSA-N 0.000 description 4
- 238000002474 experimental method Methods 0.000 description 4
- 239000013604 expression vector Substances 0.000 description 4
- 108010049041 glutamylalanine Proteins 0.000 description 4
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 4
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 4
- 108010020688 glycylhistidine Proteins 0.000 description 4
- QJHBJHUKURJDLG-UHFFFAOYSA-N hydroxy-L-lysine Natural products NCCCCC(NO)C(O)=O QJHBJHUKURJDLG-UHFFFAOYSA-N 0.000 description 4
- -1 hydroxylysyl Chemical group 0.000 description 4
- 229960002591 hydroxyproline Drugs 0.000 description 4
- 230000006872 improvement Effects 0.000 description 4
- 238000000338 in vitro Methods 0.000 description 4
- 238000003780 insertion Methods 0.000 description 4
- 230000037431 insertion Effects 0.000 description 4
- 235000018977 lysine Nutrition 0.000 description 4
- 108010017391 lysylvaline Proteins 0.000 description 4
- 238000004519 manufacturing process Methods 0.000 description 4
- 108010005942 methionylglycine Proteins 0.000 description 4
- 230000004048 modification Effects 0.000 description 4
- 238000012986 modification Methods 0.000 description 4
- 229910052757 nitrogen Inorganic materials 0.000 description 4
- 229920002401 polyacrylamide Polymers 0.000 description 4
- 108010004914 prolylarginine Proteins 0.000 description 4
- 108010070643 prolylglutamic acid Proteins 0.000 description 4
- 108010048818 seryl-histidine Proteins 0.000 description 4
- FGMPLJWBKKVCDB-UHFFFAOYSA-N trans-L-hydroxy-proline Natural products ON1CCCC1C(O)=O FGMPLJWBKKVCDB-UHFFFAOYSA-N 0.000 description 4
- 238000013518 transcription Methods 0.000 description 4
- 230000035897 transcription Effects 0.000 description 4
- 238000011282 treatment Methods 0.000 description 4
- 108010080629 tryptophan-leucine Proteins 0.000 description 4
- 239000013603 viral vector Substances 0.000 description 4
- DGVVWUTYPXICAM-UHFFFAOYSA-N β‐Mercaptoethanol Chemical compound OCCS DGVVWUTYPXICAM-UHFFFAOYSA-N 0.000 description 4
- 108010020504 2-Oxoglutarate 5-Dioxygenase Procollagen-Lysine Proteins 0.000 description 3
- 102000008490 2-Oxoglutarate 5-Dioxygenase Procollagen-Lysine Human genes 0.000 description 3
- 108020005345 3' Untranslated Regions Proteins 0.000 description 3
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 3
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 3
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 3
- IYMAXBFPHPZYIK-BQBZGAKWSA-N Arg-Gly-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IYMAXBFPHPZYIK-BQBZGAKWSA-N 0.000 description 3
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 3
- VUGWHBXPMAHEGZ-SRVKXCTJSA-N Arg-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N VUGWHBXPMAHEGZ-SRVKXCTJSA-N 0.000 description 3
- HRCIIMCTUIAKQB-XGEHTFHBSA-N Arg-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O HRCIIMCTUIAKQB-XGEHTFHBSA-N 0.000 description 3
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 3
- XSGBIBGAMKTHMY-WHFBIAKZSA-N Asn-Asp-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O XSGBIBGAMKTHMY-WHFBIAKZSA-N 0.000 description 3
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 3
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 3
- HAFCJCDJGIOYPW-WDSKDSINSA-N Asp-Gly-Gln Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O HAFCJCDJGIOYPW-WDSKDSINSA-N 0.000 description 3
- KHGPWGKPYHPOIK-QWRGUYRKSA-N Asp-Gly-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KHGPWGKPYHPOIK-QWRGUYRKSA-N 0.000 description 3
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 3
- 108010059892 Cellulase Proteins 0.000 description 3
- 235000007516 Chrysanthemum Nutrition 0.000 description 3
- 240000005250 Chrysanthemum indicum Species 0.000 description 3
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 3
- LGIKBBLQVSWUGK-DCAQKATOSA-N Gln-Leu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGIKBBLQVSWUGK-DCAQKATOSA-N 0.000 description 3
- IOFDDSNZJDIGPB-GVXVVHGQSA-N Gln-Leu-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IOFDDSNZJDIGPB-GVXVVHGQSA-N 0.000 description 3
- ININBLZFFVOQIO-JHEQGTHGSA-N Gln-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O ININBLZFFVOQIO-JHEQGTHGSA-N 0.000 description 3
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 3
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 3
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 3
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 3
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 3
- VGUYMZGLJUJRBV-YVNDNENWSA-N Glu-Ile-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VGUYMZGLJUJRBV-YVNDNENWSA-N 0.000 description 3
- XUORRGAFUQIMLC-STQMWFEESA-N Gly-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN)O XUORRGAFUQIMLC-STQMWFEESA-N 0.000 description 3
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 3
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 3
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 3
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 3
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 3
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 3
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 3
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 3
- TWTPDFFBLQEBOE-IUCAKERBSA-N Gly-Leu-Gln Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O TWTPDFFBLQEBOE-IUCAKERBSA-N 0.000 description 3
- WMGHDYWNHNLGBV-ONGXEEELSA-N Gly-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 WMGHDYWNHNLGBV-ONGXEEELSA-N 0.000 description 3
- WDXLKVQATNEAJQ-BQBZGAKWSA-N Gly-Pro-Asp Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WDXLKVQATNEAJQ-BQBZGAKWSA-N 0.000 description 3
- GLACUWHUYFBSPJ-FJXKBIBVSA-N Gly-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GLACUWHUYFBSPJ-FJXKBIBVSA-N 0.000 description 3
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 3
- CSMYMGFCEJWALV-WDSKDSINSA-N Gly-Ser-Gln Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O CSMYMGFCEJWALV-WDSKDSINSA-N 0.000 description 3
- POJJAZJHBGXEGM-YUMQZZPRSA-N Gly-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN POJJAZJHBGXEGM-YUMQZZPRSA-N 0.000 description 3
- UVTSZKIATYSKIR-RYUDHWBXSA-N Gly-Tyr-Glu Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O UVTSZKIATYSKIR-RYUDHWBXSA-N 0.000 description 3
- 244000299507 Gossypium hirsutum Species 0.000 description 3
- LSQHWKPPOFDHHZ-YUMQZZPRSA-N His-Asp-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N LSQHWKPPOFDHHZ-YUMQZZPRSA-N 0.000 description 3
- PYNPBMCLAKTHJL-SRVKXCTJSA-N His-Pro-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O PYNPBMCLAKTHJL-SRVKXCTJSA-N 0.000 description 3
- 101000945357 Homo sapiens Collagen alpha-1(I) chain Proteins 0.000 description 3
- 235000003332 Ilex aquifolium Nutrition 0.000 description 3
- 241000209027 Ilex aquifolium Species 0.000 description 3
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 3
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 3
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 3
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 3
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 3
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 3
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 3
- CUHGAUZONORRIC-HJGDQZAQSA-N Lys-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O CUHGAUZONORRIC-HJGDQZAQSA-N 0.000 description 3
- 239000004472 Lysine Substances 0.000 description 3
- 241000219823 Medicago Species 0.000 description 3
- 108010066427 N-valyltryptophan Proteins 0.000 description 3
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 3
- ULIWFCCJIOEHMU-BQBZGAKWSA-N Pro-Gly-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 ULIWFCCJIOEHMU-BQBZGAKWSA-N 0.000 description 3
- FEPSEIDIPBMIOS-QXEWZRGKSA-N Pro-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEPSEIDIPBMIOS-QXEWZRGKSA-N 0.000 description 3
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 3
- WIPAMEKBSHNFQE-IUCAKERBSA-N Pro-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@@H]1CCCN1 WIPAMEKBSHNFQE-IUCAKERBSA-N 0.000 description 3
- 108010079005 RDV peptide Proteins 0.000 description 3
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 3
- QFBNNYNWKYKVJO-DCAQKATOSA-N Ser-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N QFBNNYNWKYKVJO-DCAQKATOSA-N 0.000 description 3
- NRCJWSGXMAPYQX-LPEHRKFASA-N Ser-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N)C(=O)O NRCJWSGXMAPYQX-LPEHRKFASA-N 0.000 description 3
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 3
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 3
- 244000098338 Triticum aestivum Species 0.000 description 3
- ARKBYVBCEOWRNR-UBHSHLNASA-N Trp-Ser-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O ARKBYVBCEOWRNR-UBHSHLNASA-N 0.000 description 3
- JWHOIHCOHMZSAR-QWRGUYRKSA-N Tyr-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JWHOIHCOHMZSAR-QWRGUYRKSA-N 0.000 description 3
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 3
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 3
- XEYUMGGWQCIWAR-XVKPBYJWSA-N Val-Gln-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N XEYUMGGWQCIWAR-XVKPBYJWSA-N 0.000 description 3
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 3
- FOADDSDHGRFUOC-DZKIICNBSA-N Val-Glu-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FOADDSDHGRFUOC-DZKIICNBSA-N 0.000 description 3
- 108010070944 alanylhistidine Proteins 0.000 description 3
- 108010087924 alanylproline Proteins 0.000 description 3
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 3
- 230000037319 collagen production Effects 0.000 description 3
- 239000002299 complementary DNA Substances 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 3
- 108010010147 glycylglutamine Proteins 0.000 description 3
- 108010087823 glycyltyrosine Proteins 0.000 description 3
- 230000012010 growth Effects 0.000 description 3
- 108010025306 histidylleucine Proteins 0.000 description 3
- 108010085325 histidylproline Proteins 0.000 description 3
- 108010018006 histidylserine Proteins 0.000 description 3
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 3
- 108010038320 lysylphenylalanine Proteins 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 108010056582 methionylglutamic acid Proteins 0.000 description 3
- 238000010369 molecular cloning Methods 0.000 description 3
- 230000001717 pathogenic effect Effects 0.000 description 3
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 3
- 108010051242 phenylalanylserine Proteins 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 3
- 230000002797 proteolythic effect Effects 0.000 description 3
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 3
- 239000000758 substrate Substances 0.000 description 3
- 230000005945 translocation Effects 0.000 description 3
- 108010051110 tyrosyl-lysine Proteins 0.000 description 3
- 108010020532 tyrosyl-proline Proteins 0.000 description 3
- 108010011876 valyl-glycyl-valyl-alanyl-prolyl-glycine Proteins 0.000 description 3
- RLCSROTYKMPBDL-USJZOSNVSA-N 2-[[(2s)-1-[(2s)-2-[[(2s)-2-[[2-[[(2s)-2-amino-3-methylbutanoyl]amino]acetyl]amino]-3-methylbutanoyl]amino]propanoyl]pyrrolidine-2-carbonyl]amino]acetic acid Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RLCSROTYKMPBDL-USJZOSNVSA-N 0.000 description 2
- 108020003589 5' Untranslated Regions Proteins 0.000 description 2
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 2
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 description 2
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 2
- KRHRBKYBJXMYBB-WHFBIAKZSA-N Ala-Cys-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O KRHRBKYBJXMYBB-WHFBIAKZSA-N 0.000 description 2
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 2
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 2
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 2
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 2
- BTBUEVAGZCKULD-XPUUQOCRSA-N Ala-Gly-His Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CN=CN1 BTBUEVAGZCKULD-XPUUQOCRSA-N 0.000 description 2
- QHASENCZLDHBGX-ONGXEEELSA-N Ala-Gly-Phe Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QHASENCZLDHBGX-ONGXEEELSA-N 0.000 description 2
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 2
- JDIQCVUDDFENPU-ZKWXMUAHSA-N Ala-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CNC=N1 JDIQCVUDDFENPU-ZKWXMUAHSA-N 0.000 description 2
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 2
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 2
- PIXQDIGKDNNOOV-GUBZILKMSA-N Ala-Lys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O PIXQDIGKDNNOOV-GUBZILKMSA-N 0.000 description 2
- FQNILRVJOJBFFC-FXQIFTODSA-N Ala-Pro-Asp Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N FQNILRVJOJBFFC-FXQIFTODSA-N 0.000 description 2
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 2
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 2
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 2
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 2
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 2
- XKXAZPSREVUCRT-BPNCWPANSA-N Ala-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=C(O)C=C1 XKXAZPSREVUCRT-BPNCWPANSA-N 0.000 description 2
- MCYJBCKCAPERSE-FXQIFTODSA-N Arg-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N MCYJBCKCAPERSE-FXQIFTODSA-N 0.000 description 2
- XVLLUZMFSAYKJV-GUBZILKMSA-N Arg-Asp-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XVLLUZMFSAYKJV-GUBZILKMSA-N 0.000 description 2
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 2
- NXDXECQFKHXHAM-HJGDQZAQSA-N Arg-Glu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NXDXECQFKHXHAM-HJGDQZAQSA-N 0.000 description 2
- PNIGSVZJNVUVJA-BQBZGAKWSA-N Arg-Gly-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O PNIGSVZJNVUVJA-BQBZGAKWSA-N 0.000 description 2
- RFXXUWGNVRJTNQ-QXEWZRGKSA-N Arg-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N RFXXUWGNVRJTNQ-QXEWZRGKSA-N 0.000 description 2
- HAVKMRGWNXMCDR-STQMWFEESA-N Arg-Gly-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HAVKMRGWNXMCDR-STQMWFEESA-N 0.000 description 2
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 2
- YQGZIRIYGHNSQO-ZPFDUUQYSA-N Arg-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YQGZIRIYGHNSQO-ZPFDUUQYSA-N 0.000 description 2
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 2
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 2
- NGTYEHIRESTSRX-UWVGGRQHSA-N Arg-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NGTYEHIRESTSRX-UWVGGRQHSA-N 0.000 description 2
- FIQKRDXFTANIEJ-ULQDDVLXSA-N Arg-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FIQKRDXFTANIEJ-ULQDDVLXSA-N 0.000 description 2
- IGFJVXOATGZTHD-UHFFFAOYSA-N Arg-Phe-His Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccccc1)C(=O)NC(Cc2c[nH]cn2)C(=O)O IGFJVXOATGZTHD-UHFFFAOYSA-N 0.000 description 2
- PRLPSDIHSRITSF-UNQGMJICSA-N Arg-Phe-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PRLPSDIHSRITSF-UNQGMJICSA-N 0.000 description 2
- ATABBWFGOHKROJ-GUBZILKMSA-N Arg-Pro-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O ATABBWFGOHKROJ-GUBZILKMSA-N 0.000 description 2
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 2
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 2
- VYZBPPBKFCHCIS-WPRPVWTQSA-N Arg-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N VYZBPPBKFCHCIS-WPRPVWTQSA-N 0.000 description 2
- GXMSVVBIAMWMKO-BQBZGAKWSA-N Asn-Arg-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N GXMSVVBIAMWMKO-BQBZGAKWSA-N 0.000 description 2
- IYVSIZAXNLOKFQ-BYULHYEWSA-N Asn-Asp-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IYVSIZAXNLOKFQ-BYULHYEWSA-N 0.000 description 2
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 2
- NVWJMQNYLYWVNQ-BYULHYEWSA-N Asn-Ile-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O NVWJMQNYLYWVNQ-BYULHYEWSA-N 0.000 description 2
- SPCONPVIDFMDJI-QSFUFRPTSA-N Asn-Ile-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O SPCONPVIDFMDJI-QSFUFRPTSA-N 0.000 description 2
- NYGILGUOUOXGMJ-YUMQZZPRSA-N Asn-Lys-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O NYGILGUOUOXGMJ-YUMQZZPRSA-N 0.000 description 2
- ICDDSTLEMLGSTB-GUBZILKMSA-N Asn-Met-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ICDDSTLEMLGSTB-GUBZILKMSA-N 0.000 description 2
- HZZIFFOVHLWGCS-KKUMJFAQSA-N Asn-Phe-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O HZZIFFOVHLWGCS-KKUMJFAQSA-N 0.000 description 2
- BKFXFUPYETWGGA-XVSYOHENSA-N Asn-Phe-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BKFXFUPYETWGGA-XVSYOHENSA-N 0.000 description 2
- PLTGTJAZQRGMPP-FXQIFTODSA-N Asn-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O PLTGTJAZQRGMPP-FXQIFTODSA-N 0.000 description 2
- GKKUBLFXKRDMFC-BQBZGAKWSA-N Asn-Pro-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O GKKUBLFXKRDMFC-BQBZGAKWSA-N 0.000 description 2
- NJSNXIOKBHPFMB-GMOBBJLQSA-N Asn-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)N)N NJSNXIOKBHPFMB-GMOBBJLQSA-N 0.000 description 2
- LGCVSPFCFXWUEY-IHPCNDPISA-N Asn-Trp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N LGCVSPFCFXWUEY-IHPCNDPISA-N 0.000 description 2
- MYRLSKYSMXNLLA-LAEOZQHASA-N Asn-Val-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MYRLSKYSMXNLLA-LAEOZQHASA-N 0.000 description 2
- KVMPVNGOKHTUHZ-GCJQMDKQSA-N Asp-Ala-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KVMPVNGOKHTUHZ-GCJQMDKQSA-N 0.000 description 2
- ZLGKHJHFYSRUBH-FXQIFTODSA-N Asp-Arg-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLGKHJHFYSRUBH-FXQIFTODSA-N 0.000 description 2
- UQBGYPFHWFZMCD-ZLUOBGJFSA-N Asp-Asn-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O UQBGYPFHWFZMCD-ZLUOBGJFSA-N 0.000 description 2
- UGIBTKGQVWFTGX-BIIVOSGPSA-N Asp-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O UGIBTKGQVWFTGX-BIIVOSGPSA-N 0.000 description 2
- FRSGNOZCTWDVFZ-ACZMJKKPSA-N Asp-Asp-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O FRSGNOZCTWDVFZ-ACZMJKKPSA-N 0.000 description 2
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 2
- RATOMFTUDRYMKX-ACZMJKKPSA-N Asp-Glu-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N RATOMFTUDRYMKX-ACZMJKKPSA-N 0.000 description 2
- ZEDBMCPXPIYJLW-XHNCKOQMSA-N Asp-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O ZEDBMCPXPIYJLW-XHNCKOQMSA-N 0.000 description 2
- JUWZKMBALYLZCK-WHFBIAKZSA-N Asp-Gly-Asn Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O JUWZKMBALYLZCK-WHFBIAKZSA-N 0.000 description 2
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 2
- ZSVJVIOVABDTTL-YUMQZZPRSA-N Asp-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N ZSVJVIOVABDTTL-YUMQZZPRSA-N 0.000 description 2
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 2
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 2
- CRNKLABLTICXDV-GUBZILKMSA-N Asp-His-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N CRNKLABLTICXDV-GUBZILKMSA-N 0.000 description 2
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 2
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 2
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 2
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 2
- YIDFBWRHIYOYAA-LKXGYXEUSA-N Asp-Ser-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YIDFBWRHIYOYAA-LKXGYXEUSA-N 0.000 description 2
- JJQGZGOEDSSHTE-FOHZUACHSA-N Asp-Thr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JJQGZGOEDSSHTE-FOHZUACHSA-N 0.000 description 2
- XWKBWZXGNXTDKY-ZKWXMUAHSA-N Asp-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O XWKBWZXGNXTDKY-ZKWXMUAHSA-N 0.000 description 2
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 2
- SFJUYBCDQBAYAJ-YDHLFZDLSA-N Asp-Val-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SFJUYBCDQBAYAJ-YDHLFZDLSA-N 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 102100021277 Beta-secretase 2 Human genes 0.000 description 2
- 102100037084 C4b-binding protein alpha chain Human genes 0.000 description 2
- YKKHFPGOZXQAGK-QWRGUYRKSA-N Cys-Gly-Tyr Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YKKHFPGOZXQAGK-QWRGUYRKSA-N 0.000 description 2
- GDNWBSFSHJVXKL-GUBZILKMSA-N Cys-Lys-Gln Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O GDNWBSFSHJVXKL-GUBZILKMSA-N 0.000 description 2
- BCFXQBXXDSEHRS-FXQIFTODSA-N Cys-Ser-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BCFXQBXXDSEHRS-FXQIFTODSA-N 0.000 description 2
- 102000010834 Extracellular Matrix Proteins Human genes 0.000 description 2
- 108010037362 Extracellular Matrix Proteins Proteins 0.000 description 2
- 108060003306 Galactosyltransferase Proteins 0.000 description 2
- 102000030902 Galactosyltransferase Human genes 0.000 description 2
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 2
- VSXBYIJUAXPAAL-WDSKDSINSA-N Gln-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O VSXBYIJUAXPAAL-WDSKDSINSA-N 0.000 description 2
- NSNUZSPSADIMJQ-WDSKDSINSA-N Gln-Gly-Asp Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NSNUZSPSADIMJQ-WDSKDSINSA-N 0.000 description 2
- GNMQDOGFWYWPNM-LAEOZQHASA-N Gln-Gly-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)CNC(=O)[C@@H](N)CCC(N)=O)C(O)=O GNMQDOGFWYWPNM-LAEOZQHASA-N 0.000 description 2
- SMLDOQHTOAAFJQ-WDSKDSINSA-N Gln-Gly-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SMLDOQHTOAAFJQ-WDSKDSINSA-N 0.000 description 2
- ORYMMTRPKVTGSJ-XVKPBYJWSA-N Gln-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O ORYMMTRPKVTGSJ-XVKPBYJWSA-N 0.000 description 2
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 2
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 2
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 2
- SRZLHYPAOXBBSB-HJGDQZAQSA-N Glu-Arg-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SRZLHYPAOXBBSB-HJGDQZAQSA-N 0.000 description 2
- AFODTOLGSZQDSL-PEFMBERDSA-N Glu-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N AFODTOLGSZQDSL-PEFMBERDSA-N 0.000 description 2
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 2
- JRCUFCXYZLPSDZ-ACZMJKKPSA-N Glu-Asp-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O JRCUFCXYZLPSDZ-ACZMJKKPSA-N 0.000 description 2
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 2
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 2
- UHVIQGKBMXEVGN-WDSKDSINSA-N Glu-Gly-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UHVIQGKBMXEVGN-WDSKDSINSA-N 0.000 description 2
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 2
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 2
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 2
- JHSRJMUJOGLIHK-GUBZILKMSA-N Glu-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N JHSRJMUJOGLIHK-GUBZILKMSA-N 0.000 description 2
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 2
- BPCLDCNZBUYGOD-BPUTZDHNSA-N Glu-Trp-Glu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 BPCLDCNZBUYGOD-BPUTZDHNSA-N 0.000 description 2
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 2
- BRFJMRSRMOMIMU-WHFBIAKZSA-N Gly-Ala-Asn Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O BRFJMRSRMOMIMU-WHFBIAKZSA-N 0.000 description 2
- GZUKEVBTYNNUQF-WDSKDSINSA-N Gly-Ala-Gln Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GZUKEVBTYNNUQF-WDSKDSINSA-N 0.000 description 2
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 2
- DWUKOTKSTDWGAE-BQBZGAKWSA-N Gly-Asn-Arg Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DWUKOTKSTDWGAE-BQBZGAKWSA-N 0.000 description 2
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 2
- OCDLPQDYTJPWNG-YUMQZZPRSA-N Gly-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN OCDLPQDYTJPWNG-YUMQZZPRSA-N 0.000 description 2
- JVACNFOPSUPDTK-QWRGUYRKSA-N Gly-Asn-Phe Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JVACNFOPSUPDTK-QWRGUYRKSA-N 0.000 description 2
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 2
- IWAXHBCACVWNHT-BQBZGAKWSA-N Gly-Asp-Arg Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IWAXHBCACVWNHT-BQBZGAKWSA-N 0.000 description 2
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 2
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 2
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 2
- VNBNZUAPOYGRDB-ZDLURKLDSA-N Gly-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN)O VNBNZUAPOYGRDB-ZDLURKLDSA-N 0.000 description 2
- DTRUBYPMMVPQPD-YUMQZZPRSA-N Gly-Gln-Arg Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DTRUBYPMMVPQPD-YUMQZZPRSA-N 0.000 description 2
- AQLHORCVPGXDJW-IUCAKERBSA-N Gly-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN AQLHORCVPGXDJW-IUCAKERBSA-N 0.000 description 2
- FIQQRCFQXGLOSZ-WDSKDSINSA-N Gly-Glu-Asp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FIQQRCFQXGLOSZ-WDSKDSINSA-N 0.000 description 2
- HFXJIZNEXNIZIJ-BQBZGAKWSA-N Gly-Glu-Gln Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFXJIZNEXNIZIJ-BQBZGAKWSA-N 0.000 description 2
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 2
- TVDHVLGFJSHPAX-UWVGGRQHSA-N Gly-His-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 TVDHVLGFJSHPAX-UWVGGRQHSA-N 0.000 description 2
- VAXIVIPMCTYSHI-YUMQZZPRSA-N Gly-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN VAXIVIPMCTYSHI-YUMQZZPRSA-N 0.000 description 2
- SWQALSGKVLYKDT-ZKWXMUAHSA-N Gly-Ile-Ala Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SWQALSGKVLYKDT-ZKWXMUAHSA-N 0.000 description 2
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 description 2
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 2
- LIXWIUAORXJNBH-QWRGUYRKSA-N Gly-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN LIXWIUAORXJNBH-QWRGUYRKSA-N 0.000 description 2
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 2
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 2
- CLNSYANKYVMZNM-UWVGGRQHSA-N Gly-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CLNSYANKYVMZNM-UWVGGRQHSA-N 0.000 description 2
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 2
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 2
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 2
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 2
- ZZJVYSAQQMDIRD-UWVGGRQHSA-N Gly-Pro-His Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O ZZJVYSAQQMDIRD-UWVGGRQHSA-N 0.000 description 2
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 2
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 2
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 2
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 2
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 2
- NGBGZCUWFVVJKC-IRXDYDNUSA-N Gly-Tyr-Tyr Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NGBGZCUWFVVJKC-IRXDYDNUSA-N 0.000 description 2
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 2
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 2
- NGRPGJGKJMUGDM-XVKPBYJWSA-N Gly-Val-Gln Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NGRPGJGKJMUGDM-XVKPBYJWSA-N 0.000 description 2
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 2
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 2
- NOQPTNXSGNPJNS-YUMQZZPRSA-N His-Asn-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O NOQPTNXSGNPJNS-YUMQZZPRSA-N 0.000 description 2
- QSLKWWDKIXMWJV-SRVKXCTJSA-N His-Cys-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N QSLKWWDKIXMWJV-SRVKXCTJSA-N 0.000 description 2
- QCBYAHHNOHBXIH-UWVGGRQHSA-N His-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CN=CN1 QCBYAHHNOHBXIH-UWVGGRQHSA-N 0.000 description 2
- FLXCRBXJRJSDHX-AVGNSLFASA-N His-Pro-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O FLXCRBXJRJSDHX-AVGNSLFASA-N 0.000 description 2
- 101000936403 Homo sapiens A disintegrin and metalloproteinase with thrombospondin motifs 2 Proteins 0.000 description 2
- 101000695352 Homo sapiens Bone morphogenetic protein 1 Proteins 0.000 description 2
- 101000875067 Homo sapiens Collagen alpha-2(I) chain Proteins 0.000 description 2
- 101001072202 Homo sapiens Protein disulfide-isomerase Proteins 0.000 description 2
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 2
- OVPYIUNCVSOVNF-ZPFDUUQYSA-N Ile-Gln-Pro Natural products CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O OVPYIUNCVSOVNF-ZPFDUUQYSA-N 0.000 description 2
- UBHUJPVCJHPSEU-GRLWGSQLSA-N Ile-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N UBHUJPVCJHPSEU-GRLWGSQLSA-N 0.000 description 2
- SLQVFYWBGNNOTK-BYULHYEWSA-N Ile-Gly-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N SLQVFYWBGNNOTK-BYULHYEWSA-N 0.000 description 2
- IGJWJGIHUFQANP-LAEOZQHASA-N Ile-Gly-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N IGJWJGIHUFQANP-LAEOZQHASA-N 0.000 description 2
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 2
- TWPSALMCEHCIOY-YTFOTSKYSA-N Ile-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)O)N TWPSALMCEHCIOY-YTFOTSKYSA-N 0.000 description 2
- UDBPXJNOEWDBDF-XUXIUFHCSA-N Ile-Lys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)O)N UDBPXJNOEWDBDF-XUXIUFHCSA-N 0.000 description 2
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 2
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 2
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 2
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 2
- DGTOKVBDZXJHNZ-WZLNRYEVSA-N Ile-Thr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N DGTOKVBDZXJHNZ-WZLNRYEVSA-N 0.000 description 2
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 2
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 2
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 2
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 2
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 2
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 2
- VIWUBXKCYJGNCL-SRVKXCTJSA-N Leu-Asn-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 VIWUBXKCYJGNCL-SRVKXCTJSA-N 0.000 description 2
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 2
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 2
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 2
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 2
- CIVKXGPFXDIQBV-WDCWCFNPSA-N Leu-Gln-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CIVKXGPFXDIQBV-WDCWCFNPSA-N 0.000 description 2
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 2
- CFZZDVMBRYFFNU-QWRGUYRKSA-N Leu-His-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O CFZZDVMBRYFFNU-QWRGUYRKSA-N 0.000 description 2
- QLDHBYRUNQZIJQ-DKIMLUQUSA-N Leu-Ile-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QLDHBYRUNQZIJQ-DKIMLUQUSA-N 0.000 description 2
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 2
- FLNPJLDPGMLWAU-UWVGGRQHSA-N Leu-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(C)C FLNPJLDPGMLWAU-UWVGGRQHSA-N 0.000 description 2
- HDHQQEDVWQGBEE-DCAQKATOSA-N Leu-Met-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O HDHQQEDVWQGBEE-DCAQKATOSA-N 0.000 description 2
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 2
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 2
- MVVSHHJKJRZVNY-ACRUOGEOSA-N Leu-Phe-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MVVSHHJKJRZVNY-ACRUOGEOSA-N 0.000 description 2
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 2
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 2
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 2
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 description 2
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 description 2
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 2
- IXHKPDJKKCUKHS-GARJFASQSA-N Lys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IXHKPDJKKCUKHS-GARJFASQSA-N 0.000 description 2
- WLCYCADOWRMSAJ-CIUDSAMLSA-N Lys-Asn-Cys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O WLCYCADOWRMSAJ-CIUDSAMLSA-N 0.000 description 2
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 2
- NTBFKPBULZGXQL-KKUMJFAQSA-N Lys-Asp-Tyr Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTBFKPBULZGXQL-KKUMJFAQSA-N 0.000 description 2
- QZONCCHVHCOBSK-YUMQZZPRSA-N Lys-Gly-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O QZONCCHVHCOBSK-YUMQZZPRSA-N 0.000 description 2
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 2
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 2
- JQSIGLHQNSZZRL-KKUMJFAQSA-N Lys-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N JQSIGLHQNSZZRL-KKUMJFAQSA-N 0.000 description 2
- BEGQVWUZFXLNHZ-IHPCNDPISA-N Lys-Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN)C(O)=O)=CNC2=C1 BEGQVWUZFXLNHZ-IHPCNDPISA-N 0.000 description 2
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 2
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 2
- XYLSGAWRCZECIQ-JYJNAYRXSA-N Lys-Tyr-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 XYLSGAWRCZECIQ-JYJNAYRXSA-N 0.000 description 2
- CSNNHWWHGAXBCP-UHFFFAOYSA-L Magnesium sulfate Chemical compound [Mg+2].[O-][S+2]([O-])([O-])[O-] CSNNHWWHGAXBCP-UHFFFAOYSA-L 0.000 description 2
- KUQWVNFMZLHAPA-CIUDSAMLSA-N Met-Ala-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O KUQWVNFMZLHAPA-CIUDSAMLSA-N 0.000 description 2
- KYXDADPHSNFWQX-VEVYYDQMSA-N Met-Thr-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O KYXDADPHSNFWQX-VEVYYDQMSA-N 0.000 description 2
- 241001465754 Metazoa Species 0.000 description 2
- 108010006519 Molecular Chaperones Proteins 0.000 description 2
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 2
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 2
- 238000000636 Northern blotting Methods 0.000 description 2
- 102000004316 Oxidoreductases Human genes 0.000 description 2
- 108090000854 Oxidoreductases Proteins 0.000 description 2
- 102000035195 Peptidases Human genes 0.000 description 2
- QMMRHASQEVCJGR-UBHSHLNASA-N Phe-Ala-Pro Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 QMMRHASQEVCJGR-UBHSHLNASA-N 0.000 description 2
- FGXIJNMDRCZVDE-KKUMJFAQSA-N Phe-Cys-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N FGXIJNMDRCZVDE-KKUMJFAQSA-N 0.000 description 2
- UAMFZRNCIFFMLE-FHWLQOOXSA-N Phe-Glu-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N UAMFZRNCIFFMLE-FHWLQOOXSA-N 0.000 description 2
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 2
- YKUGPVXSDOOANW-KKUMJFAQSA-N Phe-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKUGPVXSDOOANW-KKUMJFAQSA-N 0.000 description 2
- RSPUIENXSJYZQO-JYJNAYRXSA-N Phe-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RSPUIENXSJYZQO-JYJNAYRXSA-N 0.000 description 2
- AUJWXNGCAQWLEI-KBPBESRZSA-N Phe-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AUJWXNGCAQWLEI-KBPBESRZSA-N 0.000 description 2
- 108020005089 Plant RNA Proteins 0.000 description 2
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 2
- INXAPZFIOVGHSV-CIUDSAMLSA-N Pro-Asn-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 INXAPZFIOVGHSV-CIUDSAMLSA-N 0.000 description 2
- UUHXBJHVTVGSKM-BQBZGAKWSA-N Pro-Gly-Asn Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UUHXBJHVTVGSKM-BQBZGAKWSA-N 0.000 description 2
- JMVQDLDPDBXAAX-YUMQZZPRSA-N Pro-Gly-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 JMVQDLDPDBXAAX-YUMQZZPRSA-N 0.000 description 2
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 2
- XYHMFGGWNOFUOU-QXEWZRGKSA-N Pro-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 XYHMFGGWNOFUOU-QXEWZRGKSA-N 0.000 description 2
- KLSOMAFWRISSNI-OSUNSFLBSA-N Pro-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 KLSOMAFWRISSNI-OSUNSFLBSA-N 0.000 description 2
- CLJLVCYFABNTHP-DCAQKATOSA-N Pro-Leu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O CLJLVCYFABNTHP-DCAQKATOSA-N 0.000 description 2
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 2
- DWPXHLIBFQLKLK-CYDGBPFRSA-N Pro-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 DWPXHLIBFQLKLK-CYDGBPFRSA-N 0.000 description 2
- UCTIUWKCVNGEFH-OBJOEFQTSA-N Pro-Val-Gly-Pro Chemical compound N([C@@H](C(C)C)C(=O)NCC(=O)N1[C@@H](CCC1)C(O)=O)C(=O)[C@@H]1CCCN1 UCTIUWKCVNGEFH-OBJOEFQTSA-N 0.000 description 2
- OQSGBXGNAFQGGS-CYDGBPFRSA-N Pro-Val-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OQSGBXGNAFQGGS-CYDGBPFRSA-N 0.000 description 2
- 101710136733 Proline-rich protein Proteins 0.000 description 2
- 102000006010 Protein Disulfide-Isomerase Human genes 0.000 description 2
- 101150041925 RBCS gene Proteins 0.000 description 2
- 108020004511 Recombinant DNA Proteins 0.000 description 2
- 240000000111 Saccharum officinarum Species 0.000 description 2
- 235000007201 Saccharum officinarum Nutrition 0.000 description 2
- KAAPNMOKUUPKOE-SRVKXCTJSA-N Ser-Asn-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KAAPNMOKUUPKOE-SRVKXCTJSA-N 0.000 description 2
- GWMXFEMMBHOKDX-AVGNSLFASA-N Ser-Gln-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GWMXFEMMBHOKDX-AVGNSLFASA-N 0.000 description 2
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 2
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 2
- CRJZZXMAADSBBQ-SRVKXCTJSA-N Ser-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO CRJZZXMAADSBBQ-SRVKXCTJSA-N 0.000 description 2
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 2
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 2
- 108700005078 Synthetic Genes Proteins 0.000 description 2
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 2
- SWIKDOUVROTZCW-GCJQMDKQSA-N Thr-Asn-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O SWIKDOUVROTZCW-GCJQMDKQSA-N 0.000 description 2
- VXMHQKHDKCATDV-VEVYYDQMSA-N Thr-Asp-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VXMHQKHDKCATDV-VEVYYDQMSA-N 0.000 description 2
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 2
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 2
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 2
- YZUWGFXVVZQJEI-PMVVWTBXSA-N Thr-Gly-His Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O YZUWGFXVVZQJEI-PMVVWTBXSA-N 0.000 description 2
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 2
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 2
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 2
- BDGBHYCAZJPLHX-HJGDQZAQSA-N Thr-Lys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BDGBHYCAZJPLHX-HJGDQZAQSA-N 0.000 description 2
- MGJLBZFUXUGMML-VOAKCMCISA-N Thr-Lys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MGJLBZFUXUGMML-VOAKCMCISA-N 0.000 description 2
- WNQJTLATMXYSEL-OEAJRASXSA-N Thr-Phe-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WNQJTLATMXYSEL-OEAJRASXSA-N 0.000 description 2
- XGFYGMKZKFRGAI-RCWTZXSCSA-N Thr-Val-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XGFYGMKZKFRGAI-RCWTZXSCSA-N 0.000 description 2
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 2
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 2
- NMCBVGFGWSIGSB-NUTKFTJISA-N Trp-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NMCBVGFGWSIGSB-NUTKFTJISA-N 0.000 description 2
- HJTYJQVRIQXMHM-XIRDDKMYSA-N Trp-Asp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N HJTYJQVRIQXMHM-XIRDDKMYSA-N 0.000 description 2
- HLDFBNPSURDYEN-VHWLVUOQSA-N Trp-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N HLDFBNPSURDYEN-VHWLVUOQSA-N 0.000 description 2
- RRVUOLRWIZXBRQ-IHPCNDPISA-N Trp-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RRVUOLRWIZXBRQ-IHPCNDPISA-N 0.000 description 2
- GQNCRIFNDVFRNF-BPUTZDHNSA-N Trp-Pro-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O GQNCRIFNDVFRNF-BPUTZDHNSA-N 0.000 description 2
- LGEYOIQBBIPHQN-UWJYBYFXSA-N Tyr-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LGEYOIQBBIPHQN-UWJYBYFXSA-N 0.000 description 2
- MICSYKFECRFCTJ-IHRRRGAJSA-N Tyr-Arg-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O MICSYKFECRFCTJ-IHRRRGAJSA-N 0.000 description 2
- WPVGRKLNHJJCEN-BZSNNMDCSA-N Tyr-Asp-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 WPVGRKLNHJJCEN-BZSNNMDCSA-N 0.000 description 2
- NMKJPMCEKQHRPD-IRXDYDNUSA-N Tyr-Gly-Tyr Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NMKJPMCEKQHRPD-IRXDYDNUSA-N 0.000 description 2
- SINRIKQYQJRGDQ-MEYUZBJRSA-N Tyr-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SINRIKQYQJRGDQ-MEYUZBJRSA-N 0.000 description 2
- LABUITCFCAABSV-BPNCWPANSA-N Val-Ala-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LABUITCFCAABSV-BPNCWPANSA-N 0.000 description 2
- LABUITCFCAABSV-UHFFFAOYSA-N Val-Ala-Tyr Natural products CC(C)C(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LABUITCFCAABSV-UHFFFAOYSA-N 0.000 description 2
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 2
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 2
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 2
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 2
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 2
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 2
- XTDDIVQWDXMRJL-IHRRRGAJSA-N Val-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N XTDDIVQWDXMRJL-IHRRRGAJSA-N 0.000 description 2
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 2
- PFMSJVIPEZMKSC-DZKIICNBSA-N Val-Tyr-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PFMSJVIPEZMKSC-DZKIICNBSA-N 0.000 description 2
- 108020005202 Viral DNA Proteins 0.000 description 2
- 108010066875 alanyl-prolyl-tryptophyl-cysteine Proteins 0.000 description 2
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 2
- 210000004102 animal cell Anatomy 0.000 description 2
- 108010013835 arginine glutamate Proteins 0.000 description 2
- 108010008355 arginyl-glutamine Proteins 0.000 description 2
- 108010086780 arginyl-glycyl-aspartyl-alanine Proteins 0.000 description 2
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 2
- 108010084758 arginyl-tyrosyl-aspartic acid Proteins 0.000 description 2
- 108010093581 aspartyl-proline Proteins 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 230000036760 body temperature Effects 0.000 description 2
- 235000014633 carbohydrates Nutrition 0.000 description 2
- 150000001720 carbohydrates Chemical class 0.000 description 2
- 238000004113 cell culture Methods 0.000 description 2
- 230000004186 co-expression Effects 0.000 description 2
- 210000000805 cytoplasm Anatomy 0.000 description 2
- 230000006743 cytoplasmic accumulation Effects 0.000 description 2
- 238000004925 denaturation Methods 0.000 description 2
- 230000036425 denaturation Effects 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 210000002472 endoplasmic reticulum Anatomy 0.000 description 2
- 210000002744 extracellular matrix Anatomy 0.000 description 2
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 2
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 2
- 108010079547 glutamylmethionine Proteins 0.000 description 2
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 2
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 2
- 108010084264 glycyl-glycyl-cysteine Proteins 0.000 description 2
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 2
- 238000010438 heat treatment Methods 0.000 description 2
- 108010045383 histidyl-glycyl-glutamic acid Proteins 0.000 description 2
- 102000045875 human BMP1 Human genes 0.000 description 2
- 102000053643 human P4HB Human genes 0.000 description 2
- 102000015090 human lysyl hydroxylase 3 Human genes 0.000 description 2
- 238000011534 incubation Methods 0.000 description 2
- 230000000977 initiatory effect Effects 0.000 description 2
- 230000010354 integration Effects 0.000 description 2
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 2
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 2
- 108010000761 leucylarginine Proteins 0.000 description 2
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 2
- 238000007403 mPCR Methods 0.000 description 2
- 230000001404 mediated effect Effects 0.000 description 2
- 238000000520 microinjection Methods 0.000 description 2
- 238000013508 migration Methods 0.000 description 2
- 230000005012 migration Effects 0.000 description 2
- 210000003470 mitochondria Anatomy 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 230000031787 nutrient reservoir activity Effects 0.000 description 2
- 230000001590 oxidative effect Effects 0.000 description 2
- 230000001323 posttranslational effect Effects 0.000 description 2
- 235000012015 potatoes Nutrition 0.000 description 2
- 108010007513 prolyl-glycyl-prolyl-leucine Proteins 0.000 description 2
- 108020003519 protein disulfide isomerase Proteins 0.000 description 2
- 230000017854 proteolysis Effects 0.000 description 2
- 210000001938 protoplast Anatomy 0.000 description 2
- 230000010076 replication Effects 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 230000002441 reversible effect Effects 0.000 description 2
- YGSDEFSMJLZEOE-UHFFFAOYSA-N salicylic acid Chemical compound OC(=O)C1=CC=CC=C1O YGSDEFSMJLZEOE-UHFFFAOYSA-N 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 230000028327 secretion Effects 0.000 description 2
- 108010071207 serylmethionine Proteins 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 2
- 108010003137 tyrosyltyrosine Proteins 0.000 description 2
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 2
- CWFMWBHMIMNZLN-NAKRPEOUSA-N (2s)-1-[(2s)-2-[[(2s,3s)-2-amino-3-methylpentanoyl]amino]propanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CWFMWBHMIMNZLN-NAKRPEOUSA-N 0.000 description 1
- CEHZCZCQHUNAJF-AVGNSLFASA-N (2s)-1-[2-[[(2s)-1-[(2s)-2-amino-5-(diaminomethylideneamino)pentanoyl]pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N1[C@H](C(O)=O)CCC1 CEHZCZCQHUNAJF-AVGNSLFASA-N 0.000 description 1
- IYLGMFKRTLBESI-ATIWLJMLSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-5-(diaminomethylideneamino)pentanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]propanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-5-(diaminomethylideneamino)pentanoic acid Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O IYLGMFKRTLBESI-ATIWLJMLSA-N 0.000 description 1
- JIDDDPVQQUHACU-YFKPBYRVSA-N (2s)-pyrrolidine-2-carbaldehyde Chemical group O=C[C@@H]1CCCN1 JIDDDPVQQUHACU-YFKPBYRVSA-N 0.000 description 1
- AEGSIYIIMVBZQU-CIUDSAMLSA-N (3s)-3-[[2-[[(2s)-2-amino-5-(diaminomethylideneamino)pentanoyl]amino]acetyl]amino]-4-[[(1r)-1-carboxy-2-sulfanylethyl]amino]-4-oxobutanoic acid Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(O)=O AEGSIYIIMVBZQU-CIUDSAMLSA-N 0.000 description 1
- HZKLCOYAVAAQRD-VGMNWLOBSA-N (3s)-3-[[2-[[(2s)-2-amino-5-(diaminomethylideneamino)pentanoyl]amino]acetyl]amino]-4-[[(1r)-1-carboxyethyl]amino]-4-oxobutanoic acid Chemical compound OC(=O)[C@@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N HZKLCOYAVAAQRD-VGMNWLOBSA-N 0.000 description 1
- CUVSTAMIHSSVKL-UWVGGRQHSA-N (4s)-4-[(2-aminoacetyl)amino]-5-[[(2s)-6-amino-1-(carboxymethylamino)-1-oxohexan-2-yl]amino]-5-oxopentanoic acid Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN CUVSTAMIHSSVKL-UWVGGRQHSA-N 0.000 description 1
- OZRFYUJEXYKQDV-UHFFFAOYSA-N 2-[[2-[[2-[(2-amino-3-carboxypropanoyl)amino]-3-carboxypropanoyl]amino]-3-carboxypropanoyl]amino]butanedioic acid Chemical compound OC(=O)CC(N)C(=O)NC(CC(O)=O)C(=O)NC(CC(O)=O)C(=O)NC(CC(O)=O)C(O)=O OZRFYUJEXYKQDV-UHFFFAOYSA-N 0.000 description 1
- GOJUJUVQIVIZAV-UHFFFAOYSA-N 2-amino-4,6-dichloropyrimidine-5-carbaldehyde Chemical group NC1=NC(Cl)=C(C=O)C(Cl)=N1 GOJUJUVQIVIZAV-UHFFFAOYSA-N 0.000 description 1
- 101150074148 AT2S1 gene Proteins 0.000 description 1
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 1
- WQVFQXXBNHHPLX-ZKWXMUAHSA-N Ala-Ala-His Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O WQVFQXXBNHHPLX-ZKWXMUAHSA-N 0.000 description 1
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 1
- DVWVZSJAYIJZFI-FXQIFTODSA-N Ala-Arg-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DVWVZSJAYIJZFI-FXQIFTODSA-N 0.000 description 1
- SSSROGPPPVTHLX-FXQIFTODSA-N Ala-Arg-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSROGPPPVTHLX-FXQIFTODSA-N 0.000 description 1
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 1
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 1
- JAMAWBXXKFGFGX-KZVJFYERSA-N Ala-Arg-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JAMAWBXXKFGFGX-KZVJFYERSA-N 0.000 description 1
- ZIBWKCRKNFYTPT-ZKWXMUAHSA-N Ala-Asn-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZIBWKCRKNFYTPT-ZKWXMUAHSA-N 0.000 description 1
- MCKSLROAGSDNFC-ACZMJKKPSA-N Ala-Asp-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MCKSLROAGSDNFC-ACZMJKKPSA-N 0.000 description 1
- LZRNYBIJOSKKRJ-XVYDVKMFSA-N Ala-Asp-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LZRNYBIJOSKKRJ-XVYDVKMFSA-N 0.000 description 1
- VIGKUFXFTPWYER-BIIVOSGPSA-N Ala-Cys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N VIGKUFXFTPWYER-BIIVOSGPSA-N 0.000 description 1
- AWAXZRDKUHOPBO-GUBZILKMSA-N Ala-Gln-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O AWAXZRDKUHOPBO-GUBZILKMSA-N 0.000 description 1
- CZPAHAKGPDUIPJ-CIUDSAMLSA-N Ala-Gln-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CZPAHAKGPDUIPJ-CIUDSAMLSA-N 0.000 description 1
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 1
- BGNLUHXLSAQYRQ-FXQIFTODSA-N Ala-Glu-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BGNLUHXLSAQYRQ-FXQIFTODSA-N 0.000 description 1
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 1
- FBHOPGDGELNWRH-DRZSPHRISA-N Ala-Glu-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FBHOPGDGELNWRH-DRZSPHRISA-N 0.000 description 1
- PUBLUECXJRHTBK-ACZMJKKPSA-N Ala-Glu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O PUBLUECXJRHTBK-ACZMJKKPSA-N 0.000 description 1
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 1
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 1
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 1
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 1
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 1
- FDAZDMAFZYTHGS-XVYDVKMFSA-N Ala-His-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O FDAZDMAFZYTHGS-XVYDVKMFSA-N 0.000 description 1
- AAXVGJXZKHQQHD-LSJOCFKGSA-N Ala-His-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCSC)C(=O)O)N AAXVGJXZKHQQHD-LSJOCFKGSA-N 0.000 description 1
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 1
- FOHXUHGZZKETFI-JBDRJPRFSA-N Ala-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C)N FOHXUHGZZKETFI-JBDRJPRFSA-N 0.000 description 1
- GSHKMNKPMLXSQW-KBIXCLLPSA-N Ala-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C)N GSHKMNKPMLXSQW-KBIXCLLPSA-N 0.000 description 1
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 1
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 1
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 1
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 1
- CHFFHQUVXHEGBY-GARJFASQSA-N Ala-Lys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CHFFHQUVXHEGBY-GARJFASQSA-N 0.000 description 1
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 1
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 1
- VHEVVUZDDUCAKU-FXQIFTODSA-N Ala-Met-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O VHEVVUZDDUCAKU-FXQIFTODSA-N 0.000 description 1
- OMDNCNKNEGFOMM-BQBZGAKWSA-N Ala-Met-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O OMDNCNKNEGFOMM-BQBZGAKWSA-N 0.000 description 1
- BFMIRJBURUXDRG-DLOVCJGASA-N Ala-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 BFMIRJBURUXDRG-DLOVCJGASA-N 0.000 description 1
- HYIDEIQUCBKIPL-CQDKDKBSSA-N Ala-Phe-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N HYIDEIQUCBKIPL-CQDKDKBSSA-N 0.000 description 1
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 1
- IHMCQESUJVZTKW-UBHSHLNASA-N Ala-Phe-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 IHMCQESUJVZTKW-UBHSHLNASA-N 0.000 description 1
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 1
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 1
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 1
- YYAVDNKUWLAFCV-ACZMJKKPSA-N Ala-Ser-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYAVDNKUWLAFCV-ACZMJKKPSA-N 0.000 description 1
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 1
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 1
- HCBKAOZYACJUEF-XQXXSGGOSA-N Ala-Thr-Gln Chemical compound N[C@@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(N)=O)C(=O)O HCBKAOZYACJUEF-XQXXSGGOSA-N 0.000 description 1
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 1
- SAHQGRZIQVEJPF-JXUBOQSCSA-N Ala-Thr-Lys Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN SAHQGRZIQVEJPF-JXUBOQSCSA-N 0.000 description 1
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 1
- LTTLSZVJTDSACD-OWLDWWDNSA-N Ala-Thr-Trp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O LTTLSZVJTDSACD-OWLDWWDNSA-N 0.000 description 1
- YXXPVUOMPSZURS-ZLIFDBKOSA-N Ala-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@H](C)N)=CNC2=C1 YXXPVUOMPSZURS-ZLIFDBKOSA-N 0.000 description 1
- TVUFMYKTYXTRPY-HERUPUMHSA-N Ala-Trp-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O TVUFMYKTYXTRPY-HERUPUMHSA-N 0.000 description 1
- YCTIYBUTCKNOTI-UWJYBYFXSA-N Ala-Tyr-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCTIYBUTCKNOTI-UWJYBYFXSA-N 0.000 description 1
- ZJLORAAXDAJLDC-CQDKDKBSSA-N Ala-Tyr-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O ZJLORAAXDAJLDC-CQDKDKBSSA-N 0.000 description 1
- YEBZNKPPOHFZJM-BPNCWPANSA-N Ala-Tyr-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O YEBZNKPPOHFZJM-BPNCWPANSA-N 0.000 description 1
- OIRCZHKOHJUHAC-SIUGBPQLSA-N Ala-Val-Asp-Tyr Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OIRCZHKOHJUHAC-SIUGBPQLSA-N 0.000 description 1
- LYILPUNCKACNGF-NAKRPEOUSA-N Ala-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)N LYILPUNCKACNGF-NAKRPEOUSA-N 0.000 description 1
- DHONNEYAZPNGSG-UBHSHLNASA-N Ala-Val-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DHONNEYAZPNGSG-UBHSHLNASA-N 0.000 description 1
- 101100215339 Arabidopsis thaliana ACT11 gene Proteins 0.000 description 1
- 101100434207 Arabidopsis thaliana ACT8 gene Proteins 0.000 description 1
- 101000717417 Arabidopsis thaliana Cysteine proteinase RD21A Proteins 0.000 description 1
- 101100036901 Arabidopsis thaliana RPL40B gene Proteins 0.000 description 1
- DFCIPNHFKOQAME-FXQIFTODSA-N Arg-Ala-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFCIPNHFKOQAME-FXQIFTODSA-N 0.000 description 1
- HULHGJZIZXCPLD-FXQIFTODSA-N Arg-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N HULHGJZIZXCPLD-FXQIFTODSA-N 0.000 description 1
- GXCSUJQOECMKPV-CIUDSAMLSA-N Arg-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GXCSUJQOECMKPV-CIUDSAMLSA-N 0.000 description 1
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 1
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 1
- QEKBCDODJBBWHV-GUBZILKMSA-N Arg-Arg-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O QEKBCDODJBBWHV-GUBZILKMSA-N 0.000 description 1
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 1
- OVVUNXXROOFSIM-SDDRHHMPSA-N Arg-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O OVVUNXXROOFSIM-SDDRHHMPSA-N 0.000 description 1
- NABSCJGZKWSNHX-RCWTZXSCSA-N Arg-Arg-Thr Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NABSCJGZKWSNHX-RCWTZXSCSA-N 0.000 description 1
- QPOARHANPULOTM-GMOBBJLQSA-N Arg-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N QPOARHANPULOTM-GMOBBJLQSA-N 0.000 description 1
- IIABBYGHLYWVOS-FXQIFTODSA-N Arg-Asn-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O IIABBYGHLYWVOS-FXQIFTODSA-N 0.000 description 1
- RWCLSUOSKWTXLA-FXQIFTODSA-N Arg-Asp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RWCLSUOSKWTXLA-FXQIFTODSA-N 0.000 description 1
- VXXHDZKEQNGXNU-QXEWZRGKSA-N Arg-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N VXXHDZKEQNGXNU-QXEWZRGKSA-N 0.000 description 1
- XTGGTAWGUFXJSV-NAKRPEOUSA-N Arg-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N XTGGTAWGUFXJSV-NAKRPEOUSA-N 0.000 description 1
- PTVGLOCPAVYPFG-CIUDSAMLSA-N Arg-Gln-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PTVGLOCPAVYPFG-CIUDSAMLSA-N 0.000 description 1
- BGDILZXXDJCKPF-CIUDSAMLSA-N Arg-Gln-Cys Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(O)=O BGDILZXXDJCKPF-CIUDSAMLSA-N 0.000 description 1
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 1
- OBFTYSPXDRROQO-SRVKXCTJSA-N Arg-Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCN=C(N)N OBFTYSPXDRROQO-SRVKXCTJSA-N 0.000 description 1
- MTANSHNQTWPZKP-KKUMJFAQSA-N Arg-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)O MTANSHNQTWPZKP-KKUMJFAQSA-N 0.000 description 1
- RKRSYHCNPFGMTA-CIUDSAMLSA-N Arg-Glu-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O RKRSYHCNPFGMTA-CIUDSAMLSA-N 0.000 description 1
- XLWSGICNBZGYTA-CIUDSAMLSA-N Arg-Glu-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XLWSGICNBZGYTA-CIUDSAMLSA-N 0.000 description 1
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 1
- OGUPCHKBOKJFMA-SRVKXCTJSA-N Arg-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N OGUPCHKBOKJFMA-SRVKXCTJSA-N 0.000 description 1
- YNSGXDWWPCGGQS-YUMQZZPRSA-N Arg-Gly-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O YNSGXDWWPCGGQS-YUMQZZPRSA-N 0.000 description 1
- QKSAZKCRVQYYGS-UWVGGRQHSA-N Arg-Gly-His Chemical compound N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O QKSAZKCRVQYYGS-UWVGGRQHSA-N 0.000 description 1
- ZZZWQALDSQQBEW-STQMWFEESA-N Arg-Gly-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZZZWQALDSQQBEW-STQMWFEESA-N 0.000 description 1
- SLNCSSWAIDUUGF-LSJOCFKGSA-N Arg-His-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O SLNCSSWAIDUUGF-LSJOCFKGSA-N 0.000 description 1
- NVCIXQYNWYTLDO-IHRRRGAJSA-N Arg-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N NVCIXQYNWYTLDO-IHRRRGAJSA-N 0.000 description 1
- PCQXGEUALSFGIA-WDSOQIARSA-N Arg-His-Trp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O PCQXGEUALSFGIA-WDSOQIARSA-N 0.000 description 1
- DGFXIWKPTDKBLF-AVGNSLFASA-N Arg-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N DGFXIWKPTDKBLF-AVGNSLFASA-N 0.000 description 1
- FRMQITGHXMUNDF-GMOBBJLQSA-N Arg-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FRMQITGHXMUNDF-GMOBBJLQSA-N 0.000 description 1
- HCIUUZGFTDTEGM-NAKRPEOUSA-N Arg-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N HCIUUZGFTDTEGM-NAKRPEOUSA-N 0.000 description 1
- FFEUXEAKYRCACT-PEDHHIEDSA-N Arg-Ile-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)CC)C(O)=O FFEUXEAKYRCACT-PEDHHIEDSA-N 0.000 description 1
- OOIMKQRCPJBGPD-XUXIUFHCSA-N Arg-Ile-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O OOIMKQRCPJBGPD-XUXIUFHCSA-N 0.000 description 1
- NIUDXSFNLBIWOB-DCAQKATOSA-N Arg-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NIUDXSFNLBIWOB-DCAQKATOSA-N 0.000 description 1
- DNUKXVMPARLPFN-XUXIUFHCSA-N Arg-Leu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DNUKXVMPARLPFN-XUXIUFHCSA-N 0.000 description 1
- JEXPNDORFYHJTM-IHRRRGAJSA-N Arg-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCN=C(N)N JEXPNDORFYHJTM-IHRRRGAJSA-N 0.000 description 1
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 1
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 1
- OGSQONVYSTZIJB-WDSOQIARSA-N Arg-Leu-Trp Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCN=C(N)N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O OGSQONVYSTZIJB-WDSOQIARSA-N 0.000 description 1
- SSZGOKWBHLOCHK-DCAQKATOSA-N Arg-Lys-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N SSZGOKWBHLOCHK-DCAQKATOSA-N 0.000 description 1
- BTJVOUQWFXABOI-IHRRRGAJSA-N Arg-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCNC(N)=N BTJVOUQWFXABOI-IHRRRGAJSA-N 0.000 description 1
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 1
- VIINVRPKMUZYOI-DCAQKATOSA-N Arg-Met-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VIINVRPKMUZYOI-DCAQKATOSA-N 0.000 description 1
- ZEBDYGZVMMKZNB-SRVKXCTJSA-N Arg-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCN=C(N)N)N ZEBDYGZVMMKZNB-SRVKXCTJSA-N 0.000 description 1
- FKQITMVNILRUCQ-IHRRRGAJSA-N Arg-Phe-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O FKQITMVNILRUCQ-IHRRRGAJSA-N 0.000 description 1
- BSGSDLYGGHGMND-IHRRRGAJSA-N Arg-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N BSGSDLYGGHGMND-IHRRRGAJSA-N 0.000 description 1
- UGZUVYDKAYNCII-ULQDDVLXSA-N Arg-Phe-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UGZUVYDKAYNCII-ULQDDVLXSA-N 0.000 description 1
- UIUXXFIKWQVMEX-UFYCRDLUSA-N Arg-Phe-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UIUXXFIKWQVMEX-UFYCRDLUSA-N 0.000 description 1
- FOQFHANLUJDQEE-GUBZILKMSA-N Arg-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CS)C(=O)O FOQFHANLUJDQEE-GUBZILKMSA-N 0.000 description 1
- XSPKAHFVDKRGRL-DCAQKATOSA-N Arg-Pro-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XSPKAHFVDKRGRL-DCAQKATOSA-N 0.000 description 1
- HGKHPCFTRQDHCU-IUCAKERBSA-N Arg-Pro-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HGKHPCFTRQDHCU-IUCAKERBSA-N 0.000 description 1
- AUIJUTGLPVHIRT-FXQIFTODSA-N Arg-Ser-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N AUIJUTGLPVHIRT-FXQIFTODSA-N 0.000 description 1
- ICRHGPYYXMWHIE-LPEHRKFASA-N Arg-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ICRHGPYYXMWHIE-LPEHRKFASA-N 0.000 description 1
- OQPAZKMGCWPERI-GUBZILKMSA-N Arg-Ser-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OQPAZKMGCWPERI-GUBZILKMSA-N 0.000 description 1
- AIFHRTPABBBHKU-RCWTZXSCSA-N Arg-Thr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AIFHRTPABBBHKU-RCWTZXSCSA-N 0.000 description 1
- QMQZYILAWUOLPV-JYJNAYRXSA-N Arg-Tyr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)CC1=CC=C(O)C=C1 QMQZYILAWUOLPV-JYJNAYRXSA-N 0.000 description 1
- AOJYORNRFWWEIV-IHRRRGAJSA-N Arg-Tyr-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 AOJYORNRFWWEIV-IHRRRGAJSA-N 0.000 description 1
- BFDDUDQCPJWQRQ-IHRRRGAJSA-N Arg-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O BFDDUDQCPJWQRQ-IHRRRGAJSA-N 0.000 description 1
- XRLOBFSLPCHYLQ-ULQDDVLXSA-N Arg-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O XRLOBFSLPCHYLQ-ULQDDVLXSA-N 0.000 description 1
- CGWVCWFQGXOUSJ-ULQDDVLXSA-N Arg-Tyr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O CGWVCWFQGXOUSJ-ULQDDVLXSA-N 0.000 description 1
- IZSMEUDYADKZTJ-KJEVXHAQSA-N Arg-Tyr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IZSMEUDYADKZTJ-KJEVXHAQSA-N 0.000 description 1
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 1
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 1
- SWLOHUMCUDRTCL-ZLUOBGJFSA-N Asn-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N SWLOHUMCUDRTCL-ZLUOBGJFSA-N 0.000 description 1
- QQEWINYJRFBLNN-DLOVCJGASA-N Asn-Ala-Phe Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QQEWINYJRFBLNN-DLOVCJGASA-N 0.000 description 1
- XHFXZQHTLJVZBN-FXQIFTODSA-N Asn-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N XHFXZQHTLJVZBN-FXQIFTODSA-N 0.000 description 1
- MEFGKQUUYZOLHM-GMOBBJLQSA-N Asn-Arg-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MEFGKQUUYZOLHM-GMOBBJLQSA-N 0.000 description 1
- DQTIWTULBGLJBL-DCAQKATOSA-N Asn-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N DQTIWTULBGLJBL-DCAQKATOSA-N 0.000 description 1
- RCENDENBBJFJHZ-ACZMJKKPSA-N Asn-Asn-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RCENDENBBJFJHZ-ACZMJKKPSA-N 0.000 description 1
- NVGWESORMHFISY-SRVKXCTJSA-N Asn-Asn-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NVGWESORMHFISY-SRVKXCTJSA-N 0.000 description 1
- KXEGPPNPXOKKHK-ZLUOBGJFSA-N Asn-Asp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KXEGPPNPXOKKHK-ZLUOBGJFSA-N 0.000 description 1
- ZWASIOHRQWRWAS-UGYAYLCHSA-N Asn-Asp-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZWASIOHRQWRWAS-UGYAYLCHSA-N 0.000 description 1
- ZMWDUIIACVLIHK-GHCJXIJMSA-N Asn-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N ZMWDUIIACVLIHK-GHCJXIJMSA-N 0.000 description 1
- VJTWLBMESLDOMK-WDSKDSINSA-N Asn-Gln-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VJTWLBMESLDOMK-WDSKDSINSA-N 0.000 description 1
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 1
- GYOHQKJEQQJBOY-QEJZJMRPSA-N Asn-Glu-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N GYOHQKJEQQJBOY-QEJZJMRPSA-N 0.000 description 1
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 1
- OLVIPTLKNSAYRJ-YUMQZZPRSA-N Asn-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N OLVIPTLKNSAYRJ-YUMQZZPRSA-N 0.000 description 1
- JQSWHKKUZMTOIH-QWRGUYRKSA-N Asn-Gly-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N JQSWHKKUZMTOIH-QWRGUYRKSA-N 0.000 description 1
- GJFYPBDMUGGLFR-NKWVEPMBSA-N Asn-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC(=O)N)N)C(=O)O GJFYPBDMUGGLFR-NKWVEPMBSA-N 0.000 description 1
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 description 1
- ZMUQQMGITUJQTI-CIUDSAMLSA-N Asn-Leu-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMUQQMGITUJQTI-CIUDSAMLSA-N 0.000 description 1
- BXUHCIXDSWRSBS-CIUDSAMLSA-N Asn-Leu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BXUHCIXDSWRSBS-CIUDSAMLSA-N 0.000 description 1
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 1
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 1
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 1
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 1
- TZFQICWZWFNIKU-KKUMJFAQSA-N Asn-Leu-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 TZFQICWZWFNIKU-KKUMJFAQSA-N 0.000 description 1
- RCFGLXMZDYNRSC-CIUDSAMLSA-N Asn-Lys-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O RCFGLXMZDYNRSC-CIUDSAMLSA-N 0.000 description 1
- GIQCDTKOIPUDSG-GARJFASQSA-N Asn-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N)C(=O)O GIQCDTKOIPUDSG-GARJFASQSA-N 0.000 description 1
- COWITDLVHMZSIW-CIUDSAMLSA-N Asn-Lys-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O COWITDLVHMZSIW-CIUDSAMLSA-N 0.000 description 1
- XFJKRRCWLTZIQA-XIRDDKMYSA-N Asn-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N XFJKRRCWLTZIQA-XIRDDKMYSA-N 0.000 description 1
- XMHFCUKJRCQXGI-CIUDSAMLSA-N Asn-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O XMHFCUKJRCQXGI-CIUDSAMLSA-N 0.000 description 1
- YUOXLJYVSZYPBJ-CIUDSAMLSA-N Asn-Pro-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O YUOXLJYVSZYPBJ-CIUDSAMLSA-N 0.000 description 1
- AWXDRZJQCVHCIT-DCAQKATOSA-N Asn-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O AWXDRZJQCVHCIT-DCAQKATOSA-N 0.000 description 1
- SUIJFTJDTJKSRK-IHRRRGAJSA-N Asn-Pro-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUIJFTJDTJKSRK-IHRRRGAJSA-N 0.000 description 1
- OOXUBGLNDRGOKT-FXQIFTODSA-N Asn-Ser-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OOXUBGLNDRGOKT-FXQIFTODSA-N 0.000 description 1
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 1
- NCXTYSVDWLAQGZ-ZKWXMUAHSA-N Asn-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O NCXTYSVDWLAQGZ-ZKWXMUAHSA-N 0.000 description 1
- ZUFPUBYQYWCMDB-NUMRIWBASA-N Asn-Thr-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZUFPUBYQYWCMDB-NUMRIWBASA-N 0.000 description 1
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 1
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 description 1
- YQPSDMUGFKJZHR-QRTARXTBSA-N Asn-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)N)N YQPSDMUGFKJZHR-QRTARXTBSA-N 0.000 description 1
- YSYTWUMRHSFODC-QWRGUYRKSA-N Asn-Tyr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O YSYTWUMRHSFODC-QWRGUYRKSA-N 0.000 description 1
- DATSKXOXPUAOLK-KKUMJFAQSA-N Asn-Tyr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DATSKXOXPUAOLK-KKUMJFAQSA-N 0.000 description 1
- JZLFYAAGGYMRIK-BYULHYEWSA-N Asn-Val-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O JZLFYAAGGYMRIK-BYULHYEWSA-N 0.000 description 1
- KBQOUDLMWYWXNP-YDHLFZDLSA-N Asn-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)N)N KBQOUDLMWYWXNP-YDHLFZDLSA-N 0.000 description 1
- GHWWTICYPDKPTE-NGZCFLSTSA-N Asn-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N GHWWTICYPDKPTE-NGZCFLSTSA-N 0.000 description 1
- KDFQZBWWPYQBEN-ZLUOBGJFSA-N Asp-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N KDFQZBWWPYQBEN-ZLUOBGJFSA-N 0.000 description 1
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 1
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 1
- VPPXTHJNTYDNFJ-CIUDSAMLSA-N Asp-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N VPPXTHJNTYDNFJ-CIUDSAMLSA-N 0.000 description 1
- NECWUSYTYSIFNC-DLOVCJGASA-N Asp-Ala-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 NECWUSYTYSIFNC-DLOVCJGASA-N 0.000 description 1
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 1
- QHAJMRDEWNAIBQ-FXQIFTODSA-N Asp-Arg-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O QHAJMRDEWNAIBQ-FXQIFTODSA-N 0.000 description 1
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 1
- DBWYWXNMZZYIRY-LPEHRKFASA-N Asp-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O DBWYWXNMZZYIRY-LPEHRKFASA-N 0.000 description 1
- NYLBGYLHBDFRHL-VEVYYDQMSA-N Asp-Arg-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NYLBGYLHBDFRHL-VEVYYDQMSA-N 0.000 description 1
- ATYWBXGNXZYZGI-ACZMJKKPSA-N Asp-Asn-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ATYWBXGNXZYZGI-ACZMJKKPSA-N 0.000 description 1
- JDHOJQJMWBKHDB-CIUDSAMLSA-N Asp-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N JDHOJQJMWBKHDB-CIUDSAMLSA-N 0.000 description 1
- VPSHHQXIWLGVDD-ZLUOBGJFSA-N Asp-Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VPSHHQXIWLGVDD-ZLUOBGJFSA-N 0.000 description 1
- QXHVOUSPVAWEMX-ZLUOBGJFSA-N Asp-Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXHVOUSPVAWEMX-ZLUOBGJFSA-N 0.000 description 1
- BFOYULZBKYOKAN-OLHMAJIHSA-N Asp-Asp-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFOYULZBKYOKAN-OLHMAJIHSA-N 0.000 description 1
- VZNOVQKGJQJOCS-SRVKXCTJSA-N Asp-Asp-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VZNOVQKGJQJOCS-SRVKXCTJSA-N 0.000 description 1
- FTNVLGCFIJEMQT-CIUDSAMLSA-N Asp-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N FTNVLGCFIJEMQT-CIUDSAMLSA-N 0.000 description 1
- ACEDJCOOPZFUBU-CIUDSAMLSA-N Asp-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N ACEDJCOOPZFUBU-CIUDSAMLSA-N 0.000 description 1
- WEDGJJRCJNHYSF-SRVKXCTJSA-N Asp-Cys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N WEDGJJRCJNHYSF-SRVKXCTJSA-N 0.000 description 1
- KVPHTGVUMJGMCX-BIIVOSGPSA-N Asp-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N)C(=O)O KVPHTGVUMJGMCX-BIIVOSGPSA-N 0.000 description 1
- LJRPYAZQQWHEEV-FXQIFTODSA-N Asp-Gln-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O LJRPYAZQQWHEEV-FXQIFTODSA-N 0.000 description 1
- RYKWOUUZJFSJOH-FXQIFTODSA-N Asp-Gln-Glu Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N RYKWOUUZJFSJOH-FXQIFTODSA-N 0.000 description 1
- VHQOCWWKXIOAQI-WDSKDSINSA-N Asp-Gln-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VHQOCWWKXIOAQI-WDSKDSINSA-N 0.000 description 1
- SNAWMGHSCHKSDK-GUBZILKMSA-N Asp-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N SNAWMGHSCHKSDK-GUBZILKMSA-N 0.000 description 1
- DXQOQMCLWWADMU-ACZMJKKPSA-N Asp-Gln-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DXQOQMCLWWADMU-ACZMJKKPSA-N 0.000 description 1
- JRBVWZLHBGYZNY-QEJZJMRPSA-N Asp-Gln-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JRBVWZLHBGYZNY-QEJZJMRPSA-N 0.000 description 1
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 1
- VFUXXFVCYZPOQG-WDSKDSINSA-N Asp-Glu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VFUXXFVCYZPOQG-WDSKDSINSA-N 0.000 description 1
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 1
- LTXGDRFJRZSZAV-CIUDSAMLSA-N Asp-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N LTXGDRFJRZSZAV-CIUDSAMLSA-N 0.000 description 1
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 1
- XDGBFDYXZCMYEX-NUMRIWBASA-N Asp-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)O XDGBFDYXZCMYEX-NUMRIWBASA-N 0.000 description 1
- RRKCPMGSRIDLNC-AVGNSLFASA-N Asp-Glu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RRKCPMGSRIDLNC-AVGNSLFASA-N 0.000 description 1
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 1
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 1
- BIVYLQMZPHDUIH-WHFBIAKZSA-N Asp-Gly-Cys Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)C(=O)O BIVYLQMZPHDUIH-WHFBIAKZSA-N 0.000 description 1
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 1
- PSLSTUMPZILTAH-BYULHYEWSA-N Asp-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PSLSTUMPZILTAH-BYULHYEWSA-N 0.000 description 1
- POTCZYQVVNXUIG-BQBZGAKWSA-N Asp-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O POTCZYQVVNXUIG-BQBZGAKWSA-N 0.000 description 1
- PGUYEUCYVNZGGV-QWRGUYRKSA-N Asp-Gly-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PGUYEUCYVNZGGV-QWRGUYRKSA-N 0.000 description 1
- RWHHSFSWKFBTCF-KKUMJFAQSA-N Asp-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)O)N RWHHSFSWKFBTCF-KKUMJFAQSA-N 0.000 description 1
- RKNIUWSZIAUEPK-PBCZWWQYSA-N Asp-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N)O RKNIUWSZIAUEPK-PBCZWWQYSA-N 0.000 description 1
- SEMWSADZTMJELF-BYULHYEWSA-N Asp-Ile-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O SEMWSADZTMJELF-BYULHYEWSA-N 0.000 description 1
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 1
- YFSLJHLQOALGSY-ZPFDUUQYSA-N Asp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N YFSLJHLQOALGSY-ZPFDUUQYSA-N 0.000 description 1
- SPKCGKRUYKMDHP-GUDRVLHUSA-N Asp-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N SPKCGKRUYKMDHP-GUDRVLHUSA-N 0.000 description 1
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 1
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 1
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 1
- MYOHQBFRJQFIDZ-KKUMJFAQSA-N Asp-Leu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYOHQBFRJQFIDZ-KKUMJFAQSA-N 0.000 description 1
- UZFHNLYQWMGUHU-DCAQKATOSA-N Asp-Lys-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UZFHNLYQWMGUHU-DCAQKATOSA-N 0.000 description 1
- YVHGKXAOSVBGJV-CIUDSAMLSA-N Asp-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N YVHGKXAOSVBGJV-CIUDSAMLSA-N 0.000 description 1
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 1
- YWLDTBBUHZJQHW-KKUMJFAQSA-N Asp-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N YWLDTBBUHZJQHW-KKUMJFAQSA-N 0.000 description 1
- DPNWSMBUYCLEDG-CIUDSAMLSA-N Asp-Lys-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O DPNWSMBUYCLEDG-CIUDSAMLSA-N 0.000 description 1
- SARSTIZOZFBDOM-FXQIFTODSA-N Asp-Met-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O SARSTIZOZFBDOM-FXQIFTODSA-N 0.000 description 1
- WWOYXVBGHAHQBG-FXQIFTODSA-N Asp-Met-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O WWOYXVBGHAHQBG-FXQIFTODSA-N 0.000 description 1
- XFQOQUWGVCVYON-DCAQKATOSA-N Asp-Met-His Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 XFQOQUWGVCVYON-DCAQKATOSA-N 0.000 description 1
- HXVILZUZXFLVEN-DCAQKATOSA-N Asp-Met-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O HXVILZUZXFLVEN-DCAQKATOSA-N 0.000 description 1
- IOXWDLNHXZOXQP-FXQIFTODSA-N Asp-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N IOXWDLNHXZOXQP-FXQIFTODSA-N 0.000 description 1
- LIJXJYGRSRWLCJ-IHRRRGAJSA-N Asp-Phe-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LIJXJYGRSRWLCJ-IHRRRGAJSA-N 0.000 description 1
- QJHOOKBAHRJPPX-QWRGUYRKSA-N Asp-Phe-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 QJHOOKBAHRJPPX-QWRGUYRKSA-N 0.000 description 1
- PCJOFZYFFMBZKC-PCBIJLKTSA-N Asp-Phe-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PCJOFZYFFMBZKC-PCBIJLKTSA-N 0.000 description 1
- PWAIZUBWHRHYKS-MELADBBJSA-N Asp-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)O)N)C(=O)O PWAIZUBWHRHYKS-MELADBBJSA-N 0.000 description 1
- RPUYTJJZXQBWDT-SRVKXCTJSA-N Asp-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N RPUYTJJZXQBWDT-SRVKXCTJSA-N 0.000 description 1
- NONWUQAWAANERO-BZSNNMDCSA-N Asp-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@H](CC(O)=O)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 NONWUQAWAANERO-BZSNNMDCSA-N 0.000 description 1
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 1
- BWJZSLQJNBSUPM-FXQIFTODSA-N Asp-Pro-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O BWJZSLQJNBSUPM-FXQIFTODSA-N 0.000 description 1
- HICVMZCGVFKTPM-BQBZGAKWSA-N Asp-Pro-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HICVMZCGVFKTPM-BQBZGAKWSA-N 0.000 description 1
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 1
- FOXXZZGDIAQPQI-XKNYDFJKSA-N Asp-Pro-Ser-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FOXXZZGDIAQPQI-XKNYDFJKSA-N 0.000 description 1
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 1
- ZBYLEBZCVKLPCY-FXQIFTODSA-N Asp-Ser-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZBYLEBZCVKLPCY-FXQIFTODSA-N 0.000 description 1
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 1
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 1
- XYPJXLLXNSAWHZ-SRVKXCTJSA-N Asp-Ser-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XYPJXLLXNSAWHZ-SRVKXCTJSA-N 0.000 description 1
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 1
- IWLZBRTUIVXZJD-OLHMAJIHSA-N Asp-Thr-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O IWLZBRTUIVXZJD-OLHMAJIHSA-N 0.000 description 1
- GWWSUMLEWKQHLR-NUMRIWBASA-N Asp-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GWWSUMLEWKQHLR-NUMRIWBASA-N 0.000 description 1
- ZVYYMCXVPZEAPU-CWRNSKLLSA-N Asp-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC(=O)O)N)C(=O)O ZVYYMCXVPZEAPU-CWRNSKLLSA-N 0.000 description 1
- HCOQNGIHSXICCB-IHRRRGAJSA-N Asp-Tyr-Arg Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)O HCOQNGIHSXICCB-IHRRRGAJSA-N 0.000 description 1
- KNDCWFXCFKSEBM-AVGNSLFASA-N Asp-Tyr-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O KNDCWFXCFKSEBM-AVGNSLFASA-N 0.000 description 1
- NWAHPBGBDIFUFD-KKUMJFAQSA-N Asp-Tyr-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O NWAHPBGBDIFUFD-KKUMJFAQSA-N 0.000 description 1
- WOKXEQLPBLLWHC-IHRRRGAJSA-N Asp-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 WOKXEQLPBLLWHC-IHRRRGAJSA-N 0.000 description 1
- ALMIMUZAWTUNIO-BZSNNMDCSA-N Asp-Tyr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ALMIMUZAWTUNIO-BZSNNMDCSA-N 0.000 description 1
- WAEDSQFVZJUHLI-BYULHYEWSA-N Asp-Val-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WAEDSQFVZJUHLI-BYULHYEWSA-N 0.000 description 1
- XMKXONRMGJXCJV-LAEOZQHASA-N Asp-Val-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XMKXONRMGJXCJV-LAEOZQHASA-N 0.000 description 1
- XWKPSMRPIKKDDU-RCOVLWMOSA-N Asp-Val-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O XWKPSMRPIKKDDU-RCOVLWMOSA-N 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 241000701513 Badnavirus Species 0.000 description 1
- 241000702286 Bean golden mosaic virus Species 0.000 description 1
- 101710150190 Beta-secretase 2 Proteins 0.000 description 1
- 239000002028 Biomass Substances 0.000 description 1
- 108090000654 Bone morphogenetic protein 1 Proteins 0.000 description 1
- 102100028728 Bone morphogenetic protein 1 Human genes 0.000 description 1
- 241000283690 Bos taurus Species 0.000 description 1
- 240000002791 Brassica napus Species 0.000 description 1
- 235000011293 Brassica napus Nutrition 0.000 description 1
- 102100031102 C-C motif chemokine 4 Human genes 0.000 description 1
- 102100028668 C-type lectin domain family 4 member C Human genes 0.000 description 1
- 101100290380 Caenorhabditis elegans cel-1 gene Proteins 0.000 description 1
- 241001515826 Cassava vein mosaic virus Species 0.000 description 1
- 108010049994 Chloroplast Proteins Proteins 0.000 description 1
- 102100033601 Collagen alpha-1(I) chain Human genes 0.000 description 1
- 101710126238 Collagen alpha-2(I) chain Proteins 0.000 description 1
- 102100036213 Collagen alpha-2(I) chain Human genes 0.000 description 1
- 235000019750 Crude protein Nutrition 0.000 description 1
- KKZHXOOZHFABQQ-UWJYBYFXSA-N Cys-Ala-Tyr Chemical compound SC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KKZHXOOZHFABQQ-UWJYBYFXSA-N 0.000 description 1
- GMXSSZUVDNPRMA-FXQIFTODSA-N Cys-Arg-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GMXSSZUVDNPRMA-FXQIFTODSA-N 0.000 description 1
- BUIYOWKUSCTBRE-CIUDSAMLSA-N Cys-Arg-Gln Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O BUIYOWKUSCTBRE-CIUDSAMLSA-N 0.000 description 1
- JTNKVWLMDHIUOG-IHRRRGAJSA-N Cys-Arg-Phe Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JTNKVWLMDHIUOG-IHRRRGAJSA-N 0.000 description 1
- BYALSSDCQYHKMY-XGEHTFHBSA-N Cys-Arg-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N)O BYALSSDCQYHKMY-XGEHTFHBSA-N 0.000 description 1
- XXDLUZLKHOVPNW-IHRRRGAJSA-N Cys-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N)O XXDLUZLKHOVPNW-IHRRRGAJSA-N 0.000 description 1
- UISYPAHPLXGLNH-ACZMJKKPSA-N Cys-Asn-Gln Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O UISYPAHPLXGLNH-ACZMJKKPSA-N 0.000 description 1
- UPJGYXRAPJWIHD-CIUDSAMLSA-N Cys-Asn-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UPJGYXRAPJWIHD-CIUDSAMLSA-N 0.000 description 1
- SQJSYLDKQBZQTG-FXQIFTODSA-N Cys-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N SQJSYLDKQBZQTG-FXQIFTODSA-N 0.000 description 1
- UWXFFVQPAMBETM-ZLUOBGJFSA-N Cys-Asp-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O UWXFFVQPAMBETM-ZLUOBGJFSA-N 0.000 description 1
- GSNRZJNHMVMOFV-ACZMJKKPSA-N Cys-Asp-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N GSNRZJNHMVMOFV-ACZMJKKPSA-N 0.000 description 1
- NIPJKKSXHSBEMX-CIUDSAMLSA-N Cys-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N NIPJKKSXHSBEMX-CIUDSAMLSA-N 0.000 description 1
- BIVLWXQGXJLGKG-BIIVOSGPSA-N Cys-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N)C(=O)O BIVLWXQGXJLGKG-BIIVOSGPSA-N 0.000 description 1
- BMHBJCVEXUBGFI-BIIVOSGPSA-N Cys-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CS)N)C(=O)O BMHBJCVEXUBGFI-BIIVOSGPSA-N 0.000 description 1
- YUZPQIQWXLRFBW-ACZMJKKPSA-N Cys-Glu-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O YUZPQIQWXLRFBW-ACZMJKKPSA-N 0.000 description 1
- ZEXHDOQQYZKOIB-ACZMJKKPSA-N Cys-Glu-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZEXHDOQQYZKOIB-ACZMJKKPSA-N 0.000 description 1
- DZLQXIFVQFTFJY-BYPYZUCNSA-N Cys-Gly-Gly Chemical compound SC[C@H](N)C(=O)NCC(=O)NCC(O)=O DZLQXIFVQFTFJY-BYPYZUCNSA-N 0.000 description 1
- PQHYZJPCYRDYNE-QWRGUYRKSA-N Cys-Gly-Phe Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PQHYZJPCYRDYNE-QWRGUYRKSA-N 0.000 description 1
- XTHUKRLJRUVVBF-WHFBIAKZSA-N Cys-Gly-Ser Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O XTHUKRLJRUVVBF-WHFBIAKZSA-N 0.000 description 1
- UVZFZTWNHOQWNK-NAKRPEOUSA-N Cys-Ile-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UVZFZTWNHOQWNK-NAKRPEOUSA-N 0.000 description 1
- MTNJRNQDDSWQQA-GQGQLFGLSA-N Cys-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CS)N MTNJRNQDDSWQQA-GQGQLFGLSA-N 0.000 description 1
- LHMSYHSAAJOEBL-CIUDSAMLSA-N Cys-Lys-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O LHMSYHSAAJOEBL-CIUDSAMLSA-N 0.000 description 1
- RESAHOSBQHMOKH-KKUMJFAQSA-N Cys-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CS)N RESAHOSBQHMOKH-KKUMJFAQSA-N 0.000 description 1
- IDZDFWJNPOOOHE-KKUMJFAQSA-N Cys-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N IDZDFWJNPOOOHE-KKUMJFAQSA-N 0.000 description 1
- KJJASVYBTKRYSN-FXQIFTODSA-N Cys-Pro-Asp Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CC(=O)O)C(=O)O KJJASVYBTKRYSN-FXQIFTODSA-N 0.000 description 1
- NITLUESFANGEIW-BQBZGAKWSA-N Cys-Pro-Gly Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O NITLUESFANGEIW-BQBZGAKWSA-N 0.000 description 1
- XBELMDARIGXDKY-GUBZILKMSA-N Cys-Pro-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CS)N XBELMDARIGXDKY-GUBZILKMSA-N 0.000 description 1
- ZGERHCJBLPQPGV-ACZMJKKPSA-N Cys-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N ZGERHCJBLPQPGV-ACZMJKKPSA-N 0.000 description 1
- WZJLBUPPZRZNTO-CIUDSAMLSA-N Cys-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N WZJLBUPPZRZNTO-CIUDSAMLSA-N 0.000 description 1
- SAEVTQWAYDPXMU-KATARQTJSA-N Cys-Thr-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O SAEVTQWAYDPXMU-KATARQTJSA-N 0.000 description 1
- FANFRJOFTYCNRG-JYBASQMISA-N Cys-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CS)N)O FANFRJOFTYCNRG-JYBASQMISA-N 0.000 description 1
- JRZMCSIUYGSJKP-ZKWXMUAHSA-N Cys-Val-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O JRZMCSIUYGSJKP-ZKWXMUAHSA-N 0.000 description 1
- IOLWXFWVYYCVTJ-NRPADANISA-N Cys-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N IOLWXFWVYYCVTJ-NRPADANISA-N 0.000 description 1
- NGOIQDYZMIKCOK-NAKRPEOUSA-N Cys-Val-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NGOIQDYZMIKCOK-NAKRPEOUSA-N 0.000 description 1
- WVWRADGCZPIJJR-IHRRRGAJSA-N Cys-Val-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CS)N WVWRADGCZPIJJR-IHRRRGAJSA-N 0.000 description 1
- 208000035874 Excoriation Diseases 0.000 description 1
- 101710155000 Gamma conglutin 1 Proteins 0.000 description 1
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 1
- INKFLNZBTSNFON-CIUDSAMLSA-N Gln-Ala-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O INKFLNZBTSNFON-CIUDSAMLSA-N 0.000 description 1
- LKUWAWGNJYJODH-KBIXCLLPSA-N Gln-Ala-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKUWAWGNJYJODH-KBIXCLLPSA-N 0.000 description 1
- IGNGBUVODQLMRJ-CIUDSAMLSA-N Gln-Ala-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O IGNGBUVODQLMRJ-CIUDSAMLSA-N 0.000 description 1
- SHERTACNJPYHAR-ACZMJKKPSA-N Gln-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O SHERTACNJPYHAR-ACZMJKKPSA-N 0.000 description 1
- LTLXPHKSQQILNF-CIUDSAMLSA-N Gln-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N LTLXPHKSQQILNF-CIUDSAMLSA-N 0.000 description 1
- WMOMPXKOKASNBK-PEFMBERDSA-N Gln-Asn-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WMOMPXKOKASNBK-PEFMBERDSA-N 0.000 description 1
- IKDOHQHEFPPGJG-FXQIFTODSA-N Gln-Asp-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IKDOHQHEFPPGJG-FXQIFTODSA-N 0.000 description 1
- WQWMZOIPXWSZNE-WDSKDSINSA-N Gln-Asp-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O WQWMZOIPXWSZNE-WDSKDSINSA-N 0.000 description 1
- JFSNBQJNDMXMQF-XHNCKOQMSA-N Gln-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O JFSNBQJNDMXMQF-XHNCKOQMSA-N 0.000 description 1
- PCKOTDPDHIBGRW-CIUDSAMLSA-N Gln-Cys-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N PCKOTDPDHIBGRW-CIUDSAMLSA-N 0.000 description 1
- MFLMFRZBAJSGHK-ACZMJKKPSA-N Gln-Cys-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N MFLMFRZBAJSGHK-ACZMJKKPSA-N 0.000 description 1
- QYKBTDOAMKORGL-FXQIFTODSA-N Gln-Gln-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N QYKBTDOAMKORGL-FXQIFTODSA-N 0.000 description 1
- AJDMYLOISOCHHC-YVNDNENWSA-N Gln-Gln-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AJDMYLOISOCHHC-YVNDNENWSA-N 0.000 description 1
- MADFVRSKEIEZHZ-DCAQKATOSA-N Gln-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N MADFVRSKEIEZHZ-DCAQKATOSA-N 0.000 description 1
- SNLOOPZHAQDMJG-CIUDSAMLSA-N Gln-Glu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SNLOOPZHAQDMJG-CIUDSAMLSA-N 0.000 description 1
- MAGNEQBFSBREJL-DCAQKATOSA-N Gln-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N MAGNEQBFSBREJL-DCAQKATOSA-N 0.000 description 1
- LFIVHGMKWFGUGK-IHRRRGAJSA-N Gln-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N LFIVHGMKWFGUGK-IHRRRGAJSA-N 0.000 description 1
- DRDSQGHKTLSNEA-GLLZPBPUSA-N Gln-Glu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DRDSQGHKTLSNEA-GLLZPBPUSA-N 0.000 description 1
- JHPFPROFOAJRFN-IHRRRGAJSA-N Gln-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O JHPFPROFOAJRFN-IHRRRGAJSA-N 0.000 description 1
- HVQCEQTUSWWFOS-WDSKDSINSA-N Gln-Gly-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N HVQCEQTUSWWFOS-WDSKDSINSA-N 0.000 description 1
- LVSYIKGMLRHKME-IUCAKERBSA-N Gln-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N LVSYIKGMLRHKME-IUCAKERBSA-N 0.000 description 1
- GLEGHWQNGPMKHO-DCAQKATOSA-N Gln-His-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N GLEGHWQNGPMKHO-DCAQKATOSA-N 0.000 description 1
- GFLNKSQHOBOMNM-AVGNSLFASA-N Gln-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)N)N GFLNKSQHOBOMNM-AVGNSLFASA-N 0.000 description 1
- HXOLDXKNWKLDMM-YVNDNENWSA-N Gln-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HXOLDXKNWKLDMM-YVNDNENWSA-N 0.000 description 1
- HYPVLWGNBIYTNA-GUBZILKMSA-N Gln-Leu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HYPVLWGNBIYTNA-GUBZILKMSA-N 0.000 description 1
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 1
- HHQCBFGKQDMWSP-GUBZILKMSA-N Gln-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HHQCBFGKQDMWSP-GUBZILKMSA-N 0.000 description 1
- JNENSVNAUWONEZ-GUBZILKMSA-N Gln-Lys-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O JNENSVNAUWONEZ-GUBZILKMSA-N 0.000 description 1
- DQLVHRFFBQOWFL-JYJNAYRXSA-N Gln-Lys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N)O DQLVHRFFBQOWFL-JYJNAYRXSA-N 0.000 description 1
- SWDSRANUCKNBLA-AVGNSLFASA-N Gln-Phe-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SWDSRANUCKNBLA-AVGNSLFASA-N 0.000 description 1
- BZULIEARJFRINC-IHRRRGAJSA-N Gln-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N BZULIEARJFRINC-IHRRRGAJSA-N 0.000 description 1
- QBEWLBKBGXVVPD-RYUDHWBXSA-N Gln-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N QBEWLBKBGXVVPD-RYUDHWBXSA-N 0.000 description 1
- XUMFMAVDHQDATI-DCAQKATOSA-N Gln-Pro-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XUMFMAVDHQDATI-DCAQKATOSA-N 0.000 description 1
- WBYHRQBKJGEBQJ-CIUDSAMLSA-N Gln-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CS)C(=O)O WBYHRQBKJGEBQJ-CIUDSAMLSA-N 0.000 description 1
- XQDGOJPVMSWZSO-SRVKXCTJSA-N Gln-Pro-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N XQDGOJPVMSWZSO-SRVKXCTJSA-N 0.000 description 1
- WLRYGVYQFXRJDA-DCAQKATOSA-N Gln-Pro-Pro Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 WLRYGVYQFXRJDA-DCAQKATOSA-N 0.000 description 1
- YPFFHGRJCUBXPX-NHCYSSNCSA-N Gln-Pro-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O)C(O)=O YPFFHGRJCUBXPX-NHCYSSNCSA-N 0.000 description 1
- OSCLNNWLKKIQJM-WDSKDSINSA-N Gln-Ser-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O OSCLNNWLKKIQJM-WDSKDSINSA-N 0.000 description 1
- ZGHMRONFHDVXEF-AVGNSLFASA-N Gln-Ser-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZGHMRONFHDVXEF-AVGNSLFASA-N 0.000 description 1
- PAOHIZNRJNIXQY-XQXXSGGOSA-N Gln-Thr-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PAOHIZNRJNIXQY-XQXXSGGOSA-N 0.000 description 1
- XIYWAJQIWLXXAF-XKBZYTNZSA-N Gln-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O XIYWAJQIWLXXAF-XKBZYTNZSA-N 0.000 description 1
- IIMZHVKZBGSEKZ-SZMVWBNQSA-N Gln-Trp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O IIMZHVKZBGSEKZ-SZMVWBNQSA-N 0.000 description 1
- CVRUVYDNRPSKBM-QEJZJMRPSA-N Gln-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N CVRUVYDNRPSKBM-QEJZJMRPSA-N 0.000 description 1
- WPJDPEOQUIXXOY-AVGNSLFASA-N Gln-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O WPJDPEOQUIXXOY-AVGNSLFASA-N 0.000 description 1
- OACQOWPRWGNKTP-AVGNSLFASA-N Gln-Tyr-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O OACQOWPRWGNKTP-AVGNSLFASA-N 0.000 description 1
- WIMVKDYAKRAUCG-IHRRRGAJSA-N Gln-Tyr-Glu Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O WIMVKDYAKRAUCG-IHRRRGAJSA-N 0.000 description 1
- SGVGIVDZLSHSEN-RYUDHWBXSA-N Gln-Tyr-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O SGVGIVDZLSHSEN-RYUDHWBXSA-N 0.000 description 1
- ZZLDMBMFKZFQMU-NRPADANISA-N Gln-Val-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O ZZLDMBMFKZFQMU-NRPADANISA-N 0.000 description 1
- BBFCMGBMYIAGRS-AUTRQRHGSA-N Gln-Val-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BBFCMGBMYIAGRS-AUTRQRHGSA-N 0.000 description 1
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 1
- IYAUFWMUCGBFMQ-CIUDSAMLSA-N Glu-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N IYAUFWMUCGBFMQ-CIUDSAMLSA-N 0.000 description 1
- DYFJZDDQPNIPAB-NHCYSSNCSA-N Glu-Arg-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O DYFJZDDQPNIPAB-NHCYSSNCSA-N 0.000 description 1
- YKLNMGJYMNPBCP-ACZMJKKPSA-N Glu-Asn-Asp Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YKLNMGJYMNPBCP-ACZMJKKPSA-N 0.000 description 1
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 1
- LXAUHIRMWXQRKI-XHNCKOQMSA-N Glu-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O LXAUHIRMWXQRKI-XHNCKOQMSA-N 0.000 description 1
- ZJICFHQSPWFBKP-AVGNSLFASA-N Glu-Asn-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZJICFHQSPWFBKP-AVGNSLFASA-N 0.000 description 1
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 1
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 1
- CYHBMLHCQXXCCT-AVGNSLFASA-N Glu-Asp-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CYHBMLHCQXXCCT-AVGNSLFASA-N 0.000 description 1
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 description 1
- PKYAVRMYTBBRLS-FXQIFTODSA-N Glu-Cys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O PKYAVRMYTBBRLS-FXQIFTODSA-N 0.000 description 1
- ZZIFPJZQHRJERU-WDSKDSINSA-N Glu-Cys-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O ZZIFPJZQHRJERU-WDSKDSINSA-N 0.000 description 1
- KVBPDJIFRQUQFY-ACZMJKKPSA-N Glu-Cys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O KVBPDJIFRQUQFY-ACZMJKKPSA-N 0.000 description 1
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 1
- XMVLTPMCUJTJQP-FXQIFTODSA-N Glu-Gln-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N XMVLTPMCUJTJQP-FXQIFTODSA-N 0.000 description 1
- UMIRPYLZFKOEOH-YVNDNENWSA-N Glu-Gln-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UMIRPYLZFKOEOH-YVNDNENWSA-N 0.000 description 1
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 1
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 1
- NUSWUSKZRCGFEX-FXQIFTODSA-N Glu-Glu-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O NUSWUSKZRCGFEX-FXQIFTODSA-N 0.000 description 1
- HNVFSTLPVJWIDV-CIUDSAMLSA-N Glu-Glu-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HNVFSTLPVJWIDV-CIUDSAMLSA-N 0.000 description 1
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 1
- KUTPGXNAAOQSPD-LPEHRKFASA-N Glu-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O KUTPGXNAAOQSPD-LPEHRKFASA-N 0.000 description 1
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 1
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 1
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 1
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 1
- WRNAXCVRSBBKGS-BQBZGAKWSA-N Glu-Gly-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O WRNAXCVRSBBKGS-BQBZGAKWSA-N 0.000 description 1
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 1
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 1
- XMPAXPSENRSOSV-RYUDHWBXSA-N Glu-Gly-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XMPAXPSENRSOSV-RYUDHWBXSA-N 0.000 description 1
- DVLZZEPUNFEUBW-AVGNSLFASA-N Glu-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N DVLZZEPUNFEUBW-AVGNSLFASA-N 0.000 description 1
- ZPASCJBSSCRWMC-GVXVVHGQSA-N Glu-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N ZPASCJBSSCRWMC-GVXVVHGQSA-N 0.000 description 1
- QIQABBIDHGQXGA-ZPFDUUQYSA-N Glu-Ile-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QIQABBIDHGQXGA-ZPFDUUQYSA-N 0.000 description 1
- ITBHUUMCJJQUSC-LAEOZQHASA-N Glu-Ile-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O ITBHUUMCJJQUSC-LAEOZQHASA-N 0.000 description 1
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 1
- GXMXPCXXKVWOSM-KQXIARHKSA-N Glu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N GXMXPCXXKVWOSM-KQXIARHKSA-N 0.000 description 1
- ZHNHJYYFCGUZNQ-KBIXCLLPSA-N Glu-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O ZHNHJYYFCGUZNQ-KBIXCLLPSA-N 0.000 description 1
- LZMQSTPFYJLVJB-GUBZILKMSA-N Glu-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N LZMQSTPFYJLVJB-GUBZILKMSA-N 0.000 description 1
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 1
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 1
- WNRZUESNGGDCJX-JYJNAYRXSA-N Glu-Leu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WNRZUESNGGDCJX-JYJNAYRXSA-N 0.000 description 1
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 1
- SJJHXJDSNQJMMW-SRVKXCTJSA-N Glu-Lys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SJJHXJDSNQJMMW-SRVKXCTJSA-N 0.000 description 1
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 description 1
- YGLCLCMAYUYZSG-AVGNSLFASA-N Glu-Lys-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 YGLCLCMAYUYZSG-AVGNSLFASA-N 0.000 description 1
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 1
- FMBWLLMUPXTXFC-SDDRHHMPSA-N Glu-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N)C(=O)O FMBWLLMUPXTXFC-SDDRHHMPSA-N 0.000 description 1
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 1
- CBEUFCJRFNZMCU-SRVKXCTJSA-N Glu-Met-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O CBEUFCJRFNZMCU-SRVKXCTJSA-N 0.000 description 1
- YHOJJFFTSMWVGR-HJGDQZAQSA-N Glu-Met-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YHOJJFFTSMWVGR-HJGDQZAQSA-N 0.000 description 1
- UERORLSAFUHDGU-AVGNSLFASA-N Glu-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N UERORLSAFUHDGU-AVGNSLFASA-N 0.000 description 1
- YRMZCZIRHYCNHX-RYUDHWBXSA-N Glu-Phe-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O YRMZCZIRHYCNHX-RYUDHWBXSA-N 0.000 description 1
- YTRBQAQSUDSIQE-FHWLQOOXSA-N Glu-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 YTRBQAQSUDSIQE-FHWLQOOXSA-N 0.000 description 1
- TWYFJOHWGCCRIR-DCAQKATOSA-N Glu-Pro-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYFJOHWGCCRIR-DCAQKATOSA-N 0.000 description 1
- HLYCMRDRWGSTPZ-CIUDSAMLSA-N Glu-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CS)C(=O)O HLYCMRDRWGSTPZ-CIUDSAMLSA-N 0.000 description 1
- DCBSZJJHOTXMHY-DCAQKATOSA-N Glu-Pro-Pro Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DCBSZJJHOTXMHY-DCAQKATOSA-N 0.000 description 1
- DAHLWSFUXOHMIA-FXQIFTODSA-N Glu-Ser-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O DAHLWSFUXOHMIA-FXQIFTODSA-N 0.000 description 1
- HMJULNMJWOZNFI-XHNCKOQMSA-N Glu-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N)C(=O)O HMJULNMJWOZNFI-XHNCKOQMSA-N 0.000 description 1
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 1
- BDISFWMLMNBTGP-NUMRIWBASA-N Glu-Thr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O BDISFWMLMNBTGP-NUMRIWBASA-N 0.000 description 1
- LWYUQLZOIORFFJ-XKBZYTNZSA-N Glu-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O LWYUQLZOIORFFJ-XKBZYTNZSA-N 0.000 description 1
- RGJKYNUINKGPJN-RWRJDSDZSA-N Glu-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCC(=O)O)N RGJKYNUINKGPJN-RWRJDSDZSA-N 0.000 description 1
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 1
- VHPVBPCCWVDGJL-IRIUXVKKSA-N Glu-Thr-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VHPVBPCCWVDGJL-IRIUXVKKSA-N 0.000 description 1
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 1
- JDAYMLXPUJRSDJ-XIRDDKMYSA-N Glu-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 JDAYMLXPUJRSDJ-XIRDDKMYSA-N 0.000 description 1
- ZTNHPMZHAILHRB-JSGCOSHPSA-N Glu-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)NCC(O)=O)=CNC2=C1 ZTNHPMZHAILHRB-JSGCOSHPSA-N 0.000 description 1
- MIWJDJAMMKHUAR-ZVZYQTTQSA-N Glu-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)O)N MIWJDJAMMKHUAR-ZVZYQTTQSA-N 0.000 description 1
- UCZXXMREFIETQW-AVGNSLFASA-N Glu-Tyr-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O UCZXXMREFIETQW-AVGNSLFASA-N 0.000 description 1
- XOEKMEAOMXMURD-JYJNAYRXSA-N Glu-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O XOEKMEAOMXMURD-JYJNAYRXSA-N 0.000 description 1
- HJTSRYLPAYGEEC-SIUGBPQLSA-N Glu-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N HJTSRYLPAYGEEC-SIUGBPQLSA-N 0.000 description 1
- MFYLRRCYBBJYPI-JYJNAYRXSA-N Glu-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O MFYLRRCYBBJYPI-JYJNAYRXSA-N 0.000 description 1
- KXRORHJIRAOQPG-SOUVJXGZSA-N Glu-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O KXRORHJIRAOQPG-SOUVJXGZSA-N 0.000 description 1
- BKMOHWJHXQLFEX-IRIUXVKKSA-N Glu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N)O BKMOHWJHXQLFEX-IRIUXVKKSA-N 0.000 description 1
- QLNKFGTZOBVMCS-JBACZVJFSA-N Glu-Tyr-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O QLNKFGTZOBVMCS-JBACZVJFSA-N 0.000 description 1
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 1
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 1
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 1
- FVGOGEGGQLNZGH-DZKIICNBSA-N Glu-Val-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FVGOGEGGQLNZGH-DZKIICNBSA-N 0.000 description 1
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 1
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 1
- 108010055629 Glucosyltransferases Proteins 0.000 description 1
- 102000000340 Glucosyltransferases Human genes 0.000 description 1
- FKJQNJCQTKUBCD-XPUUQOCRSA-N Gly-Ala-His Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O FKJQNJCQTKUBCD-XPUUQOCRSA-N 0.000 description 1
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 1
- LERGJIVJIIODPZ-ZANVPECISA-N Gly-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)CN)C)C(O)=O)=CNC2=C1 LERGJIVJIIODPZ-ZANVPECISA-N 0.000 description 1
- UPOJUWHGMDJUQZ-IUCAKERBSA-N Gly-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UPOJUWHGMDJUQZ-IUCAKERBSA-N 0.000 description 1
- OCQUNKSFDYDXBG-QXEWZRGKSA-N Gly-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OCQUNKSFDYDXBG-QXEWZRGKSA-N 0.000 description 1
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 1
- CIMULJZTTOBOPN-WHFBIAKZSA-N Gly-Asn-Asn Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CIMULJZTTOBOPN-WHFBIAKZSA-N 0.000 description 1
- AIJAPFVDBFYNKN-WHFBIAKZSA-N Gly-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN)C(=O)N AIJAPFVDBFYNKN-WHFBIAKZSA-N 0.000 description 1
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 1
- BGVYNAQWHSTTSP-BYULHYEWSA-N Gly-Asn-Ile Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BGVYNAQWHSTTSP-BYULHYEWSA-N 0.000 description 1
- LURCIJSJAKFCRO-QWRGUYRKSA-N Gly-Asn-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LURCIJSJAKFCRO-QWRGUYRKSA-N 0.000 description 1
- XQHSBNVACKQWAV-WHFBIAKZSA-N Gly-Asp-Asn Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XQHSBNVACKQWAV-WHFBIAKZSA-N 0.000 description 1
- XEJTYSCIXKYSHR-WDSKDSINSA-N Gly-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN XEJTYSCIXKYSHR-WDSKDSINSA-N 0.000 description 1
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 1
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 description 1
- TZOVVRJYUDETQG-RCOVLWMOSA-N Gly-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN TZOVVRJYUDETQG-RCOVLWMOSA-N 0.000 description 1
- YZACQYVWLCQWBT-BQBZGAKWSA-N Gly-Cys-Arg Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YZACQYVWLCQWBT-BQBZGAKWSA-N 0.000 description 1
- YDWZGVCXMVLDQH-WHFBIAKZSA-N Gly-Cys-Asn Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(N)=O YDWZGVCXMVLDQH-WHFBIAKZSA-N 0.000 description 1
- GZBZACMXFIPIDX-WHFBIAKZSA-N Gly-Cys-Asp Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN)C(=O)O GZBZACMXFIPIDX-WHFBIAKZSA-N 0.000 description 1
- NMROINAYXCACKF-WHFBIAKZSA-N Gly-Cys-Cys Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(O)=O NMROINAYXCACKF-WHFBIAKZSA-N 0.000 description 1
- XXGQRGQPGFYECI-WDSKDSINSA-N Gly-Cys-Glu Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCC(O)=O XXGQRGQPGFYECI-WDSKDSINSA-N 0.000 description 1
- UEGIPZAXNBYCCP-NKWVEPMBSA-N Gly-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)CN)C(=O)O UEGIPZAXNBYCCP-NKWVEPMBSA-N 0.000 description 1
- QCTLGOYODITHPQ-WHFBIAKZSA-N Gly-Cys-Ser Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O QCTLGOYODITHPQ-WHFBIAKZSA-N 0.000 description 1
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 1
- JMQFHZWESBGPFC-WDSKDSINSA-N Gly-Gln-Asp Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JMQFHZWESBGPFC-WDSKDSINSA-N 0.000 description 1
- KTSZUNRRYXPZTK-BQBZGAKWSA-N Gly-Gln-Glu Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KTSZUNRRYXPZTK-BQBZGAKWSA-N 0.000 description 1
- YZPVGIVFMZLQMM-YUMQZZPRSA-N Gly-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN YZPVGIVFMZLQMM-YUMQZZPRSA-N 0.000 description 1
- NPSWCZIRBAYNSB-JHEQGTHGSA-N Gly-Gln-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NPSWCZIRBAYNSB-JHEQGTHGSA-N 0.000 description 1
- QPDUVFSVVAOUHE-XVKPBYJWSA-N Gly-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)CN)C(O)=O QPDUVFSVVAOUHE-XVKPBYJWSA-N 0.000 description 1
- HDNXXTBKOJKWNN-WDSKDSINSA-N Gly-Glu-Asn Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O HDNXXTBKOJKWNN-WDSKDSINSA-N 0.000 description 1
- BIRKKBCSAIHDDF-WDSKDSINSA-N Gly-Glu-Cys Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O BIRKKBCSAIHDDF-WDSKDSINSA-N 0.000 description 1
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 1
- LHRXAHLCRMQBGJ-RYUDHWBXSA-N Gly-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CN LHRXAHLCRMQBGJ-RYUDHWBXSA-N 0.000 description 1
- BEQGFMIBZFNROK-JGVFFNPUSA-N Gly-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)CN)C(=O)O BEQGFMIBZFNROK-JGVFFNPUSA-N 0.000 description 1
- CUYLIWAAAYJKJH-RYUDHWBXSA-N Gly-Glu-Tyr Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUYLIWAAAYJKJH-RYUDHWBXSA-N 0.000 description 1
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 1
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 1
- IDOGEHIWMJMAHT-BYPYZUCNSA-N Gly-Gly-Cys Chemical compound NCC(=O)NCC(=O)N[C@@H](CS)C(O)=O IDOGEHIWMJMAHT-BYPYZUCNSA-N 0.000 description 1
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 1
- INLIXXRWNUKVCF-JTQLQIEISA-N Gly-Gly-Tyr Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 INLIXXRWNUKVCF-JTQLQIEISA-N 0.000 description 1
- IVSWQHKONQIOHA-YUMQZZPRSA-N Gly-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN IVSWQHKONQIOHA-YUMQZZPRSA-N 0.000 description 1
- YFGONBOFGGWKKY-VHSXEESVSA-N Gly-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)CN)C(=O)O YFGONBOFGGWKKY-VHSXEESVSA-N 0.000 description 1
- ALOBJFDJTMQQPW-ONGXEEELSA-N Gly-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN ALOBJFDJTMQQPW-ONGXEEELSA-N 0.000 description 1
- FCKPEGOCSVZPNC-WHOFXGATSA-N Gly-Ile-Phe Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FCKPEGOCSVZPNC-WHOFXGATSA-N 0.000 description 1
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 1
- HAXARWKYFIIHKD-ZKWXMUAHSA-N Gly-Ile-Ser Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HAXARWKYFIIHKD-ZKWXMUAHSA-N 0.000 description 1
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 1
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 description 1
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 1
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 1
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 1
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 1
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 1
- YSDLIYZLOTZZNP-UWVGGRQHSA-N Gly-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN YSDLIYZLOTZZNP-UWVGGRQHSA-N 0.000 description 1
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 1
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 description 1
- IUKIDFVOUHZRAK-QWRGUYRKSA-N Gly-Lys-His Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IUKIDFVOUHZRAK-QWRGUYRKSA-N 0.000 description 1
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 1
- MHZXESQPPXOING-KBPBESRZSA-N Gly-Lys-Phe Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MHZXESQPPXOING-KBPBESRZSA-N 0.000 description 1
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 1
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 1
- QLQDIJBYJZKQPR-BQBZGAKWSA-N Gly-Met-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN QLQDIJBYJZKQPR-BQBZGAKWSA-N 0.000 description 1
- IFHJOBKVXBESRE-YUMQZZPRSA-N Gly-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)CN IFHJOBKVXBESRE-YUMQZZPRSA-N 0.000 description 1
- LXTRSHQLGYINON-DTWKUNHWSA-N Gly-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN LXTRSHQLGYINON-DTWKUNHWSA-N 0.000 description 1
- QVDGHDFFYHKJPN-QWRGUYRKSA-N Gly-Phe-Cys Chemical compound NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CS)C(O)=O QVDGHDFFYHKJPN-QWRGUYRKSA-N 0.000 description 1
- DHNXGWVNLFPOMQ-KBPBESRZSA-N Gly-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)CN DHNXGWVNLFPOMQ-KBPBESRZSA-N 0.000 description 1
- JPVGHHQGKPQYIL-KBPBESRZSA-N Gly-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 JPVGHHQGKPQYIL-KBPBESRZSA-N 0.000 description 1
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 1
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 1
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 1
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 1
- OCPPBNKYGYSLOE-IUCAKERBSA-N Gly-Pro-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN OCPPBNKYGYSLOE-IUCAKERBSA-N 0.000 description 1
- ISSDODCYBOWWIP-GJZGRUSLSA-N Gly-Pro-Trp Chemical compound [H]NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ISSDODCYBOWWIP-GJZGRUSLSA-N 0.000 description 1
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 1
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 1
- FKYQEVBRZSFAMJ-QWRGUYRKSA-N Gly-Ser-Tyr Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FKYQEVBRZSFAMJ-QWRGUYRKSA-N 0.000 description 1
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 1
- YXTFLTJYLIAZQG-FJXKBIBVSA-N Gly-Thr-Arg Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YXTFLTJYLIAZQG-FJXKBIBVSA-N 0.000 description 1
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 1
- PASHZZBXZYEXFE-LSDHHAIUSA-N Gly-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)CN)C(=O)O PASHZZBXZYEXFE-LSDHHAIUSA-N 0.000 description 1
- UMBDRSMLCUYIRI-DVJZZOLTSA-N Gly-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)CN)O UMBDRSMLCUYIRI-DVJZZOLTSA-N 0.000 description 1
- UIQGJYUEQDOODF-KWQFWETISA-N Gly-Tyr-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 UIQGJYUEQDOODF-KWQFWETISA-N 0.000 description 1
- GNNJKUYDWFIBTK-QWRGUYRKSA-N Gly-Tyr-Asp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O GNNJKUYDWFIBTK-QWRGUYRKSA-N 0.000 description 1
- KOYUSMBPJOVSOO-XEGUGMAKSA-N Gly-Tyr-Ile Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KOYUSMBPJOVSOO-XEGUGMAKSA-N 0.000 description 1
- IHDKKJVBLGXLEL-STQMWFEESA-N Gly-Tyr-Met Chemical compound CSCC[C@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)CN)C(O)=O IHDKKJVBLGXLEL-STQMWFEESA-N 0.000 description 1
- DNAZKGFYFRGZIH-QWRGUYRKSA-N Gly-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 DNAZKGFYFRGZIH-QWRGUYRKSA-N 0.000 description 1
- YDIDLLVFCYSXNY-RCOVLWMOSA-N Gly-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN YDIDLLVFCYSXNY-RCOVLWMOSA-N 0.000 description 1
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 1
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 1
- MUGLKCQHTUFLGF-WPRPVWTQSA-N Gly-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)CN MUGLKCQHTUFLGF-WPRPVWTQSA-N 0.000 description 1
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 1
- 108700023372 Glycosyltransferases Proteins 0.000 description 1
- 102000051366 Glycosyltransferases Human genes 0.000 description 1
- BIAKMWKJMQLZOJ-ZKWXMUAHSA-N His-Ala-Ala Chemical compound C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O BIAKMWKJMQLZOJ-ZKWXMUAHSA-N 0.000 description 1
- DCRODRAURLJOFY-XPUUQOCRSA-N His-Ala-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)NCC(O)=O DCRODRAURLJOFY-XPUUQOCRSA-N 0.000 description 1
- XINDHUAGVGCNSF-QSFUFRPTSA-N His-Ala-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XINDHUAGVGCNSF-QSFUFRPTSA-N 0.000 description 1
- AWASVTXPTOLPPP-MBLNEYKQSA-N His-Ala-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWASVTXPTOLPPP-MBLNEYKQSA-N 0.000 description 1
- ZIMTWPHIKZEHSE-UWVGGRQHSA-N His-Arg-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O ZIMTWPHIKZEHSE-UWVGGRQHSA-N 0.000 description 1
- AVQOSMRPITVTRB-CIUDSAMLSA-N His-Asn-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AVQOSMRPITVTRB-CIUDSAMLSA-N 0.000 description 1
- WZOGEMJIZBNFBK-CIUDSAMLSA-N His-Asp-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O WZOGEMJIZBNFBK-CIUDSAMLSA-N 0.000 description 1
- RXVOMIADLXPJGW-GUBZILKMSA-N His-Asp-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O RXVOMIADLXPJGW-GUBZILKMSA-N 0.000 description 1
- ZZLWLWSUIBSMNP-CIUDSAMLSA-N His-Asp-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZZLWLWSUIBSMNP-CIUDSAMLSA-N 0.000 description 1
- IDQKGZWUPVOGPZ-GUBZILKMSA-N His-Cys-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N IDQKGZWUPVOGPZ-GUBZILKMSA-N 0.000 description 1
- TVTIDSMADMIHEU-KKUMJFAQSA-N His-Cys-Phe Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CS)C(=O)N[C@@H](Cc1ccccc1)C(O)=O TVTIDSMADMIHEU-KKUMJFAQSA-N 0.000 description 1
- MWXBCJKQRQFVOO-DCAQKATOSA-N His-Cys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CN=CN1)N MWXBCJKQRQFVOO-DCAQKATOSA-N 0.000 description 1
- HVCRQRQPIIRNLY-IUCAKERBSA-N His-Gln-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N HVCRQRQPIIRNLY-IUCAKERBSA-N 0.000 description 1
- IMCHNUANCIGUKS-SRVKXCTJSA-N His-Glu-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IMCHNUANCIGUKS-SRVKXCTJSA-N 0.000 description 1
- SDTPKSOWFXBACN-GUBZILKMSA-N His-Glu-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O SDTPKSOWFXBACN-GUBZILKMSA-N 0.000 description 1
- XMENRVZYPBKBIL-AVGNSLFASA-N His-Glu-His Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O XMENRVZYPBKBIL-AVGNSLFASA-N 0.000 description 1
- FIMNVXRZGUAGBI-AVGNSLFASA-N His-Glu-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FIMNVXRZGUAGBI-AVGNSLFASA-N 0.000 description 1
- WEIYKCOEVBUJQC-JYJNAYRXSA-N His-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N WEIYKCOEVBUJQC-JYJNAYRXSA-N 0.000 description 1
- OSZUPUINVNPCOE-SDDRHHMPSA-N His-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O OSZUPUINVNPCOE-SDDRHHMPSA-N 0.000 description 1
- YADRBUZBKHHDAO-XPUUQOCRSA-N His-Gly-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C)C(O)=O YADRBUZBKHHDAO-XPUUQOCRSA-N 0.000 description 1
- VBOFRJNDIOPNDO-YUMQZZPRSA-N His-Gly-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N VBOFRJNDIOPNDO-YUMQZZPRSA-N 0.000 description 1
- OEROYDLRVAYIMQ-YUMQZZPRSA-N His-Gly-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O OEROYDLRVAYIMQ-YUMQZZPRSA-N 0.000 description 1
- HAPWZEVRQYGLSG-IUCAKERBSA-N His-Gly-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O HAPWZEVRQYGLSG-IUCAKERBSA-N 0.000 description 1
- RGPWUJOMKFYFSR-QWRGUYRKSA-N His-Gly-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O RGPWUJOMKFYFSR-QWRGUYRKSA-N 0.000 description 1
- NQKRILCJYCASDV-QWRGUYRKSA-N His-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 NQKRILCJYCASDV-QWRGUYRKSA-N 0.000 description 1
- BDFCIKANUNMFGB-PMVVWTBXSA-N His-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 BDFCIKANUNMFGB-PMVVWTBXSA-N 0.000 description 1
- JIUYRPFQJJRSJB-QWRGUYRKSA-N His-His-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)NCC(O)=O)C1=CN=CN1 JIUYRPFQJJRSJB-QWRGUYRKSA-N 0.000 description 1
- VJJSDSNFXCWCEJ-DJFWLOJKSA-N His-Ile-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O VJJSDSNFXCWCEJ-DJFWLOJKSA-N 0.000 description 1
- MPXGJGBXCRQQJE-MXAVVETBSA-N His-Ile-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O MPXGJGBXCRQQJE-MXAVVETBSA-N 0.000 description 1
- VYUXYMRNGALHEA-DLOVCJGASA-N His-Leu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O VYUXYMRNGALHEA-DLOVCJGASA-N 0.000 description 1
- BPOHQCZZSFBSON-KKUMJFAQSA-N His-Leu-His Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BPOHQCZZSFBSON-KKUMJFAQSA-N 0.000 description 1
- KHUFDBQXGLEIHC-BZSNNMDCSA-N His-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 KHUFDBQXGLEIHC-BZSNNMDCSA-N 0.000 description 1
- XKIYNCLILDLGRS-QWRGUYRKSA-N His-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 XKIYNCLILDLGRS-QWRGUYRKSA-N 0.000 description 1
- SGLXGEDPYJPGIQ-ACRUOGEOSA-N His-Phe-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N SGLXGEDPYJPGIQ-ACRUOGEOSA-N 0.000 description 1
- ZVKDCQVQTGYBQT-LSJOCFKGSA-N His-Pro-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O ZVKDCQVQTGYBQT-LSJOCFKGSA-N 0.000 description 1
- BZAQOPHNBFOOJS-DCAQKATOSA-N His-Pro-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O BZAQOPHNBFOOJS-DCAQKATOSA-N 0.000 description 1
- STGQSBKUYSPPIG-CIUDSAMLSA-N His-Ser-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 STGQSBKUYSPPIG-CIUDSAMLSA-N 0.000 description 1
- IAYPZSHNZQHQNO-KKUMJFAQSA-N His-Ser-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC2=CN=CN2)N IAYPZSHNZQHQNO-KKUMJFAQSA-N 0.000 description 1
- FFKJUTZARGRVTH-KKUMJFAQSA-N His-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FFKJUTZARGRVTH-KKUMJFAQSA-N 0.000 description 1
- FCPSGEVYIVXPPO-QTKMDUPCSA-N His-Thr-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FCPSGEVYIVXPPO-QTKMDUPCSA-N 0.000 description 1
- DQZCEKQPSOBNMJ-NKIYYHGXSA-N His-Thr-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DQZCEKQPSOBNMJ-NKIYYHGXSA-N 0.000 description 1
- JUCZDDVZBMPKRT-IXOXFDKPSA-N His-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O JUCZDDVZBMPKRT-IXOXFDKPSA-N 0.000 description 1
- FRDFAWHTPDKRHG-ULQDDVLXSA-N His-Tyr-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CN=CN1 FRDFAWHTPDKRHG-ULQDDVLXSA-N 0.000 description 1
- PZUZIHRPOVVHOT-KBPBESRZSA-N His-Tyr-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)NCC(O)=O)C1=CN=CN1 PZUZIHRPOVVHOT-KBPBESRZSA-N 0.000 description 1
- QTMKFZAYZKBFRC-BZSNNMDCSA-N His-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N)O QTMKFZAYZKBFRC-BZSNNMDCSA-N 0.000 description 1
- CGAMSLMBYJHMDY-ONGXEEELSA-N His-Val-Gly Chemical compound CC(C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N CGAMSLMBYJHMDY-ONGXEEELSA-N 0.000 description 1
- FFYYUUWROYYKFY-IHRRRGAJSA-N His-Val-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O FFYYUUWROYYKFY-IHRRRGAJSA-N 0.000 description 1
- CMPHFUWXKBPNRS-WDSOQIARSA-N His-Val-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CNC=N1 CMPHFUWXKBPNRS-WDSOQIARSA-N 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 101000766907 Homo sapiens C-type lectin domain family 4 member C Proteins 0.000 description 1
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 1
- RWIKBYVJQAJYDP-BJDJZHNGSA-N Ile-Ala-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RWIKBYVJQAJYDP-BJDJZHNGSA-N 0.000 description 1
- CYHYBSGMHMHKOA-CIQUZCHMSA-N Ile-Ala-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CYHYBSGMHMHKOA-CIQUZCHMSA-N 0.000 description 1
- TZCGZYWNIDZZMR-NAKRPEOUSA-N Ile-Arg-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C)C(=O)O)N TZCGZYWNIDZZMR-NAKRPEOUSA-N 0.000 description 1
- BOTVMTSMOUSDRW-GMOBBJLQSA-N Ile-Arg-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O BOTVMTSMOUSDRW-GMOBBJLQSA-N 0.000 description 1
- NULSANWBUWLTKN-NAKRPEOUSA-N Ile-Arg-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N NULSANWBUWLTKN-NAKRPEOUSA-N 0.000 description 1
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 1
- HVWXAQVMRBKKFE-UGYAYLCHSA-N Ile-Asp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HVWXAQVMRBKKFE-UGYAYLCHSA-N 0.000 description 1
- NPROWIBAWYMPAZ-GUDRVLHUSA-N Ile-Asp-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N NPROWIBAWYMPAZ-GUDRVLHUSA-N 0.000 description 1
- DCQMJRSOGCYKTR-GHCJXIJMSA-N Ile-Asp-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DCQMJRSOGCYKTR-GHCJXIJMSA-N 0.000 description 1
- LLZLRXBTOOFODM-QSFUFRPTSA-N Ile-Asp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N LLZLRXBTOOFODM-QSFUFRPTSA-N 0.000 description 1
- FHCNLXMTQJNJNH-KBIXCLLPSA-N Ile-Cys-Gln Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)O FHCNLXMTQJNJNH-KBIXCLLPSA-N 0.000 description 1
- JHCVYQKVKOLAIU-NAKRPEOUSA-N Ile-Cys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)O)N JHCVYQKVKOLAIU-NAKRPEOUSA-N 0.000 description 1
- KUHFPGIVBOCRMV-MNXVOIDGSA-N Ile-Gln-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N KUHFPGIVBOCRMV-MNXVOIDGSA-N 0.000 description 1
- OVPYIUNCVSOVNF-KQXIARHKSA-N Ile-Gln-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N OVPYIUNCVSOVNF-KQXIARHKSA-N 0.000 description 1
- YBJWJQQBWRARLT-KBIXCLLPSA-N Ile-Gln-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O YBJWJQQBWRARLT-KBIXCLLPSA-N 0.000 description 1
- DVRDRICMWUSCBN-UKJIMTQDSA-N Ile-Gln-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DVRDRICMWUSCBN-UKJIMTQDSA-N 0.000 description 1
- BEWFWZRGBDVXRP-PEFMBERDSA-N Ile-Glu-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BEWFWZRGBDVXRP-PEFMBERDSA-N 0.000 description 1
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 1
- PNDMHTTXXPUQJH-RWRJDSDZSA-N Ile-Glu-Thr Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)O PNDMHTTXXPUQJH-RWRJDSDZSA-N 0.000 description 1
- JXMSHKFPDIUYGS-SIUGBPQLSA-N Ile-Glu-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N JXMSHKFPDIUYGS-SIUGBPQLSA-N 0.000 description 1
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 description 1
- ODPKZZLRDNXTJZ-WHOFXGATSA-N Ile-Gly-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ODPKZZLRDNXTJZ-WHOFXGATSA-N 0.000 description 1
- HYLIOBDWPQNLKI-HVTMNAMFSA-N Ile-His-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HYLIOBDWPQNLKI-HVTMNAMFSA-N 0.000 description 1
- VUEXLJFLDONGKQ-PYJNHQTQSA-N Ile-His-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCSC)C(=O)O)N VUEXLJFLDONGKQ-PYJNHQTQSA-N 0.000 description 1
- URWXDJAEEGBADB-TUBUOCAGSA-N Ile-His-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N URWXDJAEEGBADB-TUBUOCAGSA-N 0.000 description 1
- SJLVSMMIFYTSGY-GRLWGSQLSA-N Ile-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SJLVSMMIFYTSGY-GRLWGSQLSA-N 0.000 description 1
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 1
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 1
- IDMNOFVUXYYZPF-DKIMLUQUSA-N Ile-Lys-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N IDMNOFVUXYYZPF-DKIMLUQUSA-N 0.000 description 1
- KTTMFLSBTNBAHL-MXAVVETBSA-N Ile-Phe-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N KTTMFLSBTNBAHL-MXAVVETBSA-N 0.000 description 1
- VOCZPDONPURUHV-QEWYBTABSA-N Ile-Phe-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VOCZPDONPURUHV-QEWYBTABSA-N 0.000 description 1
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 1
- CIJLNXXMDUOFPH-HJWJTTGWSA-N Ile-Pro-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 CIJLNXXMDUOFPH-HJWJTTGWSA-N 0.000 description 1
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 1
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 1
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 1
- XMYURPUVJSKTMC-KBIXCLLPSA-N Ile-Ser-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N XMYURPUVJSKTMC-KBIXCLLPSA-N 0.000 description 1
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 1
- HXIDVIFHRYRXLZ-NAKRPEOUSA-N Ile-Ser-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)O)N HXIDVIFHRYRXLZ-NAKRPEOUSA-N 0.000 description 1
- COWHUQXTSYTKQC-RWRJDSDZSA-N Ile-Thr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N COWHUQXTSYTKQC-RWRJDSDZSA-N 0.000 description 1
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 1
- BLFXHAFTNYZEQE-VKOGCVSHSA-N Ile-Trp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N BLFXHAFTNYZEQE-VKOGCVSHSA-N 0.000 description 1
- HZVRQFKRALAMQS-SLBDDTMCSA-N Ile-Trp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZVRQFKRALAMQS-SLBDDTMCSA-N 0.000 description 1
- JSLIXOUMAOUGBN-JUKXBJQTSA-N Ile-Tyr-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N JSLIXOUMAOUGBN-JUKXBJQTSA-N 0.000 description 1
- ZGKVPOSSTGHJAF-HJPIBITLSA-N Ile-Tyr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CO)C(=O)O)N ZGKVPOSSTGHJAF-HJPIBITLSA-N 0.000 description 1
- YJRSIJZUIUANHO-NAKRPEOUSA-N Ile-Val-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(=O)O)N YJRSIJZUIUANHO-NAKRPEOUSA-N 0.000 description 1
- YWCJXQKATPNPOE-UKJIMTQDSA-N Ile-Val-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YWCJXQKATPNPOE-UKJIMTQDSA-N 0.000 description 1
- RQZFWBLDTBDEOF-RNJOBUHISA-N Ile-Val-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N RQZFWBLDTBDEOF-RNJOBUHISA-N 0.000 description 1
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 description 1
- YHFPHRUWZMEOIX-CYDGBPFRSA-N Ile-Val-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(=O)O)N YHFPHRUWZMEOIX-CYDGBPFRSA-N 0.000 description 1
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 1
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 1
- 244000211187 Lepidium sativum Species 0.000 description 1
- 235000007849 Lepidium sativum Nutrition 0.000 description 1
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 1
- SUPVSFFZWVOEOI-UHFFFAOYSA-N Leu-Ala-Tyr Natural products CC(C)CC(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-UHFFFAOYSA-N 0.000 description 1
- UILIPCLTHRPCRB-XUXIUFHCSA-N Leu-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(C)C)N UILIPCLTHRPCRB-XUXIUFHCSA-N 0.000 description 1
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 1
- IBMVEYRWAWIOTN-RWMBFGLXSA-N Leu-Arg-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(O)=O IBMVEYRWAWIOTN-RWMBFGLXSA-N 0.000 description 1
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 1
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 1
- DBVWMYGBVFCRBE-CIUDSAMLSA-N Leu-Asn-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DBVWMYGBVFCRBE-CIUDSAMLSA-N 0.000 description 1
- OIARJGNVARWKFP-YUMQZZPRSA-N Leu-Asn-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIARJGNVARWKFP-YUMQZZPRSA-N 0.000 description 1
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 1
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 1
- ZDSNOSQHMJBRQN-SRVKXCTJSA-N Leu-Asp-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZDSNOSQHMJBRQN-SRVKXCTJSA-N 0.000 description 1
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 1
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 1
- QLQHWWCSCLZUMA-KKUMJFAQSA-N Leu-Asp-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QLQHWWCSCLZUMA-KKUMJFAQSA-N 0.000 description 1
- NFHJQETXTSDZSI-DCAQKATOSA-N Leu-Cys-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NFHJQETXTSDZSI-DCAQKATOSA-N 0.000 description 1
- RRSLQOLASISYTB-CIUDSAMLSA-N Leu-Cys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O RRSLQOLASISYTB-CIUDSAMLSA-N 0.000 description 1
- WCTCIIAGNMFYAO-DCAQKATOSA-N Leu-Cys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O WCTCIIAGNMFYAO-DCAQKATOSA-N 0.000 description 1
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 1
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 1
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 1
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 1
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 1
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 1
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 1
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 1
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 1
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 1
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 1
- AUBMZAMQCOYSIC-MNXVOIDGSA-N Leu-Ile-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O AUBMZAMQCOYSIC-MNXVOIDGSA-N 0.000 description 1
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 1
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 1
- QNTJIDXQHWUBKC-BZSNNMDCSA-N Leu-Lys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNTJIDXQHWUBKC-BZSNNMDCSA-N 0.000 description 1
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 1
- ARRIJPQRBWRNLT-DCAQKATOSA-N Leu-Met-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ARRIJPQRBWRNLT-DCAQKATOSA-N 0.000 description 1
- PJWOOBTYQNNRBF-BZSNNMDCSA-N Leu-Phe-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)O)N PJWOOBTYQNNRBF-BZSNNMDCSA-N 0.000 description 1
- MAXILRZVORNXBE-PMVMPFDFSA-N Leu-Phe-Trp Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 MAXILRZVORNXBE-PMVMPFDFSA-N 0.000 description 1
- QMKFDEUJGYNFMC-AVGNSLFASA-N Leu-Pro-Arg Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QMKFDEUJGYNFMC-AVGNSLFASA-N 0.000 description 1
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 1
- XWEVVRRSIOBJOO-SRVKXCTJSA-N Leu-Pro-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O XWEVVRRSIOBJOO-SRVKXCTJSA-N 0.000 description 1
- YUTNOGOMBNYPFH-XUXIUFHCSA-N Leu-Pro-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YUTNOGOMBNYPFH-XUXIUFHCSA-N 0.000 description 1
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 1
- KIZIOFNVSOSKJI-CIUDSAMLSA-N Leu-Ser-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N KIZIOFNVSOSKJI-CIUDSAMLSA-N 0.000 description 1
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 1
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 1
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 1
- AEDWWMMHUGYIFD-HJGDQZAQSA-N Leu-Thr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O AEDWWMMHUGYIFD-HJGDQZAQSA-N 0.000 description 1
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 1
- BCUVPZLLSRMPJL-XIRDDKMYSA-N Leu-Trp-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CS)C(=O)O)N BCUVPZLLSRMPJL-XIRDDKMYSA-N 0.000 description 1
- UIIMIKFNIYPDJF-WDSOQIARSA-N Leu-Trp-Met Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CCSC)C(O)=O)NC(=O)[C@@H](N)CC(C)C)=CNC2=C1 UIIMIKFNIYPDJF-WDSOQIARSA-N 0.000 description 1
- UCRJTSIIAYHOHE-ULQDDVLXSA-N Leu-Tyr-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UCRJTSIIAYHOHE-ULQDDVLXSA-N 0.000 description 1
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 1
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 1
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 1
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 1
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 1
- IRNSXVOWSXSULE-DCAQKATOSA-N Lys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN IRNSXVOWSXSULE-DCAQKATOSA-N 0.000 description 1
- CLBGMWIYPYAZPR-AVGNSLFASA-N Lys-Arg-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O CLBGMWIYPYAZPR-AVGNSLFASA-N 0.000 description 1
- GQUDMNDPQTXZRV-DCAQKATOSA-N Lys-Arg-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GQUDMNDPQTXZRV-DCAQKATOSA-N 0.000 description 1
- GAOJCVKPIGHTGO-UWVGGRQHSA-N Lys-Arg-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O GAOJCVKPIGHTGO-UWVGGRQHSA-N 0.000 description 1
- BRSGXFITDXFMFF-IHRRRGAJSA-N Lys-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N BRSGXFITDXFMFF-IHRRRGAJSA-N 0.000 description 1
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 description 1
- SWWCDAGDQHTKIE-RHYQMDGZSA-N Lys-Arg-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWWCDAGDQHTKIE-RHYQMDGZSA-N 0.000 description 1
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 description 1
- YVSHZSUKQHNDHD-KKUMJFAQSA-N Lys-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N YVSHZSUKQHNDHD-KKUMJFAQSA-N 0.000 description 1
- NCTDKZKNBDZDOL-GARJFASQSA-N Lys-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O NCTDKZKNBDZDOL-GARJFASQSA-N 0.000 description 1
- DGWXCIORNLWGGG-CIUDSAMLSA-N Lys-Asn-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O DGWXCIORNLWGGG-CIUDSAMLSA-N 0.000 description 1
- JBRWKVANRYPCAF-XIRDDKMYSA-N Lys-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N JBRWKVANRYPCAF-XIRDDKMYSA-N 0.000 description 1
- KPJJOZUXFOLGMQ-CIUDSAMLSA-N Lys-Asp-Asn Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N KPJJOZUXFOLGMQ-CIUDSAMLSA-N 0.000 description 1
- QUYCUALODHJQLK-CIUDSAMLSA-N Lys-Asp-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUYCUALODHJQLK-CIUDSAMLSA-N 0.000 description 1
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 1
- GKFNXYMAMKJSKD-NHCYSSNCSA-N Lys-Asp-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GKFNXYMAMKJSKD-NHCYSSNCSA-N 0.000 description 1
- KSFQPRLZAUXXPT-GARJFASQSA-N Lys-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N)C(=O)O KSFQPRLZAUXXPT-GARJFASQSA-N 0.000 description 1
- BYEBKXRNDLTGFW-CIUDSAMLSA-N Lys-Cys-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O BYEBKXRNDLTGFW-CIUDSAMLSA-N 0.000 description 1
- HWMZUBUEOYAQSC-DCAQKATOSA-N Lys-Gln-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O HWMZUBUEOYAQSC-DCAQKATOSA-N 0.000 description 1
- VSRXPEHZMHSFKU-IUCAKERBSA-N Lys-Gln-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VSRXPEHZMHSFKU-IUCAKERBSA-N 0.000 description 1
- NDORZBUHCOJQDO-GVXVVHGQSA-N Lys-Gln-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O NDORZBUHCOJQDO-GVXVVHGQSA-N 0.000 description 1
- GRADYHMSAUIKPS-DCAQKATOSA-N Lys-Glu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRADYHMSAUIKPS-DCAQKATOSA-N 0.000 description 1
- GCMWRRQAKQXDED-IUCAKERBSA-N Lys-Glu-Gly Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)N[C@@H](CCC([O-])=O)C(=O)NCC([O-])=O GCMWRRQAKQXDED-IUCAKERBSA-N 0.000 description 1
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 1
- VQXAVLQBQJMENB-SRVKXCTJSA-N Lys-Glu-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O VQXAVLQBQJMENB-SRVKXCTJSA-N 0.000 description 1
- ODUQLUADRKMHOZ-JYJNAYRXSA-N Lys-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)O ODUQLUADRKMHOZ-JYJNAYRXSA-N 0.000 description 1
- ITWQLSZTLBKWJM-YUMQZZPRSA-N Lys-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCCN ITWQLSZTLBKWJM-YUMQZZPRSA-N 0.000 description 1
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 1
- DTUZCYRNEJDKSR-NHCYSSNCSA-N Lys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN DTUZCYRNEJDKSR-NHCYSSNCSA-N 0.000 description 1
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 1
- CANPXOLVTMKURR-WEDXCCLWSA-N Lys-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN CANPXOLVTMKURR-WEDXCCLWSA-N 0.000 description 1
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 1
- SQJSXOQXJYAVRV-SRVKXCTJSA-N Lys-His-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N SQJSXOQXJYAVRV-SRVKXCTJSA-N 0.000 description 1
- CAVGLNOOIFHJOF-SRVKXCTJSA-N Lys-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N CAVGLNOOIFHJOF-SRVKXCTJSA-N 0.000 description 1
- VLMNBMFYRMGEMB-QWRGUYRKSA-N Lys-His-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CNC=N1 VLMNBMFYRMGEMB-QWRGUYRKSA-N 0.000 description 1
- SPCHLZUWJTYZFC-IHRRRGAJSA-N Lys-His-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O SPCHLZUWJTYZFC-IHRRRGAJSA-N 0.000 description 1
- PRSBSVAVOQOAMI-BJDJZHNGSA-N Lys-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN PRSBSVAVOQOAMI-BJDJZHNGSA-N 0.000 description 1
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 1
- ONPDTSFZAIWMDI-AVGNSLFASA-N Lys-Leu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ONPDTSFZAIWMDI-AVGNSLFASA-N 0.000 description 1
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 description 1
- ORVFEGYUJITPGI-IHRRRGAJSA-N Lys-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN ORVFEGYUJITPGI-IHRRRGAJSA-N 0.000 description 1
- XIZQPFCRXLUNMK-BZSNNMDCSA-N Lys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N XIZQPFCRXLUNMK-BZSNNMDCSA-N 0.000 description 1
- PFZWARWVRNTPBR-IHPCNDPISA-N Lys-Leu-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N PFZWARWVRNTPBR-IHPCNDPISA-N 0.000 description 1
- VUTWYNQUSJWBHO-BZSNNMDCSA-N Lys-Leu-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VUTWYNQUSJWBHO-BZSNNMDCSA-N 0.000 description 1
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 1
- ALGGDNMLQNFVIZ-SRVKXCTJSA-N Lys-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ALGGDNMLQNFVIZ-SRVKXCTJSA-N 0.000 description 1
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 1
- ATNKHRAIZCMCCN-BZSNNMDCSA-N Lys-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N ATNKHRAIZCMCCN-BZSNNMDCSA-N 0.000 description 1
- YDDDRTIPNTWGIG-SRVKXCTJSA-N Lys-Lys-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O YDDDRTIPNTWGIG-SRVKXCTJSA-N 0.000 description 1
- PLDJDCJLRCYPJB-VOAKCMCISA-N Lys-Lys-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PLDJDCJLRCYPJB-VOAKCMCISA-N 0.000 description 1
- TYEJPFJNAHIKRT-DCAQKATOSA-N Lys-Met-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N TYEJPFJNAHIKRT-DCAQKATOSA-N 0.000 description 1
- ZCWWVXAXWUAEPZ-SRVKXCTJSA-N Lys-Met-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZCWWVXAXWUAEPZ-SRVKXCTJSA-N 0.000 description 1
- SKUOQDYMJFUMOE-ULQDDVLXSA-N Lys-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N SKUOQDYMJFUMOE-ULQDDVLXSA-N 0.000 description 1
- QCZYYEFXOBKCNQ-STQMWFEESA-N Lys-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QCZYYEFXOBKCNQ-STQMWFEESA-N 0.000 description 1
- XFOAWKDQMRMCDN-ULQDDVLXSA-N Lys-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)CC1=CC=CC=C1 XFOAWKDQMRMCDN-ULQDDVLXSA-N 0.000 description 1
- TWPCWKVOZDUYAA-KKUMJFAQSA-N Lys-Phe-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O TWPCWKVOZDUYAA-KKUMJFAQSA-N 0.000 description 1
- JPYPRVHMKRFTAT-KKUMJFAQSA-N Lys-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N JPYPRVHMKRFTAT-KKUMJFAQSA-N 0.000 description 1
- LMGNWHDWJDIOPK-DKIMLUQUSA-N Lys-Phe-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LMGNWHDWJDIOPK-DKIMLUQUSA-N 0.000 description 1
- IPTUBUUIFRZMJK-ACRUOGEOSA-N Lys-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 IPTUBUUIFRZMJK-ACRUOGEOSA-N 0.000 description 1
- WLXGMVVHTIUPHE-ULQDDVLXSA-N Lys-Phe-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O WLXGMVVHTIUPHE-ULQDDVLXSA-N 0.000 description 1
- CNGOEHJCLVCJHN-SRVKXCTJSA-N Lys-Pro-Glu Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O CNGOEHJCLVCJHN-SRVKXCTJSA-N 0.000 description 1
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 1
- MSSABBQOBUZFKZ-IHRRRGAJSA-N Lys-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCCCN)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O MSSABBQOBUZFKZ-IHRRRGAJSA-N 0.000 description 1
- LUTDBHBIHHREDC-IHRRRGAJSA-N Lys-Pro-Lys Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O LUTDBHBIHHREDC-IHRRRGAJSA-N 0.000 description 1
- UQJOKDAYFULYIX-AVGNSLFASA-N Lys-Pro-Pro Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 UQJOKDAYFULYIX-AVGNSLFASA-N 0.000 description 1
- YTJFXEDRUOQGSP-DCAQKATOSA-N Lys-Pro-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YTJFXEDRUOQGSP-DCAQKATOSA-N 0.000 description 1
- WQDKIVRHTQYJSN-DCAQKATOSA-N Lys-Ser-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N WQDKIVRHTQYJSN-DCAQKATOSA-N 0.000 description 1
- MGKFCQFVPKOWOL-CIUDSAMLSA-N Lys-Ser-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N MGKFCQFVPKOWOL-CIUDSAMLSA-N 0.000 description 1
- CTJUSALVKAWFFU-CIUDSAMLSA-N Lys-Ser-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N CTJUSALVKAWFFU-CIUDSAMLSA-N 0.000 description 1
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 1
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 1
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 1
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 1
- YKBSXQFZWFXFIB-VOAKCMCISA-N Lys-Thr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O YKBSXQFZWFXFIB-VOAKCMCISA-N 0.000 description 1
- KXYLFJIQDIMURW-IHPCNDPISA-N Lys-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CCCCN)=CNC2=C1 KXYLFJIQDIMURW-IHPCNDPISA-N 0.000 description 1
- SUZVLFWOCKHWET-CQDKDKBSSA-N Lys-Tyr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O SUZVLFWOCKHWET-CQDKDKBSSA-N 0.000 description 1
- RMKJOQSYLQQRFN-KKUMJFAQSA-N Lys-Tyr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O RMKJOQSYLQQRFN-KKUMJFAQSA-N 0.000 description 1
- XATKLFSXFINPSB-JYJNAYRXSA-N Lys-Tyr-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O XATKLFSXFINPSB-JYJNAYRXSA-N 0.000 description 1
- MIMXMVDLMDMOJD-BZSNNMDCSA-N Lys-Tyr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O MIMXMVDLMDMOJD-BZSNNMDCSA-N 0.000 description 1
- WINFHLHJTRGLCV-BZSNNMDCSA-N Lys-Tyr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=C(O)C=C1 WINFHLHJTRGLCV-BZSNNMDCSA-N 0.000 description 1
- LMMBAXJRYSXCOQ-ACRUOGEOSA-N Lys-Tyr-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O LMMBAXJRYSXCOQ-ACRUOGEOSA-N 0.000 description 1
- IEIHKHYMBIYQTH-YESZJQIVSA-N Lys-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCCN)N)C(=O)O IEIHKHYMBIYQTH-YESZJQIVSA-N 0.000 description 1
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 description 1
- QFSYGUMEANRNJE-DCAQKATOSA-N Lys-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N QFSYGUMEANRNJE-DCAQKATOSA-N 0.000 description 1
- TXTZMVNJIRZABH-ULQDDVLXSA-N Lys-Val-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TXTZMVNJIRZABH-ULQDDVLXSA-N 0.000 description 1
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 1
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 1
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 1
- 208000002720 Malnutrition Diseases 0.000 description 1
- QAHFGYLFLVGBNW-DCAQKATOSA-N Met-Ala-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN QAHFGYLFLVGBNW-DCAQKATOSA-N 0.000 description 1
- HUKLXYYPZWPXCC-KZVJFYERSA-N Met-Ala-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HUKLXYYPZWPXCC-KZVJFYERSA-N 0.000 description 1
- WDTLNWHPIPCMMP-AVGNSLFASA-N Met-Arg-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O WDTLNWHPIPCMMP-AVGNSLFASA-N 0.000 description 1
- JQECLVNLAZGHRQ-CIUDSAMLSA-N Met-Asp-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O JQECLVNLAZGHRQ-CIUDSAMLSA-N 0.000 description 1
- XMMWDTUFTZMQFD-GMOBBJLQSA-N Met-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCSC XMMWDTUFTZMQFD-GMOBBJLQSA-N 0.000 description 1
- TZLYIHDABYBOCJ-FXQIFTODSA-N Met-Asp-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O TZLYIHDABYBOCJ-FXQIFTODSA-N 0.000 description 1
- RPEPZINUYHUBKG-FXQIFTODSA-N Met-Cys-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O RPEPZINUYHUBKG-FXQIFTODSA-N 0.000 description 1
- HGKJFNCLOHKEHS-FXQIFTODSA-N Met-Cys-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(O)=O HGKJFNCLOHKEHS-FXQIFTODSA-N 0.000 description 1
- OFNCSQNBSWGGNV-DCAQKATOSA-N Met-Cys-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 OFNCSQNBSWGGNV-DCAQKATOSA-N 0.000 description 1
- CEGVMWAVGBRVFS-XGEHTFHBSA-N Met-Cys-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CEGVMWAVGBRVFS-XGEHTFHBSA-N 0.000 description 1
- UYAKZHGIPRCGPF-CIUDSAMLSA-N Met-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N UYAKZHGIPRCGPF-CIUDSAMLSA-N 0.000 description 1
- PQPMMGQTRQFSDA-SRVKXCTJSA-N Met-Glu-His Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O PQPMMGQTRQFSDA-SRVKXCTJSA-N 0.000 description 1
- SJDQOYTYNGZZJX-SRVKXCTJSA-N Met-Glu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SJDQOYTYNGZZJX-SRVKXCTJSA-N 0.000 description 1
- JPCHYAUKOUGOIB-HJGDQZAQSA-N Met-Glu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPCHYAUKOUGOIB-HJGDQZAQSA-N 0.000 description 1
- OOSPRDCGTLQLBP-NHCYSSNCSA-N Met-Glu-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OOSPRDCGTLQLBP-NHCYSSNCSA-N 0.000 description 1
- MYAPQOBHGWJZOM-UWVGGRQHSA-N Met-Gly-Leu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C MYAPQOBHGWJZOM-UWVGGRQHSA-N 0.000 description 1
- LQMHZERGCQJKAH-STQMWFEESA-N Met-Gly-Phe Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LQMHZERGCQJKAH-STQMWFEESA-N 0.000 description 1
- MXEASDMFHUKOGE-ULQDDVLXSA-N Met-His-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N MXEASDMFHUKOGE-ULQDDVLXSA-N 0.000 description 1
- NLHSFJQUHGCWSD-PYJNHQTQSA-N Met-Ile-His Chemical compound N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O NLHSFJQUHGCWSD-PYJNHQTQSA-N 0.000 description 1
- MVMNUCOHQGYYKB-PEDHHIEDSA-N Met-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CCSC)N MVMNUCOHQGYYKB-PEDHHIEDSA-N 0.000 description 1
- RBGLBUDVQVPTEG-DCAQKATOSA-N Met-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCSC)N RBGLBUDVQVPTEG-DCAQKATOSA-N 0.000 description 1
- AXHNAGAYRGCDLG-UWVGGRQHSA-N Met-Lys-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AXHNAGAYRGCDLG-UWVGGRQHSA-N 0.000 description 1
- OIFHHODAXVWKJN-ULQDDVLXSA-N Met-Phe-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 OIFHHODAXVWKJN-ULQDDVLXSA-N 0.000 description 1
- HUURTRNKPBHHKZ-JYJNAYRXSA-N Met-Phe-Val Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 HUURTRNKPBHHKZ-JYJNAYRXSA-N 0.000 description 1
- VSJAPSMRFYUOKS-IUCAKERBSA-N Met-Pro-Gly Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O VSJAPSMRFYUOKS-IUCAKERBSA-N 0.000 description 1
- LUYURUYVNYGKGM-RCWTZXSCSA-N Met-Pro-Thr Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUYURUYVNYGKGM-RCWTZXSCSA-N 0.000 description 1
- RDLSEGZJMYGFNS-FXQIFTODSA-N Met-Ser-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RDLSEGZJMYGFNS-FXQIFTODSA-N 0.000 description 1
- HLZORBMOISUNIV-DCAQKATOSA-N Met-Ser-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C HLZORBMOISUNIV-DCAQKATOSA-N 0.000 description 1
- FXBKQTOGURNXSL-HJGDQZAQSA-N Met-Thr-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O FXBKQTOGURNXSL-HJGDQZAQSA-N 0.000 description 1
- CIIJWIAORKTXAH-FJXKBIBVSA-N Met-Thr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O CIIJWIAORKTXAH-FJXKBIBVSA-N 0.000 description 1
- NDJSSFWDYDUQID-YTWAJWBKSA-N Met-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N)O NDJSSFWDYDUQID-YTWAJWBKSA-N 0.000 description 1
- 102000005431 Molecular Chaperones Human genes 0.000 description 1
- 101100217138 Mus musculus Actr10 gene Proteins 0.000 description 1
- 101000777470 Mus musculus C-C motif chemokine 4 Proteins 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- 101100342977 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-1 gene Proteins 0.000 description 1
- 108091005461 Nucleic proteins Proteins 0.000 description 1
- 101100036899 Oryza sativa subsp. japonica Ub-CEP52-1 gene Proteins 0.000 description 1
- 101710091688 Patatin Proteins 0.000 description 1
- 108700011203 Phaseolus vulgaris phaseolin Proteins 0.000 description 1
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 1
- FPTXMUIBLMGTQH-ONGXEEELSA-N Phe-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 FPTXMUIBLMGTQH-ONGXEEELSA-N 0.000 description 1
- UHRNIXJAGGLKHP-DLOVCJGASA-N Phe-Ala-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O UHRNIXJAGGLKHP-DLOVCJGASA-N 0.000 description 1
- AYPMIIKUMNADSU-IHRRRGAJSA-N Phe-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AYPMIIKUMNADSU-IHRRRGAJSA-N 0.000 description 1
- ZWJKVFAYPLPCQB-UNQGMJICSA-N Phe-Arg-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O ZWJKVFAYPLPCQB-UNQGMJICSA-N 0.000 description 1
- LXVFHIBXOWJTKZ-BZSNNMDCSA-N Phe-Asn-Tyr Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O LXVFHIBXOWJTKZ-BZSNNMDCSA-N 0.000 description 1
- CSYVXYQDIVCQNU-QWRGUYRKSA-N Phe-Asp-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O CSYVXYQDIVCQNU-QWRGUYRKSA-N 0.000 description 1
- MQVFHOPCKNTHGT-MELADBBJSA-N Phe-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O MQVFHOPCKNTHGT-MELADBBJSA-N 0.000 description 1
- FRPVPGRXUKFEQE-YDHLFZDLSA-N Phe-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FRPVPGRXUKFEQE-YDHLFZDLSA-N 0.000 description 1
- OMHMIXFFRPMYHB-SRVKXCTJSA-N Phe-Cys-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OMHMIXFFRPMYHB-SRVKXCTJSA-N 0.000 description 1
- ALHULIGNEXGFRM-QWRGUYRKSA-N Phe-Cys-Gly Chemical compound OC(=O)CNC(=O)[C@H](CS)NC(=O)[C@@H](N)CC1=CC=CC=C1 ALHULIGNEXGFRM-QWRGUYRKSA-N 0.000 description 1
- QEPZQAPZKIPVDV-KKUMJFAQSA-N Phe-Cys-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N QEPZQAPZKIPVDV-KKUMJFAQSA-N 0.000 description 1
- HNURHHFOINNTPL-IHPCNDPISA-N Phe-Cys-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N HNURHHFOINNTPL-IHPCNDPISA-N 0.000 description 1
- SXJGROGVINAYSH-AVGNSLFASA-N Phe-Gln-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SXJGROGVINAYSH-AVGNSLFASA-N 0.000 description 1
- AKJAKCBHLJGRBU-JYJNAYRXSA-N Phe-Glu-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N AKJAKCBHLJGRBU-JYJNAYRXSA-N 0.000 description 1
- RFEXGCASCQGGHZ-STQMWFEESA-N Phe-Gly-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O RFEXGCASCQGGHZ-STQMWFEESA-N 0.000 description 1
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 1
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 1
- NPLGQVKZFGJWAI-QWHCGFSZSA-N Phe-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O NPLGQVKZFGJWAI-QWHCGFSZSA-N 0.000 description 1
- BIYWZVCPZIFGPY-QWRGUYRKSA-N Phe-Gly-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O BIYWZVCPZIFGPY-QWRGUYRKSA-N 0.000 description 1
- WFHRXJOZEXUKLV-IRXDYDNUSA-N Phe-Gly-Tyr Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 WFHRXJOZEXUKLV-IRXDYDNUSA-N 0.000 description 1
- MYQCCQSMKNCNKY-KKUMJFAQSA-N Phe-His-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CO)C(=O)O)N MYQCCQSMKNCNKY-KKUMJFAQSA-N 0.000 description 1
- SPXWRYVHOZVYBU-ULQDDVLXSA-N Phe-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N SPXWRYVHOZVYBU-ULQDDVLXSA-N 0.000 description 1
- BWTKUQPNOMMKMA-FIRPJDEBSA-N Phe-Ile-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BWTKUQPNOMMKMA-FIRPJDEBSA-N 0.000 description 1
- KBVJZCVLQWCJQN-KKUMJFAQSA-N Phe-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KBVJZCVLQWCJQN-KKUMJFAQSA-N 0.000 description 1
- METZZBCMDXHFMK-BZSNNMDCSA-N Phe-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N METZZBCMDXHFMK-BZSNNMDCSA-N 0.000 description 1
- OSBADCBXAMSPQD-YESZJQIVSA-N Phe-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N OSBADCBXAMSPQD-YESZJQIVSA-N 0.000 description 1
- CMHTUJQZQXFNTQ-OEAJRASXSA-N Phe-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O CMHTUJQZQXFNTQ-OEAJRASXSA-N 0.000 description 1
- DNAXXTQSTKOHFO-QEJZJMRPSA-N Phe-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DNAXXTQSTKOHFO-QEJZJMRPSA-N 0.000 description 1
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 1
- XZQYIJALMGEUJD-OEAJRASXSA-N Phe-Lys-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZQYIJALMGEUJD-OEAJRASXSA-N 0.000 description 1
- IEOHQGFKHXUALJ-JYJNAYRXSA-N Phe-Met-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IEOHQGFKHXUALJ-JYJNAYRXSA-N 0.000 description 1
- RTUWVJVJSMOGPL-KKUMJFAQSA-N Phe-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N RTUWVJVJSMOGPL-KKUMJFAQSA-N 0.000 description 1
- JKJSIYKSGIDHPM-WBAXXEDZSA-N Phe-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O JKJSIYKSGIDHPM-WBAXXEDZSA-N 0.000 description 1
- RYQWALWYQWBUKN-FHWLQOOXSA-N Phe-Phe-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RYQWALWYQWBUKN-FHWLQOOXSA-N 0.000 description 1
- IWZRODDWOSIXPZ-IRXDYDNUSA-N Phe-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 IWZRODDWOSIXPZ-IRXDYDNUSA-N 0.000 description 1
- RBRNEFJTEHPDSL-ACRUOGEOSA-N Phe-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 RBRNEFJTEHPDSL-ACRUOGEOSA-N 0.000 description 1
- GRVMHFCZUIYNKQ-UFYCRDLUSA-N Phe-Phe-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GRVMHFCZUIYNKQ-UFYCRDLUSA-N 0.000 description 1
- RVEVENLSADZUMS-IHRRRGAJSA-N Phe-Pro-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RVEVENLSADZUMS-IHRRRGAJSA-N 0.000 description 1
- XOHJOMKCRLHGCY-UNQGMJICSA-N Phe-Pro-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOHJOMKCRLHGCY-UNQGMJICSA-N 0.000 description 1
- WEDZFLRYSIDIRX-IHRRRGAJSA-N Phe-Ser-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 WEDZFLRYSIDIRX-IHRRRGAJSA-N 0.000 description 1
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 1
- GLJZDMZJHFXJQG-BZSNNMDCSA-N Phe-Ser-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLJZDMZJHFXJQG-BZSNNMDCSA-N 0.000 description 1
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 description 1
- XNQMZHLAYFWSGJ-HTUGSXCWSA-N Phe-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XNQMZHLAYFWSGJ-HTUGSXCWSA-N 0.000 description 1
- BPIMVBKDLSBKIJ-FCLVOEFKSA-N Phe-Thr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BPIMVBKDLSBKIJ-FCLVOEFKSA-N 0.000 description 1
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 1
- VGTJSEYTVMAASM-RPTUDFQQSA-N Phe-Thr-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VGTJSEYTVMAASM-RPTUDFQQSA-N 0.000 description 1
- KCIKTPHTEYBXMG-BVSLBCMMSA-N Phe-Trp-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCIKTPHTEYBXMG-BVSLBCMMSA-N 0.000 description 1
- QTDBZORPVYTRJU-KKXDTOCCSA-N Phe-Tyr-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O QTDBZORPVYTRJU-KKXDTOCCSA-N 0.000 description 1
- VFDRDMOMHBJGKD-UFYCRDLUSA-N Phe-Tyr-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N VFDRDMOMHBJGKD-UFYCRDLUSA-N 0.000 description 1
- FRMKIPSIZSFTTE-HJOGWXRNSA-N Phe-Tyr-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FRMKIPSIZSFTTE-HJOGWXRNSA-N 0.000 description 1
- YUPRIZTWANWWHK-DZKIICNBSA-N Phe-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N YUPRIZTWANWWHK-DZKIICNBSA-N 0.000 description 1
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 1
- APZNYJFGVAGFCF-JYJNAYRXSA-N Phe-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccccc1)C(C)C)C(O)=O APZNYJFGVAGFCF-JYJNAYRXSA-N 0.000 description 1
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 1
- VXCHGLYSIOOZIS-GUBZILKMSA-N Pro-Ala-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 VXCHGLYSIOOZIS-GUBZILKMSA-N 0.000 description 1
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 1
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 1
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 1
- LNLNHXIQPGKRJQ-SRVKXCTJSA-N Pro-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 LNLNHXIQPGKRJQ-SRVKXCTJSA-N 0.000 description 1
- CYQQWUPHIZVCNY-GUBZILKMSA-N Pro-Arg-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CYQQWUPHIZVCNY-GUBZILKMSA-N 0.000 description 1
- MTHRMUXESFIAMS-DCAQKATOSA-N Pro-Asn-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O MTHRMUXESFIAMS-DCAQKATOSA-N 0.000 description 1
- MLQVJYMFASXBGZ-IHRRRGAJSA-N Pro-Asn-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O MLQVJYMFASXBGZ-IHRRRGAJSA-N 0.000 description 1
- UTAUEDINXUMHLG-FXQIFTODSA-N Pro-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 UTAUEDINXUMHLG-FXQIFTODSA-N 0.000 description 1
- WPQKSRHDTMRSJM-CIUDSAMLSA-N Pro-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 WPQKSRHDTMRSJM-CIUDSAMLSA-N 0.000 description 1
- VJLJGKQAOQJXJG-CIUDSAMLSA-N Pro-Asp-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJLJGKQAOQJXJG-CIUDSAMLSA-N 0.000 description 1
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 1
- KPDRZQUWJKTMBP-DCAQKATOSA-N Pro-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 KPDRZQUWJKTMBP-DCAQKATOSA-N 0.000 description 1
- ZCXQTRXYZOSGJR-FXQIFTODSA-N Pro-Asp-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZCXQTRXYZOSGJR-FXQIFTODSA-N 0.000 description 1
- SFECXGVELZFBFJ-VEVYYDQMSA-N Pro-Asp-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFECXGVELZFBFJ-VEVYYDQMSA-N 0.000 description 1
- XUSDDSLCRPUKLP-QXEWZRGKSA-N Pro-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 XUSDDSLCRPUKLP-QXEWZRGKSA-N 0.000 description 1
- AIZVVCMAFRREQS-GUBZILKMSA-N Pro-Cys-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AIZVVCMAFRREQS-GUBZILKMSA-N 0.000 description 1
- TUYWCHPXKQTISF-LPEHRKFASA-N Pro-Cys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N2CCC[C@@H]2C(=O)O TUYWCHPXKQTISF-LPEHRKFASA-N 0.000 description 1
- HQVPQXMCQKXARZ-FXQIFTODSA-N Pro-Cys-Ser Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O HQVPQXMCQKXARZ-FXQIFTODSA-N 0.000 description 1
- LSIWVWRUTKPXDS-DCAQKATOSA-N Pro-Gln-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LSIWVWRUTKPXDS-DCAQKATOSA-N 0.000 description 1
- UPJGUQPLYWTISV-GUBZILKMSA-N Pro-Gln-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UPJGUQPLYWTISV-GUBZILKMSA-N 0.000 description 1
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 1
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 1
- QCARZLHECSFOGG-CIUDSAMLSA-N Pro-Glu-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O QCARZLHECSFOGG-CIUDSAMLSA-N 0.000 description 1
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 1
- LGSANCBHSMDFDY-GARJFASQSA-N Pro-Glu-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O LGSANCBHSMDFDY-GARJFASQSA-N 0.000 description 1
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 1
- ZTVCLZLGHZXLOT-ULQDDVLXSA-N Pro-Glu-Trp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O ZTVCLZLGHZXLOT-ULQDDVLXSA-N 0.000 description 1
- WSRWHZRUOCACLJ-UWVGGRQHSA-N Pro-Gly-His Chemical compound C([C@@H](C(=O)O)NC(=O)CNC(=O)[C@H]1NCCC1)C1=CN=CN1 WSRWHZRUOCACLJ-UWVGGRQHSA-N 0.000 description 1
- FEVDNIBDCRKMER-IUCAKERBSA-N Pro-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEVDNIBDCRKMER-IUCAKERBSA-N 0.000 description 1
- QEWBZBLXDKIQPS-STQMWFEESA-N Pro-Gly-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QEWBZBLXDKIQPS-STQMWFEESA-N 0.000 description 1
- JUJGNDZIKKQMDJ-IHRRRGAJSA-N Pro-His-His Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O JUJGNDZIKKQMDJ-IHRRRGAJSA-N 0.000 description 1
- XQHGISDMVBTGAL-ULQDDVLXSA-N Pro-His-Phe Chemical compound C([C@@H](C(=O)[O-])NC(=O)[C@H](CC=1NC=NC=1)NC(=O)[C@H]1[NH2+]CCC1)C1=CC=CC=C1 XQHGISDMVBTGAL-ULQDDVLXSA-N 0.000 description 1
- SOACYAXADBWDDT-CYDGBPFRSA-N Pro-Ile-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SOACYAXADBWDDT-CYDGBPFRSA-N 0.000 description 1
- LNOWDSPAYBWJOR-PEDHHIEDSA-N Pro-Ile-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LNOWDSPAYBWJOR-PEDHHIEDSA-N 0.000 description 1
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 1
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 1
- VWHJZETTZDAGOM-XUXIUFHCSA-N Pro-Lys-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VWHJZETTZDAGOM-XUXIUFHCSA-N 0.000 description 1
- MHHQQZIFLWFZGR-DCAQKATOSA-N Pro-Lys-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O MHHQQZIFLWFZGR-DCAQKATOSA-N 0.000 description 1
- RPLMFKUKFZOTER-AVGNSLFASA-N Pro-Met-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1 RPLMFKUKFZOTER-AVGNSLFASA-N 0.000 description 1
- ZUZINZIJHJFJRN-UBHSHLNASA-N Pro-Phe-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 ZUZINZIJHJFJRN-UBHSHLNASA-N 0.000 description 1
- WHNJMTHJGCEKGA-ULQDDVLXSA-N Pro-Phe-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WHNJMTHJGCEKGA-ULQDDVLXSA-N 0.000 description 1
- XYAFCOJKICBRDU-JYJNAYRXSA-N Pro-Phe-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O XYAFCOJKICBRDU-JYJNAYRXSA-N 0.000 description 1
- JLMZKEQFMVORMA-SRVKXCTJSA-N Pro-Pro-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 JLMZKEQFMVORMA-SRVKXCTJSA-N 0.000 description 1
- FYKUEXMZYFIZKA-DCAQKATOSA-N Pro-Pro-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FYKUEXMZYFIZKA-DCAQKATOSA-N 0.000 description 1
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 1
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 1
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 1
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 1
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 1
- HRIXMVRZRGFKNQ-HJGDQZAQSA-N Pro-Thr-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HRIXMVRZRGFKNQ-HJGDQZAQSA-N 0.000 description 1
- IURWWZYKYPEANQ-HJGDQZAQSA-N Pro-Thr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IURWWZYKYPEANQ-HJGDQZAQSA-N 0.000 description 1
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 1
- AIOWVDNPESPXRB-YTWAJWBKSA-N Pro-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2)O AIOWVDNPESPXRB-YTWAJWBKSA-N 0.000 description 1
- GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 description 1
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 1
- STGVYUTZKGPRCI-GUBZILKMSA-N Pro-Val-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 STGVYUTZKGPRCI-GUBZILKMSA-N 0.000 description 1
- IIRBTQHFVNGPMQ-AVGNSLFASA-N Pro-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 IIRBTQHFVNGPMQ-AVGNSLFASA-N 0.000 description 1
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 1
- 108030004630 Procollagen galactosyltransferases Proteins 0.000 description 1
- 108030004602 Procollagen glucosyltransferases Proteins 0.000 description 1
- 229940124158 Protease/peptidase inhibitor Drugs 0.000 description 1
- 108010029485 Protein Isoforms Proteins 0.000 description 1
- 102000001708 Protein Isoforms Human genes 0.000 description 1
- 101000933967 Pseudomonas phage KPP25 Major capsid protein Proteins 0.000 description 1
- 241001112090 Pseudovirus Species 0.000 description 1
- 108010003201 RGH 0205 Proteins 0.000 description 1
- 108010025216 RVF peptide Proteins 0.000 description 1
- 108020005091 Replication Origin Proteins 0.000 description 1
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 1
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 1
- NLQUOHDCLSFABG-GUBZILKMSA-N Ser-Arg-Arg Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NLQUOHDCLSFABG-GUBZILKMSA-N 0.000 description 1
- QWZIOCFPXMAXET-CIUDSAMLSA-N Ser-Arg-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QWZIOCFPXMAXET-CIUDSAMLSA-N 0.000 description 1
- VQBLHWSPVYYZTB-DCAQKATOSA-N Ser-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N VQBLHWSPVYYZTB-DCAQKATOSA-N 0.000 description 1
- QGMLKFGTGXWAHF-IHRRRGAJSA-N Ser-Arg-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QGMLKFGTGXWAHF-IHRRRGAJSA-N 0.000 description 1
- WXUBSIDKNMFAGS-IHRRRGAJSA-N Ser-Arg-Tyr Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXUBSIDKNMFAGS-IHRRRGAJSA-N 0.000 description 1
- UBRXAVQWXOWRSJ-ZLUOBGJFSA-N Ser-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)C(=O)N UBRXAVQWXOWRSJ-ZLUOBGJFSA-N 0.000 description 1
- ZXLUWXWISXIFIX-ACZMJKKPSA-N Ser-Asn-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZXLUWXWISXIFIX-ACZMJKKPSA-N 0.000 description 1
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 1
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 1
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 1
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 1
- BYIROAKULFFTEK-CIUDSAMLSA-N Ser-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO BYIROAKULFFTEK-CIUDSAMLSA-N 0.000 description 1
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 1
- NJSPTZXVPZDRCU-UBHSHLNASA-N Ser-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N NJSPTZXVPZDRCU-UBHSHLNASA-N 0.000 description 1
- CDVFZMOFNJPUDD-ACZMJKKPSA-N Ser-Gln-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CDVFZMOFNJPUDD-ACZMJKKPSA-N 0.000 description 1
- IXUGADGDCQDLSA-FXQIFTODSA-N Ser-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N IXUGADGDCQDLSA-FXQIFTODSA-N 0.000 description 1
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 1
- YQQKYAZABFEYAF-FXQIFTODSA-N Ser-Glu-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQQKYAZABFEYAF-FXQIFTODSA-N 0.000 description 1
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 1
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 1
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 1
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 1
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 1
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 1
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 1
- UAJAYRMZGNQILN-BQBZGAKWSA-N Ser-Gly-Met Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UAJAYRMZGNQILN-BQBZGAKWSA-N 0.000 description 1
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 description 1
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 1
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 1
- OQPNSDWGAMFJNU-QWRGUYRKSA-N Ser-Gly-Tyr Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OQPNSDWGAMFJNU-QWRGUYRKSA-N 0.000 description 1
- FYUIFUJFNCLUIX-XVYDVKMFSA-N Ser-His-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O FYUIFUJFNCLUIX-XVYDVKMFSA-N 0.000 description 1
- LOKXAXAESFYFAX-CIUDSAMLSA-N Ser-His-Cys Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CS)C(O)=O)CC1=CN=CN1 LOKXAXAESFYFAX-CIUDSAMLSA-N 0.000 description 1
- JEHPKECJCALLRW-CUJWVEQBSA-N Ser-His-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEHPKECJCALLRW-CUJWVEQBSA-N 0.000 description 1
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 1
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 1
- YMDNFPNTIPQMJP-NAKRPEOUSA-N Ser-Ile-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O YMDNFPNTIPQMJP-NAKRPEOUSA-N 0.000 description 1
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 1
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 1
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 1
- VZQRNAYURWAEFE-KKUMJFAQSA-N Ser-Leu-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VZQRNAYURWAEFE-KKUMJFAQSA-N 0.000 description 1
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 1
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 1
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 1
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 1
- HDBOEVPDIDDEPC-CIUDSAMLSA-N Ser-Lys-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O HDBOEVPDIDDEPC-CIUDSAMLSA-N 0.000 description 1
- PPNPDKGQRFSCAC-CIUDSAMLSA-N Ser-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPNPDKGQRFSCAC-CIUDSAMLSA-N 0.000 description 1
- JLPMFVAIQHCBDC-CIUDSAMLSA-N Ser-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N JLPMFVAIQHCBDC-CIUDSAMLSA-N 0.000 description 1
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 1
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 description 1
- IFLVBVIYADZIQO-DCAQKATOSA-N Ser-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N IFLVBVIYADZIQO-DCAQKATOSA-N 0.000 description 1
- VIIJCAQMJBHSJH-FXQIFTODSA-N Ser-Met-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O VIIJCAQMJBHSJH-FXQIFTODSA-N 0.000 description 1
- RXSWQCATLWVDLI-XGEHTFHBSA-N Ser-Met-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RXSWQCATLWVDLI-XGEHTFHBSA-N 0.000 description 1
- ASGYVPAVFNDZMA-GUBZILKMSA-N Ser-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CO)N ASGYVPAVFNDZMA-GUBZILKMSA-N 0.000 description 1
- JAWGSPUJAXYXJA-IHRRRGAJSA-N Ser-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=CC=C1 JAWGSPUJAXYXJA-IHRRRGAJSA-N 0.000 description 1
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 1
- XVWDJUROVRQKAE-KKUMJFAQSA-N Ser-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=CC=C1 XVWDJUROVRQKAE-KKUMJFAQSA-N 0.000 description 1
- JLKWJWPDXPKKHI-FXQIFTODSA-N Ser-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC(=O)N)C(=O)O JLKWJWPDXPKKHI-FXQIFTODSA-N 0.000 description 1
- XQAPEISNMXNKGE-FXQIFTODSA-N Ser-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CS)C(=O)O XQAPEISNMXNKGE-FXQIFTODSA-N 0.000 description 1
- NMZXJDSKEGFDLJ-DCAQKATOSA-N Ser-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CCCCN)C(=O)O NMZXJDSKEGFDLJ-DCAQKATOSA-N 0.000 description 1
- DINQYZRMXGWWTG-GUBZILKMSA-N Ser-Pro-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DINQYZRMXGWWTG-GUBZILKMSA-N 0.000 description 1
- FLONGDPORFIVQW-XGEHTFHBSA-N Ser-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FLONGDPORFIVQW-XGEHTFHBSA-N 0.000 description 1
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 1
- OZPDGESCTGGNAD-CIUDSAMLSA-N Ser-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CO OZPDGESCTGGNAD-CIUDSAMLSA-N 0.000 description 1
- AABIBDJHSKIMJK-FXQIFTODSA-N Ser-Ser-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O AABIBDJHSKIMJK-FXQIFTODSA-N 0.000 description 1
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 1
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 1
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 1
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 1
- BCAVNDNYOGTQMQ-AAEUAGOBSA-N Ser-Trp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O BCAVNDNYOGTQMQ-AAEUAGOBSA-N 0.000 description 1
- YXEYTHXDRDAIOJ-CWRNSKLLSA-N Ser-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CO)N)C(=O)O YXEYTHXDRDAIOJ-CWRNSKLLSA-N 0.000 description 1
- UBTNVMGPMYDYIU-HJPIBITLSA-N Ser-Tyr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UBTNVMGPMYDYIU-HJPIBITLSA-N 0.000 description 1
- HAYADTTXNZFUDM-IHRRRGAJSA-N Ser-Tyr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HAYADTTXNZFUDM-IHRRRGAJSA-N 0.000 description 1
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 1
- LLSLRQOEAFCZLW-NRPADANISA-N Ser-Val-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LLSLRQOEAFCZLW-NRPADANISA-N 0.000 description 1
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 1
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 1
- DFTCYYILCSQGIZ-GCJQMDKQSA-N Thr-Ala-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFTCYYILCSQGIZ-GCJQMDKQSA-N 0.000 description 1
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 1
- JMZKMSTYXHFYAK-VEVYYDQMSA-N Thr-Arg-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O JMZKMSTYXHFYAK-VEVYYDQMSA-N 0.000 description 1
- JMQUAZXYFAEOIH-XGEHTFHBSA-N Thr-Arg-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N)O JMQUAZXYFAEOIH-XGEHTFHBSA-N 0.000 description 1
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 description 1
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 1
- JNQZPAWOPBZGIX-RCWTZXSCSA-N Thr-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N JNQZPAWOPBZGIX-RCWTZXSCSA-N 0.000 description 1
- QGXCWPNQVCYJEL-NUMRIWBASA-N Thr-Asn-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGXCWPNQVCYJEL-NUMRIWBASA-N 0.000 description 1
- JXKMXEBNZCKSDY-JIOCBJNQSA-N Thr-Asp-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O JXKMXEBNZCKSDY-JIOCBJNQSA-N 0.000 description 1
- OHAJHDJOCKKJLV-LKXGYXEUSA-N Thr-Asp-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OHAJHDJOCKKJLV-LKXGYXEUSA-N 0.000 description 1
- NRUPKQSXTJNQGD-XGEHTFHBSA-N Thr-Cys-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NRUPKQSXTJNQGD-XGEHTFHBSA-N 0.000 description 1
- LYGKYFKSZTUXGZ-ZDLURKLDSA-N Thr-Cys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)NCC(O)=O LYGKYFKSZTUXGZ-ZDLURKLDSA-N 0.000 description 1
- ZLNWJMRLHLGKFX-SVSWQMSJSA-N Thr-Cys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZLNWJMRLHLGKFX-SVSWQMSJSA-N 0.000 description 1
- MMTOHPRBJKEZHT-BWBBJGPYSA-N Thr-Cys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O MMTOHPRBJKEZHT-BWBBJGPYSA-N 0.000 description 1
- DSLHSTIUAPKERR-XGEHTFHBSA-N Thr-Cys-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O DSLHSTIUAPKERR-XGEHTFHBSA-N 0.000 description 1
- VUVCRYXYUUPGSB-GLLZPBPUSA-N Thr-Gln-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O VUVCRYXYUUPGSB-GLLZPBPUSA-N 0.000 description 1
- DIPIPFHFLPTCLK-LOKLDPHHSA-N Thr-Gln-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O DIPIPFHFLPTCLK-LOKLDPHHSA-N 0.000 description 1
- RCEHMXVEMNXRIW-IRIUXVKKSA-N Thr-Gln-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N)O RCEHMXVEMNXRIW-IRIUXVKKSA-N 0.000 description 1
- LGNBRHZANHMZHK-NUMRIWBASA-N Thr-Glu-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O LGNBRHZANHMZHK-NUMRIWBASA-N 0.000 description 1
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 1
- NIEWSKWFURSECR-FOHZUACHSA-N Thr-Gly-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NIEWSKWFURSECR-FOHZUACHSA-N 0.000 description 1
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 1
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 1
- ZTPXSEUVYNNZRB-CDMKHQONSA-N Thr-Gly-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZTPXSEUVYNNZRB-CDMKHQONSA-N 0.000 description 1
- CRZNCABIJLRFKZ-IUKAMOBKSA-N Thr-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N CRZNCABIJLRFKZ-IUKAMOBKSA-N 0.000 description 1
- ZBKDBZUTTXINIX-RWRJDSDZSA-N Thr-Ile-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZBKDBZUTTXINIX-RWRJDSDZSA-N 0.000 description 1
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 1
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 1
- HOVLHEKTGVIKAP-WDCWCFNPSA-N Thr-Leu-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HOVLHEKTGVIKAP-WDCWCFNPSA-N 0.000 description 1
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 1
- XIULAFZYEKSGAJ-IXOXFDKPSA-N Thr-Leu-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 XIULAFZYEKSGAJ-IXOXFDKPSA-N 0.000 description 1
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 1
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 1
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 1
- JWQNAFHCXKVZKZ-UVOCVTCTSA-N Thr-Lys-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWQNAFHCXKVZKZ-UVOCVTCTSA-N 0.000 description 1
- QFCQNHITJPRQTB-IEGACIPQSA-N Thr-Lys-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O QFCQNHITJPRQTB-IEGACIPQSA-N 0.000 description 1
- KKPOGALELPLJTL-MEYUZBJRSA-N Thr-Lys-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KKPOGALELPLJTL-MEYUZBJRSA-N 0.000 description 1
- MCDVZTRGHNXTGK-HJGDQZAQSA-N Thr-Met-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O MCDVZTRGHNXTGK-HJGDQZAQSA-N 0.000 description 1
- NZRUWPIYECBYRK-HTUGSXCWSA-N Thr-Phe-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O NZRUWPIYECBYRK-HTUGSXCWSA-N 0.000 description 1
- BIBYEFRASCNLAA-CDMKHQONSA-N Thr-Phe-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 BIBYEFRASCNLAA-CDMKHQONSA-N 0.000 description 1
- HSQXHRIRJSFDOH-URLPEUOOSA-N Thr-Phe-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HSQXHRIRJSFDOH-URLPEUOOSA-N 0.000 description 1
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 description 1
- NDXSOKGYKCGYKT-VEVYYDQMSA-N Thr-Pro-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O NDXSOKGYKCGYKT-VEVYYDQMSA-N 0.000 description 1
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 1
- GFRIEEKFXOVPIR-RHYQMDGZSA-N Thr-Pro-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O GFRIEEKFXOVPIR-RHYQMDGZSA-N 0.000 description 1
- PRTHQBSMXILLPC-XGEHTFHBSA-N Thr-Ser-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PRTHQBSMXILLPC-XGEHTFHBSA-N 0.000 description 1
- BCYUHPXBHCUYBA-CUJWVEQBSA-N Thr-Ser-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BCYUHPXBHCUYBA-CUJWVEQBSA-N 0.000 description 1
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 1
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 1
- QYDKSNXSBXZPFK-ZJDVBMNYSA-N Thr-Thr-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYDKSNXSBXZPFK-ZJDVBMNYSA-N 0.000 description 1
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 1
- CSNBWOJOEOPYIJ-UVOCVTCTSA-N Thr-Thr-Lys Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O CSNBWOJOEOPYIJ-UVOCVTCTSA-N 0.000 description 1
- NJGMALCNYAMYCB-JRQIVUDYSA-N Thr-Tyr-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJGMALCNYAMYCB-JRQIVUDYSA-N 0.000 description 1
- ABCLYRRGTZNIFU-BWAGICSOSA-N Thr-Tyr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O ABCLYRRGTZNIFU-BWAGICSOSA-N 0.000 description 1
- CYCGARJWIQWPQM-YJRXYDGGSA-N Thr-Tyr-Ser Chemical compound C[C@@H](O)[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CO)C([O-])=O)CC1=CC=C(O)C=C1 CYCGARJWIQWPQM-YJRXYDGGSA-N 0.000 description 1
- ILUOMMDDGREELW-OSUNSFLBSA-N Thr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O ILUOMMDDGREELW-OSUNSFLBSA-N 0.000 description 1
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 1
- SPIFGZFZMVLPHN-UNQGMJICSA-N Thr-Val-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SPIFGZFZMVLPHN-UNQGMJICSA-N 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- HOJPPPKZWFRTHJ-PJODQICGSA-N Trp-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N HOJPPPKZWFRTHJ-PJODQICGSA-N 0.000 description 1
- NXAPHBHZCMQORW-FDARSICLSA-N Trp-Arg-Ile Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NXAPHBHZCMQORW-FDARSICLSA-N 0.000 description 1
- GTNCSPKYWCJZAC-XIRDDKMYSA-N Trp-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N GTNCSPKYWCJZAC-XIRDDKMYSA-N 0.000 description 1
- WQYPAGQDXAJNED-AAEUAGOBSA-N Trp-Cys-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N WQYPAGQDXAJNED-AAEUAGOBSA-N 0.000 description 1
- LGEPIBQBGZTBHL-SXNHZJKMSA-N Trp-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N LGEPIBQBGZTBHL-SXNHZJKMSA-N 0.000 description 1
- MDDYTWOFHZFABW-SZMVWBNQSA-N Trp-Gln-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 MDDYTWOFHZFABW-SZMVWBNQSA-N 0.000 description 1
- FEZASNVQLJQBHW-CABZTGNLSA-N Trp-Gly-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O)=CNC2=C1 FEZASNVQLJQBHW-CABZTGNLSA-N 0.000 description 1
- WLBZWXXGSOLJBA-HOCLYGCPSA-N Trp-Gly-Lys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 WLBZWXXGSOLJBA-HOCLYGCPSA-N 0.000 description 1
- PGPCENKYTLDIFM-SZMVWBNQSA-N Trp-His-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O PGPCENKYTLDIFM-SZMVWBNQSA-N 0.000 description 1
- UJRIVCPPPMYCNA-HOCLYGCPSA-N Trp-Leu-Gly Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N UJRIVCPPPMYCNA-HOCLYGCPSA-N 0.000 description 1
- GWBWCGITOYODER-YTQUADARSA-N Trp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N GWBWCGITOYODER-YTQUADARSA-N 0.000 description 1
- VDUJEEQMRQCLHB-YTQUADARSA-N Trp-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O VDUJEEQMRQCLHB-YTQUADARSA-N 0.000 description 1
- ULHASJWZGUEUNN-XIRDDKMYSA-N Trp-Lys-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O ULHASJWZGUEUNN-XIRDDKMYSA-N 0.000 description 1
- KWTRGSQOQHZKIA-PMVMPFDFSA-N Trp-Lys-Tyr Chemical compound C([C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)CCCCN)C(O)=O)C1=CC=C(O)C=C1 KWTRGSQOQHZKIA-PMVMPFDFSA-N 0.000 description 1
- XDQGKIMTRSVSBC-WDSOQIARSA-N Trp-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CNC2=CC=CC=C12 XDQGKIMTRSVSBC-WDSOQIARSA-N 0.000 description 1
- ADMHZNPMMVKGJW-BPUTZDHNSA-N Trp-Ser-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N ADMHZNPMMVKGJW-BPUTZDHNSA-N 0.000 description 1
- FHHYVSCGOMPLLO-IHPCNDPISA-N Trp-Tyr-Asp Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 FHHYVSCGOMPLLO-IHPCNDPISA-N 0.000 description 1
- ZPZNQAZHMCLTOA-PXDAIIFMSA-N Trp-Tyr-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=C(O)C=C1 ZPZNQAZHMCLTOA-PXDAIIFMSA-N 0.000 description 1
- LNGFWVPNKLWATF-ZVZYQTTQSA-N Trp-Val-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LNGFWVPNKLWATF-ZVZYQTTQSA-N 0.000 description 1
- RWTFCAMQLFNPTK-UMPQAUOISA-N Trp-Val-Thr Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O)=CNC2=C1 RWTFCAMQLFNPTK-UMPQAUOISA-N 0.000 description 1
- OOEUVMFKKZYSRX-LEWSCRJBSA-N Tyr-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OOEUVMFKKZYSRX-LEWSCRJBSA-N 0.000 description 1
- IIJWXEUNETVJPV-IHRRRGAJSA-N Tyr-Arg-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N)O IIJWXEUNETVJPV-IHRRRGAJSA-N 0.000 description 1
- GFHYISDTIWZUSU-QWRGUYRKSA-N Tyr-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GFHYISDTIWZUSU-QWRGUYRKSA-N 0.000 description 1
- MTEQZJFSEMXXRK-CFMVVWHZSA-N Tyr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N MTEQZJFSEMXXRK-CFMVVWHZSA-N 0.000 description 1
- AYHSJESDFKREAR-KKUMJFAQSA-N Tyr-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AYHSJESDFKREAR-KKUMJFAQSA-N 0.000 description 1
- AYPAIRCDLARHLM-KKUMJFAQSA-N Tyr-Asn-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O AYPAIRCDLARHLM-KKUMJFAQSA-N 0.000 description 1
- TZXFLDNBYYGLKA-BZSNNMDCSA-N Tyr-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 TZXFLDNBYYGLKA-BZSNNMDCSA-N 0.000 description 1
- XKDOQXAXKFQWQJ-SRVKXCTJSA-N Tyr-Cys-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O XKDOQXAXKFQWQJ-SRVKXCTJSA-N 0.000 description 1
- KEHKBBUYZWAMHL-DZKIICNBSA-N Tyr-Gln-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O KEHKBBUYZWAMHL-DZKIICNBSA-N 0.000 description 1
- IWRMTNJCCMEBEX-AVGNSLFASA-N Tyr-Glu-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)O IWRMTNJCCMEBEX-AVGNSLFASA-N 0.000 description 1
- SLCSPPCQWUHPPO-JYJNAYRXSA-N Tyr-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SLCSPPCQWUHPPO-JYJNAYRXSA-N 0.000 description 1
- LHTGRUZSZOIAKM-SOUVJXGZSA-N Tyr-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O LHTGRUZSZOIAKM-SOUVJXGZSA-N 0.000 description 1
- UNUZEBFXGWVAOP-DZKIICNBSA-N Tyr-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UNUZEBFXGWVAOP-DZKIICNBSA-N 0.000 description 1
- HIINQLBHPIQYHN-JTQLQIEISA-N Tyr-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HIINQLBHPIQYHN-JTQLQIEISA-N 0.000 description 1
- IJUTXXAXQODRMW-KBPBESRZSA-N Tyr-Gly-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O IJUTXXAXQODRMW-KBPBESRZSA-N 0.000 description 1
- FIRUOPRJKCBLST-KKUMJFAQSA-N Tyr-His-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O FIRUOPRJKCBLST-KKUMJFAQSA-N 0.000 description 1
- OHNXAUCZVWGTLL-KKUMJFAQSA-N Tyr-His-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CS)C(=O)O)N)O OHNXAUCZVWGTLL-KKUMJFAQSA-N 0.000 description 1
- DZKFGCNKEVMXFA-JUKXBJQTSA-N Tyr-Ile-His Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O DZKFGCNKEVMXFA-JUKXBJQTSA-N 0.000 description 1
- HFJJDMOFTCQGEI-STECZYCISA-N Tyr-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N HFJJDMOFTCQGEI-STECZYCISA-N 0.000 description 1
- BSCBBPKDVOZICB-KKUMJFAQSA-N Tyr-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BSCBBPKDVOZICB-KKUMJFAQSA-N 0.000 description 1
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 1
- HSBZWINKRYZCSQ-KKUMJFAQSA-N Tyr-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O HSBZWINKRYZCSQ-KKUMJFAQSA-N 0.000 description 1
- GYKDRHDMGQUZPU-MGHWNKPDSA-N Tyr-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GYKDRHDMGQUZPU-MGHWNKPDSA-N 0.000 description 1
- CNNVVEPJTFOGHI-ACRUOGEOSA-N Tyr-Lys-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CNNVVEPJTFOGHI-ACRUOGEOSA-N 0.000 description 1
- YSGAPESOXHFTQY-IHRRRGAJSA-N Tyr-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N YSGAPESOXHFTQY-IHRRRGAJSA-N 0.000 description 1
- QKXAEWMHAAVVGS-KKUMJFAQSA-N Tyr-Pro-Glu Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O QKXAEWMHAAVVGS-KKUMJFAQSA-N 0.000 description 1
- SZEIFUXUTBBQFQ-STQMWFEESA-N Tyr-Pro-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SZEIFUXUTBBQFQ-STQMWFEESA-N 0.000 description 1
- VXFXIBCCVLJCJT-JYJNAYRXSA-N Tyr-Pro-Pro Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N1CCC[C@H]1C(O)=O VXFXIBCCVLJCJT-JYJNAYRXSA-N 0.000 description 1
- XGZBEGGGAUQBMB-KJEVXHAQSA-N Tyr-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CC=C(C=C2)O)N)O XGZBEGGGAUQBMB-KJEVXHAQSA-N 0.000 description 1
- VYQQQIRHIFALGE-UWJYBYFXSA-N Tyr-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VYQQQIRHIFALGE-UWJYBYFXSA-N 0.000 description 1
- XYBNMHRFAUKPAW-IHRRRGAJSA-N Tyr-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CC=C(C=C1)O)N XYBNMHRFAUKPAW-IHRRRGAJSA-N 0.000 description 1
- LUMQYLVYUIRHHU-YJRXYDGGSA-N Tyr-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUMQYLVYUIRHHU-YJRXYDGGSA-N 0.000 description 1
- TYFLVOUZHQUBGM-IHRRRGAJSA-N Tyr-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TYFLVOUZHQUBGM-IHRRRGAJSA-N 0.000 description 1
- JHDZONWZTCKTJR-KJEVXHAQSA-N Tyr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JHDZONWZTCKTJR-KJEVXHAQSA-N 0.000 description 1
- ABZWHLRQBSBPTO-RNXOBYDBSA-N Tyr-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC4=CC=C(C=C4)O)N ABZWHLRQBSBPTO-RNXOBYDBSA-N 0.000 description 1
- GPLTZEMVOCZVAV-UFYCRDLUSA-N Tyr-Tyr-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 GPLTZEMVOCZVAV-UFYCRDLUSA-N 0.000 description 1
- PQPWEALFTLKSEB-DZKIICNBSA-N Tyr-Val-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PQPWEALFTLKSEB-DZKIICNBSA-N 0.000 description 1
- DJIJBQYBDKGDIS-JYJNAYRXSA-N Tyr-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O DJIJBQYBDKGDIS-JYJNAYRXSA-N 0.000 description 1
- 108010064997 VPY tripeptide Proteins 0.000 description 1
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 1
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 1
- REJBPZVUHYNMEN-LSJOCFKGSA-N Val-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N REJBPZVUHYNMEN-LSJOCFKGSA-N 0.000 description 1
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 1
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 1
- JIODCDXKCJRMEH-NHCYSSNCSA-N Val-Arg-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N JIODCDXKCJRMEH-NHCYSSNCSA-N 0.000 description 1
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 1
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 1
- DNOOLPROHJWCSQ-RCWTZXSCSA-N Val-Arg-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DNOOLPROHJWCSQ-RCWTZXSCSA-N 0.000 description 1
- CWOSXNKDOACNJN-BZSNNMDCSA-N Val-Arg-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N CWOSXNKDOACNJN-BZSNNMDCSA-N 0.000 description 1
- ZMDCGGKHRKNWKD-LAEOZQHASA-N Val-Asn-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZMDCGGKHRKNWKD-LAEOZQHASA-N 0.000 description 1
- QGFPYRPIUXBYGR-YDHLFZDLSA-N Val-Asn-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N QGFPYRPIUXBYGR-YDHLFZDLSA-N 0.000 description 1
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 1
- KXUKIBHIVRYOIP-ZKWXMUAHSA-N Val-Asp-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N KXUKIBHIVRYOIP-ZKWXMUAHSA-N 0.000 description 1
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 1
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 1
- XLDYBRXERHITNH-QSFUFRPTSA-N Val-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)C(C)C XLDYBRXERHITNH-QSFUFRPTSA-N 0.000 description 1
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 1
- XKVXSCHXGJOQND-ZOBUZTSGSA-N Val-Asp-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N XKVXSCHXGJOQND-ZOBUZTSGSA-N 0.000 description 1
- FRUYSSRPJXNRRB-GUBZILKMSA-N Val-Cys-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N FRUYSSRPJXNRRB-GUBZILKMSA-N 0.000 description 1
- BWVHQINTNLVWGZ-ZKWXMUAHSA-N Val-Cys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N BWVHQINTNLVWGZ-ZKWXMUAHSA-N 0.000 description 1
- FPCIBLUVDNXPJO-XPUUQOCRSA-N Val-Cys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O FPCIBLUVDNXPJO-XPUUQOCRSA-N 0.000 description 1
- DLYOEFGPYTZVSP-AEJSXWLSSA-N Val-Cys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N DLYOEFGPYTZVSP-AEJSXWLSSA-N 0.000 description 1
- CFSSLXZJEMERJY-NRPADANISA-N Val-Gln-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CFSSLXZJEMERJY-NRPADANISA-N 0.000 description 1
- XTAUQCGQFJQGEJ-NHCYSSNCSA-N Val-Gln-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XTAUQCGQFJQGEJ-NHCYSSNCSA-N 0.000 description 1
- OUUBKKIJQIAPRI-LAEOZQHASA-N Val-Gln-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OUUBKKIJQIAPRI-LAEOZQHASA-N 0.000 description 1
- AGKDVLSDNSTLFA-UMNHJUIQSA-N Val-Gln-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N AGKDVLSDNSTLFA-UMNHJUIQSA-N 0.000 description 1
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 1
- VLDMQVZZWDOKQF-AUTRQRHGSA-N Val-Glu-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VLDMQVZZWDOKQF-AUTRQRHGSA-N 0.000 description 1
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 1
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 1
- NXRAUQGGHPCJIB-RCOVLWMOSA-N Val-Gly-Asn Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O NXRAUQGGHPCJIB-RCOVLWMOSA-N 0.000 description 1
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 description 1
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 1
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 1
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 1
- MDYSKHBSPXUOPV-JSGCOSHPSA-N Val-Gly-Phe Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MDYSKHBSPXUOPV-JSGCOSHPSA-N 0.000 description 1
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 1
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 1
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 1
- FEFZWCSXEMVSPO-LSJOCFKGSA-N Val-His-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](C)C(O)=O FEFZWCSXEMVSPO-LSJOCFKGSA-N 0.000 description 1
- OPGWZDIYEYJVRX-AVGNSLFASA-N Val-His-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N OPGWZDIYEYJVRX-AVGNSLFASA-N 0.000 description 1
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 1
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 description 1
- PYPZMFDMCCWNST-NAKRPEOUSA-N Val-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N PYPZMFDMCCWNST-NAKRPEOUSA-N 0.000 description 1
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 1
- FTKXYXACXYOHND-XUXIUFHCSA-N Val-Ile-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O FTKXYXACXYOHND-XUXIUFHCSA-N 0.000 description 1
- JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 description 1
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 1
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 1
- BMOFUVHDBROBSE-DCAQKATOSA-N Val-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N BMOFUVHDBROBSE-DCAQKATOSA-N 0.000 description 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 1
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 1
- BZOSBRIDWSSTFN-AVGNSLFASA-N Val-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N BZOSBRIDWSSTFN-AVGNSLFASA-N 0.000 description 1
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 1
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 1
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 description 1
- KTEZUXISLQTDDQ-NHCYSSNCSA-N Val-Lys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KTEZUXISLQTDDQ-NHCYSSNCSA-N 0.000 description 1
- MLADEWAIYAPAAU-IHRRRGAJSA-N Val-Lys-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N MLADEWAIYAPAAU-IHRRRGAJSA-N 0.000 description 1
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 1
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 1
- SVFRYKBZHUGKLP-QXEWZRGKSA-N Val-Met-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVFRYKBZHUGKLP-QXEWZRGKSA-N 0.000 description 1
- JVGHIFMSFBZDHH-WPRPVWTQSA-N Val-Met-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N JVGHIFMSFBZDHH-WPRPVWTQSA-N 0.000 description 1
- VNGKMNPAENRGDC-JYJNAYRXSA-N Val-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 VNGKMNPAENRGDC-JYJNAYRXSA-N 0.000 description 1
- UZFNHAXYMICTBU-DZKIICNBSA-N Val-Phe-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UZFNHAXYMICTBU-DZKIICNBSA-N 0.000 description 1
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 1
- VCIYTVOBLZHFSC-XHSDSOJGSA-N Val-Phe-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N VCIYTVOBLZHFSC-XHSDSOJGSA-N 0.000 description 1
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 1
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 1
- JMCOXFSCTGKLLB-FKBYEOEOSA-N Val-Phe-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N JMCOXFSCTGKLLB-FKBYEOEOSA-N 0.000 description 1
- LGXUZJIQCGXKGZ-QXEWZRGKSA-N Val-Pro-Asn Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N LGXUZJIQCGXKGZ-QXEWZRGKSA-N 0.000 description 1
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 1
- NSUUANXHLKKHQB-BZSNNMDCSA-N Val-Pro-Trp Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CNC2=CC=CC=C12 NSUUANXHLKKHQB-BZSNNMDCSA-N 0.000 description 1
- QWCZXKIFPWPQHR-JYJNAYRXSA-N Val-Pro-Tyr Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QWCZXKIFPWPQHR-JYJNAYRXSA-N 0.000 description 1
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 1
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 1
- LTTQCQRTSHJPPL-ZKWXMUAHSA-N Val-Ser-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LTTQCQRTSHJPPL-ZKWXMUAHSA-N 0.000 description 1
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 1
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 1
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 1
- GVNLOVJNNDZUHS-RHYQMDGZSA-N Val-Thr-Lys Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O GVNLOVJNNDZUHS-RHYQMDGZSA-N 0.000 description 1
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 1
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 1
- LNWSJGJCLFUNTN-ZOBUZTSGSA-N Val-Trp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LNWSJGJCLFUNTN-ZOBUZTSGSA-N 0.000 description 1
- ZLMFVXMJFIWIRE-FHWLQOOXSA-N Val-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](C(C)C)N ZLMFVXMJFIWIRE-FHWLQOOXSA-N 0.000 description 1
- QTXGUIMEHKCPBH-FHWLQOOXSA-N Val-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 QTXGUIMEHKCPBH-FHWLQOOXSA-N 0.000 description 1
- LZRWTJSPTJSWDN-FKBYEOEOSA-N Val-Trp-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O)N LZRWTJSPTJSWDN-FKBYEOEOSA-N 0.000 description 1
- CFIBZQOLUDURST-IHRRRGAJSA-N Val-Tyr-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CS)C(=O)O)N CFIBZQOLUDURST-IHRRRGAJSA-N 0.000 description 1
- DOBHJKVVACOQTN-DZKIICNBSA-N Val-Tyr-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 DOBHJKVVACOQTN-DZKIICNBSA-N 0.000 description 1
- JPBGMZDTPVGGMQ-ULQDDVLXSA-N Val-Tyr-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N JPBGMZDTPVGGMQ-ULQDDVLXSA-N 0.000 description 1
- PMKQKNBISAOSRI-XHSDSOJGSA-N Val-Tyr-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N PMKQKNBISAOSRI-XHSDSOJGSA-N 0.000 description 1
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 1
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 1
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 1
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 1
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 1
- 108010003533 Viral Envelope Proteins Proteins 0.000 description 1
- 108700005077 Viral Genes Proteins 0.000 description 1
- 108020000999 Viral RNA Proteins 0.000 description 1
- 229920002494 Zein Polymers 0.000 description 1
- 238000005299 abrasion Methods 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 238000003915 air pollution Methods 0.000 description 1
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- OFHCOWSQAMBJIW-AVJTYSNKSA-N alfacalcidol Chemical compound C1(/[C@@H]2CC[C@@H]([C@]2(CCC1)C)[C@H](C)CCCC(C)C)=C\C=C1\C[C@@H](O)C[C@H](O)C1=C OFHCOWSQAMBJIW-AVJTYSNKSA-N 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 210000000709 aorta Anatomy 0.000 description 1
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 1
- 108010006195 arginyl-glycyl-aspartyl-cysteine Proteins 0.000 description 1
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 1
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 1
- 108010094001 arginyl-tryptophyl-arginine Proteins 0.000 description 1
- 108010027234 aspartyl-glycyl-glutamyl-alanine Proteins 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 238000009835 boiling Methods 0.000 description 1
- 210000000988 bone and bone Anatomy 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 230000030570 cellular localization Effects 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 108010049937 collagen type I trimeric cross-linked peptide Proteins 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 150000001879 copper Chemical class 0.000 description 1
- 239000002537 cosmetic Substances 0.000 description 1
- 239000000287 crude extract Substances 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 108010018866 cysteinyl-seryl-valyl-threonyl-cysteinyl-glycine Proteins 0.000 description 1
- 108010004073 cysteinylcysteine Proteins 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 239000000539 dimer Substances 0.000 description 1
- 108010054813 diprotin B Proteins 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 230000005684 electric field Effects 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 230000006353 environmental stress Effects 0.000 description 1
- DNJIEGIFACGWOD-UHFFFAOYSA-N ethyl mercaptane Natural products CCS DNJIEGIFACGWOD-UHFFFAOYSA-N 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 1
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 1
- 108010056686 glycosylated collagen Proteins 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 1
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 1
- 108010081985 glycyl-cystinyl-aspartic acid Proteins 0.000 description 1
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 1
- 108010079413 glycyl-prolyl-glutamic acid Proteins 0.000 description 1
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 1
- 108010048994 glycyl-tyrosyl-alanine Proteins 0.000 description 1
- 108010084389 glycyltryptophan Proteins 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 238000000227 grinding Methods 0.000 description 1
- 108010028295 histidylhistidine Proteins 0.000 description 1
- 230000006801 homologous recombination Effects 0.000 description 1
- 238000002744 homologous recombination Methods 0.000 description 1
- 210000001822 immobilized cell Anatomy 0.000 description 1
- 238000003018 immunoassay Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 230000008595 infiltration Effects 0.000 description 1
- 238000001764 infiltration Methods 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 238000011081 inoculation Methods 0.000 description 1
- 108010027338 isoleucylcysteine Proteins 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- 108010053037 kyotorphin Proteins 0.000 description 1
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 1
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 1
- 108010087810 leucyl-seryl-glutamyl-leucine Proteins 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 210000004072 lung Anatomy 0.000 description 1
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 1
- 229910052943 magnesium sulfate Inorganic materials 0.000 description 1
- 235000019341 magnesium sulphate Nutrition 0.000 description 1
- 230000001071 malnutrition Effects 0.000 description 1
- 235000000824 malnutrition Nutrition 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 230000013011 mating Effects 0.000 description 1
- 239000000155 melt Substances 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 108010085203 methionylmethionine Proteins 0.000 description 1
- 230000002906 microbiologic effect Effects 0.000 description 1
- 239000011859 microparticle Substances 0.000 description 1
- 230000002438 mitochondrial effect Effects 0.000 description 1
- 230000026326 mitochondrial transport Effects 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 230000006911 nucleation Effects 0.000 description 1
- 238000010899 nucleation Methods 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 238000007899 nucleic acid hybridization Methods 0.000 description 1
- 208000015380 nutritional deficiency disease Diseases 0.000 description 1
- 238000002515 oligonucleotide synthesis Methods 0.000 description 1
- 230000003204 osmotic effect Effects 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- FJKROLUGYXJWQN-UHFFFAOYSA-N papa-hydroxy-benzoic acid Natural products OC(=O)C1=CC=C(O)C=C1 FJKROLUGYXJWQN-UHFFFAOYSA-N 0.000 description 1
- 244000052769 pathogen Species 0.000 description 1
- 239000000137 peptide hydrolase inhibitor Substances 0.000 description 1
- 230000000737 periodic effect Effects 0.000 description 1
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 1
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010084572 phenylalanyl-valine Proteins 0.000 description 1
- 108010073101 phenylalanylleucine Proteins 0.000 description 1
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 1
- 239000000419 plant extract Substances 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 239000000843 powder Substances 0.000 description 1
- 239000002244 precipitate Substances 0.000 description 1
- 230000002062 proliferating effect Effects 0.000 description 1
- 108010020432 prolyl-prolylisoleucine Proteins 0.000 description 1
- 108010079317 prolyl-tyrosine Proteins 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 238000012514 protein characterization Methods 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 230000001172 regenerating effect Effects 0.000 description 1
- 230000008929 regeneration Effects 0.000 description 1
- 238000011069 regeneration method Methods 0.000 description 1
- 210000003935 rough endoplasmic reticulum Anatomy 0.000 description 1
- 229960004889 salicylic acid Drugs 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 238000010187 selection method Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 1
- 108010015840 seryl-prolyl-lysyl-lysine Proteins 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 210000003491 skin Anatomy 0.000 description 1
- 239000002689 soil Substances 0.000 description 1
- 210000001082 somatic cell Anatomy 0.000 description 1
- 230000002269 spontaneous effect Effects 0.000 description 1
- 230000000638 stimulation Effects 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 230000009885 systemic effect Effects 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
- 210000002435 tendon Anatomy 0.000 description 1
- KUUVQVSHGLHAKZ-UHFFFAOYSA-N thionine Chemical compound C=1C=CC=CSC=CC=1 KUUVQVSHGLHAKZ-UHFFFAOYSA-N 0.000 description 1
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 1
- 125000003508 trans-4-hydroxy-L-proline group Chemical group 0.000 description 1
- 238000000844 transformation Methods 0.000 description 1
- 230000010474 transient expression Effects 0.000 description 1
- 108091005703 transmembrane proteins Proteins 0.000 description 1
- 102000035160 transmembrane proteins Human genes 0.000 description 1
- 230000032258 transport Effects 0.000 description 1
- 108700004896 tripeptide FEG Proteins 0.000 description 1
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 1
- 108010014563 tryptophyl-cysteinyl-serine Proteins 0.000 description 1
- 108010084932 tryptophyl-proline Proteins 0.000 description 1
- 108010038745 tryptophylglycine Proteins 0.000 description 1
- 108010045269 tryptophyltryptophan Proteins 0.000 description 1
- 108010044292 tryptophyltyrosine Proteins 0.000 description 1
- WFKWXMTUELFFGS-UHFFFAOYSA-N tungsten Chemical compound [W] WFKWXMTUELFFGS-UHFFFAOYSA-N 0.000 description 1
- 229910052721 tungsten Inorganic materials 0.000 description 1
- 239000010937 tungsten Substances 0.000 description 1
- 108010078580 tyrosylleucine Proteins 0.000 description 1
- 238000009281 ultraviolet germicidal irradiation Methods 0.000 description 1
- 230000035899 viability Effects 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 108010027345 wheylin-1 peptide Proteins 0.000 description 1
- 108010000998 wheylin-2 peptide Proteins 0.000 description 1
- 239000005019 zein Substances 0.000 description 1
- 229940093612 zein Drugs 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8242—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits
- C12N15/8257—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits for the production of primary gene products, e.g. pharmaceutical products, interferon
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/415—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from plants
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/78—Connective tissue peptides, e.g. collagen, elastin, laminin, fibronectin, vitronectin or cold insoluble globulin [CIG]
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8261—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
- C12N15/8271—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance
- C12N15/8273—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance for drought, cold, salt resistance
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A40/00—Adaptation technologies in agriculture, forestry, livestock or agroalimentary production
- Y02A40/10—Adaptation technologies in agriculture, forestry, livestock or agroalimentary production in agriculture
- Y02A40/146—Genetically Modified [GMO] plants, e.g. transgenic plants
Landscapes
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Molecular Biology (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- General Health & Medical Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Biophysics (AREA)
- Biotechnology (AREA)
- Biochemistry (AREA)
- Biomedical Technology (AREA)
- Wood Science & Technology (AREA)
- Medicinal Chemistry (AREA)
- Cell Biology (AREA)
- Microbiology (AREA)
- Physics & Mathematics (AREA)
- Gastroenterology & Hepatology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Plant Pathology (AREA)
- Toxicology (AREA)
- Pharmacology & Pharmacy (AREA)
- Botany (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
- Peptides Or Proteins (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Fertilizers (AREA)
- Cosmetics (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
식물에서 콜라겐을 생산하는 방법 및 콜라겐을 생성하는 식물이 제공된다. 본 방법은 내인성(endogenous) P4H 활성이 없는 세포내 컴파트먼트(subcellular compartment)에 콜라겐 알파 사슬을 축적할 수 있는 방식으로, 식물에서 적어도 한 형태의 콜라겐 알파 사슬을 발현하여 식물에서 콜라겐을 생산함으로써 수행된다.
Description
본 발명은 콜라겐 생산성 식물, 및 이를 생성 및 사용하는 방법에 관한 것이다. 더욱 특히, 본 발명은 천연의 삼중나선 구조의 I형 콜라겐 섬유를 형성할 수 있는 하이드록실화 콜라겐을 고수준으로 생성할 수 있는 식물을 생산하기 위한 신규한 접근법에 관한 것이다.
콜라겐은 척추동물 및 다른 많은 다세포 생물의 구조적 완전성에 관여하는 주된 구조 단백질이다. I형 콜라겐은 전형적인 섬유상 콜라겐을 대표하며, 대부분의 조직에 존재하는 주된 콜라겐 형태이다.
I형 콜라겐은 뼈와 힘줄의 주요 콜라겐 성분이며, 피부, 대동맥 및 폐에 다량 존재한다. I형 콜라겐 섬유는 우수한 신장강도(tensile strength) 및 제한된 확장능(limited extensibility)을 제공한다. I형 콜라겐의 가장 풍부한 분자형은 두 개의 상이한 알파 사슬 [알파 1(I)]2 및 알파 2(1)로 구성된 헤테로트리머이다(Inkinen, 2003). 모든 섬유상 콜라겐 분자는 반복 Gly-X-Y 트리플렛(triplet)(여기서, X 및 Y는 임의의 아미노산이나 종종 이미노산 프롤린 및 하이드록시프롤린이다)으로 구축된 세 개의 폴리펩티드 사슬을 가진다.
섬유 형성성 콜라겐은 구상 N- 및 C-말단 확장 펩티드를 가진 전구체 프로콜라겐으로서 합성된다. 프로콜라겐의 생합성은 프롤린과 리신의 하이드록실화, N-결합과 O-결합의 글리코실화 및 사슬내 및 사슬간 모두의 디설파이드-결합 형성을 비롯한 다수의 상이한 번역후수식을 포함하는 복잡한 공정이다. 이들 수식을 수행하는 효소는 상황에 맞게 정확히 정렬되고 열적으로 안정한 삼중-나선형 분자를 폴딩(folding) 및 집합(assembly)하도록 작용한다.
각각의 프로콜라겐 분자는 세 가지 성분의 폴리펩티드 사슬로부터 조면소포체내에 집합한다. 폴리펩티드 사슬은 소포체막에 걸쳐 번역과 동시에 전위되기 때문에, 프롤린과 리신 잔기의 하이드록실화는 Gly-X-Y 반복 영역내에서 일어난다. 폴리펩티드 사슬이 소포체 내강내로 완전히 전위되면, C-프로펩티드는 폴딩된다. 이어, 세 개의 프로-알파 사슬은 그의 C-프로펩티드를 통해 결합하여, Gly-X-Y 반복 영역으로 하여금 그의 C-말단부에서 핵형성점(nucleation point)을 형성하게 하는 트리머 분자를 형성하여, 사슬을 정확하게 정렬시킨다. 그후, Gly-X-Y 영역은 C-에서-N 방향으로 폴딩되어 삼중나선구조를 형성한다.
프롤린의 하이드록실화는 체온에서 삼중나선구조의 안정성을 확보하는데 필요하며, 일단 형성되면 삼중나선구조는 더 이상 하이드록실화 효소에 대한 기질로서의 역할을 하지 않으므로, 폴리펩티드 사슬 수식과 삼중나선형 형성 사이의 시간적 관계는 중요하다. C-프로펩티드(및 보다 적은 정도로 N-프로펩티드)는 세포를 통과하는 동안 프로콜라겐의 가용성을 유지한다(Bulleid et al., 2000). 프로콜라겐 분자를 세포외기질내로 분비하는 동안 또는 분비한 후, 프로펩티드는 프로콜라 겐 N- 및 C-프로티네이즈에 의해 제거되고, 이로 인해 섬유내로 콜라겐 분자의 특발성 자기집합(spontaneous self-assembly)이 유발된다(Hulmes, 2002). 프로콜라겐 N- 및 C-프로티네이즈에 의한 프로펩티드의 제거는 프로콜라겐의 용해도를 >10000-배까지 낮추며, 섬유내로의 콜라겐의 자기-집합을 개시하는데 필요충분하다. 삼중-나선형 도메인의 말단에 텔로펩티드라 불리는 짧은 비-삼중나선형 펩티드는 이러한 집합 공정에 있어서 중요한데, 이는 섬유상 구조내에 콜라겐 분자를 정확한 등록(registration)시키고 자기집합에 대한 임계농도를 낮춘다(Bulleid et al., 2000). 본래, 효소 프롤릴-4-하이드록실레이즈(P4H)에 의해 프롤린을 하이드록실화하여 콜라겐 사슬내에 하이드록시프롤린의 잔기를 형성하기 위해서는 콜라겐의 삼중-나선형 구조의 안정성이 필요하다.
콜라겐 사슬을 발현하는 식물은 당업계에 공지되어 있다[참조예, 미국 특허 제 6,617,431 호 및 (Merle et al., 2002, Ruggiero et al., 2000)]. 식물은 하이드록시프롤린-함유 단백질을 합성할 수 있지만, 식물 세포에서 하이드록시프롤린의 합성을 담당하는 프롤릴 하이드록실레이즈는 포유동물 P4H에 비해 비교적 느슨한 기질 서열 특이성을 나타내며, 이로써 Gly-X-Y 트리플렛의 Y 위치에만 하이드록시프롤린을 함유하는 콜라겐을 생산하는 것이 콜라겐 및 P4H 유전자의 식물 동시-발현에 필요하다(Olsen et al, 2003).
식물에 본래 존재하는 하이드록실화 기구에 의존하는 인간 콜라겐의 생산법에 의해서는 프롤린 하이드록실화가 불량한 콜라겐이 생성되었다(Merle et al., 2002). 이러한 콜라겐은 30℃ 이하의 온도에서 그의 삼중-나선형 구조가 용융되거 나 느슨해진다. 콜라겐과 프롤릴-하이드록실레이즈의 동시-발현은 체온에서 적용하기에 생물학적으로 적합한 안정한 하이드록실화 콜라겐과 함께 생성된다(Merle et al., 2002).
리실 하이드록실레이즈(LH,EC 1.14.11.4), 갈락토실트랜스페라제(EC 2.4.1.50) 및 글루코실트랜스페라제(EC 2.4.1.66)는 콜라겐의 번역후수식에 관여하는 효소이다. 이들은 하이드록시리실, 갈락토실하이드록시리실 및 글루코실갈락토실 하이드록시리실 잔기에 특이적인 위치에서 리실 잔기를 순차적으로 수식시킨다. 이들 구조는 콜라겐의 특유한 성질이며, 그의 작용성에 필수적이다(Wang et al, 2002). 단일 인간 효소인 리실 하이드록실레이즈 3(LH3)은 하이드록시리신 결합 탄수화물을 형성하는 연속적인 세 단계 모두를 촉매할 수 있다(Wang et al, 2002).
담배(tobacco)에서 발현된 인간 콜라겐의 하이드록시리신은 소 콜라겐에서 발견된 2%의 하이드록시리신 보다 적게 형성된다(잔기의 0.04%/잔기의 1.88%). 이는 식물 내인성 리실 하이드록실레이즈가 콜라겐에서 리신을 충분히 하이드록시화할 수 없음을 제시한다.
본 발명을 실시하는 동안, 본 발명자들은 콜라겐 사슬의 효율적인 하이드록실화가 이러한 폴리펩티드를 정확히 수식할 수 있는 효소와 함께 콜라겐 사슬의 고립화(sequestering)에 의존한다는 것을 밝혀내었다.
발명의 요약
본 발명의 하나의 일면에 따라, 내인성(endogenous) P4H 활성이 없는 세포내 컴파트먼트(subcellular compartment)에 적어도 한 형태의 콜라겐 알파 사슬 및 외인성 P4H를 축적할 수 있는 방식으로, 식물 또는 단리된 식물 세포에서 적어도 한 형태의 콜라겐 알파 사슬 및 외인성 P4H를 발현하여 식물에서 콜라겐을 생성하는 것을 포함하여, 식물 또는 단리된(isolated) 식물 세포에서 콜라겐을 생산하는 방법이 제공된다.
이하 기술되는 본 발명의 바람직한 일례에서 추가의 특징에 따라, 내인성 P4H 활성이 없는 세포내 컴파트먼트에서 외인성 LH3를 발현하는 것을 추가로 포함하는 방법이 제공된다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 적어도 한 형태의 콜라겐 알파 사슬은 아포플라스트(apoplast) 또는 액포를 표적화하기 위한 시그널 펩티드를 포함한다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 적어도 한 형태의 콜라겐 알파 사슬은 ER 표적화 또는 보유(retention) 서열을 가지지 않는다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 적어도 한 형태의 콜라겐 알파 사슬은 식물의 DNA-함유 세포기관(organelle)에서 발현된다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 외인성 P4H는 아포플라스트 또는 액포를 표적화하기 위한 시그널 펩티드(signal peptide)를 포함한다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 외인성 P4H는 ER 표적화 또는 보유 서열을 가지지 않는다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 외인성 P4H는 식물의 DNA-함유 세포기관에서 발현된다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 적어도 한 형태의 콜라겐 알파 사슬은 알파 1 사슬이다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 적어도 한 형태의 콜라겐 알파 사슬은 알파 2 사슬이다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 적어도 한 형태의 콜라겐 알파 사슬은 C-말단 및/또는 N-말단 프로펩티드를 포함한다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 식물은 담배(Tobacco), 옥수수(Maize), 자주개자리(Alfalfa), 벼(Rice), 감자(Potato), 대두(Soybean), 토마토(Tomato), 밀(Wheat), 보리(Barley), 캐놀라(Canola) 및 목화(Cotton)로 구성된 그룹중에서 선택된다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 적어도 한 형태의 콜라겐 알파 사슬 또는 외인성 P4H는 식물의 부분에서만 발현된다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 식물의 부분은 잎, 종자, 뿌리, 덩이줄기 또는 줄기이다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 외인성 P4H는 적어도 한 형태의 콜라겐 알파 사슬의 Gly-X-Y 트리플렛의 Y 위치를 특이적으로 하이드록실화할 수 있다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 외인성 P4H는 인간 P4H이다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 식물은 스트레스 조건(stress condition)에 적용된다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 스트레스 조건은 건조(drought), 염분(salinity), 손상(injury), 저온(cold) 및 스트레스 유발 화합물의 분무(spraying)로 구성된 그룹중에서 선택된다.
본 발명의 또 다른 일면에 따라, 콜라겐 알파 사슬이 인간 세포에서 발현할 때 생산된 것과 동일한 하이드록실화 패턴을 가진 콜라겐 알파 사슬을 축적할 수 있는 유전자 조작(genetically modified) 식물 또는 단리된 식물 세포가 제공된다.
본 발명의 또 따른 일면에 따라, 내인성 P4H 활성이 없는 세포내 컴파트먼트에 콜라겐 알파 사슬을 축적할 수 있는 유전자 조작 식물 또는 단리된 식물 세포가 제공된다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 유전자 조작 식물은 외인성 P4H를 추가로 포함한다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 적어도 한 형태의 콜라겐 알파 사슬은 아포플라스트(apoplast) 또는 액포를 표적화하기 위한 시그널 펩티드를 포함한다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 적어도 한 형태의 콜라겐 알파 사슬은 ER 표적화 또는 보유 서열을 가지지 않는다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 적어도 한 형태의 콜라겐 알파 사슬은 식물의 DNA-함유 세포기관에서 발현된다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 외인성 P4H는 아포플라스트 또는 액포를 표적화하기 위한 시그널 펩티드를 포함한다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 외인성 P4H는 ER 표적화 또는 보유 서열을 가지지 않는다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 외인성 P4H는 식물의 DNA-함유 세포기관에서 발현된다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 콜라겐 알파 사슬은 알파 1 사슬이다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 콜라겐 알파 사슬은 알파 2 사슬이다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 콜라겐 알파 사슬은 C-말단 및/또는 N-말단 프로펩티드를 포함한다.
본 발명의 또 다른 일면에 따라, 콜라겐 알파 1 사슬을 축적할 수 있는 제 1 유전자 조작 식물 및 콜라겐 알파 2 사슬을 축적할 수 있는 제 2 유전자 조작 식물을 포함하는 식물 시스템이 제공된다.
본 발명의 또 다른 일면에 따라, 콜라겐 알파 1 사슬 및 콜라겐 알파 2 사슬을 축적할 수 있는 제 1 유전자 조작 식물 및 P4H를 축적할 수 있는 제 2 유전자 조작 식물을 포함하는 식물 시스템이 제공된다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 제 1 유전자 조작 식물 및 제 2 유전자 조작 식물 중 적어도 하나는 외인성 외인성 P4H를 추가로 포함한다.
본 발명의 또 다른 일면에 따라, (a) 제 1 식물에서 콜라겐 알파 1 사슬을 발현하고; (b) 제 2 식물에서 콜라겐 알파 2 사슬을 발현하며, 여기서 제 1 식물 및 제 2 식물에서의 발현은 콜라겐 알파 1 사슬 및 콜라겐 알파 2 사슬이 각각 내인성 P4H 활성이 없는 세포내 컴파트먼트에 축적될 수 있도록 설정되고; (c) 제 1 식물 및 제 2 식물을 교배하고 콜라겐 알파 1 사슬 및 콜라겐 알파 2 사슬을 발현하는 후손을 선별하여 섬유상 콜라겐을 생산하는 것을 포함하여 섬유상 콜라겐을 생산하는 방법이 제공된다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 상기 방법은 제 1 식물 및 제 2 식물 각각에서 외인성 P4H를 발현시키는 것을 추가로 포함한다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 각각의 콜라겐 알파 1 사슬 및 콜라겐 알파 2 사슬은 아포플라스트 또는 액포를 표적화하기 위한 시그널 펩티드를 포함한다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 각각의 콜라겐 알파 1 사슬 및 콜라겐 알파 2 사슬은 ER 표적화 또는 보유 서열을 가지지 않는다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 단계 (a) 및 (b)는 식물의 DNA-함유 세포기관에서 발현을 통해 수행된다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 외인성 P4H는 아포플라스트 또는 액포를 표적화하기 위한 시그널 펩티드를 포함한다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 외인성 P4H는 ER 표적화 또는 보유 서열을 가지지 않는다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 외인성 P4H는 식물의 DNA-함유 세포기관에서 발현된다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 각각의 콜라겐 알파 1 사슬 및 콜라겐 알파 2 사슬은 C-말단 및/또는 N-말단 프로펩티드를 포함한다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 외인성 P4H는 적어도 한 형태의 콜라겐 알파 사슬의 Gly-X-Y 트리플렛의 Y 위치를 특이적으로 하이드록실화할 수 있다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 외인성 P4H는 인간 P4H이다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 제 1 식물 및 제 2 식물은 스트레스 조건에 적용된다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 스트레스 조건은 건조, 염분, 손상, 중금속 독성 및 저온 스트레스로 구성된 그룹 중에서 선택된다.
본 발명의 또 다른 일면에 따라, (a) 제 1 식물에서, 콜라겐 알파 1 사슬 및 콜라겐 알파 2 사슬을 발현하며, 여기서 제 1 식물에서의 발현은 콜라겐 알파 1 사슬 및 콜라겐 알파 2 사슬이 각각 내인성 P4H 활성이 없는 세포내 컴파트먼트에 축적될 수 있도록 설정되고; (b) 제 2 식물에서, 내인성 P4H 활성이 없는 세포내 컴파트먼트에 축적될 수 있는 외인성 P4H를 발현하며; (c) 제 1 식물 및 제 2 식물을 교배하고 콜라겐 알파 1 사슬, 콜라겐 알파 2 사슬 및 P4H를 발현하는 후손을 선별하여 섬유상 콜라겐을 생산하는 것을 포함하여, 섬유상 콜라겐을 생산하는 방법이 제공된다.
본 발명의 또 다른 일면에 따라, 식물에서 작용하는 프로모터의 전사 제어하에 위치된 인간 P4H를 코딩하는 폴리뉴클레오티드를 포함하는 핵산 작제물이 제공된다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 상기 프로모터는 CaMV 35S 프로모터, 유비퀴틴(Ubiquitin) 프로모터, rbcS 프로모터 및 SVBV 프로모터로 구성된 그룹 중에서 선택된다.
본 발명의 또 다른 일면에 따라, 콜라겐 알파 1 사슬, 콜라겐 알파 2 사슬, P4H, LH3 및 프로테아제 C 및/또는 프로테아제 N을 발현할 수 있는 유전자 조작 식물 또는 단리된 식물 세포가 제공된다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 콜라겐 알파 1 사슬 및 콜라겐 알파 2 사슬은 각각 내인성 식물 P4H 활성이 없는 세포내 컴파트먼트에 축적될 수 있다.
본 발명의 또 다른 일면에 따라, 포유동물 콜라겐의 것과 동일한 온도 안정성을 가진 콜라겐을 축적할 수 있는 유전자 조작 식물 또는 단리된 식물 세포가 제공된다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 콜라겐은 I형 콜라겐이다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 포유동물 콜라겐은 인간 콜라겐이다.
본 발명의 또 다른 일면에 따라, 식물에서 발현하기 위해 최적화된 콜라겐-코딩 서열이 제공된다.
기술된 바람직한 일례에서 또 다른 추가의 특징에 따라, 콜라겐-코딩 서열은 서열번호: 1로 정의된 것과 같다.
본 발명은 인간 콜라겐의 것과 유사한 특성을 가진 콜라겐으로 집합가능한 정확히 하이드록실화된 콜라겐 사슬을 발현할 수 있는 식물을 제공함으로써, 현재 공지된 결점을 성공적으로 극복한다.
달리 정의되지 않는 한, 본 명세서에 사용된 모든 기술적 및 과학적 용어는 본 발명이 속하는 기술분야에 있는 당업자에 의해 통상적으로 이해되는 것과 동일한 의미를 가진다. 본 명세서에 기술된 것과 유사 또는 동등한 방법 및 물질이 본 발명의 실시 또는 시험에 사용될 수 있지만, 적합한 방법 및 물질은 아래 기술된다. 불일치한 경우에, 정의를 비롯한 특허 명세서는 조절될 것이다. 또한, 물질, 방법 및 실시예는 설명을 위한 것일 뿐, 제한하고자 의도된 것이 아니다.
본 발명은 첨부된 도면과 관련하여 단지 일례로서 여기에 기술된다. 도면과 관련하여 상세하게는 도시한 도면은 일례로서, 단지 본 발명의 바람직한 실시예를 예시적으로 설명하기 위한 것이며, 본 발명의 원리 및 개념적 양상을 가장 유용하고 쉽게 이해할 수 있게 설명하도록 하기 위해 나타낸다. 이 점에서, 본 발명을 근본적으로 이해하는데 필요한 설명보다 더 상세하게 본 발명을 구조적으로 설명하고자 하는 시도는 이루어지지 않았지만, 도면에 의한 설명은 본 발명의 일부 형태가 실제로 어떻게 구체화될 수 있는지를 당업자들에게 명백하게 한다.
도면에서:
도 1a-d는 시험 식물을 형질전환하는데 사용된 각종 발현 카세트 및 벡터의 작제(construction)를 설명한다. 본 연구의 일부로서 합성된 모든 코딩 서열은 담배에서 발현하기 위해 최적화되었다.
도 2는 각종 동시-형질전환 접근법을 설명한다. 각각의 발현 카세트는 코딩 서열의 축약명으로 나타내었다. 코딩 서열을 표 1에 명시하였다. 각각의 동시-형질전환은 두 개의 pBINPLUS 바이너리 벡터(binary vector)에 의해 수행되었다. 각각의 직사각형은 한 개, 두 개 또는 세 개의 발현 카세트를 운반하는 단일 pBINPLUS 벡터를 나타낸다. 프로모터 및 터미네이터는 실시예 1에 명시되어 있다.
도 3은 콜라겐 알파 1 (324bp 단편) 또는 콜라겐 알파 2 (537bp 단편) 또는 둘 다에 대해 양성적인 식물을 나타내는 형질전환주의 다중 PCR 스크리닝이다.
도 4는 동시-형질전환 2, 3 및 4에 의해 생성된 유전자이식 식물의 웨스턴 블롯 분석이다. 총 가용성 단백질은 담배 동시-형질전환주 2, 3 및 4로부터 추출하였고, 항 -콜라겐 I 항체(케미콘 인코포레이션(Chemicon Inc.)로부터의 #AB745)를 가지고 시험하였다. 사이즈 마커(size marker)는 페르멘타스 인코포레이션(Fermentas Inc.)으로부터의 #SM0671이었다. W.T.는 야생형 담배이다. 콜라겐 I형 알파 1 또는 알파 2 또는 둘 다에 대해 PCR 양성인 양성 콜라겐 밴드는 식물에 서 볼 수 있다. 인간 태반(펩신 분해에 의해 인간 태반으로부터 추출된 케미콘 인코포레이션의 #CC050)으로부터의 500 ng 콜라겐 I형의 양성 대조군 밴드는 유전자이식 식물 샘플에서의 총 가용 단백질(약 150 ㎍)의 약 0.3%를 나타낸다. 인간 콜라겐 샘플에서 약 140 kDa의 더 큰 밴드는, 콜라겐 I형 항체(케미콘 인코포레이션으로부터의 #MAB1913)의 항 카복시-말단 프로펩티드에 의해 검출된 바, 그의 C-프로펩티드를 가진 프로콜라겐이다. 인간 콜라겐 샘플에서 약 120 kDa의 더 작은 밴드는 프로펩티드를 갖지 않는 콜라겐이다. 그의 통상적인 조성에 기인하여, 프롤린 풍부 단백질(콜라겐 포함)은 기대한 것보다 큰 분자량을 가진 밴드로서 폴리아크릴아미드 겔상에서 일관되게 이동한다. 따라서, 약 95 kDa의 분자량을 가진 프로펩티드를 갖지 않는 콜라겐 사슬은 약 120 kDa의 밴드로서 이동한다.
도 5는 동시-형질전환 8번(콜라겐 사슬에 번역과 동시에 융합된 아포플라스트 시그널을 운반함)에 의해 생성된 유전자이식 식물의 웨스턴 블롯 분석이다. 총 가용 단백질은 유전자이식 담배 잎으로부터 추출하였고, 항-콜라겐 I형 항체(케미콘 인코포레이션으로부터의 #AB745)를 가지고 시험하였다. 양성 콜라겐 알파 2 밴드는 식물 8-141에서 볼 수 있다. 인간 태반으로부터의 콜라겐 I형(케미콘 인코포레이션으로부터의 #CC050)은 대조군으로 작용하였다.
도 6a-b는 열처리 및 트립신 또는 펩신 분해에 의해 정량화된 콜라겐의 삼중나선형 집합 및 열안정성을 나타낸다. 도 6a에서 - 담배 2-9(콜라겐 알파 1만 발현하고 P4H는 발현하지 않음) 및 3-5(콜라겐 알파 1+2 및 인간 P4H 알파 및 베타 서브유닛 둘 다를 발현함)로부터의 총 가용 단백질을 열처리(38℃ 또는 43℃에서 15분)한 다음 트립신 분해(실온에서 20분)하고, 웨스턴 블롯법으로 항-콜라겐 I형 항체를 가지고 시험하였다. 양성 대조군은 500 ng 인간 콜라겐 I형 + 야생형 담배의 총 가용 단백질의 샘플이었다. 도 6b에서 - 총 가용 단백질을 유전자이식 담배 13-6[콜라겐 I형 알파 1 및 알파 2 사슬(화살표로 표시함), 인간 P4H 알파 및 베타 서브유닛 및 인간 LH3를 발현함)로부터 추출하고, 열처리(33℃, 38℃ 또는 42℃에서 20분)한 다음, 즉시 얼음상에서 냉각하여 삼중-나선형의 재집합을 방지하고, 실온(약 22℃)에서 30분간 펩신과 배양한 다음, 항-콜라겐 I형 항체(케미콘 인코포레이션으로부터의 #AB745)을 가지고 표준 웨스턴 블롯법으로 시험하였다. 양성 대조군은 야생형 담배로부터 추출된 총 가용 단백질에 추가된 약 50 ng 인간 콜라겐 I형(펩신 분해에 의해 인간 태반으로부터 추출된 케미콘 인코포레이션으로부터의 #CC050)의 샘플이었다.
도 7은 야생형 담배상에서 수행된 노던(Northern) 블롯 분석을 나타낸다. 블롯은 담배 P4H cDNA를 가지고 프로브하였다(probed).
도 8은 동시-형질전환 2, 3 및 13에 의해 생성된 유전자이식 식물의 웨스턴 블롯 분석이다. 총 가용 단백질은 담배 동시-형질전환주로부터 추출하였고, 항 인간 P4H 알파 및 베타, 및 항-콜라겐 I형 항체를 가지고 시험하였다.
바람직한 일례의 설명
본 발명은 포유동물 콜라겐의 특징을 보이는 콜라겐 및 콜라겐 섬유를 생산하는데 사용될 수 있는 콜라겐을 발현 및 축적하는 식물에 관한 것이다.
본 발명의 원리 및 조작은 도면 및 첨부되는 발명의 상세한 설명에 의해 더욱 이해될 수 있다.
본 발명의 적어도 하나의 구체예를 상세히 설명하기에 앞서, 본 발명은 이후의 발명의 상세한 설명에 나타내었거나 도면에 나타낸 구성요소의 상세 구조 및 배열로 그의 적용이 제한되는 것이 아님을 이해하여야 한다. 본 발명은 그 외에도 구체화할 수 있거나 다양한 방법으로 실행 또는 수행될 수 있다. 또한, 여기에 사용하는 표현 및 용어는 설명을 위한 것이지 제한하는 것으로서 간주되어서는 안되는 것으로 이해하여야 한다.
콜라겐 생산성 식물은 당업계에 공지되어 있다. 이러한 식물은 콜라겐 뿐만 아니라 콜라겐 사슬을 생산하는데 사용될 수 있지만, 이러한 사슬은 부정확하게 하이드록실화되고, 이로써 식물 또는 식물아닌 곳에서의 자기집합은 본질적으로 불안정한 콜라겐으로 이르게 한다.
본 발명을 수행하는 동안, 본 발명자들은 콜라겐 사슬을 정확하게 하이드록실화함으로써 인간 I형 콜라겐의 특징(예, 온도 안정성)과 거의 유사한 콜라겐을 식물에서 생산할 수 있는 식물 발현 접근법을 발명하였다.
즉, 본 발명의 하나의 일면에 따라, 적어도 한 형태의 콜라겐 알파 사슬을 발현할 수 있고, 이를 내인성 P4H 활성이 없는 세포내 컴파트먼트에 축적할 수 있는 유전자 조작 식물이 제공된다.
본 명세서에 사용된 어구 "유전자 조작 식물"은 외인성 폴리뉴클레오티드 서열에 의해 안정하거나 일과적으로(transiently) 형질전환되는 임의의 하등(예, 이 끼) 또는 고등(관다발) 식물 또는 조직 또는 그의 단리 세포를 의미한다. 식물의 예로는 담배, 옥수수, 자주개자리, 벼, 감자, 대두, 토마토, 밀, 보리, 캐놀라, 목화, 당근, 및 이끼와 같은 하등 식물이 포함된다.
본 명세서에 사용된 어구 "콜라겐 사슬"은 콜라겐 섬유, 바람직하게는 I형 콜라겐 섬유의 알파 1 또는 2 사슬과 같은 콜라겐 서브유닛을 말한다. 본 명세서에 사용된 어구 "콜라겐"은 I형 콜라겐의 경우 두 개의 알파 1 사슬 및 하나의 알파 2 사슬을 포함하는 집합(assembled) 콜라겐 트리머를 말한다. 콜라겐 섬유는 말단 프로펩티드 C 및 N이 없는 콜라겐이다.
본 명세서에 사용된 어구 "내인성 P4H가 없는 세포내 컴파트먼트"는 식물 P4H, 또는 식물-유사 P4H 활성을 가진 효소를 포함하지 않는 세포의 임의의 구획화된 영역을 말한다. 이러한 세포내 컴파트먼트의 예로는 엽록체, 미토콘드리아 등과 같은 세포기관 뿐만 아니라 액포, 아포플라스트 및 세포질이 포함된다.
임의 형태의 콜라겐 사슬은 본 발명의 유전자 조작 식물에 의해 발현될 수 있다. 예로서 섬유-형성성 콜라겐(I형, II형, III형, V형 및 XI형), 망-형성성 콜라겐(IV형, VIII형 및 X형), 섬유 표면과 연관된 콜라겐(IX형, XII형 및 XIV형), 막횡단 단백질로서 생기는 콜라겐(XIII형 및 XVII형) 또는 11-nm 주기 주상 필라멘트(periodic beaded filament)를 형성하는 콜라겐(VI형)이 포함된다. 추가의 설명은 문헌[Hulmes, 2002]을 참조하기 바란다.
바람직하게도, 발현된 콜라겐 사슬은 I형 콜라겐의 알파 1 및/또는 2 사슬이다. 발현된 콜라겐 알파 사슬은 임의의 동물로부터 유래된 임의의 폴리뉴클레오티 드 서열에 의해 코딩될 수 있다. 바람직하게도 콜라겐 알파 사슬을 코딩하는 서열은 인간이고,서열번호: 1 및 4로 정의된다.
전형적으로, 식물에서 발현된 알파 콜라겐 사슬은 그의 말단 프로펩티드(즉, 프로펩티드 C 및 프로펩티드 N)를 포함할 수도 있고 포함하지 않을 수도 있다.
루기에로(Ruggiero) 등(2000)은 식물 단백질분해 활성에 의한 프로콜라겐의 프로세싱이 인간에서의 정상적 프로세싱과는 상이하며, 절단 부위는 알려지지 않았지만 프로펩티드 C가 식물 단백질분해 활성에 의해 제거된다는 것을 언급하였다. C 프로펩티드의 절단은 트리머의 집합 이전에 프로콜라겐 펩티드상에서 일어날 수 있다(세 개의 C-프로펩티드의 회합(association)인 트리머의 집합을 개시하는데 있어서 필수적임).
식물 단백질분해 활성에 의한 N-프로펩티드 절단은 미발육식물(plantlet)에서가 아니라 완전히 자란 식물에서 일어난다. 이러한 절단은 N 텔로펩티드로부터 2 개의 아미노산을 제거한다(17 개중 2 개)
C-프로펩티드(및 보다 적은 정도로 N-프로펩티드)는 동물 세포를 통과하는 동안 프로콜라겐의 가용성을 유지하며(Bulleid et al., 2000), 식물 세포에서 유사한 효과를 가지는 것으로 기대된다. 프로콜라겐 분자를 세포외기질내로 분비하는 동안 또는 분비한 후, 프로펩티드는 프로콜라겐 N- 및 C-프로티네이즈에 의해 제거되고, 이로 인해 섬유내로 콜라겐 분자의 특발성 자기집합을 유발시킨다(Hulmes, 2002). 프로콜라겐 N- 및 C-프로티네이즈에 의한 프로펩티드의 제거는 프로콜라겐의 용해도를 >10000-배까지 낮추며, 섬유내로의 콜라겐의 자기-집합을 개시하는데 필요충분하다. 섬유상 구조내에 콜라겐 분자를 정확히 등록하고 자기집합에 대한 임계농도를 낮추는, 삼중-나선형 도메인의 말단에 텔로펩티드라 불리는 짧은 비-삼중나선형 펩티드는 이러한 집합 공정에 있어서 중요하다(Bulleid et al., 2000). 선행기술에는 콜라겐을 생산하는 동안 프로펩티드를 절단하기 위한 펩신의 용도가 개시되어 있다(Bulleid et al., 2000). 그러나, 펩신은 텔로펩티드를 손상시켜, 결과적으로 펩신-추출 콜라겐은 정연된(ordered) 섬유상 구조를 형성할 수 없다(Bulleid et al 2000).
인간 P4H의 베타 서브유닛을 형성하는 단백질 디설파이드 이소메라제(PDI, protein disulfide isomerase)는 트리머 집합 이전에 C-프로펩티드와 결합함으로써 사슬 집합 동안 분자 샤페론(chaperone)으로서 작용하는 것으로 나타났다(Ruggiero et al, 2000). 상이한 식물에서 발현된 인간 프로콜라겐 I형 N-프로티네이즈 및 프로콜라겐 C-프로티네이즈의 사용에 의해, 천연 인간 콜라겐과 더욱 유사하며 정연된 섬유상 구조를 형성할 수 있는 콜라겐을 생성할 수 있다.
발현된 콜라겐 사슬에 N 또는 C 프로펩티드 또는 둘 모두가 포함된 경우, 본 발명의 유전자 조작 식물은 또한 각각의 프로테아제(즉, C 또는 N 또는 둘 다)를 발현한다. 이러한 프로테아제를 코딩하는 폴리뉴클레오티드 서열은 서열번호: 18(프로테아제 C) 및 20(프로테아제 N)에 의해 예시된다. 이러한 프로테아제는 이들이 콜라겐 사슬로서 동일한 세포내 컴파트먼트에 축적되도록 발현될 수 있다.
내인성 P4H 활성이 없는 세포내 컴파트먼트에 발현 콜라겐 사슬의 축적은 몇몇 접근법 중 어느 하나를 통해 수행될 수 있다.
예를 들어, 발현된 콜라겐 사슬은 아포플라스트 또는 세포기관(예, 엽록체)과 같은 세포내 컴파트먼트에 발현 단백질을 표적화시키기 위한 시그널 서열을 포함할 수 있다. 적합한 시그널 서열의 예로는 엽록체 수송 펩티드(Swiss-Prot entry P07689에 포함됨, 아미노산 1-57) 및 미토콘드리아 수송 펩티드(Swiss-Prot entry P46643에 포함됨, 아미노산 1-28)가 포함된다. 다음에 오는 실시부는 적합한 시그널 서열의 추가적인 예 및 식물 세포에서 콜라겐 사슬의 발현시 이러한 시그널 서열을 사용하기 위한 가이드라인을 제공한다.
또한, 콜라겐 사슬의 서열은 식물에서 발현시 콜라겐의 세포 국재화(cellular localization)를 변경하는 방식으로 수식될 수 있다.
상기 언급된 바와 같이, 식물의 ER은 콜라겐 사슬을 정확하게 하이드록실화할 수 없는 P4H를 포함한다. 콜라겐 알파 사슬은 본래 번역후수식되는(부정확한 하이드록실화 포함) ER내로 발현 콜라겐을 보내는 ER 표적화 서열을 포함한다. 즉, ER 표적화 서열의 제거는 임의의 하이드록실화를 비롯한 번역후수식이 없는 콜라겐 사슬의 세포질 축적을 유발할 것이다.
다음에 오는 실시부의 실시예 1은 ER 서열이 없는 콜라겐 서열의 생성을 설명한다.
또한, 콜라겐 사슬은 엽록체 또는 미토콘드리아와 같은 DNA 함유 세포기관에서 발현 및 축적될 수 있다. 엽록체 발현의 추가적인 설명이 이하에 제공된다.
상기 언급된 바와 같이, 알파 사슬의 하이드록실화는 안정한 I형 콜라겐의 집합에 필요하다. 본 발명의 유전자 조작 식물에 의해 발현된 알파 사슬은 내인성 P4H 활성이 없는 컴파트먼트에 축적되기 때문에, 이러한 사슬은 식물, 식물 조직 또는 세포로부터 분리되어 시험관내에서 하이드록실화되어야 한다. 이러한 하이드록실화는 Turpeenniemi-Hujanen 및 Myllyla(Concomitant hydroxylation of proline and lysine residues in collagen using purified enzymes in vitro. Biochim Biophys Acta. 1984 Jul 16;800(l):59-65)에 의해 기술된 방법에 의해 달성될 수 있다.
이러한 시험관내 하이드록실화는 콜라겐 사슬을 정확하게 하이드록실화시킬 수 있지만, 이는 달성하기 어렵고 비용이 많이 들 수 있다.
시험관내 하이드록실화의 제한점을 극복하기 위해, 본 발명의 유전자 조작 식물은 또한 바람직하게도 콜라겐 알파 사슬(들)을 정확히 하이드록실화[즉, Gly-X-Y 트리플렛의 프롤린 (Y) 위치만 하이드록실화]할 수 있는 P4H를 동시-발현한다. P4H는 두 개의 서브 유닛 알파 및 베타로 구성된 효소이다. 베타 서브유닛은 또한 샤페론 작용성을 가지지만, 둘 모두 활성 효소를 형성하는데 필요하다.
본 발명의 유전자 조작 식물에 의해 발현된 P4H는 바람직하게는 예를 들어 서열번호: 12 및 14에 의해 코딩되는 인간 P4H이다. 또한, 기질 특이성을 향상시키는 P4H 돌연변이주 또는 P4H 동족체가 또한 사용될 수 있다.
적합한 P4H 동족체가 NCBI 수납(accession) NP_179363에 의해 확인된 아라비돕시스 옥시도리덕타제(Arabidopsis oxidoreductase)에 의해 예시된다. 본 발명자들에 의해 수행된 인간 P4H 알파 서브유닛과 이러한 단백질 서열의 쌍정렬(pairwise alignment)은 식물의 임의의 공지된 P4H 동족체의 기능적 도메인간에 가장 높은 상동성을 나타내었다.
P4H는 발현 콜라겐 사슬과 함께 동시-축적하는데 필요하기 때문에, 따라서 그의 코딩 서열은 바람직하게 변형된다(ER 표적화 등을 방지할 수 있는 결손, 시그널 서열의 첨가).
포유동물 세포에서, 콜라겐은 또한 리실 하이드록실레이즈, 갈락토실트랜스페라제 및 글푸코실트랜스페라제에 의해 수식된다. 이들 효소는 하이드록시리실, 갈락토실하이드록시리실 및 글루코실갈락토실하이드록시리실 잔기에 대한 특이적 위치에서 리실 잔기를 순차적으로 수식시킨다. 단일 인간 효소인 리실 하이드록실레이즈 3(LH3)은 하이드록시리신 결합 탄수화물을 형성하는 연속적인 세 단계 모두를 촉매할 수 있다.
즉, 본 발명의 유전자 조작 식물은 또한 바람직하게도 포유동물 LH3을 발현한다. 서열번호: 22로 정의된 것과 같은 서열을 코딩하는 LH3는 이러한 목적을 위해 사용될 수 있다.
위에 기술된 콜라겐 사슬(들) 및 수식 효소(modifying enzyme)는 식물 작용성 프로모터의 전사 제어(transcriptional control)하에 위치된 알파 사슬 및/또는 수식 효소(예, P4H 및 LH3)를 코딩하는 폴리뉴클레오티드 서열을 포함하는 안정하게 통합되거나(stably integrated) 일과적으로 발현된 핵산 작제물로부터 발현될 수 있다. 이러한 핵산 작제물(본 명세서에서 발현 작제물로서 또한 명명됨)은 전체 식물, 한정된 식물 조직 또는 한정된 식물 세포에 걸쳐 또는 식물의 한정된 성장 단계에 발현하도록 형성될 수 있다. 이러한 작제물은 또한 박테리아 복제에 대 한 복제 기원, 인핸서 요서(enhancer element) 및 선택 마커(예, 항생제 내성)를 포함할 수 있다.
두 개의 발현가능한 삽입물(예, 두 개의 알파 사슬 형태, 또는 알파 사슬과 P4H)을 포함하는 작제물은 바람직하게는 각각의 삽입물에 대한 개개의 프로모터(프로모터)를 포함하거나, 다르게는 이러한 작제물은 단일 프로모터로부터의 삽입 서열 모두를 포함하는 단일 전사 키메라(transcript chimera)를 발현할 수 있다. 이와 같은 경우에, 키메라 전사는 하향 삽입이 그로부터 번역될 수 있도록 두 개의 삽입 서열사이에 IRES 서열을 포함한다.
조직 특이적, 성장 특이적, 구성적 또는 유도가능적일 수 있는 다수의 식물 작용성 발현 프로모터 및 인핸서가 본 발명의 작제물에 의해 사용될 수 있고, 일부의 예가 아래에 제공된다.
본 명세서 및 다음에 오는 청구범위 부분에서 사용된 어구 "식물 프로모터" 또는 "프로모터"는 식물 세포(DNA 함유 세포기관 포함)에서 유전자를 발현하게 할 수 있는 프로모터를 포함한다. 이러한 프로모터는 식물, 박테리아, 바이러스, 진균 또는 동물 기원으로부터 유래될 수 있다. 이러한 프로모터는 구성적(즉, 복수의 식물 조직에서 고수준으로 유전자를 발현가능), 조직 특이적(즉, 복수의 식물 조직 또는 조직들에서 유전자를 발현가능), 유도가능적(즉, 자극하에 유전자를 발현가능) 또는 키메라적(즉, 적어도 두 개의 상이한 프로모터의 부분의 형성)일 수 있다.
따라서, 사용된 식물 프로모터는 구성적 프로모터, 조직 특이적 프로모터, 유도가능적 프로모터 또는 키메라 프로모터일 수 있다.
구성적 식물 프로모터의 예로는 CaMV35S 및 CaMV19S 프로모터, FMV34S 프로모터, 슈가케인 바실리폼 바드나바이러스(sugarcane bacilliform badnavirus) 프로모터, CsVMV 프로모터, 아라비돕시스(Arabidopsis) ACT2/ACT8 악틴(actin) 프로모터, 아라비돕시스(Arabidopsis) 유비퀴틴 UBQ1 프로모터, 보리 잎 티오닌 BTH6 프로모터, 및 라이스 악틴 프로모터 등이 포함되나 이들에 한정되지 않는다.
조직 특이적 프로모터의 예로는 빈 파세올린(bean phaseolin) 저장 단백질 프로모터, DLEC 프로모터, PHS 프로모터, 제인(zein) 저장 단백질 프로모터, 대두로부터의 콘글루틴 감마(conglutin gamma) 프로모터, AT2S1 유전자 프로모터, 아라비돕시스(Arabidopsis)로부터의 ACT11 악틴 프로모터, 브라시카 나푸스(Brassica napus)로부터의 nap A 프로모터 및 포테이토 파타틴(potato patatin) 유전자 프로모터가 포함되나 이들에 한정되지 않는다.
유도가능적 프로모터는 예를 들어, 광선, 온도, 화학물질, 건조, 고염, 삼투압 충격을 포함하는 스트레스 조건, 산화 조건 또는 병원성의 경우에서와 같이 특이적 자극에 의해 유도되는 프로모터로서, 완두 rbcS 유전자로부터 유래된 광선-유도가능 프로모터, 자주개자리 rbcS 유전자로부터 유래된 프로모터, 건조에 활성적인 프로모터 DRE, MYC 및 MYB; 고염 및 삼투압 스트레스에 활성적인 프로모터 INT, INPS, prxEa, Ha hsp17.7G4 및 RD21, 및 병원성 스트레스에 활성적인 프로모터 hsr203J 및 str246C가 포함되나 이들에 한정되지 않는다.
바람직하게도, 작제 삽입물의 발현 내내 다음의 식물 형질전환을 수행하기 위해 본 발명에 의해 사용되는 프로모터는 강력한 구성적 프로모터이다.
본 발명에 사용되는 작제물 형태는 각각의 작제물 형태에서 동일하거나 상이한 선택 마커를 사용하여 동일한 식물내로 동시-형질전환될 수 있음이 인지될 것이다. 다르게는, 제 2 작제물 형태는 제 2 동종 식물내로 도입될 수 있는 반면, 제 1 작제물 형태는 제 1 식물내에 도입될 수 있고, 이어 그로부터의 유전자이식 식물 생성물은 교배되며, 이중 형질전환주를 위해 후손이 선택될 수 있다. 이와 같은 후손의 자가-교배(self-cross)는 두 작제물에 대해 상동성을 가진 라인을 생성하는데 사용될 수 있다.
단자엽 및 쌍자엽 식물 모두에 핵산 작제물을 도입하는 여러 방법이 존재한다[Potrykus, I., Annu. Rev. Plant. Physiol., Plant. Mol. Biol. (1991) 42:205-225; Shimamoto et al., Nature (1989) 338:274-276)]. 이러한 방법은 식물 게놈내로의 핵산 작제물 또는 그의 일부의 안정한 통합 또는 핵산 작제물의 일과적 발현에 의존하며, 이 경우에 이들 서열은 식물의 후손에게 유전되지 않는다.
또한, 핵산 작제물은 엽록체와 같은 DNA 함유 세포기관의 DNA내로 직접 도입될 수 있음이 몇몇 방법에 나타나있다.
본 발명의 핵산 작제물내에 포함된 것과 같은 외인성 서열을 식물 게놈내로 안정하게 게놈 통합하는 두 가지의 주된 방법이 있다:
(i) 아그로박테리움(Agrobacterium)-매개 유전자 전달:[참조예: Klee et al. (1987). Annu. Rev. Plant Physiol. 38:467-486; Klee and Rogers in Cell Culture and Somatic Cell Genetics of Plants, Vol. 6, Molecular Biology of Plant Nuclear Genes, eds. Schell, J., and Vasil, L. K., Academic Publishers, San Diego, Calif. (1989) p. 2-25; Gatenby, in Plant Biotechnology, eds. Kung, S and Arntzen, C. J., Butterworth Publishers, Boston, Mass. (1989) p. 93-112].
(ii) 직접 DNA 흡수:[참조예: Paszkowski et al., in Cell Culture and Somatic Cell Genetics of Plants, Vol. 6, Molecular Biology of Plant Nuclear Genes eds. Schell, J., and Vasil, L. K., Academic Publishers, San Diego, Calif. (1989) p. 52-68; including methods for direct uptake of DNA into protoplasts, Toriyama, K. et al (1988) Bio/Technology 6:1072-1074. DNA uptake induced by brief electric shock of plant cells: Zhang et al. Plant Cell Rep. (1988) 7:379-384. Fromm et al. Nature (1986) 319:791-793. DNA injection into plant cells or tissues by particle bombardment, Klein et al. Bio/Technology (1988) 6:559-563; McCabe et al. Bio/Technology (1988) 6:923-926; Sanford, Physiol. Plant. (1990) 79:206-209; by the use of micropipette systems: Neuhaus et al, Theor. Appl. Genet. (1987) 75:30-36; Neuhaus and Spangenberg, Physiol. Plant. (1990) 79:213-217; or by the direct incubation of DNA with germinating pollen, DeWet et al. in Experimental Manipulation of Ovule Tissue, eds. Chapman, G. P. and Mantell, S. H. and Daniels, W. Longman, London, (1985) p. 197-209; and Ohta, Proc. Natl. Acad. Sci. USA (1986) 83:715-719].
아그로박테리움(Agrobacterium) 시스템은 식물 게놈 DNA내로 통합된 한 정(defined) DNA 세그먼트를 가진 플라스미드 벡터의 사용을 포함한다. 식물 조직의 접종 방법은 식물 종 및 아그로박테리움(Agrobacterium) 전달 시스템에 따라 달라진다. 널리 사용되는 접근법은 전체-식물 분화의 개시에 대한 우수한 소스(source)를 제공하는 임의의 조직 외식편으로 수행될 수 있는 리프-디스크(leaf-dic) 방법이다[Horsch et al. in Plant Molecular Biology Manual A5, Kluwer Academic Publishers, Dordrecht (1988) p. 1-9]. 추가적인 접근법은 진공 침투와 함께 아그로박테리움 (Agrobacterium) 전달 시스템을 사용한다. 아그로박테리움(Agrobacterium) 시스템은 유전자이식 쌍자엽 식물의 발생에 특히 유용하다.
식물 세포내로 직접 DNA를 전달하는 여러 방법이 존재한다. 일렉트로포레이션(electroporation)의 경우, 원형질체는 강한 전기장에 쉽게 노출된다. 마이크로인젝션(microinjection)의 경우, DNA는 매우 작은 마이크로피펫을 사용하여 기계적으로 세포내로 직접 주입된다. 마이크로입자 타격법(microparticle bombardment)에서, DNA는 마그네슘 설페이트 크리스탈, 텅스텐 입자 또는 금 입자와 같은 마이크로프로젝틸(microprojectile)상에 흡착되며, 이 마이크로프로젝틸은 세포 또는 식물 조직내로 물리적으로 가속된다.
형질전환후, 이어 식물을 번식시킨다. 식물 번식의 가장 통상적인 방법은 종자에 의한 것이다. 그러나, 종자는 멘델 규칙에 의해 지배되는 유전 분산(genetic variance)에 따라 식물에 의해 생성되므로, 종자 번식에 의한 재생의 단점은 이형접합성에 기인한 작물의 균일성 결핍이다. 원래, 각각의 종자는 유전학적으로 상이하며, 자신의 특별한 특성을 가지고 성장할 것이다. 따라서, 재생 식물의 특성과 특징이 모 유전자이식 식물과 동일하도록 형질전환 식물을 생산하는 것이 바람직하다. 따라서, 형질전환 식물은 형질전환 식물을 신속하고 일관되게 생산하는 미세증식(micropropagation)에 의해 재생되는 것이 바람직하다.
본 발명의 핵산 작제물에 포함되는 단리된 핵산을 일과적으로 발현하는데 사용될 수 있는 일과적 발현 방법으로는, 일과성 발현을 촉진하는 조건하에서가 아니라 상술한 바와 같은 마이크로인젝션 및 타격법, 및 핵산 작제물을 포함하는 팩킹되거나 팩킹되지 않은 재조합 바이러스 벡터가, 그 안에 확립된 증식 재조합 바이러스가 비-바이러스성 핵산 서열을 발현하도록 식물 조직 또는 세포를 감염시키는데 사용되는 바이러스-매개 발현법이 포함되나 이들에 한정되지 않는다.
식물 숙주의 형질전환에 유용한 것으로 제시된 바이러스로는 CaMV, TWV 및 BV가 포함된다. 식물 바이러스를 사용하는 식물의 형질전환에 대하여 예를 들어 미국 특허 제 4,855,237 호(BGMV); EPA 67,553 (TMV); 일본 특허공개 제 63-14693 호(TMV); EPA 194,809 (BV); EPA 278,667 (BV); 및 문헌[Gluzman, Y. et al. (1988). Communications in Molecular Biology: Viral Vectors, Cold Spring Harbor Laboratory, New York, pp. 172-189 (1988)]에 기술되어 있다. 식물을 비롯한 많은 숙주에서 외래 DNA를 발현하는데 사용하기 위한 가성바이러스(pseudovirus) 입자가 WO 87/06261에 개시되어 있다.
식물에서의 비-바이러스성 외인 핵산 서열의 도입 및 발현을 위한 식물 RNA 바이러스의 작제는 상기한 문헌 및 문헌[Dawson, W. O. et al. (1989) 172:285-292; Takamatsu et al. EMBO J. (1987) 6:307-311; French, R. et al. Science (1986) 231:1294-1297; and Takamatsu et al. FEBS Letters (1990) 269:73-76]에 의해 입증된다.
바이러스가 DNA 바이러스인 경우, 작제물은 바이러스 자체로 만들어질 수 있다. 또한,외래 DNA와의 목적하는 바이러스 벡터의 작제를 용이하게 하기 위해, 바이러스는 먼저 박테리아 플라스미드내로 클로닝될 수 있다. 이어, 바이러스는 플라스미드로부터 잘라내어질 수 있다. 바이러스가 DNA 바이러스인 경우, 박테리아 복제기점이 바이러스 DNA에 부착된 다음 박테리아에 의해 복제될 수 있다. DNA의 전사 및 번역은 바이러스 DNA를 둘러싸는 외피 단백질을 생산하게 될 것이다. 바이러스가 RNA 바이러스인 경우, 이 바이러스는 일반적으로 cDNA로서 클로닝되고 플라스미드내로 삽입된다. 이어, 이 플라스미드를 사용하여 대부분의 작제물을 제조한다. 그후, RNA 바이러스는 플라스미드의 바이러스 서열을 전사하고, 바이러스 유전자의 번역에 의해 바이러스 RNA를 둘러싸게 될 외피 단백질(들)을 생산한다.
식물에서의 비-바이러스성 외인 핵산 서열의 도입 및 발현을 위한 식물 RNA 바이러스의 작제는 본 발명의 작제물에 포함되는 것들과 같이, 상기 문헌 및 미국 특허 제 5,316,931 호에 입증되어 있다.
제 1 일례로, 바이러스성 핵산으로부터 천연(native) 외피 단백질 코딩 서열은 결손시키고, 식물 숙주에서 발현가능하고, 재조합 식물 바이러스성 핵산의 패키징(packaging) 및 재조합 식물 바이러스성 핵산으로 숙주를 전신적으로 감염시킬 수 있는, 비-천연(non-native) 식물 바이러스 외피 단백질 코딩 서열 및 비-천연 프로모터, 바람직하게는 비-천연 외피 단백질 코딩 서열의 서브게놈(subgenomic) 프로모터는 삽입한 식물 바이러스성 핵산이 제공된다. 또한, 단백질이 형성되도록, 외피 단백질 유전자는 그 안에 비-천연 핵산 서열의 삽입에 의해 비활성화될 수 있다. 재조합 식물 바이러스 핵산 작제물은 하나 이상의 추가적인 비-천연 서브게놈 프로모터를 가질 수 있다. 각각의 비-천연 서브게놈 프로모터는 식물 숙주에서 인접 유전자(adjacent gene) 또는 핵산 서열을 전사하거나 발현할 수 있고, 서로 함께 및 천연 서브게놈 프로모터와 함께 재조합될 수 없다. 하나 보다 많은 핵산 서열이 포함될 경우, 비-천연(외래) 핵산 서열은 천연 식물 바이러스성 서브게놈 프로모터 또는 천연 및 비-천연 식물 바이러스성 서브게놈 프로모터에 인접하게 삽입될 수 있다. 비-천연 핵산 서열은 서브게놈 프로모터의 제어하에 숙주 식물에서 전사되거나 발현되어 목적하는 산물을 생산한다.
제 2 일례로, 재조합 식물 바이러스성 핵산 작제물은, 천연 외피 단백질 코딩 서열이 비-천연 외피 단백질 코딩 서열에 인접하는 대신 비-천연 외피 단백질 서브게놈 프로모터중 하나에 인접하게 배치되는 것을 제외하고는 제 1 일례에서와 같이 제공된다.
제 3 일례로, 천연 외피 단백질 유전자는 그의 서브게놈 프로모터에 인접하고, 하나 이상의 비-천연 서브게놈 프로모터는 바이러스성 핵산에 삽입된 재조합 식물 바이러스성 핵산이 제공된다. 삽입된 비-천연 서브게놈 프로모터는 식물 숙주에서 인접 유전자를 전사 또는 발현할 수 있고, 서로 함께 및 천연 서브게놈 프로모터와 함께 재조합될 수 없다. 비-천연 핵산 서열은, 서브게놈 프로모터의 제 어하에 숙주 식물에서 전사되거나 발현되어 목적하는 산물을 생산할 수 있도록 비-천연 서브게놈 식물 바이러스성 프로모터에 인접하게 삽입될 수 있다.
제 4 일례로, 재조합 식물 바이러스성 핵산은 천연 외피 단백질 코딩 서열이 비-천연 외피 단백질 코딩 서열에 의해 대체되는 것을 제외하고는 제 3 일례에서와 같이 제공된다.
바이러스 벡터는 재조합 식물 바이러스성 핵산에 의해 코딩된 외피 단백질에 의해 감싸져 재조합 식물 바이러스를 생산한다. 재조합 식물 바이러스성 핵산 또는 재조합 식물 바이러스는 적합한 숙주 식물을 감염시키는데 사용된다. 재조합 식물 바이러스성 핵산은 숙주안에서 전신적으로 분포하도록 숙주내에서 복제될 수 있고, 숙주내에서 외래 유전자(들)(단리된 핵산)를 전사 또는 발현하여 목적하는 단백질을 생산할 수 있다.
엽록체 게놈에 외인성 핵산 서열을 도입하는 기술은 공지되어 있다. 이 기술은 다음의 과정을 포함한다. 첫째, 세포당 엽록체의 수가 대략 하나로 줄도록 식물 세포를 화학적으로 처리한다. 이어, 적어도 하나의 외인성 핵산 분자를 엽록체내 도입하기 위해, 외인성 핵산을 입자 충격(particle bombardment)을 통해 세포내로 도입한다. 그후, 엽록체에 대한 고유의 효소에 의해 쉽게 수행될 수 있는 상동 조합(homologous recombination)을 통해 엽록체 게놈내로 통합가능하도록 외인성 핵산을 선택한다. 그 때문에, 외인성 핵산은 관심의 대상이 되는 유전자 이외에 엽록체 게놈으로부터 유래된 적어도 하나의 핵산 스트레치(stretch)를 포함한다. 또한, 외인성 핵산은 순차적인 선택 방법에 의해, 선택후 모든 또는 실질적으 로 모든 엽록체 게놈의 복사(copy)가 외인성 핵산을 포함한다는 것을 확인할 수 있게 하는 선택가능 마커를 포함한다. 이러한 기술에 관한 더욱 상세한 설명은 참고로 본 명세서에 포함되는 미국 특허 제 4,945,050 호 및 제 5,693,507 호에 있다. 따라서, 폴리펩티드는 엽록체의 단백질 발현 시스템에 의해 생산되어, 엽록체의 내막에 통합될 수 있다.
상술한 형질전환 접근법은 임의 종의 식물, 또는 식물 조직 또는 그로부터 유래된 단리된 식물 세포에서 집합 콜라겐(프로펩티드 함유 또는 비함유) 뿐만 아니라 콜라겐 사슬 및/또는 수식 효소를 생산하는데 사용될 수 있다.
바람직한 식물은 콜라겐 사슬, 콜라겐 및/또는 본 명세서에 설명된 처리 효소를 대량으로 축적할 수 있는 것들이다. 이러한 식물은 또한 발현 성분 또는 집합 콜라겐이 추출될 수 있는 용이성 및 스트레스 조건에 대한 그의 내성에 따라 선택될 수 있다. 바람직한 식물의 예로는 담배, 옥수수, 자주개자리, 벼, 감자, 대두, 토마토, 밀, 보리, 캐놀라 및 목화가 포함된다.
콜라겐 섬유는 식품 및 화장품 산업에 광범위하게 사용된다. 따라서, 식물에 의해 발현된 콜라겐 섬유 성분(알파 사슬) 및 수식 효소는 콜라겐의 공업적 합성에 있어서 유용하지만, 단순성 및 비용 효율성을 위해서는 식물에서의 완전한 콜라겐 생산이 바람직하다.
몇몇 접근법은 식물에서 I형 콜라겐을 생성하는데 사용될 수 있다. 예를 들어, 콜라겐 알파 1 사슬은 콜라겐 알파 1 및 P4H (및 임의로 LH3)를 발현하는 식물로부터 단리되고, 콜라겐 알파 2 및 P4H (및 임의로 LH3 및 프로테아제 C 및/또는 N)를 발현하는 식물로부터 단리된 콜라겐 알파 2 사슬과 혼합될 수 있다. 콜라겐 알파 1 사슬은 자동적으로 삼중 나선형으로 자기집합하기 때문에, 콜라겐 알파 2 사슬과 혼합 및 재생하기 전에 이러한 호모-트리머를 변성시킬 필요가 있을 수 있다.
바람직하게도, 콜라겐 알파 1 및 P4H (및 임의로 LH3 및 프로테아제 C 및/또는 N)를 발현하는 제 1 식물은 콜라겐 알파 2를 발현하는 제 2 (및 바람직하게는 동종) 식물과 교배될 수 있거나, 다르게는 알파 사슬 둘 다를 발현하는 제 1 식물은 P4H 및 임의로 LH3 및 프로테아제 C 및/또는 N을 발현하는 제 2 식물과 교배될 수 있다.
상술한 식물 개량 접근법은 두 가지의 개별적인 형질전환 식물을 사용하지만, 각각 하나 또는 두 성분을 발현하는 세 가지 이상의 개별적인 형질전환 식물을 사용하는 접근법이 또한 사용될 수 있음을 주목해야 한다.
당업자들은 다양한 식물 개량 기술을 잘 알고 있으며, 이러한 기술에 대한 추가의 설명은 본 명세서에 제공되지 않는다.
식물 개량 접근법이 바람직하지만, 콜라겐 알파 1 및 2, P4H 및 LH3 (및 임의로 프로테아제 C 및/또는 N)을 발현하는 단일 식물은, 하나 이상의 발현가능 성분을 세포내로 도입하기 위해 각각 설계된 몇몇 형질전환 단계를 통해 생성될 수 있음을 주목해야 한다. 이러한 경우, 각각의 형질전환 단계의 안정성은 특정의 선택 마커를 사용하여 입증될 수 있다.
여하튼, 형질전환 및 식물 개량 접근법은 여러가지 성분을 발현하는 임의의 식물을 생성하는데 사용될 수 있다. 현재, 콜라겐 알파 1 및 2 사슬, P4H, LH3 및 적어도 하나의 프로테아제(예, 프로테아제 C 및/또는 N)를 발현하는 식물이 바람직하다. 다음에 오는 실시부에 추가로 설명된 바, 이러한 식물은 42℃ 이하의 온도에서 안정성을 보이는 콜라겐을 축적한다.
개량 또는 다르게는 다중-형질전환된 식물로부터 생성된 후손은 핵산 또는 단백질 프로브(예, 항체)를 사용하여 외인성 mRNA 및/또는 폴리펩티드의 존재를 입증함으로써 선택될 수 있다. 발현된 폴리펩티드 성분의 국재화(예를 들어 분획 식물 추출액을 프로브하는 것에 의해)를 가능케 하고, 이로써 또한 정확한 프로세싱 및 집합에 대한 가능성을 입증할 수 있기 때문에, 후자의 접근법이 바람직하다. 적합한 프로브의 예가 다음에 오는 실시부에 제공되어 있다.
콜라겐-발현 후손이 확인되면, 이러한 식물은 콜라겐 사슬 및 수식 효소의 발현을 극대화하는 조건하에서 추가로 재배된다.
유리 프롤린 축적은 본 발명의 유전자 조작 식물에 의해 발현된 콜라겐 사슬을 비롯한 상이한 프롤린-풍부 단백질들의 생산을 촉진할 수 있기 때문에, 바람직한 재배 조건은 재배 식물에 유리 프롤린 축적을 증가시키는 조건이다.
물 부족, 염류화(salinization), 저온, 고온, 병원체 감염, 중금속 독성, 혐기생활(anaerobiosis), 영양결핍, 대기오염 및 UV-조사를 비롯한 광범위한 환경 스트레스에 대한 반응으로 각종 식물에 유리 프롤린이 축적된다(Hare and Cress, 1997).
유리 프롤린은 또한 ABA와 같은 화합물 또는 구리염, 파라쿠아 트(paraquate), 살리실산 등과 같은 스트레스 유도 화합물에 의한 식물 또는 토양의 처리에 대한 반응으로 축적될 수 있다.
즉, 콜라겐-발현 후손은 상이한 스트레스 조건(예, 5O mM 내지 25O mM 범위의 NaCl의 상이한 농도)하에 성장될 수 있다. 콜라겐 생산을 더욱 향상시킬 수 있도록, 콜라겐 발현에 대한 각종 스트레스 조건의 영향을 조사하여 식물 생존능, 생체량(biomass) 및 콜라겐 축적에 관해 최적화할 것이다.
식물 조직/세포는 바람직하게는 완전 발달기에 수확되고, 콜라겐 섬유는 널리 공지된 선행 추출법을 사용하여 단리되며, 이러한 방법중 하나가 이하에 상술된다.
유전자이식 식물의 잎은 액체 질소하에 분쇄하여 분말화하고, 균질액을 0.2M NaCl을 함유하는 0.5 M 아세트산에서 4℃에서 60 시간동안 추출한다. 불용성 물질을 원심분리에 의해 제거한다. 재조합 콜라겐을 함유하는 상청액을 0.4 M 및 0.7 M NaCl에서 염분별한다(salt-fractionated). 재조합 헤테로트리머 콜라겐을 함유하는 0.7 M NaCl 침전물을 0.1 M 아세트산에 용해 및 투석한 다음, -20℃에서 저장한다(Ruggiero et al., 2000).
본 발명의 추가의 목적, 장점 및 신규한 특징은 비한정적인 하기 실시예의 실험을 통해 당업자들에게 자명할 것이다. 또한 상기 설명되고 이하 청구 범위에서 청구하는 본 발명의 다양한 일례 및 일면은 하기 실시예를 통해 실험적으로 입증될 것이다.
실시예
상기한 설명과 함께 하기 실시예를 참고하여 본 발명을 비한정적으로 설명한다.
일반적으로, 본 명세서에 사용된 명명법 및 본 발명에 사용된 실험 과정은 분자적, 생화학적, 미생물학적 및 재조합 DNA 기술을 포함한다. 이러한 기술은 문헌에 충분히 설명되어 있다[참조예, "Molecular Cloning: A laboratory Manual" Sambrook et al., (1989); "Current Protocols in Molecular Biology" Volumes I-III Ausubel, R.M., ed. (1994); Ausubel et al., "Current Protocols in Molecular Biology", John Wiley and Sons, Baltimore, Maryland (1989); Perbal, "A Practical Guide to Molecular Cloning", John Wiley & Sons, New York (1988); Watson et al., "Recombinant DNA", Scientific American Books, New York; Birren et al., (eds) "Genome Analysis: A laboratory Manual Series", Vols. 1-4, Cold Spring Harbor laboratory Press, New York (1998); 미국특허 제 4,666,828호; 제 4,683,202호; 제 4,801,531호; 제 5,192,659호 및 제 5,272,057호에 설명된 방법; "Cell Biology: A laboratory Handbook", Volumes I-III Cellis, J.E., ed. (1994); "Current Protocols in Immunology" Volumes I-III Coligan J.E., ed. (1994); Stites et al., (eds), "Basic and Clinical Immunology" (8th Edition), Appleton & Lange, Norwalk, CT (1994); Mishell and Shiigi (eds), "Selected Methods in Cellular Immunology", W.H. Freeman and Co., New York (1980)]; 이용가능한 면역분석법은 특허 및 과학 문헌에 널리 기술되어 있다[참조예, 미국특허 제 3,791,932호; 제 3,839,153호; 제 3,850,752호; 제 3,850,578호; 제 3,853,987호; 제 3,867,517호; 제 3,879,262호; 제 3,901,654호; 제 3,935,074호; 제 3,984,533호; 제 3,996,345호; 제 4,034,074호; 제 4,098,876호; 제 4,879,219호; 제 5,011,771호 및 제 5,281,521호; "Oligonucleotide Synthesis" Gait, M.J., ed. (1984); "Nucleic Acid Hybridization" Hames, B.D., and Higgins S.J., eds. (1985); "Transcription and Translation" Hames, B.D., and Higgins S.J., Eds. (1984); "Animal Cell Culture" Freshney, R.I., ed. (1986); "Immobilized Cells and Enzymes" IRL Press, (1986); "A Practical Guide to Molecular Cloning" Perbal, B., (1984) and "Methods in Enzymology" Vol. 1-317, Academic Press; "PCR Protocols: A Guide To Methods And Applications", Academic Press, San Diego, CA (1990); Marshak et al., "Strategies for Protein Purification and Characterization - A laboratory Course Manual" CSHL Press (1996)](본원에 충분히 설명된 바와 같이 언급된 모든 참고문헌은 참고로서 본원에 포함된다). 이 문서 전체를 통해 다른 일반 문헌이 제공된다. 방법은 본 분야에 잘 공지되어 있고 독자의 용이를 위해 제공된다. 이에 포함된 모든 정보는 본 명세서에서 참고적으로 인용된다.
실시예
1
작제 및 형질전환 개요
본 작업에서 사용된 발현 카세트 및 벡터의 작제를 도 1a-d에 나타내었다. 본 작업에서의 모든 코딩 서열을 담배에서의 발현을 위해 최적화하였고 목적하는 인접영역(flanking region)과 화학적으로 합성하였다(서열번호: 1, 4, 7, 12, 14, 16, 18, 20, 22). 도 1a - 액포 시그널 또는 아포플라스트 시그널(서열번호: 7에 의해 코딩됨)에 융합되거나 시그널이 없는 Col1 및 Col2(서열번호: 1, 4)에 대한 합성 유전자 코딩을 크리산세뭄(Chrysanthemum) rbcS1 프로모터와 5'UTR (서열번호: 10) 및 크리산세뭄 rbcS1 3'UTR과 터미네이터(서열번호: 11)로 구성된 발현 카세트에서 클로닝하였다. 완전 발현 카세트를 pBINPLUS 식물 형질전환 벡터의 다중 클로닝 부위에서 클로닝하였다(van Engelen et al., 1995, Transgenic Res 4: 288-290). 도 1b - 액포 시그널 또는 아포플라스트 시그널(서열번호: 7에 의해 코딩됨)에 융합되거나 시그널이 없는 P4H 베타-인간, P4H 알파-인간 및 P4H-식물(서열번호: 12, 14 및 16)에 대한 합성 유전자 코딩을 CaMV 35S 프로모터 및 TMV 오메가 서열, 및 벡터 pJD330에 의해 운반되는 아그로박테리움 노팔린 신테타제(Agrobacterium Nopaline synthetase, NOS) 터미네이터로 구성된 발현 카세트에서 클로닝하였다(Galili et al., 1987, Nucleic Acids Res 15: 3257-3273). 완전 발현 카세트를 Col1 또는 Col2의 발현 카세트를 운반하는 pBINPLUS 벡터의 다중 클로닝 부위에서 클로닝하였다. 도 1c - 액포 시그널 또는 아포플라스트 시그널(서열번호: 7에 의해 코딩됨)에 융합된 프로티네이즈 C 및 프로티네이즈 N (서열번호: 18, 20)에 대한 합성 유전자 코딩을 크리산세뭄 rbcS1 프로모터와 5'UTR (서열번호: 10) 및 크리산세뭄 rbcS1 3'UTR과 터미네이터(서열번호: 11)로 구성된 발현 카세트에서 클로닝하였다. 완전 발현 카세트를 pBINPLUS 식물 형질전환 벡터의 다중 클로닝 부위에서 클로닝하였다. 도 1d - 액포 시그널 또는 아포플라스트 시그널(서열번호: 7에 의해 코딩됨)에 융합되거나 시그널이 없는, 아그로박테리움 옥토파인 신타제(Agrobacterium octopin synthase, OCS) 터미네이터(NCBI accession Z37515 REGION: 1344..1538 version Z37515.1 GL886843)로 종결되고 인접 스트로베리 베인 밴딩 바이러스(Strawberry vein banding virus, SVBV) 프로모터(NCBI accession AF331666 REGION: 623..950 version AF331666.1 GI:13345788)를 가진 LH3(서열번호: 22)에 대한 합성 유전자 코딩을 Col1 및 P4H 베타의 발현 카세트를 운반하는 pBINPLUS 벡터의 다중 클로닝 부위에서 클로닝하였다.
숙주 식물내로의 도 1에 기술된 발현 카세트를 사용하는 동시-형질전환의 개요를 도 2에 나타내었다. 각각의 발현 카세트 삽입물은 코딩 서열의 축약명으로 나타내었다. 코딩 서열 및 관련 서열번호를 표 1에 기술하였다. 각각의 동시-형질전환을 두 개의 pBINPLUS 바이너리 벡터에 의해 수행하였다. 각각의 직사각형은 한 개, 두 개 또는 세 개의 발현 카세트를 운반하는 단일 pBINPLUS 벡터를 나타낸다. 프로모터 및 터미네이터를 도 1에 명기하였다.
실시예
2
식물 콜라겐 발현
아래 표 1에 기술된 단백질을 코딩하는 합성 폴리뉴클레오티드 서열을 담배 식물에서 발현하기 위해 설계 및 최적화하였다.
표 1 - 발현 단백질의 리스트
이름 | SwissProt accession | 아미노산 | 스플라이싱 이소형 | 결손 | 이름 | 하기 서열번호에 포함 | 하기 서열번호에 의해 코딩 | |
콜라겐 알파 1(I) 사슬[전구체] | p02452 | 1442 | 1회전 | ER 시그널 | Col1 | 3 | 1 | |
콜라겐 알파 2(I) 사슬[전구체] | p08123 p08123에 행해진 2회 변화: D549A 및 N249I | 1342 | 1회전 | ER 시그널 | Col2 | 6 | 4 | |
프롤릴 4-하이드록실레이즈 베타 서브유닛 | p07237 | 487 | 1회전 | ER 시그널, KDEL | P4H 베타 인간 | 13 | 12 | |
프롤릴 4-하이드록실레이즈 알파-1 서브유닛 | p13674 | 517 | P13674-1 | ER 시그널 | P4H 알파 인간 | 15 | 14 | |
프롤릴 4-하이드록실레이즈 식물 | SwissProt.에 등록무 NCBI accession: gi:15227885 | 252 | 1회전 | aa1-39로서 예기된 미토콘드리아 시그널 | P4H 식물 | 17 | 16 | |
프로콜라겐 C-프로티네이즈 | p13497 | 866 | P13497-1 BMP1-3 | ER 시그널, 프로펩티드 | 프로티네이즈 C | 19 | 18 | |
프로콜라겐 I N-프로티네이즈 | o95450 | 958 | O95450-1 LpNPI | ER 시그널, 프로펩티드 | 프로티네이즈 N | 21 | 20 | |
리실 하이그록실레이즈 3 | o60568 | 714 | 1회전 | ER 시그널 | LH3 | 23 | 22 |
시그널 펩티드
(i) 티올 프로테아제 알레우레인(Thiol protease aleurain) 전구체에 대한 보리 유전자의 액포 시그널 서열(NCBI accession P05167 GI:113603)
MAHARVLLLALAVLATAAVAVASSSSFADSNPIRPVTDRAASTLA (서열번호: 24).
(ii) 아라비돕시스 탈리아나 엔도-1,4-베타-글푸카네이즈의 아포플라스트 시그널(Cel1, NCBI accession CAA67156.1 GI:2440033); 서열번호 9, 서열번호 7에 의 해 코딩됨.
플라스미드의
작제
식물 발현 벡터를 실시예 1에 교시된 바와 같이 작제하고, 각각의 작제된 발현 벡터의 조성을 제한 분석 및 서열화를 통해 확인하였다.
다음의 발현 카세트를 포함하는 발현 벡터를 작제하였다:
1. 콜라겐 알파 1
2. 콜라겐 알파 1 + 인간 P4H 베타 서브유닛
3. 콜라겐 알파 1 + 인간 P4H 베타 서브유닛 + 인간 LH3
4. 콜라겐 알파 2
5. 콜라겐 알파 2 + 인간 P4H 알파 서브유닛
6. 콜라겐 알파 2 + 아라비돕시스 P4H
7. 인간 P4H 베타 서브유닛 + 인간 LH3
8. 인간 P4H 알파 서브유닛
각각의 상술한 코딩 서열은 번역과 동시에 액포 전위 펩티드 또는 아포플라스트 전위 펩티드에 융합되었고, 임의의 전위 펩티드 서열을 가지지 않았으며, 각 경우의 세포질 축적이 기대된다.
식물 형질전환 및
PCR
스크리닝
담배 식물(니코티아나 타바쿰(Nicotiana tabacum), Samsun NN)을 도 2에 교시된 형질전환 개요에 따라 상술한 발현 벡터를 가지고 형질전환하였다.
생성된 유전자이식 식물을, 콜라겐 알파 1의 324bp 단편 및 콜라겐 알파 2의 537bp 단편을 증폭할 수 있도록 디자인된 4 개의 프라이머를 사용하여 다중 PCR을 통해 스크리닝하였다(표 2). 도 3은 하나의 다중 PCR 스크린의 결과를 나타낸다.
표 2 - 콜라겐 알파 1의 324bp 단편 및 콜라겐 알파 2의 537bp 단편의 증폭을 위한 다중 PCR에 대한 프라이머의 기술
Col1 포워드 프라이머 (24-mer) | 5' ATCACCAGGAGAACAGGGACCATC 3' | 서열번호: 25 |
Col1 리버스 프라이머 (29-mer) | 5' TCCACTTCCAAATCTCTATCCCTAACAAC 3' | 서열번호: 26 |
Col2 포워드 프라이머 (23-mer) | 5' AGGCATTAGAGGCGATAAGGGAG 3' | 서열번호: 27 |
Col2 리버스 프라이머 (27-mer) | 5' TCAATCCAATAATAGCCACTTGACCAC 3' | 서열번호: 28 |
실시예
3
유전자이식 담배 식물에서 인간 콜라겐의 검출
"완전" 프로테아제 억제제 칵테일(로쉐 디아그노스틱스 게엠베하(Roche Diagnostics GmbH)로부터의 제품 #1836145, 50 ㎖ 버퍼당 1 정)을 함유하는 0.5 ㎖ 50 mM Tris-HCl pH=7.5에서 500 mg의 잎을 분쇄하여 담배 형질전환주 2, 3 및 4로부터 총 가용 단백질을 추출하였다. 조 추출물을 10% 베타-머캅토-에탄올 및 8% SDS를 함유하는 250 ㎕ 4X 샘플 어플리케이션 버퍼(Sample application buffer)와 혼합하고, 이 샘플을 7 분간 비등시킨 다음 13000 rpm으로 8 분간 원심분리하였다. 20 ㎕의 상청액을 10% 폴라아크릴아미드 겔에 부하하고, 항-콜라겐 I (변성) 항체 (케미콘 인코포레이션으로부터 #AB745)를 가지고 표준 웨스턴 블롯법으로 시험하였다(도 4). W.T.는 야생형 담배이다. 콜라겐 I형 알파 1 또는 알파 2, 또는 둘 다에 대해 PCR 양성인 양성 콜라겐 밴드를 식물에서 볼 수 있다. 인간 태반(케미콘 인코포레이션으로부터의 #CC050)으로부터의 500 ng 콜라겐 I형의 양성 대조군 밴드는 유전자이식 식물 샘플에서의 총 가용 단백질(약 150 ㎍)의 약 0.3%를 나타낸다.
콜라겐이 액포에 표적되는 경우, 총 가용 단백질의 약 1% 이하의 분자량 예측치에서 콜라겐을 발현하는 식물이 검출되었다(도 4). 아포플라스트에 대한 전장(full length) 콜라겐의 세포내 표적화가 성공적으로 달성되었다(도 5). 세포질(즉, 펩티드를 표적화하지 않음)에서 콜라겐을 발현하는 식물에서는 검출가능한 수준으로 콜라겐이 축적되지 않았는데, 이는 식물에서의 콜라겐의 세포내 표적화가 성공에 결정적임을 나타낸다.
또한, 루기에로(Ruggiero et al. 2000) 및 머를(Merle et al. 2002)의 연구와는 반대로, 세포내 컴파트먼트에 고수준으로 축적된 C-프로펩티드 및 N-프로펩티드를 가진 전장 콜라겐 단백질을 사용하는 경우, N-프로펩티드가 부족한 콜라겐이 유의적으로 단백질분해되는 경향이 있음을 나타내었다.
또한 각 사슬 형태의 최적 수준을 발현하는 식물을 선택하고, 이어 이 식물을 교배하여 목적하는 콜라겐 생산성 식물을 달성할 수 있다는 점에서, 각 상이한 콜라겐 사슬 형태를 발현하는 두 개의 식물 세포를 교배하는 것이 유리하다는 것이 본 데이터에 의해 명확히 나타난다.
본 발명의 식물에 의해 생산된 콜라겐은 천연 프로펩티드를 포함하며, 따라 서 단백질분해에 의해 정제된 인간 대조군보다 큰 단백질을 형성할 것으로 기대된다. 하이드록실화 또는 글리코실화되지 않은 콜라겐 알파 1 및 알파 2 사슬의 분자량 계산치는 다음과 같다: 프로펩티드 함유 Col1 - 136kDa, 프로펩티드 무함유 Col1 - 95kDa, 프로펩티드 함유 Col2 - 127kDa, 프로펩티드 무함유 Col2 - 92kDa.
도 4에서 알 수 있는 바, 형질전환주 3-5 및 3-49에서 Col1 밴드는 다른 식물에서의 Col1 밴드보다 크게 나타났다. 이는 이들 식물에서 동시발현되고 인간 콜라겐 사슬과 동일한 세포내 컴파트먼트(예, 액포)에 표적화되는 알파 및 베타 서브유닛으로 구성된 인간 프롤린-4-하이드록실레이즈 완전 효소에 의한 콜라겐 사슬에서의 프롤린 하이드록실화를 나타낸다.
실시예
4
유전자이식 식물에서의 콜라겐 삼중나선형 집합 및 열적 안정성
유전자이식 식물의 총 조 단백질 추출액의 열변성, 이어 트립신 또는 펩신 분해에 의해 유전자이식 식물에서의 콜라겐 삼중나선형의 집합 및 나선형의 열적 안정성을 시험하였다(도 6a-b).
첫 번째 실험에서, 0.5 ㎖의 50 mM Tris-HCl pH=7.5에서 500 mg의 잎을 분쇄하고, 13000 rpm으로 10 분간 원심분리한 다음 상청액을 수집하여 담배 2-9 (콜라겐 알파만을 발현하고 P4H는 발현하지 않음) 및 3-5 (콜라겐 알파 1+2 및 P4H 모두를 발현함)로부터 총 가용 단백질을 추출하였다. 이 상청액 50 ㎕를 열처리(33℃ 또는 43℃에서 15분)한 다음 즉시 얼음상에 배치하였다. 각 샘플에 50 mM Tris- HCl pH=7.5중의 1 ㎎/㎖ 트립신 6㎕을 첨가하여 트립신 분해를 개시하였다. 샘플을 실온(약 22℃)에서 20분간 배양하였다. 10% 베타-머캅토-에탄올 및 8% SDS를 함유하는 20 ㎕ 4X 샘플 어플리케이션 버퍼의 첨가에 의해 분해를 종결시키고, 이 샘플을 7 분간 비등시킨 다음 13000 rpm으로 7 분간 원심분리하였다. 50 ㎕의 상청액을 10% 폴라아크릴아미드 겔에 부하하고, 항-콜라겐 I (변성) 항체(케미콘 인코포레이션으로부터 #AB745)를 가지고 표준 웨스턴 블롯법으로 시험하였다. 양성 대조군은 W.T. 담배로부터 추출한 총 가용 단백질 50 ㎕에 첨가된 약 500 ng의 인간 콜라겐 I형(펩신 분해에 의해 인간 태반으로부터 추출된 케미콘 인코포레이션의 #CC050)의 샘플이었다.
도 6a에 도시된 바와 같이, 대조군 인간 콜라겐 및 식물 #3-5에서 형성된 콜라겐 삼중나선은 33℃에서 변성에 대해 내성적이었다. 반대로, 식물 #2-9에 의해 형성된 콜라겐은 33℃에서 변성되었다. 이와 같은 열 안정성의 차이는 콜라겐 알파 1 및 콜라겐 알파 2 모두와 P4H 베타 및 알파 서브유닛을 발현하는 형질전환주 #3-5에서의 성공적인 삼중나선형 집합 및 번역후 프롤린 하이드록실화를 나타낸다.
형질전환주 #2-9에서의 두 개의 밴드는 SDS 및 머캅토에탄올과 함께 7분 비등시킨 후에도 안정한 다이머 또는 트리머를 나타낼 수 있다. 인간 콜라겐(상부 패널) 및 형질전환주 #3-5에서 유사한 밴드를 볼 수 있다. 리신 옥시다제에 의한 두 리신의 산화적 탈아미노반응 이후 형성된 상이한 삼중나선에서의 두 펩티드간의 공유결합(가교결합)이 가능한 설명이다.
두 번째 실험에서, 0.5 ㎖의 100 mM Tris-HCl pH=7.5 및 300 mM NaCl에서 500 mg의 잎을 분쇄하고, 10000 rpm으로 7 분간 원심분리한 다음 상청액을 수집하여 유전자이식 담배 13-6 (콜라겐 I형 알파 1 및 알파 2 사슬(화살표로 표시), 인간 P4H 알파 및 베타 서브유닛, 및 인간 LH3를 발현함)으로부터 총 가용 단백질을 추출하였다. 이 상청액 50 ㎕를 열처리(33℃, 38℃ 또는 42℃에서 20분)한 다음 즉시 얼음상에 배치하였다. 각 샘플에 0.1 M HCl 4.5 ㎕ 및 10 mM 아세트산중 2.5 ㎎/㎖ 펩신 4㎕을 첨가하여 펩신 분해를 개시하였다. 샘플을 실온(약 22℃)에서 30분간 배양하였다. 완충되지 않은 1M Tris 5㎕를 첨가하여 분해를 종결시켰다. 각 샘플을 10% 베타-머캅토-에탄올 및 8% SDS를 함유하는 22 ㎕ 4X 샘플 어플리케이션 버퍼와 혼합하고, 7 분간 비등시킨 다음 13000 rpm으로 7 분간 원심분리하였다. 40 ㎕의 상청액을 10% 폴라아크릴아미드 겔에 부하하고, 항-콜라겐 I (변성) 항체(케미콘 인코포레이션으로부터 #AB745)를 가지고 표준 웨스턴 블롯법으로 시험하였다. 양성 대조군은 W.T. 담배로부터 추출한 총 가용 단백질에 첨가된 약 50 ng의 인간 콜라겐 I형(펩신 분해에 의해 인간 태반으로부터 추출된 케미콘 인코포레이션의 #CC050)의 샘플이었다.
도 6b에 도시된 바와 같이, 식물 #13-6에서 형성된 콜라겐 삼중나선은 42℃에서 변성에 대해 내성적이었다. 프로펩티드의 절단은 33℃에서 먼저 볼 수 있으며, 온도가 38℃ 및 다시 42℃로 상승될 때 효율이 점차 증가한다. 절단된 콜라겐 삼중나선형 도메인은 펩신 처리된 인간 콜라겐의 이동(migration)과 유사한 겔상에서의 이동을 보인다. 본 실험에 사용된 인간 콜라겐은 펩신 단백질분해에 의해 인간 태반으로부터 추출하였고, 따라서 프로펩티드 및 텔로펩티드의 일부가 결손되어 있다.
실시예
5
식물 P4H 발현
천연 식물 P4H의 유도
담배 P4H cDNA를 클로닝하고 내인성 P4H 발현을 유도하는 조건 및 처리법을 결정하기 위한 프로브로서 사용하였다. 생장점에서는 비교적 고수준으로 발현되었고 잎에서는 낮은 수준으로 발현되었음이 노던 블롯 분석법(도 7)에 의해 명확히 나타났다. P4H 수준은 연마(abrasion) 처리(하부 패널이 "훼손됨(wounded)")하고 4 시간된 입에서 두드러지게 유도되었다. 다른 스트레스 조건을 사용하는 경우 유사한 결과가 달성되었다(미도시).
유전자이식 담배 식물에서 인간
P4H
알파 및 베타 서브유닛 및 콜라겐 알파 1 및 알파 2 사슬의 검출
항-인간 P4H 알파 서브유닛 항체(바이오메디칼스 인코포레이션(ICN Biomedicals Inc.)의 #63-163), 항-인간 P4H 베타 서브유닛 항체(케미콘 인코포레이션의 #MAB2701) 및 항-콜라겐 I형 항체(케미콘 인코포레이션의 #AB745)를 사용하여, 유전자이식 담배 식물에서 인간 P4H 알파 및 베타 서브유닛 및 콜라겐 I형 알파 1 및 알파 2 사슬을 검출하였다. 이들 항체와 프로브된 웨스턴 블롯의 결과를 도 8에 도시하였다.
P4H 알파, P4H 베타 및 콜라겐 I형 알파 1 및 알파 2 밴드의 발현은 식물 13-6(또한 인간 LH3로도 형질전환됨)에서 확인되었다. 액포 시그널 펩티드를 포함하는 P4H 알파 및 베타의 분자량 계산치는 각각 65.5 kDa 및 53.4 kDa이었다. 프로펩티드를 가지나 하이드록실화 또는 글리코실화되지 않은 콜라겐 알파 1 및 알파 2 사슬의 분자량 계산치는 각각 136 kDa 및 127 kDa이었다.
명확하게 하기 위해 별도의 구체예의 문맥에 기술된 본 발명의 특성은 또한 하나의 구체예에 함께 제공될 수 있음이 인지될 것이다. 반대로, 요약하여 하나의 구체예의 문맥에 기술된 본 발명의 다양한 특성은 또한 별도로 또는 임의의 적합한 서브컴비네이션에 제공될 수 있다.
본 발명은 그의 특정 구체예와 함께 기술되었지만 다양한 대안, 수정 및 변형도 본 분야의 기술자에게 자명할 것이라는 것은 명백하다. 따라서, 첨부되는 청구범위의 정신 및 광범위한 범위내에 포함되는 모든 대안, 수정 및 변형도 포함시키고자 한다. 본 명세서에서 언급된 모든 공개 문헌, 특허 및 특허 출원 및 GenBank 기탁번호는 이들 각각의 공개 문헌, 특허 및 특허 출원 및 GenBank 기탁번호가 구체적이고 개별적으로 이들 본 명세서에서 전체적으로 참고문헌으로서 인용되는 것까지도 본 명세서에서 전체적으로 참고문헌으로서 포함된다. 추가로, 본 명세서에서 참고 문헌의 인용 또는 확인은 상기 문헌이 본 발명의 선행 기술로서 이용가능하다는 것을 허가하는 것으로 이해되어서는 안된다.
번호로 표시한 참고문헌
(기타 참고문헌은 본 서류에 인용된다)
SEQUENCE LISTING
<110> CollPlant Ltd.
<120> COLLAGEN PRODUCING PLANTS AND METHODS OF GENERATING AND USING
SAME
<130>
<160> 28
<170> PatentIn version 3.3
<210> 1
<211> 4662
<212> DNA
<213> Artificial sequence
<220>
<223> Synthetic sequence containing the coding regions of the vascular
signal sequence of barley gene for Thiol protease aleurain
precursor fused to the human Collagen alpha 1(I) chain and
flanking regions
<400> 1
gcgatgcatg taatgtcatg agccacatga tccaatggcc acaggaacgt aagaatgtag 60
atagatttga ttttgtccgt tagatagcaa acaacattat aaaaggtgtg tatcaatacg 120
aactaattca ctcattggat tcatagaagt ccattcctcc taagtatcta aaccatggct 180
cacgctcgtg ttctcctcct cgctctcgct gttttggcaa cagctgctgt ggctgtggct 240
tctagttctt cttttgctga ttcaaaccct attagacctg ttactgatag agcagcttcc 300
actttggctc aattgcaaga ggagggccag gttgagggcc aagatgagga tatccctcca 360
attacatgcg tgcaaaatgg cttgcgttac cacgataggg atgtgtggaa acctgaacct 420
tgtcgtatct gtgtgtgtga taacggcaag gtgctctgcg atgatgttat ctgcgatgag 480
acaaaaaatt gccctggcgc tgaagttcct gagggcgagt gttgccctgt gtgccctgat 540
ggttccgagt ccccaactga tcaggaaact actggcgtgg agggcccaaa aggagatact 600
ggtccacgtg gtcctagggg tccagcaggt cctccaggta gagatggtat tccaggccag 660
cctggattgc caggaccacc aggcccacct ggcccaccag gacctcctgg tcttggtgga 720
aatttcgctc cacaactctc ttatggctat gatgagaagt caacaggtgg tatttccgtt 780
ccaggtccta tgggaccatc cggaccaaga ggtctcccag gtcctccagg tgctcctgga 840
cctcaaggct ttcaaggacc tccaggcgaa ccaggagaac caggcgcttc tggaccaatg 900
ggcccaaggg gaccacctgg cccaccagga aaaaatggcg atgatggcga agctggaaag 960
cctggtcgtc ctggagagag aggtcctcct ggcccacagg gtgcaagagg cttgccagga 1020
actgctggct tgcctggaat gaagggacat aggggcttct ccggcctcga tggcgctaag 1080
ggtgatgctg gccctgctgg accaaagggc gagccaggtt cccctggaga aaacggtgct 1140
cctggacaaa tgggtcctcg tggacttcca ggagaaaggg gtcgtccagg cgctccagga 1200
ccagcaggtg ctaggggaaa cgatggtgca acaggcgctg ctggccctcc tggcccaact 1260
ggtcctgctg gccctccagg attcccaggc gcagttggag ctaaaggaga agcaggacca 1320
cagggcccta ggggttctga aggacctcag ggtgttagag gtgaaccagg tcctccaggc 1380
ccagctggag cagctggtcc agcaggaaat ccaggtgctg atggtcaacc tggagctaag 1440
ggcgctaatg gcgcaccagg tatcgcaggc gcaccaggtt ttcctggcgc tagaggccca 1500
agtggtcctc aaggaccagg tggaccacca ggtccaaaag gcaattctgg cgaacctggc 1560
gctccaggtt ctaaaggaga tactggtgct aaaggcgaac caggacctgt tggtgttcag 1620
ggtcctcctg gtcctgctgg agaagaagga aaaagaggtg ctcgtggaga accaggacca 1680
actggacttc ctggacctcc tggtgaacgt ggcggacctg gctcaagggg tttccctgga 1740
gctgatggag tggcaggtcc aaaaggccct gctggagaga gaggttcacc aggtccagct 1800
ggtcctaagg gctcccctgg tgaagcaggt agaccaggcg aagcaggatt gccaggcgca 1860
aagggattga caggctctcc tggtagtcct ggcccagatg gaaaaacagg cccaccaggt 1920
ccagcaggac aagatggacg tccaggccca ccaggtcctc ctggagcaag gggacaagct 1980
ggcgttatgg gttttccagg acctaaaggt gctgctggag agccaggaaa ggcaggtgaa 2040
agaggagttc ctggtccacc aggagcagtg ggtcctgctg gcaaagatgg tgaagctgga 2100
gcacagggcc ctccaggccc tgctggccca gctggcgaac gtggagaaca aggcccagct 2160
ggtagtccag gatttcaagg attgcctggc cctgctggcc ctccaggaga agcaggaaaa 2220
cctggagaac aaggagttcc tggtgatttg ggagcacctg gaccttcagg agcacgtggt 2280
gaaagaggct tccctggcga gaggggtgtt caaggtccac caggtccagc aggacctaga 2340
ggtgctaatg gcgctcctgg caacgatgga gcaaaaggtg atgctggtgc tcctggcgca 2400
cctggaagtc agggtgctcc tggattgcaa ggaatgcctg gagagagggg tgctgctggc 2460
ttgccaggcc caaagggcga taggggtgat gctggaccaa aaggtgctga tggatcccca 2520
ggaaaagatg gagttcgtgg tcttactggc ccaatcggac ctccaggccc tgctggcgct 2580
ccaggtgata agggcgaaag tggcccaagt ggacctgctg gacctactgg tgctagaggt 2640
gcacctggtg ataggggtga acctggacca cctggtccag ctggttttgc tggtcctcct 2700
ggagctgatg gacaacctgg cgcaaagggt gaaccaggtg atgctggcgc aaagggagat 2760
gctggtccac ctggacctgc tggtccagca ggcccccctg ggccaatcgg taatgttgga 2820
gcaccaggtg ctaagggagc taggggttcc gctggtccac ctggagcaac aggatttcca 2880
ggcgctgctg gtagagttgg cccaccaggc ccatccggaa acgcaggccc tcctggtcct 2940
ccaggtcctg ctggcaagga gggtggcaaa ggaccaaggg gcgaaactgg ccctgctggt 3000
agacctggcg aagttggccc tcctggacca ccaggtccag caggagaaaa aggttcccca 3060
ggagctgatg gcccagctgg tgctccagga actccaggcc ctcaaggtat tgctggacag 3120
agaggcgttg tgggactccc tggtcaaagg ggagagagag gatttccagg cttgccagga 3180
cctagtggag aacctggaaa acaaggccca tcaggcgcta gtggagagcg tggacctcct 3240
ggccctatgg gacctcctgg attggctggc ccacctggcg aatcaggtcg tgaaggcgca 3300
ccaggcgcag aaggatcacc tggaagagat ggatcccctg gtgctaaagg cgatcgtgga 3360
gaaactggtc cagcaggccc accaggcgca ccaggtgcac ctggcgctcc aggacctgtg 3420
ggaccagctg gaaaatccgg agataggggc gagacaggcc cagcaggacc agctggacct 3480
gttggccctg ctggcgctcg tggaccagca ggacctcaag gaccaagggg agataaggga 3540
gaaacaggcg aacaaggcga taggggcatt aagggtcata ggggttttag tggcctccag 3600
ggtcctcctg gcccacctgg atcaccagga gaacagggac catctggtgc ttccggccca 3660
gctggtccaa gaggacctcc aggatcagct ggtgcacctg gaaaagatgg tcttaacggt 3720
ctcccaggac caatcggccc tccaggacct agaggaagaa caggagatgc tggccctgtt 3780
ggccctccag gacctcctgg tccaccaggt ccacctggtc ctccatcagc tggattcgat 3840
ttttcatttc ttccacagcc accacaagag aaagctcacg atggcggcag atattaccgt 3900
gctgatgatg ctaacgttgt tagggataga gatttggaag tggatacaac tttgaaatcc 3960
ctctcccagc aaattgaaaa cattagatct ccagaaggtt cacgtaaaaa cccagctaga 4020
acatgtcgtg atttgaaaat gtgtcactcc gattggaaaa gtggtgaata ctggattgat 4080
ccaaatcagg gctgtaatct cgatgctatc aaagttttct gtaacatgga aacaggcgaa 4140
acatgcgttt atcctactca accttccgtg gctcagaaaa attggtacat ctcaaaaaat 4200
cctaaagata agaggcacgt ttggttcggt gaaagtatga ctgatggatt tcaatttgag 4260
tacggcggtc aaggtagtga tccagctgat gtggctattc aactcacatt tttgcgtctt 4320
atgtccacag aggcatcaca aaacatcact taccactgca aaaacagtgt ggcttatatg 4380
gatcaacaaa caggaaacct taagaaggct cttcttttga agggctcaaa cgagattgag 4440
attagagcag agggcaactc aaggtttact tattcagtta ctgttgatgg ctgcacttca 4500
catactggcg cttggggtaa aacagttatc gagtataaga ctacaaaaac atcaagactc 4560
ccaatcattg atgttgctcc tctcgatgtt ggcgctcctg atcaagagtt cggttttgat 4620
gtgggcccag tttgtttcct ctaatgagct cgcggccgca tc 4662
<210> 2
<211> 4662
<212> DNA
<213> Artificial sequence
<220>
<223> Synthetic sequence of the vascular signal sequence of barley gene
for Thiol protease aleurain precursor fused to the human Collagen
alpha 1(I) chain and flanking regions
<220>
<221> CDS
<222> (175)..(4644)
<400> 2
gcgatgcatg taatgtcatg agccacatga tccaatggcc acaggaacgt aagaatgtag 60
atagatttga ttttgtccgt tagatagcaa acaacattat aaaaggtgtg tatcaatacg 120
aactaattca ctcattggat tcatagaagt ccattcctcc taagtatcta aacc atg 177
Met
1
gct cac gct cgt gtt ctc ctc ctc gct ctc gct gtt ttg gca aca gct 225
Ala His Ala Arg Val Leu Leu Leu Ala Leu Ala Val Leu Ala Thr Ala
5 10 15
gct gtg gct gtg gct tct agt tct tct ttt gct gat tca aac cct att 273
Ala Val Ala Val Ala Ser Ser Ser Ser Phe Ala Asp Ser Asn Pro Ile
20 25 30
aga cct gtt act gat aga gca gct tcc act ttg gct caa ttg caa gag 321
Arg Pro Val Thr Asp Arg Ala Ala Ser Thr Leu Ala Gln Leu Gln Glu
35 40 45
gag ggc cag gtt gag ggc caa gat gag gat atc cct cca att aca tgc 369
Glu Gly Gln Val Glu Gly Gln Asp Glu Asp Ile Pro Pro Ile Thr Cys
50 55 60 65
gtg caa aat ggc ttg cgt tac cac gat agg gat gtg tgg aaa cct gaa 417
Val Gln Asn Gly Leu Arg Tyr His Asp Arg Asp Val Trp Lys Pro Glu
70 75 80
cct tgt cgt atc tgt gtg tgt gat aac ggc aag gtg ctc tgc gat gat 465
Pro Cys Arg Ile Cys Val Cys Asp Asn Gly Lys Val Leu Cys Asp Asp
85 90 95
gtt atc tgc gat gag aca aaa aat tgc cct ggc gct gaa gtt cct gag 513
Val Ile Cys Asp Glu Thr Lys Asn Cys Pro Gly Ala Glu Val Pro Glu
100 105 110
ggc gag tgt tgc cct gtg tgc cct gat ggt tcc gag tcc cca act gat 561
Gly Glu Cys Cys Pro Val Cys Pro Asp Gly Ser Glu Ser Pro Thr Asp
115 120 125
cag gaa act act ggc gtg gag ggc cca aaa gga gat act ggt cca cgt 609
Gln Glu Thr Thr Gly Val Glu Gly Pro Lys Gly Asp Thr Gly Pro Arg
130 135 140 145
ggt cct agg ggt cca gca ggt cct cca ggt aga gat ggt att cca ggc 657
Gly Pro Arg Gly Pro Ala Gly Pro Pro Gly Arg Asp Gly Ile Pro Gly
150 155 160
cag cct gga ttg cca gga cca cca ggc cca cct ggc cca cca gga cct 705
Gln Pro Gly Leu Pro Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly Pro
165 170 175
cct ggt ctt ggt gga aat ttc gct cca caa ctc tct tat ggc tat gat 753
Pro Gly Leu Gly Gly Asn Phe Ala Pro Gln Leu Ser Tyr Gly Tyr Asp
180 185 190
gag aag tca aca ggt ggt att tcc gtt cca ggt cct atg gga cca tcc 801
Glu Lys Ser Thr Gly Gly Ile Ser Val Pro Gly Pro Met Gly Pro Ser
195 200 205
gga cca aga ggt ctc cca ggt cct cca ggt gct cct gga cct caa ggc 849
Gly Pro Arg Gly Leu Pro Gly Pro Pro Gly Ala Pro Gly Pro Gln Gly
210 215 220 225
ttt caa gga cct cca ggc gaa cca gga gaa cca ggc gct tct gga cca 897
Phe Gln Gly Pro Pro Gly Glu Pro Gly Glu Pro Gly Ala Ser Gly Pro
230 235 240
atg ggc cca agg gga cca cct ggc cca cca gga aaa aat ggc gat gat 945
Met Gly Pro Arg Gly Pro Pro Gly Pro Pro Gly Lys Asn Gly Asp Asp
245 250 255
ggc gaa gct gga aag cct ggt cgt cct gga gag aga ggt cct cct ggc 993
Gly Glu Ala Gly Lys Pro Gly Arg Pro Gly Glu Arg Gly Pro Pro Gly
260 265 270
cca cag ggt gca aga ggc ttg cca gga act gct ggc ttg cct gga atg 1041
Pro Gln Gly Ala Arg Gly Leu Pro Gly Thr Ala Gly Leu Pro Gly Met
275 280 285
aag gga cat agg ggc ttc tcc ggc ctc gat ggc gct aag ggt gat gct 1089
Lys Gly His Arg Gly Phe Ser Gly Leu Asp Gly Ala Lys Gly Asp Ala
290 295 300 305
ggc cct gct gga cca aag ggc gag cca ggt tcc cct gga gaa aac ggt 1137
Gly Pro Ala Gly Pro Lys Gly Glu Pro Gly Ser Pro Gly Glu Asn Gly
310 315 320
gct cct gga caa atg ggt cct cgt gga ctt cca gga gaa agg ggt cgt 1185
Ala Pro Gly Gln Met Gly Pro Arg Gly Leu Pro Gly Glu Arg Gly Arg
325 330 335
cca ggc gct cca gga cca gca ggt gct agg gga aac gat ggt gca aca 1233
Pro Gly Ala Pro Gly Pro Ala Gly Ala Arg Gly Asn Asp Gly Ala Thr
340 345 350
ggc gct gct ggc cct cct ggc cca act ggt cct gct ggc cct cca gga 1281
Gly Ala Ala Gly Pro Pro Gly Pro Thr Gly Pro Ala Gly Pro Pro Gly
355 360 365
ttc cca ggc gca gtt gga gct aaa gga gaa gca gga cca cag ggc cct 1329
Phe Pro Gly Ala Val Gly Ala Lys Gly Glu Ala Gly Pro Gln Gly Pro
370 375 380 385
agg ggt tct gaa gga cct cag ggt gtt aga ggt gaa cca ggt cct cca 1377
Arg Gly Ser Glu Gly Pro Gln Gly Val Arg Gly Glu Pro Gly Pro Pro
390 395 400
ggc cca gct gga gca gct ggt cca gca gga aat cca ggt gct gat ggt 1425
Gly Pro Ala Gly Ala Ala Gly Pro Ala Gly Asn Pro Gly Ala Asp Gly
405 410 415
caa cct gga gct aag ggc gct aat ggc gca cca ggt atc gca ggc gca 1473
Gln Pro Gly Ala Lys Gly Ala Asn Gly Ala Pro Gly Ile Ala Gly Ala
420 425 430
cca ggt ttt cct ggc gct aga ggc cca agt ggt cct caa gga cca ggt 1521
Pro Gly Phe Pro Gly Ala Arg Gly Pro Ser Gly Pro Gln Gly Pro Gly
435 440 445
gga cca cca ggt cca aaa ggc aat tct ggc gaa cct ggc gct cca ggt 1569
Gly Pro Pro Gly Pro Lys Gly Asn Ser Gly Glu Pro Gly Ala Pro Gly
450 455 460 465
tct aaa gga gat act ggt gct aaa ggc gaa cca gga cct gtt ggt gtt 1617
Ser Lys Gly Asp Thr Gly Ala Lys Gly Glu Pro Gly Pro Val Gly Val
470 475 480
cag ggt cct cct ggt cct gct gga gaa gaa gga aaa aga ggt gct cgt 1665
Gln Gly Pro Pro Gly Pro Ala Gly Glu Glu Gly Lys Arg Gly Ala Arg
485 490 495
gga gaa cca gga cca act gga ctt cct gga cct cct ggt gaa cgt ggc 1713
Gly Glu Pro Gly Pro Thr Gly Leu Pro Gly Pro Pro Gly Glu Arg Gly
500 505 510
gga cct ggc tca agg ggt ttc cct gga gct gat gga gtg gca ggt cca 1761
Gly Pro Gly Ser Arg Gly Phe Pro Gly Ala Asp Gly Val Ala Gly Pro
515 520 525
aaa ggc cct gct gga gag aga ggt tca cca ggt cca gct ggt cct aag 1809
Lys Gly Pro Ala Gly Glu Arg Gly Ser Pro Gly Pro Ala Gly Pro Lys
530 535 540 545
ggc tcc cct ggt gaa gca ggt aga cca ggc gaa gca gga ttg cca ggc 1857
Gly Ser Pro Gly Glu Ala Gly Arg Pro Gly Glu Ala Gly Leu Pro Gly
550 555 560
gca aag gga ttg aca ggc tct cct ggt agt cct ggc cca gat gga aaa 1905
Ala Lys Gly Leu Thr Gly Ser Pro Gly Ser Pro Gly Pro Asp Gly Lys
565 570 575
aca ggc cca cca ggt cca gca gga caa gat gga cgt cca ggc cca cca 1953
Thr Gly Pro Pro Gly Pro Ala Gly Gln Asp Gly Arg Pro Gly Pro Pro
580 585 590
ggt cct cct gga gca agg gga caa gct ggc gtt atg ggt ttt cca gga 2001
Gly Pro Pro Gly Ala Arg Gly Gln Ala Gly Val Met Gly Phe Pro Gly
595 600 605
cct aaa ggt gct gct gga gag cca gga aag gca ggt gaa aga gga gtt 2049
Pro Lys Gly Ala Ala Gly Glu Pro Gly Lys Ala Gly Glu Arg Gly Val
610 615 620 625
cct ggt cca cca gga gca gtg ggt cct gct ggc aaa gat ggt gaa gct 2097
Pro Gly Pro Pro Gly Ala Val Gly Pro Ala Gly Lys Asp Gly Glu Ala
630 635 640
gga gca cag ggc cct cca ggc cct gct ggc cca gct ggc gaa cgt gga 2145
Gly Ala Gln Gly Pro Pro Gly Pro Ala Gly Pro Ala Gly Glu Arg Gly
645 650 655
gaa caa ggc cca gct ggt agt cca gga ttt caa gga ttg cct ggc cct 2193
Glu Gln Gly Pro Ala Gly Ser Pro Gly Phe Gln Gly Leu Pro Gly Pro
660 665 670
gct ggc cct cca gga gaa gca gga aaa cct gga gaa caa gga gtt cct 2241
Ala Gly Pro Pro Gly Glu Ala Gly Lys Pro Gly Glu Gln Gly Val Pro
675 680 685
ggt gat ttg gga gca cct gga cct tca gga gca cgt ggt gaa aga ggc 2289
Gly Asp Leu Gly Ala Pro Gly Pro Ser Gly Ala Arg Gly Glu Arg Gly
690 695 700 705
ttc cct ggc gag agg ggt gtt caa ggt cca cca ggt cca gca gga cct 2337
Phe Pro Gly Glu Arg Gly Val Gln Gly Pro Pro Gly Pro Ala Gly Pro
710 715 720
aga ggt gct aat ggc gct cct ggc aac gat gga gca aaa ggt gat gct 2385
Arg Gly Ala Asn Gly Ala Pro Gly Asn Asp Gly Ala Lys Gly Asp Ala
725 730 735
ggt gct cct ggc gca cct gga agt cag ggt gct cct gga ttg caa gga 2433
Gly Ala Pro Gly Ala Pro Gly Ser Gln Gly Ala Pro Gly Leu Gln Gly
740 745 750
atg cct gga gag agg ggt gct gct ggc ttg cca ggc cca aag ggc gat 2481
Met Pro Gly Glu Arg Gly Ala Ala Gly Leu Pro Gly Pro Lys Gly Asp
755 760 765
agg ggt gat gct gga cca aaa ggt gct gat gga tcc cca gga aaa gat 2529
Arg Gly Asp Ala Gly Pro Lys Gly Ala Asp Gly Ser Pro Gly Lys Asp
770 775 780 785
gga gtt cgt ggt ctt act ggc cca atc gga cct cca ggc cct gct ggc 2577
Gly Val Arg Gly Leu Thr Gly Pro Ile Gly Pro Pro Gly Pro Ala Gly
790 795 800
gct cca ggt gat aag ggc gaa agt ggc cca agt gga cct gct gga cct 2625
Ala Pro Gly Asp Lys Gly Glu Ser Gly Pro Ser Gly Pro Ala Gly Pro
805 810 815
act ggt gct aga ggt gca cct ggt gat agg ggt gaa cct gga cca cct 2673
Thr Gly Ala Arg Gly Ala Pro Gly Asp Arg Gly Glu Pro Gly Pro Pro
820 825 830
ggt cca gct ggt ttt gct ggt cct cct gga gct gat gga caa cct ggc 2721
Gly Pro Ala Gly Phe Ala Gly Pro Pro Gly Ala Asp Gly Gln Pro Gly
835 840 845
gca aag ggt gaa cca ggt gat gct ggc gca aag gga gat gct ggt cca 2769
Ala Lys Gly Glu Pro Gly Asp Ala Gly Ala Lys Gly Asp Ala Gly Pro
850 855 860 865
cct gga cct gct ggt cca gca ggc ccc cct ggg cca atc ggt aat gtt 2817
Pro Gly Pro Ala Gly Pro Ala Gly Pro Pro Gly Pro Ile Gly Asn Val
870 875 880
gga gca cca ggt gct aag gga gct agg ggt tcc gct ggt cca cct gga 2865
Gly Ala Pro Gly Ala Lys Gly Ala Arg Gly Ser Ala Gly Pro Pro Gly
885 890 895
gca aca gga ttt cca ggc gct gct ggt aga gtt ggc cca cca ggc cca 2913
Ala Thr Gly Phe Pro Gly Ala Ala Gly Arg Val Gly Pro Pro Gly Pro
900 905 910
tcc gga aac gca ggc cct cct ggt cct cca ggt cct gct ggc aag gag 2961
Ser Gly Asn Ala Gly Pro Pro Gly Pro Pro Gly Pro Ala Gly Lys Glu
915 920 925
ggt ggc aaa gga cca agg ggc gaa act ggc cct gct ggt aga cct ggc 3009
Gly Gly Lys Gly Pro Arg Gly Glu Thr Gly Pro Ala Gly Arg Pro Gly
930 935 940 945
gaa gtt ggc cct cct gga cca cca ggt cca gca gga gaa aaa ggt tcc 3057
Glu Val Gly Pro Pro Gly Pro Pro Gly Pro Ala Gly Glu Lys Gly Ser
950 955 960
cca gga gct gat ggc cca gct ggt gct cca gga act cca ggc cct caa 3105
Pro Gly Ala Asp Gly Pro Ala Gly Ala Pro Gly Thr Pro Gly Pro Gln
965 970 975
ggt att gct gga cag aga ggc gtt gtg gga ctc cct ggt caa agg gga 3153
Gly Ile Ala Gly Gln Arg Gly Val Val Gly Leu Pro Gly Gln Arg Gly
980 985 990
gag aga gga ttt cca ggc ttg cca gga cct agt gga gaa cct gga aaa 3201
Glu Arg Gly Phe Pro Gly Leu Pro Gly Pro Ser Gly Glu Pro Gly Lys
995 1000 1005
caa ggc cca tca ggc gct agt gga gag cgt gga cct cct ggc cct 3246
Gln Gly Pro Ser Gly Ala Ser Gly Glu Arg Gly Pro Pro Gly Pro
1010 1015 1020
atg gga cct cct gga ttg gct ggc cca cct ggc gaa tca ggt cgt 3291
Met Gly Pro Pro Gly Leu Ala Gly Pro Pro Gly Glu Ser Gly Arg
1025 1030 1035
gaa ggc gca cca ggc gca gaa gga tca cct gga aga gat gga tcc 3336
Glu Gly Ala Pro Gly Ala Glu Gly Ser Pro Gly Arg Asp Gly Ser
1040 1045 1050
cct ggt gct aaa ggc gat cgt gga gaa act ggt cca gca ggc cca 3381
Pro Gly Ala Lys Gly Asp Arg Gly Glu Thr Gly Pro Ala Gly Pro
1055 1060 1065
cca ggc gca cca ggt gca cct ggc gct cca gga cct gtg gga cca 3426
Pro Gly Ala Pro Gly Ala Pro Gly Ala Pro Gly Pro Val Gly Pro
1070 1075 1080
gct gga aaa tcc gga gat agg ggc gag aca ggc cca gca gga cca 3471
Ala Gly Lys Ser Gly Asp Arg Gly Glu Thr Gly Pro Ala Gly Pro
1085 1090 1095
gct gga cct gtt ggc cct gct ggc gct cgt gga cca gca gga cct 3516
Ala Gly Pro Val Gly Pro Ala Gly Ala Arg Gly Pro Ala Gly Pro
1100 1105 1110
caa gga cca agg gga gat aag gga gaa aca ggc gaa caa ggc gat 3561
Gln Gly Pro Arg Gly Asp Lys Gly Glu Thr Gly Glu Gln Gly Asp
1115 1120 1125
agg ggc att aag ggt cat agg ggt ttt agt ggc ctc cag ggt cct 3606
Arg Gly Ile Lys Gly His Arg Gly Phe Ser Gly Leu Gln Gly Pro
1130 1135 1140
cct ggc cca cct gga tca cca gga gaa cag gga cca tct ggt gct 3651
Pro Gly Pro Pro Gly Ser Pro Gly Glu Gln Gly Pro Ser Gly Ala
1145 1150 1155
tcc ggc cca gct ggt cca aga gga cct cca gga tca gct ggt gca 3696
Ser Gly Pro Ala Gly Pro Arg Gly Pro Pro Gly Ser Ala Gly Ala
1160 1165 1170
cct gga aaa gat ggt ctt aac ggt ctc cca gga cca atc ggc cct 3741
Pro Gly Lys Asp Gly Leu Asn Gly Leu Pro Gly Pro Ile Gly Pro
1175 1180 1185
cca gga cct aga gga aga aca gga gat gct ggc cct gtt ggc cct 3786
Pro Gly Pro Arg Gly Arg Thr Gly Asp Ala Gly Pro Val Gly Pro
1190 1195 1200
cca gga cct cct ggt cca cca ggt cca cct ggt cct cca tca gct 3831
Pro Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly Pro Pro Ser Ala
1205 1210 1215
gga ttc gat ttt tca ttt ctt cca cag cca cca caa gag aaa gct 3876
Gly Phe Asp Phe Ser Phe Leu Pro Gln Pro Pro Gln Glu Lys Ala
1220 1225 1230
cac gat ggc ggc aga tat tac cgt gct gat gat gct aac gtt gtt 3921
His Asp Gly Gly Arg Tyr Tyr Arg Ala Asp Asp Ala Asn Val Val
1235 1240 1245
agg gat aga gat ttg gaa gtg gat aca act ttg aaa tcc ctc tcc 3966
Arg Asp Arg Asp Leu Glu Val Asp Thr Thr Leu Lys Ser Leu Ser
1250 1255 1260
cag caa att gaa aac att aga tct cca gaa ggt tca cgt aaa aac 4011
Gln Gln Ile Glu Asn Ile Arg Ser Pro Glu Gly Ser Arg Lys Asn
1265 1270 1275
cca gct aga aca tgt cgt gat ttg aaa atg tgt cac tcc gat tgg 4056
Pro Ala Arg Thr Cys Arg Asp Leu Lys Met Cys His Ser Asp Trp
1280 1285 1290
aaa agt ggt gaa tac tgg att gat cca aat cag ggc tgt aat ctc 4101
Lys Ser Gly Glu Tyr Trp Ile Asp Pro Asn Gln Gly Cys Asn Leu
1295 1300 1305
gat gct atc aaa gtt ttc tgt aac atg gaa aca ggc gaa aca tgc 4146
Asp Ala Ile Lys Val Phe Cys Asn Met Glu Thr Gly Glu Thr Cys
1310 1315 1320
gtt tat cct act caa cct tcc gtg gct cag aaa aat tgg tac atc 4191
Val Tyr Pro Thr Gln Pro Ser Val Ala Gln Lys Asn Trp Tyr Ile
1325 1330 1335
tca aaa aat cct aaa gat aag agg cac gtt tgg ttc ggt gaa agt 4236
Ser Lys Asn Pro Lys Asp Lys Arg His Val Trp Phe Gly Glu Ser
1340 1345 1350
atg act gat gga ttt caa ttt gag tac ggc ggt caa ggt agt gat 4281
Met Thr Asp Gly Phe Gln Phe Glu Tyr Gly Gly Gln Gly Ser Asp
1355 1360 1365
cca gct gat gtg gct att caa ctc aca ttt ttg cgt ctt atg tcc 4326
Pro Ala Asp Val Ala Ile Gln Leu Thr Phe Leu Arg Leu Met Ser
1370 1375 1380
aca gag gca tca caa aac atc act tac cac tgc aaa aac agt gtg 4371
Thr Glu Ala Ser Gln Asn Ile Thr Tyr His Cys Lys Asn Ser Val
1385 1390 1395
gct tat atg gat caa caa aca gga aac ctt aag aag gct ctt ctt 4416
Ala Tyr Met Asp Gln Gln Thr Gly Asn Leu Lys Lys Ala Leu Leu
1400 1405 1410
ttg aag ggc tca aac gag att gag att aga gca gag ggc aac tca 4461
Leu Lys Gly Ser Asn Glu Ile Glu Ile Arg Ala Glu Gly Asn Ser
1415 1420 1425
agg ttt act tat tca gtt act gtt gat ggc tgc act tca cat act 4506
Arg Phe Thr Tyr Ser Val Thr Val Asp Gly Cys Thr Ser His Thr
1430 1435 1440
ggc gct tgg ggt aaa aca gtt atc gag tat aag act aca aaa aca 4551
Gly Ala Trp Gly Lys Thr Val Ile Glu Tyr Lys Thr Thr Lys Thr
1445 1450 1455
tca aga ctc cca atc att gat gtt gct cct ctc gat gtt ggc gct 4596
Ser Arg Leu Pro Ile Ile Asp Val Ala Pro Leu Asp Val Gly Ala
1460 1465 1470
cct gat caa gag ttc ggt ttt gat gtg ggc cca gtt tgt ttc ctc 4641
Pro Asp Gln Glu Phe Gly Phe Asp Val Gly Pro Val Cys Phe Leu
1475 1480 1485
taa tgagctcgcg gccgcatc 4662
<210> 3
<211> 1489
<212> PRT
<213> Artificial sequence
<220>
<223> Synthetic sequence of the vascular signal sequence of barley gene
for Thiol protease aleurain precursor fused to the human Collagen
alpha 1(I) chain and flanking regions
<400> 3
Met Ala His Ala Arg Val Leu Leu Leu Ala Leu Ala Val Leu Ala Thr
1 5 10 15
Ala Ala Val Ala Val Ala Ser Ser Ser Ser Phe Ala Asp Ser Asn Pro
20 25 30
Ile Arg Pro Val Thr Asp Arg Ala Ala Ser Thr Leu Ala Gln Leu Gln
35 40 45
Glu Glu Gly Gln Val Glu Gly Gln Asp Glu Asp Ile Pro Pro Ile Thr
50 55 60
Cys Val Gln Asn Gly Leu Arg Tyr His Asp Arg Asp Val Trp Lys Pro
65 70 75 80
Glu Pro Cys Arg Ile Cys Val Cys Asp Asn Gly Lys Val Leu Cys Asp
85 90 95
Asp Val Ile Cys Asp Glu Thr Lys Asn Cys Pro Gly Ala Glu Val Pro
100 105 110
Glu Gly Glu Cys Cys Pro Val Cys Pro Asp Gly Ser Glu Ser Pro Thr
115 120 125
Asp Gln Glu Thr Thr Gly Val Glu Gly Pro Lys Gly Asp Thr Gly Pro
130 135 140
Arg Gly Pro Arg Gly Pro Ala Gly Pro Pro Gly Arg Asp Gly Ile Pro
145 150 155 160
Gly Gln Pro Gly Leu Pro Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly
165 170 175
Pro Pro Gly Leu Gly Gly Asn Phe Ala Pro Gln Leu Ser Tyr Gly Tyr
180 185 190
Asp Glu Lys Ser Thr Gly Gly Ile Ser Val Pro Gly Pro Met Gly Pro
195 200 205
Ser Gly Pro Arg Gly Leu Pro Gly Pro Pro Gly Ala Pro Gly Pro Gln
210 215 220
Gly Phe Gln Gly Pro Pro Gly Glu Pro Gly Glu Pro Gly Ala Ser Gly
225 230 235 240
Pro Met Gly Pro Arg Gly Pro Pro Gly Pro Pro Gly Lys Asn Gly Asp
245 250 255
Asp Gly Glu Ala Gly Lys Pro Gly Arg Pro Gly Glu Arg Gly Pro Pro
260 265 270
Gly Pro Gln Gly Ala Arg Gly Leu Pro Gly Thr Ala Gly Leu Pro Gly
275 280 285
Met Lys Gly His Arg Gly Phe Ser Gly Leu Asp Gly Ala Lys Gly Asp
290 295 300
Ala Gly Pro Ala Gly Pro Lys Gly Glu Pro Gly Ser Pro Gly Glu Asn
305 310 315 320
Gly Ala Pro Gly Gln Met Gly Pro Arg Gly Leu Pro Gly Glu Arg Gly
325 330 335
Arg Pro Gly Ala Pro Gly Pro Ala Gly Ala Arg Gly Asn Asp Gly Ala
340 345 350
Thr Gly Ala Ala Gly Pro Pro Gly Pro Thr Gly Pro Ala Gly Pro Pro
355 360 365
Gly Phe Pro Gly Ala Val Gly Ala Lys Gly Glu Ala Gly Pro Gln Gly
370 375 380
Pro Arg Gly Ser Glu Gly Pro Gln Gly Val Arg Gly Glu Pro Gly Pro
385 390 395 400
Pro Gly Pro Ala Gly Ala Ala Gly Pro Ala Gly Asn Pro Gly Ala Asp
405 410 415
Gly Gln Pro Gly Ala Lys Gly Ala Asn Gly Ala Pro Gly Ile Ala Gly
420 425 430
Ala Pro Gly Phe Pro Gly Ala Arg Gly Pro Ser Gly Pro Gln Gly Pro
435 440 445
Gly Gly Pro Pro Gly Pro Lys Gly Asn Ser Gly Glu Pro Gly Ala Pro
450 455 460
Gly Ser Lys Gly Asp Thr Gly Ala Lys Gly Glu Pro Gly Pro Val Gly
465 470 475 480
Val Gln Gly Pro Pro Gly Pro Ala Gly Glu Glu Gly Lys Arg Gly Ala
485 490 495
Arg Gly Glu Pro Gly Pro Thr Gly Leu Pro Gly Pro Pro Gly Glu Arg
500 505 510
Gly Gly Pro Gly Ser Arg Gly Phe Pro Gly Ala Asp Gly Val Ala Gly
515 520 525
Pro Lys Gly Pro Ala Gly Glu Arg Gly Ser Pro Gly Pro Ala Gly Pro
530 535 540
Lys Gly Ser Pro Gly Glu Ala Gly Arg Pro Gly Glu Ala Gly Leu Pro
545 550 555 560
Gly Ala Lys Gly Leu Thr Gly Ser Pro Gly Ser Pro Gly Pro Asp Gly
565 570 575
Lys Thr Gly Pro Pro Gly Pro Ala Gly Gln Asp Gly Arg Pro Gly Pro
580 585 590
Pro Gly Pro Pro Gly Ala Arg Gly Gln Ala Gly Val Met Gly Phe Pro
595 600 605
Gly Pro Lys Gly Ala Ala Gly Glu Pro Gly Lys Ala Gly Glu Arg Gly
610 615 620
Val Pro Gly Pro Pro Gly Ala Val Gly Pro Ala Gly Lys Asp Gly Glu
625 630 635 640
Ala Gly Ala Gln Gly Pro Pro Gly Pro Ala Gly Pro Ala Gly Glu Arg
645 650 655
Gly Glu Gln Gly Pro Ala Gly Ser Pro Gly Phe Gln Gly Leu Pro Gly
660 665 670
Pro Ala Gly Pro Pro Gly Glu Ala Gly Lys Pro Gly Glu Gln Gly Val
675 680 685
Pro Gly Asp Leu Gly Ala Pro Gly Pro Ser Gly Ala Arg Gly Glu Arg
690 695 700
Gly Phe Pro Gly Glu Arg Gly Val Gln Gly Pro Pro Gly Pro Ala Gly
705 710 715 720
Pro Arg Gly Ala Asn Gly Ala Pro Gly Asn Asp Gly Ala Lys Gly Asp
725 730 735
Ala Gly Ala Pro Gly Ala Pro Gly Ser Gln Gly Ala Pro Gly Leu Gln
740 745 750
Gly Met Pro Gly Glu Arg Gly Ala Ala Gly Leu Pro Gly Pro Lys Gly
755 760 765
Asp Arg Gly Asp Ala Gly Pro Lys Gly Ala Asp Gly Ser Pro Gly Lys
770 775 780
Asp Gly Val Arg Gly Leu Thr Gly Pro Ile Gly Pro Pro Gly Pro Ala
785 790 795 800
Gly Ala Pro Gly Asp Lys Gly Glu Ser Gly Pro Ser Gly Pro Ala Gly
805 810 815
Pro Thr Gly Ala Arg Gly Ala Pro Gly Asp Arg Gly Glu Pro Gly Pro
820 825 830
Pro Gly Pro Ala Gly Phe Ala Gly Pro Pro Gly Ala Asp Gly Gln Pro
835 840 845
Gly Ala Lys Gly Glu Pro Gly Asp Ala Gly Ala Lys Gly Asp Ala Gly
850 855 860
Pro Pro Gly Pro Ala Gly Pro Ala Gly Pro Pro Gly Pro Ile Gly Asn
865 870 875 880
Val Gly Ala Pro Gly Ala Lys Gly Ala Arg Gly Ser Ala Gly Pro Pro
885 890 895
Gly Ala Thr Gly Phe Pro Gly Ala Ala Gly Arg Val Gly Pro Pro Gly
900 905 910
Pro Ser Gly Asn Ala Gly Pro Pro Gly Pro Pro Gly Pro Ala Gly Lys
915 920 925
Glu Gly Gly Lys Gly Pro Arg Gly Glu Thr Gly Pro Ala Gly Arg Pro
930 935 940
Gly Glu Val Gly Pro Pro Gly Pro Pro Gly Pro Ala Gly Glu Lys Gly
945 950 955 960
Ser Pro Gly Ala Asp Gly Pro Ala Gly Ala Pro Gly Thr Pro Gly Pro
965 970 975
Gln Gly Ile Ala Gly Gln Arg Gly Val Val Gly Leu Pro Gly Gln Arg
980 985 990
Gly Glu Arg Gly Phe Pro Gly Leu Pro Gly Pro Ser Gly Glu Pro Gly
995 1000 1005
Lys Gln Gly Pro Ser Gly Ala Ser Gly Glu Arg Gly Pro Pro Gly
1010 1015 1020
Pro Met Gly Pro Pro Gly Leu Ala Gly Pro Pro Gly Glu Ser Gly
1025 1030 1035
Arg Glu Gly Ala Pro Gly Ala Glu Gly Ser Pro Gly Arg Asp Gly
1040 1045 1050
Ser Pro Gly Ala Lys Gly Asp Arg Gly Glu Thr Gly Pro Ala Gly
1055 1060 1065
Pro Pro Gly Ala Pro Gly Ala Pro Gly Ala Pro Gly Pro Val Gly
1070 1075 1080
Pro Ala Gly Lys Ser Gly Asp Arg Gly Glu Thr Gly Pro Ala Gly
1085 1090 1095
Pro Ala Gly Pro Val Gly Pro Ala Gly Ala Arg Gly Pro Ala Gly
1100 1105 1110
Pro Gln Gly Pro Arg Gly Asp Lys Gly Glu Thr Gly Glu Gln Gly
1115 1120 1125
Asp Arg Gly Ile Lys Gly His Arg Gly Phe Ser Gly Leu Gln Gly
1130 1135 1140
Pro Pro Gly Pro Pro Gly Ser Pro Gly Glu Gln Gly Pro Ser Gly
1145 1150 1155
Ala Ser Gly Pro Ala Gly Pro Arg Gly Pro Pro Gly Ser Ala Gly
1160 1165 1170
Ala Pro Gly Lys Asp Gly Leu Asn Gly Leu Pro Gly Pro Ile Gly
1175 1180 1185
Pro Pro Gly Pro Arg Gly Arg Thr Gly Asp Ala Gly Pro Val Gly
1190 1195 1200
Pro Pro Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly Pro Pro Ser
1205 1210 1215
Ala Gly Phe Asp Phe Ser Phe Leu Pro Gln Pro Pro Gln Glu Lys
1220 1225 1230
Ala His Asp Gly Gly Arg Tyr Tyr Arg Ala Asp Asp Ala Asn Val
1235 1240 1245
Val Arg Asp Arg Asp Leu Glu Val Asp Thr Thr Leu Lys Ser Leu
1250 1255 1260
Ser Gln Gln Ile Glu Asn Ile Arg Ser Pro Glu Gly Ser Arg Lys
1265 1270 1275
Asn Pro Ala Arg Thr Cys Arg Asp Leu Lys Met Cys His Ser Asp
1280 1285 1290
Trp Lys Ser Gly Glu Tyr Trp Ile Asp Pro Asn Gln Gly Cys Asn
1295 1300 1305
Leu Asp Ala Ile Lys Val Phe Cys Asn Met Glu Thr Gly Glu Thr
1310 1315 1320
Cys Val Tyr Pro Thr Gln Pro Ser Val Ala Gln Lys Asn Trp Tyr
1325 1330 1335
Ile Ser Lys Asn Pro Lys Asp Lys Arg His Val Trp Phe Gly Glu
1340 1345 1350
Ser Met Thr Asp Gly Phe Gln Phe Glu Tyr Gly Gly Gln Gly Ser
1355 1360 1365
Asp Pro Ala Asp Val Ala Ile Gln Leu Thr Phe Leu Arg Leu Met
1370 1375 1380
Ser Thr Glu Ala Ser Gln Asn Ile Thr Tyr His Cys Lys Asn Ser
1385 1390 1395
Val Ala Tyr Met Asp Gln Gln Thr Gly Asn Leu Lys Lys Ala Leu
1400 1405 1410
Leu Leu Lys Gly Ser Asn Glu Ile Glu Ile Arg Ala Glu Gly Asn
1415 1420 1425
Ser Arg Phe Thr Tyr Ser Val Thr Val Asp Gly Cys Thr Ser His
1430 1435 1440
Thr Gly Ala Trp Gly Lys Thr Val Ile Glu Tyr Lys Thr Thr Lys
1445 1450 1455
Thr Ser Arg Leu Pro Ile Ile Asp Val Ala Pro Leu Asp Val Gly
1460 1465 1470
Ala Pro Asp Gln Glu Phe Gly Phe Asp Val Gly Pro Val Cys Phe
1475 1480 1485
Leu
<210> 4
<211> 4362
<212> DNA
<213> Artificial sequence
<220>
<223> Synthetic sequence containing the coding regions of the vascular
signal sequence of barley gene for Thiol protease aleurain
precursor fused to the human Collagen alpha 2(I) chain and
flanking regions
<400> 4
gcgatgcatg taatgtcatg agccacatga tccaatggcc acaggaacgt aagaatgtag 60
atagatttga ttttgtccgt tagatagcaa acaacattat aaaaggtgtg tatcaatacg 120
aactaattca ctcattggat tcatagaagt ccattcctcc taagtatcta aaccatggct 180
cacgctcgtg ttctcctcct cgctctcgct gttttggcaa cagctgctgt ggctgtggct 240
tcaagttcta gttttgctga ttccaaccca attcgtccag ttactgatag agcagcttcc 300
actttggctc aattgcttca agaagaaact gtgaggaagg gccctgctgg cgataggggc 360
cctaggggcg aaaggggtcc accaggacct ccaggcaggg atggcgaaga tggtccaact 420
ggccctcctg gacctcctgg ccctccaggg ccacccggct tgggcggaaa cttcgcagct 480
caatacgatg gcaagggtgt tggtcttggt cctggtccta tgggcttgat gggacctaga 540
ggcccacctg gtgctgctgg tgctcctgga ccacagggtt ttcagggacc agctggcgag 600
ccaggagagc caggccaaac aggaccagct ggtgcaaggg gacctgctgg acctcctgga 660
aaagctggtg aagatggtca cccaggcaaa ccaggacgtc ctggcgaaag aggtgttgtt 720
ggaccacaag gcgctagggg atttccaggt acacctggat tgccaggttt taagggcatt 780
cgtggtcata acggcctcga tggattgaag ggacagcctg gcgcacctgg cgttaagggt 840
gaacctggag caccaggtga aaacggtact cctggccaga ctggtgcaag aggactccca 900
ggtgaaaggg gtagagttgg tgctcctgga cctgctggag ctaggggtag tgatggtagt 960
gttggtcctg tgggccctgc tggtccaatc ggttccgctg gcccacctgg attcccaggc 1020
gctccaggac ctaaaggaga aatcggtgct gtgggtaacg caggtcctac tggtccagca 1080
ggtcctcgtg gagaagtggg attgccagga ctttctggtc cagtgggccc tccaggcaac 1140
cctggagcta acggcttgac aggagctaaa ggcgcagcag gactccctgg agtggctggc 1200
gcaccaggat tgcctggtcc aaggggtatc ccaggccctg ttggcgcagc tggagctact 1260
ggtgcacgtg gacttgttgg cgaaccaggc cctgctggat caaaaggcga gtctggaaat 1320
aagggagaac ctggttctgc tggacctcaa ggtcctcctg gaccttctgg agaagaagga 1380
aaaaggggac caaatggcga ggctggatca gcaggtccac caggaccacc tggacttcgt 1440
ggatcccctg gtagtagagg acttccaggc gctgatggta gagcaggcgt tatgggacca 1500
ccaggaagta gaggagcatc cggtccagca ggagttaggg gtcctaacgg agatgctggt 1560
agaccaggtg aaccaggtct tatgggccca aggggcctcc caggtagtcc aggaaatatc 1620
ggccctgctg gaaaagaagg ccctgttgga cttccaggta ttgatggacg tcctggccct 1680
attggcccag caggtgcaag aggagaacct ggcaatattg gatttccagg accaaagggt 1740
ccaacaggcg atcctggaaa aaatggagat aagggtcatg ctggattggc aggcgcaagg 1800
ggcgctcctg gtccagatgg aaacaacggc gcacagggtc cacctggccc tcagggtgtt 1860
caaggcggaa aaggcgaaca aggcccagct ggaccaccag gctttcaagg cttgccagga 1920
ccaagtggtc cagcaggtga agttggcaag ccaggcgagc gtggacttca tggcgagttt 1980
ggactccctg gaccagcagg accaaggggt gaaagaggcc ctcctggaga gagtggcgct 2040
gctggaccaa caggcccaat cggtagtaga ggtcctagtg gacctccagg cccagatgga 2100
aataagggtg aaccaggagt tgtgggcgct gttggaacag ctggtccttc aggaccatca 2160
ggactcccag gcgagagagg cgctgctggc attcctggag gaaaaggtga aaaaggcgaa 2220
cctggcctcc gtggcgaaat cggaaatcct ggacgtgatg gtgctcgtgg tgcacacggc 2280
gctgtgggcg ctccaggccc tgctggtgct actggtgata gaggagaggc tggcgcagct 2340
ggcccagcag gtcctgctgg cccaaggggt agtcctggtg aaagaggcga agttggacct 2400
gctggcccta acggctttgc tggccctgct ggagcagcag gtcaacctgg cgctaaaggt 2460
gaaaggggcg gaaagggccc aaaaggtgaa aatggcgttg tgggaccaac tggtccagtg 2520
ggcgcagctg gacctgctgg tccaaatgga ccaccaggac cagcaggtag tagaggagat 2580
ggtggacctc caggaatgac aggttttcca ggtgctgctg gtagaacagg acctcctggt 2640
cctagtggta tttctggtcc accaggacca ccaggtcctg ctggaaaaga aggattgagg 2700
ggtccacgtg gtgatcaagg accagtgggc agaactggtg aagttggcgc agtgggacca 2760
cctggttttg ctggagaaaa gggcccttct ggagaggcag gaacagctgg tcctcctggt 2820
acacctggac ctcaaggact tttgggtgca cctggtattc tcggattgcc aggaagtagg 2880
ggcgaacgtg gacttcctgg cgtggcagga gcagttggag aacctggccc tctcggaatc 2940
gcaggcccac caggcgcaag aggaccacca ggagctgttg gatcaccagg cgtgaatggt 3000
gcacctggcg aggctggtcg tgatggaaac ccaggaaatg atggcccacc aggaagagat 3060
ggtcaacctg gacacaaagg cgagaggggc tacccaggaa atattggccc agttggtgct 3120
gctggcgcac caggcccaca cggtccagtt ggaccagcag gaaaacacgg taatcgtggc 3180
gaaacaggcc cttcaggccc agtgggacct gctggtgctg ttggcccaag aggaccatct 3240
ggacctcaag gcattagagg cgataaggga gagcctggcg aaaaaggacc tagaggcttg 3300
cctggtttta aaggacacaa cggtctccaa ggacttccag gtatcgctgg tcatcatgga 3360
gatcagggtg ctcctggatc agtgggtcca gcaggtccta gaggcccagc aggcccttcc 3420
ggtccagcag gaaaggatgg acgtactggc caccctggaa ctgtgggccc tgctggaatt 3480
agaggtcctc aaggtcatca gggccctgct ggccctccag gtccaccagg tcctccaggc 3540
ccaccaggag tttcaggtgg tggttacgat tttggttacg atggtgattt ttaccgtgct 3600
gatcaaccta gaagtgctcc ttctctccgt cctaaagatt atgaagttga tgctactttg 3660
aaatcactta acaaccagat tgagactctt ctcacacctg agggatcaag aaagaatcca 3720
gcacgtacat gccgtgatct cagacttagt cacccagagt ggtcaagtgg ctattattgg 3780
attgatccta atcagggttg tacaatggag gctatcaaag tttactgtga ttttccaact 3840
ggagagacat gtattagggc acaacctgag aacattccag ctaaaaattg gtatcgttcc 3900
tctaaagata agaaacatgt ttggctcgga gagactatta acgctggttc tcagttcgag 3960
tataatgttg agggcgttac ttctaaagag atggcaactc agctcgcttt tatgagattg 4020
ctcgctaact acgcatccca aaacatcact tatcactgca aaaattccat tgcatatatg 4080
gatgaggaga caggaaattt gaagaaagca gttattctcc aaggtagtaa cgatgttgag 4140
cttgtggctg agggaaatag tagattcact tacacagttt tggtggatgg atgctcaaag 4200
aaaactaatg agtggggcaa gacaatcatt gagtacaaga caaataagcc ttctaggctc 4260
ccatttctcg atattgcacc tcttgatatc ggaggagctg atcacgagtt ttttgttgat 4320
atcggacctg tttgttttaa gtaatgagct cgcggccgca tc 4362
<210> 5
<211> 4362
<212> DNA
<213> Artificial sequence
<220>
<223> Synthetic sequence of the vascular signal sequence of barley gene
for Thiol protease aleurain precursor fused to the human Collagen
alpha 2(I) chain and flanking regions
<220>
<221> CDS
<222> (175)..(4344)
<400> 5
gcgatgcatg taatgtcatg agccacatga tccaatggcc acaggaacgt aagaatgtag 60
atagatttga ttttgtccgt tagatagcaa acaacattat aaaaggtgtg tatcaatacg 120
aactaattca ctcattggat tcatagaagt ccattcctcc taagtatcta aacc atg 177
Met
1
gct cac gct cgt gtt ctc ctc ctc gct ctc gct gtt ttg gca aca gct 225
Ala His Ala Arg Val Leu Leu Leu Ala Leu Ala Val Leu Ala Thr Ala
5 10 15
gct gtg gct gtg gct tca agt tct agt ttt gct gat tcc aac cca att 273
Ala Val Ala Val Ala Ser Ser Ser Ser Phe Ala Asp Ser Asn Pro Ile
20 25 30
cgt cca gtt act gat aga gca gct tcc act ttg gct caa ttg ctt caa 321
Arg Pro Val Thr Asp Arg Ala Ala Ser Thr Leu Ala Gln Leu Leu Gln
35 40 45
gaa gaa act gtg agg aag ggc cct gct ggc gat agg ggc cct agg ggc 369
Glu Glu Thr Val Arg Lys Gly Pro Ala Gly Asp Arg Gly Pro Arg Gly
50 55 60 65
gaa agg ggt cca cca gga cct cca ggc agg gat ggc gaa gat ggt cca 417
Glu Arg Gly Pro Pro Gly Pro Pro Gly Arg Asp Gly Glu Asp Gly Pro
70 75 80
act ggc cct cct gga cct cct ggc cct cca ggg cca ccc ggc ttg ggc 465
Thr Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly Leu Gly
85 90 95
gga aac ttc gca gct caa tac gat ggc aag ggt gtt ggt ctt ggt cct 513
Gly Asn Phe Ala Ala Gln Tyr Asp Gly Lys Gly Val Gly Leu Gly Pro
100 105 110
ggt cct atg ggc ttg atg gga cct aga ggc cca cct ggt gct gct ggt 561
Gly Pro Met Gly Leu Met Gly Pro Arg Gly Pro Pro Gly Ala Ala Gly
115 120 125
gct cct gga cca cag ggt ttt cag gga cca gct ggc gag cca gga gag 609
Ala Pro Gly Pro Gln Gly Phe Gln Gly Pro Ala Gly Glu Pro Gly Glu
130 135 140 145
cca ggc caa aca gga cca gct ggt gca agg gga cct gct gga cct cct 657
Pro Gly Gln Thr Gly Pro Ala Gly Ala Arg Gly Pro Ala Gly Pro Pro
150 155 160
gga aaa gct ggt gaa gat ggt cac cca ggc aaa cca gga cgt cct ggc 705
Gly Lys Ala Gly Glu Asp Gly His Pro Gly Lys Pro Gly Arg Pro Gly
165 170 175
gaa aga ggt gtt gtt gga cca caa ggc gct agg gga ttt cca ggt aca 753
Glu Arg Gly Val Val Gly Pro Gln Gly Ala Arg Gly Phe Pro Gly Thr
180 185 190
cct gga ttg cca ggt ttt aag ggc att cgt ggt cat aac ggc ctc gat 801
Pro Gly Leu Pro Gly Phe Lys Gly Ile Arg Gly His Asn Gly Leu Asp
195 200 205
gga ttg aag gga cag cct ggc gca cct ggc gtt aag ggt gaa cct gga 849
Gly Leu Lys Gly Gln Pro Gly Ala Pro Gly Val Lys Gly Glu Pro Gly
210 215 220 225
gca cca ggt gaa aac ggt act cct ggc cag act ggt gca aga gga ctc 897
Ala Pro Gly Glu Asn Gly Thr Pro Gly Gln Thr Gly Ala Arg Gly Leu
230 235 240
cca ggt gaa agg ggt aga gtt ggt gct cct gga cct gct gga gct agg 945
Pro Gly Glu Arg Gly Arg Val Gly Ala Pro Gly Pro Ala Gly Ala Arg
245 250 255
ggt agt gat ggt agt gtt ggt cct gtg ggc cct gct ggt cca atc ggt 993
Gly Ser Asp Gly Ser Val Gly Pro Val Gly Pro Ala Gly Pro Ile Gly
260 265 270
tcc gct ggc cca cct gga ttc cca ggc gct cca gga cct aaa gga gaa 1041
Ser Ala Gly Pro Pro Gly Phe Pro Gly Ala Pro Gly Pro Lys Gly Glu
275 280 285
atc ggt gct gtg ggt aac gca ggt cct act ggt cca gca ggt cct cgt 1089
Ile Gly Ala Val Gly Asn Ala Gly Pro Thr Gly Pro Ala Gly Pro Arg
290 295 300 305
gga gaa gtg gga ttg cca gga ctt tct ggt cca gtg ggc cct cca ggc 1137
Gly Glu Val Gly Leu Pro Gly Leu Ser Gly Pro Val Gly Pro Pro Gly
310 315 320
aac cct gga gct aac ggc ttg aca gga gct aaa ggc gca gca gga ctc 1185
Asn Pro Gly Ala Asn Gly Leu Thr Gly Ala Lys Gly Ala Ala Gly Leu
325 330 335
cct gga gtg gct ggc gca cca gga ttg cct ggt cca agg ggt atc cca 1233
Pro Gly Val Ala Gly Ala Pro Gly Leu Pro Gly Pro Arg Gly Ile Pro
340 345 350
ggc cct gtt ggc gca gct gga gct act ggt gca cgt gga ctt gtt ggc 1281
Gly Pro Val Gly Ala Ala Gly Ala Thr Gly Ala Arg Gly Leu Val Gly
355 360 365
gaa cca ggc cct gct gga tca aaa ggc gag tct gga aat aag gga gaa 1329
Glu Pro Gly Pro Ala Gly Ser Lys Gly Glu Ser Gly Asn Lys Gly Glu
370 375 380 385
cct ggt tct gct gga cct caa ggt cct cct gga cct tct gga gaa gaa 1377
Pro Gly Ser Ala Gly Pro Gln Gly Pro Pro Gly Pro Ser Gly Glu Glu
390 395 400
gga aaa agg gga cca aat ggc gag gct gga tca gca ggt cca cca gga 1425
Gly Lys Arg Gly Pro Asn Gly Glu Ala Gly Ser Ala Gly Pro Pro Gly
405 410 415
cca cct gga ctt cgt gga tcc cct ggt agt aga gga ctt cca ggc gct 1473
Pro Pro Gly Leu Arg Gly Ser Pro Gly Ser Arg Gly Leu Pro Gly Ala
420 425 430
gat ggt aga gca ggc gtt atg gga cca cca gga agt aga gga gca tcc 1521
Asp Gly Arg Ala Gly Val Met Gly Pro Pro Gly Ser Arg Gly Ala Ser
435 440 445
ggt cca gca gga gtt agg ggt cct aac gga gat gct ggt aga cca ggt 1569
Gly Pro Ala Gly Val Arg Gly Pro Asn Gly Asp Ala Gly Arg Pro Gly
450 455 460 465
gaa cca ggt ctt atg ggc cca agg ggc ctc cca ggt agt cca gga aat 1617
Glu Pro Gly Leu Met Gly Pro Arg Gly Leu Pro Gly Ser Pro Gly Asn
470 475 480
atc ggc cct gct gga aaa gaa ggc cct gtt gga ctt cca ggt att gat 1665
Ile Gly Pro Ala Gly Lys Glu Gly Pro Val Gly Leu Pro Gly Ile Asp
485 490 495
gga cgt cct ggc cct att ggc cca gca ggt gca aga gga gaa cct ggc 1713
Gly Arg Pro Gly Pro Ile Gly Pro Ala Gly Ala Arg Gly Glu Pro Gly
500 505 510
aat att gga ttt cca gga cca aag ggt cca aca ggc gat cct gga aaa 1761
Asn Ile Gly Phe Pro Gly Pro Lys Gly Pro Thr Gly Asp Pro Gly Lys
515 520 525
aat gga gat aag ggt cat gct gga ttg gca ggc gca agg ggc gct cct 1809
Asn Gly Asp Lys Gly His Ala Gly Leu Ala Gly Ala Arg Gly Ala Pro
530 535 540 545
ggt cca gat gga aac aac ggc gca cag ggt cca cct ggc cct cag ggt 1857
Gly Pro Asp Gly Asn Asn Gly Ala Gln Gly Pro Pro Gly Pro Gln Gly
550 555 560
gtt caa ggc gga aaa ggc gaa caa ggc cca gct gga cca cca ggc ttt 1905
Val Gln Gly Gly Lys Gly Glu Gln Gly Pro Ala Gly Pro Pro Gly Phe
565 570 575
caa ggc ttg cca gga cca agt ggt cca gca ggt gaa gtt ggc aag cca 1953
Gln Gly Leu Pro Gly Pro Ser Gly Pro Ala Gly Glu Val Gly Lys Pro
580 585 590
ggc gag cgt gga ctt cat ggc gag ttt gga ctc cct gga cca gca gga 2001
Gly Glu Arg Gly Leu His Gly Glu Phe Gly Leu Pro Gly Pro Ala Gly
595 600 605
cca agg ggt gaa aga ggc cct cct gga gag agt ggc gct gct gga cca 2049
Pro Arg Gly Glu Arg Gly Pro Pro Gly Glu Ser Gly Ala Ala Gly Pro
610 615 620 625
aca ggc cca atc ggt agt aga ggt cct agt gga cct cca ggc cca gat 2097
Thr Gly Pro Ile Gly Ser Arg Gly Pro Ser Gly Pro Pro Gly Pro Asp
630 635 640
gga aat aag ggt gaa cca gga gtt gtg ggc gct gtt gga aca gct ggt 2145
Gly Asn Lys Gly Glu Pro Gly Val Val Gly Ala Val Gly Thr Ala Gly
645 650 655
cct tca gga cca tca gga ctc cca ggc gag aga ggc gct gct ggc att 2193
Pro Ser Gly Pro Ser Gly Leu Pro Gly Glu Arg Gly Ala Ala Gly Ile
660 665 670
cct gga gga aaa ggt gaa aaa ggc gaa cct ggc ctc cgt ggc gaa atc 2241
Pro Gly Gly Lys Gly Glu Lys Gly Glu Pro Gly Leu Arg Gly Glu Ile
675 680 685
gga aat cct gga cgt gat ggt gct cgt ggt gca cac ggc gct gtg ggc 2289
Gly Asn Pro Gly Arg Asp Gly Ala Arg Gly Ala His Gly Ala Val Gly
690 695 700 705
gct cca ggc cct gct ggt gct act ggt gat aga gga gag gct ggc gca 2337
Ala Pro Gly Pro Ala Gly Ala Thr Gly Asp Arg Gly Glu Ala Gly Ala
710 715 720
gct ggc cca gca ggt cct gct ggc cca agg ggt agt cct ggt gaa aga 2385
Ala Gly Pro Ala Gly Pro Ala Gly Pro Arg Gly Ser Pro Gly Glu Arg
725 730 735
ggc gaa gtt gga cct gct ggc cct aac ggc ttt gct ggc cct gct gga 2433
Gly Glu Val Gly Pro Ala Gly Pro Asn Gly Phe Ala Gly Pro Ala Gly
740 745 750
gca gca ggt caa cct ggc gct aaa ggt gaa agg ggc gga aag ggc cca 2481
Ala Ala Gly Gln Pro Gly Ala Lys Gly Glu Arg Gly Gly Lys Gly Pro
755 760 765
aaa ggt gaa aat ggc gtt gtg gga cca act ggt cca gtg ggc gca gct 2529
Lys Gly Glu Asn Gly Val Val Gly Pro Thr Gly Pro Val Gly Ala Ala
770 775 780 785
gga cct gct ggt cca aat gga cca cca gga cca gca ggt agt aga gga 2577
Gly Pro Ala Gly Pro Asn Gly Pro Pro Gly Pro Ala Gly Ser Arg Gly
790 795 800
gat ggt gga cct cca gga atg aca ggt ttt cca ggt gct gct ggt aga 2625
Asp Gly Gly Pro Pro Gly Met Thr Gly Phe Pro Gly Ala Ala Gly Arg
805 810 815
aca gga cct cct ggt cct agt ggt att tct ggt cca cca gga cca cca 2673
Thr Gly Pro Pro Gly Pro Ser Gly Ile Ser Gly Pro Pro Gly Pro Pro
820 825 830
ggt cct gct gga aaa gaa gga ttg agg ggt cca cgt ggt gat caa gga 2721
Gly Pro Ala Gly Lys Glu Gly Leu Arg Gly Pro Arg Gly Asp Gln Gly
835 840 845
cca gtg ggc aga act ggt gaa gtt ggc gca gtg gga cca cct ggt ttt 2769
Pro Val Gly Arg Thr Gly Glu Val Gly Ala Val Gly Pro Pro Gly Phe
850 855 860 865
gct gga gaa aag ggc cct tct gga gag gca gga aca gct ggt cct cct 2817
Ala Gly Glu Lys Gly Pro Ser Gly Glu Ala Gly Thr Ala Gly Pro Pro
870 875 880
ggt aca cct gga cct caa gga ctt ttg ggt gca cct ggt att ctc gga 2865
Gly Thr Pro Gly Pro Gln Gly Leu Leu Gly Ala Pro Gly Ile Leu Gly
885 890 895
ttg cca gga agt agg ggc gaa cgt gga ctt cct ggc gtg gca gga gca 2913
Leu Pro Gly Ser Arg Gly Glu Arg Gly Leu Pro Gly Val Ala Gly Ala
900 905 910
gtt gga gaa cct ggc cct ctc gga atc gca ggc cca cca ggc gca aga 2961
Val Gly Glu Pro Gly Pro Leu Gly Ile Ala Gly Pro Pro Gly Ala Arg
915 920 925
gga cca cca gga gct gtt gga tca cca ggc gtg aat ggt gca cct ggc 3009
Gly Pro Pro Gly Ala Val Gly Ser Pro Gly Val Asn Gly Ala Pro Gly
930 935 940 945
gag gct ggt cgt gat gga aac cca gga aat gat ggc cca cca gga aga 3057
Glu Ala Gly Arg Asp Gly Asn Pro Gly Asn Asp Gly Pro Pro Gly Arg
950 955 960
gat ggt caa cct gga cac aaa ggc gag agg ggc tac cca gga aat att 3105
Asp Gly Gln Pro Gly His Lys Gly Glu Arg Gly Tyr Pro Gly Asn Ile
965 970 975
ggc cca gtt ggt gct gct ggc gca cca ggc cca cac ggt cca gtt gga 3153
Gly Pro Val Gly Ala Ala Gly Ala Pro Gly Pro His Gly Pro Val Gly
980 985 990
cca gca gga aaa cac ggt aat cgt ggc gaa aca ggc cct tca ggc cca 3201
Pro Ala Gly Lys His Gly Asn Arg Gly Glu Thr Gly Pro Ser Gly Pro
995 1000 1005
gtg gga cct gct ggt gct gtt ggc cca aga gga cca tct gga cct 3246
Val Gly Pro Ala Gly Ala Val Gly Pro Arg Gly Pro Ser Gly Pro
1010 1015 1020
caa ggc att aga ggc gat aag gga gag cct ggc gaa aaa gga cct 3291
Gln Gly Ile Arg Gly Asp Lys Gly Glu Pro Gly Glu Lys Gly Pro
1025 1030 1035
aga ggc ttg cct ggt ttt aaa gga cac aac ggt ctc caa gga ctt 3336
Arg Gly Leu Pro Gly Phe Lys Gly His Asn Gly Leu Gln Gly Leu
1040 1045 1050
cca ggt atc gct ggt cat cat gga gat cag ggt gct cct gga tca 3381
Pro Gly Ile Ala Gly His His Gly Asp Gln Gly Ala Pro Gly Ser
1055 1060 1065
gtg ggt cca gca ggt cct aga ggc cca gca ggc cct tcc ggt cca 3426
Val Gly Pro Ala Gly Pro Arg Gly Pro Ala Gly Pro Ser Gly Pro
1070 1075 1080
gca gga aag gat gga cgt act ggc cac cct gga act gtg ggc cct 3471
Ala Gly Lys Asp Gly Arg Thr Gly His Pro Gly Thr Val Gly Pro
1085 1090 1095
gct gga att aga ggt cct caa ggt cat cag ggc cct gct ggc cct 3516
Ala Gly Ile Arg Gly Pro Gln Gly His Gln Gly Pro Ala Gly Pro
1100 1105 1110
cca ggt cca cca ggt cct cca ggc cca cca gga gtt tca ggt ggt 3561
Pro Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly Val Ser Gly Gly
1115 1120 1125
ggt tac gat ttt ggt tac gat ggt gat ttt tac cgt gct gat caa 3606
Gly Tyr Asp Phe Gly Tyr Asp Gly Asp Phe Tyr Arg Ala Asp Gln
1130 1135 1140
cct aga agt gct cct tct ctc cgt cct aaa gat tat gaa gtt gat 3651
Pro Arg Ser Ala Pro Ser Leu Arg Pro Lys Asp Tyr Glu Val Asp
1145 1150 1155
gct act ttg aaa tca ctt aac aac cag att gag act ctt ctc aca 3696
Ala Thr Leu Lys Ser Leu Asn Asn Gln Ile Glu Thr Leu Leu Thr
1160 1165 1170
cct gag gga tca aga aag aat cca gca cgt aca tgc cgt gat ctc 3741
Pro Glu Gly Ser Arg Lys Asn Pro Ala Arg Thr Cys Arg Asp Leu
1175 1180 1185
aga ctt agt cac cca gag tgg tca agt ggc tat tat tgg att gat 3786
Arg Leu Ser His Pro Glu Trp Ser Ser Gly Tyr Tyr Trp Ile Asp
1190 1195 1200
cct aat cag ggt tgt aca atg gag gct atc aaa gtt tac tgt gat 3831
Pro Asn Gln Gly Cys Thr Met Glu Ala Ile Lys Val Tyr Cys Asp
1205 1210 1215
ttt cca act gga gag aca tgt att agg gca caa cct gag aac att 3876
Phe Pro Thr Gly Glu Thr Cys Ile Arg Ala Gln Pro Glu Asn Ile
1220 1225 1230
cca gct aaa aat tgg tat cgt tcc tct aaa gat aag aaa cat gtt 3921
Pro Ala Lys Asn Trp Tyr Arg Ser Ser Lys Asp Lys Lys His Val
1235 1240 1245
tgg ctc gga gag act att aac gct ggt tct cag ttc gag tat aat 3966
Trp Leu Gly Glu Thr Ile Asn Ala Gly Ser Gln Phe Glu Tyr Asn
1250 1255 1260
gtt gag ggc gtt act tct aaa gag atg gca act cag ctc gct ttt 4011
Val Glu Gly Val Thr Ser Lys Glu Met Ala Thr Gln Leu Ala Phe
1265 1270 1275
atg aga ttg ctc gct aac tac gca tcc caa aac atc act tat cac 4056
Met Arg Leu Leu Ala Asn Tyr Ala Ser Gln Asn Ile Thr Tyr His
1280 1285 1290
tgc aaa aat tcc att gca tat atg gat gag gag aca gga aat ttg 4101
Cys Lys Asn Ser Ile Ala Tyr Met Asp Glu Glu Thr Gly Asn Leu
1295 1300 1305
aag aaa gca gtt att ctc caa ggt agt aac gat gtt gag ctt gtg 4146
Lys Lys Ala Val Ile Leu Gln Gly Ser Asn Asp Val Glu Leu Val
1310 1315 1320
gct gag gga aat agt aga ttc act tac aca gtt ttg gtg gat gga 4191
Ala Glu Gly Asn Ser Arg Phe Thr Tyr Thr Val Leu Val Asp Gly
1325 1330 1335
tgc tca aag aaa act aat gag tgg ggc aag aca atc att gag tac 4236
Cys Ser Lys Lys Thr Asn Glu Trp Gly Lys Thr Ile Ile Glu Tyr
1340 1345 1350
aag aca aat aag cct tct agg ctc cca ttt ctc gat att gca cct 4281
Lys Thr Asn Lys Pro Ser Arg Leu Pro Phe Leu Asp Ile Ala Pro
1355 1360 1365
ctt gat atc gga gga gct gat cac gag ttt ttt gtt gat atc gga 4326
Leu Asp Ile Gly Gly Ala Asp His Glu Phe Phe Val Asp Ile Gly
1370 1375 1380
cct gtt tgt ttt aag taa tgagctcgcg gccgcatc 4362
Pro Val Cys Phe Lys
1385
<210> 6
<211> 1389
<212> PRT
<213> Artificial sequence
<220>
<223> Synthetic sequence containing the coding regions of the vascular
signal sequence of barley gene for Thiol protease aleurain
precursor fused to the human Collagen alpha 2(I) chain and
flanking regions
<400> 6
Met Ala His Ala Arg Val Leu Leu Leu Ala Leu Ala Val Leu Ala Thr
1 5 10 15
Ala Ala Val Ala Val Ala Ser Ser Ser Ser Phe Ala Asp Ser Asn Pro
20 25 30
Ile Arg Pro Val Thr Asp Arg Ala Ala Ser Thr Leu Ala Gln Leu Leu
35 40 45
Gln Glu Glu Thr Val Arg Lys Gly Pro Ala Gly Asp Arg Gly Pro Arg
50 55 60
Gly Glu Arg Gly Pro Pro Gly Pro Pro Gly Arg Asp Gly Glu Asp Gly
65 70 75 80
Pro Thr Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly Leu
85 90 95
Gly Gly Asn Phe Ala Ala Gln Tyr Asp Gly Lys Gly Val Gly Leu Gly
100 105 110
Pro Gly Pro Met Gly Leu Met Gly Pro Arg Gly Pro Pro Gly Ala Ala
115 120 125
Gly Ala Pro Gly Pro Gln Gly Phe Gln Gly Pro Ala Gly Glu Pro Gly
130 135 140
Glu Pro Gly Gln Thr Gly Pro Ala Gly Ala Arg Gly Pro Ala Gly Pro
145 150 155 160
Pro Gly Lys Ala Gly Glu Asp Gly His Pro Gly Lys Pro Gly Arg Pro
165 170 175
Gly Glu Arg Gly Val Val Gly Pro Gln Gly Ala Arg Gly Phe Pro Gly
180 185 190
Thr Pro Gly Leu Pro Gly Phe Lys Gly Ile Arg Gly His Asn Gly Leu
195 200 205
Asp Gly Leu Lys Gly Gln Pro Gly Ala Pro Gly Val Lys Gly Glu Pro
210 215 220
Gly Ala Pro Gly Glu Asn Gly Thr Pro Gly Gln Thr Gly Ala Arg Gly
225 230 235 240
Leu Pro Gly Glu Arg Gly Arg Val Gly Ala Pro Gly Pro Ala Gly Ala
245 250 255
Arg Gly Ser Asp Gly Ser Val Gly Pro Val Gly Pro Ala Gly Pro Ile
260 265 270
Gly Ser Ala Gly Pro Pro Gly Phe Pro Gly Ala Pro Gly Pro Lys Gly
275 280 285
Glu Ile Gly Ala Val Gly Asn Ala Gly Pro Thr Gly Pro Ala Gly Pro
290 295 300
Arg Gly Glu Val Gly Leu Pro Gly Leu Ser Gly Pro Val Gly Pro Pro
305 310 315 320
Gly Asn Pro Gly Ala Asn Gly Leu Thr Gly Ala Lys Gly Ala Ala Gly
325 330 335
Leu Pro Gly Val Ala Gly Ala Pro Gly Leu Pro Gly Pro Arg Gly Ile
340 345 350
Pro Gly Pro Val Gly Ala Ala Gly Ala Thr Gly Ala Arg Gly Leu Val
355 360 365
Gly Glu Pro Gly Pro Ala Gly Ser Lys Gly Glu Ser Gly Asn Lys Gly
370 375 380
Glu Pro Gly Ser Ala Gly Pro Gln Gly Pro Pro Gly Pro Ser Gly Glu
385 390 395 400
Glu Gly Lys Arg Gly Pro Asn Gly Glu Ala Gly Ser Ala Gly Pro Pro
405 410 415
Gly Pro Pro Gly Leu Arg Gly Ser Pro Gly Ser Arg Gly Leu Pro Gly
420 425 430
Ala Asp Gly Arg Ala Gly Val Met Gly Pro Pro Gly Ser Arg Gly Ala
435 440 445
Ser Gly Pro Ala Gly Val Arg Gly Pro Asn Gly Asp Ala Gly Arg Pro
450 455 460
Gly Glu Pro Gly Leu Met Gly Pro Arg Gly Leu Pro Gly Ser Pro Gly
465 470 475 480
Asn Ile Gly Pro Ala Gly Lys Glu Gly Pro Val Gly Leu Pro Gly Ile
485 490 495
Asp Gly Arg Pro Gly Pro Ile Gly Pro Ala Gly Ala Arg Gly Glu Pro
500 505 510
Gly Asn Ile Gly Phe Pro Gly Pro Lys Gly Pro Thr Gly Asp Pro Gly
515 520 525
Lys Asn Gly Asp Lys Gly His Ala Gly Leu Ala Gly Ala Arg Gly Ala
530 535 540
Pro Gly Pro Asp Gly Asn Asn Gly Ala Gln Gly Pro Pro Gly Pro Gln
545 550 555 560
Gly Val Gln Gly Gly Lys Gly Glu Gln Gly Pro Ala Gly Pro Pro Gly
565 570 575
Phe Gln Gly Leu Pro Gly Pro Ser Gly Pro Ala Gly Glu Val Gly Lys
580 585 590
Pro Gly Glu Arg Gly Leu His Gly Glu Phe Gly Leu Pro Gly Pro Ala
595 600 605
Gly Pro Arg Gly Glu Arg Gly Pro Pro Gly Glu Ser Gly Ala Ala Gly
610 615 620
Pro Thr Gly Pro Ile Gly Ser Arg Gly Pro Ser Gly Pro Pro Gly Pro
625 630 635 640
Asp Gly Asn Lys Gly Glu Pro Gly Val Val Gly Ala Val Gly Thr Ala
645 650 655
Gly Pro Ser Gly Pro Ser Gly Leu Pro Gly Glu Arg Gly Ala Ala Gly
660 665 670
Ile Pro Gly Gly Lys Gly Glu Lys Gly Glu Pro Gly Leu Arg Gly Glu
675 680 685
Ile Gly Asn Pro Gly Arg Asp Gly Ala Arg Gly Ala His Gly Ala Val
690 695 700
Gly Ala Pro Gly Pro Ala Gly Ala Thr Gly Asp Arg Gly Glu Ala Gly
705 710 715 720
Ala Ala Gly Pro Ala Gly Pro Ala Gly Pro Arg Gly Ser Pro Gly Glu
725 730 735
Arg Gly Glu Val Gly Pro Ala Gly Pro Asn Gly Phe Ala Gly Pro Ala
740 745 750
Gly Ala Ala Gly Gln Pro Gly Ala Lys Gly Glu Arg Gly Gly Lys Gly
755 760 765
Pro Lys Gly Glu Asn Gly Val Val Gly Pro Thr Gly Pro Val Gly Ala
770 775 780
Ala Gly Pro Ala Gly Pro Asn Gly Pro Pro Gly Pro Ala Gly Ser Arg
785 790 795 800
Gly Asp Gly Gly Pro Pro Gly Met Thr Gly Phe Pro Gly Ala Ala Gly
805 810 815
Arg Thr Gly Pro Pro Gly Pro Ser Gly Ile Ser Gly Pro Pro Gly Pro
820 825 830
Pro Gly Pro Ala Gly Lys Glu Gly Leu Arg Gly Pro Arg Gly Asp Gln
835 840 845
Gly Pro Val Gly Arg Thr Gly Glu Val Gly Ala Val Gly Pro Pro Gly
850 855 860
Phe Ala Gly Glu Lys Gly Pro Ser Gly Glu Ala Gly Thr Ala Gly Pro
865 870 875 880
Pro Gly Thr Pro Gly Pro Gln Gly Leu Leu Gly Ala Pro Gly Ile Leu
885 890 895
Gly Leu Pro Gly Ser Arg Gly Glu Arg Gly Leu Pro Gly Val Ala Gly
900 905 910
Ala Val Gly Glu Pro Gly Pro Leu Gly Ile Ala Gly Pro Pro Gly Ala
915 920 925
Arg Gly Pro Pro Gly Ala Val Gly Ser Pro Gly Val Asn Gly Ala Pro
930 935 940
Gly Glu Ala Gly Arg Asp Gly Asn Pro Gly Asn Asp Gly Pro Pro Gly
945 950 955 960
Arg Asp Gly Gln Pro Gly His Lys Gly Glu Arg Gly Tyr Pro Gly Asn
965 970 975
Ile Gly Pro Val Gly Ala Ala Gly Ala Pro Gly Pro His Gly Pro Val
980 985 990
Gly Pro Ala Gly Lys His Gly Asn Arg Gly Glu Thr Gly Pro Ser Gly
995 1000 1005
Pro Val Gly Pro Ala Gly Ala Val Gly Pro Arg Gly Pro Ser Gly
1010 1015 1020
Pro Gln Gly Ile Arg Gly Asp Lys Gly Glu Pro Gly Glu Lys Gly
1025 1030 1035
Pro Arg Gly Leu Pro Gly Phe Lys Gly His Asn Gly Leu Gln Gly
1040 1045 1050
Leu Pro Gly Ile Ala Gly His His Gly Asp Gln Gly Ala Pro Gly
1055 1060 1065
Ser Val Gly Pro Ala Gly Pro Arg Gly Pro Ala Gly Pro Ser Gly
1070 1075 1080
Pro Ala Gly Lys Asp Gly Arg Thr Gly His Pro Gly Thr Val Gly
1085 1090 1095
Pro Ala Gly Ile Arg Gly Pro Gln Gly His Gln Gly Pro Ala Gly
1100 1105 1110
Pro Pro Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly Val Ser Gly
1115 1120 1125
Gly Gly Tyr Asp Phe Gly Tyr Asp Gly Asp Phe Tyr Arg Ala Asp
1130 1135 1140
Gln Pro Arg Ser Ala Pro Ser Leu Arg Pro Lys Asp Tyr Glu Val
1145 1150 1155
Asp Ala Thr Leu Lys Ser Leu Asn Asn Gln Ile Glu Thr Leu Leu
1160 1165 1170
Thr Pro Glu Gly Ser Arg Lys Asn Pro Ala Arg Thr Cys Arg Asp
1175 1180 1185
Leu Arg Leu Ser His Pro Glu Trp Ser Ser Gly Tyr Tyr Trp Ile
1190 1195 1200
Asp Pro Asn Gln Gly Cys Thr Met Glu Ala Ile Lys Val Tyr Cys
1205 1210 1215
Asp Phe Pro Thr Gly Glu Thr Cys Ile Arg Ala Gln Pro Glu Asn
1220 1225 1230
Ile Pro Ala Lys Asn Trp Tyr Arg Ser Ser Lys Asp Lys Lys His
1235 1240 1245
Val Trp Leu Gly Glu Thr Ile Asn Ala Gly Ser Gln Phe Glu Tyr
1250 1255 1260
Asn Val Glu Gly Val Thr Ser Lys Glu Met Ala Thr Gln Leu Ala
1265 1270 1275
Phe Met Arg Leu Leu Ala Asn Tyr Ala Ser Gln Asn Ile Thr Tyr
1280 1285 1290
His Cys Lys Asn Ser Ile Ala Tyr Met Asp Glu Glu Thr Gly Asn
1295 1300 1305
Leu Lys Lys Ala Val Ile Leu Gln Gly Ser Asn Asp Val Glu Leu
1310 1315 1320
Val Ala Glu Gly Asn Ser Arg Phe Thr Tyr Thr Val Leu Val Asp
1325 1330 1335
Gly Cys Ser Lys Lys Thr Asn Glu Trp Gly Lys Thr Ile Ile Glu
1340 1345 1350
Tyr Lys Thr Asn Lys Pro Ser Arg Leu Pro Phe Leu Asp Ile Ala
1355 1360 1365
Pro Leu Asp Ile Gly Gly Ala Asp His Glu Phe Phe Val Asp Ile
1370 1375 1380
Gly Pro Val Cys Phe Lys
1385
<210> 7
<211> 127
<212> DNA
<213> Artificial sequence
<220>
<223> Synthetic sequence containing the coding region of the appoplast
signal of Arabidopsis thaliana endo-1,4-beta-glucanase and
flanking regions
<400> 7
gccatggcta ggaagtcttt gattttccca gtgattcttc ttgctgtgct tcttttctct 60
ccacctattt actctgctgg acacgattat agggatgctc ttaggaagtc atctatggct 120
caattgc 127
<210> 8
<211> 127
<212> DNA
<213> Artificial sequence
<220>
<223> Synthetic sequence of the appoplast signal of Arabidopsis
thaliana endo-1,4-beta-glucanase and flanking regions
<220>
<221> CDS
<222> (10)..(120)
<400> 8
gccatggct agg aag tct ttg att ttc cca gtg att ctt ctt gct gtg ctt 51
Arg Lys Ser Leu Ile Phe Pro Val Ile Leu Leu Ala Val Leu
1 5 10
ctt ttc tct cca cct att tac tct gct gga cac gat tat agg gat gct 99
Leu Phe Ser Pro Pro Ile Tyr Ser Ala Gly His Asp Tyr Arg Asp Ala
15 20 25 30
ctt agg aag tca tct atg gct caattgc 127
Leu Arg Lys Ser Ser Met Ala
35
<210> 9
<211> 37
<212> PRT
<213> Artificial sequence
<220>
<223> Synthetic sequence of the appoplast signal of Arabidopsis
thaliana endo-1,4-beta-glucanase and flanking regions
<400> 9
Arg Lys Ser Leu Ile Phe Pro Val Ile Leu Leu Ala Val Leu Leu Phe
1 5 10 15
Ser Pro Pro Ile Tyr Ser Ala Gly His Asp Tyr Arg Asp Ala Leu Arg
20 25 30
Lys Ser Ser Met Ala
35
<210> 10
<211> 1037
<212> DNA
<213> Artificial sequence
<220>
<223> Chrysanthemum rbcS1 promoter and 5' UTR
<400> 10
aaatggcgcg ccaagcttag acaaacaccc cttgttatac aaagaatttc gctttacaaa 60
atcaaattcg agaaaataat atatgcacta aataagatca ttcggatcca atctaaccaa 120
ttacgatacg ctttgggtac acttgatttt tgtttcagta gttacatata tcttgtttta 180
tatgctatct ttaaggatct tcactcaaag actatttgtt gatgttcttg atggggctcg 240
gaagatttga tatgatacac tctaatcttt aggagatacc agccaggatt atattcagta 300
agacaatcaa attttacgtg ttcaaactcg ttatcttttc atttaatgga tgagccagaa 360
tctctataga atgattgcaa tcgagaatat gttcggccga tatccctttg ttggcttcaa 420
tattctacat atcacacaag aatcgaccgt attgtaccct ctttccataa aggaacacac 480
agtatgcaga tgcttttttc ccacatgcag taacataggt attcaaaaat ggctaaaaga 540
agttggataa caaattgaca actatttcca tttctgttat ataaatttca caacacacaa 600
aagcccgtaa tcaagagtct gcccatgtac gaaataactt ctattatttg gtattgggcc 660
taagcccagc tcagagtacg tgggggtacc acatatagga aggtaacaaa atactgcaag 720
atagccccat aacgtaccag cctctcctta ccacgaagag ataagatata agacccaccc 780
tgccacgtgt cacatcgtca tggtggttaa tgataaggga ttacatcctt ctatgtttgt 840
ggacatgatg catgtaatgt catgagccac atgatccaat ggccacagga acgtaagaat 900
gtagatagat ttgattttgt ccgttagata gcaaacaaca ttataaaagg tgtgtatcaa 960
tacgaactaa ttcactcatt ggattcatag aagtccattc ctcctaagta tctaaacata 1020
tgcaattgtc gactaaa 1037
<210> 11
<211> 975
<212> DNA
<213> Artificial sequence
<220>
<223> Chrysanthemum rbcS1 3'UTR and terminator
<400> 11
aaaaggatcc gcggccgcat aagttttact atttaccaag acttttgaat attaaccttc 60
ttgtaacgag tcggttaaat ttgattgttt agggttttgt attatttttt tttggtcttt 120
taattcatca ctttaattcc ctaattgtct gttcatttcg ttgtttgttt ccggatcgat 180
aatgaaatgt aagagatatc atatataaat aataaattgt cgtttcatat ttgcaatctt 240
tttttacaaa cctttaatta attgtatgta tgacattttc ttcttgttat attaggggga 300
aataatgtta aataaaagta caaaataaac tacagtacat cgtactgaat aaattaccta 360
gccaaaaagt acacctttcc atatacttcc tacatgaagg cattttcaac attttcaaat 420
aaggaatgct acaaccgcat aataacatcc acaaattttt ttataaaata acatgtcaga 480
cagtgattga aagattttat tatagtttcg ttatcttctt ttctcattaa gcgaatcact 540
acctaacacg tcattttgtg aaatattttt tgaatgtttt tatatagttg tagcattcct 600
cttttcaaat tagggtttgt ttgagatagc atttcagccg gttcatacaa cttaaaagca 660
tactctaatg ctggaaaaaa gactaaaaaa tcttgtaagt tagcgcagaa tattgaccca 720
aattatatac acacatgacc ccatatagag actaattaca cttttaacca ctaataatta 780
ttactgtatt ataacatcta ctaattaaac ttgtgagttt ttgctagaat tattatcata 840
tatactaaaa ggcaggaacg caaacattgc cccggtactg tagcaactac ggtagacgca 900
ttaattgtct atagtggacg cattaattaa ccaaaaccgc ctctttcccc ttcttcttga 960
agcttgagct ctttt 975
<210> 12
<211> 1633
<212> DNA
<213> Artificial sequence
<220>
<223> Synthetic sequence containing the coding regions of the vascular
signal sequence of barley gene for Thiol protease aleurain
precursor fused to the human Prolyl 4-hydroxylase beta subunit
and flanking regions
<400> 12
ctcgagtaaa ccatggctca tgctagggtt ttgcttttgg ctcttgctgt tcttgctact 60
gctgctgttg ctgtggcttc ttcttcatct ttcgctgatt ctaacccaat taggccagtg 120
actgatagag ctgcttctac tcttgctcaa ttggtcgaca tggatgctcc agaagaggag 180
gatcacgttc ttgtgcttag gaagtctaac ttcgctgaag ctcttgctgc tcacaagtac 240
cttcttgtgg agttttatgc tccttggtgc ggacattgca aagctcttgc tccagagtat 300
gctaaggctg ctggaaagtt gaaggctgag ggatctgaaa ttaggcttgc taaagtggat 360
gctactgagg agtctgatct tgctcaacag tacggagtta ggggataccc aactattaag 420
ttcttcagga acggagatac tgcttctcca aaggagtata ctgctggaag ggaggctgat 480
gatattgtga actggcttaa gaagagaact ggaccagctg ctactactct tccagatgga 540
gctgctgctg aatctcttgt ggagtcatct gaggtggcag tgattggatt cttcaaggat 600
gtggagtctg attctgctaa gcagttcctt caagctgctg aggctattga tgatattcca 660
ttcggaatta cttctaactc tgatgtgttc tctaagtacc agcttgataa ggatggagtg 720
gtgcttttca agaaattcga tgagggaagg aacaatttcg agggagaggt gacaaaggag 780
aaccttcttg atttcattaa gcacaaccag cttccacttg tgattgagtt cactgagcag 840
actgctccaa agattttcgg aggagagatt aagactcaca ttcttctttt ccttccaaag 900
tctgtgtctg attacgatgg aaagttgtct aacttcaaga ctgctgctga gtctttcaag 960
ggaaagattc ttttcatttt cattgattct gatcacactg ataaccagag gattcttgag 1020
ttcttcggac ttaagaagga agagtgccca gctgttaggc ttattactct tgaggaggag 1080
atgactaagt acaagccaga gtctgaagaa cttactgctg agaggattac tgagttctgc 1140
cacagattcc ttgagggaaa gattaagcca caccttatgt ctcaagagct tccagaggat 1200
tgggataagc agccagttaa ggtgttggtg ggtaaaaact tcgaggatgt ggctttcgat 1260
gagaagaaga acgtgttcgt ggagttctac gcaccttggt gtggtcactg taagcagctt 1320
gctccaattt gggataagtt gggagagact tacaaggatc acgagaacat tgtgattgct 1380
aagatggatt ctactgctaa cgaggtggag gctgttaagg ttcactcttt cccaactttg 1440
aagttcttcc cagcttctgc tgataggact gtgattgatt acaacggaga aaggactctt 1500
gatggattca agaagttcct tgagtctgga ggacaagatg gagctggaga tgatgatgat 1560
cttgaggatt tggaagaagc tgaggagcca gatatggagg aggatgatga tcagaaggct 1620
gtgtgatgag ctc 1633
<210> 13
<211> 537
<212> PRT
<213> Artificial sequence
<220>
<223> Synthetic sequence containing the vascular signal sequence of
barley gene for Thiol protease aleurain precursor fused to the
human Prolyl 4-hydroxylase beta subunit and flanking regions
<400> 13
Met Ala His Ala Arg Val Leu Leu Leu Ala Leu Ala Val Leu Ala Thr
1 5 10 15
Ala Ala Val Ala Val Ala Ser Ser Ser Ser Phe Ala Asp Ser Asn Pro
20 25 30
Ile Arg Pro Val Thr Asp Arg Ala Ala Ser Thr Leu Ala Gln Leu Val
35 40 45
Asp Met Asp Ala Pro Glu Glu Glu Asp His Val Leu Val Leu Arg Lys
50 55 60
Ser Asn Phe Ala Glu Ala Leu Ala Ala His Lys Tyr Leu Leu Val Glu
65 70 75 80
Phe Tyr Ala Pro Trp Cys Gly His Cys Lys Ala Leu Ala Pro Glu Tyr
85 90 95
Ala Lys Ala Ala Gly Lys Leu Lys Ala Glu Gly Ser Glu Ile Arg Leu
100 105 110
Ala Lys Val Asp Ala Thr Glu Glu Ser Asp Leu Ala Gln Gln Tyr Gly
115 120 125
Val Arg Gly Tyr Pro Thr Ile Lys Phe Phe Arg Asn Gly Asp Thr Ala
130 135 140
Ser Pro Lys Glu Tyr Thr Ala Gly Arg Glu Ala Asp Asp Ile Val Asn
145 150 155 160
Trp Leu Lys Lys Arg Thr Gly Pro Ala Ala Thr Thr Leu Pro Asp Gly
165 170 175
Ala Ala Ala Glu Ser Leu Val Glu Ser Ser Glu Val Ala Val Ile Gly
180 185 190
Phe Phe Lys Asp Val Glu Ser Asp Ser Ala Lys Gln Phe Leu Gln Ala
195 200 205
Ala Glu Ala Ile Asp Asp Ile Pro Phe Gly Ile Thr Ser Asn Ser Asp
210 215 220
Val Phe Ser Lys Tyr Gln Leu Asp Lys Asp Gly Val Val Leu Phe Lys
225 230 235 240
Lys Phe Asp Glu Gly Arg Asn Asn Phe Glu Gly Glu Val Thr Lys Glu
245 250 255
Asn Leu Leu Asp Phe Ile Lys His Asn Gln Leu Pro Leu Val Ile Glu
260 265 270
Phe Thr Glu Gln Thr Ala Pro Lys Ile Phe Gly Gly Glu Ile Lys Thr
275 280 285
His Ile Leu Leu Phe Leu Pro Lys Ser Val Ser Asp Tyr Asp Gly Lys
290 295 300
Leu Ser Asn Phe Lys Thr Ala Ala Glu Ser Phe Lys Gly Lys Ile Leu
305 310 315 320
Phe Ile Phe Ile Asp Ser Asp His Thr Asp Asn Gln Arg Ile Leu Glu
325 330 335
Phe Phe Gly Leu Lys Lys Glu Glu Cys Pro Ala Val Arg Leu Ile Thr
340 345 350
Leu Glu Glu Glu Met Thr Lys Tyr Lys Pro Glu Ser Glu Glu Leu Thr
355 360 365
Ala Glu Arg Ile Thr Glu Phe Cys His Arg Phe Leu Glu Gly Lys Ile
370 375 380
Lys Pro His Leu Met Ser Gln Glu Leu Pro Glu Asp Trp Asp Lys Gln
385 390 395 400
Pro Val Lys Val Leu Val Gly Lys Asn Phe Glu Asp Val Ala Phe Asp
405 410 415
Glu Lys Lys Asn Val Phe Val Glu Phe Tyr Ala Pro Trp Cys Gly His
420 425 430
Cys Lys Gln Leu Ala Pro Ile Trp Asp Lys Leu Gly Glu Thr Tyr Lys
435 440 445
Asp His Glu Asn Ile Val Ile Ala Lys Met Asp Ser Thr Ala Asn Glu
450 455 460
Val Glu Ala Val Lys Val His Ser Phe Pro Thr Leu Lys Phe Phe Pro
465 470 475 480
Ala Ser Ala Asp Arg Thr Val Ile Asp Tyr Asn Gly Glu Arg Thr Leu
485 490 495
Asp Gly Phe Lys Lys Phe Leu Glu Ser Gly Gly Gln Asp Gly Ala Gly
500 505 510
Asp Asp Asp Asp Leu Glu Asp Leu Glu Glu Ala Glu Glu Pro Asp Met
515 520 525
Glu Glu Asp Asp Asp Gln Lys Ala Val
530 535
<210> 14
<211> 1723
<212> DNA
<213> Artificial sequence
<220>
<223> Synthetic sequence containing the coding regions of the vascular
signal sequence of barley gene for Thiol protease aleurain
precursor fused to the human Prolyl 4-hydroxylase alpha-1 subunit
and flanking regions
<400> 14
ctcgagtaaa ccatggctca tgctagggtt ttgcttttgg ctcttgctgt tcttgctact 60
gctgctgttg ctgtggcttc ttcttcatct ttcgctgatt ctaacccaat taggccagtg 120
actgatagag ctgcttctac tcttgctcaa ttggtcgaca tgcacccagg attcttcact 180
tctattggac agatgactga tcttattcac actgagaagg atcttgtgac ttctcttaag 240
gattacatta aggctgagga ggataagttg gagcagatta agaagtgggc tgagaagttg 300
gataggctta cttctactgc tacaaaagat ccagagggat tcgttggtca tccagtgaac 360
gctttcaagt tgatgaagag gcttaacact gagtggagtg agcttgagaa ccttgtgctt 420
aaggatatgt ctgatggatt catttctaac cttactattc agaggcagta cttcccaaat 480
gatgaggatc aagtgggagc tgctaaggct cttcttaggc ttcaggatac ttacaacctt 540
gatactgata caatttctaa gggaaacctt ccaggagtta agcacaagtc tttccttact 600
gctgaggatt gcttcgagct tggaaaggtt gcatacactg aggctgatta ctaccacact 660
gagctttgga tggaacaagc tcttaggcaa cttgatgagg gagagatttc tactattgat 720
aaggtgtcag tgcttgatta cctttcttac gctgtgtacc agcagggtga tcttgataag 780
gctcttttgc ttactaagaa gttgcttgag cttgatccag aacatcagag ggctaacgga 840
aaccttaagt acttcgagta cattatggct aaggaaaagg atgtgaacaa gtctgcttct 900
gatgatcagt ctgatcaaaa gactactcca aagaagaagg gagtggctgt tgattatctt 960
cctgagaggc agaagtatga gatgttgtgt aggggagagg gtattaagat gactccaagg 1020
aggcagaaga agttgttctg caggtatcac gatggaaaca ggaacccaaa gttcattctt 1080
gctccagcta agcaagaaga tgagtgggat aagccaagga ttattaggtt ccacgatatt 1140
atttctgatg ctgagattga gattgtgaag gatcttgcta agccaagact taggagggct 1200
actatttcta accctattac tggtgatctt gagactgtgc actacaggat ttctaagtct 1260
gcttggcttt ctggatacga gaacccagtg gtgtctagga ttaacatgag gattcaggat 1320
cttactggac ttgatgtgtc tactgctgag gagcttcaag ttgctaacta cggagttgga 1380
ggacaatatg agccacactt cgatttcgct aggaaggatg agccagatgc ttttaaggag 1440
cttggaactg gaaacaggat tgctacttgg cttttctaca tgtctgatgt ttctgctgga 1500
ggagctactg ttttcccaga agtgggagct tctgtttggc caaagaaggg aactgctgtg 1560
ttctggtaca accttttcgc ttctggagag ggagattact ctactaggca tgctgcttgc 1620
ccagttcttg ttggaaacaa gtgggtgtca aacaagtggc ttcatgagag gggacaagag 1680
tttagaaggc catgcactct ttctgagctt gagtgatgag ctc 1723
<210> 15
<211> 567
<212> PRT
<213> Artificial sequence
<220>
<223> Synthetic sequence containing the vascular signal sequence of
barley gene for Thiol protease aleurain precursor fused to the
human Prolyl 4-hydroxylase alpha-1 subunit and flanking regions
<400> 15
Met Ala His Ala Arg Val Leu Leu Leu Ala Leu Ala Val Leu Ala Thr
1 5 10 15
Ala Ala Val Ala Val Ala Ser Ser Ser Ser Phe Ala Asp Ser Asn Pro
20 25 30
Ile Arg Pro Val Thr Asp Arg Ala Ala Ser Thr Leu Ala Gln Leu Val
35 40 45
Asp Met His Pro Gly Phe Phe Thr Ser Ile Gly Gln Met Thr Asp Leu
50 55 60
Ile His Thr Glu Lys Asp Leu Val Thr Ser Leu Lys Asp Tyr Ile Lys
65 70 75 80
Ala Glu Glu Asp Lys Leu Glu Gln Ile Lys Lys Trp Ala Glu Lys Leu
85 90 95
Asp Arg Leu Thr Ser Thr Ala Thr Lys Asp Pro Glu Gly Phe Val Gly
100 105 110
His Pro Val Asn Ala Phe Lys Leu Met Lys Arg Leu Asn Thr Glu Trp
115 120 125
Ser Glu Leu Glu Asn Leu Val Leu Lys Asp Met Ser Asp Gly Phe Ile
130 135 140
Ser Asn Leu Thr Ile Gln Arg Gln Tyr Phe Pro Asn Asp Glu Asp Gln
145 150 155 160
Val Gly Ala Ala Lys Ala Leu Leu Arg Leu Gln Asp Thr Tyr Asn Leu
165 170 175
Asp Thr Asp Thr Ile Ser Lys Gly Asn Leu Pro Gly Val Lys His Lys
180 185 190
Ser Phe Leu Thr Ala Glu Asp Cys Phe Glu Leu Gly Lys Val Ala Tyr
195 200 205
Thr Glu Ala Asp Tyr Tyr His Thr Glu Leu Trp Met Glu Gln Ala Leu
210 215 220
Arg Gln Leu Asp Glu Gly Glu Ile Ser Thr Ile Asp Lys Val Ser Val
225 230 235 240
Leu Asp Tyr Leu Ser Tyr Ala Val Tyr Gln Gln Gly Asp Leu Asp Lys
245 250 255
Ala Leu Leu Leu Thr Lys Lys Leu Leu Glu Leu Asp Pro Glu His Gln
260 265 270
Arg Ala Asn Gly Asn Leu Lys Tyr Phe Glu Tyr Ile Met Ala Lys Glu
275 280 285
Lys Asp Val Asn Lys Ser Ala Ser Asp Asp Gln Ser Asp Gln Lys Thr
290 295 300
Thr Pro Lys Lys Lys Gly Val Ala Val Asp Tyr Leu Pro Glu Arg Gln
305 310 315 320
Lys Tyr Glu Met Leu Cys Arg Gly Glu Gly Ile Lys Met Thr Pro Arg
325 330 335
Arg Gln Lys Lys Leu Phe Cys Arg Tyr His Asp Gly Asn Arg Asn Pro
340 345 350
Lys Phe Ile Leu Ala Pro Ala Lys Gln Glu Asp Glu Trp Asp Lys Pro
355 360 365
Arg Ile Ile Arg Phe His Asp Ile Ile Ser Asp Ala Glu Ile Glu Ile
370 375 380
Val Lys Asp Leu Ala Lys Pro Arg Leu Arg Arg Ala Thr Ile Ser Asn
385 390 395 400
Pro Ile Thr Gly Asp Leu Glu Thr Val His Tyr Arg Ile Ser Lys Ser
405 410 415
Ala Trp Leu Ser Gly Tyr Glu Asn Pro Val Val Ser Arg Ile Asn Met
420 425 430
Arg Ile Gln Asp Leu Thr Gly Leu Asp Val Ser Thr Ala Glu Glu Leu
435 440 445
Gln Val Ala Asn Tyr Gly Val Gly Gly Gln Tyr Glu Pro His Phe Asp
450 455 460
Phe Ala Arg Lys Asp Glu Pro Asp Ala Phe Lys Glu Leu Gly Thr Gly
465 470 475 480
Asn Arg Ile Ala Thr Trp Leu Phe Tyr Met Ser Asp Val Ser Ala Gly
485 490 495
Gly Ala Thr Val Phe Pro Glu Val Gly Ala Ser Val Trp Pro Lys Lys
500 505 510
Gly Thr Ala Val Phe Trp Tyr Asn Leu Phe Ala Ser Gly Glu Gly Asp
515 520 525
Tyr Ser Thr Arg His Ala Ala Cys Pro Val Leu Val Gly Asn Lys Trp
530 535 540
Val Ser Asn Lys Trp Leu His Glu Arg Gly Gln Glu Phe Arg Arg Pro
545 550 555 560
Cys Thr Leu Ser Glu Leu Glu
565
<210> 16
<211> 928
<212> DNA
<213> Artificial sequence
<220>
<223> Synthetic sequence containing the coding regions of the vascular
signal sequence of barley gene for Thiol protease aleurain
precursor fused to the plant Prolyl 4-hydroxylase Plant and
flanking regions
<400> 16
ctcgagtaaa ccatggctca tgctagggtt ttgcttttgg ctcttgctgt tcttgctact 60
gctgctgttg ctgtggcttc ttcttcatct ttcgctgatt ctaacccaat taggccagtg 120
actgatagag ctgcttctac tcttgctcaa ttggtcgaca tgcttggtat tctttctctt 180
ccaaacgcta acaggaactc ttctaagact aacgatctta ctaacattgt gaggaagtct 240
gagacttctt ctggagatga ggagggaaat ggagaaagat gggtggaagt gatttcttgg 300
gagccaaggg ctgttgttta ccacaacttc cttactaatg aggagtgcga gcaccttatt 360
tctcttgcta agccatctat ggtgaagtct actgtggtgg atgagaaaac tggaggatct 420
aaggattcaa gagtgaggac ttcatctggt actttcctta ggaggggaca tgatgaagtt 480
gtggaagtta ttgagaagag gatttctgat ttcactttca ttccagtgga gaacggagaa 540
ggacttcaag ttcttcacta ccaagtggga caaaagtacg agccacacta cgattacttc 600
cttgatgagt tcaacactaa gaacggagga cagaggattg ctactgtgct tatgtacctt 660
tctgatgtgg atgatggagg agagactgtt tttccagctg ctaggggaaa catttctgct 720
gttccttggt ggaacgagct ttctaagtgt ggaaaggagg gactttctgt gcttccaaag 780
aaaagggatg ctcttctttt ctggaacatg aggccagatg cttctcttga tccatcttct 840
cttcatggag gatgcccagt tgttaaggga aacaagtggt catctactaa gtggttccac 900
gtgcacgagt tcaaggtgta atgagctc 928
<210> 17
<211> 302
<212> PRT
<213> Artificial sequence
<220>
<223> Synthetic sequence containing the vascular signal sequence of
barley gene for Thiol protease aleurain precursor fused to the
plant Prolyl 4-hydroxylase Plant and flanking regions
<400> 17
Met Ala His Ala Arg Val Leu Leu Leu Ala Leu Ala Val Leu Ala Thr
1 5 10 15
Ala Ala Val Ala Val Ala Ser Ser Ser Ser Phe Ala Asp Ser Asn Pro
20 25 30
Ile Arg Pro Val Thr Asp Arg Ala Ala Ser Thr Leu Ala Gln Leu Val
35 40 45
Asp Met Leu Gly Ile Leu Ser Leu Pro Asn Ala Asn Arg Asn Ser Ser
50 55 60
Lys Thr Asn Asp Leu Thr Asn Ile Val Arg Lys Ser Glu Thr Ser Ser
65 70 75 80
Gly Asp Glu Glu Gly Asn Gly Glu Arg Trp Val Glu Val Ile Ser Trp
85 90 95
Glu Pro Arg Ala Val Val Tyr His Asn Phe Leu Thr Asn Glu Glu Cys
100 105 110
Glu His Leu Ile Ser Leu Ala Lys Pro Ser Met Val Lys Ser Thr Val
115 120 125
Val Asp Glu Lys Thr Gly Gly Ser Lys Asp Ser Arg Val Arg Thr Ser
130 135 140
Ser Gly Thr Phe Leu Arg Arg Gly His Asp Glu Val Val Glu Val Ile
145 150 155 160
Glu Lys Arg Ile Ser Asp Phe Thr Phe Ile Pro Val Glu Asn Gly Glu
165 170 175
Gly Leu Gln Val Leu His Tyr Gln Val Gly Gln Lys Tyr Glu Pro His
180 185 190
Tyr Asp Tyr Phe Leu Asp Glu Phe Asn Thr Lys Asn Gly Gly Gln Arg
195 200 205
Ile Ala Thr Val Leu Met Tyr Leu Ser Asp Val Asp Asp Gly Gly Glu
210 215 220
Thr Val Phe Pro Ala Ala Arg Gly Asn Ile Ser Ala Val Pro Trp Trp
225 230 235 240
Asn Glu Leu Ser Lys Cys Gly Lys Glu Gly Leu Ser Val Leu Pro Lys
245 250 255
Lys Arg Asp Ala Leu Leu Phe Trp Asn Met Arg Pro Asp Ala Ser Leu
260 265 270
Asp Pro Ser Ser Leu His Gly Gly Cys Pro Val Val Lys Gly Asn Lys
275 280 285
Trp Ser Ser Thr Lys Trp Phe His Val His Glu Phe Lys Val
290 295 300
<210> 18
<211> 2689
<212> DNA
<213> Artificial sequence
<220>
<223> Synthetic sequence containing the coding regions of the human
Procollagen C-proteinase and flanking regions
<400> 18
agatctatcg atgcatgcca tggtaccgcg ccatggctca attggctgca acatcaaggc 60
ctgaaagagt ttggccagat ggtgttattc ctttcgttat tggtggaaac tttactggat 120
ctcagagagc agtttttaga caagctatga gacattggga aaagcacact tgtgtgacat 180
tccttgaaag gactgatgaa gattcttata ttgtgttcac ataccgtcca tgtggatgct 240
gctcatatgt tggtagaagg ggaggaggtc cacaagcaat ttctattgga aaaaactgcg 300
ataagttcgg aattgtggtg catgaattgg gacatgttgt tggtttctgg cacgaacaca 360
caaggccaga tagggatagg cacgtgtcta ttgtgaggga aaacattcag ccaggtcaag 420
agtacaattt tcttaagatg gaacctcaag aggtggaatc tctcggagag acttacgact 480
tcgactccat catgcactac gcaaggaata ctttcagcag gggcatcttc ttggatacca 540
ttgtgcctaa gtacgaggtg aacggcgtta agccacctat tggtcaaagg actaggctct 600
ctaagggtga tattgcacag gctaggaagc tctacaaatg tccagcatgc ggagaaactc 660
ttcaggattc cactggcaac ttctcatctc cagagtaccc aaacggatac tctgctcata 720
tgcactgtgt ttggaggatc tcagtgactc ctggagagaa gatcatcctc aacttcactt 780
ccctcgatct ctatcgttct aggctctgtt ggtacgacta tgtggaagtg agagatggct 840
tctggagaaa ggctccactt agaggaaggt tctgcggatc taaacttcct gagccaatcg 900
tgtctactga ttccagattg tgggtggagt tcaggtcctc ttctaattgg gttggcaagg 960
gcttttttgc tgtgtacgag gctatttgtg gcggcgacgt gaaaaaggac tacggacata 1020
ttcaaagtcc aaattaccca gatgattacc gtccttcaaa agtgtgtatt tggaggattc 1080
aagtgagtga gggtttccat gttggattga cattccaatc tttcgaaatt gagagacacg 1140
attcatgcgc atacgattat ttggaagtga gagatggaca ctctgaatct tctacactta 1200
ttggaaggta ctgcggttat gagaaacctg atgatattaa gtctacttct agtaggttgt 1260
ggcttaaatt tgtgtcagat ggttctatta acaaggctgg tttcgcagtg aacttcttca 1320
aggaagtgga tgaatgctca agacctaaca gaggaggatg tgagcaaaga tgccttaaca 1380
ctttgggaag ttacaagtgt tcttgcgatc ctggatacga gttggctcct gataagagaa 1440
gatgcgaagc tgcttgcggt ggttttttga caaaattgaa cggatctatt acttctcctg 1500
gatggccaaa agagtaccca cctaataaga attgcatttg gcagcttgtt gcacctactc 1560
agtaccgtat ttcattgcaa ttcgattttt tcgagactga gggtaatgat gtgtgcaagt 1620
acgatttcgt ggaagtgaga tcaggtctta ctgctgatag taaattgcac ggaaagttct 1680
gcggatctga aaaaccagaa gtgattacat cacagtacaa caatatgagg gtggagttca 1740
aatctgataa tactgtttct aaaaaaggtt ttaaggcaca tttcttttct gataaggacg 1800
agtgctctaa agataatggt ggttgccagc aggattgcgt gaacacattc ggttcatatg 1860
agtgccaatg ccgtagtgga tttgttcttc acgataacaa acatgattgc aaagaggcag 1920
gttgcgatca caaggtgaca tctacttcag gtactatcac atctccaaac tggcctgata 1980
agtatccttc aaaaaaagaa tgtacatggg caatttcttc tacaccaggt catagggtta 2040
agttgacatt catggagatg gatattgaga gtcaaccaga gtgcgcttat gatcatcttg 2100
aggtgttcga tggaagggat gctaaggctc ctgttcttgg tagattctgt ggtagtaaaa 2160
agccagaacc agtgcttgca acaggatcta ggatgttcct tagattctac tctgataact 2220
cagttcagag gaaaggattc caagctagtc acgcaactga atgcggtgga caagttagag 2280
cagatgttaa gactaaggat ctttactcac acgcacagtt cggagataac aactaccctg 2340
gaggagttga ttgcgagtgg gttattgtgg ctgaagaggg atacggagtt gagcttgttt 2400
tccagacatt cgaggtggag gaggaaactg attgcggtta cgattatatg gaactttttg 2460
atggatacga tagtactgct ccaagacttg gaaggtattg tggtagtggt ccaccagaag 2520
aggtgtactc agctggagat agtgttcttg ttaagttcca cagtgatgat acaattacta 2580
agaagggatt ccatcttaga tatacttcaa ctaagtttca ggatactctt cattctagga 2640
agtaatgagc tcgcggccgc atccaagctt ctgcagacgc gtcgacgtc 2689
<210> 19
<211> 870
<212> PRT
<213> Artificial sequence
<220>
<223> Synthetic sequence containing the human Procollagen C-proteinase
and flanking regions
<400> 19
Met Ala Gln Leu Ala Ala Thr Ser Arg Pro Glu Arg Val Trp Pro Asp
1 5 10 15
Gly Val Ile Pro Phe Val Ile Gly Gly Asn Phe Thr Gly Ser Gln Arg
20 25 30
Ala Val Phe Arg Gln Ala Met Arg His Trp Glu Lys His Thr Cys Val
35 40 45
Thr Phe Leu Glu Arg Thr Asp Glu Asp Ser Tyr Ile Val Phe Thr Tyr
50 55 60
Arg Pro Cys Gly Cys Cys Ser Tyr Val Gly Arg Arg Gly Gly Gly Pro
65 70 75 80
Gln Ala Ile Ser Ile Gly Lys Asn Cys Asp Lys Phe Gly Ile Val Val
85 90 95
His Glu Leu Gly His Val Val Gly Phe Trp His Glu His Thr Arg Pro
100 105 110
Asp Arg Asp Arg His Val Ser Ile Val Arg Glu Asn Ile Gln Pro Gly
115 120 125
Gln Glu Tyr Asn Phe Leu Lys Met Glu Pro Gln Glu Val Glu Ser Leu
130 135 140
Gly Glu Thr Tyr Asp Phe Asp Ser Ile Met His Tyr Ala Arg Asn Thr
145 150 155 160
Phe Ser Arg Gly Ile Phe Leu Asp Thr Ile Val Pro Lys Tyr Glu Val
165 170 175
Asn Gly Val Lys Pro Pro Ile Gly Gln Arg Thr Arg Leu Ser Lys Gly
180 185 190
Asp Ile Ala Gln Ala Arg Lys Leu Tyr Lys Cys Pro Ala Cys Gly Glu
195 200 205
Thr Leu Gln Asp Ser Thr Gly Asn Phe Ser Ser Pro Glu Tyr Pro Asn
210 215 220
Gly Tyr Ser Ala His Met His Cys Val Trp Arg Ile Ser Val Thr Pro
225 230 235 240
Gly Glu Lys Ile Ile Leu Asn Phe Thr Ser Leu Asp Leu Tyr Arg Ser
245 250 255
Arg Leu Cys Trp Tyr Asp Tyr Val Glu Val Arg Asp Gly Phe Trp Arg
260 265 270
Lys Ala Pro Leu Arg Gly Arg Phe Cys Gly Ser Lys Leu Pro Glu Pro
275 280 285
Ile Val Ser Thr Asp Ser Arg Leu Trp Val Glu Phe Arg Ser Ser Ser
290 295 300
Asn Trp Val Gly Lys Gly Phe Phe Ala Val Tyr Glu Ala Ile Cys Gly
305 310 315 320
Gly Asp Val Lys Lys Asp Tyr Gly His Ile Gln Ser Pro Asn Tyr Pro
325 330 335
Asp Asp Tyr Arg Pro Ser Lys Val Cys Ile Trp Arg Ile Gln Val Ser
340 345 350
Glu Gly Phe His Val Gly Leu Thr Phe Gln Ser Phe Glu Ile Glu Arg
355 360 365
His Asp Ser Cys Ala Tyr Asp Tyr Leu Glu Val Arg Asp Gly His Ser
370 375 380
Glu Ser Ser Thr Leu Ile Gly Arg Tyr Cys Gly Tyr Glu Lys Pro Asp
385 390 395 400
Asp Ile Lys Ser Thr Ser Ser Arg Leu Trp Leu Lys Phe Val Ser Asp
405 410 415
Gly Ser Ile Asn Lys Ala Gly Phe Ala Val Asn Phe Phe Lys Glu Val
420 425 430
Asp Glu Cys Ser Arg Pro Asn Arg Gly Gly Cys Glu Gln Arg Cys Leu
435 440 445
Asn Thr Leu Gly Ser Tyr Lys Cys Ser Cys Asp Pro Gly Tyr Glu Leu
450 455 460
Ala Pro Asp Lys Arg Arg Cys Glu Ala Ala Cys Gly Gly Phe Leu Thr
465 470 475 480
Lys Leu Asn Gly Ser Ile Thr Ser Pro Gly Trp Pro Lys Glu Tyr Pro
485 490 495
Pro Asn Lys Asn Cys Ile Trp Gln Leu Val Ala Pro Thr Gln Tyr Arg
500 505 510
Ile Ser Leu Gln Phe Asp Phe Phe Glu Thr Glu Gly Asn Asp Val Cys
515 520 525
Lys Tyr Asp Phe Val Glu Val Arg Ser Gly Leu Thr Ala Asp Ser Lys
530 535 540
Leu His Gly Lys Phe Cys Gly Ser Glu Lys Pro Glu Val Ile Thr Ser
545 550 555 560
Gln Tyr Asn Asn Met Arg Val Glu Phe Lys Ser Asp Asn Thr Val Ser
565 570 575
Lys Lys Gly Phe Lys Ala His Phe Phe Ser Asp Lys Asp Glu Cys Ser
580 585 590
Lys Asp Asn Gly Gly Cys Gln Gln Asp Cys Val Asn Thr Phe Gly Ser
595 600 605
Tyr Glu Cys Gln Cys Arg Ser Gly Phe Val Leu His Asp Asn Lys His
610 615 620
Asp Cys Lys Glu Ala Gly Cys Asp His Lys Val Thr Ser Thr Ser Gly
625 630 635 640
Thr Ile Thr Ser Pro Asn Trp Pro Asp Lys Tyr Pro Ser Lys Lys Glu
645 650 655
Cys Thr Trp Ala Ile Ser Ser Thr Pro Gly His Arg Val Lys Leu Thr
660 665 670
Phe Met Glu Met Asp Ile Glu Ser Gln Pro Glu Cys Ala Tyr Asp His
675 680 685
Leu Glu Val Phe Asp Gly Arg Asp Ala Lys Ala Pro Val Leu Gly Arg
690 695 700
Phe Cys Gly Ser Lys Lys Pro Glu Pro Val Leu Ala Thr Gly Ser Arg
705 710 715 720
Met Phe Leu Arg Phe Tyr Ser Asp Asn Ser Val Gln Arg Lys Gly Phe
725 730 735
Gln Ala Ser His Ala Thr Glu Cys Gly Gly Gln Val Arg Ala Asp Val
740 745 750
Lys Thr Lys Asp Leu Tyr Ser His Ala Gln Phe Gly Asp Asn Asn Tyr
755 760 765
Pro Gly Gly Val Asp Cys Glu Trp Val Ile Val Ala Glu Glu Gly Tyr
770 775 780
Gly Val Glu Leu Val Phe Gln Thr Phe Glu Val Glu Glu Glu Thr Asp
785 790 795 800
Cys Gly Tyr Asp Tyr Met Glu Leu Phe Asp Gly Tyr Asp Ser Thr Ala
805 810 815
Pro Arg Leu Gly Arg Tyr Cys Gly Ser Gly Pro Pro Glu Glu Val Tyr
820 825 830
Ser Ala Gly Asp Ser Val Leu Val Lys Phe His Ser Asp Asp Thr Ile
835 840 845
Thr Lys Lys Gly Phe His Leu Arg Tyr Thr Ser Thr Lys Phe Gln Asp
850 855 860
Thr Leu His Ser Arg Lys
865 870
<210> 20
<211> 2912
<212> DNA
<213> Artificial sequence
<220>
<223> Synthetic sequence containing the coding regions of the human
Procollagen I N-proteinase and flanking regions
<400> 20
gcgccatggc tcaattgagg agaagggcta ggagacacgc agctgatgat gattacaaca 60
ttgaagtttt gcttggtgtt gatgatagtg tggtgcaatt ccacggaaaa gagcatgttc 120
agaaatatct tttgacactt atgaatattg tgaacgaaat ctaccatgat gagtctttgg 180
gagcacacat taacgtggtt cttgtgagga ttattcttct ttcatacggt aaatctatgt 240
cacttattga gattggaaac ccttctcagt ctcttgagaa tgtgtgcaga tgggcatacc 300
ttcaacagaa gcctgatact ggacacgatg agtatcacga tcacgctatt ttccttacaa 360
ggcaggattt cggtccaagt ggaatgcaag gatatgctcc tgttactggt atgtgccacc 420
ctgttaggtc ttgtacactt aaccacgagg atggtttttc atctgctttc gtggtggctc 480
atgagacagg tcatgttttg ggaatggaac atgatggaca gggtaataga tgtggagatg 540
aagtgagact tggttcaatt atggctcctc ttgttcaagc tgcttttcat aggttccact 600
ggagtaggtg ttcacagcaa gagttgagta gataccttca ttcttacgat tgcttgcttg 660
atgatccatt tgctcatgat tggccagctt tgcctcaact tcctggattg cactactcta 720
tgaacgagca gtgcagattt gatttcggtc ttggttacat gatgtgcaca gctttcagga 780
ctttcgatcc atgcaaacag ttgtggtgtt cacacccaga taacccatat ttctgtaaaa 840
caaaaaaagg tccaccactt gatggtacta tgtgcgcacc tggaaagcac tgcttcaagg 900
gacactgcat ttggcttact cctgatattc ttaaaaggga tggatcatgg ggagcttggt 960
ctccattcgg aagttgctca agaacttgcg gaacaggtgt taagtttaga actaggcagt 1020
gcgataatcc acaccctgct aatggtggta gaacttgctc tggacttgct tacgattttc 1080
agttgtgttc taggcaagat tgccctgata gtcttgctga ttttagagaa gagcaatgta 1140
gacagtggga tctttacttt gagcacggcg acgctcagca ccactggctt ccacacgagc 1200
atagagatgc aaaagaaagg tgtcaccttt attgcgagag tagagagact ggagaggtgg 1260
tgtcaatgaa gagaatggtg cacgatggta caaggtgttc ttataaggat gcattctctt 1320
tgtgtgtgag gggagattgc aggaaagtgg gttgtgatgg agtgattgga tctagtaagc 1380
aagaagataa gtgcggagtg tgcggaggag ataactctca ttgcaaggtt gtgaaaggaa 1440
cttttacaag atcaccaaaa aaacacggtt acattaagat gttcgaaatt cctgctggag 1500
caaggcattt gcttattcag gaagtggatg caacatctca ccacttggca gtgaaaaacc 1560
ttgagactgg aaaattcatt ttgaacgagg agaacgatgt tgatgcatct agtaagactt 1620
tcattgcaat gggtgttgaa tgggagtata gggatgagga tggaagggaa acacttcaaa 1680
caatgggtcc tcttcatgga acaattactg tgttggtgat tccagtggga gatacaaggg 1740
tgtcattgac atacaagtat atgattcacg aggatagtct taacgttgat gataacaacg 1800
ttttggaaga agattctgtg gtttacgagt gggctcttaa gaaatggtca ccttgctcta 1860
agccatgtgg tggaggaagt cagttcacta agtatggttg taggaggagg cttgatcata 1920
agatggttca taggggattt tgcgcagcac ttagtaagcc aaaggcaatt aggagggctt 1980
gtaaccctca agaatgctca caaccagttt gggtgacagg agagtgggag ccatgttcac 2040
aaacatgcgg aagaactgga atgcaagtta gatcagttag atgcattcaa cctcttcatg 2100
ataacactac aagaagtgtg cacgcaaaac actgtaacga tgctaggcca gagagtagaa 2160
gagcttgctc tagggaactt tgccctggta gatggagggc aggaccttgg agtcagtgct 2220
ctgtgacatg tggaaacggt actcaggaaa gacctgttcc atgtagaact gctgatgata 2280
gtttcggaat ttgtcaggag gaaaggccag aaacagctag gacttgtaga cttggacctt 2340
gtcctaggaa tatttctgat cctagtaaaa aatcatacgt ggtgcaatgg ttgagtaggc 2400
cagatccaga ttcaccaatt aggaagattt cttcaaaagg acactgccag ggtgataaga 2460
gtattttctg cagaatggaa gttcttagta ggtactgttc tattccaggt tataacaaac 2520
tttcttgtaa gagttgcaac ttgtataaca atcttactaa cgtggagggt agaattgaac 2580
ctccaccagg aaagcacaac gatattgatg tgtttatgcc tactcttcct gtgccaacag 2640
ttgcaatgga agttagacct tctccatcta ctccacttga ggtgccactt aatgcatcaa 2700
gtactaacgc tactgaggat cacccagaga ctaacgcagt tgatgagcct tataagattc 2760
acggacttga ggatgaggtt cagccaccaa accttattcc taggaggcca agtccttacg 2820
aaaaaactag aaatcagagg attcaggagc ttattgatga gatgaggaaa aaggagatgc 2880
ttggaaagtt ctaatgagct cgcggccgca tc 2912
<210> 21
<211> 962
<212> PRT
<213> Artificial sequence
<220>
<223> Synthetic sequence containing the human Procollagen I
N-proteinase and flanking regions
<400> 21
Met Ala Gln Leu Arg Arg Arg Ala Arg Arg His Ala Ala Asp Asp Asp
1 5 10 15
Tyr Asn Ile Glu Val Leu Leu Gly Val Asp Asp Ser Val Val Gln Phe
20 25 30
His Gly Lys Glu His Val Gln Lys Tyr Leu Leu Thr Leu Met Asn Ile
35 40 45
Val Asn Glu Ile Tyr His Asp Glu Ser Leu Gly Ala His Ile Asn Val
50 55 60
Val Leu Val Arg Ile Ile Leu Leu Ser Tyr Gly Lys Ser Met Ser Leu
65 70 75 80
Ile Glu Ile Gly Asn Pro Ser Gln Ser Leu Glu Asn Val Cys Arg Trp
85 90 95
Ala Tyr Leu Gln Gln Lys Pro Asp Thr Gly His Asp Glu Tyr His Asp
100 105 110
His Ala Ile Phe Leu Thr Arg Gln Asp Phe Gly Pro Ser Gly Met Gln
115 120 125
Gly Tyr Ala Pro Val Thr Gly Met Cys His Pro Val Arg Ser Cys Thr
130 135 140
Leu Asn His Glu Asp Gly Phe Ser Ser Ala Phe Val Val Ala His Glu
145 150 155 160
Thr Gly His Val Leu Gly Met Glu His Asp Gly Gln Gly Asn Arg Cys
165 170 175
Gly Asp Glu Val Arg Leu Gly Ser Ile Met Ala Pro Leu Val Gln Ala
180 185 190
Ala Phe His Arg Phe His Trp Ser Arg Cys Ser Gln Gln Glu Leu Ser
195 200 205
Arg Tyr Leu His Ser Tyr Asp Cys Leu Leu Asp Asp Pro Phe Ala His
210 215 220
Asp Trp Pro Ala Leu Pro Gln Leu Pro Gly Leu His Tyr Ser Met Asn
225 230 235 240
Glu Gln Cys Arg Phe Asp Phe Gly Leu Gly Tyr Met Met Cys Thr Ala
245 250 255
Phe Arg Thr Phe Asp Pro Cys Lys Gln Leu Trp Cys Ser His Pro Asp
260 265 270
Asn Pro Tyr Phe Cys Lys Thr Lys Lys Gly Pro Pro Leu Asp Gly Thr
275 280 285
Met Cys Ala Pro Gly Lys His Cys Phe Lys Gly His Cys Ile Trp Leu
290 295 300
Thr Pro Asp Ile Leu Lys Arg Asp Gly Ser Trp Gly Ala Trp Ser Pro
305 310 315 320
Phe Gly Ser Cys Ser Arg Thr Cys Gly Thr Gly Val Lys Phe Arg Thr
325 330 335
Arg Gln Cys Asp Asn Pro His Pro Ala Asn Gly Gly Arg Thr Cys Ser
340 345 350
Gly Leu Ala Tyr Asp Phe Gln Leu Cys Ser Arg Gln Asp Cys Pro Asp
355 360 365
Ser Leu Ala Asp Phe Arg Glu Glu Gln Cys Arg Gln Trp Asp Leu Tyr
370 375 380
Phe Glu His Gly Asp Ala Gln His His Trp Leu Pro His Glu His Arg
385 390 395 400
Asp Ala Lys Glu Arg Cys His Leu Tyr Cys Glu Ser Arg Glu Thr Gly
405 410 415
Glu Val Val Ser Met Lys Arg Met Val His Asp Gly Thr Arg Cys Ser
420 425 430
Tyr Lys Asp Ala Phe Ser Leu Cys Val Arg Gly Asp Cys Arg Lys Val
435 440 445
Gly Cys Asp Gly Val Ile Gly Ser Ser Lys Gln Glu Asp Lys Cys Gly
450 455 460
Val Cys Gly Gly Asp Asn Ser His Cys Lys Val Val Lys Gly Thr Phe
465 470 475 480
Thr Arg Ser Pro Lys Lys His Gly Tyr Ile Lys Met Phe Glu Ile Pro
485 490 495
Ala Gly Ala Arg His Leu Leu Ile Gln Glu Val Asp Ala Thr Ser His
500 505 510
His Leu Ala Val Lys Asn Leu Glu Thr Gly Lys Phe Ile Leu Asn Glu
515 520 525
Glu Asn Asp Val Asp Ala Ser Ser Lys Thr Phe Ile Ala Met Gly Val
530 535 540
Glu Trp Glu Tyr Arg Asp Glu Asp Gly Arg Glu Thr Leu Gln Thr Met
545 550 555 560
Gly Pro Leu His Gly Thr Ile Thr Val Leu Val Ile Pro Val Gly Asp
565 570 575
Thr Arg Val Ser Leu Thr Tyr Lys Tyr Met Ile His Glu Asp Ser Leu
580 585 590
Asn Val Asp Asp Asn Asn Val Leu Glu Glu Asp Ser Val Val Tyr Glu
595 600 605
Trp Ala Leu Lys Lys Trp Ser Pro Cys Ser Lys Pro Cys Gly Gly Gly
610 615 620
Ser Gln Phe Thr Lys Tyr Gly Cys Arg Arg Arg Leu Asp His Lys Met
625 630 635 640
Val His Arg Gly Phe Cys Ala Ala Leu Ser Lys Pro Lys Ala Ile Arg
645 650 655
Arg Ala Cys Asn Pro Gln Glu Cys Ser Gln Pro Val Trp Val Thr Gly
660 665 670
Glu Trp Glu Pro Cys Ser Gln Thr Cys Gly Arg Thr Gly Met Gln Val
675 680 685
Arg Ser Val Arg Cys Ile Gln Pro Leu His Asp Asn Thr Thr Arg Ser
690 695 700
Val His Ala Lys His Cys Asn Asp Ala Arg Pro Glu Ser Arg Arg Ala
705 710 715 720
Cys Ser Arg Glu Leu Cys Pro Gly Arg Trp Arg Ala Gly Pro Trp Ser
725 730 735
Gln Cys Ser Val Thr Cys Gly Asn Gly Thr Gln Glu Arg Pro Val Pro
740 745 750
Cys Arg Thr Ala Asp Asp Ser Phe Gly Ile Cys Gln Glu Glu Arg Pro
755 760 765
Glu Thr Ala Arg Thr Cys Arg Leu Gly Pro Cys Pro Arg Asn Ile Ser
770 775 780
Asp Pro Ser Lys Lys Ser Tyr Val Val Gln Trp Leu Ser Arg Pro Asp
785 790 795 800
Pro Asp Ser Pro Ile Arg Lys Ile Ser Ser Lys Gly His Cys Gln Gly
805 810 815
Asp Lys Ser Ile Phe Cys Arg Met Glu Val Leu Ser Arg Tyr Cys Ser
820 825 830
Ile Pro Gly Tyr Asn Lys Leu Ser Cys Lys Ser Cys Asn Leu Tyr Asn
835 840 845
Asn Leu Thr Asn Val Glu Gly Arg Ile Glu Pro Pro Pro Gly Lys His
850 855 860
Asn Asp Ile Asp Val Phe Met Pro Thr Leu Pro Val Pro Thr Val Ala
865 870 875 880
Met Glu Val Arg Pro Ser Pro Ser Thr Pro Leu Glu Val Pro Leu Asn
885 890 895
Ala Ser Ser Thr Asn Ala Thr Glu Asp His Pro Glu Thr Asn Ala Val
900 905 910
Asp Glu Pro Tyr Lys Ile His Gly Leu Glu Asp Glu Val Gln Pro Pro
915 920 925
Asn Leu Ile Pro Arg Arg Pro Ser Pro Tyr Glu Lys Thr Arg Asn Gln
930 935 940
Arg Ile Gln Glu Leu Ile Asp Glu Met Arg Lys Lys Glu Met Leu Gly
945 950 955 960
Lys Phe
<210> 22
<211> 2888
<212> DNA
<213> Artificial sequence
<220>
<223> Synthetic sequence containing the coding regions of the vascular
signal sequence of barley gene for Thiol protease aleurain
precursor fused to the human Lysyl hydroxylase 3 and flanking
regions
<400> 22
gcgaattcgc tagctatcac tgaaaagaca gcaagacaat ggtgtctcga tgcaccagaa 60
ccacatcttt gcagcagatg tgaagcagcc agagtggtcc acaagacgca ctcagaaaag 120
gcatcttcta ccgacacaga aaaagacaac cacagctcat catccaacat gtagactgtc 180
gttatgcgtc ggctgaagat aagactgacc ccaggccagc actaaagaag aaataatgca 240
agtggtccta gctccacttt agctttaata attatgtttc attattattc tctgcttttg 300
ctctctatat aaagagcttg tattttcatt tgaaggcaga ggcgaacaca cacacagaac 360
ctccctgctt acaaaccaga tcttaaacca tggctcacgc tagggttttg cttcttgctc 420
ttgctgttct tgctactgct gctgttgctg tggcttcttc aagttctttc gctgattcta 480
acccaattag gccagtgact gatagagctg cttctactct tgctcaattg agatctatgt 540
ctgatagacc aaggggaagg gatccagtta atccagagaa gttgcttgtg attactgtgg 600
ctactgctga gactgaagga taccttagat tccttaggag tgctgagttc ttcaactaca 660
ctgtgaggac tcttggactt ggagaagaat ggaggggagg agatgttgct agaactgttg 720
gaggaggaca gaaagtgaga tggcttaaga aagagatgga gaagtacgct gatagggagg 780
atatgattat tatgttcgtg gattcttacg atgtgattct tgctggatct ccaactgagc 840
ttttgaagaa attcgttcag tctggatcta ggcttctttt ctctgctgag tctttttgtt 900
ggccagaatg gggacttgct gagcaatatc cagaagtggg aactggaaag agattcctta 960
actctggagg attcattgga ttcgctacta ctattcacca gattgtgagg cagtggaagt 1020
acaaggatga cgatgatgat cagcttttct acactaggct ttaccttgat ccaggactta 1080
gggagaagtt gtctcttaac cttgatcaca agtctaggat tttccagaac cttaacggtg 1140
ctcttgatga ggttgtgctt aagttcgata ggaacagagt gaggattagg aacgtggctt 1200
acgatactct tcctattgtg gtgcatggaa acggaccaac aaaactccag cttaactacc 1260
ttggaaacta cgttccaaac ggatggactc cagaaggagg atgtggattc tgcaatcagg 1320
ataggagaac tcttccagga ggacaaccac caccaagagt tttccttgct gtgttcgttg 1380
aacagccaac tccattcctt ccaagattcc ttcagaggct tcttcttttg gattacccac 1440
cagatagggt gacacttttc cttcacaaca acgaggtttt ccacgagcca cacattgctg 1500
attcttggcc acagcttcag gatcatttct ctgctgtgaa gttggttggt ccagaagaag 1560
ctctttctcc aggagaagct agggatatgg ctatggattt gtgcaggcag gatccagagt 1620
gcgagttcta cttctctctt gatgctgatg ctgtgcttac taaccttcag actcttagga 1680
ttcttattga ggagaacagg aaagtgattg ctccaatgct ttctaggcac ggaaagttgt 1740
ggtctaattt ctggggtgct ctttctcctg atgagtacta cgctagatca gaggactacg 1800
tggagcttgt tcagagaaag agagtgggag tttggaacgt tccttatatt tctcaggctt 1860
acgtgattag gggagatact cttaggatgg agcttccaca gagggatgtt ttctctggat 1920
ctgatactga tccagatatg gctttctgca agtctttcag ggataaggga attttccttc 1980
acctttctaa ccagcatgag ttcggaagat tgcttgctac ttcaagatac gatactgagc 2040
accttcatcc tgatctttgg cagattttcg ataacccagt ggattggaag gagcagtaca 2100
ttcacgagaa ctactctagg gctcttgaag gagaaggaat tgtggagcaa ccatgcccag 2160
atgtttactg gttcccactt ctttctgagc aaatgtgcga tgagcttgtt gctgagatgg 2220
agcattacgg acaatggagt ggaggtagac atgaggattc taggcttgct ggaggatacg 2280
agaacgttcc aactgtggat attcacatga agcaagtggg atacgaggat caatggcttc 2340
agcttcttag gacttatgtg ggaccaatga ctgagtctct tttcccagga taccacacta 2400
aggctagggc tgttatgaac ttcgttgtga ggtatcgtcc agatgagcaa ccatctctta 2460
ggccacacca cgattcttct actttcactc ttaacgtggc tcttaaccac aagggacttg 2520
attatgaggg aggaggatgc cgtttcctta gatacgattg cgtgatttct tcaccaagaa 2580
agggatgggc tcttcttcat ccaggaaggc ttactcatta ccacgaggga cttccaacta 2640
cttggggaac tagatatatt atggtgtctt tcgtggatcc atgactgctt taatgagata 2700
tgcgagacgc ctatgatcgc atgatatttg ctttcaattc tgttgtgcac gttgtaaaaa 2760
acctgagcat gtgtagctca gatccttacc gccggtttcg gttcattcta atgaatatat 2820
cacccgttac tatcgtattt ttatgaataa tattctccgt tcaatttact gattgtccag 2880
aattcgcg 2888
<210> 23
<211> 764
<212> PRT
<213> Artificial sequence
<220>
<223> Synthetic sequence containing the vascular signal sequence of
barley gene for Thiol protease aleurain precursor fused to the
human Lysyl hydroxylase 3 and flanking regions
<400> 23
Met Ala His Ala Arg Val Leu Leu Leu Ala Leu Ala Val Leu Ala Thr
1 5 10 15
Ala Ala Val Ala Val Ala Ser Ser Ser Ser Phe Ala Asp Ser Asn Pro
20 25 30
Ile Arg Pro Val Thr Asp Arg Ala Ala Ser Thr Leu Ala Gln Leu Arg
35 40 45
Ser Met Ser Asp Arg Pro Arg Gly Arg Asp Pro Val Asn Pro Glu Lys
50 55 60
Leu Leu Val Ile Thr Val Ala Thr Ala Glu Thr Glu Gly Tyr Leu Arg
65 70 75 80
Phe Leu Arg Ser Ala Glu Phe Phe Asn Tyr Thr Val Arg Thr Leu Gly
85 90 95
Leu Gly Glu Glu Trp Arg Gly Gly Asp Val Ala Arg Thr Val Gly Gly
100 105 110
Gly Gln Lys Val Arg Trp Leu Lys Lys Glu Met Glu Lys Tyr Ala Asp
115 120 125
Arg Glu Asp Met Ile Ile Met Phe Val Asp Ser Tyr Asp Val Ile Leu
130 135 140
Ala Gly Ser Pro Thr Glu Leu Leu Lys Lys Phe Val Gln Ser Gly Ser
145 150 155 160
Arg Leu Leu Phe Ser Ala Glu Ser Phe Cys Trp Pro Glu Trp Gly Leu
165 170 175
Ala Glu Gln Tyr Pro Glu Val Gly Thr Gly Lys Arg Phe Leu Asn Ser
180 185 190
Gly Gly Phe Ile Gly Phe Ala Thr Thr Ile His Gln Ile Val Arg Gln
195 200 205
Trp Lys Tyr Lys Asp Asp Asp Asp Asp Gln Leu Phe Tyr Thr Arg Leu
210 215 220
Tyr Leu Asp Pro Gly Leu Arg Glu Lys Leu Ser Leu Asn Leu Asp His
225 230 235 240
Lys Ser Arg Ile Phe Gln Asn Leu Asn Gly Ala Leu Asp Glu Val Val
245 250 255
Leu Lys Phe Asp Arg Asn Arg Val Arg Ile Arg Asn Val Ala Tyr Asp
260 265 270
Thr Leu Pro Ile Val Val His Gly Asn Gly Pro Thr Lys Leu Gln Leu
275 280 285
Asn Tyr Leu Gly Asn Tyr Val Pro Asn Gly Trp Thr Pro Glu Gly Gly
290 295 300
Cys Gly Phe Cys Asn Gln Asp Arg Arg Thr Leu Pro Gly Gly Gln Pro
305 310 315 320
Pro Pro Arg Val Phe Leu Ala Val Phe Val Glu Gln Pro Thr Pro Phe
325 330 335
Leu Pro Arg Phe Leu Gln Arg Leu Leu Leu Leu Asp Tyr Pro Pro Asp
340 345 350
Arg Val Thr Leu Phe Leu His Asn Asn Glu Val Phe His Glu Pro His
355 360 365
Ile Ala Asp Ser Trp Pro Gln Leu Gln Asp His Phe Ser Ala Val Lys
370 375 380
Leu Val Gly Pro Glu Glu Ala Leu Ser Pro Gly Glu Ala Arg Asp Met
385 390 395 400
Ala Met Asp Leu Cys Arg Gln Asp Pro Glu Cys Glu Phe Tyr Phe Ser
405 410 415
Leu Asp Ala Asp Ala Val Leu Thr Asn Leu Gln Thr Leu Arg Ile Leu
420 425 430
Ile Glu Glu Asn Arg Lys Val Ile Ala Pro Met Leu Ser Arg His Gly
435 440 445
Lys Leu Trp Ser Asn Phe Trp Gly Ala Leu Ser Pro Asp Glu Tyr Tyr
450 455 460
Ala Arg Ser Glu Asp Tyr Val Glu Leu Val Gln Arg Lys Arg Val Gly
465 470 475 480
Val Trp Asn Val Pro Tyr Ile Ser Gln Ala Tyr Val Ile Arg Gly Asp
485 490 495
Thr Leu Arg Met Glu Leu Pro Gln Arg Asp Val Phe Ser Gly Ser Asp
500 505 510
Thr Asp Pro Asp Met Ala Phe Cys Lys Ser Phe Arg Asp Lys Gly Ile
515 520 525
Phe Leu His Leu Ser Asn Gln His Glu Phe Gly Arg Leu Leu Ala Thr
530 535 540
Ser Arg Tyr Asp Thr Glu His Leu His Pro Asp Leu Trp Gln Ile Phe
545 550 555 560
Asp Asn Pro Val Asp Trp Lys Glu Gln Tyr Ile His Glu Asn Tyr Ser
565 570 575
Arg Ala Leu Glu Gly Glu Gly Ile Val Glu Gln Pro Cys Pro Asp Val
580 585 590
Tyr Trp Phe Pro Leu Leu Ser Glu Gln Met Cys Asp Glu Leu Val Ala
595 600 605
Glu Met Glu His Tyr Gly Gln Trp Ser Gly Gly Arg His Glu Asp Ser
610 615 620
Arg Leu Ala Gly Gly Tyr Glu Asn Val Pro Thr Val Asp Ile His Met
625 630 635 640
Lys Gln Val Gly Tyr Glu Asp Gln Trp Leu Gln Leu Leu Arg Thr Tyr
645 650 655
Val Gly Pro Met Thr Glu Ser Leu Phe Pro Gly Tyr His Thr Lys Ala
660 665 670
Arg Ala Val Met Asn Phe Val Val Arg Tyr Arg Pro Asp Glu Gln Pro
675 680 685
Ser Leu Arg Pro His His Asp Ser Ser Thr Phe Thr Leu Asn Val Ala
690 695 700
Leu Asn His Lys Gly Leu Asp Tyr Glu Gly Gly Gly Cys Arg Phe Leu
705 710 715 720
Arg Tyr Asp Cys Val Ile Ser Ser Pro Arg Lys Gly Trp Ala Leu Leu
725 730 735
His Pro Gly Arg Leu Thr His Tyr His Glu Gly Leu Pro Thr Thr Trp
740 745 750
Gly Thr Arg Tyr Ile Met Val Ser Phe Val Asp Pro
755 760
<210> 24
<211> 45
<212> PRT
<213> Artificial sequence
<220>
<223> Vacuole signal sequence of barley gene for Thiol protease
aleurain precursor
<400> 24
Met Ala His Ala Arg Val Leu Leu Leu Ala Leu Ala Val Leu Ala Thr
1 5 10 15
Ala Ala Val Ala Val Ala Ser Ser Ser Ser Phe Ala Asp Ser Asn Pro
20 25 30
Ile Arg Pro Val Thr Asp Arg Ala Ala Ser Thr Leu Ala
35 40 45
<210> 25
<211> 24
<212> DNA
<213> Artificial sequence
<220>
<223> Single strand DNA oligonucleotide
<400> 25
atcaccagga gaacagggac catc 24
<210> 26
<211> 29
<212> DNA
<213> Artificial sequence
<220>
<223> Single strand DNA oligonucleotide
<400> 26
tccacttcca aatctctatc cctaacaac 29
<210> 27
<211> 23
<212> DNA
<213> Artificial sequence
<220>
<223> Single strand DNA oligonucleotide
<400> 27
aggcattaga ggcgataagg gag 23
<210> 28
<211> 27
<212> DNA
<213> Artificial sequence
<220>
<223> Single strand DNA oligonucleotide
<400> 28
tcaatccaat aatagccact tgaccac 27
Claims (53)
- 내인성(endogenous) P4H 활성이 없는 세포내 컴파트먼트(subcellular compartment)에 적어도 한 형태의 콜라겐 알파 사슬 및 외인성 P4H를 축적할 수 있는 방식으로, 식물 또는 단리된 식물 세포에서 적어도 한 형태의 콜라겐 알파 사슬 및 외인성 P4H를 발현하여 식물에서 콜라겐을 생성하는 것을 포함하여, 식물 또는 단리된(isolated) 식물 세포에서 콜라겐을 생산하는 방법.
- 제 1 항에 있어서, 내인성 P4H 활성이 없는 세포내 컴파트먼트에서 외인성 LH3를 발현하는 것을 추가로 포함하는 방법.
- 제 1 항에 있어서, 상기 적어도 한 형태의 콜라겐 알파 사슬이 아포플라스트(apoplast) 또는 액포를 표적화하기 위한 시그널 펩티드(signal peptide)를 포함하는 방법.
- 제 1 항에 있어서, 상기 적어도 한 형태의 콜라겐 알파 사슬이 ER 표적화 또는 보유(retention) 서열을 가지지 않는 방법.
- 제 1 항에 있어서, 상기 적어도 한 형태의 콜라겐 알파 사슬이 식물의 DNA-함유 세포기관(organelle)에서 발현되는 방법.
- 제 1 항에 있어서, 상기 외인성 P4H가 아포플라스트 또는 액포를 표적화하기 위한 시그널 펩티드를 포함하는 방법.
- 제 1 항에 있어서, 상기 외인성 P4H가 ER 표적화 또는 보유 서열을 가지지 않는 방법.
- 제 1 항에 있어서, 상기 외인성 P4H가 식물의 DNA-함유 세포기관에서 발현되는 방법.
- 제 1 항에 있어서, 상기 적어도 한 형태의 콜라겐 알파 사슬이 알파 1 사슬인 방법.
- 제 1 항에 있어서, 상기 적어도 한 형태의 콜라겐 알파 사슬이 알파 2 사슬인 방법.
- 제 1 항에 있어서, 상기 적어도 한 형태의 콜라겐 알파 사슬이 C-말단 및/또는 N-말단 프로펩티드를 포함하는 방법.
- 제 1 항에 있어서, 상기 식물이 담배(Tobacco), 옥수수(Maize), 자주개자 리(Alfalfa), 벼(Rice), 감자(Potato), 대두(Soybean), 토마토(Tomato), 밀(Wheat), 보리(Barley), 캐놀라(Canola), 당근(Carrot) 및 목화(Cotton)로 구성된 그룹중에서 선택되는 방법.
- 제 1 항에 있어서, 상기 적어도 한 형태의 콜라겐 알파 사슬 또는 외인성 P4H가 식물의 부분에서만 발현되는 방법.
- 제 13 항에 있어서, 상기 식물의 부분이 잎, 종자, 뿌리, 덩이줄기 또는 줄기인 방법.
- 제 1 항에 있어서, 상기 외인성 P4H가 적어도 한 형태의 콜라겐 알파 사슬의 Gly-X-Y 트리플렛의 Y 위치를 특이적으로 하이드록실화할 수 있는 방법.
- 제 14 항에 있어서, 상기 외인성 P4H가 인간 P4H인 방법.
- 제 1 항에 있어서, 상기 식물이 스트레스 조건(stress condition)에 적용되는 방법.
- 제 17 항에 있어서, 상기 스트레스 조건이 건조(drought), 염분(salinity), 손상(injury), 저온(cold) 및 스트레스 유발 화합물의 분무(spraying)로 구성된 그 룹중에서 선택되는 방법.
- 콜라겐 알파 사슬이 인간 세포에서 발현할 때 생산된 것과 동일한 하이드록실화 패턴을 가진 콜라겐 알파 사슬을 축적할 수 있는 유전자 조작(genetically modified) 식물 또는 단리된 식물 세포.
- 내인성 P4H 활성이 없는 세포내 컴파트먼트에 콜라겐 알파 사슬을 축적할 수 있는 유전자 조작 식물 또는 단리된 식물 세포.
- 제 19 항 또는 제 20 항에 있어서, 외인성 P4H를 추가로 포함하는 유전자 조작 식물.
- 제 20 항에 있어서, 상기 적어도 한 형태의 콜라겐 알파 사슬이 아포플라스트 또는 액포를 표적화하기 위한 시그널 펩티드를 포함하는 유전자 조작 식물.
- 제 19 항 또는 제 20 항에 있어서, 상기 적어도 한 형태의 콜라겐 알파 사슬이 ER 표적화 또는 보유 서열을 가지지 않는 유전자 조작 식물.
- 제 19 항 또는 제 20 항에 있어서, 상기 적어도 한 형태의 콜라겐 알파 사슬이 식물의 DNA-함유 세포기관에서 발현되는 유전자 조작 식물.
- 제 21 항에 있어서, 상기 외인성 P4H가 아포플라스트 또는 액포를 표적화하기 위한 시그널 펩티드를 포함하는 유전자 조작 식물.
- 제 21 항에 있어서, 상기 외인성 P4H가 ER 표적화 또는 보유 서열을 가지지 않는 유전자 조작 식물.
- 제 21 항에 있어서, 상기 외인성 P4H가 식물의 DNA-함유 세포기관에서 발현되는 유전자 조작 식물.
- 제 19 항 또는 제 20 항에 있어서, 상기 콜라겐 알파 사슬이 알파 1 사슬인 유전자 조작 식물.
- 제 19 항 또는 제 20 항에 있어서, 상기 콜라겐 알파 사슬이 알파 2 사슬인 유전자 조작 식물.
- 제 19 항 또는 제 20 항에 있어서, 상기 콜라겐 알파 사슬이 C-말단 및/또는 N-말단 프로펩티드를 포함하는 유전자 조작 식물.
- 콜라겐 알파 1 사슬을 축적할 수 있는 제 1 유전자 조작 식물 및 콜라겐 알 파 2 사슬을 축적할 수 있는 제 2 유전자 조작 식물을 포함하는 식물 시스템.
- 제 31 항에 있어서, 상기 제 1 유전자 조작 식물 및 상기 제 2 유전자 조작 식물 중 적어도 하나는 외인성 P4H를 추가로 포함하는 식물 시스템.
- (a) 제 1 식물에서 콜라겐 알파 1 사슬을 발현하고;(b) 제 2 식물에서 콜라겐 알파 2 사슬을 발현하며, 여기서 제 1 식물 및 제 2 식물에서의 발현은 콜라겐 알파 1 사슬 및 콜라겐 알파 2 사슬이 각각 내인성 P4H 활성이 없는 세포내 컴파트먼트에 축적될 수 있도록 설정되고;(c) 상기 제 1 식물 및 상기 제 2 식물을 교배하고 콜라겐 알파 1 사슬 및 콜라겐 알파 2 사슬을 발현하는 후손을 선택하여 섬유상 콜라겐을 생산하는 것을 포함하여, 섬유상 콜라겐을 생산하는 방법.
- 제 30 항에 있어서, 상기 제 1 식물 및 상기 제 2 식물 각각에서 외인성 P4H를 발현하는 것을 추가로 포함하는 방법.
- 제 33 항에 있어서, 각각의 상기 콜라겐 알파 1 사슬 및 상기 콜라겐 알파 2 사슬이 아포플라스트 또는 액포를 표적화하기 위한 시그널 펩티드를 포함하는 방법.
- 제 1 항에 있어서, 각각의 상기 콜라겐 알파 1 사슬 및 상기 콜라겐 알파 2 사슬이 ER 표적화 또는 보유 서열을 가지지 않는 방법.
- 제 1 항에 있어서, 상기 단계 (a) 및 (b)가 식물의 DNA-함유 세포기관에서 발현을 통해 수행되는 방법.
- 제 34 항에 있어서, 상기 외인성 P4H가 아포플라스트 또는 액포를 표적화하기 위한 시그널 펩티드를 포함하는 방법.
- 제 34 항에 있어서, 상기 외인성 P4H가 ER 표적화 또는 보유 서열을 가지지 않는 방법.
- 제 34 항에 있어서, 상기 외인성 P4H가 식물의 DNA-함유 세포기관에서 발현되는 방법.
- 제 1 항에 있어서, 각각의 상기 콜라겐 알파 1 사슬 및 상기 콜라겐 알파 2 사슬이 C-말단 및/또는 N-말단 프로펩티드를 포함하는 방법.
- 제 34 항에 있어서, 상기 외인성 P4H가 적어도 한 형태의 상기 콜라겐 알파 사슬의 Gly-X-Y 트리플렛의 Y 위치를 특이적으로 하이드록실화할 수 있는 방법.
- 제 34 항에 있어서, 상기 외인성 P4H가 인간 P4H인 방법.
- 제 1 항에 있어서, 상기 제 1 식물 및 상기 제 2 식물이 스트레스 조건에 적용되는 방법.
- 제 44 항에 있어서, 상기 스트레스 조건이 건조, 염분, 손상, 중금속 독성 및 저온 스트레스로 구성된 그룹 중에서 선택되는 방법.
- (a) 제 1 식물에서, 콜라겐 알파 1 사슬 및 콜라겐 알파 2 사슬을 발현하며, 여기서 상기 제 1 식물에서의 발현은 상기 콜라겐 알파 1 사슬 및 상기 콜라겐 알파 2 사슬이 각각 내인성 P4H 활성이 없는 세포내 컴파트먼트에 축적될 수 있도록 설정되고;(b) 제 2 식물에서, 내인성 P4H 활성이 없는 상기 세포내 컴파트먼트에 축적될 수 있는 외인성 P4H를 발현하며;(c) 상기 제 1 식물 및 상기 제 2 식물을 교배하고, 상기 콜라겐 알파 1 사슬, 상기 콜라겐 알파 2 사슬 및 상기 P4H를 발현하는 후손을 선택하여 섬유상 콜라겐을 생산하는 것을 포함하여, 섬유상 콜라겐을 생산하는 방법.
- 식물 세포에 작용하는 프로모터의 전사 제어하에 위치된 인간 P4H를 코딩하 는 폴리뉴클레오티드를 포함하는 핵산 작제물.
- 제 47 항에 있어서, 상기 프로모터가 CaMV 35S 프로모터, 유비퀴틴(Ubiquitin) 프로모터, rbcS 프로모터 및 SVBV 프로모터로 구성된 그룹 중에서 선택되는 핵산 작제물.
- 콜라겐 알파 1 사슬, 콜라겐 알파 2 사슬, P4H, LH3 및 프로테아제 C 및/또는 프로테아제 N을 발현할 수 있는 유전자 조작 식물 또는 단리된 식물 세포.
- 제 49 항에 있어서, 상기 콜라겐 알파 1 사슬 및 상기 콜라겐 알파 2 사슬이 각각 내인성 식물 P4H 활성이 없는 세포내 컴파트먼트에 축적될 수 있는 유전자 조작 식물 또는 단리된 식물 세포.
- 포유동물 콜라겐의 것과 동일한 온도 안정성 특징을 가진 콜라겐을 축적할 수 있는 유전자 조작 식물 또는 단리된 식물 세포.
- 제 51 항에 있어서, 상기 콜라겐이 I형 콜라겐인 유전자 조작 식물 또는 단리된 식물 세포.
- 제 51 항에 있어서, 상기 포유동물 콜라겐이 인간 콜라겐인 유전자 조작 식 물 또는 단리된 식물 세포.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US61371904P | 2004-09-29 | 2004-09-29 | |
US60/613,719 | 2004-09-29 |
Publications (1)
Publication Number | Publication Date |
---|---|
KR20070083870A true KR20070083870A (ko) | 2007-08-24 |
Family
ID=35788568
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020077009817A KR20070083870A (ko) | 2004-09-29 | 2005-09-28 | 콜라겐 생산성 식물 및 그 생성 및 사용방법 |
Country Status (14)
Country | Link |
---|---|
EP (6) | EP3088528B1 (ko) |
JP (2) | JP5100386B2 (ko) |
KR (1) | KR20070083870A (ko) |
CN (1) | CN101065491B (ko) |
AT (1) | ATE479762T1 (ko) |
BR (1) | BRPI0516303B1 (ko) |
CA (1) | CA2582051C (ko) |
DE (1) | DE602005023332D1 (ko) |
HK (3) | HK1114405A1 (ko) |
IL (1) | IL182320A (ko) |
MX (1) | MX2007003767A (ko) |
NZ (1) | NZ554787A (ko) |
WO (1) | WO2006035442A2 (ko) |
ZA (1) | ZA200703369B (ko) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20130079220A (ko) * | 2012-01-02 | 2013-07-10 | (주)아모레퍼시픽 | 콩뿌리 추출물을 함유하는 피부 외용제 조성물 |
US10174334B2 (en) | 2013-05-06 | 2019-01-08 | Albert-Ludwigs-Universitaet Freiburg | Modified expression of prolyl-4-hydroxylase in physcomitrella patens |
Families Citing this family (25)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7378506B2 (en) | 1997-07-21 | 2008-05-27 | Ohio University | Synthetic genes for plant gums and other hydroxyproline-rich glycoproteins |
US6639050B1 (en) | 1997-07-21 | 2003-10-28 | Ohio University | Synthetic genes for plant gums and other hydroxyproline-rich glycoproteins |
EP1711533B1 (en) | 2004-01-14 | 2013-12-11 | Ohio University | Methods of producing peptides/proteins in plants and peptides/proteins produced thereby |
CA2573918A1 (en) | 2004-04-19 | 2005-11-24 | Ohio University | Cross-linkable glycoproteins and methods of making the same |
EP3088528B1 (en) | 2004-09-29 | 2019-07-03 | Collplant Ltd. | Collagen producing plants and methods of generating and using same |
US8455717B2 (en) | 2004-09-29 | 2013-06-04 | Collplant Ltd. | Collagen producing plants and methods of generating and using same |
EP2084285A4 (en) * | 2006-07-10 | 2010-01-13 | Univ Ohio | CO-EXPRESSION OF HYDROXYLASE PROLINES TO FACILITATE THE HYP-GLYCOSYLATION OF PROTEINS EXPRESSED AND SECRETED IN PLANT CELLS |
EP2220246B1 (en) * | 2007-10-26 | 2015-05-20 | Collplant Ltd. | Methods of processing recombinant procollagen |
AU2008331099B2 (en) | 2007-11-26 | 2013-10-24 | Collplant Ltd. | Compositions comprising fibrous polypeptides and polysaccharides |
WO2011064773A1 (en) | 2009-11-24 | 2011-06-03 | Collplant Ltd. | Method of generating collagen fibers |
US20130230573A1 (en) | 2010-11-16 | 2013-09-05 | Oded Shoseyov | Collagen structures and method of fabricating the same |
WO2013030840A2 (en) | 2011-09-01 | 2013-03-07 | Yissum Research Development Company Of The Hebrew University Of Jerusalem Ltd. | Adhesive biopolymers and uses thereof |
WO2013093921A1 (en) | 2011-12-20 | 2013-06-27 | Collplant Ltd. | Collagen coated synthetic polymer fibers |
BR112015023447A2 (pt) * | 2013-03-21 | 2017-12-05 | Commw Scient Ind Res Org | purificação de proteínas helicoidais triplas |
DK2976097T3 (da) | 2013-03-21 | 2021-08-16 | Collplant Ltd | Sammensætninger, som omfatter kollagen og prp, til vævsregenerering og fremgangsmåde til fremstilling deraf |
WO2015031950A1 (en) | 2013-09-09 | 2015-03-12 | Commonwealth Scientific And Industrial Research Organisation | Modified bacterial collagen-like proteins |
EA038931B9 (ru) | 2014-11-20 | 2022-02-18 | Йиссум Рисерч Дивелопмент Компани Оф Зе Хебрю Юниверсити Оф Иерусалим Лтд. | Композиции и способы получения полипептидов с модифицированным профилем гликозилирования в клетках растений |
WO2019056377A1 (zh) * | 2017-09-25 | 2019-03-28 | 吴侑峻 | 非人类转基因哺乳动物及从其结缔组织分离的蛋白质粗萃物及其方法与用途 |
US11801329B2 (en) * | 2018-05-03 | 2023-10-31 | Collplant Ltd. | Dermal fillers and applications thereof |
WO2020026241A1 (en) | 2018-07-31 | 2020-02-06 | Collplant Ltd. | Tobacco transgenic event and methods for detection and use thereof |
CN109988234A (zh) * | 2019-02-20 | 2019-07-09 | 江苏悦智生物医药有限公司 | 酵母重组人源I型胶原α1链蛋白、合成方法及其应用 |
FR3105792B1 (fr) | 2019-12-26 | 2022-08-12 | Michel Assor | Matériau biocomposite collagène/matrice polymérique poreuse et son utilisation comme implant de réparation de lésions méniscales du genou et/ou de prévention ou de traitement de l’arthrose du genou |
WO2021229577A1 (en) | 2020-05-12 | 2021-11-18 | Collplant Ltd. | Collagen as a delivery tool for metal-based anti-viral agents |
WO2023194333A1 (en) | 2022-04-04 | 2023-10-12 | Swiftpharma Bv | Recombinant spider silk-reinforced collagen proteins produced in plants and the use thereof |
WO2024029529A1 (ja) * | 2022-08-02 | 2024-02-08 | 株式会社 UniBio | コラーゲン産生方法 |
Family Cites Families (41)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
NL154600B (nl) | 1971-02-10 | 1977-09-15 | Organon Nv | Werkwijze voor het aantonen en bepalen van specifiek bindende eiwitten en hun corresponderende bindbare stoffen. |
NL154598B (nl) | 1970-11-10 | 1977-09-15 | Organon Nv | Werkwijze voor het aantonen en bepalen van laagmoleculire verbindingen en van eiwitten die deze verbindingen specifiek kunnen binden, alsmede testverpakking. |
NL154599B (nl) | 1970-12-28 | 1977-09-15 | Organon Nv | Werkwijze voor het aantonen en bepalen van specifiek bindende eiwitten en hun corresponderende bindbare stoffen, alsmede testverpakking. |
US3901654A (en) | 1971-06-21 | 1975-08-26 | Biological Developments | Receptor assays of biologically active compounds employing biologically specific receptors |
US3853987A (en) | 1971-09-01 | 1974-12-10 | W Dreyer | Immunological reagent and radioimmuno assay |
US3867517A (en) | 1971-12-21 | 1975-02-18 | Abbott Lab | Direct radioimmunoassay for antigens and their antibodies |
NL171930C (nl) | 1972-05-11 | 1983-06-01 | Akzo Nv | Werkwijze voor het aantonen en bepalen van haptenen, alsmede testverpakkingen. |
US3850578A (en) | 1973-03-12 | 1974-11-26 | H Mcconnell | Process for assaying for biologically active molecules |
US3935074A (en) | 1973-12-17 | 1976-01-27 | Syva Company | Antibody steric hindrance immunoassay with two antibodies |
US3996345A (en) | 1974-08-12 | 1976-12-07 | Syva Company | Fluorescence quenching with immunological pairs in immunoassays |
US4034074A (en) | 1974-09-19 | 1977-07-05 | The Board Of Trustees Of Leland Stanford Junior University | Universal reagent 2-site immunoradiometric assay using labelled anti (IgG) |
US3984533A (en) | 1975-11-13 | 1976-10-05 | General Electric Company | Electrophoretic method of detecting antigen-antibody reaction |
US4098876A (en) | 1976-10-26 | 1978-07-04 | Corning Glass Works | Reverse sandwich immunoassay |
US4879219A (en) | 1980-09-19 | 1989-11-07 | General Hospital Corporation | Immunoassay utilizing monoclonal high affinity IgM antibodies |
CA1192510A (en) | 1981-05-27 | 1985-08-27 | Lawrence E. Pelcher | Rna plant virus vector or portion thereof, a method of construction thereof, and a method of producing a gene derived product therefrom |
JPS6054684A (ja) | 1983-09-05 | 1985-03-29 | Teijin Ltd | 新規dνa及びハイブリツドdνa |
US5011771A (en) | 1984-04-12 | 1991-04-30 | The General Hospital Corporation | Multiepitopic immunometric assay |
US4666828A (en) | 1984-08-15 | 1987-05-19 | The General Hospital Corporation | Test for Huntington's disease |
US4945050A (en) | 1984-11-13 | 1990-07-31 | Cornell Research Foundation, Inc. | Method for transporting substances into living cells and tissues and apparatus therefor |
CA1288073C (en) | 1985-03-07 | 1991-08-27 | Paul G. Ahlquist | Rna transformation vector |
US4683202A (en) | 1985-03-28 | 1987-07-28 | Cetus Corporation | Process for amplifying nucleic acid sequences |
US4801531A (en) | 1985-04-17 | 1989-01-31 | Biotechnology Research Partners, Ltd. | Apo AI/CIII genomic polymorphisms predictive of atherosclerosis |
GB8608850D0 (en) | 1986-04-11 | 1986-05-14 | Diatech Ltd | Packaging system |
JPS6314693A (ja) | 1986-07-04 | 1988-01-21 | Sumitomo Chem Co Ltd | 植物ウイルスrnaベクタ− |
ES2060646T3 (es) | 1987-02-09 | 1994-12-01 | Lubrizol Genetics Inc | Virus rna hibrido. |
US5316931A (en) | 1988-02-26 | 1994-05-31 | Biosource Genetics Corp. | Plant viral vectors having heterologous subgenomic promoters for systemic expression of foreign genes |
US5693507A (en) | 1988-09-26 | 1997-12-02 | Auburn University | Genetic engineering of plant chloroplasts |
US5272057A (en) | 1988-10-14 | 1993-12-21 | Georgetown University | Method of detecting a predisposition to cancer by the use of restriction fragment length polymorphism of the gene for human poly (ADP-ribose) polymerase |
US5192659A (en) | 1989-08-25 | 1993-03-09 | Genetype Ag | Intron sequence analysis method for detection of adjacent and remote locus alleles as haplotypes |
US5593859A (en) * | 1991-10-23 | 1997-01-14 | Thomas Jefferson University | Synthesis of human procollagens and collagens in recombinant DNA systems |
US5281521A (en) | 1992-07-20 | 1994-01-25 | The Trustees Of The University Of Pennsylvania | Modified avidin-biotin technique |
FR2757874B1 (fr) * | 1996-12-17 | 2003-04-25 | Biocem | Collagenes recombinants et proteines derivees produits par les plantes, leurs procedes d'obtention et leurs utilisations |
ES2276475T5 (es) | 1997-09-30 | 2014-07-11 | The Regents Of The University Of California | Producción de proteínas en semillas de plantas |
BR0014945A (pt) * | 1999-10-21 | 2004-08-31 | Monsanto Co | Modificação pós-traducional de proteìnas recombinantes produzidas em plantas |
JP2003513659A (ja) * | 1999-11-12 | 2003-04-15 | ファイブローゲン、インコーポレーテッド | 動物コラーゲン及びゼラチン |
CN1285612C (zh) * | 1999-11-12 | 2006-11-22 | 法布罗根股份有限公司 | 动物胶原和明胶 |
WO2002099067A2 (en) * | 2001-06-05 | 2002-12-12 | Oishi Karen K | Gene expression and production of tgf-b proteins including bioactive mullerian inhibiting substance from plants |
CN101177675B (zh) * | 2002-02-08 | 2014-05-21 | 诺维信公司 | 肌醇六磷酸酶变体 |
WO2004057001A2 (en) * | 2002-12-19 | 2004-07-08 | University Of Bristol | Method for the production of polyunsaturated fatty acids |
AU2003293906A1 (en) * | 2002-12-20 | 2004-07-22 | Basf Aktiengesellschaft | Malate dehydrogenase as a target for herbicides |
EP3088528B1 (en) | 2004-09-29 | 2019-07-03 | Collplant Ltd. | Collagen producing plants and methods of generating and using same |
-
2005
- 2005-09-28 EP EP16171177.5A patent/EP3088528B1/en active Active
- 2005-09-28 KR KR1020077009817A patent/KR20070083870A/ko not_active Application Discontinuation
- 2005-09-28 EP EP05789469A patent/EP1809751B1/en not_active Revoked
- 2005-09-28 CA CA2582051A patent/CA2582051C/en active Active
- 2005-09-28 AT AT05789469T patent/ATE479762T1/de not_active IP Right Cessation
- 2005-09-28 BR BRPI0516303-0A patent/BRPI0516303B1/pt active IP Right Grant
- 2005-09-28 WO PCT/IL2005/001045 patent/WO2006035442A2/en active Application Filing
- 2005-09-28 NZ NZ554787A patent/NZ554787A/en unknown
- 2005-09-28 MX MX2007003767A patent/MX2007003767A/es active IP Right Grant
- 2005-09-28 EP EP14185576.7A patent/EP2816117B1/en not_active Revoked
- 2005-09-28 EP EP10181115A patent/EP2360261A1/en not_active Ceased
- 2005-09-28 CN CN2005800408217A patent/CN101065491B/zh active Active
- 2005-09-28 EP EP17205280.5A patent/EP3312282B1/en active Active
- 2005-09-28 DE DE602005023332T patent/DE602005023332D1/de active Active
- 2005-09-28 EP EP10168971.9A patent/EP2357241B1/en not_active Revoked
- 2005-09-28 JP JP2007534176A patent/JP5100386B2/ja active Active
-
2007
- 2007-03-29 IL IL182320A patent/IL182320A/en active IP Right Grant
- 2007-04-25 ZA ZA200703369A patent/ZA200703369B/xx unknown
-
2008
- 2008-04-15 HK HK08104257.2A patent/HK1114405A1/xx not_active IP Right Cessation
-
2011
- 2011-11-29 JP JP2011259678A patent/JP5517309B2/ja active Active
-
2012
- 2012-01-19 HK HK12100629.5A patent/HK1160176A1/xx not_active IP Right Cessation
-
2018
- 2018-08-23 HK HK18110838.5A patent/HK1251611B/zh unknown
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20130079220A (ko) * | 2012-01-02 | 2013-07-10 | (주)아모레퍼시픽 | 콩뿌리 추출물을 함유하는 피부 외용제 조성물 |
US10174334B2 (en) | 2013-05-06 | 2019-01-08 | Albert-Ludwigs-Universitaet Freiburg | Modified expression of prolyl-4-hydroxylase in physcomitrella patens |
Also Published As
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR20070083870A (ko) | 콜라겐 생산성 식물 및 그 생성 및 사용방법 | |
US10626408B2 (en) | Collagen producing plants and methods of generating and using same | |
US8802825B2 (en) | Production of peptides and proteins by accumulation in plant endoplasmic reticulum-derived protein bodies | |
JPH02501802A (ja) | 遺伝子導入植物内の修飾された貯蔵種子タンパク質遺伝子の発現による生物学的に活性なペプチドの製造方法 | |
AU2007201384B2 (en) | Collagen Producing Plants and Methods of Generating and Using Same | |
AU2011211341B2 (en) | Collagen Producing Plants and Methods of Generating and Using Same |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WITN | Application deemed withdrawn, e.g. because no request for examination was filed or no examination fee was paid |