KR20140001198A - 신규 시알로올리고당 유도체들의 합성 - Google Patents
신규 시알로올리고당 유도체들의 합성 Download PDFInfo
- Publication number
- KR20140001198A KR20140001198A KR20137003908A KR20137003908A KR20140001198A KR 20140001198 A KR20140001198 A KR 20140001198A KR 20137003908 A KR20137003908 A KR 20137003908A KR 20137003908 A KR20137003908 A KR 20137003908A KR 20140001198 A KR20140001198 A KR 20140001198A
- Authority
- KR
- South Korea
- Prior art keywords
- ala
- gly
- thr
- val
- ser
- Prior art date
Links
- 230000015572 biosynthetic process Effects 0.000 title description 26
- 238000003786 synthesis reaction Methods 0.000 title description 17
- 150000003839 salts Chemical class 0.000 claims abstract description 119
- 150000001875 compounds Chemical class 0.000 claims abstract description 118
- 125000002446 fucosyl group Chemical group C1([C@@H](O)[C@H](O)[C@H](O)[C@@H](O1)C)* 0.000 claims abstract description 88
- 238000000034 method Methods 0.000 claims abstract description 52
- 125000003535 D-glucopyranosyl group Chemical group [H]OC([H])([H])[C@@]1([H])OC([H])(*)[C@]([H])(O[H])[C@@]([H])(O[H])[C@]1([H])O[H] 0.000 claims abstract description 36
- 238000007327 hydrogenolysis reaction Methods 0.000 claims abstract description 22
- 125000006239 protecting group Chemical group 0.000 claims abstract description 18
- -1 phosphorus oligosaccharide Chemical class 0.000 claims description 162
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 claims description 101
- MSWZFWKMSRAUBD-UHFFFAOYSA-N beta-D-galactosamine Natural products NC1C(O)OC(CO)C(O)C1O MSWZFWKMSRAUBD-UHFFFAOYSA-N 0.000 claims description 77
- 229920001542 oligosaccharide Polymers 0.000 claims description 77
- 150000002482 oligosaccharides Chemical class 0.000 claims description 64
- 125000005630 sialyl group Chemical group 0.000 claims description 63
- WQZGKKKJIJFFOK-PHYPRBDBSA-N alpha-D-galactose Chemical compound OC[C@H]1O[C@H](O)[C@H](O)[C@@H](O)[C@H]1O WQZGKKKJIJFFOK-PHYPRBDBSA-N 0.000 claims description 52
- 229930182830 galactose Natural products 0.000 claims description 52
- SQVRNKJHWKZAKO-UHFFFAOYSA-N beta-N-Acetyl-D-neuraminic acid Natural products CC(=O)NC1C(O)CC(O)(C(O)=O)OC1C(O)C(O)CO SQVRNKJHWKZAKO-UHFFFAOYSA-N 0.000 claims description 41
- 108010004486 trans-sialidase Proteins 0.000 claims description 41
- SQVRNKJHWKZAKO-OQPLDHBCSA-N sialic acid Chemical compound CC(=O)N[C@@H]1[C@@H](O)C[C@@](O)(C(O)=O)OC1[C@H](O)[C@H](O)CO SQVRNKJHWKZAKO-OQPLDHBCSA-N 0.000 claims description 39
- 125000001797 benzyl group Chemical group [H]C1=C([H])C([H])=C(C([H])=C1[H])C([H])([H])* 0.000 claims description 32
- SHZGCJCMOBCMKK-UHFFFAOYSA-N D-mannomethylose Natural products CC1OC(O)C(O)C(O)C1O SHZGCJCMOBCMKK-UHFFFAOYSA-N 0.000 claims description 31
- PNNNRSAQSRJVSB-SLPGGIOYSA-N Fucose Natural products C[C@H](O)[C@@H](O)[C@H](O)[C@H](O)C=O PNNNRSAQSRJVSB-SLPGGIOYSA-N 0.000 claims description 31
- SHZGCJCMOBCMKK-DHVFOXMCSA-N L-fucopyranose Chemical compound C[C@@H]1OC(O)[C@@H](O)[C@H](O)[C@@H]1O SHZGCJCMOBCMKK-DHVFOXMCSA-N 0.000 claims description 31
- OVRNDRQMDRJTHS-UHFFFAOYSA-N N-acelyl-D-glucosamine Natural products CC(=O)NC1C(O)OC(CO)C(O)C1O OVRNDRQMDRJTHS-UHFFFAOYSA-N 0.000 claims description 31
- OVRNDRQMDRJTHS-FMDGEEDCSA-N N-acetyl-beta-D-glucosamine Chemical compound CC(=O)N[C@H]1[C@H](O)O[C@H](CO)[C@@H](O)[C@@H]1O OVRNDRQMDRJTHS-FMDGEEDCSA-N 0.000 claims description 31
- MBLBDJOUHNCFQT-LXGUWJNJSA-N N-acetylglucosamine Natural products CC(=O)N[C@@H](C=O)[C@@H](O)[C@H](O)[C@H](O)CO MBLBDJOUHNCFQT-LXGUWJNJSA-N 0.000 claims description 31
- 229950006780 n-acetylglucosamine Drugs 0.000 claims description 31
- 102000004190 Enzymes Human genes 0.000 claims description 29
- 108090000790 Enzymes Proteins 0.000 claims description 29
- 108010006232 Neuraminidase Proteins 0.000 claims description 29
- 102000005348 Neuraminidase Human genes 0.000 claims description 28
- 239000008103 glucose Substances 0.000 claims description 23
- 229930182470 glycoside Natural products 0.000 claims description 23
- 239000008101 lactose Substances 0.000 claims description 23
- 150000002772 monosaccharides Chemical class 0.000 claims description 23
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 claims description 22
- 230000000694 effects Effects 0.000 claims description 20
- 125000000217 alkyl group Chemical group 0.000 claims description 19
- IEQCXFNWPAHHQR-UHFFFAOYSA-N lacto-N-neotetraose Natural products OCC1OC(OC2C(C(OC3C(OC(O)C(O)C3O)CO)OC(CO)C2O)O)C(NC(=O)C)C(O)C1OC1OC(CO)C(O)C(O)C1O IEQCXFNWPAHHQR-UHFFFAOYSA-N 0.000 claims description 16
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 claims description 15
- 229940062780 lacto-n-neotetraose Drugs 0.000 claims description 15
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 claims description 14
- NPPRJALWPIXIHO-PNCMPRLYSA-N beta-D-Gal-(1->4)-beta-D-GlcNAc-(1->3)-[beta-D-Gal-(1->4)-beta-D-GlcNAc-(1->6)]-beta-D-Gal-(1->4)-D-Glc Chemical compound O([C@H]1[C@H](O)[C@H]([C@@H](O[C@@H]1CO)OC[C@@H]1[C@@H]([C@H](O[C@H]2[C@@H]([C@@H](O)[C@H](O[C@H]3[C@@H]([C@@H](O)[C@@H](O)[C@@H](CO)O3)O)[C@@H](CO)O2)NC(C)=O)[C@@H](O)[C@H](O[C@@H]2[C@H](OC(O)[C@H](O)[C@H]2O)CO)O1)O)NC(=O)C)[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O NPPRJALWPIXIHO-PNCMPRLYSA-N 0.000 claims description 14
- 229910052736 halogen Inorganic materials 0.000 claims description 14
- 125000003118 aryl group Chemical group 0.000 claims description 13
- IEQCXFNWPAHHQR-YKLSGRGUSA-N beta-D-Gal-(1->4)-beta-D-GlcNAc-(1->3)-beta-D-Gal-(1->4)-D-Glc Chemical compound O([C@H]1[C@H](O)[C@H]([C@@H](O[C@@H]1CO)O[C@@H]1[C@H]([C@H](O[C@@H]2[C@H](OC(O)[C@H](O)[C@H]2O)CO)O[C@H](CO)[C@@H]1O)O)NC(=O)C)[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O IEQCXFNWPAHHQR-YKLSGRGUSA-N 0.000 claims description 13
- 150000002367 halogens Chemical class 0.000 claims description 13
- UTVHXMGRNOOVTB-IXBJWXGWSA-N beta-D-Galp-(1->4)-beta-D-GlcpNAc-(1->3)-beta-D-Galp-(1->4)-beta-D-GlcpNAc-(1->3)-beta-D-Galp-(1->4)-D-Glcp Chemical compound O([C@H]1[C@H](O)[C@H]([C@@H](O[C@@H]1CO)O[C@@H]1[C@H]([C@H](O[C@@H]2[C@H](O[C@@H](O[C@@H]3[C@H]([C@H](O[C@@H]4[C@H](OC(O)[C@H](O)[C@H]4O)CO)O[C@H](CO)[C@@H]3O)O)[C@H](NC(C)=O)[C@H]2O)CO)O[C@H](CO)[C@@H]1O)O)NC(=O)C)[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O UTVHXMGRNOOVTB-IXBJWXGWSA-N 0.000 claims description 12
- 150000002338 glycosides Chemical class 0.000 claims description 12
- 125000001997 phenyl group Chemical group [H]C1=C([H])C([H])=C(*)C([H])=C1[H] 0.000 claims description 12
- 229910052698 phosphorus Inorganic materials 0.000 claims description 12
- 239000011574 phosphorus Substances 0.000 claims description 12
- 241000223109 Trypanosoma cruzi Species 0.000 claims description 11
- AXQLFFDZXPOFPO-UHFFFAOYSA-N UNPD216 Natural products O1C(CO)C(O)C(OC2C(C(O)C(O)C(CO)O2)O)C(NC(=O)C)C1OC(C1O)C(O)C(CO)OC1OC1C(O)C(O)C(O)OC1CO AXQLFFDZXPOFPO-UHFFFAOYSA-N 0.000 claims description 11
- 125000002519 galactosyl group Chemical group C1([C@H](O)[C@@H](O)[C@@H](O)[C@H](O1)CO)* 0.000 claims description 11
- ZDZMLVPSYYRJNI-CYQYEHMMSA-N p-lacto-n-hexaose Chemical compound O([C@H]1[C@H](O)[C@@H](CO)O[C@H]([C@@H]1N=C(C)O)O[C@@H]1[C@@H](O)[C@H](O[C@H]([C@H](O)CO)[C@H](O)[C@@H](O)C=O)OC([C@@H]1O)CO[C@H]1[C@@H]([C@H](C(O[C@H]2[C@@H]([C@@H](O)[C@@H](O)[C@@H](CO)O2)O)[C@@H](CO)O1)O)N=C(O)C)[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O ZDZMLVPSYYRJNI-CYQYEHMMSA-N 0.000 claims description 8
- 241000186660 Lactobacillus Species 0.000 claims description 7
- PDWGIAAFQACISG-QZBWVFMZSA-N beta-D-Gal-(1->3)-beta-D-GlcNAc-(1->3)-[beta-D-Gal-(1->4)-beta-D-GlcNAc-(1->6)]-beta-D-Gal-(1->4)-D-Glc Chemical compound O([C@H]1[C@H](O)[C@H]([C@@H](O[C@@H]1CO)OC[C@@H]1[C@@H]([C@H](O[C@H]2[C@@H]([C@@H](O[C@H]3[C@@H]([C@@H](O)[C@@H](O)[C@@H](CO)O3)O)[C@H](O)[C@@H](CO)O2)NC(C)=O)[C@@H](O)[C@H](O[C@@H]2[C@H](OC(O)[C@H](O)[C@H]2O)CO)O1)O)NC(=O)C)[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O PDWGIAAFQACISG-QZBWVFMZSA-N 0.000 claims description 7
- AXQLFFDZXPOFPO-UNTPKZLMSA-N beta-D-Galp-(1->3)-beta-D-GlcpNAc-(1->3)-beta-D-Galp-(1->4)-beta-D-Glcp Chemical compound O([C@@H]1O[C@H](CO)[C@H](O)[C@@H]([C@H]1O)O[C@H]1[C@@H]([C@H]([C@H](O)[C@@H](CO)O1)O[C@H]1[C@@H]([C@@H](O)[C@@H](O)[C@@H](CO)O1)O)NC(=O)C)[C@H]1[C@H](O)[C@@H](O)[C@H](O)O[C@@H]1CO AXQLFFDZXPOFPO-UNTPKZLMSA-N 0.000 claims description 7
- USIPEGYTBGEPJN-UHFFFAOYSA-N lacto-N-tetraose Natural products O1C(CO)C(O)C(OC2C(C(O)C(O)C(CO)O2)O)C(NC(=O)C)C1OC1C(O)C(CO)OC(OC(C(O)CO)C(O)C(O)C=O)C1O USIPEGYTBGEPJN-UHFFFAOYSA-N 0.000 claims description 7
- 229940039696 lactobacillus Drugs 0.000 claims description 7
- CILYIEBUXJIHCO-BNUYLMRTSA-N C(C)(=O)N[C@@H]1[C@H](CC(C(O)=O)(O[C@H]1[C@H](O)[C@H](O)CO)O[C@@H]1[C@H]([C@H](O[C@H]2[C@@H]([C@H](C(O)O[C@@H]2CO)O)O)O[C@@H]([C@@H]1O)CO)O)O Chemical compound C(C)(=O)N[C@@H]1[C@H](CC(C(O)=O)(O[C@H]1[C@H](O)[C@H](O)CO)O[C@@H]1[C@H]([C@H](O[C@H]2[C@@H]([C@H](C(O)O[C@@H]2CO)O)O)O[C@@H]([C@@H]1O)CO)O)O CILYIEBUXJIHCO-BNUYLMRTSA-N 0.000 claims description 6
- 108090000288 Glycoproteins Proteins 0.000 claims description 6
- 102000003886 Glycoproteins Human genes 0.000 claims description 6
- 229910052739 hydrogen Inorganic materials 0.000 claims description 6
- SJQVFWDWZSSQJI-RIIJGTCGSA-N lacto-n-decaose Chemical compound C[C@@H]1[C@@H](C)[C@@H](C)[C@@H](CC)OC1OC1[C@@H](CC)OC(OC[C@@H]2[C@@H](C(OC3[C@@H](C(OC4[C@@H]([C@@H](C)[C@@H](C)[C@@H](CC)O4)C)[C@@H](C)[C@@H](CC)O3)N=C(C)O)[C@@H](C)C(OC3[C@H](OC(OCC4[C@@H](C(OC5[C@@H](C(OC6[C@@H]([C@@H](C)[C@@H](C)[C@@H](CC)O6)C)[C@@H](C)[C@@H](CC)O5)N=C(C)O)[C@@H](C)C(OC5[C@@H]([C@@H](C)C(C)O[C@@H]5C)C)O4)C)[C@H](N=C(C)O)[C@H]3C)CC)O2)C)[C@H](N=C(C)O)[C@H]1C SJQVFWDWZSSQJI-RIIJGTCGSA-N 0.000 claims description 6
- 125000000636 p-nitrophenyl group Chemical group [H]C1=C([H])C(=C([H])C([H])=C1*)[N+]([O-])=O 0.000 claims description 6
- 241000186000 Bifidobacterium Species 0.000 claims description 5
- 229930186217 Glycolipid Natural products 0.000 claims description 5
- WJPIUUDKRHCAEL-YVEAQFMBSA-N 3-fucosyllactose Chemical compound O[C@H]1[C@H](O)[C@H](O)[C@H](C)O[C@H]1O[C@H]1[C@H](O[C@H]2[C@@H]([C@@H](O)[C@@H](O)[C@@H](CO)O2)O)[C@@H](CO)OC(O)[C@@H]1O WJPIUUDKRHCAEL-YVEAQFMBSA-N 0.000 claims description 4
- WJPIUUDKRHCAEL-UHFFFAOYSA-N 3FL Natural products OC1C(O)C(O)C(C)OC1OC1C(OC2C(C(O)C(O)C(CO)O2)O)C(CO)OC(O)C1O WJPIUUDKRHCAEL-UHFFFAOYSA-N 0.000 claims description 4
- LTIQZTOACJJKGQ-CTXMSNPHSA-N C1([C@@H](O)[C@H](O)[C@H](O)[C@@H](O1)C)O[C@@H]1[C@H](C(O)O[C@@H]([C@H]1O[C@H]1[C@H](O)[C@@H](OC2(C(O)=O)C[C@H](O)[C@@H](NC(C)=O)[C@@H](O2)[C@H](O)[C@H](O)CO)[C@@H](O)[C@H](O1)CO)CO)O Chemical compound C1([C@@H](O)[C@H](O)[C@H](O)[C@@H](O1)C)O[C@@H]1[C@H](C(O)O[C@@H]([C@H]1O[C@H]1[C@H](O)[C@@H](OC2(C(O)=O)C[C@H](O)[C@@H](NC(C)=O)[C@@H](O2)[C@H](O)[C@H](O)CO)[C@@H](O)[C@H](O1)CO)CO)O LTIQZTOACJJKGQ-CTXMSNPHSA-N 0.000 claims description 4
- AXQLFFDZXPOFPO-FSGZUBPKSA-N beta-D-Gal-(1->3)-beta-D-GlcNAc-(1->3)-beta-D-Gal-(1->4)-D-Glc Chemical compound O([C@@H]1O[C@H](CO)[C@H](O)[C@@H]([C@H]1O)O[C@H]1[C@@H]([C@H]([C@H](O)[C@@H](CO)O1)O[C@H]1[C@@H]([C@@H](O)[C@@H](O)[C@@H](CO)O1)O)NC(=O)C)[C@H]1[C@H](O)[C@@H](O)C(O)O[C@@H]1CO AXQLFFDZXPOFPO-FSGZUBPKSA-N 0.000 claims description 4
- 230000000739 chaotic effect Effects 0.000 claims description 4
- 230000001747 exhibiting effect Effects 0.000 claims description 4
- 108010015899 Glycopeptides Proteins 0.000 claims description 3
- 102000002068 Glycopeptides Human genes 0.000 claims description 3
- TVVLIFCVJJSLBL-SEHWTJTBSA-N Lacto-N-fucopentaose V Chemical compound O[C@H]1C(O)C(O)[C@H](C)O[C@H]1OC([C@@H](O)C=O)[C@@H](C(O)CO)O[C@H]1[C@H](O)[C@@H](OC2[C@@H](C(OC3[C@@H](C(O)C(O)[C@@H](CO)O3)O)[C@H](O)[C@@H](CO)O2)NC(C)=O)[C@@H](O)[C@@H](CO)O1 TVVLIFCVJJSLBL-SEHWTJTBSA-N 0.000 claims description 3
- 125000003047 N-acetyl group Chemical group 0.000 claims description 3
- SFMRPVLZMVJKGZ-JRZQLMJNSA-N Sialyllacto-N-tetraose b Chemical compound O1[C@@H]([C@H](O)[C@H](O)CO)[C@H](NC(=O)C)[C@@H](O)C[C@@]1(C(O)=O)OC[C@@H]1[C@@H](O)[C@H](O[C@H]2[C@@H]([C@@H](O)[C@@H](O)[C@@H](CO)O2)O)[C@@H](NC(C)=O)[C@H](O[C@@H]2[C@H]([C@H](O[C@H]([C@H](O)CO)[C@H](O)[C@@H](O)C=O)O[C@H](CO)[C@@H]2O)O)O1 SFMRPVLZMVJKGZ-JRZQLMJNSA-N 0.000 claims description 3
- 125000002015 acyclic group Chemical group 0.000 claims description 3
- 125000001931 aliphatic group Chemical group 0.000 claims description 3
- DMYPRRDPOMGEAK-XWDFSUOISA-N beta-D-Galp-(1->3)-[alpha-L-Fucp-(1->4)]-beta-D-GlcpNAc-(1->3)-beta-D-Galp-(1->4)-[alpha-L-Fucp-(1->3)]-D-Glcp Chemical compound O[C@H]1[C@H](O)[C@H](O)[C@H](C)O[C@H]1O[C@H]1[C@H](O[C@H]2[C@@H]([C@@H](O[C@H]3[C@@H]([C@@H](O[C@H]4[C@@H]([C@@H](O)[C@@H](O)[C@@H](CO)O4)O)[C@H](O[C@H]4[C@H]([C@H](O)[C@H](O)[C@H](C)O4)O)[C@@H](CO)O3)NC(C)=O)[C@@H](O)[C@@H](CO)O2)O)[C@@H](CO)OC(O)[C@@H]1O DMYPRRDPOMGEAK-XWDFSUOISA-N 0.000 claims description 3
- 238000006555 catalytic reaction Methods 0.000 claims description 3
- 125000004122 cyclic group Chemical group 0.000 claims description 3
- DMYPRRDPOMGEAK-UHFFFAOYSA-N lacto-N-difucohexaose II Natural products OC1C(O)C(O)C(C)OC1OC1C(OC2C(C(OC3C(C(OC4C(C(O)C(O)C(CO)O4)O)C(OC4C(C(O)C(O)C(C)O4)O)C(CO)O3)NC(C)=O)C(O)C(CO)O2)O)C(CO)OC(O)C1O DMYPRRDPOMGEAK-UHFFFAOYSA-N 0.000 claims description 3
- WMYQZGAEYLPOSX-JOEMMLBASA-N lex-lactose Chemical compound OC1[C@@H](O)[C@@H](O)[C@@H](C)O[C@@H]1O[C@H]1C(O[C@H]2[C@@H](C(O)C(O)C(CO)O2)O)[C@@H](CO)O[C@@H](O[C@@H]2[C@H]([C@H](OC(C(O)CO)[C@H](O)[C@@H](O)C=O)OC(CO)C2O)O)C1NC(C)=O WMYQZGAEYLPOSX-JOEMMLBASA-N 0.000 claims description 3
- 159000000000 sodium salts Chemical class 0.000 claims description 3
- DQJCDTNMLBYVAY-ZXXIYAEKSA-N (2S,5R,10R,13R)-16-{[(2R,3S,4R,5R)-3-{[(2S,3R,4R,5S,6R)-3-acetamido-4,5-dihydroxy-6-(hydroxymethyl)oxan-2-yl]oxy}-5-(ethylamino)-6-hydroxy-2-(hydroxymethyl)oxan-4-yl]oxy}-5-(4-aminobutyl)-10-carbamoyl-2,13-dimethyl-4,7,12,15-tetraoxo-3,6,11,14-tetraazaheptadecan-1-oic acid Chemical compound NCCCC[C@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)CC[C@H](C(N)=O)NC(=O)[C@@H](C)NC(=O)C(C)O[C@@H]1[C@@H](NCC)C(O)O[C@H](CO)[C@H]1O[C@H]1[C@H](NC(C)=O)[C@@H](O)[C@H](O)[C@@H](CO)O1 DQJCDTNMLBYVAY-ZXXIYAEKSA-N 0.000 claims description 2
- HVCOBJNICQPDBP-UHFFFAOYSA-N 3-[3-[3,5-dihydroxy-6-methyl-4-(3,4,5-trihydroxy-6-methyloxan-2-yl)oxyoxan-2-yl]oxydecanoyloxy]decanoic acid;hydrate Chemical compound O.OC1C(OC(CC(=O)OC(CCCCCCC)CC(O)=O)CCCCCCC)OC(C)C(O)C1OC1C(O)C(O)C(O)C(C)O1 HVCOBJNICQPDBP-UHFFFAOYSA-N 0.000 claims description 2
- 125000002353 D-glucosyl group Chemical group C1([C@H](O)[C@@H](O)[C@H](O)[C@H](O1)CO)* 0.000 claims description 2
- 102000013361 fetuin Human genes 0.000 claims description 2
- 108060002885 fetuin Proteins 0.000 claims description 2
- 125000003132 pyranosyl group Chemical group 0.000 claims description 2
- RBMYDHMFFAVMMM-PLQWBNBWSA-N neolactotetraose Chemical compound O([C@H]1[C@H](O)[C@H]([C@@H](O[C@@H]1CO)O[C@@H]1[C@H]([C@H](O[C@H]([C@H](O)CO)[C@H](O)[C@@H](O)C=O)O[C@H](CO)[C@@H]1O)O)NC(=O)C)[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O RBMYDHMFFAVMMM-PLQWBNBWSA-N 0.000 claims 3
- ATYSZLKTHMZHJA-UXLSSDPBSA-N (3R,4R,5S,6R)-6-(hydroxymethyl)-2-[(3R,4S,5R,6R)-3,4,5-trihydroxy-6-(hydroxymethyl)oxan-2-yl]-5-[(2S,3R,4S,5R,6R)-3,4,5-trihydroxy-6-(hydroxymethyl)oxan-2-yl]oxyoxane-2,3,4-triol Chemical group C1([C@H](O)[C@@H](O)[C@@H](O)[C@H](O1)CO)C1(O)[C@H](O)[C@@H](O)[C@H](O[C@H]2[C@H](O)[C@@H](O)[C@@H](O)[C@H](O2)CO)[C@H](O1)CO ATYSZLKTHMZHJA-UXLSSDPBSA-N 0.000 claims 1
- MSWZFWKMSRAUBD-IVMDWMLBSA-N 2-amino-2-deoxy-D-glucopyranose Chemical compound N[C@H]1C(O)O[C@H](CO)[C@@H](O)[C@@H]1O MSWZFWKMSRAUBD-IVMDWMLBSA-N 0.000 claims 1
- PSGQCCSGKGJLRL-UHFFFAOYSA-N 4-methyl-2h-chromen-2-one Chemical group C1=CC=CC2=C1OC(=O)C=C2C PSGQCCSGKGJLRL-UHFFFAOYSA-N 0.000 claims 1
- 229960002442 glucosamine Drugs 0.000 claims 1
- 229930182478 glucoside Natural products 0.000 claims 1
- 238000001308 synthesis method Methods 0.000 claims 1
- 230000002194 synthesizing effect Effects 0.000 abstract description 6
- 235000013350 formula milk Nutrition 0.000 description 188
- 229960003082 galactose Drugs 0.000 description 50
- 108010061238 threonyl-glycine Proteins 0.000 description 39
- 235000014633 carbohydrates Nutrition 0.000 description 37
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 37
- 238000006243 chemical reaction Methods 0.000 description 29
- 229910052799 carbon Inorganic materials 0.000 description 28
- 241000880493 Leptailurus serval Species 0.000 description 27
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 25
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 24
- 108010087924 alanylproline Proteins 0.000 description 23
- 108010047857 aspartylglycine Proteins 0.000 description 22
- 125000005647 linker group Chemical group 0.000 description 21
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 20
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 19
- 229960001031 glucose Drugs 0.000 description 19
- 108010050848 glycylleucine Proteins 0.000 description 19
- 108010037850 glycylvaline Proteins 0.000 description 19
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 18
- 108010047495 alanylglycine Proteins 0.000 description 17
- 125000004429 atom Chemical group 0.000 description 17
- 108010089804 glycyl-threonine Proteins 0.000 description 17
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 16
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 16
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 16
- 108010079364 N-glycylalanine Proteins 0.000 description 16
- 238000004519 manufacturing process Methods 0.000 description 16
- 239000000047 product Substances 0.000 description 16
- CBCCCLMNOBLBSC-XVYDVKMFSA-N Ala-His-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CBCCCLMNOBLBSC-XVYDVKMFSA-N 0.000 description 15
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 15
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 15
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 15
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 14
- 239000000470 constituent Substances 0.000 description 14
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 14
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 13
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 13
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 13
- 108010053725 prolylvaline Proteins 0.000 description 13
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 12
- ZMANZCXQSJIPKH-UHFFFAOYSA-N Triethylamine Chemical compound CCN(CC)CC ZMANZCXQSJIPKH-UHFFFAOYSA-N 0.000 description 12
- 108010093581 aspartyl-proline Proteins 0.000 description 12
- 239000000203 mixture Substances 0.000 description 12
- 125000006283 4-chlorobenzyl group Chemical group [H]C1=C([H])C(=C([H])C([H])=C1Cl)C([H])([H])* 0.000 description 11
- 125000006181 4-methyl benzyl group Chemical group [H]C1=C([H])C(=C([H])C([H])=C1C([H])([H])[H])C([H])([H])* 0.000 description 11
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 11
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 11
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 11
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 11
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 11
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 11
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 11
- 238000005755 formation reaction Methods 0.000 description 11
- 108010024607 phenylalanylalanine Proteins 0.000 description 11
- 108010031719 prolyl-serine Proteins 0.000 description 11
- 239000011541 reaction mixture Substances 0.000 description 11
- 238000001644 13C nuclear magnetic resonance spectroscopy Methods 0.000 description 10
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 10
- WSWYMRLTJVKRCE-ZLUOBGJFSA-N Asp-Ala-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O WSWYMRLTJVKRCE-ZLUOBGJFSA-N 0.000 description 10
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 10
- 108010065920 Insulin Lispro Proteins 0.000 description 10
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 10
- YFNOUBWUIIJQHF-LPEHRKFASA-N Pro-Asp-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O YFNOUBWUIIJQHF-LPEHRKFASA-N 0.000 description 10
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 10
- BKIOKSLLAAZYTC-KKHAAJSZSA-N Thr-Val-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O BKIOKSLLAAZYTC-KKHAAJSZSA-N 0.000 description 10
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 10
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 10
- 108010015792 glycyllysine Proteins 0.000 description 10
- 108010057821 leucylproline Proteins 0.000 description 10
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 10
- KFEUJDWYNGMDBV-UHFFFAOYSA-N (N-Acetyl)-glucosamin-4-beta-galaktosid Natural products OC1C(NC(=O)C)C(O)OC(CO)C1OC1C(O)C(O)C(O)C(CO)O1 KFEUJDWYNGMDBV-UHFFFAOYSA-N 0.000 description 9
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 9
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 9
- VYMJAWXRWHJIMS-LKTVYLICSA-N Ala-Tyr-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N VYMJAWXRWHJIMS-LKTVYLICSA-N 0.000 description 9
- 241000186016 Bifidobacterium bifidum Species 0.000 description 9
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 9
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 9
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 9
- KFEUJDWYNGMDBV-LODBTCKLSA-N N-acetyllactosamine Chemical class O[C@@H]1[C@@H](NC(=O)C)[C@H](O)O[C@H](CO)[C@H]1O[C@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 KFEUJDWYNGMDBV-LODBTCKLSA-N 0.000 description 9
- HESSGHHCXGBPAJ-UHFFFAOYSA-N N-acetyllactosamine Natural products CC(=O)NC(C=O)C(O)C(C(O)CO)OC1OC(CO)C(O)C(O)C1O HESSGHHCXGBPAJ-UHFFFAOYSA-N 0.000 description 9
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 9
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 9
- 108010005233 alanylglutamic acid Proteins 0.000 description 9
- 229940002008 bifidobacterium bifidum Drugs 0.000 description 9
- 108010034529 leucyl-lysine Proteins 0.000 description 9
- 238000000746 purification Methods 0.000 description 9
- 239000000243 solution Substances 0.000 description 9
- 108010080629 tryptophan-leucine Proteins 0.000 description 9
- 125000006506 3-phenyl benzyl group Chemical group [H]C1=C([H])C([H])=C(C([H])=C1[H])C1=C([H])C([H])=C([H])C(=C1[H])C([H])([H])* 0.000 description 8
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 8
- VQAVBBCZFQAAED-FXQIFTODSA-N Ala-Pro-Asn Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N VQAVBBCZFQAAED-FXQIFTODSA-N 0.000 description 8
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 8
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 8
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 8
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 8
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 8
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 8
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 8
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 8
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 description 8
- 238000005481 NMR spectroscopy Methods 0.000 description 8
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 8
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 8
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 8
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 8
- 239000002253 acid Substances 0.000 description 8
- 108010044940 alanylglutamine Proteins 0.000 description 8
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 8
- 108010077245 asparaginyl-proline Proteins 0.000 description 8
- 125000002791 glucosyl group Chemical group C1([C@H](O)[C@@H](O)[C@H](O)[C@H](O1)CO)* 0.000 description 8
- 108010048818 seryl-histidine Proteins 0.000 description 8
- 108010071207 serylmethionine Proteins 0.000 description 8
- 125000005629 sialic acid group Chemical group 0.000 description 8
- 108010073969 valyllysine Proteins 0.000 description 8
- 238000005160 1H NMR spectroscopy Methods 0.000 description 7
- KQFRUSHJPKXBMB-BHDSKKPTSA-N Ala-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 KQFRUSHJPKXBMB-BHDSKKPTSA-N 0.000 description 7
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 description 7
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 7
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 7
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 7
- WCZXPVPHUMYLMS-VEVYYDQMSA-N Arg-Thr-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O WCZXPVPHUMYLMS-VEVYYDQMSA-N 0.000 description 7
- SPCONPVIDFMDJI-QSFUFRPTSA-N Asn-Ile-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O SPCONPVIDFMDJI-QSFUFRPTSA-N 0.000 description 7
- GHWWTICYPDKPTE-NGZCFLSTSA-N Asn-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N GHWWTICYPDKPTE-NGZCFLSTSA-N 0.000 description 7
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 7
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 7
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 7
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 7
- CQMFNTVQVLQRLT-JHEQGTHGSA-N Gly-Thr-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CQMFNTVQVLQRLT-JHEQGTHGSA-N 0.000 description 7
- ILUVWFTXAUYOBW-CUJWVEQBSA-N His-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CN=CN1)N)O ILUVWFTXAUYOBW-CUJWVEQBSA-N 0.000 description 7
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 7
- QGXQHJQPAPMACW-PPCPHDFISA-N Ile-Thr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QGXQHJQPAPMACW-PPCPHDFISA-N 0.000 description 7
- AUIYHFRUOOKTGX-UKJIMTQDSA-N Ile-Val-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N AUIYHFRUOOKTGX-UKJIMTQDSA-N 0.000 description 7
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 7
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 description 7
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 7
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 7
- BVXXDMUMHMXFER-BPNCWPANSA-N Met-Ala-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVXXDMUMHMXFER-BPNCWPANSA-N 0.000 description 7
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 7
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 7
- OCSACVPBMIYNJE-GUBZILKMSA-N Pro-Arg-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O OCSACVPBMIYNJE-GUBZILKMSA-N 0.000 description 7
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 7
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 7
- FBLNYDYPCLFTSP-IXOXFDKPSA-N Ser-Phe-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FBLNYDYPCLFTSP-IXOXFDKPSA-N 0.000 description 7
- KCPFDGNYAMKZQP-KBPBESRZSA-N Tyr-Gly-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O KCPFDGNYAMKZQP-KBPBESRZSA-N 0.000 description 7
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 7
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 7
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 7
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 7
- 108010070944 alanylhistidine Proteins 0.000 description 7
- 239000002585 base Substances 0.000 description 7
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 7
- 108010045126 glycyl-tyrosyl-glycine Proteins 0.000 description 7
- 108010084389 glycyltryptophan Proteins 0.000 description 7
- 108010087823 glycyltyrosine Proteins 0.000 description 7
- 108010036413 histidylglycine Proteins 0.000 description 7
- 108010053037 kyotorphin Proteins 0.000 description 7
- 229910052757 nitrogen Inorganic materials 0.000 description 7
- 108010070643 prolylglutamic acid Proteins 0.000 description 7
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 7
- WFDIJRYMOXRFFG-UHFFFAOYSA-N Acetic anhydride Chemical compound CC(=O)OC(C)=O WFDIJRYMOXRFFG-UHFFFAOYSA-N 0.000 description 6
- CSCPPACGZOOCGX-UHFFFAOYSA-N Acetone Chemical compound CC(C)=O CSCPPACGZOOCGX-UHFFFAOYSA-N 0.000 description 6
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 6
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 6
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 6
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 6
- ZAESWDKAMDVHLL-RCOVLWMOSA-N Asn-Val-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O ZAESWDKAMDVHLL-RCOVLWMOSA-N 0.000 description 6
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 6
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 6
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 6
- MNQMTYSEKZHIDF-GCJQMDKQSA-N Asp-Thr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O MNQMTYSEKZHIDF-GCJQMDKQSA-N 0.000 description 6
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 6
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 6
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 6
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 6
- DXVOKNVIKORTHQ-GUBZILKMSA-N Glu-Pro-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O DXVOKNVIKORTHQ-GUBZILKMSA-N 0.000 description 6
- MWTGQXBHVRTCOR-GLLZPBPUSA-N Glu-Thr-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MWTGQXBHVRTCOR-GLLZPBPUSA-N 0.000 description 6
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 6
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 6
- XQHSBNVACKQWAV-WHFBIAKZSA-N Gly-Asp-Asn Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XQHSBNVACKQWAV-WHFBIAKZSA-N 0.000 description 6
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 6
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 6
- YDIDLLVFCYSXNY-RCOVLWMOSA-N Gly-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN YDIDLLVFCYSXNY-RCOVLWMOSA-N 0.000 description 6
- 238000004977 Hueckel calculation Methods 0.000 description 6
- LLZLRXBTOOFODM-QSFUFRPTSA-N Ile-Asp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N LLZLRXBTOOFODM-QSFUFRPTSA-N 0.000 description 6
- KUHFPGIVBOCRMV-MNXVOIDGSA-N Ile-Gln-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N KUHFPGIVBOCRMV-MNXVOIDGSA-N 0.000 description 6
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 6
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 6
- HGLKOTPFWOMPOB-MEYUZBJRSA-N Leu-Thr-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HGLKOTPFWOMPOB-MEYUZBJRSA-N 0.000 description 6
- SEZADXQOJJTXPG-VFAJRCTISA-N Lys-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N)O SEZADXQOJJTXPG-VFAJRCTISA-N 0.000 description 6
- LMKSBGIUPVRHEH-FXQIFTODSA-N Met-Ala-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(N)=O LMKSBGIUPVRHEH-FXQIFTODSA-N 0.000 description 6
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 6
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 6
- RVEVENLSADZUMS-IHRRRGAJSA-N Phe-Pro-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RVEVENLSADZUMS-IHRRRGAJSA-N 0.000 description 6
- DIFXZGPHVCIVSQ-CIUDSAMLSA-N Pro-Gln-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DIFXZGPHVCIVSQ-CIUDSAMLSA-N 0.000 description 6
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 6
- MESDJCNHLZBMEP-ZLUOBGJFSA-N Ser-Asp-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MESDJCNHLZBMEP-ZLUOBGJFSA-N 0.000 description 6
- QKQDTEYDEIJPNK-GUBZILKMSA-N Ser-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CO QKQDTEYDEIJPNK-GUBZILKMSA-N 0.000 description 6
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 6
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 6
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 6
- CAGTXGDOIFXLPC-KZVJFYERSA-N Thr-Arg-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N CAGTXGDOIFXLPC-KZVJFYERSA-N 0.000 description 6
- IJKNKFJZOJCKRR-GBALPHGKSA-N Thr-Trp-Ser Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 IJKNKFJZOJCKRR-GBALPHGKSA-N 0.000 description 6
- DLZKEQQWXODGGZ-KWQFWETISA-N Tyr-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DLZKEQQWXODGGZ-KWQFWETISA-N 0.000 description 6
- WTXQBCCKXIKKHB-JYJNAYRXSA-N Tyr-Arg-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WTXQBCCKXIKKHB-JYJNAYRXSA-N 0.000 description 6
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 6
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 6
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 6
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 6
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 6
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 6
- 108010041407 alanylaspartic acid Proteins 0.000 description 6
- 125000003545 alkoxy group Chemical group 0.000 description 6
- 108010038633 aspartylglutamate Proteins 0.000 description 6
- CLRLHXKNIYJWAW-QBTAGHCHSA-N deaminoneuraminic acid Chemical compound OC[C@@H](O)[C@@H](O)[C@@H]1OC(O)(C(O)=O)C[C@H](O)[C@H]1O CLRLHXKNIYJWAW-QBTAGHCHSA-N 0.000 description 6
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 6
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 6
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 6
- 108010064235 lysylglycine Proteins 0.000 description 6
- 239000003921 oil Substances 0.000 description 6
- 108010029020 prolylglycine Proteins 0.000 description 6
- 150000004043 trisaccharides Chemical group 0.000 description 6
- 108010005834 tyrosyl-alanyl-glycine Proteins 0.000 description 6
- NLOMBWNGESDVJU-GUBZILKMSA-N Ala-Met-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLOMBWNGESDVJU-GUBZILKMSA-N 0.000 description 5
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 5
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 5
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 5
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 5
- WTUZDHWWGUQEKN-SRVKXCTJSA-N Arg-Val-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O WTUZDHWWGUQEKN-SRVKXCTJSA-N 0.000 description 5
- VKCOHFFSTKCXEQ-OLHMAJIHSA-N Asn-Asn-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VKCOHFFSTKCXEQ-OLHMAJIHSA-N 0.000 description 5
- NLDNNZKUSLAYFW-NHCYSSNCSA-N Asn-Lys-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O NLDNNZKUSLAYFW-NHCYSSNCSA-N 0.000 description 5
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 5
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 5
- XPGVTUBABLRGHY-BIIVOSGPSA-N Asp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N XPGVTUBABLRGHY-BIIVOSGPSA-N 0.000 description 5
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 5
- GWTLRDMPMJCNMH-WHFBIAKZSA-N Asp-Asn-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GWTLRDMPMJCNMH-WHFBIAKZSA-N 0.000 description 5
- ZCKYZTGLXIEOKS-CIUDSAMLSA-N Asp-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N ZCKYZTGLXIEOKS-CIUDSAMLSA-N 0.000 description 5
- YNCHFVRXEQFPBY-BQBZGAKWSA-N Asp-Gly-Arg Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N YNCHFVRXEQFPBY-BQBZGAKWSA-N 0.000 description 5
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 5
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 5
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 5
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 5
- ITGFVUYOLWBPQW-KKHAAJSZSA-N Asp-Thr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ITGFVUYOLWBPQW-KKHAAJSZSA-N 0.000 description 5
- SFJUYBCDQBAYAJ-YDHLFZDLSA-N Asp-Val-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SFJUYBCDQBAYAJ-YDHLFZDLSA-N 0.000 description 5
- XXDLUZLKHOVPNW-IHRRRGAJSA-N Cys-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N)O XXDLUZLKHOVPNW-IHRRRGAJSA-N 0.000 description 5
- PIUPHASDUFSHTF-CIUDSAMLSA-N Gln-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O PIUPHASDUFSHTF-CIUDSAMLSA-N 0.000 description 5
- WTJIWXMJESRHMM-XDTLVQLUSA-N Gln-Tyr-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O WTJIWXMJESRHMM-XDTLVQLUSA-N 0.000 description 5
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 5
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 5
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 5
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 5
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 5
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 5
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 5
- AUBMZAMQCOYSIC-MNXVOIDGSA-N Leu-Ile-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O AUBMZAMQCOYSIC-MNXVOIDGSA-N 0.000 description 5
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 description 5
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 5
- BTEMNFBEAAOGBR-BZSNNMDCSA-N Leu-Tyr-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BTEMNFBEAAOGBR-BZSNNMDCSA-N 0.000 description 5
- ZMXDDKWLCZADIW-UHFFFAOYSA-N N,N-Dimethylformamide Chemical compound CN(C)C=O ZMXDDKWLCZADIW-UHFFFAOYSA-N 0.000 description 5
- GMWNQSGWWGKTSF-LFSVMHDDSA-N Phe-Thr-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMWNQSGWWGKTSF-LFSVMHDDSA-N 0.000 description 5
- CDHURCQGUDNBMA-UBHSHLNASA-N Phe-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDHURCQGUDNBMA-UBHSHLNASA-N 0.000 description 5
- LGSANCBHSMDFDY-GARJFASQSA-N Pro-Glu-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O LGSANCBHSMDFDY-GARJFASQSA-N 0.000 description 5
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 5
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 5
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 5
- VXMHQKHDKCATDV-VEVYYDQMSA-N Thr-Asp-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VXMHQKHDKCATDV-VEVYYDQMSA-N 0.000 description 5
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 5
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 5
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 5
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 5
- YXFVVABEGXRONW-UHFFFAOYSA-N Toluene Chemical compound CC1=CC=CC=C1 YXFVVABEGXRONW-UHFFFAOYSA-N 0.000 description 5
- NWEGIYMHTZXVBP-JSGCOSHPSA-N Tyr-Val-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O NWEGIYMHTZXVBP-JSGCOSHPSA-N 0.000 description 5
- DBOXBUDEAJVKRE-LSJOCFKGSA-N Val-Asn-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DBOXBUDEAJVKRE-LSJOCFKGSA-N 0.000 description 5
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 5
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 5
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 5
- DVLWZWNAQUBZBC-ZNSHCXBVSA-N Val-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N)O DVLWZWNAQUBZBC-ZNSHCXBVSA-N 0.000 description 5
- 239000000370 acceptor Substances 0.000 description 5
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 5
- 108010008355 arginyl-glutamine Proteins 0.000 description 5
- 108010036533 arginylvaline Proteins 0.000 description 5
- 239000006227 byproduct Substances 0.000 description 5
- 150000001720 carbohydrates Chemical class 0.000 description 5
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 5
- 108010078144 glutaminyl-glycine Proteins 0.000 description 5
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 5
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 5
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 5
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 5
- 108010003700 lysyl aspartic acid Proteins 0.000 description 5
- 108010017391 lysylvaline Proteins 0.000 description 5
- 230000035772 mutation Effects 0.000 description 5
- 108010015796 prolylisoleucine Proteins 0.000 description 5
- 238000000926 separation method Methods 0.000 description 5
- 108010026333 seryl-proline Proteins 0.000 description 5
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 5
- COEXAQSTZUWMRI-STQMWFEESA-N (2s)-1-[2-[[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound C([C@H](N)C(=O)NCC(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=C(O)C=C1 COEXAQSTZUWMRI-STQMWFEESA-N 0.000 description 4
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 4
- VBDMWOKJZDCFJM-FXQIFTODSA-N Ala-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N VBDMWOKJZDCFJM-FXQIFTODSA-N 0.000 description 4
- PIPTUBPKYFRLCP-NHCYSSNCSA-N Ala-Ala-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PIPTUBPKYFRLCP-NHCYSSNCSA-N 0.000 description 4
- YAXNATKKPOWVCP-ZLUOBGJFSA-N Ala-Asn-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O YAXNATKKPOWVCP-ZLUOBGJFSA-N 0.000 description 4
- LBJYAILUMSUTAM-ZLUOBGJFSA-N Ala-Asn-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LBJYAILUMSUTAM-ZLUOBGJFSA-N 0.000 description 4
- FVSOUJZKYWEFOB-KBIXCLLPSA-N Ala-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)N FVSOUJZKYWEFOB-KBIXCLLPSA-N 0.000 description 4
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 4
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 4
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 4
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 4
- ZPXCNXMJEZKRLU-LSJOCFKGSA-N Ala-His-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CN=CN1 ZPXCNXMJEZKRLU-LSJOCFKGSA-N 0.000 description 4
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 4
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 4
- BFMIRJBURUXDRG-DLOVCJGASA-N Ala-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 BFMIRJBURUXDRG-DLOVCJGASA-N 0.000 description 4
- CYBJZLQSUJEMAS-LFSVMHDDSA-N Ala-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C)N)O CYBJZLQSUJEMAS-LFSVMHDDSA-N 0.000 description 4
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 4
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 4
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 4
- NHWYNIZWLJYZAG-XVYDVKMFSA-N Ala-Ser-His Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N NHWYNIZWLJYZAG-XVYDVKMFSA-N 0.000 description 4
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 4
- SYIFFFHSXBNPMC-UWJYBYFXSA-N Ala-Ser-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N SYIFFFHSXBNPMC-UWJYBYFXSA-N 0.000 description 4
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 4
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 4
- 206010001935 American trypanosomiasis Diseases 0.000 description 4
- YFWTXMRJJDNTLM-LSJOCFKGSA-N Arg-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YFWTXMRJJDNTLM-LSJOCFKGSA-N 0.000 description 4
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 4
- JSHVMZANPXCDTL-GMOBBJLQSA-N Arg-Asp-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JSHVMZANPXCDTL-GMOBBJLQSA-N 0.000 description 4
- UFBURHXMKFQVLM-CIUDSAMLSA-N Arg-Glu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UFBURHXMKFQVLM-CIUDSAMLSA-N 0.000 description 4
- AQPVUEJJARLJHB-BQBZGAKWSA-N Arg-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N AQPVUEJJARLJHB-BQBZGAKWSA-N 0.000 description 4
- GNYUVVJYGJFKHN-RVMXOQNASA-N Arg-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GNYUVVJYGJFKHN-RVMXOQNASA-N 0.000 description 4
- ZDBWKBCKYJGKGP-DCAQKATOSA-N Arg-Leu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O ZDBWKBCKYJGKGP-DCAQKATOSA-N 0.000 description 4
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 4
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 4
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 4
- ACRYGQFHAQHDSF-ZLUOBGJFSA-N Asn-Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ACRYGQFHAQHDSF-ZLUOBGJFSA-N 0.000 description 4
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 4
- VHQSGALUSWIYOD-QXEWZRGKSA-N Asn-Pro-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O VHQSGALUSWIYOD-QXEWZRGKSA-N 0.000 description 4
- WLVLIYYBPPONRJ-GCJQMDKQSA-N Asn-Thr-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O WLVLIYYBPPONRJ-GCJQMDKQSA-N 0.000 description 4
- AXXCUABIFZPKPM-BQBZGAKWSA-N Asp-Arg-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O AXXCUABIFZPKPM-BQBZGAKWSA-N 0.000 description 4
- PXLNPFOJZQMXAT-BYULHYEWSA-N Asp-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O PXLNPFOJZQMXAT-BYULHYEWSA-N 0.000 description 4
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 4
- RVMXMLSYBTXCAV-VEVYYDQMSA-N Asp-Pro-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMXMLSYBTXCAV-VEVYYDQMSA-N 0.000 description 4
- 241000848305 Bifidobacterium longum subsp. infantis ATCC 15697 = JCM 1222 = DSM 20088 Species 0.000 description 4
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 4
- MWLYSLMKFXWZPW-ZPFDUUQYSA-N Gln-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCC(N)=O MWLYSLMKFXWZPW-ZPFDUUQYSA-N 0.000 description 4
- XOKGKOQWADCLFQ-GARJFASQSA-N Gln-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O XOKGKOQWADCLFQ-GARJFASQSA-N 0.000 description 4
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 4
- VZRAXPGTUNDIDK-GUBZILKMSA-N Gln-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VZRAXPGTUNDIDK-GUBZILKMSA-N 0.000 description 4
- CAXXTYYGFYTBPV-IUCAKERBSA-N Gln-Leu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CAXXTYYGFYTBPV-IUCAKERBSA-N 0.000 description 4
- MLSKFHLRFVGNLL-WDCWCFNPSA-N Gln-Leu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MLSKFHLRFVGNLL-WDCWCFNPSA-N 0.000 description 4
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 4
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 4
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 4
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 4
- YOTHMZZSJKKEHZ-SZMVWBNQSA-N Glu-Trp-Lys Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CCC(O)=O)=CNC2=C1 YOTHMZZSJKKEHZ-SZMVWBNQSA-N 0.000 description 4
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 4
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 4
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 4
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 4
- CLNSYANKYVMZNM-UWVGGRQHSA-N Gly-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CLNSYANKYVMZNM-UWVGGRQHSA-N 0.000 description 4
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 4
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 description 4
- WTUSRDZLLWGYAT-KCTSRDHCSA-N Gly-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)CN WTUSRDZLLWGYAT-KCTSRDHCSA-N 0.000 description 4
- UVTSZKIATYSKIR-RYUDHWBXSA-N Gly-Tyr-Glu Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O UVTSZKIATYSKIR-RYUDHWBXSA-N 0.000 description 4
- DNAZKGFYFRGZIH-QWRGUYRKSA-N Gly-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 DNAZKGFYFRGZIH-QWRGUYRKSA-N 0.000 description 4
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 4
- IAYPZSHNZQHQNO-KKUMJFAQSA-N His-Ser-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC2=CN=CN2)N IAYPZSHNZQHQNO-KKUMJFAQSA-N 0.000 description 4
- FFKJUTZARGRVTH-KKUMJFAQSA-N His-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FFKJUTZARGRVTH-KKUMJFAQSA-N 0.000 description 4
- HTDRTKMNJRRYOJ-SIUGBPQLSA-N Ile-Gln-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HTDRTKMNJRRYOJ-SIUGBPQLSA-N 0.000 description 4
- JDAWAWXGAUZPNJ-ZPFDUUQYSA-N Ile-Glu-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JDAWAWXGAUZPNJ-ZPFDUUQYSA-N 0.000 description 4
- XLXPYSDGMXTTNQ-DKIMLUQUSA-N Ile-Phe-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(C)C)C(O)=O XLXPYSDGMXTTNQ-DKIMLUQUSA-N 0.000 description 4
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 4
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 4
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 4
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 4
- UCDHVOALNXENLC-KBPBESRZSA-N Leu-Gly-Tyr Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UCDHVOALNXENLC-KBPBESRZSA-N 0.000 description 4
- WXZOHBVPVKABQN-DCAQKATOSA-N Leu-Met-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WXZOHBVPVKABQN-DCAQKATOSA-N 0.000 description 4
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 4
- PJWOOBTYQNNRBF-BZSNNMDCSA-N Leu-Phe-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)O)N PJWOOBTYQNNRBF-BZSNNMDCSA-N 0.000 description 4
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 4
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 4
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 4
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 4
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 4
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 description 4
- DGAAQRAUOFHBFJ-CIUDSAMLSA-N Lys-Asn-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DGAAQRAUOFHBFJ-CIUDSAMLSA-N 0.000 description 4
- GKFNXYMAMKJSKD-NHCYSSNCSA-N Lys-Asp-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GKFNXYMAMKJSKD-NHCYSSNCSA-N 0.000 description 4
- VSJXPNCQYGOLFM-XIRDDKMYSA-N Lys-Cys-Trp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O VSJXPNCQYGOLFM-XIRDDKMYSA-N 0.000 description 4
- DUTMKEAPLLUGNO-JYJNAYRXSA-N Lys-Glu-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DUTMKEAPLLUGNO-JYJNAYRXSA-N 0.000 description 4
- PBLLTSKBTAHDNA-KBPBESRZSA-N Lys-Gly-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PBLLTSKBTAHDNA-KBPBESRZSA-N 0.000 description 4
- GODBLDDYHFTUAH-CIUDSAMLSA-N Met-Asp-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O GODBLDDYHFTUAH-CIUDSAMLSA-N 0.000 description 4
- XDGFFEZAZHRZFR-RHYQMDGZSA-N Met-Leu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDGFFEZAZHRZFR-RHYQMDGZSA-N 0.000 description 4
- JHVNNUIQXOGAHI-KJEVXHAQSA-N Met-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCSC)N)O JHVNNUIQXOGAHI-KJEVXHAQSA-N 0.000 description 4
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 4
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 4
- WSXKXSBOJXEZDV-DLOVCJGASA-N Phe-Ala-Asn Chemical compound NC(=O)C[C@@H](C([O-])=O)NC(=O)[C@H](C)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 WSXKXSBOJXEZDV-DLOVCJGASA-N 0.000 description 4
- UHRNIXJAGGLKHP-DLOVCJGASA-N Phe-Ala-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O UHRNIXJAGGLKHP-DLOVCJGASA-N 0.000 description 4
- DPUOLKQSMYLRDR-UBHSHLNASA-N Phe-Arg-Ala Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 DPUOLKQSMYLRDR-UBHSHLNASA-N 0.000 description 4
- PSKRILMFHNIUAO-JYJNAYRXSA-N Phe-Glu-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N PSKRILMFHNIUAO-JYJNAYRXSA-N 0.000 description 4
- YCCUXNNKXDGMAM-KKUMJFAQSA-N Phe-Leu-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YCCUXNNKXDGMAM-KKUMJFAQSA-N 0.000 description 4
- GRVMHFCZUIYNKQ-UFYCRDLUSA-N Phe-Phe-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GRVMHFCZUIYNKQ-UFYCRDLUSA-N 0.000 description 4
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 4
- HFZNNDWPHBRNPV-KZVJFYERSA-N Pro-Ala-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HFZNNDWPHBRNPV-KZVJFYERSA-N 0.000 description 4
- ZSKJPKFTPQCPIH-RCWTZXSCSA-N Pro-Arg-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSKJPKFTPQCPIH-RCWTZXSCSA-N 0.000 description 4
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 description 4
- GDXZRWYXJSGWIV-GMOBBJLQSA-N Pro-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 GDXZRWYXJSGWIV-GMOBBJLQSA-N 0.000 description 4
- HQVPQXMCQKXARZ-FXQIFTODSA-N Pro-Cys-Ser Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O HQVPQXMCQKXARZ-FXQIFTODSA-N 0.000 description 4
- PULPZRAHVFBVTO-DCAQKATOSA-N Pro-Glu-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PULPZRAHVFBVTO-DCAQKATOSA-N 0.000 description 4
- QUBVFEANYYWBTM-VEVYYDQMSA-N Pro-Thr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUBVFEANYYWBTM-VEVYYDQMSA-N 0.000 description 4
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 4
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 4
- DWUIECHTAMYEFL-XVYDVKMFSA-N Ser-Ala-His Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DWUIECHTAMYEFL-XVYDVKMFSA-N 0.000 description 4
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 4
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 4
- QGMLKFGTGXWAHF-IHRRRGAJSA-N Ser-Arg-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QGMLKFGTGXWAHF-IHRRRGAJSA-N 0.000 description 4
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 description 4
- OBXVZEAMXFSGPU-FXQIFTODSA-N Ser-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)CN=C(N)N OBXVZEAMXFSGPU-FXQIFTODSA-N 0.000 description 4
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 4
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 description 4
- BTPAWKABYQMKKN-LKXGYXEUSA-N Ser-Asp-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BTPAWKABYQMKKN-LKXGYXEUSA-N 0.000 description 4
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 4
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 4
- PPNPDKGQRFSCAC-CIUDSAMLSA-N Ser-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPNPDKGQRFSCAC-CIUDSAMLSA-N 0.000 description 4
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 description 4
- NUEHQDHDLDXCRU-GUBZILKMSA-N Ser-Pro-Arg Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NUEHQDHDLDXCRU-GUBZILKMSA-N 0.000 description 4
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 4
- HAYADTTXNZFUDM-IHRRRGAJSA-N Ser-Tyr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HAYADTTXNZFUDM-IHRRRGAJSA-N 0.000 description 4
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 4
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 4
- ODRUTDLAONAVDV-IHRRRGAJSA-N Ser-Val-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ODRUTDLAONAVDV-IHRRRGAJSA-N 0.000 description 4
- 108090000141 Sialyltransferases Proteins 0.000 description 4
- 102000003838 Sialyltransferases Human genes 0.000 description 4
- WQDUMFSSJAZKTM-UHFFFAOYSA-N Sodium methoxide Chemical compound [Na+].[O-]C WQDUMFSSJAZKTM-UHFFFAOYSA-N 0.000 description 4
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 4
- MQCPGOZXFSYJPS-KZVJFYERSA-N Thr-Ala-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MQCPGOZXFSYJPS-KZVJFYERSA-N 0.000 description 4
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 4
- PQLXHSACXPGWPD-GSSVUCPTSA-N Thr-Asn-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PQLXHSACXPGWPD-GSSVUCPTSA-N 0.000 description 4
- NIEWSKWFURSECR-FOHZUACHSA-N Thr-Gly-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NIEWSKWFURSECR-FOHZUACHSA-N 0.000 description 4
- UBDDORVPVLEECX-FJXKBIBVSA-N Thr-Gly-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UBDDORVPVLEECX-FJXKBIBVSA-N 0.000 description 4
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 4
- WFAUDCSNCWJJAA-KXNHARMFSA-N Thr-Lys-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(O)=O WFAUDCSNCWJJAA-KXNHARMFSA-N 0.000 description 4
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 4
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 4
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 4
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 4
- KULBQAVOXHQLIY-HSCHXYMDSA-N Trp-Ile-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 KULBQAVOXHQLIY-HSCHXYMDSA-N 0.000 description 4
- ZZDFLJFVSNQINX-HWHUXHBOSA-N Trp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)O ZZDFLJFVSNQINX-HWHUXHBOSA-N 0.000 description 4
- DXYWRYQRKPIGGU-BPNCWPANSA-N Tyr-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DXYWRYQRKPIGGU-BPNCWPANSA-N 0.000 description 4
- KDGFPPHLXCEQRN-STECZYCISA-N Tyr-Arg-Ile Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KDGFPPHLXCEQRN-STECZYCISA-N 0.000 description 4
- HVHJYXDXRIWELT-RYUDHWBXSA-N Tyr-Glu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O HVHJYXDXRIWELT-RYUDHWBXSA-N 0.000 description 4
- JKUZFODWJGEQAP-KBPBESRZSA-N Tyr-Gly-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O JKUZFODWJGEQAP-KBPBESRZSA-N 0.000 description 4
- NMKJPMCEKQHRPD-IRXDYDNUSA-N Tyr-Gly-Tyr Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NMKJPMCEKQHRPD-IRXDYDNUSA-N 0.000 description 4
- VTCKHZJKWQENKX-KBPBESRZSA-N Tyr-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O VTCKHZJKWQENKX-KBPBESRZSA-N 0.000 description 4
- MQGGXGKQSVEQHR-KKUMJFAQSA-N Tyr-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 MQGGXGKQSVEQHR-KKUMJFAQSA-N 0.000 description 4
- HRHYJNLMIJWGLF-BZSNNMDCSA-N Tyr-Ser-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 HRHYJNLMIJWGLF-BZSNNMDCSA-N 0.000 description 4
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 4
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 4
- LTFLDDDGWOVIHY-NAKRPEOUSA-N Val-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N LTFLDDDGWOVIHY-NAKRPEOUSA-N 0.000 description 4
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 4
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 4
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 4
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 4
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 4
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 4
- XTDDIVQWDXMRJL-IHRRRGAJSA-N Val-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N XTDDIVQWDXMRJL-IHRRRGAJSA-N 0.000 description 4
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 4
- PQSNETRGCRUOGP-KKHAAJSZSA-N Val-Thr-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O PQSNETRGCRUOGP-KKHAAJSZSA-N 0.000 description 4
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 4
- USXYVSTVPHELAF-RCWTZXSCSA-N Val-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N)O USXYVSTVPHELAF-RCWTZXSCSA-N 0.000 description 4
- QTXGUIMEHKCPBH-FHWLQOOXSA-N Val-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 QTXGUIMEHKCPBH-FHWLQOOXSA-N 0.000 description 4
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 4
- 150000007513 acids Chemical class 0.000 description 4
- 108010013835 arginine glutamate Proteins 0.000 description 4
- 108010068380 arginylarginine Proteins 0.000 description 4
- 108010060035 arginylproline Proteins 0.000 description 4
- 238000010531 catalytic reduction reaction Methods 0.000 description 4
- 238000011161 development Methods 0.000 description 4
- 230000018109 developmental process Effects 0.000 description 4
- 230000002255 enzymatic effect Effects 0.000 description 4
- 108010049041 glutamylalanine Proteins 0.000 description 4
- 108010040030 histidinoalanine Proteins 0.000 description 4
- 230000003301 hydrolyzing effect Effects 0.000 description 4
- 239000000543 intermediate Substances 0.000 description 4
- 229930191176 lacto-N-biose Natural products 0.000 description 4
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 4
- 108010000761 leucylarginine Proteins 0.000 description 4
- 108010038320 lysylphenylalanine Proteins 0.000 description 4
- 125000002496 methyl group Chemical group [H]C([H])([H])* 0.000 description 4
- 244000005700 microbiome Species 0.000 description 4
- 235000013336 milk Nutrition 0.000 description 4
- 239000008267 milk Substances 0.000 description 4
- 210000004080 milk Anatomy 0.000 description 4
- 125000004923 naphthylmethyl group Chemical group C1(=CC=CC2=CC=CC=C12)C* 0.000 description 4
- 108010051242 phenylalanylserine Proteins 0.000 description 4
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 4
- 230000009450 sialylation Effects 0.000 description 4
- 239000000126 substance Substances 0.000 description 4
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 4
- 229940086542 triethylamine Drugs 0.000 description 4
- 108010020532 tyrosyl-proline Proteins 0.000 description 4
- 108010071635 tyrosyl-prolyl-arginine Proteins 0.000 description 4
- 108010009962 valyltyrosine Proteins 0.000 description 4
- WOJJIRYPFAZEPF-YFKPBYRVSA-N 2-[[(2s)-2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]propanoyl]amino]acetate Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)CNC(=O)CN WOJJIRYPFAZEPF-YFKPBYRVSA-N 0.000 description 3
- IMIZPWSVYADSCN-UHFFFAOYSA-N 4-methyl-2-[[4-methyl-2-[[4-methyl-2-(pyrrolidine-2-carbonylamino)pentanoyl]amino]pentanoyl]amino]pentanoic acid Chemical compound CC(C)CC(C(O)=O)NC(=O)C(CC(C)C)NC(=O)C(CC(C)C)NC(=O)C1CCCN1 IMIZPWSVYADSCN-UHFFFAOYSA-N 0.000 description 3
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 3
- SBGXWWCLHIOABR-UHFFFAOYSA-N Ala Ala Gly Ala Chemical compound CC(N)C(=O)NC(C)C(=O)NCC(=O)NC(C)C(O)=O SBGXWWCLHIOABR-UHFFFAOYSA-N 0.000 description 3
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 3
- GFBLJMHGHAXGNY-ZLUOBGJFSA-N Ala-Asn-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GFBLJMHGHAXGNY-ZLUOBGJFSA-N 0.000 description 3
- ZEXDYVGDZJBRMO-ACZMJKKPSA-N Ala-Asn-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZEXDYVGDZJBRMO-ACZMJKKPSA-N 0.000 description 3
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 3
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 3
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 3
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 3
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 3
- MVBWLRJESQOQTM-ACZMJKKPSA-N Ala-Gln-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O MVBWLRJESQOQTM-ACZMJKKPSA-N 0.000 description 3
- ZDYNWWQXFRUOEO-XDTLVQLUSA-N Ala-Gln-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZDYNWWQXFRUOEO-XDTLVQLUSA-N 0.000 description 3
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 3
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 3
- BTBUEVAGZCKULD-XPUUQOCRSA-N Ala-Gly-His Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CN=CN1 BTBUEVAGZCKULD-XPUUQOCRSA-N 0.000 description 3
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 3
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 3
- JEPNLGMEZMCFEX-QSFUFRPTSA-N Ala-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C)N JEPNLGMEZMCFEX-QSFUFRPTSA-N 0.000 description 3
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 3
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 3
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 3
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 3
- AJBVYEYZVYPFCF-CIUDSAMLSA-N Ala-Lys-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O AJBVYEYZVYPFCF-CIUDSAMLSA-N 0.000 description 3
- PIXQDIGKDNNOOV-GUBZILKMSA-N Ala-Lys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O PIXQDIGKDNNOOV-GUBZILKMSA-N 0.000 description 3
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 3
- XHNLCGXYBXNRIS-BJDJZHNGSA-N Ala-Lys-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XHNLCGXYBXNRIS-BJDJZHNGSA-N 0.000 description 3
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 3
- BLTRAARCJYVJKV-QEJZJMRPSA-N Ala-Lys-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccccc1)C(O)=O BLTRAARCJYVJKV-QEJZJMRPSA-N 0.000 description 3
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 3
- PVQLRJRPUTXFFX-CIUDSAMLSA-N Ala-Met-Gln Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O PVQLRJRPUTXFFX-CIUDSAMLSA-N 0.000 description 3
- XSTZMVAYYCJTNR-DCAQKATOSA-N Ala-Met-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XSTZMVAYYCJTNR-DCAQKATOSA-N 0.000 description 3
- DRARURMRLANNLS-GUBZILKMSA-N Ala-Met-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O DRARURMRLANNLS-GUBZILKMSA-N 0.000 description 3
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 3
- FQNILRVJOJBFFC-FXQIFTODSA-N Ala-Pro-Asp Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N FQNILRVJOJBFFC-FXQIFTODSA-N 0.000 description 3
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 3
- XQNRANMFRPCFFW-GCJQMDKQSA-N Ala-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C)N)O XQNRANMFRPCFFW-GCJQMDKQSA-N 0.000 description 3
- ISCYZXFOCXWUJU-KZVJFYERSA-N Ala-Thr-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O ISCYZXFOCXWUJU-KZVJFYERSA-N 0.000 description 3
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 3
- XPBVBZPVNFIHOA-UVBJJODRSA-N Ala-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@H](C)N)=CNC2=C1 XPBVBZPVNFIHOA-UVBJJODRSA-N 0.000 description 3
- ZXKNLCPUNZPFGY-LEWSCRJBSA-N Ala-Tyr-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N ZXKNLCPUNZPFGY-LEWSCRJBSA-N 0.000 description 3
- YEBZNKPPOHFZJM-BPNCWPANSA-N Ala-Tyr-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O YEBZNKPPOHFZJM-BPNCWPANSA-N 0.000 description 3
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 3
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 3
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 3
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 3
- XMGVWQWEWWULNS-BPUTZDHNSA-N Arg-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N XMGVWQWEWWULNS-BPUTZDHNSA-N 0.000 description 3
- IZSMEUDYADKZTJ-KJEVXHAQSA-N Arg-Tyr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IZSMEUDYADKZTJ-KJEVXHAQSA-N 0.000 description 3
- SWLOHUMCUDRTCL-ZLUOBGJFSA-N Asn-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N SWLOHUMCUDRTCL-ZLUOBGJFSA-N 0.000 description 3
- IYVSIZAXNLOKFQ-BYULHYEWSA-N Asn-Asp-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IYVSIZAXNLOKFQ-BYULHYEWSA-N 0.000 description 3
- ULRPXVNMIIYDDJ-ACZMJKKPSA-N Asn-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N ULRPXVNMIIYDDJ-ACZMJKKPSA-N 0.000 description 3
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 3
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 3
- ZMUQQMGITUJQTI-CIUDSAMLSA-N Asn-Leu-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMUQQMGITUJQTI-CIUDSAMLSA-N 0.000 description 3
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 3
- NTWOPSIUJBMNRI-KKUMJFAQSA-N Asn-Lys-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTWOPSIUJBMNRI-KKUMJFAQSA-N 0.000 description 3
- KSGAFDTYQPKUAP-GMOBBJLQSA-N Asn-Met-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KSGAFDTYQPKUAP-GMOBBJLQSA-N 0.000 description 3
- BKFXFUPYETWGGA-XVSYOHENSA-N Asn-Phe-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BKFXFUPYETWGGA-XVSYOHENSA-N 0.000 description 3
- QYRMBFWDSFGSFC-OLHMAJIHSA-N Asn-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QYRMBFWDSFGSFC-OLHMAJIHSA-N 0.000 description 3
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 3
- XJQRWGXKUSDEFI-ACZMJKKPSA-N Asp-Glu-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XJQRWGXKUSDEFI-ACZMJKKPSA-N 0.000 description 3
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 3
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 3
- CUQDCPXNZPDYFQ-ZLUOBGJFSA-N Asp-Ser-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O CUQDCPXNZPDYFQ-ZLUOBGJFSA-N 0.000 description 3
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 3
- PLNJUJGNLDSFOP-UWJYBYFXSA-N Asp-Tyr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PLNJUJGNLDSFOP-UWJYBYFXSA-N 0.000 description 3
- ALMIMUZAWTUNIO-BZSNNMDCSA-N Asp-Tyr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ALMIMUZAWTUNIO-BZSNNMDCSA-N 0.000 description 3
- BYLPQJAWXJWUCJ-YDHLFZDLSA-N Asp-Tyr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O BYLPQJAWXJWUCJ-YDHLFZDLSA-N 0.000 description 3
- 241000894006 Bacteria Species 0.000 description 3
- 241000933531 Bifidobacterium bifidum NCIMB 41171 Species 0.000 description 3
- 241001389732 Bifidobacterium bifidum PRL2010 Species 0.000 description 3
- 241000722293 Bifidobacterium bifidum S17 Species 0.000 description 3
- BNRHLRWCERLRTQ-BPUTZDHNSA-N Cys-Arg-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N BNRHLRWCERLRTQ-BPUTZDHNSA-N 0.000 description 3
- PQHYZJPCYRDYNE-QWRGUYRKSA-N Cys-Gly-Phe Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PQHYZJPCYRDYNE-QWRGUYRKSA-N 0.000 description 3
- FANFRJOFTYCNRG-JYBASQMISA-N Cys-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CS)N)O FANFRJOFTYCNRG-JYBASQMISA-N 0.000 description 3
- JSYULGSPLTZDHM-NRPADANISA-N Gln-Ala-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O JSYULGSPLTZDHM-NRPADANISA-N 0.000 description 3
- PBYFVIQRFLNQCO-GUBZILKMSA-N Gln-Pro-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O PBYFVIQRFLNQCO-GUBZILKMSA-N 0.000 description 3
- KUBFPYIMAGXGBT-ACZMJKKPSA-N Gln-Ser-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KUBFPYIMAGXGBT-ACZMJKKPSA-N 0.000 description 3
- OTQSTOXRUBVWAP-NRPADANISA-N Gln-Ser-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OTQSTOXRUBVWAP-NRPADANISA-N 0.000 description 3
- VCUNGPMMPNJSGS-JYJNAYRXSA-N Gln-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O VCUNGPMMPNJSGS-JYJNAYRXSA-N 0.000 description 3
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 3
- WOMUDRVDJMHTCV-DCAQKATOSA-N Glu-Arg-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WOMUDRVDJMHTCV-DCAQKATOSA-N 0.000 description 3
- OJGLIOXAKGFFDW-SRVKXCTJSA-N Glu-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N OJGLIOXAKGFFDW-SRVKXCTJSA-N 0.000 description 3
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 3
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 description 3
- HPJLZFTUUJKWAJ-JHEQGTHGSA-N Glu-Gly-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HPJLZFTUUJKWAJ-JHEQGTHGSA-N 0.000 description 3
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 3
- QJVZSVUYZFYLFQ-CIUDSAMLSA-N Glu-Pro-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O QJVZSVUYZFYLFQ-CIUDSAMLSA-N 0.000 description 3
- BFEZQZKEPRKKHV-SRVKXCTJSA-N Glu-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O BFEZQZKEPRKKHV-SRVKXCTJSA-N 0.000 description 3
- LPHGXOWFAXFCPX-KKUMJFAQSA-N Glu-Pro-Phe Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O LPHGXOWFAXFCPX-KKUMJFAQSA-N 0.000 description 3
- SWDNPSMMEWRNOH-HJGDQZAQSA-N Glu-Pro-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWDNPSMMEWRNOH-HJGDQZAQSA-N 0.000 description 3
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 3
- UUTGYDAKPISJAO-JYJNAYRXSA-N Glu-Tyr-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 UUTGYDAKPISJAO-JYJNAYRXSA-N 0.000 description 3
- LERGJIVJIIODPZ-ZANVPECISA-N Gly-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)CN)C)C(O)=O)=CNC2=C1 LERGJIVJIIODPZ-ZANVPECISA-N 0.000 description 3
- AIJAPFVDBFYNKN-WHFBIAKZSA-N Gly-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN)C(=O)N AIJAPFVDBFYNKN-WHFBIAKZSA-N 0.000 description 3
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 3
- IWAXHBCACVWNHT-BQBZGAKWSA-N Gly-Asp-Arg Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IWAXHBCACVWNHT-BQBZGAKWSA-N 0.000 description 3
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 3
- RPLLQZBOVIVGMX-QWRGUYRKSA-N Gly-Asp-Phe Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RPLLQZBOVIVGMX-QWRGUYRKSA-N 0.000 description 3
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 description 3
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 3
- HDNXXTBKOJKWNN-WDSKDSINSA-N Gly-Glu-Asn Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O HDNXXTBKOJKWNN-WDSKDSINSA-N 0.000 description 3
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 3
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 3
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 3
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 3
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 3
- UTYGDAHJBBDPBA-BYULHYEWSA-N Gly-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN UTYGDAHJBBDPBA-BYULHYEWSA-N 0.000 description 3
- VIIBEIQMLJEUJG-LAEOZQHASA-N Gly-Ile-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O VIIBEIQMLJEUJG-LAEOZQHASA-N 0.000 description 3
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 description 3
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 3
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 3
- CVFOYJJOZYYEPE-KBPBESRZSA-N Gly-Lys-Tyr Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CVFOYJJOZYYEPE-KBPBESRZSA-N 0.000 description 3
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 3
- ICUTTWWCDIIIEE-BQBZGAKWSA-N Gly-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN ICUTTWWCDIIIEE-BQBZGAKWSA-N 0.000 description 3
- IBYOLNARKHMLBG-WHOFXGATSA-N Gly-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IBYOLNARKHMLBG-WHOFXGATSA-N 0.000 description 3
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 3
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 3
- FKESCSGWBPUTPN-FOHZUACHSA-N Gly-Thr-Asn Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O FKESCSGWBPUTPN-FOHZUACHSA-N 0.000 description 3
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 description 3
- FOKISINOENBSDM-WLTAIBSBSA-N Gly-Thr-Tyr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FOKISINOENBSDM-WLTAIBSBSA-N 0.000 description 3
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 3
- ONSARSFSJHTMFJ-STQMWFEESA-N Gly-Trp-Ser Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O ONSARSFSJHTMFJ-STQMWFEESA-N 0.000 description 3
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 3
- OCRQUYDOYKCOQG-IRXDYDNUSA-N Gly-Tyr-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 OCRQUYDOYKCOQG-IRXDYDNUSA-N 0.000 description 3
- ORERHHPZDDEMSC-VGDYDELISA-N His-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ORERHHPZDDEMSC-VGDYDELISA-N 0.000 description 3
- VFBZWZXKCVBTJR-SRVKXCTJSA-N His-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VFBZWZXKCVBTJR-SRVKXCTJSA-N 0.000 description 3
- AIPUZFXMXAHZKY-QWRGUYRKSA-N His-Leu-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AIPUZFXMXAHZKY-QWRGUYRKSA-N 0.000 description 3
- BXOLYFJYQQRQDJ-MXAVVETBSA-N His-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CN=CN1)N BXOLYFJYQQRQDJ-MXAVVETBSA-N 0.000 description 3
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 3
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 3
- QYOGJYIRKACXEP-SLBDDTMCSA-N Ile-Asn-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N QYOGJYIRKACXEP-SLBDDTMCSA-N 0.000 description 3
- RPZFUIQVAPZLRH-GHCJXIJMSA-N Ile-Asp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)O)N RPZFUIQVAPZLRH-GHCJXIJMSA-N 0.000 description 3
- GYAFMRQGWHXMII-IUKAMOBKSA-N Ile-Asp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N GYAFMRQGWHXMII-IUKAMOBKSA-N 0.000 description 3
- MTFVYKQRLXYAQN-LAEOZQHASA-N Ile-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O MTFVYKQRLXYAQN-LAEOZQHASA-N 0.000 description 3
- LNJLOZYNZFGJMM-DEQVHRJGSA-N Ile-His-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N LNJLOZYNZFGJMM-DEQVHRJGSA-N 0.000 description 3
- AKOYRLRUFBZOSP-BJDJZHNGSA-N Ile-Lys-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N AKOYRLRUFBZOSP-BJDJZHNGSA-N 0.000 description 3
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 3
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 description 3
- WRDTXMBPHMBGIB-STECZYCISA-N Ile-Tyr-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 WRDTXMBPHMBGIB-STECZYCISA-N 0.000 description 3
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 3
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 3
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 3
- SUPVSFFZWVOEOI-UHFFFAOYSA-N Leu-Ala-Tyr Natural products CC(C)CC(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-UHFFFAOYSA-N 0.000 description 3
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 3
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 3
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 3
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 3
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 3
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 3
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 3
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 3
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 3
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 3
- UCXQIIIFOOGYEM-ULQDDVLXSA-N Leu-Pro-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCXQIIIFOOGYEM-ULQDDVLXSA-N 0.000 description 3
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 3
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 3
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 3
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 3
- AXVIGSRGTMNSJU-YESZJQIVSA-N Leu-Tyr-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N AXVIGSRGTMNSJU-YESZJQIVSA-N 0.000 description 3
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 3
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 3
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 3
- JGAMUXDWYSXYLM-SRVKXCTJSA-N Lys-Arg-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O JGAMUXDWYSXYLM-SRVKXCTJSA-N 0.000 description 3
- YVMQJGWLHRWMDF-MNXVOIDGSA-N Lys-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N YVMQJGWLHRWMDF-MNXVOIDGSA-N 0.000 description 3
- QFGVDCBPDGLVTA-SZMVWBNQSA-N Lys-Gln-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN)C(O)=O)=CNC2=C1 QFGVDCBPDGLVTA-SZMVWBNQSA-N 0.000 description 3
- UETQMSASAVBGJY-QWRGUYRKSA-N Lys-Gly-His Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 UETQMSASAVBGJY-QWRGUYRKSA-N 0.000 description 3
- MXMDJEJWERYPMO-XUXIUFHCSA-N Lys-Ile-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MXMDJEJWERYPMO-XUXIUFHCSA-N 0.000 description 3
- NCZIQZYZPUPMKY-PPCPHDFISA-N Lys-Ile-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NCZIQZYZPUPMKY-PPCPHDFISA-N 0.000 description 3
- XREQQOATSMMAJP-MGHWNKPDSA-N Lys-Ile-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XREQQOATSMMAJP-MGHWNKPDSA-N 0.000 description 3
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 3
- WQDKIVRHTQYJSN-DCAQKATOSA-N Lys-Ser-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N WQDKIVRHTQYJSN-DCAQKATOSA-N 0.000 description 3
- UIJVKVHLCQSPOJ-XIRDDKMYSA-N Lys-Ser-Trp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O UIJVKVHLCQSPOJ-XIRDDKMYSA-N 0.000 description 3
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 3
- TVHCDSBMFQYPNA-RHYQMDGZSA-N Lys-Thr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TVHCDSBMFQYPNA-RHYQMDGZSA-N 0.000 description 3
- VKCPHIOZDWUFSW-ONGXEEELSA-N Lys-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN VKCPHIOZDWUFSW-ONGXEEELSA-N 0.000 description 3
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 3
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 3
- 241000124008 Mammalia Species 0.000 description 3
- MNNKPHGAPRUKMW-BPUTZDHNSA-N Met-Asp-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCSC)C(O)=O)=CNC2=C1 MNNKPHGAPRUKMW-BPUTZDHNSA-N 0.000 description 3
- JUXONJROIXKHEV-GUBZILKMSA-N Met-Cys-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCCNC(N)=N JUXONJROIXKHEV-GUBZILKMSA-N 0.000 description 3
- AWOMRHGUWFBDNU-ZPFDUUQYSA-N Met-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N AWOMRHGUWFBDNU-ZPFDUUQYSA-N 0.000 description 3
- IUYCGMNKIZDRQI-BQBZGAKWSA-N Met-Gly-Ala Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O IUYCGMNKIZDRQI-BQBZGAKWSA-N 0.000 description 3
- QZPXMHVKPHJNTR-DCAQKATOSA-N Met-Leu-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O QZPXMHVKPHJNTR-DCAQKATOSA-N 0.000 description 3
- OXIWIYOJVNOKOV-SRVKXCTJSA-N Met-Met-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CCCNC(N)=N OXIWIYOJVNOKOV-SRVKXCTJSA-N 0.000 description 3
- RDLSEGZJMYGFNS-FXQIFTODSA-N Met-Ser-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RDLSEGZJMYGFNS-FXQIFTODSA-N 0.000 description 3
- 125000005118 N-alkylcarbamoyl group Chemical group 0.000 description 3
- 108010065395 Neuropep-1 Proteins 0.000 description 3
- CSYVXYQDIVCQNU-QWRGUYRKSA-N Phe-Asp-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O CSYVXYQDIVCQNU-QWRGUYRKSA-N 0.000 description 3
- MSHZERMPZKCODG-ACRUOGEOSA-N Phe-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MSHZERMPZKCODG-ACRUOGEOSA-N 0.000 description 3
- KLYYKKGCPOGDPE-OEAJRASXSA-N Phe-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O KLYYKKGCPOGDPE-OEAJRASXSA-N 0.000 description 3
- XALFIVXGQUEGKV-JSGCOSHPSA-N Phe-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XALFIVXGQUEGKV-JSGCOSHPSA-N 0.000 description 3
- TXPUNZXZDVJUJQ-LPEHRKFASA-N Pro-Asn-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O TXPUNZXZDVJUJQ-LPEHRKFASA-N 0.000 description 3
- FUVBEZJCRMHWEM-FXQIFTODSA-N Pro-Asn-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FUVBEZJCRMHWEM-FXQIFTODSA-N 0.000 description 3
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 3
- UUHXBJHVTVGSKM-BQBZGAKWSA-N Pro-Gly-Asn Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UUHXBJHVTVGSKM-BQBZGAKWSA-N 0.000 description 3
- ULIWFCCJIOEHMU-BQBZGAKWSA-N Pro-Gly-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 ULIWFCCJIOEHMU-BQBZGAKWSA-N 0.000 description 3
- LNOWDSPAYBWJOR-PEDHHIEDSA-N Pro-Ile-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LNOWDSPAYBWJOR-PEDHHIEDSA-N 0.000 description 3
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 3
- MRYUJHGPZQNOAD-IHRRRGAJSA-N Pro-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 MRYUJHGPZQNOAD-IHRRRGAJSA-N 0.000 description 3
- FYKUEXMZYFIZKA-DCAQKATOSA-N Pro-Pro-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FYKUEXMZYFIZKA-DCAQKATOSA-N 0.000 description 3
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 3
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 3
- RWRDLPDLKQPQOW-UHFFFAOYSA-N Pyrrolidine Chemical compound C1CCNC1 RWRDLPDLKQPQOW-UHFFFAOYSA-N 0.000 description 3
- 108010003201 RGH 0205 Proteins 0.000 description 3
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 3
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 3
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 3
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 3
- BLPYXIXXCFVIIF-FXQIFTODSA-N Ser-Cys-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N)CN=C(N)N BLPYXIXXCFVIIF-FXQIFTODSA-N 0.000 description 3
- BQWCDDAISCPDQV-XHNCKOQMSA-N Ser-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N)C(=O)O BQWCDDAISCPDQV-XHNCKOQMSA-N 0.000 description 3
- SQBLRDDJTUJDMV-ACZMJKKPSA-N Ser-Glu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQBLRDDJTUJDMV-ACZMJKKPSA-N 0.000 description 3
- UFKPDBLKLOBMRH-XHNCKOQMSA-N Ser-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)C(=O)O UFKPDBLKLOBMRH-XHNCKOQMSA-N 0.000 description 3
- VQBCMLMPEWPUTB-ACZMJKKPSA-N Ser-Glu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VQBCMLMPEWPUTB-ACZMJKKPSA-N 0.000 description 3
- IXCHOHLPHNGFTJ-YUMQZZPRSA-N Ser-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N IXCHOHLPHNGFTJ-YUMQZZPRSA-N 0.000 description 3
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 3
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 3
- XERQKTRGJIKTRB-CIUDSAMLSA-N Ser-His-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CN=CN1 XERQKTRGJIKTRB-CIUDSAMLSA-N 0.000 description 3
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 3
- NQZFFLBPNDLTPO-DLOVCJGASA-N Ser-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CO)N NQZFFLBPNDLTPO-DLOVCJGASA-N 0.000 description 3
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 3
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 3
- GSCVDSBEYVGMJQ-SRVKXCTJSA-N Ser-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)O GSCVDSBEYVGMJQ-SRVKXCTJSA-N 0.000 description 3
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 3
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 3
- DDPVJPIGACCMEH-XQXXSGGOSA-N Thr-Ala-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DDPVJPIGACCMEH-XQXXSGGOSA-N 0.000 description 3
- TWLMXDWFVNEFFK-FJXKBIBVSA-N Thr-Arg-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O TWLMXDWFVNEFFK-FJXKBIBVSA-N 0.000 description 3
- OHAJHDJOCKKJLV-LKXGYXEUSA-N Thr-Asp-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OHAJHDJOCKKJLV-LKXGYXEUSA-N 0.000 description 3
- DKDHTRVDOUZZTP-IFFSRLJSSA-N Thr-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DKDHTRVDOUZZTP-IFFSRLJSSA-N 0.000 description 3
- VYEHBMMAJFVTOI-JHEQGTHGSA-N Thr-Gly-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O VYEHBMMAJFVTOI-JHEQGTHGSA-N 0.000 description 3
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 3
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 3
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 3
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 3
- FDALPRWYVKJCLL-PMVVWTBXSA-N Thr-His-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O FDALPRWYVKJCLL-PMVVWTBXSA-N 0.000 description 3
- WPAKPLPGQNUXGN-OSUNSFLBSA-N Thr-Ile-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WPAKPLPGQNUXGN-OSUNSFLBSA-N 0.000 description 3
- CRZNCABIJLRFKZ-IUKAMOBKSA-N Thr-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N CRZNCABIJLRFKZ-IUKAMOBKSA-N 0.000 description 3
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 3
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 3
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 3
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 3
- AAZOYLQUEQRUMZ-GSSVUCPTSA-N Thr-Thr-Asn Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O AAZOYLQUEQRUMZ-GSSVUCPTSA-N 0.000 description 3
- MFMGPEKYBXFIRF-SUSMZKCASA-N Thr-Thr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFMGPEKYBXFIRF-SUSMZKCASA-N 0.000 description 3
- QJIODPFLAASXJC-JHYOHUSXSA-N Thr-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O QJIODPFLAASXJC-JHYOHUSXSA-N 0.000 description 3
- LVRFMARKDGGZMX-IZPVPAKOSA-N Thr-Tyr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=C(O)C=C1 LVRFMARKDGGZMX-IZPVPAKOSA-N 0.000 description 3
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 3
- HYVLNORXQGKONN-NUTKFTJISA-N Trp-Ala-Lys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 HYVLNORXQGKONN-NUTKFTJISA-N 0.000 description 3
- HXNVJPQADLRHGR-JBACZVJFSA-N Trp-Glu-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N HXNVJPQADLRHGR-JBACZVJFSA-N 0.000 description 3
- NLLARHRWSFNEMH-NUTKFTJISA-N Trp-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NLLARHRWSFNEMH-NUTKFTJISA-N 0.000 description 3
- LFMLXCJYCFZBKE-IHPCNDPISA-N Trp-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N LFMLXCJYCFZBKE-IHPCNDPISA-N 0.000 description 3
- SUEGAFMNTXXNLR-WFBYXXMGSA-N Trp-Ser-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O SUEGAFMNTXXNLR-WFBYXXMGSA-N 0.000 description 3
- BABINGWMZBWXIX-BPUTZDHNSA-N Trp-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N BABINGWMZBWXIX-BPUTZDHNSA-N 0.000 description 3
- SGFIXFAHVWJKTD-KJEVXHAQSA-N Tyr-Arg-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SGFIXFAHVWJKTD-KJEVXHAQSA-N 0.000 description 3
- QAYSODICXVZUIA-WLTAIBSBSA-N Tyr-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QAYSODICXVZUIA-WLTAIBSBSA-N 0.000 description 3
- GITNQBVCEQBDQC-KKUMJFAQSA-N Tyr-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O GITNQBVCEQBDQC-KKUMJFAQSA-N 0.000 description 3
- QFXVAFIHVWXXBJ-AVGNSLFASA-N Tyr-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O QFXVAFIHVWXXBJ-AVGNSLFASA-N 0.000 description 3
- XUIOBCQESNDTDE-FQPOAREZSA-N Tyr-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XUIOBCQESNDTDE-FQPOAREZSA-N 0.000 description 3
- UUBKSZNKJUJQEJ-JRQIVUDYSA-N Tyr-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O UUBKSZNKJUJQEJ-JRQIVUDYSA-N 0.000 description 3
- ZZDYJFVIKVSUFA-WLTAIBSBSA-N Tyr-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ZZDYJFVIKVSUFA-WLTAIBSBSA-N 0.000 description 3
- JHDZONWZTCKTJR-KJEVXHAQSA-N Tyr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JHDZONWZTCKTJR-KJEVXHAQSA-N 0.000 description 3
- DJIJBQYBDKGDIS-JYJNAYRXSA-N Tyr-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O DJIJBQYBDKGDIS-JYJNAYRXSA-N 0.000 description 3
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 3
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 3
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 3
- VUTHNLMCXKLLFI-LAEOZQHASA-N Val-Asp-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VUTHNLMCXKLLFI-LAEOZQHASA-N 0.000 description 3
- ZEVNVXYRZRIRCH-GVXVVHGQSA-N Val-Gln-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N ZEVNVXYRZRIRCH-GVXVVHGQSA-N 0.000 description 3
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 3
- NXRAUQGGHPCJIB-RCOVLWMOSA-N Val-Gly-Asn Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O NXRAUQGGHPCJIB-RCOVLWMOSA-N 0.000 description 3
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 3
- YTUABZMPYKCWCQ-XQQFMLRXSA-N Val-His-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N YTUABZMPYKCWCQ-XQQFMLRXSA-N 0.000 description 3
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 description 3
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 3
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 3
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 3
- SVFRYKBZHUGKLP-QXEWZRGKSA-N Val-Met-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVFRYKBZHUGKLP-QXEWZRGKSA-N 0.000 description 3
- VENKIVFKIPGEJN-NHCYSSNCSA-N Val-Met-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N VENKIVFKIPGEJN-NHCYSSNCSA-N 0.000 description 3
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 3
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 3
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 3
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 3
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 3
- PFMSJVIPEZMKSC-DZKIICNBSA-N Val-Tyr-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PFMSJVIPEZMKSC-DZKIICNBSA-N 0.000 description 3
- PMKQKNBISAOSRI-XHSDSOJGSA-N Val-Tyr-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N PMKQKNBISAOSRI-XHSDSOJGSA-N 0.000 description 3
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 3
- XNLUVJPMPAZHCY-JYJNAYRXSA-N Val-Val-Phe Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 XNLUVJPMPAZHCY-JYJNAYRXSA-N 0.000 description 3
- 125000002252 acyl group Chemical group 0.000 description 3
- 108010028939 alanyl-alanyl-lysyl-alanine Proteins 0.000 description 3
- 229910052783 alkali metal Inorganic materials 0.000 description 3
- 150000001340 alkali metals Chemical class 0.000 description 3
- 125000003282 alkyl amino group Chemical group 0.000 description 3
- 125000000129 anionic group Chemical group 0.000 description 3
- 239000007864 aqueous solution Substances 0.000 description 3
- 108010068265 aspartyltyrosine Proteins 0.000 description 3
- 125000000852 azido group Chemical group *N=[N+]=[N-] 0.000 description 3
- AGEZXYOZHKGVCM-UHFFFAOYSA-N benzyl bromide Chemical class BrCC1=CC=CC=C1 AGEZXYOZHKGVCM-UHFFFAOYSA-N 0.000 description 3
- 125000003917 carbamoyl group Chemical group [H]N([H])C(*)=O 0.000 description 3
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 3
- 150000001768 cations Chemical class 0.000 description 3
- 238000001816 cooling Methods 0.000 description 3
- 125000004663 dialkyl amino group Chemical group 0.000 description 3
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 3
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 3
- 108010033719 glycyl-histidyl-glycine Proteins 0.000 description 3
- 108010010147 glycylglutamine Proteins 0.000 description 3
- 235000020256 human milk Nutrition 0.000 description 3
- 210000004251 human milk Anatomy 0.000 description 3
- 108010073093 leucyl-glycyl-glycyl-glycine Proteins 0.000 description 3
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 3
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 3
- VLKZOEOYAKHREP-UHFFFAOYSA-N n-Hexane Chemical compound CCCCCC VLKZOEOYAKHREP-UHFFFAOYSA-N 0.000 description 3
- 230000007935 neutral effect Effects 0.000 description 3
- 125000001736 nosyl group Chemical group S(=O)(=O)(C1=CC=C([N+](=O)[O-])C=C1)* 0.000 description 3
- 239000012074 organic phase Substances 0.000 description 3
- 239000012071 phase Substances 0.000 description 3
- 108010018625 phenylalanylarginine Proteins 0.000 description 3
- 108010004914 prolylarginine Proteins 0.000 description 3
- 229920005989 resin Polymers 0.000 description 3
- 239000011347 resin Substances 0.000 description 3
- 238000004366 reverse phase liquid chromatography Methods 0.000 description 3
- 239000011734 sodium Substances 0.000 description 3
- 229910000104 sodium hydride Inorganic materials 0.000 description 3
- 125000001424 substituent group Chemical group 0.000 description 3
- WYURNTSHIVDZCO-UHFFFAOYSA-N tetrahydrofuran Substances C1CCOC1 WYURNTSHIVDZCO-UHFFFAOYSA-N 0.000 description 3
- 238000012546 transfer Methods 0.000 description 3
- 108010015666 tryptophyl-leucyl-glutamic acid Proteins 0.000 description 3
- WQZGKKKJIJFFOK-SVZMEOIVSA-N (+)-Galactose Chemical compound OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@H]1O WQZGKKKJIJFFOK-SVZMEOIVSA-N 0.000 description 2
- BRPMXFSTKXXNHF-IUCAKERBSA-N (2s)-1-[2-[[(2s)-pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H]1NCCC1 BRPMXFSTKXXNHF-IUCAKERBSA-N 0.000 description 2
- HEVMDQBCAHEHDY-UHFFFAOYSA-N (Dimethoxymethyl)benzene Chemical compound COC(OC)C1=CC=CC=C1 HEVMDQBCAHEHDY-UHFFFAOYSA-N 0.000 description 2
- HZAXFHJVJLSVMW-UHFFFAOYSA-N 2-Aminoethan-1-ol Chemical compound NCCO HZAXFHJVJLSVMW-UHFFFAOYSA-N 0.000 description 2
- QIMMUPPBPVKWKM-UHFFFAOYSA-N 2-methylnaphthalene Chemical compound C1=CC=CC2=CC(C)=CC=C21 QIMMUPPBPVKWKM-UHFFFAOYSA-N 0.000 description 2
- 125000000242 4-chlorobenzoyl group Chemical group ClC1=CC=C(C(=O)*)C=C1 0.000 description 2
- XEXJJJRVTFGWIC-FXQIFTODSA-N Ala-Asn-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XEXJJJRVTFGWIC-FXQIFTODSA-N 0.000 description 2
- XCVRVWZTXPCYJT-BIIVOSGPSA-N Ala-Asn-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N XCVRVWZTXPCYJT-BIIVOSGPSA-N 0.000 description 2
- MIPWEZAIMPYQST-FXQIFTODSA-N Ala-Cys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O MIPWEZAIMPYQST-FXQIFTODSA-N 0.000 description 2
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 2
- MQIGTEQXYCRLGK-BQBZGAKWSA-N Ala-Gly-Pro Chemical compound C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O MQIGTEQXYCRLGK-BQBZGAKWSA-N 0.000 description 2
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 2
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 2
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 2
- XEPSCVXTCUUHDT-AVGNSLFASA-N Arg-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCN=C(N)N XEPSCVXTCUUHDT-AVGNSLFASA-N 0.000 description 2
- PVSNBTCXCQIXSE-JYJNAYRXSA-N Arg-Arg-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PVSNBTCXCQIXSE-JYJNAYRXSA-N 0.000 description 2
- ITVINTQUZMQWJR-QXEWZRGKSA-N Arg-Asn-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ITVINTQUZMQWJR-QXEWZRGKSA-N 0.000 description 2
- XVLLUZMFSAYKJV-GUBZILKMSA-N Arg-Asp-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XVLLUZMFSAYKJV-GUBZILKMSA-N 0.000 description 2
- GIVWETPOBCRTND-DCAQKATOSA-N Arg-Gln-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GIVWETPOBCRTND-DCAQKATOSA-N 0.000 description 2
- HJDNZFIYILEIKR-OSUNSFLBSA-N Arg-Ile-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HJDNZFIYILEIKR-OSUNSFLBSA-N 0.000 description 2
- OQPAZKMGCWPERI-GUBZILKMSA-N Arg-Ser-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OQPAZKMGCWPERI-GUBZILKMSA-N 0.000 description 2
- RYQSYXFGFOTJDJ-RHYQMDGZSA-N Arg-Thr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RYQSYXFGFOTJDJ-RHYQMDGZSA-N 0.000 description 2
- ZPWMEWYQBWSGAO-ZJDVBMNYSA-N Arg-Thr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZPWMEWYQBWSGAO-ZJDVBMNYSA-N 0.000 description 2
- XRLOBFSLPCHYLQ-ULQDDVLXSA-N Arg-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O XRLOBFSLPCHYLQ-ULQDDVLXSA-N 0.000 description 2
- XKRFYHLGVUSROY-UHFFFAOYSA-N Argon Chemical compound [Ar] XKRFYHLGVUSROY-UHFFFAOYSA-N 0.000 description 2
- NXVGBGZQQFDUTM-XVYDVKMFSA-N Asn-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N NXVGBGZQQFDUTM-XVYDVKMFSA-N 0.000 description 2
- JQSWHKKUZMTOIH-QWRGUYRKSA-N Asn-Gly-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N JQSWHKKUZMTOIH-QWRGUYRKSA-N 0.000 description 2
- XVBDDUPJVQXDSI-PEFMBERDSA-N Asn-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVBDDUPJVQXDSI-PEFMBERDSA-N 0.000 description 2
- WSNSZZGIMVHDHF-TUUVXOQKSA-N Asn-Trp-Asp-Ser Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(N)=O)N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 WSNSZZGIMVHDHF-TUUVXOQKSA-N 0.000 description 2
- CBWCQCANJSGUOH-ZKWXMUAHSA-N Asn-Val-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O CBWCQCANJSGUOH-ZKWXMUAHSA-N 0.000 description 2
- UWMIZBCTVWVMFI-FXQIFTODSA-N Asp-Ala-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UWMIZBCTVWVMFI-FXQIFTODSA-N 0.000 description 2
- CELPEWWLSXMVPH-CIUDSAMLSA-N Asp-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O CELPEWWLSXMVPH-CIUDSAMLSA-N 0.000 description 2
- KGAJCJXBEWLQDZ-UBHSHLNASA-N Asp-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N KGAJCJXBEWLQDZ-UBHSHLNASA-N 0.000 description 2
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 2
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 2
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 2
- VSMYBNPOHYAXSD-GUBZILKMSA-N Asp-Lys-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O VSMYBNPOHYAXSD-GUBZILKMSA-N 0.000 description 2
- AKKUDRZKFZWPBH-SRVKXCTJSA-N Asp-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N AKKUDRZKFZWPBH-SRVKXCTJSA-N 0.000 description 2
- NZWDWXSWUQCNMG-GARJFASQSA-N Asp-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)C(=O)O NZWDWXSWUQCNMG-GARJFASQSA-N 0.000 description 2
- LKVKODXGSAFOFY-VEVYYDQMSA-N Asp-Met-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LKVKODXGSAFOFY-VEVYYDQMSA-N 0.000 description 2
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 2
- XXAMCEGRCZQGEM-ZLUOBGJFSA-N Asp-Ser-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O XXAMCEGRCZQGEM-ZLUOBGJFSA-N 0.000 description 2
- YUELDQUPTAYEGM-XIRDDKMYSA-N Asp-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)O)N YUELDQUPTAYEGM-XIRDDKMYSA-N 0.000 description 2
- XWKBWZXGNXTDKY-ZKWXMUAHSA-N Asp-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O XWKBWZXGNXTDKY-ZKWXMUAHSA-N 0.000 description 2
- 241000711404 Avian avulavirus 1 Species 0.000 description 2
- 241000933529 Bifidobacterium bifidum JCM 1254 Species 0.000 description 2
- 241001608472 Bifidobacterium longum Species 0.000 description 2
- 241000193468 Clostridium perfringens Species 0.000 description 2
- 241000186227 Corynebacterium diphtheriae Species 0.000 description 2
- YQEHNIKPAOPBNH-DCAQKATOSA-N Cys-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N YQEHNIKPAOPBNH-DCAQKATOSA-N 0.000 description 2
- 241000588724 Escherichia coli Species 0.000 description 2
- 238000004252 FT/ICR mass spectrometry Methods 0.000 description 2
- OVQXQLWWJSNYFV-XEGUGMAKSA-N Gln-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCC(N)=O)C)C(O)=O)=CNC2=C1 OVQXQLWWJSNYFV-XEGUGMAKSA-N 0.000 description 2
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 2
- NHMRJKKAVMENKJ-WDCWCFNPSA-N Gln-Thr-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NHMRJKKAVMENKJ-WDCWCFNPSA-N 0.000 description 2
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 2
- NLKVNZUFDPWPNL-YUMQZZPRSA-N Glu-Arg-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O NLKVNZUFDPWPNL-YUMQZZPRSA-N 0.000 description 2
- SRZLHYPAOXBBSB-HJGDQZAQSA-N Glu-Arg-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SRZLHYPAOXBBSB-HJGDQZAQSA-N 0.000 description 2
- SBCYJMOOHUDWDA-NUMRIWBASA-N Glu-Asp-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SBCYJMOOHUDWDA-NUMRIWBASA-N 0.000 description 2
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 2
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 2
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 2
- UDEPRBFQTWGLCW-CIUDSAMLSA-N Glu-Pro-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O UDEPRBFQTWGLCW-CIUDSAMLSA-N 0.000 description 2
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 2
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 2
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 2
- JXYMPBCYRKWJEE-BQBZGAKWSA-N Gly-Arg-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JXYMPBCYRKWJEE-BQBZGAKWSA-N 0.000 description 2
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 2
- FIQQRCFQXGLOSZ-WDSKDSINSA-N Gly-Glu-Asp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FIQQRCFQXGLOSZ-WDSKDSINSA-N 0.000 description 2
- CUYLIWAAAYJKJH-RYUDHWBXSA-N Gly-Glu-Tyr Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUYLIWAAAYJKJH-RYUDHWBXSA-N 0.000 description 2
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 2
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 2
- HHSOPSCKAZKQHQ-PEXQALLHSA-N Gly-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN HHSOPSCKAZKQHQ-PEXQALLHSA-N 0.000 description 2
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 2
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 2
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 2
- CSMYMGFCEJWALV-WDSKDSINSA-N Gly-Ser-Gln Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O CSMYMGFCEJWALV-WDSKDSINSA-N 0.000 description 2
- YABRDIBSPZONIY-BQBZGAKWSA-N Gly-Ser-Met Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O YABRDIBSPZONIY-BQBZGAKWSA-N 0.000 description 2
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 2
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 2
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 2
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 2
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 2
- LSQHWKPPOFDHHZ-YUMQZZPRSA-N His-Asp-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N LSQHWKPPOFDHHZ-YUMQZZPRSA-N 0.000 description 2
- NQKRILCJYCASDV-QWRGUYRKSA-N His-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 NQKRILCJYCASDV-QWRGUYRKSA-N 0.000 description 2
- LVXFNTIIGOQBMD-SRVKXCTJSA-N His-Leu-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O LVXFNTIIGOQBMD-SRVKXCTJSA-N 0.000 description 2
- VXZZUXWAOMWWJH-QTKMDUPCSA-N His-Thr-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VXZZUXWAOMWWJH-QTKMDUPCSA-N 0.000 description 2
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 2
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 2
- NULSANWBUWLTKN-NAKRPEOUSA-N Ile-Arg-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N NULSANWBUWLTKN-NAKRPEOUSA-N 0.000 description 2
- LJKDGRWXYUTRSH-YVNDNENWSA-N Ile-Gln-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N LJKDGRWXYUTRSH-YVNDNENWSA-N 0.000 description 2
- CSQNHSGHAPRGPQ-YTFOTSKYSA-N Ile-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(=O)O)N CSQNHSGHAPRGPQ-YTFOTSKYSA-N 0.000 description 2
- UYNXBNHVWFNVIN-HJWJTTGWSA-N Ile-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=CC=C1 UYNXBNHVWFNVIN-HJWJTTGWSA-N 0.000 description 2
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 description 2
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 2
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 2
- IBMVEYRWAWIOTN-RWMBFGLXSA-N Leu-Arg-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(O)=O IBMVEYRWAWIOTN-RWMBFGLXSA-N 0.000 description 2
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 2
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 2
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 2
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 2
- ONPJGOIVICHWBW-BZSNNMDCSA-N Leu-Lys-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 ONPJGOIVICHWBW-BZSNNMDCSA-N 0.000 description 2
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 2
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 2
- CGHXMODRYJISSK-NHCYSSNCSA-N Leu-Val-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O CGHXMODRYJISSK-NHCYSSNCSA-N 0.000 description 2
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 2
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 2
- IVFUVMSKSFSFBT-NHCYSSNCSA-N Lys-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN IVFUVMSKSFSFBT-NHCYSSNCSA-N 0.000 description 2
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 2
- OLWAOWXIADGIJG-AVGNSLFASA-N Met-Arg-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(O)=O OLWAOWXIADGIJG-AVGNSLFASA-N 0.000 description 2
- YAWKHFKCNSXYDS-XIRDDKMYSA-N Met-Glu-Trp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N YAWKHFKCNSXYDS-XIRDDKMYSA-N 0.000 description 2
- JACAKCWAOHKQBV-UWVGGRQHSA-N Met-Gly-Lys Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN JACAKCWAOHKQBV-UWVGGRQHSA-N 0.000 description 2
- QYIGOFGUOVTAHK-ZJDVBMNYSA-N Met-Thr-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QYIGOFGUOVTAHK-ZJDVBMNYSA-N 0.000 description 2
- BZLVMXJERCGZMT-UHFFFAOYSA-N Methyl tert-butyl ether Chemical compound COC(C)(C)C BZLVMXJERCGZMT-UHFFFAOYSA-N 0.000 description 2
- JGFZNNIVVJXRND-UHFFFAOYSA-N N,N-Diisopropylethylamine (DIPEA) Chemical compound CCN(C(C)C)C(C)C JGFZNNIVVJXRND-UHFFFAOYSA-N 0.000 description 2
- LRHPLDYGYMQRHN-UHFFFAOYSA-N N-Butanol Chemical compound CCCCO LRHPLDYGYMQRHN-UHFFFAOYSA-N 0.000 description 2
- SECXISVLQFMRJM-UHFFFAOYSA-N N-Methylpyrrolidone Chemical compound CN1CCCC1=O SECXISVLQFMRJM-UHFFFAOYSA-N 0.000 description 2
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 2
- 101100342977 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-1 gene Proteins 0.000 description 2
- KDLHZDBZIXYQEI-UHFFFAOYSA-N Palladium Chemical compound [Pd] KDLHZDBZIXYQEI-UHFFFAOYSA-N 0.000 description 2
- FPTXMUIBLMGTQH-ONGXEEELSA-N Phe-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 FPTXMUIBLMGTQH-ONGXEEELSA-N 0.000 description 2
- KBVJZCVLQWCJQN-KKUMJFAQSA-N Phe-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KBVJZCVLQWCJQN-KKUMJFAQSA-N 0.000 description 2
- 108091000080 Phosphotransferase Proteins 0.000 description 2
- GLUUGHFHXGJENI-UHFFFAOYSA-N Piperazine Chemical compound C1CNCCN1 GLUUGHFHXGJENI-UHFFFAOYSA-N 0.000 description 2
- NQRYJNQNLNOLGT-UHFFFAOYSA-N Piperidine Chemical compound C1CCNCC1 NQRYJNQNLNOLGT-UHFFFAOYSA-N 0.000 description 2
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 2
- WWAQEUOYCYMGHB-FXQIFTODSA-N Pro-Asn-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 WWAQEUOYCYMGHB-FXQIFTODSA-N 0.000 description 2
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 2
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 2
- ULWBBFKQBDNGOY-RWMBFGLXSA-N Pro-Lys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N2CCC[C@@H]2C(=O)O ULWBBFKQBDNGOY-RWMBFGLXSA-N 0.000 description 2
- DCHQYSOGURGJST-FJXKBIBVSA-N Pro-Thr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O DCHQYSOGURGJST-FJXKBIBVSA-N 0.000 description 2
- CHYAYDLYYIJCKY-OSUNSFLBSA-N Pro-Thr-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CHYAYDLYYIJCKY-OSUNSFLBSA-N 0.000 description 2
- AIOWVDNPESPXRB-YTWAJWBKSA-N Pro-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2)O AIOWVDNPESPXRB-YTWAJWBKSA-N 0.000 description 2
- JUJWROOIHBZHMG-UHFFFAOYSA-N Pyridine Chemical compound C1=CC=NC=C1 JUJWROOIHBZHMG-UHFFFAOYSA-N 0.000 description 2
- 241000293869 Salmonella enterica subsp. enterica serovar Typhimurium Species 0.000 description 2
- GXXTUIUYTWGPMV-FXQIFTODSA-N Ser-Arg-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O GXXTUIUYTWGPMV-FXQIFTODSA-N 0.000 description 2
- NRCJWSGXMAPYQX-LPEHRKFASA-N Ser-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N)C(=O)O NRCJWSGXMAPYQX-LPEHRKFASA-N 0.000 description 2
- HBOABDXGTMMDSE-GUBZILKMSA-N Ser-Arg-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O HBOABDXGTMMDSE-GUBZILKMSA-N 0.000 description 2
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 2
- DBIDZNUXSLXVRG-FXQIFTODSA-N Ser-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N DBIDZNUXSLXVRG-FXQIFTODSA-N 0.000 description 2
- DLPXTCTVNDTYGJ-JBDRJPRFSA-N Ser-Ile-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(O)=O DLPXTCTVNDTYGJ-JBDRJPRFSA-N 0.000 description 2
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 2
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 2
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 2
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 2
- PCMZJFMUYWIERL-ZKWXMUAHSA-N Ser-Val-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMZJFMUYWIERL-ZKWXMUAHSA-N 0.000 description 2
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 2
- FQHUAUMYHAJTDH-GRCPKETISA-N Sialosonic acid Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)[C@@H](O)CC(=O)C(O)=O FQHUAUMYHAJTDH-GRCPKETISA-N 0.000 description 2
- QAOWNCQODCNURD-UHFFFAOYSA-N Sulfuric acid Chemical compound OS(O)(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-N 0.000 description 2
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 2
- YOSLMIPKOUAHKI-OLHMAJIHSA-N Thr-Asp-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O YOSLMIPKOUAHKI-OLHMAJIHSA-N 0.000 description 2
- DCLBXIWHLVEPMQ-JRQIVUDYSA-N Thr-Asp-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DCLBXIWHLVEPMQ-JRQIVUDYSA-N 0.000 description 2
- GCXFWAZRHBRYEM-NUMRIWBASA-N Thr-Gln-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O GCXFWAZRHBRYEM-NUMRIWBASA-N 0.000 description 2
- HJOSVGCWOTYJFG-WDCWCFNPSA-N Thr-Glu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O HJOSVGCWOTYJFG-WDCWCFNPSA-N 0.000 description 2
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 2
- WRQLCVIALDUQEQ-UNQGMJICSA-N Thr-Phe-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WRQLCVIALDUQEQ-UNQGMJICSA-N 0.000 description 2
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 2
- CSZFFQBUTMGHAH-UAXMHLISSA-N Thr-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O CSZFFQBUTMGHAH-UAXMHLISSA-N 0.000 description 2
- FNOQJVHFVLVMOS-AAEUAGOBSA-N Trp-Gly-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N FNOQJVHFVLVMOS-AAEUAGOBSA-N 0.000 description 2
- NIHNMOSRSAYZIT-BPNCWPANSA-N Tyr-Ala-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NIHNMOSRSAYZIT-BPNCWPANSA-N 0.000 description 2
- IMXAAEFAIBRCQF-SIUGBPQLSA-N Tyr-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N IMXAAEFAIBRCQF-SIUGBPQLSA-N 0.000 description 2
- UNUZEBFXGWVAOP-DZKIICNBSA-N Tyr-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UNUZEBFXGWVAOP-DZKIICNBSA-N 0.000 description 2
- LMKKMCGTDANZTR-BZSNNMDCSA-N Tyr-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 LMKKMCGTDANZTR-BZSNNMDCSA-N 0.000 description 2
- BIVIUZRBCAUNPW-JRQIVUDYSA-N Tyr-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O BIVIUZRBCAUNPW-JRQIVUDYSA-N 0.000 description 2
- NWDOPHYLSORNEX-QXEWZRGKSA-N Val-Asn-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N NWDOPHYLSORNEX-QXEWZRGKSA-N 0.000 description 2
- COSLEEOIYRPTHD-YDHLFZDLSA-N Val-Asp-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 COSLEEOIYRPTHD-YDHLFZDLSA-N 0.000 description 2
- JXGWQYWDUOWQHA-DZKIICNBSA-N Val-Gln-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N JXGWQYWDUOWQHA-DZKIICNBSA-N 0.000 description 2
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 2
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 2
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 2
- MJFSRZZJQWZHFQ-SRVKXCTJSA-N Val-Met-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)O)N MJFSRZZJQWZHFQ-SRVKXCTJSA-N 0.000 description 2
- ZXYPHBKIZLAQTL-QXEWZRGKSA-N Val-Pro-Asp Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N ZXYPHBKIZLAQTL-QXEWZRGKSA-N 0.000 description 2
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 2
- LTTQCQRTSHJPPL-ZKWXMUAHSA-N Val-Ser-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LTTQCQRTSHJPPL-ZKWXMUAHSA-N 0.000 description 2
- DLRZGNXCXUGIDG-KKHAAJSZSA-N Val-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O DLRZGNXCXUGIDG-KKHAAJSZSA-N 0.000 description 2
- WUFHZIRMAZZWRS-OSUNSFLBSA-N Val-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C(C)C)N WUFHZIRMAZZWRS-OSUNSFLBSA-N 0.000 description 2
- 241000607626 Vibrio cholerae Species 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- 125000002339 acetoacetyl group Chemical group O=C([*])C([H])([H])C(=O)C([H])([H])[H] 0.000 description 2
- 125000002777 acetyl group Chemical group [H]C([H])([H])C(*)=O 0.000 description 2
- 125000004442 acylamino group Chemical group 0.000 description 2
- 150000001298 alcohols Chemical class 0.000 description 2
- 229910052784 alkaline earth metal Inorganic materials 0.000 description 2
- 150000001342 alkaline earth metals Chemical class 0.000 description 2
- 125000004453 alkoxycarbonyl group Chemical group 0.000 description 2
- 239000002168 alkylating agent Substances 0.000 description 2
- 229940100198 alkylating agent Drugs 0.000 description 2
- 150000001413 amino acids Chemical group 0.000 description 2
- 108010038850 arginyl-isoleucyl-tyrosine Proteins 0.000 description 2
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 2
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 2
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 2
- 108010092854 aspartyllysine Proteins 0.000 description 2
- 244000052616 bacterial pathogen Species 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 229940009291 bifidobacterium longum Drugs 0.000 description 2
- 230000033228 biological regulation Effects 0.000 description 2
- 239000011575 calcium Substances 0.000 description 2
- 125000006297 carbonyl amino group Chemical group [H]N([*:2])C([*:1])=O 0.000 description 2
- 239000003054 catalyst Substances 0.000 description 2
- 230000003197 catalytic effect Effects 0.000 description 2
- 238000004517 catalytic hydrocracking Methods 0.000 description 2
- 150000001767 cationic compounds Chemical class 0.000 description 2
- 229920001429 chelating resin Polymers 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 238000004587 chromatography analysis Methods 0.000 description 2
- 239000013078 crystal Substances 0.000 description 2
- 238000002425 crystallisation Methods 0.000 description 2
- 230000008025 crystallization Effects 0.000 description 2
- 108010060199 cysteinylproline Proteins 0.000 description 2
- 238000010511 deprotection reaction Methods 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 150000002016 disaccharides Chemical group 0.000 description 2
- GUVUOGQBMYCBQP-UHFFFAOYSA-N dmpu Chemical compound CN1CCCN(C)C1=O GUVUOGQBMYCBQP-UHFFFAOYSA-N 0.000 description 2
- 239000000706 filtrate Substances 0.000 description 2
- 125000000350 glycoloyl group Chemical group O=C([*])C([H])([H])O[H] 0.000 description 2
- 125000003147 glycosyl group Chemical group 0.000 description 2
- 230000013595 glycosylation Effects 0.000 description 2
- 238000006206 glycosylation reaction Methods 0.000 description 2
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 2
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 2
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 2
- 108010048994 glycyl-tyrosyl-alanine Proteins 0.000 description 2
- 125000001188 haloalkyl group Chemical group 0.000 description 2
- GNOIPBMMFNIUFM-UHFFFAOYSA-N hexamethylphosphoric triamide Chemical compound CN(C)P(=O)(N(C)C)N(C)C GNOIPBMMFNIUFM-UHFFFAOYSA-N 0.000 description 2
- 238000004128 high performance liquid chromatography Methods 0.000 description 2
- 108010085325 histidylproline Proteins 0.000 description 2
- 239000001257 hydrogen Substances 0.000 description 2
- 238000005984 hydrogenation reaction Methods 0.000 description 2
- 230000007062 hydrolysis Effects 0.000 description 2
- 238000006460 hydrolysis reaction Methods 0.000 description 2
- 230000002209 hydrophobic effect Effects 0.000 description 2
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 2
- 229910001411 inorganic cation Inorganic materials 0.000 description 2
- 230000000968 intestinal effect Effects 0.000 description 2
- 150000002597 lactoses Chemical class 0.000 description 2
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 2
- 108010009298 lysylglutamic acid Proteins 0.000 description 2
- 108700023046 methionyl-leucyl-phenylalanine Proteins 0.000 description 2
- 108010005942 methionylglycine Proteins 0.000 description 2
- 235000016709 nutrition Nutrition 0.000 description 2
- 150000002892 organic cations Chemical class 0.000 description 2
- 239000003960 organic solvent Substances 0.000 description 2
- 108010089198 phenylalanyl-prolyl-arginine Proteins 0.000 description 2
- 108010084572 phenylalanyl-valine Proteins 0.000 description 2
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 2
- 102000020233 phosphotransferase Human genes 0.000 description 2
- BWHMMNNQKKPAPP-UHFFFAOYSA-L potassium carbonate Chemical compound [K+].[K+].[O-]C([O-])=O BWHMMNNQKKPAPP-UHFFFAOYSA-L 0.000 description 2
- 239000000843 powder Substances 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 2
- 108010090894 prolylleucine Proteins 0.000 description 2
- 108090000623 proteins and genes Proteins 0.000 description 2
- 239000003586 protic polar solvent Substances 0.000 description 2
- 230000006798 recombination Effects 0.000 description 2
- 238000005215 recombination Methods 0.000 description 2
- 238000001542 size-exclusion chromatography Methods 0.000 description 2
- MFRIHAYPQRLWNB-UHFFFAOYSA-N sodium tert-butoxide Chemical compound [Na+].CC(C)(C)[O-] MFRIHAYPQRLWNB-UHFFFAOYSA-N 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- 239000000758 substrate Substances 0.000 description 2
- 239000006188 syrup Substances 0.000 description 2
- 235000020357 syrup Nutrition 0.000 description 2
- 150000004044 tetrasaccharides Chemical class 0.000 description 2
- 230000006098 transglycosylation Effects 0.000 description 2
- 238000005918 transglycosylation reaction Methods 0.000 description 2
- 230000007704 transition Effects 0.000 description 2
- 230000032258 transport Effects 0.000 description 2
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 2
- 125000002221 trityl group Chemical group [H]C1=C([H])C([H])=C([H])C([H])=C1C([*])(C1=C(C(=C(C(=C1[H])[H])[H])[H])[H])C1=C([H])C([H])=C([H])C([H])=C1[H] 0.000 description 2
- 108010029384 tryptophyl-histidine Proteins 0.000 description 2
- HDTRYLNUVZCQOY-UHFFFAOYSA-N α-D-glucopyranosyl-α-D-glucopyranoside Natural products OC1C(O)C(O)C(CO)OC1OC1C(O)C(O)C(O)C(CO)O1 HDTRYLNUVZCQOY-UHFFFAOYSA-N 0.000 description 1
- TYALNJQZQRNQNQ-SKHIEIRWSA-N (4S,5R,6R)-5-Acetamido-4-hydroxy-6-[(1R,2R)-1,2,3-trihydroxypropyl]-2-[[(2R,3R,4S,5R,6S)-3,4,5-trihydroxy-6-[(2R,3S,4R,5R)-4,5,6-trihydroxy-2-(hydroxymethyl)oxan-3-yl]oxyoxan-2-yl]methoxy]oxane-2-carboxylic acid Chemical compound C(C)(=O)N[C@@H]1[C@H](CC(C(O)=O)(O[C@H]1[C@H](O)[C@H](O)CO)OC[C@@H]1[C@@H]([C@@H]([C@H]([C@H](O[C@H]2[C@@H]([C@H](C(O)O[C@@H]2CO)O)O)O1)O)O)O)O TYALNJQZQRNQNQ-SKHIEIRWSA-N 0.000 description 1
- ZXSQEZNORDWBGZ-UHFFFAOYSA-N 1,3-dihydropyrrolo[2,3-b]pyridin-2-one Chemical compound C1=CN=C2NC(=O)CC2=C1 ZXSQEZNORDWBGZ-UHFFFAOYSA-N 0.000 description 1
- RYHBNJHYFVUHQT-UHFFFAOYSA-N 1,4-Dioxane Chemical compound C1COCCO1 RYHBNJHYFVUHQT-UHFFFAOYSA-N 0.000 description 1
- UHDGCWIWMRVCDJ-UHFFFAOYSA-N 1-beta-D-Xylofuranosyl-NH-Cytosine Natural products O=C1N=C(N)C=CN1C1C(O)C(O)C(CO)O1 UHDGCWIWMRVCDJ-UHFFFAOYSA-N 0.000 description 1
- RAXXELZNTBOGNW-UHFFFAOYSA-N 1H-imidazole Chemical compound C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 1
- UTQNKKSJPHTPBS-UHFFFAOYSA-N 2,2,2-trichloroethanone Chemical group ClC(Cl)(Cl)[C]=O UTQNKKSJPHTPBS-UHFFFAOYSA-N 0.000 description 1
- YQTCQNIPQMJNTI-UHFFFAOYSA-N 2,2-dimethylpropan-1-one Chemical group CC(C)(C)[C]=O YQTCQNIPQMJNTI-UHFFFAOYSA-N 0.000 description 1
- KZTWONRVIPPDKH-UHFFFAOYSA-N 2-(piperidin-1-yl)ethanol Chemical compound OCCN1CCCCC1 KZTWONRVIPPDKH-UHFFFAOYSA-N 0.000 description 1
- QIMMUPPBPVKWKM-RHRFEJLCSA-N 2-methylnaphthalene Chemical class C1=CC=[14CH]C2=CC(C)=CC=C21 QIMMUPPBPVKWKM-RHRFEJLCSA-N 0.000 description 1
- WSYFJVWSMYIIQS-UHFFFAOYSA-N 2-piperazin-1-ylethanol Chemical compound OCCN1CCNCC1.OCCN1CCNCC1 WSYFJVWSMYIIQS-UHFFFAOYSA-N 0.000 description 1
- CSDQQAQKBAQLLE-UHFFFAOYSA-N 4-(4-chlorophenyl)-4,5,6,7-tetrahydrothieno[3,2-c]pyridine Chemical compound C1=CC(Cl)=CC=C1C1C(C=CS2)=C2CCN1 CSDQQAQKBAQLLE-UHFFFAOYSA-N 0.000 description 1
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 1
- CERZMXAJYMMUDR-QBTAGHCHSA-N 5-amino-3,5-dideoxy-D-glycero-D-galacto-non-2-ulopyranosonic acid Chemical compound N[C@@H]1[C@@H](O)CC(O)(C(O)=O)O[C@H]1[C@H](O)[C@H](O)CO CERZMXAJYMMUDR-QBTAGHCHSA-N 0.000 description 1
- 241000186044 Actinomyces viscosus Species 0.000 description 1
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 1
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 1
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 1
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 1
- YWWATNIVMOCSAV-UBHSHLNASA-N Ala-Arg-Phe Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YWWATNIVMOCSAV-UBHSHLNASA-N 0.000 description 1
- UCIYCBSJBQGDGM-LPEHRKFASA-N Ala-Arg-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N UCIYCBSJBQGDGM-LPEHRKFASA-N 0.000 description 1
- JYEBJTDTPNKQJG-FXQIFTODSA-N Ala-Asn-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N JYEBJTDTPNKQJG-FXQIFTODSA-N 0.000 description 1
- FXKNPWNXPQZLES-ZLUOBGJFSA-N Ala-Asn-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FXKNPWNXPQZLES-ZLUOBGJFSA-N 0.000 description 1
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 1
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 1
- NFDVJAKFMXHJEQ-HERUPUMHSA-N Ala-Asp-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N NFDVJAKFMXHJEQ-HERUPUMHSA-N 0.000 description 1
- YEELWQSXYBJVSV-UWJYBYFXSA-N Ala-Cys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YEELWQSXYBJVSV-UWJYBYFXSA-N 0.000 description 1
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 1
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 1
- CWEAKSWWKHGTRJ-BQBZGAKWSA-N Ala-Gly-Met Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O CWEAKSWWKHGTRJ-BQBZGAKWSA-N 0.000 description 1
- QHASENCZLDHBGX-ONGXEEELSA-N Ala-Gly-Phe Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QHASENCZLDHBGX-ONGXEEELSA-N 0.000 description 1
- NIZKGBJVCMRDKO-KWQFWETISA-N Ala-Gly-Tyr Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NIZKGBJVCMRDKO-KWQFWETISA-N 0.000 description 1
- XZWXFWBHYRFLEF-FSPLSTOPSA-N Ala-His Chemical compound C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 XZWXFWBHYRFLEF-FSPLSTOPSA-N 0.000 description 1
- AAXVGJXZKHQQHD-LSJOCFKGSA-N Ala-His-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCSC)C(=O)O)N AAXVGJXZKHQQHD-LSJOCFKGSA-N 0.000 description 1
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 1
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 1
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 1
- NOGFDULFCFXBHB-CIUDSAMLSA-N Ala-Leu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NOGFDULFCFXBHB-CIUDSAMLSA-N 0.000 description 1
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 1
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 1
- FCXAUASCMJOFEY-NDKCEZKHSA-N Ala-Leu-Thr-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O FCXAUASCMJOFEY-NDKCEZKHSA-N 0.000 description 1
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 1
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 1
- KQESEZXHYOUIIM-CQDKDKBSSA-N Ala-Lys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KQESEZXHYOUIIM-CQDKDKBSSA-N 0.000 description 1
- MAEQBGQTDWDSJQ-LSJOCFKGSA-N Ala-Met-His Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N MAEQBGQTDWDSJQ-LSJOCFKGSA-N 0.000 description 1
- AWNAEZICPNGAJK-FXQIFTODSA-N Ala-Met-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O AWNAEZICPNGAJK-FXQIFTODSA-N 0.000 description 1
- OSRZOHXQCUFIQG-FPMFFAJLSA-N Ala-Phe-Pro Chemical compound C([C@H](NC(=O)[C@@H]([NH3+])C)C(=O)N1[C@H](CCC1)C([O-])=O)C1=CC=CC=C1 OSRZOHXQCUFIQG-FPMFFAJLSA-N 0.000 description 1
- YHBDGLZYNIARKJ-GUBZILKMSA-N Ala-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N YHBDGLZYNIARKJ-GUBZILKMSA-N 0.000 description 1
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 1
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 1
- UCDOXFBTMLKASE-HERUPUMHSA-N Ala-Ser-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N UCDOXFBTMLKASE-HERUPUMHSA-N 0.000 description 1
- VRTOMXFZHGWHIJ-KZVJFYERSA-N Ala-Thr-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VRTOMXFZHGWHIJ-KZVJFYERSA-N 0.000 description 1
- UBTKNYUAMYRMKE-GOPGUHFVSA-N Ala-Trp-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N UBTKNYUAMYRMKE-GOPGUHFVSA-N 0.000 description 1
- WZGZDOXCDLLTHE-SYWGBEHUSA-N Ala-Trp-Ile Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@H](C)N)=CNC2=C1 WZGZDOXCDLLTHE-SYWGBEHUSA-N 0.000 description 1
- MTDDMSUUXNQMKK-BPNCWPANSA-N Ala-Tyr-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N MTDDMSUUXNQMKK-BPNCWPANSA-N 0.000 description 1
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 1
- BVLPIIBTWIYOML-ZKWXMUAHSA-N Ala-Val-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BVLPIIBTWIYOML-ZKWXMUAHSA-N 0.000 description 1
- DHONNEYAZPNGSG-UBHSHLNASA-N Ala-Val-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DHONNEYAZPNGSG-UBHSHLNASA-N 0.000 description 1
- DFCIPNHFKOQAME-FXQIFTODSA-N Arg-Ala-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFCIPNHFKOQAME-FXQIFTODSA-N 0.000 description 1
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 1
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 1
- CPSHGRGUPZBMOK-CIUDSAMLSA-N Arg-Asn-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CPSHGRGUPZBMOK-CIUDSAMLSA-N 0.000 description 1
- VXXHDZKEQNGXNU-QXEWZRGKSA-N Arg-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N VXXHDZKEQNGXNU-QXEWZRGKSA-N 0.000 description 1
- BEXGZLUHRXTZCC-CIUDSAMLSA-N Arg-Gln-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N BEXGZLUHRXTZCC-CIUDSAMLSA-N 0.000 description 1
- OCDJOVKIUJVUMO-SRVKXCTJSA-N Arg-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N OCDJOVKIUJVUMO-SRVKXCTJSA-N 0.000 description 1
- YKBHOXLMMPZPHQ-GMOBBJLQSA-N Arg-Ile-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O YKBHOXLMMPZPHQ-GMOBBJLQSA-N 0.000 description 1
- UAOSDDXCTBIPCA-QXEWZRGKSA-N Arg-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UAOSDDXCTBIPCA-QXEWZRGKSA-N 0.000 description 1
- FFEUXEAKYRCACT-PEDHHIEDSA-N Arg-Ile-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)CC)C(O)=O FFEUXEAKYRCACT-PEDHHIEDSA-N 0.000 description 1
- FNXCAFKDGBROCU-STECZYCISA-N Arg-Ile-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FNXCAFKDGBROCU-STECZYCISA-N 0.000 description 1
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 1
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 1
- DTBPLQNKYCYUOM-JYJNAYRXSA-N Arg-Met-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DTBPLQNKYCYUOM-JYJNAYRXSA-N 0.000 description 1
- CZUHPNLXLWMYMG-UBHSHLNASA-N Arg-Phe-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 CZUHPNLXLWMYMG-UBHSHLNASA-N 0.000 description 1
- PRLPSDIHSRITSF-UNQGMJICSA-N Arg-Phe-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PRLPSDIHSRITSF-UNQGMJICSA-N 0.000 description 1
- BSYKSCBTTQKOJG-GUBZILKMSA-N Arg-Pro-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BSYKSCBTTQKOJG-GUBZILKMSA-N 0.000 description 1
- HGKHPCFTRQDHCU-IUCAKERBSA-N Arg-Pro-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HGKHPCFTRQDHCU-IUCAKERBSA-N 0.000 description 1
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 1
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 1
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 1
- ZJBUILVYSXQNSW-YTWAJWBKSA-N Arg-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ZJBUILVYSXQNSW-YTWAJWBKSA-N 0.000 description 1
- INOIAEUXVVNJKA-XGEHTFHBSA-N Arg-Thr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O INOIAEUXVVNJKA-XGEHTFHBSA-N 0.000 description 1
- WTFIFQWLQXZLIZ-UMPQAUOISA-N Arg-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O WTFIFQWLQXZLIZ-UMPQAUOISA-N 0.000 description 1
- YHZQOSXDTFRZKU-WDSOQIARSA-N Arg-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N)=CNC2=C1 YHZQOSXDTFRZKU-WDSOQIARSA-N 0.000 description 1
- PJOPLXOCKACMLK-KKUMJFAQSA-N Arg-Tyr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O PJOPLXOCKACMLK-KKUMJFAQSA-N 0.000 description 1
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 1
- SUMJNGAMIQSNGX-TUAOUCFPSA-N Arg-Val-Pro Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N1CCC[C@@H]1C(O)=O SUMJNGAMIQSNGX-TUAOUCFPSA-N 0.000 description 1
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 1
- UTSMXMABBPFVJP-SZMVWBNQSA-N Arg-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UTSMXMABBPFVJP-SZMVWBNQSA-N 0.000 description 1
- 241000186063 Arthrobacter Species 0.000 description 1
- LEFKSBYHUGUWLP-ACZMJKKPSA-N Asn-Ala-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LEFKSBYHUGUWLP-ACZMJKKPSA-N 0.000 description 1
- NUHQMYUWLUSRJX-BIIVOSGPSA-N Asn-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N NUHQMYUWLUSRJX-BIIVOSGPSA-N 0.000 description 1
- CIBWFJFMOBIFTE-CIUDSAMLSA-N Asn-Arg-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N CIBWFJFMOBIFTE-CIUDSAMLSA-N 0.000 description 1
- ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N Asn-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N 0.000 description 1
- IOTKDTZEEBZNCM-UGYAYLCHSA-N Asn-Asn-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOTKDTZEEBZNCM-UGYAYLCHSA-N 0.000 description 1
- KXEGPPNPXOKKHK-ZLUOBGJFSA-N Asn-Asp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KXEGPPNPXOKKHK-ZLUOBGJFSA-N 0.000 description 1
- XVVOVPFMILMHPX-ZLUOBGJFSA-N Asn-Asp-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XVVOVPFMILMHPX-ZLUOBGJFSA-N 0.000 description 1
- XWFPGQVLOVGSLU-CIUDSAMLSA-N Asn-Gln-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XWFPGQVLOVGSLU-CIUDSAMLSA-N 0.000 description 1
- QPTAGIPWARILES-AVGNSLFASA-N Asn-Gln-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QPTAGIPWARILES-AVGNSLFASA-N 0.000 description 1
- FUHFYEKSGWOWGZ-XHNCKOQMSA-N Asn-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O FUHFYEKSGWOWGZ-XHNCKOQMSA-N 0.000 description 1
- DMLSCRJBWUEALP-LAEOZQHASA-N Asn-Glu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O DMLSCRJBWUEALP-LAEOZQHASA-N 0.000 description 1
- DDPXDCKYWDGZAL-BQBZGAKWSA-N Asn-Gly-Arg Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N DDPXDCKYWDGZAL-BQBZGAKWSA-N 0.000 description 1
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 1
- WONGRTVAMHFGBE-WDSKDSINSA-N Asn-Gly-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N WONGRTVAMHFGBE-WDSKDSINSA-N 0.000 description 1
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 1
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 1
- OWUCNXMFJRFOFI-BQBZGAKWSA-N Asn-Gly-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O OWUCNXMFJRFOFI-BQBZGAKWSA-N 0.000 description 1
- RAKKBBHMTJSXOY-XVYDVKMFSA-N Asn-His-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O RAKKBBHMTJSXOY-XVYDVKMFSA-N 0.000 description 1
- XLHLPYFMXGOASD-CIUDSAMLSA-N Asn-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N XLHLPYFMXGOASD-CIUDSAMLSA-N 0.000 description 1
- SXNJBDYEBOUYOJ-DCAQKATOSA-N Asn-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)N)N SXNJBDYEBOUYOJ-DCAQKATOSA-N 0.000 description 1
- NVWJMQNYLYWVNQ-BYULHYEWSA-N Asn-Ile-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O NVWJMQNYLYWVNQ-BYULHYEWSA-N 0.000 description 1
- UHGUKCOQUNPSKK-CIUDSAMLSA-N Asn-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N UHGUKCOQUNPSKK-CIUDSAMLSA-N 0.000 description 1
- ZYPWIUFLYMQZBS-SRVKXCTJSA-N Asn-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZYPWIUFLYMQZBS-SRVKXCTJSA-N 0.000 description 1
- RVHGJNGNKGDCPX-KKUMJFAQSA-N Asn-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N RVHGJNGNKGDCPX-KKUMJFAQSA-N 0.000 description 1
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 1
- OOXUBGLNDRGOKT-FXQIFTODSA-N Asn-Ser-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OOXUBGLNDRGOKT-FXQIFTODSA-N 0.000 description 1
- REQUGIWGOGSOEZ-ZLUOBGJFSA-N Asn-Ser-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N REQUGIWGOGSOEZ-ZLUOBGJFSA-N 0.000 description 1
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 1
- JXMREEPBRANWBY-VEVYYDQMSA-N Asn-Thr-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JXMREEPBRANWBY-VEVYYDQMSA-N 0.000 description 1
- MJIJBEYEHBKTIM-BYULHYEWSA-N Asn-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MJIJBEYEHBKTIM-BYULHYEWSA-N 0.000 description 1
- SOYOSFXLXYZNRG-CIUDSAMLSA-N Asp-Arg-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O SOYOSFXLXYZNRG-CIUDSAMLSA-N 0.000 description 1
- WSOKZUVWBXVJHX-CIUDSAMLSA-N Asp-Arg-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O WSOKZUVWBXVJHX-CIUDSAMLSA-N 0.000 description 1
- MFMJRYHVLLEMQM-DCAQKATOSA-N Asp-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N MFMJRYHVLLEMQM-DCAQKATOSA-N 0.000 description 1
- ATYWBXGNXZYZGI-ACZMJKKPSA-N Asp-Asn-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ATYWBXGNXZYZGI-ACZMJKKPSA-N 0.000 description 1
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 1
- NAPNAGZWHQHZLG-ZLUOBGJFSA-N Asp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N NAPNAGZWHQHZLG-ZLUOBGJFSA-N 0.000 description 1
- QOVWVLLHMMCFFY-ZLUOBGJFSA-N Asp-Asp-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QOVWVLLHMMCFFY-ZLUOBGJFSA-N 0.000 description 1
- VPSHHQXIWLGVDD-ZLUOBGJFSA-N Asp-Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VPSHHQXIWLGVDD-ZLUOBGJFSA-N 0.000 description 1
- RSMIHCFQDCVVBR-CIUDSAMLSA-N Asp-Gln-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N RSMIHCFQDCVVBR-CIUDSAMLSA-N 0.000 description 1
- IJHUZMGJRGNXIW-CIUDSAMLSA-N Asp-Glu-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IJHUZMGJRGNXIW-CIUDSAMLSA-N 0.000 description 1
- VFUXXFVCYZPOQG-WDSKDSINSA-N Asp-Glu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VFUXXFVCYZPOQG-WDSKDSINSA-N 0.000 description 1
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 1
- GISFCCXBVJKGEO-QEJZJMRPSA-N Asp-Glu-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O GISFCCXBVJKGEO-QEJZJMRPSA-N 0.000 description 1
- JUWZKMBALYLZCK-WHFBIAKZSA-N Asp-Gly-Asn Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O JUWZKMBALYLZCK-WHFBIAKZSA-N 0.000 description 1
- PSLSTUMPZILTAH-BYULHYEWSA-N Asp-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PSLSTUMPZILTAH-BYULHYEWSA-N 0.000 description 1
- SEMWSADZTMJELF-BYULHYEWSA-N Asp-Ile-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O SEMWSADZTMJELF-BYULHYEWSA-N 0.000 description 1
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 1
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 1
- CTWCFPWFIGRAEP-CIUDSAMLSA-N Asp-Lys-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O CTWCFPWFIGRAEP-CIUDSAMLSA-N 0.000 description 1
- HSGOFISJLFDMBJ-CIUDSAMLSA-N Asp-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N HSGOFISJLFDMBJ-CIUDSAMLSA-N 0.000 description 1
- RRUWMFBLFLUZSI-LPEHRKFASA-N Asp-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N RRUWMFBLFLUZSI-LPEHRKFASA-N 0.000 description 1
- WQSXAPPYLGNMQL-IHRRRGAJSA-N Asp-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N WQSXAPPYLGNMQL-IHRRRGAJSA-N 0.000 description 1
- IDDMGSKZQDEDGA-SRVKXCTJSA-N Asp-Phe-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 IDDMGSKZQDEDGA-SRVKXCTJSA-N 0.000 description 1
- WOPJVEMFXYHZEE-SRVKXCTJSA-N Asp-Phe-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WOPJVEMFXYHZEE-SRVKXCTJSA-N 0.000 description 1
- QJHOOKBAHRJPPX-QWRGUYRKSA-N Asp-Phe-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 QJHOOKBAHRJPPX-QWRGUYRKSA-N 0.000 description 1
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 1
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 1
- HICVMZCGVFKTPM-BQBZGAKWSA-N Asp-Pro-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HICVMZCGVFKTPM-BQBZGAKWSA-N 0.000 description 1
- UAXIKORUDGGIGA-DCAQKATOSA-N Asp-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O UAXIKORUDGGIGA-DCAQKATOSA-N 0.000 description 1
- FIAKNCXQFFKSSI-ZLUOBGJFSA-N Asp-Ser-Cys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O FIAKNCXQFFKSSI-ZLUOBGJFSA-N 0.000 description 1
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 1
- JJQGZGOEDSSHTE-FOHZUACHSA-N Asp-Thr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JJQGZGOEDSSHTE-FOHZUACHSA-N 0.000 description 1
- DKQCWCQRAMAFLN-UBHSHLNASA-N Asp-Trp-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O DKQCWCQRAMAFLN-UBHSHLNASA-N 0.000 description 1
- GWOVSEVNXNVMMY-BPUTZDHNSA-N Asp-Trp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)O)N GWOVSEVNXNVMMY-BPUTZDHNSA-N 0.000 description 1
- ZVYYMCXVPZEAPU-CWRNSKLLSA-N Asp-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC(=O)O)N)C(=O)O ZVYYMCXVPZEAPU-CWRNSKLLSA-N 0.000 description 1
- HCOQNGIHSXICCB-IHRRRGAJSA-N Asp-Tyr-Arg Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)O HCOQNGIHSXICCB-IHRRRGAJSA-N 0.000 description 1
- AWPWHMVCSISSQK-QWRGUYRKSA-N Asp-Tyr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O AWPWHMVCSISSQK-QWRGUYRKSA-N 0.000 description 1
- BPAUXFVCSYQDQX-JRQIVUDYSA-N Asp-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)O)N)O BPAUXFVCSYQDQX-JRQIVUDYSA-N 0.000 description 1
- QPDUWAUSSWGJSB-NGZCFLSTSA-N Asp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N QPDUWAUSSWGJSB-NGZCFLSTSA-N 0.000 description 1
- 241000606124 Bacteroides fragilis Species 0.000 description 1
- 241000186015 Bifidobacterium longum subsp. infantis Species 0.000 description 1
- TXCIAUNLDRJGJZ-UHFFFAOYSA-N CMP-N-acetyl neuraminic acid Natural products O1C(C(O)C(O)CO)C(NC(=O)C)C(O)CC1(C(O)=O)OP(O)(=O)OCC1C(O)C(O)C(N2C(N=C(N)C=C2)=O)O1 TXCIAUNLDRJGJZ-UHFFFAOYSA-N 0.000 description 1
- TXCIAUNLDRJGJZ-BILDWYJOSA-N CMP-N-acetyl-beta-neuraminic acid Chemical compound O1[C@@H]([C@H](O)[C@H](O)CO)[C@H](NC(=O)C)[C@@H](O)C[C@]1(C(O)=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C(N=C(N)C=C2)=O)O1 TXCIAUNLDRJGJZ-BILDWYJOSA-N 0.000 description 1
- 244000025254 Cannabis sativa Species 0.000 description 1
- 208000024699 Chagas disease Diseases 0.000 description 1
- 241000193403 Clostridium Species 0.000 description 1
- FMDCYTBSPZMPQE-JBDRJPRFSA-N Cys-Ala-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMDCYTBSPZMPQE-JBDRJPRFSA-N 0.000 description 1
- JTNKVWLMDHIUOG-IHRRRGAJSA-N Cys-Arg-Phe Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JTNKVWLMDHIUOG-IHRRRGAJSA-N 0.000 description 1
- SKSJPIBFNFPTJB-NKWVEPMBSA-N Cys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CS)N)C(=O)O SKSJPIBFNFPTJB-NKWVEPMBSA-N 0.000 description 1
- LKUCSUGWHYVYLP-GHCJXIJMSA-N Cys-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N LKUCSUGWHYVYLP-GHCJXIJMSA-N 0.000 description 1
- JXVFJOMFOLFPMP-KKUMJFAQSA-N Cys-Leu-Tyr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JXVFJOMFOLFPMP-KKUMJFAQSA-N 0.000 description 1
- SNHRIJBANHPWMO-XGEHTFHBSA-N Cys-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CS)N)O SNHRIJBANHPWMO-XGEHTFHBSA-N 0.000 description 1
- NMWZMKLDGZXRKP-BZSNNMDCSA-N Cys-Phe-Phe Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NMWZMKLDGZXRKP-BZSNNMDCSA-N 0.000 description 1
- CHRCKSPMGYDLIA-SRVKXCTJSA-N Cys-Phe-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O CHRCKSPMGYDLIA-SRVKXCTJSA-N 0.000 description 1
- LKHMGNHQULEPFY-ACZMJKKPSA-N Cys-Ser-Glu Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O LKHMGNHQULEPFY-ACZMJKKPSA-N 0.000 description 1
- JAHCWGSVNZXHRR-SVSWQMSJSA-N Cys-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CS)N JAHCWGSVNZXHRR-SVSWQMSJSA-N 0.000 description 1
- JRZMCSIUYGSJKP-ZKWXMUAHSA-N Cys-Val-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O JRZMCSIUYGSJKP-ZKWXMUAHSA-N 0.000 description 1
- LPBUBIHAVKXUOT-FXQIFTODSA-N Cys-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N LPBUBIHAVKXUOT-FXQIFTODSA-N 0.000 description 1
- UHDGCWIWMRVCDJ-PSQAKQOGSA-N Cytidine Natural products O=C1N=C(N)C=CN1[C@@H]1[C@@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-PSQAKQOGSA-N 0.000 description 1
- 108010090461 DFG peptide Proteins 0.000 description 1
- ZAFNJMIOTHYJRJ-UHFFFAOYSA-N Diisopropyl ether Chemical compound CC(C)OC(C)C ZAFNJMIOTHYJRJ-UHFFFAOYSA-N 0.000 description 1
- 241000224559 Endotrypanum Species 0.000 description 1
- PIICEJLVQHRZGT-UHFFFAOYSA-N Ethylenediamine Chemical compound NCCN PIICEJLVQHRZGT-UHFFFAOYSA-N 0.000 description 1
- 241000555682 Forsythia x intermedia Species 0.000 description 1
- 241000606807 Glaesserella parasuis Species 0.000 description 1
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 1
- SHERTACNJPYHAR-ACZMJKKPSA-N Gln-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O SHERTACNJPYHAR-ACZMJKKPSA-N 0.000 description 1
- MQANCSUBSBJNLU-KKUMJFAQSA-N Gln-Arg-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MQANCSUBSBJNLU-KKUMJFAQSA-N 0.000 description 1
- TWHDOEYLXXQYOZ-FXQIFTODSA-N Gln-Asn-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N TWHDOEYLXXQYOZ-FXQIFTODSA-N 0.000 description 1
- MGJMFSBEMSNYJL-AVGNSLFASA-N Gln-Asn-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MGJMFSBEMSNYJL-AVGNSLFASA-N 0.000 description 1
- CYTSBCIIEHUPDU-ACZMJKKPSA-N Gln-Asp-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O CYTSBCIIEHUPDU-ACZMJKKPSA-N 0.000 description 1
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 1
- LOJYQMFIIJVETK-WDSKDSINSA-N Gln-Gln Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(O)=O LOJYQMFIIJVETK-WDSKDSINSA-N 0.000 description 1
- NVEASDQHBRZPSU-BQBZGAKWSA-N Gln-Gln-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O NVEASDQHBRZPSU-BQBZGAKWSA-N 0.000 description 1
- IKFZXRLDMYWNBU-YUMQZZPRSA-N Gln-Gly-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N IKFZXRLDMYWNBU-YUMQZZPRSA-N 0.000 description 1
- XWIBVSAEUCAAKF-GVXVVHGQSA-N Gln-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)N)N XWIBVSAEUCAAKF-GVXVVHGQSA-N 0.000 description 1
- HDUDGCZEOZEFOA-KBIXCLLPSA-N Gln-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HDUDGCZEOZEFOA-KBIXCLLPSA-N 0.000 description 1
- QBLMTCRYYTVUQY-GUBZILKMSA-N Gln-Leu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QBLMTCRYYTVUQY-GUBZILKMSA-N 0.000 description 1
- FALJZCPMTGJOHX-SRVKXCTJSA-N Gln-Met-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O FALJZCPMTGJOHX-SRVKXCTJSA-N 0.000 description 1
- FQCILXROGNOZON-YUMQZZPRSA-N Gln-Pro-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O FQCILXROGNOZON-YUMQZZPRSA-N 0.000 description 1
- OREPWMPAUWIIAM-ZPFDUUQYSA-N Gln-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N OREPWMPAUWIIAM-ZPFDUUQYSA-N 0.000 description 1
- HLRLXVPRJJITSK-IFFSRLJSSA-N Gln-Thr-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HLRLXVPRJJITSK-IFFSRLJSSA-N 0.000 description 1
- SGVGIVDZLSHSEN-RYUDHWBXSA-N Gln-Tyr-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O SGVGIVDZLSHSEN-RYUDHWBXSA-N 0.000 description 1
- ZMXZGYLINVNTKH-DZKIICNBSA-N Gln-Val-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZMXZGYLINVNTKH-DZKIICNBSA-N 0.000 description 1
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 1
- HNAUFGBKJLTWQE-IFFSRLJSSA-N Gln-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N)O HNAUFGBKJLTWQE-IFFSRLJSSA-N 0.000 description 1
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 1
- BPDVTFBJZNBHEU-HGNGGELXSA-N Glu-Ala-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 BPDVTFBJZNBHEU-HGNGGELXSA-N 0.000 description 1
- DYFJZDDQPNIPAB-NHCYSSNCSA-N Glu-Arg-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O DYFJZDDQPNIPAB-NHCYSSNCSA-N 0.000 description 1
- GLWXKFRTOHKGIT-ACZMJKKPSA-N Glu-Asn-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GLWXKFRTOHKGIT-ACZMJKKPSA-N 0.000 description 1
- RJONUNZIMUXUOI-GUBZILKMSA-N Glu-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N RJONUNZIMUXUOI-GUBZILKMSA-N 0.000 description 1
- WLIPTFCZLHCNFD-LPEHRKFASA-N Glu-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O WLIPTFCZLHCNFD-LPEHRKFASA-N 0.000 description 1
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 1
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 1
- VOORMNJKNBGYGK-YUMQZZPRSA-N Glu-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N VOORMNJKNBGYGK-YUMQZZPRSA-N 0.000 description 1
- VXQOONWNIWFOCS-HGNGGELXSA-N Glu-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N VXQOONWNIWFOCS-HGNGGELXSA-N 0.000 description 1
- BRKUZSLQMPNVFN-SRVKXCTJSA-N Glu-His-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BRKUZSLQMPNVFN-SRVKXCTJSA-N 0.000 description 1
- ZJFNRQHUIHKZJF-GUBZILKMSA-N Glu-His-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O ZJFNRQHUIHKZJF-GUBZILKMSA-N 0.000 description 1
- WNRZUESNGGDCJX-JYJNAYRXSA-N Glu-Leu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WNRZUESNGGDCJX-JYJNAYRXSA-N 0.000 description 1
- BBBXWRGITSUJPB-YUMQZZPRSA-N Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O BBBXWRGITSUJPB-YUMQZZPRSA-N 0.000 description 1
- FMBWLLMUPXTXFC-SDDRHHMPSA-N Glu-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N)C(=O)O FMBWLLMUPXTXFC-SDDRHHMPSA-N 0.000 description 1
- JHSRJMUJOGLIHK-GUBZILKMSA-N Glu-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N JHSRJMUJOGLIHK-GUBZILKMSA-N 0.000 description 1
- QMOSCLNJVKSHHU-YUMQZZPRSA-N Glu-Met-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QMOSCLNJVKSHHU-YUMQZZPRSA-N 0.000 description 1
- GMAGZGCAYLQBKF-NHCYSSNCSA-N Glu-Met-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O GMAGZGCAYLQBKF-NHCYSSNCSA-N 0.000 description 1
- YTRBQAQSUDSIQE-FHWLQOOXSA-N Glu-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 YTRBQAQSUDSIQE-FHWLQOOXSA-N 0.000 description 1
- AAJHGGDRKHYSDH-GUBZILKMSA-N Glu-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O AAJHGGDRKHYSDH-GUBZILKMSA-N 0.000 description 1
- SYWCGQOIIARSIX-SRVKXCTJSA-N Glu-Pro-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O SYWCGQOIIARSIX-SRVKXCTJSA-N 0.000 description 1
- NNQDRRUXFJYCCJ-NHCYSSNCSA-N Glu-Pro-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O NNQDRRUXFJYCCJ-NHCYSSNCSA-N 0.000 description 1
- ALMBZBOCGSVSAI-ACZMJKKPSA-N Glu-Ser-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ALMBZBOCGSVSAI-ACZMJKKPSA-N 0.000 description 1
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 1
- TZXOPHFCAATANZ-QEJZJMRPSA-N Glu-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N TZXOPHFCAATANZ-QEJZJMRPSA-N 0.000 description 1
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 1
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 1
- HGJREIGJLUQBTJ-SZMVWBNQSA-N Glu-Trp-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O HGJREIGJLUQBTJ-SZMVWBNQSA-N 0.000 description 1
- VXEFAWJTFAUDJK-AVGNSLFASA-N Glu-Tyr-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O VXEFAWJTFAUDJK-AVGNSLFASA-N 0.000 description 1
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 1
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 1
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 1
- QRWPTXLWHHTOCO-DZKIICNBSA-N Glu-Val-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QRWPTXLWHHTOCO-DZKIICNBSA-N 0.000 description 1
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 1
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 1
- FKJQNJCQTKUBCD-XPUUQOCRSA-N Gly-Ala-His Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O FKJQNJCQTKUBCD-XPUUQOCRSA-N 0.000 description 1
- PHONXOACARQMPM-BQBZGAKWSA-N Gly-Ala-Met Chemical compound [H]NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O PHONXOACARQMPM-BQBZGAKWSA-N 0.000 description 1
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 1
- OCQUNKSFDYDXBG-QXEWZRGKSA-N Gly-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OCQUNKSFDYDXBG-QXEWZRGKSA-N 0.000 description 1
- VXKCPBPQEKKERH-IUCAKERBSA-N Gly-Arg-Pro Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N1CCC[C@H]1C(O)=O VXKCPBPQEKKERH-IUCAKERBSA-N 0.000 description 1
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 1
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 1
- XZRZILPOZBVTDB-GJZGRUSLSA-N Gly-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)CN)C(O)=O)=CNC2=C1 XZRZILPOZBVTDB-GJZGRUSLSA-N 0.000 description 1
- OCDLPQDYTJPWNG-YUMQZZPRSA-N Gly-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN OCDLPQDYTJPWNG-YUMQZZPRSA-N 0.000 description 1
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 1
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 1
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 1
- LLXVQPKEQQCISF-YUMQZZPRSA-N Gly-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN LLXVQPKEQQCISF-YUMQZZPRSA-N 0.000 description 1
- CEXINUGNTZFNRY-BYPYZUCNSA-N Gly-Cys-Gly Chemical compound [NH3+]CC(=O)N[C@@H](CS)C(=O)NCC([O-])=O CEXINUGNTZFNRY-BYPYZUCNSA-N 0.000 description 1
- QCTLGOYODITHPQ-WHFBIAKZSA-N Gly-Cys-Ser Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O QCTLGOYODITHPQ-WHFBIAKZSA-N 0.000 description 1
- BULIVUZUDBHKKZ-WDSKDSINSA-N Gly-Gln-Asn Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BULIVUZUDBHKKZ-WDSKDSINSA-N 0.000 description 1
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 1
- NPSWCZIRBAYNSB-JHEQGTHGSA-N Gly-Gln-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NPSWCZIRBAYNSB-JHEQGTHGSA-N 0.000 description 1
- LHRXAHLCRMQBGJ-RYUDHWBXSA-N Gly-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CN LHRXAHLCRMQBGJ-RYUDHWBXSA-N 0.000 description 1
- INLIXXRWNUKVCF-JTQLQIEISA-N Gly-Gly-Tyr Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 INLIXXRWNUKVCF-JTQLQIEISA-N 0.000 description 1
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 1
- SWQALSGKVLYKDT-ZKWXMUAHSA-N Gly-Ile-Ala Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SWQALSGKVLYKDT-ZKWXMUAHSA-N 0.000 description 1
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 1
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 1
- AFWYPMDMDYCKMD-KBPBESRZSA-N Gly-Leu-Tyr Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AFWYPMDMDYCKMD-KBPBESRZSA-N 0.000 description 1
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 1
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 1
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 1
- RVGMVLVBDRQVKB-UWVGGRQHSA-N Gly-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN RVGMVLVBDRQVKB-UWVGGRQHSA-N 0.000 description 1
- OMOZPGCHVWOXHN-BQBZGAKWSA-N Gly-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)CN OMOZPGCHVWOXHN-BQBZGAKWSA-N 0.000 description 1
- IEGFSKKANYKBDU-QWHCGFSZSA-N Gly-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)CN)C(=O)O IEGFSKKANYKBDU-QWHCGFSZSA-N 0.000 description 1
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 1
- GAAHQHNCMIAYEX-UWVGGRQHSA-N Gly-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GAAHQHNCMIAYEX-UWVGGRQHSA-N 0.000 description 1
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 1
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 1
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 1
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 1
- IMRNSEPSPFQNHF-STQMWFEESA-N Gly-Ser-Trp Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C12)C(=O)O IMRNSEPSPFQNHF-STQMWFEESA-N 0.000 description 1
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 1
- YXTFLTJYLIAZQG-FJXKBIBVSA-N Gly-Thr-Arg Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YXTFLTJYLIAZQG-FJXKBIBVSA-N 0.000 description 1
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 1
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 1
- GWNIGUKSRJBIHX-STQMWFEESA-N Gly-Tyr-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)CN)O GWNIGUKSRJBIHX-STQMWFEESA-N 0.000 description 1
- NWOSHVVPKDQKKT-RYUDHWBXSA-N Gly-Tyr-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O NWOSHVVPKDQKKT-RYUDHWBXSA-N 0.000 description 1
- PNUFMLXHOLFRLD-KBPBESRZSA-N Gly-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 PNUFMLXHOLFRLD-KBPBESRZSA-N 0.000 description 1
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 1
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 1
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 1
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 1
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 1
- 241000606790 Haemophilus Species 0.000 description 1
- AWHJQEYGWRKPHE-LSJOCFKGSA-N His-Ala-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AWHJQEYGWRKPHE-LSJOCFKGSA-N 0.000 description 1
- XINDHUAGVGCNSF-QSFUFRPTSA-N His-Ala-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XINDHUAGVGCNSF-QSFUFRPTSA-N 0.000 description 1
- FLUVGKKRRMLNPU-CQDKDKBSSA-N His-Ala-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FLUVGKKRRMLNPU-CQDKDKBSSA-N 0.000 description 1
- WJUYPBBCSSLVJE-CIUDSAMLSA-N His-Asn-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N WJUYPBBCSSLVJE-CIUDSAMLSA-N 0.000 description 1
- ZJSMFRTVYSLKQU-DJFWLOJKSA-N His-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ZJSMFRTVYSLKQU-DJFWLOJKSA-N 0.000 description 1
- IMPKSPYRPUXYAP-SZMVWBNQSA-N His-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC3=CN=CN3)N IMPKSPYRPUXYAP-SZMVWBNQSA-N 0.000 description 1
- HQKADFMLECZIQJ-HVTMNAMFSA-N His-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N HQKADFMLECZIQJ-HVTMNAMFSA-N 0.000 description 1
- STWGDDDFLUFCCA-GVXVVHGQSA-N His-Glu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O STWGDDDFLUFCCA-GVXVVHGQSA-N 0.000 description 1
- VBOFRJNDIOPNDO-YUMQZZPRSA-N His-Gly-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N VBOFRJNDIOPNDO-YUMQZZPRSA-N 0.000 description 1
- FYTCLUIYTYFGPT-YUMQZZPRSA-N His-Gly-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FYTCLUIYTYFGPT-YUMQZZPRSA-N 0.000 description 1
- LBQAHBIVXQSBIR-HVTMNAMFSA-N His-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N LBQAHBIVXQSBIR-HVTMNAMFSA-N 0.000 description 1
- ZRSJXIKQXUGKRB-TUBUOCAGSA-N His-Ile-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZRSJXIKQXUGKRB-TUBUOCAGSA-N 0.000 description 1
- PGRPSOUCWRBWKZ-DLOVCJGASA-N His-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CN=CN1 PGRPSOUCWRBWKZ-DLOVCJGASA-N 0.000 description 1
- SVVULKPWDBIPCO-BZSNNMDCSA-N His-Phe-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O SVVULKPWDBIPCO-BZSNNMDCSA-N 0.000 description 1
- WHKLDLQHSYAVGU-ACRUOGEOSA-N His-Phe-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WHKLDLQHSYAVGU-ACRUOGEOSA-N 0.000 description 1
- VCBWXASUBZIFLQ-IHRRRGAJSA-N His-Pro-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O VCBWXASUBZIFLQ-IHRRRGAJSA-N 0.000 description 1
- KAXZXLSXFWSNNZ-XVYDVKMFSA-N His-Ser-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KAXZXLSXFWSNNZ-XVYDVKMFSA-N 0.000 description 1
- MCGOGXFMKHPMSQ-AVGNSLFASA-N His-Val-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 MCGOGXFMKHPMSQ-AVGNSLFASA-N 0.000 description 1
- GBMSSORHVHAYLU-QTKMDUPCSA-N His-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CN=CN1)N)O GBMSSORHVHAYLU-QTKMDUPCSA-N 0.000 description 1
- DRKZDEFADVYTLU-AVGNSLFASA-N His-Val-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DRKZDEFADVYTLU-AVGNSLFASA-N 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- UFHFLCQGNIYNRP-UHFFFAOYSA-N Hydrogen Chemical compound [H][H] UFHFLCQGNIYNRP-UHFFFAOYSA-N 0.000 description 1
- YKRYHWJRQUSTKG-KBIXCLLPSA-N Ile-Ala-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKRYHWJRQUSTKG-KBIXCLLPSA-N 0.000 description 1
- QICVAHODWHIWIS-HTFCKZLJSA-N Ile-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N QICVAHODWHIWIS-HTFCKZLJSA-N 0.000 description 1
- SACHLUOUHCVIKI-GMOBBJLQSA-N Ile-Arg-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SACHLUOUHCVIKI-GMOBBJLQSA-N 0.000 description 1
- QTUSJASXLGLJSR-OSUNSFLBSA-N Ile-Arg-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N QTUSJASXLGLJSR-OSUNSFLBSA-N 0.000 description 1
- HZMLFETXHFHGBB-UGYAYLCHSA-N Ile-Asn-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZMLFETXHFHGBB-UGYAYLCHSA-N 0.000 description 1
- ZZHGKECPZXPXJF-PCBIJLKTSA-N Ile-Asn-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZZHGKECPZXPXJF-PCBIJLKTSA-N 0.000 description 1
- HDODQNPMSHDXJT-GHCJXIJMSA-N Ile-Asn-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O HDODQNPMSHDXJT-GHCJXIJMSA-N 0.000 description 1
- UMYZBHKAVTXWIW-GMOBBJLQSA-N Ile-Asp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UMYZBHKAVTXWIW-GMOBBJLQSA-N 0.000 description 1
- HVWXAQVMRBKKFE-UGYAYLCHSA-N Ile-Asp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HVWXAQVMRBKKFE-UGYAYLCHSA-N 0.000 description 1
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 1
- HGNUKGZQASSBKQ-PCBIJLKTSA-N Ile-Asp-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HGNUKGZQASSBKQ-PCBIJLKTSA-N 0.000 description 1
- DCQMJRSOGCYKTR-GHCJXIJMSA-N Ile-Asp-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DCQMJRSOGCYKTR-GHCJXIJMSA-N 0.000 description 1
- KIMHKBDJQQYLHU-PEFMBERDSA-N Ile-Glu-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KIMHKBDJQQYLHU-PEFMBERDSA-N 0.000 description 1
- IGJWJGIHUFQANP-LAEOZQHASA-N Ile-Gly-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N IGJWJGIHUFQANP-LAEOZQHASA-N 0.000 description 1
- LPFBXFILACZHIB-LAEOZQHASA-N Ile-Gly-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)O)N LPFBXFILACZHIB-LAEOZQHASA-N 0.000 description 1
- DFFTXLCCDFYRKD-MBLNEYKQSA-N Ile-Gly-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N DFFTXLCCDFYRKD-MBLNEYKQSA-N 0.000 description 1
- KYLIZSDYWQQTFM-PEDHHIEDSA-N Ile-Ile-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N KYLIZSDYWQQTFM-PEDHHIEDSA-N 0.000 description 1
- SVBAHOMTJRFSIC-SXTJYALSSA-N Ile-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVBAHOMTJRFSIC-SXTJYALSSA-N 0.000 description 1
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 1
- OVDKXUDMKXAZIV-ZPFDUUQYSA-N Ile-Lys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OVDKXUDMKXAZIV-ZPFDUUQYSA-N 0.000 description 1
- ADDYYRVQQZFIMW-MNXVOIDGSA-N Ile-Lys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ADDYYRVQQZFIMW-MNXVOIDGSA-N 0.000 description 1
- CEPIAEUVRKGPGP-DSYPUSFNSA-N Ile-Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 CEPIAEUVRKGPGP-DSYPUSFNSA-N 0.000 description 1
- IIWQTXMUALXGOV-PCBIJLKTSA-N Ile-Phe-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IIWQTXMUALXGOV-PCBIJLKTSA-N 0.000 description 1
- KTTMFLSBTNBAHL-MXAVVETBSA-N Ile-Phe-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N KTTMFLSBTNBAHL-MXAVVETBSA-N 0.000 description 1
- CAHCWMVNBZJVAW-NAKRPEOUSA-N Ile-Pro-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)O)N CAHCWMVNBZJVAW-NAKRPEOUSA-N 0.000 description 1
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 1
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 1
- QQVXERGIFIRCGW-NAKRPEOUSA-N Ile-Ser-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N QQVXERGIFIRCGW-NAKRPEOUSA-N 0.000 description 1
- JDCQDJVYUXNCGF-SPOWBLRKSA-N Ile-Ser-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N JDCQDJVYUXNCGF-SPOWBLRKSA-N 0.000 description 1
- DGTOKVBDZXJHNZ-WZLNRYEVSA-N Ile-Thr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N DGTOKVBDZXJHNZ-WZLNRYEVSA-N 0.000 description 1
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 1
- FXJLRZFMKGHYJP-CFMVVWHZSA-N Ile-Tyr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FXJLRZFMKGHYJP-CFMVVWHZSA-N 0.000 description 1
- GVEODXUBBFDBPW-MGHWNKPDSA-N Ile-Tyr-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 GVEODXUBBFDBPW-MGHWNKPDSA-N 0.000 description 1
- NJGXXYLPDMMFJB-XUXIUFHCSA-N Ile-Val-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N NJGXXYLPDMMFJB-XUXIUFHCSA-N 0.000 description 1
- WIYDLTIBHZSPKY-HJWJTTGWSA-N Ile-Val-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WIYDLTIBHZSPKY-HJWJTTGWSA-N 0.000 description 1
- DGAQECJNVWCQMB-PUAWFVPOSA-M Ilexoside XXIX Chemical compound C[C@@H]1CC[C@@]2(CC[C@@]3(C(=CC[C@H]4[C@]3(CC[C@@H]5[C@@]4(CC[C@@H](C5(C)C)OS(=O)(=O)[O-])C)C)[C@@H]2[C@]1(C)O)C)C(=O)O[C@H]6[C@@H]([C@H]([C@@H]([C@H](O6)CO)O)O)O.[Na+] DGAQECJNVWCQMB-PUAWFVPOSA-M 0.000 description 1
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-N L-arginine Chemical compound OC(=O)[C@@H](N)CCCN=C(N)N ODKSFYDXXFIFQN-BYPYZUCNSA-N 0.000 description 1
- 229930064664 L-arginine Natural products 0.000 description 1
- 235000014852 L-arginine Nutrition 0.000 description 1
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 1
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical group NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 1
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 1
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 1
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 1
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 1
- IIKJNQWOQIWWMR-CIUDSAMLSA-N Leu-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(C)C)N IIKJNQWOQIWWMR-CIUDSAMLSA-N 0.000 description 1
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 1
- FEHQLKKBVJHSEC-SZMVWBNQSA-N Leu-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 FEHQLKKBVJHSEC-SZMVWBNQSA-N 0.000 description 1
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 1
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 1
- YWYQSLOTVIRCFE-SRVKXCTJSA-N Leu-His-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O YWYQSLOTVIRCFE-SRVKXCTJSA-N 0.000 description 1
- BKTXKJMNTSMJDQ-AVGNSLFASA-N Leu-His-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BKTXKJMNTSMJDQ-AVGNSLFASA-N 0.000 description 1
- KXODZBLFVFSLAI-AVGNSLFASA-N Leu-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KXODZBLFVFSLAI-AVGNSLFASA-N 0.000 description 1
- OYQUOLRTJHWVSQ-SRVKXCTJSA-N Leu-His-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O OYQUOLRTJHWVSQ-SRVKXCTJSA-N 0.000 description 1
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 1
- SEMUSFOBZGKBGW-YTFOTSKYSA-N Leu-Ile-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SEMUSFOBZGKBGW-YTFOTSKYSA-N 0.000 description 1
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 1
- KYIIALJHAOIAHF-KKUMJFAQSA-N Leu-Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KYIIALJHAOIAHF-KKUMJFAQSA-N 0.000 description 1
- REPBGZHJKYWFMJ-KKUMJFAQSA-N Leu-Lys-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N REPBGZHJKYWFMJ-KKUMJFAQSA-N 0.000 description 1
- PKKMDPNFGULLNQ-AVGNSLFASA-N Leu-Met-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O PKKMDPNFGULLNQ-AVGNSLFASA-N 0.000 description 1
- FLNPJLDPGMLWAU-UWVGGRQHSA-N Leu-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(C)C FLNPJLDPGMLWAU-UWVGGRQHSA-N 0.000 description 1
- ZAVCJRJOQKIOJW-KKUMJFAQSA-N Leu-Phe-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=CC=C1 ZAVCJRJOQKIOJW-KKUMJFAQSA-N 0.000 description 1
- YESNGRDJQWDYLH-KKUMJFAQSA-N Leu-Phe-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N YESNGRDJQWDYLH-KKUMJFAQSA-N 0.000 description 1
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 1
- ADJWHHZETYAAAX-SRVKXCTJSA-N Leu-Ser-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ADJWHHZETYAAAX-SRVKXCTJSA-N 0.000 description 1
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 1
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 1
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 description 1
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 1
- SXOFUVGLPHCPRQ-KKUMJFAQSA-N Leu-Tyr-Cys Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(O)=O SXOFUVGLPHCPRQ-KKUMJFAQSA-N 0.000 description 1
- RDFIVFHPOSOXMW-ACRUOGEOSA-N Leu-Tyr-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RDFIVFHPOSOXMW-ACRUOGEOSA-N 0.000 description 1
- VQHUBNVKFFLWRP-ULQDDVLXSA-N Leu-Tyr-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 VQHUBNVKFFLWRP-ULQDDVLXSA-N 0.000 description 1
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 1
- TUIOUEWKFFVNLH-DCAQKATOSA-N Leu-Val-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O TUIOUEWKFFVNLH-DCAQKATOSA-N 0.000 description 1
- 239000002841 Lewis acid Substances 0.000 description 1
- IRNSXVOWSXSULE-DCAQKATOSA-N Lys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN IRNSXVOWSXSULE-DCAQKATOSA-N 0.000 description 1
- NTEVEUCLFMWSND-SRVKXCTJSA-N Lys-Arg-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O NTEVEUCLFMWSND-SRVKXCTJSA-N 0.000 description 1
- QYOXSYXPHUHOJR-GUBZILKMSA-N Lys-Asn-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYOXSYXPHUHOJR-GUBZILKMSA-N 0.000 description 1
- DGWXCIORNLWGGG-CIUDSAMLSA-N Lys-Asn-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O DGWXCIORNLWGGG-CIUDSAMLSA-N 0.000 description 1
- NDORZBUHCOJQDO-GVXVVHGQSA-N Lys-Gln-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O NDORZBUHCOJQDO-GVXVVHGQSA-N 0.000 description 1
- GCMWRRQAKQXDED-IUCAKERBSA-N Lys-Glu-Gly Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)N[C@@H](CCC([O-])=O)C(=O)NCC([O-])=O GCMWRRQAKQXDED-IUCAKERBSA-N 0.000 description 1
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 1
- QZONCCHVHCOBSK-YUMQZZPRSA-N Lys-Gly-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O QZONCCHVHCOBSK-YUMQZZPRSA-N 0.000 description 1
- NNKLKUUGESXCBS-KBPBESRZSA-N Lys-Gly-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NNKLKUUGESXCBS-KBPBESRZSA-N 0.000 description 1
- GNLJXWBNLAIPEP-MELADBBJSA-N Lys-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCCCN)N)C(=O)O GNLJXWBNLAIPEP-MELADBBJSA-N 0.000 description 1
- KEPWSUPUFAPBRF-DKIMLUQUSA-N Lys-Ile-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KEPWSUPUFAPBRF-DKIMLUQUSA-N 0.000 description 1
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 1
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 1
- VUTWYNQUSJWBHO-BZSNNMDCSA-N Lys-Leu-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VUTWYNQUSJWBHO-BZSNNMDCSA-N 0.000 description 1
- ALGGDNMLQNFVIZ-SRVKXCTJSA-N Lys-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ALGGDNMLQNFVIZ-SRVKXCTJSA-N 0.000 description 1
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 1
- ALEVUGKHINJNIF-QEJZJMRPSA-N Lys-Phe-Ala Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ALEVUGKHINJNIF-QEJZJMRPSA-N 0.000 description 1
- MSSJJDVQTFTLIF-KBPBESRZSA-N Lys-Phe-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O MSSJJDVQTFTLIF-KBPBESRZSA-N 0.000 description 1
- BOJYMMBYBNOOGG-DCAQKATOSA-N Lys-Pro-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BOJYMMBYBNOOGG-DCAQKATOSA-N 0.000 description 1
- CNGOEHJCLVCJHN-SRVKXCTJSA-N Lys-Pro-Glu Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O CNGOEHJCLVCJHN-SRVKXCTJSA-N 0.000 description 1
- LOGFVTREOLYCPF-RHYQMDGZSA-N Lys-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN LOGFVTREOLYCPF-RHYQMDGZSA-N 0.000 description 1
- GHKXHCMRAUYLBS-CIUDSAMLSA-N Lys-Ser-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O GHKXHCMRAUYLBS-CIUDSAMLSA-N 0.000 description 1
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 1
- QVTDVTONTRSQMF-WDCWCFNPSA-N Lys-Thr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CCCCN QVTDVTONTRSQMF-WDCWCFNPSA-N 0.000 description 1
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 1
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 1
- RQILLQOQXLZTCK-KBPBESRZSA-N Lys-Tyr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O RQILLQOQXLZTCK-KBPBESRZSA-N 0.000 description 1
- SQRLLZAQNOQCEG-KKUMJFAQSA-N Lys-Tyr-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 SQRLLZAQNOQCEG-KKUMJFAQSA-N 0.000 description 1
- QLFAPXUXEBAWEK-NHCYSSNCSA-N Lys-Val-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QLFAPXUXEBAWEK-NHCYSSNCSA-N 0.000 description 1
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 1
- OZVXDDFYCQOPFD-XQQFMLRXSA-N Lys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N OZVXDDFYCQOPFD-XQQFMLRXSA-N 0.000 description 1
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 1
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 description 1
- WDTLNWHPIPCMMP-AVGNSLFASA-N Met-Arg-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O WDTLNWHPIPCMMP-AVGNSLFASA-N 0.000 description 1
- NKDSBBBPGIVWEI-RCWTZXSCSA-N Met-Arg-Thr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NKDSBBBPGIVWEI-RCWTZXSCSA-N 0.000 description 1
- HDNOQCZWJGGHSS-VEVYYDQMSA-N Met-Asn-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HDNOQCZWJGGHSS-VEVYYDQMSA-N 0.000 description 1
- TZLYIHDABYBOCJ-FXQIFTODSA-N Met-Asp-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O TZLYIHDABYBOCJ-FXQIFTODSA-N 0.000 description 1
- YORIKIDJCPKBON-YUMQZZPRSA-N Met-Glu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YORIKIDJCPKBON-YUMQZZPRSA-N 0.000 description 1
- WWWGMQHQSAUXBU-BQBZGAKWSA-N Met-Gly-Asn Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O WWWGMQHQSAUXBU-BQBZGAKWSA-N 0.000 description 1
- XPCLRYNQMZOOFB-ULQDDVLXSA-N Met-His-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N XPCLRYNQMZOOFB-ULQDDVLXSA-N 0.000 description 1
- SCKPOOMCTFEVTN-QTKMDUPCSA-N Met-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCSC)N)O SCKPOOMCTFEVTN-QTKMDUPCSA-N 0.000 description 1
- AFFKUNVPPLQUGA-DCAQKATOSA-N Met-Leu-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O AFFKUNVPPLQUGA-DCAQKATOSA-N 0.000 description 1
- AWGBEIYZPAXXSX-RWMBFGLXSA-N Met-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N AWGBEIYZPAXXSX-RWMBFGLXSA-N 0.000 description 1
- SMVTWPOATVIXTN-NAKRPEOUSA-N Met-Ser-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SMVTWPOATVIXTN-NAKRPEOUSA-N 0.000 description 1
- FXBKQTOGURNXSL-HJGDQZAQSA-N Met-Thr-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O FXBKQTOGURNXSL-HJGDQZAQSA-N 0.000 description 1
- YIGCDRZMZNDENK-UNQGMJICSA-N Met-Thr-Phe Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YIGCDRZMZNDENK-UNQGMJICSA-N 0.000 description 1
- JACMWNXOOUYXCD-JYJNAYRXSA-N Met-Val-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JACMWNXOOUYXCD-JYJNAYRXSA-N 0.000 description 1
- 206010027476 Metastases Diseases 0.000 description 1
- 241000187750 Micromonospora viridifaciens Species 0.000 description 1
- YNAVUWVOSKDBBP-UHFFFAOYSA-N Morpholine Chemical compound C1COCCN1 YNAVUWVOSKDBBP-UHFFFAOYSA-N 0.000 description 1
- OVRNDRQMDRJTHS-CBQIKETKSA-N N-Acetyl-D-Galactosamine Chemical compound CC(=O)N[C@H]1[C@@H](O)O[C@H](CO)[C@H](O)[C@@H]1O OVRNDRQMDRJTHS-CBQIKETKSA-N 0.000 description 1
- MBLBDJOUHNCFQT-UHFFFAOYSA-N N-acetyl-D-galactosamine Natural products CC(=O)NC(C=O)C(O)C(O)C(O)CO MBLBDJOUHNCFQT-UHFFFAOYSA-N 0.000 description 1
- SQVRNKJHWKZAKO-PFQGKNLYSA-N N-acetyl-beta-neuraminic acid Chemical compound CC(=O)N[C@@H]1[C@@H](O)C[C@@](O)(C(O)=O)O[C@H]1[C@H](O)[C@H](O)CO SQVRNKJHWKZAKO-PFQGKNLYSA-N 0.000 description 1
- MBBZMMPHUWSWHV-BDVNFPICSA-N N-methylglucamine Chemical compound CNC[C@H](O)[C@@H](O)[C@H](O)[C@H](O)CO MBBZMMPHUWSWHV-BDVNFPICSA-N 0.000 description 1
- 101800000597 N-terminal peptide Proteins 0.000 description 1
- 102400000108 N-terminal peptide Human genes 0.000 description 1
- 108010047562 NGR peptide Proteins 0.000 description 1
- PXHVJJICTQNCMI-UHFFFAOYSA-N Nickel Chemical compound [Ni] PXHVJJICTQNCMI-UHFFFAOYSA-N 0.000 description 1
- MNDLNRWDFVTCQF-UHFFFAOYSA-N O1CCN(CC1)CCO.OCCN1CCOCC1 Chemical compound O1CCN(CC1)CCO.OCCN1CCOCC1 MNDLNRWDFVTCQF-UHFFFAOYSA-N 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 241001524178 Paenarthrobacter ureafaciens Species 0.000 description 1
- 241000606856 Pasteurella multocida Species 0.000 description 1
- JVTMTFMMMHAPCR-UBHSHLNASA-N Phe-Ala-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JVTMTFMMMHAPCR-UBHSHLNASA-N 0.000 description 1
- LDSOBEJVGGVWGD-DLOVCJGASA-N Phe-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 LDSOBEJVGGVWGD-DLOVCJGASA-N 0.000 description 1
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 1
- AUJWXNGCAQWLEI-KBPBESRZSA-N Phe-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AUJWXNGCAQWLEI-KBPBESRZSA-N 0.000 description 1
- IWZRODDWOSIXPZ-IRXDYDNUSA-N Phe-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 IWZRODDWOSIXPZ-IRXDYDNUSA-N 0.000 description 1
- JLLJTMHNXQTMCK-UBHSHLNASA-N Phe-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 JLLJTMHNXQTMCK-UBHSHLNASA-N 0.000 description 1
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 1
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 1
- IPFXYNKCXYGSSV-KKUMJFAQSA-N Phe-Ser-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N IPFXYNKCXYGSSV-KKUMJFAQSA-N 0.000 description 1
- FGWUALWGCZJQDJ-URLPEUOOSA-N Phe-Thr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGWUALWGCZJQDJ-URLPEUOOSA-N 0.000 description 1
- SHUFSZDAIPLZLF-BEAPCOKYSA-N Phe-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O SHUFSZDAIPLZLF-BEAPCOKYSA-N 0.000 description 1
- YFXXRYFWJFQAFW-JHYOHUSXSA-N Phe-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YFXXRYFWJFQAFW-JHYOHUSXSA-N 0.000 description 1
- APMXLWHMIVWLLR-BZSNNMDCSA-N Phe-Tyr-Ser Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(O)=O)C1=CC=CC=C1 APMXLWHMIVWLLR-BZSNNMDCSA-N 0.000 description 1
- YUPRIZTWANWWHK-DZKIICNBSA-N Phe-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N YUPRIZTWANWWHK-DZKIICNBSA-N 0.000 description 1
- JTKGCYOOJLUETJ-ULQDDVLXSA-N Phe-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JTKGCYOOJLUETJ-ULQDDVLXSA-N 0.000 description 1
- RGMLUHANLDVMPB-ULQDDVLXSA-N Phe-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N RGMLUHANLDVMPB-ULQDDVLXSA-N 0.000 description 1
- VIIRRNQMMIHYHQ-XHSDSOJGSA-N Phe-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N VIIRRNQMMIHYHQ-XHSDSOJGSA-N 0.000 description 1
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 1
- FCCBQBZXIAZNIG-LSJOCFKGSA-N Pro-Ala-His Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O FCCBQBZXIAZNIG-LSJOCFKGSA-N 0.000 description 1
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 1
- HMNSRTLZAJHSIK-YUMQZZPRSA-N Pro-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 HMNSRTLZAJHSIK-YUMQZZPRSA-N 0.000 description 1
- UTAUEDINXUMHLG-FXQIFTODSA-N Pro-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 UTAUEDINXUMHLG-FXQIFTODSA-N 0.000 description 1
- VOZIBWWZSBIXQN-SRVKXCTJSA-N Pro-Glu-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O VOZIBWWZSBIXQN-SRVKXCTJSA-N 0.000 description 1
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 1
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 1
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 1
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 description 1
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 1
- MHHQQZIFLWFZGR-DCAQKATOSA-N Pro-Lys-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O MHHQQZIFLWFZGR-DCAQKATOSA-N 0.000 description 1
- WOIFYRZPIORBRY-AVGNSLFASA-N Pro-Lys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WOIFYRZPIORBRY-AVGNSLFASA-N 0.000 description 1
- ZUZINZIJHJFJRN-UBHSHLNASA-N Pro-Phe-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 ZUZINZIJHJFJRN-UBHSHLNASA-N 0.000 description 1
- JIWJRKNYLSHONY-KKUMJFAQSA-N Pro-Phe-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JIWJRKNYLSHONY-KKUMJFAQSA-N 0.000 description 1
- GMJDSFYVTAMIBF-FXQIFTODSA-N Pro-Ser-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GMJDSFYVTAMIBF-FXQIFTODSA-N 0.000 description 1
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 1
- ITUDDXVFGFEKPD-NAKRPEOUSA-N Pro-Ser-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ITUDDXVFGFEKPD-NAKRPEOUSA-N 0.000 description 1
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 1
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 1
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 1
- IURWWZYKYPEANQ-HJGDQZAQSA-N Pro-Thr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IURWWZYKYPEANQ-HJGDQZAQSA-N 0.000 description 1
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 1
- ZYJMLBCDFPIGNL-JYJNAYRXSA-N Pro-Tyr-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H]1CCCN1)C(O)=O ZYJMLBCDFPIGNL-JYJNAYRXSA-N 0.000 description 1
- SHTKRJHDMNSKRM-ULQDDVLXSA-N Pro-Tyr-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O SHTKRJHDMNSKRM-ULQDDVLXSA-N 0.000 description 1
- XDKKMRPRRCOELJ-GUBZILKMSA-N Pro-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 XDKKMRPRRCOELJ-GUBZILKMSA-N 0.000 description 1
- JXVXYRZQIUPYSA-NHCYSSNCSA-N Pro-Val-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JXVXYRZQIUPYSA-NHCYSSNCSA-N 0.000 description 1
- 241000589517 Pseudomonas aeruginosa Species 0.000 description 1
- 108010079005 RDV peptide Proteins 0.000 description 1
- 239000007868 Raney catalyst Substances 0.000 description 1
- 229910000564 Raney nickel Inorganic materials 0.000 description 1
- 229930182475 S-glycoside Natural products 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- SSJMZMUVNKEENT-IMJSIDKUSA-N Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CO SSJMZMUVNKEENT-IMJSIDKUSA-N 0.000 description 1
- MMGJPDWSIOAGTH-ACZMJKKPSA-N Ser-Ala-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MMGJPDWSIOAGTH-ACZMJKKPSA-N 0.000 description 1
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 1
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 1
- IDQFQFVEWMWRQQ-DLOVCJGASA-N Ser-Ala-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IDQFQFVEWMWRQQ-DLOVCJGASA-N 0.000 description 1
- PZZJMBYSYAKYPK-UWJYBYFXSA-N Ser-Ala-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PZZJMBYSYAKYPK-UWJYBYFXSA-N 0.000 description 1
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 1
- UBRXAVQWXOWRSJ-ZLUOBGJFSA-N Ser-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)C(=O)N UBRXAVQWXOWRSJ-ZLUOBGJFSA-N 0.000 description 1
- DKKGAAJTDKHWOD-BIIVOSGPSA-N Ser-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)C(=O)O DKKGAAJTDKHWOD-BIIVOSGPSA-N 0.000 description 1
- RNMRYWZYFHHOEV-CIUDSAMLSA-N Ser-Gln-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RNMRYWZYFHHOEV-CIUDSAMLSA-N 0.000 description 1
- IXUGADGDCQDLSA-FXQIFTODSA-N Ser-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N IXUGADGDCQDLSA-FXQIFTODSA-N 0.000 description 1
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 1
- PVDTYLHUWAEYGY-CIUDSAMLSA-N Ser-Glu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PVDTYLHUWAEYGY-CIUDSAMLSA-N 0.000 description 1
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 1
- WBINSDOPZHQPPM-AVGNSLFASA-N Ser-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)O WBINSDOPZHQPPM-AVGNSLFASA-N 0.000 description 1
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 1
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 1
- QGAHMVHBORDHDC-YUMQZZPRSA-N Ser-His-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CN=CN1 QGAHMVHBORDHDC-YUMQZZPRSA-N 0.000 description 1
- MOQDPPUMFSMYOM-KKUMJFAQSA-N Ser-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CO)N MOQDPPUMFSMYOM-KKUMJFAQSA-N 0.000 description 1
- ZUDXUJSYCCNZQJ-DCAQKATOSA-N Ser-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N ZUDXUJSYCCNZQJ-DCAQKATOSA-N 0.000 description 1
- BKZYBLLIBOBOOW-GHCJXIJMSA-N Ser-Ile-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O BKZYBLLIBOBOOW-GHCJXIJMSA-N 0.000 description 1
- JIPVNVNKXJLFJF-BJDJZHNGSA-N Ser-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N JIPVNVNKXJLFJF-BJDJZHNGSA-N 0.000 description 1
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 1
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 1
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 1
- LRWBCWGEUCKDTN-BJDJZHNGSA-N Ser-Lys-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LRWBCWGEUCKDTN-BJDJZHNGSA-N 0.000 description 1
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 1
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 1
- JAWGSPUJAXYXJA-IHRRRGAJSA-N Ser-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=CC=C1 JAWGSPUJAXYXJA-IHRRRGAJSA-N 0.000 description 1
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 1
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 1
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 1
- ZKOKTQPHFMRSJP-YJRXYDGGSA-N Ser-Thr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKOKTQPHFMRSJP-YJRXYDGGSA-N 0.000 description 1
- SDFUZKIAHWRUCS-QEJZJMRPSA-N Ser-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N SDFUZKIAHWRUCS-QEJZJMRPSA-N 0.000 description 1
- FVFUOQIYDPAIJR-XIRDDKMYSA-N Ser-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N FVFUOQIYDPAIJR-XIRDDKMYSA-N 0.000 description 1
- XPVIVVLLLOFBRH-XIRDDKMYSA-N Ser-Trp-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@@H](N)CO)C(O)=O XPVIVVLLLOFBRH-XIRDDKMYSA-N 0.000 description 1
- PIQRHJQWEPWFJG-UWJYBYFXSA-N Ser-Tyr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PIQRHJQWEPWFJG-UWJYBYFXSA-N 0.000 description 1
- FGBLCMLXHRPVOF-IHRRRGAJSA-N Ser-Tyr-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FGBLCMLXHRPVOF-IHRRRGAJSA-N 0.000 description 1
- PQEQXWRVHQAAKS-SRVKXCTJSA-N Ser-Tyr-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=C(O)C=C1 PQEQXWRVHQAAKS-SRVKXCTJSA-N 0.000 description 1
- BIWBTRRBHIEVAH-IHPCNDPISA-N Ser-Tyr-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O BIWBTRRBHIEVAH-IHPCNDPISA-N 0.000 description 1
- SYCFMSYTIFXWAJ-DCAQKATOSA-N Ser-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N SYCFMSYTIFXWAJ-DCAQKATOSA-N 0.000 description 1
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 1
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 1
- 102100028755 Sialidase-2 Human genes 0.000 description 1
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 1
- KEAYESYHFKHZAL-UHFFFAOYSA-N Sodium Chemical compound [Na] KEAYESYHFKHZAL-UHFFFAOYSA-N 0.000 description 1
- VMHLLURERBWHNL-UHFFFAOYSA-M Sodium acetate Chemical compound [Na+].CC([O-])=O VMHLLURERBWHNL-UHFFFAOYSA-M 0.000 description 1
- 239000004133 Sodium thiosulphate Substances 0.000 description 1
- 241000194017 Streptococcus Species 0.000 description 1
- PXQUBKWZENPDGE-CIQUZCHMSA-N Thr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)O)N PXQUBKWZENPDGE-CIQUZCHMSA-N 0.000 description 1
- UNURFMVMXLENAZ-KJEVXHAQSA-N Thr-Arg-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UNURFMVMXLENAZ-KJEVXHAQSA-N 0.000 description 1
- JNQZPAWOPBZGIX-RCWTZXSCSA-N Thr-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N JNQZPAWOPBZGIX-RCWTZXSCSA-N 0.000 description 1
- JHBHMCMKSPXRHV-NUMRIWBASA-N Thr-Asn-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O JHBHMCMKSPXRHV-NUMRIWBASA-N 0.000 description 1
- JTEICXDKGWKRRV-HJGDQZAQSA-N Thr-Asn-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JTEICXDKGWKRRV-HJGDQZAQSA-N 0.000 description 1
- OJRNZRROAIAHDL-LKXGYXEUSA-N Thr-Asn-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OJRNZRROAIAHDL-LKXGYXEUSA-N 0.000 description 1
- JXKMXEBNZCKSDY-JIOCBJNQSA-N Thr-Asp-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O JXKMXEBNZCKSDY-JIOCBJNQSA-N 0.000 description 1
- GKWNLDNXMMLRMC-GLLZPBPUSA-N Thr-Glu-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O GKWNLDNXMMLRMC-GLLZPBPUSA-N 0.000 description 1
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 1
- KBLYJPQSNGTDIU-LOKLDPHHSA-N Thr-Glu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O KBLYJPQSNGTDIU-LOKLDPHHSA-N 0.000 description 1
- MSIYNSBKKVMGFO-BHNWBGBOSA-N Thr-Gly-Pro Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N)O MSIYNSBKKVMGFO-BHNWBGBOSA-N 0.000 description 1
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 1
- WPSDXXQRIVKBAY-NKIYYHGXSA-N Thr-His-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O WPSDXXQRIVKBAY-NKIYYHGXSA-N 0.000 description 1
- XOWKUMFHEZLKLT-CIQUZCHMSA-N Thr-Ile-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O XOWKUMFHEZLKLT-CIQUZCHMSA-N 0.000 description 1
- JRAUIKJSEAKTGD-TUBUOCAGSA-N Thr-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N JRAUIKJSEAKTGD-TUBUOCAGSA-N 0.000 description 1
- UYTYTDMCDBPDSC-URLPEUOOSA-N Thr-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N UYTYTDMCDBPDSC-URLPEUOOSA-N 0.000 description 1
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 1
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 1
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 1
- JWQNAFHCXKVZKZ-UVOCVTCTSA-N Thr-Lys-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWQNAFHCXKVZKZ-UVOCVTCTSA-N 0.000 description 1
- KPNSNVTUVKSBFL-ZJDVBMNYSA-N Thr-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KPNSNVTUVKSBFL-ZJDVBMNYSA-N 0.000 description 1
- WVVOFCVMHAXGLE-LFSVMHDDSA-N Thr-Phe-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O WVVOFCVMHAXGLE-LFSVMHDDSA-N 0.000 description 1
- JMBRNXUOLJFURW-BEAPCOKYSA-N Thr-Phe-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N)O JMBRNXUOLJFURW-BEAPCOKYSA-N 0.000 description 1
- MUAFDCVOHYAFNG-RCWTZXSCSA-N Thr-Pro-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MUAFDCVOHYAFNG-RCWTZXSCSA-N 0.000 description 1
- NDXSOKGYKCGYKT-VEVYYDQMSA-N Thr-Pro-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O NDXSOKGYKCGYKT-VEVYYDQMSA-N 0.000 description 1
- JAJOFWABAUKAEJ-QTKMDUPCSA-N Thr-Pro-His Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O JAJOFWABAUKAEJ-QTKMDUPCSA-N 0.000 description 1
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 description 1
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 1
- DOBIBIXIHJKVJF-XKBZYTNZSA-N Thr-Ser-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DOBIBIXIHJKVJF-XKBZYTNZSA-N 0.000 description 1
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 1
- BCYUHPXBHCUYBA-CUJWVEQBSA-N Thr-Ser-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BCYUHPXBHCUYBA-CUJWVEQBSA-N 0.000 description 1
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 1
- VGNLMPBYWWNQFS-ZEILLAHLSA-N Thr-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O VGNLMPBYWWNQFS-ZEILLAHLSA-N 0.000 description 1
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 1
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 1
- FBQHKSPOIAFUEI-OWLDWWDNSA-N Thr-Trp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O FBQHKSPOIAFUEI-OWLDWWDNSA-N 0.000 description 1
- XGUAUKUYQHBUNY-SWRJLBSHSA-N Thr-Trp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O XGUAUKUYQHBUNY-SWRJLBSHSA-N 0.000 description 1
- PELIQFPESHBTMA-WLTAIBSBSA-N Thr-Tyr-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 PELIQFPESHBTMA-WLTAIBSBSA-N 0.000 description 1
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 1
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 1
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 1
- ILUOMMDDGREELW-OSUNSFLBSA-N Thr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O ILUOMMDDGREELW-OSUNSFLBSA-N 0.000 description 1
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 1
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 1
- HDTRYLNUVZCQOY-WSWWMNSNSA-N Trehalose Natural products O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-WSWWMNSNSA-N 0.000 description 1
- GSEJCLTVZPLZKY-UHFFFAOYSA-N Triethanolamine Chemical compound OCCN(CCO)CCO GSEJCLTVZPLZKY-UHFFFAOYSA-N 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- QNMIVTOQXUSGLN-SZMVWBNQSA-N Trp-Arg-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 QNMIVTOQXUSGLN-SZMVWBNQSA-N 0.000 description 1
- VEYXZZGMIBKXCN-UBHSHLNASA-N Trp-Asp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VEYXZZGMIBKXCN-UBHSHLNASA-N 0.000 description 1
- DVAAUUVLDFKTAQ-VHWLVUOQSA-N Trp-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N DVAAUUVLDFKTAQ-VHWLVUOQSA-N 0.000 description 1
- DXHHCIYKHRKBOC-BHYGNILZSA-N Trp-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O DXHHCIYKHRKBOC-BHYGNILZSA-N 0.000 description 1
- PKUJMYZNJMRHEZ-XIRDDKMYSA-N Trp-Glu-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PKUJMYZNJMRHEZ-XIRDDKMYSA-N 0.000 description 1
- DVIIYMVCSUQOJG-QEJZJMRPSA-N Trp-Glu-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DVIIYMVCSUQOJG-QEJZJMRPSA-N 0.000 description 1
- VMBBTANKMSRJSS-JSGCOSHPSA-N Trp-Glu-Gly Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VMBBTANKMSRJSS-JSGCOSHPSA-N 0.000 description 1
- SNJAPSVIPKUMCK-NWLDYVSISA-N Trp-Glu-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SNJAPSVIPKUMCK-NWLDYVSISA-N 0.000 description 1
- NXQAOORHSYJRGH-AAEUAGOBSA-N Trp-Gly-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 NXQAOORHSYJRGH-AAEUAGOBSA-N 0.000 description 1
- ORQGVWIUHICVKE-KCTSRDHCSA-N Trp-His-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O ORQGVWIUHICVKE-KCTSRDHCSA-N 0.000 description 1
- OTWIOROMZLNAQC-XIRDDKMYSA-N Trp-His-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O OTWIOROMZLNAQC-XIRDDKMYSA-N 0.000 description 1
- PGPCENKYTLDIFM-SZMVWBNQSA-N Trp-His-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O PGPCENKYTLDIFM-SZMVWBNQSA-N 0.000 description 1
- YLGQHMHKAASRGJ-WDSOQIARSA-N Trp-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N YLGQHMHKAASRGJ-WDSOQIARSA-N 0.000 description 1
- UKWSFUSPGPBJGU-VFAJRCTISA-N Trp-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O UKWSFUSPGPBJGU-VFAJRCTISA-N 0.000 description 1
- DZHDVYLBNKMLMB-ZFWWWQNUSA-N Trp-Lys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 DZHDVYLBNKMLMB-ZFWWWQNUSA-N 0.000 description 1
- GRSCONMARGNYHA-PMVMPFDFSA-N Trp-Lys-Phe Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GRSCONMARGNYHA-PMVMPFDFSA-N 0.000 description 1
- JEYRCNVVYHTZMY-SZMVWBNQSA-N Trp-Pro-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JEYRCNVVYHTZMY-SZMVWBNQSA-N 0.000 description 1
- UIRPULWLRODAEQ-QEJZJMRPSA-N Trp-Ser-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 UIRPULWLRODAEQ-QEJZJMRPSA-N 0.000 description 1
- ARKBYVBCEOWRNR-UBHSHLNASA-N Trp-Ser-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O ARKBYVBCEOWRNR-UBHSHLNASA-N 0.000 description 1
- GSCPHMSPGQSZJT-JYBASQMISA-N Trp-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O GSCPHMSPGQSZJT-JYBASQMISA-N 0.000 description 1
- LNGFWVPNKLWATF-ZVZYQTTQSA-N Trp-Val-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LNGFWVPNKLWATF-ZVZYQTTQSA-N 0.000 description 1
- RWTFCAMQLFNPTK-UMPQAUOISA-N Trp-Val-Thr Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O)=CNC2=C1 RWTFCAMQLFNPTK-UMPQAUOISA-N 0.000 description 1
- 241000224557 Trypanosoma brucei brucei Species 0.000 description 1
- 241001442399 Trypanosoma brucei gambiense Species 0.000 description 1
- 241001442397 Trypanosoma brucei rhodesiense Species 0.000 description 1
- 241000223107 Trypanosoma congolense Species 0.000 description 1
- 241000223097 Trypanosoma rangeli Species 0.000 description 1
- HSVPZJLMPLMPOX-BPNCWPANSA-N Tyr-Arg-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O HSVPZJLMPLMPOX-BPNCWPANSA-N 0.000 description 1
- SEFNTZYRPGBDCY-IHRRRGAJSA-N Tyr-Arg-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N)O SEFNTZYRPGBDCY-IHRRRGAJSA-N 0.000 description 1
- CKKFTIQYURNSEI-IHRRRGAJSA-N Tyr-Asn-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CKKFTIQYURNSEI-IHRRRGAJSA-N 0.000 description 1
- SCCKSNREWHMKOJ-SRVKXCTJSA-N Tyr-Asn-Ser Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O SCCKSNREWHMKOJ-SRVKXCTJSA-N 0.000 description 1
- IXTQGBGHWQEEDE-AVGNSLFASA-N Tyr-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IXTQGBGHWQEEDE-AVGNSLFASA-N 0.000 description 1
- YGKVNUAKYPGORG-AVGNSLFASA-N Tyr-Asp-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YGKVNUAKYPGORG-AVGNSLFASA-N 0.000 description 1
- YLRLHDFMMWDYTK-KKUMJFAQSA-N Tyr-Cys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 YLRLHDFMMWDYTK-KKUMJFAQSA-N 0.000 description 1
- UMXSDHPSMROQRB-YJRXYDGGSA-N Tyr-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O UMXSDHPSMROQRB-YJRXYDGGSA-N 0.000 description 1
- LHTGRUZSZOIAKM-SOUVJXGZSA-N Tyr-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O LHTGRUZSZOIAKM-SOUVJXGZSA-N 0.000 description 1
- KOVXHANYYYMBRF-IRIUXVKKSA-N Tyr-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O KOVXHANYYYMBRF-IRIUXVKKSA-N 0.000 description 1
- PMDWYLVWHRTJIW-STQMWFEESA-N Tyr-Gly-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PMDWYLVWHRTJIW-STQMWFEESA-N 0.000 description 1
- AZGZDDNKFFUDEH-QWRGUYRKSA-N Tyr-Gly-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AZGZDDNKFFUDEH-QWRGUYRKSA-N 0.000 description 1
- RIFVTNDKUMSSMN-ULQDDVLXSA-N Tyr-His-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(O)=O RIFVTNDKUMSSMN-ULQDDVLXSA-N 0.000 description 1
- BSCBBPKDVOZICB-KKUMJFAQSA-N Tyr-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BSCBBPKDVOZICB-KKUMJFAQSA-N 0.000 description 1
- JLKVWTICWVWGSK-JYJNAYRXSA-N Tyr-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JLKVWTICWVWGSK-JYJNAYRXSA-N 0.000 description 1
- RGYCVIZZTUBSSG-JYJNAYRXSA-N Tyr-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O RGYCVIZZTUBSSG-JYJNAYRXSA-N 0.000 description 1
- ZPFLBLFITJCBTP-QWRGUYRKSA-N Tyr-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O ZPFLBLFITJCBTP-QWRGUYRKSA-N 0.000 description 1
- UMSZZGTXGKHTFJ-SRVKXCTJSA-N Tyr-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UMSZZGTXGKHTFJ-SRVKXCTJSA-N 0.000 description 1
- VSYROIRKNBCULO-BWAGICSOSA-N Tyr-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)O VSYROIRKNBCULO-BWAGICSOSA-N 0.000 description 1
- NZBSVMQZQMEUHI-WZLNRYEVSA-N Tyr-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NZBSVMQZQMEUHI-WZLNRYEVSA-N 0.000 description 1
- MQUYPYFPHIPVHJ-MNSWYVGCSA-N Tyr-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)O MQUYPYFPHIPVHJ-MNSWYVGCSA-N 0.000 description 1
- 241000182606 Urashima Species 0.000 description 1
- WOCYUGQDXPTQPY-FXQIFTODSA-N Val-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N WOCYUGQDXPTQPY-FXQIFTODSA-N 0.000 description 1
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 1
- PFNZJEPSCBAVGX-CYDGBPFRSA-N Val-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N PFNZJEPSCBAVGX-CYDGBPFRSA-N 0.000 description 1
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 1
- BYOHPUZJVXWHAE-BYULHYEWSA-N Val-Asn-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N BYOHPUZJVXWHAE-BYULHYEWSA-N 0.000 description 1
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 1
- QGFPYRPIUXBYGR-YDHLFZDLSA-N Val-Asn-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N QGFPYRPIUXBYGR-YDHLFZDLSA-N 0.000 description 1
- IQQYYFPCWKWUHW-YDHLFZDLSA-N Val-Asn-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N IQQYYFPCWKWUHW-YDHLFZDLSA-N 0.000 description 1
- HZYOWMGWKKRMBZ-BYULHYEWSA-N Val-Asp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZYOWMGWKKRMBZ-BYULHYEWSA-N 0.000 description 1
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 1
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 1
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 1
- FPCIBLUVDNXPJO-XPUUQOCRSA-N Val-Cys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O FPCIBLUVDNXPJO-XPUUQOCRSA-N 0.000 description 1
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 1
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 1
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 description 1
- BEGDZYNDCNEGJZ-XVKPBYJWSA-N Val-Gly-Gln Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O BEGDZYNDCNEGJZ-XVKPBYJWSA-N 0.000 description 1
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 1
- MDYSKHBSPXUOPV-JSGCOSHPSA-N Val-Gly-Phe Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MDYSKHBSPXUOPV-JSGCOSHPSA-N 0.000 description 1
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 1
- KDKLLPMFFGYQJD-CYDGBPFRSA-N Val-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N KDKLLPMFFGYQJD-CYDGBPFRSA-N 0.000 description 1
- DJQIUOKSNRBTSV-CYDGBPFRSA-N Val-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](C(C)C)N DJQIUOKSNRBTSV-CYDGBPFRSA-N 0.000 description 1
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 1
- DAVNYIUELQBTAP-XUXIUFHCSA-N Val-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N DAVNYIUELQBTAP-XUXIUFHCSA-N 0.000 description 1
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 1
- OJPRSVJGNCAKQX-SRVKXCTJSA-N Val-Met-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N OJPRSVJGNCAKQX-SRVKXCTJSA-N 0.000 description 1
- RQOMPQGUGBILAG-AVGNSLFASA-N Val-Met-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O RQOMPQGUGBILAG-AVGNSLFASA-N 0.000 description 1
- WMRWZYSRQUORHJ-YDHLFZDLSA-N Val-Phe-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WMRWZYSRQUORHJ-YDHLFZDLSA-N 0.000 description 1
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 1
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 1
- MJOUSKQHAIARKI-JYJNAYRXSA-N Val-Phe-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 MJOUSKQHAIARKI-JYJNAYRXSA-N 0.000 description 1
- LGXUZJIQCGXKGZ-QXEWZRGKSA-N Val-Pro-Asn Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N LGXUZJIQCGXKGZ-QXEWZRGKSA-N 0.000 description 1
- QIVPZSWBBHRNBA-JYJNAYRXSA-N Val-Pro-Phe Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O QIVPZSWBBHRNBA-JYJNAYRXSA-N 0.000 description 1
- MIKHIIQMRFYVOR-RCWTZXSCSA-N Val-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C(C)C)N)O MIKHIIQMRFYVOR-RCWTZXSCSA-N 0.000 description 1
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 1
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 1
- DLLRRUDLMSJTMB-GUBZILKMSA-N Val-Ser-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N DLLRRUDLMSJTMB-GUBZILKMSA-N 0.000 description 1
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 1
- HWNYVQMOLCYHEA-IHRRRGAJSA-N Val-Ser-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N HWNYVQMOLCYHEA-IHRRRGAJSA-N 0.000 description 1
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 1
- GVNLOVJNNDZUHS-RHYQMDGZSA-N Val-Thr-Lys Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O GVNLOVJNNDZUHS-RHYQMDGZSA-N 0.000 description 1
- UEXPMFIAZZHEAD-HSHDSVGOSA-N Val-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](C(C)C)N)O UEXPMFIAZZHEAD-HSHDSVGOSA-N 0.000 description 1
- YLBNZCJFSVJDRJ-KJEVXHAQSA-N Val-Thr-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O YLBNZCJFSVJDRJ-KJEVXHAQSA-N 0.000 description 1
- QHSSPPHOHJSTML-HOCLYGCPSA-N Val-Trp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N QHSSPPHOHJSTML-HOCLYGCPSA-N 0.000 description 1
- SSKKGOWRPNIVDW-AVGNSLFASA-N Val-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SSKKGOWRPNIVDW-AVGNSLFASA-N 0.000 description 1
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 1
- ADNBOFQFSCPWBA-RJMJUYIDSA-N [Na].O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O Chemical compound [Na].O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O ADNBOFQFSCPWBA-RJMJUYIDSA-N 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 238000001994 activation Methods 0.000 description 1
- 125000004423 acyloxy group Chemical group 0.000 description 1
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010039538 alanyl-glycyl-aspartyl-valine Proteins 0.000 description 1
- 150000004703 alkoxides Chemical class 0.000 description 1
- 125000005278 alkyl sulfonyloxy group Chemical group 0.000 description 1
- 150000001412 amines Chemical class 0.000 description 1
- 235000001014 amino acid Nutrition 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- WGQKYBSKWIADBV-UHFFFAOYSA-N aminomethyl benzene Natural products NCC1=CC=CC=C1 WGQKYBSKWIADBV-UHFFFAOYSA-N 0.000 description 1
- 230000000844 anti-bacterial effect Effects 0.000 description 1
- 230000003110 anti-inflammatory effect Effects 0.000 description 1
- 230000000840 anti-viral effect Effects 0.000 description 1
- 239000000010 aprotic solvent Substances 0.000 description 1
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 1
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 1
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 229910052786 argon Inorganic materials 0.000 description 1
- 125000004391 aryl sulfonyl group Chemical group 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 125000003236 benzoyl group Chemical group [H]C1=C([H])C([H])=C(C([H])=C1[H])C(*)=O 0.000 description 1
- 235000019445 benzyl alcohol Nutrition 0.000 description 1
- 150000003938 benzyl alcohols Chemical class 0.000 description 1
- WGQKYBSKWIADBV-UHFFFAOYSA-O benzylaminium Chemical compound [NH3+]CC1=CC=CC=C1 WGQKYBSKWIADBV-UHFFFAOYSA-O 0.000 description 1
- HMQPEDMEOBLSQB-RCBHQUQDSA-N beta-D-Galp-(1->3)-alpha-D-GlcpNAc Chemical compound CC(=O)N[C@H]1[C@@H](O)O[C@H](CO)[C@@H](O)[C@@H]1O[C@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 HMQPEDMEOBLSQB-RCBHQUQDSA-N 0.000 description 1
- 239000011230 binding agent Substances 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 125000001246 bromo group Chemical group Br* 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 125000004063 butyryl group Chemical group O=C([*])C([H])([H])C([H])([H])C([H])([H])[H] 0.000 description 1
- 230000010261 cell growth Effects 0.000 description 1
- 235000013339 cereals Nutrition 0.000 description 1
- 125000001309 chloro group Chemical group Cl* 0.000 description 1
- 125000002668 chloroacetyl group Chemical group ClCC(=O)* 0.000 description 1
- OEYIOHPDSNJKLS-UHFFFAOYSA-N choline Chemical compound C[N+](C)(C)CCO OEYIOHPDSNJKLS-UHFFFAOYSA-N 0.000 description 1
- 229960001231 choline Drugs 0.000 description 1
- 230000008133 cognitive development Effects 0.000 description 1
- 238000004440 column chromatography Methods 0.000 description 1
- 235000020247 cow milk Nutrition 0.000 description 1
- UHDGCWIWMRVCDJ-ZAKLUEHWSA-N cytidine Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-ZAKLUEHWSA-N 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 238000003795 desorption Methods 0.000 description 1
- MWPIIMNHWGOFBL-UHFFFAOYSA-N dichloromethane;toluene Chemical compound ClCCl.CC1=CC=CC=C1 MWPIIMNHWGOFBL-UHFFFAOYSA-N 0.000 description 1
- 235000015872 dietary supplement Nutrition 0.000 description 1
- ZBCBWPMODOFKDW-UHFFFAOYSA-N diethanolamine Chemical compound OCCNCCO ZBCBWPMODOFKDW-UHFFFAOYSA-N 0.000 description 1
- LXCYSACZTOKNNS-UHFFFAOYSA-N diethoxy(oxo)phosphanium Chemical compound CCO[P+](=O)OCC LXCYSACZTOKNNS-UHFFFAOYSA-N 0.000 description 1
- HPNMFZURTQLUMO-UHFFFAOYSA-N diethylamine Chemical compound CCNCC HPNMFZURTQLUMO-UHFFFAOYSA-N 0.000 description 1
- ZZVUWRFHKOJYTH-UHFFFAOYSA-N diphenhydramine Chemical group C=1C=CC=CC=1C(OCCN(C)C)C1=CC=CC=C1 ZZVUWRFHKOJYTH-UHFFFAOYSA-N 0.000 description 1
- 125000005982 diphenylmethyl group Chemical group [H]C1=C([H])C([H])=C(C([H])=C1[H])C([H])(*)C1=C([H])C([H])=C([H])C([H])=C1[H] 0.000 description 1
- 108010054813 diprotin B Proteins 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 239000003480 eluent Substances 0.000 description 1
- 230000013020 embryo development Effects 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 125000001495 ethyl group Chemical group [H]C([H])([H])C([H])([H])* 0.000 description 1
- 238000001704 evaporation Methods 0.000 description 1
- 230000008020 evaporation Effects 0.000 description 1
- 230000007717 exclusion Effects 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 125000001153 fluoro group Chemical group F* 0.000 description 1
- 239000006260 foam Substances 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 125000002485 formyl group Chemical group [H]C(*)=O 0.000 description 1
- 235000013376 functional food Nutrition 0.000 description 1
- 125000000524 functional group Chemical group 0.000 description 1
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 1
- 150000002270 gangliosides Chemical class 0.000 description 1
- 239000007789 gas Substances 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 108010079547 glutamylmethionine Proteins 0.000 description 1
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 1
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 108010077515 glycylproline Proteins 0.000 description 1
- 235000020251 goat milk Nutrition 0.000 description 1
- 125000005843 halogen group Chemical group 0.000 description 1
- 108010092114 histidylphenylalanine Proteins 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 150000002430 hydrocarbons Chemical group 0.000 description 1
- 125000004435 hydrogen atom Chemical class [H]* 0.000 description 1
- 210000000987 immune system Anatomy 0.000 description 1
- 230000034435 immune system development Effects 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 239000004615 ingredient Substances 0.000 description 1
- 229910001853 inorganic hydroxide Inorganic materials 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 125000002346 iodo group Chemical group I* 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- 125000000959 isobutyl group Chemical group [H]C([H])([H])C([H])(C([H])([H])[H])C([H])([H])* 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- 125000001449 isopropyl group Chemical group [H]C([H])([H])C([H])(*)C([H])([H])[H] 0.000 description 1
- 230000006651 lactation Effects 0.000 description 1
- 150000007517 lewis acids Chemical class 0.000 description 1
- 238000004895 liquid chromatography mass spectrometry Methods 0.000 description 1
- 108010073210 lysyl-glutamyl-aspartyl-tryptophan Proteins 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 229960003194 meglumine Drugs 0.000 description 1
- NGYIMTKLQULBOO-UHFFFAOYSA-L mercury dibromide Chemical compound Br[Hg]Br NGYIMTKLQULBOO-UHFFFAOYSA-L 0.000 description 1
- FQGYCXFLEQVDJQ-UHFFFAOYSA-N mercury dicyanide Chemical compound N#C[Hg]C#N FQGYCXFLEQVDJQ-UHFFFAOYSA-N 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 230000009401 metastasis Effects 0.000 description 1
- 108010085203 methionylmethionine Proteins 0.000 description 1
- 108010068488 methionylphenylalanine Proteins 0.000 description 1
- 125000001160 methoxycarbonyl group Chemical group [H]C([H])([H])OC(*)=O 0.000 description 1
- 125000004170 methylsulfonyl group Chemical group [H]C([H])([H])S(*)(=O)=O 0.000 description 1
- 239000002480 mineral oil Substances 0.000 description 1
- 235000010446 mineral oil Nutrition 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000008450 motivation Effects 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 125000004108 n-butyl group Chemical group [H]C([H])([H])C([H])([H])C([H])([H])C([H])([H])* 0.000 description 1
- 125000001280 n-hexyl group Chemical group C(CCCCC)* 0.000 description 1
- 125000004123 n-propyl group Chemical group [H]C([H])([H])C([H])([H])C([H])([H])* 0.000 description 1
- 125000001624 naphthyl group Chemical group 0.000 description 1
- CERZMXAJYMMUDR-UHFFFAOYSA-N neuraminic acid Natural products NC1C(O)CC(O)(C(O)=O)OC1C(O)C(O)CO CERZMXAJYMMUDR-UHFFFAOYSA-N 0.000 description 1
- 125000001416 neuraminosyl group Chemical group 0.000 description 1
- 230000007996 neuronal plasticity Effects 0.000 description 1
- 125000000449 nitro group Chemical group [O-][N+](*)=O 0.000 description 1
- 125000004433 nitrogen atom Chemical group N* 0.000 description 1
- 239000012038 nucleophile Substances 0.000 description 1
- 230000000269 nucleophilic effect Effects 0.000 description 1
- 239000002773 nucleotide Substances 0.000 description 1
- 125000003729 nucleotide group Chemical group 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- 230000035764 nutrition Effects 0.000 description 1
- 150000002894 organic compounds Chemical class 0.000 description 1
- 125000004430 oxygen atom Chemical group O* 0.000 description 1
- 125000003232 p-nitrobenzoyl group Chemical group [N+](=O)([O-])C1=CC=C(C(=O)*)C=C1 0.000 description 1
- 229910052763 palladium Inorganic materials 0.000 description 1
- 229940051027 pasteurella multocida Drugs 0.000 description 1
- 244000052769 pathogen Species 0.000 description 1
- 230000001717 pathogenic effect Effects 0.000 description 1
- 239000000546 pharmaceutical excipient Substances 0.000 description 1
- 108010082795 phenylalanyl-arginyl-arginine Proteins 0.000 description 1
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 1
- 125000003170 phenylsulfonyl group Chemical group C1(=CC=CC=C1)S(=O)(=O)* 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- 229910000027 potassium carbonate Inorganic materials 0.000 description 1
- NTTOTNSKUYCDAV-UHFFFAOYSA-N potassium hydride Chemical compound [KH] NTTOTNSKUYCDAV-UHFFFAOYSA-N 0.000 description 1
- 229910000105 potassium hydride Inorganic materials 0.000 description 1
- LPNYRYFBWFDTMA-UHFFFAOYSA-N potassium tert-butoxide Chemical compound [K+].CC(C)(C)[O-] LPNYRYFBWFDTMA-UHFFFAOYSA-N 0.000 description 1
- 235000013406 prebiotics Nutrition 0.000 description 1
- 239000002244 precipitate Substances 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 108090000765 processed proteins & peptides Proteins 0.000 description 1
- 125000001501 propionyl group Chemical group O=C([*])C([H])([H])C([H])([H])[H] 0.000 description 1
- 102000004169 proteins and genes Human genes 0.000 description 1
- 238000000425 proton nuclear magnetic resonance spectrum Methods 0.000 description 1
- UMJSCPRVCHMLSP-UHFFFAOYSA-N pyridine Natural products COC1=CC=CN=C1 UMJSCPRVCHMLSP-UHFFFAOYSA-N 0.000 description 1
- 238000002708 random mutagenesis Methods 0.000 description 1
- 239000000376 reactant Substances 0.000 description 1
- 238000001953 recrystallisation Methods 0.000 description 1
- 230000008929 regeneration Effects 0.000 description 1
- 238000011069 regeneration method Methods 0.000 description 1
- 230000007363 regulatory process Effects 0.000 description 1
- 229920006395 saturated elastomer Polymers 0.000 description 1
- 238000013341 scale-up Methods 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 125000002914 sec-butyl group Chemical group [H]C([H])([H])C([H])([H])C([H])(*)C([H])([H])[H] 0.000 description 1
- 239000000741 silica gel Substances 0.000 description 1
- 229910002027 silica gel Inorganic materials 0.000 description 1
- 238000010898 silica gel chromatography Methods 0.000 description 1
- 229960001866 silicon dioxide Drugs 0.000 description 1
- LKZMBDSASOBTPN-UHFFFAOYSA-L silver carbonate Substances [Ag].[O-]C([O-])=O LKZMBDSASOBTPN-UHFFFAOYSA-L 0.000 description 1
- 229910001958 silver carbonate Inorganic materials 0.000 description 1
- QRUBYZBWAOOHSV-UHFFFAOYSA-M silver trifluoromethanesulfonate Chemical compound [Ag+].[O-]S(=O)(=O)C(F)(F)F QRUBYZBWAOOHSV-UHFFFAOYSA-M 0.000 description 1
- 239000002002 slurry Substances 0.000 description 1
- 229910052708 sodium Inorganic materials 0.000 description 1
- 239000001632 sodium acetate Substances 0.000 description 1
- 235000017281 sodium acetate Nutrition 0.000 description 1
- 239000012312 sodium hydride Substances 0.000 description 1
- AKHNMLFCWUSKQB-UHFFFAOYSA-L sodium thiosulfate Chemical compound [Na+].[Na+].[O-]S([O-])(=O)=S AKHNMLFCWUSKQB-UHFFFAOYSA-L 0.000 description 1
- 235000019345 sodium thiosulphate Nutrition 0.000 description 1
- 230000000707 stereoselective effect Effects 0.000 description 1
- 238000003756 stirring Methods 0.000 description 1
- WPLOVIFNBMNBPD-ATHMIXSHSA-N subtilin Chemical compound CC1SCC(NC2=O)C(=O)NC(CC(N)=O)C(=O)NC(C(=O)NC(CCCCN)C(=O)NC(C(C)CC)C(=O)NC(=C)C(=O)NC(CCCCN)C(O)=O)CSC(C)C2NC(=O)C(CC(C)C)NC(=O)C1NC(=O)C(CCC(N)=O)NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C1NC(=O)C(=C/C)/NC(=O)C(CCC(N)=O)NC(=O)C(CC(C)C)NC(=O)C(C)NC(=O)CNC(=O)C(NC(=O)C(NC(=O)C2NC(=O)CNC(=O)C3CCCN3C(=O)C(NC(=O)C3NC(=O)C(CC(C)C)NC(=O)C(=C)NC(=O)C(CCC(O)=O)NC(=O)C(NC(=O)C(CCCCN)NC(=O)C(N)CC=4C5=CC=CC=C5NC=4)CSC3)C(C)SC2)C(C)C)C(C)SC1)CC1=CC=CC=C1 WPLOVIFNBMNBPD-ATHMIXSHSA-N 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- DKVBOUDTNWVDEP-NJCHZNEYSA-N teicoplanin aglycone Chemical compound N([C@H](C(N[C@@H](C1=CC(O)=CC(O)=C1C=1C(O)=CC=C2C=1)C(O)=O)=O)[C@H](O)C1=CC=C(C(=C1)Cl)OC=1C=C3C=C(C=1O)OC1=CC=C(C=C1Cl)C[C@H](C(=O)N1)NC([C@H](N)C=4C=C(O5)C(O)=CC=4)=O)C(=O)[C@@H]2NC(=O)[C@@H]3NC(=O)[C@@H]1C1=CC5=CC(O)=C1 DKVBOUDTNWVDEP-NJCHZNEYSA-N 0.000 description 1
- 125000000999 tert-butyl group Chemical group [H]C([H])([H])C(*)(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 229940124597 therapeutic agent Drugs 0.000 description 1
- 150000003569 thioglycosides Chemical class 0.000 description 1
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 1
- 150000003613 toluenes Chemical class 0.000 description 1
- 125000002088 tosyl group Chemical group [H]C1=C([H])C(=C([H])C([H])=C1C([H])([H])[H])S(*)(=O)=O 0.000 description 1
- 239000003440 toxic substance Substances 0.000 description 1
- 229910001428 transition metal ion Inorganic materials 0.000 description 1
- 125000001889 triflyl group Chemical group FC(F)(F)S(*)(=O)=O 0.000 description 1
- 108010084932 tryptophyl-proline Proteins 0.000 description 1
- 231100000925 very toxic Toxicity 0.000 description 1
- 229940118696 vibrio cholerae Drugs 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
- 108010027345 wheylin-1 peptide Proteins 0.000 description 1
- 238000010626 work up procedure Methods 0.000 description 1
Images
Classifications
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P1/00—Drugs for disorders of the alimentary tract or the digestive system
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07H—SUGARS; DERIVATIVES THEREOF; NUCLEOSIDES; NUCLEOTIDES; NUCLEIC ACIDS
- C07H5/00—Compounds containing saccharide radicals in which the hetero bonds to oxygen have been replaced by the same number of hetero bonds to halogen, nitrogen, sulfur, selenium, or tellurium
- C07H5/04—Compounds containing saccharide radicals in which the hetero bonds to oxygen have been replaced by the same number of hetero bonds to halogen, nitrogen, sulfur, selenium, or tellurium to nitrogen
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P3/00—Drugs for disorders of the metabolism
- A61P3/02—Nutrients, e.g. vitamins, minerals
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07H—SUGARS; DERIVATIVES THEREOF; NUCLEOSIDES; NUCLEOTIDES; NUCLEIC ACIDS
- C07H1/00—Processes for the preparation of sugar derivatives
- C07H1/06—Separation; Purification
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07H—SUGARS; DERIVATIVES THEREOF; NUCLEOSIDES; NUCLEOTIDES; NUCLEIC ACIDS
- C07H3/00—Compounds containing only hydrogen atoms and saccharide radicals having only carbon, hydrogen, and oxygen atoms
- C07H3/06—Oligosaccharides, i.e. having three to five saccharide radicals attached to each other by glycosidic linkages
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P19/00—Preparation of compounds containing saccharide radicals
- C12P19/14—Preparation of compounds containing saccharide radicals produced by the action of a carbohydrase (EC 3.2.x), e.g. by alpha-amylase, e.g. by cellulase, hemicellulase
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P19/00—Preparation of compounds containing saccharide radicals
- C12P19/18—Preparation of compounds containing saccharide radicals produced by the action of a glycosyl transferase, e.g. alpha-, beta- or gamma-cyclodextrins
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P19/00—Preparation of compounds containing saccharide radicals
- C12P19/26—Preparation of nitrogen-containing carbohydrates
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P19/00—Preparation of compounds containing saccharide radicals
- C12P19/44—Preparation of O-glycosides, e.g. glucosides
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02P—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
- Y02P20/00—Technologies relating to chemical industry
- Y02P20/50—Improvements relating to the production of bulk chemicals
- Y02P20/55—Design of synthesis routes, e.g. reducing the use of auxiliary or protecting groups
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Biochemistry (AREA)
- Biotechnology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Molecular Biology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Medicinal Chemistry (AREA)
- Animal Behavior & Ethology (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- Pharmacology & Pharmacy (AREA)
- Obesity (AREA)
- Hematology (AREA)
- Diabetes (AREA)
- Nutrition Science (AREA)
- Biomedical Technology (AREA)
- Saccharide Compounds (AREA)
- Pharmaceuticals Containing Other Organic And Inorganic Compounds (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Cosmetics (AREA)
Abstract
본 발명은 트랜스시알화 반응에 의하여 일반 화학식 1A의 화합물들 및 그의 염들의 합성 방법에 관한 것으로서, 일반 화학식 1A에서 R 기들 중의 하나는 α-시알릴 모이어티, 다른 하나는 H이고, X1은 탄수화물 링커를 나타내고, A는 선택적으로 푸코실로 치환된 D-글루코피라노실 단위이고, R1은 가수소분해에 의해 제거될 수 있는 보호기이고, m은 0 또는 1의 정수이다.
Description
본 발명은 시알로올리고당 배당체(glycoside)들의 효소적 합성, 및 상기 합성에 관여하는 신규한 전구체 및 결과물에 관한 것이다.
시알산(sialic acid)들은 9탄당 뉴라민산(neuraminic acid)의 유도체들이고, C-4, C-7, C-8 및 C-9이 다양한 모이어티(moiety)들로 치환될 수 있는 3 가지 모분자(parent molecule)들, N-아세틸-(Neu5Ac)(N-acetyl-(Neu5Ac)), N-글라이콜릴-(Neu5Gc)(N-glycolyl-(Neu5Gc)) 및 데아미노-뉴라민산(deamino-neuraminic acid)(3-deoxy-D-glycero-D-galacto-nonulosonic acid, KDN)을 망라한다. 그것들은 배발생(embryogenesis)에서부터 병원체 상호작용(pathogen interactions)에 대한 신경 가소성(neural plasticity)에 이르는 많은 주요한 생물학적 역할들을 가진다. 비록 그것들이 좀처럼 유리된 형태(free from)로 나타나지는 않지만, 그것들은 항상 당단백질(glycoprotein)들 및 당지질(glycolipid)들의 올리고당 측쇄의 비-환원 말단(non-reducing terminus) 또는 내부 위치에 있는 화학적 공유 결합(covalent linkage)들에서 발견된다. 갈락토스(galactose,), N-아세틸-갈락토사민(N-acetyl-galactosamine) 및 N-아세틸-글루코사민(N-acetyl-glucosamine)과 같은 끝에서 두 번째 당들에 결합된 시알산의 상기 결합들은 대부분 α-2,3- 및 α-2,6-케토사이드 결합(ketosidic bond)이다.
시알로글라이코 결합체(sialoglycoconjugate)들 중에서, 항박테리아성, 항바이러스성, 면역계 및 인지발달(cognitive development) 향상 활성과 같은 독특한 생물학적 활성과 직접적으로 연관되어 있는 시알릴화 된 인유 올리고당(human milk oligosaccharide, HMO)들은 매우 중요하다. 시알릴화 된 인유 올리고당들은 인간 내장계(intestinal system)에서 장내세균총(intestinal flora)의 발달과 유지를 돕는 프리바이오틱스(prebiotics)와 같이 작용하는 것으로 밝혀졌다. 더욱이, 그것들은 또한 항-염증성을 가지고, 따라서 이러한 화합물들은 합성적으로 구성되거나 자연적으로 발생한 화합물들 및 그의 염들 모두, 예를 들어, 영아용 제조분유(infant formulas), 영아용 씨리얼(infant cereals), 임상적 영아용 영양 제품(clinical infant nutritional product), 유아용 제조분유(toddler formulas), 또는 어린이, 성인, 노령 또는 수유 중의 여성을 위한 식이보충제(dietary supplements)나 건강 기능성 식품(health functional food)의 생산을 위한 영양 산업에서 매력적인 성분이다. 마찬가지로, 상기 화합물들은 또한 치료제의 생산을 위한 제약 산업에서도 관심거리이다. 상기 인유 올리고당에서, 상기 시알산 잔기는 항상 D-갈락토스의 말단 3-O- 및/또는 6-O- 위치(들)에 α-글라이코사이드 결합(α-glycosidic linkage)을 통해 연결되어 있다.
자연적으로 발생하는 시알릴화 된 인유 올리고당의 입수가능성은 제한적이다. 성숙 인유(mature human milk)는 가장 높은 농도(12-14 g/l)의 유 올리고당(milk oligosaccharide)들을 함유하는 자연적인 유원(milk source)이고, 다른 유원들은 소의 젖(0.01 g/l), 염소의 젖 및 다른 포유동물들에서 유래한 젖이 있다. 이러한 낮은 자연적 입수가능성과 어려운 분리 방법들은 이러한 매력적인 화합물들의 생산을 위한 생물공학적 및 화학적 방법론들의 발전에 중요한 동기가 된다.
HPLC Chip/MS(microchip liquid chromatography mass spectrometry) 및 MALDI-FT ICR MS(matrix-assisted laser desorption/ionization Fourier transform ion cyclotron resonance mass spectrometry)를 포함하는 기술들의 조합에 의하여 인유로부터 약 200개의 HMO들이 검출되었고(Ninonuevo et al. J. Agric. Food Chem. 54, 7471 (2006)), 지금까지 적어도 115개의 올리고당들이 구조적으로 결정되었다(Urashima et al.: Milk Oligosaccharides, Nova Medical Books, NY, 2011). 상기와 같은 인유 올리고당들은 13개의 코어 단위들로 그룹화될 수 있다(표 1). 올리고당들의 약 1/4이 시알산을 포함한다.
인유 올리고당( HMO )들의 13개의 다른 코어 구조들 | ||
No | Core name | Core structure |
1 | 락토스 (lactose, Lac) | Galβ1-4Glc |
2 | 락토-N-테트라오스 (lacto-N-tetraose, LNT) |
Galβ1-3GlcNAcβ1-3Galβ1-4Glc |
3 | 락토-N-네오테트라오스 (lacto-N-neotetraose, LNnT) | Galβ1-4GlcNAcβ1-3Galβ1-4Glc |
4 | 락토-N-헥사오스 (lacto-N-hexaose, LNH) |
Galβ1-3GlcNAcβ1-3(Galβ1-4GlcNAcβ1-6)Galβ1-4Glc |
5 | 락토-N-네오헥사오스 (lacto-N-neohexaose, LNnH) |
Galβ1-4GlcNAcβ1-3(Galβ1-4GlcNAcβ1-6)Galβ1-4Glc |
6 | 파라-락토-N-헥사오스 (para-lacto-N-hexaose, para-LNH) |
Galβ1-3GlcNAcβ1-3Galβ1-4GlcNAcβ1-3Galβ1-4Glc |
7 | 파라-락토-N-네오헥사오스 (para-lacto-N-neohexaose, para-LNnH) |
Galβ1-4GlcNAcβ1-3Galβ1-4GlcNAcβ1-3Galβ1-4Glc |
8 | 락토-N-옥타오스 (lacto-N-octaose, LNO) |
Galβ1-3GlcNAcβ1-3(Galβ1-4GlcNAcβ1-3Galβ1-4GlcNAcβ1-6)Galβ1-4Glc |
9 | 락토-N-네오옥타오스 (lacto-N-neooctaose, LNnO) |
Galβ1-4GlcNAcβ1-3(Galβ1-3GlcNAcβ1-3Galβ1-4GlcNAcβ1-6)Galβ1-4Glc |
10 | 아이소-락토-N-옥타오스 (Iso-lacto-N-octaose, iso-LNO) |
Galβ1-3GlcNAcβ1-3(Galβ1-3GlcNAcβ1-3Galβ1-4GlcNAcβ1-6)Galβ1-4Glc |
11 | 파라-락토-N-옥타오스 (para-lacto-N-octaose, para-LNO) |
Galβ1-3GlcNAcβ1-3Galβ1-4GlcNAcβ1-3Galβ1-4GlcNAcβ1-3Galβ1-4Glc |
12 | 락토-N-네오데카오스 (Lacto-N-neodecaose, LNnD) |
Galβ1-3GlcNAcβ1-3[Galβ1-4GlcNAcβ1-3(Galβ1-4GlcNAcβ1-6)Galβ1-4GlcNAcβ1-6]Galβ1-4Glc |
13 | 락토-N-데카오스 (Lacto-N-decaose, LND) |
Galβ1-3GlcNAcβ1-3[Galβ1-3GlcNAcβ1-3(Galβ1-4GlcNAcβ1-6)Galβ1-4GlcNAcβ1-6]Galβ1-4Glc |
많은 수의 유사 올리고당들이 존재하기 때문에 인간 및 다른 포유동물들의 젖으로부터 심지어는 밀리그람 양의 시알로올리고당들을 분리하는 것조차 상당히 어렵다. 지금까지, 오직 분석용 HPLC 방법론들만이 자연적인 원천으로부터 일부 시알로올리고당들의 분리를 위해 개발되어 왔다.
복잡한 시알로올리고당들의 합성은 보호(protection) 및 탈보호(deprotection) 전략들을 활용하는 다단계 합성 경로를 따른다. 입체선택성(stereoselective) 화학적 합성 과정들은 보호기(protecting group)들의 광범위한 사용으로 인하여 매우 복잡해질 수 있다. 이러한 전략들은 글라이코할라이드(glycosylhalide), 싸이오글라이코사이드(thioglycoside) 또는 아인산 다이에틸(diethyl phosphite) 공여체(donor) 활성화(activations)를 이용한 적절한 보호된 글라이코실(glycosyl) 수용체(acceptor)들의 입체선택성 O-시알릴화(O-sialylation)를 통하여 시알릴화 된 올리고당들을 생성한다. 사이안화 수은(mercury cyanide), 브롬화 수은(mercury bromide) 및 탄산 은(silver carbonate)과 같은, 시알릴화를 위한 매우 비싸거나 또는 매우 유독한 화학약품들은 이러한 방법론들이 매력적이지 않게 하는 이유들 중의 하나이다. 마찬가지로, 비효율적인 입체조절(stereocontrol) 및/또는 낮은 수율도 상기 전략들을 추가로 개발하는 것이 적합하지 않게 한다. 추가적으로, 이러한 전략들은 심각한 정제(purification) 상의 곤란성으로 특징지어진다.
시알로올리고당들의 효소적 생산의 경우에, 시알릴트랜스퍼라아제(sialyltransferase)들과 시알리다아제(sialidase)들이 선호되는 효소들로서 이용되어 왔다. 이러한 복잡한 효소적 체계들은 대량 생산(scale-up production)에 매우 돈이 많이 드는 방법론들 중 대표적인 것이고, 어려운 정제 프로토콜들 또한 마찬가지로 추가적인 기술 발전에 장애가 된다. 시알리다아제들은 낮은 수율과 위치- 및 입체선택성의 결여로 인해, 양산 방법론으로서 성공적으로 이용될 수 없다. 비록, 일부의 경우들에서, 시알릴트랜스퍼라아제 효소들이 복합한 시알로올리고당들(예, 3'-O-(N-아세틸-뉴라미노실)-락토스 나트륨(3'-O-(N-acetyl-neuraminosyl)-lactose sodium) 염의 1-O-β-벤질 배당체(1-O-β-benzyl glycoside)의 합성: WO 96/32492; 3'-O-(N-아세틸-뉴라미노실)-락토스 나트륨(3'-O-(N-acetyl-neuraminosyl)-lactose sodium) 염의 1-O-β-(4,5-다이메톡시-2-나이트로)-벤질 배당체(1-O-β-(4,5-dimethoxy-2-nitro)-benzyl glycoside)의 합성: Cohen et al. J. Org. Chem. 65, 6145 (2000))의 합성에 효율적인 것으로 밝혀졌지만, 시알산 부분을 수용체 올리고당으로 전이시키기 위한 시알릴 공여체로서 CMP-활성화된 시알산(사이티딘 5'-모노포스포시알산(cytidine 5'-monophosphosialic acid))-사실, 입수가능성이 상당히 제한적이지만-의 필요가 그것들의 유용성을 제한한다.
트랜스시알리다아제 반응에서 기질로서 작용하는 T. 크루지(T. cruzi) 유래 뮤신(mucin) 올리고당들의 N-아세틸-락토사민 벤질 배당체(N-acetyl-lactosamine benzyl glycoside) 및 벤질 배당체들이 연구되어 왔다(Lubineau et al. Carbohydr. Res. 300, 161 (1997); Agustㅽ et al. Bioorg. Med. Chem. 15, 2611 (2007)).
시알리다아제에 의하여 보호되지 않았거나 또는 아노머성으로 치환된 갈락토스, 락토스 또는 N-아세틸-락토사민(N-acetyl-lactosamine) 유도체들이 낮은 수율로 위치선택적 시알화됨이 보고되어 왔다(Thiem et al. Angew. Chem. Int. Ed. Eng. 30, 1503 (1991); Schmidt et al. Chem. Comm. 1919 (2000); Schmidt et al. J. Org. Chem. 65, 8518 (2000)).
유전학적으로 조작된 박테리아, 효모 또는 다른 미생물들을 이용하는 일부 생물공학적 방법론들 또한 보고되고 있다. 그러한 방법들은 제한된 상품화 기회로 인해 조절 과정(regulatory processes)에서 심각한 문제점을 가진다.
시알로글라이코 결합체들은 산 및 염기와 같은 특정 반응 조건 하에서 불안정한 것으로 알려져 있다. 실제로, 그것들은 자가-가수분해될 수 있다. 따라서, 이러한 화합물의 제조와 정제를 위한 조건들은 신중하게 선택되어야 한다.
요컨대, 자연적인 원천의 풀, 예, 인유에 존재하는 많은 수의 올리고당들로 인하여, 분리 기술들은 대량의 시알로올리고당들을 제공할 수 없었다. 또한, 극히 유사한 구조로 특징화되는 구조이성질체(regioisomer)들의 존재는 분리 기술들이 성공하기 더욱 힘들게 한다. 효소적 방법론들은 유전학적으로 조작된 미생물들에서 생산된 효소들을 사용하기 때문에, 효소의 낮은 입수가능성, 매우 높은 당 뉴클레오타이드 공여체 가격 및 조절 상의 어려움들이 있다. 생물공학을 통한 올리고당들의 제조는 몇몇의 비자연적인 글라이코실화(glycosylation) 산물의 잠재적인 형성으로 인하여 조절 상의 큰 장애가 있다. 일반적으로, 시알로올리고당들을 합성하기 위해 개발된 모든 화학적 방법들은 심지어는 수 그람(multigram) 양의 표적 화합물들을 제조하는 것조차 방해하는 몇몇의 문제점들이 있다(예, 대응하는 벤질 배당체를 통한 3'-O- 및 6'-O-(N-아세틸-뉴라미노실)-락토스(6'-O-(N-acetyl-neuraminosyl)-lactose)의 합성: Rencurosi et al. Carbohydr. Res. 337, 473 (2002) 참조).
과거 수십년 동안, 시알릴화 된 인유 올리고당의 제조 및 상업화에 대한 관심은 점점 증대되고 있다. 제조를 단순화하고, 선행 기술의 방법들에서 맞닥뜨렸던 정제 문제들을 극복 또는 회피할 수 있는 신규한 방법론들에 대한 필요는 여전히 존재한다.
일 측면에서, 본 발명은 트랜스시알리다아제(transsialidase) 활성을 가지는 효소의 촉매 작용 하에서, 화학식 SA-OR2 및 그의 염들의 시알릴 공여체(sialyl donor)가 일반 화학식 2A 및 그의 염들의 시알릴 수용체(sialyl acceptor)와 반응되는 것을 특징으로 하는 일반 화학식 1A의 화합물 및 그의 염들의 합성 방법에 관한 것이고,
화학식 SA-OR2에서, R2는 단-(mono-), 이-(di-) 또는 올리고당(oligosaccharide), 당지질(glycolipid), 당단백질(glycoprotein) 또는 당펩티드(glycopeptide), 고리형(cyclic) 또는 비고리형(acyclic) 지방족 기(aliphatic group), 또는 아릴 잔기(residue)이고, SA는 α-시알릴 모이어티(α-sialyl moiety)이고,
일반 화학식 1A에서, R 기들 중의 하나는 α-시알릴 모이어티, 다른 하나는 H이고, X1은 탄수화물 링커를 나타내고, A는 선택적으로 푸코실(fucosyl)로 치환된 D-글루코피라노실 단위(D-glucopyranosyl unit)이고, R1은 가수소분해(hydrogenolysis)에 의해 제거될 수 있는 보호기(protecting group)이고, m은 0 또는 1의 정수이며,
일반 화학식 2A에서, X1, A, m 및 R1은 상기에서 정의한 바와 같다.
다른 측면에서, 본 발명은 일반 화학식 1A'의 화합물 및 그의 염들을 제공하고,
일반 화학식 1A'에서, R 기들 중의 하나는 α-시알릴 모이어티, 다른 하나는 H이고, R1은 가수소분해에 의해 제거될 수 있는 보호기이고, A는 선택적으로 푸코실로 치환된 D-글루코피라노실 단위이고, m은 0 또는 1의 정수이고, X1은 탄수화물 링커를 나타내며,
단, 3'-O-(N-아세틸-뉴라미노실)-락토스((3'-O-(N-acetyl-neuraminosyl)-lactose)) 나트륨 염의 1-O-β-벤질(1-O-β-benzyl) 및 1-O-β-(4,5-다이메톡시-2-나이트로)-벤질(1-O-β-(4,5-dimethoxy-2-nitro)-benzyl) 배당체들, 및 6'-O-(N-아세틸-뉴라미노실)-락토스(6'-O-(N-acetyl-neuraminosyl)-lactose) 나트륨 염의 1-O-β-벤질(1-O-β-benzyl) 배당체는 제외된다.
본 발명은 아노머성 위치에서 보호된 신규 시알로올리고당들 및 그의 생산에 적합한 방법론을 제공한다.
본 발명은 트랜스시알화 반응에서 수용성 1-O-보호된 올리고당 중간체(1-O-protected oligosaccharide intermediates)의 활용에 기반하고, 상기 1-O-보호기는 가수소분해에 의하여 제거될 수 있다. 바람직하게, 상기 1-O-보호기는 또한 상기 올리고당 중간체에 강력한 정제 과정을 돕는 물리적 및 화학적 특성들을 제공한다. 예를 들어, 소수성 모이어티로서 벤질 또는 치환된 벤질 기와 같은 상기 방향족 기(aromatic group)는 유도체들이 그들의 수용성도(water solubility)는 그대로 두면서, 알콜과 같은 유기 양성자성 용매에 용해될 수 있도록 한다. 이는 광범위한 물/알콜 비율을 갖는 이동상(mobile phases)의 이용 가능성을 연다. 또한, 상기 방향족 기에 치환기들의 신중한 설계함으로써, 일부 경우들에서 생성물 정제에 결정화만을 이용하는 강력한 생산 절차의 개발을 가능하게 하는 결정성 화합물(crystalline compound)들을 수득할 수 있다. 더욱이, 상기 벤질 기를 포함하는 1-O-보호기는 부산물의 생성을 막는 온화하고 정교한 조건 하에서의 최종 단계에서 촉매적 환원(가수소분해)에 의해 제거될 수 있고, 이는 표적 올리고당에 적어도 하나의 시알릴 기(sialyl group)가 존재하는 경우에 확실히 유리하다. 상기 촉매적 환원은 수용액 상에서 일어날 수 있다.
본 발명은 하기에서, 본 발명에 따른 3'''-O-시알릴-β-LNnT(3'''-O-sialyl-β-LNnT)의 400 MHz 1H-NMR 스펙트럼을 나타내는 도 1을 참조하여 보다 상세히 설명될 것이다.
올리고당을 합성하기 위해 취해지는 경로에 관계없이, 최종 표적인 비보호된 올리고당은 오직 물에만 용해되는데, 이는 상기 합성의 후속 단계들을 위한 과제(challenge)들을 제공한다. 합성적 생산 과정들에서 흔히 이용되는 유기 용매들은 상기 올리고당 합성의 최종 단계의 반응들에 적합하지 않다.
본 발명은 아노머성 위치에서 보호된 신규 시알로올리고당들 및 그의 생산에 적합한 방법론을 제공한다. 본 발명은 트랜스시알화 반응에서 수용성 1-O-보호된 올리고당 중간체의 활용에 기반하고, 상기 1-O-보호기는 가수소분해에 의하여 제거될 수 있다. 바람직하게, 상기 1-O-보호기는 또한 상기 올리고당 중간체에 강력한 정제 과정을 돕는 물리적 및 화학적 특성들을 제공한다. 예를 들어, 소수성 모이어티로서 벤질 또는 치환된 벤질 기와 같은 상기 방향족 기는 유도체들이 그들의 수용성도는 그대로 두면서, 알콜과 같은 유기 양성자성 용매에 용해될 수 있도록 한다. 이는 광범위한 물/알콜 비율을 갖는 이동상의 이용 가능성을 연다. 또한, 상기 방향족 기에 치환기들의 신중한 설계함으로써, 일부 경우들에서 생성물 정제에 결정화만을 이용하는 강력한 생산 절차의 개발을 가능하게 하는 결정성 화합물들을 수득할 수 있다. 더욱이, 상기 벤질 기를 포함하는 1-O-보호기는 부산물의 생성을 막는 온화하고 정교한 조건 하에서의 최종 단계에서 촉매적 환원(가수소분해)에 의해 제거될 수 있고, 이는 표적 올리고당에 적어도 하나의 시알릴 기가 존재하는 경우에 확실히 유리하다. 상기 촉매적 환원은 수용액 상에서 일어날 수 있다.
일반적인 용어들
본 출원의 전체에 걸쳐, 시알릴 공여체들과 일반 화학식 1의 화합물에 존재하는 "α-시알릴 모이어티" 또는 "시알릴 모이어티(sialyl moiety)" 용어는 식(scheme) 1의 N-아세틸 뉴라민산(N-acetyl neuraminic acid)의 예로 도시되는 바와 같이, α-글라이코사이드 결합(α-glycosidic linkage)을 갖는, 모든 자연적으로 발생하거나 또는 변형된 뉴라민산 또는 시알산 유도체들의 글라이코실 모이어티(glycosyl moiety) 및 그의 유사체(analogue)들을 일컫는다. 바람직한 뉴라민산들은 N-아세틸- (N-acetyl-, Neu5Ac), N-글라이콜릴- (N-glycolyl-, Neu5Gc) 및 데아미노-뉴라민산(deamino-neuraminic acid) (3-deoxy-D-glycero-D-galacto-nonulosonic acid, KDN)이다. 또한, 링커들, 반응성 기능기들, 검출가능한 라벨들 또는 표적 모이어티들과 함께 유도되거나/유도되고, C-4, C-7-, C-8 및/또는 C-9가 아실옥시(acyloxy), 알콕시(alkoxy), 할로젠(halogen) 또는 아지도(azido)로 치환된 Neu5Ac, Neu5Gc 및 KDN 유도체들도 포함된다. 더욱 바람직한 O-치환기(O-substituent)들은 아세틸(C-4, C-7-, C-8 및/또는 C-9에), 락틸(C-9에), 메틸(C-8에), 황산(sulphate)(C-8에) 또는 인산(phosphate)(C-8에)이다. 아미노 기(amino group) 상의 상기 바람직한 치환기들은 글라이콜릴(glycolyl) 및 아세토아세틸(acetoacetyl)을 포함하는 아실들이다.
식(scheme) 1
상기 "가수소분해에 의해 제거되는 보호기"는 1-산소의 C-O 결합이 촉매량의 팔라듐(palladium), 라니 니켈(Raney nickel) 또는 가수소분해에 이용되는 것으로 알려진 다른 적절한 금속 촉매들의 존재 하에서 수소의 첨가에 의하여 갈라져 OH 기의 재생성을 야기하는 기들을 일컫는다. 상기와 같은 보호기들은 당업자에게 잘 알려져 있고, Protective Groups in Organic Synthesis, PGM Wuts and TW Greene, John Wiley & Sons 2007에 논의되어 있다. 적절한 보호기들은 벤질(benzyl), 다이페닐메틸(벤즈하이드릴)(diphenylmethyl (benzhydryl)), 1-나프틸메틸(1-naphthylmethyl), 2-나프틸메틸(2-naphthylmethyl) 또는 트라이페닐메틸(triphenylmethyl, trityl) 기들을 포함하고, 이들 각각은 알킬, 알콕시, 페닐, 아미노, 아실아미노, 알킬아미노, 다이알킬아미노, 나이트로, 카복실, 알콕시카보닐, 카바모일(carbamoyl), N-알킬카바모일(N-alkylcarbamoyl), N,N-다이알킬카바모일(N,N-dialkylcarbamoyl), 아지도, 할로젠알킬(halogenalkyl) 또는 할로젠으로부터 선택된 하나 이상의 기들에 의하여 선택적으로 치환될 수 있다. 바람직하게, 상기 치환이 존재한다면, 그러한 치환은 방향족 고리(들)상에 있다. 특히 바람직한 보호기들은 페닐, 알킬 또는 할로젠으로부터 선택된 하나 이상의 기들로 선택적으로 치환된 벤질(benzyl) 또는 2-나프틸메틸(2-naphthylmethyl) 기들이다. 더욱 바람직하게, 상기 보호기는 비치환된 벤질, 비치환된 2-나프틸메틸, 4-클로로벤질(4-chlorobenzyl), 3-페닐벤질(3-phenylbenzyl) 및 4-메틸벤질(4-methylbenzyl)이다. 이러한 특히 바람직하고 더욱 바람직한 보호기들은 상기 가수소분해의 부산물들이 각각 배타적으로 톨루엔(toluene), 2-메틸나프탈렌(2-methylnaphthalene), 또는 치환된 톨루엔 또는 2-메틸나프탈렌 유도체들이라는 이점을 갖는다. 그러한 부산물들은 수용성 올리고당 생성물들로부터 심지어는 수 톤 규모라 할지라도 증발(evaporation) 및/또는 추출 과정들을 통해 용이하게 제거될 수 있다.
본 명세서의 전체에 걸쳐, 용어 "알킬(alkyl)"은 메틸(methyl), 에틸(ethyl), n-프로필(n-propyl), i-프로필(i-propyl), n-뷰틸(n-butyl), i-뷰틸(i-butyl), s-뷰틸(s-butyl), t-뷰틸(t-butyl), n-헥실(n-hexyl) 등과 같이 탄화수소(hydrocarbon) 기가 1-6 탄소 원자들로 포화된 선형 또는 분지형의 사슬을 의미한다.
용어 "아릴(aryl)"은 페닐 또는 나프틸과 같은 유사방향족(homoaromatic) 기를 일컫는다.
본 명세서에서, 용어 "아실(acyl)"은 포밀(formyl), 아세틸(acetyl), 프로피오닐(propionyl), 뷰티릴(butyryl), 피발로일(pivaloyl), 벤조일(benzoyl) 등과 같은 R'-C(=O)-기, 여기서 R'은 H, 알킬(상기 참조) 또는 아릴(상기 참조)을 나타낸다. 상기 알킬 또는 아릴 잔기는 비치환되거나, 또는 알킬(오직 아릴 잔기들의 경우에만), 할로젠, 나이트로, 아릴, 알콕시, 아미노, 알킬아미노, 다이알킬아미노, 카복실, 알콕시카보닐, 카바모일, N-알킬카바모일, N,N-다이알킬카바모일, 아지도, 할로젠알킬 또는 하이드록시알킬(hydroxyalkyl)로부터 선택되는 하나 이상의 기로 치환되어 클로로아세틸(chloroacetyl), 트라이클로로아세틸(trichloroacetyl), 4-클로로벤조일(4-chlorobenzoyl), 4-나이트로벤조일(4-nitrobenzoyl), 4-페닐벤조일(4-phenylbenzoyl), 4-벤자미도벤조일(4-benzamidobenzoyl), 4-(페닐카바모일)-벤조일(4-(phenylcarbamoyl)-benzoyl), 글라이콜릴(glycolyl), 아세토아세틸(acetoacetyl) 등과 같은 아세틸 기들을 생성할 수 있다.
용어 "알킬옥시(alkyloxy)" 또는 "알콕시(alkoxy)"는 메톡시(methoxy), 에톡시(ethoxy), t-뷰톡시(t-butoxy) 등과 같이 산소 원자를 통해 모 분자 모이어티에 부착된 알킬 기(상기 참조)를 의미한다.
"할로젠(halogen)"은 플루오로(fluoro), 클로로(chloro), 브로모(bromo) 또는 아이오도(iodo))를 의미한다.
"아미노(amino)"는 -NH2 기를 일컫는다.
"알킬아미노(alkylamino)"는 메틸아미노(methylamino), 에틸아미노(ethylamino) 등과 같이 -NH-기를 통해 모 분자 모이어티에 부착된 알킬 기(상기 참조)를 의미한다.
"다이알킬아미노(dialkylamino)"는 다이메틸아미노(dimethylamino), 다이에틸아미노(diethylamino) 등과 같이 질소 원자를 통해 모 분자 모이어티에 부착된 동일 또는 상이한 두 개의 알킬 기(상기 참조)들을 의미한다.
"아실아미노(acylamino)"는 아세틸아미노(acetylamino, acetamido), 벤조일아미노(benzoylamino, benzamido) 등과 같이 -NH-기를 통해 모 분자 모이어티에 부착된 아실 기(상기 참조)를 일컫는다.
"카복실(carboxyl)"은 -COOH 기를 나타낸다.
"알킬옥시카보닐(alkyloxycarbonyl)"은 메톡시카보닐(methoxycarbonyl), t-뷰톡시카보닐(t-butoxycarbonyl) 등과 같이 -C(=O)-기를 통해 모 분자 모이어티에 부착된 알킬옥시 기(상기 참조)를 의미한다.
"카바모일(carbamoyl)"은 H2N-C(=O)-기이다.
"N-알킬카바모일(N-Alkylcarbamoyl)"은 N-메틸카바모일(N-methylcarbamoyl) 등과 같이 -HN-C(=O)-기를 통해 모 분자 모이어티에 부착된 알킬 기(상기 참조)를 의미한다.
"N,N-다이알킬카바모일(N,N-dialkylcarbamoyl)"은 N,N-메틸카바모일(N,N-methylcarbamoyl) 등과 같이 >N-C(=O)-기를 통해 모 분자 모이어티에 부착된 동일 또는 상이한 두 개의 알킬 기(상기 참조)들을 의미한다.
본 명세서에서, 적어도 하나의 시알릴 잔기를 포함하는 일반 화학식 1 및 2와 화학식 SA-OR2와 관련된 용어 "염(salt)"은 음전하를 띠는 산 잔기(acid residue)와 비화학량적 비율에 맞는 하나 이상의 양이온들로 구성되는 조합된 이온 쌍(associated ion pair)을 의미한다. 본 명세서에서 의미하는 양이온들은 양 전하를 띠는 원자들 또는 분자들이다. 상기 양이온은 유기 또는 무기 양이온일 수 있다. 바람직한 무기 양이온들은 암모늄(ammonium) 이온, 알칼리 금속(alkali metal), 알칼리 토금속(alkali earth metal) 및 전이 금속 이온들(transition metal ions)이고, 더욱 바람직하게 Na+, K+, Ca2+, Mg2+, Ba2+, Fe2+, Zn2+, Mn2+ 및 Cu2+, 가장 바람직하게 K+, Ca2+, Mg2+, Ba2+, Fe2+ 및 Zn2+이다. 양전하를 띠는 형태의 염기성 유기 화합물들은 유기 양이온과 관련이 있을 수 있다. 바람직한 양전하를 띠는 대응물들은, 모두 양성자화된 형태인, 다이에틸 아민(diethyl amine), 트라이에틸 아민(triethyl amine), 다이아이소프로필 에틸 아민(diisopropyl ethyl amine), 에탄올아민(ethanolamine), 다이에탄올아민(diethanolamine), 트라이에탄올아민(triethanolamine), 이미다졸(imidazol), 피페리딘(piperidine), 피페라진(piperazine), 모폴린(morpholin), 벤질 아민(benzyl amine), 에틸렌 디아민(ethylene diamine), 메글루민(meglumin), 피롤리딘(pyrrolidine), 콜린(choline), 트리스-(하이드록시메틸)-메일 아민(tris-(hydroxymethyl)-methyl amine), N-(2-하이드록시에틸)-피롤리딘(N-(2-hydroxyethyl)-pyrrolidine), N-(2-하이드록시에틸)-피페리딘(N-(2-hydroxyethyl)-piperidine), N-(2-하이드록시에틸)-피페라진(N-(2-hydroxyethyl)-piperazine), N-(2-하이드록시에틸)-모폴린(N-(2-hydroxyethyl)-morpholine), L-아르지닌(L-arginine), L-라이신(L-lysine), L-아르지닌 또는 L-라이신 단위를 갖는 올리고펩타이드들 또는 N-말단에 유리 아미노 기를 갖는 올리고펩타이드들 등이다. 상기와 같은 염 형성물(salt formation)들은 복잡한 분자의 안정성, 부형제(excipient)들에 대한 호환성(compatibility), 용해도 및 결정 형성능 등과 같은 특성을 전체적으로 변형하는데 이용될 수 있다.
트랜스시알화
반응들(
Transsialidation
reactions
)
본 발명에 따르면, 트랜스시알리다아제(transsialidase) 활성을 가지는 효소의 촉매 작용 하에서, 화학식 SA-OR2 및 그의 염들의 시알릴 공여체(sialyl donor)가 일반 화학식 2 및 그의 염들의 시알릴 수용체(sialyl acceptor)와 반응되는 것을 특징으로 하는 일반 화학식 1의 화합물 및 그의 염들의 합성 방법이 제공되고, 화학식 SA-OR2에서, R2는 단-(mono-), 이-(di-) 또는 올리고당, 당지질, 당단백질 또는 당펩티드, 고리형(cyclic) 또는 비고리형(acyclic) 지방족 기(aliphatic group), 또는 아릴 잔기이고, SA는 α-시알릴 모이어티이고, 일반 화학식 1에서, R 기들 중의 하나는 α-시알릴 모이어티), 다른 하나는 H이고, X는 탄수화물 링커를 나타내고, R1은 가수소분해에 의해 제거될 수 있는 보호기이고, n은 0 또는 1의 정수이다. 상기 과정은 식(scheme) 2에 도시된다.
식(scheme) 2
일반 화학식 1의 화합물을 제공하는 것의 이점은 비글라이코실화 된 시알로올리고당들에 비하여 시알릴화된 올리고당 1-O-보호된 배당체들의 보다 간단한 정제이다. 부산물로서 유리 시알산의 형성이 없고, 반응 화합물들의 다른 극성 때문에, 역상 도는 크기 배제 크로마토그래피에 의한 생성물의 분리가 가능하다. 역상 크로마토그래피의 경우, 물이 이용될 때, 일반 화학식 1의 화합물은 반응 혼합물 내에 존재하는 극성 화합물들보다 훨씬 더 느리게 이동하고, 따라서 상기 극성 화합물들은 원활하게 용출될 수 있다. 그런 다음, 일반 화학식 1의 화합물들은 예를 들어 알콜로 칼럼으로부터 세척될 수 있다.
트랜스시알리다아제
활성을 갖는 효소(
Enzymes
having
transsialidase
activity
)
트랜스시알리다아제 활성을 갖고 본 출원에서 청구되는 시알로올리고당들의 제조 방법의 목적에 적합한 효소들은 시알리다아제 및 트랜스시알리다아제 효소들로부터 선택될 수 있다.
GH33 족으로 분류되는 시알리다아제들(EC 3.2.1.18)은 다양한 시알로글라이코 결합체들의 말단 시알산의 α-결합, 주로 갈락토스에 α-2-3 또는 α-2-6 결합으로 결합된 것들을 가수분해하는 능력을 갖는 유지 효소(retaining enzyme)이다. 그것들은 특히 다양한 바이러스 족들과 박테리아에서 발견되고, 또한, 원생동물(protozoa), 일부 무척추동물(invertebrates) 및 포유동물(mammal)에서도 발견된다. 일부 박테리아의 시알리다아제는 박테리아 세포 성장을 위한 영양분으로서 시알릴화된 당단백질, 당지질 또는 다른 당-결합체(glycoconjugate)들로부터 시알산들을 제거하는데 이용될 수 있다.
비록 시알리다아제들이 그것들의 가수분해 활성에 의해 특징화되지만, 적절한 반응 조건들 하에서 그것들은 시알로글라이코-결합체들의 형성을 야기시키는 트랜스시알화 반응에 의하여 아시알로 수용체(asialo acceptor)로 시알산 단위의 전이를 촉매할 수 있다. 박테로이데스 프라질리스(Bacteroides fragilis), 클로스트리듐(Clostridium) 종들(예, C. 퍼프린젠스(C. perfringens)), 코리네박테리움 디프테라아(Corynebacterium diphtheriae), 해모필러스 파라수이스(Haemophilus parasuis), 파스튜렐라 물토시다(Pasteurella multocida), 슈도모나스 애루지노사(Pseudomonas aeruginosa), 살모넬라 티피뮤리움(Salmonella typhimurium), 스트렙토코커스 뉴모니아(Streptococcus pneumoniae), 타네렐라 포시티아(Tannerella forsythia), 비브리오 콜레라(Vibrio cholerae) 또는 뉴캐슬병 바이러스(Newcastle disease virus)와 같은 병원성 박테리아 또는 바이러스들, 및
악티노마이세스 비스코서스(Actinomyces viscosus), 아스로박터(Arthrobacter) 종들 또는 마이크로모노스포라 비기디파시엔스(Micromonospora viridifaciens)와 같은 비병원성 박테리아 또는 바이러스들로부터 유래한 시알리다아제들은 그것들의 α-2-3 및/또는 α-2-6 선택성을 갖는 트랜스시알리다아제 활성으로 인하여 시알릴화 반응(sialylation reaction)을 위한 촉매로서 작용할 수 있다. 상기 위치선택성(regioselectivity)에 관하여, α-2-3- 및 α-2-6-결합된 생성물들 간의 비율은 효소들 및/또는 수용체들에 따라 달라진다. 예를 들어, A. 우레아파시엔스(A. ureafaciens), C. 퍼프린젠스(C. perfringens) 및 V. 콜레라(V. cholerae)로부터 유래된 시알리다아제들은 좋은 α-2-6 선택성을 갖는 반면, S. 티피뷰리움(S. typhimurium) 및 뉴캐슬병 바이러스(Newcastle disease virus)로부터 유래한 것들은 α-2-3 결합의 형성에 높은 선호도(preference)를 가진다.
최근, 비피도박테리움 비피둠(Bifidobacterium bifidum) 및 비피도박테리움 롱검 아종. 인판티스(Bifidobacterium longum subsp. infantis)과 같은 비피도박테리움(Bifidobacterium) 종들에서 유래한 시알리다아제들이 동정, 클로닝되고 특성이 밝혀졌다. 상기 시알리다아제들은 α-2,3- 및 α-2,6-결합된 시알로사이드(sialoside)들을 모두 분해하고 인식할 수 있다. 비피도박테리움 롱검 아종. 인판티스(Bifidobacterium longum subsp. infantis)에서 유래한 시알리다아제들은 α-2,6-결합에 일관된 선호도를 가지는 반면, 비피도박테리움 비피둠(Bifidobacterium bifidum)은 α-2,3-결합에 일관된 선호도를 가진다.
위치선택성 및/또는 트랜스시알화 반응을 향상시키기 위해서, 상기 시알리다아제들은 다양한 조작 기술들에 의하여 변형될 수 있다.
적절한 조작으로, 점 돌연변이(point mutatiion)에 의하여 신규 변형 효소들(돌연변이체들)이 생성되었다. 상기 돌연변이는 유전학적으로 상기 효소의 활성 부위에 영향을 미친다. 촉매적 친핵체(catalytic nucleophile)의 비-친핵성 잔기로의 대체는 비활성 돌연변이의 형성 또는 트랜스글라이코실화(transglycosylation)를 위한 반응성 주-객 복합체(reactive host-guest complex)의 형성을 위한 적절한 환경의 결핍으로 인하여 감소된 트랜스글라이코실화 활성을 갖는 변형된 효소를 야기한다. 그러나, 천연형보다 더욱 높은 활성을 갖는 시알릴 공여체의 존재하에서, 상기 돌연변이된 효소는 상기 시알릴 잔기를 적합한 수용체에 효율적으로 전이시킬 수 있다. 효소의 적절한 조작은 일반적으로 정지된 상태의 3D 단백질 구조에 의존한다. 상기의 변형된 효소들은 생성물 가수분해 활성이 전혀 없을 수도 있다.
유도 진화(directed evolution) 전략라고 불리는 두 번째 기술은 선별된 천연형 시알리다아제 효소의 무작위적인 돌연변이 유발(mutagenesis)을 포함하여, 각각의 변이체들이 단일 또는 복수의 위치들에서 변형이 일어난 효소 변이체들의 라이브러리를 생성한다. 그것들은 약간 변형된 특성들을 갖는 재조합 변이체들을 생산하기 위하여 E. 콜라이(E. coli) 또는 S. 세레비시아(S. cerevisiae)와 같은 적합한 미생물들에 삽입될 수 있다. 그런 다음, 빠르고 신뢰성 있는 스크리닝 방법으로 향상된 효소를 발현하는 클론들이 동정되고, 선별되어, 돌연변이 과정의 다음 차례로 진행된다. 원하는 활성 및/또는 특이성을 가진 돌연변이(들)이 진화될 때까지 돌연변이, 재조합 및 선별의 반복되는 순환들이 계속된다.
트랜스시알리다아제들에 관하여, 가장 먼저 개시된 트랜스시알리다아제 효소는 샤가스 병(Chagas disease)을 일으키는 원생동물인 트리파노소마 크루지(Trypanosoma cruzi)에서 발견되었다. 그 이후로, 트랜스시알리다아제는 트리파노소마 브루세이 감비언스(Trypanosoma brucei gambiense), 트리파노소마 브루세이 로데시언스(Trypanosoma brucei rhodesiense), 트리파노소마 브루세이 브루세이(Trypanosoma brucei brucei) 및 프리파노소마 콩골렌스(Trypanosoma congolense)와 같은 몇몇의 다른 트리파노솜 유형들에서 발견되어 왔다. 더욱이, 트랜스시알리다아제들의 존재는 엔도트리파눔(Endotrypanum) 유형, 코리네박테리움 디프테리아(Corynebacterium diphtheriae) 및 심지어는 인간 혈장(plasma)에서도 밝혀졌다.
트랜스시알리다아제들은 시알리다아제의 시알산에 대한 가수분해 활성에 더하여 더욱 중요한 시알산 전이 활성(sialic acid transfer activity)을 가진다는 점에서 시알리다아제들과는 다르다. 트랜스시알리다아제들은 공여체 분자로부터 시알산들, 바람직하게는 α-2,3-결합된 시알산들을 수용체 유도체, 바람직하게는 β-인터글라이코사이드 결합(β-interglycosidic linkage)을 갖는 말단 갈락토스 모이어티로 전이시킬 수 있다. 상기의 전이의 결과로서, 상기 시알산과 상기 수용체 간에 α-글라이코사이드 결합이 형성된다. 그러나, 적합한 수용체가 없다면, 상기 트랜스시알리다아제는 상기 시알산을 가수분해한다.
트랜스시알리다아제 작용에 대하여, 예를 들어 아미노산 서열을 변경시킴으로써 상기 가수분해 활성이 제거된 유도된 트랜스시알리다아제 효소 돌연변이체들이 생산될 수 있다. 돌연변이 유발 및/또는 재조합에 의하여 변형된 유전자들의 라이브러리를 생성한 다음, 그것들은 약간 변형된 특성들을 갖는 재조합 변이체들을 생산하기 위해 E. 콜라이(E. coli) 또는 S. 세레비시아(S. cerevisiae)와 같은 적합한 미생물들에 삽입될 수 있다. 그런 다음, 효소를 발현하는 클론들이 동정되고, 분리되어 원하는 목적을 위해 이용될 수 있다. 예를 들어, 서열과 구조 비교에 기반하여, 트리파노소마 랑겔리(Trypanosoma rangeli)로부터 유래된 시알리다아제는 6개의 위치들에서 돌연변이될 수 있고, 상기 변형된 돌연변이체는 현저한 수준의 트랜스시알리다아제 활성을 나타낼 수 있다(Paris et al. J. Mol. Biol. 345, 923 (2005)).
바람직하게, 상기 트랜스시알리다아제 활성을 갖는 효소는 비도박테리움 롱검 아종. 인판티스(Bifidobacterium longum subsp. infantis) ATCC 15697, 비도박테리움 비피둠(Bifidobacterium bifidum) JCM1254, 비도박테리움 비피둠(Bifidobacterium bifidum) S17, 비도박테리움 비피둠(Bifidobacterium bifidum) PRL2010, 비도박테리움 비피둠(Bifidobacterium bifidum) NCIMB 41171, 트리바노소마 크루지(Trypanosoma cruzi) 등에서 유래된 시알리다아제들 또는 트랜스시알리다아제들로부터 선택될 수 있다.
보다 바람직하게, 상기 트랜스시알리다아제 활성을 갖는 효소는 하기의 기탁 번호에 따라 정의되는 시알리다아제들 또는 트랜스시알리다아제들로부터 선택될 수 있다: gi|213524659 (비도박테리움 롱검 아종. 인판티스 ATCC 15697, SEQ ID NO: 1), gi|213523006 비도박테리움 롱검 아종. 인판티스 ATCC 15697, SEQ ID NO: 2), gi|309252191 (비도박테리움 비피둠 S17, SEQ ID NO: 3), gi|309252190 (비도박테리움 비피둠 S17, SEQ ID NO: 4), gi|310867437 (비도박테리움 비피둠 PRL2010, SEQ ID NO: 5), gi|310867438 (비도박테리움 비피둠 PRL2010, SEQ ID NO: 6), gi|224283484 (비도박테리움 비피둠 NCIMB 41171, SEQ ID NO: 7), gi|224283485 (비도박테리움 비피둠 NCIMB 41171, SEQ ID NO: 8), gi|334283443 gi|47252690 (비도박테리움 비피둠 JCM1254, SEQ ID NO: 9), gi|47252690 (T. cruzi, SEQ ID NO: 10), gi|432485 (T. 크루지, SEQ ID NO: 11). 트랜스시알리다아제 활성을 갖는 특히 바람직한 시알리다아제들/트랜스시알리다아제들은 하기 표 2에 열거된다:
바람직한 시알리다아제들 / 트랜스시알리다아제들 | ||
GenBank
데이터베이스의 GI 번호 |
생물체(Organism) | 서열번호: |
gi|213524659 | Bifidobacterium longum subsp. infantis ATCC 15697 | 1 |
gi|213523006 | Bifidobacterium longum subsp. infantis ATCC 15697 | 2 |
gi|309252191 | Bifidobacterium bifidum S17 | 3 |
gi|309252190 | Bifidobacterium bifidum S17 | 4 |
gi|310867437 | Bifidobacterium bifidum PRL2010 | 5 |
gi|310867438 | Bifidobacterium bifidum PRL2010 | 6 |
gi|224283484 | Bifidobacterium bifidum NCIMB 41171 | 7 |
gi|224283485 | Bifidobacterium bifidum NCIMB 41171 | 8 |
gi|334283443 | Bifidobacterium bifidum JCM1254 | 9 |
gi|47252690 | Trypanosoma cruzi | 10 |
gi|432485 | Trypanosoma cruzi | 11 |
트랜스시알리다아제 활성을 유지하고 상기 언급된 트랜스시알리다아제 활성을 갖는 효소 서열들의 서열에 대하여, 아미노산 수준 상의 전체 야생형 서열에 비해 적어도 70%, 보다 바람직하게는 적어도 80%, 보다 더욱 바람직하게는 적어도 85%, 보다 더욱 바람직하게는 적어도 90%, 그리고 가장 바람직하게는 적어도 95%, 또는 심지어 97%, 98% 또는 99%의 서열 유사성/상동성을 갖는 시알리다아제들/트랜스시알리다아제들 효소 돌연변이체들이 설계된다.
바람직하게는, 상기 서열 유사성이 적어도 90%, 보다 바람직하게는 95%, 97%, 98% 또는 가장 바람직하게 99%이다. 바람직하게는, 상기 트랜스시알리다아제 활성이 상기 효소의 천연형의 활성의 적어도 75%, 보다 바람직하게는 적어도 90% 및 보다 더욱 바람직하게는 적어도 100%이다.
시알리다아제들 및 트랜스시알리다아제들은 선행 기술에서 이용된 시알릴트랜스퍼라아제보다 광범위한 공여체와 수용체 특이성을 가지며, 특히 다양한 반응들에 이용될 수 있다. 따라서, 시알리다아제들/트랜스시알리다아제들은 종래 이용되던 시알릴트랜스퍼라아제보다 산업적 활용에 더욱 유리하다.
시알리다아제들 / 트랜스시알리다아제들의 공여체들(Donors for sialidases / transsialidases )
T. 크루지로 생물체를 감염시키면, T. 크루지 내의 트랜스시알리다아제가 숙주 생물체의 시알로올리고-결합체들로부터 시알산들을 제거하고, 자신의 에피토프(epitope)를 감추기 위하여 자신의 표면 뮤신을 효율적으로 시알릴화시킨다.
수많은 집중적인 연구 결과, 시알화 반응들에서 시알릴 공여체들로서 작용할 수 있는 유도체들을 포함하는 다수의 천연 및 합성의 시알산이 발견되었다. 따라서, 상기에서 정의된 일반 화학식 SA-OR2의 시알릴 공여체 화합물들은 트랜스시알리다아제들에 의하여 상기 수용체로 전이될 기질을 제공할 수 있다. 트랜스시알리다아제들은 상기 수용체로 순수한 시알산이나 CMP-시알산(실제로, β-시알라이드(β-sialide))를 전이시키는 것이 아니라, α-아노머성 아글리콘(α-anomeric aglycon) 또는 α-아노머성 치환기를 갖는 시알산의 존재가 트랜스-시알라다아제 반응들을 위한 필요조건이다. 전형적인 천연 시알릴 공여체들은 말단 β-갈락토사이드(β-galactoside) 잔기에 α-2,3-결합된 시알산 또는 α-2,8-결합을 갖는 폴리시알산을 포함하는 3'-O-시알릴-락토스(3'-O-sialyl-lactose), 페투인(fetuin), 강글리오사이드(ganglioside)들, O- 또는 N-결합된 당단백질들로부터 선택될 수 있으나, 이에 한정되지 아니한다. 합성 시알로사이드(sialoside)들 중에 2-O-(4-메틸움벨리페릴)- (2-O-(4-methylumbelliferyl)-) 또는 2-O-(선택적으로 치환도니 페닐)-α-D-시알로사이드(sialoside)들, 더욱 흔하게는 2-O-(p-나이트로페닐)-α-D-시알로사이드(2-O-(p-nitrophenyl)-α-D-sialoside)가 선호된다.
시알리다아제들
/
트랜스시알리다아제들의
수용체들(
Acceptors
for
sialidases
/
transsialidases
)
상기 개시된 트랜스시알화 반응에 이용되는 트랜스시알리다아제 수용체들은 R1-갈락토피라노사이드(R1-galactopyranosides)들(n이 0일 때) 또는 비-환원 말단 상의 말단 당 모이어티가 갈락토피라노스(galactopyranose)인 올리고당 R1-배당체들(n이 1일 때)인 일반 화학식 2 및 그의 염들에 의하여 특징화된다. 상기 말단 갈락토피라노실 단위는 바람직하게 β-글라이코사이드 결합에 의하여 결합된다.
일반 화학식 2의 화합물의 상기 아노머성 하이드록실 기는 가수소화반응에 의하여 제거될 수 있는 R1 기들로 보호된다. 상기에서 설명한 바와 같이, 그러한 기들은 선택적으로 치환된 벤질 및 나프틸메틸 기들을 포함한다. 페닐, 알킬 또는 할로젠으로 선택적으로 치환된 벤질 또는 2-나프틸메틸 기들은 바람직한 R1-기들이고, 그 중에서도 비치환된 벤질, 비치환된 2-나프틸메틸, 4-클로로벤질, 3-페닐벤질 또는 4-메틸벤질 기들이 특히 바람직하다.
n이 1일 때, 상기 구조적 성분 X는 탄수화물 링커로서, 선형 또는 분지형 구조를 나타내는 단-, 이-, 삼-(tri-), 사-(tetra-), 오-(penta-) 또는 올리고당을 의미한다. 상기 탄수화물 링커의 단당류(monosaccharide) 구성 단위(building unit)들은 가장 흔히 발생되는 단위들인 글루코스(glucose), N-아세틸 글루코사민(N-acetyl glucosamine), 갈락토스, 푸코스(fucose) 및 시알산를 포함하여, 모든 자연적으로 발생하는 5-, 6-. 또는 9-탄소 함유 당 유도체들일 수 있다.
바람직한 방법에서, n은 1이고, 링커 X는 일반 화학식 2A 또는 그의 염들의 수용체들을 형성하는 화학식 -(X1)m-A-에 대응되고,
일반 화학식 2A에서, A는 선택적으로 푸코실로 치환된 D-글루코피라노실 단위이고, X1은 탄수화물 링커를 나타내며, m은 0 또는 1의 정수이다. -OR1 기는 D-글루코피라노실 고리의 아노머성 탄소(C1) 원자에, 바람직하게는 β 배향(β orientation)으로 연결된다.
m이 0일 때, 상기 말단 D-글루코피라노실 단위는 인터글라이코사이드 결합(interglycosidic linkage)을 통해 A 기에 직접 결합된다. 바람직하게, 상기 인터글라이코사이드 결합은 1-4 결합이고, 따라서 일반 화학식 2B의 화합물을 형성하고,
화학식 2B에서 R3는 푸코실 또는 H이다. 보다 바람직하게는, 상기 갈락토스와 상기 글루코스 부분 사이의 상기 인터글라이코사이드 결합은 β이고, 따라서 락토스 유도체를 생성한다. 보다 더욱 바람직한 구현예에서는, 아글리콘 -OR1 또한 β 배향이다.
m이 1일 때, 상기 구조적 성분 X1은 탄수화물 링커로서, 선형 또는 분지형 구조를 나타내는 단-, 이-, 삼-, 사-, 오- 또는 올리고당을 의미한다. 상기 탄수화물 링커의 단당류 구성 단위들은 가장 흔히 발생되는 단위들인 글루코스, N-아세틸 글루코사민, 갈락토스, 푸코스 및 시알산을 포함하여, 모든 자연적으로 발생하는 5-, 6-. 또는 9-탄소 함유 당 유도체들일 수 있다.
바람직하게, 링커 X1은 일반 화학식 2C의 화합물들 또는 그의 염들을 형성하는 화학식 -B-X2-로 표현되고,
일반 화학식 2C에서, B 기는 선택적으로 푸코실 및/또는 시알릴로 치환된 N-아세틸-글루코사미노피라노실 단위(N-acetyl-glucosaminopyranosyl unit)이며, 링커 X2는 글루코스, N-아세틸-글루코사민, 갈락토스, 푸코스 및 시알산으로부터 선택되는 단당류 구성 단위들을 갖는 선형 또는 분지형 구조를 나타내는 단-, 이-, 삼-, 사-, 오- 또는 올리고당을 의미한다. -OR1 기는 상기 D-글루코피라노실 고리의 아노머성 탄소(C1) 원자에, 바람직하게는 β 배향으로 연결된다. 바람직하게는, 말단 갈락토실 기가 1-3 또는 1-4 인터글라이코사이드 결합을 통해 B 기에 부착되어, 각각 락토-N-바이오실(lacto-N-biosyl) 또는 N-아세틸-락토사미닐(N-acetyl-lactosaminyl) 말단 이당류(disaccharide) 모이어티들을 형성한다. 보다 더 바람직한 방법에서는, 푸코실 치환기가 단위 B의 3-OH 기 또는 4-OH 기, 및/또는 단위 A의 3-OH 기에 결합될 수 있고/있거나, 시알릴이 단위 B의 6-OH에 연결될 수 있다.
보다 바람직한 방법에서, X2 기는 시알릴 또는 N-아세틸-락토사민, 락토-N-바이오스(lacto-N-biose), 푸코스 및 시알산으로부터 선택되는 당류 구성 단위들을 갖는 선형 또는 분지형 구조를 나타내는 올리고당으로 선택적으로 치환된 갈락토스이므로, 일반 화학식 2D 또는 그의 염들로 표현되는 인유 올리고당 유도체들을 형성하고,
일반 화학식 2D에서, R1은 가수소분해에 의해 제거될 수 있는 기이고, R3는 H 또는 푸코실 단위이고, Y는 서로에 관계없이, 선택적으로 시알릴 및/또는 푸코실 잔기로 치환된 N-아세틸-락토사미닐 기이고, p는 서로에 관계없이 0, 1 또는 2의 정수이고, R8은 일반 화학식 5 및 6에 의하여 특징화되는 기들로부터 선택되고, R9는 H, α-시알릴 모이어티, 일반 화학식 5의 기 및 일반 화학식 6의 기로부터 선택되며,
일반 화학식 5 또는 6에서, R6은 H 또는 푸코실 잔기이고, R7은 H 또는 α-시알릴 모이어티이다.
보다 바람직한 방법에 따라, 상기에서 정의된 바와 같은 일반 화학식 2D의 화합물 및 그의 염들은 그것의 결합부 및 부착된 모이어티들로 특징지을 수 있으며, 이때에
- Y 기의 N-아세틸-락토사미닐 기는, 또 다른 N-아세틸-락토사미닐 기(p = 2)에 부착될 때, 1-3 인터글라이코사이드 결합으로 결합되고,
- 일반 화학식 5의 기는, Y(p = 1, 2)에 부착될 때, 1-3 인터글라이코사이드 결합으로 결합되고,
- 일반 화학식 6의 기는, Y(p = 1, 2)에 부착될 때, 1-3 인터글라이코사이드 결합으로 결합되고,
- 푸코실 잔기는, Y에 존재하는 N-아세틸-락토사미닐 기에 부착되는 경우, 1-3 인터글라이코사이드 결합으로 N-아세틸-락토사미닐 기의 N-아세틸-글루코사민에 연결되고,
- α-시알릴 잔기는, Y에 존재하는 N-아세틸-락토사미닐 기에 부착되는 경우, 2-6 인터글라이코사이드 결합으로 N-아세틸-락토사미닐 기의 갈락토스에 연결된다.
추가적인 측면에서, 상기에서 정의된 바와 같은 일반 화학식 2B, 2C 또는 2D의 화합물 및 그의 염들은 선택적으로 하나 이상의 시알릴 및/또는 푸코실 잔기로 치환되고, 비치환된 말단 갈락토실(galactosyl) 잔기를 갖는 락토스(lactose), 락토-N-네오테트라오스(lacto-N-neotetraose), 파라-락토-N-헥사오스(para-lacto-N-hexaose), 파라-락토-N-네오헥사오스(para-lacto-N-neohexaose), 락토-N-네오헥사오스(lacto-N-neohexaose), 파라-락토-N-옥타오스(para-lacto-N-octaose), 락토-N-네오옥타오스(lacto-N-neooctaose), 락토-N-테트라오스(lacto-N-tetraose), 락토-N-헥사오스(lacto-N-hexaose), 락토-N-옥타오스(lacto-N-octaose), 아이소-락토-N-옥타오스(iso-lacto-N-octaose), 락토-N-데카오스(lacto-N-decaose) 및 락토-N-네오데카오스(lacto-N-neodecaose)의 R1-배당체들을 나타낸다. 바람직하게, 상기 시알릴 치환기는 N-아세틸 뉴라미닐(N-acetyl neuraminy) 기이다.
특히 바람직하게, 상기에서 정의된 바와 같은 일반 화학식 2B, 2C 또는 2D의 화합물 및 그의 염들은 Galβ1-4Glc (락토스(lactose)), Galβ1-4(Fucα1-3)Glc (3-O-푸코실락토스(3-O-fucosyllactose)), Galβ1-3GlcNAcβ1-3Galβ1-4Glc (LNT), Galβ1-4GlcNAcβ1-3Galβ1-4Glc (LNnT), Galβ1-3(Fucα1-4)GlcNAcβ1-3Galβ1-4Glc (LNFP II), Galβ1-4(Fucα1-3)GlcNAcβ1-3Galβ1-4Glc (LNFP III), Galβ1-3GlcNAcβ1-3Galβ1-4(Fucα1-4)Glc (LNFP V), Galβ1-3(Fucα1-4)GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc (LNDFH II), Galβ1-3(Neu5Acα2-6)GlcNAcβ1-3Galβ1-4Glc (LSTb), Galβ1-3(Neu5Acα2-6)(Fucα1-4)GlcNAcβ1-3Galβ1-4Glc, Galβ1-3(Neu5Acα2-6)GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Galβ1-4GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Galβ1-4(Fucα1-3)GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc (LNDFH III)의 R1-배당체들의 군, 또는 그들의 염들로부터 선택된다. 상기 R1-배당체들은 α- 또는 β-아노머들일 수 있다. 바람직하게, 상기 R1-배당체들은 β-아노머들이다.
일반 화학식 2의 화합물들의 전형적인 합성은 갈락토스, 또는 비환원 말단에 갈락토시파노실 단위(galactopyranosyl unit)를 갖는 올리고당들을 50-125 ℃에서 아세트산 무수물(acetic anhydride) 및 아세트산 나트륨(sodium acetate)으로 처리한 다음, DCM 톨루엔(toluene), THF 등과 같은 유기 용매 내에서 R1-OH, 바람직하게는 벤질/치환된 벤질 알콜들을 이용하여 루이스 산 촉매된 글라이코실화(Lewis acid catalyzed glycosylation)시키는 것을 포함한다. 뒤이어, 상기 글라이코실화된 생성물들의 최종 젬플렌 탈보호(Zemplen deprotection)를 통해 일반 화학식 2의 화합물들이 수득된다.
다른 전형적인 아노머성 O-보호 과정에 따르면, DMF, DMSO, N-메틸피롤리돈(N-methylpyrrolidone), 헥사메틸포스포라마이드(hexamethylphosphoramide, HMPA), N,N'-다이메틸헥사하이드로피리미딘-2-온(N,N'-dimethylhexahydropyrimidine-2-one, DMPU), THF, 다이옥세인(dioxane), 아세토나이트릴(acetonitrile) 등 또는 그들의 혼합물과 같은 쌍극성 비양성자성 용매(dipolar aprotic solvent) 내에서 유리 아노머성 OH로 전부 또는 일부가 보호된 갈락토스 또는 비환원 말단에 갈락토피라노실 단위(galactopyranosyl unit)을 갖는 올리고당들은 강 염기 및 R1-X의 존재하에서 O-알킬화되고, R1-X에서 X는 할로젠(halogen), 메실(mesyl), 트라이플릴(triflyl) 등과 같은 알킬설포닐옥시(alkylsulfonyloxy) 및 벤젠설포닐(benzenesulfonyl), 토실(tosyl) 등과 같은 아릴설포닐(arylsulfonyl)로부터 선택되는 이탈기(leaving group)이다. 바람직한 알킬화제(alkylating agent)들은 선택적으로 페닐, 알킬 또는 할로젠으로부터 선택된 하나 이상의 기들로 치환된 벤질 또는 1- 또는 2-나프틸메틸 할로제나이드(1- or 2-naphthylmethyl halogenide)들이다. 상기 강 염기는 동등량 또는 약간 초과량(1 내지 1.5 equiv)의 염기가 이용되었을 때, 아노머성 OH의 보다 강한 산성 특성으로 인하여 아노머성 OH를 화학선택적으로 탈양성자화할 수 있다. 상기 아노머성 OH를 활성화시키는데 적합한 강 염기는 일반적으로 NaH, KH, CaH2, NaOMe, NaOtBu, KOtBu, 무기 하이드록사이드(inorganic hydroxide)들, 탄산 칼륨(potassium carbonate) 등과 같은 알칼리 금속(alkali metal) 또는 알칼라인 토금속 하이드라이드(alkaline earth metal hydride)들 또는 알콕사이드(alkoxide)들의 군으로부터 취해진다. 상기 알킬화제는 동응량 또는 약간 초과량(1 내지 1.5 equiv)으로 첨가된다. 상기 반응은 -10 내지 80 ℃ 사이, 바람직하게는 반응의 전 과정 동안 낮은 온도에서, 또는 시약/반응물들의 첨가하는 동안에는 낮은 온도, 상기 반응 과정의 후속 단계들에서 상승된 온도에서 수행된다. 통상적인 마무리(work-up) 후에 일반 화학식 2의 벤질/치환된 벤질 배당체들이 수득된다.
추가적인 측면에서, 본 발명은 일반 화학식 2A'의 화합물 및 그의 염들을 제공하는 것과 관련되고,
일반 화학식 2A'에서, A는 선택적으로 푸코실로 치환된 D-글루코피라노실 단위이고, X1은 탄수화물 링커를 나타내고, m은 0 또는 1의 정수이고, 단, a) m이 0일 때, 상기 A는 푸코실로 치환되고, b) 상기 화합물은 1-O-β-벤질-LNT(1-O-β-benzyl-LNT), 1-O-β-(4-하이드록시메틸벤질)-LNnT(1-O-β-(4-hydroxymethylbenzyl)-LNnT) 및 1-O-β-벤질-LNnT(1-O-β-benzyl-LNnT)와는 상이하다. -OR1 기는 D-글루코피라노실 고리의 아노머성 탄소(C1) 원자에, 바람직하게는 β 배향으로 연결된다.
m이 0일 때, 상기 말단 D-글루코피라노실 단위는 인터글라이코사이드 결합을 통해 A 기에 직접 결합된다. 바람직하게, 상기 인터글라이코사이드 결합은 1-4 결합이고, 따라서 일반 화학식 2B'의 화합물들 및 그의 염들을 형성하고,
화학식 2B'에서 R3는 푸코실이다. 보다 바람직하게는, 상기 갈락토스와 상기 글루코스 부분 사이의 상기 인터글라이코사이드 결합은 β이고, 따라서 락토스 유도체를 생성한다. 보다 더욱 바람직한 구현예에서는, 아글리콘 -OR1 또한 β 배향이다.
m이 1일 때, 상기 구조적 성분 X1은 탄수화물 링커로서, 선형 또는 분지형 구조를 나타내는 단-, 이-, 삼-, 사-, 오- 또는 올리고당을 의미한다. 상기 탄수화물 링커의 단당류 구성 단위들은 가장 흔히 발생되는 단위들인 글루코스, N-아세틸 글루코사민, 갈락토스, 푸코스 및 시알산을 포함하여, 모든 자연적으로 발생하는 5-, 6-. 또는 9-탄소 함유 당 유도체들일 수 있다. 바람직하게, 링커 X1은 화학식 -B-X2-로 표현되어, 일반 화학식 2C'의 화합물들 및 그의 염들을 형성하고,
일반 화학식 2C에서, B 기는 선택적으로 푸코실 및/또는 시알릴로 치환된 N-아세틸-글루코사미노피라노실 단위이며, 링커 X2는 글루코스, N-아세틸-글루코사민, 갈락토스, 푸코스 및 시알산으로부터 선택되는 단당류 구성 단위들을 갖는 선형 또는 분지형 구조를 나타내는 단-, 이-, 삼-, 사-, 오- 또는 올리고당을 의미한다. -OR1 기는 상기 D-글루코피라노실 고리의 아노머성 탄소(C1) 원자에, 바람직하게는 β 배향으로 연결된다. 바람직하게는, 말단 갈락토실 기가 1-3 또는 1-4 인터글라이코사이드 결합을 통해 B 기에 부착되어, 각각 락토-N-바이오실 또는 N-아세틸-락토사미닐 말단 이당류 모이어티들을 형성한다. 보다 더 바람직한 방법에서는, 푸코실 치환기가 단위 B의 3-OH 기 또는 4-OH 기, 및/또는 단위 A의 3-OH 기에 결합될 수 있고/있거나, 시알릴이 단위 B의 6-OH에 연결될 수 있다.
보다 바람직한 방법에서, X2 기는 시알릴 또는 N-아세틸-락토사민, 락토-N-바이오스, 푸코스 및 시알산으로부터 선택되는 당류 구성 단위들을 갖는 선형 또는 분지형 구조를 나타내는 올리고당 잔기로 선택적으로 치환된 갈락토스이므로, 일반 화학식 2D' 또는 그의 염들로 표현되는 인유 올리고당 유도체들을 형성하고,
일반 화학식 2D'에서, R1은 가수소분해에 의해 제거될 수 있는 기이고, R3는 H 또는 푸코실 단위이고, Y는 서로에 관계없이, 선택적으로 시알릴 및/또는 푸코실 잔기로 치환된 N-아세틸-락토사미닐 기이고, p는 서로에 관계없이 0, 1 또는 2의 정수이고, R8은 일반 화학식 5 및 6에 의하여 특징화되는 기들로부터 선택되고, R9는 H, α-시알릴 모이어티, 일반 화학식 5의 기 및 일반 화학식 6의 기로부터 선택되며,
일반 화학식 5 또는 6에서, R6은 H 또는 푸코실 잔기이고, R7은 H 또는 α-시알릴 모이어티이다.
보다 바람직한 구현예에 따라, 상기에서 정의된 바와 같은 일반 화학식 2D'의 화합물 및 그의 염들은 그것의 결합부 및 부착된 모이어티들로 특징지을 수 있으며, 이때에
- Y 기의 N-아세틸-락토사미닐 기는, 또 다른 N-아세틸-락토사미닐 기에 부착될 때, 1-3 인터글라이코사이드 결합으로 결합되고,
- 일반 화학식 5의 기는, Y에 부착될 때, 1-3 인터글라이코사이드 결합으로 결합되고,
- 일반 화학식 6의 기는, Y에 부착될 때, 1-3 인터글라이코사이드 결합으로 결합되고,
- 푸코실 잔기는, Y에 존재하는 N-아세틸-락토사미닐 기에 부착되는 경우, 1-3 인터글라이코사이드 결합으로 N-아세틸-락토사미닐 기의 N-아세틸-글루코사민에 연결되고,
- α-시알릴 잔기는, Y에 존재하는 N-아세틸-락토사미닐 기에 부착되는 경우, 2-6 인터글라이코사이드 결합으로 N-아세틸-락토사미닐 기의 갈락토스에 연결된다.
추가적인 측면에서, 상기에서 정의된 바와 같은 일반 화학식 2 C' 또는 2 D'의 화합물 및 그의 염들은 선택적으로 하나 이상의 시알릴 및/또는 푸코실 잔기로 치환되고, 비치환된 말단 갈락토실 잔기를 갖는 락토-N-네오테트라오스(lacto-N-neotetraose), 파라-락토-N-헥사오스(para-lacto-N-hexaose), 파라-락토-N-네오헥사오스(para-lacto-N-neohexaose), 락토-N-네오헥사오스(lacto-N-neohexaose), 파라-락토-N-옥타오스(para-lacto-N-octaose), 락토-N-네오옥타오스(lacto-N-neooctaose), 락토-N-테트라오스(lacto-N-tetraose), 락토-N-헥사오스(lacto-N-hexaose), 락토-N-옥타오스(lacto-N-octaose), 아이소-락토-N-옥타오스(iso-lacto-N-octaose), 락토-N-데카오스(lacto-N-decaose) 및 락토-N-네오데카오스(lacto-N-neodecaose)의 R1-배당체들을 나타낸다. 바람직하게, 상기 시알릴 치환기는 N-아세틸 뉴라미닐 기이다.
특히 바람직하게, 상기에서 정의된 바와 같은 일반 화학식 2 A'의 화합물 및 그의 염들은 Galβ1-4(Fucα1-3)Glc (3-O-푸코실락토스(3-O-fucosyllactose)), Galβ1-3GlcNAcβ1-3Galβ1-4Glc (LNT), Galβ1-4GlcNAcβ1-3Galβ1-4Glc (LNnT), Galβ1-3(Fucα1-4)GlcNAcβ1-3Galβ1-4Glc (LNFP II), Galβ1-4(Fucα1-3)GlcNAcβ1-3Galβ1-4Glc (LNFP III), Galβ1-3GlcNAcβ1-3Galβ1-4(Fucα1-4)Glc (LNFP V), Galβ1-3(Fucα1-4)GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc (LNDFH II), Galβ1-3(Neu5Acα2-6)GlcNAcβ1-3Galβ1-4Glc (LSTb), Galβ1-3(Neu5Acα2-6)(Fucα1-4)GlcNAcβ1-3Galβ1-4Glc, Galβ1-3(Neu5Acα2-6)GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Galβ1-4GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Galβ1-4(Fucα1-3)GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc (LNDFH III)의 R1-배당체들의 군, 또는 그들의 염들로부터 선택된다. 상기 R1-배당체들은 α- 또는 β-아노머들일 수 있다. 바람직하게, 상기 R1-배당체들은 β-아노머들이다.
트랜스시알화
반응의 생성물들(
Products
of
transsialidation
reaction
)
상기에서 설명한 바와 같이, 본 출원에서 청구된 트랜스시알화 반응은 일반 화학식 2의 화합물 및 그의 염들로부터 일반 화학식 1의 화합물들 및 그의 염들을 생성하고,
일반 화학식 1에서 R 기들 중의 하나는 α-시알릴 모이어티, 다른 하나는 H이고, X는 탄수화물 링커를 나타내고, R1은 가수소분해에 의해 제거될 수 있는 보호기이고, n은 0 또는 1의 정수이다.
R1 기는 선택적으로 치환된 벤질 및 나프틸메틸 기들을 포함하고, 그 중에서 선택적으로 페닐, 알킬 또는 할로젠으로 치환된 벤질 또는 2-나프틸메틸 기들은 바람직한 R1-기들이고, 그 중에서 비치환된 벤질, 비치환된 2-나프틸메틸, 4-클로로벤질, 3-페닐벤질 또는 4-메틸벤질 기들이 특히 바람직하다.
n이 1일 때, 상기 구조적 성분 X는 탄수화물 링커로서, 선형 또는 분지형 구조를 나타내는 단-, 이-, 삼-, 사-, 오- 또는 올리고당을 의미한다. 상기 탄수화물 링커의 단당류 구성 단위들은 가장 흔히 발생되는 단위들인 글루코스, N-아세틸 글루코사민, 갈락토스, 푸코스 및 시알산을 포함하여, 모든 자연적으로 발생하는 5-, 6-. 또는 9-탄소 함유 당 유도체들일 수 있다.
바람직한 방법에서, n은 1이고, 링커 X는 일반 화학식 1A 또는 그의 염들의 시알릴화된 생성물들을 형성하는 화학식 -(X1)m-A-에 대응되고,
일반 화학식 1A에서, A는 선택적으로 푸코실로 치환된 D-글루코피라노실 단위이고, X1은 탄수화물 링커를 나타내며, m은 0 또는 1의 정수이다. -OR1 기는 D-글루코피라노실 고리의 아노머성 탄소(C1) 원자에, 바람직하게는 β 배향으로 연결된다.
m이 0일 때, 상기 D-글루코피라노실 단위는 인터글라이코사이드 결합을 통해 A 기에 직접 결합된다. 바람직하게, 상기 인터글라이코사이드 결합은 1-4 결합이고, 따라서 일반 화학식 1B의 화합물들 및 그의 염들을 형성하고,
화학식 1B에서 R3는 푸코실 또는 H이다. 보다 바람직하게, 상기 갈락토스와 상기 글루코스 부분 사이의 상기 인터글라이코사이드 결합은 β이고, 따라서 3'-O-시알릴-락토스(3'-O-sialyl-lactose) 유도체를 생성한다. 보다 더욱 바람직한 구현예에서는, 아글리콘 -OR1 또한 β 배향이다.
m이 1일 때, 상기 구조적 성분 X1은 탄수화물 링커로서, 선형 또는 분지형 구조를 나타내는 단-, 이-, 삼-, 사-, 오- 또는 올리고당을 의미한다. 상기 탄수화물 링커의 단당류 구성 단위들은 가장 흔히 발생되는 단위들인 글루코스, N-아세틸 글루코사민, 갈락토스, 푸코스 및 시알산을 포함하여, 모든 자연적으로 발생하는 5-, 6-. 또는 9-탄소 함유 당 유도체들일 수 있다. 바람직하게, 링커 X1은 화학식 -B-X2-로 표현되어, 일반 화학식 1C의 화합물들 및 그의 염들을 형성하고,
일반 화학식 1C에서, B 기는 선택적으로 푸코실 및/또는 시알릴로 치환된 N-아세틸-글루코사미노피라노실 단위이며, 링커 X2는 글루코스, N-아세틸-글루코사민, 갈락토스, 푸코스 및 시알산으로부터 선택되는 단당류 구성 단위들을 갖는 선형 또는 분지형 구조를 나타내는 단-, 이-, 삼-, 사-, 오- 또는 올리고당을 나타낸다. -OR1 기는 상기 D-글루코피라노실 고리의 아노머성 탄소(C1) 원자에, 바람직하게는 β 배향으로 연결된다. 바람직하게는, 상기 시알릴-갈락토실 기가 1-3 또는 1-4 인터글라이코사이드 결합을 통해 B 기에 부착되어, 각각 시알릴-락토-N-바이오실(sialyl-lacto-N-biosyl) 또는 시알릴-N-아세틸-락토사미닐(sialyl-N-acetyl-lactosaminyl) 말단 삼당류(trisaccharide) 모이어티들을 형성한다. 보다 더 바람직한 방법에서는, 푸코실 치환기가 단위 B의 3-OH 기 또는 4-OH 기, 및/또는 단위 A의 3-OH 기에 결합될 수 있고/있거나, 시알릴이 단위 B의 6-OH에 연결될 수 있다.
시알릴 올리고당 유도체들을 합성하는 보다 바람직한 방법에서, 일반 화학식 1C 및 그의 염들의 X2 기는 시알릴 또는 N-아세틸-락토사민, 락토-N-바이오스, 푸코스 및 시알산으로부터 선택되는 당류 구성 단위들을 갖는 선형 또는 분지형 구조를 나타내는 올리고당으로 선택적으로 치환된 갈락토스이므로, 일반 화학식 1D 및 그의 염들로 표현되는 인유 올리고당 유도체들을 형성하고,
일반 화학식 1D에서, R1은 가수소분해에 의해 제거될 수 있는 기이고, R3는 H 또는 푸코실 단위이고, Y는 서로에 관계없이, 선택적으로 시알릴 및/또는 푸코실 잔기로 치환된 N-아세틸-락토사미닐 기이고, p는 서로에 관계없이 0, 1 또는 2의 정수이고, R4는 일반 화학식 3 및 4에 의하여 특징화되는 기들로부터 선택되고, R5는 H, α-시알릴 모이어티, 일반 화학식 3의 기 및 일반 화학식 4의 기로부터 선택되며,
일반 화학식 3 또는 4에서, R6은 H 또는 푸코실 잔기이고, R7은 H 또는 α-시알릴 모이어티이고, R 기들 중의 하나는 α-시알릴 모이어티, 다른 하나는 H이다.
보다 바람직한 방법에 따라, 상기에서 정의된 바와 같은 일반 화학식 1D의 화합물 및 그의 염들은 그것의 결합부 및 부착된 모이어티들로 특징지을 수 있으며, 이때에
- Y 기(p = 2)의 N-아세틸-락토사미닐 기는, 또 다른 N-아세틸-락토사미닐 기에 부착될 때, 1-3 인터글라이코사이드 결합으로 결합되고,
- 일반 화학식 3의 기는, Y(p = 1, 2)에 부착될 때, 1-3 인터글라이코사이드 결합으로 결합되고,
- 일반 화학식 4의 기는, Y(p = 1, 2)에 부착될 때, 1-3 인터글라이코사이드 결합으로 결합되고,
- 푸코실 잔기는, Y에 존재하는 N-아세틸-락토사미닐 기에 부착되는 경우, 1-3 인터글라이코사이드 결합으로 N-아세틸-락토사미닐 기의 N-아세틸-글루코사민에 연결되고,
- α-시알릴 잔기는, Y에 존재하는 N-아세틸-락토사미닐 기에 부착되는 경우, 2-6 인터글라이코사이드 결합으로 N-아세틸-락토사미닐 기의 갈락토스에 연결된다.
추가적인 측면에서, 상기에서 정의된 바와 같은 일반 화학식 1B, 1C 또는 1D의 화합물 및 그의 염들은 선택적으로 하나 이상의 시알릴 및/ 또는 푸코실 잔기로 치환되고, 말단 갈락토실 잔기의 3-OH 또는 6-OH에 시알릴 치환기를 갖는 락토스, 락토-N-네오테트라오스, 파라-락토-N-헥사오스, 파라-락토-N-네오헥사오스, 락토-N-네오헥사오스, 파라-락토-N-옥타오스, 락토-N-네오옥타오스, 락토-N-테트라오스, 락토-N-헥사오스, 락토-N-옥타오스, 아이소-락토-N-옥타오스, 락토-N-데카오스 및 락토-N-네오데카오스의 R1-배당체들을 나타낸다. 바람직하게, 상기 시알릴 치환기는 N-아세틸 뉴라미닐 기이다.
특히 바람직하게, 상기에서 정의된 바와 같은 일반 화학식 2B, 2C 또는 2D의 화합물 및 그의 염들은 Neu5Acα2-3Galβ1-4Glc (3'-O-(N-아세틸-뉴라미노실)-락토스(3'-O-(N-acetyl-neuraminosyl)-lactose)), Neu5Acα2-6Galβ1-4Glc (6'-O-(N-아세틸-뉴라미노실)-락토스(6'-O-(N-acetyl-neuraminosyl)-lactose)), Neu5Acα2-3Galβ1-4(Fucα1-3)Glc (3-O-푸코실-3'-O-(N-아세틸-뉴라미노실)-락토스(3-O-fucosyl-3'-O-(N-acetyl-neuraminosyl)-lactose)), Neu5Acα2-3Galβ1-3GlcNAcβ1-3Galβ1-4Glc (LST a), Neu5Acα2-6Galβ1-3GlcNAcβ1-3Galβ1-4Glc, Neu5Acα2-3Galβ1-4GlcNAcβ1-3Galβ1-4Glc, Neu5Acα2-6Galβ1-4GlcNAcβ1-3Galβ1-4Glc (LST c), Neu5Acα2-3Galβ1-3(Fucα1-4)GlcNAcβ1-3Galβ1-4Glc (FLST a), Neu5Acα2-6Galβ1-3(Fucα1-4)GlcNAcβ1-3Galβ1-4Glc, Neu5Acα2-3Galβ1-4(Fucα1-3)GlcNAcβ1-3Galβ1-4Glc, Neu5Acα2-Galβ1-4(Fucα1-3)GlcNAcβ1-3Galβ1-4Glc, Neu5Acα2-3Galβ1-3GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Neu5Acα2-6Galβ1-3GlcNAcβ1-3Galβ1-4(Fucα1-4)Glc, Neu5Acα2-3Galβ1-3(Fucα1-4)GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Neu5Acα2-6Galβ1-3(Fucα1-4)GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Neu5Acα2-3Galβ1-3(Neu5Acα2-6)GlcNAcβ1-3Galβ1-4Glc (DSLNT), Neu5Acα2-6Galβ1-3(Neu5Acα2-6)GlcNAcβ1-3Galβ1-4Glc, Neu5Acα2-3Galβ1-3(Neu5Acα2-6)(Fucα1-4)GlcNAcβ1-3Galβ1-4Glc (FDSLNT I), Neu5Acα2-6Galβ1-3(Neu5Acα2-6)(Fucα1-4)GlcNAcβ1-3Galβ1-4Glc, Neu5Acα2-3Galβ1-3(Neu5Acα2-6)GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc (FDSLNT II), Neu5Acα2-6Galβ1-3(Neu5Acα2-6)GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Neu5Acα2-3Galβ1-4GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Neu5Acα2-6Galβ1-4GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc (FLST c), Neu5Acα2-3Galβ1-4(Fucα1-3)GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Neu5Acα2-6Galβ1-4(Fucα1-3)GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Neu5Acα2-3Galβ1-4(Fucα1-3)GlcNAcβ1-3Galβ1-4Glc, Neu5Acα2-6Galβ1-4(Fucα1-3)GlcNAcβ1-3Galβ1-4Glc의 R1-배당체들의 군, 또는 그들의 염들로부터 선택된다. 상기 R1-배당체들은 α- 또는 β-아노머들일 수 있다. 바람직하게, 상기 R1-배당체들은 β-아노머들이다.
선택적 α-2-3 시알화의 경우에, 본 출원에서 청구된 상기 트랜스시알화 반응은 일반 화학식 2의 화합물 및 그의 염들로부터 일반 화학식 1-3의 화합물들 및 그의 염들을 생성하고,
일반 화학식 1-3에서, SA는 α-시알릴 모이어티이고, R1은 가수소분해에 의해 제거될 수 있는 보호기이고, n은 0 또는 1의 정수이다. R1 기는 선택적으로 치환된 벤질 및 나프틸메틸 기들을 포함하고, 그 중에서 선택적으로 페닐, 알킬 또는 할로젠으로 치환된 벤질 또는 2-나프틸메틸 기들은 바람직한 R1-기들이고, 그 중에서 비치환된 벤질, 비치환된 2-나프틸메틸, 4-클로로벤질, 3-페닐벤질 또는 4-메틸벤질 기들이 특히 바람직하다.
n이 1일 때, 상기 구조적 성분 X는 탄수화물 링커로서, 선형 또는 분지형 구조를 나타내는 단-, 이-, 삼-, 사-, 오- 또는 올리고당을 의미한다. 상기 탄수화물 링커의 단당류 구성 단위들은 가장 흔히 발생되는 단위들인 글루코스, N-아세틸 글루코사민, 갈락토스, 푸코스 및 시알산을 포함하여, 모든 자연적으로 발생하는 5-, 6-. 또는 9-탄소 함유 당 유도체들일 수 있다.
바람직한 방법에서, n은 1이고, 링커 X는 일반 화학식 1-3A 또는 그의 염들의 시알릴화된 생성물들을 형성하는 화학식 -(X1)m-A-에 대응되고,
일반 화학식 1-3A에서, A는 선택적으로 푸코실로 치환된 D-글루코피라노실 단위이고, X1은 탄수화물 링커를 나타내며, m은 0 또는 1의 정수이다. -OR1 기는 D-글루코피라노실 고리의 아노머성 탄소(C1) 원자에, 바람직하게는 β 배향으로 연결된다.
m이 0일 때, 상기 D-글루코피라노실 단위는 인터글라이코사이드 결합을 통해 A 기에 직접 결합된다. 바람직하게, 상기 인터글라이코사이드 결합은 1-4 결합이고, 따라서 일반 화학식 1-3B의 화합물 및 그의 염들을 형성하고,
화학식 1B에서 R3는 푸코실 또는 H이다. 보다 바람직하게, 상기 갈락토스와 상기 글루코스 부분 사이의 상기 인터글라이코사이드 결합은 β이고, 따라서 3'-O-시알릴-락토스(3'-O-sialyl-lactose) 유도체를 생성한다. 보다 더욱 바람직한 구현예에서는, 아글리콘 -OR1 또한 β 배향이다.
m이 1일 때, 상기 구조적 성분 X1은 탄수화물 링커로서, 선형 또는 분지형 구조를 나타내는 단-, 이-, 삼-, 사-, 오- 또는 올리고당을 의미한다. 상기 탄수화물 링커의 단당류 구성 단위들은 가장 흔히 발생되는 단위들인 글루코스, N-아세틸 글루코사민, 갈락토스, 푸코스 및 시알산을 포함하여, 모든 자연적으로 발생하는 5-, 6-. 또는 9-탄소 함유 당 유도체들일 수 있다. 바람직하게, 링커 X1은 화학식 -B-X2-로 표현되어, 일반 화학식 1-3C의 화합물들 및 그의 염들을 형성하고,
일반 화학식 1-3C에서, B 기는 선택적으로 푸코실 및/또는 시알릴로 치환된 N-아세틸-글루코사미노피라노실 단위이며, 링커 X2는 글루코스, N-아세틸-글루코사민, 갈락토스, 푸코스 및 시알산으로부터 선택되는 단당류 구성 단위들을 갖는 선형 또는 분지형 구조를 나타내는 단-, 이-, 삼-, 사-, 오- 또는 올리고당을 의미한다. -OR1 기는 상기 D-글루코피라노실 고리의 아노머성 탄소(C1) 원자에, 바람직하게는 β 배향으로 연결된다. 바람직하게는, 상기 시알릴-갈락토실 기가 1-3 또는 1-4 인터글라이코사이드 결합을 통해 B 기에 부착되어, 각각 시알릴-락토-N-바이오실 또는 시알릴-N-아세틸-락토사미닐) 말단 삼당류 모이어티들을 형성한다. 보다 더 바람직한 방법에서는, 푸코실 치환기가 단위 B의 3-OH 기 또는 4-OH 기, 및/또는 단위 A의 3-OH 기에 결합될 수 있고/있거나, 시알릴이 단위 B의 6-OH에 연결될 수 있다.
시알릴 올리고당 유도체들을 합성하는 보다 바람직한 방법에서, 일반 화학식 1-3C의 X2 기는 시알릴 또는 N-아세틸-락토사민, 락토-N-바이오스, 푸코스 및 시알산으로부터 선택되는 당류 구성 단위들을 갖는 선형 또는 분지형 구조를 나타내는 올리고당으로 선택적으로 치환된 갈락토스이므로, 일반 화학식 1-3D 및 그의 염들로 표현되는 인유 올리고당 유도체들을 형성하고,
일반 화학식 1-3D에서, R1은 가수소분해에 의해 제거될 수 있는 기이고, R3는 H 또는 푸코실 단위이고, Y는 서로에 관계없이, 선택적으로 시알릴 및/또는 푸코실 잔기로 치환된 N-아세틸-락토사미닐 기이고, p는 서로에 관계없이 0, 1 또는 2의 정수이고, R4'은 일반 화학식 3-3 및 4-3에 의하여 특징화되는 기들로부터 선택되고, R5'은 H, α-시알릴 모이어티, 일반 화학식 3-3의 기 및 일반 화학식 4-3의 기로부터 선택되며,
일반 화학식 3-3 또는 4-3에서, R6은 H 또는 푸코실 잔기이고, R7은 H 또는 α-시알릴 모이어티이고, SA는 α-시알릴 모이어티이다.
보다 바람직한 구현예에 따라, 상기에서 정의된 바와 같은 일반 화학식 1-3D의 화합물 및 그의 염들은 그것의 결합부 및 부착된 모이어티들로 특징지을 수 있으며, 이때에
- Y 기(p = 2)의 N-아세틸-락토사미닐 기는, 또 다른 N-아세틸-락토사미닐 기에 부착될 때, 1-3 인터글라이코사이드 결합으로 결합되고,
- 일반 화학식 3-3의 기는, Y(p = 1, 2)에 부착될 때, 1-3 인터글라이코사이드 결합으로 결합되고,
- 일반 화학식 4-3의 기는, Y(p = 1, 2)에 부착될 때, 1-3 인터글라이코사이드 결합으로 결합되고,
- 푸코실 잔기는, Y에 존재하는 N-아세틸-락토사미닐 기에 부착되는 경우, 1-3 인터글라이코사이드 결합으로 N-아세틸-락토사미닐 기의 N-아세틸-글루코사민에 연결되고,
- α-시알릴 잔기는, Y에 존재하는 N-아세틸-락토사미닐 기에 부착되는 경우, 2-6 인터글라이코사이드 결합으로 N-아세틸-락토사미닐 기의 갈락토스에 연결된다.
추가적인 측면에서, 상기에서 정의된 바와 같은 일반 화학식 1-3B, 1-3C 또는 1-3D의 화합물 및 그의 염들은 선택적으로 하나 이상의 시알릴 및/ 또는 푸코실 잔기로 치환되고, 말단 갈락토실 잔기의 3-OH에 시알릴 치환기를 갖는 락토스, 락토-N-네오테트라오스, 파라-락토-N-헥사오스, 파라-락토-N-네오헥사오스, 락토-N-네오헥사오스, 파라-락토-N-옥타오스, 락토-N-네오옥타오스, 락토-N-테트라오스, 락토-N-헥사오스, 락토-N-옥타오스, 아이소-락토-N-옥타오스, 락토-N-데카오스 및 락토-N-네오데카오스의 R1-배당체들을 나타낸다. 바람직하게, 상기 시알릴 치환기(들)는 N-아세틸 뉴라미닐 기(들)이다.
특히 바람직하게, 상기에서 정의된 바와 같은 일반 화학식 1-3B, 1-3C 또는 1-3D의 화합물 또는 그의 염들은 Neu5Acα2-3Galβ1-4Glc (3'-O-(N-아세틸-뉴라미노실)-락토스(3'-O-(N-acetyl-neuraminosyl)-lactose)), Neu5Acα2-3Galβ1-4(Fucα1-3)Glc (3-O-푸코실-3'-O-(N-아세틸-뉴라미노실)-락토스(3-O-fucosyl-3'-O-(N-acetyl-neuraminosyl)-lactose)), Neu5Acα2-3Galβ1-3GlcNAcβ1-3Galβ1-4Glc (LST a), Neu5Acα2-3Galβ1-4GlcNAcβ1-3Galβ1-4Glc, Neu5Acα2-3Galβ1-3(Fucα1-4)GlcNAcβ1-3Galβ1-4Glc (FLST a), Neu5Acα2-3Galβ1-4(Fucα1-3)GlcNAcβ1-3Galβ1-4Glc, Neu5Acα2-3Galβ1-3GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Neu5Acα2-3Galβ1-3(Fucα1-4)GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Neu5Acα2-3Galβ1-3(Neu5Acα2-6)GlcNAcβ1-3Galβ1-4Glc (DSLNT), Neu5Acα2-3Galβ1-3(Neu5Acα2-6)(Fucα1-4)GlcNAcβ1-3Galβ1-4Glc (FDSLNT I), Neu5Acα2-3Galβ1-3(Neu5Acα2-6)GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc (FDSLNT II), Neu5Acα2-3Galβ1-4GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Neu5Acα2-3Galβ1-4(Fucα1-3)GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Neu5Acα2-3Galβ1-4(Fucα1-3)GlcNAcβ1-3Galβ1-4Glc의 R1-배당체들의 군, 또는 그들의 염들로부터 선택된다. 상기 R1-배당체들은 α- 또는 β-아노머들일 수 있다. 바람직하게, 상기 R1-배당체들은 β-아노머들이다.
선택적 α-2-6 시알화의 경우에, 본 출원에서 청구된 상기 트랜스시알화 반응은 일반 화학식 2의 화합물 및 그의 염들로부터 일반 화학식 1-6의 화합물들 및 그의 염들을 생성하고,
일반 화학식 1-6에서, SA는 α-시알릴 모이어티이고, X는 탄수화물 링커를 나타내고, R1은 가수소분해에 의해 제거될 수 있는 보호기이고, n은 0 또는 1의 정수이다. R1 기는 선택적으로 치환된 벤질 및 나프틸메틸 기들을 포함하고, 그 중에서 선택적으로 페닐, 알킬 또는 할로젠으로 치환된 벤질 또는 2-나프틸메틸 기들은 바람직한 R1-기들이고, 그 중에서 비치환된 벤질, 비치환된 2-나프틸메틸, 4-클로로벤질, 3-페닐벤질 또는 4-메틸벤질 기들이 특히 바람직하다.
n이 1일 때, 상기 구조적 성분 X는 탄수화물 링커로서, 선형 또는 분지형 구조를 나타내는 단-, 이-, 삼-, 사-, 오- 또는 올리고당을 의미한다. 상기 탄수화물 링커의 단당류 구성 단위들은 가장 흔히 발생되는 단위들인 글루코스, N-아세틸 글루코사민, 갈락토스, 푸코스 및 시알산을 포함하여, 모든 자연적으로 발생하는 5-, 6-. 또는 9-탄소 함유 당 유도체들일 수 있다.
바람직한 방법에서, n은 1이고, 링커 X는 일반 화학식 1-6A 또는 그의 염들의 시알릴화된 생성물들을 형성하는 화학식 -(X1)m-A-에 대응되고,
일반 화학식 1-6A에서, A는 선택적으로 푸코실로 치환된 D-글루코피라노실 단위이고, X1은 탄수화물 링커를 나타내며, m은 0 또는 1의 정수이다. -OR1 기는 D-글루코피라노실 고리의 아노머성 탄소(C1) 원자에, 바람직하게는 β 배향으로 연결된다.
m이 0일 때, 상기 D-글루코피라노실 단위는 인터글라이코사이드 결합을 통해 A 기에 직접 결합된다. 바람직하게, 상기 인터글라이코사이드 결합은 1-4 결합이고, 따라서 일반 화학식 1-6B의 화합물들 및 그의 염들을 형성하고,
화학식 1-6B에서 R3는 푸코실 또는 H이다. 보다 바람직하게, 상기 갈락토스와 상기 글루코스 부분 사이의 상기 인터글라이코사이드 결합은 β이고, 따라서 3'-O-시알릴-락토스(3'-O-sialyl-lactose) 유도체를 생성한다. 보다 더욱 바람직한 구현예에서는, 아글리콘 -OR1 또한 β 배향이다.
m이 1일 때, 상기 구조적 성분 X1은 탄수화물 링커로서, 선형 또는 분지형 구조를 나타내는 단-, 이-, 삼-, 사-, 오- 또는 올리고당을 의미한다. 상기 탄수화물 링커의 단당류 구성 단위들은 가장 흔히 발생되는 단위들인 글루코스, N-아세틸 글루코사민, 갈락토스, 푸코스 및 시알산을 포함하여, 모든 자연적으로 발생하는 5-, 6-. 또는 9-탄소 함유 당 유도체들일 수 있다. 바람직하게, 링커 X1은 화학식 -B-X2-로 표현되어, 일반 화학식 1-6C의 화합물들 및 그의 염들을 형성하고,
일반 화학식 1-6C에서, B 기는 선택적으로 푸코실 및/또는 시알릴로 치환된 N-아세틸-글루코사미노피라노실 단위이며, 링커 X2는 글루코스, N-아세틸-글루코사민, 갈락토스, 푸코스 및 시알산으로부터 선택되는 단당류 구성 단위들을 갖는 선형 또는 분지형 구조를 나타내는 단-, 이-, 삼-, 사-, 오- 또는 올리고당을 의미한다. -OR1 기는 상기 D-글루코피라노실 고리의 아노머성 탄소(C1) 원자에, 바람직하게는 β 배향으로 연결된다. 바람직하게는, 상기 시알릴-갈락토실 기가 1-3 또는 1-4 인터글라이코사이드 결합을 통해 B 기에 부착되어, 각각 시알릴-락토-N-바이오실 또는 시알릴-N-아세틸-락토사미닐) 말단 삼당류 모이어티들을 형성한다. 보다 더 바람직한 방법에서는, 푸코실 치환기가 단위 B의 3-OH 기 또는 4-OH 기, 및/또는 단위 A의 3-OH 기에 결합될 수 있고/있거나, 시알릴이 단위 B의 6-OH에 연결될 수 있다.
시알릴 올리고당 유도체들을 합성하는 보다 바람직한 방법에서, 일반 화학식 1-6C의 X2 기는 시알릴 또는 N-아세틸-락토사민, 락토-N-바이오스, 푸코스 및 시알산으로부터 선택되는 당류 구성 단위들을 갖는 선형 또는 분지형 구조를 나타내는 올리고당으로 선택적으로 치환된 갈락토스이므로, 일반 화학식 1-3D 및 그의 염들로 표현되는 인유 올리고당 유도체들을 형성하고,
일반 화학식 1-6D에서, R1은 가수소분해에 의해 제거될 수 있는 기이고, R3는 H 또는 푸코실 단위이고, Y는 서로에 관계없이, 선택적으로 시알릴 및/또는 푸코실 잔기로 치환된 N-아세틸-락토사미닐 기이고, p는 서로에 관계없이 0, 1 또는 2의 정수이고, R4''은 일반 화학식 3-6 및 4-6에 의하여 특징화되는 기들로부터 선택되고, R5''은 H, α-시알릴 모이어티, 일반 화학식 3-6의 기 및 일반 화학식 4-6의 기로부터 선택되며,
일반 화학식 3-6 또는 4-6에서, R6은 H 또는 푸코실 잔기이고, R7은 H 또는 α-시알릴 모이어티이고, SA는 α-시알릴 모이어티이다.
보다 바람직한 구현예에 따라, 상기에서 정의된 바와 같은 일반 화학식 1-6D의 화합물 및 그의 염들은 그것의 결합부 및 부착된 모이어티들로 특징지을 수 있으며, 이때에
- Y 기(p = 2)의 N-아세틸-락토사미닐 기는, 또 다른 N-아세틸-락토사미닐 기에 부착될 때, 1-3 인터글라이코사이드 결합으로 결합되고,
- 일반 화학식 3-6의 기는, Y(p = 1, 2)에 부착될 때, 1-3 인터글라이코사이드 결합으로 결합되고,
- 일반 화학식 4-6의 기는, Y(p = 1, 2)에 부착될 때, 1-3 인터글라이코사이드 결합으로 결합되고,
- 푸코실 잔기는, Y에 존재하는 N-아세틸-락토사미닐 기에 부착되는 경우, 1-3 인터글라이코사이드 결합으로 N-아세틸-락토사미닐 기의 N-아세틸-글루코사민에 연결되고,
- α-시알릴 잔기는, Y에 존재하는 N-아세틸-락토사미닐 기에 부착되는 경우, 2-6 인터글라이코사이드 결합으로 N-아세틸-락토사미닐 기의 갈락토스에 연결된다.
추가적인 측면에서, 상기에서 정의된 바와 같은 일반 화학식 1-6B, 1-6C 또는 1-6D의 화합물 및 그의 염들은 선택적으로 하나 이상의 시알릴 및/ 또는 푸코실 잔기로 치환되고, 말단 갈락토실 잔기의 6-OH에 시알릴 치환기를 갖는 락토스, 락토-N-네오테트라오스, 파라-락토-N-헥사오스, 파라-락토-N-네오헥사오스, 락토-N-네오헥사오스, 파라-락토-N-옥타오스, 락토-N-네오옥타오스, 락토-N-테트라오스, 락토-N-헥사오스, 락토-N-옥타오스, 아이소-락토-N-옥타오스, 락토-N-데카오스 및 락토-N-네오데카오스의 R1-배당체들을 나타낸다. 바람직하게, 상기 시알릴 치환기(들)는 N-아세틸 뉴라미닐 기(들)이다.
특히 바람직하게, 상기에서 정의된 바와 같은 일반 화학식 1-6B, 1-6C 또는 1-6D의 화합물 또는 그의 염들은 Neu5Acα2-6Galβ1-4Glc (6'-O-(N-아세틸-뉴라미노실)-락토스(6'-O-(N-acetyl-neuraminosyl)-lactose)), Neu5Acα2-6Galβ1-4(Fucα1-3)Glc, Neu5Acα2-6Galβ1-3GlcNAcβ1-3Galβ1-4Glc, Neu5Acα2-6Galβ1-4GlcNAcβ1-3Galβ1-4Glc (LST c), Neu5Acα2-6Galβ1-3(Fucα1-4)GlcNAcβ1-3Galβ1-4Glc, Neu5Acα2-6Galβ1-4(Fucα1-3)GlcNAcβ1-3Galβ1-4Glc, Neu5Acα2-6Galβ1-3GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Neu5Acα2-6Galβ1-3(Fucα1-4)GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Neu5Acα2-6Galβ1-3(Neu5Acα2-6)GlcNAcβ1-3Galβ1-4Glc, Neu5Acα2-6Galβ1-3(Neu5Acα2-6)(Fucα1-4)GlcNAcβ1-3Galβ1-4Glc, Neu5Acα2-6Galβ1-3(Neu5Acα2-6)GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Neu5Acα2-6Galβ1-4GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc (FLST c), Neu5Acα2-6Galβ1-4(Fucα1-3)GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Neu5Acα2-6Galβ1-4(Fucα1-3)GlcNAcβ1-3Galβ1-4Glc의 R1-배당체들의 군, 또는 그들의 염들로부터 선택된다. 상기 R1-배당체들은 α- 또는 β-아노머들일 수 있다. 바람직하게, 상기 R1-배당체들은 β-아노머들이다.
상기와 같이 정의되고 본 출원에 개시된 트랜스시알화 반응에서 수득될 수 있는 일반 화학식 1에 의해 표현되는 일부 화합물들 및 그의 염들은 신규하다. 따라서, 본 발명은 일반 화학식 1A'의 신규한 화합물들 및 그의 염들을 제공하는 것에 관한 것이고,
일반 화학식 1A'에서 R 기들 중의 하나는 α-시알릴 모이어티, 다른 하나는 H이고, R1은 가수소분해에 의해 제거될 수 있는 보호기이고, A는 선택적으로 푸코실로 치환된 D-글루코피라노실 단위이고, m은 0 또는 1의 정수이고, X1은 탄수화물 링커를 나타내며, 단, 3'-O-(N-아세틸-뉴라미노실)-락토스((3'-O-(N-acetyl-neuraminosyl)-lactose)) 나트륨 염의 1-O-β-벤질(1-O-β-benzyl) 및 1-O-β-(4,5-다이메톡시-2-나이트로)-벤질(1-O-β-(4,5-dimethoxy-2-nitro)-benzyl) 배당체들, 및 6'-O-(N-아세틸-뉴라미노실)-락토스(6'-O-(N-acetyl-neuraminosyl)-lactose) 나트륨 염의 1-O-β-벤질(1-O-β-benzyl) 배당체는 제외된다. -OR1 기는 상기 D-글루코피라노실 고리의 아노머성 탄소(C1) 원자에, 바람직하게는 β 배향으로 연결된다.
m이 0일 때, 상기 D-글루코피라노실 단위는 인터글라이코사이드 결합을 통해 A 기에 직접 결합된다. 바람직하게, 상기 인터글라이코사이드 결합은 1-4 결합이고, 따라서 일반 화학식 1B'의 화합물 및 그의 염들을 형성하고,
화학식 1B'에서 R3는 푸코실 또는 H이다. 보다 바람직하게, 상기 갈락토스와 상기 글루코스 부분 사이의 상기 인터글라이코사이드 결합은 β이고, 따라서 3'-O-시알릴-락토스(3'-O-sialyl-lactose) 유도체를 생성한다. 보다 더욱 바람직한 구현예에서는, 아글리콘 -OR1 또한 β 배향이다.
m이 1일 때, 상기 구조적 성분 X1은 탄수화물 링커로서, 선형 또는 분지형 구조를 나타내는 단-, 이-, 삼-, 사-, 오- 또는 올리고당을 의미한다. 상기 탄수화물 링커의 단당류 구성 단위들은 가장 흔히 발생되는 단위들인 글루코스, N-아세틸 글루코사민, 갈락토스, 푸코스 및 시알산을 포함하여, 모든 자연적으로 발생하는 5-, 6-. 또는 9-탄소 함유 당 유도체들일 수 있다. 바람직하게, 링커 X1은 화학식 -B-X2-로 표현되어, 일반 화학식 1C'의 화합물들 및 그의 염들을 형성하고,
일반 화학식 1C'에서, B 기는 선택적으로 푸코실 및/또는 시알릴로 치환된 N-아세틸-글루코사미노피라노실 단위이며, 링커 X2는 글루코스, N-아세틸-글루코사민, 갈락토스, 푸코스 및 시알산으로부터 선택되는 단당류 구성 단위들을 갖는 선형 또는 분지형 구조를 나타내는 단-, 이-, 삼-, 사-, 오- 또는 올리고당을 의미한다. -OR1 기는 상기 D-글루코피라노실 고리의 아노머성 탄소(C1) 원자에, 바람직하게는 β 배향으로 연결된다. 바람직하게는, 상기 시알릴-갈락토실 기가 1-3 또는 1-4 인터글라이코사이드 결합을 통해 B 기에 부착되어, 각각 시알릴-락토-N-바이오실 또는 시알릴-N-아세틸-락토사미닐) 말단 삼당류 모이어티들을 형성한다. 보다 더 바람직한 방법에서는, 푸코실 치환기가 단위 B의 3-OH 기 또는 4-OH 기, 및/또는 단위 A의 3-OH 기에 결합될 수 있고/있거나, 시알릴이 단위 B의 6-OH에 연결될 수 있다.
보다 바람직한 구현예에서, 일반 화학식 1C'의 화합물들 및 그의 염들의 X2 기는 시알릴 또는 N-아세틸-락토사민, 락토-N-바이오스, 푸코스 및 시알산으로부터 선택되는 당류 구성 단위들을 갖는 선형 또는 분지형 구조를 나타내는 올리고당으로 선택적으로 치환된 갈락토스이므로, 일반 화학식 1D' 및 그의 염들로 표현되는 인유 올리고당 유도체들을 형성하고,
일반 화학식 1D'에서, R1은 가수소분해에 의해 제거될 수 있는 기이고, R3는 H 또는 푸코실 단위이고, Y는 서로에 관계없이, 선택적으로 시알릴 및/또는 푸코실 잔기로 치환된 N-아세틸-락토사미닐 기이고, p는 서로에 관계없이 0, 1 또는 2의 정수이고, R4는 일반 화학식 3 및 4에 의하여 특징화되는 기들로부터 선택되고, R5는 H, α-시알릴 모이어티, 일반 화학식 3의 기 및 일반 화학식 4의 기로부터 선택되며,
일반 화학식 3 또는 4에서, R6은 H 또는 푸코실 잔기이고, R7은 H 또는 α-시알릴 모이어티이고, R 기들 중의 하나는 α-시알릴 모이어티, 다른 하나는 H이다.
보다 바람직한 구현예에 따라, 상기에서 정의된 바와 같은 일반 화학식 1D'의 화합물 및 그의 염들은 그것의 결합부 및 부착된 모이어티들로 특징지을 수 있으며, 이때에
- Y 기(p = 2)의 N-아세틸-락토사미닐 기는, 또 다른 N-아세틸-락토사미닐 기에 부착될 때, 1-3 인터글라이코사이드 결합으로 결합되고,
- 일반 화학식 3의 기는, Y(p = 1, 2)에 부착될 때, 1-3 인터글라이코사이드 결합으로 결합되고,
- 일반 화학식 4의 기는, Y(p = 1, 2)에 부착될 때, 1-3 인터글라이코사이드 결합으로 결합되고,
- 푸코실 잔기는, Y에 존재하는 N-아세틸-락토사미닐 기에 부착되는 경우, 1-3 인터글라이코사이드 결합으로 N-아세틸-락토사미닐 기의 N-아세틸-글루코사민에 연결되고,
- α-시알릴 잔기는, Y에 존재하는 N-아세틸-락토사미닐 기에 부착되는 경우, 2-6 인터글라이코사이드 결합으로 N-아세틸-락토사미닐 기의 갈락토스에 연결된다.
추가적인 측면에서, 상기에서 정의된 바와 같은 일반 화학식 1B', 1C' 또는 1D'의 화합물 및 그의 염들은 선택적으로 하나 이상의 시알릴 및/ 또는 푸코실 잔기로 치환되고, 말단 갈락토실 잔기의 3-OH 또는 6-OH에 시알릴 치환기를 갖는 락토스, 락토-N-네오테트라오스, 파라-락토-N-헥사오스, 파라-락토-N-네오헥사오스, 락토-N-네오헥사오스, 파라-락토-N-옥타오스, 락토-N-네오옥타오스, 락토-N-테트라오스, 락토-N-헥사오스, 락토-N-옥타오스, 아이소-락토-N-옥타오스, 락토-N-데카오스 및 락토-N-네오데카오스의 R1-배당체들, 및 그의 염들로 구성된 군으로부터 선택된다. 바람직하게, 상기 시알릴 치환기(들)는 N-아세틸 뉴라미닐 기(들)이다.
특히 바람직하게, 상기에서 정의된 바와 같은 일반 화학식 1A'의 화합물 및 그의 염들은 Neu5Acα2-3Galβ1-4Glc (3'-O-(N-아세틸-뉴라미노실)-락토스(3'-O-(N-acetyl-neuraminosyl)-lactose)), Neu5Acα2-6Galβ1-4Glc (6'-O-(N-아세틸-뉴라미노실)-락토스(6'-O-(N-acetyl-neuraminosyl)-lactose)), Neu5Acα2-3Galβ1-4(Fucα1-3)Glc (3-O-푸코실-3'-O-(N-아세틸-뉴라미노실)-락토스(3-O-fucosyl-3'-O-(N-acetyl-neuraminosyl)-lactose)), Neu5Acα2-3Galβ1-3GlcNAcβ1-3Galβ1-4Glc (LST a), Neu5Acα2-6Galβ1-3GlcNAcβ1-3Galβ1-4Glc, Neu5Acα2-3Galβ1-4GlcNAcβ1-3Galβ1-4Glc, Neu5Acα2-6Galβ1-4GlcNAcβ1-3Galβ1-4Glc (LST c), Neu5Acα2-3Galβ1-3(Fucα1-4)GlcNAcβ1-3Galβ1-4Glc (FLST a), Neu5Acα2-6Galβ1-3(Fucα1-4)GlcNAcβ1-3Galβ1-4Glc, Neu5Acα2-3Galβ1-4(Fucα1-3)GlcNAcβ1-3Galβ1-4Glc, Neu5Acα2-Galβ1-4(Fucα1-3)GlcNAcβ1-3Galβ1-4Glc, Neu5Acα2-3Galβ1-3GlcNAcβ1-3Galβ1-4(Fucα1-4)Glc, Neu5Acα2-6Galβ1-3GlcNAcβ1-3Galβ1-4(Fucα1-4)Glc, Neu5Acα2-3Galβ1-3(Fucα1-4)GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Neu5Acα2-6Galβ1-3(Fucα1-4)GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Neu5Acα2-3Galβ1-3(Neu5Acα2-6)GlcNAcβ1-3Galβ1-4Glc (DSLNT), Neu5Acα2-6Galβ1-3(Neu5Acα2-6)GlcNAcβ1-3Galβ1-4Glc, Neu5Acα2-3Galβ1-3(Neu5Acα2-6)(Fucα1-4)GlcNAcβ1-3Galβ1-4Glc (FDSLNT I), Neu5Acα2-6Galβ1-3(Neu5Acα2-6)(Fucα1-4)GlcNAcβ1-3Galβ1-4Glc, Neu5Acα2-3Galβ1-3(Neu5Acα2-6)GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc (FDSLNT II), Neu5Acα2-6Galβ1-3(Neu5Acα2-6)GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Neu5Acα2-3Galβ1-4GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Neu5Acα2-6Galβ1-4GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc (FLST c), Neu5Acα2-3Galβ1-4(Fucα1-3)GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Neu5Acα2-6Galβ1-4(Fucα1-3)GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Neu5Acα2-3Galβ1-4(Fucα1-3)GlcNAcβ1-3Galβ1-4Glc, Neu5Acα2-6Galβ1-4(Fucα1-3)GlcNAcβ1-3Galβ1-4Glc의 R1-배당체들의 군, 또는 그들의 염들로부터 선택된다. 상기 R1-배당체들은 α- 또는 β-아노머들일 수 있다. 바람직하게, 상기 R1-배당체들은 β-아노머들이다.
일반 화학식 1A'의 화합물들 및 그의 염들의 군 내의 일 구현예에서, 본 발명은 일반 화학식 1'-3A의 α-2-3 시알릴화된 화합물들 및 그의 염들에 관한 것이고,
일반 화학식 1'-3A에서, A는 선택적으로 푸코실로 치환된 D-글루코피라노실 단위이고, X1은 탄수화물 링커를 나타내며, m은 0 또는 1의 정수이다. -OR1 기는 D-글루코피라노실 고리의 아노머성 탄소(C1) 원자에, 바람직하게는 β 배향으로 연결된다.
m이 0일 때, 상기 D-글루코피라노실 단위는 인터글라이코사이드 결합을 통해 A 기에 직접 결합된다. 바람직하게, 상기 인터글라이코사이드 결합은 1-4 결합이고, 따라서 일반 화학식 1'-3B의 화합물 및 그의 염들을 형성하고,
화학식 1'-3B에서 R3는 푸코실 또는 H이다. 보다 바람직하게, 상기 갈락토스와 상기 글루코스 부분 사이의 상기 인터글라이코사이드 결합은 β이고, 따라서 3'-O-시알릴-락토스(3'-O-sialyl-lactose) 유도체를 생성한다. 보다 더욱 바람직한 구현예에서는, 아글리콘 -OR1 또한 β 배향이다.
m이 1일 때, 상기 구조적 성분 X1은 탄수화물 링커로서, 선형 또는 분지형 구조를 나타내는 단-, 이-, 삼-, 사-, 오- 또는 올리고당을 의미한다. 상기 탄수화물 링커의 단당류 구성 단위들은 가장 흔히 발생되는 단위들인 글루코스, N-아세틸 글루코사민, 갈락토스, 푸코스 및 시알산을 포함하여, 모든 자연적으로 발생하는 5-, 6-. 또는 9-탄소 함유 당 유도체들일 수 있다. 바람직하게, 링커 X1은 화학식 -B-X2-로 표현되어, 일반 화학식 1'-3C의 화합물들 및 그의 염들을 형성하고,
일반 화학식 1'-3C에서, B 기는 선택적으로 푸코실 및/또는 시알릴로 치환된 N-아세틸-글루코사미노피라노실 단위이며, 링커 X2는 글루코스, N-아세틸-글루코사민, 갈락토스, 푸코스 및 시알산으로부터 선택되는 단당류 구성 단위들을 갖는 선형 또는 분지형 구조를 나타내는 단-, 이-, 삼-, 사-, 오- 또는 올리고당을 의미한다. -OR1 기는 상기 D-글루코피라노실 고리의 아노머성 탄소(C1) 원자에, 바람직하게는 β 배향으로 연결된다. 바람직하게는, 상기 시알릴-갈락토실 기가 1-3 또는 1-4 인터글라이코사이드 결합을 통해 B 기에 부착되어, 각각 시알릴-락토-N-바이오실 또는 시알릴-N-아세틸-락토사미닐) 말단 삼당류 모이어티들을 형성한다. 보다 더 바람직한 방법에서는, 푸코실 치환기가 단위 B의 3-OH 기 또는 4-OH 기, 및/또는 단위 A의 3-OH 기에 결합될 수 있고/있거나, 시알릴이 단위 B의 6-OH에 연결될 수 있다.
보다 바람직한 구현예에서, 일반 화학식 1'-3C의 X2 기는 시알릴 또는 N-아세틸-락토사민, 락토-N-바이오스, 푸코스 및 시알산으로부터 선택되는 당류 구성 단위들을 갖는 선형 또는 분지형 구조를 나타내는 올리고당으로 선택적으로 치환된 갈락토스이므로, 일반 화학식 1'-3D 및 그의 염들로 표현되는 인유 올리고당 유도체들을 형성하고,
일반 화학식 1'-3D에서, R1은 가수소분해에 의해 제거될 수 있는 기이고, R3는 H 또는 푸코실 단위이고, Y는 서로에 관계없이, 선택적으로 시알릴 및/또는 푸코실 잔기로 치환된 N-아세틸-락토사미닐 기이고, p는 서로에 관계없이 0, 1 또는 2의 정수이고, R4'은 일반 화학식 3-3 및 4-3에 의하여 특징화되는 기들로부터 선택되고, R5'은 H, α-시알릴 모이어티, 일반 화학식 3-3의 기 및 일반 화학식 4-3의 기로부터 선택되며,
일반 화학식 3-3 또는 4-3에서, R6은 H 또는 푸코실 잔기이고, R7은 H 또는 α-시알릴 모이어티이고, SA는 α-시알릴 모이어티이다.
보다 바람직한 구현예에 따라, 상기에서 정의된 바와 같은 일반 화학식 1'-3D의 화합물 및 그의 염들은 그것의 결합부 및 부착된 모이어티들로 특징지을 수 있으며, 이때에
- Y 기(p = 2)의 N-아세틸-락토사미닐 기는, 또 다른 N-아세틸-락토사미닐 기에 부착될 때, 1-3 인터글라이코사이드 결합으로 결합되고,
- 일반 화학식 3-3의 기는, Y(p = 1, 2)에 부착될 때, 1-3 인터글라이코사이드 결합으로 결합되고,
- 일반 화학식 4-3의 기는, Y(p = 1, 2)에 부착될 때, 1-3 인터글라이코사이드 결합으로 결합되고,
- 푸코실 잔기는, Y에 존재하는 N-아세틸-락토사미닐 기에 부착되는 경우, 1-3 인터글라이코사이드 결합으로 N-아세틸-락토사미닐 기의 N-아세틸-글루코사민에 연결되고,
- α-시알릴 잔기는, Y에 존재하는 N-아세틸-락토사미닐 기에 부착되는 경우, 2-6 인터글라이코사이드 결합으로 N-아세틸-락토사미닐 기의 갈락토스에 연결된다.
추가적인 측면에서, 상기에서 정의된 바와 같은 일반 화학식 1'-3B, 1'-3C 또는 1'-3D의 화합물 및 그의 염들은 선택적으로 하나 이상의 시알릴 및/ 또는 푸코실 잔기로 치환되고, 말단 갈락토실 잔기의 3-OH에 시알릴 치환기를 갖는 락토스, 락토-N-네오테트라오스, 파라-락토-N-헥사오스, 파라-락토-N-네오헥사오스, 락토-N-네오헥사오스, 파라-락토-N-옥타오스, 락토-N-네오옥타오스, 락토-N-테트라오스, 락토-N-헥사오스, 락토-N-옥타오스, 아이소-락토-N-옥타오스, 락토-N-데카오스 및 락토-N-네오데카오스의 R1-배당체들, 및 그의 염들로 구성된 군으로부터 선택된다. 바람직하게, 상기 시알릴 치환기(들)는 N-아세틸 뉴라미닐 기(들)이다.
특히 바람직하게, 상기에서 정의된 바와 같은 일반 화학식 1'-3B, 1'-3C 또는 1'-3D의 화합물 또는 그의 염들은 Neu5Acα2-3Galβ1-4Glc (3'-O-(N-아세틸-뉴라미노실)-락토스(3'-O-(N-acetyl-neuraminosyl)-lactose)), Neu5Acα2-3Galβ1-4(Fucα1-3)Glc (3-O-푸코실-3'-O-(N-아세틸-뉴라미노실)-락토스(3-O-fucosyl-3'-O-(N-acetyl-neuraminosyl)-lactose)), Neu5Acα2-3Galβ1-3GlcNAcβ1-3Galβ1-4Glc (LST a), Neu5Acα2-3Galβ1-4GlcNAcβ1-3Galβ1-4Glc, Neu5Acα2-3Galβ1-3(Fucα1-4)GlcNAcβ1-3Galβ1-4Glc (FLST a), Neu5Acα2-3Galβ1-4(Fucα1-3)GlcNAcβ1-3Galβ1-4Glc, Neu5Acα2-3Galβ1-3GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Neu5Acα2-3Galβ1-3(Fucα1-4)GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Neu5Acα2-3Galβ1-3(Neu5Acα2-6)GlcNAcβ1-3Galβ1-4Glc (DSLNT), Neu5Acα2-3Galβ1-3(Neu5Acα2-6)(Fucα1-4)GlcNAcβ1-3Galβ1-4Glc (FDSLNT I), Neu5Acα2-3Galβ1-3(Neu5Acα2-6)GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc (FDSLNT II), Neu5Acα2-3Galβ1-4GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Neu5Acα2-3Galβ1-4(Fucα1-3)GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Neu5Acα2-3Galβ1-4(Fucα1-3)GlcNAcβ1-3Galβ1-4Glc의 R1-배당체들의 군, 또는 그들의 염들로부터 선택된다. 상기 R1-배당체들은 α- 또는 β-아노머들일 수 있다. 바람직하게, 상기 R1-배당체들은 β-아노머들이다.
일반 화학식 1A'의 화합물들 및 그의 염들의 군 내의 또 다른 구현예에서, 본 발명은 일반 화학식 1'-6A의 α-2-6 시알릴화된 화합물들 및 그의 염들에 관한 것이고,
일반 화학식 1'-6A에서, A는 선택적으로 푸코실로 치환된 D-글루코피라노실 단위이고, X1은 탄수화물 링커를 나타내며, m은 0 또는 1의 정수이다. -OR1 기는 D-글루코피라노실 고리의 아노머성 탄소(C1) 원자에, 바람직하게는 β 배향으로 연결된다.
m이 0일 때, 상기 D-글루코피라노실 단위는 인터글라이코사이드 결합을 통해 A 기에 직접 결합된다. 바람직하게, 상기 인터글라이코사이드 결합은 1-4 결합이고, 따라서 일반 화학식 1'-6B의 화합물 및 그의 염들을 형성하고,
일반 화학식 1'-6B에서 R3는 푸코실 또는 H이다. 보다 바람직하게, 상기 갈락토스와 상기 글루코스 부분 사이의 상기 인터글라이코사이드 결합은 β이고, 따라서 3'-O-시알릴-락토스(3'-O-sialyl-lactose) 유도체를 생성한다. 보다 더욱 바람직한 구현예에서는, 아글리콘 -OR1 또한 β 배향이다.
m이 1일 때, 상기 구조적 성분 X1은 탄수화물 링커로서, 선형 또는 분지형 구조를 나타내는 단-, 이-, 삼-, 사-, 오- 또는 올리고당을 의미한다. 상기 탄수화물 링커의 단당류 구성 단위들은 가장 흔히 발생되는 단위들인 글루코스, N-아세틸 글루코사민, 갈락토스, 푸코스 및 시알산을 포함하여, 모든 자연적으로 발생하는 5-, 6-. 또는 9-탄소 함유 당 유도체들일 수 있다. 바람직하게, 링커 X1은 화학식 -B-X2-로 표현되어, 일반 화학식 1'-6C의 화합물들 및 그의 염들을 형성하고,
일반 화학식 1'-6C에서, B 기는 선택적으로 푸코실 및/또는 시알릴로 치환된 N-아세틸-글루코사미노피라노실 단위이며, 링커 X2는 글루코스, N-아세틸-글루코사민, 갈락토스, 푸코스 및 시알산으로부터 선택되는 단당류 구성 단위들을 갖는 선형 또는 분지형 구조를 나타내는 단-, 이-, 삼-, 사-, 오- 또는 올리고당을 의미한다. -OR1 기는 상기 D-글루코피라노실 고리의 아노머성 탄소(C1) 원자에, 바람직하게는 β 배향으로 연결된다. 바람직하게는, 상기 시알릴-갈락토실 기가 1-3 또는 1-4 인터글라이코사이드 결합을 통해 B 기에 부착되어, 각각 시알릴-락토-N-바이오실 또는 시알릴-N-아세틸-락토사미닐) 말단 삼당류 모이어티들을 형성한다. 보다 더 바람직한 방법에서는, 푸코실 치환기가 단위 B의 3-OH 기 또는 4-OH 기, 및/또는 단위 A의 3-OH 기에 결합될 수 있고/있거나, 시알릴이 단위 B의 6-OH에 연결될 수 있다.
보다 바람직한 구현예에서, 일반 화학식 1'-6C의 화합물 및 그의 염의 X2 기는 시알릴 또는 N-아세틸-락토사민, 락토-N-바이오스, 푸코스 및 시알산으로부터 선택되는 당류 구성 단위들을 갖는 선형 또는 분지형 구조를 나타내는 올리고당으로 선택적으로 치환된 갈락토스이므로, 일반 화학식 1'-6D 및 그의 염들로 표현되는 인유 올리고당 유도체들을 형성하고,
일반 화학식 1'-6D에서, R1은 가수소분해에 의해 제거될 수 있는 기이고, R3는 H 또는 푸코실 단위이고, Y는 서로에 관계없이, 선택적으로 시알릴 및/또는 푸코실 잔기로 치환된 N-아세틸-락토사미닐 기이고, p는 서로에 관계없이 0, 1 또는 2의 정수이고, R4''은 일반 화학식 3-6 및 4-6에 의하여 특징화되는 기들로부터 선택되고, R5''은 H, α-시알릴 모이어티, 일반 화학식 3-6의 기 및 일반 화학식 4-6의 기로부터 선택되며,
일반 화학식 3-6 또는 4-6에서, R6은 H 또는 푸코실 잔기이고, R7은 H 또는 α-시알릴 모이어티이고, SA는 α-시알릴 모이어티이다.
보다 바람직한 구현예에 따라, 상기에서 정의된 바와 같은 일반 화학식 1'-6D의 화합물 및 그의 염들은 그것의 결합부 및 부착된 모이어티들로 특징지을 수 있으며, 이때에
- Y 기(p = 2)의 N-아세틸-락토사미닐 기는, 또 다른 N-아세틸-락토사미닐 기에 부착될 때, 1-3 인터글라이코사이드 결합으로 결합되고,
- 일반 화학식 3-6의 기는, Y(p = 1, 2)에 부착될 때, 1-3 인터글라이코사이드 결합으로 결합되고,
- 일반 화학식 4-6의 기는, Y(p = 1, 2)에 부착될 때, 1-3 인터글라이코사이드 결합으로 결합되고,
- 푸코실 잔기는, Y에 존재하는 N-아세틸-락토사미닐 기에 부착되는 경우, 1-3 인터글라이코사이드 결합으로 N-아세틸-락토사미닐 기의 N-아세틸-글루코사민에 연결되고,
- α-시알릴 잔기는, Y에 존재하는 N-아세틸-락토사미닐 기에 부착되는 경우, 2-6 인터글라이코사이드 결합으로 N-아세틸-락토사미닐 기의 갈락토스에 연결된다.
추가적인 측면에서, 상기에서 정의된 바와 같은 일반 화학식 1'-6B, 1'-6C 또는 1'-6D의 화합물 및 그의 염들은 선택적으로 하나 이상의 시알릴 및/ 또는 푸코실 잔기로 치환되고, 말단 갈락토실 잔기의 6-OH에 시알릴 치환기를 갖는 락토스, 락토-N-네오테트라오스, 파라-락토-N-헥사오스, 파라-락토-N-네오헥사오스, 락토-N-네오헥사오스, 파라-락토-N-옥타오스, 락토-N-네오옥타오스, 락토-N-테트라오스, 락토-N-헥사오스, 락토-N-옥타오스, 아이소-락토-N-옥타오스, 락토-N-데카오스 및 락토-N-네오데카오스의 R1-배당체들, 및 그의 염들을 나타낸다. 바람직하게, 상기 시알릴 치환기(들)는 N-아세틸 뉴라미닐 기(들)이다.
특히 바람직하게, 상기에서 정의된 바와 같은 일반 화학식 1'-6B, 1'-6C 또는 1'-6D의 화합물 또는 그의 염들은 Neu5Acα2-6Galβ1-4Glc (6'-O-(N-아세틸-뉴라미노실)-락토스(6'-O-(N-acetyl-neuraminosyl)-lactose)), Neu5Acα2-6Galβ1-4(Fucα1-3)Glc, Neu5Acα2-6Galβ1-3GlcNAcβ1-3Galβ1-4Glc, Neu5Acα2-6Galβ1-4GlcNAcβ1-3Galβ1-4Glc (LST c), Neu5Acα2-6Galβ1-3(Fucα1-4)GlcNAcβ1-3Galβ1-4Glc, Neu5Acα2-6Galβ1-4(Fucα1-3)GlcNAcβ1-3Galβ1-4Glc, Neu5Acα2-6Galβ1-3GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Neu5Acα2-6Galβ1-3(Fucα1-4)GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Neu5Acα2-6Galβ1-3(Neu5Acα2-6)GlcNAcβ1-3Galβ1-4Glc, Neu5Acα2-6Galβ1-3(Neu5Acα2-6)(Fucα1-4)GlcNAcβ1-3Galβ1-4Glc, Neu5Acα2-6Galβ1-3(Neu5Acα2-6)GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Neu5Acα2-6Galβ1-4GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc (FLST c), Neu5Acα2-6Galβ1-4(Fucα1-3)GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Neu5Acα2-6Galβ1-4(Fucα1-3)GlcNAcβ1-3Galβ1-4Glc의 R1-배당체들의 군, 또는 그들의 염들로부터 선택된다. 상기 R1-배당체들은 α- 또는 β-아노머들일 수 있다. 바람직하게, 상기 R1-배당체들은 β-아노머들이다.
본 발명의 다른 특징들은 본 발명의 구체화를 위해 제공되는 하기의 실험예들의 개시에 의하여 명확해질 것이고, 이에 한정되지 아니한다.
실험예(
EXPERIMENTAL
)
1.
시알릴
수용체들의 합성(
Synthesis
of
sialyl
acceptors
)
A) 일반 과정: 실온에서 락토스(lactose)(5 g, 14.6 mmol) 및 TsOHㅇH2O(0.2 g, 1.05 mmol)가 DMF(20 ml)와 벤즈알데하이드 다이메틸 아세탈(benzaldehyde dimethyl acetal)(5.5 ml, 35.4 mmol, 2.4 eq.)의 혼합물에 일부 첨가되었다. 상기 반응 혼합물은 제습(exclusion of humidity) 하의 70 ℃에서 1시간 동안 강하게 교반되었다. 냉각된 트라이에틸 아민(triethyl amine)(0.15 ml)을 첨가한 다음, 휘발성 구성들(MeOH, 트라이에틸 아민(triethyl amine), 잔존하는 벤즈알데하이드 다이메틸 아세탈(benzaldehyde dimethyl acetal))을 진공에서 제거하였다. 상기 반응 혼합물에, 상기 브로민화 벤질 유도체(benzyl bromide derivative)(1.5 eq.)-만약 시약이 고체라면, 5-10 ml의 DMF에 선용해된-를 첨가하고, 상기 혼합물을 20분 동안 0℃로 냉각시켰다. 여전히 냉각시키면서, NaH(미네랄오일 상에서 55 % 분산된 것의 0.8 g, 1.3 eq.)가 일부로 첨가되었고, 상기 혼합물이 냉각 하에서 수소 형성이 멈출 때까지 실온에서 2-3 시간동안 교반되었다. 메탄올(methanol (2 ml))이 조심스럽게 첨가되었고, 상기 반응 혼합물이 추가 5분 동안 교반되었다. 상기 반응 혼합물은 100 ml의 DCM과 100 ml의 물 사이에서 나뉘어져, 추출되었다. 상기 물층은 100 ml의 DCM으로 2회 역추출(back-extraction)되었다. 상기 혼합된 유기 상들은 증발되었고, 상기 잔사들은 100 ml의 아세토나이트릴(acetonitrile)용해되어 100 ml의 헥산(hexane)으로 추출되었다. 상기 아세토나이트릴은 증류되어 제거되고, 상기 잔사는 50 ℃에서 아이소프로판올(isopropanol)(10 ml)과 아이소프로필 에테르(isopropyl ether)(50 ml)에 첨가되었다. 상기 투명 용액은 하룻밤(overnight) 동안 20 ℃로 냉각되었다. 상기 수득된 결정들은 TBME로 2회 여과 및 세척되고 건조되었다. 재결정화는 TBME(~50 ml)과 에탄올(~20 ml)의 혼합물로부터 수행될 수 있다.
4-클로로벤질 4',6'-O-벤질리덴-β-락토사이드(4-Chlorobenzyl 4',6'-O-benzylidene-β-lactoside)
수율: 1.71 g
4-메틸벤질 4',6'-O-벤질리덴-β-락토사이드(4-Methylbenzyl 4',6'-O-benzylidene-β-lactoside)
수율: 3.20 g
3-페닐벤질 4',6'-O-벤질리덴-β-락토사이드(3-Phenylbenzyl 4',6'-O-benzylidene-β-lactoside)
수율: 2.70 g
2-나프틸메틸 4',6'-O-벤질리덴-β-락토사이드(2-Naphthylmethyl 4',6'-O-benzylidene-β-lactoside)
수율: 1.77 g
B) 메탄올(10 ml)과 물(0.5 ml)에 혼합된 상기 벤질리덴 아세탈(benzylidene acetal)들(500 mg) 중 하나의 혼합물에, TFA가 실온에서 첨가되었고, 상기 반응 혼합물은 제습 하에서 2-4시간 동안 교반된 다음, 증발되었다. 상기 잔류 물질은 에탄올(ethanol)로 3-4회 동시증발되어, 건조된 후 메탄올(methanol)(~10-35 ml)과 물(~0-2 ml)의 혼합물로부터 재결정화될 수 있는 미가공의 고체를 생성하였다.
4-클로로벤질 β-락토사이드(4-Chlorobenzyl β-lactoside)
수율: 333 mg
13C-NMR (75.1 MHz, D2O): δ = 135.25, 133.67, 130.30, 128.70, 103.00, 101.13, 78.39, 75.44, 74.89, 74.49, 72.88, 72.58, 71.03, 70.83, 68.62, 61.11, 60.13.
4-메틸벤질 β-락토사이드(4-Methylbenzyl β-lactoside)
수율: 439 mg
13C-NMR (75.1 MHz, D2O): δ = 138.91, 133.50, 129.37, 129.07, 103.01, 100.96, 78.43, 75.44, 74.87, 74.52, 72.90, 72.59, 71.47, 71.03, 68.63, 61.11, 60.17, 20.34.
3-페닐벤질 β-락토사이드(3-Phenylbenzyl β-lactoside)
수율: 438 mg
13C-NMR (75.1 MHz, d6-DMSO/d4-MeOH/D2O 8:1:1): δ = 140.29, 140.24, 138.88, 129.13, 129.02, 127.66, 126.88, 126.83, 126.03, 125.90, 103.95, 102.03, 80.76, 75.65, 75.07, 75.00, 73.34, 73.28, 70.66, 69.81, 68.27, 60.56.
2-나프틸메틸 β-락토사이드(2-Naphthylmethyl β-lactoside)
수율: 378 mg
13C-NMR (75.1 MHz, D2O/d6-DMSO): δ = 134.96, 133.24, 133.12, 128.59, 128.31, 128.08, 127.46, 126.98, 126.90, 126.79, 103.26, 101.59, 78.89, 75.62, 75.09, 74.81, 73.14, 72.81, 71.33, 71.14, 68.75, 61.22, 60.39.
C)
10 g(8.13 mmol)의 벤질 2,3,6,2',6'-펜타-O-(4-클로로벤조일)-4'-O-벤조일-β-락토사이드(benzyl 2,3,6,2',6'-penta-O-(4-chlorobenzoyl)-4'-O-benzoyl-β-lactoside)와 10 g(1.6 equiv.)의 메틸 N-트라이클로로아세틸-3,6,2',3',4'6'-헥사-O-아세틸-1-싸이오-락토사미나이드(methyl N-trichloroacetyl-3,6,2',3',4'6'-hexa-O-acetyl-1-thio-lactosaminide)가 아르곤 하에서 35 ml의 건조 CHCl3에 용해되었다. 상기 용액에, 3.7 g의 NIS 및 4990 mg의 AgOTf가 실온에서 첨가되었고, 교반이 약 20분 동안 계속되었다. 트라이에틸 아민(triethyl amine)(5 ml)이 상기 슬러리에 첨가되고, CH2Cl2(500 ml)로 희석된 다음, 2X 싸이오황산 나트륨(sodium thiosulphate) 수용액(10 %)으로 추출되었고, 상기 유기 상은 분리되고, MgSO4로 건조되고, 농축되었고, 상기 시럽(syrup)은 CH2Cl2 : 아세톤(acetone) 98:2 → 95:5의 구배를 이용하여 실리카-겔의 컬럼(column)에서 크로마토그래피되었다. 수율: 12.7 g, 80 %. MS (ESP): 1972.1 [M+Na]+, 1988.1 [M+K]+, 1948.2 [M-H]-, 1984.0 [M+Cl]-. 13C NMR (CDCl3) δ: 101.2, 100.7, 100.0, 98.8 (아노머성 탄소들(anomeric carbons)).
상기와 같이 제조된 사당류 10 g(5.1 mmol)이 MeOH(110 ml)에 용해되었고, pH10이 될 때가지 NaOMe의 용액(MeOH 내 1 M)이 첨가되었다. 상기 용액은 40 ℃에서 5시간 동안 교반된 다음, 앰버라이트 IR 120 H+ 레진(Amberlite IR 120 H+ resin)을 첨가하여 중화되었고, 상기 레진은 여과되었고, 상기 여과물은 건조한 상태로 증발되었다. 상기 잔사는 따듯한 DMF(10 ml)에 용해되었고, iPr2O(150 ml)에 한방울씩 첨가되었고, 상기 현탁액은 추가적인 3시간 동안 교반되었다. 상기 침전물은 여과되었고, iPr2O (2x20 ml)로 세척되었고, 건조되어 생성물로서 4.2g의 황백색(off-white)의 분말을 생성하였다(91%). MS (ESP): 900.1 [M-H]-. 13C NMR (D2O) δ: 105.6, 105.5, 104.2, 103.7 (아노머성 탄소들(anomeric carbons)).
상기와 같이 제조된 4당류의 화합물 25 g이 110 ml의 MeOH 및 110 ml의 KOH(7.5 g) 수용액에 용해되었고, 상기 혼합물은 상온에서 1일 동안 교반되었다. 그런 다음, 상기 혼합물은 아이스-배스(ice-bath)로 냉각되었고, HCl-가스로 중화되었고, 건조한 상태로 농축되었다. 상기 생성되는 미가공의 갈색 유리는 피리딘(pyridine)(150 ml) 및 아세트산 무수물(acetic anhydride)(150 ml)로 실온에서 1일 동안 아세틸화된다. 상기 용액은 농축되고, 상기 시럽은 CH2Cl2에 용해되고, 상기 유기 상은 1M HCl-용액 및 sat. NaHCO3-용액으로 순차 추출되고, MgSO4로 건조되고, 여과된 후, 농축되어, CH2Cl2:아세톤(acetone) 8:2를 용리제로 이용하여 컬럼 크로마토그래피에 주입될 갈색 포말(foam) 4.3 g을 생성하였다. 13C NMR (CDCl3) δ: 101.2, 100.8, 100.4, 99.2 (아노머성 탄소들(anomeric carbons)).
상기와 같에 제조된 과아세틸화된 사당류(peracetylated tetrasaccharide) 140 g(107.5 mmol)이 1.5 L의 MeOH에 용해되었고, NaOMe-용액(1 M)이 pH 10이 될 때까지 첨가되고, 상기 혼합물은 50 ℃에서 하룻밤동안 교반되었다. 상기 생성물은 상기 반응 혼합물로부터 결정화되었다. 상기 혼합물은 실온으로 냉각될 수 있어서, 냉각 및 여과되었고, 상기 여과물은 차가운 EtOH로 세척된 다음, 건조되어 백색 분말의 벤질 β-LNnT(benzyl β-LNnT) 69 g를 생성하였다(86.5 mmol, 80 %). 13C NMR (D2O) δ: 105.6, 105.5, 105.4, 103.6 (아노머성 탄소들(anomeric carbons)). Mp. 284-286 ℃
D) 중성 HMO 벤질/치환된 벤질 배당체의 제조를 위한 일반 과정
선별된 중성 HMO(1 equiv.)이 1-10 부피(g/ml)의 DMF, DMSO 또는 그들의 혼합물에 용해/현탁되었다. 상기 반응 혼합물은 0 ℃로 냉각되고, 브로민화 벤질/치환된 브로민화 벤질(benzyl bromide/substituted benzyl bromide)(1.2-1.4 equiv.)이 첨가되었다. 수소화 나트륨(sodium hydride), 수소화 칼륨(potassium hydride), 수소화 칼슘(calcium hydride), 삼차-부톡시화 칼륨(potassium t-butoxide), 삼차-부톡시화 나트륨(sodium t-butoxide)과 같은 강 염기(1.2-1.4 equiv)가 0-40 ℃에서 첨가되었고, 상기 반응 혼합물이 0-60 ℃에서 6-24시간 동안 교반되었다. 그런 다음, 물이 첨가되어 상기 과량의 염기를 소거하고, 상기 반응 혼합물은 실온에서 30분 동안 교반되었다. 상기 생성되는 반응 혼합물은 농축되고, 역상 크로마토그래피(reverse phase chromatography), 실리카 겔 크로마토그래피(silica gel chromatography), 이온-교환 크로마토그래피(ion-exchange chromatography), 크기-배제 크로마토그래피(size-exclusion chromatography) 등으로 정제되거나 결정화되어 원하는 벤질화된/치환된 벤질화된 중성 HMO 화합물을 70-80 % 수율로 생성하였다.
1-O-β-(4-메틸벤질)-LNnT( 1-O-β-(4-methylbenzyl)-LNnT)
1H NMR (D2O): 7.3 (dd, 4H), 4.88 (d, 1H), 4.7 (m), 4.54 (d, 1H), 4.48 (d, 1H), 4.42 (d, 1H), 4.34 (d), 4.0-3.5 (m), 3.34 (dd, 1H).
13C NMR (D2O): 184.2, 177.6, 173.7, 141.5, 136.1, 131.9, 131.4, 105.6, 105.5, 105.4, 103.6, 93.2, 84.7, 81.5, 81.0, 80.8, 78.0, 77.6, 77.4, 77.1, 75.5, 75.2, 74.8, 74.0, 73.6, 72.9, 63.7, 62.8, 58.9, 56.4, 25.9, 22.9.
1-O-β-(4-클로로벤질)-LNnT(1-O-β-(4-chlorobenzyl)-LNnT)
1H NMR (D2O): 7.4 (s, 4H), 4.9 (d, 1H), 4.72 (m), 4.52 (d, 1H), 4.8 (d, 1H), 4.42 (d, 1H), 4.16 (d, 1H), 4.0-3.52 (m).
13C NMR (D2O): 138.9, 177.6, 138.3, 137.9, 136.2, 131.3, 105.6, 105.5, 105.4, 103.7, 93.2, 86.1, 84.7, 81.5, 81.0, 80.8, 78.0, 77.5, 77.4, 77.2, 77.1, 75.5, 75.2, 74.9, 63.7, 58.9, 57.8, 56.4, 24.8.
1-O-β-벤질-LNT(1-O-β-benzyl-LNT)
1H-NMR (D2O, 400 MHz) δ 2.03 (s, 3H, CH 3CONH), 3.35 (dd, 1H, J = 8.1 8.5 Hz, H-2), 3.49 (m, 1H, H-5``), 3.53 (m, H-2```), 3.65 (m, 1H, H-3```), 3.57 (dd, 1H, J = 8.1 9.0 Hz, H-4``), 3.58 (m, 1H, H-5), 3.59 (dd, 1H, J = 7.7 10.0 Hz, H-2`), 3.62 (m, 1H, H-3), 3.63 (m, 1H, H-4), 3.71 (m, 1H, H-5`), 3.71 (m, 1H, H-5```), 3.73 (dd, 1H, J = 3.3 10.0 Hz, H-3`), 3.76 (m, 2H, H-6ab```), 3.76(m, 2H, H-6ab`), 3.80 (m, 1H, H-6a``), 3.80 (dd, 1H, J = 5.0 12.2 Hz, H-6a), 3.82 (dd, 1H, J = 8.1 10.5 Hz, H-3``), 3.90 (m, 1H, H-6b``), 3.90 (dd, 1H, J = 8.4 10.5 Hz, H-2``), 3.92 (d, 1H, J = 3.3 Hz, H-4```), 3.98 (dd, 1H, J = 1.6 12.2 Hz, H-6b), 4.15 (d, 1H, J = 3.3 Hz, H-4`), 4.44 (d, 1H, J = 7.7 Hz, H-1`), 4.45 (d, 1H, J = 7.7 Hz, H-1```), 4.56 (d, 1H, J = 8.1 Hz, H-1), 4.73 (d, 1H, J = 8.4 Hz, H-1``), 4.76 (d, 1H, J = 11.7 Hz, CH 2Ph), 4.94 (d, 1H, J = 11.7 Hz, CH 2Ph), 7.40-7.50 (m, 5H, Ph).
13C-NMR (D2O, 100 MHz) δ 24.9 ( CH3CONH), 57.4 (C-2``), 62.8 (C-6), 63.2 (C-6``), 63.7 (C-6```), 63.7 (C-6`), 71.0 (C-4`), 71.2 (C-4```), 71.3 (C-4``), 72.7 (C-2`), 73.4 (C-2```), 74.2 (CH2Ph), 75.2 (C-3```), 75.5 (C-2), 77.1 (C-3), 77.5 (C-5`), 77.6 (C-5```), 77.9 (C-5), 78.0 (C-5``), 81.1 (C-4), 84.7 (C-3`), 84.8 (C-3``), 103.7 (C-1), 105.3 (C-1``), 105.6 (C-1`), 106.2 (C-1```), 131.1 (Ph), 131.4 (2C, Ph), 131.5 (2C, Ph), 139.2 (Ph), 177.7 (CH3 CONH).
M.p. 245 ℃(dec.). [α]D 22 = -10.3 (c = 1, H2O).
1-O-β-(4-메틸벤질)-LNT(1-O-β-(4-methylbenzyl)-LNT)
1H-NMR (D2O, 300 MHz) δ 1.97 (s, 3H), 2.29 (s, 3H), 3.27 (dd, 1H, J = 8.1 8.5 Hz), 3.39-3.87 (m, 21H), 3.92 (dd, 1H, J = 1.8 12.3 Hz), 4.09 (d, 1H, J = 3.3 Hz), 4.37 (d, 1H, J = 8.1Hz), 4.38 (d, 1H, J = 7.8 Hz), 4.47 (d, 1H, J = 8.1 Hz), 4.65 (d, 1H, J = 11.7 Hz), 4.67 (d, 1H, J = 8.1 Hz), 4.83 (d, 1H, J= 11.7 Hz), 7.22 (d, 2H, J= 8.1 Hz), 7.30 (d, 2H, J = 8.1 Hz).
3C-NMR (D2O, 75.4 MHz) δ 23.1, 25.0, 57.7, 62.8, 63.2, 63.7, 63.8, 71.0, 71.1, 71.3, 72.7, 73.4, 74.1, 75.2, 75.5, 77.1, 77.5, 77.6, 77.9, 78.0, 81.1, 84.7, 84.8, 103.6, 105.3, 105.7, 106.2, 131.7 (2C), 132.0 (2C), 136.2, 141.5, 177.7.
2. 트랜스-
시알릴화
반응들(
Trans
-
sialylation
reactions
)
일반 과정: 가스가 제거된 배양 완충용액(1.0 ml, 100 mM Tris/HCl, pH 7.5, 50 mg BSA, 0.02 % NaN3) 내의 2-O-(p-나이트로페닐)-α-D-시아로사이드(2-O-(p-nitrophenyl)-α-D-sialoside)(25 μmol)와 적절한 시알릴 수용체(35 μmol)가 23 ℃에서 T. 크루지(T. cruzi)로부터 유래된 재조합 트랜스시알리다아제((80 μl, 1.3 mg/ml)와 함께 24시간 동안 배양(incubatioin)되었다. 상기 반응은 TLC((뷰탄올(butanol)/아세트산(acetic acid)/물 5:2:2))에 의하여 모니터링되었다. 완료 후, 상기 효소들은 변성되었고, 원심분리된 후, 상층액을 동결건조하였다. 상기 건조 잔사는 물에 용해되었고, 물을 이용한 바이오겔 크로마토그래피(biogel chromatography)(P-2 Biogel, 16x900 mm) 또는 역상 크로마토그래피(reverse phase chromatography)로 정제되었다. 상기 수율은 45-85 % 사이에서 변한다.
4-클로로벤질 3'-O-(N-아세틸-뉴라미노실)-β-락토사이드(4-Chlorobenzyl 3'-O-(N-acetyl-neuraminosyl)-β-lactoside)
1H-NMR (500 MHz, D2O): δ [ppm] = 7.46-7.42 (m, 4H, H(a/b)arom); 4.91 (d, 1H, CH2a-Bn); 4.74 (d, 1H, CH2b-Bn); 4.55-4.52 (m, 2H, H-1/H-1'); 4.11 (dd, 1H, H-3'); 2.76 (dd, 1H, H-3''eq); 2.04 (s, 3H, COCH3); 1.81 (dd, 1H, H-3''ax).
J(a,b)- Bn = 11.8; J2' ,3' = 9.9; J3' ,4' = 2.9; J3''ax ,3'' eq = 12.4; J3''ax ,4'' = 12.1; J3''eq,4'' = 4.5 Hz.
13C-NMR (126 MHz, D2O): δ [ppm] = 135.2, 133.5 (quart. Carom); 130.2 (CHa-arom); 128.6 (CHb - arom); 102.6 (C-1'); 101.1 (C-1); 51.7 (C-5''); 39.6 (C-3''); 22.0 (COCH3).
4-메틸벤질 3'-O-(N-아세틸-뉴라미노실)-β-락토사이드(4-Methylbenzyl 3'-O-(N-acetyl-neuraminosyl)-β-lactoside)
1H-NMR (500 MHz, D2O): δ [ppm] = 7.36 (d, 2H, H-aarom); 7.28 (d, 2H, H-barom); 4.89 (d, 1H, CH2a-Bn); 4.72 (d, 1H, CH2b-Bn); 4.54-4.52 (m, 2H, H-1/H-1'); 4.12 (dd, 1H, H-3'); 2.76 (dd, 1H, H-3''eq); 2.35 (s, 3H, CH3-Tol); 2.04 (s, 3H, COCH3), 1.81 (dd, 1H, H-3''ax).
J(a,b)arom = 7.9; J(a,b)- Bn = 11.5; J2' ,3' = 9.9; J3' ,4' = 3.0; J3''ax ,3'' eq = 12.5; J3'',4'' = 12.2; J3''eq ,4'' = 4.6 Hz.
13C-NMR (126 MHz, D2O): δ [ppm] = 138.8, 133.4 (quart. Carom); 129.3 (CHa-arom); 129.0 (CHb - arom); 102.6 (C-1'); 100.9 (C-1); 99.8 (C-2''), 51.7 (C-5''); 39.6 (C-3''); 22.0 (COCH3); 20.2 (CH3-Tol).
2-나프틸메틸 3'-O-(N-아세틸-뉴라미노실)-β-락토사이드(2-Naphthylmethyl 3'-O-(N-acetyl-neuraminosyl)-β-lactoside)
H-NMR (500 MHz, D2O): δ [ppm] = 7.99-7.97 (m, 4H, H-arom); 7.62-7.59 (m, 3H, H-arom); 5.10 (d, 1H, CH2a-Bn); 4.95 (d, 1H, CH2b-Bn); 4.60 (d, 1H, H-1); 4.53 (d, 1H, H-1'); 4.12 (dd, 1H, H-3'); 2.77 (dd, 1H, H-3''eq); 2.04 (s, 3H, COCH3); 1.81 (dd, 1H, H-3''ax).
J(a,b)- Bn = 11.9; J1 ,2 = 8.0; J1' ,2' = 7.9; J2' ,3' = 9.9; J3' ,4' = 3.0; J3''ax ,3'' eq = 12.5; J3''ax , 4'' = 12.1; J3''eq , 4'' = 4.6 Hz.
13C-NMR (126 MHz, D2O): δ [ppm] = 128.3, 127.9, 127.7, 127.6, 126.6, 126.5 (CHarom); 102.6 (C-1'); 101.1 (C-1); 51.7 (C-5''); 22.0 (COCH3).
3-페닐벤질 3'-O-(N-아세틸-뉴라미노실)-β-락토사이드(3-Phenylbenzyl 3'-O-(N-acetyl-neuraminosyl)-β-lactoside)
1H-NMR (400 MHz, D2O): δ [ppm] = 7.75-7.44 (m, 9H, H-arom); 4.99 (d, 1H, CH2a-Bn); 4.82 (d, 1H, CH2b-Bn); 4.57 (d, 1H, H-1); 4.53 (d, 1H, H-1'); 4.13 (dd, 1H, H-3'); 2.78 (dd, 1H, H-3''eq); 2.05 (s, 3H, COCH3); 1.82 (dd, 1H, H-3''ax).
J(a,b)- Bn = 11.8; J1 ,2 = 8.0; J1' ,2' = 7.9; J2' ,3' = 9.9; J3' ,4' = 3.1; J3''ax ,3'' eq = 12.5; J3''ax ,4'' = 12.0; J3''eq ,4'' = 4.6 Hz.
13C-NMR (100 MHz, D2O): δ [ppm] = 140.9, 140.3, 137.5 (quart. Carom); 129.4, 129.2, 127.9, 127.8, 127.1, 127.0, 126.9, (CHarom); 102.7 (C-1'); 101.2 (C-1); 99.6 (C-2''); 51.8 (C-5''); 39.7 (C-3''); 22.1 (COCH3).
벤질 3'''-O-(N-아세틸-뉴라미노실)-β-LNnT(Benzyl 3'''-O-(N-acetyl-neuraminosyl)-β-LNnT)
1H-NMR (400 MHz, D2O): 도 1 참조.
<110> Glycom A/S
<120> Synthesis of New Sialooligosaccharide Derivatives
<130> 2013-FPA-5729
<150> GB1012036.8
<151> 2010-07-16
<150> EP11166036.1
<151> 2011-05-13
<160> 11
<170> BiSSAP 1.0
<210> 1
<211> 393
<212> PRT
<213> Bifidobacterium longum subsp. infantis ATCC 15697
<400> 1
Met Thr Glu Asn Gly Met Met Asn Thr Asn Asn Thr Val Cys Gly Ala
1 5 10 15
Asn His Asp Gly Ala Met Ser Leu Ala Ala Pro Gly Asp Tyr Gly Val
20 25 30
Ala Cys Tyr Arg Ile Pro Ala Leu Ala Glu Ala Pro Asn Gly Trp Ile
35 40 45
Leu Ala Ala Phe Asp Ala Arg Pro His Asn Cys Gln Asp Ala Pro Gln
50 55 60
Ala Asn Ser Ile Val Gln Arg Ile Ser Lys Asp Gly Gly Arg Ser Phe
65 70 75 80
Glu Pro Gln His Val Val Ala Ala Gly His Asp Gly Val Asp Lys Tyr
85 90 95
Gly Tyr Ser Asp Pro Ser Tyr Val Val Asp Arg Gln Thr Gly Glu Val
100 105 110
Phe Leu Phe Phe Val Lys Ser Tyr Asp Ala Gly Phe Gly Thr Ser Gln
115 120 125
Ala Gly Val Asp Pro Ser Ala Arg Glu Val Leu Gln Ala Ala Val Thr
130 135 140
Ser Ser Ile Asp Asn Gly Val Thr Trp Ser Glu Pro Arg Ile Ile Thr
145 150 155 160
Ala Asp Ile Thr Asn Ser Glu Ser Trp Ile Ser Arg Phe Ala Ser Ser
165 170 175
Gly Ala Gly Ile Gln Leu Thr Tyr Gly Glu His Ala Gly Arg Leu Ile
180 185 190
Gln Gln Tyr Thr Ile Lys Glu Leu Asp Gly Arg Tyr Arg Ala Val Ser
195 200 205
Val Phe Ser Asp Asp His Gly Ala Thr Trp His Ala Gly Thr Pro Val
210 215 220
Gly Asp His Met Asp Glu Asn Lys Val Val Glu Leu Ser Asp Gly Arg
225 230 235 240
Val Met Leu Asn Ser Arg Ser Ser Asp Gly Asn Gly Cys Arg Tyr Val
245 250 255
Ala Ile Ser Arg Asp Gly Gly Ala Thr Tyr Gly Pro Val Ile Arg Glu
260 265 270
Thr Gln Leu Pro Asp Pro Glu Asn Asn Ala Gln Ile Ala Arg Ala Phe
275 280 285
Pro Asp Ala Pro Glu Gly Ser Ala Gln Ala Lys Val Leu Leu Tyr Ser
290 295 300
Ser Ser Ser Pro Ser Asp Arg Ile Asp Gly Leu Val Arg Val Ser Ile
305 310 315 320
Asp Asp Gly Lys Thr Trp Ser Ala Gly Arg Arg Phe Thr Thr Gly Pro
325 330 335
Met Ala Tyr Ser Val Ile Ala Ala Leu Ser His Lys Ala Gly Gly Gly
340 345 350
Tyr Gly Leu Leu Tyr Glu Gly Asp Asn Asn Asn Ile Met Tyr Thr Arg
355 360 365
Ile Ser Leu Asp Trp Leu Asn Gly Gln Leu Asn Val Asp Gly Ile Gly
370 375 380
Gly Phe Pro Leu Ser Gly Glu Gly Gly
385 390
<210> 2
<211> 760
<212> PRT
<213> Bifidobacterium longum subsp. infantis ATCC 15697
<400> 2
Met Ala Ala Ser Asn Pro Ile Ser Trp Ser Gln Arg Thr Phe Pro Ser
1 5 10 15
Pro Glu Gly Thr Ile Ala Cys Arg Phe Arg Ala His Ala Asp Gly Arg
20 25 30
Ile Phe Asp Ala Val Asn Gly Ser Ala Asn Asp Ala Pro Leu Leu Ile
35 40 45
Cys Ala Ile Glu His Asp Ala Leu Arg Val Arg Ala Thr Thr Pro Arg
50 55 60
Gln His Val Asp Phe Asp Ile Glu Asp Thr Thr Gly Ile Ala Asp Gly
65 70 75 80
Ala Met His Thr Phe Ala Leu Thr Phe Gly Glu Phe Gly Thr Arg Val
85 90 95
Tyr Leu Asp Gly Ser Gln Cys Phe Ser Gly Thr Ala Asn Leu Cys Pro
100 105 110
Thr Thr Leu Thr Gly Thr Glu Gly Ser Gly Gln Gly Ala Ile Arg Leu
115 120 125
Ala Gly Pro Ser Ile Asp Val Thr Asp Met Arg Leu His Ala Ile Pro
130 135 140
Leu Thr Ser Glu Ser Ile Ala Ala Leu Thr Pro Arg Pro Ala Pro Asp
145 150 155 160
Ile Asp Phe Ala Ala Ala Gln Leu Ala Pro Arg Asp Val Arg Arg Val
165 170 175
Arg Thr Leu Arg Ser Gly Thr Ile Phe Met His Phe Arg Val Arg Gly
180 185 190
Pro Arg Gln Tyr Gly Thr Leu Leu Ala Ala Gly Glu Arg Gly Glu Glu
195 200 205
Arg Leu Ala Val Ser Ile Asp Asp Asn Gly Ile Thr Met Thr Ala Ala
210 215 220
Asp Gly Leu Tyr Glu Pro Ser Thr Tyr His Ala Arg Gly Ala Trp Asp
225 230 235 240
Asp Gly Arg Trp His Asp Leu Ser Ile Arg Ser Ala Arg Gly Ala Ile
245 250 255
Asp Met Tyr Val Asp Gly Trp His Glu Leu His Gln Ala Gly Gln Val
260 265 270
Phe Phe Gly Asp Trp Pro Gln Leu Asp Glu Val Ala Ile Gly Gln Asn
275 280 285
Thr Glu Gly Val Arg Leu Met Gly Glu Val Arg Asn Gly Gly Val Phe
290 295 300
Thr Thr Pro Leu Thr Asp Gly Ala Ile Arg Arg Leu Ser Asp Ala Pro
305 310 315 320
Ala Leu Thr Thr Thr Ala Leu Phe Asp Lys Gly Tyr His Gly Ser Val
325 330 335
Ser Tyr Arg Ile Pro Ser Ile Ile Arg Thr Pro His Gly Val Val Val
340 345 350
Ala Gly Ala Asp Gln Arg Thr Ala Ile Ala Asn Asp Ala Pro Asn His
355 360 365
Ile Asn Phe Val Met Arg Arg Ser Leu Asp Gly Gly Arg Thr Trp Leu
370 375 380
Asp Met Gln Thr Val Ile Ala Asn Pro Gly Glu Gly Val Asp Gly Ala
385 390 395 400
Cys Thr Ile Asp Ser Cys Leu Val Cys Asp Glu Arg Asn Gly Arg Leu
405 410 415
Thr Val Leu Ile Asp Arg Phe Ala Gly Gly Val Gly Leu Pro Asn Asn
420 425 430
Thr Pro Gly Thr Gly Val Asp Arg His Gly Arg Pro Cys Leu Tyr Asp
435 440 445
Arg Ala Gly Thr Arg Tyr Val Leu Ala Asp Asp Gly Thr Val Leu Asp
450 455 460
Gly Gly Gly Glu Arg Thr Gly Tyr Arg Val Asp Ala His Gly Asn Val
465 470 475 480
Thr His Glu Gly Arg Ala Ser Gly Asn Ile Tyr Leu Lys Glu Gly Ala
485 490 495
Asp Pro Asp Glu Ser Leu Leu Ile Glu Arg Thr Ser Phe Ile Ile Glu
500 505 510
Leu His Ser Asp Asp Asp Gly Glu Thr Trp Ser Thr Pro Arg Asn Ile
515 520 525
Asn His Met Ile Lys Glu Asp Trp Met His Phe Leu Gly Val Ser Pro
530 535 540
Gly Asn Gly Ile Gln Leu Gln Ala Ser Glu His Arg Gly Arg Leu Leu
545 550 555 560
Val Pro Phe Tyr Cys Thr Gly Ala Ser Leu Lys His Tyr Ser Gly Gly
565 570 575
Ala Leu Ile Ser Asp Asp Gly Gly Asp Thr Trp Arg Arg Gly Ser Met
580 585 590
Ile Asn Asp Gly Arg Ile Val Asn Gly Thr Ala Val Asp Pro Lys Asn
595 600 605
Ile Arg Asp Asp Asp Ala Thr Thr His Glu Ser Val Phe Val Glu Arg
610 615 620
Ala Asp Gly Thr Val Val Cys Phe Phe Arg Asn Gln Asn His Ala Gly
625 630 635 640
Arg Ile Gly Val Ala Leu Ser His Asp Gly Gly Glu Thr Trp Asp Asp
645 650 655
Leu Tyr Phe Asp Lys Asp Val Pro Asp Ile Phe Cys Gln Pro Asn Ala
660 665 670
Val Ala Cys Ala Pro Arg Ser Asp Thr Met Val Phe Ala Asn Ala Ser
675 680 685
Gln Met Leu Pro Tyr Arg Gly Asn Gly Val Leu Arg Leu Ser Leu Asp
690 695 700
Gly Ala Arg Thr Trp Ala Ala His Arg Cys Ile Asn Pro Tyr His Tyr
705 710 715 720
Gly Tyr Gln Cys Met Thr Met Leu Pro Asp Gly Glu Leu Gly Leu Leu
725 730 735
Trp Glu Arg Glu Thr Ala Gly Leu Tyr Phe Thr Thr Leu Pro Leu Ser
740 745 750
Val Phe Gly Ala Ala Glu Thr His
755 760
<210> 3
<211> 836
<212> PRT
<213> Bifidobacterium bifidum
<400> 3
Met Val Arg Ser Thr Lys Pro Ser Leu Leu Arg Arg Leu Gly Ala Leu
1 5 10 15
Val Ala Ala Ala Ala Met Leu Val Val Leu Pro Ala Gly Val Ser Thr
20 25 30
Ala Ser Ala Ala Ser Asp Asp Ala Asp Met Leu Thr Val Thr Met Thr
35 40 45
Arg Thr Asp Thr Leu Gly Asp Glu Val Tyr Val Gly Asp Thr Leu Thr
50 55 60
Tyr Ser Phe Thr Asn Thr Asn Asn Thr Ser Ser Ala Phe Thr Ala Phe
65 70 75 80
Pro Ala Glu Ser Asn Leu Ser Gly Val Leu Thr Thr Gly Thr Pro Asn
85 90 95
Cys Arg Tyr Glu Asn Leu Ala Gly Gly Ala Ser Tyr Pro Cys Ser Thr
100 105 110
Ala Ser His Thr Ile Thr Ala Asp Asp Leu Thr Ala Gly Ser Phe Thr
115 120 125
Pro Arg Thr Val Trp Lys Ala Thr Ser Asp Arg Gly Gly Thr Gln Val
130 135 140
Leu Gln Asp Asn Ile Val Ser Thr Gly Asp Thr Val Thr Val Lys Glu
145 150 155 160
Gly Lys Arg Pro Asp Pro Ala Thr Ile Pro Thr Asp Arg Ala Asp Gly
165 170 175
Glu Ala Val Arg Leu Ala Thr Ala Arg Gln Asn Leu Gly Thr Glu Cys
180 185 190
Tyr Arg Ile Pro Ala Leu Ala Glu Ala Pro Asn Gly Trp Ile Leu Ala
195 200 205
Ala Phe Asp Gln Arg Pro Asn Thr Ala Met Ala Asn Gly Ser Gly Val
210 215 220
Lys Cys Trp Asp Ala Pro Gln Pro Asn Ser Ile Val Gln Arg Ile Ser
225 230 235 240
Lys Asp Gly Gly Lys Ser Trp Thr Pro Ile Gln Tyr Val Ala Gln Gly
245 250 255
Lys Asn Ala Pro Glu Arg Tyr Gly Tyr Ser Asp Pro Ser Tyr Val Val
260 265 270
Asp Glu Glu Thr Gly Glu Ile Phe Leu Phe Phe Val His Ser Tyr Asn
275 280 285
Lys Gly Phe Ala Asp Ser Gln Leu Gly Val Asp Glu Ser Asn Arg Asn
290 295 300
Val Leu His Ala Val Val Val Ser Ser Lys Asp Asn Gly Glu Thr Trp
305 310 315 320
Ser Lys Pro Arg Asp Ile Thr Ala Asp Ile Thr Lys Gly Tyr Glu Asn
325 330 335
Glu Trp Lys Ser Arg Phe Ala Thr Ser Gly Ala Gly Ile Gln Leu Lys
340 345 350
Tyr Gly Lys Tyr Lys Gly Arg Leu Ile Gln Gln Tyr Ala Val Gly Arg
355 360 365
Thr Thr Gly Ser Asn Ala Ala Val Ser Val Tyr Ser Asp Asp His Gly
370 375 380
Lys Thr Trp Gln Ala Gly Asn Pro Val Thr Gly Met Leu Met Asp Glu
385 390 395 400
Asn Lys Val Val Glu Leu Ser Asp Gly His Val Met Leu Asn Ser Arg
405 410 415
Pro Gly Asn Gly Ser Gly Tyr Arg Arg Val Ala Ile Ser Glu Asp Gly
420 425 430
Gly Val Asn Tyr Gly Thr Val Lys Asn Glu Thr Gln Leu Pro Asp Pro
435 440 445
Asn Asn Asn Ala His Ile Thr Arg Ala Phe Pro Asn Ala Pro Glu Gly
450 455 460
Ser Ala Lys Ala Lys Val Leu Leu Tyr Ser Ser Pro Arg Ala Asn Asn
465 470 475 480
Glu Gly Arg Ala Asn Gly Val Val Arg Ile Ser Leu Asp Asp Gly Thr
485 490 495
Thr Trp Ser Ser Gly Lys Leu Tyr Lys Ala Gly Ser Met Ala Tyr Ser
500 505 510
Val Ile Thr Ala Leu Ser Gly Ala Ala Gly Gly Gly Tyr Gly Leu Leu
515 520 525
Tyr Glu Gly Ala Trp Val Thr Gly Gly Gly Ile Asp Ser His Asp Ile
530 535 540
Met Tyr Thr His Ile Ser Met Asp Trp Leu Gly Tyr Leu Ser Ala Thr
545 550 555 560
Ala Asp Asp Val Thr Ala Ser Val Glu Glu Gly Ala Ser Thr Val Asp
565 570 575
Val Thr Val Pro Val Ser Asn Val Gly Ser Val Asp Tyr Thr Gly Val
580 585 590
Thr Val Thr Pro Ala Asp Leu Pro Thr Gly Trp Ser Ala Ser Pro Val
595 600 605
Asn Val Gly Ala Leu Ala Ser Gly Ala Ser Lys Asp Val Thr Val Thr
610 615 620
Val Asn Val Pro Ala Thr Ala Lys Lys Asp Asp Val Ala Lys Ile Val
625 630 635 640
Leu Arg Val Thr Gly Thr Ser Ala Ala Asn Ala Asp Ala Thr Thr Gly
645 650 655
Phe Asp Gly Ser Ile Thr Val Asn Val Thr Glu Lys Ser Glu Pro Asp
660 665 670
Pro Glu Pro Glu Pro Thr Ile Thr Gly Val Ser Ala Val Thr Ser Gln
675 680 685
Ala Gly Val Lys Val Gly Asp Val Phe Asp Ala Ser Lys Val Ser Val
690 695 700
Thr Ala Ala Met Ser Asp Gly Ser Ser Lys Ala Leu Ala Ala Gly Glu
705 710 715 720
Tyr Ser Leu Ser Ala Val Asp Ala Asp Gly Lys Ala Val Asp Leu Ala
725 730 735
Glu Pro Phe Ala Ala Ala Gly Val Val Thr Val Thr Val Ser Val Pro
740 745 750
Val Glu Gly Ala Asp Pro Leu Thr Ala Ser Phe Thr Ile Asp Val Ala
755 760 765
Glu Lys Ser Ala Asp Pro Glu Pro Lys Pro Glu Pro Glu Pro Lys Pro
770 775 780
Glu Pro Glu Lys Pro Ala Gly Pro Lys Val Asp Val Pro Thr Glu Lys
785 790 795 800
Pro Gly Leu Ser Lys Thr Gly Ala Ser Thr Ala Gly Met Ser Ile Val
805 810 815
Phe Val Leu Leu Ala Leu Ser Gly Val Ala Ala Leu Ser Leu Arg Arg
820 825 830
Arg Ser Val His
835
<210> 4
<211> 1795
<212> PRT
<213> Bifidobacterium bifidum
<400> 4
Met Thr Thr Ile Phe Arg Arg Ala Thr Ala Lys Thr Leu Met Arg Lys
1 5 10 15
Leu Ser Gly Leu Leu Val Ala Ile Ala Met Leu Ala Val Leu Pro Ala
20 25 30
Gly Thr Ile Ser Ala Asn Ala Ala Asp Glu Pro Pro Gln Glu Tyr Leu
35 40 45
Gln Leu Thr Leu Thr Arg Thr Asp Ser Asn Gly Thr Pro Ala Glu Val
50 55 60
Gly Asp Lys Leu Thr Tyr Ser Leu Gly Tyr Lys Asn Val Ser Asp Thr
65 70 75 80
Gly Phe Ile Val His Pro Thr Ala Ser Asn Leu Asn Asn Val Ala Thr
85 90 95
Pro Gln Ser Ala Ser Asn Pro Asn Pro Met Cys Arg Trp Gly Asn Leu
100 105 110
Ala Ala Gly Ala Ser Ala Ala Cys Thr Trp Ser Ala Ser Lys Glu Phe
115 120 125
Ala Tyr His Val Val Thr Glu Asp Asp Val Ala Asn Gly Phe Thr Pro
130 135 140
Thr Ala Thr Val Ser Ala Thr Thr Gln Asp Gly Thr Asn Gly Val Leu
145 150 155 160
Gln Ser Val Asp Ile Thr Gly Glu Thr Val Pro Ala Val Pro Ala Thr
165 170 175
Ser Thr Leu Arg Val Ala Met Gln Arg Thr Asp Thr Leu Gly Asp Asn
180 185 190
Val Lys Ile Gly Asp Arg Leu Thr Phe Asn Phe Thr Tyr Thr Asn Lys
195 200 205
Thr Ala Gln Lys Ile Tyr Ala Tyr Pro Ser Glu Ser Asn Ile Glu Arg
210 215 220
Val Asp Val Val Ser Phe Pro Arg Asn Ser Cys Arg Ser Gly Val Glu
225 230 235 240
Ala Asn Gln Thr Ala Ser Cys Gly Phe Ala Tyr His Val Ile Thr Ala
245 250 255
Glu Asp Val Val Ala Arg Arg Tyr Thr Pro Thr Ala Thr Phe Arg Ala
260 265 270
Thr Ser Asp Arg Asp Gly Thr Gln Val Leu Gln Asp Asp Met Thr Phe
275 280 285
Thr Thr Gly Thr Val Thr Val Ala Gly Pro Ala Asp Asp Ala Ala Ser
290 295 300
Thr Pro Thr Glu Arg Lys Asp Gly Glu Pro Leu Leu Leu Ala Thr Asn
305 310 315 320
Lys Gln Ile Gly Asn Thr Asp Tyr Tyr Arg Ile Pro Ala Ile Ala Gln
325 330 335
Ala Pro Asn Gly Trp Ile Leu Ala Ala Trp Asp Leu Arg Pro Lys Leu
340 345 350
Ala Ala Asp Ala Pro Asn Pro Asn Ser Ile Val Gln Arg Ile Ser Lys
355 360 365
Asp Gly Gly Lys Ser Trp Glu Thr Leu Ala Tyr Val Ala Gln Gly Arg
370 375 380
Ser Ala Thr Asn Lys Tyr Gly Tyr Ser Asp Pro Ser Tyr Val Val Asp
385 390 395 400
Glu Glu Ala Gly Lys Ile Phe Leu Phe Cys Val Lys Ser Tyr Asp Gln
405 410 415
Gly Tyr Phe Gly Ser Val Leu Gly Val Glu Asp Ala Arg Asn Val Leu
420 425 430
Gln Ala Val Val Met Glu Ser Asp Asp Asn Gly Ala Thr Trp Ser Glu
435 440 445
Pro Arg Asn Ile Thr Lys Asp Ile Thr Lys Gly His Glu Asp Glu Trp
450 455 460
Lys Ser Arg Phe Ala Ser Ser Gly His Gly Ile Gln Leu Lys Tyr Gly
465 470 475 480
Gln Tyr Lys Gly Arg Leu Ile Gln Gln Tyr Ala Val Arg Thr Thr Ser
485 490 495
Asn Thr Asn Ile Ala Val Ser Val Tyr Ser Asp Asp His Gly Lys Thr
500 505 510
Trp Lys Ala Gly Asn Pro Val Thr Glu Val Asn Met Asp Glu Asn Lys
515 520 525
Val Val Glu Leu Ser Asp Gly Arg Val Met Leu Asn Ser Arg Pro Gly
530 535 540
Ala Ala Gly Tyr Arg Arg Val Ala Ile Ser Glu Asp Gly Gly Val Asn
545 550 555 560
Tyr Gly Pro Ile Lys Ser Glu Thr Gln Leu Pro Asp Pro Asn Asn Asn
565 570 575
Ala Gln Ile Thr Arg Ala Phe Pro Asn Ala Pro Glu Gly Ser Ala Lys
580 585 590
Ala Lys Val Leu Leu Tyr Ser Ala Pro Arg Ala Ser Asn Glu Gly Arg
595 600 605
Ala Asn Gly Val Val Arg Val Ser Phe Asp Asp Gly Thr Thr Trp Ser
610 615 620
Ala Gly Lys Leu Phe Lys Ala Gly Ser Met Ala Tyr Ser Val Ile Thr
625 630 635 640
Ala Leu Asn Asp Ala Ala Gly Gly Gly Tyr Gly Leu Leu Tyr Glu Gly
645 650 655
Glu Ser Ile Thr Tyr Thr Arg Ala Ser Met Glu Trp Leu Gly Tyr Leu
660 665 670
Thr Ala Thr Ala Ser Gly Thr Ala Ala Val Lys Glu Gly Glu Gly Thr
675 680 685
Leu Thr Ala Pro Val Thr Val Thr Asn Asp Gly Leu Thr Asp Tyr Thr
690 695 700
Asn Val Thr Val Thr Pro Thr Gly Leu Pro Ser Gly Trp Ser Ala Glu
705 710 715 720
Ala Val Asn Val Gly Asn Leu Ala Ala Gly Gln Ser Ala Thr Val Asn
725 730 735
Val Pro Val Thr Val Pro Ala Ala Ala Val Ser Gly Thr Val Ala Lys
740 745 750
Ala Thr Met Lys Ile Thr Gly Lys Tyr Ala Gln Ser Glu Asp Thr Leu
755 760 765
His Ser Phe Ala Glu Gly Glu Leu Ala Val Thr Val Thr Asp Pro Asp
770 775 780
Pro Ala Ala Lys Arg Leu Lys Leu Thr Ile Glu Arg Thr Asp Asp Asn
785 790 795 800
Gly Asp Pro Val Lys Val Gly Asp Thr Leu Thr Tyr Arg Ile Thr Tyr
805 810 815
Glu Asn Val Gly Thr Gln Ser Phe Ala Val Tyr Pro Arg Glu Ser Asn
820 825 830
Leu Asp Gly Val Thr Thr Pro Gln Ser Ala Ser Asn Pro Ala Pro Val
835 840 845
Cys Arg Trp Ser Arg Leu Ala Pro Gly Ala Thr Gly Ala Cys Val Ser
850 855 860
Gly Asn Gly Lys Gln Leu Ala Tyr His Thr Val Thr Glu Ala Asp Ala
865 870 875 880
Thr Ala Gly Ser Phe Thr Pro Ser Ala Thr Ile Asp Ala Thr Ala Asp
885 890 895
Ala Ser Gly Glu Thr Val Leu Glu Ser Val Ser Ile Thr Gly Asp Pro
900 905 910
Val Thr Val Ser Gln Pro Val Glu Leu Pro Ala Asp Ile Ala Ala Trp
915 920 925
Lys Thr Arg Asn Glu Ala Leu Ala Asp Trp Gln Thr Leu Ser Glu Lys
930 935 940
Leu Ala Lys Thr Asp Arg Ile Asn Trp Leu Phe Thr Gly Asp Ser Ile
945 950 955 960
Thr His Gly Val Gln Phe Thr Arg Gly Tyr Arg Thr Tyr Ser Glu Leu
965 970 975
Phe Ala Asn His Leu Asp Thr Ala Ser Val Arg Gly Val Ser Arg Ala
980 985 990
Asn Asp Val Val Met Asn Thr Gly Ile Ser Ser Ala Asp Ala Ser Trp
995 1000 1005
Pro Leu Lys Asp Gly Ala Phe Glu Lys Trp Val Ser Asp Lys His Pro
1010 1015 1020
Asp Val Val Phe Leu Thr Phe Gly Met Asn Asp Gly Arg Thr Gly Gln
1025 1030 1035 1040
Ala Phe Thr Val Asp Gln Tyr Thr Ala Asn Leu Ser Thr Leu Ile Asp
1045 1050 1055
Lys Ile Arg Asp Leu Gly Ala Ile Pro Val Leu Gln Thr Gln Asn Tyr
1060 1065 1070
Thr Thr Asn Thr Thr Phe Asn Ala Asn Leu Asp Thr Tyr Phe Asp Ala
1075 1080 1085
Glu Arg Arg Leu Ala Leu Asp Lys Asn Val Leu Leu Val Asp Phe Asn
1090 1095 1100
Lys Gln Trp Leu Glu Leu Gly Gly Gly Asn Arg Glu Ser Gly Thr Tyr
1105 1110 1115 1120
Met Gly Ala Gly Asn Asp Ile His Pro Gly Glu Asn Gly His Ile Glu
1125 1130 1135
Trp Ala Lys Phe Thr Leu Gly Ala Leu Asn Met Ile Ala Asn Asp Asp
1140 1145 1150
Pro Leu Ala Arg Trp Ser Ser Ser Asp Thr Thr Leu Asp Lys Pro Thr
1155 1160 1165
Val Thr Val Asp Ala Asp Gly Asn Gly Leu Lys Gly Ser Asp Gly Leu
1170 1175 1180
Glu Pro Ala Pro Ala Ala Ala Lys Ser Val Gly Lys Phe Leu Ser Gly
1185 1190 1195 1200
Ala Gln Tyr Val Asp Leu Gly Gly Asp Val Val Ser Ala Val Ala Gly
1205 1210 1215
Lys Arg Glu Ser Asn Val Thr Ile Arg Phe Arg Ala Ser Ala Thr Gly
1220 1225 1230
Gln Pro Gln Thr Leu Phe Ser Leu Gly Asp Ser Asp Ser Ala Thr Arg
1235 1240 1245
Ala Thr Val Arg Leu Ser Ala Thr Gly Leu Val Gln Phe Leu Asn Ser
1250 1255 1260
Gly Asn Thr Gly Asp Phe Tyr Thr Val Gly Thr Asn Asp Leu Ala Asp
1265 1270 1275 1280
Gly Ala Trp His Thr Val Ser Val Asn Phe Val Ala Asn Gly Phe Thr
1285 1290 1295
Ile Tyr Val Asp Gly Ala Ala Met Arg Ala Ile Ser Gly Gly Ala Gly
1300 1305 1310
Thr Gln Leu Asn Val Pro Gly Ala Ile Thr Val Asn Thr Ala Thr Ala
1315 1320 1325
Gly Ala Ile Arg Gly Ala Asp Ser Ala Gly Gly Ala Gln Gln Leu Thr
1330 1335 1340
Gly Ile Val Asp Tyr Val Ala Ala Trp Ser Arg Thr Leu Thr Asp Ala
1345 1350 1355 1360
Glu Ala Lys Arg Ile Ser Ala Glu Thr Ser Ala Val Ala Val Thr Lys
1365 1370 1375
Val Asp Ala Ala Val Asn Ala Leu Gln Pro Ile Ile Ser Asp Thr Gly
1380 1385 1390
Ala Arg Lys Asn Ile Val Phe Val Gly Gly Glu Thr Ile Glu Gly Gly
1395 1400 1405
Tyr Thr Asp His Leu Ile Ala Lys Asn Ile Val Gln Leu Leu Asp Glu
1410 1415 1420
Arg Val Arg Trp Glu Tyr Val Thr Gly Leu Ser Ala Thr Asp Arg Glu
1425 1430 1435 1440
Arg Gln Arg Ala Lys Phe Phe Val Ala Ala Gly Gln Gly Gly Leu Thr
1445 1450 1455
Ala Lys Gln Met Asp Glu Asp Tyr Ala Ala Met Val Gly Glu Tyr Ser
1460 1465 1470
Pro Asp Ile Leu Phe Leu Ala Pro Asp Leu Tyr Asp Ala Asp Gly Asn
1475 1480 1485
Leu Ala Glu Ser Ala Ala Ala Ala Phe Ala Gly His Ile Arg Ser Val
1490 1495 1500
Ala Ala Lys Ala Lys Glu Ala Gly Ala Lys Val Val Leu Val Thr Pro
1505 1510 1515 1520
Val Thr Val Arg Gly Gly Glu Asp Glu Tyr Ala Gly Ala Met Arg Thr
1525 1530 1535
Val Ala Lys Glu Asp Asp Leu Pro Leu Ile Asp Ala Gln Ala Trp Ile
1540 1545 1550
Gly Lys Val Val Ala Ala Asp Ala Ser Val Lys Thr Ala Trp Phe Asn
1555 1560 1565
Lys Ala Gly Gln Leu Asn Tyr Ala Gly His Leu Gly Tyr Ala Arg Phe
1570 1575 1580
Met Met Arg Ser Leu Asp Leu Tyr Pro Ser Asn Val Ser Gly Ser Arg
1585 1590 1595 1600
Ile Ala Ser Leu Pro Tyr Asp Thr Ala Asn Val Thr Leu Val Gly Ala
1605 1610 1615
Ser Glu Asn Gly Gly Glu Leu Pro Val Gly Arg Val Glu Gly Thr Asp
1620 1625 1630
Arg Ala His Ile Asp Thr Met Gln Ile Gly Ala Ala Ala Ser Leu Val
1635 1640 1645
Val Val Asp Ser Tyr Ala Val Tyr Glu Ile Gly Glu Asp Gly Gly Arg
1650 1655 1660
Thr Leu Val Ala Asp Gly Leu Lys Pro Ala Asp Val Leu Ala Asp Gly
1665 1670 1675 1680
Ile Asp Val Thr Val Asn Asp Thr Ala Ala His Arg Tyr Glu Val Val
1685 1690 1695
Gly Ser Ala Asn Val Pro Glu Gly Ala Asp Ala Val Thr Val Thr Tyr
1700 1705 1710
Thr Ala Thr Leu Ala Ala Val Glu Glu Pro Glu Pro Gly Pro Asp Pro
1715 1720 1725
Asp Pro Thr Pro Asp Pro Ser Glu Lys Pro Asp Gly Asp Gly Thr Gly
1730 1735 1740
Asp Gly Thr Gly Ala Gly Thr Gly Asp Val Gln Lys Pro Thr Pro Asp
1745 1750 1755 1760
Ala Val Ala Lys Thr Gly Ala Asp Val Phe Gly Leu Leu Thr Ala Val
1765 1770 1775
Ala Ala Leu Leu Ala Ala Gly Gly Val Thr Leu Ser Leu Arg Arg Arg
1780 1785 1790
Ala Asn Arg
1795
<210> 5
<211> 1795
<212> PRT
<213> Bifidobacterium bifidum
<400> 5
Met Thr Thr Ile Phe Arg Arg Ala Thr Ala Lys Thr Leu Met Arg Lys
1 5 10 15
Leu Ser Gly Leu Leu Val Ala Ile Ala Met Leu Ala Val Leu Pro Ala
20 25 30
Gly Thr Ile Ser Ala Asn Ala Ala Asp Glu Pro Pro Gln Glu Tyr Leu
35 40 45
Gln Leu Thr Leu Thr Arg Thr Asp Ser Asn Gly Thr Pro Ala Glu Val
50 55 60
Gly Asp Lys Leu Thr Tyr Ser Leu Gly Tyr Lys Asn Val Ser Asp Thr
65 70 75 80
Gly Phe Ile Val His Pro Thr Ala Ser Asn Leu Asn Asn Val Ala Thr
85 90 95
Pro Gln Ser Ala Ser Asn Pro Asn Pro Met Cys Arg Trp Gly Asn Leu
100 105 110
Ala Ala Gly Ala Ser Ala Ala Cys Thr Trp Ser Ala Ser Lys Glu Phe
115 120 125
Ala Tyr His Val Val Thr Glu Asp Asp Val Ala Asn Gly Phe Thr Pro
130 135 140
Thr Ala Thr Val Ser Ala Thr Thr Gln Asp Gly Thr Asn Gly Val Leu
145 150 155 160
Gln Ser Val Asp Ile Thr Gly Glu Thr Val Pro Ala Val Pro Ala Thr
165 170 175
Ser Thr Leu Arg Val Ala Met Gln Arg Thr Asp Thr Leu Gly Asp Asn
180 185 190
Val Lys Ile Gly Asp Arg Leu Thr Phe Asn Phe Thr Tyr Thr Asn Lys
195 200 205
Thr Ala Gln Lys Ile Tyr Ala Tyr Pro Ser Glu Ser Asn Ile Glu Arg
210 215 220
Val Asp Val Val Ser Phe Pro Arg Asn Ser Cys Arg Ser Gly Val Glu
225 230 235 240
Ala Asn Gln Thr Ala Ser Cys Gly Phe Ala Tyr His Val Ile Thr Ala
245 250 255
Glu Asp Val Val Ala Arg Arg Tyr Thr Pro Thr Ala Thr Phe Arg Ala
260 265 270
Thr Ser Asp Arg Asp Gly Thr Gln Val Leu Gln Asp Asp Met Thr Phe
275 280 285
Thr Thr Gly Thr Val Thr Val Ala Gly Pro Ala Asp Asp Ala Ala Ser
290 295 300
Thr Pro Thr Glu Arg Lys Asp Gly Glu Pro Leu Leu Leu Ala Thr Asn
305 310 315 320
Lys Gln Ile Gly Asn Thr Asp Tyr Tyr Arg Ile Pro Ala Ile Ala Gln
325 330 335
Ala Pro Asn Gly Trp Ile Leu Ala Ala Trp Asp Leu Arg Pro Lys Leu
340 345 350
Ala Ala Asp Ala Pro Asn Pro Asn Ser Ile Val Gln Arg Ile Ser Lys
355 360 365
Asp Gly Gly Lys Ser Trp Glu Thr Leu Ala Tyr Val Ala Gln Gly Arg
370 375 380
Ser Ala Thr Asn Lys Tyr Gly Tyr Ser Asp Pro Ser Tyr Val Val Asp
385 390 395 400
Glu Glu Ala Gly Lys Ile Phe Leu Phe Cys Val Lys Ser Tyr Asp Gln
405 410 415
Gly Tyr Phe Gly Ser Val Leu Gly Val Glu Asp Ala Arg Asn Val Leu
420 425 430
Gln Ala Val Val Met Glu Ser Asp Asp Asn Gly Ala Thr Trp Ser Glu
435 440 445
Pro Arg Asn Ile Thr Lys Asp Ile Thr Lys Gly His Glu Asp Glu Trp
450 455 460
Lys Ser Arg Phe Ala Ser Ser Gly His Gly Ile Gln Leu Lys Tyr Gly
465 470 475 480
Gln Tyr Lys Gly Arg Leu Ile Gln Gln Tyr Ala Val Arg Thr Thr Ser
485 490 495
Asn Thr Asn Ile Ala Val Ser Val Tyr Ser Asp Asp His Gly Lys Thr
500 505 510
Trp Lys Ala Gly Asn Pro Val Thr Glu Ala Asn Met Asp Glu Asn Lys
515 520 525
Val Val Glu Leu Ser Asp Gly Arg Val Met Leu Asn Ser Arg Pro Gly
530 535 540
Ala Ala Gly Tyr Arg Arg Val Ala Ile Ser Glu Asp Gly Gly Val Asn
545 550 555 560
Tyr Gly Pro Ile Lys Ser Glu Thr Gln Leu Pro Asp Pro Asn Asn Asn
565 570 575
Ala Gln Ile Thr Arg Ala Phe Pro Asn Ala Pro Glu Gly Ser Ala Lys
580 585 590
Ala Lys Val Leu Leu Tyr Ser Ala Pro Arg Ala Ser Asn Glu Gly Arg
595 600 605
Ala Asn Gly Val Val Arg Val Ser Phe Asp Asp Gly Thr Thr Trp Ser
610 615 620
Ala Gly Lys Leu Phe Lys Glu Gly Ser Met Ala Tyr Ser Val Ile Thr
625 630 635 640
Ala Leu Asn Asp Ala Ala Gly Gly Gly Tyr Gly Leu Leu Tyr Glu Gly
645 650 655
Glu Ser Ile Thr Tyr Thr Arg Val Ser Met Glu Trp Leu Gly Tyr Leu
660 665 670
Thr Ala Thr Ala Ser Gly Thr Ala Thr Val Lys Glu Gly Glu Gly Thr
675 680 685
Leu Thr Ala Pro Val Thr Val Thr Asn Asp Gly Leu Thr Asp Tyr Thr
690 695 700
Asn Val Thr Val Thr Pro Thr Gly Leu Pro Ser Gly Trp Ser Ala Glu
705 710 715 720
Ala Val Asn Val Gly Asn Leu Ala Ala Gly Gln Ser Ala Thr Val Asn
725 730 735
Val Pro Val Thr Val Pro Ala Ala Ala Val Ser Gly Thr Val Ala Lys
740 745 750
Ala Thr Met Lys Ile Thr Gly Lys Tyr Ala Gln Ser Glu Asp Thr Leu
755 760 765
His Ser Phe Ala Glu Gly Glu Leu Ala Val Thr Val Thr Glu Pro Asp
770 775 780
Pro Ala Ala Lys Arg Leu Lys Leu Thr Ile Glu Arg Thr Asp Asp Asn
785 790 795 800
Gly Ala Pro Val Lys Val Gly Asp Thr Leu Thr Tyr Arg Ile Thr Tyr
805 810 815
Glu Asn Val Gly Thr Gln Ser Phe Ala Val Tyr Pro Arg Lys Ser Asn
820 825 830
Leu Asp Gly Val Thr Thr Pro Gln Ser Ala Ser Asn Pro Ala Pro Val
835 840 845
Cys Arg Trp Ser Arg Leu Asp Pro Gly Thr Thr Gly Ala Cys Val Ser
850 855 860
Gly Asn Gly Lys Arg Leu Ala Tyr His Thr Val Thr Glu Ala Asp Ala
865 870 875 880
Thr Ala Gly Ser Phe Thr Pro Ser Ala Thr Ile Asp Ala Thr Ala Asp
885 890 895
Thr Ser Gly Glu Thr Val Leu Glu Ser Val Ser Ile Thr Gly Asp Pro
900 905 910
Val Thr Val Ser Gln Pro Val Glu Leu Pro Ala Asp Ile Ala Ala Trp
915 920 925
Lys Thr Arg Asn Glu Ala Leu Asp Asp Trp Gln Thr Leu Ser Glu Lys
930 935 940
Leu Ala Lys Thr Asp Arg Ile Asn Trp Leu Phe Thr Gly Asp Ser Ile
945 950 955 960
Thr His Gly Val Gln Leu Thr Arg Gly Tyr Arg Thr Tyr Ser Glu Leu
965 970 975
Phe Ala Asn His Leu Asp Thr Ala Ser Val Arg Gly Val Ser Arg Ala
980 985 990
Asn Asp Val Val Met Asn Thr Gly Ile Ser Ser Ala Asp Ala Ser Trp
995 1000 1005
Pro Leu Lys Asp Gly Ala Phe Glu Lys Trp Val Ser Asp Lys His Pro
1010 1015 1020
Asp Val Val Phe Leu Thr Phe Gly Met Asn Asp Gly Arg Thr Gly Gln
1025 1030 1035 1040
Ala Phe Thr Val Asp Gln Tyr Thr Ala Asn Leu Ser Thr Leu Ile Asp
1045 1050 1055
Lys Ile Arg Asp Leu Gly Ala Ile Pro Val Leu Gln Thr Gln Asn Tyr
1060 1065 1070
Thr Thr Asn Thr Thr Phe Asn Ala Asn Leu Asp Thr Tyr Phe Asp Ala
1075 1080 1085
Glu Arg Arg Leu Ala Leu Asp Lys Asn Val Leu Leu Val Asp Phe Asn
1090 1095 1100
Lys Gln Trp Leu Glu Leu Gly Gly Gly Asn Arg Glu Ser Gly Thr Tyr
1105 1110 1115 1120
Met Gly Ala Gly Asn Asp Ile His Pro Gly Glu Asn Gly His Ile Glu
1125 1130 1135
Trp Ala Lys Phe Thr Leu Gly Ala Leu Asn Met Ile Ala Asn Asp Asp
1140 1145 1150
Pro Leu Ala Arg Trp Ser Ser Ser Asp Thr Thr Leu Asp Lys Pro Thr
1155 1160 1165
Val Thr Val Asp Ala Asp Gly Asn Gly Leu Lys Gly Ser Asp Gly Leu
1170 1175 1180
Glu Pro Ala Pro Ala Ala Ala Lys Ser Val Gly Lys Phe Leu Ser Gly
1185 1190 1195 1200
Ala Gln Tyr Val Asp Leu Gly Gly Asp Val Val Ser Ala Val Ala Gly
1205 1210 1215
Lys Arg Glu Ser Asn Val Thr Ile Arg Phe Arg Ala Ser Ala Thr Gly
1220 1225 1230
Gln Pro Gln Thr Leu Phe Ser Leu Gly Asp Ser Asp Ser Ala Thr Arg
1235 1240 1245
Ala Thr Val Arg Leu Ser Ala Thr Gly Leu Val Gln Phe Leu Asn Ser
1250 1255 1260
Gly Asn Thr Gly Asp Phe Tyr Thr Val Gly Thr Asn Asp Leu Ala Asp
1265 1270 1275 1280
Gly Ala Trp His Thr Val Ser Val Asn Phe Val Ala Asn Gly Phe Thr
1285 1290 1295
Ile Tyr Val Asp Gly Ala Ala Met Arg Ala Ile Ser Gly Gly Ala Gly
1300 1305 1310
Thr Gln Leu Asn Val Pro Gly Ala Ile Thr Val Asn Thr Ala Thr Ala
1315 1320 1325
Gly Ala Ile Arg Gly Ala Asp Ser Ala Gly Gly Ala Gln Gln Leu Thr
1330 1335 1340
Gly Ile Val Asp Tyr Val Ala Ala Trp Ser Arg Thr Leu Thr Asp Ala
1345 1350 1355 1360
Glu Ala Lys Arg Ile Ser Ala Glu Thr Ser Ala Val Ala Val Thr Lys
1365 1370 1375
Val Asp Ala Ala Val Asn Ala Leu Gln Pro Ile Ile Ser Asp Thr Gly
1380 1385 1390
Ala Arg Lys Asn Ile Val Phe Val Gly Gly Glu Thr Ile Glu Gly Gly
1395 1400 1405
Tyr Thr Asp His Leu Ile Ala Lys Asn Ile Val Gln Leu Leu Asp Glu
1410 1415 1420
Arg Val Arg Trp Glu Tyr Val Thr Gly Leu Ser Ala Thr Asp Arg Glu
1425 1430 1435 1440
Leu Gln Arg Ala Lys Phe Phe Val Ala Ala Gly Gln Gly Gly Leu Thr
1445 1450 1455
Ala Lys Gln Met Asp Glu Asp Tyr Ala Ala Met Val Gly Glu Tyr Ser
1460 1465 1470
Pro Asp Ile Leu Phe Leu Ala Pro Asp Leu Tyr Asp Ala Asp Gly Ile
1475 1480 1485
Leu Ala Glu Ser Asp Ala Ala Ala Phe Ala Gly His Ile Arg Ser Val
1490 1495 1500
Ala Ala Lys Ala Lys Glu Ala Gly Ala Lys Val Val Leu Val Thr Pro
1505 1510 1515 1520
Val Thr Val Arg Gly Gly Glu Asp Glu Tyr Ala Gly Ala Met Arg Thr
1525 1530 1535
Val Ala Lys Glu Asp Asp Leu Pro Leu Ile Asp Ala Gln Ala Trp Ile
1540 1545 1550
Gly Lys Val Val Ala Ala Asp Ala Ser Val Lys Thr Ala Trp Phe Asn
1555 1560 1565
Lys Ala Gly Gln Leu Asn Tyr Ala Gly His Leu Gly Tyr Ala Arg Phe
1570 1575 1580
Met Met Arg Ser Leu Asp Leu Tyr Pro Ser Asn Val Ser Gly Ser Arg
1585 1590 1595 1600
Ile Ala Ser Leu Pro Tyr Asp Thr Ala Asn Val Thr Leu Val Gly Ala
1605 1610 1615
Ser Glu Asn Gly Gly Glu Leu Pro Val Gly Arg Val Glu Gly Thr Asp
1620 1625 1630
Arg Ala His Ile Asp Thr Met Gln Ile Gly Ala Ala Ala Ser Leu Val
1635 1640 1645
Val Thr Asp Ser Tyr Ala Val Tyr Glu Ile Gly Glu Asp Gly Gly Arg
1650 1655 1660
Thr Leu Val Ala Asp Gly Leu Lys Pro Ala Asp Val Leu Ala Asp Gly
1665 1670 1675 1680
Ile Asp Val Thr Val Asn Asp Thr Ala Ala His Arg Tyr Glu Val Val
1685 1690 1695
Gly Ser Ala Asn Val Pro Glu Gly Ala Asp Ala Val Thr Val Thr Tyr
1700 1705 1710
Thr Ala Thr Leu Ala Ala Val Glu Glu Pro Glu Pro Gly Pro Asp Pro
1715 1720 1725
Asp Pro Thr Pro Asp Pro Ser Glu Lys Pro Asp Gly Asp Gly Thr Gly
1730 1735 1740
Asp Gly Thr Gly Ala Gly Ala Gly Asp Val Gln Lys Pro Thr Pro Asp
1745 1750 1755 1760
Ala Val Ala Lys Thr Gly Ala Asp Val Phe Gly Leu Leu Ala Ala Val
1765 1770 1775
Ala Val Leu Leu Ala Ala Gly Gly Val Thr Leu Ser Leu Arg Arg Arg
1780 1785 1790
Ala Asn Arg
1795
<210> 6
<211> 770
<212> PRT
<213> Bifidobacterium bifidum
<400> 6
Met Val Arg Ser Thr Lys Pro Ser Leu Leu Arg Arg Phe Gly Ala Leu
1 5 10 15
Val Ala Ala Ala Ala Met Leu Val Val Leu Pro Ala Gly Val Ser Thr
20 25 30
Ala Ser Ala Ala Ser Asp Asp Ala Asp Met Leu Thr Val Thr Met Thr
35 40 45
Arg Thr Asp Ala Leu Gly Asp Glu Val Tyr Val Gly Asp Thr Leu Thr
50 55 60
Tyr Ser Phe Thr Asn Thr Asn Asn Thr Ser Ser Ala Phe Thr Ala Phe
65 70 75 80
Pro Ala Glu Ser Asn Leu Ser Gly Val Leu Thr Thr Gly Thr Pro Asn
85 90 95
Cys Arg Tyr Glu Asn Leu Ala Gly Gly Ala Ser Tyr Pro Cys Ser Thr
100 105 110
Ala Ser His Thr Ile Thr Ala Asp Asp Leu Thr Ala Gly Ser Phe Thr
115 120 125
Pro Arg Thr Val Trp Lys Ala Thr Ser Asp Arg Gly Gly Thr Gln Val
130 135 140
Leu Gln Asp Asn Ile Val Ser Thr Gly Asp Thr Val Thr Val Lys Glu
145 150 155 160
Gly Lys Arg Pro Asp Pro Ala Thr Ile Pro Thr Asp Arg Ala Asp Gly
165 170 175
Glu Ala Val Arg Leu Ala Thr Ala Arg Gln Asn Leu Gly Thr Glu Cys
180 185 190
Tyr Arg Ile Pro Ala Leu Ala Glu Ala Pro Asn Gly Trp Ile Leu Ala
195 200 205
Ala Phe Asp Gln Arg Pro Asn Thr Ala Met Ala Asn Gly Ser Gly Val
210 215 220
Lys Cys Trp Asp Ala Pro Gln Pro Asn Ser Ile Val Gln Arg Ile Ser
225 230 235 240
Lys Asp Gly Gly Lys Ser Trp Thr Pro Ile Gln Tyr Val Ala Gln Gly
245 250 255
Lys Asn Ala Pro Glu Arg Tyr Gly Tyr Ser Asp Pro Ser Tyr Val Val
260 265 270
Asp Lys Glu Thr Gly Glu Ile Phe Leu Phe Phe Val His Ser Tyr Asn
275 280 285
Lys Gly Phe Ala Asp Ser Gln Leu Gly Val Asp Glu Ser Asn Arg Asn
290 295 300
Val Leu His Ala Val Val Val Ser Ser Lys Asp Asn Gly Glu Thr Trp
305 310 315 320
Ser Lys Pro Arg Asp Ile Thr Ala Asp Ile Thr Lys Gly Tyr Glu Asn
325 330 335
Glu Trp Lys Ser Arg Phe Ala Thr Ser Gly Ala Gly Ile Gln Leu Lys
340 345 350
Tyr Gly Lys Tyr Lys Gly Arg Leu Ile Gln Gln Tyr Ala Val Gly Arg
355 360 365
Thr Thr Gly Ser Asn Ala Ala Val Ser Val Tyr Ser Asp Asp His Gly
370 375 380
Lys Thr Trp Gln Ala Gly Asn Pro Val Thr Gly Met Leu Met Asp Glu
385 390 395 400
Asn Lys Val Val Glu Leu Ser Asp Gly Arg Val Met Leu Asn Ser Arg
405 410 415
Pro Gly Asn Gly Ser Gly Tyr Arg Arg Val Ala Ile Ser Glu Asp Gly
420 425 430
Gly Val Asn Tyr Gly Thr Val Lys Asn Glu Thr Gln Leu Pro Asp Pro
435 440 445
Asn Asn Asn Ala His Ile Thr Arg Ala Phe Pro Asn Ala Pro Glu Gly
450 455 460
Ser Ala Lys Ala Lys Val Leu Leu Tyr Ser Ser Pro Arg Ala Asn Asn
465 470 475 480
Glu Gly Arg Ala Asn Gly Val Val Arg Ile Ser Leu Asp Asp Gly Thr
485 490 495
Thr Trp Ser Ser Gly Lys Leu Tyr Lys Glu Gly Ser Met Ala Tyr Ser
500 505 510
Val Ile Thr Ala Leu Ser Gly Ala Ala Gly Gly Gly Tyr Gly Leu Leu
515 520 525
Tyr Glu Gly Ala Trp Val Thr Gly Gly Gly Ile Asp Ser His Asp Ile
530 535 540
Met Tyr Thr His Ile Ser Met Asp Trp Leu Gly Tyr Leu Ser Ala Thr
545 550 555 560
Ala Asp Asp Val Thr Ala Ser Val Glu Glu Gly Ala Ser Thr Val Asp
565 570 575
Val Thr Val Pro Val Ser Asn Val Gly Ser Val Asp Tyr Thr Gly Val
580 585 590
Thr Val Thr Pro Ala Asp Leu Pro Thr Gly Trp Ser Ala Ser Pro Val
595 600 605
Asn Val Gly Ala Leu Ala Ser Gly Ala Ser Lys Asp Val Thr Val Thr
610 615 620
Val Asn Val Pro Ala Thr Ala Lys Lys Asp Asp Val Ala Lys Ile Val
625 630 635 640
Leu Arg Val Thr Gly Thr Ser Ala Ala Asn Ala Asn Ala Thr Thr Gly
645 650 655
Phe Asp Gly Ser Ile Thr Val Asn Val Thr Glu Lys Ser Glu Pro Asp
660 665 670
Pro Glu Pro Glu Pro Thr Ile Thr Gly Val Ser Ala Val Thr Ser Gln
675 680 685
Ala Gly Val Lys Val Gly Asp Val Phe Asp Ala Ser Lys Val Ser Val
690 695 700
Thr Ala Ala Met Ser Asp Gly Ser Ser Lys Ala Leu Ala Ala Gly Glu
705 710 715 720
Tyr Ser Leu Ser Ala Val Asp Ala Asp Gly Lys Ala Val Asp Leu Ala
725 730 735
Glu Pro Phe Ala Ala Ala Gly Val Val Thr Val Thr Val Ser Val Pro
740 745 750
Val Glu Gly Ala Asp Pro Leu Thr Ala Ser Phe Thr Ile Asp Val Ala
755 760 765
Glu Lys
770
<210> 7
<211> 834
<212> PRT
<213> Bifidobacterium bifidum
<400> 7
Met Val Arg Ser Thr Lys Pro Ser Leu Leu Arg Arg Leu Gly Ala Leu
1 5 10 15
Val Ala Ala Ala Ala Met Leu Val Val Leu Pro Ala Gly Val Ser Thr
20 25 30
Ala Ser Ala Ala Ser Asp Asp Ala Asp Met Leu Thr Val Thr Met Thr
35 40 45
Arg Thr Asp Ala Leu Gly Asp Glu Val Tyr Val Gly Asp Thr Leu Thr
50 55 60
Tyr Ser Phe Thr Asn Thr Asn Asn Thr Ser Ser Ala Phe Thr Ala Phe
65 70 75 80
Pro Ala Glu Ser Asn Leu Ser Gly Val Leu Thr Thr Gly Thr Pro Asn
85 90 95
Cys Arg Tyr Glu Asn Leu Ala Gly Gly Ala Ser Tyr Pro Cys Ser Thr
100 105 110
Ala Ser His Thr Ile Thr Ala Asp Asp Leu Thr Ala Gly Ser Phe Thr
115 120 125
Pro Arg Thr Val Trp Lys Ala Thr Ser Asp Arg Gly Gly Thr Gln Val
130 135 140
Leu Gln Asp Asn Ile Val Ser Thr Gly Asp Thr Val Thr Val Lys Glu
145 150 155 160
Gly Lys Arg Pro Asp Pro Ala Thr Ile Pro Thr Asp Arg Ala Asp Gly
165 170 175
Glu Ala Val Arg Leu Ala Thr Ala Arg Gln Asn Leu Gly Thr Glu Cys
180 185 190
Tyr Arg Ile Pro Ala Leu Ala Glu Ala Pro Asn Gly Trp Ile Leu Ala
195 200 205
Ala Phe Asp Gln Arg Pro Asn Thr Ala Met Ala Asn Gly Ser Gly Val
210 215 220
Lys Cys Trp Asp Ala Pro Gln Pro Asn Ser Ile Val Gln Arg Ile Ser
225 230 235 240
Lys Asp Gly Gly Lys Ser Trp Thr Pro Ile Gln Tyr Val Ala Gln Gly
245 250 255
Lys Asn Ala Pro Glu Arg Tyr Gly Tyr Ser Asp Pro Ser Tyr Val Val
260 265 270
Asp Glu Glu Thr Gly Glu Ile Phe Leu Phe Phe Val His Ser Tyr Asn
275 280 285
Lys Gly Phe Ala Asp Ser Gln Leu Gly Val Asp Glu Ser Asn Arg Asn
290 295 300
Val Leu His Ala Val Val Val Ser Ser Lys Asp Asn Gly Glu Thr Trp
305 310 315 320
Ser Lys Pro Arg Asp Ile Thr Ala Asp Ile Thr Lys Gly Tyr Glu Asn
325 330 335
Glu Trp Lys Ser Arg Phe Ala Thr Ser Gly Ala Gly Ile Gln Leu Lys
340 345 350
Tyr Gly Lys Tyr Lys Gly Arg Leu Ile Gln Gln Tyr Ala Val Gly Arg
355 360 365
Thr Thr Gly Ser Asn Ala Ala Val Ser Val Tyr Ser Asp Asp His Gly
370 375 380
Lys Thr Trp Gln Ala Gly Asn Pro Val Thr Gly Met Leu Met Asp Glu
385 390 395 400
Asn Lys Val Val Glu Leu Ser Asp Gly Arg Val Met Leu Asn Ser Arg
405 410 415
Pro Gly Asn Gly Ser Gly Tyr Arg Arg Val Ala Ile Ser Lys Asp Gly
420 425 430
Gly Val Asn Tyr Gly Thr Val Lys Asn Glu Thr Gln Leu Pro Asp Pro
435 440 445
Asn Asn Asn Ala His Ile Thr Arg Ala Phe Pro Asn Ala Pro Glu Gly
450 455 460
Ser Ala Lys Ala Lys Val Leu Leu Tyr Ser Ser Pro Arg Ala Asn Asn
465 470 475 480
Glu Gly Arg Ala Asn Gly Val Val Arg Ile Ser Leu Asp Asp Gly Thr
485 490 495
Thr Trp Ser Ser Gly Lys Leu Tyr Lys Ala Gly Ser Met Ala Tyr Ser
500 505 510
Val Ile Thr Ala Leu Ser Gly Ala Ala Gly Gly Gly Tyr Gly Leu Leu
515 520 525
Tyr Glu Gly Ala Trp Val Thr Gly Gly Gly Ile Asp Ser His Asp Ile
530 535 540
Met Tyr Thr His Ile Ser Met Asp Trp Leu Gly Tyr Leu Ser Ala Thr
545 550 555 560
Ala Asp Asp Val Thr Ala Ser Val Glu Glu Gly Ala Ser Thr Val Asp
565 570 575
Val Thr Val Pro Val Ser Asn Ile Gly Ser Val Asp Tyr Thr Gly Val
580 585 590
Thr Val Thr Pro Ala Asp Leu Pro Thr Gly Trp Ser Ala Ser Pro Val
595 600 605
Asn Val Gly Ala Leu Ala Ser Gly Ala Ser Lys Asp Val Thr Val Thr
610 615 620
Val Asn Val Pro Ala Thr Ala Lys Lys Asp Asp Val Ala Lys Ile Val
625 630 635 640
Leu Arg Val Thr Gly Thr Ser Ala Ala Asn Ala Asp Ala Thr Thr Gly
645 650 655
Phe Asp Gly Ser Ile Thr Val Asn Val Thr Glu Lys Ser Glu Pro Asp
660 665 670
Pro Glu Pro Glu Pro Thr Ile Thr Gly Val Ser Ala Val Thr Ser Gln
675 680 685
Ala Gly Val Lys Val Gly Asp Val Phe Asp Ala Ser Lys Val Ser Val
690 695 700
Thr Ala Ala Met Ser Asp Gly Ser Ser Lys Ala Leu Ala Ala Gly Glu
705 710 715 720
Tyr Ser Leu Ser Ala Val Asp Ala Asp Gly Lys Ala Val Asp Leu Ala
725 730 735
Glu Pro Phe Ala Ala Ala Gly Val Val Thr Val Thr Val Ser Val Pro
740 745 750
Val Glu Gly Ala Asn Pro Leu Thr Ala Ser Phe Thr Ile Asp Val Ala
755 760 765
Glu Lys Ser Val Asp Pro Glu Pro Lys Pro Glu Pro Lys Pro Glu Pro
770 775 780
Glu Lys Pro Ala Gly Pro Lys Val Asp Val Pro Thr Glu Gln Pro Gly
785 790 795 800
Leu Ser Lys Thr Gly Ala Ser Thr Ala Gly Met Ser Ile Val Phe Val
805 810 815
Leu Leu Ala Leu Ser Gly Val Ala Ala Leu Ser Leu Arg Arg Arg Ser
820 825 830
Ala His
<210> 8
<211> 1782
<212> PRT
<213> Bifidobacterium bifidum
<400> 8
Met Arg Lys Leu Ser Gly Leu Leu Val Ala Ile Ala Met Leu Ala Val
1 5 10 15
Leu Pro Ala Gly Thr Ile Ser Ala Asn Ala Ala Asp Glu Pro Pro Gln
20 25 30
Glu Tyr Leu Gln Leu Thr Leu Thr Arg Thr Asp Ser Asn Gly Thr Pro
35 40 45
Ala Glu Val Gly Asp Lys Leu Thr Tyr Ser Leu Gly Tyr Lys Asn Val
50 55 60
Ser Asp Thr Gly Phe Ile Val His Pro Thr Ala Ser Asn Leu Asn Asn
65 70 75 80
Val Ala Thr Pro Gln Ser Ala Ser Asn Pro Asn Pro Met Cys Arg Trp
85 90 95
Gly Asn Leu Ala Ala Gly Ala Ser Ala Ala Cys Thr Trp Ser Ala Ser
100 105 110
Lys Glu Phe Ala Tyr His Val Val Thr Glu Asp Asp Val Ala Asn Gly
115 120 125
Phe Thr Pro Thr Ala Thr Val Ser Ala Thr Thr Gln Asp Gly Thr Asn
130 135 140
Gly Val Leu Gln Ser Val Asp Ile Thr Gly Glu Thr Val Pro Ala Val
145 150 155 160
Pro Ala Thr Ser Thr Leu Arg Val Ala Met Gln Arg Thr Asp Thr Leu
165 170 175
Gly Asp Asn Val Lys Ile Gly Asp Arg Leu Thr Phe Asn Phe Thr Tyr
180 185 190
Thr Asn Lys Thr Ala Gln Lys Ile Tyr Ala Tyr Pro Ser Glu Ser Asn
195 200 205
Ile Glu Arg Val Asp Val Val Ser Tyr Pro Arg Asn Ser Cys Arg Ser
210 215 220
Gly Val Glu Ala Asn Gln Thr Ala Ser Cys Gly Phe Ala Tyr His Val
225 230 235 240
Ile Thr Ala Glu Asp Val Val Ala Arg Arg Tyr Thr Pro Thr Ala Thr
245 250 255
Phe Arg Ala Thr Ser Asp Arg Asp Gly Thr Gln Val Leu Gln Asp Asp
260 265 270
Met Thr Phe Thr Thr Gly Thr Val Thr Val Ala Gly Pro Ala Asp Asp
275 280 285
Ala Ala Ser Thr Pro Thr Glu Arg Lys Asp Gly Glu Pro Leu Leu Leu
290 295 300
Ala Thr Asn Lys Gln Ile Gly Asn Thr Asp Tyr Tyr Arg Ile Pro Ala
305 310 315 320
Ile Ala Gln Ala Pro Asn Gly Trp Ile Leu Ala Ala Trp Asp Leu Arg
325 330 335
Pro Ser Ser Ala Ala Asp Ala Pro Asn Pro Asn Ser Ile Val Gln Arg
340 345 350
Ile Ser Lys Asp Gly Gly Lys Ser Trp Glu Thr Leu Ala Tyr Val Ala
355 360 365
Gln Gly Arg Ser Ala Thr Asn Lys Tyr Gly Tyr Ser Asp Pro Ser Tyr
370 375 380
Val Val Asp Glu Glu Ala Gly Lys Ile Phe Leu Phe Cys Val Lys Ser
385 390 395 400
Tyr Asp Gln Gly Tyr Phe Gly Ser Val Leu Gly Val Glu Asp Ala Arg
405 410 415
Asn Val Leu Gln Ala Val Val Met Glu Ser Asp Asp Asn Gly Ala Thr
420 425 430
Trp Ser Glu Pro Arg Asn Ile Thr Lys Asp Ile Thr Lys Gly His Glu
435 440 445
Asp Glu Trp Lys Ser Arg Phe Ala Ser Ser Gly His Gly Ile Gln Leu
450 455 460
Lys Tyr Gly Gln Tyr Lys Gly Arg Leu Ile Gln Gln Tyr Ala Val Arg
465 470 475 480
Thr Thr Ser Asn Thr Asn Ile Ala Val Ser Val Tyr Ser Asp Asp His
485 490 495
Gly Lys Thr Trp Lys Ala Gly Asn Pro Val Thr Glu Val Asn Met Asp
500 505 510
Glu Asn Lys Val Val Glu Leu Ser Asp Gly Arg Val Met Leu Asn Ser
515 520 525
Arg Pro Gly Ala Ala Gly Tyr Arg Arg Val Ala Ile Ser Glu Asp Gly
530 535 540
Gly Val Asn Tyr Gly Pro Ile Lys Ser Glu Thr Gln Leu Pro Asp Pro
545 550 555 560
Asn Asn Asn Ala Gln Ile Thr Arg Ala Phe Pro Asn Ala Pro Glu Gly
565 570 575
Ser Ala Lys Ala Lys Val Leu Leu Tyr Ser Ala Pro Arg Ala Ser Asn
580 585 590
Glu Gly Arg Ala Asn Gly Val Val Arg Val Ser Phe Asp Asp Gly Thr
595 600 605
Thr Trp Ser Ala Gly Lys Leu Phe Lys Glu Gly Ser Met Ala Tyr Ser
610 615 620
Val Ile Thr Ala Leu Asn Asp Ala Ala Gly Gly Gly Tyr Gly Leu Leu
625 630 635 640
Tyr Glu Gly Glu Ser Ile Thr Tyr Thr Arg Ala Ser Met Glu Trp Leu
645 650 655
Gly Tyr Leu Thr Ala Thr Ala Ser Gly Thr Ala Thr Val Lys Glu Gly
660 665 670
Glu Gly Thr Leu Thr Ala Pro Val Thr Val Thr Asn Asp Gly Leu Thr
675 680 685
Asp Tyr Thr Asn Val Asn Val Thr Pro Thr Gly Leu Pro Ser Gly Trp
690 695 700
Ser Ala Glu Ala Val Asn Val Gly Asn Leu Ala Ala Gly Gln Ser Ala
705 710 715 720
Thr Val Asn Val Pro Val Thr Val Pro Ala Ala Ala Val Ser Gly Thr
725 730 735
Val Ala Lys Ala Thr Met Lys Ile Thr Gly Lys Tyr Ala Gln Ser Glu
740 745 750
Asp Thr Leu His Ser Phe Ala Glu Gly Glu Leu Ala Val Thr Val Thr
755 760 765
Asp Pro Asp Pro Ala Ala Lys Arg Leu Lys Leu Thr Ile Glu Arg Thr
770 775 780
Asp Asp Asn Gly Asp Pro Val Lys Val Gly Asp Thr Leu Thr Tyr Arg
785 790 795 800
Ile Thr Tyr Glu Asn Val Gly Thr Gln Ser Phe Ala Val Tyr Pro Arg
805 810 815
Glu Ser Asn Leu Asp Gly Val Thr Thr Pro Gln Ser Ala Ser Asn Pro
820 825 830
Ala Pro Val Cys Arg Trp Ser Arg Leu Ala Pro Gly Thr Thr Gly Ala
835 840 845
Cys Val Ser Gly Asn Gly Lys Gln Leu Ala Tyr His Thr Val Thr Glu
850 855 860
Ala Asp Ala Thr Ala Gly Ser Phe Thr Pro Ser Ala Thr Ile Asp Ala
865 870 875 880
Thr Ala Asp Ala Ser Gly Glu Met Val Leu Glu Ser Val Ser Ile Thr
885 890 895
Gly Asp Pro Val Thr Val Ser Gln Pro Val Glu Leu Pro Ala Asp Ile
900 905 910
Ala Ala Trp Lys Thr Arg Asn Glu Ala Leu Asp Asp Trp Gln Ala Leu
915 920 925
Ser Glu Lys Leu Ala Lys Thr Asp Arg Ile Asn Trp Leu Phe Thr Gly
930 935 940
Asp Ser Ile Thr His Gly Val Gln Phe Thr Arg Gly Tyr Arg Thr Tyr
945 950 955 960
Ser Glu Leu Phe Ala Asn His Leu Asp Thr Ala Ser Val Arg Gly Val
965 970 975
Ser Arg Ala Asn Asp Val Val Met Asn Thr Gly Ile Ser Ser Ala Asp
980 985 990
Ala Ser Trp Pro Leu Lys Asp Gly Ala Phe Glu Lys Trp Val Ser Asp
995 1000 1005
Lys His Pro Asp Val Val Phe Leu Thr Phe Gly Met Asn Asp Gly Arg
1010 1015 1020
Thr Gly Gln Ala Phe Thr Val Asp Gln Tyr Thr Ala Asn Leu Ser Thr
1025 1030 1035 1040
Leu Ile Asp Lys Ile Arg Asp Ile Gly Ala Ile Pro Val Leu Gln Thr
1045 1050 1055
Gln Asn Tyr Thr Thr Asn Thr Thr Phe Asn Ala Asn Leu Asp Thr Tyr
1060 1065 1070
Phe Asp Ala Glu Arg Arg Leu Ala Leu Asp Lys Asn Val Leu Leu Val
1075 1080 1085
Asp Phe Asn Lys Gln Trp Leu Ala Leu Gly Gly Gly Asn Arg Glu Ser
1090 1095 1100
Gly Thr Tyr Met Gly Ala Gly Asn Asp Ile His Pro Gly Glu Asn Gly
1105 1110 1115 1120
His Ile Glu Trp Ala Lys Phe Thr Leu Gly Ala Leu Asn Met Ile Ala
1125 1130 1135
Asn Asp Asp Pro Leu Ala Arg Trp Ser Ser Ser Asp Thr Thr Leu Asp
1140 1145 1150
Lys Pro Thr Val Thr Val Asp Ala Asp Gly Asn Gly Leu Lys Gly Ser
1155 1160 1165
Asp Gly Leu Glu Pro Ala Pro Ala Ala Ala Lys Ser Val Gly Lys Phe
1170 1175 1180
Leu Ser Gly Ala Gln Tyr Val Asp Leu Gly Gly Asp Val Val Ser Ala
1185 1190 1195 1200
Val Ala Gly Lys Arg Glu Ser Asn Val Thr Ile Arg Phe Arg Ala Ser
1205 1210 1215
Ala Thr Gly Gln Pro Gln Thr Leu Phe Ser Leu Gly Asp Ser Asp Ser
1220 1225 1230
Ala Thr Arg Ala Thr Val Arg Leu Ser Ala Thr Gly Leu Val Gln Phe
1235 1240 1245
Leu Asn Ser Gly Asn Thr Gly Asp Phe Tyr Thr Val Gly Thr Asn Asp
1250 1255 1260
Leu Ala Asp Gly Ala Trp His Thr Val Ser Val Asn Phe Val Ala Asn
1265 1270 1275 1280
Gly Phe Thr Ile Tyr Val Asp Gly Ala Ala Met Arg Ala Ile Ser Gly
1285 1290 1295
Gly Thr Gly Thr Gln Leu Asn Val Pro Gly Ala Ile Thr Val Asn Thr
1300 1305 1310
Ala Thr Ala Gly Ala Ile Arg Gly Ala Asp Ser Ala Gly Gly Ala Gln
1315 1320 1325
Gln Leu Thr Gly Ile Val Asp Tyr Val Ala Ala Trp Ser Arg Thr Leu
1330 1335 1340
Thr Gly Ala Glu Ala Lys Arg Ile Ser Ala Glu Thr Ser Ala Val Ala
1345 1350 1355 1360
Val Thr Lys Val Asp Ala Ala Val Asn Ala Leu Gln Pro Ile Ile Ser
1365 1370 1375
Asp Thr Gly Ala Arg Lys Asn Ile Val Phe Val Gly Gly Glu Thr Ile
1380 1385 1390
Glu Gly Gly Tyr Thr Asp His Leu Ile Ala Lys Asn Ile Val Gln Leu
1395 1400 1405
Leu Asp Glu Arg Val Arg Trp Glu Tyr Val Thr Gly Leu Ser Ala Thr
1410 1415 1420
Asp Arg Glu Arg Gln Arg Ala Lys Phe Phe Val Ala Ala Gly Gln Gly
1425 1430 1435 1440
Gly Leu Thr Ala Lys Gln Met Asp Glu Asp Tyr Ala Ala Met Val Gly
1445 1450 1455
Glu Tyr Ser Pro Asp Ile Leu Phe Leu Ala Pro Asp Leu Tyr Asp Ala
1460 1465 1470
Asp Gly Asn Leu Ala Glu Ser Asp Ala Ala Ala Phe Ala Gly His Ile
1475 1480 1485
Arg Ser Val Ala Ala Lys Ala Lys Glu Ala Gly Ala Lys Val Val Leu
1490 1495 1500
Val Thr Pro Val Thr Val Arg Gly Gly Glu Asp Glu Tyr Ala Gly Ala
1505 1510 1515 1520
Met Arg Thr Val Ala Lys Glu Asp Asp Leu Pro Leu Ile Asp Ala Gln
1525 1530 1535
Ala Trp Ile Gly Lys Val Val Ala Ala Asp Ala Ser Val Lys Thr Ala
1540 1545 1550
Trp Phe Asn Lys Ala Gly Gln Leu Asn Tyr Ala Gly His Leu Gly Tyr
1555 1560 1565
Ala Arg Phe Met Met Arg Ser Leu Asp Leu Tyr Pro Ser Asn Val Ser
1570 1575 1580
Gly Ser Arg Ile Ala Ser Leu Pro Tyr Asp Thr Ala Asn Val Thr Leu
1585 1590 1595 1600
Val Gly Ala Ser Glu Asn Gly Gly Glu Leu Pro Val Gly Arg Val Glu
1605 1610 1615
Gly Thr Asp Arg Ala His Ile Asp Thr Met Gln Ile Gly Ala Ala Ala
1620 1625 1630
Ser Leu Val Val Ala Asp Ser Tyr Ala Val Tyr Glu Ile Gly Glu Asp
1635 1640 1645
Gly Gly Arg Thr Leu Val Ala Asp Gly Leu Lys Pro Ala Asp Val Leu
1650 1655 1660
Ala Asp Gly Ile Asp Val Thr Val Asn Asp Thr Ala Ala His Arg Tyr
1665 1670 1675 1680
Glu Val Val Gly Ser Ala Asn Val Pro Glu Gly Ala Asp Ala Val Thr
1685 1690 1695
Val Thr Tyr Thr Ala Thr Leu Ala Ala Val Glu Glu Pro Glu Pro Gly
1700 1705 1710
Pro Asp Pro Asp Pro Thr Pro Asp Pro Ser Glu Lys Pro Asp Gly Asp
1715 1720 1725
Gly Thr Gly Asp Gly Thr Gly Ala Gly Thr Gly Asp Val Gln Lys Pro
1730 1735 1740
Thr Pro Asp Ala Val Ala Lys Thr Gly Ala Asp Val Phe Gly Leu Leu
1745 1750 1755 1760
Ala Ala Val Ala Val Leu Leu Ala Ala Gly Ser Val Thr Leu Ser Leu
1765 1770 1775
Arg Arg Arg Ala Asn Arg
1780
<210> 9
<211> 835
<212> PRT
<213> Bifidobacterium bifidum
<400> 9
Met Val Arg Ser Thr Lys Pro Ser Leu Leu Arg Arg Phe Gly Ala Leu
1 5 10 15
Val Ala Ala Ala Ala Met Leu Val Val Leu Pro Ala Gly Val Ser Thr
20 25 30
Ala Ser Ala Ala Ser Asp Asp Ala Asp Met Leu Thr Val Thr Met Thr
35 40 45
Arg Thr Asp Ala Leu Gly Asp Glu Val Tyr Val Gly Asp Thr Leu Thr
50 55 60
Tyr Ser Phe Thr Asn Thr Asn Asn Thr Ser Ser Ala Phe Thr Ala Phe
65 70 75 80
Pro Ala Glu Ser Asn Leu Ser Gly Val Leu Thr Thr Gly Thr Pro Asn
85 90 95
Cys Arg Tyr Glu Asn Leu Ala Gly Gly Ala Ser Tyr Pro Cys Ser Thr
100 105 110
Ala Ser His Thr Ile Thr Ala Asp Asp Leu Thr Ala Gly Ser Phe Thr
115 120 125
Pro Arg Thr Val Trp Lys Ala Thr Ser Asp Arg Gly Gly Thr Gln Val
130 135 140
Leu Gln Asp Asn Ile Val Ser Thr Gly Asp Thr Val Thr Val Lys Glu
145 150 155 160
Gly Lys Arg Pro Asp Pro Ala Thr Ile Pro Thr Asp Arg Ala Asp Gly
165 170 175
Glu Ala Val Arg Leu Ala Thr Ala Arg Gln Asn Leu Gly Thr Glu Cys
180 185 190
Tyr Arg Ile Pro Ala Leu Ala Glu Ala Pro Asn Gly Trp Ile Leu Ala
195 200 205
Ala Phe Asp Gln Arg Pro Asn Thr Ala Met Ala Asn Gly Ser Gly Val
210 215 220
Lys Cys Trp Asp Ala Pro Gln Pro Asn Ser Ile Val Gln Arg Ile Ser
225 230 235 240
Lys Asp Gly Gly Lys Ser Trp Thr Pro Ile Gln Tyr Val Ala Gln Gly
245 250 255
Lys Asn Ala Pro Glu Arg Tyr Gly Tyr Ser Asp Pro Ser Tyr Val Val
260 265 270
Asp Lys Glu Thr Gly Glu Ile Phe Leu Phe Phe Val His Ser Tyr Asn
275 280 285
Lys Gly Phe Ala Asp Ser Gln Leu Gly Val Asp Glu Ser Asn Arg Asn
290 295 300
Val Leu His Ala Val Val Val Ser Ser Lys Asp Asn Gly Glu Thr Trp
305 310 315 320
Ser Lys Pro Arg Asp Ile Thr Ala Asp Ile Thr Lys Gly Tyr Glu Asn
325 330 335
Glu Trp Lys Ser Arg Phe Ala Thr Ser Gly Ala Gly Ile Gln Leu Lys
340 345 350
Tyr Gly Lys Tyr Lys Gly Arg Leu Ile Gln Gln Tyr Ala Val Gly Arg
355 360 365
Thr Thr Gly Ser Asn Ala Ala Val Ser Val Tyr Ser Asp Asp His Gly
370 375 380
Lys Thr Trp Gln Ala Gly Asn Pro Val Thr Gly Met Leu Met Asp Glu
385 390 395 400
Asn Lys Val Val Glu Leu Ser Asp Gly Arg Val Met Leu Asn Ser Arg
405 410 415
Pro Gly Ala Ala Gly Tyr Arg Arg Val Ala Ile Ser Glu Asp Gly Gly
420 425 430
Val Asn Tyr Gly Thr Val Lys Asn Glu Thr Gln Leu Pro Asp Pro Asn
435 440 445
Asn Asn Ala His Ile Thr Arg Ala Phe Pro Asn Ala Pro Glu Gly Ser
450 455 460
Ala Lys Ala Lys Val Leu Leu Tyr Ser Ser Pro Arg Ala Asn Asn Glu
465 470 475 480
Gly Arg Ala Asn Gly Val Val Arg Ile Ser Leu Asp Asp Gly Thr Thr
485 490 495
Trp Ser Ser Gly Lys Leu Tyr Lys Glu Gly Ser Met Ala Tyr Ser Val
500 505 510
Ile Thr Ala Leu Ser Asp Ala Ala Gly Gly Gly Tyr Gly Leu Leu Tyr
515 520 525
Glu Gly Ala Trp Val Thr Gly Gly Gly Ile Asp Ser His Asp Ile Met
530 535 540
Tyr Thr His Ile Ser Met Asp Trp Leu Gly Tyr Leu Ser Ala Thr Ala
545 550 555 560
Asp Asp Val Thr Ala Ser Val Glu Glu Gly Ala Ser Thr Val Asp Val
565 570 575
Thr Val Pro Val Ser Asn Val Gly Ser Val Asp Tyr Thr Gly Val Thr
580 585 590
Val Thr Pro Ala Asp Leu Pro Thr Gly Trp Ser Ala Ser Pro Val Asn
595 600 605
Val Gly Ala Leu Ala Ser Gly Thr Ser Lys Asp Val Thr Val Thr Val
610 615 620
Asn Val Pro Ala Thr Ala Lys Lys Asp Asp Val Ala Lys Ile Val Leu
625 630 635 640
Arg Val Thr Gly Thr Ser Ala Ala Asn Ala Asn Ala Thr Thr Gly Phe
645 650 655
Asp Gly Ser Ile Thr Val Asn Val Thr Glu Lys Ser Glu Pro Asp Pro
660 665 670
Glu Pro Glu Pro Thr Ile Thr Gly Val Ser Ala Val Thr Ser Gln Ala
675 680 685
Gly Val Lys Val Gly Asp Val Phe Asp Ala Ser Lys Val Ser Val Thr
690 695 700
Ala Ala Met Ser Asp Gly Ser Ser Lys Ala Leu Ala Ala Gly Glu Tyr
705 710 715 720
Ser Leu Ser Ala Val Asp Ala Asp Gly Lys Ala Val Asp Leu Ala Glu
725 730 735
Pro Phe Ala Ala Ala Gly Val Val Thr Val Thr Val Ser Val Pro Val
740 745 750
Glu Gly Ala Gly Pro Leu Thr Ala Ser Phe Thr Ile Asp Val Ala Glu
755 760 765
Lys Ser Val Asp Pro Glu Pro Lys Pro Glu Pro Glu Pro Lys Pro Glu
770 775 780
Pro Glu Lys Pro Ala Gly Pro Lys Val Asp Val Pro Thr Glu Gln Pro
785 790 795 800
Gly Leu Ser Lys Thr Gly Ala Ser Thr Ala Gly Met Ser Ile Val Phe
805 810 815
Val Leu Leu Ala Leu Ser Gly Ile Ala Ala Leu Ser Leu Arg Arg Arg
820 825 830
Ser Val His
835
<210> 10
<211> 1060
<212> PRT
<213> Trypanosoma cruzi
<400> 10
Met Gly Lys Thr Val Val Gly Ala Ser Arg Met Phe Trp Leu Met Phe
1 5 10 15
Phe Val Pro Leu Leu Leu Ala Leu Cys Pro Ser Glu Pro Ala His Ala
20 25 30
Leu Ala Pro Gly Ser Ser Arg Val Glu Leu Phe Lys Arg Gln Ser Ser
35 40 45
Lys Val Pro Phe Glu Lys Gly Gly Lys Val Thr Glu Arg Val Val His
50 55 60
Ser Phe Arg Leu Pro Ala Leu Val Asn Val Asp Gly Val Met Val Ala
65 70 75 80
Ile Ala Asp Ala Arg Tyr Glu Thr Ser Asn Asp Asn Ser Leu Ile Asp
85 90 95
Thr Val Ala Lys Tyr Ser Val Asp Asp Gly Glu Thr Trp Glu Thr Gln
100 105 110
Ile Ala Ile Lys Asn Ser Arg Ala Ser Ser Val Ser Arg Val Val Asp
115 120 125
Pro Thr Val Ile Val Lys Gly Asn Lys Leu Tyr Val Leu Val Gly Ser
130 135 140
Tyr Asn Ser Ser Arg Ser Tyr Trp Thr Ser His Gly Asp Ala Arg Asp
145 150 155 160
Trp Asp Ile Leu Leu Ala Val Gly Glu Val Thr Lys Ser Thr Ala Gly
165 170 175
Gly Lys Ile Thr Ala Ser Ile Lys Trp Gly Ser Pro Val Ser Leu Lys
180 185 190
Glu Phe Phe Pro Ala Glu Met Glu Gly Met His Thr Asn Gln Phe Leu
195 200 205
Gly Gly Ala Gly Val Ala Ile Val Ala Ser Asn Gly Asn Leu Val Tyr
210 215 220
Pro Val Gln Val Thr Asn Lys Lys Lys Gln Val Phe Ser Lys Ile Phe
225 230 235 240
Tyr Ser Glu Asp Glu Gly Lys Thr Trp Lys Phe Gly Glu Gly Arg Ser
245 250 255
Asp Phe Gly Cys Ser Glu Pro Val Ala Leu Glu Trp Glu Gly Lys Leu
260 265 270
Ile Ile Asn Thr Arg Val Asp Tyr Arg Arg Arg Leu Val Tyr Glu Ser
275 280 285
Ser Asp Met Gly Asn Ser Trp Val Glu Ala Val Gly Thr Leu Ser Arg
290 295 300
Val Trp Gly Pro Ser Pro Lys Ser Asn Gln Pro Gly Ser Gln Ser Ser
305 310 315 320
Phe Thr Ala Val Thr Ile Glu Gly Met Arg Val Met Leu Phe Thr His
325 330 335
Pro Leu Asn Phe Lys Gly Arg Trp Leu Arg Asp Arg Leu Asn Leu Trp
340 345 350
Leu Thr Asp Asn Gln Arg Ile Tyr Asn Val Gly Gln Val Ser Ile Gly
355 360 365
Asp Glu Asn Ser Ala Tyr Ser Ser Val Leu Tyr Lys Asp Asp Lys Leu
370 375 380
Tyr Cys Leu His Glu Ile Asn Ser Asn Glu Val Tyr Ser Leu Val Phe
385 390 395 400
Ala Arg Leu Val Gly Glu Leu Arg Ile Ile Lys Ser Val Leu Gln Ser
405 410 415
Trp Lys Asn Trp Asp Ser His Leu Ser Ser Ile Cys Thr Pro Ala Asp
420 425 430
Pro Ala Ala Ser Ser Ser Glu Arg Gly Cys Gly Pro Ala Val Thr Thr
435 440 445
Val Gly Leu Val Gly Phe Leu Ser His Ser Ala Thr Lys Thr Glu Trp
450 455 460
Glu Asp Ala Tyr Arg Cys Val Asn Ala Ser Thr Ala Asn Ala Glu Arg
465 470 475 480
Val Pro Asn Gly Leu Lys Phe Ala Gly Val Gly Gly Gly Ala Leu Trp
485 490 495
Pro Val Ser Gln Gln Gly Gln Asn Gln Arg Tyr His Phe Ala Asn His
500 505 510
Ala Phe Thr Leu Val Ala Ser Val Thr Ile His Glu Val Pro Ser Val
515 520 525
Ala Ser Pro Leu Leu Gly Ala Ser Leu Asp Ser Ser Gly Gly Lys Lys
530 535 540
Leu Leu Gly Leu Ser Tyr Asp Glu Lys His Gln Trp Gln Pro Ile Tyr
545 550 555 560
Gly Ser Thr Pro Val Thr Pro Thr Gly Ser Trp Glu Met Gly Lys Arg
565 570 575
Tyr His Val Val Leu Thr Met Ala Asn Lys Ile Gly Ser Val Tyr Ile
580 585 590
Asp Gly Glu Pro Leu Glu Gly Ser Gly Gln Thr Val Val Pro Asp Gly
595 600 605
Arg Thr Pro Asp Ile Ser His Phe Tyr Val Gly Gly Tyr Gly Arg Ser
610 615 620
Asp Met Pro Thr Ile Ser His Val Thr Val Asn Asn Val Leu Leu Tyr
625 630 635 640
Asn Arg Gln Leu Asn Ala Glu Glu Ile Arg Thr Leu Phe Leu Ser Gln
645 650 655
Asp Leu Ile Gly Thr Glu Ala His Met Gly Ser Ser Ser Gly Ser Ser
660 665 670
Ala His Ser Thr Pro Ser Thr Pro Ala Asp Asn Gly Ala His Ser Thr
675 680 685
Pro Ser Thr Pro Ala Asp Ser Ser Ala His Ser Thr Pro Ser Thr Pro
690 695 700
Ala Asp Ser Ser Ala His Ser Thr Pro Ser Ala Pro Gly Asp Asn Gly
705 710 715 720
Ala His Ser Thr Pro Ser Thr Pro Gly Asp Ser Ser Ala His Ser Thr
725 730 735
Pro Ser Thr Pro Ala Asp Asn Gly Ala His Ser Thr Pro Ser Ala Pro
740 745 750
Ala Asp Ser Asn Ala His Ser Thr Pro Ser Thr Pro Ala Asp Asn Gly
755 760 765
Ala His Ser Thr Pro Ser Thr Pro Ala Asp Asn Gly Ala His Ser Thr
770 775 780
Pro Ser Thr Pro Gly Asp Asn Gly Ala His Ser Thr Pro Ser Thr Pro
785 790 795 800
Gly Asp Ser Ser Ala His Ser Thr Pro Ser Thr Pro Ala Asp Asn Gly
805 810 815
Ala His Ser Thr Pro Ser Ala Pro Ala Asp Ser Asn Ala His Ser Thr
820 825 830
Pro Ser Thr Pro Gly Asp Asn Gly Ala His Ser Thr Pro Ser Ala Pro
835 840 845
Ala Asp Ser Asn Ala His Ser Thr Pro Ser Thr Pro Ala Asp Ser Ser
850 855 860
Ala His Ser Thr Pro Ser Ala Pro Gly Asp Asn Gly Ala His Ser Thr
865 870 875 880
Pro Ser Ala Pro Ala Asp Ser Ser Ala His Ser Thr Pro Ser Ala Pro
885 890 895
Gly Asp Asn Gly Ala His Ser Thr Pro Ser Ala Pro Ala Asp Asn Gly
900 905 910
Ala His Ser Thr Pro Ser Ala Pro Gly Asp Ser Asn Ala His Ser Thr
915 920 925
Pro Ser Thr Pro Ala Asp Ser Ser Ala His Ser Thr Pro Ser Thr Pro
930 935 940
Ala Asp Ser Ser Ala His Ser Thr Pro Ser Ala Pro Gly Asp Asn Gly
945 950 955 960
Ala His Ser Thr Pro Ser Ala Pro Ala Asp Ser Ser Ala His Ser Thr
965 970 975
Pro Ser Ile Pro Gly Asp Ser Ser Ala His Ser Thr Pro Ser Ala Pro
980 985 990
Ala Asp Ser Ser Ala His Ser Thr Pro Ser Ala Pro Gly Asp Asn Gly
995 1000 1005
Ala His Ser Thr Pro Ser Thr Pro Ala Asp Asn Gly Ala Asn Gly Thr
1010 1015 1020
Val Leu Ile Leu His Asp Gly Ala Ala Phe Ser Ala Phe Ser Gly Gly
1025 1030 1035 1040
Gly Leu Leu Leu Cys Ala Gly Ala Leu Leu Leu His Val Phe Val Met
1045 1050 1055
Ala Val Phe Phe
1060
<210> 11
<211> 642
<212> PRT
<213> Trypanosoma cruzi
<400> 11
Met Leu Ala Pro Gly Ser Ser Arg Val Glu Leu Phe Lys Arg Gln Ser
1 5 10 15
Ser Lys Val Pro Phe Glu Lys Asp Gly Lys Val Thr Glu Arg Val Val
20 25 30
His Ser Phe Arg Leu Pro Ala Leu Val Asn Val Asp Gly Val Met Val
35 40 45
Ala Ile Ala Asp Ala Arg Tyr Glu Thr Ser Asn Asp Asn Ser Leu Ile
50 55 60
Asp Thr Val Ala Lys Tyr Ser Val Asp Asp Gly Glu Thr Trp Glu Thr
65 70 75 80
Gln Ile Ala Ile Lys Asn Ser Arg Ala Ser Ser Val Ser Arg Val Val
85 90 95
Asp Pro Thr Val Ile Val Lys Gly Asn Lys Leu Tyr Val Leu Val Gly
100 105 110
Ser Tyr Asn Ser Ser Arg Ser Tyr Trp Thr Ser His Gly Asp Ala Arg
115 120 125
Asp Trp Asp Ile Leu Leu Ala Val Gly Glu Val Thr Lys Ser Thr Ala
130 135 140
Gly Gly Lys Ile Thr Ala Ser Ile Lys Trp Gly Ser Pro Val Ser Leu
145 150 155 160
Lys Glu Phe Phe Pro Ala Glu Met Glu Gly Met His Thr Asn Gln Phe
165 170 175
Leu Gly Gly Ala Gly Val Ala Ile Val Ala Ser Asn Gly Asn Leu Val
180 185 190
Tyr Pro Val Gln Val Thr Asn Lys Lys Lys Gln Val Phe Ser Lys Ile
195 200 205
Phe Tyr Ser Glu Asp Glu Gly Lys Thr Trp Lys Phe Gly Lys Gly Arg
210 215 220
Ser Ala Phe Gly Cys Ser Glu Pro Val Ala Leu Glu Trp Glu Gly Lys
225 230 235 240
Leu Ile Ile Asn Thr Arg Val Asp Tyr Arg Arg Arg Leu Val Tyr Glu
245 250 255
Ser Ser Asp Met Gly Asn Ser Trp Leu Glu Ala Val Gly Thr Leu Ser
260 265 270
Arg Val Trp Gly Pro Ser Pro Lys Ser Asn Gln Pro Gly Ser Gln Ser
275 280 285
Ser Phe Thr Ala Val Thr Ile Glu Gly Met Arg Val Met Leu Phe Thr
290 295 300
His Pro Leu Asn Phe Lys Gly Arg Trp Leu Arg Asp Arg Leu Asn Leu
305 310 315 320
Trp Leu Thr Asp Asn Gln Arg Ile Tyr Asn Val Gly Gln Val Ser Ile
325 330 335
Gly Asp Glu Asn Ser Ala Tyr Ser Ser Val Leu Tyr Lys Asp Asp Lys
340 345 350
Leu Tyr Cys Leu His Glu Ile Asn Ser Asn Glu Val Tyr Ser Leu Val
355 360 365
Phe Ala Arg Leu Val Gly Glu Leu Arg Ile Ile Lys Ser Val Leu Gln
370 375 380
Ser Trp Lys Asn Trp Asp Ser His Leu Ser Ser Ile Cys Thr Pro Ala
385 390 395 400
Asp Pro Ala Ala Ser Ser Ser Glu Arg Gly Cys Gly Pro Ala Val Thr
405 410 415
Thr Val Gly Leu Val Gly Phe Leu Ser His Ser Ala Thr Lys Thr Glu
420 425 430
Trp Glu Asp Ala Tyr Arg Cys Val Asn Ala Ser Thr Ala Asn Ala Glu
435 440 445
Arg Val Pro Asn Gly Leu Lys Phe Ala Gly Val Gly Gly Gly Ala Leu
450 455 460
Trp Pro Val Ser Gln Gln Gly Gln Asn Gln Arg Tyr Arg Phe Ala Asn
465 470 475 480
His Ala Phe Thr Val Val Ala Ser Val Thr Ile His Glu Val Pro Ser
485 490 495
Val Ala Ser Pro Leu Leu Gly Ala Ser Leu Asp Ser Ser Gly Gly Lys
500 505 510
Lys Leu Leu Gly Leu Ser Tyr Asp Glu Arg His Gln Trp Gln Pro Ile
515 520 525
Tyr Gly Ser Thr Pro Val Thr Pro Thr Gly Ser Trp Glu Met Gly Lys
530 535 540
Arg Tyr His Val Val Leu Thr Met Ala Asn Lys Ile Gly Ser Glu Tyr
545 550 555 560
Ile Asp Gly Glu Pro Leu Glu Gly Ser Gly Gln Thr Val Val Pro Asp
565 570 575
Glu Arg Thr Pro Asp Ile Ser His Phe Tyr Val Gly Gly Tyr Lys Arg
580 585 590
Ser Asp Met Pro Thr Ile Ser His Val Thr Val Asn Asn Val Leu Leu
595 600 605
Tyr Asn Arg Gln Leu Asn Ala Glu Glu Ile Arg Thr Leu Phe Leu Ser
610 615 620
Gln Asp Leu Ile Gly Thr Glu Ala His Met Asp Ser Ser Ser Asp Thr
625 630 635 640
Ser Ala
Claims (23)
- 트랜스시알리다아제 활성을 가지는 효소의 촉매 작용 하에서, 화학식 SA-OR2 및 그의 염들의 시알릴 공여체가 일반 화학식 2A 및 그의 염들의 시알릴 수용체와 반응되는 것을 특징으로 하는 일반 화학식 1A의 화합물 및 그의 염들의 합성 방법:
화학식 SA-OR2에서, R2는 단-, 이- 또는 올리고당, 당지질(glycolipid), 당단백질 또는 당펩티드, 고리형 또는 비고리형 지방족 기, 또는 아릴 잔기이고, SA는 α-시알릴 모이어티이고,
일반 화학식 1A에서, R 기들 중의 하나는 α-시알릴 모이어티, 다른 하나는 H이고, X1은 탄수화물 링커를 나타내고, A는 선택적으로 푸코실로 치환된 D-글루코피라노실 단위이고, R1은 가수소분해에 의해 제거될 수 있는 보호기이고, m은 0 또는 1의 정수이며,
일반 화학식 2A에서, X1, A, m 및 R1은 상기에서 정의한 바와 같다.
- 청구항 1에 있어서, 상기 트랜스-시알리다아제 활성을 가지는 효소는 비피도박테리움속(Bifidobacterium) 종들로부터 유래된 시알리다아제 및 트리파노소마 크루지(Trypanosoma cruzi)로부터 유래된 트랜스-시알리다아제로부터 선택되는 방법.
- 청구항 1 또는 청구항 2에 있어서, 상기 트랜스-시알리다아제 활성을 가지는 효소는 조작된 효소인 방법.
- 청구항 1 내지 청구항 3 중 어느 한 항에 있어서, 상기 시알릴 공여체는 2-O-(p-나이트로페닐)-α-D-시알로사이드(2-O-(p-nitrophenyl)-α-D-sialoside), 2-O-(4-메틸움벨리페릴)-α-D-시알로사이드(2-O-(4-methylumbelliferyl)-α-D-sialoside), 페투인(fetuin) 및 3'-O-시알릴-락토스(3'-O-sialyl-lactose)로 구성되는 군에서 선택되는 방법.
- 청구항 5에 있어서, 상기 일반 화학식 1-3A의 화합물 및 그의 염들은 일반 화학식 1-3B 또는 1-3C의 화합물 및 그의 염들인 것을 특징으로 하는 방법:
일반 화학식 1-3B 또는 1-3C에서, R3는 푸코실 또는 H이고, B는 선택적으로 푸코실 및/또는 시알릴로 치환된 N-아세틸-글루코사미노피라노실 단위이며, 링커 X2는 글루코스, N-아세틸-글루코사민, 갈락토스, 푸코스 및 시알산으로부터 선택되는 단당류 구성 단위들을 갖는 선형 또는 분지형 구조를 나타내는 단-, 이-, 삼-, 사-, 오- 또는 올리고당을 의미하고, R1 및 A는 청구항 1에서 정의한 바와 같다.
- 청구항 6에 있어서, 상기 일반 화학식 1-3B 또는 1-3C의 화합물 및 그의 염들은 일반 화학식 1-3D의 인유 올리고당 및 그의 염인 방법:
일반 화학식 1-3D에서, Y는 독립적으로, 선택적으로 시알릴 및/또는 푸코실 잔기로 치환된 N-아세틸-락토사미닐 기이고, p는 독립적으로 0, 1 또는 2의 정수이고, R4'는 일반 화학식 3-3 및 4-3로 특징화되는 기들에서 선택되고, R5'는 H, α-시알릴 모이어티, 일반 화학식 3-3의 기 및 일반 화학식 4-3의 기에서 선택되고,
일반 화학식 3-3 또는 4-3에서, R6은 H 또는 푸코실 잔기이고, R7은 H 또는 α-시알릴 모이어티이고, SA는 α-시알릴 모이어티이다.
- 청구항 6 또는 청구항 7에 있어서, 일반 화학식 1-3B, 1-3C 또는 1-3D의 화합물들 및 그의 염들은 선택적으로 하나 이상의 시알릴 및/ 또는 푸코실 잔기로 치환되고, 말단 갈락토실(galactosyl) 잔기의 3-OH에 시알릴 치환기를 갖는 락토스(lactose), 락토-N-네오테트라오스(lacto-N-neotetraose), 파라-락토-N-헥사오스(para-lacto-N-hexaose), 파라-락토-N-네오헥사오스(para-lacto-N-neohexaose), 락토-N-네오헥사오스(lacto-N-neohexaose), 파라-락토-N-옥타오스(para-lacto-N-octaose), 락토-N-네오옥타오스(lacto-N-neooctaose), 락토-N-테트라오스(lacto-N-tetraose), 락토-N-헥사오스(lacto-N-hexaose), 락토-N-옥타오스(lacto-N-octaose), 아이소-락토-N-옥타오스(iso-lacto-N-octaose), 락토-N-데카오스(lacto-N-decaose) 및 락토-N-네오데카오스(lacto-N-neodecaose)의 R1-배당체(glycoside)들, 및 그의 염들로 구성된 군으로부터 선택되는 방법.
- 청구항 8에 있어서, 상기 화합물들은 Neu5Acα2-3Galβ1-4Glc (3'-O-(N-아세틸-뉴라미노실)-락토스), Neu5Acα2-3Galβ1-4(Fucα1-3)Glc (3-O-푸코실-3'-O-(N-아세틸-뉴라미노실)-락토스), Neu5Acα2-3Galβ1-3GlcNAcβ1-3Galβ1-4Glc (LST a), Neu5Acα2-3Galβ1-4GlcNAcβ1-3Galβ1-4Glc, Neu5Acα2-3Galβ1-3(Fucα1-4)GlcNAcβ1-3Galβ1-4Glc (FLST a), Neu5Acα2-3Galβ1-4(Fucα1-3)GlcNAcβ1-3Galβ1-4Glc, Neu5Acα2-3Galβ1-3GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Neu5Acα2-3Galβ1-3(Fucα1-4)GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Neu5Acα2-3Galβ1-3(Neu5Acα2-6)GlcNAcβ1-3Galβ1-4Glc (DSLNT), Neu5Acα2-3Galβ1-3(Neu5Acα2-6)(Fucα1-4)GlcNAcβ1-3Galβ1-4Glc (FDSLNT I), Neu5Acα2-3Galβ1-3(Neu5Acα2-6)GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc (FDSLNT II), Neu5Acα2-3Galβ1-4GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Neu5Acα2-3Galβ1-4(Fucα1-3)GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Neu5Acα2-3Galβ1-4(Fucα1-3)GlcNAcβ1-3Galβ1-4Glc의 R1-배당체(glycoside)들, 및 그의 염들로 구성된 군으로부터 선택되는 방법.
- 청구항 1 내지 청구항 9 중 어느 한 항에 있어서, 상기 R1은 페닐, 알킬 또는 할로젠으로 구성되는 군에서 선택되는 적어도 하나의 기로 선택적으로 치환된 벤질 또는 2-나프틸메틸 기들인 방법.
- 일반 화학식 1A'의 화합물 및 그의 염들:
일반 화학식 1A'에서, R 기들 중의 하나는 α-시알릴 모이어티, 다른 하나는 H이고, R1은 가수소분해에 의해 제거될 수 있는 보호기이고, A는 선택적으로 푸코실로 치환된 D-글루코피라노실 단위이고, m은 0 또는 1의 정수이고, X1은 탄수화물 링커를 나타내며,
단, 3'-O-(N-아세틸-뉴라미노실)-락토스((3'-O-(N-acetyl-neuraminosyl)-lactose)) 나트륨 염의 1-O-β-벤질(1-O-β-benzyl) 및 1-O-β-(4,5-다이메톡시-2-나이트로)-벤질(1-O-β-(4,5-dimethoxy-2-nitro)-benzyl) 배당체들, 및 6'-O-(N-아세틸-뉴라미노실)-락토스(6'-O-(N-acetyl-neuraminosyl)-lactose) 나트륨 염의 1-O-β-벤질(1-O-β-benzyl) 배당체는 제외된다.
- 청구항 12에 있어서, 상기 일반 화학식 1'-3A의 화합물 및 그의 염들은 일반 화학식 1'-3B 또는 1'-3C의 화합물 및 그의 염들인 것을 특징으로 하는 화합물:
일반 화학식 1'-3B 또는 1'-3C에서, R3는 푸코실 또는 H이고, B는 선택적으로 푸코실 및/또는 시알릴로 치환된 N-아세틸-글루코사미노피라노실 단위이며, 링커 X2는 글루코스, N-아세틸-글루코사민, 갈락토스, 푸코스 및 시알산으로부터 선택되는 단당류 구성 단위들을 갖는 선형 또는 분지형 구조를 나타내는 단-, 이-, 삼-, 사-, 오- 또는 올리고당을 의미하고, R1 및 A는 청구항 11에서 정의한 바와 같다.
- 청구항 13에 있어서, 상기 일반 화학식 1'-3B 또는 1'-3C의 화합물들 및 그의 염들은 일반 화학식 1'-3D의 인유 올리고당 및 그의 염인 화합물:
일반 화학식 1'-3D에서, Y는 독립적으로, 선택적으로 시알릴 및/또는 푸코실 잔기로 치환된 N-아세틸-락토사미닐 기이고, p는 독립적으로 0, 1 또는 2의 정수이고, R4'는 일반 화학식 3-3 및 4-3에 의하여 특징화되는 기들로부터 선택되고, R5'는 H, α-시알릴 모이어티, 일반 화학식 3-3의 기 및 일반 화학식 4-3의 기로부터 선택되고,
일반 화학식 3-3 또는 4-3에서, R6은 H 또는 푸코실 잔기이고, R7은 H 또는 α-시알릴 모이어티이고, SA는 α-시알릴 모이어티이다.
- 청구항 13 또는 청구항 14에 있어서, 상기 일반 화학식 1'-3B, 1'-3C 또는 1'-3D의 화합물들 및 그의 염들은 선택적으로 하나 이상의 시알릴 및/ 또는 푸코실 잔기로 치환되고, 말단 갈락토실 잔기의 3-OH에 시알릴 치환기를 갖는 락토스(lactose), 락토-N-네오테트라오스(lacto-N-neotetraose), 파라-락토-N-헥사오스(para-lacto-N-hexaose), 파라-락토-N-네오헥사오스(para-lacto-N-neohexaose), 락토-N-네오헥사오스(lacto-N-neohexaose), 파라-락토-N-옥타오스(para-lacto-N-octaose), 락토-N-네오옥타오스(lacto-N-neooctaose), 락토-N-테트라오스(lacto-N-tetraose), 락토-N-헥사오스(lacto-N-hexaose), 락토-N-옥타오스(lacto-N-octaose), 아이소-락토-N-옥타오스(iso-lacto-N-octaose), 락토-N-데카오스(lacto-N-decaose) 및 락토-N-네오데카오스(lacto-N-neodecaose)의 R1-배당체(glycoside)들, 및 그의 염들로 구성된 군으로부터 선택되는 화합물.
- 청구항 15에 있어서, 상기 화합물들은 Neu5Acα2-3Galβ1-4Glc (3'-O-(N-아세틸-뉴라미노실)-락토스), Neu5Acα2-3Galβ1-4(Fucα1-3)Glc (3-O-푸코실-3'-O-(N-아세틸-뉴라미노실)-락토스), Neu5Acα2-3Galβ1-3GlcNAcβ1-3Galβ1-4Glc (LST a), Neu5Acα2-3Galβ1-4GlcNAcβ1-3Galβ1-4Glc, Neu5Acα2-3Galβ1-3(Fucα1-4)GlcNAcβ1-3Galβ1-4Glc (FLST a), Neu5Acα2-3Galβ1-4(Fucα1-3)GlcNAcβ1-3Galβ1-4Glc, Neu5Acα2-3Galβ1-3GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Neu5Acα2-3Galβ1-3(Fucα1-4)GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Neu5Acα2-3Galβ1-3(Neu5Acα2-6)GlcNAcβ1-3Galβ1-4Glc (DSLNT), Neu5Acα2-3Galβ1-3(Neu5Acα2-6)(Fucα1-4)GlcNAcβ1-3Galβ1-4Glc (FDSLNT I), Neu5Acα2-3Galβ1-3(Neu5Acα2-6)GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc (FDSLNT II), Neu5Acα2-3Galβ1-4GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Neu5Acα2-3Galβ1-4(Fucα1-3)GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Neu5Acα2-3Galβ1-4(Fucα1-3)GlcNAcβ1-3Galβ1-4Glc의 R1-배당체(glycoside)들, 및 그의 염들로 구성된 군으로부터 선택되는 화합물.
- 청구항 11 내지 청구항 16 중 어느 한 항에 있어서, 상기 R1은 페닐, 알킬 또는 할로젠으로 구성되는 군으로부터 선택되는 적어도 하나의 기로 선택적으로 치환된 벤질 또는 2-나프틸메틸 기들인 화합물.
- 청구항 18에 있어서, 상기 일반 화학식 2A'의 화합물 및 그의 염들은 일반 화학식 2B' 또는 2C'의 화합물 및 그의 염들인 것을 특징으로 하는 화합물:
일반 화학식 2B' 또는 2C'에서, R3는 푸코실이고, B는 선택적으로 푸코실 및/또는 시알릴로 치환된 N-아세틸-글루코사미노피라노실 단위이며, 링커 X2는 글루코스, N-아세틸-글루코사민, 갈락토스, 푸코스 및 시알산으로부터 선택되는 단당류 구성 단위들을 갖는 선형 또는 분지형 구조를 나타내는 단-, 이-, 삼-, 사-, 오- 또는 올리고당을 의미하고, R1 및 A는 청구항 18에서 정의한 바와 같다.
- 청구항 19에 있어서, 상기 일반 화학식 2B' 또는 2C'의 화합물들 및 그의 염들은 일반 화학식 2D'의 인유 올리고당 유도체들 및 그의 염인 화합물:
일반 화학식 2D'에서, R1은 가수소분해에 의해 제거될 수 있는 기이고, R3는 H 또는 푸코실이고, Y는 독립적으로, 선택적으로 시알릴 및/또는 푸코실 잔기로 치환된 N-아세틸-락토사미닐 기이고, p는 독립적으로 0, 1 또는 2의 정수이고, R8은 일반 화학식 5 및 6에 의하여 특징화되는 기들로부터 선택되고, R9는 H, α-시알릴 모이어티, 일반 화학식 5의 기 및 일반 화학식 6의 기로부터 선택되고,
일반 화학식 5 또는 6에서, R6은 H 또는 푸코실 잔기이고, R7은 H 또는 α-시알릴 모이어티이다.
- 청구항 19 또는 청구항 20에 있어서, 상기 일반 화학식 2B', 2C' 또는 2D'의 화합물들 및 그의 염들은 선택적으로 하나 이상의 시알릴 및/ 또는 푸코실 잔기로 치환되고, 비치환된 말단 갈락토실(galactosyl) 잔기를 갖는 락토-N-네오테트라오스(lacto-N-neotetraose), 파라-락토-N-헥사오스(para-lacto-N-hexaose), 파라-락토-N-네오헥사오스(para-lacto-N-neohexaose), 락토-N-네오헥사오스(lacto-N-neohexaose), 파라-락토-N-옥타오스(para-lacto-N-octaose), 락토-N-네오옥타오스(lacto-N-neooctaose), 락토-N-테트라오스(lacto-N-tetraose), 락토-N-헥사오스(lacto-N-hexaose), 락토-N-옥타오스(lacto-N-octaose), 아이소-락토-N-옥타오스(iso-lacto-N-octaose), 락토-N-데카오스(lacto-N-decaose) 및 락토-N-네오데카오스(lacto-N-neodecaose)의 R1-배당체(glycoside)들, 및 그의 염들로 구성된 군으로부터 선택되는 화합물.
- 청구항 18 내지 청구항 21 중 어느 한 항에 있어서, 상기 화합물들은 Galβ1-4(Fucα1-3)Glc (3-O-푸코실락토스(3-O-fucosyllactose)), Galβ1-3GlcNAcβ1-3Galβ1-4Glc (LNT), Galβ1-4GlcNAcβ1-3Galβ1-4Glc (LNnT), Galβ1-3(Fucα1-4)GlcNAcβ1-3Galβ1-4Glc (LNFP II), Galβ1-4(Fucα1-3)GlcNAcβ1-3Galβ1-4Glc (LNFP III), Galβ1-3GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc (LNFP V), Galβ1-3(Fucα1-4)GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc (LNDFH II), Galβ1-3(Neu5Acα2-6)GlcNAcβ1-3Galβ1-4Glc (LSTb), Galβ1-3(Neu5Acα2-6)(Fucα1-4)GlcNAcβ1-3Galβ1-4Glc, Galβ1-3(Neu5Acα2-6)GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Galβ1-4GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc, Galβ1-4(Fucα1-3)GlcNAcβ1-3Galβ1-4(Fucα1-3)Glc (LNDFH III)의 R1-배당체(glycoside)들, 및 그의 염들로 구성된 군으로부터 선택되는 화합물.
- 청구항 18 내지 청구항 22 중 어느 한 항에 있어서, 상기 R1은 페닐, 알킬 또는 할로젠으로 구성되는 군으로부터 선택되는 적어도 하나의 기로 선택적으로 치환된 벤질 또는 2-나프틸메틸 기들인 화합물.
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GBGB1012036.8A GB201012036D0 (en) | 2010-07-16 | 2010-07-16 | Synthesis of new carbohydrate derivatives |
GB1012036.8 | 2010-07-16 | ||
EP11166036.1 | 2011-05-13 | ||
EP11166036 | 2011-05-13 | ||
PCT/EP2011/062184 WO2012007588A1 (en) | 2010-07-16 | 2011-07-15 | Synthesis of new sialooligosaccharide derivatives |
Publications (1)
Publication Number | Publication Date |
---|---|
KR20140001198A true KR20140001198A (ko) | 2014-01-06 |
Family
ID=44533375
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR20137003909A KR20140011296A (ko) | 2010-07-16 | 2011-07-15 | 올리고당의 유도체화 |
KR20137003908A KR20140001198A (ko) | 2010-07-16 | 2011-07-15 | 신규 시알로올리고당 유도체들의 합성 |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR20137003909A KR20140011296A (ko) | 2010-07-16 | 2011-07-15 | 올리고당의 유도체화 |
Country Status (9)
Country | Link |
---|---|
US (3) | US20130172548A1 (ko) |
EP (2) | EP2593559A1 (ko) |
JP (2) | JP2013531049A (ko) |
KR (2) | KR20140011296A (ko) |
CN (2) | CN103108956A (ko) |
AU (2) | AU2011278239B2 (ko) |
CA (2) | CA2805501A1 (ko) |
RU (2) | RU2013106897A (ko) |
WO (2) | WO2012007585A1 (ko) |
Families Citing this family (63)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
AU2012232727A1 (en) | 2011-03-18 | 2013-09-26 | Glycom A/S | Synthesis of new fucose-containing carbohydrate derivatives |
US9234225B2 (en) | 2011-05-13 | 2016-01-12 | Glycom A/S | Method for generating human milk oligosaccharides (HMOs) or precursors thereof |
US9382564B2 (en) | 2011-05-13 | 2016-07-05 | Glycom A/S | Diversification of human milk oligosaccharides (HMOs) or precursors thereof |
US20140235850A1 (en) * | 2011-09-30 | 2014-08-21 | Glycom A/S | Synthesis of hmo core structures |
WO2013091660A1 (en) * | 2011-12-23 | 2013-06-27 | Glycom A/S | A method for obtaining crystalline lacto-n-tetraose and lacto-n-neotetraose precursors and mixtures thereof |
US20150065702A1 (en) * | 2012-03-20 | 2015-03-05 | Glycom A/S | Synthesis of the Trisaccharide 3-O-Fucosyllactose and Intermediates Thereof |
EP2859112A4 (en) | 2012-06-08 | 2015-10-28 | Glycom As | PROCESS FOR THE PREPARATION OF OLIGOSACCHARIDES AND OLIGOSACCHARIDE GLYCOSIDES BY FERMENTATION |
CN104428307A (zh) * | 2012-06-14 | 2015-03-18 | 格礼卡姆股份公司 | 增强人乳寡糖或其前体或掺合物的稳定性和纯度以及增加其生物利用度 |
ITFI20120143A1 (it) * | 2012-07-12 | 2014-01-13 | Inalco S A S Di Giovanni Cipollett I & C | Polimorfi idrati e anidri del 2'-o-fucosillattosio e loro metodi di produzione. |
EP2943500B1 (en) * | 2012-11-13 | 2017-11-08 | Glycom A/S | Crystalline 3-o-fucosyllactose |
WO2014135167A1 (en) * | 2013-03-08 | 2014-09-12 | Glycom A/S | Purification of oligosaccaharides by reversible derivatization |
GB201306689D0 (en) | 2013-04-12 | 2013-05-29 | Glycom As | Synthesis of sialylated/fucosylated human milk oligosaccharides |
US20160113952A1 (en) * | 2013-05-22 | 2016-04-28 | Glycom As | Synthetic Mixture of Oligosaccharides for Treating a Microbiota of a Mammal |
KR101525230B1 (ko) * | 2013-05-31 | 2015-06-01 | 주식회사 진켐 | 시알산 유도체의 제조방법 |
WO2015032413A1 (en) | 2013-09-06 | 2015-03-12 | Glycom A/S | Fermentative production of oligosaccharides |
CN103551562B (zh) * | 2013-10-21 | 2015-07-15 | 中国科学院微生物研究所 | 唾液酸寡糖-金纳米粒子及其制备方法与应用 |
CA3228950A1 (en) | 2014-07-09 | 2016-01-14 | Dsm Nutritional Products, Llc | Oligosaccharide compositions and methods for producing thereof |
CN105372214B (zh) * | 2014-08-18 | 2019-06-28 | 中国科学院上海有机化学研究所 | 一种鉴定新红细胞生成刺激蛋白的n-连接寡糖结构的方法 |
EP3212198B1 (en) | 2014-10-29 | 2020-12-23 | Glycom A/S | Synthetic composition and method for promoting mucosal healing |
US11040049B2 (en) | 2014-10-29 | 2021-06-22 | Glycom A/S | Composition comprising HMSs/HMOs and use thereof |
JP7182872B2 (ja) | 2014-10-29 | 2022-12-05 | グリコム・アクティーゼルスカブ | 過敏性腸症候群の治療のための合成組成物および方法 |
US11040050B2 (en) | 2014-10-29 | 2021-06-22 | Glycom A/S | Composition comprising HMSs/HMOs and use thereof |
WO2016086947A1 (en) * | 2014-12-05 | 2016-06-09 | Glycom A/S | Crystalline difucosyllactose |
US10881674B2 (en) | 2014-12-08 | 2021-01-05 | Glycom A/S | Synthetic composition for treating metabolic disorders |
WO2016091265A1 (en) | 2014-12-08 | 2016-06-16 | Glycom A/S | Synthetic composition for treating metabolic disorders |
US10987368B2 (en) | 2014-12-08 | 2021-04-27 | Glycom A/S | Synthetic composition for preventing or treating CVD |
BR112017015944B1 (pt) | 2015-01-26 | 2022-03-22 | Cadena Bio, Inc | Composição de ração animal, pré-mistura de ração animal, uso e método para produzir uma composição de ração animal |
ES2971093T3 (es) * | 2015-03-31 | 2024-06-03 | Glycom As | Mezclas de oligosacáridos de la leche humana que comprenden 3'-O-sialilactosa |
WO2016199069A1 (en) * | 2015-06-09 | 2016-12-15 | Glycom A/S | Mutated sialidases |
WO2017046711A1 (en) | 2015-09-14 | 2017-03-23 | Glycom A/S | Synthetic composition for microbiota modulation |
WO2017071715A1 (en) | 2015-10-28 | 2017-05-04 | Glycom A/S | Synthetic composition and method for modulating brain function and behaviour |
WO2017071716A1 (en) | 2015-10-28 | 2017-05-04 | Glycom A/S | Synthetic composition and method for modulating emotion and mood disorders |
US10857168B2 (en) | 2016-02-24 | 2020-12-08 | Glycom A/S | Synthetic composition for microbiota modulation |
US10899782B2 (en) | 2016-03-07 | 2021-01-26 | Glycom A/S | Separation of oligosaccharides from fermentation broth |
PL3452051T3 (pl) | 2016-05-05 | 2022-11-14 | Glycom A/S | Kompozycja zawierająca hmos do zastosowania w leczeniu nadwrażliwości trzewnej i/lub bólu trzewnego mediowanych komórkami tucznymi |
PL3452050T3 (pl) | 2016-05-05 | 2023-06-12 | Glycom A/S | Kompozycja zawierająca hmo do leczenia biegunki niezakaźnej |
WO2017198276A1 (en) | 2016-05-19 | 2017-11-23 | Glycom A/S | Synthetic composition |
US10922621B2 (en) | 2016-11-11 | 2021-02-16 | International Business Machines Corporation | Facilitating mapping of control policies to regulatory documents |
EP3589139A4 (en) | 2017-03-01 | 2020-12-23 | Glycom A/S | SYNTHETIC COMPOSITION FOR MICROBIOTE MODULATION |
EP3634429A4 (en) | 2017-05-09 | 2020-11-04 | Glycom A/S | SYNTHETIC COMPOSITION FOR MICROBIOTE MODULATION |
US11541067B2 (en) | 2017-05-24 | 2023-01-03 | Glycom A/S | HMO compositions and methods for reducing detrimental proteolytic metabolites |
WO2018215961A1 (en) | 2017-05-24 | 2018-11-29 | Glycom A/S | Synthetic composition comprising oligosaccharides and its use in medical treatment. |
US20200222474A1 (en) * | 2017-07-24 | 2020-07-16 | North Carolina State University | Compositions and methods for increasing phytochemical bioavailablity and bioactivity |
WO2019038668A1 (en) | 2017-08-21 | 2019-02-28 | Glycom A/S | SYNTHETIC COMPOSITION TO REDUCE ALLERGY SYMPTOMS |
EP3456836A1 (en) | 2017-09-13 | 2019-03-20 | Glycom A/S | Separation of sialylated oligosaccharides from fermentation broth |
CN107602634A (zh) * | 2017-09-19 | 2018-01-19 | 佛山科学技术学院 | 一种三糖的合成方法 |
EP3691658A4 (en) | 2017-10-04 | 2021-06-23 | The Regents of The University of California | IMMUNOMODULATOR OLIGOSACCHARIDES |
US11541069B2 (en) | 2017-11-02 | 2023-01-03 | Glycom A/S | One or more HMOs for reducing or preventing fatigue and/or improving focus or concentration |
JP7327724B2 (ja) | 2017-11-30 | 2023-08-16 | グリコム・アクティーゼルスカブ | 小麦過敏症を処置するためのhmoの混合物 |
US11602545B2 (en) | 2017-12-05 | 2023-03-14 | Glycom A/S | Human milk oligosaccharides for treating migraine |
EP3743078A4 (en) | 2017-12-22 | 2022-01-19 | Glycom A/S | COMPOSITION WITH HMOs TO PREVENT OR REDUCE NOCICEPTION |
CN108129531A (zh) * | 2018-01-24 | 2018-06-08 | 邯郸市赵都精细化工有限公司 | 一种甲基橙皮苷的制备方法 |
CN108220310B (zh) * | 2018-02-27 | 2021-06-29 | 山东大学 | 一种转糖基exo-α-唾液酸苷酶在制备6’唾液酸乳糖中的应用 |
US11554131B2 (en) | 2018-05-31 | 2023-01-17 | Glycom A/S | Mixture of HMOs for treating autoimmune diseases |
TWI667252B (zh) * | 2018-07-13 | 2019-08-01 | 中原大學 | 阿伐-選擇性唾液酸供體及其於製備唾液酸苷的用途 |
CN113194960A (zh) | 2018-12-19 | 2021-07-30 | 格礼卡姆股份公司 | 用于治疗使用低fodmap饮食的人的组合物和方法 |
EP3924999A4 (en) | 2019-02-15 | 2022-11-02 | Kulicke & Soffa Netherlands B.V. | DYNAMIC RELEASE STRIPS FOR SEPARATE COMPONENT ASSEMBLY |
KR20220072866A (ko) * | 2019-10-01 | 2022-06-02 | 글리콤 에이/에스 | 발효액으로부터 중성 올리고사카라이드의 분리 |
WO2021122103A1 (de) * | 2019-12-18 | 2021-06-24 | Basf Se | Verfahren zur reduktion des säuregehaltes in humanen milcholigosacchariden |
WO2022223430A1 (en) | 2021-04-19 | 2022-10-27 | Dsm Ip Assets B.V. | A composition of enzymes and human milk oligosaccharides |
DK202200588A1 (en) | 2022-06-20 | 2024-02-23 | Dsm Ip Assets Bv | Mixture of fucosylated HMOs |
WO2024042235A1 (en) | 2022-08-25 | 2024-02-29 | Dsm Ip Assets B.V. | Hybrid method for producing complex hmos |
WO2024110667A1 (en) | 2022-11-25 | 2024-05-30 | Dsm Ip Assets B.V. | Two-strain system for producing oligosaccharides |
Family Cites Families (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE69623005T2 (de) * | 1995-04-11 | 2003-04-03 | Neose Technologies, Inc. | Verbesserte verfahren zur enzymatischen synthese von oligosacchariden |
US8008500B2 (en) * | 2007-06-08 | 2011-08-30 | General Electric Company | Intermediates useful for making tetrabenazine compounds |
WO2010115934A1 (en) * | 2009-04-07 | 2010-10-14 | Glycom A/S | Synthesis of 2'-o-fucosyllactose |
MX2012009306A (es) * | 2010-02-19 | 2012-09-12 | Glycom As | Produccion de 6'-o-sialil-lactosa y compuestos intermedios. |
CN105330705A (zh) | 2010-02-19 | 2016-02-17 | 格礼卡姆股份公司 | 用于制备含n-乙酰乳糖胺的四糖乳糖-n-新四糖(lnnt)的方法 |
US20140057868A1 (en) | 2011-02-21 | 2014-02-27 | Glycom A/S | Catalytic hydrogenolysis of a composition of a mixture of oligosaccharide precursors and uses thereof |
AU2012232727A1 (en) | 2011-03-18 | 2013-09-26 | Glycom A/S | Synthesis of new fucose-containing carbohydrate derivatives |
US9382564B2 (en) | 2011-05-13 | 2016-07-05 | Glycom A/S | Diversification of human milk oligosaccharides (HMOs) or precursors thereof |
US20140323705A1 (en) | 2011-05-13 | 2014-10-30 | Markus Hederos | Manufacture of lacto-n-tetraose |
US9234225B2 (en) | 2011-05-13 | 2016-01-12 | Glycom A/S | Method for generating human milk oligosaccharides (HMOs) or precursors thereof |
-
2011
- 2011-07-15 CN CN2011800429257A patent/CN103108956A/zh active Pending
- 2011-07-15 JP JP2013520083A patent/JP2013531049A/ja not_active Withdrawn
- 2011-07-15 RU RU2013106897/04A patent/RU2013106897A/ru not_active Application Discontinuation
- 2011-07-15 AU AU2011278239A patent/AU2011278239B2/en not_active Ceased
- 2011-07-15 RU RU2013106896/04A patent/RU2013106896A/ru not_active Application Discontinuation
- 2011-07-15 EP EP11736329.1A patent/EP2593559A1/en not_active Withdrawn
- 2011-07-15 WO PCT/EP2011/062171 patent/WO2012007585A1/en active Application Filing
- 2011-07-15 JP JP2013520090A patent/JP2013532470A/ja not_active Withdrawn
- 2011-07-15 CA CA 2805501 patent/CA2805501A1/en not_active Abandoned
- 2011-07-15 AU AU2011278236A patent/AU2011278236A1/en not_active Abandoned
- 2011-07-15 CA CA 2805499 patent/CA2805499A1/en not_active Abandoned
- 2011-07-15 CN CN2011800429149A patent/CN103108881A/zh active Pending
- 2011-07-15 KR KR20137003909A patent/KR20140011296A/ko not_active Application Discontinuation
- 2011-07-15 EP EP11731401.3A patent/EP2593465A1/en not_active Withdrawn
- 2011-07-15 US US13/809,829 patent/US20130172548A1/en not_active Abandoned
- 2011-07-15 US US13/809,794 patent/US9102966B2/en not_active Expired - Fee Related
- 2011-07-15 WO PCT/EP2011/062184 patent/WO2012007588A1/en active Application Filing
- 2011-07-15 KR KR20137003908A patent/KR20140001198A/ko not_active Application Discontinuation
-
2015
- 2015-07-06 US US14/792,278 patent/US20160024129A1/en not_active Abandoned
Also Published As
Publication number | Publication date |
---|---|
KR20140011296A (ko) | 2014-01-28 |
US9102966B2 (en) | 2015-08-11 |
CA2805499A1 (en) | 2012-01-19 |
JP2013531049A (ja) | 2013-08-01 |
WO2012007585A1 (en) | 2012-01-19 |
WO2012007585A9 (en) | 2012-07-26 |
EP2593465A1 (en) | 2013-05-22 |
RU2013106897A (ru) | 2014-08-27 |
JP2013532470A (ja) | 2013-08-19 |
CN103108881A (zh) | 2013-05-15 |
AU2011278239B2 (en) | 2014-03-27 |
US20130171696A1 (en) | 2013-07-04 |
CN103108956A (zh) | 2013-05-15 |
CA2805501A1 (en) | 2012-01-19 |
AU2011278239A1 (en) | 2013-02-07 |
RU2013106896A (ru) | 2014-08-27 |
AU2011278236A1 (en) | 2013-02-07 |
US20160024129A1 (en) | 2016-01-28 |
US20130172548A1 (en) | 2013-07-04 |
WO2012007588A9 (en) | 2012-07-05 |
WO2012007588A1 (en) | 2012-01-19 |
EP2593559A1 (en) | 2013-05-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR20140001198A (ko) | 신규 시알로올리고당 유도체들의 합성 | |
US9963729B2 (en) | Diversification of human milk oligosaccharides (HMOs) or precursors thereof | |
US9234225B2 (en) | Method for generating human milk oligosaccharides (HMOs) or precursors thereof | |
JP2014510098A (ja) | 新規のフコース含有炭水化物誘導体の合成 | |
KR20140006026A (ko) | 올리고사카라이드 전구체의 혼합물의 조성물의 촉매적 가수소분해 및 이의 용도 | |
EP2984096B1 (en) | Synthesis of sialylated and optionally fucosylated human milk oligosaccharides | |
WO2014167537A1 (en) | Synthesis of sialylated/fucosylated oligosaccharides | |
AU2012257396A1 (en) | Diversification of human milk oligosaccharides (HMOs) or precursors thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WITN | Application deemed withdrawn, e.g. because no request for examination was filed or no examination fee was paid |