CN115710594A - 一种合成尿苷二磷酸-6-叠氮-d-半乳糖的方法 - Google Patents
一种合成尿苷二磷酸-6-叠氮-d-半乳糖的方法 Download PDFInfo
- Publication number
- CN115710594A CN115710594A CN202110968981.9A CN202110968981A CN115710594A CN 115710594 A CN115710594 A CN 115710594A CN 202110968981 A CN202110968981 A CN 202110968981A CN 115710594 A CN115710594 A CN 115710594A
- Authority
- CN
- China
- Prior art keywords
- ala
- gly
- leu
- asp
- val
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- DRTQHJPVMGBUCF-XVFCMESISA-N Uridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-XVFCMESISA-N 0.000 title claims abstract description 74
- 238000000034 method Methods 0.000 title claims abstract description 51
- DRTQHJPVMGBUCF-PSQAKQOGSA-N beta-L-uridine Natural products O[C@H]1[C@@H](O)[C@H](CO)O[C@@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-PSQAKQOGSA-N 0.000 title claims abstract description 37
- DRTQHJPVMGBUCF-UHFFFAOYSA-N uracil arabinoside Natural products OC1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-UHFFFAOYSA-N 0.000 title claims abstract description 37
- 229940045145 uridine Drugs 0.000 title claims abstract description 37
- -1 diphosphate-6-azido-D-galactose Chemical compound 0.000 title claims abstract description 34
- 230000002194 synthesizing effect Effects 0.000 title claims abstract description 15
- 102000048120 Galactokinases Human genes 0.000 claims abstract description 48
- 108700023157 Galactokinases Proteins 0.000 claims abstract description 48
- 241000186000 Bifidobacterium Species 0.000 claims abstract description 17
- 235000000346 sugar Nutrition 0.000 claims abstract description 13
- 241000219195 Arabidopsis thaliana Species 0.000 claims abstract description 7
- 150000001413 amino acids Chemical group 0.000 claims description 27
- 238000006243 chemical reaction Methods 0.000 claims description 25
- 108090000623 proteins and genes Proteins 0.000 claims description 20
- 230000000694 effects Effects 0.000 claims description 15
- 102000004169 proteins and genes Human genes 0.000 claims description 15
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 claims description 13
- 239000008367 deionised water Substances 0.000 claims description 10
- 229910021641 deionized water Inorganic materials 0.000 claims description 10
- 239000002773 nucleotide Substances 0.000 claims description 10
- 125000003729 nucleotide group Chemical group 0.000 claims description 10
- 239000002904 solvent Substances 0.000 claims description 10
- 238000000926 separation method Methods 0.000 claims description 8
- 241000186011 Bifidobacterium catenulatum Species 0.000 claims description 7
- 241001134772 Bifidobacterium pseudocatenulatum Species 0.000 claims description 7
- GHYOCDFICYLMRF-UTIIJYGPSA-N (2S,3R)-N-[(2S)-3-(cyclopenten-1-yl)-1-[(2R)-2-methyloxiran-2-yl]-1-oxopropan-2-yl]-3-hydroxy-3-(4-methoxyphenyl)-2-[[(2S)-2-[(2-morpholin-4-ylacetyl)amino]propanoyl]amino]propanamide Chemical compound C1(=CCCC1)C[C@@H](C(=O)[C@@]1(OC1)C)NC([C@H]([C@@H](C1=CC=C(C=C1)OC)O)NC([C@H](C)NC(CN1CCOCC1)=O)=O)=O GHYOCDFICYLMRF-UTIIJYGPSA-N 0.000 claims description 6
- ZKHQWZAMYRWXGA-KQYNXXCUSA-J ATP(4-) Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP([O-])(=O)OP([O-])(=O)OP([O-])([O-])=O)[C@@H](O)[C@H]1O ZKHQWZAMYRWXGA-KQYNXXCUSA-J 0.000 claims description 6
- ZKHQWZAMYRWXGA-UHFFFAOYSA-N Adenosine triphosphate Natural products C1=NC=2C(N)=NC=NC=2N1C1OC(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)C(O)C1O ZKHQWZAMYRWXGA-UHFFFAOYSA-N 0.000 claims description 6
- 241001134770 Bifidobacterium animalis Species 0.000 claims description 6
- 241000186013 Bifidobacterium asteroides Species 0.000 claims description 6
- 241000186016 Bifidobacterium bifidum Species 0.000 claims description 6
- 241000186012 Bifidobacterium breve Species 0.000 claims description 6
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 claims description 6
- 229940118852 bifidobacterium animalis Drugs 0.000 claims description 6
- 229940002008 bifidobacterium bifidum Drugs 0.000 claims description 6
- 229940125797 compound 12 Drugs 0.000 claims description 6
- 150000001875 compounds Chemical class 0.000 claims description 6
- 239000007788 liquid Substances 0.000 claims description 6
- 241000186148 Bifidobacterium pseudolongum Species 0.000 claims description 5
- 238000007792 addition Methods 0.000 claims description 5
- 238000012217 deletion Methods 0.000 claims description 5
- 230000037430 deletion Effects 0.000 claims description 5
- 238000006467 substitution reaction Methods 0.000 claims description 5
- 239000000758 substrate Substances 0.000 claims description 5
- 238000004809 thin layer chromatography Methods 0.000 claims description 5
- PGAVKCOVUIYSFO-UHFFFAOYSA-N uridine-triphosphate Natural products OC1C(O)C(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)OC1N1C(=O)NC(=O)C=C1 PGAVKCOVUIYSFO-UHFFFAOYSA-N 0.000 claims description 5
- 241001312342 Bifidobacterium gallinarum Species 0.000 claims description 4
- 241001608472 Bifidobacterium longum Species 0.000 claims description 4
- PGAVKCOVUIYSFO-XVFCMESISA-N UTP Chemical compound O[C@@H]1[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O[C@H]1N1C(=O)NC(=O)C=C1 PGAVKCOVUIYSFO-XVFCMESISA-N 0.000 claims description 4
- 229940009291 bifidobacterium longum Drugs 0.000 claims description 4
- 239000003054 catalyst Substances 0.000 claims description 4
- 238000000746 purification Methods 0.000 claims description 4
- 229950010342 uridine triphosphate Drugs 0.000 claims description 4
- 239000012535 impurity Substances 0.000 claims description 3
- 238000005342 ion exchange Methods 0.000 claims description 3
- 239000002184 metal Substances 0.000 claims description 3
- 229910052751 metal Inorganic materials 0.000 claims description 3
- 238000007036 catalytic synthesis reaction Methods 0.000 claims description 2
- 238000002156 mixing Methods 0.000 claims description 2
- 125000003275 alpha amino acid group Chemical group 0.000 claims 17
- 239000000499 gel Substances 0.000 claims 2
- 241000219194 Arabidopsis Species 0.000 claims 1
- 239000011543 agarose gel Substances 0.000 claims 1
- 150000001720 carbohydrates Chemical class 0.000 claims 1
- 229920002401 polyacrylamide Polymers 0.000 claims 1
- YKVDKWKUEJQXNB-DHVFOXMCSA-N O=C[C@H](O)[C@@H](O)[C@@H](O)[C@H](O)C(O)N=[N+]=[N-] Chemical compound O=C[C@H](O)[C@@H](O)[C@@H](O)[C@H](O)C(O)N=[N+]=[N-] YKVDKWKUEJQXNB-DHVFOXMCSA-N 0.000 abstract description 14
- 239000000126 substance Substances 0.000 abstract description 13
- 102000004190 Enzymes Human genes 0.000 abstract description 9
- 108090000790 Enzymes Proteins 0.000 abstract description 9
- 101000638915 Arabidopsis thaliana UDP-sugar pyrophosphorylase Proteins 0.000 abstract 1
- 229910000147 aluminium phosphate Inorganic materials 0.000 abstract 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-N phosphoric acid Substances OP(O)(O)=O NBIIXXVUZAFLBC-UHFFFAOYSA-N 0.000 abstract 1
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 33
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 19
- 108010079364 N-glycylalanine Proteins 0.000 description 17
- 239000000047 product Substances 0.000 description 17
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 16
- 239000000243 solution Substances 0.000 description 16
- 210000004027 cell Anatomy 0.000 description 15
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 15
- 108010057821 leucylproline Proteins 0.000 description 14
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 13
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 10
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 10
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 10
- 108010081551 glycylphenylalanine Proteins 0.000 description 10
- 108010034529 leucyl-lysine Proteins 0.000 description 10
- 108010090894 prolylleucine Proteins 0.000 description 10
- 238000003786 synthesis reaction Methods 0.000 description 10
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 9
- HBHMVBGGHDMPBF-GARJFASQSA-N Cys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N HBHMVBGGHDMPBF-GARJFASQSA-N 0.000 description 9
- QQAYIVHVRFJICE-AEJSXWLSSA-N Cys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N QQAYIVHVRFJICE-AEJSXWLSSA-N 0.000 description 9
- JUBDONGMHASUCN-IUCAKERBSA-N Gly-Glu-His Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O JUBDONGMHASUCN-IUCAKERBSA-N 0.000 description 9
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 9
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 9
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 9
- LDSOBEJVGGVWGD-DLOVCJGASA-N Phe-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 LDSOBEJVGGVWGD-DLOVCJGASA-N 0.000 description 9
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 9
- LCUOTSLIVGSGAU-AVGNSLFASA-N Pro-His-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LCUOTSLIVGSGAU-AVGNSLFASA-N 0.000 description 9
- HBTCFCHYALPXME-HTFCKZLJSA-N Ser-Ile-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HBTCFCHYALPXME-HTFCKZLJSA-N 0.000 description 9
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 9
- 108010044940 alanylglutamine Proteins 0.000 description 9
- 230000015572 biosynthetic process Effects 0.000 description 9
- 108010054813 diprotin B Proteins 0.000 description 9
- 108010050848 glycylleucine Proteins 0.000 description 9
- 239000013612 plasmid Substances 0.000 description 9
- 108010061238 threonyl-glycine Proteins 0.000 description 9
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 9
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 8
- GHBSKQGCIYSCNS-NAKRPEOUSA-N Ala-Leu-Asp-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GHBSKQGCIYSCNS-NAKRPEOUSA-N 0.000 description 8
- DGFXIWKPTDKBLF-AVGNSLFASA-N Arg-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N DGFXIWKPTDKBLF-AVGNSLFASA-N 0.000 description 8
- XSGBIBGAMKTHMY-WHFBIAKZSA-N Asn-Asp-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O XSGBIBGAMKTHMY-WHFBIAKZSA-N 0.000 description 8
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 8
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 8
- NJLLRXWFPQQPHV-SRVKXCTJSA-N Asp-Tyr-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJLLRXWFPQQPHV-SRVKXCTJSA-N 0.000 description 8
- KWUSGAIFNHQCBY-DCAQKATOSA-N Gln-Arg-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O KWUSGAIFNHQCBY-DCAQKATOSA-N 0.000 description 8
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 8
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 8
- COZMNNJEGNPDED-HOCLYGCPSA-N Gly-Val-Trp Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O COZMNNJEGNPDED-HOCLYGCPSA-N 0.000 description 8
- VHHYJBSXXMPQGZ-AVGNSLFASA-N His-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N VHHYJBSXXMPQGZ-AVGNSLFASA-N 0.000 description 8
- XLCZWMJPVGRWHJ-KQXIARHKSA-N Ile-Glu-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N XLCZWMJPVGRWHJ-KQXIARHKSA-N 0.000 description 8
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 8
- JMNRXRPBHFGXQX-GUBZILKMSA-N Lys-Ser-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JMNRXRPBHFGXQX-GUBZILKMSA-N 0.000 description 8
- WSPQHZOMTFFWGH-XGEHTFHBSA-N Met-Thr-Cys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(O)=O WSPQHZOMTFFWGH-XGEHTFHBSA-N 0.000 description 8
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 8
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 8
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 8
- UNUZEBFXGWVAOP-DZKIICNBSA-N Tyr-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UNUZEBFXGWVAOP-DZKIICNBSA-N 0.000 description 8
- FOADDSDHGRFUOC-DZKIICNBSA-N Val-Glu-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FOADDSDHGRFUOC-DZKIICNBSA-N 0.000 description 8
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 8
- 108010012581 phenylalanylglutamate Proteins 0.000 description 8
- 108010070643 prolylglutamic acid Proteins 0.000 description 8
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 7
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 7
- XLWSGICNBZGYTA-CIUDSAMLSA-N Arg-Glu-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XLWSGICNBZGYTA-CIUDSAMLSA-N 0.000 description 7
- KSUALAGYYLQSHJ-RCWTZXSCSA-N Arg-Met-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KSUALAGYYLQSHJ-RCWTZXSCSA-N 0.000 description 7
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 7
- WTJIWXMJESRHMM-XDTLVQLUSA-N Gln-Tyr-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O WTJIWXMJESRHMM-XDTLVQLUSA-N 0.000 description 7
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 7
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 7
- 241000880493 Leptailurus serval Species 0.000 description 7
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 7
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 description 7
- KDGBLMDAPJTQIW-RHYQMDGZSA-N Thr-Met-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N)O KDGBLMDAPJTQIW-RHYQMDGZSA-N 0.000 description 7
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 7
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 7
- GUIYPEKUEMQBIK-JSGCOSHPSA-N Val-Tyr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)NCC(O)=O GUIYPEKUEMQBIK-JSGCOSHPSA-N 0.000 description 7
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 7
- 108010047857 aspartylglycine Proteins 0.000 description 7
- 238000006206 glycosylation reaction Methods 0.000 description 7
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 7
- 108010054155 lysyllysine Proteins 0.000 description 7
- 239000002609 medium Substances 0.000 description 7
- 238000002360 preparation method Methods 0.000 description 7
- NHWYNIZWLJYZAG-XVYDVKMFSA-N Ala-Ser-His Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N NHWYNIZWLJYZAG-XVYDVKMFSA-N 0.000 description 6
- QKHWNPQNOHEFST-VZFHVOOUSA-N Ala-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C)N)O QKHWNPQNOHEFST-VZFHVOOUSA-N 0.000 description 6
- PGNNQOJOEGFAOR-KWQFWETISA-N Ala-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 PGNNQOJOEGFAOR-KWQFWETISA-N 0.000 description 6
- XSPKAHFVDKRGRL-DCAQKATOSA-N Arg-Pro-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XSPKAHFVDKRGRL-DCAQKATOSA-N 0.000 description 6
- BGINHSZTXRJIPP-FXQIFTODSA-N Asn-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N BGINHSZTXRJIPP-FXQIFTODSA-N 0.000 description 6
- OVPHVTCDVYYTHN-AVGNSLFASA-N Asp-Glu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OVPHVTCDVYYTHN-AVGNSLFASA-N 0.000 description 6
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 6
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 6
- SLQJJFAVWSZLBL-BJDJZHNGSA-N Lys-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN SLQJJFAVWSZLBL-BJDJZHNGSA-N 0.000 description 6
- PXHVJJICTQNCMI-UHFFFAOYSA-N Nickel Chemical compound [Ni] PXHVJJICTQNCMI-UHFFFAOYSA-N 0.000 description 6
- VZFPYFRVHMSSNA-JURCDPSOSA-N Phe-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 VZFPYFRVHMSSNA-JURCDPSOSA-N 0.000 description 6
- JUJWROOIHBZHMG-UHFFFAOYSA-N Pyridine Chemical compound C1=CC=NC=C1 JUJWROOIHBZHMG-UHFFFAOYSA-N 0.000 description 6
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 6
- DXPURPNJDFCKKO-RHYQMDGZSA-N Thr-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DXPURPNJDFCKKO-RHYQMDGZSA-N 0.000 description 6
- 230000013595 glycosylation Effects 0.000 description 6
- 108010020688 glycylhistidine Proteins 0.000 description 6
- 108010037850 glycylvaline Proteins 0.000 description 6
- 108010071207 serylmethionine Proteins 0.000 description 6
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 5
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 5
- VFUXXFVCYZPOQG-WDSKDSINSA-N Asp-Glu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VFUXXFVCYZPOQG-WDSKDSINSA-N 0.000 description 5
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 5
- YKRYHWJRQUSTKG-KBIXCLLPSA-N Ile-Ala-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKRYHWJRQUSTKG-KBIXCLLPSA-N 0.000 description 5
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 5
- WGLAORUKDGRINI-WDCWCFNPSA-N Lys-Glu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGLAORUKDGRINI-WDCWCFNPSA-N 0.000 description 5
- QDMUMFDBUVOZOY-GUBZILKMSA-N Met-Arg-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N QDMUMFDBUVOZOY-GUBZILKMSA-N 0.000 description 5
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 description 5
- NHDVNAKDACFHPX-GUBZILKMSA-N Pro-Arg-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O NHDVNAKDACFHPX-GUBZILKMSA-N 0.000 description 5
- BRBCKMMXKONBAA-KWBADKCTSA-N Trp-Ala-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 BRBCKMMXKONBAA-KWBADKCTSA-N 0.000 description 5
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 5
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 5
- 108010013835 arginine glutamate Proteins 0.000 description 5
- 238000001976 enzyme digestion Methods 0.000 description 5
- 239000001963 growth medium Substances 0.000 description 5
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 4
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 4
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 4
- NTAZNGWBXRVEDJ-FXQIFTODSA-N Arg-Asp-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NTAZNGWBXRVEDJ-FXQIFTODSA-N 0.000 description 4
- XWKBWZXGNXTDKY-ZKWXMUAHSA-N Asp-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O XWKBWZXGNXTDKY-ZKWXMUAHSA-N 0.000 description 4
- LFIVHGMKWFGUGK-IHRRRGAJSA-N Gln-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N LFIVHGMKWFGUGK-IHRRRGAJSA-N 0.000 description 4
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 4
- HAXARWKYFIIHKD-ZKWXMUAHSA-N Gly-Ile-Ser Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HAXARWKYFIIHKD-ZKWXMUAHSA-N 0.000 description 4
- MTBIKIMYHUWBRX-QWRGUYRKSA-N Gly-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN MTBIKIMYHUWBRX-QWRGUYRKSA-N 0.000 description 4
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 4
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 4
- FGNQZXKVAZIMCI-CIUDSAMLSA-N Leu-Asp-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N FGNQZXKVAZIMCI-CIUDSAMLSA-N 0.000 description 4
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 4
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 4
- GMMLGMFBYCFCCX-KZVJFYERSA-N Met-Thr-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMMLGMFBYCFCCX-KZVJFYERSA-N 0.000 description 4
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 4
- OVRNDRQMDRJTHS-FMDGEEDCSA-N N-acetyl-beta-D-glucosamine Chemical group CC(=O)N[C@H]1[C@H](O)O[C@H](CO)[C@@H](O)[C@@H]1O OVRNDRQMDRJTHS-FMDGEEDCSA-N 0.000 description 4
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 4
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 4
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 4
- YUSRGTQIPCJNHQ-CIUDSAMLSA-N Ser-Arg-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YUSRGTQIPCJNHQ-CIUDSAMLSA-N 0.000 description 4
- 241000193998 Streptococcus pneumoniae Species 0.000 description 4
- PELIQFPESHBTMA-WLTAIBSBSA-N Thr-Tyr-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 PELIQFPESHBTMA-WLTAIBSBSA-N 0.000 description 4
- RGYCVIZZTUBSSG-JYJNAYRXSA-N Tyr-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O RGYCVIZZTUBSSG-JYJNAYRXSA-N 0.000 description 4
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 4
- DIOSYUIWOQCXNR-ONGXEEELSA-N Val-Lys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O DIOSYUIWOQCXNR-ONGXEEELSA-N 0.000 description 4
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 4
- 108010047495 alanylglycine Proteins 0.000 description 4
- 229940049595 antibody-drug conjugate Drugs 0.000 description 4
- 108010038633 aspartylglutamate Proteins 0.000 description 4
- 238000005119 centrifugation Methods 0.000 description 4
- 108010049041 glutamylalanine Proteins 0.000 description 4
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 4
- 238000002372 labelling Methods 0.000 description 4
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 4
- 108010000761 leucylarginine Proteins 0.000 description 4
- ZRSNZINYAWTAHE-UHFFFAOYSA-N p-methoxybenzaldehyde Chemical compound COC1=CC=C(C=O)C=C1 ZRSNZINYAWTAHE-UHFFFAOYSA-N 0.000 description 4
- 229940031000 streptococcus pneumoniae Drugs 0.000 description 4
- 239000006228 supernatant Substances 0.000 description 4
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 3
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 3
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 3
- MRQQMVZUHXUPEV-IHRRRGAJSA-N Asp-Arg-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MRQQMVZUHXUPEV-IHRRRGAJSA-N 0.000 description 3
- OEUQMKNNOWJREN-AVGNSLFASA-N Asp-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N OEUQMKNNOWJREN-AVGNSLFASA-N 0.000 description 3
- YFSLJHLQOALGSY-ZPFDUUQYSA-N Asp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N YFSLJHLQOALGSY-ZPFDUUQYSA-N 0.000 description 3
- 241000588724 Escherichia coli Species 0.000 description 3
- XEKOWRVHYACXOJ-UHFFFAOYSA-N Ethyl acetate Chemical compound CCOC(C)=O XEKOWRVHYACXOJ-UHFFFAOYSA-N 0.000 description 3
- FHPXTPQBODWBIY-CIUDSAMLSA-N Glu-Ala-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHPXTPQBODWBIY-CIUDSAMLSA-N 0.000 description 3
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 3
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 3
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 3
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 3
- DHNXGWVNLFPOMQ-KBPBESRZSA-N Gly-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)CN DHNXGWVNLFPOMQ-KBPBESRZSA-N 0.000 description 3
- XINDHUAGVGCNSF-QSFUFRPTSA-N His-Ala-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XINDHUAGVGCNSF-QSFUFRPTSA-N 0.000 description 3
- ZZLWLWSUIBSMNP-CIUDSAMLSA-N His-Asp-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZZLWLWSUIBSMNP-CIUDSAMLSA-N 0.000 description 3
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 3
- ADJWHHZETYAAAX-SRVKXCTJSA-N Leu-Ser-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ADJWHHZETYAAAX-SRVKXCTJSA-N 0.000 description 3
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 3
- MPGHETGWWWUHPY-CIUDSAMLSA-N Lys-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN MPGHETGWWWUHPY-CIUDSAMLSA-N 0.000 description 3
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 3
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 3
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 3
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 3
- 108010066427 N-valyltryptophan Proteins 0.000 description 3
- QCHNRQQVLJYDSI-DLOVCJGASA-N Phe-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 QCHNRQQVLJYDSI-DLOVCJGASA-N 0.000 description 3
- JSGWNFKWZNPDAV-YDHLFZDLSA-N Phe-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JSGWNFKWZNPDAV-YDHLFZDLSA-N 0.000 description 3
- XKHCJJPNXFBADI-DCAQKATOSA-N Pro-Asp-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O XKHCJJPNXFBADI-DCAQKATOSA-N 0.000 description 3
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 3
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 3
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 3
- BIBYEFRASCNLAA-CDMKHQONSA-N Thr-Phe-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 BIBYEFRASCNLAA-CDMKHQONSA-N 0.000 description 3
- HSCJRCZFDFQWRP-UHFFFAOYSA-N Uridindiphosphoglukose Natural products OC1C(O)C(O)C(CO)OC1OP(O)(=O)OP(O)(=O)OCC1C(O)C(O)C(N2C(NC(=O)C=C2)=O)O1 HSCJRCZFDFQWRP-UHFFFAOYSA-N 0.000 description 3
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 3
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 3
- XEYUMGGWQCIWAR-XVKPBYJWSA-N Val-Gln-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N XEYUMGGWQCIWAR-XVKPBYJWSA-N 0.000 description 3
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 3
- 108010005233 alanylglutamic acid Proteins 0.000 description 3
- 239000000611 antibody drug conjugate Substances 0.000 description 3
- 108010062796 arginyllysine Proteins 0.000 description 3
- 108010077245 asparaginyl-proline Proteins 0.000 description 3
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 3
- 239000003153 chemical reaction reagent Substances 0.000 description 3
- 238000001816 cooling Methods 0.000 description 3
- 238000012258 culturing Methods 0.000 description 3
- XPPKVPWEQAFLFU-UHFFFAOYSA-J diphosphate(4-) Chemical group [O-]P([O-])(=O)OP([O-])([O-])=O XPPKVPWEQAFLFU-UHFFFAOYSA-J 0.000 description 3
- 230000002255 enzymatic effect Effects 0.000 description 3
- 238000006911 enzymatic reaction Methods 0.000 description 3
- 108010078144 glutaminyl-glycine Proteins 0.000 description 3
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 3
- 108010084389 glycyltryptophan Proteins 0.000 description 3
- 229930027917 kanamycin Natural products 0.000 description 3
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 3
- 229960000318 kanamycin Drugs 0.000 description 3
- 229930182823 kanamycin A Natural products 0.000 description 3
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 3
- 229910052759 nickel Inorganic materials 0.000 description 3
- 238000005580 one pot reaction Methods 0.000 description 3
- UMJSCPRVCHMLSP-UHFFFAOYSA-N pyridine Natural products COC1=CC=CN=C1 UMJSCPRVCHMLSP-UHFFFAOYSA-N 0.000 description 3
- 108091008146 restriction endonucleases Proteins 0.000 description 3
- 238000012163 sequencing technique Methods 0.000 description 3
- 239000011780 sodium chloride Substances 0.000 description 3
- 238000001308 synthesis method Methods 0.000 description 3
- RYHBNJHYFVUHQT-UHFFFAOYSA-N 1,4-Dioxane Chemical compound C1COCCO1 RYHBNJHYFVUHQT-UHFFFAOYSA-N 0.000 description 2
- WMPDAIZRQDCGFH-UHFFFAOYSA-N 3-methoxybenzaldehyde Chemical compound COC1=CC=CC(C=O)=C1 WMPDAIZRQDCGFH-UHFFFAOYSA-N 0.000 description 2
- CSCPPACGZOOCGX-UHFFFAOYSA-N Acetone Chemical compound CC(C)=O CSCPPACGZOOCGX-UHFFFAOYSA-N 0.000 description 2
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 description 2
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 2
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 2
- NLOMBWNGESDVJU-GUBZILKMSA-N Ala-Met-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLOMBWNGESDVJU-GUBZILKMSA-N 0.000 description 2
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 2
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 2
- JSHVMZANPXCDTL-GMOBBJLQSA-N Arg-Asp-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JSHVMZANPXCDTL-GMOBBJLQSA-N 0.000 description 2
- YKZJPIPFKGYHKY-DCAQKATOSA-N Arg-Leu-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKZJPIPFKGYHKY-DCAQKATOSA-N 0.000 description 2
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 2
- ASCGFDYEKSRNPL-CIUDSAMLSA-N Asn-Glu-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O ASCGFDYEKSRNPL-CIUDSAMLSA-N 0.000 description 2
- PMEHKVHZQKJACS-PEFMBERDSA-N Asp-Gln-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PMEHKVHZQKJACS-PEFMBERDSA-N 0.000 description 2
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 2
- XUVTWGPERWIERB-IHRRRGAJSA-N Asp-Pro-Phe Chemical compound N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O XUVTWGPERWIERB-IHRRRGAJSA-N 0.000 description 2
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 2
- KNDCWFXCFKSEBM-AVGNSLFASA-N Asp-Tyr-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O KNDCWFXCFKSEBM-AVGNSLFASA-N 0.000 description 2
- WAEDSQFVZJUHLI-BYULHYEWSA-N Asp-Val-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WAEDSQFVZJUHLI-BYULHYEWSA-N 0.000 description 2
- KZMGYPLQYOPHEL-UHFFFAOYSA-N Boron trifluoride etherate Chemical compound FB(F)F.CCOCC KZMGYPLQYOPHEL-UHFFFAOYSA-N 0.000 description 2
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 2
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 2
- OACPJRQRAHMQEQ-NHCYSSNCSA-N Gln-Val-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OACPJRQRAHMQEQ-NHCYSSNCSA-N 0.000 description 2
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 2
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 2
- KUTPGXNAAOQSPD-LPEHRKFASA-N Glu-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O KUTPGXNAAOQSPD-LPEHRKFASA-N 0.000 description 2
- WNRZUESNGGDCJX-JYJNAYRXSA-N Glu-Leu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WNRZUESNGGDCJX-JYJNAYRXSA-N 0.000 description 2
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 2
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 2
- JLJLBWDKDRYOPA-RYUDHWBXSA-N Gly-Gln-Tyr Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JLJLBWDKDRYOPA-RYUDHWBXSA-N 0.000 description 2
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 2
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 2
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 2
- VSLXGYMEHVAJBH-DLOVCJGASA-N His-Ala-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O VSLXGYMEHVAJBH-DLOVCJGASA-N 0.000 description 2
- NHJKZMDIMMTVCK-QXEWZRGKSA-N Ile-Gly-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N NHJKZMDIMMTVCK-QXEWZRGKSA-N 0.000 description 2
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical group CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 description 2
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 2
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 2
- REPPKAMYTOJTFC-DCAQKATOSA-N Leu-Arg-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O REPPKAMYTOJTFC-DCAQKATOSA-N 0.000 description 2
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 description 2
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 2
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 2
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 2
- FGZVGOAAROXFAB-IXOXFDKPSA-N Leu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N)O FGZVGOAAROXFAB-IXOXFDKPSA-N 0.000 description 2
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 2
- DGAAQRAUOFHBFJ-CIUDSAMLSA-N Lys-Asn-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DGAAQRAUOFHBFJ-CIUDSAMLSA-N 0.000 description 2
- DUTMKEAPLLUGNO-JYJNAYRXSA-N Lys-Glu-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DUTMKEAPLLUGNO-JYJNAYRXSA-N 0.000 description 2
- IUWMQCZOTYRXPL-ZPFDUUQYSA-N Lys-Ile-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O IUWMQCZOTYRXPL-ZPFDUUQYSA-N 0.000 description 2
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 2
- GAELMDJMQDUDLJ-BQBZGAKWSA-N Met-Ala-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O GAELMDJMQDUDLJ-BQBZGAKWSA-N 0.000 description 2
- GWADARYJIJDYRC-XGEHTFHBSA-N Met-Thr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GWADARYJIJDYRC-XGEHTFHBSA-N 0.000 description 2
- 108010046068 N-Acetyllactosamine Synthase Proteins 0.000 description 2
- OVRNDRQMDRJTHS-UHFFFAOYSA-N N-acelyl-D-glucosamine Natural products CC(=O)NC1C(O)OC(CO)C(O)C1O OVRNDRQMDRJTHS-UHFFFAOYSA-N 0.000 description 2
- MBLBDJOUHNCFQT-LXGUWJNJSA-N N-acetylglucosamine Natural products CC(=O)N[C@@H](C=O)[C@@H](O)[C@H](O)[C@H](O)CO MBLBDJOUHNCFQT-LXGUWJNJSA-N 0.000 description 2
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 2
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 2
- 241000588650 Neisseria meningitidis Species 0.000 description 2
- DPUOLKQSMYLRDR-UBHSHLNASA-N Phe-Arg-Ala Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 DPUOLKQSMYLRDR-UBHSHLNASA-N 0.000 description 2
- WIVCOAKLPICYGY-KKUMJFAQSA-N Phe-Asp-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N WIVCOAKLPICYGY-KKUMJFAQSA-N 0.000 description 2
- KRYSMKKRRRWOCZ-QEWYBTABSA-N Phe-Ile-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KRYSMKKRRRWOCZ-QEWYBTABSA-N 0.000 description 2
- VJLJGKQAOQJXJG-CIUDSAMLSA-N Pro-Asp-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJLJGKQAOQJXJG-CIUDSAMLSA-N 0.000 description 2
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 2
- DMKWYMWNEKIPFC-IUCAKERBSA-N Pro-Gly-Arg Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O DMKWYMWNEKIPFC-IUCAKERBSA-N 0.000 description 2
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 2
- IIRBTQHFVNGPMQ-AVGNSLFASA-N Pro-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 IIRBTQHFVNGPMQ-AVGNSLFASA-N 0.000 description 2
- CRZRTKAVUUGKEQ-ACZMJKKPSA-N Ser-Gln-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CRZRTKAVUUGKEQ-ACZMJKKPSA-N 0.000 description 2
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 2
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 2
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 2
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 2
- DCLBXIWHLVEPMQ-JRQIVUDYSA-N Thr-Asp-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DCLBXIWHLVEPMQ-JRQIVUDYSA-N 0.000 description 2
- JMGJDTNUMAZNLX-RWRJDSDZSA-N Thr-Glu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JMGJDTNUMAZNLX-RWRJDSDZSA-N 0.000 description 2
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 2
- NDXSOKGYKCGYKT-VEVYYDQMSA-N Thr-Pro-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O NDXSOKGYKCGYKT-VEVYYDQMSA-N 0.000 description 2
- NMCBVGFGWSIGSB-NUTKFTJISA-N Trp-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NMCBVGFGWSIGSB-NUTKFTJISA-N 0.000 description 2
- HSCJRCZFDFQWRP-JZMIEXBBSA-N UDP-alpha-D-glucose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1OP(O)(=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C(NC(=O)C=C2)=O)O1 HSCJRCZFDFQWRP-JZMIEXBBSA-N 0.000 description 2
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 2
- XQVRMLRMTAGSFJ-QXEWZRGKSA-N Val-Asp-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XQVRMLRMTAGSFJ-QXEWZRGKSA-N 0.000 description 2
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 2
- UDMBCSSLTHHNCD-KQYNXXCUSA-N adenosine 5'-monophosphate Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP(O)(O)=O)[C@@H](O)[C@H]1O UDMBCSSLTHHNCD-KQYNXXCUSA-N 0.000 description 2
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 2
- 108010008355 arginyl-glutamine Proteins 0.000 description 2
- 108010036533 arginylvaline Proteins 0.000 description 2
- 229960002685 biotin Drugs 0.000 description 2
- 235000020958 biotin Nutrition 0.000 description 2
- 239000011616 biotin Substances 0.000 description 2
- 239000012043 crude product Substances 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 229940079593 drug Drugs 0.000 description 2
- 239000003814 drug Substances 0.000 description 2
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 2
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 2
- 238000004128 high performance liquid chromatography Methods 0.000 description 2
- 108010040030 histidinoalanine Proteins 0.000 description 2
- 108010018006 histidylserine Proteins 0.000 description 2
- 230000006698 induction Effects 0.000 description 2
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 2
- 108010009298 lysylglutamic acid Proteins 0.000 description 2
- 229950006780 n-acetylglucosamine Drugs 0.000 description 2
- 108010029020 prolylglycine Proteins 0.000 description 2
- 108010048818 seryl-histidine Proteins 0.000 description 2
- 108010026333 seryl-proline Proteins 0.000 description 2
- 229940126586 small molecule drug Drugs 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- RUUNLLXEBIVUAQ-YTORKDELSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-aminopropanoyl]amino]propanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]propanoyl]amino]propanoic acid Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 RUUNLLXEBIVUAQ-YTORKDELSA-N 0.000 description 1
- OTEWWRBKGONZBW-UHFFFAOYSA-N 2-[[2-[[2-[(2-azaniumylacetyl)amino]-4-methylpentanoyl]amino]acetyl]amino]acetate Chemical compound NCC(=O)NC(CC(C)C)C(=O)NCC(=O)NCC(O)=O OTEWWRBKGONZBW-UHFFFAOYSA-N 0.000 description 1
- MSWZFWKMSRAUBD-IVMDWMLBSA-N 2-amino-2-deoxy-D-glucopyranose Chemical compound N[C@H]1C(O)O[C@H](CO)[C@@H](O)[C@@H]1O MSWZFWKMSRAUBD-IVMDWMLBSA-N 0.000 description 1
- GMAATFGIWIVOSD-UHFFFAOYSA-N 2h-1,2,3,4-tetrazole Chemical compound C=1N=NNN=1.C=1N=NNN=1 GMAATFGIWIVOSD-UHFFFAOYSA-N 0.000 description 1
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 1
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 1
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 1
- UGLPMYSCWHTZQU-AUTRQRHGSA-N Ala-Ala-Tyr Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UGLPMYSCWHTZQU-AUTRQRHGSA-N 0.000 description 1
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 1
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 1
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 1
- FVSOUJZKYWEFOB-KBIXCLLPSA-N Ala-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)N FVSOUJZKYWEFOB-KBIXCLLPSA-N 0.000 description 1
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 1
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 1
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 1
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 1
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 1
- FOHXUHGZZKETFI-JBDRJPRFSA-N Ala-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C)N FOHXUHGZZKETFI-JBDRJPRFSA-N 0.000 description 1
- QQACQIHVWCVBBR-GVARAGBVSA-N Ala-Ile-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QQACQIHVWCVBBR-GVARAGBVSA-N 0.000 description 1
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 1
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 1
- WUHJHHGYVVJMQE-BJDJZHNGSA-N Ala-Leu-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WUHJHHGYVVJMQE-BJDJZHNGSA-N 0.000 description 1
- VHVVPYOJIIQCKS-QEJZJMRPSA-N Ala-Leu-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VHVVPYOJIIQCKS-QEJZJMRPSA-N 0.000 description 1
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 1
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 1
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- DGLQWAFPIXDKRL-UBHSHLNASA-N Ala-Met-Phe Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N DGLQWAFPIXDKRL-UBHSHLNASA-N 0.000 description 1
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 1
- JAQNUEWEJWBVAY-WBAXXEDZSA-N Ala-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 JAQNUEWEJWBVAY-WBAXXEDZSA-N 0.000 description 1
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 1
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 1
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 1
- SAHQGRZIQVEJPF-JXUBOQSCSA-N Ala-Thr-Lys Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN SAHQGRZIQVEJPF-JXUBOQSCSA-N 0.000 description 1
- AENHOIXXHKNIQL-AUTRQRHGSA-N Ala-Tyr-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H]([NH3+])C)CC1=CC=C(O)C=C1 AENHOIXXHKNIQL-AUTRQRHGSA-N 0.000 description 1
- ZXKNLCPUNZPFGY-LEWSCRJBSA-N Ala-Tyr-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N ZXKNLCPUNZPFGY-LEWSCRJBSA-N 0.000 description 1
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 1
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 1
- 102000009027 Albumins Human genes 0.000 description 1
- 108010088751 Albumins Proteins 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- VHUUQVKOLVNVRT-UHFFFAOYSA-N Ammonium hydroxide Chemical compound [NH4+].[OH-] VHUUQVKOLVNVRT-UHFFFAOYSA-N 0.000 description 1
- DFCIPNHFKOQAME-FXQIFTODSA-N Arg-Ala-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFCIPNHFKOQAME-FXQIFTODSA-N 0.000 description 1
- SBVJJNJLFWSJOV-UBHSHLNASA-N Arg-Ala-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SBVJJNJLFWSJOV-UBHSHLNASA-N 0.000 description 1
- BIOCIVSVEDFKDJ-GUBZILKMSA-N Arg-Arg-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O BIOCIVSVEDFKDJ-GUBZILKMSA-N 0.000 description 1
- OZNSCVPYWZRQPY-CIUDSAMLSA-N Arg-Asp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OZNSCVPYWZRQPY-CIUDSAMLSA-N 0.000 description 1
- HJAICMSAKODKRF-GUBZILKMSA-N Arg-Cys-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O HJAICMSAKODKRF-GUBZILKMSA-N 0.000 description 1
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 1
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 1
- DNUKXVMPARLPFN-XUXIUFHCSA-N Arg-Leu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DNUKXVMPARLPFN-XUXIUFHCSA-N 0.000 description 1
- RTDZQOFEGPWSJD-AVGNSLFASA-N Arg-Leu-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O RTDZQOFEGPWSJD-AVGNSLFASA-N 0.000 description 1
- YTMKMRSYXHBGER-IHRRRGAJSA-N Arg-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YTMKMRSYXHBGER-IHRRRGAJSA-N 0.000 description 1
- HGKHPCFTRQDHCU-IUCAKERBSA-N Arg-Pro-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HGKHPCFTRQDHCU-IUCAKERBSA-N 0.000 description 1
- LRPZJPMQGKGHSG-XGEHTFHBSA-N Arg-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)O LRPZJPMQGKGHSG-XGEHTFHBSA-N 0.000 description 1
- HRCIIMCTUIAKQB-XGEHTFHBSA-N Arg-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O HRCIIMCTUIAKQB-XGEHTFHBSA-N 0.000 description 1
- JKRPBTQDPJSQIT-RCWTZXSCSA-N Arg-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O JKRPBTQDPJSQIT-RCWTZXSCSA-N 0.000 description 1
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 1
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 1
- NLCDVZJDEXIDDL-BIIVOSGPSA-N Asn-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O NLCDVZJDEXIDDL-BIIVOSGPSA-N 0.000 description 1
- HZYFHQOWCFUSOV-IMJSIDKUSA-N Asn-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(O)=O HZYFHQOWCFUSOV-IMJSIDKUSA-N 0.000 description 1
- BHQQRVARKXWXPP-ACZMJKKPSA-N Asn-Asp-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N BHQQRVARKXWXPP-ACZMJKKPSA-N 0.000 description 1
- ZPMNECSEJXXNBE-CIUDSAMLSA-N Asn-Cys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O ZPMNECSEJXXNBE-CIUDSAMLSA-N 0.000 description 1
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 1
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 1
- WONGRTVAMHFGBE-WDSKDSINSA-N Asn-Gly-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N WONGRTVAMHFGBE-WDSKDSINSA-N 0.000 description 1
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 1
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 1
- JWKDQOORUCYUIW-ZPFDUUQYSA-N Asn-Lys-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JWKDQOORUCYUIW-ZPFDUUQYSA-N 0.000 description 1
- MYTHOBCLNIOFBL-SRVKXCTJSA-N Asn-Ser-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYTHOBCLNIOFBL-SRVKXCTJSA-N 0.000 description 1
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 1
- YSYTWUMRHSFODC-QWRGUYRKSA-N Asn-Tyr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O YSYTWUMRHSFODC-QWRGUYRKSA-N 0.000 description 1
- KBQOUDLMWYWXNP-YDHLFZDLSA-N Asn-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)N)N KBQOUDLMWYWXNP-YDHLFZDLSA-N 0.000 description 1
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 1
- WSWYMRLTJVKRCE-ZLUOBGJFSA-N Asp-Ala-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O WSWYMRLTJVKRCE-ZLUOBGJFSA-N 0.000 description 1
- XEDQMTWEYFBOIK-ACZMJKKPSA-N Asp-Ala-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XEDQMTWEYFBOIK-ACZMJKKPSA-N 0.000 description 1
- VPPXTHJNTYDNFJ-CIUDSAMLSA-N Asp-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N VPPXTHJNTYDNFJ-CIUDSAMLSA-N 0.000 description 1
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 1
- HOQGTAIGQSDCHR-SRVKXCTJSA-N Asp-Asn-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HOQGTAIGQSDCHR-SRVKXCTJSA-N 0.000 description 1
- CELPEWWLSXMVPH-CIUDSAMLSA-N Asp-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O CELPEWWLSXMVPH-CIUDSAMLSA-N 0.000 description 1
- SVFOIXMRMLROHO-SRVKXCTJSA-N Asp-Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SVFOIXMRMLROHO-SRVKXCTJSA-N 0.000 description 1
- BFOYULZBKYOKAN-OLHMAJIHSA-N Asp-Asp-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFOYULZBKYOKAN-OLHMAJIHSA-N 0.000 description 1
- RSMIHCFQDCVVBR-CIUDSAMLSA-N Asp-Gln-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N RSMIHCFQDCVVBR-CIUDSAMLSA-N 0.000 description 1
- ZSJFGGSPCCHMNE-LAEOZQHASA-N Asp-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N ZSJFGGSPCCHMNE-LAEOZQHASA-N 0.000 description 1
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 1
- NRIFEOUAFLTMFJ-AAEUAGOBSA-N Asp-Gly-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O NRIFEOUAFLTMFJ-AAEUAGOBSA-N 0.000 description 1
- TVIZQBFURPLQDV-DJFWLOJKSA-N Asp-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N TVIZQBFURPLQDV-DJFWLOJKSA-N 0.000 description 1
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 1
- KQBVNNAPIURMPD-PEFMBERDSA-N Asp-Ile-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KQBVNNAPIURMPD-PEFMBERDSA-N 0.000 description 1
- HKEZZWQWXWGASX-KKUMJFAQSA-N Asp-Leu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HKEZZWQWXWGASX-KKUMJFAQSA-N 0.000 description 1
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 1
- HSGOFISJLFDMBJ-CIUDSAMLSA-N Asp-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N HSGOFISJLFDMBJ-CIUDSAMLSA-N 0.000 description 1
- YFGUZQQCSDZRBN-DCAQKATOSA-N Asp-Pro-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YFGUZQQCSDZRBN-DCAQKATOSA-N 0.000 description 1
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 1
- ZBYLEBZCVKLPCY-FXQIFTODSA-N Asp-Ser-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZBYLEBZCVKLPCY-FXQIFTODSA-N 0.000 description 1
- CUQDCPXNZPDYFQ-ZLUOBGJFSA-N Asp-Ser-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O CUQDCPXNZPDYFQ-ZLUOBGJFSA-N 0.000 description 1
- GCACQYDBDHRVGE-LKXGYXEUSA-N Asp-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC(O)=O GCACQYDBDHRVGE-LKXGYXEUSA-N 0.000 description 1
- BJDHEININLSZOT-KKUMJFAQSA-N Asp-Tyr-Lys Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(O)=O BJDHEININLSZOT-KKUMJFAQSA-N 0.000 description 1
- BYLPQJAWXJWUCJ-YDHLFZDLSA-N Asp-Tyr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O BYLPQJAWXJWUCJ-YDHLFZDLSA-N 0.000 description 1
- PLOKOIJSGCISHE-BYULHYEWSA-N Asp-Val-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLOKOIJSGCISHE-BYULHYEWSA-N 0.000 description 1
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 1
- 241000186020 Bifidobacterium dentium Species 0.000 description 1
- 241001312954 Bifidobacterium pullorum Species 0.000 description 1
- ZOXJGFHDIHLPTG-UHFFFAOYSA-N Boron Chemical compound [B] ZOXJGFHDIHLPTG-UHFFFAOYSA-N 0.000 description 1
- 101000926224 Bos taurus N-acetyllactosaminide alpha-1,3-galactosyltransferase Proteins 0.000 description 1
- 101100074330 Caenorhabditis elegans lec-8 gene Proteins 0.000 description 1
- 241000699802 Cricetulus griseus Species 0.000 description 1
- GEEXORWTBTUOHC-FXQIFTODSA-N Cys-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N GEEXORWTBTUOHC-FXQIFTODSA-N 0.000 description 1
- MKVKKORBPTUSNX-LPEHRKFASA-N Cys-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N MKVKKORBPTUSNX-LPEHRKFASA-N 0.000 description 1
- 241000620209 Escherichia coli DH5[alpha] Species 0.000 description 1
- 241001333951 Escherichia coli O157 Species 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- 102000006471 Fucosyltransferases Human genes 0.000 description 1
- 108010019236 Fucosyltransferases Proteins 0.000 description 1
- 108060003306 Galactosyltransferase Proteins 0.000 description 1
- 102000030902 Galactosyltransferase Human genes 0.000 description 1
- INKFLNZBTSNFON-CIUDSAMLSA-N Gln-Ala-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O INKFLNZBTSNFON-CIUDSAMLSA-N 0.000 description 1
- NUMFTVCBONFQIQ-DRZSPHRISA-N Gln-Ala-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NUMFTVCBONFQIQ-DRZSPHRISA-N 0.000 description 1
- SHERTACNJPYHAR-ACZMJKKPSA-N Gln-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O SHERTACNJPYHAR-ACZMJKKPSA-N 0.000 description 1
- UICOTGULOUGGLC-NUMRIWBASA-N Gln-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UICOTGULOUGGLC-NUMRIWBASA-N 0.000 description 1
- OFPWCBGRYAOLMU-AVGNSLFASA-N Gln-Asp-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O OFPWCBGRYAOLMU-AVGNSLFASA-N 0.000 description 1
- XFKUFUJECJUQTQ-CIUDSAMLSA-N Gln-Gln-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XFKUFUJECJUQTQ-CIUDSAMLSA-N 0.000 description 1
- KDXKFBSNIJYNNR-YVNDNENWSA-N Gln-Glu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KDXKFBSNIJYNNR-YVNDNENWSA-N 0.000 description 1
- VOLVNCMGXWDDQY-LPEHRKFASA-N Gln-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O VOLVNCMGXWDDQY-LPEHRKFASA-N 0.000 description 1
- VSXBYIJUAXPAAL-WDSKDSINSA-N Gln-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O VSXBYIJUAXPAAL-WDSKDSINSA-N 0.000 description 1
- ORYMMTRPKVTGSJ-XVKPBYJWSA-N Gln-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O ORYMMTRPKVTGSJ-XVKPBYJWSA-N 0.000 description 1
- PODFFOWWLUPNMN-DCAQKATOSA-N Gln-His-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O PODFFOWWLUPNMN-DCAQKATOSA-N 0.000 description 1
- LKVCNGLNTAPMSZ-JYJNAYRXSA-N Gln-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)N)N LKVCNGLNTAPMSZ-JYJNAYRXSA-N 0.000 description 1
- QKCZZAZNMMVICF-DCAQKATOSA-N Gln-Leu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O QKCZZAZNMMVICF-DCAQKATOSA-N 0.000 description 1
- FQCILXROGNOZON-YUMQZZPRSA-N Gln-Pro-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O FQCILXROGNOZON-YUMQZZPRSA-N 0.000 description 1
- KUBFPYIMAGXGBT-ACZMJKKPSA-N Gln-Ser-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KUBFPYIMAGXGBT-ACZMJKKPSA-N 0.000 description 1
- KVQOVQVGVKDZNW-GUBZILKMSA-N Gln-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N KVQOVQVGVKDZNW-GUBZILKMSA-N 0.000 description 1
- YJCZUTXLPXBNIO-BHYGNILZSA-N Gln-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CCC(=O)N)N)C(=O)O YJCZUTXLPXBNIO-BHYGNILZSA-N 0.000 description 1
- WIMVKDYAKRAUCG-IHRRRGAJSA-N Gln-Tyr-Glu Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O WIMVKDYAKRAUCG-IHRRRGAJSA-N 0.000 description 1
- QZQYITIKPAUDGN-GVXVVHGQSA-N Gln-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N QZQYITIKPAUDGN-GVXVVHGQSA-N 0.000 description 1
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 1
- RLZBLVSJDFHDBL-KBIXCLLPSA-N Glu-Ala-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RLZBLVSJDFHDBL-KBIXCLLPSA-N 0.000 description 1
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 1
- WOMUDRVDJMHTCV-DCAQKATOSA-N Glu-Arg-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WOMUDRVDJMHTCV-DCAQKATOSA-N 0.000 description 1
- YKLNMGJYMNPBCP-ACZMJKKPSA-N Glu-Asn-Asp Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YKLNMGJYMNPBCP-ACZMJKKPSA-N 0.000 description 1
- YYOBUPFZLKQUAX-FXQIFTODSA-N Glu-Asn-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YYOBUPFZLKQUAX-FXQIFTODSA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 1
- ZXLZWUQBRYGDNS-CIUDSAMLSA-N Glu-Cys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N ZXLZWUQBRYGDNS-CIUDSAMLSA-N 0.000 description 1
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 1
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 1
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 1
- QIQABBIDHGQXGA-ZPFDUUQYSA-N Glu-Ile-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QIQABBIDHGQXGA-ZPFDUUQYSA-N 0.000 description 1
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 1
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 1
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 1
- SJJHXJDSNQJMMW-SRVKXCTJSA-N Glu-Lys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SJJHXJDSNQJMMW-SRVKXCTJSA-N 0.000 description 1
- UJMNFCAHLYKWOZ-DCAQKATOSA-N Glu-Lys-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UJMNFCAHLYKWOZ-DCAQKATOSA-N 0.000 description 1
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 1
- YHOJJFFTSMWVGR-HJGDQZAQSA-N Glu-Met-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YHOJJFFTSMWVGR-HJGDQZAQSA-N 0.000 description 1
- LHIPZASLKPYDPI-AVGNSLFASA-N Glu-Phe-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LHIPZASLKPYDPI-AVGNSLFASA-N 0.000 description 1
- QJVZSVUYZFYLFQ-CIUDSAMLSA-N Glu-Pro-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O QJVZSVUYZFYLFQ-CIUDSAMLSA-N 0.000 description 1
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 1
- QOXDAWODGSIDDI-GUBZILKMSA-N Glu-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N QOXDAWODGSIDDI-GUBZILKMSA-N 0.000 description 1
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 1
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 1
- ZGXGVBYEJGVJMV-HJGDQZAQSA-N Glu-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O ZGXGVBYEJGVJMV-HJGDQZAQSA-N 0.000 description 1
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 1
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 1
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 1
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 1
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 1
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 1
- QIZJOTQTCAGKPU-KWQFWETISA-N Gly-Ala-Tyr Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 QIZJOTQTCAGKPU-KWQFWETISA-N 0.000 description 1
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 1
- DJTXYXZNNDDEOU-WHFBIAKZSA-N Gly-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)C(=O)N DJTXYXZNNDDEOU-WHFBIAKZSA-N 0.000 description 1
- XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 1
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 description 1
- GVVKYKCOFMMTKZ-WHFBIAKZSA-N Gly-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)CN GVVKYKCOFMMTKZ-WHFBIAKZSA-N 0.000 description 1
- NTOWAXLMQFKJPT-YUMQZZPRSA-N Gly-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CN NTOWAXLMQFKJPT-YUMQZZPRSA-N 0.000 description 1
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 1
- ORXZVPZCPMKHNR-IUCAKERBSA-N Gly-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CNC=N1 ORXZVPZCPMKHNR-IUCAKERBSA-N 0.000 description 1
- ADZGCWWDPFDHCY-ZETCQYMHSA-N Gly-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 ADZGCWWDPFDHCY-ZETCQYMHSA-N 0.000 description 1
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 1
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 1
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 1
- YKJUITHASJAGHO-HOTGVXAUSA-N Gly-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN YKJUITHASJAGHO-HOTGVXAUSA-N 0.000 description 1
- OJNZVYSGVYLQIN-BQBZGAKWSA-N Gly-Met-Asp Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O OJNZVYSGVYLQIN-BQBZGAKWSA-N 0.000 description 1
- YLEIWGJJBFBFHC-KBPBESRZSA-N Gly-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 YLEIWGJJBFBFHC-KBPBESRZSA-N 0.000 description 1
- IEGFSKKANYKBDU-QWHCGFSZSA-N Gly-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)CN)C(=O)O IEGFSKKANYKBDU-QWHCGFSZSA-N 0.000 description 1
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 1
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 1
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 1
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 1
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 1
- RCHFYMASWAZQQZ-ZANVPECISA-N Gly-Trp-Ala Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)CN)=CNC2=C1 RCHFYMASWAZQQZ-ZANVPECISA-N 0.000 description 1
- UMBDRSMLCUYIRI-DVJZZOLTSA-N Gly-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)CN)O UMBDRSMLCUYIRI-DVJZZOLTSA-N 0.000 description 1
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 1
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 1
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 1
- 108700023372 Glycosyltransferases Proteins 0.000 description 1
- 102000051366 Glycosyltransferases Human genes 0.000 description 1
- TTZAWSKKNCEINZ-AVGNSLFASA-N His-Arg-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O TTZAWSKKNCEINZ-AVGNSLFASA-N 0.000 description 1
- FHKZHRMERJUXRJ-DCAQKATOSA-N His-Ser-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 FHKZHRMERJUXRJ-DCAQKATOSA-N 0.000 description 1
- CWSZWFILCNSNEX-CIUDSAMLSA-N His-Ser-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CWSZWFILCNSNEX-CIUDSAMLSA-N 0.000 description 1
- UWNUQPZUSRFIIN-JUKXBJQTSA-N His-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N UWNUQPZUSRFIIN-JUKXBJQTSA-N 0.000 description 1
- GYXDQXPCPASCNR-NHCYSSNCSA-N His-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N GYXDQXPCPASCNR-NHCYSSNCSA-N 0.000 description 1
- KDDKJKKQODQQBR-NHCYSSNCSA-N His-Val-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N KDDKJKKQODQQBR-NHCYSSNCSA-N 0.000 description 1
- QLRMMMQNCWBNPQ-QXEWZRGKSA-N Ile-Arg-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N QLRMMMQNCWBNPQ-QXEWZRGKSA-N 0.000 description 1
- PJLLMGWWINYQPB-PEFMBERDSA-N Ile-Asn-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PJLLMGWWINYQPB-PEFMBERDSA-N 0.000 description 1
- RPZFUIQVAPZLRH-GHCJXIJMSA-N Ile-Asp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)O)N RPZFUIQVAPZLRH-GHCJXIJMSA-N 0.000 description 1
- OVPYIUNCVSOVNF-ZPFDUUQYSA-N Ile-Gln-Pro Natural products CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O OVPYIUNCVSOVNF-ZPFDUUQYSA-N 0.000 description 1
- JRYQSFOFUFXPTB-RWRJDSDZSA-N Ile-Gln-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N JRYQSFOFUFXPTB-RWRJDSDZSA-N 0.000 description 1
- UBHUJPVCJHPSEU-GRLWGSQLSA-N Ile-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N UBHUJPVCJHPSEU-GRLWGSQLSA-N 0.000 description 1
- TVSPLSZTKTUYLV-ZPFDUUQYSA-N Ile-Glu-Met Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O TVSPLSZTKTUYLV-ZPFDUUQYSA-N 0.000 description 1
- LPFBXFILACZHIB-LAEOZQHASA-N Ile-Gly-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)O)N LPFBXFILACZHIB-LAEOZQHASA-N 0.000 description 1
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 1
- RMNMUUCYTMLWNA-ZPFDUUQYSA-N Ile-Lys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RMNMUUCYTMLWNA-ZPFDUUQYSA-N 0.000 description 1
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 1
- CKRFDMPBSWYOBT-PPCPHDFISA-N Ile-Lys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CKRFDMPBSWYOBT-PPCPHDFISA-N 0.000 description 1
- FFJQAEYLAQMGDL-MGHWNKPDSA-N Ile-Lys-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FFJQAEYLAQMGDL-MGHWNKPDSA-N 0.000 description 1
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 1
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 1
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 1
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 1
- COWHUQXTSYTKQC-RWRJDSDZSA-N Ile-Thr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N COWHUQXTSYTKQC-RWRJDSDZSA-N 0.000 description 1
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 1
- KWHFUMYCSPJCFQ-NGTWOADLSA-N Ile-Thr-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N KWHFUMYCSPJCFQ-NGTWOADLSA-N 0.000 description 1
- RTSQPLLOYSGMKM-DSYPUSFNSA-N Ile-Trp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(C)C)C(=O)O)N RTSQPLLOYSGMKM-DSYPUSFNSA-N 0.000 description 1
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 1
- 102000018071 Immunoglobulin Fc Fragments Human genes 0.000 description 1
- 108010091135 Immunoglobulin Fc Fragments Proteins 0.000 description 1
- 102000009617 Inorganic Pyrophosphatase Human genes 0.000 description 1
- 108010009595 Inorganic Pyrophosphatase Proteins 0.000 description 1
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 1
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 1
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 1
- SUPVSFFZWVOEOI-UHFFFAOYSA-N Leu-Ala-Tyr Natural products CC(C)CC(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-UHFFFAOYSA-N 0.000 description 1
- JUWJEAPUNARGCF-DCAQKATOSA-N Leu-Arg-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JUWJEAPUNARGCF-DCAQKATOSA-N 0.000 description 1
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 1
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 1
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 1
- PPBKJAQJAUHZKX-SRVKXCTJSA-N Leu-Cys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(C)C PPBKJAQJAUHZKX-SRVKXCTJSA-N 0.000 description 1
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 1
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 1
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 1
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 1
- KVOFSTUWVSQMDK-KKUMJFAQSA-N Leu-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KVOFSTUWVSQMDK-KKUMJFAQSA-N 0.000 description 1
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 1
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 1
- KUIDCYNIEJBZBU-AJNGGQMLSA-N Leu-Ile-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O KUIDCYNIEJBZBU-AJNGGQMLSA-N 0.000 description 1
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 1
- GCXGCIYIHXSKAY-ULQDDVLXSA-N Leu-Phe-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GCXGCIYIHXSKAY-ULQDDVLXSA-N 0.000 description 1
- KQFZKDITNUEVFJ-JYJNAYRXSA-N Leu-Phe-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CC=CC=C1 KQFZKDITNUEVFJ-JYJNAYRXSA-N 0.000 description 1
- MJWVXZABPOKJJF-ACRUOGEOSA-N Leu-Phe-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MJWVXZABPOKJJF-ACRUOGEOSA-N 0.000 description 1
- QMKFDEUJGYNFMC-AVGNSLFASA-N Leu-Pro-Arg Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QMKFDEUJGYNFMC-AVGNSLFASA-N 0.000 description 1
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 1
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 1
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 1
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 description 1
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 1
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 1
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 1
- ABHIXYDMILIUKV-CIUDSAMLSA-N Lys-Asn-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ABHIXYDMILIUKV-CIUDSAMLSA-N 0.000 description 1
- NCTDKZKNBDZDOL-GARJFASQSA-N Lys-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O NCTDKZKNBDZDOL-GARJFASQSA-N 0.000 description 1
- GKFNXYMAMKJSKD-NHCYSSNCSA-N Lys-Asp-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GKFNXYMAMKJSKD-NHCYSSNCSA-N 0.000 description 1
- ODUQLUADRKMHOZ-JYJNAYRXSA-N Lys-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)O ODUQLUADRKMHOZ-JYJNAYRXSA-N 0.000 description 1
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 1
- QZONCCHVHCOBSK-YUMQZZPRSA-N Lys-Gly-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O QZONCCHVHCOBSK-YUMQZZPRSA-N 0.000 description 1
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 1
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 1
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 description 1
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 1
- LUAJJLPHUXPQLH-KKUMJFAQSA-N Lys-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N LUAJJLPHUXPQLH-KKUMJFAQSA-N 0.000 description 1
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 1
- MSSABBQOBUZFKZ-IHRRRGAJSA-N Lys-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCCCN)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O MSSABBQOBUZFKZ-IHRRRGAJSA-N 0.000 description 1
- LUTDBHBIHHREDC-IHRRRGAJSA-N Lys-Pro-Lys Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O LUTDBHBIHHREDC-IHRRRGAJSA-N 0.000 description 1
- LOGFVTREOLYCPF-RHYQMDGZSA-N Lys-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN LOGFVTREOLYCPF-RHYQMDGZSA-N 0.000 description 1
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 1
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 1
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 1
- ZFNYWKHYUMEZDZ-WDSOQIARSA-N Lys-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCCCN)N ZFNYWKHYUMEZDZ-WDSOQIARSA-N 0.000 description 1
- WINFHLHJTRGLCV-BZSNNMDCSA-N Lys-Tyr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=C(O)C=C1 WINFHLHJTRGLCV-BZSNNMDCSA-N 0.000 description 1
- SQRLLZAQNOQCEG-KKUMJFAQSA-N Lys-Tyr-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 SQRLLZAQNOQCEG-KKUMJFAQSA-N 0.000 description 1
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 description 1
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 1
- VKCPHIOZDWUFSW-ONGXEEELSA-N Lys-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN VKCPHIOZDWUFSW-ONGXEEELSA-N 0.000 description 1
- OZVXDDFYCQOPFD-XQQFMLRXSA-N Lys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N OZVXDDFYCQOPFD-XQQFMLRXSA-N 0.000 description 1
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 1
- HMZPYMSEAALNAE-ULQDDVLXSA-N Lys-Val-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMZPYMSEAALNAE-ULQDDVLXSA-N 0.000 description 1
- QGQGAIBGTUJRBR-NAKRPEOUSA-N Met-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCSC QGQGAIBGTUJRBR-NAKRPEOUSA-N 0.000 description 1
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 1
- HLQWFLJOJRFXHO-CIUDSAMLSA-N Met-Glu-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O HLQWFLJOJRFXHO-CIUDSAMLSA-N 0.000 description 1
- MYAPQOBHGWJZOM-UWVGGRQHSA-N Met-Gly-Leu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C MYAPQOBHGWJZOM-UWVGGRQHSA-N 0.000 description 1
- VSJAPSMRFYUOKS-IUCAKERBSA-N Met-Pro-Gly Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O VSJAPSMRFYUOKS-IUCAKERBSA-N 0.000 description 1
- CIDICGYKRUTYLE-FXQIFTODSA-N Met-Ser-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CIDICGYKRUTYLE-FXQIFTODSA-N 0.000 description 1
- PCTFVQATEGYHJU-FXQIFTODSA-N Met-Ser-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O PCTFVQATEGYHJU-FXQIFTODSA-N 0.000 description 1
- RDLSEGZJMYGFNS-FXQIFTODSA-N Met-Ser-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RDLSEGZJMYGFNS-FXQIFTODSA-N 0.000 description 1
- MIXPUVSPPOWTCR-FXQIFTODSA-N Met-Ser-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MIXPUVSPPOWTCR-FXQIFTODSA-N 0.000 description 1
- KSIPKXNIQOWMIC-RCWTZXSCSA-N Met-Thr-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KSIPKXNIQOWMIC-RCWTZXSCSA-N 0.000 description 1
- KYXDADPHSNFWQX-VEVYYDQMSA-N Met-Thr-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O KYXDADPHSNFWQX-VEVYYDQMSA-N 0.000 description 1
- CIIJWIAORKTXAH-FJXKBIBVSA-N Met-Thr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O CIIJWIAORKTXAH-FJXKBIBVSA-N 0.000 description 1
- QYIGOFGUOVTAHK-ZJDVBMNYSA-N Met-Thr-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QYIGOFGUOVTAHK-ZJDVBMNYSA-N 0.000 description 1
- HOTNHEUETJELDL-BPNCWPANSA-N Met-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCSC)N HOTNHEUETJELDL-BPNCWPANSA-N 0.000 description 1
- OVTOTTGZBWXLFU-QXEWZRGKSA-N Met-Val-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O OVTOTTGZBWXLFU-QXEWZRGKSA-N 0.000 description 1
- KPVLLNDCBYXKNV-CYDGBPFRSA-N Met-Val-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KPVLLNDCBYXKNV-CYDGBPFRSA-N 0.000 description 1
- JGFZNNIVVJXRND-UHFFFAOYSA-N N,N-Diisopropylethylamine (DIPEA) Chemical compound CCN(C(C)C)C(C)C JGFZNNIVVJXRND-UHFFFAOYSA-N 0.000 description 1
- 125000003047 N-acetyl group Chemical group 0.000 description 1
- 230000004988 N-glycosylation Effects 0.000 description 1
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 1
- UHRNIXJAGGLKHP-DLOVCJGASA-N Phe-Ala-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O UHRNIXJAGGLKHP-DLOVCJGASA-N 0.000 description 1
- CSYVXYQDIVCQNU-QWRGUYRKSA-N Phe-Asp-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O CSYVXYQDIVCQNU-QWRGUYRKSA-N 0.000 description 1
- RIYZXJVARWJLKS-KKUMJFAQSA-N Phe-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 RIYZXJVARWJLKS-KKUMJFAQSA-N 0.000 description 1
- SWZKMTDPQXLQRD-XVSYOHENSA-N Phe-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWZKMTDPQXLQRD-XVSYOHENSA-N 0.000 description 1
- HOYQLNNGMHXZDW-KKUMJFAQSA-N Phe-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HOYQLNNGMHXZDW-KKUMJFAQSA-N 0.000 description 1
- JWQWPTLEOFNCGX-AVGNSLFASA-N Phe-Glu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 JWQWPTLEOFNCGX-AVGNSLFASA-N 0.000 description 1
- RFEXGCASCQGGHZ-STQMWFEESA-N Phe-Gly-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O RFEXGCASCQGGHZ-STQMWFEESA-N 0.000 description 1
- JEBWZLWTRPZQRX-QWRGUYRKSA-N Phe-Gly-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O JEBWZLWTRPZQRX-QWRGUYRKSA-N 0.000 description 1
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 1
- VJLLEKDQJSMHRU-STQMWFEESA-N Phe-Gly-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O VJLLEKDQJSMHRU-STQMWFEESA-N 0.000 description 1
- MIICYIIBVYQNKE-QEWYBTABSA-N Phe-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N MIICYIIBVYQNKE-QEWYBTABSA-N 0.000 description 1
- MJQFZGOIVBDIMZ-WHOFXGATSA-N Phe-Ile-Gly Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O MJQFZGOIVBDIMZ-WHOFXGATSA-N 0.000 description 1
- KXUZHWXENMYOHC-QEJZJMRPSA-N Phe-Leu-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUZHWXENMYOHC-QEJZJMRPSA-N 0.000 description 1
- RSPUIENXSJYZQO-JYJNAYRXSA-N Phe-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RSPUIENXSJYZQO-JYJNAYRXSA-N 0.000 description 1
- SZYBZVANEAOIPE-UBHSHLNASA-N Phe-Met-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O SZYBZVANEAOIPE-UBHSHLNASA-N 0.000 description 1
- XDMMOISUAHXXFD-SRVKXCTJSA-N Phe-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O XDMMOISUAHXXFD-SRVKXCTJSA-N 0.000 description 1
- HBXAOEBRGLCLIW-AVGNSLFASA-N Phe-Ser-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HBXAOEBRGLCLIW-AVGNSLFASA-N 0.000 description 1
- QSWKNJAPHQDAAS-MELADBBJSA-N Phe-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O QSWKNJAPHQDAAS-MELADBBJSA-N 0.000 description 1
- DBALDZKOTNSBFM-FXQIFTODSA-N Pro-Ala-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DBALDZKOTNSBFM-FXQIFTODSA-N 0.000 description 1
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 1
- ZSKJPKFTPQCPIH-RCWTZXSCSA-N Pro-Arg-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSKJPKFTPQCPIH-RCWTZXSCSA-N 0.000 description 1
- UTAUEDINXUMHLG-FXQIFTODSA-N Pro-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 UTAUEDINXUMHLG-FXQIFTODSA-N 0.000 description 1
- ILMLVTGTUJPQFP-FXQIFTODSA-N Pro-Asp-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ILMLVTGTUJPQFP-FXQIFTODSA-N 0.000 description 1
- XUSDDSLCRPUKLP-QXEWZRGKSA-N Pro-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 XUSDDSLCRPUKLP-QXEWZRGKSA-N 0.000 description 1
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 1
- UUHXBJHVTVGSKM-BQBZGAKWSA-N Pro-Gly-Asn Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UUHXBJHVTVGSKM-BQBZGAKWSA-N 0.000 description 1
- FDINZVJXLPILKV-DCAQKATOSA-N Pro-His-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O FDINZVJXLPILKV-DCAQKATOSA-N 0.000 description 1
- SSWJYJHXQOYTSP-SRVKXCTJSA-N Pro-His-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O SSWJYJHXQOYTSP-SRVKXCTJSA-N 0.000 description 1
- IBGCFJDLCYTKPW-NAKRPEOUSA-N Pro-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 IBGCFJDLCYTKPW-NAKRPEOUSA-N 0.000 description 1
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 1
- WFIVLLFYUZZWOD-RHYQMDGZSA-N Pro-Lys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WFIVLLFYUZZWOD-RHYQMDGZSA-N 0.000 description 1
- WIPAMEKBSHNFQE-IUCAKERBSA-N Pro-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@@H]1CCCN1 WIPAMEKBSHNFQE-IUCAKERBSA-N 0.000 description 1
- MHBSUKYVBZVQRW-HJWJTTGWSA-N Pro-Phe-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MHBSUKYVBZVQRW-HJWJTTGWSA-N 0.000 description 1
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 1
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 1
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 1
- PKHDJFHFMGQMPS-RCWTZXSCSA-N Pro-Thr-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PKHDJFHFMGQMPS-RCWTZXSCSA-N 0.000 description 1
- VBZXFFYOBDLLFE-HSHDSVGOSA-N Pro-Trp-Thr Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H]([C@H](O)C)C(O)=O)C(=O)[C@@H]1CCCN1 VBZXFFYOBDLLFE-HSHDSVGOSA-N 0.000 description 1
- SHTKRJHDMNSKRM-ULQDDVLXSA-N Pro-Tyr-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O SHTKRJHDMNSKRM-ULQDDVLXSA-N 0.000 description 1
- UIUWGMRJTWHIJZ-ULQDDVLXSA-N Pro-Tyr-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O UIUWGMRJTWHIJZ-ULQDDVLXSA-N 0.000 description 1
- IALSFJSONJZBKB-HRCADAONSA-N Pro-Tyr-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N3CCC[C@@H]3C(=O)O IALSFJSONJZBKB-HRCADAONSA-N 0.000 description 1
- JXVXYRZQIUPYSA-NHCYSSNCSA-N Pro-Val-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JXVXYRZQIUPYSA-NHCYSSNCSA-N 0.000 description 1
- FUOGXAQMNJMBFG-WPRPVWTQSA-N Pro-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FUOGXAQMNJMBFG-WPRPVWTQSA-N 0.000 description 1
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 1
- GXXTUIUYTWGPMV-FXQIFTODSA-N Ser-Arg-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O GXXTUIUYTWGPMV-FXQIFTODSA-N 0.000 description 1
- KAAPNMOKUUPKOE-SRVKXCTJSA-N Ser-Asn-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KAAPNMOKUUPKOE-SRVKXCTJSA-N 0.000 description 1
- DBIDZNUXSLXVRG-FXQIFTODSA-N Ser-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N DBIDZNUXSLXVRG-FXQIFTODSA-N 0.000 description 1
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 description 1
- SWIQQMYVHIXPEK-FXQIFTODSA-N Ser-Cys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O SWIQQMYVHIXPEK-FXQIFTODSA-N 0.000 description 1
- RNMRYWZYFHHOEV-CIUDSAMLSA-N Ser-Gln-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RNMRYWZYFHHOEV-CIUDSAMLSA-N 0.000 description 1
- IXUGADGDCQDLSA-FXQIFTODSA-N Ser-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N IXUGADGDCQDLSA-FXQIFTODSA-N 0.000 description 1
- SQBLRDDJTUJDMV-ACZMJKKPSA-N Ser-Glu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQBLRDDJTUJDMV-ACZMJKKPSA-N 0.000 description 1
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 1
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 description 1
- XERQKTRGJIKTRB-CIUDSAMLSA-N Ser-His-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CN=CN1 XERQKTRGJIKTRB-CIUDSAMLSA-N 0.000 description 1
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 1
- BKZYBLLIBOBOOW-GHCJXIJMSA-N Ser-Ile-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O BKZYBLLIBOBOOW-GHCJXIJMSA-N 0.000 description 1
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 1
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 1
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 1
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 1
- XKFJENWJGHMDLI-QWRGUYRKSA-N Ser-Phe-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O XKFJENWJGHMDLI-QWRGUYRKSA-N 0.000 description 1
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 1
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 1
- SOACHCFYJMCMHC-BWBBJGPYSA-N Ser-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)O SOACHCFYJMCMHC-BWBBJGPYSA-N 0.000 description 1
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 1
- IAOHCSQDQDWRQU-GUBZILKMSA-N Ser-Val-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IAOHCSQDQDWRQU-GUBZILKMSA-N 0.000 description 1
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 1
- MQCPGOZXFSYJPS-KZVJFYERSA-N Thr-Ala-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MQCPGOZXFSYJPS-KZVJFYERSA-N 0.000 description 1
- KEGBFULVYKYJRD-LFSVMHDDSA-N Thr-Ala-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KEGBFULVYKYJRD-LFSVMHDDSA-N 0.000 description 1
- CAGTXGDOIFXLPC-KZVJFYERSA-N Thr-Arg-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N CAGTXGDOIFXLPC-KZVJFYERSA-N 0.000 description 1
- UKBSDLHIKIXJKH-HJGDQZAQSA-N Thr-Arg-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UKBSDLHIKIXJKH-HJGDQZAQSA-N 0.000 description 1
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 description 1
- DIPIPFHFLPTCLK-LOKLDPHHSA-N Thr-Gln-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O DIPIPFHFLPTCLK-LOKLDPHHSA-N 0.000 description 1
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 1
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 1
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 1
- CJXURNZYNHCYFD-WDCWCFNPSA-N Thr-Lys-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O CJXURNZYNHCYFD-WDCWCFNPSA-N 0.000 description 1
- UJQVSMNQMQHVRY-KZVJFYERSA-N Thr-Met-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O UJQVSMNQMQHVRY-KZVJFYERSA-N 0.000 description 1
- YJVJPJPHHFOVMG-VEVYYDQMSA-N Thr-Met-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O YJVJPJPHHFOVMG-VEVYYDQMSA-N 0.000 description 1
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 1
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 1
- KVEWWQRTAVMOFT-KJEVXHAQSA-N Thr-Tyr-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O KVEWWQRTAVMOFT-KJEVXHAQSA-N 0.000 description 1
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 1
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 1
- IJRXQJVGFBSKIV-ZFWWWQNUSA-N Trp-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CNC2=CC=CC=C21)N IJRXQJVGFBSKIV-ZFWWWQNUSA-N 0.000 description 1
- VPRHDRKAPYZMHL-SZMVWBNQSA-N Trp-Leu-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 VPRHDRKAPYZMHL-SZMVWBNQSA-N 0.000 description 1
- SUEGAFMNTXXNLR-WFBYXXMGSA-N Trp-Ser-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O SUEGAFMNTXXNLR-WFBYXXMGSA-N 0.000 description 1
- ADMHZNPMMVKGJW-BPUTZDHNSA-N Trp-Ser-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N ADMHZNPMMVKGJW-BPUTZDHNSA-N 0.000 description 1
- IXTQGBGHWQEEDE-AVGNSLFASA-N Tyr-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IXTQGBGHWQEEDE-AVGNSLFASA-N 0.000 description 1
- CNLKDWSAORJEMW-KWQFWETISA-N Tyr-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O CNLKDWSAORJEMW-KWQFWETISA-N 0.000 description 1
- GIOBXJSONRQHKQ-RYUDHWBXSA-N Tyr-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O GIOBXJSONRQHKQ-RYUDHWBXSA-N 0.000 description 1
- USYGMBIIUDLYHJ-GVARAGBVSA-N Tyr-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 USYGMBIIUDLYHJ-GVARAGBVSA-N 0.000 description 1
- JAGGEZACYAAMIL-CQDKDKBSSA-N Tyr-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JAGGEZACYAAMIL-CQDKDKBSSA-N 0.000 description 1
- CDBXVDXSLPLFMD-BPNCWPANSA-N Tyr-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDBXVDXSLPLFMD-BPNCWPANSA-N 0.000 description 1
- AEOFMCAKYIQQFY-YDHLFZDLSA-N Tyr-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AEOFMCAKYIQQFY-YDHLFZDLSA-N 0.000 description 1
- DJIJBQYBDKGDIS-JYJNAYRXSA-N Tyr-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O DJIJBQYBDKGDIS-JYJNAYRXSA-N 0.000 description 1
- XCCTYIAWTASOJW-UHFFFAOYSA-N UDP-Glc Natural products OC1C(O)C(COP(O)(=O)OP(O)(O)=O)OC1N1C(=O)NC(=O)C=C1 XCCTYIAWTASOJW-UHFFFAOYSA-N 0.000 description 1
- HSCJRCZFDFQWRP-ABVWGUQPSA-N UDP-alpha-D-galactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@@H]1OP(O)(=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C(NC(=O)C=C2)=O)O1 HSCJRCZFDFQWRP-ABVWGUQPSA-N 0.000 description 1
- JIODCDXKCJRMEH-NHCYSSNCSA-N Val-Arg-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N JIODCDXKCJRMEH-NHCYSSNCSA-N 0.000 description 1
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 1
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 1
- DCOOGDCRFXXQNW-ZKWXMUAHSA-N Val-Asn-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N DCOOGDCRFXXQNW-ZKWXMUAHSA-N 0.000 description 1
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 1
- IDKGBVZGNTYYCC-QXEWZRGKSA-N Val-Asn-Pro Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(O)=O IDKGBVZGNTYYCC-QXEWZRGKSA-N 0.000 description 1
- HZYOWMGWKKRMBZ-BYULHYEWSA-N Val-Asp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZYOWMGWKKRMBZ-BYULHYEWSA-N 0.000 description 1
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 1
- GBESYURLQOYWLU-LAEOZQHASA-N Val-Glu-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GBESYURLQOYWLU-LAEOZQHASA-N 0.000 description 1
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 1
- WDIGUPHXPBMODF-UMNHJUIQSA-N Val-Glu-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N WDIGUPHXPBMODF-UMNHJUIQSA-N 0.000 description 1
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 1
- PMXBARDFIAPBGK-DZKIICNBSA-N Val-Glu-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PMXBARDFIAPBGK-DZKIICNBSA-N 0.000 description 1
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 1
- MDYSKHBSPXUOPV-JSGCOSHPSA-N Val-Gly-Phe Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MDYSKHBSPXUOPV-JSGCOSHPSA-N 0.000 description 1
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 1
- HQYVQDRYODWONX-DCAQKATOSA-N Val-His-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N HQYVQDRYODWONX-DCAQKATOSA-N 0.000 description 1
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 1
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 1
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 1
- MBGFDZDWMDLXHQ-GUBZILKMSA-N Val-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MBGFDZDWMDLXHQ-GUBZILKMSA-N 0.000 description 1
- OJOMXGVLFKYDKP-QXEWZRGKSA-N Val-Met-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OJOMXGVLFKYDKP-QXEWZRGKSA-N 0.000 description 1
- WMRWZYSRQUORHJ-YDHLFZDLSA-N Val-Phe-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WMRWZYSRQUORHJ-YDHLFZDLSA-N 0.000 description 1
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 1
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 1
- YTNGABPUXFEOGU-SRVKXCTJSA-N Val-Pro-Arg Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTNGABPUXFEOGU-SRVKXCTJSA-N 0.000 description 1
- RYQUMYBMOJYYDK-NHCYSSNCSA-N Val-Pro-Glu Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RYQUMYBMOJYYDK-NHCYSSNCSA-N 0.000 description 1
- MIKHIIQMRFYVOR-RCWTZXSCSA-N Val-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C(C)C)N)O MIKHIIQMRFYVOR-RCWTZXSCSA-N 0.000 description 1
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 1
- BZDGLJPROOOUOZ-XGEHTFHBSA-N Val-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N)O BZDGLJPROOOUOZ-XGEHTFHBSA-N 0.000 description 1
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 1
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- TTWYZDPBDWHJOR-IDIVVRGQSA-L adenosine triphosphate disodium Chemical compound [Na+].[Na+].C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP([O-])([O-])=O)[C@@H](O)[C@H]1O TTWYZDPBDWHJOR-IDIVVRGQSA-L 0.000 description 1
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 1
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 1
- 108010078114 alanyl-tryptophyl-alanine Proteins 0.000 description 1
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- 108010087924 alanylproline Proteins 0.000 description 1
- 125000002355 alkine group Chemical group 0.000 description 1
- 125000000304 alkynyl group Chemical group 0.000 description 1
- 235000011114 ammonium hydroxide Nutrition 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 108010093581 aspartyl-proline Proteins 0.000 description 1
- 238000010461 azide-alkyne cycloaddition reaction Methods 0.000 description 1
- 150000001540 azides Chemical class 0.000 description 1
- 125000001797 benzyl group Chemical group [H]C1=C([H])C([H])=C(C([H])=C1[H])C([H])([H])* 0.000 description 1
- MSWZFWKMSRAUBD-UHFFFAOYSA-N beta-D-galactosamine Natural products NC1C(O)OC(CO)C(O)C1O MSWZFWKMSRAUBD-UHFFFAOYSA-N 0.000 description 1
- 229910052796 boron Inorganic materials 0.000 description 1
- 239000006227 byproduct Substances 0.000 description 1
- 108010089934 carbohydrase Proteins 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- IJOOHPMOJXWVHK-UHFFFAOYSA-N chlorotrimethylsilane Chemical compound C[Si](C)(C)Cl IJOOHPMOJXWVHK-UHFFFAOYSA-N 0.000 description 1
- 229940125904 compound 1 Drugs 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 229940127089 cytotoxic agent Drugs 0.000 description 1
- 239000002254 cytotoxic agent Substances 0.000 description 1
- 238000000354 decomposition reaction Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 108010060455 des-Tyr- beta-casomorphin Proteins 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 235000011180 diphosphates Nutrition 0.000 description 1
- YVBGRQLITPHVOP-UHFFFAOYSA-L disodium;[hydroxy-[hydroxy(oxido)phosphoryl]oxyphosphoryl] hydrogen phosphate Chemical compound [Na+].[Na+].OP([O-])(=O)OP(O)(=O)OP(O)([O-])=O YVBGRQLITPHVOP-UHFFFAOYSA-L 0.000 description 1
- 125000001495 ethyl group Chemical group [H]C([H])([H])C([H])([H])* 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 238000004108 freeze drying Methods 0.000 description 1
- 229960002442 glucosamine Drugs 0.000 description 1
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 1
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 1
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 1
- 108010079547 glutamylmethionine Proteins 0.000 description 1
- 150000004676 glycans Chemical class 0.000 description 1
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 1
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 1
- 108010057753 glycyl-arginyl-glycyl-aspartyl-tyrosine Proteins 0.000 description 1
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 1
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 1
- 108010054666 glycyl-leucyl-glycyl-glycine Proteins 0.000 description 1
- 108010089804 glycyl-threonine Proteins 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 108010087823 glycyltyrosine Proteins 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 108010025306 histidylleucine Proteins 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 108010076756 leucyl-alanyl-phenylalanine Proteins 0.000 description 1
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 1
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 1
- 108010003700 lysyl aspartic acid Proteins 0.000 description 1
- 108010089256 lysyl-aspartyl-glutamyl-leucine Proteins 0.000 description 1
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 1
- 108010064235 lysylglycine Proteins 0.000 description 1
- 108010017391 lysylvaline Proteins 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 1
- 108010022588 methionyl-lysyl-proline Proteins 0.000 description 1
- 108010085203 methionylmethionine Proteins 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000005311 nuclear magnetism Effects 0.000 description 1
- 210000001672 ovary Anatomy 0.000 description 1
- 108010084572 phenylalanyl-valine Proteins 0.000 description 1
- 108010018625 phenylalanylarginine Proteins 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- 239000002244 precipitate Substances 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 108010015796 prolylisoleucine Proteins 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 1
- 150000003384 small molecules Chemical class 0.000 description 1
- 239000011734 sodium Substances 0.000 description 1
- 159000000000 sodium salts Chemical class 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 239000012192 staining solution Substances 0.000 description 1
- 150000008163 sugars Chemical class 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- CIHOLLKRGTVIJN-UHFFFAOYSA-N tert‐butyl hydroperoxide Chemical compound CC(C)(C)OO CIHOLLKRGTVIJN-UHFFFAOYSA-N 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 1
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 1
- CSRZQMIRAZTJOY-UHFFFAOYSA-N trimethylsilyl iodide Substances C[Si](C)(C)I CSRZQMIRAZTJOY-UHFFFAOYSA-N 0.000 description 1
- 150000004043 trisaccharides Chemical class 0.000 description 1
- 108010037335 tyrosyl-prolyl-glycyl-glycine Proteins 0.000 description 1
Images
Landscapes
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
本发明提供了一种合成尿苷二磷酸‑6‑叠氮‑D‑半乳糖的方法。所述方法包括:6‑叠氮‑半乳糖在半乳糖激酶的作用下转变为6‑叠氮‑半乳糖‑1‑磷酸;6‑叠氮‑半乳糖‑1‑磷酸在糖焦磷酸化酶的作用下转变为尿苷二磷酸‑6‑叠氮‑D‑半乳糖。本发明利用双歧杆菌来源的半乳糖激酶(BiGalK)和拟南芥来源的糖焦磷酸化酶(AtUSP)建立了一种合成尿苷二磷酸‑6‑叠氮‑D‑半乳糖的方法,此方法较已报道的化学法更为简单、易操作,且比已报道的化学酶法更为高效,可以用于大量合成尿苷二磷酸‑6‑叠氮‑D‑半乳糖。
Description
技术领域
本发明属于尿苷二磷酸-6-叠氮-D-半乳糖的合成技术领域,具体地,涉及一种化学酶法合成尿苷二磷酸-6-叠氮-D-半乳糖的方法。
背景技术
非天然糖供体常被用于蛋白糖基化修饰的化学酶法标记。利用糖基转移酶将带有生物正交基团的核苷酸糖即糖供体转移到特异的糖基化位点,然后与带有荧光基团或生物素的生物正交基团发生反应,从而实现糖基化位点的检测和定位。叠氮和炔烃基团具有稳定性好,体积小(对天然聚糖结构的影响很小),不参与生物反应等特点,因此叠氮-炔烃环加成反应成为最常见的化学酶法标记糖基化的反应之一。
尿苷二磷酸-6-叠氮-D-半乳糖(UDP-6-叠氮-D-半乳糖,UDP- 6-N3-D-galactose)是一种已报道的非天然糖供体。文献表明牛源半乳糖基转移酶(GalT)能将它连接至4-甲基琥珀酸苯甲基N-乙酰乙基葡萄糖胺 (MU-GlcNAc)上。此外,尿苷二磷酸-6-叠氮-D-半乳糖能被来源于脑膜炎奈瑟菌(Neisseria meniningitidis)的β1,4-半乳糖基转移酶(NmLgtB)转移至中国仓鼠卵巢细胞Lec8表面N-糖基化的末端的N-乙酰葡萄糖胺残基上,然后与带有生物素的炔基结合,实现N-乙酰葡萄胺糖基化的标记。这表明尿苷二磷酸-6-叠氮-D-半乳糖在N-乙酰葡萄糖胺糖基化标记中具有一定的应用前景。
抗体药物偶联物(antibody-drugconjugate,ADC)是采用特定的连接子将抗体和小分子细胞毒药物连接起来,其主要组成成分包括抗体、连接子和小分子药物。抗体分子发挥靶向投递作用,小分子药物发挥效应。抗体的N-糖基化发生在抗体Fc片段的Asn297残基上,在抗体与靶标结合位点的远端,不会影响抗体的靶标识别作用。因此抗体的糖基化位点是理想的抗体与药物偶联的部位。含有生物正交基团的非天然糖具有成为ADC药物连接子的潜力。
此外,尿苷二磷酸-6-叠氮-D-半乳糖是一种含有生物正交基团的非天然糖供体,可以用于构建非天然糖供体糖库。尿苷二磷酸-6-叠氮-D-半乳糖有α和β两种构型,其中α构型的结构式如下:
目前,尿苷二磷酸-6-叠氮-D-半乳糖的合成方法有化学合成法和化学酶合成法。化学法和化学酶法都利用化学方法合成UDP-6-叠氮-D-半乳糖的前体6-叠氮-半乳糖(化合物6),其合成过程如以下反应式1所示(Srivastava, G.;Kaur,K.J.;Hindsgaul,O.;Palcic,M.M.,Enzymatic transfer of a preassembled trisaccharide antigen tocell surfaces using a fucosyltransferase.J Biol Chem 1992,267(31),22356-61.)。
反应式1.化学方法合成6-叠氮-半乳糖(6-N3-半乳糖)(a)浓H2SO4,CuSO4,丙酮,76%;(b) (CF3SO2)2O,吡啶,DCM;(c)NaN3,DMF;(d)80%TFA.
已报道的化学法有两种,合成过程见如下反应式2(Bosco,M.;Gall,S. L.;Rihouey,C.;Couve-Bonnaire,S.;Bardor,M.;Lerouge,P.;Pannecoucke,X., 6-Azido d-galactose transfer to N-acetyl-d-glucosamine derivative using commerciallyavailableβ-1,4-galactosyltransferase.Tetrahedron Letters 2008, 49(14),2294-2297.)。两种化学方法产率不高,不适用于大量合成。其中法1生成的产物为α和β两种立体异构体的混合物,难以分离。
法1:
法2:
反应式2.化学法合成尿苷二磷酸-6-叠氮-D-半乳糖(e)TMSCl,吡啶,0℃;(f)TMSI,CH2Cl2,0℃; (g)(Bu4N)2–UDP;(h)(i)Bu4NF,H2O;(ii)碱性磷酸酶(alkalinephosphatase);(iii)HPLC C18,2.5%(4 steps);(i)AA,三氟化硼-乙醚络合物(boron trifluoroetherate),0℃;(j)CH3COONH4,DIEA,DMF, 85%;(k)(i)2-氯-4H-苯并二氧磷-4-酮(2-chloro-4H-benzodioxaphosphorin-4-one,8),Et3N,二氧六环(dioxane),THF,0℃;(ii)H2O,74%(2步);(l)t-BuOOH(2当量),I2 cat.,THF,74%;(m)10,1H-四唑(1H-tetrazole),吡啶,46%;(n)(i)Et3N,NH4HCO3,CH3OH;(ii)HPLC C18,53%.
另一种则是化学酶法,以通过化学法合成的6-叠氮-半乳糖(化合物6) 为底物,通过酶催化反应合成尿苷二磷酸-6-叠氮-D-半乳糖。文献(Zou,Y.; Xue,M.;Wang,W.;Cai,L.;Chen,L.;Liu,J.;Wang,P.G.;Shen,J.; Chen,M.,One-pot three-enzyme synthesisof UDP-Glc,UDP-Gal,and their derivatives.Carbohydr Res 2013,373,76-81.)利用一锅三酶法合成尿苷二磷酸-6-叠氮-D-半乳糖,三种酶分别为肺炎链球菌来源的半乳糖激酶(SpGalK)、肺炎链球菌来源的尿苷二磷酸-葡萄糖焦磷酸化酶(SpGalU)和无机焦磷酸酶(PPA)(以下反应式3),产率较低。
法3:
反应式3.已报道的化学酶法合成尿苷二磷酸-6-叠氮-D-半乳糖。SpGalK:肺炎链球菌来源的半乳糖激酶;SpGalU:肺炎链球菌来源的尿苷二磷酸-葡萄糖焦磷酸化酶;PPA:无机焦磷酸酶。
发明内容
针对现有技术的不足,本发明提供了一种更为高效的大量制备尿苷二磷酸-6-叠氮-D-半乳糖的合成方法。此方法相较于化学法更为简单、易操作,且比已报道的化学酶法更为高效,可以用于大量合成尿苷二磷酸-6-叠氮-D- 半乳糖。
因此,一方面,本发明提供了一种合成尿苷二磷酸-6-叠氮-D-半乳糖的方法,包括以下步骤:
S1,在双歧杆菌来源的半乳糖激酶的作用下,6-叠氮-半乳糖(化合物 6)转变为6-叠氮-半乳糖-1-磷酸(化合物12),其反应式如下所示:
优选地,其中,所述半乳糖激酶选自:长双歧杆菌(Bifidobacterium longum)来源的半乳糖激酶(BiGalK)或在BiGalK的氨基酸序列中经过取代、缺失或添加一个或几个氨基酸且具有BiGalK活性的由BiGalK衍生的蛋白质,齿双歧杆菌(Bifidobacterium dentium)来源的半乳糖激酶,链状双歧杆菌(Bifidobacterium catenulatum)来源的半乳糖激酶,小鸡双歧杆菌 (Bifidobacterium pullorum)来源的半乳糖激酶,动物双歧杆菌(Bifidobacterium animalis)来源的半乳糖激酶,两歧双歧杆菌 (Bifidobacteriumbifidum)来源的半乳糖激酶,星状双歧杆菌 (Bifidobacterium asteroides)来源的半乳糖激酶,短双歧杆菌(Bifidobacterium breve)来源的半乳糖激酶,假长双歧杆菌(Bifidobacterium pseudolongum)来源的半乳糖激酶,假链状双歧杆菌(Bifidobacteriumpseudocatenulatum)来源的半乳糖激酶;
S2,在拟南芥(Arabidopsis thaliana)来源的糖焦磷酸化酶(AtUSP) 或在AtUSP的氨基酸序列中经过取代、缺失或添加一个或几个氨基酸且具有AtUSP活性的由AtUSP衍生的蛋白质的作用下,6-叠氮-半乳糖-1-磷酸 (化合物12)转变为尿苷二磷酸-6-叠氮-D-半乳糖,其反应式如下所示:
进一步地,本发明方法步骤S1中,BiGalK能够将6-叠氮-半乳糖转变为6-叠氮-半乳糖-1-磷酸(化合物12),其氨基酸序列可以如SEQ ID NO:1 所示。BiGalK可被Bifidobacterium系列同源序列替代。齿双歧杆菌 (Bifidobacterium dentium)来源的半乳糖激酶的氨基酸序列可以如SEQ ID NO:4所示。链状双歧杆菌(Bifidobacteriumcatenulatum)来源的半乳糖激酶的氨基酸序列可以如SEQ ID NO:5所示。小鸡双歧杆菌(Bifidobacterium pullorum)来源的半乳糖激酶的氨基酸序列可以如SEQ ID NO:6所示。动物双歧杆菌(Bifidobacterium animalis)来源的半乳糖激酶的氨基酸序列可以如SEQID NO:7所示。两歧双歧杆菌(Bifidobacterium bifidum)来源的半乳糖激酶的氨基酸序列可以如SEQ ID NO:8所示。星状双歧杆菌 (Bifidobacterium asteroides)来源的半乳糖激酶的氨基酸序列可以如SEQ ID NO:9所示。短双歧杆菌(Bifidobacterium breve)来源的半乳糖激酶的氨基酸序列可以如SEQ ID NO:10所示。假长双歧杆菌(Bifidobacteriumpseudolongum)来源的半乳糖激酶的氨基酸序列可以如SEQ ID NO:11所示。假链状双歧杆菌(Bifidobacterium pseudocatenulatum)来源的半乳糖激酶的氨基酸序列可以如SEQ IDNO:12所示。
进一步地,优选地,编码所述BiGalK的核苷酸序列如SEQ ID NO:13 所示。
进一步地,本发明方法步骤S1中,所述反应在三磷酸腺苷(ATP)和金属Mg2+存在条件下进行。
进一步地,本发明方法步骤S1中,所述反应体系的pH值为6至9,例如6.5、7.0、7.2、7.5、7.8、8.0、8.5等,优选为约7.5。
进一步地,本发明方法步骤S2中,AtUSP能够将6-叠氮-半乳糖-1-磷酸(化合物12)转变为尿苷二磷酸-6-叠氮-D-半乳糖,其氨基酸序列可以如 SEQ ID NO:2所示。
进一步地,优选地,编码所述AtUSP的核苷酸序列如SEQ ID NO:14 所示。
进一步地,本发明方法步骤S2中,所述反应在三磷酸尿苷(UTP)和金属Mg2+存在条件下进行。
进一步地,本发明方法步骤S2中,所述反应体系的pH值为6至9,例如6.5、7.0、7.2、7.5、7.8、8.0、8.5等,优选为约7.5。
进一步地,本发明方法步骤S2中,可以使用无机焦磷酸化酶(PPA) 或在PPA的氨基酸序列中经过取代、缺失或添加一个或几个氨基酸且具有 PPA活性的由PPA衍生的蛋白质,以催化副产物焦磷酸盐(PPi)分解,从而避免因PPi的积累可能会抑制AtUSP的酶活性,以推动反应进行。PPA的氨基酸序列如SEQ ID NO:3所示。
进一步地,优选地,所述无机焦磷酸化酶(PPA)可以是大肠杆菌 (Escherichiacoli)O157来源的无机焦磷酸化酶(PPA)。
进一步地,优选地,编码所述PPA的核苷酸序列如SEQ ID NO:15所示。
进一步地,本发明方法步骤S2中,得到的产物尿苷二磷酸-6-叠氮-D- 半乳糖(化合物1)为α构型。
优选地,本发明方法步骤S1、S2可以分开进行或者按照一锅法在同一反应体系中进行。
在一些实施方式中,本发明的方法在同一反应体系中进行,此时可以将6-叠氮-半乳糖(化合物6)、三磷酸腺苷二钠(ATP·2Na+)、三磷酸尿苷二钠(UTP·2Na+)作为底物,在金属Mg2+存在下,在步骤S1中所述的双歧杆菌来源的的半乳糖激酶和步骤S2中所述的拟南芥(Arabidopsis thaliana)来源的糖焦磷酸化酶(AtUSP)或在AtUSP的氨基酸序列中经过取代、缺失或添加一个或几个氨基酸且具有AtUSP活性的由AtUSP衍生的蛋白质、和任选的无机焦磷酸化酶(PPA)或在PPA的氨基酸序列中经过取代、缺失或添加一个或几个氨基酸且具有PPA活性的由PPA衍生的蛋白质的作用下,催化合成尿苷二磷酸-6-D-叠氮-半乳糖,其反应式如下:
本发明合成方法中,可以进一步包括分离纯化步骤。具体地,所述纯化步骤包括:在上述催化合成反应结束后,加入乙醇,固液分离(例如, 10000g,离心10分钟),液体减压脱除溶剂(优选温度低于35℃),然后用去离子水溶解,P-2凝胶柱分离,TLC检测分离效果;合并包含产物的组分,减压脱除溶剂,用去离子水溶解后,QS-FF离子交换柱分离,去除杂质(例如,腺苷单磷酸(AMP)等);将含有产物的组分合并,减压脱除溶剂(优选温度低于35℃),然后用去离子水溶解,P-2凝胶柱分离收集含有产物的组分,优选经浓缩冻干,得到目标产物。
在上文中已经详细地描述了本发明,但是上述实施方式本质上仅是例示性,且并不欲限制本发明。此外,本文并不受前述现有技术或发明内容或以下实施例中所描述的任何理论的限制。
除非另有明确说明,在整个申请文件中的数值范围包括其中的任何子范围和以其中给定值的最小子单位递增的任何数值。除非另有明确说明,在整个申请文件中的数值表示对包括与给定值的微小偏差以及具有大约所提及的值以及具有所提及的精确值的实施方案的范围的近似度量或限制。除了在详细描述最后提供的工作实施例之外,本申请文件(包括所附权利要求)中的参数(例如,数量或条件)的所有数值在所有情况下都应被理解为被术语“大约”修饰,不管“大约”是否实际出现在该数值之前。“大约”表示所述的数值允许稍微不精确(在该值上有一些接近精确;大约或合理地接近该值;近似)。如果“大约”提供的不精确性在本领域中没有以这个普通含义来理解,则本文所用的“大约”至少表示可以通过测量和使用这些参数的普通方法产生的变化。例如,“大约”可以包括小于或等于10%,小于或等于5%,小于或等于4%,小于或等于3%,小于或等于2%,小于或等于1%或者小于或等于0.5%的变化。
附图说明
图1为显示尿苷二磷酸-6-叠氮-D-半乳糖磁共振1H谱图。
图2为显示尿苷二磷酸-6-叠氮-D-半乳糖核磁共振13C谱图。
具体实施方式
在下文中,将通过实施例详细描述本发明。然而,在此提供的实施例仅用于说明目的,并不用于限制本发明。
下述实施例所使用的实验方法如无特殊说明,均为常规方法。
下述实施例所用的材料、试剂等,如无特殊说明,均可从商业途径得到。
表1主要实验仪器
表2主要实验试剂
表3茴香醛染色液配方
制备实施例
制备实施例1:BiGalK的制备
依据BiGalK的蛋白序列人工优化并合成基因序列SEQ ID NO:13。目的基因与载体pET-30a经NdeI与XhoI限制性内切酶双酶切后进行连接。连接产物转化至大肠杆菌DH5α感受态细胞,涂布在卡那抗性平板上,37℃培养箱过夜培养。平板挑点至5mL LB培养基,37℃,180rmp摇床过夜培养后提取质粒。酶切和测序对质粒进行鉴定。经鉴定正确的质粒转化至BL21(DE3)感受态细胞,平板挑点至50mL LB培养基,28℃,180rmp 摇床过夜培养。将50mL培养基转入1.8L LB培养基,37℃,200rmp摇床培养6h,降温至16℃,加入异丙基-β-D-硫代半乳糖苷(IPTG,终浓度为0.5mM)诱导过夜。离心收集菌体,用10mM咪唑溶液(10mM咪唑、 50mMTris-HCl、300mM氯化钠;pH7.5)重悬细胞。破碎细胞后,离心将上清过镍柱,用10mM咪唑溶液洗去杂蛋白,再用300mM咪唑溶液 (300mM咪唑、50mM Tris-HCl、300mM氯化钠;pH7.5)洗脱目的蛋白,得到酶溶液,其浓度约为1mg/mL。
制备实施例2:AtUSP的制备
依据AtUSP的蛋白序列人工优化并合成基因序列SEQ ID NO:14。目的基因与载体pET28a经NdeI与XhoI限制性内切酶双酶切后进行连接。连接产物转化至大肠杆菌DH5α感受态细胞,涂布在卡那抗性平板上,37℃培养箱过夜培养。平板挑点至5mL LB培养基,37℃,180rmp摇床过夜培养后提取质粒。酶切和测序对质粒进行鉴定。经鉴定正确的质粒转化至BL21(DE3)感受态细胞,平板挑点至50mL LB培养基,28℃,180rmp 摇床过夜培养。将50mLLB培养基转入1.8L LB培养基,37℃,200rmp 摇床培养6h,降温至16℃,加入IPTG诱导过夜。离心收集菌体,用10mM 咪唑溶液重悬细胞。破碎细胞后,离心将上清过镍柱,用10mM咪唑溶液洗去杂蛋白,再用300mM咪唑溶液洗脱目的蛋白,得到酶溶液,其浓度约为1mg/mL。
制备实施例3:PPA的制备
以E.coli O157为模板,进行PCR获得目的基因SED ID NO:15。目的基因与载体pET28a经NdeI与HindIII限制性内切酶双酶切后进行连接,连接产物转化至大肠杆菌DH5α感受态细胞,涂布在卡那抗性平板上,37℃培养箱过夜培养。平板挑点至5mL LB培养基,37℃,180rmp摇床过夜培养后提取质粒。酶切和测序对质粒进行鉴定。经鉴定正确的质粒转化至 BL21(DE3)感受态细胞,涂布在卡那抗性平板上,37℃培养箱过夜培养。平板挑点至50mLLB培养基,28℃,180rmp摇床过夜培养。将50mL培养基转入1.8L LB培养基,37℃,200rmp摇床培养6h,降温至16℃,加入IPTG诱导过夜。离心收集菌体,用10mM咪唑溶液重悬细胞。破碎细胞后,离心将上清过镍柱,用10mM咪唑溶液洗去杂蛋白,再用300mM 咪唑溶液洗脱目的蛋白,得到酶溶液,其浓度约为1mg/mL。
实施例
实施例1:尿苷二磷酸-6-叠氮-D-半乳糖合成
按下表4所示的反应体系加入6-叠氮-半乳糖(4.0g,12.2mmol), ATP·2Na+(7.0g,12.7mmol),UTP·2Na+(7.0g,13.3mmol),1M Tris-HCl (pH=7.5)30mL,200mM MgCl230mL于2000mL试剂瓶中,加适量ddH2O 溶解,用1M NaOH将溶液pH值调至7.5左右,加入BiGalK(10mL), AtUSP(50mL),PPA(100μL)。最后补加ddH2O至600mL,混匀,在37℃水浴锅内进行反应。TLC监测反应,展开剂为乙酸乙酯:甲醇:水=5:3: 2,茴香醛显色液显色,尿苷二磷酸-6-叠氮-D-半乳糖Rf值为0.62。展开剂为异丙醇:氨水:水=7:3:2,茴香醛显色液显色,尿苷二磷酸-6-叠氮-D- 半乳糖Rf值为0.40。反应结束后,加入600mL乙醇,摇匀后出现白色沉淀,离心(10000g,10分钟)取上清,减压脱出溶剂(温度低于35℃),得到目标产物粗品。
表4合成UDP-6-N3-D-半乳糖的酶反应体系(600mL体系为例)
将得到的目标产物粗品加入80mL去离子水溶解,P-2凝胶柱分离,去离子水洗脱产物。TLC检测分离效果。合并包含产物的组分,减压脱出溶剂(温度低于35℃),加50mL去离子水溶解后,QS-FF离子交换柱分离,NaCl梯度洗脱(0-300mM),去除腺苷单磷酸(AMP)等杂质。将含有产物的组分合并,减压脱出溶剂(温度低于35℃),加70mL去离子水溶解,P-2凝胶柱除盐,收集含有产物的组分,浓缩冻干得到目标产物,白色固体5.8g,产率为75%(底物(6-叠氮-半乳糖)投料12.2mmol,若完全反应生成尿苷二磷酸-6-叠氮-D-半乳糖的钠盐(M=635.28)质量为7.747g,实施例得目标产物白色固体5.8g,产率为5.8/7.747=75%),核磁验证其结构(见图1,图2)。
1H NMR(600MHz,D2O)δ7.95(d,J=8.2Hz,1H),6.00-5.94(m,2H), 5.61(dd,J=7.2,3.6Hz,1H),4.39–4.32(m,2H),4.30-4.16(m,4H),3.98(d,J =2.4Hz,1H),3.90(dd,J=10.3,3.3Hz,1H),3.78(dt,J=10.3,3.2Hz,1H), 3.55(dd,J=12.7,7.3Hz,1H),3.45(dd,J=12.8,6.0Hz,1H).
13C NMR(151MHz,D2O)δ166.31,151.85,141.60,102.60,95.71(d,J= 6.6Hz),88.42,83.16(d,J=9.2Hz),73.76,70.11,69.55,69.14,69.11,68.23(d, J=8.3Hz),64.86(d,J=5.3Hz),50.24.
序列:
SEQ ID NO:1
MTAVEFIEPLTHEEGVSQATKLFVDTYGAAPEGVWAAPGRVNLIGEHTD YNAGLCLPIALPHRTFIALKPREDTKVRVVSGVAPDKVAEADLDGLKAR GVDGWSAYPTGVAWALRQAGFDKVKGFDAAFVSCVPLGSGLSSSAAMT CSTALALDDVYGLGYGDSDAGRVTLINAAIKSENEMAGASTGGLDQNA SMRCTEGHALLLDCRPELTPLENVSQQEFDLDKYNLELLVVDTQAPHQL NDGQYAQRRATCEEAAKILGVANLRVTADGISKADDQFQALKETLDALPDETMKKRVRHVVTEIERVRSFVRAFAQGDIKAAGRLFNASHDSLAADYE VTVPELDIAVDVARKNGAYGARMTGGGFGGSIIALVDKGQGHEIAQKIA DRFEKEGFNAPRALPAFAAASASREA
SEQ ID NO:2
MASTVDSNFFSSVPALHSNLGLLSPDQIELAKILLENGQSHLFQQWPELG VDDKEKLAFFDQIARLNSSYPGGLAAYIKTAKELLADSKVGKNPYDGFS PSVPSGENLTFGTDNFIEMEKRGVVEARNAAFVLVAGGLGERLGYNGIK VALPRETTTGTCFLQHYIESILALQEASNKIDSDGSERDIPFIIMTSDDTHS RTLDLLELNSYFGMKPTQVHLLKQEKVACLDDNDARLALDPHNKYSIQ TKPHGHGDVHSLLYSSGLLHKWLEAGLKWVLFFQDTNGLLFNAIPASLG VSATKQYHVNSLAVPRKAKEAIGGISKLTHVDGRSMVINVEYNQLDPLL RASGFPDGDVNCETGFSPFPGNINQLILELGPYKDELQKTGGAIKEFVNP KYKDSTKTAFKSSTRLECMMQDYPKTLPPTARVGFTVMDIWLAYAPVK NNPEDAAKVPKGNPYHSATSGEMAIYRANSLILQKAGVKVEEPVKQVL NGQEVEVWSRITWKPKWGMIFSDIKKKVSGNCEVSQRSTMAIKGRNVFI KDLSLDGALIVDSIDDAEVKLGGLIKNNGWTMESVDYKDTSVPEEIRIRGFRFNKVEQLEKKLTQPGKFSVED
SEQ ID NO:3
MGLETVPAGKALPDDIYVVIEIPANSDPIKYEVDKESGALFVDRFMATAM FYPANYGYVNNTLSLDGDPVDVLVPTPYPLQPGSVIRCRPVGVLKMTDE AGSDAKVVAVPHSKLTKEYDHIKDVNDLPALLKAQIQHFFESYKALEAG KWVKVDGWEGVDAARQEILDSFERAKK
SEQ ID NO:4
MTDVEFIEPLSHDEGVKIAVDLFKAVYGEEPTGVWAAPGRVNLIGEHTD YNAGLCLPIALPHRTFIALKPREDTKVRVVSDVAPDAVAEADLDGLTAGG VEGWAAYPVGVAWALREAGFDTVKGFDAAFSSCVPLGSGLSSSAAMTC STALALDDVYGLGYGSTDAGRVTLINAAIKSENDMAGASTGGLDQNAS MRCSFGHAIRLDCKPGLSAVESVEPKEFDLDRYGLELLVLDTRAPHQLN DGQYAQRRSTCEQAAEILGVANLRVTAETVAASADPAAALADVLDRLED GTMKKRVRHVVTEIGRVDRFVDAFAAGDIKTAGDLFNASHDSLRDDYE VTVPELDTAVDVARANGAYGARMTGGGFGGSIIALVDKGQGHEIAQKIA DEFESKGFNAPRALPAFAAAAASREI
SEQ ID NO:5
MTAVEFIEPLSHDEGVKNATDLFRATYGEEPAGVWAAPGRVNLIGEHTD YNAGLCLPIALPHRTFIALKPREDTKVRVVSDVDSENVTEADLDGLQAG GVEGWAAYPVGVAWALREAGFDAVQGFDAAFSSCVPLGSGLSSSAAMT CSTALALDDVYGLGYGASDAGRVTLINAAIKSENDMAGASTGGLDQNA SMRCTFGHALRLDCRPELSPLENVSQQEFDLDKYGLELLVLDTQAPHQL NDGQYAQRRATCEKAAEILGVANLRVVADEIAKSEDPFQALKETLDKLEDDTMKKRVRHVITEIARVNSFVRAFANGKIDEAGRLFNASHDSLAADYE VTVPELDIAVDVARVNGAYGARMTGGGFGGSIIALVDKGQGHEIAQKIA DRFEKEGFNAPRALPAFAAASASREA
SEQ ID NO:6
MSSVEFIEPISREDGVARATELFRATYGEEPAGVWAAPGRVNLIGEHTDY NAGLCLPIALPHRTFLALKPREDTAVRLVSDVNPTAVAEAELDGLKARGV DGWAAYPTGVAWAMREAGYDQVRGFDAAFVSCVPLGSGLSSSAAMTC STALALDDVYGLGYGATDEGRVTLITMAIKSENDMAGASTGGLDQNAS MRCTPGHAIRLDCMPGLSAVDSVSQQEFDLDKYGLELLVVDTQAHHQL NDGQYEQRRRTCEQAAELLGYEHLRAAAEAVAYSTDSEGSLAALLNCLNDETMKKRVRHVITEIGRVDEFVKAFAAGDIAESGRLFNASHDSLRDDY EVTVPELDVAVDMARANGAYGARMTGGGFGGSIIALVDKGRGREVAQLI ADEFEVRGFHAPRALAAVASASASRED
SEQ ID NO:7
MTSVEFIEPMSDAEGAARAAELFKQAYGKEPAGVWAAPGRVNLIGEHT DYNAGLCLPIALPHRTYIALSPRDDTSVRVVSDLASDVIAEADLDGLEAG GVDGWAAYPVGVAWALRNAGFDGVQGFDAAFSSCVPLGSGLSSSAAMT CSTALALDDVYSQGFGDTDEGRVTLINAAIASENDMAGASTGGLDQNA SMRCTPDHAIRLDCRPGLSAVDSVQQEVFDLEGHGLELLVLDTRAPHQL NDGQYAQRRATCEEAARILGVANLREVADLVNAQADPAAALDGVLDRLDDETMRKRVRHVVTEIGRVDDFVRAFAEGDMQTAGELFNASHDSLRDD YEVTVPELDVAVDVARDEGALGARMTGGGFGGSIIALVNAGESQRIAQA ICDEFERRGFVLPRALPAQASASAHRVQ
SEQ ID NO:8
MTAVEFIEPLSREDGVSRATKLFVDMYATAPEGVWAAPGRVNLIGEHTD YNAGLCLPIALPHRTFIALKPREDTKVRVVSDVAPDKVAEADLDGLKAR GVDGWSAYPTGVAWALREAGFSQVKGFDAAFVSCVPLGSGLSSSAAMT CSTALALDDVYGLGYGSSDAGRVTLINAAIKSENDMAGASTGGLDQNA SMRCTEGHALLLDCRPELTPLENVSQQAFDLDKYGLELLVVDTQAPHQL NDGQYAQRRATCEEAARILGVANLRVAADGISKADDQFQALKETLDALPDVTMKKRVRHVVTEIERVRSFVRAFAQGDIEAAGRLFNASHDSLAADYE VTVPELDVAVDVARKNGAYGARMTGGGFGGSIIALVDKGRSQEVAQKIA DEFEARGFHAPRALPAVAAPSASREA
SEQ ID NO:9
MTRTVEFIQPWTQGADGQGATKARELFRKVYGGDPQGVWSAPGRVNLI GEHTDYNAGLCLPIALPTRTYVAASPRTDSRVRLVSTMDPENPVQADLD GLQARGVSGWAAYPVGVAWALRRDGFPQVRGFDLALASCVPVGSGLSS SAAMTCAMALALDDLFGLGLGGDEGGRVRLIQAAITAENDMAGASTGG MDQSAAMRCRSGCALRLDCRPELDAMSNVRQVPFDLRAAGLELLVVD TRAQHQLNDGQYDQRRATCEQAVHLLGVANLRQAADQVNGAADPPSA LAALLEQLPDETMRRRVRHVISEIGRVDRFIEAFGRGDYVLAGRLINASH DSLRDDYEVTCPELDEAVDAARQGGAYGARMTGGGFGGSIIALADAGK GSGLARDIAERFASKGFKAPRALIALPSSAATRES
SEQ ID NO:10
MSAVEFIEPLTHEEGVSQATKLFVDTYGAAPEGVWAAPGRVNLIGEHTD YNAGLCLPIALPHRTFIALKPREDTKVRVVSGVAPDKVAEADLDGLKAR GVDGWSAYPTGVAWALRQAGFDKVKGFDAAFVSCVPLGSGLSSSAAMT CSTALALDDVYGLGYGDSDAGRVTLINAAIKSENEMAGASTGGLDQNA SMRCTEGHALLLDCRPELTPLENVSQQEFGLDKYNLELLVVDTQAPHQL NDGQYAQRRATCEEAAKILGVANLRVTADGISKADDQFQALKETLDALPDETMKKRVRHVVTEIERVRSFVRAFAQGDIKAAGRLFNASHDSLAADYE VTVPELDIAVDVARKNGAYGARMTGGGFGGSIIALVDKGRSQEVAQKIA DEFEKQGFHAPRALAAYAAPSASREA
SEQ ID NO:11
MTTAVEFIEPMGDADGAARAAALFEARFGTAPAGVWAAPGRVNLIGEHT DYNGGLCLPIALPHRTYVALAPRDDTTVRVISDMTPDEMTMVDLDGLA AGGVDGWGAYPIGVAWALREAGFDQVRGFDAVFSSCVPLGSGLSSSAA MTCSTALALDDVYGLGFGGSDEGRITLIDAAVMAENEMAGASTGGLDQ NASMRCAADHAIRLDCMPGLTAAQSVRQEPFDLSAYGLELLVLDTQA PHQLNDGQYEARRTMCEEAAQILGVPNLRVVADQVNAAVDPAAALEDV LSQLDDETMRRRVRHVITEIGRVDDFIGAFGRGDIETAGALFNASHDSLR DDYEVTVPELDVAVDVARDEGAYGARMTGGGFGGSIIALVNAGESRRIA QAIADEFARRGFDAPRALPARASQSAHRVND
SED ID NO:12
MTAVEFIEPLSHDEGVKNATDLFRATYGEEPAGVWAAPGRVNLIGEHTD YNAGLCLPIALPHRTFIALKPREDTKVRVVSDVDSGNVTEADLDGLQAG GVEGWAAYPVGVAWALREAGFNAVQGFDAAFSSCVPLGSGLSSSAAMT CSTALALDDVYGLGYGASDAGRVTLINAAIKSENDMAGASTGGLDQNA SMRCTFGHALRLDCRPELSPLENVSQQEFDLDKYGLELLVLDTQAPHQL NDGQYAQRRATCEKAAEILGVANLRVVADSIAKSGDPFQALKETLDKLEDDTMKKRVRHVITEIARVNSFVRAFANGKIDEAGRLFNASHDSLAADYE VTVPELDIAVDVARANGAYGARMTGGGFGGSIIALVNKGQGHEIAQKIA DRFEKEGFNAPRALPAFAAASASREA
SEQ ID NO:13
CATATGACAGCTGTAGAATTTATAGAGCCCCTAACCCATGAGGAGGGT GTCTCCCAGGCAACCAAGCTGTTTGTCGACACCTATGGTGCTGCTCCG GAGGGCGTGTGGGCTGCGCCGGGTCGTGTAAATCTGATTGGTGAACATACCGATTATAACGCTGGCCTTTGCCTGCCCATCGCGTTGCCGCACAG AACCTTTATTGCGCTTAAGCCGCGCGAAGATACCAAAGTCCGCGTGGT TTCCGGTGTTGCTCCGGATAAGGTGGCTGAGGCTGATCTGGACGGCCT GAAGGCCCGCGGGGTGGACGGTTGGTCTGCGTACCCGACCGGTGTGG CGTGGGCACTGCGTCAGGCCGGCTTCGATAAGGTGAAAGGTTTCGAC GCGGCCTTCGTGAGCTGTGTTCCGTTGGGCAGCGGTCTTTCTTCCTCA GCCGCAATGACGTGCAGCACCGCTTTAGCGCTCGACGATGTTTACGGC CTGGGTTATGGCGATAGCGATGCGGGCCGCGTGACGCTGATTAACGCG GCGATTAAAAGCGAAAATGAAATGGCAGGTGCGTCGACCGGTGGTTT AGACCAAAACGCAAGCATGCGTTGCACCGAGGGCCACGCACTGCTGT TGGACTGCCGTCCGGAGCTGACCCCGCTGGAGAACGTGTCTCAGCAAGAGTTCGACCTGGACAAGTACAACCTGGAACTGCTGGTTGTCGATAC CCAGGCGCCACACCAGCTGAATGATGGCCAATATGCACAACGTCGTG CGACTTGTGAAGAGGCTGCCAAGATCCTGGGCGTGGCGAATTTGCGC GTCACGGCGGATGGCATCAGCAAAGCGGACGACCAGTTTCAGGCGTT GAAGGAAACTCTGGACGCCTTGCCAGATGAGACAATGAAAAAACGT GTTCGTCACGTGGTAACCGAAATCGAACGTGTTAGAAGCTTTGTTCGC GCGTTTGCACAAGGTGATATCAAGGCGGCTGGCCGTCTGTTCAACGC GAGCCATGATTCGCTGGCTGCCGACTACGAAGTTACGGTTCCGGAGCT CGACATCGCGGTTGACGTTGCGCGTAAAAACGGTGCGTACGGCGCGC GCATGACCGGTGGTGGTTTCGGCGGCTCCATTATCGCGCTTGTGGATA AGGGTCAGGGTCACGAGATCGCCCAAAAAATTGCGGATCGTTTTGAA AAAGAGGGGTTCAACGCTCCGCGTGCGCTTCCGGCATTCGCTGCGGC ATCTGCCAGCCGTGAAGCCAAATTGGCCGCCGCGCTGGAGCTCGAG
SEQ ID NO:14
CATATGGCTAGCACCGTTGATAGCAACTTCTTCTCTAGCGTGCCGGCA CTGCATAGCAACCTGGGTCTGCTGTCCCCGGATCAGATTGAACTGGCA AAAATCCTGCTGGAAAACGGCCAGTCCCACCTGTTCCAGCAGTGGCCGGAACTGGGCGTTGACGATAAAGAAAAACTGGCCTTCTTCGATCAGA TTGCTCGTCTGAACTCTTCCTATCCAGGCGGCCTGGCTGCGTACATCA AAACCGCGAAAGAGCTGCTGGCGGATAGCAAAGTTGGTAAAAACCC GTATGATGGTTTTTCTCCGTCTGTTCCGAGCGGCGAAAACCTGACTTT CGGCACCGATAATTTCATTGAAATGGAAAAACGTGGTGTTGTGGAAG CCCGTAACGCAGCGTTTGTGCTGGTTGCAGGTGGCCTGGGCGAACGT CTGGGTTACAACGGTATCAAAGTTGCGCTGCCGCGTGAAACCACCAC CGGCACCTGTTTCCTGCAGCACTATATCGAATCTATCCTGGCTCTGCAG GAAGCGTCTAACAAAATCGATAGCGATGGCTCTGAACGTGACATTCCG TTCATCATCATGACCTCCGATGATACTCACTCCCGTACCCTGGACCTGC TGGAGCTGAACAGCTACTTTGGCATGAAACCGACCCAGGTGCACCTCCTGAAACAGGAAAAAGTTGCTTGCCTGGATGATAACGATGCCCGTCT GGCGCTGGATCCGCACAACAAATATAGCATTCAGACCAAACCACACG GTCACGGTGATGTGCATAGCCTGCTGTACTCTTCTGGTCTGCTGCACA AATGGCTGGAAGCTGGTCTGAAATGGGTGCTGTTCTTCCAGGATACCA ACGGCCTGCTGTTTAACGCTATTCCGGCCTCTCTGGGCGTGAGCGCGA CTAAACAGTACCACGTTAACTCCCTGGCTGTTCCACGTAAAGCTAAAG AAGCGATCGGTGGTATCAGCAAACTGACCCACGTTGATGGTCGTTCTA TGGTGATTAACGTGGAATATAACCAACTCGACCCGCTGCTGCGCGCTT CCGGCTTCCCGGACGGCGACGTGAACTGTGAAACCGGTTTTAGCCCG TTTCCGGGTAACATCAACCAGCTGATCCTGGAACTTGGCCCGTATAAA GACGAACTGCAGAAAACCGGCGGTGCGATTAAAGAATTCGTTAACCC GAAATATAAAGACAGCACTAAAACCGCGTTCAAATCCAGCACCCGCC TGGAATGCATGATGCAGGACTACCCGAAAACTCTGCCGCCGACCGCG CGCGTTGGCTTCACCGTAATGGATATCTGGCTGGCTTACGCGCCGGTT AAAAACAACCCGGAAGATGCTGCTAAAGTTCCGAAAGGTAACCCGTA CCACAGCGCAACCTCTGGTGAAATGGCGATCTATCGTGCGAACTCTCT GATTCTGCAGAAAGCAGGCGTTAAAGTTGAAGAACCGGTTAAACAGG TGCTGAACGGCCAAGAAGTTGAAGTTTGGAGCCGTATCACCTGGAAA CCGAAATGGGGTATGATCTTTTCTGACATTAAAAAGAAAGTGTCTGGT AACTGTGAAGTTTCCCAGCGTTCCACTATGGCGATCAAAGGTCGCAAT GTGTTTATCAAAGATCTGAGCCTGGACGGTGCTCTGATCGTTGATAGC ATCGATGACGCGGAAGTTAAACTGGGCGGTCTGATTAAAAACAACGG CTGGACCATGGAATCTGTAGATTACAAAGATACCTCTGTTCCGGAAGA AATCCGTATCCGTGGCTTCCGTTTCAACAAAGTTGAACAGCTGGAAA AGAAACTGACCCAGCCGGGTAAATTCTCTGTTGAAGATTAACTCGAG
SEQ ID NO:15
ATGAGCTTACTCAACGTCCCTGCGGGTAAAGATCTGCCGGAAGACATC TACGTTGTTATTGAGATCCCGGCTAACGCAGATCCGATCAAATACGAA ATCGACAAAGAGAGCGGCGCACTGTTCGTTGACCGCTTCATGTCCACCGCGATGTTCTATCCGTGCAACTACGGTTACATCAACCACACCCTGTC TCTGGACGGTGACCCAGTTGACGTACTGGTCCCAACTCCGTACCCGCT GCAGCCGGGTTCTGTGATCCGTTGCCGTCCGGTTGGCGTTCTGAAAAT GACCGACGAAGCCGGTGAAGATGCGAAACTGGTTGCGGTTCCGCAC AGCAAGCTGAGCAAAGAATACGATCACATTAAAGACGTTAACGATCT GCCTGAACTGCTGAAAGCGCAAATCGCTCACTTCTTCGAGCACTACA AAGACCTCGAAAAAGGCAAGTGGGTGAAAGTTGAAGGTTGGGAAAA CGCAGAAGCCGCTAAAGCTGAAATCGTTGCCTCCTTCGAGCGCGCAA AGAATAAATAA。
SEQUENCE LISTING
<110> 中国科学院上海药物研究所
<120> 一种合成尿苷二磷酸-6-叠氮-D-半乳糖的方法
<130> DI21-1174-XC37
<160> 15
<170> PatentIn version 3.5
<210> 1
<211> 416
<212> PRT
<213> Bifidobacterium longum
<400> 1
Met Thr Ala Val Glu Phe Ile Glu Pro Leu Thr His Glu Glu Gly Val
1 5 10 15
Ser Gln Ala Thr Lys Leu Phe Val Asp Thr Tyr Gly Ala Ala Pro Glu
20 25 30
Gly Val Trp Ala Ala Pro Gly Arg Val Asn Leu Ile Gly Glu His Thr
35 40 45
Asp Tyr Asn Ala Gly Leu Cys Leu Pro Ile Ala Leu Pro His Arg Thr
50 55 60
Phe Ile Ala Leu Lys Pro Arg Glu Asp Thr Lys Val Arg Val Val Ser
65 70 75 80
Gly Val Ala Pro Asp Lys Val Ala Glu Ala Asp Leu Asp Gly Leu Lys
85 90 95
Ala Arg Gly Val Asp Gly Trp Ser Ala Tyr Pro Thr Gly Val Ala Trp
100 105 110
Ala Leu Arg Gln Ala Gly Phe Asp Lys Val Lys Gly Phe Asp Ala Ala
115 120 125
Phe Val Ser Cys Val Pro Leu Gly Ser Gly Leu Ser Ser Ser Ala Ala
130 135 140
Met Thr Cys Ser Thr Ala Leu Ala Leu Asp Asp Val Tyr Gly Leu Gly
145 150 155 160
Tyr Gly Asp Ser Asp Ala Gly Arg Val Thr Leu Ile Asn Ala Ala Ile
165 170 175
Lys Ser Glu Asn Glu Met Ala Gly Ala Ser Thr Gly Gly Leu Asp Gln
180 185 190
Asn Ala Ser Met Arg Cys Thr Glu Gly His Ala Leu Leu Leu Asp Cys
195 200 205
Arg Pro Glu Leu Thr Pro Leu Glu Asn Val Ser Gln Gln Glu Phe Asp
210 215 220
Leu Asp Lys Tyr Asn Leu Glu Leu Leu Val Val Asp Thr Gln Ala Pro
225 230 235 240
His Gln Leu Asn Asp Gly Gln Tyr Ala Gln Arg Arg Ala Thr Cys Glu
245 250 255
Glu Ala Ala Lys Ile Leu Gly Val Ala Asn Leu Arg Val Thr Ala Asp
260 265 270
Gly Ile Ser Lys Ala Asp Asp Gln Phe Gln Ala Leu Lys Glu Thr Leu
275 280 285
Asp Ala Leu Pro Asp Glu Thr Met Lys Lys Arg Val Arg His Val Val
290 295 300
Thr Glu Ile Glu Arg Val Arg Ser Phe Val Arg Ala Phe Ala Gln Gly
305 310 315 320
Asp Ile Lys Ala Ala Gly Arg Leu Phe Asn Ala Ser His Asp Ser Leu
325 330 335
Ala Ala Asp Tyr Glu Val Thr Val Pro Glu Leu Asp Ile Ala Val Asp
340 345 350
Val Ala Arg Lys Asn Gly Ala Tyr Gly Ala Arg Met Thr Gly Gly Gly
355 360 365
Phe Gly Gly Ser Ile Ile Ala Leu Val Asp Lys Gly Gln Gly His Glu
370 375 380
Ile Ala Gln Lys Ile Ala Asp Arg Phe Glu Lys Glu Gly Phe Asn Ala
385 390 395 400
Pro Arg Ala Leu Pro Ala Phe Ala Ala Ala Ser Ala Ser Arg Glu Ala
405 410 415
<210> 2
<211> 614
<212> PRT
<213> Arabidopsis thaliana
<400> 2
Met Ala Ser Thr Val Asp Ser Asn Phe Phe Ser Ser Val Pro Ala Leu
1 5 10 15
His Ser Asn Leu Gly Leu Leu Ser Pro Asp Gln Ile Glu Leu Ala Lys
20 25 30
Ile Leu Leu Glu Asn Gly Gln Ser His Leu Phe Gln Gln Trp Pro Glu
35 40 45
Leu Gly Val Asp Asp Lys Glu Lys Leu Ala Phe Phe Asp Gln Ile Ala
50 55 60
Arg Leu Asn Ser Ser Tyr Pro Gly Gly Leu Ala Ala Tyr Ile Lys Thr
65 70 75 80
Ala Lys Glu Leu Leu Ala Asp Ser Lys Val Gly Lys Asn Pro Tyr Asp
85 90 95
Gly Phe Ser Pro Ser Val Pro Ser Gly Glu Asn Leu Thr Phe Gly Thr
100 105 110
Asp Asn Phe Ile Glu Met Glu Lys Arg Gly Val Val Glu Ala Arg Asn
115 120 125
Ala Ala Phe Val Leu Val Ala Gly Gly Leu Gly Glu Arg Leu Gly Tyr
130 135 140
Asn Gly Ile Lys Val Ala Leu Pro Arg Glu Thr Thr Thr Gly Thr Cys
145 150 155 160
Phe Leu Gln His Tyr Ile Glu Ser Ile Leu Ala Leu Gln Glu Ala Ser
165 170 175
Asn Lys Ile Asp Ser Asp Gly Ser Glu Arg Asp Ile Pro Phe Ile Ile
180 185 190
Met Thr Ser Asp Asp Thr His Ser Arg Thr Leu Asp Leu Leu Glu Leu
195 200 205
Asn Ser Tyr Phe Gly Met Lys Pro Thr Gln Val His Leu Leu Lys Gln
210 215 220
Glu Lys Val Ala Cys Leu Asp Asp Asn Asp Ala Arg Leu Ala Leu Asp
225 230 235 240
Pro His Asn Lys Tyr Ser Ile Gln Thr Lys Pro His Gly His Gly Asp
245 250 255
Val His Ser Leu Leu Tyr Ser Ser Gly Leu Leu His Lys Trp Leu Glu
260 265 270
Ala Gly Leu Lys Trp Val Leu Phe Phe Gln Asp Thr Asn Gly Leu Leu
275 280 285
Phe Asn Ala Ile Pro Ala Ser Leu Gly Val Ser Ala Thr Lys Gln Tyr
290 295 300
His Val Asn Ser Leu Ala Val Pro Arg Lys Ala Lys Glu Ala Ile Gly
305 310 315 320
Gly Ile Ser Lys Leu Thr His Val Asp Gly Arg Ser Met Val Ile Asn
325 330 335
Val Glu Tyr Asn Gln Leu Asp Pro Leu Leu Arg Ala Ser Gly Phe Pro
340 345 350
Asp Gly Asp Val Asn Cys Glu Thr Gly Phe Ser Pro Phe Pro Gly Asn
355 360 365
Ile Asn Gln Leu Ile Leu Glu Leu Gly Pro Tyr Lys Asp Glu Leu Gln
370 375 380
Lys Thr Gly Gly Ala Ile Lys Glu Phe Val Asn Pro Lys Tyr Lys Asp
385 390 395 400
Ser Thr Lys Thr Ala Phe Lys Ser Ser Thr Arg Leu Glu Cys Met Met
405 410 415
Gln Asp Tyr Pro Lys Thr Leu Pro Pro Thr Ala Arg Val Gly Phe Thr
420 425 430
Val Met Asp Ile Trp Leu Ala Tyr Ala Pro Val Lys Asn Asn Pro Glu
435 440 445
Asp Ala Ala Lys Val Pro Lys Gly Asn Pro Tyr His Ser Ala Thr Ser
450 455 460
Gly Glu Met Ala Ile Tyr Arg Ala Asn Ser Leu Ile Leu Gln Lys Ala
465 470 475 480
Gly Val Lys Val Glu Glu Pro Val Lys Gln Val Leu Asn Gly Gln Glu
485 490 495
Val Glu Val Trp Ser Arg Ile Thr Trp Lys Pro Lys Trp Gly Met Ile
500 505 510
Phe Ser Asp Ile Lys Lys Lys Val Ser Gly Asn Cys Glu Val Ser Gln
515 520 525
Arg Ser Thr Met Ala Ile Lys Gly Arg Asn Val Phe Ile Lys Asp Leu
530 535 540
Ser Leu Asp Gly Ala Leu Ile Val Asp Ser Ile Asp Asp Ala Glu Val
545 550 555 560
Lys Leu Gly Gly Leu Ile Lys Asn Asn Gly Trp Thr Met Glu Ser Val
565 570 575
Asp Tyr Lys Asp Thr Ser Val Pro Glu Glu Ile Arg Ile Arg Gly Phe
580 585 590
Arg Phe Asn Lys Val Glu Gln Leu Glu Lys Lys Leu Thr Gln Pro Gly
595 600 605
Lys Phe Ser Val Glu Asp
610
<210> 3
<211> 175
<212> PRT
<213> Escherichia coli
<400> 3
Met Gly Leu Glu Thr Val Pro Ala Gly Lys Ala Leu Pro Asp Asp Ile
1 5 10 15
Tyr Val Val Ile Glu Ile Pro Ala Asn Ser Asp Pro Ile Lys Tyr Glu
20 25 30
Val Asp Lys Glu Ser Gly Ala Leu Phe Val Asp Arg Phe Met Ala Thr
35 40 45
Ala Met Phe Tyr Pro Ala Asn Tyr Gly Tyr Val Asn Asn Thr Leu Ser
50 55 60
Leu Asp Gly Asp Pro Val Asp Val Leu Val Pro Thr Pro Tyr Pro Leu
65 70 75 80
Gln Pro Gly Ser Val Ile Arg Cys Arg Pro Val Gly Val Leu Lys Met
85 90 95
Thr Asp Glu Ala Gly Ser Asp Ala Lys Val Val Ala Val Pro His Ser
100 105 110
Lys Leu Thr Lys Glu Tyr Asp His Ile Lys Asp Val Asn Asp Leu Pro
115 120 125
Ala Leu Leu Lys Ala Gln Ile Gln His Phe Phe Glu Ser Tyr Lys Ala
130 135 140
Leu Glu Ala Gly Lys Trp Val Lys Val Asp Gly Trp Glu Gly Val Asp
145 150 155 160
Ala Ala Arg Gln Glu Ile Leu Asp Ser Phe Glu Arg Ala Lys Lys
165 170 175
<210> 4
<211> 416
<212> PRT
<213> Bifidobacterium dentium
<400> 4
Met Thr Asp Val Glu Phe Ile Glu Pro Leu Ser His Asp Glu Gly Val
1 5 10 15
Lys Ile Ala Val Asp Leu Phe Lys Ala Val Tyr Gly Glu Glu Pro Thr
20 25 30
Gly Val Trp Ala Ala Pro Gly Arg Val Asn Leu Ile Gly Glu His Thr
35 40 45
Asp Tyr Asn Ala Gly Leu Cys Leu Pro Ile Ala Leu Pro His Arg Thr
50 55 60
Phe Ile Ala Leu Lys Pro Arg Glu Asp Thr Lys Val Arg Val Val Ser
65 70 75 80
Asp Val Ala Pro Asp Ala Val Ala Glu Ala Asp Leu Asp Gly Leu Thr
85 90 95
Ala Gly Gly Val Glu Gly Trp Ala Ala Tyr Pro Val Gly Val Ala Trp
100 105 110
Ala Leu Arg Glu Ala Gly Phe Asp Thr Val Lys Gly Phe Asp Ala Ala
115 120 125
Phe Ser Ser Cys Val Pro Leu Gly Ser Gly Leu Ser Ser Ser Ala Ala
130 135 140
Met Thr Cys Ser Thr Ala Leu Ala Leu Asp Asp Val Tyr Gly Leu Gly
145 150 155 160
Tyr Gly Ser Thr Asp Ala Gly Arg Val Thr Leu Ile Asn Ala Ala Ile
165 170 175
Lys Ser Glu Asn Asp Met Ala Gly Ala Ser Thr Gly Gly Leu Asp Gln
180 185 190
Asn Ala Ser Met Arg Cys Ser Phe Gly His Ala Ile Arg Leu Asp Cys
195 200 205
Lys Pro Gly Leu Ser Ala Val Glu Ser Val Glu Pro Lys Glu Phe Asp
210 215 220
Leu Asp Arg Tyr Gly Leu Glu Leu Leu Val Leu Asp Thr Arg Ala Pro
225 230 235 240
His Gln Leu Asn Asp Gly Gln Tyr Ala Gln Arg Arg Ser Thr Cys Glu
245 250 255
Gln Ala Ala Glu Ile Leu Gly Val Ala Asn Leu Arg Val Thr Ala Glu
260 265 270
Thr Val Ala Ala Ser Ala Asp Pro Ala Ala Ala Leu Ala Asp Val Leu
275 280 285
Asp Arg Leu Glu Asp Gly Thr Met Lys Lys Arg Val Arg His Val Val
290 295 300
Thr Glu Ile Gly Arg Val Asp Arg Phe Val Asp Ala Phe Ala Ala Gly
305 310 315 320
Asp Ile Lys Thr Ala Gly Asp Leu Phe Asn Ala Ser His Asp Ser Leu
325 330 335
Arg Asp Asp Tyr Glu Val Thr Val Pro Glu Leu Asp Thr Ala Val Asp
340 345 350
Val Ala Arg Ala Asn Gly Ala Tyr Gly Ala Arg Met Thr Gly Gly Gly
355 360 365
Phe Gly Gly Ser Ile Ile Ala Leu Val Asp Lys Gly Gln Gly His Glu
370 375 380
Ile Ala Gln Lys Ile Ala Asp Glu Phe Glu Ser Lys Gly Phe Asn Ala
385 390 395 400
Pro Arg Ala Leu Pro Ala Phe Ala Ala Ala Ala Ala Ser Arg Glu Ile
405 410 415
<210> 5
<211> 416
<212> PRT
<213> Bifidobacterium catenulatum
<400> 5
Met Thr Ala Val Glu Phe Ile Glu Pro Leu Ser His Asp Glu Gly Val
1 5 10 15
Lys Asn Ala Thr Asp Leu Phe Arg Ala Thr Tyr Gly Glu Glu Pro Ala
20 25 30
Gly Val Trp Ala Ala Pro Gly Arg Val Asn Leu Ile Gly Glu His Thr
35 40 45
Asp Tyr Asn Ala Gly Leu Cys Leu Pro Ile Ala Leu Pro His Arg Thr
50 55 60
Phe Ile Ala Leu Lys Pro Arg Glu Asp Thr Lys Val Arg Val Val Ser
65 70 75 80
Asp Val Asp Ser Glu Asn Val Thr Glu Ala Asp Leu Asp Gly Leu Gln
85 90 95
Ala Gly Gly Val Glu Gly Trp Ala Ala Tyr Pro Val Gly Val Ala Trp
100 105 110
Ala Leu Arg Glu Ala Gly Phe Asp Ala Val Gln Gly Phe Asp Ala Ala
115 120 125
Phe Ser Ser Cys Val Pro Leu Gly Ser Gly Leu Ser Ser Ser Ala Ala
130 135 140
Met Thr Cys Ser Thr Ala Leu Ala Leu Asp Asp Val Tyr Gly Leu Gly
145 150 155 160
Tyr Gly Ala Ser Asp Ala Gly Arg Val Thr Leu Ile Asn Ala Ala Ile
165 170 175
Lys Ser Glu Asn Asp Met Ala Gly Ala Ser Thr Gly Gly Leu Asp Gln
180 185 190
Asn Ala Ser Met Arg Cys Thr Phe Gly His Ala Leu Arg Leu Asp Cys
195 200 205
Arg Pro Glu Leu Ser Pro Leu Glu Asn Val Ser Gln Gln Glu Phe Asp
210 215 220
Leu Asp Lys Tyr Gly Leu Glu Leu Leu Val Leu Asp Thr Gln Ala Pro
225 230 235 240
His Gln Leu Asn Asp Gly Gln Tyr Ala Gln Arg Arg Ala Thr Cys Glu
245 250 255
Lys Ala Ala Glu Ile Leu Gly Val Ala Asn Leu Arg Val Val Ala Asp
260 265 270
Glu Ile Ala Lys Ser Glu Asp Pro Phe Gln Ala Leu Lys Glu Thr Leu
275 280 285
Asp Lys Leu Glu Asp Asp Thr Met Lys Lys Arg Val Arg His Val Ile
290 295 300
Thr Glu Ile Ala Arg Val Asn Ser Phe Val Arg Ala Phe Ala Asn Gly
305 310 315 320
Lys Ile Asp Glu Ala Gly Arg Leu Phe Asn Ala Ser His Asp Ser Leu
325 330 335
Ala Ala Asp Tyr Glu Val Thr Val Pro Glu Leu Asp Ile Ala Val Asp
340 345 350
Val Ala Arg Val Asn Gly Ala Tyr Gly Ala Arg Met Thr Gly Gly Gly
355 360 365
Phe Gly Gly Ser Ile Ile Ala Leu Val Asp Lys Gly Gln Gly His Glu
370 375 380
Ile Ala Gln Lys Ile Ala Asp Arg Phe Glu Lys Glu Gly Phe Asn Ala
385 390 395 400
Pro Arg Ala Leu Pro Ala Phe Ala Ala Ala Ser Ala Ser Arg Glu Ala
405 410 415
<210> 6
<211> 416
<212> PRT
<213> Bifidobacterium catenulatum
<400> 6
Met Ser Ser Val Glu Phe Ile Glu Pro Ile Ser Arg Glu Asp Gly Val
1 5 10 15
Ala Arg Ala Thr Glu Leu Phe Arg Ala Thr Tyr Gly Glu Glu Pro Ala
20 25 30
Gly Val Trp Ala Ala Pro Gly Arg Val Asn Leu Ile Gly Glu His Thr
35 40 45
Asp Tyr Asn Ala Gly Leu Cys Leu Pro Ile Ala Leu Pro His Arg Thr
50 55 60
Phe Leu Ala Leu Lys Pro Arg Glu Asp Thr Ala Val Arg Leu Val Ser
65 70 75 80
Asp Val Asn Pro Thr Ala Val Ala Glu Ala Glu Leu Asp Gly Leu Lys
85 90 95
Ala Arg Gly Val Asp Gly Trp Ala Ala Tyr Pro Thr Gly Val Ala Trp
100 105 110
Ala Met Arg Glu Ala Gly Tyr Asp Gln Val Arg Gly Phe Asp Ala Ala
115 120 125
Phe Val Ser Cys Val Pro Leu Gly Ser Gly Leu Ser Ser Ser Ala Ala
130 135 140
Met Thr Cys Ser Thr Ala Leu Ala Leu Asp Asp Val Tyr Gly Leu Gly
145 150 155 160
Tyr Gly Ala Thr Asp Glu Gly Arg Val Thr Leu Ile Thr Met Ala Ile
165 170 175
Lys Ser Glu Asn Asp Met Ala Gly Ala Ser Thr Gly Gly Leu Asp Gln
180 185 190
Asn Ala Ser Met Arg Cys Thr Pro Gly His Ala Ile Arg Leu Asp Cys
195 200 205
Met Pro Gly Leu Ser Ala Val Asp Ser Val Ser Gln Gln Glu Phe Asp
210 215 220
Leu Asp Lys Tyr Gly Leu Glu Leu Leu Val Val Asp Thr Gln Ala His
225 230 235 240
His Gln Leu Asn Asp Gly Gln Tyr Glu Gln Arg Arg Arg Thr Cys Glu
245 250 255
Gln Ala Ala Glu Leu Leu Gly Tyr Glu His Leu Arg Ala Ala Ala Glu
260 265 270
Ala Val Ala Tyr Ser Thr Asp Ser Glu Gly Ser Leu Ala Ala Leu Leu
275 280 285
Asn Cys Leu Asn Asp Glu Thr Met Lys Lys Arg Val Arg His Val Ile
290 295 300
Thr Glu Ile Gly Arg Val Asp Glu Phe Val Lys Ala Phe Ala Ala Gly
305 310 315 320
Asp Ile Ala Glu Ser Gly Arg Leu Phe Asn Ala Ser His Asp Ser Leu
325 330 335
Arg Asp Asp Tyr Glu Val Thr Val Pro Glu Leu Asp Val Ala Val Asp
340 345 350
Met Ala Arg Ala Asn Gly Ala Tyr Gly Ala Arg Met Thr Gly Gly Gly
355 360 365
Phe Gly Gly Ser Ile Ile Ala Leu Val Asp Lys Gly Arg Gly Arg Glu
370 375 380
Val Ala Gln Leu Ile Ala Asp Glu Phe Glu Val Arg Gly Phe His Ala
385 390 395 400
Pro Arg Ala Leu Ala Ala Val Ala Ser Ala Ser Ala Ser Arg Glu Asp
405 410 415
<210> 7
<211> 416
<212> PRT
<213> Bifidobacterium animalis
<400> 7
Met Thr Ser Val Glu Phe Ile Glu Pro Met Ser Asp Ala Glu Gly Ala
1 5 10 15
Ala Arg Ala Ala Glu Leu Phe Lys Gln Ala Tyr Gly Lys Glu Pro Ala
20 25 30
Gly Val Trp Ala Ala Pro Gly Arg Val Asn Leu Ile Gly Glu His Thr
35 40 45
Asp Tyr Asn Ala Gly Leu Cys Leu Pro Ile Ala Leu Pro His Arg Thr
50 55 60
Tyr Ile Ala Leu Ser Pro Arg Asp Asp Thr Ser Val Arg Val Val Ser
65 70 75 80
Asp Leu Ala Ser Asp Val Ile Ala Glu Ala Asp Leu Asp Gly Leu Glu
85 90 95
Ala Gly Gly Val Asp Gly Trp Ala Ala Tyr Pro Val Gly Val Ala Trp
100 105 110
Ala Leu Arg Asn Ala Gly Phe Asp Gly Val Gln Gly Phe Asp Ala Ala
115 120 125
Phe Ser Ser Cys Val Pro Leu Gly Ser Gly Leu Ser Ser Ser Ala Ala
130 135 140
Met Thr Cys Ser Thr Ala Leu Ala Leu Asp Asp Val Tyr Ser Gln Gly
145 150 155 160
Phe Gly Asp Thr Asp Glu Gly Arg Val Thr Leu Ile Asn Ala Ala Ile
165 170 175
Ala Ser Glu Asn Asp Met Ala Gly Ala Ser Thr Gly Gly Leu Asp Gln
180 185 190
Asn Ala Ser Met Arg Cys Thr Pro Asp His Ala Ile Arg Leu Asp Cys
195 200 205
Arg Pro Gly Leu Ser Ala Val Asp Ser Val Gln Gln Glu Val Phe Asp
210 215 220
Leu Glu Gly His Gly Leu Glu Leu Leu Val Leu Asp Thr Arg Ala Pro
225 230 235 240
His Gln Leu Asn Asp Gly Gln Tyr Ala Gln Arg Arg Ala Thr Cys Glu
245 250 255
Glu Ala Ala Arg Ile Leu Gly Val Ala Asn Leu Arg Glu Val Ala Asp
260 265 270
Leu Val Asn Ala Gln Ala Asp Pro Ala Ala Ala Leu Asp Gly Val Leu
275 280 285
Asp Arg Leu Asp Asp Glu Thr Met Arg Lys Arg Val Arg His Val Val
290 295 300
Thr Glu Ile Gly Arg Val Asp Asp Phe Val Arg Ala Phe Ala Glu Gly
305 310 315 320
Asp Met Gln Thr Ala Gly Glu Leu Phe Asn Ala Ser His Asp Ser Leu
325 330 335
Arg Asp Asp Tyr Glu Val Thr Val Pro Glu Leu Asp Val Ala Val Asp
340 345 350
Val Ala Arg Asp Glu Gly Ala Leu Gly Ala Arg Met Thr Gly Gly Gly
355 360 365
Phe Gly Gly Ser Ile Ile Ala Leu Val Asn Ala Gly Glu Ser Gln Arg
370 375 380
Ile Ala Gln Ala Ile Cys Asp Glu Phe Glu Arg Arg Gly Phe Val Leu
385 390 395 400
Pro Arg Ala Leu Pro Ala Gln Ala Ser Ala Ser Ala His Arg Val Gln
405 410 415
<210> 8
<211> 416
<212> PRT
<213> Bifidobacterium bifidum
<400> 8
Met Thr Ala Val Glu Phe Ile Glu Pro Leu Ser Arg Glu Asp Gly Val
1 5 10 15
Ser Arg Ala Thr Lys Leu Phe Val Asp Met Tyr Ala Thr Ala Pro Glu
20 25 30
Gly Val Trp Ala Ala Pro Gly Arg Val Asn Leu Ile Gly Glu His Thr
35 40 45
Asp Tyr Asn Ala Gly Leu Cys Leu Pro Ile Ala Leu Pro His Arg Thr
50 55 60
Phe Ile Ala Leu Lys Pro Arg Glu Asp Thr Lys Val Arg Val Val Ser
65 70 75 80
Asp Val Ala Pro Asp Lys Val Ala Glu Ala Asp Leu Asp Gly Leu Lys
85 90 95
Ala Arg Gly Val Asp Gly Trp Ser Ala Tyr Pro Thr Gly Val Ala Trp
100 105 110
Ala Leu Arg Glu Ala Gly Phe Ser Gln Val Lys Gly Phe Asp Ala Ala
115 120 125
Phe Val Ser Cys Val Pro Leu Gly Ser Gly Leu Ser Ser Ser Ala Ala
130 135 140
Met Thr Cys Ser Thr Ala Leu Ala Leu Asp Asp Val Tyr Gly Leu Gly
145 150 155 160
Tyr Gly Ser Ser Asp Ala Gly Arg Val Thr Leu Ile Asn Ala Ala Ile
165 170 175
Lys Ser Glu Asn Asp Met Ala Gly Ala Ser Thr Gly Gly Leu Asp Gln
180 185 190
Asn Ala Ser Met Arg Cys Thr Glu Gly His Ala Leu Leu Leu Asp Cys
195 200 205
Arg Pro Glu Leu Thr Pro Leu Glu Asn Val Ser Gln Gln Ala Phe Asp
210 215 220
Leu Asp Lys Tyr Gly Leu Glu Leu Leu Val Val Asp Thr Gln Ala Pro
225 230 235 240
His Gln Leu Asn Asp Gly Gln Tyr Ala Gln Arg Arg Ala Thr Cys Glu
245 250 255
Glu Ala Ala Arg Ile Leu Gly Val Ala Asn Leu Arg Val Ala Ala Asp
260 265 270
Gly Ile Ser Lys Ala Asp Asp Gln Phe Gln Ala Leu Lys Glu Thr Leu
275 280 285
Asp Ala Leu Pro Asp Val Thr Met Lys Lys Arg Val Arg His Val Val
290 295 300
Thr Glu Ile Glu Arg Val Arg Ser Phe Val Arg Ala Phe Ala Gln Gly
305 310 315 320
Asp Ile Glu Ala Ala Gly Arg Leu Phe Asn Ala Ser His Asp Ser Leu
325 330 335
Ala Ala Asp Tyr Glu Val Thr Val Pro Glu Leu Asp Val Ala Val Asp
340 345 350
Val Ala Arg Lys Asn Gly Ala Tyr Gly Ala Arg Met Thr Gly Gly Gly
355 360 365
Phe Gly Gly Ser Ile Ile Ala Leu Val Asp Lys Gly Arg Ser Gln Glu
370 375 380
Val Ala Gln Lys Ile Ala Asp Glu Phe Glu Ala Arg Gly Phe His Ala
385 390 395 400
Pro Arg Ala Leu Pro Ala Val Ala Ala Pro Ser Ala Ser Arg Glu Ala
405 410 415
<210> 9
<211> 420
<212> PRT
<213> Bifidobacterium asteroides
<400> 9
Met Thr Arg Thr Val Glu Phe Ile Gln Pro Trp Thr Gln Gly Ala Asp
1 5 10 15
Gly Gln Gly Ala Thr Lys Ala Arg Glu Leu Phe Arg Lys Val Tyr Gly
20 25 30
Gly Asp Pro Gln Gly Val Trp Ser Ala Pro Gly Arg Val Asn Leu Ile
35 40 45
Gly Glu His Thr Asp Tyr Asn Ala Gly Leu Cys Leu Pro Ile Ala Leu
50 55 60
Pro Thr Arg Thr Tyr Val Ala Ala Ser Pro Arg Thr Asp Ser Arg Val
65 70 75 80
Arg Leu Val Ser Thr Met Asp Pro Glu Asn Pro Val Gln Ala Asp Leu
85 90 95
Asp Gly Leu Gln Ala Arg Gly Val Ser Gly Trp Ala Ala Tyr Pro Val
100 105 110
Gly Val Ala Trp Ala Leu Arg Arg Asp Gly Phe Pro Gln Val Arg Gly
115 120 125
Phe Asp Leu Ala Leu Ala Ser Cys Val Pro Val Gly Ser Gly Leu Ser
130 135 140
Ser Ser Ala Ala Met Thr Cys Ala Met Ala Leu Ala Leu Asp Asp Leu
145 150 155 160
Phe Gly Leu Gly Leu Gly Gly Asp Glu Gly Gly Arg Val Arg Leu Ile
165 170 175
Gln Ala Ala Ile Thr Ala Glu Asn Asp Met Ala Gly Ala Ser Thr Gly
180 185 190
Gly Met Asp Gln Ser Ala Ala Met Arg Cys Arg Ser Gly Cys Ala Leu
195 200 205
Arg Leu Asp Cys Arg Pro Glu Leu Asp Ala Met Ser Asn Val Arg Gln
210 215 220
Val Pro Phe Asp Leu Arg Ala Ala Gly Leu Glu Leu Leu Val Val Asp
225 230 235 240
Thr Arg Ala Gln His Gln Leu Asn Asp Gly Gln Tyr Asp Gln Arg Arg
245 250 255
Ala Thr Cys Glu Gln Ala Val His Leu Leu Gly Val Ala Asn Leu Arg
260 265 270
Gln Ala Ala Asp Gln Val Asn Gly Ala Ala Asp Pro Pro Ser Ala Leu
275 280 285
Ala Ala Leu Leu Glu Gln Leu Pro Asp Glu Thr Met Arg Arg Arg Val
290 295 300
Arg His Val Ile Ser Glu Ile Gly Arg Val Asp Arg Phe Ile Glu Ala
305 310 315 320
Phe Gly Arg Gly Asp Tyr Val Leu Ala Gly Arg Leu Ile Asn Ala Ser
325 330 335
His Asp Ser Leu Arg Asp Asp Tyr Glu Val Thr Cys Pro Glu Leu Asp
340 345 350
Glu Ala Val Asp Ala Ala Arg Gln Gly Gly Ala Tyr Gly Ala Arg Met
355 360 365
Thr Gly Gly Gly Phe Gly Gly Ser Ile Ile Ala Leu Ala Asp Ala Gly
370 375 380
Lys Gly Ser Gly Leu Ala Arg Asp Ile Ala Glu Arg Phe Ala Ser Lys
385 390 395 400
Gly Phe Lys Ala Pro Arg Ala Leu Ile Ala Leu Pro Ser Ser Ala Ala
405 410 415
Thr Arg Glu Ser
420
<210> 10
<211> 416
<212> PRT
<213> Bifidobacterium breve
<400> 10
Met Ser Ala Val Glu Phe Ile Glu Pro Leu Thr His Glu Glu Gly Val
1 5 10 15
Ser Gln Ala Thr Lys Leu Phe Val Asp Thr Tyr Gly Ala Ala Pro Glu
20 25 30
Gly Val Trp Ala Ala Pro Gly Arg Val Asn Leu Ile Gly Glu His Thr
35 40 45
Asp Tyr Asn Ala Gly Leu Cys Leu Pro Ile Ala Leu Pro His Arg Thr
50 55 60
Phe Ile Ala Leu Lys Pro Arg Glu Asp Thr Lys Val Arg Val Val Ser
65 70 75 80
Gly Val Ala Pro Asp Lys Val Ala Glu Ala Asp Leu Asp Gly Leu Lys
85 90 95
Ala Arg Gly Val Asp Gly Trp Ser Ala Tyr Pro Thr Gly Val Ala Trp
100 105 110
Ala Leu Arg Gln Ala Gly Phe Asp Lys Val Lys Gly Phe Asp Ala Ala
115 120 125
Phe Val Ser Cys Val Pro Leu Gly Ser Gly Leu Ser Ser Ser Ala Ala
130 135 140
Met Thr Cys Ser Thr Ala Leu Ala Leu Asp Asp Val Tyr Gly Leu Gly
145 150 155 160
Tyr Gly Asp Ser Asp Ala Gly Arg Val Thr Leu Ile Asn Ala Ala Ile
165 170 175
Lys Ser Glu Asn Glu Met Ala Gly Ala Ser Thr Gly Gly Leu Asp Gln
180 185 190
Asn Ala Ser Met Arg Cys Thr Glu Gly His Ala Leu Leu Leu Asp Cys
195 200 205
Arg Pro Glu Leu Thr Pro Leu Glu Asn Val Ser Gln Gln Glu Phe Gly
210 215 220
Leu Asp Lys Tyr Asn Leu Glu Leu Leu Val Val Asp Thr Gln Ala Pro
225 230 235 240
His Gln Leu Asn Asp Gly Gln Tyr Ala Gln Arg Arg Ala Thr Cys Glu
245 250 255
Glu Ala Ala Lys Ile Leu Gly Val Ala Asn Leu Arg Val Thr Ala Asp
260 265 270
Gly Ile Ser Lys Ala Asp Asp Gln Phe Gln Ala Leu Lys Glu Thr Leu
275 280 285
Asp Ala Leu Pro Asp Glu Thr Met Lys Lys Arg Val Arg His Val Val
290 295 300
Thr Glu Ile Glu Arg Val Arg Ser Phe Val Arg Ala Phe Ala Gln Gly
305 310 315 320
Asp Ile Lys Ala Ala Gly Arg Leu Phe Asn Ala Ser His Asp Ser Leu
325 330 335
Ala Ala Asp Tyr Glu Val Thr Val Pro Glu Leu Asp Ile Ala Val Asp
340 345 350
Val Ala Arg Lys Asn Gly Ala Tyr Gly Ala Arg Met Thr Gly Gly Gly
355 360 365
Phe Gly Gly Ser Ile Ile Ala Leu Val Asp Lys Gly Arg Ser Gln Glu
370 375 380
Val Ala Gln Lys Ile Ala Asp Glu Phe Glu Lys Gln Gly Phe His Ala
385 390 395 400
Pro Arg Ala Leu Ala Ala Tyr Ala Ala Pro Ser Ala Ser Arg Glu Ala
405 410 415
<210> 11
<211> 418
<212> PRT
<213> Bifidobacterium pseudolongum
<400> 11
Met Thr Thr Ala Val Glu Phe Ile Glu Pro Met Gly Asp Ala Asp Gly
1 5 10 15
Ala Ala Arg Ala Ala Ala Leu Phe Glu Ala Arg Phe Gly Thr Ala Pro
20 25 30
Ala Gly Val Trp Ala Ala Pro Gly Arg Val Asn Leu Ile Gly Glu His
35 40 45
Thr Asp Tyr Asn Gly Gly Leu Cys Leu Pro Ile Ala Leu Pro His Arg
50 55 60
Thr Tyr Val Ala Leu Ala Pro Arg Asp Asp Thr Thr Val Arg Val Ile
65 70 75 80
Ser Asp Met Thr Pro Asp Glu Met Thr Met Val Asp Leu Asp Gly Leu
85 90 95
Ala Ala Gly Gly Val Asp Gly Trp Gly Ala Tyr Pro Ile Gly Val Ala
100 105 110
Trp Ala Leu Arg Glu Ala Gly Phe Asp Gln Val Arg Gly Phe Asp Ala
115 120 125
Val Phe Ser Ser Cys Val Pro Leu Gly Ser Gly Leu Ser Ser Ser Ala
130 135 140
Ala Met Thr Cys Ser Thr Ala Leu Ala Leu Asp Asp Val Tyr Gly Leu
145 150 155 160
Gly Phe Gly Gly Ser Asp Glu Gly Arg Ile Thr Leu Ile Asp Ala Ala
165 170 175
Val Met Ala Glu Asn Glu Met Ala Gly Ala Ser Thr Gly Gly Leu Asp
180 185 190
Gln Asn Ala Ser Met Arg Cys Ala Ala Asp His Ala Ile Arg Leu Asp
195 200 205
Cys Met Pro Gly Leu Thr Ala Ala Gln Ser Val Arg Gln Glu Pro Phe
210 215 220
Asp Leu Ser Ala Tyr Gly Leu Glu Leu Leu Val Leu Asp Thr Gln Ala
225 230 235 240
Pro His Gln Leu Asn Asp Gly Gln Tyr Glu Ala Arg Arg Thr Met Cys
245 250 255
Glu Glu Ala Ala Gln Ile Leu Gly Val Pro Asn Leu Arg Val Val Ala
260 265 270
Asp Gln Val Asn Ala Ala Val Asp Pro Ala Ala Ala Leu Glu Asp Val
275 280 285
Leu Ser Gln Leu Asp Asp Glu Thr Met Arg Arg Arg Val Arg His Val
290 295 300
Ile Thr Glu Ile Gly Arg Val Asp Asp Phe Ile Gly Ala Phe Gly Arg
305 310 315 320
Gly Asp Ile Glu Thr Ala Gly Ala Leu Phe Asn Ala Ser His Asp Ser
325 330 335
Leu Arg Asp Asp Tyr Glu Val Thr Val Pro Glu Leu Asp Val Ala Val
340 345 350
Asp Val Ala Arg Asp Glu Gly Ala Tyr Gly Ala Arg Met Thr Gly Gly
355 360 365
Gly Phe Gly Gly Ser Ile Ile Ala Leu Val Asn Ala Gly Glu Ser Arg
370 375 380
Arg Ile Ala Gln Ala Ile Ala Asp Glu Phe Ala Arg Arg Gly Phe Asp
385 390 395 400
Ala Pro Arg Ala Leu Pro Ala Arg Ala Ser Gln Ser Ala His Arg Val
405 410 415
Asn Asp
<210> 12
<211> 416
<212> PRT
<213> Bifidobacterium pseudocatenulatum
<400> 12
Met Thr Ala Val Glu Phe Ile Glu Pro Leu Ser His Asp Glu Gly Val
1 5 10 15
Lys Asn Ala Thr Asp Leu Phe Arg Ala Thr Tyr Gly Glu Glu Pro Ala
20 25 30
Gly Val Trp Ala Ala Pro Gly Arg Val Asn Leu Ile Gly Glu His Thr
35 40 45
Asp Tyr Asn Ala Gly Leu Cys Leu Pro Ile Ala Leu Pro His Arg Thr
50 55 60
Phe Ile Ala Leu Lys Pro Arg Glu Asp Thr Lys Val Arg Val Val Ser
65 70 75 80
Asp Val Asp Ser Gly Asn Val Thr Glu Ala Asp Leu Asp Gly Leu Gln
85 90 95
Ala Gly Gly Val Glu Gly Trp Ala Ala Tyr Pro Val Gly Val Ala Trp
100 105 110
Ala Leu Arg Glu Ala Gly Phe Asn Ala Val Gln Gly Phe Asp Ala Ala
115 120 125
Phe Ser Ser Cys Val Pro Leu Gly Ser Gly Leu Ser Ser Ser Ala Ala
130 135 140
Met Thr Cys Ser Thr Ala Leu Ala Leu Asp Asp Val Tyr Gly Leu Gly
145 150 155 160
Tyr Gly Ala Ser Asp Ala Gly Arg Val Thr Leu Ile Asn Ala Ala Ile
165 170 175
Lys Ser Glu Asn Asp Met Ala Gly Ala Ser Thr Gly Gly Leu Asp Gln
180 185 190
Asn Ala Ser Met Arg Cys Thr Phe Gly His Ala Leu Arg Leu Asp Cys
195 200 205
Arg Pro Glu Leu Ser Pro Leu Glu Asn Val Ser Gln Gln Glu Phe Asp
210 215 220
Leu Asp Lys Tyr Gly Leu Glu Leu Leu Val Leu Asp Thr Gln Ala Pro
225 230 235 240
His Gln Leu Asn Asp Gly Gln Tyr Ala Gln Arg Arg Ala Thr Cys Glu
245 250 255
Lys Ala Ala Glu Ile Leu Gly Val Ala Asn Leu Arg Val Val Ala Asp
260 265 270
Ser Ile Ala Lys Ser Gly Asp Pro Phe Gln Ala Leu Lys Glu Thr Leu
275 280 285
Asp Lys Leu Glu Asp Asp Thr Met Lys Lys Arg Val Arg His Val Ile
290 295 300
Thr Glu Ile Ala Arg Val Asn Ser Phe Val Arg Ala Phe Ala Asn Gly
305 310 315 320
Lys Ile Asp Glu Ala Gly Arg Leu Phe Asn Ala Ser His Asp Ser Leu
325 330 335
Ala Ala Asp Tyr Glu Val Thr Val Pro Glu Leu Asp Ile Ala Val Asp
340 345 350
Val Ala Arg Ala Asn Gly Ala Tyr Gly Ala Arg Met Thr Gly Gly Gly
355 360 365
Phe Gly Gly Ser Ile Ile Ala Leu Val Asn Lys Gly Gln Gly His Glu
370 375 380
Ile Ala Gln Lys Ile Ala Asp Arg Phe Glu Lys Glu Gly Phe Asn Ala
385 390 395 400
Pro Arg Ala Leu Pro Ala Phe Ala Ala Ala Ser Ala Ser Arg Glu Ala
405 410 415
<210> 13
<211> 1278
<212> DNA
<213> 人工序列
<220>
<223> 编码BiGalK的核苷酸序列
<400> 13
catatgacag ctgtagaatt tatagagccc ctaacccatg aggagggtgt ctcccaggca 60
accaagctgt ttgtcgacac ctatggtgct gctccggagg gcgtgtgggc tgcgccgggt 120
cgtgtaaatc tgattggtga acataccgat tataacgctg gcctttgcct gcccatcgcg 180
ttgccgcaca gaacctttat tgcgcttaag ccgcgcgaag ataccaaagt ccgcgtggtt 240
tccggtgttg ctccggataa ggtggctgag gctgatctgg acggcctgaa ggcccgcggg 300
gtggacggtt ggtctgcgta cccgaccggt gtggcgtggg cactgcgtca ggccggcttc 360
gataaggtga aaggtttcga cgcggccttc gtgagctgtg ttccgttggg cagcggtctt 420
tcttcctcag ccgcaatgac gtgcagcacc gctttagcgc tcgacgatgt ttacggcctg 480
ggttatggcg atagcgatgc gggccgcgtg acgctgatta acgcggcgat taaaagcgaa 540
aatgaaatgg caggtgcgtc gaccggtggt ttagaccaaa acgcaagcat gcgttgcacc 600
gagggccacg cactgctgtt ggactgccgt ccggagctga ccccgctgga gaacgtgtct 660
cagcaagagt tcgacctgga caagtacaac ctggaactgc tggttgtcga tacccaggcg 720
ccacaccagc tgaatgatgg ccaatatgca caacgtcgtg cgacttgtga agaggctgcc 780
aagatcctgg gcgtggcgaa tttgcgcgtc acggcggatg gcatcagcaa agcggacgac 840
cagtttcagg cgttgaagga aactctggac gccttgccag atgagacaat gaaaaaacgt 900
gttcgtcacg tggtaaccga aatcgaacgt gttagaagct ttgttcgcgc gtttgcacaa 960
ggtgatatca aggcggctgg ccgtctgttc aacgcgagcc atgattcgct ggctgccgac 1020
tacgaagtta cggttccgga gctcgacatc gcggttgacg ttgcgcgtaa aaacggtgcg 1080
tacggcgcgc gcatgaccgg tggtggtttc ggcggctcca ttatcgcgct tgtggataag 1140
ggtcagggtc acgagatcgc ccaaaaaatt gcggatcgtt ttgaaaaaga ggggttcaac 1200
gctccgcgtg cgcttccggc attcgctgcg gcatctgcca gccgtgaagc caaattggcc 1260
gccgcgctgg agctcgag 1278
<210> 14
<211> 1854
<212> DNA
<213> 人工序列
<220>
<223> 编码AtUSP的核苷酸序列
<400> 14
catatggcta gcaccgttga tagcaacttc ttctctagcg tgccggcact gcatagcaac 60
ctgggtctgc tgtccccgga tcagattgaa ctggcaaaaa tcctgctgga aaacggccag 120
tcccacctgt tccagcagtg gccggaactg ggcgttgacg ataaagaaaa actggccttc 180
ttcgatcaga ttgctcgtct gaactcttcc tatccaggcg gcctggctgc gtacatcaaa 240
accgcgaaag agctgctggc ggatagcaaa gttggtaaaa acccgtatga tggtttttct 300
ccgtctgttc cgagcggcga aaacctgact ttcggcaccg ataatttcat tgaaatggaa 360
aaacgtggtg ttgtggaagc ccgtaacgca gcgtttgtgc tggttgcagg tggcctgggc 420
gaacgtctgg gttacaacgg tatcaaagtt gcgctgccgc gtgaaaccac caccggcacc 480
tgtttcctgc agcactatat cgaatctatc ctggctctgc aggaagcgtc taacaaaatc 540
gatagcgatg gctctgaacg tgacattccg ttcatcatca tgacctccga tgatactcac 600
tcccgtaccc tggacctgct ggagctgaac agctactttg gcatgaaacc gacccaggtg 660
cacctcctga aacaggaaaa agttgcttgc ctggatgata acgatgcccg tctggcgctg 720
gatccgcaca acaaatatag cattcagacc aaaccacacg gtcacggtga tgtgcatagc 780
ctgctgtact cttctggtct gctgcacaaa tggctggaag ctggtctgaa atgggtgctg 840
ttcttccagg ataccaacgg cctgctgttt aacgctattc cggcctctct gggcgtgagc 900
gcgactaaac agtaccacgt taactccctg gctgttccac gtaaagctaa agaagcgatc 960
ggtggtatca gcaaactgac ccacgttgat ggtcgttcta tggtgattaa cgtggaatat 1020
aaccaactcg acccgctgct gcgcgcttcc ggcttcccgg acggcgacgt gaactgtgaa 1080
accggtttta gcccgtttcc gggtaacatc aaccagctga tcctggaact tggcccgtat 1140
aaagacgaac tgcagaaaac cggcggtgcg attaaagaat tcgttaaccc gaaatataaa 1200
gacagcacta aaaccgcgtt caaatccagc acccgcctgg aatgcatgat gcaggactac 1260
ccgaaaactc tgccgccgac cgcgcgcgtt ggcttcaccg taatggatat ctggctggct 1320
tacgcgccgg ttaaaaacaa cccggaagat gctgctaaag ttccgaaagg taacccgtac 1380
cacagcgcaa cctctggtga aatggcgatc tatcgtgcga actctctgat tctgcagaaa 1440
gcaggcgtta aagttgaaga accggttaaa caggtgctga acggccaaga agttgaagtt 1500
tggagccgta tcacctggaa accgaaatgg ggtatgatct tttctgacat taaaaagaaa 1560
gtgtctggta actgtgaagt ttcccagcgt tccactatgg cgatcaaagg tcgcaatgtg 1620
tttatcaaag atctgagcct ggacggtgct ctgatcgttg atagcatcga tgacgcggaa 1680
gttaaactgg gcggtctgat taaaaacaac ggctggacca tggaatctgt agattacaaa 1740
gatacctctg ttccggaaga aatccgtatc cgtggcttcc gtttcaacaa agttgaacag 1800
ctggaaaaga aactgaccca gccgggtaaa ttctctgttg aagattaact cgag 1854
<210> 15
<211> 531
<212> DNA
<213> 人工序列
<220>
<223> 编码PPA的核苷酸序列
<400> 15
atgagcttac tcaacgtccc tgcgggtaaa gatctgccgg aagacatcta cgttgttatt 60
gagatcccgg ctaacgcaga tccgatcaaa tacgaaatcg acaaagagag cggcgcactg 120
ttcgttgacc gcttcatgtc caccgcgatg ttctatccgt gcaactacgg ttacatcaac 180
cacaccctgt ctctggacgg tgacccagtt gacgtactgg tcccaactcc gtacccgctg 240
cagccgggtt ctgtgatccg ttgccgtccg gttggcgttc tgaaaatgac cgacgaagcc 300
ggtgaagatg cgaaactggt tgcggttccg cacagcaagc tgagcaaaga atacgatcac 360
attaaagacg ttaacgatct gcctgaactg ctgaaagcgc aaatcgctca cttcttcgag 420
cactacaaag acctcgaaaa aggcaagtgg gtgaaagttg aaggttggga aaacgcagaa 480
gccgctaaag ctgaaatcgt tgcctccttc gagcgcgcaa agaataaata a 531
Claims (10)
1.一种合成尿苷二磷酸-6-叠氮-D-半乳糖的方法,包括以下步骤:
S1,化合物6在双歧杆菌来源的半乳糖激酶的作用下转变为化合物12,其反应式如下所示:
优选地,其中,所述半乳糖激酶选自:长双歧杆菌来源的半乳糖激酶(BiGalK)或在BiGalK的氨基酸序列中经过取代、缺失或添加一个或几个氨基酸且具有BiGalK活性的由BiGalK衍生的蛋白质,齿双歧杆菌来源的半乳糖激酶,链状双歧杆菌来源的半乳糖激酶,小鸡双歧杆菌来源的半乳糖激酶,动物双歧杆菌来源的半乳糖激酶,两歧双歧杆菌来源的半乳糖激酶,星状双歧杆菌来源的半乳糖激酶,短双歧杆菌来源的半乳糖激酶,假长双歧杆菌来源的半乳糖激酶,假链状双歧杆菌来源的半乳糖激酶;
S2,在拟南芥来源的糖焦磷酸化酶(AtUSP)或在AtUSP的氨基酸序列中经过取代、缺失或添加一个或几个氨基酸且具有AtUSP活性的由AtUSP衍生的蛋白质的作用下,化合物12转变为尿苷二磷酸-6-叠氮-D-半乳糖1α,其反应式如下所示:
2.根据权利要求1所述的方法,其中,所述BiGalK的氨基酸序列如SEQ ID NO:1所示;所述齿双歧杆菌来源的半乳糖激酶的氨基酸序列如SEQ ID NO:4所示;所述链状双歧杆菌来源的半乳糖激酶的氨基酸序列如SEQ ID NO:5所示;所述小鸡双歧杆菌来源的半乳糖激酶的氨基酸序列如SEQ ID NO:6所示;所述动物双歧杆菌来源的半乳糖激酶的氨基酸序列如SEQ ID NO:7所示;所述两歧双歧杆菌来源的半乳糖激酶的氨基酸序列如SEQ ID NO:8所示;所述星状双歧杆菌来源的半乳糖激酶的氨基酸序列如SEQ ID NO:9所示;所述短双歧杆菌来源的半乳糖激酶的氨基酸序列如SEQ ID NO:10所示;所述假长双歧杆菌来源的半乳糖激酶的氨基酸序列如SEQ ID NO:11所示;所述假链状双歧杆菌来源的半乳糖激酶的氨基酸序列如SEQ ID NO:12所示。
3.根据权利要求1所述的方法,其中,编码所述BiGalK的核苷酸序列如SEQ ID NO:13所示。
4.根据权利要求1所述的方法,
其中,步骤S1中,所述反应在三磷酸腺苷和金属Mg2+存在条件下进行,和/或
其中,步骤S1中,所述反应体系的pH值为6至9,优选为7.5。
5.根据权利要求1所述的方法,步骤S2中,所述AtUSP的氨基酸序列如SEQ ID NO:2所示,优选地,编码所述AtUSP的核苷酸序列如SEQ ID NO:14所示。
6.根据权利要求1所述的方法,
其中,步骤S2中,所述反应在三磷酸尿苷和金属Mg2+存在条件下进行,和/或
其中,步骤S2中,所述反应体系的pH值为6至9,优选为约7.5。
7.根据权利要求1所述的方法,步骤S2中,进一步使用无机焦磷酸化酶(PPA)或在PPA的氨基酸序列中经过取代、缺失或添加一个或几个氨基酸且具有PPA活性的由PPA衍生的蛋白质。
8.根据权利要求1所述的方法,步骤S2中,所述PPA的氨基酸序列如SEQ ID NO:3所示,优选地,编码所述PPA的核苷酸序列如SEQ ID NO:15所示。
10.根据权利要求1-9中任一项所述的方法,进一步包括分离纯化步骤,优选地,所述纯化步骤包括:在上述催化合成反应结束后,加入乙醇,固液分离,液体减压脱除溶剂,然后用去离子水溶解,聚丙烯酰胺凝胶柱(P-2)分离,薄层色谱法(TLC)检测分离效果;合并包含产物的组分,减压脱除溶剂,用去离子水溶解后,琼脂糖凝胶(QS-FF)离子交换柱分离,去除杂质;将含有产物的组分合并,减压脱除溶剂,然后用去离子水溶解,P-2凝胶柱分离收集含有产物的组分,优选经浓缩冻干,得到目标产物。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110968981.9A CN115710594B (zh) | 2021-08-23 | 一种合成尿苷二磷酸-6-叠氮-d-半乳糖的方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110968981.9A CN115710594B (zh) | 2021-08-23 | 一种合成尿苷二磷酸-6-叠氮-d-半乳糖的方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN115710594A true CN115710594A (zh) | 2023-02-24 |
CN115710594B CN115710594B (zh) | 2024-06-28 |
Family
ID=
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108504702A (zh) * | 2012-08-20 | 2018-09-07 | 中央研究院 | 寡糖的大规模酶合成的方法 |
CN113265434A (zh) * | 2021-05-19 | 2021-08-17 | 吉林大学 | 一种合成udp-半乳糖及合成半乳糖基化合物的方法 |
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108504702A (zh) * | 2012-08-20 | 2018-09-07 | 中央研究院 | 寡糖的大规模酶合成的方法 |
CN113265434A (zh) * | 2021-05-19 | 2021-08-17 | 吉林大学 | 一种合成udp-半乳糖及合成半乳糖基化合物的方法 |
Non-Patent Citations (2)
Title |
---|
JUN LIU等: "Biosynthesis of nucleotide sugars by a promiscuous UDP-sugar pyrophosphorylase from Arabidopsis thaliana (AtUSP)", 《BIOORGANIC & MEDICINAL CHEMISTRY LETTERS》, vol. 23, 9 May 2013 (2013-05-09), pages 3764 - 3768, XP028564913, DOI: 10.1016/j.bmcl.2013.04.090 * |
LEI LI等: "A highly efficient galactokinase from Bifidobacterium infantis with broad substrate specificity", 《CARBOHYDRATE RESEARCH》, vol. 355, 8 May 2012 (2012-05-08), pages 35 - 39 * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
TWI567200B (zh) | 寡醣之大規模酵素合成 | |
KR101525230B1 (ko) | 시알산 유도체의 제조방법 | |
US11993627B2 (en) | Enzymatic synthesis of homogeneous chondroitin sulfate oligosaccharides | |
CN110699373A (zh) | 尿苷二磷酸葡萄糖高产菌株及其应用 | |
CN1314913A (zh) | 有针对性的组合化合物库及其进行筛选所用的高效测定方法 | |
Zou et al. | One-pot three-enzyme synthesis of UDP-Glc, UDP-Gal, and their derivatives | |
EP3097200A1 (en) | Process for the attachment of a galnac moiety comprising a (hetero)aryl group to a glcnac moiety, and product obtained thereby | |
CN115710594B (zh) | 一种合成尿苷二磷酸-6-叠氮-d-半乳糖的方法 | |
CN115710594A (zh) | 一种合成尿苷二磷酸-6-叠氮-d-半乳糖的方法 | |
EP2094840B1 (en) | Novel n-acetylglucosamine-2-epimerase and method for producing cmp-neuraminic acid using the same | |
JPWO2004009830A1 (ja) | Cmp−n−アセチルノイラミン酸の製造法 | |
CN107541503B (zh) | 一种甲基转移酶GenL和其编码基因genL及应用 | |
US20110288286A1 (en) | Process for the enzymatic production of cyclic diguanosine monophosphate employing a diguanylate cyclase comprising a mutated rxxd motif | |
US20200325457A1 (en) | Sialyl transferase variants having neosialidase activity | |
US20240199675A1 (en) | Oligosaccharide analytical standards | |
CN113025549B (zh) | 一种星孢菌素骨架化合物的生物糖基化合成体系及其合成方法 | |
Xue et al. | Mechanistic Investigations into the Catalytic Mode of a Dehydratase Complex Involved in the Biosynthesis of Lantibiotic Cacaoidin | |
Schwardt et al. | Minireview: bacterial sialyltransferases for carbohydrate synthesis | |
CN116355875B (zh) | 甲硫氨酸腺苷基转移酶突变体及其在生产s-腺苷甲硫氨酸中的应用 | |
Kajihara et al. | Studies in glycopeptide synthesis | |
JP4509447B2 (ja) | 高純度グアノシン5′−ジリン酸フコースおよびその製造法 | |
JP2826636B2 (ja) | 酵母のマンノース−1−リン酸転移酵素遺伝子を利用するリン酸含有酸性糖鎖の製造方法 | |
JP4171805B2 (ja) | 糖転移酵素 | |
JP4901447B2 (ja) | Cmp−デアミノノイラミン酸の製造法 | |
CN114736944A (zh) | 一种化学酶法合成α-肌营养不良蛋白聚糖相关糖肽的方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant |