RU2810730C2 - Use of glycosidases in oligosacharide production - Google Patents
Use of glycosidases in oligosacharide production Download PDFInfo
- Publication number
- RU2810730C2 RU2810730C2 RU2020138045A RU2020138045A RU2810730C2 RU 2810730 C2 RU2810730 C2 RU 2810730C2 RU 2020138045 A RU2020138045 A RU 2020138045A RU 2020138045 A RU2020138045 A RU 2020138045A RU 2810730 C2 RU2810730 C2 RU 2810730C2
- Authority
- RU
- Russia
- Prior art keywords
- ala
- gly
- thr
- val
- ser
- Prior art date
Links
- 238000004519 manufacturing process Methods 0.000 title claims abstract description 44
- 102000005744 Glycoside Hydrolases Human genes 0.000 title description 40
- 108010031186 Glycoside Hydrolases Proteins 0.000 title description 40
- SNFSYLYCDAVZGP-UHFFFAOYSA-N UNPD26986 Natural products OC1C(O)C(O)C(C)OC1OC1C(OC2C(OC(O)C(O)C2O)CO)OC(CO)C(O)C1O SNFSYLYCDAVZGP-UHFFFAOYSA-N 0.000 claims abstract description 57
- 229940062827 2'-fucosyllactose Drugs 0.000 claims abstract description 46
- HWHQUWQCBPAQQH-UHFFFAOYSA-N 2-O-alpha-L-Fucosyl-lactose Natural products OC1C(O)C(O)C(C)OC1OC1C(O)C(O)C(CO)OC1OC(C(O)CO)C(O)C(O)C=O HWHQUWQCBPAQQH-UHFFFAOYSA-N 0.000 claims abstract description 46
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 claims abstract description 35
- 238000000034 method Methods 0.000 claims abstract description 35
- 239000008101 lactose Substances 0.000 claims abstract description 34
- 241000588724 Escherichia coli Species 0.000 claims abstract description 31
- SHZGCJCMOBCMKK-DHVFOXMCSA-N L-fucopyranose Chemical compound C[C@@H]1OC(O)[C@@H](O)[C@H](O)[C@@H]1O SHZGCJCMOBCMKK-DHVFOXMCSA-N 0.000 claims abstract description 28
- 230000015572 biosynthetic process Effects 0.000 claims abstract description 25
- SHZGCJCMOBCMKK-UHFFFAOYSA-N D-mannomethylose Natural products CC1OC(O)C(O)C(O)C1O SHZGCJCMOBCMKK-UHFFFAOYSA-N 0.000 claims abstract description 22
- WJPIUUDKRHCAEL-UHFFFAOYSA-N 3FL Natural products OC1C(O)C(O)C(C)OC1OC1C(OC2C(C(O)C(O)C(CO)O2)O)C(CO)OC(O)C1O WJPIUUDKRHCAEL-UHFFFAOYSA-N 0.000 claims abstract description 18
- AUNPEJDACLEKSC-ZAYDSPBTSA-N 3-fucosyllactose Chemical compound O[C@H]1[C@H](O)[C@H](O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](O)[C@H](O[C@@H]2[C@H](OC(O)[C@H](O)[C@H]2O)CO)O[C@H](CO)[C@@H]1O AUNPEJDACLEKSC-ZAYDSPBTSA-N 0.000 claims abstract description 17
- PNNNRSAQSRJVSB-SLPGGIOYSA-N Fucose Natural products C[C@H](O)[C@@H](O)[C@H](O)[C@H](O)C=O PNNNRSAQSRJVSB-SLPGGIOYSA-N 0.000 claims abstract description 17
- 239000007857 degradation product Substances 0.000 claims abstract description 15
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 claims abstract description 13
- 230000002255 enzymatic effect Effects 0.000 claims abstract description 11
- 230000003834 intracellular effect Effects 0.000 claims abstract description 11
- 230000010039 intracellular degradation Effects 0.000 claims abstract description 8
- 101001027098 Arabidopsis thaliana Fucose-1-phosphate guanylyltransferase Proteins 0.000 claims abstract description 6
- PTVXQARCLQPGIR-SXUWKVJYSA-N beta-L-fucose 1-phosphate Chemical compound C[C@@H]1O[C@H](OP(O)(O)=O)[C@@H](O)[C@H](O)[C@@H]1O PTVXQARCLQPGIR-SXUWKVJYSA-N 0.000 claims abstract description 6
- 230000001588 bifunctional effect Effects 0.000 claims abstract description 6
- 108010064833 guanylyltransferase Proteins 0.000 claims abstract description 6
- HWHQUWQCBPAQQH-BWRPKUOHSA-N 2-fucosyllactose Chemical compound O[C@H]1[C@H](O)[C@H](O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@H]([C@H](O)CO)[C@H](O)[C@@H](O)C=O HWHQUWQCBPAQQH-BWRPKUOHSA-N 0.000 claims abstract 25
- 108090000623 proteins and genes Proteins 0.000 claims description 46
- 108700023372 Glycosyltransferases Proteins 0.000 claims description 11
- 102000051366 Glycosyltransferases Human genes 0.000 claims description 9
- 101710098620 Alpha-1,2-fucosyltransferase Proteins 0.000 claims description 8
- 241000186016 Bifidobacterium bifidum Species 0.000 claims description 8
- 108010019236 Fucosyltransferases Proteins 0.000 claims description 8
- 102000006471 Fucosyltransferases Human genes 0.000 claims description 8
- 229940002008 bifidobacterium bifidum Drugs 0.000 claims description 8
- 108090000141 Sialyltransferases Proteins 0.000 claims description 6
- 102000003838 Sialyltransferases Human genes 0.000 claims description 6
- 108060003306 Galactosyltransferase Proteins 0.000 claims description 4
- 102000030902 Galactosyltransferase Human genes 0.000 claims description 4
- 238000012258 culturing Methods 0.000 claims description 3
- 241000606124 Bacteroides fragilis Species 0.000 claims description 2
- 102000000340 Glucosyltransferases Human genes 0.000 claims 1
- 108010055629 Glucosyltransferases Proteins 0.000 claims 1
- 102000002493 N-Acetylglucosaminyltransferases Human genes 0.000 claims 1
- 108010093077 N-Acetylglucosaminyltransferases Proteins 0.000 claims 1
- 238000002955 isolation Methods 0.000 claims 1
- 150000002482 oligosaccharides Chemical class 0.000 abstract description 120
- 229920001542 oligosaccharide Polymers 0.000 abstract description 118
- 239000006227 byproduct Substances 0.000 abstract description 43
- 230000000694 effects Effects 0.000 abstract description 7
- 239000000126 substance Substances 0.000 abstract 1
- 210000004027 cell Anatomy 0.000 description 151
- 230000000813 microbial effect Effects 0.000 description 124
- 239000002773 nucleotide Substances 0.000 description 89
- 125000003729 nucleotide group Chemical group 0.000 description 86
- 230000014509 gene expression Effects 0.000 description 44
- 108091028043 Nucleic acid sequence Proteins 0.000 description 33
- SNFSYLYCDAVZGP-OLAZETNGSA-N 2'-fucosyllactose Chemical compound O[C@H]1[C@H](O)[C@H](O)[C@H](C)O[C@H]1O[C@H]1[C@H](O[C@@H]2[C@H](OC(O)[C@H](O)[C@H]2O)CO)O[C@H](CO)[C@H](O)[C@@H]1O SNFSYLYCDAVZGP-OLAZETNGSA-N 0.000 description 32
- WQZGKKKJIJFFOK-PHYPRBDBSA-N alpha-D-galactose Chemical group OC[C@H]1O[C@H](O)[C@H](O)[C@@H](O)[C@H]1O WQZGKKKJIJFFOK-PHYPRBDBSA-N 0.000 description 22
- 244000005700 microbiome Species 0.000 description 22
- 125000003275 alpha amino acid group Chemical group 0.000 description 21
- 108020004707 nucleic acids Proteins 0.000 description 20
- 102000039446 nucleic acids Human genes 0.000 description 20
- 150000007523 nucleic acids Chemical class 0.000 description 20
- 108090000765 processed proteins & peptides Proteins 0.000 description 20
- 229920001184 polypeptide Polymers 0.000 description 19
- 102000004196 processed proteins & peptides Human genes 0.000 description 19
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 18
- 238000012546 transfer Methods 0.000 description 16
- 108020004414 DNA Proteins 0.000 description 15
- 239000000758 substrate Substances 0.000 description 15
- 108010061238 threonyl-glycine Proteins 0.000 description 15
- AXQLFFDZXPOFPO-UHFFFAOYSA-N UNPD216 Natural products O1C(CO)C(O)C(OC2C(C(O)C(O)C(CO)O2)O)C(NC(=O)C)C1OC(C1O)C(O)C(CO)OC1OC1C(O)C(O)C(O)OC1CO AXQLFFDZXPOFPO-UHFFFAOYSA-N 0.000 description 14
- AXQLFFDZXPOFPO-UNTPKZLMSA-N beta-D-Galp-(1->3)-beta-D-GlcpNAc-(1->3)-beta-D-Galp-(1->4)-beta-D-Glcp Chemical compound O([C@@H]1O[C@H](CO)[C@H](O)[C@@H]([C@H]1O)O[C@H]1[C@@H]([C@H]([C@H](O)[C@@H](CO)O1)O[C@H]1[C@@H]([C@@H](O)[C@@H](O)[C@@H](CO)O1)O)NC(=O)C)[C@H]1[C@H](O)[C@@H](O)[C@H](O)O[C@@H]1CO AXQLFFDZXPOFPO-UNTPKZLMSA-N 0.000 description 14
- 150000001720 carbohydrates Chemical class 0.000 description 14
- 230000010354 integration Effects 0.000 description 14
- IEQCXFNWPAHHQR-UHFFFAOYSA-N lacto-N-neotetraose Natural products OCC1OC(OC2C(C(OC3C(OC(O)C(O)C3O)CO)OC(CO)C2O)O)C(NC(=O)C)C(O)C1OC1OC(CO)C(O)C(O)C1O IEQCXFNWPAHHQR-UHFFFAOYSA-N 0.000 description 14
- USIPEGYTBGEPJN-UHFFFAOYSA-N lacto-N-tetraose Natural products O1C(CO)C(O)C(OC2C(C(O)C(O)C(CO)O2)O)C(NC(=O)C)C1OC1C(O)C(CO)OC(OC(C(O)CO)C(O)C(O)C=O)C1O USIPEGYTBGEPJN-UHFFFAOYSA-N 0.000 description 14
- 229940062780 lacto-n-neotetraose Drugs 0.000 description 14
- RBMYDHMFFAVMMM-PLQWBNBWSA-N neolactotetraose Chemical compound O([C@H]1[C@H](O)[C@H]([C@@H](O[C@@H]1CO)O[C@@H]1[C@H]([C@H](O[C@H]([C@H](O)CO)[C@H](O)[C@@H](O)C=O)O[C@H](CO)[C@@H]1O)O)NC(=O)C)[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O RBMYDHMFFAVMMM-PLQWBNBWSA-N 0.000 description 14
- 238000000855 fermentation Methods 0.000 description 13
- 230000004151 fermentation Effects 0.000 description 13
- 239000000203 mixture Substances 0.000 description 13
- 101000893749 Arabidopsis thaliana Alpha-L-fucosidase 3 Proteins 0.000 description 12
- RJTOFDPWCJDYFZ-SPVZFZGWSA-N Lacto-N-triaose Chemical compound CC(=O)N[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](O)[C@H](O[C@H]([C@H](O)CO)[C@H](O)[C@@H](O)C=O)O[C@H](CO)[C@@H]1O RJTOFDPWCJDYFZ-SPVZFZGWSA-N 0.000 description 12
- 108010047857 aspartylglycine Proteins 0.000 description 12
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 12
- 108010050848 glycylleucine Proteins 0.000 description 12
- 230000002194 synthesizing effect Effects 0.000 description 12
- 108010047495 alanylglycine Proteins 0.000 description 11
- 150000002772 monosaccharides Chemical group 0.000 description 11
- 241000880493 Leptailurus serval Species 0.000 description 10
- 239000002243 precursor Substances 0.000 description 10
- 239000000047 product Substances 0.000 description 10
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 9
- 108010020764 Transposases Proteins 0.000 description 9
- 102000008579 Transposases Human genes 0.000 description 9
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 9
- 230000007062 hydrolysis Effects 0.000 description 9
- 238000006460 hydrolysis reaction Methods 0.000 description 9
- 108010057821 leucylproline Proteins 0.000 description 9
- 239000002609 medium Substances 0.000 description 9
- 230000002503 metabolic effect Effects 0.000 description 9
- 235000018102 proteins Nutrition 0.000 description 9
- 102000004169 proteins and genes Human genes 0.000 description 9
- 230000001105 regulatory effect Effects 0.000 description 9
- OVRNDRQMDRJTHS-RTRLPJTCSA-N N-acetyl-D-glucosamine Chemical group CC(=O)N[C@H]1C(O)O[C@H](CO)[C@@H](O)[C@@H]1O OVRNDRQMDRJTHS-RTRLPJTCSA-N 0.000 description 8
- HSCJRCZFDFQWRP-UHFFFAOYSA-N Uridindiphosphoglukose Natural products OC1C(O)C(O)C(CO)OC1OP(O)(=O)OP(O)(=O)OCC1C(O)C(O)C(N2C(NC(=O)C=C2)=O)O1 HSCJRCZFDFQWRP-UHFFFAOYSA-N 0.000 description 8
- FZIVHOUANIQOMU-YIHIYSSUSA-N alpha-L-Fucp-(1->2)-beta-D-Galp-(1->3)-beta-D-GlcpNAc-(1->3)-beta-D-Galp-(1->4)-D-Glcp Chemical compound O[C@H]1[C@H](O)[C@H](O)[C@H](C)O[C@H]1O[C@H]1[C@H](O[C@@H]2[C@H]([C@H](O[C@@H]3[C@H]([C@H](O[C@@H]4[C@H](OC(O)[C@H](O)[C@H]4O)CO)O[C@H](CO)[C@@H]3O)O)O[C@H](CO)[C@H]2O)NC(C)=O)O[C@H](CO)[C@H](O)[C@@H]1O FZIVHOUANIQOMU-YIHIYSSUSA-N 0.000 description 8
- 108010077245 asparaginyl-proline Proteins 0.000 description 8
- 238000009396 hybridization Methods 0.000 description 8
- FZIVHOUANIQOMU-UHFFFAOYSA-N lacto-N-fucopentaose I Natural products OC1C(O)C(O)C(C)OC1OC1C(OC2C(C(OC3C(C(OC4C(OC(O)C(O)C4O)CO)OC(CO)C3O)O)OC(CO)C2O)NC(C)=O)OC(CO)C(O)C1O FZIVHOUANIQOMU-UHFFFAOYSA-N 0.000 description 8
- 230000004048 modification Effects 0.000 description 8
- 238000012986 modification Methods 0.000 description 8
- 108010046068 N-Acetyllactosamine Synthase Proteins 0.000 description 7
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 7
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 7
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 7
- 230000000295 complement effect Effects 0.000 description 7
- 229930182830 galactose Natural products 0.000 description 7
- 108010037850 glycylvaline Proteins 0.000 description 7
- 102000004190 Enzymes Human genes 0.000 description 6
- 108090000790 Enzymes Proteins 0.000 description 6
- 102100021700 Glycoprotein-N-acetylgalactosamine 3-beta-galactosyltransferase 1 Human genes 0.000 description 6
- 101000896564 Homo sapiens Glycoprotein-N-acetylgalactosamine 3-beta-galactosyltransferase 1 Proteins 0.000 description 6
- 108010065920 Insulin Lispro Proteins 0.000 description 6
- SQVRNKJHWKZAKO-PFQGKNLYSA-N N-acetyl-beta-neuraminic acid Chemical group CC(=O)N[C@@H]1[C@@H](O)C[C@@](O)(C(O)=O)O[C@H]1[C@H](O)[C@H](O)CO SQVRNKJHWKZAKO-PFQGKNLYSA-N 0.000 description 6
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 6
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 6
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 6
- USAZACJQJDHAJH-KDEXOMDGSA-N [[(2r,3s,4r,5s)-5-(2,4-dioxo-1h-pyrimidin-6-yl)-3,4-dihydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl] [(2r,3r,4s,5r,6r)-3,4,5-trihydroxy-6-(hydroxymethyl)oxan-2-yl] hydrogen phosphate Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@@H]1OP(O)(=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](C=2NC(=O)NC(=O)C=2)O1 USAZACJQJDHAJH-KDEXOMDGSA-N 0.000 description 6
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 6
- 108010005233 alanylglutamic acid Proteins 0.000 description 6
- TYALNJQZQRNQNQ-JLYOMPFMSA-N alpha-Neup5Ac-(2->6)-beta-D-Galp-(1->4)-beta-D-Glcp Chemical compound O1[C@@H]([C@H](O)[C@H](O)CO)[C@H](NC(=O)C)[C@@H](O)C[C@@]1(C(O)=O)OC[C@@H]1[C@H](O)[C@H](O)[C@@H](O)[C@H](O[C@H]2[C@@H]([C@@H](O)[C@H](O)O[C@@H]2CO)O)O1 TYALNJQZQRNQNQ-JLYOMPFMSA-N 0.000 description 6
- 108010015792 glycyllysine Proteins 0.000 description 6
- 238000004128 high performance liquid chromatography Methods 0.000 description 6
- 108010054155 lysyllysine Proteins 0.000 description 6
- 108010051242 phenylalanylserine Proteins 0.000 description 6
- 108010029020 prolylglycine Proteins 0.000 description 6
- 238000013518 transcription Methods 0.000 description 6
- 230000035897 transcription Effects 0.000 description 6
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 5
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 5
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 5
- SHZGCJCMOBCMKK-PQMKYFCFSA-N L-Fucose Natural products C[C@H]1O[C@H](O)[C@@H](O)[C@@H](O)[C@@H]1O SHZGCJCMOBCMKK-PQMKYFCFSA-N 0.000 description 5
- PNNNRSAQSRJVSB-UHFFFAOYSA-N L-rhamnose Natural products CC(O)C(O)C(O)C(O)C=O PNNNRSAQSRJVSB-UHFFFAOYSA-N 0.000 description 5
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 5
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 5
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 5
- 108010041407 alanylaspartic acid Proteins 0.000 description 5
- 108010087924 alanylproline Proteins 0.000 description 5
- 102000012086 alpha-L-Fucosidase Human genes 0.000 description 5
- 108010061314 alpha-L-Fucosidase Proteins 0.000 description 5
- 108010092854 aspartyllysine Proteins 0.000 description 5
- 230000001580 bacterial effect Effects 0.000 description 5
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 5
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 5
- 108010089804 glycyl-threonine Proteins 0.000 description 5
- 108010010147 glycylglutamine Proteins 0.000 description 5
- 108010034529 leucyl-lysine Proteins 0.000 description 5
- 108010064235 lysylglycine Proteins 0.000 description 5
- 238000003786 synthesis reaction Methods 0.000 description 5
- 108010073969 valyllysine Proteins 0.000 description 5
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 4
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 4
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 4
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 4
- YNOCMHZSWJMGBB-GCJQMDKQSA-N Ala-Thr-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O YNOCMHZSWJMGBB-GCJQMDKQSA-N 0.000 description 4
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 4
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 4
- DONWIPDSZZJHHK-HJGDQZAQSA-N Asp-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)O DONWIPDSZZJHHK-HJGDQZAQSA-N 0.000 description 4
- 241000894006 Bacteria Species 0.000 description 4
- 241001608472 Bifidobacterium longum Species 0.000 description 4
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 description 4
- 229930182566 Gentamicin Natural products 0.000 description 4
- CEAZRRDELHUEMR-URQXQFDESA-N Gentamicin Chemical compound O1[C@H](C(C)NC)CC[C@@H](N)[C@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](NC)[C@@](C)(O)CO2)O)[C@H](N)C[C@@H]1N CEAZRRDELHUEMR-URQXQFDESA-N 0.000 description 4
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 4
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 4
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 4
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 4
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 4
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 4
- 241000831652 Salinivibrio sharmensis Species 0.000 description 4
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 4
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 4
- GRIUMVXCJDKVPI-IZPVPAKOSA-N Thr-Thr-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GRIUMVXCJDKVPI-IZPVPAKOSA-N 0.000 description 4
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 4
- LFTYTUAZOPRMMI-CFRASDGPSA-N UDP-N-acetyl-alpha-D-glucosamine Chemical compound O1[C@H](CO)[C@@H](O)[C@H](O)[C@@H](NC(=O)C)[C@H]1OP(O)(=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C(NC(=O)C=C2)=O)O1 LFTYTUAZOPRMMI-CFRASDGPSA-N 0.000 description 4
- LFTYTUAZOPRMMI-UHFFFAOYSA-N UNPD164450 Natural products O1C(CO)C(O)C(O)C(NC(=O)C)C1OP(O)(=O)OP(O)(=O)OCC1C(O)C(O)C(N2C(NC(=O)C=C2)=O)O1 LFTYTUAZOPRMMI-UHFFFAOYSA-N 0.000 description 4
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 4
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 4
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 4
- 108010068265 aspartyltyrosine Proteins 0.000 description 4
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 4
- SQVRNKJHWKZAKO-UHFFFAOYSA-N beta-N-Acetyl-D-neuraminic acid Natural products CC(=O)NC1C(O)CC(O)(C(O)=O)OC1C(O)C(O)CO SQVRNKJHWKZAKO-UHFFFAOYSA-N 0.000 description 4
- 229940009291 bifidobacterium longum Drugs 0.000 description 4
- 238000012217 deletion Methods 0.000 description 4
- 230000037430 deletion Effects 0.000 description 4
- 230000002068 genetic effect Effects 0.000 description 4
- 229960002518 gentamicin Drugs 0.000 description 4
- 108010049041 glutamylalanine Proteins 0.000 description 4
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 4
- -1 guanosine diphosphate-activated L-fucose Chemical class 0.000 description 4
- 235000020256 human milk Nutrition 0.000 description 4
- 108010038320 lysylphenylalanine Proteins 0.000 description 4
- 235000015097 nutrients Nutrition 0.000 description 4
- 235000016709 nutrition Nutrition 0.000 description 4
- 239000001301 oxygen Substances 0.000 description 4
- 229910052760 oxygen Inorganic materials 0.000 description 4
- 238000002360 preparation method Methods 0.000 description 4
- 108010053725 prolylvaline Proteins 0.000 description 4
- 230000006798 recombination Effects 0.000 description 4
- 238000005215 recombination Methods 0.000 description 4
- 108010071207 serylmethionine Proteins 0.000 description 4
- 108010038745 tryptophylglycine Proteins 0.000 description 4
- DVGKRPYUFRZAQW-UHFFFAOYSA-N 3 prime Natural products CC(=O)NC1OC(CC(O)C1C(O)C(O)CO)(OC2C(O)C(CO)OC(OC3C(O)C(O)C(O)OC3CO)C2O)C(=O)O DVGKRPYUFRZAQW-UHFFFAOYSA-N 0.000 description 3
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 description 3
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 3
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 3
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 3
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 3
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 3
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 3
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 3
- OLVIPTLKNSAYRJ-YUMQZZPRSA-N Asn-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N OLVIPTLKNSAYRJ-YUMQZZPRSA-N 0.000 description 3
- WLVLIYYBPPONRJ-GCJQMDKQSA-N Asn-Thr-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O WLVLIYYBPPONRJ-GCJQMDKQSA-N 0.000 description 3
- XEGZSHSPQNDNRH-JRQIVUDYSA-N Asn-Tyr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XEGZSHSPQNDNRH-JRQIVUDYSA-N 0.000 description 3
- JUWZKMBALYLZCK-WHFBIAKZSA-N Asp-Gly-Asn Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O JUWZKMBALYLZCK-WHFBIAKZSA-N 0.000 description 3
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 3
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 3
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 3
- RKXVTTIQNKPCHU-KKHAAJSZSA-N Asp-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O RKXVTTIQNKPCHU-KKHAAJSZSA-N 0.000 description 3
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 3
- 108010090461 DFG peptide Proteins 0.000 description 3
- 241001646716 Escherichia coli K-12 Species 0.000 description 3
- 241000233866 Fungi Species 0.000 description 3
- LQEBEXMHBLQMDB-UHFFFAOYSA-N GDP-L-fucose Natural products OC1C(O)C(O)C(C)OC1OP(O)(=O)OP(O)(=O)OCC1C(O)C(O)C(N2C3=C(C(N=C(N)N3)=O)N=C2)O1 LQEBEXMHBLQMDB-UHFFFAOYSA-N 0.000 description 3
- 102000002464 Galactosidases Human genes 0.000 description 3
- 108010093031 Galactosidases Proteins 0.000 description 3
- NTHIHAUEXVTXQG-KKUMJFAQSA-N Glu-Tyr-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O NTHIHAUEXVTXQG-KKUMJFAQSA-N 0.000 description 3
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 3
- OCDLPQDYTJPWNG-YUMQZZPRSA-N Gly-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN OCDLPQDYTJPWNG-YUMQZZPRSA-N 0.000 description 3
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 3
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 3
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 3
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 3
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 3
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 3
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 3
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 3
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 3
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 3
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 3
- 102000005348 Neuraminidase Human genes 0.000 description 3
- 108010006232 Neuraminidase Proteins 0.000 description 3
- 241000592795 Paenibacillus sp. Species 0.000 description 3
- 240000004713 Pisum sativum Species 0.000 description 3
- ULIWFCCJIOEHMU-BQBZGAKWSA-N Pro-Gly-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 ULIWFCCJIOEHMU-BQBZGAKWSA-N 0.000 description 3
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 3
- 241000193448 Ruminiclostridium thermocellum Species 0.000 description 3
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 3
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 3
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 3
- 241000193998 Streptococcus pneumoniae Species 0.000 description 3
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 3
- XDARBNMYXKUFOJ-GSSVUCPTSA-N Thr-Asp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDARBNMYXKUFOJ-GSSVUCPTSA-N 0.000 description 3
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 3
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 3
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 3
- 108091023040 Transcription factor Proteins 0.000 description 3
- 102000040945 Transcription factor Human genes 0.000 description 3
- ULHASJWZGUEUNN-XIRDDKMYSA-N Trp-Lys-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O ULHASJWZGUEUNN-XIRDDKMYSA-N 0.000 description 3
- QAYSODICXVZUIA-WLTAIBSBSA-N Tyr-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QAYSODICXVZUIA-WLTAIBSBSA-N 0.000 description 3
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 3
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 3
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 3
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 3
- 230000009471 action Effects 0.000 description 3
- 238000013459 approach Methods 0.000 description 3
- PYMYPHUHKUWMLA-UHFFFAOYSA-N arabinose Natural products OCC(O)C(O)C(O)C=O PYMYPHUHKUWMLA-UHFFFAOYSA-N 0.000 description 3
- 108010013835 arginine glutamate Proteins 0.000 description 3
- 108010062796 arginyllysine Proteins 0.000 description 3
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 3
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 description 3
- 230000027455 binding Effects 0.000 description 3
- 230000003115 biocidal effect Effects 0.000 description 3
- 229910052799 carbon Inorganic materials 0.000 description 3
- 230000001413 cellular effect Effects 0.000 description 3
- 230000002759 chromosomal effect Effects 0.000 description 3
- KRKNYBCHXYNGOX-UHFFFAOYSA-N citric acid Chemical compound OC(=O)CC(O)(C(O)=O)CC(O)=O KRKNYBCHXYNGOX-UHFFFAOYSA-N 0.000 description 3
- 239000012228 culture supernatant Substances 0.000 description 3
- 239000003623 enhancer Substances 0.000 description 3
- 108010001671 galactoside 3-fucosyltransferase Proteins 0.000 description 3
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 3
- 125000002791 glucosyl group Chemical group C1([C@H](O)[C@@H](O)[C@H](O)[C@H](O1)CO)* 0.000 description 3
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 3
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 3
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 3
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 3
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 3
- 108010081551 glycylphenylalanine Proteins 0.000 description 3
- 239000001963 growth medium Substances 0.000 description 3
- 229910052500 inorganic mineral Inorganic materials 0.000 description 3
- 239000000543 intermediate Substances 0.000 description 3
- 108010017391 lysylvaline Proteins 0.000 description 3
- 108010056582 methionylglutamic acid Proteins 0.000 description 3
- 239000011707 mineral Substances 0.000 description 3
- 235000010755 mineral Nutrition 0.000 description 3
- 101150038284 pfkA gene Proteins 0.000 description 3
- 108010031719 prolyl-serine Proteins 0.000 description 3
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 3
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 3
- 235000019333 sodium laurylsulphate Nutrition 0.000 description 3
- 241000894007 species Species 0.000 description 3
- 229940031000 streptococcus pneumoniae Drugs 0.000 description 3
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 3
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 3
- 230000017105 transposition Effects 0.000 description 3
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 3
- 108010020532 tyrosyl-proline Proteins 0.000 description 3
- 108010003137 tyrosyltyrosine Proteins 0.000 description 3
- 210000005253 yeast cell Anatomy 0.000 description 3
- WQZGKKKJIJFFOK-SVZMEOIVSA-N (+)-Galactose Chemical compound OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@H]1O WQZGKKKJIJFFOK-SVZMEOIVSA-N 0.000 description 2
- 101710099475 3'-phosphoadenosine 5'-phosphate phosphatase Proteins 0.000 description 2
- OIZGSVFYNBZVIK-FHHHURIISA-N 3'-sialyllactose Chemical compound O1[C@@H]([C@H](O)[C@H](O)CO)[C@H](NC(=O)C)[C@@H](O)C[C@@]1(C(O)=O)O[C@@H]1[C@@H](O)[C@H](O[C@H]([C@H](O)CO)[C@H](O)[C@@H](O)C=O)O[C@H](CO)[C@@H]1O OIZGSVFYNBZVIK-FHHHURIISA-N 0.000 description 2
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 2
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 2
- PXKLCFFSVLKOJM-ACZMJKKPSA-N Ala-Asn-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXKLCFFSVLKOJM-ACZMJKKPSA-N 0.000 description 2
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 2
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 2
- KUDREHRZRIVKHS-UWJYBYFXSA-N Ala-Asp-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KUDREHRZRIVKHS-UWJYBYFXSA-N 0.000 description 2
- AWAXZRDKUHOPBO-GUBZILKMSA-N Ala-Gln-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O AWAXZRDKUHOPBO-GUBZILKMSA-N 0.000 description 2
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 2
- UHMQKOBNPRAZGB-CIUDSAMLSA-N Ala-Glu-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N UHMQKOBNPRAZGB-CIUDSAMLSA-N 0.000 description 2
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 2
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 2
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 2
- HQJKCXHQNUCKMY-GHCJXIJMSA-N Ala-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C)N HQJKCXHQNUCKMY-GHCJXIJMSA-N 0.000 description 2
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 2
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 2
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 2
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 2
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 2
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 2
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 2
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 2
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 2
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 2
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 2
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 2
- FRMQITGHXMUNDF-GMOBBJLQSA-N Arg-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FRMQITGHXMUNDF-GMOBBJLQSA-N 0.000 description 2
- YKBHOXLMMPZPHQ-GMOBBJLQSA-N Arg-Ile-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O YKBHOXLMMPZPHQ-GMOBBJLQSA-N 0.000 description 2
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 2
- OVQJAKFLFTZDNC-GUBZILKMSA-N Arg-Pro-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O OVQJAKFLFTZDNC-GUBZILKMSA-N 0.000 description 2
- AWMAZIIEFPFHCP-RCWTZXSCSA-N Arg-Pro-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWMAZIIEFPFHCP-RCWTZXSCSA-N 0.000 description 2
- UZSQXCMNUPKLCC-FJXKBIBVSA-N Arg-Thr-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UZSQXCMNUPKLCC-FJXKBIBVSA-N 0.000 description 2
- CGWVCWFQGXOUSJ-ULQDDVLXSA-N Arg-Tyr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O CGWVCWFQGXOUSJ-ULQDDVLXSA-N 0.000 description 2
- IARGXWMWRFOQPG-GCJQMDKQSA-N Asn-Ala-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IARGXWMWRFOQPG-GCJQMDKQSA-N 0.000 description 2
- GOVUDFOGXOONFT-VEVYYDQMSA-N Asn-Arg-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GOVUDFOGXOONFT-VEVYYDQMSA-N 0.000 description 2
- XQQVCUIBGYFKDC-OLHMAJIHSA-N Asn-Asp-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XQQVCUIBGYFKDC-OLHMAJIHSA-N 0.000 description 2
- SJPZTWAYTJPPBI-GUBZILKMSA-N Asn-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N SJPZTWAYTJPPBI-GUBZILKMSA-N 0.000 description 2
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 2
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 2
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 2
- JEEFEQCRXKPQHC-KKUMJFAQSA-N Asn-Leu-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JEEFEQCRXKPQHC-KKUMJFAQSA-N 0.000 description 2
- RCFGLXMZDYNRSC-CIUDSAMLSA-N Asn-Lys-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O RCFGLXMZDYNRSC-CIUDSAMLSA-N 0.000 description 2
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 2
- PPCORQFLAZWUNO-QWRGUYRKSA-N Asn-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N PPCORQFLAZWUNO-QWRGUYRKSA-N 0.000 description 2
- GKKUBLFXKRDMFC-BQBZGAKWSA-N Asn-Pro-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O GKKUBLFXKRDMFC-BQBZGAKWSA-N 0.000 description 2
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 2
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 2
- YSYTWUMRHSFODC-QWRGUYRKSA-N Asn-Tyr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O YSYTWUMRHSFODC-QWRGUYRKSA-N 0.000 description 2
- VPPXTHJNTYDNFJ-CIUDSAMLSA-N Asp-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N VPPXTHJNTYDNFJ-CIUDSAMLSA-N 0.000 description 2
- XPGVTUBABLRGHY-BIIVOSGPSA-N Asp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N XPGVTUBABLRGHY-BIIVOSGPSA-N 0.000 description 2
- KVMPVNGOKHTUHZ-GCJQMDKQSA-N Asp-Ala-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KVMPVNGOKHTUHZ-GCJQMDKQSA-N 0.000 description 2
- ATYWBXGNXZYZGI-ACZMJKKPSA-N Asp-Asn-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ATYWBXGNXZYZGI-ACZMJKKPSA-N 0.000 description 2
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 2
- RQYMKRMRZWJGHC-BQBZGAKWSA-N Asp-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N RQYMKRMRZWJGHC-BQBZGAKWSA-N 0.000 description 2
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 2
- QJHOOKBAHRJPPX-QWRGUYRKSA-N Asp-Phe-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 QJHOOKBAHRJPPX-QWRGUYRKSA-N 0.000 description 2
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 2
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 2
- VNXQRBXEQXLERQ-CIUDSAMLSA-N Asp-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N VNXQRBXEQXLERQ-CIUDSAMLSA-N 0.000 description 2
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 2
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 2
- JJQGZGOEDSSHTE-FOHZUACHSA-N Asp-Thr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JJQGZGOEDSSHTE-FOHZUACHSA-N 0.000 description 2
- GCACQYDBDHRVGE-LKXGYXEUSA-N Asp-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC(O)=O GCACQYDBDHRVGE-LKXGYXEUSA-N 0.000 description 2
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 2
- TXCIAUNLDRJGJZ-UHFFFAOYSA-N CMP-N-acetyl neuraminic acid Natural products O1C(C(O)C(O)CO)C(NC(=O)C)C(O)CC1(C(O)=O)OP(O)(=O)OCC1C(O)C(O)C(N2C(N=C(N)C=C2)=O)O1 TXCIAUNLDRJGJZ-UHFFFAOYSA-N 0.000 description 2
- TXCIAUNLDRJGJZ-BILDWYJOSA-N CMP-N-acetyl-beta-neuraminic acid Chemical compound O1[C@@H]([C@H](O)[C@H](O)CO)[C@H](NC(=O)C)[C@@H](O)C[C@]1(C(O)=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C(N=C(N)C=C2)=O)O1 TXCIAUNLDRJGJZ-BILDWYJOSA-N 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- GSXOAOHZAIYLCY-UHFFFAOYSA-N D-F6P Natural products OCC(=O)C(O)C(O)C(O)COP(O)(O)=O GSXOAOHZAIYLCY-UHFFFAOYSA-N 0.000 description 2
- 102000052510 DNA-Binding Proteins Human genes 0.000 description 2
- 108700020911 DNA-Binding Proteins Proteins 0.000 description 2
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 2
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 2
- 241000196324 Embryophyta Species 0.000 description 2
- 101100280818 Escherichia coli (strain K12) fcl gene Proteins 0.000 description 2
- 101710196411 Fructose-1,6-bisphosphatase Proteins 0.000 description 2
- 101710186733 Fructose-1,6-bisphosphatase, chloroplastic Proteins 0.000 description 2
- 101710109119 Fructose-1,6-bisphosphatase, cytosolic Proteins 0.000 description 2
- 101710198902 Fructose-1,6-bisphosphate aldolase/phosphatase Proteins 0.000 description 2
- LQEBEXMHBLQMDB-JGQUBWHWSA-N GDP-beta-L-fucose Chemical compound O[C@H]1[C@H](O)[C@H](O)[C@H](C)O[C@@H]1OP(O)(=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C3=C(C(NC(N)=N3)=O)N=C2)O1 LQEBEXMHBLQMDB-JGQUBWHWSA-N 0.000 description 2
- NKCZYEDZTKOFBG-GUBZILKMSA-N Gln-Gln-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NKCZYEDZTKOFBG-GUBZILKMSA-N 0.000 description 2
- QFJPFPCSXOXMKI-BPUTZDHNSA-N Gln-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N QFJPFPCSXOXMKI-BPUTZDHNSA-N 0.000 description 2
- NSNUZSPSADIMJQ-WDSKDSINSA-N Gln-Gly-Asp Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NSNUZSPSADIMJQ-WDSKDSINSA-N 0.000 description 2
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 2
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 2
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 2
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 2
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 2
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 2
- MFNUFCFRAZPJFW-JYJNAYRXSA-N Glu-Lys-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MFNUFCFRAZPJFW-JYJNAYRXSA-N 0.000 description 2
- UZWUBBRJWFTHTD-LAEOZQHASA-N Glu-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O UZWUBBRJWFTHTD-LAEOZQHASA-N 0.000 description 2
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 2
- 102000004366 Glucosidases Human genes 0.000 description 2
- 108010056771 Glucosidases Proteins 0.000 description 2
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 2
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 2
- BRFJMRSRMOMIMU-WHFBIAKZSA-N Gly-Ala-Asn Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O BRFJMRSRMOMIMU-WHFBIAKZSA-N 0.000 description 2
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 2
- LERGJIVJIIODPZ-ZANVPECISA-N Gly-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)CN)C)C(O)=O)=CNC2=C1 LERGJIVJIIODPZ-ZANVPECISA-N 0.000 description 2
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 2
- XUORRGAFUQIMLC-STQMWFEESA-N Gly-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN)O XUORRGAFUQIMLC-STQMWFEESA-N 0.000 description 2
- CIMULJZTTOBOPN-WHFBIAKZSA-N Gly-Asn-Asn Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CIMULJZTTOBOPN-WHFBIAKZSA-N 0.000 description 2
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 2
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 2
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 2
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 2
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 2
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 2
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 2
- INLIXXRWNUKVCF-JTQLQIEISA-N Gly-Gly-Tyr Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 INLIXXRWNUKVCF-JTQLQIEISA-N 0.000 description 2
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 2
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 2
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 2
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 2
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 2
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 2
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 2
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 2
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 2
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 2
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 2
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 2
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 2
- FOKISINOENBSDM-WLTAIBSBSA-N Gly-Thr-Tyr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FOKISINOENBSDM-WLTAIBSBSA-N 0.000 description 2
- GBYYQVBXFVDJPJ-WLTAIBSBSA-N Gly-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)CN)O GBYYQVBXFVDJPJ-WLTAIBSBSA-N 0.000 description 2
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 2
- JFFAPRNXXLRINI-NHCYSSNCSA-N His-Asp-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JFFAPRNXXLRINI-NHCYSSNCSA-N 0.000 description 2
- KAXZXLSXFWSNNZ-XVYDVKMFSA-N His-Ser-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KAXZXLSXFWSNNZ-XVYDVKMFSA-N 0.000 description 2
- LPBWRHRHEIYAIP-KKUMJFAQSA-N His-Tyr-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LPBWRHRHEIYAIP-KKUMJFAQSA-N 0.000 description 2
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 2
- HZMLFETXHFHGBB-UGYAYLCHSA-N Ile-Asn-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZMLFETXHFHGBB-UGYAYLCHSA-N 0.000 description 2
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 2
- NBJAAWYRLGCJOF-UGYAYLCHSA-N Ile-Asp-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NBJAAWYRLGCJOF-UGYAYLCHSA-N 0.000 description 2
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 2
- SLQVFYWBGNNOTK-BYULHYEWSA-N Ile-Gly-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N SLQVFYWBGNNOTK-BYULHYEWSA-N 0.000 description 2
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 2
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 2
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 2
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 2
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 2
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 2
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 2
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 2
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 2
- TVVLIFCVJJSLBL-SEHWTJTBSA-N Lacto-N-fucopentaose V Chemical compound O[C@H]1C(O)C(O)[C@H](C)O[C@H]1OC([C@@H](O)C=O)[C@@H](C(O)CO)O[C@H]1[C@H](O)[C@@H](OC2[C@@H](C(OC3[C@@H](C(O)C(O)[C@@H](CO)O3)O)[C@H](O)[C@@H](CO)O2)NC(C)=O)[C@@H](O)[C@@H](CO)O1 TVVLIFCVJJSLBL-SEHWTJTBSA-N 0.000 description 2
- LKOHREGGXUJGKC-UHFFFAOYSA-N Lactodifucotetraose Natural products OC1C(O)C(O)C(C)OC1OC1C(OC2C(C(O)C(O)OC2CO)OC2C(C(O)C(O)C(C)O2)O)OC(CO)C(O)C1O LKOHREGGXUJGKC-UHFFFAOYSA-N 0.000 description 2
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 2
- MDVZJYGNAGLPGJ-KKUMJFAQSA-N Leu-Asn-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MDVZJYGNAGLPGJ-KKUMJFAQSA-N 0.000 description 2
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 2
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 2
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 2
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 2
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 2
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 2
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 2
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 2
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 2
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 2
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 2
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 2
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 2
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 2
- 108010062166 Lys-Asn-Asp Proteins 0.000 description 2
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 2
- OVIVOCSURJYCTM-GUBZILKMSA-N Lys-Asp-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O OVIVOCSURJYCTM-GUBZILKMSA-N 0.000 description 2
- DFXQCCBKGUNYGG-GUBZILKMSA-N Lys-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN DFXQCCBKGUNYGG-GUBZILKMSA-N 0.000 description 2
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 2
- VLMNBMFYRMGEMB-QWRGUYRKSA-N Lys-His-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CNC=N1 VLMNBMFYRMGEMB-QWRGUYRKSA-N 0.000 description 2
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 2
- BXPHMHQHYHILBB-BZSNNMDCSA-N Lys-Lys-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BXPHMHQHYHILBB-BZSNNMDCSA-N 0.000 description 2
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 2
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 2
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 2
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 2
- OHMKUHXCDSCOMT-QXEWZRGKSA-N Met-Asn-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHMKUHXCDSCOMT-QXEWZRGKSA-N 0.000 description 2
- SODXFJOPSCXOHE-IHRRRGAJSA-N Met-Leu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O SODXFJOPSCXOHE-IHRRRGAJSA-N 0.000 description 2
- DOQXHOUYYSPISL-SZMVWBNQSA-N Met-Trp-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCSC)C(=O)O)N DOQXHOUYYSPISL-SZMVWBNQSA-N 0.000 description 2
- 241001465754 Metazoa Species 0.000 description 2
- OVRNDRQMDRJTHS-UHFFFAOYSA-N N-acelyl-D-glucosamine Natural products CC(=O)NC1C(O)OC(CO)C(O)C1O OVRNDRQMDRJTHS-UHFFFAOYSA-N 0.000 description 2
- MBLBDJOUHNCFQT-LXGUWJNJSA-N N-acetylglucosamine Natural products CC(=O)N[C@@H](C=O)[C@@H](O)[C@H](O)[C@H](O)CO MBLBDJOUHNCFQT-LXGUWJNJSA-N 0.000 description 2
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 2
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 2
- 108010079364 N-glycylalanine Proteins 0.000 description 2
- 108010066427 N-valyltryptophan Proteins 0.000 description 2
- 108091034117 Oligonucleotide Proteins 0.000 description 2
- CUMXHKAOHNWRFQ-BZSNNMDCSA-N Phe-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 CUMXHKAOHNWRFQ-BZSNNMDCSA-N 0.000 description 2
- RMKGXGPQIPLTFC-KKUMJFAQSA-N Phe-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RMKGXGPQIPLTFC-KKUMJFAQSA-N 0.000 description 2
- VGTJSEYTVMAASM-RPTUDFQQSA-N Phe-Thr-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VGTJSEYTVMAASM-RPTUDFQQSA-N 0.000 description 2
- DBNGDEAQXGFGRA-ACRUOGEOSA-N Phe-Tyr-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DBNGDEAQXGFGRA-ACRUOGEOSA-N 0.000 description 2
- CDHURCQGUDNBMA-UBHSHLNASA-N Phe-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDHURCQGUDNBMA-UBHSHLNASA-N 0.000 description 2
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 2
- ZCXQTRXYZOSGJR-FXQIFTODSA-N Pro-Asp-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZCXQTRXYZOSGJR-FXQIFTODSA-N 0.000 description 2
- BCNRNJWSRFDPTQ-HJWJTTGWSA-N Pro-Ile-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BCNRNJWSRFDPTQ-HJWJTTGWSA-N 0.000 description 2
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 2
- RCYUBVHMVUHEBM-RCWTZXSCSA-N Pro-Pro-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RCYUBVHMVUHEBM-RCWTZXSCSA-N 0.000 description 2
- ITUDDXVFGFEKPD-NAKRPEOUSA-N Pro-Ser-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ITUDDXVFGFEKPD-NAKRPEOUSA-N 0.000 description 2
- WVXQQUWOKUZIEG-VEVYYDQMSA-N Pro-Thr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O WVXQQUWOKUZIEG-VEVYYDQMSA-N 0.000 description 2
- HRIXMVRZRGFKNQ-HJGDQZAQSA-N Pro-Thr-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HRIXMVRZRGFKNQ-HJGDQZAQSA-N 0.000 description 2
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 2
- VDHGTOHMHHQSKG-JYJNAYRXSA-N Pro-Val-Phe Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O VDHGTOHMHHQSKG-JYJNAYRXSA-N 0.000 description 2
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 2
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 2
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 2
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 2
- MESDJCNHLZBMEP-ZLUOBGJFSA-N Ser-Asp-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MESDJCNHLZBMEP-ZLUOBGJFSA-N 0.000 description 2
- BYIROAKULFFTEK-CIUDSAMLSA-N Ser-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO BYIROAKULFFTEK-CIUDSAMLSA-N 0.000 description 2
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 2
- QKQDTEYDEIJPNK-GUBZILKMSA-N Ser-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CO QKQDTEYDEIJPNK-GUBZILKMSA-N 0.000 description 2
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 2
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 2
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 2
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 2
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 2
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 2
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 2
- VZQRNAYURWAEFE-KKUMJFAQSA-N Ser-Leu-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VZQRNAYURWAEFE-KKUMJFAQSA-N 0.000 description 2
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 2
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 2
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 2
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 2
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 2
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 2
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 2
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 2
- DDPVJPIGACCMEH-XQXXSGGOSA-N Thr-Ala-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DDPVJPIGACCMEH-XQXXSGGOSA-N 0.000 description 2
- GNHRVXYZKWSJTF-HJGDQZAQSA-N Thr-Asp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GNHRVXYZKWSJTF-HJGDQZAQSA-N 0.000 description 2
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 2
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 2
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 2
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 2
- WFAUDCSNCWJJAA-KXNHARMFSA-N Thr-Lys-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(O)=O WFAUDCSNCWJJAA-KXNHARMFSA-N 0.000 description 2
- WYLAVUAWOUVUCA-XVSYOHENSA-N Thr-Phe-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WYLAVUAWOUVUCA-XVSYOHENSA-N 0.000 description 2
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 2
- FWTFAZKJORVTIR-VZFHVOOUSA-N Thr-Ser-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O FWTFAZKJORVTIR-VZFHVOOUSA-N 0.000 description 2
- HUPLKEHTTQBXSC-YJRXYDGGSA-N Thr-Ser-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUPLKEHTTQBXSC-YJRXYDGGSA-N 0.000 description 2
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 2
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 2
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 2
- ZKVANNIVSDOQMG-HKUYNNGSSA-N Trp-Tyr-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)NCC(=O)O)N ZKVANNIVSDOQMG-HKUYNNGSSA-N 0.000 description 2
- VFJIWSJKZJTQII-SRVKXCTJSA-N Tyr-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VFJIWSJKZJTQII-SRVKXCTJSA-N 0.000 description 2
- ZRPLVTZTKPPSBT-AVGNSLFASA-N Tyr-Glu-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZRPLVTZTKPPSBT-AVGNSLFASA-N 0.000 description 2
- UNUZEBFXGWVAOP-DZKIICNBSA-N Tyr-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UNUZEBFXGWVAOP-DZKIICNBSA-N 0.000 description 2
- USYGMBIIUDLYHJ-GVARAGBVSA-N Tyr-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 USYGMBIIUDLYHJ-GVARAGBVSA-N 0.000 description 2
- GULIUBBXCYPDJU-CQDKDKBSSA-N Tyr-Leu-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 GULIUBBXCYPDJU-CQDKDKBSSA-N 0.000 description 2
- MVFQLSPDMMFCMW-KKUMJFAQSA-N Tyr-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O MVFQLSPDMMFCMW-KKUMJFAQSA-N 0.000 description 2
- OFHKXNKJXURPSY-ULQDDVLXSA-N Tyr-Met-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O OFHKXNKJXURPSY-ULQDDVLXSA-N 0.000 description 2
- PWKMJDQXKCENMF-MEYUZBJRSA-N Tyr-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O PWKMJDQXKCENMF-MEYUZBJRSA-N 0.000 description 2
- WQOHKVRQDLNDIL-YJRXYDGGSA-N Tyr-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O WQOHKVRQDLNDIL-YJRXYDGGSA-N 0.000 description 2
- RMRFSFXLFWWAJZ-HJOGWXRNSA-N Tyr-Tyr-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 RMRFSFXLFWWAJZ-HJOGWXRNSA-N 0.000 description 2
- HSCJRCZFDFQWRP-JZMIEXBBSA-N UDP-alpha-D-glucose Chemical group O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1OP(O)(=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C(NC(=O)C=C2)=O)O1 HSCJRCZFDFQWRP-JZMIEXBBSA-N 0.000 description 2
- XCCTYIAWTASOJW-XVFCMESISA-N Uridine-5'-Diphosphate Chemical compound O[C@@H]1[C@H](O)[C@@H](COP(O)(=O)OP(O)(O)=O)O[C@H]1N1C(=O)NC(=O)C=C1 XCCTYIAWTASOJW-XVFCMESISA-N 0.000 description 2
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 2
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 2
- LABUITCFCAABSV-UHFFFAOYSA-N Val-Ala-Tyr Natural products CC(C)C(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LABUITCFCAABSV-UHFFFAOYSA-N 0.000 description 2
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 2
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 2
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 2
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 2
- AAOPYWQQBXHINJ-DZKIICNBSA-N Val-Gln-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AAOPYWQQBXHINJ-DZKIICNBSA-N 0.000 description 2
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 2
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 2
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 2
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 2
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 2
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 2
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 2
- CXWJFWAZIVWBOS-XQQFMLRXSA-N Val-Lys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CXWJFWAZIVWBOS-XQQFMLRXSA-N 0.000 description 2
- UOUIMEGEPSBZIV-ULQDDVLXSA-N Val-Lys-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UOUIMEGEPSBZIV-ULQDDVLXSA-N 0.000 description 2
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 2
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 2
- LTTQCQRTSHJPPL-ZKWXMUAHSA-N Val-Ser-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LTTQCQRTSHJPPL-ZKWXMUAHSA-N 0.000 description 2
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 2
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 2
- DVLWZWNAQUBZBC-ZNSHCXBVSA-N Val-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N)O DVLWZWNAQUBZBC-ZNSHCXBVSA-N 0.000 description 2
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 2
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 2
- UEXPMFIAZZHEAD-HSHDSVGOSA-N Val-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](C(C)C)N)O UEXPMFIAZZHEAD-HSHDSVGOSA-N 0.000 description 2
- BGTDGENDNWGMDQ-KJEVXHAQSA-N Val-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N)O BGTDGENDNWGMDQ-KJEVXHAQSA-N 0.000 description 2
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 2
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 2
- 239000002253 acid Substances 0.000 description 2
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 2
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 2
- 108010044940 alanylglutamine Proteins 0.000 description 2
- LKOHREGGXUJGKC-GXSKDVPZSA-N alpha-L-Fucp-(1->3)-[alpha-L-Fucp-(1->2)-beta-D-Galp-(1->4)]-beta-D-Glcp Chemical compound C[C@@H]1O[C@@H](O[C@@H]2[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]2O[C@@H]2[C@@H](CO)O[C@@H](O)[C@H](O)[C@H]2O[C@@H]2O[C@@H](C)[C@@H](O)[C@@H](O)[C@@H]2O)[C@@H](O)[C@H](O)[C@@H]1O LKOHREGGXUJGKC-GXSKDVPZSA-N 0.000 description 2
- CMQZRJBJDCVIEY-JEOLMMCMSA-N alpha-L-Fucp-(1->3)-[beta-D-Galp-(1->4)]-beta-D-GlcpNAc-(1->3)-beta-D-Galp-(1->4)-D-Glcp Chemical compound O[C@H]1[C@H](O)[C@H](O)[C@H](C)O[C@H]1O[C@H]1[C@H](O[C@H]2[C@@H]([C@@H](O)[C@@H](O)[C@@H](CO)O2)O)[C@@H](CO)O[C@@H](O[C@@H]2[C@H]([C@H](O[C@@H]3[C@H](OC(O)[C@H](O)[C@H]3O)CO)O[C@H](CO)[C@@H]2O)O)[C@@H]1NC(C)=O CMQZRJBJDCVIEY-JEOLMMCMSA-N 0.000 description 2
- DUKURNFHYQXCJG-JEOLMMCMSA-N alpha-L-Fucp-(1->4)-[beta-D-Galp-(1->3)]-beta-D-GlcpNAc-(1->3)-beta-D-Galp-(1->4)-D-Glcp Chemical compound O[C@H]1[C@H](O)[C@H](O)[C@H](C)O[C@H]1O[C@H]1[C@H](O[C@H]2[C@@H]([C@@H](O)[C@@H](O)[C@@H](CO)O2)O)[C@@H](NC(C)=O)[C@H](O[C@@H]2[C@H]([C@H](O[C@@H]3[C@H](OC(O)[C@H](O)[C@H]3O)CO)O[C@H](CO)[C@@H]2O)O)O[C@@H]1CO DUKURNFHYQXCJG-JEOLMMCMSA-N 0.000 description 2
- 108010008355 arginyl-glutamine Proteins 0.000 description 2
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 2
- XMQFTWRPUQYINF-UHFFFAOYSA-N bensulfuron-methyl Chemical compound COC(=O)C1=CC=CC=C1CS(=O)(=O)NC(=O)NC1=NC(OC)=CC(OC)=N1 XMQFTWRPUQYINF-UHFFFAOYSA-N 0.000 description 2
- BGWGXPAPYGQALX-ARQDHWQXSA-N beta-D-fructofuranose 6-phosphate Chemical compound OC[C@@]1(O)O[C@H](COP(O)(O)=O)[C@@H](O)[C@@H]1O BGWGXPAPYGQALX-ARQDHWQXSA-N 0.000 description 2
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 2
- 230000001851 biosynthetic effect Effects 0.000 description 2
- 229940098773 bovine serum albumin Drugs 0.000 description 2
- 239000013592 cell lysate Substances 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 2
- 210000003763 chloroplast Anatomy 0.000 description 2
- RPKLZQLYODPWTM-KBMWBBLPSA-N cholanoic acid Chemical compound C1CC2CCCC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@@H](CCC(O)=O)C)[C@@]1(C)CC2 RPKLZQLYODPWTM-KBMWBBLPSA-N 0.000 description 2
- 150000001875 compounds Chemical class 0.000 description 2
- 230000001276 controlling effect Effects 0.000 description 2
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 2
- 230000007071 enzymatic hydrolysis Effects 0.000 description 2
- 238000006047 enzymatic hydrolysis reaction Methods 0.000 description 2
- 210000003527 eukaryotic cell Anatomy 0.000 description 2
- 101150050376 fbaB gene Proteins 0.000 description 2
- 239000012634 fragment Substances 0.000 description 2
- 239000000446 fuel Substances 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 2
- 230000001890 gluconeogenic effect Effects 0.000 description 2
- 239000008103 glucose Substances 0.000 description 2
- 108010078144 glutaminyl-glycine Proteins 0.000 description 2
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 2
- 150000004676 glycans Chemical class 0.000 description 2
- 102000045442 glycosyltransferase activity proteins Human genes 0.000 description 2
- 108700014210 glycosyltransferase activity proteins Proteins 0.000 description 2
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 2
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 2
- 108010087823 glycyltyrosine Proteins 0.000 description 2
- 108010018006 histidylserine Proteins 0.000 description 2
- 210000004251 human milk Anatomy 0.000 description 2
- 230000003301 hydrolyzing effect Effects 0.000 description 2
- 230000002779 inactivation Effects 0.000 description 2
- 230000000977 initiatory effect Effects 0.000 description 2
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 2
- FKADDOYBRRMBPP-UHFFFAOYSA-N lacto-N-fucopentaose II Natural products OC1C(O)C(O)C(C)OC1OC1C(OC2C(C(O)C(O)C(CO)O2)O)C(NC(C)=O)C(OC2C(C(OC(C(O)CO)C(O)C(O)C=O)OC(CO)C2O)O)OC1CO FKADDOYBRRMBPP-UHFFFAOYSA-N 0.000 description 2
- CMQZRJBJDCVIEY-UHFFFAOYSA-N lacto-N-fucopentaose III Natural products OC1C(O)C(O)C(C)OC1OC1C(OC2C(C(O)C(O)C(CO)O2)O)C(CO)OC(OC2C(C(OC3C(OC(O)C(O)C3O)CO)OC(CO)C2O)O)C1NC(C)=O CMQZRJBJDCVIEY-UHFFFAOYSA-N 0.000 description 2
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 2
- 108010012058 leucyltyrosine Proteins 0.000 description 2
- 108010003700 lysyl aspartic acid Proteins 0.000 description 2
- 108010057952 lysyl-phenylalanyl-lysine Proteins 0.000 description 2
- 108010009298 lysylglutamic acid Proteins 0.000 description 2
- 239000002184 metal Substances 0.000 description 2
- 229910052751 metal Inorganic materials 0.000 description 2
- 150000002739 metals Chemical class 0.000 description 2
- 238000001823 molecular biology technique Methods 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 238000002703 mutagenesis Methods 0.000 description 2
- 231100000350 mutagenesis Toxicity 0.000 description 2
- 108010084572 phenylalanyl-valine Proteins 0.000 description 2
- 108010024607 phenylalanylalanine Proteins 0.000 description 2
- 239000013612 plasmid Substances 0.000 description 2
- 229920001282 polysaccharide Polymers 0.000 description 2
- 239000005017 polysaccharide Substances 0.000 description 2
- 210000001236 prokaryotic cell Anatomy 0.000 description 2
- 239000011541 reaction mixture Substances 0.000 description 2
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 2
- 108010026333 seryl-proline Proteins 0.000 description 2
- 239000011734 sodium Substances 0.000 description 2
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 2
- 235000000346 sugar Nutrition 0.000 description 2
- 150000008163 sugars Chemical class 0.000 description 2
- 108010044292 tryptophyltyrosine Proteins 0.000 description 2
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 2
- TYALNJQZQRNQNQ-UHFFFAOYSA-N #alpha;2,6-sialyllactose Natural products O1C(C(O)C(O)CO)C(NC(=O)C)C(O)CC1(C(O)=O)OCC1C(O)C(O)C(O)C(OC2C(C(O)C(O)OC2CO)O)O1 TYALNJQZQRNQNQ-UHFFFAOYSA-N 0.000 description 1
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 1
- CILYIEBUXJIHCO-UHFFFAOYSA-N 102778-91-6 Natural products O1C(C(O)C(O)CO)C(NC(=O)C)C(O)CC1(C(O)=O)OC1C(O)C(OC2C(C(O)C(O)OC2CO)O)OC(CO)C1O CILYIEBUXJIHCO-UHFFFAOYSA-N 0.000 description 1
- 101710157736 ATP-dependent 6-phosphofructokinase Proteins 0.000 description 1
- 208000004998 Abdominal Pain Diseases 0.000 description 1
- 102100033647 Activity-regulated cytoskeleton-associated protein Human genes 0.000 description 1
- 241000606828 Aggregatibacter aphrophilus Species 0.000 description 1
- UWQJHXKARZWDIJ-ZLUOBGJFSA-N Ala-Ala-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O UWQJHXKARZWDIJ-ZLUOBGJFSA-N 0.000 description 1
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 1
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 1
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 1
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 1
- ODWSTKXGQGYHSH-FXQIFTODSA-N Ala-Arg-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O ODWSTKXGQGYHSH-FXQIFTODSA-N 0.000 description 1
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 1
- DWINFPQUSSHSFS-UVBJJODRSA-N Ala-Arg-Trp Chemical compound N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C12)C(=O)O DWINFPQUSSHSFS-UVBJJODRSA-N 0.000 description 1
- YAXNATKKPOWVCP-ZLUOBGJFSA-N Ala-Asn-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O YAXNATKKPOWVCP-ZLUOBGJFSA-N 0.000 description 1
- ZEXDYVGDZJBRMO-ACZMJKKPSA-N Ala-Asn-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZEXDYVGDZJBRMO-ACZMJKKPSA-N 0.000 description 1
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 1
- XQGIRPGAVLFKBJ-CIUDSAMLSA-N Ala-Asn-Lys Chemical compound N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)O XQGIRPGAVLFKBJ-CIUDSAMLSA-N 0.000 description 1
- XQJAFSDFQZPYCU-UWJYBYFXSA-N Ala-Asn-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N XQJAFSDFQZPYCU-UWJYBYFXSA-N 0.000 description 1
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 1
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 1
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 1
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 1
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 1
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 1
- BVSGPHDECMJBDE-HGNGGELXSA-N Ala-Glu-His Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BVSGPHDECMJBDE-HGNGGELXSA-N 0.000 description 1
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 1
- FBHOPGDGELNWRH-DRZSPHRISA-N Ala-Glu-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FBHOPGDGELNWRH-DRZSPHRISA-N 0.000 description 1
- VBRDBGCROKWTPV-XHNCKOQMSA-N Ala-Glu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N VBRDBGCROKWTPV-XHNCKOQMSA-N 0.000 description 1
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 1
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 1
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 1
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 1
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 1
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 1
- JEPNLGMEZMCFEX-QSFUFRPTSA-N Ala-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C)N JEPNLGMEZMCFEX-QSFUFRPTSA-N 0.000 description 1
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 1
- GSHKMNKPMLXSQW-KBIXCLLPSA-N Ala-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C)N GSHKMNKPMLXSQW-KBIXCLLPSA-N 0.000 description 1
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 1
- DPNZTBKGAUAZQU-DLOVCJGASA-N Ala-Leu-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DPNZTBKGAUAZQU-DLOVCJGASA-N 0.000 description 1
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 1
- SDZRIBWEVVRDQI-CIUDSAMLSA-N Ala-Lys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O SDZRIBWEVVRDQI-CIUDSAMLSA-N 0.000 description 1
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 1
- FUKFQILQFQKHLE-DCAQKATOSA-N Ala-Lys-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O FUKFQILQFQKHLE-DCAQKATOSA-N 0.000 description 1
- BLTRAARCJYVJKV-QEJZJMRPSA-N Ala-Lys-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccccc1)C(O)=O BLTRAARCJYVJKV-QEJZJMRPSA-N 0.000 description 1
- CHFFHQUVXHEGBY-GARJFASQSA-N Ala-Lys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CHFFHQUVXHEGBY-GARJFASQSA-N 0.000 description 1
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 1
- KQESEZXHYOUIIM-CQDKDKBSSA-N Ala-Lys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KQESEZXHYOUIIM-CQDKDKBSSA-N 0.000 description 1
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 1
- PVQLRJRPUTXFFX-CIUDSAMLSA-N Ala-Met-Gln Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O PVQLRJRPUTXFFX-CIUDSAMLSA-N 0.000 description 1
- OMDNCNKNEGFOMM-BQBZGAKWSA-N Ala-Met-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O OMDNCNKNEGFOMM-BQBZGAKWSA-N 0.000 description 1
- GKAZXNDATBWNBI-DCAQKATOSA-N Ala-Met-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N GKAZXNDATBWNBI-DCAQKATOSA-N 0.000 description 1
- GFEDXKNBZMPEDM-KZVJFYERSA-N Ala-Met-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFEDXKNBZMPEDM-KZVJFYERSA-N 0.000 description 1
- FVNAUOZKIPAYNA-BPNCWPANSA-N Ala-Met-Tyr Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FVNAUOZKIPAYNA-BPNCWPANSA-N 0.000 description 1
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 1
- XRUJOVRWNMBAAA-NHCYSSNCSA-N Ala-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 XRUJOVRWNMBAAA-NHCYSSNCSA-N 0.000 description 1
- BFMIRJBURUXDRG-DLOVCJGASA-N Ala-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 BFMIRJBURUXDRG-DLOVCJGASA-N 0.000 description 1
- WEZNQZHACPSMEF-QEJZJMRPSA-N Ala-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 WEZNQZHACPSMEF-QEJZJMRPSA-N 0.000 description 1
- DYJJJCHDHLEFDW-FXQIFTODSA-N Ala-Pro-Cys Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N DYJJJCHDHLEFDW-FXQIFTODSA-N 0.000 description 1
- BTRULDJUUVGRNE-DCAQKATOSA-N Ala-Pro-Lys Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O BTRULDJUUVGRNE-DCAQKATOSA-N 0.000 description 1
- FFZJHQODAYHGPO-KZVJFYERSA-N Ala-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N FFZJHQODAYHGPO-KZVJFYERSA-N 0.000 description 1
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 1
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 1
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 1
- HIIJOGIBQXHFKE-HHKYUTTNSA-N Ala-Thr-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O HIIJOGIBQXHFKE-HHKYUTTNSA-N 0.000 description 1
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 1
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 1
- ISCYZXFOCXWUJU-KZVJFYERSA-N Ala-Thr-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O ISCYZXFOCXWUJU-KZVJFYERSA-N 0.000 description 1
- JJHBEVZAZXZREW-LFSVMHDDSA-N Ala-Thr-Phe Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O JJHBEVZAZXZREW-LFSVMHDDSA-N 0.000 description 1
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 1
- LTTLSZVJTDSACD-OWLDWWDNSA-N Ala-Thr-Trp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O LTTLSZVJTDSACD-OWLDWWDNSA-N 0.000 description 1
- ZVWXMTTZJKBJCI-BHDSKKPTSA-N Ala-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 ZVWXMTTZJKBJCI-BHDSKKPTSA-N 0.000 description 1
- AOAKQKVICDWCLB-UWJYBYFXSA-N Ala-Tyr-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AOAKQKVICDWCLB-UWJYBYFXSA-N 0.000 description 1
- YCTIYBUTCKNOTI-UWJYBYFXSA-N Ala-Tyr-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCTIYBUTCKNOTI-UWJYBYFXSA-N 0.000 description 1
- BHFOJPDOQPWJRN-XDTLVQLUSA-N Ala-Tyr-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CCC(N)=O)C(O)=O BHFOJPDOQPWJRN-XDTLVQLUSA-N 0.000 description 1
- BGGAIXWIZCIFSG-XDTLVQLUSA-N Ala-Tyr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O BGGAIXWIZCIFSG-XDTLVQLUSA-N 0.000 description 1
- XKXAZPSREVUCRT-BPNCWPANSA-N Ala-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=C(O)C=C1 XKXAZPSREVUCRT-BPNCWPANSA-N 0.000 description 1
- MUGAESARFRGOTQ-IGNZVWTISA-N Ala-Tyr-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N MUGAESARFRGOTQ-IGNZVWTISA-N 0.000 description 1
- YEBZNKPPOHFZJM-BPNCWPANSA-N Ala-Tyr-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O YEBZNKPPOHFZJM-BPNCWPANSA-N 0.000 description 1
- BVLPIIBTWIYOML-ZKWXMUAHSA-N Ala-Val-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BVLPIIBTWIYOML-ZKWXMUAHSA-N 0.000 description 1
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 1
- QGZKDVFQNNGYKY-UHFFFAOYSA-O Ammonium Chemical compound [NH4+] QGZKDVFQNNGYKY-UHFFFAOYSA-O 0.000 description 1
- DFCIPNHFKOQAME-FXQIFTODSA-N Arg-Ala-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFCIPNHFKOQAME-FXQIFTODSA-N 0.000 description 1
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 1
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 1
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 1
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 1
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 1
- YUIGJDNAGKJLDO-JYJNAYRXSA-N Arg-Arg-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YUIGJDNAGKJLDO-JYJNAYRXSA-N 0.000 description 1
- RVDVDRUZWZIBJQ-CIUDSAMLSA-N Arg-Asn-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O RVDVDRUZWZIBJQ-CIUDSAMLSA-N 0.000 description 1
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 1
- YFBGNGASPGRWEM-DCAQKATOSA-N Arg-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YFBGNGASPGRWEM-DCAQKATOSA-N 0.000 description 1
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 1
- VXXHDZKEQNGXNU-QXEWZRGKSA-N Arg-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N VXXHDZKEQNGXNU-QXEWZRGKSA-N 0.000 description 1
- KBBKCNHWCDJPGN-GUBZILKMSA-N Arg-Gln-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KBBKCNHWCDJPGN-GUBZILKMSA-N 0.000 description 1
- BEXGZLUHRXTZCC-CIUDSAMLSA-N Arg-Gln-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N BEXGZLUHRXTZCC-CIUDSAMLSA-N 0.000 description 1
- QAODJPUKWNNNRP-DCAQKATOSA-N Arg-Glu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QAODJPUKWNNNRP-DCAQKATOSA-N 0.000 description 1
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 1
- PNIGSVZJNVUVJA-BQBZGAKWSA-N Arg-Gly-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O PNIGSVZJNVUVJA-BQBZGAKWSA-N 0.000 description 1
- PPPXVIBMLFWNSK-BQBZGAKWSA-N Arg-Gly-Cys Chemical compound C(C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N PPPXVIBMLFWNSK-BQBZGAKWSA-N 0.000 description 1
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 1
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 1
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 1
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 1
- DNUKXVMPARLPFN-XUXIUFHCSA-N Arg-Leu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DNUKXVMPARLPFN-XUXIUFHCSA-N 0.000 description 1
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 1
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 1
- YVTHEZNOKSAWRW-DCAQKATOSA-N Arg-Lys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O YVTHEZNOKSAWRW-DCAQKATOSA-N 0.000 description 1
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 1
- CZUHPNLXLWMYMG-UBHSHLNASA-N Arg-Phe-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 CZUHPNLXLWMYMG-UBHSHLNASA-N 0.000 description 1
- BSYKSCBTTQKOJG-GUBZILKMSA-N Arg-Pro-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BSYKSCBTTQKOJG-GUBZILKMSA-N 0.000 description 1
- UULLJGQFCDXVTQ-CYDGBPFRSA-N Arg-Pro-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UULLJGQFCDXVTQ-CYDGBPFRSA-N 0.000 description 1
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 1
- VRTWYUYCJGNFES-CIUDSAMLSA-N Arg-Ser-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O VRTWYUYCJGNFES-CIUDSAMLSA-N 0.000 description 1
- URAUIUGLHBRPMF-NAKRPEOUSA-N Arg-Ser-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O URAUIUGLHBRPMF-NAKRPEOUSA-N 0.000 description 1
- ZJBUILVYSXQNSW-YTWAJWBKSA-N Arg-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ZJBUILVYSXQNSW-YTWAJWBKSA-N 0.000 description 1
- ZPWMEWYQBWSGAO-ZJDVBMNYSA-N Arg-Thr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZPWMEWYQBWSGAO-ZJDVBMNYSA-N 0.000 description 1
- DRDWXKWUSIKKOB-PJODQICGSA-N Arg-Trp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O DRDWXKWUSIKKOB-PJODQICGSA-N 0.000 description 1
- ZUVMUOOHJYNJPP-XIRDDKMYSA-N Arg-Trp-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZUVMUOOHJYNJPP-XIRDDKMYSA-N 0.000 description 1
- NVPHRWNWTKYIST-BPNCWPANSA-N Arg-Tyr-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 NVPHRWNWTKYIST-BPNCWPANSA-N 0.000 description 1
- QMQZYILAWUOLPV-JYJNAYRXSA-N Arg-Tyr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)CC1=CC=C(O)C=C1 QMQZYILAWUOLPV-JYJNAYRXSA-N 0.000 description 1
- PJOPLXOCKACMLK-KKUMJFAQSA-N Arg-Tyr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O PJOPLXOCKACMLK-KKUMJFAQSA-N 0.000 description 1
- VLIJAPRTSXSGFY-STQMWFEESA-N Arg-Tyr-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 VLIJAPRTSXSGFY-STQMWFEESA-N 0.000 description 1
- PSUXEQYPYZLNER-QXEWZRGKSA-N Arg-Val-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PSUXEQYPYZLNER-QXEWZRGKSA-N 0.000 description 1
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 1
- FMYQECOAIFGQGU-CYDGBPFRSA-N Arg-Val-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMYQECOAIFGQGU-CYDGBPFRSA-N 0.000 description 1
- ANAHQDPQQBDOBM-UHFFFAOYSA-N Arg-Val-Tyr Natural products CC(C)C(NC(=O)C(N)CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O ANAHQDPQQBDOBM-UHFFFAOYSA-N 0.000 description 1
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 1
- NXVGBGZQQFDUTM-XVYDVKMFSA-N Asn-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N NXVGBGZQQFDUTM-XVYDVKMFSA-N 0.000 description 1
- JJGRJMKUOYXZRA-LPEHRKFASA-N Asn-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O JJGRJMKUOYXZRA-LPEHRKFASA-N 0.000 description 1
- POOCJCRBHHMAOS-FXQIFTODSA-N Asn-Arg-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O POOCJCRBHHMAOS-FXQIFTODSA-N 0.000 description 1
- ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N Asn-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N 0.000 description 1
- KSBHCUSPLWRVEK-ZLUOBGJFSA-N Asn-Asn-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KSBHCUSPLWRVEK-ZLUOBGJFSA-N 0.000 description 1
- NVGWESORMHFISY-SRVKXCTJSA-N Asn-Asn-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NVGWESORMHFISY-SRVKXCTJSA-N 0.000 description 1
- NLCDVZJDEXIDDL-BIIVOSGPSA-N Asn-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O NLCDVZJDEXIDDL-BIIVOSGPSA-N 0.000 description 1
- QHBMKQWOIYJYMI-BYULHYEWSA-N Asn-Asn-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QHBMKQWOIYJYMI-BYULHYEWSA-N 0.000 description 1
- XVVOVPFMILMHPX-ZLUOBGJFSA-N Asn-Asp-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XVVOVPFMILMHPX-ZLUOBGJFSA-N 0.000 description 1
- JZRLLSOWDYUKOK-SRVKXCTJSA-N Asn-Asp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N JZRLLSOWDYUKOK-SRVKXCTJSA-N 0.000 description 1
- VYLVOMUVLMGCRF-ZLUOBGJFSA-N Asn-Asp-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VYLVOMUVLMGCRF-ZLUOBGJFSA-N 0.000 description 1
- IYVSIZAXNLOKFQ-BYULHYEWSA-N Asn-Asp-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IYVSIZAXNLOKFQ-BYULHYEWSA-N 0.000 description 1
- AYKKKGFJXIDYLX-ACZMJKKPSA-N Asn-Gln-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O AYKKKGFJXIDYLX-ACZMJKKPSA-N 0.000 description 1
- BZMWJLLUAKSIMH-FXQIFTODSA-N Asn-Glu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BZMWJLLUAKSIMH-FXQIFTODSA-N 0.000 description 1
- UBKOVSLDWIHYSY-ACZMJKKPSA-N Asn-Glu-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UBKOVSLDWIHYSY-ACZMJKKPSA-N 0.000 description 1
- GFFRWIJAFFMQGM-NUMRIWBASA-N Asn-Glu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFFRWIJAFFMQGM-NUMRIWBASA-N 0.000 description 1
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 1
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 1
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 1
- OFQPMRDJVWLMNJ-CIUDSAMLSA-N Asn-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N OFQPMRDJVWLMNJ-CIUDSAMLSA-N 0.000 description 1
- NKLRWRRVYGQNIH-GHCJXIJMSA-N Asn-Ile-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O NKLRWRRVYGQNIH-GHCJXIJMSA-N 0.000 description 1
- OLISTMZJGQUOGS-GMOBBJLQSA-N Asn-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OLISTMZJGQUOGS-GMOBBJLQSA-N 0.000 description 1
- ANPFQTJEPONRPL-UGYAYLCHSA-N Asn-Ile-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O ANPFQTJEPONRPL-UGYAYLCHSA-N 0.000 description 1
- GQRDIVQPSMPQME-ZPFDUUQYSA-N Asn-Ile-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O GQRDIVQPSMPQME-ZPFDUUQYSA-N 0.000 description 1
- XLZCLJRGGMBKLR-PCBIJLKTSA-N Asn-Ile-Phe Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XLZCLJRGGMBKLR-PCBIJLKTSA-N 0.000 description 1
- PNHQRQTVBRDIEF-CIUDSAMLSA-N Asn-Leu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)N)N PNHQRQTVBRDIEF-CIUDSAMLSA-N 0.000 description 1
- HFPXZWPUVFVNLL-GUBZILKMSA-N Asn-Leu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFPXZWPUVFVNLL-GUBZILKMSA-N 0.000 description 1
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 1
- FTSAJSADJCMDHH-CIUDSAMLSA-N Asn-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N FTSAJSADJCMDHH-CIUDSAMLSA-N 0.000 description 1
- LZLCLRQMUQWUHJ-GUBZILKMSA-N Asn-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N LZLCLRQMUQWUHJ-GUBZILKMSA-N 0.000 description 1
- NYGILGUOUOXGMJ-YUMQZZPRSA-N Asn-Lys-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O NYGILGUOUOXGMJ-YUMQZZPRSA-N 0.000 description 1
- GIQCDTKOIPUDSG-GARJFASQSA-N Asn-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N)C(=O)O GIQCDTKOIPUDSG-GARJFASQSA-N 0.000 description 1
- COWITDLVHMZSIW-CIUDSAMLSA-N Asn-Lys-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O COWITDLVHMZSIW-CIUDSAMLSA-N 0.000 description 1
- XFJKRRCWLTZIQA-XIRDDKMYSA-N Asn-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N XFJKRRCWLTZIQA-XIRDDKMYSA-N 0.000 description 1
- MDDXKBHIMYYJLW-FXQIFTODSA-N Asn-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N MDDXKBHIMYYJLW-FXQIFTODSA-N 0.000 description 1
- QGABLMITFKUQDF-DCAQKATOSA-N Asn-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N QGABLMITFKUQDF-DCAQKATOSA-N 0.000 description 1
- RTFWCVDISAMGEQ-SRVKXCTJSA-N Asn-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N RTFWCVDISAMGEQ-SRVKXCTJSA-N 0.000 description 1
- YUUIAUXBNOHFRJ-IHRRRGAJSA-N Asn-Phe-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O YUUIAUXBNOHFRJ-IHRRRGAJSA-N 0.000 description 1
- BKFXFUPYETWGGA-XVSYOHENSA-N Asn-Phe-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BKFXFUPYETWGGA-XVSYOHENSA-N 0.000 description 1
- UYCPJVYQYARFGB-YDHLFZDLSA-N Asn-Phe-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O UYCPJVYQYARFGB-YDHLFZDLSA-N 0.000 description 1
- PLTGTJAZQRGMPP-FXQIFTODSA-N Asn-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O PLTGTJAZQRGMPP-FXQIFTODSA-N 0.000 description 1
- JTXVXGXTRXMOFJ-FXQIFTODSA-N Asn-Pro-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O JTXVXGXTRXMOFJ-FXQIFTODSA-N 0.000 description 1
- YRTOMUMWSTUQAX-FXQIFTODSA-N Asn-Pro-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O YRTOMUMWSTUQAX-FXQIFTODSA-N 0.000 description 1
- YWFLXGZHZXXINF-BPUTZDHNSA-N Asn-Pro-Trp Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CNC2=CC=CC=C12 YWFLXGZHZXXINF-BPUTZDHNSA-N 0.000 description 1
- JWQWPRCDYWNVNM-ACZMJKKPSA-N Asn-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N JWQWPRCDYWNVNM-ACZMJKKPSA-N 0.000 description 1
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 1
- ZUFPUBYQYWCMDB-NUMRIWBASA-N Asn-Thr-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZUFPUBYQYWCMDB-NUMRIWBASA-N 0.000 description 1
- PUUPMDXIHCOPJU-HJGDQZAQSA-N Asn-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O PUUPMDXIHCOPJU-HJGDQZAQSA-N 0.000 description 1
- WUQXMTITJLFXAU-JIOCBJNQSA-N Asn-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N)O WUQXMTITJLFXAU-JIOCBJNQSA-N 0.000 description 1
- KZYSHAMXEBPJBD-JRQIVUDYSA-N Asn-Thr-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZYSHAMXEBPJBD-JRQIVUDYSA-N 0.000 description 1
- IPPFAOCLQSGHJV-WFBYXXMGSA-N Asn-Trp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O IPPFAOCLQSGHJV-WFBYXXMGSA-N 0.000 description 1
- QIRJQYQOIKBPBZ-IHRRRGAJSA-N Asn-Tyr-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QIRJQYQOIKBPBZ-IHRRRGAJSA-N 0.000 description 1
- NSTBNYOKCZKOMI-AVGNSLFASA-N Asn-Tyr-Glu Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O NSTBNYOKCZKOMI-AVGNSLFASA-N 0.000 description 1
- QUCCLIXMVPIVOB-BZSNNMDCSA-N Asn-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N QUCCLIXMVPIVOB-BZSNNMDCSA-N 0.000 description 1
- CBWCQCANJSGUOH-ZKWXMUAHSA-N Asn-Val-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O CBWCQCANJSGUOH-ZKWXMUAHSA-N 0.000 description 1
- ZAESWDKAMDVHLL-RCOVLWMOSA-N Asn-Val-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O ZAESWDKAMDVHLL-RCOVLWMOSA-N 0.000 description 1
- LMIWYCWRJVMAIQ-NHCYSSNCSA-N Asn-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N LMIWYCWRJVMAIQ-NHCYSSNCSA-N 0.000 description 1
- GHWWTICYPDKPTE-NGZCFLSTSA-N Asn-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N GHWWTICYPDKPTE-NGZCFLSTSA-N 0.000 description 1
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 1
- HBUJSDCLZCXXCW-YDHLFZDLSA-N Asn-Val-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HBUJSDCLZCXXCW-YDHLFZDLSA-N 0.000 description 1
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 1
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 1
- ZVTDYGWRRPMFCL-WFBYXXMGSA-N Asp-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)O)N ZVTDYGWRRPMFCL-WFBYXXMGSA-N 0.000 description 1
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 1
- UQBGYPFHWFZMCD-ZLUOBGJFSA-N Asp-Asn-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O UQBGYPFHWFZMCD-ZLUOBGJFSA-N 0.000 description 1
- QRULNKJGYQQZMW-ZLUOBGJFSA-N Asp-Asn-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QRULNKJGYQQZMW-ZLUOBGJFSA-N 0.000 description 1
- GWTLRDMPMJCNMH-WHFBIAKZSA-N Asp-Asn-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GWTLRDMPMJCNMH-WHFBIAKZSA-N 0.000 description 1
- UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 1
- UGIBTKGQVWFTGX-BIIVOSGPSA-N Asp-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O UGIBTKGQVWFTGX-BIIVOSGPSA-N 0.000 description 1
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 1
- NAPNAGZWHQHZLG-ZLUOBGJFSA-N Asp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N NAPNAGZWHQHZLG-ZLUOBGJFSA-N 0.000 description 1
- VPSHHQXIWLGVDD-ZLUOBGJFSA-N Asp-Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VPSHHQXIWLGVDD-ZLUOBGJFSA-N 0.000 description 1
- ZCKYZTGLXIEOKS-CIUDSAMLSA-N Asp-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N ZCKYZTGLXIEOKS-CIUDSAMLSA-N 0.000 description 1
- CELPEWWLSXMVPH-CIUDSAMLSA-N Asp-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O CELPEWWLSXMVPH-CIUDSAMLSA-N 0.000 description 1
- SVFOIXMRMLROHO-SRVKXCTJSA-N Asp-Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SVFOIXMRMLROHO-SRVKXCTJSA-N 0.000 description 1
- LKIYSIYBKYLKPU-BIIVOSGPSA-N Asp-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O LKIYSIYBKYLKPU-BIIVOSGPSA-N 0.000 description 1
- QXHVOUSPVAWEMX-ZLUOBGJFSA-N Asp-Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXHVOUSPVAWEMX-ZLUOBGJFSA-N 0.000 description 1
- PXLNPFOJZQMXAT-BYULHYEWSA-N Asp-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O PXLNPFOJZQMXAT-BYULHYEWSA-N 0.000 description 1
- CSEJMKNZDCJYGJ-XHNCKOQMSA-N Asp-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O CSEJMKNZDCJYGJ-XHNCKOQMSA-N 0.000 description 1
- JRBVWZLHBGYZNY-QEJZJMRPSA-N Asp-Gln-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JRBVWZLHBGYZNY-QEJZJMRPSA-N 0.000 description 1
- OVPHVTCDVYYTHN-AVGNSLFASA-N Asp-Glu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OVPHVTCDVYYTHN-AVGNSLFASA-N 0.000 description 1
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 1
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 1
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 1
- KHGPWGKPYHPOIK-QWRGUYRKSA-N Asp-Gly-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KHGPWGKPYHPOIK-QWRGUYRKSA-N 0.000 description 1
- POTCZYQVVNXUIG-BQBZGAKWSA-N Asp-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O POTCZYQVVNXUIG-BQBZGAKWSA-N 0.000 description 1
- LDGUZSIPGSPBJP-XVYDVKMFSA-N Asp-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N LDGUZSIPGSPBJP-XVYDVKMFSA-N 0.000 description 1
- WSXDIZFNQYTUJB-SRVKXCTJSA-N Asp-His-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O WSXDIZFNQYTUJB-SRVKXCTJSA-N 0.000 description 1
- SWTQDYFZVOJVLL-KKUMJFAQSA-N Asp-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)O)N)O SWTQDYFZVOJVLL-KKUMJFAQSA-N 0.000 description 1
- LDLZOAJRXXBVGF-GMOBBJLQSA-N Asp-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N LDLZOAJRXXBVGF-GMOBBJLQSA-N 0.000 description 1
- SPKCGKRUYKMDHP-GUDRVLHUSA-N Asp-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N SPKCGKRUYKMDHP-GUDRVLHUSA-N 0.000 description 1
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 1
- SCQIQCWLOMOEFP-DCAQKATOSA-N Asp-Leu-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SCQIQCWLOMOEFP-DCAQKATOSA-N 0.000 description 1
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 1
- OEDJQRXNDRUGEU-SRVKXCTJSA-N Asp-Leu-His Chemical compound N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O OEDJQRXNDRUGEU-SRVKXCTJSA-N 0.000 description 1
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 1
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 1
- NVFSJIXJZCDICF-SRVKXCTJSA-N Asp-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N NVFSJIXJZCDICF-SRVKXCTJSA-N 0.000 description 1
- ZXRQJQCXPSMNMR-XIRDDKMYSA-N Asp-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N ZXRQJQCXPSMNMR-XIRDDKMYSA-N 0.000 description 1
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 1
- SARSTIZOZFBDOM-FXQIFTODSA-N Asp-Met-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O SARSTIZOZFBDOM-FXQIFTODSA-N 0.000 description 1
- GYWQGGUCMDCUJE-DLOVCJGASA-N Asp-Phe-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O GYWQGGUCMDCUJE-DLOVCJGASA-N 0.000 description 1
- WOPJVEMFXYHZEE-SRVKXCTJSA-N Asp-Phe-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WOPJVEMFXYHZEE-SRVKXCTJSA-N 0.000 description 1
- KRQFMDNIUOVRIF-KKUMJFAQSA-N Asp-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)O)N KRQFMDNIUOVRIF-KKUMJFAQSA-N 0.000 description 1
- USNJAPJZSGTTPX-XVSYOHENSA-N Asp-Phe-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O USNJAPJZSGTTPX-XVSYOHENSA-N 0.000 description 1
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 1
- BWJZSLQJNBSUPM-FXQIFTODSA-N Asp-Pro-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O BWJZSLQJNBSUPM-FXQIFTODSA-N 0.000 description 1
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 1
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 1
- YIDFBWRHIYOYAA-LKXGYXEUSA-N Asp-Ser-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YIDFBWRHIYOYAA-LKXGYXEUSA-N 0.000 description 1
- XYPJXLLXNSAWHZ-SRVKXCTJSA-N Asp-Ser-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XYPJXLLXNSAWHZ-SRVKXCTJSA-N 0.000 description 1
- UTLCRGFJFSZWAW-OLHMAJIHSA-N Asp-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O UTLCRGFJFSZWAW-OLHMAJIHSA-N 0.000 description 1
- KBJVTFWQWXCYCQ-IUKAMOBKSA-N Asp-Thr-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KBJVTFWQWXCYCQ-IUKAMOBKSA-N 0.000 description 1
- JDDYEZGPYBBPBN-JRQIVUDYSA-N Asp-Thr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JDDYEZGPYBBPBN-JRQIVUDYSA-N 0.000 description 1
- XOASPVGNFAMYBD-WFBYXXMGSA-N Asp-Trp-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O XOASPVGNFAMYBD-WFBYXXMGSA-N 0.000 description 1
- YUELDQUPTAYEGM-XIRDDKMYSA-N Asp-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)O)N YUELDQUPTAYEGM-XIRDDKMYSA-N 0.000 description 1
- FIRWLDUOFOULCA-XIRDDKMYSA-N Asp-Trp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N FIRWLDUOFOULCA-XIRDDKMYSA-N 0.000 description 1
- OYSYWMMZGJSQRB-AVGNSLFASA-N Asp-Tyr-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O OYSYWMMZGJSQRB-AVGNSLFASA-N 0.000 description 1
- AWPWHMVCSISSQK-QWRGUYRKSA-N Asp-Tyr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O AWPWHMVCSISSQK-QWRGUYRKSA-N 0.000 description 1
- BJDHEININLSZOT-KKUMJFAQSA-N Asp-Tyr-Lys Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(O)=O BJDHEININLSZOT-KKUMJFAQSA-N 0.000 description 1
- OQMGSMNZVHYDTQ-ZKWXMUAHSA-N Asp-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N OQMGSMNZVHYDTQ-ZKWXMUAHSA-N 0.000 description 1
- XQFLFQWOBXPMHW-NHCYSSNCSA-N Asp-Val-His Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O XQFLFQWOBXPMHW-NHCYSSNCSA-N 0.000 description 1
- 101100075927 Aspergillus aculeatus mndA gene Proteins 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 241000193752 Bacillus circulans Species 0.000 description 1
- 241000193749 Bacillus coagulans Species 0.000 description 1
- 241000193422 Bacillus lentus Species 0.000 description 1
- 241000194108 Bacillus licheniformis Species 0.000 description 1
- 241000194107 Bacillus megaterium Species 0.000 description 1
- 241000194106 Bacillus mycoides Species 0.000 description 1
- 241000194103 Bacillus pumilus Species 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 101100508888 Bacillus subtilis (strain 168) iolJ gene Proteins 0.000 description 1
- 241000186000 Bifidobacterium Species 0.000 description 1
- 241000186015 Bifidobacterium longum subsp. infantis Species 0.000 description 1
- 241000193417 Brevibacillus laterosporus Species 0.000 description 1
- 240000001817 Cereus hexagonus Species 0.000 description 1
- 241000588919 Citrobacter freundii Species 0.000 description 1
- 241000193403 Clostridium Species 0.000 description 1
- 241000193401 Clostridium acetobutylicum Species 0.000 description 1
- 208000002881 Colic Diseases 0.000 description 1
- 229910021591 Copper(I) chloride Inorganic materials 0.000 description 1
- 241000186216 Corynebacterium Species 0.000 description 1
- 241000186226 Corynebacterium glutamicum Species 0.000 description 1
- 108010051219 Cre recombinase Proteins 0.000 description 1
- YFXFOZPXVFPBDH-VZFHVOOUSA-N Cys-Ala-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)CS)C(O)=O YFXFOZPXVFPBDH-VZFHVOOUSA-N 0.000 description 1
- NDUSUIGBMZCOIL-ZKWXMUAHSA-N Cys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N NDUSUIGBMZCOIL-ZKWXMUAHSA-N 0.000 description 1
- PORWNQWEEIOIRH-XHNCKOQMSA-N Cys-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N)C(=O)O PORWNQWEEIOIRH-XHNCKOQMSA-N 0.000 description 1
- DZSICRGTVPDCRN-YUMQZZPRSA-N Cys-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N DZSICRGTVPDCRN-YUMQZZPRSA-N 0.000 description 1
- VPQZSNQICFCCSO-BJDJZHNGSA-N Cys-Leu-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VPQZSNQICFCCSO-BJDJZHNGSA-N 0.000 description 1
- IDZDFWJNPOOOHE-KKUMJFAQSA-N Cys-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N IDZDFWJNPOOOHE-KKUMJFAQSA-N 0.000 description 1
- MJOYUXLETJMQGG-IHRRRGAJSA-N Cys-Tyr-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MJOYUXLETJMQGG-IHRRRGAJSA-N 0.000 description 1
- UGPCUUWZXRMCIJ-KKUMJFAQSA-N Cys-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CS)N UGPCUUWZXRMCIJ-KKUMJFAQSA-N 0.000 description 1
- 108010084372 D-arabinose isomerase Proteins 0.000 description 1
- XPYBSIWDXQFNMH-UHFFFAOYSA-N D-fructose 1,6-bisphosphate Natural products OP(=O)(O)OCC(O)C(O)C(O)C(=O)COP(O)(O)=O XPYBSIWDXQFNMH-UHFFFAOYSA-N 0.000 description 1
- AEMOLEFTQBMNLQ-VANFPWTGSA-N D-mannopyranuronic acid Chemical compound OC1O[C@H](C(O)=O)[C@@H](O)[C@H](O)[C@@H]1O AEMOLEFTQBMNLQ-VANFPWTGSA-N 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 241000194033 Enterococcus Species 0.000 description 1
- 241000194031 Enterococcus faecium Species 0.000 description 1
- 241000588698 Erwinia Species 0.000 description 1
- 101100022282 Escherichia coli O157:H7 manC2 gene Proteins 0.000 description 1
- 241000701959 Escherichia virus Lambda Species 0.000 description 1
- 108010046276 FLP recombinase Proteins 0.000 description 1
- 102000001390 Fructose-Bisphosphate Aldolase Human genes 0.000 description 1
- 108010068561 Fructose-Bisphosphate Aldolase Proteins 0.000 description 1
- 102100024515 GDP-L-fucose synthase Human genes 0.000 description 1
- 108030006298 GDP-L-fucose synthases Proteins 0.000 description 1
- 108010062427 GDP-mannose 4,6-dehydratase Proteins 0.000 description 1
- 102000002312 GDPmannose 4,6-dehydratase Human genes 0.000 description 1
- INKFLNZBTSNFON-CIUDSAMLSA-N Gln-Ala-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O INKFLNZBTSNFON-CIUDSAMLSA-N 0.000 description 1
- WUAYFMZULZDSLB-ACZMJKKPSA-N Gln-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O WUAYFMZULZDSLB-ACZMJKKPSA-N 0.000 description 1
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 1
- LKUWAWGNJYJODH-KBIXCLLPSA-N Gln-Ala-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKUWAWGNJYJODH-KBIXCLLPSA-N 0.000 description 1
- KJRXLVZYJJLUCV-DCAQKATOSA-N Gln-Arg-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O KJRXLVZYJJLUCV-DCAQKATOSA-N 0.000 description 1
- OETQLUYCMBARHJ-CIUDSAMLSA-N Gln-Asn-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OETQLUYCMBARHJ-CIUDSAMLSA-N 0.000 description 1
- MGJMFSBEMSNYJL-AVGNSLFASA-N Gln-Asn-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MGJMFSBEMSNYJL-AVGNSLFASA-N 0.000 description 1
- LMPBBFWHCRURJD-LAEOZQHASA-N Gln-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N LMPBBFWHCRURJD-LAEOZQHASA-N 0.000 description 1
- CYTSBCIIEHUPDU-ACZMJKKPSA-N Gln-Asp-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O CYTSBCIIEHUPDU-ACZMJKKPSA-N 0.000 description 1
- IKDOHQHEFPPGJG-FXQIFTODSA-N Gln-Asp-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IKDOHQHEFPPGJG-FXQIFTODSA-N 0.000 description 1
- JFSNBQJNDMXMQF-XHNCKOQMSA-N Gln-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O JFSNBQJNDMXMQF-XHNCKOQMSA-N 0.000 description 1
- CGVWDTRDPLOMHZ-FXQIFTODSA-N Gln-Glu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CGVWDTRDPLOMHZ-FXQIFTODSA-N 0.000 description 1
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 1
- XJKAKYXMFHUIHT-AUTRQRHGSA-N Gln-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N XJKAKYXMFHUIHT-AUTRQRHGSA-N 0.000 description 1
- XKBASPWPBXNVLQ-WDSKDSINSA-N Gln-Gly-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XKBASPWPBXNVLQ-WDSKDSINSA-N 0.000 description 1
- GNMQDOGFWYWPNM-LAEOZQHASA-N Gln-Gly-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)CNC(=O)[C@@H](N)CCC(N)=O)C(O)=O GNMQDOGFWYWPNM-LAEOZQHASA-N 0.000 description 1
- NSORZJXKUQFEKL-JGVFFNPUSA-N Gln-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)N)N)C(=O)O NSORZJXKUQFEKL-JGVFFNPUSA-N 0.000 description 1
- SMLDOQHTOAAFJQ-WDSKDSINSA-N Gln-Gly-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SMLDOQHTOAAFJQ-WDSKDSINSA-N 0.000 description 1
- XQEAVUJIRZRLQQ-SZMVWBNQSA-N Gln-His-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC3=CN=CN3)NC(=O)[C@H](CCC(=O)N)N XQEAVUJIRZRLQQ-SZMVWBNQSA-N 0.000 description 1
- TWTWUBHEWQPMQW-ZPFDUUQYSA-N Gln-Ile-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWTWUBHEWQPMQW-ZPFDUUQYSA-N 0.000 description 1
- HXOLDXKNWKLDMM-YVNDNENWSA-N Gln-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HXOLDXKNWKLDMM-YVNDNENWSA-N 0.000 description 1
- MTCXQQINVAFZKW-MNXVOIDGSA-N Gln-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MTCXQQINVAFZKW-MNXVOIDGSA-N 0.000 description 1
- MWERYIXRDZDXOA-QEWYBTABSA-N Gln-Ile-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MWERYIXRDZDXOA-QEWYBTABSA-N 0.000 description 1
- ITZWDGBYBPUZRG-KBIXCLLPSA-N Gln-Ile-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O ITZWDGBYBPUZRG-KBIXCLLPSA-N 0.000 description 1
- VZRAXPGTUNDIDK-GUBZILKMSA-N Gln-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VZRAXPGTUNDIDK-GUBZILKMSA-N 0.000 description 1
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 1
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 1
- MLSKFHLRFVGNLL-WDCWCFNPSA-N Gln-Leu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MLSKFHLRFVGNLL-WDCWCFNPSA-N 0.000 description 1
- IOFDDSNZJDIGPB-GVXVVHGQSA-N Gln-Leu-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IOFDDSNZJDIGPB-GVXVVHGQSA-N 0.000 description 1
- DQLVHRFFBQOWFL-JYJNAYRXSA-N Gln-Lys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N)O DQLVHRFFBQOWFL-JYJNAYRXSA-N 0.000 description 1
- DRNMNLKUUKKPIA-HTUGSXCWSA-N Gln-Phe-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)CCC(N)=O)C(O)=O DRNMNLKUUKKPIA-HTUGSXCWSA-N 0.000 description 1
- FNAJNWPDTIXYJN-CIUDSAMLSA-N Gln-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O FNAJNWPDTIXYJN-CIUDSAMLSA-N 0.000 description 1
- PIUPHASDUFSHTF-CIUDSAMLSA-N Gln-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O PIUPHASDUFSHTF-CIUDSAMLSA-N 0.000 description 1
- HMIXCETWRYDVMO-GUBZILKMSA-N Gln-Pro-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O HMIXCETWRYDVMO-GUBZILKMSA-N 0.000 description 1
- OKARHJKJTKFQBM-ACZMJKKPSA-N Gln-Ser-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OKARHJKJTKFQBM-ACZMJKKPSA-N 0.000 description 1
- OSCLNNWLKKIQJM-WDSKDSINSA-N Gln-Ser-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O OSCLNNWLKKIQJM-WDSKDSINSA-N 0.000 description 1
- ZGHMRONFHDVXEF-AVGNSLFASA-N Gln-Ser-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZGHMRONFHDVXEF-AVGNSLFASA-N 0.000 description 1
- SYZZMPFLOLSMHL-XHNCKOQMSA-N Gln-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)C(=O)O SYZZMPFLOLSMHL-XHNCKOQMSA-N 0.000 description 1
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 1
- ININBLZFFVOQIO-JHEQGTHGSA-N Gln-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O ININBLZFFVOQIO-JHEQGTHGSA-N 0.000 description 1
- NHMRJKKAVMENKJ-WDCWCFNPSA-N Gln-Thr-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NHMRJKKAVMENKJ-WDCWCFNPSA-N 0.000 description 1
- OUBUHIODTNUUTC-WDCWCFNPSA-N Gln-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O OUBUHIODTNUUTC-WDCWCFNPSA-N 0.000 description 1
- ARYKRXHBIPLULY-XKBZYTNZSA-N Gln-Thr-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ARYKRXHBIPLULY-XKBZYTNZSA-N 0.000 description 1
- STHSGOZLFLFGSS-SUSMZKCASA-N Gln-Thr-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STHSGOZLFLFGSS-SUSMZKCASA-N 0.000 description 1
- XMWNHGKDDIFXQJ-NWLDYVSISA-N Gln-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O XMWNHGKDDIFXQJ-NWLDYVSISA-N 0.000 description 1
- RBSKVTZUFMIWFU-XEGUGMAKSA-N Gln-Trp-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O RBSKVTZUFMIWFU-XEGUGMAKSA-N 0.000 description 1
- YJCZUTXLPXBNIO-BHYGNILZSA-N Gln-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CCC(=O)N)N)C(=O)O YJCZUTXLPXBNIO-BHYGNILZSA-N 0.000 description 1
- CTJRFALAOYAJBX-NWLDYVSISA-N Gln-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)N)N)O CTJRFALAOYAJBX-NWLDYVSISA-N 0.000 description 1
- NVHJGTGTUGEWCG-ZVZYQTTQSA-N Gln-Trp-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O NVHJGTGTUGEWCG-ZVZYQTTQSA-N 0.000 description 1
- VYOILACOFPPNQH-UMNHJUIQSA-N Gln-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N VYOILACOFPPNQH-UMNHJUIQSA-N 0.000 description 1
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 1
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 1
- SZXSSXUNOALWCH-ACZMJKKPSA-N Glu-Ala-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O SZXSSXUNOALWCH-ACZMJKKPSA-N 0.000 description 1
- OGMQXTXGLDNBSS-FXQIFTODSA-N Glu-Ala-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O OGMQXTXGLDNBSS-FXQIFTODSA-N 0.000 description 1
- BPDVTFBJZNBHEU-HGNGGELXSA-N Glu-Ala-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 BPDVTFBJZNBHEU-HGNGGELXSA-N 0.000 description 1
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 1
- IRDASPPCLZIERZ-XHNCKOQMSA-N Glu-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N IRDASPPCLZIERZ-XHNCKOQMSA-N 0.000 description 1
- PBEQPAZRHDVJQI-SRVKXCTJSA-N Glu-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N PBEQPAZRHDVJQI-SRVKXCTJSA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- RDDSZZJOKDVPAE-ACZMJKKPSA-N Glu-Asn-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDDSZZJOKDVPAE-ACZMJKKPSA-N 0.000 description 1
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 1
- HJIFPJUEOGZWRI-GUBZILKMSA-N Glu-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N HJIFPJUEOGZWRI-GUBZILKMSA-N 0.000 description 1
- PKYAVRMYTBBRLS-FXQIFTODSA-N Glu-Cys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O PKYAVRMYTBBRLS-FXQIFTODSA-N 0.000 description 1
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 1
- GYCPQVFKCPPRQB-GUBZILKMSA-N Glu-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N GYCPQVFKCPPRQB-GUBZILKMSA-N 0.000 description 1
- RFDHKPSHTXZKLL-IHRRRGAJSA-N Glu-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N RFDHKPSHTXZKLL-IHRRRGAJSA-N 0.000 description 1
- WPLGNDORMXTMQS-FXQIFTODSA-N Glu-Gln-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O WPLGNDORMXTMQS-FXQIFTODSA-N 0.000 description 1
- MIQCYAJSDGNCNK-BPUTZDHNSA-N Glu-Gln-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O MIQCYAJSDGNCNK-BPUTZDHNSA-N 0.000 description 1
- HUFCEIHAFNVSNR-IHRRRGAJSA-N Glu-Gln-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUFCEIHAFNVSNR-IHRRRGAJSA-N 0.000 description 1
- UHVIQGKBMXEVGN-WDSKDSINSA-N Glu-Gly-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UHVIQGKBMXEVGN-WDSKDSINSA-N 0.000 description 1
- WRNAXCVRSBBKGS-BQBZGAKWSA-N Glu-Gly-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O WRNAXCVRSBBKGS-BQBZGAKWSA-N 0.000 description 1
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 1
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 1
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 1
- XMPAXPSENRSOSV-RYUDHWBXSA-N Glu-Gly-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XMPAXPSENRSOSV-RYUDHWBXSA-N 0.000 description 1
- WVTIBGWZUMJBFY-GUBZILKMSA-N Glu-His-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O WVTIBGWZUMJBFY-GUBZILKMSA-N 0.000 description 1
- WDTAKCUOIKHCTB-NKIYYHGXSA-N Glu-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N)O WDTAKCUOIKHCTB-NKIYYHGXSA-N 0.000 description 1
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 1
- WVYJNPCWJYBHJG-YVNDNENWSA-N Glu-Ile-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O WVYJNPCWJYBHJG-YVNDNENWSA-N 0.000 description 1
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 1
- ZSWGJYOZWBHROQ-RWRJDSDZSA-N Glu-Ile-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSWGJYOZWBHROQ-RWRJDSDZSA-N 0.000 description 1
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 1
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 1
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 1
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 1
- UGSVSNXPJJDJKL-SDDRHHMPSA-N Glu-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UGSVSNXPJJDJKL-SDDRHHMPSA-N 0.000 description 1
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 description 1
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 1
- NPMSEUWUMOSEFM-CIUDSAMLSA-N Glu-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N NPMSEUWUMOSEFM-CIUDSAMLSA-N 0.000 description 1
- HQOGXFLBAKJUMH-CIUDSAMLSA-N Glu-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N HQOGXFLBAKJUMH-CIUDSAMLSA-N 0.000 description 1
- YHOJJFFTSMWVGR-HJGDQZAQSA-N Glu-Met-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YHOJJFFTSMWVGR-HJGDQZAQSA-N 0.000 description 1
- DXVOKNVIKORTHQ-GUBZILKMSA-N Glu-Pro-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O DXVOKNVIKORTHQ-GUBZILKMSA-N 0.000 description 1
- BPLNJYHNAJVLRT-ACZMJKKPSA-N Glu-Ser-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O BPLNJYHNAJVLRT-ACZMJKKPSA-N 0.000 description 1
- DAHLWSFUXOHMIA-FXQIFTODSA-N Glu-Ser-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O DAHLWSFUXOHMIA-FXQIFTODSA-N 0.000 description 1
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 1
- QOXDAWODGSIDDI-GUBZILKMSA-N Glu-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N QOXDAWODGSIDDI-GUBZILKMSA-N 0.000 description 1
- JWNZHMSRZXXGTM-XKBZYTNZSA-N Glu-Ser-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWNZHMSRZXXGTM-XKBZYTNZSA-N 0.000 description 1
- TZXOPHFCAATANZ-QEJZJMRPSA-N Glu-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N TZXOPHFCAATANZ-QEJZJMRPSA-N 0.000 description 1
- DTLLNDVORUEOTM-WDCWCFNPSA-N Glu-Thr-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DTLLNDVORUEOTM-WDCWCFNPSA-N 0.000 description 1
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 1
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 1
- QVXWAFZDWRLXTI-NWLDYVSISA-N Glu-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O QVXWAFZDWRLXTI-NWLDYVSISA-N 0.000 description 1
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 1
- ZTNHPMZHAILHRB-JSGCOSHPSA-N Glu-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)NCC(O)=O)=CNC2=C1 ZTNHPMZHAILHRB-JSGCOSHPSA-N 0.000 description 1
- QEJKKJNDDDPSMU-KKUMJFAQSA-N Glu-Tyr-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(O)=O QEJKKJNDDDPSMU-KKUMJFAQSA-N 0.000 description 1
- PMSDOVISAARGAV-FHWLQOOXSA-N Glu-Tyr-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 PMSDOVISAARGAV-FHWLQOOXSA-N 0.000 description 1
- HBMRTXJZQDVRFT-DZKIICNBSA-N Glu-Tyr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HBMRTXJZQDVRFT-DZKIICNBSA-N 0.000 description 1
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 1
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 1
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 1
- XIJOPMSILDNVNJ-ZVZYQTTQSA-N Glu-Val-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O XIJOPMSILDNVNJ-ZVZYQTTQSA-N 0.000 description 1
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 1
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 1
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 1
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 1
- KFMBRBPXHVMDFN-UWVGGRQHSA-N Gly-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCNC(N)=N KFMBRBPXHVMDFN-UWVGGRQHSA-N 0.000 description 1
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 1
- UXJHNZODTMHWRD-WHFBIAKZSA-N Gly-Asn-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O UXJHNZODTMHWRD-WHFBIAKZSA-N 0.000 description 1
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 1
- JVACNFOPSUPDTK-QWRGUYRKSA-N Gly-Asn-Phe Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JVACNFOPSUPDTK-QWRGUYRKSA-N 0.000 description 1
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 1
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 1
- LURCIJSJAKFCRO-QWRGUYRKSA-N Gly-Asn-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LURCIJSJAKFCRO-QWRGUYRKSA-N 0.000 description 1
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 1
- XQHSBNVACKQWAV-WHFBIAKZSA-N Gly-Asp-Asn Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XQHSBNVACKQWAV-WHFBIAKZSA-N 0.000 description 1
- FUTAPPOITCCWTH-WHFBIAKZSA-N Gly-Asp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FUTAPPOITCCWTH-WHFBIAKZSA-N 0.000 description 1
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 1
- XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 1
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 1
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 1
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 1
- QGZSAHIZRQHCEQ-QWRGUYRKSA-N Gly-Asp-Tyr Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QGZSAHIZRQHCEQ-QWRGUYRKSA-N 0.000 description 1
- TZOVVRJYUDETQG-RCOVLWMOSA-N Gly-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN TZOVVRJYUDETQG-RCOVLWMOSA-N 0.000 description 1
- VNBNZUAPOYGRDB-ZDLURKLDSA-N Gly-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN)O VNBNZUAPOYGRDB-ZDLURKLDSA-N 0.000 description 1
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 1
- BPQYBFAXRGMGGY-LAEOZQHASA-N Gly-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN BPQYBFAXRGMGGY-LAEOZQHASA-N 0.000 description 1
- GNPVTZJUUBPZKW-WDSKDSINSA-N Gly-Gln-Ser Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GNPVTZJUUBPZKW-WDSKDSINSA-N 0.000 description 1
- NPSWCZIRBAYNSB-JHEQGTHGSA-N Gly-Gln-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NPSWCZIRBAYNSB-JHEQGTHGSA-N 0.000 description 1
- HDNXXTBKOJKWNN-WDSKDSINSA-N Gly-Glu-Asn Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O HDNXXTBKOJKWNN-WDSKDSINSA-N 0.000 description 1
- FIQQRCFQXGLOSZ-WDSKDSINSA-N Gly-Glu-Asp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FIQQRCFQXGLOSZ-WDSKDSINSA-N 0.000 description 1
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 1
- LHRXAHLCRMQBGJ-RYUDHWBXSA-N Gly-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CN LHRXAHLCRMQBGJ-RYUDHWBXSA-N 0.000 description 1
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 1
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 1
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 1
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 1
- VAXIVIPMCTYSHI-YUMQZZPRSA-N Gly-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN VAXIVIPMCTYSHI-YUMQZZPRSA-N 0.000 description 1
- HPAIKDPJURGQLN-KBPBESRZSA-N Gly-His-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CNC=N1 HPAIKDPJURGQLN-KBPBESRZSA-N 0.000 description 1
- SIYTVHWNKGIGMD-HOTGVXAUSA-N Gly-His-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC3=CN=CN3)NC(=O)CN SIYTVHWNKGIGMD-HOTGVXAUSA-N 0.000 description 1
- SXJHOPPTOJACOA-QXEWZRGKSA-N Gly-Ile-Arg Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SXJHOPPTOJACOA-QXEWZRGKSA-N 0.000 description 1
- DGKBSGNCMCLDSL-BYULHYEWSA-N Gly-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN DGKBSGNCMCLDSL-BYULHYEWSA-N 0.000 description 1
- VIIBEIQMLJEUJG-LAEOZQHASA-N Gly-Ile-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O VIIBEIQMLJEUJG-LAEOZQHASA-N 0.000 description 1
- AAHSHTLISQUZJL-QSFUFRPTSA-N Gly-Ile-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AAHSHTLISQUZJL-QSFUFRPTSA-N 0.000 description 1
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 1
- TWTPDFFBLQEBOE-IUCAKERBSA-N Gly-Leu-Gln Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O TWTPDFFBLQEBOE-IUCAKERBSA-N 0.000 description 1
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 1
- YSDLIYZLOTZZNP-UWVGGRQHSA-N Gly-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN YSDLIYZLOTZZNP-UWVGGRQHSA-N 0.000 description 1
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 1
- BXICSAQLIHFDDL-YUMQZZPRSA-N Gly-Lys-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BXICSAQLIHFDDL-YUMQZZPRSA-N 0.000 description 1
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 1
- IUKIDFVOUHZRAK-QWRGUYRKSA-N Gly-Lys-His Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IUKIDFVOUHZRAK-QWRGUYRKSA-N 0.000 description 1
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 1
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 1
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 1
- YHYDTTUSJXGTQK-UWVGGRQHSA-N Gly-Met-Leu Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(C)C)C(O)=O YHYDTTUSJXGTQK-UWVGGRQHSA-N 0.000 description 1
- WZSHYFGOLPXPLL-RYUDHWBXSA-N Gly-Phe-Glu Chemical compound NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCC(O)=O)C(O)=O WZSHYFGOLPXPLL-RYUDHWBXSA-N 0.000 description 1
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 1
- JPVGHHQGKPQYIL-KBPBESRZSA-N Gly-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 JPVGHHQGKPQYIL-KBPBESRZSA-N 0.000 description 1
- YLEIWGJJBFBFHC-KBPBESRZSA-N Gly-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 YLEIWGJJBFBFHC-KBPBESRZSA-N 0.000 description 1
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 1
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 1
- HJARVELKOSZUEW-YUMQZZPRSA-N Gly-Pro-Gln Chemical compound [H]NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJARVELKOSZUEW-YUMQZZPRSA-N 0.000 description 1
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 1
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 1
- POJJAZJHBGXEGM-YUMQZZPRSA-N Gly-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN POJJAZJHBGXEGM-YUMQZZPRSA-N 0.000 description 1
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 1
- RHRLHXQWHCNJKR-PMVVWTBXSA-N Gly-Thr-His Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 RHRLHXQWHCNJKR-PMVVWTBXSA-N 0.000 description 1
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 description 1
- BXDLTKLPPKBVEL-FJXKBIBVSA-N Gly-Thr-Met Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O BXDLTKLPPKBVEL-FJXKBIBVSA-N 0.000 description 1
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 1
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 1
- WSWWTQYHFCBKBT-DVJZZOLTSA-N Gly-Thr-Trp Chemical compound C[C@@H](O)[C@H](NC(=O)CN)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O WSWWTQYHFCBKBT-DVJZZOLTSA-N 0.000 description 1
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 1
- RCHFYMASWAZQQZ-ZANVPECISA-N Gly-Trp-Ala Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)CN)=CNC2=C1 RCHFYMASWAZQQZ-ZANVPECISA-N 0.000 description 1
- UMRIXLHPZZIOML-OALUTQOASA-N Gly-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)CN UMRIXLHPZZIOML-OALUTQOASA-N 0.000 description 1
- UMBDRSMLCUYIRI-DVJZZOLTSA-N Gly-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)CN)O UMBDRSMLCUYIRI-DVJZZOLTSA-N 0.000 description 1
- MREVELMMFOLESM-HOCLYGCPSA-N Gly-Trp-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O MREVELMMFOLESM-HOCLYGCPSA-N 0.000 description 1
- UIQGJYUEQDOODF-KWQFWETISA-N Gly-Tyr-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 UIQGJYUEQDOODF-KWQFWETISA-N 0.000 description 1
- PNUFMLXHOLFRLD-KBPBESRZSA-N Gly-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 PNUFMLXHOLFRLD-KBPBESRZSA-N 0.000 description 1
- IHDKKJVBLGXLEL-STQMWFEESA-N Gly-Tyr-Met Chemical compound CSCC[C@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)CN)C(O)=O IHDKKJVBLGXLEL-STQMWFEESA-N 0.000 description 1
- OCRQUYDOYKCOQG-IRXDYDNUSA-N Gly-Tyr-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 OCRQUYDOYKCOQG-IRXDYDNUSA-N 0.000 description 1
- LYZYGGWCBLBDMC-QWHCGFSZSA-N Gly-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)CN)C(=O)O LYZYGGWCBLBDMC-QWHCGFSZSA-N 0.000 description 1
- JYGYNWYVKXENNE-OALUTQOASA-N Gly-Tyr-Trp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JYGYNWYVKXENNE-OALUTQOASA-N 0.000 description 1
- NGBGZCUWFVVJKC-IRXDYDNUSA-N Gly-Tyr-Tyr Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NGBGZCUWFVVJKC-IRXDYDNUSA-N 0.000 description 1
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 1
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 1
- IZVICCORZOSGPT-JSGCOSHPSA-N Gly-Val-Tyr Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IZVICCORZOSGPT-JSGCOSHPSA-N 0.000 description 1
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 1
- 229930186217 Glycolipid Natural products 0.000 description 1
- 102000003886 Glycoproteins Human genes 0.000 description 1
- 108090000288 Glycoproteins Proteins 0.000 description 1
- 101100511168 Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) lex1 gene Proteins 0.000 description 1
- QIVPRLJQQVXCIY-HGNGGELXSA-N His-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](CCC(N)=O)C(O)=O QIVPRLJQQVXCIY-HGNGGELXSA-N 0.000 description 1
- DCRODRAURLJOFY-XPUUQOCRSA-N His-Ala-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)NCC(O)=O DCRODRAURLJOFY-XPUUQOCRSA-N 0.000 description 1
- HTZKFIYQMHJWSQ-INTQDDNPSA-N His-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N HTZKFIYQMHJWSQ-INTQDDNPSA-N 0.000 description 1
- IDNNYVGVSZMQTK-IHRRRGAJSA-N His-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N IDNNYVGVSZMQTK-IHRRRGAJSA-N 0.000 description 1
- QZAFGJNKLMNDEM-DCAQKATOSA-N His-Asn-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CN=CN1 QZAFGJNKLMNDEM-DCAQKATOSA-N 0.000 description 1
- VBOFRJNDIOPNDO-YUMQZZPRSA-N His-Gly-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N VBOFRJNDIOPNDO-YUMQZZPRSA-N 0.000 description 1
- FDQYIRHBVVUTJF-ZETCQYMHSA-N His-Gly-Gly Chemical compound [O-]C(=O)CNC(=O)CNC(=O)[C@@H]([NH3+])CC1=CN=CN1 FDQYIRHBVVUTJF-ZETCQYMHSA-N 0.000 description 1
- RGPWUJOMKFYFSR-QWRGUYRKSA-N His-Gly-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O RGPWUJOMKFYFSR-QWRGUYRKSA-N 0.000 description 1
- NTXIJPDAHXSHNL-ONGXEEELSA-N His-Gly-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NTXIJPDAHXSHNL-ONGXEEELSA-N 0.000 description 1
- KAFZDWMZKGQDEE-SRVKXCTJSA-N His-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KAFZDWMZKGQDEE-SRVKXCTJSA-N 0.000 description 1
- 108010093488 His-His-His-His-His-His Proteins 0.000 description 1
- NDKSHNQINMRKHT-PEXQALLHSA-N His-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N NDKSHNQINMRKHT-PEXQALLHSA-N 0.000 description 1
- PGRPSOUCWRBWKZ-DLOVCJGASA-N His-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CN=CN1 PGRPSOUCWRBWKZ-DLOVCJGASA-N 0.000 description 1
- LDFWDDVELNOGII-MXAVVETBSA-N His-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N LDFWDDVELNOGII-MXAVVETBSA-N 0.000 description 1
- CKRJBQJIGOEKMC-SRVKXCTJSA-N His-Lys-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O CKRJBQJIGOEKMC-SRVKXCTJSA-N 0.000 description 1
- VDHOMPFVSABJKU-ULQDDVLXSA-N His-Phe-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CN=CN2)N VDHOMPFVSABJKU-ULQDDVLXSA-N 0.000 description 1
- PYNPBMCLAKTHJL-SRVKXCTJSA-N His-Pro-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O PYNPBMCLAKTHJL-SRVKXCTJSA-N 0.000 description 1
- PLCAEMGSYOYIPP-GUBZILKMSA-N His-Ser-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 PLCAEMGSYOYIPP-GUBZILKMSA-N 0.000 description 1
- XHQYFGPIRUHQIB-PBCZWWQYSA-N His-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CN=CN1 XHQYFGPIRUHQIB-PBCZWWQYSA-N 0.000 description 1
- MDOBWSFNSNPENN-PMVVWTBXSA-N His-Thr-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O MDOBWSFNSNPENN-PMVVWTBXSA-N 0.000 description 1
- YSMZBYPVVYSGOT-SZMVWBNQSA-N His-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N YSMZBYPVVYSGOT-SZMVWBNQSA-N 0.000 description 1
- ZNTSGDNUITWTRA-WDSOQIARSA-N His-Trp-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O ZNTSGDNUITWTRA-WDSOQIARSA-N 0.000 description 1
- RNVUQLOKVIPNEM-BZSNNMDCSA-N His-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O RNVUQLOKVIPNEM-BZSNNMDCSA-N 0.000 description 1
- CSTDQOOBZBAJKE-BWAGICSOSA-N His-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N)O CSTDQOOBZBAJKE-BWAGICSOSA-N 0.000 description 1
- FFYYUUWROYYKFY-IHRRRGAJSA-N His-Val-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O FFYYUUWROYYKFY-IHRRRGAJSA-N 0.000 description 1
- 102000004157 Hydrolases Human genes 0.000 description 1
- 108090000604 Hydrolases Proteins 0.000 description 1
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 1
- HLYBGMZJVDHJEO-CYDGBPFRSA-N Ile-Arg-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HLYBGMZJVDHJEO-CYDGBPFRSA-N 0.000 description 1
- SACHLUOUHCVIKI-GMOBBJLQSA-N Ile-Arg-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SACHLUOUHCVIKI-GMOBBJLQSA-N 0.000 description 1
- NULSANWBUWLTKN-NAKRPEOUSA-N Ile-Arg-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N NULSANWBUWLTKN-NAKRPEOUSA-N 0.000 description 1
- ZZHGKECPZXPXJF-PCBIJLKTSA-N Ile-Asn-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZZHGKECPZXPXJF-PCBIJLKTSA-N 0.000 description 1
- NCSIQAFSIPHVAN-IUKAMOBKSA-N Ile-Asn-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NCSIQAFSIPHVAN-IUKAMOBKSA-N 0.000 description 1
- QIHJTGSVGIPHIW-QSFUFRPTSA-N Ile-Asn-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N QIHJTGSVGIPHIW-QSFUFRPTSA-N 0.000 description 1
- IDAHFEPYTJJZFD-PEFMBERDSA-N Ile-Asp-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N IDAHFEPYTJJZFD-PEFMBERDSA-N 0.000 description 1
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 1
- BGZIJZJBXRVBGJ-SXTJYALSSA-N Ile-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N BGZIJZJBXRVBGJ-SXTJYALSSA-N 0.000 description 1
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 1
- NPROWIBAWYMPAZ-GUDRVLHUSA-N Ile-Asp-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N NPROWIBAWYMPAZ-GUDRVLHUSA-N 0.000 description 1
- DCQMJRSOGCYKTR-GHCJXIJMSA-N Ile-Asp-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DCQMJRSOGCYKTR-GHCJXIJMSA-N 0.000 description 1
- REJKOQYVFDEZHA-SLBDDTMCSA-N Ile-Asp-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N REJKOQYVFDEZHA-SLBDDTMCSA-N 0.000 description 1
- AQTWDZDISVGCAC-CFMVVWHZSA-N Ile-Asp-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AQTWDZDISVGCAC-CFMVVWHZSA-N 0.000 description 1
- VCYVLFAWCJRXFT-HJPIBITLSA-N Ile-Cys-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N VCYVLFAWCJRXFT-HJPIBITLSA-N 0.000 description 1
- OVPYIUNCVSOVNF-ZPFDUUQYSA-N Ile-Gln-Pro Natural products CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O OVPYIUNCVSOVNF-ZPFDUUQYSA-N 0.000 description 1
- JDAWAWXGAUZPNJ-ZPFDUUQYSA-N Ile-Glu-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JDAWAWXGAUZPNJ-ZPFDUUQYSA-N 0.000 description 1
- WUKLZPHVWAMZQV-UKJIMTQDSA-N Ile-Glu-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N WUKLZPHVWAMZQV-UKJIMTQDSA-N 0.000 description 1
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 1
- IGJWJGIHUFQANP-LAEOZQHASA-N Ile-Gly-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N IGJWJGIHUFQANP-LAEOZQHASA-N 0.000 description 1
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 1
- MQFGXJNSUJTXDT-QSFUFRPTSA-N Ile-Gly-Ile Chemical compound N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)O MQFGXJNSUJTXDT-QSFUFRPTSA-N 0.000 description 1
- LBRCLQMZAHRTLV-ZKWXMUAHSA-N Ile-Gly-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LBRCLQMZAHRTLV-ZKWXMUAHSA-N 0.000 description 1
- DFFTXLCCDFYRKD-MBLNEYKQSA-N Ile-Gly-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N DFFTXLCCDFYRKD-MBLNEYKQSA-N 0.000 description 1
- UAQSZXGJGLHMNV-XEGUGMAKSA-N Ile-Gly-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N UAQSZXGJGLHMNV-XEGUGMAKSA-N 0.000 description 1
- VOBYAKCXGQQFLR-LSJOCFKGSA-N Ile-Gly-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VOBYAKCXGQQFLR-LSJOCFKGSA-N 0.000 description 1
- JLWLMGADIQFKRD-QSFUFRPTSA-N Ile-His-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CN=CN1 JLWLMGADIQFKRD-QSFUFRPTSA-N 0.000 description 1
- VNDQNDYEPSXHLU-JUKXBJQTSA-N Ile-His-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N VNDQNDYEPSXHLU-JUKXBJQTSA-N 0.000 description 1
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 1
- RQQCJTLBSJMVCR-DSYPUSFNSA-N Ile-Leu-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N RQQCJTLBSJMVCR-DSYPUSFNSA-N 0.000 description 1
- PNTWNAXGBOZMBO-MNXVOIDGSA-N Ile-Lys-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PNTWNAXGBOZMBO-MNXVOIDGSA-N 0.000 description 1
- ADDYYRVQQZFIMW-MNXVOIDGSA-N Ile-Lys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ADDYYRVQQZFIMW-MNXVOIDGSA-N 0.000 description 1
- FFJQAEYLAQMGDL-MGHWNKPDSA-N Ile-Lys-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FFJQAEYLAQMGDL-MGHWNKPDSA-N 0.000 description 1
- RVNOXPZHMUWCLW-GMOBBJLQSA-N Ile-Met-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N RVNOXPZHMUWCLW-GMOBBJLQSA-N 0.000 description 1
- VOCZPDONPURUHV-QEWYBTABSA-N Ile-Phe-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VOCZPDONPURUHV-QEWYBTABSA-N 0.000 description 1
- OWSWUWDMSNXTNE-GMOBBJLQSA-N Ile-Pro-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N OWSWUWDMSNXTNE-GMOBBJLQSA-N 0.000 description 1
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 1
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 1
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 1
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 1
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 1
- JNLSTRPWUXOORL-MMWGEVLESA-N Ile-Ser-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N JNLSTRPWUXOORL-MMWGEVLESA-N 0.000 description 1
- HXIDVIFHRYRXLZ-NAKRPEOUSA-N Ile-Ser-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)O)N HXIDVIFHRYRXLZ-NAKRPEOUSA-N 0.000 description 1
- HJDZMPFEXINXLO-QPHKQPEJSA-N Ile-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N HJDZMPFEXINXLO-QPHKQPEJSA-N 0.000 description 1
- ANTFEOSJMAUGIB-KNZXXDILSA-N Ile-Thr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N ANTFEOSJMAUGIB-KNZXXDILSA-N 0.000 description 1
- PRTZQMBYUZFSFA-XEGUGMAKSA-N Ile-Tyr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)NCC(=O)O)N PRTZQMBYUZFSFA-XEGUGMAKSA-N 0.000 description 1
- GVEODXUBBFDBPW-MGHWNKPDSA-N Ile-Tyr-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 GVEODXUBBFDBPW-MGHWNKPDSA-N 0.000 description 1
- NXRNRBOKDBIVKQ-CXTHYWKRSA-N Ile-Tyr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N NXRNRBOKDBIVKQ-CXTHYWKRSA-N 0.000 description 1
- YJRSIJZUIUANHO-NAKRPEOUSA-N Ile-Val-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(=O)O)N YJRSIJZUIUANHO-NAKRPEOUSA-N 0.000 description 1
- YWCJXQKATPNPOE-UKJIMTQDSA-N Ile-Val-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YWCJXQKATPNPOE-UKJIMTQDSA-N 0.000 description 1
- 108091030087 Initiator element Proteins 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- 108090000769 Isomerases Proteins 0.000 description 1
- 241000170280 Kluyveromyces sp. Species 0.000 description 1
- 241000235058 Komagataella pastoris Species 0.000 description 1
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 1
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 1
- 101710186049 L-fuculokinase Proteins 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 1
- 241000186660 Lactobacillus Species 0.000 description 1
- 240000001046 Lactobacillus acidophilus Species 0.000 description 1
- 235000013956 Lactobacillus acidophilus Nutrition 0.000 description 1
- 241000218492 Lactobacillus crispatus Species 0.000 description 1
- 241000186606 Lactobacillus gasseri Species 0.000 description 1
- 241001561398 Lactobacillus jensenii Species 0.000 description 1
- 240000006024 Lactobacillus plantarum Species 0.000 description 1
- 235000013965 Lactobacillus plantarum Nutrition 0.000 description 1
- 241000186604 Lactobacillus reuteri Species 0.000 description 1
- 241000186869 Lactobacillus salivarius Species 0.000 description 1
- 241000194036 Lactococcus Species 0.000 description 1
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 1
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 1
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 1
- SUPVSFFZWVOEOI-UHFFFAOYSA-N Leu-Ala-Tyr Natural products CC(C)CC(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-UHFFFAOYSA-N 0.000 description 1
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 description 1
- WXHFZJFZWNCDNB-KKUMJFAQSA-N Leu-Asn-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXHFZJFZWNCDNB-KKUMJFAQSA-N 0.000 description 1
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 1
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 1
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 1
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 1
- QLQHWWCSCLZUMA-KKUMJFAQSA-N Leu-Asp-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QLQHWWCSCLZUMA-KKUMJFAQSA-N 0.000 description 1
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 1
- NFHJQETXTSDZSI-DCAQKATOSA-N Leu-Cys-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NFHJQETXTSDZSI-DCAQKATOSA-N 0.000 description 1
- HUEBCHPSXSQUGN-GARJFASQSA-N Leu-Cys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N HUEBCHPSXSQUGN-GARJFASQSA-N 0.000 description 1
- DLCXCECTCPKKCD-GUBZILKMSA-N Leu-Gln-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DLCXCECTCPKKCD-GUBZILKMSA-N 0.000 description 1
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 1
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 1
- CQGSYZCULZMEDE-SRVKXCTJSA-N Leu-Gln-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CQGSYZCULZMEDE-SRVKXCTJSA-N 0.000 description 1
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 1
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 1
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 1
- LLBQJYDYOLIQAI-JYJNAYRXSA-N Leu-Glu-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LLBQJYDYOLIQAI-JYJNAYRXSA-N 0.000 description 1
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 1
- FIYMBBHGYNQFOP-IUCAKERBSA-N Leu-Gly-Gln Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N FIYMBBHGYNQFOP-IUCAKERBSA-N 0.000 description 1
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 1
- UCDHVOALNXENLC-KBPBESRZSA-N Leu-Gly-Tyr Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UCDHVOALNXENLC-KBPBESRZSA-N 0.000 description 1
- PBGDOSARRIJMEV-DLOVCJGASA-N Leu-His-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O PBGDOSARRIJMEV-DLOVCJGASA-N 0.000 description 1
- XQXGNBFMAXWIGI-MXAVVETBSA-N Leu-His-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 XQXGNBFMAXWIGI-MXAVVETBSA-N 0.000 description 1
- WRLPVDVHNWSSCL-MELADBBJSA-N Leu-His-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N WRLPVDVHNWSSCL-MELADBBJSA-N 0.000 description 1
- SGIIOQQGLUUMDQ-IHRRRGAJSA-N Leu-His-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N SGIIOQQGLUUMDQ-IHRRRGAJSA-N 0.000 description 1
- AUBMZAMQCOYSIC-MNXVOIDGSA-N Leu-Ile-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O AUBMZAMQCOYSIC-MNXVOIDGSA-N 0.000 description 1
- JFSGIJSCJFQGSZ-MXAVVETBSA-N Leu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N JFSGIJSCJFQGSZ-MXAVVETBSA-N 0.000 description 1
- QLDHBYRUNQZIJQ-DKIMLUQUSA-N Leu-Ile-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QLDHBYRUNQZIJQ-DKIMLUQUSA-N 0.000 description 1
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 1
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 1
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 1
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 1
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 1
- KXCMQWMNYQOAKA-SRVKXCTJSA-N Leu-Met-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KXCMQWMNYQOAKA-SRVKXCTJSA-N 0.000 description 1
- FLNPJLDPGMLWAU-UWVGGRQHSA-N Leu-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(C)C FLNPJLDPGMLWAU-UWVGGRQHSA-N 0.000 description 1
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 description 1
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 1
- UHNQRAFSEBGZFZ-YESZJQIVSA-N Leu-Phe-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N UHNQRAFSEBGZFZ-YESZJQIVSA-N 0.000 description 1
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 1
- FYPWFNKQVVEELI-ULQDDVLXSA-N Leu-Phe-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 FYPWFNKQVVEELI-ULQDDVLXSA-N 0.000 description 1
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 1
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 1
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 1
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 1
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 1
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 1
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 1
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 1
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 1
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 1
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 1
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 1
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 1
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 1
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 1
- HOMFINRJHIIZNJ-HOCLYGCPSA-N Leu-Trp-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O HOMFINRJHIIZNJ-HOCLYGCPSA-N 0.000 description 1
- UIIMIKFNIYPDJF-WDSOQIARSA-N Leu-Trp-Met Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CCSC)C(O)=O)NC(=O)[C@@H](N)CC(C)C)=CNC2=C1 UIIMIKFNIYPDJF-WDSOQIARSA-N 0.000 description 1
- LFXSPAIBSZSTEM-PMVMPFDFSA-N Leu-Trp-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O)N LFXSPAIBSZSTEM-PMVMPFDFSA-N 0.000 description 1
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 1
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 description 1
- MPOHDJKRBLVGCT-CIUDSAMLSA-N Lys-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N MPOHDJKRBLVGCT-CIUDSAMLSA-N 0.000 description 1
- MPGHETGWWWUHPY-CIUDSAMLSA-N Lys-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN MPGHETGWWWUHPY-CIUDSAMLSA-N 0.000 description 1
- JCFYLFOCALSNLQ-GUBZILKMSA-N Lys-Ala-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JCFYLFOCALSNLQ-GUBZILKMSA-N 0.000 description 1
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 1
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 1
- VHXMZJGOKIMETG-CQDKDKBSSA-N Lys-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCCN)N VHXMZJGOKIMETG-CQDKDKBSSA-N 0.000 description 1
- IRNSXVOWSXSULE-DCAQKATOSA-N Lys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN IRNSXVOWSXSULE-DCAQKATOSA-N 0.000 description 1
- ALSRJRIWBNENFY-DCAQKATOSA-N Lys-Arg-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O ALSRJRIWBNENFY-DCAQKATOSA-N 0.000 description 1
- BYPMOIFBQPEWOH-CIUDSAMLSA-N Lys-Asn-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N BYPMOIFBQPEWOH-CIUDSAMLSA-N 0.000 description 1
- DEFGUIIUYAUEDU-ZPFDUUQYSA-N Lys-Asn-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DEFGUIIUYAUEDU-ZPFDUUQYSA-N 0.000 description 1
- DGWXCIORNLWGGG-CIUDSAMLSA-N Lys-Asn-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O DGWXCIORNLWGGG-CIUDSAMLSA-N 0.000 description 1
- KPJJOZUXFOLGMQ-CIUDSAMLSA-N Lys-Asp-Asn Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N KPJJOZUXFOLGMQ-CIUDSAMLSA-N 0.000 description 1
- QUYCUALODHJQLK-CIUDSAMLSA-N Lys-Asp-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUYCUALODHJQLK-CIUDSAMLSA-N 0.000 description 1
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 1
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 1
- KWUKZRFFKPLUPE-HJGDQZAQSA-N Lys-Asp-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWUKZRFFKPLUPE-HJGDQZAQSA-N 0.000 description 1
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 1
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 1
- LLSUNJYOSCOOEB-GUBZILKMSA-N Lys-Glu-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O LLSUNJYOSCOOEB-GUBZILKMSA-N 0.000 description 1
- KZOHPCYVORJBLG-AVGNSLFASA-N Lys-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N KZOHPCYVORJBLG-AVGNSLFASA-N 0.000 description 1
- ITWQLSZTLBKWJM-YUMQZZPRSA-N Lys-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCCN ITWQLSZTLBKWJM-YUMQZZPRSA-N 0.000 description 1
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 1
- PBLLTSKBTAHDNA-KBPBESRZSA-N Lys-Gly-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PBLLTSKBTAHDNA-KBPBESRZSA-N 0.000 description 1
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 1
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 1
- SQJSXOQXJYAVRV-SRVKXCTJSA-N Lys-His-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N SQJSXOQXJYAVRV-SRVKXCTJSA-N 0.000 description 1
- WOEDRPCHKPSFDT-MXAVVETBSA-N Lys-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCCN)N WOEDRPCHKPSFDT-MXAVVETBSA-N 0.000 description 1
- PRSBSVAVOQOAMI-BJDJZHNGSA-N Lys-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN PRSBSVAVOQOAMI-BJDJZHNGSA-N 0.000 description 1
- NCZIQZYZPUPMKY-PPCPHDFISA-N Lys-Ile-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NCZIQZYZPUPMKY-PPCPHDFISA-N 0.000 description 1
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 1
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 1
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 1
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 1
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 1
- ALGGDNMLQNFVIZ-SRVKXCTJSA-N Lys-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ALGGDNMLQNFVIZ-SRVKXCTJSA-N 0.000 description 1
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 1
- YXPJCVNIDDKGOE-MELADBBJSA-N Lys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N)C(=O)O YXPJCVNIDDKGOE-MELADBBJSA-N 0.000 description 1
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 1
- URGPVYGVWLIRGT-DCAQKATOSA-N Lys-Met-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O URGPVYGVWLIRGT-DCAQKATOSA-N 0.000 description 1
- LUTDBHBIHHREDC-IHRRRGAJSA-N Lys-Pro-Lys Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O LUTDBHBIHHREDC-IHRRRGAJSA-N 0.000 description 1
- CRIODIGWCUPXKU-AVGNSLFASA-N Lys-Pro-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O CRIODIGWCUPXKU-AVGNSLFASA-N 0.000 description 1
- YTJFXEDRUOQGSP-DCAQKATOSA-N Lys-Pro-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YTJFXEDRUOQGSP-DCAQKATOSA-N 0.000 description 1
- MIROMRNASYKZNL-ULQDDVLXSA-N Lys-Pro-Tyr Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MIROMRNASYKZNL-ULQDDVLXSA-N 0.000 description 1
- HKXSZKJMDBHOTG-CIUDSAMLSA-N Lys-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN HKXSZKJMDBHOTG-CIUDSAMLSA-N 0.000 description 1
- MGKFCQFVPKOWOL-CIUDSAMLSA-N Lys-Ser-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N MGKFCQFVPKOWOL-CIUDSAMLSA-N 0.000 description 1
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 1
- SQXZLVXQXWILKW-KKUMJFAQSA-N Lys-Ser-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQXZLVXQXWILKW-KKUMJFAQSA-N 0.000 description 1
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 1
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 1
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 1
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 1
- QVTDVTONTRSQMF-WDCWCFNPSA-N Lys-Thr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CCCCN QVTDVTONTRSQMF-WDCWCFNPSA-N 0.000 description 1
- BDFHWFUAQLIMJO-KXNHARMFSA-N Lys-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N)O BDFHWFUAQLIMJO-KXNHARMFSA-N 0.000 description 1
- RMOKGALPSPOYKE-KATARQTJSA-N Lys-Thr-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMOKGALPSPOYKE-KATARQTJSA-N 0.000 description 1
- RMKJOQSYLQQRFN-KKUMJFAQSA-N Lys-Tyr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O RMKJOQSYLQQRFN-KKUMJFAQSA-N 0.000 description 1
- XATKLFSXFINPSB-JYJNAYRXSA-N Lys-Tyr-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O XATKLFSXFINPSB-JYJNAYRXSA-N 0.000 description 1
- XYLSGAWRCZECIQ-JYJNAYRXSA-N Lys-Tyr-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 XYLSGAWRCZECIQ-JYJNAYRXSA-N 0.000 description 1
- RQILLQOQXLZTCK-KBPBESRZSA-N Lys-Tyr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O RQILLQOQXLZTCK-KBPBESRZSA-N 0.000 description 1
- WINFHLHJTRGLCV-BZSNNMDCSA-N Lys-Tyr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=C(O)C=C1 WINFHLHJTRGLCV-BZSNNMDCSA-N 0.000 description 1
- VVURYEVJJTXWNE-ULQDDVLXSA-N Lys-Tyr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O VVURYEVJJTXWNE-ULQDDVLXSA-N 0.000 description 1
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 description 1
- MDDUIRLQCYVRDO-NHCYSSNCSA-N Lys-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN MDDUIRLQCYVRDO-NHCYSSNCSA-N 0.000 description 1
- 239000006154 MacConkey agar Substances 0.000 description 1
- 102000003939 Membrane transport proteins Human genes 0.000 description 1
- 108090000301 Membrane transport proteins Proteins 0.000 description 1
- 201000009906 Meningitis Diseases 0.000 description 1
- ONGCSGVHCSAATF-CIUDSAMLSA-N Met-Ala-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O ONGCSGVHCSAATF-CIUDSAMLSA-N 0.000 description 1
- MUYQDMBLDFEVRJ-LSJOCFKGSA-N Met-Ala-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 MUYQDMBLDFEVRJ-LSJOCFKGSA-N 0.000 description 1
- ULNXMMYXQKGNPG-LPEHRKFASA-N Met-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N ULNXMMYXQKGNPG-LPEHRKFASA-N 0.000 description 1
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 1
- BVXXDMUMHMXFER-BPNCWPANSA-N Met-Ala-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVXXDMUMHMXFER-BPNCWPANSA-N 0.000 description 1
- CWFYZYQMUDWGTI-GUBZILKMSA-N Met-Arg-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O CWFYZYQMUDWGTI-GUBZILKMSA-N 0.000 description 1
- OBVHKUFUDCPZDW-JYJNAYRXSA-N Met-Arg-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OBVHKUFUDCPZDW-JYJNAYRXSA-N 0.000 description 1
- SBSIKVMCCJUCBZ-GUBZILKMSA-N Met-Asn-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N SBSIKVMCCJUCBZ-GUBZILKMSA-N 0.000 description 1
- DRXODWRPPUFIAY-DCAQKATOSA-N Met-Asn-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN DRXODWRPPUFIAY-DCAQKATOSA-N 0.000 description 1
- HKRYNJSKVLZIFP-IHRRRGAJSA-N Met-Asn-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HKRYNJSKVLZIFP-IHRRRGAJSA-N 0.000 description 1
- KQBJYJXPZBNEIK-DCAQKATOSA-N Met-Glu-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQBJYJXPZBNEIK-DCAQKATOSA-N 0.000 description 1
- PQPMMGQTRQFSDA-SRVKXCTJSA-N Met-Glu-His Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O PQPMMGQTRQFSDA-SRVKXCTJSA-N 0.000 description 1
- YCUSPBPZVJDMII-YUMQZZPRSA-N Met-Gly-Glu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O YCUSPBPZVJDMII-YUMQZZPRSA-N 0.000 description 1
- SXWQMBGNFXAGAT-FJXKBIBVSA-N Met-Gly-Thr Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SXWQMBGNFXAGAT-FJXKBIBVSA-N 0.000 description 1
- BMHIFARYXOJDLD-WPRPVWTQSA-N Met-Gly-Val Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O BMHIFARYXOJDLD-WPRPVWTQSA-N 0.000 description 1
- FTQOFRPGLYXRFM-CYDGBPFRSA-N Met-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCSC)N FTQOFRPGLYXRFM-CYDGBPFRSA-N 0.000 description 1
- JYPITOUIQVSCKM-IHRRRGAJSA-N Met-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCSC)N JYPITOUIQVSCKM-IHRRRGAJSA-N 0.000 description 1
- ZRACLHJYVRBJFC-ULQDDVLXSA-N Met-Lys-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZRACLHJYVRBJFC-ULQDDVLXSA-N 0.000 description 1
- RDLSEGZJMYGFNS-FXQIFTODSA-N Met-Ser-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RDLSEGZJMYGFNS-FXQIFTODSA-N 0.000 description 1
- LHXFNWBNRBWMNV-DCAQKATOSA-N Met-Ser-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LHXFNWBNRBWMNV-DCAQKATOSA-N 0.000 description 1
- QYIGOFGUOVTAHK-ZJDVBMNYSA-N Met-Thr-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QYIGOFGUOVTAHK-ZJDVBMNYSA-N 0.000 description 1
- NSMXRFMGZYTFEX-KJEVXHAQSA-N Met-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCSC)N)O NSMXRFMGZYTFEX-KJEVXHAQSA-N 0.000 description 1
- ZBLSZPYQQRIHQU-RCWTZXSCSA-N Met-Thr-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ZBLSZPYQQRIHQU-RCWTZXSCSA-N 0.000 description 1
- RMLWDZINJUDMEB-IHRRRGAJSA-N Met-Tyr-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N RMLWDZINJUDMEB-IHRRRGAJSA-N 0.000 description 1
- VYDLZDRMOFYOGV-TUAOUCFPSA-N Met-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N VYDLZDRMOFYOGV-TUAOUCFPSA-N 0.000 description 1
- 241000192041 Micrococcus Species 0.000 description 1
- 241000187708 Micromonospora Species 0.000 description 1
- 101100519658 Mus musculus Pfkm gene Proteins 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 1
- CILYIEBUXJIHCO-UITFWXMXSA-N N-acetyl-alpha-neuraminyl-(2->3)-beta-D-galactosyl-(1->4)-beta-D-glucose Chemical compound O1[C@@H]([C@H](O)[C@H](O)CO)[C@H](NC(=O)C)[C@@H](O)C[C@@]1(C(O)=O)O[C@@H]1[C@@H](O)[C@H](O[C@H]2[C@@H]([C@@H](O)[C@H](O)O[C@@H]2CO)O)O[C@H](CO)[C@@H]1O CILYIEBUXJIHCO-UITFWXMXSA-N 0.000 description 1
- OVRNDRQMDRJTHS-FMDGEEDCSA-N N-acetyl-beta-D-glucosamine Chemical compound CC(=O)N[C@H]1[C@H](O)O[C@H](CO)[C@@H](O)[C@@H]1O OVRNDRQMDRJTHS-FMDGEEDCSA-N 0.000 description 1
- OIZGSVFYNBZVIK-UHFFFAOYSA-N N-acetylneuraminosyl-D-lactose Natural products O1C(C(O)C(O)CO)C(NC(=O)C)C(O)CC1(C(O)=O)OC1C(O)C(OC(C(O)CO)C(O)C(O)C=O)OC(CO)C1O OIZGSVFYNBZVIK-UHFFFAOYSA-N 0.000 description 1
- 108010047562 NGR peptide Proteins 0.000 description 1
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 1
- 241000588653 Neisseria Species 0.000 description 1
- 241000080590 Niso Species 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 241000588912 Pantoea agglomerans Species 0.000 description 1
- 241000588701 Pectobacterium carotovorum Species 0.000 description 1
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 1
- CYZBFPYMSJGBRL-DRZSPHRISA-N Phe-Ala-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CYZBFPYMSJGBRL-DRZSPHRISA-N 0.000 description 1
- QMMRHASQEVCJGR-UBHSHLNASA-N Phe-Ala-Pro Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 QMMRHASQEVCJGR-UBHSHLNASA-N 0.000 description 1
- BKWJQWJPZMUWEG-LFSVMHDDSA-N Phe-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BKWJQWJPZMUWEG-LFSVMHDDSA-N 0.000 description 1
- LGBVMDMZZFYSFW-HJWJTTGWSA-N Phe-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CC=CC=C1)N LGBVMDMZZFYSFW-HJWJTTGWSA-N 0.000 description 1
- YYRCPTVAPLQRNC-ULQDDVLXSA-N Phe-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CC1=CC=CC=C1 YYRCPTVAPLQRNC-ULQDDVLXSA-N 0.000 description 1
- IWRZUGHCHFZYQZ-UFYCRDLUSA-N Phe-Arg-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 IWRZUGHCHFZYQZ-UFYCRDLUSA-N 0.000 description 1
- QCHNRQQVLJYDSI-DLOVCJGASA-N Phe-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 QCHNRQQVLJYDSI-DLOVCJGASA-N 0.000 description 1
- HCTXJGRYAACKOB-SRVKXCTJSA-N Phe-Asn-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HCTXJGRYAACKOB-SRVKXCTJSA-N 0.000 description 1
- DDYIRGBOZVKRFR-AVGNSLFASA-N Phe-Asp-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DDYIRGBOZVKRFR-AVGNSLFASA-N 0.000 description 1
- WIVCOAKLPICYGY-KKUMJFAQSA-N Phe-Asp-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N WIVCOAKLPICYGY-KKUMJFAQSA-N 0.000 description 1
- DJPXNKUDJKGQEE-BZSNNMDCSA-N Phe-Asp-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DJPXNKUDJKGQEE-BZSNNMDCSA-N 0.000 description 1
- OJUMUUXGSXUZJZ-SRVKXCTJSA-N Phe-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OJUMUUXGSXUZJZ-SRVKXCTJSA-N 0.000 description 1
- SWZKMTDPQXLQRD-XVSYOHENSA-N Phe-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWZKMTDPQXLQRD-XVSYOHENSA-N 0.000 description 1
- UMKYAYXCMYYNHI-AVGNSLFASA-N Phe-Gln-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N UMKYAYXCMYYNHI-AVGNSLFASA-N 0.000 description 1
- RJYBHZVWJPUSLB-QEWYBTABSA-N Phe-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N RJYBHZVWJPUSLB-QEWYBTABSA-N 0.000 description 1
- WYPVCIACUMJRIB-JYJNAYRXSA-N Phe-Gln-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N WYPVCIACUMJRIB-JYJNAYRXSA-N 0.000 description 1
- RLUMIJXNHJVUCO-JBACZVJFSA-N Phe-Gln-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 RLUMIJXNHJVUCO-JBACZVJFSA-N 0.000 description 1
- LWPMGKSZPKFKJD-DZKIICNBSA-N Phe-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O LWPMGKSZPKFKJD-DZKIICNBSA-N 0.000 description 1
- WPTYDQPGBMDUBI-QWRGUYRKSA-N Phe-Gly-Asn Chemical compound N[C@@H](Cc1ccccc1)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O WPTYDQPGBMDUBI-QWRGUYRKSA-N 0.000 description 1
- HGNGAMWHGGANAU-WHOFXGATSA-N Phe-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HGNGAMWHGGANAU-WHOFXGATSA-N 0.000 description 1
- MJQFZGOIVBDIMZ-WHOFXGATSA-N Phe-Ile-Gly Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O MJQFZGOIVBDIMZ-WHOFXGATSA-N 0.000 description 1
- KPEIBEPEUAZWNS-ULQDDVLXSA-N Phe-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KPEIBEPEUAZWNS-ULQDDVLXSA-N 0.000 description 1
- MSHZERMPZKCODG-ACRUOGEOSA-N Phe-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MSHZERMPZKCODG-ACRUOGEOSA-N 0.000 description 1
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 1
- ZIQQNOXKEFDPBE-BZSNNMDCSA-N Phe-Lys-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N ZIQQNOXKEFDPBE-BZSNNMDCSA-N 0.000 description 1
- DOXQMJCSSYZSNM-BZSNNMDCSA-N Phe-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O DOXQMJCSSYZSNM-BZSNNMDCSA-N 0.000 description 1
- YOFKMVUAZGPFCF-IHRRRGAJSA-N Phe-Met-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(O)=O YOFKMVUAZGPFCF-IHRRRGAJSA-N 0.000 description 1
- CBENHWCORLVGEQ-HJOGWXRNSA-N Phe-Phe-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 CBENHWCORLVGEQ-HJOGWXRNSA-N 0.000 description 1
- MGLBSROLWAWCKN-FCLVOEFKSA-N Phe-Phe-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MGLBSROLWAWCKN-FCLVOEFKSA-N 0.000 description 1
- XOHJOMKCRLHGCY-UNQGMJICSA-N Phe-Pro-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOHJOMKCRLHGCY-UNQGMJICSA-N 0.000 description 1
- BONHGTUEEPIMPM-AVGNSLFASA-N Phe-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O BONHGTUEEPIMPM-AVGNSLFASA-N 0.000 description 1
- JXQVYPWVGUOIDV-MXAVVETBSA-N Phe-Ser-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JXQVYPWVGUOIDV-MXAVVETBSA-N 0.000 description 1
- MRWOVVNKSXXLRP-IHPCNDPISA-N Phe-Ser-Trp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O MRWOVVNKSXXLRP-IHPCNDPISA-N 0.000 description 1
- RAGOJJCBGXARPO-XVSYOHENSA-N Phe-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RAGOJJCBGXARPO-XVSYOHENSA-N 0.000 description 1
- KLYYKKGCPOGDPE-OEAJRASXSA-N Phe-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O KLYYKKGCPOGDPE-OEAJRASXSA-N 0.000 description 1
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 1
- MSSXKZBDKZAHCX-UNQGMJICSA-N Phe-Thr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O MSSXKZBDKZAHCX-UNQGMJICSA-N 0.000 description 1
- BPIFSOUEUYDJRM-DCPHZVHLSA-N Phe-Trp-Ala Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](C)C(O)=O)C1=CC=CC=C1 BPIFSOUEUYDJRM-DCPHZVHLSA-N 0.000 description 1
- ZYNBEWGJFXTBDU-ACRUOGEOSA-N Phe-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CC=CC=C2)N ZYNBEWGJFXTBDU-ACRUOGEOSA-N 0.000 description 1
- ZOGICTVLQDWPER-UFYCRDLUSA-N Phe-Tyr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O ZOGICTVLQDWPER-UFYCRDLUSA-N 0.000 description 1
- JSGWNFKWZNPDAV-YDHLFZDLSA-N Phe-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JSGWNFKWZNPDAV-YDHLFZDLSA-N 0.000 description 1
- XALFIVXGQUEGKV-JSGCOSHPSA-N Phe-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XALFIVXGQUEGKV-JSGCOSHPSA-N 0.000 description 1
- GNZCMRRSXOBHLC-JYJNAYRXSA-N Phe-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N GNZCMRRSXOBHLC-JYJNAYRXSA-N 0.000 description 1
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 1
- MWQXFDIQXIXPMS-UNQGMJICSA-N Phe-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O MWQXFDIQXIXPMS-UNQGMJICSA-N 0.000 description 1
- GAMLAXHLYGLQBJ-UFYCRDLUSA-N Phe-Val-Tyr Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC1=CC=C(C=C1)O)C(C)C)CC1=CC=CC=C1 GAMLAXHLYGLQBJ-UFYCRDLUSA-N 0.000 description 1
- 102000030605 Phosphomannomutase Human genes 0.000 description 1
- 102000004160 Phosphoric Monoester Hydrolases Human genes 0.000 description 1
- 108090000608 Phosphoric Monoester Hydrolases Proteins 0.000 description 1
- 241000235648 Pichia Species 0.000 description 1
- 241000235061 Pichia sp. Species 0.000 description 1
- 235000010582 Pisum sativum Nutrition 0.000 description 1
- DBALDZKOTNSBFM-FXQIFTODSA-N Pro-Ala-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DBALDZKOTNSBFM-FXQIFTODSA-N 0.000 description 1
- FZHBZMDRDASUHN-NAKRPEOUSA-N Pro-Ala-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1)C(O)=O FZHBZMDRDASUHN-NAKRPEOUSA-N 0.000 description 1
- SSSFPISOZOLQNP-GUBZILKMSA-N Pro-Arg-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSFPISOZOLQNP-GUBZILKMSA-N 0.000 description 1
- GRIRJQGZZJVANI-CYDGBPFRSA-N Pro-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 GRIRJQGZZJVANI-CYDGBPFRSA-N 0.000 description 1
- UVKNEILZSJMKSR-FXQIFTODSA-N Pro-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 UVKNEILZSJMKSR-FXQIFTODSA-N 0.000 description 1
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 description 1
- SBYVDRLQAGENMY-DCAQKATOSA-N Pro-Asn-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O SBYVDRLQAGENMY-DCAQKATOSA-N 0.000 description 1
- FUVBEZJCRMHWEM-FXQIFTODSA-N Pro-Asn-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FUVBEZJCRMHWEM-FXQIFTODSA-N 0.000 description 1
- UTAUEDINXUMHLG-FXQIFTODSA-N Pro-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 UTAUEDINXUMHLG-FXQIFTODSA-N 0.000 description 1
- CJZTUKSFZUSNCC-FXQIFTODSA-N Pro-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 CJZTUKSFZUSNCC-FXQIFTODSA-N 0.000 description 1
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 1
- DEDANIDYQAPTFI-IHRRRGAJSA-N Pro-Asp-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DEDANIDYQAPTFI-IHRRRGAJSA-N 0.000 description 1
- ZPPVJIJMIKTERM-YUMQZZPRSA-N Pro-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ZPPVJIJMIKTERM-YUMQZZPRSA-N 0.000 description 1
- LANQLYHLMYDWJP-SRVKXCTJSA-N Pro-Gln-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O LANQLYHLMYDWJP-SRVKXCTJSA-N 0.000 description 1
- SKICPQLTOXGWGO-GARJFASQSA-N Pro-Gln-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O SKICPQLTOXGWGO-GARJFASQSA-N 0.000 description 1
- UAYHMOIGIQZLFR-NHCYSSNCSA-N Pro-Gln-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UAYHMOIGIQZLFR-NHCYSSNCSA-N 0.000 description 1
- VDGTVWFMRXVQCT-GUBZILKMSA-N Pro-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 VDGTVWFMRXVQCT-GUBZILKMSA-N 0.000 description 1
- VPFGPKIWSDVTOY-SRVKXCTJSA-N Pro-Glu-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O VPFGPKIWSDVTOY-SRVKXCTJSA-N 0.000 description 1
- WVOXLKUUVCCCSU-ZPFDUUQYSA-N Pro-Glu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVOXLKUUVCCCSU-ZPFDUUQYSA-N 0.000 description 1
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 1
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 1
- JMVQDLDPDBXAAX-YUMQZZPRSA-N Pro-Gly-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 JMVQDLDPDBXAAX-YUMQZZPRSA-N 0.000 description 1
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 1
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 1
- KWMUAKQOVYCQJQ-ZPFDUUQYSA-N Pro-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@@H]1CCCN1 KWMUAKQOVYCQJQ-ZPFDUUQYSA-N 0.000 description 1
- KLSOMAFWRISSNI-OSUNSFLBSA-N Pro-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 KLSOMAFWRISSNI-OSUNSFLBSA-N 0.000 description 1
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 1
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 description 1
- BRJGUPWVFXKBQI-XUXIUFHCSA-N Pro-Leu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRJGUPWVFXKBQI-XUXIUFHCSA-N 0.000 description 1
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 1
- SXMSEHDMNIUTSP-DCAQKATOSA-N Pro-Lys-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SXMSEHDMNIUTSP-DCAQKATOSA-N 0.000 description 1
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 1
- WOIFYRZPIORBRY-AVGNSLFASA-N Pro-Lys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WOIFYRZPIORBRY-AVGNSLFASA-N 0.000 description 1
- KLOQCCRTPHPIFN-DCAQKATOSA-N Pro-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 KLOQCCRTPHPIFN-DCAQKATOSA-N 0.000 description 1
- MLKVIVZCFYRTIR-KKUMJFAQSA-N Pro-Phe-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLKVIVZCFYRTIR-KKUMJFAQSA-N 0.000 description 1
- JIWJRKNYLSHONY-KKUMJFAQSA-N Pro-Phe-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JIWJRKNYLSHONY-KKUMJFAQSA-N 0.000 description 1
- AJBQTGZIZQXBLT-STQMWFEESA-N Pro-Phe-Gly Chemical compound C([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 AJBQTGZIZQXBLT-STQMWFEESA-N 0.000 description 1
- JLMZKEQFMVORMA-SRVKXCTJSA-N Pro-Pro-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 JLMZKEQFMVORMA-SRVKXCTJSA-N 0.000 description 1
- NAIPAPCKKRCMBL-JYJNAYRXSA-N Pro-Pro-Phe Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CC=CC=C1 NAIPAPCKKRCMBL-JYJNAYRXSA-N 0.000 description 1
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 1
- UGDMQJSXSSZUKL-IHRRRGAJSA-N Pro-Ser-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O UGDMQJSXSSZUKL-IHRRRGAJSA-N 0.000 description 1
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 1
- DCHQYSOGURGJST-FJXKBIBVSA-N Pro-Thr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O DCHQYSOGURGJST-FJXKBIBVSA-N 0.000 description 1
- JDJMFMVVJHLWDP-UNQGMJICSA-N Pro-Thr-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JDJMFMVVJHLWDP-UNQGMJICSA-N 0.000 description 1
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 1
- BNUKRHFCHHLIGR-JYJNAYRXSA-N Pro-Trp-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CC(=O)O)C(=O)O BNUKRHFCHHLIGR-JYJNAYRXSA-N 0.000 description 1
- GNFHQWNCSSPOBT-ULQDDVLXSA-N Pro-Trp-Gln Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CCC(=O)N)C(=O)O GNFHQWNCSSPOBT-ULQDDVLXSA-N 0.000 description 1
- DMNANGOFEUVBRV-GJZGRUSLSA-N Pro-Trp-Gly Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)NCC(=O)O)C(=O)[C@@H]1CCCN1 DMNANGOFEUVBRV-GJZGRUSLSA-N 0.000 description 1
- VVEQUISRWJDGMX-VKOGCVSHSA-N Pro-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@@H]3CCCN3 VVEQUISRWJDGMX-VKOGCVSHSA-N 0.000 description 1
- ZAUHSLVPDLNTRZ-QXEWZRGKSA-N Pro-Val-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZAUHSLVPDLNTRZ-QXEWZRGKSA-N 0.000 description 1
- DGDCSVGVWWAJRS-AVGNSLFASA-N Pro-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 DGDCSVGVWWAJRS-AVGNSLFASA-N 0.000 description 1
- 102000001253 Protein Kinase Human genes 0.000 description 1
- 241000589516 Pseudomonas Species 0.000 description 1
- 101100214699 Pseudomonas aeruginosa aacC1 gene Proteins 0.000 description 1
- 241000589540 Pseudomonas fluorescens Species 0.000 description 1
- 108010079005 RDV peptide Proteins 0.000 description 1
- 108010052388 RGES peptide Proteins 0.000 description 1
- 102000009572 RNA Polymerase II Human genes 0.000 description 1
- 108010009460 RNA Polymerase II Proteins 0.000 description 1
- 241000316848 Rhodococcus <scale insect> Species 0.000 description 1
- 241000223252 Rhodotorula Species 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 1
- 241000235088 Saccharomyces sp. Species 0.000 description 1
- 241001360381 Saccharomycopsis sp. Species 0.000 description 1
- 241001138501 Salmonella enterica Species 0.000 description 1
- 241000720795 Schizosaccharomyces sp. Species 0.000 description 1
- FIXILCYTSAUERA-FXQIFTODSA-N Ser-Ala-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIXILCYTSAUERA-FXQIFTODSA-N 0.000 description 1
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 1
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 1
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 1
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 1
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 1
- PZZJMBYSYAKYPK-UWJYBYFXSA-N Ser-Ala-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PZZJMBYSYAKYPK-UWJYBYFXSA-N 0.000 description 1
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 1
- QEDMOZUJTGEIBF-FXQIFTODSA-N Ser-Arg-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O QEDMOZUJTGEIBF-FXQIFTODSA-N 0.000 description 1
- JJKSSJVYOVRJMZ-FXQIFTODSA-N Ser-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)CN=C(N)N JJKSSJVYOVRJMZ-FXQIFTODSA-N 0.000 description 1
- YUSRGTQIPCJNHQ-CIUDSAMLSA-N Ser-Arg-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YUSRGTQIPCJNHQ-CIUDSAMLSA-N 0.000 description 1
- VQBLHWSPVYYZTB-DCAQKATOSA-N Ser-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N VQBLHWSPVYYZTB-DCAQKATOSA-N 0.000 description 1
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 1
- WXUBSIDKNMFAGS-IHRRRGAJSA-N Ser-Arg-Tyr Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXUBSIDKNMFAGS-IHRRRGAJSA-N 0.000 description 1
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 1
- VGNYHOBZJKWRGI-CIUDSAMLSA-N Ser-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO VGNYHOBZJKWRGI-CIUDSAMLSA-N 0.000 description 1
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 1
- UGJRQLURDVGULT-LKXGYXEUSA-N Ser-Asn-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UGJRQLURDVGULT-LKXGYXEUSA-N 0.000 description 1
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 1
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 1
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 1
- BTPAWKABYQMKKN-LKXGYXEUSA-N Ser-Asp-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BTPAWKABYQMKKN-LKXGYXEUSA-N 0.000 description 1
- XWCYBVBLJRWOFR-WDSKDSINSA-N Ser-Gln-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O XWCYBVBLJRWOFR-WDSKDSINSA-N 0.000 description 1
- KJMOINFQVCCSDX-XKBZYTNZSA-N Ser-Gln-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KJMOINFQVCCSDX-XKBZYTNZSA-N 0.000 description 1
- VDVYTKZBMFADQH-AVGNSLFASA-N Ser-Gln-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 VDVYTKZBMFADQH-AVGNSLFASA-N 0.000 description 1
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 1
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 1
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 1
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 1
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 1
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 1
- SVWQEIRZHHNBIO-WHFBIAKZSA-N Ser-Gly-Cys Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CS)C(O)=O SVWQEIRZHHNBIO-WHFBIAKZSA-N 0.000 description 1
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 1
- IXCHOHLPHNGFTJ-YUMQZZPRSA-N Ser-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N IXCHOHLPHNGFTJ-YUMQZZPRSA-N 0.000 description 1
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 1
- CXBFHZLODKPIJY-AAEUAGOBSA-N Ser-Gly-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N CXBFHZLODKPIJY-AAEUAGOBSA-N 0.000 description 1
- MOQDPPUMFSMYOM-KKUMJFAQSA-N Ser-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CO)N MOQDPPUMFSMYOM-KKUMJFAQSA-N 0.000 description 1
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 1
- BKZYBLLIBOBOOW-GHCJXIJMSA-N Ser-Ile-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O BKZYBLLIBOBOOW-GHCJXIJMSA-N 0.000 description 1
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 1
- LWMQRHDTXHQQOV-MXAVVETBSA-N Ser-Ile-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LWMQRHDTXHQQOV-MXAVVETBSA-N 0.000 description 1
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 1
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 1
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 1
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 1
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 1
- GVIGVIOEYBOTCB-XIRDDKMYSA-N Ser-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC(C)C)C(O)=O)=CNC2=C1 GVIGVIOEYBOTCB-XIRDDKMYSA-N 0.000 description 1
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 1
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 1
- PPNPDKGQRFSCAC-CIUDSAMLSA-N Ser-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPNPDKGQRFSCAC-CIUDSAMLSA-N 0.000 description 1
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 1
- LRWBCWGEUCKDTN-BJDJZHNGSA-N Ser-Lys-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LRWBCWGEUCKDTN-BJDJZHNGSA-N 0.000 description 1
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 1
- XNXRTQZTFVMJIJ-DCAQKATOSA-N Ser-Met-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNXRTQZTFVMJIJ-DCAQKATOSA-N 0.000 description 1
- RXSWQCATLWVDLI-XGEHTFHBSA-N Ser-Met-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RXSWQCATLWVDLI-XGEHTFHBSA-N 0.000 description 1
- JAWGSPUJAXYXJA-IHRRRGAJSA-N Ser-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=CC=C1 JAWGSPUJAXYXJA-IHRRRGAJSA-N 0.000 description 1
- KZPRPBLHYMZIMH-MXAVVETBSA-N Ser-Phe-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZPRPBLHYMZIMH-MXAVVETBSA-N 0.000 description 1
- RWDVVSKYZBNDCO-MELADBBJSA-N Ser-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CO)N)C(=O)O RWDVVSKYZBNDCO-MELADBBJSA-N 0.000 description 1
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 1
- QMCDMHWAKMUGJE-IHRRRGAJSA-N Ser-Phe-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O QMCDMHWAKMUGJE-IHRRRGAJSA-N 0.000 description 1
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 1
- WNDUPCKKKGSKIQ-CIUDSAMLSA-N Ser-Pro-Gln Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O WNDUPCKKKGSKIQ-CIUDSAMLSA-N 0.000 description 1
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 1
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 1
- DINQYZRMXGWWTG-GUBZILKMSA-N Ser-Pro-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DINQYZRMXGWWTG-GUBZILKMSA-N 0.000 description 1
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 1
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 1
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 1
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 1
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 1
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 1
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 1
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 1
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 1
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 1
- DYEGLQRVMBWQLD-IXOXFDKPSA-N Ser-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CO)N)O DYEGLQRVMBWQLD-IXOXFDKPSA-N 0.000 description 1
- STIAINRLUUKYKM-WFBYXXMGSA-N Ser-Trp-Ala Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CO)=CNC2=C1 STIAINRLUUKYKM-WFBYXXMGSA-N 0.000 description 1
- BCAVNDNYOGTQMQ-AAEUAGOBSA-N Ser-Trp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O BCAVNDNYOGTQMQ-AAEUAGOBSA-N 0.000 description 1
- XTWXRUWACCXBMU-XIRDDKMYSA-N Ser-Trp-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)NC(=O)[C@H](CO)N XTWXRUWACCXBMU-XIRDDKMYSA-N 0.000 description 1
- ZWSZBWAFDZRBNM-UBHSHLNASA-N Ser-Trp-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O ZWSZBWAFDZRBNM-UBHSHLNASA-N 0.000 description 1
- FGBLCMLXHRPVOF-IHRRRGAJSA-N Ser-Tyr-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FGBLCMLXHRPVOF-IHRRRGAJSA-N 0.000 description 1
- FHXGMDRKJHKLKW-QWRGUYRKSA-N Ser-Tyr-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 FHXGMDRKJHKLKW-QWRGUYRKSA-N 0.000 description 1
- PLQWGQUNUPMNOD-KKUMJFAQSA-N Ser-Tyr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PLQWGQUNUPMNOD-KKUMJFAQSA-N 0.000 description 1
- OQSQCUWQOIHECT-YJRXYDGGSA-N Ser-Tyr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OQSQCUWQOIHECT-YJRXYDGGSA-N 0.000 description 1
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 1
- IAOHCSQDQDWRQU-GUBZILKMSA-N Ser-Val-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IAOHCSQDQDWRQU-GUBZILKMSA-N 0.000 description 1
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- QUOQJNYANJQSDA-MHQSSNGYSA-N Sialyllacto-N-tetraose a Chemical compound O1C([C@H](O)[C@H](O)CO)[C@H](NC(=O)C)[C@@H](O)C[C@@]1(C(O)=O)O[C@@H]1[C@@H](O)[C@H](OC2[C@H]([C@H](OC3[C@H]([C@H](O[C@H]([C@H](O)CO)[C@H](O)[C@@H](O)C=O)O[C@H](CO)[C@@H]3O)O)O[C@H](CO)[C@H]2O)NC(C)=O)O[C@H](CO)[C@@H]1O QUOQJNYANJQSDA-MHQSSNGYSA-N 0.000 description 1
- SFMRPVLZMVJKGZ-JRZQLMJNSA-N Sialyllacto-N-tetraose b Chemical compound O1[C@@H]([C@H](O)[C@H](O)CO)[C@H](NC(=O)C)[C@@H](O)C[C@@]1(C(O)=O)OC[C@@H]1[C@@H](O)[C@H](O[C@H]2[C@@H]([C@@H](O)[C@@H](O)[C@@H](CO)O2)O)[C@@H](NC(C)=O)[C@H](O[C@@H]2[C@H]([C@H](O[C@H]([C@H](O)CO)[C@H](O)[C@@H](O)C=O)O[C@H](CO)[C@@H]2O)O)O1 SFMRPVLZMVJKGZ-JRZQLMJNSA-N 0.000 description 1
- 241000204117 Sporolactobacillus Species 0.000 description 1
- 241000194017 Streptococcus Species 0.000 description 1
- 244000057717 Streptococcus lactis Species 0.000 description 1
- 101710161145 Sugar efflux transporter Proteins 0.000 description 1
- 108700026226 TATA Box Proteins 0.000 description 1
- 241000520244 Tatumella citrea Species 0.000 description 1
- 102000002933 Thioredoxin Human genes 0.000 description 1
- ZUXQFMVPAYGPFJ-JXUBOQSCSA-N Thr-Ala-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN ZUXQFMVPAYGPFJ-JXUBOQSCSA-N 0.000 description 1
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 1
- DGDCHPCRMWEOJR-FQPOAREZSA-N Thr-Ala-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DGDCHPCRMWEOJR-FQPOAREZSA-N 0.000 description 1
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 1
- XYEXCEPTALHNEV-RCWTZXSCSA-N Thr-Arg-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XYEXCEPTALHNEV-RCWTZXSCSA-N 0.000 description 1
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 1
- JNQZPAWOPBZGIX-RCWTZXSCSA-N Thr-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N JNQZPAWOPBZGIX-RCWTZXSCSA-N 0.000 description 1
- VIBXMCZWVUOZLA-OLHMAJIHSA-N Thr-Asn-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O VIBXMCZWVUOZLA-OLHMAJIHSA-N 0.000 description 1
- QGXCWPNQVCYJEL-NUMRIWBASA-N Thr-Asn-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGXCWPNQVCYJEL-NUMRIWBASA-N 0.000 description 1
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 description 1
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 1
- JTEICXDKGWKRRV-HJGDQZAQSA-N Thr-Asn-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JTEICXDKGWKRRV-HJGDQZAQSA-N 0.000 description 1
- LXWZOMSOUAMOIA-JIOCBJNQSA-N Thr-Asn-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O LXWZOMSOUAMOIA-JIOCBJNQSA-N 0.000 description 1
- JVTHIXKSVYEWNI-JRQIVUDYSA-N Thr-Asn-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JVTHIXKSVYEWNI-JRQIVUDYSA-N 0.000 description 1
- NLJKZUGAIIRWJN-LKXGYXEUSA-N Thr-Asp-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)O NLJKZUGAIIRWJN-LKXGYXEUSA-N 0.000 description 1
- GKMYGVQDGVYCPC-IUKAMOBKSA-N Thr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H]([C@@H](C)O)N GKMYGVQDGVYCPC-IUKAMOBKSA-N 0.000 description 1
- KRPKYGOFYUNIGM-XVSYOHENSA-N Thr-Asp-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O KRPKYGOFYUNIGM-XVSYOHENSA-N 0.000 description 1
- DCLBXIWHLVEPMQ-JRQIVUDYSA-N Thr-Asp-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DCLBXIWHLVEPMQ-JRQIVUDYSA-N 0.000 description 1
- OYTNZCBFDXGQGE-XQXXSGGOSA-N Thr-Gln-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O OYTNZCBFDXGQGE-XQXXSGGOSA-N 0.000 description 1
- UHBPFYOQQPFKQR-JHEQGTHGSA-N Thr-Gln-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UHBPFYOQQPFKQR-JHEQGTHGSA-N 0.000 description 1
- XXNLGZRRSKPSGF-HTUGSXCWSA-N Thr-Gln-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O XXNLGZRRSKPSGF-HTUGSXCWSA-N 0.000 description 1
- CQNFRKAKGDSJFR-NUMRIWBASA-N Thr-Glu-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CQNFRKAKGDSJFR-NUMRIWBASA-N 0.000 description 1
- WDFPMSHYMRBLKM-NKIYYHGXSA-N Thr-Glu-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O WDFPMSHYMRBLKM-NKIYYHGXSA-N 0.000 description 1
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 1
- HJOSVGCWOTYJFG-WDCWCFNPSA-N Thr-Glu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O HJOSVGCWOTYJFG-WDCWCFNPSA-N 0.000 description 1
- VULNJDORNLBPNG-SWRJLBSHSA-N Thr-Glu-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O VULNJDORNLBPNG-SWRJLBSHSA-N 0.000 description 1
- LKEKWDJCJSPXNI-IRIUXVKKSA-N Thr-Glu-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LKEKWDJCJSPXNI-IRIUXVKKSA-N 0.000 description 1
- KCRQEJSKXAIULJ-FJXKBIBVSA-N Thr-Gly-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCRQEJSKXAIULJ-FJXKBIBVSA-N 0.000 description 1
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 1
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 1
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 1
- ZTPXSEUVYNNZRB-CDMKHQONSA-N Thr-Gly-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZTPXSEUVYNNZRB-CDMKHQONSA-N 0.000 description 1
- YSXYEJWDHBCTDJ-DVJZZOLTSA-N Thr-Gly-Trp Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O YSXYEJWDHBCTDJ-DVJZZOLTSA-N 0.000 description 1
- JQAWYCUUFIMTHE-WLTAIBSBSA-N Thr-Gly-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JQAWYCUUFIMTHE-WLTAIBSBSA-N 0.000 description 1
- YJCVECXVYHZOBK-KNZXXDILSA-N Thr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H]([C@@H](C)O)N YJCVECXVYHZOBK-KNZXXDILSA-N 0.000 description 1
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 1
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 1
- HOVLHEKTGVIKAP-WDCWCFNPSA-N Thr-Leu-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HOVLHEKTGVIKAP-WDCWCFNPSA-N 0.000 description 1
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 1
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 1
- ISLDRLHVPXABBC-IEGACIPQSA-N Thr-Leu-Trp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ISLDRLHVPXABBC-IEGACIPQSA-N 0.000 description 1
- ZSPQUTWLWGWTPS-HJGDQZAQSA-N Thr-Lys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZSPQUTWLWGWTPS-HJGDQZAQSA-N 0.000 description 1
- QNCFWHZVRNXAKW-OEAJRASXSA-N Thr-Lys-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNCFWHZVRNXAKW-OEAJRASXSA-N 0.000 description 1
- XSEPSRUDSPHMPX-KATARQTJSA-N Thr-Lys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O XSEPSRUDSPHMPX-KATARQTJSA-N 0.000 description 1
- DXPURPNJDFCKKO-RHYQMDGZSA-N Thr-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DXPURPNJDFCKKO-RHYQMDGZSA-N 0.000 description 1
- UJQVSMNQMQHVRY-KZVJFYERSA-N Thr-Met-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O UJQVSMNQMQHVRY-KZVJFYERSA-N 0.000 description 1
- QFEYTTHKPSOFLV-OSUNSFLBSA-N Thr-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H]([C@@H](C)O)N QFEYTTHKPSOFLV-OSUNSFLBSA-N 0.000 description 1
- KPNSNVTUVKSBFL-ZJDVBMNYSA-N Thr-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KPNSNVTUVKSBFL-ZJDVBMNYSA-N 0.000 description 1
- WVVOFCVMHAXGLE-LFSVMHDDSA-N Thr-Phe-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O WVVOFCVMHAXGLE-LFSVMHDDSA-N 0.000 description 1
- WNQJTLATMXYSEL-OEAJRASXSA-N Thr-Phe-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WNQJTLATMXYSEL-OEAJRASXSA-N 0.000 description 1
- VEIKMWOMUYMMMK-FCLVOEFKSA-N Thr-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 VEIKMWOMUYMMMK-FCLVOEFKSA-N 0.000 description 1
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 description 1
- NYQIZWROIMIQSL-VEVYYDQMSA-N Thr-Pro-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O NYQIZWROIMIQSL-VEVYYDQMSA-N 0.000 description 1
- NDXSOKGYKCGYKT-VEVYYDQMSA-N Thr-Pro-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O NDXSOKGYKCGYKT-VEVYYDQMSA-N 0.000 description 1
- VTMGKRABARCZAX-OSUNSFLBSA-N Thr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O VTMGKRABARCZAX-OSUNSFLBSA-N 0.000 description 1
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 1
- PRTHQBSMXILLPC-XGEHTFHBSA-N Thr-Ser-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PRTHQBSMXILLPC-XGEHTFHBSA-N 0.000 description 1
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 1
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 1
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 1
- BCYUHPXBHCUYBA-CUJWVEQBSA-N Thr-Ser-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BCYUHPXBHCUYBA-CUJWVEQBSA-N 0.000 description 1
- NQQMWWVVGIXUOX-SVSWQMSJSA-N Thr-Ser-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NQQMWWVVGIXUOX-SVSWQMSJSA-N 0.000 description 1
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 1
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 1
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 1
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 1
- QYDKSNXSBXZPFK-ZJDVBMNYSA-N Thr-Thr-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYDKSNXSBXZPFK-ZJDVBMNYSA-N 0.000 description 1
- AAZOYLQUEQRUMZ-GSSVUCPTSA-N Thr-Thr-Asn Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O AAZOYLQUEQRUMZ-GSSVUCPTSA-N 0.000 description 1
- MFMGPEKYBXFIRF-SUSMZKCASA-N Thr-Thr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFMGPEKYBXFIRF-SUSMZKCASA-N 0.000 description 1
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 1
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 1
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 1
- CSZFFQBUTMGHAH-UAXMHLISSA-N Thr-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O CSZFFQBUTMGHAH-UAXMHLISSA-N 0.000 description 1
- BJJRNAVDQGREGC-HOUAVDHOSA-N Thr-Trp-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O BJJRNAVDQGREGC-HOUAVDHOSA-N 0.000 description 1
- MYNYCUXMIIWUNW-IEGACIPQSA-N Thr-Trp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MYNYCUXMIIWUNW-IEGACIPQSA-N 0.000 description 1
- IJKNKFJZOJCKRR-GBALPHGKSA-N Thr-Trp-Ser Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 IJKNKFJZOJCKRR-GBALPHGKSA-N 0.000 description 1
- LXXCHJKHJYRMIY-FQPOAREZSA-N Thr-Tyr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O LXXCHJKHJYRMIY-FQPOAREZSA-N 0.000 description 1
- CYCGARJWIQWPQM-YJRXYDGGSA-N Thr-Tyr-Ser Chemical compound C[C@@H](O)[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CO)C([O-])=O)CC1=CC=C(O)C=C1 CYCGARJWIQWPQM-YJRXYDGGSA-N 0.000 description 1
- LVRFMARKDGGZMX-IZPVPAKOSA-N Thr-Tyr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=C(O)C=C1 LVRFMARKDGGZMX-IZPVPAKOSA-N 0.000 description 1
- YOPQYBJJNSIQGZ-JNPHEJMOSA-N Thr-Tyr-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 YOPQYBJJNSIQGZ-JNPHEJMOSA-N 0.000 description 1
- KVEWWQRTAVMOFT-KJEVXHAQSA-N Thr-Tyr-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O KVEWWQRTAVMOFT-KJEVXHAQSA-N 0.000 description 1
- BKIOKSLLAAZYTC-KKHAAJSZSA-N Thr-Val-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O BKIOKSLLAAZYTC-KKHAAJSZSA-N 0.000 description 1
- AXEJRUGTOJPZKG-XGEHTFHBSA-N Thr-Val-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O)N)O AXEJRUGTOJPZKG-XGEHTFHBSA-N 0.000 description 1
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 1
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 1
- BTAJAOWZCWOHBU-HSHDSVGOSA-N Thr-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)C(C)C)C(O)=O)=CNC2=C1 BTAJAOWZCWOHBU-HSHDSVGOSA-N 0.000 description 1
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 1
- 108010022394 Threonine synthase Proteins 0.000 description 1
- 102000004357 Transferases Human genes 0.000 description 1
- 108090000992 Transferases Proteins 0.000 description 1
- QAXCHNZDPLSFPC-PJODQICGSA-N Trp-Ala-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 QAXCHNZDPLSFPC-PJODQICGSA-N 0.000 description 1
- BDWDMRSGCXEDMR-WFBYXXMGSA-N Trp-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N BDWDMRSGCXEDMR-WFBYXXMGSA-N 0.000 description 1
- CXUFDWZBHKUGKK-CABZTGNLSA-N Trp-Ala-Gly Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O)=CNC2=C1 CXUFDWZBHKUGKK-CABZTGNLSA-N 0.000 description 1
- ADBFWLXCCKIXBQ-XIRDDKMYSA-N Trp-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N ADBFWLXCCKIXBQ-XIRDDKMYSA-N 0.000 description 1
- WEAPHMIKOICYAU-QEJZJMRPSA-N Trp-Cys-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O WEAPHMIKOICYAU-QEJZJMRPSA-N 0.000 description 1
- BORCDLUWGBGTKL-XIRDDKMYSA-N Trp-Gln-Met Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O)=CNC2=C1 BORCDLUWGBGTKL-XIRDDKMYSA-N 0.000 description 1
- CZWIHKFGHICAJX-BPUTZDHNSA-N Trp-Glu-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 CZWIHKFGHICAJX-BPUTZDHNSA-N 0.000 description 1
- HXNVJPQADLRHGR-JBACZVJFSA-N Trp-Glu-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N HXNVJPQADLRHGR-JBACZVJFSA-N 0.000 description 1
- FNOQJVHFVLVMOS-AAEUAGOBSA-N Trp-Gly-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N FNOQJVHFVLVMOS-AAEUAGOBSA-N 0.000 description 1
- BEWOXKJJMBKRQL-AAEUAGOBSA-N Trp-Gly-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N BEWOXKJJMBKRQL-AAEUAGOBSA-N 0.000 description 1
- WVHUFSCKCBQKJW-HKUYNNGSSA-N Trp-Gly-Tyr Chemical compound C([C@H](NC(=O)CNC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=C(O)C=C1 WVHUFSCKCBQKJW-HKUYNNGSSA-N 0.000 description 1
- HLDFBNPSURDYEN-VHWLVUOQSA-N Trp-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N HLDFBNPSURDYEN-VHWLVUOQSA-N 0.000 description 1
- AIISTODACBDQLW-WDSOQIARSA-N Trp-Leu-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 AIISTODACBDQLW-WDSOQIARSA-N 0.000 description 1
- OGZRZMJASKKMJZ-XIRDDKMYSA-N Trp-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N OGZRZMJASKKMJZ-XIRDDKMYSA-N 0.000 description 1
- GWBWCGITOYODER-YTQUADARSA-N Trp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N GWBWCGITOYODER-YTQUADARSA-N 0.000 description 1
- NWQCKAPDGQMZQN-IHPCNDPISA-N Trp-Lys-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O NWQCKAPDGQMZQN-IHPCNDPISA-N 0.000 description 1
- GRSCONMARGNYHA-PMVMPFDFSA-N Trp-Lys-Phe Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GRSCONMARGNYHA-PMVMPFDFSA-N 0.000 description 1
- GQEXFCQNAJHJTI-IHPCNDPISA-N Trp-Phe-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N GQEXFCQNAJHJTI-IHPCNDPISA-N 0.000 description 1
- XGFOXYJQBRTJPO-PJODQICGSA-N Trp-Pro-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XGFOXYJQBRTJPO-PJODQICGSA-N 0.000 description 1
- BIBZRFIKOLGWFQ-XIRDDKMYSA-N Trp-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O BIBZRFIKOLGWFQ-XIRDDKMYSA-N 0.000 description 1
- IKUMWSDCGQVGHC-UMPQAUOISA-N Trp-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)O IKUMWSDCGQVGHC-UMPQAUOISA-N 0.000 description 1
- IQIRAJGHFRVFEL-UBHSHLNASA-N Trp-Ser-Cys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N IQIRAJGHFRVFEL-UBHSHLNASA-N 0.000 description 1
- KXFYAQUYJKOQMI-QEJZJMRPSA-N Trp-Ser-Gln Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 KXFYAQUYJKOQMI-QEJZJMRPSA-N 0.000 description 1
- UIRPULWLRODAEQ-QEJZJMRPSA-N Trp-Ser-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 UIRPULWLRODAEQ-QEJZJMRPSA-N 0.000 description 1
- GEGYPBOPIGNZIF-CWRNSKLLSA-N Trp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O GEGYPBOPIGNZIF-CWRNSKLLSA-N 0.000 description 1
- GSCPHMSPGQSZJT-JYBASQMISA-N Trp-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O GSCPHMSPGQSZJT-JYBASQMISA-N 0.000 description 1
- DDHFMBDACJYSKW-AQZXSJQPSA-N Trp-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O DDHFMBDACJYSKW-AQZXSJQPSA-N 0.000 description 1
- YCQKQFKXBPJXRY-PMVMPFDFSA-N Trp-Tyr-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)N[C@@H](CCCCN)C(=O)O)N YCQKQFKXBPJXRY-PMVMPFDFSA-N 0.000 description 1
- UIRVSEPRMWDVEW-RNXOBYDBSA-N Trp-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CNC4=CC=CC=C43)N UIRVSEPRMWDVEW-RNXOBYDBSA-N 0.000 description 1
- ICPRIGUXAFULPH-ILWGZMRPSA-N Trp-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CNC4=CC=CC=C43)N)C(=O)O ICPRIGUXAFULPH-ILWGZMRPSA-N 0.000 description 1
- SSSDKJMQMZTMJP-BVSLBCMMSA-N Trp-Tyr-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=C(O)C=C1 SSSDKJMQMZTMJP-BVSLBCMMSA-N 0.000 description 1
- RKISDJMICOREEL-QRTARXTBSA-N Trp-Val-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RKISDJMICOREEL-QRTARXTBSA-N 0.000 description 1
- AKFLVKKWVZMFOT-IHRRRGAJSA-N Tyr-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AKFLVKKWVZMFOT-IHRRRGAJSA-N 0.000 description 1
- XHALUUQSNXSPLP-UFYCRDLUSA-N Tyr-Arg-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 XHALUUQSNXSPLP-UFYCRDLUSA-N 0.000 description 1
- PZXUIGWOEWWFQM-SRVKXCTJSA-N Tyr-Asn-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O PZXUIGWOEWWFQM-SRVKXCTJSA-N 0.000 description 1
- MBFJIHUHHCJBSN-AVGNSLFASA-N Tyr-Asn-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MBFJIHUHHCJBSN-AVGNSLFASA-N 0.000 description 1
- AYHSJESDFKREAR-KKUMJFAQSA-N Tyr-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AYHSJESDFKREAR-KKUMJFAQSA-N 0.000 description 1
- VTFWAGGJDRSQFG-MELADBBJSA-N Tyr-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O VTFWAGGJDRSQFG-MELADBBJSA-N 0.000 description 1
- SCCKSNREWHMKOJ-SRVKXCTJSA-N Tyr-Asn-Ser Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O SCCKSNREWHMKOJ-SRVKXCTJSA-N 0.000 description 1
- BARBHMSSVWPKPZ-IHRRRGAJSA-N Tyr-Asp-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BARBHMSSVWPKPZ-IHRRRGAJSA-N 0.000 description 1
- JWHOIHCOHMZSAR-QWRGUYRKSA-N Tyr-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JWHOIHCOHMZSAR-QWRGUYRKSA-N 0.000 description 1
- QHEGAOPHISYNDF-XDTLVQLUSA-N Tyr-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHEGAOPHISYNDF-XDTLVQLUSA-N 0.000 description 1
- UXUFNBVCPAWACG-SIUGBPQLSA-N Tyr-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N UXUFNBVCPAWACG-SIUGBPQLSA-N 0.000 description 1
- WVRUKYLYMFGKAN-IHRRRGAJSA-N Tyr-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 WVRUKYLYMFGKAN-IHRRRGAJSA-N 0.000 description 1
- NZFCWALTLNFHHC-JYJNAYRXSA-N Tyr-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NZFCWALTLNFHHC-JYJNAYRXSA-N 0.000 description 1
- FMOSEWZYZPMJAL-KKUMJFAQSA-N Tyr-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N FMOSEWZYZPMJAL-KKUMJFAQSA-N 0.000 description 1
- CNLKDWSAORJEMW-KWQFWETISA-N Tyr-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O CNLKDWSAORJEMW-KWQFWETISA-N 0.000 description 1
- PMDWYLVWHRTJIW-STQMWFEESA-N Tyr-Gly-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PMDWYLVWHRTJIW-STQMWFEESA-N 0.000 description 1
- LTSIAOZUVISRAQ-QWRGUYRKSA-N Tyr-Gly-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)O LTSIAOZUVISRAQ-QWRGUYRKSA-N 0.000 description 1
- GIOBXJSONRQHKQ-RYUDHWBXSA-N Tyr-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O GIOBXJSONRQHKQ-RYUDHWBXSA-N 0.000 description 1
- FNWGDMZVYBVAGJ-XEGUGMAKSA-N Tyr-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CC=C(C=C1)O)N FNWGDMZVYBVAGJ-XEGUGMAKSA-N 0.000 description 1
- NOOMDULIORCDNF-IRXDYDNUSA-N Tyr-Gly-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NOOMDULIORCDNF-IRXDYDNUSA-N 0.000 description 1
- AZGZDDNKFFUDEH-QWRGUYRKSA-N Tyr-Gly-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AZGZDDNKFFUDEH-QWRGUYRKSA-N 0.000 description 1
- CTDPLKMBVALCGN-JSGCOSHPSA-N Tyr-Gly-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O CTDPLKMBVALCGN-JSGCOSHPSA-N 0.000 description 1
- KIJLSRYAUGGZIN-CFMVVWHZSA-N Tyr-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KIJLSRYAUGGZIN-CFMVVWHZSA-N 0.000 description 1
- DWAMXBFJNZIHMC-KBPBESRZSA-N Tyr-Leu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O DWAMXBFJNZIHMC-KBPBESRZSA-N 0.000 description 1
- YKCXQOBTISTQJD-BZSNNMDCSA-N Tyr-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YKCXQOBTISTQJD-BZSNNMDCSA-N 0.000 description 1
- QHLIUFUEUDFAOT-MGHWNKPDSA-N Tyr-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHLIUFUEUDFAOT-MGHWNKPDSA-N 0.000 description 1
- KHCSOLAHNLOXJR-BZSNNMDCSA-N Tyr-Leu-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHCSOLAHNLOXJR-BZSNNMDCSA-N 0.000 description 1
- WDGDKHLSDIOXQC-ACRUOGEOSA-N Tyr-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 WDGDKHLSDIOXQC-ACRUOGEOSA-N 0.000 description 1
- ARJASMXQBRNAGI-YESZJQIVSA-N Tyr-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N ARJASMXQBRNAGI-YESZJQIVSA-N 0.000 description 1
- CDKZJGMPZHPAJC-ULQDDVLXSA-N Tyr-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDKZJGMPZHPAJC-ULQDDVLXSA-N 0.000 description 1
- FMXFHNSFABRVFZ-BZSNNMDCSA-N Tyr-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FMXFHNSFABRVFZ-BZSNNMDCSA-N 0.000 description 1
- CNNVVEPJTFOGHI-ACRUOGEOSA-N Tyr-Lys-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CNNVVEPJTFOGHI-ACRUOGEOSA-N 0.000 description 1
- PMHLLBKTDHQMCY-ULQDDVLXSA-N Tyr-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMHLLBKTDHQMCY-ULQDDVLXSA-N 0.000 description 1
- UBKKNELWDCBNCF-STQMWFEESA-N Tyr-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UBKKNELWDCBNCF-STQMWFEESA-N 0.000 description 1
- LMKKMCGTDANZTR-BZSNNMDCSA-N Tyr-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 LMKKMCGTDANZTR-BZSNNMDCSA-N 0.000 description 1
- RGYCVIZZTUBSSG-JYJNAYRXSA-N Tyr-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O RGYCVIZZTUBSSG-JYJNAYRXSA-N 0.000 description 1
- VYQQQIRHIFALGE-UWJYBYFXSA-N Tyr-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VYQQQIRHIFALGE-UWJYBYFXSA-N 0.000 description 1
- ZPFLBLFITJCBTP-QWRGUYRKSA-N Tyr-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O ZPFLBLFITJCBTP-QWRGUYRKSA-N 0.000 description 1
- NHOVZGFNTGMYMI-KKUMJFAQSA-N Tyr-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NHOVZGFNTGMYMI-KKUMJFAQSA-N 0.000 description 1
- SYFHQHYTNCQCCN-MELADBBJSA-N Tyr-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O SYFHQHYTNCQCCN-MELADBBJSA-N 0.000 description 1
- TYFLVOUZHQUBGM-IHRRRGAJSA-N Tyr-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TYFLVOUZHQUBGM-IHRRRGAJSA-N 0.000 description 1
- XUIOBCQESNDTDE-FQPOAREZSA-N Tyr-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XUIOBCQESNDTDE-FQPOAREZSA-N 0.000 description 1
- ITDWWLTTWRRLCC-KJEVXHAQSA-N Tyr-Thr-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ITDWWLTTWRRLCC-KJEVXHAQSA-N 0.000 description 1
- UUBKSZNKJUJQEJ-JRQIVUDYSA-N Tyr-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O UUBKSZNKJUJQEJ-JRQIVUDYSA-N 0.000 description 1
- NZBSVMQZQMEUHI-WZLNRYEVSA-N Tyr-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NZBSVMQZQMEUHI-WZLNRYEVSA-N 0.000 description 1
- CLEGSEJVGBYZBJ-MEYUZBJRSA-N Tyr-Thr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CLEGSEJVGBYZBJ-MEYUZBJRSA-N 0.000 description 1
- XFEMMSGONWQACR-KJEVXHAQSA-N Tyr-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XFEMMSGONWQACR-KJEVXHAQSA-N 0.000 description 1
- GAKBTSMAPGLQFA-JNPHEJMOSA-N Tyr-Thr-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 GAKBTSMAPGLQFA-JNPHEJMOSA-N 0.000 description 1
- HMPMGPISLMLHSI-JBACZVJFSA-N Tyr-Trp-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N HMPMGPISLMLHSI-JBACZVJFSA-N 0.000 description 1
- HZDQUVQEVVYDDA-ACRUOGEOSA-N Tyr-Tyr-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 HZDQUVQEVVYDDA-ACRUOGEOSA-N 0.000 description 1
- RGJZPXFZIUUQDN-BPNCWPANSA-N Tyr-Val-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O RGJZPXFZIUUQDN-BPNCWPANSA-N 0.000 description 1
- AEOFMCAKYIQQFY-YDHLFZDLSA-N Tyr-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AEOFMCAKYIQQFY-YDHLFZDLSA-N 0.000 description 1
- RVGVIWNHABGIFH-IHRRRGAJSA-N Tyr-Val-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O RVGVIWNHABGIFH-IHRRRGAJSA-N 0.000 description 1
- YKBUNNNRNZZUID-UFYCRDLUSA-N Tyr-Val-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YKBUNNNRNZZUID-UFYCRDLUSA-N 0.000 description 1
- DJIJBQYBDKGDIS-JYJNAYRXSA-N Tyr-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O DJIJBQYBDKGDIS-JYJNAYRXSA-N 0.000 description 1
- 101710196080 UDP-glucose:undecaprenyl-phosphate glucose-1-phosphate transferase Proteins 0.000 description 1
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 1
- LTFLDDDGWOVIHY-NAKRPEOUSA-N Val-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N LTFLDDDGWOVIHY-NAKRPEOUSA-N 0.000 description 1
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 1
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 1
- LABUITCFCAABSV-BPNCWPANSA-N Val-Ala-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LABUITCFCAABSV-BPNCWPANSA-N 0.000 description 1
- UUYCNAXCCDNULB-QXEWZRGKSA-N Val-Arg-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O UUYCNAXCCDNULB-QXEWZRGKSA-N 0.000 description 1
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 1
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 1
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 1
- AUMNPAUHKUNHHN-BYULHYEWSA-N Val-Asn-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N AUMNPAUHKUNHHN-BYULHYEWSA-N 0.000 description 1
- ZMDCGGKHRKNWKD-LAEOZQHASA-N Val-Asn-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZMDCGGKHRKNWKD-LAEOZQHASA-N 0.000 description 1
- OGNMURQZFMHFFD-NHCYSSNCSA-N Val-Asn-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N OGNMURQZFMHFFD-NHCYSSNCSA-N 0.000 description 1
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 1
- NMPXRFYMZDIBRF-ZOBUZTSGSA-N Val-Asn-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N NMPXRFYMZDIBRF-ZOBUZTSGSA-N 0.000 description 1
- HZYOWMGWKKRMBZ-BYULHYEWSA-N Val-Asp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZYOWMGWKKRMBZ-BYULHYEWSA-N 0.000 description 1
- XLDYBRXERHITNH-QSFUFRPTSA-N Val-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)C(C)C XLDYBRXERHITNH-QSFUFRPTSA-N 0.000 description 1
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 1
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 1
- COSLEEOIYRPTHD-YDHLFZDLSA-N Val-Asp-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 COSLEEOIYRPTHD-YDHLFZDLSA-N 0.000 description 1
- PWRITNSESKQTPW-NRPADANISA-N Val-Gln-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N PWRITNSESKQTPW-NRPADANISA-N 0.000 description 1
- UZDHNIJRRTUKKC-DLOVCJGASA-N Val-Gln-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UZDHNIJRRTUKKC-DLOVCJGASA-N 0.000 description 1
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 1
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 1
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 1
- WDIGUPHXPBMODF-UMNHJUIQSA-N Val-Glu-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N WDIGUPHXPBMODF-UMNHJUIQSA-N 0.000 description 1
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 1
- NXRAUQGGHPCJIB-RCOVLWMOSA-N Val-Gly-Asn Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O NXRAUQGGHPCJIB-RCOVLWMOSA-N 0.000 description 1
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 description 1
- BEGDZYNDCNEGJZ-XVKPBYJWSA-N Val-Gly-Gln Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O BEGDZYNDCNEGJZ-XVKPBYJWSA-N 0.000 description 1
- FXVDGDZRYLFQKY-WPRPVWTQSA-N Val-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C FXVDGDZRYLFQKY-WPRPVWTQSA-N 0.000 description 1
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 1
- KDKLLPMFFGYQJD-CYDGBPFRSA-N Val-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N KDKLLPMFFGYQJD-CYDGBPFRSA-N 0.000 description 1
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 description 1
- APEBUJBRGCMMHP-HJWJTTGWSA-N Val-Ile-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 APEBUJBRGCMMHP-HJWJTTGWSA-N 0.000 description 1
- JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 description 1
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 1
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 1
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 1
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 1
- DIOSYUIWOQCXNR-ONGXEEELSA-N Val-Lys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O DIOSYUIWOQCXNR-ONGXEEELSA-N 0.000 description 1
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 1
- HPANGHISDXDUQY-ULQDDVLXSA-N Val-Lys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HPANGHISDXDUQY-ULQDDVLXSA-N 0.000 description 1
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 1
- OJOMXGVLFKYDKP-QXEWZRGKSA-N Val-Met-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OJOMXGVLFKYDKP-QXEWZRGKSA-N 0.000 description 1
- MGVYZTPLGXPVQB-CYDGBPFRSA-N Val-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MGVYZTPLGXPVQB-CYDGBPFRSA-N 0.000 description 1
- WSUWDIVCPOJFCX-TUAOUCFPSA-N Val-Met-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N WSUWDIVCPOJFCX-TUAOUCFPSA-N 0.000 description 1
- MJFSRZZJQWZHFQ-SRVKXCTJSA-N Val-Met-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)O)N MJFSRZZJQWZHFQ-SRVKXCTJSA-N 0.000 description 1
- LJSZPMSUYKKKCP-UBHSHLNASA-N Val-Phe-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 LJSZPMSUYKKKCP-UBHSHLNASA-N 0.000 description 1
- UXODSMTVPWXHBT-ULQDDVLXSA-N Val-Phe-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N UXODSMTVPWXHBT-ULQDDVLXSA-N 0.000 description 1
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 1
- LGXUZJIQCGXKGZ-QXEWZRGKSA-N Val-Pro-Asn Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N LGXUZJIQCGXKGZ-QXEWZRGKSA-N 0.000 description 1
- MIKHIIQMRFYVOR-RCWTZXSCSA-N Val-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C(C)C)N)O MIKHIIQMRFYVOR-RCWTZXSCSA-N 0.000 description 1
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 1
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 1
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 1
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 1
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 1
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 1
- DLRZGNXCXUGIDG-KKHAAJSZSA-N Val-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O DLRZGNXCXUGIDG-KKHAAJSZSA-N 0.000 description 1
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 1
- WUFHZIRMAZZWRS-OSUNSFLBSA-N Val-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C(C)C)N WUFHZIRMAZZWRS-OSUNSFLBSA-N 0.000 description 1
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 1
- YLBNZCJFSVJDRJ-KJEVXHAQSA-N Val-Thr-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O YLBNZCJFSVJDRJ-KJEVXHAQSA-N 0.000 description 1
- ZLMFVXMJFIWIRE-FHWLQOOXSA-N Val-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](C(C)C)N ZLMFVXMJFIWIRE-FHWLQOOXSA-N 0.000 description 1
- QPJSIBAOZBVELU-BPNCWPANSA-N Val-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N QPJSIBAOZBVELU-BPNCWPANSA-N 0.000 description 1
- MIAZWUMFUURQNP-YDHLFZDLSA-N Val-Tyr-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N MIAZWUMFUURQNP-YDHLFZDLSA-N 0.000 description 1
- VTIAEOKFUJJBTC-YDHLFZDLSA-N Val-Tyr-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VTIAEOKFUJJBTC-YDHLFZDLSA-N 0.000 description 1
- PFMSJVIPEZMKSC-DZKIICNBSA-N Val-Tyr-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PFMSJVIPEZMKSC-DZKIICNBSA-N 0.000 description 1
- RLVTVHSDKHBFQP-ULQDDVLXSA-N Val-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 RLVTVHSDKHBFQP-ULQDDVLXSA-N 0.000 description 1
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 1
- LMVWCLDJNSBOEA-FKBYEOEOSA-N Val-Tyr-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N LMVWCLDJNSBOEA-FKBYEOEOSA-N 0.000 description 1
- ZNGPROMGGGFOAA-JYJNAYRXSA-N Val-Tyr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 ZNGPROMGGGFOAA-JYJNAYRXSA-N 0.000 description 1
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 1
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 1
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 1
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 1
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 1
- OIRDTQYFTABQOQ-UHTZMRCNSA-N Vidarabine Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@@H]1O OIRDTQYFTABQOQ-UHTZMRCNSA-N 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 241000589636 Xanthomonas campestris Species 0.000 description 1
- 241000157303 Xanthomonas phaseoli pv. manihotis Species 0.000 description 1
- 241000490645 Yarrowia sp. Species 0.000 description 1
- 241000779672 Yersinia bercovieri ATCC 43970 Species 0.000 description 1
- 241000193453 [Clostridium] cellulolyticum Species 0.000 description 1
- RCITVHFNWJIDNA-UHFFFAOYSA-K [NH4+].[NH4+].[NH4+].[Fe+3].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O.[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O Chemical compound [NH4+].[NH4+].[NH4+].[Fe+3].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O.[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O RCITVHFNWJIDNA-UHFFFAOYSA-K 0.000 description 1
- 101150095147 aacC1 gene Proteins 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 238000005903 acid hydrolysis reaction Methods 0.000 description 1
- 238000005273 aeration Methods 0.000 description 1
- 238000013019 agitation Methods 0.000 description 1
- 108010078114 alanyl-tryptophyl-alanine Proteins 0.000 description 1
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- RNBGYGVWRKECFJ-ZXXMMSQZSA-N alpha-D-fructofuranose 1,6-bisphosphate Chemical compound O[C@H]1[C@H](O)[C@](O)(COP(O)(O)=O)O[C@@H]1COP(O)(O)=O RNBGYGVWRKECFJ-ZXXMMSQZSA-N 0.000 description 1
- HXXFSFRBOHSIMQ-RWOPYEJCSA-L alpha-D-mannose 1-phosphate(2-) Chemical compound OC[C@H]1O[C@H](OP([O-])([O-])=O)[C@@H](O)[C@@H](O)[C@@H]1O HXXFSFRBOHSIMQ-RWOPYEJCSA-L 0.000 description 1
- RQNFGIWYOACERD-OCQMRBNYSA-N alpha-L-Fucp-(1->4)-[alpha-L-Fucp-(1->2)-beta-D-Galp-(1->3)]-beta-D-GlcpNAc-(1->3)-beta-D-Galp-(1->4)-D-Glcp Chemical compound O[C@H]1[C@H](O)[C@H](O)[C@H](C)O[C@H]1O[C@H]1[C@H](O[C@H]2[C@@H]([C@@H](CO)O[C@@H](O[C@@H]3[C@H]([C@H](O[C@@H]4[C@H](OC(O)[C@H](O)[C@H]4O)CO)O[C@H](CO)[C@@H]3O)O)[C@@H]2NC(C)=O)O[C@H]2[C@H]([C@H](O)[C@H](O)[C@H](C)O2)O)O[C@H](CO)[C@H](O)[C@@H]1O RQNFGIWYOACERD-OCQMRBNYSA-N 0.000 description 1
- 150000001408 amides Chemical class 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- OIRDTQYFTABQOQ-UHFFFAOYSA-N ara-adenosine Natural products Nc1ncnc2n(cnc12)C1OC(CO)C(O)C1O OIRDTQYFTABQOQ-UHFFFAOYSA-N 0.000 description 1
- 101150035354 araA gene Proteins 0.000 description 1
- PYMYPHUHKUWMLA-WDCZJNDASA-N arabinose Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)C=O PYMYPHUHKUWMLA-WDCZJNDASA-N 0.000 description 1
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 1
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 1
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 1
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 108010036533 arginylvaline Proteins 0.000 description 1
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 1
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 1
- 108010093581 aspartyl-proline Proteins 0.000 description 1
- 108010038633 aspartylglutamate Proteins 0.000 description 1
- 108010064886 beta-D-galactoside alpha 2-6-sialyltransferase Proteins 0.000 description 1
- 102000005936 beta-Galactosidase Human genes 0.000 description 1
- 108010005774 beta-Galactosidase Proteins 0.000 description 1
- 102000007478 beta-N-Acetylhexosaminidases Human genes 0.000 description 1
- 108010085377 beta-N-Acetylhexosaminidases Proteins 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 239000006172 buffering agent Substances 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 239000013611 chromosomal DNA Substances 0.000 description 1
- 230000003930 cognitive ability Effects 0.000 description 1
- 230000000112 colonic effect Effects 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- OXBLHERUFWYNTN-UHFFFAOYSA-M copper(I) chloride Chemical compound [Cu]Cl OXBLHERUFWYNTN-UHFFFAOYSA-M 0.000 description 1
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000000593 degrading effect Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 235000007882 dietary composition Nutrition 0.000 description 1
- 108010009297 diglycyl-histidine Proteins 0.000 description 1
- 150000002016 disaccharides Chemical class 0.000 description 1
- FCIROHDMPFOSFG-LAVSNGQLSA-N disialyllacto-N-tetraose Chemical compound O1[C@@H]([C@H](O)[C@H](O)CO)[C@H](NC(=O)C)[C@@H](O)C[C@@]1(C(O)=O)OC[C@@H]1[C@@H](O)[C@H](O[C@H]2[C@@H]([C@@H](O[C@]3(O[C@H]([C@H](NC(C)=O)[C@@H](O)C3)[C@H](O)[C@H](O)CO)C(O)=O)[C@@H](O)[C@@H](CO)O2)O)[C@@H](NC(C)=O)[C@H](O[C@@H]2[C@H]([C@H](O[C@H]3[C@@H]([C@@H](O)C(O)O[C@@H]3CO)O)O[C@H](CO)[C@@H]2O)O)O1 FCIROHDMPFOSFG-LAVSNGQLSA-N 0.000 description 1
- 239000003480 eluent Substances 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 210000001723 extracellular space Anatomy 0.000 description 1
- 101150111583 fda gene Proteins 0.000 description 1
- 235000013350 formula milk Nutrition 0.000 description 1
- RNBGYGVWRKECFJ-UHFFFAOYSA-N fructose-1,6-phosphate Natural products OC1C(O)C(O)(COP(O)(O)=O)OC1COP(O)(O)=O RNBGYGVWRKECFJ-UHFFFAOYSA-N 0.000 description 1
- 101150025078 fucK gene Proteins 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 108091008053 gene clusters Proteins 0.000 description 1
- 238000012224 gene deletion Methods 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 235000003869 genetically modified organism Nutrition 0.000 description 1
- 238000010362 genome editing Methods 0.000 description 1
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 1
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 1
- HPAIKDPJURGQLN-UHFFFAOYSA-N glycyl-L-histidyl-L-phenylalanine Natural products C=1C=CC=CC=1CC(C(O)=O)NC(=O)C(NC(=O)CN)CC1=CN=CN1 HPAIKDPJURGQLN-UHFFFAOYSA-N 0.000 description 1
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 1
- 108010033719 glycyl-histidyl-glycine Proteins 0.000 description 1
- 108010028188 glycyl-histidyl-serine Proteins 0.000 description 1
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 1
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010048994 glycyl-tyrosyl-alanine Proteins 0.000 description 1
- 108010045126 glycyl-tyrosyl-glycine Proteins 0.000 description 1
- 108010020688 glycylhistidine Proteins 0.000 description 1
- 108010077515 glycylproline Proteins 0.000 description 1
- QGWNDRXFNXRZMB-UHFFFAOYSA-N guanidine diphosphate Natural products C1=2NC(N)=NC(=O)C=2N=CN1C1OC(COP(O)(=O)OP(O)(O)=O)C(O)C1O QGWNDRXFNXRZMB-UHFFFAOYSA-N 0.000 description 1
- 108010040030 histidinoalanine Proteins 0.000 description 1
- 108010050343 histidyl-alanyl-glutamine Proteins 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 108010028295 histidylhistidine Proteins 0.000 description 1
- 238000002744 homologous recombination Methods 0.000 description 1
- 230000006801 homologous recombination Effects 0.000 description 1
- 210000000987 immune system Anatomy 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 235000021125 infant nutrition Nutrition 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 239000002054 inoculum Substances 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000005342 ion exchange Methods 0.000 description 1
- 150000002500 ions Chemical group 0.000 description 1
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 101150001899 lacY gene Proteins 0.000 description 1
- 101150066555 lacZ gene Proteins 0.000 description 1
- 229930187173 lacto-N-Difucosylhexaose Natural products 0.000 description 1
- RQNFGIWYOACERD-UHFFFAOYSA-N lacto-N-Difucosylhexaose I Natural products OC1C(O)C(O)C(C)OC1OC1C(OC2C(C(CO)OC(OC3C(C(OC4C(OC(O)C(O)C4O)CO)OC(CO)C3O)O)C2NC(C)=O)OC2C(C(O)C(O)C(C)O2)O)OC(CO)C(O)C1O RQNFGIWYOACERD-UHFFFAOYSA-N 0.000 description 1
- OQIUPKPUOLIHHS-UHFFFAOYSA-N lacto-N-difucohexaose I Natural products OC1C(O)C(O)C(C)OC1OC1C(OC2C(C(CO)OC(OC3C(C(OC(C(O)CO)C(O)C(O)C=O)OC(CO)C3O)O)C2NC(C)=O)OC2C(C(O)C(O)C(C)O2)O)OC(CO)C(O)C1O OQIUPKPUOLIHHS-UHFFFAOYSA-N 0.000 description 1
- 229940039696 lactobacillus Drugs 0.000 description 1
- 229940039695 lactobacillus acidophilus Drugs 0.000 description 1
- 229940072205 lactobacillus plantarum Drugs 0.000 description 1
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 1
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 1
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 1
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 1
- 108010091871 leucylmethionine Proteins 0.000 description 1
- 101150006217 lex1 gene Proteins 0.000 description 1
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010012988 lysyl-glutamyl-aspartyl-glycine Proteins 0.000 description 1
- 108010076718 lysyl-glutamyl-tryptophan Proteins 0.000 description 1
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 1
- 101150088678 manB gene Proteins 0.000 description 1
- 101150032120 manC gene Proteins 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- BCVXHSPFUWZLGQ-UHFFFAOYSA-N mecn acetonitrile Chemical compound CC#N.CC#N BCVXHSPFUWZLGQ-UHFFFAOYSA-N 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 238000012269 metabolic engineering Methods 0.000 description 1
- 230000037353 metabolic pathway Effects 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 229950006780 n-acetylglucosamine Drugs 0.000 description 1
- 238000002414 normal-phase solid-phase extraction Methods 0.000 description 1
- 238000010899 nucleation Methods 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 230000002018 overexpression Effects 0.000 description 1
- 244000052769 pathogen Species 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 239000012071 phase Substances 0.000 description 1
- 108010074082 phenylalanyl-alanyl-lysine Proteins 0.000 description 1
- 108010065135 phenylalanyl-phenylalanyl-phenylalanine Proteins 0.000 description 1
- 108010012581 phenylalanylglutamate Proteins 0.000 description 1
- 235000021317 phosphate Nutrition 0.000 description 1
- 108091000115 phosphomannomutase Proteins 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 239000011148 porous material Substances 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 230000035755 proliferation Effects 0.000 description 1
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 238000004064 recycling Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000037425 regulation of transcription Effects 0.000 description 1
- 230000008439 repair process Effects 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- SXMGGNXBTZBGLU-UHFFFAOYSA-N sialyllacto-n-tetraose c Chemical compound OCC1OC(OC2C(C(OC(C(O)CO)C(O)C(O)C=O)OC(CO)C2O)O)C(NC(=O)C)C(O)C1OC(C(C(O)C1O)O)OC1COC1(C(O)=O)CC(O)C(NC(C)=O)C(C(O)C(O)CO)O1 SXMGGNXBTZBGLU-UHFFFAOYSA-N 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 239000001488 sodium phosphate Substances 0.000 description 1
- 229910000162 sodium phosphate Inorganic materials 0.000 description 1
- 238000003756 stirring Methods 0.000 description 1
- 229960005322 streptomycin Drugs 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 150000004044 tetrasaccharides Chemical class 0.000 description 1
- 108060008226 thioredoxin Proteins 0.000 description 1
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 1
- 239000011573 trace mineral Substances 0.000 description 1
- 235000013619 trace mineral Nutrition 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 108091006107 transcriptional repressors Proteins 0.000 description 1
- 238000010361 transduction Methods 0.000 description 1
- 230000026683 transduction Effects 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- IEDVJHCEMCRBQM-UHFFFAOYSA-N trimethoprim Chemical compound COC1=C(OC)C(OC)=CC(CC=2C(=NC(N)=NC=2)N)=C1 IEDVJHCEMCRBQM-UHFFFAOYSA-N 0.000 description 1
- 229960001082 trimethoprim Drugs 0.000 description 1
- 108700004896 tripeptide FEG Proteins 0.000 description 1
- 150000004043 trisaccharides Chemical class 0.000 description 1
- RYFMWSXOAZQYPI-UHFFFAOYSA-K trisodium phosphate Chemical compound [Na+].[Na+].[Na+].[O-]P([O-])([O-])=O RYFMWSXOAZQYPI-UHFFFAOYSA-K 0.000 description 1
- 108010058119 tryptophyl-glycyl-glycine Proteins 0.000 description 1
- 108010084932 tryptophyl-proline Proteins 0.000 description 1
- 108010078580 tyrosylleucine Proteins 0.000 description 1
- 108010072644 valyl-alanyl-prolyl-glycine Proteins 0.000 description 1
- 108010009962 valyltyrosine Proteins 0.000 description 1
- 230000035899 viability Effects 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 239000003643 water by type Substances 0.000 description 1
- 108010027345 wheylin-1 peptide Proteins 0.000 description 1
Images
Abstract
Description
Настоящее изобретение относится к получению олигосахаридов посредством микробной ферментации. Более конкретно, настоящее изобретение относится к применению гликозидаз для улучшения получения целевых олигосахаридов посредством микробной ферментации.The present invention relates to the production of oligosaccharides by microbial fermentation. More specifically, the present invention relates to the use of glycosidases to improve the production of target oligosaccharides by microbial fermentation.
Предшествующий уровень техникиPrior Art
Человеческое грудное молоко содержит уникальную смесь разных олигосахаридов, называемых Олигосахаридами Грудного Молока (ОГМ). На сегодняшний день в человеческом грудном молоке идентифицировано более чем 150 структурно разных олигосахаридов. За очень редким исключением, ОГМ характеризуются лактозной группировкой на их восстанавливающем конце, и многие ОГМ содержат остаток фукозы и/или остаток N-ацетилнейраминовой кислоты на их невосстанавливающем конце. Обычно, остатки моносахаридов ОГМ происходят из D-глюкозы, D-галактозы, N-ацетилглюкозамина, L-фукозы и N-ацетилнейраминовой кислоты. Важность ОГМ для питания грудных детей прямо связано с их уникальными биологическими активностями, включая защиту новорожденного от патогенов, поддержку развития иммунной системы и познавательных способностей грудного ребенка. Таким образом, существует большой интерес к получению ОГМ в коммерческом масштабе.Human breast milk contains a unique mixture of different oligosaccharides called Human Milk Oligosaccharides (HMOs). To date, more than 150 structurally different oligosaccharides have been identified in human breast milk. With very few exceptions, OGMs are characterized by a lactose moiety at their reducing end, and many OGMs contain a fucose moiety and/or an N-acetylneuraminic acid moiety at their non-reducing end. Typically, the monosaccharide residues of OGM are derived from D-glucose, D-galactose, N-acetylglucosamine, L-fucose and N-acetylneuraminic acid. The importance of HMOs in infant nutrition is directly related to their unique biological activities, including protecting the newborn from pathogens and supporting the development of the infant's immune system and cognitive abilities. Thus, there is great interest in producing OGM on a commercial scale.
Помимо химического синтеза отдельных ОГМ значительный прогресс был сделан в разработке получения ОГМ посредством микробной ферментации с использованием генетически модифицированных микроорганизмов, которые сверхэкспрессируют гетерологичную гликозилтрансферазу. При культивации таких микроорганизмов в среде и в условиях, являющихся пермиссивными в отношении экспрессии микроорганизмами указанной гетерологичной гликозилтрансферазы, ОГМ может быть продуцирован указанным микроорганизмом и выделен из культуральной среды или клеточного лизата.In addition to the chemical synthesis of individual HMOs, significant progress has been made in developing the production of HMOs through microbial fermentation using genetically modified microorganisms that overexpress a heterologous glycosyltransferase. When such microorganisms are cultivated in a medium and under conditions that are permissive for the microorganisms to express said heterologous glycosyltransferase, OGM can be produced by said microorganism and isolated from the culture medium or cell lysate.
Однако гликозилтрансферазы часто обладают ферментативными побочными активностями, таким образом, что их сверхэкспрессия для получения целевого олигосахарида обычно приводит к побочным продуктам, которые являются нежелательными. Обычно, данные побочные продукты также представляют собой олигосахариды, но которые должны быть удалены из препарата целевого олигосахарида для коммерческого применения продукта. Однако, удаление таких побочных продуктов из целевого олигосахарида является сложным и трудоемким. Один подход к удалению таких побочных продуктов включает применение гликозидаз, которых либо экзогенно добавляют к реакционной смеси/клеточной среде, содержащей желательные и нежелательные олигосахариды, либо продуцирует генетически сконструированный микроорганизм при индукции в конкретный момент времени в конце процесса ферментации для получения целевого олигосахарида.However, glycosyltransferases often have enzymatic side activities such that their overexpression to produce the desired oligosaccharide usually results in by-products that are undesirable. Typically, these by-products are also oligosaccharides, but must be removed from the target oligosaccharide preparation for commercialization of the product. However, removing such by-products from the target oligosaccharide is difficult and time-consuming. One approach to removing such by-products involves the use of glycosidases, which are either exogenously added to the reaction mixture/cell medium containing the desired and undesired oligosaccharides, or produced by a genetically engineered microorganism when induced at a specific point in time at the end of the fermentation process to produce the target oligosaccharide.
Международная публикация WO 2015/032412 А1 относится к применению фукозы и в ней раскрыт способ, в котором генетически модифицированную клетку, экспрессирующую гетерологичную фукозилтрансферазу, культивируют в присутствии лактозы для продуцирования и секреции смеси 2'-фукозиллактозы (2'-FL- от англ. 2'-fucosyllactose) и дифукозиллактозы (DFL - от англ. difucosyllactose) во внеклеточное пространство культуральной среды с высоким выходом. Сахариды разделяют и подвергают гидролизу посредством кислоты или посредством фукозидазы с получением фукозы с высоким выходом.International publication WO 2015/032412 A1 relates to the use of fucose and discloses a method in which a genetically modified cell expressing a heterologous fucosyltransferase is cultured in the presence of lactose to produce and secrete a mixture of 2'-fucosyllactose (2'-FL- from English 2 '-fucosyllactose) and difucosyllactose (DFL - from the English difucosyllactose) into the extracellular space of the culture medium with high yield. The saccharides are separated and hydrolyzed by acid or fucosidase to produce fucose in high yield.
В международной публикации №WO 2104/090261 А1 раскрыт способ образования смеси, содержащей по меньшей мере одну из 2'-FL и 3-фукозиллактозы β-FL), где DFL подвергается частичному гидролизу, например, ферментативному гидролизу или кислотному гидролизу. В ферментативном гидролизе DFL подвергают действию фукозидазы, которая может высвобождать один из остатков фукозы из DFL. DFL (10 мМ) инкубировали с 1,2-α-1_-фукозидазой из Xanthomonas manihotis при 37°С в инкубационном буфере, и гидролиз DFL отслеживали посредством ВЭЖХ (высокоэффективная жидкостная хроматография). Спустя 18 часов, DFL подвергали частичному гидролизу до 3-FL и фукозы. Лактозы не выявляли.International Publication No. WO 2104/090261 A1 discloses a method for forming a mixture containing at least one of 2'-FL and 3-fucosyllactose β-FL), wherein DFL undergoes partial hydrolysis, for example enzymatic hydrolysis or acid hydrolysis. In enzymatic hydrolysis, DFL is exposed to fucosidase, which can release one of the fucose residues from DFL. DFL (10 mM) was incubated with 1,2-α-1_-fucosidase from Xanthomonas manihotis at 37°C in incubation buffer, and DFL hydrolysis was monitored by HPLC (high performance liquid chromatography). After 18 hours, DFL was partially hydrolyzed to 3-FL and fucose. Lactose was not detected.
Европейская патентная заявка №ЕР 2845905 А1 относится к получению олигосахаридов и в ней раскрыто применение одного или более гликозидаз в способе получения и/или очистки олигосахарида. Способ включает а) культивирование микроорганизма-хозяина, подходящего для продукции целевого олигосахарида, в условиях и в среде, являющихся пермиссивными для продукции указанного целевого олигосахарида, в результате чего продуцируются данный олигосахарид и, где применимо, промежуточные соединения биосинтеза - сахариды и/или побочные продукты; b) применение гликозидазы в среде, в которой культивируется микроорганизм-хозяин, для деградации промежуточных соединений биосинтеза - сахаридов и/или побочных продуктов - сахаридов и/или неиспользованных субстратов сахаридов; и с) выделение целевого олигосахарида. В одном воплощении указанная гликозидаза эндогенно продуцируется в микроорганизме-хозяине, где гликозидаза представляет собой гликозидазу, которая не встречается в природе в микроорганизме-хозяине, и где экспрессия указанной гликозидазы в указанном микроорганизме-хозяине является индуцибельной, таким образом, что данная экспрессия может быть инициирована после того, как достаточное и/или по существу максимальное количество целевого олигосахарида было продуцировано во время культивации микроорганизмα-хозяина.European Patent Application No. EP 2845905 A1 relates to the preparation of oligosaccharides and discloses the use of one or more glycosidases in a process for the preparation and/or purification of an oligosaccharide. The method includes a) cultivating a host microorganism suitable for the production of the target oligosaccharide, under conditions and in an environment that are permissive for the production of the specified target oligosaccharide, resulting in the production of this oligosaccharide and, where applicable, biosynthetic intermediates - saccharides and/or by-products ; b) the use of glycosidase in the environment in which the host microorganism is cultivated to degrade biosynthetic intermediates - saccharides and/or by-products - saccharides and/or unused saccharide substrates; and c) isolating the target oligosaccharide. In one embodiment, said glycosidase is endogenously produced in a host microorganism, wherein the glycosidase is a glycosidase that does not naturally occur in the host microorganism, and wherein expression of said glycosidase in said host microorganism is inducible such that the expression can be initiated after a sufficient and/or substantially maximum amount of the target oligosaccharide has been produced during cultivation of the host microorganism.
В итоге, в предшествующем уровне техники раскрыто применение гликозидаз для удаления нецелевых олигосахаридов из смеси целевых и нецелевых олигосахаридов посредством гидролиза нецелевых олигосахаридов в реакционной смеси/клеточной среде. Однако, данные подходы включают биосинтез нецелевых олигосахаридов микроорганизмом, включая применение субстратов и энергии, и данные подходы требуют удаления продуктов деградации нецелевых олигосахаридов из целевых олигосахаридов.In summary, the prior art discloses the use of glycosidases to remove non-target oligosaccharides from a mixture of target and non-target oligosaccharides by hydrolyzing the non-target oligosaccharides in the reaction mixture/cellular environment. However, these approaches involve the biosynthesis of non-target oligosaccharides by a microorganism, including the use of substrates and energy, and these approaches require the removal of degradation products of non-target oligosaccharides from the target oligosaccharides.
Таким образом, цель настоящего изобретения заключалась в предложении способа получения целевого олигосахарида посредством микробной ферментации без сопутствующей продукции/аккумуляции нецелевых побочных продуктов-сахаридов, а именно нецелевых олигосахаридов, в клеточной среде, содержащей микроорганизм, подлежащий ферментации.Thus, the object of the present invention was to provide a method for producing a target oligosaccharide by microbial fermentation without the co-production/accumulation of non-target saccharide by-products, namely non-target oligosaccharides, in a cellular environment containing the microorganism to be fermented.
Цель достигается посредством предложения генетически сконструированной микробной клетки-хозяина, способной продуцировать целевой олигосахарид, где указанная микробная клетка-хозяин экспрессирует гетерологичную гликозидазу, которая способна внутриклеточно деградировать побочные продукты метаболизма, которые образуются во время внутриклеточного биосинтеза целевого олигосахарида, таким образом, предотвращая образование смеси целевых и нецелевых сахаридов в культуральной среде. Указанные продукты деградации затем могут быть использованы в метаболизме микробной клетки-хозяина, например, для биосинтеза целевого олигосахарида.The objective is achieved by providing a genetically engineered microbial host cell capable of producing a target oligosaccharide, wherein said microbial host cell expresses a heterologous glycosidase that is capable of intracellularly degrading metabolic by-products that are formed during intracellular biosynthesis of the target oligosaccharide, thereby preventing the formation of a mixture of target oligosaccharides. and non-target saccharides in the culture medium. These degradation products can then be used in the metabolism of the microbial host cell, for example, for the biosynthesis of the target oligosaccharide.
В Таблице 1 предоставлен подробный обзор целевых олигосахаридов и возможных предшественников, которые добавляются, и/или нецелевых побочных продуктов - сахаридов, которые образуются во время получения целевого олигосахарида.Table 1 provides a detailed overview of the target oligosaccharides and possible precursors that are added and/or non-target saccharide byproducts that are formed during the production of the target oligosaccharide.
Краткое изложение сущности изобретенияSummary of the invention
В первом аспекте раскрыт способ получения целевого олигосахарида с использованием генетически сконструированной микробной клетки-хозяина, которая способна продуцировать целевой олигосахарид, причем указанная микробная клетка-хозяин экспрессирует гетерологичную гликозидазу, способную осуществлять внутриклеточную деградацию побочных продуктов метаболизма - сахаридов, которые образуются в процессе внутриклеточного биосинтеза целевого олигосахарида.In the first aspect, a method for producing a target oligosaccharide is disclosed using a genetically engineered microbial host cell that is capable of producing the target oligosaccharide, wherein said microbial host cell expresses a heterologous glycosidase capable of intracellular degradation of metabolic by-products - saccharides that are formed during the intracellular biosynthesis of the target oligosaccharide.
Во втором аспекте раскрыта генетически сконструированная микробная клетка-хозяин для получения целевого олигосахарида, где указанная микробная клетка-хозяин способна продуцировать целевой олигосахарид, и где указанная микробная клетка-хозяин была генетически сконструирована с возможностью экспрессии гетерологичной гликозидазы, которая может осуществлять внутриклеточную деградацию побочных продуктов метаболизма - сахаридов, которые образуются в процессе внутриклеточного биосинтеза целевого олигосахарида.A second aspect discloses a genetically engineered microbial host cell for producing a target oligosaccharide, wherein said microbial host cell is capable of producing the target oligosaccharide, and wherein said microbial host cell has been genetically engineered to express a heterologous glycosidase that can perform intracellular degradation of metabolic byproducts - saccharides that are formed during the intracellular biosynthesis of the target oligosaccharide.
В третьем аспекте раскрыто применение генетически сконструированной микробной клетки-хозяина согласно второму аспекту для получения целевого олигосахарида.The third aspect discloses the use of a genetically engineered microbial host cell according to the second aspect to produce a target oligosaccharide.
В четвертом аспекте раскрыты олигосахариды, а именно желательные олигосахариды, которые получают способом согласно первому аспекту и/или посредством использования генетически сконструированной микробной клетки-хозяина согласно второму аспекту.In a fourth aspect, oligosaccharides are disclosed, namely, desired oligosaccharides that are produced by the method of the first aspect and/or by using a genetically engineered microbial host cell according to the second aspect.
В пятом аспекте раскрыто применение целевых олигосахаридов согласно четвертому аспекту для получения питательной композиции.The fifth aspect discloses the use of the target oligosaccharides according to the fourth aspect to produce a nutritional composition.
В шестом аспекте раскрыты питательные композиции, содержащие целевой олигосахарид согласно четвертому аспекту.The sixth aspect discloses nutritional compositions containing the target oligosaccharide according to the fourth aspect.
Краткое описание графических материаловBrief description of graphic materials
На Фиг. 1 показано схематичное представление воплощения микробной клетки-хозяина, экспрессирующей гетерологичную гликозидазу (например, альфа-1,3-фукозидазу), которая может осуществлять деградацию побочных продуктов метаболизма - сахаридов (например, 3-фукозиллактозы и 2'3-дифукозиллактозы), которые образуются во время внутриклеточного биосинтеза целевого олигосахарида (2'-фукозиллактоза), и где микробная клетка-хозяин может повторно использовать продукты деградации (например, фукозу и лактозу), являющиеся результатом ферментативной активности указанной гликозидазы, для продукции целевого олигосахарида.In FIG. 1 shows a schematic representation of an embodiment of a microbial host cell expressing a heterologous glycosidase (e.g., alpha-1,3-fucosidase) that can degrade the metabolic by-products saccharides (e.g., 3-fucosyllactose and 2'3-difucosyllactose) that are produced during intracellular biosynthesis of the target oligosaccharide (2'-fucosyllactose), and where the microbial host cell can reuse degradation products (eg, fucose and lactose) resulting from the enzymatic activity of said glycosidase to produce the target oligosaccharide.
Подробное описаниеDetailed description
Согласно первому аспекту предложен способ получения целевого олигосахарида с использованием генетически сконструированной микробной клетки-хозяина, который включает следующие стадии:According to the first aspect, a method is provided for producing a target oligosaccharide using a genetically engineered microbial host cell, which includes the following steps:
(i) предоставление генетически сконструированной микробной клетки-хозяина, которая способна продуцировать целевой олигосахарид, где микробная клетка-хозяин генетически сконструирована с возможностью экспрессии гетерологичной гликозидазы, которая может осуществлять внутриклеточную деградацию побочных продуктов метаболизма - сахаридов, которые образуются в процессе внутриклеточного биосинтеза целевого олигосахарида, и где микробная клетка-хозяин способна повторно использовать продукты деградации, являющиеся результатом ферментативной активности указанной гликозидазы;(i) providing a genetically engineered microbial host cell that is capable of producing the target oligosaccharide, where the microbial host cell is genetically engineered to express a heterologous glycosidase that can perform intracellular degradation of metabolic by-products - saccharides that are formed during the intracellular biosynthesis of the target oligosaccharide, and where the microbial host cell is capable of recycling degradation products resulting from the enzymatic activity of said glycosidase;
(ii) культивирование генетически сконструированной микробной клетки-хозяина в условиях и в среде, являющихся пермиссивными в отношении продукции целевого олигосахарида, с получением, таким образом, целевого олигосахарида; и(ii) culturing the genetically engineered microbial host cell under conditions and environments that are permissive for the production of the target oligosaccharide, thereby obtaining the target oligosaccharide; And
(iii) возможно выделение целевого олигосахарида.(iii) the target oligosaccharide can be isolated.
Термин «целевой», в том виде, в котором он используется в данном документе в отношении олигосахаридов, относится к олигосахариду, который предназначен для того, чтобы быть продуцированным микробной клеткой-хозяином. Термин «целевой» используется для проведения различия между олигосахаридом, подлежащим целевой продукции, и другими олигосахаридами, которые может продуцировать микробная клетка-хозяин. Указанные другие олигосахариды считаются «нежелательными», независимо от того, обладают ли или нет данные другие олигосахариды биологической функцией, участвуют ли или нет в биосинтезе других соединений клетки, таких как гликолипиды, гликопротеины или полисахариды, или являются ли или нет продуктами метаболизма - сахаридами, которые образованы во время внутриклеточного биосинтеза целевого олигосахарида, или за счет запасных (нецелевых) ферментативных активностей одного или более ферментов, участвующих в биосинтезе целевого олигосахарида, или за счет ферментативной активности одного или более ферментов, которые не прямо участвуют в биосинтезе целевого олигосахарида, а используют олигосахарид в качестве субстрата, который образуется в виде промежуточного соединения в метаболическом пути, приводя к получению целевого олигосахарида.The term “target”, as used herein in relation to oligosaccharides, refers to an oligosaccharide that is intended to be produced by a microbial host cell. The term "target" is used to distinguish between the oligosaccharide being targeted for production and other oligosaccharides that the microbial host cell may produce. These other oligosaccharides are considered "undesirable" regardless of whether or not these other oligosaccharides have a biological function, whether or not they participate in the biosynthesis of other cellular compounds such as glycolipids, glycoproteins or polysaccharides, or whether or not they are metabolic products of saccharides. which are formed during the intracellular biosynthesis of the target oligosaccharide, or due to the reserve (non-target) enzymatic activities of one or more enzymes involved in the biosynthesis of the target oligosaccharide, or due to the enzymatic activity of one or more enzymes that are not directly involved in the biosynthesis of the target oligosaccharide, but use an oligosaccharide as a substrate that is formed as an intermediate in a metabolic pathway, resulting in the target oligosaccharide.
Термин «олигосахарид», в том виде, в котором он используется в данном документе, относится к молекуле сахарида, состоящей из трех - двадцати остатков моносахаридов, где каждый из указанных остатков моносахаридов связан по меньшей мере с еще одним из указанных моносахаридных звеньев гликозидной связью. Олигосахарид может представлять собой линейную цепь моносахаридных остатков или разветвленную цепь моносахаридных остатков.The term "oligosaccharide", as used herein, refers to a saccharide molecule consisting of three to twenty monosaccharide units, wherein each of said monosaccharide units is linked to at least one other of said monosaccharide units by a glycosidic linkage. An oligosaccharide may be a linear chain of monosaccharide residues or a branched chain of monosaccharide residues.
В дополнительном и/или альтернативном воплощении целевой олигосахарид представляет собой олигосахарид грудного молока (НМО).In a further and/or alternative embodiment, the target oligosaccharide is human milk oligosaccharide (HMO).
В дополнительном и/или альтернативном воплощении целевой олигосахарид представляет собой ОГМ, выбранный из группы, состоящей из 2'-фукозиллактозы (2'-FL), 3-фукозиллактозы (3-FL), 2'3-дифукозиллактозы (DFL), лакто-N-триозы II, лакто-N-тетраозы (LNT), лакто-N-неотетраозы (LNnT), лакто-N-фукопентаозы I (LNFP-I), лакто-N-неофукопентаозы I (LNnFP-I), лакто-N-фукопентаозы II (LNFP-II), лакто-N-фукопентаозы III (LNFP-III), лакто-N-фукопентаозы V (LNFP-V), лакто-N-неофукопентаозы V (LNnFP-V), лакто-N-дифукогексаозы I, лакто-N-дифукозилгексаозы II, пара-Лакто-N-фукозилгексаозы, фукозил-лакто-N-сиалилпентаозы b, фукозил-лакто-N-сиалилпентаозы с, фукозил-лакто-N-сиалилпентаозы с, дисиалил-лакто-N-фукопентаозы, 3-фукозил-3'-сиалиллактозы, 3-фукозил-6'-сиалиллактозы, лакто-N-неодифукогексаозы I, 3'-сиалиллакозы β-SL), 6'-сиалиллактозы (6-SL), сиалиллакто-N-тетраозы a (LST-a), сиалиллакто-N-тетраозы b (LST-b), сиалиллакто-N-тетраозы с (LST-c) и дисиалиллакто-N-тетраозы.In an additional and/or alternative embodiment, the target oligosaccharide is an OGM selected from the group consisting of 2'-fucosyllactose (2'-FL), 3-fucosyllactose (3-FL), 2'3-difucosyllactose (DFL), lacto- N-triose II, lacto-N-tetraose (LNT), lacto-N-neotetraose (LNnT), lacto-N-fucopentaose I (LNFP-I), lacto-N-neofucopentaose I (LNnFP-I), lacto-N -fucopentaose II (LNFP-II), lacto-N-fucopentaose III (LNFP-III), lacto-N-fucopentaose V (LNFP-V), lacto-N-neofucopentaose V (LNnFP-V), lacto-N-difucohexaose I, lacto-N-difucosylhexaoses II, para-Lacto-N-fucosylhexaoses, fucosyl-lacto-N-sialylpentaoses b, fucosyl-lacto-N-sialylpentaoses c, fucosyl-lacto-N-sialylpentaoses c, disialyl-lacto-N- fucopentaose, 3-fucosyl-3'-sialyllactose, 3-fucosyl-6'-sialyllactose, lacto-N-neodifucohexaose I, 3'-sialyllactose β-SL), 6'-sialyllactose (6-SL), sialyllacto-N- tetraose a (LST-a), sialyllacto-N-tetraose b (LST-b), sialyllacto-N-tetraose c (LST-c) and disialyl lacto-N-tetraose.
Способ включает предоставление генетически сконструированной микробной клетки-хозяина, которая способна продуцировать целевой олигосахарид.The method includes providing a genetically engineered microbial host cell that is capable of producing the target oligosaccharide.
Термин «генетически сконструированный», в том виде, в котором он используется в данном документе, относится к модификации генетического состава клетки с использованием методов молекулярной биологии. Модификация генетического состава клетки может включать перенос генов в пределах и/или через видовые связи, вставку, удаление, замену и/или модификацию нуклеотидов, триплетов, генов, открытых рамок считывания, промоторов, энхансеров, терминаторов и других нуклеотидных последовательностей, опосредуя и/или контролируя экспрессию генов. Модификация генетического состава клетки нацелена на создание генетически модифицированного организма, обладающего конкретными, желательными свойствами. Генетически сконструированная микробная клетка-хозяин может содержать один или более генов, которые отсутствуют в нативной (не генетически сконструированной) форме клетки. Методики введения экзогенных молекул нуклеиновой кислоты и/или осуществления вставки экзогенных молекул нуклеиновой кислоты (рекомбинантных, гетерологичных) в наследуемую информацию клетки для вставки, удаления или изменения нуклеотидной последовательности генетической информации клетки известны квалифицированному специалисту. Генетически сконструированные клетки могут содержать один или более генов, которые находятся в нативной форме клетки, где указанные гены модифицированы и повторно вставлены в клетку искусственными средствами. Термин «генетически сконструированный» также охватывает клетки, которые содержат молекулу нуклеиновой кислоты, являющуюся эндогенной в отношении клетки, и которая модифицирована без удаления молекулы нуклеиновой кислоты из клетки. Такие модификации включают модификации, полученные в результате замены генов, сайт-специфичных мутаций, и родственные методики, включающие методики, обычно называемые «редактированием генома».The term “genetically engineered,” as used herein, refers to the modification of the genetic makeup of a cell using molecular biology techniques. Modification of the genetic composition of a cell may include gene transfer within and/or across species relationships, insertion, deletion, substitution and/or modification of nucleotides, triplets, genes, open reading frames, promoters, enhancers, terminators and other nucleotide sequences, mediating and/or controlling gene expression. Modification of a cell's genetic composition aims to create a genetically modified organism that has specific, desirable properties. A genetically engineered microbial host cell may contain one or more genes that are not present in the native (non-genetically engineered) form of the cell. Methods for introducing exogenous nucleic acid molecules and/or inserting exogenous nucleic acid molecules (recombinant, heterologous) into the heritable information of a cell to insert, delete or change the nucleotide sequence of the genetic information of a cell are known to a qualified specialist. Genetically engineered cells may contain one or more genes that are found in the native form of the cell, where said genes are modified and reinserted into the cell by artificial means. The term "genetically engineered" also includes cells that contain a nucleic acid molecule that is endogenous to the cell and that is modified without removing the nucleic acid molecule from the cell. Such modifications include modifications resulting from gene replacement, site-specific mutations, and related techniques, including techniques commonly referred to as "genome editing".
Генетически сконструированная микробная клетка-хозяин может представлять собой прокариотическую клетку или эукариотическую клетку. Подходящие микробные клетки-хозяева включают дрожжевые клетки, бактериальные клетки, клетки архебактерий и клетки грибов.The genetically engineered microbial host cell may be a prokaryotic cell or a eukaryotic cell. Suitable microbial host cells include yeast cells, bacterial cells, archaebacterial cells and fungal cells.
В дополнительном и/или альтернативном воплощении прокариотическая клетка представляет собой бактериальную клетку, предпочтительно бактериальную клетку, выбранную из бактерий рода, выбранного из группы, состоящей из Bacillus, Bifidobacterium, Clostridium, Corynebacterium, Enterococcus, Lactobacillus, Lactococcus, Micrococcus, Micromonospora, Pseudomonas, Rhodococcus и Sporolactobacillus. Подходящие виды бактерий представляют собой Bacillus subtilis, В, licheniformis, В. coagulans, В, thermophiius, В. laterosporus, В, megaterium, В. mycoides, В. pumilus, В. lentus, В. cereus, В. circulans, Bifidobacterium longum, B. infantis, B, bifidum, Citrobacter freundii, Clostridium cellulolyticum, C. Ijungdahiii, C, autoethanogenum, C. acetobutylicum, Corynebacterium glutamicum, Enterococcus faecium, E. thermophiles, Escherichia coli, Erwinia herbicoia (Pantoea agglomerans), Lactobacillus acidophilus, L. salivarius, L, plantarum, L, helveticus, L. deibrueckii, L, rhamnosus, L. buigaricus, L. crispatus, L. gasseri, L, casei, L. reuteri, L. jensenii, L. lactis, Pantoea citrea, Pectobacterium carotovorum, Proprionibacterium freudenreichii, Pseudomonas fluorescens, P. aeruginosa, Streptococcus thermophiles и Xanthomonas campestris.In a further and/or alternative embodiment, the prokaryotic cell is a bacterial cell, preferably a bacterial cell selected from a genus of bacteria selected from the group consisting of Bacillus, Bifidobacterium, Clostridium, Corynebacterium, Enterococcus, Lactobacillus, Lactococcus, Micrococcus, Micromonospora, Pseudomonas, Rhodococcus and Sporolactobacillus. Suitable bacterial species include Bacillus subtilis, B. licheniformis, B. coagulans, B. thermophiius, B. laterosporus, B. megaterium, B. mycoides, B. pumilus, B. lentus, B. cereus, B. circulans, Bifidobacterium longum , B. infantis, B, bifidum, Citrobacter freundii, Clostridium cellulolyticum, C. Ijungdahiii, C, autoethanogenum, C. acetobutylicum, Corynebacterium glutamicum, Enterococcus faecium, E. thermophiles, Escherichia coli, Erwinia herbicoia (Pantoea agglomerans), Lactobacillus acidophilus, L. salivarius, L, plantarum, L, helveticus, L. deibrueckii, L, rhamnosus, L. buigaricus, L. crispatus, L. gasseri, L, casei, L. reuteri, L. jensenii, L. lactis, Pantoea citrea , Pectobacterium carotovorum, Proprionibacterium freudenreichii, Pseudomonas fluorescens, P. aeruginosa, Streptococcus thermophiles and Xanthomonas campestris.
В дополнительном и/или альтернативном воплощении эукариотическая клетка представляет собой дрожжевую клетку, предпочтительно, дрожжевую клетку, выбранную из группы, состоящей из Saccharomyces sp., в частности Saccharomyces cerevisiae, Saccharomycopsis sp., Pichia sp., в частности, Pichia pastoris, Hansenula sp., Kluyveromyces sp., Yarrowia sp., Rhodotorula sp, и Schizosaccharomyces sp.In a further and/or alternative embodiment, the eukaryotic cell is a yeast cell, preferably a yeast cell selected from the group consisting of Saccharomyces sp., in particular Saccharomyces cerevisiae, Saccharomycopsis sp., Pichia sp., in particular Pichia pastoris, Hansenula sp. ., Kluyveromyces sp., Yarrowia sp., Rhodotorula sp, and Schizosaccharomyces sp.
Генетически сконструированная микробная клетка-хозяин может продуцировать целевой олигосахарид. Термин «может продуцировать» в том виде, в котором он используется в данном документе, относится к способности генетически сконструированной микробной клетки-хозяина продуцировать целевой олигосахарид, при условии, что микробную клетку-хозяина культивируют в условиях и в среде, которые являются пермиссивными в отношении продукции микробной клеткой-хозяином целевого олигосахарида. Таким образом, среда должна иметь значение рН в определенном интервале, состав ионов и питательных элементов, а также содержать соединения, требуемые для сохранения жизнеспособности и метаболической активности микробной клетки-хозяина. При необходимости для получения целевого олигосахарида, среда также должна содержать достаточные количества какого-либо предшественника, требуемого для биосинтеза целевого олигосахарида микробной клеткой-хозяином. Аналогично, условия (например, температура, рН, подача кислорода, перемешивание, снабжение питательными веществами и т.д.) для культивирования микробной клетки-хозяина для продукции целевого олигосахарида должны поддерживаться таким образом, чтобы микробная клетка-хозяин могла быть или оставаться метаболически активной для продуцирования целевого олигосахарида.A genetically engineered microbial host cell can produce the target oligosaccharide. The term "can produce" as used herein refers to the ability of a genetically engineered microbial host cell to produce a target oligosaccharide, provided that the microbial host cell is cultured under conditions and media that are permissive for production of the target oligosaccharide by the microbial host cell. Thus, the medium must have a pH value in a certain range, a composition of ions and nutrients, and also contain compounds required to maintain the viability and metabolic activity of the microbial host cell. If necessary to produce the target oligosaccharide, the medium must also contain sufficient amounts of any precursor required for the biosynthesis of the target oligosaccharide by the microbial host cell. Likewise, the conditions (e.g., temperature, pH, oxygen supply, agitation, nutrient supply, etc.) for culturing a microbial host cell to produce the target oligosaccharide must be maintained such that the microbial host cell can be or remain metabolically active to produce the target oligosaccharide.
В дополнительном и/или альтернативном воплощении генетически сконструированная микробная клетка-хозяин, способная продуцировать целевой олигосахарид, представляет собой микробную клетку-хозяина, которая генетически сконструирована для того, чтобы иметь способность продуцировать целевой олигосахарид. В дополнительном и/или альтернативном воплощении генетически сконструированная микробная клетка-хозяин генетически сконструирована с возможностью экспрессии гетерологичной гликозилтрансферазы. Гетерологичная гликозидаза экспрессируется в генетически сконструированной микробной клетке-хозяине во время ферментации, а именно во время продукции или биосинтеза целевого олигосахарида. В дополнительном и/или альтернативном воплощении экспрессия гетерологичной гликозидазы является конститутивной в генетически сконструированном хозяине - микробе.In a further and/or alternative embodiment, the genetically engineered microbial host cell capable of producing the target oligosaccharide is a microbial host cell that is genetically engineered to have the ability to produce the target oligosaccharide. In a further and/or alternative embodiment, the genetically engineered microbial host cell is genetically engineered to express a heterologous glycosyltransferase. The heterologous glycosidase is expressed in the genetically engineered microbial host cell during fermentation, namely during the production or biosynthesis of the target oligosaccharide. In a further and/or alternative embodiment, expression of the heterologous glycosidase is constitutive in the genetically engineered microbial host.
Термин «гетерологичный», в том виде, в котором он используется в данном документе, относится к нуклеотидной последовательности, молекуле нуклеиновой кислоты или полипептиду, которые являются чужеродными в отношении клетки или организма, а именно, к нуклеотидной последовательности, молекуле нуклеиновой кислоты или полипептиду, которые в природе не встречаются в указанной клетке или организме. Термин «гетерологичная последовательность» или «гетерологичная нуклеиновая кислота» или «гетерологичный полипептид», в том виде, в котором он используется в данном документе, представляет собой последовательность или нуклеиновую кислоту или пептид, которые происходят из источника, являющегося чужеродным для конкретной клетки-хозяина (например, из другого вида), или, если из того же источника, модифицированы по сравнению со своей исходной формой. Таким образом, гетерологичная нуклеиновая кислота, функционально связанная с промотором, происходит из источника, отличного от источника, из которого происходит промотор, или, если из того же источника, является модифицированной в сравнении с ее исходной формой. Гетерологичная последовательность может быть стабильно введена, например, посредством трансфекции, трансформации, конъюгации или трансдукции, в геном микробной клетки-хозяина, таким образом, представляя генетически модифицированную клетку-хозяина. Можно применять методики, которые будут зависеть клетки-хозяина, последовательности, которая подлежит введению. Разные методики известны специалисту в данной области, и например, раскрыты в Sambrook ef al., Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989). Соответственно, «гетерологичный полипептид» представляет собой полипептид, который в природе не встречается в клетке дикого типа, от которой происходит генетически сконструированная клетка, и «гетерологичная гликозилтрансфераза» представляет собой гликозилтрансферазу, которая не встречается в природе в клетке дикого типа, от которой происходит генетически сконструированная клетка.The term "heterologous", as used herein, refers to a nucleotide sequence, nucleic acid molecule or polypeptide that is foreign to a cell or organism, namely, a nucleotide sequence, nucleic acid molecule or polypeptide that do not naturally occur in the specified cell or organism. The term "heterologous sequence" or "heterologous nucleic acid" or "heterologous polypeptide", as used herein, is a sequence or nucleic acid or peptide that is derived from a source that is foreign to a particular host cell (e.g. from a different species), or, if from the same source, modified from its original form. Thus, the heterologous nucleic acid operably linked to a promoter comes from a source different from that of the promoter or, if from the same source, is modified from its original form. The heterologous sequence can be stably introduced, for example, through transfection, transformation, conjugation or transduction, into the genome of a microbial host cell, thereby representing a genetically modified host cell. Techniques can be used that will depend on the host cell, the sequence to be administered. Various techniques are known to one skilled in the art and, for example, are disclosed in Sambrook et al., Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY (1989). Accordingly, a “heterologous polypeptide” is a polypeptide that does not naturally occur in the wild-type cell from which the genetically engineered cell is derived, and a “heterologous glycosyltransferase” is a glycosyltransferase that does not naturally occur in the wild-type cell from which the genetically engineered cell is derived. engineered cell.
В дополнительном и/или альтернативном воплощении гетерологичная гликозилтрансфераза выбрана из группы, состоящей из фукозилтрансфераз, предпочтительно α-1,2-фукозилтрансфераз и α-1,3-фукозилтрансфераз, гликозилтрансфераз, галактозилтрансфераз, предпочтительно β-1,3-галактозилтрансфераз и β-1,4-галактозилтрансфераз, сиалилтрансфераз, предпочтительно α-2,3-сиалилтрансфераз и α-2,6-сиалилтрансфераз, и N-ацетил-глкжозаминилтрансфераз.In a further and/or alternative embodiment, the heterologous glycosyltransferase is selected from the group consisting of fucosyltransferases, preferably α-1,2-fucosyltransferases and α-1,3-fucosyltransferases, glycosyltransferases, galactosyltransferases, preferably β-1,3-galactosyltransferases and β-1 ,4-galactosyltransferases, sialyltransferases, preferably α-2,3-sialyltransferases and α-2,6-sialyltransferases, and N-acetyl-gluchosaminyltransferases.
Фукозилтрансферазы катализируют перенос остатков фукозы от донора - гуанозиндифосфат-активированной L-фукозы (ГДФ-фукоза) к нескольким акцепторным молекулам. Фукозилтрансферазы экспрессируются в животных, растениях, грибах и бактериях, и они подразделяются на категории в соответствии со связью с фукозой на акцепторном субстрате. Таким образом, α-1,2-, α-1,3/4- и α-1,6-фукозилтрансферазы отличаются друг от друга. Подходящие фукозилтрансферазы для гетерологичной экспрессии в генетически сконструированной микробной клетке-хозяине раскрыты, например, в Европейской патентной заявке №17180176.Fucosyltransferases catalyze the transfer of fucose residues from the donor guanosine diphosphate-activated L-fucose (GDP-fucose) to several acceptor molecules. Fucosyltransferases are expressed in animals, plants, fungi and bacteria, and they are categorized according to their association with fucose on the acceptor substrate. Thus, α-1,2-, α-1,3/4- and α-1,6-fucosyltransferases are different from each other. Suitable fucosyltransferases for heterologous expression in a genetically engineered microbial host cell are disclosed, for example, in European Patent Application No. 17180176.
Сиалилтрансферазы катализируют перенос остатков N-ацетилнейраминовой кислоты (Neu5Ac) от донора CMP-Neu5Ac к акцепторным молекулам. Обнаружили, что сиалилтрансферазы экспрессируются в животных, растениях, грибах и бактериях. Сиалилтрансферазы подразделяются на категории в соответствии со связью, которая образуется между NeuSAc и акцепторной молекулой. Следовательно, α-2,3-, α-2,6- и α-2,8-сиалилтрансферазы отличаются друг от друга. Подходящие сиалилтрансферазы для гетерологичной экспрессии в генетически сконструированной микробной клетке-хозяине раскрыты, например, в Европейской патентной заявке №17183391.Sialyltransferases catalyze the transfer of N-acetylneuraminic acid (Neu5Ac) residues from the donor CMP-Neu5Ac to acceptor molecules. Sialyltransferases have been found to be expressed in animals, plants, fungi and bacteria. Sialyltransferases are classified into categories according to the bond that is formed between NeuSAc and the acceptor molecule. Therefore, α-2,3-, α-2,6- and α-2,8-sialyltransferases are different from each other. Suitable sialyltransferases for heterologous expression in a genetically engineered microbial host cell are disclosed, for example, in European Patent Application No. 17183391.
Галактозилтрансферазы катализируют перенос остатка галактозы от донора, УДФ (уридиндифосфат)-галактозы, к акцепторным субстратам. Галактозилтрансферазы различают на основе связи между галактозой и акцепторной молекулой, которая образуется. Следовательно, β-1,3- и β-1,4-галактозилтрансферазы отличаются друг от друга. Подходящая β-1,3-галактозилтрансфераза для гетерологичной экспрессии в генетически сконструированной микробной клетке-хозяине кодируется геном wbdO Salmonella enterica. Подходящая β-1,4-галактозилтрансфераза для гетерологичной экспрессии в генетически сконструированной микробной клетке-хозяине кодируется геном lex1 Aggregatibacter aphrophilus.Galactosyltransferases catalyze the transfer of a galactose residue from a donor, UDP (uridine diphosphate)-galactose, to acceptor substrates. Galactosyltransferases are differentiated based on the bond between galactose and the acceptor molecule that is formed. Therefore, β-1,3- and β-1,4-galactosyltransferases are different from each other. A suitable β-1,3-galactosyltransferase for heterologous expression in a genetically engineered microbial host cell is encoded by the Salmonella enterica wbdO gene. A suitable β-1,4-galactosyltransferase for heterologous expression in a genetically engineered microbial host cell is encoded by the lex1 gene of Aggregatibacter aphrophilus.
Генетически сконструированная микробная клетка-хозяин генетически сконструирована с возможностью экспрессии гетерологичной гликозидазы, которая может осуществлять внутриклеточную деградацию побочных продуктов метаболизма - сахаридов, которые образуются во время внутриклеточного биосинтеза целевого олигосахарида. Подходящие гликозидазы представляют собой гликозидазы, которые являются специфичными в отношении гликозидной связи, которая гидролизуется под действием ферментативной активности, и/или в отношении субстрата, который гидролизуется гликозидазой. За счет указанной специфичности гликозидаза гидролизует нежелательные побочные продукты, а не целевой олигосахарид, подлежащий продуцированию. В дополнительном и/или альтернативном воплощении гликозидаза не гидролизует один или более предшественников, которые поглощаются или синтезируются микробной клеткой-хозяином для продуцирования целевого олигосахарида. Предпочтительно, гликозидаза представляет собой экзогликозидазу.The genetically engineered microbial host cell is genetically engineered to express a heterologous glycosidase that can perform the intracellular degradation of metabolic byproducts - saccharides that are formed during the intracellular biosynthesis of the target oligosaccharide. Suitable glycosidases are glycosidases that are specific for a glycosidic bond that is hydrolyzed by enzymatic activity and/or for a substrate that is hydrolyzed by the glycosidase. Due to this specificity, the glycosidase hydrolyzes unwanted by-products rather than the target oligosaccharide to be produced. In a further and/or alternative embodiment, the glycosidase does not hydrolyze one or more precursors that are taken up or synthesized by the microbial host cell to produce the target oligosaccharide. Preferably, the glycosidase is an exoglycosidase.
Экзогликозидазы представляют собой ферменты гликозид гидролазы, которые разрушают гликозидные связи на концевом остатке структуры олигосахарида.Exoglycosidases are glycoside hydrolase enzymes that break down glycosidic bonds at the terminal residue of the oligosaccharide structure.
В дополнительном и/или альтернативном воплощении гетерологичная гликозидаза выбрана из группы, состоящей из фукозидаз, включая α-1,2-фукозидазы и α-1,3-фукозидазы, сиалидаз, таких как α-2,3-сиалидазы, α-2,6-сиалидазы, α-2,8-сиалидазы, галактозидаз, таких как β-1,3-галактозидазы, β-1,4-галактозидазы и β-1,6-галактозидазы, β-N-ацетилгексозаминидаз и глюкозидаз, таких как β-1,3-глюкозидазы.In an additional and/or alternative embodiment, the heterologous glycosidase is selected from the group consisting of fucosidases, including α-1,2-fucosidases and α-1,3-fucosidases, sialidases, such as α-2,3-sialidases, α-2, 6-sialidases, α-2,8-sialidases, galactosidases such as β-1,3-galactosidases, β-1,4-galactosidases and β-1,6-galactosidases, β-N-acetylhexosaminidases and glucosidases such as β-1,3-glucosidases.
Подходящая фукозидаза представляет собой α-1,2-фукозидазу. α-1,2-фукозидаза представляет собой высокоспецифичную экзогликозидазу, которая катализирует гидролиз линейных альфа-1,2-связанных L-фукопиранозильных остатков от олигосахаридов. Предпочтительная α-1,2-фукозидаза представляет собой AfcA Bifidobacterium bifidum (SEQ ID NO: 2).A suitable fucosidase is α-1,2-fucosidase. α-1,2-fucosidase is a highly specific exoglycosidase that catalyzes the hydrolysis of linear alpha-1,2-linked L-fucopyranosyl residues from oligosaccharides. A preferred α-1,2-fucosidase is Bifidobacterium bifidum AfcA (SEQ ID NO: 2).
В дополнительном и/или альтернативном воплощении предложена генетически сконструированная микробная клетка-хозяин, которая может продуцировать 3-FL, где указанная генетически сконструированная микробная клетка-хозяин экспрессирует α-1,2-фукозидазу. Для того, чтобы мочь продуцировать 3-FL, генетически сконструированная микробная клетка-хозяин экспрессирует альфа-1,3-фукозилтрансферазу. Указанная альфа-1,3-фукозилтрансфераза может переносить остаток фукозы от ГДФ-фукозы на группировку глюкозы лактозы в качестве акцепторного субстрата, синтезируя, таким образом, 3-FL в качестве целевого олигосахарида. 2'-FL и 2'3-DFL представляют собой нежелательные побочные продукты - сахариды в получении 3-FL.In an additional and/or alternative embodiment, a genetically engineered microbial host cell is provided that can produce 3-FL, wherein said genetically engineered microbial host cell expresses α-1,2-fucosidase. In order to be able to produce 3-FL, the genetically engineered microbial host cell expresses alpha-1,3-fucosyltransferase. This alpha-1,3-fucosyltransferase can transfer a fucose residue from GDP-fucose to the glucose moiety of lactose as an acceptor substrate, thereby synthesizing 3-FL as the target oligosaccharide. 2'-FL and 2'3-DFL are undesirable saccharide by-products in the preparation of 3-FL.
Посредством экспрессии гетерологичной α-1,2-фукозидазы в генетически сконструированной микробной клетке-хозяине, которая может продуцировать 3-FL, продукция побочных продуктов 2'-FL и 2'3-DFL может быть прекращена или по меньшей мере уменьшена, поскольку данные побочные продукты гидролизуются в пределах генетически-сконструированной микробной клетки-хозяина под действием гетерологичной α-1,2-фукозидазы. Полученные продукты деградации представляют собой фукозу и лактозу. Как фукоза, так и лактоза могут быть использованы генетически сконструированной микробной клеткой-хозяином для продукции целевой 3-FL.By expressing a heterologous α-1,2-fucosidase in a genetically engineered microbial host cell that can produce 3-FL, the production of 2'-FL and 2'3-DFL by-products can be stopped or at least reduced, since these by-products the products are hydrolyzed within the genetically engineered microbial host cell under the action of heterologous α-1,2-fucosidase. The resulting degradation products are fucose and lactose. Both fucose and lactose can be used by the genetically engineered microbial host cell to produce the target 3-FL.
В дополнительном и/или альтернативном воплощении генетически сконструированная микробная клетка-хозяин генетически сконструирована с возможностью экспрессии α-1,2-фукозидазы. В дополнительном и/или альтернативном воплощении генетически сконструированная микробная клетка-хозяин генетически сконструирована для того, чтобы содержать молекулу нуклеиновой кислоты, содержащую нуклеотидную последовательность, кодирующую α-1,2-фукозидазу для ее экспресии. Предпочтительно, нуклеотидная последовательность, кодирующая α-1,2-фукозидазу, представляет собой нуклеотидную последовательность, выбранную из группы, состоящей из:In a further and/or alternative embodiment, the genetically engineered microbial host cell is genetically engineered to express α-1,2-fucosidase. In a further and/or alternative embodiment, the genetically engineered microbial host cell is genetically engineered to contain a nucleic acid molecule comprising a nucleotide sequence encoding an α-1,2-fucosidase for its expression. Preferably, the nucleotide sequence encoding α-1,2-fucosidase is a nucleotide sequence selected from the group consisting of:
- нуклеотидной последовательности, ка представлено SEQ ID NO: 1;- nucleotide sequence as represented by SEQ ID NO: 1;
- нуклеотидных последовательностей, которые комплементарны нуклеотидной последовательности, которая гибридизуется с нуклеотидной последовательностью, как представлено SEQ ID NO: 1, в жестких условиях;- nucleotide sequences that are complementary to a nucleotide sequence that hybridizes to the nucleotide sequence as represented by SEQ ID NO: 1, under stringent conditions;
- нуклеотидных последовательностей, обладающих идентичностью последовательностей по меньшей мере 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% или 99% с нуклеотидной последовательностью, как представлено SEQ ID NO: 1;- nucleotide sequences having at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% sequence identity with the nucleotide sequence as represented by SEQ ID NO: 1 ;
- нуклеотидных последовательностей, кодирующих полипептид, имеющий аминокислотную последовательность, как представлено SEQ ID NO: 2; и- nucleotide sequences encoding a polypeptide having an amino acid sequence as represented by SEQ ID NO: 2; And
- нуклеотидных последовательностей, кодирующих функциональный вариант полипептидных последовательностей, как представлено SEQ ID NO: 2, где аминокислотная последовательность функционального варианта обладает идентичностью последовательностей по меньшей мере 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% или 99% с аминокислотной последовательностью, как представлено SEQ ID NO: 2.- nucleotide sequences encoding a functional variant of polypeptide sequences as represented by SEQ ID NO: 2, where the amino acid sequence of the functional variant has at least 70%, 75%, 80%, 85%, 90%, 95%, 96% sequence identity, 97%, 98% or 99% with the amino acid sequence as represented by SEQ ID NO: 2.
Термин «гибридизуется» или «гибридизация», в том виде, в котором он используется в данном документе, означает гибридизацию в общепринятых условиях, как описано в Sambrook et al. (1989) «Molecular Cloning, A Laboratory Manual» (Cold Spring Harbor Laboratory Press, New York), предпочтительно в жестких условиях. Жесткие условия гибридизации, например, выглядят следующим образом: гибридизация в 4 × SSC при 65°С и последующая множественная промывка в 0,1 × SSC при 65°С на протяжении в общей сложности примерно 1 часа. Менее жесткие условия гибридизации выглядят, например, следующим образом: гибридизация в 4 × SSC при 37°С и последующая множественная промывка в 1 × SSC при комнатной температуре (примерно 21°С). «Условия жесткой гибридизации» могут также означать: гибридизацию при 68°С в 0,25 М фосфате натрия, рН 7,2, 7% SDS (от англ. sodium dodecyl sulphate - додецилсульфат натрия), 1 мМ ЭДТА (Этилендиаминтетрауксусная кислота) и 1% БСА (Бычий сывороточный альбумин) в течение 16 часов и с последующими двумя промывками 2 × SSC и 0,1% SDS при 68°С.The term “hybridizes” or “hybridization” as used herein means hybridization under conventional conditions as described in Sambrook et al. (1989) "Molecular Cloning, A Laboratory Manual" (Cold Spring Harbor Laboratory Press, New York), preferably under harsh conditions. Stringent hybridization conditions, for example, are as follows: hybridization in 4× SSC at 65°C followed by multiple washes in 0.1× SSC at 65°C for a total of approximately 1 hour. Less stringent hybridization conditions are, for example, as follows: hybridization in 4 × SSC at 37°C and subsequent multiple washes in 1 × SSC at room temperature (approximately 21°C). “Stringent hybridization conditions” may also mean: hybridization at 68°C in 0.25 M sodium phosphate, pH 7.2, 7% SDS (sodium dodecyl sulphate), 1 mM EDTA and 1% BSA (Bovine Serum Albumin) for 16 hours followed by two washes with 2 x SSC and 0.1% SDS at 68°C.
Для экспрессии нуклеотидной последовательности, кодирующей α-1,2-фукозидазу или ее функциональный вариант, указанная нуклеотидная последовательность функционально связана с последовательностями контроля экспрессии, которые опосредуют экспрессию нуклеотидной последовательности, кодирующей α-1,2-фукозидазу или ее функциональный вариант в генетически сконструированной микробной клетке-хозяине.For the expression of a nucleotide sequence encoding α-1,2-fucosidase or a functional variant thereof, the nucleotide sequence is operably linked to expression control sequences that mediate expression of the nucleotide sequence encoding α-1,2-fucosidase or a functional variant thereof in the genetically engineered microbial host cell.
«Последовательности контроля экспрессии» представляют собой регуляторные нуклеотидные последовательности, которые не являются частью нуклеотидной последовательности, кодирующей белок, а опосредуют экспрессию нуклеотидной последовательности, кодирующей белок. Нуклеотидные последовательности регуляторных элементов включают промоторы, цис-регуляторные элементы, энхансеры, интроны и терминаторы. В зависимости от типа регуляторного элемента, он находится на молекуле нуклеиновой кислоты перед нуклеотидной последовательностью, кодирующей белок (а именно 3'), или позади нуклеотидной последовательности, кодирующей белок (а именно 5'). Регуляторные элементы являются функциональными в микробной клетке-хозяине.“Expression control sequences” are regulatory nucleotide sequences that are not part of the protein-coding nucleotide sequence, but mediate the expression of the protein-coding nucleotide sequence. Nucleotide sequences of regulatory elements include promoters, cis-regulatory elements, enhancers, introns and terminators. Depending on the type of regulatory element, it is located on the nucleic acid molecule before the nucleotide sequence encoding the protein (namely 3'), or behind the nucleotide sequence encoding the protein (namely 5'). Regulatory elements are functional in the microbial host cell.
Термин «функционально связанный» означает, что регуляторный элемент соединен таким путем с нуклеотидной последовательностью, кодирующей белок, а именно расположен таким образом относительно нуклеотидной последовательности, кодирующей белок, например, молекулы нуклеиновой кислоты, что экспрессия нуклеотидной последовательности, кодирующей белок, под контролем данного регуляторного элемента может происходить в живой клетке.The term "operably linked" means that the regulatory element is connected in such a way to the nucleotide sequence encoding the protein, namely, is located in such a way relative to the nucleotide sequence encoding the protein, for example, a nucleic acid molecule, that the expression of the nucleotide sequence encoding the protein is under the control of this regulatory element. element can occur in a living cell.
В целях настоящего изобретения «промотор» представляет собой нуклеотидную последовательность, регулирующую экспрессию гена, которая обычно находится на 5'-конце гена и посредством взаимодействия с конкретными ДНК-связывающими белками опосредует инициацию транскрипции посредством РНК-полимеразы.For purposes of the present invention, a “promoter” is a nucleotide sequence that regulates gene expression, which is typically located at the 5' end of a gene and, through interaction with specific DNA-binding proteins, mediates the initiation of transcription by RNA polymerase.
Кроме того, подходящие промоторы включают синтетические промоторы. Существуют промоторы, которые были созданы посредством методик молекулярной биологии, которые не обнаруживаются в природе в данной конфигурации. Синтетический промотор представляет собой минималистичный промотор, содержащий только один или более выбранных определенных цис-элементов помимо минимального промотора. Данные цис-элементы представляют собой сайты связывания для ДНК-связывающих белков, таких как транскрипционные факторы, и выделены из природных промоторов, происходящих из ранее выделенных цис-элементов, или технически получены посредством методик случайной рекомбинации и выбраны соответствующими способами; по сравнению с природным промотором, за счет своей менее сложной конструкции синтетический промотор активируется только несколькими экзогенными и эндогенными факторами и, таким образом, более специфично регулируется.In addition, suitable promoters include synthetic promoters. There are promoters that have been created through molecular biology techniques that are not found in nature in a given configuration. A synthetic promoter is a minimal promoter containing only one or more selected specific cis elements in addition to the minimal promoter. These cis-elements represent binding sites for DNA-binding proteins such as transcription factors, and are isolated from natural promoters derived from previously isolated cis-elements, or technically obtained through random recombination techniques and selected by appropriate methods; Compared to a natural promoter, due to its less complex design, the synthetic promoter is activated by only a few exogenous and endogenous factors and is thus more specifically regulated.
«Минимальный промотор» или «коровый промотор» представляет собой нуклеотидную последовательность, которая содержит сайты связывания для комплекса базальных факторов транскрипции и делает возможной точную инициацию транскрипции посредством РНК-полимеразы II. Характеристические мотивы последовательности минимального промотора представляют собой ТАТА-бокс, инициаторный элемент (Inr), «элемент распознавания TFBII» (BRE) и «расположенный ниже элемент корового промотора» (ОРЕ). В минимальном промоторе данные элементы могут встречаться отдельно или в комбинации. Минимальный промотор или его мотивы последовательности доступны, например, из гена бактерии, гриба или вируса.A "minimal promoter" or "core promoter" is a nucleotide sequence that contains binding sites for a complex of basal transcription factors and allows precise initiation of transcription by RNA polymerase II. The characteristic sequence motifs of the minimal promoter are the TATA box, the initiator element (Inr), the “TFBII recognition element” (BRE) and the “downstream core promoter element” (OPE). In a minimal promoter, these elements may occur alone or in combination. The minimal promoter or its sequence motifs are available, for example, from a gene of a bacterium, fungus or virus.
«Цис-элементы» представляют собой нуклеотидные последовательности, которые расположены на той же молекуле нуклеиновой кислоты, что и нуклеотидная последовательность, кодирующая белок, подлежащая экспрессии. Цис-элементы не должны кодировать РНК или белок и в направлении транскрипции могут быть расположены перед или после нуклеотидной последовательности, кодирующей белок, подлежащей экспрессии. Расположенные выше цис-элементы перед нуклеотидной последовательностью, кодирующей белок, подлежащей экспрессии, часто предоставляют необходимые мотивы связывания, в частности для транскрипционных факторов, которые участвуют как транс-действующие элементы (от лат.trans - за пределами), на молекулярном уровне, с другой стороны в регуляции транскрипции данного гена. Если, кроме того, цис-элементы приводят к ингибированию транскрипции, они называются сайленсерами. Цис-элементы, которые приводят к усилению транскрипции, называются энхансерами. Совокупность цис/транс-активностей в промоторе определяет интенсивность, с которой РНК-полимераза осуществляет транскрипцию."Cis elements" are nucleotide sequences that are located on the same nucleic acid molecule as the nucleotide sequence encoding the protein to be expressed. Cis elements do not have to encode RNA or protein and in the direction of transcription may be located before or after the nucleotide sequence encoding the protein to be expressed. Upstream cis elements upstream of the nucleotide sequence encoding the protein to be expressed often provide the necessary binding motifs, in particular for transcription factors that participate as trans-acting elements at the molecular level, on the other hand aspects in the regulation of transcription of this gene. If, in addition, cis elements lead to transcriptional inhibition, they are called silencers. Cis elements that lead to increased transcription are called enhancers. The combination of cis/trans activities in the promoter determines the intensity with which RNA polymerase performs transcription.
Кроме того, промотор может представлять собой химерный промотор и/или промотор, который был модифицирован цис-элементами. Модификация промотора также может означать дополнительное включение цис-элемента в промотор, который, например, уже в природе имеет цис-элемент. Кроме того, модификация также включает мультимеризацию цис-элемента, в частности мультимеризацию существующего в природе цис-элемента. По сравнению с нативной версией, такой модифицированный промотор может, например, обладать измененными свойствами в отношении специфичности, уровня экспрессии или фоновой активности.In addition, the promoter may be a chimeric promoter and/or a promoter that has been modified with cis elements. Modification of a promoter may also mean the additional inclusion of a cis element in a promoter that, for example, already naturally has a cis element. In addition, the modification also includes multimerization of a cis element, in particular multimerization of a naturally occurring cis element. Compared to the native version, such a modified promoter may, for example, have altered properties in terms of specificity, expression level or background activity.
Терминаторы представляют собой нуклеотидные последовательности на ДНК, которые обычно отмечают конец гена и приводят к терминации транскрипции.Terminators are nucleotide sequences on DNA that typically mark the end of a gene and cause transcription to terminate.
Другая подходящая фукозидаза представляет собой α-1,3-фукозидазу. α-1,3-фукозидаза представляет собой высоко специфичную гликозидазу, которая катализирует гидролиз α-1,3-связанных L-фукопиранозильных остатков от олигосахаридов. Предпочтительная α-1,3-фукозидаза представляет собой AfcB из Bifidobacterium bifidum (SEQ ID NO: 4).Another suitable fucosidase is α-1,3-fucosidase. α-1,3-fucosidase is a highly specific glycosidase that catalyzes the hydrolysis of α-1,3-linked L-fucopyranosyl residues from oligosaccharides. A preferred α-1,3-fucosidase is AfcB from Bifidobacterium bifidum (SEQ ID NO: 4).
В дополнительном и/или альтернативном воплощении предложена генетически сконструированная микробная клетка-хозяин, которая способна продуцировать 2'-FL, где указанный генетически сконструированный микроорганизм - хозяин экспрессирует α-1,3-фукозидазу. Для того, чтобы мочь продуцировать 2'-FL, генетически сконструированная микробная клетка-хозяин экспрессирует α-1,2-фукозилтрансферазу. Указанная альфа-1,2-фукозилтрансфераза может переносить остаток фукозы от ГДФ-фукозы к группировке галактозы лактозы в качестве акцепторного субстрата, синтезируя, таким образом, 2'-FL в качестве целевого олигосахарида. 3-FL и 2'3-DFL представляют собой нежелательные побочные продукты - сахариды в продукции 2-FL.In an additional and/or alternative embodiment, a genetically engineered microbial host cell is provided that is capable of producing 2'-FL, wherein said genetically engineered host microorganism expresses α-1,3-fucosidase. In order to be able to produce 2'-FL, the genetically engineered microbial host cell expresses α-1,2-fucosyltransferase. This alpha-1,2-fucosyltransferase can transfer a fucose residue from GDP-fucose to the galactose moiety of lactose as an acceptor substrate, thereby synthesizing 2'-FL as the target oligosaccharide. 3-FL and 2'3-DFL are undesirable saccharide by-products in 2-FL products.
Посредством экспрессии гетерологичной α-1,3-фукозидазы в генетически сконструированной микробной клетке-хозяине, которая может продуцировать 2'-FL, продукция побочных продуктов 3-FL и 2'3-DFL может быть прекращена или по меньшей мере уменьшена, по той причине, что данные побочные продукты гидролизуются в генетически сконструированной микробной клетке-хозяине посредством гетерологичной α-1,3-фукозидазы. Полученные продукты деградации представляют собой фукозу и лактозу. Как фукоза, так и лактоза могут быть использованы генетически сконструированным микроорганизмом-хозяином для продукции целевой 2'-FL.By expressing a heterologous α-1,3-fucosidase in a genetically engineered microbial host cell that can produce 2'-FL, the production of 3-FL and 2'3-DFL by-products can be stopped or at least reduced because that these by-products are hydrolyzed in the genetically engineered microbial host cell via heterologous α-1,3-fucosidase. The resulting degradation products are fucose and lactose. Both fucose and lactose can be used by the genetically engineered host microorganism to produce the target 2'-FL.
В дополнительном и/или альтернативном воплощении генетически сконструированная микробная клетка-хозяин генетически сконструирована с возможностью экспрессии α-1,3-фукозидазы. В дополнительном и/или альтернативном воплощении генетически сконструированная микробная клетка-хозяин генетически сконструирована для того, чтобы содержать молекулу нуклеиновой кислоты, содержащую нуклеотидную последовательность, кодирующую α-1,3-фукозидазу, для ее экспрессии. Предпочтительно, нуклеотидная последовательность, кодирующая α-1,3-фукозидазу, представляет собой нуклеотидную последовательность, выбранную из группы, состоящей из:In a further and/or alternative embodiment, the genetically engineered microbial host cell is genetically engineered to express α-1,3-fucosidase. In a further and/or alternative embodiment, the genetically engineered microbial host cell is genetically engineered to contain a nucleic acid molecule comprising a nucleotide sequence encoding α-1,3-fucosidase for its expression. Preferably, the nucleotide sequence encoding α-1,3-fucosidase is a nucleotide sequence selected from the group consisting of:
- нуклеотидной последовательности, как представлено SEQ ID NO: 3;- nucleotide sequence as represented by SEQ ID NO: 3;
- нуклеотидных последовательностей, которые комплементарны нуклеотидной последовательности, которая гибридизуется с нуклеотидной последовательностью, как представлено SEQ ID NO: 3, в жестких условиях;- nucleotide sequences that are complementary to a nucleotide sequence that hybridizes to the nucleotide sequence as represented by SEQ ID NO: 3, under stringent conditions;
- нуклеотидных последовательностей, обладающих идентичностью последовательностей по меньшей мере 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% или 99% с нуклеотидной последовательностью, как представлено SEQ ID NO: 3;- nucleotide sequences having at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% sequence identity with the nucleotide sequence as represented by SEQ ID NO: 3 ;
- нуклеотидных последовательностей, кодирующих полипептид, имеющий аминокислотную последовательность, как представлено SEQ ID NO: 4; и- nucleotide sequences encoding a polypeptide having an amino acid sequence as represented by SEQ ID NO: 4; And
- нуклеотидных последовательностей, кодирующих функциональный вариант полипептидных последовательностей, как представлено SEQ ID NO: 4, где аминокислотная последовательность функционального варианта обладает идентичностью последовательностей по меньшей мере 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% или 99% с аминокислотной последовательностью, как представлено SEQ ID NO: 4.- nucleotide sequences encoding a functional variant of polypeptide sequences as represented by SEQ ID NO: 4, where the amino acid sequence of the functional variant has at least 70%, 75%, 80%, 85%, 90%, 95%, 96% sequence identity, 97%, 98% or 99% with the amino acid sequence as represented by SEQ ID NO: 4.
Для экспрессии нуклеотидной последовательности, кодирующей α-1,3-фукозидазу или ее функциональный вариант, указанная нуклеотидная последовательность функционально связана с последовательностями контроля экспрессии, которые опосредуют экспрессию нуклеотидной последовательности, кодирующей α-1,3-фукозидазу или ее функциональный вариант в генетически-сконструированной микробной клетке-хозяине.For the expression of a nucleotide sequence encoding α-1,3-fucosidase or a functional variant thereof, said nucleotide sequence is operably linked to expression control sequences that mediate expression of the nucleotide sequence encoding α-1,3-fucosidase or a functional variant thereof in a genetically engineered microbial host cell.
В дополнительном и/или альтернативном воплощении предложена генетически сконструированная микробная клетка-хозяин, которая может продуцировать LNFP-I, где указанная генетически сконструированная микробная клетка-хозяин экспрессирует α-1,3-фукозидазу. Для того, чтобы мочь продуцировать LNFP-I, генетически сконструированная микробная клетка экспрессирует β-1,3-N-ацетилглюкозаминилтрансферазу, β-1,3-галактозилтрансферазу и α-1,2-фукозилтрансферазу. Указанная β-1,3-N-ацетилглюкозаминилтрансфераза может переносить остаток GlcNAc от УДФ-GlcNAc к галактозной группировке лактозы, синтезируя, таким образом, лакто-N-триозу-11 (LNT-II). Указанная β-1,3-галактозилтрансфераза может переносить остаток галактозы от УДФ-галактозы к группировке GlcNAc LNT-II, синтезируя, таким образом, лакто-N-тетраозу (LNT). Указанная α-1,2-фукозилтрансфераза может переносить остаток фукозы от ГДФ-фукозы к концевой группировке галактозы LNT, синтезируя, таким образом, LNFP-I. 3-FL и 2'3-DFL будут нежелательными побочными продуктами в получении LNFP-I. Посредством экспрессии α-1,3-фукозидазы в генетически сконструированной микробной клетке-хозяине, способной продуцировать LNFP-I, продукция побочных продуктов 3-FL и 2'3-DFL может быть прекращена или по меньшей мере уменьшена, по той причине, что данные побочные продукты гидролизуются под действием гетерологичной α-1,3-фукозидазы в пределах генетически-сконструированной микробной клетки-хозяина. Полученные продукты деградации представляют собой фукозу, лактозу и 2'-FL. Фукоза и лактоза могут использоваться генетически сконструированным микроорганизмом-хозяином для продукции целевой LNFP-I.In an additional and/or alternative embodiment, a genetically engineered microbial host cell is provided that can produce LNFP-I, wherein said genetically engineered microbial host cell expresses α-1,3-fucosidase. In order to be able to produce LNFP-I, the genetically engineered microbial cell expresses β-1,3-N-acetylglucosaminyltransferase, β-1,3-galactosyltransferase and α-1,2-fucosyltransferase. This β-1,3-N-acetylglucosaminyltransferase can transfer the GlcNAc residue from UDP-GlcNAc to the galactose moiety of lactose, thereby synthesizing lacto-N-triose-11 (LNT-II). This β-1,3-galactosyltransferase can transfer a galactose residue from UDP-galactose to the GlcNAc moiety of LNT-II, thereby synthesizing lacto-N-tetraose (LNT). This α-1,2-fucosyltransferase can transfer a fucose residue from GDP-fucose to the terminal galactose moiety of LNT, thereby synthesizing LNFP-I. 3-FL and 2′3-DFL would be undesirable by-products in the production of LNFP-I. By expressing α-1,3-fucosidase in a genetically engineered microbial host cell capable of producing LNFP-I, the production of 3-FL and 2'3-DFL by-products can be stopped or at least reduced, for the reason that data the by-products are hydrolyzed by heterologous α-1,3-fucosidase within the genetically engineered microbial host cell. The resulting degradation products are fucose, lactose and 2'-FL. Fucose and lactose can be used by the genetically engineered host microorganism to produce the target LNFP-I.
В дополнительном и/или альтернативном воплощении генетически сконструированная микробная клетка-хозяин генетически сконструирована с возможностью экспрессии α-1,3-фукозидазы. В дополнительном и/или альтернативном воплощении генетически сконструированная микробная клетка-хозяин генетически сконструирована для того, чтобы содержать молекулу нуклеиновой кислоты, содержащую нуклеотидную последовательность, кодирующую α-1,3-фукозидазу, для ее экспрессии. Предпочтительно, нуклеотидная последовательность, кодирующая α-1,3-фукозидазу, представляет собой нуклеотидную последовательность, выбранную из группы, состоящей из:In a further and/or alternative embodiment, the genetically engineered microbial host cell is genetically engineered to express α-1,3-fucosidase. In a further and/or alternative embodiment, the genetically engineered microbial host cell is genetically engineered to contain a nucleic acid molecule comprising a nucleotide sequence encoding α-1,3-fucosidase for its expression. Preferably, the nucleotide sequence encoding α-1,3-fucosidase is a nucleotide sequence selected from the group consisting of:
- нуклеотидной последовательности, как представлено SEQ ID NO: 3;- nucleotide sequence as represented by SEQ ID NO: 3;
- нуклеотидных последовательностей, которые комплементарны нуклеотидной последовательности, которая гибридизуется с нуклеотидной последовательностью, как представлено SEQ ID NO: 3, в жестких условиях;- nucleotide sequences that are complementary to a nucleotide sequence that hybridizes to the nucleotide sequence as represented by SEQ ID NO: 3, under stringent conditions;
- нуклеотидных последовательностей, обладающих идентичностью последовательностей по меньшей мере 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% или 99% с нуклеотидной последовательностью, как представлено SEQ ID NO: 3;- nucleotide sequences having at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% sequence identity with the nucleotide sequence as represented by SEQ ID NO: 3 ;
- нуклеотидных последовательностей, кодирующих полипептид, имеющий аминокислотную последовательность, как представлено SEQ ID NO: 4; и- nucleotide sequences encoding a polypeptide having an amino acid sequence as represented by SEQ ID NO: 4; And
- нуклеотидных последовательностей, кодирующих функциональный вариант полипептидных последовательностей, как представлено SEQ ID NO: 4, где аминокислотная последовательность функционального варианта обладает идентичностью последовательностей по меньшей мере 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% или 99% с аминокислотной последовательностью, как представлено SEQ ID NO: 4.- nucleotide sequences encoding a functional variant of polypeptide sequences as represented by SEQ ID NO: 4, where the amino acid sequence of the functional variant has at least 70%, 75%, 80%, 85%, 90%, 95%, 96% sequence identity, 97%, 98% or 99% with the amino acid sequence as represented by SEQ ID NO: 4.
Для экспрессии нуклеотидной последовательности, кодирующей α-1,3-фукозидазу или ее функциональный вариант указанная нуклеотидная последовательность функционально связана с последовательностями контроля экспрессии, которые опосредуют экспрессию нуклеотидной последовательности, кодирующей α-1,3-фукозидазу или ее функциональный вариант в генетически сконструированной микробной клетке-хозяине.For the expression of a nucleotide sequence encoding α-1,3-fucosidase or a functional variant thereof, the nucleotide sequence is operably linked to expression control sequences that mediate expression of the nucleotide sequence encoding α-1,3-fucosidase or a functional variant thereof in a genetically engineered microbial cell -the owner.
Подходящая сиалидаза представляет собой α-2,3-сиалидазу. α-2,3-Сиалидаза представляет собой высоко специфичную экзогликозидазу, которая катализирует гидролиз линейных α-2,3-связанных L-сиалильных остатков от олигосахаридов. Предпочтительная α-2,3-сиалидаза представляет собой NanB Streptococcus pneumoniae (SEQ ID NO: 6).A suitable sialidase is α-2,3-sialidase. α-2,3-Sialidase is a highly specific exoglycosidase that catalyzes the hydrolysis of linear α-2,3-linked L-sialyl residues from oligosaccharides. A preferred α-2,3-sialidase is Streptococcus pneumoniae NanB (SEQ ID NO: 6).
В дополнительном и/или альтернативном воплощении предложена генетически сконструированная микробная клетка-хозяин, которая способна продуцировать 6'-SL, где указанная генетически сконструированная микробная клетка-хозяин экспрессирует α-2,3 сиалидазу. Для того чтобы мочь продуцировать 6'-SL, генетически сконструированная микробная клетка-хозяин экстрессирует α-2,6-сиалилтрансферазу. Указанная 2,6-сиалилтрансфераза может переносить остаток NeuSAc от CMP-Neu5Ac к галактозной группировке лактозы в качестве субстрата, синтезируя, таким образом, 6'-SL. 3'-SL представляет собой нецелевой побочный продукт в продукции 6'-SL.In an additional and/or alternative embodiment, a genetically engineered microbial host cell is provided that is capable of producing 6'-SL, wherein said genetically engineered microbial host cell expresses α-2,3 sialidase. In order to be able to produce 6'-SL, the genetically engineered microbial host cell expresses α-2,6-sialyltransferase. This 2,6-sialyltransferase can transfer the NeuSAc residue from CMP-Neu5Ac to the galactose moiety of lactose as a substrate, thereby synthesizing 6'-SL. 3'-SL is an off-target by-product in the production of 6'-SL.
Посредством экспрессии α-2,3-сиалидазы в генетически сконструированной микробной клетке-хозяине, которая может продуцировать 6'SL, продукция побочного продукта 3'-SL может быть прекращена или по меньшей мере уменьшена по той причине, что данный побочный продукт гидролизуется под действием α-2,3-сиалидазы в генетически модифицированной микробной клетке-хозяине. Полученные продукты деградации представляют собой N-ацетилнейарминовую кислоту и лактозу. Как N-ацетилнейраминовая кислота, так и лактоза могут быть использованы генетически сконструированным микроорганизмом-хозяином для продукции целевой 6'-SL.By expressing α-2,3-sialidase in a genetically engineered microbial host cell that can produce 6'SL, production of the 3'-SL byproduct can be stopped or at least reduced because the byproduct is hydrolyzed by α-2,3-sialidases in a genetically modified microbial host cell. The resulting degradation products are N-acetylneuraminic acid and lactose. Both N-acetylneuraminic acid and lactose can be used by the genetically engineered host microorganism to produce the target 6'-SL.
В дополнительном и/или альтернативном воплощении генетически сконструированная микробная клетка-хозяин генетически сконструирована с возможностью экспрессии α-2,3-сиалидазы. В дополнительном и/или альтернативном воплощении генетически сконструированная микробная клетка-хозяин генетически сконструирована для того, чтобы содержать молекулу нуклеиновой кислоты, содержащую нуклеотидную последовательность, кодирующую α-2,3-сиалидазу для ее экспрессии. Предпочтительно, нуклеотидная последовательность, кодирующая α-2,3-сиалидазу, представляет собой нуклеотидную последовательность, выбранную из группы, состоящей из:In a further and/or alternative embodiment, the genetically engineered microbial host cell is genetically engineered to express α-2,3-sialidase. In a further and/or alternative embodiment, the genetically engineered microbial host cell is genetically engineered to contain a nucleic acid molecule comprising a nucleotide sequence encoding an α-2,3-sialidase for its expression. Preferably, the nucleotide sequence encoding the α-2,3-sialidase is a nucleotide sequence selected from the group consisting of:
- нуклеотидной последовательности, как представлено SEQ ID NO: 5;- the nucleotide sequence as represented by SEQ ID NO: 5;
- нуклеотидных последовательностей, которые комплементарны нуклеотидной последовательности, которая гибридизуется с нуклеотидной последовательностью, как представлено SEQ ID NO: 5, в жестких условиях;- nucleotide sequences that are complementary to a nucleotide sequence that hybridizes to the nucleotide sequence as represented by SEQ ID NO: 5, under stringent conditions;
- нуклеотидных последовательностей, обладающих идентичностью последовательностей по меньшей мере 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% или 99% с нуклеотидной последовательностью, как представлено SEQ ID NO: 5;- nucleotide sequences having at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% sequence identity with the nucleotide sequence as represented by SEQ ID NO: 5 ;
- нуклеотидных последовательностей, кодирующих полипептид, имеющий аминокислотную последовательность, как представлено SEQ ID NO: 6; и- nucleotide sequences encoding a polypeptide having an amino acid sequence as represented by SEQ ID NO: 6; And
- нуклеотидных последовательностей, кодирующих функциональный вариант полипептидных последовательностей, как представлено SEQ ID NO: 6, где аминокислотная последовательность функционального варианта обладает идентичностью последовательностей по меньшей мере 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% или 99% с аминокислотной последовательностью, как представлено SEQ ID NO: 6.- nucleotide sequences encoding a functional variant of polypeptide sequences as represented by SEQ ID NO: 6, where the amino acid sequence of the functional variant has at least 70%, 75%, 80%, 85%, 90%, 95%, 96% sequence identity, 97%, 98% or 99% with the amino acid sequence as represented by SEQ ID NO: 6.
Для экспрессии нуклеотидной последовательности, кодирующей α-2,3-сиалидазу или ее функциональный вариант, указанная нуклеотидная последовательность функционально связана с последовательностями контроля экспрессии, которые опосредуют экспрессию нуклеотидной последовательности, кодирующей α-2,3-сиалидазу или ее функциональный вариант, в генетически сконструированной микробной клетке-хозяине.For expression of a nucleotide sequence encoding α-2,3-sialidase or a functional variant thereof, said nucleotide sequence is operably linked to expression control sequences that mediate expression of the nucleotide sequence encoding α-2,3-sialidase or a functional variant thereof in a genetically engineered microbial host cell.
Подходящая галактозидаза представляет собой β-1,3-галактозидазу. β-1,3-Галактозидаза представляет собой фермент, который катализирует гидролиз остатка β-1,3-связанной галактозы от олигосахаридов. Предпочтительная β-1,3-галактозидаза представляет собой Bga42A Bifidobacterium longum (SEQ ID NO: 8).A suitable galactosidase is β-1,3-galactosidase. β-1,3-Galactosidase is an enzyme that catalyzes the hydrolysis of the β-1,3-linked galactose residue from oligosaccharides. A preferred β-1,3-galactosidase is Bifidobacterium longum Bga42A (SEQ ID NO: 8).
В дополнительном и/или альтернативном воплощении предложена генетически сконструированная микробная клетка-хозяин, которая может продуцировать LNnT, где указанная генетически сконструированная микробная клетка-хозяин экспрессирует β-1,3-галактозидазу. Для того, чтобы иметь способность продуцировать LNnT, генетически сконструированная микробная клетка-хозяин экспрессирует β-1,3-N-ацетилглюкозаминилтрансферазу и β-1,4-галактозилтрансферазу. Указанная β-1,3-N-ацетилглюкозаминилтрансфераза может переносить остаток GlcNAc от УДФ-GlcNAc к группировке галактозы лактозы, синтезируя, таким образом, LNT-II. Указанная β-1,4-галактозилтрансфераза может переносить остаток галактозы от УДФ-галактозы на группировку GlcNAc LNT-II, синтезируя, таким образом, LNnT в качестве целевого олигосахарида.In an additional and/or alternative embodiment, a genetically engineered microbial host cell is provided that can produce LNnT, wherein the genetically engineered microbial host cell expresses β-1,3-galactosidase. In order to have the ability to produce LNnT, the genetically engineered microbial host cell expresses β-1,3-N-acetylglucosaminyltransferase and β-1,4-galactosyltransferase. This β-1,3-N-acetylglucosaminyltransferase can transfer the GlcNAc residue from UDP-GlcNAc to the galactose moiety of lactose, thereby synthesizing LNT-II. This β-1,4-galactosyltransferase can transfer a galactose residue from UDP-galactose to the GlcNAc moiety of LNT-II, thereby synthesizing LNnT as the target oligosaccharide.
LNT представляет собой нецелевой побочный продукт в продукции LNnT. Посредством экспрессии β-1,3-галактозидазы в генетически сконструированной микробной клетке-хозяине, способной продуцировать LNnT, продукция побочного продукта LNT может быть прекращена или по меньшей мере уменьшена по той причине, что данный побочный продукт гидролизуется в генетически сконструированной микробной клетке-хозяине под действием гетерологичной β-1,3-галактозидазы. Полученные продукты деградации представляют собой галактозу и LNT-II. Галактоза, а также LNT-II могут быть использованы генетически сконструированным микроорганизмом-хозяином для продукции целевой LNnT.LNT is an unintended by-product in LNnT products. By expressing β-1,3-galactosidase in a genetically engineered microbial host cell capable of producing LNnT, production of the LNT byproduct can be stopped or at least reduced because the byproduct is hydrolyzed in the genetically engineered microbial host cell under action of heterologous β-1,3-galactosidase. The resulting degradation products are galactose and LNT-II. Galactose as well as LNT-II can be used by the genetically engineered host microorganism to produce the target LNnT.
В дополнительном и/или альтернативном воплощении генетически сконструированная микробная клетка-хозяин генетически сконструирована с возможностью экспрессии β-1,3-галактозидазы. В дополнительном и/или альтернативном воплощении генетически сконструированная микробная клетка-хозяин генетически сконструирована для того, чтобы содержать молекулу нуклеиновой кислоты, содержащую нуклеотидную последовательность, кодирующую β-1,3-галактозидазу для ее экспрессии.In a further and/or alternative embodiment, the genetically engineered microbial host cell is genetically engineered to express β-1,3-galactosidase. In a further and/or alternative embodiment, the genetically engineered microbial host cell is genetically engineered to contain a nucleic acid molecule comprising a nucleotide sequence encoding a β-1,3-galactosidase for its expression.
В дополнительном и/или альтернативном воплощении генетически сконструированная микробная клетка-хозяин генетически сконструирована с возможностью экспрессии β-1,3-галактозидазы. В дополнительном и/или альтернативном воплощении генетически сконструированная микробная клетка-хозяин генетически сконструированная для того, чтобы содержать молекулу нуклеиновой кислоты, содержащую нуклеотидную последовательность, кодирующую β-1,3-галактозидазу для ее экспрессии. Предпочтительно, нуклеотидная последовательность, кодирующая β-1,3-галактозидазу, представляет собой нуклеотидную последовательность, выбранную из группы, состоящей из:In a further and/or alternative embodiment, the genetically engineered microbial host cell is genetically engineered to express β-1,3-galactosidase. In a further and/or alternative embodiment, the genetically engineered microbial host cell is genetically engineered to contain a nucleic acid molecule comprising a nucleotide sequence encoding a β-1,3-galactosidase for its expression. Preferably, the nucleotide sequence encoding β-1,3-galactosidase is a nucleotide sequence selected from the group consisting of:
- нуклеотидной последовательности, как представлено SEQ ID NO: 7;- the nucleotide sequence as represented by SEQ ID NO: 7;
- нуклеотидных последовательностей, которые комплементарны нуклеотидной последовательности, которая гибридизуется с нуклеотидной последовательностью, как представлено SEQ ID NO: 7, в жестких условиях;- nucleotide sequences that are complementary to a nucleotide sequence that hybridizes to the nucleotide sequence as represented by SEQ ID NO: 7, under stringent conditions;
- нуклеотидных последовательностей, обладающих идентичностью последовательностей по меньшей мере 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% или 99% с нуклеотидной последовательностью, как представлено SEQ ID NO: 7;- nucleotide sequences having at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% sequence identity with the nucleotide sequence as represented by SEQ ID NO: 7 ;
- нуклеотидных последовательностей, кодирующих полипептид, имеющий аминокислотную последовательность, как представлено SEQ ID NO: 8; и- nucleotide sequences encoding a polypeptide having an amino acid sequence as represented by SEQ ID NO: 8; And
- нуклеотидных последовательностей, кодирующих функциональный вариант полипептидных последовательностей, как представлено SEQ ID NO: 8, где аминокислотная последовательность функционального варианта обладает идентичностью последовательностей по меньшей мере 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% или 99% с аминокислотной последовательностью, как представлено SEQ ID NO: 8.- nucleotide sequences encoding a functional variant of polypeptide sequences as represented by SEQ ID NO: 8, where the amino acid sequence of the functional variant has at least 70%, 75%, 80%, 85%, 90%, 95%, 96% sequence identity, 97%, 98% or 99% with the amino acid sequence as represented by SEQ ID NO: 8.
Для экспрессии нуклеотидной последовательности, кодирующей β-1,3-галактозидазу или ее функциональный вариант, указанная нуклеотидная последовательность функционально связана с последовательностями контроля экспрессии, которые опосредуют экспрессию нуклеотидной последовательности, кодирующей β-1,3-глюкозидазу или ее функциональный вариант в генетически сконструированной микробной клетке-хозяине.For the expression of a nucleotide sequence encoding a β-1,3-galactosidase or a functional variant thereof, the nucleotide sequence is operably linked to expression control sequences that mediate the expression of a nucleotide sequence encoding a β-1,3-glucosidase or a functional variant thereof in a genetically engineered microbial host cell.
Другая подходящая галактозидаза представляет собой галактан β-1,3-галактозидазу. Галактан β-1,3-галактозидаза представляет собой фермент, который катализирует гидролиз остатка β-1,3-связанной галактозы от олигосахаридных цепей, несущих галактозу. Предпочтительная галактан β-1,3-галактозидаза представляет собой Ct1,3Gal43A Clostridium thermocellum (SEQ ID NO: 10).Another suitable galactosidase is galactan β-1,3-galactosidase. Galactan β-1,3-galactosidase is an enzyme that catalyzes the hydrolysis of the β-1,3-linked galactose residue from oligosaccharide chains bearing galactose. A preferred galactan β-1,3-galactosidase is Clostridium thermocellum Ct1,3Gal43A (SEQ ID NO: 10).
В дополнительном и/или альтернативном воплощении генетически сконструированная микробная клетка-хозяин генетически сконструирована с возможностью экспрессии галактан β-1,3-галактозидазы. В дополнительном и/или альтернативном воплощении генетически сконструированная микробная клетка-хозяин генетически сконструирована для того, чтобы содержать молекулу нуклеиновой кислоты, содержащую нуклеотидную последовательность, кодирующую галактан β-1,3-галактозидазу для ее экспрессии. Предпочтительно, нуклеотидная последовательность, кодирующая галактан β-1,3-галактозидазу, представляет собой нуклеотидную последовательность, выбранную из группы, состоящей из:In a further and/or alternative embodiment, the genetically engineered microbial host cell is genetically engineered to express galactan β-1,3-galactosidase. In a further and/or alternative embodiment, the genetically engineered microbial host cell is genetically engineered to contain a nucleic acid molecule comprising a nucleotide sequence encoding galactan β-1,3-galactosidase for its expression. Preferably, the nucleotide sequence encoding galactan β-1,3-galactosidase is a nucleotide sequence selected from the group consisting of:
- нуклеотидной последовательности, как представлено SEQ ID NO: 9;- the nucleotide sequence as represented by SEQ ID NO: 9;
- нуклеотидных последовательностей, которые комплементарны нуклеотидной последовательности, которая гибридизуется с нуклеотидной последовательностью, как представлено SEQ ID NO: 9, в жестких условиях;- nucleotide sequences that are complementary to a nucleotide sequence that hybridizes to the nucleotide sequence as represented by SEQ ID NO: 9, under stringent conditions;
- нуклеотидных последовательностей, обладающих идентичностью последовательностей по меньшей мере 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% или 99% с нуклеотидной последовательностью, как представлено SEQ ID NO: 9;- nucleotide sequences having at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% sequence identity with the nucleotide sequence as represented by SEQ ID NO: 9 ;
- нуклеотидных последовательностей, кодирующих полипептид, имеющий аминокислотную последовательность, как представлено SEQ ID NO: 10; и- nucleotide sequences encoding a polypeptide having an amino acid sequence as represented by SEQ ID NO: 10; And
- нуклеотидных последовательностей, кодирующих функциональный вариант полипептидных последовательностей, как представлено SEQ ID NO: 10, где аминокислотная последовательность функционального варианта обладает идентичностью последовательностей по меньшей мере 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% или 99% с аминокислотной последовательностью, как представлено SEQ ID NO: 10.- nucleotide sequences encoding a functional variant of polypeptide sequences as represented by SEQ ID NO: 10, where the amino acid sequence of the functional variant has at least 70%, 75%, 80%, 85%, 90%, 95%, 96% sequence identity, 97%, 98% or 99% with the amino acid sequence as represented by SEQ ID NO: 10.
Для экспрессии нуклеотидной последовательности, кодирующей галактан β-1,3-галактозидазу или ее функциональный вариант, указанная нуклеотидная последовательность функционально связана с последовательностями контроля экспрессии, которые опосредуют экспрессию нуклеотидной последовательности, кодирующей галактан β-1,3-глюкозидазу или ее функциональный вариант, в генетически сконструированной микробной клетке-хозяине.For the expression of a nucleotide sequence encoding galactan β-1,3-galactosidase or a functional variant thereof, the nucleotide sequence is operably linked to expression control sequences that mediate expression of the nucleotide sequence encoding galactan β-1,3-glucosidase or a functional variant thereof, in genetically engineered microbial host cell.
Подходящая глюкозидаза представляет собой β-1,3-глюкозидазу. β-1,3-Глюкозидаза представляет собой высокоспецифичную экзогликозидазу, которая катализирует гидролиз остатка β-1,3-связанной глюкозы от олигосахаридов. Предпочтительная β-1,3-глюкозидаза представляет собой PgIA Paenibacillus sp.(SEQ ID NO: 12).A suitable glucosidase is β-1,3-glucosidase. β-1,3-Glucosidase is a highly specific exoglycosidase that catalyzes the hydrolysis of the β-1,3-linked glucose moiety from oligosaccharides. A preferred β-1,3-glucosidase is PgIA Paenibacillus sp. (SEQ ID NO: 12).
В дополнительном и/или альтернативном воплощении предложен генетически сконструированный микроорганизм-хозяин, который может продуцировать LNT или LNnT, где указанная генетически сконструированная микробная клетка-хозяин экспрессирует β1,3-глюкозидазу и/или β-1,3-галактозидазу. Для того, чтобы мочь продуцировать LNT, генетически сконструированная микробная клетка-хозяин экспрессирует β-1,3-N-ацетилглюкозаминилтрансферазу и β-1,3-галактозилтрансферазу. Указанная β-1,3-N-ацетилглюкозаминилтрансфераза может переносить остаток GlcNAc от УДФ-GlcNAc к группировке галактозы лактозы, синтезируя, таким образом, лакто-N-триозу-II (LNT-II). Указанная β-1,3-галактозилтрансфераза может переносить остаток галактозы от УДФ-галактозы к группировке GlcNAc LNT-II, синтезируя, таким образом, лакто-N-тетраозу (LNT). Для того, чтобы мочь продуцировать LNnT, генетически сконструированная микробная клетка-хозяин экспрессирует β-1,3-N-ацетилглюкозаминилтрансферазу и β-1,4-галактозилтрансферазу. Указанная β-1,3-N-ацетилглюкозаминилтрансфераза может синтезировать LNT-II. Указанная β-1,4-галактозилтрансфераза может переносить остаток галактозы от УДФ-галактозы к группировке GlcNAc LNT-II, синтезируя, таким образом, LNnT в качестве целевого олигосахарида.In an additional and/or alternative embodiment, there is provided a genetically engineered microbial host that can produce LNT or LNnT, wherein the genetically engineered microbial host cell expresses β1,3-glucosidase and/or β-1,3-galactosidase. In order to be able to produce LNT, the genetically engineered microbial host cell expresses β-1,3-N-acetylglucosaminyltransferase and β-1,3-galactosyltransferase. This β-1,3-N-acetylglucosaminyltransferase can transfer the GlcNAc residue from UDP-GlcNAc to the galactose moiety of lactose, thereby synthesizing lacto-N-triose-II (LNT-II). This β-1,3-galactosyltransferase can transfer a galactose residue from UDP-galactose to the GlcNAc moiety of LNT-II, thereby synthesizing lacto-N-tetraose (LNT). In order to be able to produce LNnT, the genetically engineered microbial host cell expresses β-1,3-N-acetylglucosaminyltransferase and β-1,4-galactosyltransferase. This β-1,3-N-acetylglucosaminyltransferase can synthesize LNT-II. This β-1,4-galactosyltransferase can transfer a galactose residue from UDP-galactose to the GlcNAc moiety of LNT-II, thereby synthesizing LNnT as the target oligosaccharide.
Квалифицированному специалисту в данной области известно, что β-1,3-N-ацетилглюкозаминилтрансферазы, подобно LgtA Neisseria meningitides, акцептируют широкий спектр субстратов-доноров. Как правило, при переносе GlcNAc от УДФ-GlcNAc к соответствующему акцепторному сахариду, LgtA также может использовать УДФ-галактозу или УДФ-глюкозу в качестве субстратов-до но ров. Используя генетически сконструированный микроорганизм-хозяин, способный продуцировать LNT или LNnT, как описано, указанная β-1,3-N-ацетилглюкозаминилтрансфераза также может переносить остаток галактозы от УДФ-галактозы, а также остаток глюкозы от УДФ-глюкозы к группировке галактозы лактозы, синтезируя, таким образом, нежелательные побочные продукты Gal(β1,3)Gal(β1,4)Glc и Glc(β1,3)Gal(β1,4)Glc, соответственно.One skilled in the art will recognize that β-1,3-N-acetylglucosaminyltransferases, like Neisseria meningitides LgtA, accept a wide range of donor substrates. Typically, when transferring GlcNAc from UDP-GlcNAc to the corresponding acceptor saccharide, LgtA can also use UDP-galactose or UDP-glucose as donor substrates. Using a genetically engineered host microorganism capable of producing LNT or LNnT as described, said β-1,3-N-acetylglucosaminyltransferase can also transfer a galactose moiety from UDP-galactose, as well as a glucose moiety from UDP-glucose to the galactose moiety of lactose, synthesizing , thus the unwanted by-products Gal(β1,3)Gal(β1,4)Glc and Glc(β1,3)Gal(β1,4)Glc, respectively.
Посредством экспрессии галактан β-1,3-галактозидазы и/или β-1,3-глюкозидазы в генетически сконструированной микробной клетке-хозяине, способной продуцировать LNT или LNnT, продукция побочных продуктов Gal(β1,3)Gal(β1,4)Glc и Glc(β1,3)Gal(β1,4)Glc может быть прекращена или по меньшей мере уменьшена по той причине, что данные побочный продукты гидролизуются в генетически сконструированной микробной клетке-хозяине под действием галактан β-1,3-галактозидазы и/или β1,3-глюкозидазы. Полученные продукты деградации представляют собой галактозу и/или глюкозу и лактозу. Оба моносахарида, а также лактоза могут быть использованы генетически сконструированной микробной клеткой-хозяином для продукции целевой LNT или LNnT.By expressing galactan β-1,3-galactosidase and/or β-1,3-glucosidase in a genetically engineered microbial host cell capable of producing LNT or LNnT, the production of Gal(β1,3)Gal(β1,4)Glc by-products and Glc(β1,3)Gal(β1,4)Glc may be discontinued or at least reduced due to the fact that these by-products are hydrolyzed in the genetically engineered microbial host cell by the action of galactan β-1,3-galactosidase and/ or β1,3-glucosidases. The resulting degradation products are galactose and/or glucose and lactose. Both monosaccharides, as well as lactose, can be used by a genetically engineered microbial host cell to produce the target LNT or LNnT.
В дополнительном и/или альтернативном воплощении генетически сконструированная микробная клетка-хозяин генетически сконструирована с возможностью экспрессии β-1,3-глюкозидазы. В дополнительном и/или альтернативном воплощении генетически сконструированная микробная клетка-хозяин генетически сконструирована для того, чтобы содержать молекулу нуклеиновой кислоты, содержащую нуклеотидную последовательность, кодирующую β-1,3-глюкозидазу для ее экспрессии. Предпочтительно, нуклеотидная последовательность, кодирующая β-1,3-глюкозидазу, представляет собой нуклеотидную последовательность, выбранную из группы, состоящей из:In a further and/or alternative embodiment, the genetically engineered microbial host cell is genetically engineered to express β-1,3-glucosidase. In a further and/or alternative embodiment, the genetically engineered microbial host cell is genetically engineered to contain a nucleic acid molecule comprising a nucleotide sequence encoding a β-1,3-glucosidase for its expression. Preferably, the nucleotide sequence encoding the β-1,3-glucosidase is a nucleotide sequence selected from the group consisting of:
- нуклеотидной последовательности, как представлено SEQ ID NO: 11;- the nucleotide sequence as represented by SEQ ID NO: 11;
- нуклеотидных последовательностей, которые комплементарны нуклеотидной последовательности, которая гибридизуется с нуклеотидной последовательностью, как представлено SEQ ID NO: 11, в жестких условиях;- nucleotide sequences that are complementary to a nucleotide sequence that hybridizes to the nucleotide sequence as represented by SEQ ID NO: 11, under stringent conditions;
- нуклеотидных последовательностей, обладающих идентичностью последовательностей по меньшей мере 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% или 99% с нуклеотидной последовательностью, как представлено SEQ ID NO: 11;- nucleotide sequences having at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% sequence identity with the nucleotide sequence as represented by SEQ ID NO: 11 ;
- нуклеотидных последовательностей, кодирующих полипептид, имеющий аминокислотную последовательность, как представлено SEQ ID NO: 12; и- nucleotide sequences encoding a polypeptide having an amino acid sequence as represented by SEQ ID NO: 12; And
- нуклеотидных последовательностей, кодирующих функциональный вариант полипептидных последовательностей, как представлено SEQ ID NO: 12, где аминокислотная последовательность функционального варианта обладает идентичностью последовательностей по меньшей мере 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% или 99% с аминокислотной последовательностью, как представлено SEQ ID NO: 10.- nucleotide sequences encoding a functional variant of polypeptide sequences as represented by SEQ ID NO: 12, where the amino acid sequence of the functional variant has at least 70%, 75%, 80%, 85%, 90%, 95%, 96% sequence identity, 97%, 98% or 99% with the amino acid sequence as represented by SEQ ID NO: 10.
Для экспрессии нуклеотидной последовательности, кодирующей β-1,3-глюкозидазу или ее функциональный вариант, указанная нуклеотидная последовательность функционально связана с последовательностями контроля экспрессии, которые опосредуют экспрессию нуклеотидной последовательности, кодирующей β-1,3-глюкозидазу или ее функциональный вариант, в генетически сконструированной микробной клетке-хозяине.For the expression of a nucleotide sequence encoding a β-1,3-glucosidase or a functional variant thereof, the nucleotide sequence is operably linked to expression control sequences that mediate the expression of a nucleotide sequence encoding a β-1,3-glucosidase or a functional variant thereof in a genetically engineered microbial host cell.
Генетически сконструированная микробная клетка-хозяин может повторно использовать по меньшей мере один из продуктов деградации, полученных в результате ферментативной активности гетерологичной гликозидазы в генетически сконструированной микробной клетке-хозяине. Таким образом, генетически сконструированная микробная клетка-хозяин может использовать по меньшей мере один из продуктов деградации, полученных за счет ферментативной активности гетерологичной гликозидазы для продукции целевого олигосахарида. Например, остаток моносахарида, высвобождающийся из нецелевого побочного продукта -сахарида под действием гетерологичной гликозидазы, может быть повторно активирован, а именно связан с нуклеотидом, перенесен от полученного нуклеотид-активированного моносахарида к субстрату-акцептору посредством соответствующей гликозилтрансферазы с получением целевого олигосахарида или предшественника целевого олигосахарида.The genetically engineered microbial host cell may reuse at least one of the degradation products resulting from the enzymatic activity of the heterologous glycosidase in the genetically engineered microbial host cell. Thus, the genetically engineered microbial host cell can utilize at least one of the degradation products produced by the enzymatic activity of the heterologous glycosidase to produce the target oligosaccharide. For example, a monosaccharide residue released from a non-target saccharide by-product by a heterologous glycosidase can be reactivated, namely bound to a nucleotide, transferred from the resulting nucleotide-activated monosaccharide to an acceptor substrate via an appropriate glycosyltransferase to produce the target oligosaccharide or a precursor of the target oligosaccharide .
Способ включает стадию культивирования генетически сконструированной микробной клетки-хозяина в среде, которая является пермиссивной для продукции целевого олигосахарида указанным генетически сконструированным микроорганизмом-хозяином, и в условиях, которые являются пермиссивными для продукции целевого олигосахарида указанным генетически сконструированным микроорганизмом-хозяином.The method includes the step of cultivating a genetically engineered microbial host cell in an environment that is permissive for the production of a target oligosaccharide by the specified genetically engineered host microorganism, and under conditions that are permissive for the production of the target oligosaccharide by the specified genetically engineered host microorganism.
Среда, которая является пермиссивной в отношении продукции целевого олигосахарида генетически сконструированной микробной клеткой-хозяином, содержит питательные вещества, по меньшей мере один источник энергии, незаменимые металлы и минеральные вещества и буферный агент. Среда возможно содержит предшественник целевого олигосахарида, причем указанный предшественник может поглощаться генетически сконструированной микробной клеткой-хозяином и использоваться для продукции целевого олигосахарида, при условии, что генетически сконструированная микробная клетка-хозяин не может синтезировать указанный предшественник самостоятельно. Затем, генетически сконструированная микробная клетка-хозяин поглощает предшественник и подвергает предшественник биосинтезу целевого олигосахарида. Например, лактозу можно считать предшественником 2'-фукозиллактозы.The medium, which is permissive to production of the target oligosaccharide by the genetically engineered microbial host cell, contains nutrients, at least one energy source, essential metals and minerals, and a buffering agent. The medium optionally contains a precursor of the target oligosaccharide, which precursor can be taken up by the genetically engineered microbial host cell and used to produce the target oligosaccharide, provided that the genetically engineered microbial host cell cannot synthesize the precursor on its own. Next, the genetically engineered microbial host cell ingests the precursor and subjects the precursor to biosynthesis of the target oligosaccharide. For example, lactose can be considered a precursor to 2'-fucosyllactose.
Во время культивации генетически сконструированных микробных клеток-хозяев для продуцирования целевого олигосахарида поддерживаются пермиссивные условия. Условия являются «пермиссивными», если генетически сконструированные микробные клетки-хозяева, которые культивируют в данных условиях, остаются живыми и продуцируют целевой олигосахарид. Предпочтительно, пермиссивные условия культивирования делают возможным размножение генетически сконструированных микробных клеток-хозяев. Условия, которые нужно поддерживать при определенной величине или в пределах определенного диапазона, включают рН, температуру, кислород и концентрации питательных веществ, источников энергии и незаменимых металлов и минеральных веществ.Permissive conditions are maintained during the cultivation of genetically engineered microbial host cells to produce the target oligosaccharide. Conditions are “permissive” if the genetically engineered microbial host cells that are cultured under the conditions remain alive and produce the target oligosaccharide. Preferably, permissive culture conditions allow the proliferation of genetically engineered microbial host cells. Conditions that must be maintained at a certain value or within a certain range include pH, temperature, oxygen, and concentrations of nutrients, energy sources, and essential metals and minerals.
В дополнительном и/или альтернативном воплощении способ включает стадию выделения целевого олигосахарида. Целевой олигосахарид может быть выделен из ферментационного бульона и/или из генетически сконструированного микроорганизмα-хозяина.In an additional and/or alternative embodiment, the method includes the step of isolating the target oligosaccharide. The target oligosaccharide can be isolated from the fermentation broth and/or from a genetically engineered microorganism α host.
Способ, как ранее описано в данном документе, является преимущественным в том отношении, что во время продукции целевого олигосахарида продуцируется меньше нецелевых побочных продуктов или они не продуцируются. Таким образом, менее трудоемко и затратно выделять и очищать целевой огилосахарид из ферментационного бульона или клеточного лизата.The method as previously described herein is advantageous in that fewer or no off-target byproducts are produced during production of the target oligosaccharide. Thus, it is less laborious and costly to isolate and purify the target oligosaccharide from the fermentation broth or cell lysate.
Кроме того, гораздо больше субстрата специфично используется для продуцирования целевого олигосахарида вместо того, чтобы стать недоступным для продукции целевого олигосахарида, поскольку он включается в нежелательные побочные продукты, которые не могут быть метаболизированы микробной клеткой-хозяином.In addition, much more substrate is specifically used to produce the target oligosaccharide, rather than becoming unavailable for production of the target oligosaccharide because it is incorporated into unwanted byproducts that cannot be metabolized by the microbial host cell.
Согласно второму аспекту предложены генетически сконструированные микробные клетки-хозяева для получения целевого олигосахарида, где микробная клетка-хозяин может продуцировать целевой олигосахарид, и где микробная клетка-хозяин генетически сконструирована с возможностью экспрессии гетерологичной гликозидазы, которая может осуществлять внутриклеточную деградацию побочных продуктов метаболизма, которые образуются во время внутриклеточного биосинтеза целевого олигосахарида.According to a second aspect, there are provided genetically engineered microbial host cells to produce a target oligosaccharide, wherein the microbial host cell can produce the target oligosaccharide, and where the microbial host cell is genetically engineered to express a heterologous glycosidase that can perform intracellular degradation of metabolic byproducts that are produced during intracellular biosynthesis of the target oligosaccharide.
Согласно третьему аспекту, генетически сконструированные микробные клетки-хозяева, как ранее описано в данном документе, используются для продуцирования целевого олигосахарида. Использование данных генетически сконструированных микробных клеток-хозяев для продуцирования целевого олигосахарида посредством ферментации является преимущественным, поскольку продуцирование нецелевых побочных продуктов-сахаридов предотвращается или даже прекращается. Таким образом, это сохраняет ресурсы и является менее трудоемким для выделения целевого олигосахарида из ферментационного бульона, поскольку можно избежать отделения целевого олигосахарида от нецелевых побочных продуктов - олигосахаридов. Кроме того, гораздо больше продукта извлечения и источников энергии, предоставленных генетически сконструированным микробным клеткам-хозяевам согласно настоящему изобретению, превращается в целевой продукт, по сравнению с нативной микробной клеткой-хозяином, которая не была генетически сконструирована с возможностью экспрессии гетерологичной гликозидазы.In a third aspect, genetically engineered microbial host cells, as previously described herein, are used to produce the target oligosaccharide. The use of genetically engineered microbial host cells to produce a target oligosaccharide through fermentation is advantageous because the production of non-target saccharide by-products is prevented or even eliminated. Thus, it saves resources and is less labor intensive to isolate the target oligosaccharide from the fermentation broth, since separation of the target oligosaccharide from non-target oligosaccharide by-products can be avoided. In addition, much more of the recovery product and energy sources provided to the genetically engineered microbial host cells of the present invention are converted to the target product compared to a native microbial host cell that has not been genetically engineered to express a heterologous glycosidase.
Согласно четвертому аспекту, целевые олигосахариды, которые получены способом, и/или применением генетически сконструированных микробных клеток-хозяев, описанных в данном документе ранее, предпочтительно выбраны из группы ОГМ.According to a fourth aspect, the target oligosaccharides that are obtained by the method and/or use of genetically engineered microbial host cells described hereinbefore are preferably selected from the group of OGMs.
Целевые олигосахариды, которые получены способом, и/или применением генетически сконструированных микробных клеток-хозяев, описанных в данном документе, могут быть использованы для получения питательной композиции.Target oligosaccharides that are obtained by the method and/or use of genetically engineered microbial host cells described herein can be used to prepare a nutritional composition.
Питательная композиция представляет собой лечебную композицию, диетическую композицию, детскую питательную смесь или т.п.The nutritional composition is a medicinal composition, a dietary composition, an infant formula, or the like.
Настоящее изобретение будет описано в отношении конкретных воплощений и со ссылкой на графические материалы, но данное изобретение не ограничивается ими, а только формулой изобретения. Кроме того, термины первый, второй и тому подобное в описании и в формуле изобретения используются для проведения различия между похожими элементами и не обязательно для описания последовательности, во времени, в пространстве, по рангу или любым другим образом. Следует понимать, что термины, используемые таким образом, являются взаимозаменяемыми в соответствующих обстоятельствах, и что воплощения изобретения, описанные в данном документе, способны работать в последовательностях, отличных от описанных или проиллюстрированных в данном документе.The present invention will be described with respect to specific embodiments and with reference to drawings, but the invention is not limited thereto, but only by the claims. In addition, the terms first, second, and the like in the specification and claims are used to distinguish between like elements and not necessarily to describe sequence, in time, space, rank, or any other manner. It should be understood that the terms used in this manner are interchangeable in appropriate circumstances, and that the embodiments of the invention described herein are capable of operating in sequences other than those described or illustrated herein.
Следует понимать, что термин «содержащий», используемый в формуле изобретения, не следует считать ограничивающимся средствами, перечисленными в дальнейшем; он не исключает других элементов или стадий. Таким образом, его следует считать определяющим наличие заявленных признаков, целых чисел, стадий или компонентов, на которые ссылаются, но он не исключает наличия или добавления одного или более других признаков, целых чисел, стадий или компонентов или их групп. Таким образом, объем выражения «устройство, содержащее средства А и В» не следует ограничивать устройствами, состоящими только из компонентов А и В. Оно означает, что в отношении настоящего изобретения, единственными релевантными компонентами устройства являются А и В.It should be understood that the term “comprising” as used in the claims should not be considered limited to the means listed hereinafter; it does not exclude other elements or stages. Thus, it should be considered to determine the presence of the claimed features, integers, steps or components referred to, but it does not exclude the presence or addition of one or more other features, integers, steps or components or groups thereof. Thus, the scope of the expression "device comprising means A and B" should not be limited to devices consisting only of components A and B. It means that for the purposes of the present invention, the only relevant components of the device are A and B.
Ссылка на всем протяжении данного описания изобретения на «одно воплощение» или «воплощение» означает, что конкретный признак, структура или характеристика, описанные в связи с данным воплощением, включены в по меньшей мере одно воплощение настоящего изобретения. Таким образом, появления фраз «в одном воплощении» или «в воплощении» в разных местах по всему объему данного описания изобретения не обязательно все относятся к одному и тому же воплощению, но могут. Кроме того, конкретные признаки, структуры или характеристики могут быть объединены любым подходящим образом, как будет очевидно обычному специалисту в данной области из данного раскрытия, в одном или более воплощениях.Reference throughout this specification to “one embodiment” or “embodiment” means that the particular feature, structure, or characteristic described in connection with that embodiment is included in at least one embodiment of the present invention. Thus, appearances of the phrases “in one embodiment” or “in an embodiment” in different places throughout this specification do not necessarily all refer to the same embodiment, but may. Moreover, specific features, structures, or characteristics may be combined in any suitable manner, as will be apparent to one of ordinary skill in the art from this disclosure, in one or more embodiments.
Аналогично следует понимать, что в описании иллюстративных воплощений изобретения разные признаки изобретения иногда сгруппированы вместе в одном единственном воплощении, фигуре или его описании в целях упрощения раскрытия и помощи в понимании одного или более разных аспектов изобретения. Данный способ раскрытия, однако, не нужно считать отражающим мысль, что заявленное изобретение требует больше признаков, чем явным образом перечислены в каждом пункте. Скорее, как отражено в следующей формуле изобретения, аспекты изобретения заключаются меньше чем во всех признаках одного вышеизложенного раскрытого воплощения. Таким образом, формула изобретения после подробного описания явным образом включена тем самым в данное подробное описание, причем каждый пункт отдельно стоит в виде отдельного воплощения данного изобретения.Likewise, it should be understood that in the description of illustrative embodiments of the invention, various features of the invention are sometimes grouped together in one single embodiment, figure, or description thereof for the purpose of simplifying the disclosure and aiding in the understanding of one or more different aspects of the invention. This manner of disclosure, however, should not be taken to imply that the claimed invention requires more features than are expressly listed in each claim. Rather, as reflected in the following claims, aspects of the invention are comprised of less than all of the features of one of the above disclosed embodiments. Thus, the claims following the detailed description are hereby expressly incorporated into this detailed description, with each claim standing alone as a separate embodiment of the invention.
Кроме того, в то время как некоторые воплощения, описанные в данном документе, включают некоторые, но не все признаки, включенные в другие воплощения, подразумевается, что комбинации признаков разных воплощений находятся в объеме изобретения и образуют разные воплощения, как будет понятно специалистам в данной области. Например, в следующей формуле изобретения любое из заявленных воплощений можно использовать в любой комбинации.In addition, while some embodiments described herein include some, but not all, of the features included in other embodiments, it is understood that combinations of features of different embodiments are within the scope of the invention and form different embodiments, as will be appreciated by those skilled in the art. areas. For example, in the following claims, any of the claimed embodiments may be used in any combination.
Кроме того, некоторые из воплощений описаны в данном документе как способ или комбинация элементов способа, которые могут быть реализованы посредством процессора компьютерной системы или с помощью других средств выполнения функции. Таким образом, процессор с необходимыми инструкциями для осуществления такого способа или элемента способа образует средство осуществления способа или элемента способа. Кроме того, описанный в данном документе элемент воплощения аппарата представляет собой пример средства осуществления функции, выполняемой элементом, с целью осуществления изобретения.In addition, some of the embodiments are described herein as a method or combination of method elements that may be implemented by a computer system processor or other means of performing a function. Thus, a processor with the necessary instructions for implementing such method or method element constitutes means for implementing the method or method element. In addition, the apparatus embodiment described herein is an example of a means of implementing the function performed by the apparatus for the purpose of carrying out the invention.
В описании и графических материалах, предоставленных в данном документе, изложены многочисленные конкретные подробности. Однако, понятно, что воплощения изобретения можно осуществлять на практике без данных конкретных подробностей. В других примерах хорошо известные способы, структуры и методики не были показаны подробно для того, чтобы не затруднять понимание данного описания.Numerous specific details are set forth in the descriptions and graphics provided herein. However, it will be understood that embodiments of the invention may be practiced without these specific details. In other examples, well-known methods, structures and techniques have not been shown in detail so as not to obscure the understanding of this description.
Теперь изобретение будет описано с помощью подробного описания нескольких воплощений изобретения. Ясно, что другие воплощения изобретения могут быть скомпонованы в соответствии со знаниями специалистов в данной области, не отклоняясь от истинной сущности или технической идеи изобретения, причем изобретение ограничено только условиями прилагаемой формулы изобретения.The invention will now be described by way of detailed description of several embodiments of the invention. It is clear that other embodiments of the invention can be put together in accordance with the knowledge of those skilled in the art without deviating from the true spirit or technical idea of the invention, the invention being limited only by the terms of the appended claims.
Пример 1: Метаболическое конструирование штамма BL21(DE3) Е. coli для продукции 2'-фукозиллактозыExample 1: Metabolic engineering of E. coli strain BL21(DE3) for production of 2'-fucosyllactose
BL21(DE3) Е. coli (Novagen) использовали в качестве исходного штамма для конструирования штаммα-хозяина для продукции 2'-FL. Генетическое конструирование исходного штамма включало нарушение гена и события делеций и интеграцию гетерологичных генов.E. coli BL21(DE3) (Novagen) was used as the starting strain to construct the α host strain for 2′-FL production. Genetic engineering of the parent strain involved gene disruption and deletion events and integration of heterologous genes.
Поскольку 2'-фукозиллактоза синтезируется из лактозы, ее применяют для бактериальной культуры, и из ГДФ-L-фукозы, которая продуцируется из живых клеток, сначала, копию дикого типа гена lacZ, кодирующего эндогенную β-галактозидазу, инактивировали посредством мутагенеза с использованием ошибки спаривания олигонуклеотидов (Ellis et al., «High efficiency mutagenesis, repair, and engineering of chromosomal DNA using single-stranded oligonucleotides”, Proc. Natl. Acad. Sci. USA 98: 6742-6746 (2001)). Используя тот же способ, нарушали ген арабинозоизомеразы araA.Since 2'-fucosyllactose is synthesized from lactose, it is used for bacterial culture, and from GDP-L-fucose, which is produced from living cells, first, the wild-type copy of the lacZ gene encoding endogenous β-galactosidase was inactivated by mismatch mutagenesis oligonucleotides (Ellis et al., “High efficiency mutagenesis, repair, and engineering of chromosomal DNA using single-stranded oligonucleotides,” Proc. Natl. Acad. Sci. USA 98: 6742-6746 (2001)). Using the same method, the arabinose isomerase gene araA was disrupted.
Фрагмент гена lacZΩ вводили под контролем чувствительного к температуре транскрипционного репрессора cl857. Фрагмент гена lacZα экспрессируется под контролем промотора PgbA BL21 (DE3) Е, coli в данном штамме, обнаруживая LacZ+ штамм.The lacZΩ gene fragment was introduced under the control of the temperature-sensitive transcriptional repressor cl857. The lacZα gene fragment is expressed under the control of the PgbA BL21 (DE3) E coli promoter in this strain, revealing a LacZ + strain.
Геномные делеции осуществляли посредством Red-зависимой рекомбинации фага λ в соответствии с методом Datsenko и Warner («One-step inactivation of chromosomal genes in Escherichia coli K-12 using PCR products», Proc. Natl. Acad. Sci. USA 97:6640-6645 (2000)). Гены fuel и fucK, кодирующие L-фукозоизомеразу и L-фукулозокиназу, соответственно, удаляли для предотвращения деградации L-фукозы. Также удаляли гены wzxC-wcaJ. WcaJ возможно кодирует УДФ-глюкоза: ундекапренилфосфатглюкозо-1-фосфаттрансферазу, катализирующую первую стадию в синтезе колановой кислоты (Stevenson et al., «Organization of the Escherichia coli K-12 gene cluster responsible for production of the extracellular polysaccharide colonic acid», J. Bacterid. 178:4885-4893; (1996)); продуцирование колановой кислоты будет конкурировать за ГДФ-фукозу с реакцией фукозилтрансферазы.Genomic deletions were carried out by Red-dependent recombination of phage λ according to the method of Datsenko and Warner (“One-step inactivation of chromosomal genes in Escherichia coli K-12 using PCR products”, Proc. Natl. Acad. Sci. USA 97:6640- 6645 (2000)). The fuel and fucK genes, encoding L-fucose isomerase and L-fuculose kinase, respectively, were deleted to prevent L-fucose degradation. The wzxC-wcaJ genes were also deleted. WcaJ possibly encodes UDP-glucose: undecaprenylphosphate glucose-1-phosphate transferase, which catalyzes the first step in the synthesis of colanic acid (Stevenson et al., “Organization of the Escherichia coli K-12 gene cluster responsible for the production of the extracellular polysaccharide colonic acid,” J. Bacterid 178:4885-4893 (1996)); colanic acid production will compete for GDP-fucose with the fucosyltransferase reaction.
Интеграцию в геном гетерологичных генов проводили посредством транспозиции. Большие кластеры генов интегрировали в геном при посредничестве гиперактивного С9-мутанта транспозазы mariner Himarl (Lampe et al., «Hyperactive transposase mutants of the Himarl mariner transposon», Proc. Natl. Acad. Sci. USA 96:11428-11433 (1999)), который вставляли в плазмиду pEcomar под транскрипционным контролем промотора Para. Для усиления de novo синтеза ГДФ-фукозы, гены, кодирующие фосфоманномутазу (manB), маннозо-1-фосфатгуанозилтрансферазу (manC), ГДФ-маннозо-4,6-дегидратазу (gmd) и ГДФ-L-фукозосинтазу (wcaG), из DH5α K12 Е, coli сверхэкспрессировали в штамме BL21(DE3) Е, coli; оперон manCB ставили под контролем конститутивного промотора Ptet, оперон gmd, wcaG транскрибируется с конститутивного промотора PT5-Транспозонную кассету <Ptet-manCB-PT5-gmd, wcaG-FRT-dhfr-FRT> (SEQ ID NO: 13), включающую ген дигидрофолатредуктазы для устойчивости к триметоприму, фланкированную инвертированными концевыми повторами, специфично узнаваемыми транспозазой mariner-подобных элементов Himar1, вставляли в геном Е, coli из pEcomar CQ-manCB-gmd, wcaG-dhfr.Integration of heterologous genes into the genome was carried out through transposition. Large clusters of genes have been integrated into the genome through the mediation of a hyperactive C9 mutant of the mariner Himarl transposase (Lampe et al., “Hyperactive transposase mutants of the Himarl mariner transposon,” Proc. Natl. Acad. Sci. USA 96:11428-11433 (1999)) , which was inserted into the pEcomar plasmid under the transcriptional control of the P ara promoter. To enhance de novo synthesis of GDP-fucose, genes encoding phosphomannomutase (manB), mannose-1-phosphate guanosyltransferase (manC), GDP-mannose-4,6-dehydratase (gmd) and GDP-L-fucose synthase (wcaG), from DH5α K12 E, coli was overexpressed in strain BL21(DE3) E, coli; the manCB operon was placed under the control of the constitutive promoter P tet , the gmd, wcaG operon is transcribed from the constitutive promoter P T5 - Transposon cassette <P tet -manCB-P T5 -gmd, wcaG-FRT-dhfr-FRT> (SEQ ID NO: 13), including the dihydrofolate reductase gene for resistance to trimethoprim, flanked by inverted terminal repeats specifically recognized by the transposase of mariner-like elements Himar1, was inserted into the genome of E, coli from pEcomar CQ-manCB-gmd, wcaG-dhfr.
Для хромосомной интеграции одиночных генов использовали транспозазу EZ-Tn5™ (Epicentre, США). Для получения транспосом EZ-Tn5 исследуемый ген вместе с FRT-сайт-фланкированной кассетой устойчивости к антибиотику амплифицировали с помощью праймеров, которые несли на обоих участках сайты распознавания 19 - п.н. Mosaic End (5'-CTGTCTCTTATACACATCT, SEQ ID NO: 21) для транспозазы EZ-Tn5. Используя транспозазу EZ-Tn5TM, ген импортера лактозы LacY от K12 TG1 Е, coli (учет. №ABN72583), ген 2-фукозилтрансферазы wbgL от Е. coli: O126 (учет. №ADN43847), и ген yberc0001_9420, кодирующий эффлюксный транспортер сахара суперсемейства мембранных транспортеров из Yersinia bercovieri АТСС 43970 (учет. №EEQ08298), интегрировали, используя соответствующие кассеты интеграции: <Ptet-lacY-FRT-aadA-FRT> (SEQ ID NO: 14), <Ptet-wbgL.co-FRT-neo-FRT> (SEQ ID NO: 15) и <Ptet-yberc0001_9420co-FRT-cat-FRT> (SEQ ID NO: 16). Гены wbgL и yberc0001_9420 синтетически синтезировали и осуществляли оптимизацию кодонов (со) GenScript Cooperation (США). После успешной интеграции гена lacY ген устойчивости устраняли из клонов, устойчивых к стрептомицину, посредством рекомбиназы FLP, кодируемой на плазмиде рСР20 (Datsenko and Warner, «One-step inactivation of chromosomal genes in Escherichia coli K-12 using PCR products”, Proc. Natl. Acad. Sci. USA 97:6640-6645 (2000)).For chromosomal integration of single genes, transposase EZ-Tn5™ (Epicentre, USA) was used. To obtain EZ-Tn5 transposomes, the gene under study, together with the FRT-site-flanked antibiotic resistance cassette, was amplified using primers that carried 19-bp recognition sites in both regions. Mosaic End (5'-CTGTCTCTTATACACATCT, SEQ ID NO: 21) for EZ-Tn5 transposase. Using the transposase EZ-Tn5TM, the lactose importer gene LacY from K12 TG1 E, coli (account no. ABN72583), the 2-fucosyltransferase gene wbgL from E. coli: O126 (account no. ADN43847), and the gene yberc0001_9420, encoding the sugar efflux transporter superfamily membrane transporters from Yersinia bercovieri ATCC 43970 (account no. EEQ08298), were integrated using the appropriate integration cassettes: <P tet -lacY-FRT-aadA-FRT> (SEQ ID NO: 14), <P tet -wbgL.co-FRT -neo-FRT> (SEQ ID NO: 15) and <P tet -yberc0001_9420co-FRT-cat-FRT> (SEQ ID NO: 16). The wbgL and yberc0001_9420 genes were synthetically synthesized and codon optimized by (co) GenScript Cooperation (USA). After successful integration of the lacY gene, the resistance gene was eliminated from streptomycin-resistant clones using FLP recombinase encoded on plasmid pCP20 (Datsenko and Warner, “One-step inactivation of chromosomal genes in Escherichia coli K-12 using PCR products,” Proc. Natl Acad Sci USA 97:6640-6645 (2000).
Поскольку BL21(DE3) E, coli не имеет функционального gal-оперона, нативно регулируемую копию оперона galETKM от Е, coli K интегрировали в штамм В посредством EZ-транспозиции с использованием интегрируемой кассеты <Pgal-galЕ-galT-galK-galM> (SEQ ID NO: 17). Интегранты выбирали из агара МакКонки, содержащего 1% галактозу, в виде красных колоний. Полученный штамм может метаболизировать моносахариды - глюкозу и галактозу, возникающие в результате гидролиза лактозы.Since BL21(DE3) E, coli does not have a functional gal operon, a natively regulated copy of the galETKM operon from E, coli K was integrated into strain B via EZ transposition using the integration cassette <P gal -galE-galT-galK-galM> ( SEQ ID NO: 17). Integrants were selected from MacConkey agar containing 1% galactose as red colonies. The resulting strain can metabolize monosaccharides - glucose and galactose, resulting from the hydrolysis of lactose.
Дополнительного улучшения, касающегося синтеза 2'-фукозиллактозы штаммом Е, coli, достигали посредством делеции гена pfkA, кодирующего фосфофруктокиназу А. При культивировании Е, coli на глюконеогенном субстрате подобно глицерину, фосфорилирование фруктозо-6-фосфата под действием PfkA представляет собой тредмил-реакцию с высоким потреблением АТФ, и, кроме того, конкурирует с ManA за субстрат. Ген pfkA удаляли в результате гомологичной рекомбинации в соответствии с Datsenko and Wanner (2000) с использованием кассеты устойчивости к гентамицину (аасС1), которая фланкирована сайтами lox71/66 (Lambert, JM et al. (2007) Cre-lox-based system for multiple gene deletions and selectable - marker removal in Lactobacillus plantarum. Appl. Environ. Microbiol 73: 1126-1135). После удачной делеции гена pfkA ген устойчивости к антибиотику удаляли из генома Е, colic использованием рекомбиназы Cre (Abremski, K et al. (1983) Studies on the properties of P1 site-specific recombination: evidence for topologically unlinked products following recombination. Cell 32: 1301-1311), который клонировали под контролем промотора Para в каркас pKD46 (Datsenko and Wanner, 2000).An additional improvement regarding the synthesis of 2'-fucosyllactose by the E coli strain was achieved by deleting the pfkA gene encoding phosphofructokinase A. When E coli is cultured on a gluconeogenic substrate like glycerol, phosphorylation of fructose-6-phosphate by PfkA is a treadmill reaction with high ATP consumption, and, in addition, competes with ManA for substrate. The pfkA gene was deleted by homologous recombination according to Datsenko and Wanner (2000) using a gentamicin resistance cassette (aacC1) that is flanked by lox71/66 sites (Lambert, JM et al. (2007) Cre-lox-based system for multiple gene deletions and selectable - marker removal in Lactobacillus plantarum. Appl. Environ. Microbiol 73: 1126-1135). After successful deletion of the pfkA gene, the antibiotic resistance gene was removed from the E, colic genome using Cre recombinase (Abremski, K et al. (1983) Studies on the properties of P1 site-specific recombination: evidence for topologically unlinked products following recombination. Cell 32: 1301-1311), which was cloned under the control of the P ara promoter into the pKD46 framework (Datsenko and Wanner, 2000).
В случае разных фукозилтрансфераз помимо трансферазной активности была продемонстрирована ГДФ-L-фукозогидролазная активность. Кроме того, в случае wbgL, α-1,2-фукозилтрансферазы, используемой в данном документе для синтеза 2'-фукозиллактозы, была показана данная гидролитическая активность (см. ЕР 3 050 973 А1). Для сохранения свободной L-фукозы для продуцирования 2'-фукозиллактозы и для устранения загрязняющей L-фукозы из культурального бульона, ген fkp, кодирующий бифункциональную L-фукокиназо/L-фукозо-1-фосфатгуанилилтрансферазу Bacteroides fragilis, под транскрипционным контролем промотора Ptet, вместе с lox71/66-фланкированным геном аасС1 хромосомно интегрировали посредством транспозиции с использованием транспозазы EZ-Tn5™, <Ptet-fkp-lox-aacC1-lox> (SEQ ID NO: 18). После успешной интеграции ген устойчивости к гентамицину удаляли из генома, как описано выше.In the case of various fucosyltransferases, in addition to transferase activity, GDP-L-fucose hydrolase activity has been demonstrated. In addition, in the case of wbgL, the α-1,2-fucosyltransferase used herein for the synthesis of 2'-fucosyllactose, this hydrolytic activity was shown (see
Для усиления потока метаболизированного источника углерода-глицерина через глюконеогенный путь от триозофосфатов до фруктозо-6-фосфата для подпитки биосинтеза ГДФ-L-фукозы осуществляли сверхэкспрессию генов, кодирующих фруктозо-1,6-бисфосфатальдолазу (fbaB) и гетерологичную фруктозо-1,6-бисфосфатфосфотазу (fbpase), из Pisum sativum. Ген fbaB от BL21 (DE3) Е. coli сливали с промотором Ptet. Активность FBPase хлоропласта P. sativum аллостерически регулировали посредством замены дисульфид-дитиол за счет восстановления тиоредоксинами. Замена остатка цистеина 153 на серии приводит к конститутивно активному ферменту. Приобретали ген, кодирующий FBPase хлоропласта от P. sativum (учет. №AAD10213), осуществляли оптимизацию кодонов для экспрессии в Е. coli, метели N-концевой гексагистидиновой меткой и модифицировали для кодирования варианта C153S фермента из Genescript. Ген fbpase транскрибируется с промотора 17. Кассету <Ptet-fbaB-P17-His6-fbpase-lox-аасС1-lox> (SEQ ID NO: 19) использовали для транспозаза EZ-Tn5™ -опосредованной интеграции в штамме-хозяине. После удаления гена устойчивости к гентамицину из генома Е. coli штамм использовали для продукции 2'-фукозиллактозы. Впоследствии, данный штамм называется «штаммом А».To enhance the flow of the metabolized carbon source glycerol through the gluconeogenic pathway from triose phosphates to fructose-6-phosphate to fuel the biosynthesis of GDP-L-fucose, genes encoding fructose-1,6-bisphosphate aldolase (fbaB) and heterologous fructose-1,6- bisphosphate phosphatase (fbpase), from Pisum sativum. The fbaB gene from E. coli BL21 (DE3) was fused to the P tet promoter. P. sativum chloroplast FBPase activity was allosterically regulated through disulfide-dithiol exchange via reduction with thioredoxins. Replacement of cysteine residue 153 with a serine results in a constitutively active enzyme. The gene encoding chloroplast FBPase from P. sativum (Accession No. AAD10213) was acquired, codon optimized for expression in E. coli, tagged with an N-terminal hexahistidine tag, and modified to encode the C153S variant of the enzyme from Genescript. The fbpase gene is transcribed from promoter 17. The <P tet -fbaB-P 17 -His 6 -fbpase-lox-aacC1-lox> cassette (SEQ ID NO: 19) was used for EZ-Tn5™ transposase-mediated integration in the host strain. After removal of the gentamicin resistance gene from the E. coli genome, the strain was used to produce 2'-fucosyllactose. Subsequently, this strain is called “strain A”.
Пример 2: Конструирование штамма BL21(DE3) Е. coli для продукции 2'-фукозиллактозы с высокой чистотойExample 2: Construction of E. coli strain BL21(DE3) to produce 2'-fucosyllactose with high purity
Культивирования с подпиткой с использованием штамма А для продукции 2'-фукозиллактозы обнаружили наличие побочных продуктов β-фукозиллактозы и 2'3-дифукозиллактозы) в культуральном бульоне. Для минимизации продукции данных побочных продуктов, а также для улучшения выхода углерода, α-1,3-фукозидазу субклонировали за конститутивным промотором и интегрировали в геном штамма А. Таким образом, ген afcB Bifidobacterium bifidum (учетн. № АВ474964) сливали с конститутивным промотором Pand и геном устойчивости к гентамицину. Полученную транспозонную кассету <Pand-afcS-lox-aacC7-lox> (SEQ ID NO: 20), фланкированную инвертированными концевыми повторами, специфично распознаваемыми транспозазой mariner-подобных элементов Himar1, вставляли в геном Е, coli из pEcomar afcB-aacC1, образуя «штамм В».Fed-batch cultures using strain A to produce 2'-fucosyllactose revealed the presence of by-products β-fucosyllactose and 2'3-difucosyllactose) in the culture broth. To minimize the production of these by-products, as well as to improve carbon yield, α-1,3-fucosidase was subcloned behind the constitutive promoter and integrated into the genome of strain A. Thus, the afcB gene of Bifidobacterium bifidum (Accession No. AB474964) was fused to the constitutive promoter P and and the gentamicin resistance genome. The resulting transposon cassette <P and -afcS-lox-aacC7-lox> (SEQ ID NO: 20), flanked by inverted terminal repeats specifically recognized by the transposase of mariner-like elements Himar1, was inserted into the E, coli genome from pEcomar afcB-aacC1, forming "strain B".
Пример 3: Анализ ВЭЖХ для выявления 2'-фукозиллактозы в супернатанте культурыExample 3: HPLC Analysis to Detect 2'-Fucosyllactose in Culture Supernatant
Анализы посредством ВЭЖХ проводили, используя рефрактометрический детектор (RID-10A) (Shimadzu, Германия) и колонку Waters XBridge Amide 3,5 мкм (250 × 4,6 мм) (Eschborn, Германия), соединенную с системой ВЭЖХ (Shimadzu, Германия). Элюирование проводили изократически с помощью 30% А: 50% (об./об.) ACN (от англ. acetonitrile - ацетонитрил) в ddH2O, 0,1% (об./об.) NH4OH и 70% В: 80% (об./об.) ACN в ddH2O, 0,1% (об./об.) NH4OH (об./об.) в качестве элюента при 35°С и при скорости потока 1,4 мл мин-1. Образцы для ВЭЖХ подвергали стерильной фильтрации (размер пор 0,22 мкм) и очищали посредством твердофазной экстракции на ионообменной матрице (Strata ABW, Phenomenex). 10 мкл образцов наносили на колонку, и концентрацию 2'-фукозиллактозы рассчитывали в соответствии со стандартной кривой. Другие сахара, такие как L-фукоза и/или другие моносахариды, лактоза и/или другие дисахариды, 3-фукозиллактоза и/или другие трисахариды, 2'3-дифукозиллактоза и/или другие тетрасахариды, а также глицерин, также могут быть выявлены с использованием данных условий анализа. Относительные количества выявленных Сахаров можно определять посредством сравнения AUC (от англ. area under the curve - площадь под кривой) всех пиков в хроматограмме. Пики, также присутствующие в водном контроле, исключают из данного расчета.HPLC analyzes were performed using a refractometric detector (RID-10A) (Shimadzu, Germany) and a Waters XBridge Amide 3.5 μm (250 × 4.6 mm) column (Eschborn, Germany) coupled to an HPLC system (Shimadzu, Germany) . Elution was carried out isocratically using 30% A: 50% (v/v) ACN (from English acetonitrile - acetonitrile) in ddH 2 O, 0.1% (v/v) NH 4 OH and 70% B : 80% (v/v) ACN in ddH 2 O, 0.1% (v/v) NH 4 OH (v/v) as eluent at 35°C and flow rate 1, 4 ml min -1 . HPLC samples were sterile filtered (0.22 μm pore size) and purified by solid phase extraction on an ion exchange matrix (Strata ABW, Phenomenex). 10 μL of samples were applied to the column, and the concentration of 2′-fucosyllactose was calculated according to the standard curve. Other sugars such as L-fucose and/or other monosaccharides, lactose and/or other disaccharides, 3-fucosyllactose and/or other trisaccharides, 2'3-difucosyllactose and/or other tetrasaccharides, and glycerol may also be detected with using these analysis conditions. The relative amounts of identified Sugars can be determined by comparing the AUC (from the English area under the curve - area under the curve) of all peaks in the chromatogram. Peaks also present in the water control are excluded from this calculation.
Пример 4: Получение 2'-фукозиллактозы в процессе ферментацииExample 4: Production of 2'-fucosyllactose by fermentation
Ферментации проводили в 3л-ферментерах при 33°С (New Brunswick, Edison, США), начиная с 1000 мл среды с минеральными солями, содержащей 3 г/л KH2PO4, 12 г/л K2HPO4, 5 г/л (NH4)2SO4, 0,3 г/л лимонной кислоты, 2 г/л MgSO4 × 7⋅H2O, 0,1 г/л NaCl и 0,015 г/л CaCl2 × 6⋅H2O с добавлением 1 г/л раствора микроэлементов (54,4 г⋅л-1 цитрата железа (III)-аммония, 9,8 г/л MnCl2 × 4⋅H2O, 1,6 г/л CoCl2 × 6⋅Н2O, 1 г/л CuCl2 × 2⋅H2O, 1,9 г/л Н3ВО3, 9 г/л ZnSO4 × 7⋅H2O, 1,1 г/л Na2MoO4 × 2⋅H2O, 1,5 г/л Na2SeO3, 1.,5 г/л NiSO4 × 6⋅H2O) и содержащей 2% (об./об.) глицерина в качестве источника углерода, 60 мМ лактозу и антибиотик канамицин (25 мкг/мл). Аэрацию поддерживали на уровне 3 л/мин. Растворенный кислород поддерживали на уровне насыщения 20-30% посредством осуществления контроля скорости перемешивания. рН поддерживали на уровне 7,0 посредством добавления 25%-ного раствора аммония. Культивацию начинали с 2,5% (об./об.) инокулята из предварительной культуры, растущей в той же среде, содержащей глицерин, но не содержащей лактозы. После прохождения порционной фазы, на что указывало повышение уровня растворенного кислорода, осуществляли подачу глицерина (60% (об./об.) с добавлением 2 г/л MgSO4 × 7⋅H2O, 0,015 г/л CaCl2 × 6⋅H2O и 1 мл/л раствора микроэлементов) при скоростях потока 7,0-8,0 мл/ч, относительно исходного объема. Подачу лактозы (0,66 М) проводили на протяжении всей культивации и регулировали интуитивно для реализации постоянной подачи лактозы в культуральной бульоне. Подачу лактозы останавливали к концу ферментации, и культивацию продолжали до тех пор, пока лактоза полностью не превратится в 2'-фукозиллактозу. Через примерно 94 часа после засева ферментера титр 2'-фукозиллактозы в клеточной среде примерно 150 г/л достигался при использовании штамма, описанного в примере 1 (штамм А). Штамм, продуцирующий 2'-фукозиллактозу, генетически модифицированный, как описано в примере 2 (штамм В), культивировали аналогично, и он также приводил к получению титра 2'-фукозиллактозы примерно 150 г/л. Однако, количество побочных продуктов было значимо ниже, чем после культивирования штамма А (Таблица 2). Тогда как содержание сахарида в супернатанте культуры штамма А составляло только 94,22% 2'-фукозиллактозы, оно увеличивалось на 5,50% в супернатанте культуры штамма В, демонстрируя чистоту 99,72%.Fermentations were carried out in 3L fermenters at 33°C (New Brunswick, Edison, USA), starting with 1000 ml of medium with mineral salts containing 3 g/l KH 2 PO 4 , 12 g/l K 2 HPO 4 , 5 g/l l (NH 4 ) 2 SO 4 , 0.3 g/l citric acid, 2 g/l MgSO 4 × 7⋅H 2 O, 0.1 g/l NaCl and 0.015 g/l CaCl 2 × 6⋅H 2 O with the addition of 1 g/l solution of trace elements (54.4 g ⋅ l -1 iron (III)-ammonium citrate, 9.8 g/l MnCl 2 × 4⋅H 2 O, 1.6 g/l CoCl 2 × 6⋅H 2 O, 1 g/l CuCl 2 × 2⋅H 2 O, 1.9 g/l H 3 VO 3 , 9 g/l ZnSO 4 × 7⋅H 2 O, 1.1 g/l Na 2 MoO 4 × 2⋅H 2 O, 1.5 g/l Na 2 SeO 3 , 1.5 g/l NiSO 4 × 6⋅H 2 O) and containing 2% (v/v) glycerol in as a carbon source, 60 mM lactose and the antibiotic kanamycin (25 μg/ml). Aeration was maintained at 3 L/min. Dissolved oxygen was maintained at a saturation level of 20-30% by controlling the stirring speed. The pH was maintained at 7.0 by adding 25% ammonium solution. Cultivation was started with a 2.5% (v/v) inoculum from a preculture growing in the same medium containing glycerol but not lactose. After passing the batch phase, as indicated by an increase in the level of dissolved oxygen, glycerol was supplied (60% (v/v) with the addition of 2 g/l MgSO 4 × 7⋅H 2 O, 0.015 g/l CaCl 2 × 6⋅ H 2 O and 1 ml/l of microelements solution) at flow rates of 7.0-8.0 ml/h, relative to the original volume. The supply of lactose (0.66 M) was carried out throughout the cultivation and was adjusted intuitively to realize a constant supply of lactose in the culture broth. The supply of lactose was stopped towards the end of fermentation and cultivation continued until the lactose was completely converted to 2′-fucosyllactose. Approximately 94 hours after seeding the fermenter, a titer of 2'-fucosyllactose in the cell medium of approximately 150 g/l was achieved using the strain described in example 1 (strain A). A 2'-fucosyllactose-producing strain genetically modified as described in Example 2 (strain B) was cultured similarly and also resulted in a 2'-fucosyllactose titer of approximately 150 g/L. However, the amount of by-products was significantly lower than after cultivation of strain A (Table 2). While the saccharide content in the culture supernatant of strain A was only 94.22% 2'-fucosyllactose, it increased by 5.50% in the culture supernatant of strain B, showing a purity of 99.72%.
--->--->
ПЕРЕЧЕНЬ ПОСЛЕДОВАТЕЛЬНОСТЕЙ LIST OF SEQUENCES
<110> Jennewein Biotechnologie GmbH<110> Jennewein Biotechnologie GmbH
<120> ПРИМЕНЕНИЕ ГЛИКОЗИДАЗ В ПОЛУЧЕНИИ ОЛИГОСАХАРИДОВ<120> APPLICATION OF GLYCOSIDASES IN OLIGOSACHARIDE PRODUCTION
<130> P 1804 WO<130>P 1804 WO
<160> 20 <160> 20
<170> PatentIn version 3.5<170> Patent In version 3.5
<210> 1<210> 1
<211> 5880<211> 5880
<212> ДНК<212> DNA
<213> Bifidobacterium bifidum<213> Bifidobacterium bifidum
<400> 1<400> 1
atgaaacata gagcgatgtc atcgcgtctg atgccactgg tggcgtcctg cgcgacggtc 60atgaaacata gagcgatgtc atcgcgtctg atgccactgg tggcgtcctg cgcgacggtc 60
ggcatgctgc tggccggact acctgtgtcg gccgtcgcgg tcggcacgac gagagcggca 120ggcatgctgc tggccggact acctgtgtcg gccgtcgcgg tcggcacgac gagagcggca 120
gcgtccgacg cctcgtcctc caccacagca accatcaccc cctccgccga taccacgttg 180gcgtccgacg cctcgtcctc caccacagca accatcaccc cctccgccga taccacgttg 180
cagacatgga cgagcgagaa gaattcctca atggcgtcca agccgtacat cggcacactg 240cagacatgga cgagcgagaa gaattcctca atggcgtcca agccgtacat cggcacactg 240
caagggccct cgcaaggcgt gttcggcgag aagttcgagt ccacggatgc cgcggacacc 300caagggccct cgcaaggcgt gttcggcgag aagttcgagt ccacggatgc cgcggacacc 300
accgatctga agaccggcct gctgacgttc gacctgagcg cctacgacca tgcccccgat 360accgatctga agaccggcct gctgacgttc gacctgagcg cctacgacca tgcccccgat 360
tccgcaacgt tcgagatgac gtacctcggc taccgcggca acccgacggc caccgacacc 420tccgcaacgt tcgagatgac gtacctcggc taccgcggca acccgacggc caccgacacc 420
gacaccatca aggtgacccc cgtcgacacc accgtgtgca ccaataacgc cacagactgc 480gacaccatca aggtgacccc cgtcgacacc accgtgtgca ccaataacgc cacagactgc 480
ggcgcgaatg tcgcgaccgg cgcgaccaag ccgaagttca gcatcaacga ctcctcattc 540ggcgcgaatg tcgcgaccgg cgcgaccaag ccgaagttca gcatcaacga ctcctcattc 540
gtcgccgagt ccaagccgtt cgagtacggt acgacggttt acacgggcga cgccatcacc 600gtcgccgagt ccaagccgtt cgagtacggt acgacggttt acacgggcga cgccatcacc 600
gtggttcccg ccaataccaa gaaggtcacc gtagatgtga ccgaaatcgt gcgccagcag 660gtggttcccg ccaataccaa gaaggtcacc gtagatgtga ccgaaatcgt gcgccagcag 660
ttcgccgaag gcaagaaggt catcaccctg gccgtgggcg agaccaagaa gaccgaggtt 720ttcgccgaag gcaagaaggt catcaccctg gccgtgggcg agaccaagaa gaccgaggtt 720
cgtttcgcca gttccgaagg cacgacgtcc ctgaacggcg cgaccgcaga catggctccg 780cgtttcgcca gttccgaagg cacgacgtcc ctgaacggcg cgaccgcaga catggctccg 780
aagctgaccg tttccgtgtc caccaaggac gatctcaagc cctccgccga caccacgttg 840aagctgaccg tttccgtgtc caccaaggac gatctcaagc cctccgccga caccacgttg 840
caggcatggg ccagcgagaa gaacgagaag aagaacactg cggcctatgt cggcgcgctg 900caggcatggg ccagcgagaa gaacgagaag aagaacactg cggcctatgt cggcgcgctg 900
cagccggaag gcgattacgg cgacttcggt gagaagttca agtccaccga cgtccacgat 960cagccggaag gcgattacgg cgacttcggt gagaagttca agtccaccga cgtccacgat 960
gtcacagacg ccaagatggg tctgatgacg ttcgacctgt ccgattacac cgcggcgccc 1020gtcacagacg ccaagatggg tctgatgacg ttcgacctgt ccgattacac cgcggcgccc 1020
gagcactcca tcctcacctt gacgtatctg ggctacgccg gtgcagacaa gaccgccacg 1080gagcactcca tcctcacctt gacgtatctg ggctacgccg gtgcagacaa gaccgccacg 1080
gccaccgata aggtcaaggt ggtcgctgtt gacacgtcgc ggtgcaccgg caccgctccc 1140gccaccgata aggtcaaggt ggtcgctgtt gacacgtcgc ggtgcaccgg caccgctccc 1140
tgcgacacca acaatgccac gtgggcgaac cgcccggact tcgaggtgac cgataccacg 1200tgcgacacca acaatgccac gtgggcgaac cgcccggact tcgaggtgac cgataccacg 1200
aagaccgcga cgtcccatgc gttcgcttat ggatctaaga agtattccga tggcatgacc 1260aagaccgcga cgtcccatgc gttcgcttat ggatctaaga agtattccga tggcatgacc 1260
gtcgaatcgg gcaacgccaa gaaggtcctg ctcgacgtgt ccgatgtcat caaggcagag 1320gtcgaatcgg gcaacgccaa gaaggtcctg ctcgacgtgt ccgatgtcat caaggcagag 1320
ttcgccaagt tcagcgccgg cgccaccgag aagaagatca cgctggccct gggcgagctc 1380ttcgccaagt tcagcgccgg cgccaccgag aagaagatca cgctggccct gggcgagctc 1380
aacaagtccg acatgcgttt cggcagcaag gaagtcacct cgctgaccgg cgccaccgaa 1440aacaagtccg acatgcgttt cggcagcaag gaagtcacct cgctgaccgg cgccaccgaa 1440
gccatgcagc cgaccttgtc cgtcaccaag aagccgaagg catacacgct gagcatcgaa 1500gccatgcagc cgaccttgtc cgtcaccaag aagccgaagg catacacgct gagcatcgaa 1500
ggcccgacca aggtcaagta ccagaagggc gaggcgttcg acaaggccgg actcgtggtc 1560ggcccgacca aggtcaagta ccagaagggc gaggcgttcg acaaggccgg actcgtggtc 1560
aaggccacca gcacggctga cggcacggtc aagacgctga ccgaaggcaa cggtgaggat 1620aaggccacca gcacggctga cggcacggtc aagacgctga ccgaaggcaa cggtgaggat 1620
aactacacca tcgacaccag cgctttcgat agtgccagca tcggcgtata ccctgttacc 1680aactacacca tcgacaccag cgctttcgat agtgccagca tcggcgtata ccctgttacc 1680
gtgaagtaca acaaggaccc cgaaatcgcc gcttcgttca acgcctatgt catcgccagt 1740gtgaagtaca acaaggaccc cgaaatcgcc gcttcgttca acgcctatgt catcgccagt 1740
gtcgaggacg gcggagacgg cgacaccagc aaagacgact ggctgtggta caagcagccc 1800gtcgaggacg gcggagacgg cgacaccagc aaagacgact ggctgtggta caagcagccc 1800
gcgtcgcaga ccgacgccac cgccaccgcc ggcggcaatt acggcaaccc cgacaacaac 1860gcgtcgcaga ccgacgccac cgccaccgcc ggcggcaatt acggcaaccc cgacaacaac 1860
cgttggcagc agaccacctt gccgttcggc aacggcaaga tcggcggcac cgtctggggc 1920cgttggcagc agaccacctt gccgttcggc aacggcaaga tcggcggcac cgtctggggc 1920
gaggtcagcc gtgaacgcgt caccttcaac gaggagacgc tgtggaccgg cggccccgga 1980gaggtcagcc gtgaacgcgt caccttcaac gaggacgc tgtggaccgg cggccccgga 1980
tcctcgacca gctacaacgg cggcaacaac gagaccaagg gtcagaacgg cgccacgctg 2040tcctcgacca gctacaacgg cggcaacaac gagaccaagg gtcagaacgg cgccacgctg 2040
cgcgcgctca acaagcagct cgcgaacggc gccgagacgg tcaatcccgg caacctgacc 2100cgcgcgctca acaagcagct cgcgaacggc gccgagacgg tcaatcccgg caacctgacc 2100
ggcggcgaga acgcggccga gcagggcaac tacctgaact ggggcgacat ctacctcgac 2160ggcggcgaga acgcggccga gcagggcaac tacctgaact ggggcgacat ctacctcgac 2160
tacgggttca acgatacgac cgtcaccgaa taccgccgcg acctgaacct gagcaagggc 2220tacgggttca acgatacgac cgtcaccgaa taccgccgcg acctgaacct gagcaagggc 2220
aaggccgacg tcacgttcaa gcatgacggc gtcacctaca cgcgcgaata cttcgcgtcg 2280aaggccgacg tcacgttcaa gcatgacggc gtcacctaca cgcgcgaata cttcgcgtcg 2280
aaccccgaca atgtcatggt cgcccgcctc acggccagca aagccggcaa gctgaacttc 2340aaccccgaca atgtcatggt cgcccgcctc acggccagca aagccggcaa gctgaacttc 2340
aacgtcagca tgccgaccaa cacgaactac tccaagaccg gcgaaaccac gacggtcaag 2400aacgtcagca tgccgaccaa cacgaactac tccaagaccg gcgaaaccac gacggtcaag 2400
ggtgacacgc tcaccgtcaa gggcgctctc ggcaacaacg gcctgctgta caactcgcag 2460ggtgacacgc tcaccgtcaa gggcgctctc ggcaacaacg gcctgctgta caactcgcag 2460
atcaaggtcg tcctcgacaa cggtgagggc acgctctccg aaggctccga cggcgcttcg 2520atcaaggtcg tcctcgacaa cggtgagggc acgctctccg aaggctccga cggcgcttcg 2520
ctgaaggtct ccgacgcgaa ggcggtcacg ctgtacatcg ccgccgcgac ggactacaag 2580ctgaaggtct ccgacgcgaa ggcggtcacg ctgtacatcg ccgccgcgac ggactacaag 2580
cagaagtatc cgtcctaccg caccggcgaa accgccgccg aggtgaacac ccgcgtcgcc 2640cagaagtatc cgtcctaccg caccggcgaa accgccgccg aggtgaacac ccgcgtcgcc 2640
aaggtcgtgc aggacgccgc caacaagggc tacaccgccg tcaagaaagc gcacatcgac 2700aaggtcgtgc aggacgccgc caacaagggc tacaccgccg tcaagaaagc gcacatcgac 2700
gatcattccg ccatctacga ccgcgtgaag atcgatttgg gccagtccgg ccacagctcc 2760gatcattccg ccatctacga ccgcgtgaag atcgatttgg gccagtccgg ccacagctcc 2760
gacggcgccg tcgccaccga cgcgctgctc aaggcgtacc agagaggctc cgcaaccacc 2820gacggcgccg tcgccaccga cgcgctgctc aaggcgtacc agagaggctc cgcaaccacc 2820
gcgcagaagc gcgagctgga gacgctggtg tacaagtacg gccgctactt gaccatcggc 2880gcgcagaagc gcgagctgga gacgctggtg tacaagtacg gccgctactt gaccatcggc 2880
tcctcccgtg agaacagcca gctgcccagc aacctgcagg gcatctggtc ggtcaccgcg 2940tcctcccgtg agaacagcca gctgcccagc aacctgcagg gcatctggtc ggtcaccgcg 2940
ggcgacaacg cccacggcaa cacgccttgg ggctccgact tccacatgaa cgtgaacctc 3000ggcgacaacg cccacggcaa cacgccttgg ggctccgact tccacatgaa cgtgaacctc 3000
cagatgaact actggccgac ctattcggcc aacatgggag agctcgccga gccgctcatc 3060cagatgaact actggccgac ctattcggcc aacatgggag agctcgccga gccgctcatc 3060
gagtatgtgg agggtctggt caagcccggc cgtgtgaccg ccaaggtcta cgcgggcgcg 3120gagtatgtgg agggtctggt caagcccggc cgtgtgaccg ccaaggtcta cgcgggcgcg 3120
gagacgacga accccgagac cacgccgatc ggcgagggcg agggctacat ggcccacacc 3180gagacgacga accccgagac cacgccgatc ggcgagggcg agggctacat ggcccacacc 3180
gagaacaccg cctacggctg gaccgcaccc ggtcaatcgt tctcgtgggg ttggagcccg 3240gagaacaccg cctacggctg gaccgcaccc ggtcaatcgt tctcgtgggg ttggagcccg 3240
gccgccgtgc cgtggatcct gcagaacgtg tacgaggcgt acgagtactc cggcgaccct 3300gccgccgtgc cgtggatcct gcagaacgtg tacgaggcgt acgagtactc cggcgaccct 3300
gccctgcttg atcgcgtgta cgcgctgctc aaggaggaat cgcacttcta cgtcaactac 3360gccctgcttg atcgcgtgta cgcgctgctc aagggaggaat cgcacttcta cgtcaactac 3360
atgctgcaca aggccggctc cagctccggt gaccgcctga ctaccggcgt cgcgtactcg 3420atgctgcaca aggccggctc cagctccggt gaccgcctga ctaccggcgt cgcgtactcg 3420
cccgaacagg gcccgctggg caccgacggc aacacgtacg agagctcgct cgtgtggcag 3480cccgaacagg gcccgctggg caccgacggc aacacgtacg agagctcgct cgtgtggcag 3480
atgctcaacg acgccatcga ggcggccaag gccaagggag atccggacgg tctggtcggc 3540atgctcaacg acgccatcga ggcggccaag gccaagggag atccggacgg tctggtcggc 3540
aataccaccg actgctcggc cgacaactgg gccaagaatg acagcggcaa cttcaccgat 3600aataccaccg actgctcggc cgacaactgg gccaagaatg acagcggcaa cttcaccgat 3600
gcgaacgcca accgttcctg gagctgcgcc aagagcctgc tcaagccgat cgaggtcggc 3660gcgaacgcca accgttcctg gagctgcgcc aagagcctgc tcaagccgat cgaggtcggc 3660
gactccggcc agatcaagga atggtacttc gaaggtgcgc tcggcaagaa gaaggatgga 3720gactccggcc agatcaagga atggtacttc gaaggtgcgc tcggcaagaa gaaggatgga 3720
tccaccatca gcggctacca ggcggacaac cagcaccgtc acatgtccca cctgctcgga 3780tccaccatca gcggctacca ggcggacaac cagcaccgtc acatgtccca cctgctcgga 3780
ctgttccccg gtgatttgat caccatcgac aactccgagt acatggatgc ggccaagacc 3840ctgttccccg gtgatttgat caccatcgac aactccgagt acatggatgc ggccaagacc 3840
tcgctgaggt accgctgctt caagggcaac gtgctgcagt ccaacaccgg ctgggccatt 3900tcgctgaggt accgctgctt caagggcaac gtgctgcagt ccaacaccgg ctgggccatt 3900
ggccagcgca tcaattcgtg ggctcgcacc ggcgacggca acaccacgta ccagctggtc 3960ggccagcgca tcaattcgtg ggctcgcacc ggcgacggca acaccacgta ccagctggtc 3960
gagctgcagc tcaagaacgc gatgtatgca aacctgttcg attaccatgc gccgttccag 4020gagctgcagc tcaagaacgc gatgtatgca aacctgttcg attaccatgc gccgttccag 4020
atcgacggca acttcggcaa cacctccggt gtcgacgaaa tgctgctgca gtccaactcc 4080atcgacggca acttcggcaa cacctccggt gtcgacgaaa tgctgctgca gtccaactcc 4080
accttcaccg acaccgccgg caagaagtac gtgaactaca cgaacatcct gcccgccctg 4140accttcaccg acaccgccgg caagaagtac gtgaactaca cgaacatcct gcccgccctg 4140
cccgatgcct gggcgggcgg ctcggtgagc ggcctcgtgg cccgcggcaa cttcaccgtc 4200cccgatgcct gggcgggcgg ctcggtgagc ggcctcgtgg cccgcggcaa cttcaccgtc 4200
ggcacgacat ggaagaacgg caaggccacc gaagtcaggc tgacctccaa caagggcaag 4260ggcacgacat ggaagaacgg caaggccacc gaagtcaggc tgacctccaa caagggcaag 4260
caggcggccg tcaagatcac cgccggcggc gcccagaact acgaggtcaa gaacggtgac 4320caggcggccg tcaagatcac cgccggcggc gcccagaact acgaggtcaa gaacggtgac 4320
accgccgtga acgccaaggt cgtgaccaac gcggacggcg cctcgctgct cgtgttcgat 4380accgccgtga acgccaaggt cgtgaccaac gcggacggcg cctcgctgct cgtgttcgat 4380
accaccgcag gcaccacgta cacgatcacg aagaaggcga gcgccaacgt gcccgtcacc 4440accaccgcag gcaccacgta cacgatcacg aagaaggcga gcgccaacgt gcccgtcacc 4440
ggcgtgaccg tgaccggcgc caacaccgcc accgcaggcg acaccgtcac tcttacggct 4500ggcgtgaccg tgaccggcgc caacaccgcc accgcaggcg acaccgtcac tcttacggct 4500
accgtcgccc cggccaatgc gaccgacaag tccgtcacct ggtcgacctc cgacgccgcc 4560accgtcgccc cggccaatgc gaccgacaag tccgtcacct ggtcgacctc cgacgccgcc 4560
gtagctacgg tcaacgccaa cggcgtggtg accacgaaga aggccggcaa ggtgaccatc 4620gtagctacgg tcaacgccaa cggcgtggtg accacgaaga aggccggcaa ggtgaccatc 4620
accgccacgt cgaacggcga caagacgaag ttcggttcca tcgagatcac cgtctccgcc 4680accgccacgt cgaacggcga caagacgaag ttcggttcca tcgagatcac cgtctccgcc 4680
gcgaccgtgc ccgtcaccag cgtcaccgtt gccggcgacg ccgcgatgac cgtcgatgga 4740gcgaccgtgc ccgtcaccag cgtcaccgtt gccggcgacg ccgcgatgac cgtcgatgga 4740
gagcagaccc tgacggcgac cgtcgccccg gccactgcga ccgacaagac ggtcacgtgg 4800gagcagaccc tgacggcgac cgtcgccccg gccactgcga ccgacaagac ggtcacgtgg 4800
aagtcctccg acgccactgt ggcgacggtt gacgccaacg gcaaggtcgt cgcgaagaag 4860aagtcctccg acgccactgt ggcgacggtt gacgccaacg gcaaggtcgt cgcgaagaag 4860
gccggcgaag tgacgatcac cgccacggcc ggtggcgtgt ccggcacgct gaagatcacg 4920gccggcgaag tgacgatcac cgccacggcc ggtggcgtgt ccggcacgct gaagatcacg 4920
gtgagcgaca aggccccgac cgtcatcccg gtccagtccg tgaccgtgac aggcaagcag 4980gtgagcgaca aggccccgac cgtcatcccg gtccagtccg tgaccgtgac aggcaagcag 4980
gagctcgtcg aaggcgcctc cacgaccctg acggcgaccg tcgccccggc tgacgcgacc 5040gagctcgtcg aaggcgcctc cacgaccctg acggcgaccg tcgccccggc tgacgcgacc 5040
gacaagacgg ttacgtggaa gtcgagcgac gagtccgtcg ccacggtcga caaggacggc 5100gacaagacgg ttacgtggaa gtcgagcgac gagtccgtcg ccacggtcga caaggacggc 5100
gtcgtgaccg ccaagaaggc cggcacggtg accatcaccg ccacggccgg tggcgtgtcc 5160gtcgtgaccg ccaagaaggc cggcacggtg accatcaccg ccacggccgg tggcgtgtcc 5160
ggcacgctcc acatcaccgt gacggccaag cccgtcgaga ccgtccccgt caccagcgtg 5220ggcacgctcc acatcaccgt gacggccaag cccgtcgaga ccgtccccgt caccagcgtg 5220
gaggtcaccg tcgaggccgg caccaccgtc tccgtcggca agacactcca ggccaccgcg 5280gaggtcaccg tcgaggccgg caccaccgtc tccgtcggca agacactcca ggccaccgcg 5280
accgtcaagc ccggcaacgc caccaacaag aaggtgacgt ggaagtcgag cgacgaatcc 5340accgtcaagc ccggcaacgc caccaacaag aaggtgacgt ggaagtcgag cgacgaatcc 5340
atcgcgacgg tcgacgccaa cggcgtcatc accgcgaaga aggccggcaa ggtcgtcatc 5400atcgcgacgg tcgacgccaa cggcgtcatc accgcgaaga aggccggcaa ggtcgtcatc 5400
acggccacct cgaccgacgg cacggacaag tccggcagcg tcgagatcac cgtcgtggat 5460acggccacct cgaccgacgg cacggacaag tccggcagcg tcgagatcac cgtcgtggat 5460
gagaccaagc cgacgcccga ccacaagtcc gtcaaggccg ataccggcga cgtgaccgcc 5520gagaccaagc cgacgcccga ccacaagtcc gtcaaggccg ataccggcga cgtgaccgcc 5520
ggcaagaccg gtacggtcac cgagccgaag gacgtggcgg gctggaagag ccgctccatc 5580ggcaagaccg gtacggtcac cgagccgaag gacgtggcgg gctggaagag ccgctccatc 5580
atcaagcaag gcaagctcgg caaggccgaa atcgccgacg gcacgctcgt gtatgcggcc 5640atcaagcaag gcaagctcgg caaggccgaa atcgccgacg gcacgctcgt gtatgcggcc 5640
ggcgacaaga ccggtgacga cagcttcgtc gtgcagtaca cgatggccga cggcacggtc 5700ggcgacaaga ccggtgacga cagcttcgtc gtgcagtaca cgatggccga cggcacggtc 5700
atcgacgtga cctacagcgt cacggtcaag gccgccgaaa ccggcaagaa cgacggcgac 5760atcgacgtga cctacagcgt cacggtcaag gccgccgaaa ccggcaagaa cgacggcgac 5760
ggcaagggcg acggtgtcgc gaagaccggc gccgccgtcg gcgcgctcgc cggcctcggc 5820ggcaagggcg acggtgtcgc gaagaccggc gccgccgtcg gcgcgctcgc cggcctcggc 5820
ttgatgctgc tcgccgtcgg agtgagcgtg gtgatgattc gccgcaagca ctccgcctga 5880ttgatgctgc tcgccgtcgg agtgagcgtg gtgatgattc gccgcaagca ctccgcctga 5880
<210> 2<210> 2
<211> 1959<211> 1959
<212> ПРТ<212> PRT
<213> Bifidobacterium bifidum<213> Bifidobacterium bifidum
<400> 2<400> 2
Met Lys His Arg Ala Met Ser Ser Arg Leu Met Pro Leu Val Ala Ser Met Lys His Arg Ala Met Ser Ser Arg Leu Met Pro Leu Val Ala Ser
1 5 10 15 1 5 10 15
Cys Ala Thr Val Gly Met Leu Leu Ala Gly Leu Pro Val Ser Ala Val Cys Ala Thr Val Gly Met Leu Leu Ala Gly Leu Pro Val Ser Ala Val
20 25 30 20 25 30
Ala Val Gly Thr Thr Arg Ala Ala Ala Ser Asp Ala Ser Ser Ser Thr Ala Val Gly Thr Thr Arg Ala Ala Ala Ser Asp Ala Ser Ser Ser Thr
35 40 45 35 40 45
Thr Ala Thr Ile Thr Pro Ser Ala Asp Thr Thr Leu Gln Thr Trp Thr Thr Ala Thr Ile Thr Pro Ser Ala Asp Thr Thr Leu Gln Thr Trp Thr
50 55 60 50 55 60
Ser Glu Lys Asn Ser Ser Met Ala Ser Lys Pro Tyr Ile Gly Thr Leu Ser Glu Lys Asn Ser Ser Met Ala Ser Lys Pro Tyr Ile Gly Thr Leu
65 70 75 80 65 70 75 80
Gln Gly Pro Ser Gln Gly Val Phe Gly Glu Lys Phe Glu Ser Thr Asp Gln Gly Pro Ser Gln Gly Val Phe Gly Glu Lys Phe Glu Ser Thr Asp
85 90 95 85 90 95
Ala Ala Asp Thr Thr Asp Leu Lys Thr Gly Leu Leu Thr Phe Asp Leu Ala Ala Asp Thr Thr Asp Leu Lys Thr Gly Leu Leu Thr Phe Asp Leu
100 105 110 100 105 110
Ser Ala Tyr Asp His Ala Pro Asp Ser Ala Thr Phe Glu Met Thr Tyr Ser Ala Tyr Asp His Ala Pro Asp Ser Ala Thr Phe Glu Met Thr Tyr
115 120 125 115 120 125
Leu Gly Tyr Arg Gly Asn Pro Thr Ala Thr Asp Thr Asp Thr Ile Lys Leu Gly Tyr Arg Gly Asn Pro Thr Ala Thr Asp Thr Asp Thr Ile Lys
130 135 140 130 135 140
Val Thr Pro Val Asp Thr Thr Val Cys Thr Asn Asn Ala Thr Asp Cys Val Thr Pro Val Asp Thr Thr Val Cys Thr Asn Asn Ala Thr Asp Cys
145 150 155 160 145 150 155 160
Gly Ala Asn Val Ala Thr Gly Ala Thr Lys Pro Lys Phe Ser Ile Asn Gly Ala Asn Val Ala Thr Gly Ala Thr Lys Pro Lys Phe Ser Ile Asn
165 170 175 165 170 175
Asp Ser Ser Phe Val Ala Glu Ser Lys Pro Phe Glu Tyr Gly Thr Thr Asp Ser Ser Phe Val Ala Glu Ser Lys Pro Phe Glu Tyr Gly Thr Thr
180 185 190 180 185 190
Val Tyr Thr Gly Asp Ala Ile Thr Val Val Pro Ala Asn Thr Lys Lys Val Tyr Thr Gly Asp Ala Ile Thr Val Val Pro Ala Asn Thr Lys Lys
195 200 205 195 200 205
Val Thr Val Asp Val Thr Glu Ile Val Arg Gln Gln Phe Ala Glu Gly Val Thr Val Asp Val Thr Glu Ile Val Arg Gln Gln Phe Ala Glu Gly
210 215 220 210 215 220
Lys Lys Val Ile Thr Leu Ala Val Gly Glu Thr Lys Lys Thr Glu Val Lys Lys Val Ile Thr Leu Ala Val Gly Glu Thr Lys Lys Thr Glu Val
225 230 235 240 225 230 235 240
Arg Phe Ala Ser Ser Glu Gly Thr Thr Ser Leu Asn Gly Ala Thr Ala Arg Phe Ala Ser Ser Glu Gly Thr Thr Ser Leu Asn Gly Ala Thr Ala
245 250 255 245 250 255
Asp Met Ala Pro Lys Leu Thr Val Ser Val Ser Thr Lys Asp Asp Leu Asp Met Ala Pro Lys Leu Thr Val Ser Val Ser Thr Lys Asp Asp Leu
260 265 270 260 265 270
Lys Pro Ser Ala Asp Thr Thr Leu Gln Ala Trp Ala Ser Glu Lys Asn Lys Pro Ser Ala Asp Thr Thr Leu Gln Ala Trp Ala Ser Glu Lys Asn
275 280 285 275 280 285
Glu Lys Lys Asn Thr Ala Ala Tyr Val Gly Ala Leu Gln Pro Glu Gly Glu Lys Lys Asn Thr Ala Ala Tyr Val Gly Ala Leu Gln Pro Glu Gly
290 295 300 290 295 300
Asp Tyr Gly Asp Phe Gly Glu Lys Phe Lys Ser Thr Asp Val His Asp Asp Tyr Gly Asp Phe Gly Glu Lys Phe Lys Ser Thr Asp Val His Asp
305 310 315 320 305 310 315 320
Val Thr Asp Ala Lys Met Gly Leu Met Thr Phe Asp Leu Ser Asp Tyr Val Thr Asp Ala Lys Met Gly Leu Met Thr Phe Asp Leu Ser Asp Tyr
325 330 335 325 330 335
Thr Ala Ala Pro Glu His Ser Ile Leu Thr Leu Thr Tyr Leu Gly Tyr Thr Ala Ala Pro Glu His Ser Ile Leu Thr Leu Thr Tyr Leu Gly Tyr
340 345 350 340 345 350
Ala Gly Ala Asp Lys Thr Ala Thr Ala Thr Asp Lys Val Lys Val Val Ala Gly Ala Asp Lys Thr Ala Thr Ala Thr Asp Lys Val Lys Val Val
355 360 365 355 360 365
Ala Val Asp Thr Ser Arg Cys Thr Gly Thr Ala Pro Cys Asp Thr Asn Ala Val Asp Thr Ser Arg Cys Thr Gly Thr Ala Pro Cys Asp Thr Asn
370 375 380 370 375 380
Asn Ala Thr Trp Ala Asn Arg Pro Asp Phe Glu Val Thr Asp Thr Thr Asn Ala Thr Trp Ala Asn Arg Pro Asp Phe Glu Val Thr Asp Thr Thr
385 390 395 400 385 390 395 400
Lys Thr Ala Thr Ser His Ala Phe Ala Tyr Gly Ser Lys Lys Tyr Ser Lys Thr Ala Thr Ser His Ala Phe Ala Tyr Gly Ser Lys Lys Tyr Ser
405 410 415 405 410 415
Asp Gly Met Thr Val Glu Ser Gly Asn Ala Lys Lys Val Leu Leu Asp Asp Gly Met Thr Val Glu Ser Gly Asn Ala Lys Lys Val Leu Leu Asp
420 425 430 420 425 430
Val Ser Asp Val Ile Lys Ala Glu Phe Ala Lys Phe Ser Ala Gly Ala Val Ser Asp Val Ile Lys Ala Glu Phe Ala Lys Phe Ser Ala Gly Ala
435 440 445 435 440 445
Thr Glu Lys Lys Ile Thr Leu Ala Leu Gly Glu Leu Asn Lys Ser Asp Thr Glu Lys Lys Ile Thr Leu Ala Leu Gly Glu Leu Asn Lys Ser Asp
450 455 460 450 455 460
Met Arg Phe Gly Ser Lys Glu Val Thr Ser Leu Thr Gly Ala Thr Glu Met Arg Phe Gly Ser Lys Glu Val Thr Ser Leu Thr Gly Ala Thr Glu
465 470 475 480 465 470 475 480
Ala Met Gln Pro Thr Leu Ser Val Thr Lys Lys Pro Lys Ala Tyr Thr Ala Met Gln Pro Thr Leu Ser Val Thr Lys Lys Pro Lys Ala Tyr Thr
485 490 495 485 490 495
Leu Ser Ile Glu Gly Pro Thr Lys Val Lys Tyr Gln Lys Gly Glu Ala Leu Ser Ile Glu Gly Pro Thr Lys Val Lys Tyr Gln Lys Gly Glu Ala
500 505 510 500 505 510
Phe Asp Lys Ala Gly Leu Val Val Lys Ala Thr Ser Thr Ala Asp Gly Phe Asp Lys Ala Gly Leu Val Val Lys Ala Thr Ser Thr Ala Asp Gly
515 520 525 515 520 525
Thr Val Lys Thr Leu Thr Glu Gly Asn Gly Glu Asp Asn Tyr Thr Ile Thr Val Lys Thr Leu Thr Glu Gly Asn Gly Glu Asp Asn Tyr Thr Ile
530 535 540 530 535 540
Asp Thr Ser Ala Phe Asp Ser Ala Ser Ile Gly Val Tyr Pro Val Thr Asp Thr Ser Ala Phe Asp Ser Ala Ser Ile Gly Val Tyr Pro Val Thr
545 550 555 560 545 550 555 560
Val Lys Tyr Asn Lys Asp Pro Glu Ile Ala Ala Ser Phe Asn Ala Tyr Val Lys Tyr Asn Lys Asp Pro Glu Ile Ala Ala Ser Phe Asn Ala Tyr
565 570 575 565 570 575
Val Ile Ala Ser Val Glu Asp Gly Gly Asp Gly Asp Thr Ser Lys Asp Val Ile Ala Ser Val Glu Asp Gly Gly Asp Gly Asp Thr Ser Lys Asp
580 585 590 580 585 590
Asp Trp Leu Trp Tyr Lys Gln Pro Ala Ser Gln Thr Asp Ala Thr Ala Asp Trp Leu Trp Tyr Lys Gln Pro Ala Ser Gln Thr Asp Ala Thr Ala
595 600 605 595 600 605
Thr Ala Gly Gly Asn Tyr Gly Asn Pro Asp Asn Asn Arg Trp Gln Gln Thr Ala Gly Gly Asn Tyr Gly Asn Pro Asp Asn Asn Arg Trp Gln Gln
610 615 620 610 615 620
Thr Thr Leu Pro Phe Gly Asn Gly Lys Ile Gly Gly Thr Val Trp Gly Thr Thr Leu Pro Phe Gly Asn Gly Lys Ile Gly Gly Thr Val Trp Gly
625 630 635 640 625 630 635 640
Glu Val Ser Arg Glu Arg Val Thr Phe Asn Glu Glu Thr Leu Trp Thr Glu Val Ser Arg Glu Arg Val Thr Phe Asn Glu Glu Thr Leu Trp Thr
645 650 655 645 650 655
Gly Gly Pro Gly Ser Ser Thr Ser Tyr Asn Gly Gly Asn Asn Glu Thr Gly Gly Pro Gly Ser Ser Thr Ser Tyr Asn Gly Gly Asn Asn Glu Thr
660 665 670 660 665 670
Lys Gly Gln Asn Gly Ala Thr Leu Arg Ala Leu Asn Lys Gln Leu Ala Lys Gly Gln Asn Gly Ala Thr Leu Arg Ala Leu Asn Lys Gln Leu Ala
675 680 685 675 680 685
Asn Gly Ala Glu Thr Val Asn Pro Gly Asn Leu Thr Gly Gly Glu Asn Asn Gly Ala Glu Thr Val Asn Pro Gly Asn Leu Thr Gly Gly Glu Asn
690 695 700 690 695 700
Ala Ala Glu Gln Gly Asn Tyr Leu Asn Trp Gly Asp Ile Tyr Leu Asp Ala Ala Glu Gln Gly Asn Tyr Leu Asn Trp Gly Asp Ile Tyr Leu Asp
705 710 715 720 705 710 715 720
Tyr Gly Phe Asn Asp Thr Thr Val Thr Glu Tyr Arg Arg Asp Leu Asn Tyr Gly Phe Asn Asp Thr Thr Val Thr Glu Tyr Arg Arg Asp Leu Asn
725 730 735 725 730 735
Leu Ser Lys Gly Lys Ala Asp Val Thr Phe Lys His Asp Gly Val Thr Leu Ser Lys Gly Lys Ala Asp Val Thr Phe Lys His Asp Gly Val Thr
740 745 750 740 745 750
Tyr Thr Arg Glu Tyr Phe Ala Ser Asn Pro Asp Asn Val Met Val Ala Tyr Thr Arg Glu Tyr Phe Ala Ser Asn Pro Asp Asn Val Met Val Ala
755 760 765 755 760 765
Arg Leu Thr Ala Ser Lys Ala Gly Lys Leu Asn Phe Asn Val Ser Met Arg Leu Thr Ala Ser Lys Ala Gly Lys Leu Asn Phe Asn Val Ser Met
770 775 780 770 775 780
Pro Thr Asn Thr Asn Tyr Ser Lys Thr Gly Glu Thr Thr Thr Val Lys Pro Thr Asn Thr Asn Tyr Ser Lys Thr Gly Glu Thr Thr Thr Val Lys
785 790 795 800 785 790 795 800
Gly Asp Thr Leu Thr Val Lys Gly Ala Leu Gly Asn Asn Gly Leu Leu Gly Asp Thr Leu Thr Val Lys Gly Ala Leu Gly Asn Asn Gly Leu Leu
805 810 815 805 810 815
Tyr Asn Ser Gln Ile Lys Val Val Leu Asp Asn Gly Glu Gly Thr Leu Tyr Asn Ser Gln Ile Lys Val Val Leu Asp Asn Gly Glu Gly Thr Leu
820 825 830 820 825 830
Ser Glu Gly Ser Asp Gly Ala Ser Leu Lys Val Ser Asp Ala Lys Ala Ser Glu Gly Ser Asp Gly Ala Ser Leu Lys Val Ser Asp Ala Lys Ala
835 840 845 835 840 845
Val Thr Leu Tyr Ile Ala Ala Ala Thr Asp Tyr Lys Gln Lys Tyr Pro Val Thr Leu Tyr Ile Ala Ala Ala Thr Asp Tyr Lys Gln Lys Tyr Pro
850 855 860 850 855 860
Ser Tyr Arg Thr Gly Glu Thr Ala Ala Glu Val Asn Thr Arg Val Ala Ser Tyr Arg Thr Gly Glu Thr Ala Ala Glu Val Asn Thr Arg Val Ala
865 870 875 880 865 870 875 880
Lys Val Val Gln Asp Ala Ala Asn Lys Gly Tyr Thr Ala Val Lys Lys Lys Val Val Gln Asp Ala Ala Asn Lys Gly Tyr Thr Ala Val Lys Lys
885 890 895 885 890 895
Ala His Ile Asp Asp His Ser Ala Ile Tyr Asp Arg Val Lys Ile Asp Ala His Ile Asp Asp His Ser Ala Ile Tyr Asp Arg Val Lys Ile Asp
900 905 910 900 905 910
Leu Gly Gln Ser Gly His Ser Ser Asp Gly Ala Val Ala Thr Asp Ala Leu Gly Gln Ser Gly His Ser Ser Asp Gly Ala Val Ala Thr Asp Ala
915 920 925 915 920 925
Leu Leu Lys Ala Tyr Gln Arg Gly Ser Ala Thr Thr Ala Gln Lys Arg Leu Leu Lys Ala Tyr Gln Arg Gly Ser Ala Thr Thr Ala Gln Lys Arg
930 935 940 930 935 940
Glu Leu Glu Thr Leu Val Tyr Lys Tyr Gly Arg Tyr Leu Thr Ile Gly Glu Leu Glu Thr Leu Val Tyr Lys Tyr Gly Arg Tyr Leu Thr Ile Gly
945 950 955 960 945 950 955 960
Ser Ser Arg Glu Asn Ser Gln Leu Pro Ser Asn Leu Gln Gly Ile Trp Ser Ser Arg Glu Asn Ser Gln Leu Pro Ser Asn Leu Gln Gly Ile Trp
965 970 975 965 970 975
Ser Val Thr Ala Gly Asp Asn Ala His Gly Asn Thr Pro Trp Gly Ser Ser Val Thr Ala Gly Asp Asn Ala His Gly Asn Thr Pro Trp Gly Ser
980 985 990 980 985 990
Asp Phe His Met Asn Val Asn Leu Gln Met Asn Tyr Trp Pro Thr Tyr Asp Phe His Met Asn Val Asn Leu Gln Met Asn Tyr Trp Pro Thr Tyr
995 1000 1005 995 1000 1005
Ser Ala Asn Met Gly Glu Leu Ala Glu Pro Leu Ile Glu Tyr Val Ser Ala Asn Met Gly Glu Leu Ala Glu Pro Leu Ile Glu Tyr Val
1010 1015 1020 1010 1015 1020
Glu Gly Leu Val Lys Pro Gly Arg Val Thr Ala Lys Val Tyr Ala Glu Gly Leu Val Lys Pro Gly Arg Val Thr Ala Lys Val Tyr Ala
1025 1030 1035 1025 1030 1035
Gly Ala Glu Thr Thr Asn Pro Glu Thr Thr Pro Ile Gly Glu Gly Gly Ala Glu Thr Thr Asn Pro Glu Thr Thr Pro Ile Gly Glu Gly
1040 1045 1050 1040 1045 1050
Glu Gly Tyr Met Ala His Thr Glu Asn Thr Ala Tyr Gly Trp Thr Glu Gly Tyr Met Ala His Thr Glu Asn Thr Ala Tyr Gly Trp Thr
1055 1060 1065 1055 1060 1065
Ala Pro Gly Gln Ser Phe Ser Trp Gly Trp Ser Pro Ala Ala Val Ala Pro Gly Gln Ser Phe Ser Trp Gly Trp Ser Pro Ala Ala Val
1070 1075 1080 1070 1075 1080
Pro Trp Ile Leu Gln Asn Val Tyr Glu Ala Tyr Glu Tyr Ser Gly Pro Trp Ile Leu Gln Asn Val Tyr Glu Ala Tyr Glu Tyr Ser Gly
1085 1090 1095 1085 1090 1095
Asp Pro Ala Leu Leu Asp Arg Val Tyr Ala Leu Leu Lys Glu Glu Asp Pro Ala Leu Leu Asp Arg Val Tyr Ala Leu Leu Lys Glu Glu
1100 1105 1110 1100 1105 1110
Ser His Phe Tyr Val Asn Tyr Met Leu His Lys Ala Gly Ser Ser Ser His Phe Tyr Val Asn Tyr Met Leu His Lys Ala Gly Ser Ser
1115 1120 1125 1115 1120 1125
Ser Gly Asp Arg Leu Thr Thr Gly Val Ala Tyr Ser Pro Glu Gln Ser Gly Asp Arg Leu Thr Thr Gly Val Ala Tyr Ser Pro Glu Gln
1130 1135 1140 1130 1135 1140
Gly Pro Leu Gly Thr Asp Gly Asn Thr Tyr Glu Ser Ser Leu Val Gly Pro Leu Gly Thr Asp Gly Asn Thr Tyr Glu Ser Ser Leu Val
1145 1150 1155 1145 1150 1155
Trp Gln Met Leu Asn Asp Ala Ile Glu Ala Ala Lys Ala Lys Gly Trp Gln Met Leu Asn Asp Ala Ile Glu Ala Ala Lys Ala Lys Gly
1160 1165 1170 1160 1165 1170
Asp Pro Asp Gly Leu Val Gly Asn Thr Thr Asp Cys Ser Ala Asp Asp Pro Asp Gly Leu Val Gly Asn Thr Thr Asp Cys Ser Ala Asp
1175 1180 1185 1175 1180 1185
Asn Trp Ala Lys Asn Asp Ser Gly Asn Phe Thr Asp Ala Asn Ala Asn Trp Ala Lys Asn Asp Ser Gly Asn Phe Thr Asp Ala Asn Ala
1190 1195 1200 1190 1195 1200
Asn Arg Ser Trp Ser Cys Ala Lys Ser Leu Leu Lys Pro Ile Glu Asn Arg Ser Trp Ser Cys Ala Lys Ser Leu Leu Lys Pro Ile Glu
1205 1210 1215 1205 1210 1215
Val Gly Asp Ser Gly Gln Ile Lys Glu Trp Tyr Phe Glu Gly Ala Val Gly Asp Ser Gly Gln Ile Lys Glu Trp Tyr Phe Glu Gly Ala
1220 1225 1230 1220 1225 1230
Leu Gly Lys Lys Lys Asp Gly Ser Thr Ile Ser Gly Tyr Gln Ala Leu Gly Lys Lys Lys Asp Gly Ser Thr Ile Ser Gly Tyr Gln Ala
1235 1240 1245 1235 1240 1245
Asp Asn Gln His Arg His Met Ser His Leu Leu Gly Leu Phe Pro Asp Asn Gln His Arg His Met Ser His Leu Leu Gly Leu Phe Pro
1250 1255 1260 1250 1255 1260
Gly Asp Leu Ile Thr Ile Asp Asn Ser Glu Tyr Met Asp Ala Ala Gly Asp Leu Ile Thr Ile Asp Asn Ser Glu Tyr Met Asp Ala Ala
1265 1270 1275 1265 1270 1275
Lys Thr Ser Leu Arg Tyr Arg Cys Phe Lys Gly Asn Val Leu Gln Lys Thr Ser Leu Arg Tyr Arg Cys Phe Lys Gly Asn Val Leu Gln
1280 1285 1290 1280 1285 1290
Ser Asn Thr Gly Trp Ala Ile Gly Gln Arg Ile Asn Ser Trp Ala Ser Asn Thr Gly Trp Ala Ile Gly Gln Arg Ile Asn Ser Trp Ala
1295 1300 1305 1295 1300 1305
Arg Thr Gly Asp Gly Asn Thr Thr Tyr Gln Leu Val Glu Leu Gln Arg Thr Gly Asp Gly Asn Thr Thr Tyr Gln Leu Val Glu Leu Gln
1310 1315 1320 1310 1315 1320
Leu Lys Asn Ala Met Tyr Ala Asn Leu Phe Asp Tyr His Ala Pro Leu Lys Asn Ala Met Tyr Ala Asn Leu Phe Asp Tyr His Ala Pro
1325 1330 1335 1325 1330 1335
Phe Gln Ile Asp Gly Asn Phe Gly Asn Thr Ser Gly Val Asp Glu Phe Gln Ile Asp Gly Asn Phe Gly Asn Thr Ser Gly Val Asp Glu
1340 1345 1350 1340 1345 1350
Met Leu Leu Gln Ser Asn Ser Thr Phe Thr Asp Thr Ala Gly Lys Met Leu Leu Gln Ser Asn Ser Thr Phe Thr Asp Thr Ala Gly Lys
1355 1360 1365 1355 1360 1365
Lys Tyr Val Asn Tyr Thr Asn Ile Leu Pro Ala Leu Pro Asp Ala Lys Tyr Val Asn Tyr Thr Asn Ile Leu Pro Ala Leu Pro Asp Ala
1370 1375 1380 1370 1375 1380
Trp Ala Gly Gly Ser Val Ser Gly Leu Val Ala Arg Gly Asn Phe Trp Ala Gly Gly Ser Val Ser Gly Leu Val Ala Arg Gly Asn Phe
1385 1390 1395 1385 1390 1395
Thr Val Gly Thr Thr Trp Lys Asn Gly Lys Ala Thr Glu Val Arg Thr Val Gly Thr Thr Trp Lys Asn Gly Lys Ala Thr Glu Val Arg
1400 1405 1410 1400 1405 1410
Leu Thr Ser Asn Lys Gly Lys Gln Ala Ala Val Lys Ile Thr Ala Leu Thr Ser Asn Lys Gly Lys Gln Ala Ala Val Lys Ile Thr Ala
1415 1420 1425 1415 1420 1425
Gly Gly Ala Gln Asn Tyr Glu Val Lys Asn Gly Asp Thr Ala Val Gly Gly Ala Gln Asn Tyr Glu Val Lys Asn Gly Asp Thr Ala Val
1430 1435 1440 1430 1435 1440
Asn Ala Lys Val Val Thr Asn Ala Asp Gly Ala Ser Leu Leu Val Asn Ala Lys Val Val Thr Asn Ala Asp Gly Ala Ser Leu Leu Val
1445 1450 1455 1445 1450 1455
Phe Asp Thr Thr Ala Gly Thr Thr Tyr Thr Ile Thr Lys Lys Ala Phe Asp Thr Thr Ala Gly Thr Thr Tyr Thr Ile Thr Lys Lys Ala
1460 1465 1470 1460 1465 1470
Ser Ala Asn Val Pro Val Thr Gly Val Thr Val Thr Gly Ala Asn Ser Ala Asn Val Pro Val Thr Gly Val Thr Val Thr Gly Ala Asn
1475 1480 1485 1475 1480 1485
Thr Ala Thr Ala Gly Asp Thr Val Thr Leu Thr Ala Thr Val Ala Thr Ala Thr Ala Gly Asp Thr Val Thr Leu Thr Ala Thr Val Ala
1490 1495 1500 1490 1495 1500
Pro Ala Asn Ala Thr Asp Lys Ser Val Thr Trp Ser Thr Ser Asp Pro Ala Asn Ala Thr Asp Lys Ser Val Thr Trp Ser Thr Ser Asp
1505 1510 1515 1505 1510 1515
Ala Ala Val Ala Thr Val Asn Ala Asn Gly Val Val Thr Thr Lys Ala Ala Val Ala Thr Val Asn Ala Asn Gly Val Val Thr Thr Lys
1520 1525 1530 1520 1525 1530
Lys Ala Gly Lys Val Thr Ile Thr Ala Thr Ser Asn Gly Asp Lys Lys Ala Gly Lys Val Thr Ile Thr Ala Thr Ser Asn Gly Asp Lys
1535 1540 1545 1535 1540 1545
Thr Lys Phe Gly Ser Ile Glu Ile Thr Val Ser Ala Ala Thr Val Thr Lys Phe Gly Ser Ile Glu Ile Thr Val Ser Ala Ala Thr Val
1550 1555 1560 1550 1555 1560
Pro Val Thr Ser Val Thr Val Ala Gly Asp Ala Ala Met Thr Val Pro Val Thr Ser Val Thr Val Ala Gly Asp Ala Ala Met Thr Val
1565 1570 1575 1565 1570 1575
Asp Gly Glu Gln Thr Leu Thr Ala Thr Val Ala Pro Ala Thr Ala Asp Gly Glu Gln Thr Leu Thr Ala Thr Val Ala Pro Ala Thr Ala
1580 1585 1590 1580 1585 1590
Thr Asp Lys Thr Val Thr Trp Lys Ser Ser Asp Ala Thr Val Ala Thr Asp Lys Thr Val Thr Trp Lys Ser Ser Asp Ala Thr Val Ala
1595 1600 1605 1595 1600 1605
Thr Val Asp Ala Asn Gly Lys Val Val Ala Lys Lys Ala Gly Glu Thr Val Asp Ala Asn Gly Lys Val Val Ala Lys Lys Ala Gly Glu
1610 1615 1620 1610 1615 1620
Val Thr Ile Thr Ala Thr Ala Gly Gly Val Ser Gly Thr Leu Lys Val Thr Ile Thr Ala Thr Ala Gly Gly Val Ser Gly Thr Leu Lys
1625 1630 1635 1625 1630 1635
Ile Thr Val Ser Asp Lys Ala Pro Thr Val Ile Pro Val Gln Ser Ile Thr Val Ser Asp Lys Ala Pro Thr Val Ile Pro Val Gln Ser
1640 1645 1650 1640 1645 1650
Val Thr Val Thr Gly Lys Gln Glu Leu Val Glu Gly Ala Ser Thr Val Thr Val Thr Gly Lys Gln Glu Leu Val Glu Gly Ala Ser Thr
1655 1660 1665 1655 1660 1665
Thr Leu Thr Ala Thr Val Ala Pro Ala Asp Ala Thr Asp Lys Thr Thr Leu Thr Ala Thr Val Ala Pro Ala Asp Ala Thr Asp Lys Thr
1670 1675 1680 1670 1675 1680
Val Thr Trp Lys Ser Ser Asp Glu Ser Val Ala Thr Val Asp Lys Val Thr Trp Lys Ser Ser Asp Glu Ser Val Ala Thr Val Asp Lys
1685 1690 1695 1685 1690 1695
Asp Gly Val Val Thr Ala Lys Lys Ala Gly Thr Val Thr Ile Thr Asp Gly Val Val Thr Ala Lys Lys Ala Gly Thr Val Thr Ile Thr
1700 1705 1710 1700 1705 1710
Ala Thr Ala Gly Gly Val Ser Gly Thr Leu His Ile Thr Val Thr Ala Thr Ala Gly Gly Val Ser Gly Thr Leu His Ile Thr Val Thr
1715 1720 1725 1715 1720 1725
Ala Lys Pro Val Glu Thr Val Pro Val Thr Ser Val Glu Val Thr Ala Lys Pro Val Glu Thr Val Pro Val Thr Ser Val Glu Val Thr
1730 1735 1740 1730 1735 1740
Val Glu Ala Gly Thr Thr Val Ser Val Gly Lys Thr Leu Gln Ala Val Glu Ala Gly Thr Thr Val Ser Val Gly Lys Thr Leu Gln Ala
1745 1750 1755 1745 1750 1755
Thr Ala Thr Val Lys Pro Gly Asn Ala Thr Asn Lys Lys Val Thr Thr Ala Thr Val Lys Pro Gly Asn Ala Thr Asn Lys Lys Val Thr
1760 1765 1770 1760 1765 1770
Trp Lys Ser Ser Asp Glu Ser Ile Ala Thr Val Asp Ala Asn Gly Trp Lys Ser Ser Asp Glu Ser Ile Ala Thr Val Asp Ala Asn Gly
1775 1780 1785 1775 1780 1785
Val Ile Thr Ala Lys Lys Ala Gly Lys Val Val Ile Thr Ala Thr Val Ile Thr Ala Lys Lys Ala Gly Lys Val Val Ile Thr Ala Thr
1790 1795 1800 1790 1795 1800
Ser Thr Asp Gly Thr Asp Lys Ser Gly Ser Val Glu Ile Thr Val Ser Thr Asp Gly Thr Asp Lys Ser Gly Ser Val Glu Ile Thr Val
1805 1810 1815 1805 1810 1815
Val Asp Glu Thr Lys Pro Thr Pro Asp His Lys Ser Val Lys Ala Val Asp Glu Thr Lys Pro Thr Pro Asp His Lys Ser Val Lys Ala
1820 1825 1830 1820 1825 1830
Asp Thr Gly Asp Val Thr Ala Gly Lys Thr Gly Thr Val Thr Glu Asp Thr Gly Asp Val Thr Ala Gly Lys Thr Gly Thr Val Thr Glu
1835 1840 1845 1835 1840 1845
Pro Lys Asp Val Ala Gly Trp Lys Ser Arg Ser Ile Ile Lys Gln Pro Lys Asp Val Ala Gly Trp Lys Ser Arg Ser Ile Ile Lys Gln
1850 1855 1860 1850 1855 1860
Gly Lys Leu Gly Lys Ala Glu Ile Ala Asp Gly Thr Leu Val Tyr Gly Lys Leu Gly Lys Ala Glu Ile Ala Asp Gly Thr Leu Val Tyr
1865 1870 1875 1865 1870 1875
Ala Ala Gly Asp Lys Thr Gly Asp Asp Ser Phe Val Val Gln Tyr Ala Ala Gly Asp Lys Thr Gly Asp Asp Ser Phe Val Val Gln Tyr
1880 1885 1890 1880 1885 1890
Thr Met Ala Asp Gly Thr Val Ile Asp Val Thr Tyr Ser Val Thr Thr Met Ala Asp Gly Thr Val Ile Asp Val Thr Tyr Ser Val Thr
1895 1900 1905 1895 1900 1905
Val Lys Ala Ala Glu Thr Gly Lys Asn Asp Gly Asp Gly Lys Gly Val Lys Ala Ala Glu Thr Gly Lys Asn Asp Gly Asp Gly Lys Gly
1910 1915 1920 1910 1915 1920
Asp Gly Val Ala Lys Thr Gly Ala Ala Val Gly Ala Leu Ala Gly Asp Gly Val Ala Lys Thr Gly Ala Ala Val Gly Ala Leu Ala Gly
1925 1930 1935 1925 1930 1935
Leu Gly Leu Met Leu Leu Ala Val Gly Val Ser Val Val Met Ile Leu Gly Leu Met Leu Leu Ala Val Gly Val Ser Val Val Met Ile
1940 1945 1950 1940 1945 1950
Arg Arg Lys His Ser Ala Arg Arg Lys His Ser Ala
1955 1955
<210> 3<210> 3
<211> 4482<211> 4482
<212> ДНК<212> DNA
<213> Bifidobacterium bifidum<213> Bifidobacterium bifidum
<400> 3<400> 3
atgctacaca cagcatcaag aggatgctcg cgttcgtggc tgcgcagact caccgcattg 60atgctacaca cagcatcaag aggatgctcg cgttcgtggc tgcgcagact caccgcattg 60
atagcggtct cggcgctcgc gttcgtggca ttgccgaacg tcgcggtggc ggcggatccg 120atagcggtct cggcgctcgc gttcgtggca ttgccgaacg tcgcggtggc ggcggatccg 120
atggaatacc tcgatgtgtc gttcggcggc acgttcgctg cagacaccta caccacaggt 180atggaatacc tcgatgtgtc gttcggcggc acgttcgctg cagacaccta caccacaggt 180
ggcgacgagg tggcgaaggg ccccgtgacc aagcacggca gcataccgac caagcttgac 240ggcgacgagg tggcgaaggg ccccgtgacc aagcacggca gcataccgac caagcttgac 240
ggcggcggca tcaccctcgc tggcggcacc aacggcgtga cattcacctc gaccgcgagc 300ggcggcggca tcaccctcgc tggcggcacc aacggcgtga cattcacctc gaccgcgagc 300
ttcagcgaga gtgggaaggt gaacaaggga ttccgcgccg aaatggagta ccgtacgacg 360ttcagcgaga gtgggaaggt gaacaaggga ttccgcgccg aaatggagta ccgtacgacg 360
cagacgccca gcaacctcgc cacattgttc tccgccatgg gcaacatctt cgtgcgggcg 420cagacgccca gcaacctcgc cacattgttc tccgccatgg gcaacatctt cgtgcgggcg 420
aacggcagca acctcgaata cggcttctcc acgaaccctt ccggcagtac atggaacgac 480aacggcagca acctcgaata cggcttctcc acgaaccctt ccggcagtac atggaacgac 480
tacacaaagt ccgtgacgct gccttccaac aatgtgaagc acatcatcca gctgacatat 540tacacaaagt ccgtgacgct gccttccaac aatgtgaagc acatcatcca gctgacatat 540
ctgccgggag ccgacggcgc tgcctcgacg ttgcagttgt cggtggatgg cgtggccggc 600ctgccgggag ccgacggcgc tgcctcgacg ttgcagttgt cggtggatgg cgtggccggc 600
gagaccgcca cctccgcggc cggcgagctc gcggccgtca gcgattccgt cgggaacaag 660gagaccgcca cctccgcggc cggcgagctc gcggccgtca gcgattccgt cgggaacaag 660
ttcgggatcg gctacgaggt gaaccccgct tccggcgcgg cgagccgcgg tcttgccggt 720ttcgggatcg gctacgaggt gaaccccgct tccggcgcgg cgagccgcgg tcttgccggt 720
gacgtgttcc gcgcgcgtgt cgccgattcg gacgccccgt gggagattct tgacgcatcc 780gacgtgttcc gcgcgcgtgt cgccgattcg gacgccccgt gggagattct tgacgcatcc 780
cagctgctgc atgtcaattt caacggcacg ttcagcggca cctcatatac cgcggcgagc 840cagctgctgc atgtcaattt caacggcacg ttcagcggca cctcatatac cgcggcgagc 840
ggcgagcaga tgctgggctc gctggtgtcg cgctcggcca atccgtccat ctcgaactcc 900ggcgagcaga tgctgggctc gctggtgtcg cgctcggcca atccgtccat ctcgaactcc 900
gccgtcacgc tgggcggcgg cacggccgga ttcgatttca cgcccacgga cttcaccctc 960gccgtcacgc tgggcggcgg cacggccgga ttcgatttca cgcccacgga cttcaccctc 960
ggtgacaacg aggccatcac ccgcccgctg gtcgcggagc tgcgcttcac cccgacgcag 1020ggtgacaacg aggccatcac ccgcccgctg gtcgcggagc tgcgcttcac cccgacgcag 1020
accggcgaca accagaccct gttcggcgcg ggcggcaacc tgttcctgcg ctacgagtcg 1080accggcgaca accagaccct gttcggcgcg ggcggcaacc tgttcctgcg ctacgagtcg 1080
aacaagctcg tgttcggcgc ctccaccaag tccggcgata attggaccga ccacaagatc 1140aacaagctcg tgttcggcgc ctccaccaag tccggcgata attggaccga ccacaagatc 1140
gagtccgcgg ccgccacggg tgcggagcac gtcgtgtcgg tggcgtacgt gcccaataag 1200gagtccgcgg ccgccacggg tgcggagcac gtcgtgtcgg tggcgtacgt gcccaataag 1200
gccggcaccg gcgcgaagct tgtcatgcgc gtggatggcg gcgacgccca gaccaaggac 1260gccggcaccg gcgcgaagct tgtcatgcgc gtggatggcg gcgacgccca gaccaaggac 1260
atcactggtc tggcttacct gaattcgagc atcaagggca aggtcggctt cggcaacgac 1320atcactggtc tggcttacct gaattcgagc atcaagggca aggtcggctt cggcaacgac 1320
gtgcataccg acgcgctcag ccgcggcttc gtcggctcgc tgagcgagat ccgcctggcc 1380gtgcataccg acgcgctcag ccgcggcttc gtcggctcgc tgagcgagat ccgcctggcc 1380
gaaacctccg cgaacttcac caccaacgaa ttcaagctgg tctactctca ggtcagctgc 1440gaaacctccg cgaacttcac caccaacgaa ttcaagctgg tctactctca ggtcagctgc 1440
gacacgtcgg gcatcaagga ggcgaatacc ttcgacgtgg agcccgccga gtgcgaggcc 1500gacacgtcgg gcatcaagga ggcgaatacc ttcgacgtgg agcccgccga gtgcgaggcc 1500
gcgcttaaga ccaagctgtc caagctgcgt ccgaccgaag ggcaggccga ctacatcgac 1560gcgcttaaga ccaagctgtc caagctgcgt ccgaccgaag ggcaggccga ctacatcgac 1560
tggggtcaga tcggattcct ccattacggc atcaacacgt actacaacca ggagtggggt 1620tggggtcaga tcggattcct ccattacggc atcaacacgt actacaacca ggagtggggt 1620
cacggtaacg aggatccctc ccgcatcaac ccgaccggcc tcgacaccga ccagtgggcg 1680cacggtaacg aggatccctc ccgcatcaac ccgaccggcc tcgacaccga ccagtgggcg 1680
aagtccttcg ccgacggtgg cttcaagatg atcatggtga cggtcaagca ccatgacggt 1740aagtccttcg ccgacggtgg cttcaagatg atcatggtga cggtcaagca ccatgacggt 1740
ttcgagctgt acgactcgcg gtacaacacc gagcacgact gggcaaacac cgccgtcgcc 1800ttcgagctgt acgactcgcg gtacaacacc gagcacgact gggcaaacac cgccgtcgcc 1800
aagcgcacgg gggagaagga cctgttccgc aagattgtcg cctcggcgaa gaaatacggc 1860aagcgcacgg gggagaagga cctgttccgc aagattgtcg cctcggcgaa gaaatacggc 1860
ctgaaggtcg gcatctacta ttcgccggcc gattcctaca tggagaggaa gggcgtctgg 1920ctgaaggtcg gcatctacta ttcgccggcc gattcctaca tggagaggaa gggcgtctgg 1920
ggcaacaact ccgcacgcgt cgagcgcacg atccccacgc tggtggagaa cgacgaccgc 1980ggcaacaact ccgcacgcgt cgagcgcacg atccccacgc tggtggagaa cgacgaccgc 1980
gccggcaagg tggcttccgg caaactgccc acgttcaagt acaaggccac ggattacggc 2040gccggcaagg tggcttccgg caaactgccc acgttcaagt acaaggccac ggattacggc 2040
gcctacatgc tcaaccagct ctatgagctg ctgactgagt acggcgacat ctccgaggtc 2100gcctacatgc tcaaccagct ctatgagctg ctgactgagt acggcgacat ctccgaggtc 2100
tggttcgacg gtgcccaagg caacaccgca ggcactgagc attacgacta tggcgtgttc 2160tggttcgacg gtgcccaagg caacaccgca ggcactgagc attacgacta tggcgtgttc 2160
tacgagatga tccgccggct tcagccccag gcaattcagg ccaacgccgc atacgatgcc 2220tacgagatga tccgccggct tcagccccag gcaattcagg ccaacgccgc atacgatgcc 2220
cgatgggtgg gcaacgagga cggctgggcc cgtcagaccg agtggagccc gcaggcggca 2280cgatgggtgg gcaacgagga cggctgggcc cgtcagaccg agtggagccc gcaggcggca 2280
tacaacgacg gcgtggacaa ggtgtcgctc aagcctggcc agatggcccc cgacggtaag 2340tacaacgacg gcgtggacaa ggtgtcgctc aagcctggcc agatggcccc cgacggtaag 2340
cttggcagca tgtcgagcgt gctgtccgag atccgcagcg gcgccgccaa ccagctgcac 2400cttggcagca tgtcgagcgt gctgtccgag atccgcagcg gcgccgccaa ccagctgcac 2400
tggtatccgg ccgaagtcga cgccaagaac cggcccggat ggttctaccg tgccagccaa 2460tggtatccgg ccgaagtcga cgccaagaac cggcccggat ggttctaccg tgccagccaa 2460
tcgccggcgt ccgtagccga agtcgtgaag tactacgagc agtccacggg acgcaactcg 2520tcgccggcgt ccgtagccga agtcgtgaag tactacgagc agtccacggg acgcaactcg 2520
cagtatctgc tgaacgtccc accgtccgat accggcaagc tcgccgatgc ggatgccgcg 2580cagtatctgc tgaacgtccc accgtccgat accggcaagc tcgccgatgc ggatgccgcg 2580
ggacttaagg ggctgggcga ggagctcgcc cgacgctacg gcaccgatct tgccctgggc 2640ggacttaagg ggctgggcga ggagctcgcc cgacgctacg gcaccgatct tgccctgggc 2640
aagagcgcga ccgtcgccgc gtccgcgaac gacactgcgg tagcggcccc gaagctgacc 2700aagagcgcga ccgtcgccgc gtccgcgaac gacactgcgg tagcggcccc gaagctgacc 2700
gacggttcga agctctcctc cgacaaggcc gtgggcaata cgccgacgta caccatcgat 2760gacggttcga agctctcctc cgacaaggcc gtgggcaata cgccgacgta caccatcgat 2760
ctgggcagca ctgtcgccgt ggatgcagtg aagatctccg aggacgtgcg caatgccggc 2820ctgggcagca ctgtcgccgt ggatgcagtg aagatctccg aggacgtgcg caatgccggc 2820
cagcagatcg aaagcgccac tctgcaggga cgagtcaatg gaacatggac gaatctggcg 2880cagcagatcg aaagcgccac tctgcaggga cgagtcaatg gaacatggac gaatctggcg 2880
actatgacga cggtcgggca gcagcgcgac cttcgcttca cgtcccagaa catcgatgcc 2940actatgacga cggtcgggca gcagcgcgac cttcgcttca cgtcccagaa catcgatgcc 2940
atccgtctgg tggtcaactc ctcccgcggt ccggtgcgtc tgagccgtct tgaggtgttc 3000atccgtctgg tggtcaactc ctcccgcggt ccggtgcgtc tgagccgtct tgaggtgttc 3000
cacaccgaat ccgagattca gaccggcgcc cgcgcctact acatcgatcc gacggcgcag 3060cacaccgaat ccgagattca gaccggcgcc cgcgcctact acatcgatcc gacggcgcag 3060
accgcgggag atggattcac gaaggacaag cccatgacgt cgatcgagca gctgcacgat 3120accgcgggag atggattcac gaaggacaag cccatgacgt cgatcgagca gctgcacgat 3120
gtgaccgtcg cgccaggctc cgtgatcttc gtcaaggcgg gcaccgagct gaccggggac 3180gtgaccgtcg cgccaggctc cgtgatcttc gtcaaggcgg gcaccgagct gaccggggac 3180
ttcgccgtct tcggctacgg caccaaggac gagcccatca ccgtgacgac atacggcgaa 3240ttcgccgtct tcggctacgg caccaaggac gagcccatca ccgtgacgac atacggcgaa 3240
agcgacaaag ccaccaccgc gagcttcgac ggcatgaccg ccgggctgac gctgaagcag 3300agcgacaaag ccaccaccgc gagcttcgac ggcatgaccg ccgggctgac gctgaagcag 3300
gcgctgaagg cgctcggcaa ggacgacgcc ggctgggtcg tggccgattc cgccactgca 3360gcgctgaagg cgctcggcaa ggacgacgcc ggctgggtcg tggccgattc cgccactgca 3360
ccggcctccc gcgtgtatgt cccgcaggat gagatcagcg tgcacgccca gtcgtcgcag 3420ccggcctccc gcgtgtatgt cccgcaggat gagatcagcg tgcacgccca gtcgtcgcag 3420
aactccggcg cagaggcggc gagggcgctc gacggcgact cgtcgacgag ctggcactcc 3480aactccggcg cagaggcggc gagggcgctc gacggcgact cgtcgacgag ctggcactcc 3480
cagtacagcc cgaccaccgc gtctgctccg cattgggtga ctctcgatct cggcaaatcg 3540cagtacagcc cgaccaccgc gtctgctccg cattgggtga ctctcgatct cggcaaatcg 3540
cgtgagaacg tcgcctactt cgactacctc gcccgtatcg acggcaacaa taacggtgcc 3600cgtgagaacg tcgcctactt cgactacctc gcccgtatcg acggcaacaa taacggtgcc 3600
gccaaggatt acgaggtgta tgtctccgac gatcccaacg attttggagc ccctgtggcc 3660gccaaggatt acgaggtgta tgtctccgac gatcccaacg attttggagc ccctgtggcc 3660
tcgggcacgt tgaagaacgt cgcctacacg cagcgcatca agctgacccc caagaacgga 3720tcgggcacgt tgaagaacgt cgcctacacg cagcgcatca agctgacccc caagaacgga 3720
cggtacgtca agttcgtcat caagaccgat tattccggat cgaacttcgg ctccgcggcg 3780cggtacgtca agttcgtcat caagaccgat tattccggat cgaacttcgg ctccgcggcg 3780
gaaatgaatg tcgagttgct gcccacggcc gtagaggagg acaaggtcgc caccccgcag 3840gaaatgaatg tcgagttgct gcccacggcc gtagaggagg acaaggtcgc caccccgcag 3840
aagccgacag tggacgatga tgccgataca tacaccatcc ccgacatcga gggagtcgtg 3900aagccgacag tggacgatga tgccgataca tacaccatcc ccgacatcga gggagtcgtg 3900
tacaaggtcg acggcaaggt gttggccgct ggttccgtag tgaacgtggg cgatgaggac 3960tacaaggtcg acggcaaggt gttggccgct ggttccgtag tgaacgtggg cgatgaggac 3960
gtgaccgtca cggtcaccgc cgagcccgcc gacggatacc gcttcccgga tggtgtgacg 4020gtgaccgtca cggtcaccgc cgagcccgcc gacggatacc gcttcccgga tggtgtgacg 4020
tccccagtca cgtatgagct gacgttcacc aagaagggtg gcgagaagcc tccgaccgaa 4080tccccagtca cgtatgagct gacgttcacc aagaagggtg gcgagaagcc tccgaccgaa 4080
gtcaacaagg acaagctgca cgccacgatc accaaggctc aggcgatcga ccgttccgcc 4140gtcaacaagg acaagctgca cgccacgatc accaaggctc aggcgatcga ccgttccgcc 4140
tatacggacg agtcgctcaa ggtgcttgat gacaagctcg ccgcagcgct caaggtctat 4200tatacggacg agtcgctcaa ggtgcttgat gacaagctcg ccgcagcgct caaggtctat 4200
gacgatgaca aggtgagcca ggatgatgtc gatgccgccg aggcggctct gtctgcggcg 4260gacgatgaca aggtgagcca ggatgatgtc gatgccgccg aggcggctct gtctgcggcg 4260
atcgacgcgc tgaagaccaa gccgacgacc cccggcggtg aaggtgagaa gcctggtgaa 4320atcgacgcgc tgaagaccaa gccgacgacc cccggcggtg aaggtgagaa gcctggtgaa 4320
ggtgaaaagc ccggtgacgg caacaagccc ggtgacggca agaagcccgg cgacgtgatc 4380ggtgaaaagc ccggtgacgg caacaagccc ggtgacggca agaagcccgg cgacgtgatc 4380
gcaaagaccg gcgcctccac aatgggcgtt gtcttcgctg cactcgcgat ggtagcgggt 4440gcaaagaccg gcgcctccac aatgggcgtt gtcttcgctg cactcgcgat ggtagcgggt 4440
gcggtcgtga cgcttgaagc caagcgtaag tccaaccggt aa 4482gcggtcgtga cgcttgaagc caagcgtaag tccaaccggt aa 4482
<210> 4<210> 4
<211> 1493<211> 1493
<212> PRT<212>PRT
<213> Bifidobacterium bifidum<213> Bifidobacterium bifidum
<400> 4<400> 4
Met Leu His Thr Ala Ser Arg Gly Cys Ser Arg Ser Trp Leu Arg Arg Met Leu His Thr Ala Ser Arg Gly Cys Ser Arg Ser Trp Leu Arg Arg
1 5 10 15 1 5 10 15
Leu Thr Ala Leu Ile Ala Val Ser Ala Leu Ala Phe Val Ala Leu Pro Leu Thr Ala Leu Ile Ala Val Ser Ala Leu Ala Phe Val Ala Leu Pro
20 25 30 20 25 30
Asn Val Ala Val Ala Ala Asp Pro Met Glu Tyr Leu Asp Val Ser Phe Asn Val Ala Val Ala Ala Asp Pro Met Glu Tyr Leu Asp Val Ser Phe
35 40 45 35 40 45
Gly Gly Thr Phe Ala Ala Asp Thr Tyr Thr Thr Gly Gly Asp Glu Val Gly Gly Thr Phe Ala Ala Asp Thr Tyr Thr Thr Gly Gly Asp Glu Val
50 55 60 50 55 60
Ala Lys Gly Pro Val Thr Lys His Gly Ser Ile Pro Thr Lys Leu Asp Ala Lys Gly Pro Val Thr Lys His Gly Ser Ile Pro Thr Lys Leu Asp
65 70 75 80 65 70 75 80
Gly Gly Gly Ile Thr Leu Ala Gly Gly Thr Asn Gly Val Thr Phe Thr Gly Gly Gly Ile Thr Leu Ala Gly Gly Thr Asn Gly Val Thr Phe Thr
85 90 95 85 90 95
Ser Thr Ala Ser Phe Ser Glu Ser Gly Lys Val Asn Lys Gly Phe Arg Ser Thr Ala Ser Phe Ser Glu Ser Gly Lys Val Asn Lys Gly Phe Arg
100 105 110 100 105 110
Ala Glu Met Glu Tyr Arg Thr Thr Gln Thr Pro Ser Asn Leu Ala Thr Ala Glu Met Glu Tyr Arg Thr Thr Gln Thr Pro Ser Asn Leu Ala Thr
115 120 125 115 120 125
Leu Phe Ser Ala Met Gly Asn Ile Phe Val Arg Ala Asn Gly Ser Asn Leu Phe Ser Ala Met Gly Asn Ile Phe Val Arg Ala Asn Gly Ser Asn
130 135 140 130 135 140
Leu Glu Tyr Gly Phe Ser Thr Asn Pro Ser Gly Ser Thr Trp Asn Asp Leu Glu Tyr Gly Phe Ser Thr Asn Pro Ser Gly Ser Thr Trp Asn Asp
145 150 155 160 145 150 155 160
Tyr Thr Lys Ser Val Thr Leu Pro Ser Asn Asn Val Lys His Ile Ile Tyr Thr Lys Ser Val Thr Leu Pro Ser Asn Asn Val Lys His Ile Ile
165 170 175 165 170 175
Gln Leu Thr Tyr Leu Pro Gly Ala Asp Gly Ala Ala Ser Thr Leu Gln Gln Leu Thr Tyr Leu Pro Gly Ala Asp Gly Ala Ala Ser Thr Leu Gln
180 185 190 180 185 190
Leu Ser Val Asp Gly Val Ala Gly Glu Thr Ala Thr Ser Ala Ala Gly Leu Ser Val Asp Gly Val Ala Gly Glu Thr Ala Thr Ser Ala Ala Gly
195 200 205 195 200 205
Glu Leu Ala Ala Val Ser Asp Ser Val Gly Asn Lys Phe Gly Ile Gly Glu Leu Ala Ala Val Ser Asp Ser Val Gly Asn Lys Phe Gly Ile Gly
210 215 220 210 215 220
Tyr Glu Val Asn Pro Ala Ser Gly Ala Ala Ser Arg Gly Leu Ala Gly Tyr Glu Val Asn Pro Ala Ser Gly Ala Ala Ser Arg Gly Leu Ala Gly
225 230 235 240 225 230 235 240
Asp Val Phe Arg Ala Arg Val Ala Asp Ser Asp Ala Pro Trp Glu Ile Asp Val Phe Arg Ala Arg Val Ala Asp Ser Asp Ala Pro Trp Glu Ile
245 250 255 245 250 255
Leu Asp Ala Ser Gln Leu Leu His Val Asn Phe Asn Gly Thr Phe Ser Leu Asp Ala Ser Gln Leu Leu His Val Asn Phe Asn Gly Thr Phe Ser
260 265 270 260 265 270
Gly Thr Ser Tyr Thr Ala Ala Ser Gly Glu Gln Met Leu Gly Ser Leu Gly Thr Ser Tyr Thr Ala Ala Ser Gly Glu Gln Met Leu Gly Ser Leu
275 280 285 275 280 285
Val Ser Arg Ser Ala Asn Pro Ser Ile Ser Asn Ser Ala Val Thr Leu Val Ser Arg Ser Ala Asn Pro Ser Ile Ser Asn Ser Ala Val Thr Leu
290 295 300 290 295 300
Gly Gly Gly Thr Ala Gly Phe Asp Phe Thr Pro Thr Asp Phe Thr Leu Gly Gly Gly Thr Ala Gly Phe Asp Phe Thr Pro Thr Asp Phe Thr Leu
305 310 315 320 305 310 315 320
Gly Asp Asn Glu Ala Ile Thr Arg Pro Leu Val Ala Glu Leu Arg Phe Gly Asp Asn Glu Ala Ile Thr Arg Pro Leu Val Ala Glu Leu Arg Phe
325 330 335 325 330 335
Thr Pro Thr Gln Thr Gly Asp Asn Gln Thr Leu Phe Gly Ala Gly Gly Thr Pro Thr Gln Thr Gly Asp Asn Gln Thr Leu Phe Gly Ala Gly Gly
340 345 350 340 345 350
Asn Leu Phe Leu Arg Tyr Glu Ser Asn Lys Leu Val Phe Gly Ala Ser Asn Leu Phe Leu Arg Tyr Glu Ser Asn Lys Leu Val Phe Gly Ala Ser
355 360 365 355 360 365
Thr Lys Ser Gly Asp Asn Trp Thr Asp His Lys Ile Glu Ser Ala Ala Thr Lys Ser Gly Asp Asn Trp Thr Asp His Lys Ile Glu Ser Ala Ala
370 375 380 370 375 380
Ala Thr Gly Ala Glu His Val Val Ser Val Ala Tyr Val Pro Asn Lys Ala Thr Gly Ala Glu His Val Val Ser Val Ala Tyr Val Pro Asn Lys
385 390 395 400 385 390 395 400
Ala Gly Thr Gly Ala Lys Leu Val Met Arg Val Asp Gly Gly Asp Ala Ala Gly Thr Gly Ala Lys Leu Val Met Arg Val Asp Gly Gly Asp Ala
405 410 415 405 410 415
Gln Thr Lys Asp Ile Thr Gly Leu Ala Tyr Leu Asn Ser Ser Ile Lys Gln Thr Lys Asp Ile Thr Gly Leu Ala Tyr Leu Asn Ser Ser Ile Lys
420 425 430 420 425 430
Gly Lys Val Gly Phe Gly Asn Asp Val His Thr Asp Ala Leu Ser Arg Gly Lys Val Gly Phe Gly Asn Asp Val His Thr Asp Ala Leu Ser Arg
435 440 445 435 440 445
Gly Phe Val Gly Ser Leu Ser Glu Ile Arg Leu Ala Glu Thr Ser Ala Gly Phe Val Gly Ser Leu Ser Glu Ile Arg Leu Ala Glu Thr Ser Ala
450 455 460 450 455 460
Asn Phe Thr Thr Asn Glu Phe Lys Leu Val Tyr Ser Gln Val Ser Cys Asn Phe Thr Thr Asn Glu Phe Lys Leu Val Tyr Ser Gln Val Ser Cys
465 470 475 480 465 470 475 480
Asp Thr Ser Gly Ile Lys Glu Ala Asn Thr Phe Asp Val Glu Pro Ala Asp Thr Ser Gly Ile Lys Glu Ala Asn Thr Phe Asp Val Glu Pro Ala
485 490 495 485 490 495
Glu Cys Glu Ala Ala Leu Lys Thr Lys Leu Ser Lys Leu Arg Pro Thr Glu Cys Glu Ala Ala Leu Lys Thr Lys Leu Ser Lys Leu Arg Pro Thr
500 505 510 500 505 510
Glu Gly Gln Ala Asp Tyr Ile Asp Trp Gly Gln Ile Gly Phe Leu His Glu Gly Gln Ala Asp Tyr Ile Asp Trp Gly Gln Ile Gly Phe Leu His
515 520 525 515 520 525
Tyr Gly Ile Asn Thr Tyr Tyr Asn Gln Glu Trp Gly His Gly Asn Glu Tyr Gly Ile Asn Thr Tyr Tyr Asn Gln Glu Trp Gly His Gly Asn Glu
530 535 540 530 535 540
Asp Pro Ser Arg Ile Asn Pro Thr Gly Leu Asp Thr Asp Gln Trp Ala Asp Pro Ser Arg Ile Asn Pro Thr Gly Leu Asp Thr Asp Gln Trp Ala
545 550 555 560 545 550 555 560
Lys Ser Phe Ala Asp Gly Gly Phe Lys Met Ile Met Val Thr Val Lys Lys Ser Phe Ala Asp Gly Gly Phe Lys Met Ile Met Val Thr Val Lys
565 570 575 565 570 575
His His Asp Gly Phe Glu Leu Tyr Asp Ser Arg Tyr Asn Thr Glu His His His Asp Gly Phe Glu Leu Tyr Asp Ser Arg Tyr Asn Thr Glu His
580 585 590 580 585 590
Asp Trp Ala Asn Thr Ala Val Ala Lys Arg Thr Gly Glu Lys Asp Leu Asp Trp Ala Asn Thr Ala Val Ala Lys Arg Thr Gly Glu Lys Asp Leu
595 600 605 595 600 605
Phe Arg Lys Ile Val Ala Ser Ala Lys Lys Tyr Gly Leu Lys Val Gly Phe Arg Lys Ile Val Ala Ser Ala Lys Lys Tyr Gly Leu Lys Val Gly
610 615 620 610 615 620
Ile Tyr Tyr Ser Pro Ala Asp Ser Tyr Met Glu Arg Lys Gly Val Trp Ile Tyr Tyr Ser Pro Ala Asp Ser Tyr Met Glu Arg Lys Gly Val Trp
625 630 635 640 625 630 635 640
Gly Asn Asn Ser Ala Arg Val Glu Arg Thr Ile Pro Thr Leu Val Glu Gly Asn Asn Ser Ala Arg Val Glu Arg Thr Ile Pro Thr Leu Val Glu
645 650 655 645 650 655
Asn Asp Asp Arg Ala Gly Lys Val Ala Ser Gly Lys Leu Pro Thr Phe Asn Asp Asp Arg Ala Gly Lys Val Ala Ser Gly Lys Leu Pro Thr Phe
660 665 670 660 665 670
Lys Tyr Lys Ala Thr Asp Tyr Gly Ala Tyr Met Leu Asn Gln Leu Tyr Lys Tyr Lys Ala Thr Asp Tyr Gly Ala Tyr Met Leu Asn Gln Leu Tyr
675 680 685 675 680 685
Glu Leu Leu Thr Glu Tyr Gly Asp Ile Ser Glu Val Trp Phe Asp Gly Glu Leu Leu Thr Glu Tyr Gly Asp Ile Ser Glu Val Trp Phe Asp Gly
690 695 700 690 695 700
Ala Gln Gly Asn Thr Ala Gly Thr Glu His Tyr Asp Tyr Gly Val Phe Ala Gln Gly Asn Thr Ala Gly Thr Glu His Tyr Asp Tyr Gly Val Phe
705 710 715 720 705 710 715 720
Tyr Glu Met Ile Arg Arg Leu Gln Pro Gln Ala Ile Gln Ala Asn Ala Tyr Glu Met Ile Arg Arg Leu Gln Pro Gln Ala Ile Gln Ala Asn Ala
725 730 735 725 730 735
Ala Tyr Asp Ala Arg Trp Val Gly Asn Glu Asp Gly Trp Ala Arg Gln Ala Tyr Asp Ala Arg Trp Val Gly Asn Glu Asp Gly Trp Ala Arg Gln
740 745 750 740 745 750
Thr Glu Trp Ser Pro Gln Ala Ala Tyr Asn Asp Gly Val Asp Lys Val Thr Glu Trp Ser Pro Gln Ala Ala Tyr Asn Asp Gly Val Asp Lys Val
755 760 765 755 760 765
Ser Leu Lys Pro Gly Gln Met Ala Pro Asp Gly Lys Leu Gly Ser Met Ser Leu Lys Pro Gly Gln Met Ala Pro Asp Gly Lys Leu Gly Ser Met
770 775 780 770 775 780
Ser Ser Val Leu Ser Glu Ile Arg Ser Gly Ala Ala Asn Gln Leu His Ser Ser Val Leu Ser Glu Ile Arg Ser Gly Ala Ala Asn Gln Leu His
785 790 795 800 785 790 795 800
Trp Tyr Pro Ala Glu Val Asp Ala Lys Asn Arg Pro Gly Trp Phe Tyr Trp Tyr Pro Ala Glu Val Asp Ala Lys Asn Arg Pro Gly Trp Phe Tyr
805 810 815 805 810 815
Arg Ala Ser Gln Ser Pro Ala Ser Val Ala Glu Val Val Lys Tyr Tyr Arg Ala Ser Gln Ser Pro Ala Ser Val Ala Glu Val Val Lys Tyr Tyr
820 825 830 820 825 830
Glu Gln Ser Thr Gly Arg Asn Ser Gln Tyr Leu Leu Asn Val Pro Pro Glu Gln Ser Thr Gly Arg Asn Ser Gln Tyr Leu Leu Asn Val Pro Pro
835 840 845 835 840 845
Ser Asp Thr Gly Lys Leu Ala Asp Ala Asp Ala Ala Gly Leu Lys Gly Ser Asp Thr Gly Lys Leu Ala Asp Ala Asp Ala Ala Gly Leu Lys Gly
850 855 860 850 855 860
Leu Gly Glu Glu Leu Ala Arg Arg Tyr Gly Thr Asp Leu Ala Leu Gly Leu Gly Glu Glu Leu Ala Arg Arg Tyr Gly Thr Asp Leu Ala Leu Gly
865 870 875 880 865 870 875 880
Lys Ser Ala Thr Val Ala Ala Ser Ala Asn Asp Thr Ala Val Ala Ala Lys Ser Ala Thr Val Ala Ala Ser Ala Asn Asp Thr Ala Val Ala Ala
885 890 895 885 890 895
Pro Lys Leu Thr Asp Gly Ser Lys Leu Ser Ser Asp Lys Ala Val Gly Pro Lys Leu Thr Asp Gly Ser Lys Leu Ser Ser Asp Lys Ala Val Gly
900 905 910 900 905 910
Asn Thr Pro Thr Tyr Thr Ile Asp Leu Gly Ser Thr Val Ala Val Asp Asn Thr Pro Thr Tyr Thr Ile Asp Leu Gly Ser Thr Val Ala Val Asp
915 920 925 915 920 925
Ala Val Lys Ile Ser Glu Asp Val Arg Asn Ala Gly Gln Gln Ile Glu Ala Val Lys Ile Ser Glu Asp Val Arg Asn Ala Gly Gln Gln Ile Glu
930 935 940 930 935 940
Ser Ala Thr Leu Gln Gly Arg Val Asn Gly Thr Trp Thr Asn Leu Ala Ser Ala Thr Leu Gln Gly Arg Val Asn Gly Thr Trp Thr Asn Leu Ala
945 950 955 960 945 950 955 960
Thr Met Thr Thr Val Gly Gln Gln Arg Asp Leu Arg Phe Thr Ser Gln Thr Met Thr Thr Val Gly Gln Gln Arg Asp Leu Arg Phe Thr Ser Gln
965 970 975 965 970 975
Asn Ile Asp Ala Ile Arg Leu Val Val Asn Ser Ser Arg Gly Pro Val Asn Ile Asp Ala Ile Arg Leu Val Val Asn Ser Ser Arg Gly Pro Val
980 985 990 980 985 990
Arg Leu Ser Arg Leu Glu Val Phe His Thr Glu Ser Glu Ile Gln Thr Arg Leu Ser Arg Leu Glu Val Phe His Thr Glu Ser Glu Ile Gln Thr
995 1000 1005 995 1000 1005
Gly Ala Arg Ala Tyr Tyr Ile Asp Pro Thr Ala Gln Thr Ala Gly Gly Ala Arg Ala Tyr Tyr Ile Asp Pro Thr Ala Gln Thr Ala Gly
1010 1015 1020 1010 1015 1020
Asp Gly Phe Thr Lys Asp Lys Pro Met Thr Ser Ile Glu Gln Leu Asp Gly Phe Thr Lys Asp Lys Pro Met Thr Ser Ile Glu Gln Leu
1025 1030 1035 1025 1030 1035
His Asp Val Thr Val Ala Pro Gly Ser Val Ile Phe Val Lys Ala His Asp Val Thr Val Ala Pro Gly Ser Val Ile Phe Val Lys Ala
1040 1045 1050 1040 1045 1050
Gly Thr Glu Leu Thr Gly Asp Phe Ala Val Phe Gly Tyr Gly Thr Gly Thr Glu Leu Thr Gly Asp Phe Ala Val Phe Gly Tyr Gly Thr
1055 1060 1065 1055 1060 1065
Lys Asp Glu Pro Ile Thr Val Thr Thr Tyr Gly Glu Ser Asp Lys Lys Asp Glu Pro Ile Thr Val Thr Thr Tyr Gly Glu Ser Asp Lys
1070 1075 1080 1070 1075 1080
Ala Thr Thr Ala Ser Phe Asp Gly Met Thr Ala Gly Leu Thr Leu Ala Thr Thr Ala Ser Phe Asp Gly Met Thr Ala Gly Leu Thr Leu
1085 1090 1095 1085 1090 1095
Lys Gln Ala Leu Lys Ala Leu Gly Lys Asp Asp Ala Gly Trp Val Lys Gln Ala Leu Lys Ala Leu Gly Lys Asp Asp Ala Gly Trp Val
1100 1105 1110 1100 1105 1110
Val Ala Asp Ser Ala Thr Ala Pro Ala Ser Arg Val Tyr Val Pro Val Ala Asp Ser Ala Thr Ala Pro Ala Ser Arg Val Tyr Val Pro
1115 1120 1125 1115 1120 1125
Gln Asp Glu Ile Ser Val His Ala Gln Ser Ser Gln Asn Ser Gly Gln Asp Glu Ile Ser Val His Ala Gln Ser Ser Gln Asn Ser Gly
1130 1135 1140 1130 1135 1140
Ala Glu Ala Ala Arg Ala Leu Asp Gly Asp Ser Ser Thr Ser Trp Ala Glu Ala Ala Arg Ala Leu Asp Gly Asp Ser Ser Thr Ser Trp
1145 1150 1155 1145 1150 1155
His Ser Gln Tyr Ser Pro Thr Thr Ala Ser Ala Pro His Trp Val His Ser Gln Tyr Ser Pro Thr Thr Ala Ser Ala Pro His Trp Val
1160 1165 1170 1160 1165 1170
Thr Leu Asp Leu Gly Lys Ser Arg Glu Asn Val Ala Tyr Phe Asp Thr Leu Asp Leu Gly Lys Ser Arg Glu Asn Val Ala Tyr Phe Asp
1175 1180 1185 1175 1180 1185
Tyr Leu Ala Arg Ile Asp Gly Asn Asn Asn Gly Ala Ala Lys Asp Tyr Leu Ala Arg Ile Asp Gly Asn Asn Asn Gly Ala Ala Lys Asp
1190 1195 1200 1190 1195 1200
Tyr Glu Val Tyr Val Ser Asp Asp Pro Asn Asp Phe Gly Ala Pro Tyr Glu Val Tyr Val Ser Asp Asp Pro Asn Asp Phe Gly Ala Pro
1205 1210 1215 1205 1210 1215
Val Ala Ser Gly Thr Leu Lys Asn Val Ala Tyr Thr Gln Arg Ile Val Ala Ser Gly Thr Leu Lys Asn Val Ala Tyr Thr Gln Arg Ile
1220 1225 1230 1220 1225 1230
Lys Leu Thr Pro Lys Asn Gly Arg Tyr Val Lys Phe Val Ile Lys Lys Leu Thr Pro Lys Asn Gly Arg Tyr Val Lys Phe Val Ile Lys
1235 1240 1245 1235 1240 1245
Thr Asp Tyr Ser Gly Ser Asn Phe Gly Ser Ala Ala Glu Met Asn Thr Asp Tyr Ser Gly Ser Asn Phe Gly Ser Ala Ala Glu Met Asn
1250 1255 1260 1250 1255 1260
Val Glu Leu Leu Pro Thr Ala Val Glu Glu Asp Lys Val Ala Thr Val Glu Leu Leu Pro Thr Ala Val Glu Glu Asp Lys Val Ala Thr
1265 1270 1275 1265 1270 1275
Pro Gln Lys Pro Thr Val Asp Asp Asp Ala Asp Thr Tyr Thr Ile Pro Gln Lys Pro Thr Val Asp Asp Asp Ala Asp Thr Tyr Thr Ile
1280 1285 1290 1280 1285 1290
Pro Asp Ile Glu Gly Val Val Tyr Lys Val Asp Gly Lys Val Leu Pro Asp Ile Glu Gly Val Val Tyr Lys Val Asp Gly Lys Val Leu
1295 1300 1305 1295 1300 1305
Ala Ala Gly Ser Val Val Asn Val Gly Asp Glu Asp Val Thr Val Ala Ala Gly Ser Val Val Asn Val Gly Asp Glu Asp Val Thr Val
1310 1315 1320 1310 1315 1320
Thr Val Thr Ala Glu Pro Ala Asp Gly Tyr Arg Phe Pro Asp Gly Thr Val Thr Ala Glu Pro Ala Asp Gly Tyr Arg Phe Pro Asp Gly
1325 1330 1335 1325 1330 1335
Val Thr Ser Pro Val Thr Tyr Glu Leu Thr Phe Thr Lys Lys Gly Val Thr Ser Pro Val Thr Tyr Glu Leu Thr Phe Thr Lys Lys Gly
1340 1345 1350 1340 1345 1350
Gly Glu Lys Pro Pro Thr Glu Val Asn Lys Asp Lys Leu His Ala Gly Glu Lys Pro Pro Thr Glu Val Asn Lys Asp Lys Leu His Ala
1355 1360 1365 1355 1360 1365
Thr Ile Thr Lys Ala Gln Ala Ile Asp Arg Ser Ala Tyr Thr Asp Thr Ile Thr Lys Ala Gln Ala Ile Asp Arg Ser Ala Tyr Thr Asp
1370 1375 1380 1370 1375 1380
Glu Ser Leu Lys Val Leu Asp Asp Lys Leu Ala Ala Ala Leu Lys Glu Ser Leu Lys Val Leu Asp Asp Lys Leu Ala Ala Ala Leu Lys
1385 1390 1395 1385 1390 1395
Val Tyr Asp Asp Asp Lys Val Ser Gln Asp Asp Val Asp Ala Ala Val Tyr Asp Asp Asp Lys Val Ser Gln Asp Asp Val Asp Ala Ala
1400 1405 1410 1400 1405 1410
Glu Ala Ala Leu Ser Ala Ala Ile Asp Ala Leu Lys Thr Lys Pro Glu Ala Ala Leu Ser Ala Ala Ile Asp Ala Leu Lys Thr Lys Pro
1415 1420 1425 1415 1420 1425
Thr Thr Pro Gly Gly Glu Gly Glu Lys Pro Gly Glu Gly Glu Lys Thr Thr Pro Gly Gly Glu Gly Glu Lys Pro Gly Glu Gly Glu Lys
1430 1435 1440 1430 1435 1440
Pro Gly Asp Gly Asn Lys Pro Gly Asp Gly Lys Lys Pro Gly Asp Pro Gly Asp Gly Asn Lys Pro Gly Asp Gly Lys Lys Pro Gly Asp
1445 1450 1455 1445 1450 1455
Val Ile Ala Lys Thr Gly Ala Ser Thr Met Gly Val Val Phe Ala Val Ile Ala Lys Thr Gly Ala Ser Thr Met Gly Val Val Phe Ala
1460 1465 1470 1460 1465 1470
Ala Leu Ala Met Val Ala Gly Ala Val Val Thr Leu Glu Ala Lys Ala Leu Ala Met Val Ala Gly Ala Val Val Thr Leu Glu Ala Lys
1475 1480 1485 1475 1480 1485
Arg Lys Ser Asn Arg Arg Lys Ser Asn Arg
1490 1490
<210> 5<210> 5
<211> 2094<211> 2094
<212> ДНК<212> DNA
<213> Streptococcus pneumoniae<213> Streptococcus pneumoniae
<400> 5<400> 5
atgaataaaa gaggtcttta ttcaaaacta ggaatttctg ttgtaggcat tagtctttta 60atgaataaaa gaggtcttta ttcaaaacta ggaatttctg ttgtaggcat tagtctttta 60
atgggagtcc ccactttgat tcatgcgaat gaattaaact atggtcaact gtccatatct 120atgggagtcc ccactttgat tcatgcgaat gaattaaact atggtcaact gtccatatct 120
cctatttttc aaggaggttc atatcaactg aacaataaga gtatagatat cagctctttg 180cctatttttc aaggaggttc atatcaactg aacaataaga gtatagatat cagctctttg 180
ttattagata aattgtctgg agagagtcag acagtagtaa tgaaatttaa agcagataaa 240ttattagata aattgtctgg agagagtcag acagtagtaa tgaaatttaa agcagataaa 240
ccaaactctc ttcaagcttt gtttggccta tctaatagta aagcaggctt taaaaataat 300ccaaactctc ttcaagcttt gtttggccta tctaatagta aagcaggctt taaaaataat 300
tacttttcaa ttttcatgag agattctggt gagataggtg tagaaataag agacgcccaa 360tacttttcaa ttttcatgag agattctggt gagataggtg tagaaataag agacgcccaa 360
gagggaataa attatttatt ttctagacca gcttcattat ggggaaagca taaaggacag 420gagggaataa attatttatt ttctagacca gcttcattat ggggaaagca taaaggacag 420
gcagttgaaa atacactagt atttgtatct gattctaaag ataaaacata cacaatgtat 480gcagttgaaa atacactagt atttgtatct gattctaaag ataaaacata cacaatgtat 480
gttaatggaa tagaagtgtt ctctgaaaca gttgatacat ttttgccaat ttcaaatata 540gttaatggaa tagaagtgtt ctctgaaaca gttgatacat ttttgccaat ttcaaatata 540
aatggtatag ataaggcaac actaggagct gttaatcgtg aaggtaagga acattacctc 600aatggtatag ataaggcaac actaggagct gttaatcgtg aaggtaagga acattacctc 600
gcaaaaggaa gtattggtga aatcagtcta tttaacaaag caattagtga tcaggaagtt 660gcaaaaggaa gtattggtga aatcagtcta tttaacaaag caattagtga tcaggaagtt 660
tcaaatattc ccttgtcaaa tccatttcag ttaattttcc aatcaggaga ttctactcaa 720tcaaatattc ccttgtcaaa tccatttcag ttaattttcc aatcaggaga ttctactcaa 720
gctaactatt ttagaatacc gacactatat acattaagta gtggaagagt tctatcaagt 780gctaactatt ttagaatacc gacactatat acattaagta gtggaagagt tctatcaagt 780
attgatgcac gttatggtgg gactcatgat tctaaaagta agattaatat tgccacttct 840attgatgcac gttatggtgg gactcatgat tctaaaagta agattaatat tgccacttct 840
tatagtgatg ataatgggaa aacgtggagt gagccaattt ttgctatgaa gtttaatgac 900tatagtgatg ataatgggaa aacgtggagt gagccaattt ttgctatgaa gtttaatgac 900
tatgaggagc agttagttta ctggccacga gataataaat taaagaatag tcaaattagt 960tatgaggagc agttagttta ctggccacga gataataaat taaagaatag tcaaattagt 960
ggaagtgctt cattcataga ttcatccatt gttgaagata aaaaatctgg gaaaacgata 1020ggaagtgctt cattcataga ttcatccatt gttgaagata aaaaatctgg gaaaacgata 1020
ttactagctg atgttatgcc tgcgggtatt ggaaataata atgcaaataa agccgactca 1080ttactagctg atgttatgcc tgcgggtatt ggaaataata atgcaaataa agccgactca 1080
ggttttaaag aaataaatgg tcattattat ttaaaactaa agaagaatgg agataacgat 1140ggttttaaag aaataaatgg tcattattat ttaaaactaa agaagaatgg agataacgat 1140
ttccgttata cagttagaga aaatggtgtc gtttatgatg aaacaactaa taaacctaca 1200ttccgttata cagttagaga aaatggtgtc gtttatgatg aaacaactaa taaacctaca 1200
aattatacta taaatgataa gtatgaagtt ttggagggag gaaagtcttt aacagtcgaa 1260aattatacta taaatgataa gtatgaagtt ttggagggag gaaagtcttt aacagtcgaa 1260
caatattcgg ttgattttga tagtggctct ttaagagaaa ggcataatgg aaaacaggtt 1320caatattcgg ttgattttga tagtggctct ttaagagaaa ggcataatgg aaacaggtt 1320
cctatgaatg ttttctacaa agattcgtta tttaaagtga ctcctactaa ttatatagca 1380cctatgaatg ttttctacaa agattcgtta tttaaagtga ctcctactaa ttatatagca 1380
atgacaacta gtcagaatag aggagagagt tgggaacaat ttaagttgtt gcctccgttc 1440atgacaacta gtcagaatag aggagagagt tgggaacaat ttaagttgtt gcctccgttc 1440
ttaggagaaa aacataatgg aacttacttg tgtcctggac aaggtttagc attaaaatca 1500ttaggagaaa aacataatgg aacttacttg tgtcctggac aaggtttagc attaaaatca 1500
agtaacagat tgatttttgc aacatatact agtggagaac taacctatct catttcggat 1560agtaacagat tgatttttgc aacatatact agtggagaac taacctatct catttcggat 1560
gatagtggtc aaacatggaa gaaatcctca gcttcaattc cgtttaaaaa tgcaacagca 1620gatagtggtc aaacatggaa gaaatcctca gcttcaattc cgtttaaaaa tgcaacagca 1620
gaagcacaaa tggttgaact gagagatggt gtgattagaa cattctttag aaccactaca 1680gaagcacaaa tggttgaact gagagatggt gtgattagaa cattctttag aaccactaca 1680
ggtaagatag cttatatgac tagtagagat tctggagaaa catggtcgaa agtttcgtat 1740ggtaagatag cttatatgac tagtagagat tctggagaaa catggtcgaa agtttcgtat 1740
attgatggaa ttcaacaaac ttcatatggc acacaagtat ctgcaattaa atactctcaa 1800attgatggaa ttcaacaaac ttcatatggc acacaagtat ctgcaattaa atactctcaa 1800
ttaattgatg gaaaagaagc agtcattttg agtacaccaa attctagaag tggccgtaag 1860ttaattgatg gaaaagaagc agtcattttg agtacaccaa attctagaag tggccgtaag 1860
ggaggccaat tagttgtcgg tttggtcaat aaagaagatg atagtattga ttggagatac 1920ggaggccaat tagttgtcgg tttggtcaat aaagaagatg atagtattga ttggagatac 1920
cactatgata ttgatttgcc ttcgtatggt tatgcctatt ctgcgattac agaattgcca 1980cactatgata ttgatttgcc ttcgtatggt tatgcctatt ctgcgattac agaattgcca 1980
aatcatcaca taggtgtact gtttgaaaaa tatgattcgt ggtcgagaaa tgaattgcat 2040aatcatcaca taggtgtact gtttgaaaaa tatgattcgt ggtcgagaaa tgaattgcat 2040
ttaagcaatg tagttcagta tatagatttg gaaattaatg atttaacaaa ataa 2094ttaagcaatg tagttcagta tatagatttg gaaattaatg atttaacaaa ataa 2094
<210> 6<210> 6
<211> 697<211> 697
<212> ПРТ<212> PRT
<213> Streptococcus pneumoniae<213> Streptococcus pneumoniae
<400> 6<400> 6
Met Asn Lys Arg Gly Leu Tyr Ser Lys Leu Gly Ile Ser Val Val Gly Met Asn Lys Arg Gly Leu Tyr Ser Lys Leu Gly Ile Ser Val Val Gly
1 5 10 15 1 5 10 15
Ile Ser Leu Leu Met Gly Val Pro Thr Leu Ile His Ala Asn Glu Leu Ile Ser Leu Leu Met Gly Val Pro Thr Leu Ile His Ala Asn Glu Leu
20 25 30 20 25 30
Asn Tyr Gly Gln Leu Ser Ile Ser Pro Ile Phe Gln Gly Gly Ser Tyr Asn Tyr Gly Gln Leu Ser Ile Ser Pro Ile Phe Gln Gly Gly Ser Tyr
35 40 45 35 40 45
Gln Leu Asn Asn Lys Ser Ile Asp Ile Ser Ser Leu Leu Leu Asp Lys Gln Leu Asn Asn Lys Ser Ile Asp Ile Ser Ser Leu Leu Leu Asp Lys
50 55 60 50 55 60
Leu Ser Gly Glu Ser Gln Thr Val Val Met Lys Phe Lys Ala Asp Lys Leu Ser Gly Glu Ser Gln Thr Val Val Met Lys Phe Lys Ala Asp Lys
65 70 75 80 65 70 75 80
Pro Asn Ser Leu Gln Ala Leu Phe Gly Leu Ser Asn Ser Lys Ala Gly Pro Asn Ser Leu Gln Ala Leu Phe Gly Leu Ser Asn Ser Lys Ala Gly
85 90 95 85 90 95
Phe Lys Asn Asn Tyr Phe Ser Ile Phe Met Arg Asp Ser Gly Glu Ile Phe Lys Asn Asn Tyr Phe Ser Ile Phe Met Arg Asp Ser Gly Glu Ile
100 105 110 100 105 110
Gly Val Glu Ile Arg Asp Ala Gln Lys Gly Ile Asn Tyr Leu Phe Ser Gly Val Glu Ile Arg Asp Ala Gln Lys Gly Ile Asn Tyr Leu Phe Ser
115 120 125 115 120 125
Arg Pro Ala Ser Leu Trp Gly Lys His Lys Gly Gln Ala Val Glu Asn Arg Pro Ala Ser Leu Trp Gly Lys His Lys Gly Gln Ala Val Glu Asn
130 135 140 130 135 140
Thr Leu Val Phe Val Ser Asp Ser Lys Asp Lys Thr Tyr Thr Met Tyr Thr Leu Val Phe Val Ser Asp Ser Lys Asp Lys Thr Tyr Thr Met Tyr
145 150 155 160 145 150 155 160
Val Asn Gly Ile Glu Val Phe Ser Glu Thr Val Asp Thr Phe Leu Pro Val Asn Gly Ile Glu Val Phe Ser Glu Thr Val Asp Thr Phe Leu Pro
165 170 175 165 170 175
Ile Ser Asn Ile Asn Gly Ile Asp Lys Ala Thr Leu Gly Ala Val Asn Ile Ser Asn Ile Asn Gly Ile Asp Lys Ala Thr Leu Gly Ala Val Asn
180 185 190 180 185 190
Arg Glu Gly Lys Glu His Tyr Leu Ala Lys Gly Ser Ile Asp Glu Ile Arg Glu Gly Lys Glu His Tyr Leu Ala Lys Gly Ser Ile Asp Glu Ile
195 200 205 195 200 205
Ser Leu Phe Asn Lys Ala Ile Ser Asp Gln Glu Val Ser Thr Ile Pro Ser Leu Phe Asn Lys Ala Ile Ser Asp Gln Glu Val Ser Thr Ile Pro
210 215 220 210 215 220
Leu Ser Asn Pro Phe Gln Leu Ile Phe Gln Ser Gly Asp Ser Thr Gln Leu Ser Asn Pro Phe Gln Leu Ile Phe Gln Ser Gly Asp Ser Thr Gln
225 230 235 240 225 230 235 240
Ala Asn Tyr Phe Arg Ile Pro Thr Leu Tyr Thr Leu Ser Ser Gly Arg Ala Asn Tyr Phe Arg Ile Pro Thr Leu Tyr Thr Leu Ser Ser Gly Arg
245 250 255 245 250 255
Val Leu Ser Ser Ile Asp Ala Arg Tyr Gly Gly Thr His Asp Ser Lys Val Leu Ser Ser Ile Asp Ala Arg Tyr Gly Gly Thr His Asp Ser Lys
260 265 270 260 265 270
Ser Lys Ile Asn Ile Ala Thr Ser Tyr Ser Asp Asp Asn Gly Lys Thr Ser Lys Ile Asn Ile Ala Thr Ser Tyr Ser Asp Asp Asn Gly Lys Thr
275 280 285 275 280 285
Trp Ser Glu Pro Ile Phe Ala Met Lys Phe Asn Asp Tyr Glu Glu Gln Trp Ser Glu Pro Ile Phe Ala Met Lys Phe Asn Asp Tyr Glu Glu Gln
290 295 300 290 295 300
Leu Val Tyr Trp Pro Arg Asp Asn Lys Leu Lys Asn Ser Gln Ile Ser Leu Val Tyr Trp Pro Arg Asp Asn Lys Leu Lys Asn Ser Gln Ile Ser
305 310 315 320 305 310 315 320
Gly Ser Ala Ser Phe Ile Asp Ser Ser Ile Val Glu Asp Lys Lys Ser Gly Ser Ala Ser Phe Ile Asp Ser Ser Ile Val Glu Asp Lys Lys Ser
325 330 335 325 330 335
Gly Lys Thr Ile Leu Leu Ala Asp Val Met Pro Ala Gly Ile Gly Asn Gly Lys Thr Ile Leu Leu Ala Asp Val Met Pro Ala Gly Ile Gly Asn
340 345 350 340 345 350
Asn Asn Ala Asn Lys Ala Asp Ser Gly Phe Lys Glu Ile Asn Gly His Asn Asn Ala Asn Lys Ala Asp Ser Gly Phe Lys Glu Ile Asn Gly His
355 360 365 355 360 365
Tyr Tyr Leu Lys Leu Lys Lys Asn Gly Asp Asn Asp Phe Arg Tyr Thr Tyr Tyr Leu Lys Leu Lys Lys Asn Gly Asp Asn Asp Phe Arg Tyr Thr
370 375 380 370 375 380
Val Arg Glu Asn Gly Val Val Tyr Asn Glu Thr Thr Asn Lys Pro Thr Val Arg Glu Asn Gly Val Val Tyr Asn Glu Thr Thr Asn Lys Pro Thr
385 390 395 400 385 390 395 400
Asn Tyr Thr Ile Asn Asp Lys Tyr Glu Val Leu Glu Gly Gly Lys Ser Asn Tyr Thr Ile Asn Asp Lys Tyr Glu Val Leu Glu Gly Gly Lys Ser
405 410 415 405 410 415
Leu Thr Val Glu Gln Tyr Ser Val Asp Phe Asp Ser Gly Ser Leu Arg Leu Thr Val Glu Gln Tyr Ser Val Asp Phe Asp Ser Gly Ser Leu Arg
420 425 430 420 425 430
Glu Arg His Asn Gly Lys Gln Val Pro Met Asn Val Phe Tyr Lys Asp Glu Arg His Asn Gly Lys Gln Val Pro Met Asn Val Phe Tyr Lys Asp
435 440 445 435 440 445
Ser Leu Phe Lys Val Thr Pro Thr Asn Tyr Ile Ala Met Thr Thr Ser Ser Leu Phe Lys Val Thr Pro Thr Asn Tyr Ile Ala Met Thr Thr Ser
450 455 460 450 455 460
Gln Asn Arg Gly Glu Ser Trp Glu Gln Phe Lys Leu Leu Pro Pro Phe Gln Asn Arg Gly Glu Ser Trp Glu Gln Phe Lys Leu Leu Pro Pro Phe
465 470 475 480 465 470 475 480
Leu Gly Glu Lys His Asn Gly Thr Tyr Leu Cys Pro Gly Gln Gly Leu Leu Gly Glu Lys His Asn Gly Thr Tyr Leu Cys Pro Gly Gln Gly Leu
485 490 495 485 490 495
Ala Leu Lys Ser Ser Asn Arg Leu Ile Phe Ala Thr Tyr Thr Ser Gly Ala Leu Lys Ser Ser Asn Arg Leu Ile Phe Ala Thr Tyr Thr Ser Gly
500 505 510 500 505 510
Glu Leu Thr Tyr Leu Ile Ser Asp Asp Ser Gly Gln Thr Trp Lys Lys Glu Leu Thr Tyr Leu Ile Ser Asp Asp Ser Gly Gln Thr Trp Lys Lys
515 520 525 515 520 525
Ser Ser Ala Ser Ile Pro Phe Lys Asn Ala Thr Ala Glu Ala Gln Met Ser Ser Ala Ser Ile Pro Phe Lys Asn Ala Thr Ala Glu Ala Gln Met
530 535 540 530 535 540
Val Glu Leu Arg Asp Gly Val Ile Arg Thr Phe Phe Arg Thr Thr Thr Val Glu Leu Arg Asp Gly Val Ile Arg Thr Phe Phe Arg Thr Thr Thr
545 550 555 560 545 550 555 560
Gly Lys Ile Ala Tyr Met Thr Ser Arg Asp Ser Gly Glu Thr Trp Ser Gly Lys Ile Ala Tyr Met Thr Ser Arg Asp Ser Gly Glu Thr Trp Ser
565 570 575 565 570 575
Lys Val Ser Tyr Ile Asp Gly Ile Gln Gln Thr Ser Tyr Gly Thr Gln Lys Val Ser Tyr Ile Asp Gly Ile Gln Gln Thr Ser Tyr Gly Thr Gln
580 585 590 580 585 590
Val Ser Ala Ile Lys Tyr Ser Gln Leu Ile Asp Gly Lys Glu Ala Val Val Ser Ala Ile Lys Tyr Ser Gln Leu Ile Asp Gly Lys Glu Ala Val
595 600 605 595 600 605
Ile Leu Ser Thr Pro Asn Ser Arg Ser Gly Arg Lys Gly Gly Gln Leu Ile Leu Ser Thr Pro Asn Ser Arg Ser Gly Arg Lys Gly Gly Gln Leu
610 615 620 610 615 620
Val Val Gly Leu Val Asn Lys Glu Asp Asp Ser Ile Asp Trp Lys Tyr Val Val Gly Leu Val Asn Lys Glu Asp Asp Ser Ile Asp Trp Lys Tyr
625 630 635 640 625 630 635 640
His Tyr Asp Ile Asp Leu Pro Ser Tyr Gly Tyr Ala Tyr Ser Ala Ile His Tyr Asp Ile Asp Leu Pro Ser Tyr Gly Tyr Ala Tyr Ser Ala Ile
645 650 655 645 650 655
Thr Glu Leu Pro Asn His His Ile Gly Val Leu Phe Glu Lys Tyr Asp Thr Glu Leu Pro Asn His His Ile Gly Val Leu Phe Glu Lys Tyr Asp
660 665 670 660 665 670
Ser Trp Ser Arg Asn Glu Leu His Leu Ser Asn Val Val Gln Tyr Ile Ser Trp Ser Arg Asn Glu Leu His Leu Ser Asn Val Val Gln Tyr Ile
675 680 685 675 680 685
Asp Leu Glu Ile Asn Asp Leu Thr Lys Asp Leu Glu Ile Asn Asp Leu Thr Lys
690 695 690 695
<210> 7<210> 7
<211> 2076<211> 2076
<212> ДНК<212> DNA
<213> Bifidobacterium longum<213> Bifidobacterium longum
<400> 7<400> 7
atggaacata gagcgttcaa gtggccgcag ccacttgcgg gcaacaagcc ccgcatctgg 60atggaacata gagcgttcaa gtggccgcag ccacttgcgg gcaacaagcc ccgcatctgg 60
tacggcggcg attacaaccc cgaccaatgg cctgaggaag tgtgggacga agatgtagcc 120tacggcggcg attacaaccc cgaccaatgg cctgaggaag tgtgggacga agatgtagcc 120
ctcatgcagc aggccggcgt caacctcgtc tccgtagcca tcttctcctg ggccaagctt 180ctcatgcagc aggccggcgt caacctcgtc tccgtagcca tcttctcctg ggccaagctt 180
gagcccgaag aaggcgtgta cgacttcgat tggctcgacc gcgtcatcga caagctcggc 240gagcccgaag aaggcgtgta cgacttcgat tggctcgacc gcgtcatcga caagctcggc 240
aaggccggca tcgccgtcga tctcgcctcc ggcaccgcat ccccgccgat gtggatgacc 300aaggccggca tcgccgtcga tctcgcctcc ggcaccgcat ccccgccgat gtggatgacc 300
caggcccacc cggagatcct ctgggtcgac taccgcggcg acgtctgcca gcccggtgcc 360caggcccacc cggagatcct ctgggtcgac taccgcggcg acgtctgcca gcccggtgcc 360
cgccagcact ggcgcgccac cagcccggtc ttccttgact acgcgctcaa cctgtgccgc 420cgccagcact ggcgcgccac cagcccggtc ttccttgact acgcgctcaa cctgtgccgc 420
aagatggccg agcactacaa ggacaacccc tatgtggtct cttggcatgt gagcaacgag 480aagatggccg agcactacaa ggacaacccc tatgtggtct cttggcatgt gagcaacgag 480
tacggctgcc acaaccgctt cgactattcc gaagacgccg agcgcgcctt ccagaagtgg 540tacggctgcc acaaccgctt cgactattcc gaagacgccg agcgcgcctt ccagaagtgg 540
tgcgagaaga agtacggcac catcgacgct gtcaacgacg cctggggcac cgccttctgg 600tgcgagaaga agtacggcac catcgacgct gtcaacgacg cctggggcac cgccttctgg 600
gcgcagcgca tgaacaattt ctccgagatc atcccgccgc gattcatcgg cgacggcaac 660gcgcagcgca tgaacaattt ctccgagatc atcccgccgc gattcatcgg cgacggcaac 660
ttcatgaacc cgggcaagct gcttgattgg aagcgtttca gctccgacgc gctgctggac 720ttcatgaacc cgggcaagct gcttgattgg aagcgtttca gctccgacgc gctgctggac 720
ttctacaagg ccgagcgcga cgccctgctc gagatcgccc ccaagccgca gaccaccaac 780ttctacaagg ccgagcgcga cgccctgctc gagatcgccc ccaagccgca gaccaccaac 780
ttcatggtct ccgcgggctg caccgtcctc gactacgaca agtggggtca tgacgtggac 840ttcatggtct ccgcgggctg caccgtcctc gactacgaca agtggggtca tgacgtggac 840
ttcgtgtcca acgaccatta cttctcgccc ggcgaggccc acttcgacga gatggcctac 900ttcgtgtcca acgaccatta cttctcgccc ggcgaggccc acttcgacga gatggcctac 900
gcggcctgcc tcaccgacgg catcgcccgc aagaacccgt ggttcctcat ggaacattcc 960gcggcctgcc tcaccgacgg catcgcccgc aagaacccgt ggttcctcat ggaacattcc 960
acgtccgccg tcaactggcg cccgaccaac taccggctcg agcccggcga gctggtgcgc 1020acgtccgccg tcaactggcg cccgaccaac taccggctcg agcccggcga gctggtgcgc 1020
gactccctgg cccatctggc catgggcgcc gacgccatct gctacttcca gtggcgtcag 1080gactccctgg cccatctggc catgggcgcc gacgccatct gctacttcca gtggcgtcag 1080
tccaaggccg gcgccgagaa gtggcattcc gccatggtgc cccacgcagg ccccgactcc 1140tccaaggccg gcgccgagaa gtggcattcc gccatggtgc cccacgcagg ccccgactcc 1140
cagatcttcc gcgatgtgtg cgagctgggt gccgacctca acaagcttgc tgacgagggc 1200cagatcttcc gcgatgtgtg cgagctgggt gccgacctca acaagcttgc tgacgagggc 1200
ctgctgagca ccaagctggt caagtccaag gtcgccatcg tcttcgacta cgagtcccag 1260ctgctgagca ccaagctggt caagtccaag gtcgccatcg tcttcgacta cgagtcccag 1260
tgggccaccg agcacaccgc cacccccacg caggaggtgc gccactggac cgagccgctg 1320tgggccaccg agcacaccgc cacccccacg caggaggtgc gccactggac cgagccgctg 1320
gactggttcc gcgcgctggc ggacaatggc ctgaccgccg acgtggtgcc ggtccgcggt 1380gactggttcc gcgcgctggc ggacaatggc ctgaccgccg acgtggtgcc ggtccgcggt 1380
ccttgggatg agtacgaggc cgtcgtgttg ccgagcctgg ccatcctgtc cgagcagacc 1440ccttgggatg agtacgaggc cgtcgtgttg ccgagcctgg ccatcctgtc cgagcagacc 1440
acgcgccgcg tgcgcgagta tgtggcgaac ggcggcaagc tgttcgtgac ctactacacc 1500acgcgccgcg tgcgcgagta tgtggcgaac ggcggcaagc tgttcgtgac ctactacacc 1500
ggtctggtgg acgacaggga tcacgtctgg ctgggcggct accccggctc cattcgcgac 1560ggtctggtgg acgacaggga tcacgtctgg ctgggcggct accccggctc cattcgcgac 1560
gtggtgggcg tgcgcgtcga ggaattcgcc ccgatgggca ccgacgcccc cggcaccatg 1620gtggtgggcg tgcgcgtcga ggaattcgcc ccgatgggca ccgacgcccc cggcaccatg 1620
gaccaccttg acttggacaa cggaaccgtg gcgcacgatt tcgccgacgt gatcacctcc 1680gaccaccttg acttggacaa cggaaccgtg gcgcacgatt tcgccgacgt gatcacctcc 1680
gtggccgata ccgctcacgt ggtcgcctcc ttcaaggcag ataagtggac cggtttcgac 1740gtggccgata ccgctcacgt ggtcgcctcc ttcaaggcag ataagtggac cggtttcgac 1740
ggcgctcccg ccatcaccgt caacgacttc ggcgacggca aggccgcata cgtcggtgcc 1800ggcgctcccg ccatcaccgt caacgacttc ggcgacggca aggccgcata cgtcggtgcc 1800
cgtctcgggc gtgagggctt ggccaagagc ctgcccgcgc tgctggagga actcggcatc 1860cgtctcgggc gtgagggctt ggccaagagc ctgcccgcgc tgctggagga actcggcatc 1860
gagacttcgg ctgaggacga tcgtggtgaa gtgctgcgcg tcgagcgtgc ggacgaaact 1920gagacttcgg ctgaggacga tcgtggtgaa gtgctgcgcg tcgagcgtgc ggacgaaact 1920
ggcgagaacc acttcgtgtt cctgttcaac cgcacccacg atgttgcggt cgtggacgtg 1980ggcgagaacc acttcgtgtt cctgttcaac cgcacccacg atgttgcggt cgtggacgtg 1980
gaaggcgaac cgctggtcgc ctcgctggcc caggtcaacg agtccgagca cacggccgcc 2040gaaggcgaac cgctggtcgc ctcgctggcc caggtcaacg agtccgagca cacggccgcc 2040
atccagccca acggcgtact cgtcgtcaag ctgtaa 2076atccagccca acggcgtact cgtcgtcaag ctgtaa 2076
<210> 8<210> 8
<211> 691<211> 691
<212> ПРТ<212> PRT
<213> Bifidobacterium longum<213> Bifidobacterium longum
<400> 8<400> 8
Met Glu His Arg Ala Phe Lys Trp Pro Gln Pro Leu Ala Gly Asn Lys Met Glu His Arg Ala Phe Lys Trp Pro Gln Pro Leu Ala Gly Asn Lys
1 5 10 15 1 5 10 15
Pro Arg Ile Trp Tyr Gly Gly Asp Tyr Asn Pro Asp Gln Trp Pro Glu Pro Arg Ile Trp Tyr Gly Gly Asp Tyr Asn Pro Asp Gln Trp Pro Glu
20 25 30 20 25 30
Glu Val Trp Asp Glu Asp Val Ala Leu Met Gln Gln Ala Gly Val Asn Glu Val Trp Asp Glu Asp Val Ala Leu Met Gln Gln Ala Gly Val Asn
35 40 45 35 40 45
Leu Val Ser Val Ala Ile Phe Ser Trp Ala Lys Leu Glu Pro Glu Glu Leu Val Ser Val Ala Ile Phe Ser Trp Ala Lys Leu Glu Pro Glu Glu
50 55 60 50 55 60
Gly Val Tyr Asp Phe Asp Trp Leu Asp Arg Val Ile Asp Lys Leu Gly Gly Val Tyr Asp Phe Asp Trp Leu Asp Arg Val Ile Asp Lys Leu Gly
65 70 75 80 65 70 75 80
Lys Ala Gly Ile Ala Val Asp Leu Ala Ser Gly Thr Ala Ser Pro Pro Lys Ala Gly Ile Ala Val Asp Leu Ala Ser Gly Thr Ala Ser Pro Pro
85 90 95 85 90 95
Met Trp Met Thr Gln Ala His Pro Glu Ile Leu Trp Val Asp Tyr Arg Met Trp Met Thr Gln Ala His Pro Glu Ile Leu Trp Val Asp Tyr Arg
100 105 110 100 105 110
Gly Asp Val Cys Gln Pro Gly Ala Arg Gln His Trp Arg Ala Thr Ser Gly Asp Val Cys Gln Pro Gly Ala Arg Gln His Trp Arg Ala Thr Ser
115 120 125 115 120 125
Pro Val Phe Leu Asp Tyr Ala Leu Asn Leu Cys Arg Lys Met Ala Glu Pro Val Phe Leu Asp Tyr Ala Leu Asn Leu Cys Arg Lys Met Ala Glu
130 135 140 130 135 140
His Tyr Lys Asp Asn Pro Tyr Val Val Ser Trp His Val Ser Asn Glu His Tyr Lys Asp Asn Pro Tyr Val Val Ser Trp His Val Ser Asn Glu
145 150 155 160 145 150 155 160
Tyr Gly Cys His Asn Arg Phe Asp Tyr Ser Glu Asp Ala Glu Arg Ala Tyr Gly Cys His Asn Arg Phe Asp Tyr Ser Glu Asp Ala Glu Arg Ala
165 170 175 165 170 175
Phe Gln Lys Trp Cys Glu Lys Lys Tyr Gly Thr Ile Asp Ala Val Asn Phe Gln Lys Trp Cys Glu Lys Lys Tyr Gly Thr Ile Asp Ala Val Asn
180 185 190 180 185 190
Asp Ala Trp Gly Thr Ala Phe Trp Ala Gln Arg Met Asn Asn Phe Ser Asp Ala Trp Gly Thr Ala Phe Trp Ala Gln Arg Met Asn Asn Phe Ser
195 200 205 195 200 205
Glu Ile Ile Pro Pro Arg Phe Ile Gly Asp Gly Asn Phe Met Asn Pro Glu Ile Ile Pro Pro Arg Phe Ile Gly Asp Gly Asn Phe Met Asn Pro
210 215 220 210 215 220
Gly Lys Leu Leu Asp Trp Lys Arg Phe Ser Ser Asp Ala Leu Leu Asp Gly Lys Leu Leu Asp Trp Lys Arg Phe Ser Ser Asp Ala Leu Leu Asp
225 230 235 240 225 230 235 240
Phe Tyr Lys Ala Glu Arg Asp Ala Leu Leu Glu Ile Ala Pro Lys Pro Phe Tyr Lys Ala Glu Arg Asp Ala Leu Leu Glu Ile Ala Pro Lys Pro
245 250 255 245 250 255
Gln Thr Thr Asn Phe Met Val Ser Ala Gly Cys Thr Val Leu Asp Tyr Gln Thr Thr Asn Phe Met Val Ser Ala Gly Cys Thr Val Leu Asp Tyr
260 265 270 260 265 270
Asp Lys Trp Gly His Asp Val Asp Phe Val Ser Asn Asp His Tyr Phe Asp Lys Trp Gly His Asp Val Asp Phe Val Ser Asn Asp His Tyr Phe
275 280 285 275 280 285
Ser Pro Gly Glu Ala His Phe Asp Glu Met Ala Tyr Ala Ala Cys Leu Ser Pro Gly Glu Ala His Phe Asp Glu Met Ala Tyr Ala Ala Cys Leu
290 295 300 290 295 300
Thr Asp Gly Ile Ala Arg Lys Asn Pro Trp Phe Leu Met Glu His Ser Thr Asp Gly Ile Ala Arg Lys Asn Pro Trp Phe Leu Met Glu His Ser
305 310 315 320 305 310 315 320
Thr Ser Ala Val Asn Trp Arg Pro Thr Asn Tyr Arg Leu Glu Pro Gly Thr Ser Ala Val Asn Trp Arg Pro Thr Asn Tyr Arg Leu Glu Pro Gly
325 330 335 325 330 335
Glu Leu Val Arg Asp Ser Leu Ala His Leu Ala Met Gly Ala Asp Ala Glu Leu Val Arg Asp Ser Leu Ala His Leu Ala Met Gly Ala Asp Ala
340 345 350 340 345 350
Ile Cys Tyr Phe Gln Trp Arg Gln Ser Lys Ala Gly Ala Glu Lys Trp Ile Cys Tyr Phe Gln Trp Arg Gln Ser Lys Ala Gly Ala Glu Lys Trp
355 360 365 355 360 365
His Ser Ala Met Val Pro His Ala Gly Pro Asp Ser Gln Ile Phe Arg His Ser Ala Met Val Pro His Ala Gly Pro Asp Ser Gln Ile Phe Arg
370 375 380 370 375 380
Asp Val Cys Glu Leu Gly Ala Asp Leu Asn Lys Leu Ala Asp Glu Gly Asp Val Cys Glu Leu Gly Ala Asp Leu Asn Lys Leu Ala Asp Glu Gly
385 390 395 400 385 390 395 400
Leu Leu Ser Thr Lys Leu Val Lys Ser Lys Val Ala Ile Val Phe Asp Leu Leu Ser Thr Lys Leu Val Lys Ser Lys Val Ala Ile Val Phe Asp
405 410 415 405 410 415
Tyr Glu Ser Gln Trp Ala Thr Glu His Thr Ala Thr Pro Thr Gln Glu Tyr Glu Ser Gln Trp Ala Thr Glu His Thr Ala Thr Pro Thr Gln Glu
420 425 430 420 425 430
Val Arg His Trp Thr Glu Pro Leu Asp Trp Phe Arg Ala Leu Ala Asp Val Arg His Trp Thr Glu Pro Leu Asp Trp Phe Arg Ala Leu Ala Asp
435 440 445 435 440 445
Asn Gly Leu Thr Ala Asp Val Val Pro Val Arg Gly Pro Trp Asp Glu Asn Gly Leu Thr Ala Asp Val Val Pro Val Arg Gly Pro Trp Asp Glu
450 455 460 450 455 460
Tyr Glu Ala Val Val Leu Pro Ser Leu Ala Ile Leu Ser Glu Gln Thr Tyr Glu Ala Val Val Leu Pro Ser Leu Ala Ile Leu Ser Glu Gln Thr
465 470 475 480 465 470 475 480
Thr Arg Arg Val Arg Glu Tyr Val Ala Asn Gly Gly Lys Leu Phe Val Thr Arg Arg Val Arg Glu Tyr Val Ala Asn Gly Gly Lys Leu Phe Val
485 490 495 485 490 495
Thr Tyr Tyr Thr Gly Leu Val Asp Asp Arg Asp His Val Trp Leu Gly Thr Tyr Tyr Thr Gly Leu Val Asp Asp Arg Asp His Val Trp Leu Gly
500 505 510 500 505 510
Gly Tyr Pro Gly Ser Ile Arg Asp Val Val Gly Val Arg Val Glu Glu Gly Tyr Pro Gly Ser Ile Arg Asp Val Val Gly Val Arg Val Glu Glu
515 520 525 515 520 525
Phe Ala Pro Met Gly Thr Asp Ala Pro Gly Thr Met Asp His Leu Asp Phe Ala Pro Met Gly Thr Asp Ala Pro Gly Thr Met Asp His Leu Asp
530 535 540 530 535 540
Leu Asp Asn Gly Thr Val Ala His Asp Phe Ala Asp Val Ile Thr Ser Leu Asp Asn Gly Thr Val Ala His Asp Phe Ala Asp Val Ile Thr Ser
545 550 555 560 545 550 555 560
Val Ala Asp Thr Ala His Val Val Ala Ser Phe Lys Ala Asp Lys Trp Val Ala Asp Thr Ala His Val Val Ala Ser Phe Lys Ala Asp Lys Trp
565 570 575 565 570 575
Thr Gly Phe Asp Gly Ala Pro Ala Ile Thr Val Asn Asp Phe Gly Asp Thr Gly Phe Asp Gly Ala Pro Ala Ile Thr Val Asn Asp Phe Gly Asp
580 585 590 580 585 590
Gly Lys Ala Ala Tyr Val Gly Ala Arg Leu Gly Arg Glu Gly Leu Ala Gly Lys Ala Ala Tyr Val Gly Ala Arg Leu Gly Arg Glu Gly Leu Ala
595 600 605 595 600 605
Lys Ser Leu Pro Ala Leu Leu Glu Glu Leu Gly Ile Glu Thr Ser Ala Lys Ser Leu Pro Ala Leu Leu Glu Glu Leu Gly Ile Glu Thr Ser Ala
610 615 620 610 615 620
Glu Asp Asp Arg Gly Glu Val Leu Arg Val Glu Arg Ala Asp Glu Thr Glu Asp Asp Arg Gly Glu Val Leu Arg Val Glu Arg Ala Asp Glu Thr
625 630 635 640 625 630 635 640
Gly Glu Asn His Phe Val Phe Leu Phe Asn Arg Thr His Asp Val Ala Gly Glu Asn His Phe Val Phe Leu Phe Asn Arg Thr His Asp Val Ala
645 650 655 645 650 655
Val Val Asp Val Glu Gly Glu Pro Leu Val Ala Ser Leu Ala Gln Val Val Val Asp Val Glu Gly Glu Pro Leu Val Ala Ser Leu Ala Gln Val
660 665 670 660 665 670
Asn Glu Ser Glu His Thr Ala Ala Ile Gln Pro Asn Gly Val Leu Val Asn Glu Ser Glu His Thr Ala Ala Ile Gln Pro Asn Gly Val Leu Val
675 680 685 675 680 685
Val Lys Leu Val Lys Leu
690 690
<210> 9<210> 9
<211> 1626<211> 1626
<212> ДНК<212> DNA
<213> Clostridium thermocellum<213> Clostridium thermocellum
<400> 9<400> 9
atggcagaag gggttatagt caacggaact cagtttaaag acacatcggg aaatgtgata 60atggcagaag gggttatagt caacggaact cagtttaaag acacatcggg aaatgtgata 60
catgcccatg ggggaggcat gttaaagcat ggtgactatt attactggta cggtgaatac 120catgcccatg ggggaggcat gttaaagcat ggtgactatt attactggta cggtgaatac 120
cgggacgact ccaacttgtt tttgggtgta agttgctaca ggtcaaaaga tcttgtaaac 180cgggacgact ccaacttgtt tttgggtgta agttgctaca ggtcaaaaga tcttgtaaac 180
tgggaataca gaggagaagt gctgagccga aattccgctc ctgaactgaa tcactgcaat 240tgggaataca gaggagaagt gctgagccga aattccgctc ctgaactgaa tcactgcaat 240
attgaaagac cgaaagtcat gtacaacgca tcaaccggtg aatttgtcat gtggatgcac 300attgaaagac cgaaagtcat gtacaacgca tcaaccggtg aatttgtcat gtggatgcac 300
tgggagaacg gcataaacta cggtcaggca agagcagctg ttgcgtattc caaaacgccc 360tggggagaacg gcataaacta cggtcaggca agagcagctg ttgcgtattc caaaacgccc 360
gacggcaaat tcacatacat tcgaagcttt cgtcccatgc aggataccgg cgttatggat 420gacggcaaat tcacatacat tcgaagcttt cgtcccatgc aggataccgg cgttatggat 420
catggccttc cgggatatat gtcaagggac tgcaatgtat ttgtggacac tgacggcaag 480catggccttc cgggatatat gtcaagggac tgcaatgtat ttgtggacac tgacggcaag 480
ggatatttta tatccgcagc caatgagaac atggacctgc acctttatga gctgacacct 540ggatatttta tatccgcagc caatgagaac atggacctgc acctttatga gctgacacct 540
gactataaaa atattgcatc ccttaaggca aagctgtttg tcggacagca gagggaagca 600gactataaaa atattgcatc ccttaaggca aagctgtttg tcggacagca gagggaagca 600
ccatgcctta taaagagaaa cggctactat taccttatta cttccggttg tacaggttgg 660ccatgcctta taaagagaaa cggctactat taccttatta cttccggttg tacaggttgg 660
aacccgaatc aggctaaata cgcatattcc aaagatttgg ccagtggctg gtcccagctt 720aacccgaatc aggctaaata cgcatattcc aaagatttgg ccagtggctg gtcccagctt 720
tacaatcttg gtaattcaac cacctacagg tcacagccga cttttatcat tcccgttcag 780tacaatcttg gtaattcaac cacctacagg tcacagccga cttttatcat tcccgttcag 780
ggaagctcgg gaaccagtta tctttatatg ggtgaccgtt gggccggtgc ctggggagga 840ggaagctcgg gaaccagtta tctttatatg ggtgaccgtt gggccggtgc ctggggagga 840
aaggttaatg actcccaata tgtatggctt cccttaaact tcatatccga tacaacactt 900aaggttaatg actcccaata tgtatggctt cccttaaact tcatatccga tacaacactt 900
gaactgccct attatgactc tgtaaagatt gatgcttctt caggaataat ttccgagtac 960gaactgccct attatgactc tgtaaagatt gatgcttctt caggaataat ttccgagtac 960
ataccggaca ctacacgcta caagctggta aacaaaaaca gcggaaaagt cctggatgtt 1020ataccggaca ctacacgcta caagctggta aacaaaaaca gcggaaaagt cctggatgtt 1020
cttgacggtt ctgtcgataa tgcagcccag atagtccaat ggaccgataa cgggtctttg 1080cttgacggtt ctgtcgataa tgcagcccag atagtccaat ggaccgataa cgggtctttg 1080
agtcaacagt ggtaccttgt ggacgtgggc ggtggttata aaaagattgt aaatgtaaag 1140agtcaacagt ggtaccttgt ggacgtgggc ggtggttata aaaagattgt aaatgtaaag 1140
agcggaagag ccttggatgt aaaagacgaa tccaaggaag acggtggagt attaatacaa 1200agcggaagag ccttggatgt aaaagacgaa tccaaggaag acggtggagt attaatacaa 1200
tataccagca acggcggata taatcagcac tggaaattca cagacatagg tgacgggtat 1260tataccagca acggcggata taatcagcac tggaaattca cagacatagg tgacgggtat 1260
tacaagattt ccagccgcca ctgcggaaaa cttatagatg tgcgaaaatg gtcaacggaa 1320tacaagattt ccagccgcca ctgcggaaaa cttatagatg tgcgaaaatg gtcaacggaa 1320
gacggcggaa taattcagca gtggtccgat gccggaggaa caaatcagca ttggaagctg 1380gacggcggaa taattcagca gtggtccgat gccggaggaa caaatcagca ttggaagctg 1380
gtgcttgtat caagtcccga gccttcacca tcaccttctc cccaagtggt taaaggagat 1440gtgcttgtat caagtcccga gccttcacca tcaccttctc cccaagtggt taaaggagat 1440
gtaaacggcg acttgaaagt aaattcaacg gatttttcca tgttaagaag atatttactt 1500gtaaacggcg acttgaaagt aaattcaacg gatttttcca tgttaagaag atatttactt 1500
aaaaccattg acaattttcc gacagaaaac ggaaaacagg ctgccgattt gaacggagac 1560aaaaccattg acaattttcc gacagaaaac ggaaaacagg ctgccgattt gaacggagac 1560
ggcagaataa actcttcgga tcttacaatg ctgaaaagat acttgcttat ggaagtggat 1620ggcagaataa actcttcgga tcttacaatg ctgaaaagat acttgcttat ggaagtggat 1620
ttgtaa 1626ttgtaa 1626
<210> 10<210> 10
<211> 541<211> 541
<212> ПРТ<212> PRT
<213> Clostridium thermocellum<213> Clostridium thermocellum
<400> 10<400> 10
Met Ala Glu Gly Val Ile Val Asn Gly Thr Gln Phe Lys Asp Thr Ser Met Ala Glu Gly Val Ile Val Asn Gly Thr Gln Phe Lys Asp Thr Ser
1 5 10 15 1 5 10 15
Gly Asn Val Ile His Ala His Gly Gly Gly Met Leu Lys His Gly Asp Gly Asn Val Ile His Ala His Gly Gly Gly Met Leu Lys His Gly Asp
20 25 30 20 25 30
Tyr Tyr Tyr Trp Tyr Gly Glu Tyr Arg Asp Asp Ser Asn Leu Phe Leu Tyr Tyr Tyr Trp Tyr Gly Glu Tyr Arg Asp Asp Ser Asn Leu Phe Leu
35 40 45 35 40 45
Gly Val Ser Cys Tyr Arg Ser Lys Asp Leu Val Asn Trp Glu Tyr Arg Gly Val Ser Cys Tyr Arg Ser Lys Asp Leu Val Asn Trp Glu Tyr Arg
50 55 60 50 55 60
Gly Glu Val Leu Ser Arg Asn Ser Ala Pro Glu Leu Asn His Cys Asn Gly Glu Val Leu Ser Arg Asn Ser Ala Pro Glu Leu Asn His Cys Asn
65 70 75 80 65 70 75 80
Ile Glu Arg Pro Lys Val Met Tyr Asn Ala Ser Thr Gly Glu Phe Val Ile Glu Arg Pro Lys Val Met Tyr Asn Ala Ser Thr Gly Glu Phe Val
85 90 95 85 90 95
Met Trp Met His Trp Glu Asn Gly Ile Asn Tyr Gly Gln Ala Arg Ala Met Trp Met His Trp Glu Asn Gly Ile Asn Tyr Gly Gln Ala Arg Ala
100 105 110 100 105 110
Ala Val Ala Tyr Ser Lys Thr Pro Asp Gly Lys Phe Thr Tyr Ile Arg Ala Val Ala Tyr Ser Lys Thr Pro Asp Gly Lys Phe Thr Tyr Ile Arg
115 120 125 115 120 125
Ser Phe Arg Pro Met Gln Asp Thr Gly Val Met Asp His Gly Leu Pro Ser Phe Arg Pro Met Gln Asp Thr Gly Val Met Asp His Gly Leu Pro
130 135 140 130 135 140
Gly Tyr Met Ser Arg Asp Cys Asn Val Phe Val Asp Thr Asp Gly Lys Gly Tyr Met Ser Arg Asp Cys Asn Val Phe Val Asp Thr Asp Gly Lys
145 150 155 160 145 150 155 160
Gly Tyr Phe Ile Ser Ala Ala Asn Glu Asn Met Asp Leu His Leu Tyr Gly Tyr Phe Ile Ser Ala Ala Asn Glu Asn Met Asp Leu His Leu Tyr
165 170 175 165 170 175
Glu Leu Thr Pro Asp Tyr Lys Asn Ile Ala Ser Leu Lys Ala Lys Leu Glu Leu Thr Pro Asp Tyr Lys Asn Ile Ala Ser Leu Lys Ala Lys Leu
180 185 190 180 185 190
Phe Val Gly Gln Gln Arg Glu Ala Pro Cys Leu Ile Lys Arg Asn Gly Phe Val Gly Gln Gln Arg Glu Ala Pro Cys Leu Ile Lys Arg Asn Gly
195 200 205 195 200 205
Tyr Tyr Tyr Leu Ile Thr Ser Gly Cys Thr Gly Trp Asn Pro Asn Gln Tyr Tyr Tyr Leu Ile Thr Ser Gly Cys Thr Gly Trp Asn Pro Asn Gln
210 215 220 210 215 220
Ala Lys Tyr Ala Tyr Ser Lys Asp Leu Ala Ser Gly Trp Ser Gln Leu Ala Lys Tyr Ala Tyr Ser Lys Asp Leu Ala Ser Gly Trp Ser Gln Leu
225 230 235 240 225 230 235 240
Tyr Asn Leu Gly Asn Ser Thr Thr Tyr Arg Ser Gln Pro Thr Phe Ile Tyr Asn Leu Gly Asn Ser Thr Thr Tyr Arg Ser Gln Pro Thr Phe Ile
245 250 255 245 250 255
Ile Pro Val Gln Gly Ser Ser Gly Thr Ser Tyr Leu Tyr Met Gly Asp Ile Pro Val Gln Gly Ser Ser Gly Thr Ser Tyr Leu Tyr Met Gly Asp
260 265 270 260 265 270
Arg Trp Ala Gly Ala Trp Gly Gly Lys Val Asn Asp Ser Gln Tyr Val Arg Trp Ala Gly Ala Trp Gly Gly Lys Val Asn Asp Ser Gln Tyr Val
275 280 285 275 280 285
Trp Leu Pro Leu Asn Phe Ile Ser Asp Thr Thr Leu Glu Leu Pro Tyr Trp Leu Pro Leu Asn Phe Ile Ser Asp Thr Thr Leu Glu Leu Pro Tyr
290 295 300 290 295 300
Tyr Asp Ser Val Lys Ile Asp Ala Ser Ser Gly Ile Ile Ser Glu Tyr Tyr Asp Ser Val Lys Ile Asp Ala Ser Ser Gly Ile Ile Ser Glu Tyr
305 310 315 320 305 310 315 320
Ile Pro Asp Thr Thr Arg Tyr Lys Leu Val Asn Lys Asn Ser Gly Lys Ile Pro Asp Thr Thr Arg Tyr Lys Leu Val Asn Lys Asn Ser Gly Lys
325 330 335 325 330 335
Val Leu Asp Val Leu Asp Gly Ser Val Asp Asn Ala Ala Gln Ile Val Val Leu Asp Val Leu Asp Gly Ser Val Asp Asn Ala Ala Gln Ile Val
340 345 350 340 345 350
Gln Trp Thr Asp Asn Gly Ser Leu Ser Gln Gln Trp Tyr Leu Val Asp Gln Trp Thr Asp Asn Gly Ser Leu Ser Gln Gln Trp Tyr Leu Val Asp
355 360 365 355 360 365
Val Gly Gly Gly Tyr Lys Lys Ile Val Asn Val Lys Ser Gly Arg Ala Val Gly Gly Gly Tyr Lys Lys Ile Val Asn Val Lys Ser Gly Arg Ala
370 375 380 370 375 380
Leu Asp Val Lys Asp Glu Ser Lys Glu Asp Gly Gly Val Leu Ile Gln Leu Asp Val Lys Asp Glu Ser Lys Glu Asp Gly Gly Val Leu Ile Gln
385 390 395 400 385 390 395 400
Tyr Thr Ser Asn Gly Gly Tyr Asn Gln His Trp Lys Phe Thr Asp Ile Tyr Thr Ser Asn Gly Gly Tyr Asn Gln His Trp Lys Phe Thr Asp Ile
405 410 415 405 410 415
Gly Asp Gly Tyr Tyr Lys Ile Ser Ser Arg His Cys Gly Lys Leu Ile Gly Asp Gly Tyr Tyr Lys Ile Ser Ser Arg His Cys Gly Lys Leu Ile
420 425 430 420 425 430
Asp Val Arg Lys Trp Ser Thr Glu Asp Gly Gly Ile Ile Gln Gln Trp Asp Val Arg Lys Trp Ser Thr Glu Asp Gly Gly Ile Ile Gln Gln Trp
435 440 445 435 440 445
Ser Asp Ala Gly Gly Thr Asn Gln His Trp Lys Leu Val Leu Val Ser Ser Asp Ala Gly Gly Thr Asn Gln His Trp Lys Leu Val Leu Val Ser
450 455 460 450 455 460
Ser Pro Glu Pro Ser Pro Ser Pro Ser Pro Gln Val Val Lys Gly Asp Ser Pro Glu Pro Ser Pro Ser Pro Ser Pro Gln Val Val Lys Gly Asp
465 470 475 480 465 470 475 480
Val Asn Gly Asp Leu Lys Val Asn Ser Thr Asp Phe Ser Met Leu Arg Val Asn Gly Asp Leu Lys Val Asn Ser Thr Asp Phe Ser Met Leu Arg
485 490 495 485 490 495
Arg Tyr Leu Leu Lys Thr Ile Asp Asn Phe Pro Thr Glu Asn Gly Lys Arg Tyr Leu Leu Lys Thr Ile Asp Asn Phe Pro Thr Glu Asn Gly Lys
500 505 510 500 505 510
Gln Ala Ala Asp Leu Asn Gly Asp Gly Arg Ile Asn Ser Ser Asp Leu Gln Ala Ala Asp Leu Asn Gly Asp Gly Arg Ile Asn Ser Ser Asp Leu
515 520 525 515 520 525
Thr Met Leu Lys Arg Tyr Leu Leu Met Glu Val Asp Leu Thr Met Leu Lys Arg Tyr Leu Leu Met Glu Val Asp Leu
530 535 540 530 535 540
<210> 11<210> 11
<211> 2631<211> 2631
<212> ДНК<212> DNA
<213> Paenibacillus sp.<213> Paenibacillus sp.
<400> 11<400> 11
atgaatcgac acgtcctgct tcatccgtat ctccaccgga aggcgttgcc tctgctcctg 60atgaatcgac acgtcctgct tcatccgtat ctccaccgga aggcgttgcc tctgctcctg 60
gccttgacgc tgctgacggg catcgccctg ttcccggcct ccaccgcgca ggcggcgacg 120gccttgacgc tgctgacggg catcgccctg ttcccggcct ccaccgcgca ggcggcgacg 120
accgtgacgt cgatgacgta cttctctgcc aatgacggtc ccgtcatctc caaatccggc 180accgtgacgt cgatgacgta cttctctgcc aatgacggtc ccgtcatctc caaatccggc 180
gtcgggcaag ccagctacgg tttcgtcatg ccgatcttca acggaggcgc tgcgacctgg 240gtcgggcaag ccagctacgg tttcgtcatg ccgatcttca acggaggcgc tgcgacctgg 240
aacgatgtcg ccgatgacgt cggcgttcgc gtcaaggtcg gcggcagctg ggtcgacatt 300aacgatgtcg ccgatgacgt cggcgttcgc gtcaaggtcg gcggcagctg ggtcgacatt 300
gacagcgttg gcggctatgt gtacaaccag aactggggcc attggaacga cagcggcacc 360gacagcgttg gcggctatgt gtacaaccag aactggggcc attggaacga cagcggcacc 360
tatggctact ggttcaccct ctccgccacg accgagctgc agctctactc caaggcgaac 420tatggctact ggttcaccct ctccgccacg accgagctgc agctctactc caaggcgaac 420
agcagcgtca cactcaacta cacgctcgtc ttccagaatg tcaatgaaac gaccattacc 480agcagcgtca cactcaacta cacgctcgtc ttccagaatg tcaatgaaac gaccattacc 480
tcgatgacac cgacccaggg cccgcaattg accgcagggt ataccggcgg cgcaggcttc 540tcgatgacac cgacccaggg cccgcaattg accgcagggt ataccggcgg cgcaggcttc 540
acctatccgg tcttcaacaa cgatccctcc atcccgtatg cagccgtagc cggcgatctg 600acctatccgg tcttcaacaa cgatccctcc atcccgtatg cagccgtagc cggcgatctg 600
aaggtgtacg tcaagccagt cgccagcagt acctggatcg atatcgacaa caacgcggcg 660aaggtgtacg tcaagccagt cgccagcagt acctggatcg atatcgacaa caacgcggcg 660
agcggctgga tctacgacag caacttcggc cagttcaccg aaggcggcgg cggctactgg 720agcggctgga tctacgacag caacttcggc cagttcaccg aaggcggcgg cggctactgg 720
ttcaccgtca ccgagtcgat caacgtcaag ctcgagtcca ggacgtcctc ggccaacgtc 780ttcaccgtca ccgagtcgat caacgtcaag ctcgagtcca ggacgtcctc ggccaacgtc 780
gtctatacga tcaacttccc gcagccgacg cgcagcagct acacactctc cgcctatgac 840gtctatacga tcaacttccc gcagccgacg cgcagcagct acacactctc cgcctatgac 840
ggcacgacct acagcgccga tgcgagcggc gcgatcggta tcccgctgcc gcggatcgac 900ggcacgacct acagcgccga tgcgagcggc gcgatcggta tcccgctgcc gcggatcgac 900
ggcaccccgg cgatcggcag cgagctcggc aacttcgtct accagatcta ccggaacggc 960ggcaccccgg cgatcggcag cgagctcggc aacttcgtct accagatcta ccggaacggc 960
cagtgggtcg agatgagcaa ctcggcgcag agcagcttcg tctactcggc caatggctac 1020cagtgggtcg agatgagcaa ctcggcgcag agcagcttcg tctactcggc caatggctac 1020
aacaacatgt ccgacgccaa tcaatggggc tactgggccg actacatcta cggcctctgg 1080aacaacatgt ccgacgccaa tcaatggggc tactgggccg actacatcta cggcctctgg 1080
ttccggccga tccaggagga tatgcagatc cgcatcggct atccgctgaa tggccagtcc 1140ttccggccga tccaggagga tatgcagatc cgcatcggct atccgctgaa tggccagtcc 1140
ggcggcagcg tcggcagcaa cttcgtcacc tatacgctga tcggcaaccc gaacgcgccg 1200ggcggcagcg tcggcagcaa cttcgtcacc tatacgctga tcggcaaccc gaacgcgccg 1200
cgacccgatg tgagcgacca gggcgacgtc gagatcggca cgcccaccga tccggccatc 1260cgacccgatg tgagcgacca gggcgacgtc gagatcggca cgcccaccga tccggccatc 1260
gcaggatgga atctgtattg gcaggatgaa ttcgccggca gcgcgctcga tctgaacaag 1320gcaggatgga atctgtattg gcaggatgaa ttcgccggca gcgcgctcga tctgaacaag 1320
tggaactacg agaccggcta ctacatcggc aacgacccca atctgtgggg ctggggcaac 1380tggaactacg agaccggcta ctacatcggc aacgacccca atctgtgggg ctggggcaac 1380
gccgagatgc agcactatac gacgagcacg caaaatgtct tcgtcgctga cggcaaactc 1440gccgagatgc agcactatac gacgagcacg caaaatgtct tcgtcgctga cggcaaactc 1440
aacatccgag cgctccacga ttaccaatcg ttcccgcagg acccgaaccg ctacgcgacc 1500aacatccgag cgctccacga ttaccaatcg ttcccgcagg acccgaaccg ctacgcgacc 1500
tactcctccg gcaagatcaa caccaaggac aacatgtcgc tgcagtacgg ccgcgtcgat 1560tactcctccg gcaagatcaa caccaaggac aacatgtcgc tgcagtacgg ccgcgtcgat 1560
atccgcgcca agctgccgac tggcgatggc gtctggccgg cactgtggat gctgccggag 1620atccgcgcca agctgccgac tggcgatggc gtctggccgg cactgtggat gctgccggag 1620
gactccgtct acggcgcatg ggcggcatca ggagagatcg acatcatgga ggcgaagggc 1680gactccgtct acggcgcatg ggcggcatca ggagagatcg acatcatgga ggcgaagggc 1680
cgtctgcccg gcacgacgag cggcgcgatc cactacggcg gccaatggcc ggtcaaccgc 1740cgtctgcccg gcacgacgag cggcgcgatc cactacggcg gccaatggcc ggtcaaccgc 1740
tacctcgccg gagaatgcta cctcccgcaa ggtacgacat tcgccgacga ctttaatgtg 1800tacctcgccg gagaatgcta cctcccgcaa ggtacgacat tcgccgacga ctttaatgtg 1800
tacacgatga tctgggaaga ggacaacatg aagtggtacg ttaacggtga gtttttcttc 1860tacacgatga tctgggaaga ggacaacatg aagtggtacg ttaacggtga gtttttcttc 1860
aaggtgacgc gcgagcagtg gtactccgtc gccgccccca acaatccgga cgcgccgttc 1920aaggtgacgc gcgagcagtg gtactccgtc gccgccccca acaatccgga cgcgccgttc 1920
gaccagccgt tctatctgat catgaacctg gcggtcggcg gccacttcga cggcgggcgt 1980gaccagccgt tctatctgat catgaacctg gcggtcggcg gccacttcga cggcgggcgt 1980
acgcccgacc cgtccgacat cccggcgacg atgcagatcg actacgtgcg ggtgtacaaa 2040acgcccgacc cgtccgacat cccggcgacg atgcagatcg actacgtgcg ggtgtacaaa 2040
gagggcgcgg gcggcggtcc gggcaacccg ggcggcaacg tcgcggtgac cggcgttagc 2100gagggcgcgg gcggcggtcc gggcaacccg ggcggcaacg tcgcggtgac cggcgttagc 2100
gtgaccccgg caacggcgca ggtgcaggtc ggtcagaccg tctcgctgag cgccaacgtc 2160gtgaccccgg caacggcgca ggtgcaggtc ggtcagaccg tctcgctgag cgccaacgtc 2160
gcgccagcca atgcaacgaa caagcaagtg acctggtcag tcgccaatgg cagcatcgcc 2220gcgccagcca atgcaacgaa caagcaagtg acctggtcag tcgccaatgg cagcatcgcc 2220
tcggtgagcg ccagcggcgt cgtcagtgga ctcgctgctg gcacgacgac cgtaaccgcc 2280tcggtgagcg ccagcggcgt cgtcagtgga ctcgctgctg gcacgacgac cgtaaccgcc 2280
acgaccgcag acggcaaccg caccgcctcg gcgacgatca ccgtcgtgcc gccaccgacg 2340acgaccgcag acggcaaccg caccgcctcg gcgacgatca ccgtcgtgcc gccaccgacg 2340
acgaccgtca tcatcggcga tagcgtgcgc ggcatccgaa agaccggcga caacctgctc 2400acgaccgtca tcatcggcga tagcgtgcgc ggcatccgaa agaccggcga caacctgctc 2400
ttctacgtca acggcgcaac ctacgccgac ctgcactaca aggtgaacgg cggcggtcag 2460ttctacgtca acggcgcaac ctacgccgac ctgcactaca aggtgaacgg cggcggtcag 2460
cctaatgtcg cgatgacgca cacaggaggc ggcaactaca cctacccggt gcatggcctc 2520cctaatgtcg cgatgacgca cacaggaggc ggcaactaca cctacccggt gcatggcctc 2520
caacaaggcg ataccgtcga atacttcttc acctacaacc ccggcaacgg cgcgctagac 2580caacaaggcg ataccgtcga atacttcttc acctacaacc ccggcaacgg cgcgctagac 2580
acgccttggc agacttatgt gcatggggta acacaaggtg ttgttgagta a 2631acgccttggc agacttatgt gcatggggta acacaaggtg ttgttgagta a 2631
<210> 12<210> 12
<211> 876<211> 876
<212> ПРТ<212> PRT
<213> Paenibacillus sp.<213> Paenibacillus sp.
<400> 12<400> 12
Met Asn Arg His Val Leu Leu His Pro Tyr Leu His Arg Lys Ala Leu Met Asn Arg His Val Leu Leu His Pro Tyr Leu His Arg Lys Ala Leu
1 5 10 15 1 5 10 15
Pro Leu Leu Leu Ala Leu Thr Leu Leu Thr Gly Ile Ala Leu Phe Pro Pro Leu Leu Leu Ala Leu Thr Leu Leu Thr Gly Ile Ala Leu Phe Pro
20 25 30 20 25 30
Ala Ser Thr Ala Gln Ala Ala Thr Thr Val Thr Ser Met Thr Tyr Phe Ala Ser Thr Ala Gln Ala Ala Thr Thr Val Thr Ser Met Thr Tyr Phe
35 40 45 35 40 45
Ser Ala Asn Asp Gly Pro Val Ile Ser Lys Ser Gly Val Gly Gln Ala Ser Ala Asn Asp Gly Pro Val Ile Ser Lys Ser Gly Val Gly Gln Ala
50 55 60 50 55 60
Ser Tyr Gly Phe Val Met Pro Ile Phe Asn Gly Gly Ala Ala Thr Trp Ser Tyr Gly Phe Val Met Pro Ile Phe Asn Gly Gly Ala Ala Thr Trp
65 70 75 80 65 70 75 80
Asn Asp Val Ala Asp Asp Val Gly Val Arg Val Lys Val Gly Gly Ser Asn Asp Val Ala Asp Asp Val Gly Val Arg Val Lys Val Gly Gly Ser
85 90 95 85 90 95
Trp Val Asp Ile Asp Ser Val Gly Gly Tyr Val Tyr Asn Gln Asn Trp Trp Val Asp Ile Asp Ser Val Gly Gly Tyr Val Tyr Asn Gln Asn Trp
100 105 110 100 105 110
Gly His Trp Asn Asp Ser Gly Thr Tyr Gly Tyr Trp Phe Thr Leu Ser Gly His Trp Asn Asp Ser Gly Thr Tyr Gly Tyr Trp Phe Thr Leu Ser
115 120 125 115 120 125
Ala Thr Thr Glu Leu Gln Leu Tyr Ser Lys Ala Asn Ser Ser Val Thr Ala Thr Thr Glu Leu Gln Leu Tyr Ser Lys Ala Asn Ser Ser Val Thr
130 135 140 130 135 140
Leu Asn Tyr Thr Leu Val Phe Gln Asn Val Asn Glu Thr Thr Ile Thr Leu Asn Tyr Thr Leu Val Phe Gln Asn Val Asn Glu Thr Thr Ile Thr
145 150 155 160 145 150 155 160
Ser Met Thr Pro Thr Gln Gly Pro Gln Leu Thr Ala Gly Tyr Thr Gly Ser Met Thr Pro Thr Gln Gly Pro Gln Leu Thr Ala Gly Tyr Thr Gly
165 170 175 165 170 175
Gly Ala Gly Phe Thr Tyr Pro Val Phe Asn Asn Asp Pro Ser Ile Pro Gly Ala Gly Phe Thr Tyr Pro Val Phe Asn Asn Asp Pro Ser Ile Pro
180 185 190 180 185 190
Tyr Ala Ala Val Ala Gly Asp Leu Lys Val Tyr Val Lys Pro Val Ala Tyr Ala Ala Val Ala Gly Asp Leu Lys Val Tyr Val Lys Pro Val Ala
195 200 205 195 200 205
Ser Ser Thr Trp Ile Asp Ile Asp Asn Asn Ala Ala Ser Gly Trp Ile Ser Ser Thr Trp Ile Asp Ile Asp Asn Asn Ala Ala Ser Gly Trp Ile
210 215 220 210 215 220
Tyr Asp Ser Asn Phe Gly Gln Phe Thr Glu Gly Gly Gly Gly Tyr Trp Tyr Asp Ser Asn Phe Gly Gln Phe Thr Glu Gly Gly Gly Gly Tyr Trp
225 230 235 240 225 230 235 240
Phe Thr Val Thr Glu Ser Ile Asn Val Lys Leu Glu Ser Arg Thr Ser Phe Thr Val Thr Glu Ser Ile Asn Val Lys Leu Glu Ser Arg Thr Ser
245 250 255 245 250 255
Ser Ala Asn Val Val Tyr Thr Ile Asn Phe Pro Gln Pro Thr Arg Ser Ser Ala Asn Val Val Tyr Thr Ile Asn Phe Pro Gln Pro Thr Arg Ser
260 265 270 260 265 270
Ser Tyr Thr Leu Ser Ala Tyr Asp Gly Thr Thr Tyr Ser Ala Asp Ala Ser Tyr Thr Leu Ser Ala Tyr Asp Gly Thr Thr Tyr Ser Ala Asp Ala
275 280 285 275 280 285
Ser Gly Ala Ile Gly Ile Pro Leu Pro Arg Ile Asp Gly Thr Pro Ala Ser Gly Ala Ile Gly Ile Pro Leu Pro Arg Ile Asp Gly Thr Pro Ala
290 295 300 290 295 300
Ile Gly Ser Glu Leu Gly Asn Phe Val Tyr Gln Ile Tyr Arg Asn Gly Ile Gly Ser Glu Leu Gly Asn Phe Val Tyr Gln Ile Tyr Arg Asn Gly
305 310 315 320 305 310 315 320
Gln Trp Val Glu Met Ser Asn Ser Ala Gln Ser Ser Phe Val Tyr Ser Gln Trp Val Glu Met Ser Asn Ser Ala Gln Ser Ser Phe Val Tyr Ser
325 330 335 325 330 335
Ala Asn Gly Tyr Asn Asn Met Ser Asp Ala Asn Gln Trp Gly Tyr Trp Ala Asn Gly Tyr Asn Asn Met Ser Asp Ala Asn Gln Trp Gly Tyr Trp
340 345 350 340 345 350
Ala Asp Tyr Ile Tyr Gly Leu Trp Phe Arg Pro Ile Gln Glu Asp Met Ala Asp Tyr Ile Tyr Gly Leu Trp Phe Arg Pro Ile Gln Glu Asp Met
355 360 365 355 360 365
Gln Ile Arg Ile Gly Tyr Pro Leu Asn Gly Gln Ser Gly Gly Ser Val Gln Ile Arg Ile Gly Tyr Pro Leu Asn Gly Gln Ser Gly Gly Ser Val
370 375 380 370 375 380
Gly Ser Asn Phe Val Thr Tyr Thr Leu Ile Gly Asn Pro Asn Ala Pro Gly Ser Asn Phe Val Thr Tyr Thr Leu Ile Gly Asn Pro Asn Ala Pro
385 390 395 400 385 390 395 400
Arg Pro Asp Val Ser Asp Gln Gly Asp Val Glu Ile Gly Thr Pro Thr Arg Pro Asp Val Ser Asp Gln Gly Asp Val Glu Ile Gly Thr Pro Thr
405 410 415 405 410 415
Asp Pro Ala Ile Ala Gly Trp Asn Leu Tyr Trp Gln Asp Glu Phe Ala Asp Pro Ala Ile Ala Gly Trp Asn Leu Tyr Trp Gln Asp Glu Phe Ala
420 425 430 420 425 430
Gly Ser Ala Leu Asp Leu Asn Lys Trp Asn Tyr Glu Thr Gly Tyr Tyr Gly Ser Ala Leu Asp Leu Asn Lys Trp Asn Tyr Glu Thr Gly Tyr Tyr
435 440 445 435 440 445
Ile Gly Asn Asp Pro Asn Leu Trp Gly Trp Gly Asn Ala Glu Met Gln Ile Gly Asn Asp Pro Asn Leu Trp Gly Trp Gly Asn Ala Glu Met Gln
450 455 460 450 455 460
His Tyr Thr Thr Ser Thr Gln Asn Val Phe Val Ala Asp Gly Lys Leu His Tyr Thr Thr Ser Thr Gln Asn Val Phe Val Ala Asp Gly Lys Leu
465 470 475 480 465 470 475 480
Asn Ile Arg Ala Leu His Asp Tyr Gln Ser Phe Pro Gln Asp Pro Asn Asn Ile Arg Ala Leu His Asp Tyr Gln Ser Phe Pro Gln Asp Pro Asn
485 490 495 485 490 495
Arg Tyr Ala Thr Tyr Ser Ser Gly Lys Ile Asn Thr Lys Asp Asn Met Arg Tyr Ala Thr Tyr Ser Ser Gly Lys Ile Asn Thr Lys Asp Asn Met
500 505 510 500 505 510
Ser Leu Gln Tyr Gly Arg Val Asp Ile Arg Ala Lys Leu Pro Thr Gly Ser Leu Gln Tyr Gly Arg Val Asp Ile Arg Ala Lys Leu Pro Thr Gly
515 520 525 515 520 525
Asp Gly Val Trp Pro Ala Leu Trp Met Leu Pro Glu Asp Ser Val Tyr Asp Gly Val Trp Pro Ala Leu Trp Met Leu Pro Glu Asp Ser Val Tyr
530 535 540 530 535 540
Gly Ala Trp Ala Ala Ser Gly Glu Ile Asp Ile Met Glu Ala Lys Gly Gly Ala Trp Ala Ala Ser Gly Glu Ile Asp Ile Met Glu Ala Lys Gly
545 550 555 560 545 550 555 560
Arg Leu Pro Gly Thr Thr Ser Gly Ala Ile His Tyr Gly Gly Gln Trp Arg Leu Pro Gly Thr Thr Ser Gly Ala Ile His Tyr Gly Gly Gln Trp
565 570 575 565 570 575
Pro Val Asn Arg Tyr Leu Ala Gly Glu Cys Tyr Leu Pro Gln Gly Thr Pro Val Asn Arg Tyr Leu Ala Gly Glu Cys Tyr Leu Pro Gln Gly Thr
580 585 590 580 585 590
Thr Phe Ala Asp Asp Phe Asn Val Tyr Thr Met Ile Trp Glu Glu Asp Thr Phe Ala Asp Asp Phe Asn Val Tyr Thr Met Ile Trp Glu Glu Asp
595 600 605 595 600 605
Asn Met Lys Trp Tyr Val Asn Gly Glu Phe Phe Phe Lys Val Thr Arg Asn Met Lys Trp Tyr Val Asn Gly Glu Phe Phe Phe Lys Val Thr Arg
610 615 620 610 615 620
Glu Gln Trp Tyr Ser Val Ala Ala Pro Asn Asn Pro Asp Ala Pro Phe Glu Gln Trp Tyr Ser Val Ala Ala Pro Asn Asn Pro Asp Ala Pro Phe
625 630 635 640 625 630 635 640
Asp Gln Pro Phe Tyr Leu Ile Met Asn Leu Ala Val Gly Gly His Phe Asp Gln Pro Phe Tyr Leu Ile Met Asn Leu Ala Val Gly Gly His Phe
645 650 655 645 650 655
Asp Gly Gly Arg Thr Pro Asp Pro Ser Asp Ile Pro Ala Thr Met Gln Asp Gly Gly Arg Thr Pro Asp Pro Ser Asp Ile Pro Ala Thr Met Gln
660 665 670 660 665 670
Ile Asp Tyr Val Arg Val Tyr Lys Glu Gly Ala Gly Gly Gly Pro Gly Ile Asp Tyr Val Arg Val Tyr Lys Glu Gly Ala Gly Gly Gly Pro Gly
675 680 685 675 680 685
Asn Pro Gly Gly Asn Val Ala Val Thr Gly Val Ser Val Thr Pro Ala Asn Pro Gly Gly Asn Val Ala Val Thr Gly Val Ser Val Thr Pro Ala
690 695 700 690 695 700
Thr Ala Gln Val Gln Val Gly Gln Thr Val Ser Leu Ser Ala Asn Val Thr Ala Gln Val Gln Val Gly Gln Thr Val Ser Leu Ser Ala Asn Val
705 710 715 720 705 710 715 720
Ala Pro Ala Asn Ala Thr Asn Lys Gln Val Thr Trp Ser Val Ala Asn Ala Pro Ala Asn Ala Thr Asn Lys Gln Val Thr Trp Ser Val Ala Asn
725 730 735 725 730 735
Gly Ser Ile Ala Ser Val Ser Ala Ser Gly Val Val Ser Gly Leu Ala Gly Ser Ile Ala Ser Val Ser Ala Ser Gly Val Val Ser Gly Leu Ala
740 745 750 740 745 750
Ala Gly Thr Thr Thr Val Thr Ala Thr Thr Ala Asp Gly Asn Arg Thr Ala Gly Thr Thr Val Thr Ala Thr Thr Ala Asp Gly Asn Arg Thr
755 760 765 755 760 765
Ala Ser Ala Thr Ile Thr Val Val Pro Pro Pro Thr Thr Thr Val Ile Ala Ser Ala Thr Ile Thr Val Val Pro Pro Pro Thr Thr Thr Val Ile
770 775 780 770 775 780
Ile Gly Asp Ser Val Arg Gly Ile Arg Lys Thr Gly Asp Asn Leu Leu Ile Gly Asp Ser Val Arg Gly Ile Arg Lys Thr Gly Asp Asn Leu Leu
785 790 795 800 785 790 795 800
Phe Tyr Val Asn Gly Ala Thr Tyr Ala Asp Leu His Tyr Lys Val Asn Phe Tyr Val Asn Gly Ala Thr Tyr Ala Asp Leu His Tyr Lys Val Asn
805 810 815 805 810 815
Gly Gly Gly Gln Pro Asn Val Ala Met Thr His Thr Gly Gly Gly Asn Gly Gly Gly Gln Pro Asn Val Ala Met Thr His Thr Gly Gly Gly Asn
820 825 830 820 825 830
Tyr Thr Tyr Pro Val His Gly Leu Gln Gln Gly Asp Thr Val Glu Tyr Tyr Thr Tyr Pro Val His Gly Leu Gln Gln Gly Asp Thr Val Glu Tyr
835 840 845 835 840 845
Phe Phe Thr Tyr Asn Pro Gly Asn Gly Ala Leu Asp Thr Pro Trp Gln Phe Phe Thr Tyr Asn Pro Gly Asn Gly Ala Leu Asp Thr Pro Trp Gln
850 855 860 850 855 860
Thr Tyr Val His Gly Val Thr Gln Gly Val Val Glu Thr Tyr Val His Gly Val Thr Gln Gly Val Val Glu
865 870 875 865 870 875
<210> 13<210> 13
<211> 6783<211> 6783
<212> ДНК<212> DNA
<213> Искусственная последовательность<213> Artificial sequence
<220><220>
<223> Транспозонная кассета <223> Transposon cassette
<400> 13<400> 13
gccagatgat taattcctaa tttttgttga cactctatca ttgatagagt tattttacca 60gccagatgat taattcctaa tttttgttga cactctatca ttgatagagt tattttacca 60
ctccctatca gtgatagaga aaagtgaaat gaatagttcg acaaaaatct agaaataatt 120ctccctatca gtgatagaga aaagtgaaat gaatagttcg acaaaaatct agaaataatt 120
ttgtttaact ttaagaagga gatatacaat ttcgtcgaca cacaggaaac atattaaaaa 180ttgtttaact ttaagaagga gatatacaat ttcgtcgaca cacaggaaac atattaaaaa 180
ttaaaacctg caggagtttg aaggagatag aaccatggcg cagtcgaaac tctatccagt 240ttaaaacctg caggagtttg aaggagatag aaccatggcg cagtcgaaac tctatccagt 240
tgtgatggca ggtggctccg gtagccgctt atggccgctt tcccgcgtac tttatcccaa 300tgtgatggca ggtggctccg gtagccgctt atggccgctt tcccgcgtac tttatcccaa 300
gcagttttta tgcctgaaag gcgatctcac catgctgcaa accaccatct gccgcctgaa 360gcagttttta tgcctgaaag gcgatctcac catgctgcaa accaccatct gccgcctgaa 360
cggcgtggag tgcgaaagcc cggtggtgat ttgcaatgag cagcaccgct ttattgtcgc 420cggcgtggag tgcgaaagcc cggtggtgat ttgcaatgag cagcaccgct ttattgtcgc 420
ggaacagctg cgtcaactga acaaacttac cgagaacatt attctcgaac cggcagggcg 480ggaacagctg cgtcaactga acaaacttac cgagaacatt attctcgaac cggcagggcg 480
aaacacggca cctgccattg cgctggcggc gctggcggca aaacgtcata gcccggagag 540aaacacggca cctgccattg cgctggcggc gctggcggca aaacgtcata gcccggagag 540
cgacccgtta atgctggtat tggcggcgga tcatgtgatt gccgatgaag acgcgttccg 600cgacccgtta atgctggtat tggcggcgga tcatgtgatt gccgatgaag acgcgttccg 600
tgccgccgtg cgtaatgcca tgccatatgc cgaagcgggc aagctggtga ccttcggcat 660tgccgccgtg cgtaatgcca tgccatatgc cgaagcgggc aagctggtga ccttcggcat 660
tgtgccggat ctaccagaaa ccggttatgg ctatattcgt cgcggtgaag tgtctgcggg 720tgtgccggat ctaccagaaa ccggttatgg ctatattcgt cgcggtgaag tgtctgcggg 720
tgagcaggat atggtggcct ttgaagtggc gcagtttgtc gaaaaaccga atctggaaac 780tgagcaggat atggtggcct ttgaagtggc gcagtttgtc gaaaaaccga atctggaaac 780
cgctcaggcc tatgtggcaa gcggcgaata ttactggaac agcggtatgt tcctgttccg 840cgctcaggcc tatgtggcaa gcggcgaata ttactggaac agcggtatgt tcctgttccg 840
cgccggacgc tatctcgaag aactgaaaaa atatcgcccg gatatcctcg atgcctgtga 900cgccggacgc tatctcgaag aactgaaaaa atatcgcccg gatatcctcg atgcctgtga 900
aaaagcgatg agcgccgtcg atccggatct caattttatt cgcgtggatg aagaagcgtt 960aaaagcgatg agcgccgtcg atccggatct caattttatt cgcgtggatg aagaagcgtt 960
tctcgcctgc ccggaagagt cggtggatta cgcggtcatg gaacgtacgg cagatgctgt 1020tctcgcctgc ccggaagagt cggtggatta cgcggtcatg gaacgtacgg cagatgctgt 1020
tgtggtgccg atggatgcgg gctggagcga tgttggctcc tggtcttcat tatgggagat 1080tgtggtgccg atggatgcgg gctggagcga tgttggctcc tggtcttcat tatggggagat 1080
cagcgcccac accgccgagg gcaacgtttg ccacggcgat gtgattaatc acaaaactga 1140cagcgcccac accgccgagg gcaacgtttg ccacggcgat gtgattaatc acaaaactga 1140
aaacagctat gtgtatgctg aatctggcct ggtcaccacc gtcggggtga aagatctggt 1200aaacagctat gtgtatgctg aatctggcct ggtcaccacc gtcggggtga aagatctggt 1200
agtggtgcag accaaagatg cggtgctgat tgccgaccgt aacgcggtac aggatgtgaa 1260agtggtgcag accaaagatg cggtgctgat tgccgaccgt aacgcggtac aggatgtgaa 1260
aaaagtggtc gagcagatca aagccgatgg tcgccatgag catcgggtgc atcgcgaagt 1320aaaagtggtc gagcagatca aagccgatgg tcgccatgag catcgggtgc atcgcgaagt 1320
gtatcgtccg tggggcaaat atgactctat cgacgcgggc gaccgctacc aggtgaaacg 1380gtatcgtccg tggggcaaat atgactctat cgacgcgggc gaccgctacc aggtgaaacg 1380
catcaccgtg aaaccgggcg agggcttgtc ggtacagatg caccatcacc gcgcggaaca 1440catcaccgtg aaaccgggcg agggcttgtc ggtacagatg caccatcacc gcgcggaaca 1440
ctgggtggtt gtcgcgggaa cggcaaaagt caccattgat ggtgatatca aactgcttgg 1500ctgggtggtt gtcgcgggaa cggcaaaagt caccattgat ggtgatatca aactgcttgg 1500
tgaaaacgag tccatttata ttccgctggg ggcgacgcat tgcctggaaa acccggggaa 1560tgaaaacgag tccatttata ttccgctggg ggcgacgcat tgcctggaaa acccggggaa 1560
aattccgctc gatttaattg aagtgcgctc cggctcttat ctcgaagagg atgatgtggt 1620aattccgctc gatttaattg aagtgcgctc cggctcttat ctcgaagagg atgatgtggt 1620
gcgtttcgcg gatcgctacg gacgggtgta aacgtcgcat caggcaatga atgcgaaacc 1680gcgtttcgcg gatcgctacg gacgggtgta aacgtcgcat caggcaatga atgcgaaacc 1680
gcggtgtaaa taacgacaaa aataaaattg gccgcttcgg tcagggccaa ctattgcctg 1740gcggtgtaaa taacgacaaa aataaaattg gccgcttcgg tcagggccaa ctattgcctg 1740
aaaaagggta acgatatgaa aaaattaacc tgctttaaag cctatgatat tcgcgggaaa 1800aaaaagggta acgatatgaa aaaattaacc tgctttaaag cctatgatat tcgcgggaaa 1800
ttaggcgaag aactgaatga agatatcgcc tggcgcattg gtcgcgccta tggcgaattt 1860ttaggcgaag aactgaatga agatatcgcc tggcgcattg gtcgcgccta tggcgaattt 1860
ctcaaaccga aaaccattgt gttaggcggt gatgtccgcc tcaccagcga aaccttaaaa 1920ctcaaaccga aaaccattgt gttaggcggt gatgtccgcc tcaccagcga aaccttaaaa 1920
ctggcgctgg cgaaaggttt acaggatgcg ggcgttgacg tgctggatat tggtatgtcc 1980ctggcgctgg cgaaaggttt acaggatgcg ggcgttgacg tgctggatat tggtatgtcc 1980
ggcaccgaag agatctattt cgccacgttc catctcggcg tggatggcgg cattgaagtt 2040ggcaccgaag agatctattt cgccacgttc catctcggcg tggatggcgg cattgaagtt 2040
accgccagcc ataatccgat ggattataac ggcatgaagc tggttcgcga gggggctcgc 2100accgccagcc ataatccgat ggattataac ggcatgaagc tggttcgcga gggggctcgc 2100
ccgatcagcg gagataccgg actgcgcgac gtccagcgtc tggctgaagc caacgacttt 2160ccgatcagcg gagataccgg actgcgcgac gtccagcgtc tggctgaagc caacgacttt 2160
cctcccgtcg atgaaaccaa acgcggtcgc tatcagcaaa tcaacctgcg tgacgcttac 2220cctcccgtcg atgaaaccaa acgcggtcgc tatcagcaaa tcaacctgcg tgacgcttac 2220
gttgatcacc tgttcggtta tatcaatgtc aaaaacctca cgccgctcaa gctggtgatc 2280gttgatcacc tgttcggtta tatcaatgtc aaaaacctca cgccgctcaa gctggtgatc 2280
aactccggga acggcgcagc gggtccggtg gtggacgcca ttgaagcccg ctttaaagcc 2340aactccggga acggcgcagc gggtccggtg gtggacgcca ttgaagcccg ctttaaagcc 2340
ctcggcgcgc ccgtggaatt aatcaaagtg cacaacacgc cggacggcaa tttccccaac 2400ctcggcgcgc ccgtggaatt aatcaaagtg cacaacacgc cggacggcaa tttccccaac 2400
ggtattccta acccactact gccggaatgc cgcgacgaca cccgcaatgc ggtcatcaaa 2460ggtattccta acccactact gccggaatgc cgcgacgaca cccgcaatgc ggtcatcaaa 2460
cacggcgcgg atatgggcat tgcttttgat ggcgattttg accgctgttt cctgtttgac 2520cacggcgcgg atatgggcat tgcttttgat ggcgattttg accgctgttt cctgtttgac 2520
gaaaaagggc agtttattga gggctactac attgtcggcc tgttggcaga agcattcctc 2580gaaaaagggc agtttattga gggctactac attgtcggcc tgttggcaga agcattcctc 2580
gaaaaaaatc ccggcgcgaa gatcatccac gatccacgtc tctcctggaa caccgttgat 2640gaaaaaaatc ccggcgcgaa gatcatccac gatccacgtc tctcctggaa caccgttgat 2640
gtggtgactg ccgcaggtgg cacgccggta atgtcgaaaa ccggacacgc ctttattaaa 2700gtggtgactg ccgcaggtgg cacgccggta atgtcgaaaa ccggacacgc ctttattaaa 2700
gaacgtatgc gcaaggaaga cgccatctat ggtggcgaaa tgagcgccca ccattacttc 2760gaacgtatgc gcaaggaaga cgccatctat ggtggcgaaa tgagcgccca ccattacttc 2760
cgtgatttcg cttactgcga cagcggcatg atcccgtggc tgctggtcgc cgaactggtg 2820cgtgatttcg cttactgcga cagcggcatg atcccgtggc tgctggtcgc cgaactggtg 2820
tgcctgaaag ataaaacgct gggcgaactg gtacgcgacc ggatggcggc gtttccggca 2880tgcctgaaag ataaaacgct gggcgaactg gtacgcgacc ggatggcggc gtttccggca 2880
agcggtgaga tcaacagcaa actggcgcaa cccgttgagg cgattaaccg cgtggaacag 2940agcggtgaga tcaacagcaa actggcgcaa cccgttgagg cgattaaccg cgtggaacag 2940
cattttagcc gtgaggcgct ggcggtggat cgcaccgatg gcatcagcat gacctttgcc 3000cattttagcc gtgaggcgct ggcggtggat cgcaccgatg gcatcagcat gacctttgcc 3000
gactggcgct ttaacctgcg cacctccaat accgaaccgg tggtgcgcct gaatgtggaa 3060gactggcgct ttaacctgcg cacctccaat accgaaccgg tggtgcgcct gaatgtggaa 3060
tcgcgcggtg atgtgccgct gatggaagcg cgaacgcgaa ctctgctgac gttgctgaac 3120tcgcgcggtg atgtgccgct gatggaagcg cgaacgcgaa ctctgctgac gttgctgaac 3120
gagtaaaaac gcggccgcga tatcgttgta aaacgacggc cagtgcaaga atcataaaaa 3180gagtaaaaac gcggccgcga tatcgttgta aaacgacggc cagtgcaaga atcataaaaa 3180
atttatttgc tttcaggaaa atttttctgt ataatagatt cataaatttg agagaggagt 3240atttatttgc tttcaggaaa atttttctgt ataatagatt cataaatttg agagaggagt 3240
ttttgtgagc ggataacaat tccccatctt agtatattag ttaagtataa atacaccgcg 3300ttttgtgagc ggataacaat tccccatctt agtatattag ttaagtataa atacaccgcg 3300
gaggacgaag gagatagaac catgtcaaaa gtcgctctca tcaccggtgt aaccggacaa 3360gaggacgaag gagatagaac catgtcaaaa gtcgctctca tcaccggtgt aaccggacaa 3360
gacggttctt acctggcaga gtttctgctg gaaaaaggtt acgaggtgca tggtattaag 3420gacggttctt acctggcaga gtttctgctg gaaaaaggtt acgaggtgca tggtattaag 3420
cgtcgcgcat cgtcattcaa caccgagcgc gtggatcaca tttatcagga tccgcacacc 3480cgtcgcgcat cgtcattcaa caccgagcgc gtggatcaca tttatcagga tccgcacacc 3480
tgcaacccga aattccatct gcattatggc gacctgagtg atacctctaa cctgacgcgc 3540tgcaacccga aattccatct gcattatggc gacctgagtg atacctctaa cctgacgcgc 3540
attttgcgtg aagtacagcc ggatgaagtg tacaacctgg gcgcaatgag ccacgttgcg 3600attttgcgtg aagtacagcc ggatgaagtg tacaacctgg gcgcaatgag ccacgttgcg 3600
gtctcttttg agtcaccaga atataccgct gacgtcgacg cgatgggtac gctgcgcctg 3660gtctcttttg agtcaccaga atataccgct gacgtcgacg cgatgggtac gctgcgcctg 3660
ctggaggcga tccgcttcct cggtctggaa aagaaaactc gtttctatca ggcttccacc 3720ctggaggcga tccgcttcct cggtctggaa aagaaaactc gtttctatca ggcttccacc 3720
tctgaactgt atggtctggt gcaggaaatt ccgcagaaag agaccacgcc gttctacccg 3780tctgaactgt atggtctggt gcaggaaatt ccgcagaaag agaccacgcc gttctacccg 3780
cgatctccgt atgcggtcgc caaactgtac gcctactgga tcaccgttaa ctaccgtgaa 3840cgatctccgt atgcggtcgc caaactgtac gcctactgga tcaccgttaa ctaccgtgaa 3840
tcctacggca tgtacgcctg taacggaatt ctcttcaacc atgaatcccc gcgccgcggc 3900tcctacggca tgtacgcctg taacggaatt ctcttcaacc atgaatcccc gcgccgcggc 3900
gaaaccttcg ttacccgcaa aatcacccgc gcaatcgcca acatcgccca ggggctggag 3960gaaaccttcg ttacccgcaa aatcacccgc gcaatcgcca acatcgccca ggggctggag 3960
tcgtgcctgt acctcggcaa tatggattcc ctgcgtgact ggggccacgc caaagactac 4020tcgtgcctgt acctcggcaa tatggattcc ctgcgtgact ggggccacgc caaagactac 4020
gtaaaaatgc agtggatgat gctgcagcag gaacagccgg aagatttcgt tatcgcgacc 4080gtaaaaatgc agtggatgat gctgcagcag gaacagccgg aagatttcgt tatcgcgacc 4080
ggcgttcagt actccgtgcg tcagttcgtg gaaatggcgg cagcacagct gggcatcaaa 4140ggcgttcagt actccgtgcg tcagttcgtg gaaatggcgg cagcacagct gggcatcaaa 4140
ctgcgctttg aaggcacggg cgttgaagag aagggcattg tggtttccgt caccgggcat 4200ctgcgctttg aaggcacggg cgttgaagag aagggcattg tggtttccgt caccgggcat 4200
gacgcgccgg gcgttaaacc gggtgatgtg attatcgctg ttgacccgcg ttacttccgt 4260gacgcgccgg gcgttaaacc gggtgatgtg attatcgctg ttgacccgcg ttacttccgt 4260
ccggctgaag ttgaaacgct gctcggcgac ccgaccaaag cgcacgaaaa actgggctgg 4320ccggctgaag ttgaaacgct gctcggcgac ccgaccaaag cgcacgaaaa actgggctgg 4320
aaaccggaaa tcaccctcag agagatggtg tctgaaatgg tggctaatga cctcgaagcg 4380aaaccggaaa tcaccctcag agagatggtg tctgaaatgg tggctaatga cctcgaagcg 4380
gcgaaaaaac actctctgct gaaatctcac ggctacgacg tggcgatcgc gctggagtca 4440gcgaaaaaac actctctgct gaaatctcac ggctacgacg tggcgatcgc gctggagtca 4440
taagcatgag taaacaacga gtttttattg ctggtcatcg cgggatggtc ggttccgcca 4500taagcatgag taaacaacga gtttttattg ctggtcatcg cgggatggtc ggttccgcca 4500
tcaggcggca gctcgaacag cgcggtgatg tggaactggt attacgcacc cgcgacgagc 4560tcaggcggca gctcgaacag cgcggtgatg tggaactggt attacgcacc cgcgacgagc 4560
tgaacctgct ggacagccgc gccgtgcatg atttctttgc cagcgaacgt attgaccagg 4620tgaacctgct ggacagccgc gccgtgcatg atttctttgc cagcgaacgt attgaccagg 4620
tctatctggc ggcggcgaaa gtgggcggca ttgttgccaa caacacctat ccggcggatt 4680tctatctggc ggcggcgaaa gtgggcggca ttgttgccaa caacacctat ccggcggatt 4680
tcatctacca gaacatgatg attgagagca acatcattca cgccgcgcat cagaacgacg 4740tcatctacca gaacatgatg attgagagca acatcattca cgccgcgcat cagaacgacg 4740
tgaacaaact gctgtttctc ggatcgtcct gcatctaccc gaaactggca aaacagccga 4800tgaacaaact gctgtttctc ggatcgtcct gcatctaccc gaaactggca aaacagccga 4800
tggcagaaag cgagttgttg cagggcacgc tggagccgac taacgagcct tatgctattg 4860tggcagaaag cgagttgttg cagggcacgc tggagccgac taacgagcct tatgctattg 4860
ccaaaatcgc cgggatcaaa ctgtgcgaat catacaaccg ccagtacgga cgcgattacc 4920ccaaaatcgc cgggatcaaa ctgtgcgaat catacaaccg ccagtacgga cgcgattacc 4920
gctcagtcat gccgaccaac ctgtacgggc cacacgacaa cttccacccg agtaattcgc 4980gctcagtcat gccgaccaac ctgtacgggc cacacgacaa cttccacccg agtaattcgc 4980
atgtgatccc agcattgctg cgtcgcttcc acgaggcgac ggcacagaat gcgccggacg 5040atgtgatccc agcattgctg cgtcgcttcc acgaggcgac ggcacagaat gcgccggacg 5040
tggtggtatg gggcagcggt acaccgatgc gcgaatttct gcacgtcgat gatatggcgg 5100tggtggtatg gggcagcggt acaccgatgc gcgaatttct gcacgtcgat gatatggcgg 5100
cggcgagcat tcatgtcatg gagctggcgc atgaagtctg gctggagaac acccagccga 5160cggcgagcat tcatgtcatg gagctggcgc atgaagtctg gctggagaac acccagccga 5160
tgttgtcgca cattaacgtc ggcacgggcg ttgactgcac tatccgcgag ctggcgcaaa 5220tgttgtcgca cattaacgtc ggcacggggcg ttgactgcac tatccgcgag ctggcgcaaa 5220
ccatcgccaa agtggtgggt tacaaaggcc gggtggtttt tgatgccagc aaaccggatg 5280ccatcgccaa agtggtgggt tacaaaggcc gggtggtttt tgatgccagc aaaccggatg 5280
gcacgccgcg caaactgctg gatgtgacgc gcctgcatca gcttggctgg tatcacgaaa 5340gcacgccgcg caaactgctg gatgtgacgc gcctgcatca gcttggctgg tatcacgaaa 5340
tctcactgga agcggggctt gccagcactt accagtggtt ccttgagaat caagaccgct 5400tctcactgga agcggggctt gccagcactt accagtggtt ccttgagaat caagaccgct 5400
ttcggggggg gagctaacgc gccatttaaa tcaacctcag cggtcatagc tgtttcctgt 5460ttcgggggggg gagctaacgc gccatttaaa tcaacctcag cggtcatagc tgtttcctgt 5460
gactgagcaa taactagcat aaccccttgg ggcctctaaa cgggtcttga ggggtttttt 5520gactgagcaa taactagcat aaccccttgg ggcctctaaa cgggtcttga ggggtttttt 5520
gctgaaacca atttgcctgg cggcagtagc gcggtggtcc cacctgaccc catgccgaac 5580gctgaaacca atttgcctgg cggcagtagc gcggtggtcc cacctgaccc catgccgaac 5580
tcagaagtga aacgccgtag cgccgatggt agtgtggggt ctccccatgc gagagtaggg 5640tcagaagtga aacgccgtag cgccgatggt agtgtggggt ctccccatgc gagagtaggg 5640
aactgccagg catcaaataa aacgaaaggc tcagtcgaaa gactgggcct ttcgggatcc 5700aactgccagg catcaaataa aacgaaaggc tcagtcgaaa gactgggcct ttcgggatcc 5700
aggccggcct gttaacgaat taatcttccg cggcggtatc gataagcttg atatcgaatt 5760aggccggcct gttaacgaat taatcttccg cggcggtatc gataagcttg atatcgaatt 5760
ccgaagttcc tattctctag aaagtatagg aacttcaggt ctgaagagga gtttacgtcc 5820ccgaagttcc tattctctag aaagtatagg aacttcaggt ctgaagagga gtttacgtcc 5820
agccaagcta gcttggctgc aggtcgtcga aattctaccg ggtaggggag gcgcttttcc 5880agccaagcta gcttggctgc aggtcgtcga aattctaccg ggtaggggag gcgcttttcc 5880
caaggcagtc tggagcatgc gctttagcag ccccgctggg cacttggcgc tacacaagtg 5940caaggcagtc tggagcatgc gctttagcag ccccgctggg cacttggcgc tacacaagtg 5940
gcctctggcc tcgcacacat tccacatcca ccggtaggcg ccaaccggct ccgttctttg 6000gcctctggcc tcgcacacat tccacatcca ccggtaggcg ccaaccggct ccgttctttg 6000
gtggcccctt cgcgccacct tctactcctc ccctagtcag gaagttcccc cccgccccgc 6060gtggcccctt cgcgccacct tctactcctc ccctagtcag gaagttcccc cccgccccgc 6060
agctcgcgtc gtgcaggacg tgacaaatgg aagtagcacg tctcactagt ctcgtgcaga 6120agctcgcgtc gtgcaggacg tgacaaatgg aagtagcacg tctcactagt ctcgtgcaga 6120
tggacagcac cgctgagcaa tggaagcggg taggcctttg gggcagcggc caatagcagc 6180tggacagcac cgctgagcaa tggaagcggg taggcctttg gggcagcggc caatagcagc 6180
tttgctcctt cgctttctgg gctcagaggc tgggaagggg tgggtccggg ggcgggctca 6240tttgctcctt cgctttctgg gctcagaggc tgggaagggg tgggtccggg ggcgggctca 6240
ggggcgggct caggggcggg gcgggcgccc gaaggtcctc cggaggcccg gcattctgca 6300ggggcgggct caggggcggg gcgggcgccc gaaggtcctc cggaggcccg gcattctgca 6300
cgcttcaaaa gcgcacgtct gccgcgctgt tctcctcttc ctcatctccg ggcctttcga 6360cgcttcaaaa gcgcacgtct gccgcgctgt tctcctcttc ctcatctccg ggcctttcga 6360
cctgcagcct gttgacaatt aatcatcggc atagtatatc ggcatagtat aatacgacaa 6420cctgcagcct gttgacaatt aatcatcggc atagtatatc ggcatagtat aatacgacaa 6420
ggtgaggaac taaaccatgg gtcaaagtag cgatgaagcc aacgctcccg ttgcagggca 6480ggtgaggaac taaaccatgg gtcaaagtag cgatgaagcc aacgctcccg ttgcagggca 6480
gtttgcgctt cccctgagtg ccacctttgg cttaggggat cgcgtacgca agaaatctgg 6540gtttgcgctt cccctgagtg ccacctttgg cttaggggat cgcgtacgca agaaatctgg 6540
tgccgcttgg cagggtcaag tcgtcggttg gtattgcaca aaactcactc ctgaaggcta 6600tgccgcttgg cagggtcaag tcgtcggttg gtattgcaca aaactcactc ctgaaggcta 6600
tgcggtcgag tccgaatccc acccaggctc agtgcaaatt tatcctgtgg ctgcacttga 6660tgcggtcgag tccgaatccc acccaggctc agtgcaaatt tatcctgtgg ctgcacttga 6660
acgtgtggcc taatgagggg atcaattctc tagagctcgc tgatcagaag ttcctattct 6720acgtgtggcc taatgagggg atcaattctc tagagctcgc tgatcagaag ttcctattct 6720
ctagaaagta taggaacttc gatggcgcct catccctgaa gccaataggg ataacagggt 6780ctagaaagta taggaacttc gatggcgcct catccctgaa gccaataggg ataacagggt 6780
aat 6783aat 6783
<210> 14<210> 14
<211> 2851<211> 2851
<212> ДНК<212> DNA
<213> Искусственная последовательность<213> Artificial sequence
<220><220>
<223> Кассета интеграции<223> Integration cassette
<400> 14<400> 14
tggccagatg attaattcct aatttttgtt gacactctat cattgataga gttattttac 60tggccagatg attaattcct aatttttgtt gacactctat cattgataga gttattttac 60
cactccctat cagtgataga gaaaagtgaa atgaatagtt cgacaaaaat ctagaaataa 120cactccctat cagtgataga gaaaagtgaa atgaatagtt cgacaaaaat ctagaaataa 120
ttttgtttaa ctttaagaag gagatataca aatgtactat ttaaaaaaca caaacttttg 180ttttgtttaa ctttaagaag gagatataca aatgtactat ttaaaaaaca caaacttttg 180
gatgttcggt ttattctttt tcttttactt ttttatcatg ggagcctact tcccgttttt 240gatgttcggt ttattctttt tcttttactt ttttatcatg ggagcctact tcccgttttt 240
cccgatttgg ctacatgaca tcaaccatat cagcaaaagt gatacgggta ttatttttgc 300cccgatttgg ctacatgaca tcaaccatat cagcaaaagt gatacgggta ttatttttgc 300
cgctatttct ctgttctcgc tattattcca accgctgttt ggtctgcttt ctgacaaact 360cgctatttct ctgttctcgc tattattcca accgctgttt ggtctgcttt ctgacaaact 360
cgggctgcgc aaatacctgc tgtggattat taccggcatg ttagtgatgt ttgcgccgtt 420cgggctgcgc aaatacctgc tgtggattat taccggcatg ttagtgatgt ttgcgccgtt 420
ctttattttt atcttcgggc cactgttaca atacaacatt ttagtaggat cgattgttgg 480ctttattttt atcttcgggc cactgttaca atacaacatt ttagtaggat cgattgttgg 480
tggtatttat ctaggctttt gttttaacgc cggtgcgcca gcagtagagg catttattga 540tggtatttat ctaggctttt gttttaacgc cggtgcgcca gcagtagagg catttattga 540
gaaagtcagc cgtcgcagta atttcgaatt tggtcgcgcg cggatgtttg gctgtgttgg 600gaaagtcagc cgtcgcagta atttcgaatt tggtcgcgcg cggatgtttg gctgtgttgg 600
ctgggcgctg tgtgcctcga ttgtcggcat catgttcacc atcaataatc agtttgtttt 660ctgggcgctg tgtgcctcga ttgtcggcat catgttcacc atcaataatc agtttgtttt 660
ctggctgggc tctggctgtg cactcatcct cgccgtttta ctctttttcg ccaaaacgga 720ctggctgggc tctggctgtg cactcatcct cgccgtttta ctctttttcg ccaaaacgga 720
tgcgccctct tctgccacgg ttgccaatgc ggtaggtgcc aaccattcgg catttagcct 780tgcgccctct tctgccacgg ttgccaatgc ggtaggtgcc aaccattcgg catttagcct 780
taagctggca ctggaactgt tcagacagcc aaaactgtgg tttttgtcac tgtatgttat 840taagctggca ctggaactgt tcagacagcc aaaactgtgg tttttgtcac tgtatgttat 840
tggcgtttcc tgcacctacg atgtttttga ccaacagttt gctaatttct ttacttcgtt 900tggcgtttcc tgcacctacg atgtttttga ccaacagttt gctaatttct ttacttcgtt 900
ctttgctacc ggtgaacagg gtacgcgggt atttggctac gtaacgacaa tgggcgaatt 960ctttgctacc ggtgaacagg gtacgcgggt atttggctac gtaacgacaa tgggcgaatt 960
acttaacgcc tcgattatgt tctttgcgcc actgatcatt aatcgcatcg gtgggaaaaa 1020acttaacgcc tcgattatgt tctttgcgcc actgatcatt aatcgcatcg gtgggaaaaa 1020
cgccctgctg ctggctggca ctattatgtc tgtacgtatt attggctcat cgttcgccac 1080cgccctgctg ctggctggca ctattatgtc tgtacgtatt attggctcat cgttcgccac 1080
ctcagcgctg gaagtggtta ttctgaaaac gctgcatatg tttgaagtac cgttcctgct 1140ctcagcgctg gaagtggtta ttctgaaaac gctgcatatg tttgaagtac cgttcctgct 1140
ggtgggctgc tttaaatata ttaccagcca gtttgaagtg cgtttttcag cgacgattta 1200ggtgggctgc tttaaatata ttaccagcca gtttgaagtg cgtttttcag cgacgattta 1200
tctggtctgt ttctgcttct ttaagcaact ggcgatgatt tttatgtctg tactggcggg 1260tctggtctgt ttctgcttct ttaagcaact ggcgatgatt tttatgtctg tactggcggg 1260
caatatgtat gaaagcatcg gtttccaggg cgcttatctg gtgctgggtc tggtggcgct 1320caatatgtat gaaagcatcg gtttccaggg cgcttatctg gtgctgggtc tggtggcgct 1320
gggcttcacc ttaatttccg tgttcacgct tagcggcccc ggcccgcttt ccctgctgcg 1380gggcttcacc ttaatttccg tgttcacgct tagcggcccc ggcccgcttt ccctgctgcg 1380
tcgtcaggtg aatgaagtcg ctgggagcta agcggccgcg tcgacacgca aaaaggccat 1440tcgtcaggtg aatgaagtcg ctgggagcta agcggccgcg tcgacacgca aaaaggccat 1440
ccgtcaggat ggccttctgc ttaatttgat gcctggcagt ttatggcggg cgtcctgccc 1500ccgtcaggat ggccttctgc ttaatttgat gcctggcagt ttatggcggg cgtcctgccc 1500
gccaccctcc gggccgttgc ttcgcaacgt tcaaatccgc tcccggcgga tttgtcctac 1560gccaccctcc gggccgttgc ttcgcaacgt tcaaatccgc tcccggcgga tttgtcctac 1560
tcaggagagc gttcaccgac aaacaacaga taaaacgaaa ggcccagtct ttcgactgag 1620tcaggagagc gttcaccgac aaacaacaga taaaacgaaa ggcccagtct ttcgactgag 1620
cctttcgttt tatttgatgc ctggcagttc cctactctcg catggggaga ccccacacta 1680cctttcgttt tatttgatgc ctggcagttc cctactctcg catggggaga cccccacacta 1680
ccatcatgta tgaatatcct ccttagttcc tattccgaag ttcctattct ctagaaagta 1740ccatcatgta tgaatatcct ccttagttcc tattccgaag ttcctattct ctagaaagta 1740
taggaacttc ggcgcgtcct acctgtgaca cgcgtgccgc agtctcacgc ccggagcgta 1800taggaacttc ggcgcgtcct acctgtgaca cgcgtgccgc agtctcacgc ccggagcgta 1800
gcgaccgagt gagctagcta tttgtttatt tttctaaata cattcaaata tgtatccgct 1860gcgaccgagt gagctagcta tttgtttatt tttctaaata cattcaaata tgtatccgct 1860
catgagacaa taaccctgat aaatgcttca ataatattga aaaaggaaga gtatgaggga 1920catgagacaa taaccctgat aaatgcttca ataatattga aaaaggaaga gtatgaggga 1920
agcggtgatc gccgaagtat cgactcaact atcagaggta gttggcgtca tcgagcgcca 1980agcggtgatc gccgaagtat cgactcaact atcagaggta gttggcgtca tcgagcgcca 1980
tctcgaaccg acgttgctgg ccgtacattt gtacggctcc gcagtggatg gcggcctgaa 2040tctcgaaccg acgttgctgg ccgtacattt gtacggctcc gcagtggatg gcggcctgaa 2040
gccacacagt gatattgatt tgctggttac ggtgaccgta aggcttgatg aaacaacgcg 2100gccacacagt gatattgatt tgctggttac ggtgaccgta aggcttgatg aaacaacgcg 2100
gcgagctttg atcaacgacc ttttggaaac ttcggcttcc cctggagaga gcgagattct 2160gcgagctttg atcaacgacc ttttggaaac ttcggcttcc cctggagaga gcgagattct 2160
ccgcgctgta gaagtcacca ttgttgtgca cgacgacatc attccgtggc gttatccagc 2220ccgcgctgta gaagtcacca ttgttgtgca cgacgacatc attccgtggc gttatccagc 2220
taagcgcgaa ctgcaatttg gagaatggca gcgcaatgac attcttgcag gtatcttcga 2280taagcgcgaa ctgcaatttg gagaatggca gcgcaatgac attcttgcag gtatcttcga 2280
gccagccacg atcgacattg atctggctat cttgctgaca aaagcaagag aacatagcgt 2340gccagccacg atcgacattg atctggctat cttgctgaca aaagcaagag aacatagcgt 2340
tgccttggta ggtccagcgg cggaggaact ctttgatccg gttcctgaac aggatctatt 2400tgccttggta ggtccagcgg cggaggaact ctttgatccg gttcctgaac aggatctatt 2400
tgaggcgcta aatgaaacct taacgctatg gaactcgccg cccgactggg ctggcgatga 2460tgaggcgcta aatgaaacct taacgctatg gaactcgccg cccgactggg ctggcgatga 2460
gcgaaatgta gtgcttacgt tgtcccgcat ttggtacagc gcagtaaccg gcaaaatcgc 2520gcgaaatgta gtgcttacgt tgtcccgcat ttggtacagc gcagtaaccg gcaaaatcgc 2520
gccgaaggat gtcgctgccg actgggcaat ggagcgcctg ccggcccagt atcagcccgt 2580gccgaaggat gtcgctgccg actgggcaat ggagcgcctg ccggcccagt atcagcccgt 2580
catacttgaa gctagacagg cttatcttgg acaagaagaa gatcgcttgg cctcgcgcgc 2640catacttgaa gctagacagg cttatcttgg acaagaagaa gatcgcttgg cctcgcgcgc 2640
agatcagttg gaagaatttg tccactacgt gaaaggcgag atcaccaagg tagtcggcaa 2700agatcagttg gaagaatttg tccactacgt gaaaggcgag atcaccaagg tagtcggcaa 2700
ataatgtcta acaattcgtt caagccgagg ggccgcaaga tccggccacg atgacccggt 2760ataatgtcta acaattcgtt caagccgagg ggccgcaaga tccggccacg atgacccggt 2760
cgtcgggtac cggcagggcg gggcgtaagg cgcgccattt aaatgaagtt cctattccga 2820cgtcgggtac cggcagggcg gggcgtaagg cgcgccattt aaatgaagtt cctattccga 2820
agttcctatt ctctagaaag tataggaact t 2851agttcctatt ctctagaaag tataggaact t 2851
<210> 15<210> 15
<211> 2858<211> 2858
<212> ДНК<212> DNA
<213> Искусственная последовательность<213> Artificial sequence
<220><220>
<223> Кассета интеграции<223> Integration cassette
<400> 15<400> 15
ggccagatga ttaattccta atttttgttg acactctatc attgatagag ttattttacc 60ggccagatga ttaattccta atttttgttg acactctatc attgatagag ttattttacc 60
actccctatc agtgatagag aaaagtgaaa tgaatagttc gacaaaaatc tagaaataat 120actccctatc agtgatagag aaaagtgaaa tgaatagttc gacaaaaatc tagaaataat 120
tttgtttaac tttaagaagg agatatacaa atgggcagca ttattcgtct gcagggtggt 180tttgtttaac tttaagaagg agatatacaa atgggcagca ttattcgtct gcagggtggt 180
ctgggtaatc agctgtttca gtttagcttt ggttatgccc tgagcaaaat taatggtaca 240ctgggtaatc agctgtttca gtttagcttt ggttatgccc tgagcaaaat taatggtaca 240
ccgctgtatt tcgacattag ccattatgcc gaaaacgatg atcatggtgg ttatcgtctg 300ccgctgtatt tcgacattag ccattatgcc gaaaacgatg atcatggtgg ttatcgtctg 300
aataatctgc agattccgga agaatatctg cagtattata ccccgaaaat taataatatt 360aataatctgc agattccgga agaatatctg cagtattata ccccgaaaat taataatatt 360
tataaactgc tggtgcgtgg cagccgtctg tatccggata tttttctgtt tctgggcttt 420tataaactgc tggtgcgtgg cagccgtctg tatccggata tttttctgtt tctgggcttt 420
tgcaacgaat ttcatgccta tggctacgat tttgaatata ttgcccagaa atggaaaagc 480tgcaacgaat ttcatgccta tggctacgat tttgaatata ttgcccagaa atggaaaagc 480
aaaaaataca ttggctactg gcagagcgaa cacttttttc ataaacatat tctggacctg 540aaaaaataca ttggctactg gcagagcgaa cacttttttc ataaacatat tctggacctg 540
aaagaatttt ttattccgaa aaatgtgagc gaacaggcaa atctgctggc agcaaaaatt 600aaagaatttt ttattccgaa aaatgtgagc gaacaggcaa atctgctggc agcaaaaatt 600
ctggaaagcc agagcagcct gagcattcat attcgtcgtg gcgattatat taaaaacaaa 660ctggaaagcc agagcagcct gagcattcat attcgtcgtg gcgattatat taaaaacaaa 660
accgcaaccc tgacacatgg tgtttgtagc ctggaatatt ataaaaaagc cctgaacaaa 720accgcaaccc tgacacatgg tgtttgtagc ctggaatatt ataaaaaagc cctgaacaaa 720
atccgcgatc tggcaatgat tcgtgatgtg tttatcttta gcgacgatat cttctggtgc 780atccgcgatc tggcaatgat tcgtgatgtg tttatcttta gcgacgatat cttctggtgc 780
aaagaaaata ttgaaaccct gctgagcaaa aaatataata tttattatag cgaagatctg 840aaagaaaata ttgaaaccct gctgagcaaa aaatataata tttattatag cgaagatctg 840
agccaagaag aggatctgtg gctgatgagc ctggcaaatc atcatattat tgccaatagc 900agccaagaag aggatctgtg gctgatgagc ctggcaaatc atcatattat tgccaatagc 900
agctttagtt ggtggggtgc atatctgggt agcagcgcaa gccagattgt tatttatccg 960agctttagtt ggtggggtgc atatctgggt agcagcgcaa gccagattgt tatttatccg 960
accccgtggt atgatattac cccgaaaaac acctatatcc cgattgtgaa ccattggatc 1020accccgtggt atgatattac cccgaaaaac acctatatcc cgattgtgaa ccattggatc 1020
aacgttgata aacatagcag ctgctaagcg gccgcgtcga cacgcaaaaa ggccatccgt 1080aacgttgata aacatagcag ctgctaagcg gccgcgtcga cacgcaaaaa ggccatccgt 1080
caggatggcc ttctgcttaa tttgatgcct ggcagtttat ggcgggcgtc ctgcccgcca 1140caggatggcc ttctgcttaa tttgatgcct ggcagtttat ggcgggcgtc ctgcccgcca 1140
ccctccgggc cgttgcttcg caacgttcaa atccgctccc ggcggatttg tcctactcag 1200ccctccgggc cgttgcttcg caacgttcaa atccgctccc ggcggatttg tcctactcag 1200
gagagcgttc accgacaaac aacagataaa acgaaaggcc cagtctttcg actgagcctt 1260gagagcgttc accgacaaac aacagataaa acgaaaggcc cagtctttcg actgagcctt 1260
tcgttttatt tgatgcctgg cagttcccta ctctcgcatg gggagacccc acactaccat 1320tcgttttatt tgatgcctgg cagttcccta ctctcgcatg gggagacccc acactaccat 1320
catgtatgaa tatcctcctt agttcctatt ccgaagttcc tattctctag aaagtatagg 1380catgtatgaa tatcctcctt agttcctatt ccgaagttcc tattctctag aaagtatagg 1380
aacttcggcg cgtcctacct gtgacacgcg tcaagatccc ctcacgctgc cgcaagcact 1440aacttcggcg cgtcctacct gtgacacgcg tcaagatccc ctcacgctgc cgcaagcact 1440
cagggcgcaa gggctgctaa aggaagcgga acacgtagaa agccagtccg cagaaacggt 1500cagggcgcaa gggctgctaa aggaagcgga acacgtagaa agccagtccg cagaaacggt 1500
gctgaccccg gatgaatgtc agctactggg ctatctggac aagggaaaac gcaagcgcaa 1560gctgaccccg gatgaatgtc agctactggg ctatctggac aagggaaaac gcaagcgcaa 1560
agagaaagca ggtagcttgc agtgggctta catggcgata gctagactgg gcggttttat 1620agagaaagca ggtagcttgc agtgggctta catggcgata gctagactgg gcggttttat 1620
ggacagcaag cgaaccggaa ttgccagctg gggcgccctc tggtaaggtt gggaagccct 1680ggacagcaag cgaaccggaa ttgccagctg gggcgccctc tggtaaggtt gggaagccct 1680
gcaaagtaaa ctggatggct ttcttgccgc caaggatctg atggcgcagg ggatcaagat 1740gcaaagtaaa ctggatggct ttcttgccgc caaggatctg atggcgcagg ggatcaagat 1740
ctgatcaaga gacaggatga ggatcgtttc gcatgattga acaagatgga ttgcacgcag 1800ctgatcaaga gacaggatga ggatcgtttc gcatgattga acaagatgga ttgcacgcag 1800
gttctccggc cgcttgggtg gagaggctat tcggctatga ctgggcacaa cagacaatcg 1860gttctccggc cgcttgggtg gagaggctat tcggctatga ctgggcacaa cagacaatcg 1860
gctgctctga tgccgccgtg ttccggctgt cagcgcaggg gcgcccggtt ctttttgtca 1920gctgctctga tgccgccgtg ttccggctgt cagcgcaggg gcgcccggtt ctttttgtca 1920
agaccgacct gtccggtgcc ctgaatgaac tgcaggacga ggcagcgcgg ctatcgtggc 1980agaccgacct gtccggtgcc ctgaatgaac tgcaggacga ggcagcgcgg ctatcgtggc 1980
tggccacgac gggcgttcct tgcgcagctg tgctcgacgt tgtcactgaa gcgggaaggg 2040tggccacgac gggcgttcct tgcgcagctg tgctcgacgt tgtcactgaa gcgggaaggg 2040
actggctgct attgggcgaa gtgccggggc aggatctcct gtcatctcac cttgctcctg 2100actggctgct attgggcgaa gtgccggggc aggatctcct gtcatctcac cttgctcctg 2100
ccgagaaagt atccatcatg gctgatgcaa tgcggcggct gcatacgctt gatccggcta 2160ccgagaaagt atccatcatg gctgatgcaa tgcggcggct gcatacgctt gatccggcta 2160
cctgcccatt cgaccaccaa gcgaaacatc gcatcgagcg agcacgtact cggatggaag 2220cctgcccatt cgaccaccaa gcgaaacatc gcatcgagcg agcacgtact cggatggaag 2220
ccggtcttgt cgatcaggat gatctggacg aagagcatca ggggctcgcg ccagccgaac 2280ccggtcttgt cgatcaggat gatctggacg aagagcatca ggggctcgcg ccagccgaac 2280
tgttcgccag gctcaaggcg cgcatgcccg acggcgagga tctcgtcgtg acccatggcg 2340tgttcgccag gctcaaggcg cgcatgcccg acggcgagga tctcgtcgtg acccatggcg 2340
atgcctgctt gccgaatatc atggtggaaa atggccgctt ttctggattc atcgactgtg 2400atgcctgctt gccgaatatc atggtggaaa atggccgctt ttctggattc atcgactgtg 2400
gccggctggg tgtggcggac cgctatcagg acatagcgtt ggctacccgt gatattgctg 2460gccggctggg tgtggcggac cgctatcagg acatagcgtt ggctacccgt gatattgctg 2460
aagagcttgg cggcgaatgg gctgaccgct tcctcgtgct ttacggtatc gccgctcccg 2520aagagcttgg cggcgaatgg gctgaccgct tcctcgtgct ttacggtatc gccgctcccg 2520
attcgcagcg catcgccttc tatcgccttc ttgacgagtt cttctgagcg ggactctggg 2580attcgcagcg catcgccttc tatcgccttc ttgacgagtt cttctgagcg ggactctggg 2580
gttcgaaatg accgaccaag cgacgcccaa cctgccatca cgagatttcg attccaccgc 2640gttcgaaatg accgaccaag cgacgcccaa cctgccatca cgagatttcg attccaccgc 2640
cgccttctat gaaaggttgg gcttcggaat cgttttccgg gacgccggct ggatgatcct 2700cgccttctat gaaaggttgg gcttcggaat cgttttccgg gacgccggct ggatgatcct 2700
ccagcgcggg gatctcatgc tggagttctt cgcccacccc agcttcaaaa gcgctctcgg 2760ccagcgcggg gatctcatgc tggagttctt cgcccacccc agcttcaaaa gcgctctcgg 2760
taccggcagg gcggggcgta aggcgcgcca tttaaatgaa gttcctattc cgaagttcct 2820taccggcagg gcggggcgta aggcgcgcca tttaaatgaa gttcctattc cgaagttcct 2820
attctctaga aagtatagga acttcgaagc agctccag 2858attctctaga aagtatagga acttcgaagc agctccag 2858
<210> 16<210> 16
<211> 2631<211> 2631
<212> ДНК<212> DNA
<213> Искусственная последовательность<213> Artificial sequence
<220><220>
<223> Кассета интеграции<223> Integration cassette
<400> 16<400> 16
ggccagatga ttaattccta atttttgttg acactctatc attgatagag ttattttacc 60ggccagatga ttaattccta atttttgttg acactctatc attgatagag ttattttacc 60
actccctatc agtgatagag aaaagtgaaa tgaatagttc gacaaaaatc tagaaataat 120actccctatc agtgatagag aaaagtgaaa tgaatagttc gacaaaaatc tagaaataat 120
tttgtttaac tttaagaagg agatatacaa atgaagtcgg cactgacctt ttcccgtcgc 180tttgtttaac tttaagaagg agatatacaa atgaagtcgg cactgacctt ttcccgtcgc 180
atcaatccgg tgtttctggc gttctttgtc gttgcttttc tgagcggtat cgcaggcgca 240atcaatccgg tgtttctggc gttctttgtc gttgcttttc tgagcggtat cgcaggcgca 240
ctgcaggctc cgaccctgag tctgtttctg tccacggaag tgaaagttcg tccgctgtgg 300ctgcaggctc cgaccctgag tctgtttctg tccacggaag tgaaagttcg tccgctgtgg 300
gttggtctgt tctataccgt caacgcaatc gctggcatta cggttagctt tatcctggcg 360gttggtctgt tctataccgt caacgcaatc gctggcatta cggttagctt tatcctggcg 360
aaacgttcag attcgcgcgg tgaccgtcgc aagctgatta tggtgtgcta tctgatggcg 420aaacgttcag attcgcgcgg tgaccgtcgc aagctgatta tggtgtgcta tctgatggcg 420
gttggcaact gtctgctgtt tgccttcaat cgtgattacc tgaccctgat cacggcaggt 480gttggcaact gtctgctgtt tgccttcaat cgtgattacc tgaccctgat cacggcaggt 480
gtgctgctgg cgagcgttgc caacaccgca atgccgcaga ttttcgcgct ggcccgtgaa 540gtgctgctgg cgagcgttgc caacaccgca atgccgcaga ttttcgcgct ggcccgtgaa 540
tatgccgaca gctctgcacg cgaagtggtt atgtttagtt ccatcatgcg cgctcaactg 600tatgccgaca gctctgcacg cgaagtggtt atgtttagtt ccatcatgcg cgctcaactg 600
agtctggcat gggtgattgg tccgccgctg tcctttatgc tggcgctgaa ttacggtttt 660agtctggcat gggtgattgg tccgccgctg tcctttatgc tggcgctgaa ttacggtttt 660
accctgatgt tctcaatcgc ggccggcatt ttcgttctgt cggccctggt cgtgtggttt 720accctgatgt tctcaatcgc ggccggcatt ttcgttctgt cggccctggt cgtgtggttt 720
atcctgccga gtgtcccgcg tgcagaaccg gttgtcgatg caccggtggt tgtccagggt 780atcctgccga gtgtcccgcg tgcagaaccg gttgtcgatg caccggtggt tgtccaggt 780
tcactgttcg cagacaaaaa cgttctgctg ctgtttatcg cgtcgatgct gatgtggacc 840tcactgttcg cagacaaaaa cgttctgctg ctgtttatcg cgtcgatgct gatgtggacc 840
tgcaatacga tgtatattat cgatatgccg ctgtacatta ccgcaagcct gggtctgccg 900tgcaatacga tgtatattat cgatatgccg ctgtacatta ccgcaagcct gggtctgccg 900
gaacgtctgg ctggtctgct gatgggtacc gcagctggcc tggaaattcc gatcatgctg 960gaacgtctgg ctggtctgct gatgggtacc gcagctggcc tggaaattcc gatcatgctg 960
ctggcgggtt attctgtgcg ttactttggc aaacgcaaga ttatgctgtt cgctgttctg 1020ctggcgggtt attctgtgcg ttactttggc aaacgcaaga ttatgctgtt cgctgttctg 1020
gcgggtgtcc tgttttatac cggcctggtt ctgtttaaat tcaagacggc cctgatgctg 1080gcgggtgtcc tgttttatac cggcctggtt ctgtttaaat tcaagacggc cctgatgctg 1080
ctgcagatct ttaacgcaat tttcatcggt attgtggctg gcattggtat gctgtacttc 1140ctgcagatct ttaacgcaat tttcatcggt attgtggctg gcattggtat gctgtacttc 1140
caagatctga tgccgggtcg tgcaggtgca gcaaccacgc tgtttaccaa tagcatctct 1200caagatctga tgccgggtcg tgcaggtgca gcaaccacgc tgtttaccaa tagcatctct 1200
acgggtgtca ttctggcagg cgtgctgcaa ggcggtctga ccgaaacgtg gggccatgac 1260acgggtgtca ttctggcagg cgtgctgcaa ggcggtctga ccgaaacgtg gggccatgac 1260
agcgtctatg tgatggcgat ggtcctgtct attctggccc tgattatctg tgcacgtgtg 1320agcgtctatg tgatggcgat ggtcctgtct attctggccc tgattatctg tgcacgtgtg 1320
cgcgaagctt aaatcgatac tagcataacc ccttggggcc tctaaacgcg tcgacacgca 1380cgcgaagctt aaatcgatac tagcataacc ccttggggcc tctaaacgcg tcgacacgca 1380
aaaaggccat ccgtcaggat ggccttctgc ttaatttgat gcctggcagt ttatggcggg 1440aaaaggccat ccgtcaggat ggccttctgc ttaatttgat gcctggcagt ttatggcggg 1440
cgtcctgccc gccaccctcc gggccgttgc ttcgcaacgt tcaaatccgc tcccggcgga 1500cgtcctgccc gccaccctcc gggccgttgc ttcgcaacgt tcaaatccgc tcccggcgga 1500
tttgtcctac tcaggagagc gttcaccgac aaacaacaga taaaacgaaa ggcccagtct 1560tttgtcctac tcaggagagc gttcaccgac aaacaacaga taaaacgaaa ggcccagtct 1560
ttcgactgag cctttcgttt tatttgatgc ctggcagttc cctactctcg catggggaga 1620ttcgactgag cctttcgttt tatttgatgc ctggcagttc cctactctcg catggggaga 1620
ccccacacta ccatcatgta tgaatatcct ccttagttcc tattccgaag ttcctattct 1680ccccacacta ccatcatgta tgaatatcct ccttagttcc tattccgaag ttcctattct 1680
ctagaaagta taggaacttc ggcgcgtcct acctgtgacg gaagatcact tcgcagaata 1740ctagaaagta taggaacttc ggcgcgtcct acctgtgacg gaagatcact tcgcagaata 1740
aataaatcct ggtgtccctg ttgataccgg gaagccctgg gccaactttt ggcgaaaatg 1800aataaatcct ggtgtccctg ttgataccgg gaagccctgg gccaactttt ggcgaaaatg 1800
agacgttgat cggcacgtaa gaggttccaa ctttcaccat aatgaaataa gatcactacc 1860agacgttgat cggcacgtaa gaggttccaa ctttcaccat aatgaaataa gatcactacc 1860
gggcgtattt tttgagttgt cgagattttc aggagctaag gaagctaaaa tggagaaaaa 1920gggcgtattt tttgagttgt cgagattttc aggagctaag gaagctaaaa tggagaaaaa 1920
aatcactgga tataccaccg ttgatatatc ccaatggcat cgtaaagaac attttgaggc 1980aatcactgga tataccaccg ttgatatatc ccaatggcat cgtaaagaac attttgaggc 1980
atttcagtca gttgctcaat gtacctataa ccagaccgtt cagctggata ttacggcctt 2040atttcagtca gttgctcaat gtacctataa ccagaccgtt cagctggata ttacggcctt 2040
tttaaagacc gtaaagaaaa ataagcacaa gttttatccg gcctttattc acattcttgc 2100tttaaagacc gtaaagaaaa ataagcacaa gttttatccg gcctttattc acattcttgc 2100
ccgcctgatg aatgctcatc cggaattacg tatggcaatg aaagacggtg agctggtgat 2160ccgcctgatg aatgctcatc cggaattacg tatggcaatg aaagacggtg agctggtgat 2160
atgggatagt gttcaccctt gttacaccgt tttccatgag caaactgaaa cgttttcatc 2220atgggatagt gttcaccctt gttacaccgt tttccatgag caaactgaaa cgttttcatc 2220
gctctggagt gaataccacg acgatttccg gcagtttcta cacatatatt cgcaagatgt 2280gctctggagt gaataccacg acgatttccg gcagtttcta cacatatatt cgcaagatgt 2280
ggcgtgttac ggtgaaaacc tggcctattt ccctaaaggg tttattgaga atatgttttt 2340ggcgtgttac ggtgaaaacc tggcctattt ccctaaaggg tttattgaga atatgttttt 2340
cgtctcagcc aatccctggg tgagtttcac cagttttgat ttaaacgtgg ccaatatgga 2400cgtctcagcc aatccctggg tgagtttcac cagttttgat ttaaacgtgg ccaatatgga 2400
caacttcttc gcccccgttt tcaccatggg caaatattat acgcaaggcg acaaggtgct 2460caacttcttc gcccccgttt tcaccatggg caaatattat acgcaaggcg acaaggtgct 2460
gatgccgctg gcgattcagg ttcatcatgc cgtttgtgat ggcttccatg tcggcagatg 2520gatgccgctg gcgattcagg ttcatcatgc cgtttgtgat ggcttccatg tcggcagatg 2520
cttaatgaat acaacagtac tgcgatgagt ggcagggcgg ggcgtaaggc gcgccattta 2580cttaatgaat acaacagtac tgcgatgagt ggcagggcgg ggcgtaaggc gcgccattta 2580
aatgaagttc ctattccgaa gttcctattc tctagaaagt ataggaactt c 2631aatgaagttc ctattccgaa gttcctattc tctagaaagt ataggaactt c 2631
<210> 17<210> 17
<211> 4259<211> 4259
<212> ДНК<212> DNA
<213> Искусственная последовательность<213> Artificial sequence
<220><220>
<223> Кассета интеграции<223> Integration cassette
<400> 17<400> 17
ttactcagca ataaactgat attccgtcag gctggaatac tcttcgccag gacgcaggaa 60ttactcagca ataaactgat attccgtcag gctggaatac tcttcgccag gacgcaggaa 60
gcagtccggt tgcggccatt cagggtggtt cgggctgtcc ggtagaaact cgctttccag 120gcagtccggt tgcggccatt cagggtggtt cgggctgtcc ggtagaaact cgctttccag 120
agccagccct tgccagtcgg cgtaaggttc ggttccccgc gacggtgtgc cgccgaggaa 180agccagccct tgccagtcgg cgtaaggttc ggttccccgc gacggtgtgc cgccgaggaa 180
gttgccggag tagaattgca gagccggagc ggtggtgtag accttcagct gcaatttttc 240gttgccggag tagaattgca gagccggagc ggtggtgtag accttcagct gcaatttttc 240
atctgctgac cagacatgcg ccgccacttt cttgccatcg cctttggcct gtaacaagaa 300atctgctgac cagacatgcg ccgccacttt cttgccatcg cctttggcct gtaacaagaa 300
tgcgtgatcg taacctttca ctttgcgctg atcgtcgtcg gcaagaaact cactggcgat 360tgcgtgatcg taacctttca ctttgcgctg atcgtcgtcg gcaagaaact cactggcgat 360
gattttggcg ctgcggaaat caaaagacgt tccggcgaca gatttcaggc cgtcgtgcgg 420gattttggcg ctgcggaaat caaaagacgt tccggcgaca gatttcaggc cgtcgtgcgg 420
aatgccgcct tcatcaaccg gcagatattc gtccgccaga atctgcaact tgtgattgcg 480aatgccgcct tcatcaaccg gcagatattc gtccgccaga atctgcaact tgtgattgcg 480
cacgtcagac tgctcgccgt caagattgaa atagacgtga ttagtcatat tcaccgggca 540cacgtcagac tgctcgccgt caagattgaa atagacgtga ttagtcatat tcaccgggca 540
aggtttatca actgtggcgc gataagtaat ggagatacgg ttatcgtcgg tcagacgata 600aggtttatca actgtggcgc gataagtaat ggagatacgg ttatcgtcgg tcagacgata 600
ttgcaccgtc gcgccgagat tacccgggaa gccctgatca ccatcatctg aactcagggc 660ttgcaccgtc gcgccgagat tacccgggaa gccctgatca ccatcatctg aactcagggc 660
aaacagcacc tgacgatcgt tctggttcac aatctgccag cgacgtttgt cgaacccttc 720aaacagcacc tgacgatcgt tctggttcac aatctgccag cgacgtttgt cgaacccttc 720
cggcccgccg tgcagctggt taacgccctg acttggcgaa agcgtcacgg tttcaccgtc 780cggcccgccg tgcagctggt taacgccctg acttggcgaa agcgtcacgg tttcaccgtc 780
aaaggtataa cggctattgg cgatacggtt ggcataacga ccaatagagg cccccagaaa 840aaaggtataa cggctattgg cgatacggtt ggcataacga ccaatagagg cccccagaaa 840
cgcggcctga tcctgatagc attccgggct ggcacagccg agcagcgcct cgcggacgct 900cgcggcctga tcctgatagc attccgggct ggcacagccg agcagcgcct cgcggacgct 900
gccatcggaa agcggaatac gggcggaaag taaagtcgca ccccagtcca tcagcgtgac 960gccatcggaa agcggaatac gggcggaaag taaagtcgca ccccagtcca tcagcgtgac 960
taccatccct gcgttgttac gcaaagttaa cagtcggtac ggctgaccat cgggtgccag 1020taccatccct gcgttgttac gcaaagttaa cagtcggtac ggctgaccat cgggtgccag 1020
tgcgggagtt tcgttcagca ctgtcctgct ccttgtgatg gtttacaaac gtaaaaagtc 1080tgcgggagtt tcgttcagca ctgtcctgct ccttgtgatg gtttacaaac gtaaaaagtc 1080
tctttaatac ctgtttttgc ttcatattgt tcagcgacag cttgctgtac ggcaggcacc 1140tctttaatac ctgtttttgc ttcatattgt tcagcgacag cttgctgtac ggcaggcacc 1140
agctcttccg ggatcagcgc gacgatacag ccgccaaatc cgccgccggt catgcgtacg 1200agctcttccg ggatcagcgc gacgatacag ccgccaaatc cgccgccggt catgcgtacg 1200
ccacctttgt cgccaatcac agctttgacg atttctacca gagtgtcaat ttgcggcacg 1260ccacctttgt cgccaatcac agctttgacg atttctacca gagtgtcaat ttgcggcacg 1260
gtgatttcga aatcatcgcg catagaggca tgagactccg ccatcaactc gcccatacgt 1320gtgatttcga aatcatcgcg catagaggca tgagactccg ccatcaactc gcccatacgt 1320
ttcaggtcgc cttgctccag cgcgctggca gcttcaacgg tgcgggcgtt ttcagtcagt 1380ttcaggtcgc cttgctccag cgcgctggca gcttcaacgg tgcgggcgtt ttcagtcagt 1380
atatgacgca cgcgttttgc cacgatcggg tccagttcat gcgcaacagc gttgaactct 1440atatgacgca cgcgttttgc cacgatcggg tccagttcat gcgcaacagc gttgaactct 1440
tcaatggtga catcacgcag ggctggctgc tggaagaaac gcgcaccggt ttcgcactgt 1500tcaatggtga catcacgcag ggctggctgc tggaagaaac gcgcaccggt ttcgcactgt 1500
tcacgacggg tgttgtattc gctgccaacc agggtacgtt tgaagttact gttgatgatg 1560tcacgacggg tgttgtattc gctgccaacc agggtacgtt tgaagttact gttgatgatg 1560
acgacagcca cacctttggg catggaaact gctttggtcc ccagtgagcg gcaatcgatc 1620acgacagcca cacctttggg catggaaact gctttggtcc ccagtgagcg gcaatcgatc 1620
agcaaggcat gatctttctt gccgagcgcg gaaattagct gatccatgat cccgcagtta 1680agcaaggcat gatctttctt gccgagcgcg gaaattagct gatccatgat cccgcagtta 1680
cagcctacaa actggttttc tgcttcctga ccgttaagcg cgatttgtgc gccgtccagc 1740cagcctacaa actggttttc tgcttcctga ccgttaagcg cgatttgtgc gccgtccagc 1740
ggcagatgat aaagctgctg caatacggtt ccgaccgcga cttccagtga agcggaagaa 1800ggcagatgat aaagctgctg caatacggtt ccgaccgcga cttccagtga agcggaagaa 1800
cttaacccgg caccctgcgg cacattgccg ctgatcacca tgtccacgcc gccgaagctg 1860cttaacccgg caccctgcgg cacattgccg ctgatcacca tgtccacgcc gccgaagctg 1860
ttgttacgca gttgcagatg tttcaccacg ccacgaacgt agttagccca ttgatagttt 1920ttgttacgca gttgcagatg tttcaccacg ccacgaacgt agttagccca ttgatagttt 1920
tcatgtgcga caatgggcgc atcgagggaa aactcgtcga gctgattttc ataatcggct 1980tcatgtgcga caatgggcgc atcgagggaa aactcgtcga gctgattttc ataatcggct 1980
gccatcacgc gaactttacg gtcatcgcgt ggtgcacaac tgatcacggt ttgataatca 2040gccatcacgc gaactttacg gtcatcgcgt ggtgcacaac tgatcacggt ttgataatca 2040
atcgcgcagg gcagaacgaa accgtcgttg tagtcggtgt gttcaccaat caaattcacg 2100atcgcgcagg gcagaacgaa accgtcgttg tagtcggtgt gttcaccaat caaattcacg 2100
cggccaggcg cctgaatggt gtgagtggca gggtagccaa atgcgttggc aaacagagat 2160cggccaggcg cctgaatggt gtgagtggca gggtagccaa atgcgttggc aaacagagat 2160
tgtgtttttt ctttcagact catttcttac actccggatt cgcgaaaatg gatatcgctg 2220tgtgtttttt ctttcagact catttcttac actccggatt cgcgaaaatg gatatcgctg 2220
actgcgcgca aacgctctgc tgcctgttct gcggtcaggt ctcgctgggt ctctgccagc 2280actgcgcgca aacgctctgc tgcctgttct gcggtcaggt ctcgctgggt ctctgccagc 2280
atttcataac caaccataaa tttacgtacg gtggcggagc gcagcagagg cggataaaag 2340atttcataac caaccataaa tttacgtacg gtggcggagc gcagcagagg cggataaaag 2340
tgcgcgtgca gctgccagtg ttgattctct tcgccattaa atggcgcgcc gtgccagccc 2400tgcgcgtgca gctgccagtg ttgattctct tcgccattaa atggcgcgcc gtgccagccc 2400
atagagtagg ggaaggagca ctggaagagg ttgtcataac gactggtcag ctttttcaac 2460atagagtagg ggaaggagca ctggaagagg ttgtcataac gactggtcag ctttttcaac 2460
gccagcgcca gatcgctgcg ctgggcgtcg gtcaaatcgg tgatccgtaa aacgtgggct 2520gccagcgcca gatcgctgcg ctgggcgtcg gtcaaatcgg tgatccgtaa aacgtgggct 2520
ttgggcagca gtagcgtttc gaacggccag gcagcccagt aaggcacgac ggctaaccag 2580ttgggcagca gtagcgtttc gaacggccag gcagcccagt aaggcacgac ggctaaccag 2580
tgttcggttt cgacaacggt acggctaccg tctgccagct cgcgctgaac ataatccacc 2640tgttcggttt cgacaacggt acggctaccg tctgccagct cgcgctgaac ataatccacc 2640
agcattggtg atttctgttc ggcaaaatat tctttttgca ggcggtcttc gcgctcagct 2700agcattggtg atttctgttc ggcaaaatat tctttttgca ggcggtcttc gcgctcagct 2700
tcgttaggca ggaagctatt tgcccaaatc tgaccgtgcg gatgcgggtt agagcagccc 2760tcgttaggca ggaagctatt tgcccaaatc tgaccgtgcg gatgcgggtt agagcagccc 2760
atcgccgcgc ctttgttttc aaaaacctgc acccatgggt acgttttccc cagttctgcg 2820atcgccgcgc ctttgttttc aaaaacctgc acccatgggt acgttttccc cagttctgcg 2820
gtttgctcct gccaggtttt gacgatttcc gtcaatgctg caacgctgag ctctggcagc 2880gtttgctcct gccaggtttt gacgatttcc gtcaatgctg caacgctgag ctctggcagc 2880
gttttactgt gatccggtga aaagcagatc acccggctgg tgccgcgcgc gctctggcaa 2940gttttactgt gatccggtga aaagcagatc acccggctgg tgccgcgcgc gctctggcaa 2940
cgcatcagcg gatcgtgact ttctggcgca tctggcgtgt cagacatcaa agccgcaaag 3000cgcatcagcg gatcgtgact ttctggcgca tctggcgtgt cagacatcaa agccgcaaag 3000
tcattagtga aaacgtaagt cccggtgtaa tcggggtttt tatcgcctgt cacccgcaca 3060tcattagtga aaacgtaagt cccggtgtaa tcggggtttt tatcgcctgt cacccgcaca 3060
ttacctgcgc agaggaagca atctggatcg tgcgcaggta acacctgttt ggctggcgtt 3120ttacctgcgc agaggaagca atctggatcg tgcgcaggta acacctgttt ggctggcgtt 3120
tcctgcgccc cctgccaggg gcgcttagcg cggtgcggtg aaaccagaat ccattgcccg 3180tcctgcgccc cctgccaggg gcgcttagcg cggtgcggtg aaaccagaat ccattgcccg 3180
gtgagcgggt tgtagcggcg atgtggatga tcaacgggat taaattgcgt catggtcgtt 3240gtgagcgggt tgtagcggcg atgtggatga tcaacgggat taaattgcgt catggtcgtt 3240
ccttaatcgg gatatccctg tggatggcgt gactgccagt gccaggtgtc ctgcgccatt 3300ccttaatcgg gatatccctg tggatggcgt gactgccagt gccaggtgtc ctgcgccatt 3300
tcatcgagtg tgcgcgttac gcgccagttc agttcacggt cggctttgct ggcgtccgcc 3360tcatcgagtg tgcgcgttac gcgccagttc agttcacggt cggctttgct ggcgtccgcc 3360
cagtaggccg gaaggtcgcc ctcgcgacgc ggtgcaaaat gataattaac cggtttgccg 3420cagtaggccg gaaggtcgcc ctcgcgacgc ggtgcaaaat gataattaac cggtttgccg 3420
caggctttgc tgaaggcatt aaccacgtcc agcacgctgt tgcctacgcc agcgccgagg 3480caggctttgc tgaaggcatt aaccacgtcc agcacgctgt tgcctacgcc agcgccgagg 3480
ttgtagatgt gtacgcctgg cttgttcgcc agtttttcca tcgccacgac gtgaccgtcc 3540ttgtagatgt gtacgcctgg cttgttcgcc agtttttcca tcgccacgac gtgaccgtcc 3540
gccagatcca ttacgtggat gtaatcgcgt acgccagtac catcttcggt cggataatcg 3600gccagatcca ttacgtggat gtaatcgcgt acgccagtac catcttcggt cggataatcg 3600
ttaccaaaaa tcgccagcga gtcgcgacgg cctacagcaa cctgggcgat gtatggcatc 3660ttaccaaaaa tcgccagcga gtcgcgacgg cctacagcaa cctgggcgat gtatggcatc 3660
aggttattcg gaatgccttg cggatcttcg cccatatcgc ccgacggatg cgcgccaacc 3720aggttattcg gaatgccttg cggatcttcg cccatatcgc ccgacggatg cgcgccaacc 3720
gggttgaagt agcgcagcag ggcaatgctc cagtccggct gggctttttg cagatcggtg 3780gggttgaagt agcgcagcag ggcaatgctc cagtccggct gggctttttg cagatcggtg 3780
aggatctgtt ccaccatcag cttgcttttg ccgtaagggc tttgcggtgt gccggtcggg 3840aggatctgtt caccatcag cttgcttttg ccgtaagggc tttgcggtgt gccggtcggg 3840
aagctttcaa cgtatggaat tttgggctga tcgccataaa cggtggcgga ggagctaaaa 3900aagctttcaa cgtatggaat tttgggctga tcgccataaa cggtggcgga ggagctaaaa 3900
ataaagtttt tgacgttagc ggcgcgcatg gcgctaatca ggcgcagagt gccgttgaca 3960ataaagtttt tgacgttagc ggcgcgcatg gcgctaatca ggcgcagagt gccgttgaca 3960
ttgttgtcgt aatattccag cggtttttgt accgattcgc ccacggcttt cagcccggcg 4020ttgttgtcgt aatattccag cggtttttgt accgattcgc ccacggcttt cagcccggcg 4020
aagtggatca cggtgtcgat agcgtgatcg tgcaggatct cggtcatcaa cgcttcgtta 4080aagtggatca cggtgtcgat agcgtgatcg tgcaggatct cggtcatcaa cgcttcgtta 4080
cgaatatcgc cttcaacaaa cgttggatgt ttgccgccta aacgctcgat aacaggcagt 4140cgaatatcgc cttcaacaaa cgttggatgt ttgccgccta aacgctcgat aacaggcagt 4140
acgctgcgct tactgttaca gaggttatca agaatgatga catcatgacc gttttgcagt 4200acgctgcgct tactgttaca gaggttatca agaatgatga catcatgacc gttttgcagt 4200
aattgcacac aggtatgact tccaatgtaa ccgctaccac cggtaaccag aactctcat 4259aattgcacac aggtatgact tccaatgtaa ccgctaccac cggtaaccag aactctcat 4259
<210> 18<210> 18
<211> 4223<211> 4223
<212> ДНК<212> DNA
<213> Искусственная последовательность<213> Artificial sequence
<220><220>
<223> Кассета интеграции<223> Integration cassette
<400> 18<400> 18
tggccagatg attaattcct aatttttgtt gacactctat cattgataga gttattttac 60tggccagatg attaattcct aatttttgtt gacactctat cattgataga gttattttac 60
cactccctat cagtgataga gaaaagtgaa atgaatagtt cgacaaaaat ctagaaataa 120cactccctat cagtgataga gaaaagtgaa atgaatagtt cgacaaaaat ctagaaataa 120
ttttgtttaa ctttaagaag gagatataca aatgcaaaaa ctactatctt taccgtccaa 180ttttgtttaa ctttaagaag gagatataca aatgcaaaaa ctactatctt taccgtccaa 180
tctggttcag tcttttcatg aactggagag ggtgaatcgt accgattggt tttgtacttc 240tctggttcag tcttttcatg aactggagag ggtgaatcgt accgattggt tttgtacttc 240
cgacccggta ggtaagaaac ttggttccgg tggtggaaca tcctggctgc ttgaagaatg 300cgacccggta ggtaagaaac ttggttccgg tggtggaaca tcctggctgc ttgaagaatg 300
ttataatgaa tattcagatg gtgctacttt tggagagtgg cttgaaaaag aaaaaagaat 360ttataatgaa tattcagatg gtgctacttt tggagagtgg cttgaaaaag aaaaaagaat 360
tcttcttcat gcgggtgggc aaagccgtcg tttacccggc tatgcacctt ctggaaagat 420tcttcttcat gcgggtgggc aaagccgtcg tttacccggc tatgcacctt ctggaaagat 420
tctcactccg gttcctgtgt tccggtggga gagagggcaa catctgggac aaaatctgct 480tctcactccg gttcctgtgt tccggtggga gagagggcaa catctgggac aaaatctgct 480
ttctctgcaa cttcccctat atgaaaaaat catgtctttg gctccggata aactccatac 540ttctctgcaa cttcccctat atgaaaaaat catgtctttg gctccggata aactccatac 540
actgattgcg agtggtgatg tctatattcg ttcggagaaa cctttgcaga gtattcccga 600actgattgcg agtggtgatg tctatattcg ttcggagaaa cctttgcaga gtattcccga 600
agcggatgtg gtttgttatg gactgtgggt agatccgtct ctggctaccc atcatggcgt 660agcggatgtg gtttgttatg gactgtgggt agatccgtct ctggctaccc atcatggcgt 660
gtttgcttcc gatcgcaaac atcccgaaca actcgacttt atgcttcaga agccttcgtt 720gtttgcttcc gatcgcaaac atcccgaaca actcgacttt atgcttcaga agccttcgtt 720
ggcagaattg gaatctttat cgaagaccca tttgttcctg atggacatcg gtatatggct 780ggcagaattg gaatctttat cgaagaccca tttgttcctg atggacatcg gtatatggct 780
tttgagtgac cgtgccgtag aaatcttgat gaaacgttct cataaagaaa gctctgaaga 840tttgagtgac cgtgccgtag aaatcttgat gaaacgttct cataaagaaa gctctgaaga 840
actaaagtat tatgatcttt attccgattt tggattagct ttgggaactc atccccgtat 900actaaagtat tatgatcttt attccgattt tggattagct ttgggaactc atccccgtat 900
tgaagacgaa gaggtcaata cgctatccgt tgctattctg cctttgccgg gaggagagtt 960tgaagacgaa gaggtcaata cgctatccgt tgctattctg cctttgccgg gaggagtt 960
ctatcattac gggaccagta aagaactgat ttcttcaact ctttccgtac agaataaggt 1020ctatcattac gggaccagta aagaactgat ttcttcaact ctttccgtac agaataaggt 1020
ttacgatcag cgtcgtatca tgcaccgtaa agtaaagccc aatccggcta tgtttgtcca 1080ttacgatcag cgtcgtatca tgcaccgtaa agtaaagccc aatccggcta tgtttgtcca 1080
aaatgctgtc gtgcggatac ctctttgtgc cgagaatgct gatttatgga tcgagaacag 1140aaatgctgtc gtgcggatac ctctttgtgc cgagaatgct gatttatgga tcgagaacag 1140
tcatatcgga ccaaagtgga agattgcttc acgacatatt attaccgggg ttccggaaaa 1200tcatatcgga ccaaagtgga agattgcttc acgacatatt attaccgggg ttccggaaaa 1200
tgactggtca ttggctgtgc ctgccggagt gtgtgtagat gtggttccga tgggtgataa 1260tgactggtca ttggctgtgc ctgccggagt gtgtgtagat gtggttccga tgggtgataa 1260
gggctttgtt gcccgtccat acggtctgga cgatgttttc aaaggagatt tgagagattc 1320gggctttgtt gcccgtccat acggtctgga cgatgttttc aaaggagatt tgagagattc 1320
caaaacaacc ctgacgggta ttccttttgg tgaatggatg tccaaacgcg gtttgtcata 1380caaaacaacc ctgacgggta ttccttttgg tgaatggatg tccaaacgcg gtttgtcata 1380
tacagatttg aaaggacgta cggacgattt acaggcagtt tccgtattcc ctatggttaa 1440tacagatttg aaaggacgta cggacgattt acaggcagtt tccgtattcc ctatggttaa 1440
ttctgtagaa gagttgggat tggtgttgag gtggatgttg tccgaacccg aactggagga 1500ttctgtagaa gagttgggat tggtgttgag gtggatgttg tccgaacccg aactggagga 1500
aggaaagaat atctggttac gttccgaaca tttttctgcg gacgaaattt cggcaggtgc 1560aggaaagaat atctggttac gttccgaaca tttttctgcg gacgaaattt cggcaggtgc 1560
caatctgaag cgtttgtatg cacaacgtga agagttcaga aaaggaaact ggaaagcatt 1620caatctgaag cgtttgtatg cacaacgtga agagttcaga aaaggaaact ggaaagcatt 1620
ggccgttaat catgaaaaaa gtgtttttta tcaacttgat ttggccgatg cagctgaaga 1680ggccgttaat catgaaaaaa gtgtttttta tcaacttgat ttggccgatg cagctgaaga 1680
ttttgtacgt cttggtttgg atatgcctga attattgcct gaggatgctc tgcagatgtc 1740ttttgtacgt cttggtttgg atatgcctga attattgcct gaggatgctc tgcagatgtc 1740
acgcatccat aaccggatgt tgcgtgcgcg tattttgaaa ttagacggga aagattatcg 1800acgcatccat aaccggatgt tgcgtgcgcg tattttgaaa ttagacggga aagattatcg 1800
tccggaagaa caggctgctt ttgatttgct tcgtgacggc ttgctggacg ggatcagtaa 1860tccggaagaa caggctgctt ttgatttgct tcgtgacggc ttgctggacg ggatcagtaa 1860
tcgtaagagt accccaaaat tggatgtata ttccgatcag attgtttggg gacgtagccc 1920tcgtaagagt accccaaaat tggatgtata ttccgatcag attgtttggg gacgtagccc 1920
cgtgcgcatc gatatggcag gtggatggac cgatactcct ccttattcac tttattcggg 1980cgtgcgcatc gatatggcag gtggatggac cgatactcct ccttattcac tttattcggg 1980
aggaaatgtg gtgaatctag ccattgagtt gaacggacaa cctcccttac aggtctatgt 2040aggaaatgtg gtgaatctag ccattgagtt gaacggacaa cctcccttac aggtctatgt 2040
gaagccgtgt aaagacttcc atatcgtcct gcgttctatc gatatgggtg ctatggaaat 2100gaagccgtgt aaagacttcc atatcgtcct gcgttctatc gatatgggtg ctatggaaat 2100
agtatctacg tttgatgaat tgcaagatta taagaagatc ggttcacctt tctctattcc 2160agtatctacg tttgatgaat tgcaagatta taagaagatc ggttcacctt tctctattcc 2160
gaaagccgct ctgtcattgg caggctttgc acctgcgttt tctgctgtat cttatgcttc 2220gaaagccgct ctgtcattgg caggctttgc acctgcgttt tctgctgtat cttatgcttc 2220
attagaggaa cagcttaaag atttcggtgc aggtattgaa gtgactttat tggctgctat 2280attagaggaa cagcttaaag atttcggtgc aggtattgaa gtgactttat tggctgctat 2280
tcctgccggt tccggtttgg gcaccagttc cattctggct tctaccgtac ttggtgccat 2340tcctgccggt tccggtttgg gcaccagttc cattctggct tctaccgtac ttggtgccat 2340
taacgatttc tgtggtttag cctgggataa aaatgagatt tgtcaacgta ctcttgttct 2400taacgatttc tgtggtttag cctgggataa aaatgagatt tgtcaacgta ctcttgttct 2400
tgaacaattg ctgactaccg gaggtggatg gcaggatcag tatggaggtg tgttgcaggg 2460tgaacaattg ctgactaccg gaggtggatg gcaggatcag tatggaggtg tgttgcaggg 2460
tgtgaagctt cttcagaccg aggccggctt tgctcaaagt ccattggtgc gttggctacc 2520tgtgaagctt cttcagaccg aggccggctt tgctcaaagt ccattggtgc gttggctacc 2520
cgatcattta tttacgcatc ctgaatacaa agactgtcac ttgctttatt ataccggtat 2580cgatcattta tttacgcatc ctgaatacaa agactgtcac ttgctttatt ataccggtat 2580
aactcgtacg gcaaaaggga tcttggcaga aatagtcagt tccatgttcc tcaattcatc 2640aactcgtacg gcaaaaggga tcttggcaga aatagtcagt tccatgttcc tcaattcatc 2640
gttgcatctc aatttacttt cggaaatgaa ggcgcatgca ttggatatga atgaagctat 2700gttgcatctc aatttacttt cggaaatgaa ggcgcatgca ttggatatga atgaagctat 2700
acagcgtgga agttttgttg agtttggccg tttggtagga aaaacctggg aacaaaacaa 2760acagcgtgga agttttgttg agtttggccg tttggtagga aaaacctggg aacaaaacaa 2760
agcattggat agcggaacaa atcctccggc tgtggaggca attatcgatc tgataaaaga 2820agcattggat agcggaacaa atcctccggc tgtggaggca attatcgatc tgataaaaga 2820
ttataccttg ggatataaat tgccgggagc cggtggtggc gggtacttat atatggtagc 2880ttataccttg ggatataaat tgccgggagc cggtggtggc gggtacttat atatggtagc 2880
gaaagatccg caagctgctg ttcgtattcg taagatactg acagaaaacg ctccgaatcc 2940gaaagatccg caagctgctg ttcgtattcg taagatactg acagaaaacg ctccgaatcc 2940
gcgggcacgt tttgtcgaaa tgacgttatc tgataaggga ttccaagtat cacgatcata 3000gcgggcacgt tttgtcgaaa tgacgttatc tgataaggga ttccaagtat cacgatcata 3000
actgaaacca atttgcctgg cggcagtagc gcggtggtcc cacctgaccc catgccgaac 3060actgaaacca atttgcctgg cggcagtagc gcggtggtcc cacctgaccc catgccgaac 3060
tcagaagtga aacgccgtag cgccgatggt agtgtggggt ctccccatgc gagagtaggg 3120tcagaagtga aacgccgtag cgccgatggt agtgtggggt ctccccatgc gagagtaggg 3120
aactgccagg catcaaataa aacgaaaggc tcagtcgaaa gactgggcct ttcgggatcc 3180aactgccagg catcaaataa aacgaaaggc tcagtcgaaa gactgggcct ttcgggatcc 3180
aggccggcct gttaagacgg ccagtgaatt cgagctcggt acctaccgtt cgtataatgt 3240aggccggcct gttaagacgg ccagtgaatt cgagctcggt acctaccgtt cgtataatgt 3240
atgctatacg aagttatcga gctctagaga atgatcccct cattaggcca cacgttcaag 3300atgctatacg aagttatcga gctctagaga atgatcccct cattaggcca cacgttcaag 3300
tgcagcgcac accgtggaaa cggatgaagg cacgaaccca gttgacataa gcctgttcgg 3360tgcagcgcac accgtggaaa cggatgaagg cacgaaccca gttgacataa gcctgttcgg 3360
ttcgtaaact gtaatgcaag tagcgtatgc gctcacgcaa ctggtccaga accttgaccg 3420ttcgtaaact gtaatgcaag tagcgtatgc gctcacgcaa ctggtccaga accttgaccg 3420
aacgcagcgg tggtaacggc gcagtggcgg ttttcatggc ttgttatgac tgtttttttg 3480aacgcagcgg tggtaacggc gcagtggcgg ttttcatggc ttgttatgac tgtttttttg 3480
tacagtctat gcctcgggca tccaagcagc aagcgcgtta cgccgtgggt cgatgtttga 3540tacagtctat gcctcgggca tccaagcagc aagcgcgtta cgccgtgggt cgatgtttga 3540
tgttatggag cagcaacgat gttacgcagc agcaacgatg ttacgcagca gggcagtcgc 3600tgttatggag cagcaacgat gttacgcagc agcaacgatg ttacgcagca gggcagtcgc 3600
cctaaaacaa agttaggtgg ctcaagtatg ggcatcattc gcacatgtag gctcggccct 3660cctaaaacaa agttaggtgg ctcaagtatg ggcatcattc gcacatgtag gctcggccct 3660
gaccaagtca aatccatgcg ggctgctctt gatcttttcg gtcgtgagtt cggagacgta 3720gaccaagtca aatccatgcg ggctgctctt gatcttttcg gtcgtgagtt cggagacgta 3720
gccacctact cccaacatca gccggactcc gattacctcg ggaacttgct ccgtagtaag 3780gccacctact cccaacatca gccggactcc gattacctcg ggaacttgct ccgtagtaag 3780
acattcatcg cgcttgctgc cttcgaccaa gaagcggttg ttggcgctct cgcggcttac 3840acattcatcg cgcttgctgc cttcgaccaa gaagcggttg ttggcgctct cgcggcttac 3840
gttctgccca ggtttgagca gccgcgtagt gagatctata tctatgatct cgcagtctcc 3900gttctgccca ggtttgagca gccgcgtagt gagatctata tctatgatct cgcagtctcc 3900
ggcgagcacc ggaggcaggg cattgccacc gcgctcatca atctcctcaa gcatgaggcc 3960ggcgagcacc ggaggcaggg cattgccacc gcgctcatca atctcctcaa gcatgaggcc 3960
aacgcgcttg gtgcttatgt gatctacgtg caagcagatt acggtgacga tcccgcagtg 4020aacgcgcttg gtgcttatgt gatctacgtg caagcagatt acggtgacga tcccgcagtg 4020
gctctctata caaagttggg catacgggaa gaagtgatgc actttgatat cgacccaagt 4080gctctctata caaagttggg catacgggaa gaagtgatgc actttgatat cgacccaagt 4080
accgccacct aacaattcgt tcaagccgag atcgtagaat ttcgacgacc tgcagccaag 4140accgccacct aacaattcgt tcaagccgag atcgtagaat ttcgacgacc tgcagccaag 4140
cataacttcg tataatgtat gctatacgaa cggtaggatc ctctagagtc gacctgcagg 4200cataacttcg tataatgtat gctatacgaa cggtaggatc ctctagagtc gacctgcagg 4200
catgagatgt gtataagaga cag 4223catgagatgt gtataagaga cag 4223
<210> 19<210> 19
<211> 3792<211> 3792
<212> ДНК<212> DNA
<213> Искусственная последовательность<213> Artificial sequence
<220><220>
<223> Кассета интеграции<223> Integration cassette
<400> 19<400> 19
gggaattgat tctggtacca aatgagtcga ccggccagat gattaattcc taatttttgt 60gggaattgat tctggtacca aatgagtcga ccggccagat gattaattcc taatttttgt 60
tgacactcta tcattgatag agttatttta ccactcccta tcagtgatag agaaaagtga 120tgacactcta tcattgatag agttatttta ccactcccta tcagtgatag agaaaagtga 120
aatgaatagt tcgacaaaaa tctagaaata attttgttta actttaagaa ggagatatac 180aatgaatagt tcgacaaaaa tctagaaata attttgttta actttaagaa ggagatatac 180
aaatgattac ccgcaaaagg cgggccagga caatccatag ccgatatcca atcggaattt 240aaatgattac ccgcaaaagg cgggccagga caatccatag ccgatatcca atcggaattt 240
acgggagcat agtaatgaca gatattgcac agttgcttgg caaagacgcc gacaaccttt 300acgggagcat agtaatgaca gatattgcac agttgcttgg caaagacgcc gacaaccttt 300
tacagcaccg ttgtatgact attccttctg accagcttta tctccccgga catgactacg 360tacagcaccg ttgtatgact attccttctg accagcttta tctccccgga catgactacg 360
tagaccgcgt gatgattgac aataatcgcc cgccagcggt gttacgtaat atgcagacgt 420tagaccgcgt gatgattgac aataatcgcc cgccagcggt gttacgtaat atgcagacgt 420
tgtacaacac tgggcgtctg gctggcacag gatatctttc tattctgccg gttgaccagg 480tgtacaacac tgggcgtctg gctggcacag gatatctttc tattctgccg gttgaccagg 480
gcgttgagca ctctgccgga gcttcatttg ctgctaaccc gctctacttt gacccgaaaa 540gcgttgagca ctctgccgga gcttcatttg ctgctaaccc gctctacttt gacccgaaaa 540
acattgttga actggcgatc gaagcgggct gtaactgtgt ggcatcaact tacggcgtgt 600acattgttga actggcgatc gaagcgggct gtaactgtgt ggcatcaact tacggcgtgt 600
tggcgtcggt atcgcggcgc tatgcgcatc gcattccatt cctcgtcaaa cttaatcaca 660tggcgtcggt atcgcggcgc tatgcgcatc gcattccat cctcgtcaaa cttaatcaca 660
acgagacgct aagttacccg aacacctacg atcaaacgct gtatgccagc gtggagcagg 720acgagacgct aagttacccg aacacctacg atcaaacgct gtatgccagc gtggagcagg 720
ccttcaacat gggcgcggtg gcggttggtg cgactatcta ttttggttcg gaagagtcac 780ccttcaacat gggcgcggtg gcggttggtg cgactatcta ttttggttcg gaagagtcac 780
gtcgccagat tgaagaaatt tctgcggctt ttgaacgtgc gcacgagctg ggcatggtga 840gtcgccagat tgaagaaatt tctgcggctt ttgaacgtgc gcacgagctg ggcatggtga 840
cagtgctgtg ggcctatttg cgtaactccg cctttaagaa agatggcgtt gattaccatg 900cagtgctgtg ggcctatttg cgtaactccg cctttaagaa agatggcgtt gattaccatg 900
tttccgccga cctgaccggt caggcaaacc atctggcggc gaccataggt gcagatatcg 960tttccgccga cctgaccggt caggcaaacc atctggcggc gaccataggt gcagatatcg 960
tcaaacaaaa aatggcggaa aataacggcg gctataaagc aattaattac ggttataccg 1020tcaaacaaaa aatggcggaa aataacggcg gctataaagc aattaattac ggttataccg 1020
acgatcgcgt gtacagcaag ttaaccagcg aaaacccgat tgatctggtg cgttatcagt 1080acgatcgcgt gtacagcaag ttaaccagcg aaaacccgat tgatctggtg cgttatcagt 1080
tagctaactg ctatatgggc cgggccgggt tgataaactc cggcggtgct gcaggcggtg 1140tagctaactg ctatatgggc cgggccgggt tgataaactc cggcggtgct gcaggcggtg 1140
aaactgacct cagcgatgca gtgcgtactg cggttatcaa caaacgcgct ggcggaatgg 1200aaactgacct cagcgatgca gtgcgtactg cggttatcaa caaacgcgct ggcggaatgg 1200
ggctgattct tggacgtaag gcgttcaaga aatcgatggc tgacggcgtg aaactgatta 1260ggctgattct tggacgtaag gcgttcaaga aatcgatggc tgacggcgtg aaactgatta 1260
acgccgtgca ggatgtttat ctcgatagca aaattactat cgcctaagag gatcgagatc 1320acgccgtgca ggatgtttat ctcgatagca aaattactat cgcctaagag gatcgagatc 1320
tcgatcccgc gaaattaata cgactcacta taggggaatt gtgagcggat aacaattccc 1380tcgatcccgc gaaattaata cgactcacta taggggaatt gtgagcggat aacaattccc 1380
ctctagaaat aattttgttt aactttaaga aggagatata ccatgggcca tcatcatcat 1440ctctagaaat aattttgttt aactttaaga aggagatata ccatggggcca tcatcatcat 1440
catcatcatc atcatcacag cagcggccat atcgaaggtc gtcatatggc ggtgaaagaa 1500catcatcatc atcatcacag cagcggccat atcgaaggtc gtcatatggc ggtgaaagaa 1500
gcgaccagcg agaccaagaa gcgtagcggt tacgagatca ttaccctgac cagctggctg 1560gcgaccagcg agaccaagaa gcgtagcggt tacgagatca ttaccctgac cagctggctg 1560
ctgcaacaag aacagaaggg tatcattgac gcggaactga ccatcgttct gagcagcatt 1620ctgcaacaag aacagaaggg tatcattgac gcggaactga ccatcgttct gagcagcatt 1620
agcatggcgt gcaaacagat cgcgagcctg gtgcaacgtg cgaacattag caacctgacc 1680agcatggcgt gcaaacagat cgcgagcctg gtgcaacgtg cgaacattag caacctgacc 1680
ggtacccaag gcgcggttaa catccagggt gaagaccaaa agaaactgga tgttattagc 1740ggtacccaag gcgcggttaa catccagggt gaagaccaaa agaaactgga tgttattagc 1740
aacgaggtgt tcagcaactg cctgcgtagc agcggtcgta ccggcatcat tgcgagcgag 1800aacgaggtgt tcagcaactg cctgcgtagc agcggtcgta ccggcatcat tgcgagcgag 1800
gaagaggacg tggcggttgc ggtggaagag agctacagcg gtaactatat cgtggttttt 1860gaagaggacg tggcggttgc ggtggaagag agctacagcg gtaactatat cgtggttttt 1860
gacccgctgg atggcagcag caacctggat gcggctgtga gcaccggtag catcttcggc 1920gacccgctgg atggcagcag caacctggat gcggctgtga gcaccggtag catcttcggc 1920
atttacagcc cgaacgacga gagcctgccg gattttggtg acgatagcga cgataacacc 1980atttacagcc cgaacgacga gagcctgccg gattttggtg acgatagcga cgataacacc 1980
ctgggcaccg aagagcaacg ttgcatcgtt aacgtgtgcc aaccgggtag caacctgctg 2040ctgggcaccg aagagcaacg ttgcatcgtt aacgtgtgcc aaccgggtag caacctgctg 2040
gcggcgggct actgcatgta tagcagcagc gttgcgttcg tgctgaccat tggcaagggc 2100gcggcgggct actgcatgta tagcagcagc gttgcgttcg tgctgaccat tggcaagggc 2100
gttttcgtgt ttaccctgga cccgctgtac ggtgaattcg tgctgaccca ggagaacctg 2160gttttcgtgt ttaccctgga cccgctgtac ggtgaattcg tgctgaccca ggagaacctg 2160
caaatcccga agagcggtga aatttacagc tttaacgagg gcaactataa actgtgggat 2220caaatcccga agagcggtga aatttacagc tttaacgagg gcaactataa actgtgggat 2220
gaaaacctga agaaatatat cgacgatctg aaggaaccgg gtccgagcgg taaaccgtac 2280gaaaacctga agaaatatat cgacgatctg aaggaaccgg gtccgagcgg taaaccgtac 2280
agcgcgcgtt atatcggtag cctggttggc gacttccacc gtaccctgct gtacggtggc 2340agcgcgcgtt atatcggtag cctggttggc gacttccacc gtaccctgct gtacggtggc 2340
atttacggtt atccgcgtga taagaaaagc aagaacggca aactgcgtct gctgtatgaa 2400atttacggtt atccgcgtga taagaaaagc aagaacggca aactgcgtct gctgtatgaa 2400
tgcgcgccga tgagctttat tgttgagcag gcgggtggca aaggtagcga cggccaccag 2460tgcgcgccga tgagctttat tgttgagcag gcgggtggca aaggtagcga cggccaccag 2460
cgtgtgctgg atatccaacc gaccgaaatt caccagcgtg ttccgctgta cattggtagc 2520cgtgtgctgg atatccaacc gaccgaaatt caccagcgtg ttccgctgta cattggtagc 2520
accgaagagg ttgaaaaagt tgaaaagtat ctggcgtaat cgagtctggt aaagaaaccg 2580accgaagagg ttgaaaaagt tgaaaagtat ctggcgtaat cgagtctggt aaagaaaccg 2580
ctgctgcgaa atttgaacgc cagcacatgg actcgtctac tagcgcagct taattaacct 2640ctgctgcgaa atttgaacgc cagcacatgg actcgtctac tagcgcagct taattaacct 2640
aggctgctgc caccgctgag caataactag cataacccct tggggcctct aaacgggtct 2700aggctgctgc caccgctgag caataactag cataacccct tggggcctct aaacgggtct 2700
tgaggggttt tttgctgaaa ggaggaacta tatccggatt ggcgaatggg acgcgccctg 2760tgaggggttt tttgctgaaa ggaggaacta tatccggatt ggcgaatggg acgcgccctg 2760
tagcggcgca ttaagcgcgg cgggtggacg gccagtgaat tcgagctcgg tacctaccgt 2820tagcggcgca ttaagcgcgg cgggtggacg gccagtgaat tcgagctcgg tacctaccgt 2820
tcgtataatg tatgctatac gaagttatcg agctctagag aatgatcccc tcattaggcc 2880tcgtataatg tatgctatac gaagttatcg agctctagag aatgatcccc tcattaggcc 2880
acacgttcaa gtgcagcgca caccgtggaa acggatgaag gcacgaaccc agttgacata 2940acacgttcaa gtgcagcgca caccgtggaa acggatgaag gcacgaaccc agttgacata 2940
agcctgttcg gttcgtaaac tgtaatgcaa gtagcgtatg cgctcacgca actggtccag 3000agcctgttcg gttcgtaaac tgtaatgcaa gtagcgtatg cgctcacgca actggtccag 3000
aaccttgacc gaacgcagcg gtggtaacgg cgcagtggcg gttttcatgg cttgttatga 3060aaccttgacc gaacgcagcg gtggtaacgg cgcagtggcg gttttcatgg cttgttatga 3060
ctgttttttt gtacagtcta tgcctcgggc atccaagcag caagcgcgtt acgccgtggg 3120ctgttttttt gtacagtcta tgcctcgggc atccaagcag caagcgcgtt acgccgtggg 3120
tcgatgtttg atgttatgga gcagcaacga tgttacgcag cagcaacgat gttacgcagc 3180tcgatgtttg atgttatgga gcagcaacga tgttacgcag cagcaacgat gttacgcagc 3180
agggcagtcg ccctaaaaca aagttaggtg gctcaagtat gggcatcatt cgcacatgta 3240agggcagtcg ccctaaaaca aagttaggtg gctcaagtat gggcatcatt cgcacatgta 3240
ggctcggccc tgaccaagtc aaatccatgc gggctgctct tgatcttttc ggtcgtgagt 3300ggctcggccc tgaccaagtc aaatccatgc gggctgctct tgatcttttc ggtcgtgagt 3300
tcggagacgt agccacctac tcccaacatc agccggactc cgattacctc gggaacttgc 3360tcggagacgt agccacctac tcccaacatc agccggactc cgattacctc gggaacttgc 3360
tccgtagtaa gacattcatc gcgcttgctg ccttcgacca agaagcggtt gttggcgctc 3420tccgtagtaa gacattcatc gcgcttgctg ccttcgacca agaagcggtt gttggcgctc 3420
tcgcggctta cgttctgccc aggtttgagc agccgcgtag tgagatctat atctatgatc 3480tcgcggctta cgttctgccc aggtttgagc agccgcgtag tgagatctat atctatgatc 3480
tcgcagtctc cggcgagcac cggaggcagg gcattgccac cgcgctcatc aatctcctca 3540tcgcagtctc cggcgagcac cggaggcagg gcattgccac cgcgctcatc aatctcctca 3540
agcatgaggc caacgcgctt ggtgcttatg tgatctacgt gcaagcagat tacggtgacg 3600agcatgaggc caacgcgctt ggtgcttatg tgatctacgt gcaagcagat tacggtgacg 3600
atcccgcagt ggctctctat acaaagttgg gcatacggga agaagtgatg cactttgata 3660atcccgcagt ggctctctat acaaagttgg gcatacggga agaagtgatg cactttgata 3660
tcgacccaag taccgccacc taacaattcg ttcaagccga gatcgtagaa tttcgacgac 3720tcgacccaag taccgccacc taacaattcg ttcaagccga gatcgtagaa tttcgacgac 3720
ctgcagccaa gcataacttc gtataatgta tgctatacga acggtaggat cctctagagt 3780ctgcagccaa gcataacttc gtataatgta tgctatacga acggtaggat cctctagagt 3780
cgacctgcag gc 3792cgacctgcag gc 3792
<210> 20<210> 20
<211> 5917<211> 5917
<212> ДНК<212> DNA
<213> Искусственная последовательность<213> Artificial sequence
<220><220>
<223> Транспозонная кассета<223> Transposon cassette
<400> 20<400> 20
acaggttggc tgataagtcc ccggtctagc ttgcatgcag attgcagcat tacacgtctt 60acaggttggc tgataagtcc ccggtctagc ttgcatgcag attgcagcat tacacgtctt 60
gatttgacgg ctagctcagt cctaggtaca gtgctagcac tgctttgtgg aaggagatag 120gatttgacgg ctagctcagt cctaggtaca gtgctagcac tgctttgtgg aaggagatag 120
acttatggcg gatccgatgg aatacctcga tgtgtcgttc ggcggcacgt tcgctgcaga 180acttatggcg gatccgatgg aatacctcga tgtgtcgttc ggcggcacgt tcgctgcaga 180
cacctacacc acaggtggcg acgaggtggc gaagggcccc gtgaccaagc acggcagcat 240cacctacacc acaggtggcg acgaggtggc gaagggcccc gtgaccaagc acggcagcat 240
accgaccaag cttgacggcg gcggcatcac cctcgctggc ggcaccaacg gcgtgacatt 300accgaccaag cttgacggcg gcggcatcac cctcgctggc ggcaccaacg gcgtgacatt 300
cacctcgacc gcgagcttca gcgagagtgg gaaggtgaac aagggattcc gcgccgaaat 360cacctcgacc gcgagcttca gcgagagtgg gaaggtgaac aagggattcc gcgccgaaat 360
ggagtaccgt acgacgcaga cgcccagcaa cctcgccaca ttgttctccg ccatgggcaa 420ggagtaccgt acgacgcaga cgcccagcaa cctcgccaca ttgttctccg ccatgggcaa 420
catcttcgtg cgggcgaacg gcagcaacct cgaatacggc ttctccacga acccttccgg 480catcttcgtg cgggcgaacg gcagcaacct cgaatacggc ttctccacga acccttccgg 480
cagtacatgg aacgactaca caaagtccgt gacgctgcct tccaacaatg tgaagcacat 540cagtacatgg aacgactaca caaagtccgt gacgctgcct tccaacaatg tgaagcacat 540
catccagctg acatatctgc cgggagccga cggcgctgcc tcgacgttgc agttgtcggt 600catccagctg acatatctgc cgggagccga cggcgctgcc tcgacgttgc agttgtcggt 600
ggatggcgtg gccggcgaga ccgccacctc cgcggccggc gagctcgcgg ccgtcagcga 660ggatggcgtg gccggcgaga ccgccacctc cgcggccggc gagctcgcgg ccgtcagcga 660
ttccgtcggg aacaagttcg ggatcggcta cgaggtgaac cccgcttccg gcgcggcgag 720ttccgtcggg aacaagttcg ggatcggcta cgaggtgaac cccgcttccg gcgcggcgag 720
ccgcggtctt gccggtgacg tgttccgcgc gcgtgtcgcc gattcggacg ccccgtggga 780ccgcggtctt gccggtgacg tgttccgcgc gcgtgtcgcc gattcggacg ccccgtggga 780
gattcttgac gcatcccagc tgctgcatgt caatttcaac ggcacgttca gcggcacctc 840gattcttgac gcatcccagc tgctgcatgt caatttcaac ggcacgttca gcggcacctc 840
atataccgcg gcgagcggcg agcagatgct gggctcgctg gtgtcgcgct cggccaatcc 900atataccgcg gcgagcggcg agcagatgct gggctcgctg gtgtcgcgct cggccaatcc 900
gtccatctcg aactccgccg tcacgctggg cggcggcacg gccggattcg atttcacgcc 960gtccatctcg aactccgccg tcacgctggg cggcggcacg gccggattcg atttcacgcc 960
cacggacttc accctcggtg acaacgaggc catcacccgc ccgctggtcg cggagctgcg 1020cacggacttc accctcggtg acaacgaggc catcacccgc ccgctggtcg cggagctgcg 1020
cttcaccccg acgcagaccg gcgacaacca gaccctgttc ggcgcgggcg gcaacctgtt 1080cttcaccccg acgcagaccg gcgacaacca gaccctgttc ggcgcgggcg gcaacctgtt 1080
cctgcgctac gagtcgaaca agctcgtgtt cggcgcctcc accaagtccg gcgataattg 1140cctgcgctac gagtcgaaca agctcgtgtt cggcgcctcc accaagtccg gcgataattg 1140
gaccgaccac aagatcgagt ccgcggccgc cacgggtgcg gagcacgtcg tgtcggtggc 1200gaccgaccac aagatcgagt ccgcggccgc cacgggtgcg gagcacgtcg tgtcggtggc 1200
gtacgtgccc aataaggccg gcaccggcgc gaagcttgtc atgcgcgtgg atggcggcga 1260gtacgtgccc aataaggccg gcaccggcgc gaagcttgtc atgcgcgtgg atggcggcga 1260
cgcccagacc aaggacatca ctggtctggc ttacctgaat tcgagcatca agggcaaggt 1320cgcccagacc aaggacatca ctggtctggc ttacctgaat tcgagcatca agggcaaggt 1320
cggcttcggc aacgacgtgc ataccgacgc gctcagccgc ggcttcgtcg gctcgctgag 1380cggcttcggc aacgacgtgc ataccgacgc gctcagccgc ggcttcgtcg gctcgctgag 1380
cgagatccgc ctggccgaaa cctccgcgaa cttcaccacc aacgaattca agctggtcta 1440cgagatccgc ctggccgaaa cctccgcgaa cttcaccacc aacgaattca agctggtcta 1440
ctctcaggtc agctgcgaca cgtcgggcat caaggaggcg aataccttcg acgtggagcc 1500ctctcaggtc agctgcgaca cgtcgggcat caaggaggcg aataccttcg acgtggagcc 1500
cgccgagtgc gaggccgcgc ttaagaccaa gctgtccaag ctgcgtccga ccgaagggca 1560cgccgagtgc gaggccgcgc ttaagaccaa gctgtccaag ctgcgtccga ccgaagggca 1560
ggccgactac atcgactggg gtcagatcgg attcctccat tacggcatca acacgtacta 1620ggccgactac atcgactggg gtcagatcgg attcctccat tacggcatca acacgtacta 1620
caaccaggag tggggtcacg gtaacgagga tccctcccgc atcaacccga ccggcctcga 1680caaccaggag tggggtcacg gtaacgagga tccctcccgc atcaacccga ccggcctcga 1680
caccgaccag tgggcgaagt ccttcgccga cggtggcttc aagatgatca tggtgacggt 1740caccgaccag tgggcgaagt ccttcgccga cggtggcttc aagatgatca tggtgacggt 1740
caagcaccat gacggtttcg agctgtacga ctcgcggtac aacaccgagc acgactgggc 1800caagcaccat gacggtttcg agctgtacga ctcgcggtac aacaccgagc acgactgggc 1800
aaacaccgcc gtcgccaagc gcacggggga gaaggacctg ttccgcaaga ttgtcgcctc 1860aaacaccgcc gtcgccaagc gcacggggga gaaggacctg ttccgcaaga ttgtcgcctc 1860
ggcgaagaaa tacggcctga aggtcggcat ctactattcg ccggccgatt cctacatgga 1920ggcgaagaaa tacggcctga aggtcggcat ctactattcg ccggccgatt cctacatgga 1920
gaggaagggc gtctggggca acaactccgc acgcgtcgag cgcacgatcc ccacgctggt 1980gaggaagggc gtctggggca acaactccgc acgcgtcgag cgcacgatcc ccacgctggt 1980
ggagaacgac gaccgcgccg gcaaggtggc ttccggcaaa ctgcccacgt tcaagtacaa 2040ggagaacgac gaccgcgccg gcaaggtggc ttccggcaaa ctgcccacgt tcaagtacaa 2040
ggccacggat tacggcgcct acatgctcaa ccagctctat gagctgctga ctgagtacgg 2100ggccacggat tacggcgcct acatgctcaa ccagctctat gagctgctga ctgagtacgg 2100
cgacatctcc gaggtctggt tcgacggtgc ccaaggcaac accgcaggca ctgagcatta 2160cgacatctcc gaggtctggt tcgacggtgc ccaaggcaac accgcaggca ctgagcatta 2160
cgactatggc gtgttctacg agatgatccg ccggcttcag ccccaggcaa ttcaggccaa 2220cgactatggc gtgttctacg agatgatccg ccggcttcag ccccaggcaa ttcaggccaa 2220
cgccgcatac gatgcccgat gggtgggcaa cgaggacggc tgggcccgtc agaccgagtg 2280cgccgcatac gatgcccgat gggtgggcaa cgaggacggc tgggcccgtc agaccgagtg 2280
gagcccgcag gcggcataca acgacggcgt ggacaaggtg tcgctcaagc ctggccagat 2340gagcccgcag gcggcataca acgacggcgt ggacaaggtg tcgctcaagc ctggccagat 2340
ggcccccgac ggtaagcttg gcagcatgtc gagcgtgctg tccgagatcc gcagcggcgc 2400ggcccccgac ggtaagcttg gcagcatgtc gagcgtgctg tccgagatcc gcagcggcgc 2400
cgccaaccag ctgcactggt atccggccga agtcgacgcc aagaaccggc ccggatggtt 2460cgccaaccag ctgcactggt atccggccga agtcgacgcc aagaaccggc ccggatggtt 2460
ctaccgtgcc agccaatcgc cggcgtccgt agccgaagtc gtgaagtact acgagcagtc 2520ctaccgtgcc agccaatcgc cggcgtccgt agccgaagtc gtgaagtact acgagcagtc 2520
cacgggacgc aactcgcagt atctgctgaa cgtcccaccg tccgataccg gcaagctcgc 2580cacgggacgc aactcgcagt atctgctgaa cgtcccaccg tccgataccg gcaagctcgc 2580
cgatgcggat gccgcgggac ttaaggggct gggcgaggag ctcgcccgac gctacggcac 2640cgatgcggat gccgcgggac ttaaggggct gggcgaggag ctcgcccgac gctacggcac 2640
cgatcttgcc ctgggcaaga gcgcgaccgt cgccgcgtcc gcgaacgaca ctgcggtagc 2700cgatcttgcc ctgggcaaga gcgcgaccgt cgccgcgtcc gcgaacgaca ctgcggtagc 2700
ggccccgaag ctgaccgacg gttcgaagct ctcctccgac aaggccgtgg gcaatacgcc 2760ggccccgaag ctgaccgacg gttcgaagct ctcctccgac aaggccgtgg gcaatacgcc 2760
gacgtacacc atcgatctgg gcagcactgt cgccgtggat gcagtgaaga tctccgagga 2820gacgtacacc atcgatctgg gcagcactgt cgccgtggat gcagtgaaga tctccgagga 2820
cgtgcgcaat gccggccagc agatcgaaag cgccactctg cagggacgag tcaatggaac 2880cgtgcgcaat gccggccagc agatcgaaag cgccactctg cagggacgag tcaatggaac 2880
atggacgaat ctggcgacta tgacgacggt cgggcagcag cgcgaccttc gcttcacgtc 2940atggacgaat ctggcgacta tgacgacggt cgggcagcag cgcgaccttc gcttcacgtc 2940
ccagaacatc gatgccatcc gtctggtggt caactcctcc cgcggtccgg tgcgtctgag 3000ccagaacatc gatgccatcc gtctggtggt caactcctcc cgcggtccgg tgcgtctgag 3000
ccgtcttgag gtgttccaca ccgaatccga gattcagacc ggcgcccgcg cctactacat 3060ccgtcttgag gtgttccaca ccgaatccga gattcagacc ggcgcccgcg cctactacat 3060
cgatccgacg gcgcagaccg cgggagatgg attcacgaag gacaagccca tgacgtcgat 3120cgatccgacg gcgcagaccg cgggagatgg attcacgaag gacaagccca tgacgtcgat 3120
cgagcagctg cacgatgtga ccgtcgcgcc aggctccgtg atcttcgtca aggcgggcac 3180cgagcagctg cacgatgtga ccgtcgcgcc aggctccgtg atcttcgtca aggcgggcac 3180
cgagctgacc ggggacttcg ccgtcttcgg ctacggcacc aaggacgagc ccatcaccgt 3240cgagctgacc ggggacttcg ccgtcttcgg ctacggcacc aaggacgagc ccatcaccgt 3240
gacgacatac ggcgaaagcg acaaagccac caccgcgagc ttcgacggca tgaccgccgg 3300gacgacatac ggcgaaagcg acaaagccac caccgcgagc ttcgacggca tgaccgccgg 3300
gctgacgctg aagcaggcgc tgaaggcgct cggcaaggac gacgccggct gggtcgtggc 3360gctgacgctg aagcaggcgc tgaaggcgct cggcaaggac gacgccggct gggtcgtggc 3360
cgattccgcc actgcaccgg cctcccgcgt gtatgtcccg caggatgaga tcagcgtgca 3420cgattccgcc actgcaccgg cctcccgcgt gtatgtcccg caggatgaga tcagcgtgca 3420
cgcccagtcg tcgcagaact ccggcgcaga ggcggcgagg gcgctcgacg gcgactcgtc 3480cgcccagtcg tcgcagaact ccggcgcaga ggcggcgagg gcgctcgacg gcgactcgtc 3480
gacgagctgg cactcccagt acagcccgac caccgcgtct gctccgcatt gggtgactct 3540gacgagctgg cactcccagt acagcccgac caccgcgtct gctccgcatt gggtgactct 3540
cgatctcggc aaatcgcgtg agaacgtcgc ctacttcgac tacctcgccc gtatcgacgg 3600cgatctcggc aaatcgcgtg agaacgtcgc ctacttcgac tacctcgccc gtatcgacgg 3600
caacaataac ggtgccgcca aggattacga ggtgtatgtc tccgacgatc ccaacgattt 3660caacaataac ggtgccgcca aggattacga ggtgtatgtc tccgacgatc ccaacgattt 3660
tggagcccct gtggcctcgg gcacgttgaa gaacgtcgcc tacacgcagc gcatcaagct 3720tggagcccct gtggcctcgg gcacgttgaa gaacgtcgcc tacacgcagc gcatcaagct 3720
gacccccaag aacggacggt acgtcaagtt cgtcatcaag accgattatt ccggatcgaa 3780gacccccaag aacggacggt acgtcaagtt cgtcatcaag accgattatt ccggatcgaa 3780
cttcggctcc gcggcggaaa tgaatgtcga gttgctgccc acggccgtag aggaggacaa 3840cttcggctcc gcggcggaaa tgaatgtcga gttgctgccc acggccgtag aggaggacaa 3840
ggtcgccacc ccgcagaagc cgacagtgga cgatgatgcc gatacataca ccatccccga 3900ggtcgccacc ccgcagaagc cgacagtgga cgatgatgcc gatacataca ccatccccga 3900
catcgaggga gtcgtgtaca aggtcgacgg caaggtgttg gccgctggtt ccgtagtgaa 3960catcgaggga gtcgtgtaca aggtcgacgg caaggtgttg gccgctggtt ccgtagtgaa 3960
cgtgggcgat gaggacgtga ccgtcacggt caccgccgag cccgccgacg gataccgctt 4020cgtgggcgat gaggacgtga ccgtcacggt caccgccgag cccgccgacg gataccgctt 4020
cccggatggt gtgacgtccc cagtcacgta tgagctgacg ttcaccaaga agggtggcga 4080cccggatggt gtgacgtccc cagtcacgta tgagctgacg ttcaccaaga agggtggcga 4080
gaagcctccg accgaagtca acaaggacaa gctgcacgcc acgatcacca aggctcaggc 4140gaagcctccg accgaagtca acaaggacaa gctgcacgcc acgatcacca aggctcaggc 4140
gatcgaccgt tccgcctata cggacgagtc gctcaaggtg cttgatgaca agctcgccgc 4200gatcgaccgt tccgcctata cggacgagtc gctcaaggtg cttgatgaca agctcgccgc 4200
agcgctcaag gtctatgacg atgacaaggt gagccaggat gatgtcgatg ccgccgaggc 4260agcgctcaag gtctatgacg atgacaaggt gagccaggat gatgtcgatg ccgccgaggc 4260
ggctctgtct gcggcgatcg acgcgctgaa gaccaagccg acgacccccg gcggtgaagg 4320ggctctgtct gcggcgatcg acgcgctgaa gaccaagccg acgacccccg gcggtgaagg 4320
tgagaagcct ggtgaaggtg aaaagcccgg tgacggcaac aagcccggtg acggcaagaa 4380tgagaagcct ggtgaaggtg aaaagcccgg tgacggcaac aagcccggtg acggcaagaa 4380
gcccggcgac gtgatcgcaa agaccggcgc ctccacaatg taactagcat aaccccttgg 4440gcccggcgac gtgatcgcaa agaccggcgc ctccacaatg taactagcat aaccccttgg 4440
ggcctctaaa cgggtcttga ggggtttttt gctgaaacca atttgcctgg cggcagtagc 4500ggcctctaaa cgggtcttga ggggtttttt gctgaaacca atttgcctgg cggcagtagc 4500
gcggtggtcc cacctgaccc catgccgaac tcagaagtga aacgccgtag cgccgatggt 4560gcggtggtcc cacctgaccc catgccgaac tcagaagtga aacgccgtag cgccgatggt 4560
agtgtggggt ctccccatgc gagagtaggg aactgccagg catcaaataa aacgaaaggc 4620agtgtggggt ctccccatgc gagagtaggg aactgccagg catcaaataa aacgaaaggc 4620
tcagtcgaaa gactgggcct ttcgggatcc aggccggcct gttaacgaat taatcttccg 4680tcagtcgaaa gactgggcct ttcgggatcc aggccggcct gttaacgaat taatcttccg 4680
cggcggtatc gataagcttg atatcgaatt ccgaagttcc tattctctag acgccattca 4740cggcggtatc gataagcttg atatcgaatt ccgaagttcc tattctctag acgccattca 4740
ggctgcgcaa ctgttgggaa gggcgatcgg tgcgggcctc ttcgctatta cgccagctgg 4800ggctgcgcaa ctgttgggaa gggcgatcgg tgcgggcctc ttcgctatta cgccagctgg 4800
cgaaaggggg atgtgctgca aggcgattaa gttgggtaac gccagggttt tcccagtcac 4860cgaaaggggg atgtgctgca aggcgattaa gttgggtaac gccagggttt tcccagtcac 4860
gacgttgtaa aacgacggcc agtgaattcg agctcggtac ctaccgttcg tataatgtat 4920gacgttgtaa aacgacggcc agtgaattcg agctcggtac ctaccgttcg tataatgtat 4920
gctatacgaa gttatcgagc tctagagaat gatcccctca ttaggccaca cgttcaagtg 4980gctatacgaa gttatcgagc tctagagaat gatcccctca ttaggccaca cgttcaagtg 4980
cagcgcacac cgtggaaacg gatgaaggca cgaacccagt tgacataagc ctgttcggtt 5040cagcgcacac cgtggaaacg gatgaaggca cgaacccagt tgacataagc ctgttcggtt 5040
cgtaaactgt aatgcaagta gcgtatgcgc tcacgcaact ggtccagaac cttgaccgaa 5100cgtaaactgt aatgcaagta gcgtatgcgc tcacgcaact ggtccagaac cttgaccgaa 5100
cgcagcggtg gtaacggcgc agtggcggtt ttcatggctt gttatgactg tttttttgta 5160cgcagcggtg gtaacggcgc agtggcggtt ttcatggctt gttatgactg tttttttgta 5160
cagtctatgc ctcgggcatc caagcagcaa gcgcgttacg ccgtgggtcg atgtttgatg 5220cagtctatgc ctcgggcatc caagcagcaa gcgcgttacg ccgtgggtcg atgtttgatg 5220
ttatggagca gcaacgatgt tacgcagcag caacgatgtt acgcagcagg gcagtcgccc 5280ttatggagca gcaacgatgt tacgcagcag caacgatgtt acgcagcagg gcagtcgccc 5280
taaaacaaag ttaggtggct caagtatggg catcattcgc acatgtaggc tcggccctga 5340taaaacaaag ttaggtggct caagtatggg catcattcgc acatgtaggc tcggccctga 5340
ccaagtcaaa tccatgcggg ctgctcttga tcttttcggt cgtgagttcg gagacgtagc 5400ccaagtcaaa tccatgcggg ctgctcttga tcttttcggt cgtgagttcg gagacgtagc 5400
cacctactcc caacatcagc cggactccga ttacctcggg aacttgctcc gtagtaagac 5460cacctactcc caacatcagc cggactccga ttacctcggg aacttgctcc gtagtaagac 5460
attcatcgcg cttgctgcct tcgaccaaga agcggttgtt ggcgctctcg cggcttacgt 5520attcatcgcg cttgctgcct tcgaccaaga agcggttgtt ggcgctctcg cggcttacgt 5520
tctgcccaga tttgagcagc cgcgtagtga gatctatatc tatgatctcg cagtctccgg 5580tctgcccaga tttgagcagc cgcgtagtga gatctatatc tatgatctcg cagtctccgg 5580
cgagcaccgg aggcagggca ttgccaccgc gctcatcaat ctcctcaagc atgaggccaa 5640cgagcaccgg aggcagggca ttgccaccgc gctcatcaat ctcctcaagc atgaggccaa 5640
cgcgcttggt gcttatgtga tctacgtgca agcagattac ggtgacgatc ccgcagtggc 5700cgcgcttggt gcttatgtga tctacgtgca agcagattac ggtgacgatc ccgcagtggc 5700
tctctataca aagttgggca tacgggaaga agtgatgcac tttgatatcg acccaagtac 5760tctctataca aagttgggca tacgggaaga agtgatgcac tttgatatcg acccaagtac 5760
cgccacctaa caattcgttc aagccgagat cgtagaattt cgacgacctg cagccaagca 5820cgccacctaa caattcgttc aagccgagat cgtagaattt cgacgacctg cagccaagca 5820
taacttcgta taatgtatgc tatacgaacg gtaggatcct ctagagtcga ccaggtggca 5880taacttcgta taatgtatgc tatacgaacg gtaggatcct ctagagtcga ccaggtggca 5880
cttttcgggc agaccgggga cttatcagcc aacctgt 5917cttttcgggc agaccgggga cttatcagcc aacctgt 5917
<---<---
Claims (15)
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP18172658.9 | 2018-05-16 |
Publications (2)
Publication Number | Publication Date |
---|---|
RU2020138045A RU2020138045A (en) | 2022-06-17 |
RU2810730C2 true RU2810730C2 (en) | 2023-12-28 |
Family
ID=
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP2845905A1 (en) * | 2013-09-10 | 2015-03-11 | Jennewein Biotechnologie GmbH | Production of oligosaccharides |
RU2570556C2 (en) * | 2010-07-29 | 2015-12-10 | Басф Се | Method of producing glycosides of acrylate derivatives using polysaccharides and glycosidases or glycosyltransferases |
RU2591123C2 (en) * | 2011-05-13 | 2016-07-10 | Нестек С.А. | Food products containing human milk oligosaccharides and methods for production thereof |
EP3131912A1 (en) * | 2014-01-20 | 2017-02-22 | Jennewein Biotechnologie GmbH | PROCESS FOR EFFICIENT PURIFICATION OF NEUTRAL HUMAN MILK OLIGOSACCHARIDES (HMOs) FROM MICROBIAL FERMENTATION |
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
RU2570556C2 (en) * | 2010-07-29 | 2015-12-10 | Басф Се | Method of producing glycosides of acrylate derivatives using polysaccharides and glycosidases or glycosyltransferases |
RU2591123C2 (en) * | 2011-05-13 | 2016-07-10 | Нестек С.А. | Food products containing human milk oligosaccharides and methods for production thereof |
EP2845905A1 (en) * | 2013-09-10 | 2015-03-11 | Jennewein Biotechnologie GmbH | Production of oligosaccharides |
EP3131912A1 (en) * | 2014-01-20 | 2017-02-22 | Jennewein Biotechnologie GmbH | PROCESS FOR EFFICIENT PURIFICATION OF NEUTRAL HUMAN MILK OLIGOSACCHARIDES (HMOs) FROM MICROBIAL FERMENTATION |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112119164A (en) | Use of glycosidases in oligosaccharide production | |
AU2018296557B2 (en) | Fucosyltransferases and their use in producing fucosylated oligosaccharides | |
KR102554781B1 (en) | Improved process for production of fucosylated oligosaccharides | |
CN113166789A (en) | Synthesis of fucosylated oligosaccharide LNFP-V | |
KR101544184B1 (en) | Variant Microorganism for Producing 2-Fucosyllactose and Method of Producing 2-Fucosyllactose by Using the Same | |
US10519475B1 (en) | Biosynthesis of compounds in yeast | |
WO2023099680A1 (en) | Cells with tri-, tetra- or pentasaccharide importers useful in oligosaccharide production | |
EP4341417A1 (en) | Identification of an alpha -1,2-fucosyltransferase for the in vivo production of pure lnfp-i | |
RU2810730C2 (en) | Use of glycosidases in oligosacharide production | |
JP2024516207A (en) | Microbial strains expressing invertase/sucrose hydrolase | |
CN116676243A (en) | Construction method and application of recombinant escherichia coli producing 2' -fucosyllactose | |
CN116802286A (en) | Bacterial strain for producing DFL | |
RU2790445C2 (en) | Improved method for production of fucosylated oligosaccharides | |
RU2818835C2 (en) | Fucosyltransferases and their use for obtaining fucosylated oligosaccharides | |
JP7331278B1 (en) | A novel sialyltransferase for the in vivo synthesis of 3'SL | |
WO2024133702A2 (en) | New fucosyltransferases for production of 3fl | |
DK202200591A1 (en) | New sialyltransferases for in vivo synthesis of lst-c | |
WO2024175777A1 (en) | Product specific transporter for in vivo synthesis of human milk oligosaccharides | |
WO2023166035A2 (en) | New sialyltransferases for in vivo synthesis of 3'sl and 6'sl | |
WO2024046994A1 (en) | Fermentative production of oligosaccharides by microbial cells utilizing glycerol | |
WO2023166034A1 (en) | New sialyltransferases for in vivo synthesis of lst-a | |
WO2023209098A1 (en) | Hmo producing microorganism with increased robustness towards glucose gradients | |
CN117321071A (en) | Microbial strains expressing invertase/sucrose hydrolase |