CN112119164A - 糖苷酶在低聚糖生产中的用途 - Google Patents
糖苷酶在低聚糖生产中的用途 Download PDFInfo
- Publication number
- CN112119164A CN112119164A CN201980032286.2A CN201980032286A CN112119164A CN 112119164 A CN112119164 A CN 112119164A CN 201980032286 A CN201980032286 A CN 201980032286A CN 112119164 A CN112119164 A CN 112119164A
- Authority
- CN
- China
- Prior art keywords
- ala
- gly
- thr
- val
- ser
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 150000002482 oligosaccharides Chemical class 0.000 title claims abstract description 145
- 229920001542 oligosaccharide Polymers 0.000 title claims abstract description 143
- 238000004519 manufacturing process Methods 0.000 title claims abstract description 50
- 102000005744 Glycoside Hydrolases Human genes 0.000 title claims description 46
- 108010031186 Glycoside Hydrolases Proteins 0.000 title claims description 46
- 230000000813 microbial effect Effects 0.000 claims abstract description 161
- 239000006227 byproduct Substances 0.000 claims abstract description 45
- 235000000346 sugar Nutrition 0.000 claims abstract description 29
- 230000015572 biosynthetic process Effects 0.000 claims abstract description 26
- 230000003834 intracellular effect Effects 0.000 claims abstract description 12
- 230000002503 metabolic effect Effects 0.000 claims abstract description 12
- 102000004190 Enzymes Human genes 0.000 claims abstract description 11
- 108090000790 Enzymes Proteins 0.000 claims abstract description 11
- 230000010039 intracellular degradation Effects 0.000 claims abstract 2
- 238000000034 method Methods 0.000 claims description 50
- SNFSYLYCDAVZGP-UHFFFAOYSA-N UNPD26986 Natural products OC1C(O)C(O)C(C)OC1OC1C(OC2C(OC(O)C(O)C2O)CO)OC(CO)C(O)C1O SNFSYLYCDAVZGP-UHFFFAOYSA-N 0.000 claims description 28
- 229940062827 2'-fucosyllactose Drugs 0.000 claims description 17
- HWHQUWQCBPAQQH-UHFFFAOYSA-N 2-O-alpha-L-Fucosyl-lactose Natural products OC1C(O)C(O)C(C)OC1OC1C(O)C(O)C(CO)OC1OC(C(O)CO)C(O)C(O)C=O HWHQUWQCBPAQQH-UHFFFAOYSA-N 0.000 claims description 17
- WJPIUUDKRHCAEL-UHFFFAOYSA-N 3FL Natural products OC1C(O)C(O)C(C)OC1OC1C(OC2C(C(O)C(O)C(CO)O2)O)C(CO)OC(O)C1O WJPIUUDKRHCAEL-UHFFFAOYSA-N 0.000 claims description 17
- IEQCXFNWPAHHQR-UHFFFAOYSA-N lacto-N-neotetraose Natural products OCC1OC(OC2C(C(OC3C(OC(O)C(O)C3O)CO)OC(CO)C2O)O)C(NC(=O)C)C(O)C1OC1OC(CO)C(O)C(O)C1O IEQCXFNWPAHHQR-UHFFFAOYSA-N 0.000 claims description 15
- 229940062780 lacto-n-neotetraose Drugs 0.000 claims description 15
- RBMYDHMFFAVMMM-PLQWBNBWSA-N neolactotetraose Chemical compound O([C@H]1[C@H](O)[C@H]([C@@H](O[C@@H]1CO)O[C@@H]1[C@H]([C@H](O[C@H]([C@H](O)CO)[C@H](O)[C@@H](O)C=O)O[C@H](CO)[C@@H]1O)O)NC(=O)C)[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O RBMYDHMFFAVMMM-PLQWBNBWSA-N 0.000 claims description 15
- 239000007857 degradation product Substances 0.000 claims description 14
- 239000000203 mixture Substances 0.000 claims description 14
- 101000893749 Arabidopsis thaliana Alpha-L-fucosidase 3 Proteins 0.000 claims description 13
- AXQLFFDZXPOFPO-UHFFFAOYSA-N UNPD216 Natural products O1C(CO)C(O)C(OC2C(C(O)C(O)C(CO)O2)O)C(NC(=O)C)C1OC(C1O)C(O)C(CO)OC1OC1C(O)C(O)C(O)OC1CO AXQLFFDZXPOFPO-UHFFFAOYSA-N 0.000 claims description 13
- AXQLFFDZXPOFPO-UNTPKZLMSA-N beta-D-Galp-(1->3)-beta-D-GlcpNAc-(1->3)-beta-D-Galp-(1->4)-beta-D-Glcp Chemical compound O([C@@H]1O[C@H](CO)[C@H](O)[C@@H]([C@H]1O)O[C@H]1[C@@H]([C@H]([C@H](O)[C@@H](CO)O1)O[C@H]1[C@@H]([C@@H](O)[C@@H](O)[C@@H](CO)O1)O)NC(=O)C)[C@H]1[C@H](O)[C@@H](O)[C@H](O)O[C@@H]1CO AXQLFFDZXPOFPO-UNTPKZLMSA-N 0.000 claims description 13
- USIPEGYTBGEPJN-UHFFFAOYSA-N lacto-N-tetraose Natural products O1C(CO)C(O)C(OC2C(C(O)C(O)C(CO)O2)O)C(NC(=O)C)C1OC1C(O)C(CO)OC(OC(C(O)CO)C(O)C(O)C=O)C1O USIPEGYTBGEPJN-UHFFFAOYSA-N 0.000 claims description 13
- 108700023372 Glycosyltransferases Proteins 0.000 claims description 10
- 230000002255 enzymatic effect Effects 0.000 claims description 10
- 102100021700 Glycoprotein-N-acetylgalactosamine 3-beta-galactosyltransferase 1 Human genes 0.000 claims description 9
- 101000896564 Homo sapiens Glycoprotein-N-acetylgalactosamine 3-beta-galactosyltransferase 1 Proteins 0.000 claims description 9
- 102000012086 alpha-L-Fucosidase Human genes 0.000 claims description 9
- 108010061314 alpha-L-Fucosidase Proteins 0.000 claims description 9
- 101710098620 Alpha-1,2-fucosyltransferase Proteins 0.000 claims description 8
- 108010019236 Fucosyltransferases Proteins 0.000 claims description 8
- 102000006471 Fucosyltransferases Human genes 0.000 claims description 8
- 108010046068 N-Acetyllactosamine Synthase Proteins 0.000 claims description 8
- FZIVHOUANIQOMU-YIHIYSSUSA-N alpha-L-Fucp-(1->2)-beta-D-Galp-(1->3)-beta-D-GlcpNAc-(1->3)-beta-D-Galp-(1->4)-D-Glcp Chemical compound O[C@H]1[C@H](O)[C@H](O)[C@H](C)O[C@H]1O[C@H]1[C@H](O[C@@H]2[C@H]([C@H](O[C@@H]3[C@H]([C@H](O[C@@H]4[C@H](OC(O)[C@H](O)[C@H]4O)CO)O[C@H](CO)[C@@H]3O)O)O[C@H](CO)[C@H]2O)NC(C)=O)O[C@H](CO)[C@H](O)[C@@H]1O FZIVHOUANIQOMU-YIHIYSSUSA-N 0.000 claims description 8
- 230000000593 degrading effect Effects 0.000 claims description 8
- FZIVHOUANIQOMU-UHFFFAOYSA-N lacto-N-fucopentaose I Natural products OC1C(O)C(O)C(C)OC1OC1C(OC2C(C(OC3C(C(OC4C(OC(O)C(O)C4O)CO)OC(CO)C3O)O)OC(CO)C2O)NC(C)=O)OC(CO)C(O)C1O FZIVHOUANIQOMU-UHFFFAOYSA-N 0.000 claims description 8
- 239000002253 acid Substances 0.000 claims description 7
- TYALNJQZQRNQNQ-JLYOMPFMSA-N alpha-Neup5Ac-(2->6)-beta-D-Galp-(1->4)-beta-D-Glcp Chemical compound O1[C@@H]([C@H](O)[C@H](O)CO)[C@H](NC(=O)C)[C@@H](O)C[C@@]1(C(O)=O)OC[C@@H]1[C@H](O)[C@H](O)[C@@H](O)[C@H](O[C@H]2[C@@H]([C@@H](O)[C@H](O)O[C@@H]2CO)O)O1 TYALNJQZQRNQNQ-JLYOMPFMSA-N 0.000 claims description 7
- 102000002464 Galactosidases Human genes 0.000 claims description 6
- 108010093031 Galactosidases Proteins 0.000 claims description 6
- 102000051366 Glycosyltransferases Human genes 0.000 claims description 6
- 108090000141 Sialyltransferases Proteins 0.000 claims description 6
- 102000003838 Sialyltransferases Human genes 0.000 claims description 6
- 235000020256 human milk Nutrition 0.000 claims description 6
- 210000004251 human milk Anatomy 0.000 claims description 6
- 235000016709 nutrition Nutrition 0.000 claims description 6
- DVGKRPYUFRZAQW-UHFFFAOYSA-N 3 prime Natural products CC(=O)NC1OC(CC(O)C1C(O)C(O)CO)(OC2C(O)C(CO)OC(OC3C(O)C(O)C(O)OC3CO)C2O)C(=O)O DVGKRPYUFRZAQW-UHFFFAOYSA-N 0.000 claims description 5
- 239000001963 growth medium Substances 0.000 claims description 5
- 108060003306 Galactosyltransferase Proteins 0.000 claims description 4
- 102000030902 Galactosyltransferase Human genes 0.000 claims description 4
- 238000012258 culturing Methods 0.000 claims description 4
- 108010001671 galactoside 3-fucosyltransferase Proteins 0.000 claims description 4
- 102000004366 Glucosidases Human genes 0.000 claims description 3
- 108010056771 Glucosidases Proteins 0.000 claims description 3
- 102000005348 Neuraminidase Human genes 0.000 claims description 3
- 108010006232 Neuraminidase Proteins 0.000 claims description 3
- TYALNJQZQRNQNQ-UHFFFAOYSA-N #alpha;2,6-sialyllactose Natural products O1C(C(O)C(O)CO)C(NC(=O)C)C(O)CC1(C(O)=O)OCC1C(O)C(O)C(O)C(OC2C(C(O)C(O)OC2CO)O)O1 TYALNJQZQRNQNQ-UHFFFAOYSA-N 0.000 claims description 2
- CILYIEBUXJIHCO-UHFFFAOYSA-N 102778-91-6 Natural products O1C(C(O)C(O)CO)C(NC(=O)C)C(O)CC1(C(O)=O)OC1C(O)C(OC2C(C(O)C(O)OC2CO)O)OC(CO)C1O CILYIEBUXJIHCO-UHFFFAOYSA-N 0.000 claims description 2
- 108010055629 Glucosyltransferases Proteins 0.000 claims description 2
- 102000000340 Glucosyltransferases Human genes 0.000 claims description 2
- CILYIEBUXJIHCO-UITFWXMXSA-N N-acetyl-alpha-neuraminyl-(2->3)-beta-D-galactosyl-(1->4)-beta-D-glucose Chemical compound O1[C@@H]([C@H](O)[C@H](O)CO)[C@H](NC(=O)C)[C@@H](O)C[C@@]1(C(O)=O)O[C@@H]1[C@@H](O)[C@H](O[C@H]2[C@@H]([C@@H](O)[C@H](O)O[C@@H]2CO)O)O[C@H](CO)[C@@H]1O CILYIEBUXJIHCO-UITFWXMXSA-N 0.000 claims description 2
- OIZGSVFYNBZVIK-UHFFFAOYSA-N N-acetylneuraminosyl-D-lactose Natural products O1C(C(O)C(O)CO)C(NC(=O)C)C(O)CC1(C(O)=O)OC1C(O)C(OC(C(O)CO)C(O)C(O)C=O)OC(CO)C1O OIZGSVFYNBZVIK-UHFFFAOYSA-N 0.000 claims description 2
- 102000007478 beta-N-Acetylhexosaminidases Human genes 0.000 claims description 2
- 108010085377 beta-N-Acetylhexosaminidases Proteins 0.000 claims description 2
- 238000002360 preparation method Methods 0.000 claims description 2
- 238000004977 Hueckel calculation Methods 0.000 claims 3
- 125000005630 sialyl group Chemical group 0.000 claims 3
- HWHQUWQCBPAQQH-BWRPKUOHSA-N 2-fucosyllactose Chemical compound O[C@H]1[C@H](O)[C@H](O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@H]([C@H](O)CO)[C@H](O)[C@@H](O)C=O HWHQUWQCBPAQQH-BWRPKUOHSA-N 0.000 claims 1
- 102000002268 Hexosaminidases Human genes 0.000 claims 1
- 108010000540 Hexosaminidases Proteins 0.000 claims 1
- TVVLIFCVJJSLBL-SEHWTJTBSA-N Lacto-N-fucopentaose V Chemical compound O[C@H]1C(O)C(O)[C@H](C)O[C@H]1OC([C@@H](O)C=O)[C@@H](C(O)CO)O[C@H]1[C@H](O)[C@@H](OC2[C@@H](C(OC3[C@@H](C(O)C(O)[C@@H](CO)O3)O)[C@H](O)[C@@H](CO)O2)NC(C)=O)[C@@H](O)[C@@H](CO)O1 TVVLIFCVJJSLBL-SEHWTJTBSA-N 0.000 claims 1
- 102000002493 N-Acetylglucosaminyltransferases Human genes 0.000 claims 1
- 108010093077 N-Acetylglucosaminyltransferases Proteins 0.000 claims 1
- CMQZRJBJDCVIEY-JEOLMMCMSA-N alpha-L-Fucp-(1->3)-[beta-D-Galp-(1->4)]-beta-D-GlcpNAc-(1->3)-beta-D-Galp-(1->4)-D-Glcp Chemical compound O[C@H]1[C@H](O)[C@H](O)[C@H](C)O[C@H]1O[C@H]1[C@H](O[C@H]2[C@@H]([C@@H](O)[C@@H](O)[C@@H](CO)O2)O)[C@@H](CO)O[C@@H](O[C@@H]2[C@H]([C@H](O[C@@H]3[C@H](OC(O)[C@H](O)[C@H]3O)CO)O[C@H](CO)[C@@H]2O)O)[C@@H]1NC(C)=O CMQZRJBJDCVIEY-JEOLMMCMSA-N 0.000 claims 1
- RQNFGIWYOACERD-OCQMRBNYSA-N alpha-L-Fucp-(1->4)-[alpha-L-Fucp-(1->2)-beta-D-Galp-(1->3)]-beta-D-GlcpNAc-(1->3)-beta-D-Galp-(1->4)-D-Glcp Chemical compound O[C@H]1[C@H](O)[C@H](O)[C@H](C)O[C@H]1O[C@H]1[C@H](O[C@H]2[C@@H]([C@@H](CO)O[C@@H](O[C@@H]3[C@H]([C@H](O[C@@H]4[C@H](OC(O)[C@H](O)[C@H]4O)CO)O[C@H](CO)[C@@H]3O)O)[C@@H]2NC(C)=O)O[C@H]2[C@H]([C@H](O)[C@H](O)[C@H](C)O2)O)O[C@H](CO)[C@H](O)[C@@H]1O RQNFGIWYOACERD-OCQMRBNYSA-N 0.000 claims 1
- DUKURNFHYQXCJG-JEOLMMCMSA-N alpha-L-Fucp-(1->4)-[beta-D-Galp-(1->3)]-beta-D-GlcpNAc-(1->3)-beta-D-Galp-(1->4)-D-Glcp Chemical compound O[C@H]1[C@H](O)[C@H](O)[C@H](C)O[C@H]1O[C@H]1[C@H](O[C@H]2[C@@H]([C@@H](O)[C@@H](O)[C@@H](CO)O2)O)[C@@H](NC(C)=O)[C@H](O[C@@H]2[C@H]([C@H](O[C@@H]3[C@H](OC(O)[C@H](O)[C@H]3O)CO)O[C@H](CO)[C@@H]2O)O)O[C@@H]1CO DUKURNFHYQXCJG-JEOLMMCMSA-N 0.000 claims 1
- RQNFGIWYOACERD-UHFFFAOYSA-N lacto-N-Difucosylhexaose I Natural products OC1C(O)C(O)C(C)OC1OC1C(OC2C(C(CO)OC(OC3C(C(OC4C(OC(O)C(O)C4O)CO)OC(CO)C3O)O)C2NC(C)=O)OC2C(C(O)C(O)C(C)O2)O)OC(CO)C(O)C1O RQNFGIWYOACERD-UHFFFAOYSA-N 0.000 claims 1
- OQIUPKPUOLIHHS-UHFFFAOYSA-N lacto-N-difucohexaose I Natural products OC1C(O)C(O)C(C)OC1OC1C(OC2C(C(CO)OC(OC3C(C(OC(C(O)CO)C(O)C(O)C=O)OC(CO)C3O)O)C2NC(C)=O)OC2C(C(O)C(O)C(C)O2)O)OC(CO)C(O)C1O OQIUPKPUOLIHHS-UHFFFAOYSA-N 0.000 claims 1
- FKADDOYBRRMBPP-UHFFFAOYSA-N lacto-N-fucopentaose II Natural products OC1C(O)C(O)C(C)OC1OC1C(OC2C(C(O)C(O)C(CO)O2)O)C(NC(C)=O)C(OC2C(C(OC(C(O)CO)C(O)C(O)C=O)OC(CO)C2O)O)OC1CO FKADDOYBRRMBPP-UHFFFAOYSA-N 0.000 claims 1
- CMQZRJBJDCVIEY-UHFFFAOYSA-N lacto-N-fucopentaose III Natural products OC1C(O)C(O)C(C)OC1OC1C(OC2C(C(O)C(O)C(CO)O2)O)C(CO)OC(OC2C(C(OC3C(OC(O)C(O)C3O)CO)OC(CO)C2O)O)C1NC(C)=O CMQZRJBJDCVIEY-UHFFFAOYSA-N 0.000 claims 1
- 210000004027 cell Anatomy 0.000 description 148
- 239000002773 nucleotide Substances 0.000 description 115
- 125000003729 nucleotide group Chemical group 0.000 description 115
- 230000014509 gene expression Effects 0.000 description 42
- 108090000623 proteins and genes Proteins 0.000 description 42
- GUBGYTABKSRVRQ-QKKXKWKRSA-N lactose group Chemical group OC1[C@H](O)[C@@H](O)[C@H](O[C@H]2[C@H](O)[C@@H](O)[C@@H](O)[C@H](O2)CO)[C@H](O1)CO GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 32
- 239000008101 lactose Substances 0.000 description 31
- SHZGCJCMOBCMKK-DHVFOXMCSA-N L-fucopyranose Chemical group C[C@@H]1OC(O)[C@@H](O)[C@H](O)[C@@H]1O SHZGCJCMOBCMKK-DHVFOXMCSA-N 0.000 description 23
- WQZGKKKJIJFFOK-PHYPRBDBSA-N alpha-D-galactose Chemical group OC[C@H]1O[C@H](O)[C@H](O)[C@@H](O)[C@H]1O WQZGKKKJIJFFOK-PHYPRBDBSA-N 0.000 description 22
- 229920001184 polypeptide Polymers 0.000 description 22
- 108090000765 processed proteins & peptides Proteins 0.000 description 22
- 102000004196 processed proteins & peptides Human genes 0.000 description 22
- 108020004707 nucleic acids Proteins 0.000 description 20
- 102000039446 nucleic acids Human genes 0.000 description 20
- 150000007523 nucleic acids Chemical class 0.000 description 20
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 18
- SHZGCJCMOBCMKK-UHFFFAOYSA-N D-mannomethylose Natural products CC1OC(O)C(O)C(O)C1O SHZGCJCMOBCMKK-UHFFFAOYSA-N 0.000 description 17
- PHTAQVMXYWFMHF-GJGMMKECSA-N alpha-L-Fucp-(1->2)-beta-D-Galp-(1->4)-D-GlcpNAc Chemical compound O[C@H]1[C@H](O)[C@H](O)[C@H](C)O[C@H]1O[C@H]1[C@H](O[C@@H]2[C@H](OC(O)[C@H](NC(C)=O)[C@H]2O)CO)O[C@H](CO)[C@H](O)[C@@H]1O PHTAQVMXYWFMHF-GJGMMKECSA-N 0.000 description 17
- 238000000855 fermentation Methods 0.000 description 16
- 230000004151 fermentation Effects 0.000 description 16
- 108020004414 DNA Proteins 0.000 description 15
- 239000000758 substrate Substances 0.000 description 15
- 108010061238 threonyl-glycine Proteins 0.000 description 15
- AUNPEJDACLEKSC-ZAYDSPBTSA-N 3-fucosyllactose Chemical compound O[C@H]1[C@H](O)[C@H](O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](O)[C@H](O[C@@H]2[C@H](OC(O)[C@H](O)[C@H]2O)CO)O[C@H](CO)[C@@H]1O AUNPEJDACLEKSC-ZAYDSPBTSA-N 0.000 description 14
- 125000003275 alpha amino acid group Chemical group 0.000 description 14
- 230000010354 integration Effects 0.000 description 14
- 108010047857 aspartylglycine Proteins 0.000 description 13
- 244000005700 microbiome Species 0.000 description 13
- 230000002194 synthesizing effect Effects 0.000 description 13
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 description 12
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 12
- 108010050848 glycylleucine Proteins 0.000 description 12
- 239000002609 medium Substances 0.000 description 12
- SNFSYLYCDAVZGP-OLAZETNGSA-N 2'-fucosyllactose Chemical compound O[C@H]1[C@H](O)[C@H](O)[C@H](C)O[C@H]1O[C@H]1[C@H](O[C@@H]2[C@H](OC(O)[C@H](O)[C@H]2O)CO)O[C@H](CO)[C@H](O)[C@@H]1O SNFSYLYCDAVZGP-OLAZETNGSA-N 0.000 description 11
- PNNNRSAQSRJVSB-SLPGGIOYSA-N Fucose Natural products C[C@H](O)[C@@H](O)[C@H](O)[C@H](O)C=O PNNNRSAQSRJVSB-SLPGGIOYSA-N 0.000 description 11
- 108010047495 alanylglycine Proteins 0.000 description 11
- 150000002772 monosaccharides Chemical group 0.000 description 11
- 241000588724 Escherichia coli Species 0.000 description 10
- 241000880493 Leptailurus serval Species 0.000 description 10
- HSCJRCZFDFQWRP-UHFFFAOYSA-N Uridindiphosphoglukose Natural products OC1C(O)C(O)C(CO)OC1OP(O)(=O)OP(O)(=O)OCC1C(O)C(O)C(N2C(NC(=O)C=C2)=O)O1 HSCJRCZFDFQWRP-UHFFFAOYSA-N 0.000 description 10
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 10
- 239000002243 precursor Substances 0.000 description 10
- 238000012546 transfer Methods 0.000 description 10
- 241000186016 Bifidobacterium bifidum Species 0.000 description 9
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 9
- LKOHREGGXUJGKC-UHFFFAOYSA-N Lactodifucotetraose Natural products OC1C(O)C(O)C(C)OC1OC1C(OC2C(C(O)C(O)OC2CO)OC2C(C(O)C(O)C(C)O2)O)OC(CO)C(O)C1O LKOHREGGXUJGKC-UHFFFAOYSA-N 0.000 description 9
- LKOHREGGXUJGKC-GXSKDVPZSA-N alpha-L-Fucp-(1->3)-[alpha-L-Fucp-(1->2)-beta-D-Galp-(1->4)]-beta-D-Glcp Chemical compound C[C@@H]1O[C@@H](O[C@@H]2[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]2O[C@@H]2[C@@H](CO)O[C@@H](O)[C@H](O)[C@H]2O[C@@H]2O[C@@H](C)[C@@H](O)[C@@H](O)[C@@H]2O)[C@@H](O)[C@H](O)[C@@H]1O LKOHREGGXUJGKC-GXSKDVPZSA-N 0.000 description 9
- 230000007062 hydrolysis Effects 0.000 description 9
- 238000006460 hydrolysis reaction Methods 0.000 description 9
- 108010057821 leucylproline Proteins 0.000 description 9
- OVRNDRQMDRJTHS-RTRLPJTCSA-N N-acetyl-D-glucosamine Chemical group CC(=O)N[C@H]1C(O)O[C@H](CO)[C@@H](O)[C@@H]1O OVRNDRQMDRJTHS-RTRLPJTCSA-N 0.000 description 8
- SQVRNKJHWKZAKO-PFQGKNLYSA-N N-acetyl-beta-neuraminic acid Chemical group CC(=O)N[C@@H]1[C@@H](O)C[C@@](O)(C(O)=O)O[C@H]1[C@H](O)[C@H](O)CO SQVRNKJHWKZAKO-PFQGKNLYSA-N 0.000 description 8
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 8
- 108010020764 Transposases Proteins 0.000 description 8
- 102000008579 Transposases Human genes 0.000 description 8
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 8
- 108010077245 asparaginyl-proline Proteins 0.000 description 8
- 229940002008 bifidobacterium bifidum Drugs 0.000 description 8
- 230000001105 regulatory effect Effects 0.000 description 8
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 description 7
- USAZACJQJDHAJH-KDEXOMDGSA-N [[(2r,3s,4r,5s)-5-(2,4-dioxo-1h-pyrimidin-6-yl)-3,4-dihydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl] [(2r,3r,4s,5r,6r)-3,4,5-trihydroxy-6-(hydroxymethyl)oxan-2-yl] hydrogen phosphate Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@@H]1OP(O)(=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](C=2NC(=O)NC(=O)C=2)O1 USAZACJQJDHAJH-KDEXOMDGSA-N 0.000 description 7
- 230000001580 bacterial effect Effects 0.000 description 7
- 230000000295 complement effect Effects 0.000 description 7
- 229930182830 galactose Natural products 0.000 description 7
- 108010037850 glycylvaline Proteins 0.000 description 7
- 238000009396 hybridization Methods 0.000 description 7
- 230000004048 modification Effects 0.000 description 7
- 238000012986 modification Methods 0.000 description 7
- 238000003786 synthesis reaction Methods 0.000 description 7
- 238000013518 transcription Methods 0.000 description 7
- 230000035897 transcription Effects 0.000 description 7
- 108010065920 Insulin Lispro Proteins 0.000 description 6
- SHZGCJCMOBCMKK-PQMKYFCFSA-N L-Fucose Natural products C[C@H]1O[C@H](O)[C@@H](O)[C@@H](O)[C@@H]1O SHZGCJCMOBCMKK-PQMKYFCFSA-N 0.000 description 6
- PNNNRSAQSRJVSB-UHFFFAOYSA-N L-rhamnose Natural products CC(O)C(O)C(O)C(O)C=O PNNNRSAQSRJVSB-UHFFFAOYSA-N 0.000 description 6
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 6
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 6
- 108091028043 Nucleic acid sequence Proteins 0.000 description 6
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 6
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 6
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 6
- 108010005233 alanylglutamic acid Proteins 0.000 description 6
- 238000012217 deletion Methods 0.000 description 6
- 230000037430 deletion Effects 0.000 description 6
- 230000000694 effects Effects 0.000 description 6
- 108010015792 glycyllysine Proteins 0.000 description 6
- 238000004128 high performance liquid chromatography Methods 0.000 description 6
- 108010034529 leucyl-lysine Proteins 0.000 description 6
- 108010054155 lysyllysine Proteins 0.000 description 6
- 108010051242 phenylalanylserine Proteins 0.000 description 6
- 108010029020 prolylglycine Proteins 0.000 description 6
- 108010073969 valyllysine Proteins 0.000 description 6
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 5
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 5
- 241001608472 Bifidobacterium longum Species 0.000 description 5
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 5
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 5
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 5
- 108010041407 alanylaspartic acid Proteins 0.000 description 5
- 108010087924 alanylproline Proteins 0.000 description 5
- 108010092854 aspartyllysine Proteins 0.000 description 5
- SQVRNKJHWKZAKO-UHFFFAOYSA-N beta-N-Acetyl-D-neuraminic acid Natural products CC(=O)NC1C(O)CC(O)(C(O)=O)OC1C(O)C(O)CO SQVRNKJHWKZAKO-UHFFFAOYSA-N 0.000 description 5
- 229940009291 bifidobacterium longum Drugs 0.000 description 5
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 5
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 5
- 108010089804 glycyl-threonine Proteins 0.000 description 5
- 108010010147 glycylglutamine Proteins 0.000 description 5
- 108010064235 lysylglycine Proteins 0.000 description 5
- 239000000047 product Substances 0.000 description 5
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 4
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 4
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 4
- YNOCMHZSWJMGBB-GCJQMDKQSA-N Ala-Thr-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O YNOCMHZSWJMGBB-GCJQMDKQSA-N 0.000 description 4
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 4
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 4
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 4
- DONWIPDSZZJHHK-HJGDQZAQSA-N Asp-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)O DONWIPDSZZJHHK-HJGDQZAQSA-N 0.000 description 4
- 241001646716 Escherichia coli K-12 Species 0.000 description 4
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 4
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 4
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 4
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 4
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 4
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 4
- 240000004713 Pisum sativum Species 0.000 description 4
- 235000010582 Pisum sativum Nutrition 0.000 description 4
- 241000831652 Salinivibrio sharmensis Species 0.000 description 4
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 4
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 4
- GRIUMVXCJDKVPI-IZPVPAKOSA-N Thr-Thr-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GRIUMVXCJDKVPI-IZPVPAKOSA-N 0.000 description 4
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 4
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 4
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 4
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 4
- 108010068265 aspartyltyrosine Proteins 0.000 description 4
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 4
- 239000006143 cell culture medium Substances 0.000 description 4
- 239000012228 culture supernatant Substances 0.000 description 4
- 230000002068 genetic effect Effects 0.000 description 4
- 108010049041 glutamylalanine Proteins 0.000 description 4
- 102000045442 glycosyltransferase activity proteins Human genes 0.000 description 4
- 108700014210 glycosyltransferase activity proteins Proteins 0.000 description 4
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 4
- -1 guanosine diphosphate-activated L-fucose Chemical class 0.000 description 4
- 108010038320 lysylphenylalanine Proteins 0.000 description 4
- 235000015097 nutrients Nutrition 0.000 description 4
- 229910052760 oxygen Inorganic materials 0.000 description 4
- 239000001301 oxygen Substances 0.000 description 4
- 108010053725 prolylvaline Proteins 0.000 description 4
- 235000018102 proteins Nutrition 0.000 description 4
- 102000004169 proteins and genes Human genes 0.000 description 4
- 230000006798 recombination Effects 0.000 description 4
- 238000005215 recombination Methods 0.000 description 4
- 108010071207 serylmethionine Proteins 0.000 description 4
- 108010038745 tryptophylglycine Proteins 0.000 description 4
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 description 3
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 3
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 3
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 3
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 3
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 3
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 3
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 3
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 3
- OLVIPTLKNSAYRJ-YUMQZZPRSA-N Asn-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N OLVIPTLKNSAYRJ-YUMQZZPRSA-N 0.000 description 3
- WLVLIYYBPPONRJ-GCJQMDKQSA-N Asn-Thr-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O WLVLIYYBPPONRJ-GCJQMDKQSA-N 0.000 description 3
- XEGZSHSPQNDNRH-JRQIVUDYSA-N Asn-Tyr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XEGZSHSPQNDNRH-JRQIVUDYSA-N 0.000 description 3
- JUWZKMBALYLZCK-WHFBIAKZSA-N Asp-Gly-Asn Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O JUWZKMBALYLZCK-WHFBIAKZSA-N 0.000 description 3
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 3
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 3
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 3
- RKXVTTIQNKPCHU-KKHAAJSZSA-N Asp-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O RKXVTTIQNKPCHU-KKHAAJSZSA-N 0.000 description 3
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 3
- 108010090461 DFG peptide Proteins 0.000 description 3
- LQEBEXMHBLQMDB-UHFFFAOYSA-N GDP-L-fucose Natural products OC1C(O)C(O)C(C)OC1OP(O)(=O)OP(O)(=O)OCC1C(O)C(O)C(N2C3=C(C(N=C(N)N3)=O)N=C2)O1 LQEBEXMHBLQMDB-UHFFFAOYSA-N 0.000 description 3
- 229930182566 Gentamicin Natural products 0.000 description 3
- CEAZRRDELHUEMR-URQXQFDESA-N Gentamicin Chemical compound O1[C@H](C(C)NC)CC[C@@H](N)[C@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](NC)[C@@](C)(O)CO2)O)[C@H](N)C[C@@H]1N CEAZRRDELHUEMR-URQXQFDESA-N 0.000 description 3
- NTHIHAUEXVTXQG-KKUMJFAQSA-N Glu-Tyr-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O NTHIHAUEXVTXQG-KKUMJFAQSA-N 0.000 description 3
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 3
- OCDLPQDYTJPWNG-YUMQZZPRSA-N Gly-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN OCDLPQDYTJPWNG-YUMQZZPRSA-N 0.000 description 3
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 3
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 3
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 3
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 3
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 3
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 3
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 3
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 3
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 3
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 3
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 3
- ULIWFCCJIOEHMU-BQBZGAKWSA-N Pro-Gly-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 ULIWFCCJIOEHMU-BQBZGAKWSA-N 0.000 description 3
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 3
- 241000193448 Ruminiclostridium thermocellum Species 0.000 description 3
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 3
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 3
- 241000193998 Streptococcus pneumoniae Species 0.000 description 3
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 3
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 3
- XDARBNMYXKUFOJ-GSSVUCPTSA-N Thr-Asp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDARBNMYXKUFOJ-GSSVUCPTSA-N 0.000 description 3
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 3
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 3
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 3
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 3
- 108091023040 Transcription factor Proteins 0.000 description 3
- 102000040945 Transcription factor Human genes 0.000 description 3
- ULHASJWZGUEUNN-XIRDDKMYSA-N Trp-Lys-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O ULHASJWZGUEUNN-XIRDDKMYSA-N 0.000 description 3
- QAYSODICXVZUIA-WLTAIBSBSA-N Tyr-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QAYSODICXVZUIA-WLTAIBSBSA-N 0.000 description 3
- HSCJRCZFDFQWRP-JZMIEXBBSA-N UDP-alpha-D-glucose Chemical group O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1OP(O)(=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C(NC(=O)C=C2)=O)O1 HSCJRCZFDFQWRP-JZMIEXBBSA-N 0.000 description 3
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 3
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 3
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 3
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 3
- 108010013835 arginine glutamate Proteins 0.000 description 3
- 108010062796 arginyllysine Proteins 0.000 description 3
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 3
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 3
- 230000027455 binding Effects 0.000 description 3
- 230000003115 biocidal effect Effects 0.000 description 3
- 229910052799 carbon Inorganic materials 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 230000002759 chromosomal effect Effects 0.000 description 3
- KRKNYBCHXYNGOX-UHFFFAOYSA-N citric acid Chemical compound OC(=O)CC(O)(C(O)=O)CC(O)=O KRKNYBCHXYNGOX-UHFFFAOYSA-N 0.000 description 3
- 239000003623 enhancer Substances 0.000 description 3
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 3
- 229960002518 gentamicin Drugs 0.000 description 3
- 125000002791 glucosyl group Chemical group C1([C@H](O)[C@@H](O)[C@H](O)[C@H](O1)CO)* 0.000 description 3
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 3
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 3
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 3
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 3
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 3
- 108010081551 glycylphenylalanine Proteins 0.000 description 3
- 229910052500 inorganic mineral Inorganic materials 0.000 description 3
- 238000003780 insertion Methods 0.000 description 3
- 230000037431 insertion Effects 0.000 description 3
- 239000000543 intermediate Substances 0.000 description 3
- 108010017391 lysylvaline Proteins 0.000 description 3
- 230000001404 mediated effect Effects 0.000 description 3
- 108010056582 methionylglutamic acid Proteins 0.000 description 3
- 239000011707 mineral Substances 0.000 description 3
- 235000010755 mineral Nutrition 0.000 description 3
- 101150038284 pfkA gene Proteins 0.000 description 3
- 108010031719 prolyl-serine Proteins 0.000 description 3
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 3
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 3
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 3
- 241000894007 species Species 0.000 description 3
- 229940031000 streptococcus pneumoniae Drugs 0.000 description 3
- 150000008163 sugars Chemical class 0.000 description 3
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 3
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 3
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 3
- 108010020532 tyrosyl-proline Proteins 0.000 description 3
- 108010003137 tyrosyltyrosine Proteins 0.000 description 3
- 210000005253 yeast cell Anatomy 0.000 description 3
- WQZGKKKJIJFFOK-SVZMEOIVSA-N (+)-Galactose Chemical compound OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@H]1O WQZGKKKJIJFFOK-SVZMEOIVSA-N 0.000 description 2
- 101710099475 3'-phosphoadenosine 5'-phosphate phosphatase Proteins 0.000 description 2
- OIZGSVFYNBZVIK-FHHHURIISA-N 3'-sialyllactose Chemical compound O1[C@@H]([C@H](O)[C@H](O)CO)[C@H](NC(=O)C)[C@@H](O)C[C@@]1(C(O)=O)O[C@@H]1[C@@H](O)[C@H](O[C@H]([C@H](O)CO)[C@H](O)[C@@H](O)C=O)O[C@H](CO)[C@@H]1O OIZGSVFYNBZVIK-FHHHURIISA-N 0.000 description 2
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 2
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 2
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 2
- PXKLCFFSVLKOJM-ACZMJKKPSA-N Ala-Asn-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXKLCFFSVLKOJM-ACZMJKKPSA-N 0.000 description 2
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 2
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 2
- KUDREHRZRIVKHS-UWJYBYFXSA-N Ala-Asp-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KUDREHRZRIVKHS-UWJYBYFXSA-N 0.000 description 2
- AWAXZRDKUHOPBO-GUBZILKMSA-N Ala-Gln-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O AWAXZRDKUHOPBO-GUBZILKMSA-N 0.000 description 2
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 2
- UHMQKOBNPRAZGB-CIUDSAMLSA-N Ala-Glu-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N UHMQKOBNPRAZGB-CIUDSAMLSA-N 0.000 description 2
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 2
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 2
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 2
- HQJKCXHQNUCKMY-GHCJXIJMSA-N Ala-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C)N HQJKCXHQNUCKMY-GHCJXIJMSA-N 0.000 description 2
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 2
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 2
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 2
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 2
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 2
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 2
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 2
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 2
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 2
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 2
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 2
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 2
- FRMQITGHXMUNDF-GMOBBJLQSA-N Arg-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FRMQITGHXMUNDF-GMOBBJLQSA-N 0.000 description 2
- YKBHOXLMMPZPHQ-GMOBBJLQSA-N Arg-Ile-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O YKBHOXLMMPZPHQ-GMOBBJLQSA-N 0.000 description 2
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 2
- OVQJAKFLFTZDNC-GUBZILKMSA-N Arg-Pro-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O OVQJAKFLFTZDNC-GUBZILKMSA-N 0.000 description 2
- AWMAZIIEFPFHCP-RCWTZXSCSA-N Arg-Pro-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWMAZIIEFPFHCP-RCWTZXSCSA-N 0.000 description 2
- UZSQXCMNUPKLCC-FJXKBIBVSA-N Arg-Thr-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UZSQXCMNUPKLCC-FJXKBIBVSA-N 0.000 description 2
- CGWVCWFQGXOUSJ-ULQDDVLXSA-N Arg-Tyr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O CGWVCWFQGXOUSJ-ULQDDVLXSA-N 0.000 description 2
- IARGXWMWRFOQPG-GCJQMDKQSA-N Asn-Ala-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IARGXWMWRFOQPG-GCJQMDKQSA-N 0.000 description 2
- XQQVCUIBGYFKDC-OLHMAJIHSA-N Asn-Asp-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XQQVCUIBGYFKDC-OLHMAJIHSA-N 0.000 description 2
- SJPZTWAYTJPPBI-GUBZILKMSA-N Asn-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N SJPZTWAYTJPPBI-GUBZILKMSA-N 0.000 description 2
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 2
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 2
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 2
- JEEFEQCRXKPQHC-KKUMJFAQSA-N Asn-Leu-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JEEFEQCRXKPQHC-KKUMJFAQSA-N 0.000 description 2
- RCFGLXMZDYNRSC-CIUDSAMLSA-N Asn-Lys-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O RCFGLXMZDYNRSC-CIUDSAMLSA-N 0.000 description 2
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 2
- PPCORQFLAZWUNO-QWRGUYRKSA-N Asn-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N PPCORQFLAZWUNO-QWRGUYRKSA-N 0.000 description 2
- GKKUBLFXKRDMFC-BQBZGAKWSA-N Asn-Pro-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O GKKUBLFXKRDMFC-BQBZGAKWSA-N 0.000 description 2
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 2
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 2
- YSYTWUMRHSFODC-QWRGUYRKSA-N Asn-Tyr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O YSYTWUMRHSFODC-QWRGUYRKSA-N 0.000 description 2
- VPPXTHJNTYDNFJ-CIUDSAMLSA-N Asp-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N VPPXTHJNTYDNFJ-CIUDSAMLSA-N 0.000 description 2
- XPGVTUBABLRGHY-BIIVOSGPSA-N Asp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N XPGVTUBABLRGHY-BIIVOSGPSA-N 0.000 description 2
- KVMPVNGOKHTUHZ-GCJQMDKQSA-N Asp-Ala-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KVMPVNGOKHTUHZ-GCJQMDKQSA-N 0.000 description 2
- ATYWBXGNXZYZGI-ACZMJKKPSA-N Asp-Asn-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ATYWBXGNXZYZGI-ACZMJKKPSA-N 0.000 description 2
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 2
- RQYMKRMRZWJGHC-BQBZGAKWSA-N Asp-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N RQYMKRMRZWJGHC-BQBZGAKWSA-N 0.000 description 2
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 2
- QJHOOKBAHRJPPX-QWRGUYRKSA-N Asp-Phe-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 QJHOOKBAHRJPPX-QWRGUYRKSA-N 0.000 description 2
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 2
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 2
- VNXQRBXEQXLERQ-CIUDSAMLSA-N Asp-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N VNXQRBXEQXLERQ-CIUDSAMLSA-N 0.000 description 2
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 2
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 2
- JJQGZGOEDSSHTE-FOHZUACHSA-N Asp-Thr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JJQGZGOEDSSHTE-FOHZUACHSA-N 0.000 description 2
- GCACQYDBDHRVGE-LKXGYXEUSA-N Asp-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC(O)=O GCACQYDBDHRVGE-LKXGYXEUSA-N 0.000 description 2
- 241000193830 Bacillus <bacterium> Species 0.000 description 2
- 241000193422 Bacillus lentus Species 0.000 description 2
- 241000194107 Bacillus megaterium Species 0.000 description 2
- 241000194106 Bacillus mycoides Species 0.000 description 2
- 241000194103 Bacillus pumilus Species 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 241000186015 Bifidobacterium longum subsp. infantis Species 0.000 description 2
- TXCIAUNLDRJGJZ-UHFFFAOYSA-N CMP-N-acetyl neuraminic acid Natural products O1C(C(O)C(O)CO)C(NC(=O)C)C(O)CC1(C(O)=O)OP(O)(=O)OCC1C(O)C(O)C(N2C(N=C(N)C=C2)=O)O1 TXCIAUNLDRJGJZ-UHFFFAOYSA-N 0.000 description 2
- TXCIAUNLDRJGJZ-BILDWYJOSA-N CMP-N-acetyl-beta-neuraminic acid Chemical compound O1[C@@H]([C@H](O)[C@H](O)CO)[C@H](NC(=O)C)[C@@H](O)C[C@]1(C(O)=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C(N=C(N)C=C2)=O)O1 TXCIAUNLDRJGJZ-BILDWYJOSA-N 0.000 description 2
- 241000588919 Citrobacter freundii Species 0.000 description 2
- GSXOAOHZAIYLCY-UHFFFAOYSA-N D-F6P Natural products OCC(=O)C(O)C(O)C(O)COP(O)(O)=O GSXOAOHZAIYLCY-UHFFFAOYSA-N 0.000 description 2
- 102000052510 DNA-Binding Proteins Human genes 0.000 description 2
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 2
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 2
- 241000196324 Embryophyta Species 0.000 description 2
- 101100280818 Escherichia coli (strain K12) fcl gene Proteins 0.000 description 2
- 101100022282 Escherichia coli O157:H7 manC2 gene Proteins 0.000 description 2
- 101710196411 Fructose-1,6-bisphosphatase Proteins 0.000 description 2
- 101710186733 Fructose-1,6-bisphosphatase, chloroplastic Proteins 0.000 description 2
- 101710109119 Fructose-1,6-bisphosphatase, cytosolic Proteins 0.000 description 2
- 101710198902 Fructose-1,6-bisphosphate aldolase/phosphatase Proteins 0.000 description 2
- RTVRUWIBAVHRQX-PMEZUWKYSA-N Fucosyllactose Chemical compound C([C@H]1O[C@@H]([C@H]([C@@H](O[C@@H]2[C@H]([C@@H](O)[C@H](O)[C@@H](CO)O2)O)[C@@H]1O)O)OC)O[C@H]1OC[C@@H](O)[C@H](O)[C@@H]1O RTVRUWIBAVHRQX-PMEZUWKYSA-N 0.000 description 2
- 241000233866 Fungi Species 0.000 description 2
- LQEBEXMHBLQMDB-JGQUBWHWSA-N GDP-beta-L-fucose Chemical compound O[C@H]1[C@H](O)[C@H](O)[C@H](C)O[C@@H]1OP(O)(=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C3=C(C(NC(N)=N3)=O)N=C2)O1 LQEBEXMHBLQMDB-JGQUBWHWSA-N 0.000 description 2
- NKCZYEDZTKOFBG-GUBZILKMSA-N Gln-Gln-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NKCZYEDZTKOFBG-GUBZILKMSA-N 0.000 description 2
- QFJPFPCSXOXMKI-BPUTZDHNSA-N Gln-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N QFJPFPCSXOXMKI-BPUTZDHNSA-N 0.000 description 2
- NSNUZSPSADIMJQ-WDSKDSINSA-N Gln-Gly-Asp Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NSNUZSPSADIMJQ-WDSKDSINSA-N 0.000 description 2
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 2
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 2
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 2
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 2
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 2
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 2
- MFNUFCFRAZPJFW-JYJNAYRXSA-N Glu-Lys-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MFNUFCFRAZPJFW-JYJNAYRXSA-N 0.000 description 2
- UZWUBBRJWFTHTD-LAEOZQHASA-N Glu-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O UZWUBBRJWFTHTD-LAEOZQHASA-N 0.000 description 2
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 2
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 2
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 2
- BRFJMRSRMOMIMU-WHFBIAKZSA-N Gly-Ala-Asn Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O BRFJMRSRMOMIMU-WHFBIAKZSA-N 0.000 description 2
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 2
- LERGJIVJIIODPZ-ZANVPECISA-N Gly-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)CN)C)C(O)=O)=CNC2=C1 LERGJIVJIIODPZ-ZANVPECISA-N 0.000 description 2
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 2
- XUORRGAFUQIMLC-STQMWFEESA-N Gly-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN)O XUORRGAFUQIMLC-STQMWFEESA-N 0.000 description 2
- CIMULJZTTOBOPN-WHFBIAKZSA-N Gly-Asn-Asn Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CIMULJZTTOBOPN-WHFBIAKZSA-N 0.000 description 2
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 2
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 2
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 2
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 2
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 2
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 2
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 2
- INLIXXRWNUKVCF-JTQLQIEISA-N Gly-Gly-Tyr Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 INLIXXRWNUKVCF-JTQLQIEISA-N 0.000 description 2
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 2
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 2
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 2
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 2
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 2
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 2
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 2
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 2
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 2
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 2
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 2
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 2
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 2
- FOKISINOENBSDM-WLTAIBSBSA-N Gly-Thr-Tyr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FOKISINOENBSDM-WLTAIBSBSA-N 0.000 description 2
- GBYYQVBXFVDJPJ-WLTAIBSBSA-N Gly-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)CN)O GBYYQVBXFVDJPJ-WLTAIBSBSA-N 0.000 description 2
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 2
- JFFAPRNXXLRINI-NHCYSSNCSA-N His-Asp-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JFFAPRNXXLRINI-NHCYSSNCSA-N 0.000 description 2
- KAXZXLSXFWSNNZ-XVYDVKMFSA-N His-Ser-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KAXZXLSXFWSNNZ-XVYDVKMFSA-N 0.000 description 2
- LPBWRHRHEIYAIP-KKUMJFAQSA-N His-Tyr-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LPBWRHRHEIYAIP-KKUMJFAQSA-N 0.000 description 2
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 2
- HZMLFETXHFHGBB-UGYAYLCHSA-N Ile-Asn-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZMLFETXHFHGBB-UGYAYLCHSA-N 0.000 description 2
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 2
- NBJAAWYRLGCJOF-UGYAYLCHSA-N Ile-Asp-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NBJAAWYRLGCJOF-UGYAYLCHSA-N 0.000 description 2
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 2
- SLQVFYWBGNNOTK-BYULHYEWSA-N Ile-Gly-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N SLQVFYWBGNNOTK-BYULHYEWSA-N 0.000 description 2
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 2
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 2
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 2
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 2
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 2
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 2
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 2
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 2
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 2
- 244000199885 Lactobacillus bulgaricus Species 0.000 description 2
- 244000199866 Lactobacillus casei Species 0.000 description 2
- 241000218492 Lactobacillus crispatus Species 0.000 description 2
- 241000186606 Lactobacillus gasseri Species 0.000 description 2
- 240000006024 Lactobacillus plantarum Species 0.000 description 2
- 241000186604 Lactobacillus reuteri Species 0.000 description 2
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 2
- MDVZJYGNAGLPGJ-KKUMJFAQSA-N Leu-Asn-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MDVZJYGNAGLPGJ-KKUMJFAQSA-N 0.000 description 2
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 2
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 2
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 2
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 2
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 2
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 2
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 2
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 2
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 2
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 2
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 2
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 2
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 2
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 2
- 108010062166 Lys-Asn-Asp Proteins 0.000 description 2
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 2
- OVIVOCSURJYCTM-GUBZILKMSA-N Lys-Asp-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O OVIVOCSURJYCTM-GUBZILKMSA-N 0.000 description 2
- DFXQCCBKGUNYGG-GUBZILKMSA-N Lys-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN DFXQCCBKGUNYGG-GUBZILKMSA-N 0.000 description 2
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 2
- VLMNBMFYRMGEMB-QWRGUYRKSA-N Lys-His-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CNC=N1 VLMNBMFYRMGEMB-QWRGUYRKSA-N 0.000 description 2
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 2
- BXPHMHQHYHILBB-BZSNNMDCSA-N Lys-Lys-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BXPHMHQHYHILBB-BZSNNMDCSA-N 0.000 description 2
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 2
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 2
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 2
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 2
- OHMKUHXCDSCOMT-QXEWZRGKSA-N Met-Asn-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHMKUHXCDSCOMT-QXEWZRGKSA-N 0.000 description 2
- SODXFJOPSCXOHE-IHRRRGAJSA-N Met-Leu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O SODXFJOPSCXOHE-IHRRRGAJSA-N 0.000 description 2
- DOQXHOUYYSPISL-SZMVWBNQSA-N Met-Trp-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCSC)C(=O)O)N DOQXHOUYYSPISL-SZMVWBNQSA-N 0.000 description 2
- 241001465754 Metazoa Species 0.000 description 2
- OVRNDRQMDRJTHS-UHFFFAOYSA-N N-acelyl-D-glucosamine Natural products CC(=O)NC1C(O)OC(CO)C(O)C1O OVRNDRQMDRJTHS-UHFFFAOYSA-N 0.000 description 2
- MBLBDJOUHNCFQT-LXGUWJNJSA-N N-acetylglucosamine Natural products CC(=O)N[C@@H](C=O)[C@@H](O)[C@H](O)[C@H](O)CO MBLBDJOUHNCFQT-LXGUWJNJSA-N 0.000 description 2
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 2
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 2
- 108010079364 N-glycylalanine Proteins 0.000 description 2
- 108010066427 N-valyltryptophan Proteins 0.000 description 2
- 108091034117 Oligonucleotide Proteins 0.000 description 2
- 241000592795 Paenibacillus sp. Species 0.000 description 2
- 241000588912 Pantoea agglomerans Species 0.000 description 2
- CUMXHKAOHNWRFQ-BZSNNMDCSA-N Phe-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 CUMXHKAOHNWRFQ-BZSNNMDCSA-N 0.000 description 2
- RMKGXGPQIPLTFC-KKUMJFAQSA-N Phe-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RMKGXGPQIPLTFC-KKUMJFAQSA-N 0.000 description 2
- VGTJSEYTVMAASM-RPTUDFQQSA-N Phe-Thr-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VGTJSEYTVMAASM-RPTUDFQQSA-N 0.000 description 2
- DBNGDEAQXGFGRA-ACRUOGEOSA-N Phe-Tyr-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DBNGDEAQXGFGRA-ACRUOGEOSA-N 0.000 description 2
- CDHURCQGUDNBMA-UBHSHLNASA-N Phe-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDHURCQGUDNBMA-UBHSHLNASA-N 0.000 description 2
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 2
- ZCXQTRXYZOSGJR-FXQIFTODSA-N Pro-Asp-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZCXQTRXYZOSGJR-FXQIFTODSA-N 0.000 description 2
- BCNRNJWSRFDPTQ-HJWJTTGWSA-N Pro-Ile-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BCNRNJWSRFDPTQ-HJWJTTGWSA-N 0.000 description 2
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 2
- RCYUBVHMVUHEBM-RCWTZXSCSA-N Pro-Pro-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RCYUBVHMVUHEBM-RCWTZXSCSA-N 0.000 description 2
- ITUDDXVFGFEKPD-NAKRPEOUSA-N Pro-Ser-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ITUDDXVFGFEKPD-NAKRPEOUSA-N 0.000 description 2
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 2
- WVXQQUWOKUZIEG-VEVYYDQMSA-N Pro-Thr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O WVXQQUWOKUZIEG-VEVYYDQMSA-N 0.000 description 2
- HRIXMVRZRGFKNQ-HJGDQZAQSA-N Pro-Thr-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HRIXMVRZRGFKNQ-HJGDQZAQSA-N 0.000 description 2
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 2
- VDHGTOHMHHQSKG-JYJNAYRXSA-N Pro-Val-Phe Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O VDHGTOHMHHQSKG-JYJNAYRXSA-N 0.000 description 2
- 241000589516 Pseudomonas Species 0.000 description 2
- 241000589540 Pseudomonas fluorescens Species 0.000 description 2
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 2
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 2
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 2
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 2
- MESDJCNHLZBMEP-ZLUOBGJFSA-N Ser-Asp-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MESDJCNHLZBMEP-ZLUOBGJFSA-N 0.000 description 2
- BYIROAKULFFTEK-CIUDSAMLSA-N Ser-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO BYIROAKULFFTEK-CIUDSAMLSA-N 0.000 description 2
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 2
- QKQDTEYDEIJPNK-GUBZILKMSA-N Ser-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CO QKQDTEYDEIJPNK-GUBZILKMSA-N 0.000 description 2
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 2
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 2
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 2
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 2
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 2
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 2
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 2
- VZQRNAYURWAEFE-KKUMJFAQSA-N Ser-Leu-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VZQRNAYURWAEFE-KKUMJFAQSA-N 0.000 description 2
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 2
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 2
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 2
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 2
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 2
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 2
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 2
- DDPVJPIGACCMEH-XQXXSGGOSA-N Thr-Ala-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DDPVJPIGACCMEH-XQXXSGGOSA-N 0.000 description 2
- GNHRVXYZKWSJTF-HJGDQZAQSA-N Thr-Asp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GNHRVXYZKWSJTF-HJGDQZAQSA-N 0.000 description 2
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 2
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 2
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 2
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 2
- WFAUDCSNCWJJAA-KXNHARMFSA-N Thr-Lys-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(O)=O WFAUDCSNCWJJAA-KXNHARMFSA-N 0.000 description 2
- WYLAVUAWOUVUCA-XVSYOHENSA-N Thr-Phe-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WYLAVUAWOUVUCA-XVSYOHENSA-N 0.000 description 2
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 2
- FWTFAZKJORVTIR-VZFHVOOUSA-N Thr-Ser-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O FWTFAZKJORVTIR-VZFHVOOUSA-N 0.000 description 2
- HUPLKEHTTQBXSC-YJRXYDGGSA-N Thr-Ser-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUPLKEHTTQBXSC-YJRXYDGGSA-N 0.000 description 2
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 2
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 2
- 102000004357 Transferases Human genes 0.000 description 2
- 108090000992 Transferases Proteins 0.000 description 2
- ZKVANNIVSDOQMG-HKUYNNGSSA-N Trp-Tyr-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)NCC(=O)O)N ZKVANNIVSDOQMG-HKUYNNGSSA-N 0.000 description 2
- VFJIWSJKZJTQII-SRVKXCTJSA-N Tyr-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VFJIWSJKZJTQII-SRVKXCTJSA-N 0.000 description 2
- ZRPLVTZTKPPSBT-AVGNSLFASA-N Tyr-Glu-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZRPLVTZTKPPSBT-AVGNSLFASA-N 0.000 description 2
- UNUZEBFXGWVAOP-DZKIICNBSA-N Tyr-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UNUZEBFXGWVAOP-DZKIICNBSA-N 0.000 description 2
- USYGMBIIUDLYHJ-GVARAGBVSA-N Tyr-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 USYGMBIIUDLYHJ-GVARAGBVSA-N 0.000 description 2
- GULIUBBXCYPDJU-CQDKDKBSSA-N Tyr-Leu-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 GULIUBBXCYPDJU-CQDKDKBSSA-N 0.000 description 2
- MVFQLSPDMMFCMW-KKUMJFAQSA-N Tyr-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O MVFQLSPDMMFCMW-KKUMJFAQSA-N 0.000 description 2
- OFHKXNKJXURPSY-ULQDDVLXSA-N Tyr-Met-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O OFHKXNKJXURPSY-ULQDDVLXSA-N 0.000 description 2
- PWKMJDQXKCENMF-MEYUZBJRSA-N Tyr-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O PWKMJDQXKCENMF-MEYUZBJRSA-N 0.000 description 2
- WQOHKVRQDLNDIL-YJRXYDGGSA-N Tyr-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O WQOHKVRQDLNDIL-YJRXYDGGSA-N 0.000 description 2
- RMRFSFXLFWWAJZ-HJOGWXRNSA-N Tyr-Tyr-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 RMRFSFXLFWWAJZ-HJOGWXRNSA-N 0.000 description 2
- LFTYTUAZOPRMMI-CFRASDGPSA-N UDP-N-acetyl-alpha-D-glucosamine Chemical compound O1[C@H](CO)[C@@H](O)[C@H](O)[C@@H](NC(=O)C)[C@H]1OP(O)(=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C(NC(=O)C=C2)=O)O1 LFTYTUAZOPRMMI-CFRASDGPSA-N 0.000 description 2
- LFTYTUAZOPRMMI-UHFFFAOYSA-N UNPD164450 Natural products O1C(CO)C(O)C(O)C(NC(=O)C)C1OP(O)(=O)OP(O)(=O)OCC1C(O)C(O)C(N2C(NC(=O)C=C2)=O)O1 LFTYTUAZOPRMMI-UHFFFAOYSA-N 0.000 description 2
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 2
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 2
- LABUITCFCAABSV-UHFFFAOYSA-N Val-Ala-Tyr Natural products CC(C)C(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LABUITCFCAABSV-UHFFFAOYSA-N 0.000 description 2
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 2
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 2
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 2
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 2
- AAOPYWQQBXHINJ-DZKIICNBSA-N Val-Gln-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AAOPYWQQBXHINJ-DZKIICNBSA-N 0.000 description 2
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 2
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 2
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 2
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 2
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 2
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 2
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 2
- CXWJFWAZIVWBOS-XQQFMLRXSA-N Val-Lys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CXWJFWAZIVWBOS-XQQFMLRXSA-N 0.000 description 2
- UOUIMEGEPSBZIV-ULQDDVLXSA-N Val-Lys-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UOUIMEGEPSBZIV-ULQDDVLXSA-N 0.000 description 2
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 2
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 2
- LTTQCQRTSHJPPL-ZKWXMUAHSA-N Val-Ser-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LTTQCQRTSHJPPL-ZKWXMUAHSA-N 0.000 description 2
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 2
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 2
- DVLWZWNAQUBZBC-ZNSHCXBVSA-N Val-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N)O DVLWZWNAQUBZBC-ZNSHCXBVSA-N 0.000 description 2
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 2
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 2
- UEXPMFIAZZHEAD-HSHDSVGOSA-N Val-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](C(C)C)N)O UEXPMFIAZZHEAD-HSHDSVGOSA-N 0.000 description 2
- RLVTVHSDKHBFQP-ULQDDVLXSA-N Val-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 RLVTVHSDKHBFQP-ULQDDVLXSA-N 0.000 description 2
- BGTDGENDNWGMDQ-KJEVXHAQSA-N Val-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N)O BGTDGENDNWGMDQ-KJEVXHAQSA-N 0.000 description 2
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 2
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 2
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 2
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 2
- 108010044940 alanylglutamine Proteins 0.000 description 2
- HXXFSFRBOHSIMQ-RWOPYEJCSA-L alpha-D-mannose 1-phosphate(2-) Chemical compound OC[C@H]1O[C@H](OP([O-])([O-])=O)[C@@H](O)[C@@H](O)[C@@H]1O HXXFSFRBOHSIMQ-RWOPYEJCSA-L 0.000 description 2
- 108010008355 arginyl-glutamine Proteins 0.000 description 2
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 2
- PEOXGOPDODNWAZ-KZBIEJSGSA-N beta-D-Galp-(1->3)-[alpha-L-Fucp-(1->4)]-beta-D-GlcpNAc-(1->3)-beta-D-Galp-(1->4)-beta-D-Glcp Chemical compound O[C@H]1[C@H](O)[C@H](O)[C@H](C)O[C@H]1O[C@H]1[C@H](O[C@H]2[C@@H]([C@@H](O)[C@@H](O)[C@@H](CO)O2)O)[C@@H](O)[C@H](O[C@@H]2[C@H]([C@H](O[C@@H]3[C@H](O[C@@H](O)[C@H](O)[C@H]3O)CO)O[C@H](CO)[C@@H]2O)NC(C)=O)O[C@@H]1CO PEOXGOPDODNWAZ-KZBIEJSGSA-N 0.000 description 2
- BGWGXPAPYGQALX-ARQDHWQXSA-N beta-D-fructofuranose 6-phosphate Chemical compound OC[C@@]1(O)O[C@H](COP(O)(O)=O)[C@@H](O)[C@@H]1O BGWGXPAPYGQALX-ARQDHWQXSA-N 0.000 description 2
- 102000005936 beta-Galactosidase Human genes 0.000 description 2
- 108010005774 beta-Galactosidase Proteins 0.000 description 2
- 239000000872 buffer Substances 0.000 description 2
- 239000013592 cell lysate Substances 0.000 description 2
- 210000003763 chloroplast Anatomy 0.000 description 2
- RPKLZQLYODPWTM-KBMWBBLPSA-N cholanoic acid Chemical compound C1CC2CCCC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@@H](CCC(O)=O)C)[C@@]1(C)CC2 RPKLZQLYODPWTM-KBMWBBLPSA-N 0.000 description 2
- 150000001875 compounds Chemical class 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 2
- 230000007071 enzymatic hydrolysis Effects 0.000 description 2
- 238000006047 enzymatic hydrolysis reaction Methods 0.000 description 2
- 101150050376 fbaB gene Proteins 0.000 description 2
- 239000012634 fragment Substances 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 230000002538 fungal effect Effects 0.000 description 2
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 2
- 108091008053 gene clusters Proteins 0.000 description 2
- 239000008103 glucose Substances 0.000 description 2
- 108010078144 glutaminyl-glycine Proteins 0.000 description 2
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 2
- 150000004676 glycans Chemical class 0.000 description 2
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 2
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 2
- 108010087823 glycyltyrosine Proteins 0.000 description 2
- 108010018006 histidylserine Proteins 0.000 description 2
- 230000003301 hydrolyzing effect Effects 0.000 description 2
- 230000000977 initiatory effect Effects 0.000 description 2
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 2
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 2
- 108010012058 leucyltyrosine Proteins 0.000 description 2
- 108010003700 lysyl aspartic acid Proteins 0.000 description 2
- 108010057952 lysyl-phenylalanyl-lysine Proteins 0.000 description 2
- 108010009298 lysylglutamic acid Proteins 0.000 description 2
- 101150032120 manC gene Proteins 0.000 description 2
- 239000002184 metal Substances 0.000 description 2
- 229910052751 metal Inorganic materials 0.000 description 2
- 150000002739 metals Chemical class 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 238000002703 mutagenesis Methods 0.000 description 2
- 231100000350 mutagenesis Toxicity 0.000 description 2
- 108010084572 phenylalanyl-valine Proteins 0.000 description 2
- 108010024607 phenylalanylalanine Proteins 0.000 description 2
- 239000013612 plasmid Substances 0.000 description 2
- 239000011541 reaction mixture Substances 0.000 description 2
- 238000011084 recovery Methods 0.000 description 2
- 238000004064 recycling Methods 0.000 description 2
- 108010026333 seryl-proline Proteins 0.000 description 2
- 239000011734 sodium Substances 0.000 description 2
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 2
- 235000013619 trace mineral Nutrition 0.000 description 2
- 239000011573 trace mineral Substances 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 230000017105 transposition Effects 0.000 description 2
- 108010044292 tryptophyltyrosine Proteins 0.000 description 2
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 2
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 2
- RSLLXTJELTWVHR-FNORWQNLSA-N (3e)-undeca-1,3-diene Chemical compound CCCCCCC\C=C\C=C RSLLXTJELTWVHR-FNORWQNLSA-N 0.000 description 1
- 108010083651 3-galactosyl-N-acetylglucosaminide 4-alpha-L-fucosyltransferase Proteins 0.000 description 1
- 101710157736 ATP-dependent 6-phosphofructokinase Proteins 0.000 description 1
- 102100033647 Activity-regulated cytoskeleton-associated protein Human genes 0.000 description 1
- 241000606828 Aggregatibacter aphrophilus Species 0.000 description 1
- UWQJHXKARZWDIJ-ZLUOBGJFSA-N Ala-Ala-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O UWQJHXKARZWDIJ-ZLUOBGJFSA-N 0.000 description 1
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 1
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 1
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 1
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 1
- ODWSTKXGQGYHSH-FXQIFTODSA-N Ala-Arg-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O ODWSTKXGQGYHSH-FXQIFTODSA-N 0.000 description 1
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 1
- DWINFPQUSSHSFS-UVBJJODRSA-N Ala-Arg-Trp Chemical compound N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C12)C(=O)O DWINFPQUSSHSFS-UVBJJODRSA-N 0.000 description 1
- YAXNATKKPOWVCP-ZLUOBGJFSA-N Ala-Asn-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O YAXNATKKPOWVCP-ZLUOBGJFSA-N 0.000 description 1
- ZEXDYVGDZJBRMO-ACZMJKKPSA-N Ala-Asn-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZEXDYVGDZJBRMO-ACZMJKKPSA-N 0.000 description 1
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 1
- XQGIRPGAVLFKBJ-CIUDSAMLSA-N Ala-Asn-Lys Chemical compound N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)O XQGIRPGAVLFKBJ-CIUDSAMLSA-N 0.000 description 1
- XQJAFSDFQZPYCU-UWJYBYFXSA-N Ala-Asn-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N XQJAFSDFQZPYCU-UWJYBYFXSA-N 0.000 description 1
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 1
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 1
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 1
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 1
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 1
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 1
- BVSGPHDECMJBDE-HGNGGELXSA-N Ala-Glu-His Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BVSGPHDECMJBDE-HGNGGELXSA-N 0.000 description 1
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 1
- FBHOPGDGELNWRH-DRZSPHRISA-N Ala-Glu-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FBHOPGDGELNWRH-DRZSPHRISA-N 0.000 description 1
- VBRDBGCROKWTPV-XHNCKOQMSA-N Ala-Glu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N VBRDBGCROKWTPV-XHNCKOQMSA-N 0.000 description 1
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 1
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 1
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 1
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 1
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 1
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 1
- JEPNLGMEZMCFEX-QSFUFRPTSA-N Ala-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C)N JEPNLGMEZMCFEX-QSFUFRPTSA-N 0.000 description 1
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 1
- GSHKMNKPMLXSQW-KBIXCLLPSA-N Ala-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C)N GSHKMNKPMLXSQW-KBIXCLLPSA-N 0.000 description 1
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 1
- DPNZTBKGAUAZQU-DLOVCJGASA-N Ala-Leu-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DPNZTBKGAUAZQU-DLOVCJGASA-N 0.000 description 1
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 1
- SDZRIBWEVVRDQI-CIUDSAMLSA-N Ala-Lys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O SDZRIBWEVVRDQI-CIUDSAMLSA-N 0.000 description 1
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 1
- FUKFQILQFQKHLE-DCAQKATOSA-N Ala-Lys-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O FUKFQILQFQKHLE-DCAQKATOSA-N 0.000 description 1
- BLTRAARCJYVJKV-QEJZJMRPSA-N Ala-Lys-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccccc1)C(O)=O BLTRAARCJYVJKV-QEJZJMRPSA-N 0.000 description 1
- CHFFHQUVXHEGBY-GARJFASQSA-N Ala-Lys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CHFFHQUVXHEGBY-GARJFASQSA-N 0.000 description 1
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 1
- KQESEZXHYOUIIM-CQDKDKBSSA-N Ala-Lys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KQESEZXHYOUIIM-CQDKDKBSSA-N 0.000 description 1
- PVQLRJRPUTXFFX-CIUDSAMLSA-N Ala-Met-Gln Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O PVQLRJRPUTXFFX-CIUDSAMLSA-N 0.000 description 1
- OMDNCNKNEGFOMM-BQBZGAKWSA-N Ala-Met-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O OMDNCNKNEGFOMM-BQBZGAKWSA-N 0.000 description 1
- GKAZXNDATBWNBI-DCAQKATOSA-N Ala-Met-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N GKAZXNDATBWNBI-DCAQKATOSA-N 0.000 description 1
- GFEDXKNBZMPEDM-KZVJFYERSA-N Ala-Met-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFEDXKNBZMPEDM-KZVJFYERSA-N 0.000 description 1
- FVNAUOZKIPAYNA-BPNCWPANSA-N Ala-Met-Tyr Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FVNAUOZKIPAYNA-BPNCWPANSA-N 0.000 description 1
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 1
- XRUJOVRWNMBAAA-NHCYSSNCSA-N Ala-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 XRUJOVRWNMBAAA-NHCYSSNCSA-N 0.000 description 1
- BFMIRJBURUXDRG-DLOVCJGASA-N Ala-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 BFMIRJBURUXDRG-DLOVCJGASA-N 0.000 description 1
- WEZNQZHACPSMEF-QEJZJMRPSA-N Ala-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 WEZNQZHACPSMEF-QEJZJMRPSA-N 0.000 description 1
- DYJJJCHDHLEFDW-FXQIFTODSA-N Ala-Pro-Cys Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N DYJJJCHDHLEFDW-FXQIFTODSA-N 0.000 description 1
- BTRULDJUUVGRNE-DCAQKATOSA-N Ala-Pro-Lys Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O BTRULDJUUVGRNE-DCAQKATOSA-N 0.000 description 1
- FFZJHQODAYHGPO-KZVJFYERSA-N Ala-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N FFZJHQODAYHGPO-KZVJFYERSA-N 0.000 description 1
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 1
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 1
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 1
- HIIJOGIBQXHFKE-HHKYUTTNSA-N Ala-Thr-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O HIIJOGIBQXHFKE-HHKYUTTNSA-N 0.000 description 1
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 1
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 1
- ISCYZXFOCXWUJU-KZVJFYERSA-N Ala-Thr-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O ISCYZXFOCXWUJU-KZVJFYERSA-N 0.000 description 1
- JJHBEVZAZXZREW-LFSVMHDDSA-N Ala-Thr-Phe Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O JJHBEVZAZXZREW-LFSVMHDDSA-N 0.000 description 1
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 1
- LTTLSZVJTDSACD-OWLDWWDNSA-N Ala-Thr-Trp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O LTTLSZVJTDSACD-OWLDWWDNSA-N 0.000 description 1
- ZVWXMTTZJKBJCI-BHDSKKPTSA-N Ala-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 ZVWXMTTZJKBJCI-BHDSKKPTSA-N 0.000 description 1
- AOAKQKVICDWCLB-UWJYBYFXSA-N Ala-Tyr-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AOAKQKVICDWCLB-UWJYBYFXSA-N 0.000 description 1
- YCTIYBUTCKNOTI-UWJYBYFXSA-N Ala-Tyr-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCTIYBUTCKNOTI-UWJYBYFXSA-N 0.000 description 1
- BHFOJPDOQPWJRN-XDTLVQLUSA-N Ala-Tyr-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CCC(N)=O)C(O)=O BHFOJPDOQPWJRN-XDTLVQLUSA-N 0.000 description 1
- BGGAIXWIZCIFSG-XDTLVQLUSA-N Ala-Tyr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O BGGAIXWIZCIFSG-XDTLVQLUSA-N 0.000 description 1
- XKXAZPSREVUCRT-BPNCWPANSA-N Ala-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=C(O)C=C1 XKXAZPSREVUCRT-BPNCWPANSA-N 0.000 description 1
- JPOQZCHGOTWRTM-FQPOAREZSA-N Ala-Tyr-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPOQZCHGOTWRTM-FQPOAREZSA-N 0.000 description 1
- MUGAESARFRGOTQ-IGNZVWTISA-N Ala-Tyr-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N MUGAESARFRGOTQ-IGNZVWTISA-N 0.000 description 1
- YEBZNKPPOHFZJM-BPNCWPANSA-N Ala-Tyr-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O YEBZNKPPOHFZJM-BPNCWPANSA-N 0.000 description 1
- BVLPIIBTWIYOML-ZKWXMUAHSA-N Ala-Val-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BVLPIIBTWIYOML-ZKWXMUAHSA-N 0.000 description 1
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 1
- NLXLAEXVIDQMFP-UHFFFAOYSA-N Ammonium chloride Substances [NH4+].[Cl-] NLXLAEXVIDQMFP-UHFFFAOYSA-N 0.000 description 1
- VHUUQVKOLVNVRT-UHFFFAOYSA-N Ammonium hydroxide Chemical compound [NH4+].[OH-] VHUUQVKOLVNVRT-UHFFFAOYSA-N 0.000 description 1
- DFCIPNHFKOQAME-FXQIFTODSA-N Arg-Ala-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFCIPNHFKOQAME-FXQIFTODSA-N 0.000 description 1
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 1
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 1
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 1
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 1
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 1
- YUIGJDNAGKJLDO-JYJNAYRXSA-N Arg-Arg-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YUIGJDNAGKJLDO-JYJNAYRXSA-N 0.000 description 1
- RVDVDRUZWZIBJQ-CIUDSAMLSA-N Arg-Asn-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O RVDVDRUZWZIBJQ-CIUDSAMLSA-N 0.000 description 1
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 1
- YFBGNGASPGRWEM-DCAQKATOSA-N Arg-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YFBGNGASPGRWEM-DCAQKATOSA-N 0.000 description 1
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 1
- VXXHDZKEQNGXNU-QXEWZRGKSA-N Arg-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N VXXHDZKEQNGXNU-QXEWZRGKSA-N 0.000 description 1
- KBBKCNHWCDJPGN-GUBZILKMSA-N Arg-Gln-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KBBKCNHWCDJPGN-GUBZILKMSA-N 0.000 description 1
- BEXGZLUHRXTZCC-CIUDSAMLSA-N Arg-Gln-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N BEXGZLUHRXTZCC-CIUDSAMLSA-N 0.000 description 1
- QAODJPUKWNNNRP-DCAQKATOSA-N Arg-Glu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QAODJPUKWNNNRP-DCAQKATOSA-N 0.000 description 1
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 1
- PNIGSVZJNVUVJA-BQBZGAKWSA-N Arg-Gly-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O PNIGSVZJNVUVJA-BQBZGAKWSA-N 0.000 description 1
- PPPXVIBMLFWNSK-BQBZGAKWSA-N Arg-Gly-Cys Chemical compound C(C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N PPPXVIBMLFWNSK-BQBZGAKWSA-N 0.000 description 1
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 1
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 1
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 1
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 1
- DNUKXVMPARLPFN-XUXIUFHCSA-N Arg-Leu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DNUKXVMPARLPFN-XUXIUFHCSA-N 0.000 description 1
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 1
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 1
- YVTHEZNOKSAWRW-DCAQKATOSA-N Arg-Lys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O YVTHEZNOKSAWRW-DCAQKATOSA-N 0.000 description 1
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 1
- CZUHPNLXLWMYMG-UBHSHLNASA-N Arg-Phe-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 CZUHPNLXLWMYMG-UBHSHLNASA-N 0.000 description 1
- BSYKSCBTTQKOJG-GUBZILKMSA-N Arg-Pro-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BSYKSCBTTQKOJG-GUBZILKMSA-N 0.000 description 1
- UULLJGQFCDXVTQ-CYDGBPFRSA-N Arg-Pro-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UULLJGQFCDXVTQ-CYDGBPFRSA-N 0.000 description 1
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 1
- VRTWYUYCJGNFES-CIUDSAMLSA-N Arg-Ser-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O VRTWYUYCJGNFES-CIUDSAMLSA-N 0.000 description 1
- URAUIUGLHBRPMF-NAKRPEOUSA-N Arg-Ser-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O URAUIUGLHBRPMF-NAKRPEOUSA-N 0.000 description 1
- ZJBUILVYSXQNSW-YTWAJWBKSA-N Arg-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ZJBUILVYSXQNSW-YTWAJWBKSA-N 0.000 description 1
- ZPWMEWYQBWSGAO-ZJDVBMNYSA-N Arg-Thr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZPWMEWYQBWSGAO-ZJDVBMNYSA-N 0.000 description 1
- DRDWXKWUSIKKOB-PJODQICGSA-N Arg-Trp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O DRDWXKWUSIKKOB-PJODQICGSA-N 0.000 description 1
- ZUVMUOOHJYNJPP-XIRDDKMYSA-N Arg-Trp-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZUVMUOOHJYNJPP-XIRDDKMYSA-N 0.000 description 1
- NVPHRWNWTKYIST-BPNCWPANSA-N Arg-Tyr-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 NVPHRWNWTKYIST-BPNCWPANSA-N 0.000 description 1
- QMQZYILAWUOLPV-JYJNAYRXSA-N Arg-Tyr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)CC1=CC=C(O)C=C1 QMQZYILAWUOLPV-JYJNAYRXSA-N 0.000 description 1
- PJOPLXOCKACMLK-KKUMJFAQSA-N Arg-Tyr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O PJOPLXOCKACMLK-KKUMJFAQSA-N 0.000 description 1
- VLIJAPRTSXSGFY-STQMWFEESA-N Arg-Tyr-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 VLIJAPRTSXSGFY-STQMWFEESA-N 0.000 description 1
- PSUXEQYPYZLNER-QXEWZRGKSA-N Arg-Val-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PSUXEQYPYZLNER-QXEWZRGKSA-N 0.000 description 1
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 1
- FMYQECOAIFGQGU-CYDGBPFRSA-N Arg-Val-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMYQECOAIFGQGU-CYDGBPFRSA-N 0.000 description 1
- ANAHQDPQQBDOBM-UHFFFAOYSA-N Arg-Val-Tyr Natural products CC(C)C(NC(=O)C(N)CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O ANAHQDPQQBDOBM-UHFFFAOYSA-N 0.000 description 1
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 1
- NXVGBGZQQFDUTM-XVYDVKMFSA-N Asn-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N NXVGBGZQQFDUTM-XVYDVKMFSA-N 0.000 description 1
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 1
- JJGRJMKUOYXZRA-LPEHRKFASA-N Asn-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O JJGRJMKUOYXZRA-LPEHRKFASA-N 0.000 description 1
- POOCJCRBHHMAOS-FXQIFTODSA-N Asn-Arg-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O POOCJCRBHHMAOS-FXQIFTODSA-N 0.000 description 1
- GOVUDFOGXOONFT-VEVYYDQMSA-N Asn-Arg-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GOVUDFOGXOONFT-VEVYYDQMSA-N 0.000 description 1
- ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N Asn-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N 0.000 description 1
- KSBHCUSPLWRVEK-ZLUOBGJFSA-N Asn-Asn-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KSBHCUSPLWRVEK-ZLUOBGJFSA-N 0.000 description 1
- NVGWESORMHFISY-SRVKXCTJSA-N Asn-Asn-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NVGWESORMHFISY-SRVKXCTJSA-N 0.000 description 1
- NLCDVZJDEXIDDL-BIIVOSGPSA-N Asn-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O NLCDVZJDEXIDDL-BIIVOSGPSA-N 0.000 description 1
- QHBMKQWOIYJYMI-BYULHYEWSA-N Asn-Asn-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QHBMKQWOIYJYMI-BYULHYEWSA-N 0.000 description 1
- XVVOVPFMILMHPX-ZLUOBGJFSA-N Asn-Asp-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XVVOVPFMILMHPX-ZLUOBGJFSA-N 0.000 description 1
- JZRLLSOWDYUKOK-SRVKXCTJSA-N Asn-Asp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N JZRLLSOWDYUKOK-SRVKXCTJSA-N 0.000 description 1
- VYLVOMUVLMGCRF-ZLUOBGJFSA-N Asn-Asp-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VYLVOMUVLMGCRF-ZLUOBGJFSA-N 0.000 description 1
- IYVSIZAXNLOKFQ-BYULHYEWSA-N Asn-Asp-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IYVSIZAXNLOKFQ-BYULHYEWSA-N 0.000 description 1
- AYKKKGFJXIDYLX-ACZMJKKPSA-N Asn-Gln-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O AYKKKGFJXIDYLX-ACZMJKKPSA-N 0.000 description 1
- BZMWJLLUAKSIMH-FXQIFTODSA-N Asn-Glu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BZMWJLLUAKSIMH-FXQIFTODSA-N 0.000 description 1
- UBKOVSLDWIHYSY-ACZMJKKPSA-N Asn-Glu-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UBKOVSLDWIHYSY-ACZMJKKPSA-N 0.000 description 1
- GFFRWIJAFFMQGM-NUMRIWBASA-N Asn-Glu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFFRWIJAFFMQGM-NUMRIWBASA-N 0.000 description 1
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 1
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 1
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 1
- OFQPMRDJVWLMNJ-CIUDSAMLSA-N Asn-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N OFQPMRDJVWLMNJ-CIUDSAMLSA-N 0.000 description 1
- NKLRWRRVYGQNIH-GHCJXIJMSA-N Asn-Ile-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O NKLRWRRVYGQNIH-GHCJXIJMSA-N 0.000 description 1
- OLISTMZJGQUOGS-GMOBBJLQSA-N Asn-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OLISTMZJGQUOGS-GMOBBJLQSA-N 0.000 description 1
- ANPFQTJEPONRPL-UGYAYLCHSA-N Asn-Ile-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O ANPFQTJEPONRPL-UGYAYLCHSA-N 0.000 description 1
- GQRDIVQPSMPQME-ZPFDUUQYSA-N Asn-Ile-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O GQRDIVQPSMPQME-ZPFDUUQYSA-N 0.000 description 1
- XLZCLJRGGMBKLR-PCBIJLKTSA-N Asn-Ile-Phe Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XLZCLJRGGMBKLR-PCBIJLKTSA-N 0.000 description 1
- PNHQRQTVBRDIEF-CIUDSAMLSA-N Asn-Leu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)N)N PNHQRQTVBRDIEF-CIUDSAMLSA-N 0.000 description 1
- HFPXZWPUVFVNLL-GUBZILKMSA-N Asn-Leu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFPXZWPUVFVNLL-GUBZILKMSA-N 0.000 description 1
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 1
- FTSAJSADJCMDHH-CIUDSAMLSA-N Asn-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N FTSAJSADJCMDHH-CIUDSAMLSA-N 0.000 description 1
- LZLCLRQMUQWUHJ-GUBZILKMSA-N Asn-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N LZLCLRQMUQWUHJ-GUBZILKMSA-N 0.000 description 1
- NYGILGUOUOXGMJ-YUMQZZPRSA-N Asn-Lys-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O NYGILGUOUOXGMJ-YUMQZZPRSA-N 0.000 description 1
- GIQCDTKOIPUDSG-GARJFASQSA-N Asn-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N)C(=O)O GIQCDTKOIPUDSG-GARJFASQSA-N 0.000 description 1
- COWITDLVHMZSIW-CIUDSAMLSA-N Asn-Lys-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O COWITDLVHMZSIW-CIUDSAMLSA-N 0.000 description 1
- XFJKRRCWLTZIQA-XIRDDKMYSA-N Asn-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N XFJKRRCWLTZIQA-XIRDDKMYSA-N 0.000 description 1
- MDDXKBHIMYYJLW-FXQIFTODSA-N Asn-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N MDDXKBHIMYYJLW-FXQIFTODSA-N 0.000 description 1
- QGABLMITFKUQDF-DCAQKATOSA-N Asn-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N QGABLMITFKUQDF-DCAQKATOSA-N 0.000 description 1
- RTFWCVDISAMGEQ-SRVKXCTJSA-N Asn-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N RTFWCVDISAMGEQ-SRVKXCTJSA-N 0.000 description 1
- YUUIAUXBNOHFRJ-IHRRRGAJSA-N Asn-Phe-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O YUUIAUXBNOHFRJ-IHRRRGAJSA-N 0.000 description 1
- BKFXFUPYETWGGA-XVSYOHENSA-N Asn-Phe-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BKFXFUPYETWGGA-XVSYOHENSA-N 0.000 description 1
- UYCPJVYQYARFGB-YDHLFZDLSA-N Asn-Phe-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O UYCPJVYQYARFGB-YDHLFZDLSA-N 0.000 description 1
- PLTGTJAZQRGMPP-FXQIFTODSA-N Asn-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O PLTGTJAZQRGMPP-FXQIFTODSA-N 0.000 description 1
- JTXVXGXTRXMOFJ-FXQIFTODSA-N Asn-Pro-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O JTXVXGXTRXMOFJ-FXQIFTODSA-N 0.000 description 1
- YRTOMUMWSTUQAX-FXQIFTODSA-N Asn-Pro-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O YRTOMUMWSTUQAX-FXQIFTODSA-N 0.000 description 1
- YWFLXGZHZXXINF-BPUTZDHNSA-N Asn-Pro-Trp Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CNC2=CC=CC=C12 YWFLXGZHZXXINF-BPUTZDHNSA-N 0.000 description 1
- JWQWPRCDYWNVNM-ACZMJKKPSA-N Asn-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N JWQWPRCDYWNVNM-ACZMJKKPSA-N 0.000 description 1
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 1
- ZUFPUBYQYWCMDB-NUMRIWBASA-N Asn-Thr-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZUFPUBYQYWCMDB-NUMRIWBASA-N 0.000 description 1
- PUUPMDXIHCOPJU-HJGDQZAQSA-N Asn-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O PUUPMDXIHCOPJU-HJGDQZAQSA-N 0.000 description 1
- WUQXMTITJLFXAU-JIOCBJNQSA-N Asn-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N)O WUQXMTITJLFXAU-JIOCBJNQSA-N 0.000 description 1
- KZYSHAMXEBPJBD-JRQIVUDYSA-N Asn-Thr-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZYSHAMXEBPJBD-JRQIVUDYSA-N 0.000 description 1
- IPPFAOCLQSGHJV-WFBYXXMGSA-N Asn-Trp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O IPPFAOCLQSGHJV-WFBYXXMGSA-N 0.000 description 1
- QIRJQYQOIKBPBZ-IHRRRGAJSA-N Asn-Tyr-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QIRJQYQOIKBPBZ-IHRRRGAJSA-N 0.000 description 1
- NSTBNYOKCZKOMI-AVGNSLFASA-N Asn-Tyr-Glu Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O NSTBNYOKCZKOMI-AVGNSLFASA-N 0.000 description 1
- QUCCLIXMVPIVOB-BZSNNMDCSA-N Asn-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N QUCCLIXMVPIVOB-BZSNNMDCSA-N 0.000 description 1
- CBWCQCANJSGUOH-ZKWXMUAHSA-N Asn-Val-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O CBWCQCANJSGUOH-ZKWXMUAHSA-N 0.000 description 1
- ZAESWDKAMDVHLL-RCOVLWMOSA-N Asn-Val-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O ZAESWDKAMDVHLL-RCOVLWMOSA-N 0.000 description 1
- LMIWYCWRJVMAIQ-NHCYSSNCSA-N Asn-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N LMIWYCWRJVMAIQ-NHCYSSNCSA-N 0.000 description 1
- GHWWTICYPDKPTE-NGZCFLSTSA-N Asn-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N GHWWTICYPDKPTE-NGZCFLSTSA-N 0.000 description 1
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 1
- HBUJSDCLZCXXCW-YDHLFZDLSA-N Asn-Val-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HBUJSDCLZCXXCW-YDHLFZDLSA-N 0.000 description 1
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 1
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 1
- ZVTDYGWRRPMFCL-WFBYXXMGSA-N Asp-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)O)N ZVTDYGWRRPMFCL-WFBYXXMGSA-N 0.000 description 1
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 1
- UQBGYPFHWFZMCD-ZLUOBGJFSA-N Asp-Asn-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O UQBGYPFHWFZMCD-ZLUOBGJFSA-N 0.000 description 1
- QRULNKJGYQQZMW-ZLUOBGJFSA-N Asp-Asn-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QRULNKJGYQQZMW-ZLUOBGJFSA-N 0.000 description 1
- GWTLRDMPMJCNMH-WHFBIAKZSA-N Asp-Asn-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GWTLRDMPMJCNMH-WHFBIAKZSA-N 0.000 description 1
- UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 1
- UGIBTKGQVWFTGX-BIIVOSGPSA-N Asp-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O UGIBTKGQVWFTGX-BIIVOSGPSA-N 0.000 description 1
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 1
- NAPNAGZWHQHZLG-ZLUOBGJFSA-N Asp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N NAPNAGZWHQHZLG-ZLUOBGJFSA-N 0.000 description 1
- VPSHHQXIWLGVDD-ZLUOBGJFSA-N Asp-Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VPSHHQXIWLGVDD-ZLUOBGJFSA-N 0.000 description 1
- ZCKYZTGLXIEOKS-CIUDSAMLSA-N Asp-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N ZCKYZTGLXIEOKS-CIUDSAMLSA-N 0.000 description 1
- CELPEWWLSXMVPH-CIUDSAMLSA-N Asp-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O CELPEWWLSXMVPH-CIUDSAMLSA-N 0.000 description 1
- SVFOIXMRMLROHO-SRVKXCTJSA-N Asp-Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SVFOIXMRMLROHO-SRVKXCTJSA-N 0.000 description 1
- LKIYSIYBKYLKPU-BIIVOSGPSA-N Asp-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O LKIYSIYBKYLKPU-BIIVOSGPSA-N 0.000 description 1
- QXHVOUSPVAWEMX-ZLUOBGJFSA-N Asp-Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXHVOUSPVAWEMX-ZLUOBGJFSA-N 0.000 description 1
- PXLNPFOJZQMXAT-BYULHYEWSA-N Asp-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O PXLNPFOJZQMXAT-BYULHYEWSA-N 0.000 description 1
- CSEJMKNZDCJYGJ-XHNCKOQMSA-N Asp-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O CSEJMKNZDCJYGJ-XHNCKOQMSA-N 0.000 description 1
- JRBVWZLHBGYZNY-QEJZJMRPSA-N Asp-Gln-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JRBVWZLHBGYZNY-QEJZJMRPSA-N 0.000 description 1
- OVPHVTCDVYYTHN-AVGNSLFASA-N Asp-Glu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OVPHVTCDVYYTHN-AVGNSLFASA-N 0.000 description 1
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 1
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 1
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 1
- KHGPWGKPYHPOIK-QWRGUYRKSA-N Asp-Gly-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KHGPWGKPYHPOIK-QWRGUYRKSA-N 0.000 description 1
- POTCZYQVVNXUIG-BQBZGAKWSA-N Asp-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O POTCZYQVVNXUIG-BQBZGAKWSA-N 0.000 description 1
- LDGUZSIPGSPBJP-XVYDVKMFSA-N Asp-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N LDGUZSIPGSPBJP-XVYDVKMFSA-N 0.000 description 1
- WSXDIZFNQYTUJB-SRVKXCTJSA-N Asp-His-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O WSXDIZFNQYTUJB-SRVKXCTJSA-N 0.000 description 1
- SWTQDYFZVOJVLL-KKUMJFAQSA-N Asp-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)O)N)O SWTQDYFZVOJVLL-KKUMJFAQSA-N 0.000 description 1
- LDLZOAJRXXBVGF-GMOBBJLQSA-N Asp-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N LDLZOAJRXXBVGF-GMOBBJLQSA-N 0.000 description 1
- SPKCGKRUYKMDHP-GUDRVLHUSA-N Asp-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N SPKCGKRUYKMDHP-GUDRVLHUSA-N 0.000 description 1
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 1
- SCQIQCWLOMOEFP-DCAQKATOSA-N Asp-Leu-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SCQIQCWLOMOEFP-DCAQKATOSA-N 0.000 description 1
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 1
- OEDJQRXNDRUGEU-SRVKXCTJSA-N Asp-Leu-His Chemical compound N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O OEDJQRXNDRUGEU-SRVKXCTJSA-N 0.000 description 1
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 1
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 1
- NVFSJIXJZCDICF-SRVKXCTJSA-N Asp-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N NVFSJIXJZCDICF-SRVKXCTJSA-N 0.000 description 1
- ZXRQJQCXPSMNMR-XIRDDKMYSA-N Asp-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N ZXRQJQCXPSMNMR-XIRDDKMYSA-N 0.000 description 1
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 1
- SARSTIZOZFBDOM-FXQIFTODSA-N Asp-Met-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O SARSTIZOZFBDOM-FXQIFTODSA-N 0.000 description 1
- GYWQGGUCMDCUJE-DLOVCJGASA-N Asp-Phe-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O GYWQGGUCMDCUJE-DLOVCJGASA-N 0.000 description 1
- WOPJVEMFXYHZEE-SRVKXCTJSA-N Asp-Phe-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WOPJVEMFXYHZEE-SRVKXCTJSA-N 0.000 description 1
- KRQFMDNIUOVRIF-KKUMJFAQSA-N Asp-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)O)N KRQFMDNIUOVRIF-KKUMJFAQSA-N 0.000 description 1
- USNJAPJZSGTTPX-XVSYOHENSA-N Asp-Phe-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O USNJAPJZSGTTPX-XVSYOHENSA-N 0.000 description 1
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 1
- BWJZSLQJNBSUPM-FXQIFTODSA-N Asp-Pro-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O BWJZSLQJNBSUPM-FXQIFTODSA-N 0.000 description 1
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 1
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 1
- YIDFBWRHIYOYAA-LKXGYXEUSA-N Asp-Ser-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YIDFBWRHIYOYAA-LKXGYXEUSA-N 0.000 description 1
- XYPJXLLXNSAWHZ-SRVKXCTJSA-N Asp-Ser-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XYPJXLLXNSAWHZ-SRVKXCTJSA-N 0.000 description 1
- UTLCRGFJFSZWAW-OLHMAJIHSA-N Asp-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O UTLCRGFJFSZWAW-OLHMAJIHSA-N 0.000 description 1
- KBJVTFWQWXCYCQ-IUKAMOBKSA-N Asp-Thr-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KBJVTFWQWXCYCQ-IUKAMOBKSA-N 0.000 description 1
- JDDYEZGPYBBPBN-JRQIVUDYSA-N Asp-Thr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JDDYEZGPYBBPBN-JRQIVUDYSA-N 0.000 description 1
- XOASPVGNFAMYBD-WFBYXXMGSA-N Asp-Trp-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O XOASPVGNFAMYBD-WFBYXXMGSA-N 0.000 description 1
- YUELDQUPTAYEGM-XIRDDKMYSA-N Asp-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)O)N YUELDQUPTAYEGM-XIRDDKMYSA-N 0.000 description 1
- FIRWLDUOFOULCA-XIRDDKMYSA-N Asp-Trp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N FIRWLDUOFOULCA-XIRDDKMYSA-N 0.000 description 1
- OYSYWMMZGJSQRB-AVGNSLFASA-N Asp-Tyr-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O OYSYWMMZGJSQRB-AVGNSLFASA-N 0.000 description 1
- AWPWHMVCSISSQK-QWRGUYRKSA-N Asp-Tyr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O AWPWHMVCSISSQK-QWRGUYRKSA-N 0.000 description 1
- BJDHEININLSZOT-KKUMJFAQSA-N Asp-Tyr-Lys Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(O)=O BJDHEININLSZOT-KKUMJFAQSA-N 0.000 description 1
- OQMGSMNZVHYDTQ-ZKWXMUAHSA-N Asp-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N OQMGSMNZVHYDTQ-ZKWXMUAHSA-N 0.000 description 1
- XQFLFQWOBXPMHW-NHCYSSNCSA-N Asp-Val-His Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O XQFLFQWOBXPMHW-NHCYSSNCSA-N 0.000 description 1
- 101100075927 Aspergillus aculeatus mndA gene Proteins 0.000 description 1
- 241000193755 Bacillus cereus Species 0.000 description 1
- 241000193752 Bacillus circulans Species 0.000 description 1
- 241000193749 Bacillus coagulans Species 0.000 description 1
- 241000194108 Bacillus licheniformis Species 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 101100508888 Bacillus subtilis (strain 168) iolJ gene Proteins 0.000 description 1
- 241000770536 Bacillus thermophilus Species 0.000 description 1
- 108700003860 Bacterial Genes Proteins 0.000 description 1
- 241000606124 Bacteroides fragilis Species 0.000 description 1
- 241000186000 Bifidobacterium Species 0.000 description 1
- 241000193417 Brevibacillus laterosporus Species 0.000 description 1
- 240000001817 Cereus hexagonus Species 0.000 description 1
- 241000193403 Clostridium Species 0.000 description 1
- 241000193401 Clostridium acetobutylicum Species 0.000 description 1
- 241001656809 Clostridium autoethanogenum Species 0.000 description 1
- 241000186566 Clostridium ljungdahlii Species 0.000 description 1
- 229910021591 Copper(I) chloride Inorganic materials 0.000 description 1
- 241000186216 Corynebacterium Species 0.000 description 1
- 241000186226 Corynebacterium glutamicum Species 0.000 description 1
- 108010051219 Cre recombinase Proteins 0.000 description 1
- YFXFOZPXVFPBDH-VZFHVOOUSA-N Cys-Ala-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)CS)C(O)=O YFXFOZPXVFPBDH-VZFHVOOUSA-N 0.000 description 1
- NDUSUIGBMZCOIL-ZKWXMUAHSA-N Cys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N NDUSUIGBMZCOIL-ZKWXMUAHSA-N 0.000 description 1
- PORWNQWEEIOIRH-XHNCKOQMSA-N Cys-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N)C(=O)O PORWNQWEEIOIRH-XHNCKOQMSA-N 0.000 description 1
- DZSICRGTVPDCRN-YUMQZZPRSA-N Cys-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N DZSICRGTVPDCRN-YUMQZZPRSA-N 0.000 description 1
- VPQZSNQICFCCSO-BJDJZHNGSA-N Cys-Leu-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VPQZSNQICFCCSO-BJDJZHNGSA-N 0.000 description 1
- IDZDFWJNPOOOHE-KKUMJFAQSA-N Cys-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N IDZDFWJNPOOOHE-KKUMJFAQSA-N 0.000 description 1
- MJOYUXLETJMQGG-IHRRRGAJSA-N Cys-Tyr-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MJOYUXLETJMQGG-IHRRRGAJSA-N 0.000 description 1
- UGPCUUWZXRMCIJ-KKUMJFAQSA-N Cys-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CS)N UGPCUUWZXRMCIJ-KKUMJFAQSA-N 0.000 description 1
- 108010084372 D-arabinose isomerase Proteins 0.000 description 1
- XPYBSIWDXQFNMH-UHFFFAOYSA-N D-fructose 1,6-bisphosphate Natural products OP(=O)(O)OCC(O)C(O)C(O)C(=O)COP(O)(O)=O XPYBSIWDXQFNMH-UHFFFAOYSA-N 0.000 description 1
- LXJXRIRHZLFYRP-VKHMYHEASA-N D-glyceraldehyde 3-phosphate Chemical compound O=C[C@H](O)COP(O)(O)=O LXJXRIRHZLFYRP-VKHMYHEASA-N 0.000 description 1
- AEMOLEFTQBMNLQ-VANFPWTGSA-N D-mannopyranuronic acid Chemical compound OC1O[C@H](C(O)=O)[C@@H](O)[C@H](O)[C@@H]1O AEMOLEFTQBMNLQ-VANFPWTGSA-N 0.000 description 1
- 108700020911 DNA-Binding Proteins Proteins 0.000 description 1
- 101710096438 DNA-binding protein Proteins 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 241000194033 Enterococcus Species 0.000 description 1
- 241000194031 Enterococcus faecium Species 0.000 description 1
- 108010046276 FLP recombinase Proteins 0.000 description 1
- 238000012366 Fed-batch cultivation Methods 0.000 description 1
- 102000001390 Fructose-Bisphosphate Aldolase Human genes 0.000 description 1
- 108010068561 Fructose-Bisphosphate Aldolase Proteins 0.000 description 1
- 108700005088 Fungal Genes Proteins 0.000 description 1
- 102100024515 GDP-L-fucose synthase Human genes 0.000 description 1
- 108030006298 GDP-L-fucose synthases Proteins 0.000 description 1
- 108010062427 GDP-mannose 4,6-dehydratase Proteins 0.000 description 1
- 102000002312 GDPmannose 4,6-dehydratase Human genes 0.000 description 1
- INKFLNZBTSNFON-CIUDSAMLSA-N Gln-Ala-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O INKFLNZBTSNFON-CIUDSAMLSA-N 0.000 description 1
- WUAYFMZULZDSLB-ACZMJKKPSA-N Gln-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O WUAYFMZULZDSLB-ACZMJKKPSA-N 0.000 description 1
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 1
- LKUWAWGNJYJODH-KBIXCLLPSA-N Gln-Ala-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKUWAWGNJYJODH-KBIXCLLPSA-N 0.000 description 1
- MWLYSLMKFXWZPW-ZPFDUUQYSA-N Gln-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCC(N)=O MWLYSLMKFXWZPW-ZPFDUUQYSA-N 0.000 description 1
- KJRXLVZYJJLUCV-DCAQKATOSA-N Gln-Arg-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O KJRXLVZYJJLUCV-DCAQKATOSA-N 0.000 description 1
- OETQLUYCMBARHJ-CIUDSAMLSA-N Gln-Asn-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OETQLUYCMBARHJ-CIUDSAMLSA-N 0.000 description 1
- MGJMFSBEMSNYJL-AVGNSLFASA-N Gln-Asn-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MGJMFSBEMSNYJL-AVGNSLFASA-N 0.000 description 1
- LMPBBFWHCRURJD-LAEOZQHASA-N Gln-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N LMPBBFWHCRURJD-LAEOZQHASA-N 0.000 description 1
- CYTSBCIIEHUPDU-ACZMJKKPSA-N Gln-Asp-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O CYTSBCIIEHUPDU-ACZMJKKPSA-N 0.000 description 1
- IKDOHQHEFPPGJG-FXQIFTODSA-N Gln-Asp-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IKDOHQHEFPPGJG-FXQIFTODSA-N 0.000 description 1
- JFSNBQJNDMXMQF-XHNCKOQMSA-N Gln-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O JFSNBQJNDMXMQF-XHNCKOQMSA-N 0.000 description 1
- CGVWDTRDPLOMHZ-FXQIFTODSA-N Gln-Glu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CGVWDTRDPLOMHZ-FXQIFTODSA-N 0.000 description 1
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 1
- XJKAKYXMFHUIHT-AUTRQRHGSA-N Gln-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N XJKAKYXMFHUIHT-AUTRQRHGSA-N 0.000 description 1
- XKBASPWPBXNVLQ-WDSKDSINSA-N Gln-Gly-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XKBASPWPBXNVLQ-WDSKDSINSA-N 0.000 description 1
- GNMQDOGFWYWPNM-LAEOZQHASA-N Gln-Gly-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)CNC(=O)[C@@H](N)CCC(N)=O)C(O)=O GNMQDOGFWYWPNM-LAEOZQHASA-N 0.000 description 1
- NSORZJXKUQFEKL-JGVFFNPUSA-N Gln-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)N)N)C(=O)O NSORZJXKUQFEKL-JGVFFNPUSA-N 0.000 description 1
- SMLDOQHTOAAFJQ-WDSKDSINSA-N Gln-Gly-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SMLDOQHTOAAFJQ-WDSKDSINSA-N 0.000 description 1
- XQEAVUJIRZRLQQ-SZMVWBNQSA-N Gln-His-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC3=CN=CN3)NC(=O)[C@H](CCC(=O)N)N XQEAVUJIRZRLQQ-SZMVWBNQSA-N 0.000 description 1
- TWTWUBHEWQPMQW-ZPFDUUQYSA-N Gln-Ile-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWTWUBHEWQPMQW-ZPFDUUQYSA-N 0.000 description 1
- HXOLDXKNWKLDMM-YVNDNENWSA-N Gln-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HXOLDXKNWKLDMM-YVNDNENWSA-N 0.000 description 1
- MTCXQQINVAFZKW-MNXVOIDGSA-N Gln-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MTCXQQINVAFZKW-MNXVOIDGSA-N 0.000 description 1
- MWERYIXRDZDXOA-QEWYBTABSA-N Gln-Ile-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MWERYIXRDZDXOA-QEWYBTABSA-N 0.000 description 1
- ITZWDGBYBPUZRG-KBIXCLLPSA-N Gln-Ile-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O ITZWDGBYBPUZRG-KBIXCLLPSA-N 0.000 description 1
- VZRAXPGTUNDIDK-GUBZILKMSA-N Gln-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VZRAXPGTUNDIDK-GUBZILKMSA-N 0.000 description 1
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 1
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 1
- MLSKFHLRFVGNLL-WDCWCFNPSA-N Gln-Leu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MLSKFHLRFVGNLL-WDCWCFNPSA-N 0.000 description 1
- IOFDDSNZJDIGPB-GVXVVHGQSA-N Gln-Leu-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IOFDDSNZJDIGPB-GVXVVHGQSA-N 0.000 description 1
- DQLVHRFFBQOWFL-JYJNAYRXSA-N Gln-Lys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N)O DQLVHRFFBQOWFL-JYJNAYRXSA-N 0.000 description 1
- DRNMNLKUUKKPIA-HTUGSXCWSA-N Gln-Phe-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)CCC(N)=O)C(O)=O DRNMNLKUUKKPIA-HTUGSXCWSA-N 0.000 description 1
- FNAJNWPDTIXYJN-CIUDSAMLSA-N Gln-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O FNAJNWPDTIXYJN-CIUDSAMLSA-N 0.000 description 1
- PIUPHASDUFSHTF-CIUDSAMLSA-N Gln-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O PIUPHASDUFSHTF-CIUDSAMLSA-N 0.000 description 1
- HMIXCETWRYDVMO-GUBZILKMSA-N Gln-Pro-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O HMIXCETWRYDVMO-GUBZILKMSA-N 0.000 description 1
- OKARHJKJTKFQBM-ACZMJKKPSA-N Gln-Ser-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OKARHJKJTKFQBM-ACZMJKKPSA-N 0.000 description 1
- OSCLNNWLKKIQJM-WDSKDSINSA-N Gln-Ser-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O OSCLNNWLKKIQJM-WDSKDSINSA-N 0.000 description 1
- ZGHMRONFHDVXEF-AVGNSLFASA-N Gln-Ser-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZGHMRONFHDVXEF-AVGNSLFASA-N 0.000 description 1
- SYZZMPFLOLSMHL-XHNCKOQMSA-N Gln-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)C(=O)O SYZZMPFLOLSMHL-XHNCKOQMSA-N 0.000 description 1
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 1
- ININBLZFFVOQIO-JHEQGTHGSA-N Gln-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O ININBLZFFVOQIO-JHEQGTHGSA-N 0.000 description 1
- NHMRJKKAVMENKJ-WDCWCFNPSA-N Gln-Thr-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NHMRJKKAVMENKJ-WDCWCFNPSA-N 0.000 description 1
- OUBUHIODTNUUTC-WDCWCFNPSA-N Gln-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O OUBUHIODTNUUTC-WDCWCFNPSA-N 0.000 description 1
- ARYKRXHBIPLULY-XKBZYTNZSA-N Gln-Thr-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ARYKRXHBIPLULY-XKBZYTNZSA-N 0.000 description 1
- STHSGOZLFLFGSS-SUSMZKCASA-N Gln-Thr-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STHSGOZLFLFGSS-SUSMZKCASA-N 0.000 description 1
- XMWNHGKDDIFXQJ-NWLDYVSISA-N Gln-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O XMWNHGKDDIFXQJ-NWLDYVSISA-N 0.000 description 1
- RBSKVTZUFMIWFU-XEGUGMAKSA-N Gln-Trp-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O RBSKVTZUFMIWFU-XEGUGMAKSA-N 0.000 description 1
- YJCZUTXLPXBNIO-BHYGNILZSA-N Gln-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CCC(=O)N)N)C(=O)O YJCZUTXLPXBNIO-BHYGNILZSA-N 0.000 description 1
- CTJRFALAOYAJBX-NWLDYVSISA-N Gln-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)N)N)O CTJRFALAOYAJBX-NWLDYVSISA-N 0.000 description 1
- NVHJGTGTUGEWCG-ZVZYQTTQSA-N Gln-Trp-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O NVHJGTGTUGEWCG-ZVZYQTTQSA-N 0.000 description 1
- VYOILACOFPPNQH-UMNHJUIQSA-N Gln-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N VYOILACOFPPNQH-UMNHJUIQSA-N 0.000 description 1
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 1
- SOEXCCGNHQBFPV-DLOVCJGASA-N Gln-Val-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SOEXCCGNHQBFPV-DLOVCJGASA-N 0.000 description 1
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 1
- SZXSSXUNOALWCH-ACZMJKKPSA-N Glu-Ala-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O SZXSSXUNOALWCH-ACZMJKKPSA-N 0.000 description 1
- OGMQXTXGLDNBSS-FXQIFTODSA-N Glu-Ala-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O OGMQXTXGLDNBSS-FXQIFTODSA-N 0.000 description 1
- BPDVTFBJZNBHEU-HGNGGELXSA-N Glu-Ala-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 BPDVTFBJZNBHEU-HGNGGELXSA-N 0.000 description 1
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 1
- IRDASPPCLZIERZ-XHNCKOQMSA-N Glu-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N IRDASPPCLZIERZ-XHNCKOQMSA-N 0.000 description 1
- PBEQPAZRHDVJQI-SRVKXCTJSA-N Glu-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N PBEQPAZRHDVJQI-SRVKXCTJSA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- RDDSZZJOKDVPAE-ACZMJKKPSA-N Glu-Asn-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDDSZZJOKDVPAE-ACZMJKKPSA-N 0.000 description 1
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 1
- HJIFPJUEOGZWRI-GUBZILKMSA-N Glu-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N HJIFPJUEOGZWRI-GUBZILKMSA-N 0.000 description 1
- PKYAVRMYTBBRLS-FXQIFTODSA-N Glu-Cys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O PKYAVRMYTBBRLS-FXQIFTODSA-N 0.000 description 1
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 1
- GYCPQVFKCPPRQB-GUBZILKMSA-N Glu-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N GYCPQVFKCPPRQB-GUBZILKMSA-N 0.000 description 1
- RFDHKPSHTXZKLL-IHRRRGAJSA-N Glu-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N RFDHKPSHTXZKLL-IHRRRGAJSA-N 0.000 description 1
- WPLGNDORMXTMQS-FXQIFTODSA-N Glu-Gln-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O WPLGNDORMXTMQS-FXQIFTODSA-N 0.000 description 1
- MIQCYAJSDGNCNK-BPUTZDHNSA-N Glu-Gln-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O MIQCYAJSDGNCNK-BPUTZDHNSA-N 0.000 description 1
- HUFCEIHAFNVSNR-IHRRRGAJSA-N Glu-Gln-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUFCEIHAFNVSNR-IHRRRGAJSA-N 0.000 description 1
- UHVIQGKBMXEVGN-WDSKDSINSA-N Glu-Gly-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UHVIQGKBMXEVGN-WDSKDSINSA-N 0.000 description 1
- WRNAXCVRSBBKGS-BQBZGAKWSA-N Glu-Gly-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O WRNAXCVRSBBKGS-BQBZGAKWSA-N 0.000 description 1
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 1
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 1
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 1
- XMPAXPSENRSOSV-RYUDHWBXSA-N Glu-Gly-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XMPAXPSENRSOSV-RYUDHWBXSA-N 0.000 description 1
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 1
- WVTIBGWZUMJBFY-GUBZILKMSA-N Glu-His-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O WVTIBGWZUMJBFY-GUBZILKMSA-N 0.000 description 1
- WDTAKCUOIKHCTB-NKIYYHGXSA-N Glu-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N)O WDTAKCUOIKHCTB-NKIYYHGXSA-N 0.000 description 1
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 1
- WVYJNPCWJYBHJG-YVNDNENWSA-N Glu-Ile-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O WVYJNPCWJYBHJG-YVNDNENWSA-N 0.000 description 1
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 1
- ZSWGJYOZWBHROQ-RWRJDSDZSA-N Glu-Ile-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSWGJYOZWBHROQ-RWRJDSDZSA-N 0.000 description 1
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 1
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 1
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 1
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 1
- UGSVSNXPJJDJKL-SDDRHHMPSA-N Glu-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UGSVSNXPJJDJKL-SDDRHHMPSA-N 0.000 description 1
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 description 1
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 1
- NPMSEUWUMOSEFM-CIUDSAMLSA-N Glu-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N NPMSEUWUMOSEFM-CIUDSAMLSA-N 0.000 description 1
- HQOGXFLBAKJUMH-CIUDSAMLSA-N Glu-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N HQOGXFLBAKJUMH-CIUDSAMLSA-N 0.000 description 1
- YHOJJFFTSMWVGR-HJGDQZAQSA-N Glu-Met-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YHOJJFFTSMWVGR-HJGDQZAQSA-N 0.000 description 1
- DXVOKNVIKORTHQ-GUBZILKMSA-N Glu-Pro-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O DXVOKNVIKORTHQ-GUBZILKMSA-N 0.000 description 1
- BPLNJYHNAJVLRT-ACZMJKKPSA-N Glu-Ser-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O BPLNJYHNAJVLRT-ACZMJKKPSA-N 0.000 description 1
- DAHLWSFUXOHMIA-FXQIFTODSA-N Glu-Ser-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O DAHLWSFUXOHMIA-FXQIFTODSA-N 0.000 description 1
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 1
- QOXDAWODGSIDDI-GUBZILKMSA-N Glu-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N QOXDAWODGSIDDI-GUBZILKMSA-N 0.000 description 1
- JWNZHMSRZXXGTM-XKBZYTNZSA-N Glu-Ser-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWNZHMSRZXXGTM-XKBZYTNZSA-N 0.000 description 1
- TZXOPHFCAATANZ-QEJZJMRPSA-N Glu-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N TZXOPHFCAATANZ-QEJZJMRPSA-N 0.000 description 1
- DTLLNDVORUEOTM-WDCWCFNPSA-N Glu-Thr-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DTLLNDVORUEOTM-WDCWCFNPSA-N 0.000 description 1
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 1
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 1
- QVXWAFZDWRLXTI-NWLDYVSISA-N Glu-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O QVXWAFZDWRLXTI-NWLDYVSISA-N 0.000 description 1
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 1
- ZTNHPMZHAILHRB-JSGCOSHPSA-N Glu-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)NCC(O)=O)=CNC2=C1 ZTNHPMZHAILHRB-JSGCOSHPSA-N 0.000 description 1
- QEJKKJNDDDPSMU-KKUMJFAQSA-N Glu-Tyr-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(O)=O QEJKKJNDDDPSMU-KKUMJFAQSA-N 0.000 description 1
- PMSDOVISAARGAV-FHWLQOOXSA-N Glu-Tyr-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 PMSDOVISAARGAV-FHWLQOOXSA-N 0.000 description 1
- HBMRTXJZQDVRFT-DZKIICNBSA-N Glu-Tyr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HBMRTXJZQDVRFT-DZKIICNBSA-N 0.000 description 1
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 1
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 1
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 1
- XIJOPMSILDNVNJ-ZVZYQTTQSA-N Glu-Val-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O XIJOPMSILDNVNJ-ZVZYQTTQSA-N 0.000 description 1
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 1
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 1
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 1
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 1
- KFMBRBPXHVMDFN-UWVGGRQHSA-N Gly-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCNC(N)=N KFMBRBPXHVMDFN-UWVGGRQHSA-N 0.000 description 1
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 1
- UXJHNZODTMHWRD-WHFBIAKZSA-N Gly-Asn-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O UXJHNZODTMHWRD-WHFBIAKZSA-N 0.000 description 1
- DWUKOTKSTDWGAE-BQBZGAKWSA-N Gly-Asn-Arg Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DWUKOTKSTDWGAE-BQBZGAKWSA-N 0.000 description 1
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 1
- JVACNFOPSUPDTK-QWRGUYRKSA-N Gly-Asn-Phe Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JVACNFOPSUPDTK-QWRGUYRKSA-N 0.000 description 1
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 1
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 1
- LURCIJSJAKFCRO-QWRGUYRKSA-N Gly-Asn-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LURCIJSJAKFCRO-QWRGUYRKSA-N 0.000 description 1
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 1
- XQHSBNVACKQWAV-WHFBIAKZSA-N Gly-Asp-Asn Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XQHSBNVACKQWAV-WHFBIAKZSA-N 0.000 description 1
- FUTAPPOITCCWTH-WHFBIAKZSA-N Gly-Asp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FUTAPPOITCCWTH-WHFBIAKZSA-N 0.000 description 1
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 1
- XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 1
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 1
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 1
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 1
- QGZSAHIZRQHCEQ-QWRGUYRKSA-N Gly-Asp-Tyr Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QGZSAHIZRQHCEQ-QWRGUYRKSA-N 0.000 description 1
- TZOVVRJYUDETQG-RCOVLWMOSA-N Gly-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN TZOVVRJYUDETQG-RCOVLWMOSA-N 0.000 description 1
- VNBNZUAPOYGRDB-ZDLURKLDSA-N Gly-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN)O VNBNZUAPOYGRDB-ZDLURKLDSA-N 0.000 description 1
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 1
- BPQYBFAXRGMGGY-LAEOZQHASA-N Gly-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN BPQYBFAXRGMGGY-LAEOZQHASA-N 0.000 description 1
- GNPVTZJUUBPZKW-WDSKDSINSA-N Gly-Gln-Ser Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GNPVTZJUUBPZKW-WDSKDSINSA-N 0.000 description 1
- NPSWCZIRBAYNSB-JHEQGTHGSA-N Gly-Gln-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NPSWCZIRBAYNSB-JHEQGTHGSA-N 0.000 description 1
- HDNXXTBKOJKWNN-WDSKDSINSA-N Gly-Glu-Asn Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O HDNXXTBKOJKWNN-WDSKDSINSA-N 0.000 description 1
- FIQQRCFQXGLOSZ-WDSKDSINSA-N Gly-Glu-Asp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FIQQRCFQXGLOSZ-WDSKDSINSA-N 0.000 description 1
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 1
- LHRXAHLCRMQBGJ-RYUDHWBXSA-N Gly-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CN LHRXAHLCRMQBGJ-RYUDHWBXSA-N 0.000 description 1
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 1
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 1
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 1
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 1
- VAXIVIPMCTYSHI-YUMQZZPRSA-N Gly-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN VAXIVIPMCTYSHI-YUMQZZPRSA-N 0.000 description 1
- HPAIKDPJURGQLN-KBPBESRZSA-N Gly-His-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CNC=N1 HPAIKDPJURGQLN-KBPBESRZSA-N 0.000 description 1
- SIYTVHWNKGIGMD-HOTGVXAUSA-N Gly-His-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC3=CN=CN3)NC(=O)CN SIYTVHWNKGIGMD-HOTGVXAUSA-N 0.000 description 1
- SXJHOPPTOJACOA-QXEWZRGKSA-N Gly-Ile-Arg Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SXJHOPPTOJACOA-QXEWZRGKSA-N 0.000 description 1
- DGKBSGNCMCLDSL-BYULHYEWSA-N Gly-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN DGKBSGNCMCLDSL-BYULHYEWSA-N 0.000 description 1
- VIIBEIQMLJEUJG-LAEOZQHASA-N Gly-Ile-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O VIIBEIQMLJEUJG-LAEOZQHASA-N 0.000 description 1
- AAHSHTLISQUZJL-QSFUFRPTSA-N Gly-Ile-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AAHSHTLISQUZJL-QSFUFRPTSA-N 0.000 description 1
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 1
- TWTPDFFBLQEBOE-IUCAKERBSA-N Gly-Leu-Gln Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O TWTPDFFBLQEBOE-IUCAKERBSA-N 0.000 description 1
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 1
- YSDLIYZLOTZZNP-UWVGGRQHSA-N Gly-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN YSDLIYZLOTZZNP-UWVGGRQHSA-N 0.000 description 1
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 1
- BXICSAQLIHFDDL-YUMQZZPRSA-N Gly-Lys-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BXICSAQLIHFDDL-YUMQZZPRSA-N 0.000 description 1
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 1
- IUKIDFVOUHZRAK-QWRGUYRKSA-N Gly-Lys-His Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IUKIDFVOUHZRAK-QWRGUYRKSA-N 0.000 description 1
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 1
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 1
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 1
- YHYDTTUSJXGTQK-UWVGGRQHSA-N Gly-Met-Leu Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(C)C)C(O)=O YHYDTTUSJXGTQK-UWVGGRQHSA-N 0.000 description 1
- WZSHYFGOLPXPLL-RYUDHWBXSA-N Gly-Phe-Glu Chemical compound NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCC(O)=O)C(O)=O WZSHYFGOLPXPLL-RYUDHWBXSA-N 0.000 description 1
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 1
- JPVGHHQGKPQYIL-KBPBESRZSA-N Gly-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 JPVGHHQGKPQYIL-KBPBESRZSA-N 0.000 description 1
- YLEIWGJJBFBFHC-KBPBESRZSA-N Gly-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 YLEIWGJJBFBFHC-KBPBESRZSA-N 0.000 description 1
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 1
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 1
- HJARVELKOSZUEW-YUMQZZPRSA-N Gly-Pro-Gln Chemical compound [H]NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJARVELKOSZUEW-YUMQZZPRSA-N 0.000 description 1
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 1
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 1
- POJJAZJHBGXEGM-YUMQZZPRSA-N Gly-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN POJJAZJHBGXEGM-YUMQZZPRSA-N 0.000 description 1
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 1
- RHRLHXQWHCNJKR-PMVVWTBXSA-N Gly-Thr-His Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 RHRLHXQWHCNJKR-PMVVWTBXSA-N 0.000 description 1
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 description 1
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 1
- BXDLTKLPPKBVEL-FJXKBIBVSA-N Gly-Thr-Met Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O BXDLTKLPPKBVEL-FJXKBIBVSA-N 0.000 description 1
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 1
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 1
- WSWWTQYHFCBKBT-DVJZZOLTSA-N Gly-Thr-Trp Chemical compound C[C@@H](O)[C@H](NC(=O)CN)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O WSWWTQYHFCBKBT-DVJZZOLTSA-N 0.000 description 1
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 1
- RCHFYMASWAZQQZ-ZANVPECISA-N Gly-Trp-Ala Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)CN)=CNC2=C1 RCHFYMASWAZQQZ-ZANVPECISA-N 0.000 description 1
- UMRIXLHPZZIOML-OALUTQOASA-N Gly-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)CN UMRIXLHPZZIOML-OALUTQOASA-N 0.000 description 1
- UMBDRSMLCUYIRI-DVJZZOLTSA-N Gly-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)CN)O UMBDRSMLCUYIRI-DVJZZOLTSA-N 0.000 description 1
- MREVELMMFOLESM-HOCLYGCPSA-N Gly-Trp-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O MREVELMMFOLESM-HOCLYGCPSA-N 0.000 description 1
- UIQGJYUEQDOODF-KWQFWETISA-N Gly-Tyr-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 UIQGJYUEQDOODF-KWQFWETISA-N 0.000 description 1
- PNUFMLXHOLFRLD-KBPBESRZSA-N Gly-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 PNUFMLXHOLFRLD-KBPBESRZSA-N 0.000 description 1
- IHDKKJVBLGXLEL-STQMWFEESA-N Gly-Tyr-Met Chemical compound CSCC[C@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)CN)C(O)=O IHDKKJVBLGXLEL-STQMWFEESA-N 0.000 description 1
- OCRQUYDOYKCOQG-IRXDYDNUSA-N Gly-Tyr-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 OCRQUYDOYKCOQG-IRXDYDNUSA-N 0.000 description 1
- LYZYGGWCBLBDMC-QWHCGFSZSA-N Gly-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)CN)C(=O)O LYZYGGWCBLBDMC-QWHCGFSZSA-N 0.000 description 1
- JYGYNWYVKXENNE-OALUTQOASA-N Gly-Tyr-Trp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JYGYNWYVKXENNE-OALUTQOASA-N 0.000 description 1
- NGBGZCUWFVVJKC-IRXDYDNUSA-N Gly-Tyr-Tyr Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NGBGZCUWFVVJKC-IRXDYDNUSA-N 0.000 description 1
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 1
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 1
- IZVICCORZOSGPT-JSGCOSHPSA-N Gly-Val-Tyr Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IZVICCORZOSGPT-JSGCOSHPSA-N 0.000 description 1
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 1
- 229930186217 Glycolipid Natural products 0.000 description 1
- 102000003886 Glycoproteins Human genes 0.000 description 1
- 108090000288 Glycoproteins Proteins 0.000 description 1
- 101100511168 Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) lex1 gene Proteins 0.000 description 1
- QIVPRLJQQVXCIY-HGNGGELXSA-N His-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](CCC(N)=O)C(O)=O QIVPRLJQQVXCIY-HGNGGELXSA-N 0.000 description 1
- DCRODRAURLJOFY-XPUUQOCRSA-N His-Ala-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)NCC(O)=O DCRODRAURLJOFY-XPUUQOCRSA-N 0.000 description 1
- HTZKFIYQMHJWSQ-INTQDDNPSA-N His-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N HTZKFIYQMHJWSQ-INTQDDNPSA-N 0.000 description 1
- IDNNYVGVSZMQTK-IHRRRGAJSA-N His-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N IDNNYVGVSZMQTK-IHRRRGAJSA-N 0.000 description 1
- QZAFGJNKLMNDEM-DCAQKATOSA-N His-Asn-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CN=CN1 QZAFGJNKLMNDEM-DCAQKATOSA-N 0.000 description 1
- VBOFRJNDIOPNDO-YUMQZZPRSA-N His-Gly-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N VBOFRJNDIOPNDO-YUMQZZPRSA-N 0.000 description 1
- FDQYIRHBVVUTJF-ZETCQYMHSA-N His-Gly-Gly Chemical compound [O-]C(=O)CNC(=O)CNC(=O)[C@@H]([NH3+])CC1=CN=CN1 FDQYIRHBVVUTJF-ZETCQYMHSA-N 0.000 description 1
- RGPWUJOMKFYFSR-QWRGUYRKSA-N His-Gly-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O RGPWUJOMKFYFSR-QWRGUYRKSA-N 0.000 description 1
- NTXIJPDAHXSHNL-ONGXEEELSA-N His-Gly-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NTXIJPDAHXSHNL-ONGXEEELSA-N 0.000 description 1
- KAFZDWMZKGQDEE-SRVKXCTJSA-N His-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KAFZDWMZKGQDEE-SRVKXCTJSA-N 0.000 description 1
- 108010093488 His-His-His-His-His-His Proteins 0.000 description 1
- NDKSHNQINMRKHT-PEXQALLHSA-N His-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N NDKSHNQINMRKHT-PEXQALLHSA-N 0.000 description 1
- PGRPSOUCWRBWKZ-DLOVCJGASA-N His-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CN=CN1 PGRPSOUCWRBWKZ-DLOVCJGASA-N 0.000 description 1
- LDFWDDVELNOGII-MXAVVETBSA-N His-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N LDFWDDVELNOGII-MXAVVETBSA-N 0.000 description 1
- CKRJBQJIGOEKMC-SRVKXCTJSA-N His-Lys-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O CKRJBQJIGOEKMC-SRVKXCTJSA-N 0.000 description 1
- VDHOMPFVSABJKU-ULQDDVLXSA-N His-Phe-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CN=CN2)N VDHOMPFVSABJKU-ULQDDVLXSA-N 0.000 description 1
- PYNPBMCLAKTHJL-SRVKXCTJSA-N His-Pro-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O PYNPBMCLAKTHJL-SRVKXCTJSA-N 0.000 description 1
- PLCAEMGSYOYIPP-GUBZILKMSA-N His-Ser-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 PLCAEMGSYOYIPP-GUBZILKMSA-N 0.000 description 1
- XHQYFGPIRUHQIB-PBCZWWQYSA-N His-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CN=CN1 XHQYFGPIRUHQIB-PBCZWWQYSA-N 0.000 description 1
- MDOBWSFNSNPENN-PMVVWTBXSA-N His-Thr-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O MDOBWSFNSNPENN-PMVVWTBXSA-N 0.000 description 1
- YSMZBYPVVYSGOT-SZMVWBNQSA-N His-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N YSMZBYPVVYSGOT-SZMVWBNQSA-N 0.000 description 1
- ZNTSGDNUITWTRA-WDSOQIARSA-N His-Trp-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O ZNTSGDNUITWTRA-WDSOQIARSA-N 0.000 description 1
- RNVUQLOKVIPNEM-BZSNNMDCSA-N His-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O RNVUQLOKVIPNEM-BZSNNMDCSA-N 0.000 description 1
- CSTDQOOBZBAJKE-BWAGICSOSA-N His-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N)O CSTDQOOBZBAJKE-BWAGICSOSA-N 0.000 description 1
- FFYYUUWROYYKFY-IHRRRGAJSA-N His-Val-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O FFYYUUWROYYKFY-IHRRRGAJSA-N 0.000 description 1
- 102000004157 Hydrolases Human genes 0.000 description 1
- 108090000604 Hydrolases Proteins 0.000 description 1
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 1
- HLYBGMZJVDHJEO-CYDGBPFRSA-N Ile-Arg-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HLYBGMZJVDHJEO-CYDGBPFRSA-N 0.000 description 1
- SACHLUOUHCVIKI-GMOBBJLQSA-N Ile-Arg-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SACHLUOUHCVIKI-GMOBBJLQSA-N 0.000 description 1
- NULSANWBUWLTKN-NAKRPEOUSA-N Ile-Arg-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N NULSANWBUWLTKN-NAKRPEOUSA-N 0.000 description 1
- ZZHGKECPZXPXJF-PCBIJLKTSA-N Ile-Asn-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZZHGKECPZXPXJF-PCBIJLKTSA-N 0.000 description 1
- NCSIQAFSIPHVAN-IUKAMOBKSA-N Ile-Asn-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NCSIQAFSIPHVAN-IUKAMOBKSA-N 0.000 description 1
- QIHJTGSVGIPHIW-QSFUFRPTSA-N Ile-Asn-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N QIHJTGSVGIPHIW-QSFUFRPTSA-N 0.000 description 1
- IDAHFEPYTJJZFD-PEFMBERDSA-N Ile-Asp-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N IDAHFEPYTJJZFD-PEFMBERDSA-N 0.000 description 1
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 1
- BGZIJZJBXRVBGJ-SXTJYALSSA-N Ile-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N BGZIJZJBXRVBGJ-SXTJYALSSA-N 0.000 description 1
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 1
- NPROWIBAWYMPAZ-GUDRVLHUSA-N Ile-Asp-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N NPROWIBAWYMPAZ-GUDRVLHUSA-N 0.000 description 1
- DCQMJRSOGCYKTR-GHCJXIJMSA-N Ile-Asp-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DCQMJRSOGCYKTR-GHCJXIJMSA-N 0.000 description 1
- REJKOQYVFDEZHA-SLBDDTMCSA-N Ile-Asp-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N REJKOQYVFDEZHA-SLBDDTMCSA-N 0.000 description 1
- AQTWDZDISVGCAC-CFMVVWHZSA-N Ile-Asp-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AQTWDZDISVGCAC-CFMVVWHZSA-N 0.000 description 1
- VCYVLFAWCJRXFT-HJPIBITLSA-N Ile-Cys-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N VCYVLFAWCJRXFT-HJPIBITLSA-N 0.000 description 1
- OVPYIUNCVSOVNF-ZPFDUUQYSA-N Ile-Gln-Pro Natural products CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O OVPYIUNCVSOVNF-ZPFDUUQYSA-N 0.000 description 1
- JDAWAWXGAUZPNJ-ZPFDUUQYSA-N Ile-Glu-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JDAWAWXGAUZPNJ-ZPFDUUQYSA-N 0.000 description 1
- WUKLZPHVWAMZQV-UKJIMTQDSA-N Ile-Glu-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N WUKLZPHVWAMZQV-UKJIMTQDSA-N 0.000 description 1
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 1
- IGJWJGIHUFQANP-LAEOZQHASA-N Ile-Gly-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N IGJWJGIHUFQANP-LAEOZQHASA-N 0.000 description 1
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 1
- MQFGXJNSUJTXDT-QSFUFRPTSA-N Ile-Gly-Ile Chemical compound N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)O MQFGXJNSUJTXDT-QSFUFRPTSA-N 0.000 description 1
- LBRCLQMZAHRTLV-ZKWXMUAHSA-N Ile-Gly-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LBRCLQMZAHRTLV-ZKWXMUAHSA-N 0.000 description 1
- DFFTXLCCDFYRKD-MBLNEYKQSA-N Ile-Gly-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N DFFTXLCCDFYRKD-MBLNEYKQSA-N 0.000 description 1
- UAQSZXGJGLHMNV-XEGUGMAKSA-N Ile-Gly-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N UAQSZXGJGLHMNV-XEGUGMAKSA-N 0.000 description 1
- VOBYAKCXGQQFLR-LSJOCFKGSA-N Ile-Gly-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VOBYAKCXGQQFLR-LSJOCFKGSA-N 0.000 description 1
- JLWLMGADIQFKRD-QSFUFRPTSA-N Ile-His-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CN=CN1 JLWLMGADIQFKRD-QSFUFRPTSA-N 0.000 description 1
- VNDQNDYEPSXHLU-JUKXBJQTSA-N Ile-His-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N VNDQNDYEPSXHLU-JUKXBJQTSA-N 0.000 description 1
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 1
- RQQCJTLBSJMVCR-DSYPUSFNSA-N Ile-Leu-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N RQQCJTLBSJMVCR-DSYPUSFNSA-N 0.000 description 1
- PNTWNAXGBOZMBO-MNXVOIDGSA-N Ile-Lys-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PNTWNAXGBOZMBO-MNXVOIDGSA-N 0.000 description 1
- ADDYYRVQQZFIMW-MNXVOIDGSA-N Ile-Lys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ADDYYRVQQZFIMW-MNXVOIDGSA-N 0.000 description 1
- FFJQAEYLAQMGDL-MGHWNKPDSA-N Ile-Lys-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FFJQAEYLAQMGDL-MGHWNKPDSA-N 0.000 description 1
- RVNOXPZHMUWCLW-GMOBBJLQSA-N Ile-Met-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N RVNOXPZHMUWCLW-GMOBBJLQSA-N 0.000 description 1
- VOCZPDONPURUHV-QEWYBTABSA-N Ile-Phe-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VOCZPDONPURUHV-QEWYBTABSA-N 0.000 description 1
- OWSWUWDMSNXTNE-GMOBBJLQSA-N Ile-Pro-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N OWSWUWDMSNXTNE-GMOBBJLQSA-N 0.000 description 1
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 1
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 1
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 1
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 1
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 1
- JNLSTRPWUXOORL-MMWGEVLESA-N Ile-Ser-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N JNLSTRPWUXOORL-MMWGEVLESA-N 0.000 description 1
- HXIDVIFHRYRXLZ-NAKRPEOUSA-N Ile-Ser-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)O)N HXIDVIFHRYRXLZ-NAKRPEOUSA-N 0.000 description 1
- HJDZMPFEXINXLO-QPHKQPEJSA-N Ile-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N HJDZMPFEXINXLO-QPHKQPEJSA-N 0.000 description 1
- ANTFEOSJMAUGIB-KNZXXDILSA-N Ile-Thr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N ANTFEOSJMAUGIB-KNZXXDILSA-N 0.000 description 1
- PRTZQMBYUZFSFA-XEGUGMAKSA-N Ile-Tyr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)NCC(=O)O)N PRTZQMBYUZFSFA-XEGUGMAKSA-N 0.000 description 1
- GVEODXUBBFDBPW-MGHWNKPDSA-N Ile-Tyr-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 GVEODXUBBFDBPW-MGHWNKPDSA-N 0.000 description 1
- NXRNRBOKDBIVKQ-CXTHYWKRSA-N Ile-Tyr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N NXRNRBOKDBIVKQ-CXTHYWKRSA-N 0.000 description 1
- YJRSIJZUIUANHO-NAKRPEOUSA-N Ile-Val-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(=O)O)N YJRSIJZUIUANHO-NAKRPEOUSA-N 0.000 description 1
- YWCJXQKATPNPOE-UKJIMTQDSA-N Ile-Val-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YWCJXQKATPNPOE-UKJIMTQDSA-N 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- 239000007836 KH2PO4 Substances 0.000 description 1
- 241000170280 Kluyveromyces sp. Species 0.000 description 1
- 241000235058 Komagataella pastoris Species 0.000 description 1
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 1
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 1
- 102100040648 L-fucose kinase Human genes 0.000 description 1
- 101710091950 L-fucose kinase Proteins 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 1
- 241000186660 Lactobacillus Species 0.000 description 1
- 240000001046 Lactobacillus acidophilus Species 0.000 description 1
- 235000013956 Lactobacillus acidophilus Nutrition 0.000 description 1
- 235000013960 Lactobacillus bulgaricus Nutrition 0.000 description 1
- 235000013958 Lactobacillus casei Nutrition 0.000 description 1
- 241000186673 Lactobacillus delbrueckii Species 0.000 description 1
- 240000002605 Lactobacillus helveticus Species 0.000 description 1
- 241001561398 Lactobacillus jensenii Species 0.000 description 1
- 235000013965 Lactobacillus plantarum Nutrition 0.000 description 1
- 241000218588 Lactobacillus rhamnosus Species 0.000 description 1
- 241000186869 Lactobacillus salivarius Species 0.000 description 1
- 241000194036 Lactococcus Species 0.000 description 1
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 1
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 1
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 1
- SUPVSFFZWVOEOI-UHFFFAOYSA-N Leu-Ala-Tyr Natural products CC(C)CC(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-UHFFFAOYSA-N 0.000 description 1
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 description 1
- WXHFZJFZWNCDNB-KKUMJFAQSA-N Leu-Asn-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXHFZJFZWNCDNB-KKUMJFAQSA-N 0.000 description 1
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 1
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 1
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 1
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 1
- QLQHWWCSCLZUMA-KKUMJFAQSA-N Leu-Asp-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QLQHWWCSCLZUMA-KKUMJFAQSA-N 0.000 description 1
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 1
- NFHJQETXTSDZSI-DCAQKATOSA-N Leu-Cys-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NFHJQETXTSDZSI-DCAQKATOSA-N 0.000 description 1
- HUEBCHPSXSQUGN-GARJFASQSA-N Leu-Cys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N HUEBCHPSXSQUGN-GARJFASQSA-N 0.000 description 1
- DLCXCECTCPKKCD-GUBZILKMSA-N Leu-Gln-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DLCXCECTCPKKCD-GUBZILKMSA-N 0.000 description 1
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 1
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 1
- CQGSYZCULZMEDE-SRVKXCTJSA-N Leu-Gln-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CQGSYZCULZMEDE-SRVKXCTJSA-N 0.000 description 1
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 1
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 1
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 1
- LLBQJYDYOLIQAI-JYJNAYRXSA-N Leu-Glu-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LLBQJYDYOLIQAI-JYJNAYRXSA-N 0.000 description 1
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 1
- FIYMBBHGYNQFOP-IUCAKERBSA-N Leu-Gly-Gln Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N FIYMBBHGYNQFOP-IUCAKERBSA-N 0.000 description 1
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 1
- UCDHVOALNXENLC-KBPBESRZSA-N Leu-Gly-Tyr Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UCDHVOALNXENLC-KBPBESRZSA-N 0.000 description 1
- PBGDOSARRIJMEV-DLOVCJGASA-N Leu-His-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O PBGDOSARRIJMEV-DLOVCJGASA-N 0.000 description 1
- XQXGNBFMAXWIGI-MXAVVETBSA-N Leu-His-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 XQXGNBFMAXWIGI-MXAVVETBSA-N 0.000 description 1
- WRLPVDVHNWSSCL-MELADBBJSA-N Leu-His-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N WRLPVDVHNWSSCL-MELADBBJSA-N 0.000 description 1
- SGIIOQQGLUUMDQ-IHRRRGAJSA-N Leu-His-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N SGIIOQQGLUUMDQ-IHRRRGAJSA-N 0.000 description 1
- AUBMZAMQCOYSIC-MNXVOIDGSA-N Leu-Ile-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O AUBMZAMQCOYSIC-MNXVOIDGSA-N 0.000 description 1
- JFSGIJSCJFQGSZ-MXAVVETBSA-N Leu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N JFSGIJSCJFQGSZ-MXAVVETBSA-N 0.000 description 1
- QLDHBYRUNQZIJQ-DKIMLUQUSA-N Leu-Ile-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QLDHBYRUNQZIJQ-DKIMLUQUSA-N 0.000 description 1
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 1
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 1
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 1
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 1
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 1
- KXCMQWMNYQOAKA-SRVKXCTJSA-N Leu-Met-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KXCMQWMNYQOAKA-SRVKXCTJSA-N 0.000 description 1
- FLNPJLDPGMLWAU-UWVGGRQHSA-N Leu-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(C)C FLNPJLDPGMLWAU-UWVGGRQHSA-N 0.000 description 1
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 description 1
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 1
- UHNQRAFSEBGZFZ-YESZJQIVSA-N Leu-Phe-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N UHNQRAFSEBGZFZ-YESZJQIVSA-N 0.000 description 1
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 1
- FYPWFNKQVVEELI-ULQDDVLXSA-N Leu-Phe-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 FYPWFNKQVVEELI-ULQDDVLXSA-N 0.000 description 1
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 1
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 1
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 1
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 1
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 1
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 1
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 1
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 1
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 1
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 1
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 1
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 1
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 1
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 1
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 1
- HOMFINRJHIIZNJ-HOCLYGCPSA-N Leu-Trp-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O HOMFINRJHIIZNJ-HOCLYGCPSA-N 0.000 description 1
- UIIMIKFNIYPDJF-WDSOQIARSA-N Leu-Trp-Met Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CCSC)C(O)=O)NC(=O)[C@@H](N)CC(C)C)=CNC2=C1 UIIMIKFNIYPDJF-WDSOQIARSA-N 0.000 description 1
- LFXSPAIBSZSTEM-PMVMPFDFSA-N Leu-Trp-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O)N LFXSPAIBSZSTEM-PMVMPFDFSA-N 0.000 description 1
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 1
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 description 1
- MPOHDJKRBLVGCT-CIUDSAMLSA-N Lys-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N MPOHDJKRBLVGCT-CIUDSAMLSA-N 0.000 description 1
- MPGHETGWWWUHPY-CIUDSAMLSA-N Lys-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN MPGHETGWWWUHPY-CIUDSAMLSA-N 0.000 description 1
- JCFYLFOCALSNLQ-GUBZILKMSA-N Lys-Ala-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JCFYLFOCALSNLQ-GUBZILKMSA-N 0.000 description 1
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 1
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 1
- VHXMZJGOKIMETG-CQDKDKBSSA-N Lys-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCCN)N VHXMZJGOKIMETG-CQDKDKBSSA-N 0.000 description 1
- IRNSXVOWSXSULE-DCAQKATOSA-N Lys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN IRNSXVOWSXSULE-DCAQKATOSA-N 0.000 description 1
- ALSRJRIWBNENFY-DCAQKATOSA-N Lys-Arg-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O ALSRJRIWBNENFY-DCAQKATOSA-N 0.000 description 1
- BYPMOIFBQPEWOH-CIUDSAMLSA-N Lys-Asn-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N BYPMOIFBQPEWOH-CIUDSAMLSA-N 0.000 description 1
- DEFGUIIUYAUEDU-ZPFDUUQYSA-N Lys-Asn-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DEFGUIIUYAUEDU-ZPFDUUQYSA-N 0.000 description 1
- DGWXCIORNLWGGG-CIUDSAMLSA-N Lys-Asn-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O DGWXCIORNLWGGG-CIUDSAMLSA-N 0.000 description 1
- LZWNAOIMTLNMDW-NHCYSSNCSA-N Lys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N LZWNAOIMTLNMDW-NHCYSSNCSA-N 0.000 description 1
- KPJJOZUXFOLGMQ-CIUDSAMLSA-N Lys-Asp-Asn Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N KPJJOZUXFOLGMQ-CIUDSAMLSA-N 0.000 description 1
- QUYCUALODHJQLK-CIUDSAMLSA-N Lys-Asp-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUYCUALODHJQLK-CIUDSAMLSA-N 0.000 description 1
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 1
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 1
- KWUKZRFFKPLUPE-HJGDQZAQSA-N Lys-Asp-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWUKZRFFKPLUPE-HJGDQZAQSA-N 0.000 description 1
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 1
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 1
- LLSUNJYOSCOOEB-GUBZILKMSA-N Lys-Glu-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O LLSUNJYOSCOOEB-GUBZILKMSA-N 0.000 description 1
- KZOHPCYVORJBLG-AVGNSLFASA-N Lys-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N KZOHPCYVORJBLG-AVGNSLFASA-N 0.000 description 1
- ITWQLSZTLBKWJM-YUMQZZPRSA-N Lys-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCCN ITWQLSZTLBKWJM-YUMQZZPRSA-N 0.000 description 1
- GPJGFSFYBJGYRX-YUMQZZPRSA-N Lys-Gly-Asp Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O GPJGFSFYBJGYRX-YUMQZZPRSA-N 0.000 description 1
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 1
- PBLLTSKBTAHDNA-KBPBESRZSA-N Lys-Gly-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PBLLTSKBTAHDNA-KBPBESRZSA-N 0.000 description 1
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 1
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 1
- SQJSXOQXJYAVRV-SRVKXCTJSA-N Lys-His-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N SQJSXOQXJYAVRV-SRVKXCTJSA-N 0.000 description 1
- WOEDRPCHKPSFDT-MXAVVETBSA-N Lys-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCCN)N WOEDRPCHKPSFDT-MXAVVETBSA-N 0.000 description 1
- PRSBSVAVOQOAMI-BJDJZHNGSA-N Lys-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN PRSBSVAVOQOAMI-BJDJZHNGSA-N 0.000 description 1
- NCZIQZYZPUPMKY-PPCPHDFISA-N Lys-Ile-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NCZIQZYZPUPMKY-PPCPHDFISA-N 0.000 description 1
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 1
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 1
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 1
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 1
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 1
- ALGGDNMLQNFVIZ-SRVKXCTJSA-N Lys-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ALGGDNMLQNFVIZ-SRVKXCTJSA-N 0.000 description 1
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 1
- YXPJCVNIDDKGOE-MELADBBJSA-N Lys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N)C(=O)O YXPJCVNIDDKGOE-MELADBBJSA-N 0.000 description 1
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 1
- URGPVYGVWLIRGT-DCAQKATOSA-N Lys-Met-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O URGPVYGVWLIRGT-DCAQKATOSA-N 0.000 description 1
- LUTDBHBIHHREDC-IHRRRGAJSA-N Lys-Pro-Lys Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O LUTDBHBIHHREDC-IHRRRGAJSA-N 0.000 description 1
- CRIODIGWCUPXKU-AVGNSLFASA-N Lys-Pro-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O CRIODIGWCUPXKU-AVGNSLFASA-N 0.000 description 1
- YTJFXEDRUOQGSP-DCAQKATOSA-N Lys-Pro-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YTJFXEDRUOQGSP-DCAQKATOSA-N 0.000 description 1
- MIROMRNASYKZNL-ULQDDVLXSA-N Lys-Pro-Tyr Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MIROMRNASYKZNL-ULQDDVLXSA-N 0.000 description 1
- HKXSZKJMDBHOTG-CIUDSAMLSA-N Lys-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN HKXSZKJMDBHOTG-CIUDSAMLSA-N 0.000 description 1
- MGKFCQFVPKOWOL-CIUDSAMLSA-N Lys-Ser-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N MGKFCQFVPKOWOL-CIUDSAMLSA-N 0.000 description 1
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 1
- SQXZLVXQXWILKW-KKUMJFAQSA-N Lys-Ser-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQXZLVXQXWILKW-KKUMJFAQSA-N 0.000 description 1
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 1
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 1
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 1
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 1
- QVTDVTONTRSQMF-WDCWCFNPSA-N Lys-Thr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CCCCN QVTDVTONTRSQMF-WDCWCFNPSA-N 0.000 description 1
- BDFHWFUAQLIMJO-KXNHARMFSA-N Lys-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N)O BDFHWFUAQLIMJO-KXNHARMFSA-N 0.000 description 1
- RMOKGALPSPOYKE-KATARQTJSA-N Lys-Thr-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMOKGALPSPOYKE-KATARQTJSA-N 0.000 description 1
- RMKJOQSYLQQRFN-KKUMJFAQSA-N Lys-Tyr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O RMKJOQSYLQQRFN-KKUMJFAQSA-N 0.000 description 1
- XATKLFSXFINPSB-JYJNAYRXSA-N Lys-Tyr-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O XATKLFSXFINPSB-JYJNAYRXSA-N 0.000 description 1
- XYLSGAWRCZECIQ-JYJNAYRXSA-N Lys-Tyr-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 XYLSGAWRCZECIQ-JYJNAYRXSA-N 0.000 description 1
- RQILLQOQXLZTCK-KBPBESRZSA-N Lys-Tyr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O RQILLQOQXLZTCK-KBPBESRZSA-N 0.000 description 1
- WINFHLHJTRGLCV-BZSNNMDCSA-N Lys-Tyr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=C(O)C=C1 WINFHLHJTRGLCV-BZSNNMDCSA-N 0.000 description 1
- VVURYEVJJTXWNE-ULQDDVLXSA-N Lys-Tyr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O VVURYEVJJTXWNE-ULQDDVLXSA-N 0.000 description 1
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 description 1
- MDDUIRLQCYVRDO-NHCYSSNCSA-N Lys-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN MDDUIRLQCYVRDO-NHCYSSNCSA-N 0.000 description 1
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 1
- 239000006154 MacConkey agar Substances 0.000 description 1
- 108050004064 Major facilitator superfamily Proteins 0.000 description 1
- 102000015841 Major facilitator superfamily Human genes 0.000 description 1
- ONGCSGVHCSAATF-CIUDSAMLSA-N Met-Ala-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O ONGCSGVHCSAATF-CIUDSAMLSA-N 0.000 description 1
- MUYQDMBLDFEVRJ-LSJOCFKGSA-N Met-Ala-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 MUYQDMBLDFEVRJ-LSJOCFKGSA-N 0.000 description 1
- ULNXMMYXQKGNPG-LPEHRKFASA-N Met-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N ULNXMMYXQKGNPG-LPEHRKFASA-N 0.000 description 1
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 1
- BVXXDMUMHMXFER-BPNCWPANSA-N Met-Ala-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVXXDMUMHMXFER-BPNCWPANSA-N 0.000 description 1
- CWFYZYQMUDWGTI-GUBZILKMSA-N Met-Arg-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O CWFYZYQMUDWGTI-GUBZILKMSA-N 0.000 description 1
- OBVHKUFUDCPZDW-JYJNAYRXSA-N Met-Arg-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OBVHKUFUDCPZDW-JYJNAYRXSA-N 0.000 description 1
- SBSIKVMCCJUCBZ-GUBZILKMSA-N Met-Asn-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N SBSIKVMCCJUCBZ-GUBZILKMSA-N 0.000 description 1
- DRXODWRPPUFIAY-DCAQKATOSA-N Met-Asn-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN DRXODWRPPUFIAY-DCAQKATOSA-N 0.000 description 1
- HKRYNJSKVLZIFP-IHRRRGAJSA-N Met-Asn-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HKRYNJSKVLZIFP-IHRRRGAJSA-N 0.000 description 1
- KQBJYJXPZBNEIK-DCAQKATOSA-N Met-Glu-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQBJYJXPZBNEIK-DCAQKATOSA-N 0.000 description 1
- PQPMMGQTRQFSDA-SRVKXCTJSA-N Met-Glu-His Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O PQPMMGQTRQFSDA-SRVKXCTJSA-N 0.000 description 1
- YCUSPBPZVJDMII-YUMQZZPRSA-N Met-Gly-Glu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O YCUSPBPZVJDMII-YUMQZZPRSA-N 0.000 description 1
- SXWQMBGNFXAGAT-FJXKBIBVSA-N Met-Gly-Thr Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SXWQMBGNFXAGAT-FJXKBIBVSA-N 0.000 description 1
- BMHIFARYXOJDLD-WPRPVWTQSA-N Met-Gly-Val Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O BMHIFARYXOJDLD-WPRPVWTQSA-N 0.000 description 1
- FTQOFRPGLYXRFM-CYDGBPFRSA-N Met-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCSC)N FTQOFRPGLYXRFM-CYDGBPFRSA-N 0.000 description 1
- JYPITOUIQVSCKM-IHRRRGAJSA-N Met-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCSC)N JYPITOUIQVSCKM-IHRRRGAJSA-N 0.000 description 1
- ZRACLHJYVRBJFC-ULQDDVLXSA-N Met-Lys-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZRACLHJYVRBJFC-ULQDDVLXSA-N 0.000 description 1
- RDLSEGZJMYGFNS-FXQIFTODSA-N Met-Ser-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RDLSEGZJMYGFNS-FXQIFTODSA-N 0.000 description 1
- LHXFNWBNRBWMNV-DCAQKATOSA-N Met-Ser-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LHXFNWBNRBWMNV-DCAQKATOSA-N 0.000 description 1
- QYIGOFGUOVTAHK-ZJDVBMNYSA-N Met-Thr-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QYIGOFGUOVTAHK-ZJDVBMNYSA-N 0.000 description 1
- NSMXRFMGZYTFEX-KJEVXHAQSA-N Met-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCSC)N)O NSMXRFMGZYTFEX-KJEVXHAQSA-N 0.000 description 1
- ZBLSZPYQQRIHQU-RCWTZXSCSA-N Met-Thr-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ZBLSZPYQQRIHQU-RCWTZXSCSA-N 0.000 description 1
- RMLWDZINJUDMEB-IHRRRGAJSA-N Met-Tyr-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N RMLWDZINJUDMEB-IHRRRGAJSA-N 0.000 description 1
- VYDLZDRMOFYOGV-TUAOUCFPSA-N Met-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N VYDLZDRMOFYOGV-TUAOUCFPSA-N 0.000 description 1
- 241000192041 Micrococcus Species 0.000 description 1
- 241000187708 Micromonospora Species 0.000 description 1
- 101100519658 Mus musculus Pfkm gene Proteins 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 1
- OVRNDRQMDRJTHS-FMDGEEDCSA-N N-acetyl-beta-D-glucosamine Chemical compound CC(=O)N[C@H]1[C@H](O)O[C@H](CO)[C@@H](O)[C@@H]1O OVRNDRQMDRJTHS-FMDGEEDCSA-N 0.000 description 1
- 102100023315 N-acetyllactosaminide beta-1,6-N-acetylglucosaminyl-transferase Human genes 0.000 description 1
- 108010056664 N-acetyllactosaminide beta-1,6-N-acetylglucosaminyltransferase Proteins 0.000 description 1
- 108010047562 NGR peptide Proteins 0.000 description 1
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 1
- 241000588650 Neisseria meningitidis Species 0.000 description 1
- 241000080590 Niso Species 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 241000179039 Paenibacillus Species 0.000 description 1
- 241000588701 Pectobacterium carotovorum Species 0.000 description 1
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 1
- CYZBFPYMSJGBRL-DRZSPHRISA-N Phe-Ala-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CYZBFPYMSJGBRL-DRZSPHRISA-N 0.000 description 1
- QMMRHASQEVCJGR-UBHSHLNASA-N Phe-Ala-Pro Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 QMMRHASQEVCJGR-UBHSHLNASA-N 0.000 description 1
- BKWJQWJPZMUWEG-LFSVMHDDSA-N Phe-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BKWJQWJPZMUWEG-LFSVMHDDSA-N 0.000 description 1
- LGBVMDMZZFYSFW-HJWJTTGWSA-N Phe-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CC=CC=C1)N LGBVMDMZZFYSFW-HJWJTTGWSA-N 0.000 description 1
- YYRCPTVAPLQRNC-ULQDDVLXSA-N Phe-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CC1=CC=CC=C1 YYRCPTVAPLQRNC-ULQDDVLXSA-N 0.000 description 1
- ZWJKVFAYPLPCQB-UNQGMJICSA-N Phe-Arg-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O ZWJKVFAYPLPCQB-UNQGMJICSA-N 0.000 description 1
- IWRZUGHCHFZYQZ-UFYCRDLUSA-N Phe-Arg-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 IWRZUGHCHFZYQZ-UFYCRDLUSA-N 0.000 description 1
- QCHNRQQVLJYDSI-DLOVCJGASA-N Phe-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 QCHNRQQVLJYDSI-DLOVCJGASA-N 0.000 description 1
- HCTXJGRYAACKOB-SRVKXCTJSA-N Phe-Asn-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HCTXJGRYAACKOB-SRVKXCTJSA-N 0.000 description 1
- DDYIRGBOZVKRFR-AVGNSLFASA-N Phe-Asp-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DDYIRGBOZVKRFR-AVGNSLFASA-N 0.000 description 1
- WIVCOAKLPICYGY-KKUMJFAQSA-N Phe-Asp-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N WIVCOAKLPICYGY-KKUMJFAQSA-N 0.000 description 1
- DJPXNKUDJKGQEE-BZSNNMDCSA-N Phe-Asp-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DJPXNKUDJKGQEE-BZSNNMDCSA-N 0.000 description 1
- OJUMUUXGSXUZJZ-SRVKXCTJSA-N Phe-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OJUMUUXGSXUZJZ-SRVKXCTJSA-N 0.000 description 1
- SWZKMTDPQXLQRD-XVSYOHENSA-N Phe-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWZKMTDPQXLQRD-XVSYOHENSA-N 0.000 description 1
- UMKYAYXCMYYNHI-AVGNSLFASA-N Phe-Gln-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N UMKYAYXCMYYNHI-AVGNSLFASA-N 0.000 description 1
- RJYBHZVWJPUSLB-QEWYBTABSA-N Phe-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N RJYBHZVWJPUSLB-QEWYBTABSA-N 0.000 description 1
- WYPVCIACUMJRIB-JYJNAYRXSA-N Phe-Gln-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N WYPVCIACUMJRIB-JYJNAYRXSA-N 0.000 description 1
- RLUMIJXNHJVUCO-JBACZVJFSA-N Phe-Gln-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 RLUMIJXNHJVUCO-JBACZVJFSA-N 0.000 description 1
- LWPMGKSZPKFKJD-DZKIICNBSA-N Phe-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O LWPMGKSZPKFKJD-DZKIICNBSA-N 0.000 description 1
- WPTYDQPGBMDUBI-QWRGUYRKSA-N Phe-Gly-Asn Chemical compound N[C@@H](Cc1ccccc1)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O WPTYDQPGBMDUBI-QWRGUYRKSA-N 0.000 description 1
- HGNGAMWHGGANAU-WHOFXGATSA-N Phe-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HGNGAMWHGGANAU-WHOFXGATSA-N 0.000 description 1
- MJQFZGOIVBDIMZ-WHOFXGATSA-N Phe-Ile-Gly Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O MJQFZGOIVBDIMZ-WHOFXGATSA-N 0.000 description 1
- KPEIBEPEUAZWNS-ULQDDVLXSA-N Phe-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KPEIBEPEUAZWNS-ULQDDVLXSA-N 0.000 description 1
- MSHZERMPZKCODG-ACRUOGEOSA-N Phe-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MSHZERMPZKCODG-ACRUOGEOSA-N 0.000 description 1
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 1
- ZIQQNOXKEFDPBE-BZSNNMDCSA-N Phe-Lys-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N ZIQQNOXKEFDPBE-BZSNNMDCSA-N 0.000 description 1
- DOXQMJCSSYZSNM-BZSNNMDCSA-N Phe-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O DOXQMJCSSYZSNM-BZSNNMDCSA-N 0.000 description 1
- YOFKMVUAZGPFCF-IHRRRGAJSA-N Phe-Met-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(O)=O YOFKMVUAZGPFCF-IHRRRGAJSA-N 0.000 description 1
- CBENHWCORLVGEQ-HJOGWXRNSA-N Phe-Phe-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 CBENHWCORLVGEQ-HJOGWXRNSA-N 0.000 description 1
- MGLBSROLWAWCKN-FCLVOEFKSA-N Phe-Phe-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MGLBSROLWAWCKN-FCLVOEFKSA-N 0.000 description 1
- XOHJOMKCRLHGCY-UNQGMJICSA-N Phe-Pro-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOHJOMKCRLHGCY-UNQGMJICSA-N 0.000 description 1
- BONHGTUEEPIMPM-AVGNSLFASA-N Phe-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O BONHGTUEEPIMPM-AVGNSLFASA-N 0.000 description 1
- JXQVYPWVGUOIDV-MXAVVETBSA-N Phe-Ser-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JXQVYPWVGUOIDV-MXAVVETBSA-N 0.000 description 1
- MRWOVVNKSXXLRP-IHPCNDPISA-N Phe-Ser-Trp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O MRWOVVNKSXXLRP-IHPCNDPISA-N 0.000 description 1
- RAGOJJCBGXARPO-XVSYOHENSA-N Phe-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RAGOJJCBGXARPO-XVSYOHENSA-N 0.000 description 1
- KLYYKKGCPOGDPE-OEAJRASXSA-N Phe-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O KLYYKKGCPOGDPE-OEAJRASXSA-N 0.000 description 1
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 1
- MSSXKZBDKZAHCX-UNQGMJICSA-N Phe-Thr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O MSSXKZBDKZAHCX-UNQGMJICSA-N 0.000 description 1
- BPIFSOUEUYDJRM-DCPHZVHLSA-N Phe-Trp-Ala Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](C)C(O)=O)C1=CC=CC=C1 BPIFSOUEUYDJRM-DCPHZVHLSA-N 0.000 description 1
- ZYNBEWGJFXTBDU-ACRUOGEOSA-N Phe-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CC=CC=C2)N ZYNBEWGJFXTBDU-ACRUOGEOSA-N 0.000 description 1
- ZOGICTVLQDWPER-UFYCRDLUSA-N Phe-Tyr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O ZOGICTVLQDWPER-UFYCRDLUSA-N 0.000 description 1
- JSGWNFKWZNPDAV-YDHLFZDLSA-N Phe-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JSGWNFKWZNPDAV-YDHLFZDLSA-N 0.000 description 1
- XALFIVXGQUEGKV-JSGCOSHPSA-N Phe-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XALFIVXGQUEGKV-JSGCOSHPSA-N 0.000 description 1
- GNZCMRRSXOBHLC-JYJNAYRXSA-N Phe-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N GNZCMRRSXOBHLC-JYJNAYRXSA-N 0.000 description 1
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 1
- MWQXFDIQXIXPMS-UNQGMJICSA-N Phe-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O MWQXFDIQXIXPMS-UNQGMJICSA-N 0.000 description 1
- GAMLAXHLYGLQBJ-UFYCRDLUSA-N Phe-Val-Tyr Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC1=CC=C(C=C1)O)C(C)C)CC1=CC=CC=C1 GAMLAXHLYGLQBJ-UFYCRDLUSA-N 0.000 description 1
- 102000004160 Phosphoric Monoester Hydrolases Human genes 0.000 description 1
- 108090000608 Phosphoric Monoester Hydrolases Proteins 0.000 description 1
- 241000235648 Pichia Species 0.000 description 1
- 241000235061 Pichia sp. Species 0.000 description 1
- DBALDZKOTNSBFM-FXQIFTODSA-N Pro-Ala-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DBALDZKOTNSBFM-FXQIFTODSA-N 0.000 description 1
- FZHBZMDRDASUHN-NAKRPEOUSA-N Pro-Ala-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1)C(O)=O FZHBZMDRDASUHN-NAKRPEOUSA-N 0.000 description 1
- SSSFPISOZOLQNP-GUBZILKMSA-N Pro-Arg-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSFPISOZOLQNP-GUBZILKMSA-N 0.000 description 1
- GRIRJQGZZJVANI-CYDGBPFRSA-N Pro-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 GRIRJQGZZJVANI-CYDGBPFRSA-N 0.000 description 1
- UVKNEILZSJMKSR-FXQIFTODSA-N Pro-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 UVKNEILZSJMKSR-FXQIFTODSA-N 0.000 description 1
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 description 1
- SBYVDRLQAGENMY-DCAQKATOSA-N Pro-Asn-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O SBYVDRLQAGENMY-DCAQKATOSA-N 0.000 description 1
- FUVBEZJCRMHWEM-FXQIFTODSA-N Pro-Asn-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FUVBEZJCRMHWEM-FXQIFTODSA-N 0.000 description 1
- UTAUEDINXUMHLG-FXQIFTODSA-N Pro-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 UTAUEDINXUMHLG-FXQIFTODSA-N 0.000 description 1
- CJZTUKSFZUSNCC-FXQIFTODSA-N Pro-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 CJZTUKSFZUSNCC-FXQIFTODSA-N 0.000 description 1
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 1
- GDXZRWYXJSGWIV-GMOBBJLQSA-N Pro-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 GDXZRWYXJSGWIV-GMOBBJLQSA-N 0.000 description 1
- DEDANIDYQAPTFI-IHRRRGAJSA-N Pro-Asp-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DEDANIDYQAPTFI-IHRRRGAJSA-N 0.000 description 1
- ZPPVJIJMIKTERM-YUMQZZPRSA-N Pro-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ZPPVJIJMIKTERM-YUMQZZPRSA-N 0.000 description 1
- LANQLYHLMYDWJP-SRVKXCTJSA-N Pro-Gln-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O LANQLYHLMYDWJP-SRVKXCTJSA-N 0.000 description 1
- SKICPQLTOXGWGO-GARJFASQSA-N Pro-Gln-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O SKICPQLTOXGWGO-GARJFASQSA-N 0.000 description 1
- VDGTVWFMRXVQCT-GUBZILKMSA-N Pro-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 VDGTVWFMRXVQCT-GUBZILKMSA-N 0.000 description 1
- VPFGPKIWSDVTOY-SRVKXCTJSA-N Pro-Glu-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O VPFGPKIWSDVTOY-SRVKXCTJSA-N 0.000 description 1
- WVOXLKUUVCCCSU-ZPFDUUQYSA-N Pro-Glu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVOXLKUUVCCCSU-ZPFDUUQYSA-N 0.000 description 1
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 1
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 1
- JMVQDLDPDBXAAX-YUMQZZPRSA-N Pro-Gly-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 JMVQDLDPDBXAAX-YUMQZZPRSA-N 0.000 description 1
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 1
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 1
- KWMUAKQOVYCQJQ-ZPFDUUQYSA-N Pro-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@@H]1CCCN1 KWMUAKQOVYCQJQ-ZPFDUUQYSA-N 0.000 description 1
- KLSOMAFWRISSNI-OSUNSFLBSA-N Pro-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 KLSOMAFWRISSNI-OSUNSFLBSA-N 0.000 description 1
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 1
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 description 1
- BRJGUPWVFXKBQI-XUXIUFHCSA-N Pro-Leu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRJGUPWVFXKBQI-XUXIUFHCSA-N 0.000 description 1
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 1
- SXMSEHDMNIUTSP-DCAQKATOSA-N Pro-Lys-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SXMSEHDMNIUTSP-DCAQKATOSA-N 0.000 description 1
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 1
- WOIFYRZPIORBRY-AVGNSLFASA-N Pro-Lys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WOIFYRZPIORBRY-AVGNSLFASA-N 0.000 description 1
- KLOQCCRTPHPIFN-DCAQKATOSA-N Pro-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 KLOQCCRTPHPIFN-DCAQKATOSA-N 0.000 description 1
- MLKVIVZCFYRTIR-KKUMJFAQSA-N Pro-Phe-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLKVIVZCFYRTIR-KKUMJFAQSA-N 0.000 description 1
- JIWJRKNYLSHONY-KKUMJFAQSA-N Pro-Phe-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JIWJRKNYLSHONY-KKUMJFAQSA-N 0.000 description 1
- AJBQTGZIZQXBLT-STQMWFEESA-N Pro-Phe-Gly Chemical compound C([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 AJBQTGZIZQXBLT-STQMWFEESA-N 0.000 description 1
- JLMZKEQFMVORMA-SRVKXCTJSA-N Pro-Pro-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 JLMZKEQFMVORMA-SRVKXCTJSA-N 0.000 description 1
- NAIPAPCKKRCMBL-JYJNAYRXSA-N Pro-Pro-Phe Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CC=CC=C1 NAIPAPCKKRCMBL-JYJNAYRXSA-N 0.000 description 1
- UGDMQJSXSSZUKL-IHRRRGAJSA-N Pro-Ser-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O UGDMQJSXSSZUKL-IHRRRGAJSA-N 0.000 description 1
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 1
- DCHQYSOGURGJST-FJXKBIBVSA-N Pro-Thr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O DCHQYSOGURGJST-FJXKBIBVSA-N 0.000 description 1
- JDJMFMVVJHLWDP-UNQGMJICSA-N Pro-Thr-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JDJMFMVVJHLWDP-UNQGMJICSA-N 0.000 description 1
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 1
- BNUKRHFCHHLIGR-JYJNAYRXSA-N Pro-Trp-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CC(=O)O)C(=O)O BNUKRHFCHHLIGR-JYJNAYRXSA-N 0.000 description 1
- GNFHQWNCSSPOBT-ULQDDVLXSA-N Pro-Trp-Gln Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CCC(=O)N)C(=O)O GNFHQWNCSSPOBT-ULQDDVLXSA-N 0.000 description 1
- DMNANGOFEUVBRV-GJZGRUSLSA-N Pro-Trp-Gly Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)NCC(=O)O)C(=O)[C@@H]1CCCN1 DMNANGOFEUVBRV-GJZGRUSLSA-N 0.000 description 1
- VVEQUISRWJDGMX-VKOGCVSHSA-N Pro-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@@H]3CCCN3 VVEQUISRWJDGMX-VKOGCVSHSA-N 0.000 description 1
- ZAUHSLVPDLNTRZ-QXEWZRGKSA-N Pro-Val-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZAUHSLVPDLNTRZ-QXEWZRGKSA-N 0.000 description 1
- DGDCSVGVWWAJRS-AVGNSLFASA-N Pro-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 DGDCSVGVWWAJRS-AVGNSLFASA-N 0.000 description 1
- 241000589517 Pseudomonas aeruginosa Species 0.000 description 1
- 108010079005 RDV peptide Proteins 0.000 description 1
- 108010052388 RGES peptide Proteins 0.000 description 1
- 102000009572 RNA Polymerase II Human genes 0.000 description 1
- 108010009460 RNA Polymerase II Proteins 0.000 description 1
- 241000316848 Rhodococcus <scale insect> Species 0.000 description 1
- 241001030146 Rhodotorula sp. Species 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 1
- 241000235088 Saccharomyces sp. Species 0.000 description 1
- 241001360381 Saccharomycopsis sp. Species 0.000 description 1
- 241001138501 Salmonella enterica Species 0.000 description 1
- 241000720795 Schizosaccharomyces sp. Species 0.000 description 1
- FIXILCYTSAUERA-FXQIFTODSA-N Ser-Ala-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIXILCYTSAUERA-FXQIFTODSA-N 0.000 description 1
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 1
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 1
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 1
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 1
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 1
- PZZJMBYSYAKYPK-UWJYBYFXSA-N Ser-Ala-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PZZJMBYSYAKYPK-UWJYBYFXSA-N 0.000 description 1
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 1
- QEDMOZUJTGEIBF-FXQIFTODSA-N Ser-Arg-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O QEDMOZUJTGEIBF-FXQIFTODSA-N 0.000 description 1
- JJKSSJVYOVRJMZ-FXQIFTODSA-N Ser-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)CN=C(N)N JJKSSJVYOVRJMZ-FXQIFTODSA-N 0.000 description 1
- YUSRGTQIPCJNHQ-CIUDSAMLSA-N Ser-Arg-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YUSRGTQIPCJNHQ-CIUDSAMLSA-N 0.000 description 1
- VQBLHWSPVYYZTB-DCAQKATOSA-N Ser-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N VQBLHWSPVYYZTB-DCAQKATOSA-N 0.000 description 1
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 1
- WXUBSIDKNMFAGS-IHRRRGAJSA-N Ser-Arg-Tyr Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXUBSIDKNMFAGS-IHRRRGAJSA-N 0.000 description 1
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 1
- VGNYHOBZJKWRGI-CIUDSAMLSA-N Ser-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO VGNYHOBZJKWRGI-CIUDSAMLSA-N 0.000 description 1
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 1
- UGJRQLURDVGULT-LKXGYXEUSA-N Ser-Asn-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UGJRQLURDVGULT-LKXGYXEUSA-N 0.000 description 1
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 1
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 1
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 1
- BTPAWKABYQMKKN-LKXGYXEUSA-N Ser-Asp-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BTPAWKABYQMKKN-LKXGYXEUSA-N 0.000 description 1
- XWCYBVBLJRWOFR-WDSKDSINSA-N Ser-Gln-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O XWCYBVBLJRWOFR-WDSKDSINSA-N 0.000 description 1
- KJMOINFQVCCSDX-XKBZYTNZSA-N Ser-Gln-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KJMOINFQVCCSDX-XKBZYTNZSA-N 0.000 description 1
- VDVYTKZBMFADQH-AVGNSLFASA-N Ser-Gln-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 VDVYTKZBMFADQH-AVGNSLFASA-N 0.000 description 1
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 1
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 1
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 1
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 1
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 1
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 1
- SVWQEIRZHHNBIO-WHFBIAKZSA-N Ser-Gly-Cys Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CS)C(O)=O SVWQEIRZHHNBIO-WHFBIAKZSA-N 0.000 description 1
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 1
- IXCHOHLPHNGFTJ-YUMQZZPRSA-N Ser-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N IXCHOHLPHNGFTJ-YUMQZZPRSA-N 0.000 description 1
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 1
- CXBFHZLODKPIJY-AAEUAGOBSA-N Ser-Gly-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N CXBFHZLODKPIJY-AAEUAGOBSA-N 0.000 description 1
- MOQDPPUMFSMYOM-KKUMJFAQSA-N Ser-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CO)N MOQDPPUMFSMYOM-KKUMJFAQSA-N 0.000 description 1
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 1
- BKZYBLLIBOBOOW-GHCJXIJMSA-N Ser-Ile-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O BKZYBLLIBOBOOW-GHCJXIJMSA-N 0.000 description 1
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 1
- LWMQRHDTXHQQOV-MXAVVETBSA-N Ser-Ile-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LWMQRHDTXHQQOV-MXAVVETBSA-N 0.000 description 1
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 1
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 1
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 1
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 1
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 1
- GVIGVIOEYBOTCB-XIRDDKMYSA-N Ser-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC(C)C)C(O)=O)=CNC2=C1 GVIGVIOEYBOTCB-XIRDDKMYSA-N 0.000 description 1
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 1
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 1
- PPNPDKGQRFSCAC-CIUDSAMLSA-N Ser-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPNPDKGQRFSCAC-CIUDSAMLSA-N 0.000 description 1
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 1
- LRWBCWGEUCKDTN-BJDJZHNGSA-N Ser-Lys-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LRWBCWGEUCKDTN-BJDJZHNGSA-N 0.000 description 1
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 1
- XNXRTQZTFVMJIJ-DCAQKATOSA-N Ser-Met-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNXRTQZTFVMJIJ-DCAQKATOSA-N 0.000 description 1
- RXSWQCATLWVDLI-XGEHTFHBSA-N Ser-Met-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RXSWQCATLWVDLI-XGEHTFHBSA-N 0.000 description 1
- JAWGSPUJAXYXJA-IHRRRGAJSA-N Ser-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=CC=C1 JAWGSPUJAXYXJA-IHRRRGAJSA-N 0.000 description 1
- KZPRPBLHYMZIMH-MXAVVETBSA-N Ser-Phe-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZPRPBLHYMZIMH-MXAVVETBSA-N 0.000 description 1
- RWDVVSKYZBNDCO-MELADBBJSA-N Ser-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CO)N)C(=O)O RWDVVSKYZBNDCO-MELADBBJSA-N 0.000 description 1
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 1
- QMCDMHWAKMUGJE-IHRRRGAJSA-N Ser-Phe-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O QMCDMHWAKMUGJE-IHRRRGAJSA-N 0.000 description 1
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 1
- WNDUPCKKKGSKIQ-CIUDSAMLSA-N Ser-Pro-Gln Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O WNDUPCKKKGSKIQ-CIUDSAMLSA-N 0.000 description 1
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 1
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 1
- DINQYZRMXGWWTG-GUBZILKMSA-N Ser-Pro-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DINQYZRMXGWWTG-GUBZILKMSA-N 0.000 description 1
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 1
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 1
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 1
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 1
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 1
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 1
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 1
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 1
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 1
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 1
- DYEGLQRVMBWQLD-IXOXFDKPSA-N Ser-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CO)N)O DYEGLQRVMBWQLD-IXOXFDKPSA-N 0.000 description 1
- STIAINRLUUKYKM-WFBYXXMGSA-N Ser-Trp-Ala Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CO)=CNC2=C1 STIAINRLUUKYKM-WFBYXXMGSA-N 0.000 description 1
- BCAVNDNYOGTQMQ-AAEUAGOBSA-N Ser-Trp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O BCAVNDNYOGTQMQ-AAEUAGOBSA-N 0.000 description 1
- XTWXRUWACCXBMU-XIRDDKMYSA-N Ser-Trp-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)NC(=O)[C@H](CO)N XTWXRUWACCXBMU-XIRDDKMYSA-N 0.000 description 1
- ZWSZBWAFDZRBNM-UBHSHLNASA-N Ser-Trp-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O ZWSZBWAFDZRBNM-UBHSHLNASA-N 0.000 description 1
- FGBLCMLXHRPVOF-IHRRRGAJSA-N Ser-Tyr-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FGBLCMLXHRPVOF-IHRRRGAJSA-N 0.000 description 1
- FHXGMDRKJHKLKW-QWRGUYRKSA-N Ser-Tyr-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 FHXGMDRKJHKLKW-QWRGUYRKSA-N 0.000 description 1
- PLQWGQUNUPMNOD-KKUMJFAQSA-N Ser-Tyr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PLQWGQUNUPMNOD-KKUMJFAQSA-N 0.000 description 1
- OQSQCUWQOIHECT-YJRXYDGGSA-N Ser-Tyr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OQSQCUWQOIHECT-YJRXYDGGSA-N 0.000 description 1
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 1
- IAOHCSQDQDWRQU-GUBZILKMSA-N Ser-Val-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IAOHCSQDQDWRQU-GUBZILKMSA-N 0.000 description 1
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 241000204117 Sporolactobacillus Species 0.000 description 1
- 241000194017 Streptococcus Species 0.000 description 1
- 244000057717 Streptococcus lactis Species 0.000 description 1
- 101710161145 Sugar efflux transporter Proteins 0.000 description 1
- 108700026226 TATA Box Proteins 0.000 description 1
- 241000520244 Tatumella citrea Species 0.000 description 1
- 102000002933 Thioredoxin Human genes 0.000 description 1
- ZUXQFMVPAYGPFJ-JXUBOQSCSA-N Thr-Ala-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN ZUXQFMVPAYGPFJ-JXUBOQSCSA-N 0.000 description 1
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 1
- DGDCHPCRMWEOJR-FQPOAREZSA-N Thr-Ala-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DGDCHPCRMWEOJR-FQPOAREZSA-N 0.000 description 1
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 1
- XYEXCEPTALHNEV-RCWTZXSCSA-N Thr-Arg-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XYEXCEPTALHNEV-RCWTZXSCSA-N 0.000 description 1
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 1
- JNQZPAWOPBZGIX-RCWTZXSCSA-N Thr-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N JNQZPAWOPBZGIX-RCWTZXSCSA-N 0.000 description 1
- VIBXMCZWVUOZLA-OLHMAJIHSA-N Thr-Asn-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O VIBXMCZWVUOZLA-OLHMAJIHSA-N 0.000 description 1
- QGXCWPNQVCYJEL-NUMRIWBASA-N Thr-Asn-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGXCWPNQVCYJEL-NUMRIWBASA-N 0.000 description 1
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 description 1
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 1
- JTEICXDKGWKRRV-HJGDQZAQSA-N Thr-Asn-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JTEICXDKGWKRRV-HJGDQZAQSA-N 0.000 description 1
- LXWZOMSOUAMOIA-JIOCBJNQSA-N Thr-Asn-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O LXWZOMSOUAMOIA-JIOCBJNQSA-N 0.000 description 1
- JVTHIXKSVYEWNI-JRQIVUDYSA-N Thr-Asn-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JVTHIXKSVYEWNI-JRQIVUDYSA-N 0.000 description 1
- NLJKZUGAIIRWJN-LKXGYXEUSA-N Thr-Asp-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)O NLJKZUGAIIRWJN-LKXGYXEUSA-N 0.000 description 1
- GKMYGVQDGVYCPC-IUKAMOBKSA-N Thr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H]([C@@H](C)O)N GKMYGVQDGVYCPC-IUKAMOBKSA-N 0.000 description 1
- KRPKYGOFYUNIGM-XVSYOHENSA-N Thr-Asp-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O KRPKYGOFYUNIGM-XVSYOHENSA-N 0.000 description 1
- DCLBXIWHLVEPMQ-JRQIVUDYSA-N Thr-Asp-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DCLBXIWHLVEPMQ-JRQIVUDYSA-N 0.000 description 1
- OYTNZCBFDXGQGE-XQXXSGGOSA-N Thr-Gln-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O OYTNZCBFDXGQGE-XQXXSGGOSA-N 0.000 description 1
- UHBPFYOQQPFKQR-JHEQGTHGSA-N Thr-Gln-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UHBPFYOQQPFKQR-JHEQGTHGSA-N 0.000 description 1
- XXNLGZRRSKPSGF-HTUGSXCWSA-N Thr-Gln-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O XXNLGZRRSKPSGF-HTUGSXCWSA-N 0.000 description 1
- CQNFRKAKGDSJFR-NUMRIWBASA-N Thr-Glu-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CQNFRKAKGDSJFR-NUMRIWBASA-N 0.000 description 1
- WDFPMSHYMRBLKM-NKIYYHGXSA-N Thr-Glu-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O WDFPMSHYMRBLKM-NKIYYHGXSA-N 0.000 description 1
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 1
- HJOSVGCWOTYJFG-WDCWCFNPSA-N Thr-Glu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O HJOSVGCWOTYJFG-WDCWCFNPSA-N 0.000 description 1
- VULNJDORNLBPNG-SWRJLBSHSA-N Thr-Glu-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O VULNJDORNLBPNG-SWRJLBSHSA-N 0.000 description 1
- LKEKWDJCJSPXNI-IRIUXVKKSA-N Thr-Glu-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LKEKWDJCJSPXNI-IRIUXVKKSA-N 0.000 description 1
- KCRQEJSKXAIULJ-FJXKBIBVSA-N Thr-Gly-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCRQEJSKXAIULJ-FJXKBIBVSA-N 0.000 description 1
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 1
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 1
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 1
- ZTPXSEUVYNNZRB-CDMKHQONSA-N Thr-Gly-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZTPXSEUVYNNZRB-CDMKHQONSA-N 0.000 description 1
- YSXYEJWDHBCTDJ-DVJZZOLTSA-N Thr-Gly-Trp Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O YSXYEJWDHBCTDJ-DVJZZOLTSA-N 0.000 description 1
- JQAWYCUUFIMTHE-WLTAIBSBSA-N Thr-Gly-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JQAWYCUUFIMTHE-WLTAIBSBSA-N 0.000 description 1
- YJCVECXVYHZOBK-KNZXXDILSA-N Thr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H]([C@@H](C)O)N YJCVECXVYHZOBK-KNZXXDILSA-N 0.000 description 1
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 1
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 1
- HOVLHEKTGVIKAP-WDCWCFNPSA-N Thr-Leu-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HOVLHEKTGVIKAP-WDCWCFNPSA-N 0.000 description 1
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 1
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 1
- ISLDRLHVPXABBC-IEGACIPQSA-N Thr-Leu-Trp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ISLDRLHVPXABBC-IEGACIPQSA-N 0.000 description 1
- ZSPQUTWLWGWTPS-HJGDQZAQSA-N Thr-Lys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZSPQUTWLWGWTPS-HJGDQZAQSA-N 0.000 description 1
- QNCFWHZVRNXAKW-OEAJRASXSA-N Thr-Lys-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNCFWHZVRNXAKW-OEAJRASXSA-N 0.000 description 1
- XSEPSRUDSPHMPX-KATARQTJSA-N Thr-Lys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O XSEPSRUDSPHMPX-KATARQTJSA-N 0.000 description 1
- DXPURPNJDFCKKO-RHYQMDGZSA-N Thr-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DXPURPNJDFCKKO-RHYQMDGZSA-N 0.000 description 1
- UJQVSMNQMQHVRY-KZVJFYERSA-N Thr-Met-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O UJQVSMNQMQHVRY-KZVJFYERSA-N 0.000 description 1
- QFEYTTHKPSOFLV-OSUNSFLBSA-N Thr-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H]([C@@H](C)O)N QFEYTTHKPSOFLV-OSUNSFLBSA-N 0.000 description 1
- KPNSNVTUVKSBFL-ZJDVBMNYSA-N Thr-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KPNSNVTUVKSBFL-ZJDVBMNYSA-N 0.000 description 1
- WVVOFCVMHAXGLE-LFSVMHDDSA-N Thr-Phe-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O WVVOFCVMHAXGLE-LFSVMHDDSA-N 0.000 description 1
- WNQJTLATMXYSEL-OEAJRASXSA-N Thr-Phe-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WNQJTLATMXYSEL-OEAJRASXSA-N 0.000 description 1
- VEIKMWOMUYMMMK-FCLVOEFKSA-N Thr-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 VEIKMWOMUYMMMK-FCLVOEFKSA-N 0.000 description 1
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 description 1
- NYQIZWROIMIQSL-VEVYYDQMSA-N Thr-Pro-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O NYQIZWROIMIQSL-VEVYYDQMSA-N 0.000 description 1
- NDXSOKGYKCGYKT-VEVYYDQMSA-N Thr-Pro-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O NDXSOKGYKCGYKT-VEVYYDQMSA-N 0.000 description 1
- VTMGKRABARCZAX-OSUNSFLBSA-N Thr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O VTMGKRABARCZAX-OSUNSFLBSA-N 0.000 description 1
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 1
- PRTHQBSMXILLPC-XGEHTFHBSA-N Thr-Ser-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PRTHQBSMXILLPC-XGEHTFHBSA-N 0.000 description 1
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 1
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 1
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 1
- BCYUHPXBHCUYBA-CUJWVEQBSA-N Thr-Ser-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BCYUHPXBHCUYBA-CUJWVEQBSA-N 0.000 description 1
- NQQMWWVVGIXUOX-SVSWQMSJSA-N Thr-Ser-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NQQMWWVVGIXUOX-SVSWQMSJSA-N 0.000 description 1
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 1
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 1
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 1
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 1
- QYDKSNXSBXZPFK-ZJDVBMNYSA-N Thr-Thr-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYDKSNXSBXZPFK-ZJDVBMNYSA-N 0.000 description 1
- AAZOYLQUEQRUMZ-GSSVUCPTSA-N Thr-Thr-Asn Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O AAZOYLQUEQRUMZ-GSSVUCPTSA-N 0.000 description 1
- MFMGPEKYBXFIRF-SUSMZKCASA-N Thr-Thr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFMGPEKYBXFIRF-SUSMZKCASA-N 0.000 description 1
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 1
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 1
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 1
- CSZFFQBUTMGHAH-UAXMHLISSA-N Thr-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O CSZFFQBUTMGHAH-UAXMHLISSA-N 0.000 description 1
- BJJRNAVDQGREGC-HOUAVDHOSA-N Thr-Trp-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O BJJRNAVDQGREGC-HOUAVDHOSA-N 0.000 description 1
- MYNYCUXMIIWUNW-IEGACIPQSA-N Thr-Trp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MYNYCUXMIIWUNW-IEGACIPQSA-N 0.000 description 1
- IJKNKFJZOJCKRR-GBALPHGKSA-N Thr-Trp-Ser Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 IJKNKFJZOJCKRR-GBALPHGKSA-N 0.000 description 1
- LXXCHJKHJYRMIY-FQPOAREZSA-N Thr-Tyr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O LXXCHJKHJYRMIY-FQPOAREZSA-N 0.000 description 1
- CYCGARJWIQWPQM-YJRXYDGGSA-N Thr-Tyr-Ser Chemical compound C[C@@H](O)[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CO)C([O-])=O)CC1=CC=C(O)C=C1 CYCGARJWIQWPQM-YJRXYDGGSA-N 0.000 description 1
- LVRFMARKDGGZMX-IZPVPAKOSA-N Thr-Tyr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=C(O)C=C1 LVRFMARKDGGZMX-IZPVPAKOSA-N 0.000 description 1
- YOPQYBJJNSIQGZ-JNPHEJMOSA-N Thr-Tyr-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 YOPQYBJJNSIQGZ-JNPHEJMOSA-N 0.000 description 1
- KVEWWQRTAVMOFT-KJEVXHAQSA-N Thr-Tyr-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O KVEWWQRTAVMOFT-KJEVXHAQSA-N 0.000 description 1
- BKIOKSLLAAZYTC-KKHAAJSZSA-N Thr-Val-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O BKIOKSLLAAZYTC-KKHAAJSZSA-N 0.000 description 1
- AXEJRUGTOJPZKG-XGEHTFHBSA-N Thr-Val-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O)N)O AXEJRUGTOJPZKG-XGEHTFHBSA-N 0.000 description 1
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 1
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 1
- BTAJAOWZCWOHBU-HSHDSVGOSA-N Thr-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)C(C)C)C(O)=O)=CNC2=C1 BTAJAOWZCWOHBU-HSHDSVGOSA-N 0.000 description 1
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 1
- 108010022394 Threonine synthase Proteins 0.000 description 1
- QAXCHNZDPLSFPC-PJODQICGSA-N Trp-Ala-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 QAXCHNZDPLSFPC-PJODQICGSA-N 0.000 description 1
- BDWDMRSGCXEDMR-WFBYXXMGSA-N Trp-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N BDWDMRSGCXEDMR-WFBYXXMGSA-N 0.000 description 1
- CXUFDWZBHKUGKK-CABZTGNLSA-N Trp-Ala-Gly Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O)=CNC2=C1 CXUFDWZBHKUGKK-CABZTGNLSA-N 0.000 description 1
- ADBFWLXCCKIXBQ-XIRDDKMYSA-N Trp-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N ADBFWLXCCKIXBQ-XIRDDKMYSA-N 0.000 description 1
- WEAPHMIKOICYAU-QEJZJMRPSA-N Trp-Cys-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O WEAPHMIKOICYAU-QEJZJMRPSA-N 0.000 description 1
- BORCDLUWGBGTKL-XIRDDKMYSA-N Trp-Gln-Met Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O)=CNC2=C1 BORCDLUWGBGTKL-XIRDDKMYSA-N 0.000 description 1
- CZWIHKFGHICAJX-BPUTZDHNSA-N Trp-Glu-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 CZWIHKFGHICAJX-BPUTZDHNSA-N 0.000 description 1
- HXNVJPQADLRHGR-JBACZVJFSA-N Trp-Glu-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N HXNVJPQADLRHGR-JBACZVJFSA-N 0.000 description 1
- FNOQJVHFVLVMOS-AAEUAGOBSA-N Trp-Gly-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N FNOQJVHFVLVMOS-AAEUAGOBSA-N 0.000 description 1
- BEWOXKJJMBKRQL-AAEUAGOBSA-N Trp-Gly-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N BEWOXKJJMBKRQL-AAEUAGOBSA-N 0.000 description 1
- WVHUFSCKCBQKJW-HKUYNNGSSA-N Trp-Gly-Tyr Chemical compound C([C@H](NC(=O)CNC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=C(O)C=C1 WVHUFSCKCBQKJW-HKUYNNGSSA-N 0.000 description 1
- HLDFBNPSURDYEN-VHWLVUOQSA-N Trp-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N HLDFBNPSURDYEN-VHWLVUOQSA-N 0.000 description 1
- AIISTODACBDQLW-WDSOQIARSA-N Trp-Leu-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 AIISTODACBDQLW-WDSOQIARSA-N 0.000 description 1
- OGZRZMJASKKMJZ-XIRDDKMYSA-N Trp-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N OGZRZMJASKKMJZ-XIRDDKMYSA-N 0.000 description 1
- GWBWCGITOYODER-YTQUADARSA-N Trp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N GWBWCGITOYODER-YTQUADARSA-N 0.000 description 1
- NWQCKAPDGQMZQN-IHPCNDPISA-N Trp-Lys-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O NWQCKAPDGQMZQN-IHPCNDPISA-N 0.000 description 1
- GRSCONMARGNYHA-PMVMPFDFSA-N Trp-Lys-Phe Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GRSCONMARGNYHA-PMVMPFDFSA-N 0.000 description 1
- GQEXFCQNAJHJTI-IHPCNDPISA-N Trp-Phe-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N GQEXFCQNAJHJTI-IHPCNDPISA-N 0.000 description 1
- XGFOXYJQBRTJPO-PJODQICGSA-N Trp-Pro-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XGFOXYJQBRTJPO-PJODQICGSA-N 0.000 description 1
- BIBZRFIKOLGWFQ-XIRDDKMYSA-N Trp-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O BIBZRFIKOLGWFQ-XIRDDKMYSA-N 0.000 description 1
- IKUMWSDCGQVGHC-UMPQAUOISA-N Trp-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)O IKUMWSDCGQVGHC-UMPQAUOISA-N 0.000 description 1
- IQIRAJGHFRVFEL-UBHSHLNASA-N Trp-Ser-Cys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N IQIRAJGHFRVFEL-UBHSHLNASA-N 0.000 description 1
- KXFYAQUYJKOQMI-QEJZJMRPSA-N Trp-Ser-Gln Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 KXFYAQUYJKOQMI-QEJZJMRPSA-N 0.000 description 1
- UIRPULWLRODAEQ-QEJZJMRPSA-N Trp-Ser-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 UIRPULWLRODAEQ-QEJZJMRPSA-N 0.000 description 1
- GEGYPBOPIGNZIF-CWRNSKLLSA-N Trp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O GEGYPBOPIGNZIF-CWRNSKLLSA-N 0.000 description 1
- GSCPHMSPGQSZJT-JYBASQMISA-N Trp-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O GSCPHMSPGQSZJT-JYBASQMISA-N 0.000 description 1
- DDHFMBDACJYSKW-AQZXSJQPSA-N Trp-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O DDHFMBDACJYSKW-AQZXSJQPSA-N 0.000 description 1
- YCQKQFKXBPJXRY-PMVMPFDFSA-N Trp-Tyr-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)N[C@@H](CCCCN)C(=O)O)N YCQKQFKXBPJXRY-PMVMPFDFSA-N 0.000 description 1
- UIRVSEPRMWDVEW-RNXOBYDBSA-N Trp-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CNC4=CC=CC=C43)N UIRVSEPRMWDVEW-RNXOBYDBSA-N 0.000 description 1
- ICPRIGUXAFULPH-ILWGZMRPSA-N Trp-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CNC4=CC=CC=C43)N)C(=O)O ICPRIGUXAFULPH-ILWGZMRPSA-N 0.000 description 1
- SSSDKJMQMZTMJP-BVSLBCMMSA-N Trp-Tyr-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=C(O)C=C1 SSSDKJMQMZTMJP-BVSLBCMMSA-N 0.000 description 1
- RKISDJMICOREEL-QRTARXTBSA-N Trp-Val-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RKISDJMICOREEL-QRTARXTBSA-N 0.000 description 1
- AKFLVKKWVZMFOT-IHRRRGAJSA-N Tyr-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AKFLVKKWVZMFOT-IHRRRGAJSA-N 0.000 description 1
- XHALUUQSNXSPLP-UFYCRDLUSA-N Tyr-Arg-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 XHALUUQSNXSPLP-UFYCRDLUSA-N 0.000 description 1
- PZXUIGWOEWWFQM-SRVKXCTJSA-N Tyr-Asn-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O PZXUIGWOEWWFQM-SRVKXCTJSA-N 0.000 description 1
- MBFJIHUHHCJBSN-AVGNSLFASA-N Tyr-Asn-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MBFJIHUHHCJBSN-AVGNSLFASA-N 0.000 description 1
- AYHSJESDFKREAR-KKUMJFAQSA-N Tyr-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AYHSJESDFKREAR-KKUMJFAQSA-N 0.000 description 1
- VTFWAGGJDRSQFG-MELADBBJSA-N Tyr-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O VTFWAGGJDRSQFG-MELADBBJSA-N 0.000 description 1
- SCCKSNREWHMKOJ-SRVKXCTJSA-N Tyr-Asn-Ser Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O SCCKSNREWHMKOJ-SRVKXCTJSA-N 0.000 description 1
- BARBHMSSVWPKPZ-IHRRRGAJSA-N Tyr-Asp-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BARBHMSSVWPKPZ-IHRRRGAJSA-N 0.000 description 1
- JWHOIHCOHMZSAR-QWRGUYRKSA-N Tyr-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JWHOIHCOHMZSAR-QWRGUYRKSA-N 0.000 description 1
- QHEGAOPHISYNDF-XDTLVQLUSA-N Tyr-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHEGAOPHISYNDF-XDTLVQLUSA-N 0.000 description 1
- UXUFNBVCPAWACG-SIUGBPQLSA-N Tyr-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N UXUFNBVCPAWACG-SIUGBPQLSA-N 0.000 description 1
- WVRUKYLYMFGKAN-IHRRRGAJSA-N Tyr-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 WVRUKYLYMFGKAN-IHRRRGAJSA-N 0.000 description 1
- NZFCWALTLNFHHC-JYJNAYRXSA-N Tyr-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NZFCWALTLNFHHC-JYJNAYRXSA-N 0.000 description 1
- FMOSEWZYZPMJAL-KKUMJFAQSA-N Tyr-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N FMOSEWZYZPMJAL-KKUMJFAQSA-N 0.000 description 1
- CNLKDWSAORJEMW-KWQFWETISA-N Tyr-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O CNLKDWSAORJEMW-KWQFWETISA-N 0.000 description 1
- PMDWYLVWHRTJIW-STQMWFEESA-N Tyr-Gly-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PMDWYLVWHRTJIW-STQMWFEESA-N 0.000 description 1
- LTSIAOZUVISRAQ-QWRGUYRKSA-N Tyr-Gly-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)O LTSIAOZUVISRAQ-QWRGUYRKSA-N 0.000 description 1
- GIOBXJSONRQHKQ-RYUDHWBXSA-N Tyr-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O GIOBXJSONRQHKQ-RYUDHWBXSA-N 0.000 description 1
- FNWGDMZVYBVAGJ-XEGUGMAKSA-N Tyr-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CC=C(C=C1)O)N FNWGDMZVYBVAGJ-XEGUGMAKSA-N 0.000 description 1
- NOOMDULIORCDNF-IRXDYDNUSA-N Tyr-Gly-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NOOMDULIORCDNF-IRXDYDNUSA-N 0.000 description 1
- AZGZDDNKFFUDEH-QWRGUYRKSA-N Tyr-Gly-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AZGZDDNKFFUDEH-QWRGUYRKSA-N 0.000 description 1
- CTDPLKMBVALCGN-JSGCOSHPSA-N Tyr-Gly-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O CTDPLKMBVALCGN-JSGCOSHPSA-N 0.000 description 1
- KIJLSRYAUGGZIN-CFMVVWHZSA-N Tyr-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KIJLSRYAUGGZIN-CFMVVWHZSA-N 0.000 description 1
- DWAMXBFJNZIHMC-KBPBESRZSA-N Tyr-Leu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O DWAMXBFJNZIHMC-KBPBESRZSA-N 0.000 description 1
- YKCXQOBTISTQJD-BZSNNMDCSA-N Tyr-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YKCXQOBTISTQJD-BZSNNMDCSA-N 0.000 description 1
- QHLIUFUEUDFAOT-MGHWNKPDSA-N Tyr-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHLIUFUEUDFAOT-MGHWNKPDSA-N 0.000 description 1
- KHCSOLAHNLOXJR-BZSNNMDCSA-N Tyr-Leu-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHCSOLAHNLOXJR-BZSNNMDCSA-N 0.000 description 1
- WDGDKHLSDIOXQC-ACRUOGEOSA-N Tyr-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 WDGDKHLSDIOXQC-ACRUOGEOSA-N 0.000 description 1
- ARJASMXQBRNAGI-YESZJQIVSA-N Tyr-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N ARJASMXQBRNAGI-YESZJQIVSA-N 0.000 description 1
- CDKZJGMPZHPAJC-ULQDDVLXSA-N Tyr-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDKZJGMPZHPAJC-ULQDDVLXSA-N 0.000 description 1
- FMXFHNSFABRVFZ-BZSNNMDCSA-N Tyr-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FMXFHNSFABRVFZ-BZSNNMDCSA-N 0.000 description 1
- CNNVVEPJTFOGHI-ACRUOGEOSA-N Tyr-Lys-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CNNVVEPJTFOGHI-ACRUOGEOSA-N 0.000 description 1
- PMHLLBKTDHQMCY-ULQDDVLXSA-N Tyr-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMHLLBKTDHQMCY-ULQDDVLXSA-N 0.000 description 1
- UBKKNELWDCBNCF-STQMWFEESA-N Tyr-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UBKKNELWDCBNCF-STQMWFEESA-N 0.000 description 1
- LMKKMCGTDANZTR-BZSNNMDCSA-N Tyr-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 LMKKMCGTDANZTR-BZSNNMDCSA-N 0.000 description 1
- RGYCVIZZTUBSSG-JYJNAYRXSA-N Tyr-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O RGYCVIZZTUBSSG-JYJNAYRXSA-N 0.000 description 1
- VYQQQIRHIFALGE-UWJYBYFXSA-N Tyr-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VYQQQIRHIFALGE-UWJYBYFXSA-N 0.000 description 1
- ZPFLBLFITJCBTP-QWRGUYRKSA-N Tyr-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O ZPFLBLFITJCBTP-QWRGUYRKSA-N 0.000 description 1
- NHOVZGFNTGMYMI-KKUMJFAQSA-N Tyr-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NHOVZGFNTGMYMI-KKUMJFAQSA-N 0.000 description 1
- SYFHQHYTNCQCCN-MELADBBJSA-N Tyr-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O SYFHQHYTNCQCCN-MELADBBJSA-N 0.000 description 1
- TYFLVOUZHQUBGM-IHRRRGAJSA-N Tyr-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TYFLVOUZHQUBGM-IHRRRGAJSA-N 0.000 description 1
- XUIOBCQESNDTDE-FQPOAREZSA-N Tyr-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XUIOBCQESNDTDE-FQPOAREZSA-N 0.000 description 1
- ITDWWLTTWRRLCC-KJEVXHAQSA-N Tyr-Thr-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ITDWWLTTWRRLCC-KJEVXHAQSA-N 0.000 description 1
- UUBKSZNKJUJQEJ-JRQIVUDYSA-N Tyr-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O UUBKSZNKJUJQEJ-JRQIVUDYSA-N 0.000 description 1
- NZBSVMQZQMEUHI-WZLNRYEVSA-N Tyr-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NZBSVMQZQMEUHI-WZLNRYEVSA-N 0.000 description 1
- CLEGSEJVGBYZBJ-MEYUZBJRSA-N Tyr-Thr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CLEGSEJVGBYZBJ-MEYUZBJRSA-N 0.000 description 1
- XFEMMSGONWQACR-KJEVXHAQSA-N Tyr-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XFEMMSGONWQACR-KJEVXHAQSA-N 0.000 description 1
- GAKBTSMAPGLQFA-JNPHEJMOSA-N Tyr-Thr-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 GAKBTSMAPGLQFA-JNPHEJMOSA-N 0.000 description 1
- HMPMGPISLMLHSI-JBACZVJFSA-N Tyr-Trp-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N HMPMGPISLMLHSI-JBACZVJFSA-N 0.000 description 1
- HZDQUVQEVVYDDA-ACRUOGEOSA-N Tyr-Tyr-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 HZDQUVQEVVYDDA-ACRUOGEOSA-N 0.000 description 1
- RGJZPXFZIUUQDN-BPNCWPANSA-N Tyr-Val-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O RGJZPXFZIUUQDN-BPNCWPANSA-N 0.000 description 1
- AEOFMCAKYIQQFY-YDHLFZDLSA-N Tyr-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AEOFMCAKYIQQFY-YDHLFZDLSA-N 0.000 description 1
- RVGVIWNHABGIFH-IHRRRGAJSA-N Tyr-Val-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O RVGVIWNHABGIFH-IHRRRGAJSA-N 0.000 description 1
- YKBUNNNRNZZUID-UFYCRDLUSA-N Tyr-Val-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YKBUNNNRNZZUID-UFYCRDLUSA-N 0.000 description 1
- DJIJBQYBDKGDIS-JYJNAYRXSA-N Tyr-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O DJIJBQYBDKGDIS-JYJNAYRXSA-N 0.000 description 1
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 1
- LTFLDDDGWOVIHY-NAKRPEOUSA-N Val-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N LTFLDDDGWOVIHY-NAKRPEOUSA-N 0.000 description 1
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 1
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 1
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 1
- LABUITCFCAABSV-BPNCWPANSA-N Val-Ala-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LABUITCFCAABSV-BPNCWPANSA-N 0.000 description 1
- UUYCNAXCCDNULB-QXEWZRGKSA-N Val-Arg-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O UUYCNAXCCDNULB-QXEWZRGKSA-N 0.000 description 1
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 1
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 1
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 1
- AUMNPAUHKUNHHN-BYULHYEWSA-N Val-Asn-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N AUMNPAUHKUNHHN-BYULHYEWSA-N 0.000 description 1
- ZMDCGGKHRKNWKD-LAEOZQHASA-N Val-Asn-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZMDCGGKHRKNWKD-LAEOZQHASA-N 0.000 description 1
- OGNMURQZFMHFFD-NHCYSSNCSA-N Val-Asn-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N OGNMURQZFMHFFD-NHCYSSNCSA-N 0.000 description 1
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 1
- NMPXRFYMZDIBRF-ZOBUZTSGSA-N Val-Asn-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N NMPXRFYMZDIBRF-ZOBUZTSGSA-N 0.000 description 1
- HZYOWMGWKKRMBZ-BYULHYEWSA-N Val-Asp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZYOWMGWKKRMBZ-BYULHYEWSA-N 0.000 description 1
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 1
- XLDYBRXERHITNH-QSFUFRPTSA-N Val-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)C(C)C XLDYBRXERHITNH-QSFUFRPTSA-N 0.000 description 1
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 1
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 1
- COSLEEOIYRPTHD-YDHLFZDLSA-N Val-Asp-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 COSLEEOIYRPTHD-YDHLFZDLSA-N 0.000 description 1
- PWRITNSESKQTPW-NRPADANISA-N Val-Gln-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N PWRITNSESKQTPW-NRPADANISA-N 0.000 description 1
- UZDHNIJRRTUKKC-DLOVCJGASA-N Val-Gln-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UZDHNIJRRTUKKC-DLOVCJGASA-N 0.000 description 1
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 1
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 1
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 1
- WDIGUPHXPBMODF-UMNHJUIQSA-N Val-Glu-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N WDIGUPHXPBMODF-UMNHJUIQSA-N 0.000 description 1
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 1
- NXRAUQGGHPCJIB-RCOVLWMOSA-N Val-Gly-Asn Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O NXRAUQGGHPCJIB-RCOVLWMOSA-N 0.000 description 1
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 description 1
- BEGDZYNDCNEGJZ-XVKPBYJWSA-N Val-Gly-Gln Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O BEGDZYNDCNEGJZ-XVKPBYJWSA-N 0.000 description 1
- FXVDGDZRYLFQKY-WPRPVWTQSA-N Val-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C FXVDGDZRYLFQKY-WPRPVWTQSA-N 0.000 description 1
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 1
- KDKLLPMFFGYQJD-CYDGBPFRSA-N Val-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N KDKLLPMFFGYQJD-CYDGBPFRSA-N 0.000 description 1
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 description 1
- APEBUJBRGCMMHP-HJWJTTGWSA-N Val-Ile-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 APEBUJBRGCMMHP-HJWJTTGWSA-N 0.000 description 1
- JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 description 1
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 1
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 1
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 1
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 1
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 1
- HPANGHISDXDUQY-ULQDDVLXSA-N Val-Lys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HPANGHISDXDUQY-ULQDDVLXSA-N 0.000 description 1
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 1
- OJOMXGVLFKYDKP-QXEWZRGKSA-N Val-Met-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OJOMXGVLFKYDKP-QXEWZRGKSA-N 0.000 description 1
- MGVYZTPLGXPVQB-CYDGBPFRSA-N Val-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MGVYZTPLGXPVQB-CYDGBPFRSA-N 0.000 description 1
- WSUWDIVCPOJFCX-TUAOUCFPSA-N Val-Met-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N WSUWDIVCPOJFCX-TUAOUCFPSA-N 0.000 description 1
- MJFSRZZJQWZHFQ-SRVKXCTJSA-N Val-Met-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)O)N MJFSRZZJQWZHFQ-SRVKXCTJSA-N 0.000 description 1
- LJSZPMSUYKKKCP-UBHSHLNASA-N Val-Phe-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 LJSZPMSUYKKKCP-UBHSHLNASA-N 0.000 description 1
- UXODSMTVPWXHBT-ULQDDVLXSA-N Val-Phe-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N UXODSMTVPWXHBT-ULQDDVLXSA-N 0.000 description 1
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 1
- LGXUZJIQCGXKGZ-QXEWZRGKSA-N Val-Pro-Asn Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N LGXUZJIQCGXKGZ-QXEWZRGKSA-N 0.000 description 1
- MIKHIIQMRFYVOR-RCWTZXSCSA-N Val-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C(C)C)N)O MIKHIIQMRFYVOR-RCWTZXSCSA-N 0.000 description 1
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 1
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 1
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 1
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 1
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 1
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 1
- PQSNETRGCRUOGP-KKHAAJSZSA-N Val-Thr-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O PQSNETRGCRUOGP-KKHAAJSZSA-N 0.000 description 1
- DLRZGNXCXUGIDG-KKHAAJSZSA-N Val-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O DLRZGNXCXUGIDG-KKHAAJSZSA-N 0.000 description 1
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 1
- WUFHZIRMAZZWRS-OSUNSFLBSA-N Val-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C(C)C)N WUFHZIRMAZZWRS-OSUNSFLBSA-N 0.000 description 1
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 1
- YLBNZCJFSVJDRJ-KJEVXHAQSA-N Val-Thr-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O YLBNZCJFSVJDRJ-KJEVXHAQSA-N 0.000 description 1
- ZLMFVXMJFIWIRE-FHWLQOOXSA-N Val-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](C(C)C)N ZLMFVXMJFIWIRE-FHWLQOOXSA-N 0.000 description 1
- QPJSIBAOZBVELU-BPNCWPANSA-N Val-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N QPJSIBAOZBVELU-BPNCWPANSA-N 0.000 description 1
- MIAZWUMFUURQNP-YDHLFZDLSA-N Val-Tyr-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N MIAZWUMFUURQNP-YDHLFZDLSA-N 0.000 description 1
- VTIAEOKFUJJBTC-YDHLFZDLSA-N Val-Tyr-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VTIAEOKFUJJBTC-YDHLFZDLSA-N 0.000 description 1
- PFMSJVIPEZMKSC-DZKIICNBSA-N Val-Tyr-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PFMSJVIPEZMKSC-DZKIICNBSA-N 0.000 description 1
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 1
- LMVWCLDJNSBOEA-FKBYEOEOSA-N Val-Tyr-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N LMVWCLDJNSBOEA-FKBYEOEOSA-N 0.000 description 1
- ZNGPROMGGGFOAA-JYJNAYRXSA-N Val-Tyr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 ZNGPROMGGGFOAA-JYJNAYRXSA-N 0.000 description 1
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 1
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 1
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 1
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 1
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 1
- OIRDTQYFTABQOQ-UHTZMRCNSA-N Vidarabine Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@@H]1O OIRDTQYFTABQOQ-UHTZMRCNSA-N 0.000 description 1
- 108700005077 Viral Genes Proteins 0.000 description 1
- 241000589636 Xanthomonas campestris Species 0.000 description 1
- 241000157303 Xanthomonas phaseoli pv. manihotis Species 0.000 description 1
- 241000490645 Yarrowia sp. Species 0.000 description 1
- 241000779672 Yersinia bercovieri ATCC 43970 Species 0.000 description 1
- GBXZONVFWYCRPT-KVTDHHQDSA-N [(2s,3s,4r,5r)-3,4,5,6-tetrahydroxy-1-oxohexan-2-yl] dihydrogen phosphate Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@@H](C=O)OP(O)(O)=O GBXZONVFWYCRPT-KVTDHHQDSA-N 0.000 description 1
- 241000193453 [Clostridium] cellulolyticum Species 0.000 description 1
- 101150095147 aacC1 gene Proteins 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 238000005903 acid hydrolysis reaction Methods 0.000 description 1
- 238000005273 aeration Methods 0.000 description 1
- 238000013019 agitation Methods 0.000 description 1
- 108010078114 alanyl-tryptophyl-alanine Proteins 0.000 description 1
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- RNBGYGVWRKECFJ-ZXXMMSQZSA-N alpha-D-fructofuranose 1,6-bisphosphate Chemical compound O[C@H]1[C@H](O)[C@](O)(COP(O)(O)=O)O[C@@H]1COP(O)(O)=O RNBGYGVWRKECFJ-ZXXMMSQZSA-N 0.000 description 1
- 150000001408 amides Chemical class 0.000 description 1
- FRHBOQMZUOWXQL-UHFFFAOYSA-L ammonium ferric citrate Chemical compound [NH4+].[Fe+3].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O FRHBOQMZUOWXQL-UHFFFAOYSA-L 0.000 description 1
- 235000011114 ammonium hydroxide Nutrition 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- OIRDTQYFTABQOQ-UHFFFAOYSA-N ara-adenosine Natural products Nc1ncnc2n(cnc12)C1OC(CO)C(O)C1O OIRDTQYFTABQOQ-UHFFFAOYSA-N 0.000 description 1
- 101150035354 araA gene Proteins 0.000 description 1
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 1
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 1
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 1
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 108010036533 arginylvaline Proteins 0.000 description 1
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 1
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 1
- 108010093581 aspartyl-proline Proteins 0.000 description 1
- 108010038633 aspartylglutamate Proteins 0.000 description 1
- XMQFTWRPUQYINF-UHFFFAOYSA-N bensulfuron-methyl Chemical compound COC(=O)C1=CC=CC=C1CS(=O)(=O)NC(=O)NC1=NC(OC)=CC(OC)=N1 XMQFTWRPUQYINF-UHFFFAOYSA-N 0.000 description 1
- 108010064886 beta-D-galactoside alpha 2-6-sialyltransferase Proteins 0.000 description 1
- PTVXQARCLQPGIR-SXUWKVJYSA-N beta-L-fucose 1-phosphate Chemical compound C[C@@H]1O[C@H](OP(O)(O)=O)[C@@H](O)[C@H](O)[C@@H]1O PTVXQARCLQPGIR-SXUWKVJYSA-N 0.000 description 1
- 229940004120 bifidobacterium infantis Drugs 0.000 description 1
- 230000001588 bifunctional effect Effects 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 238000010170 biological method Methods 0.000 description 1
- 230000001851 biosynthetic effect Effects 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 239000001913 cellulose Substances 0.000 description 1
- 229920002678 cellulose Polymers 0.000 description 1
- 239000013611 chromosomal DNA Substances 0.000 description 1
- 230000003930 cognitive ability Effects 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- OXBLHERUFWYNTN-UHFFFAOYSA-M copper(I) chloride Chemical compound [Cu]Cl OXBLHERUFWYNTN-UHFFFAOYSA-M 0.000 description 1
- 125000004122 cyclic group Chemical group 0.000 description 1
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 description 1
- 230000002559 cytogenic effect Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 235000007882 dietary composition Nutrition 0.000 description 1
- 108010009297 diglycyl-histidine Proteins 0.000 description 1
- 102000004419 dihydrofolate reductase Human genes 0.000 description 1
- 150000002016 disaccharides Chemical class 0.000 description 1
- 239000003480 eluent Substances 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 210000001723 extracellular space Anatomy 0.000 description 1
- 101150111583 fda gene Proteins 0.000 description 1
- 210000003608 fece Anatomy 0.000 description 1
- 229960004642 ferric ammonium citrate Drugs 0.000 description 1
- 230000004907 flux Effects 0.000 description 1
- 235000013350 formula milk Nutrition 0.000 description 1
- RNBGYGVWRKECFJ-UHFFFAOYSA-N fructose-1,6-phosphate Natural products OC1C(O)C(O)(COP(O)(O)=O)OC1COP(O)(O)=O RNBGYGVWRKECFJ-UHFFFAOYSA-N 0.000 description 1
- 238000012239 gene modification Methods 0.000 description 1
- 238000003208 gene overexpression Methods 0.000 description 1
- 230000005017 genetic modification Effects 0.000 description 1
- 235000013617 genetically modified food Nutrition 0.000 description 1
- 235000003869 genetically modified organism Nutrition 0.000 description 1
- 238000010362 genome editing Methods 0.000 description 1
- 230000004110 gluconeogenesis Effects 0.000 description 1
- 230000001890 gluconeogenic effect Effects 0.000 description 1
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 1
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 1
- HPAIKDPJURGQLN-UHFFFAOYSA-N glycyl-L-histidyl-L-phenylalanine Natural products C=1C=CC=CC=1CC(C(O)=O)NC(=O)C(NC(=O)CN)CC1=CN=CN1 HPAIKDPJURGQLN-UHFFFAOYSA-N 0.000 description 1
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 1
- 108010033719 glycyl-histidyl-glycine Proteins 0.000 description 1
- 108010028188 glycyl-histidyl-serine Proteins 0.000 description 1
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 1
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010048994 glycyl-tyrosyl-alanine Proteins 0.000 description 1
- 108010045126 glycyl-tyrosyl-glycine Proteins 0.000 description 1
- 108010020688 glycylhistidine Proteins 0.000 description 1
- 108010077515 glycylproline Proteins 0.000 description 1
- QGWNDRXFNXRZMB-UHFFFAOYSA-N guanidine diphosphate Natural products C1=2NC(N)=NC(=O)C=2N=CN1C1OC(COP(O)(=O)OP(O)(O)=O)C(O)C1O QGWNDRXFNXRZMB-UHFFFAOYSA-N 0.000 description 1
- 108010040030 histidinoalanine Proteins 0.000 description 1
- 108010050343 histidyl-alanyl-glutamine Proteins 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 108010028295 histidylhistidine Proteins 0.000 description 1
- 230000006801 homologous recombination Effects 0.000 description 1
- 238000002744 homologous recombination Methods 0.000 description 1
- 210000000987 immune system Anatomy 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000000415 inactivating effect Effects 0.000 description 1
- 230000002779 inactivation Effects 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 235000021125 infant nutrition Nutrition 0.000 description 1
- 238000011081 inoculation Methods 0.000 description 1
- 239000002054 inoculum Substances 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000005342 ion exchange Methods 0.000 description 1
- 239000004313 iron ammonium citrate Substances 0.000 description 1
- 235000000011 iron ammonium citrate Nutrition 0.000 description 1
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 101150001899 lacY gene Proteins 0.000 description 1
- 101150066555 lacZ gene Proteins 0.000 description 1
- 229940039696 lactobacillus Drugs 0.000 description 1
- 229940039695 lactobacillus acidophilus Drugs 0.000 description 1
- 229940004208 lactobacillus bulgaricus Drugs 0.000 description 1
- 229940017800 lactobacillus casei Drugs 0.000 description 1
- 229940072205 lactobacillus plantarum Drugs 0.000 description 1
- 229940001882 lactobacillus reuteri Drugs 0.000 description 1
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 1
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 1
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 1
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 1
- 108010091871 leucylmethionine Proteins 0.000 description 1
- 101150006217 lex1 gene Proteins 0.000 description 1
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010012988 lysyl-glutamyl-aspartyl-glycine Proteins 0.000 description 1
- 108010076718 lysyl-glutamyl-tryptophan Proteins 0.000 description 1
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 1
- 101150088678 manB gene Proteins 0.000 description 1
- 102000016470 mariner transposase Human genes 0.000 description 1
- 108060004631 mariner transposase Proteins 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 238000012269 metabolic engineering Methods 0.000 description 1
- 230000037353 metabolic pathway Effects 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 239000002207 metabolite Substances 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- 238000001823 molecular biology technique Methods 0.000 description 1
- 229910000402 monopotassium phosphate Inorganic materials 0.000 description 1
- 235000019796 monopotassium phosphate Nutrition 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 229950006780 n-acetylglucosamine Drugs 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 238000002414 normal-phase solid-phase extraction Methods 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 230000002018 overexpression Effects 0.000 description 1
- 244000052769 pathogen Species 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 239000008194 pharmaceutical composition Substances 0.000 description 1
- 108010074082 phenylalanyl-alanyl-lysine Proteins 0.000 description 1
- 108010065135 phenylalanyl-phenylalanyl-phenylalanine Proteins 0.000 description 1
- 108010012581 phenylalanylglutamate Proteins 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 229920001282 polysaccharide Polymers 0.000 description 1
- 239000005017 polysaccharide Substances 0.000 description 1
- 239000011148 porous material Substances 0.000 description 1
- GNSKLFRGEWLPPA-UHFFFAOYSA-M potassium dihydrogen phosphate Chemical compound [K+].OP(O)([O-])=O GNSKLFRGEWLPPA-UHFFFAOYSA-M 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000008439 repair process Effects 0.000 description 1
- 230000000630 rising effect Effects 0.000 description 1
- 239000011435 rock Substances 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 239000001488 sodium phosphate Substances 0.000 description 1
- 229910000162 sodium phosphate Inorganic materials 0.000 description 1
- 238000003756 stirring Methods 0.000 description 1
- 229960005322 streptomycin Drugs 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 230000008093 supporting effect Effects 0.000 description 1
- 150000004044 tetrasaccharides Chemical class 0.000 description 1
- 108060008226 thioredoxin Proteins 0.000 description 1
- 229940094937 thioredoxin Drugs 0.000 description 1
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 1
- 108091006107 transcriptional repressors Proteins 0.000 description 1
- 238000010361 transduction Methods 0.000 description 1
- 230000026683 transduction Effects 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- IEDVJHCEMCRBQM-UHFFFAOYSA-N trimethoprim Chemical compound COC1=C(OC)C(OC)=CC(CC=2C(=NC(N)=NC=2)N)=C1 IEDVJHCEMCRBQM-UHFFFAOYSA-N 0.000 description 1
- 229960001082 trimethoprim Drugs 0.000 description 1
- 108700004896 tripeptide FEG Proteins 0.000 description 1
- 150000004043 trisaccharides Chemical class 0.000 description 1
- RYFMWSXOAZQYPI-UHFFFAOYSA-K trisodium phosphate Chemical compound [Na+].[Na+].[Na+].[O-]P([O-])([O-])=O RYFMWSXOAZQYPI-UHFFFAOYSA-K 0.000 description 1
- 108010058119 tryptophyl-glycyl-glycine Proteins 0.000 description 1
- 108010084932 tryptophyl-proline Proteins 0.000 description 1
- 108010078580 tyrosylleucine Proteins 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 108010072644 valyl-alanyl-prolyl-glycine Proteins 0.000 description 1
- 108010009962 valyltyrosine Proteins 0.000 description 1
- 230000035899 viability Effects 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 239000003643 water by type Substances 0.000 description 1
- 108010027345 wheylin-1 peptide Proteins 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P19/00—Preparation of compounds containing saccharide radicals
-
- A—HUMAN NECESSITIES
- A23—FOODS OR FOODSTUFFS; TREATMENT THEREOF, NOT COVERED BY OTHER CLASSES
- A23L—FOODS, FOODSTUFFS OR NON-ALCOHOLIC BEVERAGES, NOT OTHERWISE PROVIDED FOR; PREPARATION OR TREATMENT THEREOF
- A23L33/00—Modifying nutritive qualities of foods; Dietetic products; Preparation or treatment thereof
- A23L33/40—Complete food formulations for specific consumer groups or specific purposes, e.g. infant formula
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N1/00—Microorganisms, e.g. protozoa; Compositions thereof; Processes of propagating, maintaining or preserving microorganisms or compositions thereof; Processes of preparing or isolating a composition containing a microorganism; Culture media therefor
- C12N1/20—Bacteria; Culture media therefor
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1048—Glycosyltransferases (2.4)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1048—Glycosyltransferases (2.4)
- C12N9/1051—Hexosyltransferases (2.4.1)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/24—Hydrolases (3) acting on glycosyl compounds (3.2)
- C12N9/2402—Hydrolases (3) acting on glycosyl compounds (3.2) hydrolysing O- and S- glycosyl compounds (3.2.1)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/24—Hydrolases (3) acting on glycosyl compounds (3.2)
- C12N9/2402—Hydrolases (3) acting on glycosyl compounds (3.2) hydrolysing O- and S- glycosyl compounds (3.2.1)
- C12N9/2405—Glucanases
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Genetics & Genomics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Tropical Medicine & Parasitology (AREA)
- Virology (AREA)
- Polymers & Plastics (AREA)
- Plant Pathology (AREA)
- Nutrition Science (AREA)
- Food Science & Technology (AREA)
- Pediatric Medicine (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Mycology (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Coloring Foods And Improving Nutritive Qualities (AREA)
- Saccharide Compounds (AREA)
- Pharmaceuticals Containing Other Organic And Inorganic Compounds (AREA)
- Polysaccharides And Polysaccharide Derivatives (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
公开了一种使用基因工程微生物宿主细胞生产所需低聚糖的方法,所述基因工程微生物宿主细胞已被基因工程改造以表达异源糖苷酶,其能够在细胞内降解在所需低聚糖的细胞内生物合成过程中产生的代谢糖副产物。
Description
本发明涉及通过微生物发酵生产低聚糖。更具体地,本发明涉及糖苷酶通过微生物发酵改善所需低聚糖的生产的用途。
背景技术
人乳含有称为人乳低聚糖(HMO)的不同低聚糖的独特混合物。迄今为止,已在人乳中鉴定出150多种结构不同的低聚糖。除极少数例外以外,HMO的特征在于在其还原端为乳糖部分,许多HMO在其非还原端含有岩藻糖残基和/或N-乙酰神经氨酸残基。通常,HMO的单糖残基来源于D-葡萄糖、D-半乳糖、N-乙酰葡糖胺、L-岩藻糖和N-乙酰神经氨酸。HMO对婴儿营养的重要性与其独特的生物活性直接相关,所述生物活性包括保护新生儿免受病原体的侵害,支持婴儿免疫系统和认知能力的发育。因此,人们对以商业规模制备HMO具有强烈的兴趣。
除了单个HMO的化学合成外,在使用过表达异源糖基转移酶的基因修饰的微生物通过微生物发酵生产HMO的开发中也取得了相当大的进展。在允许微生物表达所述异源糖基转移酶的培养基中和条件下培养此类微生物时,所述微生物可以产生HMO,并从培养基或细胞裂解物中回收。
然而,糖基转移酶通常具有酶的副活性,使得它们产生所需低聚糖的过表达通常会导致不需要的副产物。通常,这些副产物也是低聚糖,但必须从所需低聚糖的制备中去除,以用于产品的商业用途。然而,从所需低聚糖中去除此类副产物是困难和麻烦的。去除此类副产物的一种方法涉及使用糖苷酶,其是外源添加到含有所需和不需要的低聚糖的反应混合物/细胞培养基中,或者是由基因工程微生物在用于生产所需低聚糖的发酵过程结束时经在特定时间点诱导产生的。
国际公布号WO 2015/032412 A1涉及岩藻糖的用途,并公开了一种方法,其中在乳糖存在下培养表达异源岩藻糖基转移酶的基因修饰细胞,以高产率产生和分泌2’-岩藻糖基乳糖(2’-FL)和二岩藻糖基乳糖(DFL)的混合物至培养基的细胞外空间。分离糖,并通过酸或岩藻糖苷酶进行水解以高产率产生岩藻糖。
国际公布号WO 2104/090261 A1公开了一种形成含有2’-FL和3-岩藻糖基乳糖(3-FL)的至少一种的混合物的方法,其中将DFL部分水解,例如酶水解或酸水解。在酶水解中,将DFL暴露于岩藻糖苷酶中,该酶可从DFL中释放一个岩藻糖残基。将DFL(10mM)与来自曼尼霍蒂斯黄单胞菌(Xanthomonas manihotis)的1,2-α-L-岩藻糖苷酶在37℃下在孵育缓冲液中孵育,水解DFL,然后通过HPLC。在18小时后,DFL被部分水解为3-FL和岩藻糖。未检出乳糖。
欧洲专利申请号EP 2 845 905 A1涉及低聚糖的生产,并公开了在生产和/或纯化低聚糖的方法中使用一种或多种糖苷酶。所述方法包括:a)在允许生产所述所需低聚糖的条件下和培养基中培育适合生产所需低聚糖的宿主微生物,从而生产低聚糖,并在适用的情况下生产生物合成糖中间体和/或副产物;b)在培养宿主微生物的培养基中使用糖苷酶,以降解生物合成的糖中间体和/或糖副产物和/或未使用的糖底物;和c)回收所需低聚糖。在实施方案中,所述糖苷酶是在宿主微生物中内源生产的,其中所述糖苷酶是在宿主微生物中非天然存在的糖苷酶,并且其中所述糖苷酶在所述宿主微生物中的表达是可诱导的,使得在宿主微生物的培养过程中已经产生了足够和/或基本上最大量的所需低聚糖后,可以启动该表达。
总之,现有技术公开了糖苷酶在反应混合物/细胞培养基中通过水解不需要的低聚糖从所需和不需要的低聚糖的混合物中去除不需要的低聚糖的用途。然而,这些方法包括通过微生物(包括使用底物和能量)对不需要的低聚糖的生物合成,并且这些方法需要从所需低聚糖中去除不需要的低聚糖的降解产物。
因此,本发明的目的是提供一种生产所需低聚糖的方法,其在含有待发酵的微生物的细胞培养基中通过微生物发酵进行,而不同时产生/积累不需要的糖副产物,即不需要的低聚糖。
所述目的是通过提供能够产生所需低聚糖的基因工程微生物宿主细胞来解决的,其中所述微生物宿主细胞表达异源糖苷酶,其能够降解细胞内生物合成所需低聚糖过程中产生的胞内代谢产物,从而防止在培养基中形成所需和不需要的糖的混合物。然后,所述降解产物可通过微生物宿主细胞的代谢利用,例如用于生物合成所需低聚糖。
表1提供了所需低聚糖和在生产所需低聚糖的过程中添加的可想到的前体和/或产生的不需要的糖副产物的综合概述。
表1:所需低聚糖和在生产所需低聚糖的过程中添加的可想到的前体和/或产生的不需要的糖副产物的概述。
发明内容
在第一方面,公开了一种使用能够产生所需低聚糖的基因工程微生物宿主细胞生产所需低聚糖的方法,所述微生物宿主细胞表达异源糖苷酶,其能够在细胞内降解在所需低聚糖的细胞内生物合成过程中产生的代谢糖副产物。
在第二方面,公开了一种用于生产所需低聚糖的基因工程微生物宿主细胞,其中所述微生物宿主细胞能够产生所需低聚糖,并且其中所述微生物宿主细胞已被基因工程改造以表达异源糖苷酶,其能够在细胞内降解在所需低聚糖的细胞内生物合成过程中产生的代谢糖副产物。
在第三方面,公开了根据第二方面的基因工程微生物宿主细胞用于生产所需低聚糖的用途。
在第四方面,公开了由根据第一方面的方法和/或使用根据第二方面的基因工程微生物宿主细胞生产的低聚糖,即所需低聚糖。
在第五方面,公开了根据第四方面的所需低聚糖用于生产营养组合物的用途。
在第六方面,公开了包含根据第四方面的所需低聚糖的营养组合物。
附图说明
图1示出了表达异源糖苷酶(如α-1,3-岩藻糖苷酶)的微生物宿主细胞的实施方案的示意图,所述异源糖苷酶能够降解在所需低聚糖(2’-岩藻糖基乳糖)的细胞内生物合成过程中产生的代谢糖副产物(例如3-岩藻糖基乳糖和2’3-二岩藻糖基乳糖),其中微生物宿主细胞能够回收由所述糖苷酶的酶活性产生的降解产物(例如岩藻糖和乳糖),用于产生所需低聚糖。
具体实施方式
根据第一方面,提供了一种使用基因工程微生物宿主细胞生产所需低聚糖的方法,所述方法包括以下步骤:
(i)提供能够产生所需低聚糖的基因工程微生物宿主细胞,其中微生物宿主细胞已被基因工程改造以表达异源糖苷酶,其能够在细胞内降解在所需低聚糖的细胞内生物合成过程中产生的代谢糖副产物,其中微生物宿主细胞能够回收由所述糖苷酶的酶活性产生的降解产物;
(ii)在允许产生所需低聚糖的条件下和培养基中培养基因工程微生物宿主细胞,从而产生所需低聚糖;和
(iii)任选地,回收所需低聚糖。
本文使用的关于低聚糖的术语“所需”是指旨在由微生物宿主细胞产生的低聚糖。术语“所需”用于区分有意产生的低聚糖和微生物宿主细胞可能产生的其他低聚糖。所述其他低聚糖被认为是“不需要的”,无论这些其他低聚糖是否具有生物功能,是否参与其他细胞化合物(如糖脂、糖蛋白或多糖)的生物合成,或者是否是在所需低聚糖的细胞内生物合成过程中产生的代谢糖产物,这些代谢糖产物要么是由于参与所需低聚糖的生物合成的一种或多种酶的次要(不需要的)酶活性而产生的,要么是由于一种或多种不直接参与所需低聚糖的生物合成,而是使用低聚糖——在产生所需低聚糖的代谢途径中作为中间体而生成的——作为底物的酶的酶活性而产生的。
本文使用的术语“低聚糖”是指由三至二十个单糖残基组成的糖分子,其中每个所述单糖残基通过糖苷键与所述单糖单元中的至少另一个结合。低聚糖可为单糖残基的直链,或单糖残基的支链。
在另一个和/或替代的实施方案中,所需低聚糖为人乳低聚糖(HMO)。
在另一个和/或替代的实施方案中,所需低聚糖为选自以下的HMO:2’-岩藻糖基乳糖(2’-FL)、3-岩藻糖基乳糖(3-FL)、2’3-二岩藻糖基乳糖(DFL)、乳-N-三糖II、乳-N-四糖(LNT)、乳-N-新四糖(LNnT)、乳-N-岩藻戊糖I(LNFP-I)、乳-N-新岩藻戊糖I(LNnFP-I)、乳-N-岩藻戊糖II(LNFP-II)、乳-N-岩藻戊糖III(LNFP-III)、乳-N-岩藻戊糖V(LNFP-V)、乳-N-新岩藻戊糖V(LNnFP-V)、乳-N-二岩藻己糖I、乳-N-二岩藻糖基己糖II、对-乳-N-岩藻糖基己糖、岩藻糖基-乳-N-唾液酸戊糖b、岩藻糖基-乳-N-唾液酸戊糖c、岩藻糖基-乳-N-唾液酸戊糖c、二唾液酸-乳-N-岩藻戊糖、3-岩藻糖基-3’-唾液酸乳糖、3-岩藻糖基-6’-唾液酸乳糖、乳-N-新二岩藻己糖I、3’-唾液酸乳糖(3-SL)、6’-唾液酸乳糖(6-SL)、唾液酸乳-N-四糖a(LST-a)、唾液酸乳-N-四糖b(LST-b)、唾液酸乳-N-四糖c(LST-c)和二唾液酸乳-N-四糖。
所述方法包括提供能够产生所需低聚糖的基因工程微生物宿主细胞。
本文使用的术语“基因工程(genetically-engineered)”是指使用分子生物学方法进行的对细胞遗传组成的修饰。细胞遗传组成的修饰可包括基因在物种界限内和/或跨越物种界限的转移、插入、缺失、替换和/或修饰核苷酸、三联体、基因、开放阅读框、启动子、增强子、终止子和其他介导和/或控制基因表达的核苷酸序列。细胞遗传组成的修饰旨在产生具有特定的、所需特性的基因修饰生物体。基因工程微生物宿主细胞可以含有一个或多个在天然(非基因工程)形式的细胞中不存在的基因。本领域技术人员已知用于将外源核酸分子引入和/或将外源核酸分子(重组、异源)插入到细胞的遗传信息中来插入、缺失或改变细胞遗传信息的核甘酸序列的技术。基因工程细胞可以含有一个或多个存在于天然形式的细胞中的基因,其中所述基因通过人工手段被修饰并重新导入到细胞中。术语“基因工程”还涵盖这样的细胞,其含有对于细胞而言是内源性的核酸分子,并且已经被修饰而未将核酸分子从细胞中移除。此类修饰包括通过基因替换、位点特异性突变和包括通常称为“基因编辑”的相关技术获得的那些修饰。
基因工程微生物宿主细胞可为原核细胞或真核细胞。合适的微生物宿主细胞包括酵母细胞、细菌细胞、古细菌细胞和真菌细胞。
在另一个和/或替代的实施方案中,原核细胞是细菌细胞,优选选自以下细菌属的细菌细胞:芽孢杆菌属(Bacillus)、双歧杆菌属(Bifidobacterium)、梭菌属(Clostridium)、棒状杆菌属(Corynebacterium)、肠球菌属(Enterococcus)、乳杆菌属(Lactobacillus)、乳球菌属(Lactococcus)、微球菌属(Micrococcus)、小单孢菌属(Micromonospora)、假单胞菌属(Pseudomonas)、红球菌属(Rhodococcus)和芽胞乳杆菌属(Sporolactobacillus)。适合的细菌种为枯草芽孢杆菌(Bacillus subtilis)、地衣芽孢杆菌(B.licheniformis)、凝结芽孢杆菌(B.coagulans)、嗜热芽孢杆菌(B.thermophilus)、侧孢芽孢杆菌(B.laterosporus)、巨大芽孢杆菌(B.megaterium)、蕈状芽孢杆菌(B.mycoides)、短小芽孢杆菌(B.pumilus)、迟缓芽孢杆菌(B.lentus)、蜡样芽孢杆菌(B.cereus)、环状芽孢杆菌(B.circulans)、长双歧杆菌(Bifidobacterium longum)、婴儿双歧杆菌(B.infantis)、两歧双歧杆菌(B.bifidum)、弗氏柠檬酸杆菌(Citrobacterfreundii)、解纤维素梭菌(Clostridium cellulolyticum)、永达尔梭菌(C.ljungdahlii)、自产乙醇梭菌(C.autoethanogenum)、丙酮丁醇梭菌(C.acetobutylicum)、谷氨酸棒状杆菌(Corynebacterium glutamicum)、屎肠球菌(Enterococcus faecium)、嗜热肠球菌(E.thermophiles)、大肠杆菌(Escherichia coli)、草生欧文氏菌(Erwinia herbicola)(成团泛菌(Pantoea agglomerans))、嗜酸乳杆菌(Lactobacillus acidophilus)、唾液乳杆菌(L.salivarius)、胚牙乳杆菌(L.plantarum)、瑞士乳杆菌(L.helveticus)、德氏乳杆菌(L.delbrueckii)、鼠李糖乳杆菌(L.rhamnosus)、保加利亚乳杆菌(L.bulgaricus)、卷曲乳杆菌(L.crispatus)、加氏乳杆菌(L.gasseri)、干酪乳杆菌(L.casei)、罗伊氏乳杆菌(L.reuteri)、詹氏乳杆菌(L.jensenii)、乳酸乳球菌(L.lactis)、柠檬泛菌(Pantoeacitrea)、胡萝卜软腐果胶杆菌(Pectobacterium carotovorum)、费氏丙酸杆菌(Proprionibacterium freudenreichii)、荧光假单胞菌(Pseudomonas fluorescens)、铜绿假单胞菌(P.aeruginosa)、嗜热链球菌(Streptococcus thermophiles)和野油菜黄单胞菌(Xanthomonas campestris)。
在另一个和/或替代的实施方案中,真核细胞是酵母细胞,优选选自以下的酵母细胞:酵母属某些种(Saccharomyces sp.),特别是酿酒酵母(Saccharomyces cerevisiae);复膜孢酵母属某些种(Saccharomycopsis sp.);毕赤酵母属某些种(Pichia sp.),特别是巴斯德毕赤酵母(Pichia pastoris);汉森酵母属某些种(Hansenula sp.)、克鲁维酵母属种(Kluyveromyces sp.);亚罗酵母属某些种(Yarrowia sp.);红酵母属某些种(Rhodotorula sp.)和裂殖酵母属某些种(Schizosaccharomyces sp.)。
基因工程微生物宿主细胞能够产生所需低聚糖。本文使用的术语“能够产生”是指基因工程微生物宿主细胞产生所需低聚糖的能力,条件是微生物宿主细胞是在允许微生物宿主细胞产生所需低聚糖的条件下和培养基中培养的。因此,培养基必须含有在规定范围内的pH值、离子和营养物的组合物以及维持微生物宿主细胞活力和代谢活性所需的化合物。如果对于产生所需低聚糖是必需的,则培养基还必须含有足够量的用于通过微生物宿主细胞生物合成所需低聚糖所需要的任何前体。同样,必须保持用于培养产生所需低聚糖的微生物宿主细胞的条件(例如温度、pH、供氧、搅拌、营养物供应等),使得微生物宿主细胞能够为代谢活性的或保持代谢活性以产生所需低聚糖。
在另一个和/或替代的实施方案中,能够生产所需低聚糖的基因工程微生物宿主细胞为已被基因工程改造以能够产生所需低聚糖的微生物宿主细胞。在另一个和/或替代的实施方案中,基因工程微生物宿主细胞已被基因工程改造以表达异源糖基转移酶。异源糖苷酶在发酵过程中(即在所需低聚糖的产生或生物合成过程中)在基因工程微生物宿主细胞中表达。在另一个和/或替代的实施方案中,异源糖苷酶的表达在基因工程微生物宿主中为组成型。
本文使用的术语“异源”是指对于细胞或生物体而言是外来的核苷酸序列、核酸分子或多肽,即是指非天然存在于所述细胞或生物体中的核苷酸序列、核酸分子或多肽。本文所用的“异源序列”或“异源核酸”或“异源多肽”是源自特定宿主细胞以外的来源(例如来自不同物种)的序列、核酸分子或多肽,或如果来自同一来源,则从其原始形式被修饰的序列、核酸分子或多肽。因此,可操作地连接到启动子的异源核酸来自不同于该启动子的来源,或者,如果来自同一来源,则从其原始形式被修饰。所述异源序列可以例如通过转染、转化、接合或转导稳定地引入宿主微生物宿主细胞的基因组中,从而代表基因工程宿主细胞。可以应用技术,这些技术将取决于将要引入序列的宿主细胞。各种技术对于本领域的技术人员来说是已知的并且例如公开于Sambrook et al.,Molecular Cloning:A LaboratoryManual,2nd Ed.,Cold Spring Harbor Laboratory Press,Cold Spring Harbor,N.Y.(1989)。因此,“异源多肽”是一种非天然存在于基因工程细胞所源自的野生型细胞中的多肽,“异源糖基转移酶”是一种非天然存在于基因工程细胞所源自的野生型细胞中的糖基转移酶。
在另一个和/或替代的实施方案中,异源糖基转移酶选自岩藻糖基转移酶(优选α-1,2-岩藻糖基转移酶和α-1,3-岩藻糖基转移酶)、葡糖基转移酶、半乳糖基转移酶(优选β-1,3-半乳糖基转移酶和β-1,4-半乳糖基转移酶)、唾液酸转移酶(优选α-2,3-唾液酸转移酶和α-2,6-唾液酸转移酶)和N-乙酰葡糖胺基转移酶。
岩藻糖基转移酶催化岩藻糖残基从供体鸟苷二磷酸活化的L-岩藻糖(GDP-岩藻糖)转移到几个受体分子。岩藻糖基转移酶在动物、植物、真菌和细菌中表达,并根据受体底物上的岩藻糖键进行分类。因此,α-1,2-岩藻糖基转移酶、α-1,3/4-岩藻糖基转移酶和α-1,6-岩藻糖基转移酶是彼此区别的。例如,在欧洲专利申请号17 180 176中公开了合适的用于在基因工程微生物宿主细胞中异源表达的岩藻糖基转移酶。
唾液酸转移酶催化N-乙酰神经氨酸(Neu5Ac)残基从供体CMP-Neu5Ac转移到受体分子。发现唾液酸转移酶在动物、植物、真菌和细菌中表达。唾液酸转移酶根据Neu5Ac与受体分子之间形成的键进行分类。因此,α-2,3-唾液酸转移酶、α-2,6-唾液酸转移酶和α-2,8-唾液酸转移酶是彼此区别的。例如,在欧洲专利申请号17 183 391中公开了合适的用于在基因工程微生物宿主细胞中异源表达的唾液酸转移酶。
半乳糖基转移酶催化半乳糖残基从供体UDP-半乳糖转移到受体底物。根据半乳糖和受体分子之间形成的键来区分半乳糖基转移酶。因此,β-1,3-半乳糖基转移酶和β-1,4-半乳糖基转移酶是彼此区别的。合适的用于在基因工程微生物宿主细胞中异源表达的β-1,3-半乳糖基转移酶由肠道沙门氏菌(Salmonella enterica)wbdO基因编码。合适的用于在基因工程微生物宿主细胞中异源表达的β-1,4-半乳糖基转移酶由嗜沫聚集杆菌(Aggregatibacter aphrophilus)的lex1基因编码的。
基因工程微生物宿主细胞已被基因工程改造以表达异源糖苷酶,其能够在细胞内降解在所需低聚糖的细胞内生物合成过程中产生的代谢糖副产物。合适的糖苷酶是相对于被酶活性水解的糖苷键和/或相对于被糖苷酶水解的底物具有特异性的糖苷酶。由于所述特异性,糖苷酶水解不需要的副产物,但不水解要生产的所需低聚糖。在另一个和/或替代的实施方案中,糖苷酶不水解由微生物宿主细胞内化或合成以产生所需低聚糖的一种或多种前体。优选地,所述糖苷酶为外切糖苷酶。
外切糖苷酶为糖苷水解酶,其破坏了低聚糖结构末端残基处的糖苷键。
在另一个和/或替代的实施方案中,异源糖苷酶选自岩藻糖苷酶(包括α-1,2-岩藻糖苷酶和α-1,3-岩藻糖苷酶)、唾液酸酶(如α-2,3-唾液酸酶、α-2,6-唾液酸酶、α-2,8-唾液酸酶)、半乳糖苷酶(如β-1,3-半乳糖苷酶、β-1,4-半乳糖苷酶和β-1,6-半乳糖苷酶)、β-N-乙酰己糖胺酶和葡糖苷酶(如β-1,3-葡糖苷酶)。
合适的岩藻糖苷酶为α-1,2-岩藻糖苷酶。α-1,2-岩藻糖苷酶是高度特异性的外切糖苷酶,其催化低聚糖中线性α-1,2-连接的L-岩藻吡喃糖基残基的水解。优选的α-1,2-岩藻糖苷酶为两岐双岐杆菌的AfcA(SEQ ID NO:2)。
在另一个和/或替代的实施方案中,提供了能够产生3-FL的基因工程微生物宿主细胞,其中所述基因工程微生物宿主细胞表达α-1,2-岩藻糖苷酶。为了能够产生3-FL,基因工程微生物宿主细胞表达α-1,3-岩藻糖基转移酶。所述α-1,3-岩藻糖基转移酶能够将岩藻糖残基从GDP-岩藻糖转移到乳糖(作为受体底物)的葡萄糖部分,从而合成3-FL,作为所需低聚糖。2’-FL和2’3-DFL是3-FL生产中不需要的糖副产物。
通过在能够产生3-FL的基因工程微生物宿主细胞中表达异源α-1,2-岩藻糖苷酶,可以消除或至少减少副产物2’-FL和2’3-DFL的产生,因为这些副产物在基因工程微生物宿主细胞内被异源α-1,2-岩藻糖苷酶水解。得到的降解产物为岩藻糖和乳糖。岩藻糖和乳糖都可以被基因工程微生物宿主细胞利用以产生所需的3-FL。
在另一个和/或替代的实施方案中,基因工程微生物宿主细胞已被基因工程改造以表达α-1,2-岩藻糖苷酶。在另一个和/或替代的实施方案中,基因工程微生物宿主细胞已被基因工程改造以含有包含编码α-1,2-岩藻糖苷酶用于其表达的核苷酸序列的核酸分子。优选地,编码α-1,2-岩藻糖苷酶的核苷酸序列为选自以下的核苷酸序列:
-由SEQ ID NO:1表示的核苷酸序列;
-与在严格的条件下与由SEQ ID NO:1表示的核苷酸序列杂交的核苷酸序列互补的核苷酸序列;
-与由SEQ ID NO:1表示的核苷酸序列具有至少70%、75%、80%、85%、90%、95%、96%、97%、98%或99%的序列同一性的核苷酸序列;
-编码具有由SEQ ID NO:2表示的氨基酸序列的多肽的核苷酸序列;和
-编码由SEQ ID NO:2表示的多肽序列的功能性变体的核苷酸序列,其中功能性变体的氨基酸序列与由SEQ ID NO:2表示的氨基酸序列具有至少70%、75%、80%、85%、90%、95%、96%、97%、98%或99%的序列同一性。
本文使用的术语“杂交(hybridize)”或“杂交(hybridizing)”是指在常规条件下杂交,如Sambrook et al.(1989)"Molecular Cloning,A Laboratory Manual"(ColdSpring Harbor Laboratory Press,New York)中描述的,优选在严格条件下。严格的杂交条件为例如:在65℃下在4x SSC中杂交,随后在65℃下在0.1x SSC中进行多次洗涤,共持续约1小时。较不严格的杂交条件为例如:在37℃下在4x SSC中杂交,随后在室温(约21℃)下在1x SSC中多次洗涤。“严格的杂交条件”还指:在68℃下在0.25M磷酸钠、pH 7.2、7%SDS、1mM EDTA和1%BSA中杂交16小时,随后在68℃下用2x SSC和01%SDS洗涤两次。
为了表达编码α-1,2-岩藻糖苷酶或其功能性变体的核苷酸序列,所述核苷酸序列可操作地连接到表达控制序列,其介导编码α-1,2-岩藻糖苷酶或其功能性变体的核苷酸序列在基因工程微生物宿主细胞中的表达。
“表达控制序列”为不是编码蛋白质的核苷酸序列的一部分,而是介导编码蛋白质的核苷酸序列的表达的调控核苷酸序列。调控元件核苷酸序列包括启动子、顺式调控元件、增强子、内含子和终止子。根据调控元件的类型,它在编码蛋白质的核苷酸序列之前(即3’)或编码蛋白质的核苷酸序列之后(即5’)存在于核酸分子上。调控元件在微生物宿主细胞中是功能性的。
术语“可操作地连接”是指调控元件以这种方式与编码蛋白质的核苷酸序列连接,即相对于编码蛋白质的核苷酸序列以这种方式定位在例如核酸分子上,在调控元件的控制下,编码蛋白质的核苷酸序列的表达可以发生在活细胞中。
为了本发明的目的,“启动子”是调控核苷酸序列的表达基因,其通常位于基因的5’端,并通过与特定的DNA结合蛋白的相互作用介导RNA聚合酶转录启动。
此外,合适的启动子包括合成启动子。这些是通过分子生物学技术创建的的启动子,在自然界中没有发现这种构型的启动子。合成启动子是除了最小启动子之外只包含一个或多个选择的、定义的顺式元件的极简启动子。这些顺式元件是DNA结合蛋白(如转录因子)的结合位点,分离自天然启动子,源自先前分离的顺式元件,或通过随机重组技术技术性产生,并通过适当的方法选择;与天然启动子相比,由于合成启动子的结构不太复杂,其仅由少数外源和内源因子激活,因此受到更具体的调控。
“最小启动子”或“核心”启动子是含有基础转录因子复合物的结合位点,并允许通过RNA聚合酶II准确启动转录的核苷酸序列。最小启动子的特征序列基序为TATA盒、启动子元件(Inr)、“TFBII识别元件”(BRE)和“下游核心启动子元件”(OPE)。在最小启动子中,这些元件可以单独或组合存在。最小启动子或其序列基序是例如可从细菌、真菌或病毒基因中获得的。
“顺式元件”是指与待表达的编码蛋白质的核苷酸序列位于同一核酸分子上的核苷酸序列。顺式元件不必编码RNA或蛋白质,在转录方向上可以位于待表达的编码蛋白质的核苷酸序列之前或之后。在待表达的编码蛋白质的核苷酸序列之前的上游顺式元件通常提供必要的结合基序,特别是对于转录因子而言是必要的,所述转录因子在分子水平上从另一端作为(Lat.trans,'beyond'的)反式作用元件参与该基因的转录调控。此外,如果顺式元件导致转录受到抑制,那么它们称为沉默子。导致转录增强的顺式元件称为增强子。启动子中顺式/反式活性的总和决定了RNA聚合酶进行转录的强度。
此外,启动子可为嵌合启动子和/或已被顺式元件修饰的启动子。启动子的修饰还可指在启动子中额外引入顺式元件,所述启动子例如已经天然具有顺式元件。此外,修饰还包括顺式元件的多聚化,特别是天然存在的顺式元件的多聚化。与天然形式相比,这种修饰的启动子的特性例如关于特异性、表达水平或背景活性方面可能已发生了改变。
终止子为DNA上的核苷酸序列,其通常标记基因末端,导致转录终止。
另一种合适的岩藻糖苷酶为α-1,3-岩藻糖苷酶。α-1,3-岩藻糖苷酶为高度特异性糖苷酶,其催化低聚糖中α-1,3-连接的L-岩藻吡喃糖基残基的水解。优选的α-1,3-岩藻糖苷酶为来自两歧双歧杆菌的AfcB(SEQ ID NO:4)。
在另一个和/或替代的实施方案中,提供了能够产生2’-FL的基因工程微生物宿主细胞,其中所述基因工程微生物宿主生物体表达α-1,3-岩藻糖苷酶。为了能够产生2’-FL,基因工程微生物宿主细胞表达α-1,2-岩藻糖基转移酶。所述α-1,2-岩藻糖基转移酶能够将岩藻糖残基从GDP-岩藻糖转移到乳糖(作为受体底物)的半乳糖部分,从而合成2’-FL,作为所需低聚糖。3-FL和2’3-DFL是2’-FL生产中不需要的糖副产物。
通过在能够产生2’-FL的基因工程微生物宿主细胞中表达异源α-1,3-岩藻糖苷酶,可以消除或至少减少副产物3-FL和2’3-DFL的产生,因为这些副产物在基因工程微生物宿主细胞中被异源α-1,3-岩藻糖苷酶水解。得到的降解产物为岩藻糖和乳糖。岩藻糖和乳糖都可以被基因工程微生物宿主生物体利用以产生所需的2’-FL。
在另一个和/或替代的实施方案中,基因工程微生物宿主细胞已被基因工程改造以表达α-1,3-岩藻糖苷酶。在另一个和/或替代的实施方案中,基因工程微生物宿主细胞已被基因工程改造以含有包含编码α-1,3-岩藻糖苷酶用于其表达的核苷酸序列的核酸分子。优选地,编码α-1,3-岩藻糖苷酶的核苷酸序列为选自以下的核苷酸序列;
-由SEQ ID NO:3表示的核苷酸序列;
-与在严格的条件下与由SEQ ID NO:3表示的核苷酸序列杂交的核苷酸序列互补的核苷酸序列;
-与由SEQ ID NO:3表示的核苷酸序列具有至少70%、75%、80%、85%、90%、95%、96%、97%、98%或99%的序列同一性的核苷酸序列;
-编码具有由SEQ ID NO:4表示的氨基酸序列的多肽的核苷酸序列;和
-编码由SEQ ID NO:4表示的多肽序列的功能性变体的核苷酸序列,其中功能性变体的氨基酸序列与由SEQ ID NO:4表示的氨基酸序列具有至少70%、75%、80%、85%、90%、95%、96%、97%、98%或99%的序列同一性。
为了表达编码α-1,3-岩藻糖苷酶或其功能性变体的核苷酸序列,所述核苷酸序列可操作地连接到表达控制序列,其介导编码α-1,3-岩藻糖苷酶或其功能性变体的核苷酸序列在基因工程微生物宿主细胞中的表达。
在另一个和/或替代的实施方案中,提供了能够产生LNFP-I的基因工程微生物宿主细胞,其中所述基因工程微生物宿主细胞表达α-1,3-岩藻糖苷酶。为了能够产生LNFP-I,基因工程微生物宿主细胞表达β-1,3-N-乙酰葡糖胺基转移酶、β-1,3-半乳糖基转移酶和α-1,2-岩藻糖基转移酶。所述β-1,3-N-乙酰葡糖胺基转移酶能够将GlcNAc残基从UDP-GlcNAc转移到乳糖的半乳糖部分,从而合成乳-N-三糖-II(LNT-II)。所述β-1,3-半乳糖基转移酶能够将半乳糖残基从UDP-半乳糖转移到LNT-II的GlcNAc部分,从而合成乳-N-四糖(LNT)。所述α-1,2-岩藻糖基转移酶能够将岩藻糖残基从GDP-岩藻糖转移到LNT的末端半乳糖部分,从而合成LNFP-I。3-FL和2’3-DFL将会是LNFP-I生产中不需要的副产物。通过在能够产生LNFP-I的基因工程微生物宿主细胞中表达α-1,3-岩藻糖苷酶,可以消除或至少减少副产物3-FL和2’3-DFL的产生,因为这些副产物在基因工程微生物宿主细胞中被α-1,3-岩藻糖苷酶水解。得到的降解产物为岩藻糖、乳糖和2’-FL。岩藻糖和乳糖可以被基因工程微生物宿主生物体利用以产生所需的LNFP-I。
在另一个和/或替代的实施方案中,基因工程微生物宿主细胞已被基因工程改造以表达α-1,3-岩藻糖苷酶。在另一个和/或替代的实施方案中,基因工程微生物宿主细胞已被基因工程改造以含有包含编码α-1,3-岩藻糖苷酶用于其表达的核苷酸序列的核酸分子。优选地,编码α-1,3-岩藻糖苷酶的核苷酸序列为选自以下的核苷酸序列;
-SEQ ID NO:3所表示的核苷酸序列;
-与在严格的条件下与SEQ ID NO:3所表示的核苷酸序列杂交的核苷酸序列互补的核苷酸序列;
-与SEQ ID NO:3所表示的核苷酸序列具有至少70%、75%、80%、85%、90%、95%、96%、97%、98%或99%的序列同一性的核苷酸序列;
-编码具有SEQ ID NO:4所表示的氨基酸序列的多肽的核苷酸序列;和
-编码SEQ ID NO:4所表示的多肽序列的功能性变体的核苷酸序列,其中功能性变体的氨基酸序列与SEQ ID NO:4所表示的氨基酸序列具有至少70%、75%、80%、85%、90%、95%、96%、97%、98%或99%的序列同一性。
为了表达编码α-1,3-岩藻糖苷酶或其功能性变体的核苷酸序列,所述核苷酸序列可操作地连接到表达控制序列,其介导编码α-1,3-岩藻糖苷酶或其功能性变体的核苷酸序列在基因工程微生物宿主细胞中的表达。
合适的唾液酸酶为α-2,3-唾液酸酶。α-2,3-唾液酸酶为高度特异性外切糖苷酶,其催化来自低聚糖的线性α-2,3-连接的L-唾液酸残基的水解。优选的α-2,3-唾液酸酶为肺炎链球菌(Streptococcus pneumoniae)的NanB(SEQ ID NO:6)。
在另一个和/或替代的实施方案中,提供了能够产生6’-SL的基因工程微生物宿主细胞,其中所述基因工程微生物宿主细胞表达α-2,3-唾液酸酶。为了能够产生6’-SL,基因工程微生物宿主细胞表达α-2,6-唾液酸转移酶。所述α-2,6-唾液酸转移酶能够将Neu5Ac残基从CMP-Neu5Ac转移到乳糖(作为底物)的半乳糖部分,从而合成6’-SL。3’-SL是6’-SL产生中不需要的副产物。
通过在能够产生6’-SL的基因工程微生物宿主细胞中表达α-2,3-唾液酸酶,可以消除或至少减少副产物3’-SL的产生,因为该副产物在遗传修饰的微生物宿主细胞中被α-2,3-唾液酸酶水解。得到的降解产物为N-乙酰神经氨酸和乳糖。N-乙酰神经氨酸和乳糖都可以被基因工程微生物宿主生物体利用以产生所需的6’-SL。
在另一个和/或替代的实施方案中,基因工程微生物宿主细胞已被基因工程改造以表达α-2,3-唾液酸酶。在另一个和/或替代的实施方案中,基因工程微生物宿主细胞已被基因工程改造以含有包含编码α-2,3-唾液酸酶的核苷酸序列的核酸分子,用于其表达。优选地,编码α-2,3-唾液酸酶的核苷酸序列为选自以下的核苷酸序列;
-SEQ ID NO:5所表示的核苷酸序列;
-与在严格的条件下与SEQ ID NO:5所表示的核苷酸序列杂交的核苷酸序列互补的核苷酸序列;
-与SEQ ID NO:5所表示的核苷酸序列具有至少70%、75%、80%、85%、90%、95%、96%、97%、98%或99%的序列同一性的核苷酸序列;
-编码具有SEQ ID NO:6所表示的氨基酸序列的多肽的核苷酸序列;和
-编码SEQ ID NO:6所表示的多肽序列的功能性变体的核苷酸序列,其中功能性变体的氨基酸序列与SEQ ID NO:6所表示的氨基酸序列具有至少70%、75%、80%、85%、90%、95%、96%、97%、98%或99%的序列同一性。
为了表达编码α-2,3-唾液酸酶或其功能性变体的核苷酸序列,所述核苷酸序列可操作地连接到表达控制序列,其介导编码α-2,3-唾液酸酶或其功能性变体的核苷酸序列在基因工程微生物宿主细胞中的表达。
合适的半乳糖苷酶为β-1,3-半乳糖苷酶。β-1,3-半乳糖苷酶为催化低聚糖中β-1,3-连接的半乳糖残基的水解的酶。优选的β-1,3-半乳糖苷酶为长双歧杆菌的Bga42A(SEQID NO:8)。
在另一个和/或替代的实施方案中,提供了能够产生LNnT的基因工程微生物宿主细胞,其中所述基因工程微生物宿主细胞表达β-1,3-半乳糖苷酶。为了能够产生LNnT,基因工程微生物宿主细胞表达β-1,3-N-乙酰葡糖胺基转移酶和β-1,4-半乳糖基转移酶。所述β-1,3-N-乙酰葡糖胺基转移酶能够将GlcNAc残基从UDP-GIcNAc转移到乳糖的半乳糖部分,从而合成LNT-II。所述β-1,4-半乳糖基转移酶能够将半乳糖残基从UDP-半乳糖转移到LNT-II的GlcNAc部分,从而合成LNnT,作为所需低聚糖。
LNT是LNnT生产中不需要的副产物。通过在能够生产LNnT的基因工程微生物宿主细胞中表达β-1,3-半乳糖苷酶,可以消除或至少减少副产物LNT的产生,因为该副产物在基因工程微生物宿主细胞中被异源β-1,3-半乳糖苷酶水解。得到的降解产物为半乳糖和LNT-II。半乳糖和LNT-II都可以被基因工程微生物宿主生物体利用以产生所需的LNnT。
在另一个和/或替代的实施方案中,基因工程微生物宿主细胞已被基因工程改造以表达β-1,3-半乳糖苷酶。在另一个和/或替代的实施方案中,基因工程微生物宿主细胞已被基因工程改造以含有包含编码β-1,3-半乳糖苷酶用于其表达的核苷酸序列的核酸分子。
在另一个和/或替代的实施方案中,基因工程微生物宿主细胞已被基因工程改造以表达β-1,3-半乳糖苷酶。在另一个和/或替代的实施方案中,基因工程微生物宿主细胞已被基因工程改造以含有包含编码β-1,3-半乳糖苷酶用于其表达的核苷酸序列的核酸分子。
优选地,编码β-1,3-半乳糖苷酶的核苷酸序列为选自以下的核苷酸序列;
-SEQ ID NO:7所表示的核苷酸序列;
-与在严格的条件下与SEQ ID NO:7所表示的核苷酸序列杂交的核苷酸序列互补的核苷酸序列;
-与SEQ ID NO:7所表示的核苷酸序列具有至少70%、75%、80%、85%、90%、95%、96%、97%、98%或99%的序列同一性的核苷酸序列;
-编码具有SEQ ID NO:8所表示的氨基酸序列的多肽的核苷酸序列;和
-编码SEQ ID NO:8所表示的多肽序列的功能性变体的核苷酸序列,其中功能性变体的氨基酸序列与SEQ ID NO:8所表示的氨基酸序列具有至少70%、75%、80%、85%、90%、95%、96%、97%、98%或99%的序列同一性。
为了表达编码β-1,3-半乳糖苷酶或其功能性变体的核苷酸序列,所述核苷酸序列可操作地连接到表达控制序列,其介导编码β-1,3-半乳糖苷酶或其功能性变体的核苷酸序列在基因工程微生物宿主细胞中的表达。
另一种合适的半乳糖苷酶为半乳聚糖β-1,3-半乳糖苷酶。所述半乳聚糖β-1,3-半乳糖苷酶为催化具有低聚糖链的半乳糖中β-1,3-连接的半乳糖残基的水解的酶。优选的半乳聚糖β-1,3-半乳糖苷酶为热纤梭菌(Clostridium thermocellum)的Ct1、3Gal43A(SEQID NO:10)。
在另一个和/或替代的实施方案中,基因工程微生物宿主细胞已被基因工程改造以表达半乳聚糖β-1,3-半乳糖苷酶。在另一个和/或替代的实施方案中,基因工程微生物宿主细胞已被基因工程改造以含有包含编码半乳聚糖β-1,3-半乳糖苷酶用于其表达的核苷酸序列的核酸分子。优选地,编码半乳聚糖β-1,3-半乳糖苷酶的核苷酸序列为选自以下的核苷酸序列;
-SEQ ID NO:9所表示的核苷酸序列;
-与在严格的条件下与SEQ ID NO:9所表示的核苷酸序列杂交的核苷酸序列互补的核苷酸序列;
-与SEQ ID NO:9所表示的核苷酸序列具有至少70%、75%、80%、85%、90%、95%、96%、97%、98%或99%的序列同一性的核苷酸序列;
-编码具有由SEQ ID NO:10所表示的氨基酸序列的多肽的核苷酸序列;和
-编码SEQ ID NO:10所表示的多肽序列的功能性变体的核苷酸序列,其中功能性变体的氨基酸序列与SEQ ID NO:10所表示的氨基酸序列具有至少70%、75%、80%、85%、90%、95%、96%、97%、98%或99%的序列同一性。
为了表达编码半乳聚糖β-1,3-半乳糖苷酶或其功能性变体的核苷酸序列,所述核苷酸序列可操作地连接到表达控制序列,其介导编码半乳聚糖β-1,3-葡糖苷酶或其功能性变体的核苷酸序列在基因工程微生物宿主细胞中的表达。
合适的葡糖苷酶为β-1,3-葡糖苷酶。所述β-1,3-葡糖苷酶为高度特异性外切糖苷酶,其催化低聚糖中β-1,3-连接的葡萄糖残基的水解。优选的β-1,3-葡糖苷酶为类芽孢杆菌属(Paenibacillus sp.)的PglA(SEQ ID NO:12)。
在另一个和/或替代的实施方案中,提供了能够产生LNT或LNnT的基因工程微生物宿主细胞,其中所述基因工程微生物宿主细胞表达β-1,3-葡糖苷酶和/或β-1,3-半乳糖苷酶。为了能够产生LNT,基因工程微生物宿主细胞表达β-1,3-N-乙酰葡糖胺基转移酶和β-1,3-半乳糖基转移酶。所述β-1,3-N-乙酰葡糖胺基转移酶能够将GlcNAc残基从UDP-GlcNAc转移到乳糖的半乳糖部分,从而合成乳-N-三糖-II(LNT-II)。所述β-1,3-半乳糖基转移酶能够将半乳糖残基从UDP-半乳糖转移到LNT-II的GlcNAc部分,从而合成乳-N-四糖(LNT)。为了能够产生LNnT,基因工程微生物宿主细胞表达β-1,3-N-乙酰葡糖胺基转移酶和β-1,4-半乳糖基转移酶。所述β-1,3-N-乙酰葡糖胺基转移酶能够合成LNT-II。所述β-1,4-半乳糖基转移酶能够将半乳糖残基从UDP-半乳糖转移到LNT-II的GlcNAc部分,从而合成LNnT,作为所需低聚糖。
技术人员已知,β-1,3-N-乙酰葡糖胺基转移酶(如脑膜炎奈瑟氏菌(Neisseriameningitidis)的LgtA)接受广谱的供体底物。虽然主要将GlcNAc从UDP-GIcNAc转移到适当的受体糖,但LgtA还能够使用UDP-半乳糖或UDP-葡萄糖作为供体底物。使用能够生产所述的LNT或LNnT的基因工程微生物宿主生物体,所述β-1,3-N-乙酰葡糖胺基转移酶还能够将UDP-半乳糖的半乳糖残基以及UDP-葡萄糖的葡萄糖残基转移到乳糖的半乳糖部分,从而分别合成不需要的副产物Gal(β1,3)Gal(β1,4)Glc和Glc(β1,3)Gal(β1,4)Glc。
通过在能够产生LNT或LNnT的基因工程微生物宿主细胞中表达半乳聚糖β-1,3-半乳糖苷酶和/或β-1,3-葡糖苷酶,可以消除或至少减少副产物Gal(β1,3)Gal(β1,4)Glc和Glc(β1,3)Gal(β1,4)Glc的产生,因为这些副产物在基因工程微生物宿主细胞中被半乳聚糖β-1,3-半乳糖苷酶和/或β-1,3-葡糖苷酶水解。得到的降解产物为半乳糖和/或葡萄糖和乳糖。单糖和乳糖都可以被基因工程微生物宿主细胞利用以产生所需的LNT或LNnT。
在另一个和/或替代的实施方案中,基因工程微生物宿主细胞已被基因工程改造以表达β-1,3-葡糖苷酶。在另一个和/或替代的实施方案中,基因工程微生物宿主细胞已被基因工程改造以含有包含编码β-1,3-葡糖苷酶用于其表达的核苷酸序列的核酸分子。优选地,编码β-1,3-葡糖苷酶的核苷酸序列为选自以下的核苷酸序列;
-SEQ ID NO:11所表示的核苷酸序列;
-与在严格的条件下与SEQ ID NO:11所表示的核苷酸序列杂交的核苷酸序列互补的核苷酸序列;
-与SEQ ID NO:11所表示的核苷酸序列具有至少70%、75%、80%、85%、90%、95%、96%、97%、98%或99%的序列同一性的核苷酸序列;
-编码具有SEQ ID NO:12所表示的氨基酸序列的多肽的核苷酸序列;和
-编码SEQ ID NO:12所表示的多肽序列的功能性变体的核苷酸序列,其中功能性变体的氨基酸序列与SEQ ID NO:10所表示的氨基酸序列具有至少70%、75%、80%、85%、90%、95%、96%、97%、98%或99%的序列同一性。
为了表达编码β-1,3-葡糖苷酶或其功能性变体的核苷酸序列,所述核苷酸序列可操作地连接到表达控制序列,其介导编码β-1,3-葡糖苷酶或其功能性变体的核苷酸序列在基因工程微生物宿主细胞中的表达。
基因工程微生物宿主细胞能够回收至少一种由基因工程微生物宿主细胞中的异源糖苷酶的酶活性产生的降解产物。因此,基因工程微生物宿主细胞可以使用至少一种由异源糖苷酶的酶活性产生的降解产物来生产所需低聚糖。例如,从不需要的糖副产物中释放出的单糖残基可以被异源糖苷酶再激活,即结合到核苷酸上,由各自的糖基转移酶从产生的核苷酸激活的单糖转移到受体底物上,以获得所需低聚糖或所需低聚糖的前体。
所述方法包括在允许所述基因工程微生物宿主生物体产生所需低聚糖的培养基中,以及在允许所述基因工程微生物宿主生物体产生所需低聚糖的条件下培养基因工程微生物宿主细胞的步骤。
允许基因工程微生物宿主细胞产生所需低聚糖的培养基含有营养物、至少一种能量来源、必需金属和矿物质以及缓冲剂。所述培养基任选地含有所需低聚糖的前体,所述前体可由基因工程微生物宿主细胞内化,并用于产生所需低聚糖,条件是基因工程微生物宿主细胞不能自行合成所述前体。然后,基因工程微生物宿主细胞内化前体,并使前体进行所需低聚糖的生物合成。例如,乳糖可以被认为是2’-岩藻糖基乳糖的前体。
在培养用于产生所需低聚糖的基因工程微生物宿主细胞的过程中,保持允许的条件。如果在这些条件下培养的基因工程微生物宿主细胞保持活力并产生所需低聚糖,则条件是“允许的”。优选地,允许的培养条件使基因工程微生物宿主细胞增殖。需要保持在一定值或一定范围内的条件包括pH、温度、氧和营养物浓度、能量来源以及必需的金属和矿物质。
在另一个和/或替代的实施方案中,所述方法包括回收所需低聚糖的步骤。所需低聚糖可以从发酵液和/或基因工程微生物宿主生物体中回收。
本文前述的方法是有利的,因为在生产所需低聚糖的过程中很少或没有产生不需要的副产物。因此,从发酵液或细胞裂解液中回收和纯化所需低聚糖是不那么麻烦和昂贵的。
此外,更多的底物被专门用于生产所需低聚糖,而不是因为它被并入不需要的副产物中,而这些副产物不能被微生物宿主细胞代谢而变得无法用于生产所需低聚糖。
根据第二方面,提供了用于生产所需低聚糖的基因工程微生物宿主细胞,其中所述微生物宿主细胞能够产生所需低聚糖,并且其中所述微生物宿主细胞已被基因工程改造以表达异源糖苷酶,其能够在细胞内降解在所需低聚糖的细胞内生物合成过程中产生的代谢副产物。
根据第三方面,本文前述的基因工程微生物宿主细胞用于生产所需低聚糖。使用这些基因工程微生物宿主细胞通过发酵生产所需低聚糖是有利的,因为防止甚至消除了不需要的糖副产物的产生。因此,从发酵液中回收所需低聚糖既节省了资源,又不那么麻烦,因为可以避免将所需低聚糖从不需要的低聚糖副产物中分离出来。此外,与未被基因工程改造以表达异源糖苷酶的天然微生物宿主细胞相比,向本发明的基因工程微生物宿主细胞提供的更多的离析物和能量来源被转化为所需的产物。
根据第四方面,通过本文前述的方法和/或基因工程微生物宿主细胞的用途生产的所需低聚糖优选选自HMO。
通过本文所述的方法和/或基因工程微生物宿主细胞的用途生产的所需低聚糖可用于生产营养组合物。
所述营养组合物为药用组合物、膳食组合物、婴儿配方物等。
将参照具体实施方案并参照附图来描述本发明,但本发明不仅限于此,而仅通过权利要求来限定。此外,说明书和权利要求中的术语第一、第二等用于区分相似的要素,而并不必然用于在时间、空间、排序或以任何其他方式描述顺序。应该理解,如此使用的术语在适当的情况下是可互换的,并且本文所述的本发明的实施方案能够以不同于本文所述或所示的其他顺序操作。
应当注意的是,权利要求中使用的术语“包括”不应被解释为限于其后列出的方法;它不排除其他元素或步骤。因此,它被解释为指明所述特征、整数、步骤或成分的存在,但不排除存在或添加一个或多个其他特征、整数、步骤或成分或其群组。因此,表述“包括装置A和B的设备”的范围不应限于仅由组件A和B组成的设备。这意味着,就本发明而言,该装置仅有的相关组件是A和B。
在本说明书中提及“一个实施方案(one embodiment)”或“实施方案(anembodiment)”是指在本发明的至少一个实施方案中包括与实施方案相关描述的特定的特性、结构或特征。因此,短语“在一个实施方案中”或“在实施方案中”在本说明书中各处出现不一定都是指同一实施方案,而是可能。此外,在一个或多个实施方案中,可以以任何适当的方式结合特定的特性、结构或特征,这对于本领域普通技术人员来说,根据本公开将是显而易见的。
类似地,应当理解,在本发明的示例性实施方案的描述中,出于简化公开内容并帮助理解本发明的一个或多个方面的目的,有时将本发明的各种特征组合在单个实施方案、附图或其描述。然而,这种公开方法不应被解释为反映了所要求保护的发明需要比在每个权利要求中明确陈述的更多的特征的意图。相反,正如所附权利要求所反映的,本发明的方面在于小于单个前述公开实施方案的所有特征。因此,在详细说明书之后的权利要求被明确地并入本详细说明书中,每一项权利要求本身作为本发明的单独的实施方案。
此外,虽然本文描述的一些实施方案包括其他实施方案中包括的一些特征但不包括其他特征,但不同实施方案的特征的组合意于在本发明的范围内,并形成不同的实施方案,正如本领域技术人员所理解的那样。例如,在所附权利要求中,任何要求保护的实施方案都可以以任何组合使用。
此外,本文将一些实施方案描述为可以由计算机系统的处理器或通过执行该功能的其他手段来实现的方法或方法的要素的组合。因此,具有执行这种方法或方法的要素的必要指令的处理器构成用于执行方法或方法的要素的手段。此外,本文描述的装置实施方案的元件是为了实施本发明,用于执行由元件行使的功能的装置的实例。
在本文提供的说明书和附图中,列出了许多具体的细节。然而,应当理解,本发明的实施方案可以在没有这些具体细节的情况下实施。在其他情况下,公知的方法、结构和技术还没有被详细地示出,以避免混淆对本说明书的理解。
现在将通过对本发明的几个实施方案的详细描述来描述本发明。显然,在不偏离本发明的真正精神或技术教导的情况下,可以根据本领域技术人员的知识来设置本发明的其他实施方案,本发明仅由所附权利要求的条款来限制。
实施例1:用于生产2’-岩藻糖基乳糖的大肠杆菌(E.coli)BL21(DE3)菌株的代谢改造
将大肠杆菌BL21(DE3)(Novagen)用作亲本菌株,用于构建用于生产2’-FL的宿主菌株。亲本菌株的遗传改造包括基因破坏和缺失事件以及异源基因的整合。
由于2’-岩藻糖基乳糖是由乳糖(其应用于细菌培养)和GDP-L-岩藻糖(其是由活细胞产生)合成的,首先通过使用错配寡核苷酸的诱变,使编码内源性β-半乳糖苷酶的lacZ基因的野生型拷贝失活(Ellis et al.,“High efficiency mutagenesis,repair,andengineering of chromosomal DNA using single-stranded oligonucleotides”,Proc.Natl.Acad.Sci.USA 98:6742-6746(2001))。使用同样的方法,破坏了阿拉伯糖-异构酶araA的基因。
在温度敏感的转录抑制子cl857的控制下,引入了lacZΩ基因片段。在菌株中大肠杆菌BL21(DE3)PgbA启动子的控制下表达了lacZα片段基因,显示为LacZ+菌株。
基因组缺失按照Datsenko和Warner的方法通过λRed介导的重组进行(“One-stepinactivation of chromosomal genes in Escherichia coli K-12 using PCRproducts”,Proc.Natl.Acad.Sci.USA 97:6640-6645(2000))。为了防止L-岩藻糖的降解,分别缺失了编码L-岩藻糖异构酶和L-墨角藻糖激酶的基因fucl和fucK。还缺失了基因wzxC-wcaJ。WcaJ可能编码UDP-葡萄糖:
十一碳二烯磷酸葡萄糖-1-磷酸转移酶,其催化荚膜异多糖酸(colanic acid)合成的第一步(Stevenson et al.,“Organization of the Escherichia coli K-12 genecluster responsible for production of the extracellular polysaccharidecolonic acid”,J.Bacteriol.178:4885-4893;(1996));产生的荚膜异多糖酸将与岩藻糖基转移酶反应竞争GDP-岩藻糖。
通过转座进行异源基因的基因组整合。大的基因簇被整合到由水手(mariner)转座酶Himar1的超活性C9突变体介导的基因组中(Lampe et al.,“Hyperactivetransposase mutants of the Himar1 mariner transposon”,Proc.Natl.Acad.Sci.USA96:11428-11433(1999)),其在Para启动子的转录控制下被插入到质粒pEcomar中。为增强GDP-岩藻糖的从头合成,编码来自大肠杆菌K12 DH5α的磷酸甘露糖变位酶(manB)、甘露糖-1-磷酸鸟苷酰转移酶(mannose-1-phosphate guanosyltransferase,manC)、GDP-甘露糖-4,6-脱水酶(gmd)和GDP-L-岩藻糖合酶(wcaG)的基因在大肠杆菌BL21(DE3)菌株中过表达;将操纵子manCB置于组成型启动子Ptet的控制下,操纵子gmd、wcaG从组成型PT5启动子转录而来。将转座子盒<Ptet-manCB-PT5-gmd,wcaG-FRT-dhfr-FRT>(SEQ ID NO:13)从pEcomar C9-manCB-gmd、wcaG-dhfr插入到大肠杆菌基因组中,所述转座子盒包含对甲氧苄氨嘧啶抗性的二氢叶酸还原酶的基因,侧翼为被水手状元件Himar1转座酶特异性识别的反向末端重复序列。
对于单个基因的染色体整合,使用EZ-Tn5TM转座酶(Epicentre,USA)。为了产生EZ-Tn5转座体,用引物一起扩增目的基因和侧翼为FRT位点的抗生素抗性盒,所述引物在两个位点上携带EZ-Tn5转座酶的19-bp嵌合端识别位点(5’-CTGTCTCTTATACACATCT,SEQ IDNO:21)。使用EZ-Tn5TM转座酶,将来自大肠杆菌K12 TG1的乳糖内向转运蛋白(importer)LacY的基因(登录号ABN72583)、来自大肠杆菌O126的2-岩藻糖基转移酶基因wbgL(登录号ADN43847)和编码来自Yersinia bercovieri ATCC 43970的主要易化子超家族的糖外排转运蛋白的基因yberc0001_9420(登录号EEQ08298)使用各自的整合盒:<Ptet-lacY-FRT-aadA-FRT>(SEQ ID NO:14)、<Ptet-wbgLco-FRT-neo-FRT>(SEQ ID NO:15)和<Ptet-yberc0001_9420co-FRT-cat-FRT>(SEQ ID NO:16)整合。基因wbgL和yberc0001_9420由GenScript公司(USA)合成,并进行密码子优化(co)。在成功整合lacY基因后,通过在质粒pCP20上编码的FLP重组酶从链霉素抗性克隆中消除抗性基因(Datsenko and Warner,“One-step inactivation of chromosomal genes in Escherichia coli K-12 usingPCR products”,Proc.Natl.Acad.Sci.USA 97:6640-6645(2000))。
由于大肠杆菌BL21(DE3)缺乏功能性gal-操纵子,因此将来自大肠杆菌K的galETKM操纵子的天然调控拷贝用整合盒<Pgal-galE-galT-galK-galM>(SEQ ID NO:17)通过EZ-转座整合到B菌株中。从含有1%半乳糖的MacConkey-agar中选择整合体,为红色菌落。所得菌株能够代谢源自乳糖水解的单糖葡萄糖和半乳糖。
通过缺失编码磷酸果糖激酶A的pfkA基因,实现了关于大肠杆菌菌株合成2’-岩藻糖基乳糖的进一步改善。当在葡萄糖异生作用底物(如甘油)上培养大肠杆菌时,PfkA对果糖-6-磷酸的磷酸化是高度消耗ATP的平板(treadmill)反应,此外,它还与ManA竞争底物。根据Datsenko和Wanner(2000),使用侧翼为lox71/66位点的庆大霉素抗性盒(aacC1),通过同源重组缺失pfkA基因(Lambert,JM et al.(2007)Cre-lox-based system for multiplegene deletions and selectable-marker removal in Lactobacillus plantarum.Appl.Environ.Microbiol 73:1126-1135)。成功缺失pfkA基因后,使用在pKD46(Datsenko和Wanner,2000)底盘(chassis)中的Para启动子控制下克隆的Cre重组酶(Abremski,K et al.(1983)Studies on the properties of P1 site-specific recombination:evidencefor topologically unlinked products following recombination.Cell 32:1301-1311)从大肠杆菌基因组中去除抗生素抗性基因。
对于不同的岩藻糖基转移酶,除了转移酶活性外,还显示了GDP-L-岩藻糖水解酶活性。此外,对于wbgL,本文用于2’-岩藻糖基乳糖合成的α-1,2-岩藻糖基转移酶显示了这种水解活性(见EP 3 050 973 A1)。为了挽救用于2’-岩藻糖基乳糖生产的游离L-岩藻糖,并消除来自发酵液的污染L-岩藻糖,将编码脆弱拟杆菌(Bacteroides fragilis)的双功能L-岩藻糖激酶/L-岩藻糖1-磷酸鸟苷基转移酶的fkp基因,在Ptet启动子的转录控制下,与侧翼为lox71/66的aacC1基因一起,使用EZ-Tn5TM转座酶<Ptet-fkp-lox-aacC1-lox>(SEQ IDNO:18)通过转座进行染色体整合。在成功整合后,从上述基因组中去除庆大霉素抗性基因。
为了提高代谢碳源甘油通过从丙糖-磷酸到果糖-6-磷酸的糖异生途径来供给GDP-L-岩藻糖生物合成的通量,将编码来自豌豆(Pisum sativum)的果糖-1,6-二磷酸醛缩酶(fbaB)和异源果糖-1,6-二磷酸磷酸酶(fbpase)的基因过表达。将大肠杆菌BL21(DE3)的fbaB基因与Ptet启动子融合。由于硫氧还蛋白的还原作用,豌豆叶绿体FBPase的活性受到二硫化物-二巯基化物交换的别构调控。半胱氨酸残基153与丝氨酸的交换产生组成型活性酶。购买编码来自豌豆的叶绿体FBPase的基因(登录号AAD10213),进行密码子优化以在大肠杆菌中表达,N端用六聚组氨酸标记进行标记,并进行修饰以编码来自Genescript的酶的C153S变体。从T7启动子转录fbpase基因。盒<Ptet-fbaB-PT7-His6-fbpase-lox-aacC1-lox>(SEQ ID NO:19)用于EZ-Tn5TM转座酶介导的宿主菌株中的整合。从大肠杆菌基因组中去除庆大霉素抗性基因后,该菌株用于2’-岩藻糖基乳糖生产。随后,该菌株被命名为“菌株A”。
实施例2:改造大肠杆菌BL21(DE3)菌株以生产高纯度的2’-岩藻糖基乳糖
用菌株A进行2’-岩藻糖基乳糖生产的分批补料培养显示,在发酵液中存在副产物(3-岩藻糖基乳糖和2’3-二岩藻糖基乳糖)。为了最大限度地减少这些副产物的产生并提高碳产量,将α-1,3-岩藻糖苷酶亚克隆到组成型启动子后面,并整合到菌株A的基因组中。因此,两岐双歧杆菌的afcB基因(登录号AB474964)与组成型Pand启动子和庆大霉素抗性基因融合。将得到的转座子盒<Pand-afcB-lox-aacC1-lox>(SEQ ID NO:20)——其侧翼为被水手状元件Himar1转座酶特异性识别的反向末端重复序列——从pEcomar afcB-aacC1插入到大肠杆菌基因组中,产生“菌株B”。
实施例3:HPLC分析检测培养上清液中的2’-岩藻糖基乳糖
用与HPLC系统(Shimadzu,Germany)连接的折射率检测器(RID-10A)(Shimadzu,Germany)和Waters XBridge Amide柱3.5μm(250×4.6mm)(Eschborn,Germany)进行HPLC分析。等比例地用30%A:50%(v/v)ACN于ddH2O中、0.1%(v/v)NH4OH和70%B:80%(v/v)ACN于ddH2O中、0.1%(v/v)NH4OH(v/v)作为洗脱剂,在35℃下,以1.4ml·min-1的流速进行洗脱。对HPLC样品进行无菌过滤(0.22μm孔径),并在离子交换基质(Strata ABW,Phenotex)上通过固相萃取清除。将10μl的样品上样于柱上,并根据标准曲线计算2’-岩藻糖基乳糖浓度。其他糖,如L-岩藻糖和/或其他单糖、乳糖和/或其他二糖、3-岩藻糖基乳糖和/或其他三糖、2’3-二岩藻糖基乳糖和/或其他四糖以及甘油,用这些分析条件也是可检测到的。通过比较色谱图中所有峰的AUC(曲线下面积),可以确定检测到的糖的相对量。将水对照中也存在的峰排除在该计算之外。
实施例4:在发酵过程中产生2’-岩藻糖基乳糖
在33℃下在3L-发酵罐中进行发酵(New Brunswick,Edison,USA),开始于含有3g/L KH2PO4、12g/L K2HPO4、5g/L(NH4)2SO4、0.3g/L柠檬酸、2g/L MgSO4×7·H2O、0.1g/L NaCl和0.015g/L CaCl2×6·H2O,补充1g/L微量元素溶液(54.4g·L-1柠檬酸铁铵、9.8g/L MnCl2×4·H2O、1.6g/L CoCl2×6·H2O、1g/L CuCl2×2·H2O、1.9g/L H3BO3、9g/L ZnSO4×7·H2O、1.1g/L Na2MoO4×2·H2O、1.5g/L Na2SeO3、1.5g/L NiSO4×6·H2O),并含有2%(v/v)甘油作为碳源、60mM乳糖和抗生素卡那霉素(25μg/mL)的1000mL矿物盐培养基。曝气维持在3L/min。通过控制搅拌速度,使溶解氧保持在20-30%的饱和度。通过加入25%氨溶液使pH维持在7.0。用2.5%(v/v)接种物开始培养,该接种物来自在含有相同的甘油但缺乏乳糖的培养基中生长的预培养物。离开分批阶段后,以溶解氧水平上升为指示,进行甘油补料(60%(v/v),补充有2g/L MgSO4×7·H2O、0.015g/L CaCl2×6·H2O和1mL/L微量元素溶液),流速为7.0-8.0mL/h,参照起始体积。在整个培养过程中进行乳糖补料(0.66M),并直观地进行调整,以实现发酵液中恒定的乳糖供应。在发酵快要结束时停止乳糖补料,并继续培养,直到乳糖完全转化为2’-岩藻糖基乳糖。当使用实施例1中描述的菌株(菌株A)时,在接种发酵罐后约94小时,在细胞培养基中达到约150g/L的2’-岩藻糖基乳糖效价(titer)。将如实施例2中描述的基因修饰的2’-岩藻糖基乳糖生产菌株(菌株B)进行同等地培养,也产生了约150g/L的2’-岩藻糖基乳糖效价。然而,副产物的量明显低于培养菌株A后的量(表2)。尽管菌株A的培养上清液中的糖含量2’-岩藻糖基乳糖仅占94.22%,但在菌株B的培养上清液中增加了5.50%,纯度为99.72%。
表2:培养94小时后,菌株A和菌株B的培养上清液中可检测糖的相对量的定性HPLC分析(n.d.:未检测出的)。
序列表
<110> 詹尼温生物技术有限责任公司
<120> 糖苷酶在低聚糖生产中的用途
<130> CP1200849P
<160> 20
<170> PatentIn version 3.5
<210> 1
<211> 5880
<212> DNA
<213> 两岐双岐杆菌(Bifidobacterium bifidum)
<400> 1
atgaaacata gagcgatgtc atcgcgtctg atgccactgg tggcgtcctg cgcgacggtc 60
ggcatgctgc tggccggact acctgtgtcg gccgtcgcgg tcggcacgac gagagcggca 120
gcgtccgacg cctcgtcctc caccacagca accatcaccc cctccgccga taccacgttg 180
cagacatgga cgagcgagaa gaattcctca atggcgtcca agccgtacat cggcacactg 240
caagggccct cgcaaggcgt gttcggcgag aagttcgagt ccacggatgc cgcggacacc 300
accgatctga agaccggcct gctgacgttc gacctgagcg cctacgacca tgcccccgat 360
tccgcaacgt tcgagatgac gtacctcggc taccgcggca acccgacggc caccgacacc 420
gacaccatca aggtgacccc cgtcgacacc accgtgtgca ccaataacgc cacagactgc 480
ggcgcgaatg tcgcgaccgg cgcgaccaag ccgaagttca gcatcaacga ctcctcattc 540
gtcgccgagt ccaagccgtt cgagtacggt acgacggttt acacgggcga cgccatcacc 600
gtggttcccg ccaataccaa gaaggtcacc gtagatgtga ccgaaatcgt gcgccagcag 660
ttcgccgaag gcaagaaggt catcaccctg gccgtgggcg agaccaagaa gaccgaggtt 720
cgtttcgcca gttccgaagg cacgacgtcc ctgaacggcg cgaccgcaga catggctccg 780
aagctgaccg tttccgtgtc caccaaggac gatctcaagc cctccgccga caccacgttg 840
caggcatggg ccagcgagaa gaacgagaag aagaacactg cggcctatgt cggcgcgctg 900
cagccggaag gcgattacgg cgacttcggt gagaagttca agtccaccga cgtccacgat 960
gtcacagacg ccaagatggg tctgatgacg ttcgacctgt ccgattacac cgcggcgccc 1020
gagcactcca tcctcacctt gacgtatctg ggctacgccg gtgcagacaa gaccgccacg 1080
gccaccgata aggtcaaggt ggtcgctgtt gacacgtcgc ggtgcaccgg caccgctccc 1140
tgcgacacca acaatgccac gtgggcgaac cgcccggact tcgaggtgac cgataccacg 1200
aagaccgcga cgtcccatgc gttcgcttat ggatctaaga agtattccga tggcatgacc 1260
gtcgaatcgg gcaacgccaa gaaggtcctg ctcgacgtgt ccgatgtcat caaggcagag 1320
ttcgccaagt tcagcgccgg cgccaccgag aagaagatca cgctggccct gggcgagctc 1380
aacaagtccg acatgcgttt cggcagcaag gaagtcacct cgctgaccgg cgccaccgaa 1440
gccatgcagc cgaccttgtc cgtcaccaag aagccgaagg catacacgct gagcatcgaa 1500
ggcccgacca aggtcaagta ccagaagggc gaggcgttcg acaaggccgg actcgtggtc 1560
aaggccacca gcacggctga cggcacggtc aagacgctga ccgaaggcaa cggtgaggat 1620
aactacacca tcgacaccag cgctttcgat agtgccagca tcggcgtata ccctgttacc 1680
gtgaagtaca acaaggaccc cgaaatcgcc gcttcgttca acgcctatgt catcgccagt 1740
gtcgaggacg gcggagacgg cgacaccagc aaagacgact ggctgtggta caagcagccc 1800
gcgtcgcaga ccgacgccac cgccaccgcc ggcggcaatt acggcaaccc cgacaacaac 1860
cgttggcagc agaccacctt gccgttcggc aacggcaaga tcggcggcac cgtctggggc 1920
gaggtcagcc gtgaacgcgt caccttcaac gaggagacgc tgtggaccgg cggccccgga 1980
tcctcgacca gctacaacgg cggcaacaac gagaccaagg gtcagaacgg cgccacgctg 2040
cgcgcgctca acaagcagct cgcgaacggc gccgagacgg tcaatcccgg caacctgacc 2100
ggcggcgaga acgcggccga gcagggcaac tacctgaact ggggcgacat ctacctcgac 2160
tacgggttca acgatacgac cgtcaccgaa taccgccgcg acctgaacct gagcaagggc 2220
aaggccgacg tcacgttcaa gcatgacggc gtcacctaca cgcgcgaata cttcgcgtcg 2280
aaccccgaca atgtcatggt cgcccgcctc acggccagca aagccggcaa gctgaacttc 2340
aacgtcagca tgccgaccaa cacgaactac tccaagaccg gcgaaaccac gacggtcaag 2400
ggtgacacgc tcaccgtcaa gggcgctctc ggcaacaacg gcctgctgta caactcgcag 2460
atcaaggtcg tcctcgacaa cggtgagggc acgctctccg aaggctccga cggcgcttcg 2520
ctgaaggtct ccgacgcgaa ggcggtcacg ctgtacatcg ccgccgcgac ggactacaag 2580
cagaagtatc cgtcctaccg caccggcgaa accgccgccg aggtgaacac ccgcgtcgcc 2640
aaggtcgtgc aggacgccgc caacaagggc tacaccgccg tcaagaaagc gcacatcgac 2700
gatcattccg ccatctacga ccgcgtgaag atcgatttgg gccagtccgg ccacagctcc 2760
gacggcgccg tcgccaccga cgcgctgctc aaggcgtacc agagaggctc cgcaaccacc 2820
gcgcagaagc gcgagctgga gacgctggtg tacaagtacg gccgctactt gaccatcggc 2880
tcctcccgtg agaacagcca gctgcccagc aacctgcagg gcatctggtc ggtcaccgcg 2940
ggcgacaacg cccacggcaa cacgccttgg ggctccgact tccacatgaa cgtgaacctc 3000
cagatgaact actggccgac ctattcggcc aacatgggag agctcgccga gccgctcatc 3060
gagtatgtgg agggtctggt caagcccggc cgtgtgaccg ccaaggtcta cgcgggcgcg 3120
gagacgacga accccgagac cacgccgatc ggcgagggcg agggctacat ggcccacacc 3180
gagaacaccg cctacggctg gaccgcaccc ggtcaatcgt tctcgtgggg ttggagcccg 3240
gccgccgtgc cgtggatcct gcagaacgtg tacgaggcgt acgagtactc cggcgaccct 3300
gccctgcttg atcgcgtgta cgcgctgctc aaggaggaat cgcacttcta cgtcaactac 3360
atgctgcaca aggccggctc cagctccggt gaccgcctga ctaccggcgt cgcgtactcg 3420
cccgaacagg gcccgctggg caccgacggc aacacgtacg agagctcgct cgtgtggcag 3480
atgctcaacg acgccatcga ggcggccaag gccaagggag atccggacgg tctggtcggc 3540
aataccaccg actgctcggc cgacaactgg gccaagaatg acagcggcaa cttcaccgat 3600
gcgaacgcca accgttcctg gagctgcgcc aagagcctgc tcaagccgat cgaggtcggc 3660
gactccggcc agatcaagga atggtacttc gaaggtgcgc tcggcaagaa gaaggatgga 3720
tccaccatca gcggctacca ggcggacaac cagcaccgtc acatgtccca cctgctcgga 3780
ctgttccccg gtgatttgat caccatcgac aactccgagt acatggatgc ggccaagacc 3840
tcgctgaggt accgctgctt caagggcaac gtgctgcagt ccaacaccgg ctgggccatt 3900
ggccagcgca tcaattcgtg ggctcgcacc ggcgacggca acaccacgta ccagctggtc 3960
gagctgcagc tcaagaacgc gatgtatgca aacctgttcg attaccatgc gccgttccag 4020
atcgacggca acttcggcaa cacctccggt gtcgacgaaa tgctgctgca gtccaactcc 4080
accttcaccg acaccgccgg caagaagtac gtgaactaca cgaacatcct gcccgccctg 4140
cccgatgcct gggcgggcgg ctcggtgagc ggcctcgtgg cccgcggcaa cttcaccgtc 4200
ggcacgacat ggaagaacgg caaggccacc gaagtcaggc tgacctccaa caagggcaag 4260
caggcggccg tcaagatcac cgccggcggc gcccagaact acgaggtcaa gaacggtgac 4320
accgccgtga acgccaaggt cgtgaccaac gcggacggcg cctcgctgct cgtgttcgat 4380
accaccgcag gcaccacgta cacgatcacg aagaaggcga gcgccaacgt gcccgtcacc 4440
ggcgtgaccg tgaccggcgc caacaccgcc accgcaggcg acaccgtcac tcttacggct 4500
accgtcgccc cggccaatgc gaccgacaag tccgtcacct ggtcgacctc cgacgccgcc 4560
gtagctacgg tcaacgccaa cggcgtggtg accacgaaga aggccggcaa ggtgaccatc 4620
accgccacgt cgaacggcga caagacgaag ttcggttcca tcgagatcac cgtctccgcc 4680
gcgaccgtgc ccgtcaccag cgtcaccgtt gccggcgacg ccgcgatgac cgtcgatgga 4740
gagcagaccc tgacggcgac cgtcgccccg gccactgcga ccgacaagac ggtcacgtgg 4800
aagtcctccg acgccactgt ggcgacggtt gacgccaacg gcaaggtcgt cgcgaagaag 4860
gccggcgaag tgacgatcac cgccacggcc ggtggcgtgt ccggcacgct gaagatcacg 4920
gtgagcgaca aggccccgac cgtcatcccg gtccagtccg tgaccgtgac aggcaagcag 4980
gagctcgtcg aaggcgcctc cacgaccctg acggcgaccg tcgccccggc tgacgcgacc 5040
gacaagacgg ttacgtggaa gtcgagcgac gagtccgtcg ccacggtcga caaggacggc 5100
gtcgtgaccg ccaagaaggc cggcacggtg accatcaccg ccacggccgg tggcgtgtcc 5160
ggcacgctcc acatcaccgt gacggccaag cccgtcgaga ccgtccccgt caccagcgtg 5220
gaggtcaccg tcgaggccgg caccaccgtc tccgtcggca agacactcca ggccaccgcg 5280
accgtcaagc ccggcaacgc caccaacaag aaggtgacgt ggaagtcgag cgacgaatcc 5340
atcgcgacgg tcgacgccaa cggcgtcatc accgcgaaga aggccggcaa ggtcgtcatc 5400
acggccacct cgaccgacgg cacggacaag tccggcagcg tcgagatcac cgtcgtggat 5460
gagaccaagc cgacgcccga ccacaagtcc gtcaaggccg ataccggcga cgtgaccgcc 5520
ggcaagaccg gtacggtcac cgagccgaag gacgtggcgg gctggaagag ccgctccatc 5580
atcaagcaag gcaagctcgg caaggccgaa atcgccgacg gcacgctcgt gtatgcggcc 5640
ggcgacaaga ccggtgacga cagcttcgtc gtgcagtaca cgatggccga cggcacggtc 5700
atcgacgtga cctacagcgt cacggtcaag gccgccgaaa ccggcaagaa cgacggcgac 5760
ggcaagggcg acggtgtcgc gaagaccggc gccgccgtcg gcgcgctcgc cggcctcggc 5820
ttgatgctgc tcgccgtcgg agtgagcgtg gtgatgattc gccgcaagca ctccgcctga 5880
<210> 2
<211> 1959
<212> PRT
<213> 两岐双岐杆菌
<400> 2
Met Lys His Arg Ala Met Ser Ser Arg Leu Met Pro Leu Val Ala Ser
1 5 10 15
Cys Ala Thr Val Gly Met Leu Leu Ala Gly Leu Pro Val Ser Ala Val
20 25 30
Ala Val Gly Thr Thr Arg Ala Ala Ala Ser Asp Ala Ser Ser Ser Thr
35 40 45
Thr Ala Thr Ile Thr Pro Ser Ala Asp Thr Thr Leu Gln Thr Trp Thr
50 55 60
Ser Glu Lys Asn Ser Ser Met Ala Ser Lys Pro Tyr Ile Gly Thr Leu
65 70 75 80
Gln Gly Pro Ser Gln Gly Val Phe Gly Glu Lys Phe Glu Ser Thr Asp
85 90 95
Ala Ala Asp Thr Thr Asp Leu Lys Thr Gly Leu Leu Thr Phe Asp Leu
100 105 110
Ser Ala Tyr Asp His Ala Pro Asp Ser Ala Thr Phe Glu Met Thr Tyr
115 120 125
Leu Gly Tyr Arg Gly Asn Pro Thr Ala Thr Asp Thr Asp Thr Ile Lys
130 135 140
Val Thr Pro Val Asp Thr Thr Val Cys Thr Asn Asn Ala Thr Asp Cys
145 150 155 160
Gly Ala Asn Val Ala Thr Gly Ala Thr Lys Pro Lys Phe Ser Ile Asn
165 170 175
Asp Ser Ser Phe Val Ala Glu Ser Lys Pro Phe Glu Tyr Gly Thr Thr
180 185 190
Val Tyr Thr Gly Asp Ala Ile Thr Val Val Pro Ala Asn Thr Lys Lys
195 200 205
Val Thr Val Asp Val Thr Glu Ile Val Arg Gln Gln Phe Ala Glu Gly
210 215 220
Lys Lys Val Ile Thr Leu Ala Val Gly Glu Thr Lys Lys Thr Glu Val
225 230 235 240
Arg Phe Ala Ser Ser Glu Gly Thr Thr Ser Leu Asn Gly Ala Thr Ala
245 250 255
Asp Met Ala Pro Lys Leu Thr Val Ser Val Ser Thr Lys Asp Asp Leu
260 265 270
Lys Pro Ser Ala Asp Thr Thr Leu Gln Ala Trp Ala Ser Glu Lys Asn
275 280 285
Glu Lys Lys Asn Thr Ala Ala Tyr Val Gly Ala Leu Gln Pro Glu Gly
290 295 300
Asp Tyr Gly Asp Phe Gly Glu Lys Phe Lys Ser Thr Asp Val His Asp
305 310 315 320
Val Thr Asp Ala Lys Met Gly Leu Met Thr Phe Asp Leu Ser Asp Tyr
325 330 335
Thr Ala Ala Pro Glu His Ser Ile Leu Thr Leu Thr Tyr Leu Gly Tyr
340 345 350
Ala Gly Ala Asp Lys Thr Ala Thr Ala Thr Asp Lys Val Lys Val Val
355 360 365
Ala Val Asp Thr Ser Arg Cys Thr Gly Thr Ala Pro Cys Asp Thr Asn
370 375 380
Asn Ala Thr Trp Ala Asn Arg Pro Asp Phe Glu Val Thr Asp Thr Thr
385 390 395 400
Lys Thr Ala Thr Ser His Ala Phe Ala Tyr Gly Ser Lys Lys Tyr Ser
405 410 415
Asp Gly Met Thr Val Glu Ser Gly Asn Ala Lys Lys Val Leu Leu Asp
420 425 430
Val Ser Asp Val Ile Lys Ala Glu Phe Ala Lys Phe Ser Ala Gly Ala
435 440 445
Thr Glu Lys Lys Ile Thr Leu Ala Leu Gly Glu Leu Asn Lys Ser Asp
450 455 460
Met Arg Phe Gly Ser Lys Glu Val Thr Ser Leu Thr Gly Ala Thr Glu
465 470 475 480
Ala Met Gln Pro Thr Leu Ser Val Thr Lys Lys Pro Lys Ala Tyr Thr
485 490 495
Leu Ser Ile Glu Gly Pro Thr Lys Val Lys Tyr Gln Lys Gly Glu Ala
500 505 510
Phe Asp Lys Ala Gly Leu Val Val Lys Ala Thr Ser Thr Ala Asp Gly
515 520 525
Thr Val Lys Thr Leu Thr Glu Gly Asn Gly Glu Asp Asn Tyr Thr Ile
530 535 540
Asp Thr Ser Ala Phe Asp Ser Ala Ser Ile Gly Val Tyr Pro Val Thr
545 550 555 560
Val Lys Tyr Asn Lys Asp Pro Glu Ile Ala Ala Ser Phe Asn Ala Tyr
565 570 575
Val Ile Ala Ser Val Glu Asp Gly Gly Asp Gly Asp Thr Ser Lys Asp
580 585 590
Asp Trp Leu Trp Tyr Lys Gln Pro Ala Ser Gln Thr Asp Ala Thr Ala
595 600 605
Thr Ala Gly Gly Asn Tyr Gly Asn Pro Asp Asn Asn Arg Trp Gln Gln
610 615 620
Thr Thr Leu Pro Phe Gly Asn Gly Lys Ile Gly Gly Thr Val Trp Gly
625 630 635 640
Glu Val Ser Arg Glu Arg Val Thr Phe Asn Glu Glu Thr Leu Trp Thr
645 650 655
Gly Gly Pro Gly Ser Ser Thr Ser Tyr Asn Gly Gly Asn Asn Glu Thr
660 665 670
Lys Gly Gln Asn Gly Ala Thr Leu Arg Ala Leu Asn Lys Gln Leu Ala
675 680 685
Asn Gly Ala Glu Thr Val Asn Pro Gly Asn Leu Thr Gly Gly Glu Asn
690 695 700
Ala Ala Glu Gln Gly Asn Tyr Leu Asn Trp Gly Asp Ile Tyr Leu Asp
705 710 715 720
Tyr Gly Phe Asn Asp Thr Thr Val Thr Glu Tyr Arg Arg Asp Leu Asn
725 730 735
Leu Ser Lys Gly Lys Ala Asp Val Thr Phe Lys His Asp Gly Val Thr
740 745 750
Tyr Thr Arg Glu Tyr Phe Ala Ser Asn Pro Asp Asn Val Met Val Ala
755 760 765
Arg Leu Thr Ala Ser Lys Ala Gly Lys Leu Asn Phe Asn Val Ser Met
770 775 780
Pro Thr Asn Thr Asn Tyr Ser Lys Thr Gly Glu Thr Thr Thr Val Lys
785 790 795 800
Gly Asp Thr Leu Thr Val Lys Gly Ala Leu Gly Asn Asn Gly Leu Leu
805 810 815
Tyr Asn Ser Gln Ile Lys Val Val Leu Asp Asn Gly Glu Gly Thr Leu
820 825 830
Ser Glu Gly Ser Asp Gly Ala Ser Leu Lys Val Ser Asp Ala Lys Ala
835 840 845
Val Thr Leu Tyr Ile Ala Ala Ala Thr Asp Tyr Lys Gln Lys Tyr Pro
850 855 860
Ser Tyr Arg Thr Gly Glu Thr Ala Ala Glu Val Asn Thr Arg Val Ala
865 870 875 880
Lys Val Val Gln Asp Ala Ala Asn Lys Gly Tyr Thr Ala Val Lys Lys
885 890 895
Ala His Ile Asp Asp His Ser Ala Ile Tyr Asp Arg Val Lys Ile Asp
900 905 910
Leu Gly Gln Ser Gly His Ser Ser Asp Gly Ala Val Ala Thr Asp Ala
915 920 925
Leu Leu Lys Ala Tyr Gln Arg Gly Ser Ala Thr Thr Ala Gln Lys Arg
930 935 940
Glu Leu Glu Thr Leu Val Tyr Lys Tyr Gly Arg Tyr Leu Thr Ile Gly
945 950 955 960
Ser Ser Arg Glu Asn Ser Gln Leu Pro Ser Asn Leu Gln Gly Ile Trp
965 970 975
Ser Val Thr Ala Gly Asp Asn Ala His Gly Asn Thr Pro Trp Gly Ser
980 985 990
Asp Phe His Met Asn Val Asn Leu Gln Met Asn Tyr Trp Pro Thr Tyr
995 1000 1005
Ser Ala Asn Met Gly Glu Leu Ala Glu Pro Leu Ile Glu Tyr Val
1010 1015 1020
Glu Gly Leu Val Lys Pro Gly Arg Val Thr Ala Lys Val Tyr Ala
1025 1030 1035
Gly Ala Glu Thr Thr Asn Pro Glu Thr Thr Pro Ile Gly Glu Gly
1040 1045 1050
Glu Gly Tyr Met Ala His Thr Glu Asn Thr Ala Tyr Gly Trp Thr
1055 1060 1065
Ala Pro Gly Gln Ser Phe Ser Trp Gly Trp Ser Pro Ala Ala Val
1070 1075 1080
Pro Trp Ile Leu Gln Asn Val Tyr Glu Ala Tyr Glu Tyr Ser Gly
1085 1090 1095
Asp Pro Ala Leu Leu Asp Arg Val Tyr Ala Leu Leu Lys Glu Glu
1100 1105 1110
Ser His Phe Tyr Val Asn Tyr Met Leu His Lys Ala Gly Ser Ser
1115 1120 1125
Ser Gly Asp Arg Leu Thr Thr Gly Val Ala Tyr Ser Pro Glu Gln
1130 1135 1140
Gly Pro Leu Gly Thr Asp Gly Asn Thr Tyr Glu Ser Ser Leu Val
1145 1150 1155
Trp Gln Met Leu Asn Asp Ala Ile Glu Ala Ala Lys Ala Lys Gly
1160 1165 1170
Asp Pro Asp Gly Leu Val Gly Asn Thr Thr Asp Cys Ser Ala Asp
1175 1180 1185
Asn Trp Ala Lys Asn Asp Ser Gly Asn Phe Thr Asp Ala Asn Ala
1190 1195 1200
Asn Arg Ser Trp Ser Cys Ala Lys Ser Leu Leu Lys Pro Ile Glu
1205 1210 1215
Val Gly Asp Ser Gly Gln Ile Lys Glu Trp Tyr Phe Glu Gly Ala
1220 1225 1230
Leu Gly Lys Lys Lys Asp Gly Ser Thr Ile Ser Gly Tyr Gln Ala
1235 1240 1245
Asp Asn Gln His Arg His Met Ser His Leu Leu Gly Leu Phe Pro
1250 1255 1260
Gly Asp Leu Ile Thr Ile Asp Asn Ser Glu Tyr Met Asp Ala Ala
1265 1270 1275
Lys Thr Ser Leu Arg Tyr Arg Cys Phe Lys Gly Asn Val Leu Gln
1280 1285 1290
Ser Asn Thr Gly Trp Ala Ile Gly Gln Arg Ile Asn Ser Trp Ala
1295 1300 1305
Arg Thr Gly Asp Gly Asn Thr Thr Tyr Gln Leu Val Glu Leu Gln
1310 1315 1320
Leu Lys Asn Ala Met Tyr Ala Asn Leu Phe Asp Tyr His Ala Pro
1325 1330 1335
Phe Gln Ile Asp Gly Asn Phe Gly Asn Thr Ser Gly Val Asp Glu
1340 1345 1350
Met Leu Leu Gln Ser Asn Ser Thr Phe Thr Asp Thr Ala Gly Lys
1355 1360 1365
Lys Tyr Val Asn Tyr Thr Asn Ile Leu Pro Ala Leu Pro Asp Ala
1370 1375 1380
Trp Ala Gly Gly Ser Val Ser Gly Leu Val Ala Arg Gly Asn Phe
1385 1390 1395
Thr Val Gly Thr Thr Trp Lys Asn Gly Lys Ala Thr Glu Val Arg
1400 1405 1410
Leu Thr Ser Asn Lys Gly Lys Gln Ala Ala Val Lys Ile Thr Ala
1415 1420 1425
Gly Gly Ala Gln Asn Tyr Glu Val Lys Asn Gly Asp Thr Ala Val
1430 1435 1440
Asn Ala Lys Val Val Thr Asn Ala Asp Gly Ala Ser Leu Leu Val
1445 1450 1455
Phe Asp Thr Thr Ala Gly Thr Thr Tyr Thr Ile Thr Lys Lys Ala
1460 1465 1470
Ser Ala Asn Val Pro Val Thr Gly Val Thr Val Thr Gly Ala Asn
1475 1480 1485
Thr Ala Thr Ala Gly Asp Thr Val Thr Leu Thr Ala Thr Val Ala
1490 1495 1500
Pro Ala Asn Ala Thr Asp Lys Ser Val Thr Trp Ser Thr Ser Asp
1505 1510 1515
Ala Ala Val Ala Thr Val Asn Ala Asn Gly Val Val Thr Thr Lys
1520 1525 1530
Lys Ala Gly Lys Val Thr Ile Thr Ala Thr Ser Asn Gly Asp Lys
1535 1540 1545
Thr Lys Phe Gly Ser Ile Glu Ile Thr Val Ser Ala Ala Thr Val
1550 1555 1560
Pro Val Thr Ser Val Thr Val Ala Gly Asp Ala Ala Met Thr Val
1565 1570 1575
Asp Gly Glu Gln Thr Leu Thr Ala Thr Val Ala Pro Ala Thr Ala
1580 1585 1590
Thr Asp Lys Thr Val Thr Trp Lys Ser Ser Asp Ala Thr Val Ala
1595 1600 1605
Thr Val Asp Ala Asn Gly Lys Val Val Ala Lys Lys Ala Gly Glu
1610 1615 1620
Val Thr Ile Thr Ala Thr Ala Gly Gly Val Ser Gly Thr Leu Lys
1625 1630 1635
Ile Thr Val Ser Asp Lys Ala Pro Thr Val Ile Pro Val Gln Ser
1640 1645 1650
Val Thr Val Thr Gly Lys Gln Glu Leu Val Glu Gly Ala Ser Thr
1655 1660 1665
Thr Leu Thr Ala Thr Val Ala Pro Ala Asp Ala Thr Asp Lys Thr
1670 1675 1680
Val Thr Trp Lys Ser Ser Asp Glu Ser Val Ala Thr Val Asp Lys
1685 1690 1695
Asp Gly Val Val Thr Ala Lys Lys Ala Gly Thr Val Thr Ile Thr
1700 1705 1710
Ala Thr Ala Gly Gly Val Ser Gly Thr Leu His Ile Thr Val Thr
1715 1720 1725
Ala Lys Pro Val Glu Thr Val Pro Val Thr Ser Val Glu Val Thr
1730 1735 1740
Val Glu Ala Gly Thr Thr Val Ser Val Gly Lys Thr Leu Gln Ala
1745 1750 1755
Thr Ala Thr Val Lys Pro Gly Asn Ala Thr Asn Lys Lys Val Thr
1760 1765 1770
Trp Lys Ser Ser Asp Glu Ser Ile Ala Thr Val Asp Ala Asn Gly
1775 1780 1785
Val Ile Thr Ala Lys Lys Ala Gly Lys Val Val Ile Thr Ala Thr
1790 1795 1800
Ser Thr Asp Gly Thr Asp Lys Ser Gly Ser Val Glu Ile Thr Val
1805 1810 1815
Val Asp Glu Thr Lys Pro Thr Pro Asp His Lys Ser Val Lys Ala
1820 1825 1830
Asp Thr Gly Asp Val Thr Ala Gly Lys Thr Gly Thr Val Thr Glu
1835 1840 1845
Pro Lys Asp Val Ala Gly Trp Lys Ser Arg Ser Ile Ile Lys Gln
1850 1855 1860
Gly Lys Leu Gly Lys Ala Glu Ile Ala Asp Gly Thr Leu Val Tyr
1865 1870 1875
Ala Ala Gly Asp Lys Thr Gly Asp Asp Ser Phe Val Val Gln Tyr
1880 1885 1890
Thr Met Ala Asp Gly Thr Val Ile Asp Val Thr Tyr Ser Val Thr
1895 1900 1905
Val Lys Ala Ala Glu Thr Gly Lys Asn Asp Gly Asp Gly Lys Gly
1910 1915 1920
Asp Gly Val Ala Lys Thr Gly Ala Ala Val Gly Ala Leu Ala Gly
1925 1930 1935
Leu Gly Leu Met Leu Leu Ala Val Gly Val Ser Val Val Met Ile
1940 1945 1950
Arg Arg Lys His Ser Ala
1955
<210> 3
<211> 4482
<212> DNA
<213> 两岐双岐杆菌
<400> 3
atgctacaca cagcatcaag aggatgctcg cgttcgtggc tgcgcagact caccgcattg 60
atagcggtct cggcgctcgc gttcgtggca ttgccgaacg tcgcggtggc ggcggatccg 120
atggaatacc tcgatgtgtc gttcggcggc acgttcgctg cagacaccta caccacaggt 180
ggcgacgagg tggcgaaggg ccccgtgacc aagcacggca gcataccgac caagcttgac 240
ggcggcggca tcaccctcgc tggcggcacc aacggcgtga cattcacctc gaccgcgagc 300
ttcagcgaga gtgggaaggt gaacaaggga ttccgcgccg aaatggagta ccgtacgacg 360
cagacgccca gcaacctcgc cacattgttc tccgccatgg gcaacatctt cgtgcgggcg 420
aacggcagca acctcgaata cggcttctcc acgaaccctt ccggcagtac atggaacgac 480
tacacaaagt ccgtgacgct gccttccaac aatgtgaagc acatcatcca gctgacatat 540
ctgccgggag ccgacggcgc tgcctcgacg ttgcagttgt cggtggatgg cgtggccggc 600
gagaccgcca cctccgcggc cggcgagctc gcggccgtca gcgattccgt cgggaacaag 660
ttcgggatcg gctacgaggt gaaccccgct tccggcgcgg cgagccgcgg tcttgccggt 720
gacgtgttcc gcgcgcgtgt cgccgattcg gacgccccgt gggagattct tgacgcatcc 780
cagctgctgc atgtcaattt caacggcacg ttcagcggca cctcatatac cgcggcgagc 840
ggcgagcaga tgctgggctc gctggtgtcg cgctcggcca atccgtccat ctcgaactcc 900
gccgtcacgc tgggcggcgg cacggccgga ttcgatttca cgcccacgga cttcaccctc 960
ggtgacaacg aggccatcac ccgcccgctg gtcgcggagc tgcgcttcac cccgacgcag 1020
accggcgaca accagaccct gttcggcgcg ggcggcaacc tgttcctgcg ctacgagtcg 1080
aacaagctcg tgttcggcgc ctccaccaag tccggcgata attggaccga ccacaagatc 1140
gagtccgcgg ccgccacggg tgcggagcac gtcgtgtcgg tggcgtacgt gcccaataag 1200
gccggcaccg gcgcgaagct tgtcatgcgc gtggatggcg gcgacgccca gaccaaggac 1260
atcactggtc tggcttacct gaattcgagc atcaagggca aggtcggctt cggcaacgac 1320
gtgcataccg acgcgctcag ccgcggcttc gtcggctcgc tgagcgagat ccgcctggcc 1380
gaaacctccg cgaacttcac caccaacgaa ttcaagctgg tctactctca ggtcagctgc 1440
gacacgtcgg gcatcaagga ggcgaatacc ttcgacgtgg agcccgccga gtgcgaggcc 1500
gcgcttaaga ccaagctgtc caagctgcgt ccgaccgaag ggcaggccga ctacatcgac 1560
tggggtcaga tcggattcct ccattacggc atcaacacgt actacaacca ggagtggggt 1620
cacggtaacg aggatccctc ccgcatcaac ccgaccggcc tcgacaccga ccagtgggcg 1680
aagtccttcg ccgacggtgg cttcaagatg atcatggtga cggtcaagca ccatgacggt 1740
ttcgagctgt acgactcgcg gtacaacacc gagcacgact gggcaaacac cgccgtcgcc 1800
aagcgcacgg gggagaagga cctgttccgc aagattgtcg cctcggcgaa gaaatacggc 1860
ctgaaggtcg gcatctacta ttcgccggcc gattcctaca tggagaggaa gggcgtctgg 1920
ggcaacaact ccgcacgcgt cgagcgcacg atccccacgc tggtggagaa cgacgaccgc 1980
gccggcaagg tggcttccgg caaactgccc acgttcaagt acaaggccac ggattacggc 2040
gcctacatgc tcaaccagct ctatgagctg ctgactgagt acggcgacat ctccgaggtc 2100
tggttcgacg gtgcccaagg caacaccgca ggcactgagc attacgacta tggcgtgttc 2160
tacgagatga tccgccggct tcagccccag gcaattcagg ccaacgccgc atacgatgcc 2220
cgatgggtgg gcaacgagga cggctgggcc cgtcagaccg agtggagccc gcaggcggca 2280
tacaacgacg gcgtggacaa ggtgtcgctc aagcctggcc agatggcccc cgacggtaag 2340
cttggcagca tgtcgagcgt gctgtccgag atccgcagcg gcgccgccaa ccagctgcac 2400
tggtatccgg ccgaagtcga cgccaagaac cggcccggat ggttctaccg tgccagccaa 2460
tcgccggcgt ccgtagccga agtcgtgaag tactacgagc agtccacggg acgcaactcg 2520
cagtatctgc tgaacgtccc accgtccgat accggcaagc tcgccgatgc ggatgccgcg 2580
ggacttaagg ggctgggcga ggagctcgcc cgacgctacg gcaccgatct tgccctgggc 2640
aagagcgcga ccgtcgccgc gtccgcgaac gacactgcgg tagcggcccc gaagctgacc 2700
gacggttcga agctctcctc cgacaaggcc gtgggcaata cgccgacgta caccatcgat 2760
ctgggcagca ctgtcgccgt ggatgcagtg aagatctccg aggacgtgcg caatgccggc 2820
cagcagatcg aaagcgccac tctgcaggga cgagtcaatg gaacatggac gaatctggcg 2880
actatgacga cggtcgggca gcagcgcgac cttcgcttca cgtcccagaa catcgatgcc 2940
atccgtctgg tggtcaactc ctcccgcggt ccggtgcgtc tgagccgtct tgaggtgttc 3000
cacaccgaat ccgagattca gaccggcgcc cgcgcctact acatcgatcc gacggcgcag 3060
accgcgggag atggattcac gaaggacaag cccatgacgt cgatcgagca gctgcacgat 3120
gtgaccgtcg cgccaggctc cgtgatcttc gtcaaggcgg gcaccgagct gaccggggac 3180
ttcgccgtct tcggctacgg caccaaggac gagcccatca ccgtgacgac atacggcgaa 3240
agcgacaaag ccaccaccgc gagcttcgac ggcatgaccg ccgggctgac gctgaagcag 3300
gcgctgaagg cgctcggcaa ggacgacgcc ggctgggtcg tggccgattc cgccactgca 3360
ccggcctccc gcgtgtatgt cccgcaggat gagatcagcg tgcacgccca gtcgtcgcag 3420
aactccggcg cagaggcggc gagggcgctc gacggcgact cgtcgacgag ctggcactcc 3480
cagtacagcc cgaccaccgc gtctgctccg cattgggtga ctctcgatct cggcaaatcg 3540
cgtgagaacg tcgcctactt cgactacctc gcccgtatcg acggcaacaa taacggtgcc 3600
gccaaggatt acgaggtgta tgtctccgac gatcccaacg attttggagc ccctgtggcc 3660
tcgggcacgt tgaagaacgt cgcctacacg cagcgcatca agctgacccc caagaacgga 3720
cggtacgtca agttcgtcat caagaccgat tattccggat cgaacttcgg ctccgcggcg 3780
gaaatgaatg tcgagttgct gcccacggcc gtagaggagg acaaggtcgc caccccgcag 3840
aagccgacag tggacgatga tgccgataca tacaccatcc ccgacatcga gggagtcgtg 3900
tacaaggtcg acggcaaggt gttggccgct ggttccgtag tgaacgtggg cgatgaggac 3960
gtgaccgtca cggtcaccgc cgagcccgcc gacggatacc gcttcccgga tggtgtgacg 4020
tccccagtca cgtatgagct gacgttcacc aagaagggtg gcgagaagcc tccgaccgaa 4080
gtcaacaagg acaagctgca cgccacgatc accaaggctc aggcgatcga ccgttccgcc 4140
tatacggacg agtcgctcaa ggtgcttgat gacaagctcg ccgcagcgct caaggtctat 4200
gacgatgaca aggtgagcca ggatgatgtc gatgccgccg aggcggctct gtctgcggcg 4260
atcgacgcgc tgaagaccaa gccgacgacc cccggcggtg aaggtgagaa gcctggtgaa 4320
ggtgaaaagc ccggtgacgg caacaagccc ggtgacggca agaagcccgg cgacgtgatc 4380
gcaaagaccg gcgcctccac aatgggcgtt gtcttcgctg cactcgcgat ggtagcgggt 4440
gcggtcgtga cgcttgaagc caagcgtaag tccaaccggt aa 4482
<210> 4
<211> 1493
<212> PRT
<213> 两岐双岐杆菌
<400> 4
Met Leu His Thr Ala Ser Arg Gly Cys Ser Arg Ser Trp Leu Arg Arg
1 5 10 15
Leu Thr Ala Leu Ile Ala Val Ser Ala Leu Ala Phe Val Ala Leu Pro
20 25 30
Asn Val Ala Val Ala Ala Asp Pro Met Glu Tyr Leu Asp Val Ser Phe
35 40 45
Gly Gly Thr Phe Ala Ala Asp Thr Tyr Thr Thr Gly Gly Asp Glu Val
50 55 60
Ala Lys Gly Pro Val Thr Lys His Gly Ser Ile Pro Thr Lys Leu Asp
65 70 75 80
Gly Gly Gly Ile Thr Leu Ala Gly Gly Thr Asn Gly Val Thr Phe Thr
85 90 95
Ser Thr Ala Ser Phe Ser Glu Ser Gly Lys Val Asn Lys Gly Phe Arg
100 105 110
Ala Glu Met Glu Tyr Arg Thr Thr Gln Thr Pro Ser Asn Leu Ala Thr
115 120 125
Leu Phe Ser Ala Met Gly Asn Ile Phe Val Arg Ala Asn Gly Ser Asn
130 135 140
Leu Glu Tyr Gly Phe Ser Thr Asn Pro Ser Gly Ser Thr Trp Asn Asp
145 150 155 160
Tyr Thr Lys Ser Val Thr Leu Pro Ser Asn Asn Val Lys His Ile Ile
165 170 175
Gln Leu Thr Tyr Leu Pro Gly Ala Asp Gly Ala Ala Ser Thr Leu Gln
180 185 190
Leu Ser Val Asp Gly Val Ala Gly Glu Thr Ala Thr Ser Ala Ala Gly
195 200 205
Glu Leu Ala Ala Val Ser Asp Ser Val Gly Asn Lys Phe Gly Ile Gly
210 215 220
Tyr Glu Val Asn Pro Ala Ser Gly Ala Ala Ser Arg Gly Leu Ala Gly
225 230 235 240
Asp Val Phe Arg Ala Arg Val Ala Asp Ser Asp Ala Pro Trp Glu Ile
245 250 255
Leu Asp Ala Ser Gln Leu Leu His Val Asn Phe Asn Gly Thr Phe Ser
260 265 270
Gly Thr Ser Tyr Thr Ala Ala Ser Gly Glu Gln Met Leu Gly Ser Leu
275 280 285
Val Ser Arg Ser Ala Asn Pro Ser Ile Ser Asn Ser Ala Val Thr Leu
290 295 300
Gly Gly Gly Thr Ala Gly Phe Asp Phe Thr Pro Thr Asp Phe Thr Leu
305 310 315 320
Gly Asp Asn Glu Ala Ile Thr Arg Pro Leu Val Ala Glu Leu Arg Phe
325 330 335
Thr Pro Thr Gln Thr Gly Asp Asn Gln Thr Leu Phe Gly Ala Gly Gly
340 345 350
Asn Leu Phe Leu Arg Tyr Glu Ser Asn Lys Leu Val Phe Gly Ala Ser
355 360 365
Thr Lys Ser Gly Asp Asn Trp Thr Asp His Lys Ile Glu Ser Ala Ala
370 375 380
Ala Thr Gly Ala Glu His Val Val Ser Val Ala Tyr Val Pro Asn Lys
385 390 395 400
Ala Gly Thr Gly Ala Lys Leu Val Met Arg Val Asp Gly Gly Asp Ala
405 410 415
Gln Thr Lys Asp Ile Thr Gly Leu Ala Tyr Leu Asn Ser Ser Ile Lys
420 425 430
Gly Lys Val Gly Phe Gly Asn Asp Val His Thr Asp Ala Leu Ser Arg
435 440 445
Gly Phe Val Gly Ser Leu Ser Glu Ile Arg Leu Ala Glu Thr Ser Ala
450 455 460
Asn Phe Thr Thr Asn Glu Phe Lys Leu Val Tyr Ser Gln Val Ser Cys
465 470 475 480
Asp Thr Ser Gly Ile Lys Glu Ala Asn Thr Phe Asp Val Glu Pro Ala
485 490 495
Glu Cys Glu Ala Ala Leu Lys Thr Lys Leu Ser Lys Leu Arg Pro Thr
500 505 510
Glu Gly Gln Ala Asp Tyr Ile Asp Trp Gly Gln Ile Gly Phe Leu His
515 520 525
Tyr Gly Ile Asn Thr Tyr Tyr Asn Gln Glu Trp Gly His Gly Asn Glu
530 535 540
Asp Pro Ser Arg Ile Asn Pro Thr Gly Leu Asp Thr Asp Gln Trp Ala
545 550 555 560
Lys Ser Phe Ala Asp Gly Gly Phe Lys Met Ile Met Val Thr Val Lys
565 570 575
His His Asp Gly Phe Glu Leu Tyr Asp Ser Arg Tyr Asn Thr Glu His
580 585 590
Asp Trp Ala Asn Thr Ala Val Ala Lys Arg Thr Gly Glu Lys Asp Leu
595 600 605
Phe Arg Lys Ile Val Ala Ser Ala Lys Lys Tyr Gly Leu Lys Val Gly
610 615 620
Ile Tyr Tyr Ser Pro Ala Asp Ser Tyr Met Glu Arg Lys Gly Val Trp
625 630 635 640
Gly Asn Asn Ser Ala Arg Val Glu Arg Thr Ile Pro Thr Leu Val Glu
645 650 655
Asn Asp Asp Arg Ala Gly Lys Val Ala Ser Gly Lys Leu Pro Thr Phe
660 665 670
Lys Tyr Lys Ala Thr Asp Tyr Gly Ala Tyr Met Leu Asn Gln Leu Tyr
675 680 685
Glu Leu Leu Thr Glu Tyr Gly Asp Ile Ser Glu Val Trp Phe Asp Gly
690 695 700
Ala Gln Gly Asn Thr Ala Gly Thr Glu His Tyr Asp Tyr Gly Val Phe
705 710 715 720
Tyr Glu Met Ile Arg Arg Leu Gln Pro Gln Ala Ile Gln Ala Asn Ala
725 730 735
Ala Tyr Asp Ala Arg Trp Val Gly Asn Glu Asp Gly Trp Ala Arg Gln
740 745 750
Thr Glu Trp Ser Pro Gln Ala Ala Tyr Asn Asp Gly Val Asp Lys Val
755 760 765
Ser Leu Lys Pro Gly Gln Met Ala Pro Asp Gly Lys Leu Gly Ser Met
770 775 780
Ser Ser Val Leu Ser Glu Ile Arg Ser Gly Ala Ala Asn Gln Leu His
785 790 795 800
Trp Tyr Pro Ala Glu Val Asp Ala Lys Asn Arg Pro Gly Trp Phe Tyr
805 810 815
Arg Ala Ser Gln Ser Pro Ala Ser Val Ala Glu Val Val Lys Tyr Tyr
820 825 830
Glu Gln Ser Thr Gly Arg Asn Ser Gln Tyr Leu Leu Asn Val Pro Pro
835 840 845
Ser Asp Thr Gly Lys Leu Ala Asp Ala Asp Ala Ala Gly Leu Lys Gly
850 855 860
Leu Gly Glu Glu Leu Ala Arg Arg Tyr Gly Thr Asp Leu Ala Leu Gly
865 870 875 880
Lys Ser Ala Thr Val Ala Ala Ser Ala Asn Asp Thr Ala Val Ala Ala
885 890 895
Pro Lys Leu Thr Asp Gly Ser Lys Leu Ser Ser Asp Lys Ala Val Gly
900 905 910
Asn Thr Pro Thr Tyr Thr Ile Asp Leu Gly Ser Thr Val Ala Val Asp
915 920 925
Ala Val Lys Ile Ser Glu Asp Val Arg Asn Ala Gly Gln Gln Ile Glu
930 935 940
Ser Ala Thr Leu Gln Gly Arg Val Asn Gly Thr Trp Thr Asn Leu Ala
945 950 955 960
Thr Met Thr Thr Val Gly Gln Gln Arg Asp Leu Arg Phe Thr Ser Gln
965 970 975
Asn Ile Asp Ala Ile Arg Leu Val Val Asn Ser Ser Arg Gly Pro Val
980 985 990
Arg Leu Ser Arg Leu Glu Val Phe His Thr Glu Ser Glu Ile Gln Thr
995 1000 1005
Gly Ala Arg Ala Tyr Tyr Ile Asp Pro Thr Ala Gln Thr Ala Gly
1010 1015 1020
Asp Gly Phe Thr Lys Asp Lys Pro Met Thr Ser Ile Glu Gln Leu
1025 1030 1035
His Asp Val Thr Val Ala Pro Gly Ser Val Ile Phe Val Lys Ala
1040 1045 1050
Gly Thr Glu Leu Thr Gly Asp Phe Ala Val Phe Gly Tyr Gly Thr
1055 1060 1065
Lys Asp Glu Pro Ile Thr Val Thr Thr Tyr Gly Glu Ser Asp Lys
1070 1075 1080
Ala Thr Thr Ala Ser Phe Asp Gly Met Thr Ala Gly Leu Thr Leu
1085 1090 1095
Lys Gln Ala Leu Lys Ala Leu Gly Lys Asp Asp Ala Gly Trp Val
1100 1105 1110
Val Ala Asp Ser Ala Thr Ala Pro Ala Ser Arg Val Tyr Val Pro
1115 1120 1125
Gln Asp Glu Ile Ser Val His Ala Gln Ser Ser Gln Asn Ser Gly
1130 1135 1140
Ala Glu Ala Ala Arg Ala Leu Asp Gly Asp Ser Ser Thr Ser Trp
1145 1150 1155
His Ser Gln Tyr Ser Pro Thr Thr Ala Ser Ala Pro His Trp Val
1160 1165 1170
Thr Leu Asp Leu Gly Lys Ser Arg Glu Asn Val Ala Tyr Phe Asp
1175 1180 1185
Tyr Leu Ala Arg Ile Asp Gly Asn Asn Asn Gly Ala Ala Lys Asp
1190 1195 1200
Tyr Glu Val Tyr Val Ser Asp Asp Pro Asn Asp Phe Gly Ala Pro
1205 1210 1215
Val Ala Ser Gly Thr Leu Lys Asn Val Ala Tyr Thr Gln Arg Ile
1220 1225 1230
Lys Leu Thr Pro Lys Asn Gly Arg Tyr Val Lys Phe Val Ile Lys
1235 1240 1245
Thr Asp Tyr Ser Gly Ser Asn Phe Gly Ser Ala Ala Glu Met Asn
1250 1255 1260
Val Glu Leu Leu Pro Thr Ala Val Glu Glu Asp Lys Val Ala Thr
1265 1270 1275
Pro Gln Lys Pro Thr Val Asp Asp Asp Ala Asp Thr Tyr Thr Ile
1280 1285 1290
Pro Asp Ile Glu Gly Val Val Tyr Lys Val Asp Gly Lys Val Leu
1295 1300 1305
Ala Ala Gly Ser Val Val Asn Val Gly Asp Glu Asp Val Thr Val
1310 1315 1320
Thr Val Thr Ala Glu Pro Ala Asp Gly Tyr Arg Phe Pro Asp Gly
1325 1330 1335
Val Thr Ser Pro Val Thr Tyr Glu Leu Thr Phe Thr Lys Lys Gly
1340 1345 1350
Gly Glu Lys Pro Pro Thr Glu Val Asn Lys Asp Lys Leu His Ala
1355 1360 1365
Thr Ile Thr Lys Ala Gln Ala Ile Asp Arg Ser Ala Tyr Thr Asp
1370 1375 1380
Glu Ser Leu Lys Val Leu Asp Asp Lys Leu Ala Ala Ala Leu Lys
1385 1390 1395
Val Tyr Asp Asp Asp Lys Val Ser Gln Asp Asp Val Asp Ala Ala
1400 1405 1410
Glu Ala Ala Leu Ser Ala Ala Ile Asp Ala Leu Lys Thr Lys Pro
1415 1420 1425
Thr Thr Pro Gly Gly Glu Gly Glu Lys Pro Gly Glu Gly Glu Lys
1430 1435 1440
Pro Gly Asp Gly Asn Lys Pro Gly Asp Gly Lys Lys Pro Gly Asp
1445 1450 1455
Val Ile Ala Lys Thr Gly Ala Ser Thr Met Gly Val Val Phe Ala
1460 1465 1470
Ala Leu Ala Met Val Ala Gly Ala Val Val Thr Leu Glu Ala Lys
1475 1480 1485
Arg Lys Ser Asn Arg
1490
<210> 5
<211> 2094
<212> DNA
<213> 肺炎链球菌(Streptococcus pneumoniae)
<400> 5
atgaataaaa gaggtcttta ttcaaaacta ggaatttctg ttgtaggcat tagtctttta 60
atgggagtcc ccactttgat tcatgcgaat gaattaaact atggtcaact gtccatatct 120
cctatttttc aaggaggttc atatcaactg aacaataaga gtatagatat cagctctttg 180
ttattagata aattgtctgg agagagtcag acagtagtaa tgaaatttaa agcagataaa 240
ccaaactctc ttcaagcttt gtttggccta tctaatagta aagcaggctt taaaaataat 300
tacttttcaa ttttcatgag agattctggt gagataggtg tagaaataag agacgcccaa 360
gagggaataa attatttatt ttctagacca gcttcattat ggggaaagca taaaggacag 420
gcagttgaaa atacactagt atttgtatct gattctaaag ataaaacata cacaatgtat 480
gttaatggaa tagaagtgtt ctctgaaaca gttgatacat ttttgccaat ttcaaatata 540
aatggtatag ataaggcaac actaggagct gttaatcgtg aaggtaagga acattacctc 600
gcaaaaggaa gtattggtga aatcagtcta tttaacaaag caattagtga tcaggaagtt 660
tcaaatattc ccttgtcaaa tccatttcag ttaattttcc aatcaggaga ttctactcaa 720
gctaactatt ttagaatacc gacactatat acattaagta gtggaagagt tctatcaagt 780
attgatgcac gttatggtgg gactcatgat tctaaaagta agattaatat tgccacttct 840
tatagtgatg ataatgggaa aacgtggagt gagccaattt ttgctatgaa gtttaatgac 900
tatgaggagc agttagttta ctggccacga gataataaat taaagaatag tcaaattagt 960
ggaagtgctt cattcataga ttcatccatt gttgaagata aaaaatctgg gaaaacgata 1020
ttactagctg atgttatgcc tgcgggtatt ggaaataata atgcaaataa agccgactca 1080
ggttttaaag aaataaatgg tcattattat ttaaaactaa agaagaatgg agataacgat 1140
ttccgttata cagttagaga aaatggtgtc gtttatgatg aaacaactaa taaacctaca 1200
aattatacta taaatgataa gtatgaagtt ttggagggag gaaagtcttt aacagtcgaa 1260
caatattcgg ttgattttga tagtggctct ttaagagaaa ggcataatgg aaaacaggtt 1320
cctatgaatg ttttctacaa agattcgtta tttaaagtga ctcctactaa ttatatagca 1380
atgacaacta gtcagaatag aggagagagt tgggaacaat ttaagttgtt gcctccgttc 1440
ttaggagaaa aacataatgg aacttacttg tgtcctggac aaggtttagc attaaaatca 1500
agtaacagat tgatttttgc aacatatact agtggagaac taacctatct catttcggat 1560
gatagtggtc aaacatggaa gaaatcctca gcttcaattc cgtttaaaaa tgcaacagca 1620
gaagcacaaa tggttgaact gagagatggt gtgattagaa cattctttag aaccactaca 1680
ggtaagatag cttatatgac tagtagagat tctggagaaa catggtcgaa agtttcgtat 1740
attgatggaa ttcaacaaac ttcatatggc acacaagtat ctgcaattaa atactctcaa 1800
ttaattgatg gaaaagaagc agtcattttg agtacaccaa attctagaag tggccgtaag 1860
ggaggccaat tagttgtcgg tttggtcaat aaagaagatg atagtattga ttggagatac 1920
cactatgata ttgatttgcc ttcgtatggt tatgcctatt ctgcgattac agaattgcca 1980
aatcatcaca taggtgtact gtttgaaaaa tatgattcgt ggtcgagaaa tgaattgcat 2040
ttaagcaatg tagttcagta tatagatttg gaaattaatg atttaacaaa ataa 2094
<210> 6
<211> 697
<212> PRT
<213> 肺炎链球菌
<400> 6
Met Asn Lys Arg Gly Leu Tyr Ser Lys Leu Gly Ile Ser Val Val Gly
1 5 10 15
Ile Ser Leu Leu Met Gly Val Pro Thr Leu Ile His Ala Asn Glu Leu
20 25 30
Asn Tyr Gly Gln Leu Ser Ile Ser Pro Ile Phe Gln Gly Gly Ser Tyr
35 40 45
Gln Leu Asn Asn Lys Ser Ile Asp Ile Ser Ser Leu Leu Leu Asp Lys
50 55 60
Leu Ser Gly Glu Ser Gln Thr Val Val Met Lys Phe Lys Ala Asp Lys
65 70 75 80
Pro Asn Ser Leu Gln Ala Leu Phe Gly Leu Ser Asn Ser Lys Ala Gly
85 90 95
Phe Lys Asn Asn Tyr Phe Ser Ile Phe Met Arg Asp Ser Gly Glu Ile
100 105 110
Gly Val Glu Ile Arg Asp Ala Gln Lys Gly Ile Asn Tyr Leu Phe Ser
115 120 125
Arg Pro Ala Ser Leu Trp Gly Lys His Lys Gly Gln Ala Val Glu Asn
130 135 140
Thr Leu Val Phe Val Ser Asp Ser Lys Asp Lys Thr Tyr Thr Met Tyr
145 150 155 160
Val Asn Gly Ile Glu Val Phe Ser Glu Thr Val Asp Thr Phe Leu Pro
165 170 175
Ile Ser Asn Ile Asn Gly Ile Asp Lys Ala Thr Leu Gly Ala Val Asn
180 185 190
Arg Glu Gly Lys Glu His Tyr Leu Ala Lys Gly Ser Ile Asp Glu Ile
195 200 205
Ser Leu Phe Asn Lys Ala Ile Ser Asp Gln Glu Val Ser Thr Ile Pro
210 215 220
Leu Ser Asn Pro Phe Gln Leu Ile Phe Gln Ser Gly Asp Ser Thr Gln
225 230 235 240
Ala Asn Tyr Phe Arg Ile Pro Thr Leu Tyr Thr Leu Ser Ser Gly Arg
245 250 255
Val Leu Ser Ser Ile Asp Ala Arg Tyr Gly Gly Thr His Asp Ser Lys
260 265 270
Ser Lys Ile Asn Ile Ala Thr Ser Tyr Ser Asp Asp Asn Gly Lys Thr
275 280 285
Trp Ser Glu Pro Ile Phe Ala Met Lys Phe Asn Asp Tyr Glu Glu Gln
290 295 300
Leu Val Tyr Trp Pro Arg Asp Asn Lys Leu Lys Asn Ser Gln Ile Ser
305 310 315 320
Gly Ser Ala Ser Phe Ile Asp Ser Ser Ile Val Glu Asp Lys Lys Ser
325 330 335
Gly Lys Thr Ile Leu Leu Ala Asp Val Met Pro Ala Gly Ile Gly Asn
340 345 350
Asn Asn Ala Asn Lys Ala Asp Ser Gly Phe Lys Glu Ile Asn Gly His
355 360 365
Tyr Tyr Leu Lys Leu Lys Lys Asn Gly Asp Asn Asp Phe Arg Tyr Thr
370 375 380
Val Arg Glu Asn Gly Val Val Tyr Asn Glu Thr Thr Asn Lys Pro Thr
385 390 395 400
Asn Tyr Thr Ile Asn Asp Lys Tyr Glu Val Leu Glu Gly Gly Lys Ser
405 410 415
Leu Thr Val Glu Gln Tyr Ser Val Asp Phe Asp Ser Gly Ser Leu Arg
420 425 430
Glu Arg His Asn Gly Lys Gln Val Pro Met Asn Val Phe Tyr Lys Asp
435 440 445
Ser Leu Phe Lys Val Thr Pro Thr Asn Tyr Ile Ala Met Thr Thr Ser
450 455 460
Gln Asn Arg Gly Glu Ser Trp Glu Gln Phe Lys Leu Leu Pro Pro Phe
465 470 475 480
Leu Gly Glu Lys His Asn Gly Thr Tyr Leu Cys Pro Gly Gln Gly Leu
485 490 495
Ala Leu Lys Ser Ser Asn Arg Leu Ile Phe Ala Thr Tyr Thr Ser Gly
500 505 510
Glu Leu Thr Tyr Leu Ile Ser Asp Asp Ser Gly Gln Thr Trp Lys Lys
515 520 525
Ser Ser Ala Ser Ile Pro Phe Lys Asn Ala Thr Ala Glu Ala Gln Met
530 535 540
Val Glu Leu Arg Asp Gly Val Ile Arg Thr Phe Phe Arg Thr Thr Thr
545 550 555 560
Gly Lys Ile Ala Tyr Met Thr Ser Arg Asp Ser Gly Glu Thr Trp Ser
565 570 575
Lys Val Ser Tyr Ile Asp Gly Ile Gln Gln Thr Ser Tyr Gly Thr Gln
580 585 590
Val Ser Ala Ile Lys Tyr Ser Gln Leu Ile Asp Gly Lys Glu Ala Val
595 600 605
Ile Leu Ser Thr Pro Asn Ser Arg Ser Gly Arg Lys Gly Gly Gln Leu
610 615 620
Val Val Gly Leu Val Asn Lys Glu Asp Asp Ser Ile Asp Trp Lys Tyr
625 630 635 640
His Tyr Asp Ile Asp Leu Pro Ser Tyr Gly Tyr Ala Tyr Ser Ala Ile
645 650 655
Thr Glu Leu Pro Asn His His Ile Gly Val Leu Phe Glu Lys Tyr Asp
660 665 670
Ser Trp Ser Arg Asn Glu Leu His Leu Ser Asn Val Val Gln Tyr Ile
675 680 685
Asp Leu Glu Ile Asn Asp Leu Thr Lys
690 695
<210> 7
<211> 2076
<212> DNA
<213> 长双歧杆菌(Bifidobacterium longum)
<400> 7
atggaacata gagcgttcaa gtggccgcag ccacttgcgg gcaacaagcc ccgcatctgg 60
tacggcggcg attacaaccc cgaccaatgg cctgaggaag tgtgggacga agatgtagcc 120
ctcatgcagc aggccggcgt caacctcgtc tccgtagcca tcttctcctg ggccaagctt 180
gagcccgaag aaggcgtgta cgacttcgat tggctcgacc gcgtcatcga caagctcggc 240
aaggccggca tcgccgtcga tctcgcctcc ggcaccgcat ccccgccgat gtggatgacc 300
caggcccacc cggagatcct ctgggtcgac taccgcggcg acgtctgcca gcccggtgcc 360
cgccagcact ggcgcgccac cagcccggtc ttccttgact acgcgctcaa cctgtgccgc 420
aagatggccg agcactacaa ggacaacccc tatgtggtct cttggcatgt gagcaacgag 480
tacggctgcc acaaccgctt cgactattcc gaagacgccg agcgcgcctt ccagaagtgg 540
tgcgagaaga agtacggcac catcgacgct gtcaacgacg cctggggcac cgccttctgg 600
gcgcagcgca tgaacaattt ctccgagatc atcccgccgc gattcatcgg cgacggcaac 660
ttcatgaacc cgggcaagct gcttgattgg aagcgtttca gctccgacgc gctgctggac 720
ttctacaagg ccgagcgcga cgccctgctc gagatcgccc ccaagccgca gaccaccaac 780
ttcatggtct ccgcgggctg caccgtcctc gactacgaca agtggggtca tgacgtggac 840
ttcgtgtcca acgaccatta cttctcgccc ggcgaggccc acttcgacga gatggcctac 900
gcggcctgcc tcaccgacgg catcgcccgc aagaacccgt ggttcctcat ggaacattcc 960
acgtccgccg tcaactggcg cccgaccaac taccggctcg agcccggcga gctggtgcgc 1020
gactccctgg cccatctggc catgggcgcc gacgccatct gctacttcca gtggcgtcag 1080
tccaaggccg gcgccgagaa gtggcattcc gccatggtgc cccacgcagg ccccgactcc 1140
cagatcttcc gcgatgtgtg cgagctgggt gccgacctca acaagcttgc tgacgagggc 1200
ctgctgagca ccaagctggt caagtccaag gtcgccatcg tcttcgacta cgagtcccag 1260
tgggccaccg agcacaccgc cacccccacg caggaggtgc gccactggac cgagccgctg 1320
gactggttcc gcgcgctggc ggacaatggc ctgaccgccg acgtggtgcc ggtccgcggt 1380
ccttgggatg agtacgaggc cgtcgtgttg ccgagcctgg ccatcctgtc cgagcagacc 1440
acgcgccgcg tgcgcgagta tgtggcgaac ggcggcaagc tgttcgtgac ctactacacc 1500
ggtctggtgg acgacaggga tcacgtctgg ctgggcggct accccggctc cattcgcgac 1560
gtggtgggcg tgcgcgtcga ggaattcgcc ccgatgggca ccgacgcccc cggcaccatg 1620
gaccaccttg acttggacaa cggaaccgtg gcgcacgatt tcgccgacgt gatcacctcc 1680
gtggccgata ccgctcacgt ggtcgcctcc ttcaaggcag ataagtggac cggtttcgac 1740
ggcgctcccg ccatcaccgt caacgacttc ggcgacggca aggccgcata cgtcggtgcc 1800
cgtctcgggc gtgagggctt ggccaagagc ctgcccgcgc tgctggagga actcggcatc 1860
gagacttcgg ctgaggacga tcgtggtgaa gtgctgcgcg tcgagcgtgc ggacgaaact 1920
ggcgagaacc acttcgtgtt cctgttcaac cgcacccacg atgttgcggt cgtggacgtg 1980
gaaggcgaac cgctggtcgc ctcgctggcc caggtcaacg agtccgagca cacggccgcc 2040
atccagccca acggcgtact cgtcgtcaag ctgtaa 2076
<210> 8
<211> 691
<212> PRT
<213> 长双歧杆菌
<400> 8
Met Glu His Arg Ala Phe Lys Trp Pro Gln Pro Leu Ala Gly Asn Lys
1 5 10 15
Pro Arg Ile Trp Tyr Gly Gly Asp Tyr Asn Pro Asp Gln Trp Pro Glu
20 25 30
Glu Val Trp Asp Glu Asp Val Ala Leu Met Gln Gln Ala Gly Val Asn
35 40 45
Leu Val Ser Val Ala Ile Phe Ser Trp Ala Lys Leu Glu Pro Glu Glu
50 55 60
Gly Val Tyr Asp Phe Asp Trp Leu Asp Arg Val Ile Asp Lys Leu Gly
65 70 75 80
Lys Ala Gly Ile Ala Val Asp Leu Ala Ser Gly Thr Ala Ser Pro Pro
85 90 95
Met Trp Met Thr Gln Ala His Pro Glu Ile Leu Trp Val Asp Tyr Arg
100 105 110
Gly Asp Val Cys Gln Pro Gly Ala Arg Gln His Trp Arg Ala Thr Ser
115 120 125
Pro Val Phe Leu Asp Tyr Ala Leu Asn Leu Cys Arg Lys Met Ala Glu
130 135 140
His Tyr Lys Asp Asn Pro Tyr Val Val Ser Trp His Val Ser Asn Glu
145 150 155 160
Tyr Gly Cys His Asn Arg Phe Asp Tyr Ser Glu Asp Ala Glu Arg Ala
165 170 175
Phe Gln Lys Trp Cys Glu Lys Lys Tyr Gly Thr Ile Asp Ala Val Asn
180 185 190
Asp Ala Trp Gly Thr Ala Phe Trp Ala Gln Arg Met Asn Asn Phe Ser
195 200 205
Glu Ile Ile Pro Pro Arg Phe Ile Gly Asp Gly Asn Phe Met Asn Pro
210 215 220
Gly Lys Leu Leu Asp Trp Lys Arg Phe Ser Ser Asp Ala Leu Leu Asp
225 230 235 240
Phe Tyr Lys Ala Glu Arg Asp Ala Leu Leu Glu Ile Ala Pro Lys Pro
245 250 255
Gln Thr Thr Asn Phe Met Val Ser Ala Gly Cys Thr Val Leu Asp Tyr
260 265 270
Asp Lys Trp Gly His Asp Val Asp Phe Val Ser Asn Asp His Tyr Phe
275 280 285
Ser Pro Gly Glu Ala His Phe Asp Glu Met Ala Tyr Ala Ala Cys Leu
290 295 300
Thr Asp Gly Ile Ala Arg Lys Asn Pro Trp Phe Leu Met Glu His Ser
305 310 315 320
Thr Ser Ala Val Asn Trp Arg Pro Thr Asn Tyr Arg Leu Glu Pro Gly
325 330 335
Glu Leu Val Arg Asp Ser Leu Ala His Leu Ala Met Gly Ala Asp Ala
340 345 350
Ile Cys Tyr Phe Gln Trp Arg Gln Ser Lys Ala Gly Ala Glu Lys Trp
355 360 365
His Ser Ala Met Val Pro His Ala Gly Pro Asp Ser Gln Ile Phe Arg
370 375 380
Asp Val Cys Glu Leu Gly Ala Asp Leu Asn Lys Leu Ala Asp Glu Gly
385 390 395 400
Leu Leu Ser Thr Lys Leu Val Lys Ser Lys Val Ala Ile Val Phe Asp
405 410 415
Tyr Glu Ser Gln Trp Ala Thr Glu His Thr Ala Thr Pro Thr Gln Glu
420 425 430
Val Arg His Trp Thr Glu Pro Leu Asp Trp Phe Arg Ala Leu Ala Asp
435 440 445
Asn Gly Leu Thr Ala Asp Val Val Pro Val Arg Gly Pro Trp Asp Glu
450 455 460
Tyr Glu Ala Val Val Leu Pro Ser Leu Ala Ile Leu Ser Glu Gln Thr
465 470 475 480
Thr Arg Arg Val Arg Glu Tyr Val Ala Asn Gly Gly Lys Leu Phe Val
485 490 495
Thr Tyr Tyr Thr Gly Leu Val Asp Asp Arg Asp His Val Trp Leu Gly
500 505 510
Gly Tyr Pro Gly Ser Ile Arg Asp Val Val Gly Val Arg Val Glu Glu
515 520 525
Phe Ala Pro Met Gly Thr Asp Ala Pro Gly Thr Met Asp His Leu Asp
530 535 540
Leu Asp Asn Gly Thr Val Ala His Asp Phe Ala Asp Val Ile Thr Ser
545 550 555 560
Val Ala Asp Thr Ala His Val Val Ala Ser Phe Lys Ala Asp Lys Trp
565 570 575
Thr Gly Phe Asp Gly Ala Pro Ala Ile Thr Val Asn Asp Phe Gly Asp
580 585 590
Gly Lys Ala Ala Tyr Val Gly Ala Arg Leu Gly Arg Glu Gly Leu Ala
595 600 605
Lys Ser Leu Pro Ala Leu Leu Glu Glu Leu Gly Ile Glu Thr Ser Ala
610 615 620
Glu Asp Asp Arg Gly Glu Val Leu Arg Val Glu Arg Ala Asp Glu Thr
625 630 635 640
Gly Glu Asn His Phe Val Phe Leu Phe Asn Arg Thr His Asp Val Ala
645 650 655
Val Val Asp Val Glu Gly Glu Pro Leu Val Ala Ser Leu Ala Gln Val
660 665 670
Asn Glu Ser Glu His Thr Ala Ala Ile Gln Pro Asn Gly Val Leu Val
675 680 685
Val Lys Leu
690
<210> 9
<211> 1626
<212> DNA
<213> 热纤梭菌(Clostridium thermocellum)
<400> 9
atggcagaag gggttatagt caacggaact cagtttaaag acacatcggg aaatgtgata 60
catgcccatg ggggaggcat gttaaagcat ggtgactatt attactggta cggtgaatac 120
cgggacgact ccaacttgtt tttgggtgta agttgctaca ggtcaaaaga tcttgtaaac 180
tgggaataca gaggagaagt gctgagccga aattccgctc ctgaactgaa tcactgcaat 240
attgaaagac cgaaagtcat gtacaacgca tcaaccggtg aatttgtcat gtggatgcac 300
tgggagaacg gcataaacta cggtcaggca agagcagctg ttgcgtattc caaaacgccc 360
gacggcaaat tcacatacat tcgaagcttt cgtcccatgc aggataccgg cgttatggat 420
catggccttc cgggatatat gtcaagggac tgcaatgtat ttgtggacac tgacggcaag 480
ggatatttta tatccgcagc caatgagaac atggacctgc acctttatga gctgacacct 540
gactataaaa atattgcatc ccttaaggca aagctgtttg tcggacagca gagggaagca 600
ccatgcctta taaagagaaa cggctactat taccttatta cttccggttg tacaggttgg 660
aacccgaatc aggctaaata cgcatattcc aaagatttgg ccagtggctg gtcccagctt 720
tacaatcttg gtaattcaac cacctacagg tcacagccga cttttatcat tcccgttcag 780
ggaagctcgg gaaccagtta tctttatatg ggtgaccgtt gggccggtgc ctggggagga 840
aaggttaatg actcccaata tgtatggctt cccttaaact tcatatccga tacaacactt 900
gaactgccct attatgactc tgtaaagatt gatgcttctt caggaataat ttccgagtac 960
ataccggaca ctacacgcta caagctggta aacaaaaaca gcggaaaagt cctggatgtt 1020
cttgacggtt ctgtcgataa tgcagcccag atagtccaat ggaccgataa cgggtctttg 1080
agtcaacagt ggtaccttgt ggacgtgggc ggtggttata aaaagattgt aaatgtaaag 1140
agcggaagag ccttggatgt aaaagacgaa tccaaggaag acggtggagt attaatacaa 1200
tataccagca acggcggata taatcagcac tggaaattca cagacatagg tgacgggtat 1260
tacaagattt ccagccgcca ctgcggaaaa cttatagatg tgcgaaaatg gtcaacggaa 1320
gacggcggaa taattcagca gtggtccgat gccggaggaa caaatcagca ttggaagctg 1380
gtgcttgtat caagtcccga gccttcacca tcaccttctc cccaagtggt taaaggagat 1440
gtaaacggcg acttgaaagt aaattcaacg gatttttcca tgttaagaag atatttactt 1500
aaaaccattg acaattttcc gacagaaaac ggaaaacagg ctgccgattt gaacggagac 1560
ggcagaataa actcttcgga tcttacaatg ctgaaaagat acttgcttat ggaagtggat 1620
ttgtaa 1626
<210> 10
<211> 541
<212> PRT
<213> 热纤梭菌(Clostridium thermocellum)
<400> 10
Met Ala Glu Gly Val Ile Val Asn Gly Thr Gln Phe Lys Asp Thr Ser
1 5 10 15
Gly Asn Val Ile His Ala His Gly Gly Gly Met Leu Lys His Gly Asp
20 25 30
Tyr Tyr Tyr Trp Tyr Gly Glu Tyr Arg Asp Asp Ser Asn Leu Phe Leu
35 40 45
Gly Val Ser Cys Tyr Arg Ser Lys Asp Leu Val Asn Trp Glu Tyr Arg
50 55 60
Gly Glu Val Leu Ser Arg Asn Ser Ala Pro Glu Leu Asn His Cys Asn
65 70 75 80
Ile Glu Arg Pro Lys Val Met Tyr Asn Ala Ser Thr Gly Glu Phe Val
85 90 95
Met Trp Met His Trp Glu Asn Gly Ile Asn Tyr Gly Gln Ala Arg Ala
100 105 110
Ala Val Ala Tyr Ser Lys Thr Pro Asp Gly Lys Phe Thr Tyr Ile Arg
115 120 125
Ser Phe Arg Pro Met Gln Asp Thr Gly Val Met Asp His Gly Leu Pro
130 135 140
Gly Tyr Met Ser Arg Asp Cys Asn Val Phe Val Asp Thr Asp Gly Lys
145 150 155 160
Gly Tyr Phe Ile Ser Ala Ala Asn Glu Asn Met Asp Leu His Leu Tyr
165 170 175
Glu Leu Thr Pro Asp Tyr Lys Asn Ile Ala Ser Leu Lys Ala Lys Leu
180 185 190
Phe Val Gly Gln Gln Arg Glu Ala Pro Cys Leu Ile Lys Arg Asn Gly
195 200 205
Tyr Tyr Tyr Leu Ile Thr Ser Gly Cys Thr Gly Trp Asn Pro Asn Gln
210 215 220
Ala Lys Tyr Ala Tyr Ser Lys Asp Leu Ala Ser Gly Trp Ser Gln Leu
225 230 235 240
Tyr Asn Leu Gly Asn Ser Thr Thr Tyr Arg Ser Gln Pro Thr Phe Ile
245 250 255
Ile Pro Val Gln Gly Ser Ser Gly Thr Ser Tyr Leu Tyr Met Gly Asp
260 265 270
Arg Trp Ala Gly Ala Trp Gly Gly Lys Val Asn Asp Ser Gln Tyr Val
275 280 285
Trp Leu Pro Leu Asn Phe Ile Ser Asp Thr Thr Leu Glu Leu Pro Tyr
290 295 300
Tyr Asp Ser Val Lys Ile Asp Ala Ser Ser Gly Ile Ile Ser Glu Tyr
305 310 315 320
Ile Pro Asp Thr Thr Arg Tyr Lys Leu Val Asn Lys Asn Ser Gly Lys
325 330 335
Val Leu Asp Val Leu Asp Gly Ser Val Asp Asn Ala Ala Gln Ile Val
340 345 350
Gln Trp Thr Asp Asn Gly Ser Leu Ser Gln Gln Trp Tyr Leu Val Asp
355 360 365
Val Gly Gly Gly Tyr Lys Lys Ile Val Asn Val Lys Ser Gly Arg Ala
370 375 380
Leu Asp Val Lys Asp Glu Ser Lys Glu Asp Gly Gly Val Leu Ile Gln
385 390 395 400
Tyr Thr Ser Asn Gly Gly Tyr Asn Gln His Trp Lys Phe Thr Asp Ile
405 410 415
Gly Asp Gly Tyr Tyr Lys Ile Ser Ser Arg His Cys Gly Lys Leu Ile
420 425 430
Asp Val Arg Lys Trp Ser Thr Glu Asp Gly Gly Ile Ile Gln Gln Trp
435 440 445
Ser Asp Ala Gly Gly Thr Asn Gln His Trp Lys Leu Val Leu Val Ser
450 455 460
Ser Pro Glu Pro Ser Pro Ser Pro Ser Pro Gln Val Val Lys Gly Asp
465 470 475 480
Val Asn Gly Asp Leu Lys Val Asn Ser Thr Asp Phe Ser Met Leu Arg
485 490 495
Arg Tyr Leu Leu Lys Thr Ile Asp Asn Phe Pro Thr Glu Asn Gly Lys
500 505 510
Gln Ala Ala Asp Leu Asn Gly Asp Gly Arg Ile Asn Ser Ser Asp Leu
515 520 525
Thr Met Leu Lys Arg Tyr Leu Leu Met Glu Val Asp Leu
530 535 540
<210> 11
<211> 2631
<212> DNA
<213> 类芽孢杆菌属(Paenibacillus sp.)
<400> 11
atgaatcgac acgtcctgct tcatccgtat ctccaccgga aggcgttgcc tctgctcctg 60
gccttgacgc tgctgacggg catcgccctg ttcccggcct ccaccgcgca ggcggcgacg 120
accgtgacgt cgatgacgta cttctctgcc aatgacggtc ccgtcatctc caaatccggc 180
gtcgggcaag ccagctacgg tttcgtcatg ccgatcttca acggaggcgc tgcgacctgg 240
aacgatgtcg ccgatgacgt cggcgttcgc gtcaaggtcg gcggcagctg ggtcgacatt 300
gacagcgttg gcggctatgt gtacaaccag aactggggcc attggaacga cagcggcacc 360
tatggctact ggttcaccct ctccgccacg accgagctgc agctctactc caaggcgaac 420
agcagcgtca cactcaacta cacgctcgtc ttccagaatg tcaatgaaac gaccattacc 480
tcgatgacac cgacccaggg cccgcaattg accgcagggt ataccggcgg cgcaggcttc 540
acctatccgg tcttcaacaa cgatccctcc atcccgtatg cagccgtagc cggcgatctg 600
aaggtgtacg tcaagccagt cgccagcagt acctggatcg atatcgacaa caacgcggcg 660
agcggctgga tctacgacag caacttcggc cagttcaccg aaggcggcgg cggctactgg 720
ttcaccgtca ccgagtcgat caacgtcaag ctcgagtcca ggacgtcctc ggccaacgtc 780
gtctatacga tcaacttccc gcagccgacg cgcagcagct acacactctc cgcctatgac 840
ggcacgacct acagcgccga tgcgagcggc gcgatcggta tcccgctgcc gcggatcgac 900
ggcaccccgg cgatcggcag cgagctcggc aacttcgtct accagatcta ccggaacggc 960
cagtgggtcg agatgagcaa ctcggcgcag agcagcttcg tctactcggc caatggctac 1020
aacaacatgt ccgacgccaa tcaatggggc tactgggccg actacatcta cggcctctgg 1080
ttccggccga tccaggagga tatgcagatc cgcatcggct atccgctgaa tggccagtcc 1140
ggcggcagcg tcggcagcaa cttcgtcacc tatacgctga tcggcaaccc gaacgcgccg 1200
cgacccgatg tgagcgacca gggcgacgtc gagatcggca cgcccaccga tccggccatc 1260
gcaggatgga atctgtattg gcaggatgaa ttcgccggca gcgcgctcga tctgaacaag 1320
tggaactacg agaccggcta ctacatcggc aacgacccca atctgtgggg ctggggcaac 1380
gccgagatgc agcactatac gacgagcacg caaaatgtct tcgtcgctga cggcaaactc 1440
aacatccgag cgctccacga ttaccaatcg ttcccgcagg acccgaaccg ctacgcgacc 1500
tactcctccg gcaagatcaa caccaaggac aacatgtcgc tgcagtacgg ccgcgtcgat 1560
atccgcgcca agctgccgac tggcgatggc gtctggccgg cactgtggat gctgccggag 1620
gactccgtct acggcgcatg ggcggcatca ggagagatcg acatcatgga ggcgaagggc 1680
cgtctgcccg gcacgacgag cggcgcgatc cactacggcg gccaatggcc ggtcaaccgc 1740
tacctcgccg gagaatgcta cctcccgcaa ggtacgacat tcgccgacga ctttaatgtg 1800
tacacgatga tctgggaaga ggacaacatg aagtggtacg ttaacggtga gtttttcttc 1860
aaggtgacgc gcgagcagtg gtactccgtc gccgccccca acaatccgga cgcgccgttc 1920
gaccagccgt tctatctgat catgaacctg gcggtcggcg gccacttcga cggcgggcgt 1980
acgcccgacc cgtccgacat cccggcgacg atgcagatcg actacgtgcg ggtgtacaaa 2040
gagggcgcgg gcggcggtcc gggcaacccg ggcggcaacg tcgcggtgac cggcgttagc 2100
gtgaccccgg caacggcgca ggtgcaggtc ggtcagaccg tctcgctgag cgccaacgtc 2160
gcgccagcca atgcaacgaa caagcaagtg acctggtcag tcgccaatgg cagcatcgcc 2220
tcggtgagcg ccagcggcgt cgtcagtgga ctcgctgctg gcacgacgac cgtaaccgcc 2280
acgaccgcag acggcaaccg caccgcctcg gcgacgatca ccgtcgtgcc gccaccgacg 2340
acgaccgtca tcatcggcga tagcgtgcgc ggcatccgaa agaccggcga caacctgctc 2400
ttctacgtca acggcgcaac ctacgccgac ctgcactaca aggtgaacgg cggcggtcag 2460
cctaatgtcg cgatgacgca cacaggaggc ggcaactaca cctacccggt gcatggcctc 2520
caacaaggcg ataccgtcga atacttcttc acctacaacc ccggcaacgg cgcgctagac 2580
acgccttggc agacttatgt gcatggggta acacaaggtg ttgttgagta a 2631
<210> 12
<211> 876
<212> PRT
<213> 类芽孢杆菌属
<400> 12
Met Asn Arg His Val Leu Leu His Pro Tyr Leu His Arg Lys Ala Leu
1 5 10 15
Pro Leu Leu Leu Ala Leu Thr Leu Leu Thr Gly Ile Ala Leu Phe Pro
20 25 30
Ala Ser Thr Ala Gln Ala Ala Thr Thr Val Thr Ser Met Thr Tyr Phe
35 40 45
Ser Ala Asn Asp Gly Pro Val Ile Ser Lys Ser Gly Val Gly Gln Ala
50 55 60
Ser Tyr Gly Phe Val Met Pro Ile Phe Asn Gly Gly Ala Ala Thr Trp
65 70 75 80
Asn Asp Val Ala Asp Asp Val Gly Val Arg Val Lys Val Gly Gly Ser
85 90 95
Trp Val Asp Ile Asp Ser Val Gly Gly Tyr Val Tyr Asn Gln Asn Trp
100 105 110
Gly His Trp Asn Asp Ser Gly Thr Tyr Gly Tyr Trp Phe Thr Leu Ser
115 120 125
Ala Thr Thr Glu Leu Gln Leu Tyr Ser Lys Ala Asn Ser Ser Val Thr
130 135 140
Leu Asn Tyr Thr Leu Val Phe Gln Asn Val Asn Glu Thr Thr Ile Thr
145 150 155 160
Ser Met Thr Pro Thr Gln Gly Pro Gln Leu Thr Ala Gly Tyr Thr Gly
165 170 175
Gly Ala Gly Phe Thr Tyr Pro Val Phe Asn Asn Asp Pro Ser Ile Pro
180 185 190
Tyr Ala Ala Val Ala Gly Asp Leu Lys Val Tyr Val Lys Pro Val Ala
195 200 205
Ser Ser Thr Trp Ile Asp Ile Asp Asn Asn Ala Ala Ser Gly Trp Ile
210 215 220
Tyr Asp Ser Asn Phe Gly Gln Phe Thr Glu Gly Gly Gly Gly Tyr Trp
225 230 235 240
Phe Thr Val Thr Glu Ser Ile Asn Val Lys Leu Glu Ser Arg Thr Ser
245 250 255
Ser Ala Asn Val Val Tyr Thr Ile Asn Phe Pro Gln Pro Thr Arg Ser
260 265 270
Ser Tyr Thr Leu Ser Ala Tyr Asp Gly Thr Thr Tyr Ser Ala Asp Ala
275 280 285
Ser Gly Ala Ile Gly Ile Pro Leu Pro Arg Ile Asp Gly Thr Pro Ala
290 295 300
Ile Gly Ser Glu Leu Gly Asn Phe Val Tyr Gln Ile Tyr Arg Asn Gly
305 310 315 320
Gln Trp Val Glu Met Ser Asn Ser Ala Gln Ser Ser Phe Val Tyr Ser
325 330 335
Ala Asn Gly Tyr Asn Asn Met Ser Asp Ala Asn Gln Trp Gly Tyr Trp
340 345 350
Ala Asp Tyr Ile Tyr Gly Leu Trp Phe Arg Pro Ile Gln Glu Asp Met
355 360 365
Gln Ile Arg Ile Gly Tyr Pro Leu Asn Gly Gln Ser Gly Gly Ser Val
370 375 380
Gly Ser Asn Phe Val Thr Tyr Thr Leu Ile Gly Asn Pro Asn Ala Pro
385 390 395 400
Arg Pro Asp Val Ser Asp Gln Gly Asp Val Glu Ile Gly Thr Pro Thr
405 410 415
Asp Pro Ala Ile Ala Gly Trp Asn Leu Tyr Trp Gln Asp Glu Phe Ala
420 425 430
Gly Ser Ala Leu Asp Leu Asn Lys Trp Asn Tyr Glu Thr Gly Tyr Tyr
435 440 445
Ile Gly Asn Asp Pro Asn Leu Trp Gly Trp Gly Asn Ala Glu Met Gln
450 455 460
His Tyr Thr Thr Ser Thr Gln Asn Val Phe Val Ala Asp Gly Lys Leu
465 470 475 480
Asn Ile Arg Ala Leu His Asp Tyr Gln Ser Phe Pro Gln Asp Pro Asn
485 490 495
Arg Tyr Ala Thr Tyr Ser Ser Gly Lys Ile Asn Thr Lys Asp Asn Met
500 505 510
Ser Leu Gln Tyr Gly Arg Val Asp Ile Arg Ala Lys Leu Pro Thr Gly
515 520 525
Asp Gly Val Trp Pro Ala Leu Trp Met Leu Pro Glu Asp Ser Val Tyr
530 535 540
Gly Ala Trp Ala Ala Ser Gly Glu Ile Asp Ile Met Glu Ala Lys Gly
545 550 555 560
Arg Leu Pro Gly Thr Thr Ser Gly Ala Ile His Tyr Gly Gly Gln Trp
565 570 575
Pro Val Asn Arg Tyr Leu Ala Gly Glu Cys Tyr Leu Pro Gln Gly Thr
580 585 590
Thr Phe Ala Asp Asp Phe Asn Val Tyr Thr Met Ile Trp Glu Glu Asp
595 600 605
Asn Met Lys Trp Tyr Val Asn Gly Glu Phe Phe Phe Lys Val Thr Arg
610 615 620
Glu Gln Trp Tyr Ser Val Ala Ala Pro Asn Asn Pro Asp Ala Pro Phe
625 630 635 640
Asp Gln Pro Phe Tyr Leu Ile Met Asn Leu Ala Val Gly Gly His Phe
645 650 655
Asp Gly Gly Arg Thr Pro Asp Pro Ser Asp Ile Pro Ala Thr Met Gln
660 665 670
Ile Asp Tyr Val Arg Val Tyr Lys Glu Gly Ala Gly Gly Gly Pro Gly
675 680 685
Asn Pro Gly Gly Asn Val Ala Val Thr Gly Val Ser Val Thr Pro Ala
690 695 700
Thr Ala Gln Val Gln Val Gly Gln Thr Val Ser Leu Ser Ala Asn Val
705 710 715 720
Ala Pro Ala Asn Ala Thr Asn Lys Gln Val Thr Trp Ser Val Ala Asn
725 730 735
Gly Ser Ile Ala Ser Val Ser Ala Ser Gly Val Val Ser Gly Leu Ala
740 745 750
Ala Gly Thr Thr Thr Val Thr Ala Thr Thr Ala Asp Gly Asn Arg Thr
755 760 765
Ala Ser Ala Thr Ile Thr Val Val Pro Pro Pro Thr Thr Thr Val Ile
770 775 780
Ile Gly Asp Ser Val Arg Gly Ile Arg Lys Thr Gly Asp Asn Leu Leu
785 790 795 800
Phe Tyr Val Asn Gly Ala Thr Tyr Ala Asp Leu His Tyr Lys Val Asn
805 810 815
Gly Gly Gly Gln Pro Asn Val Ala Met Thr His Thr Gly Gly Gly Asn
820 825 830
Tyr Thr Tyr Pro Val His Gly Leu Gln Gln Gly Asp Thr Val Glu Tyr
835 840 845
Phe Phe Thr Tyr Asn Pro Gly Asn Gly Ala Leu Asp Thr Pro Trp Gln
850 855 860
Thr Tyr Val His Gly Val Thr Gln Gly Val Val Glu
865 870 875
<210> 13
<211> 6783
<212> DNA
<213> 人工序列
<220>
<223> 转座子盒
<400> 13
gccagatgat taattcctaa tttttgttga cactctatca ttgatagagt tattttacca 60
ctccctatca gtgatagaga aaagtgaaat gaatagttcg acaaaaatct agaaataatt 120
ttgtttaact ttaagaagga gatatacaat ttcgtcgaca cacaggaaac atattaaaaa 180
ttaaaacctg caggagtttg aaggagatag aaccatggcg cagtcgaaac tctatccagt 240
tgtgatggca ggtggctccg gtagccgctt atggccgctt tcccgcgtac tttatcccaa 300
gcagttttta tgcctgaaag gcgatctcac catgctgcaa accaccatct gccgcctgaa 360
cggcgtggag tgcgaaagcc cggtggtgat ttgcaatgag cagcaccgct ttattgtcgc 420
ggaacagctg cgtcaactga acaaacttac cgagaacatt attctcgaac cggcagggcg 480
aaacacggca cctgccattg cgctggcggc gctggcggca aaacgtcata gcccggagag 540
cgacccgtta atgctggtat tggcggcgga tcatgtgatt gccgatgaag acgcgttccg 600
tgccgccgtg cgtaatgcca tgccatatgc cgaagcgggc aagctggtga ccttcggcat 660
tgtgccggat ctaccagaaa ccggttatgg ctatattcgt cgcggtgaag tgtctgcggg 720
tgagcaggat atggtggcct ttgaagtggc gcagtttgtc gaaaaaccga atctggaaac 780
cgctcaggcc tatgtggcaa gcggcgaata ttactggaac agcggtatgt tcctgttccg 840
cgccggacgc tatctcgaag aactgaaaaa atatcgcccg gatatcctcg atgcctgtga 900
aaaagcgatg agcgccgtcg atccggatct caattttatt cgcgtggatg aagaagcgtt 960
tctcgcctgc ccggaagagt cggtggatta cgcggtcatg gaacgtacgg cagatgctgt 1020
tgtggtgccg atggatgcgg gctggagcga tgttggctcc tggtcttcat tatgggagat 1080
cagcgcccac accgccgagg gcaacgtttg ccacggcgat gtgattaatc acaaaactga 1140
aaacagctat gtgtatgctg aatctggcct ggtcaccacc gtcggggtga aagatctggt 1200
agtggtgcag accaaagatg cggtgctgat tgccgaccgt aacgcggtac aggatgtgaa 1260
aaaagtggtc gagcagatca aagccgatgg tcgccatgag catcgggtgc atcgcgaagt 1320
gtatcgtccg tggggcaaat atgactctat cgacgcgggc gaccgctacc aggtgaaacg 1380
catcaccgtg aaaccgggcg agggcttgtc ggtacagatg caccatcacc gcgcggaaca 1440
ctgggtggtt gtcgcgggaa cggcaaaagt caccattgat ggtgatatca aactgcttgg 1500
tgaaaacgag tccatttata ttccgctggg ggcgacgcat tgcctggaaa acccggggaa 1560
aattccgctc gatttaattg aagtgcgctc cggctcttat ctcgaagagg atgatgtggt 1620
gcgtttcgcg gatcgctacg gacgggtgta aacgtcgcat caggcaatga atgcgaaacc 1680
gcggtgtaaa taacgacaaa aataaaattg gccgcttcgg tcagggccaa ctattgcctg 1740
aaaaagggta acgatatgaa aaaattaacc tgctttaaag cctatgatat tcgcgggaaa 1800
ttaggcgaag aactgaatga agatatcgcc tggcgcattg gtcgcgccta tggcgaattt 1860
ctcaaaccga aaaccattgt gttaggcggt gatgtccgcc tcaccagcga aaccttaaaa 1920
ctggcgctgg cgaaaggttt acaggatgcg ggcgttgacg tgctggatat tggtatgtcc 1980
ggcaccgaag agatctattt cgccacgttc catctcggcg tggatggcgg cattgaagtt 2040
accgccagcc ataatccgat ggattataac ggcatgaagc tggttcgcga gggggctcgc 2100
ccgatcagcg gagataccgg actgcgcgac gtccagcgtc tggctgaagc caacgacttt 2160
cctcccgtcg atgaaaccaa acgcggtcgc tatcagcaaa tcaacctgcg tgacgcttac 2220
gttgatcacc tgttcggtta tatcaatgtc aaaaacctca cgccgctcaa gctggtgatc 2280
aactccggga acggcgcagc gggtccggtg gtggacgcca ttgaagcccg ctttaaagcc 2340
ctcggcgcgc ccgtggaatt aatcaaagtg cacaacacgc cggacggcaa tttccccaac 2400
ggtattccta acccactact gccggaatgc cgcgacgaca cccgcaatgc ggtcatcaaa 2460
cacggcgcgg atatgggcat tgcttttgat ggcgattttg accgctgttt cctgtttgac 2520
gaaaaagggc agtttattga gggctactac attgtcggcc tgttggcaga agcattcctc 2580
gaaaaaaatc ccggcgcgaa gatcatccac gatccacgtc tctcctggaa caccgttgat 2640
gtggtgactg ccgcaggtgg cacgccggta atgtcgaaaa ccggacacgc ctttattaaa 2700
gaacgtatgc gcaaggaaga cgccatctat ggtggcgaaa tgagcgccca ccattacttc 2760
cgtgatttcg cttactgcga cagcggcatg atcccgtggc tgctggtcgc cgaactggtg 2820
tgcctgaaag ataaaacgct gggcgaactg gtacgcgacc ggatggcggc gtttccggca 2880
agcggtgaga tcaacagcaa actggcgcaa cccgttgagg cgattaaccg cgtggaacag 2940
cattttagcc gtgaggcgct ggcggtggat cgcaccgatg gcatcagcat gacctttgcc 3000
gactggcgct ttaacctgcg cacctccaat accgaaccgg tggtgcgcct gaatgtggaa 3060
tcgcgcggtg atgtgccgct gatggaagcg cgaacgcgaa ctctgctgac gttgctgaac 3120
gagtaaaaac gcggccgcga tatcgttgta aaacgacggc cagtgcaaga atcataaaaa 3180
atttatttgc tttcaggaaa atttttctgt ataatagatt cataaatttg agagaggagt 3240
ttttgtgagc ggataacaat tccccatctt agtatattag ttaagtataa atacaccgcg 3300
gaggacgaag gagatagaac catgtcaaaa gtcgctctca tcaccggtgt aaccggacaa 3360
gacggttctt acctggcaga gtttctgctg gaaaaaggtt acgaggtgca tggtattaag 3420
cgtcgcgcat cgtcattcaa caccgagcgc gtggatcaca tttatcagga tccgcacacc 3480
tgcaacccga aattccatct gcattatggc gacctgagtg atacctctaa cctgacgcgc 3540
attttgcgtg aagtacagcc ggatgaagtg tacaacctgg gcgcaatgag ccacgttgcg 3600
gtctcttttg agtcaccaga atataccgct gacgtcgacg cgatgggtac gctgcgcctg 3660
ctggaggcga tccgcttcct cggtctggaa aagaaaactc gtttctatca ggcttccacc 3720
tctgaactgt atggtctggt gcaggaaatt ccgcagaaag agaccacgcc gttctacccg 3780
cgatctccgt atgcggtcgc caaactgtac gcctactgga tcaccgttaa ctaccgtgaa 3840
tcctacggca tgtacgcctg taacggaatt ctcttcaacc atgaatcccc gcgccgcggc 3900
gaaaccttcg ttacccgcaa aatcacccgc gcaatcgcca acatcgccca ggggctggag 3960
tcgtgcctgt acctcggcaa tatggattcc ctgcgtgact ggggccacgc caaagactac 4020
gtaaaaatgc agtggatgat gctgcagcag gaacagccgg aagatttcgt tatcgcgacc 4080
ggcgttcagt actccgtgcg tcagttcgtg gaaatggcgg cagcacagct gggcatcaaa 4140
ctgcgctttg aaggcacggg cgttgaagag aagggcattg tggtttccgt caccgggcat 4200
gacgcgccgg gcgttaaacc gggtgatgtg attatcgctg ttgacccgcg ttacttccgt 4260
ccggctgaag ttgaaacgct gctcggcgac ccgaccaaag cgcacgaaaa actgggctgg 4320
aaaccggaaa tcaccctcag agagatggtg tctgaaatgg tggctaatga cctcgaagcg 4380
gcgaaaaaac actctctgct gaaatctcac ggctacgacg tggcgatcgc gctggagtca 4440
taagcatgag taaacaacga gtttttattg ctggtcatcg cgggatggtc ggttccgcca 4500
tcaggcggca gctcgaacag cgcggtgatg tggaactggt attacgcacc cgcgacgagc 4560
tgaacctgct ggacagccgc gccgtgcatg atttctttgc cagcgaacgt attgaccagg 4620
tctatctggc ggcggcgaaa gtgggcggca ttgttgccaa caacacctat ccggcggatt 4680
tcatctacca gaacatgatg attgagagca acatcattca cgccgcgcat cagaacgacg 4740
tgaacaaact gctgtttctc ggatcgtcct gcatctaccc gaaactggca aaacagccga 4800
tggcagaaag cgagttgttg cagggcacgc tggagccgac taacgagcct tatgctattg 4860
ccaaaatcgc cgggatcaaa ctgtgcgaat catacaaccg ccagtacgga cgcgattacc 4920
gctcagtcat gccgaccaac ctgtacgggc cacacgacaa cttccacccg agtaattcgc 4980
atgtgatccc agcattgctg cgtcgcttcc acgaggcgac ggcacagaat gcgccggacg 5040
tggtggtatg gggcagcggt acaccgatgc gcgaatttct gcacgtcgat gatatggcgg 5100
cggcgagcat tcatgtcatg gagctggcgc atgaagtctg gctggagaac acccagccga 5160
tgttgtcgca cattaacgtc ggcacgggcg ttgactgcac tatccgcgag ctggcgcaaa 5220
ccatcgccaa agtggtgggt tacaaaggcc gggtggtttt tgatgccagc aaaccggatg 5280
gcacgccgcg caaactgctg gatgtgacgc gcctgcatca gcttggctgg tatcacgaaa 5340
tctcactgga agcggggctt gccagcactt accagtggtt ccttgagaat caagaccgct 5400
ttcggggggg gagctaacgc gccatttaaa tcaacctcag cggtcatagc tgtttcctgt 5460
gactgagcaa taactagcat aaccccttgg ggcctctaaa cgggtcttga ggggtttttt 5520
gctgaaacca atttgcctgg cggcagtagc gcggtggtcc cacctgaccc catgccgaac 5580
tcagaagtga aacgccgtag cgccgatggt agtgtggggt ctccccatgc gagagtaggg 5640
aactgccagg catcaaataa aacgaaaggc tcagtcgaaa gactgggcct ttcgggatcc 5700
aggccggcct gttaacgaat taatcttccg cggcggtatc gataagcttg atatcgaatt 5760
ccgaagttcc tattctctag aaagtatagg aacttcaggt ctgaagagga gtttacgtcc 5820
agccaagcta gcttggctgc aggtcgtcga aattctaccg ggtaggggag gcgcttttcc 5880
caaggcagtc tggagcatgc gctttagcag ccccgctggg cacttggcgc tacacaagtg 5940
gcctctggcc tcgcacacat tccacatcca ccggtaggcg ccaaccggct ccgttctttg 6000
gtggcccctt cgcgccacct tctactcctc ccctagtcag gaagttcccc cccgccccgc 6060
agctcgcgtc gtgcaggacg tgacaaatgg aagtagcacg tctcactagt ctcgtgcaga 6120
tggacagcac cgctgagcaa tggaagcggg taggcctttg gggcagcggc caatagcagc 6180
tttgctcctt cgctttctgg gctcagaggc tgggaagggg tgggtccggg ggcgggctca 6240
ggggcgggct caggggcggg gcgggcgccc gaaggtcctc cggaggcccg gcattctgca 6300
cgcttcaaaa gcgcacgtct gccgcgctgt tctcctcttc ctcatctccg ggcctttcga 6360
cctgcagcct gttgacaatt aatcatcggc atagtatatc ggcatagtat aatacgacaa 6420
ggtgaggaac taaaccatgg gtcaaagtag cgatgaagcc aacgctcccg ttgcagggca 6480
gtttgcgctt cccctgagtg ccacctttgg cttaggggat cgcgtacgca agaaatctgg 6540
tgccgcttgg cagggtcaag tcgtcggttg gtattgcaca aaactcactc ctgaaggcta 6600
tgcggtcgag tccgaatccc acccaggctc agtgcaaatt tatcctgtgg ctgcacttga 6660
acgtgtggcc taatgagggg atcaattctc tagagctcgc tgatcagaag ttcctattct 6720
ctagaaagta taggaacttc gatggcgcct catccctgaa gccaataggg ataacagggt 6780
aat 6783
<210> 14
<211> 2851
<212> DNA
<213> 人工序列
<220>
<223> 整合盒
<400> 14
tggccagatg attaattcct aatttttgtt gacactctat cattgataga gttattttac 60
cactccctat cagtgataga gaaaagtgaa atgaatagtt cgacaaaaat ctagaaataa 120
ttttgtttaa ctttaagaag gagatataca aatgtactat ttaaaaaaca caaacttttg 180
gatgttcggt ttattctttt tcttttactt ttttatcatg ggagcctact tcccgttttt 240
cccgatttgg ctacatgaca tcaaccatat cagcaaaagt gatacgggta ttatttttgc 300
cgctatttct ctgttctcgc tattattcca accgctgttt ggtctgcttt ctgacaaact 360
cgggctgcgc aaatacctgc tgtggattat taccggcatg ttagtgatgt ttgcgccgtt 420
ctttattttt atcttcgggc cactgttaca atacaacatt ttagtaggat cgattgttgg 480
tggtatttat ctaggctttt gttttaacgc cggtgcgcca gcagtagagg catttattga 540
gaaagtcagc cgtcgcagta atttcgaatt tggtcgcgcg cggatgtttg gctgtgttgg 600
ctgggcgctg tgtgcctcga ttgtcggcat catgttcacc atcaataatc agtttgtttt 660
ctggctgggc tctggctgtg cactcatcct cgccgtttta ctctttttcg ccaaaacgga 720
tgcgccctct tctgccacgg ttgccaatgc ggtaggtgcc aaccattcgg catttagcct 780
taagctggca ctggaactgt tcagacagcc aaaactgtgg tttttgtcac tgtatgttat 840
tggcgtttcc tgcacctacg atgtttttga ccaacagttt gctaatttct ttacttcgtt 900
ctttgctacc ggtgaacagg gtacgcgggt atttggctac gtaacgacaa tgggcgaatt 960
acttaacgcc tcgattatgt tctttgcgcc actgatcatt aatcgcatcg gtgggaaaaa 1020
cgccctgctg ctggctggca ctattatgtc tgtacgtatt attggctcat cgttcgccac 1080
ctcagcgctg gaagtggtta ttctgaaaac gctgcatatg tttgaagtac cgttcctgct 1140
ggtgggctgc tttaaatata ttaccagcca gtttgaagtg cgtttttcag cgacgattta 1200
tctggtctgt ttctgcttct ttaagcaact ggcgatgatt tttatgtctg tactggcggg 1260
caatatgtat gaaagcatcg gtttccaggg cgcttatctg gtgctgggtc tggtggcgct 1320
gggcttcacc ttaatttccg tgttcacgct tagcggcccc ggcccgcttt ccctgctgcg 1380
tcgtcaggtg aatgaagtcg ctgggagcta agcggccgcg tcgacacgca aaaaggccat 1440
ccgtcaggat ggccttctgc ttaatttgat gcctggcagt ttatggcggg cgtcctgccc 1500
gccaccctcc gggccgttgc ttcgcaacgt tcaaatccgc tcccggcgga tttgtcctac 1560
tcaggagagc gttcaccgac aaacaacaga taaaacgaaa ggcccagtct ttcgactgag 1620
cctttcgttt tatttgatgc ctggcagttc cctactctcg catggggaga ccccacacta 1680
ccatcatgta tgaatatcct ccttagttcc tattccgaag ttcctattct ctagaaagta 1740
taggaacttc ggcgcgtcct acctgtgaca cgcgtgccgc agtctcacgc ccggagcgta 1800
gcgaccgagt gagctagcta tttgtttatt tttctaaata cattcaaata tgtatccgct 1860
catgagacaa taaccctgat aaatgcttca ataatattga aaaaggaaga gtatgaggga 1920
agcggtgatc gccgaagtat cgactcaact atcagaggta gttggcgtca tcgagcgcca 1980
tctcgaaccg acgttgctgg ccgtacattt gtacggctcc gcagtggatg gcggcctgaa 2040
gccacacagt gatattgatt tgctggttac ggtgaccgta aggcttgatg aaacaacgcg 2100
gcgagctttg atcaacgacc ttttggaaac ttcggcttcc cctggagaga gcgagattct 2160
ccgcgctgta gaagtcacca ttgttgtgca cgacgacatc attccgtggc gttatccagc 2220
taagcgcgaa ctgcaatttg gagaatggca gcgcaatgac attcttgcag gtatcttcga 2280
gccagccacg atcgacattg atctggctat cttgctgaca aaagcaagag aacatagcgt 2340
tgccttggta ggtccagcgg cggaggaact ctttgatccg gttcctgaac aggatctatt 2400
tgaggcgcta aatgaaacct taacgctatg gaactcgccg cccgactggg ctggcgatga 2460
gcgaaatgta gtgcttacgt tgtcccgcat ttggtacagc gcagtaaccg gcaaaatcgc 2520
gccgaaggat gtcgctgccg actgggcaat ggagcgcctg ccggcccagt atcagcccgt 2580
catacttgaa gctagacagg cttatcttgg acaagaagaa gatcgcttgg cctcgcgcgc 2640
agatcagttg gaagaatttg tccactacgt gaaaggcgag atcaccaagg tagtcggcaa 2700
ataatgtcta acaattcgtt caagccgagg ggccgcaaga tccggccacg atgacccggt 2760
cgtcgggtac cggcagggcg gggcgtaagg cgcgccattt aaatgaagtt cctattccga 2820
agttcctatt ctctagaaag tataggaact t 2851
<210> 15
<211> 2858
<212> DNA
<213> 人工序列
<220>
<223> 整合盒
<400> 15
ggccagatga ttaattccta atttttgttg acactctatc attgatagag ttattttacc 60
actccctatc agtgatagag aaaagtgaaa tgaatagttc gacaaaaatc tagaaataat 120
tttgtttaac tttaagaagg agatatacaa atgggcagca ttattcgtct gcagggtggt 180
ctgggtaatc agctgtttca gtttagcttt ggttatgccc tgagcaaaat taatggtaca 240
ccgctgtatt tcgacattag ccattatgcc gaaaacgatg atcatggtgg ttatcgtctg 300
aataatctgc agattccgga agaatatctg cagtattata ccccgaaaat taataatatt 360
tataaactgc tggtgcgtgg cagccgtctg tatccggata tttttctgtt tctgggcttt 420
tgcaacgaat ttcatgccta tggctacgat tttgaatata ttgcccagaa atggaaaagc 480
aaaaaataca ttggctactg gcagagcgaa cacttttttc ataaacatat tctggacctg 540
aaagaatttt ttattccgaa aaatgtgagc gaacaggcaa atctgctggc agcaaaaatt 600
ctggaaagcc agagcagcct gagcattcat attcgtcgtg gcgattatat taaaaacaaa 660
accgcaaccc tgacacatgg tgtttgtagc ctggaatatt ataaaaaagc cctgaacaaa 720
atccgcgatc tggcaatgat tcgtgatgtg tttatcttta gcgacgatat cttctggtgc 780
aaagaaaata ttgaaaccct gctgagcaaa aaatataata tttattatag cgaagatctg 840
agccaagaag aggatctgtg gctgatgagc ctggcaaatc atcatattat tgccaatagc 900
agctttagtt ggtggggtgc atatctgggt agcagcgcaa gccagattgt tatttatccg 960
accccgtggt atgatattac cccgaaaaac acctatatcc cgattgtgaa ccattggatc 1020
aacgttgata aacatagcag ctgctaagcg gccgcgtcga cacgcaaaaa ggccatccgt 1080
caggatggcc ttctgcttaa tttgatgcct ggcagtttat ggcgggcgtc ctgcccgcca 1140
ccctccgggc cgttgcttcg caacgttcaa atccgctccc ggcggatttg tcctactcag 1200
gagagcgttc accgacaaac aacagataaa acgaaaggcc cagtctttcg actgagcctt 1260
tcgttttatt tgatgcctgg cagttcccta ctctcgcatg gggagacccc acactaccat 1320
catgtatgaa tatcctcctt agttcctatt ccgaagttcc tattctctag aaagtatagg 1380
aacttcggcg cgtcctacct gtgacacgcg tcaagatccc ctcacgctgc cgcaagcact 1440
cagggcgcaa gggctgctaa aggaagcgga acacgtagaa agccagtccg cagaaacggt 1500
gctgaccccg gatgaatgtc agctactggg ctatctggac aagggaaaac gcaagcgcaa 1560
agagaaagca ggtagcttgc agtgggctta catggcgata gctagactgg gcggttttat 1620
ggacagcaag cgaaccggaa ttgccagctg gggcgccctc tggtaaggtt gggaagccct 1680
gcaaagtaaa ctggatggct ttcttgccgc caaggatctg atggcgcagg ggatcaagat 1740
ctgatcaaga gacaggatga ggatcgtttc gcatgattga acaagatgga ttgcacgcag 1800
gttctccggc cgcttgggtg gagaggctat tcggctatga ctgggcacaa cagacaatcg 1860
gctgctctga tgccgccgtg ttccggctgt cagcgcaggg gcgcccggtt ctttttgtca 1920
agaccgacct gtccggtgcc ctgaatgaac tgcaggacga ggcagcgcgg ctatcgtggc 1980
tggccacgac gggcgttcct tgcgcagctg tgctcgacgt tgtcactgaa gcgggaaggg 2040
actggctgct attgggcgaa gtgccggggc aggatctcct gtcatctcac cttgctcctg 2100
ccgagaaagt atccatcatg gctgatgcaa tgcggcggct gcatacgctt gatccggcta 2160
cctgcccatt cgaccaccaa gcgaaacatc gcatcgagcg agcacgtact cggatggaag 2220
ccggtcttgt cgatcaggat gatctggacg aagagcatca ggggctcgcg ccagccgaac 2280
tgttcgccag gctcaaggcg cgcatgcccg acggcgagga tctcgtcgtg acccatggcg 2340
atgcctgctt gccgaatatc atggtggaaa atggccgctt ttctggattc atcgactgtg 2400
gccggctggg tgtggcggac cgctatcagg acatagcgtt ggctacccgt gatattgctg 2460
aagagcttgg cggcgaatgg gctgaccgct tcctcgtgct ttacggtatc gccgctcccg 2520
attcgcagcg catcgccttc tatcgccttc ttgacgagtt cttctgagcg ggactctggg 2580
gttcgaaatg accgaccaag cgacgcccaa cctgccatca cgagatttcg attccaccgc 2640
cgccttctat gaaaggttgg gcttcggaat cgttttccgg gacgccggct ggatgatcct 2700
ccagcgcggg gatctcatgc tggagttctt cgcccacccc agcttcaaaa gcgctctcgg 2760
taccggcagg gcggggcgta aggcgcgcca tttaaatgaa gttcctattc cgaagttcct 2820
attctctaga aagtatagga acttcgaagc agctccag 2858
<210> 16
<211> 2631
<212> DNA
<213> 人工序列
<220>
<223> 整合盒
<400> 16
ggccagatga ttaattccta atttttgttg acactctatc attgatagag ttattttacc 60
actccctatc agtgatagag aaaagtgaaa tgaatagttc gacaaaaatc tagaaataat 120
tttgtttaac tttaagaagg agatatacaa atgaagtcgg cactgacctt ttcccgtcgc 180
atcaatccgg tgtttctggc gttctttgtc gttgcttttc tgagcggtat cgcaggcgca 240
ctgcaggctc cgaccctgag tctgtttctg tccacggaag tgaaagttcg tccgctgtgg 300
gttggtctgt tctataccgt caacgcaatc gctggcatta cggttagctt tatcctggcg 360
aaacgttcag attcgcgcgg tgaccgtcgc aagctgatta tggtgtgcta tctgatggcg 420
gttggcaact gtctgctgtt tgccttcaat cgtgattacc tgaccctgat cacggcaggt 480
gtgctgctgg cgagcgttgc caacaccgca atgccgcaga ttttcgcgct ggcccgtgaa 540
tatgccgaca gctctgcacg cgaagtggtt atgtttagtt ccatcatgcg cgctcaactg 600
agtctggcat gggtgattgg tccgccgctg tcctttatgc tggcgctgaa ttacggtttt 660
accctgatgt tctcaatcgc ggccggcatt ttcgttctgt cggccctggt cgtgtggttt 720
atcctgccga gtgtcccgcg tgcagaaccg gttgtcgatg caccggtggt tgtccagggt 780
tcactgttcg cagacaaaaa cgttctgctg ctgtttatcg cgtcgatgct gatgtggacc 840
tgcaatacga tgtatattat cgatatgccg ctgtacatta ccgcaagcct gggtctgccg 900
gaacgtctgg ctggtctgct gatgggtacc gcagctggcc tggaaattcc gatcatgctg 960
ctggcgggtt attctgtgcg ttactttggc aaacgcaaga ttatgctgtt cgctgttctg 1020
gcgggtgtcc tgttttatac cggcctggtt ctgtttaaat tcaagacggc cctgatgctg 1080
ctgcagatct ttaacgcaat tttcatcggt attgtggctg gcattggtat gctgtacttc 1140
caagatctga tgccgggtcg tgcaggtgca gcaaccacgc tgtttaccaa tagcatctct 1200
acgggtgtca ttctggcagg cgtgctgcaa ggcggtctga ccgaaacgtg gggccatgac 1260
agcgtctatg tgatggcgat ggtcctgtct attctggccc tgattatctg tgcacgtgtg 1320
cgcgaagctt aaatcgatac tagcataacc ccttggggcc tctaaacgcg tcgacacgca 1380
aaaaggccat ccgtcaggat ggccttctgc ttaatttgat gcctggcagt ttatggcggg 1440
cgtcctgccc gccaccctcc gggccgttgc ttcgcaacgt tcaaatccgc tcccggcgga 1500
tttgtcctac tcaggagagc gttcaccgac aaacaacaga taaaacgaaa ggcccagtct 1560
ttcgactgag cctttcgttt tatttgatgc ctggcagttc cctactctcg catggggaga 1620
ccccacacta ccatcatgta tgaatatcct ccttagttcc tattccgaag ttcctattct 1680
ctagaaagta taggaacttc ggcgcgtcct acctgtgacg gaagatcact tcgcagaata 1740
aataaatcct ggtgtccctg ttgataccgg gaagccctgg gccaactttt ggcgaaaatg 1800
agacgttgat cggcacgtaa gaggttccaa ctttcaccat aatgaaataa gatcactacc 1860
gggcgtattt tttgagttgt cgagattttc aggagctaag gaagctaaaa tggagaaaaa 1920
aatcactgga tataccaccg ttgatatatc ccaatggcat cgtaaagaac attttgaggc 1980
atttcagtca gttgctcaat gtacctataa ccagaccgtt cagctggata ttacggcctt 2040
tttaaagacc gtaaagaaaa ataagcacaa gttttatccg gcctttattc acattcttgc 2100
ccgcctgatg aatgctcatc cggaattacg tatggcaatg aaagacggtg agctggtgat 2160
atgggatagt gttcaccctt gttacaccgt tttccatgag caaactgaaa cgttttcatc 2220
gctctggagt gaataccacg acgatttccg gcagtttcta cacatatatt cgcaagatgt 2280
ggcgtgttac ggtgaaaacc tggcctattt ccctaaaggg tttattgaga atatgttttt 2340
cgtctcagcc aatccctggg tgagtttcac cagttttgat ttaaacgtgg ccaatatgga 2400
caacttcttc gcccccgttt tcaccatggg caaatattat acgcaaggcg acaaggtgct 2460
gatgccgctg gcgattcagg ttcatcatgc cgtttgtgat ggcttccatg tcggcagatg 2520
cttaatgaat acaacagtac tgcgatgagt ggcagggcgg ggcgtaaggc gcgccattta 2580
aatgaagttc ctattccgaa gttcctattc tctagaaagt ataggaactt c 2631
<210> 17
<211> 4259
<212> DNA
<213> 人工序列
<220>
<223> 整合盒
<400> 17
ttactcagca ataaactgat attccgtcag gctggaatac tcttcgccag gacgcaggaa 60
gcagtccggt tgcggccatt cagggtggtt cgggctgtcc ggtagaaact cgctttccag 120
agccagccct tgccagtcgg cgtaaggttc ggttccccgc gacggtgtgc cgccgaggaa 180
gttgccggag tagaattgca gagccggagc ggtggtgtag accttcagct gcaatttttc 240
atctgctgac cagacatgcg ccgccacttt cttgccatcg cctttggcct gtaacaagaa 300
tgcgtgatcg taacctttca ctttgcgctg atcgtcgtcg gcaagaaact cactggcgat 360
gattttggcg ctgcggaaat caaaagacgt tccggcgaca gatttcaggc cgtcgtgcgg 420
aatgccgcct tcatcaaccg gcagatattc gtccgccaga atctgcaact tgtgattgcg 480
cacgtcagac tgctcgccgt caagattgaa atagacgtga ttagtcatat tcaccgggca 540
aggtttatca actgtggcgc gataagtaat ggagatacgg ttatcgtcgg tcagacgata 600
ttgcaccgtc gcgccgagat tacccgggaa gccctgatca ccatcatctg aactcagggc 660
aaacagcacc tgacgatcgt tctggttcac aatctgccag cgacgtttgt cgaacccttc 720
cggcccgccg tgcagctggt taacgccctg acttggcgaa agcgtcacgg tttcaccgtc 780
aaaggtataa cggctattgg cgatacggtt ggcataacga ccaatagagg cccccagaaa 840
cgcggcctga tcctgatagc attccgggct ggcacagccg agcagcgcct cgcggacgct 900
gccatcggaa agcggaatac gggcggaaag taaagtcgca ccccagtcca tcagcgtgac 960
taccatccct gcgttgttac gcaaagttaa cagtcggtac ggctgaccat cgggtgccag 1020
tgcgggagtt tcgttcagca ctgtcctgct ccttgtgatg gtttacaaac gtaaaaagtc 1080
tctttaatac ctgtttttgc ttcatattgt tcagcgacag cttgctgtac ggcaggcacc 1140
agctcttccg ggatcagcgc gacgatacag ccgccaaatc cgccgccggt catgcgtacg 1200
ccacctttgt cgccaatcac agctttgacg atttctacca gagtgtcaat ttgcggcacg 1260
gtgatttcga aatcatcgcg catagaggca tgagactccg ccatcaactc gcccatacgt 1320
ttcaggtcgc cttgctccag cgcgctggca gcttcaacgg tgcgggcgtt ttcagtcagt 1380
atatgacgca cgcgttttgc cacgatcggg tccagttcat gcgcaacagc gttgaactct 1440
tcaatggtga catcacgcag ggctggctgc tggaagaaac gcgcaccggt ttcgcactgt 1500
tcacgacggg tgttgtattc gctgccaacc agggtacgtt tgaagttact gttgatgatg 1560
acgacagcca cacctttggg catggaaact gctttggtcc ccagtgagcg gcaatcgatc 1620
agcaaggcat gatctttctt gccgagcgcg gaaattagct gatccatgat cccgcagtta 1680
cagcctacaa actggttttc tgcttcctga ccgttaagcg cgatttgtgc gccgtccagc 1740
ggcagatgat aaagctgctg caatacggtt ccgaccgcga cttccagtga agcggaagaa 1800
cttaacccgg caccctgcgg cacattgccg ctgatcacca tgtccacgcc gccgaagctg 1860
ttgttacgca gttgcagatg tttcaccacg ccacgaacgt agttagccca ttgatagttt 1920
tcatgtgcga caatgggcgc atcgagggaa aactcgtcga gctgattttc ataatcggct 1980
gccatcacgc gaactttacg gtcatcgcgt ggtgcacaac tgatcacggt ttgataatca 2040
atcgcgcagg gcagaacgaa accgtcgttg tagtcggtgt gttcaccaat caaattcacg 2100
cggccaggcg cctgaatggt gtgagtggca gggtagccaa atgcgttggc aaacagagat 2160
tgtgtttttt ctttcagact catttcttac actccggatt cgcgaaaatg gatatcgctg 2220
actgcgcgca aacgctctgc tgcctgttct gcggtcaggt ctcgctgggt ctctgccagc 2280
atttcataac caaccataaa tttacgtacg gtggcggagc gcagcagagg cggataaaag 2340
tgcgcgtgca gctgccagtg ttgattctct tcgccattaa atggcgcgcc gtgccagccc 2400
atagagtagg ggaaggagca ctggaagagg ttgtcataac gactggtcag ctttttcaac 2460
gccagcgcca gatcgctgcg ctgggcgtcg gtcaaatcgg tgatccgtaa aacgtgggct 2520
ttgggcagca gtagcgtttc gaacggccag gcagcccagt aaggcacgac ggctaaccag 2580
tgttcggttt cgacaacggt acggctaccg tctgccagct cgcgctgaac ataatccacc 2640
agcattggtg atttctgttc ggcaaaatat tctttttgca ggcggtcttc gcgctcagct 2700
tcgttaggca ggaagctatt tgcccaaatc tgaccgtgcg gatgcgggtt agagcagccc 2760
atcgccgcgc ctttgttttc aaaaacctgc acccatgggt acgttttccc cagttctgcg 2820
gtttgctcct gccaggtttt gacgatttcc gtcaatgctg caacgctgag ctctggcagc 2880
gttttactgt gatccggtga aaagcagatc acccggctgg tgccgcgcgc gctctggcaa 2940
cgcatcagcg gatcgtgact ttctggcgca tctggcgtgt cagacatcaa agccgcaaag 3000
tcattagtga aaacgtaagt cccggtgtaa tcggggtttt tatcgcctgt cacccgcaca 3060
ttacctgcgc agaggaagca atctggatcg tgcgcaggta acacctgttt ggctggcgtt 3120
tcctgcgccc cctgccaggg gcgcttagcg cggtgcggtg aaaccagaat ccattgcccg 3180
gtgagcgggt tgtagcggcg atgtggatga tcaacgggat taaattgcgt catggtcgtt 3240
ccttaatcgg gatatccctg tggatggcgt gactgccagt gccaggtgtc ctgcgccatt 3300
tcatcgagtg tgcgcgttac gcgccagttc agttcacggt cggctttgct ggcgtccgcc 3360
cagtaggccg gaaggtcgcc ctcgcgacgc ggtgcaaaat gataattaac cggtttgccg 3420
caggctttgc tgaaggcatt aaccacgtcc agcacgctgt tgcctacgcc agcgccgagg 3480
ttgtagatgt gtacgcctgg cttgttcgcc agtttttcca tcgccacgac gtgaccgtcc 3540
gccagatcca ttacgtggat gtaatcgcgt acgccagtac catcttcggt cggataatcg 3600
ttaccaaaaa tcgccagcga gtcgcgacgg cctacagcaa cctgggcgat gtatggcatc 3660
aggttattcg gaatgccttg cggatcttcg cccatatcgc ccgacggatg cgcgccaacc 3720
gggttgaagt agcgcagcag ggcaatgctc cagtccggct gggctttttg cagatcggtg 3780
aggatctgtt ccaccatcag cttgcttttg ccgtaagggc tttgcggtgt gccggtcggg 3840
aagctttcaa cgtatggaat tttgggctga tcgccataaa cggtggcgga ggagctaaaa 3900
ataaagtttt tgacgttagc ggcgcgcatg gcgctaatca ggcgcagagt gccgttgaca 3960
ttgttgtcgt aatattccag cggtttttgt accgattcgc ccacggcttt cagcccggcg 4020
aagtggatca cggtgtcgat agcgtgatcg tgcaggatct cggtcatcaa cgcttcgtta 4080
cgaatatcgc cttcaacaaa cgttggatgt ttgccgccta aacgctcgat aacaggcagt 4140
acgctgcgct tactgttaca gaggttatca agaatgatga catcatgacc gttttgcagt 4200
aattgcacac aggtatgact tccaatgtaa ccgctaccac cggtaaccag aactctcat 4259
<210> 18
<211> 4223
<212> DNA
<213> 人工序列
<220>
<223> 整合盒
<400> 18
tggccagatg attaattcct aatttttgtt gacactctat cattgataga gttattttac 60
cactccctat cagtgataga gaaaagtgaa atgaatagtt cgacaaaaat ctagaaataa 120
ttttgtttaa ctttaagaag gagatataca aatgcaaaaa ctactatctt taccgtccaa 180
tctggttcag tcttttcatg aactggagag ggtgaatcgt accgattggt tttgtacttc 240
cgacccggta ggtaagaaac ttggttccgg tggtggaaca tcctggctgc ttgaagaatg 300
ttataatgaa tattcagatg gtgctacttt tggagagtgg cttgaaaaag aaaaaagaat 360
tcttcttcat gcgggtgggc aaagccgtcg tttacccggc tatgcacctt ctggaaagat 420
tctcactccg gttcctgtgt tccggtggga gagagggcaa catctgggac aaaatctgct 480
ttctctgcaa cttcccctat atgaaaaaat catgtctttg gctccggata aactccatac 540
actgattgcg agtggtgatg tctatattcg ttcggagaaa cctttgcaga gtattcccga 600
agcggatgtg gtttgttatg gactgtgggt agatccgtct ctggctaccc atcatggcgt 660
gtttgcttcc gatcgcaaac atcccgaaca actcgacttt atgcttcaga agccttcgtt 720
ggcagaattg gaatctttat cgaagaccca tttgttcctg atggacatcg gtatatggct 780
tttgagtgac cgtgccgtag aaatcttgat gaaacgttct cataaagaaa gctctgaaga 840
actaaagtat tatgatcttt attccgattt tggattagct ttgggaactc atccccgtat 900
tgaagacgaa gaggtcaata cgctatccgt tgctattctg cctttgccgg gaggagagtt 960
ctatcattac gggaccagta aagaactgat ttcttcaact ctttccgtac agaataaggt 1020
ttacgatcag cgtcgtatca tgcaccgtaa agtaaagccc aatccggcta tgtttgtcca 1080
aaatgctgtc gtgcggatac ctctttgtgc cgagaatgct gatttatgga tcgagaacag 1140
tcatatcgga ccaaagtgga agattgcttc acgacatatt attaccgggg ttccggaaaa 1200
tgactggtca ttggctgtgc ctgccggagt gtgtgtagat gtggttccga tgggtgataa 1260
gggctttgtt gcccgtccat acggtctgga cgatgttttc aaaggagatt tgagagattc 1320
caaaacaacc ctgacgggta ttccttttgg tgaatggatg tccaaacgcg gtttgtcata 1380
tacagatttg aaaggacgta cggacgattt acaggcagtt tccgtattcc ctatggttaa 1440
ttctgtagaa gagttgggat tggtgttgag gtggatgttg tccgaacccg aactggagga 1500
aggaaagaat atctggttac gttccgaaca tttttctgcg gacgaaattt cggcaggtgc 1560
caatctgaag cgtttgtatg cacaacgtga agagttcaga aaaggaaact ggaaagcatt 1620
ggccgttaat catgaaaaaa gtgtttttta tcaacttgat ttggccgatg cagctgaaga 1680
ttttgtacgt cttggtttgg atatgcctga attattgcct gaggatgctc tgcagatgtc 1740
acgcatccat aaccggatgt tgcgtgcgcg tattttgaaa ttagacggga aagattatcg 1800
tccggaagaa caggctgctt ttgatttgct tcgtgacggc ttgctggacg ggatcagtaa 1860
tcgtaagagt accccaaaat tggatgtata ttccgatcag attgtttggg gacgtagccc 1920
cgtgcgcatc gatatggcag gtggatggac cgatactcct ccttattcac tttattcggg 1980
aggaaatgtg gtgaatctag ccattgagtt gaacggacaa cctcccttac aggtctatgt 2040
gaagccgtgt aaagacttcc atatcgtcct gcgttctatc gatatgggtg ctatggaaat 2100
agtatctacg tttgatgaat tgcaagatta taagaagatc ggttcacctt tctctattcc 2160
gaaagccgct ctgtcattgg caggctttgc acctgcgttt tctgctgtat cttatgcttc 2220
attagaggaa cagcttaaag atttcggtgc aggtattgaa gtgactttat tggctgctat 2280
tcctgccggt tccggtttgg gcaccagttc cattctggct tctaccgtac ttggtgccat 2340
taacgatttc tgtggtttag cctgggataa aaatgagatt tgtcaacgta ctcttgttct 2400
tgaacaattg ctgactaccg gaggtggatg gcaggatcag tatggaggtg tgttgcaggg 2460
tgtgaagctt cttcagaccg aggccggctt tgctcaaagt ccattggtgc gttggctacc 2520
cgatcattta tttacgcatc ctgaatacaa agactgtcac ttgctttatt ataccggtat 2580
aactcgtacg gcaaaaggga tcttggcaga aatagtcagt tccatgttcc tcaattcatc 2640
gttgcatctc aatttacttt cggaaatgaa ggcgcatgca ttggatatga atgaagctat 2700
acagcgtgga agttttgttg agtttggccg tttggtagga aaaacctggg aacaaaacaa 2760
agcattggat agcggaacaa atcctccggc tgtggaggca attatcgatc tgataaaaga 2820
ttataccttg ggatataaat tgccgggagc cggtggtggc gggtacttat atatggtagc 2880
gaaagatccg caagctgctg ttcgtattcg taagatactg acagaaaacg ctccgaatcc 2940
gcgggcacgt tttgtcgaaa tgacgttatc tgataaggga ttccaagtat cacgatcata 3000
actgaaacca atttgcctgg cggcagtagc gcggtggtcc cacctgaccc catgccgaac 3060
tcagaagtga aacgccgtag cgccgatggt agtgtggggt ctccccatgc gagagtaggg 3120
aactgccagg catcaaataa aacgaaaggc tcagtcgaaa gactgggcct ttcgggatcc 3180
aggccggcct gttaagacgg ccagtgaatt cgagctcggt acctaccgtt cgtataatgt 3240
atgctatacg aagttatcga gctctagaga atgatcccct cattaggcca cacgttcaag 3300
tgcagcgcac accgtggaaa cggatgaagg cacgaaccca gttgacataa gcctgttcgg 3360
ttcgtaaact gtaatgcaag tagcgtatgc gctcacgcaa ctggtccaga accttgaccg 3420
aacgcagcgg tggtaacggc gcagtggcgg ttttcatggc ttgttatgac tgtttttttg 3480
tacagtctat gcctcgggca tccaagcagc aagcgcgtta cgccgtgggt cgatgtttga 3540
tgttatggag cagcaacgat gttacgcagc agcaacgatg ttacgcagca gggcagtcgc 3600
cctaaaacaa agttaggtgg ctcaagtatg ggcatcattc gcacatgtag gctcggccct 3660
gaccaagtca aatccatgcg ggctgctctt gatcttttcg gtcgtgagtt cggagacgta 3720
gccacctact cccaacatca gccggactcc gattacctcg ggaacttgct ccgtagtaag 3780
acattcatcg cgcttgctgc cttcgaccaa gaagcggttg ttggcgctct cgcggcttac 3840
gttctgccca ggtttgagca gccgcgtagt gagatctata tctatgatct cgcagtctcc 3900
ggcgagcacc ggaggcaggg cattgccacc gcgctcatca atctcctcaa gcatgaggcc 3960
aacgcgcttg gtgcttatgt gatctacgtg caagcagatt acggtgacga tcccgcagtg 4020
gctctctata caaagttggg catacgggaa gaagtgatgc actttgatat cgacccaagt 4080
accgccacct aacaattcgt tcaagccgag atcgtagaat ttcgacgacc tgcagccaag 4140
cataacttcg tataatgtat gctatacgaa cggtaggatc ctctagagtc gacctgcagg 4200
catgagatgt gtataagaga cag 4223
<210> 19
<211> 3792
<212> DNA
<213> 人工序列
<220>
<223> 整合盒
<400> 19
gggaattgat tctggtacca aatgagtcga ccggccagat gattaattcc taatttttgt 60
tgacactcta tcattgatag agttatttta ccactcccta tcagtgatag agaaaagtga 120
aatgaatagt tcgacaaaaa tctagaaata attttgttta actttaagaa ggagatatac 180
aaatgattac ccgcaaaagg cgggccagga caatccatag ccgatatcca atcggaattt 240
acgggagcat agtaatgaca gatattgcac agttgcttgg caaagacgcc gacaaccttt 300
tacagcaccg ttgtatgact attccttctg accagcttta tctccccgga catgactacg 360
tagaccgcgt gatgattgac aataatcgcc cgccagcggt gttacgtaat atgcagacgt 420
tgtacaacac tgggcgtctg gctggcacag gatatctttc tattctgccg gttgaccagg 480
gcgttgagca ctctgccgga gcttcatttg ctgctaaccc gctctacttt gacccgaaaa 540
acattgttga actggcgatc gaagcgggct gtaactgtgt ggcatcaact tacggcgtgt 600
tggcgtcggt atcgcggcgc tatgcgcatc gcattccatt cctcgtcaaa cttaatcaca 660
acgagacgct aagttacccg aacacctacg atcaaacgct gtatgccagc gtggagcagg 720
ccttcaacat gggcgcggtg gcggttggtg cgactatcta ttttggttcg gaagagtcac 780
gtcgccagat tgaagaaatt tctgcggctt ttgaacgtgc gcacgagctg ggcatggtga 840
cagtgctgtg ggcctatttg cgtaactccg cctttaagaa agatggcgtt gattaccatg 900
tttccgccga cctgaccggt caggcaaacc atctggcggc gaccataggt gcagatatcg 960
tcaaacaaaa aatggcggaa aataacggcg gctataaagc aattaattac ggttataccg 1020
acgatcgcgt gtacagcaag ttaaccagcg aaaacccgat tgatctggtg cgttatcagt 1080
tagctaactg ctatatgggc cgggccgggt tgataaactc cggcggtgct gcaggcggtg 1140
aaactgacct cagcgatgca gtgcgtactg cggttatcaa caaacgcgct ggcggaatgg 1200
ggctgattct tggacgtaag gcgttcaaga aatcgatggc tgacggcgtg aaactgatta 1260
acgccgtgca ggatgtttat ctcgatagca aaattactat cgcctaagag gatcgagatc 1320
tcgatcccgc gaaattaata cgactcacta taggggaatt gtgagcggat aacaattccc 1380
ctctagaaat aattttgttt aactttaaga aggagatata ccatgggcca tcatcatcat 1440
catcatcatc atcatcacag cagcggccat atcgaaggtc gtcatatggc ggtgaaagaa 1500
gcgaccagcg agaccaagaa gcgtagcggt tacgagatca ttaccctgac cagctggctg 1560
ctgcaacaag aacagaaggg tatcattgac gcggaactga ccatcgttct gagcagcatt 1620
agcatggcgt gcaaacagat cgcgagcctg gtgcaacgtg cgaacattag caacctgacc 1680
ggtacccaag gcgcggttaa catccagggt gaagaccaaa agaaactgga tgttattagc 1740
aacgaggtgt tcagcaactg cctgcgtagc agcggtcgta ccggcatcat tgcgagcgag 1800
gaagaggacg tggcggttgc ggtggaagag agctacagcg gtaactatat cgtggttttt 1860
gacccgctgg atggcagcag caacctggat gcggctgtga gcaccggtag catcttcggc 1920
atttacagcc cgaacgacga gagcctgccg gattttggtg acgatagcga cgataacacc 1980
ctgggcaccg aagagcaacg ttgcatcgtt aacgtgtgcc aaccgggtag caacctgctg 2040
gcggcgggct actgcatgta tagcagcagc gttgcgttcg tgctgaccat tggcaagggc 2100
gttttcgtgt ttaccctgga cccgctgtac ggtgaattcg tgctgaccca ggagaacctg 2160
caaatcccga agagcggtga aatttacagc tttaacgagg gcaactataa actgtgggat 2220
gaaaacctga agaaatatat cgacgatctg aaggaaccgg gtccgagcgg taaaccgtac 2280
agcgcgcgtt atatcggtag cctggttggc gacttccacc gtaccctgct gtacggtggc 2340
atttacggtt atccgcgtga taagaaaagc aagaacggca aactgcgtct gctgtatgaa 2400
tgcgcgccga tgagctttat tgttgagcag gcgggtggca aaggtagcga cggccaccag 2460
cgtgtgctgg atatccaacc gaccgaaatt caccagcgtg ttccgctgta cattggtagc 2520
accgaagagg ttgaaaaagt tgaaaagtat ctggcgtaat cgagtctggt aaagaaaccg 2580
ctgctgcgaa atttgaacgc cagcacatgg actcgtctac tagcgcagct taattaacct 2640
aggctgctgc caccgctgag caataactag cataacccct tggggcctct aaacgggtct 2700
tgaggggttt tttgctgaaa ggaggaacta tatccggatt ggcgaatggg acgcgccctg 2760
tagcggcgca ttaagcgcgg cgggtggacg gccagtgaat tcgagctcgg tacctaccgt 2820
tcgtataatg tatgctatac gaagttatcg agctctagag aatgatcccc tcattaggcc 2880
acacgttcaa gtgcagcgca caccgtggaa acggatgaag gcacgaaccc agttgacata 2940
agcctgttcg gttcgtaaac tgtaatgcaa gtagcgtatg cgctcacgca actggtccag 3000
aaccttgacc gaacgcagcg gtggtaacgg cgcagtggcg gttttcatgg cttgttatga 3060
ctgttttttt gtacagtcta tgcctcgggc atccaagcag caagcgcgtt acgccgtggg 3120
tcgatgtttg atgttatgga gcagcaacga tgttacgcag cagcaacgat gttacgcagc 3180
agggcagtcg ccctaaaaca aagttaggtg gctcaagtat gggcatcatt cgcacatgta 3240
ggctcggccc tgaccaagtc aaatccatgc gggctgctct tgatcttttc ggtcgtgagt 3300
tcggagacgt agccacctac tcccaacatc agccggactc cgattacctc gggaacttgc 3360
tccgtagtaa gacattcatc gcgcttgctg ccttcgacca agaagcggtt gttggcgctc 3420
tcgcggctta cgttctgccc aggtttgagc agccgcgtag tgagatctat atctatgatc 3480
tcgcagtctc cggcgagcac cggaggcagg gcattgccac cgcgctcatc aatctcctca 3540
agcatgaggc caacgcgctt ggtgcttatg tgatctacgt gcaagcagat tacggtgacg 3600
atcccgcagt ggctctctat acaaagttgg gcatacggga agaagtgatg cactttgata 3660
tcgacccaag taccgccacc taacaattcg ttcaagccga gatcgtagaa tttcgacgac 3720
ctgcagccaa gcataacttc gtataatgta tgctatacga acggtaggat cctctagagt 3780
cgacctgcag gc 3792
<210> 20
<211> 5917
<212> DNA
<213> 人工序列
<220>
<223> 转座子盒
<400> 20
acaggttggc tgataagtcc ccggtctagc ttgcatgcag attgcagcat tacacgtctt 60
gatttgacgg ctagctcagt cctaggtaca gtgctagcac tgctttgtgg aaggagatag 120
acttatggcg gatccgatgg aatacctcga tgtgtcgttc ggcggcacgt tcgctgcaga 180
cacctacacc acaggtggcg acgaggtggc gaagggcccc gtgaccaagc acggcagcat 240
accgaccaag cttgacggcg gcggcatcac cctcgctggc ggcaccaacg gcgtgacatt 300
cacctcgacc gcgagcttca gcgagagtgg gaaggtgaac aagggattcc gcgccgaaat 360
ggagtaccgt acgacgcaga cgcccagcaa cctcgccaca ttgttctccg ccatgggcaa 420
catcttcgtg cgggcgaacg gcagcaacct cgaatacggc ttctccacga acccttccgg 480
cagtacatgg aacgactaca caaagtccgt gacgctgcct tccaacaatg tgaagcacat 540
catccagctg acatatctgc cgggagccga cggcgctgcc tcgacgttgc agttgtcggt 600
ggatggcgtg gccggcgaga ccgccacctc cgcggccggc gagctcgcgg ccgtcagcga 660
ttccgtcggg aacaagttcg ggatcggcta cgaggtgaac cccgcttccg gcgcggcgag 720
ccgcggtctt gccggtgacg tgttccgcgc gcgtgtcgcc gattcggacg ccccgtggga 780
gattcttgac gcatcccagc tgctgcatgt caatttcaac ggcacgttca gcggcacctc 840
atataccgcg gcgagcggcg agcagatgct gggctcgctg gtgtcgcgct cggccaatcc 900
gtccatctcg aactccgccg tcacgctggg cggcggcacg gccggattcg atttcacgcc 960
cacggacttc accctcggtg acaacgaggc catcacccgc ccgctggtcg cggagctgcg 1020
cttcaccccg acgcagaccg gcgacaacca gaccctgttc ggcgcgggcg gcaacctgtt 1080
cctgcgctac gagtcgaaca agctcgtgtt cggcgcctcc accaagtccg gcgataattg 1140
gaccgaccac aagatcgagt ccgcggccgc cacgggtgcg gagcacgtcg tgtcggtggc 1200
gtacgtgccc aataaggccg gcaccggcgc gaagcttgtc atgcgcgtgg atggcggcga 1260
cgcccagacc aaggacatca ctggtctggc ttacctgaat tcgagcatca agggcaaggt 1320
cggcttcggc aacgacgtgc ataccgacgc gctcagccgc ggcttcgtcg gctcgctgag 1380
cgagatccgc ctggccgaaa cctccgcgaa cttcaccacc aacgaattca agctggtcta 1440
ctctcaggtc agctgcgaca cgtcgggcat caaggaggcg aataccttcg acgtggagcc 1500
cgccgagtgc gaggccgcgc ttaagaccaa gctgtccaag ctgcgtccga ccgaagggca 1560
ggccgactac atcgactggg gtcagatcgg attcctccat tacggcatca acacgtacta 1620
caaccaggag tggggtcacg gtaacgagga tccctcccgc atcaacccga ccggcctcga 1680
caccgaccag tgggcgaagt ccttcgccga cggtggcttc aagatgatca tggtgacggt 1740
caagcaccat gacggtttcg agctgtacga ctcgcggtac aacaccgagc acgactgggc 1800
aaacaccgcc gtcgccaagc gcacggggga gaaggacctg ttccgcaaga ttgtcgcctc 1860
ggcgaagaaa tacggcctga aggtcggcat ctactattcg ccggccgatt cctacatgga 1920
gaggaagggc gtctggggca acaactccgc acgcgtcgag cgcacgatcc ccacgctggt 1980
ggagaacgac gaccgcgccg gcaaggtggc ttccggcaaa ctgcccacgt tcaagtacaa 2040
ggccacggat tacggcgcct acatgctcaa ccagctctat gagctgctga ctgagtacgg 2100
cgacatctcc gaggtctggt tcgacggtgc ccaaggcaac accgcaggca ctgagcatta 2160
cgactatggc gtgttctacg agatgatccg ccggcttcag ccccaggcaa ttcaggccaa 2220
cgccgcatac gatgcccgat gggtgggcaa cgaggacggc tgggcccgtc agaccgagtg 2280
gagcccgcag gcggcataca acgacggcgt ggacaaggtg tcgctcaagc ctggccagat 2340
ggcccccgac ggtaagcttg gcagcatgtc gagcgtgctg tccgagatcc gcagcggcgc 2400
cgccaaccag ctgcactggt atccggccga agtcgacgcc aagaaccggc ccggatggtt 2460
ctaccgtgcc agccaatcgc cggcgtccgt agccgaagtc gtgaagtact acgagcagtc 2520
cacgggacgc aactcgcagt atctgctgaa cgtcccaccg tccgataccg gcaagctcgc 2580
cgatgcggat gccgcgggac ttaaggggct gggcgaggag ctcgcccgac gctacggcac 2640
cgatcttgcc ctgggcaaga gcgcgaccgt cgccgcgtcc gcgaacgaca ctgcggtagc 2700
ggccccgaag ctgaccgacg gttcgaagct ctcctccgac aaggccgtgg gcaatacgcc 2760
gacgtacacc atcgatctgg gcagcactgt cgccgtggat gcagtgaaga tctccgagga 2820
cgtgcgcaat gccggccagc agatcgaaag cgccactctg cagggacgag tcaatggaac 2880
atggacgaat ctggcgacta tgacgacggt cgggcagcag cgcgaccttc gcttcacgtc 2940
ccagaacatc gatgccatcc gtctggtggt caactcctcc cgcggtccgg tgcgtctgag 3000
ccgtcttgag gtgttccaca ccgaatccga gattcagacc ggcgcccgcg cctactacat 3060
cgatccgacg gcgcagaccg cgggagatgg attcacgaag gacaagccca tgacgtcgat 3120
cgagcagctg cacgatgtga ccgtcgcgcc aggctccgtg atcttcgtca aggcgggcac 3180
cgagctgacc ggggacttcg ccgtcttcgg ctacggcacc aaggacgagc ccatcaccgt 3240
gacgacatac ggcgaaagcg acaaagccac caccgcgagc ttcgacggca tgaccgccgg 3300
gctgacgctg aagcaggcgc tgaaggcgct cggcaaggac gacgccggct gggtcgtggc 3360
cgattccgcc actgcaccgg cctcccgcgt gtatgtcccg caggatgaga tcagcgtgca 3420
cgcccagtcg tcgcagaact ccggcgcaga ggcggcgagg gcgctcgacg gcgactcgtc 3480
gacgagctgg cactcccagt acagcccgac caccgcgtct gctccgcatt gggtgactct 3540
cgatctcggc aaatcgcgtg agaacgtcgc ctacttcgac tacctcgccc gtatcgacgg 3600
caacaataac ggtgccgcca aggattacga ggtgtatgtc tccgacgatc ccaacgattt 3660
tggagcccct gtggcctcgg gcacgttgaa gaacgtcgcc tacacgcagc gcatcaagct 3720
gacccccaag aacggacggt acgtcaagtt cgtcatcaag accgattatt ccggatcgaa 3780
cttcggctcc gcggcggaaa tgaatgtcga gttgctgccc acggccgtag aggaggacaa 3840
ggtcgccacc ccgcagaagc cgacagtgga cgatgatgcc gatacataca ccatccccga 3900
catcgaggga gtcgtgtaca aggtcgacgg caaggtgttg gccgctggtt ccgtagtgaa 3960
cgtgggcgat gaggacgtga ccgtcacggt caccgccgag cccgccgacg gataccgctt 4020
cccggatggt gtgacgtccc cagtcacgta tgagctgacg ttcaccaaga agggtggcga 4080
gaagcctccg accgaagtca acaaggacaa gctgcacgcc acgatcacca aggctcaggc 4140
gatcgaccgt tccgcctata cggacgagtc gctcaaggtg cttgatgaca agctcgccgc 4200
agcgctcaag gtctatgacg atgacaaggt gagccaggat gatgtcgatg ccgccgaggc 4260
ggctctgtct gcggcgatcg acgcgctgaa gaccaagccg acgacccccg gcggtgaagg 4320
tgagaagcct ggtgaaggtg aaaagcccgg tgacggcaac aagcccggtg acggcaagaa 4380
gcccggcgac gtgatcgcaa agaccggcgc ctccacaatg taactagcat aaccccttgg 4440
ggcctctaaa cgggtcttga ggggtttttt gctgaaacca atttgcctgg cggcagtagc 4500
gcggtggtcc cacctgaccc catgccgaac tcagaagtga aacgccgtag cgccgatggt 4560
agtgtggggt ctccccatgc gagagtaggg aactgccagg catcaaataa aacgaaaggc 4620
tcagtcgaaa gactgggcct ttcgggatcc aggccggcct gttaacgaat taatcttccg 4680
cggcggtatc gataagcttg atatcgaatt ccgaagttcc tattctctag acgccattca 4740
ggctgcgcaa ctgttgggaa gggcgatcgg tgcgggcctc ttcgctatta cgccagctgg 4800
cgaaaggggg atgtgctgca aggcgattaa gttgggtaac gccagggttt tcccagtcac 4860
gacgttgtaa aacgacggcc agtgaattcg agctcggtac ctaccgttcg tataatgtat 4920
gctatacgaa gttatcgagc tctagagaat gatcccctca ttaggccaca cgttcaagtg 4980
cagcgcacac cgtggaaacg gatgaaggca cgaacccagt tgacataagc ctgttcggtt 5040
cgtaaactgt aatgcaagta gcgtatgcgc tcacgcaact ggtccagaac cttgaccgaa 5100
cgcagcggtg gtaacggcgc agtggcggtt ttcatggctt gttatgactg tttttttgta 5160
cagtctatgc ctcgggcatc caagcagcaa gcgcgttacg ccgtgggtcg atgtttgatg 5220
ttatggagca gcaacgatgt tacgcagcag caacgatgtt acgcagcagg gcagtcgccc 5280
taaaacaaag ttaggtggct caagtatggg catcattcgc acatgtaggc tcggccctga 5340
ccaagtcaaa tccatgcggg ctgctcttga tcttttcggt cgtgagttcg gagacgtagc 5400
cacctactcc caacatcagc cggactccga ttacctcggg aacttgctcc gtagtaagac 5460
attcatcgcg cttgctgcct tcgaccaaga agcggttgtt ggcgctctcg cggcttacgt 5520
tctgcccaga tttgagcagc cgcgtagtga gatctatatc tatgatctcg cagtctccgg 5580
cgagcaccgg aggcagggca ttgccaccgc gctcatcaat ctcctcaagc atgaggccaa 5640
cgcgcttggt gcttatgtga tctacgtgca agcagattac ggtgacgatc ccgcagtggc 5700
tctctataca aagttgggca tacgggaaga agtgatgcac tttgatatcg acccaagtac 5760
cgccacctaa caattcgttc aagccgagat cgtagaattt cgacgacctg cagccaagca 5820
taacttcgta taatgtatgc tatacgaacg gtaggatcct ctagagtcga ccaggtggca 5880
cttttcgggc agaccgggga cttatcagcc aacctgt 5917
Claims (15)
1.一种使用基因工程微生物宿主细胞生产所需低聚糖的方法,所述方法包括以下步骤:
(i)提供能够产生所需低聚糖的基因工程微生物宿主细胞,其中所述微生物宿主细胞已被基因工程改造以表达至少一种异源糖苷酶,其能够在细胞内降解在所需低聚糖的细胞内生物合成过程中产生的代谢糖副产物,其中所述微生物宿主细胞能够回收由所述糖苷酶的酶活性产生的降解产物,用于产生所需低聚糖;
(ii)在允许产生所需低聚糖的条件下和培养基中培养所述基因工程微生物宿主细胞,从而产生所需低聚糖;和
(iii)任选地,回收所需低聚糖。
2.一种用于生产所需低聚糖的基因工程微生物宿主细胞,其中所述微生物宿主细胞
a)能够产生所需低聚糖;
b)已被基因工程改造以表达至少一种异源糖苷酶,其能够在细胞内降解在所需低聚糖的细胞内生物合成过程中产生的代谢糖副产物;和
c)能够回收由所述糖苷酶的酶活性产生的降解产物,以产生所需低聚糖。
3.根据权利要求1所述的方法或根据权利要求2所述的基因工程微生物宿主细胞,其中所述异源糖苷酶选自岩藻糖苷酶、唾液酸酶、己糖胺酶、半乳糖苷酶和葡糖苷酶。
4.根据权利要求1至3中任一项所述的方法或基因工程微生物宿主细胞,其中所述异源糖苷酶选自α-1,2-岩藻糖苷酶、α-1,3-岩藻糖苷酶、α-2,3-唾液酸酶、α-2,6-唾液酸酶、α-2,8-唾液酸酶、β-1,3-半乳糖苷酶、β-1,4-半乳糖苷酶、β-1,6-半乳糖苷酶、β-N-乙酰己糖胺酶和β-1,3-葡糖苷酶。
5.根据权利要求1至4中任一项所述的方法或基因工程微生物宿主细胞,其中所述基因工程微生物宿主细胞已被基因工程改造以表达异源糖基转移酶,优选选自岩藻糖基转移酶、唾液酸转移酶、半乳糖基转移酶、N-乙酰葡糖胺基转移酶和葡糖基转移酶的糖基转移酶。
6.根据权利要求1至5中任一项所述的方法或基因工程微生物宿主细胞,其中所述微生物宿主细胞已被基因工程改造以表达异源α-1,3-岩藻糖基转移酶和异源α-1,2-岩藻糖苷酶。
7.根据权利要求1至5中任一项所述的方法或基因工程微生物宿主细胞,其中所述微生物宿主细胞已被基因工程改造以表达异源α-1,2-岩藻糖基转移酶和异源α-1,3-岩藻糖苷酶。
8.根据权利要求1至5中任一项所述的方法或基因工程微生物宿主细胞,其中所述微生物宿主细胞已被基因工程改造以表达异源β-1,3-N-乙酰葡糖胺基转移酶、异源α-1,2-岩藻糖基转移酶、异源β-1,3-半乳糖基转移酶和异源α-1,3-岩藻糖苷酶。
9.根据权利要求1至5中任一项所述的方法或基因工程微生物宿主细胞,其中所述微生物宿主细胞已被基因工程改造以表达异源α-2,6-唾液酸转移酶和异源α-2,3-唾液酸酶。
10.根据权利要求1至5中任一项所述的方法或基因工程微生物宿主细胞,其中所述微生物宿主细胞已被基因工程改造以表达异源β-1,3-N-乙酰葡糖胺基转移酶、异源β-1,4-半乳糖基转移酶和异源β-1,3-半乳糖苷酶和/或β-1,3-葡糖苷酶和/或半乳聚糖-β-1,3-半乳糖苷酶。
11.根据权利要求1至5中任一项所述的方法或基因工程微生物宿主细胞,其中所述微生物宿主细胞已被基因工程改造以表达异源β-1,3-N-乙酰葡糖胺基转移酶、异源β-1,3-半乳糖基转移酶和异源β-1,3-葡糖苷酶和/或半乳聚糖-β-1,3-半乳糖苷酶。
12.根据权利要求1至11中任一项所述的方法或基因工程微生物宿主细胞,其中所述所需低聚糖为人乳低聚糖,优选选自以下的人乳低聚糖:2’-岩藻糖基乳糖(2’-FL)、3-岩藻糖基乳糖(3-FL)、2’,3-二岩藻糖基乳糖、乳-N-三糖II、乳-N-四糖、乳-N-新四糖、乳-N-岩藻戊糖I、乳-N-新岩藻戊糖I、乳-N-岩藻戊糖II、乳-N-岩藻戊糖III、乳-N-岩藻戊糖V、乳-N-新岩藻戊糖V、乳-N-二岩藻己糖I、乳-N-二岩藻糖基己糖II、对-乳-N-岩藻糖基己糖、岩藻糖基-乳-N-唾液酸戊糖b、岩藻糖基-乳-N-唾液酸戊糖c、岩藻糖基-乳-N-唾液酸戊糖c、二唾液酸-乳-N-岩藻戊糖、3-岩藻糖基-3’-唾液酸乳糖、3-岩藻糖基-6’-唾液酸乳糖、乳-N-新二岩藻己糖I、3’-唾液酸乳糖、6’-唾液酸乳糖、唾液酸乳-N-四糖a(LST-a)、唾液酸乳-N-四糖b(LST-b)、唾液酸乳-N-四糖c(LST-c)和二唾液酸乳-N-四糖。
13.根据权利要求2至12中任一项所述的基因工程微生物宿主细胞用于生产所需低聚糖、优选选自HMO的低聚糖的用途。
14.一种低聚糖,优选选自HMO的低聚糖,其通过权利要求1至12中任一项所述的方法或通过使用权利要求1至12中任一项所述的基因工程微生物宿主细胞生产,其用于制备营养组合物。
15.一种营养组合物,其含有通过权利要求1至12中任一项所述的方法或通过使用权利要求1至12中任一项所述的基因工程微生物宿主细胞生产的至少一种低聚糖,其中所述至少一种低聚糖优选为HMO。
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP18172658.9A EP3569713A1 (en) | 2018-05-16 | 2018-05-16 | Use of glycosidases in the production of oligosaccharides |
EP18172658.9 | 2018-05-16 | ||
PCT/EP2019/062160 WO2019219578A1 (en) | 2018-05-16 | 2019-05-13 | Use of glycosidases in the production of oligosaccharides |
Publications (1)
Publication Number | Publication Date |
---|---|
CN112119164A true CN112119164A (zh) | 2020-12-22 |
Family
ID=62186367
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201980032286.2A Pending CN112119164A (zh) | 2018-05-16 | 2019-05-13 | 糖苷酶在低聚糖生产中的用途 |
Country Status (11)
Country | Link |
---|---|
US (1) | US20210363557A1 (zh) |
EP (2) | EP3569713A1 (zh) |
JP (4) | JP2021524232A (zh) |
KR (1) | KR20210010472A (zh) |
CN (1) | CN112119164A (zh) |
AU (2) | AU2019270211B2 (zh) |
BR (1) | BR112020023228A2 (zh) |
MX (1) | MX2020012152A (zh) |
PH (1) | PH12020551940A1 (zh) |
SG (1) | SG11202010730SA (zh) |
WO (1) | WO2019219578A1 (zh) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115003687A (zh) * | 2020-01-23 | 2022-09-02 | 格礼卡姆股份公司 | Hmo生产 |
CN117321217A (zh) * | 2021-05-17 | 2023-12-29 | 帝斯曼知识产权资产管理有限公司 | 用于体内生产纯LNFP-I的α-1,2-岩藻糖基转移酶的识别 |
CN117597451A (zh) * | 2021-05-17 | 2024-02-23 | 帝斯曼知识产权资产管理有限公司 | 通过修饰细胞内乳糖的输入来增强hmo的形成 |
CN118834856A (zh) * | 2024-07-25 | 2024-10-25 | 合肥中科健康生物产业技术研究院有限公司 | 一种α-L-岩藻糖苷酶突变体及其应用 |
Families Citing this family (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
MX2020000969A (es) * | 2017-07-26 | 2020-09-28 | Jennewein Biotechnologie Gmbh | Sialil-transferasas y su uso en la produccion de oligosacaridos sialilados. |
WO2019222391A1 (en) * | 2018-05-15 | 2019-11-21 | The Board Of Trustees Of The University Of Illinois | Engineered microorganisms for production of 2' fucosyllactose and l-fucose |
US11162102B2 (en) | 2019-05-13 | 2021-11-02 | Dna Twopointo Inc. | Modifications of mammalian cells using artificial micro-RNA to alter their properties and the compositions of their products |
CN110804577B (zh) * | 2019-11-28 | 2021-05-28 | 江南大学 | 一种高效生产2’-岩藻糖基乳糖的重组菌的构建方法及其应用 |
JP2023511523A (ja) * | 2020-01-23 | 2023-03-20 | グリコム・アクティーゼルスカブ | Hmoの産生 |
US20230109661A1 (en) * | 2020-01-23 | 2023-04-06 | Glycom A/S | Hmo production |
KR20230048380A (ko) * | 2020-08-10 | 2023-04-11 | 인바이오스 엔.브이. | 세포에 의한 올리고당 혼합물의 제조 |
US20230313187A1 (en) * | 2020-09-04 | 2023-10-05 | Dna Twopointo Inc. | Modifications of mammalian cells using artificial micro-rna to alter their properties and the compositions of their products |
EP4281577A1 (en) * | 2021-01-20 | 2023-11-29 | Inbiose N.V. | Production of oligosaccharides comprising ln3 as core structure in host cells |
WO2024042235A1 (en) * | 2022-08-25 | 2024-02-29 | Dsm Ip Assets B.V. | Hybrid method for producing complex hmos |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2015150328A1 (en) * | 2014-03-31 | 2015-10-08 | Jennewein Biotechnologie Gmbh | Total fermentation of oligosaccharides |
CN105722991A (zh) * | 2013-09-10 | 2016-06-29 | 詹尼温生物技术有限责任公司 | 寡糖的生产 |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102459605A (zh) * | 2009-06-08 | 2012-05-16 | 詹尼温生物技术有限责任公司 | Hmo合成 |
ES2439507T3 (es) * | 2011-01-20 | 2014-01-23 | Jennewein Biotechnologie Gmbh | Fucosiltransferasas novedosas y sus aplicaciones |
DK2728009T3 (da) * | 2012-10-31 | 2017-11-06 | Jennewein Biotechnologie Gmbh | Fremgangsmåde til fremstilling af monosaccharider |
EP2931737A4 (en) | 2012-12-14 | 2016-11-16 | Glycom As | MIXTURE FROM FUCOSYLATE LACTOSES |
CN111172220B (zh) | 2013-09-06 | 2023-08-04 | 格礼卡姆股份公司 | 寡糖的发酵生产 |
CN106715076B (zh) | 2014-08-11 | 2020-03-13 | 圣万提注塑工业(苏州)有限公司 | 能够实现多个活塞速度的致动装置和方法 |
CA2958294C (en) | 2014-08-19 | 2019-02-19 | Atlas James RUSSELL | System, method and apparatus for recycling asphalt shingles and producing asphalt mix |
EP3050973A1 (en) | 2015-01-30 | 2016-08-03 | Jennewein Biotechnologie GmbH | Fermentation process for producing monosaccharides in free form from nucleotide-activated sugars |
-
2018
- 2018-05-16 EP EP18172658.9A patent/EP3569713A1/en not_active Ceased
-
2019
- 2019-05-13 EP EP19725062.4A patent/EP3794134A1/en active Pending
- 2019-05-13 SG SG11202010730SA patent/SG11202010730SA/en unknown
- 2019-05-13 KR KR1020207033599A patent/KR20210010472A/ko not_active Application Discontinuation
- 2019-05-13 WO PCT/EP2019/062160 patent/WO2019219578A1/en active Application Filing
- 2019-05-13 CN CN201980032286.2A patent/CN112119164A/zh active Pending
- 2019-05-13 AU AU2019270211A patent/AU2019270211B2/en active Active
- 2019-05-13 MX MX2020012152A patent/MX2020012152A/es unknown
- 2019-05-13 JP JP2020564406A patent/JP2021524232A/ja active Pending
- 2019-05-13 US US17/054,950 patent/US20210363557A1/en active Pending
- 2019-05-13 BR BR112020023228-9A patent/BR112020023228A2/pt unknown
-
2020
- 2020-11-13 PH PH12020551940A patent/PH12020551940A1/en unknown
-
2024
- 2024-04-03 JP JP2024059942A patent/JP2024081771A/ja active Pending
- 2024-04-03 JP JP2024059943A patent/JP2024081772A/ja active Pending
- 2024-04-03 JP JP2024059945A patent/JP2024081773A/ja active Pending
-
2025
- 2025-01-07 AU AU2025200092A patent/AU2025200092A1/en active Pending
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105722991A (zh) * | 2013-09-10 | 2016-06-29 | 詹尼温生物技术有限责任公司 | 寡糖的生产 |
WO2015150328A1 (en) * | 2014-03-31 | 2015-10-08 | Jennewein Biotechnologie Gmbh | Total fermentation of oligosaccharides |
Non-Patent Citations (1)
Title |
---|
HISASHI ASHIDA等: "Two distinct α-L-fucosidases from Bifidobacterium bifidum are essential for the utilization of fucosylated milk oligosaccharides and glycoconjugates", 《GLYCOBIOLOGY》, vol. 9, no. 9, 11 June 2009 (2009-06-11) * |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115003687A (zh) * | 2020-01-23 | 2022-09-02 | 格礼卡姆股份公司 | Hmo生产 |
CN115003687B (zh) * | 2020-01-23 | 2024-12-06 | 格礼卡姆股份公司 | Hmo生产 |
CN117321217A (zh) * | 2021-05-17 | 2023-12-29 | 帝斯曼知识产权资产管理有限公司 | 用于体内生产纯LNFP-I的α-1,2-岩藻糖基转移酶的识别 |
CN117597451A (zh) * | 2021-05-17 | 2024-02-23 | 帝斯曼知识产权资产管理有限公司 | 通过修饰细胞内乳糖的输入来增强hmo的形成 |
CN118834856A (zh) * | 2024-07-25 | 2024-10-25 | 合肥中科健康生物产业技术研究院有限公司 | 一种α-L-岩藻糖苷酶突变体及其应用 |
Also Published As
Publication number | Publication date |
---|---|
JP2021524232A (ja) | 2021-09-13 |
WO2019219578A1 (en) | 2019-11-21 |
JP2024081772A (ja) | 2024-06-18 |
SG11202010730SA (en) | 2020-11-27 |
PH12020551940A1 (en) | 2021-06-21 |
BR112020023228A2 (pt) | 2021-02-23 |
EP3569713A1 (en) | 2019-11-20 |
AU2019270211B2 (en) | 2024-10-10 |
EP3794134A1 (en) | 2021-03-24 |
AU2025200092A1 (en) | 2025-01-30 |
JP2024081773A (ja) | 2024-06-18 |
AU2019270211A1 (en) | 2020-11-26 |
KR20210010472A (ko) | 2021-01-27 |
MX2020012152A (es) | 2021-01-29 |
US20210363557A1 (en) | 2021-11-25 |
JP2024081771A (ja) | 2024-06-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112119164A (zh) | 糖苷酶在低聚糖生产中的用途 | |
KR102554781B1 (ko) | 푸코실화 올리고당(fucosylated oligosaccharide)의 생산을 위한 향상된 공정 | |
JP2024028710A (ja) | フコシル基転移酵素及びフコシル化オリゴ糖の生産におけるその使用 | |
CN113166789A (zh) | 岩藻糖基化寡糖lnfp-v的合成 | |
WO2021148618A9 (en) | New major facilitator superfamily (mfs) protein (bad) in hmo production | |
TW201839139A (zh) | 用於生產塔格糖的組成物與利用其生產塔格糖的方法 | |
US10519475B1 (en) | Biosynthesis of compounds in yeast | |
JP2024516207A (ja) | インベルターゼ/スクロースヒドロラーゼを発現する微生物株 | |
WO2022243312A1 (en) | IDENTIFICATION OF AN α-1,2-FUCOSYLTRANSFERASE FOR THE IN VIVO PRODUCTION OF PURE LNFP-I | |
WO2023099680A1 (en) | Cells with tri-, tetra- or pentasaccharide importers useful in oligosaccharide production | |
WO2004081216A1 (ja) | 酢酸菌のアルコール脱水素酵素遺伝子 | |
CN117321210A (zh) | 产生以lnfp-i和lnt作为主要化合物的hmo共混物的方法 | |
RU2810730C2 (ru) | Применение гликозидаз в получении олигосахаридов | |
CN116802286A (zh) | 一种产生dfl的菌株 | |
RU2790445C2 (ru) | Улучшенный способ получения фукозилированных олигосахаридов | |
JP7331278B1 (ja) | 3’slのインビボ合成のための新規なシアリルトランスフェラーゼ | |
RU2818835C2 (ru) | Фукозилтрансферазы и их применение для получения фукозилированных олигосахаридов | |
WO2024046994A1 (en) | Fermentative production of oligosaccharides by microbial cells utilizing glycerol | |
CN118804979A (zh) | 用于体内合成lst-a的新型唾液酸转移酶 | |
CN118786221A (zh) | 用于体内合成3'sl和6'sl的新型唾液酸转移酶 | |
WO2024133702A2 (en) | New fucosyltransferases for production of 3fl | |
DK202200591A1 (en) | New sialyltransferases for in vivo synthesis of lst-c | |
CN117321071A (zh) | 表达转化酶/蔗糖水解酶的微生物菌株 | |
CN118440916A (zh) | 载体、转化子、唾液酸转移酶突变体及其应用 | |
CN117355613A (zh) | 产生以lnfp-i和2’-fl作为主要化合物的hmo共混物分布的方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
REG | Reference to a national code |
Ref country code: HK Ref legal event code: DE Ref document number: 40038438 Country of ref document: HK |
|
CB02 | Change of applicant information |
Address after: Bright Bach, Rhine, Germany Applicant after: Kohansen breast milk oligosaccharides Co.,Ltd. Address before: Bright Bach, Rhine, Germany Applicant before: JENNEWEIN BIOTECHNOLOGIE GmbH |
|
CB02 | Change of applicant information |