KR20200091458A - 에틸렌 글리콜의 생물학적 생성을 위한 미생물 및 방법 - Google Patents
에틸렌 글리콜의 생물학적 생성을 위한 미생물 및 방법 Download PDFInfo
- Publication number
- KR20200091458A KR20200091458A KR1020207020008A KR20207020008A KR20200091458A KR 20200091458 A KR20200091458 A KR 20200091458A KR 1020207020008 A KR1020207020008 A KR 1020207020008A KR 20207020008 A KR20207020008 A KR 20207020008A KR 20200091458 A KR20200091458 A KR 20200091458A
- Authority
- KR
- South Korea
- Prior art keywords
- ala
- leu
- glu
- gly
- val
- Prior art date
Links
- LYCAIKOWRPUZTN-UHFFFAOYSA-N Ethylene glycol Chemical compound OCCO LYCAIKOWRPUZTN-UHFFFAOYSA-N 0.000 title claims abstract description 403
- 244000005700 microbiome Species 0.000 title claims abstract description 315
- 238000000034 method Methods 0.000 title claims abstract description 51
- 238000004519 manufacturing process Methods 0.000 title abstract description 42
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 claims abstract description 60
- KRKNYBCHXYNGOX-UHFFFAOYSA-K Citrate Chemical compound [O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O KRKNYBCHXYNGOX-UHFFFAOYSA-K 0.000 claims abstract description 44
- KHPXUQMNIQBQEV-UHFFFAOYSA-N oxaloacetic acid Chemical compound OC(=O)CC(=O)C(O)=O KHPXUQMNIQBQEV-UHFFFAOYSA-N 0.000 claims abstract description 42
- 239000004471 Glycine Substances 0.000 claims abstract description 27
- 239000002243 precursor Substances 0.000 claims abstract description 21
- 229940049920 malate Drugs 0.000 claims abstract description 13
- BJEPYKJPYRNKOW-UHFFFAOYSA-N malic acid Chemical compound OC(=O)C(O)CC(O)=O BJEPYKJPYRNKOW-UHFFFAOYSA-N 0.000 claims abstract description 13
- QYNUQALWYRSVHF-OLZOCXBDSA-N (6R)-5,10-methylenetetrahydrofolic acid Chemical compound C([C@H]1CNC=2N=C(NC(=O)C=2N1C1)N)N1C1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 QYNUQALWYRSVHF-OLZOCXBDSA-N 0.000 claims abstract description 11
- 102000004190 Enzymes Human genes 0.000 claims description 340
- 108090000790 Enzymes Proteins 0.000 claims description 340
- WGCNASOHLSPBMP-UHFFFAOYSA-N Glycolaldehyde Chemical compound OCC=O WGCNASOHLSPBMP-UHFFFAOYSA-N 0.000 claims description 61
- AEMRFAOFKBGASW-UHFFFAOYSA-M Glycolate Chemical compound OCC([O-])=O AEMRFAOFKBGASW-UHFFFAOYSA-M 0.000 claims description 56
- HHLFWLYXYJOTON-UHFFFAOYSA-N glyoxylic acid Chemical compound OC(=O)C=O HHLFWLYXYJOTON-UHFFFAOYSA-N 0.000 claims description 53
- 239000000758 substrate Substances 0.000 claims description 53
- 241000193403 Clostridium Species 0.000 claims description 52
- LCTONWCANYUPML-UHFFFAOYSA-M Pyruvate Chemical compound CC(=O)C([O-])=O LCTONWCANYUPML-UHFFFAOYSA-M 0.000 claims description 38
- 229910002091 carbon monoxide Inorganic materials 0.000 claims description 36
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 claims description 34
- ZSLZBFCDCINBPY-ZSJPKINUSA-N acetyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 ZSLZBFCDCINBPY-ZSJPKINUSA-N 0.000 claims description 34
- 102000005369 Aldehyde Dehydrogenase Human genes 0.000 claims description 27
- 108020002663 Aldehyde Dehydrogenase Proteins 0.000 claims description 25
- 230000035772 mutation Effects 0.000 claims description 25
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 claims description 24
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 claims description 24
- 230000014509 gene expression Effects 0.000 claims description 24
- 229960001153 serine Drugs 0.000 claims description 24
- 230000037361 pathway Effects 0.000 claims description 23
- ODBLHEXUDAPZAU-UHFFFAOYSA-N isocitric acid Chemical compound OC(=O)C(O)C(C(O)=O)CC(O)=O ODBLHEXUDAPZAU-UHFFFAOYSA-N 0.000 claims description 22
- HHDDCCUIIUWNGJ-UHFFFAOYSA-N 3-hydroxypyruvic acid Chemical compound OCC(=O)C(O)=O HHDDCCUIIUWNGJ-UHFFFAOYSA-N 0.000 claims description 21
- 229930029653 phosphoenolpyruvate Natural products 0.000 claims description 21
- DTBNBXWJWCWCIK-UHFFFAOYSA-N phosphoenolpyruvic acid Chemical compound OC(=O)C(=C)OP(O)(O)=O DTBNBXWJWCWCIK-UHFFFAOYSA-N 0.000 claims description 21
- 229940091179 aconitate Drugs 0.000 claims description 20
- GTZCVFVGUGFEME-UHFFFAOYSA-N aconitic acid Chemical compound OC(=O)CC(C(O)=O)=CC(O)=O GTZCVFVGUGFEME-UHFFFAOYSA-N 0.000 claims description 20
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical group CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 claims description 15
- 241001656809 Clostridium autoethanogenum Species 0.000 claims description 15
- 108010068673 Glycolaldehyde dehydrogenase Proteins 0.000 claims description 14
- 108020003285 Isocitrate lyase Proteins 0.000 claims description 14
- VZCYOOQTPOCHFL-UPHRSURJSA-N maleic acid Chemical compound OC(=O)\C=C/C(O)=O VZCYOOQTPOCHFL-UPHRSURJSA-N 0.000 claims description 14
- 235000015097 nutrients Nutrition 0.000 claims description 14
- VZCYOOQTPOCHFL-UHFFFAOYSA-N trans-butenedioic acid Natural products OC(=O)C=CC(O)=O VZCYOOQTPOCHFL-UHFFFAOYSA-N 0.000 claims description 14
- GXIURPTVHJPJLF-UWTATZPHSA-N 2-phospho-D-glyceric acid Chemical compound OC[C@H](C(O)=O)OP(O)(O)=O GXIURPTVHJPJLF-UWTATZPHSA-N 0.000 claims description 12
- OSJPPGNTCRNQQC-UWTATZPHSA-N 3-phospho-D-glyceric acid Chemical compound OC(=O)[C@H](O)COP(O)(O)=O OSJPPGNTCRNQQC-UWTATZPHSA-N 0.000 claims description 12
- LFLUCDOSQPJJBE-UHFFFAOYSA-N 3-phosphonooxypyruvic acid Chemical compound OC(=O)C(=O)COP(O)(O)=O LFLUCDOSQPJJBE-UHFFFAOYSA-N 0.000 claims description 12
- 108010000445 Glycerate dehydrogenase Proteins 0.000 claims description 12
- BZQFBWGGLXLEPQ-REOHCLBHSA-N phosphoserine Chemical compound OC(=O)[C@@H](N)COP(O)(O)=O BZQFBWGGLXLEPQ-REOHCLBHSA-N 0.000 claims description 12
- 102000004316 Oxidoreductases Human genes 0.000 claims description 11
- 108090000854 Oxidoreductases Proteins 0.000 claims description 11
- 108010060800 serine-pyruvate aminotransferase Proteins 0.000 claims description 11
- 108010033918 Alanine-glyoxylate transaminase Proteins 0.000 claims description 10
- 241000193830 Bacillus <bacterium> Species 0.000 claims description 10
- 108030005073 Glycolate dehydrogenases Proteins 0.000 claims description 9
- 150000001299 aldehydes Chemical class 0.000 claims description 9
- OWBTYPJTUOEWEK-UHFFFAOYSA-N butane-2,3-diol Chemical compound CC(O)C(C)O OWBTYPJTUOEWEK-UHFFFAOYSA-N 0.000 claims description 8
- IHVFHZGGMJDGGZ-UHFFFAOYSA-N 2-[2-[2-[3-[[4-[[[5-(6-aminopurin-9-yl)-4-hydroxy-3-phosphonooxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-hydroxyphosphoryl]oxy-2-hydroxy-3,3-dimethylbutanoyl]amino]propanoylamino]ethylsulfanyl]-2-oxoethyl]-2-hydroxybutanedioic acid Chemical compound OC1C(OP(O)(O)=O)C(COP(O)(=O)OP(O)(=O)OCC(C)(C)C(O)C(=O)NCCC(=O)NCCSC(=O)CC(O)(CC(O)=O)C(O)=O)OC1N1C2=NC=NC(N)=C2N=C1 IHVFHZGGMJDGGZ-UHFFFAOYSA-N 0.000 claims description 7
- 108010036684 Glycine Dehydrogenase Proteins 0.000 claims description 7
- 102100033495 Glycine dehydrogenase (decarboxylating), mitochondrial Human genes 0.000 claims description 7
- 108010075869 Isocitrate Dehydrogenase Proteins 0.000 claims description 7
- 102000012011 Isocitrate Dehydrogenase Human genes 0.000 claims description 7
- RBNPOMFGQQGHHO-UHFFFAOYSA-N -2,3-Dihydroxypropanoic acid Natural products OCC(O)C(O)=O RBNPOMFGQQGHHO-UHFFFAOYSA-N 0.000 claims description 6
- RBNPOMFGQQGHHO-UWTATZPHSA-M D-glycerate Chemical compound OC[C@@H](O)C([O-])=O RBNPOMFGQQGHHO-UWTATZPHSA-M 0.000 claims description 6
- 241000037909 Alkalibaculum Species 0.000 claims description 5
- 241000588722 Escherichia Species 0.000 claims description 5
- 241000178986 Oxobacter Species 0.000 claims description 5
- 241000589516 Pseudomonas Species 0.000 claims description 5
- 241000187747 Streptomyces Species 0.000 claims description 5
- 239000003513 alkali Substances 0.000 claims description 5
- 108010031025 Alanine Dehydrogenase Proteins 0.000 claims description 4
- 241001202853 Blautia Species 0.000 claims description 4
- 241001171821 Clostridium coskatii Species 0.000 claims description 4
- 241000178985 Moorella Species 0.000 claims description 4
- 108030001025 Serine-glyoxylate transaminases Proteins 0.000 claims description 4
- 108010084086 Succinate-Semialdehyde Dehydrogenase Proteins 0.000 claims description 4
- 102100023673 Succinate-semialdehyde dehydrogenase, mitochondrial Human genes 0.000 claims description 4
- 241000186339 Thermoanaerobacter Species 0.000 claims description 4
- 125000000480 butynyl group Chemical group [*]C#CC([H])([H])C([H])([H])[H] 0.000 claims description 4
- 239000000543 intermediate Substances 0.000 claims description 4
- 125000002496 methyl group Chemical group [H]C([H])([H])* 0.000 claims description 4
- 108010015450 2,5-dioxovalerate dehydrogenase Proteins 0.000 claims description 3
- 102000004146 ATP citrate synthases Human genes 0.000 claims description 3
- 108090000662 ATP citrate synthases Proteins 0.000 claims description 3
- 241001468161 Acetobacterium Species 0.000 claims description 3
- 102100024085 Alpha-aminoadipic semialdehyde dehydrogenase Human genes 0.000 claims description 3
- 108010049668 Betaine-Aldehyde Dehydrogenase Proteins 0.000 claims description 3
- 108030002234 Citrate (Re)-synthases Proteins 0.000 claims description 3
- 241000193161 Clostridium formicaceticum Species 0.000 claims description 3
- 241000186566 Clostridium ljungdahlii Species 0.000 claims description 3
- 241001468167 Clostridium magnum Species 0.000 claims description 3
- 241001611023 Clostridium ragsdalei Species 0.000 claims description 3
- 241000186587 Clostridium scatologenes Species 0.000 claims description 3
- 241000186394 Eubacterium Species 0.000 claims description 3
- 241000186398 Eubacterium limosum Species 0.000 claims description 3
- 241000589236 Gluconobacter Species 0.000 claims description 3
- 108010011689 Glycine transaminase Proteins 0.000 claims description 3
- 108030001064 Glycine-oxaloacetate transaminases Proteins 0.000 claims description 3
- 108010057617 Lactaldehyde dehydrogenase Proteins 0.000 claims description 3
- 241000568397 Lysinibacillus Species 0.000 claims description 3
- 241000778933 Sedimenticola Species 0.000 claims description 3
- 241000204388 Sporomusa Species 0.000 claims description 3
- 241000186547 Sporosarcina Species 0.000 claims description 3
- FZWLAAWBMGSTSO-UHFFFAOYSA-N Thiazole Chemical compound C1=CSC=N1 FZWLAAWBMGSTSO-UHFFFAOYSA-N 0.000 claims description 3
- 239000000571 coke Substances 0.000 claims description 3
- 238000012258 culturing Methods 0.000 claims description 3
- KDYFGRWQOYBRFD-UHFFFAOYSA-L succinate(2-) Chemical compound [O-]C(=O)CCC([O-])=O KDYFGRWQOYBRFD-UHFFFAOYSA-L 0.000 claims description 3
- 241001468163 Acetobacterium woodii Species 0.000 claims description 2
- 241001464894 Blautia producta Species 0.000 claims description 2
- 241001656810 Clostridium aceticum Species 0.000 claims description 2
- 241001611022 Clostridium carboxidivorans Species 0.000 claims description 2
- 241000328950 Clostridium drakei Species 0.000 claims description 2
- 241000862974 Hyphomicrobium Species 0.000 claims description 2
- 241000193459 Moorella thermoacetica Species 0.000 claims description 2
- 241001509483 Oxobacter pfennigii Species 0.000 claims description 2
- 241000179039 Paenibacillus Species 0.000 claims description 2
- 241000204376 Sporomusa ovata Species 0.000 claims description 2
- 241000543642 Sporomusa silvacetica Species 0.000 claims description 2
- 241000217849 Sporomusa sphaeroides Species 0.000 claims description 2
- 241000204652 Thermotoga Species 0.000 claims description 2
- 241000605118 Thiobacillus Species 0.000 claims description 2
- LCCNCVORNKJIRZ-UHFFFAOYSA-N parathion Chemical compound CCOP(=S)(OCC)OC1=CC=C([N+]([O-])=O)C=C1 LCCNCVORNKJIRZ-UHFFFAOYSA-N 0.000 claims description 2
- 125000001487 glyoxylate group Chemical group O=C([O-])C(=O)[*] 0.000 claims 2
- 241000606860 Pasteurella Species 0.000 claims 1
- 239000000203 mixture Substances 0.000 abstract description 20
- 229920000642 polymer Polymers 0.000 abstract description 16
- -1 polyethylene terephthalate Polymers 0.000 abstract description 6
- 229920000139 polyethylene terephthalate Polymers 0.000 abstract description 3
- 239000005020 polyethylene terephthalate Substances 0.000 abstract description 3
- 238000006243 chemical reaction Methods 0.000 description 127
- 125000003729 nucleotide group Chemical group 0.000 description 63
- 239000002773 nucleotide Substances 0.000 description 60
- 108090000623 proteins and genes Proteins 0.000 description 58
- 150000007523 nucleic acids Chemical class 0.000 description 43
- 150000001413 amino acids Chemical class 0.000 description 42
- 102000039446 nucleic acids Human genes 0.000 description 42
- 108020004707 nucleic acids Proteins 0.000 description 42
- 239000000047 product Substances 0.000 description 40
- 102000004169 proteins and genes Human genes 0.000 description 32
- 238000000855 fermentation Methods 0.000 description 31
- 235000018102 proteins Nutrition 0.000 description 31
- 230000004151 fermentation Effects 0.000 description 30
- 108020004414 DNA Proteins 0.000 description 29
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 28
- 239000007789 gas Substances 0.000 description 28
- 241000894007 species Species 0.000 description 28
- 108010047495 alanylglycine Proteins 0.000 description 25
- 108010050848 glycylleucine Proteins 0.000 description 23
- 108010049041 glutamylalanine Proteins 0.000 description 22
- 229910052799 carbon Inorganic materials 0.000 description 21
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 21
- 241000588724 Escherichia coli Species 0.000 description 20
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 20
- 108091005461 Nucleic proteins Proteins 0.000 description 19
- 108010005233 alanylglutamic acid Proteins 0.000 description 19
- 230000000670 limiting effect Effects 0.000 description 19
- 108010017391 lysylvaline Proteins 0.000 description 18
- 108010087924 alanylproline Proteins 0.000 description 17
- 108010034529 leucyl-lysine Proteins 0.000 description 17
- 108010061238 threonyl-glycine Proteins 0.000 description 16
- 102000014898 transaminase activity proteins Human genes 0.000 description 16
- 241000880493 Leptailurus serval Species 0.000 description 15
- 108010057821 leucylproline Proteins 0.000 description 15
- 239000002609 medium Substances 0.000 description 15
- ULGZDMOVFRHVEP-RWJQBGPGSA-N Erythromycin Chemical compound O([C@@H]1[C@@H](C)C(=O)O[C@@H]([C@@]([C@H](O)[C@@H](C)C(=O)[C@H](C)C[C@@](C)(O)[C@H](O[C@H]2[C@@H]([C@H](C[C@@H](C)O2)N(C)C)O)[C@H]1C)(C)O)CC)[C@H]1C[C@@](C)(OC)[C@@H](O)[C@H](C)O1 ULGZDMOVFRHVEP-RWJQBGPGSA-N 0.000 description 14
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 14
- 210000004027 cell Anatomy 0.000 description 14
- 239000013604 expression vector Substances 0.000 description 14
- 108010009298 lysylglutamic acid Proteins 0.000 description 14
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 13
- 230000000694 effects Effects 0.000 description 13
- 108010078144 glutaminyl-glycine Proteins 0.000 description 13
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 13
- 239000013612 plasmid Substances 0.000 description 13
- 101710088194 Dehydrogenase Proteins 0.000 description 12
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 12
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 12
- 108090000340 Transaminases Proteins 0.000 description 12
- 108010044940 alanylglutamine Proteins 0.000 description 12
- 108010062796 arginyllysine Proteins 0.000 description 12
- 108010047857 aspartylglycine Proteins 0.000 description 12
- VNWKTOKETHGBQD-UHFFFAOYSA-N methane Chemical compound C VNWKTOKETHGBQD-UHFFFAOYSA-N 0.000 description 12
- 108010079364 N-glycylalanine Proteins 0.000 description 11
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 11
- 108010038633 aspartylglutamate Proteins 0.000 description 11
- 239000008280 blood Substances 0.000 description 11
- 210000004369 blood Anatomy 0.000 description 11
- 108010037850 glycylvaline Proteins 0.000 description 11
- 108010073969 valyllysine Proteins 0.000 description 11
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 10
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 10
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 10
- 229940024606 amino acid Drugs 0.000 description 10
- 235000001014 amino acid Nutrition 0.000 description 10
- 108010013835 arginine glutamate Proteins 0.000 description 10
- 108010015792 glycyllysine Proteins 0.000 description 10
- 108010064235 lysylglycine Proteins 0.000 description 10
- 108010053725 prolylvaline Proteins 0.000 description 10
- 239000013598 vector Substances 0.000 description 10
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 9
- 108020004705 Codon Proteins 0.000 description 9
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 9
- QDSKNVXKLPQNOJ-GVXVVHGQSA-N Leu-Gln-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QDSKNVXKLPQNOJ-GVXVVHGQSA-N 0.000 description 9
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 9
- 230000015572 biosynthetic process Effects 0.000 description 9
- 230000037430 deletion Effects 0.000 description 9
- 238000012217 deletion Methods 0.000 description 9
- 238000012239 gene modification Methods 0.000 description 9
- 230000005017 genetic modification Effects 0.000 description 9
- 235000013617 genetically modified food Nutrition 0.000 description 9
- 108010089804 glycyl-threonine Proteins 0.000 description 9
- 108010070643 prolylglutamic acid Proteins 0.000 description 9
- 239000002028 Biomass Substances 0.000 description 8
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 8
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 8
- GPAHWYRSHCKICP-GUBZILKMSA-N Met-Glu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GPAHWYRSHCKICP-GUBZILKMSA-N 0.000 description 8
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 8
- 230000012010 growth Effects 0.000 description 8
- 108010040030 histidinoalanine Proteins 0.000 description 8
- 239000013642 negative control Substances 0.000 description 8
- 238000005457 optimization Methods 0.000 description 8
- 229910052760 oxygen Inorganic materials 0.000 description 8
- 239000001301 oxygen Substances 0.000 description 8
- 108010012581 phenylalanylglutamate Proteins 0.000 description 8
- 108091033319 polynucleotide Proteins 0.000 description 8
- 239000002157 polynucleotide Substances 0.000 description 8
- 102000040430 polynucleotide Human genes 0.000 description 8
- 108010080629 tryptophan-leucine Proteins 0.000 description 8
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 7
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 7
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 7
- 108010071536 Citrate (Si)-synthase Proteins 0.000 description 7
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 7
- WFCKERTZVCQXKH-KBPBESRZSA-N Leu-Tyr-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O WFCKERTZVCQXKH-KBPBESRZSA-N 0.000 description 7
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 7
- 108010068265 aspartyltyrosine Proteins 0.000 description 7
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 7
- AGOYDEPGAOXOCK-KCBOHYOISA-N clarithromycin Chemical compound O([C@@H]1[C@@H](C)C(=O)O[C@@H]([C@@]([C@H](O)[C@@H](C)C(=O)[C@H](C)C[C@](C)([C@H](O[C@H]2[C@@H]([C@H](C[C@@H](C)O2)N(C)C)O)[C@H]1C)OC)(C)O)CC)[C@H]1C[C@@](C)(OC)[C@@H](O)[C@H](C)O1 AGOYDEPGAOXOCK-KCBOHYOISA-N 0.000 description 7
- 229960002626 clarithromycin Drugs 0.000 description 7
- 230000021615 conjugation Effects 0.000 description 7
- 229960003276 erythromycin Drugs 0.000 description 7
- 230000006870 function Effects 0.000 description 7
- 108010087823 glycyltyrosine Proteins 0.000 description 7
- 108010092114 histidylphenylalanine Proteins 0.000 description 7
- 108010056582 methionylglutamic acid Proteins 0.000 description 7
- 230000000813 microbial effect Effects 0.000 description 7
- 108010051242 phenylalanylserine Proteins 0.000 description 7
- 230000008569 process Effects 0.000 description 7
- 108010031719 prolyl-serine Proteins 0.000 description 7
- 108010015796 prolylisoleucine Proteins 0.000 description 7
- 108010038745 tryptophylglycine Proteins 0.000 description 7
- 108010030844 2-methylcitrate synthase Proteins 0.000 description 6
- DPNZTBKGAUAZQU-DLOVCJGASA-N Ala-Leu-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DPNZTBKGAUAZQU-DLOVCJGASA-N 0.000 description 6
- BVLPIIBTWIYOML-ZKWXMUAHSA-N Ala-Val-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BVLPIIBTWIYOML-ZKWXMUAHSA-N 0.000 description 6
- AGVNTAUPLWIQEN-ZPFDUUQYSA-N Arg-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AGVNTAUPLWIQEN-ZPFDUUQYSA-N 0.000 description 6
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 6
- 241000894006 Bacteria Species 0.000 description 6
- UGFAIRIUMAVXCW-UHFFFAOYSA-N Carbon monoxide Chemical compound [O+]#[C-] UGFAIRIUMAVXCW-UHFFFAOYSA-N 0.000 description 6
- 102000006732 Citrate synthase Human genes 0.000 description 6
- SRBFZHDQGSBBOR-IOVATXLUSA-N D-xylopyranose Chemical compound O[C@@H]1COC(O)[C@H](O)[C@H]1O SRBFZHDQGSBBOR-IOVATXLUSA-N 0.000 description 6
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 6
- YSDLIYZLOTZZNP-UWVGGRQHSA-N Gly-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN YSDLIYZLOTZZNP-UWVGGRQHSA-N 0.000 description 6
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 6
- SCHZQZPYHBWYEQ-PEFMBERDSA-N Ile-Asn-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SCHZQZPYHBWYEQ-PEFMBERDSA-N 0.000 description 6
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 6
- GODBLDDYHFTUAH-CIUDSAMLSA-N Met-Asp-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O GODBLDDYHFTUAH-CIUDSAMLSA-N 0.000 description 6
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 6
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 6
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 6
- 101100342977 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-1 gene Proteins 0.000 description 6
- 102100026842 Serine-pyruvate aminotransferase Human genes 0.000 description 6
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 6
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 6
- UUYCNAXCCDNULB-QXEWZRGKSA-N Val-Arg-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O UUYCNAXCCDNULB-QXEWZRGKSA-N 0.000 description 6
- GMOLURHJBLOBFW-ONGXEEELSA-N Val-Gly-His Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N GMOLURHJBLOBFW-ONGXEEELSA-N 0.000 description 6
- 235000004279 alanine Nutrition 0.000 description 6
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 6
- 108010060035 arginylproline Proteins 0.000 description 6
- 108010077245 asparaginyl-proline Proteins 0.000 description 6
- 108010092854 aspartyllysine Proteins 0.000 description 6
- 108010054812 diprotin A Proteins 0.000 description 6
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 6
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 6
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 6
- 108010020688 glycylhistidine Proteins 0.000 description 6
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 6
- 108010012058 leucyltyrosine Proteins 0.000 description 6
- 108010005942 methionylglycine Proteins 0.000 description 6
- 235000016709 nutrition Nutrition 0.000 description 6
- 108010029020 prolylglycine Proteins 0.000 description 6
- 230000002829 reductive effect Effects 0.000 description 6
- 108010048818 seryl-histidine Proteins 0.000 description 6
- 108010026333 seryl-proline Proteins 0.000 description 6
- 235000000346 sugar Nutrition 0.000 description 6
- 238000003786 synthesis reaction Methods 0.000 description 6
- 238000012546 transfer Methods 0.000 description 6
- 108010000998 wheylin-2 peptide Proteins 0.000 description 6
- QTBSBXVTEAMEQO-UHFFFAOYSA-M Acetate Chemical compound CC([O-])=O QTBSBXVTEAMEQO-UHFFFAOYSA-M 0.000 description 5
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 5
- DWINFPQUSSHSFS-UVBJJODRSA-N Ala-Arg-Trp Chemical compound N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C12)C(=O)O DWINFPQUSSHSFS-UVBJJODRSA-N 0.000 description 5
- WYPUMLRSQMKIJU-BPNCWPANSA-N Ala-Arg-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WYPUMLRSQMKIJU-BPNCWPANSA-N 0.000 description 5
- FVSOUJZKYWEFOB-KBIXCLLPSA-N Ala-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)N FVSOUJZKYWEFOB-KBIXCLLPSA-N 0.000 description 5
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 5
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 5
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 5
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 5
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 5
- ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N Asn-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N 0.000 description 5
- RDLYUKRPEJERMM-XIRDDKMYSA-N Asn-Trp-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O RDLYUKRPEJERMM-XIRDDKMYSA-N 0.000 description 5
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 5
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 5
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 5
- 102100034229 Citramalyl-CoA lyase, mitochondrial Human genes 0.000 description 5
- GNMQDOGFWYWPNM-LAEOZQHASA-N Gln-Gly-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)CNC(=O)[C@@H](N)CCC(N)=O)C(O)=O GNMQDOGFWYWPNM-LAEOZQHASA-N 0.000 description 5
- JRCUFCXYZLPSDZ-ACZMJKKPSA-N Glu-Asp-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O JRCUFCXYZLPSDZ-ACZMJKKPSA-N 0.000 description 5
- CUXJIASLBRJOFV-LAEOZQHASA-N Glu-Gly-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUXJIASLBRJOFV-LAEOZQHASA-N 0.000 description 5
- ITBHUUMCJJQUSC-LAEOZQHASA-N Glu-Ile-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O ITBHUUMCJJQUSC-LAEOZQHASA-N 0.000 description 5
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 5
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 5
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 5
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 5
- KZTLOHBDLMIFSH-XVYDVKMFSA-N His-Ala-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O KZTLOHBDLMIFSH-XVYDVKMFSA-N 0.000 description 5
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 5
- QHGBCRCMBCWMBJ-UHFFFAOYSA-N Ile-Glu-Ala-Lys Natural products CCC(C)C(N)C(=O)NC(CCC(O)=O)C(=O)NC(C)C(=O)NC(C(O)=O)CCCCN QHGBCRCMBCWMBJ-UHFFFAOYSA-N 0.000 description 5
- 108010044467 Isoenzymes Proteins 0.000 description 5
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 5
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 5
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 5
- YKNBJXOJTURHCU-DCAQKATOSA-N Leu-Asp-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKNBJXOJTURHCU-DCAQKATOSA-N 0.000 description 5
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 5
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 5
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 5
- 108020004687 Malate Synthase Proteins 0.000 description 5
- NMKJFZCBCIUYHI-UHFFFAOYSA-N N-(1,3-benzodioxol-5-ylmethyl)-2,6-dichlorobenzamide Chemical compound ClC1=CC=CC(Cl)=C1C(=O)NCC1=CC=C(OCO2)C2=C1 NMKJFZCBCIUYHI-UHFFFAOYSA-N 0.000 description 5
- ULECEJGNDHWSKD-QEJZJMRPSA-N Phe-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 ULECEJGNDHWSKD-QEJZJMRPSA-N 0.000 description 5
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 5
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 5
- DGHFNYXVIXNNMC-GUBZILKMSA-N Ser-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N DGHFNYXVIXNNMC-GUBZILKMSA-N 0.000 description 5
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 5
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 5
- GAYLGYUVTDMLKC-UWJYBYFXSA-N Tyr-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GAYLGYUVTDMLKC-UWJYBYFXSA-N 0.000 description 5
- KTEZUXISLQTDDQ-NHCYSSNCSA-N Val-Lys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KTEZUXISLQTDDQ-NHCYSSNCSA-N 0.000 description 5
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 5
- 108010070944 alanylhistidine Proteins 0.000 description 5
- 230000001651 autotrophic effect Effects 0.000 description 5
- 230000008901 benefit Effects 0.000 description 5
- 150000002009 diols Chemical class 0.000 description 5
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 5
- 238000004821 distillation Methods 0.000 description 5
- 238000002309 gasification Methods 0.000 description 5
- 108010036413 histidylglycine Proteins 0.000 description 5
- 108010025306 histidylleucine Proteins 0.000 description 5
- 229910052739 hydrogen Inorganic materials 0.000 description 5
- 230000001965 increasing effect Effects 0.000 description 5
- 108091022889 lactaldehyde reductase Proteins 0.000 description 5
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 5
- 108090000765 processed proteins & peptides Proteins 0.000 description 5
- 239000004055 small Interfering RNA Substances 0.000 description 5
- 238000013519 translation Methods 0.000 description 5
- 108010051110 tyrosyl-lysine Proteins 0.000 description 5
- UGLPMYSCWHTZQU-AUTRQRHGSA-N Ala-Ala-Tyr Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UGLPMYSCWHTZQU-AUTRQRHGSA-N 0.000 description 4
- MKZCBYZBCINNJN-DLOVCJGASA-N Ala-Asp-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MKZCBYZBCINNJN-DLOVCJGASA-N 0.000 description 4
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 4
- UHMQKOBNPRAZGB-CIUDSAMLSA-N Ala-Glu-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N UHMQKOBNPRAZGB-CIUDSAMLSA-N 0.000 description 4
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 4
- MQIGTEQXYCRLGK-BQBZGAKWSA-N Ala-Gly-Pro Chemical compound C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O MQIGTEQXYCRLGK-BQBZGAKWSA-N 0.000 description 4
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 4
- JPOQZCHGOTWRTM-FQPOAREZSA-N Ala-Tyr-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPOQZCHGOTWRTM-FQPOAREZSA-N 0.000 description 4
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 4
- OCOZPTHLDVSFCZ-BPUTZDHNSA-N Arg-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N OCOZPTHLDVSFCZ-BPUTZDHNSA-N 0.000 description 4
- JOADBFCFJGNIKF-GUBZILKMSA-N Arg-Met-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O JOADBFCFJGNIKF-GUBZILKMSA-N 0.000 description 4
- INOIAEUXVVNJKA-XGEHTFHBSA-N Arg-Thr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O INOIAEUXVVNJKA-XGEHTFHBSA-N 0.000 description 4
- DMLSCRJBWUEALP-LAEOZQHASA-N Asn-Glu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O DMLSCRJBWUEALP-LAEOZQHASA-N 0.000 description 4
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 4
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 4
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 4
- JZLFYAAGGYMRIK-BYULHYEWSA-N Asn-Val-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O JZLFYAAGGYMRIK-BYULHYEWSA-N 0.000 description 4
- SDHFVYLZFBDSQT-DCAQKATOSA-N Asp-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N SDHFVYLZFBDSQT-DCAQKATOSA-N 0.000 description 4
- NYLBGYLHBDFRHL-VEVYYDQMSA-N Asp-Arg-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NYLBGYLHBDFRHL-VEVYYDQMSA-N 0.000 description 4
- 235000014469 Bacillus subtilis Nutrition 0.000 description 4
- 108010090461 DFG peptide Proteins 0.000 description 4
- SXGMGNZEHFORAV-IUCAKERBSA-N Gln-Lys-Gly Chemical compound C(CCN)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N SXGMGNZEHFORAV-IUCAKERBSA-N 0.000 description 4
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 4
- QPRZKNOOOBWXSU-CIUDSAMLSA-N Glu-Asp-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N QPRZKNOOOBWXSU-CIUDSAMLSA-N 0.000 description 4
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 4
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 4
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 4
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 4
- WVYJNPCWJYBHJG-YVNDNENWSA-N Glu-Ile-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O WVYJNPCWJYBHJG-YVNDNENWSA-N 0.000 description 4
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 4
- CQAHWYDHKUWYIX-YUMQZZPRSA-N Glu-Pro-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O CQAHWYDHKUWYIX-YUMQZZPRSA-N 0.000 description 4
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 4
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 4
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 4
- JXYMPBCYRKWJEE-BQBZGAKWSA-N Gly-Arg-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JXYMPBCYRKWJEE-BQBZGAKWSA-N 0.000 description 4
- HFXJIZNEXNIZIJ-BQBZGAKWSA-N Gly-Glu-Gln Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFXJIZNEXNIZIJ-BQBZGAKWSA-N 0.000 description 4
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 4
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 description 4
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 4
- JPVGHHQGKPQYIL-KBPBESRZSA-N Gly-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 JPVGHHQGKPQYIL-KBPBESRZSA-N 0.000 description 4
- CSMYMGFCEJWALV-WDSKDSINSA-N Gly-Ser-Gln Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O CSMYMGFCEJWALV-WDSKDSINSA-N 0.000 description 4
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 4
- GNNJKUYDWFIBTK-QWRGUYRKSA-N Gly-Tyr-Asp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O GNNJKUYDWFIBTK-QWRGUYRKSA-N 0.000 description 4
- MUGLKCQHTUFLGF-WPRPVWTQSA-N Gly-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)CN MUGLKCQHTUFLGF-WPRPVWTQSA-N 0.000 description 4
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 4
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 4
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 4
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 4
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 4
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 4
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 4
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 4
- UCDHVOALNXENLC-KBPBESRZSA-N Leu-Gly-Tyr Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UCDHVOALNXENLC-KBPBESRZSA-N 0.000 description 4
- KXODZBLFVFSLAI-AVGNSLFASA-N Leu-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KXODZBLFVFSLAI-AVGNSLFASA-N 0.000 description 4
- XBCWOTOCBXXJDG-BZSNNMDCSA-N Leu-His-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 XBCWOTOCBXXJDG-BZSNNMDCSA-N 0.000 description 4
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 4
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 4
- BJWKOATWNQJPSK-SRVKXCTJSA-N Leu-Met-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BJWKOATWNQJPSK-SRVKXCTJSA-N 0.000 description 4
- FLNPJLDPGMLWAU-UWVGGRQHSA-N Leu-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(C)C FLNPJLDPGMLWAU-UWVGGRQHSA-N 0.000 description 4
- JVTYXRRFZCEPPK-RHYQMDGZSA-N Leu-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(C)C)N)O JVTYXRRFZCEPPK-RHYQMDGZSA-N 0.000 description 4
- PJWOOBTYQNNRBF-BZSNNMDCSA-N Leu-Phe-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)O)N PJWOOBTYQNNRBF-BZSNNMDCSA-N 0.000 description 4
- SLQJJFAVWSZLBL-BJDJZHNGSA-N Lys-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN SLQJJFAVWSZLBL-BJDJZHNGSA-N 0.000 description 4
- VKCPHIOZDWUFSW-ONGXEEELSA-N Lys-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN VKCPHIOZDWUFSW-ONGXEEELSA-N 0.000 description 4
- RPWQJSBMXJSCPD-XUXIUFHCSA-N Lys-Val-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)C(C)C)C(O)=O RPWQJSBMXJSCPD-XUXIUFHCSA-N 0.000 description 4
- WPTHAGXMYDRPFD-SRVKXCTJSA-N Met-Lys-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O WPTHAGXMYDRPFD-SRVKXCTJSA-N 0.000 description 4
- AXHNAGAYRGCDLG-UWVGGRQHSA-N Met-Lys-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AXHNAGAYRGCDLG-UWVGGRQHSA-N 0.000 description 4
- LRHPLDYGYMQRHN-UHFFFAOYSA-N N-Butanol Chemical compound CCCCO LRHPLDYGYMQRHN-UHFFFAOYSA-N 0.000 description 4
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 4
- MGBRZXXGQBAULP-DRZSPHRISA-N Phe-Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGBRZXXGQBAULP-DRZSPHRISA-N 0.000 description 4
- MPFGIYLYWUCSJG-AVGNSLFASA-N Phe-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MPFGIYLYWUCSJG-AVGNSLFASA-N 0.000 description 4
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 4
- 241001570392 Sedimenticola thiotaurini Species 0.000 description 4
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 4
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 4
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 4
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 4
- NIEWSKWFURSECR-FOHZUACHSA-N Thr-Gly-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NIEWSKWFURSECR-FOHZUACHSA-N 0.000 description 4
- ZTPXSEUVYNNZRB-CDMKHQONSA-N Thr-Gly-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZTPXSEUVYNNZRB-CDMKHQONSA-N 0.000 description 4
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 4
- HDSKHCBAVVWPCQ-FHWLQOOXSA-N Tyr-Glu-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HDSKHCBAVVWPCQ-FHWLQOOXSA-N 0.000 description 4
- BYOHPUZJVXWHAE-BYULHYEWSA-N Val-Asn-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N BYOHPUZJVXWHAE-BYULHYEWSA-N 0.000 description 4
- VUTHNLMCXKLLFI-LAEOZQHASA-N Val-Asp-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VUTHNLMCXKLLFI-LAEOZQHASA-N 0.000 description 4
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 4
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 4
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 4
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 4
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 4
- 108010041407 alanylaspartic acid Proteins 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 4
- PYMYPHUHKUWMLA-UHFFFAOYSA-N arabinose Natural products OCC(O)C(O)C(O)C=O PYMYPHUHKUWMLA-UHFFFAOYSA-N 0.000 description 4
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 4
- 108010093581 aspartyl-proline Proteins 0.000 description 4
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 description 4
- 150000001720 carbohydrates Chemical class 0.000 description 4
- 235000014633 carbohydrates Nutrition 0.000 description 4
- 238000010276 construction Methods 0.000 description 4
- 238000006114 decarboxylation reaction Methods 0.000 description 4
- 239000000835 fiber Substances 0.000 description 4
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 4
- 239000001963 growth medium Substances 0.000 description 4
- 108010028295 histidylhistidine Proteins 0.000 description 4
- 108010085325 histidylproline Proteins 0.000 description 4
- 230000037431 insertion Effects 0.000 description 4
- 238000003780 insertion Methods 0.000 description 4
- 239000007788 liquid Substances 0.000 description 4
- 108010003700 lysyl aspartic acid Proteins 0.000 description 4
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 4
- 108010054155 lysyllysine Proteins 0.000 description 4
- 108020004999 messenger RNA Proteins 0.000 description 4
- 108010034507 methionyltryptophan Proteins 0.000 description 4
- 238000005373 pervaporation Methods 0.000 description 4
- 239000012071 phase Substances 0.000 description 4
- 108010024607 phenylalanylalanine Proteins 0.000 description 4
- 108010073101 phenylalanylleucine Proteins 0.000 description 4
- 108010004914 prolylarginine Proteins 0.000 description 4
- 238000011084 recovery Methods 0.000 description 4
- 108010071207 serylmethionine Proteins 0.000 description 4
- 239000000126 substance Substances 0.000 description 4
- 239000006228 supernatant Substances 0.000 description 4
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 4
- 230000009466 transformation Effects 0.000 description 4
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 3
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 3
- CSCPPACGZOOCGX-UHFFFAOYSA-N Acetone Chemical compound CC(C)=O CSCPPACGZOOCGX-UHFFFAOYSA-N 0.000 description 3
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 3
- MBWYUTNBYSSUIQ-HERUPUMHSA-N Ala-Asn-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N MBWYUTNBYSSUIQ-HERUPUMHSA-N 0.000 description 3
- MVBWLRJESQOQTM-ACZMJKKPSA-N Ala-Gln-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O MVBWLRJESQOQTM-ACZMJKKPSA-N 0.000 description 3
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 3
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 3
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 3
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 3
- NIZKGBJVCMRDKO-KWQFWETISA-N Ala-Gly-Tyr Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NIZKGBJVCMRDKO-KWQFWETISA-N 0.000 description 3
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 3
- IFKQPMZRDQZSHI-GHCJXIJMSA-N Ala-Ile-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O IFKQPMZRDQZSHI-GHCJXIJMSA-N 0.000 description 3
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 3
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 3
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 3
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 3
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 3
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 3
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 3
- OQWQTGBOFPJOIF-DLOVCJGASA-N Ala-Lys-His Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N OQWQTGBOFPJOIF-DLOVCJGASA-N 0.000 description 3
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 3
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 3
- DEWWPUNXRNGMQN-LPEHRKFASA-N Ala-Met-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N DEWWPUNXRNGMQN-LPEHRKFASA-N 0.000 description 3
- XRUJOVRWNMBAAA-NHCYSSNCSA-N Ala-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 XRUJOVRWNMBAAA-NHCYSSNCSA-N 0.000 description 3
- CNQAFFMNJIQYGX-DRZSPHRISA-N Ala-Phe-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 CNQAFFMNJIQYGX-DRZSPHRISA-N 0.000 description 3
- IHMCQESUJVZTKW-UBHSHLNASA-N Ala-Phe-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 IHMCQESUJVZTKW-UBHSHLNASA-N 0.000 description 3
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 3
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 3
- AOAKQKVICDWCLB-UWJYBYFXSA-N Ala-Tyr-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AOAKQKVICDWCLB-UWJYBYFXSA-N 0.000 description 3
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 3
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 3
- OBFTYSPXDRROQO-SRVKXCTJSA-N Arg-Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCN=C(N)N OBFTYSPXDRROQO-SRVKXCTJSA-N 0.000 description 3
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 3
- CVKOQHYVDVYJSI-QTKMDUPCSA-N Arg-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N)O CVKOQHYVDVYJSI-QTKMDUPCSA-N 0.000 description 3
- UAOSDDXCTBIPCA-QXEWZRGKSA-N Arg-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UAOSDDXCTBIPCA-QXEWZRGKSA-N 0.000 description 3
- JCROZIFVIYMXHM-GUBZILKMSA-N Arg-Met-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCCN=C(N)N JCROZIFVIYMXHM-GUBZILKMSA-N 0.000 description 3
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 3
- PCKRJVZAQZWNKM-WHFBIAKZSA-N Asn-Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O PCKRJVZAQZWNKM-WHFBIAKZSA-N 0.000 description 3
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 3
- NYGILGUOUOXGMJ-YUMQZZPRSA-N Asn-Lys-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O NYGILGUOUOXGMJ-YUMQZZPRSA-N 0.000 description 3
- MYRLSKYSMXNLLA-LAEOZQHASA-N Asn-Val-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MYRLSKYSMXNLLA-LAEOZQHASA-N 0.000 description 3
- XEDQMTWEYFBOIK-ACZMJKKPSA-N Asp-Ala-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XEDQMTWEYFBOIK-ACZMJKKPSA-N 0.000 description 3
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 3
- QHAJMRDEWNAIBQ-FXQIFTODSA-N Asp-Arg-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O QHAJMRDEWNAIBQ-FXQIFTODSA-N 0.000 description 3
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 3
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 3
- LDGUZSIPGSPBJP-XVYDVKMFSA-N Asp-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N LDGUZSIPGSPBJP-XVYDVKMFSA-N 0.000 description 3
- CYCKJEFVFNRWEZ-UGYAYLCHSA-N Asp-Ile-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CYCKJEFVFNRWEZ-UGYAYLCHSA-N 0.000 description 3
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 3
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 3
- LIVXPXUVXFRWNY-CIUDSAMLSA-N Asp-Lys-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O LIVXPXUVXFRWNY-CIUDSAMLSA-N 0.000 description 3
- QJHOOKBAHRJPPX-QWRGUYRKSA-N Asp-Phe-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 QJHOOKBAHRJPPX-QWRGUYRKSA-N 0.000 description 3
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 3
- 241001112696 Clostridia Species 0.000 description 3
- 241000193401 Clostridium acetobutylicum Species 0.000 description 3
- XABFFGOGKOORCG-CIUDSAMLSA-N Cys-Asp-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XABFFGOGKOORCG-CIUDSAMLSA-N 0.000 description 3
- WZJLBUPPZRZNTO-CIUDSAMLSA-N Cys-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N WZJLBUPPZRZNTO-CIUDSAMLSA-N 0.000 description 3
- IKDOHQHEFPPGJG-FXQIFTODSA-N Gln-Asp-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IKDOHQHEFPPGJG-FXQIFTODSA-N 0.000 description 3
- JXBZEDIQFFCHPZ-PEFMBERDSA-N Gln-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JXBZEDIQFFCHPZ-PEFMBERDSA-N 0.000 description 3
- YRWWJCDWLVXTHN-LAEOZQHASA-N Gln-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N YRWWJCDWLVXTHN-LAEOZQHASA-N 0.000 description 3
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 3
- PIUPHASDUFSHTF-CIUDSAMLSA-N Gln-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O PIUPHASDUFSHTF-CIUDSAMLSA-N 0.000 description 3
- YJSCHRBERYWPQL-DCAQKATOSA-N Gln-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N YJSCHRBERYWPQL-DCAQKATOSA-N 0.000 description 3
- ATRHMOJQJWPVBQ-DRZSPHRISA-N Glu-Ala-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ATRHMOJQJWPVBQ-DRZSPHRISA-N 0.000 description 3
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 3
- AKJRHDMTEJXTPV-ACZMJKKPSA-N Glu-Asn-Ala Chemical compound C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AKJRHDMTEJXTPV-ACZMJKKPSA-N 0.000 description 3
- RJONUNZIMUXUOI-GUBZILKMSA-N Glu-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N RJONUNZIMUXUOI-GUBZILKMSA-N 0.000 description 3
- SBYVDRJAXWSXQL-AVGNSLFASA-N Glu-Asn-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SBYVDRJAXWSXQL-AVGNSLFASA-N 0.000 description 3
- HJIFPJUEOGZWRI-GUBZILKMSA-N Glu-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N HJIFPJUEOGZWRI-GUBZILKMSA-N 0.000 description 3
- UENPHLAAKDPZQY-XKBZYTNZSA-N Glu-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N)O UENPHLAAKDPZQY-XKBZYTNZSA-N 0.000 description 3
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 3
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 3
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 3
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 3
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 3
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 3
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 3
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 3
- MFNUFCFRAZPJFW-JYJNAYRXSA-N Glu-Lys-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MFNUFCFRAZPJFW-JYJNAYRXSA-N 0.000 description 3
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 3
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 3
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 3
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 3
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 3
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 3
- SWQALSGKVLYKDT-ZKWXMUAHSA-N Gly-Ile-Ala Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SWQALSGKVLYKDT-ZKWXMUAHSA-N 0.000 description 3
- LUJVWKKYHSLULQ-ZKWXMUAHSA-N Gly-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN LUJVWKKYHSLULQ-ZKWXMUAHSA-N 0.000 description 3
- HKSNHPVETYYJBK-LAEOZQHASA-N Gly-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)CN HKSNHPVETYYJBK-LAEOZQHASA-N 0.000 description 3
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 3
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 3
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 3
- HJARVELKOSZUEW-YUMQZZPRSA-N Gly-Pro-Gln Chemical compound [H]NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJARVELKOSZUEW-YUMQZZPRSA-N 0.000 description 3
- UVTSZKIATYSKIR-RYUDHWBXSA-N Gly-Tyr-Glu Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O UVTSZKIATYSKIR-RYUDHWBXSA-N 0.000 description 3
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 3
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 3
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 3
- 108010038519 Glyoxylate reductase Proteins 0.000 description 3
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 3
- PDSUIXMZYNURGI-AVGNSLFASA-N His-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC1=CN=CN1 PDSUIXMZYNURGI-AVGNSLFASA-N 0.000 description 3
- VFBZWZXKCVBTJR-SRVKXCTJSA-N His-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VFBZWZXKCVBTJR-SRVKXCTJSA-N 0.000 description 3
- OQDLKDUVMTUPPG-AVGNSLFASA-N His-Leu-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OQDLKDUVMTUPPG-AVGNSLFASA-N 0.000 description 3
- UWSMZKRTOZEGDD-CUJWVEQBSA-N His-Thr-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O UWSMZKRTOZEGDD-CUJWVEQBSA-N 0.000 description 3
- QICVAHODWHIWIS-HTFCKZLJSA-N Ile-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N QICVAHODWHIWIS-HTFCKZLJSA-N 0.000 description 3
- DPTBVFUDCPINIP-JURCDPSOSA-N Ile-Ala-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DPTBVFUDCPINIP-JURCDPSOSA-N 0.000 description 3
- YOTNPRLPIPHQSB-XUXIUFHCSA-N Ile-Arg-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOTNPRLPIPHQSB-XUXIUFHCSA-N 0.000 description 3
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 3
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 description 3
- NCSIQAFSIPHVAN-IUKAMOBKSA-N Ile-Asn-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NCSIQAFSIPHVAN-IUKAMOBKSA-N 0.000 description 3
- RPZFUIQVAPZLRH-GHCJXIJMSA-N Ile-Asp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)O)N RPZFUIQVAPZLRH-GHCJXIJMSA-N 0.000 description 3
- NBJAAWYRLGCJOF-UGYAYLCHSA-N Ile-Asp-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NBJAAWYRLGCJOF-UGYAYLCHSA-N 0.000 description 3
- FFAUOCITXBMRBT-YTFOTSKYSA-N Ile-Lys-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FFAUOCITXBMRBT-YTFOTSKYSA-N 0.000 description 3
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 description 3
- NJGXXYLPDMMFJB-XUXIUFHCSA-N Ile-Val-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N NJGXXYLPDMMFJB-XUXIUFHCSA-N 0.000 description 3
- 108010065920 Insulin Lispro Proteins 0.000 description 3
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 3
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 3
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 3
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 3
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 3
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 3
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 3
- KUEVMUXNILMJTK-JYJNAYRXSA-N Leu-Gln-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KUEVMUXNILMJTK-JYJNAYRXSA-N 0.000 description 3
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 3
- KVMULWOHPPMHHE-DCAQKATOSA-N Leu-Glu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KVMULWOHPPMHHE-DCAQKATOSA-N 0.000 description 3
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 3
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 3
- JFSGIJSCJFQGSZ-MXAVVETBSA-N Leu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N JFSGIJSCJFQGSZ-MXAVVETBSA-N 0.000 description 3
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 3
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 3
- XWEVVRRSIOBJOO-SRVKXCTJSA-N Leu-Pro-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O XWEVVRRSIOBJOO-SRVKXCTJSA-N 0.000 description 3
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 3
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 3
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 3
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 3
- LMDVGHQPPPLYAR-IHRRRGAJSA-N Leu-Val-His Chemical compound N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O LMDVGHQPPPLYAR-IHRRRGAJSA-N 0.000 description 3
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 3
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 3
- GQUDMNDPQTXZRV-DCAQKATOSA-N Lys-Arg-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GQUDMNDPQTXZRV-DCAQKATOSA-N 0.000 description 3
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 3
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 3
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 3
- DTUZCYRNEJDKSR-NHCYSSNCSA-N Lys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN DTUZCYRNEJDKSR-NHCYSSNCSA-N 0.000 description 3
- JZMGVXLDOQOKAH-UWVGGRQHSA-N Lys-Gly-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O JZMGVXLDOQOKAH-UWVGGRQHSA-N 0.000 description 3
- HQXSFFSLXFHWOX-IXOXFDKPSA-N Lys-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCCN)N)O HQXSFFSLXFHWOX-IXOXFDKPSA-N 0.000 description 3
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 3
- XIZQPFCRXLUNMK-BZSNNMDCSA-N Lys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N XIZQPFCRXLUNMK-BZSNNMDCSA-N 0.000 description 3
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 3
- WLXGMVVHTIUPHE-ULQDDVLXSA-N Lys-Phe-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O WLXGMVVHTIUPHE-ULQDDVLXSA-N 0.000 description 3
- LUTDBHBIHHREDC-IHRRRGAJSA-N Lys-Pro-Lys Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O LUTDBHBIHHREDC-IHRRRGAJSA-N 0.000 description 3
- RQILLQOQXLZTCK-KBPBESRZSA-N Lys-Tyr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O RQILLQOQXLZTCK-KBPBESRZSA-N 0.000 description 3
- QLFAPXUXEBAWEK-NHCYSSNCSA-N Lys-Val-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QLFAPXUXEBAWEK-NHCYSSNCSA-N 0.000 description 3
- XABXVVSWUVCZST-GVXVVHGQSA-N Lys-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN XABXVVSWUVCZST-GVXVVHGQSA-N 0.000 description 3
- OZVXDDFYCQOPFD-XQQFMLRXSA-N Lys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N OZVXDDFYCQOPFD-XQQFMLRXSA-N 0.000 description 3
- UYAKZHGIPRCGPF-CIUDSAMLSA-N Met-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N UYAKZHGIPRCGPF-CIUDSAMLSA-N 0.000 description 3
- MTBVQFFQMXHCPC-CIUDSAMLSA-N Met-Glu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MTBVQFFQMXHCPC-CIUDSAMLSA-N 0.000 description 3
- JPCHYAUKOUGOIB-HJGDQZAQSA-N Met-Glu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPCHYAUKOUGOIB-HJGDQZAQSA-N 0.000 description 3
- BCRQJDMZQUHQSV-STQMWFEESA-N Met-Gly-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BCRQJDMZQUHQSV-STQMWFEESA-N 0.000 description 3
- RIIFMEBFDDXGCV-VEVYYDQMSA-N Met-Thr-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O RIIFMEBFDDXGCV-VEVYYDQMSA-N 0.000 description 3
- 241000460112 Murella Species 0.000 description 3
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 3
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 3
- XJLXINKUBYWONI-NNYOXOHSSA-O NADP(+) Chemical compound NC(=O)C1=CC=C[N+]([C@H]2[C@@H]([C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OC[C@@H]3[C@H]([C@@H](OP(O)(O)=O)[C@@H](O3)N3C4=NC=NC(N)=C4N=C3)O)O2)O)=C1 XJLXINKUBYWONI-NNYOXOHSSA-O 0.000 description 3
- WSXKXSBOJXEZDV-DLOVCJGASA-N Phe-Ala-Asn Chemical compound NC(=O)C[C@@H](C([O-])=O)NC(=O)[C@H](C)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 WSXKXSBOJXEZDV-DLOVCJGASA-N 0.000 description 3
- WKTSCAXSYITIJJ-PCBIJLKTSA-N Phe-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O WKTSCAXSYITIJJ-PCBIJLKTSA-N 0.000 description 3
- PEFJUUYFEGBXFA-BZSNNMDCSA-N Phe-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 PEFJUUYFEGBXFA-BZSNNMDCSA-N 0.000 description 3
- 239000002202 Polyethylene glycol Substances 0.000 description 3
- NHDVNAKDACFHPX-GUBZILKMSA-N Pro-Arg-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O NHDVNAKDACFHPX-GUBZILKMSA-N 0.000 description 3
- JARJPEMLQAWNBR-GUBZILKMSA-N Pro-Asp-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JARJPEMLQAWNBR-GUBZILKMSA-N 0.000 description 3
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 3
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 3
- VZKBJNBZMZHKRC-XUXIUFHCSA-N Pro-Ile-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O VZKBJNBZMZHKRC-XUXIUFHCSA-N 0.000 description 3
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 3
- PUQRDHNIOONJJN-AVGNSLFASA-N Pro-Lys-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O PUQRDHNIOONJJN-AVGNSLFASA-N 0.000 description 3
- HBBBLSVBQGZKOZ-GUBZILKMSA-N Pro-Met-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O HBBBLSVBQGZKOZ-GUBZILKMSA-N 0.000 description 3
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 3
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 3
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 3
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 3
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 3
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 3
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 3
- XXNYYSXNXCJYKX-DCAQKATOSA-N Ser-Leu-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O XXNYYSXNXCJYKX-DCAQKATOSA-N 0.000 description 3
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 3
- NIOYDASGXWLHEZ-CIUDSAMLSA-N Ser-Met-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O NIOYDASGXWLHEZ-CIUDSAMLSA-N 0.000 description 3
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 3
- FLONGDPORFIVQW-XGEHTFHBSA-N Ser-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FLONGDPORFIVQW-XGEHTFHBSA-N 0.000 description 3
- 108020004459 Small interfering RNA Proteins 0.000 description 3
- 241000605176 Thermithiobacillus tepidarius Species 0.000 description 3
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 3
- NFMPFBCXABPALN-OWLDWWDNSA-N Thr-Ala-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O NFMPFBCXABPALN-OWLDWWDNSA-N 0.000 description 3
- XSEPSRUDSPHMPX-KATARQTJSA-N Thr-Lys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O XSEPSRUDSPHMPX-KATARQTJSA-N 0.000 description 3
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 3
- FOAJSVIXYCLTSC-PJODQICGSA-N Trp-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N FOAJSVIXYCLTSC-PJODQICGSA-N 0.000 description 3
- PALLCTDPFINNMM-JQHSSLGASA-N Trp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N PALLCTDPFINNMM-JQHSSLGASA-N 0.000 description 3
- FJBCEFPCVPHPPM-STECZYCISA-N Tyr-Ile-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O FJBCEFPCVPHPPM-STECZYCISA-N 0.000 description 3
- ZPFLBLFITJCBTP-QWRGUYRKSA-N Tyr-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O ZPFLBLFITJCBTP-QWRGUYRKSA-N 0.000 description 3
- LVFZXRQQQDTBQH-IRIUXVKKSA-N Tyr-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LVFZXRQQQDTBQH-IRIUXVKKSA-N 0.000 description 3
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 3
- JFAWZADYPRMRCO-UBHSHLNASA-N Val-Ala-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JFAWZADYPRMRCO-UBHSHLNASA-N 0.000 description 3
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 3
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 3
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 3
- DNOOLPROHJWCSQ-RCWTZXSCSA-N Val-Arg-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DNOOLPROHJWCSQ-RCWTZXSCSA-N 0.000 description 3
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 3
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 description 3
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 3
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 3
- ZXYPHBKIZLAQTL-QXEWZRGKSA-N Val-Pro-Asp Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N ZXYPHBKIZLAQTL-QXEWZRGKSA-N 0.000 description 3
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 3
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 3
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 3
- 239000002253 acid Substances 0.000 description 3
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 3
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 3
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 3
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 3
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 3
- BTANRVKWQNVYAZ-UHFFFAOYSA-N butan-2-ol Chemical compound CCC(C)O BTANRVKWQNVYAZ-UHFFFAOYSA-N 0.000 description 3
- 230000003247 decreasing effect Effects 0.000 description 3
- 108010054813 diprotin B Proteins 0.000 description 3
- 238000004134 energy conservation Methods 0.000 description 3
- 238000010353 genetic engineering Methods 0.000 description 3
- 239000008103 glucose Substances 0.000 description 3
- 108010079547 glutamylmethionine Proteins 0.000 description 3
- 108010081551 glycylphenylalanine Proteins 0.000 description 3
- 108010077515 glycylproline Proteins 0.000 description 3
- 108010018006 histidylserine Proteins 0.000 description 3
- 108010076756 leucyl-alanyl-phenylalanine Proteins 0.000 description 3
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 3
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 3
- 239000007791 liquid phase Substances 0.000 description 3
- 108010038320 lysylphenylalanine Proteins 0.000 description 3
- 230000007246 mechanism Effects 0.000 description 3
- 230000002503 metabolic effect Effects 0.000 description 3
- 230000004060 metabolic process Effects 0.000 description 3
- 108010085203 methionylmethionine Proteins 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 229910052757 nitrogen Inorganic materials 0.000 description 3
- 239000012466 permeate Substances 0.000 description 3
- 108010074082 phenylalanyl-alanyl-lysine Proteins 0.000 description 3
- 229920001223 polyethylene glycol Polymers 0.000 description 3
- 229920001184 polypeptide Polymers 0.000 description 3
- 102000004196 processed proteins & peptides Human genes 0.000 description 3
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 3
- BDERNNFJNOPAEC-UHFFFAOYSA-N propan-1-ol Chemical compound CCCO BDERNNFJNOPAEC-UHFFFAOYSA-N 0.000 description 3
- 238000001223 reverse osmosis Methods 0.000 description 3
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 3
- 238000006467 substitution reaction Methods 0.000 description 3
- 150000008163 sugars Chemical class 0.000 description 3
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 3
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 3
- 108010027345 wheylin-1 peptide Proteins 0.000 description 3
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 2
- PUPZLCDOIYMWBV-UHFFFAOYSA-N (+/-)-1,3-Butanediol Chemical compound CC(O)CCO PUPZLCDOIYMWBV-UHFFFAOYSA-N 0.000 description 2
- JNTMAZFVYNDPLB-PEDHHIEDSA-N (2S,3S)-2-[[[(2S)-1-[(2S,3S)-2-amino-3-methyl-1-oxopentyl]-2-pyrrolidinyl]-oxomethyl]amino]-3-methylpentanoic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNTMAZFVYNDPLB-PEDHHIEDSA-N 0.000 description 2
- BRPMXFSTKXXNHF-IUCAKERBSA-N (2s)-1-[2-[[(2s)-pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H]1NCCC1 BRPMXFSTKXXNHF-IUCAKERBSA-N 0.000 description 2
- AZQWKYJCGOJGHM-UHFFFAOYSA-N 1,4-benzoquinone Chemical compound O=C1C=CC(=O)C=C1 AZQWKYJCGOJGHM-UHFFFAOYSA-N 0.000 description 2
- SCVJRXQHFJXZFZ-KVQBGUIXSA-N 2-amino-9-[(2r,4s,5r)-4-hydroxy-5-(hydroxymethyl)oxolan-2-yl]-3h-purine-6-thione Chemical compound C1=2NC(N)=NC(=S)C=2N=CN1[C@H]1C[C@H](O)[C@@H](CO)O1 SCVJRXQHFJXZFZ-KVQBGUIXSA-N 0.000 description 2
- ALRHLSYJTWAHJZ-UHFFFAOYSA-M 3-hydroxypropionate Chemical compound OCCC([O-])=O ALRHLSYJTWAHJZ-UHFFFAOYSA-M 0.000 description 2
- 229920001817 Agar Polymers 0.000 description 2
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 2
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 2
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 2
- VBDMWOKJZDCFJM-FXQIFTODSA-N Ala-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N VBDMWOKJZDCFJM-FXQIFTODSA-N 0.000 description 2
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 2
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 2
- SDMAQFGBPOJFOM-GUBZILKMSA-N Ala-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SDMAQFGBPOJFOM-GUBZILKMSA-N 0.000 description 2
- JAMAWBXXKFGFGX-KZVJFYERSA-N Ala-Arg-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JAMAWBXXKFGFGX-KZVJFYERSA-N 0.000 description 2
- ZEXDYVGDZJBRMO-ACZMJKKPSA-N Ala-Asn-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZEXDYVGDZJBRMO-ACZMJKKPSA-N 0.000 description 2
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 2
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 2
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 2
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 description 2
- NKJBKNVQHBZUIX-ACZMJKKPSA-N Ala-Gln-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKJBKNVQHBZUIX-ACZMJKKPSA-N 0.000 description 2
- BLGHHPHXVJWCNK-GUBZILKMSA-N Ala-Gln-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BLGHHPHXVJWCNK-GUBZILKMSA-N 0.000 description 2
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 2
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 2
- ZPXCNXMJEZKRLU-LSJOCFKGSA-N Ala-His-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CN=CN1 ZPXCNXMJEZKRLU-LSJOCFKGSA-N 0.000 description 2
- HUUOZYZWNCXTFK-INTQDDNPSA-N Ala-His-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N HUUOZYZWNCXTFK-INTQDDNPSA-N 0.000 description 2
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 2
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 2
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 2
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 2
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 2
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 2
- LDLSENBXQNDTPB-DCAQKATOSA-N Ala-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LDLSENBXQNDTPB-DCAQKATOSA-N 0.000 description 2
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 2
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 2
- DWYROCSXOOMOEU-CIUDSAMLSA-N Ala-Met-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DWYROCSXOOMOEU-CIUDSAMLSA-N 0.000 description 2
- OMDNCNKNEGFOMM-BQBZGAKWSA-N Ala-Met-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O OMDNCNKNEGFOMM-BQBZGAKWSA-N 0.000 description 2
- KYDYGANDJHFBCW-DRZSPHRISA-N Ala-Phe-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KYDYGANDJHFBCW-DRZSPHRISA-N 0.000 description 2
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 2
- YHBDGLZYNIARKJ-GUBZILKMSA-N Ala-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N YHBDGLZYNIARKJ-GUBZILKMSA-N 0.000 description 2
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 2
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 2
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 2
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 2
- VYMJAWXRWHJIMS-LKTVYLICSA-N Ala-Tyr-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N VYMJAWXRWHJIMS-LKTVYLICSA-N 0.000 description 2
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 2
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 2
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 2
- DHONNEYAZPNGSG-UBHSHLNASA-N Ala-Val-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DHONNEYAZPNGSG-UBHSHLNASA-N 0.000 description 2
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 2
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 2
- 102100036475 Alanine aminotransferase 1 Human genes 0.000 description 2
- 108010082126 Alanine transaminase Proteins 0.000 description 2
- 108010058065 Aminomethyltransferase Proteins 0.000 description 2
- QGZKDVFQNNGYKY-UHFFFAOYSA-N Ammonia Chemical compound N QGZKDVFQNNGYKY-UHFFFAOYSA-N 0.000 description 2
- GXCSUJQOECMKPV-CIUDSAMLSA-N Arg-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GXCSUJQOECMKPV-CIUDSAMLSA-N 0.000 description 2
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 2
- DBKNLHKEVPZVQC-LPEHRKFASA-N Arg-Ala-Pro Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O DBKNLHKEVPZVQC-LPEHRKFASA-N 0.000 description 2
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 2
- NABSCJGZKWSNHX-RCWTZXSCSA-N Arg-Arg-Thr Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NABSCJGZKWSNHX-RCWTZXSCSA-N 0.000 description 2
- RWCLSUOSKWTXLA-FXQIFTODSA-N Arg-Asp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RWCLSUOSKWTXLA-FXQIFTODSA-N 0.000 description 2
- OZNSCVPYWZRQPY-CIUDSAMLSA-N Arg-Asp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OZNSCVPYWZRQPY-CIUDSAMLSA-N 0.000 description 2
- VXXHDZKEQNGXNU-QXEWZRGKSA-N Arg-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N VXXHDZKEQNGXNU-QXEWZRGKSA-N 0.000 description 2
- YUGFLWBWAJFGKY-BQBZGAKWSA-N Arg-Cys-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O YUGFLWBWAJFGKY-BQBZGAKWSA-N 0.000 description 2
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 2
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 2
- VRZDJJWOFXMFRO-ZFWWWQNUSA-N Arg-Gly-Trp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O VRZDJJWOFXMFRO-ZFWWWQNUSA-N 0.000 description 2
- FRMQITGHXMUNDF-GMOBBJLQSA-N Arg-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FRMQITGHXMUNDF-GMOBBJLQSA-N 0.000 description 2
- LLUGJARLJCGLAR-CYDGBPFRSA-N Arg-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LLUGJARLJCGLAR-CYDGBPFRSA-N 0.000 description 2
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 2
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 2
- OGSQONVYSTZIJB-WDSOQIARSA-N Arg-Leu-Trp Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCN=C(N)N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O OGSQONVYSTZIJB-WDSOQIARSA-N 0.000 description 2
- BNYNOWJESJJIOI-XUXIUFHCSA-N Arg-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N BNYNOWJESJJIOI-XUXIUFHCSA-N 0.000 description 2
- MTYLORHAQXVQOW-AVGNSLFASA-N Arg-Lys-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O MTYLORHAQXVQOW-AVGNSLFASA-N 0.000 description 2
- ADPACBMPYWJJCE-FXQIFTODSA-N Arg-Ser-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O ADPACBMPYWJJCE-FXQIFTODSA-N 0.000 description 2
- ZUVDFJXRAICIAJ-BPUTZDHNSA-N Arg-Trp-Asp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CC(O)=O)C(O)=O)=CNC2=C1 ZUVDFJXRAICIAJ-BPUTZDHNSA-N 0.000 description 2
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 2
- RZVVKNIACROXRM-ZLUOBGJFSA-N Asn-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N RZVVKNIACROXRM-ZLUOBGJFSA-N 0.000 description 2
- LEFKSBYHUGUWLP-ACZMJKKPSA-N Asn-Ala-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LEFKSBYHUGUWLP-ACZMJKKPSA-N 0.000 description 2
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 2
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 2
- QQEWINYJRFBLNN-DLOVCJGASA-N Asn-Ala-Phe Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QQEWINYJRFBLNN-DLOVCJGASA-N 0.000 description 2
- IOTKDTZEEBZNCM-UGYAYLCHSA-N Asn-Asn-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOTKDTZEEBZNCM-UGYAYLCHSA-N 0.000 description 2
- XVVOVPFMILMHPX-ZLUOBGJFSA-N Asn-Asp-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XVVOVPFMILMHPX-ZLUOBGJFSA-N 0.000 description 2
- QISZHYWZHJRDAO-CIUDSAMLSA-N Asn-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N QISZHYWZHJRDAO-CIUDSAMLSA-N 0.000 description 2
- BGINHSZTXRJIPP-FXQIFTODSA-N Asn-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N BGINHSZTXRJIPP-FXQIFTODSA-N 0.000 description 2
- IYVSIZAXNLOKFQ-BYULHYEWSA-N Asn-Asp-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IYVSIZAXNLOKFQ-BYULHYEWSA-N 0.000 description 2
- MSBDSTRUMZFSEU-PEFMBERDSA-N Asn-Glu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MSBDSTRUMZFSEU-PEFMBERDSA-N 0.000 description 2
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 2
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 2
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 2
- PNHQRQTVBRDIEF-CIUDSAMLSA-N Asn-Leu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)N)N PNHQRQTVBRDIEF-CIUDSAMLSA-N 0.000 description 2
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 2
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 2
- RVHGJNGNKGDCPX-KKUMJFAQSA-N Asn-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N RVHGJNGNKGDCPX-KKUMJFAQSA-N 0.000 description 2
- RBOBTTLFPRSXKZ-BZSNNMDCSA-N Asn-Phe-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RBOBTTLFPRSXKZ-BZSNNMDCSA-N 0.000 description 2
- GMUOCGCDOYYWPD-FXQIFTODSA-N Asn-Pro-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O GMUOCGCDOYYWPD-FXQIFTODSA-N 0.000 description 2
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 2
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 2
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 2
- WLVLIYYBPPONRJ-GCJQMDKQSA-N Asn-Thr-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O WLVLIYYBPPONRJ-GCJQMDKQSA-N 0.000 description 2
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 2
- HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 description 2
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 2
- VPPXTHJNTYDNFJ-CIUDSAMLSA-N Asp-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N VPPXTHJNTYDNFJ-CIUDSAMLSA-N 0.000 description 2
- XPGVTUBABLRGHY-BIIVOSGPSA-N Asp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N XPGVTUBABLRGHY-BIIVOSGPSA-N 0.000 description 2
- OERMIMJQPQUIPK-FXQIFTODSA-N Asp-Arg-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O OERMIMJQPQUIPK-FXQIFTODSA-N 0.000 description 2
- GVPSCJQLUGIKAM-GUBZILKMSA-N Asp-Arg-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GVPSCJQLUGIKAM-GUBZILKMSA-N 0.000 description 2
- XYBJLTKSGFBLCS-QXEWZRGKSA-N Asp-Arg-Val Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC(O)=O XYBJLTKSGFBLCS-QXEWZRGKSA-N 0.000 description 2
- UQBGYPFHWFZMCD-ZLUOBGJFSA-N Asp-Asn-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O UQBGYPFHWFZMCD-ZLUOBGJFSA-N 0.000 description 2
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 2
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 2
- LTXGDRFJRZSZAV-CIUDSAMLSA-N Asp-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N LTXGDRFJRZSZAV-CIUDSAMLSA-N 0.000 description 2
- YNCHFVRXEQFPBY-BQBZGAKWSA-N Asp-Gly-Arg Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N YNCHFVRXEQFPBY-BQBZGAKWSA-N 0.000 description 2
- HAFCJCDJGIOYPW-WDSKDSINSA-N Asp-Gly-Gln Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O HAFCJCDJGIOYPW-WDSKDSINSA-N 0.000 description 2
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 2
- KFAFUJMGHVVYRC-DCAQKATOSA-N Asp-Leu-Met Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O KFAFUJMGHVVYRC-DCAQKATOSA-N 0.000 description 2
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 2
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 2
- WDMNFNXKGSLIOB-GUBZILKMSA-N Asp-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N WDMNFNXKGSLIOB-GUBZILKMSA-N 0.000 description 2
- RPUYTJJZXQBWDT-SRVKXCTJSA-N Asp-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N RPUYTJJZXQBWDT-SRVKXCTJSA-N 0.000 description 2
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 2
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 2
- 241000194107 Bacillus megaterium Species 0.000 description 2
- 241000194110 Bacillus sp. (in: Bacteria) Species 0.000 description 2
- 244000063299 Bacillus subtilis Species 0.000 description 2
- 101100138718 Bacillus subtilis (strain 168) pucG gene Proteins 0.000 description 2
- KAKZBPTYRLMSJV-UHFFFAOYSA-N Butadiene Chemical compound C=CC=C KAKZBPTYRLMSJV-UHFFFAOYSA-N 0.000 description 2
- 101100505161 Caenorhabditis elegans mel-32 gene Proteins 0.000 description 2
- 241000193167 Clostridium cochlearium Species 0.000 description 2
- 241000186570 Clostridium kluyveri Species 0.000 description 2
- HQZGVYJBRSISDT-BQBZGAKWSA-N Cys-Gly-Arg Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQZGVYJBRSISDT-BQBZGAKWSA-N 0.000 description 2
- ZLHPWFSAUJEEAN-KBIXCLLPSA-N Cys-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N ZLHPWFSAUJEEAN-KBIXCLLPSA-N 0.000 description 2
- KCSDYJSCUWLILX-BJDJZHNGSA-N Cys-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N KCSDYJSCUWLILX-BJDJZHNGSA-N 0.000 description 2
- SRIRHERUAMYIOQ-CIUDSAMLSA-N Cys-Leu-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SRIRHERUAMYIOQ-CIUDSAMLSA-N 0.000 description 2
- IQXSTXKVEMRMMB-XAVMHZPKSA-N Cys-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N)O IQXSTXKVEMRMMB-XAVMHZPKSA-N 0.000 description 2
- 241000588914 Enterobacter Species 0.000 description 2
- CWYNVVGOOAEACU-UHFFFAOYSA-N Fe2+ Chemical compound [Fe+2] CWYNVVGOOAEACU-UHFFFAOYSA-N 0.000 description 2
- 229930091371 Fructose Natural products 0.000 description 2
- 239000005715 Fructose Substances 0.000 description 2
- RFSUNEUAIZKAJO-ARQDHWQXSA-N Fructose Chemical compound OC[C@H]1O[C@](O)(CO)[C@@H](O)[C@@H]1O RFSUNEUAIZKAJO-ARQDHWQXSA-N 0.000 description 2
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 2
- KZKBJEUWNMQTLV-XDTLVQLUSA-N Gln-Ala-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZKBJEUWNMQTLV-XDTLVQLUSA-N 0.000 description 2
- INFBPLSHYFALDE-ACZMJKKPSA-N Gln-Asn-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O INFBPLSHYFALDE-ACZMJKKPSA-N 0.000 description 2
- KQOPMGBHNQBCEL-HVTMNAMFSA-N Gln-His-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KQOPMGBHNQBCEL-HVTMNAMFSA-N 0.000 description 2
- HXOLDXKNWKLDMM-YVNDNENWSA-N Gln-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HXOLDXKNWKLDMM-YVNDNENWSA-N 0.000 description 2
- KKCJHBXMYYVWMX-KQXIARHKSA-N Gln-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N KKCJHBXMYYVWMX-KQXIARHKSA-N 0.000 description 2
- HYPVLWGNBIYTNA-GUBZILKMSA-N Gln-Leu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HYPVLWGNBIYTNA-GUBZILKMSA-N 0.000 description 2
- VZRAXPGTUNDIDK-GUBZILKMSA-N Gln-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VZRAXPGTUNDIDK-GUBZILKMSA-N 0.000 description 2
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 2
- XIYWAJQIWLXXAF-XKBZYTNZSA-N Gln-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O XIYWAJQIWLXXAF-XKBZYTNZSA-N 0.000 description 2
- HNAUFGBKJLTWQE-IFFSRLJSSA-N Gln-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N)O HNAUFGBKJLTWQE-IFFSRLJSSA-N 0.000 description 2
- SZXSSXUNOALWCH-ACZMJKKPSA-N Glu-Ala-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O SZXSSXUNOALWCH-ACZMJKKPSA-N 0.000 description 2
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 2
- KBKGRMNVKPSQIF-XDTLVQLUSA-N Glu-Ala-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KBKGRMNVKPSQIF-XDTLVQLUSA-N 0.000 description 2
- PBEQPAZRHDVJQI-SRVKXCTJSA-N Glu-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N PBEQPAZRHDVJQI-SRVKXCTJSA-N 0.000 description 2
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 2
- HTTSBEBKVNEDFE-AUTRQRHGSA-N Glu-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N HTTSBEBKVNEDFE-AUTRQRHGSA-N 0.000 description 2
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 2
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 2
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 2
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 2
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 2
- ZHNHJYYFCGUZNQ-KBIXCLLPSA-N Glu-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O ZHNHJYYFCGUZNQ-KBIXCLLPSA-N 0.000 description 2
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 2
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 2
- IOUQWHIEQYQVFD-JYJNAYRXSA-N Glu-Leu-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IOUQWHIEQYQVFD-JYJNAYRXSA-N 0.000 description 2
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 2
- YGLCLCMAYUYZSG-AVGNSLFASA-N Glu-Lys-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 YGLCLCMAYUYZSG-AVGNSLFASA-N 0.000 description 2
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 2
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 2
- GMAGZGCAYLQBKF-NHCYSSNCSA-N Glu-Met-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O GMAGZGCAYLQBKF-NHCYSSNCSA-N 0.000 description 2
- QNJNPKSWAHPYGI-JYJNAYRXSA-N Glu-Phe-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 QNJNPKSWAHPYGI-JYJNAYRXSA-N 0.000 description 2
- JYXKPJVDCAWMDG-ZPFDUUQYSA-N Glu-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N JYXKPJVDCAWMDG-ZPFDUUQYSA-N 0.000 description 2
- BIYNPVYAZOUVFQ-CIUDSAMLSA-N Glu-Pro-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O BIYNPVYAZOUVFQ-CIUDSAMLSA-N 0.000 description 2
- ZTNHPMZHAILHRB-JSGCOSHPSA-N Glu-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)NCC(O)=O)=CNC2=C1 ZTNHPMZHAILHRB-JSGCOSHPSA-N 0.000 description 2
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 2
- YQPFCZVKMUVZIN-AUTRQRHGSA-N Glu-Val-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQPFCZVKMUVZIN-AUTRQRHGSA-N 0.000 description 2
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 2
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 2
- BRFJMRSRMOMIMU-WHFBIAKZSA-N Gly-Ala-Asn Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O BRFJMRSRMOMIMU-WHFBIAKZSA-N 0.000 description 2
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 2
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 2
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 2
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 2
- QCTLGOYODITHPQ-WHFBIAKZSA-N Gly-Cys-Ser Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O QCTLGOYODITHPQ-WHFBIAKZSA-N 0.000 description 2
- AQLHORCVPGXDJW-IUCAKERBSA-N Gly-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN AQLHORCVPGXDJW-IUCAKERBSA-N 0.000 description 2
- GNPVTZJUUBPZKW-WDSKDSINSA-N Gly-Gln-Ser Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GNPVTZJUUBPZKW-WDSKDSINSA-N 0.000 description 2
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 2
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 2
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 2
- NTOWAXLMQFKJPT-YUMQZZPRSA-N Gly-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CN NTOWAXLMQFKJPT-YUMQZZPRSA-N 0.000 description 2
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 2
- JNGJGFMFXREJNF-KBPBESRZSA-N Gly-Glu-Trp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JNGJGFMFXREJNF-KBPBESRZSA-N 0.000 description 2
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 2
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 2
- QPCVIQJVRGXUSA-LURJTMIESA-N Gly-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QPCVIQJVRGXUSA-LURJTMIESA-N 0.000 description 2
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 2
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 2
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 2
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 2
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 2
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 2
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 2
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 2
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 2
- YYXJFBMCOUSYSF-RYUDHWBXSA-N Gly-Phe-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYXJFBMCOUSYSF-RYUDHWBXSA-N 0.000 description 2
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 2
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 2
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 2
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 2
- NWOSHVVPKDQKKT-RYUDHWBXSA-N Gly-Tyr-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O NWOSHVVPKDQKKT-RYUDHWBXSA-N 0.000 description 2
- GBYYQVBXFVDJPJ-WLTAIBSBSA-N Gly-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)CN)O GBYYQVBXFVDJPJ-WLTAIBSBSA-N 0.000 description 2
- DUAWRXXTOQOECJ-JSGCOSHPSA-N Gly-Tyr-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O DUAWRXXTOQOECJ-JSGCOSHPSA-N 0.000 description 2
- NGRPGJGKJMUGDM-XVKPBYJWSA-N Gly-Val-Gln Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NGRPGJGKJMUGDM-XVKPBYJWSA-N 0.000 description 2
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 2
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 2
- AEMRFAOFKBGASW-UHFFFAOYSA-N Glycolic acid Chemical class OCC(O)=O AEMRFAOFKBGASW-UHFFFAOYSA-N 0.000 description 2
- 241000186530 Gottschalkia acidurici Species 0.000 description 2
- 229920002488 Hemicellulose Polymers 0.000 description 2
- AWHJQEYGWRKPHE-LSJOCFKGSA-N His-Ala-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AWHJQEYGWRKPHE-LSJOCFKGSA-N 0.000 description 2
- VSLXGYMEHVAJBH-DLOVCJGASA-N His-Ala-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O VSLXGYMEHVAJBH-DLOVCJGASA-N 0.000 description 2
- MVADCDSCFTXCBT-CIUDSAMLSA-N His-Asp-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MVADCDSCFTXCBT-CIUDSAMLSA-N 0.000 description 2
- NWGXCPUKPVISSJ-AVGNSLFASA-N His-Gln-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N NWGXCPUKPVISSJ-AVGNSLFASA-N 0.000 description 2
- FYTCLUIYTYFGPT-YUMQZZPRSA-N His-Gly-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FYTCLUIYTYFGPT-YUMQZZPRSA-N 0.000 description 2
- BDFCIKANUNMFGB-PMVVWTBXSA-N His-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 BDFCIKANUNMFGB-PMVVWTBXSA-N 0.000 description 2
- LBQAHBIVXQSBIR-HVTMNAMFSA-N His-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N LBQAHBIVXQSBIR-HVTMNAMFSA-N 0.000 description 2
- AIPUZFXMXAHZKY-QWRGUYRKSA-N His-Leu-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AIPUZFXMXAHZKY-QWRGUYRKSA-N 0.000 description 2
- RNMNYMDTESKEAJ-KKUMJFAQSA-N His-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 RNMNYMDTESKEAJ-KKUMJFAQSA-N 0.000 description 2
- WCHONUZTYDQMBY-PYJNHQTQSA-N His-Pro-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WCHONUZTYDQMBY-PYJNHQTQSA-N 0.000 description 2
- PGXZHYYGOPKYKM-IHRRRGAJSA-N His-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CN=CN2)N)C(=O)N[C@@H](CCCCN)C(=O)O PGXZHYYGOPKYKM-IHRRRGAJSA-N 0.000 description 2
- BRQKGRLDDDQWQJ-MBLNEYKQSA-N His-Thr-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O BRQKGRLDDDQWQJ-MBLNEYKQSA-N 0.000 description 2
- QTMKFZAYZKBFRC-BZSNNMDCSA-N His-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N)O QTMKFZAYZKBFRC-BZSNNMDCSA-N 0.000 description 2
- 108010059247 Hydroxypyruvate reductase Proteins 0.000 description 2
- 241000862983 Hyphomicrobium methylovorum Species 0.000 description 2
- LQSBBHNVAVNZSX-GHCJXIJMSA-N Ile-Ala-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LQSBBHNVAVNZSX-GHCJXIJMSA-N 0.000 description 2
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 2
- DMHGKBGOUAJRHU-RVMXOQNASA-N Ile-Arg-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N DMHGKBGOUAJRHU-RVMXOQNASA-N 0.000 description 2
- HZMLFETXHFHGBB-UGYAYLCHSA-N Ile-Asn-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZMLFETXHFHGBB-UGYAYLCHSA-N 0.000 description 2
- FJWYJQRCVNGEAQ-ZPFDUUQYSA-N Ile-Asn-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N FJWYJQRCVNGEAQ-ZPFDUUQYSA-N 0.000 description 2
- AQTWDZDISVGCAC-CFMVVWHZSA-N Ile-Asp-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AQTWDZDISVGCAC-CFMVVWHZSA-N 0.000 description 2
- DMZOUKXXHJQPTL-GRLWGSQLSA-N Ile-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N DMZOUKXXHJQPTL-GRLWGSQLSA-N 0.000 description 2
- JDAWAWXGAUZPNJ-ZPFDUUQYSA-N Ile-Glu-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JDAWAWXGAUZPNJ-ZPFDUUQYSA-N 0.000 description 2
- DFJJAVZIHDFOGQ-MNXVOIDGSA-N Ile-Glu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DFJJAVZIHDFOGQ-MNXVOIDGSA-N 0.000 description 2
- NYEYYMLUABXDMC-NHCYSSNCSA-N Ile-Gly-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)O)N NYEYYMLUABXDMC-NHCYSSNCSA-N 0.000 description 2
- UAQSZXGJGLHMNV-XEGUGMAKSA-N Ile-Gly-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N UAQSZXGJGLHMNV-XEGUGMAKSA-N 0.000 description 2
- URWXDJAEEGBADB-TUBUOCAGSA-N Ile-His-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N URWXDJAEEGBADB-TUBUOCAGSA-N 0.000 description 2
- WIZPFZKOFZXDQG-HTFCKZLJSA-N Ile-Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O WIZPFZKOFZXDQG-HTFCKZLJSA-N 0.000 description 2
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 2
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 2
- DBXXASNNDTXOLU-MXAVVETBSA-N Ile-Leu-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DBXXASNNDTXOLU-MXAVVETBSA-N 0.000 description 2
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 2
- IOVUXUSIGXCREV-DKIMLUQUSA-N Ile-Leu-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IOVUXUSIGXCREV-DKIMLUQUSA-N 0.000 description 2
- PWUMCBLVWPCKNO-MGHWNKPDSA-N Ile-Leu-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PWUMCBLVWPCKNO-MGHWNKPDSA-N 0.000 description 2
- UIEZQYNXCYHMQS-BJDJZHNGSA-N Ile-Lys-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)O)N UIEZQYNXCYHMQS-BJDJZHNGSA-N 0.000 description 2
- ADDYYRVQQZFIMW-MNXVOIDGSA-N Ile-Lys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ADDYYRVQQZFIMW-MNXVOIDGSA-N 0.000 description 2
- UYNXBNHVWFNVIN-HJWJTTGWSA-N Ile-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=CC=C1 UYNXBNHVWFNVIN-HJWJTTGWSA-N 0.000 description 2
- FGBRXCZYVRFNKQ-MXAVVETBSA-N Ile-Phe-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N FGBRXCZYVRFNKQ-MXAVVETBSA-N 0.000 description 2
- BATWGBRIZANGPN-ZPFDUUQYSA-N Ile-Pro-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BATWGBRIZANGPN-ZPFDUUQYSA-N 0.000 description 2
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 2
- PZWBBXHHUSIGKH-OSUNSFLBSA-N Ile-Thr-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PZWBBXHHUSIGKH-OSUNSFLBSA-N 0.000 description 2
- QGXQHJQPAPMACW-PPCPHDFISA-N Ile-Thr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QGXQHJQPAPMACW-PPCPHDFISA-N 0.000 description 2
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 2
- RRHGJUQNOFWUDK-UHFFFAOYSA-N Isoprene Chemical compound CC(=C)C=C RRHGJUQNOFWUDK-UHFFFAOYSA-N 0.000 description 2
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 description 2
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 2
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 2
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 2
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 2
- 241000736479 Lentibacillus Species 0.000 description 2
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 2
- SUPVSFFZWVOEOI-UHFFFAOYSA-N Leu-Ala-Tyr Natural products CC(C)CC(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-UHFFFAOYSA-N 0.000 description 2
- REPPKAMYTOJTFC-DCAQKATOSA-N Leu-Arg-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O REPPKAMYTOJTFC-DCAQKATOSA-N 0.000 description 2
- DBVWMYGBVFCRBE-CIUDSAMLSA-N Leu-Asn-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DBVWMYGBVFCRBE-CIUDSAMLSA-N 0.000 description 2
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 description 2
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 2
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 2
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 2
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 2
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 2
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 2
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 2
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 2
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 2
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 2
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 2
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 2
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 2
- VZBIUJURDLFFOE-IHRRRGAJSA-N Leu-His-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VZBIUJURDLFFOE-IHRRRGAJSA-N 0.000 description 2
- CFZZDVMBRYFFNU-QWRGUYRKSA-N Leu-His-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O CFZZDVMBRYFFNU-QWRGUYRKSA-N 0.000 description 2
- AOFYPTOHESIBFZ-KKUMJFAQSA-N Leu-His-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O AOFYPTOHESIBFZ-KKUMJFAQSA-N 0.000 description 2
- KVOFSTUWVSQMDK-KKUMJFAQSA-N Leu-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KVOFSTUWVSQMDK-KKUMJFAQSA-N 0.000 description 2
- KUIDCYNIEJBZBU-AJNGGQMLSA-N Leu-Ile-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O KUIDCYNIEJBZBU-AJNGGQMLSA-N 0.000 description 2
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 2
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 2
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 2
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 2
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 2
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 2
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 2
- ZGUMORRUBUCXEH-AVGNSLFASA-N Leu-Lys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZGUMORRUBUCXEH-AVGNSLFASA-N 0.000 description 2
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 2
- HDHQQEDVWQGBEE-DCAQKATOSA-N Leu-Met-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O HDHQQEDVWQGBEE-DCAQKATOSA-N 0.000 description 2
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 description 2
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 2
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 2
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 2
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 description 2
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 2
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 2
- VHFFQUSNFFIZBT-CIUDSAMLSA-N Lys-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N VHFFQUSNFFIZBT-CIUDSAMLSA-N 0.000 description 2
- WQWZXKWOEVSGQM-DCAQKATOSA-N Lys-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN WQWZXKWOEVSGQM-DCAQKATOSA-N 0.000 description 2
- YIBOAHAOAWACDK-QEJZJMRPSA-N Lys-Ala-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YIBOAHAOAWACDK-QEJZJMRPSA-N 0.000 description 2
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 2
- VHXMZJGOKIMETG-CQDKDKBSSA-N Lys-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCCN)N VHXMZJGOKIMETG-CQDKDKBSSA-N 0.000 description 2
- DGAAQRAUOFHBFJ-CIUDSAMLSA-N Lys-Asn-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DGAAQRAUOFHBFJ-CIUDSAMLSA-N 0.000 description 2
- QYOXSYXPHUHOJR-GUBZILKMSA-N Lys-Asn-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYOXSYXPHUHOJR-GUBZILKMSA-N 0.000 description 2
- NCTDKZKNBDZDOL-GARJFASQSA-N Lys-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O NCTDKZKNBDZDOL-GARJFASQSA-N 0.000 description 2
- OVIVOCSURJYCTM-GUBZILKMSA-N Lys-Asp-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O OVIVOCSURJYCTM-GUBZILKMSA-N 0.000 description 2
- IBQMEXQYZMVIFU-SRVKXCTJSA-N Lys-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N IBQMEXQYZMVIFU-SRVKXCTJSA-N 0.000 description 2
- GKFNXYMAMKJSKD-NHCYSSNCSA-N Lys-Asp-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GKFNXYMAMKJSKD-NHCYSSNCSA-N 0.000 description 2
- QQUJSUFWEDZQQY-AVGNSLFASA-N Lys-Gln-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN QQUJSUFWEDZQQY-AVGNSLFASA-N 0.000 description 2
- GRADYHMSAUIKPS-DCAQKATOSA-N Lys-Glu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRADYHMSAUIKPS-DCAQKATOSA-N 0.000 description 2
- DUTMKEAPLLUGNO-JYJNAYRXSA-N Lys-Glu-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DUTMKEAPLLUGNO-JYJNAYRXSA-N 0.000 description 2
- ULUQBUKAPDUKOC-GVXVVHGQSA-N Lys-Glu-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ULUQBUKAPDUKOC-GVXVVHGQSA-N 0.000 description 2
- GPJGFSFYBJGYRX-YUMQZZPRSA-N Lys-Gly-Asp Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O GPJGFSFYBJGYRX-YUMQZZPRSA-N 0.000 description 2
- PBLLTSKBTAHDNA-KBPBESRZSA-N Lys-Gly-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PBLLTSKBTAHDNA-KBPBESRZSA-N 0.000 description 2
- CANPXOLVTMKURR-WEDXCCLWSA-N Lys-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN CANPXOLVTMKURR-WEDXCCLWSA-N 0.000 description 2
- OVAOHZIOUBEQCJ-IHRRRGAJSA-N Lys-Leu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OVAOHZIOUBEQCJ-IHRRRGAJSA-N 0.000 description 2
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 2
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 2
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 2
- YDDDRTIPNTWGIG-SRVKXCTJSA-N Lys-Lys-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O YDDDRTIPNTWGIG-SRVKXCTJSA-N 0.000 description 2
- ZZHPLPSLBVBWOA-WDSOQIARSA-N Lys-Met-Trp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N ZZHPLPSLBVBWOA-WDSOQIARSA-N 0.000 description 2
- HKXSZKJMDBHOTG-CIUDSAMLSA-N Lys-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN HKXSZKJMDBHOTG-CIUDSAMLSA-N 0.000 description 2
- DYJOORGDQIGZAS-DCAQKATOSA-N Lys-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N DYJOORGDQIGZAS-DCAQKATOSA-N 0.000 description 2
- YRNRVKTYDSLKMD-KKUMJFAQSA-N Lys-Ser-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YRNRVKTYDSLKMD-KKUMJFAQSA-N 0.000 description 2
- QVTDVTONTRSQMF-WDCWCFNPSA-N Lys-Thr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CCCCN QVTDVTONTRSQMF-WDCWCFNPSA-N 0.000 description 2
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 2
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 2
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 2
- NROQVSYLPRLJIP-PMVMPFDFSA-N Lys-Trp-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NROQVSYLPRLJIP-PMVMPFDFSA-N 0.000 description 2
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 description 2
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 2
- TXTZMVNJIRZABH-ULQDDVLXSA-N Lys-Val-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TXTZMVNJIRZABH-ULQDDVLXSA-N 0.000 description 2
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 description 2
- QAHFGYLFLVGBNW-DCAQKATOSA-N Met-Ala-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN QAHFGYLFLVGBNW-DCAQKATOSA-N 0.000 description 2
- NKDSBBBPGIVWEI-RCWTZXSCSA-N Met-Arg-Thr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NKDSBBBPGIVWEI-RCWTZXSCSA-N 0.000 description 2
- DCHHUGLTVLJYKA-FXQIFTODSA-N Met-Asn-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DCHHUGLTVLJYKA-FXQIFTODSA-N 0.000 description 2
- DZTDEZSHBVRUCQ-FXQIFTODSA-N Met-Asp-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N DZTDEZSHBVRUCQ-FXQIFTODSA-N 0.000 description 2
- OSOLWRWQADPDIQ-DCAQKATOSA-N Met-Asp-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OSOLWRWQADPDIQ-DCAQKATOSA-N 0.000 description 2
- JACAKCWAOHKQBV-UWVGGRQHSA-N Met-Gly-Lys Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN JACAKCWAOHKQBV-UWVGGRQHSA-N 0.000 description 2
- WXJXYMFUTRXRGO-UWVGGRQHSA-N Met-His-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CNC=N1 WXJXYMFUTRXRGO-UWVGGRQHSA-N 0.000 description 2
- XDGFFEZAZHRZFR-RHYQMDGZSA-N Met-Leu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDGFFEZAZHRZFR-RHYQMDGZSA-N 0.000 description 2
- XPVCDCMPKCERFT-GUBZILKMSA-N Met-Ser-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XPVCDCMPKCERFT-GUBZILKMSA-N 0.000 description 2
- SMVTWPOATVIXTN-NAKRPEOUSA-N Met-Ser-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SMVTWPOATVIXTN-NAKRPEOUSA-N 0.000 description 2
- DSZFTPCSFVWMKP-DCAQKATOSA-N Met-Ser-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN DSZFTPCSFVWMKP-DCAQKATOSA-N 0.000 description 2
- FZDOBWIKRQORAC-ULQDDVLXSA-N Met-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCSC)N FZDOBWIKRQORAC-ULQDDVLXSA-N 0.000 description 2
- FSTWDRPCQQUJIT-NHCYSSNCSA-N Met-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCSC)N FSTWDRPCQQUJIT-NHCYSSNCSA-N 0.000 description 2
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 2
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 2
- 108010066427 N-valyltryptophan Proteins 0.000 description 2
- 108010016852 Orthophosphate Dikinase Pyruvate Proteins 0.000 description 2
- 241000206591 Peptococcus Species 0.000 description 2
- AJOKKVTWEMXZHC-DRZSPHRISA-N Phe-Ala-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 AJOKKVTWEMXZHC-DRZSPHRISA-N 0.000 description 2
- XMPUYNHKEPFERE-IHRRRGAJSA-N Phe-Asp-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMPUYNHKEPFERE-IHRRRGAJSA-N 0.000 description 2
- OJUMUUXGSXUZJZ-SRVKXCTJSA-N Phe-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OJUMUUXGSXUZJZ-SRVKXCTJSA-N 0.000 description 2
- CUMXHKAOHNWRFQ-BZSNNMDCSA-N Phe-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 CUMXHKAOHNWRFQ-BZSNNMDCSA-N 0.000 description 2
- LXUJDHOKVUYHRC-KKUMJFAQSA-N Phe-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N LXUJDHOKVUYHRC-KKUMJFAQSA-N 0.000 description 2
- VLZGUAUYZGQKPM-DRZSPHRISA-N Phe-Gln-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VLZGUAUYZGQKPM-DRZSPHRISA-N 0.000 description 2
- KYYMILWEGJYPQZ-IHRRRGAJSA-N Phe-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KYYMILWEGJYPQZ-IHRRRGAJSA-N 0.000 description 2
- PSKRILMFHNIUAO-JYJNAYRXSA-N Phe-Glu-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N PSKRILMFHNIUAO-JYJNAYRXSA-N 0.000 description 2
- RVRRHFPCEOVRKQ-KKUMJFAQSA-N Phe-His-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CC(=O)N)C(=O)O)N RVRRHFPCEOVRKQ-KKUMJFAQSA-N 0.000 description 2
- MJQFZGOIVBDIMZ-WHOFXGATSA-N Phe-Ile-Gly Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O MJQFZGOIVBDIMZ-WHOFXGATSA-N 0.000 description 2
- YKUGPVXSDOOANW-KKUMJFAQSA-N Phe-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKUGPVXSDOOANW-KKUMJFAQSA-N 0.000 description 2
- LRBSWBVUCLLRLU-BZSNNMDCSA-N Phe-Leu-Lys Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1ccccc1)C(=O)N[C@@H](CCCCN)C(O)=O LRBSWBVUCLLRLU-BZSNNMDCSA-N 0.000 description 2
- CMHTUJQZQXFNTQ-OEAJRASXSA-N Phe-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O CMHTUJQZQXFNTQ-OEAJRASXSA-N 0.000 description 2
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 2
- AUJWXNGCAQWLEI-KBPBESRZSA-N Phe-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AUJWXNGCAQWLEI-KBPBESRZSA-N 0.000 description 2
- AXIOGMQCDYVTNY-ACRUOGEOSA-N Phe-Phe-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 AXIOGMQCDYVTNY-ACRUOGEOSA-N 0.000 description 2
- SHUFSZDAIPLZLF-BEAPCOKYSA-N Phe-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O SHUFSZDAIPLZLF-BEAPCOKYSA-N 0.000 description 2
- VDTYRPWRWRCROL-UFYCRDLUSA-N Phe-Val-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 VDTYRPWRWRCROL-UFYCRDLUSA-N 0.000 description 2
- DRVIASBABBMZTF-GUBZILKMSA-N Pro-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@@H]1CCCN1 DRVIASBABBMZTF-GUBZILKMSA-N 0.000 description 2
- BNBBNGZZKQUWCD-IUCAKERBSA-N Pro-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 BNBBNGZZKQUWCD-IUCAKERBSA-N 0.000 description 2
- OYEUSRAZOGIDBY-JYJNAYRXSA-N Pro-Arg-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OYEUSRAZOGIDBY-JYJNAYRXSA-N 0.000 description 2
- RETPETNFPLNLRV-JYJNAYRXSA-N Pro-Asn-Trp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O RETPETNFPLNLRV-JYJNAYRXSA-N 0.000 description 2
- KPDRZQUWJKTMBP-DCAQKATOSA-N Pro-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 KPDRZQUWJKTMBP-DCAQKATOSA-N 0.000 description 2
- QCARZLHECSFOGG-CIUDSAMLSA-N Pro-Glu-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O QCARZLHECSFOGG-CIUDSAMLSA-N 0.000 description 2
- WFHYFCWBLSKEMS-KKUMJFAQSA-N Pro-Glu-Phe Chemical compound N([C@@H](CCC(=O)O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 WFHYFCWBLSKEMS-KKUMJFAQSA-N 0.000 description 2
- ULIWFCCJIOEHMU-BQBZGAKWSA-N Pro-Gly-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 ULIWFCCJIOEHMU-BQBZGAKWSA-N 0.000 description 2
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 2
- IBGCFJDLCYTKPW-NAKRPEOUSA-N Pro-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 IBGCFJDLCYTKPW-NAKRPEOUSA-N 0.000 description 2
- SOACYAXADBWDDT-CYDGBPFRSA-N Pro-Ile-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SOACYAXADBWDDT-CYDGBPFRSA-N 0.000 description 2
- BBFRBZYKHIKFBX-GMOBBJLQSA-N Pro-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BBFRBZYKHIKFBX-GMOBBJLQSA-N 0.000 description 2
- MRYUJHGPZQNOAD-IHRRRGAJSA-N Pro-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 MRYUJHGPZQNOAD-IHRRRGAJSA-N 0.000 description 2
- SMFQZMGHCODUPQ-ULQDDVLXSA-N Pro-Lys-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SMFQZMGHCODUPQ-ULQDDVLXSA-N 0.000 description 2
- DSGSTPRKNYHGCL-JYJNAYRXSA-N Pro-Phe-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O DSGSTPRKNYHGCL-JYJNAYRXSA-N 0.000 description 2
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 2
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 2
- CNUIHOAISPKQPY-HSHDSVGOSA-N Pro-Thr-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O CNUIHOAISPKQPY-HSHDSVGOSA-N 0.000 description 2
- QKWYXRPICJEQAJ-KJEVXHAQSA-N Pro-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@@H]2CCCN2)O QKWYXRPICJEQAJ-KJEVXHAQSA-N 0.000 description 2
- VDHGTOHMHHQSKG-JYJNAYRXSA-N Pro-Val-Phe Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O VDHGTOHMHHQSKG-JYJNAYRXSA-N 0.000 description 2
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 2
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 2
- 108010079005 RDV peptide Proteins 0.000 description 2
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 2
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 2
- 241001042899 Salimicrobium Species 0.000 description 2
- YUSRGTQIPCJNHQ-CIUDSAMLSA-N Ser-Arg-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YUSRGTQIPCJNHQ-CIUDSAMLSA-N 0.000 description 2
- QFBNNYNWKYKVJO-DCAQKATOSA-N Ser-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N QFBNNYNWKYKVJO-DCAQKATOSA-N 0.000 description 2
- NRCJWSGXMAPYQX-LPEHRKFASA-N Ser-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N)C(=O)O NRCJWSGXMAPYQX-LPEHRKFASA-N 0.000 description 2
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 2
- BTPAWKABYQMKKN-LKXGYXEUSA-N Ser-Asp-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BTPAWKABYQMKKN-LKXGYXEUSA-N 0.000 description 2
- BRIZMMZEYSAKJX-QEJZJMRPSA-N Ser-Glu-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N BRIZMMZEYSAKJX-QEJZJMRPSA-N 0.000 description 2
- LQESNKGTTNHZPZ-GHCJXIJMSA-N Ser-Ile-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O LQESNKGTTNHZPZ-GHCJXIJMSA-N 0.000 description 2
- MQQBBLVOUUJKLH-HJPIBITLSA-N Ser-Ile-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MQQBBLVOUUJKLH-HJPIBITLSA-N 0.000 description 2
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 2
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 2
- GDUZTEQRAOXYJS-SRVKXCTJSA-N Ser-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GDUZTEQRAOXYJS-SRVKXCTJSA-N 0.000 description 2
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 2
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 2
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 2
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 2
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 2
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 2
- DYEGLQRVMBWQLD-IXOXFDKPSA-N Ser-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CO)N)O DYEGLQRVMBWQLD-IXOXFDKPSA-N 0.000 description 2
- VAIWUNAAPZZGRI-IHPCNDPISA-N Ser-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CO)N VAIWUNAAPZZGRI-IHPCNDPISA-N 0.000 description 2
- QYBRQMLZDDJBSW-AVGNSLFASA-N Ser-Tyr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYBRQMLZDDJBSW-AVGNSLFASA-N 0.000 description 2
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 2
- 108091027967 Small hairpin RNA Proteins 0.000 description 2
- 241000246348 Sporosarcina sp. Species 0.000 description 2
- 229920002472 Starch Polymers 0.000 description 2
- 241000187432 Streptomyces coelicolor Species 0.000 description 2
- QAOWNCQODCNURD-UHFFFAOYSA-N Sulfuric acid Chemical compound OS(O)(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-N 0.000 description 2
- 241000204666 Thermotoga maritima Species 0.000 description 2
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 description 2
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 2
- XYEXCEPTALHNEV-RCWTZXSCSA-N Thr-Arg-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XYEXCEPTALHNEV-RCWTZXSCSA-N 0.000 description 2
- NAXBBCLCEOTAIG-RHYQMDGZSA-N Thr-Arg-Lys Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O NAXBBCLCEOTAIG-RHYQMDGZSA-N 0.000 description 2
- LXWZOMSOUAMOIA-JIOCBJNQSA-N Thr-Asn-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O LXWZOMSOUAMOIA-JIOCBJNQSA-N 0.000 description 2
- UCCNDUPVIFOOQX-CUJWVEQBSA-N Thr-Cys-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 UCCNDUPVIFOOQX-CUJWVEQBSA-N 0.000 description 2
- LAFLAXHTDVNVEL-WDCWCFNPSA-N Thr-Gln-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O LAFLAXHTDVNVEL-WDCWCFNPSA-N 0.000 description 2
- CQNFRKAKGDSJFR-NUMRIWBASA-N Thr-Glu-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CQNFRKAKGDSJFR-NUMRIWBASA-N 0.000 description 2
- LGNBRHZANHMZHK-NUMRIWBASA-N Thr-Glu-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O LGNBRHZANHMZHK-NUMRIWBASA-N 0.000 description 2
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 2
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 2
- MSIYNSBKKVMGFO-BHNWBGBOSA-N Thr-Gly-Pro Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N)O MSIYNSBKKVMGFO-BHNWBGBOSA-N 0.000 description 2
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 2
- XTCNBOBTROGWMW-RWRJDSDZSA-N Thr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XTCNBOBTROGWMW-RWRJDSDZSA-N 0.000 description 2
- URPSJRMWHQTARR-MBLNEYKQSA-N Thr-Ile-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O URPSJRMWHQTARR-MBLNEYKQSA-N 0.000 description 2
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 2
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 2
- XIULAFZYEKSGAJ-IXOXFDKPSA-N Thr-Leu-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 XIULAFZYEKSGAJ-IXOXFDKPSA-N 0.000 description 2
- FIFDDJFLNVAVMS-RHYQMDGZSA-N Thr-Leu-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O FIFDDJFLNVAVMS-RHYQMDGZSA-N 0.000 description 2
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 2
- SCSVNSNWUTYSFO-WDCWCFNPSA-N Thr-Lys-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O SCSVNSNWUTYSFO-WDCWCFNPSA-N 0.000 description 2
- ZXIHABSKUITPTN-IXOXFDKPSA-N Thr-Lys-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O ZXIHABSKUITPTN-IXOXFDKPSA-N 0.000 description 2
- KPNSNVTUVKSBFL-ZJDVBMNYSA-N Thr-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KPNSNVTUVKSBFL-ZJDVBMNYSA-N 0.000 description 2
- ABWNZPOIUJMNKT-IXOXFDKPSA-N Thr-Phe-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O ABWNZPOIUJMNKT-IXOXFDKPSA-N 0.000 description 2
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 description 2
- XHWCDRUPDNSDAZ-XKBZYTNZSA-N Thr-Ser-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O XHWCDRUPDNSDAZ-XKBZYTNZSA-N 0.000 description 2
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 2
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 2
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 2
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 2
- XGUAUKUYQHBUNY-SWRJLBSHSA-N Thr-Trp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O XGUAUKUYQHBUNY-SWRJLBSHSA-N 0.000 description 2
- LXXCHJKHJYRMIY-FQPOAREZSA-N Thr-Tyr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O LXXCHJKHJYRMIY-FQPOAREZSA-N 0.000 description 2
- QAXCHNZDPLSFPC-PJODQICGSA-N Trp-Ala-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 QAXCHNZDPLSFPC-PJODQICGSA-N 0.000 description 2
- OFCKFBGRYHOKFP-IHPCNDPISA-N Trp-Asp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N OFCKFBGRYHOKFP-IHPCNDPISA-N 0.000 description 2
- RXEQOXHCHQJMSO-IHPCNDPISA-N Trp-His-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O RXEQOXHCHQJMSO-IHPCNDPISA-N 0.000 description 2
- CXPJPTFWKXNDKV-NUTKFTJISA-N Trp-Leu-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CXPJPTFWKXNDKV-NUTKFTJISA-N 0.000 description 2
- HHPSUFUXXBOFQY-AQZXSJQPSA-N Trp-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O HHPSUFUXXBOFQY-AQZXSJQPSA-N 0.000 description 2
- UGFOSENEZHEQKX-PJODQICGSA-N Trp-Val-Ala Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(=O)N[C@@H](C)C(O)=O UGFOSENEZHEQKX-PJODQICGSA-N 0.000 description 2
- LNGFWVPNKLWATF-ZVZYQTTQSA-N Trp-Val-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LNGFWVPNKLWATF-ZVZYQTTQSA-N 0.000 description 2
- BURPTJBFWIOHEY-UWJYBYFXSA-N Tyr-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 BURPTJBFWIOHEY-UWJYBYFXSA-N 0.000 description 2
- ADBDQGBDNUTRDB-ULQDDVLXSA-N Tyr-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O ADBDQGBDNUTRDB-ULQDDVLXSA-N 0.000 description 2
- GFZQWWDXJVGEMW-ULQDDVLXSA-N Tyr-Arg-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GFZQWWDXJVGEMW-ULQDDVLXSA-N 0.000 description 2
- AYHSJESDFKREAR-KKUMJFAQSA-N Tyr-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AYHSJESDFKREAR-KKUMJFAQSA-N 0.000 description 2
- XMNDQSYABVWZRK-BZSNNMDCSA-N Tyr-Asn-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XMNDQSYABVWZRK-BZSNNMDCSA-N 0.000 description 2
- CRHFOYCJGVJPLE-AVGNSLFASA-N Tyr-Gln-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CRHFOYCJGVJPLE-AVGNSLFASA-N 0.000 description 2
- HVHJYXDXRIWELT-RYUDHWBXSA-N Tyr-Glu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O HVHJYXDXRIWELT-RYUDHWBXSA-N 0.000 description 2
- SLCSPPCQWUHPPO-JYJNAYRXSA-N Tyr-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SLCSPPCQWUHPPO-JYJNAYRXSA-N 0.000 description 2
- CNLKDWSAORJEMW-KWQFWETISA-N Tyr-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O CNLKDWSAORJEMW-KWQFWETISA-N 0.000 description 2
- CTDPLKMBVALCGN-JSGCOSHPSA-N Tyr-Gly-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O CTDPLKMBVALCGN-JSGCOSHPSA-N 0.000 description 2
- YYZPVPJCOGGQPC-JYJNAYRXSA-N Tyr-His-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYZPVPJCOGGQPC-JYJNAYRXSA-N 0.000 description 2
- GFJXBLSZOFWHAW-JYJNAYRXSA-N Tyr-His-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GFJXBLSZOFWHAW-JYJNAYRXSA-N 0.000 description 2
- WSFXJLFSJSXGMQ-MGHWNKPDSA-N Tyr-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N WSFXJLFSJSXGMQ-MGHWNKPDSA-N 0.000 description 2
- KSCVLGXNQXKUAR-JYJNAYRXSA-N Tyr-Leu-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KSCVLGXNQXKUAR-JYJNAYRXSA-N 0.000 description 2
- WDGDKHLSDIOXQC-ACRUOGEOSA-N Tyr-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 WDGDKHLSDIOXQC-ACRUOGEOSA-N 0.000 description 2
- ARJASMXQBRNAGI-YESZJQIVSA-N Tyr-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N ARJASMXQBRNAGI-YESZJQIVSA-N 0.000 description 2
- WOAQYWUEUYMVGK-ULQDDVLXSA-N Tyr-Lys-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WOAQYWUEUYMVGK-ULQDDVLXSA-N 0.000 description 2
- BYAKMYBZADCNMN-JYJNAYRXSA-N Tyr-Lys-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O BYAKMYBZADCNMN-JYJNAYRXSA-N 0.000 description 2
- MXFPBNFKVBHIRW-BZSNNMDCSA-N Tyr-Lys-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O MXFPBNFKVBHIRW-BZSNNMDCSA-N 0.000 description 2
- LRHBBGDMBLFYGL-FHWLQOOXSA-N Tyr-Phe-Glu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=C(O)C=C1 LRHBBGDMBLFYGL-FHWLQOOXSA-N 0.000 description 2
- PHKQVWWHRYUCJL-HJOGWXRNSA-N Tyr-Phe-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PHKQVWWHRYUCJL-HJOGWXRNSA-N 0.000 description 2
- GOPQNCQSXBJAII-ULQDDVLXSA-N Tyr-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GOPQNCQSXBJAII-ULQDDVLXSA-N 0.000 description 2
- DJIJBQYBDKGDIS-JYJNAYRXSA-N Tyr-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O DJIJBQYBDKGDIS-JYJNAYRXSA-N 0.000 description 2
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 2
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 2
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 2
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 2
- ZMDCGGKHRKNWKD-LAEOZQHASA-N Val-Asn-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZMDCGGKHRKNWKD-LAEOZQHASA-N 0.000 description 2
- OGNMURQZFMHFFD-NHCYSSNCSA-N Val-Asn-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N OGNMURQZFMHFFD-NHCYSSNCSA-N 0.000 description 2
- DBOXBUDEAJVKRE-LSJOCFKGSA-N Val-Asn-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DBOXBUDEAJVKRE-LSJOCFKGSA-N 0.000 description 2
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 2
- HZYOWMGWKKRMBZ-BYULHYEWSA-N Val-Asp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZYOWMGWKKRMBZ-BYULHYEWSA-N 0.000 description 2
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 2
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 2
- VXCAZHCVDBQMTP-NRPADANISA-N Val-Cys-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VXCAZHCVDBQMTP-NRPADANISA-N 0.000 description 2
- AAOPYWQQBXHINJ-DZKIICNBSA-N Val-Gln-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AAOPYWQQBXHINJ-DZKIICNBSA-N 0.000 description 2
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 2
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 2
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 2
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 2
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 2
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 2
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 2
- BMOFUVHDBROBSE-DCAQKATOSA-N Val-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N BMOFUVHDBROBSE-DCAQKATOSA-N 0.000 description 2
- XTDDIVQWDXMRJL-IHRRRGAJSA-N Val-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N XTDDIVQWDXMRJL-IHRRRGAJSA-N 0.000 description 2
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 2
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 2
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 description 2
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 2
- OJPRSVJGNCAKQX-SRVKXCTJSA-N Val-Met-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N OJPRSVJGNCAKQX-SRVKXCTJSA-N 0.000 description 2
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 2
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 2
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 2
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 2
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 2
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 2
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 2
- PQSNETRGCRUOGP-KKHAAJSZSA-N Val-Thr-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O PQSNETRGCRUOGP-KKHAAJSZSA-N 0.000 description 2
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 2
- USXYVSTVPHELAF-RCWTZXSCSA-N Val-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N)O USXYVSTVPHELAF-RCWTZXSCSA-N 0.000 description 2
- JXCOEPXCBVCTRD-JYJNAYRXSA-N Val-Tyr-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JXCOEPXCBVCTRD-JYJNAYRXSA-N 0.000 description 2
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 2
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- 241000209149 Zea Species 0.000 description 2
- 240000008042 Zea mays Species 0.000 description 2
- 230000000789 acetogenic effect Effects 0.000 description 2
- 150000007513 acids Chemical class 0.000 description 2
- 239000008272 agar Substances 0.000 description 2
- 108010078114 alanyl-tryptophyl-alanine Proteins 0.000 description 2
- 150000001298 alcohols Chemical class 0.000 description 2
- 230000002528 anti-freeze Effects 0.000 description 2
- 239000007864 aqueous solution Substances 0.000 description 2
- 108010008355 arginyl-glutamine Proteins 0.000 description 2
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 2
- 239000006227 byproduct Substances 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 239000001913 cellulose Substances 0.000 description 2
- 229920002678 cellulose Polymers 0.000 description 2
- 238000005119 centrifugation Methods 0.000 description 2
- 101150028648 citZ gene Proteins 0.000 description 2
- 239000003245 coal Substances 0.000 description 2
- 239000002299 complementary DNA Substances 0.000 description 2
- 150000001875 compounds Chemical class 0.000 description 2
- 229920001577 copolymer Polymers 0.000 description 2
- 108010016616 cysteinylglycine Proteins 0.000 description 2
- 108010060199 cysteinylproline Proteins 0.000 description 2
- 239000012024 dehydrating agents Substances 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- 238000005553 drilling Methods 0.000 description 2
- 239000010408 film Substances 0.000 description 2
- 239000012530 fluid Substances 0.000 description 2
- 238000005194 fractionation Methods 0.000 description 2
- 239000012634 fragment Substances 0.000 description 2
- 238000002290 gas chromatography-mass spectrometry Methods 0.000 description 2
- 239000011521 glass Substances 0.000 description 2
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 2
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 2
- 108010014977 glycine cleavage system Proteins 0.000 description 2
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 2
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 2
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 2
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 2
- 108010023364 glycyl-histidyl-arginine Proteins 0.000 description 2
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 2
- 108010010147 glycylglutamine Proteins 0.000 description 2
- 108010084389 glycyltryptophan Proteins 0.000 description 2
- 238000004128 high performance liquid chromatography Methods 0.000 description 2
- 229920001519 homopolymer Polymers 0.000 description 2
- 239000002440 industrial waste Substances 0.000 description 2
- 238000002347 injection Methods 0.000 description 2
- 239000007924 injection Substances 0.000 description 2
- 108010060857 isoleucyl-valyl-tyrosine Proteins 0.000 description 2
- 108010027338 isoleucylcysteine Proteins 0.000 description 2
- 108010053037 kyotorphin Proteins 0.000 description 2
- 238000002372 labelling Methods 0.000 description 2
- JVTAAEKCZFNVCJ-UHFFFAOYSA-N lactic acid Chemical compound CC(O)C(O)=O JVTAAEKCZFNVCJ-UHFFFAOYSA-N 0.000 description 2
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 2
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 2
- 108010000761 leucylarginine Proteins 0.000 description 2
- 229920005610 lignin Polymers 0.000 description 2
- 239000002207 metabolite Substances 0.000 description 2
- 108091070501 miRNA Proteins 0.000 description 2
- 239000002679 microRNA Substances 0.000 description 2
- 229930014626 natural product Natural products 0.000 description 2
- 239000003960 organic solvent Substances 0.000 description 2
- 230000002018 overexpression Effects 0.000 description 2
- 230000036961 partial effect Effects 0.000 description 2
- 238000005191 phase separation Methods 0.000 description 2
- 108010069184 phenylalanyl-leucyl-glutamyl-glutamyl-isoleucine Proteins 0.000 description 2
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 2
- 230000000243 photosynthetic effect Effects 0.000 description 2
- 239000004033 plastic Substances 0.000 description 2
- 229920003023 plastic Polymers 0.000 description 2
- 239000003755 preservative agent Substances 0.000 description 2
- 230000002335 preservative effect Effects 0.000 description 2
- 108010079317 prolyl-tyrosine Proteins 0.000 description 2
- 238000011002 quantification Methods 0.000 description 2
- 230000001105 regulatory effect Effects 0.000 description 2
- 239000011347 resin Substances 0.000 description 2
- 229920005989 resin Polymers 0.000 description 2
- 239000000523 sample Substances 0.000 description 2
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 2
- 239000013605 shuttle vector Substances 0.000 description 2
- 238000001179 sorption measurement Methods 0.000 description 2
- 229960000268 spectinomycin Drugs 0.000 description 2
- UNFWWIHTNXNPBV-WXKVUWSESA-N spectinomycin Chemical compound O([C@@H]1[C@@H](NC)[C@@H](O)[C@H]([C@@H]([C@H]1O1)O)NC)[C@]2(O)[C@H]1O[C@H](C)CC2=O UNFWWIHTNXNPBV-WXKVUWSESA-N 0.000 description 2
- 108010005652 splenotritin Proteins 0.000 description 2
- 239000008107 starch Substances 0.000 description 2
- 235000019698 starch Nutrition 0.000 description 2
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 2
- 108010029384 tryptophyl-histidine Proteins 0.000 description 2
- 108010020532 tyrosyl-proline Proteins 0.000 description 2
- 108010003137 tyrosyltyrosine Proteins 0.000 description 2
- 108010009962 valyltyrosine Proteins 0.000 description 2
- 239000011782 vitamin Substances 0.000 description 2
- 235000013343 vitamin Nutrition 0.000 description 2
- 229930003231 vitamin Natural products 0.000 description 2
- 229940088594 vitamin Drugs 0.000 description 2
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 1
- CWFMWBHMIMNZLN-NAKRPEOUSA-N (2s)-1-[(2s)-2-[[(2s,3s)-2-amino-3-methylpentanoyl]amino]propanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CWFMWBHMIMNZLN-NAKRPEOUSA-N 0.000 description 1
- NTUPOKHATNSWCY-PMPSAXMXSA-N (2s)-2-[[(2s)-1-[(2r)-2-amino-3-phenylpropanoyl]pyrrolidine-2-carbonyl]amino]-5-(diaminomethylideneamino)pentanoic acid Chemical compound C([C@@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=CC=C1 NTUPOKHATNSWCY-PMPSAXMXSA-N 0.000 description 1
- JBFQOLHAGBKPTP-NZATWWQASA-N (2s)-2-[[(2s)-4-carboxy-2-[[3-carboxy-2-[[(2s)-2,6-diaminohexanoyl]amino]propanoyl]amino]butanoyl]amino]-4-methylpentanoic acid Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)C(CC(O)=O)NC(=O)[C@@H](N)CCCCN JBFQOLHAGBKPTP-NZATWWQASA-N 0.000 description 1
- HPYLHFWTUAGUNX-BGZSDMPXSA-N (3s)-4-[[(1s)-1-carboxy-2-hydroxyethyl]amino]-4-oxo-3-[[(2s,6s)-2,6,10-triamino-4-[(diaminomethylideneamino)methyl]-5-oxodecanoyl]amino]butanoic acid Chemical compound NCCCC[C@H](N)C(=O)C(CN=C(N)N)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O HPYLHFWTUAGUNX-BGZSDMPXSA-N 0.000 description 1
- DNIAPMSPPWPWGF-GSVOUGTGSA-N (R)-(-)-Propylene glycol Chemical compound C[C@@H](O)CO DNIAPMSPPWPWGF-GSVOUGTGSA-N 0.000 description 1
- VXNZUUAINFGPBY-UHFFFAOYSA-N 1-Butene Chemical compound CCC=C VXNZUUAINFGPBY-UHFFFAOYSA-N 0.000 description 1
- 229940044613 1-propanol Drugs 0.000 description 1
- OWEGMIWEEQEYGQ-UHFFFAOYSA-N 100676-05-9 Natural products OC1C(O)C(O)C(CO)OC1OCC1C(O)C(O)C(O)C(OC2C(OC(O)C(O)C2O)CO)O1 OWEGMIWEEQEYGQ-UHFFFAOYSA-N 0.000 description 1
- 108020004465 16S ribosomal RNA Proteins 0.000 description 1
- 101710082371 2-methylcitrate synthase, mitochondrial Proteins 0.000 description 1
- KPGXRSRHYNQIFN-UHFFFAOYSA-N 2-oxoglutaric acid Chemical compound OC(=O)CCC(=O)C(O)=O KPGXRSRHYNQIFN-UHFFFAOYSA-N 0.000 description 1
- WHBMMWSBFZVSSR-UHFFFAOYSA-M 3-hydroxybutyrate Chemical compound CC(O)CC([O-])=O WHBMMWSBFZVSSR-UHFFFAOYSA-M 0.000 description 1
- 102100027278 4-trimethylaminobutyraldehyde dehydrogenase Human genes 0.000 description 1
- 108010092060 Acetate kinase Proteins 0.000 description 1
- 241000589291 Acinetobacter Species 0.000 description 1
- 108010009924 Aconitate hydratase Proteins 0.000 description 1
- 102000009836 Aconitate hydratase Human genes 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 241000589158 Agrobacterium Species 0.000 description 1
- UWQJHXKARZWDIJ-ZLUOBGJFSA-N Ala-Ala-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O UWQJHXKARZWDIJ-ZLUOBGJFSA-N 0.000 description 1
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 1
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 1
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 1
- IMMKUCQIKKXKNP-DCAQKATOSA-N Ala-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCN=C(N)N IMMKUCQIKKXKNP-DCAQKATOSA-N 0.000 description 1
- YAXNATKKPOWVCP-ZLUOBGJFSA-N Ala-Asn-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O YAXNATKKPOWVCP-ZLUOBGJFSA-N 0.000 description 1
- LBJYAILUMSUTAM-ZLUOBGJFSA-N Ala-Asn-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LBJYAILUMSUTAM-ZLUOBGJFSA-N 0.000 description 1
- PXKLCFFSVLKOJM-ACZMJKKPSA-N Ala-Asn-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXKLCFFSVLKOJM-ACZMJKKPSA-N 0.000 description 1
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 description 1
- JYEBJTDTPNKQJG-FXQIFTODSA-N Ala-Asn-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N JYEBJTDTPNKQJG-FXQIFTODSA-N 0.000 description 1
- XCVRVWZTXPCYJT-BIIVOSGPSA-N Ala-Asn-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N XCVRVWZTXPCYJT-BIIVOSGPSA-N 0.000 description 1
- FXKNPWNXPQZLES-ZLUOBGJFSA-N Ala-Asn-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FXKNPWNXPQZLES-ZLUOBGJFSA-N 0.000 description 1
- GORKKVHIBWAQHM-GCJQMDKQSA-N Ala-Asn-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GORKKVHIBWAQHM-GCJQMDKQSA-N 0.000 description 1
- XQJAFSDFQZPYCU-UWJYBYFXSA-N Ala-Asn-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N XQJAFSDFQZPYCU-UWJYBYFXSA-N 0.000 description 1
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 1
- MCKSLROAGSDNFC-ACZMJKKPSA-N Ala-Asp-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MCKSLROAGSDNFC-ACZMJKKPSA-N 0.000 description 1
- FOWHQTWRLFTELJ-FXQIFTODSA-N Ala-Asp-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N FOWHQTWRLFTELJ-FXQIFTODSA-N 0.000 description 1
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 1
- DAEFQZCYZKRTLR-ZLUOBGJFSA-N Ala-Cys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O DAEFQZCYZKRTLR-ZLUOBGJFSA-N 0.000 description 1
- WCBVQNZTOKJWJS-ACZMJKKPSA-N Ala-Cys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O WCBVQNZTOKJWJS-ACZMJKKPSA-N 0.000 description 1
- CXQODNIBUNQWAS-CIUDSAMLSA-N Ala-Gln-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CXQODNIBUNQWAS-CIUDSAMLSA-N 0.000 description 1
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 1
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 1
- NJPMYXWVWQWCSR-ACZMJKKPSA-N Ala-Glu-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NJPMYXWVWQWCSR-ACZMJKKPSA-N 0.000 description 1
- BGNLUHXLSAQYRQ-FXQIFTODSA-N Ala-Glu-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BGNLUHXLSAQYRQ-FXQIFTODSA-N 0.000 description 1
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 1
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 1
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 1
- FBHOPGDGELNWRH-DRZSPHRISA-N Ala-Glu-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FBHOPGDGELNWRH-DRZSPHRISA-N 0.000 description 1
- ROLXPVQSRCPVGK-XDTLVQLUSA-N Ala-Glu-Tyr Chemical compound N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O ROLXPVQSRCPVGK-XDTLVQLUSA-N 0.000 description 1
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 1
- VWEWCZSUWOEEFM-WDSKDSINSA-N Ala-Gly-Ala-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(=O)NCC(O)=O VWEWCZSUWOEEFM-WDSKDSINSA-N 0.000 description 1
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 1
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 1
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 1
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 1
- CWEAKSWWKHGTRJ-BQBZGAKWSA-N Ala-Gly-Met Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O CWEAKSWWKHGTRJ-BQBZGAKWSA-N 0.000 description 1
- QHASENCZLDHBGX-ONGXEEELSA-N Ala-Gly-Phe Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QHASENCZLDHBGX-ONGXEEELSA-N 0.000 description 1
- FDAZDMAFZYTHGS-XVYDVKMFSA-N Ala-His-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O FDAZDMAFZYTHGS-XVYDVKMFSA-N 0.000 description 1
- HQJKCXHQNUCKMY-GHCJXIJMSA-N Ala-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C)N HQJKCXHQNUCKMY-GHCJXIJMSA-N 0.000 description 1
- GSHKMNKPMLXSQW-KBIXCLLPSA-N Ala-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C)N GSHKMNKPMLXSQW-KBIXCLLPSA-N 0.000 description 1
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 1
- XCZXVTHYGSMQGH-NAKRPEOUSA-N Ala-Ile-Met Chemical compound C[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C([O-])=O XCZXVTHYGSMQGH-NAKRPEOUSA-N 0.000 description 1
- LXAARTARZJJCMB-CIQUZCHMSA-N Ala-Ile-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LXAARTARZJJCMB-CIQUZCHMSA-N 0.000 description 1
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 1
- LBYMZCVBOKYZNS-CIUDSAMLSA-N Ala-Leu-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O LBYMZCVBOKYZNS-CIUDSAMLSA-N 0.000 description 1
- DCUCOIYYUBILPS-GUBZILKMSA-N Ala-Leu-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DCUCOIYYUBILPS-GUBZILKMSA-N 0.000 description 1
- NOGFDULFCFXBHB-CIUDSAMLSA-N Ala-Leu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NOGFDULFCFXBHB-CIUDSAMLSA-N 0.000 description 1
- VHVVPYOJIIQCKS-QEJZJMRPSA-N Ala-Leu-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VHVVPYOJIIQCKS-QEJZJMRPSA-N 0.000 description 1
- AJBVYEYZVYPFCF-CIUDSAMLSA-N Ala-Lys-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O AJBVYEYZVYPFCF-CIUDSAMLSA-N 0.000 description 1
- SDZRIBWEVVRDQI-CIUDSAMLSA-N Ala-Lys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O SDZRIBWEVVRDQI-CIUDSAMLSA-N 0.000 description 1
- PIXQDIGKDNNOOV-GUBZILKMSA-N Ala-Lys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O PIXQDIGKDNNOOV-GUBZILKMSA-N 0.000 description 1
- XHNLCGXYBXNRIS-BJDJZHNGSA-N Ala-Lys-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XHNLCGXYBXNRIS-BJDJZHNGSA-N 0.000 description 1
- FUKFQILQFQKHLE-DCAQKATOSA-N Ala-Lys-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O FUKFQILQFQKHLE-DCAQKATOSA-N 0.000 description 1
- BLTRAARCJYVJKV-QEJZJMRPSA-N Ala-Lys-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccccc1)C(O)=O BLTRAARCJYVJKV-QEJZJMRPSA-N 0.000 description 1
- CHFFHQUVXHEGBY-GARJFASQSA-N Ala-Lys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CHFFHQUVXHEGBY-GARJFASQSA-N 0.000 description 1
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 1
- KQESEZXHYOUIIM-CQDKDKBSSA-N Ala-Lys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KQESEZXHYOUIIM-CQDKDKBSSA-N 0.000 description 1
- IHRGVZXPTIQNIP-NAKRPEOUSA-N Ala-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C)N IHRGVZXPTIQNIP-NAKRPEOUSA-N 0.000 description 1
- GKAZXNDATBWNBI-DCAQKATOSA-N Ala-Met-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N GKAZXNDATBWNBI-DCAQKATOSA-N 0.000 description 1
- HYIDEIQUCBKIPL-CQDKDKBSSA-N Ala-Phe-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N HYIDEIQUCBKIPL-CQDKDKBSSA-N 0.000 description 1
- RUXQNKVQSKOOBS-JURCDPSOSA-N Ala-Phe-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RUXQNKVQSKOOBS-JURCDPSOSA-N 0.000 description 1
- WEZNQZHACPSMEF-QEJZJMRPSA-N Ala-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 WEZNQZHACPSMEF-QEJZJMRPSA-N 0.000 description 1
- JAQNUEWEJWBVAY-WBAXXEDZSA-N Ala-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 JAQNUEWEJWBVAY-WBAXXEDZSA-N 0.000 description 1
- OSRZOHXQCUFIQG-FPMFFAJLSA-N Ala-Phe-Pro Chemical compound C([C@H](NC(=O)[C@@H]([NH3+])C)C(=O)N1[C@H](CCC1)C([O-])=O)C1=CC=CC=C1 OSRZOHXQCUFIQG-FPMFFAJLSA-N 0.000 description 1
- DXTYEWAQOXYRHZ-KKXDTOCCSA-N Ala-Phe-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N DXTYEWAQOXYRHZ-KKXDTOCCSA-N 0.000 description 1
- VQAVBBCZFQAAED-FXQIFTODSA-N Ala-Pro-Asn Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N VQAVBBCZFQAAED-FXQIFTODSA-N 0.000 description 1
- XAXHGSOBFPIRFG-LSJOCFKGSA-N Ala-Pro-His Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O XAXHGSOBFPIRFG-LSJOCFKGSA-N 0.000 description 1
- BTRULDJUUVGRNE-DCAQKATOSA-N Ala-Pro-Lys Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O BTRULDJUUVGRNE-DCAQKATOSA-N 0.000 description 1
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 1
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 1
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 1
- NHWYNIZWLJYZAG-XVYDVKMFSA-N Ala-Ser-His Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N NHWYNIZWLJYZAG-XVYDVKMFSA-N 0.000 description 1
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 1
- OMCKWYSDUQBYCN-FXQIFTODSA-N Ala-Ser-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O OMCKWYSDUQBYCN-FXQIFTODSA-N 0.000 description 1
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 1
- XQNRANMFRPCFFW-GCJQMDKQSA-N Ala-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C)N)O XQNRANMFRPCFFW-GCJQMDKQSA-N 0.000 description 1
- AAWLEICNDUHIJM-MBLNEYKQSA-N Ala-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C)N)O AAWLEICNDUHIJM-MBLNEYKQSA-N 0.000 description 1
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 1
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 1
- PXAFZDXYEIIUTF-LKTVYLICSA-N Ala-Trp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXAFZDXYEIIUTF-LKTVYLICSA-N 0.000 description 1
- YXXPVUOMPSZURS-ZLIFDBKOSA-N Ala-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@H](C)N)=CNC2=C1 YXXPVUOMPSZURS-ZLIFDBKOSA-N 0.000 description 1
- XPBVBZPVNFIHOA-UVBJJODRSA-N Ala-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@H](C)N)=CNC2=C1 XPBVBZPVNFIHOA-UVBJJODRSA-N 0.000 description 1
- AENHOIXXHKNIQL-AUTRQRHGSA-N Ala-Tyr-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H]([NH3+])C)CC1=CC=C(O)C=C1 AENHOIXXHKNIQL-AUTRQRHGSA-N 0.000 description 1
- MTDDMSUUXNQMKK-BPNCWPANSA-N Ala-Tyr-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N MTDDMSUUXNQMKK-BPNCWPANSA-N 0.000 description 1
- BGGAIXWIZCIFSG-XDTLVQLUSA-N Ala-Tyr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O BGGAIXWIZCIFSG-XDTLVQLUSA-N 0.000 description 1
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 1
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 1
- CLOMBHBBUKAUBP-LSJOCFKGSA-N Ala-Val-His Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N CLOMBHBBUKAUBP-LSJOCFKGSA-N 0.000 description 1
- ZDILXFDENZVOTL-BPNCWPANSA-N Ala-Val-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZDILXFDENZVOTL-BPNCWPANSA-N 0.000 description 1
- 108010021809 Alcohol dehydrogenase Proteins 0.000 description 1
- 102000007698 Alcohol dehydrogenase Human genes 0.000 description 1
- 108010053754 Aldehyde reductase Proteins 0.000 description 1
- 102100027265 Aldo-keto reductase family 1 member B1 Human genes 0.000 description 1
- 241001147780 Alicyclobacillus Species 0.000 description 1
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 description 1
- 108010065763 Aminobutyraldehyde dehydrogenase Proteins 0.000 description 1
- 102100039338 Aminomethyltransferase, mitochondrial Human genes 0.000 description 1
- 241000379991 Anaerococcus Species 0.000 description 1
- 241001626813 Anoxybacillus Species 0.000 description 1
- 108020005544 Antisense RNA Proteins 0.000 description 1
- 241000203069 Archaea Species 0.000 description 1
- 241001135163 Arcobacter Species 0.000 description 1
- DFCIPNHFKOQAME-FXQIFTODSA-N Arg-Ala-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFCIPNHFKOQAME-FXQIFTODSA-N 0.000 description 1
- YYOVLDPHIJAOSY-DCAQKATOSA-N Arg-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N YYOVLDPHIJAOSY-DCAQKATOSA-N 0.000 description 1
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 1
- BHSYMWWMVRPCPA-CYDGBPFRSA-N Arg-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCN=C(N)N BHSYMWWMVRPCPA-CYDGBPFRSA-N 0.000 description 1
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 1
- DCGLNNVKIZXQOJ-FXQIFTODSA-N Arg-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N DCGLNNVKIZXQOJ-FXQIFTODSA-N 0.000 description 1
- NUBPTCMEOCKWDO-DCAQKATOSA-N Arg-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N NUBPTCMEOCKWDO-DCAQKATOSA-N 0.000 description 1
- QPOARHANPULOTM-GMOBBJLQSA-N Arg-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N QPOARHANPULOTM-GMOBBJLQSA-N 0.000 description 1
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 1
- ZTKHZAXGTFXUDD-VEVYYDQMSA-N Arg-Asn-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZTKHZAXGTFXUDD-VEVYYDQMSA-N 0.000 description 1
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 1
- YFBGNGASPGRWEM-DCAQKATOSA-N Arg-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YFBGNGASPGRWEM-DCAQKATOSA-N 0.000 description 1
- SQKPKIJVWHAWNF-DCAQKATOSA-N Arg-Asp-Lys Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(O)=O SQKPKIJVWHAWNF-DCAQKATOSA-N 0.000 description 1
- TTXYKSADPSNOIF-IHRRRGAJSA-N Arg-Asp-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O TTXYKSADPSNOIF-IHRRRGAJSA-N 0.000 description 1
- MFAMTAVAFBPXDC-LPEHRKFASA-N Arg-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O MFAMTAVAFBPXDC-LPEHRKFASA-N 0.000 description 1
- OANWAFQRNQEDSY-DCAQKATOSA-N Arg-Cys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N OANWAFQRNQEDSY-DCAQKATOSA-N 0.000 description 1
- RWDVGVPHEWOZMO-GUBZILKMSA-N Arg-Cys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCCNC(N)=N)C(O)=O RWDVGVPHEWOZMO-GUBZILKMSA-N 0.000 description 1
- PTVGLOCPAVYPFG-CIUDSAMLSA-N Arg-Gln-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PTVGLOCPAVYPFG-CIUDSAMLSA-N 0.000 description 1
- KBBKCNHWCDJPGN-GUBZILKMSA-N Arg-Gln-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KBBKCNHWCDJPGN-GUBZILKMSA-N 0.000 description 1
- RYRQZJVFDVWURI-SRVKXCTJSA-N Arg-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N RYRQZJVFDVWURI-SRVKXCTJSA-N 0.000 description 1
- BEXGZLUHRXTZCC-CIUDSAMLSA-N Arg-Gln-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N BEXGZLUHRXTZCC-CIUDSAMLSA-N 0.000 description 1
- MTANSHNQTWPZKP-KKUMJFAQSA-N Arg-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)O MTANSHNQTWPZKP-KKUMJFAQSA-N 0.000 description 1
- QAODJPUKWNNNRP-DCAQKATOSA-N Arg-Glu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QAODJPUKWNNNRP-DCAQKATOSA-N 0.000 description 1
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 1
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 1
- NXDXECQFKHXHAM-HJGDQZAQSA-N Arg-Glu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NXDXECQFKHXHAM-HJGDQZAQSA-N 0.000 description 1
- AQPVUEJJARLJHB-BQBZGAKWSA-N Arg-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N AQPVUEJJARLJHB-BQBZGAKWSA-N 0.000 description 1
- HQIZDMIGUJOSNI-IUCAKERBSA-N Arg-Gly-Arg Chemical compound N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQIZDMIGUJOSNI-IUCAKERBSA-N 0.000 description 1
- PNIGSVZJNVUVJA-BQBZGAKWSA-N Arg-Gly-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O PNIGSVZJNVUVJA-BQBZGAKWSA-N 0.000 description 1
- IYMAXBFPHPZYIK-BQBZGAKWSA-N Arg-Gly-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IYMAXBFPHPZYIK-BQBZGAKWSA-N 0.000 description 1
- QKSAZKCRVQYYGS-UWVGGRQHSA-N Arg-Gly-His Chemical compound N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O QKSAZKCRVQYYGS-UWVGGRQHSA-N 0.000 description 1
- RFXXUWGNVRJTNQ-QXEWZRGKSA-N Arg-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N RFXXUWGNVRJTNQ-QXEWZRGKSA-N 0.000 description 1
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 1
- ZATRYQNPUHGXCU-DTWKUNHWSA-N Arg-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZATRYQNPUHGXCU-DTWKUNHWSA-N 0.000 description 1
- BMNVSPMWMICFRV-DCAQKATOSA-N Arg-His-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CN=CN1 BMNVSPMWMICFRV-DCAQKATOSA-N 0.000 description 1
- MSILNNHVVMMTHZ-UWVGGRQHSA-N Arg-His-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CN=CN1 MSILNNHVVMMTHZ-UWVGGRQHSA-N 0.000 description 1
- UPKMBGAAEZGHOC-RWMBFGLXSA-N Arg-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O UPKMBGAAEZGHOC-RWMBFGLXSA-N 0.000 description 1
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 1
- NVUIWHJLPSZZQC-CYDGBPFRSA-N Arg-Ile-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NVUIWHJLPSZZQC-CYDGBPFRSA-N 0.000 description 1
- OOIMKQRCPJBGPD-XUXIUFHCSA-N Arg-Ile-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O OOIMKQRCPJBGPD-XUXIUFHCSA-N 0.000 description 1
- LKDHUGLXOHYINY-XUXIUFHCSA-N Arg-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LKDHUGLXOHYINY-XUXIUFHCSA-N 0.000 description 1
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 1
- YKZJPIPFKGYHKY-DCAQKATOSA-N Arg-Leu-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKZJPIPFKGYHKY-DCAQKATOSA-N 0.000 description 1
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 1
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 1
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 1
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 1
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 1
- YVTHEZNOKSAWRW-DCAQKATOSA-N Arg-Lys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O YVTHEZNOKSAWRW-DCAQKATOSA-N 0.000 description 1
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 1
- RIIVUOJDDQXHRV-SRVKXCTJSA-N Arg-Lys-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O RIIVUOJDDQXHRV-SRVKXCTJSA-N 0.000 description 1
- RIQBRKVTFBWEDY-RHYQMDGZSA-N Arg-Lys-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RIQBRKVTFBWEDY-RHYQMDGZSA-N 0.000 description 1
- CZUHPNLXLWMYMG-UBHSHLNASA-N Arg-Phe-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 CZUHPNLXLWMYMG-UBHSHLNASA-N 0.000 description 1
- FKQITMVNILRUCQ-IHRRRGAJSA-N Arg-Phe-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O FKQITMVNILRUCQ-IHRRRGAJSA-N 0.000 description 1
- KZXPVYVSHUJCEO-ULQDDVLXSA-N Arg-Phe-Lys Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 KZXPVYVSHUJCEO-ULQDDVLXSA-N 0.000 description 1
- MNBHKGYCLBUIBC-UFYCRDLUSA-N Arg-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCNC(N)=N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MNBHKGYCLBUIBC-UFYCRDLUSA-N 0.000 description 1
- YFHATWYGAAXQCF-JYJNAYRXSA-N Arg-Pro-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YFHATWYGAAXQCF-JYJNAYRXSA-N 0.000 description 1
- VRTWYUYCJGNFES-CIUDSAMLSA-N Arg-Ser-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O VRTWYUYCJGNFES-CIUDSAMLSA-N 0.000 description 1
- URAUIUGLHBRPMF-NAKRPEOUSA-N Arg-Ser-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O URAUIUGLHBRPMF-NAKRPEOUSA-N 0.000 description 1
- OQPAZKMGCWPERI-GUBZILKMSA-N Arg-Ser-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OQPAZKMGCWPERI-GUBZILKMSA-N 0.000 description 1
- WCZXPVPHUMYLMS-VEVYYDQMSA-N Arg-Thr-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O WCZXPVPHUMYLMS-VEVYYDQMSA-N 0.000 description 1
- HRCIIMCTUIAKQB-XGEHTFHBSA-N Arg-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O HRCIIMCTUIAKQB-XGEHTFHBSA-N 0.000 description 1
- LYJXHXGPWDTLKW-HJGDQZAQSA-N Arg-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O LYJXHXGPWDTLKW-HJGDQZAQSA-N 0.000 description 1
- AUZAXCPWMDBWEE-HJGDQZAQSA-N Arg-Thr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O AUZAXCPWMDBWEE-HJGDQZAQSA-N 0.000 description 1
- RYQSYXFGFOTJDJ-RHYQMDGZSA-N Arg-Thr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RYQSYXFGFOTJDJ-RHYQMDGZSA-N 0.000 description 1
- ZJBUILVYSXQNSW-YTWAJWBKSA-N Arg-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ZJBUILVYSXQNSW-YTWAJWBKSA-N 0.000 description 1
- ZPWMEWYQBWSGAO-ZJDVBMNYSA-N Arg-Thr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZPWMEWYQBWSGAO-ZJDVBMNYSA-N 0.000 description 1
- UGJLILSJKSBVIR-ZFWWWQNUSA-N Arg-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)NCC(O)=O)=CNC2=C1 UGJLILSJKSBVIR-ZFWWWQNUSA-N 0.000 description 1
- QJWLLRZTJFPCHA-STECZYCISA-N Arg-Tyr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QJWLLRZTJFPCHA-STECZYCISA-N 0.000 description 1
- CGWVCWFQGXOUSJ-ULQDDVLXSA-N Arg-Tyr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O CGWVCWFQGXOUSJ-ULQDDVLXSA-N 0.000 description 1
- LFWOQHSQNCKXRU-UFYCRDLUSA-N Arg-Tyr-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 LFWOQHSQNCKXRU-UFYCRDLUSA-N 0.000 description 1
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 1
- PSUXEQYPYZLNER-QXEWZRGKSA-N Arg-Val-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PSUXEQYPYZLNER-QXEWZRGKSA-N 0.000 description 1
- FTMRPIVPSDVGCC-GUBZILKMSA-N Arg-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FTMRPIVPSDVGCC-GUBZILKMSA-N 0.000 description 1
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- PFOYSEIHFVKHNF-FXQIFTODSA-N Asn-Ala-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PFOYSEIHFVKHNF-FXQIFTODSA-N 0.000 description 1
- PDQBXRSOSCTGKY-ACZMJKKPSA-N Asn-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PDQBXRSOSCTGKY-ACZMJKKPSA-N 0.000 description 1
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 1
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 1
- NTXNUXPCNRDMAF-WFBYXXMGSA-N Asn-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC(N)=O)C)C(O)=O)=CNC2=C1 NTXNUXPCNRDMAF-WFBYXXMGSA-N 0.000 description 1
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 1
- HOIFSHOLNKQCSA-FXQIFTODSA-N Asn-Arg-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O HOIFSHOLNKQCSA-FXQIFTODSA-N 0.000 description 1
- YJRORCOAFUZVKA-FXQIFTODSA-N Asn-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N YJRORCOAFUZVKA-FXQIFTODSA-N 0.000 description 1
- MFFOYNGMOYFPBD-DCAQKATOSA-N Asn-Arg-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O MFFOYNGMOYFPBD-DCAQKATOSA-N 0.000 description 1
- DQTIWTULBGLJBL-DCAQKATOSA-N Asn-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N DQTIWTULBGLJBL-DCAQKATOSA-N 0.000 description 1
- CQMQJWRCRQSBAF-BPUTZDHNSA-N Asn-Arg-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N CQMQJWRCRQSBAF-BPUTZDHNSA-N 0.000 description 1
- JEPNYDRDYNSFIU-QXEWZRGKSA-N Asn-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(N)=O)C(O)=O JEPNYDRDYNSFIU-QXEWZRGKSA-N 0.000 description 1
- RCENDENBBJFJHZ-ACZMJKKPSA-N Asn-Asn-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RCENDENBBJFJHZ-ACZMJKKPSA-N 0.000 description 1
- APHUDFFMXFYRKP-CIUDSAMLSA-N Asn-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N APHUDFFMXFYRKP-CIUDSAMLSA-N 0.000 description 1
- DXZNJWFECGJCQR-FXQIFTODSA-N Asn-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N DXZNJWFECGJCQR-FXQIFTODSA-N 0.000 description 1
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 1
- BHQQRVARKXWXPP-ACZMJKKPSA-N Asn-Asp-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N BHQQRVARKXWXPP-ACZMJKKPSA-N 0.000 description 1
- ZWASIOHRQWRWAS-UGYAYLCHSA-N Asn-Asp-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZWASIOHRQWRWAS-UGYAYLCHSA-N 0.000 description 1
- JZRLLSOWDYUKOK-SRVKXCTJSA-N Asn-Asp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N JZRLLSOWDYUKOK-SRVKXCTJSA-N 0.000 description 1
- VYLVOMUVLMGCRF-ZLUOBGJFSA-N Asn-Asp-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VYLVOMUVLMGCRF-ZLUOBGJFSA-N 0.000 description 1
- ZPMNECSEJXXNBE-CIUDSAMLSA-N Asn-Cys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O ZPMNECSEJXXNBE-CIUDSAMLSA-N 0.000 description 1
- CZIXHXIJJZLYRJ-SRVKXCTJSA-N Asn-Cys-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CZIXHXIJJZLYRJ-SRVKXCTJSA-N 0.000 description 1
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 1
- PPMTUXJSQDNUDE-CIUDSAMLSA-N Asn-Glu-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PPMTUXJSQDNUDE-CIUDSAMLSA-N 0.000 description 1
- XVAPVJNJGLWGCS-ACZMJKKPSA-N Asn-Glu-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVAPVJNJGLWGCS-ACZMJKKPSA-N 0.000 description 1
- HCAUEJAQCXVQQM-ACZMJKKPSA-N Asn-Glu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HCAUEJAQCXVQQM-ACZMJKKPSA-N 0.000 description 1
- BZMWJLLUAKSIMH-FXQIFTODSA-N Asn-Glu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BZMWJLLUAKSIMH-FXQIFTODSA-N 0.000 description 1
- OGMDXNFGPOPZTK-GUBZILKMSA-N Asn-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N OGMDXNFGPOPZTK-GUBZILKMSA-N 0.000 description 1
- BKDDABUWNKGZCK-XHNCKOQMSA-N Asn-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O BKDDABUWNKGZCK-XHNCKOQMSA-N 0.000 description 1
- GFFRWIJAFFMQGM-NUMRIWBASA-N Asn-Glu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFFRWIJAFFMQGM-NUMRIWBASA-N 0.000 description 1
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 1
- DDPXDCKYWDGZAL-BQBZGAKWSA-N Asn-Gly-Arg Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N DDPXDCKYWDGZAL-BQBZGAKWSA-N 0.000 description 1
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 1
- OLVIPTLKNSAYRJ-YUMQZZPRSA-N Asn-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N OLVIPTLKNSAYRJ-YUMQZZPRSA-N 0.000 description 1
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 1
- YGHCVNQOZZMHRZ-DJFWLOJKSA-N Asn-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)N)N YGHCVNQOZZMHRZ-DJFWLOJKSA-N 0.000 description 1
- IKLAUGBIDCDFOY-SRVKXCTJSA-N Asn-His-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O IKLAUGBIDCDFOY-SRVKXCTJSA-N 0.000 description 1
- AITGTTNYKAWKDR-CIUDSAMLSA-N Asn-His-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O AITGTTNYKAWKDR-CIUDSAMLSA-N 0.000 description 1
- NKLRWRRVYGQNIH-GHCJXIJMSA-N Asn-Ile-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O NKLRWRRVYGQNIH-GHCJXIJMSA-N 0.000 description 1
- OLISTMZJGQUOGS-GMOBBJLQSA-N Asn-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OLISTMZJGQUOGS-GMOBBJLQSA-N 0.000 description 1
- GQRDIVQPSMPQME-ZPFDUUQYSA-N Asn-Ile-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O GQRDIVQPSMPQME-ZPFDUUQYSA-N 0.000 description 1
- LVHMEJJWEXBMKK-GMOBBJLQSA-N Asn-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N LVHMEJJWEXBMKK-GMOBBJLQSA-N 0.000 description 1
- JQBCANGGAVVERB-CFMVVWHZSA-N Asn-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N JQBCANGGAVVERB-CFMVVWHZSA-N 0.000 description 1
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 1
- ALKWEXBKAHPJAQ-NAKRPEOUSA-N Asn-Leu-Asp-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ALKWEXBKAHPJAQ-NAKRPEOUSA-N 0.000 description 1
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 1
- UBGGJTMETLEXJD-DCAQKATOSA-N Asn-Leu-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O UBGGJTMETLEXJD-DCAQKATOSA-N 0.000 description 1
- ALHMNHZJBYBYHS-DCAQKATOSA-N Asn-Lys-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ALHMNHZJBYBYHS-DCAQKATOSA-N 0.000 description 1
- FTSAJSADJCMDHH-CIUDSAMLSA-N Asn-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N FTSAJSADJCMDHH-CIUDSAMLSA-N 0.000 description 1
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 1
- ZYPWIUFLYMQZBS-SRVKXCTJSA-N Asn-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZYPWIUFLYMQZBS-SRVKXCTJSA-N 0.000 description 1
- VOGCFWDZYYTEOY-DCAQKATOSA-N Asn-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N VOGCFWDZYYTEOY-DCAQKATOSA-N 0.000 description 1
- GIQCDTKOIPUDSG-GARJFASQSA-N Asn-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N)C(=O)O GIQCDTKOIPUDSG-GARJFASQSA-N 0.000 description 1
- MDDXKBHIMYYJLW-FXQIFTODSA-N Asn-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N MDDXKBHIMYYJLW-FXQIFTODSA-N 0.000 description 1
- KSGAFDTYQPKUAP-GMOBBJLQSA-N Asn-Met-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KSGAFDTYQPKUAP-GMOBBJLQSA-N 0.000 description 1
- QGABLMITFKUQDF-DCAQKATOSA-N Asn-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N QGABLMITFKUQDF-DCAQKATOSA-N 0.000 description 1
- KEUNWIXNKVWCFL-FXQIFTODSA-N Asn-Met-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O KEUNWIXNKVWCFL-FXQIFTODSA-N 0.000 description 1
- VITDJIPIJZAVGC-VEVYYDQMSA-N Asn-Met-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VITDJIPIJZAVGC-VEVYYDQMSA-N 0.000 description 1
- OROMFUQQTSWUTI-IHRRRGAJSA-N Asn-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OROMFUQQTSWUTI-IHRRRGAJSA-N 0.000 description 1
- RTFWCVDISAMGEQ-SRVKXCTJSA-N Asn-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N RTFWCVDISAMGEQ-SRVKXCTJSA-N 0.000 description 1
- ZJIFRAPZHAGLGR-MELADBBJSA-N Asn-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZJIFRAPZHAGLGR-MELADBBJSA-N 0.000 description 1
- PLTGTJAZQRGMPP-FXQIFTODSA-N Asn-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O PLTGTJAZQRGMPP-FXQIFTODSA-N 0.000 description 1
- YRTOMUMWSTUQAX-FXQIFTODSA-N Asn-Pro-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O YRTOMUMWSTUQAX-FXQIFTODSA-N 0.000 description 1
- XMHFCUKJRCQXGI-CIUDSAMLSA-N Asn-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O XMHFCUKJRCQXGI-CIUDSAMLSA-N 0.000 description 1
- NJSNXIOKBHPFMB-GMOBBJLQSA-N Asn-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)N)N NJSNXIOKBHPFMB-GMOBBJLQSA-N 0.000 description 1
- AWXDRZJQCVHCIT-DCAQKATOSA-N Asn-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O AWXDRZJQCVHCIT-DCAQKATOSA-N 0.000 description 1
- VCJCPARXDBEGNE-GUBZILKMSA-N Asn-Pro-Pro Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 VCJCPARXDBEGNE-GUBZILKMSA-N 0.000 description 1
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 1
- HPNDKUOLNRVRAY-BIIVOSGPSA-N Asn-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N)C(=O)O HPNDKUOLNRVRAY-BIIVOSGPSA-N 0.000 description 1
- ZUFPUBYQYWCMDB-NUMRIWBASA-N Asn-Thr-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZUFPUBYQYWCMDB-NUMRIWBASA-N 0.000 description 1
- HCZQKHSRYHCPSD-IUKAMOBKSA-N Asn-Thr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HCZQKHSRYHCPSD-IUKAMOBKSA-N 0.000 description 1
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 1
- PIABYSIYPGLLDQ-XVSYOHENSA-N Asn-Thr-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PIABYSIYPGLLDQ-XVSYOHENSA-N 0.000 description 1
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 1
- XCBKBPRFACFFOO-AQZXSJQPSA-N Asn-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O XCBKBPRFACFFOO-AQZXSJQPSA-N 0.000 description 1
- KZYSHAMXEBPJBD-JRQIVUDYSA-N Asn-Thr-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZYSHAMXEBPJBD-JRQIVUDYSA-N 0.000 description 1
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 description 1
- BIGRHVNFFJTHEB-UBHSHLNASA-N Asn-Trp-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O BIGRHVNFFJTHEB-UBHSHLNASA-N 0.000 description 1
- TZQWZQSMHDVLQL-QEJZJMRPSA-N Asn-Trp-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N TZQWZQSMHDVLQL-QEJZJMRPSA-N 0.000 description 1
- QIRJQYQOIKBPBZ-IHRRRGAJSA-N Asn-Tyr-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QIRJQYQOIKBPBZ-IHRRRGAJSA-N 0.000 description 1
- RTFXPCYMDYBZNQ-SRVKXCTJSA-N Asn-Tyr-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O RTFXPCYMDYBZNQ-SRVKXCTJSA-N 0.000 description 1
- SKQTXVZTCGSRJS-SRVKXCTJSA-N Asn-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O SKQTXVZTCGSRJS-SRVKXCTJSA-N 0.000 description 1
- LRCIOEVFVGXZKB-BZSNNMDCSA-N Asn-Tyr-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LRCIOEVFVGXZKB-BZSNNMDCSA-N 0.000 description 1
- CBWCQCANJSGUOH-ZKWXMUAHSA-N Asn-Val-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O CBWCQCANJSGUOH-ZKWXMUAHSA-N 0.000 description 1
- MJIJBEYEHBKTIM-BYULHYEWSA-N Asn-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MJIJBEYEHBKTIM-BYULHYEWSA-N 0.000 description 1
- AECPDLSSUMDUAA-ZKWXMUAHSA-N Asn-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N AECPDLSSUMDUAA-ZKWXMUAHSA-N 0.000 description 1
- XZFONYMRYTVLPL-NHCYSSNCSA-N Asn-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N XZFONYMRYTVLPL-NHCYSSNCSA-N 0.000 description 1
- JNCRAQVYJZGIOW-QSFUFRPTSA-N Asn-Val-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNCRAQVYJZGIOW-QSFUFRPTSA-N 0.000 description 1
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 1
- QXNGSPZMGFEZNO-QRTARXTBSA-N Asn-Val-Trp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O QXNGSPZMGFEZNO-QRTARXTBSA-N 0.000 description 1
- HBUJSDCLZCXXCW-YDHLFZDLSA-N Asn-Val-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HBUJSDCLZCXXCW-YDHLFZDLSA-N 0.000 description 1
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 1
- KVMPVNGOKHTUHZ-GCJQMDKQSA-N Asp-Ala-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KVMPVNGOKHTUHZ-GCJQMDKQSA-N 0.000 description 1
- ZVTDYGWRRPMFCL-WFBYXXMGSA-N Asp-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)O)N ZVTDYGWRRPMFCL-WFBYXXMGSA-N 0.000 description 1
- BLQBMRNMBAYREH-UWJYBYFXSA-N Asp-Ala-Tyr Chemical compound N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O BLQBMRNMBAYREH-UWJYBYFXSA-N 0.000 description 1
- ZLGKHJHFYSRUBH-FXQIFTODSA-N Asp-Arg-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLGKHJHFYSRUBH-FXQIFTODSA-N 0.000 description 1
- WSOKZUVWBXVJHX-CIUDSAMLSA-N Asp-Arg-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O WSOKZUVWBXVJHX-CIUDSAMLSA-N 0.000 description 1
- HMQDRBKQMLRCCG-GMOBBJLQSA-N Asp-Arg-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HMQDRBKQMLRCCG-GMOBBJLQSA-N 0.000 description 1
- FAEIQWHBRBWUBN-FXQIFTODSA-N Asp-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N FAEIQWHBRBWUBN-FXQIFTODSA-N 0.000 description 1
- ILJQISGMGXRZQQ-IHRRRGAJSA-N Asp-Arg-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ILJQISGMGXRZQQ-IHRRRGAJSA-N 0.000 description 1
- CASGONAXMZPHCK-FXQIFTODSA-N Asp-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N CASGONAXMZPHCK-FXQIFTODSA-N 0.000 description 1
- ZELQAFZSJOBEQS-ACZMJKKPSA-N Asp-Asn-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZELQAFZSJOBEQS-ACZMJKKPSA-N 0.000 description 1
- RYEWQKQXRJCHIO-SRVKXCTJSA-N Asp-Asn-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 RYEWQKQXRJCHIO-SRVKXCTJSA-N 0.000 description 1
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 1
- WCFCYFDBMNFSPA-ACZMJKKPSA-N Asp-Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O WCFCYFDBMNFSPA-ACZMJKKPSA-N 0.000 description 1
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 1
- QXHVOUSPVAWEMX-ZLUOBGJFSA-N Asp-Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXHVOUSPVAWEMX-ZLUOBGJFSA-N 0.000 description 1
- MJKBOVWWADWLHV-ZLUOBGJFSA-N Asp-Cys-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)C(=O)O MJKBOVWWADWLHV-ZLUOBGJFSA-N 0.000 description 1
- UWOPETAWXDZUJR-ACZMJKKPSA-N Asp-Cys-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O UWOPETAWXDZUJR-ACZMJKKPSA-N 0.000 description 1
- LJRPYAZQQWHEEV-FXQIFTODSA-N Asp-Gln-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O LJRPYAZQQWHEEV-FXQIFTODSA-N 0.000 description 1
- VHQOCWWKXIOAQI-WDSKDSINSA-N Asp-Gln-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VHQOCWWKXIOAQI-WDSKDSINSA-N 0.000 description 1
- PMEHKVHZQKJACS-PEFMBERDSA-N Asp-Gln-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PMEHKVHZQKJACS-PEFMBERDSA-N 0.000 description 1
- OEUQMKNNOWJREN-AVGNSLFASA-N Asp-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N OEUQMKNNOWJREN-AVGNSLFASA-N 0.000 description 1
- CSEJMKNZDCJYGJ-XHNCKOQMSA-N Asp-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O CSEJMKNZDCJYGJ-XHNCKOQMSA-N 0.000 description 1
- IJHUZMGJRGNXIW-CIUDSAMLSA-N Asp-Glu-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IJHUZMGJRGNXIW-CIUDSAMLSA-N 0.000 description 1
- XJQRWGXKUSDEFI-ACZMJKKPSA-N Asp-Glu-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XJQRWGXKUSDEFI-ACZMJKKPSA-N 0.000 description 1
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 1
- HSWYMWGDMPLTTH-FXQIFTODSA-N Asp-Glu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HSWYMWGDMPLTTH-FXQIFTODSA-N 0.000 description 1
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 1
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 1
- ZEDBMCPXPIYJLW-XHNCKOQMSA-N Asp-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O ZEDBMCPXPIYJLW-XHNCKOQMSA-N 0.000 description 1
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 1
- GISFCCXBVJKGEO-QEJZJMRPSA-N Asp-Glu-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O GISFCCXBVJKGEO-QEJZJMRPSA-N 0.000 description 1
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 1
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 1
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 1
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 1
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 1
- PGUYEUCYVNZGGV-QWRGUYRKSA-N Asp-Gly-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PGUYEUCYVNZGGV-QWRGUYRKSA-N 0.000 description 1
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 1
- OGTCOKZFOJIZFG-CIUDSAMLSA-N Asp-His-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O OGTCOKZFOJIZFG-CIUDSAMLSA-N 0.000 description 1
- UBPMOJLRVMGTOQ-GARJFASQSA-N Asp-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)O)N)C(=O)O UBPMOJLRVMGTOQ-GARJFASQSA-N 0.000 description 1
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 1
- MFTVXYMXSAQZNL-DJFWLOJKSA-N Asp-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)O)N MFTVXYMXSAQZNL-DJFWLOJKSA-N 0.000 description 1
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 1
- CLUMZOKVGUWUFD-CIUDSAMLSA-N Asp-Leu-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O CLUMZOKVGUWUFD-CIUDSAMLSA-N 0.000 description 1
- XLILXFRAKOYEJX-GUBZILKMSA-N Asp-Leu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLILXFRAKOYEJX-GUBZILKMSA-N 0.000 description 1
- XSXVLWBWIPKUSN-UHFFFAOYSA-N Asp-Leu-Glu-Asp Chemical compound OC(=O)CC(N)C(=O)NC(CC(C)C)C(=O)NC(CCC(O)=O)C(=O)NC(CC(O)=O)C(O)=O XSXVLWBWIPKUSN-UHFFFAOYSA-N 0.000 description 1
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 1
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 1
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 1
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 1
- CZECQDPEMSVPDH-MNXVOIDGSA-N Asp-Leu-Val-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CZECQDPEMSVPDH-MNXVOIDGSA-N 0.000 description 1
- UZFHNLYQWMGUHU-DCAQKATOSA-N Asp-Lys-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UZFHNLYQWMGUHU-DCAQKATOSA-N 0.000 description 1
- XWSIYTYNLKCLJB-CIUDSAMLSA-N Asp-Lys-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O XWSIYTYNLKCLJB-CIUDSAMLSA-N 0.000 description 1
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 1
- YTXCCDCOHIYQFC-GUBZILKMSA-N Asp-Met-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTXCCDCOHIYQFC-GUBZILKMSA-N 0.000 description 1
- IOXWDLNHXZOXQP-FXQIFTODSA-N Asp-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N IOXWDLNHXZOXQP-FXQIFTODSA-N 0.000 description 1
- IDDMGSKZQDEDGA-SRVKXCTJSA-N Asp-Phe-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 IDDMGSKZQDEDGA-SRVKXCTJSA-N 0.000 description 1
- GWIJZUVQVDJHDI-AVGNSLFASA-N Asp-Phe-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GWIJZUVQVDJHDI-AVGNSLFASA-N 0.000 description 1
- USNJAPJZSGTTPX-XVSYOHENSA-N Asp-Phe-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O USNJAPJZSGTTPX-XVSYOHENSA-N 0.000 description 1
- NONWUQAWAANERO-BZSNNMDCSA-N Asp-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@H](CC(O)=O)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 NONWUQAWAANERO-BZSNNMDCSA-N 0.000 description 1
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 1
- ZKAOJVJQGVUIIU-GUBZILKMSA-N Asp-Pro-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZKAOJVJQGVUIIU-GUBZILKMSA-N 0.000 description 1
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 1
- HJZLUGQGJWXJCJ-CIUDSAMLSA-N Asp-Pro-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJZLUGQGJWXJCJ-CIUDSAMLSA-N 0.000 description 1
- XXAMCEGRCZQGEM-ZLUOBGJFSA-N Asp-Ser-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O XXAMCEGRCZQGEM-ZLUOBGJFSA-N 0.000 description 1
- ZVGRHIRJLWBWGJ-ACZMJKKPSA-N Asp-Ser-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVGRHIRJLWBWGJ-ACZMJKKPSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 1
- YIDFBWRHIYOYAA-LKXGYXEUSA-N Asp-Ser-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YIDFBWRHIYOYAA-LKXGYXEUSA-N 0.000 description 1
- JJQGZGOEDSSHTE-FOHZUACHSA-N Asp-Thr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JJQGZGOEDSSHTE-FOHZUACHSA-N 0.000 description 1
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 1
- NAAAPCLFJPURAM-HJGDQZAQSA-N Asp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O NAAAPCLFJPURAM-HJGDQZAQSA-N 0.000 description 1
- GXHDGYOXPNQCKM-XVSYOHENSA-N Asp-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GXHDGYOXPNQCKM-XVSYOHENSA-N 0.000 description 1
- KCOPOPKJRHVGPE-AQZXSJQPSA-N Asp-Thr-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O KCOPOPKJRHVGPE-AQZXSJQPSA-N 0.000 description 1
- ITGFVUYOLWBPQW-KKHAAJSZSA-N Asp-Thr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ITGFVUYOLWBPQW-KKHAAJSZSA-N 0.000 description 1
- DKQCWCQRAMAFLN-UBHSHLNASA-N Asp-Trp-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O DKQCWCQRAMAFLN-UBHSHLNASA-N 0.000 description 1
- BOXNGMVEVOGXOJ-UBHSHLNASA-N Asp-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N BOXNGMVEVOGXOJ-UBHSHLNASA-N 0.000 description 1
- LEYKQPDPZJIRTA-AQZXSJQPSA-N Asp-Trp-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LEYKQPDPZJIRTA-AQZXSJQPSA-N 0.000 description 1
- OYSYWMMZGJSQRB-AVGNSLFASA-N Asp-Tyr-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O OYSYWMMZGJSQRB-AVGNSLFASA-N 0.000 description 1
- 108010083946 Asp-Tyr-Leu-Lys Proteins 0.000 description 1
- BJDHEININLSZOT-KKUMJFAQSA-N Asp-Tyr-Lys Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(O)=O BJDHEININLSZOT-KKUMJFAQSA-N 0.000 description 1
- SQIARYGNVQWOSB-BZSNNMDCSA-N Asp-Tyr-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQIARYGNVQWOSB-BZSNNMDCSA-N 0.000 description 1
- BPAUXFVCSYQDQX-JRQIVUDYSA-N Asp-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)O)N)O BPAUXFVCSYQDQX-JRQIVUDYSA-N 0.000 description 1
- XWKBWZXGNXTDKY-ZKWXMUAHSA-N Asp-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O XWKBWZXGNXTDKY-ZKWXMUAHSA-N 0.000 description 1
- GFYOIYJJMSHLSN-QXEWZRGKSA-N Asp-Val-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GFYOIYJJMSHLSN-QXEWZRGKSA-N 0.000 description 1
- PLOKOIJSGCISHE-BYULHYEWSA-N Asp-Val-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLOKOIJSGCISHE-BYULHYEWSA-N 0.000 description 1
- WAEDSQFVZJUHLI-BYULHYEWSA-N Asp-Val-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WAEDSQFVZJUHLI-BYULHYEWSA-N 0.000 description 1
- XMKXONRMGJXCJV-LAEOZQHASA-N Asp-Val-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XMKXONRMGJXCJV-LAEOZQHASA-N 0.000 description 1
- XQFLFQWOBXPMHW-NHCYSSNCSA-N Asp-Val-His Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O XQFLFQWOBXPMHW-NHCYSSNCSA-N 0.000 description 1
- GXIUDSXIUSTSLO-QXEWZRGKSA-N Asp-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N GXIUDSXIUSTSLO-QXEWZRGKSA-N 0.000 description 1
- 241000193755 Bacillus cereus Species 0.000 description 1
- 241001661602 Bacillus infantis Species 0.000 description 1
- 241001226316 Bacillus infantis NRRL B-14911 Species 0.000 description 1
- 108010029692 Bisphosphoglycerate mutase Proteins 0.000 description 1
- 241000588807 Bordetella Species 0.000 description 1
- 241000283725 Bos Species 0.000 description 1
- 241000186146 Brevibacterium Species 0.000 description 1
- FERIUCNNQQJTOY-UHFFFAOYSA-M Butyrate Chemical compound CCCC([O-])=O FERIUCNNQQJTOY-UHFFFAOYSA-M 0.000 description 1
- FERIUCNNQQJTOY-UHFFFAOYSA-N Butyric acid Natural products CCCC(O)=O FERIUCNNQQJTOY-UHFFFAOYSA-N 0.000 description 1
- 108091033409 CRISPR Proteins 0.000 description 1
- 238000010354 CRISPR gene editing Methods 0.000 description 1
- 241000589876 Campylobacter Species 0.000 description 1
- 108090000489 Carboxy-Lyases Proteins 0.000 description 1
- 102000004031 Carboxy-Lyases Human genes 0.000 description 1
- 108090000994 Catalytic RNA Proteins 0.000 description 1
- 102000053642 Catalytic RNA Human genes 0.000 description 1
- 241001660259 Cereus <cactus> Species 0.000 description 1
- 240000001817 Cereus hexagonus Species 0.000 description 1
- 108010020495 Citrate CoA-transferase Proteins 0.000 description 1
- 101710192913 Citrate synthase 2 Proteins 0.000 description 1
- 102100037021 Citrate synthase, mitochondrial Human genes 0.000 description 1
- 108010050785 Citryl-CoA lyase Proteins 0.000 description 1
- 241001112695 Clostridiales Species 0.000 description 1
- 241000193454 Clostridium beijerinckii Species 0.000 description 1
- 241001508458 Clostridium saccharoperbutylacetonicum Species 0.000 description 1
- 241000193464 Clostridium sp. Species 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- 241000186216 Corynebacterium Species 0.000 description 1
- 241000989055 Cronobacter Species 0.000 description 1
- 241001528480 Cupriavidus Species 0.000 description 1
- 241000928573 Cutibacterium Species 0.000 description 1
- XEEIQMGZRFFSRD-XVYDVKMFSA-N Cys-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CS)N XEEIQMGZRFFSRD-XVYDVKMFSA-N 0.000 description 1
- PRXCTTWKGJAPMT-ZLUOBGJFSA-N Cys-Ala-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O PRXCTTWKGJAPMT-ZLUOBGJFSA-N 0.000 description 1
- DCXGXDGGXVZVMY-GHCJXIJMSA-N Cys-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CS DCXGXDGGXVZVMY-GHCJXIJMSA-N 0.000 description 1
- ASHTVGGFIMESRD-LKXGYXEUSA-N Cys-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N)O ASHTVGGFIMESRD-LKXGYXEUSA-N 0.000 description 1
- HNNGTYHNYDOSKV-FXQIFTODSA-N Cys-Cys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CS)N HNNGTYHNYDOSKV-FXQIFTODSA-N 0.000 description 1
- BVFQOPGFOQVZTE-ACZMJKKPSA-N Cys-Gln-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O BVFQOPGFOQVZTE-ACZMJKKPSA-N 0.000 description 1
- BDWIZLQVVWQMTB-XKBZYTNZSA-N Cys-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N)O BDWIZLQVVWQMTB-XKBZYTNZSA-N 0.000 description 1
- VIRYODQIWJNWNU-NRPADANISA-N Cys-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N VIRYODQIWJNWNU-NRPADANISA-N 0.000 description 1
- RWAZRMXTVSIVJR-YUMQZZPRSA-N Cys-Gly-His Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC1=CNC=N1)C(O)=O RWAZRMXTVSIVJR-YUMQZZPRSA-N 0.000 description 1
- XTHUKRLJRUVVBF-WHFBIAKZSA-N Cys-Gly-Ser Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O XTHUKRLJRUVVBF-WHFBIAKZSA-N 0.000 description 1
- UVZFZTWNHOQWNK-NAKRPEOUSA-N Cys-Ile-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UVZFZTWNHOQWNK-NAKRPEOUSA-N 0.000 description 1
- LKUCSUGWHYVYLP-GHCJXIJMSA-N Cys-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N LKUCSUGWHYVYLP-GHCJXIJMSA-N 0.000 description 1
- PDRMRVHPAQKTLT-NAKRPEOUSA-N Cys-Ile-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O PDRMRVHPAQKTLT-NAKRPEOUSA-N 0.000 description 1
- XLLSMEFANRROJE-GUBZILKMSA-N Cys-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N XLLSMEFANRROJE-GUBZILKMSA-N 0.000 description 1
- WVLZTXGTNGHPBO-SRVKXCTJSA-N Cys-Leu-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O WVLZTXGTNGHPBO-SRVKXCTJSA-N 0.000 description 1
- XZKJEOMFLDVXJG-KATARQTJSA-N Cys-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CS)N)O XZKJEOMFLDVXJG-KATARQTJSA-N 0.000 description 1
- OHLLDUNVMPPUMD-DCAQKATOSA-N Cys-Leu-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N OHLLDUNVMPPUMD-DCAQKATOSA-N 0.000 description 1
- ZXCAQANTQWBICD-DCAQKATOSA-N Cys-Lys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N ZXCAQANTQWBICD-DCAQKATOSA-N 0.000 description 1
- UIKLEGZPIOXFHJ-DLOVCJGASA-N Cys-Phe-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O UIKLEGZPIOXFHJ-DLOVCJGASA-N 0.000 description 1
- YYLBXQJGWOQZOU-IHRRRGAJSA-N Cys-Phe-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CS)N YYLBXQJGWOQZOU-IHRRRGAJSA-N 0.000 description 1
- ABLQPNMKLMFDQU-BIIVOSGPSA-N Cys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CS)N)C(=O)O ABLQPNMKLMFDQU-BIIVOSGPSA-N 0.000 description 1
- YNJBLTDKTMKEET-ZLUOBGJFSA-N Cys-Ser-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O YNJBLTDKTMKEET-ZLUOBGJFSA-N 0.000 description 1
- IXPSSIBVVKSOIE-SRVKXCTJSA-N Cys-Ser-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N)O IXPSSIBVVKSOIE-SRVKXCTJSA-N 0.000 description 1
- NDNZRWUDUMTITL-FXQIFTODSA-N Cys-Ser-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NDNZRWUDUMTITL-FXQIFTODSA-N 0.000 description 1
- ALNKNYKSZPSLBD-ZDLURKLDSA-N Cys-Thr-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ALNKNYKSZPSLBD-ZDLURKLDSA-N 0.000 description 1
- FNXOZWPPOJRBRE-XGEHTFHBSA-N Cys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CS)N)O FNXOZWPPOJRBRE-XGEHTFHBSA-N 0.000 description 1
- 102000018832 Cytochromes Human genes 0.000 description 1
- 108010052832 Cytochromes Proteins 0.000 description 1
- 102100037579 D-3-phosphoglycerate dehydrogenase Human genes 0.000 description 1
- 108030001081 D-amino-acid transaminases Proteins 0.000 description 1
- RGHNJXZEOKUKBD-SQOUGZDYSA-M D-gluconate Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@@H](O)C([O-])=O RGHNJXZEOKUKBD-SQOUGZDYSA-M 0.000 description 1
- SHZGCJCMOBCMKK-UHFFFAOYSA-N D-mannomethylose Natural products CC1OC(O)C(O)C(O)C1O SHZGCJCMOBCMKK-UHFFFAOYSA-N 0.000 description 1
- 108010028127 Dihydrolipoamide Dehydrogenase Proteins 0.000 description 1
- 102000028526 Dihydrolipoamide Dehydrogenase Human genes 0.000 description 1
- 108700034189 EC 2.3.3.16 Proteins 0.000 description 1
- 241001379910 Ephemera danica Species 0.000 description 1
- 241000588698 Erwinia Species 0.000 description 1
- 101000717864 Escherichia coli (strain K12) Lactaldehyde dehydrogenase Proteins 0.000 description 1
- 241001131785 Escherichia coli HB101 Species 0.000 description 1
- VGGSQFUCUMXWEO-UHFFFAOYSA-N Ethene Chemical compound C=C VGGSQFUCUMXWEO-UHFFFAOYSA-N 0.000 description 1
- 239000005977 Ethylene Substances 0.000 description 1
- 240000002395 Euphorbia pulcherrima Species 0.000 description 1
- 208000033962 Fontaine progeroid syndrome Diseases 0.000 description 1
- 241000589601 Francisella Species 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- 230000005526 G1 to G0 transition Effects 0.000 description 1
- 241000287826 Gallus Species 0.000 description 1
- 241000626621 Geobacillus Species 0.000 description 1
- 101000892220 Geobacillus thermodenitrificans (strain NG80-2) Long-chain-alcohol dehydrogenase 1 Proteins 0.000 description 1
- INKFLNZBTSNFON-CIUDSAMLSA-N Gln-Ala-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O INKFLNZBTSNFON-CIUDSAMLSA-N 0.000 description 1
- REJJNXODKSHOKA-ACZMJKKPSA-N Gln-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N REJJNXODKSHOKA-ACZMJKKPSA-N 0.000 description 1
- RZSLYUUFFVHFRQ-FXQIFTODSA-N Gln-Ala-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O RZSLYUUFFVHFRQ-FXQIFTODSA-N 0.000 description 1
- UWZLBXOBVKRUFE-HGNGGELXSA-N Gln-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N UWZLBXOBVKRUFE-HGNGGELXSA-N 0.000 description 1
- LKUWAWGNJYJODH-KBIXCLLPSA-N Gln-Ala-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKUWAWGNJYJODH-KBIXCLLPSA-N 0.000 description 1
- MLZRSFQRBDNJON-GUBZILKMSA-N Gln-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MLZRSFQRBDNJON-GUBZILKMSA-N 0.000 description 1
- JSYULGSPLTZDHM-NRPADANISA-N Gln-Ala-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O JSYULGSPLTZDHM-NRPADANISA-N 0.000 description 1
- DLOHWQXXGMEZDW-CIUDSAMLSA-N Gln-Arg-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DLOHWQXXGMEZDW-CIUDSAMLSA-N 0.000 description 1
- KJRXLVZYJJLUCV-DCAQKATOSA-N Gln-Arg-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O KJRXLVZYJJLUCV-DCAQKATOSA-N 0.000 description 1
- PHZYLYASFWHLHJ-FXQIFTODSA-N Gln-Asn-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PHZYLYASFWHLHJ-FXQIFTODSA-N 0.000 description 1
- WQWMZOIPXWSZNE-WDSKDSINSA-N Gln-Asp-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O WQWMZOIPXWSZNE-WDSKDSINSA-N 0.000 description 1
- IXFVOPOHSRKJNG-LAEOZQHASA-N Gln-Asp-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IXFVOPOHSRKJNG-LAEOZQHASA-N 0.000 description 1
- PKVWNYGXMNWJSI-CIUDSAMLSA-N Gln-Gln-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O PKVWNYGXMNWJSI-CIUDSAMLSA-N 0.000 description 1
- LVNILKSSFHCSJZ-IHRRRGAJSA-N Gln-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LVNILKSSFHCSJZ-IHRRRGAJSA-N 0.000 description 1
- SNLOOPZHAQDMJG-CIUDSAMLSA-N Gln-Glu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SNLOOPZHAQDMJG-CIUDSAMLSA-N 0.000 description 1
- KCJJFESQRXGTGC-BQBZGAKWSA-N Gln-Glu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O KCJJFESQRXGTGC-BQBZGAKWSA-N 0.000 description 1
- KDXKFBSNIJYNNR-YVNDNENWSA-N Gln-Glu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KDXKFBSNIJYNNR-YVNDNENWSA-N 0.000 description 1
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 1
- MAGNEQBFSBREJL-DCAQKATOSA-N Gln-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N MAGNEQBFSBREJL-DCAQKATOSA-N 0.000 description 1
- XJKAKYXMFHUIHT-AUTRQRHGSA-N Gln-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N XJKAKYXMFHUIHT-AUTRQRHGSA-N 0.000 description 1
- VSXBYIJUAXPAAL-WDSKDSINSA-N Gln-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O VSXBYIJUAXPAAL-WDSKDSINSA-N 0.000 description 1
- NSNUZSPSADIMJQ-WDSKDSINSA-N Gln-Gly-Asp Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NSNUZSPSADIMJQ-WDSKDSINSA-N 0.000 description 1
- CLPQUWHBWXFJOX-BQBZGAKWSA-N Gln-Gly-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O CLPQUWHBWXFJOX-BQBZGAKWSA-N 0.000 description 1
- YXQCLIVLWCKCRS-RYUDHWBXSA-N Gln-Gly-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N)O YXQCLIVLWCKCRS-RYUDHWBXSA-N 0.000 description 1
- ORYMMTRPKVTGSJ-XVKPBYJWSA-N Gln-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O ORYMMTRPKVTGSJ-XVKPBYJWSA-N 0.000 description 1
- GIVHPCWYVWUUSG-HVTMNAMFSA-N Gln-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N GIVHPCWYVWUUSG-HVTMNAMFSA-N 0.000 description 1
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 1
- LGIKBBLQVSWUGK-DCAQKATOSA-N Gln-Leu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGIKBBLQVSWUGK-DCAQKATOSA-N 0.000 description 1
- SHAUZYVSXAMYAZ-JYJNAYRXSA-N Gln-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SHAUZYVSXAMYAZ-JYJNAYRXSA-N 0.000 description 1
- MLSKFHLRFVGNLL-WDCWCFNPSA-N Gln-Leu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MLSKFHLRFVGNLL-WDCWCFNPSA-N 0.000 description 1
- IHSGESFHTMFHRB-GUBZILKMSA-N Gln-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O IHSGESFHTMFHRB-GUBZILKMSA-N 0.000 description 1
- GURIQZQSTBBHRV-SRVKXCTJSA-N Gln-Lys-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GURIQZQSTBBHRV-SRVKXCTJSA-N 0.000 description 1
- TWIAMTNJOMRDAK-GUBZILKMSA-N Gln-Lys-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O TWIAMTNJOMRDAK-GUBZILKMSA-N 0.000 description 1
- UWKPRVKWEKEMSY-DCAQKATOSA-N Gln-Lys-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWKPRVKWEKEMSY-DCAQKATOSA-N 0.000 description 1
- LURQDGKYBFWWJA-MNXVOIDGSA-N Gln-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N LURQDGKYBFWWJA-MNXVOIDGSA-N 0.000 description 1
- VHLZDSUANXBJHW-QWRGUYRKSA-N Gln-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VHLZDSUANXBJHW-QWRGUYRKSA-N 0.000 description 1
- SWDSRANUCKNBLA-AVGNSLFASA-N Gln-Phe-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SWDSRANUCKNBLA-AVGNSLFASA-N 0.000 description 1
- WHVLABLIJYGVEK-QEWYBTABSA-N Gln-Phe-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WHVLABLIJYGVEK-QEWYBTABSA-N 0.000 description 1
- XUMFMAVDHQDATI-DCAQKATOSA-N Gln-Pro-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XUMFMAVDHQDATI-DCAQKATOSA-N 0.000 description 1
- MFORDNZDKAVNSR-SRVKXCTJSA-N Gln-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O MFORDNZDKAVNSR-SRVKXCTJSA-N 0.000 description 1
- VNTGPISAOMAXRK-CIUDSAMLSA-N Gln-Pro-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O VNTGPISAOMAXRK-CIUDSAMLSA-N 0.000 description 1
- NYCVMJGIJYQWDO-CIUDSAMLSA-N Gln-Ser-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NYCVMJGIJYQWDO-CIUDSAMLSA-N 0.000 description 1
- OKARHJKJTKFQBM-ACZMJKKPSA-N Gln-Ser-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OKARHJKJTKFQBM-ACZMJKKPSA-N 0.000 description 1
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 1
- KPNWAJMEMRCLAL-GUBZILKMSA-N Gln-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N KPNWAJMEMRCLAL-GUBZILKMSA-N 0.000 description 1
- OKQLXOYFUPVEHI-CIUDSAMLSA-N Gln-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N OKQLXOYFUPVEHI-CIUDSAMLSA-N 0.000 description 1
- DUGYCMAIAKAQPB-GLLZPBPUSA-N Gln-Thr-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DUGYCMAIAKAQPB-GLLZPBPUSA-N 0.000 description 1
- NVHJGTGTUGEWCG-ZVZYQTTQSA-N Gln-Trp-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O NVHJGTGTUGEWCG-ZVZYQTTQSA-N 0.000 description 1
- OACQOWPRWGNKTP-AVGNSLFASA-N Gln-Tyr-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O OACQOWPRWGNKTP-AVGNSLFASA-N 0.000 description 1
- UBRQJXFDVZNYJP-AVGNSLFASA-N Gln-Tyr-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UBRQJXFDVZNYJP-AVGNSLFASA-N 0.000 description 1
- ICRKQMRFXYDYMK-LAEOZQHASA-N Gln-Val-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ICRKQMRFXYDYMK-LAEOZQHASA-N 0.000 description 1
- VDMABHYXBULDGN-LAEOZQHASA-N Gln-Val-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VDMABHYXBULDGN-LAEOZQHASA-N 0.000 description 1
- BBFCMGBMYIAGRS-AUTRQRHGSA-N Gln-Val-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BBFCMGBMYIAGRS-AUTRQRHGSA-N 0.000 description 1
- QGWXAMDECCKGRU-XVKPBYJWSA-N Gln-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(N)=O)C(=O)NCC(O)=O QGWXAMDECCKGRU-XVKPBYJWSA-N 0.000 description 1
- QZQYITIKPAUDGN-GVXVVHGQSA-N Gln-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N QZQYITIKPAUDGN-GVXVVHGQSA-N 0.000 description 1
- SDSMVVSHLAAOJL-UKJIMTQDSA-N Gln-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N SDSMVVSHLAAOJL-UKJIMTQDSA-N 0.000 description 1
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 1
- MKRDNSWGJWTBKZ-GVXVVHGQSA-N Gln-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MKRDNSWGJWTBKZ-GVXVVHGQSA-N 0.000 description 1
- ZMXZGYLINVNTKH-DZKIICNBSA-N Gln-Val-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZMXZGYLINVNTKH-DZKIICNBSA-N 0.000 description 1
- FTMLQFPULNGION-ZVZYQTTQSA-N Gln-Val-Trp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O FTMLQFPULNGION-ZVZYQTTQSA-N 0.000 description 1
- SOEXCCGNHQBFPV-DLOVCJGASA-N Gln-Val-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SOEXCCGNHQBFPV-DLOVCJGASA-N 0.000 description 1
- FHPXTPQBODWBIY-CIUDSAMLSA-N Glu-Ala-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHPXTPQBODWBIY-CIUDSAMLSA-N 0.000 description 1
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 1
- IRDASPPCLZIERZ-XHNCKOQMSA-N Glu-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N IRDASPPCLZIERZ-XHNCKOQMSA-N 0.000 description 1
- FYBSCGZLICNOBA-XQXXSGGOSA-N Glu-Ala-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FYBSCGZLICNOBA-XQXXSGGOSA-N 0.000 description 1
- RSUVOPBMWMTVDI-XEGUGMAKSA-N Glu-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCC(O)=O)C)C(O)=O)=CNC2=C1 RSUVOPBMWMTVDI-XEGUGMAKSA-N 0.000 description 1
- AVZHGSCDKIQZPQ-CIUDSAMLSA-N Glu-Arg-Ala Chemical compound C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AVZHGSCDKIQZPQ-CIUDSAMLSA-N 0.000 description 1
- WOMUDRVDJMHTCV-DCAQKATOSA-N Glu-Arg-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WOMUDRVDJMHTCV-DCAQKATOSA-N 0.000 description 1
- CVPXINNKRTZBMO-CIUDSAMLSA-N Glu-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N CVPXINNKRTZBMO-CIUDSAMLSA-N 0.000 description 1
- NLKVNZUFDPWPNL-YUMQZZPRSA-N Glu-Arg-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O NLKVNZUFDPWPNL-YUMQZZPRSA-N 0.000 description 1
- WOSRKEJQESVHGA-CIUDSAMLSA-N Glu-Arg-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O WOSRKEJQESVHGA-CIUDSAMLSA-N 0.000 description 1
- SYDJILXOZNEEDK-XIRDDKMYSA-N Glu-Arg-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O SYDJILXOZNEEDK-XIRDDKMYSA-N 0.000 description 1
- GLWXKFRTOHKGIT-ACZMJKKPSA-N Glu-Asn-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GLWXKFRTOHKGIT-ACZMJKKPSA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- LJLPOZGRPLORTF-CIUDSAMLSA-N Glu-Asn-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O LJLPOZGRPLORTF-CIUDSAMLSA-N 0.000 description 1
- VAZZOGXDUQSVQF-NUMRIWBASA-N Glu-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)O VAZZOGXDUQSVQF-NUMRIWBASA-N 0.000 description 1
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 1
- VAIWPXWHWAPYDF-FXQIFTODSA-N Glu-Asp-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O VAIWPXWHWAPYDF-FXQIFTODSA-N 0.000 description 1
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 1
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 1
- GZWOBWMOMPFPCD-CIUDSAMLSA-N Glu-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N GZWOBWMOMPFPCD-CIUDSAMLSA-N 0.000 description 1
- CYHBMLHCQXXCCT-AVGNSLFASA-N Glu-Asp-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CYHBMLHCQXXCCT-AVGNSLFASA-N 0.000 description 1
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 description 1
- LSTFYPOGBGFIPP-FXQIFTODSA-N Glu-Cys-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O LSTFYPOGBGFIPP-FXQIFTODSA-N 0.000 description 1
- XKPOCESCRTVRPL-KBIXCLLPSA-N Glu-Cys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XKPOCESCRTVRPL-KBIXCLLPSA-N 0.000 description 1
- FKGNJUCQKXQNRA-NRPADANISA-N Glu-Cys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(O)=O FKGNJUCQKXQNRA-NRPADANISA-N 0.000 description 1
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 1
- XHUCVVHRLNPZSZ-CIUDSAMLSA-N Glu-Gln-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XHUCVVHRLNPZSZ-CIUDSAMLSA-N 0.000 description 1
- LVCHEMOPBORRLB-DCAQKATOSA-N Glu-Gln-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O LVCHEMOPBORRLB-DCAQKATOSA-N 0.000 description 1
- RFDHKPSHTXZKLL-IHRRRGAJSA-N Glu-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N RFDHKPSHTXZKLL-IHRRRGAJSA-N 0.000 description 1
- WLIPTFCZLHCNFD-LPEHRKFASA-N Glu-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O WLIPTFCZLHCNFD-LPEHRKFASA-N 0.000 description 1
- WPLGNDORMXTMQS-FXQIFTODSA-N Glu-Gln-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O WPLGNDORMXTMQS-FXQIFTODSA-N 0.000 description 1
- HUFCEIHAFNVSNR-IHRRRGAJSA-N Glu-Gln-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUFCEIHAFNVSNR-IHRRRGAJSA-N 0.000 description 1
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 1
- QQLBPVKLJBAXBS-FXQIFTODSA-N Glu-Glu-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QQLBPVKLJBAXBS-FXQIFTODSA-N 0.000 description 1
- HNVFSTLPVJWIDV-CIUDSAMLSA-N Glu-Glu-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HNVFSTLPVJWIDV-CIUDSAMLSA-N 0.000 description 1
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 1
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 1
- KASDBWKLWJKTLJ-GUBZILKMSA-N Glu-Glu-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O KASDBWKLWJKTLJ-GUBZILKMSA-N 0.000 description 1
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 1
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 1
- UHVIQGKBMXEVGN-WDSKDSINSA-N Glu-Gly-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UHVIQGKBMXEVGN-WDSKDSINSA-N 0.000 description 1
- MTAOBYXRYJZRGQ-WDSKDSINSA-N Glu-Gly-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MTAOBYXRYJZRGQ-WDSKDSINSA-N 0.000 description 1
- CAVMESABQIKFKT-IUCAKERBSA-N Glu-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N CAVMESABQIKFKT-IUCAKERBSA-N 0.000 description 1
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 1
- HPJLZFTUUJKWAJ-JHEQGTHGSA-N Glu-Gly-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HPJLZFTUUJKWAJ-JHEQGTHGSA-N 0.000 description 1
- VXQOONWNIWFOCS-HGNGGELXSA-N Glu-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N VXQOONWNIWFOCS-HGNGGELXSA-N 0.000 description 1
- XIKYNVKEUINBGL-IUCAKERBSA-N Glu-His-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O XIKYNVKEUINBGL-IUCAKERBSA-N 0.000 description 1
- BIHMNDPWRUROFZ-JYJNAYRXSA-N Glu-His-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BIHMNDPWRUROFZ-JYJNAYRXSA-N 0.000 description 1
- WVTIBGWZUMJBFY-GUBZILKMSA-N Glu-His-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O WVTIBGWZUMJBFY-GUBZILKMSA-N 0.000 description 1
- WDTAKCUOIKHCTB-NKIYYHGXSA-N Glu-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N)O WDTAKCUOIKHCTB-NKIYYHGXSA-N 0.000 description 1
- ZPASCJBSSCRWMC-GVXVVHGQSA-N Glu-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N ZPASCJBSSCRWMC-GVXVVHGQSA-N 0.000 description 1
- LGYCLOCORAEQSZ-PEFMBERDSA-N Glu-Ile-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O LGYCLOCORAEQSZ-PEFMBERDSA-N 0.000 description 1
- YVYVMJNUENBOOL-KBIXCLLPSA-N Glu-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N YVYVMJNUENBOOL-KBIXCLLPSA-N 0.000 description 1
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 1
- GRHXUHCFENOCOS-ZPFDUUQYSA-N Glu-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N GRHXUHCFENOCOS-ZPFDUUQYSA-N 0.000 description 1
- GXMXPCXXKVWOSM-KQXIARHKSA-N Glu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N GXMXPCXXKVWOSM-KQXIARHKSA-N 0.000 description 1
- ZSWGJYOZWBHROQ-RWRJDSDZSA-N Glu-Ile-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSWGJYOZWBHROQ-RWRJDSDZSA-N 0.000 description 1
- KRRFFAHEAOCBCQ-SIUGBPQLSA-N Glu-Ile-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KRRFFAHEAOCBCQ-SIUGBPQLSA-N 0.000 description 1
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 1
- DWBBKNPKDHXIAC-SRVKXCTJSA-N Glu-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCC(O)=O DWBBKNPKDHXIAC-SRVKXCTJSA-N 0.000 description 1
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 1
- SWRVAQHFBRZVNX-GUBZILKMSA-N Glu-Lys-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SWRVAQHFBRZVNX-GUBZILKMSA-N 0.000 description 1
- OCJRHJZKGGSPRW-IUCAKERBSA-N Glu-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O OCJRHJZKGGSPRW-IUCAKERBSA-N 0.000 description 1
- YKBUCXNNBYZYAY-MNXVOIDGSA-N Glu-Lys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YKBUCXNNBYZYAY-MNXVOIDGSA-N 0.000 description 1
- AQNYKMCFCCZEEL-JYJNAYRXSA-N Glu-Lys-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AQNYKMCFCCZEEL-JYJNAYRXSA-N 0.000 description 1
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 1
- ZQYZDDXTNQXUJH-CIUDSAMLSA-N Glu-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(=O)O)N ZQYZDDXTNQXUJH-CIUDSAMLSA-N 0.000 description 1
- ZWMYUDZLXAQHCK-CIUDSAMLSA-N Glu-Met-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O ZWMYUDZLXAQHCK-CIUDSAMLSA-N 0.000 description 1
- CBEUFCJRFNZMCU-SRVKXCTJSA-N Glu-Met-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O CBEUFCJRFNZMCU-SRVKXCTJSA-N 0.000 description 1
- XEKAJTCACGEBOK-KKUMJFAQSA-N Glu-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XEKAJTCACGEBOK-KKUMJFAQSA-N 0.000 description 1
- ZIYGTCDTJJCDDP-JYJNAYRXSA-N Glu-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZIYGTCDTJJCDDP-JYJNAYRXSA-N 0.000 description 1
- CBWKURKPYSLMJV-SOUVJXGZSA-N Glu-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CBWKURKPYSLMJV-SOUVJXGZSA-N 0.000 description 1
- ITVBKCZZLJUUHI-HTUGSXCWSA-N Glu-Phe-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ITVBKCZZLJUUHI-HTUGSXCWSA-N 0.000 description 1
- KXTAGESXNQEZKB-DZKIICNBSA-N Glu-Phe-Val Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 KXTAGESXNQEZKB-DZKIICNBSA-N 0.000 description 1
- QJVZSVUYZFYLFQ-CIUDSAMLSA-N Glu-Pro-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O QJVZSVUYZFYLFQ-CIUDSAMLSA-N 0.000 description 1
- TWYFJOHWGCCRIR-DCAQKATOSA-N Glu-Pro-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYFJOHWGCCRIR-DCAQKATOSA-N 0.000 description 1
- HLYCMRDRWGSTPZ-CIUDSAMLSA-N Glu-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CS)C(=O)O HLYCMRDRWGSTPZ-CIUDSAMLSA-N 0.000 description 1
- NNQDRRUXFJYCCJ-NHCYSSNCSA-N Glu-Pro-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O NNQDRRUXFJYCCJ-NHCYSSNCSA-N 0.000 description 1
- ZAPFAWQHBOHWLL-GUBZILKMSA-N Glu-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N ZAPFAWQHBOHWLL-GUBZILKMSA-N 0.000 description 1
- QOXDAWODGSIDDI-GUBZILKMSA-N Glu-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N QOXDAWODGSIDDI-GUBZILKMSA-N 0.000 description 1
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 1
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 1
- RGJKYNUINKGPJN-RWRJDSDZSA-N Glu-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCC(=O)O)N RGJKYNUINKGPJN-RWRJDSDZSA-N 0.000 description 1
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 1
- ZGXGVBYEJGVJMV-HJGDQZAQSA-N Glu-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O ZGXGVBYEJGVJMV-HJGDQZAQSA-N 0.000 description 1
- PEKRLYMGPZFTCB-WNHJNPCNSA-N Glu-Trp-Asp-Arg Chemical compound N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O PEKRLYMGPZFTCB-WNHJNPCNSA-N 0.000 description 1
- BPCLDCNZBUYGOD-BPUTZDHNSA-N Glu-Trp-Glu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 BPCLDCNZBUYGOD-BPUTZDHNSA-N 0.000 description 1
- HVKAAUOFFTUSAA-XDTLVQLUSA-N Glu-Tyr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O HVKAAUOFFTUSAA-XDTLVQLUSA-N 0.000 description 1
- HJTSRYLPAYGEEC-SIUGBPQLSA-N Glu-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N HJTSRYLPAYGEEC-SIUGBPQLSA-N 0.000 description 1
- HBMRTXJZQDVRFT-DZKIICNBSA-N Glu-Tyr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HBMRTXJZQDVRFT-DZKIICNBSA-N 0.000 description 1
- UZWUBBRJWFTHTD-LAEOZQHASA-N Glu-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O UZWUBBRJWFTHTD-LAEOZQHASA-N 0.000 description 1
- KCCNSVHJSMMGFS-NRPADANISA-N Glu-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N KCCNSVHJSMMGFS-NRPADANISA-N 0.000 description 1
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 1
- HQTDNEZTGZUWSY-XVKPBYJWSA-N Glu-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)NCC(O)=O HQTDNEZTGZUWSY-XVKPBYJWSA-N 0.000 description 1
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 1
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 1
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 1
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 1
- 241000589232 Gluconobacter oxydans Species 0.000 description 1
- 108030000917 Glutamine-pyruvate transaminases Proteins 0.000 description 1
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 1
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 1
- FKJQNJCQTKUBCD-XPUUQOCRSA-N Gly-Ala-His Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O FKJQNJCQTKUBCD-XPUUQOCRSA-N 0.000 description 1
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 1
- PHONXOACARQMPM-BQBZGAKWSA-N Gly-Ala-Met Chemical compound [H]NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O PHONXOACARQMPM-BQBZGAKWSA-N 0.000 description 1
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 1
- QIZJOTQTCAGKPU-KWQFWETISA-N Gly-Ala-Tyr Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 QIZJOTQTCAGKPU-KWQFWETISA-N 0.000 description 1
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 1
- XUDLUKYPXQDCRX-BQBZGAKWSA-N Gly-Arg-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O XUDLUKYPXQDCRX-BQBZGAKWSA-N 0.000 description 1
- RQZGFWKQLPJOEQ-YUMQZZPRSA-N Gly-Arg-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)CN)CN=C(N)N RQZGFWKQLPJOEQ-YUMQZZPRSA-N 0.000 description 1
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 1
- CIMULJZTTOBOPN-WHFBIAKZSA-N Gly-Asn-Asn Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CIMULJZTTOBOPN-WHFBIAKZSA-N 0.000 description 1
- DJTXYXZNNDDEOU-WHFBIAKZSA-N Gly-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)C(=O)N DJTXYXZNNDDEOU-WHFBIAKZSA-N 0.000 description 1
- OCDLPQDYTJPWNG-YUMQZZPRSA-N Gly-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN OCDLPQDYTJPWNG-YUMQZZPRSA-N 0.000 description 1
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 1
- GNBMOZPQUXTCRW-STQMWFEESA-N Gly-Asn-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)CN)C(O)=O)=CNC2=C1 GNBMOZPQUXTCRW-STQMWFEESA-N 0.000 description 1
- IWAXHBCACVWNHT-BQBZGAKWSA-N Gly-Asp-Arg Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IWAXHBCACVWNHT-BQBZGAKWSA-N 0.000 description 1
- XQHSBNVACKQWAV-WHFBIAKZSA-N Gly-Asp-Asn Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XQHSBNVACKQWAV-WHFBIAKZSA-N 0.000 description 1
- FUTAPPOITCCWTH-WHFBIAKZSA-N Gly-Asp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FUTAPPOITCCWTH-WHFBIAKZSA-N 0.000 description 1
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 1
- RPLLQZBOVIVGMX-QWRGUYRKSA-N Gly-Asp-Phe Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RPLLQZBOVIVGMX-QWRGUYRKSA-N 0.000 description 1
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 description 1
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 1
- JPWIMMUNWUKOAD-STQMWFEESA-N Gly-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN JPWIMMUNWUKOAD-STQMWFEESA-N 0.000 description 1
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 1
- BPQYBFAXRGMGGY-LAEOZQHASA-N Gly-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN BPQYBFAXRGMGGY-LAEOZQHASA-N 0.000 description 1
- NPSWCZIRBAYNSB-JHEQGTHGSA-N Gly-Gln-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NPSWCZIRBAYNSB-JHEQGTHGSA-N 0.000 description 1
- QPDUVFSVVAOUHE-XVKPBYJWSA-N Gly-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)CN)C(O)=O QPDUVFSVVAOUHE-XVKPBYJWSA-N 0.000 description 1
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 1
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 1
- LHRXAHLCRMQBGJ-RYUDHWBXSA-N Gly-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CN LHRXAHLCRMQBGJ-RYUDHWBXSA-N 0.000 description 1
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 1
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 1
- INLIXXRWNUKVCF-JTQLQIEISA-N Gly-Gly-Tyr Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 INLIXXRWNUKVCF-JTQLQIEISA-N 0.000 description 1
- FQKKPCWTZZEDIC-XPUUQOCRSA-N Gly-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 FQKKPCWTZZEDIC-XPUUQOCRSA-N 0.000 description 1
- TVDHVLGFJSHPAX-UWVGGRQHSA-N Gly-His-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 TVDHVLGFJSHPAX-UWVGGRQHSA-N 0.000 description 1
- UUWOBINZFGTFMS-UWVGGRQHSA-N Gly-His-Met Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(O)=O UUWOBINZFGTFMS-UWVGGRQHSA-N 0.000 description 1
- SXJHOPPTOJACOA-QXEWZRGKSA-N Gly-Ile-Arg Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SXJHOPPTOJACOA-QXEWZRGKSA-N 0.000 description 1
- DGKBSGNCMCLDSL-BYULHYEWSA-N Gly-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN DGKBSGNCMCLDSL-BYULHYEWSA-N 0.000 description 1
- DENRBIYENOKSEX-PEXQALLHSA-N Gly-Ile-His Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DENRBIYENOKSEX-PEXQALLHSA-N 0.000 description 1
- FCKPEGOCSVZPNC-WHOFXGATSA-N Gly-Ile-Phe Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FCKPEGOCSVZPNC-WHOFXGATSA-N 0.000 description 1
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 1
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 description 1
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 1
- IUZGUFAJDBHQQV-YUMQZZPRSA-N Gly-Leu-Asn Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IUZGUFAJDBHQQV-YUMQZZPRSA-N 0.000 description 1
- TWTPDFFBLQEBOE-IUCAKERBSA-N Gly-Leu-Gln Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O TWTPDFFBLQEBOE-IUCAKERBSA-N 0.000 description 1
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 1
- AFWYPMDMDYCKMD-KBPBESRZSA-N Gly-Leu-Tyr Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AFWYPMDMDYCKMD-KBPBESRZSA-N 0.000 description 1
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 description 1
- FHQRLHFYVZAQHU-IUCAKERBSA-N Gly-Lys-Gln Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O FHQRLHFYVZAQHU-IUCAKERBSA-N 0.000 description 1
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 1
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 1
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 1
- DBJYVKDPGIFXFO-BQBZGAKWSA-N Gly-Met-Ala Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O DBJYVKDPGIFXFO-BQBZGAKWSA-N 0.000 description 1
- ICUTTWWCDIIIEE-BQBZGAKWSA-N Gly-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN ICUTTWWCDIIIEE-BQBZGAKWSA-N 0.000 description 1
- HHRODZSXDXMUHS-LURJTMIESA-N Gly-Met-Gly Chemical compound CSCC[C@H](NC(=O)C[NH3+])C(=O)NCC([O-])=O HHRODZSXDXMUHS-LURJTMIESA-N 0.000 description 1
- RVGMVLVBDRQVKB-UWVGGRQHSA-N Gly-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN RVGMVLVBDRQVKB-UWVGGRQHSA-N 0.000 description 1
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 1
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 1
- DHNXGWVNLFPOMQ-KBPBESRZSA-N Gly-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)CN DHNXGWVNLFPOMQ-KBPBESRZSA-N 0.000 description 1
- FEUPVVCGQLNXNP-IRXDYDNUSA-N Gly-Phe-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FEUPVVCGQLNXNP-IRXDYDNUSA-N 0.000 description 1
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 1
- WDXLKVQATNEAJQ-BQBZGAKWSA-N Gly-Pro-Asp Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WDXLKVQATNEAJQ-BQBZGAKWSA-N 0.000 description 1
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 1
- IXHQLZIWBCQBLQ-STQMWFEESA-N Gly-Pro-Phe Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IXHQLZIWBCQBLQ-STQMWFEESA-N 0.000 description 1
- GLACUWHUYFBSPJ-FJXKBIBVSA-N Gly-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GLACUWHUYFBSPJ-FJXKBIBVSA-N 0.000 description 1
- JNGHLWWFPGIJER-STQMWFEESA-N Gly-Pro-Tyr Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JNGHLWWFPGIJER-STQMWFEESA-N 0.000 description 1
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 1
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 1
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 1
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 1
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 1
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 1
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 1
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 description 1
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 1
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 1
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 1
- PYFHPYDQHCEVIT-KBPBESRZSA-N Gly-Trp-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(O)=O PYFHPYDQHCEVIT-KBPBESRZSA-N 0.000 description 1
- ONSARSFSJHTMFJ-STQMWFEESA-N Gly-Trp-Ser Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O ONSARSFSJHTMFJ-STQMWFEESA-N 0.000 description 1
- IROABALAWGJQGM-OALUTQOASA-N Gly-Trp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)NC(=O)CN IROABALAWGJQGM-OALUTQOASA-N 0.000 description 1
- YJDALMUYJIENAG-QWRGUYRKSA-N Gly-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN)O YJDALMUYJIENAG-QWRGUYRKSA-N 0.000 description 1
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 1
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 1
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 1
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 1
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 1
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 1
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 1
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 1
- 102000000587 Glycerolphosphate Dehydrogenase Human genes 0.000 description 1
- 108010041921 Glycerolphosphate Dehydrogenase Proteins 0.000 description 1
- 108010043428 Glycine hydroxymethyltransferase Proteins 0.000 description 1
- 102100030648 Glyoxylate reductase/hydroxypyruvate reductase Human genes 0.000 description 1
- 241000193004 Halobacillus Species 0.000 description 1
- 241000589989 Helicobacter Species 0.000 description 1
- BIAKMWKJMQLZOJ-ZKWXMUAHSA-N His-Ala-Ala Chemical compound C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O BIAKMWKJMQLZOJ-ZKWXMUAHSA-N 0.000 description 1
- DCRODRAURLJOFY-XPUUQOCRSA-N His-Ala-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)NCC(O)=O DCRODRAURLJOFY-XPUUQOCRSA-N 0.000 description 1
- IPIVXQQRZXEUGW-UWJYBYFXSA-N His-Ala-His Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 IPIVXQQRZXEUGW-UWJYBYFXSA-N 0.000 description 1
- DZMVESFTHXSSPZ-XVYDVKMFSA-N His-Ala-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DZMVESFTHXSSPZ-XVYDVKMFSA-N 0.000 description 1
- YPLYIXGKCRQZGW-SRVKXCTJSA-N His-Arg-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YPLYIXGKCRQZGW-SRVKXCTJSA-N 0.000 description 1
- MWAJSVTZZOUOBU-IHRRRGAJSA-N His-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC1=CN=CN1 MWAJSVTZZOUOBU-IHRRRGAJSA-N 0.000 description 1
- SOFSRBYHDINIRG-QTKMDUPCSA-N His-Arg-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CN=CN1)N)O SOFSRBYHDINIRG-QTKMDUPCSA-N 0.000 description 1
- TTZAWSKKNCEINZ-AVGNSLFASA-N His-Arg-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O TTZAWSKKNCEINZ-AVGNSLFASA-N 0.000 description 1
- MAABHGXCIBEYQR-XVYDVKMFSA-N His-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N MAABHGXCIBEYQR-XVYDVKMFSA-N 0.000 description 1
- UOAVQQRILDGZEN-SRVKXCTJSA-N His-Asp-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UOAVQQRILDGZEN-SRVKXCTJSA-N 0.000 description 1
- UPGJWSUYENXOPV-HGNGGELXSA-N His-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N UPGJWSUYENXOPV-HGNGGELXSA-N 0.000 description 1
- VYMGAXSNYUFVCK-GUBZILKMSA-N His-Gln-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N VYMGAXSNYUFVCK-GUBZILKMSA-N 0.000 description 1
- ZNNNYCXPCKACHX-DCAQKATOSA-N His-Gln-Gln Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZNNNYCXPCKACHX-DCAQKATOSA-N 0.000 description 1
- SWSVTNGMKBDTBM-DCAQKATOSA-N His-Gln-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SWSVTNGMKBDTBM-DCAQKATOSA-N 0.000 description 1
- FLYSHWAAHYNKRT-JYJNAYRXSA-N His-Gln-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FLYSHWAAHYNKRT-JYJNAYRXSA-N 0.000 description 1
- SDTPKSOWFXBACN-GUBZILKMSA-N His-Glu-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O SDTPKSOWFXBACN-GUBZILKMSA-N 0.000 description 1
- FIMNVXRZGUAGBI-AVGNSLFASA-N His-Glu-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FIMNVXRZGUAGBI-AVGNSLFASA-N 0.000 description 1
- PQKCQZHAGILVIM-NKIYYHGXSA-N His-Glu-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O PQKCQZHAGILVIM-NKIYYHGXSA-N 0.000 description 1
- FDQYIRHBVVUTJF-ZETCQYMHSA-N His-Gly-Gly Chemical compound [O-]C(=O)CNC(=O)CNC(=O)[C@@H]([NH3+])CC1=CN=CN1 FDQYIRHBVVUTJF-ZETCQYMHSA-N 0.000 description 1
- PGTISAJTWZPFGN-PEXQALLHSA-N His-Gly-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O PGTISAJTWZPFGN-PEXQALLHSA-N 0.000 description 1
- JIUYRPFQJJRSJB-QWRGUYRKSA-N His-His-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)NCC(O)=O)C1=CN=CN1 JIUYRPFQJJRSJB-QWRGUYRKSA-N 0.000 description 1
- MPXGJGBXCRQQJE-MXAVVETBSA-N His-Ile-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O MPXGJGBXCRQQJE-MXAVVETBSA-N 0.000 description 1
- ZSKJIISDJXJQPV-BZSNNMDCSA-N His-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 ZSKJIISDJXJQPV-BZSNNMDCSA-N 0.000 description 1
- GJMHMDKCJPQJOI-IHRRRGAJSA-N His-Lys-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CN=CN1 GJMHMDKCJPQJOI-IHRRRGAJSA-N 0.000 description 1
- JUIOPCXACJLRJK-AVGNSLFASA-N His-Lys-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N JUIOPCXACJLRJK-AVGNSLFASA-N 0.000 description 1
- LDFWDDVELNOGII-MXAVVETBSA-N His-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N LDFWDDVELNOGII-MXAVVETBSA-N 0.000 description 1
- BKOVCRUIXDIWFV-IXOXFDKPSA-N His-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CN=CN1 BKOVCRUIXDIWFV-IXOXFDKPSA-N 0.000 description 1
- YXXKBPJEIYFGOD-MGHWNKPDSA-N His-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CN=CN2)N YXXKBPJEIYFGOD-MGHWNKPDSA-N 0.000 description 1
- XJFITURPHAKKAI-SRVKXCTJSA-N His-Pro-Gln Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(N)=O)C(O)=O)C1=CN=CN1 XJFITURPHAKKAI-SRVKXCTJSA-N 0.000 description 1
- PYNPBMCLAKTHJL-SRVKXCTJSA-N His-Pro-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O PYNPBMCLAKTHJL-SRVKXCTJSA-N 0.000 description 1
- PBVQWNDMFFCPIZ-ULQDDVLXSA-N His-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 PBVQWNDMFFCPIZ-ULQDDVLXSA-N 0.000 description 1
- LNDVNHOSZQPJGI-AVGNSLFASA-N His-Pro-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CN=CN1 LNDVNHOSZQPJGI-AVGNSLFASA-N 0.000 description 1
- JMSONHOUHFDOJH-GUBZILKMSA-N His-Ser-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 JMSONHOUHFDOJH-GUBZILKMSA-N 0.000 description 1
- BFOGZWSSGMLYKV-DCAQKATOSA-N His-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CN=CN1)N BFOGZWSSGMLYKV-DCAQKATOSA-N 0.000 description 1
- IAYPZSHNZQHQNO-KKUMJFAQSA-N His-Ser-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC2=CN=CN2)N IAYPZSHNZQHQNO-KKUMJFAQSA-N 0.000 description 1
- GIRSNERMXCMDBO-GARJFASQSA-N His-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O GIRSNERMXCMDBO-GARJFASQSA-N 0.000 description 1
- XVZJRZQIHJMUBG-TUBUOCAGSA-N His-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CN=CN1)N XVZJRZQIHJMUBG-TUBUOCAGSA-N 0.000 description 1
- ZHMZWSFQRUGLEC-JYJNAYRXSA-N His-Tyr-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZHMZWSFQRUGLEC-JYJNAYRXSA-N 0.000 description 1
- 108010020056 Hydrogenase Proteins 0.000 description 1
- 108010073073 Hydroxypyruvate decarboxylase Proteins 0.000 description 1
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 1
- JXUGDUWBMKIJDC-NAKRPEOUSA-N Ile-Ala-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JXUGDUWBMKIJDC-NAKRPEOUSA-N 0.000 description 1
- YPWHUFAAMNHMGS-QSFUFRPTSA-N Ile-Ala-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YPWHUFAAMNHMGS-QSFUFRPTSA-N 0.000 description 1
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 1
- RWIKBYVJQAJYDP-BJDJZHNGSA-N Ile-Ala-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RWIKBYVJQAJYDP-BJDJZHNGSA-N 0.000 description 1
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 1
- TZCGZYWNIDZZMR-NAKRPEOUSA-N Ile-Arg-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C)C(=O)O)N TZCGZYWNIDZZMR-NAKRPEOUSA-N 0.000 description 1
- SACHLUOUHCVIKI-GMOBBJLQSA-N Ile-Arg-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SACHLUOUHCVIKI-GMOBBJLQSA-N 0.000 description 1
- FVEWRQXNISSYFO-ZPFDUUQYSA-N Ile-Arg-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FVEWRQXNISSYFO-ZPFDUUQYSA-N 0.000 description 1
- QLRMMMQNCWBNPQ-QXEWZRGKSA-N Ile-Arg-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N QLRMMMQNCWBNPQ-QXEWZRGKSA-N 0.000 description 1
- WECYRWOMWSCWNX-XUXIUFHCSA-N Ile-Arg-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O WECYRWOMWSCWNX-XUXIUFHCSA-N 0.000 description 1
- YKRIXHPEIZUDDY-GMOBBJLQSA-N Ile-Asn-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKRIXHPEIZUDDY-GMOBBJLQSA-N 0.000 description 1
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 1
- IPYVXYDYLHVWHU-GMOBBJLQSA-N Ile-Asn-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N IPYVXYDYLHVWHU-GMOBBJLQSA-N 0.000 description 1
- HDODQNPMSHDXJT-GHCJXIJMSA-N Ile-Asn-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O HDODQNPMSHDXJT-GHCJXIJMSA-N 0.000 description 1
- QIHJTGSVGIPHIW-QSFUFRPTSA-N Ile-Asn-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N QIHJTGSVGIPHIW-QSFUFRPTSA-N 0.000 description 1
- HVWXAQVMRBKKFE-UGYAYLCHSA-N Ile-Asp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HVWXAQVMRBKKFE-UGYAYLCHSA-N 0.000 description 1
- UDLAWRKOVFDKFL-PEFMBERDSA-N Ile-Asp-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UDLAWRKOVFDKFL-PEFMBERDSA-N 0.000 description 1
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 1
- DCQMJRSOGCYKTR-GHCJXIJMSA-N Ile-Asp-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DCQMJRSOGCYKTR-GHCJXIJMSA-N 0.000 description 1
- LLZLRXBTOOFODM-QSFUFRPTSA-N Ile-Asp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N LLZLRXBTOOFODM-QSFUFRPTSA-N 0.000 description 1
- PPSQSIDMOVPKPI-BJDJZHNGSA-N Ile-Cys-Leu Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O PPSQSIDMOVPKPI-BJDJZHNGSA-N 0.000 description 1
- VQUCKIAECLVLAD-SVSWQMSJSA-N Ile-Cys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N VQUCKIAECLVLAD-SVSWQMSJSA-N 0.000 description 1
- JHCVYQKVKOLAIU-NAKRPEOUSA-N Ile-Cys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)O)N JHCVYQKVKOLAIU-NAKRPEOUSA-N 0.000 description 1
- KUHFPGIVBOCRMV-MNXVOIDGSA-N Ile-Gln-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N KUHFPGIVBOCRMV-MNXVOIDGSA-N 0.000 description 1
- LKACSKJPTFSBHR-MNXVOIDGSA-N Ile-Gln-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N LKACSKJPTFSBHR-MNXVOIDGSA-N 0.000 description 1
- WNQKUUQIVDDAFA-ZPFDUUQYSA-N Ile-Gln-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N WNQKUUQIVDDAFA-ZPFDUUQYSA-N 0.000 description 1
- YBJWJQQBWRARLT-KBIXCLLPSA-N Ile-Gln-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O YBJWJQQBWRARLT-KBIXCLLPSA-N 0.000 description 1
- MVLDERGQICFFLL-ZQINRCPSSA-N Ile-Gln-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 MVLDERGQICFFLL-ZQINRCPSSA-N 0.000 description 1
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 1
- KIMHKBDJQQYLHU-PEFMBERDSA-N Ile-Glu-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KIMHKBDJQQYLHU-PEFMBERDSA-N 0.000 description 1
- LGMUPVWZEYYUMU-YVNDNENWSA-N Ile-Glu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N LGMUPVWZEYYUMU-YVNDNENWSA-N 0.000 description 1
- IXEFKXAGHRQFAF-HVTMNAMFSA-N Ile-Glu-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N IXEFKXAGHRQFAF-HVTMNAMFSA-N 0.000 description 1
- XLCZWMJPVGRWHJ-KQXIARHKSA-N Ile-Glu-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N XLCZWMJPVGRWHJ-KQXIARHKSA-N 0.000 description 1
- JXMSHKFPDIUYGS-SIUGBPQLSA-N Ile-Glu-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N JXMSHKFPDIUYGS-SIUGBPQLSA-N 0.000 description 1
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 description 1
- SLQVFYWBGNNOTK-BYULHYEWSA-N Ile-Gly-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N SLQVFYWBGNNOTK-BYULHYEWSA-N 0.000 description 1
- LPFBXFILACZHIB-LAEOZQHASA-N Ile-Gly-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)O)N LPFBXFILACZHIB-LAEOZQHASA-N 0.000 description 1
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 1
- PDTMWFVVNZYWTR-NHCYSSNCSA-N Ile-Gly-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O PDTMWFVVNZYWTR-NHCYSSNCSA-N 0.000 description 1
- VOBYAKCXGQQFLR-LSJOCFKGSA-N Ile-Gly-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VOBYAKCXGQQFLR-LSJOCFKGSA-N 0.000 description 1
- JLWLMGADIQFKRD-QSFUFRPTSA-N Ile-His-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CN=CN1 JLWLMGADIQFKRD-QSFUFRPTSA-N 0.000 description 1
- UASTVUQJMLZWGG-PEXQALLHSA-N Ile-His-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N UASTVUQJMLZWGG-PEXQALLHSA-N 0.000 description 1
- YBGTWSFIGHUWQE-MXAVVETBSA-N Ile-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CN=CN1 YBGTWSFIGHUWQE-MXAVVETBSA-N 0.000 description 1
- KEKTTYCXKGBAAL-VGDYDELISA-N Ile-His-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N KEKTTYCXKGBAAL-VGDYDELISA-N 0.000 description 1
- PWDSHAAAFXISLE-SXTJYALSSA-N Ile-Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O PWDSHAAAFXISLE-SXTJYALSSA-N 0.000 description 1
- CSQNHSGHAPRGPQ-YTFOTSKYSA-N Ile-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(=O)O)N CSQNHSGHAPRGPQ-YTFOTSKYSA-N 0.000 description 1
- DMSVBUWGDLYNLC-IAVJCBSLSA-N Ile-Ile-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DMSVBUWGDLYNLC-IAVJCBSLSA-N 0.000 description 1
- UWLHDGMRWXHFFY-HPCHECBXSA-N Ile-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N1CCC[C@@H]1C(=O)O)N UWLHDGMRWXHFFY-HPCHECBXSA-N 0.000 description 1
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 description 1
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 1
- NUKXXNFEUZGPRO-BJDJZHNGSA-N Ile-Leu-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NUKXXNFEUZGPRO-BJDJZHNGSA-N 0.000 description 1
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 1
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 1
- RMNMUUCYTMLWNA-ZPFDUUQYSA-N Ile-Lys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RMNMUUCYTMLWNA-ZPFDUUQYSA-N 0.000 description 1
- PNTWNAXGBOZMBO-MNXVOIDGSA-N Ile-Lys-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PNTWNAXGBOZMBO-MNXVOIDGSA-N 0.000 description 1
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 1
- MASWXTFJVNRZPT-NAKRPEOUSA-N Ile-Met-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)O)N MASWXTFJVNRZPT-NAKRPEOUSA-N 0.000 description 1
- IALVDKNUFSTICJ-GMOBBJLQSA-N Ile-Met-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IALVDKNUFSTICJ-GMOBBJLQSA-N 0.000 description 1
- ZUPJCJINYQISSN-XUXIUFHCSA-N Ile-Met-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUPJCJINYQISSN-XUXIUFHCSA-N 0.000 description 1
- IIWQTXMUALXGOV-PCBIJLKTSA-N Ile-Phe-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IIWQTXMUALXGOV-PCBIJLKTSA-N 0.000 description 1
- VOCZPDONPURUHV-QEWYBTABSA-N Ile-Phe-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VOCZPDONPURUHV-QEWYBTABSA-N 0.000 description 1
- WYUHAXJAMDTOAU-IAVJCBSLSA-N Ile-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N WYUHAXJAMDTOAU-IAVJCBSLSA-N 0.000 description 1
- LRAUKBMYHHNADU-DKIMLUQUSA-N Ile-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=CC=C1 LRAUKBMYHHNADU-DKIMLUQUSA-N 0.000 description 1
- VEPIBPGLTLPBDW-URLPEUOOSA-N Ile-Phe-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N VEPIBPGLTLPBDW-URLPEUOOSA-N 0.000 description 1
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 1
- IVXJIMGDOYRLQU-XUXIUFHCSA-N Ile-Pro-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O IVXJIMGDOYRLQU-XUXIUFHCSA-N 0.000 description 1
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 1
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 1
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 1
- XMYURPUVJSKTMC-KBIXCLLPSA-N Ile-Ser-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N XMYURPUVJSKTMC-KBIXCLLPSA-N 0.000 description 1
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 1
- SHVFUCSSACPBTF-VGDYDELISA-N Ile-Ser-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SHVFUCSSACPBTF-VGDYDELISA-N 0.000 description 1
- QQVXERGIFIRCGW-NAKRPEOUSA-N Ile-Ser-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N QQVXERGIFIRCGW-NAKRPEOUSA-N 0.000 description 1
- WLRJHVNFGAOYPS-HJPIBITLSA-N Ile-Ser-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N WLRJHVNFGAOYPS-HJPIBITLSA-N 0.000 description 1
- SAEWJTCJQVZQNZ-IUKAMOBKSA-N Ile-Thr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SAEWJTCJQVZQNZ-IUKAMOBKSA-N 0.000 description 1
- COWHUQXTSYTKQC-RWRJDSDZSA-N Ile-Thr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N COWHUQXTSYTKQC-RWRJDSDZSA-N 0.000 description 1
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 1
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 1
- VBGCPJBKUXRYDA-DSYPUSFNSA-N Ile-Trp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCCCN)C(=O)O)N VBGCPJBKUXRYDA-DSYPUSFNSA-N 0.000 description 1
- DTPGSUQHUMELQB-GVARAGBVSA-N Ile-Tyr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 DTPGSUQHUMELQB-GVARAGBVSA-N 0.000 description 1
- FXJLRZFMKGHYJP-CFMVVWHZSA-N Ile-Tyr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FXJLRZFMKGHYJP-CFMVVWHZSA-N 0.000 description 1
- REXAUQBGSGDEJY-IGISWZIWSA-N Ile-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N REXAUQBGSGDEJY-IGISWZIWSA-N 0.000 description 1
- GVEODXUBBFDBPW-MGHWNKPDSA-N Ile-Tyr-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 GVEODXUBBFDBPW-MGHWNKPDSA-N 0.000 description 1
- ZUWSVOYKBCHLRR-MGHWNKPDSA-N Ile-Tyr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUWSVOYKBCHLRR-MGHWNKPDSA-N 0.000 description 1
- ZGKVPOSSTGHJAF-HJPIBITLSA-N Ile-Tyr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CO)C(=O)O)N ZGKVPOSSTGHJAF-HJPIBITLSA-N 0.000 description 1
- WRDTXMBPHMBGIB-STECZYCISA-N Ile-Tyr-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 WRDTXMBPHMBGIB-STECZYCISA-N 0.000 description 1
- YJRSIJZUIUANHO-NAKRPEOUSA-N Ile-Val-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(=O)O)N YJRSIJZUIUANHO-NAKRPEOUSA-N 0.000 description 1
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 1
- QSXSHZIRKTUXNG-STECZYCISA-N Ile-Val-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QSXSHZIRKTUXNG-STECZYCISA-N 0.000 description 1
- 241000588748 Klebsiella Species 0.000 description 1
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 1
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 1
- 108030000198 L-amino-acid dehydrogenases Proteins 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 1
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 1
- SHZGCJCMOBCMKK-JFNONXLTSA-N L-rhamnopyranose Chemical compound C[C@@H]1OC(O)[C@H](O)[C@H](O)[C@H]1O SHZGCJCMOBCMKK-JFNONXLTSA-N 0.000 description 1
- PNNNRSAQSRJVSB-UHFFFAOYSA-N L-rhamnose Natural products CC(O)C(O)C(O)C(O)C=O PNNNRSAQSRJVSB-UHFFFAOYSA-N 0.000 description 1
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 1
- JVTAAEKCZFNVCJ-UHFFFAOYSA-M Lactate Chemical compound CC(O)C([O-])=O JVTAAEKCZFNVCJ-UHFFFAOYSA-M 0.000 description 1
- 241000186660 Lactobacillus Species 0.000 description 1
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 1
- 241000589248 Legionella Species 0.000 description 1
- 208000007764 Legionnaires' Disease Diseases 0.000 description 1
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 1
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 1
- PBCHMHROGNUXMK-DLOVCJGASA-N Leu-Ala-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 PBCHMHROGNUXMK-DLOVCJGASA-N 0.000 description 1
- XIRYQRLFHWWWTC-QEJZJMRPSA-N Leu-Ala-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XIRYQRLFHWWWTC-QEJZJMRPSA-N 0.000 description 1
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 1
- SUPVSFFZWVOEOI-CQDKDKBSSA-N Leu-Ala-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-CQDKDKBSSA-N 0.000 description 1
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 1
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 1
- FJUKMPUELVROGK-IHRRRGAJSA-N Leu-Arg-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N FJUKMPUELVROGK-IHRRRGAJSA-N 0.000 description 1
- UILIPCLTHRPCRB-XUXIUFHCSA-N Leu-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(C)C)N UILIPCLTHRPCRB-XUXIUFHCSA-N 0.000 description 1
- IBMVEYRWAWIOTN-RWMBFGLXSA-N Leu-Arg-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(O)=O IBMVEYRWAWIOTN-RWMBFGLXSA-N 0.000 description 1
- BAJIJEGGUYXZGC-CIUDSAMLSA-N Leu-Asn-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N BAJIJEGGUYXZGC-CIUDSAMLSA-N 0.000 description 1
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 1
- OIARJGNVARWKFP-YUMQZZPRSA-N Leu-Asn-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIARJGNVARWKFP-YUMQZZPRSA-N 0.000 description 1
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 1
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 1
- FGNQZXKVAZIMCI-CIUDSAMLSA-N Leu-Asp-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N FGNQZXKVAZIMCI-CIUDSAMLSA-N 0.000 description 1
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 1
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 1
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 1
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 1
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 1
- IIKJNQWOQIWWMR-CIUDSAMLSA-N Leu-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(C)C)N IIKJNQWOQIWWMR-CIUDSAMLSA-N 0.000 description 1
- PPTAQBNUFKTJKA-BJDJZHNGSA-N Leu-Cys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PPTAQBNUFKTJKA-BJDJZHNGSA-N 0.000 description 1
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 1
- DLCXCECTCPKKCD-GUBZILKMSA-N Leu-Gln-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DLCXCECTCPKKCD-GUBZILKMSA-N 0.000 description 1
- KAFOIVJDVSZUMD-DCAQKATOSA-N Leu-Gln-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-DCAQKATOSA-N 0.000 description 1
- AXZGZMGRBDQTEY-SRVKXCTJSA-N Leu-Gln-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O AXZGZMGRBDQTEY-SRVKXCTJSA-N 0.000 description 1
- GLBNEGIOFRVRHO-JYJNAYRXSA-N Leu-Gln-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLBNEGIOFRVRHO-JYJNAYRXSA-N 0.000 description 1
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 1
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 1
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 1
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 1
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 1
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 1
- VBZOAGIPCULURB-QWRGUYRKSA-N Leu-Gly-His Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N VBZOAGIPCULURB-QWRGUYRKSA-N 0.000 description 1
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 1
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 1
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 1
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 1
- AUBMZAMQCOYSIC-MNXVOIDGSA-N Leu-Ile-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O AUBMZAMQCOYSIC-MNXVOIDGSA-N 0.000 description 1
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 1
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 1
- QLDHBYRUNQZIJQ-DKIMLUQUSA-N Leu-Ile-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QLDHBYRUNQZIJQ-DKIMLUQUSA-N 0.000 description 1
- NRFGTHFONZYFNY-MGHWNKPDSA-N Leu-Ile-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NRFGTHFONZYFNY-MGHWNKPDSA-N 0.000 description 1
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 1
- KYIIALJHAOIAHF-KKUMJFAQSA-N Leu-Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KYIIALJHAOIAHF-KKUMJFAQSA-N 0.000 description 1
- PPQRKXHCLYCBSP-IHRRRGAJSA-N Leu-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N PPQRKXHCLYCBSP-IHRRRGAJSA-N 0.000 description 1
- UBZGNBKMIJHOHL-BZSNNMDCSA-N Leu-Leu-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 UBZGNBKMIJHOHL-BZSNNMDCSA-N 0.000 description 1
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 1
- DCGXHWINSHEPIR-SRVKXCTJSA-N Leu-Lys-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N DCGXHWINSHEPIR-SRVKXCTJSA-N 0.000 description 1
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 1
- FKQPWMZLIIATBA-AJNGGQMLSA-N Leu-Lys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FKQPWMZLIIATBA-AJNGGQMLSA-N 0.000 description 1
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 1
- VVQJGYPTIYOFBR-IHRRRGAJSA-N Leu-Lys-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N VVQJGYPTIYOFBR-IHRRRGAJSA-N 0.000 description 1
- QNTJIDXQHWUBKC-BZSNNMDCSA-N Leu-Lys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNTJIDXQHWUBKC-BZSNNMDCSA-N 0.000 description 1
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 1
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 1
- KXCMQWMNYQOAKA-SRVKXCTJSA-N Leu-Met-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KXCMQWMNYQOAKA-SRVKXCTJSA-N 0.000 description 1
- DDVHDMSBLRAKNV-IHRRRGAJSA-N Leu-Met-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O DDVHDMSBLRAKNV-IHRRRGAJSA-N 0.000 description 1
- IBSGMIPRBMPMHE-IHRRRGAJSA-N Leu-Met-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O IBSGMIPRBMPMHE-IHRRRGAJSA-N 0.000 description 1
- NJMXCOOEFLMZSR-AVGNSLFASA-N Leu-Met-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O NJMXCOOEFLMZSR-AVGNSLFASA-N 0.000 description 1
- ZDBMWELMUCLUPL-QEJZJMRPSA-N Leu-Phe-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ZDBMWELMUCLUPL-QEJZJMRPSA-N 0.000 description 1
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 description 1
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 1
- SYRTUBLKWNDSDK-DKIMLUQUSA-N Leu-Phe-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYRTUBLKWNDSDK-DKIMLUQUSA-N 0.000 description 1
- MJWVXZABPOKJJF-ACRUOGEOSA-N Leu-Phe-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MJWVXZABPOKJJF-ACRUOGEOSA-N 0.000 description 1
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 1
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 1
- QMKFDEUJGYNFMC-AVGNSLFASA-N Leu-Pro-Arg Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QMKFDEUJGYNFMC-AVGNSLFASA-N 0.000 description 1
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 1
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 1
- YUTNOGOMBNYPFH-XUXIUFHCSA-N Leu-Pro-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YUTNOGOMBNYPFH-XUXIUFHCSA-N 0.000 description 1
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 1
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 1
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 1
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 description 1
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 1
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 1
- LCNASHSOFMRYFO-WDCWCFNPSA-N Leu-Thr-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O LCNASHSOFMRYFO-WDCWCFNPSA-N 0.000 description 1
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 1
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 1
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 1
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 1
- IDGRADDMTTWOQC-WDSOQIARSA-N Leu-Trp-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IDGRADDMTTWOQC-WDSOQIARSA-N 0.000 description 1
- YWFZWQKWNDOWPA-XIRDDKMYSA-N Leu-Trp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O YWFZWQKWNDOWPA-XIRDDKMYSA-N 0.000 description 1
- SNOUHRPNNCAOPI-SZMVWBNQSA-N Leu-Trp-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N SNOUHRPNNCAOPI-SZMVWBNQSA-N 0.000 description 1
- HQBOMRTVKVKFMN-WDSOQIARSA-N Leu-Trp-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O HQBOMRTVKVKFMN-WDSOQIARSA-N 0.000 description 1
- UCRJTSIIAYHOHE-ULQDDVLXSA-N Leu-Tyr-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UCRJTSIIAYHOHE-ULQDDVLXSA-N 0.000 description 1
- WUHBLPVELFTPQK-KKUMJFAQSA-N Leu-Tyr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O WUHBLPVELFTPQK-KKUMJFAQSA-N 0.000 description 1
- VHTIZYYHIUHMCA-JYJNAYRXSA-N Leu-Tyr-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VHTIZYYHIUHMCA-JYJNAYRXSA-N 0.000 description 1
- BTEMNFBEAAOGBR-BZSNNMDCSA-N Leu-Tyr-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BTEMNFBEAAOGBR-BZSNNMDCSA-N 0.000 description 1
- RDFIVFHPOSOXMW-ACRUOGEOSA-N Leu-Tyr-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RDFIVFHPOSOXMW-ACRUOGEOSA-N 0.000 description 1
- BGGTYDNTOYRTTR-MEYUZBJRSA-N Leu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(C)C)N)O BGGTYDNTOYRTTR-MEYUZBJRSA-N 0.000 description 1
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 1
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 1
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 1
- FMFNIDICDKEMOE-XUXIUFHCSA-N Leu-Val-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMFNIDICDKEMOE-XUXIUFHCSA-N 0.000 description 1
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 1
- NTXYXFDMIHXTHE-WDSOQIARSA-N Leu-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 NTXYXFDMIHXTHE-WDSOQIARSA-N 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- 241000186781 Listeria Species 0.000 description 1
- MPOHDJKRBLVGCT-CIUDSAMLSA-N Lys-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N MPOHDJKRBLVGCT-CIUDSAMLSA-N 0.000 description 1
- MPGHETGWWWUHPY-CIUDSAMLSA-N Lys-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN MPGHETGWWWUHPY-CIUDSAMLSA-N 0.000 description 1
- NFLFJGGKOHYZJF-BJDJZHNGSA-N Lys-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN NFLFJGGKOHYZJF-BJDJZHNGSA-N 0.000 description 1
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 1
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 1
- CLBGMWIYPYAZPR-AVGNSLFASA-N Lys-Arg-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O CLBGMWIYPYAZPR-AVGNSLFASA-N 0.000 description 1
- FUKDBQGFSJUXGX-RWMBFGLXSA-N Lys-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)C(=O)O FUKDBQGFSJUXGX-RWMBFGLXSA-N 0.000 description 1
- NQCJGQHHYZNUDK-DCAQKATOSA-N Lys-Arg-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCN=C(N)N NQCJGQHHYZNUDK-DCAQKATOSA-N 0.000 description 1
- 108010062166 Lys-Asn-Asp Proteins 0.000 description 1
- BYPMOIFBQPEWOH-CIUDSAMLSA-N Lys-Asn-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N BYPMOIFBQPEWOH-CIUDSAMLSA-N 0.000 description 1
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 1
- MKBIVWXCFINCLE-SRVKXCTJSA-N Lys-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N MKBIVWXCFINCLE-SRVKXCTJSA-N 0.000 description 1
- DEFGUIIUYAUEDU-ZPFDUUQYSA-N Lys-Asn-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DEFGUIIUYAUEDU-ZPFDUUQYSA-N 0.000 description 1
- ZQCVMVCVPFYXHZ-SRVKXCTJSA-N Lys-Asn-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN ZQCVMVCVPFYXHZ-SRVKXCTJSA-N 0.000 description 1
- HGZHSNBZDOLMLH-DCAQKATOSA-N Lys-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N HGZHSNBZDOLMLH-DCAQKATOSA-N 0.000 description 1
- LZWNAOIMTLNMDW-NHCYSSNCSA-N Lys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N LZWNAOIMTLNMDW-NHCYSSNCSA-N 0.000 description 1
- HIIZIQUUHIXUJY-GUBZILKMSA-N Lys-Asp-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HIIZIQUUHIXUJY-GUBZILKMSA-N 0.000 description 1
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 1
- SSJBMGCZZXCGJJ-DCAQKATOSA-N Lys-Asp-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O SSJBMGCZZXCGJJ-DCAQKATOSA-N 0.000 description 1
- NRQRKMYZONPCTM-CIUDSAMLSA-N Lys-Asp-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O NRQRKMYZONPCTM-CIUDSAMLSA-N 0.000 description 1
- WTZUSCUIVPVCRH-SRVKXCTJSA-N Lys-Gln-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WTZUSCUIVPVCRH-SRVKXCTJSA-N 0.000 description 1
- PGBPWPTUOSCNLE-JYJNAYRXSA-N Lys-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N PGBPWPTUOSCNLE-JYJNAYRXSA-N 0.000 description 1
- LLSUNJYOSCOOEB-GUBZILKMSA-N Lys-Glu-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O LLSUNJYOSCOOEB-GUBZILKMSA-N 0.000 description 1
- GCMWRRQAKQXDED-IUCAKERBSA-N Lys-Glu-Gly Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)N[C@@H](CCC([O-])=O)C(=O)NCC([O-])=O GCMWRRQAKQXDED-IUCAKERBSA-N 0.000 description 1
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 1
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 1
- PAMDBWYMLWOELY-SDDRHHMPSA-N Lys-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O PAMDBWYMLWOELY-SDDRHHMPSA-N 0.000 description 1
- GHOIOYHDDKXIDX-SZMVWBNQSA-N Lys-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN)C(O)=O)=CNC2=C1 GHOIOYHDDKXIDX-SZMVWBNQSA-N 0.000 description 1
- ITWQLSZTLBKWJM-YUMQZZPRSA-N Lys-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCCN ITWQLSZTLBKWJM-YUMQZZPRSA-N 0.000 description 1
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 1
- QZONCCHVHCOBSK-YUMQZZPRSA-N Lys-Gly-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O QZONCCHVHCOBSK-YUMQZZPRSA-N 0.000 description 1
- ISHNZELVUVPCHY-ZETCQYMHSA-N Lys-Gly-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O ISHNZELVUVPCHY-ZETCQYMHSA-N 0.000 description 1
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 1
- RFQATBGBLDAKGI-VHSXEESVSA-N Lys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCCN)N)C(=O)O RFQATBGBLDAKGI-VHSXEESVSA-N 0.000 description 1
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 1
- WOEDRPCHKPSFDT-MXAVVETBSA-N Lys-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCCN)N WOEDRPCHKPSFDT-MXAVVETBSA-N 0.000 description 1
- OIYWBDBHEGAVST-BZSNNMDCSA-N Lys-His-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OIYWBDBHEGAVST-BZSNNMDCSA-N 0.000 description 1
- MXMDJEJWERYPMO-XUXIUFHCSA-N Lys-Ile-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MXMDJEJWERYPMO-XUXIUFHCSA-N 0.000 description 1
- YWJQHDDBFAXNIR-MXAVVETBSA-N Lys-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N YWJQHDDBFAXNIR-MXAVVETBSA-N 0.000 description 1
- JYXBNQOKPRQNQS-YTFOTSKYSA-N Lys-Ile-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JYXBNQOKPRQNQS-YTFOTSKYSA-N 0.000 description 1
- QOJDBRUCOXQSSK-AJNGGQMLSA-N Lys-Ile-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O QOJDBRUCOXQSSK-AJNGGQMLSA-N 0.000 description 1
- CBNMHRCLYBJIIZ-XUXIUFHCSA-N Lys-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCCN)N CBNMHRCLYBJIIZ-XUXIUFHCSA-N 0.000 description 1
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 1
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 1
- PINHPJWGVBKQII-SRVKXCTJSA-N Lys-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N PINHPJWGVBKQII-SRVKXCTJSA-N 0.000 description 1
- QKXZCUCBFPEXNK-KKUMJFAQSA-N Lys-Leu-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 QKXZCUCBFPEXNK-KKUMJFAQSA-N 0.000 description 1
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 1
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 1
- VUTWYNQUSJWBHO-BZSNNMDCSA-N Lys-Leu-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VUTWYNQUSJWBHO-BZSNNMDCSA-N 0.000 description 1
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 1
- ALGGDNMLQNFVIZ-SRVKXCTJSA-N Lys-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ALGGDNMLQNFVIZ-SRVKXCTJSA-N 0.000 description 1
- AHFOKDZWPPGJAZ-SRVKXCTJSA-N Lys-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N AHFOKDZWPPGJAZ-SRVKXCTJSA-N 0.000 description 1
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 1
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 1
- KJIXWRWPOCKYLD-IHRRRGAJSA-N Lys-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N KJIXWRWPOCKYLD-IHRRRGAJSA-N 0.000 description 1
- ATNKHRAIZCMCCN-BZSNNMDCSA-N Lys-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N ATNKHRAIZCMCCN-BZSNNMDCSA-N 0.000 description 1
- PLDJDCJLRCYPJB-VOAKCMCISA-N Lys-Lys-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PLDJDCJLRCYPJB-VOAKCMCISA-N 0.000 description 1
- URGPVYGVWLIRGT-DCAQKATOSA-N Lys-Met-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O URGPVYGVWLIRGT-DCAQKATOSA-N 0.000 description 1
- ZCWWVXAXWUAEPZ-SRVKXCTJSA-N Lys-Met-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZCWWVXAXWUAEPZ-SRVKXCTJSA-N 0.000 description 1
- JYVCOTWSRGFABJ-DCAQKATOSA-N Lys-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N JYVCOTWSRGFABJ-DCAQKATOSA-N 0.000 description 1
- KFSALEZVQJYHCE-AVGNSLFASA-N Lys-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCCN)N KFSALEZVQJYHCE-AVGNSLFASA-N 0.000 description 1
- ALEVUGKHINJNIF-QEJZJMRPSA-N Lys-Phe-Ala Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ALEVUGKHINJNIF-QEJZJMRPSA-N 0.000 description 1
- PIXVFCBYEGPZPA-JYJNAYRXSA-N Lys-Phe-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N PIXVFCBYEGPZPA-JYJNAYRXSA-N 0.000 description 1
- MSSJJDVQTFTLIF-KBPBESRZSA-N Lys-Phe-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O MSSJJDVQTFTLIF-KBPBESRZSA-N 0.000 description 1
- OBZHNHBAAVEWKI-DCAQKATOSA-N Lys-Pro-Asn Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O OBZHNHBAAVEWKI-DCAQKATOSA-N 0.000 description 1
- WGILOYIKJVQUPT-DCAQKATOSA-N Lys-Pro-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WGILOYIKJVQUPT-DCAQKATOSA-N 0.000 description 1
- YTJFXEDRUOQGSP-DCAQKATOSA-N Lys-Pro-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YTJFXEDRUOQGSP-DCAQKATOSA-N 0.000 description 1
- GHKXHCMRAUYLBS-CIUDSAMLSA-N Lys-Ser-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O GHKXHCMRAUYLBS-CIUDSAMLSA-N 0.000 description 1
- MGKFCQFVPKOWOL-CIUDSAMLSA-N Lys-Ser-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N MGKFCQFVPKOWOL-CIUDSAMLSA-N 0.000 description 1
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 1
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 1
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 1
- SQXZLVXQXWILKW-KKUMJFAQSA-N Lys-Ser-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQXZLVXQXWILKW-KKUMJFAQSA-N 0.000 description 1
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 1
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 1
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 1
- GIKFNMZSGYAPEJ-HJGDQZAQSA-N Lys-Thr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O GIKFNMZSGYAPEJ-HJGDQZAQSA-N 0.000 description 1
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 1
- YFQSSOAGMZGXFT-MEYUZBJRSA-N Lys-Thr-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YFQSSOAGMZGXFT-MEYUZBJRSA-N 0.000 description 1
- KXYLFJIQDIMURW-IHPCNDPISA-N Lys-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CCCCN)=CNC2=C1 KXYLFJIQDIMURW-IHPCNDPISA-N 0.000 description 1
- ZFNYWKHYUMEZDZ-WDSOQIARSA-N Lys-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCCCN)N ZFNYWKHYUMEZDZ-WDSOQIARSA-N 0.000 description 1
- IMDJSVBFQKDDEQ-MGHWNKPDSA-N Lys-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCCCN)N IMDJSVBFQKDDEQ-MGHWNKPDSA-N 0.000 description 1
- NQOQDINRVQCAKD-ULQDDVLXSA-N Lys-Tyr-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCCCN)N NQOQDINRVQCAKD-ULQDDVLXSA-N 0.000 description 1
- IEIHKHYMBIYQTH-YESZJQIVSA-N Lys-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCCN)N)C(=O)O IEIHKHYMBIYQTH-YESZJQIVSA-N 0.000 description 1
- SQRLLZAQNOQCEG-KKUMJFAQSA-N Lys-Tyr-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 SQRLLZAQNOQCEG-KKUMJFAQSA-N 0.000 description 1
- OHXUUQDOBQKSNB-AVGNSLFASA-N Lys-Val-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OHXUUQDOBQKSNB-AVGNSLFASA-N 0.000 description 1
- MDDUIRLQCYVRDO-NHCYSSNCSA-N Lys-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN MDDUIRLQCYVRDO-NHCYSSNCSA-N 0.000 description 1
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 1
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 1
- VWJFOUBDZIUXGA-AVGNSLFASA-N Lys-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCCN)N VWJFOUBDZIUXGA-AVGNSLFASA-N 0.000 description 1
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 1
- XBAJINCXDBTJRH-WDSOQIARSA-N Lys-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N XBAJINCXDBTJRH-WDSOQIARSA-N 0.000 description 1
- 241001472591 Lysinibacillus sp. Species 0.000 description 1
- 241000407429 Maja Species 0.000 description 1
- GUBGYTABKSRVRQ-PICCSMPSSA-N Maltose Natural products O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-PICCSMPSSA-N 0.000 description 1
- 241000604449 Megasphaera Species 0.000 description 1
- 241000970829 Mesorhizobium Species 0.000 description 1
- LMKSBGIUPVRHEH-FXQIFTODSA-N Met-Ala-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(N)=O LMKSBGIUPVRHEH-FXQIFTODSA-N 0.000 description 1
- QRHWTCJBCLGYRB-FXQIFTODSA-N Met-Ala-Cys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O QRHWTCJBCLGYRB-FXQIFTODSA-N 0.000 description 1
- VTKPSXWRUGCOAC-GUBZILKMSA-N Met-Ala-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCSC VTKPSXWRUGCOAC-GUBZILKMSA-N 0.000 description 1
- DLAFCQWUMFMZSN-GUBZILKMSA-N Met-Arg-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N DLAFCQWUMFMZSN-GUBZILKMSA-N 0.000 description 1
- OLWAOWXIADGIJG-AVGNSLFASA-N Met-Arg-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(O)=O OLWAOWXIADGIJG-AVGNSLFASA-N 0.000 description 1
- RJEFZSIVBHGRQJ-SRVKXCTJSA-N Met-Arg-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O RJEFZSIVBHGRQJ-SRVKXCTJSA-N 0.000 description 1
- ACYHZNZHIZWLQF-BQBZGAKWSA-N Met-Asn-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ACYHZNZHIZWLQF-BQBZGAKWSA-N 0.000 description 1
- QXEVZBXTDTVPCP-GMOBBJLQSA-N Met-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCSC)N QXEVZBXTDTVPCP-GMOBBJLQSA-N 0.000 description 1
- YNOVBMBQSQTLFM-DCAQKATOSA-N Met-Asn-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O YNOVBMBQSQTLFM-DCAQKATOSA-N 0.000 description 1
- DRXODWRPPUFIAY-DCAQKATOSA-N Met-Asn-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN DRXODWRPPUFIAY-DCAQKATOSA-N 0.000 description 1
- OHMKUHXCDSCOMT-QXEWZRGKSA-N Met-Asn-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHMKUHXCDSCOMT-QXEWZRGKSA-N 0.000 description 1
- MCNGIXXCMJAURZ-VEVYYDQMSA-N Met-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCSC)N)O MCNGIXXCMJAURZ-VEVYYDQMSA-N 0.000 description 1
- CNUPMMXDISGXMU-CIUDSAMLSA-N Met-Cys-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O CNUPMMXDISGXMU-CIUDSAMLSA-N 0.000 description 1
- AWOMRHGUWFBDNU-ZPFDUUQYSA-N Met-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N AWOMRHGUWFBDNU-ZPFDUUQYSA-N 0.000 description 1
- GPVLSVCBKUCEBI-KKUMJFAQSA-N Met-Gln-Phe Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GPVLSVCBKUCEBI-KKUMJFAQSA-N 0.000 description 1
- PHWSCIFNNLLUFJ-NHCYSSNCSA-N Met-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N PHWSCIFNNLLUFJ-NHCYSSNCSA-N 0.000 description 1
- AETNZPKUUYYYEK-CIUDSAMLSA-N Met-Glu-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O AETNZPKUUYYYEK-CIUDSAMLSA-N 0.000 description 1
- YORIKIDJCPKBON-YUMQZZPRSA-N Met-Glu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YORIKIDJCPKBON-YUMQZZPRSA-N 0.000 description 1
- PQPMMGQTRQFSDA-SRVKXCTJSA-N Met-Glu-His Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O PQPMMGQTRQFSDA-SRVKXCTJSA-N 0.000 description 1
- HLQWFLJOJRFXHO-CIUDSAMLSA-N Met-Glu-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O HLQWFLJOJRFXHO-CIUDSAMLSA-N 0.000 description 1
- DGNZGCQSVGGYJS-BQBZGAKWSA-N Met-Gly-Asp Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O DGNZGCQSVGGYJS-BQBZGAKWSA-N 0.000 description 1
- YCUSPBPZVJDMII-YUMQZZPRSA-N Met-Gly-Glu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O YCUSPBPZVJDMII-YUMQZZPRSA-N 0.000 description 1
- LQMHZERGCQJKAH-STQMWFEESA-N Met-Gly-Phe Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LQMHZERGCQJKAH-STQMWFEESA-N 0.000 description 1
- SXWQMBGNFXAGAT-FJXKBIBVSA-N Met-Gly-Thr Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SXWQMBGNFXAGAT-FJXKBIBVSA-N 0.000 description 1
- BKIFWLQFOOKUCA-DCAQKATOSA-N Met-His-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N BKIFWLQFOOKUCA-DCAQKATOSA-N 0.000 description 1
- DJBCKVNHEIJLQA-GMOBBJLQSA-N Met-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCSC)N DJBCKVNHEIJLQA-GMOBBJLQSA-N 0.000 description 1
- QGRJTULYDZUBAY-ZPFDUUQYSA-N Met-Ile-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGRJTULYDZUBAY-ZPFDUUQYSA-N 0.000 description 1
- RRIHXWPHQSXHAQ-XUXIUFHCSA-N Met-Ile-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O RRIHXWPHQSXHAQ-XUXIUFHCSA-N 0.000 description 1
- AFFKUNVPPLQUGA-DCAQKATOSA-N Met-Leu-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O AFFKUNVPPLQUGA-DCAQKATOSA-N 0.000 description 1
- UROWNMBTQGGTHB-DCAQKATOSA-N Met-Leu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UROWNMBTQGGTHB-DCAQKATOSA-N 0.000 description 1
- HGAJNEWOUHDUMZ-SRVKXCTJSA-N Met-Leu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O HGAJNEWOUHDUMZ-SRVKXCTJSA-N 0.000 description 1
- KMSMNUFBNCHMII-IHRRRGAJSA-N Met-Leu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN KMSMNUFBNCHMII-IHRRRGAJSA-N 0.000 description 1
- LBNFTWKGISQVEE-AVGNSLFASA-N Met-Leu-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCSC LBNFTWKGISQVEE-AVGNSLFASA-N 0.000 description 1
- CHDYFPCQVUOJEB-ULQDDVLXSA-N Met-Leu-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 CHDYFPCQVUOJEB-ULQDDVLXSA-N 0.000 description 1
- AWGBEIYZPAXXSX-RWMBFGLXSA-N Met-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N AWGBEIYZPAXXSX-RWMBFGLXSA-N 0.000 description 1
- YYEIFXZOBZVDPH-DCAQKATOSA-N Met-Lys-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O YYEIFXZOBZVDPH-DCAQKATOSA-N 0.000 description 1
- MSSJHBAKDDIRMJ-SRVKXCTJSA-N Met-Lys-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O MSSJHBAKDDIRMJ-SRVKXCTJSA-N 0.000 description 1
- UFOWQBYMUILSRK-IHRRRGAJSA-N Met-Lys-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 UFOWQBYMUILSRK-IHRRRGAJSA-N 0.000 description 1
- HOZNVKDCKZPRER-XUXIUFHCSA-N Met-Lys-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HOZNVKDCKZPRER-XUXIUFHCSA-N 0.000 description 1
- HAQLBBVZAGMESV-IHRRRGAJSA-N Met-Lys-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O HAQLBBVZAGMESV-IHRRRGAJSA-N 0.000 description 1
- LCPUWQLULVXROY-RHYQMDGZSA-N Met-Lys-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LCPUWQLULVXROY-RHYQMDGZSA-N 0.000 description 1
- UDOYVQQKQHZYMB-DCAQKATOSA-N Met-Met-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDOYVQQKQHZYMB-DCAQKATOSA-N 0.000 description 1
- CRVSHEPROQHVQT-AVGNSLFASA-N Met-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N CRVSHEPROQHVQT-AVGNSLFASA-N 0.000 description 1
- LNXGEYIEEUZGGH-JYJNAYRXSA-N Met-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCSC)CC1=CC=CC=C1 LNXGEYIEEUZGGH-JYJNAYRXSA-N 0.000 description 1
- IILAGWCGKJSBGB-IHRRRGAJSA-N Met-Phe-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IILAGWCGKJSBGB-IHRRRGAJSA-N 0.000 description 1
- CIDICGYKRUTYLE-FXQIFTODSA-N Met-Ser-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CIDICGYKRUTYLE-FXQIFTODSA-N 0.000 description 1
- RDLSEGZJMYGFNS-FXQIFTODSA-N Met-Ser-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RDLSEGZJMYGFNS-FXQIFTODSA-N 0.000 description 1
- HLZORBMOISUNIV-DCAQKATOSA-N Met-Ser-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C HLZORBMOISUNIV-DCAQKATOSA-N 0.000 description 1
- DBMLDOWSVHMQQN-XGEHTFHBSA-N Met-Ser-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DBMLDOWSVHMQQN-XGEHTFHBSA-N 0.000 description 1
- GMMLGMFBYCFCCX-KZVJFYERSA-N Met-Thr-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMMLGMFBYCFCCX-KZVJFYERSA-N 0.000 description 1
- KYXDADPHSNFWQX-VEVYYDQMSA-N Met-Thr-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O KYXDADPHSNFWQX-VEVYYDQMSA-N 0.000 description 1
- QQPMHUCGDRJFQK-RHYQMDGZSA-N Met-Thr-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QQPMHUCGDRJFQK-RHYQMDGZSA-N 0.000 description 1
- NDJSSFWDYDUQID-YTWAJWBKSA-N Met-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N)O NDJSSFWDYDUQID-YTWAJWBKSA-N 0.000 description 1
- XLTSAUGGDYRFLS-UMPQAUOISA-N Met-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCSC)N)O XLTSAUGGDYRFLS-UMPQAUOISA-N 0.000 description 1
- ZBLSZPYQQRIHQU-RCWTZXSCSA-N Met-Thr-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ZBLSZPYQQRIHQU-RCWTZXSCSA-N 0.000 description 1
- YGNUDKAPJARTEM-GUBZILKMSA-N Met-Val-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O YGNUDKAPJARTEM-GUBZILKMSA-N 0.000 description 1
- CQRGINSEMFBACV-WPRPVWTQSA-N Met-Val-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O CQRGINSEMFBACV-WPRPVWTQSA-N 0.000 description 1
- VEKRTVRZDMUOQN-AVGNSLFASA-N Met-Val-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 VEKRTVRZDMUOQN-AVGNSLFASA-N 0.000 description 1
- KPVLLNDCBYXKNV-CYDGBPFRSA-N Met-Val-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KPVLLNDCBYXKNV-CYDGBPFRSA-N 0.000 description 1
- 241000589323 Methylobacterium Species 0.000 description 1
- 241000699666 Mus <mouse, genus> Species 0.000 description 1
- 241000699670 Mus sp. Species 0.000 description 1
- 241000186359 Mycobacterium Species 0.000 description 1
- 241000896548 Mycobacterium chelonae group Species 0.000 description 1
- 101710127007 NAD-dependent malic enzyme Proteins 0.000 description 1
- 108010047562 NGR peptide Proteins 0.000 description 1
- 241000588653 Neisseria Species 0.000 description 1
- 108010065395 Neuropep-1 Proteins 0.000 description 1
- 108091092724 Noncoding DNA Proteins 0.000 description 1
- 108020004711 Nucleic Acid Probes Proteins 0.000 description 1
- 108091028043 Nucleic acid sequence Proteins 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 241000224207 Ornithinibacillus Species 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 241000592795 Paenibacillus sp. Species 0.000 description 1
- 241000287127 Passeridae Species 0.000 description 1
- 241000531155 Pectobacterium Species 0.000 description 1
- 241001112694 Peptococcaceae Species 0.000 description 1
- CYZBFPYMSJGBRL-DRZSPHRISA-N Phe-Ala-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CYZBFPYMSJGBRL-DRZSPHRISA-N 0.000 description 1
- DFEVBOYEUQJGER-JURCDPSOSA-N Phe-Ala-Ile Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O DFEVBOYEUQJGER-JURCDPSOSA-N 0.000 description 1
- QMMRHASQEVCJGR-UBHSHLNASA-N Phe-Ala-Pro Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 QMMRHASQEVCJGR-UBHSHLNASA-N 0.000 description 1
- CGOMLCQJEMWMCE-STQMWFEESA-N Phe-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CGOMLCQJEMWMCE-STQMWFEESA-N 0.000 description 1
- LGBVMDMZZFYSFW-HJWJTTGWSA-N Phe-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CC=CC=C1)N LGBVMDMZZFYSFW-HJWJTTGWSA-N 0.000 description 1
- AGYXCMYVTBYGCT-ULQDDVLXSA-N Phe-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O AGYXCMYVTBYGCT-ULQDDVLXSA-N 0.000 description 1
- ZWJKVFAYPLPCQB-UNQGMJICSA-N Phe-Arg-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O ZWJKVFAYPLPCQB-UNQGMJICSA-N 0.000 description 1
- HXSUFWQYLPKEHF-IHRRRGAJSA-N Phe-Asn-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HXSUFWQYLPKEHF-IHRRRGAJSA-N 0.000 description 1
- HTTYNOXBBOWZTB-SRVKXCTJSA-N Phe-Asn-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N HTTYNOXBBOWZTB-SRVKXCTJSA-N 0.000 description 1
- HHOOEUSPFGPZFP-QWRGUYRKSA-N Phe-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HHOOEUSPFGPZFP-QWRGUYRKSA-N 0.000 description 1
- KAHUBGWSIQNZQQ-KKUMJFAQSA-N Phe-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KAHUBGWSIQNZQQ-KKUMJFAQSA-N 0.000 description 1
- IUVYJBMTHARMIP-PCBIJLKTSA-N Phe-Asp-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IUVYJBMTHARMIP-PCBIJLKTSA-N 0.000 description 1
- RIYZXJVARWJLKS-KKUMJFAQSA-N Phe-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 RIYZXJVARWJLKS-KKUMJFAQSA-N 0.000 description 1
- WIVCOAKLPICYGY-KKUMJFAQSA-N Phe-Asp-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N WIVCOAKLPICYGY-KKUMJFAQSA-N 0.000 description 1
- FRPVPGRXUKFEQE-YDHLFZDLSA-N Phe-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FRPVPGRXUKFEQE-YDHLFZDLSA-N 0.000 description 1
- CPTJPDZTFNKFOU-MXAVVETBSA-N Phe-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N CPTJPDZTFNKFOU-MXAVVETBSA-N 0.000 description 1
- RJYBHZVWJPUSLB-QEWYBTABSA-N Phe-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N RJYBHZVWJPUSLB-QEWYBTABSA-N 0.000 description 1
- MFQXSDWKUXTOPZ-DZKIICNBSA-N Phe-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N MFQXSDWKUXTOPZ-DZKIICNBSA-N 0.000 description 1
- CDQCFGOQNYOICK-IHRRRGAJSA-N Phe-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDQCFGOQNYOICK-IHRRRGAJSA-N 0.000 description 1
- MGECUMGTSHYHEJ-QEWYBTABSA-N Phe-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGECUMGTSHYHEJ-QEWYBTABSA-N 0.000 description 1
- BFYHIHGIHGROAT-HTUGSXCWSA-N Phe-Glu-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFYHIHGIHGROAT-HTUGSXCWSA-N 0.000 description 1
- UAMFZRNCIFFMLE-FHWLQOOXSA-N Phe-Glu-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N UAMFZRNCIFFMLE-FHWLQOOXSA-N 0.000 description 1
- JEBWZLWTRPZQRX-QWRGUYRKSA-N Phe-Gly-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O JEBWZLWTRPZQRX-QWRGUYRKSA-N 0.000 description 1
- YYKZDTVQHTUKDW-RYUDHWBXSA-N Phe-Gly-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N YYKZDTVQHTUKDW-RYUDHWBXSA-N 0.000 description 1
- XEXSSIBQYNKFBX-KBPBESRZSA-N Phe-Gly-His Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1N=CNC=1)C(O)=O)C1=CC=CC=C1 XEXSSIBQYNKFBX-KBPBESRZSA-N 0.000 description 1
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 1
- HBGFEEQFVBWYJQ-KBPBESRZSA-N Phe-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HBGFEEQFVBWYJQ-KBPBESRZSA-N 0.000 description 1
- NHCKESBLOMHIIE-IRXDYDNUSA-N Phe-Gly-Phe Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 NHCKESBLOMHIIE-IRXDYDNUSA-N 0.000 description 1
- YZJKNDCEPDDIDA-BZSNNMDCSA-N Phe-His-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CN=CN1 YZJKNDCEPDDIDA-BZSNNMDCSA-N 0.000 description 1
- BEEVXUYVEHXWRQ-YESZJQIVSA-N Phe-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O BEEVXUYVEHXWRQ-YESZJQIVSA-N 0.000 description 1
- ONORAGIFHNAADN-LLLHUVSDSA-N Phe-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N ONORAGIFHNAADN-LLLHUVSDSA-N 0.000 description 1
- BYAIIACBWBOJCU-URLPEUOOSA-N Phe-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BYAIIACBWBOJCU-URLPEUOOSA-N 0.000 description 1
- OSBADCBXAMSPQD-YESZJQIVSA-N Phe-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N OSBADCBXAMSPQD-YESZJQIVSA-N 0.000 description 1
- RMKGXGPQIPLTFC-KKUMJFAQSA-N Phe-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RMKGXGPQIPLTFC-KKUMJFAQSA-N 0.000 description 1
- MJAYDXWQQUOURZ-JYJNAYRXSA-N Phe-Lys-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O MJAYDXWQQUOURZ-JYJNAYRXSA-N 0.000 description 1
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 1
- ZUQACJLOHYRVPJ-DKIMLUQUSA-N Phe-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZUQACJLOHYRVPJ-DKIMLUQUSA-N 0.000 description 1
- GPSMLZQVIIYLDK-ULQDDVLXSA-N Phe-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O GPSMLZQVIIYLDK-ULQDDVLXSA-N 0.000 description 1
- SZYBZVANEAOIPE-UBHSHLNASA-N Phe-Met-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O SZYBZVANEAOIPE-UBHSHLNASA-N 0.000 description 1
- IEOHQGFKHXUALJ-JYJNAYRXSA-N Phe-Met-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IEOHQGFKHXUALJ-JYJNAYRXSA-N 0.000 description 1
- VHDNDCPMHQMXIR-IHRRRGAJSA-N Phe-Met-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC1=CC=CC=C1 VHDNDCPMHQMXIR-IHRRRGAJSA-N 0.000 description 1
- GPLWGAYGROGDEN-BZSNNMDCSA-N Phe-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GPLWGAYGROGDEN-BZSNNMDCSA-N 0.000 description 1
- RVEVENLSADZUMS-IHRRRGAJSA-N Phe-Pro-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RVEVENLSADZUMS-IHRRRGAJSA-N 0.000 description 1
- AAERWTUHZKLDLC-IHRRRGAJSA-N Phe-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O AAERWTUHZKLDLC-IHRRRGAJSA-N 0.000 description 1
- MMJJFXWMCMJMQA-STQMWFEESA-N Phe-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CC=CC=C1 MMJJFXWMCMJMQA-STQMWFEESA-N 0.000 description 1
- NJJBATPLUQHRBM-IHRRRGAJSA-N Phe-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CO)C(=O)O NJJBATPLUQHRBM-IHRRRGAJSA-N 0.000 description 1
- ODGNUUUDJONJSC-UFYCRDLUSA-N Phe-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O ODGNUUUDJONJSC-UFYCRDLUSA-N 0.000 description 1
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 1
- GMWNQSGWWGKTSF-LFSVMHDDSA-N Phe-Thr-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMWNQSGWWGKTSF-LFSVMHDDSA-N 0.000 description 1
- BPIMVBKDLSBKIJ-FCLVOEFKSA-N Phe-Thr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BPIMVBKDLSBKIJ-FCLVOEFKSA-N 0.000 description 1
- YFXXRYFWJFQAFW-JHYOHUSXSA-N Phe-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YFXXRYFWJFQAFW-JHYOHUSXSA-N 0.000 description 1
- OAAWNUBFRMVIQS-IHPCNDPISA-N Phe-Trp-Cys Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CS)CC1=CNC2=CC=CC=C12)CC1=CC=CC=C1 OAAWNUBFRMVIQS-IHPCNDPISA-N 0.000 description 1
- QTDBZORPVYTRJU-KKXDTOCCSA-N Phe-Tyr-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O QTDBZORPVYTRJU-KKXDTOCCSA-N 0.000 description 1
- VFDRDMOMHBJGKD-UFYCRDLUSA-N Phe-Tyr-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N VFDRDMOMHBJGKD-UFYCRDLUSA-N 0.000 description 1
- MMPBPRXOFJNCCN-ZEWNOJEFSA-N Phe-Tyr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MMPBPRXOFJNCCN-ZEWNOJEFSA-N 0.000 description 1
- JSGWNFKWZNPDAV-YDHLFZDLSA-N Phe-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JSGWNFKWZNPDAV-YDHLFZDLSA-N 0.000 description 1
- FXEKNHAJIMHRFJ-ULQDDVLXSA-N Phe-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N FXEKNHAJIMHRFJ-ULQDDVLXSA-N 0.000 description 1
- JTKGCYOOJLUETJ-ULQDDVLXSA-N Phe-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JTKGCYOOJLUETJ-ULQDDVLXSA-N 0.000 description 1
- VIIRRNQMMIHYHQ-XHSDSOJGSA-N Phe-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N VIIRRNQMMIHYHQ-XHSDSOJGSA-N 0.000 description 1
- 108700023175 Phosphate acetyltransferases Proteins 0.000 description 1
- 108090000472 Phosphoenolpyruvate carboxykinase (ATP) Proteins 0.000 description 1
- 102000004035 Phosphoenolpyruvate carboxykinase (ATP) Human genes 0.000 description 1
- 102000011025 Phosphoglycerate Mutase Human genes 0.000 description 1
- 108010038555 Phosphoglycerate dehydrogenase Proteins 0.000 description 1
- 108010022181 Phosphopyruvate Hydratase Proteins 0.000 description 1
- 102000012288 Phosphopyruvate Hydratase Human genes 0.000 description 1
- OAICVXFJPJFONN-UHFFFAOYSA-N Phosphorus Chemical compound [P] OAICVXFJPJFONN-UHFFFAOYSA-N 0.000 description 1
- 102100021762 Phosphoserine phosphatase Human genes 0.000 description 1
- 241000193804 Planococcus <bacterium> Species 0.000 description 1
- DBALDZKOTNSBFM-FXQIFTODSA-N Pro-Ala-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DBALDZKOTNSBFM-FXQIFTODSA-N 0.000 description 1
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 1
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 1
- CQZNGNCAIXMAIQ-UBHSHLNASA-N Pro-Ala-Phe Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O CQZNGNCAIXMAIQ-UBHSHLNASA-N 0.000 description 1
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 1
- HFZNNDWPHBRNPV-KZVJFYERSA-N Pro-Ala-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HFZNNDWPHBRNPV-KZVJFYERSA-N 0.000 description 1
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 1
- GRIRJQGZZJVANI-CYDGBPFRSA-N Pro-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 GRIRJQGZZJVANI-CYDGBPFRSA-N 0.000 description 1
- ICTZKEXYDDZZFP-SRVKXCTJSA-N Pro-Arg-Pro Chemical compound N([C@@H](CCCN=C(N)N)C(=O)N1[C@@H](CCC1)C(O)=O)C(=O)[C@@H]1CCCN1 ICTZKEXYDDZZFP-SRVKXCTJSA-N 0.000 description 1
- ZSKJPKFTPQCPIH-RCWTZXSCSA-N Pro-Arg-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSKJPKFTPQCPIH-RCWTZXSCSA-N 0.000 description 1
- ORPZXBQTEHINPB-SRVKXCTJSA-N Pro-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H]1CCCN1)C(O)=O ORPZXBQTEHINPB-SRVKXCTJSA-N 0.000 description 1
- UVKNEILZSJMKSR-FXQIFTODSA-N Pro-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 UVKNEILZSJMKSR-FXQIFTODSA-N 0.000 description 1
- INXAPZFIOVGHSV-CIUDSAMLSA-N Pro-Asn-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 INXAPZFIOVGHSV-CIUDSAMLSA-N 0.000 description 1
- AMBLXEMWFARNNQ-DCAQKATOSA-N Pro-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 AMBLXEMWFARNNQ-DCAQKATOSA-N 0.000 description 1
- MTHRMUXESFIAMS-DCAQKATOSA-N Pro-Asn-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O MTHRMUXESFIAMS-DCAQKATOSA-N 0.000 description 1
- MLQVJYMFASXBGZ-IHRRRGAJSA-N Pro-Asn-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O MLQVJYMFASXBGZ-IHRRRGAJSA-N 0.000 description 1
- AHXPYZRZRMQOAU-QXEWZRGKSA-N Pro-Asn-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1)C(O)=O AHXPYZRZRMQOAU-QXEWZRGKSA-N 0.000 description 1
- UTAUEDINXUMHLG-FXQIFTODSA-N Pro-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 UTAUEDINXUMHLG-FXQIFTODSA-N 0.000 description 1
- CJZTUKSFZUSNCC-FXQIFTODSA-N Pro-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 CJZTUKSFZUSNCC-FXQIFTODSA-N 0.000 description 1
- ILMLVTGTUJPQFP-FXQIFTODSA-N Pro-Asp-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ILMLVTGTUJPQFP-FXQIFTODSA-N 0.000 description 1
- GDXZRWYXJSGWIV-GMOBBJLQSA-N Pro-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 GDXZRWYXJSGWIV-GMOBBJLQSA-N 0.000 description 1
- XKHCJJPNXFBADI-DCAQKATOSA-N Pro-Asp-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O XKHCJJPNXFBADI-DCAQKATOSA-N 0.000 description 1
- QVIZLAUEAMQKGS-GUBZILKMSA-N Pro-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 QVIZLAUEAMQKGS-GUBZILKMSA-N 0.000 description 1
- DEDANIDYQAPTFI-IHRRRGAJSA-N Pro-Asp-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DEDANIDYQAPTFI-IHRRRGAJSA-N 0.000 description 1
- FKKHDBFNOLCYQM-FXQIFTODSA-N Pro-Cys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O FKKHDBFNOLCYQM-FXQIFTODSA-N 0.000 description 1
- HJSCRFZVGXAGNG-SRVKXCTJSA-N Pro-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 HJSCRFZVGXAGNG-SRVKXCTJSA-N 0.000 description 1
- LQZZPNDMYNZPFT-KKUMJFAQSA-N Pro-Gln-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LQZZPNDMYNZPFT-KKUMJFAQSA-N 0.000 description 1
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 1
- VPFGPKIWSDVTOY-SRVKXCTJSA-N Pro-Glu-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O VPFGPKIWSDVTOY-SRVKXCTJSA-N 0.000 description 1
- WVOXLKUUVCCCSU-ZPFDUUQYSA-N Pro-Glu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVOXLKUUVCCCSU-ZPFDUUQYSA-N 0.000 description 1
- VOZIBWWZSBIXQN-SRVKXCTJSA-N Pro-Glu-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O VOZIBWWZSBIXQN-SRVKXCTJSA-N 0.000 description 1
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 1
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 1
- DMKWYMWNEKIPFC-IUCAKERBSA-N Pro-Gly-Arg Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O DMKWYMWNEKIPFC-IUCAKERBSA-N 0.000 description 1
- WSRWHZRUOCACLJ-UWVGGRQHSA-N Pro-Gly-His Chemical compound C([C@@H](C(=O)O)NC(=O)CNC(=O)[C@H]1NCCC1)C1=CN=CN1 WSRWHZRUOCACLJ-UWVGGRQHSA-N 0.000 description 1
- FEPSEIDIPBMIOS-QXEWZRGKSA-N Pro-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEPSEIDIPBMIOS-QXEWZRGKSA-N 0.000 description 1
- FEVDNIBDCRKMER-IUCAKERBSA-N Pro-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEVDNIBDCRKMER-IUCAKERBSA-N 0.000 description 1
- XQSREVQDGCPFRJ-STQMWFEESA-N Pro-Gly-Phe Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XQSREVQDGCPFRJ-STQMWFEESA-N 0.000 description 1
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 1
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 1
- XQHGISDMVBTGAL-ULQDDVLXSA-N Pro-His-Phe Chemical compound C([C@@H](C(=O)[O-])NC(=O)[C@H](CC=1NC=NC=1)NC(=O)[C@H]1[NH2+]CCC1)C1=CC=CC=C1 XQHGISDMVBTGAL-ULQDDVLXSA-N 0.000 description 1
- KWMUAKQOVYCQJQ-ZPFDUUQYSA-N Pro-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@@H]1CCCN1 KWMUAKQOVYCQJQ-ZPFDUUQYSA-N 0.000 description 1
- XYHMFGGWNOFUOU-QXEWZRGKSA-N Pro-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 XYHMFGGWNOFUOU-QXEWZRGKSA-N 0.000 description 1
- LXLFEIHKWGHJJB-XUXIUFHCSA-N Pro-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 LXLFEIHKWGHJJB-XUXIUFHCSA-N 0.000 description 1
- BCNRNJWSRFDPTQ-HJWJTTGWSA-N Pro-Ile-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BCNRNJWSRFDPTQ-HJWJTTGWSA-N 0.000 description 1
- RYJRPPUATSKNAY-STECZYCISA-N Pro-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@@H]2CCCN2 RYJRPPUATSKNAY-STECZYCISA-N 0.000 description 1
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 description 1
- CLJLVCYFABNTHP-DCAQKATOSA-N Pro-Leu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O CLJLVCYFABNTHP-DCAQKATOSA-N 0.000 description 1
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 1
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 1
- CPRLKHJUFAXVTD-ULQDDVLXSA-N Pro-Leu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CPRLKHJUFAXVTD-ULQDDVLXSA-N 0.000 description 1
- OFGUOWQVEGTVNU-DCAQKATOSA-N Pro-Lys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OFGUOWQVEGTVNU-DCAQKATOSA-N 0.000 description 1
- JUJCUYWRJMFJJF-AVGNSLFASA-N Pro-Lys-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 JUJCUYWRJMFJJF-AVGNSLFASA-N 0.000 description 1
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 1
- INDVYIOKMXFQFM-SRVKXCTJSA-N Pro-Lys-Gln Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O INDVYIOKMXFQFM-SRVKXCTJSA-N 0.000 description 1
- VWHJZETTZDAGOM-XUXIUFHCSA-N Pro-Lys-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VWHJZETTZDAGOM-XUXIUFHCSA-N 0.000 description 1
- NTXFLJULRHQMDC-GUBZILKMSA-N Pro-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@@H]1CCCN1 NTXFLJULRHQMDC-GUBZILKMSA-N 0.000 description 1
- XZBYTHCRAVAXQQ-DCAQKATOSA-N Pro-Met-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O XZBYTHCRAVAXQQ-DCAQKATOSA-N 0.000 description 1
- AWQGDZBKQTYNMN-IHRRRGAJSA-N Pro-Phe-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)O)C(=O)O AWQGDZBKQTYNMN-IHRRRGAJSA-N 0.000 description 1
- MHBSUKYVBZVQRW-HJWJTTGWSA-N Pro-Phe-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MHBSUKYVBZVQRW-HJWJTTGWSA-N 0.000 description 1
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 description 1
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 1
- AJNGQVUFQUVRQT-JYJNAYRXSA-N Pro-Pro-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 AJNGQVUFQUVRQT-JYJNAYRXSA-N 0.000 description 1
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 1
- QKDIHFHGHBYTKB-IHRRRGAJSA-N Pro-Ser-Phe Chemical compound N([C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 QKDIHFHGHBYTKB-IHRRRGAJSA-N 0.000 description 1
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 1
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 1
- QUBVFEANYYWBTM-VEVYYDQMSA-N Pro-Thr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUBVFEANYYWBTM-VEVYYDQMSA-N 0.000 description 1
- JDJMFMVVJHLWDP-UNQGMJICSA-N Pro-Thr-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JDJMFMVVJHLWDP-UNQGMJICSA-N 0.000 description 1
- AIOWVDNPESPXRB-YTWAJWBKSA-N Pro-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2)O AIOWVDNPESPXRB-YTWAJWBKSA-N 0.000 description 1
- CWZUFLWPEFHWEI-IHRRRGAJSA-N Pro-Tyr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O CWZUFLWPEFHWEI-IHRRRGAJSA-N 0.000 description 1
- DYJTXTCEXMCPBF-UFYCRDLUSA-N Pro-Tyr-Phe Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O DYJTXTCEXMCPBF-UFYCRDLUSA-N 0.000 description 1
- VEUACYMXJKXALX-IHRRRGAJSA-N Pro-Tyr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VEUACYMXJKXALX-IHRRRGAJSA-N 0.000 description 1
- XDKKMRPRRCOELJ-GUBZILKMSA-N Pro-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 XDKKMRPRRCOELJ-GUBZILKMSA-N 0.000 description 1
- WWXNZNWZNZPDIF-SRVKXCTJSA-N Pro-Val-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 WWXNZNWZNZPDIF-SRVKXCTJSA-N 0.000 description 1
- OOZJHTXCLJUODH-QXEWZRGKSA-N Pro-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 OOZJHTXCLJUODH-QXEWZRGKSA-N 0.000 description 1
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 1
- 241000205160 Pyrococcus Species 0.000 description 1
- 108010011939 Pyruvate Decarboxylase Proteins 0.000 description 1
- 102000013009 Pyruvate Kinase Human genes 0.000 description 1
- 108020005115 Pyruvate Kinase Proteins 0.000 description 1
- 108010031852 Pyruvate Synthase Proteins 0.000 description 1
- WHBMMWSBFZVSSR-UHFFFAOYSA-N R3HBA Natural products CC(O)CC(O)=O WHBMMWSBFZVSSR-UHFFFAOYSA-N 0.000 description 1
- 241000232299 Ralstonia Species 0.000 description 1
- 241000700159 Rattus Species 0.000 description 1
- 241000589180 Rhizobium Species 0.000 description 1
- 241000316848 Rhodococcus <scale insect> Species 0.000 description 1
- 108091028664 Ribonucleotide Proteins 0.000 description 1
- 241000235070 Saccharomyces Species 0.000 description 1
- 240000000111 Saccharum officinarum Species 0.000 description 1
- 235000007201 Saccharum officinarum Nutrition 0.000 description 1
- 241000607142 Salmonella Species 0.000 description 1
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 1
- FIXILCYTSAUERA-FXQIFTODSA-N Ser-Ala-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIXILCYTSAUERA-FXQIFTODSA-N 0.000 description 1
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 1
- BKOKTRCZXRIQPX-ZLUOBGJFSA-N Ser-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N BKOKTRCZXRIQPX-ZLUOBGJFSA-N 0.000 description 1
- MMGJPDWSIOAGTH-ACZMJKKPSA-N Ser-Ala-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MMGJPDWSIOAGTH-ACZMJKKPSA-N 0.000 description 1
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 1
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 1
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 1
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- NLQUOHDCLSFABG-GUBZILKMSA-N Ser-Arg-Arg Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NLQUOHDCLSFABG-GUBZILKMSA-N 0.000 description 1
- VQBLHWSPVYYZTB-DCAQKATOSA-N Ser-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N VQBLHWSPVYYZTB-DCAQKATOSA-N 0.000 description 1
- KYKKKSWGEPFUMR-NAKRPEOUSA-N Ser-Arg-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KYKKKSWGEPFUMR-NAKRPEOUSA-N 0.000 description 1
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 1
- RZUOXAKGNHXZTB-GUBZILKMSA-N Ser-Arg-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O RZUOXAKGNHXZTB-GUBZILKMSA-N 0.000 description 1
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 1
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 1
- WXUBSIDKNMFAGS-IHRRRGAJSA-N Ser-Arg-Tyr Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXUBSIDKNMFAGS-IHRRRGAJSA-N 0.000 description 1
- OBXVZEAMXFSGPU-FXQIFTODSA-N Ser-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)CN=C(N)N OBXVZEAMXFSGPU-FXQIFTODSA-N 0.000 description 1
- UBRXAVQWXOWRSJ-ZLUOBGJFSA-N Ser-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)C(=O)N UBRXAVQWXOWRSJ-ZLUOBGJFSA-N 0.000 description 1
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 1
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 1
- BYIROAKULFFTEK-CIUDSAMLSA-N Ser-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO BYIROAKULFFTEK-CIUDSAMLSA-N 0.000 description 1
- OLIJLNWFEQEFDM-SRVKXCTJSA-N Ser-Asp-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLIJLNWFEQEFDM-SRVKXCTJSA-N 0.000 description 1
- HEQPKICPPDOSIN-SRVKXCTJSA-N Ser-Asp-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HEQPKICPPDOSIN-SRVKXCTJSA-N 0.000 description 1
- MOVJSUIKUNCVMG-ZLUOBGJFSA-N Ser-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)O MOVJSUIKUNCVMG-ZLUOBGJFSA-N 0.000 description 1
- WKLJLEXEENIYQE-SRVKXCTJSA-N Ser-Cys-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WKLJLEXEENIYQE-SRVKXCTJSA-N 0.000 description 1
- VMVNCJDKFOQOHM-GUBZILKMSA-N Ser-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N VMVNCJDKFOQOHM-GUBZILKMSA-N 0.000 description 1
- VDVYTKZBMFADQH-AVGNSLFASA-N Ser-Gln-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 VDVYTKZBMFADQH-AVGNSLFASA-N 0.000 description 1
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 1
- PVDTYLHUWAEYGY-CIUDSAMLSA-N Ser-Glu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PVDTYLHUWAEYGY-CIUDSAMLSA-N 0.000 description 1
- QKQDTEYDEIJPNK-GUBZILKMSA-N Ser-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CO QKQDTEYDEIJPNK-GUBZILKMSA-N 0.000 description 1
- GRSLLFZTTLBOQX-CIUDSAMLSA-N Ser-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N GRSLLFZTTLBOQX-CIUDSAMLSA-N 0.000 description 1
- DSGYZICNAMEJOC-AVGNSLFASA-N Ser-Glu-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DSGYZICNAMEJOC-AVGNSLFASA-N 0.000 description 1
- WBINSDOPZHQPPM-AVGNSLFASA-N Ser-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)O WBINSDOPZHQPPM-AVGNSLFASA-N 0.000 description 1
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 1
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 1
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 1
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 description 1
- ZFVFHHZBCVNLGD-GUBZILKMSA-N Ser-His-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZFVFHHZBCVNLGD-GUBZILKMSA-N 0.000 description 1
- UGHCUDLCCVVIJR-VGDYDELISA-N Ser-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N UGHCUDLCCVVIJR-VGDYDELISA-N 0.000 description 1
- CICQXRWZNVXFCU-SRVKXCTJSA-N Ser-His-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O CICQXRWZNVXFCU-SRVKXCTJSA-N 0.000 description 1
- DJACUBDEDBZKLQ-KBIXCLLPSA-N Ser-Ile-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O DJACUBDEDBZKLQ-KBIXCLLPSA-N 0.000 description 1
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 1
- HBTCFCHYALPXME-HTFCKZLJSA-N Ser-Ile-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HBTCFCHYALPXME-HTFCKZLJSA-N 0.000 description 1
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 1
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 1
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 1
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 1
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 1
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 1
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 1
- JWOBLHJRDADHLN-KKUMJFAQSA-N Ser-Leu-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JWOBLHJRDADHLN-KKUMJFAQSA-N 0.000 description 1
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 1
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 1
- PPNPDKGQRFSCAC-CIUDSAMLSA-N Ser-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPNPDKGQRFSCAC-CIUDSAMLSA-N 0.000 description 1
- OCWWJBZQXGYQCA-DCAQKATOSA-N Ser-Lys-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O OCWWJBZQXGYQCA-DCAQKATOSA-N 0.000 description 1
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 1
- UGTZYIPOBYXWRW-SRVKXCTJSA-N Ser-Phe-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O UGTZYIPOBYXWRW-SRVKXCTJSA-N 0.000 description 1
- RRVFEDGUXSYWOW-BZSNNMDCSA-N Ser-Phe-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RRVFEDGUXSYWOW-BZSNNMDCSA-N 0.000 description 1
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 1
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 1
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 1
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- NVNPWELENFJOHH-CIUDSAMLSA-N Ser-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CO)N NVNPWELENFJOHH-CIUDSAMLSA-N 0.000 description 1
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 1
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 1
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 1
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 1
- ATEQEHCGZKBEMU-GQGQLFGLSA-N Ser-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N ATEQEHCGZKBEMU-GQGQLFGLSA-N 0.000 description 1
- FVFUOQIYDPAIJR-XIRDDKMYSA-N Ser-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N FVFUOQIYDPAIJR-XIRDDKMYSA-N 0.000 description 1
- XPVIVVLLLOFBRH-XIRDDKMYSA-N Ser-Trp-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@@H](N)CO)C(O)=O XPVIVVLLLOFBRH-XIRDDKMYSA-N 0.000 description 1
- HAUVENOGHPECML-BPUTZDHNSA-N Ser-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CO)=CNC2=C1 HAUVENOGHPECML-BPUTZDHNSA-N 0.000 description 1
- PIQRHJQWEPWFJG-UWJYBYFXSA-N Ser-Tyr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PIQRHJQWEPWFJG-UWJYBYFXSA-N 0.000 description 1
- PQEQXWRVHQAAKS-SRVKXCTJSA-N Ser-Tyr-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=C(O)C=C1 PQEQXWRVHQAAKS-SRVKXCTJSA-N 0.000 description 1
- VEVYMLNYMULSMS-AVGNSLFASA-N Ser-Tyr-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VEVYMLNYMULSMS-AVGNSLFASA-N 0.000 description 1
- PLQWGQUNUPMNOD-KKUMJFAQSA-N Ser-Tyr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PLQWGQUNUPMNOD-KKUMJFAQSA-N 0.000 description 1
- HKHCTNFKZXAMIF-KKUMJFAQSA-N Ser-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=C(O)C=C1 HKHCTNFKZXAMIF-KKUMJFAQSA-N 0.000 description 1
- VVKVHAOOUGNDPJ-SRVKXCTJSA-N Ser-Tyr-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VVKVHAOOUGNDPJ-SRVKXCTJSA-N 0.000 description 1
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 1
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 1
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 1
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 1
- ODRUTDLAONAVDV-IHRRRGAJSA-N Ser-Val-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ODRUTDLAONAVDV-IHRRRGAJSA-N 0.000 description 1
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 1
- 108030000194 Serine 2-dehydrogenases Proteins 0.000 description 1
- 102100021225 Serine hydroxymethyltransferase, cytosolic Human genes 0.000 description 1
- 241000607720 Serratia Species 0.000 description 1
- 241000863430 Shewanella Species 0.000 description 1
- 241000607768 Shigella Species 0.000 description 1
- 241001571329 Solibacillus Species 0.000 description 1
- 241000191940 Staphylococcus Species 0.000 description 1
- 229910000831 Steel Inorganic materials 0.000 description 1
- 241000194017 Streptococcus Species 0.000 description 1
- 229930006000 Sucrose Natural products 0.000 description 1
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 1
- 241001134777 Sulfobacillus Species 0.000 description 1
- 241000580834 Sulfurospirillum Species 0.000 description 1
- 241000282890 Sus Species 0.000 description 1
- 241000207198 Symbiobacterium Species 0.000 description 1
- 241001265507 Thermaerobacter Species 0.000 description 1
- 241000205188 Thermococcus Species 0.000 description 1
- JZRWCGZRTZMZEH-UHFFFAOYSA-N Thiamine Natural products CC1=C(CCO)SC=[N+]1CC1=CN=C(C)N=C1N JZRWCGZRTZMZEH-UHFFFAOYSA-N 0.000 description 1
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 1
- DFTCYYILCSQGIZ-GCJQMDKQSA-N Thr-Ala-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFTCYYILCSQGIZ-GCJQMDKQSA-N 0.000 description 1
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 1
- ZUXQFMVPAYGPFJ-JXUBOQSCSA-N Thr-Ala-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN ZUXQFMVPAYGPFJ-JXUBOQSCSA-N 0.000 description 1
- DGDCHPCRMWEOJR-FQPOAREZSA-N Thr-Ala-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DGDCHPCRMWEOJR-FQPOAREZSA-N 0.000 description 1
- CAGTXGDOIFXLPC-KZVJFYERSA-N Thr-Arg-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N CAGTXGDOIFXLPC-KZVJFYERSA-N 0.000 description 1
- UKBSDLHIKIXJKH-HJGDQZAQSA-N Thr-Arg-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UKBSDLHIKIXJKH-HJGDQZAQSA-N 0.000 description 1
- TWLMXDWFVNEFFK-FJXKBIBVSA-N Thr-Arg-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O TWLMXDWFVNEFFK-FJXKBIBVSA-N 0.000 description 1
- MQBTXMPQNCGSSZ-OSUNSFLBSA-N Thr-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N MQBTXMPQNCGSSZ-OSUNSFLBSA-N 0.000 description 1
- XVNZSJIKGJLQLH-RCWTZXSCSA-N Thr-Arg-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCSC)C(=O)O)N)O XVNZSJIKGJLQLH-RCWTZXSCSA-N 0.000 description 1
- JNQZPAWOPBZGIX-RCWTZXSCSA-N Thr-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N JNQZPAWOPBZGIX-RCWTZXSCSA-N 0.000 description 1
- YLXAMFZYJTZXFH-OLHMAJIHSA-N Thr-Asn-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O YLXAMFZYJTZXFH-OLHMAJIHSA-N 0.000 description 1
- CTONFVDJYCAMQM-IUKAMOBKSA-N Thr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H]([C@@H](C)O)N CTONFVDJYCAMQM-IUKAMOBKSA-N 0.000 description 1
- PZVGOVRNGKEFCB-KKHAAJSZSA-N Thr-Asn-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N)O PZVGOVRNGKEFCB-KKHAAJSZSA-N 0.000 description 1
- LMMDEZPNUTZJAY-GCJQMDKQSA-N Thr-Asp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O LMMDEZPNUTZJAY-GCJQMDKQSA-N 0.000 description 1
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 1
- GNHRVXYZKWSJTF-HJGDQZAQSA-N Thr-Asp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GNHRVXYZKWSJTF-HJGDQZAQSA-N 0.000 description 1
- XDARBNMYXKUFOJ-GSSVUCPTSA-N Thr-Asp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDARBNMYXKUFOJ-GSSVUCPTSA-N 0.000 description 1
- DCLBXIWHLVEPMQ-JRQIVUDYSA-N Thr-Asp-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DCLBXIWHLVEPMQ-JRQIVUDYSA-N 0.000 description 1
- ODSAPYVQSLDRSR-LKXGYXEUSA-N Thr-Cys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O ODSAPYVQSLDRSR-LKXGYXEUSA-N 0.000 description 1
- DSLHSTIUAPKERR-XGEHTFHBSA-N Thr-Cys-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O DSLHSTIUAPKERR-XGEHTFHBSA-N 0.000 description 1
- OYTNZCBFDXGQGE-XQXXSGGOSA-N Thr-Gln-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O OYTNZCBFDXGQGE-XQXXSGGOSA-N 0.000 description 1
- VUVCRYXYUUPGSB-GLLZPBPUSA-N Thr-Gln-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O VUVCRYXYUUPGSB-GLLZPBPUSA-N 0.000 description 1
- DIPIPFHFLPTCLK-LOKLDPHHSA-N Thr-Gln-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O DIPIPFHFLPTCLK-LOKLDPHHSA-N 0.000 description 1
- FHDLKMFZKRUQCE-HJGDQZAQSA-N Thr-Glu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHDLKMFZKRUQCE-HJGDQZAQSA-N 0.000 description 1
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 1
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 1
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 1
- KBLYJPQSNGTDIU-LOKLDPHHSA-N Thr-Glu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O KBLYJPQSNGTDIU-LOKLDPHHSA-N 0.000 description 1
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 1
- BNGDYRRHRGOPHX-IFFSRLJSSA-N Thr-Glu-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O BNGDYRRHRGOPHX-IFFSRLJSSA-N 0.000 description 1
- KCRQEJSKXAIULJ-FJXKBIBVSA-N Thr-Gly-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCRQEJSKXAIULJ-FJXKBIBVSA-N 0.000 description 1
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 1
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 1
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 1
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 1
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 1
- VUSAEKOXGNEYNE-PBCZWWQYSA-N Thr-His-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O VUSAEKOXGNEYNE-PBCZWWQYSA-N 0.000 description 1
- SXAGUVRFGJSFKC-ZEILLAHLSA-N Thr-His-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SXAGUVRFGJSFKC-ZEILLAHLSA-N 0.000 description 1
- PAXANSWUSVPFNK-IUKAMOBKSA-N Thr-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N PAXANSWUSVPFNK-IUKAMOBKSA-N 0.000 description 1
- JRAUIKJSEAKTGD-TUBUOCAGSA-N Thr-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N JRAUIKJSEAKTGD-TUBUOCAGSA-N 0.000 description 1
- GMXIJHCBTZDAPD-QPHKQPEJSA-N Thr-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N GMXIJHCBTZDAPD-QPHKQPEJSA-N 0.000 description 1
- ADPHPKGWVDHWML-PPCPHDFISA-N Thr-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N ADPHPKGWVDHWML-PPCPHDFISA-N 0.000 description 1
- LCCSEJSPBWKBNT-OSUNSFLBSA-N Thr-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N LCCSEJSPBWKBNT-OSUNSFLBSA-N 0.000 description 1
- HOVLHEKTGVIKAP-WDCWCFNPSA-N Thr-Leu-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HOVLHEKTGVIKAP-WDCWCFNPSA-N 0.000 description 1
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 1
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 1
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 1
- TZJSEJOXAIWOST-RHYQMDGZSA-N Thr-Lys-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N TZJSEJOXAIWOST-RHYQMDGZSA-N 0.000 description 1
- BDGBHYCAZJPLHX-HJGDQZAQSA-N Thr-Lys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BDGBHYCAZJPLHX-HJGDQZAQSA-N 0.000 description 1
- ZSPQUTWLWGWTPS-HJGDQZAQSA-N Thr-Lys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZSPQUTWLWGWTPS-HJGDQZAQSA-N 0.000 description 1
- MGJLBZFUXUGMML-VOAKCMCISA-N Thr-Lys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MGJLBZFUXUGMML-VOAKCMCISA-N 0.000 description 1
- DCRHJDRLCFMEBI-RHYQMDGZSA-N Thr-Lys-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O DCRHJDRLCFMEBI-RHYQMDGZSA-N 0.000 description 1
- UJQVSMNQMQHVRY-KZVJFYERSA-N Thr-Met-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O UJQVSMNQMQHVRY-KZVJFYERSA-N 0.000 description 1
- PCMDGXKXVMBIFP-VEVYYDQMSA-N Thr-Met-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMDGXKXVMBIFP-VEVYYDQMSA-N 0.000 description 1
- YJVJPJPHHFOVMG-VEVYYDQMSA-N Thr-Met-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O YJVJPJPHHFOVMG-VEVYYDQMSA-N 0.000 description 1
- UXUAZXWKIGPUCH-RCWTZXSCSA-N Thr-Met-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O UXUAZXWKIGPUCH-RCWTZXSCSA-N 0.000 description 1
- SIEZEMFJLYRUMK-YTWAJWBKSA-N Thr-Met-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N)O SIEZEMFJLYRUMK-YTWAJWBKSA-N 0.000 description 1
- WVVOFCVMHAXGLE-LFSVMHDDSA-N Thr-Phe-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O WVVOFCVMHAXGLE-LFSVMHDDSA-N 0.000 description 1
- WRQLCVIALDUQEQ-UNQGMJICSA-N Thr-Phe-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WRQLCVIALDUQEQ-UNQGMJICSA-N 0.000 description 1
- BIBYEFRASCNLAA-CDMKHQONSA-N Thr-Phe-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 BIBYEFRASCNLAA-CDMKHQONSA-N 0.000 description 1
- NWECYMJLJGCBOD-UNQGMJICSA-N Thr-Phe-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O NWECYMJLJGCBOD-UNQGMJICSA-N 0.000 description 1
- NYQIZWROIMIQSL-VEVYYDQMSA-N Thr-Pro-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O NYQIZWROIMIQSL-VEVYYDQMSA-N 0.000 description 1
- NDXSOKGYKCGYKT-VEVYYDQMSA-N Thr-Pro-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O NDXSOKGYKCGYKT-VEVYYDQMSA-N 0.000 description 1
- JAJOFWABAUKAEJ-QTKMDUPCSA-N Thr-Pro-His Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O JAJOFWABAUKAEJ-QTKMDUPCSA-N 0.000 description 1
- VTMGKRABARCZAX-OSUNSFLBSA-N Thr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O VTMGKRABARCZAX-OSUNSFLBSA-N 0.000 description 1
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 1
- GFRIEEKFXOVPIR-RHYQMDGZSA-N Thr-Pro-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O GFRIEEKFXOVPIR-RHYQMDGZSA-N 0.000 description 1
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 1
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 description 1
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 1
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 1
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 1
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 1
- QJIODPFLAASXJC-JHYOHUSXSA-N Thr-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O QJIODPFLAASXJC-JHYOHUSXSA-N 0.000 description 1
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 1
- BJJRNAVDQGREGC-HOUAVDHOSA-N Thr-Trp-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O BJJRNAVDQGREGC-HOUAVDHOSA-N 0.000 description 1
- RPECVQBNONKZAT-WZLNRYEVSA-N Thr-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H]([C@@H](C)O)N RPECVQBNONKZAT-WZLNRYEVSA-N 0.000 description 1
- VMSSYINFMOFLJM-KJEVXHAQSA-N Thr-Tyr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCSC)C(=O)O)N)O VMSSYINFMOFLJM-KJEVXHAQSA-N 0.000 description 1
- CYCGARJWIQWPQM-YJRXYDGGSA-N Thr-Tyr-Ser Chemical compound C[C@@H](O)[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CO)C([O-])=O)CC1=CC=C(O)C=C1 CYCGARJWIQWPQM-YJRXYDGGSA-N 0.000 description 1
- KVEWWQRTAVMOFT-KJEVXHAQSA-N Thr-Tyr-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O KVEWWQRTAVMOFT-KJEVXHAQSA-N 0.000 description 1
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 1
- SBYQHZCMVSPQCS-RCWTZXSCSA-N Thr-Val-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O SBYQHZCMVSPQCS-RCWTZXSCSA-N 0.000 description 1
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 1
- 108020004566 Transfer RNA Proteins 0.000 description 1
- CXUFDWZBHKUGKK-CABZTGNLSA-N Trp-Ala-Gly Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O)=CNC2=C1 CXUFDWZBHKUGKK-CABZTGNLSA-N 0.000 description 1
- NXAPHBHZCMQORW-FDARSICLSA-N Trp-Arg-Ile Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NXAPHBHZCMQORW-FDARSICLSA-N 0.000 description 1
- MHNHRNHJMXAVHZ-AAEUAGOBSA-N Trp-Asn-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N MHNHRNHJMXAVHZ-AAEUAGOBSA-N 0.000 description 1
- PTAWAMWPRFTACW-SZMVWBNQSA-N Trp-Gln-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PTAWAMWPRFTACW-SZMVWBNQSA-N 0.000 description 1
- AFSYEUHJBVCPEL-JBACZVJFSA-N Trp-Gln-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=CC=C1 AFSYEUHJBVCPEL-JBACZVJFSA-N 0.000 description 1
- LJCLHMPCYYXVPR-VJBMBRPKSA-N Trp-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)N LJCLHMPCYYXVPR-VJBMBRPKSA-N 0.000 description 1
- JLTQXEOXIJMCLZ-ZVZYQTTQSA-N Trp-Gln-Val Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O)=CNC2=C1 JLTQXEOXIJMCLZ-ZVZYQTTQSA-N 0.000 description 1
- VMBBTANKMSRJSS-JSGCOSHPSA-N Trp-Glu-Gly Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VMBBTANKMSRJSS-JSGCOSHPSA-N 0.000 description 1
- UDCHKDYNMRJYMI-QEJZJMRPSA-N Trp-Glu-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UDCHKDYNMRJYMI-QEJZJMRPSA-N 0.000 description 1
- RPVDDQYNBOVWLR-HOCLYGCPSA-N Trp-Gly-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O RPVDDQYNBOVWLR-HOCLYGCPSA-N 0.000 description 1
- WSGPBCAGEGHKQJ-BBRMVZONSA-N Trp-Gly-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WSGPBCAGEGHKQJ-BBRMVZONSA-N 0.000 description 1
- XLVRTKPAIXJYOH-HOCLYGCPSA-N Trp-His-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)NCC(=O)O)N XLVRTKPAIXJYOH-HOCLYGCPSA-N 0.000 description 1
- GQHAIUPYZPTADF-FDARSICLSA-N Trp-Ile-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 GQHAIUPYZPTADF-FDARSICLSA-N 0.000 description 1
- MKDXQPMIQPTTAW-SIXJUCDHSA-N Trp-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N MKDXQPMIQPTTAW-SIXJUCDHSA-N 0.000 description 1
- AIISTODACBDQLW-WDSOQIARSA-N Trp-Leu-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 AIISTODACBDQLW-WDSOQIARSA-N 0.000 description 1
- YVXIAOOYAKBAAI-SZMVWBNQSA-N Trp-Leu-Gln Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 YVXIAOOYAKBAAI-SZMVWBNQSA-N 0.000 description 1
- IQXWAJUIAQLZNX-IHPCNDPISA-N Trp-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N IQXWAJUIAQLZNX-IHPCNDPISA-N 0.000 description 1
- RWAYYYOZMHMEGD-XIRDDKMYSA-N Trp-Leu-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 RWAYYYOZMHMEGD-XIRDDKMYSA-N 0.000 description 1
- OSYOKZZRVGUDMO-HSCHXYMDSA-N Trp-Lys-Ile Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OSYOKZZRVGUDMO-HSCHXYMDSA-N 0.000 description 1
- OFTGYORHQMSPAI-PJODQICGSA-N Trp-Met-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O OFTGYORHQMSPAI-PJODQICGSA-N 0.000 description 1
- LFMLXCJYCFZBKE-IHPCNDPISA-N Trp-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N LFMLXCJYCFZBKE-IHPCNDPISA-N 0.000 description 1
- CSOBBJWWODOYGW-ILWGZMRPSA-N Trp-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CNC4=CC=CC=C43)N)C(=O)O CSOBBJWWODOYGW-ILWGZMRPSA-N 0.000 description 1
- RNDWCRUOGGQDKN-UBHSHLNASA-N Trp-Ser-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RNDWCRUOGGQDKN-UBHSHLNASA-N 0.000 description 1
- UJGDFQRPYGJBEH-AAEUAGOBSA-N Trp-Ser-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N UJGDFQRPYGJBEH-AAEUAGOBSA-N 0.000 description 1
- ARKBYVBCEOWRNR-UBHSHLNASA-N Trp-Ser-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O ARKBYVBCEOWRNR-UBHSHLNASA-N 0.000 description 1
- QHWMVGCEQAPQDK-UMPQAUOISA-N Trp-Thr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O QHWMVGCEQAPQDK-UMPQAUOISA-N 0.000 description 1
- ZZDFLJFVSNQINX-HWHUXHBOSA-N Trp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)O ZZDFLJFVSNQINX-HWHUXHBOSA-N 0.000 description 1
- WNGMGTMSUBARLB-RXVVDRJESA-N Trp-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC=3C4=CC=CC=C4NC=3)N)C(=O)NCC(O)=O)=CNC2=C1 WNGMGTMSUBARLB-RXVVDRJESA-N 0.000 description 1
- XLMDWQNAOKLKCP-XDTLVQLUSA-N Tyr-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N XLMDWQNAOKLKCP-XDTLVQLUSA-N 0.000 description 1
- OOEUVMFKKZYSRX-LEWSCRJBSA-N Tyr-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OOEUVMFKKZYSRX-LEWSCRJBSA-N 0.000 description 1
- LGEYOIQBBIPHQN-UWJYBYFXSA-N Tyr-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LGEYOIQBBIPHQN-UWJYBYFXSA-N 0.000 description 1
- NOXKHHXSHQFSGJ-FQPOAREZSA-N Tyr-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NOXKHHXSHQFSGJ-FQPOAREZSA-N 0.000 description 1
- HSVPZJLMPLMPOX-BPNCWPANSA-N Tyr-Arg-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O HSVPZJLMPLMPOX-BPNCWPANSA-N 0.000 description 1
- MICSYKFECRFCTJ-IHRRRGAJSA-N Tyr-Arg-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O MICSYKFECRFCTJ-IHRRRGAJSA-N 0.000 description 1
- HTHCZRWCFXMENJ-KKUMJFAQSA-N Tyr-Arg-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HTHCZRWCFXMENJ-KKUMJFAQSA-N 0.000 description 1
- CRWOSTCODDFEKZ-HRCADAONSA-N Tyr-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O CRWOSTCODDFEKZ-HRCADAONSA-N 0.000 description 1
- DYEGCOJHFNJBKB-UFYCRDLUSA-N Tyr-Arg-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 DYEGCOJHFNJBKB-UFYCRDLUSA-N 0.000 description 1
- CYDVHRFXDMDMGX-KKUMJFAQSA-N Tyr-Asn-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O CYDVHRFXDMDMGX-KKUMJFAQSA-N 0.000 description 1
- BEIGSKUPTIFYRZ-SRVKXCTJSA-N Tyr-Asp-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O BEIGSKUPTIFYRZ-SRVKXCTJSA-N 0.000 description 1
- JFDGVHXRCKEBAU-KKUMJFAQSA-N Tyr-Asp-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JFDGVHXRCKEBAU-KKUMJFAQSA-N 0.000 description 1
- TZXFLDNBYYGLKA-BZSNNMDCSA-N Tyr-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 TZXFLDNBYYGLKA-BZSNNMDCSA-N 0.000 description 1
- QHEGAOPHISYNDF-XDTLVQLUSA-N Tyr-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHEGAOPHISYNDF-XDTLVQLUSA-N 0.000 description 1
- RYSNTWVRSLCAJZ-RYUDHWBXSA-N Tyr-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RYSNTWVRSLCAJZ-RYUDHWBXSA-N 0.000 description 1
- FXYOYUMPUJONGW-FHWLQOOXSA-N Tyr-Gln-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 FXYOYUMPUJONGW-FHWLQOOXSA-N 0.000 description 1
- WAPFQMXRSDEGOE-IHRRRGAJSA-N Tyr-Glu-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O WAPFQMXRSDEGOE-IHRRRGAJSA-N 0.000 description 1
- WVRUKYLYMFGKAN-IHRRRGAJSA-N Tyr-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 WVRUKYLYMFGKAN-IHRRRGAJSA-N 0.000 description 1
- NZFCWALTLNFHHC-JYJNAYRXSA-N Tyr-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NZFCWALTLNFHHC-JYJNAYRXSA-N 0.000 description 1
- ZRPLVTZTKPPSBT-AVGNSLFASA-N Tyr-Glu-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZRPLVTZTKPPSBT-AVGNSLFASA-N 0.000 description 1
- NJLQMKZSXYQRTO-FHWLQOOXSA-N Tyr-Glu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NJLQMKZSXYQRTO-FHWLQOOXSA-N 0.000 description 1
- HIINQLBHPIQYHN-JTQLQIEISA-N Tyr-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HIINQLBHPIQYHN-JTQLQIEISA-N 0.000 description 1
- IJUTXXAXQODRMW-KBPBESRZSA-N Tyr-Gly-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O IJUTXXAXQODRMW-KBPBESRZSA-N 0.000 description 1
- KCPFDGNYAMKZQP-KBPBESRZSA-N Tyr-Gly-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O KCPFDGNYAMKZQP-KBPBESRZSA-N 0.000 description 1
- NOOMDULIORCDNF-IRXDYDNUSA-N Tyr-Gly-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NOOMDULIORCDNF-IRXDYDNUSA-N 0.000 description 1
- QAYSODICXVZUIA-WLTAIBSBSA-N Tyr-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QAYSODICXVZUIA-WLTAIBSBSA-N 0.000 description 1
- FIRUOPRJKCBLST-KKUMJFAQSA-N Tyr-His-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O FIRUOPRJKCBLST-KKUMJFAQSA-N 0.000 description 1
- WPXKRJVHBXYLDT-JUKXBJQTSA-N Tyr-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N WPXKRJVHBXYLDT-JUKXBJQTSA-N 0.000 description 1
- USYGMBIIUDLYHJ-GVARAGBVSA-N Tyr-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 USYGMBIIUDLYHJ-GVARAGBVSA-N 0.000 description 1
- PJWCWGXAVIVXQC-STECZYCISA-N Tyr-Ile-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PJWCWGXAVIVXQC-STECZYCISA-N 0.000 description 1
- NXRGXTBPMOGFID-CFMVVWHZSA-N Tyr-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O NXRGXTBPMOGFID-CFMVVWHZSA-N 0.000 description 1
- GGXUDPQWAWRINY-XEGUGMAKSA-N Tyr-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GGXUDPQWAWRINY-XEGUGMAKSA-N 0.000 description 1
- HHFMNAVFGBYSAT-IGISWZIWSA-N Tyr-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N HHFMNAVFGBYSAT-IGISWZIWSA-N 0.000 description 1
- AZZLDIDWPZLCCW-ZEWNOJEFSA-N Tyr-Ile-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O AZZLDIDWPZLCCW-ZEWNOJEFSA-N 0.000 description 1
- OHOVFPKXPZODHS-SJWGOKEGSA-N Tyr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OHOVFPKXPZODHS-SJWGOKEGSA-N 0.000 description 1
- KHCSOLAHNLOXJR-BZSNNMDCSA-N Tyr-Leu-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHCSOLAHNLOXJR-BZSNNMDCSA-N 0.000 description 1
- GITNQBVCEQBDQC-KKUMJFAQSA-N Tyr-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O GITNQBVCEQBDQC-KKUMJFAQSA-N 0.000 description 1
- VTCKHZJKWQENKX-KBPBESRZSA-N Tyr-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O VTCKHZJKWQENKX-KBPBESRZSA-N 0.000 description 1
- FMXFHNSFABRVFZ-BZSNNMDCSA-N Tyr-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FMXFHNSFABRVFZ-BZSNNMDCSA-N 0.000 description 1
- OFHKXNKJXURPSY-ULQDDVLXSA-N Tyr-Met-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O OFHKXNKJXURPSY-ULQDDVLXSA-N 0.000 description 1
- OGPKMBOPMDTEDM-IHRRRGAJSA-N Tyr-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N OGPKMBOPMDTEDM-IHRRRGAJSA-N 0.000 description 1
- LMKKMCGTDANZTR-BZSNNMDCSA-N Tyr-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 LMKKMCGTDANZTR-BZSNNMDCSA-N 0.000 description 1
- NVZVJIUDICCMHZ-BZSNNMDCSA-N Tyr-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O NVZVJIUDICCMHZ-BZSNNMDCSA-N 0.000 description 1
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 1
- VPEFOFYNHBWFNQ-UFYCRDLUSA-N Tyr-Pro-Tyr Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 VPEFOFYNHBWFNQ-UFYCRDLUSA-N 0.000 description 1
- VYQQQIRHIFALGE-UWJYBYFXSA-N Tyr-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VYQQQIRHIFALGE-UWJYBYFXSA-N 0.000 description 1
- QPOUERMDWKKZEG-HJPIBITLSA-N Tyr-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QPOUERMDWKKZEG-HJPIBITLSA-N 0.000 description 1
- MQGGXGKQSVEQHR-KKUMJFAQSA-N Tyr-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 MQGGXGKQSVEQHR-KKUMJFAQSA-N 0.000 description 1
- XUIOBCQESNDTDE-FQPOAREZSA-N Tyr-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XUIOBCQESNDTDE-FQPOAREZSA-N 0.000 description 1
- ITDWWLTTWRRLCC-KJEVXHAQSA-N Tyr-Thr-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ITDWWLTTWRRLCC-KJEVXHAQSA-N 0.000 description 1
- ZZDYJFVIKVSUFA-WLTAIBSBSA-N Tyr-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ZZDYJFVIKVSUFA-WLTAIBSBSA-N 0.000 description 1
- CLEGSEJVGBYZBJ-MEYUZBJRSA-N Tyr-Thr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CLEGSEJVGBYZBJ-MEYUZBJRSA-N 0.000 description 1
- LDKDSFQSEUOCOO-RPTUDFQQSA-N Tyr-Thr-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LDKDSFQSEUOCOO-RPTUDFQQSA-N 0.000 description 1
- AOIZTZRWMSPPAY-KAOXEZKKSA-N Tyr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)O AOIZTZRWMSPPAY-KAOXEZKKSA-N 0.000 description 1
- GPLTZEMVOCZVAV-UFYCRDLUSA-N Tyr-Tyr-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 GPLTZEMVOCZVAV-UFYCRDLUSA-N 0.000 description 1
- RMRFSFXLFWWAJZ-HJOGWXRNSA-N Tyr-Tyr-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 RMRFSFXLFWWAJZ-HJOGWXRNSA-N 0.000 description 1
- SQUMHUZLJDUROQ-YDHLFZDLSA-N Tyr-Val-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O SQUMHUZLJDUROQ-YDHLFZDLSA-N 0.000 description 1
- PQPWEALFTLKSEB-DZKIICNBSA-N Tyr-Val-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PQPWEALFTLKSEB-DZKIICNBSA-N 0.000 description 1
- NWEGIYMHTZXVBP-JSGCOSHPSA-N Tyr-Val-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O NWEGIYMHTZXVBP-JSGCOSHPSA-N 0.000 description 1
- 108010064997 VPY tripeptide Proteins 0.000 description 1
- FZSPNKUFROZBSG-ZKWXMUAHSA-N Val-Ala-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O FZSPNKUFROZBSG-ZKWXMUAHSA-N 0.000 description 1
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 1
- LTFLDDDGWOVIHY-NAKRPEOUSA-N Val-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N LTFLDDDGWOVIHY-NAKRPEOUSA-N 0.000 description 1
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 1
- LABUITCFCAABSV-UHFFFAOYSA-N Val-Ala-Tyr Natural products CC(C)C(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LABUITCFCAABSV-UHFFFAOYSA-N 0.000 description 1
- JIODCDXKCJRMEH-NHCYSSNCSA-N Val-Arg-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N JIODCDXKCJRMEH-NHCYSSNCSA-N 0.000 description 1
- HNWQUBBOBKSFQV-AVGNSLFASA-N Val-Arg-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HNWQUBBOBKSFQV-AVGNSLFASA-N 0.000 description 1
- PFNZJEPSCBAVGX-CYDGBPFRSA-N Val-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N PFNZJEPSCBAVGX-CYDGBPFRSA-N 0.000 description 1
- CVUDMNSZAIZFAE-TUAOUCFPSA-N Val-Arg-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N CVUDMNSZAIZFAE-TUAOUCFPSA-N 0.000 description 1
- CVUDMNSZAIZFAE-UHFFFAOYSA-N Val-Arg-Pro Natural products NC(N)=NCCCC(NC(=O)C(N)C(C)C)C(=O)N1CCCC1C(O)=O CVUDMNSZAIZFAE-UHFFFAOYSA-N 0.000 description 1
- CWOSXNKDOACNJN-BZSNNMDCSA-N Val-Arg-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N CWOSXNKDOACNJN-BZSNNMDCSA-N 0.000 description 1
- WKWJJQZZZBBWKV-JYJNAYRXSA-N Val-Arg-Tyr Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WKWJJQZZZBBWKV-JYJNAYRXSA-N 0.000 description 1
- UDLYXGYWTVOIKU-QXEWZRGKSA-N Val-Asn-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UDLYXGYWTVOIKU-QXEWZRGKSA-N 0.000 description 1
- AUMNPAUHKUNHHN-BYULHYEWSA-N Val-Asn-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N AUMNPAUHKUNHHN-BYULHYEWSA-N 0.000 description 1
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 1
- QGFPYRPIUXBYGR-YDHLFZDLSA-N Val-Asn-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N QGFPYRPIUXBYGR-YDHLFZDLSA-N 0.000 description 1
- IDKGBVZGNTYYCC-QXEWZRGKSA-N Val-Asn-Pro Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(O)=O IDKGBVZGNTYYCC-QXEWZRGKSA-N 0.000 description 1
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 1
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 1
- ZSZFTYVFQLUWBF-QXEWZRGKSA-N Val-Asp-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N ZSZFTYVFQLUWBF-QXEWZRGKSA-N 0.000 description 1
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 1
- XKVXSCHXGJOQND-ZOBUZTSGSA-N Val-Asp-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N XKVXSCHXGJOQND-ZOBUZTSGSA-N 0.000 description 1
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 1
- WBUOKGBHGDPYMH-GUBZILKMSA-N Val-Cys-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)C(C)C WBUOKGBHGDPYMH-GUBZILKMSA-N 0.000 description 1
- IRLYZKKNBFPQBW-XGEHTFHBSA-N Val-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N)O IRLYZKKNBFPQBW-XGEHTFHBSA-N 0.000 description 1
- OUUBKKIJQIAPRI-LAEOZQHASA-N Val-Gln-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OUUBKKIJQIAPRI-LAEOZQHASA-N 0.000 description 1
- XEYUMGGWQCIWAR-XVKPBYJWSA-N Val-Gln-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N XEYUMGGWQCIWAR-XVKPBYJWSA-N 0.000 description 1
- PWRITNSESKQTPW-NRPADANISA-N Val-Gln-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N PWRITNSESKQTPW-NRPADANISA-N 0.000 description 1
- UZDHNIJRRTUKKC-DLOVCJGASA-N Val-Gln-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UZDHNIJRRTUKKC-DLOVCJGASA-N 0.000 description 1
- CVIXTAITYJQMPE-LAEOZQHASA-N Val-Glu-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CVIXTAITYJQMPE-LAEOZQHASA-N 0.000 description 1
- GBESYURLQOYWLU-LAEOZQHASA-N Val-Glu-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GBESYURLQOYWLU-LAEOZQHASA-N 0.000 description 1
- YDPFWRVQHFWBKI-GVXVVHGQSA-N Val-Glu-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YDPFWRVQHFWBKI-GVXVVHGQSA-N 0.000 description 1
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 1
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 1
- NXRAUQGGHPCJIB-RCOVLWMOSA-N Val-Gly-Asn Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O NXRAUQGGHPCJIB-RCOVLWMOSA-N 0.000 description 1
- OXGVAUFVTOPFFA-XPUUQOCRSA-N Val-Gly-Cys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N OXGVAUFVTOPFFA-XPUUQOCRSA-N 0.000 description 1
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 1
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 1
- PMDOQZFYGWZSTK-LSJOCFKGSA-N Val-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C PMDOQZFYGWZSTK-LSJOCFKGSA-N 0.000 description 1
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 1
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 1
- KVRLNEILGGVBJX-IHRRRGAJSA-N Val-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CN=CN1 KVRLNEILGGVBJX-IHRRRGAJSA-N 0.000 description 1
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 1
- KDKLLPMFFGYQJD-CYDGBPFRSA-N Val-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N KDKLLPMFFGYQJD-CYDGBPFRSA-N 0.000 description 1
- BZMIYHIJVVJPCK-QSFUFRPTSA-N Val-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N BZMIYHIJVVJPCK-QSFUFRPTSA-N 0.000 description 1
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 1
- FTKXYXACXYOHND-XUXIUFHCSA-N Val-Ile-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O FTKXYXACXYOHND-XUXIUFHCSA-N 0.000 description 1
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 1
- DJQIUOKSNRBTSV-CYDGBPFRSA-N Val-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](C(C)C)N DJQIUOKSNRBTSV-CYDGBPFRSA-N 0.000 description 1
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 1
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 1
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 1
- RFKJNTRMXGCKFE-FHWLQOOXSA-N Val-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC(C)C)C(O)=O)=CNC2=C1 RFKJNTRMXGCKFE-FHWLQOOXSA-N 0.000 description 1
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 1
- GVJUTBOZZBTBIG-AVGNSLFASA-N Val-Lys-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N GVJUTBOZZBTBIG-AVGNSLFASA-N 0.000 description 1
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 1
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 1
- OJOMXGVLFKYDKP-QXEWZRGKSA-N Val-Met-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OJOMXGVLFKYDKP-QXEWZRGKSA-N 0.000 description 1
- JVGHIFMSFBZDHH-WPRPVWTQSA-N Val-Met-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N JVGHIFMSFBZDHH-WPRPVWTQSA-N 0.000 description 1
- MGVYZTPLGXPVQB-CYDGBPFRSA-N Val-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MGVYZTPLGXPVQB-CYDGBPFRSA-N 0.000 description 1
- UEPLNXPLHJUYPT-AVGNSLFASA-N Val-Met-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O UEPLNXPLHJUYPT-AVGNSLFASA-N 0.000 description 1
- LJSZPMSUYKKKCP-UBHSHLNASA-N Val-Phe-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 LJSZPMSUYKKKCP-UBHSHLNASA-N 0.000 description 1
- UZFNHAXYMICTBU-DZKIICNBSA-N Val-Phe-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UZFNHAXYMICTBU-DZKIICNBSA-N 0.000 description 1
- YLRAFVVWZRSZQC-DZKIICNBSA-N Val-Phe-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YLRAFVVWZRSZQC-DZKIICNBSA-N 0.000 description 1
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 1
- FMQGYTMERWBMSI-HJWJTTGWSA-N Val-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N FMQGYTMERWBMSI-HJWJTTGWSA-N 0.000 description 1
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 1
- MHHAWNPHDLCPLF-ULQDDVLXSA-N Val-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 MHHAWNPHDLCPLF-ULQDDVLXSA-N 0.000 description 1
- JMCOXFSCTGKLLB-FKBYEOEOSA-N Val-Phe-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N JMCOXFSCTGKLLB-FKBYEOEOSA-N 0.000 description 1
- MJOUSKQHAIARKI-JYJNAYRXSA-N Val-Phe-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 MJOUSKQHAIARKI-JYJNAYRXSA-N 0.000 description 1
- RYQUMYBMOJYYDK-NHCYSSNCSA-N Val-Pro-Glu Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RYQUMYBMOJYYDK-NHCYSSNCSA-N 0.000 description 1
- USLVEJAHTBLSIL-CYDGBPFRSA-N Val-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C USLVEJAHTBLSIL-CYDGBPFRSA-N 0.000 description 1
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 1
- QIVPZSWBBHRNBA-JYJNAYRXSA-N Val-Pro-Phe Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O QIVPZSWBBHRNBA-JYJNAYRXSA-N 0.000 description 1
- MIKHIIQMRFYVOR-RCWTZXSCSA-N Val-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C(C)C)N)O MIKHIIQMRFYVOR-RCWTZXSCSA-N 0.000 description 1
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 1
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 1
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 1
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 1
- MNSSBIHFEUUXNW-RCWTZXSCSA-N Val-Thr-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N MNSSBIHFEUUXNW-RCWTZXSCSA-N 0.000 description 1
- TVGWMCTYUFBXAP-QTKMDUPCSA-N Val-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N)O TVGWMCTYUFBXAP-QTKMDUPCSA-N 0.000 description 1
- GVNLOVJNNDZUHS-RHYQMDGZSA-N Val-Thr-Lys Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O GVNLOVJNNDZUHS-RHYQMDGZSA-N 0.000 description 1
- DVLWZWNAQUBZBC-ZNSHCXBVSA-N Val-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N)O DVLWZWNAQUBZBC-ZNSHCXBVSA-N 0.000 description 1
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 1
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 1
- ZLMFVXMJFIWIRE-FHWLQOOXSA-N Val-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](C(C)C)N ZLMFVXMJFIWIRE-FHWLQOOXSA-N 0.000 description 1
- JPBGMZDTPVGGMQ-ULQDDVLXSA-N Val-Tyr-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N JPBGMZDTPVGGMQ-ULQDDVLXSA-N 0.000 description 1
- RLVTVHSDKHBFQP-ULQDDVLXSA-N Val-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 RLVTVHSDKHBFQP-ULQDDVLXSA-N 0.000 description 1
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 1
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 1
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 1
- 241000607598 Vibrio Species 0.000 description 1
- 241001659629 Virgibacillus Species 0.000 description 1
- 241000589634 Xanthomonas Species 0.000 description 1
- 241000269370 Xenopus <genus> Species 0.000 description 1
- 241000607734 Yersinia <bacteria> Species 0.000 description 1
- 235000007244 Zea mays Nutrition 0.000 description 1
- 108010081404 acein-2 Proteins 0.000 description 1
- 230000021736 acetylation Effects 0.000 description 1
- 238000006640 acetylation reaction Methods 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 239000003463 adsorbent Substances 0.000 description 1
- 108010045023 alanyl-prolyl-tyrosine Proteins 0.000 description 1
- 108010011559 alanylphenylalanine Proteins 0.000 description 1
- 108010058033 alcohol dehydrogenase (NADP+) Proteins 0.000 description 1
- PYMYPHUHKUWMLA-VPENINKCSA-N aldehydo-D-xylose Chemical compound OC[C@@H](O)[C@H](O)[C@@H](O)C=O PYMYPHUHKUWMLA-VPENINKCSA-N 0.000 description 1
- 229930094280 aldehydo-D-xylose Natural products 0.000 description 1
- WQZGKKKJIJFFOK-PHYPRBDBSA-N alpha-D-galactose Chemical compound OC[C@H]1O[C@H](O)[C@H](O)[C@@H](O)[C@H]1O WQZGKKKJIJFFOK-PHYPRBDBSA-N 0.000 description 1
- 229910021529 ammonia Inorganic materials 0.000 description 1
- 230000003698 anagen phase Effects 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 239000012736 aqueous medium Substances 0.000 description 1
- PYMYPHUHKUWMLA-WDCZJNDASA-N arabinose Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)C=O PYMYPHUHKUWMLA-WDCZJNDASA-N 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 1
- 108010024668 arginyl-glutamyl-aspartyl-valine Proteins 0.000 description 1
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 description 1
- 108010089442 arginyl-leucyl-alanyl-arginine Proteins 0.000 description 1
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- 210000004507 artificial chromosome Anatomy 0.000 description 1
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 1
- 239000012298 atmosphere Substances 0.000 description 1
- 238000010533 azeotropic distillation Methods 0.000 description 1
- 235000021028 berry Nutrition 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- GUBGYTABKSRVRQ-QUYVBRFLSA-N beta-maltose Chemical compound OC[C@H]1O[C@H](O[C@H]2[C@H](O)[C@@H](O)[C@H](O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@@H]1O GUBGYTABKSRVRQ-QUYVBRFLSA-N 0.000 description 1
- 235000020958 biotin Nutrition 0.000 description 1
- 229960002685 biotin Drugs 0.000 description 1
- 239000011616 biotin Substances 0.000 description 1
- 238000009835 boiling Methods 0.000 description 1
- 238000006664 bond formation reaction Methods 0.000 description 1
- RFAZFSACZIVZDV-UHFFFAOYSA-N butan-2-one Chemical compound CCC(C)=O.CCC(C)=O RFAZFSACZIVZDV-UHFFFAOYSA-N 0.000 description 1
- IAQRGUVFOMOMEM-UHFFFAOYSA-N butene Natural products CC=CC IAQRGUVFOMOMEM-UHFFFAOYSA-N 0.000 description 1
- KVNRLNFWIYMESJ-UHFFFAOYSA-N butyronitrile Chemical compound CCCC#N KVNRLNFWIYMESJ-UHFFFAOYSA-N 0.000 description 1
- 239000006229 carbon black Substances 0.000 description 1
- 239000001569 carbon dioxide Substances 0.000 description 1
- 229910002092 carbon dioxide Inorganic materials 0.000 description 1
- CURLTUGMZLYLDI-UHFFFAOYSA-N carbon dioxide Natural products O=C=O CURLTUGMZLYLDI-UHFFFAOYSA-N 0.000 description 1
- 150000001735 carboxylic acids Chemical class 0.000 description 1
- 239000012159 carrier gas Substances 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 238000006555 catalytic reaction Methods 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 239000012459 cleaning agent Substances 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 239000003184 complementary RNA Substances 0.000 description 1
- 239000008139 complexing agent Substances 0.000 description 1
- 238000005094 computer simulation Methods 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 230000034994 death Effects 0.000 description 1
- 230000000593 degrading effect Effects 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 235000014113 dietary fatty acids Nutrition 0.000 description 1
- 238000007865 diluting Methods 0.000 description 1
- LDCRTTXIJACKKU-ARJAWSKDSA-N dimethyl maleate Chemical compound COC(=O)\C=C/C(=O)OC LDCRTTXIJACKKU-ARJAWSKDSA-N 0.000 description 1
- 239000000428 dust Substances 0.000 description 1
- 238000004043 dyeing Methods 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 230000006862 enzymatic digestion Effects 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 238000001704 evaporation Methods 0.000 description 1
- 230000008020 evaporation Effects 0.000 description 1
- 229930195729 fatty acid Natural products 0.000 description 1
- 239000000194 fatty acid Substances 0.000 description 1
- 150000004665 fatty acids Chemical class 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000004907 flux Effects 0.000 description 1
- 239000011888 foil Substances 0.000 description 1
- 239000013505 freshwater Substances 0.000 description 1
- 239000000446 fuel Substances 0.000 description 1
- 229930182830 galactose Natural products 0.000 description 1
- 230000004077 genetic alteration Effects 0.000 description 1
- 231100000118 genetic alteration Toxicity 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 229940050410 gluconate Drugs 0.000 description 1
- 230000004110 gluconeogenesis Effects 0.000 description 1
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 1
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 1
- 108010083145 glycine-oxaloacetate aminotransferase Proteins 0.000 description 1
- 150000002334 glycols Chemical class 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 1
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 1
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 1
- 239000001307 helium Substances 0.000 description 1
- 229910052734 helium Inorganic materials 0.000 description 1
- SWQJXJOGLNCZEY-UHFFFAOYSA-N helium atom Chemical compound [He] SWQJXJOGLNCZEY-UHFFFAOYSA-N 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 108010050343 histidyl-alanyl-glutamine Proteins 0.000 description 1
- 108010045383 histidyl-glycyl-glutamic acid Proteins 0.000 description 1
- 229930195733 hydrocarbon Natural products 0.000 description 1
- 150000002430 hydrocarbons Chemical class 0.000 description 1
- 239000012535 impurity Substances 0.000 description 1
- 230000002779 inactivation Effects 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 238000011081 inoculation Methods 0.000 description 1
- 239000002054 inoculum Substances 0.000 description 1
- 229910052500 inorganic mineral Inorganic materials 0.000 description 1
- 238000005342 ion exchange Methods 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- 125000001449 isopropyl group Chemical group [H]C([H])([H])C([H])(*)C([H])([H])[H] 0.000 description 1
- 229940097413 isopropyl maleate Drugs 0.000 description 1
- 239000004310 lactic acid Substances 0.000 description 1
- 235000014655 lactic acid Nutrition 0.000 description 1
- 229940039696 lactobacillus Drugs 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 1
- 239000012978 lignocellulosic material Substances 0.000 description 1
- 230000029226 lipidation Effects 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 238000000622 liquid--liquid extraction Methods 0.000 description 1
- 108010056787 lysyl-arginyl-glutamyl-glutamic acid Proteins 0.000 description 1
- 108010089256 lysyl-aspartyl-glutamyl-leucine Proteins 0.000 description 1
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010076718 lysyl-glutamyl-tryptophan Proteins 0.000 description 1
- 108010057952 lysyl-phenylalanyl-lysine Proteins 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 238000006241 metabolic reaction Methods 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 108700023046 methionyl-leucyl-phenylalanine Proteins 0.000 description 1
- 108010068488 methionylphenylalanine Proteins 0.000 description 1
- 239000011707 mineral Substances 0.000 description 1
- DNIAPMSPPWPWGF-UHFFFAOYSA-N monopropylene glycol Natural products CC(O)CO DNIAPMSPPWPWGF-UHFFFAOYSA-N 0.000 description 1
- 239000010813 municipal solid waste Substances 0.000 description 1
- 239000003345 natural gas Substances 0.000 description 1
- 239000002853 nucleic acid probe Substances 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 125000004430 oxygen atom Chemical group O* 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 150000002972 pentoses Chemical class 0.000 description 1
- 239000000816 peptidomimetic Substances 0.000 description 1
- 239000000575 pesticide Substances 0.000 description 1
- 238000005504 petroleum refining Methods 0.000 description 1
- 108010025593 phenylalanine (histidine) aminotransferase Proteins 0.000 description 1
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 1
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 1
- 108010089198 phenylalanyl-prolyl-arginine Proteins 0.000 description 1
- 108010084572 phenylalanyl-valine Proteins 0.000 description 1
- 108010018625 phenylalanylarginine Proteins 0.000 description 1
- 229910052698 phosphorus Inorganic materials 0.000 description 1
- 239000011574 phosphorus Substances 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 108010088694 phosphoserine aminotransferase Proteins 0.000 description 1
- 102000030592 phosphoserine aminotransferase Human genes 0.000 description 1
- 108010076573 phosphoserine phosphatase Proteins 0.000 description 1
- 230000004962 physiological condition Effects 0.000 description 1
- 108010025488 pinealon Proteins 0.000 description 1
- 238000006116 polymerization reaction Methods 0.000 description 1
- 244000144977 poultry Species 0.000 description 1
- 230000019525 primary metabolic process Effects 0.000 description 1
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 235000013772 propylene glycol Nutrition 0.000 description 1
- 108040006686 pyruvate synthase activity proteins Proteins 0.000 description 1
- 238000002708 random mutagenesis Methods 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000002407 reforming Methods 0.000 description 1
- 230000008929 regeneration Effects 0.000 description 1
- 238000011069 regeneration method Methods 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 239000002336 ribonucleotide Substances 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 108020004418 ribosomal RNA Proteins 0.000 description 1
- 108091092562 ribozyme Proteins 0.000 description 1
- 238000007790 scraping Methods 0.000 description 1
- 239000013049 sediment Substances 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000002910 solid waste Substances 0.000 description 1
- 238000000638 solvent extraction Methods 0.000 description 1
- 238000000527 sonication Methods 0.000 description 1
- 239000004071 soot Substances 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 239000010959 steel Substances 0.000 description 1
- 238000003756 stirring Methods 0.000 description 1
- 239000005720 sucrose Substances 0.000 description 1
- 125000000446 sulfanediyl group Chemical group *S* 0.000 description 1
- 239000013501 sustainable material Substances 0.000 description 1
- 235000019157 thiamine Nutrition 0.000 description 1
- KYMBYSLLVAOCFI-UHFFFAOYSA-N thiamine Chemical compound CC1=C(CCO)SCN1CC1=CN=C(C)N=C1N KYMBYSLLVAOCFI-UHFFFAOYSA-N 0.000 description 1
- 229960003495 thiamine Drugs 0.000 description 1
- 239000011721 thiamine Substances 0.000 description 1
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 1
- 239000003053 toxin Substances 0.000 description 1
- 231100000765 toxin Toxicity 0.000 description 1
- 108700012359 toxins Proteins 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 238000010361 transduction Methods 0.000 description 1
- 230000026683 transduction Effects 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 238000011282 treatment Methods 0.000 description 1
- 230000001228 trophic effect Effects 0.000 description 1
- 108010044292 tryptophyltyrosine Proteins 0.000 description 1
- 108010005834 tyrosyl-alanyl-glycine Proteins 0.000 description 1
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 1
- 108010078580 tyrosylleucine Proteins 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- 238000005292 vacuum distillation Methods 0.000 description 1
- FGQOOHJZONJGDT-UHFFFAOYSA-N vanillin Natural products COC1=CC(O)=CC(C=O)=C1 FGQOOHJZONJGDT-UHFFFAOYSA-N 0.000 description 1
- MWOOGOJBHIARFG-UHFFFAOYSA-N vanillin Chemical compound COC1=CC(C=O)=CC=C1O MWOOGOJBHIARFG-UHFFFAOYSA-N 0.000 description 1
- 235000012141 vanillin Nutrition 0.000 description 1
- 230000035899 viability Effects 0.000 description 1
- 235000019156 vitamin B Nutrition 0.000 description 1
- 239000011720 vitamin B Substances 0.000 description 1
- 239000002699 waste material Substances 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/02—Preparation of oxygen-containing organic compounds containing a hydroxy group
- C12P7/04—Preparation of oxygen-containing organic compounds containing a hydroxy group acyclic
- C12P7/18—Preparation of oxygen-containing organic compounds containing a hydroxy group acyclic polyhydric
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N1/00—Microorganisms, e.g. protozoa; Compositions thereof; Processes of propagating, maintaining or preserving microorganisms or compositions thereof; Processes of preparing or isolating a composition containing a microorganism; Culture media therefor
- C12N1/20—Bacteria; Culture media therefor
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/52—Genes encoding for enzymes or proenzymes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/0008—Oxidoreductases (1.) acting on the aldehyde or oxo group of donors (1.2)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/0012—Oxidoreductases (1.) acting on nitrogen containing compounds as donors (1.4, 1.5, 1.6, 1.7)
- C12N9/0014—Oxidoreductases (1.) acting on nitrogen containing compounds as donors (1.4, 1.5, 1.6, 1.7) acting on the CH-NH2 group of donors (1.4)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1025—Acyltransferases (2.3)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1096—Transferases (2.) transferring nitrogenous groups (2.6)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/88—Lyases (4.)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/02—Preparation of oxygen-containing organic compounds containing a hydroxy group
- C12P7/04—Preparation of oxygen-containing organic compounds containing a hydroxy group acyclic
- C12P7/06—Ethanol, i.e. non-beverage
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/40—Preparation of oxygen-containing organic compounds containing a carboxyl group including Peroxycarboxylic acids
- C12P7/42—Hydroxy-carboxylic acids
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/40—Preparation of oxygen-containing organic compounds containing a carboxyl group including Peroxycarboxylic acids
- C12P7/44—Polycarboxylic acids
- C12P7/46—Dicarboxylic acids having four or less carbon atoms, e.g. fumaric acid, maleic acid
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y102/00—Oxidoreductases acting on the aldehyde or oxo group of donors (1.2)
- C12Y102/01—Oxidoreductases acting on the aldehyde or oxo group of donors (1.2) with NAD+ or NADP+ as acceptor (1.2.1)
- C12Y102/01003—Aldehyde dehydrogenase (NAD+) (1.2.1.3)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y102/00—Oxidoreductases acting on the aldehyde or oxo group of donors (1.2)
- C12Y102/01—Oxidoreductases acting on the aldehyde or oxo group of donors (1.2) with NAD+ or NADP+ as acceptor (1.2.1)
- C12Y102/01021—Glycolaldehyde dehydrogenase (1.2.1.21)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y203/00—Acyltransferases (2.3)
- C12Y203/03—Acyl groups converted into alkyl on transfer (2.3.3)
- C12Y203/03001—Citrate (Si)-synthase (2.3.3.1)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y206/00—Transferases transferring nitrogenous groups (2.6)
- C12Y206/01—Transaminases (2.6.1)
- C12Y206/01044—Alanine--glyoxylate transaminase (2.6.1.44)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y401/00—Carbon-carbon lyases (4.1)
- C12Y401/03—Oxo-acid-lyases (4.1.3)
- C12Y401/03001—Isocitrate lyase (4.1.3.1)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2800/00—Nucleic acids vectors
- C12N2800/22—Vectors comprising a coding region that has been codon optimised for expression in a respective host
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02E—REDUCTION OF GREENHOUSE GAS [GHG] EMISSIONS, RELATED TO ENERGY GENERATION, TRANSMISSION OR DISTRIBUTION
- Y02E50/00—Technologies for the production of fuel of non-fossil origin
- Y02E50/10—Biofuels, e.g. bio-diesel
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02E—REDUCTION OF GREENHOUSE GAS [GHG] EMISSIONS, RELATED TO ENERGY GENERATION, TRANSMISSION OR DISTRIBUTION
- Y02E50/00—Technologies for the production of fuel of non-fossil origin
- Y02E50/30—Fuel from waste, e.g. synthetic alcohol or diesel
Abstract
본 발명은 에틸렌 글리콜 및 에틸렌 글리콜 전구체의 생물학적 생성을 위한 유전자 조작된 미생물 및 방법을 제공한다. 특히, 본 발명의 미생물은 5,10-메틸렌테트라하이드로폴레이트, 옥살로아세테이트, 시트레이트, 말레이트 및 글리신 중 하나 이상을 통해 에틸렌 글리콜 또는 에틸렌 글리콜 전구체를 생성한다. 나아가, 본 발명은 에틸렌 글리콜을 포함하는 조성물 또는 에틸렌 글리콜의 중합체, 예컨대 폴리에틸렌 테레프탈레이트를 제공한다.
Description
본 발명은 미생물 발효에 의한, 특히 가스성(gaseous) 기질의 미생물 발효에 의한 에틸렌 글리콜 및 에틸렌 글리콜 전구체의 생성을 위한 유전자 조작된 미생물 및 방법에 관한 것이다.
모노에틸렌 글리콜(MEG)로도 알려진 에틸렌 글리콜은 미국에서 현재 330억 달러가 넘는 시장 가치를 가지고, 매우 다양한 산업적, 의료적 및 소비자 제품의 중요한 구성성분이다. 에틸렌 글리콜은 현재, 다량의 에너지 및 물을 필요로 하며 많은 바람직하지 못한 부산물을 발생시키고 석유화학 공급 원료에 의존하는 화학적 촉매작용 과정을 사용하여 생성된다. 지속 가능한 물질에 대한 요구는 일부 기술적 진보, 예컨대 사탕수수 유래의 에탄올로부터 에틸렌 글리콜의 촉매작용적 생성을 이끌었다.
에틸렌 글리콜 전구체는 또한, 상업적으로 가치가 있다. 예를 들어, 글리콜레이트는 피부 관리, 개인 생활(personal care), 염색, 태닝에서, 그리고 세정제로서 사용된다. 글리옥실레이트는 바닐린, 농약, 항생제, 알란토인 및 착화제(complexing agent)에 대한 중간산물이다.
그러나, 에틸렌 글리콜을 생물학적으로 생성할 수 있는 미생물은 알려져 있지 않고, 에틸렌 글리콜을 생성하기 위한 어떠한 완전한 생물학적 경로도 잘 확립되어 있지 않았다. 에틸렌 글리콜로의 일부 생물학적 경로는 문헌에서 당(sugar)으로부터 기재되어 있다. 예를 들어, 문헌[Alkim et al., Microb Cell Fact, 14: 127, 2015]는 이. 콜라이(E. coli)에서 (D)-자일로스로부터의 에틸렌 글리콜 생성을 실증하였으나, 높은 수율을 달성하기 위해서는 호기성 조건이 필요하였음을 주지하였다. 유사하게는, 문헌[Pereira et al., Metab Eng, 34: 80-87, 2016]은 이. 콜라이에서 펜토스로부터의 에틸렌 글리콜 생성을 달성하였다. 펜토스로부터의 에틸렌 글리콜 생성에 대한 소수의 연구가 또한 에스. 세레비지애(S. cerevisiae)에서 수행되긴 했지만, 일관되지 않은 결과를 보여주었다. 예를 들어, 문헌[Uranukul et al., Metab Eng, 51: 20-31, 2018]을 참조한다.
가스 발효는 화학물질 및 연료로의 광범위한, 쉽게 입수 가능하며 저렴한 C1 공급 원료, 예컨대 산업적 폐가스(waste gas), 합성 가스(syngas) 또는 개질된(reformed) 메탄을 사용하는 경로를 제공한다. 가스 발효 대사가 당-발효 대사와 상당히 다르고, 상기 언급된 경로는 포도당신생합성(gluconeogenesis), 에너지 음성(negative) 과정을 통한 가스로부터 당 전구체의 생성을 필요로 하기 때문에 이들 경로의 사용은 실용적이지 않을 것이다. 현재까지, 가스성 기질로부터 에틸렌 글리콜을 생성하기 위해 이용가능한 경로는 없다.
탐구적 활동에서, 문헌[Islam et al., Metab Eng, 41: 173-181, 2017]은 화학정보학 툴(cheminformatics tool)을 사용하여 엠. 테르모아세티아(M. thermoacetia)에서 합성 가스로부터 에틸렌 글리콜을 생성하는 수백 개의 가상 경로를 예측하였다. 그러나, 많은 경로가 열역학적 또는 다른 제약으로 인해 실현 불가능하므로, 당업자라도 이들 경로를 가스 발효 유기체에 혼입하는 것은 가능하지 않다. 예를 들어, 거의 2,000 가지의 산소 또는 산소 라디칼-의존적 반응이 Islam 등에 포함되었으며, 이들은 엄격하게 혐기성인 시스템에서는 실현 가능하지 않을 것이다. 기지의(known) 반응을 갖는, Islam 등에 의해 유일하게 식별된 가상 경로는 포도당신생합성 또는 중간산물로서 에탄올을 필요로 한다. 따라서, 가스성 기질로부터 고수율의 에틸렌 글리콜 및 에틸렌 글리콜 전구체를 생성할 수 있는 유효하며(validated) 에너지적으로 선호할 만한 재조합형 생성 시스템에 대한 필요성이 남아 있다.
상기 배경 기술에 대하여 본 발명은 종래 기술에 비해 특정 장점 및 진보를 제공한다.
본원에 개시된 본 발명이 구체적인 이점 또는 작용성으로 제한되진 않지만, 본 발명은 가스성 기질로부터 에틸렌 글리콜 또는 에틸렌 글리콜 전구체를 생성할 수 있는 유전자 조작된 미생물을 제공한다.
본원에 개시된 미생물의 일부 양태에서, 상기 미생물은 5,10-메틸렌테트라하이드로폴레이트, 옥살로아세테이트, 시트레이트, 말레이트 및 글리신으로 이루어진 군으로부터 선택되는 하나 이상의 중간산물을 통해 에틸렌 글리콜 또는 에틸렌 글리콜 전구체를 생성한다.
본원에 개시된 미생물의 일부 양태에서, 상기 미생물은 옥살로아세테이트를 시트레이트로 전환할 수 있는 이종성 효소, 글리신을 글리옥실레이트로 전환할 수 있는 이종성 효소, 이소-시트레이트를 글리옥실레이트로 전환할 수 있는 이종성 효소, 및 글리콜레이트를 글리코알데하이드로 전환할 수 있는 이종성 효소 중 하나 이상을 포함한다.
본원에 개시된 미생물의 일부 양태에서, 옥살로아세테이트를 시트레이트로 전환할 수 있는 이종성 효소는 시트레이트 [Si]-신타제(synthase)[2.3.3.1], ATP 시트레이트 신타제[2.3.3.8]; 또는 시트레이트(Re)-신타제[2.3.3.3]이며; 글리신을 글리옥실레이트로 전환할 수 있는 이종성 효소는 알라닌-글리옥실레이트 트랜스아미나제[2.6.1.44], 세린-글리옥실레이트 트랜스아미나제[2.6.1.45], 세린-피루베이트 트랜스아미나제[2.6.1.51], 글리신-옥살로아세테이트 트랜스아미나제[2.6.1.35], 글리신 트랜스아미나제[2.6.1.4], 글리신 데하이드로게나제[1.4.1.10], 알라닌 데하이드로게나제[1.4.1.1] 또는 글리신 데하이드로게나제[1.4.2.1]이며; 이소-시트레이트를 글리옥실레이트로 전환할 수 있는 이종성 효소는 이소시트레이트 리아제(lyase)[4.1.3.1]이며; 및/또는 글리콜레이트를 글리코알데하이드로 전환할 수 있는 이종성 효소는 글리콜알데하이드 데하이드로게나제[1.2.1.21], 락트알데하이드 데하이드로게나제[1.2.1.22], 숙시네이트-세미알데하이드 데하이드로게나제[1.2.1.24], 2,5-디옥소발레레이트 데하이드로게나제[1.2.1.26], 알데하이드 데하이드로게나제[1.2.1.3/4/5], 베타인-알데하이드 데하이드로게나제[1.2.1.8] 또는 알데하이드 페레독신 옥시도리덕타제[1.2.7.5]이다.
본원에 개시된 미생물의 일부 양태에서, 이종성 효소는 바실러스(Bacillus), 클로스트리디움(Clostridium), 에스케리키아(Escherichia), 글루코노박터(Gluconobacter), 하이포미크로비움(Hyphomicrobium), 리시니바실러스(Lysinibacillus), 패니바실러스(Paenibacillus), 슈도모나스(Pseudomonas), 세디멘티콜라(Sedimenticola), 스포로사르치나(Sporosarcina), 스트렙토마이세스(Streptomyces), 테르미티오바실러스(Thermithio), 테르모토가(Thermotoga) 및 제아(Zea)로 이루어진 군으로부터 선택되는 속(genus)으로부터 유래된다.
본원에 개시된 미생물의 일부 양태에서, 하나 이상의 이종성 효소는 미생물에서 발현을 위해 코돈-최적화된다.
본원에 개시된 미생물의 일부 양태에서, 미생물은 아세틸-CoA를 피루베이트로 전환할 수 있는 효소; 피루베이트를 옥살로아세테이트로 전환할 수 있는 효소; 피루베이트를 말레이트로 전환할 수 있는 효소; 피루베이트를 포스페놀피루베이트로 전환할 수 있는 효소; 옥살로아세테이트를 시트릴-CoA로 전환할 수 있는 효소; 시트릴-CoA를 시트레이트로 전환할 수 있는 효소; 시트레이트를 아코니테이트로 그리고 아코니테이트를 이소-시트레이트로 전환할 수 있는 효소; 포스포에놀피루베이트를 옥살로아세테이트로 전환할 수 있는 효소; 포스포에놀피루베이트를 2-포스포-D-글리세레이트로 전환할 수 있는 효소; 2-포스포-D-글리세레이트를 3-포스포-D-글리세레이트로 전환할 수 있는 효소; 3-포스포-D-글리세레이트를 3-포스포노옥시피루베이트로 전환할 수 있는 효소; 3-포스포노옥시피루베이트를 3-포스포-L-세린으로 전환할 수 있는 효소; 3-포스포-L-세린을 세린으로 전환할 수 있는 효소; 세린을 글리신으로 전환할 수 있는 효소; 5,10-메틸렌테트라하이드로폴레이트를 글리신으로 전환할 수 있는 효소; 세린을 하이드록시피루베이트로 전환할 수 있는 효소; D-글리세레이트를 하이드록시피루베이트로 전환할 수 있는 효소; 말레이트를 글리옥실레이트로 전환할 수 있는 효소; 글리옥실레이트를 글리콜레이트로 전환할 수 있는 효소; 하이드록시피루베이트를 글리코알데하이드로 전환할 수 있는 효소; 및/또는 글리코알데하이드를 에틸렌 글리콜로 전환할 수 있는 효소 중 하나 이상을 추가로 포함한다.
본원에 개시된 미생물의 일부 양태에서, 미생물은 옥살로아세테이트를 시트레이트로 전환할 수 있는 이종성 효소, 글리신을 글리옥실레이트로 전환할 수 있는 이종성 효소, 및/또는 글리콜레이트를 글리코알데하이드로 전환할 수 있는 이종성 효소를 과발현한다.
본원에 개시된 미생물의 일부 양태에서, 미생물은 피루베이트를 옥살로아세테이트로 전환할 수 있는 효소, 시트레이트를 아코니테이트로 그리고 아코니테이트를 이소-시트레이트로 전환할 수 있는 효소, 포스포에놀피루베이트를 옥살로아세테이트로 전환할 수 있는 효소, 세린을 글리신으로 전환할 수 있는 효소, 5,10-메틸렌테트라하이드로폴레이트를 글리신으로 전환할 수 있는 효소, 글리옥실레이트를 글리콜레이트로 전환할 수 있는 효소; 및/또는 글리코알데하이드를 에틸렌 글리콜로 전환할 수 있는 효소를 과발현한다.
본원에 개시된 미생물의 일부 양태에서, 미생물은 이소시트레이트 데하이드로게나제, 글리세레이트 데하이드로게나제, 글리콜레이트 데하이드로게나제, 글리세레이트 데하이드로게나제, 글리콜레이트 데하이드로게나제, 알데하이드 페레독신 옥시도리덕타제 및 알데하이드 데하이드로게나제로 이루어진 군으로부터 선택되는 하나 이상의 효소에서 교란적 돌연변이(disruptive mutation)를 포함한다.
본원에 개시된 미생물의 일부 양태에서, 미생물은 아세토박테리움(Acetobacterium), 알칼리바쿨룸(Alkalibaculum), 블라우티아(Blautia), 부티리박테리움(Butyribacterium), 클로스트리디움, 유박테리움(Eubacterium), 무렐라(Moorella), 옥소박터(Oxobacter), 스포로무사(Sporomusa) 및 테르모아내로박터(Thermoanaerobacter)로 이루어진 군으로부터 선택되는 속의 구성원이다.
본원에 개시된 미생물의 일부 양태에서, 미생물은 아세토박테리움 우디이(Acetobacterium woodii), 알칼리바쿨룸 박키이(Alkalibaculum bacchii), 블라우티아 프로덕타(Blautia producta), 부티리박테리움 메틸로트로피쿰(Butyribacterium methylotrophicum), 클로스트리디움 아세티쿰(Clostridium aceticum), 클로스트리디움 아우토에타노게눔(Clostridium autoethanogenum), 클로스트리디움 카르복시디보란스(Clostridium carboxidivorans), 클로스트리디움 코스카티이(Clostridium coskatii), 클로스트리디움 드라케이(Clostridium drakei), 클로스트리디움 포르미코아세티쿰(Clostridium formicoaceticum), 클로스트리디움 륭달리이(Clostridium ljungdahlii), 클로스트리디움 마그눔(Clostridium magnum), 클로스트리디움 라그스달레이(Clostridium ragsdalei), 클로스트리디움 스카톨로게네스(Clostridium scatologenes), 유박테리움 리모숨(Eubacterium limosum), 무렐라 테르마우토트로피카(Moorella thermautotrophica), 무렐라 테르모아세티카(Moorella thermoacetica), 옥소박터 펜니기이(Oxobacter pfennigii), 스포로무사 오바타(Sporomusa ovata), 스포로무사 실바세티카(Sporomusa silvacetica), 스포로무사 스패로이데스(Sporomusa sphaeroides) 및 테르모아내로박터 키우비(Thermoanaerobacter kiuvi)로 이루어진 군으로부터 선택되는 부모 미생물로부터 유래된다.
본원에 개시된 미생물의 일부 양태에서, 미생물은 클로스트리디움 아우토에타노게눔, 클로스트리디움 륭달리이 및 클로스트리디움 라그스달레이로 이루어진 군으로부터 선택되는 부모 미생물로부터 유래된다.
본원에 개시된 미생물의 일부 양태에서, 미생물은 본연의(native) 또는 이종성 우드-륭달(Wood-Ljungdahl) 경로를 포함한다.
본원에 개시된 미생물의 일부 양태에서, 미생물은 글리옥실레이트 또는 글리콜레이트를 에틸렌 글리콜 전구체로서 생성한다.
나아가, 본 발명은 에틸렌 글리콜 또는 에틸렌 글리콜 전구체를 생성하는 방법을 제공하며, 상기 방법은 본원에 개시된 미생물을 영양소 배지에서 기질의 존재 하에 배양하여, 미생물이 에틸렌 글리콜 또는 에틸렌 글리콜 전구체를 생성하는 단계를 포함한다.
본원에 개시된 방법의 일부 양태에서, 기질은 CO, CO2 및 H2 중 하나 이상을 포함한다.
본원에 개시된 방법의 일부 양태에서, 기질의 적어도 일부는 산업적 폐가스, 산업적 오프 가스, 또는 합성 가스이다.
본원에 개시된 방법의 일부 양태에서, 미생물은 글리옥실레이트 또는 글리콜레이트를 에틸렌 글리콜 전구체로서 생성한다.
본원에 개시된 방법의 일부 양태에서, 상기 방법은 에틸렌 글리콜 또는 에틸렌 글리콜 전구체를 영양소 배지로부터 분리하는 단계를 추가로 포함한다.
본원에 개시된 방법의 일부 양태에서, 미생물은 에탄올, 2,3-부탄디올 및 숙시네이트 중 하나 이상을 추가로 생성한다.
나아가, 본 발명은 본원에 기재된 방법에 의해 생성된 에틸렌 글리콜을 포함하는 조성물을 제공한다. 일부 양태에서, 상기 조성물은 부동제(antifreeze), 보존제, 탈수제 또는 드릴링액(drilling fluid)이다.
나아가, 본 발명은 본원에 기재된 방법에 의해 생성되는 에틸렌 글리콜을 포함하는 중합체를 제공한다. 일부 양태에서, 상기 중합체는 동종중합체 또는 공중합체이다. 일부 양태에서, 상기 중합체는 폴리에틸렌 글리콜 또는 폴리에틸렌 테레프탈레이트이다.
나아가, 본 발명은 본원에 기재된 중합체를 포함하는 조성물을 제공한다. 일부 양태에서, 상기 조성물은 섬유, 수지, 필름 또는 플라스틱이다.
본 발명의 이들 및 다른 특징 및 장점은 첨부된 청구범위와 함께 하기의 상세한 설명으로부터 보다 완전히 이해될 것이다. 청구범위의 범위는 본 설명에 기재된 특징 및 장점에 대한 구체적인 논의에 의해서가 아니라 청구범위 내의 기재 내용에 의해 정의된다는 점에 유의한다.
본 발명의 도면의 하기의 상세한 설명은 하기의 도면과 함께 읽을 때 가장 잘 이해될 수 있으며, 도면에서 유사한 구조는 유사한 참조 숫자로 표시되고, 도면에서:
도 1은 CO, CO2 및/또는 H2를 포함하는 가스성 기질로부터 에틸렌 글리콜, 글리콜레이트 및 글리옥실레이트의 생성을 위한 경로를 보여주는 반응식이다.
도 2a 내지 도 2e는 실시예 1 내지 4에서 사용되는 플라스미드의 지도(map)이다. 도 2a는 실시예 1에 기재된 바와 같은 발현 셔틀 벡터, pIPL12의 지도이다. 도 2b는 실시예 1에 기재된 바와 같은 비. 서브틸리스(B. subtilis) 시트레이트 신타제, 이. 콜라이 이소시트레이트 리아제 및 지. 옥시단스(G. oxydans) 글리콜알데하이드 데하이드로게나제를 포함하는 플라스미드 pMEG042의 지도이다. 도 2c는 실시예 2에 기재된 바와 같은 에스. 티오타우리니(S. thiotaurini) 알라닌-글리옥실레이트 아미노트랜스퍼라제 및 피. 플루오레센스(P. fluorescens) 알데하이드 데하이드로게나제를 포함하는 플라스미드 pMEG058의 지도이다. 도 2d는 실시예 3에 기재된 바와 같은 에스. 티오타우리니 알라닌-글리옥실레이트 아미노트랜스퍼라제 및 지. 옥시단스 알데하이드 데하이드로게나제를 포함하는 플라스미드 pMEG059의 지도이다. 도 2e는 실시예 4에 기재된 바와 같은 씨. 악시두리치(C. acidurici) 부류(class) V 아미노트랜스퍼라제 및 피. 플루오레센스 알데하이드 데하이드로게나제를 포함하는 플라스미드 pMEG061의 지도이다.
도 3a는 pMEG042(클론 1-3)를 발현하는 씨. 아우토에타노게눔 또는 빈 벡터(음성 대조군)를 포함하는 씨. 아우토에타노게눔의 바이오매스 수준(g 건조 세포 중량/L)을 보여준다. 도 3b는 독립영양적으로 성장하고 발현 벡터 pMEG042를 옮기는 씨. 아우토에타노게눔에서 시간 경과에 따라 생성되는 에틸렌 글리콜을 음성 대조군(빈 벡터)와 비교하여 보여준다. 도 3c는 독립영양적으로 성장하고 발현 벡터 pMEG042를 옮기는 씨. 아우토에타노게눔에서 시간 경과에 따라 생성되는 글리콜레이트를 보여준다. 실시예 1을 참조한다.
도 4a는 pMEG058(클론 1-2)을 발현하는 씨. 아우토에타노게눔 또는 빈 벡터(음성 대조군)를 포함하는 씨. 아우토에타노게눔의 바이오매스 수준(g 건조 세포 중량/L)을 보여준다. 도 4b는 독립영양적으로 성장하고 발현 벡터 pMEG058을 옮기는 씨. 아우토에타노게눔에서 시간 경과에 따라 생성되는 에틸렌 글리콜을 음성 대조군(빈 벡터)와 비교하여 보여준다. 실시예 2를 참조한다.
도 5a는 pMEG059(클론 1-3)를 발현하는 씨. 아우토에타노게눔 또는 빈 벡터(음성 대조군)를 포함하는 씨. 아우토에타노게눔의 바이오매스 수준(g 건조 세포 중량/L)을 보여준다. 도 5b는 독립영양적으로 성장하고 발현 벡터 pMEG059를 옮기는 씨. 아우토에타노게눔에서 시간 경과에 따라 생성되는 에틸렌 글리콜을 음성 대조군(빈 벡터)와 비교하여 보여준다. 실시예 3을 참조한다.
도 6a는 pMEG061(클론 1)을 발현하는 씨. 아우토에타노게눔 또는 빈 벡터(음성 대조군)를 포함하는 씨. 아우토에타노게눔의 바이오매스 수준(g 건조 세포 중량/L)을 보여준다. 도 6b는 독립영양적으로 성장하고 발현 벡터 pMEG061을 옮기는 씨. 아우토에타노게눔에서 시간 경과에 따라 생성되는 에틸렌 글리콜을 음성 대조군(빈 벡터)와 비교하여 보여준다. 실시예 4를 참조한다.
도 1은 CO, CO2 및/또는 H2를 포함하는 가스성 기질로부터 에틸렌 글리콜, 글리콜레이트 및 글리옥실레이트의 생성을 위한 경로를 보여주는 반응식이다.
도 2a 내지 도 2e는 실시예 1 내지 4에서 사용되는 플라스미드의 지도(map)이다. 도 2a는 실시예 1에 기재된 바와 같은 발현 셔틀 벡터, pIPL12의 지도이다. 도 2b는 실시예 1에 기재된 바와 같은 비. 서브틸리스(B. subtilis) 시트레이트 신타제, 이. 콜라이 이소시트레이트 리아제 및 지. 옥시단스(G. oxydans) 글리콜알데하이드 데하이드로게나제를 포함하는 플라스미드 pMEG042의 지도이다. 도 2c는 실시예 2에 기재된 바와 같은 에스. 티오타우리니(S. thiotaurini) 알라닌-글리옥실레이트 아미노트랜스퍼라제 및 피. 플루오레센스(P. fluorescens) 알데하이드 데하이드로게나제를 포함하는 플라스미드 pMEG058의 지도이다. 도 2d는 실시예 3에 기재된 바와 같은 에스. 티오타우리니 알라닌-글리옥실레이트 아미노트랜스퍼라제 및 지. 옥시단스 알데하이드 데하이드로게나제를 포함하는 플라스미드 pMEG059의 지도이다. 도 2e는 실시예 4에 기재된 바와 같은 씨. 악시두리치(C. acidurici) 부류(class) V 아미노트랜스퍼라제 및 피. 플루오레센스 알데하이드 데하이드로게나제를 포함하는 플라스미드 pMEG061의 지도이다.
도 3a는 pMEG042(클론 1-3)를 발현하는 씨. 아우토에타노게눔 또는 빈 벡터(음성 대조군)를 포함하는 씨. 아우토에타노게눔의 바이오매스 수준(g 건조 세포 중량/L)을 보여준다. 도 3b는 독립영양적으로 성장하고 발현 벡터 pMEG042를 옮기는 씨. 아우토에타노게눔에서 시간 경과에 따라 생성되는 에틸렌 글리콜을 음성 대조군(빈 벡터)와 비교하여 보여준다. 도 3c는 독립영양적으로 성장하고 발현 벡터 pMEG042를 옮기는 씨. 아우토에타노게눔에서 시간 경과에 따라 생성되는 글리콜레이트를 보여준다. 실시예 1을 참조한다.
도 4a는 pMEG058(클론 1-2)을 발현하는 씨. 아우토에타노게눔 또는 빈 벡터(음성 대조군)를 포함하는 씨. 아우토에타노게눔의 바이오매스 수준(g 건조 세포 중량/L)을 보여준다. 도 4b는 독립영양적으로 성장하고 발현 벡터 pMEG058을 옮기는 씨. 아우토에타노게눔에서 시간 경과에 따라 생성되는 에틸렌 글리콜을 음성 대조군(빈 벡터)와 비교하여 보여준다. 실시예 2를 참조한다.
도 5a는 pMEG059(클론 1-3)를 발현하는 씨. 아우토에타노게눔 또는 빈 벡터(음성 대조군)를 포함하는 씨. 아우토에타노게눔의 바이오매스 수준(g 건조 세포 중량/L)을 보여준다. 도 5b는 독립영양적으로 성장하고 발현 벡터 pMEG059를 옮기는 씨. 아우토에타노게눔에서 시간 경과에 따라 생성되는 에틸렌 글리콜을 음성 대조군(빈 벡터)와 비교하여 보여준다. 실시예 3을 참조한다.
도 6a는 pMEG061(클론 1)을 발현하는 씨. 아우토에타노게눔 또는 빈 벡터(음성 대조군)를 포함하는 씨. 아우토에타노게눔의 바이오매스 수준(g 건조 세포 중량/L)을 보여준다. 도 6b는 독립영양적으로 성장하고 발현 벡터 pMEG061을 옮기는 씨. 아우토에타노게눔에서 시간 경과에 따라 생성되는 에틸렌 글리콜을 음성 대조군(빈 벡터)와 비교하여 보여준다. 실시예 4를 참조한다.
본 발명은 에틸렌 글리콜의 생물학적 생성을 위한 미생물을 제공한다. "미생물"은 현미경적 유기체(microscopic organism), 특히, 박테리아, 고세균, 바이러스 또는 진균류이다. 바람직한 구현예에서, 본 발명의 미생물은 박테리아이다.
용어 "비-천연 발생"은 미생물에 관하여 사용되는 경우, 상기 미생물이 기준 종(species)의 야생형 균주를 포함한 기준 종의 천연 발생 균주에서 발견되지 않는 유전적 변형을 적어도 하나 가짐을 의미한다. 비-천연 발생 미생물은 전형적으로, 실험실 또는 연구 시설에서 개발된다. 본 발명의 미생물은 비-천연 발생이다.
용어 "유전적 변형," "유전적 변경," 또는 "유전적 조작"은 광범위하게는, 인간에 의한 미생물의 게놈 또는 핵산의 조작을 지칭한다. 마찬가지로, 용어 "유전적으로 변형된," "유전적으로 변경된," 또는 "유전자 조작된"은 이러한 유전적 변형, 유전적 변경 또는 유전적 조작을 함유하는 미생물을 지칭한다. 이들 용어는 실험실-발생된 미생물을 천연-발생 미생물로부터 구별하는 데 사용될 수 있다. 유전적 변형의 방법은 예를 들어, 이종성 유전자 발현, 유전자 또는 프로모터 삽입 또는 결실, 핵산 돌연변이, 변경된 유전자 발현 또는 불활성화, 효소 조작, 지향 진화(directed evolution), 지식-기초 설계, 무작위 돌연변이발생 방법, 유전자 셔플링, 및 코돈 최적화를 포함한다. 본 발명의 미생물은 유전자 조작된다.
"재조합형(recombinant)"은, 핵산, 단백질 또는 미생물이 유전적 변형, 조작 또는 재조합의 생성물임을 나타낸다. 일반적으로, 용어 "재조합형"은, 다수의 공급원들, 예컨대 2 이상의 상이한 균주 또는 종의 미생물로부터 유래되는 유전적 물질을 함유하거나 이에 의해 인코딩되는 핵산, 단백질 또는 미생물을 지칭한다. 본 발명의 미생물은 일반적으로, 재조합형이다.
"야생형"은, 이것이 자연에서 발생하며 돌연변이체 또는 변이체 형태로부터 구분되는 바와 같이 유기체, 균주, 유전자 또는 특징의 전형적인 형태를 지칭한다.
"내인성"은 본 발명의 미생물이 유래되는 야생형 또는 부모 미생물에 존재하거나 발현되는 핵산 또는 단백질을 지칭한다. 예를 들어, 내인성 유전자는 본 발명의 미생물이 유래되는 야생형 또는 부모 미생물에 천연적으로 존재하는 유전자이다. 일 구현예에서, 내인성 유전자의 발현은 외인성 조절 요소, 예컨대 외인성 프로모터에 의해 제어될 수 있다.
"외인성"은 본 발명의 미생물의 외부에서 기원하는 핵산 또는 단백질을 지칭한다. 예를 들어, 외인성 유전자 또는 효소는 인공적으로 또는 재조합적으로 생성되고 본 발명의 미생물에 도입되거나 발현될 수 있다. 외인성 유전자 또는 효소는 또한, 이종성 미생물로부터 단리되고 본 발명의 미생물에 도입되거나 발현될 수 있다. 외인성 핵산은 본 발명의 미생물의 게놈 내로 통합되거나 본 발명의 미생물에서 염색체-외 상태, 예를 들어, 플라스미드에 잔류하도록 적응될 수 있다.
"이종성"은 본 발명의 미생물이 유래하는 야생형 또는 부모 미생물 내에 존재하지 않는 핵산 또는 단백질을 지칭한다. 예를 들어, 이종성 유전자 또는 효소는 상이한 균주 또는 종으로부터 유래되고 본 발명의 미생물 내에 도입 또는 발현될 수 있다. 이종성 유전자 또는 효소는, 이것이 상이한 균주 또는 종에서 발생하는 형태로 본 발명의 미생물 내에 도입 또는 발현될 수 있다. 대안적으로, 이종성 유전자 또는 효소는 일부 방식으로, 예를 들어, 본 발명의 미생물에서의 발현을 위해 이를 코돈-최적화함으로써 또는 기능을 변경시키기 위해, 예컨대 효소 활성의 방향을 역전시키거나 기질 특이성을 변경시키기 위해 이를 조작함으로써 변형될 수 있다.
특히, 본원에 기재된 미생물에서 발현되는 이종성 핵산 또는 단백질은 바실러스, 클로스트리디움, 에스케리키아, 글루코노박터, 하이포미크로비움, 리시니바실러스, 패니바실러스, 슈도모나스, 세디멘티콜라, 스포로사르치나, 스트렙토마이세스, 테르미티오바실러스, 테르모토가, 제아, 클레브시엘라(Klebsiella), 미코박테리움(Mycobacterium), 살모넬라(Salmonella), 미코박테로이데스(Mycobacteroides), 스타필로콕커스(Staphylococcus), 부르콜데리아(Burkholderia), 리스테리아(Listeria), 악시네토박터(Acinetobacter), 시겔라(Shigella), 네이쎄리아(Neisseria), 보르데텔라(Bordetella), 스트렙토콕커스(Streptococcus), 엔테로박터(Enterobacter), 비브리오(Vibrio), 레지오넬라(Legionella), 잔토모나스(Xanthomonas), 세라티아(Serratia), 크로노박터(Cronobacter), 쿠프리아비두스(Cupriavidus), 헬리코박터(Helicobacter), 예르시니아(Yersinia), 쿠티박테리움(Cutibacterium), 프란치셀라(Francisella), 펙토박테리움(Pectobacterium), 아르코박터(Arcobacter), 락토바실러스, 쉐와넬라(Shewanella), 에르위니아(Erwinia), 술푸로스피릴룸(Sulfurospirillum), 펩토콕카세애(Peptococcaceae), 테르모콕커스(Thermococcus), 사카로마이세스(Saccharomyces), 피로콕커스(Pyrococcus), 글리신, 호모(Homo), 랄스토니아(Ralstonia), 브레비박테리움(Brevibacterium), 메틸로박테리움(Methylobacterium), 제오바실러스(Geobacillus), 보스(bos), 갈루스(gallus), 아내로콕커스(Anaerococcus), 제노푸스(Xenopus), 암블리린쿠스(Amblyrhynchus), 라투스(rattus), 무스(mus), 수스(sus), 로도콕커스(Rhodococcus), 리조비움(Rhizobium), 메가스패라(Megasphaera), 메조리조비움(Mesorhizobium), 펩토콕커스(Peptococcus), 아그로박테리움(Agrobacterium), 캄필로박터(Campylobacter), 아세토박테리움, 알칼리바쿨룸, 블라우티아, 부티리박테리움, 유박테리움, 무렐라, 옥소박터, 스포로무사, 테르모아내로박터, 스키조사카로마이세스(Schizosaccharomyces), 패니바실러스, 픽티바실러스(Fictibacillus), 리시니바실러스, 오르니티니바실러스(Ornithinibacillus), 할로바실러스(Halobacillus), 쿠르티아(Kurthia), 렌티바실러스(Lentibacillus), 아녹시바실러스(Anoxybacillus), 솔리바실러스(Solibacillus), 비르기바실러스(Virgibacillus), 알리사이클로바실러스(Alicyclobacillus), 스포로사르치나, 살리미크로비움(Salimicrobium), 스포로사르치나, 플란코콕커스(Planococcus), 코리네박테리움(Corynebacterium), 테르매로박터(Thermaerobacter), 설포바실러스(Sulfobacillus) 또는 심비오박테리움(Symbiobacterium)으로부터 유래될 수 있다.
용어 "폴리뉴클레오타이드," "뉴클레오타이드," "뉴클레오타이드 서열," "핵산," 및 "올리고뉴클레오타이드"는 상호교환적으로 사용된다. 이들은 임의의 길이의 뉴클레오타이드(데옥시리보뉴클레오타이드 또는 리보뉴클레오타이드)의 중합체성 형태, 또는 이들의 유사체를 지칭한다. 폴리뉴클레오타이드는 임의의 3차원 구조를 가질 수 있고, 기지의(known) 또는 미지의(unknown) 임의의 기능을 수행할 수 있다. 하기는 폴리뉴클레오타이드의 비제한적인 예이다: 유전자 또는 유전자 단편의 코딩 또는 비-코딩 영역, 연결 분석으로부터 정의된 유전자좌들(유전자좌), 엑손, 인트론, 메신저 RNA(mRNA), 전달 RNA, 리보솜 RNA, 짧은 간섭 RNA(siRNA), 짧은-헤어핀 RNA(shRNA), 마이크로-RNA(miRNA), 리보자임, cDNA, 재조합형 폴리뉴클레오타이드, 분지형 폴리뉴클레오타이드, 플라스미드, 벡터, 임의의 서열의 단리된 DNA, 임의의 서열의 단리된 RNA, 핵산 프로브, 및 프라이머. 폴리뉴클레오타이드는 하나 이상의 변형된 뉴클레오타이드, 예컨대 메틸화된 뉴클레오타이드 또는 뉴클레오타이드 유사체를 포함할 수 있다. 존재한다면, 뉴클레오타이드 구조에 대한 변형은 중합체의 조립 전 또는 후에 주어질 수 있다. 뉴클레오타이드의 서열은 비-뉴클레오타이드 구성성분에 의해 간섭될 수 있다. 폴리뉴클레오타이드는 중합 후에, 예컨대 표지화 구성성분과의 컨쥬게이션에 의해 추가로 변형될 수 있다.
본원에 사용된 바와 같이, "발현"은, 폴리뉴클레오타이드가 DNA 주형으로부터 (예컨대 mRNA 또는 다른 RNA 전사체로) 전사되는 과정, 및/또는 전사된 mRNA가 후속하여 펩타이드, 폴리펩타이드 또는 단백질로 번역되는 과정을 지칭한다. 전사체 및 인코딩된 폴리펩타이드는 통틀어 "유전자 생성물"로 지칭될 수 있다.
용어 "폴리펩타이드", "펩타이드," 및 "단백질"는 본원에서 상호교환적으로 사용되어, 임의의 길이의 아미노산의 중합체를 지칭한다. 중합체는 선형 또는 분지형일 수 있으며, 중합체는 변형된 아미노산을 포함할 수 있고, 중합체는 비-아미노산에 의해 간섭될 수 있다. 상기 용어는 또한, 예를 들어, 디설파이드 결합 형성, 글리코실화, 지질화, 아세틸화, 인산화, 또는 임의의 다른 조작, 예컨대 표지화 구성성분과의 컨쥬게이션에 의해 변형되었던 아미노산 중합체를 포괄한다. 본원에 사용된 바와 같이, 용어 "아미노산"은 글리신 및 D 또는 L 광학 이성질체 둘 모두를 포함한 천연 및/또는 비천연 또는 합성 아미노산, 및 아미노산 유사체 및 펩티도모방체(peptidomimetic)를 포함한다.
"효소 활성," 또는 간단히 "활성"은 광범위하게는, 비제한적으로 반응을 촉매하기 위한 효소의 활성, 효소의 양 또는 효소의 유용성을 포함한 효소적 활성을 지칭한다. 이에, 효소 활성을 "증가시키는" 것은 반응을 촉매화하기 위해 효소의 활성을 증가시키거나, 효소의 양을 증가시키거나, 효소의 유용성을 증가시키는 것을 포함한다. 유사하게는, 효소 활성을 "저하시키는" 것은 반응을 촉매화하기 위해 효소의 활성을 저하시키거나, 효소의 양을 저하시키거나, 효소의 유용성을 저하시키는 것을 포함한다.
"돌연변이화된"은, 본 발명의 미생물이 유래되는 야생형 또는 부모 미생물과 비교하여 본 발명의 미생물에서 변형되었던 핵산 또는 단백질을 지칭한다. 일 구현예에서, 돌연변이는 효소를 인코딩하는 유전자에서 결실, 삽입 또는 치환일 수 있다. 또 다른 구현예에서, 돌연변이는 효소에서 하나 이상의 아미노산의 결실, 삽입 또는 치환일 수 있다.
"부모 미생물"은 본 발명의 미생물을 발생시키는 데 사용되는 미생물이다. 부모 미생물은 천연-발생 미생물(즉, 야생형 미생물) 또는 이전에 변형되었던 미생물(즉, 돌연변이체 또는 재조합형 미생물)일 수 있다. 본 발명의 미생물은 부모 미생물에서 발현되지 않거나 과발현되지 않았던 하나 이상의 효소를 발현하도록 과발현하도록 변형될 수 있다. 유사하게는, 본 발명의 미생물은 부모 미생물에 의해 함유되지 않은 하나 이상의 유전자를 함유하도록 변형될 수 있다. 본 발명의 미생물은 또한, 부모 미생물에서 발현되었던 하나 이상의 효소를 발현하지 않거나 더 적은 양으로 발현하도록 변형될 수 있다.
본 발명의 미생물은 본질적으로 임의의 부모 미생물로부터 유래될 수 있다. 일 구현예에서, 본 발명의 미생물은 클로스트리디움 아세토부틸리쿰(Clostridium acetobutylicum), 클로스트리디움 베이예린키이(Clostridium beijerinckii), 에스케리키아 콜라이 및 사카로마이세스 세레비지애(Saccharomyces cerevisiae)로 이루어진 군으로부터 선택되는 부모 미생물로부터 유래될 수 있다. 다른 구현예에서, 상기 미생물은 아세토박테리움 우디이, 알칼리바쿨룸 박키이, 블라우티아 프로덕트(Blautia product), 부티리박테리움 메틸로트로피쿰, 클로스트리디움 아세티쿰, 클로스트리디움 아우토에타노게눔, 클로스트리디움 카르복시디보란스, 클로스트리디움 코스카티이, 클로스트리디움 드라케이, 클로스트리디움 포르미코아세티쿰, 클로스트리디움 륭달리이, 클로스트리디움 마그눔, 클로스트리디움 라그스달레이, 클로스트리디움 스카톨로게네스, 유박테리움 리모숨, 무렐라 테르마우토트로피카, 무렐라 테르모아세티카, 옥소박터 펜니기이, 스포로무사 오바타, 스포로무사 실바세티카, 스포로무사 스패로이데스 및 테르모아내로박터 키우비로 이루어진 군으로부터 선택되는 부모 미생물로부터 유래된다. 바람직한 구현예에서, 부모 미생물은 클로스트리디움 아우토에타노게눔, 클로스트리디움 륭달리이 또는 클로스트리디움 라그스달레이이다. 특히 바람직한 구현예에서, 부모 미생물은 2010년 6월 7일에 부다페스트 조약의 협약 및 합의된 기탁 번호 DSM23693 하에 독일 브라운슈바이크 D-38124, 인호펜스트라쎄에 위치한 Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH(DSMZ)에 2010년 6월 7일에 기탁되었던 클로스트리디움 아우토에타노게눔 LZ1561이다. 이 균주는 WO 2012/015317로서 공개된 국제 특허 출원 PCT/NZ2011/000144에 기재되어 있다.
용어 "~로부터 유래된"은, 핵산, 단백질 또는 미생물이 상이한(예를 들어, 부모 또는 야생형) 핵산, 단백질 또는 미생물로부터 변형되거나 적응되어 새로운 핵산, 단백질 또는 미생물을 생성함을 나타낸다. 이러한 변형 또는 적응은 전형적으로, 핵산 또는 유전자의 삽입, 결실, 돌연변이 또는 치환을 포함한다. 일반적으로, 본 발명의 미생물은 부모 미생물로부터 유래된다. 일 구현예에서, 본 발명의 미생물은 클로스트리디움 아우토에타노게눔, 클로스트리디움 륭달리이 또는 클로스트리디움 라그스달레이로부터 유래된다. 바람직한 구현예에서, 본 발명의 미생물은 DSMZ 기탁 번호 DSM23693 하에 기탁된 클로스트리디움 아우토에타노게눔 LZ1561로부터 유래된다.
본 발명의 미생물은 기능적 특징에 기초하여 추가로 분류될 수 있다. 예를 들어, 본 발명의 미생물은 C1-고정 미생물, 혐기성 생물, 아세토겐, 에탄올로겐, 일산화탄소 영양 생물 및/또는 메탄영양체(methanotroph)일 수 있거나 이로부터 유래될 수 있다.
표 1은 미생물의 대표적인 목록을 제공하고, 이들 미생물의 기능적인 특징을 식별한다.
"우드-륭달"은 예를 들어, 문헌[Ragsdale, Biochim Biophys Acta, 1784: 1873-1898, 2008]에 기재된 바와 같은 탄소 고정의 우드-륭달 경로를 지칭한다. "우드-륭달 미생물"은, 예상대로, 우드-륭달 경로를 포함하는 미생물을 의미한다. 종종, 본 발명의 미생물은 본연의 우드-륭달 경로를 함유한다. 본원에서, 우드-륭달 경로는 본연의, 비변형된 우드-륭달 경로일 수 있거나, 우드-륭달 경로는 이 경로가 CO, CO2 및/또는 H2를 아세틸-CoA로 전환하는 작용을 여전히 하는 한, 어느 정도의 유전적 변형(예를 들어, 과발현, 이종성 발현, 녹아웃 등)을 갖는 우드-륭달 경로일 수 있다.
"C1"은 1-탄소 분자, 예를 들어, CO, CO2, CH4 또는 CH3OH를 지칭한다. "C1-옥시게네이트"는, 적어도 하나의 산소 원자를 또한 포함하는 1-탄소 분자, 예를 들어, CO, CO2 또는 CH3OH를 지칭한다. "C1-탄소 공급원"은 본 발명의 미생물에 대한 부분적인 또는 단독(sole) 탄소 공급원으로서 역할을 하는 1-탄소 분자를 지칭한다. 예를 들어, C1-탄소 공급원은 CO, CO2, CH4, CH3OH 또는 CH2O2 중 하나 이상을 포함할 수 있다. 바람직하게는, C1-탄소 공급원은 CO와 CO2 중 하나 또는 둘 모두를 포함한다. "C1-고정 미생물"은 C1-탄소 공급원으로부터 하나 이상의 생성물을 생성하는 능력을 갖는 미생물이다. 종종, 본 발명의 미생물은 C1-고정 박테리아이다. 바람직한 구현예에서, 본 발명의 미생물은 표 1에서 식별된 C1-고정 미생물로부터 유래된다.
"혐기성 생물(anaerobe)"은 성장을 위해 산소를 필요로 하지 않는 미생물이다. 혐기성 생물은 산소가 소정의 역치보다 높게 존재한다면 부정적으로 반응하거나 심지어 사멸할 수 있다. 그러나, 일부 혐기성 생물은 이따금 "미소산소(microoxic) 조건"으로 지칭되는 저수준의 산소(예를 들어 0.000001 내지 5% 산소)를 관용할 수 있다. 종종, 본 발명의 미생물은 혐기성 생물이다. 바람직한 구현예에서, 본 발명의 미생물은 표 1에서 식별된 혐기성 미생물로부터 유래된다.
"아세토겐"은 에너지 보존 및 아세틸-CoA 및 아세틸-CoA-유래 생성물, 예컨대 아세테이트의 합성을 위해 우드-륭달 경로를 이들의 주요 기전으로서 사용하는 절대적(obligately) 혐기성 박테리아이다(문헌[Ragsdale, Biochim Biophys Acta, 1784: 1873-1898, 2008]). 특히, 아세토겐은 (1) CO2로부터 아세틸-CoA의 환원적 합성을 위한 기전, (2) 말단 전자-수용, 에너지 보존 과정, (3) 세포 탄소의 합성에서 CO2의 고정(동화)을 위한 기전으로서 우드-륭달 경로를 사용한다(문헌[Drake, Acetogenic Prokaryotes, In: The Prokaryotes, 3rd edition, p. 354, New York, NY, 2006]). 모든 천연 발생 아세토겐들은 C1-고정, 혐기성, 독립 영양 생물, 및 비-메탄영양체이다. 종종, 본 발명의 미생물은 아세토겐이다. 바람직한 구현예에서, 본 발명의 미생물은 표 1에서 식별된 아세토겐으로부터 유래된다.
"에탄올로겐"은 에탄올을 생성하거나 생성할 수 있는 미생물이다. 종종, 본 발명의 미생물은 에탄올로겐이다. 바람직한 구현예에서, 본 발명의 미생물은 표 1에서 식별된 에탄올로겐으로부터 유래된다.
"독립 영양 생물"은 유기 탄소의 부재 하에 성장할 수 있는 미생물이다. 대신에, 독립 영양 생물은 무기 탄소 공급원, 예컨대 CO 및/또는 CO2를 사용한다. 종종, 본 발명의 미생물은 독립 영양 생물이다. 바람직한 구현예에서, 본 발명의 미생물은 표 1에서 식별된 독립 영양 생물로부터 유래된다.
"일산화탄소 영양 생물"은 탄소 및 에너지의 단독 공급원으로서 CO를 이용할 수 있는 미생물이다. 종종, 본 발명의 미생물은 일산화탄소 영양 생물이다. 바람직한 구현예에서, 본 발명의 미생물은 표 1에서 식별된 일산화탄소 영양 생물로부터 유래된다.
"메탄영양체"는 탄소 및 에너지의 단독 공급원으로서 메탄을 이용할 수 있는 미생물이다. 소정의 구현예에서, 본 발명의 미생물은 메탄영양체이거나 메탄영양체로부터 유래된다. 다른 구현예에서, 본 발명의 미생물은 메탄영양체가 아니거나 메탄영양체로부터 유래되지 않는다.
바람직한 구현예에서, 본 발명의 미생물은 클로스트리디움 아우토에타노게눔, 클로스트리디움 륭달리이 및 클로스트리디움 라그스달레이 종을 포함하는 클로스트리디아(Clostridia)의 클러스터로부터 유래된다. 이들 종은 문헌[Abrini, Arch Microbiol, 161: 345-351, 1994(클로스트리디움 아우토에타노게눔)], [Tanner, Int J System Bacteriol, 43: 232-236, 1993(클로스트리디움 륭달리이)] 및 Huhnke, WO 2008/028055(클로스트리디움 라그스달레이)에 의해 처음으로 보고되었고 특징화되었다.
이들 3개 종들은 많은 유사성을 가진다. 특히, 이들 종은 모두 클로스트리디움 속의 C1-고정, 혐기성, 아세토겐성, 에탄올로겐성, 및 일산화탄소 영양 생물 구성원이다. 이들 종은 유사한 유전자형 및 표현형, 및 에너지 보존 및 발효 대사 방식을 가진다. 더욱이, 이들 종은, 99% 초과 동일하며, 약 22 내지 30 몰%의 DNA G + C 함량을 가지며, 그람-양성이며, 유사한 형태 및 크기를 갖고(0.5~0.7 x 3~5 μm 사이에서 대수 성장하는 세포), 중온성(mesophilic)이며(30℃ 내지 37℃에서 최적으로 성장함), 약 4 내지 7.5의 유사한 pH 범위를 갖고(약 5.5 내지 6의 최적 pH를 가짐), 시토크롬이 결여되어 있고, Rnf 복합체를 통해 에너지를 보존하는 16S rRNA DNA를 갖는 클로스트리디움성(clostridial) rRNA 상동성 그룹 I에서 클러스터화된다. 또한, 카르복실산으로부터 이들의 상응하는 알코올로의 환원은 이들 종에서 나타났다(문헌[Perez, Biotechnol Bioeng, 110:1066-1077, 2012]). 중요하게는, 이들 종은 또한 모두, 소정의 조건 하에 CO-함유 가스 상에서 강한 독립 영양 생물 성장을 보여주며, 에탄올 및 아세테이트(또는 아세트산)를 주 발효 생성물로서 생성하고, 소량의 2,3-부탄디올 및 락트산을 생성한다.
그러나, 이들 3개 종들은 또한, 많은 차이를 가진다. 이들 종은 상이한 공급원들로부터 단리되었다: 토끼 소화관으로부터 클로스트리디움 아우토에타노게눔, 양계장 폐기물로부터 클로스트리디움 륭달리이, 및 담수 침강물로부터 클로스트리디움 라그스달레이. 이들 종은 다양한 당(예를 들어, 람노스, 아라비노스), 산(예를 들어, 글루코네이트, 시트레이트), 아미노산(예를 들어, 아르기닌, 히스티딘), 및 다른 기질(예를 들어, 베타인, 부탄올)의 이용 면에서 상이하다. 더욱이, 이들 종은 소정의 비타민(예를 들어, 티아민, 비오틴)에 대한 영양요구성 면에서 상이하다. 이들 종은 우드-륭달 경로 유전자 및 단백질의 핵산 및 아미노산 서열에서 차이를 갖지만, 이들 유전자 및 단백질의 일반적인 구조화 및 수는 모든 종들에서 동일한 것으로 밝혀졌다(문헌[Kφpke, Curr Opin Biotechnol, 22: 320-325, 2011]).
따라서, 요약하자면, 클로스트리디움 아우토에타노게눔, 클로스트리디움 륭달리이 또는 클로스트리디움 라그스달레이의 특징들 중 많은 것들은 해당 종에 특이적이지 않지만, 그보다는 클로스트리디움 속의 C1-고정, 혐기성, 아세토겐성, 에탄올로겐성, 및 일산화탄소 영양 생물 구성원의 이 클러스터에 대해 일반적인 특징이다. 그러나, 이들 종이 사실상 별개이기 때문에, 이들 종 중 하나의 유전적 변형 또는 조작은 이들 종 중 또 다른 것에서 동일한 효과를 갖지 않을 수 있다. 예를 들어, 성장, 성능 또는 생성물 생성에서의 차이가 관찰될 수 있다.
본 발명의 미생물은 또한, 클로스트리디움 아우토에타노게눔, 클로스트리디움 륭달리이 또는 클로스트리디움 라그스달레이의 단리물 또는 돌연변이체로부터 유래될 수 있다.클로스트리디움 아우토에타노게눔의 단리물 및 돌연변이체는 JA1-1(DSM10061)(문헌[Abrini, Arch Microbiol, 161: 345-351, 1994]), LBS1560(DSM19630)(WO 2009/064200) 및 LZ1561(DSM23693)(WO 2012/015317)을 포함한다.클로스트리디움 륭달리이의 단리물 및 돌연변이체는 ATCC 49587(문헌[Tanner, Int J Syst Bacteriol, 43: 232-236, 1993]), PETCT(DSM13528, ATCC 55383), ERI-2(ATCC 55380)(US 5,593,886), C-01(ATCC 55988)(US 6,368,819), O-52(ATCC 55989)(US 6,368,819), 및 OTA-1(문헌[Tirado-Acevedo, Production of bioethanol from synthesis gas using Clostridium ljungdahlii, PhD thesis, North Carolina State University, 2010])을 포함한다.클로스트리디움 라그스달레이의 단리물 및 돌연변이체는 PI 1(ATCC BAA-622, ATCC PTA-7826)(WO 2008/028055)을 포함한다.
그러나, 상기에 기재된 바와 같이, 본 발명의 미생물은 또한, 본질적으로 임의의 부모 미생물, 예컨대 클로스트리디움 아세토부틸리쿰, 클로스트리디움 베이예린키이, 에스케리키아 콜라이 및 사카로마이세스 세레비지애로 이루어진 군으로부터 선택되는 부모 미생물로부터 유래될 수 있다.
본 발명은 본 발명의 미생물을 기질의 존재 하에 배양하여 상기 미생물이 에틸렌 글리콜을 생성하는 단계를 포함하는 에틸렌 글리콜, 글리옥실레이트, 및 글리콜레이트를 생성하는 방법뿐만 아니라 에틸렌 글리콜, 글리옥실레이트, 및 글리콜레이트를 생성할 수 있는 미생물을 제공한다.
본 발명의 미생물은 아세틸-CoA, 예컨대 우드-륭달 경로에 의해 생성되는 아세틸-CoA를 피루베이트로 전환하는 효소를 포함할 수 있다(도 1의 반응 1). 이러한 효소는 피루베이트 신타제(PFOR)[1.2.7.1] 또는 ATP:피루베이트, 오르토포스페이트 포스포트랜스퍼라제[1.2.7.1]일 수 있다. 일부 구현예에서, 아세틸-CoA를 피루베이트로 전환하는 효소는 내인성 효소이다.
본 발명의 미생물은 피루베이트를 옥살로아세테이트로 전환하는 효소를 포함할 수 있다(도 1의 반응 2). 이 효소는 피루베이트:이산화탄소 리가제(ligase)[ADP-형성][6.4.1.1]일 수 있다. 일부 구현예에서, 피루베이트를 옥살로아세테이트로 전환하는 효소는 내인성 효소이다. 일부 구현예에서, 피루베이트를 옥살로아세테이트로 전환하는 효소는 과발현된다.
본 발명의 미생물은 옥살로아세테이트를 시트릴-CoA로 전환하는 효소를 포함할 수 있다(도 1의 반응 3). 이 효소는 시트릴-CoA 리아제[4.1.3.34]일 수 있다. 일부 구현예에서, 옥살로아세테이트를 시트릴-CoA로 전환하는 효소는 내인성 효소이다.
본 발명의 미생물은 시트릴-CoA를 시트레이트로 전환하는 효소를 포함할 수 있다(도 1의 반응 4). 이 효소는 시트레이트-CoA 트랜스퍼라제[2.8.3.10]일 수 있다. 일부 구현예에서, 시트릴-CoA를 시트레이트로 전환하는 효소는 내인성 효소이다.
본 발명의 미생물은 옥살로아세테이트를 시트레이트로 전환하는 효소를 포함할 수 있다(도 1의 반응 5). 이 효소는 시트레이트 [Si]-신타제[2.3.3.1], ATP 시트레이트 신타제[2.3.3.8], 또는 시트레이트(Re)-신타제[2.3.3.3]일 수 있다. 일부 구현예에서, 옥살로아세테이트를 시트레이트로 전환하는 효소는 내인성 효소이다. 다른 구현예에서, 옥살로아세테이트를 시트레이트로 전환하는 효소는 이종성 효소이다. 예를 들어, 일부 구현예에서, 본 발명의 미생물은 비. 서브틸리스로부터의 시트레이트 신타제 1[EC 2.3.3.16]을 포함하여, 상기 미생물은 SEQ ID NO: 2로 제시된 아미노산 서열을 인코딩하는 SEQ ID NO: 1로 제시된 뉴클레오타이드 서열을 포함한다. 일부 구현예에서, 본 발명의 미생물은 씨. 클루이베리(C. kluyveri)로부터의 시트레이트(Re)-신타제를 포함하여, 상기 미생물은 SEQ ID NO: 4로 제시된 아미노산 서열을 인코딩하는 SEQ ID NO: 3으로 제시된 뉴클레오타이드 서열을 포함한다. 일부 구현예에서, 본 발명의 미생물은 클로스트리디움 종으로부터의 시트레이트(Si)-신타제를 포함하여, 상기 미생물은 SEQ ID NO: 6으로 제시된 아미노산 서열을 인코딩하는 SEQ ID NO: 5로 제시된 뉴클레오타이드 서열을 포함한다. 일부 구현예에서, 본 발명의 미생물은 비. 서브틸리스로부터의 시트레이트 신타제 2를 포함하여, 상기 미생물은 SEQ ID NO: 8로 제시된 아미노산 서열을 인코딩하는 SEQ ID NO: 7로 제시된 뉴클레오타이드 서열을 포함한다. 일부 구현예에서, 옥살로아세테이트를 시트레이트로 전환하는 효소는 과발현된다.
본 발명의 미생물은 시트레이트를 아코니테이트로 그리고 아코니테이트를 이소-시트레이트로 전환하는 효소를 포함할 수 있다(도 1의 반응 6). 이 효소는 아코니테이트 하이드라타제[4.2.1.3]일 수 있다. 일부 구현예에서, 시트레이트를 아코니테이트로 그리고 아코니테이트를 이소-시트레이트로 전환하는 효소는 내인성 효소이다. 일부 구현예에서, 시트레이트를 아코니테이트로 그리고 아코니테이트를 이소-시트레이트로 전환하는 효소는 과발현된다.
본 발명의 미생물은 이소시트레이트를 글리옥실레이트로 전환하는 효소를 포함할 수 있다(도 1의 반응 7). 이 효소는 이소시트레이트 리아제[4.1.3.1]일 수 있다. 일부 구현예에서, 본 발명의 미생물은 제트. 마이스(Z. mays)로부터의 이소시트레이트 리아제를 포함하여, 상기 미생물은 SEQ ID NO: 10으로 제시된 아미노산 서열을 인코딩하는 SEQ ID NO: 9로 제시된 뉴클레오타이드 서열을 포함한다. 일부 구현예에서, 본 발명의 미생물은 이. 콜라이로부터의 이소시트레이트 리아제를 포함하여, 상기 미생물은 SEQ ID NO: 12로 제시된 아미노산 서열을 인코딩하는 SEQ ID NO: 11로 제시된 뉴클레오타이드 서열을 포함한다. 일부 구현예에서
본 발명의 미생물은 글리옥실레이트를 글리콜레이트로 전환하는 효소를 포함할 수 있다(도 1의 반응 8). 이 효소는 글리세레이트 데하이드로게나제[1.1.1.29], 글리옥실레이트 리덕타제[1.1.1.26/79], 또는 글리콜레이트 데하이드로게나제[1.1.99.14]일 수 있다. 일부 구현예에서, 글리옥실레이트를 글리콜레이트로 전환하는 효소는 내인성 효소이다. 일부 구현예에서, 글리옥실레이트를 글리콜레이트로 전환하는 효소는 과발현된다.
본 발명의 미생물은 글리콜레이트를 글리코알데하이드로 전환하는 효소를 포함할 수 있다(도 1의 반응 9). 이 효소는 글리콜알데하이드 데하이드로게나제[1.2.1.21], 락트알데하이드 데하이드로게나제[1.2.1.22], 숙시네이트-세미알데하이드 데하이드로게나제[1.2.1.24], 2,5-디옥소발레레이트 데하이드로게나제[1.2.1.26], 알데하이드 데하이드로게나제[1.2.1.3/4/5], 베타인-알데하이드 데하이드로게나제[1.2.1.8], 또는 알데하이드 페레독신 옥시도리덕타제[1.2.7.5]일 수 있다. 일부 구현예에서, 글리콜레이트를 글리코알데하이드로 전환하는 효소는 내인성 효소이다. 다른 구현예에서, 글리콜레이트를 글리코알데하이드로 전환하는 효소는 이종성 효소이다. 예를 들어, 일부 구현예에서, 본 발명의 미생물은 이. 콜라이로부터의 감마-아미노부티르알데하이드 데하이드로게나제를 포함하여, 상기 미생물은 SEQ ID NO: 50으로 제시된 아미노산 서열을 인코딩하는 SEQ ID NO: 49로 제시된 뉴클레오타이드 서열을 포함한다. 일부 구현예에서, 본 발명의 미생물은 이. 콜라이로부터의 알데하이드 데하이드로게나제를 포함하여, 상기 미생물은 SEQ ID NO: 52로 제시된 아미노산 서열을 인코딩하는 SEQ ID NO: 51로 제시된 뉴클레오타이드 서열을 포함한다. 일부 구현예에서, 본 발명의 미생물은 이. 콜라이로부터의 NADP-의존적 숙시네이트-세미알데하이드 데하이드로게나제 I를 포함하여, 상기 미생물은 SEQ ID NO: 54로 제시된 아미노산 서열을 인코딩하는 SEQ ID NO: 53으로 제시된 뉴클레오타이드 서열을 포함한다. 일부 구현예에서, 본 발명의 미생물은 지. 옥시단스로부터의 락트알데하이드 데하이드로게나제/글리콜알데하이드 데하이드로게나제를 포함하여, 상기 미생물은 SEQ ID NO: 56으로 제시된 아미노산 서열을 인코딩하는 SEQ ID NO: 55로 제시된 뉴클레오타이드 서열을 포함한다. 일부 구현예에서, 본 발명의 미생물은 피. 플루오레센스로부터의 알데하이드 데하이드로게나제 A를 포함하여, 상기 미생물은 SEQ ID NO: 58 또는 SEQ ID NO: 60으로 제시된 아미노산 서열을 각각 인코딩하는 SEQ ID NO: 57 또는 SEQ ID NO: 59로 제시된 뉴클레오타이드 서열을 포함한다. 글리콜레이트를 글리코알데하이드로 전환하는 효소의 부가적인 비제한적인 예는 GenBank 기탁 번호 WP_003202098, WP_003182567, ACT39044, ACT39074, WP_041112005, 및 ACT40170에서 찾을 수 있다. 일부 구현예에서, 글리콜레이트를 글리코알데하이드로 전환하는 효소는 과발현된다.
본 발명의 미생물은 글리코알데하이드를 에틸렌 글리콜로 전환하는 효소를 포함할 수 있다(도 1의 반응 10). 이 효소는 락트알데하이드 리덕타제[1.1.1.77], 알코올 데하이드로게나제[1.1.1.1], 알코올 데하이드로게나제(NADP+)[1.1.1.2], 글리세롤 데하이드로게나제[1.1.1.72], 글리세롤-3-포스페이트 데하이드로게나제[1.1.1.8], 또는 알데하이드 리덕타제[1.1.1.21]일 수 있다. 일부 구현예에서, 글리코알데하이드를 에틸렌 글리콜로 전환하는 효소는 내인성 효소이다. 일부 구현예에서, 글리코알데하이드를 에틸렌 글리콜로 전환하는 내인성 효소는 과발현된다. 다른 구현예에서, 글리코알데하이드를 에틸렌 글리콜로 전환하는 효소는 이종성 효소이다. 일부 구현예에서, 본 발명의 미생물은 씨. 사카로페르부틸아세토니쿰(C. saccharoperbutylacetonicum)으로부터의 락트알데하이드 리덕타제를 포함하여, 상기 미생물은 SEQ ID NO: 62로 제시된 아미노산 서열을 인코딩하는 SEQ ID NO: 61로 제시된 뉴클레오타이드 서열을 포함한다. 일부 구현예에서, 본 발명의 미생물은 씨. 륭달리이로부터의 락트알데하이드 리덕타제를 포함하여, 상기 미생물은 SEQ ID NO: 64로 제시된 아미노산 서열을 인코딩하는 SEQ ID NO: 63으로 제시된 뉴클레오타이드 서열을 포함한다. 일부 구현예에서, 본 발명의 미생물은 이. 콜라이로부터의 락트알데하이드 리덕타제를 포함하여, 상기 미생물은 SEQ ID NO: 66으로 제시된 아미노산 서열을 인코딩하는 SEQ ID NO: 65로 제시된 뉴클레오타이드 서열을 포함한다. 일부 구현예에서, 본 발명의 미생물은 씨. 베이예린키이로부터의 락트알데하이드 리덕타제를 포함하여, 상기 미생물은 SEQ ID NO: 68로 제시된 아미노산 서열을 인코딩하는 SEQ ID NO: 67로 제시된 뉴클레오타이드 서열을 포함한다. 일부 구현예에서, 이종성 글리코알데하이드를 에틸렌 글리콜로 전환하는 효소는 과발현된다.
본 발명의 미생물은 피루베이트를 말레이트로 전환하는 효소를 포함할 수 있다(도 1의 반응 11). 이 효소는 말레이트 데하이드로게나제[1.1.1.37], 말레이트 데하이드로게나제(옥살로아세테이트-탈카르복실화)[1.1.1.38], 말레이트 데하이드로게나제(탈카르복실화)[1.1.1.39], 말레이트 데하이드로게나제(옥살로아세테이트-탈카르복실화)(NADP+)[1.1.1.40], 말레이트 데하이드로게나제(NADP+)[1.1.1.82], D-말레이트 데하이드로게나제(탈카르복실화)[1.1.1.83], 디메틸말레이트 데하이드로게나제[1.1.1.84], 3-이소프로필말레이트 데하이드로게나제[1.1.1.85], 말레이트 데하이드로게나제[NAD(P)+][1.1.1.299], 또는 말레이트 데하이드로게나제(퀴논)[1.1.5.4]일 수 있다. 일부 구현예에서, 피루베이트를 말레이트로 전환하는 효소는 내인성 효소이다. 다른 구현예에서, 피루베이트를 말레이트로 전환하는 효소는 이종성 효소이다. 예를 들어, 일부 구현예에서, 본 발명의 미생물은 씨. 아우토에타노게눔으로부터의 말레이트 데하이드로게나제를 포함하여, 상기 미생물은 SEQ ID NO: 24로 제시된 아미노산 서열을 인코딩하는 SEQ ID NO: 23으로 제시된 뉴클레오타이드 서열을 포함한다. 일부 구현예에서, 본 발명의 미생물은 씨. 아우토에타노게눔으로부터의 NAD-의존적 말산(malic) 효소를 포함하여, 상기 미생물은 SEQ ID NO: 26으로 제시된 아미노산 서열을 인코딩하는 SEQ ID NO: 25로 제시된 뉴클레오타이드 서열을 포함한다.
본 발명의 미생물은 말레이트를 글리옥실레이트로 전환하는 효소를 포함할 수 있다(도 1의 반응 12). 이 효소는 말레이트 신타제[2.3.3.9] 또는 이소시트레이트 리아제[4.1.3.1]일 수 있다. 일부 구현예에서, 말레이트를 글리옥실레이트로 전환하는 효소는 이종성 효소이다. 예를 들어, 일부 구현예에서, 본 발명의 미생물은 스포로사르치나 종으로부터의 말레이트 신타제 G를 포함하여, 상기 미생물은 SEQ ID NO: 28 또는 SEQ ID NO: 34로 제시된 아미노산 서열을 각각 인코딩하는 SEQ ID NO: 27 또는 SEQ ID NO: 33으로 제시된 뉴클레오타이드 서열을 포함한다. 일부 구현예에서, 본 발명의 미생물은 바실러스 종으로부터의 말레이트 신타제 G를 포함하여, 상기 미생물은 SEQ ID NO: 30 또는 SEQ ID NO: 36으로 제시된 아미노산 서열을 각각 인코딩하는 SEQ ID NO: 29 또는 SEQ ID NO: 35로 제시된 뉴클레오타이드 서열을 포함한다. 일부 구현예에서, 본 발명의 미생물은 에스. 코엘리콜로르(S. coelicolor)로부터의 말레이트 신타제를 포함하여, 상기 미생물은 SEQ ID NO: 32로 제시된 아미노산 서열을 인코딩하는 SEQ ID NO: 31로 제시된 뉴클레오타이드 서열을 포함한다. 일부 구현예에서, 본 발명의 미생물은 비. 인판티스(B. infantis)로부터의 말레이트 신타제 G를 포함하여, 상기 미생물은 SEQ ID NO: 38로 제시된 아미노산 서열을 인코딩하는 SEQ ID NO: 37로 제시된 뉴클레오타이드 서열을 포함한다. 일부 구현예에서, 본 발명의 미생물은 씨. 코클레아리움(C. cochlearium)으로부터의 말레이트 신타제를 포함하여, 상기 미생물은 SEQ ID NO: 40으로 제시된 아미노산 서열을 인코딩하는 SEQ ID NO: 39로 제시된 뉴클레오타이드 서열을 포함한다. 일부 구현예에서, 본 발명의 미생물은 비. 메가테리움(B. megaterium)으로부터의 말레이트 신타제 G를 포함하여, 상기 미생물은 SEQ ID NO: 42로 제시된 아미노산 서열을 인코딩하는 SEQ ID NO: 41로 제시된 뉴클레오타이드 서열을 포함한다. 일부 구현예에서, 본 발명의 미생물은 패니바실러스 종으로부터의 말레이트 신타제를 포함하여, 상기 미생물은 SEQ ID NO: 44로 제시된 아미노산 서열을 인코딩하는 SEQ ID NO: 43으로 제시된 뉴클레오타이드 서열을 포함한다. 일부 구현예에서, 본 발명의 미생물은 리시니바실러스 종으로부터의 말레이트 신타제를 포함하여, 상기 미생물은 SEQ ID NO: 46으로 제시된 아미노산 서열을 인코딩하는 SEQ ID NO: 45로 제시된 뉴클레오타이드 서열을 포함한다. 일부 구현예에서, 본 발명의 미생물은 비. 세레우스(B. cereus)로부터의 말레이트 신타제를 포함하여, 상기 미생물은 SEQ ID NO: 48로 제시된 아미노산 서열을 인코딩하는 SEQ ID NO: 47로 제시된 뉴클레오타이드 서열을 포함한다.
본 발명의 미생물은 피루베이트를 포스포에놀피루베이트로 전환하는 효소를 포함할 수 있다(도 1의 반응 13). 이 효소는 피루베이트 키나제[2.7.1.40], 피루베이트, 포스페이트 디키나제(dikinase)[2.7.9.1], 또는 피루베이트, 물(water) 디키나제[2.7.9.2]일 수 있다. 일부 구현예에서, 피루베이트를 포스포에놀피루베이트로 전환하는 효소는 내인성 효소이다.
본 발명의 미생물은 포스포에놀피루베이트를 2-포스포-D-글리세레이트로 전환하는 효소를 포함할 수 있다(도 1의 반응 14). 이 효소는 포스포피루베이트 하이드라타제(hydratase)[4.2.1.11]일 수 있다. 일부 구현예에서, 포스포에놀피루베이트를 2-포스포-D-글리세레이트로 전환하는 효소는 내인성 효소이다.
본 발명의 미생물은 2-포스포-D-글리세레이트를 3-포스포-D-글리세레이트로 전환하는 효소를 포함할 수 있다(도 1의 반응 15). 이 효소는 포스포글리세레이트 뮤타제(mutase)[5.4.2.11/12]일 수 있다. 일부 구현예에서, 2-포스포-D-글리세레이트를 3-포스포-D-글리세레이트로 전환하는 효소는 내인성 효소이다.
본 발명의 미생물은 3-포스포-D-글리세레이트를 3-포스포노옥시피루베이트로 전환하는 효소를 포함할 수 있다(도 1의 반응 16). 이 효소는 포스포글리세레이트 데하이드로게나제[1.1.1.95]일 수 있다. 일부 구현예에서, 3-포스포-D-글리세레이트를 3-포스포노옥시피루베이트로 전환하는 효소는 내인성 효소이다.
본 발명의 미생물은 3-포스포노옥시피루베이트를 3-포스포-L-세린으로 전환하는 효소를 포함할 수 있다(도 1의 반응 17). 이 효소는 포스포세린 트랜스아미나제[2.6.1.52]일 수 있다. 일부 구현예에서, 3-포스포노옥시피루베이트를 3-포스포-L-세린으로 전환하는 효소는 내인성 효소이다.
본 발명의 미생물은 3-포스포-L-세린을 세린으로 전환하는 효소를 포함할 수 있다(도 1의 반응 18). 이 효소는 포스포세린 포스파타제[3.1.3.3]일 수 있다. 일부 구현예에서, 3-포스포-L-세린을 세린으로 전환하는 효소는 내인성 효소이다.
본 발명의 미생물은 세린을 글리신으로 전환하는 효소를 포함할 수 있다(도 1의 반응 19). 이 효소는 글리신 하이드록시메틸트랜스퍼라제[2.1.2.1]일 수 있다. 일부 구현예에서, 세린을 글리신으로 전환하는 효소는 내인성 효소이다. 일부 구현예에서, 세린을 글리신으로 전환하는 효소는 과발현된다.
본 발명의 미생물은 글리신을 글리옥실레이트로 전환시키는 효소를 포함할 수 있다(도 1의 반응 20). 이 효소는 알라닌-글리옥실레이트 아미노트랜스퍼라제/트랜스아미나제[2.6.1.44], 세린-글리옥실레이트 아미노트랜스퍼라제/트랜스아미나제[2.6.1.45], 세린-피루베이트 아미노트랜스퍼라제/트랜스아미나제[2.6.1.51], 글리신-옥살로아세테이트 아미노트랜스퍼라제/트랜스아미나제[2.6.1.35], 글리신 트랜스아미나제[2.6.1.4], 글리신 데하이드로게나제[1.4.1.10], 알라닌 데하이드로게나제[1.4.1.1], 또는 글리신 데하이드로게나제[1.4.2.1]일 수 있다. 일부 구현예에서, 글리신을 글리옥실레이트로 전환시키는 효소는 내인성 효소이다. 다른 구현예에서, 글리신을 글리옥실레이트로 전환시키는 효소는 이종성 효소이다. 예를 들어, 일부 구현예에서, 본 발명의 미생물은 에이치. 메틸로보룸(H. methylovorum)으로부터의 세린-글리옥실레이트 아미노트랜스퍼라제를 포함하여, 상기 미생물은 SEQ ID NO: 14로 제시된 아미노산 서열을 인코딩하는 SEQ ID NO: 13으로 제시된 뉴클레오타이드 서열을 포함한다. 일부 구현예에서, 본 발명의 미생물은 에스. 티오타우리니로부터의 알라닌-글리옥실레이트 아미노트랜스퍼라제를 포함하여, 상기 미생물은 SEQ ID NO: 16으로 제시된 아미노산 서열을 인코딩하는 SEQ ID NO: 15로 제시된 뉴클레오타이드 서열을 포함한다. 일부 구현예에서, 본 발명의 미생물은 티. 테피다리우스(T. tepidarius)로부터의 알라닌-글리옥실레이트 아미노트랜스퍼라제를 포함하여, 상기 미생물은 SEQ ID NO: 18로 제시된 아미노산 서열을 인코딩하는 SEQ ID NO: 17로 제시된 뉴클레오타이드 서열을 포함한다. 일부 구현예에서, 본 발명의 미생물은 씨. 악시두리치로부터의 부류 V 아미노트랜스퍼라제를 포함하여, 상기 미생물은 SEQ ID NO: 20으로 제시된 아미노산 서열을 인코딩하는 SEQ ID NO: 19로 제시된 뉴클레오타이드 서열을 포함한다. 일부 구현예에서, 본 발명의 미생물은 티. 마르티마(T. maritima)로부터의 세린-피루베이트 아미노트랜스퍼라제를 포함하여, 상기 미생물은 SEQ ID NO: 22로 제시된 아미노산 서열을 인코딩하는 SEQ ID NO: 21로 제시된 뉴클레오타이드 서열을 포함한다. 일부 구현예에서, 글리신을 글리옥실레이트로 전환시키는 효소는 과발현된다.
본 발명의 미생물은 세린을 하이드록시피루베이트로 전환하는 효소를 포함할 수 있다(도 1의 반응 21). 이 효소는 세린-피루베이트 트랜스아미나제[2.6.1.51], 세린-글리옥실레이트 트랜스아미나제[2.6.1.45], 알라닌 데하이드로게나제[1.4.1.1], L-아미노산 데하이드로게나제[1.4.1.5], 세린 2-데하이드로게나제[1.4.1.7], 알라닌 트랜스아미나제[2.6.1.2], 글루타민-피루베이트 트랜스아미나제[2.6.1.15], D-아미노산 트랜스아미나제[2.6.1.21], 알라닌-글리옥실레이트 트랜스아미나제[2.6.1.44], 또는 세린-피루베이트 트랜스아미나제[2.6.1.51]일 수 있다. 일부 구현예에서, 세린을 하이드록시피루베이트로 전환하는 효소는 내인성 효소이다. 다른 구현예에서, 세린을 하이드록시피루베이트로 전환하는 효소는 이종성 효소이다. 세린을 하이드록시피루베이트로 전환할 수 있는 효소의 비제한적인 예는 GenBank 기탁 번호 WP_009989311 및 NP_511062.1에서 찾을 수 있다. 일부 구현예에서, 세린을 하이드록시피루베이트로 전환하는 효소는 과발현된다.
본 발명의 미생물은 하이드록시피루베이트를 글리코알데하이드로 전환하는 효소를 포함할 수 있다(도 1의 반응 22). 이 효소는 하이드록시피루베이트 데카르복실라제[4.1.1.40] 또는 피루베이트 데카르복실라제[4.1.1.1]일 수 있다. 이 효소는 또한, 임의의 다른 데카르복실라제[4.1.1.-]일 수 있다. 일부 구현예에서, 하이드록시피루베이트를 글리코알데하이드로 전환하는 효소는 이종성 효소이다. 하이드록시피루베이트를 글리코알데하이드로 전환할 수 있는 효소의 비제한적인 예는 GenBank 기탁 번호 CCG28866, SVF98953, PA0096, CAA54522, KRU13460 및 KLA26356에서 찾을 수 있다.
본 발명의 미생물은 D-글리세레이트를 하이드록시피루베이트로 전환하는 효소를 포함할 수 있다(도 1의 반응 23). 이 효소는 글리옥실레이트 리덕타제[EC 1.1.1.26], 글리세레이트 데하이드로게나제[EC 1.1.1.29], 또는 하이드록시피루베이트 리덕타제[EC 1.1.1.81]일 수 있다. 일부 구현예에서, D-글리세레이트를 하이드록시피루베이트로 전환하는 효소는 이종성 효소이다. D-글리세레이트를 하이드록시피루베이트로 전환할 수 있는 효소의 비제한적인 예는 GenBank 기탁 번호 SUK16841, RPK22618, KPA02240, AGW90762, CAC11987, Q9CA90 및 Q9UBQ7에서 찾을 수 있다.
본 발명의 미생물은 5,10-메틸렌테트라하이드로폴레이트를 글리신으로 전환시키는 효소의 복합체를 포함할 수 있다(도 1의 반응 24). 5,10-메틸렌테트라하이드로폴레이트는 우드-륭달 경로의 환원적 분지(reductive branch)에서 보조인자이고, 아세틸-CoA의 생성에서 스캐폴드로서 작용한다. 이 복합체는 글리신 데하이드로게나제[1.4.4.2], 디하이드로리포일 데하이드로게나제[1.8.1.4], 및 아미노메틸트랜스퍼라제(글리신 신타제)[2.1.2.10]를 포함하는 글리신 절단 시스템일 수 있다. 일부 구현예에서, 5,10-메틸렌테트라하이드로폴레이트를 글리신으로 전환하는 복합체의 효소는 내인성 효소이다. 일부 구현예에서, 글리신 절단 시스템의 효소는 과발현된다.
본 발명의 미생물은 포스포에놀피루베이트를 옥살로아세테이트로 전환하는 효소(도 1의 반응 25)를 포함할 수 있다. 이 효소는 포스포에놀피루베이트 카르복시키나제(ATP)[4.1.1.49] 또는 (GTP)[4.1.1.32]일 수 있다. 일부 구현예에서, 포스포에놀피루베이트를 옥살로아세테이트로 전환하는 효소는 내인성 효소이다. 다른 구현예에서, 포스포에놀피루베이트를 옥살로아세테이트로 전환하는 효소는 이종성 효소이다. 일부 구현예에서, 포스포에놀피루베이트를 옥살로아세테이트로 전환하는 효소는 과발현된다.
일부 구현예에서, 아세틸-CoA를 피루베이트로 전환하는 효소(도 1의 반응 1), 피루베이트를 옥살로아세테이트로 전환하는 효소(도 1의 반응 2), 옥살로아세테이트를 시트레이트로 전환하는 효소(도 1의 반응 5), 시트레이트를 아코니테이트로 그리고 아코니테이트를 이소-시트레이트로 전환하는 효소(도 1의 반응 6), 이소시트레이트를 글리옥실레이트로 전환하는 효소(도 1의 반응 7), 글리옥실레이트를 글리콜레이트로 전환하는 효소(도 1의 반응 8), 글리콜레이트를 글리코알데하이드로 전환하는 효소(도 1의 반응 9), 및 글리코알데하이드를 에틸렌 글리콜로 전환하는 효소(도 1의 반응 10)를 포함하는 미생물은 에틸렌 글리콜을 생성한다. 비제한적인 예에서, 옥살로아세테이트를 시트레이트로 전환하는 효소는 비. 서브틸리스로부터의 시트레이트 신타제(SEQ ID NO: 1-2)일 수 있다. 비제한적인 예에서, 이소-시트레이트를 글리옥실레이트로 전환하는 효소는 이. 콜라이로부터의 이소시트레이트 리아제(SEQ ID NO: 11-12)일 수 있다. 비제한적인 예에서, 글리콜레이트를 글리코알데하이드로 전환하는 효소는 지. 옥시단스로부터의 글리콜알데하이드 데하이드로게나제(SEQ ID NO: 55-56) 또는 피. 플루오레센스로부터의 알데하이드 데하이드로게나제(SEQ ID NO: 57-58)일 수 있다. 도 1에 제시된 바와 같이 반응 2, 5, 6, 8, 9 및 10을 촉매화하는 하나 이상의 효소는 과발현될 수 있다. 예를 들어, 실시예 1 및 도 3b를 참조한다.
일부 구현예에서, 아세틸-CoA를 피루베이트로 전환하는 효소(도 1의 반응 1), 피루베이트를 포스포에놀피루베이트로 전환하는 효소(도 1의 반응 13), 포스포에놀피루베이트를 2-포스포-D-글리세레이트로 전환하는 효소(도 1의 반응 14), 2-포스포-D-글리세레이트를 3-포스포-D-글리세레이트로 전환하는 효소(도 1의 반응 15), 3-포스포-D-글리세레이트를 3-포스포노옥시피루베이트로 전환하는 효소(도 1의 반응 16), 3-포스포노옥시피루베이트를 3-포스포-L-세린으로 전환하는 효소(도 1의 반응 17), 3-포스포-L-세린을 세린으로 전환하는 효소(도 1의 반응 18), 세린을 글리신으로 전환하는 효소(도 1의 반응 19), 글리신을 글리옥실레이트로 전환시키는 효소(도 1의 반응 20), 글리옥실레이트를 글리콜레이트로 전환하는 효소(도 1의 반응 8), 글리콜레이트를 글리코알데하이드로 전환하는 효소(도 1의 반응 9), 및 글리코알데하이드를 에틸렌 글리콜로 전환하는 효소(도 1의 반응 10)를 포함하는 미생물은 에틸렌 글리콜을 생성한다. 비제한적인 예에서, 글리신을 글리옥실레이트로 전환시키는 효소는 에스. 티오타우리니로부터의 알라닌-글리옥실레이트 아미노트랜스퍼라제(SEQ ID NO: 15-16) 또는 씨. 악시두리치로부터의 부류 V 아미노트랜스퍼라제(SEQ ID NO: 19-20)일 수 있다. 비제한적인 예에서, 글리콜레이트를 글리코알데하이드로 전환하는 효소는 지. 옥시단스로부터의 글리콜알데하이드 데하이드로게나제(SEQ ID NO: 55-56) 또는 피. 플루오레센스로부터의 알데하이드 데하이드로게나제(SEQ ID NO: 57-58)일 수 있다. 도 1에 제시된 바와 같이 반응 19, 20, 8, 9 및 10을 촉매화하는 하나 이상의 효소는 과발현될 수 있다. 예를 들어, 실시예 2 내지 4 및 도 4b, 5b 및 6b를 참조한다.
일부 구현예에서, 아세틸-CoA를 피루베이트로 전환하는 효소(도 1의 반응 1), 피루베이트를 옥살로아세테이트로 전환하는 효소(도 1의 반응 2), 옥살로아세테이트를 시트릴-CoA로 전환하는 효소(도 1의 반응 3), 시트릴-CoA를 시트레이트로 전환하는 효소(도 1의 반응 4), 시트레이트를 아코니테이트로 그리고 아코니테이트를 이소-시트레이트로 전환하는 효소(도 1의 반응 6), 이소시트레이트를 글리옥실레이트로 전환하는 효소(도 1의 반응 7), 글리옥실레이트를 글리콜레이트로 전환하는 효소(도 1의 반응 8), 글리콜레이트를 글리코알데하이드로 전환하는 효소(도 1의 반응 9), 및 글리코알데하이드를 에틸렌 글리콜로 전환하는 효소(도 1의 반응 10)를 포함하는 미생물은 에틸렌 글리콜을 생성한다. 비제한적인 예에서, 이소-시트레이트를 글리옥실레이트로 전환하는 효소는 이. 콜라이로부터의 이소시트레이트 리아제(SEQ ID NO: 11-12)일 수 있다. 비제한적인 예에서, 이소-시트레이트를 글리옥실레이트로 전환하는 효소는 이. 콜라이로부터의 이소시트레이트 리아제(SEQ ID NO: 11-12)일 수 있다. 비제한적인 예에서, 글리콜레이트를 글리코알데하이드로 전환하는 효소는 지. 옥시단스로부터의 글리콜알데하이드 데하이드로게나제(SEQ ID NO: 55-56) 또는 피. 플루오레센스로부터의 알데하이드 데하이드로게나제(SEQ ID NO: 57-58)일 수 있다. 도 1에 제시된 바와 같이 반응 2, 6, 8, 9 및 10을 촉매화하는 하나 이상의 효소는 과발현될 수 있다.
일부 구현예에서, 아세틸-CoA를 피루베이트로 전환하는 효소(도 1의 반응 1), 피루베이트를 말레이트로 전환하는 효소(도 1의 반응 11), 말레이트를 글리옥실레이트로 전환하는 효소(도 1의 반응 12), 글리옥실레이트를 글리콜레이트로 전환하는 효소(도 1의 반응 8), 글리콜레이트를 글리코알데하이드로 전환하는 효소(도 1의 반응 9), 및 글리코알데하이드를 에틸렌 글리콜로 전환하는 효소(도 1의 반응 10)를 포함하는 미생물은 에틸렌 글리콜을 생성한다. 비제한적인 예에서, 글리콜레이트를 글리코알데하이드로 전환하는 효소는 지. 옥시단스로부터의 글리콜알데하이드 데하이드로게나제(SEQ ID NO: 55-56) 또는 피. 플루오레센스로부터의 알데하이드 데하이드로게나제(SEQ ID NO: 57-58)일 수 있다. 도 1에 제시된 바와 같이 반응 8, 9 및 10을 촉매화하는 하나 이상의 효소는 과발현될 수 있다.
일부 구현예에서, 5,10-메틸렌테트라하이드로폴레이트를 글리신으로 전환시키는 효소(도 1의 반응 24), 글리신을 글리옥실레이트로 전환시키는 효소(도 1의 반응 20), 글리옥실레이트를 글리콜레이트로 전환하는 효소(도 1의 반응 8), 글리콜레이트를 글리코알데하이드로 전환하는 효소(도 1의 반응 9), 및 글리코알데하이드를 에틸렌 글리콜로 전환하는 효소(도 1의 반응 10)의 복합체를 포함하는 미생물은 에틸렌 글리콜을 생성한다. 비제한적인 예에서, 글리신을 글리옥실레이트로 전환시키는 효소는 에스. 티오타우리니로부터의 알라닌-글리옥실레이트 아미노트랜스퍼라제(SEQ ID NO: 15-16) 또는 씨. 악시두리치로부터의 부류 V 아미노트랜스퍼라제(SEQ ID NO: 19-20)일 수 있다. 비제한적인 예에서, 글리콜레이트를 글리코알데하이드로 전환하는 효소는 지. 옥시단스로부터의 글리콜알데하이드 데하이드로게나제(SEQ ID NO: 55-56) 또는 피. 플루오레센스로부터의 알데하이드 데하이드로게나제(SEQ ID NO: 57-58)일 수 있다. 도 1에 제시된 바와 같이 반응 8, 9, 10, 20 및 24를 촉매화하는 하나 이상의 효소는 과발현될 수 있다.
일부 구현예에서, 아세틸-CoA를 피루베이트로 전환하는 효소(도 1의 반응 1), 피루베이트를 포스포에놀피루베이트로 전환하는 효소(도 1의 반응 13), 포스포에놀피루베이트를 옥살로아세테이트로 전환하는 효소(도 1의 반응 25), 옥살로아세테이트를 시트릴-CoA로 전환하는 효소(도 1의 반응 3), 시트릴-CoA를 시트레이트로 전환하는 효소(도 1의 반응 4), 시트레이트를 아코니테이트로 그리고 아코니테이트를 이소-시트레이트로 전환하는 효소(도 1의 반응 6), 이소시트레이트를 글리옥실레이트로 전환하는 효소(도 1의 반응 7), 글리옥실레이트를 글리콜레이트로 전환하는 효소(도 1의 반응 8), 글리콜레이트를 글리코알데하이드로 전환하는 효소(도 1의 반응 9), 및 글리코알데하이드를 에틸렌 글리콜로 전환하는 효소(도 1의 반응 10)를 포함하는 미생물은 에틸렌 글리콜을 생성한다. 비제한적인 예에서, 이소-시트레이트를 글리옥실레이트로 전환하는 효소는 이. 콜라이로부터의 이소시트레이트 리아제(SEQ ID NO: 11-12)일 수 있다. 비제한적인 예에서, 글리콜레이트를 글리코알데하이드로 전환하는 효소는 지. 옥시단스로부터의 글리콜알데하이드 데하이드로게나제(SEQ ID NO: 55-56) 또는 피. 플루오레센스로부터의 알데하이드 데하이드로게나제(SEQ ID NO: 57-58)일 수 있다. 도 1에 제시된 바와 같이 반응 2, 6, 8, 9, 10 및 25를 촉매화하는 하나 이상의 효소는 과발현될 수 있다.
일부 구현예에서, 아세틸-CoA를 피루베이트로 전환하는 효소(도 1의 반응 1), 피루베이트를 포스포에놀피루베이트로 전환하는 효소(도 1의 반응 13), 포스포에놀피루베이트를 옥살로아세테이트로 전환하는 효소(도 1의 반응 25), 옥살로아세테이트를 시트레이트로 전환하는 효소(도 1의 반응 5), 시트레이트를 아코니테이트로 그리고 아코니테이트를 이소-시트레이트로 전환하는 효소(도 1의 반응 6), 이소시트레이트를 글리옥실레이트로 전환하는 효소(도 1의 반응 7), 글리옥실레이트를 글리콜레이트로 전환하는 효소(도 1의 반응 8), 글리콜레이트를 글리코알데하이드로 전환하는 효소(도 1의 반응 9), 및 글리코알데하이드를 에틸렌 글리콜로 전환하는 효소(도 1의 반응 10)를 포함하는 미생물은 에틸렌 글리콜을 생성한다. 비제한적인 예에서, 옥살로아세테이트를 시트레이트로 전환하는 효소는 비. 서브틸리스로부터의 시트레이트 신타제(SEQ ID NO: 1-2)일 수 있다. 비제한적인 예에서, 이소-시트레이트를 글리옥실레이트로 전환하는 효소는 이. 콜라이로부터의 이소시트레이트 리아제(SEQ ID NO: 11-12)일 수 있다. 비제한적인 예에서, 글리콜레이트를 글리코알데하이드로 전환하는 효소는 지. 옥시단스로부터의 글리콜알데하이드 데하이드로게나제(SEQ ID NO: 55-56) 또는 피. 플루오레센스로부터의 알데하이드 데하이드로게나제(SEQ ID NO: 57-58)일 수 있다. 도 1에 제시된 바와 같이 반응 5, 6, 8, 9, 10 및 25를 촉매화하는 하나 이상의 효소는 과발현될 수 있다.
일부 구현예에서, 아세틸-CoA를 피루베이트로 전환하는 효소(도 1의 반응 1), 피루베이트를 포스포에놀피루베이트로 전환하는 효소(도 1의 반응 13), 포스포에놀피루베이트를 2-포스포-D-글리세레이트로 전환하는 효소(도 1의 반응 14), 2-포스포-D-글리세레이트를 3-포스포-D-글리세레이트로 전환하는 효소(도 1의 반응 15), 3-포스포-D-글리세레이트를 3-포스포노옥시피루베이트로 전환하는 효소(도 1의 반응 16), 3-포스포노옥시피루베이트를 3-포스포-L-세린으로 전환하는 효소(도 1의 반응 17), 3-포스포-L-세린을 세린으로 전환하는 효소(도 1의 반응 18), 세린을 하이드록시피루베이트로 전환하는 효소(도 1의 반응 21), 하이드록시피루베이트를 글리코알데하이드로 전환하는 효소(도 1의 반응 22), 및 글리코알데하이드를 에틸렌 글리콜로 전환하는 효소(도 1의 반응 10)를 포함하는 미생물은 에틸렌 글리콜을 생성한다. 글리코알데하이드로부터 에틸렌 글리콜로의 전환을 촉매화하는 효소는 과발현될 수 있다.
일부 구현예에서, D-글리세레이트를 하이드록시피루베이트로 전환하는 효소(도 1의 반응 23), 하이드록시피루베이트를 글리코알데하이드로 전환하는 효소(도 1의 반응 22), 및 글리코알데하이드를 에틸렌 글리콜로 전환하는 효소(도 1의 반응 10)를 포함하는 미생물은 에틸렌 글리콜을 생성한다. 글리코알데하이드로부터 에틸렌 글리콜로의 전환을 촉매화하는 효소는 과발현될 수 있다.
본 발명의 효소는 본 발명의 미생물에서 발현을 위해 코돈 최적화될 수 있다. "코돈 최적화"는 특정 균주 또는 종에서 핵산의 최적화된 또는 향상된 번역을 위한 핵산, 예컨대 유전자의 돌연변이를 지칭한다. 코돈 최적화는 더 신속한 번역 속도(rate) 또는 더 높은 번역 정확도를 초래할 수 있다. 바람직한 구현예에서, 본 발명의 유전자는 본 발명의 미생물에서의 발현을 위해 코돈 최적화된다. 코돈 최적화가 기저(underlying) 유전자 서열을 지칭하긴 하지만, 코돈 최적화는 종종 향상된 번역, 따라서 향상된 효소 발현을 초래한다. 이에, 본 발명의 효소 또한, 코돈 최적화되는 것으로 기재될 수 있다.
본 발명의 하나 이상의 효소는 과발현될 수 있다. "과발현된"은, 본 발명의 미생물이 유래되는 야생형 또는 부모 미생물과 비교하여 본 발명의 미생물에서 핵산 또는 단백질의 발현의 증가를 지칭한다. 과발현은 유전자 복사 수, 유전자 전사 속도, 유전자 번역 속도 또는 효소 분해 속도를 변형시키는 것을 포함한 당업계에 알려진 임의의 수단에 의해 달성될 수 있다. 상기 기재된 바와 같이, 도 1의 반응 2, 5, 6, 8, 9, 10, 19, 20, 24 또는 25를 촉매화하는 효소 중 하나 이상이 과발현될 수 있다.
본 발명의 효소는 교란적 돌연변이를 포함할 수 있다. "교란적 돌연변이"는, 유전자 또는 효소의 발현 또는 활성을 감소시키거나 없애는(즉, "교란시키는") 돌연변이를 지칭한다. 교란적 돌연변이는 유전자 또는 효소를 부분적으로 불활성화시키거나, 완전히 불활성화시키거나 결실시킬 수 있다. 교란적 돌연변이는 녹아웃(KO) 돌연변이일 수 있다. 교란적 돌연변이는 또한, 효소에 의해 생성되는 생성물의 생합성을 감소시키거나, 예방하거나 차단하는 임의의 돌연변이일 수 있다. 교란적 돌연변이는 예를 들어, 효소를 인코딩하는 유전자에서의 돌연변이, 효소를 인코딩하는 유전자의 발현에 관여하는 유전적 조절 요소에서의 돌연변이, 효소의 활성을 감소시키거나 저해하는 단백질을 생성하는 핵산의 도입, 또는 효소의 발현을 저해하는 핵산(예를 들어, 안티센스 RNA, siRNA, CRISPR) 또는 단백질의 도입을 포함할 수 있다. 교란적 돌연변이는 당업계에 알려진 임의의 방법을 사용하여 도입될 수 있다.
일부 구현예에서, 본 발명의 미생물은 이소시트레이트 데하이드로게나제[1.1.1.41]에서 교란적 돌연변이를 포함한다. 이소시트레이트 데하이드로게나제는 이소-시트레이트를 2-옥소글루타레이트로 전환한다. 예컨대 이소시트레이트 데하이드로게나제의 결실에 의한 상기 이소시트레이트 데하이드로게나제의 교란은 이소-시트레이트의 수준을 증가시킨다.
일부 구현예에서, 본 발명의 미생물은 글리세레이트 데하이드로게나제[1.1.1.29]에서 교란적 돌연변이를 포함한다. 글리세레이트 데하이드로게나제는 글리옥실레이트를 글리콜레이트로 전환한다. 예컨대 이소시트레이트 데하이드로게나제의 결실에 의한 글리세레이트 데하이드로게나제의 교란은 글리옥실레이트의 수준을 증가시킨다.
일부 구현예에서, 본 발명의 미생물은 글리콜레이트 데하이드로게나제[1.1.99.14]에서 교란적 돌연변이를 포함한다. 글리콜레이트 데하이드로게나제는 글리옥실레이트를 글리콜레이트로 전환한다. 예컨대 글리콜레이트 데하이드로게나제의 결실에 의한 글리콜레이트 데하이드로게나제의 교란은 글리옥실레이트의 수준을 증가시킨다.
일부 구현예에서, 본 발명의 미생물은 알데하이드 페레독신 옥시도리덕타제[1.2.7.5]에서 교란적 돌연변이를 포함한다. 알데하이드 페레독신 옥시도리덕타제는 글리콜레이트를 글리코알데하이드로 전환한다. 예컨대 알데하이드 페레독신 옥시도리덕타제의 결실에 의한 알데하이드 페레독신 옥시도리덕타제의 교란은 글리콜레이트의 수준을 증가시킨다.
일부 구현예에서, 본 발명의 미생물은 알데하이드 데하이드로게나제[1.2.1.3/1.2.3.4/1.2.3.5]에서 교란적 돌연변이를 포함한다. 알데하이드 데하이드로게나제는 글리콜레이트를 글리코알데하이드로 전환한다.. 예컨대 알데하이드 데하이드로게나제의 결실에 의한 알데하이드 데하이드로게나제의 교란은 글리콜레이트의 수준을 증가시킨다.
교란적 돌연변이의 도입은, 표적 생성물을 생성하지 않거나 표적 생성물을 실질적으로 생성하지 않거나, 본 발명의 미생물이 유래되는 부모 미생물과 비교하여 표적 생성물을 감소된 양으로 생성하는 본 발명의 미생물을 초래한다. 예를 들어, 본 발명의 미생물은 표적 생성물을 생성하지 않거나, 부모 미생물보다 적어도 약 1%, 3%, 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90% 또는 95% 적은 표적 생성물을 생성할 수 있다. 예를 들어, 본 발명의 미생물은 약 0.001, 0.01, 0.10, 0.30, 0.50 또는 1.0 g/L 미만의 표적 생성물을 생성할 수 있다.
효소에 대한 예시적인 서열 및 공급원이 본원에 제공되긴 하지만, 본 발명은 이들 서열 및 공급원으로 제한되는 것이 아니며 - 본 발명은 변이체를 또한 포괄한다. 용어 "변이체"는, 핵산 및 단백질의 서열이 기준 핵산 및 단백질의 서열, 예컨대 선행 기술에 개시되거나 본원에 예시된 기준 핵산 및 단백질의 서열로부터 달라진 핵산 및 단백질을 포함한다. 본 발명은 기준 핵산 또는 단백질과 실질적으로 동일한 기능을 수행하는 변이체 핵산 또는 단백질을 사용하여 실시될 수 있다. 예를 들어, 변이체 단백질은 기준 단백질과 실질적으로 동일한 기능을 수행하거나 실질적으로 동일한 반응을 촉매화할 수 있다. 변이체 유전자는 기준 유전자와 동일한 단백질 또는 실질적으로 동일한 단백질을 인코딩할 수 있다. 변이체 프로모터는 하나 이상의 유전자의 발현을 촉진하는 기준 프로모터와 실질적으로 동일한 능력을 가질 수 있다.
이러한 핵산 또는 단백질은 본원에서 "기능적으로 등가인 변이체"로서 지칭될 수 있다. 예로서, 핵산의 기능적으로 등가인 변이체는 대립유전자(allelic) 변이체, 유전자의 단편, 돌연변이화된 유전자, 다형성(polymorphism) 등을 포함할 수 있다. 다른 미생물로부터의 상동성 유전자 또한, 기능적으로 등가인 변이체의 예이다. 이들은 종, 예컨대 클로스트리디움 아세토부틸리쿰, 클로스트리디움 베이예린키이 또는 클로스트리디움 륭달리이 내의 상동성 유전자를 포함하고, 이들의 세부사항은 웹사이트, 예컨대 Genbank 또는 NCBI 상에서 공개적으로 입수 가능하다. 기능적으로 등가인 변이체는 또한, 핵산의 서열이 특정 미생물에 대한 코돈 최적화의 결과 달라지는 핵산을 포함한다. 핵산의 기능적으로 등가인 변이체는 바람직하게는, 기준 핵산과 적어도 대략 70%, 대략 80%, 대략 85%, 대략 90%, 대략 95%, 대략 98% 또는 그 이상의 핵산 서열 동일성(상동성 퍼센트)을 가질 것이다. 단백질의 기능적으로 등가인 변이체는 바람직하게는, 기준 단백질과 적어도 대략 70%, 대략 80%, 대략 85%, 대략 90%, 대략 95%, 대략 98% 또는 그 이상의 아미노산 동일성(상동성 퍼센트)을 가질 것이다. 변이체 핵산 또는 단백질의 기능적 등가성(equivalence)은 당업계에 알려진 임의의 방법을 사용하여 평가될 수 있다.
핵산은 당업계에 알려진 임의의 방법을 사용하여 본 발명의 미생물에 전달될 수 있다. 예를 들어, 핵산은 노출(naked) 핵산으로서 전달될 수 있거나, 하나 이상의 작용제, 예컨대 리포솜으로 제제화될 수 있다. 핵산은 적절하다면, DNA, RNA, cDNA 또는 이들의 조합일 수 있다. 제한 저해제는 소정의 구현예에서 사용될 수 있다. 추가의 벡터는 플라스미드, 바이러스, 박테리오파지, 코스미드 및 인공 염색체를 포함할 수 있다. 바람직한 구현예에서, 핵산은 플라스미드를 사용하여 본 발명의 미생물에 전달될 수 있다. 예로서, 형질전환(형질도입 또는 형질감염을 포함함)은 전기천공, 초음파처리, 폴리에틸렌 글리콜-매개 형질전환, 화학적 또는 천연 적격(competence), 원생동물 형질전환, 프로파지(prophage) 유도 또는 컨쥬게이션에 의해 달성될 수 있다. 활성 제한 효소 시스템을 갖는 소정의 구현예에서, 핵산을 미생물 내로 도입하기 전에 핵산을 메틸화하는 것이 필요할 수 있다.
더욱이, 핵산은 특정 핵산의 발현을 증가시키거나 그렇지 않으면 제어하기 위해 조절 요소, 예컨대 프로모터를 포함하도록 설계될 수 있다. 프로모터는 구성적(constitutive) 프로모터 또는 유도적 프로모터일 수 있다. 이상적으로, 프로모터는 우드-륭달 경로 프로모터, 페레독신 프로모터, 피루베이트 페레독신 옥시도리덕타제 프로모터, Rnf 복합체 오페론 프로모터, ATP 신타제 오페론 프로모터, 또는 포스포트랜스아세틸라제/아세테이트 키나제 오페론 프로모터이다.
"기질"은 본 발명의 미생물에 대한 탄소 및/또는 에너지 공급원을 지칭한다. 종종, 기질은 가스성이고, C1-탄소 공급원, 예를 들어, CO, CO2 및/또는 CH4를 포함한다. 바람직하게는, 기질은 CO 또는 CO + CO2의 C1-탄소 공급원을 포함한다. 기질은 다른 비-탄소 구성성분, 예컨대 H2, N2 또는 전자를 추가로 포함할 수 있다. 그러나, 다른 구현예에서, 기질은 탄수화물, 예컨대 당(sugar), 전분, 섬유, 리그닌, 셀룰로스 또는 헤미셀룰로스 또는 이들의 조합일 수 있다. 예를 들어, 탄수화물은 프룩토스, 갈락토스, 글루코스, 락토스, 말토스, 수크로스, 자일로스 또는 이들의 일부 조합일 수 있다. 일부 구현예에서, 기질은 (D)-자일로스를 포함하지 않는다(문헌[Alkim, Microb Cell Fact, 14: 127, 2015]). 일부 구현예에서, 기질은 펜토스, 예컨대 자일로스를 포함하지 않는다(문헌[Pereira, Metab Eng, 34: 80-87, 2016]). 일부 구현예에서, 기질은 가스성 기질과 탄수화물 기질 둘 모두를 포함할 수 있다(혼합영양(mixotrophic) 발효).
가스성 기질은 일반적으로, 적어도 일부 양의 CO, 예컨대 약 1, 2, 5, 10, 20, 30, 40, 50, 60, 70, 80, 90 또는 100 몰% CO를 포함한다. 가스성 기질은 CO 범위, 예컨대 약 20 내지 80 몰%, 30 내지 70 몰% 또는 40 내지 60 몰% CO를 포함할 수 있다. 바람직하게는, 가스성 기질은 약 40 내지 70 몰% CO(예를 들어, 제철소 또는 용광로 가스), 약 20 내지 30 몰% CO(예를 들어, 기본적인(basic) 산소로(oxygen furnace) 가스), 또는 약 15 내지 45 몰% CO(예를 들어, 합성가스)를 포함한다. 일부 구현예에서, 가스성 기질은 상대적으로 소량의 CO, 예컨대 약 1 내지 10 몰% 또는 1 내지 20 몰% CO를 포함할 수 있다. 본 발명의 미생물은 전형적으로, 가스성 기질 내 적어도 일부의 CO를 생성물로 전환시킨다. 일부 구현예에서, 가스성 기질은 CO를 포함하지 않거나 CO를 실질적으로(1 몰% 미만) 포함하지 않는다.
가스성 기질은 일부 양의 H2를 포함할 수 있다. 예를 들어, 가스성 기질은 약 1, 2, 5, 10, 15, 20 또는 30 몰% H2를 포함할 수 있다. 일부 구현예에서, 가스성 기질은 상대적으로 다량의 H2, 예컨대 약 60, 70, 80 또는 90 몰% H2를 포함할 수 있다. 추가의 구현예에서, 가스성 기질은 H2를 포함하지 않거나 H2를 실질적으로(1 몰% 미만) 포함하지 않는다.
가스성 기질은 일부 양의 CO2를 포함할 수 있다. 예를 들어, 가스성 기질은 약 1 내지 80 몰% 또는 1 내지 30 몰% CO2를 포함할 수 있다. 일부 구현예에서, 가스성 기질은 약 20, 15, 10 또는 5 몰% 미만의 CO2를 포함할 수 있다. 또 다른 구현예에서, 가스성 기질은 CO2를 포함하지 않거나 CO2를 실질적으로(1 몰% 미만) 포함하지 않는다.
가스성 기질은 또한, 대안적인 형태로 제공될 수 있다. 예를 들어, 가스성 기질은 액체 내에 용해되거나 고체 지지체 상으로 흡착될 수 있다.
가스성 기질 및/또는 C1-탄소 공급원은 산업적 과정의 부산물로서 또는 일부 다른 공급원으로부터, 예컨대 자동차 매연 또는 바이오매스 가스화로부터 수득되는 폐가스 또는 오프가스일 수 있다. 소정의 구현예에서, 산업적 과정은 페러스(ferrous) 금속 생성물 제조, 예컨대 제철소 제조, 비-페러스 생성물 제조, 석유 정제, 석탄 가스화, 전력 생산, 카본 블랙 생산, 암모니아 생산, 메탄올 생산 및 코크스 제조로 이루어진 군으로부터 선택된다. 이들 구현예에서, 가스성 기질 및/또는 C1-탄소 공급원은, 이것이 대기 중으로 배출되기 전에 임의의 편리한 방법을 사용하여 산업적 과정으로부터 포착될 수 있다.
가스성 기질 및/또는 C1-탄소 공급원은 합성가스, 예컨대 석탄 또는 정제 잔류물(residue)의 가스화, 바이오매스 또는 리그노셀룰로스성 물질의 가스화, 또는 천연 가스의 개질(reforming)에 의해 수득되는 합성가스일 수 있다. 또 다른 구현예에서, 합성가스는 도시(municipal) 고체 폐기물 또는 산업용 고체 폐기물의 가스화로부터 수득될 수 있다.
가스성 기질의 조성물은 반응의 효율 및/또는 비용에 유의한 영향을 가질 수 있다. 예를 들어, 산소(O2)의 존재는 혐기성 발효 과정의 효율을 감소시킬 수 있다. 기질의 조성물에 따라, 기질을 처리하거나, 스크럽하거나 여과하여, 요망되지 않는 불순물, 예컨대 독소, 요망되지 않는 구성성분, 또는 먼지 입자를 제거하고/하거나 바람직한 구성성분의 농도를 증가시키는 것이 바람직할 수 있다.
소정의 구현예에서, 발효는 탄수화물 기질, 예컨대 당, 전분, 섬유질, 리그닌, 셀룰로스 또는 헤미셀룰로스의 부재 하에 수행된다.
일부 구현예에서, CO 및 H2로부터 에틸렌 글리콜(MEG)로의 전반적인 에너지론은 하기 제시된 바와 같이 글루코스로부터 에틸렌 글리콜로의 것들에 비해 바람직하며, 여기서, CO 및 H2에 대한 더 음성의 깁스 자유 에너지, ΔrG'm 값은 에틸렌 글리콜로의 더 큰 구동력을 나타낸다.기질로서 글루코스 대 CO의 비교를 위한 전반적인 반응 델타 G의 계산은 평형 장치(equilibrator)(http://equilibrator.weizmann.ac.il/)를 사용하여 수행되었으며, 이는 생물학적 시스템에서 경로의 전반적인 실행 가능성 또는 경로에서의 개별적인 단계를 평가하는 표준 방법이다(문헌[Flamholz, E. Noor, A. Bar-Even, R. Milo (2012) eQuilibrator - the biochemical thermodynamics calculator Nucleic Acids Res 40:D770-5; Noor, A. Bar-Even, A. Flamholz, Y. Lubling, D. Davidi, R. Milo (2012) An integrated open framework for thermodynamics of reactions that combines accuracy and coverageBioinformatics 28:2037-2044; Noor, H.S. Haraldsdottir, R. Milo, R.M.T. Fleming (2013) Consistent Estimation of Gibbs Energy Using Component Contributions PLoS Comput Biol 9(7): e1003098; Noor, A. Bar-Even, A. Flamholz, E. Reznik, W. Liebermeister, R. Milo (2014) Pathway Thermodynamics Highlights Kinetic Obstacles in Central Metabolism PLoS Comput Biol 10(2):e1003483]). 계산은 하기와 같다:
생리학적 조건:
에틸렌 글리콜, 글리옥실레이트, 및/또는 글리콜레이트 외에도, 본 발명의 미생물은 하나 이상의 공동-생성물 생성물을 생성하도록 배양될 수 있다. 예를 들어, 본 발명의 미생물은 에탄올(WO 2007/117157), 아세테이트(WO 2007/117157), 부탄올(WO 2008/115080 및 WO 2012/053905), 부티레이트(WO 2008/115080), 2,3-부탄디올(WO 2009/151342 및 WO 2016/094334), 락테이트(WO 2011/112103), 부텐(WO 2012/024522), 부타디엔(WO 2012/024522), 메틸 에틸 케톤(2-부타논)(WO 2012/024522 및 WO 2013/185123), 에틸렌(WO 2012/026833), 아세톤(WO 2012/115527), 이소프로판올(WO 2012/115527), 지질(WO 2013/036147), 3-하이드록시프로피오네이트(3-HP)(WO 2013/180581), 이소프렌(WO 2013/180584), 지방산(WO 2013/191567), 2-부탄올(WO 2013/185123), 1,2-프로판디올(WO 2014/036152), 1-프로판올(WO 2014/0369152), 코리스메이트-유래 생성물(WO 2016/191625), 3-하이드록시부티레이트(WO 2017/066498) 및 1,3-부탄디올(WO 2017/0066498)을 생성할 수 있거나 이들을 생성하도록 조작될 수 있다. 일부 구현예에서, 에틸렌 글리콜 외에도, 본 발명의 미생물은 또한, 에탄올, 2,3-부탄디올 및/또는 숙시네이트를 생성한다. 소정의 구현예에서, 미생물 바이오매스 자체가 생성물인 것으로 여겨질 수 있다.
"본연의 생성물"은 유전적으로 비변형된 미생물에 의해 생성되는 생성물이다. 예를 들어, 에탄올, 아세테이트 및 2,3-부탄디올은 클로스트리디움 아우토에타노게눔, 클로스트리디움 륭달리이, 및 클로스트리디움 라그스달레이의 본연의 생성물이다. "비-본연의 생성물"은 유전적으로 변형된 미생물에 의해 생성되지만, 유전적으로 변형된 미생물이 유래되는 유전적으로 비변형된 미생물에 의해서는 생성되지 않는 생성물이다. 에틸렌 글리콜은 임의의 천연-발생 미생물에 의해 생성되는 것으로 알려져 있지 않아서, 이는 모든 미생물의 비(non)-본연의 생성물이다.
"선택도"는 미생물에 의해 생성되는 모든 발효 생성물들의 생성에 대한 표적 생성물의 생성의 비를 지칭한다. 본 발명의 미생물은 소정의 선택도에서 또는 최소 선택도에서 생성물을 생성하도록 조작될 수 있다. 일 구현예에서, 표적 생성물, 예컨대 에틸렌 글리콜은 본 발명의 미생물에 의해 생성되는 모든 발효 생성물의 적어도 약 5%, 10%, 15%, 20%, 30%, 50% 또는 75%를 차지한다. 일 구현예에서, 에틸렌 글리콜은 본 발명의 미생물에 의해 생성되는 모든 발효 생성물들의 적어도 약 10%를 차지하여, 본 발명의 미생물은 적어도 10%의, 에틸렌 글리콜에 대한 선택도를 갖는다. 또 다른 구현예에서, 에틸렌 글리콜은 본 발명의 미생물에 의해 생성되는 모든 발효 생성물들의 적어도 약 30%를 차지하여, 본 발명의 미생물은 적어도 30%의, 에틸렌 글리콜에 대한 선택도를 갖는다.
전형적으로, 배양은 생물반응기에서 수행된다. 용어 "생물반응기"는 하나 이상의 용기, 탑(tower), 또는 배관(piping) 배열, 예컨대 연속 교반 탱크 반응기(CSTR), 고정형 세포 반응기(ICR), 트리클 베드 반응기(TBR; trickle bed reactor), 기포 컬럼, 가스 리프트 발효기, 정적 혼합기, 또는 가스-액체 접촉에 적합한 다른 용기 또는 다른 장치로 이루어진 배양/발효 장치를 포함한다. 일부 구현예에서, 생물반응기는 제1 성장 반응기 및 제2 배양/발효 반응기를 포함할 수 있다. 기질은 이들 반응기 중 하나 또는 둘 모두에 제공될 수 있다. 본원에 사용된 바와 같이, 용어 "배양" 및 "발효"는 상호교환적으로 사용된다. 이들 용어는 배양/발효 과정의 성장기(growth phase)와 생성물 생합성기(biosynthesis phase) 둘 모두를 포괄한다.
배양물은 일반적으로, 미생물의 성장을 가능하게 하기에 충분한 영양소, 비타민 및/또는 미네랄을 함유하는 수성 배양 배지에서 유지된다. 바람직하게는 수성 배양 배지는 혐기성 미생물 성장 배지, 예컨대 최소 혐기성 미생물 성장 배지이다. 적합한 배지는 당업계에 잘 알려져 있다.
배양/발효는 바람직하게는, 에틸렌 글리콜의 생성을 위한 적절한 조건 하에 수행되어야 한다. 필요한 경우, 배양/발효는 혐기성 조건 하에 수행된다. 고려되는 반응 조건은, 압력(또는 분압), 온도, 가스 유동 속도, 액체 유동 속도, 배지 pH, 배지 산화환원 전위, 교반 속도(연속 교반 탱크 반응기를 사용하는 경우), 접종물 수준, 액체상 내 가스가 제한적으로 되지 않도록 보장하기 위한 최대 가스 기질 농도, 및 생성물 저해를 피하기 위한 최대 생성물 농도를 포함한다. 특히, 기질의 도입 속도는 액체상 내 가스의 농도가 제한적으로 되지 않도록 보장하기 위해 제어될 수 있다.
승압에서 생물반응기를 작동시키는 것은 가스상으로부터 액체상으로의 가스 매스(mass) 전달의 증가된 속도를 가능하게 한다. 이에, 대기압보다 높은 압력에서 배양/발효를 수행하는 것이 일반적으로 바람직할 수 있다. 또한, 주어진 가스 전환 속도가 부분적으로, 기질 체류 시간, 및 생물반응기의 필요한 부피를 나타내는 체류 시간의 함수이기 때문에, 가압된 시스템의 사용은 필요한 생물반응기의 부피, 결과적으로 배양/발효 장비의 자본 비용을 크게 감소시킬 수 있다. 이는 다시 말해, 생물반응기가 대기압보다 승압에서 유지되는 경우, 투입 가스 유동 속도로 나눈 생물반응기 내 액체 부피로서 정의되는 체류 시간이 감소될 수 있음을 의미한다. 최적 반응 조건은 부분적으로는, 사용되는 특정 미생물에 따라 다를 것이다. 그러나, 일반적으로, 발효를 대기압보다 높은 압력에서 조작하는 것이 바람직하다. 또한, 주어진 가스 전환 속도가 부분적으로는 기질 체류 시간의 함수이고 요망되는 체류 시간을 달성하는 것이 다시 말해 생물반응기의 필요한 시간을 나타내기 때문에, 가압된 시스템의 사용은 필요한 생물반응기의 부피, 결과적으로 발효 장비의 자본 비용을 크게 감소시킬 수 있다.
소정의 구현예에서, 발효는 광의 부재 하에, 또는 광합성 미생물의 에너지 요건을 충족시키기에 불충분한 양의 광의 존재 하에 수행된다. 소정의 구현예에서, 본 발명의 미생물은 비-광합성 미생물이다.
본 발명의 방법은 에틸렌 글리콜을 발효 브로쓰(broth)로부터 분리하는 단계를 추가로 포함할 수 있다. 에틸렌 글리콜은 예를 들어, 증류, 모사 이동상(simulated moving bed) 과정, 막 처리, 증발, 투과증발(pervaporation), 가스 탈리(gas stripping), 상분리, 이온 교환, 또는 예를 들어 액체-액체 추출을 포함한 추출 발효를 포함하는 당업계에 알려진 임의의 방법 또는 방법들의 조합을 사용하여 발효 브로쓰로부터 분리 또는 정제될 수 있다. 일 구현예에서, 에틸렌 글리콜은 역삼투 및/또는 투과증발을 사용하여 발효 브로쓰로부터 농축될 수 있다(US 5,552,023). 물은 증류에 의해 제거될 수 있고, 그 후에 하부(bottom)(높은 비율의 에틸렌 글리콜을 함유함)는 증류 또는 진공 증류에 의해 회수되어 고순도의 에틸렌 글리콜 스트림을 생성할 수 있다. 대안적으로, 역삼투 및/또는 투과증발에 의한 농축과 함께 또는 없이, 에틸렌 글리콜은 알데하이드(문헌[Atul, Chem Eng Sci, 59: 2881-2890, 2004])와의 반응성 증류 또는 탄화수소(US 2,218,234)를 사용한 공비 증류에 의해 추가로 정제될 수 있다. 또 다른 접근법에서, 에틸렌 글리콜은 (역삼투 및/또는 투과증발과 함께 또는 없이) 수용액으로부터 활성탄 또는 중합체 흡착제 상에 포획(trap)되고, 저비점 유기 용매를 사용하여 회수될 수 있다(문헌[Chinn, Recovery of Glycols, Sugars, and Related Multiple -OH Compounds from Dilute-Aqueous Solution by Regenerable Adsorption onto Activated Carbons, University of California Berkeley, 1999]). 그 후에, 에틸렌 글리콜은 유기 용매로부터 증류에 의해 회수될 수 있다. 소정의 구현예에서, 에틸렌 글리콜은 발효 브로쓰의 일부를 생물반응기로부터 연속적으로 제거하며 미생물 세포를 상기 브로쓰로부터 (편리하게는 여과에 의해) 분리하고 에틸렌 글리콜을 상기 브로쓰로부터 회수함으로써 발효 브로쓰로부터 회수된다. 공동-생성물, 예컨대 알코올 또는 산은 또한, 브로쓰로부터 분리 또는 정제될 수 있다. 알코올은 예를 들어, 증류에 의해 회수될 수 있다. 산은 예를 들어, 활성탄 상에서의 흡착에 의해 회수될 수 있다. 분리된 미생물 세포는 소정의 구현예에서 생물반응기로 되돌려질 수 있다. 표적 생성물이 제거된 후 잔류하는 세포-무함유 침투물 또한 바람직하게는 생물반응기로 전부 또는 일부 되돌려진다. 추가의 영양소(예컨대 B 비타민)는 세포-무함유 침투물이 생물반응기로 되돌려지기 전에 배지를 보충하기 위해 상기 세포-무함유 침투물에 첨가될 수 있다.
수성 배지로부터 디올의 회수는 많은 방식으로 실증되어 왔다. 모사 이동상(SMB) 기술은 에탄올과 연관된 옥시게네이트의 수성 혼합물로부터 2,3-부탄디올을 회수하는 데 사용되어 왔다(미국 특허 제8,658.845호). 반응성 분리 또한, 효과적인 디올 회수에 대해 실증되어 왔다. 일부 구현예에서, 에틸렌 글리콜의 회수는 디올-함유 스트림과 알데하이드의 반응, 상기 디올의 분획화 및 재생, 농축된 디올 스트림을 회수하기 위한 최종 분획화에 의해 수행된다. 예를 들어, 미국 특허 제7,951,980호를 참조한다.
본 발명은 본원에 기재된 미생물에 의해 본 방법에 따라 생성된 에틸렌 글리콜을 포함하는 조성물을 제공한다. 예를 들어, 에틸렌 글리콜을 포함하는 조성물은 부동제, 보존제, 탈수제 또는 드릴링액일 수 있다.
본 발명은 또한, 본원에 기재된 미생물에 의해 본 방법에 따라 생성된 에틸렌 글리콜을 포함하는 중합체를 제공한다. 이러한 중합체는 예를 들어, 동종중합체, 예컨대 폴리에틸렌 글리콜 또는 공중합체, 예컨대 폴리에틸렌 테레프탈레이트일 수 있다. 이들 중합체의 합성 방법은 당업계에 잘 알려져 있다. 예를 들어, 문헌[Herzberger et al., Chem Rev., 116(4): 2170-2243 (2016) 및 Xiao et al., Ind Eng Chem Res. 54(22): 5862-5869 (2015)]를 참조한다.
나아가, 본 발명은 본원에 기재된 미생물에 의해 본 방법에 따라 생성된 에틸렌 글리콜을 포함하는 중합체를 포함하는 조성물을 제공한다. 예를 들어, 상기 조성물은 섬유, 수지, 필름 또는 플라스틱일 수 있다.
실시예
하기 실시예는 본 발명을 추가로 예시하지만, 당연하게도, 본 발명의 범위를 임의의 방식으로 제한하는 것으로 간주되어서는 안 된다.
실시예 1: 씨. 아우토에타노게눔에서 CO 및/또는 CO
2
및 H
2
로부터 에틸렌 글리콜의 생성을 위한, 비. 서브틸리스 시트레이트 신타제, 이. 콜라이 이소시트레이트 리아제, 및 지. 옥시단스 글리콜알데하이드 데하이드로게나제를 포함하는 이종성 발현 벡터의 구축
비. 서브틸리스로부터의 시트레이트 신타제(citZ; SEQ ID NO: 1-2), 이. 콜라이로부터의 이소시트레이트 리아제(icl; SEQ ID NO: 11-12), 및 지. 옥시단스로부터의 글리콜알데하이드 데하이드로게나제(aldA1; SEQ ID NO: 55-56)를 코딩하는 유전자를 씨. 아우토에타노게눔에서 발현을 위해 코돈-적응시키고 합성하였다. 적응된 유전자를 표준 BsaI 골든 게이트 클로닝 키트(New England Biolabs, 미국 매사추세츠주 입스위치 소재)를 사용하여 발현 셔틀 벡터, pIPL12 내로 클로닝하였다. pIPL12는 이. 콜라이와 씨. 아우토에타노게눔 둘 모두에 대한 복제 기원을 포함하여, pIPL12가 두 종 모두에서 복제되고 유지될 수 있게 하며; pIPL12는 또한, 대부분의 클로스트리디아에서 작용한다. pIPL12는 양성 선별을 위해 에리트로마이신/클래리스로마이신(clarithromycin) 내성을 부여하는 23S rRNA(아데닌(2058)-N(6))-메틸트랜스퍼라제 Erm(B), 이. 콜라이로부터의 접합성 전달(conjugative transfer)을 위한 TraJ, 및 이종성 유전자의 발현을 위한 프로모터를 추가로 포함한다. 도 2a를 참조한다. pIPL12 내로의 citZ, icl 및 aldA1의 클로닝 시 생성되는 발현 벡터는 본원에서 pMEG042로 지칭된다(도 2b).
pMEG042 구축물을 컨쥬게이션을 통해 씨. 아우토에타노게눔 내로 형질전환하였다. 발현 벡터를 우선, 표준 열 충격 형질전환을 사용하여 접합성 공여체 균주, 이. 콜라이 HB101+R702(CA434)(Williams 등 1990)(공여체) 내로 도입하였다. 공여체 세포를 37℃에서 1시간 동안 SOC 배지로 회수한 후, 100 μg/mL 스펙티노마이신(spectinomycin) 및 500 μg/mL 에리트로마이신을 포함하는 LB 배지 플레이트 상으로 평판배양하고 37℃에서 밤새 인큐베이션하였다. 다음날, 100 μg/mL 스펙티노마이신 및 500 μg/mL 에리트로마이신을 포함하는 5 mL LB 분취액을 몇몇 공여체 콜로니에 접종하고, 37℃에서 쉐이킹(shaking)하면서 대략 4시간 동안 또는 배양물이 시각적으로 빽빽(dense)하지만 아직까지는 정지기(stationary phase)에 도달하지 않았을 때까지 인큐베이션하였다. 1.5 mL의 공여체 배양물을 4000 rpm 및 20℃ 내지 25℃에서 2분 동안 원심분리에 의해 수확하고, 상층액을 폐기하였다. 공여체 세포를 500 μL 멸균 PBS 완충제에서 부드럽게 재현탁시키고, 4000 rpm에서 2분 동안 원심분리하고, PBS 상층액을 폐기하였다.
펠렛을 혐기성 챔버 내로 도입하고, 씨. 아우토에타노게눔 배양물(수혜물)의 후기 지수성장기(late exponential phase) 동안 200 μL에서 부드럽게 재현탁시켰다. 씨. 아우토에타노게눔 DSM10061 및 DSM23693(DSM10061의 유도체)을 DSMZ로부터 공급받았다(독일 미생물 및 세포 배양물 컬렉션(The German Collection of Microorganisms and Cell Cultures), 독일 38124 브라운슈바이크 인호펜스트라쎄 7 B 소재). 균주를 표준 혐기성 기법(Hungate 1969; Wolfe 1971)을 사용하여 pH 5.6에서 37℃에서 PETC 배지(미국 특허 제9,738,875호 참조)에서 성장시켰다.
컨쥬게이션 혼합물(공여체 세포와 수혜물 세포의 믹스(mix))을 PETC-MES + 프룩토스 한천 플레이트 상으로 스폿(spot)하고, 건조하였다. 스폿이 더 이상 시각적으로 젖어있지 않았을 때, 플레이트를 압력병(pressure jar) 내로 도입하고, 합성 가스(50% CO, 10% N2, 30% CO2, 10% H2)로 25 내지 30 psi까지 가압하고, 37℃에서 약 24시간 동안 인큐베이션하였다. 그 후에, 컨쥬게이션 혼합물을 10 μL 접종 루프를 사용한 부드러운 스크래핑(scraping)에 의해 플레이트로부터 제거하였다. 제거된 혼합물을 200 내지 300 μL PETC 배지에 현탁시켰다. 컨쥬게이션 혼합물의 100 μL 분취액을 5 μg/mL 클래리스로마이신이 보충된 PETC 배지 한천 플레이트 상으로 평판배양하여, 플라스미드를 갖고 있는 형질전환체를 선별하였다.
pMEG042 플라스미드를 갖고 있는 씨. 아우토에타노게눔의 3개의 구별되는 콜로니를 5 μg/mL 클래리스로마이신과 함께 2 mL의 PETC-MES 배지에 접종하고, 50% CO, 10% N2, 30% CO2, 10% H2와 함께 37℃에서 100 rpm 오비탈 쉐이킹하면서 3일 동안 독립영양적으로 성장시켰다. 배양물을 혈청병(serum bottle)에서 5 μg/mL 클래리스로마이신과 함께 10 mL PETC-MES 배지에서 0.05의 OD600까지 희석시키고, 50% CO, 10% N2, 30% CO2, 10% H2와 함께 37℃에서 100 rpm 오비탈 쉐이킹하면서 20일 이하 동안 독립영양적으로 성장시켜, 매일 샘플링하여 바이오매스 및 대사물을 측정하였다(도 3a 및 도 3b). 하기 기재된 바와 같이, 에틸렌 글리콜의 생성을 가스 크로마토그래피 질량 분광법(GC-MS)을 사용하여 측정하고, 다른 대사물을 고성능 액체 크로마토그래피(HPLC)를 사용하여 측정하였다.
에틸렌 글리콜 농도를 Agilent VF-WAXms 컬럼(15 m Х 0.25 μm Х 0.25 μm) 및 RSH 자동시료화기(autosampler)가 장착된 Thermo Scientific ISQ LT GCMS로 측정하였다. 200 μL의 브로쓰를 200 μL의 메탄올로 희석시킴으로써 시료를 제조하였다. 상기 시료를 보텍스(vortex)한 다음, 14,000 rpm에서 3분 동안 원심분리하였고; 200 μL의 상층액을 인서트(insert)와 함께 유리 바이얼로 옮겼다. 시료를 1.0 μL 주입, 5:1의 분할비, 및 240℃의 유입구 온도를 사용하여 분석용 자동시료화기로 옮겼다. 크로마토그래피를 80℃에서 0.5분 유지(hold)로부터 10℃/분의 램프(ramp)로 150℃까지, 그리고 25℃/분의 램프로 220℃까지, 3분간의 최종 유지의 오븐 프로그램으로 수행하였다. 컬럼 유속은 0.5분 유지와 함께 4.0 mL/분이었고, 그 후에 헬륨을 담체 가스로 사용하여 100 mL/분/분의 속도로 1.5 mL/분까지 하락하였다. MS 이온 공급원을 240℃에서 설정된 전달선(transfer line)을 이용하여 260℃에서 유지시켰다. 33.0 m/z를 정량화 피크로서 그리고 31.0 + 62.0 m/z를 확증 피크(confirming peak)로서 사용하는 선형 외부 표준 보정을 사용하여 정량화를 수행하였다.
에탄올, 아세테이트, 2,3-부탄디올, 글리옥실레이트 및 글리콜레이트 농도를 35℃에서 굴절률(RI) 검출과 함께 Agilent 1260 Infinity LC 상에서 HPLC에 의해 측정하였다. 시료를 80℃에서 5분 동안 가열한 후, 14,000 rpm에서 3분간의 원심분리에 의해 제조하였고; 상층액을 분석용 유리 바이얼로 옮겼다. 등용매(isocratic) 조건 하에 5 mM 황산 이동상을 사용하여 0.7 mL/분 및 35℃에서 Phenomenex RezexTM ROA-Organic Acid H+(8%) 컬럼(300 mm x 7.8 mm x 8 μm) 상으로 10 μL 주입과 함께 분리를 수행하였다.
대략 3일의 독립영양적 성장 후, 에틸렌 글리콜 전구체 글리콜레이트가 관찰되었고, 10일 후, 에틸렌 글리콜의 생성이 관찰되었다(도 3b).
실시예 2: 씨. 아우토에타노게눔에서 CO 및/또는 CO
2
및 H
2
로부터 에틸렌 글리콜의 생성을 위한, 에스. 티오타우리니 알라닌-글리옥실레이트 아미노트랜스퍼라제 및 피. 플루오레센스 알데하이드 데하이드로게나제를 포함하는 이종성 발현 벡터의 구축.
에스. 티오타우리니로부터의 알라닌-글리옥실레이트 아미노트랜스퍼라제(pucG; SEQ ID NO: 15-16) 및 피. 플루오레센스로부터의 알데하이드 데하이드로게나제 Q8r1-96(aldA1; SEQ ID NO: 57-58)를 코딩하는 유전자를 씨. 아우토에타노게눔에서 발현을 위해 코돈-적응시키고 합성하였다. 코돈-적응된 유전자를 pIPL12 내로 클로닝하고(도 2a), 생성된 발현 벡터, pMEG058을 실시예 1에 기재된 바와 같이 씨. 아우토에타노게눔 내로 도입하였다. 도 2c를 참조한다.
실시예 1에 기재된 바와 같이, pMEG058 플라스미드를 갖고 있는 씨. 아우토에타노게눔의 2개의 구별되는 콜로니를 5 μg/mL 클래리스로마이신과 함께 2 mL의 PETC-MES 배지에 접종하고, 독립영양적으로 성장시켰다. 도 4a를 참조한다. 대략 3일의 독립영양적 성장 후, 글리콜레이트가 관찰되었고, 8일 후, 에틸렌 글리콜의 생성이 관찰되었다(도 4b).
실시예 3: 씨. 아우토에타노게눔에서 CO 및/또는 CO
2
및 H
2
로부터 에틸렌 글리콜의 생성을 위한, 에스. 티오타우리니 알라닌-글리옥실레이트 아미노트랜스퍼라제 및 지. 옥시단스 글리콜알데하이드 데하이드로게나제를 포함하는 이종성 발현 벡터의 구축.
에스. 티오타우리니로부터의 알라닌-글리옥실레이트 아미노트랜스퍼라제(pucG; SEQ ID NO: 15-16) 및 지. 옥시단스로부터의 글리콜알데하이드 데하이드로게나제(aldA1; SEQ ID NO: 55-56)를 코딩하는 유전자를 씨. 아우토에타노게눔에서 발현을 위해 코돈-적응시키고 합성하였다. 코돈-적응된 유전자를 pIPL12 내로 클로닝하고(도 2a), 생성된 발현 벡터, pMEG059를 실시예 1에 기재된 바와 같이 씨. 아우토에타노게눔 내로 도입하였다. 도 2d를 참조한다.
실시예 1에 기재된 바와 같이, pMEG059 플라스미드를 갖고 있는 씨. 아우토에타노게눔의 2개의 구별되는 콜로니를 5 μg/mL 클래리스로마이신과 함께 2 mL의 PETC-MES 배지에 접종하고, 독립영양적으로 성장시켰다. 도 5a를 참조한다. 대략 3일의 독립영양적 성장 후, 글리콜레이트가 관찰되었고, 10일 후, 에틸렌 글리콜의 생성이 관찰되었다(도 5b).
실시예 4: 씨. 아우토에타노게눔에서 CO 및/또는 CO
2
및 H
2
로부터 에틸렌 글리콜의 생성을 위한, 알라닌-글리옥실레이트 아미노트랜스퍼라제 및 알데하이드 데하이드로게나제를 포함하는 이종성 발현 벡터의 구축.
씨. 악시두리치로부터의 부류 V 아미노트랜스퍼라제(SgA; SEQ ID NO: 19, 20) 및 피. 플루오레센스로부터의 알데하이드 데하이드로게나제 Q8r1-96(aldA1; SEQ ID NO: 57-58)를 코딩하는 유전자를 씨. 아우토에타노게눔에서 발현을 위해 코돈-적응시키고 합성하였다. 코돈-적응된 유전자를 pIPL12 내로 클로닝하고(도 2a), 생성된 발현 벡터, pMEG061을 실시예 1에 기재된 바와 같이 씨. 아우토에타노게눔 내로 도입하였다. 도 2e를 참조한다.
실시예 1에 기재된 바와 같이, pMEG061 플라스미드를 갖고 있는 씨. 아우토에타노게눔의 3개의 구별되는 콜로니를 5 μg/mL 클래리스로마이신과 함께 2 mL의 PETC-MES 배지에 접종하고, 독립영양적으로 성장시켰다. 도 6a를 참조한다. 대략 3일의 독립영양적 성장 후, 글리콜레이트가 관찰되었고, 16일 후, 에틸렌 글리콜의 생성이 관찰되었다(도 6b).
실시예 5: 에틸렌 글리콜로의 상이한 경로의 최대 수율의 모델링
문헌[Marcellin, Green Chem, 18: 3020-3028, 2016]에 기재된 것과 같은 클로스트리디움 아우토에타노게눔의 게놈-규모 대사 모델을 이용하여, 에틸렌 글리콜로의 상이한 경로의 최대 수율을 예측하였다. 이종성 대사 반응을 야생형 클로스트리디움 아우토에타노게눔 모델 구조에 첨가하여, 비-본연의 화합물 생성 경로의 혼입을 나타내었다. 본원에 기재된 실험 작업에 사용된 모델이 클로스트리디움 아우토에타노게눔에 기초하긴 하지만, 그 결과는, 대사에서의 유사성을 고려한다면, 다른 우드-륭달 미생물에도 적용될 것으로 합리적으로 예상될 수 있다.
솔버 인터페이스(solver interface)로서 옵트랑(optlang) 버전 1.2.3(문헌[Jensen, Optlang: An Algebraic Modeling Language for Mathematical Optimization," The Journal of Open Source Software, 2, doi:10.21105/joss.00139, 2017]) 및 최적화 솔버(optimization solver)로서 Gurobi Optimizer 버전 7.0.2와 함께 코브라피(cobrapy) 버전 0.8.2(문헌[Ebrahim., COBRApy: COnstraints-Based Reconstruction and Analysis for Python, BMC Syst Biol, 7: 74, 2013])을 사용하는 제약-기초 컴퓨터 모델링(constraint-based computational modeling) 기법 유량 균형 분석(FBA; flux balance analysis) 및 대사 조정의 선형 최소화(LMOMA; linear minimization of metabolic adjustment)를 사용하여 에틸렌 글리콜 생성을 모사하였다(문헌[Maia, Proceedings of the Genetic and Evolutionary Computation Conference Companion on - GECCO '17, New York, New York, ACM Press, 1661-1668, 2017])
모델링은 본원의 실시예 1 내지 4에 기재된 경로에 의해 0.37 mol 에틸렌 글리콜/mol CO의 예측된 수율을 드러내었다. 이는 포도당신생합성을 필요로 하는 문헌[Islam et al. Metab Eng, 41: 173-181, 2017]에 의해 기재된 가상 경로에 의한 예측된 수율의 2배 초과이고; 최고 예측된 수율은 약 0.44 g 에틸렌 글리콜/g CO인 것으로 밝혀졌고, 이는 약 0.18 mol 에틸렌 글리콜/mol CO와 동일하다.
본원에서 인용된 공보, 특허 출원 및 특허를 포함한 모든 참조문헌들은, 각각의 참조문헌이 참조에 의해 개별적으로 그리고 구체적으로 포함되는 것으로 나타내고 그 전문이 본원에 제시된 것과 동일한 정도로 참조에 의해 본 명세서에 포함된다. 이 명세서에서 임의의 선행 기술에 대한 참조문헌은, 선행 기술이 임의의 나라에서 노력 분야에서 보편적인 일반적 지식의 일부를 형성함을 인정하는 것이 아니고, 그리고 인정하는 것으로 간주되어서는 안 된다.
본 발명을 기재하는 맥락에서(특히 하기 청구항의 맥락에서) 용어 단수형("a", "an" 및 "the") 및 유사한 지시어의 사용은, 본원에서 다르게 나타내거나 문맥에 의해 명확하게 상충되지 않는 한 단수형과 복수형 둘 모두를 망라하는 것으로 여겨져야 한다. 용어 "포함하는(comprising)," "갖는," "포함하는(including)" 및 "함유하는"은 다르게 주지되지 않는 한, 열린-말단(open-ended) 용어(즉, "비제한적으로 ~을 포함하는"을 의미함)로서 여겨져야 한다. 용어 "~로 본질적으로 구성된"은, 조성물, 과정 또는 방법의 범위를 명시된 물질 또는 단계로, 또는 조성물, 과정 또는 방법의 기본적인 그리고 신규 특징에 물질적으로 영향을 주지 않는 것들로 제한한다. 대안(예를 들어, "또는")의 사용은 대안 중 하나, 둘 모두, 또는 이들의 임의의 조합을 의미하는 것으로 이해되어야 한다. 본원에 사용된 바와 같이, 용어 "약"은 다르게 나타내지 않는 한, 표시된 범위, 값 또는 구조의 ±20%를 의미한다.
본원에서 값들의 범위의 언급은 단지, 본원에서 다르게 나타내지 않는 한, 해당 범위 내에 속하는 각각의 개별 값을 개별적으로 지칭하는 약칭 방법으로서 역할을 하는 것일 뿐이고, 각각의 개별 값은 이 값이 본원에서 개별적으로 언급된 것처럼 명세서 내에 포함된다. 예를 들어, 임의의 농도 범위, 퍼센트 범위, 비 범위, 정수 범위, 크기 범위, 또는 두께 범위는 다르게 나타내지 않는 한, 언급된 범위 내의 임의의 정수 값, 및 적절하다면 이의 부분(fraction)(예컨대 정수의 1/10 및 1/100)을 포함하는 것으로 이해되어야 한다.
본원에서 기재된 모든 방법들은 본원에서 다르게 나타내지 않는 한 또는 문맥에 명확하게 상충되지 않는 한 임의의 적합한 순서로 수행될 수 있다. 본원에서 제공된 임의의 그리고 모든 예, 또는 예시적인 언어(예를 들어, "예컨대")의 사용은 본 발명을 보다 양호하게 예시하기 위한 것일 뿐이고, 다르게 청구되지 않는 한 본 발명의 범위에 제한을 두는 것은 아니다. 본 명세서의 어떤 언어도 임의의 청구되지 않은 구성 요소를 본 발명의 실시에 필수적인 것으로 명시하는 것으로 해석되지 않아야 한다.
본 발명의 바람직한 구현예가 본원에 기재되어 있다. 이들 바람직한 구현예들의 변이는 상기 상세한 설명을 읽을 때 당업자에게 명백해질 수 있다. 본 발명자들은 당업자가 이러한 변이를 적절하게 이용할 것임을 예상하고, 본 발명자들은 본 발명이 본원에서 구체적으로 기재된 것보다 다르게 실시되고자 한다. 이에, 본 발명은 적용 가능한 법에 의해 허용되는 한 첨부된 청구항에서 언급된 주제의 모든 변형들 및 등가물들을 포함한다. 더욱이, 이의 모든 가능한 변이들에서 상기 기재된 요소들의 임의의 조합은, 본원에서 다르게 나타내지 않는 한 또는 문맥에 의해 명확하게 상충되지 않는 한 본 발명에 의해 포괄된다.
SEQUENCE LISTING
<110> LanzaTech, Inc.
<120> MICROORGANISMS AND METHODS FOR THE BIOLOGICAL PRODUCTION OF
ETHYLENE GLYCOL
<130> LT133WO1
<150> US 62/607,446
<151> 2017-12-19
<150> US 62/683,454
<151> 2018-06-11
<160> 82
<170> PatentIn version 3.5
<210> 1
<211> 1101
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon-adapted nucleotide sequence
<400> 1
atggtacatt atggattaaa gggaataact tgtgtagaaa cttctatatc tcatatagat 60
ggagaaaagg gaaggcttat atacagagga catcatgcta aggacatagc actaaatcat 120
agctttgaag aggctgctta tttaatctta tttggaaagc tcccaagtac agaagagctt 180
caagtcttca aagacaaatt ggcagcagaa agaaatttac cagaacatat agaaagactt 240
attcaatcct taccaaataa tatggatgat atgtcagttt taagaactgt tgtaagtgca 300
cttggtgaaa atacctatac atttcatcct aaaacagaag aggctataag acttatagca 360
ataactcctt ccataattgc ttatagaaaa agatggacaa gaggtgaaca agcaatagca 420
ccatcatcac aatatggaca tgttgaaaat tattattaca tgcttacagg agaacagcct 480
agtgaggcta agaaaaaagc acttgaaacc tatatgatat tagctacaga acatggcatg 540
aatgcttcta ctttttctgc aagagtaact ttaagcactg aatcagattt agtatcagca 600
gtaacagcag cattaggtac tatgaaggga ccactacatg gcggcgctcc ctctgcagtt 660
acaaagatgt tagaagacat aggagaaaag gaacatgcag aggcttatct aaaagaaaaa 720
cttgaaaagg gagagagact catgggtttt ggacatagag tatacaagac taaagatcct 780
agagcagaag cattaagaca aaaggcagaa gaagtggcag gaaatgatag agatcttgat 840
cttgcattgc acgttgaagc agaggctata agattacttg aaatatataa accaggaaga 900
aaactttata ctaatgttga attttatgca gctgctgtta tgagggctat agactttgac 960
gatgaattat ttactcctac tttttccgct tctcgtatgg ttggatggtg tgcgcatgtg 1020
cttgaacagg cagagaataa catgattttt agaccatctg cacaatatac aggtgctatc 1080
ccagaagaag tactttctta a 1101
<210> 2
<211> 366
<212> PRT
<213> Bacillus subtilis
<400> 2
Met Val His Tyr Gly Leu Lys Gly Ile Thr Cys Val Glu Thr Ser Ile
1 5 10 15
Ser His Ile Asp Gly Glu Lys Gly Arg Leu Ile Tyr Arg Gly His His
20 25 30
Ala Lys Asp Ile Ala Leu Asn His Ser Phe Glu Glu Ala Ala Tyr Leu
35 40 45
Ile Leu Phe Gly Lys Leu Pro Ser Thr Glu Glu Leu Gln Val Phe Lys
50 55 60
Asp Lys Leu Ala Ala Glu Arg Asn Leu Pro Glu His Ile Glu Arg Leu
65 70 75 80
Ile Gln Ser Leu Pro Asn Asn Met Asp Asp Met Ser Val Leu Arg Thr
85 90 95
Val Val Ser Ala Leu Gly Glu Asn Thr Tyr Thr Phe His Pro Lys Thr
100 105 110
Glu Glu Ala Ile Arg Leu Ile Ala Ile Thr Pro Ser Ile Ile Ala Tyr
115 120 125
Arg Lys Arg Trp Thr Arg Gly Glu Gln Ala Ile Ala Pro Ser Ser Gln
130 135 140
Tyr Gly His Val Glu Asn Tyr Tyr Tyr Met Leu Thr Gly Glu Gln Pro
145 150 155 160
Ser Glu Ala Lys Lys Lys Ala Leu Glu Thr Tyr Met Ile Leu Ala Thr
165 170 175
Glu His Gly Met Asn Ala Ser Thr Phe Ser Ala Arg Val Thr Leu Ser
180 185 190
Thr Glu Ser Asp Leu Val Ser Ala Val Thr Ala Ala Leu Gly Thr Met
195 200 205
Lys Gly Pro Leu His Gly Gly Ala Pro Ser Ala Val Thr Lys Met Leu
210 215 220
Glu Asp Ile Gly Glu Lys Glu His Ala Glu Ala Tyr Leu Lys Glu Lys
225 230 235 240
Leu Glu Lys Gly Glu Arg Leu Met Gly Phe Gly His Arg Val Tyr Lys
245 250 255
Thr Lys Asp Pro Arg Ala Glu Ala Leu Arg Gln Lys Ala Glu Glu Val
260 265 270
Ala Gly Asn Asp Arg Asp Leu Asp Leu Ala Leu His Val Glu Ala Glu
275 280 285
Ala Ile Arg Leu Leu Glu Ile Tyr Lys Pro Gly Arg Lys Leu Tyr Thr
290 295 300
Asn Val Glu Phe Tyr Ala Ala Ala Val Met Arg Ala Ile Asp Phe Asp
305 310 315 320
Asp Glu Leu Phe Thr Pro Thr Phe Ser Ala Ser Arg Met Val Gly Trp
325 330 335
Cys Ala His Val Leu Glu Gln Ala Glu Asn Asn Met Ile Phe Arg Pro
340 345 350
Ser Ala Gln Tyr Thr Gly Ala Ile Pro Glu Glu Val Leu Ser
355 360 365
<210> 3
<211> 1362
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon-adapted nucleotide sequence
<400> 3
atgaaaaaat gttcttacga ctataaatta aataatgtaa atgatcctaa cttctataaa 60
gatatattcc cttatgaaga agtacctaaa atagtattta ataatattca attaccaatg 120
gatctgcctg ataacatata cataactgat actaccttcc gtgatggaca acaatcaatg 180
cctccttata caagtagaga aatagtaagg atttttgatt atttgcatga attagacaac 240
aattcaggaa taataaaaca aacagaattt tttttatata ccaaaaaaga tagaaaagca 300
gctgaagttt gtatggaaag aggatacgag ttccctgaag ttacttcttg gattagggca 360
gataaagagg acttaaaatt agttaaggat atgggcataa aggaaacagg tatgttaatg 420
agttgttcag actatcacat atttaagaaa ttaaaaatga caagaaaaga gacaatggat 480
atgtatcttg atttagctag agaggctcta aataatggta ttagacctag atgtcattta 540
gaagatatta caagagcaga tttttatgga tttgtagtac cttttgtaaa tgaacttatg 600
aaaatgagca aagaggcaaa catcccaata aaaataaggg cttgtgatac tcttggatta 660
ggggtacctt ataatggagt tgaaatacca agatctgtac agggaataat tcatggtttg 720
agaaacatat gtgaagttcc ttctgaatct attgaatggc atggacataa tgatttctat 780
ggagtagtaa ctaactcctc cacggcatgg ctatatggag caagcagcat aaacacttcc 840
ttcttgggaa taggagaaag aacaggaaac tgtccacttg aagcaatgat atttgaatat 900
gctcaaataa aaggaaatac taaaaatatg aaacttcatg taataacgga gcttgctcaa 960
tattttgaaa aggaaataaa atattctgta cctgttagaa ctccttttgt tggaactgat 1020
tttaatgtaa caagggctgg catacatgca gatggtatcc taaaagatga agaaatatat 1080
aatatttttg atacagataa gatactggga aggcctgtag tagtagctgt ttcccagtat 1140
tcaggaaggg ctggaatagc agcatgggtg aacacttatt ataggcttaa agatgaagat 1200
aaagttaata aaaatgacag cagaatagat caaattaaaa tgtgggtaga tgagcaatac 1260
cgcgctggta ggacatcagt aattggaaac aatgaactag aacttttagt ttcaaaagta 1320
atgccagaag taatagaaaa aacagaagaa agggcttctt aa 1362
<210> 4
<211> 453
<212> PRT
<213> Clostridium kluyveri
<400> 4
Met Lys Lys Cys Ser Tyr Asp Tyr Lys Leu Asn Asn Val Asn Asp Pro
1 5 10 15
Asn Phe Tyr Lys Asp Ile Phe Pro Tyr Glu Glu Val Pro Lys Ile Val
20 25 30
Phe Asn Asn Ile Gln Leu Pro Met Asp Leu Pro Asp Asn Ile Tyr Ile
35 40 45
Thr Asp Thr Thr Phe Arg Asp Gly Gln Gln Ser Met Pro Pro Tyr Thr
50 55 60
Ser Arg Glu Ile Val Arg Ile Phe Asp Tyr Leu His Glu Leu Asp Asn
65 70 75 80
Asn Ser Gly Ile Ile Lys Gln Thr Glu Phe Phe Leu Tyr Thr Lys Lys
85 90 95
Asp Arg Lys Ala Ala Glu Val Cys Met Glu Arg Gly Tyr Glu Phe Pro
100 105 110
Glu Val Thr Ser Trp Ile Arg Ala Asp Lys Glu Asp Leu Lys Leu Val
115 120 125
Lys Asp Met Gly Ile Lys Glu Thr Gly Met Leu Met Ser Cys Ser Asp
130 135 140
Tyr His Ile Phe Lys Lys Leu Lys Met Thr Arg Lys Glu Thr Met Asp
145 150 155 160
Met Tyr Leu Asp Leu Ala Arg Glu Ala Leu Asn Asn Gly Ile Arg Pro
165 170 175
Arg Cys His Leu Glu Asp Ile Thr Arg Ala Asp Phe Tyr Gly Phe Val
180 185 190
Val Pro Phe Val Asn Glu Leu Met Lys Met Ser Lys Glu Ala Asn Ile
195 200 205
Pro Ile Lys Ile Arg Ala Cys Asp Thr Leu Gly Leu Gly Val Pro Tyr
210 215 220
Asn Gly Val Glu Ile Pro Arg Ser Val Gln Gly Ile Ile His Gly Leu
225 230 235 240
Arg Asn Ile Cys Glu Val Pro Ser Glu Ser Ile Glu Trp His Gly His
245 250 255
Asn Asp Phe Tyr Gly Val Val Thr Asn Ser Ser Thr Ala Trp Leu Tyr
260 265 270
Gly Ala Ser Ser Ile Asn Thr Ser Phe Leu Gly Ile Gly Glu Arg Thr
275 280 285
Gly Asn Cys Pro Leu Glu Ala Met Ile Phe Glu Tyr Ala Gln Ile Lys
290 295 300
Gly Asn Thr Lys Asn Met Lys Leu His Val Ile Thr Glu Leu Ala Gln
305 310 315 320
Tyr Phe Glu Lys Glu Ile Lys Tyr Ser Val Pro Val Arg Thr Pro Phe
325 330 335
Val Gly Thr Asp Phe Asn Val Thr Arg Ala Gly Ile His Ala Asp Gly
340 345 350
Ile Leu Lys Asp Glu Glu Ile Tyr Asn Ile Phe Asp Thr Asp Lys Ile
355 360 365
Leu Gly Arg Pro Val Val Val Ala Val Ser Gln Tyr Ser Gly Arg Ala
370 375 380
Gly Ile Ala Ala Trp Val Asn Thr Tyr Tyr Arg Leu Lys Asp Glu Asp
385 390 395 400
Lys Val Asn Lys Asn Asp Ser Arg Ile Asp Gln Ile Lys Met Trp Val
405 410 415
Asp Glu Gln Tyr Arg Ala Gly Arg Thr Ser Val Ile Gly Asn Asn Glu
420 425 430
Leu Glu Leu Leu Val Ser Lys Val Met Pro Glu Val Ile Glu Lys Thr
435 440 445
Glu Glu Arg Ala Ser
450
<210> 5
<211> 1359
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon-adapted nucleotide sequence
<400> 5
atgtcaataa acaacatagg tccttttact aaatcccact tagatatgtg tattaaaaac 60
aattcaattg atgatgcctt gtatgaaaag tatggagtaa agagatcact tagagatctt 120
aatggtattg gaataaatgc tgggataaca aatgtcagtt tgtcaaagtc ttttactaca 180
gatgaaaatg gtaacagagt accttgtgca ggagagttat attatagagg atacgagatt 240
catgatctta taaagggatt ttttttggac aatagatttg gatttgagga atgtacttat 300
ttgttacttt ttggcgtact tcctgacgaa aaagaacttc aaaatttcaa acaagtctta 360
aatatctctt acgatttacc tcatcatttt atacaagatg ttataatgaa atctcctaca 420
gcagacataa tagctaatat gactaaatcc acgcttgcac taggttccta tgataaaaag 480
atgggagata actcacttga aaatgtcctt caacaatgta ttcaattaat atctatgttt 540
ccaaggcttg ctgtatactc ctatcagggt tatagacatt atgaattagg taaatcttgc 600
tatatacaca aacctcttcc agaattaagt tttgcagaaa atatattatc aactcttaga 660
tcaaatagaa aatatacaag attggaagca agagtacttg atcttgccct agttttacac 720
atggaacatg gcggcggctc aaattctact tttactacaa gggtagttac ttcatcagga 780
agtgatacgt atgcaactat ggcagcagca ttatgttcat taaaaggacc tttaaatggc 840
ggcggcgatt atcaagtaat gggtatgatg aagaatataa gagataatgt aagtgatata 900
actgacgaag aagaagttgg tgaatatatt agaaaaattg taaaccgtga agcgtatgat 960
aaaacaggaa tagtatacgg aatgggtcat ccattctata gcatatctga cccaagggct 1020
ttagagttca agaaatatgt aaaattactt gcagcagaaa aaggaatgga tgaagaatat 1080
gcattatatg aaatgataga aaggattgca ccagaaatta tcgcagaaga aaggaagata 1140
tataaaggag tatgtattaa tatagattat tattctggtt tgctttataa aatgttaaag 1200
atcccagcag agatgtttac tccattattt gctattgcca gagttgtagg atggtcggca 1260
catagaatgg aagaacttgt aaattcttac aaaatcataa gacctgctta tacatctata 1320
gcagagataa aggaatacgt acctataaat gaaagataa 1359
<210> 6
<211> 452
<212> PRT
<213> Clostridium sp. L2-50
<400> 6
Met Ser Ile Asn Asn Ile Gly Pro Phe Thr Lys Ser His Leu Asp Met
1 5 10 15
Cys Ile Lys Asn Asn Ser Ile Asp Asp Ala Leu Tyr Glu Lys Tyr Gly
20 25 30
Val Lys Arg Ser Leu Arg Asp Leu Asn Gly Ile Gly Ile Asn Ala Gly
35 40 45
Ile Thr Asn Val Ser Leu Ser Lys Ser Phe Thr Thr Asp Glu Asn Gly
50 55 60
Asn Arg Val Pro Cys Ala Gly Glu Leu Tyr Tyr Arg Gly Tyr Glu Ile
65 70 75 80
His Asp Leu Ile Lys Gly Phe Phe Leu Asp Asn Arg Phe Gly Phe Glu
85 90 95
Glu Cys Thr Tyr Leu Leu Leu Phe Gly Val Leu Pro Asp Glu Lys Glu
100 105 110
Leu Gln Asn Phe Lys Gln Val Leu Asn Ile Ser Tyr Asp Leu Pro His
115 120 125
His Phe Ile Gln Asp Val Ile Met Lys Ser Pro Thr Ala Asp Ile Ile
130 135 140
Ala Asn Met Thr Lys Ser Thr Leu Ala Leu Gly Ser Tyr Asp Lys Lys
145 150 155 160
Met Gly Asp Asn Ser Leu Glu Asn Val Leu Gln Gln Cys Ile Gln Leu
165 170 175
Ile Ser Met Phe Pro Arg Leu Ala Val Tyr Ser Tyr Gln Gly Tyr Arg
180 185 190
His Tyr Glu Leu Gly Lys Ser Cys Tyr Ile His Lys Pro Leu Pro Glu
195 200 205
Leu Ser Phe Ala Glu Asn Ile Leu Ser Thr Leu Arg Ser Asn Arg Lys
210 215 220
Tyr Thr Arg Leu Glu Ala Arg Val Leu Asp Leu Ala Leu Val Leu His
225 230 235 240
Met Glu His Gly Gly Gly Ser Asn Ser Thr Phe Thr Thr Arg Val Val
245 250 255
Thr Ser Ser Gly Ser Asp Thr Tyr Ala Thr Met Ala Ala Ala Leu Cys
260 265 270
Ser Leu Lys Gly Pro Leu Asn Gly Gly Gly Asp Tyr Gln Val Met Gly
275 280 285
Met Met Lys Asn Ile Arg Asp Asn Val Ser Asp Ile Thr Asp Glu Glu
290 295 300
Glu Val Gly Glu Tyr Ile Arg Lys Ile Val Asn Arg Glu Ala Tyr Asp
305 310 315 320
Lys Thr Gly Ile Val Tyr Gly Met Gly His Pro Phe Tyr Ser Ile Ser
325 330 335
Asp Pro Arg Ala Leu Glu Phe Lys Lys Tyr Val Lys Leu Leu Ala Ala
340 345 350
Glu Lys Gly Met Asp Glu Glu Tyr Ala Leu Tyr Glu Met Ile Glu Arg
355 360 365
Ile Ala Pro Glu Ile Ile Ala Glu Glu Arg Lys Ile Tyr Lys Gly Val
370 375 380
Cys Ile Asn Ile Asp Tyr Tyr Ser Gly Leu Leu Tyr Lys Met Leu Lys
385 390 395 400
Ile Pro Ala Glu Met Phe Thr Pro Leu Phe Ala Ile Ala Arg Val Val
405 410 415
Gly Trp Ser Ala His Arg Met Glu Glu Leu Val Asn Ser Tyr Lys Ile
420 425 430
Ile Arg Pro Ala Tyr Thr Ser Ile Ala Glu Ile Lys Glu Tyr Val Pro
435 440 445
Ile Asn Glu Arg
450
<210> 7
<211> 1119
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon-adapted nucleotide sequence
<400> 7
atgacagcaa caaggggcct tgaaggggta gtagcgacta ctagtagtgt aagttcaatt 60
atagatgata ctttgactta tgttggatat gatatagatg atcttacgga aaatgcaagc 120
tttgaagaaa taatatattt attgtggcat ttgagattac caaacaaaaa ggaattagaa 180
gaattaaaac aacaattagc caaagaggca gctgttcctc aggaaataat agaacatttc 240
aaatcctata gcttagaaaa tgttcatcct atggctgcac ttagaactgc tatatccctc 300
ttaggtcttt tggattctga ggcagatact atgaatccag aggctaacta tagaaaagca 360
ataagattac aggctaaagt cccaggatta gttgcagcat tttcaagaat acgaaaagga 420
ttagaaccag tagagccaag agaagattac ggaatagcag agaatttttt gtatactttg 480
aatggcgaag agcctagtcc aatagaagtt gaagcattta ataaagcact tatacttcat 540
gctgaccatg aacttaacgc atctacattt acagctagag tttgtgtagc cactctttct 600
gatatttatt ccggcattac tgctgcaatt ggggctctta agggacctct acatggcggc 660
gccaacgagg gtgtaatgaa gatgttaaca gagattggag aggttgaaaa tgctgaacct 720
tatataagag ccaaacttga aaaaaaggaa aaaataatgg gatttggtca tagagtatac 780
aaacatggag atcctagagc aaaacatctt aaagaaatgt caaagagact tacaaattta 840
acaggtgaat caaaatggta tgaaatgagt attcgtattg aagatatagt tacgtcagag 900
aagaaacttc cccctaatgt agatttttac agtgcatctg tttatcattc gcttggaatc 960
gatcacgatt tatttacgcc tatatttgct gtaagtagaa tgagcggatg gttagctcat 1020
attctcgaac agtacgacaa taacagactt ataagaccac gtgctgatta tacaggtcct 1080
gacaaacaaa aatttgtacc tatagaagaa agagcataa 1119
<210> 8
<211> 372
<212> PRT
<213> Bacillus subtilis
<400> 8
Met Thr Ala Thr Arg Gly Leu Glu Gly Val Val Ala Thr Thr Ser Ser
1 5 10 15
Val Ser Ser Ile Ile Asp Asp Thr Leu Thr Tyr Val Gly Tyr Asp Ile
20 25 30
Asp Asp Leu Thr Glu Asn Ala Ser Phe Glu Glu Ile Ile Tyr Leu Leu
35 40 45
Trp His Leu Arg Leu Pro Asn Lys Lys Glu Leu Glu Glu Leu Lys Gln
50 55 60
Gln Leu Ala Lys Glu Ala Ala Val Pro Gln Glu Ile Ile Glu His Phe
65 70 75 80
Lys Ser Tyr Ser Leu Glu Asn Val His Pro Met Ala Ala Leu Arg Thr
85 90 95
Ala Ile Ser Leu Leu Gly Leu Leu Asp Ser Glu Ala Asp Thr Met Asn
100 105 110
Pro Glu Ala Asn Tyr Arg Lys Ala Ile Arg Leu Gln Ala Lys Val Pro
115 120 125
Gly Leu Val Ala Ala Phe Ser Arg Ile Arg Lys Gly Leu Glu Pro Val
130 135 140
Glu Pro Arg Glu Asp Tyr Gly Ile Ala Glu Asn Phe Leu Tyr Thr Leu
145 150 155 160
Asn Gly Glu Glu Pro Ser Pro Ile Glu Val Glu Ala Phe Asn Lys Ala
165 170 175
Leu Ile Leu His Ala Asp His Glu Leu Asn Ala Ser Thr Phe Thr Ala
180 185 190
Arg Val Cys Val Ala Thr Leu Ser Asp Ile Tyr Ser Gly Ile Thr Ala
195 200 205
Ala Ile Gly Ala Leu Lys Gly Pro Leu His Gly Gly Ala Asn Glu Gly
210 215 220
Val Met Lys Met Leu Thr Glu Ile Gly Glu Val Glu Asn Ala Glu Pro
225 230 235 240
Tyr Ile Arg Ala Lys Leu Glu Lys Lys Glu Lys Ile Met Gly Phe Gly
245 250 255
His Arg Val Tyr Lys His Gly Asp Pro Arg Ala Lys His Leu Lys Glu
260 265 270
Met Ser Lys Arg Leu Thr Asn Leu Thr Gly Glu Ser Lys Trp Tyr Glu
275 280 285
Met Ser Ile Arg Ile Glu Asp Ile Val Thr Ser Glu Lys Lys Leu Pro
290 295 300
Pro Asn Val Asp Phe Tyr Ser Ala Ser Val Tyr His Ser Leu Gly Ile
305 310 315 320
Asp His Asp Leu Phe Thr Pro Ile Phe Ala Val Ser Arg Met Ser Gly
325 330 335
Trp Leu Ala His Ile Leu Glu Gln Tyr Asp Asn Asn Arg Leu Ile Arg
340 345 350
Pro Arg Ala Asp Tyr Thr Gly Pro Asp Lys Gln Lys Phe Val Pro Ile
355 360 365
Glu Glu Arg Ala
370
<210> 9
<211> 1785
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon-adapted nucleotide sequence
<400> 9
atgcaaatta tggaagaaga aggaagattt gaagcagaag tggcagaagt agaaagttgg 60
tggggaacag agcgttttag gcttactaaa aggccttata cggcaaggga cgttgtactt 120
ttaagaggaa ccttgagaca gtcttatgcc agtggcgaga tggctaagaa attatggaga 180
actttaaaag cgcatcaggc tggcggcact gcttcaagaa cttttggtgc tttagatcca 240
gttcaagtta caatgatggc taagcaccta gatactattt atgtaagcgg atggcagtgt 300
tcatctacac acacatcaac aaatgaacct ggcccagatc ttgcagacta tccttatgat 360
actgtgccaa ataaggtaga acatcttttt tttgctcaat tatatcatga ccgcaagcaa 420
agagaggcaa gaatgagtct tccgcgagca gaaagagccc gtgctcctta tgtagatttt 480
ttaaaaccta taatagcaga tggagatact ggatttggcg gcgccacagc tacagttaaa 540
ctttgtaaac tttttgtaga gagaggtgct gcgggagttc accttgagga tcaatcatct 600
gttacaaaaa aatgtggaca catggctgga aaagttttag tggcagtttc agagcatgtt 660
aataggcttg tagctgctag acttcaattt gacgttatgg gcgtggagac agttttagtg 720
gcaaggacag atgcagtagc agctacactt atacaaacta atgtagatgc cagggatcac 780
caattcatag taggagccac aaatccagga ttgagaggtc agtctcttgc agctgtatta 840
tctgctggta tgtcagctgg taagagcgga agagaattgc aagcaatcga agatgaatgg 900
ctagcagcag cacaattaaa gacttttagc gaatgtgtac gagatgctat tgcaggacta 960
ggcgtggcag caaaggaaaa gcaaagaaga ctccaagaat gggacagggc aacaggcggc 1020
tatgatagat gtgtaagcaa tgatcaagca agagatatcg cagcatccct tggagtaact 1080
tctgtattct gggattggga tttgcctaga actagagaag gtttttacag attcagaggc 1140
tcagtagctg ccgcagtagt tagaggcaga gcatttgctc cacatgcaga tgtattatgg 1200
atggaaacat cttcaccaaa tgtggcagaa tgtactgcat tttcagaagg agttaaggca 1260
gcatgtccag aagcaatgct cgcgtataat ttgtcaccat cctttaactg ggacgcaagt 1320
ggcatgacag atgcagaaat ggcagcattt attccatctg tagctagatt gggatatgta 1380
tggcaattta taactcttgc tggttttcat gctgatgcct tggttacaga tacttttgct 1440
agggattttg ctagaagagg tatgttagct tatgttgaaa gaatacagag agaagaaaga 1500
ataaatggtg tagaaactct tgaacatcaa aaatggtcag gagcaaattt ttacgaccgt 1560
gtgttgaaag cagtacaagg cggcataagc agtactgcag ctatgggaaa aggtaaagta 1620
cctcacttcc cagcattctt tttttgctta gaaaaaaata agccatcatt cgttcacagt 1680
tttgatgtag tactttttac aggtgttaca gaggaacaat tcaaagatcc aaggcctgcc 1740
actggttcaa gtggacttca ggttatggcc aaatcacgta tttaa 1785
<210> 10
<211> 594
<212> PRT
<213> Zea mays
<400> 10
Met Gln Ile Met Glu Glu Glu Gly Arg Phe Glu Ala Glu Val Ala Glu
1 5 10 15
Val Glu Ser Trp Trp Gly Thr Glu Arg Phe Arg Leu Thr Lys Arg Pro
20 25 30
Tyr Thr Ala Arg Asp Val Val Leu Leu Arg Gly Thr Leu Arg Gln Ser
35 40 45
Tyr Ala Ser Gly Glu Met Ala Lys Lys Leu Trp Arg Thr Leu Lys Ala
50 55 60
His Gln Ala Gly Gly Thr Ala Ser Arg Thr Phe Gly Ala Leu Asp Pro
65 70 75 80
Val Gln Val Thr Met Met Ala Lys His Leu Asp Thr Ile Tyr Val Ser
85 90 95
Gly Trp Gln Cys Ser Ser Thr His Thr Ser Thr Asn Glu Pro Gly Pro
100 105 110
Asp Leu Ala Asp Tyr Pro Tyr Asp Thr Val Pro Asn Lys Val Glu His
115 120 125
Leu Phe Phe Ala Gln Leu Tyr His Asp Arg Lys Gln Arg Glu Ala Arg
130 135 140
Met Ser Leu Pro Arg Ala Glu Arg Ala Arg Ala Pro Tyr Val Asp Phe
145 150 155 160
Leu Lys Pro Ile Ile Ala Asp Gly Asp Thr Gly Phe Gly Gly Ala Thr
165 170 175
Ala Thr Val Lys Leu Cys Lys Leu Phe Val Glu Arg Gly Ala Ala Gly
180 185 190
Val His Leu Glu Asp Gln Ser Ser Val Thr Lys Lys Cys Gly His Met
195 200 205
Ala Gly Lys Val Leu Val Ala Val Ser Glu His Val Asn Arg Leu Val
210 215 220
Ala Ala Arg Leu Gln Phe Asp Val Met Gly Val Glu Thr Val Leu Val
225 230 235 240
Ala Arg Thr Asp Ala Val Ala Ala Thr Leu Ile Gln Thr Asn Val Asp
245 250 255
Ala Arg Asp His Gln Phe Ile Val Gly Ala Thr Asn Pro Gly Leu Arg
260 265 270
Gly Gln Ser Leu Ala Ala Val Leu Ser Ala Gly Met Ser Ala Gly Lys
275 280 285
Ser Gly Arg Glu Leu Gln Ala Ile Glu Asp Glu Trp Leu Ala Ala Ala
290 295 300
Gln Leu Lys Thr Phe Ser Glu Cys Val Arg Asp Ala Ile Ala Gly Leu
305 310 315 320
Gly Val Ala Ala Lys Glu Lys Gln Arg Arg Leu Gln Glu Trp Asp Arg
325 330 335
Ala Thr Gly Gly Tyr Asp Arg Cys Val Ser Asn Asp Gln Ala Arg Asp
340 345 350
Ile Ala Ala Ser Leu Gly Val Thr Ser Val Phe Trp Asp Trp Asp Leu
355 360 365
Pro Arg Thr Arg Glu Gly Phe Tyr Arg Phe Arg Gly Ser Val Ala Ala
370 375 380
Ala Val Val Arg Gly Arg Ala Phe Ala Pro His Ala Asp Val Leu Trp
385 390 395 400
Met Glu Thr Ser Ser Pro Asn Val Ala Glu Cys Thr Ala Phe Ser Glu
405 410 415
Gly Val Lys Ala Ala Cys Pro Glu Ala Met Leu Ala Tyr Asn Leu Ser
420 425 430
Pro Ser Phe Asn Trp Asp Ala Ser Gly Met Thr Asp Ala Glu Met Ala
435 440 445
Ala Phe Ile Pro Ser Val Ala Arg Leu Gly Tyr Val Trp Gln Phe Ile
450 455 460
Thr Leu Ala Gly Phe His Ala Asp Ala Leu Val Thr Asp Thr Phe Ala
465 470 475 480
Arg Asp Phe Ala Arg Arg Gly Met Leu Ala Tyr Val Glu Arg Ile Gln
485 490 495
Arg Glu Glu Arg Ile Asn Gly Val Glu Thr Leu Glu His Gln Lys Trp
500 505 510
Ser Gly Ala Asn Phe Tyr Asp Arg Val Leu Lys Ala Val Gln Gly Gly
515 520 525
Ile Ser Ser Thr Ala Ala Met Gly Lys Gly Lys Val Pro His Phe Pro
530 535 540
Ala Phe Phe Phe Cys Leu Glu Lys Asn Lys Pro Ser Phe Val His Ser
545 550 555 560
Phe Asp Val Val Leu Phe Thr Gly Val Thr Glu Glu Gln Phe Lys Asp
565 570 575
Pro Arg Pro Ala Thr Gly Ser Ser Gly Leu Gln Val Met Ala Lys Ser
580 585 590
Arg Ile
<210> 11
<211> 1305
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon-adapted nucleotide sequence
<400> 11
atgaaaacaa gaactcaaca aatagaagaa ttacaaaaag aatggacgca accaagatgg 60
gaaggtatta cgaggcctta ttctgcagaa gatgtagtaa aattaagagg ttctgtaaat 120
ccagaatgta ctcttgccca gcttggagca gctaaaatgt ggagactttt gcacggtgaa 180
tcaaagaagg gttatataaa ctctcttggc gctttaacag gcggccaggc acttcaacag 240
gctaaggcag gaatagaagc agtttatctt tctggatggc aagtagcagc agatgcaaat 300
ttagcagcat caatgtatcc tgatcagagc ttatacccag caaattcagt cccagctgta 360
gtagagagaa taaataatac ctttagaagg gcagatcaaa ttcaatggtc tgctggtatt 420
gaaccaggtg atccaagata cgtggattat tttttgccaa ttgtagcaga tgctgaggct 480
ggttttggcg gcgtattaaa tgcatttgaa ttaatgaaag caatgataga ggctggtgct 540
gcagctgtcc attttgaaga tcagttagct tcagttaaga aatgtggaca catgggcggc 600
aaggtattag ttccaaccca agaagcaata caaaaattag tggcagctag acttgcagct 660
gatgtaacag gtgtgcctac attactagtt gcaagaacag atgcagatgc tgcagatctt 720
attactagtg actgtgatcc ttatgattca gaatttatta caggagaaag aaccagtgag 780
ggatttttta gaactcatgc aggaatagaa caggctatat caagaggatt agcttatgct 840
ccttatgcag atcttgtttg gtgtgaaaca tctacaccag atctcgaact tgcccgtaga 900
tttgcccagg caatacatgc taagtatcca ggaaaattat tagcgtacaa ttgttctcct 960
tcatttaatt ggcagaagaa cttagatgac aaaacaatag caagttttca gcaacaatta 1020
tcagatatgg gatacaaatt tcagttcata acattagctg gaatacatag tatgtggttt 1080
aatatgtttg atcttgcaaa tgcttatgca caaggagaag gcatgaagca ttatgtagaa 1140
aaagtacaac agccagaatt tgcagctgcc aaggatggat atactttcgt ttctcatcaa 1200
caagaggttg gaactggata ttttgataag gttacaacaa ttatacaggg cggcacatcg 1260
tctgttactg cactaacagg ttcaactgaa gaatctcaat tttaa 1305
<210> 12
<211> 434
<212> PRT
<213> Escherichia coli
<400> 12
Met Lys Thr Arg Thr Gln Gln Ile Glu Glu Leu Gln Lys Glu Trp Thr
1 5 10 15
Gln Pro Arg Trp Glu Gly Ile Thr Arg Pro Tyr Ser Ala Glu Asp Val
20 25 30
Val Lys Leu Arg Gly Ser Val Asn Pro Glu Cys Thr Leu Ala Gln Leu
35 40 45
Gly Ala Ala Lys Met Trp Arg Leu Leu His Gly Glu Ser Lys Lys Gly
50 55 60
Tyr Ile Asn Ser Leu Gly Ala Leu Thr Gly Gly Gln Ala Leu Gln Gln
65 70 75 80
Ala Lys Ala Gly Ile Glu Ala Val Tyr Leu Ser Gly Trp Gln Val Ala
85 90 95
Ala Asp Ala Asn Leu Ala Ala Ser Met Tyr Pro Asp Gln Ser Leu Tyr
100 105 110
Pro Ala Asn Ser Val Pro Ala Val Val Glu Arg Ile Asn Asn Thr Phe
115 120 125
Arg Arg Ala Asp Gln Ile Gln Trp Ser Ala Gly Ile Glu Pro Gly Asp
130 135 140
Pro Arg Tyr Val Asp Tyr Phe Leu Pro Ile Val Ala Asp Ala Glu Ala
145 150 155 160
Gly Phe Gly Gly Val Leu Asn Ala Phe Glu Leu Met Lys Ala Met Ile
165 170 175
Glu Ala Gly Ala Ala Ala Val His Phe Glu Asp Gln Leu Ala Ser Val
180 185 190
Lys Lys Cys Gly His Met Gly Gly Lys Val Leu Val Pro Thr Gln Glu
195 200 205
Ala Ile Gln Lys Leu Val Ala Ala Arg Leu Ala Ala Asp Val Thr Gly
210 215 220
Val Pro Thr Leu Leu Val Ala Arg Thr Asp Ala Asp Ala Ala Asp Leu
225 230 235 240
Ile Thr Ser Asp Cys Asp Pro Tyr Asp Ser Glu Phe Ile Thr Gly Glu
245 250 255
Arg Thr Ser Glu Gly Phe Phe Arg Thr His Ala Gly Ile Glu Gln Ala
260 265 270
Ile Ser Arg Gly Leu Ala Tyr Ala Pro Tyr Ala Asp Leu Val Trp Cys
275 280 285
Glu Thr Ser Thr Pro Asp Leu Glu Leu Ala Arg Arg Phe Ala Gln Ala
290 295 300
Ile His Ala Lys Tyr Pro Gly Lys Leu Leu Ala Tyr Asn Cys Ser Pro
305 310 315 320
Ser Phe Asn Trp Gln Lys Asn Leu Asp Asp Lys Thr Ile Ala Ser Phe
325 330 335
Gln Gln Gln Leu Ser Asp Met Gly Tyr Lys Phe Gln Phe Ile Thr Leu
340 345 350
Ala Gly Ile His Ser Met Trp Phe Asn Met Phe Asp Leu Ala Asn Ala
355 360 365
Tyr Ala Gln Gly Glu Gly Met Lys His Tyr Val Glu Lys Val Gln Gln
370 375 380
Pro Glu Phe Ala Ala Ala Lys Asp Gly Tyr Thr Phe Val Ser His Gln
385 390 395 400
Gln Glu Val Gly Thr Gly Tyr Phe Asp Lys Val Thr Thr Ile Ile Gln
405 410 415
Gly Gly Thr Ser Ser Val Thr Ala Leu Thr Gly Ser Thr Glu Glu Ser
420 425 430
Gln Phe
<210> 13
<211> 1218
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon-adapted nucleotide sequence
<400> 13
atgacagtta ctccacattt atttataccg ggcccaacaa acataccaga tgcagtacgt 60
atggcaatga atatacctat ggaagacatg cgttcaccag agttcccaaa atttacatta 120
cctttatttg aggatttaaa aaaagcattt aagatgaaag atggaagagt ttttatattt 180
ccatcttcag gaacaggcgc atgggaatca gctgtagaaa acactcttgc cactggagat 240
aaggttttaa tgtcaagatt tggacaattt tctttgctat gggtagatat gtgtgaaaga 300
ttgggattaa aagttgaagt atgtgatgaa gaatggggaa caggagtgcc agtagaaaaa 360
tatgctgata tacttgctaa agataaaaat catgaaataa aggctgtttt tgtaactcac 420
aatgaaacag caacaggtgt ttcttcagat gtggctggtg taagaaaagc acttgacgca 480
gcaaagcatc cagcactttt gatggtggat ggagtatcat cagttggttc tcttgatatg 540
agaatgggtg aatggggagt tgattgctgt gtatctggaa gccaaaaggg ttttatgctt 600
cctacaggtt tgggcatttt agctgtgtca cagaaggcat tagatattaa taaatcaaag 660
aatggcagaa tgaatagatg ctttttttcc tttgaggata tgataaaaac taatgatcag 720
ggtttttttc cttatacccc cgccactcaa ttattgagag gattaagaac ttctctcgat 780
cttttgttcg cagaaggact agataatgta tttgcaagac atactagatt agctagtgga 840
gttagggctg ccgtagatgc atggggatta aaattgtgtg caaaagaacc taaatggtat 900
tccgatactg tatcagcaat tttagttcca gaaggtattg attccaatgc tataacaaaa 960
acagcttatt atagatataa tacaagtttt ggtcttggat taaataaggt tgcaggaaaa 1020
gtattcagaa taggccattt aggtatgtta gatgaagtaa tgataggcgg cgctttattt 1080
gcagcagaga tggcacttaa agataatgga gtaaatctaa aattaggatc tggaacaggt 1140
gcagctgctg aatattttag taaaaatgct acaaagtctg ctactgcttt aactccaaaa 1200
caagcaaaag cggcataa 1218
<210> 14
<211> 405
<212> PRT
<213> Hyphomicrobium methylovorum
<400> 14
Met Thr Val Thr Pro His Leu Phe Ile Pro Gly Pro Thr Asn Ile Pro
1 5 10 15
Asp Ala Val Arg Met Ala Met Asn Ile Pro Met Glu Asp Met Arg Ser
20 25 30
Pro Glu Phe Pro Lys Phe Thr Leu Pro Leu Phe Glu Asp Leu Lys Lys
35 40 45
Ala Phe Lys Met Lys Asp Gly Arg Val Phe Ile Phe Pro Ser Ser Gly
50 55 60
Thr Gly Ala Trp Glu Ser Ala Val Glu Asn Thr Leu Ala Thr Gly Asp
65 70 75 80
Lys Val Leu Met Ser Arg Phe Gly Gln Phe Ser Leu Leu Trp Val Asp
85 90 95
Met Cys Glu Arg Leu Gly Leu Lys Val Glu Val Cys Asp Glu Glu Trp
100 105 110
Gly Thr Gly Val Pro Val Glu Lys Tyr Ala Asp Ile Leu Ala Lys Asp
115 120 125
Lys Asn His Glu Ile Lys Ala Val Phe Val Thr His Asn Glu Thr Ala
130 135 140
Thr Gly Val Ser Ser Asp Val Ala Gly Val Arg Lys Ala Leu Asp Ala
145 150 155 160
Ala Lys His Pro Ala Leu Leu Met Val Asp Gly Val Ser Ser Val Gly
165 170 175
Ser Leu Asp Met Arg Met Gly Glu Trp Gly Val Asp Cys Cys Val Ser
180 185 190
Gly Ser Gln Lys Gly Phe Met Leu Pro Thr Gly Leu Gly Ile Leu Ala
195 200 205
Val Ser Gln Lys Ala Leu Asp Ile Asn Lys Ser Lys Asn Gly Arg Met
210 215 220
Asn Arg Cys Phe Phe Ser Phe Glu Asp Met Ile Lys Thr Asn Asp Gln
225 230 235 240
Gly Phe Phe Pro Tyr Thr Pro Ala Thr Gln Leu Leu Arg Gly Leu Arg
245 250 255
Thr Ser Leu Asp Leu Leu Phe Ala Glu Gly Leu Asp Asn Val Phe Ala
260 265 270
Arg His Thr Arg Leu Ala Ser Gly Val Arg Ala Ala Val Asp Ala Trp
275 280 285
Gly Leu Lys Leu Cys Ala Lys Glu Pro Lys Trp Tyr Ser Asp Thr Val
290 295 300
Ser Ala Ile Leu Val Pro Glu Gly Ile Asp Ser Asn Ala Ile Thr Lys
305 310 315 320
Thr Ala Tyr Tyr Arg Tyr Asn Thr Ser Phe Gly Leu Gly Leu Asn Lys
325 330 335
Val Ala Gly Lys Val Phe Arg Ile Gly His Leu Gly Met Leu Asp Glu
340 345 350
Val Met Ile Gly Gly Ala Leu Phe Ala Ala Glu Met Ala Leu Lys Asp
355 360 365
Asn Gly Val Asn Leu Lys Leu Gly Ser Gly Thr Gly Ala Ala Ala Glu
370 375 380
Tyr Phe Ser Lys Asn Ala Thr Lys Ser Ala Thr Ala Leu Thr Pro Lys
385 390 395 400
Gln Ala Lys Ala Ala
405
<210> 15
<211> 1185
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon-adapted nucleotide sequence
<400> 15
atgcaattta ggccttttaa tccaccagtt agaactctta tgggaccagg accaagcgat 60
gtacacccaa gaatattaga ggctatgagc cgtcctacaa taggacattt ggatcctgct 120
tttatacaga tgatggaaga agtaaaaact ttacttcagt atgcatttca aactaaaaat 180
gaacttacta tgccagtaag tgccccaggc tctgcaggca tggaaacatg ctttgccaac 240
ttagtagaac caggtgatca ggttatagtt tgccagaatg gtgtatttgg cggcagaatg 300
aaagaaaatg tagaaagatg tggcggcata cctataatgg ttgaagatac ttggggagag 360
gctgttgatc cagataaatt ggagactgca ttaaaggcta atccagaggc ttgtatagtg 420
gcatttgttc atgctgaaac tagtactggt gcacaaagtg atgctgaaac attggtaaaa 480
ttagctcatc agtatgattg tcttactata gttgatgctg ttacatcact tggcggcact 540
ccaataaagg tagatgaatg ggaaatagat gctatttata gtggaactca gaaatgcctt 600
tcatgtactc caggactttc accagtaagt ttcaatgaaa gggctcttga aaaaattagg 660
aacagaaaac aaaaagttca gtcgtggttt atggatttaa atctagttat gggatattgg 720
ggcggcggcg caaagcgtgc ttatcatcat acagcaccaa ttaatgcttt atatggactt 780
catgaggcac ttttgatgct tcaggaagag ggattagaga acgcatgggc aaggcaccaa 840
aaaaatcatc ttgctttacg ggctggactg gaagcaatgg gcctcacttt tatagtaaat 900
gaaggagata gactgcctca gttaaatgct gtatctatac cagagggagt tgatgatggt 960
gctgttagat caaggcttct aaacgaatat aacttagaaa ttggtgctgg gttaggtgct 1020
ttagctggga aggtatggag aataggctta atgggtcatg caagtagagc agaaaatatt 1080
ctcttatgca taagttcatt agaggctata ttaagtgaga tgggtgctga catatctcaa 1140
ggtgtggcta ttccagcaat gcagaaggca cttgcgcaag cataa 1185
<210> 16
<211> 394
<212> PRT
<213> Sedimenticola thiotaurini
<400> 16
Met Gln Phe Arg Pro Phe Asn Pro Pro Val Arg Thr Leu Met Gly Pro
1 5 10 15
Gly Pro Ser Asp Val His Pro Arg Ile Leu Glu Ala Met Ser Arg Pro
20 25 30
Thr Ile Gly His Leu Asp Pro Ala Phe Ile Gln Met Met Glu Glu Val
35 40 45
Lys Thr Leu Leu Gln Tyr Ala Phe Gln Thr Lys Asn Glu Leu Thr Met
50 55 60
Pro Val Ser Ala Pro Gly Ser Ala Gly Met Glu Thr Cys Phe Ala Asn
65 70 75 80
Leu Val Glu Pro Gly Asp Gln Val Ile Val Cys Gln Asn Gly Val Phe
85 90 95
Gly Gly Arg Met Lys Glu Asn Val Glu Arg Cys Gly Gly Ile Pro Ile
100 105 110
Met Val Glu Asp Thr Trp Gly Glu Ala Val Asp Pro Asp Lys Leu Glu
115 120 125
Thr Ala Leu Lys Ala Asn Pro Glu Ala Cys Ile Val Ala Phe Val His
130 135 140
Ala Glu Thr Ser Thr Gly Ala Gln Ser Asp Ala Glu Thr Leu Val Lys
145 150 155 160
Leu Ala His Gln Tyr Asp Cys Leu Thr Ile Val Asp Ala Val Thr Ser
165 170 175
Leu Gly Gly Thr Pro Ile Lys Val Asp Glu Trp Glu Ile Asp Ala Ile
180 185 190
Tyr Ser Gly Thr Gln Lys Cys Leu Ser Cys Thr Pro Gly Leu Ser Pro
195 200 205
Val Ser Phe Asn Glu Arg Ala Leu Glu Lys Ile Arg Asn Arg Lys Gln
210 215 220
Lys Val Gln Ser Trp Phe Met Asp Leu Asn Leu Val Met Gly Tyr Trp
225 230 235 240
Gly Gly Gly Ala Lys Arg Ala Tyr His His Thr Ala Pro Ile Asn Ala
245 250 255
Leu Tyr Gly Leu His Glu Ala Leu Leu Met Leu Gln Glu Glu Gly Leu
260 265 270
Glu Asn Ala Trp Ala Arg His Gln Lys Asn His Leu Ala Leu Arg Ala
275 280 285
Gly Leu Glu Ala Met Gly Leu Thr Phe Ile Val Asn Glu Gly Asp Arg
290 295 300
Leu Pro Gln Leu Asn Ala Val Ser Ile Pro Glu Gly Val Asp Asp Gly
305 310 315 320
Ala Val Arg Ser Arg Leu Leu Asn Glu Tyr Asn Leu Glu Ile Gly Ala
325 330 335
Gly Leu Gly Ala Leu Ala Gly Lys Val Trp Arg Ile Gly Leu Met Gly
340 345 350
His Ala Ser Arg Ala Glu Asn Ile Leu Leu Cys Ile Ser Ser Leu Glu
355 360 365
Ala Ile Leu Ser Glu Met Gly Ala Asp Ile Ser Gln Gly Val Ala Ile
370 375 380
Pro Ala Met Gln Lys Ala Leu Ala Gln Ala
385 390
<210> 17
<211> 1185
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon-adapted nucleotide sequence
<400> 17
atgcggactc attcatttca cccaccagtt agaactctta tgggaccagg accttctgat 60
gtaaatccaa gagtacttga ggcaatgtca cgacctacaa ttggacactt agatcctgta 120
tttgtagata tgatggaaga attaaagagt ttgcttcaat atgcatttca aacaggaaat 180
caattaacta tgcctgtaag tggacctggc tcagctggaa tggaaacatg ttttgttaat 240
ctagttgaac ctggagataa agtaatagtt tgtcaaaatg gagtatttgg cggcaggatg 300
aaagaaaatg tagaaagatg tggcggcaca gcagtcatgg tggaagatgc atggggttcc 360
gcagttgacc cacaaaaact taaagatgca cttcaggcac atcctgatgc taaattagtt 420
gcttttgttc atgctgaaac tagtacagga gcacaaagcg atgcaaaggc tttagtagaa 480
attgctcata gacatgactg cttagtaatt gtggatacag ttacctcatt aggcggcact 540
cctgtaaaag tagatgaatg gggaatagat gcagtttatt caggaaccca aaaatgctta 600
tcatgtaccc caggtctttc accagtatct ttctctgaaa gggctatgga aagaataaaa 660
cataggaaaa ctaaagtaca gtcttggttt atggatttaa atcttgttat gggctattgg 720
ggatcaggag caaaaagggc ttatcatcat actgctccta taaatgcatt gtacggtctt 780
cacgaagcat tagttatact tcaagaagag gggttagaaa atgcatgggc aagacatgct 840
catgctcata gagcactatt agctggtatt gaagcaatgg gattaaaatt tgtagtaaag 900
gaagatgaac ggttaccgca attaaatgct gtaggtattc cagaaggcgt agatgatgca 960
gctgtgcgtg cccagctcct tcaagattat aaccacgaaa taggtgctgg tcttggacct 1020
atggcaggaa aaatctggag aataggtctt atgggctatg gtgctaatcc taaaaatgta 1080
cttttctgct taggagcatt agaggatgta ctttcgcgca tgagggctcc tatagaaaga 1140
ggtgctgctc ttccagcagc tcatgctgca cttggcgctg cataa 1185
<210> 18
<211> 394
<212> PRT
<213> Thermithiobacillus tepidarius
<400> 18
Met Arg Thr His Ser Phe His Pro Pro Val Arg Thr Leu Met Gly Pro
1 5 10 15
Gly Pro Ser Asp Val Asn Pro Arg Val Leu Glu Ala Met Ser Arg Pro
20 25 30
Thr Ile Gly His Leu Asp Pro Val Phe Val Asp Met Met Glu Glu Leu
35 40 45
Lys Ser Leu Leu Gln Tyr Ala Phe Gln Thr Gly Asn Gln Leu Thr Met
50 55 60
Pro Val Ser Gly Pro Gly Ser Ala Gly Met Glu Thr Cys Phe Val Asn
65 70 75 80
Leu Val Glu Pro Gly Asp Lys Val Ile Val Cys Gln Asn Gly Val Phe
85 90 95
Gly Gly Arg Met Lys Glu Asn Val Glu Arg Cys Gly Gly Thr Ala Val
100 105 110
Met Val Glu Asp Ala Trp Gly Ser Ala Val Asp Pro Gln Lys Leu Lys
115 120 125
Asp Ala Leu Gln Ala His Pro Asp Ala Lys Leu Val Ala Phe Val His
130 135 140
Ala Glu Thr Ser Thr Gly Ala Gln Ser Asp Ala Lys Ala Leu Val Glu
145 150 155 160
Ile Ala His Arg His Asp Cys Leu Val Ile Val Asp Thr Val Thr Ser
165 170 175
Leu Gly Gly Thr Pro Val Lys Val Asp Glu Trp Gly Ile Asp Ala Val
180 185 190
Tyr Ser Gly Thr Gln Lys Cys Leu Ser Cys Thr Pro Gly Leu Ser Pro
195 200 205
Val Ser Phe Ser Glu Arg Ala Met Glu Arg Ile Lys His Arg Lys Thr
210 215 220
Lys Val Gln Ser Trp Phe Met Asp Leu Asn Leu Val Met Gly Tyr Trp
225 230 235 240
Gly Ser Gly Ala Lys Arg Ala Tyr His His Thr Ala Pro Ile Asn Ala
245 250 255
Leu Tyr Gly Leu His Glu Ala Leu Val Ile Leu Gln Glu Glu Gly Leu
260 265 270
Glu Asn Ala Trp Ala Arg His Ala His Ala His Arg Ala Leu Leu Ala
275 280 285
Gly Ile Glu Ala Met Gly Leu Lys Phe Val Val Lys Glu Asp Glu Arg
290 295 300
Leu Pro Gln Leu Asn Ala Val Gly Ile Pro Glu Gly Val Asp Asp Ala
305 310 315 320
Ala Val Arg Ala Gln Leu Leu Gln Asp Tyr Asn His Glu Ile Gly Ala
325 330 335
Gly Leu Gly Pro Met Ala Gly Lys Ile Trp Arg Ile Gly Leu Met Gly
340 345 350
Tyr Gly Ala Asn Pro Lys Asn Val Leu Phe Cys Leu Gly Ala Leu Glu
355 360 365
Asp Val Leu Ser Arg Met Arg Ala Pro Ile Glu Arg Gly Ala Ala Leu
370 375 380
Pro Ala Ala His Ala Ala Leu Gly Ala Ala
385 390
<210> 19
<211> 1125
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon-adapted nucleotide sequence
<400> 19
atgagaactc catttattat gaccccagga ccaacacaag ttcatgaaga agtaagaaag 60
gctatgtcca gagaagcaac taatcctgat ttagatgaaa atttctacga gttctataaa 120
aatacctgta ataagataaa aagattatta aatacagaaa atcaggtatt aattcttgat 180
ggcgaaggta ttttaggttt ggaagcagct tgtgcaagct taactgaaca aggagataga 240
gtactttgta tagataatgg tatttttgga aagggttttg gtgatttttc taaaatgtat 300
ggcggcgaag ttgtatactt cgagtctgat tatagaaagg gtatagatgt agaaaaactt 360
gaagagttcc ttaaaagaga ttctaacttc aaatacgcga cactagtaca ctgtgaaaca 420
ccagcgggta taactaatcc tatagataag atatgtactt tattaaataa atatggtgtg 480
ctttcagtag tagatagtgt aagttcagta ggcggcgatg aaataaatgt agatgagtgg 540
aaaatagata tagctttagg cggctctcaa aagtgtatat cagcgccatc aggattaact 600
ttcctttcaa tttcagaaaa agcaatggat actatgataa atagaaaaac tcctatagca 660
gcattttatt gtaatcttac aatttggaaa ggttggtatg aagaaaagtg gttcccttat 720
actcagccaa ttaatgcaat atatgcactt gattgtgctt tagatagact tttagaaaca 780
gattatataa atagacataa aacaatagct aatgctacaa gagaagccct tgtaaaaagt 840
ggacttgaat tgtatccttt agattcctat tcaaatactg taactacttt tcttgtacca 900
gaaggaataa attttgaaga tgtatttgaa gatatgatga aagatcacaa cataatgata 960
ggcggcgctt ttgattattt aaaaggaaaa gttattagaa taggacacat gggcgaaaac 1020
tgctatgaag aaaaaatata tataacttta aaggcacttg atacagtttt aaaaaaatat 1080
ggagcaaaac taaacggaga gatttacaag cactttgtag attag 1125
<210> 20
<211> 374
<212> PRT
<213> Clostridium acidi-urici
<400> 20
Met Arg Thr Pro Phe Ile Met Thr Pro Gly Pro Thr Gln Val His Glu
1 5 10 15
Glu Val Arg Lys Ala Met Ser Arg Glu Ala Thr Asn Pro Asp Leu Asp
20 25 30
Glu Asn Phe Tyr Glu Phe Tyr Lys Asn Thr Cys Asn Lys Ile Lys Arg
35 40 45
Leu Leu Asn Thr Glu Asn Gln Val Leu Ile Leu Asp Gly Glu Gly Ile
50 55 60
Leu Gly Leu Glu Ala Ala Cys Ala Ser Leu Thr Glu Gln Gly Asp Arg
65 70 75 80
Val Leu Cys Ile Asp Asn Gly Ile Phe Gly Lys Gly Phe Gly Asp Phe
85 90 95
Ser Lys Met Tyr Gly Gly Glu Val Val Tyr Phe Glu Ser Asp Tyr Arg
100 105 110
Lys Gly Ile Asp Val Glu Lys Leu Glu Glu Phe Leu Lys Arg Asp Ser
115 120 125
Asn Phe Lys Tyr Ala Thr Leu Val His Cys Glu Thr Pro Ala Gly Ile
130 135 140
Thr Asn Pro Ile Asp Lys Ile Cys Thr Leu Leu Asn Lys Tyr Gly Val
145 150 155 160
Leu Ser Val Val Asp Ser Val Ser Ser Val Gly Gly Asp Glu Ile Asn
165 170 175
Val Asp Glu Trp Lys Ile Asp Ile Ala Leu Gly Gly Ser Gln Lys Cys
180 185 190
Ile Ser Ala Pro Ser Gly Leu Thr Phe Leu Ser Ile Ser Glu Lys Ala
195 200 205
Met Asp Thr Met Ile Asn Arg Lys Thr Pro Ile Ala Ala Phe Tyr Cys
210 215 220
Asn Leu Thr Ile Trp Lys Gly Trp Tyr Glu Glu Lys Trp Phe Pro Tyr
225 230 235 240
Thr Gln Pro Ile Asn Ala Ile Tyr Ala Leu Asp Cys Ala Leu Asp Arg
245 250 255
Leu Leu Glu Thr Asp Tyr Ile Asn Arg His Lys Thr Ile Ala Asn Ala
260 265 270
Thr Arg Glu Ala Leu Val Lys Ser Gly Leu Glu Leu Tyr Pro Leu Asp
275 280 285
Ser Tyr Ser Asn Thr Val Thr Thr Phe Leu Val Pro Glu Gly Ile Asn
290 295 300
Phe Glu Asp Val Phe Glu Asp Met Met Lys Asp His Asn Ile Met Ile
305 310 315 320
Gly Gly Ala Phe Asp Tyr Leu Lys Gly Lys Val Ile Arg Ile Gly His
325 330 335
Met Gly Glu Asn Cys Tyr Glu Glu Lys Ile Tyr Ile Thr Leu Lys Ala
340 345 350
Leu Asp Thr Val Leu Lys Lys Tyr Gly Ala Lys Leu Asn Gly Glu Ile
355 360 365
Tyr Lys His Phe Val Asp
370
<210> 21
<211> 1155
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon-adapted nucleotide sequence
<400> 21
atgggaaaat ttttaaaaaa gcactatata atggcgccag gacctacacc agtaccaaat 60
gatatattaa ctgaaggggc taaagaaact atacaccatc gcacgcccca atttgtatct 120
ataatggaag agacactgga atcagccaaa tatatcttcc aaactaagca caatgtttat 180
gcatttgcat ctacaggtac aggtgctatg gaagcagcag ttgctaactt ggtaagtcca 240
ggtgacaagg ttatagtagt agttgcagga aaatttgggg agagatggag agaactttgt 300
caggcttatg gtgctgatat agtagagatt gccttggagt ggggagatgc tgttactcct 360
gaacaaattg aagaagcctt aaataaaaat cctgatgcta aagtagtatt tacaacttat 420
tctgaaacat caactggaac agttatagat cttgaaggaa tagctagagt tactaaagaa 480
aaagatgtgg ttctggttac agatgcagtt tcggcattag gtgctgagcc attaaaaatg 540
gatgaatggg gagtagactt agtggttaca ggttctcaaa agggacttat gcttccacca 600
ggacttgcat taataagctt aaatgataaa gcatggggat tagtagaaaa atccagatca 660
ccaagatatt actttgatct tagagcatac agaaaaagct atccagataa cccatacaca 720
ccagcagtaa atatgatata tatgctgaga aaggctcttc agatgataaa ggaagaaggt 780
attgaaaatg tatgggaaag gcatagaata ctgggtgatg ctaccagagc agcagttaaa 840
gcattagggt tagaattact gtcaaagcgt ccgggaaatg tagttacagc tgtaaaagtt 900
ccagaaggta ttgatggtaa acaaatacct aaaataatga gagataaata tggagttacc 960
attgcaggcg gccaggctaa attaaaaggt aaaattttcc gtattgccca tttaggatat 1020
atgagtccat ttgatactat cactgctata tctgcattag aacttacatt aaaggaactt 1080
ggatatgaat ttgaattagg agttggagta aaggctgcag aggcagtatt tgctaaagaa 1140
tttataggag aataa 1155
<210> 22
<211> 384
<212> PRT
<213> Thermotoga maritima
<400> 22
Met Gly Lys Phe Leu Lys Lys His Tyr Ile Met Ala Pro Gly Pro Thr
1 5 10 15
Pro Val Pro Asn Asp Ile Leu Thr Glu Gly Ala Lys Glu Thr Ile His
20 25 30
His Arg Thr Pro Gln Phe Val Ser Ile Met Glu Glu Thr Leu Glu Ser
35 40 45
Ala Lys Tyr Ile Phe Gln Thr Lys His Asn Val Tyr Ala Phe Ala Ser
50 55 60
Thr Gly Thr Gly Ala Met Glu Ala Ala Val Ala Asn Leu Val Ser Pro
65 70 75 80
Gly Asp Lys Val Ile Val Val Val Ala Gly Lys Phe Gly Glu Arg Trp
85 90 95
Arg Glu Leu Cys Gln Ala Tyr Gly Ala Asp Ile Val Glu Ile Ala Leu
100 105 110
Glu Trp Gly Asp Ala Val Thr Pro Glu Gln Ile Glu Glu Ala Leu Asn
115 120 125
Lys Asn Pro Asp Ala Lys Val Val Phe Thr Thr Tyr Ser Glu Thr Ser
130 135 140
Thr Gly Thr Val Ile Asp Leu Glu Gly Ile Ala Arg Val Thr Lys Glu
145 150 155 160
Lys Asp Val Val Leu Val Thr Asp Ala Val Ser Ala Leu Gly Ala Glu
165 170 175
Pro Leu Lys Met Asp Glu Trp Gly Val Asp Leu Val Val Thr Gly Ser
180 185 190
Gln Lys Gly Leu Met Leu Pro Pro Gly Leu Ala Leu Ile Ser Leu Asn
195 200 205
Asp Lys Ala Trp Gly Leu Val Glu Lys Ser Arg Ser Pro Arg Tyr Tyr
210 215 220
Phe Asp Leu Arg Ala Tyr Arg Lys Ser Tyr Pro Asp Asn Pro Tyr Thr
225 230 235 240
Pro Ala Val Asn Met Ile Tyr Met Leu Arg Lys Ala Leu Gln Met Ile
245 250 255
Lys Glu Glu Gly Ile Glu Asn Val Trp Glu Arg His Arg Ile Leu Gly
260 265 270
Asp Ala Thr Arg Ala Ala Val Lys Ala Leu Gly Leu Glu Leu Leu Ser
275 280 285
Lys Arg Pro Gly Asn Val Val Thr Ala Val Lys Val Pro Glu Gly Ile
290 295 300
Asp Gly Lys Gln Ile Pro Lys Ile Met Arg Asp Lys Tyr Gly Val Thr
305 310 315 320
Ile Ala Gly Gly Gln Ala Lys Leu Lys Gly Lys Ile Phe Arg Ile Ala
325 330 335
His Leu Gly Tyr Met Ser Pro Phe Asp Thr Ile Thr Ala Ile Ser Ala
340 345 350
Leu Glu Leu Thr Leu Lys Glu Leu Gly Tyr Glu Phe Glu Leu Gly Val
355 360 365
Gly Val Lys Ala Ala Glu Ala Val Phe Ala Lys Glu Phe Ile Gly Glu
370 375 380
<210> 23
<211> 1230
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon-adapted nucleotide sequence
<400> 23
atgaatttaa gagaaactgc actgaaattt cataaagata acgaaggtaa aatagcacta 60
aaatgcaaag ttccagtaaa aaataaagaa gatttgacac ttgcctatac accaggagtt 120
gctgaacctt gtctagaaat aaataagaat cctgaatgca tatatgatta tacatctaaa 180
ggtaactggg tagcagtagt aacaaatgga accgcagtat taggcttagg aaatattggt 240
gctggggctg gtcttccagt tatggaaggt aaatctgtcc ttttcaaaac ttttgctggt 300
gtagatgcat ttccaatctg cttggaatca aaagatataa atgaaatagt agctgcagta 360
aaattaatgg aacctacatt tggcggcata aatttagagg atataaaggc accagaatgt 420
tttgaaatag aatcaaaact taaagaggtc tgtaatatac cagtattcca tgatgatcag 480
catggtactg cagttgtatc ttctgcatgt cttataaatg cactaaaaat agtaaataag 540
aaatttgagg acctaaaaat agtagtaaat ggtgcgggtg ctgctggaac agctattact 600
aaattactta taaaaatggg tacaaaaaat gtaatacttt gtgacactaa gggcgctatt 660
tataagagaa ggcctatagg catgaataag ttcaaagatg aaatggctga aataacaaat 720
ccaaatcttc aaaaaggcac actagcagat gtattaaaag gtgctgatgt cttccttgga 780
gtttctgctg caaattgtgt tacagaagaa atggtaaaat caatgaataa ggattcaata 840
ataatggcaa tggctaatcc aaacccagaa atattaccag atttagctat aaaggctggt 900
gctaaagtag tatgtactgg acggagtgac tttcctaacc aagtaaacaa tgttttagct 960
tttcccggta tatttagagg agcgttggat gtaagagcat cagaaataaa tgatgaaatg 1020
aaaattgctg ctgcttatgc tatagcagaa ttagtttcag aagaagaatt aaaacctgat 1080
tatattatac caaatgcatt tgatttgaga atagctccta aagtagcagc ttatgtagca 1140
aaagcagcaa tagatacagg agtggcaaga aagaaagatg ttacaccaga aatggttgaa 1200
aagcacacaa aaactttgct tggcatttaa 1230
<210> 24
<211> 409
<212> PRT
<213> Clostridium autoethanogenum
<400> 24
Met Asn Leu Arg Glu Thr Ala Leu Lys Phe His Lys Asp Asn Glu Gly
1 5 10 15
Lys Ile Ala Leu Lys Cys Lys Val Pro Val Lys Asn Lys Glu Asp Leu
20 25 30
Thr Leu Ala Tyr Thr Pro Gly Val Ala Glu Pro Cys Leu Glu Ile Asn
35 40 45
Lys Asn Pro Glu Cys Ile Tyr Asp Tyr Thr Ser Lys Gly Asn Trp Val
50 55 60
Ala Val Val Thr Asn Gly Thr Ala Val Leu Gly Leu Gly Asn Ile Gly
65 70 75 80
Ala Gly Ala Gly Leu Pro Val Met Glu Gly Lys Ser Val Leu Phe Lys
85 90 95
Thr Phe Ala Gly Val Asp Ala Phe Pro Ile Cys Leu Glu Ser Lys Asp
100 105 110
Ile Asn Glu Ile Val Ala Ala Val Lys Leu Met Glu Pro Thr Phe Gly
115 120 125
Gly Ile Asn Leu Glu Asp Ile Lys Ala Pro Glu Cys Phe Glu Ile Glu
130 135 140
Ser Lys Leu Lys Glu Val Cys Asn Ile Pro Val Phe His Asp Asp Gln
145 150 155 160
His Gly Thr Ala Val Val Ser Ser Ala Cys Leu Ile Asn Ala Leu Lys
165 170 175
Ile Val Asn Lys Lys Phe Glu Asp Leu Lys Ile Val Val Asn Gly Ala
180 185 190
Gly Ala Ala Gly Thr Ala Ile Thr Lys Leu Leu Ile Lys Met Gly Thr
195 200 205
Lys Asn Val Ile Leu Cys Asp Thr Lys Gly Ala Ile Tyr Lys Arg Arg
210 215 220
Pro Ile Gly Met Asn Lys Phe Lys Asp Glu Met Ala Glu Ile Thr Asn
225 230 235 240
Pro Asn Leu Gln Lys Gly Thr Leu Ala Asp Val Leu Lys Gly Ala Asp
245 250 255
Val Phe Leu Gly Val Ser Ala Ala Asn Cys Val Thr Glu Glu Met Val
260 265 270
Lys Ser Met Asn Lys Asp Ser Ile Ile Met Ala Met Ala Asn Pro Asn
275 280 285
Pro Glu Ile Leu Pro Asp Leu Ala Ile Lys Ala Gly Ala Lys Val Val
290 295 300
Cys Thr Gly Arg Ser Asp Phe Pro Asn Gln Val Asn Asn Val Leu Ala
305 310 315 320
Phe Pro Gly Ile Phe Arg Gly Ala Leu Asp Val Arg Ala Ser Glu Ile
325 330 335
Asn Asp Glu Met Lys Ile Ala Ala Ala Tyr Ala Ile Ala Glu Leu Val
340 345 350
Ser Glu Glu Glu Leu Lys Pro Asp Tyr Ile Ile Pro Asn Ala Phe Asp
355 360 365
Leu Arg Ile Ala Pro Lys Val Ala Ala Tyr Val Ala Lys Ala Ala Ile
370 375 380
Asp Thr Gly Val Ala Arg Lys Lys Asp Val Thr Pro Glu Met Val Glu
385 390 395 400
Lys His Thr Lys Thr Leu Leu Gly Ile
405
<210> 25
<211> 1173
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon-adapted nucleotide sequence
<400> 25
atgaatgtaa aagaaaaatc acttaagctg catagagaaa aacatggaac aatagaaata 60
gtaggaacaa tgcctttaag aaatggtgat gatcttgcag tagcttatac tcctggagta 120
gctggtcctt gcttagaaat agctaaggat gaagaaaagg cttatgaata tactataaaa 180
ggaaaaacag ttgctgtagt tactaatggt acagctgttc ttggacttgg aaatatagga 240
cctgctgcag gacttcctgt tgtagaagga aaggctttac ttttgaaaag atttgcaaat 300
gtaaatgcta tacctatatg tgtagattct acagatccag atgatatcgt taatacaata 360
aaaaatatag ctccaggatt tggcggcata catctggaag atataaaggc tccagaatgt 420
ttctacatag aagataaact taaggaagaa ttagatatac ctatatacca tgatgatcaa 480
catggtactg ccatcgctgt tttagctgga ttgtataatg cattaaaaat agttaacaaa 540
gatatatcag atataaaagt tgtaataaat ggtgctggtg ctagtggtat agctacagca 600
aaacttctca tatctgcagg agtaaaaaat attgtccttt gtgacattaa tggaatagtt 660
tatgaaggtg acaattgctt aaatgagcct cagaaacaaa tagcaaaagt aactaacaga 720
ggactggcaa agggaacatt aaaagatgct atgaaaaatg cagatgtatt cattggagtt 780
tctgctggta atgtggtaac tggagaaatg gttgaaggta tgaataaaga ttctataata 840
tttgctttag ctaatcctac accagaaatt atgcctgaag aagcaaaaaa ggctggtgct 900
aaagttatag caacaggaag atctgatttt ccaaaccaaa ttaacaatgt tcttgtattc 960
cctggtatct tcaaaggtgc tctttcagta agggctaagg aaatatgtga cgaaatgaaa 1020
atagcagctg caaagggact agcaaatcta gtaaagaagg acgagcttaa tgaagaatat 1080
ataataccat cagttttcaa tagaaatgta tgtgatgcag tttccaaggc tgttatggat 1140
gtagcacaaa aaaataataa atttactgca taa 1173
<210> 26
<211> 390
<212> PRT
<213> Clostridium autoethanogenum
<400> 26
Met Asn Val Lys Glu Lys Ser Leu Lys Leu His Arg Glu Lys His Gly
1 5 10 15
Thr Ile Glu Ile Val Gly Thr Met Pro Leu Arg Asn Gly Asp Asp Leu
20 25 30
Ala Val Ala Tyr Thr Pro Gly Val Ala Gly Pro Cys Leu Glu Ile Ala
35 40 45
Lys Asp Glu Glu Lys Ala Tyr Glu Tyr Thr Ile Lys Gly Lys Thr Val
50 55 60
Ala Val Val Thr Asn Gly Thr Ala Val Leu Gly Leu Gly Asn Ile Gly
65 70 75 80
Pro Ala Ala Gly Leu Pro Val Val Glu Gly Lys Ala Leu Leu Leu Lys
85 90 95
Arg Phe Ala Asn Val Asn Ala Ile Pro Ile Cys Val Asp Ser Thr Asp
100 105 110
Pro Asp Asp Ile Val Asn Thr Ile Lys Asn Ile Ala Pro Gly Phe Gly
115 120 125
Gly Ile His Leu Glu Asp Ile Lys Ala Pro Glu Cys Phe Tyr Ile Glu
130 135 140
Asp Lys Leu Lys Glu Glu Leu Asp Ile Pro Ile Tyr His Asp Asp Gln
145 150 155 160
His Gly Thr Ala Ile Ala Val Leu Ala Gly Leu Tyr Asn Ala Leu Lys
165 170 175
Ile Val Asn Lys Asp Ile Ser Asp Ile Lys Val Val Ile Asn Gly Ala
180 185 190
Gly Ala Ser Gly Ile Ala Thr Ala Lys Leu Leu Ile Ser Ala Gly Val
195 200 205
Lys Asn Ile Val Leu Cys Asp Ile Asn Gly Ile Val Tyr Glu Gly Asp
210 215 220
Asn Cys Leu Asn Glu Pro Gln Lys Gln Ile Ala Lys Val Thr Asn Arg
225 230 235 240
Gly Leu Ala Lys Gly Thr Leu Lys Asp Ala Met Lys Asn Ala Asp Val
245 250 255
Phe Ile Gly Val Ser Ala Gly Asn Val Val Thr Gly Glu Met Val Glu
260 265 270
Gly Met Asn Lys Asp Ser Ile Ile Phe Ala Leu Ala Asn Pro Thr Pro
275 280 285
Glu Ile Met Pro Glu Glu Ala Lys Lys Ala Gly Ala Lys Val Ile Ala
290 295 300
Thr Gly Arg Ser Asp Phe Pro Asn Gln Ile Asn Asn Val Leu Val Phe
305 310 315 320
Pro Gly Ile Phe Lys Gly Ala Leu Ser Val Arg Ala Lys Glu Ile Cys
325 330 335
Asp Glu Met Lys Ile Ala Ala Ala Lys Gly Leu Ala Asn Leu Val Lys
340 345 350
Lys Asp Glu Leu Asn Glu Glu Tyr Ile Ile Pro Ser Val Phe Asn Arg
355 360 365
Asn Val Cys Asp Ala Val Ser Lys Ala Val Met Asp Val Ala Gln Lys
370 375 380
Asn Asn Lys Phe Thr Ala
385 390
<210> 27
<211> 2187
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon-adapted nucleotide sequence
<400> 27
atgactaact atgaaaaggt aggtaaatta caagtagcaa cggaattata taactttgta 60
aaggaagaag ttttaccagg acttgaaata caaaatgagc aattctggac aaattttgat 120
tcgcttattc atgaacttgc cccagaaaat aaggcacttt tggaaaaaag ggacgagctt 180
cagaagacca tatcagaatg gcatcaaaat aataaaggag aaatagattt tgctaaatac 240
aaagagttct tacaagaaat aggatatctt gaaccagttc cagaagattt caaagttact 300
acagctaatg tagacaatga agtggctaat caggctggtt ctcaattagt tgtacctata 360
gataatgcaa gatatgctct aaacgctgct aatgcccgct ggggatcact ttatgatgca 420
ttatatggta gtgacgttat aagcgatgag gctggagcag aggctggtgt ccagtataat 480
cctataagag gtcaaaaggt aatagatttt gcaaaaaatt tattagatca agcagctcct 540
cttgcagaag gttctcatgc tgatgtaacc gcctacaaaa ttgttgaagg aaaacttcag 600
gttactttgg aatctggtaa tactgcttta cttcaagatg aatccaaatt tgtaggatat 660
aatggaagtg aggatgcacc gacggcagta ctccttgtaa acaacgggct tcatattgaa 720
atagcaatag ataaaaataa tcctatagga aaatctgaca aggctggtgt taaggacctt 780
gttttagagg ctgcactttc gactttaatg gactgtgagg attcaattgc tgcagtagat 840
gcagaggata aagtaggcgt atatagaaat tggcttggac ttatgaaagg agatttagaa 900
agcactttta agagaggatc aaaaactgtt acaagaaagc tgaacgctga cagaacctat 960
acaggtgatg gtaaacaatt aactctcagg ggacgtagtc ttatgtttgt gagaaatgtg 1020
ggacatttaa tgactaacaa tgctatattg gatgaaaacg gaaatgaagt tccagaaggt 1080
atcttagatg gagtattaac aagtcttata gcaactcata atttcaaaga aaatgcagag 1140
ttcaaaaaca gccttcacaa gagtatatat attgttaaac caaaaatgca ttcaccagca 1200
gaagcagctt ttgctaataa gttatttgat agaatagaag atttacttgg agtagaaaga 1260
aatactatta aaattggtgt tatggatgaa gaaagaagaa tgtcattaaa tttaaagtct 1320
gcaataaatg aagttaaaga aagaatagct tttattaata caggattcct tgatagaact 1380
ggagatgaaa tacacacttc tatggaagca ggacctgtaa ttagaaaggc tgacatgaag 1440
acttcagaat ggctttcttc ttatgaatca gctaatgtag ctgtaggaat aggagcagga 1500
ttaccaggac atgcacagat tggaaaggga atgtgggcaa tgccagacct tatggcagca 1560
atgcttgaac aaaaaatagc acatcctaag gctggggctt caacagcatg ggttccttct 1620
ccaactgcag ctatattgca tgcccttcac tatcatgagg taaacgttaa agaagttcag 1680
gctggtattg atagttctat agattataga gatggaatat tagatatacc tcttgctcca 1740
aatgcagact ggagcgctga ggaagttcag tctgaattag acaacaatgc acaaggaata 1800
cttggatatg ttgtgcgctg gattgatcaa ggtgtaggat gcagcactgt accagatatt 1860
aatgatgttg gtcttatgga agatagggct actctccgta tttcaagtca gcatatagct 1920
aattggctta gacatggtgt gtgtactaaa gaacaggtag aggaaacttt agagagaatg 1980
gctaaagttg tagaccaaca aaatgcagat gatgaacttt atcaaccaat ggcaccaaac 2040
tacgacgatt caattgcatt ccaggctgca tcagacttaa ttttcaaagg agcagagcaa 2100
cctagtgggt atactgagcc aatcctacat gcaagaagaa tagaagcaaa ggctaaggct 2160
aaacaaaaag caacagtaca gaattag 2187
<210> 28
<211> 728
<212> PRT
<213> Sporosarcina sp. P30
<400> 28
Met Thr Asn Tyr Glu Lys Val Gly Lys Leu Gln Val Ala Thr Glu Leu
1 5 10 15
Tyr Asn Phe Val Lys Glu Glu Val Leu Pro Gly Leu Glu Ile Gln Asn
20 25 30
Glu Gln Phe Trp Thr Asn Phe Asp Ser Leu Ile His Glu Leu Ala Pro
35 40 45
Glu Asn Lys Ala Leu Leu Glu Lys Arg Asp Glu Leu Gln Lys Thr Ile
50 55 60
Ser Glu Trp His Gln Asn Asn Lys Gly Glu Ile Asp Phe Ala Lys Tyr
65 70 75 80
Lys Glu Phe Leu Gln Glu Ile Gly Tyr Leu Glu Pro Val Pro Glu Asp
85 90 95
Phe Lys Val Thr Thr Ala Asn Val Asp Asn Glu Val Ala Asn Gln Ala
100 105 110
Gly Ser Gln Leu Val Val Pro Ile Asp Asn Ala Arg Tyr Ala Leu Asn
115 120 125
Ala Ala Asn Ala Arg Trp Gly Ser Leu Tyr Asp Ala Leu Tyr Gly Ser
130 135 140
Asp Val Ile Ser Asp Glu Ala Gly Ala Glu Ala Gly Val Gln Tyr Asn
145 150 155 160
Pro Ile Arg Gly Gln Lys Val Ile Asp Phe Ala Lys Asn Leu Leu Asp
165 170 175
Gln Ala Ala Pro Leu Ala Glu Gly Ser His Ala Asp Val Thr Ala Tyr
180 185 190
Lys Ile Val Glu Gly Lys Leu Gln Val Thr Leu Glu Ser Gly Asn Thr
195 200 205
Ala Leu Leu Gln Asp Glu Ser Lys Phe Val Gly Tyr Asn Gly Ser Glu
210 215 220
Asp Ala Pro Thr Ala Val Leu Leu Val Asn Asn Gly Leu His Ile Glu
225 230 235 240
Ile Ala Ile Asp Lys Asn Asn Pro Ile Gly Lys Ser Asp Lys Ala Gly
245 250 255
Val Lys Asp Leu Val Leu Glu Ala Ala Leu Ser Thr Leu Met Asp Cys
260 265 270
Glu Asp Ser Ile Ala Ala Val Asp Ala Glu Asp Lys Val Gly Val Tyr
275 280 285
Arg Asn Trp Leu Gly Leu Met Lys Gly Asp Leu Glu Ser Thr Phe Lys
290 295 300
Arg Gly Ser Lys Thr Val Thr Arg Lys Leu Asn Ala Asp Arg Thr Tyr
305 310 315 320
Thr Gly Asp Gly Lys Gln Leu Thr Leu Arg Gly Arg Ser Leu Met Phe
325 330 335
Val Arg Asn Val Gly His Leu Met Thr Asn Asn Ala Ile Leu Asp Glu
340 345 350
Asn Gly Asn Glu Val Pro Glu Gly Ile Leu Asp Gly Val Leu Thr Ser
355 360 365
Leu Ile Ala Thr His Asn Phe Lys Glu Asn Ala Glu Phe Lys Asn Ser
370 375 380
Leu His Lys Ser Ile Tyr Ile Val Lys Pro Lys Met His Ser Pro Ala
385 390 395 400
Glu Ala Ala Phe Ala Asn Lys Leu Phe Asp Arg Ile Glu Asp Leu Leu
405 410 415
Gly Val Glu Arg Asn Thr Ile Lys Ile Gly Val Met Asp Glu Glu Arg
420 425 430
Arg Met Ser Leu Asn Leu Lys Ser Ala Ile Asn Glu Val Lys Glu Arg
435 440 445
Ile Ala Phe Ile Asn Thr Gly Phe Leu Asp Arg Thr Gly Asp Glu Ile
450 455 460
His Thr Ser Met Glu Ala Gly Pro Val Ile Arg Lys Ala Asp Met Lys
465 470 475 480
Thr Ser Glu Trp Leu Ser Ser Tyr Glu Ser Ala Asn Val Ala Val Gly
485 490 495
Ile Gly Ala Gly Leu Pro Gly His Ala Gln Ile Gly Lys Gly Met Trp
500 505 510
Ala Met Pro Asp Leu Met Ala Ala Met Leu Glu Gln Lys Ile Ala His
515 520 525
Pro Lys Ala Gly Ala Ser Thr Ala Trp Val Pro Ser Pro Thr Ala Ala
530 535 540
Ile Leu His Ala Leu His Tyr His Glu Val Asn Val Lys Glu Val Gln
545 550 555 560
Ala Gly Ile Asp Ser Ser Ile Asp Tyr Arg Asp Gly Ile Leu Asp Ile
565 570 575
Pro Leu Ala Pro Asn Ala Asp Trp Ser Ala Glu Glu Val Gln Ser Glu
580 585 590
Leu Asp Asn Asn Ala Gln Gly Ile Leu Gly Tyr Val Val Arg Trp Ile
595 600 605
Asp Gln Gly Val Gly Cys Ser Thr Val Pro Asp Ile Asn Asp Val Gly
610 615 620
Leu Met Glu Asp Arg Ala Thr Leu Arg Ile Ser Ser Gln His Ile Ala
625 630 635 640
Asn Trp Leu Arg His Gly Val Cys Thr Lys Glu Gln Val Glu Glu Thr
645 650 655
Leu Glu Arg Met Ala Lys Val Val Asp Gln Gln Asn Ala Asp Asp Glu
660 665 670
Leu Tyr Gln Pro Met Ala Pro Asn Tyr Asp Asp Ser Ile Ala Phe Gln
675 680 685
Ala Ala Ser Asp Leu Ile Phe Lys Gly Ala Glu Gln Pro Ser Gly Tyr
690 695 700
Thr Glu Pro Ile Leu His Ala Arg Arg Ile Glu Ala Lys Ala Lys Ala
705 710 715 720
Lys Gln Lys Ala Thr Val Gln Asn
725
<210> 29
<211> 2181
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon-adapted nucleotide sequence
<400> 29
atggaaaatt atgtaaaagt aggctcatta caagtagcaa gtgaacttta tgaatttatt 60
aactcagagg ctctacctgg aagtgatttg gaaccagaga aattttggag tggatttgaa 120
aaattagttc atgatcttac tcctaaaaat aagcaacttc ttgcccgtag agatgaaata 180
caaagtaaaa taaatacttg gcacagagag aacaatcaat cctttaactt cgaaacttat 240
aagagtttcc tagaagaaat aggatattta gaaacagaag tagaggattt tgatataaaa 300
acagaaggtg tagatgatga aatagctgta caggctggtc cacagcttgt agtacctgta 360
aacaatgcaa gatatgcaat aaatgctgca aatgctagat ggggttcact atatgatgct 420
ttatatggta cagatgctat aagtgaagaa ggcggcgcca cacgtgcagg cggctataat 480
cctgttagag gagaaaaggt aatagatttt gcaagagaat ttttagatca agcagtccct 540
cttaatggtt tttcccacaa agaagcaaca agttatttag tagtagatgg aaaacttaca 600
gttaagctga aaaatggaga atctacagga ttaaagaatg aggaaaaatt tgcaggatat 660
cagggtgcac cggaacaacc ttctgcagtt cttttaaaga acaatggcct tcactttgaa 720
attcaaatag atagatctca tccaatagga caaactgatg aagcgggagt taaagatttg 780
ttacttgaat ctgctgtaac tactataatg gactgtgaag attctgttac tgcagtagat 840
gcagaagaca aagttttagt ttatagaaat tggcttggat taatgaaagg ggatttggaa 900
gcatctttct caaagggtaa taaatcaatg atgagaaaat taaatgcaga cagaaaatac 960
tcctctccaa ctggcggcga attaagtttg aagggaagaa gtttgttatt tgtaagaaat 1020
gttggccatc ttatgtctat aaatgcaata cttgatcaag acggtgaaga aatacaggaa 1080
ggtattttag acactgttat gacatcgctt atagctaaac atacattact tggaaacggt 1140
tcataccaaa atacttcaaa gggttctgtt tatatagtta aacctaagat gcatggttct 1200
gaagaagtag catttgcaaa tgaattattt gatagagtag aagatttact tgaattacag 1260
agaaatacat tgaaaatagg agtaatggat gaagaaagaa ggacatctct aaacttaaaa 1320
gcatgtatta gacaagttaa agatcgtatt gtatttataa atacaggatt ccttgacagg 1380
acaggtgatg agattcatac aagtatggaa gcaggacctg tagtaagaaa aaatgaaatg 1440
aaatcttcaa aatggcttca agcctatgaa caaagtaatg ttattgctgg attatcatca 1500
ggatttcaag gacaggcaca aataggaaaa ggaatgtggg ctatgccaga tttaatgaaa 1560
gagatgatgg aacagaagat aggacatcta aaaactggtg ctaatactgc ctgggttcca 1620
agccctacag cggctacatt gcatgcactt cattatcatc aagttgacat tacaaaagtt 1680
caagatgaac gtgccaacga taaaagagat ttaagagatg atattttaga atttccagta 1740
gtaactaatc cacagtggac gcccgaagaa atacagaatg aattagataa taatgcacaa 1800
tccatacttg gatacgttgt tagatgggtt gaacagggag ttggttgttc aaaagtacct 1860
gacataaaca atgttggatt aatggaagac agggctacat taagaataag cagtcagcat 1920
gtagctaatt ggcttcatca tggaatatgt aagaaggaac aagttattga aacacttcaa 1980
aggatggcaa aggttgtaga tgaacaaaat gctggaaatt tggcttatag gcctatggca 2040
gcaaattatg atgactcagt agcatttcag gctgcctgtg atttaatttt acaaggatat 2100
gatcagccat ctggatacac agagcctata ctacacagaa ggcgtataga ggctaaggct 2160
aaatttgcaa ttaaacaata a 2181
<210> 30
<211> 726
<212> PRT
<213> Bacillus sp. cl95
<400> 30
Met Glu Asn Tyr Val Lys Val Gly Ser Leu Gln Val Ala Ser Glu Leu
1 5 10 15
Tyr Glu Phe Ile Asn Ser Glu Ala Leu Pro Gly Ser Asp Leu Glu Pro
20 25 30
Glu Lys Phe Trp Ser Gly Phe Glu Lys Leu Val His Asp Leu Thr Pro
35 40 45
Lys Asn Lys Gln Leu Leu Ala Arg Arg Asp Glu Ile Gln Ser Lys Ile
50 55 60
Asn Thr Trp His Arg Glu Asn Asn Gln Ser Phe Asn Phe Glu Thr Tyr
65 70 75 80
Lys Ser Phe Leu Glu Glu Ile Gly Tyr Leu Glu Thr Glu Val Glu Asp
85 90 95
Phe Asp Ile Lys Thr Glu Gly Val Asp Asp Glu Ile Ala Val Gln Ala
100 105 110
Gly Pro Gln Leu Val Val Pro Val Asn Asn Ala Arg Tyr Ala Ile Asn
115 120 125
Ala Ala Asn Ala Arg Trp Gly Ser Leu Tyr Asp Ala Leu Tyr Gly Thr
130 135 140
Asp Ala Ile Ser Glu Glu Gly Gly Ala Thr Arg Ala Gly Gly Tyr Asn
145 150 155 160
Pro Val Arg Gly Glu Lys Val Ile Asp Phe Ala Arg Glu Phe Leu Asp
165 170 175
Gln Ala Val Pro Leu Asn Gly Phe Ser His Lys Glu Ala Thr Ser Tyr
180 185 190
Leu Val Val Asp Gly Lys Leu Thr Val Lys Leu Lys Asn Gly Glu Ser
195 200 205
Thr Gly Leu Lys Asn Glu Glu Lys Phe Ala Gly Tyr Gln Gly Ala Pro
210 215 220
Glu Gln Pro Ser Ala Val Leu Leu Lys Asn Asn Gly Leu His Phe Glu
225 230 235 240
Ile Gln Ile Asp Arg Ser His Pro Ile Gly Gln Thr Asp Glu Ala Gly
245 250 255
Val Lys Asp Leu Leu Leu Glu Ser Ala Val Thr Thr Ile Met Asp Cys
260 265 270
Glu Asp Ser Val Thr Ala Val Asp Ala Glu Asp Lys Val Leu Val Tyr
275 280 285
Arg Asn Trp Leu Gly Leu Met Lys Gly Asp Leu Glu Ala Ser Phe Ser
290 295 300
Lys Gly Asn Lys Ser Met Met Arg Lys Leu Asn Ala Asp Arg Lys Tyr
305 310 315 320
Ser Ser Pro Thr Gly Gly Glu Leu Ser Leu Lys Gly Arg Ser Leu Leu
325 330 335
Phe Val Arg Asn Val Gly His Leu Met Ser Ile Asn Ala Ile Leu Asp
340 345 350
Gln Asp Gly Glu Glu Ile Gln Glu Gly Ile Leu Asp Thr Val Met Thr
355 360 365
Ser Leu Ile Ala Lys His Thr Leu Leu Gly Asn Gly Ser Tyr Gln Asn
370 375 380
Thr Ser Lys Gly Ser Val Tyr Ile Val Lys Pro Lys Met His Gly Ser
385 390 395 400
Glu Glu Val Ala Phe Ala Asn Glu Leu Phe Asp Arg Val Glu Asp Leu
405 410 415
Leu Glu Leu Gln Arg Asn Thr Leu Lys Ile Gly Val Met Asp Glu Glu
420 425 430
Arg Arg Thr Ser Leu Asn Leu Lys Ala Cys Ile Arg Gln Val Lys Asp
435 440 445
Arg Ile Val Phe Ile Asn Thr Gly Phe Leu Asp Arg Thr Gly Asp Glu
450 455 460
Ile His Thr Ser Met Glu Ala Gly Pro Val Val Arg Lys Asn Glu Met
465 470 475 480
Lys Ser Ser Lys Trp Leu Gln Ala Tyr Glu Gln Ser Asn Val Ile Ala
485 490 495
Gly Leu Ser Ser Gly Phe Gln Gly Gln Ala Gln Ile Gly Lys Gly Met
500 505 510
Trp Ala Met Pro Asp Leu Met Lys Glu Met Met Glu Gln Lys Ile Gly
515 520 525
His Leu Lys Thr Gly Ala Asn Thr Ala Trp Val Pro Ser Pro Thr Ala
530 535 540
Ala Thr Leu His Ala Leu His Tyr His Gln Val Asp Ile Thr Lys Val
545 550 555 560
Gln Asp Glu Arg Ala Asn Asp Lys Arg Asp Leu Arg Asp Asp Ile Leu
565 570 575
Glu Phe Pro Val Val Thr Asn Pro Gln Trp Thr Pro Glu Glu Ile Gln
580 585 590
Asn Glu Leu Asp Asn Asn Ala Gln Ser Ile Leu Gly Tyr Val Val Arg
595 600 605
Trp Val Glu Gln Gly Val Gly Cys Ser Lys Val Pro Asp Ile Asn Asn
610 615 620
Val Gly Leu Met Glu Asp Arg Ala Thr Leu Arg Ile Ser Ser Gln His
625 630 635 640
Val Ala Asn Trp Leu His His Gly Ile Cys Lys Lys Glu Gln Val Ile
645 650 655
Glu Thr Leu Gln Arg Met Ala Lys Val Val Asp Glu Gln Asn Ala Gly
660 665 670
Asn Leu Ala Tyr Arg Pro Met Ala Ala Asn Tyr Asp Asp Ser Val Ala
675 680 685
Phe Gln Ala Ala Cys Asp Leu Ile Leu Gln Gly Tyr Asp Gln Pro Ser
690 695 700
Gly Tyr Thr Glu Pro Ile Leu His Arg Arg Arg Ile Glu Ala Lys Ala
705 710 715 720
Lys Phe Ala Ile Lys Gln
725
<210> 31
<211> 1623
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon-adapted nucleotide sequence
<400> 31
atgtcagcac cagcaccatc aactttagct atagtagatg cagaaccatt accaagacaa 60
gaggaagtgc ttacagatgc tgcacttgct tttgttgctg aattgcacag aagatttaca 120
ccacgtagag atgaattatt agcaagaagg gcagaaagaa gagcggaaat agctagaact 180
tctacactgg atttcttgcc agaaacagca gctatacgtg ctgatgacag ctggaaggta 240
gcccctgctc cagctgctct caacgacaga agagtagaaa taacaggacc tacagataga 300
aagatgacta taaacgctct aaatagtggt gctaaagttt ggctagcaga ttttgaagat 360
gcttcagctc caacttggga aaatgttgtt ttgggacaat taaatcttgc atcagcttat 420
actagatcca ttgactttac agatgagaga actggaaaga gttatgcact tcgtccggat 480
gctgaattag caacggtagt tatgaggcct agaggttggc atcttgatga aagacatctt 540
caggtagacg gtaggcctgt acctggtgca ttagtggact ttgggcttta tttttttcat 600
aatgcacaaa gattgcttga tctaggtaag ggaccatact tctatttacc taaaactgaa 660
tctcatcttg aagcaagact atggaatgaa gtatttgtat ttgcacagga ttatgtaggt 720
ataccacagg gaactgtcag agcaactgta cttatagaaa ctattacagc agcctatgaa 780
atggaagaaa tactttacga gcttagggac catgcaagtg gcttaaatgc aggaagatgg 840
gattatctat tttccatagt taaaaatttt agggacggcg gcgctaaatt tgttttacct 900
gatagaaatg cagttactat gactgctcca tttatgcgtg cttatacaga attattagta 960
cgtacctgtc acaagagagg agcacatgct ataggcggca tggcagcatt tatacctagt 1020
agaagggatg cagaggtaaa taaagtagca tttgaaaaag taagagcaga taaggaccgt 1080
gaggctggtg atggttttga tggcagctgg gttgctcatc cggatcttgt acctatagca 1140
atggagagtt ttgataaggt acttggagat aaaccaaacc aaaaggacag gcttagagaa 1200
gatgtagatg taaaagcagc tgatttaatt gccgtagatt cacttgaggc taaacctacc 1260
tatgcaggat tagttaatgc agttcaagta ggtattagat atattgaagc atggcttaga 1320
ggattaggtg ctgtagctat atttaactta atggaagatg ctgctactgc agaaatatca 1380
aggagtcaga tttggcaatg gattaatgct gaggtagttc ttgataatgg tgaacaggta 1440
acagctgatt tagcccgtaa agtagctgca gaagaattgg caggaataag agcagaaata 1500
ggtgaagagg catttgcagc gggcaactgg caacaggctc atgatttgtt acttactgta 1560
tctttagatg aagattatgc agattttttg actttaccag cttatgaaca acttaaagga 1620
taa 1623
<210> 32
<211> 540
<212> PRT
<213> Streptomyces coelicolor
<400> 32
Met Ser Ala Pro Ala Pro Ser Thr Leu Ala Ile Val Asp Ala Glu Pro
1 5 10 15
Leu Pro Arg Gln Glu Glu Val Leu Thr Asp Ala Ala Leu Ala Phe Val
20 25 30
Ala Glu Leu His Arg Arg Phe Thr Pro Arg Arg Asp Glu Leu Leu Ala
35 40 45
Arg Arg Ala Glu Arg Arg Ala Glu Ile Ala Arg Thr Ser Thr Leu Asp
50 55 60
Phe Leu Pro Glu Thr Ala Ala Ile Arg Ala Asp Asp Ser Trp Lys Val
65 70 75 80
Ala Pro Ala Pro Ala Ala Leu Asn Asp Arg Arg Val Glu Ile Thr Gly
85 90 95
Pro Thr Asp Arg Lys Met Thr Ile Asn Ala Leu Asn Ser Gly Ala Lys
100 105 110
Val Trp Leu Ala Asp Phe Glu Asp Ala Ser Ala Pro Thr Trp Glu Asn
115 120 125
Val Val Leu Gly Gln Leu Asn Leu Ala Ser Ala Tyr Thr Arg Ser Ile
130 135 140
Asp Phe Thr Asp Glu Arg Thr Gly Lys Ser Tyr Ala Leu Arg Pro Asp
145 150 155 160
Ala Glu Leu Ala Thr Val Val Met Arg Pro Arg Gly Trp His Leu Asp
165 170 175
Glu Arg His Leu Gln Val Asp Gly Arg Pro Val Pro Gly Ala Leu Val
180 185 190
Asp Phe Gly Leu Tyr Phe Phe His Asn Ala Gln Arg Leu Leu Asp Leu
195 200 205
Gly Lys Gly Pro Tyr Phe Tyr Leu Pro Lys Thr Glu Ser His Leu Glu
210 215 220
Ala Arg Leu Trp Asn Glu Val Phe Val Phe Ala Gln Asp Tyr Val Gly
225 230 235 240
Ile Pro Gln Gly Thr Val Arg Ala Thr Val Leu Ile Glu Thr Ile Thr
245 250 255
Ala Ala Tyr Glu Met Glu Glu Ile Leu Tyr Glu Leu Arg Asp His Ala
260 265 270
Ser Gly Leu Asn Ala Gly Arg Trp Asp Tyr Leu Phe Ser Ile Val Lys
275 280 285
Asn Phe Arg Asp Gly Gly Ala Lys Phe Val Leu Pro Asp Arg Asn Ala
290 295 300
Val Thr Met Thr Ala Pro Phe Met Arg Ala Tyr Thr Glu Leu Leu Val
305 310 315 320
Arg Thr Cys His Lys Arg Gly Ala His Ala Ile Gly Gly Met Ala Ala
325 330 335
Phe Ile Pro Ser Arg Arg Asp Ala Glu Val Asn Lys Val Ala Phe Glu
340 345 350
Lys Val Arg Ala Asp Lys Asp Arg Glu Ala Gly Asp Gly Phe Asp Gly
355 360 365
Ser Trp Val Ala His Pro Asp Leu Val Pro Ile Ala Met Glu Ser Phe
370 375 380
Asp Lys Val Leu Gly Asp Lys Pro Asn Gln Lys Asp Arg Leu Arg Glu
385 390 395 400
Asp Val Asp Val Lys Ala Ala Asp Leu Ile Ala Val Asp Ser Leu Glu
405 410 415
Ala Lys Pro Thr Tyr Ala Gly Leu Val Asn Ala Val Gln Val Gly Ile
420 425 430
Arg Tyr Ile Glu Ala Trp Leu Arg Gly Leu Gly Ala Val Ala Ile Phe
435 440 445
Asn Leu Met Glu Asp Ala Ala Thr Ala Glu Ile Ser Arg Ser Gln Ile
450 455 460
Trp Gln Trp Ile Asn Ala Glu Val Val Leu Asp Asn Gly Glu Gln Val
465 470 475 480
Thr Ala Asp Leu Ala Arg Lys Val Ala Ala Glu Glu Leu Ala Gly Ile
485 490 495
Arg Ala Glu Ile Gly Glu Glu Ala Phe Ala Ala Gly Asn Trp Gln Gln
500 505 510
Ala His Asp Leu Leu Leu Thr Val Ser Leu Asp Glu Asp Tyr Ala Asp
515 520 525
Phe Leu Thr Leu Pro Ala Tyr Glu Gln Leu Lys Gly
530 535 540
<210> 33
<211> 2190
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon-adapted nucleotide sequence
<400> 33
atgaccaatt atgaaaaagt aggtaagtta caggtagcaa ctgaattagt aaattttgta 60
aatgaggaag tattacctgg cttagaaata cagaaagatc aattctggac caatttcgat 120
tcactgatcc atgaattagc tccagaaaat aaagcacttt tagaaaaaag atcagaactt 180
cagaatgcaa tttctgaatg gcatcagcaa aataaaggac aaatagatgc tgcaaaatat 240
aaggaatttc tggaagaaat aggatattta gagccagttg ctgaagattt tcaggtaact 300
acaagcaatg tagataatga aattgctaat caggctggtt ctcaattagt tgtaccaatt 360
gataatgcaa gatatgcttt aaatgcagct aatgctagat ggggttcact atatgatgca 420
ttatatggaa cagatgttat atctgatgaa gatggagcac aggcaggagc agagtataat 480
cctaaaagag gacaaaaagt tattgctttt gctaagaatt tacttgatca ggctgctcct 540
ttagctgagg gatctcatgc agatgcagct gcttataaaa ttgcagatgg aacattacag 600
gttactttag aaaatggaaa aacaactgca cttcaggatg aaagcaagct ggcaggatat 660
aacggaagtg aagatgcccc agaagcagtg ttactagtaa ataatggact tcatattgaa 720
attgcaatag atagaaatca tcctataggt aaagatgata aggctggtgt aaaagaccta 780
gtgcttgaag cagctttatc tacattaatg gattgtgaag atagtatagc agcagtagat 840
gcagaagaca aagtaggtgt ttatagaaat tggttagggc ttatgaaagg agatttagag 900
gcttcattta agagaggaaa taagacagta actagaagaa tgaatgcaga tagaaaatat 960
aaaactgcag atggtaaaga atttacattg cacggaaggt cattgatgtt tgtaagaaat 1020
gtaggacatc ttatgacaaa taatgcaatc ctagatgaaa acggaaatga agttccagaa 1080
ggtatacttg atggagttat aacatcttta attgcaactc ataacttcaa atcagataca 1140
gaatttaaga attcaagaca cggatcaatt tatatagtta agcctaaaat gcatagtcca 1200
gcagaggctg cttttgcaaa taaattattc gatagaatag aggatttatt agggttagag 1260
agaaatacta taaaaatagg attgatggac gaggaacgta gaatgtcctt aaatcttaaa 1320
tctgctataa atgaagttaa agaacgtatt gcttttatta atactggatt ccttgataga 1380
acaggagatg aaatacacac tagcatggaa gcaggacctg taataagaaa agcagacatg 1440
aaggcttcaa actggttaag ttcctatgaa gcaagcaatg ttgcagtagg tataaaagca 1500
ggattaccgg gacatgcaca aataggtaaa ggaatgtggg caatgccaga tatgatggca 1560
gcaatgttag aacagaaggt agctcatcca aaagcaggag catccactgc atgggtacca 1620
tcaccaactg cagctaccct tcatgcacta cattatcatg aagtaaatgt aaaagatgtt 1680
caggctggaa tagattcctc tgtagattat agggatggaa tattagagat acctttggca 1740
ccgtcggtag attggacacc agaagaagtt caatctgaat tagataataa tgcccaagga 1800
atattaggat atgtagtaag atggatagat caaggtgtag gatgttctaa ggtaccagat 1860
ataaatgatg tgggccttat ggaagacagg gcaacattac gaatatctag tcagcatata 1920
gcaaattggc ttagacacgg aatatgtaca aaagaacaag ttcaagaaac attagaaaga 1980
atggctaaag ttgtagatgg tcaaaatgca gatgacgaat tgtaccaacc tatggcacca 2040
aattatgatg attctatagc attccaggct gcttgtgact taatattcaa aggagcagaa 2100
cagccaagtg gatatactga accaattcta catgctagaa gaatagaggc taaggctaaa 2160
gccaagcaaa aagcaactgt acagaattag 2190
<210> 34
<211> 729
<212> PRT
<213> Sporosarcina sp. P35
<400> 34
Met Thr Asn Tyr Glu Lys Val Gly Lys Leu Gln Val Ala Thr Glu Leu
1 5 10 15
Val Asn Phe Val Asn Glu Glu Val Leu Pro Gly Leu Glu Ile Gln Lys
20 25 30
Asp Gln Phe Trp Thr Asn Phe Asp Ser Leu Ile His Glu Leu Ala Pro
35 40 45
Glu Asn Lys Ala Leu Leu Glu Lys Arg Ser Glu Leu Gln Asn Ala Ile
50 55 60
Ser Glu Trp His Gln Gln Asn Lys Gly Gln Ile Asp Ala Ala Lys Tyr
65 70 75 80
Lys Glu Phe Leu Glu Glu Ile Gly Tyr Leu Glu Pro Val Ala Glu Asp
85 90 95
Phe Gln Val Thr Thr Ser Asn Val Asp Asn Glu Ile Ala Asn Gln Ala
100 105 110
Gly Ser Gln Leu Val Val Pro Ile Asp Asn Ala Arg Tyr Ala Leu Asn
115 120 125
Ala Ala Asn Ala Arg Trp Gly Ser Leu Tyr Asp Ala Leu Tyr Gly Thr
130 135 140
Asp Val Ile Ser Asp Glu Asp Gly Ala Gln Ala Gly Ala Glu Tyr Asn
145 150 155 160
Pro Lys Arg Gly Gln Lys Val Ile Ala Phe Ala Lys Asn Leu Leu Asp
165 170 175
Gln Ala Ala Pro Leu Ala Glu Gly Ser His Ala Asp Ala Ala Ala Tyr
180 185 190
Lys Ile Ala Asp Gly Thr Leu Gln Val Thr Leu Glu Asn Gly Lys Thr
195 200 205
Thr Ala Leu Gln Asp Glu Ser Lys Leu Ala Gly Tyr Asn Gly Ser Glu
210 215 220
Asp Ala Pro Glu Ala Val Leu Leu Val Asn Asn Gly Leu His Ile Glu
225 230 235 240
Ile Ala Ile Asp Arg Asn His Pro Ile Gly Lys Asp Asp Lys Ala Gly
245 250 255
Val Lys Asp Leu Val Leu Glu Ala Ala Leu Ser Thr Leu Met Asp Cys
260 265 270
Glu Asp Ser Ile Ala Ala Val Asp Ala Glu Asp Lys Val Gly Val Tyr
275 280 285
Arg Asn Trp Leu Gly Leu Met Lys Gly Asp Leu Glu Ala Ser Phe Lys
290 295 300
Arg Gly Asn Lys Thr Val Thr Arg Arg Met Asn Ala Asp Arg Lys Tyr
305 310 315 320
Lys Thr Ala Asp Gly Lys Glu Phe Thr Leu His Gly Arg Ser Leu Met
325 330 335
Phe Val Arg Asn Val Gly His Leu Met Thr Asn Asn Ala Ile Leu Asp
340 345 350
Glu Asn Gly Asn Glu Val Pro Glu Gly Ile Leu Asp Gly Val Ile Thr
355 360 365
Ser Leu Ile Ala Thr His Asn Phe Lys Ser Asp Thr Glu Phe Lys Asn
370 375 380
Ser Arg His Gly Ser Ile Tyr Ile Val Lys Pro Lys Met His Ser Pro
385 390 395 400
Ala Glu Ala Ala Phe Ala Asn Lys Leu Phe Asp Arg Ile Glu Asp Leu
405 410 415
Leu Gly Leu Glu Arg Asn Thr Ile Lys Ile Gly Leu Met Asp Glu Glu
420 425 430
Arg Arg Met Ser Leu Asn Leu Lys Ser Ala Ile Asn Glu Val Lys Glu
435 440 445
Arg Ile Ala Phe Ile Asn Thr Gly Phe Leu Asp Arg Thr Gly Asp Glu
450 455 460
Ile His Thr Ser Met Glu Ala Gly Pro Val Ile Arg Lys Ala Asp Met
465 470 475 480
Lys Ala Ser Asn Trp Leu Ser Ser Tyr Glu Ala Ser Asn Val Ala Val
485 490 495
Gly Ile Lys Ala Gly Leu Pro Gly His Ala Gln Ile Gly Lys Gly Met
500 505 510
Trp Ala Met Pro Asp Met Met Ala Ala Met Leu Glu Gln Lys Val Ala
515 520 525
His Pro Lys Ala Gly Ala Ser Thr Ala Trp Val Pro Ser Pro Thr Ala
530 535 540
Ala Thr Leu His Ala Leu His Tyr His Glu Val Asn Val Lys Asp Val
545 550 555 560
Gln Ala Gly Ile Asp Ser Ser Val Asp Tyr Arg Asp Gly Ile Leu Glu
565 570 575
Ile Pro Leu Ala Pro Ser Val Asp Trp Thr Pro Glu Glu Val Gln Ser
580 585 590
Glu Leu Asp Asn Asn Ala Gln Gly Ile Leu Gly Tyr Val Val Arg Trp
595 600 605
Ile Asp Gln Gly Val Gly Cys Ser Lys Val Pro Asp Ile Asn Asp Val
610 615 620
Gly Leu Met Glu Asp Arg Ala Thr Leu Arg Ile Ser Ser Gln His Ile
625 630 635 640
Ala Asn Trp Leu Arg His Gly Ile Cys Thr Lys Glu Gln Val Gln Glu
645 650 655
Thr Leu Glu Arg Met Ala Lys Val Val Asp Gly Gln Asn Ala Asp Asp
660 665 670
Glu Leu Tyr Gln Pro Met Ala Pro Asn Tyr Asp Asp Ser Ile Ala Phe
675 680 685
Gln Ala Ala Cys Asp Leu Ile Phe Lys Gly Ala Glu Gln Pro Ser Gly
690 695 700
Tyr Thr Glu Pro Ile Leu His Ala Arg Arg Ile Glu Ala Lys Ala Lys
705 710 715 720
Ala Lys Gln Lys Ala Thr Val Gln Asn
725
<210> 35
<211> 2181
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon-adapted nucleotide sequence
<400> 35
atggtagcgt ataaacaaat aggaaaactt caggtagctc cagttttata taattttata 60
aatgaagaag cattacctga aacaggactt caggaagaag cgttctgggc gggttttgaa 120
cagttaattc atgaattgac tcctgaaaat aaggctctac ttgctaaaag agatgaatta 180
caagcaaaac taaacagatg gtacagagaa aatagggact cattcgattt tgaagcatac 240
aaggcttttt taacatctat tggatatctt gaagcagatg ttgcagattt tcaaatatca 300
actgctaatg tagatgatga aattgcttta caggctggtc ctcaattagt tgtaccagta 360
aataatgcaa gatatgctat aaatgctgca aatgcaagat ggggttcttt gtatgatgcc 420
ctctacggaa ctgatgcaat atcttctgaa aatggagcag gcgtgcaaag tcaatataat 480
cctattcgag gtgagaaggt aataactttt gctaaaagct ttttaaatca cactattccc 540
ttaaaagaag gaaagcatga agatgtagtt caatacgtgg taacaaataa gatggaagca 600
ttgcttcaag atggaactac tacagagtta aaagaaccat caaaatgggt tggctatcaa 660
ggggatggtt caaatccatc agcactttta tttaagaata atggacttca ctttgaaata 720
cagatagata gacaggatgc cataggtaaa tcagatgatg ctggtgtaaa agatgtattg 780
ttagagtcag ctgtaacaac tattatggat tgtgaagata gtgtagctgc cgtagatgca 840
gaagataaag ttgaagtata caggaactgg ttgggattaa tgaaaggtga tctgaaggca 900
agatttaaga aaggtgcaaa aactatgaca agaacattga atgatgacag acagtataaa 960
actgcaaatg gagatactgt aacattatca ggtagatcct taatgtttgt tagaaatgta 1020
ggacatttga tgtcaaattc tgctatttta gatgcaaatg gagatgaaat acaggaagga 1080
atacttgatt caataataac ttcacttata gctaaacata ctttattagg aacaggaaaa 1140
taccaaaaca gccaaaaggg aagtgtttat attgtaaaac ctaaaatgca tggttcagaa 1200
gaagtagctt ttgctaataa actttttgat agagttgaag atcttgtagg actaccaaga 1260
catactttaa aaataggtgt catggatgaa gaaagaagaa cttcattaaa tttaaaagca 1320
tgcatagaga aagtaaagaa tagggtagct tttataaaca ctggtttttt ggatagaact 1380
ggagatgaaa tgcataccag tatggaagca ggagttatga taagaaaaaa tgacatgaaa 1440
tcaagtgttt ggttggcagg atacgaaaaa agcaatgtat taaccggatt agcttcaggc 1500
tttcagggaa aagcccagat aggtaaaggc atgtgggcaa tgcctgatct tatggcagaa 1560
atgttaaaac aaaaagtagg acatcttcag gctggagcca atacagcatg ggtaccttca 1620
ccaacagcag ccactttaca tgccttgcac tatcatgaag tatccgtagt tgatgtacag 1680
aatcaacttg ctaacaattc tacaaatttg agggatgata ttttacaggt acctcttgca 1740
aaagagccaa attggacaaa agaggaagtt caacaggaat tggacaacaa tgcgcaaggc 1800
attttaggat acgtggtaag atgggtagac caaggtatag gttgttctaa agtgcctgac 1860
ataaatgatg ttggacttat ggaagatagg gcaactctaa gaatatcatc acaacatgta 1920
gcaaattggc ttcatcacgg aatatgtact aaggaacagg tacttgctac tcttcagaga 1980
atggccaaag tagtggattc tcaaaatgct ggtgatgcta attatcagcc aatggctcct 2040
cactacgagg aatctatagc attccaggca gcctgtgatt tagtattcaa aggctatgat 2100
cagccaaatg gatatacaga gcctatattg catgcaagaa gaatagaggc taaggcaaaa 2160
caagcaatag aacagaaata a 2181
<210> 36
<211> 726
<212> PRT
<213> Bacillus sp. VT 712
<400> 36
Met Val Ala Tyr Lys Gln Ile Gly Lys Leu Gln Val Ala Pro Val Leu
1 5 10 15
Tyr Asn Phe Ile Asn Glu Glu Ala Leu Pro Glu Thr Gly Leu Gln Glu
20 25 30
Glu Ala Phe Trp Ala Gly Phe Glu Gln Leu Ile His Glu Leu Thr Pro
35 40 45
Glu Asn Lys Ala Leu Leu Ala Lys Arg Asp Glu Leu Gln Ala Lys Leu
50 55 60
Asn Arg Trp Tyr Arg Glu Asn Arg Asp Ser Phe Asp Phe Glu Ala Tyr
65 70 75 80
Lys Ala Phe Leu Thr Ser Ile Gly Tyr Leu Glu Ala Asp Val Ala Asp
85 90 95
Phe Gln Ile Ser Thr Ala Asn Val Asp Asp Glu Ile Ala Leu Gln Ala
100 105 110
Gly Pro Gln Leu Val Val Pro Val Asn Asn Ala Arg Tyr Ala Ile Asn
115 120 125
Ala Ala Asn Ala Arg Trp Gly Ser Leu Tyr Asp Ala Leu Tyr Gly Thr
130 135 140
Asp Ala Ile Ser Ser Glu Asn Gly Ala Gly Val Gln Ser Gln Tyr Asn
145 150 155 160
Pro Ile Arg Gly Glu Lys Val Ile Thr Phe Ala Lys Ser Phe Leu Asn
165 170 175
His Thr Ile Pro Leu Lys Glu Gly Lys His Glu Asp Val Val Gln Tyr
180 185 190
Val Val Thr Asn Lys Met Glu Ala Leu Leu Gln Asp Gly Thr Thr Thr
195 200 205
Glu Leu Lys Glu Pro Ser Lys Trp Val Gly Tyr Gln Gly Asp Gly Ser
210 215 220
Asn Pro Ser Ala Leu Leu Phe Lys Asn Asn Gly Leu His Phe Glu Ile
225 230 235 240
Gln Ile Asp Arg Gln Asp Ala Ile Gly Lys Ser Asp Asp Ala Gly Val
245 250 255
Lys Asp Val Leu Leu Glu Ser Ala Val Thr Thr Ile Met Asp Cys Glu
260 265 270
Asp Ser Val Ala Ala Val Asp Ala Glu Asp Lys Val Glu Val Tyr Arg
275 280 285
Asn Trp Leu Gly Leu Met Lys Gly Asp Leu Lys Ala Arg Phe Lys Lys
290 295 300
Gly Ala Lys Thr Met Thr Arg Thr Leu Asn Asp Asp Arg Gln Tyr Lys
305 310 315 320
Thr Ala Asn Gly Asp Thr Val Thr Leu Ser Gly Arg Ser Leu Met Phe
325 330 335
Val Arg Asn Val Gly His Leu Met Ser Asn Ser Ala Ile Leu Asp Ala
340 345 350
Asn Gly Asp Glu Ile Gln Glu Gly Ile Leu Asp Ser Ile Ile Thr Ser
355 360 365
Leu Ile Ala Lys His Thr Leu Leu Gly Thr Gly Lys Tyr Gln Asn Ser
370 375 380
Gln Lys Gly Ser Val Tyr Ile Val Lys Pro Lys Met His Gly Ser Glu
385 390 395 400
Glu Val Ala Phe Ala Asn Lys Leu Phe Asp Arg Val Glu Asp Leu Val
405 410 415
Gly Leu Pro Arg His Thr Leu Lys Ile Gly Val Met Asp Glu Glu Arg
420 425 430
Arg Thr Ser Leu Asn Leu Lys Ala Cys Ile Glu Lys Val Lys Asn Arg
435 440 445
Val Ala Phe Ile Asn Thr Gly Phe Leu Asp Arg Thr Gly Asp Glu Met
450 455 460
His Thr Ser Met Glu Ala Gly Val Met Ile Arg Lys Asn Asp Met Lys
465 470 475 480
Ser Ser Val Trp Leu Ala Gly Tyr Glu Lys Ser Asn Val Leu Thr Gly
485 490 495
Leu Ala Ser Gly Phe Gln Gly Lys Ala Gln Ile Gly Lys Gly Met Trp
500 505 510
Ala Met Pro Asp Leu Met Ala Glu Met Leu Lys Gln Lys Val Gly His
515 520 525
Leu Gln Ala Gly Ala Asn Thr Ala Trp Val Pro Ser Pro Thr Ala Ala
530 535 540
Thr Leu His Ala Leu His Tyr His Glu Val Ser Val Val Asp Val Gln
545 550 555 560
Asn Gln Leu Ala Asn Asn Ser Thr Asn Leu Arg Asp Asp Ile Leu Gln
565 570 575
Val Pro Leu Ala Lys Glu Pro Asn Trp Thr Lys Glu Glu Val Gln Gln
580 585 590
Glu Leu Asp Asn Asn Ala Gln Gly Ile Leu Gly Tyr Val Val Arg Trp
595 600 605
Val Asp Gln Gly Ile Gly Cys Ser Lys Val Pro Asp Ile Asn Asp Val
610 615 620
Gly Leu Met Glu Asp Arg Ala Thr Leu Arg Ile Ser Ser Gln His Val
625 630 635 640
Ala Asn Trp Leu His His Gly Ile Cys Thr Lys Glu Gln Val Leu Ala
645 650 655
Thr Leu Gln Arg Met Ala Lys Val Val Asp Ser Gln Asn Ala Gly Asp
660 665 670
Ala Asn Tyr Gln Pro Met Ala Pro His Tyr Glu Glu Ser Ile Ala Phe
675 680 685
Gln Ala Ala Cys Asp Leu Val Phe Lys Gly Tyr Asp Gln Pro Asn Gly
690 695 700
Tyr Thr Glu Pro Ile Leu His Ala Arg Arg Ile Glu Ala Lys Ala Lys
705 710 715 720
Gln Ala Ile Glu Gln Lys
725
<210> 37
<211> 2181
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon-adapted nucleotide sequence
<400> 37
atggcaaact atagaaaaat aggaaattta caggtagacg aggcacttca tcaatttctt 60
caaaaagagg ctttaccagg tacaggactt gaagaaaagg ctttttggaa tggatttgag 120
aaacttatag aagtattaac tccagaaaat aaaagacttc ttgcaaagag agaagagctt 180
caaagagaac ttgatagata tcactcagag aaaagagatg atttttcatt tgaagcatac 240
aagcaatttt tacttgattt aggatatctt ttacctgaac ctggagagtt caaaataagg 300
acagaaaatg tagatgatga gattgctctt caagcaggac cacaattggt cgttcctgtc 360
aataattcaa gatattcaat aaacgcagca aatgctcgct ggggtagctt atatgatgcc 420
ttgtatggaa cagatgctat aagcgaagaa ggcggcgctg agagatctat agagtacaat 480
agagttagag gaaataaagt tatagaattt gcaaagggat tcttagatca ggcagctgca 540
cttgacggtg catcccacaa agaagcagtt agatattccg caaaggaagg ttctttagtt 600
ataactttga aagatggaag ttcctctaaa ttaaaagatc aagaggcttt tgctgggtat 660
agaggagata aagaccatcc agaggctgta ttacttaaac atcatggatt gcattttgaa 720
atacagatag atagggcaag tgacatcgga aagtcagatc ctgctggtat taaagatata 780
ttattggaag cagcagtaac tgttataatg gattgtgaag attctgtagc tgctgtagat 840
gctgaagata aggtacttgt atatagaaat tggcttggat tgatgaaagg agaactttcc 900
gcagatttta gcaagggcgg caaaataata tcaagaaaat taaatggtgt acgtcattat 960
agagatcctg aaggaaatct tttttcattg cctggaagat cattactttt cgtaagaaat 1020
gtaggtcatc ttatgactaa cccagctgtt ttggataaag aaggaaatga agtttatgaa 1080
ggtattctag atgcagtatt cacatcttta gctggaatgc acagcttatt aaatactgaa 1140
gagcccgcaa actcaagaaa aggatctata tatatagtta agccaaaaat gcacgggcca 1200
gaagaagttg cttatgcagg agaactattt gataaaactg aagatctttt aggacttgac 1260
agaaacactc ttaaaattgg attaatggat gaagaaagga gaacttcatt aaatttaaag 1320
tcttgtataa aagaagtaaa agatcgtatt gtatttataa atacaggttt tttagataga 1380
acaggtgatg aaatacattc atctatggaa gcaggaccta tggtgagaaa gggagaaatg 1440
aaaaaatcaa actggcttca ggcttatgaa acttcaaatg tttccacggg tctttcagca 1500
ggattttctg gtaaggcaca gatcggaaag ggtatgtggg caatgccaga taaaatgaaa 1560
gaaatgctgg aacagaaagg tgcccagttg aaaactggtg ctaatacagc atgggttcca 1620
tctccatctg cagcagtact tcatgcccta cattatcatc aaataaatgt taaaggtata 1680
caagagaaag aatgccaaaa tccgtctctt tatcgtgacg aaatgctgtc aataccagtt 1740
gaaacctgtg gttcttggtc aagtgaagaa attcaagttg aaatagaaaa taatgcacaa 1800
ggtatattgg gatacgtagt tagatgggta gaacagggta taggatgctc taaagtccct 1860
gatattcatg atgtaggcct catggaagat agagcaactt taagaataag tagtcagcat 1920
cttgctaatt ggatacatca caagatagtt tcaagagaac aggtaatgaa tgctttaaaa 1980
aagatggcta aaattgtaga tgcacaaaat gaaaatgaac cgggctataa aagaatgagc 2040
gatgacttct ctacatctgt tgcattccag gctgcctgtg aattaatatt tgaaggcaga 2100
aatcaaccta atggatatac ggaacctatt ctccacaaga gaagattaga ggctaaatcc 2160
aaaatggcag taagacaata a 2181
<210> 38
<211> 726
<212> PRT
<213> Bacillus infantis NRRL B-14911
<400> 38
Met Ala Asn Tyr Arg Lys Ile Gly Asn Leu Gln Val Asp Glu Ala Leu
1 5 10 15
His Gln Phe Leu Gln Lys Glu Ala Leu Pro Gly Thr Gly Leu Glu Glu
20 25 30
Lys Ala Phe Trp Asn Gly Phe Glu Lys Leu Ile Glu Val Leu Thr Pro
35 40 45
Glu Asn Lys Arg Leu Leu Ala Lys Arg Glu Glu Leu Gln Arg Glu Leu
50 55 60
Asp Arg Tyr His Ser Glu Lys Arg Asp Asp Phe Ser Phe Glu Ala Tyr
65 70 75 80
Lys Gln Phe Leu Leu Asp Leu Gly Tyr Leu Leu Pro Glu Pro Gly Glu
85 90 95
Phe Lys Ile Arg Thr Glu Asn Val Asp Asp Glu Ile Ala Leu Gln Ala
100 105 110
Gly Pro Gln Leu Val Val Pro Val Asn Asn Ser Arg Tyr Ser Ile Asn
115 120 125
Ala Ala Asn Ala Arg Trp Gly Ser Leu Tyr Asp Ala Leu Tyr Gly Thr
130 135 140
Asp Ala Ile Ser Glu Glu Gly Gly Ala Glu Arg Ser Ile Glu Tyr Asn
145 150 155 160
Arg Val Arg Gly Asn Lys Val Ile Glu Phe Ala Lys Gly Phe Leu Asp
165 170 175
Gln Ala Ala Ala Leu Asp Gly Ala Ser His Lys Glu Ala Val Arg Tyr
180 185 190
Ser Ala Lys Glu Gly Ser Leu Val Ile Thr Leu Lys Asp Gly Ser Ser
195 200 205
Ser Lys Leu Lys Asp Gln Glu Ala Phe Ala Gly Tyr Arg Gly Asp Lys
210 215 220
Asp His Pro Glu Ala Val Leu Leu Lys His His Gly Leu His Phe Glu
225 230 235 240
Ile Gln Ile Asp Arg Ala Ser Asp Ile Gly Lys Ser Asp Pro Ala Gly
245 250 255
Ile Lys Asp Ile Leu Leu Glu Ala Ala Val Thr Val Ile Met Asp Cys
260 265 270
Glu Asp Ser Val Ala Ala Val Asp Ala Glu Asp Lys Val Leu Val Tyr
275 280 285
Arg Asn Trp Leu Gly Leu Met Lys Gly Glu Leu Ser Ala Asp Phe Ser
290 295 300
Lys Gly Gly Lys Ile Ile Ser Arg Lys Leu Asn Gly Val Arg His Tyr
305 310 315 320
Arg Asp Pro Glu Gly Asn Leu Phe Ser Leu Pro Gly Arg Ser Leu Leu
325 330 335
Phe Val Arg Asn Val Gly His Leu Met Thr Asn Pro Ala Val Leu Asp
340 345 350
Lys Glu Gly Asn Glu Val Tyr Glu Gly Ile Leu Asp Ala Val Phe Thr
355 360 365
Ser Leu Ala Gly Met His Ser Leu Leu Asn Thr Glu Glu Pro Ala Asn
370 375 380
Ser Arg Lys Gly Ser Ile Tyr Ile Val Lys Pro Lys Met His Gly Pro
385 390 395 400
Glu Glu Val Ala Tyr Ala Gly Glu Leu Phe Asp Lys Thr Glu Asp Leu
405 410 415
Leu Gly Leu Asp Arg Asn Thr Leu Lys Ile Gly Leu Met Asp Glu Glu
420 425 430
Arg Arg Thr Ser Leu Asn Leu Lys Ser Cys Ile Lys Glu Val Lys Asp
435 440 445
Arg Ile Val Phe Ile Asn Thr Gly Phe Leu Asp Arg Thr Gly Asp Glu
450 455 460
Ile His Ser Ser Met Glu Ala Gly Pro Met Val Arg Lys Gly Glu Met
465 470 475 480
Lys Lys Ser Asn Trp Leu Gln Ala Tyr Glu Thr Ser Asn Val Ser Thr
485 490 495
Gly Leu Ser Ala Gly Phe Ser Gly Lys Ala Gln Ile Gly Lys Gly Met
500 505 510
Trp Ala Met Pro Asp Lys Met Lys Glu Met Leu Glu Gln Lys Gly Ala
515 520 525
Gln Leu Lys Thr Gly Ala Asn Thr Ala Trp Val Pro Ser Pro Ser Ala
530 535 540
Ala Val Leu His Ala Leu His Tyr His Gln Ile Asn Val Lys Gly Ile
545 550 555 560
Gln Glu Lys Glu Cys Gln Asn Pro Ser Leu Tyr Arg Asp Glu Met Leu
565 570 575
Ser Ile Pro Val Glu Thr Cys Gly Ser Trp Ser Ser Glu Glu Ile Gln
580 585 590
Val Glu Ile Glu Asn Asn Ala Gln Gly Ile Leu Gly Tyr Val Val Arg
595 600 605
Trp Val Glu Gln Gly Ile Gly Cys Ser Lys Val Pro Asp Ile His Asp
610 615 620
Val Gly Leu Met Glu Asp Arg Ala Thr Leu Arg Ile Ser Ser Gln His
625 630 635 640
Leu Ala Asn Trp Ile His His Lys Ile Val Ser Arg Glu Gln Val Met
645 650 655
Asn Ala Leu Lys Lys Met Ala Lys Ile Val Asp Ala Gln Asn Glu Asn
660 665 670
Glu Pro Gly Tyr Lys Arg Met Ser Asp Asp Phe Ser Thr Ser Val Ala
675 680 685
Phe Gln Ala Ala Cys Glu Leu Ile Phe Glu Gly Arg Asn Gln Pro Asn
690 695 700
Gly Tyr Thr Glu Pro Ile Leu His Lys Arg Arg Leu Glu Ala Lys Ser
705 710 715 720
Lys Met Ala Val Arg Gln
725
<210> 39
<211> 855
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon-adapted nucleotide sequence
<400> 39
atgtatttag tagataaaga agtaattcat gaaacatttg gcaaaggttc agtagtaaat 60
tataatgata attatattaa gattgacttt gaatcaggcg caaagaaatt tgtatttcct 120
gacgtatttg ggaaatatat gactcttgta gatcaggaag cagtaaactt agttaatatg 180
aaaatacaga aaagagaaga agaaaagaaa aaagaggaac ttaagttaat taaagaaaaa 240
gatcttgaaa gagaaagaca gcatatactg gagcaaaaaa aaactatgca atccaggaaa 300
attcatccaa aacaacaggt agtattctgg tgtgaaaccg gagaggaaga taaaatattt 360
actgagggta ggatatttat aggtaaggta aagagtggag aaaataaggg tcagccgaag 420
agattagcaa gaatgacctg gaaatcaggc tgcttactaa caaggcgtga accaggtatg 480
cctgaaaaag acagaaggat attaggagta tttatggctg aagaaggttt caatggtcaa 540
acctgtaagg atggctatat tccagcccat cctgaatata aacttagact tagtgaacaa 600
gaatcagata aaatgttatt ttggaattat tatataaata agaacttccc tactagaatg 660
acttggaatt caggcagaca gagatatttt aacaatattt ggatggcaca aatacttcaa 720
gatattgtaa gcttaaaaaa taaacctgaa gaaagggaaa atgcacagag attctttgaa 780
cacttctgta aagttaacca tataaatgaa gataaacttc ctaaggcaaa tggtgccttg 840
atgcaaattc aataa 855
<210> 40
<211> 284
<212> PRT
<213> Clostridium cochlearium
<400> 40
Met Tyr Leu Val Asp Lys Glu Val Ile His Glu Thr Phe Gly Lys Gly
1 5 10 15
Ser Val Val Asn Tyr Asn Asp Asn Tyr Ile Lys Ile Asp Phe Glu Ser
20 25 30
Gly Ala Lys Lys Phe Val Phe Pro Asp Val Phe Gly Lys Tyr Met Thr
35 40 45
Leu Val Asp Gln Glu Ala Val Asn Leu Val Asn Met Lys Ile Gln Lys
50 55 60
Arg Glu Glu Glu Lys Lys Lys Glu Glu Leu Lys Leu Ile Lys Glu Lys
65 70 75 80
Asp Leu Glu Arg Glu Arg Gln His Ile Leu Glu Gln Lys Lys Thr Met
85 90 95
Gln Ser Arg Lys Ile His Pro Lys Gln Gln Val Val Phe Trp Cys Glu
100 105 110
Thr Gly Glu Glu Asp Lys Ile Phe Thr Glu Gly Arg Ile Phe Ile Gly
115 120 125
Lys Val Lys Ser Gly Glu Asn Lys Gly Gln Pro Lys Arg Leu Ala Arg
130 135 140
Met Thr Trp Lys Ser Gly Cys Leu Leu Thr Arg Arg Glu Pro Gly Met
145 150 155 160
Pro Glu Lys Asp Arg Arg Ile Leu Gly Val Phe Met Ala Glu Glu Gly
165 170 175
Phe Asn Gly Gln Thr Cys Lys Asp Gly Tyr Ile Pro Ala His Pro Glu
180 185 190
Tyr Lys Leu Arg Leu Ser Glu Gln Glu Ser Asp Lys Met Leu Phe Trp
195 200 205
Asn Tyr Tyr Ile Asn Lys Asn Phe Pro Thr Arg Met Thr Trp Asn Ser
210 215 220
Gly Arg Gln Arg Tyr Phe Asn Asn Ile Trp Met Ala Gln Ile Leu Gln
225 230 235 240
Asp Ile Val Ser Leu Lys Asn Lys Pro Glu Glu Arg Glu Asn Ala Gln
245 250 255
Arg Phe Phe Glu His Phe Cys Lys Val Asn His Ile Asn Glu Asp Lys
260 265 270
Leu Pro Lys Ala Asn Gly Ala Leu Met Gln Ile Gln
275 280
<210> 41
<211> 2178
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon-adapted nucleotide sequence
<400> 41
atgactaact ataaacaagt aggcaattta aaagtagcac cagtactata tcaattcata 60
aatgaagaag cattaccggg cagtggactt tccacggaaa acttttggtc tgattttgag 120
gctttagtaa ctgagcttac tcctgttaat aaaagactcc ttgaaaaaag ggatcagctt 180
caggcacaaa taaatgcatg gcatcaagaa aatccagatg gtgatttctc tgaatacaag 240
agtttcctaa ctcgtattgg atatcttgag gataaaacag aggatttttt aattggaacg 300
gaaggtgttg acagtgaaat tgcttatcag gctggtcctc aattagtggt tccggtgaat 360
aacgcaaggt atgcaataaa tgctgctaat gcaagatggg gaagtttgta tgatgcttta 420
tatggcactg atgctatttc agaagaaaat ggtgcgtcaa gaactagttc ctacaatcct 480
attaggggag aaaaagttat agcttttgca aaaaatttcc ttgatgaagt tgtaccttta 540
gtccagagct ctcatgcaga ggttgttcaa tacagtttgg aaaatgaaaa attagtagca 600
caattaaatg atggtagctt aacagaactt caagaagaag aaaaattcgt tggatatcag 660
ggagaagaag aatcaccaga tgccttgtta ttcaaaaaca atggacttca ttttgaagtt 720
caaatagata gaacagattc cataggaaaa acagacgatg caggagttaa agatatactt 780
atggaagcag cacttacaac tataatggat tgcgaagatt ctgtagctgc tgttgatgca 840
gaagacaagg ttgacgtgta tagaaactgg ttaggtctta tgaaaggaga tttaactagt 900
acatttaaga agggatctca aaatatgaca agaagattaa atccggatag aacttatata 960
agtccagata agaaaaagat attattgtcg ggaagatcac ttatgtttgt aagaaatgtt 1020
ggacatctta tgactaattc tgctgtatta gatagaaatg gtaacgaaat atacgagggt 1080
attttggatt ctgttattac atctttaatt gcaaaacata ccttattaaa gaatggtact 1140
tatcaaaatt ctaagaaatc aagtatatac attgttaaac caaaaatgca tggatcaaaa 1200
gaagttgctt ttgccaacac attatttaac tctatagaag atatgttagg gttagagcgt 1260
catactataa aaattggagt tatggatgag gaaagaagaa caactttaaa tcttaaagcc 1320
tgtataaagg aagtaaagga cagagtagct tttataaata ctggttttct tgacagaact 1380
ggagatgaaa tacacacatc aatggaagcc ggagcagtta taagaaaaaa cgatatgaag 1440
gcttcaaaat ggcttcaagg atatgaacaa tcaaatgtaa atgtaggatt agctagtgga 1500
tttcaaggaa gggcacaaat aggtaaggga atgtgggcta tgccggatat gatggcagaa 1560
atgcttaaac aaaaagtagg tcatcttaaa gcaggagcca atacggcatg ggttcctagt 1620
cctacagcag caacccttca tgccctacat tatcatcaaa ttgatgttag agatgtacaa 1680
aacgagttac ttacacaatc cacagatctt caggatgata tattacaaat tccagttgct 1740
gaaaagccta attggtctaa agatgaaata cagcaagaat tagataataa tgcacaagga 1800
atacttggat atgtagttag atgggtagat cagggtgtag gttgttcaaa agttccagat 1860
ataaataatg taggacttat ggaagatcgg gctacactgc gcatctcaag tcagcatgta 1920
gcaaattggt tgcatcatgg tatttgtact aaagaacaag ttactgaaac attaaaaaga 1980
atggcgaaag ttgtagatca gcaaaatgaa aatgatccat tatatcagcc tatgagttca 2040
aattacagtg catcaatagc atttcaggct gcgtgcgatc ttgtattcca gggatacgac 2100
caacctaatg gatacacaga accaatattg catagaagaa ggattgaagc aaaggctaaa 2160
gcagcaataa aacaataa 2178
<210> 42
<211> 725
<212> PRT
<213> Bacillus megaterium
<400> 42
Met Thr Asn Tyr Lys Gln Val Gly Asn Leu Lys Val Ala Pro Val Leu
1 5 10 15
Tyr Gln Phe Ile Asn Glu Glu Ala Leu Pro Gly Ser Gly Leu Ser Thr
20 25 30
Glu Asn Phe Trp Ser Asp Phe Glu Ala Leu Val Thr Glu Leu Thr Pro
35 40 45
Val Asn Lys Arg Leu Leu Glu Lys Arg Asp Gln Leu Gln Ala Gln Ile
50 55 60
Asn Ala Trp His Gln Glu Asn Pro Asp Gly Asp Phe Ser Glu Tyr Lys
65 70 75 80
Ser Phe Leu Thr Arg Ile Gly Tyr Leu Glu Asp Lys Thr Glu Asp Phe
85 90 95
Leu Ile Gly Thr Glu Gly Val Asp Ser Glu Ile Ala Tyr Gln Ala Gly
100 105 110
Pro Gln Leu Val Val Pro Val Asn Asn Ala Arg Tyr Ala Ile Asn Ala
115 120 125
Ala Asn Ala Arg Trp Gly Ser Leu Tyr Asp Ala Leu Tyr Gly Thr Asp
130 135 140
Ala Ile Ser Glu Glu Asn Gly Ala Ser Arg Thr Ser Ser Tyr Asn Pro
145 150 155 160
Ile Arg Gly Glu Lys Val Ile Ala Phe Ala Lys Asn Phe Leu Asp Glu
165 170 175
Val Val Pro Leu Val Gln Ser Ser His Ala Glu Val Val Gln Tyr Ser
180 185 190
Leu Glu Asn Glu Lys Leu Val Ala Gln Leu Asn Asp Gly Ser Leu Thr
195 200 205
Glu Leu Gln Glu Glu Glu Lys Phe Val Gly Tyr Gln Gly Glu Glu Glu
210 215 220
Ser Pro Asp Ala Leu Leu Phe Lys Asn Asn Gly Leu His Phe Glu Val
225 230 235 240
Gln Ile Asp Arg Thr Asp Ser Ile Gly Lys Thr Asp Asp Ala Gly Val
245 250 255
Lys Asp Ile Leu Met Glu Ala Ala Leu Thr Thr Ile Met Asp Cys Glu
260 265 270
Asp Ser Val Ala Ala Val Asp Ala Glu Asp Lys Val Asp Val Tyr Arg
275 280 285
Asn Trp Leu Gly Leu Met Lys Gly Asp Leu Thr Ser Thr Phe Lys Lys
290 295 300
Gly Ser Gln Asn Met Thr Arg Arg Leu Asn Pro Asp Arg Thr Tyr Ile
305 310 315 320
Ser Pro Asp Lys Lys Lys Ile Leu Leu Ser Gly Arg Ser Leu Met Phe
325 330 335
Val Arg Asn Val Gly His Leu Met Thr Asn Ser Ala Val Leu Asp Arg
340 345 350
Asn Gly Asn Glu Ile Tyr Glu Gly Ile Leu Asp Ser Val Ile Thr Ser
355 360 365
Leu Ile Ala Lys His Thr Leu Leu Lys Asn Gly Thr Tyr Gln Asn Ser
370 375 380
Lys Lys Ser Ser Ile Tyr Ile Val Lys Pro Lys Met His Gly Ser Lys
385 390 395 400
Glu Val Ala Phe Ala Asn Thr Leu Phe Asn Ser Ile Glu Asp Met Leu
405 410 415
Gly Leu Glu Arg His Thr Ile Lys Ile Gly Val Met Asp Glu Glu Arg
420 425 430
Arg Thr Thr Leu Asn Leu Lys Ala Cys Ile Lys Glu Val Lys Asp Arg
435 440 445
Val Ala Phe Ile Asn Thr Gly Phe Leu Asp Arg Thr Gly Asp Glu Ile
450 455 460
His Thr Ser Met Glu Ala Gly Ala Val Ile Arg Lys Asn Asp Met Lys
465 470 475 480
Ala Ser Lys Trp Leu Gln Gly Tyr Glu Gln Ser Asn Val Asn Val Gly
485 490 495
Leu Ala Ser Gly Phe Gln Gly Arg Ala Gln Ile Gly Lys Gly Met Trp
500 505 510
Ala Met Pro Asp Met Met Ala Glu Met Leu Lys Gln Lys Val Gly His
515 520 525
Leu Lys Ala Gly Ala Asn Thr Ala Trp Val Pro Ser Pro Thr Ala Ala
530 535 540
Thr Leu His Ala Leu His Tyr His Gln Ile Asp Val Arg Asp Val Gln
545 550 555 560
Asn Glu Leu Leu Thr Gln Ser Thr Asp Leu Gln Asp Asp Ile Leu Gln
565 570 575
Ile Pro Val Ala Glu Lys Pro Asn Trp Ser Lys Asp Glu Ile Gln Gln
580 585 590
Glu Leu Asp Asn Asn Ala Gln Gly Ile Leu Gly Tyr Val Val Arg Trp
595 600 605
Val Asp Gln Gly Val Gly Cys Ser Lys Val Pro Asp Ile Asn Asn Val
610 615 620
Gly Leu Met Glu Asp Arg Ala Thr Leu Arg Ile Ser Ser Gln His Val
625 630 635 640
Ala Asn Trp Leu His His Gly Ile Cys Thr Lys Glu Gln Val Thr Glu
645 650 655
Thr Leu Lys Arg Met Ala Lys Val Val Asp Gln Gln Asn Glu Asn Asp
660 665 670
Pro Leu Tyr Gln Pro Met Ser Ser Asn Tyr Ser Ala Ser Ile Ala Phe
675 680 685
Gln Ala Ala Cys Asp Leu Val Phe Gln Gly Tyr Asp Gln Pro Asn Gly
690 695 700
Tyr Thr Glu Pro Ile Leu His Arg Arg Arg Ile Glu Ala Lys Ala Lys
705 710 715 720
Ala Ala Ile Lys Gln
725
<210> 43
<211> 1581
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon-adapted nucleotide sequence
<400> 43
atgtcaagac cagcagcagg acttgcagta ttaggaccac cactttcgtc agcagcacaa 60
gaattattag gtaaacgcgc attagcattc gttcaattac tagaacagca atttggacat 120
agaagaagag aattacttca ggctagacag cacagacaac agagatttga cggcggcgaa 180
aagcctgatt ttagatctga tactcttgca gttaggacgg gagaatggag tgtagctcca 240
gctccagcag aattacgcga caggagagtt gaaattactg gtcctgctgg agatagaaag 300
atggttataa atgctttaaa ttccggagca agagtattca tgtgtgatct tgaagacgct 360
aattcaccaa cttgggctaa cactatgaat ggtcagttaa atataagaga tgctgaggca 420
ggaactatag cttatgaatc accagaagga aaggcttata gacttgctcc agatcatgca 480
gtaattaaaa taagaccaag aggatggcat cttgaagaat ctcatgtagc atgggaagga 540
caaagtgttt ctgcagcttt atttgacttt ggaatggctg catttcataa tgcaagagaa 600
aaagcaagaa gaggatctgg cttgtacttc tatttaccta agttagaatc tatggaagaa 660
gcagaactat gggaagacgt attcacattt gcagaaagag agcttggtct tgaaagaggt 720
atgtttaggg ctacagtttt aatagaaacc ctaccagctg cctttgaaat ggaagaaata 780
ctttttgttc ttagagatca tgccgacgga ttgaattgtg gaagatggga ttacatattt 840
agttatatta aaaagttaag agcacaccca gaggctatat taccagatag aagtttggtt 900
actatggata gcccttttat ggcagcttat gctagacttg cagtacagac ttgtcataga 960
agaggcgcat tctgcatagg cggcatggct gcacagattc caatcaagaa tgattctgct 1020
gccaacgaac aagcactgga taaggtaaga cttgacaaat taagagaggt tagattaggg 1080
catgatggta cttgggttgc tcatcctgga cttgtagcag ttgctgaaaa agtatttaat 1140
gaacacatgc caggagataa tcaacttttc ttccatcctg atggttctgt tggtgctgaa 1200
caattgcttg aggctcctag aggaccaatt actgaggctg gagttagatt aaatttgtca 1260
gtttcacttc aatacattga ggcatggttg agaggtacag gtgcagttcc aataaacagc 1320
cttatggaag atgcagcaac tgctgaaatt tcaagagcac agttatggca gtggatacgg 1380
catccacaag gcatattaga agatggaaga aaaatgagtg cagatttata cagaaaatta 1440
ttagaagaag agcttggaaa attaccagca gcagcatcag gtgcttatgg acgggcagaa 1500
gaacttctta cagcaatgac tcttgccgat acttttgctg agttccttac tgtagacgct 1560
tatagatatc ttcaagatta g 1581
<210> 44
<211> 526
<212> PRT
<213> Paenibacillus sp. RU4X
<400> 44
Met Ser Arg Pro Ala Ala Gly Leu Ala Val Leu Gly Pro Pro Leu Ser
1 5 10 15
Ser Ala Ala Gln Glu Leu Leu Gly Lys Arg Ala Leu Ala Phe Val Gln
20 25 30
Leu Leu Glu Gln Gln Phe Gly His Arg Arg Arg Glu Leu Leu Gln Ala
35 40 45
Arg Gln His Arg Gln Gln Arg Phe Asp Gly Gly Glu Lys Pro Asp Phe
50 55 60
Arg Ser Asp Thr Leu Ala Val Arg Thr Gly Glu Trp Ser Val Ala Pro
65 70 75 80
Ala Pro Ala Glu Leu Arg Asp Arg Arg Val Glu Ile Thr Gly Pro Ala
85 90 95
Gly Asp Arg Lys Met Val Ile Asn Ala Leu Asn Ser Gly Ala Arg Val
100 105 110
Phe Met Cys Asp Leu Glu Asp Ala Asn Ser Pro Thr Trp Ala Asn Thr
115 120 125
Met Asn Gly Gln Leu Asn Ile Arg Asp Ala Glu Ala Gly Thr Ile Ala
130 135 140
Tyr Glu Ser Pro Glu Gly Lys Ala Tyr Arg Leu Ala Pro Asp His Ala
145 150 155 160
Val Ile Lys Ile Arg Pro Arg Gly Trp His Leu Glu Glu Ser His Val
165 170 175
Ala Trp Glu Gly Gln Ser Val Ser Ala Ala Leu Phe Asp Phe Gly Met
180 185 190
Ala Ala Phe His Asn Ala Arg Glu Lys Ala Arg Arg Gly Ser Gly Leu
195 200 205
Tyr Phe Tyr Leu Pro Lys Leu Glu Ser Met Glu Glu Ala Glu Leu Trp
210 215 220
Glu Asp Val Phe Thr Phe Ala Glu Arg Glu Leu Gly Leu Glu Arg Gly
225 230 235 240
Met Phe Arg Ala Thr Val Leu Ile Glu Thr Leu Pro Ala Ala Phe Glu
245 250 255
Met Glu Glu Ile Leu Phe Val Leu Arg Asp His Ala Asp Gly Leu Asn
260 265 270
Cys Gly Arg Trp Asp Tyr Ile Phe Ser Tyr Ile Lys Lys Leu Arg Ala
275 280 285
His Pro Glu Ala Ile Leu Pro Asp Arg Ser Leu Val Thr Met Asp Ser
290 295 300
Pro Phe Met Ala Ala Tyr Ala Arg Leu Ala Val Gln Thr Cys His Arg
305 310 315 320
Arg Gly Ala Phe Cys Ile Gly Gly Met Ala Ala Gln Ile Pro Ile Lys
325 330 335
Asn Asp Ser Ala Ala Asn Glu Gln Ala Leu Asp Lys Val Arg Leu Asp
340 345 350
Lys Leu Arg Glu Val Arg Leu Gly His Asp Gly Thr Trp Val Ala His
355 360 365
Pro Gly Leu Val Ala Val Ala Glu Lys Val Phe Asn Glu His Met Pro
370 375 380
Gly Asp Asn Gln Leu Phe Phe His Pro Asp Gly Ser Val Gly Ala Glu
385 390 395 400
Gln Leu Leu Glu Ala Pro Arg Gly Pro Ile Thr Glu Ala Gly Val Arg
405 410 415
Leu Asn Leu Ser Val Ser Leu Gln Tyr Ile Glu Ala Trp Leu Arg Gly
420 425 430
Thr Gly Ala Val Pro Ile Asn Ser Leu Met Glu Asp Ala Ala Thr Ala
435 440 445
Glu Ile Ser Arg Ala Gln Leu Trp Gln Trp Ile Arg His Pro Gln Gly
450 455 460
Ile Leu Glu Asp Gly Arg Lys Met Ser Ala Asp Leu Tyr Arg Lys Leu
465 470 475 480
Leu Glu Glu Glu Leu Gly Lys Leu Pro Ala Ala Ala Ser Gly Ala Tyr
485 490 495
Gly Arg Ala Glu Glu Leu Leu Thr Ala Met Thr Leu Ala Asp Thr Phe
500 505 510
Ala Glu Phe Leu Thr Val Asp Ala Tyr Arg Tyr Leu Gln Asp
515 520 525
<210> 45
<211> 1599
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon-adapted nucleotide sequence
<400> 45
atgaaacaag caacaacagg aaaacttaaa atagttggag aacaaaatga gcatacaaac 60
gaaatactta ccccagaggc tttagaattt gttttagcac ttcatgaaaa atttgatgca 120
agaagaaagg aattattaaa tgcaagacaa aagagacaga agagattaga tgctggtgaa 180
aagctagatt tccttccaga gacaaaacat attagagaag gtgactggtc tatagctcct 240
cttccacaag atcttcagga tagacgtgtg gaaataactg gaccagtaga tagaaagatg 300
gtaataaatg ccttaaattc aggcgcaaag atgtttatgg catgttttga agatgcttca 360
agcccaactt gggaaaatat gataggcggc caaataaata tgagagatgc tataaataag 420
acaattgaat ttactcaggc ttcaaacggt aagacataca agctcaatgc ggaaactgct 480
gtattattag ttaggcctag aggattacat cttttagaaa agcacgtttt agttcatgac 540
gaacctatat caggctcatt ttttgacttt ggattatatt tatttcataa tgccaaaaat 600
gcactagcta aaggaacagg tccttatttt tatttaccaa aacttgaatc acatctcgaa 660
gcaagacttt ggaatgatgt atttgtattt gcccaggatt atataggcat accacaagga 720
actataaagg ctactgtact cattgaaact atccttgctg catttgaaat ggatgaaatc 780
ctatatgaat tgagagaaca ttcagctgga cttaactgtg gaagatggga ttatatattc 840
agctatataa aaagacttag aaatcaggca gatgtaatac ttcctgatag gggacaagtt 900
actatgacag tgccttttat gaaggcttat acatcacttt gtattcaaac ctgtcacaaa 960
aggaatgctc ctgctatggg cggcatggct gcacaaatac ctataaaaaa cgatgatgaa 1020
gcgaatgctg tggcatttgc aaaggttgct gaggataaaa ggagagaggc tacagaagga 1080
catgatggta catgggttgc ccatccagga atggttgcaa ctgcaatgga acaatttgat 1140
gctattatga ctactcctaa tcaaatacat aaaaagagag aagatgtaca agttactgca 1200
gatgacctag ttgcagttcc agaaggtact ataactcttg aaggacttag agtaaattgt 1260
tcggttggag tacagtatat tgcaagttgg cttaggggaa atggggctgc ccctataaat 1320
aatcttatgg aagatgcagc aacagcagaa atttcaagaa ctcaagtatg gcaatgggtg 1380
agacacccaa aaggaatatt agatgatggc agaggaataa ctttagcttt tgttcttgaa 1440
atattggaag aagaattagt taaaattaaa gaggctgttg gtgaacaggc ttataattct 1500
ggaagatttg aagaggctgc tgaattattc aaatccctca tagaacaaga tgaatttgca 1560
gagttcctta cactaccagg atacgaaaaa ttggcataa 1599
<210> 46
<211> 532
<212> PRT
<213> Lysinibacillus sp. A1
<400> 46
Met Lys Gln Ala Thr Thr Gly Lys Leu Lys Ile Val Gly Glu Gln Asn
1 5 10 15
Glu His Thr Asn Glu Ile Leu Thr Pro Glu Ala Leu Glu Phe Val Leu
20 25 30
Ala Leu His Glu Lys Phe Asp Ala Arg Arg Lys Glu Leu Leu Asn Ala
35 40 45
Arg Gln Lys Arg Gln Lys Arg Leu Asp Ala Gly Glu Lys Leu Asp Phe
50 55 60
Leu Pro Glu Thr Lys His Ile Arg Glu Gly Asp Trp Ser Ile Ala Pro
65 70 75 80
Leu Pro Gln Asp Leu Gln Asp Arg Arg Val Glu Ile Thr Gly Pro Val
85 90 95
Asp Arg Lys Met Val Ile Asn Ala Leu Asn Ser Gly Ala Lys Met Phe
100 105 110
Met Ala Cys Phe Glu Asp Ala Ser Ser Pro Thr Trp Glu Asn Met Ile
115 120 125
Gly Gly Gln Ile Asn Met Arg Asp Ala Ile Asn Lys Thr Ile Glu Phe
130 135 140
Thr Gln Ala Ser Asn Gly Lys Thr Tyr Lys Leu Asn Ala Glu Thr Ala
145 150 155 160
Val Leu Leu Val Arg Pro Arg Gly Leu His Leu Leu Glu Lys His Val
165 170 175
Leu Val His Asp Glu Pro Ile Ser Gly Ser Phe Phe Asp Phe Gly Leu
180 185 190
Tyr Leu Phe His Asn Ala Lys Asn Ala Leu Ala Lys Gly Thr Gly Pro
195 200 205
Tyr Phe Tyr Leu Pro Lys Leu Glu Ser His Leu Glu Ala Arg Leu Trp
210 215 220
Asn Asp Val Phe Val Phe Ala Gln Asp Tyr Ile Gly Ile Pro Gln Gly
225 230 235 240
Thr Ile Lys Ala Thr Val Leu Ile Glu Thr Ile Leu Ala Ala Phe Glu
245 250 255
Met Asp Glu Ile Leu Tyr Glu Leu Arg Glu His Ser Ala Gly Leu Asn
260 265 270
Cys Gly Arg Trp Asp Tyr Ile Phe Ser Tyr Ile Lys Arg Leu Arg Asn
275 280 285
Gln Ala Asp Val Ile Leu Pro Asp Arg Gly Gln Val Thr Met Thr Val
290 295 300
Pro Phe Met Lys Ala Tyr Thr Ser Leu Cys Ile Gln Thr Cys His Lys
305 310 315 320
Arg Asn Ala Pro Ala Met Gly Gly Met Ala Ala Gln Ile Pro Ile Lys
325 330 335
Asn Asp Asp Glu Ala Asn Ala Val Ala Phe Ala Lys Val Ala Glu Asp
340 345 350
Lys Arg Arg Glu Ala Thr Glu Gly His Asp Gly Thr Trp Val Ala His
355 360 365
Pro Gly Met Val Ala Thr Ala Met Glu Gln Phe Asp Ala Ile Met Thr
370 375 380
Thr Pro Asn Gln Ile His Lys Lys Arg Glu Asp Val Gln Val Thr Ala
385 390 395 400
Asp Asp Leu Val Ala Val Pro Glu Gly Thr Ile Thr Leu Glu Gly Leu
405 410 415
Arg Val Asn Cys Ser Val Gly Val Gln Tyr Ile Ala Ser Trp Leu Arg
420 425 430
Gly Asn Gly Ala Ala Pro Ile Asn Asn Leu Met Glu Asp Ala Ala Thr
435 440 445
Ala Glu Ile Ser Arg Thr Gln Val Trp Gln Trp Val Arg His Pro Lys
450 455 460
Gly Ile Leu Asp Asp Gly Arg Gly Ile Thr Leu Ala Phe Val Leu Glu
465 470 475 480
Ile Leu Glu Glu Glu Leu Val Lys Ile Lys Glu Ala Val Gly Glu Gln
485 490 495
Ala Tyr Asn Ser Gly Arg Phe Glu Glu Ala Ala Glu Leu Phe Lys Ser
500 505 510
Leu Ile Glu Gln Asp Glu Phe Ala Glu Phe Leu Thr Leu Pro Gly Tyr
515 520 525
Glu Lys Leu Ala
530
<210> 47
<211> 1590
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon-adapted nucleotide sequence
<400> 47
atgtcaacaa gaacatcaag agttacatta cctggagaaa tgttaccagc ttataacgaa 60
atacttaccc cagaagtttt atcattcctt aaagaattac atgaaaattt taatgaaaga 120
cgaacggaat tacttcaaaa aagggttgaa aaacaaaaaa ggattgatgc gggtgaattt 180
ccaaaatttt tagaagaaac aaagcacatc agagaggctg attggacaat cgccaatctt 240
cctaaagacc ttgaagacag aagagtagaa ataacaggtc ctgtagatcg taaaatggtt 300
attaatgcat tgaattcagg agcacactta tttatggctg attttgaaga ttccaattca 360
ccaacttggg aaaatactat agaaggacaa ataaatttaa gagatgcagt aaaagggaca 420
ataagtcata aaaatgataa gggaaaagaa tataggttaa atgacaaaac agcagtttta 480
atagttaggc ctagaggatg gcacttagaa gaaaagcaca tgcaggttga tggaaagaat 540
atgtcgggat ctcttgtaga ttttggatta tatttttttc ataatgcaaa ggctctatta 600
gaaaaaggtt caggaccata cttctattta cctaaaatgg aatcttatct tgaagcaaga 660
ctttggaacg atgtatttgt atttgctcaa aagtatatag gtataccaaa tggaactatc 720
aaggcaactg tattattgga aactatccat gcatcatttg aaatggatga aattctttat 780
gaattaaaag atcattcagc aggattaaat tgtggacgct gggattatat tttttctttc 840
ctaaaaggat ttagaaacca caatgaattt cttttaccag atagggctca agtaactatg 900
actgctcctt ttatgagggc ttattctctc aaggtaatcc aaacttgtca tagaagaaat 960
gcaccagcta taggcggcat ggctgcacaa attcctataa aaaataatcc agaggctaat 1020
gaagcagcat ttgaaaaagt aagagcagat aaagaaagag aagcattaga tggtcatgac 1080
ggtacttggg tagcacatcc tggcttagtt cccgttgcta tggaagtatt taatcatatc 1140
atgaaaactc ctaatcagat atttcgcaaa agagaagaga taagagttac ggaaaaggat 1200
ttacttgaag ttcctgtagg tacaatcact gaagaagggt taagaactaa catatctgtt 1260
ggaatacagt acatagcatc atggttatca ggaagagggg ctgcccctat atataatctc 1320
atggaagatg cagctactgc agaaatttcc agggctcaaa tttggcaatg gataagacat 1380
gaaggcggca aactaaacga tggtagaaat attacattgg aattaatgga agaatggaaa 1440
gaagaagaat tggtaaagat agaacgggaa ataggaaaag aggcattcaa aaaaggcaga 1500
tttcaagagg ctactacatt atttacaaat ttgataagaa atgatgaatt tgtcccattc 1560
cttactttac ctggatacga gatattataa 1590
<210> 48
<211> 529
<212> PRT
<213> Bacillus cereus
<400> 48
Met Ser Thr Arg Thr Ser Arg Val Thr Leu Pro Gly Glu Met Leu Pro
1 5 10 15
Ala Tyr Asn Glu Ile Leu Thr Pro Glu Val Leu Ser Phe Leu Lys Glu
20 25 30
Leu His Glu Asn Phe Asn Glu Arg Arg Thr Glu Leu Leu Gln Lys Arg
35 40 45
Val Glu Lys Gln Lys Arg Ile Asp Ala Gly Glu Phe Pro Lys Phe Leu
50 55 60
Glu Glu Thr Lys His Ile Arg Glu Ala Asp Trp Thr Ile Ala Asn Leu
65 70 75 80
Pro Lys Asp Leu Glu Asp Arg Arg Val Glu Ile Thr Gly Pro Val Asp
85 90 95
Arg Lys Met Val Ile Asn Ala Leu Asn Ser Gly Ala His Leu Phe Met
100 105 110
Ala Asp Phe Glu Asp Ser Asn Ser Pro Thr Trp Glu Asn Thr Ile Glu
115 120 125
Gly Gln Ile Asn Leu Arg Asp Ala Val Lys Gly Thr Ile Ser His Lys
130 135 140
Asn Asp Lys Gly Lys Glu Tyr Arg Leu Asn Asp Lys Thr Ala Val Leu
145 150 155 160
Ile Val Arg Pro Arg Gly Trp His Leu Glu Glu Lys His Met Gln Val
165 170 175
Asp Gly Lys Asn Met Ser Gly Ser Leu Val Asp Phe Gly Leu Tyr Phe
180 185 190
Phe His Asn Ala Lys Ala Leu Leu Glu Lys Gly Ser Gly Pro Tyr Phe
195 200 205
Tyr Leu Pro Lys Met Glu Ser Tyr Leu Glu Ala Arg Leu Trp Asn Asp
210 215 220
Val Phe Val Phe Ala Gln Lys Tyr Ile Gly Ile Pro Asn Gly Thr Ile
225 230 235 240
Lys Ala Thr Val Leu Leu Glu Thr Ile His Ala Ser Phe Glu Met Asp
245 250 255
Glu Ile Leu Tyr Glu Leu Lys Asp His Ser Ala Gly Leu Asn Cys Gly
260 265 270
Arg Trp Asp Tyr Ile Phe Ser Phe Leu Lys Gly Phe Arg Asn His Asn
275 280 285
Glu Phe Leu Leu Pro Asp Arg Ala Gln Val Thr Met Thr Ala Pro Phe
290 295 300
Met Arg Ala Tyr Ser Leu Lys Val Ile Gln Thr Cys His Arg Arg Asn
305 310 315 320
Ala Pro Ala Ile Gly Gly Met Ala Ala Gln Ile Pro Ile Lys Asn Asn
325 330 335
Pro Glu Ala Asn Glu Ala Ala Phe Glu Lys Val Arg Ala Asp Lys Glu
340 345 350
Arg Glu Ala Leu Asp Gly His Asp Gly Thr Trp Val Ala His Pro Gly
355 360 365
Leu Val Pro Val Ala Met Glu Val Phe Asn His Ile Met Lys Thr Pro
370 375 380
Asn Gln Ile Phe Arg Lys Arg Glu Glu Ile Arg Val Thr Glu Lys Asp
385 390 395 400
Leu Leu Glu Val Pro Val Gly Thr Ile Thr Glu Glu Gly Leu Arg Thr
405 410 415
Asn Ile Ser Val Gly Ile Gln Tyr Ile Ala Ser Trp Leu Ser Gly Arg
420 425 430
Gly Ala Ala Pro Ile Tyr Asn Leu Met Glu Asp Ala Ala Thr Ala Glu
435 440 445
Ile Ser Arg Ala Gln Ile Trp Gln Trp Ile Arg His Glu Gly Gly Lys
450 455 460
Leu Asn Asp Gly Arg Asn Ile Thr Leu Glu Leu Met Glu Glu Trp Lys
465 470 475 480
Glu Glu Glu Leu Val Lys Ile Glu Arg Glu Ile Gly Lys Glu Ala Phe
485 490 495
Lys Lys Gly Arg Phe Gln Glu Ala Thr Thr Leu Phe Thr Asn Leu Ile
500 505 510
Arg Asn Asp Glu Phe Val Pro Phe Leu Thr Leu Pro Gly Tyr Glu Ile
515 520 525
Leu
<210> 49
<211> 1425
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon-adapted nucleotide sequence
<400> 49
atgcagcaca aattattaat taacggagaa cttgtaagtg gagaaggaga aaaacaacca 60
gtatataacc cagcaactgg agatgtatta ttagaaatag cagaggcatc agcagaacag 120
gtagatgctg cagttagggc agcagacgca gcatttgcag agtggggaca aactactcct 180
aaagtgcgtg cagaatgtct tctaaaactt gcagacgtta tagaggaaaa tggacaagta 240
tttgctgaat tggagtcgag aaactgcggt aaacctttac attcagcatt taatgatgaa 300
ataccagcaa tagtagatgt attcagattt tttgctggtg cagctaggtg tcttaacgga 360
ctagcagctg gagagtatct tgaaggacat acatcaatga taagaagaga tccattaggt 420
gtagttgcca gtatagctcc ttggaactat cctttgatga tggcagcatg gaaacttgcc 480
cccgcccttg cagcaggaaa ttgtgttgta ttgaaaccaa gtgaaataac ccctcttaca 540
gcattaaaat tagctgaatt agcaaaggac atcttcccag ctggtgttat aaatatacta 600
tttggaagag gcaaaacagt tggtgatcct ttgacaggac atcctaaggt aaggatggtt 660
agccttacag gctcaatagc aacaggcgaa catattatat cacacacggc atcttctata 720
aaacgcacgc acatggaatt gggcggcaaa gccccggtta ttgtatttga tgatgcagat 780
atagaggcag tagtagaagg agttagaact tttggatatt ataatgctgg ccaagattgt 840
actgctgctt gtaggattta tgctcaaaaa ggtatttatg atacacttgt tgaaaagcta 900
ggtgctgcag ttgcaaccct taagtctggt gcaccagatg atgaatctac agaattggga 960
cctttatctt ctttagcaca ccttgaaaga gttagcaaag cagttgaaga ggctaaggct 1020
actggacata taaaggtaat aacaggcggc gaaaagagaa agggaaatgg atattattat 1080
gctcctacgc ttttagctgg tgcccttcag gatgatgcta tagtacagaa agaagtattt 1140
ggaccagtag taagtgtaac tccttttgat aatgaagaac aggtagttaa ctgggccaat 1200
gatagccagt acggattagc gtcttctgta tggacaaagg atgtaggcag agcacatagg 1260
gtatcagcaa gacttcaata tggatgtact tgggtaaata ctcactttat gttagtaagt 1320
gagatgccac atggcggcca aaagttgtca ggatatggaa aagatatgag cttatacggt 1380
ttggaagact atacagtagt aagacacgta atggtaaaac attag 1425
<210> 50
<211> 474
<212> PRT
<213> Escherichia coli
<400> 50
Met Gln His Lys Leu Leu Ile Asn Gly Glu Leu Val Ser Gly Glu Gly
1 5 10 15
Glu Lys Gln Pro Val Tyr Asn Pro Ala Thr Gly Asp Val Leu Leu Glu
20 25 30
Ile Ala Glu Ala Ser Ala Glu Gln Val Asp Ala Ala Val Arg Ala Ala
35 40 45
Asp Ala Ala Phe Ala Glu Trp Gly Gln Thr Thr Pro Lys Val Arg Ala
50 55 60
Glu Cys Leu Leu Lys Leu Ala Asp Val Ile Glu Glu Asn Gly Gln Val
65 70 75 80
Phe Ala Glu Leu Glu Ser Arg Asn Cys Gly Lys Pro Leu His Ser Ala
85 90 95
Phe Asn Asp Glu Ile Pro Ala Ile Val Asp Val Phe Arg Phe Phe Ala
100 105 110
Gly Ala Ala Arg Cys Leu Asn Gly Leu Ala Ala Gly Glu Tyr Leu Glu
115 120 125
Gly His Thr Ser Met Ile Arg Arg Asp Pro Leu Gly Val Val Ala Ser
130 135 140
Ile Ala Pro Trp Asn Tyr Pro Leu Met Met Ala Ala Trp Lys Leu Ala
145 150 155 160
Pro Ala Leu Ala Ala Gly Asn Cys Val Val Leu Lys Pro Ser Glu Ile
165 170 175
Thr Pro Leu Thr Ala Leu Lys Leu Ala Glu Leu Ala Lys Asp Ile Phe
180 185 190
Pro Ala Gly Val Ile Asn Ile Leu Phe Gly Arg Gly Lys Thr Val Gly
195 200 205
Asp Pro Leu Thr Gly His Pro Lys Val Arg Met Val Ser Leu Thr Gly
210 215 220
Ser Ile Ala Thr Gly Glu His Ile Ile Ser His Thr Ala Ser Ser Ile
225 230 235 240
Lys Arg Thr His Met Glu Leu Gly Gly Lys Ala Pro Val Ile Val Phe
245 250 255
Asp Asp Ala Asp Ile Glu Ala Val Val Glu Gly Val Arg Thr Phe Gly
260 265 270
Tyr Tyr Asn Ala Gly Gln Asp Cys Thr Ala Ala Cys Arg Ile Tyr Ala
275 280 285
Gln Lys Gly Ile Tyr Asp Thr Leu Val Glu Lys Leu Gly Ala Ala Val
290 295 300
Ala Thr Leu Lys Ser Gly Ala Pro Asp Asp Glu Ser Thr Glu Leu Gly
305 310 315 320
Pro Leu Ser Ser Leu Ala His Leu Glu Arg Val Ser Lys Ala Val Glu
325 330 335
Glu Ala Lys Ala Thr Gly His Ile Lys Val Ile Thr Gly Gly Glu Lys
340 345 350
Arg Lys Gly Asn Gly Tyr Tyr Tyr Ala Pro Thr Leu Leu Ala Gly Ala
355 360 365
Leu Gln Asp Asp Ala Ile Val Gln Lys Glu Val Phe Gly Pro Val Val
370 375 380
Ser Val Thr Pro Phe Asp Asn Glu Glu Gln Val Val Asn Trp Ala Asn
385 390 395 400
Asp Ser Gln Tyr Gly Leu Ala Ser Ser Val Trp Thr Lys Asp Val Gly
405 410 415
Arg Ala His Arg Val Ser Ala Arg Leu Gln Tyr Gly Cys Thr Trp Val
420 425 430
Asn Thr His Phe Met Leu Val Ser Glu Met Pro His Gly Gly Gln Lys
435 440 445
Leu Ser Gly Tyr Gly Lys Asp Met Ser Leu Tyr Gly Leu Glu Asp Tyr
450 455 460
Thr Val Val Arg His Val Met Val Lys His
465 470
<210> 51
<211> 1440
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon-adapted nucleotide sequence
<400> 51
atgtcagttc cggttcagca cccaatgtat attgatggac aatttgtaac ttggcgagga 60
gatgcatgga tagatgttgt gaatccagcg actgaggcag ttatctctag gattcctgat 120
ggtcaggcag aggatgccag aaaagcaata gatgctgcag aaagggctca accagaatgg 180
gaagcgttac ctgctattga aagggcttcc tggttacgaa aaatttcagc aggaataaga 240
gaaagagcat cagaaatatc agcactaata gttgaagaag gcggcaaaat tcaacaactt 300
gcagaggttg aagtagcatt tacagcggat tatattgatt acatggctga atgggcaaga 360
agatacgaag gagagattat tcaatctgat agaccaggag aaaatatctt attattcaaa 420
agagcattag gtgttacaac aggcattctt ccttggaatt ttccattctt cctaattgca 480
agaaagatgg ccccagcact acttacagga aatactattg taataaaacc ttcagaattt 540
actcctaata atgctatagc ttttgctaaa attgtagatg aaataggact tccaagaggt 600
gtatttaatc tagtactagg acgtggtgaa actgtaggac aagaattagc tggaaatccg 660
aaggtagcaa tggtttctat gactggatca gtttccgctg gtgaaaaaat aatggcgact 720
gcagctaaaa acattacaaa agtatgcttg gagcttggcg gcaaagcacc agcaattgta 780
atggatgatg cagatttaga acttgcagta aaggctattg tagattcaag agtaataaac 840
agtggtcagg tatgcaattg tgctgaacgt atttatgtac aaaaaggtat atatgatcaa 900
tttgtaaatc gattgggtga agcaatgcaa gcagtacaat ttggaaaccc agctgaacgg 960
aacgatatag cgatgggacc tttaataaat gcagcagcac ttgaaagagt tgaacaaaaa 1020
gtagctaggg ctgtggaaga aggagcaaga gttgcattgg gcggcaaggc agttgaaggt 1080
aaaggatatt attatcctcc tacactttta ctagatgttc ttcaagaaat gagtataatg 1140
catgaagaaa cttttggacc tgtattacca gttgtagctt ttgatacttt agaagaggct 1200
atatcaatgg caaatgattc tgactatggc ttaactagca gcatatacac tcaaaatcta 1260
aacgtagcta tgaaggctat taaagggtta aaatttggtg agacttatat aaatagagaa 1320
aactttgagg ctatgcaagg ttttcatgct ggatggagaa aaagtggtat tggcggcgct 1380
gacggaaagc atggacttca tgaatattta cagactcagg ttgtttatct tcaatcttaa 1440
<210> 52
<211> 479
<212> PRT
<213> Escherichia coli
<400> 52
Met Ser Val Pro Val Gln His Pro Met Tyr Ile Asp Gly Gln Phe Val
1 5 10 15
Thr Trp Arg Gly Asp Ala Trp Ile Asp Val Val Asn Pro Ala Thr Glu
20 25 30
Ala Val Ile Ser Arg Ile Pro Asp Gly Gln Ala Glu Asp Ala Arg Lys
35 40 45
Ala Ile Asp Ala Ala Glu Arg Ala Gln Pro Glu Trp Glu Ala Leu Pro
50 55 60
Ala Ile Glu Arg Ala Ser Trp Leu Arg Lys Ile Ser Ala Gly Ile Arg
65 70 75 80
Glu Arg Ala Ser Glu Ile Ser Ala Leu Ile Val Glu Glu Gly Gly Lys
85 90 95
Ile Gln Gln Leu Ala Glu Val Glu Val Ala Phe Thr Ala Asp Tyr Ile
100 105 110
Asp Tyr Met Ala Glu Trp Ala Arg Arg Tyr Glu Gly Glu Ile Ile Gln
115 120 125
Ser Asp Arg Pro Gly Glu Asn Ile Leu Leu Phe Lys Arg Ala Leu Gly
130 135 140
Val Thr Thr Gly Ile Leu Pro Trp Asn Phe Pro Phe Phe Leu Ile Ala
145 150 155 160
Arg Lys Met Ala Pro Ala Leu Leu Thr Gly Asn Thr Ile Val Ile Lys
165 170 175
Pro Ser Glu Phe Thr Pro Asn Asn Ala Ile Ala Phe Ala Lys Ile Val
180 185 190
Asp Glu Ile Gly Leu Pro Arg Gly Val Phe Asn Leu Val Leu Gly Arg
195 200 205
Gly Glu Thr Val Gly Gln Glu Leu Ala Gly Asn Pro Lys Val Ala Met
210 215 220
Val Ser Met Thr Gly Ser Val Ser Ala Gly Glu Lys Ile Met Ala Thr
225 230 235 240
Ala Ala Lys Asn Ile Thr Lys Val Cys Leu Glu Leu Gly Gly Lys Ala
245 250 255
Pro Ala Ile Val Met Asp Asp Ala Asp Leu Glu Leu Ala Val Lys Ala
260 265 270
Ile Val Asp Ser Arg Val Ile Asn Ser Gly Gln Val Cys Asn Cys Ala
275 280 285
Glu Arg Ile Tyr Val Gln Lys Gly Ile Tyr Asp Gln Phe Val Asn Arg
290 295 300
Leu Gly Glu Ala Met Gln Ala Val Gln Phe Gly Asn Pro Ala Glu Arg
305 310 315 320
Asn Asp Ile Ala Met Gly Pro Leu Ile Asn Ala Ala Ala Leu Glu Arg
325 330 335
Val Glu Gln Lys Val Ala Arg Ala Val Glu Glu Gly Ala Arg Val Ala
340 345 350
Leu Gly Gly Lys Ala Val Glu Gly Lys Gly Tyr Tyr Tyr Pro Pro Thr
355 360 365
Leu Leu Leu Asp Val Leu Gln Glu Met Ser Ile Met His Glu Glu Thr
370 375 380
Phe Gly Pro Val Leu Pro Val Val Ala Phe Asp Thr Leu Glu Glu Ala
385 390 395 400
Ile Ser Met Ala Asn Asp Ser Asp Tyr Gly Leu Thr Ser Ser Ile Tyr
405 410 415
Thr Gln Asn Leu Asn Val Ala Met Lys Ala Ile Lys Gly Leu Lys Phe
420 425 430
Gly Glu Thr Tyr Ile Asn Arg Glu Asn Phe Glu Ala Met Gln Gly Phe
435 440 445
His Ala Gly Trp Arg Lys Ser Gly Ile Gly Gly Ala Asp Gly Lys His
450 455 460
Gly Leu His Glu Tyr Leu Gln Thr Gln Val Val Tyr Leu Gln Ser
465 470 475
<210> 53
<211> 1449
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon-adapted nucleotide sequence
<400> 53
atgaaattaa atgattcaaa actttttaga caacaagcct taataaatgg agaatggtta 60
gatgcaaata acggagaagt aatagatgtt actaatccag caaatggtga taaacttggt 120
tctgttccaa agatgggagc agatgaaacc agggctgcta tagatgcagc aaatagagca 180
cttccagcat ggagagcact tacagcaaaa gaacgggcaa atatacttag aaattggttt 240
aatcttttaa tggaacatca ggatgatcta gcaaggctta tgacgcttga acagggaaaa 300
cctcttgctg aggctaaagg agagatcagt tatgcagcgt catttataga atggtttgct 360
gaagaaggaa aaaggattta tggagatact ataccaggac atcaggcaga caaaagactt 420
atagttatta aacaacctat aggtgtaact gctgctataa ctccttggaa cttcccagca 480
gctatgataa ctagaaaagc aggaccagct cttgctgctg gttgcactat ggttttaaaa 540
cctgcttccc agactccttt tagtgccctt gcacttgctg aattagctat tcgtgctggt 600
attccagcgg gtgtattcaa tgtagttact ggatctgctg gtgcggttgg aaatgagctt 660
acatcaaatc cgcttgtaag aaaactttca tttacaggaa gtacagaaat aggtaggcaa 720
ttaatggaac aatgtgctaa agatattaag aaagtttcac tggagttagg cggcaatgcc 780
ccttttattg tatttgatga tgcagactta gataaagcag ttgaaggtgc tttaagttct 840
aaatttagga atgctggaca aacttgtgta tgtgcgaata gattatacgt ccaagacgga 900
gtttacgata gatttgcaga aaaacttcaa caggctgtat ctaaattaca cattggagat 960
gggttagaga aaggcgttac aattggccca ttgatagatg aaaaagcagt agctaaagtt 1020
gaggaacaca ttgctgatgc acttgaaaaa ggtgctagag ttgtttgcgg cggcaaggct 1080
gatgaaagag gcggcaactt tttccagcct actatacttg tagacgttcc agctaatgca 1140
aaggtatcaa aagaggaaac ctttggtcca cttgctcctt tatttagatt taaggatgag 1200
gcagatgtta tagcacaggc aaatgatacc gaatttggac ttgcagctta tttctatgct 1260
agggatttat ccagggtttt tagagttggt gaggctttag agtacggcat tgttggaata 1320
aatactggaa taatatcaaa tgaagttgca ccatttggcg gcataaaggc tagtggatta 1380
gggagagaag gctcaaaata tggaatagaa gactatttgg aaataaaata tatgtgcatt 1440
ggcttataa 1449
<210> 54
<211> 482
<212> PRT
<213> Escherichia coli
<400> 54
Met Lys Leu Asn Asp Ser Lys Leu Phe Arg Gln Gln Ala Leu Ile Asn
1 5 10 15
Gly Glu Trp Leu Asp Ala Asn Asn Gly Glu Val Ile Asp Val Thr Asn
20 25 30
Pro Ala Asn Gly Asp Lys Leu Gly Ser Val Pro Lys Met Gly Ala Asp
35 40 45
Glu Thr Arg Ala Ala Ile Asp Ala Ala Asn Arg Ala Leu Pro Ala Trp
50 55 60
Arg Ala Leu Thr Ala Lys Glu Arg Ala Asn Ile Leu Arg Asn Trp Phe
65 70 75 80
Asn Leu Leu Met Glu His Gln Asp Asp Leu Ala Arg Leu Met Thr Leu
85 90 95
Glu Gln Gly Lys Pro Leu Ala Glu Ala Lys Gly Glu Ile Ser Tyr Ala
100 105 110
Ala Ser Phe Ile Glu Trp Phe Ala Glu Glu Gly Lys Arg Ile Tyr Gly
115 120 125
Asp Thr Ile Pro Gly His Gln Ala Asp Lys Arg Leu Ile Val Ile Lys
130 135 140
Gln Pro Ile Gly Val Thr Ala Ala Ile Thr Pro Trp Asn Phe Pro Ala
145 150 155 160
Ala Met Ile Thr Arg Lys Ala Gly Pro Ala Leu Ala Ala Gly Cys Thr
165 170 175
Met Val Leu Lys Pro Ala Ser Gln Thr Pro Phe Ser Ala Leu Ala Leu
180 185 190
Ala Glu Leu Ala Ile Arg Ala Gly Ile Pro Ala Gly Val Phe Asn Val
195 200 205
Val Thr Gly Ser Ala Gly Ala Val Gly Asn Glu Leu Thr Ser Asn Pro
210 215 220
Leu Val Arg Lys Leu Ser Phe Thr Gly Ser Thr Glu Ile Gly Arg Gln
225 230 235 240
Leu Met Glu Gln Cys Ala Lys Asp Ile Lys Lys Val Ser Leu Glu Leu
245 250 255
Gly Gly Asn Ala Pro Phe Ile Val Phe Asp Asp Ala Asp Leu Asp Lys
260 265 270
Ala Val Glu Gly Ala Leu Ser Ser Lys Phe Arg Asn Ala Gly Gln Thr
275 280 285
Cys Val Cys Ala Asn Arg Leu Tyr Val Gln Asp Gly Val Tyr Asp Arg
290 295 300
Phe Ala Glu Lys Leu Gln Gln Ala Val Ser Lys Leu His Ile Gly Asp
305 310 315 320
Gly Leu Glu Lys Gly Val Thr Ile Gly Pro Leu Ile Asp Glu Lys Ala
325 330 335
Val Ala Lys Val Glu Glu His Ile Ala Asp Ala Leu Glu Lys Gly Ala
340 345 350
Arg Val Val Cys Gly Gly Lys Ala Asp Glu Arg Gly Gly Asn Phe Phe
355 360 365
Gln Pro Thr Ile Leu Val Asp Val Pro Ala Asn Ala Lys Val Ser Lys
370 375 380
Glu Glu Thr Phe Gly Pro Leu Ala Pro Leu Phe Arg Phe Lys Asp Glu
385 390 395 400
Ala Asp Val Ile Ala Gln Ala Asn Asp Thr Glu Phe Gly Leu Ala Ala
405 410 415
Tyr Phe Tyr Ala Arg Asp Leu Ser Arg Val Phe Arg Val Gly Glu Ala
420 425 430
Leu Glu Tyr Gly Ile Val Gly Ile Asn Thr Gly Ile Ile Ser Asn Glu
435 440 445
Val Ala Pro Phe Gly Gly Ile Lys Ala Ser Gly Leu Gly Arg Glu Gly
450 455 460
Ser Lys Tyr Gly Ile Glu Asp Tyr Leu Glu Ile Lys Tyr Met Cys Ile
465 470 475 480
Gly Leu
<210> 55
<211> 1443
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon-adapted nucleotide sequence
<400> 55
atgactgaaa aaaataattt attcataaat ggatcttggg ttgctcctaa aggcggcgaa 60
tggattaaag ttgaaaaccc agctacaaag gcagtagtgg cagaagtagc aaagggcggc 120
caggctgacg tagatgctgc tgtatcagca gctaagtcag catttattgg atggtcaaga 180
aggatggcaa ctgagagagc agattatata catgcattaa aagatcttgt gaaaagggat 240
aaagaaaaat tagcagctat tataactagt gaaatgggga aaccattgaa agaggctaga 300
atagaagtag attttgcaat tggattactt agatttgcag cagaaaatgt tttaagactt 360
cagggagaaa taataccagg atcttctcca gaagaaaaga tattaattga tagggtacct 420
ttgggagtaa taggtgctat aacagcatgg aattttcctc ttgcactttg tgcaagaaag 480
attggacctg ctgtggcagc gggaaatact atagttgtaa aaccacatga attaacgcca 540
ttagcttgtc tacatcttgc taaattagtt gaagaggcaa agatcccaca tggagttata 600
aatgttgtaa caggtgatgg caaagatgta ggagtacctc tagtagcaca taaagatatt 660
aaattaataa ctatgacagg ttccacgcct gctggaaaaa aaattatggc agcagctagt 720
gagacactta aagaagttag gttagaactt ggcggcaaag caccatttat ggttatggaa 780
gatgctgata ttgacagggc agcagatgct gccgttacag caagatttaa taatgcggga 840
caggtatgta cttgtaatga aagaacctac attcatgaag cagtttacga caaatttgtt 900
caaaaagtta gagaaaaaat agaagcatta aaagtaggac tgccaacaga tccatctaca 960
gatatgggac ctaaagtatc tgaggacgaa cttaataaag ttcatgagat ggttgaacat 1020
gctgtaagac aaggagcaag attagctata ggcggcaaaa ggttaactgg cggcgtttat 1080
gataagggat acttctatgc accaacactg ttgacagatg taactcaaga tatggacata 1140
gttcacaatg aggtatttgg tcctgtaatg tcattgatta gagttaaaga ttttgatcag 1200
gctatagcat gggcaaatga ttgtagatac gggctaagtg cttatctttt cactaatgat 1260
ctttcaagga tacttaggat gacaagagat cttgaatttg gagaagtata cgtgaaccgt 1320
ccgggcggcg aagcgccaca aggatttcat catggataca aagaatctgg acttggcggc 1380
gaggacggac agcacggaat ggaagcatac gtacagacaa aaacaatata tctaaatgca 1440
taa 1443
<210> 56
<211> 480
<212> PRT
<213> Gluconobacter oxydans
<400> 56
Met Thr Glu Lys Asn Asn Leu Phe Ile Asn Gly Ser Trp Val Ala Pro
1 5 10 15
Lys Gly Gly Glu Trp Ile Lys Val Glu Asn Pro Ala Thr Lys Ala Val
20 25 30
Val Ala Glu Val Ala Lys Gly Gly Gln Ala Asp Val Asp Ala Ala Val
35 40 45
Ser Ala Ala Lys Ser Ala Phe Ile Gly Trp Ser Arg Arg Met Ala Thr
50 55 60
Glu Arg Ala Asp Tyr Ile His Ala Leu Lys Asp Leu Val Lys Arg Asp
65 70 75 80
Lys Glu Lys Leu Ala Ala Ile Ile Thr Ser Glu Met Gly Lys Pro Leu
85 90 95
Lys Glu Ala Arg Ile Glu Val Asp Phe Ala Ile Gly Leu Leu Arg Phe
100 105 110
Ala Ala Glu Asn Val Leu Arg Leu Gln Gly Glu Ile Ile Pro Gly Ser
115 120 125
Ser Pro Glu Glu Lys Ile Leu Ile Asp Arg Val Pro Leu Gly Val Ile
130 135 140
Gly Ala Ile Thr Ala Trp Asn Phe Pro Leu Ala Leu Cys Ala Arg Lys
145 150 155 160
Ile Gly Pro Ala Val Ala Ala Gly Asn Thr Ile Val Val Lys Pro His
165 170 175
Glu Leu Thr Pro Leu Ala Cys Leu His Leu Ala Lys Leu Val Glu Glu
180 185 190
Ala Lys Ile Pro His Gly Val Ile Asn Val Val Thr Gly Asp Gly Lys
195 200 205
Asp Val Gly Val Pro Leu Val Ala His Lys Asp Ile Lys Leu Ile Thr
210 215 220
Met Thr Gly Ser Thr Pro Ala Gly Lys Lys Ile Met Ala Ala Ala Ser
225 230 235 240
Glu Thr Leu Lys Glu Val Arg Leu Glu Leu Gly Gly Lys Ala Pro Phe
245 250 255
Met Val Met Glu Asp Ala Asp Ile Asp Arg Ala Ala Asp Ala Ala Val
260 265 270
Thr Ala Arg Phe Asn Asn Ala Gly Gln Val Cys Thr Cys Asn Glu Arg
275 280 285
Thr Tyr Ile His Glu Ala Val Tyr Asp Lys Phe Val Gln Lys Val Arg
290 295 300
Glu Lys Ile Glu Ala Leu Lys Val Gly Leu Pro Thr Asp Pro Ser Thr
305 310 315 320
Asp Met Gly Pro Lys Val Ser Glu Asp Glu Leu Asn Lys Val His Glu
325 330 335
Met Val Glu His Ala Val Arg Gln Gly Ala Arg Leu Ala Ile Gly Gly
340 345 350
Lys Arg Leu Thr Gly Gly Val Tyr Asp Lys Gly Tyr Phe Tyr Ala Pro
355 360 365
Thr Leu Leu Thr Asp Val Thr Gln Asp Met Asp Ile Val His Asn Glu
370 375 380
Val Phe Gly Pro Val Met Ser Leu Ile Arg Val Lys Asp Phe Asp Gln
385 390 395 400
Ala Ile Ala Trp Ala Asn Asp Cys Arg Tyr Gly Leu Ser Ala Tyr Leu
405 410 415
Phe Thr Asn Asp Leu Ser Arg Ile Leu Arg Met Thr Arg Asp Leu Glu
420 425 430
Phe Gly Glu Val Tyr Val Asn Arg Pro Gly Gly Glu Ala Pro Gln Gly
435 440 445
Phe His His Gly Tyr Lys Glu Ser Gly Leu Gly Gly Glu Asp Gly Gln
450 455 460
His Gly Met Glu Ala Tyr Val Gln Thr Lys Thr Ile Tyr Leu Asn Ala
465 470 475 480
<210> 57
<211> 1434
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon-adapted nucleotide sequence
<400> 57
atgtcttcag tgcctgtatt ccagaacttt ataaatggac aatttacgca tagtgaagcc 60
catcttgatg tttataatcc cgccacagga gcacttttat caagggtacc agcaagtact 120
tgtgcagatg tagatcaggc tcttgctggt gcaagagcag ctcaaaaagc atggtcagca 180
aaaccagcaa tagaaagggc aggatacctt agacgtattg cttcaaaact tagagaaaat 240
gttgctcatc ttgcaagaac tataactcta gaacaaggaa aaatatcagc attagcagaa 300
gttgaagtaa acttcacagc tgactacctt gattatatgg cagaatgggc tagaagaata 360
gaaggcgaaa taataacttc agatcgccca ggggaaaaca tattcctttt tcgtaaacct 420
ttaggagtag tggcaggaat acttccttgg aatttccctt tcttcttaat cgcaagaaaa 480
atggcaccag cattgcttac aggcaataca attgttataa aaccaagtga agagacacca 540
aataattgtt ttgaatttgc tagacttgta gctgagactg atttacctcc aggagttttt 600
aatgttgtat gtggagatgg aagagtagga gcagcattaa gtgggcataa aggagtagat 660
atgataagct ttacaggctc agttgacaca ggatcacgaa taatgactgc agcagcgact 720
aatattacaa aattaaattt ggaacttggc ggcaaggcac cagctatagt tttggcagat 780
gcagatcttg cattggcagt aaaagcaata agagattcaa gaataataaa tactggacaa 840
gtatgtaatt gtgctgaaag agtatatgtt gagagaaaag tagctgatca atttatagaa 900
agaataagtg ctgcaatgtc agctacaaga tacggagatc cattagctga accggatgta 960
gagatgggac cattaataaa caggcaagga cttgattctg tagaaagaaa agtacgtatt 1020
gctcttcaac agggtgcttc tcttattagt ggcggccgag tagcagatag acctgatgga 1080
ttccattttg agccaactgt attagcagga tgtaatgctt caatggatat tatgagagaa 1140
gaaatatttg ggccagtttt accaatccaa atagtagatg atttagatga agcaatcgct 1200
ttagctaacg actgcgatta tggattaact tcatctgtat atacaaggga ccttggacgt 1260
gctatgcatg ctataagagg attagatttt ggtgaaactt atgttaatag ggaaaatttt 1320
gaggctatgc agggattcca tgctggtgta agaaagtcag gagtaggcgg cgcagatggc 1380
aagcatggat tatatgaata tactcatact catgcagtat atctccagtc ttaa 1434
<210> 58
<211> 477
<212> PRT
<213> Pseudomonas fluorescens
<400> 58
Met Ser Ser Val Pro Val Phe Gln Asn Phe Ile Asn Gly Gln Phe Thr
1 5 10 15
His Ser Glu Ala His Leu Asp Val Tyr Asn Pro Ala Thr Gly Ala Leu
20 25 30
Leu Ser Arg Val Pro Ala Ser Thr Cys Ala Asp Val Asp Gln Ala Leu
35 40 45
Ala Gly Ala Arg Ala Ala Gln Lys Ala Trp Ser Ala Lys Pro Ala Ile
50 55 60
Glu Arg Ala Gly Tyr Leu Arg Arg Ile Ala Ser Lys Leu Arg Glu Asn
65 70 75 80
Val Ala His Leu Ala Arg Thr Ile Thr Leu Glu Gln Gly Lys Ile Ser
85 90 95
Ala Leu Ala Glu Val Glu Val Asn Phe Thr Ala Asp Tyr Leu Asp Tyr
100 105 110
Met Ala Glu Trp Ala Arg Arg Ile Glu Gly Glu Ile Ile Thr Ser Asp
115 120 125
Arg Pro Gly Glu Asn Ile Phe Leu Phe Arg Lys Pro Leu Gly Val Val
130 135 140
Ala Gly Ile Leu Pro Trp Asn Phe Pro Phe Phe Leu Ile Ala Arg Lys
145 150 155 160
Met Ala Pro Ala Leu Leu Thr Gly Asn Thr Ile Val Ile Lys Pro Ser
165 170 175
Glu Glu Thr Pro Asn Asn Cys Phe Glu Phe Ala Arg Leu Val Ala Glu
180 185 190
Thr Asp Leu Pro Pro Gly Val Phe Asn Val Val Cys Gly Asp Gly Arg
195 200 205
Val Gly Ala Ala Leu Ser Gly His Lys Gly Val Asp Met Ile Ser Phe
210 215 220
Thr Gly Ser Val Asp Thr Gly Ser Arg Ile Met Thr Ala Ala Ala Thr
225 230 235 240
Asn Ile Thr Lys Leu Asn Leu Glu Leu Gly Gly Lys Ala Pro Ala Ile
245 250 255
Val Leu Ala Asp Ala Asp Leu Ala Leu Ala Val Lys Ala Ile Arg Asp
260 265 270
Ser Arg Ile Ile Asn Thr Gly Gln Val Cys Asn Cys Ala Glu Arg Val
275 280 285
Tyr Val Glu Arg Lys Val Ala Asp Gln Phe Ile Glu Arg Ile Ser Ala
290 295 300
Ala Met Ser Ala Thr Arg Tyr Gly Asp Pro Leu Ala Glu Pro Asp Val
305 310 315 320
Glu Met Gly Pro Leu Ile Asn Arg Gln Gly Leu Asp Ser Val Glu Arg
325 330 335
Lys Val Arg Ile Ala Leu Gln Gln Gly Ala Ser Leu Ile Ser Gly Gly
340 345 350
Arg Val Ala Asp Arg Pro Asp Gly Phe His Phe Glu Pro Thr Val Leu
355 360 365
Ala Gly Cys Asn Ala Ser Met Asp Ile Met Arg Glu Glu Ile Phe Gly
370 375 380
Pro Val Leu Pro Ile Gln Ile Val Asp Asp Leu Asp Glu Ala Ile Ala
385 390 395 400
Leu Ala Asn Asp Cys Asp Tyr Gly Leu Thr Ser Ser Val Tyr Thr Arg
405 410 415
Asp Leu Gly Arg Ala Met His Ala Ile Arg Gly Leu Asp Phe Gly Glu
420 425 430
Thr Tyr Val Asn Arg Glu Asn Phe Glu Ala Met Gln Gly Phe His Ala
435 440 445
Gly Val Arg Lys Ser Gly Val Gly Gly Ala Asp Gly Lys His Gly Leu
450 455 460
Tyr Glu Tyr Thr His Thr His Ala Val Tyr Leu Gln Ser
465 470 475
<210> 59
<211> 1434
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon-adapted nucleotide sequence
<400> 59
atgtctcatg ctatatatca gaactatata gctaatgcat ttgtagcatc agatgaacac 60
ttagaggtac acaatccagc gaatggacaa ttgcttgctc atgtacctca gggttcttct 120
gctgaagttg aaagggctat agctgctgca agacaagccc aaaaagcatg ggctgctaga 180
ccagcaatag aaagggctgg atatttaaga aaaatagcat caaaaataag agaacacgga 240
gaaagattag cccgtataat aacagcagaa cagggaaaag ttttagaact ggcaagagtt 300
gaagtaaatt ttacagctga ttatttagac tacatggctg agtgggcaag aagattggaa 360
ggagaggtct tgagttcaga tagaccagga gaatctatat ttttgttaag aaaacctctt 420
ggagttgtcg ctggaatact tccttggaat tttcctttct tccttatagc tagaaaaatg 480
gctccagcac tgcttacagg aaatactata gttataaagc cttctgaaga gactcctata 540
aattgttttg aatttgcaag actggtagca gagacagatc ttccagcggg agtatttaat 600
gttgtatgtg gaactggagc gactgtagga aatgctttaa ctagtcatcc tggaatagat 660
ttgataagct ttacaggctc agttggaaca ggaagtagaa taatggcagc agcagcacca 720
aatataacaa aattgaatct tgaacttggc ggcaaggcac cagccattgt actagctgat 780
gctgatcttg atcttgcagt tagagcaata actgcatcaa gggtaatcaa tacaggtcag 840
gtatgtaact gtgctgaaag agtatacgtg gagagaaagg ttgcagatgc atttattgaa 900
aggattgctg cagcaatggc aggaactaga tatggtgatc cattagcaga aaatgggttg 960
gatatgggtc cacttataaa tagggctgcg ttggacaaag ttgcacaaat ggtaagaact 1020
gcaagtggtc agggtgccca ggttataaca ggcggcgcag ttgccgactt aggacaagga 1080
ttccactacc aacctacagt attagctggc tgctctgcag atatggaaat tatgagaaag 1140
gaaatatttg gtcctgtact tcctatacaa atagtagatg acttagatga ggctattgca 1200
ttatcaaatg attccgaata tggattaaca agctccatat ataccgccag cttaagtgca 1260
gctatgcagg ctacaagaag ccttgatttt ggagaaacct acataaatcg tgaaaacttt 1320
gaagcaatgc aaggttttca tgctggtaca agaaagtctg gcataggcgg cgctgacgga 1380
aagcacgggt tatatgaata tacgcatacc catgtagttt atatccaagc ataa 1434
<210> 60
<211> 477
<212> PRT
<213> Pseudomonas fluorescens
<400> 60
Met Ser His Ala Ile Tyr Gln Asn Tyr Ile Ala Asn Ala Phe Val Ala
1 5 10 15
Ser Asp Glu His Leu Glu Val His Asn Pro Ala Asn Gly Gln Leu Leu
20 25 30
Ala His Val Pro Gln Gly Ser Ser Ala Glu Val Glu Arg Ala Ile Ala
35 40 45
Ala Ala Arg Gln Ala Gln Lys Ala Trp Ala Ala Arg Pro Ala Ile Glu
50 55 60
Arg Ala Gly Tyr Leu Arg Lys Ile Ala Ser Lys Ile Arg Glu His Gly
65 70 75 80
Glu Arg Leu Ala Arg Ile Ile Thr Ala Glu Gln Gly Lys Val Leu Glu
85 90 95
Leu Ala Arg Val Glu Val Asn Phe Thr Ala Asp Tyr Leu Asp Tyr Met
100 105 110
Ala Glu Trp Ala Arg Arg Leu Glu Gly Glu Val Leu Ser Ser Asp Arg
115 120 125
Pro Gly Glu Ser Ile Phe Leu Leu Arg Lys Pro Leu Gly Val Val Ala
130 135 140
Gly Ile Leu Pro Trp Asn Phe Pro Phe Phe Leu Ile Ala Arg Lys Met
145 150 155 160
Ala Pro Ala Leu Leu Thr Gly Asn Thr Ile Val Ile Lys Pro Ser Glu
165 170 175
Glu Thr Pro Ile Asn Cys Phe Glu Phe Ala Arg Leu Val Ala Glu Thr
180 185 190
Asp Leu Pro Ala Gly Val Phe Asn Val Val Cys Gly Thr Gly Ala Thr
195 200 205
Val Gly Asn Ala Leu Thr Ser His Pro Gly Ile Asp Leu Ile Ser Phe
210 215 220
Thr Gly Ser Val Gly Thr Gly Ser Arg Ile Met Ala Ala Ala Ala Pro
225 230 235 240
Asn Ile Thr Lys Leu Asn Leu Glu Leu Gly Gly Lys Ala Pro Ala Ile
245 250 255
Val Leu Ala Asp Ala Asp Leu Asp Leu Ala Val Arg Ala Ile Thr Ala
260 265 270
Ser Arg Val Ile Asn Thr Gly Gln Val Cys Asn Cys Ala Glu Arg Val
275 280 285
Tyr Val Glu Arg Lys Val Ala Asp Ala Phe Ile Glu Arg Ile Ala Ala
290 295 300
Ala Met Ala Gly Thr Arg Tyr Gly Asp Pro Leu Ala Glu Asn Gly Leu
305 310 315 320
Asp Met Gly Pro Leu Ile Asn Arg Ala Ala Leu Asp Lys Val Ala Gln
325 330 335
Met Val Arg Thr Ala Ser Gly Gln Gly Ala Gln Val Ile Thr Gly Gly
340 345 350
Ala Val Ala Asp Leu Gly Gln Gly Phe His Tyr Gln Pro Thr Val Leu
355 360 365
Ala Gly Cys Ser Ala Asp Met Glu Ile Met Arg Lys Glu Ile Phe Gly
370 375 380
Pro Val Leu Pro Ile Gln Ile Val Asp Asp Leu Asp Glu Ala Ile Ala
385 390 395 400
Leu Ser Asn Asp Ser Glu Tyr Gly Leu Thr Ser Ser Ile Tyr Thr Ala
405 410 415
Ser Leu Ser Ala Ala Met Gln Ala Thr Arg Ser Leu Asp Phe Gly Glu
420 425 430
Thr Tyr Ile Asn Arg Glu Asn Phe Glu Ala Met Gln Gly Phe His Ala
435 440 445
Gly Thr Arg Lys Ser Gly Ile Gly Gly Ala Asp Gly Lys His Gly Leu
450 455 460
Tyr Glu Tyr Thr His Thr His Val Val Tyr Ile Gln Ala
465 470 475
<210> 61
<211> 1155
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon-adapted nucleotide sequence
<400> 61
atggccaaca gaatgatctt aaatgaaaca agttatattg gagcaggagc aatagaaaat 60
atagtggcag aggctaaggt tagaggttat aaaaaggctc ttgcagttac tgatagggac 120
cttattaaat ttaatgtagc aaccaaagtt acagatcttt taaaggcaaa caatcttgct 180
tttgaaatat ttgatgaagt aaaagcaaat cccactatta atgttgtttt agctggtatt 240
gaaaaattta aggcagcagg agcagattac ttattagcta taggcggcgg ctcgagtatc 300
gatacggcaa aagcaatagg tattatagta aagaaccctg aatttagtga tgttagatct 360
cttgaaggag ttgccgatac aaaaaataaa tgtgttgata ttatagctgt acctactact 420
gctggcacag cagctgaggt aactataaac tatgtaataa cagatgaaga aaaaaagaga 480
aaatttgtct gtgttgatcc tcatgatata cctgtaatag ccgtagtaga ttcagaaatg 540
atgtcaagta tgccaaaagg actaacagca gcaacaggaa tggatgcact tacgcatgct 600
atagaaggat atataacaaa aggagcctgg gaacttacag atgcactaca tcttaaggct 660
atagaaataa ttggaagatc ccttagatca gcagttaata atgaaccaaa aggaagagaa 720
gatatggctt taggacaata cgtggcagga atgggattta gcaatgttgg tttgggaata 780
gtccatggta tggctcatcc tcttggagca ttctatgata ctcctcatgg tatagcaaat 840
gcagtactcc ttccttatgt tatggagtat aatgcagagg caacaggata caaatataga 900
gaaattgccc gtgcaatggg tgttcaaggt gtagactcaa tgagccagga tgaatacaga 960
aaagcggcta ttgatgctgt aaagaaatta agtgaagatg ttggtattcc taaggtatta 1020
aatgagattg gagtaaagga agaagattta caggctcttt ctgaatcagc atttgcagat 1080
gcttgtactc caggaaatcc tagagatact tctgttgaag aaatacttgc catatataag 1140
aaggcattca aataa 1155
<210> 62
<211> 384
<212> PRT
<213> Clostridium saccharoperbutylacetonicum
<400> 62
Met Ala Asn Arg Met Ile Leu Asn Glu Thr Ser Tyr Ile Gly Ala Gly
1 5 10 15
Ala Ile Glu Asn Ile Val Ala Glu Ala Lys Val Arg Gly Tyr Lys Lys
20 25 30
Ala Leu Ala Val Thr Asp Arg Asp Leu Ile Lys Phe Asn Val Ala Thr
35 40 45
Lys Val Thr Asp Leu Leu Lys Ala Asn Asn Leu Ala Phe Glu Ile Phe
50 55 60
Asp Glu Val Lys Ala Asn Pro Thr Ile Asn Val Val Leu Ala Gly Ile
65 70 75 80
Glu Lys Phe Lys Ala Ala Gly Ala Asp Tyr Leu Leu Ala Ile Gly Gly
85 90 95
Gly Ser Ser Ile Asp Thr Ala Lys Ala Ile Gly Ile Ile Val Lys Asn
100 105 110
Pro Glu Phe Ser Asp Val Arg Ser Leu Glu Gly Val Ala Asp Thr Lys
115 120 125
Asn Lys Cys Val Asp Ile Ile Ala Val Pro Thr Thr Ala Gly Thr Ala
130 135 140
Ala Glu Val Thr Ile Asn Tyr Val Ile Thr Asp Glu Glu Lys Lys Arg
145 150 155 160
Lys Phe Val Cys Val Asp Pro His Asp Ile Pro Val Ile Ala Val Val
165 170 175
Asp Ser Glu Met Met Ser Ser Met Pro Lys Gly Leu Thr Ala Ala Thr
180 185 190
Gly Met Asp Ala Leu Thr His Ala Ile Glu Gly Tyr Ile Thr Lys Gly
195 200 205
Ala Trp Glu Leu Thr Asp Ala Leu His Leu Lys Ala Ile Glu Ile Ile
210 215 220
Gly Arg Ser Leu Arg Ser Ala Val Asn Asn Glu Pro Lys Gly Arg Glu
225 230 235 240
Asp Met Ala Leu Gly Gln Tyr Val Ala Gly Met Gly Phe Ser Asn Val
245 250 255
Gly Leu Gly Ile Val His Gly Met Ala His Pro Leu Gly Ala Phe Tyr
260 265 270
Asp Thr Pro His Gly Ile Ala Asn Ala Val Leu Leu Pro Tyr Val Met
275 280 285
Glu Tyr Asn Ala Glu Ala Thr Gly Tyr Lys Tyr Arg Glu Ile Ala Arg
290 295 300
Ala Met Gly Val Gln Gly Val Asp Ser Met Ser Gln Asp Glu Tyr Arg
305 310 315 320
Lys Ala Ala Ile Asp Ala Val Lys Lys Leu Ser Glu Asp Val Gly Ile
325 330 335
Pro Lys Val Leu Asn Glu Ile Gly Val Lys Glu Glu Asp Leu Gln Ala
340 345 350
Leu Ser Glu Ser Ala Phe Ala Asp Ala Cys Thr Pro Gly Asn Pro Arg
355 360 365
Asp Thr Ser Val Glu Glu Ile Leu Ala Ile Tyr Lys Lys Ala Phe Lys
370 375 380
<210> 63
<211> 504
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon-adapted nucleotide sequence
<400> 63
atggtatcaa gtggagtttt ttctcttcat ctcaaactta taaacagaat attatcagct 60
ttagccgtat gtaaacaaat ttcccagata tttgatttag ctatagtggc tttagctgta 120
tgtgatggcg gcataatggc tggatctcat agaataaatg gaatggaaca tcctgtaagt 180
gatttatatg atgcagttca tggtaaggga ttggctgctt taactcctat aatagttgaa 240
aaatcctgga aaagtgatat agaaaaatat gatgatataa gcaaattgat tggatgttca 300
tcagcaaaaa attgtgcaga tgctatacgg tcattccttg aaaagataaa tctaaacgta 360
acccttggtg aattaggtgt taaagaaaaa gatgtagaat ggatgtcaga aaattgcatg 420
aaagtgtcaa aaccttccat aattaatcac ccaagggaat ttactctaga agaaattaag 480
aacatttatt atgaagaatt ataa 504
<210> 64
<211> 167
<212> PRT
<213> Clostridium ljungdahlii
<400> 64
Met Val Ser Ser Gly Val Phe Ser Leu His Leu Lys Leu Ile Asn Arg
1 5 10 15
Ile Leu Ser Ala Leu Ala Val Cys Lys Gln Ile Ser Gln Ile Phe Asp
20 25 30
Leu Ala Ile Val Ala Leu Ala Val Cys Asp Gly Gly Ile Met Ala Gly
35 40 45
Ser His Arg Ile Asn Gly Met Glu His Pro Val Ser Asp Leu Tyr Asp
50 55 60
Ala Val His Gly Lys Gly Leu Ala Ala Leu Thr Pro Ile Ile Val Glu
65 70 75 80
Lys Ser Trp Lys Ser Asp Ile Glu Lys Tyr Asp Asp Ile Ser Lys Leu
85 90 95
Ile Gly Cys Ser Ser Ala Lys Asn Cys Ala Asp Ala Ile Arg Ser Phe
100 105 110
Leu Glu Lys Ile Asn Leu Asn Val Thr Leu Gly Glu Leu Gly Val Lys
115 120 125
Glu Lys Asp Val Glu Trp Met Ser Glu Asn Cys Met Lys Val Ser Lys
130 135 140
Pro Ser Ile Ile Asn His Pro Arg Glu Phe Thr Leu Glu Glu Ile Lys
145 150 155 160
Asn Ile Tyr Tyr Glu Glu Leu
165
<210> 65
<211> 1149
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon-adapted nucleotide sequence
<400> 65
atggcaaata gaatgatatt aaatgaaaca gcatggtttg gaagaggggc tgtaggtgca 60
ctaacagatg aagtaaagag aagaggatat cagaaggctt taatagtaac tgataagacg 120
cttgtacaat gtggtgtagt tgctaaagta acagataaaa tggatgctgc aggacttgca 180
tgggctattt atgatggtgt agtccctaat cctactataa ctgtagtaaa agagggcctt 240
ggagtatttc aaaattcagg tgcagattat ttgatagcta taggcggcgg ctctcctcaa 300
gatacttgta aagccattgg aataattagc aacaatcctg aatttgccga cgttagatca 360
cttgaaggat tatctcctac aaataaacca agcgtaccta tacttgcaat acctactaca 420
gcgggtactg cagctgaagt tacaataaac tatgtaatta cagacgaaga aaagagaaga 480
aaatttgtat gtgtagaccc tcatgacata cctcaagtag catttattga tgcagacatg 540
atggatggaa tgccccctgc tttaaaagca gcaactggtg tagatgcatt gacccatgct 600
atagaaggat atattactcg cggggcatgg gctttaaccg atgcactgca tataaaggct 660
atagaaataa tagctggggc attgagaggt tctgtagctg gtgacaaaga tgctggtgaa 720
gagatggcgt taggtcagta tgtagcggga atgggatttt caaatgtagg gttaggatta 780
gttcacggga tggctcatcc tttaggtgca ttctataata caccacatgg agtagctaat 840
gctatactac taccacatgt tatgagatat aatgcagatt ttaccggaga aaaatataga 900
gatatagcac gagttatggg tgtaaaagta gaaggaatga gcttagaaga ggctagaaat 960
gcagcagtag aagcagtatt tgctttaaat agagatgtag gaataccacc acatttaaga 1020
gatgttggtg taagaaaaga ggatattcca gcactggcac aggcagcatt ggatgatgta 1080
tgtacaggcg gcaatccaag agaggctaca cttgaagata tagtagagct ttatcatact 1140
gcatggtaa 1149
<210> 66
<211> 382
<212> PRT
<213> Escherichia coli
<400> 66
Met Ala Asn Arg Met Ile Leu Asn Glu Thr Ala Trp Phe Gly Arg Gly
1 5 10 15
Ala Val Gly Ala Leu Thr Asp Glu Val Lys Arg Arg Gly Tyr Gln Lys
20 25 30
Ala Leu Ile Val Thr Asp Lys Thr Leu Val Gln Cys Gly Val Val Ala
35 40 45
Lys Val Thr Asp Lys Met Asp Ala Ala Gly Leu Ala Trp Ala Ile Tyr
50 55 60
Asp Gly Val Val Pro Asn Pro Thr Ile Thr Val Val Lys Glu Gly Leu
65 70 75 80
Gly Val Phe Gln Asn Ser Gly Ala Asp Tyr Leu Ile Ala Ile Gly Gly
85 90 95
Gly Ser Pro Gln Asp Thr Cys Lys Ala Ile Gly Ile Ile Ser Asn Asn
100 105 110
Pro Glu Phe Ala Asp Val Arg Ser Leu Glu Gly Leu Ser Pro Thr Asn
115 120 125
Lys Pro Ser Val Pro Ile Leu Ala Ile Pro Thr Thr Ala Gly Thr Ala
130 135 140
Ala Glu Val Thr Ile Asn Tyr Val Ile Thr Asp Glu Glu Lys Arg Arg
145 150 155 160
Lys Phe Val Cys Val Asp Pro His Asp Ile Pro Gln Val Ala Phe Ile
165 170 175
Asp Ala Asp Met Met Asp Gly Met Pro Pro Ala Leu Lys Ala Ala Thr
180 185 190
Gly Val Asp Ala Leu Thr His Ala Ile Glu Gly Tyr Ile Thr Arg Gly
195 200 205
Ala Trp Ala Leu Thr Asp Ala Leu His Ile Lys Ala Ile Glu Ile Ile
210 215 220
Ala Gly Ala Leu Arg Gly Ser Val Ala Gly Asp Lys Asp Ala Gly Glu
225 230 235 240
Glu Met Ala Leu Gly Gln Tyr Val Ala Gly Met Gly Phe Ser Asn Val
245 250 255
Gly Leu Gly Leu Val His Gly Met Ala His Pro Leu Gly Ala Phe Tyr
260 265 270
Asn Thr Pro His Gly Val Ala Asn Ala Ile Leu Leu Pro His Val Met
275 280 285
Arg Tyr Asn Ala Asp Phe Thr Gly Glu Lys Tyr Arg Asp Ile Ala Arg
290 295 300
Val Met Gly Val Lys Val Glu Gly Met Ser Leu Glu Glu Ala Arg Asn
305 310 315 320
Ala Ala Val Glu Ala Val Phe Ala Leu Asn Arg Asp Val Gly Ile Pro
325 330 335
Pro His Leu Arg Asp Val Gly Val Arg Lys Glu Asp Ile Pro Ala Leu
340 345 350
Ala Gln Ala Ala Leu Asp Asp Val Cys Thr Gly Gly Asn Pro Arg Glu
355 360 365
Ala Thr Leu Glu Asp Ile Val Glu Leu Tyr His Thr Ala Trp
370 375 380
<210> 67
<211> 1155
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon-adapted nucleotide sequence
<400> 67
atgacaaata gaatgatatt aaatgaaact agttatatag gtgctggagc aatagaaaac 60
atagtaacag aggcaaaaac acgaggttat aaaaaggcac ttgttgtaac agataaagaa 120
ttaattaaat ttaatgttgc cagcaaagta accaatttgt taaataaaaa tgatctaata 180
tttgagattt ttgatgaagt aaaagcaaat ccaactataa atgtagtatt agctggtata 240
gaaagattta aggcttcagg agcagattat cttatagcta taggcggcgg ctcttcaata 300
gatactgcta aagcaattgg tataataata aataatccag aatttagtga tgttagatca 360
cttgaaggtg ctgtagaaac aaaaaataaa tgtgtagata taatagcagt tccaactaca 420
gcaggcactg ctgctgaagt aactataaat tatgttataa cagatgaaga aagaaagaga 480
aaatttgtat gtgttgatcc tcatgatatt ccagttattg cagtagtaga tagtgagatg 540
atgtcaagca tgcctaaggg attaacagct gcaactggaa tggatgcttt aactcatgct 600
atagaaggat atattacaaa aggagcatgg gaactaacag atactctaca tttaaaggct 660
attgaaataa taggaagaag cttaaggtca gctgtaaata atgaacctaa aggaagagaa 720
gatatggcat taggacaata tatagcagga atgggttttt ccaatgttgg attgggaata 780
gttcattcta tggcgcaccc attgggtgct ttttatgata ctcttcacgg aatagcaaat 840
gctgtacttt taccttatgt aatggagtat aatgcagagg ctactgatga aaagtacagg 900
gaaatagcga gagtaatggg tgtagaaggt gtagataaca tgtctcaaaa agaatacaga 960
aaggctgcaa ttgatgctgt taaaaagctc tccgaagatg taggtatacc aaaggtactt 1020
aatgaaatcg gagtaaaaga agaggatctt caatctttag cagaatcagc ctttgtagat 1080
gcatgcacgc ctggtaaccc aagggatact tcagttgtag aaatactgga aatatataaa 1140
aaggcattca aataa 1155
<210> 68
<211> 384
<212> PRT
<213> Clostridium beijerinckii
<400> 68
Met Thr Asn Arg Met Ile Leu Asn Glu Thr Ser Tyr Ile Gly Ala Gly
1 5 10 15
Ala Ile Glu Asn Ile Val Thr Glu Ala Lys Thr Arg Gly Tyr Lys Lys
20 25 30
Ala Leu Val Val Thr Asp Lys Glu Leu Ile Lys Phe Asn Val Ala Ser
35 40 45
Lys Val Thr Asn Leu Leu Asn Lys Asn Asp Leu Ile Phe Glu Ile Phe
50 55 60
Asp Glu Val Lys Ala Asn Pro Thr Ile Asn Val Val Leu Ala Gly Ile
65 70 75 80
Glu Arg Phe Lys Ala Ser Gly Ala Asp Tyr Leu Ile Ala Ile Gly Gly
85 90 95
Gly Ser Ser Ile Asp Thr Ala Lys Ala Ile Gly Ile Ile Ile Asn Asn
100 105 110
Pro Glu Phe Ser Asp Val Arg Ser Leu Glu Gly Ala Val Glu Thr Lys
115 120 125
Asn Lys Cys Val Asp Ile Ile Ala Val Pro Thr Thr Ala Gly Thr Ala
130 135 140
Ala Glu Val Thr Ile Asn Tyr Val Ile Thr Asp Glu Glu Arg Lys Arg
145 150 155 160
Lys Phe Val Cys Val Asp Pro His Asp Ile Pro Val Ile Ala Val Val
165 170 175
Asp Ser Glu Met Met Ser Ser Met Pro Lys Gly Leu Thr Ala Ala Thr
180 185 190
Gly Met Asp Ala Leu Thr His Ala Ile Glu Gly Tyr Ile Thr Lys Gly
195 200 205
Ala Trp Glu Leu Thr Asp Thr Leu His Leu Lys Ala Ile Glu Ile Ile
210 215 220
Gly Arg Ser Leu Arg Ser Ala Val Asn Asn Glu Pro Lys Gly Arg Glu
225 230 235 240
Asp Met Ala Leu Gly Gln Tyr Ile Ala Gly Met Gly Phe Ser Asn Val
245 250 255
Gly Leu Gly Ile Val His Ser Met Ala His Pro Leu Gly Ala Phe Tyr
260 265 270
Asp Thr Leu His Gly Ile Ala Asn Ala Val Leu Leu Pro Tyr Val Met
275 280 285
Glu Tyr Asn Ala Glu Ala Thr Asp Glu Lys Tyr Arg Glu Ile Ala Arg
290 295 300
Val Met Gly Val Glu Gly Val Asp Asn Met Ser Gln Lys Glu Tyr Arg
305 310 315 320
Lys Ala Ala Ile Asp Ala Val Lys Lys Leu Ser Glu Asp Val Gly Ile
325 330 335
Pro Lys Val Leu Asn Glu Ile Gly Val Lys Glu Glu Asp Leu Gln Ser
340 345 350
Leu Ala Glu Ser Ala Phe Val Asp Ala Cys Thr Pro Gly Asn Pro Arg
355 360 365
Asp Thr Ser Val Val Glu Ile Leu Glu Ile Tyr Lys Lys Ala Phe Lys
370 375 380
<210> 69
<211> 37
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic oligo
<400> 69
cacaccaggt ctcaaaccat ggagatctcg aggcctg 37
<210> 70
<211> 37
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic oligo
<400> 70
cacaccaggt ctcacatatg ataagaagac tcttggc 37
<210> 71
<211> 36
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic oligo
<400> 71
cacaccaggt ctcacatatg acagcaacaa ggggcc 36
<210> 72
<211> 69
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic oligo
<400> 72
cacaccaggt ctcaattgta acacctcctt aattagttat gctctttctt ctataggtac 60
aaatttttg 69
<210> 73
<211> 41
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic oligo
<400> 73
cacaccaggt ctcacaatga aaacaagaac tcaacaaata g 41
<210> 74
<211> 62
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic oligo
<400> 74
cacaccaggt ctcagtgttc ctcctatgtg ttcttaaaat tgagattctt cagttgaacc 60
tg 62
<210> 75
<211> 62
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic oligo
<400> 75
cacaccaggt ctcagtgttc ctcctatgtg ttcttaaaat tgagattctt cagttgaacc 60
tg 62
<210> 76
<211> 50
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic oligo
<400> 76
cacaccaggt ctcaggttat gcatttagat atattgtttt tgtctgtacg 50
<210> 77
<211> 44
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic oligo
<400> 77
cacaccaggt ctcacatatg caatttaggc cttttaatcc acca 44
<210> 78
<211> 53
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic oligo
<400> 78
cacaccaggt ctcagtgttc ctcctatgtg ttcttatgct tgcgcaagtg cct 53
<210> 79
<211> 44
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic oligo
<400> 79
cacaccaggt ctcaacacat atgtcttcag tgcctgtatt ccag 44
<210> 80
<211> 42
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic oligo
<400> 80
cacaccaggt ctcaggttaa gactggagat atactgcatg ag 42
<210> 81
<211> 40
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic oligo
<400> 81
cacaccaggt ctcacatatg agaactccat ttattatgac 40
<210> 82
<211> 52
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic oligo
<400> 82
cacaccaggt ctcagtgttc ctcctatgtg ttcctaatct acaaagtgct tg 52
Claims (20)
- 가스성(gaseous) 기질로부터 에틸렌 글리콜 또는 에틸렌 글리콜 전구체를 생성할 수 있는 유전자 조작된 미생물.
- 제1항에 있어서, 상기 미생물은 5,10-메틸렌테트라하이드로폴레이트, 옥살로아세테이트, 시트레이트, 말레이트 및 글리신으로 이루어진 군으로부터 선택되는 하나 이상의 중간산물을 통해 에틸렌 글리콜 또는 에틸렌 글리콜 전구체를 생성하는, 미생물.
- 제1항에 있어서, 상기 미생물은 하기의 효소 중 하나 이상을 포함하는, 미생물:
a. 옥살로아세테이트를 시트레이트로 전환할 수 있는 이종성 효소;
b. 글리신을 글리옥실레이트로 전환할 수 있는 이종성 효소;
c. 이소-시트레이트를 글리옥실레이트로 전환할 수 있는 이종성 효소; 및
d. 글리콜레이트를 글리코알데하이드로 전환할 수 있는 이종성 효소. - 제3항에 있어서,
a. 옥살로아세테이트를 시트레이트로 전환할 수 있는 이종성 효소는 시트레이트 [Si]-신타제(synthase)[2.3.3.1], ATP 시트레이트 신타제[2.3.3.8]; 또는 시트레이트(Re)-신타제[2.3.3.3]이며;
b. 글리신을 글리옥실레이트로 전환할 수 있는 이종성 효소는 알라닌-글리옥실레이트 트랜스아미나제[2.6.1.44], 세린-글리옥실레이트 트랜스아미나제[2.6.1.45], 세린-피루베이트 트랜스아미나제[2.6.1.51], 글리신-옥살로아세테이트 트랜스아미나제[2.6.1.35], 글리신 트랜스아미나제[2.6.1.4], 글리신 데하이드로게나제[1.4.1.10], 알라닌 데하이드로게나제[1.4.1.1] 또는 글리신 데하이드로게나제[1.4.2.1]이며;
c. 이소-시트레이트를 글리옥실레이트로 전환할 수 있는 이종성 효소는 이소시트레이트 리아제(lyase)[4.1.3.1]이며; 및/또는
d. 글리콜레이트를 글리코알데하이드로 전환할 수 있는 이종성 효소는 글리콜알데하이드 데하이드로게나제[1.2.1.21], 락트알데하이드 데하이드로게나제[1.2.1.22], 숙시네이트-세미알데하이드 데하이드로게나제[1.2.1.24], 2,5-디옥소발레레이트 데하이드로게나제[1.2.1.26], 알데하이드 데하이드로게나제[1.2.1.3/4/5], 베타인-알데하이드 데하이드로게나제[1.2.1.8] 또는 알데하이드 페레독신 옥시도리덕타제[1.2.7.5]인, 미생물. - 제3항에 있어서, 하나 이상의 이종성 효소는 바실러스(Bacillus), 클로스트리디움(Clostridium), 에스케리키아(Escherichia), 글루코노박터(Gluconobacter), 하이포미크로비움(Hyphomicrobium), 리시니바실러스(Lysinibacillus), 패니바실러스(Paenibacillus), 슈도모나스(Pseudomonas), 세디멘티콜라(Sedimenticola), 스포로사르치나(Sporosarcina), 스트렙토마이세스(Streptomyces), 테르미티오바실러스(Thermithio), 테르모토가(Thermotoga) 및 제아(Zea)로 이루어진 군으로부터 선택되는 속(genus)으로부터 유래되는, 미생물.
- 제3항에 있어서, 하나 이상의 이종성 효소는 상기 미생물에서 발현을 위해 코돈-최적화되는(codon-optimized), 미생물.
- 제3항에 있어서, 상기 미생물은 아세틸-CoA를 피루베이트로 전환할 수 있는 효소; 피루베이트를 옥살로아세테이트로 전환할 수 있는 효소; 피루베이트를 말레이트로 전환할 수 있는 효소; 피루베이트를 포스페놀피루베이트로 전환할 수 있는 효소; 옥살로아세테이트를 시트릴-CoA로 전환할 수 있는 효소; 시트릴-CoA를 시트레이트로 전환할 수 있는 효소; 시트레이트를 아코니테이트로 그리고 아코니테이트를 이소-시트레이트로 전환할 수 있는 효소; 포스포에놀피루베이트를 옥살로아세테이트로 전환할 수 있는 효소; 포스포에놀피루베이트를 2-포스포-D-글리세레이트로 전환할 수 있는 효소; 2-포스포-D-글리세레이트를 3-포스포-D-글리세레이트로 전환할 수 있는 효소; 3-포스포-D-글리세레이트를 3-포스포노옥시피루베이트로 전환할 수 있는 효소; 3-포스포노옥시피루베이트를 3-포스포-L-세린으로 전환할 수 있는 효소; 3-포스포-L-세린을 세린으로 전환할 수 있는 효소; 세린을 글리신으로 전환할 수 있는 효소; 5,10-메틸렌테트라하이드로폴레이트를 글리신으로 전환할 수 있는 효소; 세린을 하이드록시피루베이트로 전환할 수 있는 효소; D-글리세레이트를 하이드록시피루베이트로 전환할 수 있는 효소; 말레이트를 글리옥실레이트로 전환할 수 있는 효소; 글리옥실레이트를 글리콜레이트로 전환할 수 있는 효소; 하이드록시피루베이트를 글리코알데하이드로 전환할 수 있는 효소; 및 글리코알데하이드를 에틸렌 글리콜로 전환할 수 있는 효소 중 하나 이상을 추가로 포함하는, 미생물.
- 제3항에 있어서, 상기 미생물은 하기의 효소를 과발현하는, 미생물:
a. 옥살로아세테이트를 시트레이트로 전환할 수 있는 이종성 효소;
b. 글리신을 글리옥실레이트로 전환할 수 있는 이종성 효소; 및/또는
c. 글리콜레이트를 글리코알데하이드로 전환할 수 있는 이종성 효소. - 제7항에 있어서, 상기 미생물은 하기의 효소를 과발현하는, 미생물:
a. 피루베이트를 옥살로아세테이트로 전환할 수 있는 효소;
b. 시트레이트를 아코니테이트로 그리고 아코니테이트를 이소-시트레이트로 전환할 수 있는 효소;
c. 포스포에놀피루베이트를 옥살로아세테이트로 전환할 수 있는 효소;
d. 세린을 글리신으로 전환할 수 있는 효소;
e. 5,10-메틸렌테트라하이드로폴레이트를 글리신으로 전환할 수 있는 효소;
f. 글리옥실레이트를 글리콜레이트로 전환할 수 있는 효소; 및/또는
g. 글리코알데하이드를 에틸렌 글리콜로 전환할 수 있는 효소. - 제1항에 있어서, 상기 미생물은 이소시트레이트 데하이드로게나제, 글리세레이트 데하이드로게나제, 글리콜레이트 데하이드로게나제, 글리세레이트 데하이드로게나제, 글리콜레이트 데하이드로게나제, 알데하이드 페레독신 옥시도리덕타제 및 알데하이드 데하이드로게나제 중 하나 이상에서 교란적 돌연변이(disruptive mutation)를 포함하는, 미생물.
- 제1항에 있어서, 상기 미생물은 아세토박테리움(Acetobacterium), 알칼리바쿨룸(Alkalibaculum), 블라우티아(Blautia), 부티리박테리움(Butyribacterium), 클로스트리디움, 유박테리움(Eubacterium), 무렐라(Moorella), 옥소박터(Oxobacter), 스포로무사(Sporomusa) 및 테르모아내로박터(Thermoanaerobacter)로 이루어진 군으로부터 선택되는 속의 구성원인, 미생물.
- 제1항에 있어서, 상기 미생물은 아세토박테리움 우디이(Acetobacterium woodii), 알칼리바쿨룸 박키이(Alkalibaculum bacchii), 블라우티아 프로덕타(Blautia producta), 부티리박테리움 메틸로트로피쿰(Butyribacterium methylotrophicum), 클로스트리디움 아세티쿰(Clostridium aceticum), 클로스트리디움 아우토에타노게눔(Clostridium autoethanogenum), 클로스트리디움 카르복시디보란스(Clostridium carboxidivorans), 클로스트리디움 코스카티이(Clostridium coskatii), 클로스트리디움 드라케이(Clostridium drakei), 클로스트리디움 포르미코아세티쿰(Clostridium formicoaceticum), 클로스트리디움 륭달리이(Clostridium ljungdahlii), 클로스트리디움 마그눔(Clostridium magnum), 클로스트리디움 라그스달레이(Clostridium ragsdalei), 클로스트리디움 스카톨로게네스(Clostridium scatologenes), 유박테리움 리모숨(Eubacterium limosum), 무렐라 테르마우토트로피카(Moorella thermautotrophica), 무렐라 테르모아세티카(Moorella thermoacetica), 옥소박터 펜니기이(Oxobacter pfennigii), 스포로무사 오바타(Sporomusa ovata), 스포로무사 실바세티카(Sporomusa silvacetica), 스포로무사 스패로이데스(Sporomusa sphaeroides) 및 테르모아내로박터 키우비(Thermoanaerobacter kiuvi)로 이루어진 군으로부터 선택되는 부모 미생물로부터 유래되는, 미생물.
- 제12항에 있어서, 상기 미생물은 클로스트리디움 아우토에타노게눔, 클로스트리디움 륭달리이 및 클로스트리디움 라그스달레이로 이루어진 군으로부터 선택되는 부모 미생물로부터 유래되는, 미생물.
- 제1항에 있어서, 상기 미생물은 본연의(native) 또는 이종성 우드-륭달(Wood-Ljungdahl) 경로를 포함하는, 미생물.
- 제1항에 있어서, 상기 에틸렌 글리콜 전구체는 글리옥실레이트 또는 글리콜레이트인, 미생물.
- 에틸렌 글리콜 또는 에틸렌 글리콜 전구체를 생성하는 방법으로서, 상기 방법은 제1항의 미생물을 영양소 배지에서 기질의 존재 하에 배양하여, 미생물이 에틸렌 글리콜 또는 에틸렌 글리콜 전구체를 생성하는 단계를 포함하는, 방법.
- 제16항에 있어서, 상기 가스성 기질은 CO, CO2 및 H2 중 하나 이상을 포함하는, 방법.
- 제16항에 있어서, 상기 에틸렌 글리콜 전구체는 글리옥실레이트 또는 글리콜레이트인, 방법.
- 제16항에 있어서, 상기 에틸렌 글리콜 또는 에틸렌 글리콜 전구체를 영양소 배지로부터 분리하는 단계를 추가로 포함하는, 방법.
- 제16항에 있어서, 상기 미생물은 에탄올, 2,3-부탄디올 및 숙시네이트 중 하나 이상을 추가로 생성하는, 방법.
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201762607446P | 2017-12-19 | 2017-12-19 | |
US62/607,446 | 2017-12-19 | ||
US201862683454P | 2018-06-11 | 2018-06-11 | |
US62/683,454 | 2018-06-11 | ||
PCT/US2018/066619 WO2019126400A1 (en) | 2017-12-19 | 2018-12-19 | Microorganisms and methods for the biological production of ethylene glycol |
Publications (1)
Publication Number | Publication Date |
---|---|
KR20200091458A true KR20200091458A (ko) | 2020-07-30 |
Family
ID=66815024
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020207020008A KR20200091458A (ko) | 2017-12-19 | 2018-12-19 | 에틸렌 글리콜의 생물학적 생성을 위한 미생물 및 방법 |
Country Status (11)
Country | Link |
---|---|
US (2) | US11555209B2 (ko) |
EP (1) | EP3728614A4 (ko) |
JP (2) | JP7304859B2 (ko) |
KR (1) | KR20200091458A (ko) |
CN (1) | CN111936631A (ko) |
AU (2) | AU2018393075B2 (ko) |
BR (1) | BR112020008718A2 (ko) |
CA (1) | CA3079761C (ko) |
MY (1) | MY196897A (ko) |
WO (1) | WO2019126400A1 (ko) |
ZA (1) | ZA202004080B (ko) |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111936631A (zh) * | 2017-12-19 | 2020-11-13 | 朗泽科技有限公司 | 用于生物产生乙二醇的微生物和方法 |
MY197373A (en) | 2019-07-11 | 2023-06-14 | Lanzatech Inc | Methods for optimizing gas utilization |
MY197253A (en) | 2020-06-06 | 2023-06-08 | Lanzatech Inc | Microorganism with knock-in at acetolactate decarboxylase gene locus |
EP4179099A1 (en) * | 2020-07-09 | 2023-05-17 | Evonik Operations GmbH | Method for the fermentative production of guanidinoacetic acid |
WO2022008280A1 (en) * | 2020-07-09 | 2022-01-13 | Evonik Operations Gmbh | Method for the fermentative production of guanidinoacetic acid |
CN114574529A (zh) * | 2020-12-01 | 2022-06-03 | 中国科学院天津工业生物技术研究所 | 一种乙醇酸在酶的作用下生成目标产物的方法 |
US11788092B2 (en) | 2021-02-08 | 2023-10-17 | Lanzatech, Inc. | Recombinant microorganisms and uses therefor |
TW202307202A (zh) * | 2021-08-06 | 2023-02-16 | 美商朗澤科技有限公司 | 用於改良乙二醇之生物產生的微生物及方法 |
Family Cites Families (38)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US2218234A (en) | 1937-12-09 | 1940-10-15 | Eastman Kodak Co | Process for the recovery of ethylene glycol from aqueous solutions |
US5593886A (en) | 1992-10-30 | 1997-01-14 | Gaddy; James L. | Clostridium stain which produces acetic acid from waste gases |
US5552023A (en) | 1993-12-15 | 1996-09-03 | Alliedsignal Inc. | Recovery of spent deicing fluid |
UA72220C2 (uk) | 1998-09-08 | 2005-02-15 | Байоенджініерінг Рісорсиз, Інк. | Незмішувана з водою суміш розчинник/співрозчинник для екстрагування оцтової кислоти, спосіб одержання оцтової кислоти (варіанти), спосіб анаеробного мікробного бродіння для одержання оцтової кислоти (варіанти), модифікований розчинник та спосіб його одержання |
NZ546496A (en) | 2006-04-07 | 2008-09-26 | Lanzatech New Zealand Ltd | Gas treatment process |
US7704723B2 (en) | 2006-08-31 | 2010-04-27 | The Board Of Regents For Oklahoma State University | Isolation and characterization of novel clostridial species |
NZ553984A (en) | 2007-03-19 | 2009-07-31 | Lanzatech New Zealand Ltd | Alcohol production process |
US20200048665A1 (en) | 2007-10-28 | 2020-02-13 | Lanzatech New Zealand Limited | Carbon capture in fermentation |
EA022710B1 (ru) | 2007-11-13 | 2016-02-29 | Ланзатек Нью Зиленд Лимитед | Штамм бактерии clostridium autoethanogenum, способный продуцировать этанол и ацетат путем анаэробной ферментации субстрата, содержащего co |
CN101978042B (zh) * | 2008-01-22 | 2017-03-08 | 基因组股份公司 | 利用合成气或其它气态碳源和甲醇的方法和有机体 |
EA018720B1 (ru) | 2008-06-09 | 2013-10-30 | Ланзатек Нью Зиленд Лимитед | Способ получения бутандиола с помощью анаэробной микробной ферментации |
DE102008044440B4 (de) | 2008-08-18 | 2011-03-03 | Lurgi Zimmer Gmbh | Verfahren und Vorrichtung zur Rückgewinnung von Ethylenglykol bei der Polyethylenterephthalatherstellung |
US8039239B2 (en) | 2008-12-16 | 2011-10-18 | Coskata, Inc. | Recombinant microorganisms having modified production of alcohols and acids |
NZ594199A (en) | 2009-01-26 | 2013-05-31 | Xyleco Inc | Method of producing ethanol |
US8445244B2 (en) * | 2010-02-23 | 2013-05-21 | Genomatica, Inc. | Methods for increasing product yields |
CN102791869B (zh) | 2010-03-10 | 2015-08-19 | 朗泽科技新西兰有限公司 | 通过发酵的酸产生 |
BR112012025846A2 (pt) * | 2010-04-13 | 2015-10-06 | Genomatica Inc | organismo microbiano de ocorrência não natural e método para produzir etilenoglicol. |
US20130217096A1 (en) | 2010-07-28 | 2013-08-22 | Lanzatech New Zealand Limited | Novel bacteria and methods of use thereof |
CN103415618A (zh) | 2010-08-19 | 2013-11-27 | 新西兰郎泽科技公司 | 使用对含一氧化碳的底物的微生物发酵来生产化学物质的方法 |
CN103282505A (zh) | 2010-08-26 | 2013-09-04 | 新西兰郎泽科技公司 | 通过发酵产生乙醇和乙烯的方法 |
US20110236941A1 (en) | 2010-10-22 | 2011-09-29 | Lanzatech New Zealand Limited | Recombinant microorganism and methods of production thereof |
US9410130B2 (en) | 2011-02-25 | 2016-08-09 | Lanzatech New Zealand Limited | Recombinant microorganisms and uses therefor |
US9914947B2 (en) * | 2011-07-27 | 2018-03-13 | Alliance For Sustainable Energy, Llc | Biological production of organic compounds |
PL2753700T3 (pl) | 2011-09-08 | 2020-07-27 | Lanzatech New Zealand Limited | Sposób fermentacji |
KR101351879B1 (ko) | 2012-02-06 | 2014-01-22 | 명지대학교 산학협력단 | 에탄―1,2―디올 생산 미생물 및 이를 이용한 에탄―1,2―디올 생산 방법 |
US8658845B2 (en) | 2012-05-23 | 2014-02-25 | Orochem Technologies, Inc. | Process and adsorbent for separating ethanol and associated oxygenates from a biofermentation system |
WO2013180581A1 (en) | 2012-05-30 | 2013-12-05 | Lanzatech New Zealand Limited | Recombinant microorganisms and uses therefor |
US20130323820A1 (en) | 2012-06-01 | 2013-12-05 | Lanzatech New Zealand Limited | Recombinant microorganisms and uses therefor |
CN104919037A (zh) * | 2012-06-08 | 2015-09-16 | 朗泽科技新西兰有限公司 | 重组微生物和其用途 |
US9347076B2 (en) | 2012-06-21 | 2016-05-24 | Lanzatech New Zealand Limited | Recombinant microorganisms that make biodiesel |
WO2014004625A1 (en) | 2012-06-26 | 2014-01-03 | Genomatica, Inc. | Microorganisms for producing ethylene glycol using synthesis gas |
KR102121888B1 (ko) | 2012-08-28 | 2020-06-12 | 란자테크 뉴질랜드 리미티드 | 재조합 미생물 및 이에 대한 용도 |
JP6862349B2 (ja) | 2014-12-08 | 2021-04-21 | ランザテク・ニュージーランド・リミテッド | 発酵経路を経由するフラックスの増大を示す組み換え微生物体 |
EA036334B9 (ru) | 2015-05-27 | 2020-12-24 | Ланцатек Нью Зилэнд Лимитед | Генетически модифицированная бактерия clostridium для получения салицилата из хоризмата |
US10294481B2 (en) * | 2015-10-02 | 2019-05-21 | Massachusetts Institute Of Technology | Microbial production of renewable glycolate |
MX2018004642A (es) | 2015-10-13 | 2019-04-15 | Lanzatech New Zealand Ltd | Bacteria diseñada por ingeniería genética que comprende una trayectoria de fermentación que genera energía. |
BR112018067323A2 (pt) | 2016-03-09 | 2019-01-29 | Braskem Sa | micro-organismos e métodos para a coprodução de etileno glicol e três compostos de carbono |
CN111936631A (zh) * | 2017-12-19 | 2020-11-13 | 朗泽科技有限公司 | 用于生物产生乙二醇的微生物和方法 |
-
2018
- 2018-12-19 CN CN201880080546.9A patent/CN111936631A/zh active Pending
- 2018-12-19 AU AU2018393075A patent/AU2018393075B2/en active Active
- 2018-12-19 BR BR112020008718-1A patent/BR112020008718A2/pt unknown
- 2018-12-19 WO PCT/US2018/066619 patent/WO2019126400A1/en unknown
- 2018-12-19 EP EP18891435.2A patent/EP3728614A4/en active Pending
- 2018-12-19 CA CA3079761A patent/CA3079761C/en active Active
- 2018-12-19 KR KR1020207020008A patent/KR20200091458A/ko not_active Application Discontinuation
- 2018-12-19 JP JP2020532013A patent/JP7304859B2/ja active Active
- 2018-12-19 US US16/226,603 patent/US11555209B2/en active Active
- 2018-12-19 MY MYPI2020003030A patent/MY196897A/en unknown
-
2020
- 2020-07-03 ZA ZA2020/04080A patent/ZA202004080B/en unknown
-
2022
- 2022-09-14 US US17/932,204 patent/US20230084118A1/en active Pending
-
2023
- 2023-06-27 JP JP2023105168A patent/JP2023123701A/ja active Pending
-
2024
- 2024-03-05 AU AU2024201435A patent/AU2024201435A1/en active Pending
Also Published As
Publication number | Publication date |
---|---|
US20190185888A1 (en) | 2019-06-20 |
EP3728614A1 (en) | 2020-10-28 |
JP7304859B2 (ja) | 2023-07-07 |
JP2023123701A (ja) | 2023-09-05 |
AU2018393075A1 (en) | 2020-07-30 |
CN111936631A (zh) | 2020-11-13 |
AU2024201435A1 (en) | 2024-03-21 |
EP3728614A4 (en) | 2021-11-24 |
JP2021506247A (ja) | 2021-02-22 |
AU2018393075B2 (en) | 2024-03-21 |
WO2019126400A1 (en) | 2019-06-27 |
US20230084118A1 (en) | 2023-03-16 |
ZA202004080B (en) | 2023-12-20 |
US11555209B2 (en) | 2023-01-17 |
CA3079761C (en) | 2023-09-19 |
BR112020008718A2 (pt) | 2020-11-24 |
CA3079761A1 (en) | 2019-06-27 |
MY196897A (en) | 2023-05-09 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR20200091458A (ko) | 에틸렌 글리콜의 생물학적 생성을 위한 미생물 및 방법 | |
KR101640325B1 (ko) | 재조합 미생물에 의한 일산화탄소로부터 부탄올의 제조 | |
KR102493197B1 (ko) | 발효 경로를 통해 플럭스 증가를 나타내는 재조합 미생물 | |
KR101511639B1 (ko) | 재조합 미생물 및 이의 사용 방법 | |
KR102281806B1 (ko) | 3-hp를 생산할 수 있는 재조합 효모 및 이를 이용한 3-hp의 제조방법 | |
KR102290728B1 (ko) | Nadph 의존적 효소를 포함하는 재조합 미생물 및 이의 생산방법 | |
JP2017534268A (ja) | 有用産物の生産のための改変微生物および方法 | |
CN107223152B (zh) | 具有改变的一氧化碳脱氢酶(codh)活性的遗传工程细菌 | |
KR102648170B1 (ko) | 높은 글리세린 함량을 갖는 배양 배지 상에서의 발효에 의한 개선된 1,3-프로판디올 생산을 위한 미생물 및 방법 | |
KR102208963B1 (ko) | 신규한 아크릴산 생성 경로를 갖는 미생물 및 이를 이용한 아크릴산 생산 방법 | |
KR20190097250A (ko) | 신규한 효소를 사용한 메틸글리옥살의 히드록시아세톤으로의 전환 및 그의 적용 | |
US11952607B2 (en) | Microorganisms and methods for improved biological production of ethylene glycol | |
EA042922B1 (ru) | Микроорганизмы и способы для биологического производства этиленгликоля | |
CN117693588A (zh) | 用于改进乙二醇的生物产生的微生物和方法 | |
BRPI0709758A2 (pt) | organismos termofÍlicos para conversço de biomassa lignocelulàsica em etanol | |
KR20160111936A (ko) | 부티르산에 대한 증가된 내성을 갖는 박테리아 | |
KR20230079454A (ko) | 재조합 미생물 및 이에 대한 용도 | |
TW201816109A (zh) | 包括產能醱酵路徑之經基因工程改造之細菌 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
E902 | Notification of reason for refusal |