KR20160006030A - CoA 아실화 알데히드 데히드게나제 활성이 증가된 신규한 아크릴산 생성 경로를 갖는 미생물 및 이를 이용한 아크릴산 생산 방법 - Google Patents
CoA 아실화 알데히드 데히드게나제 활성이 증가된 신규한 아크릴산 생성 경로를 갖는 미생물 및 이를 이용한 아크릴산 생산 방법 Download PDFInfo
- Publication number
- KR20160006030A KR20160006030A KR1020140085356A KR20140085356A KR20160006030A KR 20160006030 A KR20160006030 A KR 20160006030A KR 1020140085356 A KR1020140085356 A KR 1020140085356A KR 20140085356 A KR20140085356 A KR 20140085356A KR 20160006030 A KR20160006030 A KR 20160006030A
- Authority
- KR
- South Korea
- Prior art keywords
- ala
- leu
- gly
- glu
- val
- Prior art date
Links
- 244000005700 microbiome Species 0.000 title claims abstract description 68
- 230000000694 effects Effects 0.000 title claims abstract description 37
- 230000037361 pathway Effects 0.000 title claims abstract description 17
- 108020002663 Aldehyde Dehydrogenase Proteins 0.000 title claims description 31
- 102000005369 Aldehyde Dehydrogenase Human genes 0.000 title claims description 26
- 238000000034 method Methods 0.000 title claims description 23
- SMZOUWXMTYCWNB-UHFFFAOYSA-N 2-(2-methoxy-5-methylphenyl)ethanamine Chemical compound COC1=CC=C(C)C=C1CCN SMZOUWXMTYCWNB-UHFFFAOYSA-N 0.000 title abstract description 30
- NIXOWILDQLNWCW-UHFFFAOYSA-N 2-Propenoic acid Natural products OC(=O)C=C NIXOWILDQLNWCW-UHFFFAOYSA-N 0.000 title abstract description 30
- 230000015572 biosynthetic process Effects 0.000 title description 3
- 238000003786 synthesis reaction Methods 0.000 title description 2
- BERBFZCUSMQABM-IEXPHMLFSA-N 3-hydroxypropanoyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)CCO)O[C@H]1N1C2=NC=NC(N)=C2N=C1 BERBFZCUSMQABM-IEXPHMLFSA-N 0.000 claims abstract description 34
- 238000004519 manufacturing process Methods 0.000 claims abstract description 32
- AKXKFZDCRYJKTF-UHFFFAOYSA-N 3-Hydroxypropionaldehyde Chemical compound OCCC=O AKXKFZDCRYJKTF-UHFFFAOYSA-N 0.000 claims abstract description 28
- 108090000623 proteins and genes Proteins 0.000 claims description 97
- 108090000790 Enzymes Proteins 0.000 claims description 54
- NIXOWILDQLNWCW-UHFFFAOYSA-M Acrylate Chemical compound [O-]C(=O)C=C NIXOWILDQLNWCW-UHFFFAOYSA-M 0.000 claims description 50
- 102000004190 Enzymes Human genes 0.000 claims description 48
- 108090001042 Hydro-Lyases Proteins 0.000 claims description 42
- 102000004867 Hydro-Lyases Human genes 0.000 claims description 39
- 238000006243 chemical reaction Methods 0.000 claims description 32
- 108010023922 Enoyl-CoA hydratase Proteins 0.000 claims description 31
- 241000588724 Escherichia coli Species 0.000 claims description 25
- 108010025885 Glycerol dehydratase Proteins 0.000 claims description 22
- 230000014509 gene expression Effects 0.000 claims description 22
- 108091033319 polynucleotide Proteins 0.000 claims description 21
- 102000040430 polynucleotide Human genes 0.000 claims description 21
- 239000002157 polynucleotide Substances 0.000 claims description 21
- POODSGUMUCVRTR-IEXPHMLFSA-N acryloyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)C=C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 POODSGUMUCVRTR-IEXPHMLFSA-N 0.000 claims description 18
- 101710088194 Dehydrogenase Proteins 0.000 claims description 11
- 108030005660 3-hydroxybutyryl-CoA dehydratases Proteins 0.000 claims description 10
- 101100407403 Citrobacter freundii pduP gene Proteins 0.000 claims description 8
- 230000003247 decreasing effect Effects 0.000 claims description 6
- 102100034767 3-hydroxyisobutyryl-CoA hydrolase, mitochondrial Human genes 0.000 claims description 5
- 241000894006 Bacteria Species 0.000 claims description 4
- 108090000604 Hydrolases Proteins 0.000 claims description 4
- 238000012258 culturing Methods 0.000 claims description 4
- 108010077268 3-hydroxyisobutyryl-CoA hydrolase Proteins 0.000 claims description 3
- 241000186146 Brevibacterium Species 0.000 claims description 2
- 241000186216 Corynebacterium Species 0.000 claims description 2
- 108030005878 Phosphinomethylmalate isomerases Proteins 0.000 claims description 2
- 125000003275 alpha amino acid group Chemical group 0.000 claims 3
- 102100021834 3-hydroxyacyl-CoA dehydrogenase Human genes 0.000 claims 1
- IJNJLGFTSIAHEA-UHFFFAOYSA-N prop-2-ynal Chemical compound O=CC#C IJNJLGFTSIAHEA-UHFFFAOYSA-N 0.000 claims 1
- 108010061238 threonyl-glycine Proteins 0.000 description 60
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 55
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 48
- 210000004027 cell Anatomy 0.000 description 41
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 37
- 108010079364 N-glycylalanine Proteins 0.000 description 33
- 108010047495 alanylglycine Proteins 0.000 description 33
- 108010015792 glycyllysine Proteins 0.000 description 32
- 108010005233 alanylglutamic acid Proteins 0.000 description 31
- 102000011426 Enoyl-CoA hydratase Human genes 0.000 description 30
- 108010050848 glycylleucine Proteins 0.000 description 30
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 29
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 29
- 108010034529 leucyl-lysine Proteins 0.000 description 28
- 150000001413 amino acids Chemical class 0.000 description 26
- 239000013598 vector Substances 0.000 description 26
- 108020004414 DNA Proteins 0.000 description 25
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 25
- 108010006027 2-hydroxyglutaryl-CoA dehydratase Proteins 0.000 description 23
- 108010044940 alanylglutamine Proteins 0.000 description 23
- 108010049041 glutamylalanine Proteins 0.000 description 23
- 108010092854 aspartyllysine Proteins 0.000 description 21
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 21
- 108010026333 seryl-proline Proteins 0.000 description 21
- 108010073969 valyllysine Proteins 0.000 description 21
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 20
- 108010037850 glycylvaline Proteins 0.000 description 20
- 108010089804 glycyl-threonine Proteins 0.000 description 19
- 229920001184 polypeptide Polymers 0.000 description 19
- 108090000765 processed proteins & peptides Proteins 0.000 description 19
- 102000004196 processed proteins & peptides Human genes 0.000 description 19
- NBBJYMSMWIIQGU-UHFFFAOYSA-N Propionic aldehyde Chemical compound CCC=O NBBJYMSMWIIQGU-UHFFFAOYSA-N 0.000 description 18
- 108010038633 aspartylglutamate Proteins 0.000 description 18
- 108010009298 lysylglutamic acid Proteins 0.000 description 18
- 108010078274 isoleucylvaline Proteins 0.000 description 16
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 15
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 15
- 108010077245 asparaginyl-proline Proteins 0.000 description 15
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 15
- 108010054155 lysyllysine Proteins 0.000 description 15
- 108010017391 lysylvaline Proteins 0.000 description 15
- 239000002609 medium Substances 0.000 description 15
- 108010005942 methionylglycine Proteins 0.000 description 15
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 14
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 14
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 14
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 14
- 108010087924 alanylproline Proteins 0.000 description 14
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 14
- 108010090894 prolylleucine Proteins 0.000 description 14
- 102000004169 proteins and genes Human genes 0.000 description 14
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 13
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 13
- SWQALSGKVLYKDT-ZKWXMUAHSA-N Gly-Ile-Ala Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SWQALSGKVLYKDT-ZKWXMUAHSA-N 0.000 description 13
- 241000880493 Leptailurus serval Species 0.000 description 13
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 13
- HDNOQCZWJGGHSS-VEVYYDQMSA-N Met-Asn-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HDNOQCZWJGGHSS-VEVYYDQMSA-N 0.000 description 13
- UBDDORVPVLEECX-FJXKBIBVSA-N Thr-Gly-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UBDDORVPVLEECX-FJXKBIBVSA-N 0.000 description 13
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 13
- 108010047857 aspartylglycine Proteins 0.000 description 13
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 13
- 108010077515 glycylproline Proteins 0.000 description 13
- 108010057821 leucylproline Proteins 0.000 description 13
- 108010064235 lysylglycine Proteins 0.000 description 13
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 12
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 12
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 12
- RCFGLXMZDYNRSC-CIUDSAMLSA-N Asn-Lys-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O RCFGLXMZDYNRSC-CIUDSAMLSA-N 0.000 description 12
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 12
- 108010078144 glutaminyl-glycine Proteins 0.000 description 12
- 108010079547 glutamylmethionine Proteins 0.000 description 12
- 108010003700 lysyl aspartic acid Proteins 0.000 description 12
- 108010029020 prolylglycine Proteins 0.000 description 12
- WQVFQXXBNHHPLX-ZKWXMUAHSA-N Ala-Ala-His Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O WQVFQXXBNHHPLX-ZKWXMUAHSA-N 0.000 description 11
- VWEWCZSUWOEEFM-WDSKDSINSA-N Ala-Gly-Ala-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(=O)NCC(O)=O VWEWCZSUWOEEFM-WDSKDSINSA-N 0.000 description 11
- KJGNDQCYBNBXDA-GUBZILKMSA-N Arg-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N KJGNDQCYBNBXDA-GUBZILKMSA-N 0.000 description 11
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 11
- VCJCPARXDBEGNE-GUBZILKMSA-N Asn-Pro-Pro Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 VCJCPARXDBEGNE-GUBZILKMSA-N 0.000 description 11
- XDGBFDYXZCMYEX-NUMRIWBASA-N Asp-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)O XDGBFDYXZCMYEX-NUMRIWBASA-N 0.000 description 11
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 11
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 11
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 11
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 11
- XOWKUMFHEZLKLT-CIQUZCHMSA-N Thr-Ile-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O XOWKUMFHEZLKLT-CIQUZCHMSA-N 0.000 description 11
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 11
- 108010008355 arginyl-glutamine Proteins 0.000 description 11
- 108010093581 aspartyl-proline Proteins 0.000 description 11
- 108010068265 aspartyltyrosine Proteins 0.000 description 11
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 11
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 11
- ALRHLSYJTWAHJZ-UHFFFAOYSA-N 3-hydroxypropionic acid Chemical compound OCCC(O)=O ALRHLSYJTWAHJZ-UHFFFAOYSA-N 0.000 description 10
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 10
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 10
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 10
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 10
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 10
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 10
- LIXWIUAORXJNBH-QWRGUYRKSA-N Gly-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN LIXWIUAORXJNBH-QWRGUYRKSA-N 0.000 description 10
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 10
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 10
- BRQKGRLDDDQWQJ-MBLNEYKQSA-N His-Thr-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O BRQKGRLDDDQWQJ-MBLNEYKQSA-N 0.000 description 10
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 10
- CYHYBSGMHMHKOA-CIQUZCHMSA-N Ile-Ala-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CYHYBSGMHMHKOA-CIQUZCHMSA-N 0.000 description 10
- KIMHKBDJQQYLHU-PEFMBERDSA-N Ile-Glu-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KIMHKBDJQQYLHU-PEFMBERDSA-N 0.000 description 10
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 10
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 10
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 10
- IURWWZYKYPEANQ-HJGDQZAQSA-N Pro-Thr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IURWWZYKYPEANQ-HJGDQZAQSA-N 0.000 description 10
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 10
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 10
- 108010068380 arginylarginine Proteins 0.000 description 10
- 230000001419 dependent effect Effects 0.000 description 10
- 108010073093 leucyl-glycyl-glycyl-glycine Proteins 0.000 description 10
- 108010053725 prolylvaline Proteins 0.000 description 10
- 108010005834 tyrosyl-alanyl-glycine Proteins 0.000 description 10
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 9
- STACJSVFHSEZJV-GHCJXIJMSA-N Ala-Asn-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STACJSVFHSEZJV-GHCJXIJMSA-N 0.000 description 9
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 9
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 9
- ZYPWIUFLYMQZBS-SRVKXCTJSA-N Asn-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZYPWIUFLYMQZBS-SRVKXCTJSA-N 0.000 description 9
- GWTLRDMPMJCNMH-WHFBIAKZSA-N Asp-Asn-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GWTLRDMPMJCNMH-WHFBIAKZSA-N 0.000 description 9
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 9
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 description 9
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 9
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 9
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 9
- GNRPTBRHRRZCMA-RWMBFGLXSA-N Leu-Met-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N GNRPTBRHRRZCMA-RWMBFGLXSA-N 0.000 description 9
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 description 9
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 9
- RPWQJSBMXJSCPD-XUXIUFHCSA-N Lys-Val-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)C(C)C)C(O)=O RPWQJSBMXJSCPD-XUXIUFHCSA-N 0.000 description 9
- ITUDDXVFGFEKPD-NAKRPEOUSA-N Pro-Ser-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ITUDDXVFGFEKPD-NAKRPEOUSA-N 0.000 description 9
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 9
- UGTZYIPOBYXWRW-SRVKXCTJSA-N Ser-Phe-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O UGTZYIPOBYXWRW-SRVKXCTJSA-N 0.000 description 9
- DLZKEQQWXODGGZ-KWQFWETISA-N Tyr-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DLZKEQQWXODGGZ-KWQFWETISA-N 0.000 description 9
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 9
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 9
- CFSSLXZJEMERJY-NRPADANISA-N Val-Gln-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CFSSLXZJEMERJY-NRPADANISA-N 0.000 description 9
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 9
- 108010070944 alanylhistidine Proteins 0.000 description 9
- 108010062796 arginyllysine Proteins 0.000 description 9
- 108010081551 glycylphenylalanine Proteins 0.000 description 9
- 108010028295 histidylhistidine Proteins 0.000 description 9
- 108010027338 isoleucylcysteine Proteins 0.000 description 9
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 9
- 108010038320 lysylphenylalanine Proteins 0.000 description 9
- 108010051242 phenylalanylserine Proteins 0.000 description 9
- 108010031719 prolyl-serine Proteins 0.000 description 9
- 108010051110 tyrosyl-lysine Proteins 0.000 description 9
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 8
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 8
- LXAARTARZJJCMB-CIQUZCHMSA-N Ala-Ile-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LXAARTARZJJCMB-CIQUZCHMSA-N 0.000 description 8
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 8
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 8
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 8
- GBSUGIXJAAKZOW-GMOBBJLQSA-N Asp-Ile-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GBSUGIXJAAKZOW-GMOBBJLQSA-N 0.000 description 8
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 8
- 241000193469 Clostridium pasteurianum Species 0.000 description 8
- WMOMPXKOKASNBK-PEFMBERDSA-N Gln-Asn-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WMOMPXKOKASNBK-PEFMBERDSA-N 0.000 description 8
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 8
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 8
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 8
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 8
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 8
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 8
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 description 8
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 8
- NUKXXNFEUZGPRO-BJDJZHNGSA-N Ile-Leu-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NUKXXNFEUZGPRO-BJDJZHNGSA-N 0.000 description 8
- XQLGNKLSPYCRMZ-HJWJTTGWSA-N Ile-Phe-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)O)N XQLGNKLSPYCRMZ-HJWJTTGWSA-N 0.000 description 8
- 241000411974 Ilyobacter polytropus Species 0.000 description 8
- 108010065920 Insulin Lispro Proteins 0.000 description 8
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 8
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 8
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 8
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 8
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 8
- NFLFJGGKOHYZJF-BJDJZHNGSA-N Lys-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN NFLFJGGKOHYZJF-BJDJZHNGSA-N 0.000 description 8
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 description 8
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 8
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 8
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 8
- NQQMWWVVGIXUOX-SVSWQMSJSA-N Thr-Ser-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NQQMWWVVGIXUOX-SVSWQMSJSA-N 0.000 description 8
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 8
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 8
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 8
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 8
- 108010013835 arginine glutamate Proteins 0.000 description 8
- ZTQSAGDEMFDKMZ-UHFFFAOYSA-N butyric aldehyde Natural products CCCC=O ZTQSAGDEMFDKMZ-UHFFFAOYSA-N 0.000 description 8
- -1 for example Proteins 0.000 description 8
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 8
- 239000008103 glucose Substances 0.000 description 8
- 108010010147 glycylglutamine Proteins 0.000 description 8
- 108010085325 histidylproline Proteins 0.000 description 8
- 108010070643 prolylglutamic acid Proteins 0.000 description 8
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 7
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 7
- IFKQPMZRDQZSHI-GHCJXIJMSA-N Ala-Ile-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O IFKQPMZRDQZSHI-GHCJXIJMSA-N 0.000 description 7
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 7
- AJBVYEYZVYPFCF-CIUDSAMLSA-N Ala-Lys-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O AJBVYEYZVYPFCF-CIUDSAMLSA-N 0.000 description 7
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 7
- PVQLRJRPUTXFFX-CIUDSAMLSA-N Ala-Met-Gln Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O PVQLRJRPUTXFFX-CIUDSAMLSA-N 0.000 description 7
- SGAUXNZEFIEAAI-GARJFASQSA-N Asn-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)N)N)C(=O)O SGAUXNZEFIEAAI-GARJFASQSA-N 0.000 description 7
- HCZQKHSRYHCPSD-IUKAMOBKSA-N Asn-Thr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HCZQKHSRYHCPSD-IUKAMOBKSA-N 0.000 description 7
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 7
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 7
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 7
- MCGNJCNXIMQCMN-DCAQKATOSA-N Glu-Met-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCC(O)=O MCGNJCNXIMQCMN-DCAQKATOSA-N 0.000 description 7
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 7
- QPCVIQJVRGXUSA-LURJTMIESA-N Gly-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QPCVIQJVRGXUSA-LURJTMIESA-N 0.000 description 7
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 7
- FCKPEGOCSVZPNC-WHOFXGATSA-N Gly-Ile-Phe Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FCKPEGOCSVZPNC-WHOFXGATSA-N 0.000 description 7
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 7
- CNHSMSFYVARZLI-YJRXYDGGSA-N His-His-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CNHSMSFYVARZLI-YJRXYDGGSA-N 0.000 description 7
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 7
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 7
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 description 7
- YJRSIJZUIUANHO-NAKRPEOUSA-N Ile-Val-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(=O)O)N YJRSIJZUIUANHO-NAKRPEOUSA-N 0.000 description 7
- RQZFWBLDTBDEOF-RNJOBUHISA-N Ile-Val-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N RQZFWBLDTBDEOF-RNJOBUHISA-N 0.000 description 7
- SEMUSFOBZGKBGW-YTFOTSKYSA-N Leu-Ile-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SEMUSFOBZGKBGW-YTFOTSKYSA-N 0.000 description 7
- AXVIGSRGTMNSJU-YESZJQIVSA-N Leu-Tyr-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N AXVIGSRGTMNSJU-YESZJQIVSA-N 0.000 description 7
- IRNSXVOWSXSULE-DCAQKATOSA-N Lys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN IRNSXVOWSXSULE-DCAQKATOSA-N 0.000 description 7
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 7
- GCMWRRQAKQXDED-IUCAKERBSA-N Lys-Glu-Gly Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)N[C@@H](CCC([O-])=O)C(=O)NCC([O-])=O GCMWRRQAKQXDED-IUCAKERBSA-N 0.000 description 7
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 7
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 7
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 7
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 7
- OZVXDDFYCQOPFD-XQQFMLRXSA-N Lys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N OZVXDDFYCQOPFD-XQQFMLRXSA-N 0.000 description 7
- HHCOOFPGNXKFGR-HJGDQZAQSA-N Met-Gln-Thr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HHCOOFPGNXKFGR-HJGDQZAQSA-N 0.000 description 7
- HLQWFLJOJRFXHO-CIUDSAMLSA-N Met-Glu-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O HLQWFLJOJRFXHO-CIUDSAMLSA-N 0.000 description 7
- AFFKUNVPPLQUGA-DCAQKATOSA-N Met-Leu-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O AFFKUNVPPLQUGA-DCAQKATOSA-N 0.000 description 7
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 7
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 7
- HNFUGJUZJRYUHN-JSGCOSHPSA-N Phe-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HNFUGJUZJRYUHN-JSGCOSHPSA-N 0.000 description 7
- LRBSWBVUCLLRLU-BZSNNMDCSA-N Phe-Leu-Lys Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1ccccc1)C(=O)N[C@@H](CCCCN)C(O)=O LRBSWBVUCLLRLU-BZSNNMDCSA-N 0.000 description 7
- APMXLWHMIVWLLR-BZSNNMDCSA-N Phe-Tyr-Ser Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(O)=O)C1=CC=CC=C1 APMXLWHMIVWLLR-BZSNNMDCSA-N 0.000 description 7
- RGMLUHANLDVMPB-ULQDDVLXSA-N Phe-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N RGMLUHANLDVMPB-ULQDDVLXSA-N 0.000 description 7
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 7
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 7
- LXWZOMSOUAMOIA-JIOCBJNQSA-N Thr-Asn-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O LXWZOMSOUAMOIA-JIOCBJNQSA-N 0.000 description 7
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 7
- QJIODPFLAASXJC-JHYOHUSXSA-N Thr-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O QJIODPFLAASXJC-JHYOHUSXSA-N 0.000 description 7
- SYFHQHYTNCQCCN-MELADBBJSA-N Tyr-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O SYFHQHYTNCQCCN-MELADBBJSA-N 0.000 description 7
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 7
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 7
- 108010036533 arginylvaline Proteins 0.000 description 7
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 7
- 108010060199 cysteinylproline Proteins 0.000 description 7
- 108010040030 histidinoalanine Proteins 0.000 description 7
- 108010036413 histidylglycine Proteins 0.000 description 7
- 239000002773 nucleotide Substances 0.000 description 7
- 125000003729 nucleotide group Chemical group 0.000 description 7
- 230000001105 regulatory effect Effects 0.000 description 7
- XWTNPSHCJMZAHQ-QMMMGPOBSA-N 2-[[2-[[2-[[(2s)-2-amino-4-methylpentanoyl]amino]acetyl]amino]acetyl]amino]acetic acid Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(=O)NCC(O)=O XWTNPSHCJMZAHQ-QMMMGPOBSA-N 0.000 description 6
- 108030005924 3-hydroxypropionyl-CoA dehydratases Proteins 0.000 description 6
- 108010035023 4-hydroxybutyryl-CoA dehydratase Proteins 0.000 description 6
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 6
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 6
- SSSROGPPPVTHLX-FXQIFTODSA-N Ala-Arg-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSROGPPPVTHLX-FXQIFTODSA-N 0.000 description 6
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 6
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 6
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 6
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 6
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 6
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 6
- IRRMIGDCPOPZJW-ULQDDVLXSA-N Arg-His-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IRRMIGDCPOPZJW-ULQDDVLXSA-N 0.000 description 6
- CTAPSNCVKPOOSM-KKUMJFAQSA-N Arg-Tyr-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O CTAPSNCVKPOOSM-KKUMJFAQSA-N 0.000 description 6
- JJGRJMKUOYXZRA-LPEHRKFASA-N Asn-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O JJGRJMKUOYXZRA-LPEHRKFASA-N 0.000 description 6
- JQSWHKKUZMTOIH-QWRGUYRKSA-N Asn-Gly-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N JQSWHKKUZMTOIH-QWRGUYRKSA-N 0.000 description 6
- GMUOCGCDOYYWPD-FXQIFTODSA-N Asn-Pro-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O GMUOCGCDOYYWPD-FXQIFTODSA-N 0.000 description 6
- UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 6
- PSLSTUMPZILTAH-BYULHYEWSA-N Asp-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PSLSTUMPZILTAH-BYULHYEWSA-N 0.000 description 6
- SPKCGKRUYKMDHP-GUDRVLHUSA-N Asp-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N SPKCGKRUYKMDHP-GUDRVLHUSA-N 0.000 description 6
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 6
- MYOHQBFRJQFIDZ-KKUMJFAQSA-N Asp-Leu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYOHQBFRJQFIDZ-KKUMJFAQSA-N 0.000 description 6
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 6
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 6
- ITGFVUYOLWBPQW-KKHAAJSZSA-N Asp-Thr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ITGFVUYOLWBPQW-KKHAAJSZSA-N 0.000 description 6
- 101001028272 Escherichia coli (strain K12) Long-chain acyl-CoA thioesterase FadM Proteins 0.000 description 6
- VGUYMZGLJUJRBV-YVNDNENWSA-N Glu-Ile-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VGUYMZGLJUJRBV-YVNDNENWSA-N 0.000 description 6
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 6
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 6
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 6
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 6
- PLCAEMGSYOYIPP-GUBZILKMSA-N His-Ser-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 PLCAEMGSYOYIPP-GUBZILKMSA-N 0.000 description 6
- QTUSJASXLGLJSR-OSUNSFLBSA-N Ile-Arg-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N QTUSJASXLGLJSR-OSUNSFLBSA-N 0.000 description 6
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 description 6
- DFJJAVZIHDFOGQ-MNXVOIDGSA-N Ile-Glu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DFJJAVZIHDFOGQ-MNXVOIDGSA-N 0.000 description 6
- LWWILHPVAKKLQS-QXEWZRGKSA-N Ile-Gly-Met Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCSC)C(=O)O)N LWWILHPVAKKLQS-QXEWZRGKSA-N 0.000 description 6
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 6
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 6
- 102000003855 L-lactate dehydrogenase Human genes 0.000 description 6
- 108700023483 L-lactate dehydrogenases Proteins 0.000 description 6
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 6
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 6
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 6
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 6
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 6
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 6
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 6
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 6
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 6
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 6
- QUCDKEKDPYISNX-HJGDQZAQSA-N Lys-Asn-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QUCDKEKDPYISNX-HJGDQZAQSA-N 0.000 description 6
- LLSUNJYOSCOOEB-GUBZILKMSA-N Lys-Glu-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O LLSUNJYOSCOOEB-GUBZILKMSA-N 0.000 description 6
- GPJGFSFYBJGYRX-YUMQZZPRSA-N Lys-Gly-Asp Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O GPJGFSFYBJGYRX-YUMQZZPRSA-N 0.000 description 6
- HKXSZKJMDBHOTG-CIUDSAMLSA-N Lys-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN HKXSZKJMDBHOTG-CIUDSAMLSA-N 0.000 description 6
- BDFHWFUAQLIMJO-KXNHARMFSA-N Lys-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N)O BDFHWFUAQLIMJO-KXNHARMFSA-N 0.000 description 6
- QQPMHUCGDRJFQK-RHYQMDGZSA-N Met-Thr-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QQPMHUCGDRJFQK-RHYQMDGZSA-N 0.000 description 6
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 6
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 6
- 238000012408 PCR amplification Methods 0.000 description 6
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 6
- ZUZINZIJHJFJRN-UBHSHLNASA-N Pro-Phe-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 ZUZINZIJHJFJRN-UBHSHLNASA-N 0.000 description 6
- QKDIHFHGHBYTKB-IHRRRGAJSA-N Pro-Ser-Phe Chemical compound N([C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 QKDIHFHGHBYTKB-IHRRRGAJSA-N 0.000 description 6
- LZHHZYDPMZEMRX-STQMWFEESA-N Pro-Tyr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O LZHHZYDPMZEMRX-STQMWFEESA-N 0.000 description 6
- 108010065027 Propanediol Dehydratase Proteins 0.000 description 6
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 6
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 6
- YIUWWXVTYLANCJ-NAKRPEOUSA-N Ser-Ile-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YIUWWXVTYLANCJ-NAKRPEOUSA-N 0.000 description 6
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 6
- XOTBWOCSLMBGMF-SUSMZKCASA-N Thr-Glu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOTBWOCSLMBGMF-SUSMZKCASA-N 0.000 description 6
- BNGDYRRHRGOPHX-IFFSRLJSSA-N Thr-Glu-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O BNGDYRRHRGOPHX-IFFSRLJSSA-N 0.000 description 6
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 6
- LHNNQVXITHUCAB-QTKMDUPCSA-N Thr-Met-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O LHNNQVXITHUCAB-QTKMDUPCSA-N 0.000 description 6
- WKCFCVBOFKEVKY-HSCHXYMDSA-N Trp-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WKCFCVBOFKEVKY-HSCHXYMDSA-N 0.000 description 6
- JLKVWTICWVWGSK-JYJNAYRXSA-N Tyr-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JLKVWTICWVWGSK-JYJNAYRXSA-N 0.000 description 6
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 6
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 6
- GBESYURLQOYWLU-LAEOZQHASA-N Val-Glu-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GBESYURLQOYWLU-LAEOZQHASA-N 0.000 description 6
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 6
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 6
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 6
- XPKCFQZDQGVJCX-RHYQMDGZSA-N Val-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N)O XPKCFQZDQGVJCX-RHYQMDGZSA-N 0.000 description 6
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 6
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 6
- 230000003321 amplification Effects 0.000 description 6
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 6
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 6
- 108010004073 cysteinylcysteine Proteins 0.000 description 6
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 6
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 6
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 6
- 108010025306 histidylleucine Proteins 0.000 description 6
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 6
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 6
- 108010012058 leucyltyrosine Proteins 0.000 description 6
- 238000003199 nucleic acid amplification method Methods 0.000 description 6
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 6
- 108010012581 phenylalanylglutamate Proteins 0.000 description 6
- 108010048818 seryl-histidine Proteins 0.000 description 6
- 108010071207 serylmethionine Proteins 0.000 description 6
- 108010027345 wheylin-1 peptide Proteins 0.000 description 6
- 101100070612 Acidaminococcus fermentans (strain ATCC 25085 / DSM 20731 / CCUG 9996 / CIP 106432 / VR4) hgdA gene Proteins 0.000 description 5
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 5
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 5
- XQGIRPGAVLFKBJ-CIUDSAMLSA-N Ala-Asn-Lys Chemical compound N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)O XQGIRPGAVLFKBJ-CIUDSAMLSA-N 0.000 description 5
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 5
- VBRDBGCROKWTPV-XHNCKOQMSA-N Ala-Glu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N VBRDBGCROKWTPV-XHNCKOQMSA-N 0.000 description 5
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 5
- XCZXVTHYGSMQGH-NAKRPEOUSA-N Ala-Ile-Met Chemical compound C[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C([O-])=O XCZXVTHYGSMQGH-NAKRPEOUSA-N 0.000 description 5
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 5
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 5
- OMDNCNKNEGFOMM-BQBZGAKWSA-N Ala-Met-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O OMDNCNKNEGFOMM-BQBZGAKWSA-N 0.000 description 5
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 5
- DDPKBJZLAXLQGZ-KBIXCLLPSA-N Ala-Val-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DDPKBJZLAXLQGZ-KBIXCLLPSA-N 0.000 description 5
- BOKLLPVAQDSLHC-FXQIFTODSA-N Ala-Val-Cys Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O)N BOKLLPVAQDSLHC-FXQIFTODSA-N 0.000 description 5
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 5
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 5
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 5
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 5
- FXGMURPOWCKNAZ-JYJNAYRXSA-N Arg-Val-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FXGMURPOWCKNAZ-JYJNAYRXSA-N 0.000 description 5
- ZWASIOHRQWRWAS-UGYAYLCHSA-N Asn-Asp-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZWASIOHRQWRWAS-UGYAYLCHSA-N 0.000 description 5
- QISZHYWZHJRDAO-CIUDSAMLSA-N Asn-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N QISZHYWZHJRDAO-CIUDSAMLSA-N 0.000 description 5
- OLVIPTLKNSAYRJ-YUMQZZPRSA-N Asn-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N OLVIPTLKNSAYRJ-YUMQZZPRSA-N 0.000 description 5
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 5
- NLDNNZKUSLAYFW-NHCYSSNCSA-N Asn-Lys-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O NLDNNZKUSLAYFW-NHCYSSNCSA-N 0.000 description 5
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 5
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 5
- TZOZNVLBTAFJRW-UGYAYLCHSA-N Asp-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N TZOZNVLBTAFJRW-UGYAYLCHSA-N 0.000 description 5
- WOPJVEMFXYHZEE-SRVKXCTJSA-N Asp-Phe-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WOPJVEMFXYHZEE-SRVKXCTJSA-N 0.000 description 5
- 101100098786 Bacillus subtilis (strain 168) tapA gene Proteins 0.000 description 5
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 5
- 241000193163 Clostridioides difficile Species 0.000 description 5
- NQSUTVRXXBGVDQ-LKXGYXEUSA-N Cys-Asn-Thr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NQSUTVRXXBGVDQ-LKXGYXEUSA-N 0.000 description 5
- IIGHQOPGMGKDMT-SRVKXCTJSA-N Cys-Asp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N IIGHQOPGMGKDMT-SRVKXCTJSA-N 0.000 description 5
- 101100321116 Escherichia coli (strain K12) yqhD gene Proteins 0.000 description 5
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 5
- MLSKFHLRFVGNLL-WDCWCFNPSA-N Gln-Leu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MLSKFHLRFVGNLL-WDCWCFNPSA-N 0.000 description 5
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 5
- RLZBLVSJDFHDBL-KBIXCLLPSA-N Glu-Ala-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RLZBLVSJDFHDBL-KBIXCLLPSA-N 0.000 description 5
- KKCUFHUTMKQQCF-SRVKXCTJSA-N Glu-Arg-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O KKCUFHUTMKQQCF-SRVKXCTJSA-N 0.000 description 5
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 5
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 5
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 5
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 5
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 5
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 5
- SWDNPSMMEWRNOH-HJGDQZAQSA-N Glu-Pro-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWDNPSMMEWRNOH-HJGDQZAQSA-N 0.000 description 5
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 5
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 5
- IDOGEHIWMJMAHT-BYPYZUCNSA-N Gly-Gly-Cys Chemical compound NCC(=O)NCC(=O)N[C@@H](CS)C(O)=O IDOGEHIWMJMAHT-BYPYZUCNSA-N 0.000 description 5
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 5
- HHRODZSXDXMUHS-LURJTMIESA-N Gly-Met-Gly Chemical compound CSCC[C@H](NC(=O)C[NH3+])C(=O)NCC([O-])=O HHRODZSXDXMUHS-LURJTMIESA-N 0.000 description 5
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 5
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 5
- RWIKBYVJQAJYDP-BJDJZHNGSA-N Ile-Ala-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RWIKBYVJQAJYDP-BJDJZHNGSA-N 0.000 description 5
- WUEIUSDAECDLQO-NAKRPEOUSA-N Ile-Ala-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)O)N WUEIUSDAECDLQO-NAKRPEOUSA-N 0.000 description 5
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 5
- SVBAHOMTJRFSIC-SXTJYALSSA-N Ile-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVBAHOMTJRFSIC-SXTJYALSSA-N 0.000 description 5
- YGDWPQCLFJNMOL-MNXVOIDGSA-N Ile-Leu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YGDWPQCLFJNMOL-MNXVOIDGSA-N 0.000 description 5
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 5
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 5
- ADDYYRVQQZFIMW-MNXVOIDGSA-N Ile-Lys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ADDYYRVQQZFIMW-MNXVOIDGSA-N 0.000 description 5
- 241000588749 Klebsiella oxytoca Species 0.000 description 5
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 5
- DBVWMYGBVFCRBE-CIUDSAMLSA-N Leu-Asn-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DBVWMYGBVFCRBE-CIUDSAMLSA-N 0.000 description 5
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 5
- SGIIOQQGLUUMDQ-IHRRRGAJSA-N Leu-His-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N SGIIOQQGLUUMDQ-IHRRRGAJSA-N 0.000 description 5
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 5
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 5
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 5
- KQFZKDITNUEVFJ-JYJNAYRXSA-N Leu-Phe-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CC=CC=C1 KQFZKDITNUEVFJ-JYJNAYRXSA-N 0.000 description 5
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 5
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 5
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 5
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 5
- VZBXCMCHIHEPBL-SRVKXCTJSA-N Met-Glu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN VZBXCMCHIHEPBL-SRVKXCTJSA-N 0.000 description 5
- 101100342977 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-1 gene Proteins 0.000 description 5
- QSWKNJAPHQDAAS-MELADBBJSA-N Phe-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O QSWKNJAPHQDAAS-MELADBBJSA-N 0.000 description 5
- FGWUALWGCZJQDJ-URLPEUOOSA-N Phe-Thr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGWUALWGCZJQDJ-URLPEUOOSA-N 0.000 description 5
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 5
- VWXGFAIZUQBBBG-UWVGGRQHSA-N Pro-His-Gly Chemical compound C([C@@H](C(=O)NCC(=O)[O-])NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 VWXGFAIZUQBBBG-UWVGGRQHSA-N 0.000 description 5
- 108010025216 RVF peptide Proteins 0.000 description 5
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 5
- RNFKSBPHLTZHLU-WHFBIAKZSA-N Ser-Cys-Gly Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N)O RNFKSBPHLTZHLU-WHFBIAKZSA-N 0.000 description 5
- QUGRFWPMPVIAPW-IHRRRGAJSA-N Ser-Pro-Phe Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QUGRFWPMPVIAPW-IHRRRGAJSA-N 0.000 description 5
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 5
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 5
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 5
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 5
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 5
- KCRQEJSKXAIULJ-FJXKBIBVSA-N Thr-Gly-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCRQEJSKXAIULJ-FJXKBIBVSA-N 0.000 description 5
- NIEWSKWFURSECR-FOHZUACHSA-N Thr-Gly-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NIEWSKWFURSECR-FOHZUACHSA-N 0.000 description 5
- MUAFDCVOHYAFNG-RCWTZXSCSA-N Thr-Pro-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MUAFDCVOHYAFNG-RCWTZXSCSA-N 0.000 description 5
- FWTFAZKJORVTIR-VZFHVOOUSA-N Thr-Ser-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O FWTFAZKJORVTIR-VZFHVOOUSA-N 0.000 description 5
- ILUOMMDDGREELW-OSUNSFLBSA-N Thr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O ILUOMMDDGREELW-OSUNSFLBSA-N 0.000 description 5
- UDLYXGYWTVOIKU-QXEWZRGKSA-N Val-Asn-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UDLYXGYWTVOIKU-QXEWZRGKSA-N 0.000 description 5
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 5
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 5
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 5
- DLRZGNXCXUGIDG-KKHAAJSZSA-N Val-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O DLRZGNXCXUGIDG-KKHAAJSZSA-N 0.000 description 5
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 5
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 5
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 5
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 5
- 108010041407 alanylaspartic acid Proteins 0.000 description 5
- 108010011559 alanylphenylalanine Proteins 0.000 description 5
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 5
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 5
- 229910052799 carbon Inorganic materials 0.000 description 5
- 108010054813 diprotin B Proteins 0.000 description 5
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 5
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 5
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 5
- 108010087823 glycyltyrosine Proteins 0.000 description 5
- 239000001963 growth medium Substances 0.000 description 5
- 238000004128 high performance liquid chromatography Methods 0.000 description 5
- 108010018006 histidylserine Proteins 0.000 description 5
- 108010056582 methionylglutamic acid Proteins 0.000 description 5
- 108010084572 phenylalanyl-valine Proteins 0.000 description 5
- 108010009962 valyltyrosine Proteins 0.000 description 5
- JAMAWBXXKFGFGX-KZVJFYERSA-N Ala-Arg-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JAMAWBXXKFGFGX-KZVJFYERSA-N 0.000 description 4
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 4
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 4
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 4
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 4
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 4
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 4
- XSTZMVAYYCJTNR-DCAQKATOSA-N Ala-Met-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XSTZMVAYYCJTNR-DCAQKATOSA-N 0.000 description 4
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 4
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 4
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 4
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 4
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 4
- ZDILXFDENZVOTL-BPNCWPANSA-N Ala-Val-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZDILXFDENZVOTL-BPNCWPANSA-N 0.000 description 4
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 4
- DNUKXVMPARLPFN-XUXIUFHCSA-N Arg-Leu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DNUKXVMPARLPFN-XUXIUFHCSA-N 0.000 description 4
- YVTHEZNOKSAWRW-DCAQKATOSA-N Arg-Lys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O YVTHEZNOKSAWRW-DCAQKATOSA-N 0.000 description 4
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 description 4
- SPCONPVIDFMDJI-QSFUFRPTSA-N Asn-Ile-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O SPCONPVIDFMDJI-QSFUFRPTSA-N 0.000 description 4
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 4
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 4
- VPPXTHJNTYDNFJ-CIUDSAMLSA-N Asp-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N VPPXTHJNTYDNFJ-CIUDSAMLSA-N 0.000 description 4
- NYQHSUGFEWDWPD-ACZMJKKPSA-N Asp-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N NYQHSUGFEWDWPD-ACZMJKKPSA-N 0.000 description 4
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 4
- JOCQXVJCTCEFAZ-CIUDSAMLSA-N Asp-His-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O JOCQXVJCTCEFAZ-CIUDSAMLSA-N 0.000 description 4
- SEMWSADZTMJELF-BYULHYEWSA-N Asp-Ile-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O SEMWSADZTMJELF-BYULHYEWSA-N 0.000 description 4
- CTWCFPWFIGRAEP-CIUDSAMLSA-N Asp-Lys-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O CTWCFPWFIGRAEP-CIUDSAMLSA-N 0.000 description 4
- HJCGDIGVVWETRO-ZPFDUUQYSA-N Asp-Lys-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O)C(O)=O HJCGDIGVVWETRO-ZPFDUUQYSA-N 0.000 description 4
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 4
- 101100224392 Bacillus subtilis (strain 168) dpaA gene Proteins 0.000 description 4
- 241000588919 Citrobacter freundii Species 0.000 description 4
- 101100277683 Citrobacter freundii dhaB gene Proteins 0.000 description 4
- 101100506210 Clostridioides difficile hadC gene Proteins 0.000 description 4
- 241000193470 Clostridium sporogenes Species 0.000 description 4
- UPURLDIGQGTUPJ-ZKWXMUAHSA-N Cys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N UPURLDIGQGTUPJ-ZKWXMUAHSA-N 0.000 description 4
- 241001646716 Escherichia coli K-12 Species 0.000 description 4
- MINZLORERLNSPP-ACZMJKKPSA-N Gln-Asn-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N MINZLORERLNSPP-ACZMJKKPSA-N 0.000 description 4
- GHYJGDCPHMSFEJ-GUBZILKMSA-N Gln-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N GHYJGDCPHMSFEJ-GUBZILKMSA-N 0.000 description 4
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 4
- HYPVLWGNBIYTNA-GUBZILKMSA-N Gln-Leu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HYPVLWGNBIYTNA-GUBZILKMSA-N 0.000 description 4
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 4
- AVZHGSCDKIQZPQ-CIUDSAMLSA-N Glu-Arg-Ala Chemical compound C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AVZHGSCDKIQZPQ-CIUDSAMLSA-N 0.000 description 4
- AFODTOLGSZQDSL-PEFMBERDSA-N Glu-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N AFODTOLGSZQDSL-PEFMBERDSA-N 0.000 description 4
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 4
- LVCHEMOPBORRLB-DCAQKATOSA-N Glu-Gln-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O LVCHEMOPBORRLB-DCAQKATOSA-N 0.000 description 4
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 4
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 4
- HPJLZFTUUJKWAJ-JHEQGTHGSA-N Glu-Gly-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HPJLZFTUUJKWAJ-JHEQGTHGSA-N 0.000 description 4
- KRRFFAHEAOCBCQ-SIUGBPQLSA-N Glu-Ile-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KRRFFAHEAOCBCQ-SIUGBPQLSA-N 0.000 description 4
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 4
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 4
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 4
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 4
- UXJHNZODTMHWRD-WHFBIAKZSA-N Gly-Asn-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O UXJHNZODTMHWRD-WHFBIAKZSA-N 0.000 description 4
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 4
- BULIVUZUDBHKKZ-WDSKDSINSA-N Gly-Gln-Asn Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BULIVUZUDBHKKZ-WDSKDSINSA-N 0.000 description 4
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 4
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 4
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 4
- SXJHOPPTOJACOA-QXEWZRGKSA-N Gly-Ile-Arg Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SXJHOPPTOJACOA-QXEWZRGKSA-N 0.000 description 4
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 4
- FXLVSYVJDPCIHH-STQMWFEESA-N Gly-Phe-Arg Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FXLVSYVJDPCIHH-STQMWFEESA-N 0.000 description 4
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 4
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 4
- UIQGJYUEQDOODF-KWQFWETISA-N Gly-Tyr-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 UIQGJYUEQDOODF-KWQFWETISA-N 0.000 description 4
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 4
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 4
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 4
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 4
- UPGJWSUYENXOPV-HGNGGELXSA-N His-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N UPGJWSUYENXOPV-HGNGGELXSA-N 0.000 description 4
- VJJSDSNFXCWCEJ-DJFWLOJKSA-N His-Ile-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O VJJSDSNFXCWCEJ-DJFWLOJKSA-N 0.000 description 4
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 4
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 4
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 4
- GYAFMRQGWHXMII-IUKAMOBKSA-N Ile-Asp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N GYAFMRQGWHXMII-IUKAMOBKSA-N 0.000 description 4
- VOBYAKCXGQQFLR-LSJOCFKGSA-N Ile-Gly-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VOBYAKCXGQQFLR-LSJOCFKGSA-N 0.000 description 4
- PFPUFNLHBXKPHY-HTFCKZLJSA-N Ile-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)O)N PFPUFNLHBXKPHY-HTFCKZLJSA-N 0.000 description 4
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 4
- NGKPIPCGMLWHBX-WZLNRYEVSA-N Ile-Tyr-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NGKPIPCGMLWHBX-WZLNRYEVSA-N 0.000 description 4
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 4
- 241001180676 Lachnoanaerobaculum saburreum Species 0.000 description 4
- 241000535428 Lactobacillus reuteri DSM 20016 Species 0.000 description 4
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 4
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 4
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 4
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 4
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 4
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 4
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 4
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 4
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 4
- WUHBLPVELFTPQK-KKUMJFAQSA-N Leu-Tyr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O WUHBLPVELFTPQK-KKUMJFAQSA-N 0.000 description 4
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 4
- ZQCVMVCVPFYXHZ-SRVKXCTJSA-N Lys-Asn-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN ZQCVMVCVPFYXHZ-SRVKXCTJSA-N 0.000 description 4
- HKCCVDWHHTVVPN-CIUDSAMLSA-N Lys-Asp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O HKCCVDWHHTVVPN-CIUDSAMLSA-N 0.000 description 4
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 4
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 4
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 4
- QBEPTBMRQALPEV-MNXVOIDGSA-N Lys-Ile-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN QBEPTBMRQALPEV-MNXVOIDGSA-N 0.000 description 4
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 4
- UQRZFMQQXXJTTF-AVGNSLFASA-N Lys-Lys-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O UQRZFMQQXXJTTF-AVGNSLFASA-N 0.000 description 4
- SQXZLVXQXWILKW-KKUMJFAQSA-N Lys-Ser-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQXZLVXQXWILKW-KKUMJFAQSA-N 0.000 description 4
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 4
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 4
- MIMXMVDLMDMOJD-BZSNNMDCSA-N Lys-Tyr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O MIMXMVDLMDMOJD-BZSNNMDCSA-N 0.000 description 4
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 4
- WWWGMQHQSAUXBU-BQBZGAKWSA-N Met-Gly-Asn Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O WWWGMQHQSAUXBU-BQBZGAKWSA-N 0.000 description 4
- 241000684246 Peptostreptococcus stomatis Species 0.000 description 4
- CYZBFPYMSJGBRL-DRZSPHRISA-N Phe-Ala-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CYZBFPYMSJGBRL-DRZSPHRISA-N 0.000 description 4
- DDYIRGBOZVKRFR-AVGNSLFASA-N Phe-Asp-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DDYIRGBOZVKRFR-AVGNSLFASA-N 0.000 description 4
- MGBRZXXGQBAULP-DRZSPHRISA-N Phe-Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGBRZXXGQBAULP-DRZSPHRISA-N 0.000 description 4
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 4
- HWLKHNDRXWTFTN-GUBZILKMSA-N Pro-Pro-Cys Chemical compound C1C[C@H](NC1)C(=O)N2CCC[C@H]2C(=O)N[C@@H](CS)C(=O)O HWLKHNDRXWTFTN-GUBZILKMSA-N 0.000 description 4
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 4
- FIXILCYTSAUERA-FXQIFTODSA-N Ser-Ala-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIXILCYTSAUERA-FXQIFTODSA-N 0.000 description 4
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 4
- IDQFQFVEWMWRQQ-DLOVCJGASA-N Ser-Ala-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IDQFQFVEWMWRQQ-DLOVCJGASA-N 0.000 description 4
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 4
- PVDTYLHUWAEYGY-CIUDSAMLSA-N Ser-Glu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PVDTYLHUWAEYGY-CIUDSAMLSA-N 0.000 description 4
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 4
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 description 4
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 4
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 4
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 4
- 241000160715 Sulfolobus tokodaii Species 0.000 description 4
- MQBTXMPQNCGSSZ-OSUNSFLBSA-N Thr-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N MQBTXMPQNCGSSZ-OSUNSFLBSA-N 0.000 description 4
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 description 4
- ZQUKYJOKQBRBCS-GLLZPBPUSA-N Thr-Gln-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O ZQUKYJOKQBRBCS-GLLZPBPUSA-N 0.000 description 4
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 4
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 4
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 4
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 4
- AYHSJESDFKREAR-KKUMJFAQSA-N Tyr-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AYHSJESDFKREAR-KKUMJFAQSA-N 0.000 description 4
- PJWCWGXAVIVXQC-STECZYCISA-N Tyr-Ile-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PJWCWGXAVIVXQC-STECZYCISA-N 0.000 description 4
- ZZDYJFVIKVSUFA-WLTAIBSBSA-N Tyr-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ZZDYJFVIKVSUFA-WLTAIBSBSA-N 0.000 description 4
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 4
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 4
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 4
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 description 4
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 4
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 4
- 239000003242 anti bacterial agent Substances 0.000 description 4
- 229940088710 antibiotic agent Drugs 0.000 description 4
- 108010016616 cysteinylglycine Proteins 0.000 description 4
- 108010069495 cysteinyltyrosine Proteins 0.000 description 4
- 238000004520 electroporation Methods 0.000 description 4
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 4
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 4
- 108010000761 leucylarginine Proteins 0.000 description 4
- 108010091871 leucylmethionine Proteins 0.000 description 4
- 108010085203 methionylmethionine Proteins 0.000 description 4
- 229910052757 nitrogen Inorganic materials 0.000 description 4
- 108010025488 pinealon Proteins 0.000 description 4
- 108010015796 prolylisoleucine Proteins 0.000 description 4
- 230000002829 reductive effect Effects 0.000 description 4
- 108700004896 tripeptide FEG Proteins 0.000 description 4
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 4
- 101150103853 yciA gene Proteins 0.000 description 4
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 3
- 101100070613 Acidaminococcus fermentans (strain ATCC 25085 / DSM 20731 / CCUG 9996 / CIP 106432 / VR4) hgdB gene Proteins 0.000 description 3
- 101100070614 Acidaminococcus fermentans (strain ATCC 25085 / DSM 20731 / CCUG 9996 / CIP 106432 / VR4) hgdC gene Proteins 0.000 description 3
- PIPTUBPKYFRLCP-NHCYSSNCSA-N Ala-Ala-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PIPTUBPKYFRLCP-NHCYSSNCSA-N 0.000 description 3
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 3
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 3
- FOWHQTWRLFTELJ-FXQIFTODSA-N Ala-Asp-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N FOWHQTWRLFTELJ-FXQIFTODSA-N 0.000 description 3
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 3
- DAEFQZCYZKRTLR-ZLUOBGJFSA-N Ala-Cys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O DAEFQZCYZKRTLR-ZLUOBGJFSA-N 0.000 description 3
- ZPXCNXMJEZKRLU-LSJOCFKGSA-N Ala-His-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CN=CN1 ZPXCNXMJEZKRLU-LSJOCFKGSA-N 0.000 description 3
- HUUOZYZWNCXTFK-INTQDDNPSA-N Ala-His-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N HUUOZYZWNCXTFK-INTQDDNPSA-N 0.000 description 3
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 3
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 3
- PEIBBAXIKUAYGN-UBHSHLNASA-N Ala-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 PEIBBAXIKUAYGN-UBHSHLNASA-N 0.000 description 3
- BFMIRJBURUXDRG-DLOVCJGASA-N Ala-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 BFMIRJBURUXDRG-DLOVCJGASA-N 0.000 description 3
- CNQAFFMNJIQYGX-DRZSPHRISA-N Ala-Phe-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 CNQAFFMNJIQYGX-DRZSPHRISA-N 0.000 description 3
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 3
- FQNILRVJOJBFFC-FXQIFTODSA-N Ala-Pro-Asp Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N FQNILRVJOJBFFC-FXQIFTODSA-N 0.000 description 3
- YHBDGLZYNIARKJ-GUBZILKMSA-N Ala-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N YHBDGLZYNIARKJ-GUBZILKMSA-N 0.000 description 3
- HCBKAOZYACJUEF-XQXXSGGOSA-N Ala-Thr-Gln Chemical compound N[C@@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(N)=O)C(=O)O HCBKAOZYACJUEF-XQXXSGGOSA-N 0.000 description 3
- JJHBEVZAZXZREW-LFSVMHDDSA-N Ala-Thr-Phe Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O JJHBEVZAZXZREW-LFSVMHDDSA-N 0.000 description 3
- CREYEAPXISDKSB-FQPOAREZSA-N Ala-Thr-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CREYEAPXISDKSB-FQPOAREZSA-N 0.000 description 3
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 3
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 3
- LYILPUNCKACNGF-NAKRPEOUSA-N Ala-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)N LYILPUNCKACNGF-NAKRPEOUSA-N 0.000 description 3
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 3
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 3
- RWDVGVPHEWOZMO-GUBZILKMSA-N Arg-Cys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCCNC(N)=N)C(O)=O RWDVGVPHEWOZMO-GUBZILKMSA-N 0.000 description 3
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 3
- RFXXUWGNVRJTNQ-QXEWZRGKSA-N Arg-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N RFXXUWGNVRJTNQ-QXEWZRGKSA-N 0.000 description 3
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 3
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 3
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 3
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 3
- MOGMYRUNTKYZFB-UNQGMJICSA-N Arg-Thr-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MOGMYRUNTKYZFB-UNQGMJICSA-N 0.000 description 3
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 3
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 3
- DQTIWTULBGLJBL-DCAQKATOSA-N Asn-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N DQTIWTULBGLJBL-DCAQKATOSA-N 0.000 description 3
- POOCJCRBHHMAOS-FXQIFTODSA-N Asn-Arg-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O POOCJCRBHHMAOS-FXQIFTODSA-N 0.000 description 3
- IOTKDTZEEBZNCM-UGYAYLCHSA-N Asn-Asn-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOTKDTZEEBZNCM-UGYAYLCHSA-N 0.000 description 3
- FAEFJTCTNZTPHX-ACZMJKKPSA-N Asn-Gln-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FAEFJTCTNZTPHX-ACZMJKKPSA-N 0.000 description 3
- GJFYPBDMUGGLFR-NKWVEPMBSA-N Asn-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC(=O)N)N)C(=O)O GJFYPBDMUGGLFR-NKWVEPMBSA-N 0.000 description 3
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 3
- LZLCLRQMUQWUHJ-GUBZILKMSA-N Asn-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N LZLCLRQMUQWUHJ-GUBZILKMSA-N 0.000 description 3
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 3
- NCXTYSVDWLAQGZ-ZKWXMUAHSA-N Asn-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O NCXTYSVDWLAQGZ-ZKWXMUAHSA-N 0.000 description 3
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 3
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 3
- CBWCQCANJSGUOH-ZKWXMUAHSA-N Asn-Val-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O CBWCQCANJSGUOH-ZKWXMUAHSA-N 0.000 description 3
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 3
- XPGVTUBABLRGHY-BIIVOSGPSA-N Asp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N XPGVTUBABLRGHY-BIIVOSGPSA-N 0.000 description 3
- KVMPVNGOKHTUHZ-GCJQMDKQSA-N Asp-Ala-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KVMPVNGOKHTUHZ-GCJQMDKQSA-N 0.000 description 3
- NAPNAGZWHQHZLG-ZLUOBGJFSA-N Asp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N NAPNAGZWHQHZLG-ZLUOBGJFSA-N 0.000 description 3
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 3
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 3
- QNFRBNZGVVKBNJ-PEFMBERDSA-N Asp-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N QNFRBNZGVVKBNJ-PEFMBERDSA-N 0.000 description 3
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 3
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 3
- GWIJZUVQVDJHDI-AVGNSLFASA-N Asp-Phe-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GWIJZUVQVDJHDI-AVGNSLFASA-N 0.000 description 3
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 3
- YFGUZQQCSDZRBN-DCAQKATOSA-N Asp-Pro-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YFGUZQQCSDZRBN-DCAQKATOSA-N 0.000 description 3
- NJLLRXWFPQQPHV-SRVKXCTJSA-N Asp-Tyr-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJLLRXWFPQQPHV-SRVKXCTJSA-N 0.000 description 3
- 241000620137 Carboxydothermus hydrogenoformans Species 0.000 description 3
- 108091026890 Coding region Proteins 0.000 description 3
- KIQKJXYVGSYDFS-ZLUOBGJFSA-N Cys-Asn-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O KIQKJXYVGSYDFS-ZLUOBGJFSA-N 0.000 description 3
- WDQXKVCQXRNOSI-GHCJXIJMSA-N Cys-Asp-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WDQXKVCQXRNOSI-GHCJXIJMSA-N 0.000 description 3
- CFQVGYWKSLKWFX-KBIXCLLPSA-N Cys-Glu-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CFQVGYWKSLKWFX-KBIXCLLPSA-N 0.000 description 3
- PDRMRVHPAQKTLT-NAKRPEOUSA-N Cys-Ile-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O PDRMRVHPAQKTLT-NAKRPEOUSA-N 0.000 description 3
- CHRCKSPMGYDLIA-SRVKXCTJSA-N Cys-Phe-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O CHRCKSPMGYDLIA-SRVKXCTJSA-N 0.000 description 3
- KFYPRIGJTICABD-XGEHTFHBSA-N Cys-Thr-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N)O KFYPRIGJTICABD-XGEHTFHBSA-N 0.000 description 3
- 241001618315 Escherichia fergusonii ATCC 35469 Species 0.000 description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 3
- REJJNXODKSHOKA-ACZMJKKPSA-N Gln-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N REJJNXODKSHOKA-ACZMJKKPSA-N 0.000 description 3
- MWLYSLMKFXWZPW-ZPFDUUQYSA-N Gln-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCC(N)=O MWLYSLMKFXWZPW-ZPFDUUQYSA-N 0.000 description 3
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 3
- LMPBBFWHCRURJD-LAEOZQHASA-N Gln-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N LMPBBFWHCRURJD-LAEOZQHASA-N 0.000 description 3
- QFTRCUPCARNIPZ-XHNCKOQMSA-N Gln-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N)C(=O)O QFTRCUPCARNIPZ-XHNCKOQMSA-N 0.000 description 3
- HXOLDXKNWKLDMM-YVNDNENWSA-N Gln-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HXOLDXKNWKLDMM-YVNDNENWSA-N 0.000 description 3
- CAXXTYYGFYTBPV-IUCAKERBSA-N Gln-Leu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CAXXTYYGFYTBPV-IUCAKERBSA-N 0.000 description 3
- KHNJVFYHIKLUPD-SRVKXCTJSA-N Gln-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHNJVFYHIKLUPD-SRVKXCTJSA-N 0.000 description 3
- YGNPTRVNRUKVLA-DCAQKATOSA-N Gln-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N YGNPTRVNRUKVLA-DCAQKATOSA-N 0.000 description 3
- HMIXCETWRYDVMO-GUBZILKMSA-N Gln-Pro-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O HMIXCETWRYDVMO-GUBZILKMSA-N 0.000 description 3
- SYZZMPFLOLSMHL-XHNCKOQMSA-N Gln-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)C(=O)O SYZZMPFLOLSMHL-XHNCKOQMSA-N 0.000 description 3
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 3
- SZXSSXUNOALWCH-ACZMJKKPSA-N Glu-Ala-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O SZXSSXUNOALWCH-ACZMJKKPSA-N 0.000 description 3
- OGMQXTXGLDNBSS-FXQIFTODSA-N Glu-Ala-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O OGMQXTXGLDNBSS-FXQIFTODSA-N 0.000 description 3
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 3
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 3
- QPRZKNOOOBWXSU-CIUDSAMLSA-N Glu-Asp-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N QPRZKNOOOBWXSU-CIUDSAMLSA-N 0.000 description 3
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 3
- HJIFPJUEOGZWRI-GUBZILKMSA-N Glu-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N HJIFPJUEOGZWRI-GUBZILKMSA-N 0.000 description 3
- QQLBPVKLJBAXBS-FXQIFTODSA-N Glu-Glu-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QQLBPVKLJBAXBS-FXQIFTODSA-N 0.000 description 3
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 3
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 3
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 3
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 3
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 3
- SJJHXJDSNQJMMW-SRVKXCTJSA-N Glu-Lys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SJJHXJDSNQJMMW-SRVKXCTJSA-N 0.000 description 3
- YKBUCXNNBYZYAY-MNXVOIDGSA-N Glu-Lys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YKBUCXNNBYZYAY-MNXVOIDGSA-N 0.000 description 3
- BDISFWMLMNBTGP-NUMRIWBASA-N Glu-Thr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O BDISFWMLMNBTGP-NUMRIWBASA-N 0.000 description 3
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 3
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 3
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 3
- FKJQNJCQTKUBCD-XPUUQOCRSA-N Gly-Ala-His Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O FKJQNJCQTKUBCD-XPUUQOCRSA-N 0.000 description 3
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 3
- MZZSCEANQDPJER-ONGXEEELSA-N Gly-Ala-Phe Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MZZSCEANQDPJER-ONGXEEELSA-N 0.000 description 3
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 3
- OCDLPQDYTJPWNG-YUMQZZPRSA-N Gly-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN OCDLPQDYTJPWNG-YUMQZZPRSA-N 0.000 description 3
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 3
- PABFFPWEJMEVEC-JGVFFNPUSA-N Gly-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)CN)C(=O)O PABFFPWEJMEVEC-JGVFFNPUSA-N 0.000 description 3
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 3
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 3
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 3
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 3
- WMGHDYWNHNLGBV-ONGXEEELSA-N Gly-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 WMGHDYWNHNLGBV-ONGXEEELSA-N 0.000 description 3
- CQMFNTVQVLQRLT-JHEQGTHGSA-N Gly-Thr-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CQMFNTVQVLQRLT-JHEQGTHGSA-N 0.000 description 3
- HQSKKSLNLSTONK-JTQLQIEISA-N Gly-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 HQSKKSLNLSTONK-JTQLQIEISA-N 0.000 description 3
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 3
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 3
- JWLWNCVBBSBCEM-NKIYYHGXSA-N His-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N)O JWLWNCVBBSBCEM-NKIYYHGXSA-N 0.000 description 3
- YADRBUZBKHHDAO-XPUUQOCRSA-N His-Gly-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C)C(O)=O YADRBUZBKHHDAO-XPUUQOCRSA-N 0.000 description 3
- QCBYAHHNOHBXIH-UWVGGRQHSA-N His-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CN=CN1 QCBYAHHNOHBXIH-UWVGGRQHSA-N 0.000 description 3
- 102000004157 Hydrolases Human genes 0.000 description 3
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 3
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 3
- BOTVMTSMOUSDRW-GMOBBJLQSA-N Ile-Arg-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O BOTVMTSMOUSDRW-GMOBBJLQSA-N 0.000 description 3
- FADXGVVLSPPEQY-GHCJXIJMSA-N Ile-Cys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FADXGVVLSPPEQY-GHCJXIJMSA-N 0.000 description 3
- SPQWWEZBHXHUJN-KBIXCLLPSA-N Ile-Glu-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O SPQWWEZBHXHUJN-KBIXCLLPSA-N 0.000 description 3
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 3
- NYEYYMLUABXDMC-NHCYSSNCSA-N Ile-Gly-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)O)N NYEYYMLUABXDMC-NHCYSSNCSA-N 0.000 description 3
- LBRCLQMZAHRTLV-ZKWXMUAHSA-N Ile-Gly-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LBRCLQMZAHRTLV-ZKWXMUAHSA-N 0.000 description 3
- WIZPFZKOFZXDQG-HTFCKZLJSA-N Ile-Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O WIZPFZKOFZXDQG-HTFCKZLJSA-N 0.000 description 3
- DMSVBUWGDLYNLC-IAVJCBSLSA-N Ile-Ile-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DMSVBUWGDLYNLC-IAVJCBSLSA-N 0.000 description 3
- PHRWFSFCNJPWRO-PPCPHDFISA-N Ile-Leu-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N PHRWFSFCNJPWRO-PPCPHDFISA-N 0.000 description 3
- AYLAAGNJNVZDPY-CYDGBPFRSA-N Ile-Met-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(=O)O)N AYLAAGNJNVZDPY-CYDGBPFRSA-N 0.000 description 3
- HQEPKOFULQTSFV-JURCDPSOSA-N Ile-Phe-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)O)N HQEPKOFULQTSFV-JURCDPSOSA-N 0.000 description 3
- KTNGVMMGIQWIDV-OSUNSFLBSA-N Ile-Pro-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O KTNGVMMGIQWIDV-OSUNSFLBSA-N 0.000 description 3
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 3
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 3
- HJDZMPFEXINXLO-QPHKQPEJSA-N Ile-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N HJDZMPFEXINXLO-QPHKQPEJSA-N 0.000 description 3
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 3
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 3
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 3
- YWCJXQKATPNPOE-UKJIMTQDSA-N Ile-Val-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YWCJXQKATPNPOE-UKJIMTQDSA-N 0.000 description 3
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 description 3
- 241000505525 Kosakonia radicincitans DSM 16656 Species 0.000 description 3
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 3
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 3
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 3
- 240000002648 Lactobacillus brevis ATCC 367 Species 0.000 description 3
- 235000007048 Lactobacillus brevis ATCC 367 Nutrition 0.000 description 3
- 241001468197 Lactobacillus collinoides Species 0.000 description 3
- 101000743006 Lactococcus lactis subsp. cremoris UPF0177 protein in abiGi 5'region Proteins 0.000 description 3
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 3
- KVRKAGGMEWNURO-CIUDSAMLSA-N Leu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N KVRKAGGMEWNURO-CIUDSAMLSA-N 0.000 description 3
- DQPQTXMIRBUWKO-DCAQKATOSA-N Leu-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(C)C)N DQPQTXMIRBUWKO-DCAQKATOSA-N 0.000 description 3
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 3
- OIARJGNVARWKFP-YUMQZZPRSA-N Leu-Asn-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIARJGNVARWKFP-YUMQZZPRSA-N 0.000 description 3
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 3
- AOFYPTOHESIBFZ-KKUMJFAQSA-N Leu-His-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O AOFYPTOHESIBFZ-KKUMJFAQSA-N 0.000 description 3
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 3
- ZALAVHVPPOHAOL-XUXIUFHCSA-N Leu-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(C)C)N ZALAVHVPPOHAOL-XUXIUFHCSA-N 0.000 description 3
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 3
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 3
- HGUUMQWGYCVPKG-DCAQKATOSA-N Leu-Pro-Cys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N HGUUMQWGYCVPKG-DCAQKATOSA-N 0.000 description 3
- UCXQIIIFOOGYEM-ULQDDVLXSA-N Leu-Pro-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCXQIIIFOOGYEM-ULQDDVLXSA-N 0.000 description 3
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 3
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 3
- AEDWWMMHUGYIFD-HJGDQZAQSA-N Leu-Thr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O AEDWWMMHUGYIFD-HJGDQZAQSA-N 0.000 description 3
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 3
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 3
- FPFOYSCDUWTZBF-IHPCNDPISA-N Leu-Trp-Leu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H]([NH3+])CC(C)C)C(=O)N[C@@H](CC(C)C)C([O-])=O)=CNC2=C1 FPFOYSCDUWTZBF-IHPCNDPISA-N 0.000 description 3
- RIHIGSWBLHSGLV-CQDKDKBSSA-N Leu-Tyr-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O RIHIGSWBLHSGLV-CQDKDKBSSA-N 0.000 description 3
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 3
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 3
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 3
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 3
- 241000432054 Listeria innocua Clip11262 Species 0.000 description 3
- 241000693756 Listeria ivanovii subsp. ivanovii PAM 55 Species 0.000 description 3
- 241001389728 Listeria marthii FSL S4-120 Species 0.000 description 3
- 241000708064 Listeria monocytogenes ATCC 19117 Species 0.000 description 3
- 241000534259 Listeria welshimeri serovar 6b str. SLCC5334 Species 0.000 description 3
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 3
- YIBOAHAOAWACDK-QEJZJMRPSA-N Lys-Ala-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YIBOAHAOAWACDK-QEJZJMRPSA-N 0.000 description 3
- DGAAQRAUOFHBFJ-CIUDSAMLSA-N Lys-Asn-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DGAAQRAUOFHBFJ-CIUDSAMLSA-N 0.000 description 3
- 108010062166 Lys-Asn-Asp Proteins 0.000 description 3
- BYPMOIFBQPEWOH-CIUDSAMLSA-N Lys-Asn-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N BYPMOIFBQPEWOH-CIUDSAMLSA-N 0.000 description 3
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 3
- DGWXCIORNLWGGG-CIUDSAMLSA-N Lys-Asn-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O DGWXCIORNLWGGG-CIUDSAMLSA-N 0.000 description 3
- KPJJOZUXFOLGMQ-CIUDSAMLSA-N Lys-Asp-Asn Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N KPJJOZUXFOLGMQ-CIUDSAMLSA-N 0.000 description 3
- IBQMEXQYZMVIFU-SRVKXCTJSA-N Lys-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N IBQMEXQYZMVIFU-SRVKXCTJSA-N 0.000 description 3
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 3
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 3
- VQXAVLQBQJMENB-SRVKXCTJSA-N Lys-Glu-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O VQXAVLQBQJMENB-SRVKXCTJSA-N 0.000 description 3
- ULUQBUKAPDUKOC-GVXVVHGQSA-N Lys-Glu-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ULUQBUKAPDUKOC-GVXVVHGQSA-N 0.000 description 3
- SLQJJFAVWSZLBL-BJDJZHNGSA-N Lys-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN SLQJJFAVWSZLBL-BJDJZHNGSA-N 0.000 description 3
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 3
- MSSJJDVQTFTLIF-KBPBESRZSA-N Lys-Phe-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O MSSJJDVQTFTLIF-KBPBESRZSA-N 0.000 description 3
- WZVSHTFTCYOFPL-GARJFASQSA-N Lys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N)C(=O)O WZVSHTFTCYOFPL-GARJFASQSA-N 0.000 description 3
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 3
- VVURYEVJJTXWNE-ULQDDVLXSA-N Lys-Tyr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O VVURYEVJJTXWNE-ULQDDVLXSA-N 0.000 description 3
- 241000191113 Marivirga tractuosa Species 0.000 description 3
- 241000604448 Megasphaera elsdenii Species 0.000 description 3
- 241000420773 Megasphaera elsdenii DSM 20460 Species 0.000 description 3
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 3
- PNDCUTDWYVKBHX-IHRRRGAJSA-N Met-Asp-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PNDCUTDWYVKBHX-IHRRRGAJSA-N 0.000 description 3
- UYAKZHGIPRCGPF-CIUDSAMLSA-N Met-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N UYAKZHGIPRCGPF-CIUDSAMLSA-N 0.000 description 3
- GPAHWYRSHCKICP-GUBZILKMSA-N Met-Glu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GPAHWYRSHCKICP-GUBZILKMSA-N 0.000 description 3
- RNAGAJXCSPDPRK-KKUMJFAQSA-N Met-Glu-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 RNAGAJXCSPDPRK-KKUMJFAQSA-N 0.000 description 3
- SXWQMBGNFXAGAT-FJXKBIBVSA-N Met-Gly-Thr Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SXWQMBGNFXAGAT-FJXKBIBVSA-N 0.000 description 3
- LBNFTWKGISQVEE-AVGNSLFASA-N Met-Leu-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCSC LBNFTWKGISQVEE-AVGNSLFASA-N 0.000 description 3
- CGUYGMFQZCYJSG-DCAQKATOSA-N Met-Lys-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O CGUYGMFQZCYJSG-DCAQKATOSA-N 0.000 description 3
- ILKCLLLOGPDNIP-RCWTZXSCSA-N Met-Met-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ILKCLLLOGPDNIP-RCWTZXSCSA-N 0.000 description 3
- FNYBIOGBMWFQRJ-SRVKXCTJSA-N Met-Pro-Met Chemical compound CSCC[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)O)N FNYBIOGBMWFQRJ-SRVKXCTJSA-N 0.000 description 3
- NHXXGBXJTLRGJI-GUBZILKMSA-N Met-Pro-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O NHXXGBXJTLRGJI-GUBZILKMSA-N 0.000 description 3
- PCTFVQATEGYHJU-FXQIFTODSA-N Met-Ser-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O PCTFVQATEGYHJU-FXQIFTODSA-N 0.000 description 3
- RMLLCGYYVZKKRT-CIUDSAMLSA-N Met-Ser-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O RMLLCGYYVZKKRT-CIUDSAMLSA-N 0.000 description 3
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 3
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 3
- 241000191998 Pediococcus acidilactici Species 0.000 description 3
- 241001378071 Pediococcus claussenii ATCC BAA-344 Species 0.000 description 3
- YMORXCKTSSGYIG-IHRRRGAJSA-N Phe-Arg-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N YMORXCKTSSGYIG-IHRRRGAJSA-N 0.000 description 3
- OJUMUUXGSXUZJZ-SRVKXCTJSA-N Phe-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OJUMUUXGSXUZJZ-SRVKXCTJSA-N 0.000 description 3
- CUMXHKAOHNWRFQ-BZSNNMDCSA-N Phe-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 CUMXHKAOHNWRFQ-BZSNNMDCSA-N 0.000 description 3
- KYYMILWEGJYPQZ-IHRRRGAJSA-N Phe-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KYYMILWEGJYPQZ-IHRRRGAJSA-N 0.000 description 3
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 3
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 3
- INHMISZWLJZQGH-ULQDDVLXSA-N Phe-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 INHMISZWLJZQGH-ULQDDVLXSA-N 0.000 description 3
- IAOZOFPONWDXNT-IXOXFDKPSA-N Phe-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IAOZOFPONWDXNT-IXOXFDKPSA-N 0.000 description 3
- KWYUFKZDYYNOTN-UHFFFAOYSA-M Potassium hydroxide Chemical compound [OH-].[K+] KWYUFKZDYYNOTN-UHFFFAOYSA-M 0.000 description 3
- DBALDZKOTNSBFM-FXQIFTODSA-N Pro-Ala-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DBALDZKOTNSBFM-FXQIFTODSA-N 0.000 description 3
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 3
- JARJPEMLQAWNBR-GUBZILKMSA-N Pro-Asp-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JARJPEMLQAWNBR-GUBZILKMSA-N 0.000 description 3
- GQLOZEMWEBDEAY-NAKRPEOUSA-N Pro-Cys-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GQLOZEMWEBDEAY-NAKRPEOUSA-N 0.000 description 3
- ZPPVJIJMIKTERM-YUMQZZPRSA-N Pro-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ZPPVJIJMIKTERM-YUMQZZPRSA-N 0.000 description 3
- XQSREVQDGCPFRJ-STQMWFEESA-N Pro-Gly-Phe Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XQSREVQDGCPFRJ-STQMWFEESA-N 0.000 description 3
- LNOWDSPAYBWJOR-PEDHHIEDSA-N Pro-Ile-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LNOWDSPAYBWJOR-PEDHHIEDSA-N 0.000 description 3
- VZKBJNBZMZHKRC-XUXIUFHCSA-N Pro-Ile-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O VZKBJNBZMZHKRC-XUXIUFHCSA-N 0.000 description 3
- AUQGUYPHJSMAKI-CYDGBPFRSA-N Pro-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 AUQGUYPHJSMAKI-CYDGBPFRSA-N 0.000 description 3
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 3
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 3
- DWGFLKQSGRUQTI-IHRRRGAJSA-N Pro-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 DWGFLKQSGRUQTI-IHRRRGAJSA-N 0.000 description 3
- MHBSUKYVBZVQRW-HJWJTTGWSA-N Pro-Phe-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MHBSUKYVBZVQRW-HJWJTTGWSA-N 0.000 description 3
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 3
- 101100016013 Pseudomonas aeruginosa (strain ATCC 15692 / DSM 22644 / CIP 104116 / JCM 14847 / LMG 12228 / 1C / PRS 101 / PAO1) xcpU gene Proteins 0.000 description 3
- 241000293869 Salmonella enterica subsp. enterica serovar Typhimurium Species 0.000 description 3
- 241000456202 Salmonella enterica subsp. enterica serovar Urbana str. ATCC 9261 Species 0.000 description 3
- 101100135914 Salmonella typhimurium (strain LT2 / SGSC1412 / ATCC 700720) pduD gene Proteins 0.000 description 3
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 3
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 3
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 3
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 3
- DJACUBDEDBZKLQ-KBIXCLLPSA-N Ser-Ile-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O DJACUBDEDBZKLQ-KBIXCLLPSA-N 0.000 description 3
- BEAFYHFQTOTVFS-VGDYDELISA-N Ser-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N BEAFYHFQTOTVFS-VGDYDELISA-N 0.000 description 3
- LWMQRHDTXHQQOV-MXAVVETBSA-N Ser-Ile-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LWMQRHDTXHQQOV-MXAVVETBSA-N 0.000 description 3
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 3
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 3
- QPPYAWVLAVXISR-DCAQKATOSA-N Ser-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QPPYAWVLAVXISR-DCAQKATOSA-N 0.000 description 3
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 3
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 3
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 3
- 241001333726 Shewanella putrefaciens CN-32 Species 0.000 description 3
- KEGBFULVYKYJRD-LFSVMHDDSA-N Thr-Ala-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KEGBFULVYKYJRD-LFSVMHDDSA-N 0.000 description 3
- UKBSDLHIKIXJKH-HJGDQZAQSA-N Thr-Arg-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UKBSDLHIKIXJKH-HJGDQZAQSA-N 0.000 description 3
- QILPDQCTQZDHFM-HJGDQZAQSA-N Thr-Gln-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QILPDQCTQZDHFM-HJGDQZAQSA-N 0.000 description 3
- LGNBRHZANHMZHK-NUMRIWBASA-N Thr-Glu-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O LGNBRHZANHMZHK-NUMRIWBASA-N 0.000 description 3
- JMGJDTNUMAZNLX-RWRJDSDZSA-N Thr-Glu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JMGJDTNUMAZNLX-RWRJDSDZSA-N 0.000 description 3
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 3
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 3
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 3
- XTCNBOBTROGWMW-RWRJDSDZSA-N Thr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XTCNBOBTROGWMW-RWRJDSDZSA-N 0.000 description 3
- HOVLHEKTGVIKAP-WDCWCFNPSA-N Thr-Leu-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HOVLHEKTGVIKAP-WDCWCFNPSA-N 0.000 description 3
- NZRUWPIYECBYRK-HTUGSXCWSA-N Thr-Phe-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O NZRUWPIYECBYRK-HTUGSXCWSA-N 0.000 description 3
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 description 3
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 3
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 3
- 241000322994 Tolumonas auensis DSM 9187 Species 0.000 description 3
- BURPTJBFWIOHEY-UWJYBYFXSA-N Tyr-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 BURPTJBFWIOHEY-UWJYBYFXSA-N 0.000 description 3
- ZNFPUOSTMUMUDR-JRQIVUDYSA-N Tyr-Asn-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZNFPUOSTMUMUDR-JRQIVUDYSA-N 0.000 description 3
- PMDWYLVWHRTJIW-STQMWFEESA-N Tyr-Gly-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PMDWYLVWHRTJIW-STQMWFEESA-N 0.000 description 3
- ILTXFANLDMJWPR-SIUGBPQLSA-N Tyr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N ILTXFANLDMJWPR-SIUGBPQLSA-N 0.000 description 3
- XUIOBCQESNDTDE-FQPOAREZSA-N Tyr-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XUIOBCQESNDTDE-FQPOAREZSA-N 0.000 description 3
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 3
- FZSPNKUFROZBSG-ZKWXMUAHSA-N Val-Ala-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O FZSPNKUFROZBSG-ZKWXMUAHSA-N 0.000 description 3
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 3
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 3
- VLDMQVZZWDOKQF-AUTRQRHGSA-N Val-Glu-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VLDMQVZZWDOKQF-AUTRQRHGSA-N 0.000 description 3
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 3
- FXVDGDZRYLFQKY-WPRPVWTQSA-N Val-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C FXVDGDZRYLFQKY-WPRPVWTQSA-N 0.000 description 3
- VHRLUTIMTDOVCG-PEDHHIEDSA-N Val-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](C(C)C)N VHRLUTIMTDOVCG-PEDHHIEDSA-N 0.000 description 3
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 3
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 3
- UEPLNXPLHJUYPT-AVGNSLFASA-N Val-Met-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O UEPLNXPLHJUYPT-AVGNSLFASA-N 0.000 description 3
- YDVDTCJGBBJGRT-GUBZILKMSA-N Val-Met-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N YDVDTCJGBBJGRT-GUBZILKMSA-N 0.000 description 3
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 3
- LGXUZJIQCGXKGZ-QXEWZRGKSA-N Val-Pro-Asn Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N LGXUZJIQCGXKGZ-QXEWZRGKSA-N 0.000 description 3
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 3
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 3
- LTTQCQRTSHJPPL-ZKWXMUAHSA-N Val-Ser-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LTTQCQRTSHJPPL-ZKWXMUAHSA-N 0.000 description 3
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 3
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 3
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 3
- QPJSIBAOZBVELU-BPNCWPANSA-N Val-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N QPJSIBAOZBVELU-BPNCWPANSA-N 0.000 description 3
- PFMSJVIPEZMKSC-DZKIICNBSA-N Val-Tyr-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PFMSJVIPEZMKSC-DZKIICNBSA-N 0.000 description 3
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 3
- 241000863377 Yersinia enterocolitica subsp. enterocolitica 8081 Species 0.000 description 3
- 241000779673 Yersinia mollaretii ATCC 43969 Species 0.000 description 3
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 3
- 101150055425 aldh gene Proteins 0.000 description 3
- 108010060035 arginylproline Proteins 0.000 description 3
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 3
- KRKNYBCHXYNGOX-UHFFFAOYSA-N citric acid Chemical compound OC(=O)CC(O)(C(O)=O)CC(O)=O KRKNYBCHXYNGOX-UHFFFAOYSA-N 0.000 description 3
- 101150081680 fldB gene Proteins 0.000 description 3
- 238000010353 genetic engineering Methods 0.000 description 3
- 108010084264 glycyl-glycyl-cysteine Proteins 0.000 description 3
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 3
- 108010048994 glycyl-tyrosyl-alanine Proteins 0.000 description 3
- 229930027917 kanamycin Natural products 0.000 description 3
- 229960000318 kanamycin Drugs 0.000 description 3
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 3
- 229930182823 kanamycin A Natural products 0.000 description 3
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 108010068488 methionylphenylalanine Proteins 0.000 description 3
- 150000007523 nucleic acids Chemical group 0.000 description 3
- 239000003208 petroleum Substances 0.000 description 3
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 3
- 108010024607 phenylalanylalanine Proteins 0.000 description 3
- 108010004914 prolylarginine Proteins 0.000 description 3
- ULWHHBHJGPPBCO-UHFFFAOYSA-N propane-1,1-diol Chemical compound CCC(O)O ULWHHBHJGPPBCO-UHFFFAOYSA-N 0.000 description 3
- 239000002994 raw material Substances 0.000 description 3
- 230000007420 reactivation Effects 0.000 description 3
- 230000002441 reversible effect Effects 0.000 description 3
- 150000003839 salts Chemical class 0.000 description 3
- 238000000926 separation method Methods 0.000 description 3
- 239000000243 solution Substances 0.000 description 3
- 241000894007 species Species 0.000 description 3
- 239000000126 substance Substances 0.000 description 3
- 239000000758 substrate Substances 0.000 description 3
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 3
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 3
- GJLXVWOMRRWCIB-MERZOTPQSA-N (2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-acetamido-5-(diaminomethylideneamino)pentanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-3-(1H-indol-3-yl)propanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanamide Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(N)=O)C1=CC=C(O)C=C1 GJLXVWOMRRWCIB-MERZOTPQSA-N 0.000 description 2
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 2
- BRPMXFSTKXXNHF-IUCAKERBSA-N (2s)-1-[2-[[(2s)-pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H]1NCCC1 BRPMXFSTKXXNHF-IUCAKERBSA-N 0.000 description 2
- PKAUICCNAWQPAU-UHFFFAOYSA-N 2-(4-chloro-2-methylphenoxy)acetic acid;n-methylmethanamine Chemical compound CNC.CC1=CC(Cl)=CC=C1OCC(O)=O PKAUICCNAWQPAU-UHFFFAOYSA-N 0.000 description 2
- 101710160406 Acetaldehyde dehydrogenase (acetylating) EutE Proteins 0.000 description 2
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 2
- ODWSTKXGQGYHSH-FXQIFTODSA-N Ala-Arg-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O ODWSTKXGQGYHSH-FXQIFTODSA-N 0.000 description 2
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 2
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 2
- YAXNATKKPOWVCP-ZLUOBGJFSA-N Ala-Asn-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O YAXNATKKPOWVCP-ZLUOBGJFSA-N 0.000 description 2
- FXKNPWNXPQZLES-ZLUOBGJFSA-N Ala-Asn-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FXKNPWNXPQZLES-ZLUOBGJFSA-N 0.000 description 2
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 2
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 2
- MKZCBYZBCINNJN-DLOVCJGASA-N Ala-Asp-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MKZCBYZBCINNJN-DLOVCJGASA-N 0.000 description 2
- IYCZBJXFSZSHPN-DLOVCJGASA-N Ala-Cys-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IYCZBJXFSZSHPN-DLOVCJGASA-N 0.000 description 2
- CXQODNIBUNQWAS-CIUDSAMLSA-N Ala-Gln-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CXQODNIBUNQWAS-CIUDSAMLSA-N 0.000 description 2
- JPGBXANAQYHTLA-DRZSPHRISA-N Ala-Gln-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JPGBXANAQYHTLA-DRZSPHRISA-N 0.000 description 2
- CZPAHAKGPDUIPJ-CIUDSAMLSA-N Ala-Gln-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CZPAHAKGPDUIPJ-CIUDSAMLSA-N 0.000 description 2
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 2
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 2
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 2
- PUBLUECXJRHTBK-ACZMJKKPSA-N Ala-Glu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O PUBLUECXJRHTBK-ACZMJKKPSA-N 0.000 description 2
- ROLXPVQSRCPVGK-XDTLVQLUSA-N Ala-Glu-Tyr Chemical compound N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O ROLXPVQSRCPVGK-XDTLVQLUSA-N 0.000 description 2
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 2
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 2
- CWEAKSWWKHGTRJ-BQBZGAKWSA-N Ala-Gly-Met Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O CWEAKSWWKHGTRJ-BQBZGAKWSA-N 0.000 description 2
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 2
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 2
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 2
- CFPQUJZTLUQUTJ-HTFCKZLJSA-N Ala-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@H](C)N CFPQUJZTLUQUTJ-HTFCKZLJSA-N 0.000 description 2
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 2
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 2
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 2
- DPNZTBKGAUAZQU-DLOVCJGASA-N Ala-Leu-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DPNZTBKGAUAZQU-DLOVCJGASA-N 0.000 description 2
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 2
- SDZRIBWEVVRDQI-CIUDSAMLSA-N Ala-Lys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O SDZRIBWEVVRDQI-CIUDSAMLSA-N 0.000 description 2
- PIXQDIGKDNNOOV-GUBZILKMSA-N Ala-Lys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O PIXQDIGKDNNOOV-GUBZILKMSA-N 0.000 description 2
- OQWQTGBOFPJOIF-DLOVCJGASA-N Ala-Lys-His Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N OQWQTGBOFPJOIF-DLOVCJGASA-N 0.000 description 2
- BLTRAARCJYVJKV-QEJZJMRPSA-N Ala-Lys-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccccc1)C(O)=O BLTRAARCJYVJKV-QEJZJMRPSA-N 0.000 description 2
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 2
- NLOMBWNGESDVJU-GUBZILKMSA-N Ala-Met-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLOMBWNGESDVJU-GUBZILKMSA-N 0.000 description 2
- DWYROCSXOOMOEU-CIUDSAMLSA-N Ala-Met-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DWYROCSXOOMOEU-CIUDSAMLSA-N 0.000 description 2
- AWNAEZICPNGAJK-FXQIFTODSA-N Ala-Met-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O AWNAEZICPNGAJK-FXQIFTODSA-N 0.000 description 2
- GFEDXKNBZMPEDM-KZVJFYERSA-N Ala-Met-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFEDXKNBZMPEDM-KZVJFYERSA-N 0.000 description 2
- BDQNLQSWRAPHGU-DLOVCJGASA-N Ala-Phe-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N BDQNLQSWRAPHGU-DLOVCJGASA-N 0.000 description 2
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 2
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 2
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 2
- SYIFFFHSXBNPMC-UWJYBYFXSA-N Ala-Ser-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N SYIFFFHSXBNPMC-UWJYBYFXSA-N 0.000 description 2
- XQNRANMFRPCFFW-GCJQMDKQSA-N Ala-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C)N)O XQNRANMFRPCFFW-GCJQMDKQSA-N 0.000 description 2
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 2
- AENHOIXXHKNIQL-AUTRQRHGSA-N Ala-Tyr-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H]([NH3+])C)CC1=CC=C(O)C=C1 AENHOIXXHKNIQL-AUTRQRHGSA-N 0.000 description 2
- BGGAIXWIZCIFSG-XDTLVQLUSA-N Ala-Tyr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O BGGAIXWIZCIFSG-XDTLVQLUSA-N 0.000 description 2
- QGZKDVFQNNGYKY-UHFFFAOYSA-N Ammonia Chemical compound N QGZKDVFQNNGYKY-UHFFFAOYSA-N 0.000 description 2
- NLXLAEXVIDQMFP-UHFFFAOYSA-N Ammonia chloride Chemical compound [NH4+].[Cl-] NLXLAEXVIDQMFP-UHFFFAOYSA-N 0.000 description 2
- SBVJJNJLFWSJOV-UBHSHLNASA-N Arg-Ala-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SBVJJNJLFWSJOV-UBHSHLNASA-N 0.000 description 2
- IJPNNYWHXGADJG-GUBZILKMSA-N Arg-Ala-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O IJPNNYWHXGADJG-GUBZILKMSA-N 0.000 description 2
- BHSYMWWMVRPCPA-CYDGBPFRSA-N Arg-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCN=C(N)N BHSYMWWMVRPCPA-CYDGBPFRSA-N 0.000 description 2
- KWTVWJPNHAOREN-IHRRRGAJSA-N Arg-Asn-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KWTVWJPNHAOREN-IHRRRGAJSA-N 0.000 description 2
- MFAMTAVAFBPXDC-LPEHRKFASA-N Arg-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O MFAMTAVAFBPXDC-LPEHRKFASA-N 0.000 description 2
- GIVWETPOBCRTND-DCAQKATOSA-N Arg-Gln-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GIVWETPOBCRTND-DCAQKATOSA-N 0.000 description 2
- BEXGZLUHRXTZCC-CIUDSAMLSA-N Arg-Gln-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N BEXGZLUHRXTZCC-CIUDSAMLSA-N 0.000 description 2
- PHHRSPBBQUFULD-UWVGGRQHSA-N Arg-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N PHHRSPBBQUFULD-UWVGGRQHSA-N 0.000 description 2
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 2
- UAOSDDXCTBIPCA-QXEWZRGKSA-N Arg-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UAOSDDXCTBIPCA-QXEWZRGKSA-N 0.000 description 2
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 2
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 2
- NOZYDJOPOGKUSR-AVGNSLFASA-N Arg-Leu-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O NOZYDJOPOGKUSR-AVGNSLFASA-N 0.000 description 2
- BNYNOWJESJJIOI-XUXIUFHCSA-N Arg-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N BNYNOWJESJJIOI-XUXIUFHCSA-N 0.000 description 2
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 2
- MTYLORHAQXVQOW-AVGNSLFASA-N Arg-Lys-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O MTYLORHAQXVQOW-AVGNSLFASA-N 0.000 description 2
- GSUFZRURORXYTM-STQMWFEESA-N Arg-Phe-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 GSUFZRURORXYTM-STQMWFEESA-N 0.000 description 2
- HGKHPCFTRQDHCU-IUCAKERBSA-N Arg-Pro-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HGKHPCFTRQDHCU-IUCAKERBSA-N 0.000 description 2
- OWSMKCJUBAPHED-JYJNAYRXSA-N Arg-Pro-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OWSMKCJUBAPHED-JYJNAYRXSA-N 0.000 description 2
- LFAUVOXPCGJKTB-DCAQKATOSA-N Arg-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N LFAUVOXPCGJKTB-DCAQKATOSA-N 0.000 description 2
- UZSQXCMNUPKLCC-FJXKBIBVSA-N Arg-Thr-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UZSQXCMNUPKLCC-FJXKBIBVSA-N 0.000 description 2
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 2
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 2
- WOZDCBHUGJVJPL-AVGNSLFASA-N Arg-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WOZDCBHUGJVJPL-AVGNSLFASA-N 0.000 description 2
- NUHQMYUWLUSRJX-BIIVOSGPSA-N Asn-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N NUHQMYUWLUSRJX-BIIVOSGPSA-N 0.000 description 2
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 2
- VDCIPFYVCICPEC-FXQIFTODSA-N Asn-Arg-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O VDCIPFYVCICPEC-FXQIFTODSA-N 0.000 description 2
- GXMSVVBIAMWMKO-BQBZGAKWSA-N Asn-Arg-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N GXMSVVBIAMWMKO-BQBZGAKWSA-N 0.000 description 2
- QHBMKQWOIYJYMI-BYULHYEWSA-N Asn-Asn-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QHBMKQWOIYJYMI-BYULHYEWSA-N 0.000 description 2
- IYVSIZAXNLOKFQ-BYULHYEWSA-N Asn-Asp-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IYVSIZAXNLOKFQ-BYULHYEWSA-N 0.000 description 2
- QYXNFROWLZPWPC-FXQIFTODSA-N Asn-Glu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QYXNFROWLZPWPC-FXQIFTODSA-N 0.000 description 2
- OGMDXNFGPOPZTK-GUBZILKMSA-N Asn-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N OGMDXNFGPOPZTK-GUBZILKMSA-N 0.000 description 2
- DMLSCRJBWUEALP-LAEOZQHASA-N Asn-Glu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O DMLSCRJBWUEALP-LAEOZQHASA-N 0.000 description 2
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 2
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 2
- PHJPKNUWWHRAOC-PEFMBERDSA-N Asn-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PHJPKNUWWHRAOC-PEFMBERDSA-N 0.000 description 2
- XVBDDUPJVQXDSI-PEFMBERDSA-N Asn-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVBDDUPJVQXDSI-PEFMBERDSA-N 0.000 description 2
- ACKNRKFVYUVWAC-ZPFDUUQYSA-N Asn-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ACKNRKFVYUVWAC-ZPFDUUQYSA-N 0.000 description 2
- BXUHCIXDSWRSBS-CIUDSAMLSA-N Asn-Leu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BXUHCIXDSWRSBS-CIUDSAMLSA-N 0.000 description 2
- WIDVAWAQBRAKTI-YUMQZZPRSA-N Asn-Leu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O WIDVAWAQBRAKTI-YUMQZZPRSA-N 0.000 description 2
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 2
- JEEFEQCRXKPQHC-KKUMJFAQSA-N Asn-Leu-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JEEFEQCRXKPQHC-KKUMJFAQSA-N 0.000 description 2
- JLNFZLNDHONLND-GARJFASQSA-N Asn-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N JLNFZLNDHONLND-GARJFASQSA-N 0.000 description 2
- MDDXKBHIMYYJLW-FXQIFTODSA-N Asn-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N MDDXKBHIMYYJLW-FXQIFTODSA-N 0.000 description 2
- UYRPHDGXHKBZHJ-CIUDSAMLSA-N Asn-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N UYRPHDGXHKBZHJ-CIUDSAMLSA-N 0.000 description 2
- VITDJIPIJZAVGC-VEVYYDQMSA-N Asn-Met-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VITDJIPIJZAVGC-VEVYYDQMSA-N 0.000 description 2
- PBFXCUOEGVJTMV-QXEWZRGKSA-N Asn-Met-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O PBFXCUOEGVJTMV-QXEWZRGKSA-N 0.000 description 2
- PPCORQFLAZWUNO-QWRGUYRKSA-N Asn-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N PPCORQFLAZWUNO-QWRGUYRKSA-N 0.000 description 2
- PLTGTJAZQRGMPP-FXQIFTODSA-N Asn-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O PLTGTJAZQRGMPP-FXQIFTODSA-N 0.000 description 2
- XMHFCUKJRCQXGI-CIUDSAMLSA-N Asn-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O XMHFCUKJRCQXGI-CIUDSAMLSA-N 0.000 description 2
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 2
- HPNDKUOLNRVRAY-BIIVOSGPSA-N Asn-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N)C(=O)O HPNDKUOLNRVRAY-BIIVOSGPSA-N 0.000 description 2
- MYTHOBCLNIOFBL-SRVKXCTJSA-N Asn-Ser-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYTHOBCLNIOFBL-SRVKXCTJSA-N 0.000 description 2
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 2
- XEDQMTWEYFBOIK-ACZMJKKPSA-N Asp-Ala-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XEDQMTWEYFBOIK-ACZMJKKPSA-N 0.000 description 2
- BUVNWKQBMZLCDW-UGYAYLCHSA-N Asp-Asn-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BUVNWKQBMZLCDW-UGYAYLCHSA-N 0.000 description 2
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 2
- LKIYSIYBKYLKPU-BIIVOSGPSA-N Asp-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O LKIYSIYBKYLKPU-BIIVOSGPSA-N 0.000 description 2
- SNAWMGHSCHKSDK-GUBZILKMSA-N Asp-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N SNAWMGHSCHKSDK-GUBZILKMSA-N 0.000 description 2
- RRKCPMGSRIDLNC-AVGNSLFASA-N Asp-Glu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RRKCPMGSRIDLNC-AVGNSLFASA-N 0.000 description 2
- JUWZKMBALYLZCK-WHFBIAKZSA-N Asp-Gly-Asn Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O JUWZKMBALYLZCK-WHFBIAKZSA-N 0.000 description 2
- KQBVNNAPIURMPD-PEFMBERDSA-N Asp-Ile-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KQBVNNAPIURMPD-PEFMBERDSA-N 0.000 description 2
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 2
- YFSLJHLQOALGSY-ZPFDUUQYSA-N Asp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N YFSLJHLQOALGSY-ZPFDUUQYSA-N 0.000 description 2
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 2
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 2
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 2
- LIVXPXUVXFRWNY-CIUDSAMLSA-N Asp-Lys-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O LIVXPXUVXFRWNY-CIUDSAMLSA-N 0.000 description 2
- HXVILZUZXFLVEN-DCAQKATOSA-N Asp-Met-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O HXVILZUZXFLVEN-DCAQKATOSA-N 0.000 description 2
- PCJOFZYFFMBZKC-PCBIJLKTSA-N Asp-Phe-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PCJOFZYFFMBZKC-PCBIJLKTSA-N 0.000 description 2
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 2
- SXLCDCZHNCLFGZ-BPUTZDHNSA-N Asp-Pro-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O SXLCDCZHNCLFGZ-BPUTZDHNSA-N 0.000 description 2
- CUQDCPXNZPDYFQ-ZLUOBGJFSA-N Asp-Ser-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O CUQDCPXNZPDYFQ-ZLUOBGJFSA-N 0.000 description 2
- MJJIHRWNWSQTOI-VEVYYDQMSA-N Asp-Thr-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MJJIHRWNWSQTOI-VEVYYDQMSA-N 0.000 description 2
- PLNJUJGNLDSFOP-UWJYBYFXSA-N Asp-Tyr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PLNJUJGNLDSFOP-UWJYBYFXSA-N 0.000 description 2
- OTKUAVXGMREHRX-CFMVVWHZSA-N Asp-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 OTKUAVXGMREHRX-CFMVVWHZSA-N 0.000 description 2
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 2
- 101100228546 Bacillus subtilis (strain 168) folE2 gene Proteins 0.000 description 2
- 102100021277 Beta-secretase 2 Human genes 0.000 description 2
- 101710150190 Beta-secretase 2 Proteins 0.000 description 2
- 101100512078 Caenorhabditis elegans lys-1 gene Proteins 0.000 description 2
- CURLTUGMZLYLDI-UHFFFAOYSA-N Carbon dioxide Chemical compound O=C=O CURLTUGMZLYLDI-UHFFFAOYSA-N 0.000 description 2
- 241001110437 Citrobacter koseri ATCC BAA-895 Species 0.000 description 2
- 241000193401 Clostridium acetobutylicum Species 0.000 description 2
- 101100446691 Clostridium sporogenes fldC gene Proteins 0.000 description 2
- 101100446694 Clostridium sporogenes fldI gene Proteins 0.000 description 2
- 241000186226 Corynebacterium glutamicum Species 0.000 description 2
- KKZHXOOZHFABQQ-UWJYBYFXSA-N Cys-Ala-Tyr Chemical compound SC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KKZHXOOZHFABQQ-UWJYBYFXSA-N 0.000 description 2
- SZQCDCKIGWQAQN-FXQIFTODSA-N Cys-Arg-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O SZQCDCKIGWQAQN-FXQIFTODSA-N 0.000 description 2
- KABHAOSDMIYXTR-GUBZILKMSA-N Cys-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N KABHAOSDMIYXTR-GUBZILKMSA-N 0.000 description 2
- LYSHSHHDBVKJRN-JBDRJPRFSA-N Cys-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CS)N LYSHSHHDBVKJRN-JBDRJPRFSA-N 0.000 description 2
- ZLHPWFSAUJEEAN-KBIXCLLPSA-N Cys-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N ZLHPWFSAUJEEAN-KBIXCLLPSA-N 0.000 description 2
- XZKJEOMFLDVXJG-KATARQTJSA-N Cys-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CS)N)O XZKJEOMFLDVXJG-KATARQTJSA-N 0.000 description 2
- OZHXXYOHPLLLMI-CIUDSAMLSA-N Cys-Lys-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OZHXXYOHPLLLMI-CIUDSAMLSA-N 0.000 description 2
- IDFVDSBJNMPBSX-SRVKXCTJSA-N Cys-Lys-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O IDFVDSBJNMPBSX-SRVKXCTJSA-N 0.000 description 2
- MKVKKORBPTUSNX-LPEHRKFASA-N Cys-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N MKVKKORBPTUSNX-LPEHRKFASA-N 0.000 description 2
- KZZYVYWSXMFYEC-DCAQKATOSA-N Cys-Val-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KZZYVYWSXMFYEC-DCAQKATOSA-N 0.000 description 2
- 241000186540 Desulfosporosinus orientis Species 0.000 description 2
- 241000168726 Dictyostelium discoideum Species 0.000 description 2
- 241000588722 Escherichia Species 0.000 description 2
- 108010046276 FLP recombinase Proteins 0.000 description 2
- 241000233866 Fungi Species 0.000 description 2
- 241001135751 Geobacter metallireducens Species 0.000 description 2
- NUMFTVCBONFQIQ-DRZSPHRISA-N Gln-Ala-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NUMFTVCBONFQIQ-DRZSPHRISA-N 0.000 description 2
- LTLXPHKSQQILNF-CIUDSAMLSA-N Gln-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N LTLXPHKSQQILNF-CIUDSAMLSA-N 0.000 description 2
- DXMPMSWUZVNBSG-QEJZJMRPSA-N Gln-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N DXMPMSWUZVNBSG-QEJZJMRPSA-N 0.000 description 2
- CRRFJBGUGNNOCS-PEFMBERDSA-N Gln-Asp-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CRRFJBGUGNNOCS-PEFMBERDSA-N 0.000 description 2
- LWDGZZGWDMHBOF-FXQIFTODSA-N Gln-Glu-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LWDGZZGWDMHBOF-FXQIFTODSA-N 0.000 description 2
- GNMQDOGFWYWPNM-LAEOZQHASA-N Gln-Gly-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)CNC(=O)[C@@H](N)CCC(N)=O)C(O)=O GNMQDOGFWYWPNM-LAEOZQHASA-N 0.000 description 2
- SBHVGKBYOQKAEA-SDDRHHMPSA-N Gln-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)N)N)C(=O)O SBHVGKBYOQKAEA-SDDRHHMPSA-N 0.000 description 2
- JKGHMESJHRTHIC-SIUGBPQLSA-N Gln-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JKGHMESJHRTHIC-SIUGBPQLSA-N 0.000 description 2
- VZRAXPGTUNDIDK-GUBZILKMSA-N Gln-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VZRAXPGTUNDIDK-GUBZILKMSA-N 0.000 description 2
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 2
- HPCOBEHVEHWREJ-DCAQKATOSA-N Gln-Lys-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HPCOBEHVEHWREJ-DCAQKATOSA-N 0.000 description 2
- NMYFPKCIGUJMIK-GUBZILKMSA-N Gln-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N NMYFPKCIGUJMIK-GUBZILKMSA-N 0.000 description 2
- DOMHVQBSRJNNKD-ZPFDUUQYSA-N Gln-Met-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DOMHVQBSRJNNKD-ZPFDUUQYSA-N 0.000 description 2
- ZXGLLNZQSBLQLT-SRVKXCTJSA-N Gln-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZXGLLNZQSBLQLT-SRVKXCTJSA-N 0.000 description 2
- RONJIBWTGKVKFY-HTUGSXCWSA-N Gln-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O RONJIBWTGKVKFY-HTUGSXCWSA-N 0.000 description 2
- OACQOWPRWGNKTP-AVGNSLFASA-N Gln-Tyr-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O OACQOWPRWGNKTP-AVGNSLFASA-N 0.000 description 2
- AKDOUBMVLRCHBD-SIUGBPQLSA-N Gln-Tyr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AKDOUBMVLRCHBD-SIUGBPQLSA-N 0.000 description 2
- SDSMVVSHLAAOJL-UKJIMTQDSA-N Gln-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N SDSMVVSHLAAOJL-UKJIMTQDSA-N 0.000 description 2
- IRDASPPCLZIERZ-XHNCKOQMSA-N Glu-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N IRDASPPCLZIERZ-XHNCKOQMSA-N 0.000 description 2
- RSUVOPBMWMTVDI-XEGUGMAKSA-N Glu-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCC(O)=O)C)C(O)=O)=CNC2=C1 RSUVOPBMWMTVDI-XEGUGMAKSA-N 0.000 description 2
- VTTSANCGJWLPNC-ZPFDUUQYSA-N Glu-Arg-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VTTSANCGJWLPNC-ZPFDUUQYSA-N 0.000 description 2
- KEBACWCLVOXFNC-DCAQKATOSA-N Glu-Arg-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O KEBACWCLVOXFNC-DCAQKATOSA-N 0.000 description 2
- LJLPOZGRPLORTF-CIUDSAMLSA-N Glu-Asn-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O LJLPOZGRPLORTF-CIUDSAMLSA-N 0.000 description 2
- LXAUHIRMWXQRKI-XHNCKOQMSA-N Glu-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O LXAUHIRMWXQRKI-XHNCKOQMSA-N 0.000 description 2
- VAZZOGXDUQSVQF-NUMRIWBASA-N Glu-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)O VAZZOGXDUQSVQF-NUMRIWBASA-N 0.000 description 2
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 2
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 2
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 2
- XHUCVVHRLNPZSZ-CIUDSAMLSA-N Glu-Gln-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XHUCVVHRLNPZSZ-CIUDSAMLSA-N 0.000 description 2
- HUFCEIHAFNVSNR-IHRRRGAJSA-N Glu-Gln-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUFCEIHAFNVSNR-IHRRRGAJSA-N 0.000 description 2
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 2
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 2
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 2
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 2
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 2
- VOORMNJKNBGYGK-YUMQZZPRSA-N Glu-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N VOORMNJKNBGYGK-YUMQZZPRSA-N 0.000 description 2
- LGYCLOCORAEQSZ-PEFMBERDSA-N Glu-Ile-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O LGYCLOCORAEQSZ-PEFMBERDSA-N 0.000 description 2
- ITBHUUMCJJQUSC-LAEOZQHASA-N Glu-Ile-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O ITBHUUMCJJQUSC-LAEOZQHASA-N 0.000 description 2
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 2
- GRHXUHCFENOCOS-ZPFDUUQYSA-N Glu-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N GRHXUHCFENOCOS-ZPFDUUQYSA-N 0.000 description 2
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 2
- DWBBKNPKDHXIAC-SRVKXCTJSA-N Glu-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCC(O)=O DWBBKNPKDHXIAC-SRVKXCTJSA-N 0.000 description 2
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 2
- OHWJUIXZHVIXJJ-GUBZILKMSA-N Glu-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N OHWJUIXZHVIXJJ-GUBZILKMSA-N 0.000 description 2
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 2
- AQNYKMCFCCZEEL-JYJNAYRXSA-N Glu-Lys-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AQNYKMCFCCZEEL-JYJNAYRXSA-N 0.000 description 2
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 2
- ZWMYUDZLXAQHCK-CIUDSAMLSA-N Glu-Met-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O ZWMYUDZLXAQHCK-CIUDSAMLSA-N 0.000 description 2
- LHIPZASLKPYDPI-AVGNSLFASA-N Glu-Phe-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LHIPZASLKPYDPI-AVGNSLFASA-N 0.000 description 2
- BIYNPVYAZOUVFQ-CIUDSAMLSA-N Glu-Pro-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O BIYNPVYAZOUVFQ-CIUDSAMLSA-N 0.000 description 2
- GMVCSRBOSIUTFC-FXQIFTODSA-N Glu-Ser-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMVCSRBOSIUTFC-FXQIFTODSA-N 0.000 description 2
- QOXDAWODGSIDDI-GUBZILKMSA-N Glu-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N QOXDAWODGSIDDI-GUBZILKMSA-N 0.000 description 2
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 2
- RGJKYNUINKGPJN-RWRJDSDZSA-N Glu-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCC(=O)O)N RGJKYNUINKGPJN-RWRJDSDZSA-N 0.000 description 2
- UMZHHILWZBFPGL-LOKLDPHHSA-N Glu-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O UMZHHILWZBFPGL-LOKLDPHHSA-N 0.000 description 2
- HVKAAUOFFTUSAA-XDTLVQLUSA-N Glu-Tyr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O HVKAAUOFFTUSAA-XDTLVQLUSA-N 0.000 description 2
- MFYLRRCYBBJYPI-JYJNAYRXSA-N Glu-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O MFYLRRCYBBJYPI-JYJNAYRXSA-N 0.000 description 2
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 2
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 2
- FVGOGEGGQLNZGH-DZKIICNBSA-N Glu-Val-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FVGOGEGGQLNZGH-DZKIICNBSA-N 0.000 description 2
- JXYMPBCYRKWJEE-BQBZGAKWSA-N Gly-Arg-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JXYMPBCYRKWJEE-BQBZGAKWSA-N 0.000 description 2
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 2
- VXKCPBPQEKKERH-IUCAKERBSA-N Gly-Arg-Pro Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N1CCC[C@H]1C(O)=O VXKCPBPQEKKERH-IUCAKERBSA-N 0.000 description 2
- LURCIJSJAKFCRO-QWRGUYRKSA-N Gly-Asn-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LURCIJSJAKFCRO-QWRGUYRKSA-N 0.000 description 2
- FUTAPPOITCCWTH-WHFBIAKZSA-N Gly-Asp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FUTAPPOITCCWTH-WHFBIAKZSA-N 0.000 description 2
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 2
- LGQZOQRDEUIZJY-YUMQZZPRSA-N Gly-Cys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CS)NC(=O)CN)C(O)=O LGQZOQRDEUIZJY-YUMQZZPRSA-N 0.000 description 2
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 2
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 2
- DENRBIYENOKSEX-PEXQALLHSA-N Gly-Ile-His Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DENRBIYENOKSEX-PEXQALLHSA-N 0.000 description 2
- AAHSHTLISQUZJL-QSFUFRPTSA-N Gly-Ile-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AAHSHTLISQUZJL-QSFUFRPTSA-N 0.000 description 2
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 description 2
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 2
- HAXARWKYFIIHKD-ZKWXMUAHSA-N Gly-Ile-Ser Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HAXARWKYFIIHKD-ZKWXMUAHSA-N 0.000 description 2
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 2
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 2
- BXICSAQLIHFDDL-YUMQZZPRSA-N Gly-Lys-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BXICSAQLIHFDDL-YUMQZZPRSA-N 0.000 description 2
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 description 2
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 2
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 2
- DBJYVKDPGIFXFO-BQBZGAKWSA-N Gly-Met-Ala Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O DBJYVKDPGIFXFO-BQBZGAKWSA-N 0.000 description 2
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 2
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 2
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 2
- RZEDHGORCKRINR-STQMWFEESA-N Gly-Trp-Cys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN RZEDHGORCKRINR-STQMWFEESA-N 0.000 description 2
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 2
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 2
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 2
- 241001055502 Gordonia terrae C-6 Species 0.000 description 2
- 241000329363 Halalkalicoccus Species 0.000 description 2
- MJNWEIMBXKKCSF-XVYDVKMFSA-N His-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N MJNWEIMBXKKCSF-XVYDVKMFSA-N 0.000 description 2
- MDBYBTWRMOAJAY-NHCYSSNCSA-N His-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N MDBYBTWRMOAJAY-NHCYSSNCSA-N 0.000 description 2
- BDHUXUFYNUOUIT-SRVKXCTJSA-N His-Asp-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BDHUXUFYNUOUIT-SRVKXCTJSA-N 0.000 description 2
- YVCGJPIKRMGNPA-LSJOCFKGSA-N His-Met-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O YVCGJPIKRMGNPA-LSJOCFKGSA-N 0.000 description 2
- SOYCWSKCUVDLMC-AVGNSLFASA-N His-Pro-Arg Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N2CCC[C@H]2C(=O)N[C@@H](CCCNC(=N)N)C(=O)O SOYCWSKCUVDLMC-AVGNSLFASA-N 0.000 description 2
- BZAQOPHNBFOOJS-DCAQKATOSA-N His-Pro-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O BZAQOPHNBFOOJS-DCAQKATOSA-N 0.000 description 2
- VCBWXASUBZIFLQ-IHRRRGAJSA-N His-Pro-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O VCBWXASUBZIFLQ-IHRRRGAJSA-N 0.000 description 2
- XVZJRZQIHJMUBG-TUBUOCAGSA-N His-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CN=CN1)N XVZJRZQIHJMUBG-TUBUOCAGSA-N 0.000 description 2
- WSXNWASHQNSMRX-GVXVVHGQSA-N His-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N WSXNWASHQNSMRX-GVXVVHGQSA-N 0.000 description 2
- CISBRYJZMFWOHJ-JBDRJPRFSA-N Ile-Ala-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O)N CISBRYJZMFWOHJ-JBDRJPRFSA-N 0.000 description 2
- YPWHUFAAMNHMGS-QSFUFRPTSA-N Ile-Ala-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YPWHUFAAMNHMGS-QSFUFRPTSA-N 0.000 description 2
- DPTBVFUDCPINIP-JURCDPSOSA-N Ile-Ala-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DPTBVFUDCPINIP-JURCDPSOSA-N 0.000 description 2
- FVEWRQXNISSYFO-ZPFDUUQYSA-N Ile-Arg-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FVEWRQXNISSYFO-ZPFDUUQYSA-N 0.000 description 2
- QLRMMMQNCWBNPQ-QXEWZRGKSA-N Ile-Arg-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N QLRMMMQNCWBNPQ-QXEWZRGKSA-N 0.000 description 2
- ATXGFMOBVKSOMK-PEDHHIEDSA-N Ile-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N ATXGFMOBVKSOMK-PEDHHIEDSA-N 0.000 description 2
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 2
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 2
- RPZFUIQVAPZLRH-GHCJXIJMSA-N Ile-Asp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)O)N RPZFUIQVAPZLRH-GHCJXIJMSA-N 0.000 description 2
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 2
- BEWFWZRGBDVXRP-PEFMBERDSA-N Ile-Glu-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BEWFWZRGBDVXRP-PEFMBERDSA-N 0.000 description 2
- LGMUPVWZEYYUMU-YVNDNENWSA-N Ile-Glu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N LGMUPVWZEYYUMU-YVNDNENWSA-N 0.000 description 2
- MTFVYKQRLXYAQN-LAEOZQHASA-N Ile-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O MTFVYKQRLXYAQN-LAEOZQHASA-N 0.000 description 2
- KOPIAUWNLKKELG-SIGLWIIPSA-N Ile-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N KOPIAUWNLKKELG-SIGLWIIPSA-N 0.000 description 2
- PWDSHAAAFXISLE-SXTJYALSSA-N Ile-Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O PWDSHAAAFXISLE-SXTJYALSSA-N 0.000 description 2
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 description 2
- KBAPKNDWAGVGTH-IGISWZIWSA-N Ile-Ile-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KBAPKNDWAGVGTH-IGISWZIWSA-N 0.000 description 2
- QZZIBQZLWBOOJH-PEDHHIEDSA-N Ile-Ile-Val Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(=O)O QZZIBQZLWBOOJH-PEDHHIEDSA-N 0.000 description 2
- GAZGFPOZOLEYAJ-YTFOTSKYSA-N Ile-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N GAZGFPOZOLEYAJ-YTFOTSKYSA-N 0.000 description 2
- PMMMQRVUMVURGJ-XUXIUFHCSA-N Ile-Leu-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O PMMMQRVUMVURGJ-XUXIUFHCSA-N 0.000 description 2
- RMNMUUCYTMLWNA-ZPFDUUQYSA-N Ile-Lys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RMNMUUCYTMLWNA-ZPFDUUQYSA-N 0.000 description 2
- PNTWNAXGBOZMBO-MNXVOIDGSA-N Ile-Lys-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PNTWNAXGBOZMBO-MNXVOIDGSA-N 0.000 description 2
- FFAUOCITXBMRBT-YTFOTSKYSA-N Ile-Lys-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FFAUOCITXBMRBT-YTFOTSKYSA-N 0.000 description 2
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 2
- VOCZPDONPURUHV-QEWYBTABSA-N Ile-Phe-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VOCZPDONPURUHV-QEWYBTABSA-N 0.000 description 2
- IVXJIMGDOYRLQU-XUXIUFHCSA-N Ile-Pro-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O IVXJIMGDOYRLQU-XUXIUFHCSA-N 0.000 description 2
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 2
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 2
- PZWBBXHHUSIGKH-OSUNSFLBSA-N Ile-Thr-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PZWBBXHHUSIGKH-OSUNSFLBSA-N 0.000 description 2
- RKQAYOWLSFLJEE-SVSWQMSJSA-N Ile-Thr-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)O)N RKQAYOWLSFLJEE-SVSWQMSJSA-N 0.000 description 2
- ANTFEOSJMAUGIB-KNZXXDILSA-N Ile-Thr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N ANTFEOSJMAUGIB-KNZXXDILSA-N 0.000 description 2
- OMDWJWGZGMCQND-CFMVVWHZSA-N Ile-Tyr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OMDWJWGZGMCQND-CFMVVWHZSA-N 0.000 description 2
- PRTZQMBYUZFSFA-XEGUGMAKSA-N Ile-Tyr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)NCC(=O)O)N PRTZQMBYUZFSFA-XEGUGMAKSA-N 0.000 description 2
- NJGXXYLPDMMFJB-XUXIUFHCSA-N Ile-Val-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N NJGXXYLPDMMFJB-XUXIUFHCSA-N 0.000 description 2
- YHFPHRUWZMEOIX-CYDGBPFRSA-N Ile-Val-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(=O)O)N YHFPHRUWZMEOIX-CYDGBPFRSA-N 0.000 description 2
- 101100498826 Klebsiella oxytoca (strain ATCC 8724 / DSM 4798 / JCM 20051 / NBRC 3318 / NRRL B-199 / KCTC 1686) ddrB gene Proteins 0.000 description 2
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 2
- JVTAAEKCZFNVCJ-UHFFFAOYSA-M Lactate Chemical compound CC(O)C([O-])=O JVTAAEKCZFNVCJ-UHFFFAOYSA-M 0.000 description 2
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 2
- PBCHMHROGNUXMK-DLOVCJGASA-N Leu-Ala-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 PBCHMHROGNUXMK-DLOVCJGASA-N 0.000 description 2
- XIRYQRLFHWWWTC-QEJZJMRPSA-N Leu-Ala-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XIRYQRLFHWWWTC-QEJZJMRPSA-N 0.000 description 2
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 2
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 2
- IGUOAYLTQJLPPD-DCAQKATOSA-N Leu-Asn-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IGUOAYLTQJLPPD-DCAQKATOSA-N 0.000 description 2
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 2
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 2
- FIJMQLGQLBLBOL-HJGDQZAQSA-N Leu-Asn-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FIJMQLGQLBLBOL-HJGDQZAQSA-N 0.000 description 2
- YKNBJXOJTURHCU-DCAQKATOSA-N Leu-Asp-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKNBJXOJTURHCU-DCAQKATOSA-N 0.000 description 2
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 2
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 2
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 2
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 2
- RRSLQOLASISYTB-CIUDSAMLSA-N Leu-Cys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O RRSLQOLASISYTB-CIUDSAMLSA-N 0.000 description 2
- PPTAQBNUFKTJKA-BJDJZHNGSA-N Leu-Cys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PPTAQBNUFKTJKA-BJDJZHNGSA-N 0.000 description 2
- FOEHRHOBWFQSNW-KATARQTJSA-N Leu-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(C)C)N)O FOEHRHOBWFQSNW-KATARQTJSA-N 0.000 description 2
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 2
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 2
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 2
- CIVKXGPFXDIQBV-WDCWCFNPSA-N Leu-Gln-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CIVKXGPFXDIQBV-WDCWCFNPSA-N 0.000 description 2
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 2
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 2
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 2
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 2
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 2
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 2
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 2
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 2
- KXCMQWMNYQOAKA-SRVKXCTJSA-N Leu-Met-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KXCMQWMNYQOAKA-SRVKXCTJSA-N 0.000 description 2
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 2
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 2
- ADJWHHZETYAAAX-SRVKXCTJSA-N Leu-Ser-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ADJWHHZETYAAAX-SRVKXCTJSA-N 0.000 description 2
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 2
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 2
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 2
- CGHXMODRYJISSK-NHCYSSNCSA-N Leu-Val-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O CGHXMODRYJISSK-NHCYSSNCSA-N 0.000 description 2
- QQXJROOJCMIHIV-AVGNSLFASA-N Leu-Val-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O QQXJROOJCMIHIV-AVGNSLFASA-N 0.000 description 2
- 241001293864 Listeria seeligeri serovar 1/2b str. SLCC3954 Species 0.000 description 2
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 2
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 2
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 2
- DEFGUIIUYAUEDU-ZPFDUUQYSA-N Lys-Asn-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DEFGUIIUYAUEDU-ZPFDUUQYSA-N 0.000 description 2
- SSJBMGCZZXCGJJ-DCAQKATOSA-N Lys-Asp-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O SSJBMGCZZXCGJJ-DCAQKATOSA-N 0.000 description 2
- QIJVAFLRMVBHMU-KKUMJFAQSA-N Lys-Asp-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QIJVAFLRMVBHMU-KKUMJFAQSA-N 0.000 description 2
- OPTCSTACHGNULU-DCAQKATOSA-N Lys-Cys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCCCN OPTCSTACHGNULU-DCAQKATOSA-N 0.000 description 2
- YFGWNAROEYWGNL-GUBZILKMSA-N Lys-Gln-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YFGWNAROEYWGNL-GUBZILKMSA-N 0.000 description 2
- NDORZBUHCOJQDO-GVXVVHGQSA-N Lys-Gln-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O NDORZBUHCOJQDO-GVXVVHGQSA-N 0.000 description 2
- WGLAORUKDGRINI-WDCWCFNPSA-N Lys-Glu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGLAORUKDGRINI-WDCWCFNPSA-N 0.000 description 2
- ITWQLSZTLBKWJM-YUMQZZPRSA-N Lys-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCCN ITWQLSZTLBKWJM-YUMQZZPRSA-N 0.000 description 2
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 2
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 2
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 2
- IUWMQCZOTYRXPL-ZPFDUUQYSA-N Lys-Ile-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O IUWMQCZOTYRXPL-ZPFDUUQYSA-N 0.000 description 2
- PRSBSVAVOQOAMI-BJDJZHNGSA-N Lys-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN PRSBSVAVOQOAMI-BJDJZHNGSA-N 0.000 description 2
- NCZIQZYZPUPMKY-PPCPHDFISA-N Lys-Ile-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NCZIQZYZPUPMKY-PPCPHDFISA-N 0.000 description 2
- OVAOHZIOUBEQCJ-IHRRRGAJSA-N Lys-Leu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OVAOHZIOUBEQCJ-IHRRRGAJSA-N 0.000 description 2
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 2
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 2
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 2
- ATNKHRAIZCMCCN-BZSNNMDCSA-N Lys-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N ATNKHRAIZCMCCN-BZSNNMDCSA-N 0.000 description 2
- LOGFVTREOLYCPF-RHYQMDGZSA-N Lys-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN LOGFVTREOLYCPF-RHYQMDGZSA-N 0.000 description 2
- YSPZCHGIWAQVKQ-AVGNSLFASA-N Lys-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN YSPZCHGIWAQVKQ-AVGNSLFASA-N 0.000 description 2
- LKDXINHHSWFFJC-SRVKXCTJSA-N Lys-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N LKDXINHHSWFFJC-SRVKXCTJSA-N 0.000 description 2
- JOSAKOKSPXROGQ-BJDJZHNGSA-N Lys-Ser-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JOSAKOKSPXROGQ-BJDJZHNGSA-N 0.000 description 2
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 2
- IEVXCWPVBYCJRZ-IXOXFDKPSA-N Lys-Thr-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IEVXCWPVBYCJRZ-IXOXFDKPSA-N 0.000 description 2
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 2
- YKBSXQFZWFXFIB-VOAKCMCISA-N Lys-Thr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O YKBSXQFZWFXFIB-VOAKCMCISA-N 0.000 description 2
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 2
- SUZVLFWOCKHWET-CQDKDKBSSA-N Lys-Tyr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O SUZVLFWOCKHWET-CQDKDKBSSA-N 0.000 description 2
- ZVZRQKJOQQAFCF-ULQDDVLXSA-N Lys-Tyr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZVZRQKJOQQAFCF-ULQDDVLXSA-N 0.000 description 2
- IEIHKHYMBIYQTH-YESZJQIVSA-N Lys-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCCN)N)C(=O)O IEIHKHYMBIYQTH-YESZJQIVSA-N 0.000 description 2
- QFSYGUMEANRNJE-DCAQKATOSA-N Lys-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N QFSYGUMEANRNJE-DCAQKATOSA-N 0.000 description 2
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 2
- CSNNHWWHGAXBCP-UHFFFAOYSA-L Magnesium sulfate Chemical compound [Mg+2].[O-][S+2]([O-])([O-])[O-] CSNNHWWHGAXBCP-UHFFFAOYSA-L 0.000 description 2
- QAHFGYLFLVGBNW-DCAQKATOSA-N Met-Ala-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN QAHFGYLFLVGBNW-DCAQKATOSA-N 0.000 description 2
- VTKPSXWRUGCOAC-GUBZILKMSA-N Met-Ala-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCSC VTKPSXWRUGCOAC-GUBZILKMSA-N 0.000 description 2
- CTVJSFRHUOSCQQ-DCAQKATOSA-N Met-Arg-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CTVJSFRHUOSCQQ-DCAQKATOSA-N 0.000 description 2
- AHZNUGRZHMZGFL-GUBZILKMSA-N Met-Arg-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCNC(N)=N AHZNUGRZHMZGFL-GUBZILKMSA-N 0.000 description 2
- OXHSZBRPUGNMKW-DCAQKATOSA-N Met-Gln-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OXHSZBRPUGNMKW-DCAQKATOSA-N 0.000 description 2
- MYAPQOBHGWJZOM-UWVGGRQHSA-N Met-Gly-Leu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C MYAPQOBHGWJZOM-UWVGGRQHSA-N 0.000 description 2
- UZVKFARGHHMQGX-IUCAKERBSA-N Met-Gly-Met Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCSC UZVKFARGHHMQGX-IUCAKERBSA-N 0.000 description 2
- BKIFWLQFOOKUCA-DCAQKATOSA-N Met-His-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N BKIFWLQFOOKUCA-DCAQKATOSA-N 0.000 description 2
- JHDNAOVJJQSMMM-GMOBBJLQSA-N Met-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCSC)N JHDNAOVJJQSMMM-GMOBBJLQSA-N 0.000 description 2
- RRIHXWPHQSXHAQ-XUXIUFHCSA-N Met-Ile-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O RRIHXWPHQSXHAQ-XUXIUFHCSA-N 0.000 description 2
- UNPGTBHYKJOCCZ-DCAQKATOSA-N Met-Lys-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O UNPGTBHYKJOCCZ-DCAQKATOSA-N 0.000 description 2
- WPTHAGXMYDRPFD-SRVKXCTJSA-N Met-Lys-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O WPTHAGXMYDRPFD-SRVKXCTJSA-N 0.000 description 2
- LCPUWQLULVXROY-RHYQMDGZSA-N Met-Lys-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LCPUWQLULVXROY-RHYQMDGZSA-N 0.000 description 2
- FMYLZGQFKPHXHI-GUBZILKMSA-N Met-Met-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O FMYLZGQFKPHXHI-GUBZILKMSA-N 0.000 description 2
- WUYLWZRHRLLEGB-AVGNSLFASA-N Met-Met-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O WUYLWZRHRLLEGB-AVGNSLFASA-N 0.000 description 2
- XOFDBXYPKZUAAM-GUBZILKMSA-N Met-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N XOFDBXYPKZUAAM-GUBZILKMSA-N 0.000 description 2
- QEDGNYFHLXXIDC-DCAQKATOSA-N Met-Pro-Gln Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O QEDGNYFHLXXIDC-DCAQKATOSA-N 0.000 description 2
- BJPQKNHZHUCQNQ-SRVKXCTJSA-N Met-Pro-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCSC)N BJPQKNHZHUCQNQ-SRVKXCTJSA-N 0.000 description 2
- MIXPUVSPPOWTCR-FXQIFTODSA-N Met-Ser-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MIXPUVSPPOWTCR-FXQIFTODSA-N 0.000 description 2
- GMMLGMFBYCFCCX-KZVJFYERSA-N Met-Thr-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMMLGMFBYCFCCX-KZVJFYERSA-N 0.000 description 2
- CIIJWIAORKTXAH-FJXKBIBVSA-N Met-Thr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O CIIJWIAORKTXAH-FJXKBIBVSA-N 0.000 description 2
- OVTOTTGZBWXLFU-QXEWZRGKSA-N Met-Val-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O OVTOTTGZBWXLFU-QXEWZRGKSA-N 0.000 description 2
- FSTWDRPCQQUJIT-NHCYSSNCSA-N Met-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCSC)N FSTWDRPCQQUJIT-NHCYSSNCSA-N 0.000 description 2
- QAVZUKIPOMBLMC-AVGNSLFASA-N Met-Val-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C QAVZUKIPOMBLMC-AVGNSLFASA-N 0.000 description 2
- IIHMNTBFPMRJCN-RCWTZXSCSA-N Met-Val-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IIHMNTBFPMRJCN-RCWTZXSCSA-N 0.000 description 2
- 241000157876 Metallosphaera sedula Species 0.000 description 2
- 241000589308 Methylobacterium extorquens Species 0.000 description 2
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 2
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 2
- LZDIENNKWVXJMX-JYJNAYRXSA-N Phe-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC1=CC=CC=C1 LZDIENNKWVXJMX-JYJNAYRXSA-N 0.000 description 2
- BRDYYVQTEJVRQT-HRCADAONSA-N Phe-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BRDYYVQTEJVRQT-HRCADAONSA-N 0.000 description 2
- LDSOBEJVGGVWGD-DLOVCJGASA-N Phe-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 LDSOBEJVGGVWGD-DLOVCJGASA-N 0.000 description 2
- VUYCNYVLKACHPA-KKUMJFAQSA-N Phe-Asp-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N VUYCNYVLKACHPA-KKUMJFAQSA-N 0.000 description 2
- KJJROSNFBRWPHS-JYJNAYRXSA-N Phe-Glu-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KJJROSNFBRWPHS-JYJNAYRXSA-N 0.000 description 2
- RFEXGCASCQGGHZ-STQMWFEESA-N Phe-Gly-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O RFEXGCASCQGGHZ-STQMWFEESA-N 0.000 description 2
- YYKZDTVQHTUKDW-RYUDHWBXSA-N Phe-Gly-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N YYKZDTVQHTUKDW-RYUDHWBXSA-N 0.000 description 2
- KRYSMKKRRRWOCZ-QEWYBTABSA-N Phe-Ile-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KRYSMKKRRRWOCZ-QEWYBTABSA-N 0.000 description 2
- YKUGPVXSDOOANW-KKUMJFAQSA-N Phe-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKUGPVXSDOOANW-KKUMJFAQSA-N 0.000 description 2
- KPEIBEPEUAZWNS-ULQDDVLXSA-N Phe-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KPEIBEPEUAZWNS-ULQDDVLXSA-N 0.000 description 2
- DNAXXTQSTKOHFO-QEJZJMRPSA-N Phe-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DNAXXTQSTKOHFO-QEJZJMRPSA-N 0.000 description 2
- IWZRODDWOSIXPZ-IRXDYDNUSA-N Phe-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 IWZRODDWOSIXPZ-IRXDYDNUSA-N 0.000 description 2
- RBRNEFJTEHPDSL-ACRUOGEOSA-N Phe-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 RBRNEFJTEHPDSL-ACRUOGEOSA-N 0.000 description 2
- CZQZSMJXFGGBHM-KKUMJFAQSA-N Phe-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O CZQZSMJXFGGBHM-KKUMJFAQSA-N 0.000 description 2
- IPFXYNKCXYGSSV-KKUMJFAQSA-N Phe-Ser-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N IPFXYNKCXYGSSV-KKUMJFAQSA-N 0.000 description 2
- MVIJMIZJPHQGEN-IHRRRGAJSA-N Phe-Ser-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@H](CO)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 MVIJMIZJPHQGEN-IHRRRGAJSA-N 0.000 description 2
- JHSRGEODDALISP-XVSYOHENSA-N Phe-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O JHSRGEODDALISP-XVSYOHENSA-N 0.000 description 2
- BSKMOCNNLNDIMU-CDMKHQONSA-N Phe-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O BSKMOCNNLNDIMU-CDMKHQONSA-N 0.000 description 2
- YFXXRYFWJFQAFW-JHYOHUSXSA-N Phe-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YFXXRYFWJFQAFW-JHYOHUSXSA-N 0.000 description 2
- XALFIVXGQUEGKV-JSGCOSHPSA-N Phe-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XALFIVXGQUEGKV-JSGCOSHPSA-N 0.000 description 2
- VIIRRNQMMIHYHQ-XHSDSOJGSA-N Phe-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N VIIRRNQMMIHYHQ-XHSDSOJGSA-N 0.000 description 2
- NBIIXXVUZAFLBC-UHFFFAOYSA-N Phosphoric acid Chemical compound OP(O)(O)=O NBIIXXVUZAFLBC-UHFFFAOYSA-N 0.000 description 2
- 241001252411 Polynucleobacter necessarius Species 0.000 description 2
- 241000605862 Porphyromonas gingivalis Species 0.000 description 2
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 2
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 2
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 2
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 2
- HFZNNDWPHBRNPV-KZVJFYERSA-N Pro-Ala-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HFZNNDWPHBRNPV-KZVJFYERSA-N 0.000 description 2
- GRIRJQGZZJVANI-CYDGBPFRSA-N Pro-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 GRIRJQGZZJVANI-CYDGBPFRSA-N 0.000 description 2
- HXOLCSYHGRNXJJ-IHRRRGAJSA-N Pro-Asp-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HXOLCSYHGRNXJJ-IHRRRGAJSA-N 0.000 description 2
- HJSCRFZVGXAGNG-SRVKXCTJSA-N Pro-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 HJSCRFZVGXAGNG-SRVKXCTJSA-N 0.000 description 2
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 2
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 2
- WVOXLKUUVCCCSU-ZPFDUUQYSA-N Pro-Glu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVOXLKUUVCCCSU-ZPFDUUQYSA-N 0.000 description 2
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 2
- DMKWYMWNEKIPFC-IUCAKERBSA-N Pro-Gly-Arg Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O DMKWYMWNEKIPFC-IUCAKERBSA-N 0.000 description 2
- ULIWFCCJIOEHMU-BQBZGAKWSA-N Pro-Gly-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 ULIWFCCJIOEHMU-BQBZGAKWSA-N 0.000 description 2
- FFSLAIOXRMOFIZ-GJZGRUSLSA-N Pro-Gly-Trp Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)O)C(=O)CNC(=O)[C@@H]1CCCN1 FFSLAIOXRMOFIZ-GJZGRUSLSA-N 0.000 description 2
- AJCRQOHDLCBHFA-SRVKXCTJSA-N Pro-His-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AJCRQOHDLCBHFA-SRVKXCTJSA-N 0.000 description 2
- BFXZQMWKTYWGCF-PYJNHQTQSA-N Pro-His-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BFXZQMWKTYWGCF-PYJNHQTQSA-N 0.000 description 2
- STASJMBVVHNWCG-IHRRRGAJSA-N Pro-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 STASJMBVVHNWCG-IHRRRGAJSA-N 0.000 description 2
- XYHMFGGWNOFUOU-QXEWZRGKSA-N Pro-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 XYHMFGGWNOFUOU-QXEWZRGKSA-N 0.000 description 2
- MRYUJHGPZQNOAD-IHRRRGAJSA-N Pro-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 MRYUJHGPZQNOAD-IHRRRGAJSA-N 0.000 description 2
- OFGUOWQVEGTVNU-DCAQKATOSA-N Pro-Lys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OFGUOWQVEGTVNU-DCAQKATOSA-N 0.000 description 2
- AJBQTGZIZQXBLT-STQMWFEESA-N Pro-Phe-Gly Chemical compound C([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 AJBQTGZIZQXBLT-STQMWFEESA-N 0.000 description 2
- KBUAPZAZPWNYSW-SRVKXCTJSA-N Pro-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KBUAPZAZPWNYSW-SRVKXCTJSA-N 0.000 description 2
- SEZGGSHLMROBFX-CIUDSAMLSA-N Pro-Ser-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O SEZGGSHLMROBFX-CIUDSAMLSA-N 0.000 description 2
- DCHQYSOGURGJST-FJXKBIBVSA-N Pro-Thr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O DCHQYSOGURGJST-FJXKBIBVSA-N 0.000 description 2
- VVEQUISRWJDGMX-VKOGCVSHSA-N Pro-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@@H]3CCCN3 VVEQUISRWJDGMX-VKOGCVSHSA-N 0.000 description 2
- CWZUFLWPEFHWEI-IHRRRGAJSA-N Pro-Tyr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O CWZUFLWPEFHWEI-IHRRRGAJSA-N 0.000 description 2
- IIRBTQHFVNGPMQ-AVGNSLFASA-N Pro-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 IIRBTQHFVNGPMQ-AVGNSLFASA-N 0.000 description 2
- 101100505872 Pseudomonas aeruginosa (strain ATCC 15692 / DSM 22644 / CIP 104116 / JCM 14847 / LMG 12228 / 1C / PRS 101 / PAO1) xcpT gene Proteins 0.000 description 2
- LCTONWCANYUPML-UHFFFAOYSA-M Pyruvate Chemical compound CC(=O)C([O-])=O LCTONWCANYUPML-UHFFFAOYSA-M 0.000 description 2
- 102000018120 Recombinases Human genes 0.000 description 2
- 108010091086 Recombinases Proteins 0.000 description 2
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 2
- 241000448969 Salmonella enterica subsp. enterica serovar Mbandaka str. ATCC 51958 Species 0.000 description 2
- 101100135913 Salmonella typhimurium (strain LT2 / SGSC1412 / ATCC 700720) pduC gene Proteins 0.000 description 2
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 2
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 2
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 2
- PZZJMBYSYAKYPK-UWJYBYFXSA-N Ser-Ala-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PZZJMBYSYAKYPK-UWJYBYFXSA-N 0.000 description 2
- NLQUOHDCLSFABG-GUBZILKMSA-N Ser-Arg-Arg Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NLQUOHDCLSFABG-GUBZILKMSA-N 0.000 description 2
- OOKCGAYXSNJBGQ-ZLUOBGJFSA-N Ser-Asn-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OOKCGAYXSNJBGQ-ZLUOBGJFSA-N 0.000 description 2
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 2
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 2
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 2
- COLJZWUVZIXSSS-CIUDSAMLSA-N Ser-Cys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N COLJZWUVZIXSSS-CIUDSAMLSA-N 0.000 description 2
- CRZRTKAVUUGKEQ-ACZMJKKPSA-N Ser-Gln-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CRZRTKAVUUGKEQ-ACZMJKKPSA-N 0.000 description 2
- ZOHGLPQGEHSLPD-FXQIFTODSA-N Ser-Gln-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZOHGLPQGEHSLPD-FXQIFTODSA-N 0.000 description 2
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 2
- GRSLLFZTTLBOQX-CIUDSAMLSA-N Ser-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N GRSLLFZTTLBOQX-CIUDSAMLSA-N 0.000 description 2
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 2
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 2
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 2
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 2
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 2
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 2
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 2
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 2
- PIQRHJQWEPWFJG-UWJYBYFXSA-N Ser-Tyr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PIQRHJQWEPWFJG-UWJYBYFXSA-N 0.000 description 2
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 2
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 2
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 2
- ODRUTDLAONAVDV-IHRRRGAJSA-N Ser-Val-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ODRUTDLAONAVDV-IHRRRGAJSA-N 0.000 description 2
- 241000205091 Sulfolobus solfataricus Species 0.000 description 2
- QAOWNCQODCNURD-UHFFFAOYSA-N Sulfuric acid Chemical compound OS(O)(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-N 0.000 description 2
- 241000264843 Syntrophobacter fumaroxidans Species 0.000 description 2
- 241000589017 Thermomicrobium roseum Species 0.000 description 2
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 2
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 2
- YRNBANYVJJBGDI-VZFHVOOUSA-N Thr-Ala-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O)N)O YRNBANYVJJBGDI-VZFHVOOUSA-N 0.000 description 2
- DDPVJPIGACCMEH-XQXXSGGOSA-N Thr-Ala-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DDPVJPIGACCMEH-XQXXSGGOSA-N 0.000 description 2
- PXQUBKWZENPDGE-CIQUZCHMSA-N Thr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)O)N PXQUBKWZENPDGE-CIQUZCHMSA-N 0.000 description 2
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 description 2
- CAGTXGDOIFXLPC-KZVJFYERSA-N Thr-Arg-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N CAGTXGDOIFXLPC-KZVJFYERSA-N 0.000 description 2
- GKMYGVQDGVYCPC-IUKAMOBKSA-N Thr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H]([C@@H](C)O)N GKMYGVQDGVYCPC-IUKAMOBKSA-N 0.000 description 2
- JXKMXEBNZCKSDY-JIOCBJNQSA-N Thr-Asp-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O JXKMXEBNZCKSDY-JIOCBJNQSA-N 0.000 description 2
- ZUUDNCOCILSYAM-KKHAAJSZSA-N Thr-Asp-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZUUDNCOCILSYAM-KKHAAJSZSA-N 0.000 description 2
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 2
- GKWNLDNXMMLRMC-GLLZPBPUSA-N Thr-Glu-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O GKWNLDNXMMLRMC-GLLZPBPUSA-N 0.000 description 2
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 2
- HJOSVGCWOTYJFG-WDCWCFNPSA-N Thr-Glu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O HJOSVGCWOTYJFG-WDCWCFNPSA-N 0.000 description 2
- BIENEHRYNODTLP-HJGDQZAQSA-N Thr-Glu-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N)O BIENEHRYNODTLP-HJGDQZAQSA-N 0.000 description 2
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 2
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 2
- GMXIJHCBTZDAPD-QPHKQPEJSA-N Thr-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N GMXIJHCBTZDAPD-QPHKQPEJSA-N 0.000 description 2
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 2
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 2
- IJVNLNRVDUTWDD-MEYUZBJRSA-N Thr-Leu-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IJVNLNRVDUTWDD-MEYUZBJRSA-N 0.000 description 2
- KRDSCBLRHORMRK-JXUBOQSCSA-N Thr-Lys-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O KRDSCBLRHORMRK-JXUBOQSCSA-N 0.000 description 2
- JLNMFGCJODTXDH-WEDXCCLWSA-N Thr-Lys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O JLNMFGCJODTXDH-WEDXCCLWSA-N 0.000 description 2
- JWQNAFHCXKVZKZ-UVOCVTCTSA-N Thr-Lys-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWQNAFHCXKVZKZ-UVOCVTCTSA-N 0.000 description 2
- PUEWAXRPXOEQOW-HJGDQZAQSA-N Thr-Met-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(O)=O PUEWAXRPXOEQOW-HJGDQZAQSA-N 0.000 description 2
- WVVOFCVMHAXGLE-LFSVMHDDSA-N Thr-Phe-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O WVVOFCVMHAXGLE-LFSVMHDDSA-N 0.000 description 2
- WYLAVUAWOUVUCA-XVSYOHENSA-N Thr-Phe-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WYLAVUAWOUVUCA-XVSYOHENSA-N 0.000 description 2
- WNQJTLATMXYSEL-OEAJRASXSA-N Thr-Phe-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WNQJTLATMXYSEL-OEAJRASXSA-N 0.000 description 2
- VTMGKRABARCZAX-OSUNSFLBSA-N Thr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O VTMGKRABARCZAX-OSUNSFLBSA-N 0.000 description 2
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 2
- DOBIBIXIHJKVJF-XKBZYTNZSA-N Thr-Ser-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DOBIBIXIHJKVJF-XKBZYTNZSA-N 0.000 description 2
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 2
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 2
- RPECVQBNONKZAT-WZLNRYEVSA-N Thr-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H]([C@@H](C)O)N RPECVQBNONKZAT-WZLNRYEVSA-N 0.000 description 2
- XVHAUVJXBFGUPC-RPTUDFQQSA-N Thr-Tyr-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XVHAUVJXBFGUPC-RPTUDFQQSA-N 0.000 description 2
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 2
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 2
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 2
- BPGDJSUFQKWUBK-KJEVXHAQSA-N Thr-Val-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BPGDJSUFQKWUBK-KJEVXHAQSA-N 0.000 description 2
- WBZOZLNLXVBCNW-LTHWPDAASA-N Trp-Thr-Ile Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)[C@@H](C)O)=CNC2=C1 WBZOZLNLXVBCNW-LTHWPDAASA-N 0.000 description 2
- CRCHQCUINSOGFD-JBACZVJFSA-N Trp-Tyr-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N CRCHQCUINSOGFD-JBACZVJFSA-N 0.000 description 2
- VCXWRWYFJLXITF-AUTRQRHGSA-N Tyr-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VCXWRWYFJLXITF-AUTRQRHGSA-N 0.000 description 2
- TVOGEPLDNYTAHD-CQDKDKBSSA-N Tyr-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TVOGEPLDNYTAHD-CQDKDKBSSA-N 0.000 description 2
- ZWZOCUWOXSDYFZ-CQDKDKBSSA-N Tyr-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ZWZOCUWOXSDYFZ-CQDKDKBSSA-N 0.000 description 2
- DXYWRYQRKPIGGU-BPNCWPANSA-N Tyr-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DXYWRYQRKPIGGU-BPNCWPANSA-N 0.000 description 2
- SGFIXFAHVWJKTD-KJEVXHAQSA-N Tyr-Arg-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SGFIXFAHVWJKTD-KJEVXHAQSA-N 0.000 description 2
- GFHYISDTIWZUSU-QWRGUYRKSA-N Tyr-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GFHYISDTIWZUSU-QWRGUYRKSA-N 0.000 description 2
- YRBHLWWGSSQICE-IHRRRGAJSA-N Tyr-Asp-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O YRBHLWWGSSQICE-IHRRRGAJSA-N 0.000 description 2
- NQJDICVXXIMMMB-XDTLVQLUSA-N Tyr-Glu-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O NQJDICVXXIMMMB-XDTLVQLUSA-N 0.000 description 2
- XQYHLZNPOTXRMQ-KKUMJFAQSA-N Tyr-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XQYHLZNPOTXRMQ-KKUMJFAQSA-N 0.000 description 2
- FMOSEWZYZPMJAL-KKUMJFAQSA-N Tyr-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N FMOSEWZYZPMJAL-KKUMJFAQSA-N 0.000 description 2
- UNUZEBFXGWVAOP-DZKIICNBSA-N Tyr-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UNUZEBFXGWVAOP-DZKIICNBSA-N 0.000 description 2
- HIINQLBHPIQYHN-JTQLQIEISA-N Tyr-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HIINQLBHPIQYHN-JTQLQIEISA-N 0.000 description 2
- FMXFHNSFABRVFZ-BZSNNMDCSA-N Tyr-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FMXFHNSFABRVFZ-BZSNNMDCSA-N 0.000 description 2
- GZOCMHSZGGJBCX-ULQDDVLXSA-N Tyr-Lys-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O GZOCMHSZGGJBCX-ULQDDVLXSA-N 0.000 description 2
- AOIZTZRWMSPPAY-KAOXEZKKSA-N Tyr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)O AOIZTZRWMSPPAY-KAOXEZKKSA-N 0.000 description 2
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 2
- 108010064997 VPY tripeptide Proteins 0.000 description 2
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 2
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 2
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 2
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 2
- HURRXSNHCCSJHA-AUTRQRHGSA-N Val-Gln-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HURRXSNHCCSJHA-AUTRQRHGSA-N 0.000 description 2
- MHAHQDBEIDPFQS-NHCYSSNCSA-N Val-Glu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)C(C)C MHAHQDBEIDPFQS-NHCYSSNCSA-N 0.000 description 2
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 2
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 2
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 2
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 2
- YTPLVNUZZOBFFC-SCZZXKLOSA-N Val-Gly-Pro Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N1CCC[C@@H]1C(O)=O YTPLVNUZZOBFFC-SCZZXKLOSA-N 0.000 description 2
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 2
- SDSCOOZQQGUQFC-GVXVVHGQSA-N Val-His-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N SDSCOOZQQGUQFC-GVXVVHGQSA-N 0.000 description 2
- BZWUSZGQOILYEU-STECZYCISA-N Val-Ile-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BZWUSZGQOILYEU-STECZYCISA-N 0.000 description 2
- DJQIUOKSNRBTSV-CYDGBPFRSA-N Val-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](C(C)C)N DJQIUOKSNRBTSV-CYDGBPFRSA-N 0.000 description 2
- BMOFUVHDBROBSE-DCAQKATOSA-N Val-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N BMOFUVHDBROBSE-DCAQKATOSA-N 0.000 description 2
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 2
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 2
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 description 2
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 2
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 2
- QPPZEDOTPZOSEC-RCWTZXSCSA-N Val-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N)O QPPZEDOTPZOSEC-RCWTZXSCSA-N 0.000 description 2
- MJFSRZZJQWZHFQ-SRVKXCTJSA-N Val-Met-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)O)N MJFSRZZJQWZHFQ-SRVKXCTJSA-N 0.000 description 2
- MJOUSKQHAIARKI-JYJNAYRXSA-N Val-Phe-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 MJOUSKQHAIARKI-JYJNAYRXSA-N 0.000 description 2
- ZXYPHBKIZLAQTL-QXEWZRGKSA-N Val-Pro-Asp Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N ZXYPHBKIZLAQTL-QXEWZRGKSA-N 0.000 description 2
- NSUUANXHLKKHQB-BZSNNMDCSA-N Val-Pro-Trp Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CNC2=CC=CC=C12 NSUUANXHLKKHQB-BZSNNMDCSA-N 0.000 description 2
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 2
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 2
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 2
- WUFHZIRMAZZWRS-OSUNSFLBSA-N Val-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C(C)C)N WUFHZIRMAZZWRS-OSUNSFLBSA-N 0.000 description 2
- YLBNZCJFSVJDRJ-KJEVXHAQSA-N Val-Thr-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O YLBNZCJFSVJDRJ-KJEVXHAQSA-N 0.000 description 2
- QHSSPPHOHJSTML-HOCLYGCPSA-N Val-Trp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N QHSSPPHOHJSTML-HOCLYGCPSA-N 0.000 description 2
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 2
- SSKKGOWRPNIVDW-AVGNSLFASA-N Val-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SSKKGOWRPNIVDW-AVGNSLFASA-N 0.000 description 2
- 108010081404 acein-2 Proteins 0.000 description 2
- 108010028939 alanyl-alanyl-lysyl-alanine Proteins 0.000 description 2
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 2
- 239000007864 aqueous solution Substances 0.000 description 2
- 108010089442 arginyl-leucyl-alanyl-arginine Proteins 0.000 description 2
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 2
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 2
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 2
- 230000003115 biocidal effect Effects 0.000 description 2
- 229960005091 chloramphenicol Drugs 0.000 description 2
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 2
- 238000003776 cleavage reaction Methods 0.000 description 2
- 150000001875 compounds Chemical class 0.000 description 2
- 230000006378 damage Effects 0.000 description 2
- 230000000593 degrading effect Effects 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 235000014113 dietary fatty acids Nutrition 0.000 description 2
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 2
- 101150032129 egsA gene Proteins 0.000 description 2
- 235000019441 ethanol Nutrition 0.000 description 2
- 229930195729 fatty acid Natural products 0.000 description 2
- 239000000194 fatty acid Substances 0.000 description 2
- 150000004665 fatty acids Chemical class 0.000 description 2
- 101150033931 gldA gene Proteins 0.000 description 2
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 2
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 2
- 108010045126 glycyl-tyrosyl-glycine Proteins 0.000 description 2
- 108010084389 glycyltryptophan Proteins 0.000 description 2
- IPCSVZSSVZVIGE-UHFFFAOYSA-N hexadecanoic acid Chemical compound CCCCCCCCCCCCCCCC(O)=O IPCSVZSSVZVIGE-UHFFFAOYSA-N 0.000 description 2
- 230000010354 integration Effects 0.000 description 2
- 230000003834 intracellular effect Effects 0.000 description 2
- 108010076756 leucyl-alanyl-phenylalanine Proteins 0.000 description 2
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 2
- 108010089256 lysyl-aspartyl-glutamyl-leucine Proteins 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 102000039446 nucleic acids Human genes 0.000 description 2
- 108020004707 nucleic acids Proteins 0.000 description 2
- 150000007524 organic acids Chemical class 0.000 description 2
- 235000005985 organic acids Nutrition 0.000 description 2
- 229910052760 oxygen Inorganic materials 0.000 description 2
- 239000001301 oxygen Substances 0.000 description 2
- 101150097419 pduH gene Proteins 0.000 description 2
- 108010082795 phenylalanyl-arginyl-arginine Proteins 0.000 description 2
- 108010018625 phenylalanylarginine Proteins 0.000 description 2
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 2
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 2
- 108010079317 prolyl-tyrosine Proteins 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- 238000011084 recovery Methods 0.000 description 2
- 108091008146 restriction endonucleases Proteins 0.000 description 2
- 230000007017 scission Effects 0.000 description 2
- 230000028327 secretion Effects 0.000 description 2
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 2
- 239000011734 sodium Substances 0.000 description 2
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 2
- 108010038745 tryptophylglycine Proteins 0.000 description 2
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 2
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 2
- JNTMAZFVYNDPLB-PEDHHIEDSA-N (2S,3S)-2-[[[(2S)-1-[(2S,3S)-2-amino-3-methyl-1-oxopentyl]-2-pyrrolidinyl]-oxomethyl]amino]-3-methylpentanoic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNTMAZFVYNDPLB-PEDHHIEDSA-N 0.000 description 1
- AXFMEGAFCUULFV-BLFANLJRSA-N (2s)-2-[[(2s)-1-[(2s,3r)-2-amino-3-methylpentanoyl]pyrrolidine-2-carbonyl]amino]pentanedioic acid Chemical compound CC[C@@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AXFMEGAFCUULFV-BLFANLJRSA-N 0.000 description 1
- OGILYBDMVOATLU-CQJMVLFOSA-N (2s)-2-[[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]amino]-n-[(2s)-1-[[(2s)-1-amino-1-oxo-3-phenylpropan-2-yl]amino]-4-methyl-1-oxopentan-2-yl]-4-methylpentanamide Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(N)=O)C1=CC=C(O)C=C1 OGILYBDMVOATLU-CQJMVLFOSA-N 0.000 description 1
- JBFQOLHAGBKPTP-NZATWWQASA-N (2s)-2-[[(2s)-4-carboxy-2-[[3-carboxy-2-[[(2s)-2,6-diaminohexanoyl]amino]propanoyl]amino]butanoyl]amino]-4-methylpentanoic acid Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)C(CC(O)=O)NC(=O)[C@@H](N)CCCCN JBFQOLHAGBKPTP-NZATWWQASA-N 0.000 description 1
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 1
- OWEGMIWEEQEYGQ-UHFFFAOYSA-N 100676-05-9 Natural products OC1C(O)C(O)C(CO)OC1OCC1C(O)C(O)C(O)C(OC2C(OC(O)C(O)C2O)CO)O1 OWEGMIWEEQEYGQ-UHFFFAOYSA-N 0.000 description 1
- PAWQVTBBRAZDMG-UHFFFAOYSA-N 2-(3-bromo-2-fluorophenyl)acetic acid Chemical compound OC(=O)CC1=CC=CC(Br)=C1F PAWQVTBBRAZDMG-UHFFFAOYSA-N 0.000 description 1
- UZDMJOILBYFRMP-UHFFFAOYSA-N 2-[2-[2-[(2-amino-3-methylpentanoyl)amino]propanoylamino]propanoylamino]-3-methylpentanoic acid Chemical compound CCC(C)C(N)C(=O)NC(C)C(=O)NC(C)C(=O)NC(C(O)=O)C(C)CC UZDMJOILBYFRMP-UHFFFAOYSA-N 0.000 description 1
- ZWZOCNTYMUOGPQ-UHFFFAOYSA-N 2-[[2-[[1-(2-amino-3-methylpentanoyl)pyrrolidine-2-carbonyl]amino]acetyl]amino]-3-methylpentanoic acid Chemical compound CCC(C)C(N)C(=O)N1CCCC1C(=O)NCC(=O)NC(C(C)CC)C(O)=O ZWZOCNTYMUOGPQ-UHFFFAOYSA-N 0.000 description 1
- LLMSELCKURJSJI-UHFFFAOYSA-N 2-[[2-[[2-[(2-amino-4-methylsulfanylbutanoyl)amino]-3-methylpentanoyl]amino]-4-methylpentanoyl]amino]-3-methylpentanoic acid Chemical compound CCC(C)C(C(O)=O)NC(=O)C(CC(C)C)NC(=O)C(C(C)CC)NC(=O)C(N)CCSC LLMSELCKURJSJI-UHFFFAOYSA-N 0.000 description 1
- 101710135866 50S ribosomal protein L29 Proteins 0.000 description 1
- QEVHRUUCFGRFIF-UHFFFAOYSA-N 6,18-dimethoxy-17-[oxo-(3,4,5-trimethoxyphenyl)methoxy]-1,3,11,12,14,15,16,17,18,19,20,21-dodecahydroyohimban-19-carboxylic acid methyl ester Chemical compound C1C2CN3CCC(C4=CC=C(OC)C=C4N4)=C4C3CC2C(C(=O)OC)C(OC)C1OC(=O)C1=CC(OC)=C(OC)C(OC)=C1 QEVHRUUCFGRFIF-UHFFFAOYSA-N 0.000 description 1
- 108010044087 AS-I toxin Proteins 0.000 description 1
- 241000604450 Acidaminococcus fermentans Species 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 241001217929 Advenella kashmirensis Species 0.000 description 1
- UWQJHXKARZWDIJ-ZLUOBGJFSA-N Ala-Ala-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O UWQJHXKARZWDIJ-ZLUOBGJFSA-N 0.000 description 1
- LGQPPBQRUBVTIF-JBDRJPRFSA-N Ala-Ala-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LGQPPBQRUBVTIF-JBDRJPRFSA-N 0.000 description 1
- VBDMWOKJZDCFJM-FXQIFTODSA-N Ala-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N VBDMWOKJZDCFJM-FXQIFTODSA-N 0.000 description 1
- QDRGPQWIVZNJQD-CIUDSAMLSA-N Ala-Arg-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QDRGPQWIVZNJQD-CIUDSAMLSA-N 0.000 description 1
- SKHCUBQVZJHOFM-NAKRPEOUSA-N Ala-Arg-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SKHCUBQVZJHOFM-NAKRPEOUSA-N 0.000 description 1
- UCIYCBSJBQGDGM-LPEHRKFASA-N Ala-Arg-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N UCIYCBSJBQGDGM-LPEHRKFASA-N 0.000 description 1
- CCUAQNUWXLYFRA-IMJSIDKUSA-N Ala-Asn Chemical compound C[C@H]([NH3+])C(=O)N[C@H](C([O-])=O)CC(N)=O CCUAQNUWXLYFRA-IMJSIDKUSA-N 0.000 description 1
- XEXJJJRVTFGWIC-FXQIFTODSA-N Ala-Asn-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XEXJJJRVTFGWIC-FXQIFTODSA-N 0.000 description 1
- SHYYAQLDNVHPFT-DLOVCJGASA-N Ala-Asn-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SHYYAQLDNVHPFT-DLOVCJGASA-N 0.000 description 1
- GORKKVHIBWAQHM-GCJQMDKQSA-N Ala-Asn-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GORKKVHIBWAQHM-GCJQMDKQSA-N 0.000 description 1
- ZIBWKCRKNFYTPT-ZKWXMUAHSA-N Ala-Asn-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZIBWKCRKNFYTPT-ZKWXMUAHSA-N 0.000 description 1
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 description 1
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 1
- KUDREHRZRIVKHS-UWJYBYFXSA-N Ala-Asp-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KUDREHRZRIVKHS-UWJYBYFXSA-N 0.000 description 1
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 1
- RCQRKPUXJAGEEC-ZLUOBGJFSA-N Ala-Cys-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(O)=O RCQRKPUXJAGEEC-ZLUOBGJFSA-N 0.000 description 1
- KRHRBKYBJXMYBB-WHFBIAKZSA-N Ala-Cys-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O KRHRBKYBJXMYBB-WHFBIAKZSA-N 0.000 description 1
- UQJUGHFKNKGHFQ-VZFHVOOUSA-N Ala-Cys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UQJUGHFKNKGHFQ-VZFHVOOUSA-N 0.000 description 1
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 1
- CSAHOYQKNHGDHX-ACZMJKKPSA-N Ala-Gln-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CSAHOYQKNHGDHX-ACZMJKKPSA-N 0.000 description 1
- ZODMADSIQZZBSQ-FXQIFTODSA-N Ala-Gln-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZODMADSIQZZBSQ-FXQIFTODSA-N 0.000 description 1
- BLGHHPHXVJWCNK-GUBZILKMSA-N Ala-Gln-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BLGHHPHXVJWCNK-GUBZILKMSA-N 0.000 description 1
- AWAXZRDKUHOPBO-GUBZILKMSA-N Ala-Gln-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O AWAXZRDKUHOPBO-GUBZILKMSA-N 0.000 description 1
- SFNFGFDRYJKZKN-XQXXSGGOSA-N Ala-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C)N)O SFNFGFDRYJKZKN-XQXXSGGOSA-N 0.000 description 1
- NJPMYXWVWQWCSR-ACZMJKKPSA-N Ala-Glu-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NJPMYXWVWQWCSR-ACZMJKKPSA-N 0.000 description 1
- BGNLUHXLSAQYRQ-FXQIFTODSA-N Ala-Glu-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BGNLUHXLSAQYRQ-FXQIFTODSA-N 0.000 description 1
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 1
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 1
- UHMQKOBNPRAZGB-CIUDSAMLSA-N Ala-Glu-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N UHMQKOBNPRAZGB-CIUDSAMLSA-N 0.000 description 1
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 1
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 1
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 1
- BTBUEVAGZCKULD-XPUUQOCRSA-N Ala-Gly-His Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CN=CN1 BTBUEVAGZCKULD-XPUUQOCRSA-N 0.000 description 1
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 1
- QHASENCZLDHBGX-ONGXEEELSA-N Ala-Gly-Phe Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QHASENCZLDHBGX-ONGXEEELSA-N 0.000 description 1
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 1
- JDIQCVUDDFENPU-ZKWXMUAHSA-N Ala-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CNC=N1 JDIQCVUDDFENPU-ZKWXMUAHSA-N 0.000 description 1
- FDAZDMAFZYTHGS-XVYDVKMFSA-N Ala-His-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O FDAZDMAFZYTHGS-XVYDVKMFSA-N 0.000 description 1
- LTSBJNNXPBBNDT-HGNGGELXSA-N Ala-His-Gln Chemical compound N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)O LTSBJNNXPBBNDT-HGNGGELXSA-N 0.000 description 1
- SHKGHIFSEAGTNL-DLOVCJGASA-N Ala-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CN=CN1 SHKGHIFSEAGTNL-DLOVCJGASA-N 0.000 description 1
- CBCCCLMNOBLBSC-XVYDVKMFSA-N Ala-His-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CBCCCLMNOBLBSC-XVYDVKMFSA-N 0.000 description 1
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 1
- HQJKCXHQNUCKMY-GHCJXIJMSA-N Ala-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C)N HQJKCXHQNUCKMY-GHCJXIJMSA-N 0.000 description 1
- FOHXUHGZZKETFI-JBDRJPRFSA-N Ala-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C)N FOHXUHGZZKETFI-JBDRJPRFSA-N 0.000 description 1
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 1
- QCTFKEJEIMPOLW-JURCDPSOSA-N Ala-Ile-Phe Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QCTFKEJEIMPOLW-JURCDPSOSA-N 0.000 description 1
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 1
- LBYMZCVBOKYZNS-CIUDSAMLSA-N Ala-Leu-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O LBYMZCVBOKYZNS-CIUDSAMLSA-N 0.000 description 1
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 1
- VHVVPYOJIIQCKS-QEJZJMRPSA-N Ala-Leu-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VHVVPYOJIIQCKS-QEJZJMRPSA-N 0.000 description 1
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 1
- OMFMCIVBKCEMAK-CYDGBPFRSA-N Ala-Leu-Val-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O OMFMCIVBKCEMAK-CYDGBPFRSA-N 0.000 description 1
- LDLSENBXQNDTPB-DCAQKATOSA-N Ala-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LDLSENBXQNDTPB-DCAQKATOSA-N 0.000 description 1
- XHNLCGXYBXNRIS-BJDJZHNGSA-N Ala-Lys-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XHNLCGXYBXNRIS-BJDJZHNGSA-N 0.000 description 1
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 1
- CHFFHQUVXHEGBY-GARJFASQSA-N Ala-Lys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CHFFHQUVXHEGBY-GARJFASQSA-N 0.000 description 1
- XUCHENWTTBFODJ-FXQIFTODSA-N Ala-Met-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O XUCHENWTTBFODJ-FXQIFTODSA-N 0.000 description 1
- VHEVVUZDDUCAKU-FXQIFTODSA-N Ala-Met-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O VHEVVUZDDUCAKU-FXQIFTODSA-N 0.000 description 1
- GKAZXNDATBWNBI-DCAQKATOSA-N Ala-Met-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N GKAZXNDATBWNBI-DCAQKATOSA-N 0.000 description 1
- VEAPAYQQLSEKEM-GUBZILKMSA-N Ala-Met-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O VEAPAYQQLSEKEM-GUBZILKMSA-N 0.000 description 1
- FVNAUOZKIPAYNA-BPNCWPANSA-N Ala-Met-Tyr Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FVNAUOZKIPAYNA-BPNCWPANSA-N 0.000 description 1
- DRARURMRLANNLS-GUBZILKMSA-N Ala-Met-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O DRARURMRLANNLS-GUBZILKMSA-N 0.000 description 1
- XRUJOVRWNMBAAA-NHCYSSNCSA-N Ala-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 XRUJOVRWNMBAAA-NHCYSSNCSA-N 0.000 description 1
- KYDYGANDJHFBCW-DRZSPHRISA-N Ala-Phe-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KYDYGANDJHFBCW-DRZSPHRISA-N 0.000 description 1
- RUXQNKVQSKOOBS-JURCDPSOSA-N Ala-Phe-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RUXQNKVQSKOOBS-JURCDPSOSA-N 0.000 description 1
- WEZNQZHACPSMEF-QEJZJMRPSA-N Ala-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 WEZNQZHACPSMEF-QEJZJMRPSA-N 0.000 description 1
- DXTYEWAQOXYRHZ-KKXDTOCCSA-N Ala-Phe-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N DXTYEWAQOXYRHZ-KKXDTOCCSA-N 0.000 description 1
- IHMCQESUJVZTKW-UBHSHLNASA-N Ala-Phe-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 IHMCQESUJVZTKW-UBHSHLNASA-N 0.000 description 1
- FEGOCLZUJUFCHP-CIUDSAMLSA-N Ala-Pro-Gln Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FEGOCLZUJUFCHP-CIUDSAMLSA-N 0.000 description 1
- WQLDNOCHHRISMS-NAKRPEOUSA-N Ala-Pro-Ile Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WQLDNOCHHRISMS-NAKRPEOUSA-N 0.000 description 1
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 1
- CQJHFKKGZXKZBC-BPNCWPANSA-N Ala-Pro-Tyr Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CQJHFKKGZXKZBC-BPNCWPANSA-N 0.000 description 1
- YYAVDNKUWLAFCV-ACZMJKKPSA-N Ala-Ser-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYAVDNKUWLAFCV-ACZMJKKPSA-N 0.000 description 1
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- OMCKWYSDUQBYCN-FXQIFTODSA-N Ala-Ser-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O OMCKWYSDUQBYCN-FXQIFTODSA-N 0.000 description 1
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 1
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 1
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 1
- YNOCMHZSWJMGBB-GCJQMDKQSA-N Ala-Thr-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O YNOCMHZSWJMGBB-GCJQMDKQSA-N 0.000 description 1
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 1
- ISCYZXFOCXWUJU-KZVJFYERSA-N Ala-Thr-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O ISCYZXFOCXWUJU-KZVJFYERSA-N 0.000 description 1
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 1
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 1
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 1
- FSXDWQGEWZQBPJ-HERUPUMHSA-N Ala-Trp-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)O)C(=O)O)N FSXDWQGEWZQBPJ-HERUPUMHSA-N 0.000 description 1
- YXXPVUOMPSZURS-ZLIFDBKOSA-N Ala-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@H](C)N)=CNC2=C1 YXXPVUOMPSZURS-ZLIFDBKOSA-N 0.000 description 1
- AOAKQKVICDWCLB-UWJYBYFXSA-N Ala-Tyr-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AOAKQKVICDWCLB-UWJYBYFXSA-N 0.000 description 1
- KLKARCOHVHLAJP-UWJYBYFXSA-N Ala-Tyr-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CS)C(O)=O KLKARCOHVHLAJP-UWJYBYFXSA-N 0.000 description 1
- XKXAZPSREVUCRT-BPNCWPANSA-N Ala-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=C(O)C=C1 XKXAZPSREVUCRT-BPNCWPANSA-N 0.000 description 1
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 1
- YEBZNKPPOHFZJM-BPNCWPANSA-N Ala-Tyr-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O YEBZNKPPOHFZJM-BPNCWPANSA-N 0.000 description 1
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 1
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 1
- XKHLBBQNPSOGPI-GUBZILKMSA-N Ala-Val-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)N XKHLBBQNPSOGPI-GUBZILKMSA-N 0.000 description 1
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 1
- 102000007698 Alcohol dehydrogenase Human genes 0.000 description 1
- 108010021809 Alcohol dehydrogenase Proteins 0.000 description 1
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 description 1
- ATRRKUHOCOJYRX-UHFFFAOYSA-N Ammonium bicarbonate Chemical compound [NH4+].OC([O-])=O ATRRKUHOCOJYRX-UHFFFAOYSA-N 0.000 description 1
- VHUUQVKOLVNVRT-UHFFFAOYSA-N Ammonium hydroxide Chemical compound [NH4+].[OH-] VHUUQVKOLVNVRT-UHFFFAOYSA-N 0.000 description 1
- 239000004254 Ammonium phosphate Substances 0.000 description 1
- 241001468259 Anoxybacillus flavithermus Species 0.000 description 1
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 1
- DFCIPNHFKOQAME-FXQIFTODSA-N Arg-Ala-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFCIPNHFKOQAME-FXQIFTODSA-N 0.000 description 1
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 1
- YYOVLDPHIJAOSY-DCAQKATOSA-N Arg-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N YYOVLDPHIJAOSY-DCAQKATOSA-N 0.000 description 1
- VYSRNGOMGHOJCK-GUBZILKMSA-N Arg-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N VYSRNGOMGHOJCK-GUBZILKMSA-N 0.000 description 1
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 1
- OLDOLPWZEMHNIA-PJODQICGSA-N Arg-Ala-Trp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O OLDOLPWZEMHNIA-PJODQICGSA-N 0.000 description 1
- KGSJCPBERYUXCN-BPNCWPANSA-N Arg-Ala-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KGSJCPBERYUXCN-BPNCWPANSA-N 0.000 description 1
- OVVUNXXROOFSIM-SDDRHHMPSA-N Arg-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O OVVUNXXROOFSIM-SDDRHHMPSA-N 0.000 description 1
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 1
- WOPFJPHVBWKZJH-SRVKXCTJSA-N Arg-Arg-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O WOPFJPHVBWKZJH-SRVKXCTJSA-N 0.000 description 1
- DCGLNNVKIZXQOJ-FXQIFTODSA-N Arg-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N DCGLNNVKIZXQOJ-FXQIFTODSA-N 0.000 description 1
- QPOARHANPULOTM-GMOBBJLQSA-N Arg-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N QPOARHANPULOTM-GMOBBJLQSA-N 0.000 description 1
- MAISCYVJLBBRNU-DCAQKATOSA-N Arg-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N MAISCYVJLBBRNU-DCAQKATOSA-N 0.000 description 1
- OCOZPTHLDVSFCZ-BPUTZDHNSA-N Arg-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N OCOZPTHLDVSFCZ-BPUTZDHNSA-N 0.000 description 1
- OTUQSEPIIVBYEM-IHRRRGAJSA-N Arg-Asn-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OTUQSEPIIVBYEM-IHRRRGAJSA-N 0.000 description 1
- RWCLSUOSKWTXLA-FXQIFTODSA-N Arg-Asp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RWCLSUOSKWTXLA-FXQIFTODSA-N 0.000 description 1
- NTAZNGWBXRVEDJ-FXQIFTODSA-N Arg-Asp-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NTAZNGWBXRVEDJ-FXQIFTODSA-N 0.000 description 1
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 1
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 1
- SQKPKIJVWHAWNF-DCAQKATOSA-N Arg-Asp-Lys Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(O)=O SQKPKIJVWHAWNF-DCAQKATOSA-N 0.000 description 1
- HKRXJBBCQBAGIM-FXQIFTODSA-N Arg-Asp-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N HKRXJBBCQBAGIM-FXQIFTODSA-N 0.000 description 1
- JVMKBJNSRZWDBO-FXQIFTODSA-N Arg-Cys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O JVMKBJNSRZWDBO-FXQIFTODSA-N 0.000 description 1
- DGFGDPVSDQPANQ-XGEHTFHBSA-N Arg-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N)O DGFGDPVSDQPANQ-XGEHTFHBSA-N 0.000 description 1
- SNBHMYQRNCJSOJ-CIUDSAMLSA-N Arg-Gln-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SNBHMYQRNCJSOJ-CIUDSAMLSA-N 0.000 description 1
- KBBKCNHWCDJPGN-GUBZILKMSA-N Arg-Gln-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KBBKCNHWCDJPGN-GUBZILKMSA-N 0.000 description 1
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 1
- BJNUAWGXPSHQMJ-DCAQKATOSA-N Arg-Gln-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O BJNUAWGXPSHQMJ-DCAQKATOSA-N 0.000 description 1
- MTANSHNQTWPZKP-KKUMJFAQSA-N Arg-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)O MTANSHNQTWPZKP-KKUMJFAQSA-N 0.000 description 1
- RKRSYHCNPFGMTA-CIUDSAMLSA-N Arg-Glu-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O RKRSYHCNPFGMTA-CIUDSAMLSA-N 0.000 description 1
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 1
- OHYQKYUTLIPFOX-ZPFDUUQYSA-N Arg-Glu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OHYQKYUTLIPFOX-ZPFDUUQYSA-N 0.000 description 1
- UFBURHXMKFQVLM-CIUDSAMLSA-N Arg-Glu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UFBURHXMKFQVLM-CIUDSAMLSA-N 0.000 description 1
- NXDXECQFKHXHAM-HJGDQZAQSA-N Arg-Glu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NXDXECQFKHXHAM-HJGDQZAQSA-N 0.000 description 1
- JAYIQMNQDMOBFY-KKUMJFAQSA-N Arg-Glu-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JAYIQMNQDMOBFY-KKUMJFAQSA-N 0.000 description 1
- GOWZVQXTHUCNSQ-NHCYSSNCSA-N Arg-Glu-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GOWZVQXTHUCNSQ-NHCYSSNCSA-N 0.000 description 1
- AQPVUEJJARLJHB-BQBZGAKWSA-N Arg-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N AQPVUEJJARLJHB-BQBZGAKWSA-N 0.000 description 1
- YNSGXDWWPCGGQS-YUMQZZPRSA-N Arg-Gly-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O YNSGXDWWPCGGQS-YUMQZZPRSA-N 0.000 description 1
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 1
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 1
- QKSAZKCRVQYYGS-UWVGGRQHSA-N Arg-Gly-His Chemical compound N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O QKSAZKCRVQYYGS-UWVGGRQHSA-N 0.000 description 1
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 1
- UPKMBGAAEZGHOC-RWMBFGLXSA-N Arg-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O UPKMBGAAEZGHOC-RWMBFGLXSA-N 0.000 description 1
- CRCCTGPNZUCAHE-DCAQKATOSA-N Arg-His-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CN=CN1 CRCCTGPNZUCAHE-DCAQKATOSA-N 0.000 description 1
- CVKOQHYVDVYJSI-QTKMDUPCSA-N Arg-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N)O CVKOQHYVDVYJSI-QTKMDUPCSA-N 0.000 description 1
- AGVNTAUPLWIQEN-ZPFDUUQYSA-N Arg-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AGVNTAUPLWIQEN-ZPFDUUQYSA-N 0.000 description 1
- FLYANDHDFRGGTM-PYJNHQTQSA-N Arg-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FLYANDHDFRGGTM-PYJNHQTQSA-N 0.000 description 1
- OOIMKQRCPJBGPD-XUXIUFHCSA-N Arg-Ile-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O OOIMKQRCPJBGPD-XUXIUFHCSA-N 0.000 description 1
- OKKMBOSPBDASEP-CYDGBPFRSA-N Arg-Ile-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O OKKMBOSPBDASEP-CYDGBPFRSA-N 0.000 description 1
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 1
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 1
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 1
- XUGATJVGQUGQKY-ULQDDVLXSA-N Arg-Lys-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XUGATJVGQUGQKY-ULQDDVLXSA-N 0.000 description 1
- QBQVKUNBCAFXSV-ULQDDVLXSA-N Arg-Lys-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QBQVKUNBCAFXSV-ULQDDVLXSA-N 0.000 description 1
- PSOPJDUQUVFSLS-GUBZILKMSA-N Arg-Met-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N PSOPJDUQUVFSLS-GUBZILKMSA-N 0.000 description 1
- HIMXTOIXVXWHTB-DCAQKATOSA-N Arg-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N HIMXTOIXVXWHTB-DCAQKATOSA-N 0.000 description 1
- OISWSORSLQOGFV-AVGNSLFASA-N Arg-Met-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCCN=C(N)N OISWSORSLQOGFV-AVGNSLFASA-N 0.000 description 1
- KSUALAGYYLQSHJ-RCWTZXSCSA-N Arg-Met-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KSUALAGYYLQSHJ-RCWTZXSCSA-N 0.000 description 1
- CZUHPNLXLWMYMG-UBHSHLNASA-N Arg-Phe-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 CZUHPNLXLWMYMG-UBHSHLNASA-N 0.000 description 1
- YTMKMRSYXHBGER-IHRRRGAJSA-N Arg-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YTMKMRSYXHBGER-IHRRRGAJSA-N 0.000 description 1
- NIELFHOLFTUZME-HJWJTTGWSA-N Arg-Phe-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NIELFHOLFTUZME-HJWJTTGWSA-N 0.000 description 1
- RATVAFHGEFAWDH-JYJNAYRXSA-N Arg-Phe-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCCN=C(N)N)N RATVAFHGEFAWDH-JYJNAYRXSA-N 0.000 description 1
- SLQQPJBDBVPVQV-JYJNAYRXSA-N Arg-Phe-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O SLQQPJBDBVPVQV-JYJNAYRXSA-N 0.000 description 1
- XSPKAHFVDKRGRL-DCAQKATOSA-N Arg-Pro-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XSPKAHFVDKRGRL-DCAQKATOSA-N 0.000 description 1
- YCYXHLZRUSJITQ-SRVKXCTJSA-N Arg-Pro-Pro Chemical compound NC(=N)NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 YCYXHLZRUSJITQ-SRVKXCTJSA-N 0.000 description 1
- ATABBWFGOHKROJ-GUBZILKMSA-N Arg-Pro-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O ATABBWFGOHKROJ-GUBZILKMSA-N 0.000 description 1
- AUIJUTGLPVHIRT-FXQIFTODSA-N Arg-Ser-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N AUIJUTGLPVHIRT-FXQIFTODSA-N 0.000 description 1
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 1
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 1
- JOTRDIXZHNQYGP-DCAQKATOSA-N Arg-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JOTRDIXZHNQYGP-DCAQKATOSA-N 0.000 description 1
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 1
- RYQSYXFGFOTJDJ-RHYQMDGZSA-N Arg-Thr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RYQSYXFGFOTJDJ-RHYQMDGZSA-N 0.000 description 1
- DRDWXKWUSIKKOB-PJODQICGSA-N Arg-Trp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O DRDWXKWUSIKKOB-PJODQICGSA-N 0.000 description 1
- NVPHRWNWTKYIST-BPNCWPANSA-N Arg-Tyr-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 NVPHRWNWTKYIST-BPNCWPANSA-N 0.000 description 1
- BWMMKQPATDUYKB-IHRRRGAJSA-N Arg-Tyr-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=C(O)C=C1 BWMMKQPATDUYKB-IHRRRGAJSA-N 0.000 description 1
- QHUOOCKNNURZSL-IHRRRGAJSA-N Arg-Tyr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O QHUOOCKNNURZSL-IHRRRGAJSA-N 0.000 description 1
- PSUXEQYPYZLNER-QXEWZRGKSA-N Arg-Val-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PSUXEQYPYZLNER-QXEWZRGKSA-N 0.000 description 1
- QTAIIXQCOPUNBQ-QXEWZRGKSA-N Arg-Val-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QTAIIXQCOPUNBQ-QXEWZRGKSA-N 0.000 description 1
- LLQIAIUAKGNOSE-NHCYSSNCSA-N Arg-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N LLQIAIUAKGNOSE-NHCYSSNCSA-N 0.000 description 1
- FMYQECOAIFGQGU-CYDGBPFRSA-N Arg-Val-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMYQECOAIFGQGU-CYDGBPFRSA-N 0.000 description 1
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 1
- RZVVKNIACROXRM-ZLUOBGJFSA-N Asn-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N RZVVKNIACROXRM-ZLUOBGJFSA-N 0.000 description 1
- LEFKSBYHUGUWLP-ACZMJKKPSA-N Asn-Ala-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LEFKSBYHUGUWLP-ACZMJKKPSA-N 0.000 description 1
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 1
- NPDLYUOYAGBHFB-WDSKDSINSA-N Asn-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NPDLYUOYAGBHFB-WDSKDSINSA-N 0.000 description 1
- XHFXZQHTLJVZBN-FXQIFTODSA-N Asn-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N XHFXZQHTLJVZBN-FXQIFTODSA-N 0.000 description 1
- YJRORCOAFUZVKA-FXQIFTODSA-N Asn-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N YJRORCOAFUZVKA-FXQIFTODSA-N 0.000 description 1
- MEFGKQUUYZOLHM-GMOBBJLQSA-N Asn-Arg-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MEFGKQUUYZOLHM-GMOBBJLQSA-N 0.000 description 1
- MFFOYNGMOYFPBD-DCAQKATOSA-N Asn-Arg-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O MFFOYNGMOYFPBD-DCAQKATOSA-N 0.000 description 1
- PTNFNTOBUDWHNZ-GUBZILKMSA-N Asn-Arg-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O PTNFNTOBUDWHNZ-GUBZILKMSA-N 0.000 description 1
- HUZGPXBILPMCHM-IHRRRGAJSA-N Asn-Arg-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HUZGPXBILPMCHM-IHRRRGAJSA-N 0.000 description 1
- DNYRZPOWBTYFAF-IHRRRGAJSA-N Asn-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N)O DNYRZPOWBTYFAF-IHRRRGAJSA-N 0.000 description 1
- JEPNYDRDYNSFIU-QXEWZRGKSA-N Asn-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(N)=O)C(O)=O JEPNYDRDYNSFIU-QXEWZRGKSA-N 0.000 description 1
- KSBHCUSPLWRVEK-ZLUOBGJFSA-N Asn-Asn-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KSBHCUSPLWRVEK-ZLUOBGJFSA-N 0.000 description 1
- DXZNJWFECGJCQR-FXQIFTODSA-N Asn-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N DXZNJWFECGJCQR-FXQIFTODSA-N 0.000 description 1
- NLCDVZJDEXIDDL-BIIVOSGPSA-N Asn-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O NLCDVZJDEXIDDL-BIIVOSGPSA-N 0.000 description 1
- KXEGPPNPXOKKHK-ZLUOBGJFSA-N Asn-Asp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KXEGPPNPXOKKHK-ZLUOBGJFSA-N 0.000 description 1
- CUQUEHYSSFETRD-ACZMJKKPSA-N Asn-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N CUQUEHYSSFETRD-ACZMJKKPSA-N 0.000 description 1
- UGXVKHRDGLYFKR-CIUDSAMLSA-N Asn-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(N)=O UGXVKHRDGLYFKR-CIUDSAMLSA-N 0.000 description 1
- VYLVOMUVLMGCRF-ZLUOBGJFSA-N Asn-Asp-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VYLVOMUVLMGCRF-ZLUOBGJFSA-N 0.000 description 1
- XQQVCUIBGYFKDC-OLHMAJIHSA-N Asn-Asp-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XQQVCUIBGYFKDC-OLHMAJIHSA-N 0.000 description 1
- VWJFQGXPYOPXJH-ZLUOBGJFSA-N Asn-Cys-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)C(=O)N VWJFQGXPYOPXJH-ZLUOBGJFSA-N 0.000 description 1
- DHVMIHWNDBFTHB-FXQIFTODSA-N Asn-Cys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N DHVMIHWNDBFTHB-FXQIFTODSA-N 0.000 description 1
- QRHYAUYXBVVDSB-LKXGYXEUSA-N Asn-Cys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QRHYAUYXBVVDSB-LKXGYXEUSA-N 0.000 description 1
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 1
- QPTAGIPWARILES-AVGNSLFASA-N Asn-Gln-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QPTAGIPWARILES-AVGNSLFASA-N 0.000 description 1
- ULRPXVNMIIYDDJ-ACZMJKKPSA-N Asn-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N ULRPXVNMIIYDDJ-ACZMJKKPSA-N 0.000 description 1
- BZMWJLLUAKSIMH-FXQIFTODSA-N Asn-Glu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BZMWJLLUAKSIMH-FXQIFTODSA-N 0.000 description 1
- GNKVBRYFXYWXAB-WDSKDSINSA-N Asn-Glu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O GNKVBRYFXYWXAB-WDSKDSINSA-N 0.000 description 1
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 1
- KLKHFFMNGWULBN-VKHMYHEASA-N Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)NCC(O)=O KLKHFFMNGWULBN-VKHMYHEASA-N 0.000 description 1
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 1
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 1
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 1
- QEQVUHQQYDZUEN-GUBZILKMSA-N Asn-His-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N QEQVUHQQYDZUEN-GUBZILKMSA-N 0.000 description 1
- SXNJBDYEBOUYOJ-DCAQKATOSA-N Asn-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)N)N SXNJBDYEBOUYOJ-DCAQKATOSA-N 0.000 description 1
- UYXXMIZGHYKYAT-NHCYSSNCSA-N Asn-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)N)N UYXXMIZGHYKYAT-NHCYSSNCSA-N 0.000 description 1
- NKLRWRRVYGQNIH-GHCJXIJMSA-N Asn-Ile-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O NKLRWRRVYGQNIH-GHCJXIJMSA-N 0.000 description 1
- NVWJMQNYLYWVNQ-BYULHYEWSA-N Asn-Ile-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O NVWJMQNYLYWVNQ-BYULHYEWSA-N 0.000 description 1
- YYSYDIYQTUPNQQ-SXTJYALSSA-N Asn-Ile-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YYSYDIYQTUPNQQ-SXTJYALSSA-N 0.000 description 1
- GQRDIVQPSMPQME-ZPFDUUQYSA-N Asn-Ile-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O GQRDIVQPSMPQME-ZPFDUUQYSA-N 0.000 description 1
- LVHMEJJWEXBMKK-GMOBBJLQSA-N Asn-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N LVHMEJJWEXBMKK-GMOBBJLQSA-N 0.000 description 1
- LTZIRYMWOJHRCH-GUDRVLHUSA-N Asn-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N LTZIRYMWOJHRCH-GUDRVLHUSA-N 0.000 description 1
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 1
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 1
- TZFQICWZWFNIKU-KKUMJFAQSA-N Asn-Leu-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 TZFQICWZWFNIKU-KKUMJFAQSA-N 0.000 description 1
- KHCNTVRVAYCPQE-CIUDSAMLSA-N Asn-Lys-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O KHCNTVRVAYCPQE-CIUDSAMLSA-N 0.000 description 1
- FBODFHMLALOPHP-GUBZILKMSA-N Asn-Lys-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O FBODFHMLALOPHP-GUBZILKMSA-N 0.000 description 1
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 1
- COWITDLVHMZSIW-CIUDSAMLSA-N Asn-Lys-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O COWITDLVHMZSIW-CIUDSAMLSA-N 0.000 description 1
- NTWOPSIUJBMNRI-KKUMJFAQSA-N Asn-Lys-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTWOPSIUJBMNRI-KKUMJFAQSA-N 0.000 description 1
- HMUKKNAMNSXDBB-CIUDSAMLSA-N Asn-Met-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HMUKKNAMNSXDBB-CIUDSAMLSA-N 0.000 description 1
- QGABLMITFKUQDF-DCAQKATOSA-N Asn-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N QGABLMITFKUQDF-DCAQKATOSA-N 0.000 description 1
- LSJQOMAZIKQMTJ-SRVKXCTJSA-N Asn-Phe-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LSJQOMAZIKQMTJ-SRVKXCTJSA-N 0.000 description 1
- RVHGJNGNKGDCPX-KKUMJFAQSA-N Asn-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N RVHGJNGNKGDCPX-KKUMJFAQSA-N 0.000 description 1
- BKFXFUPYETWGGA-XVSYOHENSA-N Asn-Phe-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BKFXFUPYETWGGA-XVSYOHENSA-N 0.000 description 1
- QXOPPIDJKPEKCW-GUBZILKMSA-N Asn-Pro-Arg Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O QXOPPIDJKPEKCW-GUBZILKMSA-N 0.000 description 1
- OSZBYGVKAFZWKC-FXQIFTODSA-N Asn-Pro-Cys Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(O)=O OSZBYGVKAFZWKC-FXQIFTODSA-N 0.000 description 1
- YUOXLJYVSZYPBJ-CIUDSAMLSA-N Asn-Pro-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O YUOXLJYVSZYPBJ-CIUDSAMLSA-N 0.000 description 1
- BYLSYQASFJJBCL-DCAQKATOSA-N Asn-Pro-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BYLSYQASFJJBCL-DCAQKATOSA-N 0.000 description 1
- AWXDRZJQCVHCIT-DCAQKATOSA-N Asn-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O AWXDRZJQCVHCIT-DCAQKATOSA-N 0.000 description 1
- UWFOMGUWGPRVBW-GUBZILKMSA-N Asn-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)N)N UWFOMGUWGPRVBW-GUBZILKMSA-N 0.000 description 1
- VHQSGALUSWIYOD-QXEWZRGKSA-N Asn-Pro-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O VHQSGALUSWIYOD-QXEWZRGKSA-N 0.000 description 1
- VWADICJNCPFKJS-ZLUOBGJFSA-N Asn-Ser-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O VWADICJNCPFKJS-ZLUOBGJFSA-N 0.000 description 1
- KYQJHBWHRASMKG-ZLUOBGJFSA-N Asn-Ser-Cys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O KYQJHBWHRASMKG-ZLUOBGJFSA-N 0.000 description 1
- GZXOUBTUAUAVHD-ACZMJKKPSA-N Asn-Ser-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GZXOUBTUAUAVHD-ACZMJKKPSA-N 0.000 description 1
- DOURAOODTFJRIC-CIUDSAMLSA-N Asn-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N DOURAOODTFJRIC-CIUDSAMLSA-N 0.000 description 1
- VLDRQOHCMKCXLY-SRVKXCTJSA-N Asn-Ser-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VLDRQOHCMKCXLY-SRVKXCTJSA-N 0.000 description 1
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 1
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 1
- WLVLIYYBPPONRJ-GCJQMDKQSA-N Asn-Thr-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O WLVLIYYBPPONRJ-GCJQMDKQSA-N 0.000 description 1
- QYRMBFWDSFGSFC-OLHMAJIHSA-N Asn-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QYRMBFWDSFGSFC-OLHMAJIHSA-N 0.000 description 1
- ZUFPUBYQYWCMDB-NUMRIWBASA-N Asn-Thr-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZUFPUBYQYWCMDB-NUMRIWBASA-N 0.000 description 1
- KZYSHAMXEBPJBD-JRQIVUDYSA-N Asn-Thr-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZYSHAMXEBPJBD-JRQIVUDYSA-N 0.000 description 1
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 description 1
- JPPLRQVZMZFOSX-UWJYBYFXSA-N Asn-Tyr-Ala Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 JPPLRQVZMZFOSX-UWJYBYFXSA-N 0.000 description 1
- YNQMEIJEWSHOEO-SRVKXCTJSA-N Asn-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O YNQMEIJEWSHOEO-SRVKXCTJSA-N 0.000 description 1
- NSTBNYOKCZKOMI-AVGNSLFASA-N Asn-Tyr-Glu Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O NSTBNYOKCZKOMI-AVGNSLFASA-N 0.000 description 1
- XEGZSHSPQNDNRH-JRQIVUDYSA-N Asn-Tyr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XEGZSHSPQNDNRH-JRQIVUDYSA-N 0.000 description 1
- XLDMSQYOYXINSZ-QXEWZRGKSA-N Asn-Val-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XLDMSQYOYXINSZ-QXEWZRGKSA-N 0.000 description 1
- MJIJBEYEHBKTIM-BYULHYEWSA-N Asn-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MJIJBEYEHBKTIM-BYULHYEWSA-N 0.000 description 1
- ZAESWDKAMDVHLL-RCOVLWMOSA-N Asn-Val-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O ZAESWDKAMDVHLL-RCOVLWMOSA-N 0.000 description 1
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 1
- DBLPNHGKMDHWNZ-UHFFFAOYSA-N Asp Gly Arg Asn Chemical compound OC(=O)CC(N)C(=O)NCC(=O)NC(CCCN=C(N)N)C(=O)NC(CC(N)=O)C(O)=O DBLPNHGKMDHWNZ-UHFFFAOYSA-N 0.000 description 1
- WSWYMRLTJVKRCE-ZLUOBGJFSA-N Asp-Ala-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O WSWYMRLTJVKRCE-ZLUOBGJFSA-N 0.000 description 1
- HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 description 1
- CXBOKJPLEYUPGB-FXQIFTODSA-N Asp-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N CXBOKJPLEYUPGB-FXQIFTODSA-N 0.000 description 1
- NECWUSYTYSIFNC-DLOVCJGASA-N Asp-Ala-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 NECWUSYTYSIFNC-DLOVCJGASA-N 0.000 description 1
- BLQBMRNMBAYREH-UWJYBYFXSA-N Asp-Ala-Tyr Chemical compound N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O BLQBMRNMBAYREH-UWJYBYFXSA-N 0.000 description 1
- OERMIMJQPQUIPK-FXQIFTODSA-N Asp-Arg-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O OERMIMJQPQUIPK-FXQIFTODSA-N 0.000 description 1
- SOYOSFXLXYZNRG-CIUDSAMLSA-N Asp-Arg-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O SOYOSFXLXYZNRG-CIUDSAMLSA-N 0.000 description 1
- WSOKZUVWBXVJHX-CIUDSAMLSA-N Asp-Arg-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O WSOKZUVWBXVJHX-CIUDSAMLSA-N 0.000 description 1
- AXXCUABIFZPKPM-BQBZGAKWSA-N Asp-Arg-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O AXXCUABIFZPKPM-BQBZGAKWSA-N 0.000 description 1
- MFMJRYHVLLEMQM-DCAQKATOSA-N Asp-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N MFMJRYHVLLEMQM-DCAQKATOSA-N 0.000 description 1
- HMQDRBKQMLRCCG-GMOBBJLQSA-N Asp-Arg-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HMQDRBKQMLRCCG-GMOBBJLQSA-N 0.000 description 1
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 1
- SDHFVYLZFBDSQT-DCAQKATOSA-N Asp-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N SDHFVYLZFBDSQT-DCAQKATOSA-N 0.000 description 1
- MRQQMVZUHXUPEV-IHRRRGAJSA-N Asp-Arg-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MRQQMVZUHXUPEV-IHRRRGAJSA-N 0.000 description 1
- FAEIQWHBRBWUBN-FXQIFTODSA-N Asp-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N FAEIQWHBRBWUBN-FXQIFTODSA-N 0.000 description 1
- XYBJLTKSGFBLCS-QXEWZRGKSA-N Asp-Arg-Val Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC(O)=O XYBJLTKSGFBLCS-QXEWZRGKSA-N 0.000 description 1
- UQBGYPFHWFZMCD-ZLUOBGJFSA-N Asp-Asn-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O UQBGYPFHWFZMCD-ZLUOBGJFSA-N 0.000 description 1
- QRULNKJGYQQZMW-ZLUOBGJFSA-N Asp-Asn-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QRULNKJGYQQZMW-ZLUOBGJFSA-N 0.000 description 1
- MUWDILPCTSMUHI-ZLUOBGJFSA-N Asp-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)C(=O)O MUWDILPCTSMUHI-ZLUOBGJFSA-N 0.000 description 1
- ZELQAFZSJOBEQS-ACZMJKKPSA-N Asp-Asn-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZELQAFZSJOBEQS-ACZMJKKPSA-N 0.000 description 1
- JDHOJQJMWBKHDB-CIUDSAMLSA-N Asp-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N JDHOJQJMWBKHDB-CIUDSAMLSA-N 0.000 description 1
- VPSHHQXIWLGVDD-ZLUOBGJFSA-N Asp-Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VPSHHQXIWLGVDD-ZLUOBGJFSA-N 0.000 description 1
- FRSGNOZCTWDVFZ-ACZMJKKPSA-N Asp-Asp-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O FRSGNOZCTWDVFZ-ACZMJKKPSA-N 0.000 description 1
- VZNOVQKGJQJOCS-SRVKXCTJSA-N Asp-Asp-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VZNOVQKGJQJOCS-SRVKXCTJSA-N 0.000 description 1
- PXLNPFOJZQMXAT-BYULHYEWSA-N Asp-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O PXLNPFOJZQMXAT-BYULHYEWSA-N 0.000 description 1
- VHQOCWWKXIOAQI-WDSKDSINSA-N Asp-Gln-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VHQOCWWKXIOAQI-WDSKDSINSA-N 0.000 description 1
- HRGGPWBIMIQANI-GUBZILKMSA-N Asp-Gln-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HRGGPWBIMIQANI-GUBZILKMSA-N 0.000 description 1
- CSEJMKNZDCJYGJ-XHNCKOQMSA-N Asp-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O CSEJMKNZDCJYGJ-XHNCKOQMSA-N 0.000 description 1
- ZSJFGGSPCCHMNE-LAEOZQHASA-N Asp-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N ZSJFGGSPCCHMNE-LAEOZQHASA-N 0.000 description 1
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 1
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 1
- LTXGDRFJRZSZAV-CIUDSAMLSA-N Asp-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N LTXGDRFJRZSZAV-CIUDSAMLSA-N 0.000 description 1
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 1
- GISFCCXBVJKGEO-QEJZJMRPSA-N Asp-Glu-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O GISFCCXBVJKGEO-QEJZJMRPSA-N 0.000 description 1
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 1
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 1
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 1
- RQYMKRMRZWJGHC-BQBZGAKWSA-N Asp-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N RQYMKRMRZWJGHC-BQBZGAKWSA-N 0.000 description 1
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 1
- LNENWJXDHCFVOF-DCAQKATOSA-N Asp-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N LNENWJXDHCFVOF-DCAQKATOSA-N 0.000 description 1
- YRBGRUOSJROZEI-NHCYSSNCSA-N Asp-His-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O YRBGRUOSJROZEI-NHCYSSNCSA-N 0.000 description 1
- LBFYTUPYYZENIR-GHCJXIJMSA-N Asp-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N LBFYTUPYYZENIR-GHCJXIJMSA-N 0.000 description 1
- MFTVXYMXSAQZNL-DJFWLOJKSA-N Asp-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)O)N MFTVXYMXSAQZNL-DJFWLOJKSA-N 0.000 description 1
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 1
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 1
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 1
- XLILXFRAKOYEJX-GUBZILKMSA-N Asp-Leu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLILXFRAKOYEJX-GUBZILKMSA-N 0.000 description 1
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 1
- HKEZZWQWXWGASX-KKUMJFAQSA-N Asp-Leu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HKEZZWQWXWGASX-KKUMJFAQSA-N 0.000 description 1
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 1
- UZFHNLYQWMGUHU-DCAQKATOSA-N Asp-Lys-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UZFHNLYQWMGUHU-DCAQKATOSA-N 0.000 description 1
- YVHGKXAOSVBGJV-CIUDSAMLSA-N Asp-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N YVHGKXAOSVBGJV-CIUDSAMLSA-N 0.000 description 1
- FQHBAQLBIXLWAG-DCAQKATOSA-N Asp-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N FQHBAQLBIXLWAG-DCAQKATOSA-N 0.000 description 1
- NZWDWXSWUQCNMG-GARJFASQSA-N Asp-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)C(=O)O NZWDWXSWUQCNMG-GARJFASQSA-N 0.000 description 1
- RXBGWGRSWXOBGK-KKUMJFAQSA-N Asp-Lys-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RXBGWGRSWXOBGK-KKUMJFAQSA-N 0.000 description 1
- YTXCCDCOHIYQFC-GUBZILKMSA-N Asp-Met-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTXCCDCOHIYQFC-GUBZILKMSA-N 0.000 description 1
- VWWAFGHMPWBKEP-GMOBBJLQSA-N Asp-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(=O)O)N VWWAFGHMPWBKEP-GMOBBJLQSA-N 0.000 description 1
- RRUWMFBLFLUZSI-LPEHRKFASA-N Asp-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N RRUWMFBLFLUZSI-LPEHRKFASA-N 0.000 description 1
- GYWQGGUCMDCUJE-DLOVCJGASA-N Asp-Phe-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O GYWQGGUCMDCUJE-DLOVCJGASA-N 0.000 description 1
- JUWISGAGWSDGDH-KKUMJFAQSA-N Asp-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=CC=C1 JUWISGAGWSDGDH-KKUMJFAQSA-N 0.000 description 1
- USNJAPJZSGTTPX-XVSYOHENSA-N Asp-Phe-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O USNJAPJZSGTTPX-XVSYOHENSA-N 0.000 description 1
- GPPIDDWYKJPRES-YDHLFZDLSA-N Asp-Phe-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GPPIDDWYKJPRES-YDHLFZDLSA-N 0.000 description 1
- UAXIKORUDGGIGA-DCAQKATOSA-N Asp-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O UAXIKORUDGGIGA-DCAQKATOSA-N 0.000 description 1
- RVMXMLSYBTXCAV-VEVYYDQMSA-N Asp-Pro-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMXMLSYBTXCAV-VEVYYDQMSA-N 0.000 description 1
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- ZQFRDAZBTSFGGW-SRVKXCTJSA-N Asp-Ser-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZQFRDAZBTSFGGW-SRVKXCTJSA-N 0.000 description 1
- HRVQDZOWMLFAOD-BIIVOSGPSA-N Asp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N)C(=O)O HRVQDZOWMLFAOD-BIIVOSGPSA-N 0.000 description 1
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 1
- MNQMTYSEKZHIDF-GCJQMDKQSA-N Asp-Thr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O MNQMTYSEKZHIDF-GCJQMDKQSA-N 0.000 description 1
- NAAAPCLFJPURAM-HJGDQZAQSA-N Asp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O NAAAPCLFJPURAM-HJGDQZAQSA-N 0.000 description 1
- OYSYWMMZGJSQRB-AVGNSLFASA-N Asp-Tyr-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O OYSYWMMZGJSQRB-AVGNSLFASA-N 0.000 description 1
- KNDCWFXCFKSEBM-AVGNSLFASA-N Asp-Tyr-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O KNDCWFXCFKSEBM-AVGNSLFASA-N 0.000 description 1
- NWAHPBGBDIFUFD-KKUMJFAQSA-N Asp-Tyr-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O NWAHPBGBDIFUFD-KKUMJFAQSA-N 0.000 description 1
- VHUKCUHLFMRHOD-MELADBBJSA-N Asp-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O VHUKCUHLFMRHOD-MELADBBJSA-N 0.000 description 1
- XWKBWZXGNXTDKY-ZKWXMUAHSA-N Asp-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O XWKBWZXGNXTDKY-ZKWXMUAHSA-N 0.000 description 1
- WAEDSQFVZJUHLI-BYULHYEWSA-N Asp-Val-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WAEDSQFVZJUHLI-BYULHYEWSA-N 0.000 description 1
- OQMGSMNZVHYDTQ-ZKWXMUAHSA-N Asp-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N OQMGSMNZVHYDTQ-ZKWXMUAHSA-N 0.000 description 1
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 1
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 1
- GGBQDSHTXKQSLP-NHCYSSNCSA-N Asp-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N GGBQDSHTXKQSLP-NHCYSSNCSA-N 0.000 description 1
- SFJUYBCDQBAYAJ-YDHLFZDLSA-N Asp-Val-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SFJUYBCDQBAYAJ-YDHLFZDLSA-N 0.000 description 1
- 101150097247 CRT1 gene Proteins 0.000 description 1
- 101100505161 Caenorhabditis elegans mel-32 gene Proteins 0.000 description 1
- 101100315624 Caenorhabditis elegans tyr-1 gene Proteins 0.000 description 1
- 241001325302 Chitinophaga pinensis Species 0.000 description 1
- 241001643775 Chloroflexus aggregans Species 0.000 description 1
- 241000192731 Chloroflexus aurantiacus Species 0.000 description 1
- 241000588923 Citrobacter Species 0.000 description 1
- 101100135918 Citrobacter freundii pduG gene Proteins 0.000 description 1
- 101100506213 Clostridioides difficile hadI gene Proteins 0.000 description 1
- 101100385313 Clostridium acetobutylicum (strain ATCC 824 / DSM 792 / JCM 1419 / LMG 5710 / VKM B-1787) crt gene Proteins 0.000 description 1
- 101100446690 Clostridium sporogenes fldB gene Proteins 0.000 description 1
- 206010010144 Completed suicide Diseases 0.000 description 1
- 229910021591 Copper(I) chloride Inorganic materials 0.000 description 1
- 241000186249 Corynebacterium sp. Species 0.000 description 1
- 241000337023 Corynebacterium thermoaminogenes Species 0.000 description 1
- 241000988642 Cronobacter turicensis Species 0.000 description 1
- TVYMKYUSZSVOAG-ZLUOBGJFSA-N Cys-Ala-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O TVYMKYUSZSVOAG-ZLUOBGJFSA-N 0.000 description 1
- RRIJEABIXPKSGP-FXQIFTODSA-N Cys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CS RRIJEABIXPKSGP-FXQIFTODSA-N 0.000 description 1
- JIVJXVJMOBVCJF-ZLUOBGJFSA-N Cys-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)C(=O)N JIVJXVJMOBVCJF-ZLUOBGJFSA-N 0.000 description 1
- CPTUXCUWQIBZIF-ZLUOBGJFSA-N Cys-Asn-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CPTUXCUWQIBZIF-ZLUOBGJFSA-N 0.000 description 1
- XABFFGOGKOORCG-CIUDSAMLSA-N Cys-Asp-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XABFFGOGKOORCG-CIUDSAMLSA-N 0.000 description 1
- BIVLWXQGXJLGKG-BIIVOSGPSA-N Cys-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N)C(=O)O BIVLWXQGXJLGKG-BIIVOSGPSA-N 0.000 description 1
- ASHTVGGFIMESRD-LKXGYXEUSA-N Cys-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N)O ASHTVGGFIMESRD-LKXGYXEUSA-N 0.000 description 1
- GGIHYKLJUIZYGH-ZLUOBGJFSA-N Cys-Cys-Asp Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CS)N)C(=O)O GGIHYKLJUIZYGH-ZLUOBGJFSA-N 0.000 description 1
- MBILEVLLOHJZMG-FXQIFTODSA-N Cys-Gln-Glu Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N MBILEVLLOHJZMG-FXQIFTODSA-N 0.000 description 1
- UUOYKFNULIOCGJ-GUBZILKMSA-N Cys-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N UUOYKFNULIOCGJ-GUBZILKMSA-N 0.000 description 1
- GCDLPNRHPWBKJJ-WDSKDSINSA-N Cys-Gly-Glu Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O GCDLPNRHPWBKJJ-WDSKDSINSA-N 0.000 description 1
- OXOQBEVULIBOSH-ZDLURKLDSA-N Cys-Gly-Thr Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O OXOQBEVULIBOSH-ZDLURKLDSA-N 0.000 description 1
- YKKHFPGOZXQAGK-QWRGUYRKSA-N Cys-Gly-Tyr Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YKKHFPGOZXQAGK-QWRGUYRKSA-N 0.000 description 1
- RRJOQIBQVZDVCW-SRVKXCTJSA-N Cys-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CS)N RRJOQIBQVZDVCW-SRVKXCTJSA-N 0.000 description 1
- XLLSMEFANRROJE-GUBZILKMSA-N Cys-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N XLLSMEFANRROJE-GUBZILKMSA-N 0.000 description 1
- SRIRHERUAMYIOQ-CIUDSAMLSA-N Cys-Leu-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SRIRHERUAMYIOQ-CIUDSAMLSA-N 0.000 description 1
- OHLLDUNVMPPUMD-DCAQKATOSA-N Cys-Leu-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N OHLLDUNVMPPUMD-DCAQKATOSA-N 0.000 description 1
- NIXHTNJAGGFBAW-CIUDSAMLSA-N Cys-Lys-Ser Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N NIXHTNJAGGFBAW-CIUDSAMLSA-N 0.000 description 1
- ZXCAQANTQWBICD-DCAQKATOSA-N Cys-Lys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N ZXCAQANTQWBICD-DCAQKATOSA-N 0.000 description 1
- UIKLEGZPIOXFHJ-DLOVCJGASA-N Cys-Phe-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O UIKLEGZPIOXFHJ-DLOVCJGASA-N 0.000 description 1
- BBQIWFFTTQTNOC-AVGNSLFASA-N Cys-Phe-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N BBQIWFFTTQTNOC-AVGNSLFASA-N 0.000 description 1
- GFMJUESGWILPEN-MELADBBJSA-N Cys-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CS)N)C(=O)O GFMJUESGWILPEN-MELADBBJSA-N 0.000 description 1
- SMEYEQDCCBHTEF-FXQIFTODSA-N Cys-Pro-Ala Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O SMEYEQDCCBHTEF-FXQIFTODSA-N 0.000 description 1
- KJJASVYBTKRYSN-FXQIFTODSA-N Cys-Pro-Asp Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CC(=O)O)C(=O)O KJJASVYBTKRYSN-FXQIFTODSA-N 0.000 description 1
- TXCCRYAZQBUCOV-CIUDSAMLSA-N Cys-Pro-Gln Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O TXCCRYAZQBUCOV-CIUDSAMLSA-N 0.000 description 1
- TXGDWPBLUFQODU-XGEHTFHBSA-N Cys-Pro-Thr Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O TXGDWPBLUFQODU-XGEHTFHBSA-N 0.000 description 1
- BCWIFCLVCRAIQK-ZLUOBGJFSA-N Cys-Ser-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)O BCWIFCLVCRAIQK-ZLUOBGJFSA-N 0.000 description 1
- SRZZZTMJARUVPI-JBDRJPRFSA-N Cys-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N SRZZZTMJARUVPI-JBDRJPRFSA-N 0.000 description 1
- YNJBLTDKTMKEET-ZLUOBGJFSA-N Cys-Ser-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O YNJBLTDKTMKEET-ZLUOBGJFSA-N 0.000 description 1
- JAHCWGSVNZXHRR-SVSWQMSJSA-N Cys-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CS)N JAHCWGSVNZXHRR-SVSWQMSJSA-N 0.000 description 1
- WTXCNOPZMQRTNN-BWBBJGPYSA-N Cys-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N)O WTXCNOPZMQRTNN-BWBBJGPYSA-N 0.000 description 1
- YFKWIIRWHGKSQQ-WFBYXXMGSA-N Cys-Trp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CS)N YFKWIIRWHGKSQQ-WFBYXXMGSA-N 0.000 description 1
- IRDBEBCCTCNXGZ-AVGNSLFASA-N Cys-Tyr-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N)O IRDBEBCCTCNXGZ-AVGNSLFASA-N 0.000 description 1
- CLEFUAZULXANBU-MELADBBJSA-N Cys-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CS)N)C(=O)O CLEFUAZULXANBU-MELADBBJSA-N 0.000 description 1
- 241001338036 Desulfosporosinus meridiei Species 0.000 description 1
- 241000343034 Desulfosporosinus youngiae DSM 17734 Species 0.000 description 1
- 101100533283 Dictyostelium discoideum serp gene Proteins 0.000 description 1
- 108010042407 Endonucleases Proteins 0.000 description 1
- 102000004533 Endonucleases Human genes 0.000 description 1
- 101100350708 Escherichia coli (strain K12) paaF gene Proteins 0.000 description 1
- 229930091371 Fructose Natural products 0.000 description 1
- 239000005715 Fructose Substances 0.000 description 1
- RFSUNEUAIZKAJO-ARQDHWQXSA-N Fructose Chemical compound OC[C@H]1O[C@](O)(CO)[C@@H](O)[C@@H]1O RFSUNEUAIZKAJO-ARQDHWQXSA-N 0.000 description 1
- 241000059370 Fusobacterium necrophorum subsp. funduliforme Fnf 1007 Species 0.000 description 1
- 108010072062 GEKG peptide Proteins 0.000 description 1
- 108700007698 Genetic Terminator Regions Proteins 0.000 description 1
- 101000892220 Geobacillus thermodenitrificans (strain NG80-2) Long-chain-alcohol dehydrogenase 1 Proteins 0.000 description 1
- INKFLNZBTSNFON-CIUDSAMLSA-N Gln-Ala-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O INKFLNZBTSNFON-CIUDSAMLSA-N 0.000 description 1
- NNQHEEQNPQYPGL-FXQIFTODSA-N Gln-Ala-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NNQHEEQNPQYPGL-FXQIFTODSA-N 0.000 description 1
- RZSLYUUFFVHFRQ-FXQIFTODSA-N Gln-Ala-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O RZSLYUUFFVHFRQ-FXQIFTODSA-N 0.000 description 1
- LKUWAWGNJYJODH-KBIXCLLPSA-N Gln-Ala-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKUWAWGNJYJODH-KBIXCLLPSA-N 0.000 description 1
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 1
- IGNGBUVODQLMRJ-CIUDSAMLSA-N Gln-Ala-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O IGNGBUVODQLMRJ-CIUDSAMLSA-N 0.000 description 1
- XXLBHPPXDUWYAG-XQXXSGGOSA-N Gln-Ala-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XXLBHPPXDUWYAG-XQXXSGGOSA-N 0.000 description 1
- JSYULGSPLTZDHM-NRPADANISA-N Gln-Ala-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O JSYULGSPLTZDHM-NRPADANISA-N 0.000 description 1
- DTMLKCYOQKZXKZ-HJGDQZAQSA-N Gln-Arg-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DTMLKCYOQKZXKZ-HJGDQZAQSA-N 0.000 description 1
- SOBBAYVQSNXYPQ-ACZMJKKPSA-N Gln-Asn-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SOBBAYVQSNXYPQ-ACZMJKKPSA-N 0.000 description 1
- PHZYLYASFWHLHJ-FXQIFTODSA-N Gln-Asn-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PHZYLYASFWHLHJ-FXQIFTODSA-N 0.000 description 1
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 1
- KWLMLNHADZIJIS-CIUDSAMLSA-N Gln-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N KWLMLNHADZIJIS-CIUDSAMLSA-N 0.000 description 1
- RMOCFPBLHAOTDU-ACZMJKKPSA-N Gln-Asn-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RMOCFPBLHAOTDU-ACZMJKKPSA-N 0.000 description 1
- CYTSBCIIEHUPDU-ACZMJKKPSA-N Gln-Asp-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O CYTSBCIIEHUPDU-ACZMJKKPSA-N 0.000 description 1
- QYTKAVBFRUGYAU-ACZMJKKPSA-N Gln-Asp-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QYTKAVBFRUGYAU-ACZMJKKPSA-N 0.000 description 1
- IKDOHQHEFPPGJG-FXQIFTODSA-N Gln-Asp-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IKDOHQHEFPPGJG-FXQIFTODSA-N 0.000 description 1
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 1
- FJAYYNIXQNERSO-ACZMJKKPSA-N Gln-Cys-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N FJAYYNIXQNERSO-ACZMJKKPSA-N 0.000 description 1
- QYKBTDOAMKORGL-FXQIFTODSA-N Gln-Gln-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N QYKBTDOAMKORGL-FXQIFTODSA-N 0.000 description 1
- RRBLZNIIMHSHQF-FXQIFTODSA-N Gln-Gln-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N RRBLZNIIMHSHQF-FXQIFTODSA-N 0.000 description 1
- AJDMYLOISOCHHC-YVNDNENWSA-N Gln-Gln-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AJDMYLOISOCHHC-YVNDNENWSA-N 0.000 description 1
- IVCOYUURLWQDJQ-LPEHRKFASA-N Gln-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O IVCOYUURLWQDJQ-LPEHRKFASA-N 0.000 description 1
- CGVWDTRDPLOMHZ-FXQIFTODSA-N Gln-Glu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CGVWDTRDPLOMHZ-FXQIFTODSA-N 0.000 description 1
- MAGNEQBFSBREJL-DCAQKATOSA-N Gln-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N MAGNEQBFSBREJL-DCAQKATOSA-N 0.000 description 1
- PXAFHUATEHLECW-GUBZILKMSA-N Gln-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N PXAFHUATEHLECW-GUBZILKMSA-N 0.000 description 1
- LFIVHGMKWFGUGK-IHRRRGAJSA-N Gln-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N LFIVHGMKWFGUGK-IHRRRGAJSA-N 0.000 description 1
- JHPFPROFOAJRFN-IHRRRGAJSA-N Gln-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O JHPFPROFOAJRFN-IHRRRGAJSA-N 0.000 description 1
- NSNUZSPSADIMJQ-WDSKDSINSA-N Gln-Gly-Asp Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NSNUZSPSADIMJQ-WDSKDSINSA-N 0.000 description 1
- HVQCEQTUSWWFOS-WDSKDSINSA-N Gln-Gly-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N HVQCEQTUSWWFOS-WDSKDSINSA-N 0.000 description 1
- LVSYIKGMLRHKME-IUCAKERBSA-N Gln-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N LVSYIKGMLRHKME-IUCAKERBSA-N 0.000 description 1
- XSBGUANSZDGULP-IUCAKERBSA-N Gln-Gly-Lys Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O XSBGUANSZDGULP-IUCAKERBSA-N 0.000 description 1
- HDUDGCZEOZEFOA-KBIXCLLPSA-N Gln-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HDUDGCZEOZEFOA-KBIXCLLPSA-N 0.000 description 1
- YRWWJCDWLVXTHN-LAEOZQHASA-N Gln-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N YRWWJCDWLVXTHN-LAEOZQHASA-N 0.000 description 1
- ITZWDGBYBPUZRG-KBIXCLLPSA-N Gln-Ile-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O ITZWDGBYBPUZRG-KBIXCLLPSA-N 0.000 description 1
- ZNTDJIMJKNNSLR-RWRJDSDZSA-N Gln-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZNTDJIMJKNNSLR-RWRJDSDZSA-N 0.000 description 1
- FFVXLVGUJBCKRX-UKJIMTQDSA-N Gln-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N FFVXLVGUJBCKRX-UKJIMTQDSA-N 0.000 description 1
- QKCZZAZNMMVICF-DCAQKATOSA-N Gln-Leu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O QKCZZAZNMMVICF-DCAQKATOSA-N 0.000 description 1
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 1
- IHSGESFHTMFHRB-GUBZILKMSA-N Gln-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O IHSGESFHTMFHRB-GUBZILKMSA-N 0.000 description 1
- TWIAMTNJOMRDAK-GUBZILKMSA-N Gln-Lys-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O TWIAMTNJOMRDAK-GUBZILKMSA-N 0.000 description 1
- UWKPRVKWEKEMSY-DCAQKATOSA-N Gln-Lys-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWKPRVKWEKEMSY-DCAQKATOSA-N 0.000 description 1
- JRHPEMVLTRADLJ-AVGNSLFASA-N Gln-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JRHPEMVLTRADLJ-AVGNSLFASA-N 0.000 description 1
- XZLLTYBONVKGLO-SDDRHHMPSA-N Gln-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N)C(=O)O XZLLTYBONVKGLO-SDDRHHMPSA-N 0.000 description 1
- CELXWPDNIGWCJN-WDCWCFNPSA-N Gln-Lys-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CELXWPDNIGWCJN-WDCWCFNPSA-N 0.000 description 1
- QKWBEMCLYTYBNI-GVXVVHGQSA-N Gln-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O QKWBEMCLYTYBNI-GVXVVHGQSA-N 0.000 description 1
- KLKYKPXITJBSNI-CIUDSAMLSA-N Gln-Met-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O KLKYKPXITJBSNI-CIUDSAMLSA-N 0.000 description 1
- LUGUNEGJNDEBLU-DCAQKATOSA-N Gln-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LUGUNEGJNDEBLU-DCAQKATOSA-N 0.000 description 1
- ROHVCXBMIAAASL-HJGDQZAQSA-N Gln-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(=O)N)N)O ROHVCXBMIAAASL-HJGDQZAQSA-N 0.000 description 1
- SWDSRANUCKNBLA-AVGNSLFASA-N Gln-Phe-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SWDSRANUCKNBLA-AVGNSLFASA-N 0.000 description 1
- BZULIEARJFRINC-IHRRRGAJSA-N Gln-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N BZULIEARJFRINC-IHRRRGAJSA-N 0.000 description 1
- QBEWLBKBGXVVPD-RYUDHWBXSA-N Gln-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N QBEWLBKBGXVVPD-RYUDHWBXSA-N 0.000 description 1
- JUUNNOLZGVYCJT-JYJNAYRXSA-N Gln-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JUUNNOLZGVYCJT-JYJNAYRXSA-N 0.000 description 1
- DBNLXHGDGBUCDV-KKUMJFAQSA-N Gln-Phe-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O DBNLXHGDGBUCDV-KKUMJFAQSA-N 0.000 description 1
- DOQUICBEISTQHE-CIUDSAMLSA-N Gln-Pro-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O DOQUICBEISTQHE-CIUDSAMLSA-N 0.000 description 1
- PBYFVIQRFLNQCO-GUBZILKMSA-N Gln-Pro-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O PBYFVIQRFLNQCO-GUBZILKMSA-N 0.000 description 1
- OREPWMPAUWIIAM-ZPFDUUQYSA-N Gln-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N OREPWMPAUWIIAM-ZPFDUUQYSA-N 0.000 description 1
- KUBFPYIMAGXGBT-ACZMJKKPSA-N Gln-Ser-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KUBFPYIMAGXGBT-ACZMJKKPSA-N 0.000 description 1
- LGWNISYVKDNJRP-FXQIFTODSA-N Gln-Ser-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGWNISYVKDNJRP-FXQIFTODSA-N 0.000 description 1
- KVQOVQVGVKDZNW-GUBZILKMSA-N Gln-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N KVQOVQVGVKDZNW-GUBZILKMSA-N 0.000 description 1
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 1
- BYKZWDGMJLNFJY-XKBZYTNZSA-N Gln-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)O BYKZWDGMJLNFJY-XKBZYTNZSA-N 0.000 description 1
- UXXIVIQGOODKQC-NUMRIWBASA-N Gln-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UXXIVIQGOODKQC-NUMRIWBASA-N 0.000 description 1
- VOUSELYGTNGEPB-NUMRIWBASA-N Gln-Thr-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O VOUSELYGTNGEPB-NUMRIWBASA-N 0.000 description 1
- VLOLPWWCNKWRNB-LOKLDPHHSA-N Gln-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O VLOLPWWCNKWRNB-LOKLDPHHSA-N 0.000 description 1
- STHSGOZLFLFGSS-SUSMZKCASA-N Gln-Thr-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STHSGOZLFLFGSS-SUSMZKCASA-N 0.000 description 1
- IIMZHVKZBGSEKZ-SZMVWBNQSA-N Gln-Trp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O IIMZHVKZBGSEKZ-SZMVWBNQSA-N 0.000 description 1
- GTBXHETZPUURJE-KKUMJFAQSA-N Gln-Tyr-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GTBXHETZPUURJE-KKUMJFAQSA-N 0.000 description 1
- WIMVKDYAKRAUCG-IHRRRGAJSA-N Gln-Tyr-Glu Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O WIMVKDYAKRAUCG-IHRRRGAJSA-N 0.000 description 1
- ZMXZGYLINVNTKH-DZKIICNBSA-N Gln-Val-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZMXZGYLINVNTKH-DZKIICNBSA-N 0.000 description 1
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 1
- JZDHUJAFXGNDSB-WHFBIAKZSA-N Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O JZDHUJAFXGNDSB-WHFBIAKZSA-N 0.000 description 1
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 1
- HUWSBFYAGXCXKC-CIUDSAMLSA-N Glu-Ala-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O HUWSBFYAGXCXKC-CIUDSAMLSA-N 0.000 description 1
- ATRHMOJQJWPVBQ-DRZSPHRISA-N Glu-Ala-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ATRHMOJQJWPVBQ-DRZSPHRISA-N 0.000 description 1
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 1
- FYBSCGZLICNOBA-XQXXSGGOSA-N Glu-Ala-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FYBSCGZLICNOBA-XQXXSGGOSA-N 0.000 description 1
- KBKGRMNVKPSQIF-XDTLVQLUSA-N Glu-Ala-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KBKGRMNVKPSQIF-XDTLVQLUSA-N 0.000 description 1
- DIXKFOPPGWKZLY-CIUDSAMLSA-N Glu-Arg-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O DIXKFOPPGWKZLY-CIUDSAMLSA-N 0.000 description 1
- NLKVNZUFDPWPNL-YUMQZZPRSA-N Glu-Arg-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O NLKVNZUFDPWPNL-YUMQZZPRSA-N 0.000 description 1
- LTUVYLVIZHJCOQ-KKUMJFAQSA-N Glu-Arg-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LTUVYLVIZHJCOQ-KKUMJFAQSA-N 0.000 description 1
- VPKBCVUDBNINAH-GARJFASQSA-N Glu-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O VPKBCVUDBNINAH-GARJFASQSA-N 0.000 description 1
- SRZLHYPAOXBBSB-HJGDQZAQSA-N Glu-Arg-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SRZLHYPAOXBBSB-HJGDQZAQSA-N 0.000 description 1
- GCYFUZJHAXJKKE-KKUMJFAQSA-N Glu-Arg-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GCYFUZJHAXJKKE-KKUMJFAQSA-N 0.000 description 1
- AKJRHDMTEJXTPV-ACZMJKKPSA-N Glu-Asn-Ala Chemical compound C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AKJRHDMTEJXTPV-ACZMJKKPSA-N 0.000 description 1
- GLWXKFRTOHKGIT-ACZMJKKPSA-N Glu-Asn-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GLWXKFRTOHKGIT-ACZMJKKPSA-N 0.000 description 1
- MLCPTRRNICEKIS-FXQIFTODSA-N Glu-Asn-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLCPTRRNICEKIS-FXQIFTODSA-N 0.000 description 1
- YYOBUPFZLKQUAX-FXQIFTODSA-N Glu-Asn-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YYOBUPFZLKQUAX-FXQIFTODSA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- SVZIKUHLRKVZIF-GUBZILKMSA-N Glu-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N SVZIKUHLRKVZIF-GUBZILKMSA-N 0.000 description 1
- RJONUNZIMUXUOI-GUBZILKMSA-N Glu-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N RJONUNZIMUXUOI-GUBZILKMSA-N 0.000 description 1
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 1
- VAIWPXWHWAPYDF-FXQIFTODSA-N Glu-Asp-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O VAIWPXWHWAPYDF-FXQIFTODSA-N 0.000 description 1
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 1
- CYHBMLHCQXXCCT-AVGNSLFASA-N Glu-Asp-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CYHBMLHCQXXCCT-AVGNSLFASA-N 0.000 description 1
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 description 1
- OBIHEDRRSMRKLU-ACZMJKKPSA-N Glu-Cys-Asp Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OBIHEDRRSMRKLU-ACZMJKKPSA-N 0.000 description 1
- ZZIFPJZQHRJERU-WDSKDSINSA-N Glu-Cys-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O ZZIFPJZQHRJERU-WDSKDSINSA-N 0.000 description 1
- KVBPDJIFRQUQFY-ACZMJKKPSA-N Glu-Cys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O KVBPDJIFRQUQFY-ACZMJKKPSA-N 0.000 description 1
- CLROYXHHUZELFX-FXQIFTODSA-N Glu-Gln-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CLROYXHHUZELFX-FXQIFTODSA-N 0.000 description 1
- PXHABOCPJVTGEK-BQBZGAKWSA-N Glu-Gln-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O PXHABOCPJVTGEK-BQBZGAKWSA-N 0.000 description 1
- HTTSBEBKVNEDFE-AUTRQRHGSA-N Glu-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N HTTSBEBKVNEDFE-AUTRQRHGSA-N 0.000 description 1
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 1
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 1
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 1
- YLJHCWNDBKKOEB-IHRRRGAJSA-N Glu-Glu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YLJHCWNDBKKOEB-IHRRRGAJSA-N 0.000 description 1
- KUTPGXNAAOQSPD-LPEHRKFASA-N Glu-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O KUTPGXNAAOQSPD-LPEHRKFASA-N 0.000 description 1
- BUAKRRKDHSSIKK-IHRRRGAJSA-N Glu-Glu-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BUAKRRKDHSSIKK-IHRRRGAJSA-N 0.000 description 1
- MTAOBYXRYJZRGQ-WDSKDSINSA-N Glu-Gly-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MTAOBYXRYJZRGQ-WDSKDSINSA-N 0.000 description 1
- CUXJIASLBRJOFV-LAEOZQHASA-N Glu-Gly-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUXJIASLBRJOFV-LAEOZQHASA-N 0.000 description 1
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 1
- DVLZZEPUNFEUBW-AVGNSLFASA-N Glu-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N DVLZZEPUNFEUBW-AVGNSLFASA-N 0.000 description 1
- CXRWMMRLEMVSEH-PEFMBERDSA-N Glu-Ile-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CXRWMMRLEMVSEH-PEFMBERDSA-N 0.000 description 1
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 1
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 1
- ZSWGJYOZWBHROQ-RWRJDSDZSA-N Glu-Ile-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSWGJYOZWBHROQ-RWRJDSDZSA-N 0.000 description 1
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 1
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 1
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 1
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 1
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 1
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 1
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 description 1
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 1
- ZGEJRLJEAMPEDV-SRVKXCTJSA-N Glu-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N ZGEJRLJEAMPEDV-SRVKXCTJSA-N 0.000 description 1
- MFNUFCFRAZPJFW-JYJNAYRXSA-N Glu-Lys-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MFNUFCFRAZPJFW-JYJNAYRXSA-N 0.000 description 1
- FMBWLLMUPXTXFC-SDDRHHMPSA-N Glu-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N)C(=O)O FMBWLLMUPXTXFC-SDDRHHMPSA-N 0.000 description 1
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 1
- CBEUFCJRFNZMCU-SRVKXCTJSA-N Glu-Met-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O CBEUFCJRFNZMCU-SRVKXCTJSA-N 0.000 description 1
- HQOGXFLBAKJUMH-CIUDSAMLSA-N Glu-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N HQOGXFLBAKJUMH-CIUDSAMLSA-N 0.000 description 1
- YHOJJFFTSMWVGR-HJGDQZAQSA-N Glu-Met-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YHOJJFFTSMWVGR-HJGDQZAQSA-N 0.000 description 1
- GMAGZGCAYLQBKF-NHCYSSNCSA-N Glu-Met-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O GMAGZGCAYLQBKF-NHCYSSNCSA-N 0.000 description 1
- YRMZCZIRHYCNHX-RYUDHWBXSA-N Glu-Phe-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O YRMZCZIRHYCNHX-RYUDHWBXSA-N 0.000 description 1
- CBWKURKPYSLMJV-SOUVJXGZSA-N Glu-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CBWKURKPYSLMJV-SOUVJXGZSA-N 0.000 description 1
- ITVBKCZZLJUUHI-HTUGSXCWSA-N Glu-Phe-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ITVBKCZZLJUUHI-HTUGSXCWSA-N 0.000 description 1
- KXTAGESXNQEZKB-DZKIICNBSA-N Glu-Phe-Val Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 KXTAGESXNQEZKB-DZKIICNBSA-N 0.000 description 1
- UDEPRBFQTWGLCW-CIUDSAMLSA-N Glu-Pro-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O UDEPRBFQTWGLCW-CIUDSAMLSA-N 0.000 description 1
- NNQDRRUXFJYCCJ-NHCYSSNCSA-N Glu-Pro-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O NNQDRRUXFJYCCJ-NHCYSSNCSA-N 0.000 description 1
- BPLNJYHNAJVLRT-ACZMJKKPSA-N Glu-Ser-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O BPLNJYHNAJVLRT-ACZMJKKPSA-N 0.000 description 1
- WIKMTDVSCUJIPJ-CIUDSAMLSA-N Glu-Ser-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WIKMTDVSCUJIPJ-CIUDSAMLSA-N 0.000 description 1
- MRWYPDWDZSLWJM-ACZMJKKPSA-N Glu-Ser-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O MRWYPDWDZSLWJM-ACZMJKKPSA-N 0.000 description 1
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 1
- GUOWMVFLAJNPDY-CIUDSAMLSA-N Glu-Ser-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O GUOWMVFLAJNPDY-CIUDSAMLSA-N 0.000 description 1
- BXSZPACYCMNKLS-AVGNSLFASA-N Glu-Ser-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BXSZPACYCMNKLS-AVGNSLFASA-N 0.000 description 1
- JWNZHMSRZXXGTM-XKBZYTNZSA-N Glu-Ser-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWNZHMSRZXXGTM-XKBZYTNZSA-N 0.000 description 1
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 1
- TWYSSILQABLLME-HJGDQZAQSA-N Glu-Thr-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYSSILQABLLME-HJGDQZAQSA-N 0.000 description 1
- DTLLNDVORUEOTM-WDCWCFNPSA-N Glu-Thr-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DTLLNDVORUEOTM-WDCWCFNPSA-N 0.000 description 1
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 1
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 1
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 1
- BPCLDCNZBUYGOD-BPUTZDHNSA-N Glu-Trp-Glu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 BPCLDCNZBUYGOD-BPUTZDHNSA-N 0.000 description 1
- ZTNHPMZHAILHRB-JSGCOSHPSA-N Glu-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)NCC(O)=O)=CNC2=C1 ZTNHPMZHAILHRB-JSGCOSHPSA-N 0.000 description 1
- UCZXXMREFIETQW-AVGNSLFASA-N Glu-Tyr-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O UCZXXMREFIETQW-AVGNSLFASA-N 0.000 description 1
- HHSKZJZWQFPSKN-AVGNSLFASA-N Glu-Tyr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O HHSKZJZWQFPSKN-AVGNSLFASA-N 0.000 description 1
- QGAJQIGFFIQJJK-IHRRRGAJSA-N Glu-Tyr-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O QGAJQIGFFIQJJK-IHRRRGAJSA-N 0.000 description 1
- RXJFSLQVMGYQEL-IHRRRGAJSA-N Glu-Tyr-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 RXJFSLQVMGYQEL-IHRRRGAJSA-N 0.000 description 1
- HAGKYCXGTRUUFI-RYUDHWBXSA-N Glu-Tyr-Gly Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)O HAGKYCXGTRUUFI-RYUDHWBXSA-N 0.000 description 1
- UUTGYDAKPISJAO-JYJNAYRXSA-N Glu-Tyr-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 UUTGYDAKPISJAO-JYJNAYRXSA-N 0.000 description 1
- QEJKKJNDDDPSMU-KKUMJFAQSA-N Glu-Tyr-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(O)=O QEJKKJNDDDPSMU-KKUMJFAQSA-N 0.000 description 1
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 1
- UZWUBBRJWFTHTD-LAEOZQHASA-N Glu-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O UZWUBBRJWFTHTD-LAEOZQHASA-N 0.000 description 1
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 1
- KCCNSVHJSMMGFS-NRPADANISA-N Glu-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N KCCNSVHJSMMGFS-NRPADANISA-N 0.000 description 1
- HQTDNEZTGZUWSY-XVKPBYJWSA-N Glu-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)NCC(O)=O HQTDNEZTGZUWSY-XVKPBYJWSA-N 0.000 description 1
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 1
- NTNUEBVGKMVANB-NHCYSSNCSA-N Glu-Val-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O NTNUEBVGKMVANB-NHCYSSNCSA-N 0.000 description 1
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 1
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 1
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 1
- GZUKEVBTYNNUQF-WDSKDSINSA-N Gly-Ala-Gln Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GZUKEVBTYNNUQF-WDSKDSINSA-N 0.000 description 1
- PHONXOACARQMPM-BQBZGAKWSA-N Gly-Ala-Met Chemical compound [H]NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O PHONXOACARQMPM-BQBZGAKWSA-N 0.000 description 1
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 1
- QIZJOTQTCAGKPU-KWQFWETISA-N Gly-Ala-Tyr Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 QIZJOTQTCAGKPU-KWQFWETISA-N 0.000 description 1
- CLODWIOAKCSBAN-BQBZGAKWSA-N Gly-Arg-Asp Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O CLODWIOAKCSBAN-BQBZGAKWSA-N 0.000 description 1
- OGCIHJPYKVSMTE-YUMQZZPRSA-N Gly-Arg-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O OGCIHJPYKVSMTE-YUMQZZPRSA-N 0.000 description 1
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 1
- DJTXYXZNNDDEOU-WHFBIAKZSA-N Gly-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)C(=O)N DJTXYXZNNDDEOU-WHFBIAKZSA-N 0.000 description 1
- NZAFOTBEULLEQB-WDSKDSINSA-N Gly-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN NZAFOTBEULLEQB-WDSKDSINSA-N 0.000 description 1
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 1
- DUYYPIRFTLOAJQ-YUMQZZPRSA-N Gly-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN DUYYPIRFTLOAJQ-YUMQZZPRSA-N 0.000 description 1
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 1
- JVACNFOPSUPDTK-QWRGUYRKSA-N Gly-Asn-Phe Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JVACNFOPSUPDTK-QWRGUYRKSA-N 0.000 description 1
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 1
- IWAXHBCACVWNHT-BQBZGAKWSA-N Gly-Asp-Arg Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IWAXHBCACVWNHT-BQBZGAKWSA-N 0.000 description 1
- XEJTYSCIXKYSHR-WDSKDSINSA-N Gly-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN XEJTYSCIXKYSHR-WDSKDSINSA-N 0.000 description 1
- XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 1
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 1
- XXGQRGQPGFYECI-WDSKDSINSA-N Gly-Cys-Glu Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCC(O)=O XXGQRGQPGFYECI-WDSKDSINSA-N 0.000 description 1
- UEGIPZAXNBYCCP-NKWVEPMBSA-N Gly-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)CN)C(=O)O UEGIPZAXNBYCCP-NKWVEPMBSA-N 0.000 description 1
- QCTLGOYODITHPQ-WHFBIAKZSA-N Gly-Cys-Ser Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O QCTLGOYODITHPQ-WHFBIAKZSA-N 0.000 description 1
- QPDUVFSVVAOUHE-XVKPBYJWSA-N Gly-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)CN)C(O)=O QPDUVFSVVAOUHE-XVKPBYJWSA-N 0.000 description 1
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 1
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 1
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 1
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 1
- CUYLIWAAAYJKJH-RYUDHWBXSA-N Gly-Glu-Tyr Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUYLIWAAAYJKJH-RYUDHWBXSA-N 0.000 description 1
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 1
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 1
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 1
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 1
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 1
- UPADCCSMVOQAGF-LBPRGKRZSA-N Gly-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)CN)C(O)=O)=CNC2=C1 UPADCCSMVOQAGF-LBPRGKRZSA-N 0.000 description 1
- INLIXXRWNUKVCF-JTQLQIEISA-N Gly-Gly-Tyr Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 INLIXXRWNUKVCF-JTQLQIEISA-N 0.000 description 1
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 1
- FQKKPCWTZZEDIC-XPUUQOCRSA-N Gly-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 FQKKPCWTZZEDIC-XPUUQOCRSA-N 0.000 description 1
- YFGONBOFGGWKKY-VHSXEESVSA-N Gly-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)CN)C(=O)O YFGONBOFGGWKKY-VHSXEESVSA-N 0.000 description 1
- DGKBSGNCMCLDSL-BYULHYEWSA-N Gly-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN DGKBSGNCMCLDSL-BYULHYEWSA-N 0.000 description 1
- HKSNHPVETYYJBK-LAEOZQHASA-N Gly-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)CN HKSNHPVETYYJBK-LAEOZQHASA-N 0.000 description 1
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 1
- ZOTGXWMKUFSKEU-QXEWZRGKSA-N Gly-Ile-Met Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O ZOTGXWMKUFSKEU-QXEWZRGKSA-N 0.000 description 1
- XVYKMNXXJXQKME-XEGUGMAKSA-N Gly-Ile-Tyr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XVYKMNXXJXQKME-XEGUGMAKSA-N 0.000 description 1
- DKEXFJVMVGETOO-LURJTMIESA-N Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CN DKEXFJVMVGETOO-LURJTMIESA-N 0.000 description 1
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 1
- IUZGUFAJDBHQQV-YUMQZZPRSA-N Gly-Leu-Asn Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IUZGUFAJDBHQQV-YUMQZZPRSA-N 0.000 description 1
- YIFUFYZELCMPJP-YUMQZZPRSA-N Gly-Leu-Cys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O YIFUFYZELCMPJP-YUMQZZPRSA-N 0.000 description 1
- TWTPDFFBLQEBOE-IUCAKERBSA-N Gly-Leu-Gln Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O TWTPDFFBLQEBOE-IUCAKERBSA-N 0.000 description 1
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 1
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 1
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 1
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 1
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 1
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 1
- VLIJYPMATZSOLL-YUMQZZPRSA-N Gly-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN VLIJYPMATZSOLL-YUMQZZPRSA-N 0.000 description 1
- FHQRLHFYVZAQHU-IUCAKERBSA-N Gly-Lys-Gln Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O FHQRLHFYVZAQHU-IUCAKERBSA-N 0.000 description 1
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 1
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 1
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 1
- ICUTTWWCDIIIEE-BQBZGAKWSA-N Gly-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN ICUTTWWCDIIIEE-BQBZGAKWSA-N 0.000 description 1
- QLQDIJBYJZKQPR-BQBZGAKWSA-N Gly-Met-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN QLQDIJBYJZKQPR-BQBZGAKWSA-N 0.000 description 1
- LPHQAFLNEHWKFF-QXEWZRGKSA-N Gly-Met-Ile Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LPHQAFLNEHWKFF-QXEWZRGKSA-N 0.000 description 1
- QGDOOCIPHSSADO-STQMWFEESA-N Gly-Met-Phe Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QGDOOCIPHSSADO-STQMWFEESA-N 0.000 description 1
- UWQDKRIZSROAKS-FJXKBIBVSA-N Gly-Met-Thr Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UWQDKRIZSROAKS-FJXKBIBVSA-N 0.000 description 1
- FJWSJWACLMTDMI-WPRPVWTQSA-N Gly-Met-Val Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O FJWSJWACLMTDMI-WPRPVWTQSA-N 0.000 description 1
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 1
- QVDGHDFFYHKJPN-QWRGUYRKSA-N Gly-Phe-Cys Chemical compound NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CS)C(O)=O QVDGHDFFYHKJPN-QWRGUYRKSA-N 0.000 description 1
- YLEIWGJJBFBFHC-KBPBESRZSA-N Gly-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 YLEIWGJJBFBFHC-KBPBESRZSA-N 0.000 description 1
- QAMMIGULQSIRCD-IRXDYDNUSA-N Gly-Phe-Tyr Chemical compound C([C@H](NC(=O)C[NH3+])C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C([O-])=O)C1=CC=CC=C1 QAMMIGULQSIRCD-IRXDYDNUSA-N 0.000 description 1
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 1
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 1
- JJGBXTYGTKWGAT-YUMQZZPRSA-N Gly-Pro-Glu Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O JJGBXTYGTKWGAT-YUMQZZPRSA-N 0.000 description 1
- GLACUWHUYFBSPJ-FJXKBIBVSA-N Gly-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GLACUWHUYFBSPJ-FJXKBIBVSA-N 0.000 description 1
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 1
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 1
- POJJAZJHBGXEGM-YUMQZZPRSA-N Gly-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN POJJAZJHBGXEGM-YUMQZZPRSA-N 0.000 description 1
- YABRDIBSPZONIY-BQBZGAKWSA-N Gly-Ser-Met Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O YABRDIBSPZONIY-BQBZGAKWSA-N 0.000 description 1
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 1
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 1
- ZKJZBRHRWKLVSJ-ZDLURKLDSA-N Gly-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)O ZKJZBRHRWKLVSJ-ZDLURKLDSA-N 0.000 description 1
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 1
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 1
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 1
- GULGDABMYTYMJZ-STQMWFEESA-N Gly-Trp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O GULGDABMYTYMJZ-STQMWFEESA-N 0.000 description 1
- WTUSRDZLLWGYAT-KCTSRDHCSA-N Gly-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)CN WTUSRDZLLWGYAT-KCTSRDHCSA-N 0.000 description 1
- ONSARSFSJHTMFJ-STQMWFEESA-N Gly-Trp-Ser Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O ONSARSFSJHTMFJ-STQMWFEESA-N 0.000 description 1
- UVTSZKIATYSKIR-RYUDHWBXSA-N Gly-Tyr-Glu Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O UVTSZKIATYSKIR-RYUDHWBXSA-N 0.000 description 1
- KBBFOULZCHWGJX-KBPBESRZSA-N Gly-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)CN)O KBBFOULZCHWGJX-KBPBESRZSA-N 0.000 description 1
- DNAZKGFYFRGZIH-QWRGUYRKSA-N Gly-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 DNAZKGFYFRGZIH-QWRGUYRKSA-N 0.000 description 1
- GBYYQVBXFVDJPJ-WLTAIBSBSA-N Gly-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)CN)O GBYYQVBXFVDJPJ-WLTAIBSBSA-N 0.000 description 1
- YDIDLLVFCYSXNY-RCOVLWMOSA-N Gly-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN YDIDLLVFCYSXNY-RCOVLWMOSA-N 0.000 description 1
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 1
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 1
- COZMNNJEGNPDED-HOCLYGCPSA-N Gly-Val-Trp Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O COZMNNJEGNPDED-HOCLYGCPSA-N 0.000 description 1
- 244000068988 Glycine max Species 0.000 description 1
- 235000010469 Glycine max Nutrition 0.000 description 1
- VPZXBVLAVMBEQI-VKHMYHEASA-N Glycyl-alanine Chemical compound OC(=O)[C@H](C)NC(=O)CN VPZXBVLAVMBEQI-VKHMYHEASA-N 0.000 description 1
- 241000630665 Hada Species 0.000 description 1
- 241001600172 Haliangium ochraceum Species 0.000 description 1
- AWHJQEYGWRKPHE-LSJOCFKGSA-N His-Ala-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AWHJQEYGWRKPHE-LSJOCFKGSA-N 0.000 description 1
- AFPFGFUGETYOSY-HGNGGELXSA-N His-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AFPFGFUGETYOSY-HGNGGELXSA-N 0.000 description 1
- DCRODRAURLJOFY-XPUUQOCRSA-N His-Ala-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)NCC(O)=O DCRODRAURLJOFY-XPUUQOCRSA-N 0.000 description 1
- XINDHUAGVGCNSF-QSFUFRPTSA-N His-Ala-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XINDHUAGVGCNSF-QSFUFRPTSA-N 0.000 description 1
- HTZKFIYQMHJWSQ-INTQDDNPSA-N His-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N HTZKFIYQMHJWSQ-INTQDDNPSA-N 0.000 description 1
- DZMVESFTHXSSPZ-XVYDVKMFSA-N His-Ala-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DZMVESFTHXSSPZ-XVYDVKMFSA-N 0.000 description 1
- GMIWMPUGTFQFHK-KCTSRDHCSA-N His-Ala-Trp Chemical compound C[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O GMIWMPUGTFQFHK-KCTSRDHCSA-N 0.000 description 1
- HXKZJLWGSWQKEA-LSJOCFKGSA-N His-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CN=CN1 HXKZJLWGSWQKEA-LSJOCFKGSA-N 0.000 description 1
- NIKBMHGRNAPJFW-IUCAKERBSA-N His-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 NIKBMHGRNAPJFW-IUCAKERBSA-N 0.000 description 1
- TVQGUFGDVODUIF-LSJOCFKGSA-N His-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CN=CN1)N TVQGUFGDVODUIF-LSJOCFKGSA-N 0.000 description 1
- HDXNWVLQSQFJOX-SRVKXCTJSA-N His-Arg-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HDXNWVLQSQFJOX-SRVKXCTJSA-N 0.000 description 1
- ZIMTWPHIKZEHSE-UWVGGRQHSA-N His-Arg-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O ZIMTWPHIKZEHSE-UWVGGRQHSA-N 0.000 description 1
- MWAJSVTZZOUOBU-IHRRRGAJSA-N His-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC1=CN=CN1 MWAJSVTZZOUOBU-IHRRRGAJSA-N 0.000 description 1
- UCDWNBFOZCZSNV-AVGNSLFASA-N His-Arg-Met Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O UCDWNBFOZCZSNV-AVGNSLFASA-N 0.000 description 1
- CJGDTAHEMXLRMB-ULQDDVLXSA-N His-Arg-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CJGDTAHEMXLRMB-ULQDDVLXSA-N 0.000 description 1
- JWTKVPMQCCRPQY-SRVKXCTJSA-N His-Asn-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JWTKVPMQCCRPQY-SRVKXCTJSA-N 0.000 description 1
- OBTMRGFRLJBSFI-GARJFASQSA-N His-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O OBTMRGFRLJBSFI-GARJFASQSA-N 0.000 description 1
- VOEGKUNRHYKYSU-XVYDVKMFSA-N His-Asp-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O VOEGKUNRHYKYSU-XVYDVKMFSA-N 0.000 description 1
- ZJSMFRTVYSLKQU-DJFWLOJKSA-N His-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ZJSMFRTVYSLKQU-DJFWLOJKSA-N 0.000 description 1
- LIEIYPBMQJLASB-SRVKXCTJSA-N His-Gln-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CN=CN1 LIEIYPBMQJLASB-SRVKXCTJSA-N 0.000 description 1
- DVHGLDYMGWTYKW-GUBZILKMSA-N His-Gln-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DVHGLDYMGWTYKW-GUBZILKMSA-N 0.000 description 1
- HIAHVKLTHNOENC-HGNGGELXSA-N His-Glu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HIAHVKLTHNOENC-HGNGGELXSA-N 0.000 description 1
- JCOSMKPAOYDKRO-AVGNSLFASA-N His-Glu-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N JCOSMKPAOYDKRO-AVGNSLFASA-N 0.000 description 1
- ZYDYEPDFFVCUBI-SRVKXCTJSA-N His-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ZYDYEPDFFVCUBI-SRVKXCTJSA-N 0.000 description 1
- FDQYIRHBVVUTJF-ZETCQYMHSA-N His-Gly-Gly Chemical compound [O-]C(=O)CNC(=O)CNC(=O)[C@@H]([NH3+])CC1=CN=CN1 FDQYIRHBVVUTJF-ZETCQYMHSA-N 0.000 description 1
- KAFZDWMZKGQDEE-SRVKXCTJSA-N His-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KAFZDWMZKGQDEE-SRVKXCTJSA-N 0.000 description 1
- FSOXZQBMPBQKGJ-QSFUFRPTSA-N His-Ile-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]([NH3+])CC1=CN=CN1 FSOXZQBMPBQKGJ-QSFUFRPTSA-N 0.000 description 1
- MPXGJGBXCRQQJE-MXAVVETBSA-N His-Ile-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O MPXGJGBXCRQQJE-MXAVVETBSA-N 0.000 description 1
- QMUHTRISZMFKAY-MXAVVETBSA-N His-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N QMUHTRISZMFKAY-MXAVVETBSA-N 0.000 description 1
- IWXMHXYOACDSIA-PYJNHQTQSA-N His-Ile-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O IWXMHXYOACDSIA-PYJNHQTQSA-N 0.000 description 1
- JENKOCSDMSVWPY-SRVKXCTJSA-N His-Leu-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O JENKOCSDMSVWPY-SRVKXCTJSA-N 0.000 description 1
- MJUUWJJEUOBDGW-IHRRRGAJSA-N His-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 MJUUWJJEUOBDGW-IHRRRGAJSA-N 0.000 description 1
- LVWIJITYHRZHBO-IXOXFDKPSA-N His-Leu-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LVWIJITYHRZHBO-IXOXFDKPSA-N 0.000 description 1
- TWROVBNEHJSXDG-IHRRRGAJSA-N His-Leu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O TWROVBNEHJSXDG-IHRRRGAJSA-N 0.000 description 1
- TTYKEFZRLKQTHH-MELADBBJSA-N His-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O TTYKEFZRLKQTHH-MELADBBJSA-N 0.000 description 1
- TVMNTHXFRSXZGR-IHRRRGAJSA-N His-Lys-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O TVMNTHXFRSXZGR-IHRRRGAJSA-N 0.000 description 1
- VUUFXXGKMPLKNH-BZSNNMDCSA-N His-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N VUUFXXGKMPLKNH-BZSNNMDCSA-N 0.000 description 1
- GNBHSMFBUNEWCJ-DCAQKATOSA-N His-Pro-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GNBHSMFBUNEWCJ-DCAQKATOSA-N 0.000 description 1
- KAXZXLSXFWSNNZ-XVYDVKMFSA-N His-Ser-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KAXZXLSXFWSNNZ-XVYDVKMFSA-N 0.000 description 1
- JMSONHOUHFDOJH-GUBZILKMSA-N His-Ser-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 JMSONHOUHFDOJH-GUBZILKMSA-N 0.000 description 1
- ZHHLTWUOWXHVQJ-YUMQZZPRSA-N His-Ser-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZHHLTWUOWXHVQJ-YUMQZZPRSA-N 0.000 description 1
- CUEQQFOGARVNHU-VGDYDELISA-N His-Ser-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUEQQFOGARVNHU-VGDYDELISA-N 0.000 description 1
- VIJMRAIWYWRXSR-CIUDSAMLSA-N His-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 VIJMRAIWYWRXSR-CIUDSAMLSA-N 0.000 description 1
- DQZCEKQPSOBNMJ-NKIYYHGXSA-N His-Thr-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DQZCEKQPSOBNMJ-NKIYYHGXSA-N 0.000 description 1
- NBWATNYAUVSAEQ-ZEILLAHLSA-N His-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O NBWATNYAUVSAEQ-ZEILLAHLSA-N 0.000 description 1
- RNVUQLOKVIPNEM-BZSNNMDCSA-N His-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O RNVUQLOKVIPNEM-BZSNNMDCSA-N 0.000 description 1
- JATYGDHMDRAISQ-KKUMJFAQSA-N His-Tyr-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O JATYGDHMDRAISQ-KKUMJFAQSA-N 0.000 description 1
- WYKXJGWSJUULSL-AVGNSLFASA-N His-Val-Arg Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](CCCNC(=N)N)C(=O)O WYKXJGWSJUULSL-AVGNSLFASA-N 0.000 description 1
- RCFDOSNHHZGBOY-ACZMJKKPSA-N Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(O)=O RCFDOSNHHZGBOY-ACZMJKKPSA-N 0.000 description 1
- JXUGDUWBMKIJDC-NAKRPEOUSA-N Ile-Ala-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JXUGDUWBMKIJDC-NAKRPEOUSA-N 0.000 description 1
- TZCGZYWNIDZZMR-NAKRPEOUSA-N Ile-Arg-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C)C(=O)O)N TZCGZYWNIDZZMR-NAKRPEOUSA-N 0.000 description 1
- YOTNPRLPIPHQSB-XUXIUFHCSA-N Ile-Arg-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOTNPRLPIPHQSB-XUXIUFHCSA-N 0.000 description 1
- YKRIXHPEIZUDDY-GMOBBJLQSA-N Ile-Asn-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKRIXHPEIZUDDY-GMOBBJLQSA-N 0.000 description 1
- HZMLFETXHFHGBB-UGYAYLCHSA-N Ile-Asn-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZMLFETXHFHGBB-UGYAYLCHSA-N 0.000 description 1
- PJLLMGWWINYQPB-PEFMBERDSA-N Ile-Asn-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PJLLMGWWINYQPB-PEFMBERDSA-N 0.000 description 1
- XENGULNPUDGALZ-ZPFDUUQYSA-N Ile-Asn-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N XENGULNPUDGALZ-ZPFDUUQYSA-N 0.000 description 1
- FJWYJQRCVNGEAQ-ZPFDUUQYSA-N Ile-Asn-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N FJWYJQRCVNGEAQ-ZPFDUUQYSA-N 0.000 description 1
- LEDRIAHEWDJRMF-CFMVVWHZSA-N Ile-Asn-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LEDRIAHEWDJRMF-CFMVVWHZSA-N 0.000 description 1
- IDAHFEPYTJJZFD-PEFMBERDSA-N Ile-Asp-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N IDAHFEPYTJJZFD-PEFMBERDSA-N 0.000 description 1
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 1
- BGZIJZJBXRVBGJ-SXTJYALSSA-N Ile-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N BGZIJZJBXRVBGJ-SXTJYALSSA-N 0.000 description 1
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 1
- LLZLRXBTOOFODM-QSFUFRPTSA-N Ile-Asp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N LLZLRXBTOOFODM-QSFUFRPTSA-N 0.000 description 1
- LOXMWQOKYBGCHF-JBDRJPRFSA-N Ile-Cys-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O LOXMWQOKYBGCHF-JBDRJPRFSA-N 0.000 description 1
- AWTDTFXPVCTHAK-BJDJZHNGSA-N Ile-Cys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N AWTDTFXPVCTHAK-BJDJZHNGSA-N 0.000 description 1
- DURWCDDDAWVPOP-JBDRJPRFSA-N Ile-Cys-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N DURWCDDDAWVPOP-JBDRJPRFSA-N 0.000 description 1
- DMZOUKXXHJQPTL-GRLWGSQLSA-N Ile-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N DMZOUKXXHJQPTL-GRLWGSQLSA-N 0.000 description 1
- KUHFPGIVBOCRMV-MNXVOIDGSA-N Ile-Gln-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N KUHFPGIVBOCRMV-MNXVOIDGSA-N 0.000 description 1
- LKACSKJPTFSBHR-MNXVOIDGSA-N Ile-Gln-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N LKACSKJPTFSBHR-MNXVOIDGSA-N 0.000 description 1
- UBHUJPVCJHPSEU-GRLWGSQLSA-N Ile-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N UBHUJPVCJHPSEU-GRLWGSQLSA-N 0.000 description 1
- LPXHYGGZJOCAFR-MNXVOIDGSA-N Ile-Glu-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N LPXHYGGZJOCAFR-MNXVOIDGSA-N 0.000 description 1
- TVSPLSZTKTUYLV-ZPFDUUQYSA-N Ile-Glu-Met Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O TVSPLSZTKTUYLV-ZPFDUUQYSA-N 0.000 description 1
- FUOYNOXRWPJPAN-QEWYBTABSA-N Ile-Glu-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FUOYNOXRWPJPAN-QEWYBTABSA-N 0.000 description 1
- XLCZWMJPVGRWHJ-KQXIARHKSA-N Ile-Glu-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N XLCZWMJPVGRWHJ-KQXIARHKSA-N 0.000 description 1
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 1
- PDTMWFVVNZYWTR-NHCYSSNCSA-N Ile-Gly-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O PDTMWFVVNZYWTR-NHCYSSNCSA-N 0.000 description 1
- GQKSJYINYYWPMR-NGZCFLSTSA-N Ile-Gly-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N GQKSJYINYYWPMR-NGZCFLSTSA-N 0.000 description 1
- JLWLMGADIQFKRD-QSFUFRPTSA-N Ile-His-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CN=CN1 JLWLMGADIQFKRD-QSFUFRPTSA-N 0.000 description 1
- AREBLHSMLMRICD-PYJNHQTQSA-N Ile-His-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AREBLHSMLMRICD-PYJNHQTQSA-N 0.000 description 1
- HYLIOBDWPQNLKI-HVTMNAMFSA-N Ile-His-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HYLIOBDWPQNLKI-HVTMNAMFSA-N 0.000 description 1
- YKLOMBNBQUTJDT-HVTMNAMFSA-N Ile-His-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YKLOMBNBQUTJDT-HVTMNAMFSA-N 0.000 description 1
- CMNMPCTVCWWYHY-MXAVVETBSA-N Ile-His-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(C)C)C(=O)O)N CMNMPCTVCWWYHY-MXAVVETBSA-N 0.000 description 1
- KEKTTYCXKGBAAL-VGDYDELISA-N Ile-His-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N KEKTTYCXKGBAAL-VGDYDELISA-N 0.000 description 1
- APDIECQNNDGFPD-PYJNHQTQSA-N Ile-His-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N APDIECQNNDGFPD-PYJNHQTQSA-N 0.000 description 1
- KYLIZSDYWQQTFM-PEDHHIEDSA-N Ile-Ile-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N KYLIZSDYWQQTFM-PEDHHIEDSA-N 0.000 description 1
- TWPSALMCEHCIOY-YTFOTSKYSA-N Ile-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)O)N TWPSALMCEHCIOY-YTFOTSKYSA-N 0.000 description 1
- UWLHDGMRWXHFFY-HPCHECBXSA-N Ile-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N1CCC[C@@H]1C(=O)O)N UWLHDGMRWXHFFY-HPCHECBXSA-N 0.000 description 1
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 1
- DBXXASNNDTXOLU-MXAVVETBSA-N Ile-Leu-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DBXXASNNDTXOLU-MXAVVETBSA-N 0.000 description 1
- PWUMCBLVWPCKNO-MGHWNKPDSA-N Ile-Leu-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PWUMCBLVWPCKNO-MGHWNKPDSA-N 0.000 description 1
- NZGTYCMLUGYMCV-XUXIUFHCSA-N Ile-Lys-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N NZGTYCMLUGYMCV-XUXIUFHCSA-N 0.000 description 1
- OVDKXUDMKXAZIV-ZPFDUUQYSA-N Ile-Lys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OVDKXUDMKXAZIV-ZPFDUUQYSA-N 0.000 description 1
- YSGBJIQXTIVBHZ-AJNGGQMLSA-N Ile-Lys-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O YSGBJIQXTIVBHZ-AJNGGQMLSA-N 0.000 description 1
- UDBPXJNOEWDBDF-XUXIUFHCSA-N Ile-Lys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)O)N UDBPXJNOEWDBDF-XUXIUFHCSA-N 0.000 description 1
- UFRXVQGGPNSJRY-CYDGBPFRSA-N Ile-Met-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N UFRXVQGGPNSJRY-CYDGBPFRSA-N 0.000 description 1
- RVNOXPZHMUWCLW-GMOBBJLQSA-N Ile-Met-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N RVNOXPZHMUWCLW-GMOBBJLQSA-N 0.000 description 1
- NNVXABCGXOLIEB-PYJNHQTQSA-N Ile-Met-His Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NNVXABCGXOLIEB-PYJNHQTQSA-N 0.000 description 1
- MSASLZGZQAXVFP-PEDHHIEDSA-N Ile-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N MSASLZGZQAXVFP-PEDHHIEDSA-N 0.000 description 1
- ZUPJCJINYQISSN-XUXIUFHCSA-N Ile-Met-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUPJCJINYQISSN-XUXIUFHCSA-N 0.000 description 1
- FTUZWJVSNZMLPI-RVMXOQNASA-N Ile-Met-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N FTUZWJVSNZMLPI-RVMXOQNASA-N 0.000 description 1
- BKPPWVSPSIUXHZ-OSUNSFLBSA-N Ile-Met-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N BKPPWVSPSIUXHZ-OSUNSFLBSA-N 0.000 description 1
- IIWQTXMUALXGOV-PCBIJLKTSA-N Ile-Phe-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IIWQTXMUALXGOV-PCBIJLKTSA-N 0.000 description 1
- UAELWXJFLZBKQS-WHOFXGATSA-N Ile-Phe-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O UAELWXJFLZBKQS-WHOFXGATSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 1
- LRAUKBMYHHNADU-DKIMLUQUSA-N Ile-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=CC=C1 LRAUKBMYHHNADU-DKIMLUQUSA-N 0.000 description 1
- FGBRXCZYVRFNKQ-MXAVVETBSA-N Ile-Phe-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N FGBRXCZYVRFNKQ-MXAVVETBSA-N 0.000 description 1
- VEPIBPGLTLPBDW-URLPEUOOSA-N Ile-Phe-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N VEPIBPGLTLPBDW-URLPEUOOSA-N 0.000 description 1
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 1
- BJECXJHLUJXPJQ-PYJNHQTQSA-N Ile-Pro-His Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N BJECXJHLUJXPJQ-PYJNHQTQSA-N 0.000 description 1
- CZWANIQKACCEKW-CYDGBPFRSA-N Ile-Pro-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)O)N CZWANIQKACCEKW-CYDGBPFRSA-N 0.000 description 1
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 1
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 1
- XMYURPUVJSKTMC-KBIXCLLPSA-N Ile-Ser-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N XMYURPUVJSKTMC-KBIXCLLPSA-N 0.000 description 1
- AGGIYSLVUKVOPT-HTFCKZLJSA-N Ile-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N AGGIYSLVUKVOPT-HTFCKZLJSA-N 0.000 description 1
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 1
- HXIDVIFHRYRXLZ-NAKRPEOUSA-N Ile-Ser-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)O)N HXIDVIFHRYRXLZ-NAKRPEOUSA-N 0.000 description 1
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 1
- NAFIFZNBSPWYOO-RWRJDSDZSA-N Ile-Thr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N NAFIFZNBSPWYOO-RWRJDSDZSA-N 0.000 description 1
- COWHUQXTSYTKQC-RWRJDSDZSA-N Ile-Thr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N COWHUQXTSYTKQC-RWRJDSDZSA-N 0.000 description 1
- QGXQHJQPAPMACW-PPCPHDFISA-N Ile-Thr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QGXQHJQPAPMACW-PPCPHDFISA-N 0.000 description 1
- DGTOKVBDZXJHNZ-WZLNRYEVSA-N Ile-Thr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N DGTOKVBDZXJHNZ-WZLNRYEVSA-N 0.000 description 1
- GNXGAVNTVNOCLL-SIUGBPQLSA-N Ile-Tyr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GNXGAVNTVNOCLL-SIUGBPQLSA-N 0.000 description 1
- REXAUQBGSGDEJY-IGISWZIWSA-N Ile-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N REXAUQBGSGDEJY-IGISWZIWSA-N 0.000 description 1
- GVEODXUBBFDBPW-MGHWNKPDSA-N Ile-Tyr-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 GVEODXUBBFDBPW-MGHWNKPDSA-N 0.000 description 1
- DZMWFIRHFFVBHS-ZEWNOJEFSA-N Ile-Tyr-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N DZMWFIRHFFVBHS-ZEWNOJEFSA-N 0.000 description 1
- NSPNUMNLZNOPAQ-SJWGOKEGSA-N Ile-Tyr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N NSPNUMNLZNOPAQ-SJWGOKEGSA-N 0.000 description 1
- IPFKIGNDTUOFAF-CYDGBPFRSA-N Ile-Val-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IPFKIGNDTUOFAF-CYDGBPFRSA-N 0.000 description 1
- AUIYHFRUOOKTGX-UKJIMTQDSA-N Ile-Val-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N AUIYHFRUOOKTGX-UKJIMTQDSA-N 0.000 description 1
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 1
- DLEBSGAVWRPTIX-PEDHHIEDSA-N Ile-Val-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)[C@@H](C)CC DLEBSGAVWRPTIX-PEDHHIEDSA-N 0.000 description 1
- UYODHPPSCXBNCS-XUXIUFHCSA-N Ile-Val-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C UYODHPPSCXBNCS-XUXIUFHCSA-N 0.000 description 1
- ZSESFIFAYQEKRD-CYDGBPFRSA-N Ile-Val-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N ZSESFIFAYQEKRD-CYDGBPFRSA-N 0.000 description 1
- WIYDLTIBHZSPKY-HJWJTTGWSA-N Ile-Val-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WIYDLTIBHZSPKY-HJWJTTGWSA-N 0.000 description 1
- APQYGMBHIVXFML-OSUNSFLBSA-N Ile-Val-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N APQYGMBHIVXFML-OSUNSFLBSA-N 0.000 description 1
- DGAQECJNVWCQMB-PUAWFVPOSA-M Ilexoside XXIX Chemical compound C[C@@H]1CC[C@@]2(CC[C@@]3(C(=CC[C@H]4[C@]3(CC[C@@H]5[C@@]4(CC[C@@H](C5(C)C)OS(=O)(=O)[O-])C)C)[C@@H]2[C@]1(C)O)C)C(=O)O[C@H]6[C@@H]([C@H]([C@@H]([C@H](O6)CO)O)O)O.[Na+] DGAQECJNVWCQMB-PUAWFVPOSA-M 0.000 description 1
- PWWVAXIEGOYWEE-UHFFFAOYSA-N Isophenergan Chemical compound C1=CC=C2N(CC(C)N(C)C)C3=CC=CC=C3SC2=C1 PWWVAXIEGOYWEE-UHFFFAOYSA-N 0.000 description 1
- 101100498822 Klebsiella oxytoca (strain ATCC 8724 / DSM 4798 / JCM 20051 / NBRC 3318 / NRRL B-199 / KCTC 1686) ddrA gene Proteins 0.000 description 1
- 241000721596 Klebsiella oxytoca 10-5245 Species 0.000 description 1
- 241000588747 Klebsiella pneumoniae Species 0.000 description 1
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 1
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 1
- 241000186604 Lactobacillus reuteri Species 0.000 description 1
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 1
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 1
- SUPVSFFZWVOEOI-CQDKDKBSSA-N Leu-Ala-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-CQDKDKBSSA-N 0.000 description 1
- SUPVSFFZWVOEOI-UHFFFAOYSA-N Leu-Ala-Tyr Natural products CC(C)CC(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-UHFFFAOYSA-N 0.000 description 1
- JUWJEAPUNARGCF-DCAQKATOSA-N Leu-Arg-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JUWJEAPUNARGCF-DCAQKATOSA-N 0.000 description 1
- NTRAGDHVSGKUSF-AVGNSLFASA-N Leu-Arg-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NTRAGDHVSGKUSF-AVGNSLFASA-N 0.000 description 1
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 1
- GRZSCTXVCDUIPO-SRVKXCTJSA-N Leu-Arg-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRZSCTXVCDUIPO-SRVKXCTJSA-N 0.000 description 1
- FJUKMPUELVROGK-IHRRRGAJSA-N Leu-Arg-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N FJUKMPUELVROGK-IHRRRGAJSA-N 0.000 description 1
- UILIPCLTHRPCRB-XUXIUFHCSA-N Leu-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(C)C)N UILIPCLTHRPCRB-XUXIUFHCSA-N 0.000 description 1
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 1
- GPXFZVUVPCFTMG-AVGNSLFASA-N Leu-Arg-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(C)C GPXFZVUVPCFTMG-AVGNSLFASA-N 0.000 description 1
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 description 1
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 1
- OXKYZSRZKBTVEY-ZPFDUUQYSA-N Leu-Asn-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OXKYZSRZKBTVEY-ZPFDUUQYSA-N 0.000 description 1
- MDVZJYGNAGLPGJ-KKUMJFAQSA-N Leu-Asn-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MDVZJYGNAGLPGJ-KKUMJFAQSA-N 0.000 description 1
- WXHFZJFZWNCDNB-KKUMJFAQSA-N Leu-Asn-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXHFZJFZWNCDNB-KKUMJFAQSA-N 0.000 description 1
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 1
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 1
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 description 1
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 1
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 1
- QLQHWWCSCLZUMA-KKUMJFAQSA-N Leu-Asp-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QLQHWWCSCLZUMA-KKUMJFAQSA-N 0.000 description 1
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 1
- QDSKNVXKLPQNOJ-GVXVVHGQSA-N Leu-Gln-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QDSKNVXKLPQNOJ-GVXVVHGQSA-N 0.000 description 1
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 1
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 1
- KVMULWOHPPMHHE-DCAQKATOSA-N Leu-Glu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KVMULWOHPPMHHE-DCAQKATOSA-N 0.000 description 1
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 1
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 1
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 1
- JRJLGNFWYFSJHB-HOCLYGCPSA-N Leu-Gly-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JRJLGNFWYFSJHB-HOCLYGCPSA-N 0.000 description 1
- BKTXKJMNTSMJDQ-AVGNSLFASA-N Leu-His-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BKTXKJMNTSMJDQ-AVGNSLFASA-N 0.000 description 1
- AUBMZAMQCOYSIC-MNXVOIDGSA-N Leu-Ile-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O AUBMZAMQCOYSIC-MNXVOIDGSA-N 0.000 description 1
- JFSGIJSCJFQGSZ-MXAVVETBSA-N Leu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N JFSGIJSCJFQGSZ-MXAVVETBSA-N 0.000 description 1
- KUIDCYNIEJBZBU-AJNGGQMLSA-N Leu-Ile-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O KUIDCYNIEJBZBU-AJNGGQMLSA-N 0.000 description 1
- QLDHBYRUNQZIJQ-DKIMLUQUSA-N Leu-Ile-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QLDHBYRUNQZIJQ-DKIMLUQUSA-N 0.000 description 1
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 1
- NRFGTHFONZYFNY-MGHWNKPDSA-N Leu-Ile-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NRFGTHFONZYFNY-MGHWNKPDSA-N 0.000 description 1
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 1
- FAELBUXXFQLUAX-AJNGGQMLSA-N Leu-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C FAELBUXXFQLUAX-AJNGGQMLSA-N 0.000 description 1
- UBZGNBKMIJHOHL-BZSNNMDCSA-N Leu-Leu-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 UBZGNBKMIJHOHL-BZSNNMDCSA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- ZGUMORRUBUCXEH-AVGNSLFASA-N Leu-Lys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZGUMORRUBUCXEH-AVGNSLFASA-N 0.000 description 1
- REPBGZHJKYWFMJ-KKUMJFAQSA-N Leu-Lys-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N REPBGZHJKYWFMJ-KKUMJFAQSA-N 0.000 description 1
- VVQJGYPTIYOFBR-IHRRRGAJSA-N Leu-Lys-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N VVQJGYPTIYOFBR-IHRRRGAJSA-N 0.000 description 1
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 1
- ARRIJPQRBWRNLT-DCAQKATOSA-N Leu-Met-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ARRIJPQRBWRNLT-DCAQKATOSA-N 0.000 description 1
- WXZOHBVPVKABQN-DCAQKATOSA-N Leu-Met-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WXZOHBVPVKABQN-DCAQKATOSA-N 0.000 description 1
- NHRINZSPIUXYQZ-DCAQKATOSA-N Leu-Met-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)O)N NHRINZSPIUXYQZ-DCAQKATOSA-N 0.000 description 1
- FLNPJLDPGMLWAU-UWVGGRQHSA-N Leu-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(C)C FLNPJLDPGMLWAU-UWVGGRQHSA-N 0.000 description 1
- DDVHDMSBLRAKNV-IHRRRGAJSA-N Leu-Met-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O DDVHDMSBLRAKNV-IHRRRGAJSA-N 0.000 description 1
- JVTYXRRFZCEPPK-RHYQMDGZSA-N Leu-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(C)C)N)O JVTYXRRFZCEPPK-RHYQMDGZSA-N 0.000 description 1
- GCXGCIYIHXSKAY-ULQDDVLXSA-N Leu-Phe-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GCXGCIYIHXSKAY-ULQDDVLXSA-N 0.000 description 1
- ZAVCJRJOQKIOJW-KKUMJFAQSA-N Leu-Phe-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=CC=C1 ZAVCJRJOQKIOJW-KKUMJFAQSA-N 0.000 description 1
- PJWOOBTYQNNRBF-BZSNNMDCSA-N Leu-Phe-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)O)N PJWOOBTYQNNRBF-BZSNNMDCSA-N 0.000 description 1
- MJWVXZABPOKJJF-ACRUOGEOSA-N Leu-Phe-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MJWVXZABPOKJJF-ACRUOGEOSA-N 0.000 description 1
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 1
- XWEVVRRSIOBJOO-SRVKXCTJSA-N Leu-Pro-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O XWEVVRRSIOBJOO-SRVKXCTJSA-N 0.000 description 1
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 1
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 1
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 1
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 1
- KIZIOFNVSOSKJI-CIUDSAMLSA-N Leu-Ser-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N KIZIOFNVSOSKJI-CIUDSAMLSA-N 0.000 description 1
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 1
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 1
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 1
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 description 1
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 1
- ZGGVHTQAPHVMKM-IHPCNDPISA-N Leu-Trp-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCCCN)C(=O)O)N ZGGVHTQAPHVMKM-IHPCNDPISA-N 0.000 description 1
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 1
- RDFIVFHPOSOXMW-ACRUOGEOSA-N Leu-Tyr-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RDFIVFHPOSOXMW-ACRUOGEOSA-N 0.000 description 1
- BGGTYDNTOYRTTR-MEYUZBJRSA-N Leu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(C)C)N)O BGGTYDNTOYRTTR-MEYUZBJRSA-N 0.000 description 1
- YIRIDPUGZKHMHT-ACRUOGEOSA-N Leu-Tyr-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YIRIDPUGZKHMHT-ACRUOGEOSA-N 0.000 description 1
- VQHUBNVKFFLWRP-ULQDDVLXSA-N Leu-Tyr-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 VQHUBNVKFFLWRP-ULQDDVLXSA-N 0.000 description 1
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 1
- XOEDPXDZJHBQIX-ULQDDVLXSA-N Leu-Val-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOEDPXDZJHBQIX-ULQDDVLXSA-N 0.000 description 1
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 1
- 241000186807 Listeria seeligeri Species 0.000 description 1
- RVOMPSJXSRPFJT-DCAQKATOSA-N Lys-Ala-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVOMPSJXSRPFJT-DCAQKATOSA-N 0.000 description 1
- MPGHETGWWWUHPY-CIUDSAMLSA-N Lys-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN MPGHETGWWWUHPY-CIUDSAMLSA-N 0.000 description 1
- VHFFQUSNFFIZBT-CIUDSAMLSA-N Lys-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N VHFFQUSNFFIZBT-CIUDSAMLSA-N 0.000 description 1
- JCFYLFOCALSNLQ-GUBZILKMSA-N Lys-Ala-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JCFYLFOCALSNLQ-GUBZILKMSA-N 0.000 description 1
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 1
- WQWZXKWOEVSGQM-DCAQKATOSA-N Lys-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN WQWZXKWOEVSGQM-DCAQKATOSA-N 0.000 description 1
- IXHKPDJKKCUKHS-GARJFASQSA-N Lys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IXHKPDJKKCUKHS-GARJFASQSA-N 0.000 description 1
- VHXMZJGOKIMETG-CQDKDKBSSA-N Lys-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCCN)N VHXMZJGOKIMETG-CQDKDKBSSA-N 0.000 description 1
- WXJKFRMKJORORD-DCAQKATOSA-N Lys-Arg-Ala Chemical compound NC(=N)NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CCCCN WXJKFRMKJORORD-DCAQKATOSA-N 0.000 description 1
- ALSRJRIWBNENFY-DCAQKATOSA-N Lys-Arg-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O ALSRJRIWBNENFY-DCAQKATOSA-N 0.000 description 1
- GAOJCVKPIGHTGO-UWVGGRQHSA-N Lys-Arg-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O GAOJCVKPIGHTGO-UWVGGRQHSA-N 0.000 description 1
- VHNOAIFVYUQOOY-XUXIUFHCSA-N Lys-Arg-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VHNOAIFVYUQOOY-XUXIUFHCSA-N 0.000 description 1
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 description 1
- SJNZALDHDUYDBU-IHRRRGAJSA-N Lys-Arg-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(O)=O SJNZALDHDUYDBU-IHRRRGAJSA-N 0.000 description 1
- FUKDBQGFSJUXGX-RWMBFGLXSA-N Lys-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)C(=O)O FUKDBQGFSJUXGX-RWMBFGLXSA-N 0.000 description 1
- LZWNAOIMTLNMDW-NHCYSSNCSA-N Lys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N LZWNAOIMTLNMDW-NHCYSSNCSA-N 0.000 description 1
- HIIZIQUUHIXUJY-GUBZILKMSA-N Lys-Asp-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HIIZIQUUHIXUJY-GUBZILKMSA-N 0.000 description 1
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 1
- KWUKZRFFKPLUPE-HJGDQZAQSA-N Lys-Asp-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWUKZRFFKPLUPE-HJGDQZAQSA-N 0.000 description 1
- NTBFKPBULZGXQL-KKUMJFAQSA-N Lys-Asp-Tyr Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTBFKPBULZGXQL-KKUMJFAQSA-N 0.000 description 1
- GKFNXYMAMKJSKD-NHCYSSNCSA-N Lys-Asp-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GKFNXYMAMKJSKD-NHCYSSNCSA-N 0.000 description 1
- NDSNUWJPZKTFAR-DCAQKATOSA-N Lys-Cys-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCCCN NDSNUWJPZKTFAR-DCAQKATOSA-N 0.000 description 1
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 1
- HEWWNLVEWBJBKA-WDCWCFNPSA-N Lys-Gln-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN HEWWNLVEWBJBKA-WDCWCFNPSA-N 0.000 description 1
- QFGVDCBPDGLVTA-SZMVWBNQSA-N Lys-Gln-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN)C(O)=O)=CNC2=C1 QFGVDCBPDGLVTA-SZMVWBNQSA-N 0.000 description 1
- KZOHPCYVORJBLG-AVGNSLFASA-N Lys-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N KZOHPCYVORJBLG-AVGNSLFASA-N 0.000 description 1
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 1
- DUTMKEAPLLUGNO-JYJNAYRXSA-N Lys-Glu-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DUTMKEAPLLUGNO-JYJNAYRXSA-N 0.000 description 1
- PAMDBWYMLWOELY-SDDRHHMPSA-N Lys-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O PAMDBWYMLWOELY-SDDRHHMPSA-N 0.000 description 1
- QZONCCHVHCOBSK-YUMQZZPRSA-N Lys-Gly-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O QZONCCHVHCOBSK-YUMQZZPRSA-N 0.000 description 1
- ISHNZELVUVPCHY-ZETCQYMHSA-N Lys-Gly-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O ISHNZELVUVPCHY-ZETCQYMHSA-N 0.000 description 1
- DTUZCYRNEJDKSR-NHCYSSNCSA-N Lys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN DTUZCYRNEJDKSR-NHCYSSNCSA-N 0.000 description 1
- RFQATBGBLDAKGI-VHSXEESVSA-N Lys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCCN)N)C(=O)O RFQATBGBLDAKGI-VHSXEESVSA-N 0.000 description 1
- NNKLKUUGESXCBS-KBPBESRZSA-N Lys-Gly-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NNKLKUUGESXCBS-KBPBESRZSA-N 0.000 description 1
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 1
- KNKJPYAZQUFLQK-IHRRRGAJSA-N Lys-His-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCCCN)N KNKJPYAZQUFLQK-IHRRRGAJSA-N 0.000 description 1
- SQJSXOQXJYAVRV-SRVKXCTJSA-N Lys-His-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N SQJSXOQXJYAVRV-SRVKXCTJSA-N 0.000 description 1
- GTAXSKOXPIISBW-AVGNSLFASA-N Lys-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N GTAXSKOXPIISBW-AVGNSLFASA-N 0.000 description 1
- OWRUUFUVXFREBD-KKUMJFAQSA-N Lys-His-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O OWRUUFUVXFREBD-KKUMJFAQSA-N 0.000 description 1
- OIYWBDBHEGAVST-BZSNNMDCSA-N Lys-His-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OIYWBDBHEGAVST-BZSNNMDCSA-N 0.000 description 1
- SPCHLZUWJTYZFC-IHRRRGAJSA-N Lys-His-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O SPCHLZUWJTYZFC-IHRRRGAJSA-N 0.000 description 1
- KEPWSUPUFAPBRF-DKIMLUQUSA-N Lys-Ile-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KEPWSUPUFAPBRF-DKIMLUQUSA-N 0.000 description 1
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 1
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 1
- PFZWARWVRNTPBR-IHPCNDPISA-N Lys-Leu-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N PFZWARWVRNTPBR-IHPCNDPISA-N 0.000 description 1
- VUTWYNQUSJWBHO-BZSNNMDCSA-N Lys-Leu-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VUTWYNQUSJWBHO-BZSNNMDCSA-N 0.000 description 1
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 1
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 1
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 1
- PLDJDCJLRCYPJB-VOAKCMCISA-N Lys-Lys-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PLDJDCJLRCYPJB-VOAKCMCISA-N 0.000 description 1
- BXPHMHQHYHILBB-BZSNNMDCSA-N Lys-Lys-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BXPHMHQHYHILBB-BZSNNMDCSA-N 0.000 description 1
- URBJRJKWSUFCKS-AVGNSLFASA-N Lys-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCCCN)N URBJRJKWSUFCKS-AVGNSLFASA-N 0.000 description 1
- GOVDTWNJCBRRBJ-DCAQKATOSA-N Lys-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N GOVDTWNJCBRRBJ-DCAQKATOSA-N 0.000 description 1
- MTBLFIQZECOEBY-IHRRRGAJSA-N Lys-Met-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O MTBLFIQZECOEBY-IHRRRGAJSA-N 0.000 description 1
- ALEVUGKHINJNIF-QEJZJMRPSA-N Lys-Phe-Ala Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ALEVUGKHINJNIF-QEJZJMRPSA-N 0.000 description 1
- JPYPRVHMKRFTAT-KKUMJFAQSA-N Lys-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N JPYPRVHMKRFTAT-KKUMJFAQSA-N 0.000 description 1
- ZJSZPXISKMDJKQ-JYJNAYRXSA-N Lys-Phe-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=CC=C1 ZJSZPXISKMDJKQ-JYJNAYRXSA-N 0.000 description 1
- LNMKRJJLEFASGA-BZSNNMDCSA-N Lys-Phe-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LNMKRJJLEFASGA-BZSNNMDCSA-N 0.000 description 1
- LUAJJLPHUXPQLH-KKUMJFAQSA-N Lys-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N LUAJJLPHUXPQLH-KKUMJFAQSA-N 0.000 description 1
- WLXGMVVHTIUPHE-ULQDDVLXSA-N Lys-Phe-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O WLXGMVVHTIUPHE-ULQDDVLXSA-N 0.000 description 1
- BOJYMMBYBNOOGG-DCAQKATOSA-N Lys-Pro-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BOJYMMBYBNOOGG-DCAQKATOSA-N 0.000 description 1
- CNGOEHJCLVCJHN-SRVKXCTJSA-N Lys-Pro-Glu Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O CNGOEHJCLVCJHN-SRVKXCTJSA-N 0.000 description 1
- MSSABBQOBUZFKZ-IHRRRGAJSA-N Lys-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCCCN)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O MSSABBQOBUZFKZ-IHRRRGAJSA-N 0.000 description 1
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 1
- LUTDBHBIHHREDC-IHRRRGAJSA-N Lys-Pro-Lys Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O LUTDBHBIHHREDC-IHRRRGAJSA-N 0.000 description 1
- LECIJRIRMVOFMH-ULQDDVLXSA-N Lys-Pro-Phe Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LECIJRIRMVOFMH-ULQDDVLXSA-N 0.000 description 1
- UQJOKDAYFULYIX-AVGNSLFASA-N Lys-Pro-Pro Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 UQJOKDAYFULYIX-AVGNSLFASA-N 0.000 description 1
- GHKXHCMRAUYLBS-CIUDSAMLSA-N Lys-Ser-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O GHKXHCMRAUYLBS-CIUDSAMLSA-N 0.000 description 1
- JMNRXRPBHFGXQX-GUBZILKMSA-N Lys-Ser-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JMNRXRPBHFGXQX-GUBZILKMSA-N 0.000 description 1
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 1
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 1
- YRNRVKTYDSLKMD-KKUMJFAQSA-N Lys-Ser-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YRNRVKTYDSLKMD-KKUMJFAQSA-N 0.000 description 1
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 1
- TVHCDSBMFQYPNA-RHYQMDGZSA-N Lys-Thr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TVHCDSBMFQYPNA-RHYQMDGZSA-N 0.000 description 1
- YCJCEMKOZOYBEF-OEAJRASXSA-N Lys-Thr-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YCJCEMKOZOYBEF-OEAJRASXSA-N 0.000 description 1
- KTINOHQFVVCEGQ-XIRDDKMYSA-N Lys-Trp-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CC(O)=O)C(O)=O KTINOHQFVVCEGQ-XIRDDKMYSA-N 0.000 description 1
- ZJSXCIMWLPSTMG-HSCHXYMDSA-N Lys-Trp-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZJSXCIMWLPSTMG-HSCHXYMDSA-N 0.000 description 1
- NROQVSYLPRLJIP-PMVMPFDFSA-N Lys-Trp-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NROQVSYLPRLJIP-PMVMPFDFSA-N 0.000 description 1
- ZFNYWKHYUMEZDZ-WDSOQIARSA-N Lys-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCCCN)N ZFNYWKHYUMEZDZ-WDSOQIARSA-N 0.000 description 1
- PELXPRPDQRFBGQ-KKUMJFAQSA-N Lys-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O PELXPRPDQRFBGQ-KKUMJFAQSA-N 0.000 description 1
- HONVOXINDBETTI-KKUMJFAQSA-N Lys-Tyr-Cys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CS)C(O)=O)CC1=CC=C(O)C=C1 HONVOXINDBETTI-KKUMJFAQSA-N 0.000 description 1
- XYLSGAWRCZECIQ-JYJNAYRXSA-N Lys-Tyr-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 XYLSGAWRCZECIQ-JYJNAYRXSA-N 0.000 description 1
- IMDJSVBFQKDDEQ-MGHWNKPDSA-N Lys-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCCCN)N IMDJSVBFQKDDEQ-MGHWNKPDSA-N 0.000 description 1
- WINFHLHJTRGLCV-BZSNNMDCSA-N Lys-Tyr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=C(O)C=C1 WINFHLHJTRGLCV-BZSNNMDCSA-N 0.000 description 1
- NQOQDINRVQCAKD-ULQDDVLXSA-N Lys-Tyr-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCCCN)N NQOQDINRVQCAKD-ULQDDVLXSA-N 0.000 description 1
- LMMBAXJRYSXCOQ-ACRUOGEOSA-N Lys-Tyr-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O LMMBAXJRYSXCOQ-ACRUOGEOSA-N 0.000 description 1
- QLFAPXUXEBAWEK-NHCYSSNCSA-N Lys-Val-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QLFAPXUXEBAWEK-NHCYSSNCSA-N 0.000 description 1
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 1
- VWJFOUBDZIUXGA-AVGNSLFASA-N Lys-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCCN)N VWJFOUBDZIUXGA-AVGNSLFASA-N 0.000 description 1
- TXTZMVNJIRZABH-ULQDDVLXSA-N Lys-Val-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TXTZMVNJIRZABH-ULQDDVLXSA-N 0.000 description 1
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 1
- HMZPYMSEAALNAE-ULQDDVLXSA-N Lys-Val-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMZPYMSEAALNAE-ULQDDVLXSA-N 0.000 description 1
- GUBGYTABKSRVRQ-PICCSMPSSA-N Maltose Natural products O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-PICCSMPSSA-N 0.000 description 1
- 241001105696 Marinithermus hydrothermalis Species 0.000 description 1
- GAELMDJMQDUDLJ-BQBZGAKWSA-N Met-Ala-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O GAELMDJMQDUDLJ-BQBZGAKWSA-N 0.000 description 1
- MUYQDMBLDFEVRJ-LSJOCFKGSA-N Met-Ala-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 MUYQDMBLDFEVRJ-LSJOCFKGSA-N 0.000 description 1
- QGQGAIBGTUJRBR-NAKRPEOUSA-N Met-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCSC QGQGAIBGTUJRBR-NAKRPEOUSA-N 0.000 description 1
- QEVRUYFHWJJUHZ-DCAQKATOSA-N Met-Ala-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(C)C QEVRUYFHWJJUHZ-DCAQKATOSA-N 0.000 description 1
- WYEXWKAWMNJKPN-UBHSHLNASA-N Met-Ala-Phe Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCSC)N WYEXWKAWMNJKPN-UBHSHLNASA-N 0.000 description 1
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 1
- HUKLXYYPZWPXCC-KZVJFYERSA-N Met-Ala-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HUKLXYYPZWPXCC-KZVJFYERSA-N 0.000 description 1
- DLAFCQWUMFMZSN-GUBZILKMSA-N Met-Arg-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N DLAFCQWUMFMZSN-GUBZILKMSA-N 0.000 description 1
- WDTLNWHPIPCMMP-AVGNSLFASA-N Met-Arg-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O WDTLNWHPIPCMMP-AVGNSLFASA-N 0.000 description 1
- DCHHUGLTVLJYKA-FXQIFTODSA-N Met-Asn-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DCHHUGLTVLJYKA-FXQIFTODSA-N 0.000 description 1
- FRWZTWWOORIIBA-FXQIFTODSA-N Met-Asn-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FRWZTWWOORIIBA-FXQIFTODSA-N 0.000 description 1
- IVCPHARVJUYDPA-FXQIFTODSA-N Met-Asn-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IVCPHARVJUYDPA-FXQIFTODSA-N 0.000 description 1
- ACYHZNZHIZWLQF-BQBZGAKWSA-N Met-Asn-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ACYHZNZHIZWLQF-BQBZGAKWSA-N 0.000 description 1
- QXEVZBXTDTVPCP-GMOBBJLQSA-N Met-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCSC)N QXEVZBXTDTVPCP-GMOBBJLQSA-N 0.000 description 1
- SQUTUWHAAWJYES-GUBZILKMSA-N Met-Asp-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SQUTUWHAAWJYES-GUBZILKMSA-N 0.000 description 1
- XMMWDTUFTZMQFD-GMOBBJLQSA-N Met-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCSC XMMWDTUFTZMQFD-GMOBBJLQSA-N 0.000 description 1
- FBQMBZLJHOQAIH-GUBZILKMSA-N Met-Asp-Met Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O FBQMBZLJHOQAIH-GUBZILKMSA-N 0.000 description 1
- XOMXAVJBLRROMC-IHRRRGAJSA-N Met-Asp-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOMXAVJBLRROMC-IHRRRGAJSA-N 0.000 description 1
- FVKRQMQQFGBXHV-QXEWZRGKSA-N Met-Asp-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FVKRQMQQFGBXHV-QXEWZRGKSA-N 0.000 description 1
- AVTWKENDGGUWDC-BQBZGAKWSA-N Met-Cys-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O AVTWKENDGGUWDC-BQBZGAKWSA-N 0.000 description 1
- IZLCDZDNZFEDHB-DCAQKATOSA-N Met-Cys-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N IZLCDZDNZFEDHB-DCAQKATOSA-N 0.000 description 1
- RCMDUFDXDYTXOK-CIUDSAMLSA-N Met-Gln-Cys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(O)=O RCMDUFDXDYTXOK-CIUDSAMLSA-N 0.000 description 1
- JYCQGAGDJQYEDB-GUBZILKMSA-N Met-Gln-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O JYCQGAGDJQYEDB-GUBZILKMSA-N 0.000 description 1
- FWTBMGAKKPSTBT-GUBZILKMSA-N Met-Gln-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FWTBMGAKKPSTBT-GUBZILKMSA-N 0.000 description 1
- GXYYFDKJHLRNSI-SRVKXCTJSA-N Met-Gln-His Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O GXYYFDKJHLRNSI-SRVKXCTJSA-N 0.000 description 1
- AWOMRHGUWFBDNU-ZPFDUUQYSA-N Met-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N AWOMRHGUWFBDNU-ZPFDUUQYSA-N 0.000 description 1
- KQBJYJXPZBNEIK-DCAQKATOSA-N Met-Glu-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQBJYJXPZBNEIK-DCAQKATOSA-N 0.000 description 1
- AETNZPKUUYYYEK-CIUDSAMLSA-N Met-Glu-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O AETNZPKUUYYYEK-CIUDSAMLSA-N 0.000 description 1
- DJDFBVNNDAUPRW-GUBZILKMSA-N Met-Glu-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O DJDFBVNNDAUPRW-GUBZILKMSA-N 0.000 description 1
- CHQWUYSNAOABIP-ZPFDUUQYSA-N Met-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N CHQWUYSNAOABIP-ZPFDUUQYSA-N 0.000 description 1
- SJDQOYTYNGZZJX-SRVKXCTJSA-N Met-Glu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SJDQOYTYNGZZJX-SRVKXCTJSA-N 0.000 description 1
- RAAVFTFEAUAVIY-DCAQKATOSA-N Met-Glu-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N RAAVFTFEAUAVIY-DCAQKATOSA-N 0.000 description 1
- JPCHYAUKOUGOIB-HJGDQZAQSA-N Met-Glu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPCHYAUKOUGOIB-HJGDQZAQSA-N 0.000 description 1
- OOSPRDCGTLQLBP-NHCYSSNCSA-N Met-Glu-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OOSPRDCGTLQLBP-NHCYSSNCSA-N 0.000 description 1
- SLQDSYZHHOKQSR-QXEWZRGKSA-N Met-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCSC SLQDSYZHHOKQSR-QXEWZRGKSA-N 0.000 description 1
- JACAKCWAOHKQBV-UWVGGRQHSA-N Met-Gly-Lys Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN JACAKCWAOHKQBV-UWVGGRQHSA-N 0.000 description 1
- BMHIFARYXOJDLD-WPRPVWTQSA-N Met-Gly-Val Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O BMHIFARYXOJDLD-WPRPVWTQSA-N 0.000 description 1
- CUICVBQQHMKBRJ-LSJOCFKGSA-N Met-His-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](C)C(O)=O CUICVBQQHMKBRJ-LSJOCFKGSA-N 0.000 description 1
- JZNGSNMTXAHMSV-AVGNSLFASA-N Met-His-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JZNGSNMTXAHMSV-AVGNSLFASA-N 0.000 description 1
- RKIIYGUHIQJCBW-SRVKXCTJSA-N Met-His-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RKIIYGUHIQJCBW-SRVKXCTJSA-N 0.000 description 1
- AEQVPPGEJJBFEE-CYDGBPFRSA-N Met-Ile-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEQVPPGEJJBFEE-CYDGBPFRSA-N 0.000 description 1
- DJBCKVNHEIJLQA-GMOBBJLQSA-N Met-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCSC)N DJBCKVNHEIJLQA-GMOBBJLQSA-N 0.000 description 1
- RVYDCISQIGHAFC-ZPFDUUQYSA-N Met-Ile-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O RVYDCISQIGHAFC-ZPFDUUQYSA-N 0.000 description 1
- QGRJTULYDZUBAY-ZPFDUUQYSA-N Met-Ile-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGRJTULYDZUBAY-ZPFDUUQYSA-N 0.000 description 1
- ORRNBLTZBBESPN-HJWJTTGWSA-N Met-Ile-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ORRNBLTZBBESPN-HJWJTTGWSA-N 0.000 description 1
- FWAHLGXNBLWIKB-NAKRPEOUSA-N Met-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCSC FWAHLGXNBLWIKB-NAKRPEOUSA-N 0.000 description 1
- HWROAFGWPQUPTE-OSUNSFLBSA-N Met-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CCSC)N HWROAFGWPQUPTE-OSUNSFLBSA-N 0.000 description 1
- ZIIMORLEZLVRIP-SRVKXCTJSA-N Met-Leu-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZIIMORLEZLVRIP-SRVKXCTJSA-N 0.000 description 1
- OSZTUONKUMCWEP-XUXIUFHCSA-N Met-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC OSZTUONKUMCWEP-XUXIUFHCSA-N 0.000 description 1
- CHDYFPCQVUOJEB-ULQDDVLXSA-N Met-Leu-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 CHDYFPCQVUOJEB-ULQDDVLXSA-N 0.000 description 1
- DBXMFHGGHMXYHY-DCAQKATOSA-N Met-Leu-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O DBXMFHGGHMXYHY-DCAQKATOSA-N 0.000 description 1
- UFOWQBYMUILSRK-IHRRRGAJSA-N Met-Lys-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 UFOWQBYMUILSRK-IHRRRGAJSA-N 0.000 description 1
- HSJIGJRZYUADSS-IHRRRGAJSA-N Met-Lys-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HSJIGJRZYUADSS-IHRRRGAJSA-N 0.000 description 1
- HAQLBBVZAGMESV-IHRRRGAJSA-N Met-Lys-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O HAQLBBVZAGMESV-IHRRRGAJSA-N 0.000 description 1
- VAGCEUUEMMXFEX-GUBZILKMSA-N Met-Met-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(O)=O VAGCEUUEMMXFEX-GUBZILKMSA-N 0.000 description 1
- JKXVPNCSAMWUEJ-GUBZILKMSA-N Met-Met-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O JKXVPNCSAMWUEJ-GUBZILKMSA-N 0.000 description 1
- WXUUEPIDLLQBLJ-DCAQKATOSA-N Met-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N WXUUEPIDLLQBLJ-DCAQKATOSA-N 0.000 description 1
- CRVSHEPROQHVQT-AVGNSLFASA-N Met-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N CRVSHEPROQHVQT-AVGNSLFASA-N 0.000 description 1
- XGIQKEAKUSPCBU-SRVKXCTJSA-N Met-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCSC)N XGIQKEAKUSPCBU-SRVKXCTJSA-N 0.000 description 1
- KBTQZYASLSUFJR-KKUMJFAQSA-N Met-Phe-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KBTQZYASLSUFJR-KKUMJFAQSA-N 0.000 description 1
- FBLBCGLSRXBANI-KKUMJFAQSA-N Met-Phe-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FBLBCGLSRXBANI-KKUMJFAQSA-N 0.000 description 1
- JQHYVIKEFYETEW-IHRRRGAJSA-N Met-Phe-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=CC=C1 JQHYVIKEFYETEW-IHRRRGAJSA-N 0.000 description 1
- VQILILSLEFDECU-GUBZILKMSA-N Met-Pro-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O VQILILSLEFDECU-GUBZILKMSA-N 0.000 description 1
- BQHLZUMZOXUWNU-DCAQKATOSA-N Met-Pro-Glu Chemical compound CSCC[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BQHLZUMZOXUWNU-DCAQKATOSA-N 0.000 description 1
- VSJAPSMRFYUOKS-IUCAKERBSA-N Met-Pro-Gly Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O VSJAPSMRFYUOKS-IUCAKERBSA-N 0.000 description 1
- CIDICGYKRUTYLE-FXQIFTODSA-N Met-Ser-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CIDICGYKRUTYLE-FXQIFTODSA-N 0.000 description 1
- XPVCDCMPKCERFT-GUBZILKMSA-N Met-Ser-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XPVCDCMPKCERFT-GUBZILKMSA-N 0.000 description 1
- RDLSEGZJMYGFNS-FXQIFTODSA-N Met-Ser-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RDLSEGZJMYGFNS-FXQIFTODSA-N 0.000 description 1
- WRXOPYNEKGZWAZ-FXQIFTODSA-N Met-Ser-Cys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O WRXOPYNEKGZWAZ-FXQIFTODSA-N 0.000 description 1
- ZDJICAUBMUKVEJ-CIUDSAMLSA-N Met-Ser-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O ZDJICAUBMUKVEJ-CIUDSAMLSA-N 0.000 description 1
- LXCSZPUQKMTXNW-BQBZGAKWSA-N Met-Ser-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O LXCSZPUQKMTXNW-BQBZGAKWSA-N 0.000 description 1
- HLZORBMOISUNIV-DCAQKATOSA-N Met-Ser-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C HLZORBMOISUNIV-DCAQKATOSA-N 0.000 description 1
- DSZFTPCSFVWMKP-DCAQKATOSA-N Met-Ser-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN DSZFTPCSFVWMKP-DCAQKATOSA-N 0.000 description 1
- FDGAMQVRGORBDV-GUBZILKMSA-N Met-Ser-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCSC FDGAMQVRGORBDV-GUBZILKMSA-N 0.000 description 1
- GGXZOTSDJJTDGB-GUBZILKMSA-N Met-Ser-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O GGXZOTSDJJTDGB-GUBZILKMSA-N 0.000 description 1
- RIIFMEBFDDXGCV-VEVYYDQMSA-N Met-Thr-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O RIIFMEBFDDXGCV-VEVYYDQMSA-N 0.000 description 1
- KYXDADPHSNFWQX-VEVYYDQMSA-N Met-Thr-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O KYXDADPHSNFWQX-VEVYYDQMSA-N 0.000 description 1
- FXBKQTOGURNXSL-HJGDQZAQSA-N Met-Thr-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O FXBKQTOGURNXSL-HJGDQZAQSA-N 0.000 description 1
- IHRFZLQEQVHXFA-RHYQMDGZSA-N Met-Thr-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCCN IHRFZLQEQVHXFA-RHYQMDGZSA-N 0.000 description 1
- YIGCDRZMZNDENK-UNQGMJICSA-N Met-Thr-Phe Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YIGCDRZMZNDENK-UNQGMJICSA-N 0.000 description 1
- AOLKTFKKSSMRTA-WDSOQIARSA-N Met-Trp-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N AOLKTFKKSSMRTA-WDSOQIARSA-N 0.000 description 1
- VOAKKHOIAFKOQZ-JYJNAYRXSA-N Met-Tyr-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCSC)CC1=CC=C(O)C=C1 VOAKKHOIAFKOQZ-JYJNAYRXSA-N 0.000 description 1
- YJNDFEWPGLNLNH-IHRRRGAJSA-N Met-Tyr-Cys Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CS)C(O)=O)CC1=CC=C(O)C=C1 YJNDFEWPGLNLNH-IHRRRGAJSA-N 0.000 description 1
- XTSBLBXAUIBMLW-KKUMJFAQSA-N Met-Tyr-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N XTSBLBXAUIBMLW-KKUMJFAQSA-N 0.000 description 1
- CULGJGUDIJATIP-STQMWFEESA-N Met-Tyr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 CULGJGUDIJATIP-STQMWFEESA-N 0.000 description 1
- ANCPZNHGZUCSSC-ULQDDVLXSA-N Met-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCSC)CC1=CC=C(O)C=C1 ANCPZNHGZUCSSC-ULQDDVLXSA-N 0.000 description 1
- PNHRPOWKRRJATF-IHRRRGAJSA-N Met-Tyr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 PNHRPOWKRRJATF-IHRRRGAJSA-N 0.000 description 1
- JHVNNUIQXOGAHI-KJEVXHAQSA-N Met-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCSC)N)O JHVNNUIQXOGAHI-KJEVXHAQSA-N 0.000 description 1
- YGNUDKAPJARTEM-GUBZILKMSA-N Met-Val-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O YGNUDKAPJARTEM-GUBZILKMSA-N 0.000 description 1
- VWFHWJGVLVZVIS-QXEWZRGKSA-N Met-Val-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O VWFHWJGVLVZVIS-QXEWZRGKSA-N 0.000 description 1
- CQRGINSEMFBACV-WPRPVWTQSA-N Met-Val-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O CQRGINSEMFBACV-WPRPVWTQSA-N 0.000 description 1
- CKAVKDJBSNTJDB-SRVKXCTJSA-N Met-Val-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCSC CKAVKDJBSNTJDB-SRVKXCTJSA-N 0.000 description 1
- VYDLZDRMOFYOGV-TUAOUCFPSA-N Met-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N VYDLZDRMOFYOGV-TUAOUCFPSA-N 0.000 description 1
- 101100395468 Metallosphaera sedula (strain ATCC 51363 / DSM 5348 / JCM 9185 / NBRC 15509 / TH2) Msed_2001 gene Proteins 0.000 description 1
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 1
- 108010047562 NGR peptide Proteins 0.000 description 1
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 1
- 108010065395 Neuropep-1 Proteins 0.000 description 1
- 108091028043 Nucleic acid sequence Proteins 0.000 description 1
- YEOLZNKREIGAHB-BLPRJPCASA-N OC(=O)C=C.O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCS)O[C@H]1N1C2=NC=NC(N)=C2N=C1 Chemical compound OC(=O)C=C.O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCS)O[C@H]1N1C2=NC=NC(N)=C2N=C1 YEOLZNKREIGAHB-BLPRJPCASA-N 0.000 description 1
- CFNPCSNXESBNGR-XGGCCDIMSA-N O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)C(O)CC(C)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)C(O)CC(C)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 CFNPCSNXESBNGR-XGGCCDIMSA-N 0.000 description 1
- 241000121202 Oligotropha carboxidovorans Species 0.000 description 1
- 241001042460 Oscillibacter valericigenes Species 0.000 description 1
- 235000021314 Palmitic acid Nutrition 0.000 description 1
- 239000001888 Peptone Substances 0.000 description 1
- 108010080698 Peptones Proteins 0.000 description 1
- 241000841063 Peptoniphilus indolicus ATCC 29427 Species 0.000 description 1
- 241001465962 Peptostreptococcus anaerobius CAG:621 Species 0.000 description 1
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 1
- JVTMTFMMMHAPCR-UBHSHLNASA-N Phe-Ala-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JVTMTFMMMHAPCR-UBHSHLNASA-N 0.000 description 1
- FPTXMUIBLMGTQH-ONGXEEELSA-N Phe-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 FPTXMUIBLMGTQH-ONGXEEELSA-N 0.000 description 1
- DFEVBOYEUQJGER-JURCDPSOSA-N Phe-Ala-Ile Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O DFEVBOYEUQJGER-JURCDPSOSA-N 0.000 description 1
- BBDSZDHUCPSYAC-QEJZJMRPSA-N Phe-Ala-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BBDSZDHUCPSYAC-QEJZJMRPSA-N 0.000 description 1
- SEPNOAFMZLLCEW-UBHSHLNASA-N Phe-Ala-Val Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O SEPNOAFMZLLCEW-UBHSHLNASA-N 0.000 description 1
- AYPMIIKUMNADSU-IHRRRGAJSA-N Phe-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AYPMIIKUMNADSU-IHRRRGAJSA-N 0.000 description 1
- VHWOBXIWBDWZHK-IHRRRGAJSA-N Phe-Arg-Asp Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 VHWOBXIWBDWZHK-IHRRRGAJSA-N 0.000 description 1
- XWBJLKDCHJVKAK-KKUMJFAQSA-N Phe-Arg-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N XWBJLKDCHJVKAK-KKUMJFAQSA-N 0.000 description 1
- OXUMFAOVGFODPN-KKUMJFAQSA-N Phe-Asn-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N OXUMFAOVGFODPN-KKUMJFAQSA-N 0.000 description 1
- KIEPQOIQHFKQLK-PCBIJLKTSA-N Phe-Asn-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KIEPQOIQHFKQLK-PCBIJLKTSA-N 0.000 description 1
- WGXOKDLDIWSOCV-MELADBBJSA-N Phe-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O WGXOKDLDIWSOCV-MELADBBJSA-N 0.000 description 1
- WMGVYPPIMZPWPN-SRVKXCTJSA-N Phe-Asp-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N WMGVYPPIMZPWPN-SRVKXCTJSA-N 0.000 description 1
- UEEVBGHEGJMDDV-AVGNSLFASA-N Phe-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UEEVBGHEGJMDDV-AVGNSLFASA-N 0.000 description 1
- RIYZXJVARWJLKS-KKUMJFAQSA-N Phe-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 RIYZXJVARWJLKS-KKUMJFAQSA-N 0.000 description 1
- WIVCOAKLPICYGY-KKUMJFAQSA-N Phe-Asp-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N WIVCOAKLPICYGY-KKUMJFAQSA-N 0.000 description 1
- UEHNWRNADDPYNK-DLOVCJGASA-N Phe-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N UEHNWRNADDPYNK-DLOVCJGASA-N 0.000 description 1
- QEPZQAPZKIPVDV-KKUMJFAQSA-N Phe-Cys-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N QEPZQAPZKIPVDV-KKUMJFAQSA-N 0.000 description 1
- IDUCUXTUHHIQIP-SOUVJXGZSA-N Phe-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O IDUCUXTUHHIQIP-SOUVJXGZSA-N 0.000 description 1
- CDQCFGOQNYOICK-IHRRRGAJSA-N Phe-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDQCFGOQNYOICK-IHRRRGAJSA-N 0.000 description 1
- FIRWJEJVFFGXSH-RYUDHWBXSA-N Phe-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 FIRWJEJVFFGXSH-RYUDHWBXSA-N 0.000 description 1
- PSKRILMFHNIUAO-JYJNAYRXSA-N Phe-Glu-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N PSKRILMFHNIUAO-JYJNAYRXSA-N 0.000 description 1
- XXAOSEUPEMQJOF-KKUMJFAQSA-N Phe-Glu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 XXAOSEUPEMQJOF-KKUMJFAQSA-N 0.000 description 1
- BFYHIHGIHGROAT-HTUGSXCWSA-N Phe-Glu-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFYHIHGIHGROAT-HTUGSXCWSA-N 0.000 description 1
- JJHVFCUWLSKADD-ONGXEEELSA-N Phe-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O JJHVFCUWLSKADD-ONGXEEELSA-N 0.000 description 1
- JEBWZLWTRPZQRX-QWRGUYRKSA-N Phe-Gly-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O JEBWZLWTRPZQRX-QWRGUYRKSA-N 0.000 description 1
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 1
- HGNGAMWHGGANAU-WHOFXGATSA-N Phe-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HGNGAMWHGGANAU-WHOFXGATSA-N 0.000 description 1
- NHCKESBLOMHIIE-IRXDYDNUSA-N Phe-Gly-Phe Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 NHCKESBLOMHIIE-IRXDYDNUSA-N 0.000 description 1
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 description 1
- WFHRXJOZEXUKLV-IRXDYDNUSA-N Phe-Gly-Tyr Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 WFHRXJOZEXUKLV-IRXDYDNUSA-N 0.000 description 1
- SPXWRYVHOZVYBU-ULQDDVLXSA-N Phe-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N SPXWRYVHOZVYBU-ULQDDVLXSA-N 0.000 description 1
- VZFPYFRVHMSSNA-JURCDPSOSA-N Phe-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 VZFPYFRVHMSSNA-JURCDPSOSA-N 0.000 description 1
- DVOCGBNHAUHKHJ-DKIMLUQUSA-N Phe-Ile-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O DVOCGBNHAUHKHJ-DKIMLUQUSA-N 0.000 description 1
- JQLQUPIYYJXZLJ-ZEWNOJEFSA-N Phe-Ile-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 JQLQUPIYYJXZLJ-ZEWNOJEFSA-N 0.000 description 1
- XMQSOOJRRVEHRO-ULQDDVLXSA-N Phe-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMQSOOJRRVEHRO-ULQDDVLXSA-N 0.000 description 1
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 1
- KZRQONDKKJCAOL-DKIMLUQUSA-N Phe-Leu-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZRQONDKKJCAOL-DKIMLUQUSA-N 0.000 description 1
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 1
- DMEYUTSDVRCWRS-ULQDDVLXSA-N Phe-Lys-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DMEYUTSDVRCWRS-ULQDDVLXSA-N 0.000 description 1
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 1
- MJAYDXWQQUOURZ-JYJNAYRXSA-N Phe-Lys-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O MJAYDXWQQUOURZ-JYJNAYRXSA-N 0.000 description 1
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 1
- AUJWXNGCAQWLEI-KBPBESRZSA-N Phe-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AUJWXNGCAQWLEI-KBPBESRZSA-N 0.000 description 1
- DOXQMJCSSYZSNM-BZSNNMDCSA-N Phe-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O DOXQMJCSSYZSNM-BZSNNMDCSA-N 0.000 description 1
- PEFJUUYFEGBXFA-BZSNNMDCSA-N Phe-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 PEFJUUYFEGBXFA-BZSNNMDCSA-N 0.000 description 1
- UXQFHEKRGHYJRA-STQMWFEESA-N Phe-Met-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O UXQFHEKRGHYJRA-STQMWFEESA-N 0.000 description 1
- ACJULKNZOCRWEI-ULQDDVLXSA-N Phe-Met-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O ACJULKNZOCRWEI-ULQDDVLXSA-N 0.000 description 1
- JKJSIYKSGIDHPM-WBAXXEDZSA-N Phe-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O JKJSIYKSGIDHPM-WBAXXEDZSA-N 0.000 description 1
- OWSLLRKCHLTUND-BZSNNMDCSA-N Phe-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OWSLLRKCHLTUND-BZSNNMDCSA-N 0.000 description 1
- DEZCWWXTRAKZKJ-UFYCRDLUSA-N Phe-Phe-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O DEZCWWXTRAKZKJ-UFYCRDLUSA-N 0.000 description 1
- GPLWGAYGROGDEN-BZSNNMDCSA-N Phe-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GPLWGAYGROGDEN-BZSNNMDCSA-N 0.000 description 1
- JLLJTMHNXQTMCK-UBHSHLNASA-N Phe-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 JLLJTMHNXQTMCK-UBHSHLNASA-N 0.000 description 1
- QARPMYDMYVLFMW-KKUMJFAQSA-N Phe-Pro-Glu Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 QARPMYDMYVLFMW-KKUMJFAQSA-N 0.000 description 1
- ZVRJWDUPIDMHDN-ULQDDVLXSA-N Phe-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 ZVRJWDUPIDMHDN-ULQDDVLXSA-N 0.000 description 1
- ZLAKUZDMKVKFAI-JYJNAYRXSA-N Phe-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O ZLAKUZDMKVKFAI-JYJNAYRXSA-N 0.000 description 1
- WEDZFLRYSIDIRX-IHRRRGAJSA-N Phe-Ser-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 WEDZFLRYSIDIRX-IHRRRGAJSA-N 0.000 description 1
- XDMMOISUAHXXFD-SRVKXCTJSA-N Phe-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O XDMMOISUAHXXFD-SRVKXCTJSA-N 0.000 description 1
- BONHGTUEEPIMPM-AVGNSLFASA-N Phe-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O BONHGTUEEPIMPM-AVGNSLFASA-N 0.000 description 1
- JXQVYPWVGUOIDV-MXAVVETBSA-N Phe-Ser-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JXQVYPWVGUOIDV-MXAVVETBSA-N 0.000 description 1
- GKRCCTYAGQPMMP-IHRRRGAJSA-N Phe-Ser-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O GKRCCTYAGQPMMP-IHRRRGAJSA-N 0.000 description 1
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 description 1
- GMWNQSGWWGKTSF-LFSVMHDDSA-N Phe-Thr-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMWNQSGWWGKTSF-LFSVMHDDSA-N 0.000 description 1
- LTAWNJXSRUCFAN-UNQGMJICSA-N Phe-Thr-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LTAWNJXSRUCFAN-UNQGMJICSA-N 0.000 description 1
- XNQMZHLAYFWSGJ-HTUGSXCWSA-N Phe-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XNQMZHLAYFWSGJ-HTUGSXCWSA-N 0.000 description 1
- KLYYKKGCPOGDPE-OEAJRASXSA-N Phe-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O KLYYKKGCPOGDPE-OEAJRASXSA-N 0.000 description 1
- PTDAGKJHZBGDKD-OEAJRASXSA-N Phe-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O PTDAGKJHZBGDKD-OEAJRASXSA-N 0.000 description 1
- VGTJSEYTVMAASM-RPTUDFQQSA-N Phe-Thr-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VGTJSEYTVMAASM-RPTUDFQQSA-N 0.000 description 1
- VFDRDMOMHBJGKD-UFYCRDLUSA-N Phe-Tyr-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N VFDRDMOMHBJGKD-UFYCRDLUSA-N 0.000 description 1
- GTMSCDVFQLNEOY-BZSNNMDCSA-N Phe-Tyr-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N GTMSCDVFQLNEOY-BZSNNMDCSA-N 0.000 description 1
- CDHURCQGUDNBMA-UBHSHLNASA-N Phe-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDHURCQGUDNBMA-UBHSHLNASA-N 0.000 description 1
- GOUWCZRDTWTODO-YDHLFZDLSA-N Phe-Val-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O GOUWCZRDTWTODO-YDHLFZDLSA-N 0.000 description 1
- KUSYCSMTTHSZOA-DZKIICNBSA-N Phe-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N KUSYCSMTTHSZOA-DZKIICNBSA-N 0.000 description 1
- FXEKNHAJIMHRFJ-ULQDDVLXSA-N Phe-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N FXEKNHAJIMHRFJ-ULQDDVLXSA-N 0.000 description 1
- GNZCMRRSXOBHLC-JYJNAYRXSA-N Phe-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N GNZCMRRSXOBHLC-JYJNAYRXSA-N 0.000 description 1
- GAMLAXHLYGLQBJ-UFYCRDLUSA-N Phe-Val-Tyr Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC1=CC=C(C=C1)O)C(C)C)CC1=CC=CC=C1 GAMLAXHLYGLQBJ-UFYCRDLUSA-N 0.000 description 1
- APZNYJFGVAGFCF-JYJNAYRXSA-N Phe-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccccc1)C(C)C)C(O)=O APZNYJFGVAGFCF-JYJNAYRXSA-N 0.000 description 1
- OAICVXFJPJFONN-UHFFFAOYSA-N Phosphorus Chemical compound [P] OAICVXFJPJFONN-UHFFFAOYSA-N 0.000 description 1
- WUUNPBLZLWVARQ-QAETUUGQSA-N Postin Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 WUUNPBLZLWVARQ-QAETUUGQSA-N 0.000 description 1
- ALJGSKMBIUEJOB-FXQIFTODSA-N Pro-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@@H]1CCCN1 ALJGSKMBIUEJOB-FXQIFTODSA-N 0.000 description 1
- AJLVKXCNXIJHDV-CIUDSAMLSA-N Pro-Ala-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O AJLVKXCNXIJHDV-CIUDSAMLSA-N 0.000 description 1
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 1
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 1
- CQZNGNCAIXMAIQ-UBHSHLNASA-N Pro-Ala-Phe Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O CQZNGNCAIXMAIQ-UBHSHLNASA-N 0.000 description 1
- NHDVNAKDACFHPX-GUBZILKMSA-N Pro-Arg-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O NHDVNAKDACFHPX-GUBZILKMSA-N 0.000 description 1
- OCSACVPBMIYNJE-GUBZILKMSA-N Pro-Arg-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O OCSACVPBMIYNJE-GUBZILKMSA-N 0.000 description 1
- XWYXZPHPYKRYPA-GMOBBJLQSA-N Pro-Asn-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XWYXZPHPYKRYPA-GMOBBJLQSA-N 0.000 description 1
- VOHFZDSRPZLXLH-IHRRRGAJSA-N Pro-Asn-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VOHFZDSRPZLXLH-IHRRRGAJSA-N 0.000 description 1
- FUVBEZJCRMHWEM-FXQIFTODSA-N Pro-Asn-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FUVBEZJCRMHWEM-FXQIFTODSA-N 0.000 description 1
- GDXZRWYXJSGWIV-GMOBBJLQSA-N Pro-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 GDXZRWYXJSGWIV-GMOBBJLQSA-N 0.000 description 1
- KPDRZQUWJKTMBP-DCAQKATOSA-N Pro-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 KPDRZQUWJKTMBP-DCAQKATOSA-N 0.000 description 1
- XKHCJJPNXFBADI-DCAQKATOSA-N Pro-Asp-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O XKHCJJPNXFBADI-DCAQKATOSA-N 0.000 description 1
- QVIZLAUEAMQKGS-GUBZILKMSA-N Pro-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 QVIZLAUEAMQKGS-GUBZILKMSA-N 0.000 description 1
- ZCXQTRXYZOSGJR-FXQIFTODSA-N Pro-Asp-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZCXQTRXYZOSGJR-FXQIFTODSA-N 0.000 description 1
- SFECXGVELZFBFJ-VEVYYDQMSA-N Pro-Asp-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFECXGVELZFBFJ-VEVYYDQMSA-N 0.000 description 1
- XUSDDSLCRPUKLP-QXEWZRGKSA-N Pro-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 XUSDDSLCRPUKLP-QXEWZRGKSA-N 0.000 description 1
- NOXSEHJOXCWRHK-DCAQKATOSA-N Pro-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@@H]1CCCN1 NOXSEHJOXCWRHK-DCAQKATOSA-N 0.000 description 1
- CKXMGSJPDQXBPG-JYJNAYRXSA-N Pro-Cys-Trp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O CKXMGSJPDQXBPG-JYJNAYRXSA-N 0.000 description 1
- UPJGUQPLYWTISV-GUBZILKMSA-N Pro-Gln-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UPJGUQPLYWTISV-GUBZILKMSA-N 0.000 description 1
- SKICPQLTOXGWGO-GARJFASQSA-N Pro-Gln-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O SKICPQLTOXGWGO-GARJFASQSA-N 0.000 description 1
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 1
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 1
- VDGTVWFMRXVQCT-GUBZILKMSA-N Pro-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 VDGTVWFMRXVQCT-GUBZILKMSA-N 0.000 description 1
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 1
- FEPSEIDIPBMIOS-QXEWZRGKSA-N Pro-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEPSEIDIPBMIOS-QXEWZRGKSA-N 0.000 description 1
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 1
- FEVDNIBDCRKMER-IUCAKERBSA-N Pro-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEVDNIBDCRKMER-IUCAKERBSA-N 0.000 description 1
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 1
- IBGCFJDLCYTKPW-NAKRPEOUSA-N Pro-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 IBGCFJDLCYTKPW-NAKRPEOUSA-N 0.000 description 1
- KWMUAKQOVYCQJQ-ZPFDUUQYSA-N Pro-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@@H]1CCCN1 KWMUAKQOVYCQJQ-ZPFDUUQYSA-N 0.000 description 1
- KLSOMAFWRISSNI-OSUNSFLBSA-N Pro-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 KLSOMAFWRISSNI-OSUNSFLBSA-N 0.000 description 1
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 1
- DRKAXLDECUGLFE-ULQDDVLXSA-N Pro-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O DRKAXLDECUGLFE-ULQDDVLXSA-N 0.000 description 1
- JUJCUYWRJMFJJF-AVGNSLFASA-N Pro-Lys-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 JUJCUYWRJMFJJF-AVGNSLFASA-N 0.000 description 1
- XQPHBAKJJJZOBX-SRVKXCTJSA-N Pro-Lys-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O XQPHBAKJJJZOBX-SRVKXCTJSA-N 0.000 description 1
- ABSSTGUCBCDKMU-UWVGGRQHSA-N Pro-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 ABSSTGUCBCDKMU-UWVGGRQHSA-N 0.000 description 1
- WCNVGGZRTNHOOS-ULQDDVLXSA-N Pro-Lys-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O WCNVGGZRTNHOOS-ULQDDVLXSA-N 0.000 description 1
- WIPAMEKBSHNFQE-IUCAKERBSA-N Pro-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@@H]1CCCN1 WIPAMEKBSHNFQE-IUCAKERBSA-N 0.000 description 1
- RPLMFKUKFZOTER-AVGNSLFASA-N Pro-Met-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1 RPLMFKUKFZOTER-AVGNSLFASA-N 0.000 description 1
- QGLFRQCECIWXFA-RCWTZXSCSA-N Pro-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1)O QGLFRQCECIWXFA-RCWTZXSCSA-N 0.000 description 1
- GFHXZNVJIKMAGO-IHRRRGAJSA-N Pro-Phe-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GFHXZNVJIKMAGO-IHRRRGAJSA-N 0.000 description 1
- XYAFCOJKICBRDU-JYJNAYRXSA-N Pro-Phe-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O XYAFCOJKICBRDU-JYJNAYRXSA-N 0.000 description 1
- KDBHVPXBQADZKY-GUBZILKMSA-N Pro-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KDBHVPXBQADZKY-GUBZILKMSA-N 0.000 description 1
- GFHOSBYCLACKEK-GUBZILKMSA-N Pro-Pro-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GFHOSBYCLACKEK-GUBZILKMSA-N 0.000 description 1
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 1
- GMJDSFYVTAMIBF-FXQIFTODSA-N Pro-Ser-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GMJDSFYVTAMIBF-FXQIFTODSA-N 0.000 description 1
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 1
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 1
- BJCXXMGGPHRSHV-GUBZILKMSA-N Pro-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BJCXXMGGPHRSHV-GUBZILKMSA-N 0.000 description 1
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 1
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 1
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 1
- AJJDPGVVNPUZCR-RHYQMDGZSA-N Pro-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1)O AJJDPGVVNPUZCR-RHYQMDGZSA-N 0.000 description 1
- GXWRTSIVLSQACD-RCWTZXSCSA-N Pro-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@@H]1CCCN1)O GXWRTSIVLSQACD-RCWTZXSCSA-N 0.000 description 1
- JDJMFMVVJHLWDP-UNQGMJICSA-N Pro-Thr-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JDJMFMVVJHLWDP-UNQGMJICSA-N 0.000 description 1
- VVAWNPIOYXAMAL-KJEVXHAQSA-N Pro-Thr-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VVAWNPIOYXAMAL-KJEVXHAQSA-N 0.000 description 1
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 1
- XNJVJEHDZPDPQL-BZSNNMDCSA-N Pro-Trp-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@@H]1CCCN1)C(O)=O XNJVJEHDZPDPQL-BZSNNMDCSA-N 0.000 description 1
- BNUKRHFCHHLIGR-JYJNAYRXSA-N Pro-Trp-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CC(=O)O)C(=O)O BNUKRHFCHHLIGR-JYJNAYRXSA-N 0.000 description 1
- VGFFUEVZKRNRHT-ULQDDVLXSA-N Pro-Trp-Glu Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CCC(=O)O)C(=O)O VGFFUEVZKRNRHT-ULQDDVLXSA-N 0.000 description 1
- DMNANGOFEUVBRV-GJZGRUSLSA-N Pro-Trp-Gly Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)NCC(=O)O)C(=O)[C@@H]1CCCN1 DMNANGOFEUVBRV-GJZGRUSLSA-N 0.000 description 1
- BXHRXLMCYSZSIY-STECZYCISA-N Pro-Tyr-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H]1CCCN1)C(O)=O BXHRXLMCYSZSIY-STECZYCISA-N 0.000 description 1
- VEUACYMXJKXALX-IHRRRGAJSA-N Pro-Tyr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VEUACYMXJKXALX-IHRRRGAJSA-N 0.000 description 1
- QKWYXRPICJEQAJ-KJEVXHAQSA-N Pro-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@@H]2CCCN2)O QKWYXRPICJEQAJ-KJEVXHAQSA-N 0.000 description 1
- XDKKMRPRRCOELJ-GUBZILKMSA-N Pro-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 XDKKMRPRRCOELJ-GUBZILKMSA-N 0.000 description 1
- ZAUHSLVPDLNTRZ-QXEWZRGKSA-N Pro-Val-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZAUHSLVPDLNTRZ-QXEWZRGKSA-N 0.000 description 1
- IMNVAOPEMFDAQD-NHCYSSNCSA-N Pro-Val-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IMNVAOPEMFDAQD-NHCYSSNCSA-N 0.000 description 1
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 1
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 1
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 1
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 1
- 101100016023 Pseudomonas aeruginosa (strain ATCC 15692 / DSM 22644 / CIP 104116 / JCM 14847 / LMG 12228 / 1C / PRS 101 / PAO1) xcpV gene Proteins 0.000 description 1
- 241000589614 Pseudomonas stutzeri Species 0.000 description 1
- 108010003201 RGH 0205 Proteins 0.000 description 1
- 241000191023 Rhodobacter capsulatus Species 0.000 description 1
- 101100066772 Rhodobacter capsulatus (strain ATCC BAA-309 / NBRC 16581 / SB1003) nifF gene Proteins 0.000 description 1
- 241001478212 Riemerella anatipestifer Species 0.000 description 1
- 241001026379 Ruegeria pomeroyi DSS-3 Species 0.000 description 1
- 241001138501 Salmonella enterica Species 0.000 description 1
- 241001437645 Salmonella enterica subsp. enterica serovar Mbandaka Species 0.000 description 1
- 101100135915 Salmonella typhimurium (strain LT2 / SGSC1412 / ATCC 700720) pduE gene Proteins 0.000 description 1
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 1
- IDCKUIWEIZYVSO-WFBYXXMGSA-N Ser-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C)C(O)=O)=CNC2=C1 IDCKUIWEIZYVSO-WFBYXXMGSA-N 0.000 description 1
- QEDMOZUJTGEIBF-FXQIFTODSA-N Ser-Arg-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O QEDMOZUJTGEIBF-FXQIFTODSA-N 0.000 description 1
- QWZIOCFPXMAXET-CIUDSAMLSA-N Ser-Arg-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QWZIOCFPXMAXET-CIUDSAMLSA-N 0.000 description 1
- KYKKKSWGEPFUMR-NAKRPEOUSA-N Ser-Arg-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KYKKKSWGEPFUMR-NAKRPEOUSA-N 0.000 description 1
- QFBNNYNWKYKVJO-DCAQKATOSA-N Ser-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N QFBNNYNWKYKVJO-DCAQKATOSA-N 0.000 description 1
- QGMLKFGTGXWAHF-IHRRRGAJSA-N Ser-Arg-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QGMLKFGTGXWAHF-IHRRRGAJSA-N 0.000 description 1
- COAHUSQNSVFYBW-FXQIFTODSA-N Ser-Asn-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O COAHUSQNSVFYBW-FXQIFTODSA-N 0.000 description 1
- KAAPNMOKUUPKOE-SRVKXCTJSA-N Ser-Asn-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KAAPNMOKUUPKOE-SRVKXCTJSA-N 0.000 description 1
- CTRHXXXHUJTTRZ-ZLUOBGJFSA-N Ser-Asp-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O CTRHXXXHUJTTRZ-ZLUOBGJFSA-N 0.000 description 1
- VAIZFHMTBFYJIA-ACZMJKKPSA-N Ser-Asp-Gln Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O VAIZFHMTBFYJIA-ACZMJKKPSA-N 0.000 description 1
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 1
- BYIROAKULFFTEK-CIUDSAMLSA-N Ser-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO BYIROAKULFFTEK-CIUDSAMLSA-N 0.000 description 1
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 description 1
- KCFKKAQKRZBWJB-ZLUOBGJFSA-N Ser-Cys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O KCFKKAQKRZBWJB-ZLUOBGJFSA-N 0.000 description 1
- UCOYFSCEIWQYNL-FXQIFTODSA-N Ser-Cys-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCSC)C(O)=O UCOYFSCEIWQYNL-FXQIFTODSA-N 0.000 description 1
- XSYJDGIDKRNWFX-SRVKXCTJSA-N Ser-Cys-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XSYJDGIDKRNWFX-SRVKXCTJSA-N 0.000 description 1
- RFBKULCUBJAQFT-BIIVOSGPSA-N Ser-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CO)N)C(=O)O RFBKULCUBJAQFT-BIIVOSGPSA-N 0.000 description 1
- CDVFZMOFNJPUDD-ACZMJKKPSA-N Ser-Gln-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CDVFZMOFNJPUDD-ACZMJKKPSA-N 0.000 description 1
- VDVYTKZBMFADQH-AVGNSLFASA-N Ser-Gln-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 VDVYTKZBMFADQH-AVGNSLFASA-N 0.000 description 1
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 1
- YQQKYAZABFEYAF-FXQIFTODSA-N Ser-Glu-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQQKYAZABFEYAF-FXQIFTODSA-N 0.000 description 1
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 1
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 1
- QKQDTEYDEIJPNK-GUBZILKMSA-N Ser-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CO QKQDTEYDEIJPNK-GUBZILKMSA-N 0.000 description 1
- DSGYZICNAMEJOC-AVGNSLFASA-N Ser-Glu-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DSGYZICNAMEJOC-AVGNSLFASA-N 0.000 description 1
- VQBCMLMPEWPUTB-ACZMJKKPSA-N Ser-Glu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VQBCMLMPEWPUTB-ACZMJKKPSA-N 0.000 description 1
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 1
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 1
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 1
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 1
- UAJAYRMZGNQILN-BQBZGAKWSA-N Ser-Gly-Met Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UAJAYRMZGNQILN-BQBZGAKWSA-N 0.000 description 1
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 1
- HMRAQFJFTOLDKW-GUBZILKMSA-N Ser-His-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O HMRAQFJFTOLDKW-GUBZILKMSA-N 0.000 description 1
- ZUDXUJSYCCNZQJ-DCAQKATOSA-N Ser-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N ZUDXUJSYCCNZQJ-DCAQKATOSA-N 0.000 description 1
- YMDNFPNTIPQMJP-NAKRPEOUSA-N Ser-Ile-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O YMDNFPNTIPQMJP-NAKRPEOUSA-N 0.000 description 1
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 1
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 1
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 1
- HDBOEVPDIDDEPC-CIUDSAMLSA-N Ser-Lys-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O HDBOEVPDIDDEPC-CIUDSAMLSA-N 0.000 description 1
- PPNPDKGQRFSCAC-CIUDSAMLSA-N Ser-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPNPDKGQRFSCAC-CIUDSAMLSA-N 0.000 description 1
- LRWBCWGEUCKDTN-BJDJZHNGSA-N Ser-Lys-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LRWBCWGEUCKDTN-BJDJZHNGSA-N 0.000 description 1
- OCWWJBZQXGYQCA-DCAQKATOSA-N Ser-Lys-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O OCWWJBZQXGYQCA-DCAQKATOSA-N 0.000 description 1
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 1
- QJKPECIAWNNKIT-KKUMJFAQSA-N Ser-Lys-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QJKPECIAWNNKIT-KKUMJFAQSA-N 0.000 description 1
- AMRRYKHCILPAKD-FXQIFTODSA-N Ser-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N AMRRYKHCILPAKD-FXQIFTODSA-N 0.000 description 1
- FOOZNBRFRWGBNU-DCAQKATOSA-N Ser-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N FOOZNBRFRWGBNU-DCAQKATOSA-N 0.000 description 1
- XNXRTQZTFVMJIJ-DCAQKATOSA-N Ser-Met-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNXRTQZTFVMJIJ-DCAQKATOSA-N 0.000 description 1
- NQZFFLBPNDLTPO-DLOVCJGASA-N Ser-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CO)N NQZFFLBPNDLTPO-DLOVCJGASA-N 0.000 description 1
- BUYHXYIUQUBEQP-AVGNSLFASA-N Ser-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N BUYHXYIUQUBEQP-AVGNSLFASA-N 0.000 description 1
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 1
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 1
- NUEHQDHDLDXCRU-GUBZILKMSA-N Ser-Pro-Arg Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NUEHQDHDLDXCRU-GUBZILKMSA-N 0.000 description 1
- WNDUPCKKKGSKIQ-CIUDSAMLSA-N Ser-Pro-Gln Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O WNDUPCKKKGSKIQ-CIUDSAMLSA-N 0.000 description 1
- GZGFSPWOMUKKCV-NAKRPEOUSA-N Ser-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO GZGFSPWOMUKKCV-NAKRPEOUSA-N 0.000 description 1
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 1
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 1
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 1
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 1
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 1
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 1
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 1
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 1
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 1
- FGBLCMLXHRPVOF-IHRRRGAJSA-N Ser-Tyr-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FGBLCMLXHRPVOF-IHRRRGAJSA-N 0.000 description 1
- HXPNJVLVHKABMJ-KKUMJFAQSA-N Ser-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CO)N)O HXPNJVLVHKABMJ-KKUMJFAQSA-N 0.000 description 1
- HAYADTTXNZFUDM-IHRRRGAJSA-N Ser-Tyr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HAYADTTXNZFUDM-IHRRRGAJSA-N 0.000 description 1
- IAOHCSQDQDWRQU-GUBZILKMSA-N Ser-Val-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IAOHCSQDQDWRQU-GUBZILKMSA-N 0.000 description 1
- PCMZJFMUYWIERL-ZKWXMUAHSA-N Ser-Val-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMZJFMUYWIERL-ZKWXMUAHSA-N 0.000 description 1
- LLSLRQOEAFCZLW-NRPADANISA-N Ser-Val-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LLSLRQOEAFCZLW-NRPADANISA-N 0.000 description 1
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 1
- LGIMRDKGABDMBN-DCAQKATOSA-N Ser-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N LGIMRDKGABDMBN-DCAQKATOSA-N 0.000 description 1
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 1
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 1
- 241000314075 Shigella flexneri 1235-66 Species 0.000 description 1
- 229920002472 Starch Polymers 0.000 description 1
- 235000021355 Stearic acid Nutrition 0.000 description 1
- 241001468227 Streptomyces avermitilis Species 0.000 description 1
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 1
- 229930006000 Sucrose Natural products 0.000 description 1
- 235000019486 Sunflower oil Nutrition 0.000 description 1
- MQCPGOZXFSYJPS-KZVJFYERSA-N Thr-Ala-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MQCPGOZXFSYJPS-KZVJFYERSA-N 0.000 description 1
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 1
- ZUXQFMVPAYGPFJ-JXUBOQSCSA-N Thr-Ala-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN ZUXQFMVPAYGPFJ-JXUBOQSCSA-N 0.000 description 1
- GFDUZZACIWNMPE-KZVJFYERSA-N Thr-Ala-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O GFDUZZACIWNMPE-KZVJFYERSA-N 0.000 description 1
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 1
- XYEXCEPTALHNEV-RCWTZXSCSA-N Thr-Arg-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XYEXCEPTALHNEV-RCWTZXSCSA-N 0.000 description 1
- JMZKMSTYXHFYAK-VEVYYDQMSA-N Thr-Arg-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O JMZKMSTYXHFYAK-VEVYYDQMSA-N 0.000 description 1
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 description 1
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 1
- VOGXLRKCWFLJBY-HSHDSVGOSA-N Thr-Arg-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O VOGXLRKCWFLJBY-HSHDSVGOSA-N 0.000 description 1
- JNQZPAWOPBZGIX-RCWTZXSCSA-N Thr-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N JNQZPAWOPBZGIX-RCWTZXSCSA-N 0.000 description 1
- CTONFVDJYCAMQM-IUKAMOBKSA-N Thr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H]([C@@H](C)O)N CTONFVDJYCAMQM-IUKAMOBKSA-N 0.000 description 1
- JBHMLZSKIXMVFS-XVSYOHENSA-N Thr-Asn-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JBHMLZSKIXMVFS-XVSYOHENSA-N 0.000 description 1
- LMMDEZPNUTZJAY-GCJQMDKQSA-N Thr-Asp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O LMMDEZPNUTZJAY-GCJQMDKQSA-N 0.000 description 1
- YOSLMIPKOUAHKI-OLHMAJIHSA-N Thr-Asp-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O YOSLMIPKOUAHKI-OLHMAJIHSA-N 0.000 description 1
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 1
- GNHRVXYZKWSJTF-HJGDQZAQSA-N Thr-Asp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GNHRVXYZKWSJTF-HJGDQZAQSA-N 0.000 description 1
- APIQKJYZDWVOCE-VEVYYDQMSA-N Thr-Asp-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O APIQKJYZDWVOCE-VEVYYDQMSA-N 0.000 description 1
- OHAJHDJOCKKJLV-LKXGYXEUSA-N Thr-Asp-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OHAJHDJOCKKJLV-LKXGYXEUSA-N 0.000 description 1
- XDARBNMYXKUFOJ-GSSVUCPTSA-N Thr-Asp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDARBNMYXKUFOJ-GSSVUCPTSA-N 0.000 description 1
- ODSAPYVQSLDRSR-LKXGYXEUSA-N Thr-Cys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O ODSAPYVQSLDRSR-LKXGYXEUSA-N 0.000 description 1
- UCCNDUPVIFOOQX-CUJWVEQBSA-N Thr-Cys-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 UCCNDUPVIFOOQX-CUJWVEQBSA-N 0.000 description 1
- WLDUCKSCDRIVLJ-NUMRIWBASA-N Thr-Gln-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O WLDUCKSCDRIVLJ-NUMRIWBASA-N 0.000 description 1
- VUVCRYXYUUPGSB-GLLZPBPUSA-N Thr-Gln-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O VUVCRYXYUUPGSB-GLLZPBPUSA-N 0.000 description 1
- UHBPFYOQQPFKQR-JHEQGTHGSA-N Thr-Gln-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UHBPFYOQQPFKQR-JHEQGTHGSA-N 0.000 description 1
- XXNLGZRRSKPSGF-HTUGSXCWSA-N Thr-Gln-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O XXNLGZRRSKPSGF-HTUGSXCWSA-N 0.000 description 1
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 1
- OQCXTUQTKQFDCX-HTUGSXCWSA-N Thr-Glu-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O OQCXTUQTKQFDCX-HTUGSXCWSA-N 0.000 description 1
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 1
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 1
- WYKJENSCCRJLRC-ZDLURKLDSA-N Thr-Gly-Cys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)O WYKJENSCCRJLRC-ZDLURKLDSA-N 0.000 description 1
- VYEHBMMAJFVTOI-JHEQGTHGSA-N Thr-Gly-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O VYEHBMMAJFVTOI-JHEQGTHGSA-N 0.000 description 1
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 1
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 1
- CYVQBKQYQGEELV-NKIYYHGXSA-N Thr-His-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O CYVQBKQYQGEELV-NKIYYHGXSA-N 0.000 description 1
- XSTGOZBBXFKGHA-YJRXYDGGSA-N Thr-His-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O XSTGOZBBXFKGHA-YJRXYDGGSA-N 0.000 description 1
- WBCCCPZIJIJTSD-TUBUOCAGSA-N Thr-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H]([C@@H](C)O)N WBCCCPZIJIJTSD-TUBUOCAGSA-N 0.000 description 1
- YUOCMLNTUZAGNF-KLHWPWHYSA-N Thr-His-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N)O YUOCMLNTUZAGNF-KLHWPWHYSA-N 0.000 description 1
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 1
- XYFISNXATOERFZ-OSUNSFLBSA-N Thr-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XYFISNXATOERFZ-OSUNSFLBSA-N 0.000 description 1
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 1
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 1
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 1
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 1
- ISLDRLHVPXABBC-IEGACIPQSA-N Thr-Leu-Trp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ISLDRLHVPXABBC-IEGACIPQSA-N 0.000 description 1
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 1
- CJXURNZYNHCYFD-WDCWCFNPSA-N Thr-Lys-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O CJXURNZYNHCYFD-WDCWCFNPSA-N 0.000 description 1
- ZXIHABSKUITPTN-IXOXFDKPSA-N Thr-Lys-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O ZXIHABSKUITPTN-IXOXFDKPSA-N 0.000 description 1
- QNCFWHZVRNXAKW-OEAJRASXSA-N Thr-Lys-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNCFWHZVRNXAKW-OEAJRASXSA-N 0.000 description 1
- XSEPSRUDSPHMPX-KATARQTJSA-N Thr-Lys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O XSEPSRUDSPHMPX-KATARQTJSA-N 0.000 description 1
- DXPURPNJDFCKKO-RHYQMDGZSA-N Thr-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DXPURPNJDFCKKO-RHYQMDGZSA-N 0.000 description 1
- CGCMNOIQVAXYMA-UNQGMJICSA-N Thr-Met-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CGCMNOIQVAXYMA-UNQGMJICSA-N 0.000 description 1
- KZURUCDWKDEAFZ-XVSYOHENSA-N Thr-Phe-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O KZURUCDWKDEAFZ-XVSYOHENSA-N 0.000 description 1
- JAJOFWABAUKAEJ-QTKMDUPCSA-N Thr-Pro-His Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O JAJOFWABAUKAEJ-QTKMDUPCSA-N 0.000 description 1
- GFRIEEKFXOVPIR-RHYQMDGZSA-N Thr-Pro-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O GFRIEEKFXOVPIR-RHYQMDGZSA-N 0.000 description 1
- BDENGIGFTNYZSJ-RCWTZXSCSA-N Thr-Pro-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O BDENGIGFTNYZSJ-RCWTZXSCSA-N 0.000 description 1
- OLFOOYQTTQSSRK-UNQGMJICSA-N Thr-Pro-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLFOOYQTTQSSRK-UNQGMJICSA-N 0.000 description 1
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 1
- XHWCDRUPDNSDAZ-XKBZYTNZSA-N Thr-Ser-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O XHWCDRUPDNSDAZ-XKBZYTNZSA-N 0.000 description 1
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 1
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 1
- MFMGPEKYBXFIRF-SUSMZKCASA-N Thr-Thr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFMGPEKYBXFIRF-SUSMZKCASA-N 0.000 description 1
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 1
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 1
- CSNBWOJOEOPYIJ-UVOCVTCTSA-N Thr-Thr-Lys Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O CSNBWOJOEOPYIJ-UVOCVTCTSA-N 0.000 description 1
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 1
- XGUAUKUYQHBUNY-SWRJLBSHSA-N Thr-Trp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O XGUAUKUYQHBUNY-SWRJLBSHSA-N 0.000 description 1
- BZTSQFWJNJYZSX-JRQIVUDYSA-N Thr-Tyr-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O BZTSQFWJNJYZSX-JRQIVUDYSA-N 0.000 description 1
- KAJRRNHOVMZYBL-IRIUXVKKSA-N Thr-Tyr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAJRRNHOVMZYBL-IRIUXVKKSA-N 0.000 description 1
- ABCLYRRGTZNIFU-BWAGICSOSA-N Thr-Tyr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O ABCLYRRGTZNIFU-BWAGICSOSA-N 0.000 description 1
- VMSSYINFMOFLJM-KJEVXHAQSA-N Thr-Tyr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCSC)C(=O)O)N)O VMSSYINFMOFLJM-KJEVXHAQSA-N 0.000 description 1
- KVEWWQRTAVMOFT-KJEVXHAQSA-N Thr-Tyr-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O KVEWWQRTAVMOFT-KJEVXHAQSA-N 0.000 description 1
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 1
- SPIFGZFZMVLPHN-UNQGMJICSA-N Thr-Val-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SPIFGZFZMVLPHN-UNQGMJICSA-N 0.000 description 1
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 1
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 1
- 241000209140 Triticum Species 0.000 description 1
- 235000021307 Triticum Nutrition 0.000 description 1
- BDWDMRSGCXEDMR-WFBYXXMGSA-N Trp-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N BDWDMRSGCXEDMR-WFBYXXMGSA-N 0.000 description 1
- PNHABSVRPFBUJY-UMPQAUOISA-N Trp-Arg-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O PNHABSVRPFBUJY-UMPQAUOISA-N 0.000 description 1
- PXQPYPMSLBQHJJ-WFBYXXMGSA-N Trp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N PXQPYPMSLBQHJJ-WFBYXXMGSA-N 0.000 description 1
- FKAPNDWDLDWZNF-QEJZJMRPSA-N Trp-Asp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FKAPNDWDLDWZNF-QEJZJMRPSA-N 0.000 description 1
- JISIQDCOHJOOPU-WFBYXXMGSA-N Trp-Cys-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O JISIQDCOHJOOPU-WFBYXXMGSA-N 0.000 description 1
- AFSYEUHJBVCPEL-JBACZVJFSA-N Trp-Gln-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=CC=C1 AFSYEUHJBVCPEL-JBACZVJFSA-N 0.000 description 1
- NXJZCPKZIKTYLX-XEGUGMAKSA-N Trp-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NXJZCPKZIKTYLX-XEGUGMAKSA-N 0.000 description 1
- DVIIYMVCSUQOJG-QEJZJMRPSA-N Trp-Glu-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DVIIYMVCSUQOJG-QEJZJMRPSA-N 0.000 description 1
- CZWIHKFGHICAJX-BPUTZDHNSA-N Trp-Glu-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 CZWIHKFGHICAJX-BPUTZDHNSA-N 0.000 description 1
- WSGPBCAGEGHKQJ-BBRMVZONSA-N Trp-Gly-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WSGPBCAGEGHKQJ-BBRMVZONSA-N 0.000 description 1
- RRVUOLRWIZXBRQ-IHPCNDPISA-N Trp-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RRVUOLRWIZXBRQ-IHPCNDPISA-N 0.000 description 1
- VDUJEEQMRQCLHB-YTQUADARSA-N Trp-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O VDUJEEQMRQCLHB-YTQUADARSA-N 0.000 description 1
- LVTKHGUGBGNBPL-UHFFFAOYSA-N Trp-P-1 Chemical compound N1C2=CC=CC=C2C2=C1C(C)=C(N)N=C2C LVTKHGUGBGNBPL-UHFFFAOYSA-N 0.000 description 1
- JGLXHHQUSIULAK-OYDLWJJNSA-N Trp-Pro-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H]3CCCN3C(=O)[C@H](CC=3C4=CC=CC=C4NC=3)N)C(O)=O)=CNC2=C1 JGLXHHQUSIULAK-OYDLWJJNSA-N 0.000 description 1
- LORJKYIPJIRIRT-BVSLBCMMSA-N Trp-Pro-Tyr Chemical compound C([C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=C(O)C=C1 LORJKYIPJIRIRT-BVSLBCMMSA-N 0.000 description 1
- KBKTUNYBNJWFRL-UBHSHLNASA-N Trp-Ser-Asn Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O)=CNC2=C1 KBKTUNYBNJWFRL-UBHSHLNASA-N 0.000 description 1
- RPTAWXPQXXCUGL-OYDLWJJNSA-N Trp-Trp-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(O)=O RPTAWXPQXXCUGL-OYDLWJJNSA-N 0.000 description 1
- QJBWZNTWJSZUOY-UWJYBYFXSA-N Tyr-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QJBWZNTWJSZUOY-UWJYBYFXSA-N 0.000 description 1
- NSOMQRHZMJMZIE-GVARAGBVSA-N Tyr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NSOMQRHZMJMZIE-GVARAGBVSA-N 0.000 description 1
- OOEUVMFKKZYSRX-LEWSCRJBSA-N Tyr-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OOEUVMFKKZYSRX-LEWSCRJBSA-N 0.000 description 1
- AKXBNSZMYAOGLS-STQMWFEESA-N Tyr-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AKXBNSZMYAOGLS-STQMWFEESA-N 0.000 description 1
- KDGFPPHLXCEQRN-STECZYCISA-N Tyr-Arg-Ile Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KDGFPPHLXCEQRN-STECZYCISA-N 0.000 description 1
- ADBDQGBDNUTRDB-ULQDDVLXSA-N Tyr-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O ADBDQGBDNUTRDB-ULQDDVLXSA-N 0.000 description 1
- CRWOSTCODDFEKZ-HRCADAONSA-N Tyr-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O CRWOSTCODDFEKZ-HRCADAONSA-N 0.000 description 1
- XMNDQSYABVWZRK-BZSNNMDCSA-N Tyr-Asn-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XMNDQSYABVWZRK-BZSNNMDCSA-N 0.000 description 1
- NSTPFWRAIDTNGH-BZSNNMDCSA-N Tyr-Asn-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NSTPFWRAIDTNGH-BZSNNMDCSA-N 0.000 description 1
- GAYLGYUVTDMLKC-UWJYBYFXSA-N Tyr-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GAYLGYUVTDMLKC-UWJYBYFXSA-N 0.000 description 1
- YGKVNUAKYPGORG-AVGNSLFASA-N Tyr-Asp-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YGKVNUAKYPGORG-AVGNSLFASA-N 0.000 description 1
- NRFTYDWKWGJLAR-MELADBBJSA-N Tyr-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O NRFTYDWKWGJLAR-MELADBBJSA-N 0.000 description 1
- MNMYOSZWCKYEDI-JRQIVUDYSA-N Tyr-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MNMYOSZWCKYEDI-JRQIVUDYSA-N 0.000 description 1
- UABYBEBXFFNCIR-YDHLFZDLSA-N Tyr-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UABYBEBXFFNCIR-YDHLFZDLSA-N 0.000 description 1
- FQNUWOHNGJWNLM-QWRGUYRKSA-N Tyr-Cys-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)NCC(O)=O FQNUWOHNGJWNLM-QWRGUYRKSA-N 0.000 description 1
- QUILOGWWLXMSAT-IHRRRGAJSA-N Tyr-Gln-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QUILOGWWLXMSAT-IHRRRGAJSA-N 0.000 description 1
- LOOCQRRBKZTPKO-AVGNSLFASA-N Tyr-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LOOCQRRBKZTPKO-AVGNSLFASA-N 0.000 description 1
- HVHJYXDXRIWELT-RYUDHWBXSA-N Tyr-Glu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O HVHJYXDXRIWELT-RYUDHWBXSA-N 0.000 description 1
- NZFCWALTLNFHHC-JYJNAYRXSA-N Tyr-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NZFCWALTLNFHHC-JYJNAYRXSA-N 0.000 description 1
- SLCSPPCQWUHPPO-JYJNAYRXSA-N Tyr-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SLCSPPCQWUHPPO-JYJNAYRXSA-N 0.000 description 1
- NJLQMKZSXYQRTO-FHWLQOOXSA-N Tyr-Glu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NJLQMKZSXYQRTO-FHWLQOOXSA-N 0.000 description 1
- JWGXUKHIKXZWNG-RYUDHWBXSA-N Tyr-Gly-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O JWGXUKHIKXZWNG-RYUDHWBXSA-N 0.000 description 1
- KCPFDGNYAMKZQP-KBPBESRZSA-N Tyr-Gly-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O KCPFDGNYAMKZQP-KBPBESRZSA-N 0.000 description 1
- QAYSODICXVZUIA-WLTAIBSBSA-N Tyr-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QAYSODICXVZUIA-WLTAIBSBSA-N 0.000 description 1
- CTDPLKMBVALCGN-JSGCOSHPSA-N Tyr-Gly-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O CTDPLKMBVALCGN-JSGCOSHPSA-N 0.000 description 1
- ADECJAKCRKPSOR-ULQDDVLXSA-N Tyr-His-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O ADECJAKCRKPSOR-ULQDDVLXSA-N 0.000 description 1
- NENACTSCXYHPOX-ULQDDVLXSA-N Tyr-His-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(O)=O NENACTSCXYHPOX-ULQDDVLXSA-N 0.000 description 1
- USYGMBIIUDLYHJ-GVARAGBVSA-N Tyr-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 USYGMBIIUDLYHJ-GVARAGBVSA-N 0.000 description 1
- KIJLSRYAUGGZIN-CFMVVWHZSA-N Tyr-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KIJLSRYAUGGZIN-CFMVVWHZSA-N 0.000 description 1
- AXWBYOVVDRBOGU-SIUGBPQLSA-N Tyr-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N AXWBYOVVDRBOGU-SIUGBPQLSA-N 0.000 description 1
- GGXUDPQWAWRINY-XEGUGMAKSA-N Tyr-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GGXUDPQWAWRINY-XEGUGMAKSA-N 0.000 description 1
- GULIUBBXCYPDJU-CQDKDKBSSA-N Tyr-Leu-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 GULIUBBXCYPDJU-CQDKDKBSSA-N 0.000 description 1
- NKUGCYDFQKFVOJ-JYJNAYRXSA-N Tyr-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NKUGCYDFQKFVOJ-JYJNAYRXSA-N 0.000 description 1
- KSCVLGXNQXKUAR-JYJNAYRXSA-N Tyr-Leu-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KSCVLGXNQXKUAR-JYJNAYRXSA-N 0.000 description 1
- KHCSOLAHNLOXJR-BZSNNMDCSA-N Tyr-Leu-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHCSOLAHNLOXJR-BZSNNMDCSA-N 0.000 description 1
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 1
- HSBZWINKRYZCSQ-KKUMJFAQSA-N Tyr-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O HSBZWINKRYZCSQ-KKUMJFAQSA-N 0.000 description 1
- KGSDLCMCDFETHU-YESZJQIVSA-N Tyr-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O KGSDLCMCDFETHU-YESZJQIVSA-N 0.000 description 1
- SBLZVFCEOCWRLS-BPNCWPANSA-N Tyr-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CC=C(C=C1)O)N SBLZVFCEOCWRLS-BPNCWPANSA-N 0.000 description 1
- YSGAPESOXHFTQY-IHRRRGAJSA-N Tyr-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N YSGAPESOXHFTQY-IHRRRGAJSA-N 0.000 description 1
- OFHKXNKJXURPSY-ULQDDVLXSA-N Tyr-Met-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O OFHKXNKJXURPSY-ULQDDVLXSA-N 0.000 description 1
- OKDNSNWJEXAMSU-IRXDYDNUSA-N Tyr-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 OKDNSNWJEXAMSU-IRXDYDNUSA-N 0.000 description 1
- WURLIFOWSMBUAR-SLFFLAALSA-N Tyr-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O WURLIFOWSMBUAR-SLFFLAALSA-N 0.000 description 1
- NVZVJIUDICCMHZ-BZSNNMDCSA-N Tyr-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O NVZVJIUDICCMHZ-BZSNNMDCSA-N 0.000 description 1
- CDBXVDXSLPLFMD-BPNCWPANSA-N Tyr-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDBXVDXSLPLFMD-BPNCWPANSA-N 0.000 description 1
- ARMNWLJYHCOSHE-KKUMJFAQSA-N Tyr-Pro-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O ARMNWLJYHCOSHE-KKUMJFAQSA-N 0.000 description 1
- QKXAEWMHAAVVGS-KKUMJFAQSA-N Tyr-Pro-Glu Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O QKXAEWMHAAVVGS-KKUMJFAQSA-N 0.000 description 1
- SOEGLGLDSUHWTI-STECZYCISA-N Tyr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 SOEGLGLDSUHWTI-STECZYCISA-N 0.000 description 1
- MNWINJDPGBNOED-ULQDDVLXSA-N Tyr-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 MNWINJDPGBNOED-ULQDDVLXSA-N 0.000 description 1
- RGYCVIZZTUBSSG-JYJNAYRXSA-N Tyr-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O RGYCVIZZTUBSSG-JYJNAYRXSA-N 0.000 description 1
- SOAUMCDLIUGXJJ-SRVKXCTJSA-N Tyr-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O SOAUMCDLIUGXJJ-SRVKXCTJSA-N 0.000 description 1
- QFXVAFIHVWXXBJ-AVGNSLFASA-N Tyr-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O QFXVAFIHVWXXBJ-AVGNSLFASA-N 0.000 description 1
- QPOUERMDWKKZEG-HJPIBITLSA-N Tyr-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QPOUERMDWKKZEG-HJPIBITLSA-N 0.000 description 1
- NHOVZGFNTGMYMI-KKUMJFAQSA-N Tyr-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NHOVZGFNTGMYMI-KKUMJFAQSA-N 0.000 description 1
- XYBNMHRFAUKPAW-IHRRRGAJSA-N Tyr-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CC=C(C=C1)O)N XYBNMHRFAUKPAW-IHRRRGAJSA-N 0.000 description 1
- TYFLVOUZHQUBGM-IHRRRGAJSA-N Tyr-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TYFLVOUZHQUBGM-IHRRRGAJSA-N 0.000 description 1
- ITDWWLTTWRRLCC-KJEVXHAQSA-N Tyr-Thr-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ITDWWLTTWRRLCC-KJEVXHAQSA-N 0.000 description 1
- WYOBRXPIZVKNMF-IRXDYDNUSA-N Tyr-Tyr-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 WYOBRXPIZVKNMF-IRXDYDNUSA-N 0.000 description 1
- KHPLUFDSWGDRHD-SLFFLAALSA-N Tyr-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O KHPLUFDSWGDRHD-SLFFLAALSA-N 0.000 description 1
- MJUTYRIMFIICKL-JYJNAYRXSA-N Tyr-Val-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MJUTYRIMFIICKL-JYJNAYRXSA-N 0.000 description 1
- SQUMHUZLJDUROQ-YDHLFZDLSA-N Tyr-Val-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O SQUMHUZLJDUROQ-YDHLFZDLSA-N 0.000 description 1
- PQPWEALFTLKSEB-DZKIICNBSA-N Tyr-Val-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PQPWEALFTLKSEB-DZKIICNBSA-N 0.000 description 1
- NWEGIYMHTZXVBP-JSGCOSHPSA-N Tyr-Val-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O NWEGIYMHTZXVBP-JSGCOSHPSA-N 0.000 description 1
- NXPDPYYCIRDUHO-ULQDDVLXSA-N Tyr-Val-His Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=C(O)C=C1 NXPDPYYCIRDUHO-ULQDDVLXSA-N 0.000 description 1
- OBKOPLHSRDATFO-XHSDSOJGSA-N Tyr-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OBKOPLHSRDATFO-XHSDSOJGSA-N 0.000 description 1
- RVGVIWNHABGIFH-IHRRRGAJSA-N Tyr-Val-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O RVGVIWNHABGIFH-IHRRRGAJSA-N 0.000 description 1
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 1
- WOCYUGQDXPTQPY-FXQIFTODSA-N Val-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N WOCYUGQDXPTQPY-FXQIFTODSA-N 0.000 description 1
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 1
- SMKXLHVZIFKQRB-GUBZILKMSA-N Val-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N SMKXLHVZIFKQRB-GUBZILKMSA-N 0.000 description 1
- JIODCDXKCJRMEH-NHCYSSNCSA-N Val-Arg-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N JIODCDXKCJRMEH-NHCYSSNCSA-N 0.000 description 1
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 1
- IVXJODPZRWHCCR-JYJNAYRXSA-N Val-Arg-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N IVXJODPZRWHCCR-JYJNAYRXSA-N 0.000 description 1
- UBTBGUDNDFZLGP-SRVKXCTJSA-N Val-Arg-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UBTBGUDNDFZLGP-SRVKXCTJSA-N 0.000 description 1
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 1
- AUMNPAUHKUNHHN-BYULHYEWSA-N Val-Asn-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N AUMNPAUHKUNHHN-BYULHYEWSA-N 0.000 description 1
- XQVRMLRMTAGSFJ-QXEWZRGKSA-N Val-Asp-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XQVRMLRMTAGSFJ-QXEWZRGKSA-N 0.000 description 1
- CGGVNFJRZJUVAE-BYULHYEWSA-N Val-Asp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CGGVNFJRZJUVAE-BYULHYEWSA-N 0.000 description 1
- KXUKIBHIVRYOIP-ZKWXMUAHSA-N Val-Asp-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N KXUKIBHIVRYOIP-ZKWXMUAHSA-N 0.000 description 1
- ZQGPWORGSNRQLN-NHCYSSNCSA-N Val-Asp-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZQGPWORGSNRQLN-NHCYSSNCSA-N 0.000 description 1
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 1
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 1
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 1
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 1
- COSLEEOIYRPTHD-YDHLFZDLSA-N Val-Asp-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 COSLEEOIYRPTHD-YDHLFZDLSA-N 0.000 description 1
- CWSIBTLMMQLPPZ-FXQIFTODSA-N Val-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N CWSIBTLMMQLPPZ-FXQIFTODSA-N 0.000 description 1
- FPCIBLUVDNXPJO-XPUUQOCRSA-N Val-Cys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O FPCIBLUVDNXPJO-XPUUQOCRSA-N 0.000 description 1
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 1
- CVIXTAITYJQMPE-LAEOZQHASA-N Val-Glu-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CVIXTAITYJQMPE-LAEOZQHASA-N 0.000 description 1
- YDPFWRVQHFWBKI-GVXVVHGQSA-N Val-Glu-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YDPFWRVQHFWBKI-GVXVVHGQSA-N 0.000 description 1
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 1
- NVPOPSZOSXDRSP-UHFFFAOYSA-N Val-Glu-Ile-Pro-Glu Natural products CC(C)C(N)C(=O)NC(CCC(O)=O)C(=O)NC(C(C)CC)C(=O)N1CCCC1C(=O)NC(CCC(O)=O)C(O)=O NVPOPSZOSXDRSP-UHFFFAOYSA-N 0.000 description 1
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 1
- FOADDSDHGRFUOC-DZKIICNBSA-N Val-Glu-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FOADDSDHGRFUOC-DZKIICNBSA-N 0.000 description 1
- WDIGUPHXPBMODF-UMNHJUIQSA-N Val-Glu-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N WDIGUPHXPBMODF-UMNHJUIQSA-N 0.000 description 1
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 1
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 1
- BEGDZYNDCNEGJZ-XVKPBYJWSA-N Val-Gly-Gln Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O BEGDZYNDCNEGJZ-XVKPBYJWSA-N 0.000 description 1
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 1
- PMDOQZFYGWZSTK-LSJOCFKGSA-N Val-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C PMDOQZFYGWZSTK-LSJOCFKGSA-N 0.000 description 1
- JVYIGCARISMLMV-HOCLYGCPSA-N Val-Gly-Trp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N JVYIGCARISMLMV-HOCLYGCPSA-N 0.000 description 1
- WJVLTYSHNXRCLT-NHCYSSNCSA-N Val-His-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WJVLTYSHNXRCLT-NHCYSSNCSA-N 0.000 description 1
- PTFPUAXGIKTVNN-ONGXEEELSA-N Val-His-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N PTFPUAXGIKTVNN-ONGXEEELSA-N 0.000 description 1
- CHWRZUGUMAMTFC-IHRRRGAJSA-N Val-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CNC=N1 CHWRZUGUMAMTFC-IHRRRGAJSA-N 0.000 description 1
- ZIGZPYJXIWLQFC-QTKMDUPCSA-N Val-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C(C)C)N)O ZIGZPYJXIWLQFC-QTKMDUPCSA-N 0.000 description 1
- BZMIYHIJVVJPCK-QSFUFRPTSA-N Val-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N BZMIYHIJVVJPCK-QSFUFRPTSA-N 0.000 description 1
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 1
- PYXQBKJPHNCTNW-CYDGBPFRSA-N Val-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N PYXQBKJPHNCTNW-CYDGBPFRSA-N 0.000 description 1
- APEBUJBRGCMMHP-HJWJTTGWSA-N Val-Ile-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 APEBUJBRGCMMHP-HJWJTTGWSA-N 0.000 description 1
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 1
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 1
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 1
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 1
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 1
- GVJUTBOZZBTBIG-AVGNSLFASA-N Val-Lys-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N GVJUTBOZZBTBIG-AVGNSLFASA-N 0.000 description 1
- KTEZUXISLQTDDQ-NHCYSSNCSA-N Val-Lys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KTEZUXISLQTDDQ-NHCYSSNCSA-N 0.000 description 1
- WBAJDGWKRIHOAC-GVXVVHGQSA-N Val-Lys-Gln Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O WBAJDGWKRIHOAC-GVXVVHGQSA-N 0.000 description 1
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 1
- IEBGHUMBJXIXHM-AVGNSLFASA-N Val-Lys-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N IEBGHUMBJXIXHM-AVGNSLFASA-N 0.000 description 1
- MBGFDZDWMDLXHQ-GUBZILKMSA-N Val-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MBGFDZDWMDLXHQ-GUBZILKMSA-N 0.000 description 1
- OJPRSVJGNCAKQX-SRVKXCTJSA-N Val-Met-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N OJPRSVJGNCAKQX-SRVKXCTJSA-N 0.000 description 1
- VENKIVFKIPGEJN-NHCYSSNCSA-N Val-Met-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N VENKIVFKIPGEJN-NHCYSSNCSA-N 0.000 description 1
- JVGHIFMSFBZDHH-WPRPVWTQSA-N Val-Met-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N JVGHIFMSFBZDHH-WPRPVWTQSA-N 0.000 description 1
- MGVYZTPLGXPVQB-CYDGBPFRSA-N Val-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MGVYZTPLGXPVQB-CYDGBPFRSA-N 0.000 description 1
- LJSZPMSUYKKKCP-UBHSHLNASA-N Val-Phe-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 LJSZPMSUYKKKCP-UBHSHLNASA-N 0.000 description 1
- NZGOVKLVQNOEKP-YDHLFZDLSA-N Val-Phe-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NZGOVKLVQNOEKP-YDHLFZDLSA-N 0.000 description 1
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 1
- MHHAWNPHDLCPLF-ULQDDVLXSA-N Val-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 MHHAWNPHDLCPLF-ULQDDVLXSA-N 0.000 description 1
- AIWLHFZYOUUJGB-UFYCRDLUSA-N Val-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 AIWLHFZYOUUJGB-UFYCRDLUSA-N 0.000 description 1
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 1
- BGXVHVMJZCSOCA-AVGNSLFASA-N Val-Pro-Lys Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N BGXVHVMJZCSOCA-AVGNSLFASA-N 0.000 description 1
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 1
- MIKHIIQMRFYVOR-RCWTZXSCSA-N Val-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C(C)C)N)O MIKHIIQMRFYVOR-RCWTZXSCSA-N 0.000 description 1
- QWCZXKIFPWPQHR-JYJNAYRXSA-N Val-Pro-Tyr Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QWCZXKIFPWPQHR-JYJNAYRXSA-N 0.000 description 1
- JQTYTBPCSOAZHI-FXQIFTODSA-N Val-Ser-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N JQTYTBPCSOAZHI-FXQIFTODSA-N 0.000 description 1
- HWNYVQMOLCYHEA-IHRRRGAJSA-N Val-Ser-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N HWNYVQMOLCYHEA-IHRRRGAJSA-N 0.000 description 1
- MNSSBIHFEUUXNW-RCWTZXSCSA-N Val-Thr-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N MNSSBIHFEUUXNW-RCWTZXSCSA-N 0.000 description 1
- UQMPYVLTQCGRSK-IFFSRLJSSA-N Val-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N)O UQMPYVLTQCGRSK-IFFSRLJSSA-N 0.000 description 1
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 1
- GVNLOVJNNDZUHS-RHYQMDGZSA-N Val-Thr-Lys Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O GVNLOVJNNDZUHS-RHYQMDGZSA-N 0.000 description 1
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 1
- VTIAEOKFUJJBTC-YDHLFZDLSA-N Val-Tyr-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VTIAEOKFUJJBTC-YDHLFZDLSA-N 0.000 description 1
- CFIBZQOLUDURST-IHRRRGAJSA-N Val-Tyr-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CS)C(=O)O)N CFIBZQOLUDURST-IHRRRGAJSA-N 0.000 description 1
- GUIYPEKUEMQBIK-JSGCOSHPSA-N Val-Tyr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)NCC(O)=O GUIYPEKUEMQBIK-JSGCOSHPSA-N 0.000 description 1
- PDASTHRLDFOZMG-JYJNAYRXSA-N Val-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 PDASTHRLDFOZMG-JYJNAYRXSA-N 0.000 description 1
- PGBMPFKFKXYROZ-UFYCRDLUSA-N Val-Tyr-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N PGBMPFKFKXYROZ-UFYCRDLUSA-N 0.000 description 1
- PMKQKNBISAOSRI-XHSDSOJGSA-N Val-Tyr-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N PMKQKNBISAOSRI-XHSDSOJGSA-N 0.000 description 1
- BGTDGENDNWGMDQ-KJEVXHAQSA-N Val-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N)O BGTDGENDNWGMDQ-KJEVXHAQSA-N 0.000 description 1
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 1
- ZHWZDZFWBXWPDW-GUBZILKMSA-N Val-Val-Cys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O ZHWZDZFWBXWPDW-GUBZILKMSA-N 0.000 description 1
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 1
- XNLUVJPMPAZHCY-JYJNAYRXSA-N Val-Val-Phe Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 XNLUVJPMPAZHCY-JYJNAYRXSA-N 0.000 description 1
- 240000008042 Zea mays Species 0.000 description 1
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 1
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 1
- 241000319304 [Brevibacterium] flavum Species 0.000 description 1
- 238000002835 absorbance Methods 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 238000005273 aeration Methods 0.000 description 1
- 108010045023 alanyl-prolyl-tyrosine Proteins 0.000 description 1
- 108010078114 alanyl-tryptophyl-alanine Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 1
- 229910000147 aluminium phosphate Inorganic materials 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- 229910021529 ammonia Inorganic materials 0.000 description 1
- 239000001099 ammonium carbonate Substances 0.000 description 1
- 235000012501 ammonium carbonate Nutrition 0.000 description 1
- 235000019270 ammonium chloride Nutrition 0.000 description 1
- 239000000908 ammonium hydroxide Substances 0.000 description 1
- 229910000148 ammonium phosphate Inorganic materials 0.000 description 1
- 235000019289 ammonium phosphates Nutrition 0.000 description 1
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 1
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 1
- 235000011130 ammonium sulphate Nutrition 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 239000002518 antifoaming agent Substances 0.000 description 1
- 108010080488 arginyl-arginyl-leucine Proteins 0.000 description 1
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 1
- 108010038850 arginyl-isoleucyl-tyrosine Proteins 0.000 description 1
- 239000012298 atmosphere Substances 0.000 description 1
- 230000002238 attenuated effect Effects 0.000 description 1
- 108010066270 beta-lactorphin Proteins 0.000 description 1
- GUBGYTABKSRVRQ-QUYVBRFLSA-N beta-maltose Chemical compound OC[C@H]1O[C@H](O[C@H]2[C@H](O)[C@@H](O)[C@H](O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@@H]1O GUBGYTABKSRVRQ-QUYVBRFLSA-N 0.000 description 1
- 230000031018 biological processes and functions Effects 0.000 description 1
- 229940041514 candida albicans extract Drugs 0.000 description 1
- 239000004202 carbamide Substances 0.000 description 1
- 235000013877 carbamide Nutrition 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 235000014633 carbohydrates Nutrition 0.000 description 1
- 239000001569 carbon dioxide Substances 0.000 description 1
- 229910002092 carbon dioxide Inorganic materials 0.000 description 1
- 239000004359 castor oil Substances 0.000 description 1
- 235000019438 castor oil Nutrition 0.000 description 1
- 239000003054 catalyst Substances 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 238000006555 catalytic reaction Methods 0.000 description 1
- 239000001913 cellulose Substances 0.000 description 1
- 229920002678 cellulose Polymers 0.000 description 1
- 235000010980 cellulose Nutrition 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 238000001311 chemical methods and process Methods 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- FDJOLVPMNUYSCM-UVKKECPRSA-L cobalt(3+);[(2r,3s,4r,5s)-5-(5,6-dimethylbenzimidazol-1-yl)-4-hydroxy-2-(hydroxymethyl)oxolan-3-yl] [(2r)-1-[3-[(2r,3r,4z,7s,9z,12s,13s,14z,17s,18s,19r)-2,13,18-tris(2-amino-2-oxoethyl)-7,12,17-tris(3-amino-3-oxopropyl)-3,5,8,8,13,15,18,19-octamethyl-2,7, Chemical compound [Co+3].N#[C-].C1([C@H](CC(N)=O)[C@@]2(C)CCC(=O)NC[C@@H](C)OP([O-])(=O)O[C@H]3[C@H]([C@H](O[C@@H]3CO)N3C4=CC(C)=C(C)C=C4N=C3)O)[N-]\C2=C(C)/C([C@H](C\2(C)C)CCC(N)=O)=N/C/2=C\C([C@H]([C@@]/2(CC(N)=O)C)CCC(N)=O)=N\C\2=C(C)/C2=N[C@]1(C)[C@@](C)(CC(N)=O)[C@@H]2CCC(N)=O FDJOLVPMNUYSCM-UVKKECPRSA-L 0.000 description 1
- 239000003240 coconut oil Substances 0.000 description 1
- 235000019864 coconut oil Nutrition 0.000 description 1
- 230000002860 competitive effect Effects 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- OXBLHERUFWYNTN-UHFFFAOYSA-M copper(I) chloride Chemical compound [Cu]Cl OXBLHERUFWYNTN-UHFFFAOYSA-M 0.000 description 1
- 235000005822 corn Nutrition 0.000 description 1
- 239000010779 crude oil Substances 0.000 description 1
- 238000012136 culture method Methods 0.000 description 1
- 101150011756 ddrA gene Proteins 0.000 description 1
- 101150073964 ddrB gene Proteins 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 108010033011 des-Arg- enterostatin Proteins 0.000 description 1
- 230000000368 destabilizing effect Effects 0.000 description 1
- MNNHAPBLZZVQHP-UHFFFAOYSA-N diammonium hydrogen phosphate Chemical compound [NH4+].[NH4+].OP([O-])([O-])=O MNNHAPBLZZVQHP-UHFFFAOYSA-N 0.000 description 1
- ZPWVASYFFYYZEW-UHFFFAOYSA-L dipotassium hydrogen phosphate Chemical compound [K+].[K+].OP([O-])([O-])=O ZPWVASYFFYYZEW-UHFFFAOYSA-L 0.000 description 1
- 108010054812 diprotin A Proteins 0.000 description 1
- 238000005553 drilling Methods 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 101150041588 eutE gene Proteins 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 239000013604 expression vector Substances 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 101150115850 fadB1 gene Proteins 0.000 description 1
- 239000003925 fat Substances 0.000 description 1
- 235000019197 fats Nutrition 0.000 description 1
- 238000000855 fermentation Methods 0.000 description 1
- 230000004151 fermentation Effects 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 101150019247 fldA gene Proteins 0.000 description 1
- 239000000446 fuel Substances 0.000 description 1
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 1
- 235000011187 glycerol Nutrition 0.000 description 1
- ZEMPKEQAKRGZGQ-XOQCFJPHSA-N glycerol triricinoleate Natural products CCCCCC[C@@H](O)CC=CCCCCCCCC(=O)OC[C@@H](COC(=O)CCCCCCCC=CC[C@@H](O)CCCCCC)OC(=O)CCCCCCCC=CC[C@H](O)CCCCCC ZEMPKEQAKRGZGQ-XOQCFJPHSA-N 0.000 description 1
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 1
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010001064 glycyl-glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 1
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 1
- 108010023364 glycyl-histidyl-arginine Proteins 0.000 description 1
- 108010028188 glycyl-histidyl-serine Proteins 0.000 description 1
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 1
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010079413 glycyl-prolyl-glutamic acid Proteins 0.000 description 1
- 108010020688 glycylhistidine Proteins 0.000 description 1
- 235000013882 gravy Nutrition 0.000 description 1
- 101150008488 hadB gene Proteins 0.000 description 1
- 108010092114 histidylphenylalanine Proteins 0.000 description 1
- 230000006801 homologous recombination Effects 0.000 description 1
- 238000002744 homologous recombination Methods 0.000 description 1
- 230000002779 inactivation Effects 0.000 description 1
- 239000000411 inducer Substances 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 229910000358 iron sulfate Inorganic materials 0.000 description 1
- BAUYGSIQEAFULO-UHFFFAOYSA-L iron(2+) sulfate (anhydrous) Chemical compound [Fe+2].[O-]S([O-])(=O)=O BAUYGSIQEAFULO-UHFFFAOYSA-L 0.000 description 1
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- 108010053037 kyotorphin Proteins 0.000 description 1
- 229940001882 lactobacillus reuteri Drugs 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 101150104734 ldh gene Proteins 0.000 description 1
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 1
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 1
- 108010087810 leucyl-seryl-glutamyl-leucine Proteins 0.000 description 1
- 108010072591 lysyl-leucyl-alanyl-arginine Proteins 0.000 description 1
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 1
- 229910052943 magnesium sulfate Inorganic materials 0.000 description 1
- 235000019341 magnesium sulphate Nutrition 0.000 description 1
- 238000004949 mass spectrometry Methods 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 239000002207 metabolite Substances 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 1
- 108010063431 methionyl-aspartyl-glycine Proteins 0.000 description 1
- 108700023046 methionyl-leucyl-phenylalanine Proteins 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 229910000402 monopotassium phosphate Inorganic materials 0.000 description 1
- 235000019796 monopotassium phosphate Nutrition 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- WQEPLUUGTLDZJY-UHFFFAOYSA-N n-Pentadecanoic acid Natural products CCCCCCCCCCCCCCC(O)=O WQEPLUUGTLDZJY-UHFFFAOYSA-N 0.000 description 1
- QIQXTHQIDYTFRH-UHFFFAOYSA-N octadecanoic acid Chemical compound CCCCCCCCCCCCCCCCCC(O)=O QIQXTHQIDYTFRH-UHFFFAOYSA-N 0.000 description 1
- OQCDKBAXFALNLD-UHFFFAOYSA-N octadecanoic acid Natural products CCCCCCCC(C)CCCCCCCCC(O)=O OQCDKBAXFALNLD-UHFFFAOYSA-N 0.000 description 1
- 125000001477 organic nitrogen group Chemical group 0.000 description 1
- 230000002018 overexpression Effects 0.000 description 1
- 235000019319 peptone Nutrition 0.000 description 1
- 210000001322 periplasm Anatomy 0.000 description 1
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 1
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 1
- PJNZPQUBCPKICU-UHFFFAOYSA-N phosphoric acid;potassium Chemical compound [K].OP(O)(O)=O PJNZPQUBCPKICU-UHFFFAOYSA-N 0.000 description 1
- 229910052698 phosphorus Inorganic materials 0.000 description 1
- 239000011574 phosphorus Substances 0.000 description 1
- 229920000058 polyacrylate Polymers 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 229920001522 polyglycol ester Polymers 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 108010077112 prolyl-proline Proteins 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 238000010188 recombinant method Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 102000037983 regulatory factors Human genes 0.000 description 1
- 108091008025 regulatory factors Proteins 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 229910052708 sodium Inorganic materials 0.000 description 1
- 239000003549 soybean oil Substances 0.000 description 1
- 235000012424 soybean oil Nutrition 0.000 description 1
- 108010005652 splenotritin Proteins 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 235000019698 starch Nutrition 0.000 description 1
- 239000008117 stearic acid Substances 0.000 description 1
- 238000003756 stirring Methods 0.000 description 1
- 239000005720 sucrose Substances 0.000 description 1
- 239000002600 sunflower oil Substances 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 1
- 239000011573 trace mineral Substances 0.000 description 1
- 235000013619 trace mineral Nutrition 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 108010058119 tryptophyl-glycyl-glycine Proteins 0.000 description 1
- 108010015666 tryptophyl-leucyl-glutamic acid Proteins 0.000 description 1
- 108010084932 tryptophyl-proline Proteins 0.000 description 1
- 108010044292 tryptophyltyrosine Proteins 0.000 description 1
- 108010079202 tyrosyl-alanyl-cysteine Proteins 0.000 description 1
- 108010025432 tyrosyl-alanyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- 108010078580 tyrosylleucine Proteins 0.000 description 1
- 108010072695 valyl-valyl-tyrosyl-proline Proteins 0.000 description 1
- 239000003981 vehicle Substances 0.000 description 1
- 239000011782 vitamin Substances 0.000 description 1
- 229930003231 vitamin Natural products 0.000 description 1
- 235000013343 vitamin Nutrition 0.000 description 1
- 229940088594 vitamin Drugs 0.000 description 1
- 239000003643 water by type Substances 0.000 description 1
- 239000012138 yeast extract Substances 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/62—Carboxylic acid esters
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/52—Genes encoding for enzymes or proenzymes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/0008—Oxidoreductases (1.) acting on the aldehyde or oxo group of donors (1.2)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/88—Lyases (4.)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/40—Preparation of oxygen-containing organic compounds containing a carboxyl group including Peroxycarboxylic acids
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Genetics & Genomics (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Biomedical Technology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Medicinal Chemistry (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
3-HPA로부터 3-HP-CoA 및 AA-CoA를 거쳐, AA를 생성하는 경로의 활성이 증가된, 아크릴산 생산능을 갖는 미생물 및 그를 이용하여 아크릴산을 생산하는 방법을 제공한다.
Description
신규한 아크릴산 생성 경로를 갖는 미생물 및 이를 이용한 아크릴산 생산 방법에 관한 것이다.
최근 원유가격의 급등에 의한 불안정성과 탄소배출저감이 글로벌 이슈화됨에 따라 기존 석유를 원료로 하여 화학 공정을 거쳐 생산하던 연료나 화학물질들을 탄소중립적인 생물학적 공정으로 대체하여 생산하려는 노력들이 지속되고 있다.
아크릴산은 10조 원 이상의 시장을 지닌 벌크 화합물 (bulk chemical)이다. 최근에는 친환경적 생산 방법에 대한 요구로 석유계 이외의 경로를 통한 아크릴산 생산 방법의 필요성이 커지고 있다.
비석유계 아크릴산 생산 경로로는, 글리세롤 또는 포도당으로부터 3-히드록시프로피온산 (3-HP)를 생산하고, 이를 화학적으로 분리 및 정제하는 방법이 있을 수 있다. 그러나, 이 방법은 생산된 3-HP를 배양물로부터 분리하고, 정제한 후 촉매를 사용하여 화학적으로 전환하는 단계를 포함한다. 따라서, 3-HP 생산 비용에 대하여 분리, 정제 및 전환 비용이 부가되어, 석유계 화합물 유래 아크릴산 생산법에 경쟁력이 높지 않을 수 있다.
종래의 기술에 의하더라도, 아크릴산 생산능을 갖는 대안적 미생물 및 이를 이용한 아크릴산 생산방법이 요구된다.
일 양상은 유전적으로 조작되지 않은 세포에 비하여 증가된 아크릴레이트 생산능을 갖는 미생물을 제공하는 것이다.
다른 양상은 상기 미생물을 배지 중에서 배양하는 단계;를 포함하는, 아크릴레이트를 생산하는 방법을 제공한다.
본 명세서에서 사용된 용어 효소 또는 폴리펩티드 또는 단백질의 "활성 증가" 또는 "증가된 활성"은 효소 또는 폴리펩티드 또는 단백질이 활성을 나타낼 수 있도록 충분한 정도로 증가된 것일 수 있으며, 세포 또는 단리된 폴리펩티드가 비교 가능한 동일 종의 세포 또는 그의 본래 폴리펩티드에서 측정된 활성 수준과 비교하여 높은 활성 수준을 나타냄을 의미한다. 즉 해당 폴리펩티드의 활성이 본래 조작되지 않은 폴리펩티드에 의한 동일한 생화학적 활성보다 약 5% 이상, 약 10% 이상, 약 15% 이상, 약 20% 이상, 약 30% 이상, 약 50% 이상, 약 60% 이상, 약 70% 이상, 또는 약 100% 이상 증가된 것일 수 있다. 증가된 활성을 갖는 폴리펩티드는 당업계에 공지된 임의의 방법을 사용하여 확인될 수 있다.
폴리펩티드의 활성 증가는 폴리펩티드의 발현증가 또는 비활성 (specific activity)의 증가에 의하여 얻을 수 있다. 상기 발현 증가는 폴리펩티드를 코딩하는 폴리뉴클레오티드가 세포에 도입되거나 세포 내 카피 수가 증가되거나, 또는 상기 폴리뉴클레오티드의 조절 영역의 변이에 의한 것일 수 있다. 외부에서 도입되거나 또는 카피 수가 증가되는 폴리뉴클레오티드는 내인성 (endogenous) 또는 외인성 (exogenous)일 수 있다. 상기 내인성 유전자는 미생물 내부에 포함된 유전물질 상에 존재하던 유전자를 말한다. 외인성 유전자는 숙주 세포 게놈으로 도입 (integration)되는 등의 숙주 세포 내로 유전자가 도입되는 것을 의미하며, 도입되는 유전자는 도입되는 숙주세포에 대해 동종 (homologous) 또는 이종 (heterologous)일 수 있다.
용어 "카피 수 증가 (copy number increase)"는 상기 유전자의 도입 또는 증폭에 의한 것일 수 있으며, 조작되지 않은 세포에 존재하지 않는 유전자를 유전적 조작에 의해 갖게 되는 경우도 포함한다. 상기 유전자의 도입은 벡터와 같은 비히클을 매개하여 이루어질 수 있다. 상기 도입은 상기 유전자가 게놈에 통합되지 않은 임시적 (transient) 도입이거나 게놈에 삽입되는 것일 수 있다. 상기 도입은 예를 들면, 목적하는 폴리펩티드를 코딩하는 폴리뉴클레오티드가 삽입된 벡터를 상기 세포로 도입한 후, 상기 벡터가 세포 내에서 복제되거나 상기 폴리뉴클레오티드가 게놈으로 통합됨으로써 이루어질 수 있다.
용어 "유전자 (gene)"는 특정 단백질을 발현하는 핵산 단편을 의미하며, 코딩영역 또는 코딩영역 외 5'-비코딩 서열 (5'-non coding 서열)과 3'-비코딩 서열 (3'-non coding 서열) 등의 조절 (regulatory) 서열을 포함할 수 있다. 상기 조절 영역은 프로모터, 인핸서, 오퍼레이터, 리보좀 결합 부위, polyA 결합 서열, 터미네이터 영역 등을 포함할 수 있다.
"이종성 (heterologous)"은 천연 (native)이 아닌 외인성 (foreign)을 의미한다.
"분비 (secretion)"는 물질이 세포 내부에서 주변세포질공간 (periplasmic space)이나 세포 외 환경으로 이동되는 것을 의미한다.
"세포 (cell)", "균주 (strain)", 또는 "미생물 (microorganism)"은 교체 사용이 가능한 것으로서, 박테리아, 효모, 곰팡이 등을 포함한다.
"아크릴산 (acrylic acid)"은 아크릴산 또는 아크릴레이트, 또는 그 염을 포함하며 이들 서로 교환가능하게 사용될 수 있다. 아크릴산은 미생물의 발효 또는 효소 반응에 의해 생산될 수 있다.
효소 또는 폴리펩티드의 "활성의 감소" 또는 "감소된 활성"은 세포 또는 단리된 효소 또는 폴리펩티드가 비교 가능한 동일 종의 세포 또는 그의 본래 폴리펩티드에서 측정된 활성 수준과 비교하여 낮은 활성 수준을 나타내거나 활성을 나타내지 않는 것을 의미한다. 즉 해당 폴리펩티드의 활성이 본래 조작되지 않은 폴리펩티드에 의한 동일한 생화학적 활성보다 약 10%이상, 약 20%이상, 약 30%이상, 약 40%이상, 약 50% 이상, 약 55% 이상, 약 60% 이상, 약 70% 이상, 약 75% 이상, 약 80% 이상, 약 85% 이상, 약 90% 이상, 약 95% 이상, 또는 약 100% 감소된 것일 수 있다. 감소된 효소 활성은 당업계에 공지된 임의의 방법을 사용하여 확인될 수 있다. 상기 활성의 감소는 효소가 발현되더라도 효소의 활성이 없거나 감소된 경우 또는 효소를 코딩하는 유전자가 발현되지 않거나 발현되더라도 본래 조작이 되지 않은 유전자에 비하여 발현량이 감소된 경우를 포함한다.
상기 효소의 활성이 감소되는 것은 상기 효소를 코딩하는 유전자의 제거 또는 파괴에 의한 것일 수 있다. 유전자의 "제거 (deletion)" 또는 "파괴 (disruption)"는 유전자가 발현되지 않거나 발현량이 감소되거나 발현되어도 효소 활성을 나타내지 않거나 활성이 감소되도록, 유전자의 일부 또는 전부가, 또는 그 프로모터, 그 터미네이터 영역 등의 조절 인자의 일부 또는 전부가 변이, 치환, 삭제되거나 유전자에 하나 이상의 염기가 삽입되는 것을 말한다. 상기 유전자의 제거 또는 파괴는 상동 재조합과 같은 유전자 조작, 돌연변이 유발, 분자 진화를 통해 달성될 수 있다. 세포가 복수 개의 같은 유전자를 포함하거나 2개 이상의 다른 폴리펩티드 동종상동유전자 (paralog)를 포함하는 경우, 하나 또는 그 이상의 유전자가 제거 또는 파괴될 수 있다.
본 발명의 핵산 또는 폴리펩티드의 "서열 동일성 (sequence identity)"은 특정 비교 영역에서 양 서열을 최대한 일치되도록 얼라인시킨 후 서열간의 염기 또는 아미노산 잔기의 동일한 정도를 의미한다. 서열 동일성은 특정 비교 영역에서 2개의 서열을 최적으로 얼라인하여 비교함으로써 측정되는 값으로서, 비교 영역 내에서 서열의 일부는 대조 서열 (reference sequence)과 비교하여 부가 또는 삭제되어 있을 수 있다. 서열 동일성 백분율은 예를 들면, 비교 영역 전체에서 두 개의 최적으로 정렬된 서열을 비교하는 단계, 두 서열 모두에서 동일한 아미노산 또는 핵산이 나타나는 위치의 갯수를 결정하여 일치된 (matched) 위치의 갯수를 수득하는 단계, 상기 일치된 위치의 갯수를 비교 범위 내의 위치의 총 갯수 (즉, 범위 크기)로 나누는 단계, 및 상기 결과에 100을 곱하여 서열 동일성의 백분율을 수득하는 단계에 의해 계산될 수 있다. 상기 서열 동일성의 퍼센트는 공지의 서열 비교 프로그램을 사용하여 결정될 수 있으며, 상기 프로그램의 일례로 BLASTN (NCBI), CLC Main Workbench (CLC bio), MegAlignTM (DNASTAR Inc) 등을 들 수 있다.
여러 종의 동일하거나 유사한 기능이나 활성을 가지는 폴리펩티드 또는 폴리뉴클레오티드를 확인하는데 있어서 여러 수준의 서열 동일성을 사용할 수 있다. 예를 들어, 50%이상, 55%이상, 60%이상, 65%이상, 70%이상, 75%이상, 80%이상, 85%이상, 90%이상, 95%이상, 96%이상, 97%이상, 98%이상, 99%이상 또는 100% 등을 포함하는 서열 동일성이다.
일 양상은 유전적으로 조작되지 않은 세포에 비하여 3-히드록시프로피온알데히드 (3-HPA)를 3-히드록시프로피오닐-CoA (3-HP-CoA)로 전환하는 것을 촉매하는 CoA acylating aldehyde dehydrogenase (ALDH), 및 3-HP-CoA를 아크릴릴-CoA (acrylyl-CoA)로 전환하는 것을 촉매하는 3-HP-CoA 데히드라타제의 활성이 증가되어 있는, 아크릴레이트 생산능을 갖는 미생물을 제공한다.
상기 ALDH는 EC 1.2.1.10, 또는 EC 1.2.1.87에 속하는 것일 수 있다. 상기 ALDH는 3-히드록시프로피온알데히드 (3-HPA)를 3-히드록시프로피오닐-CoA (3-HP-CoA)로 전환하는 것을 촉매하는 활성이 그 역반응을 촉매하는 것보다 높은 것일 수 있다. 상기 ALDH는 서열번호 1 내지 20의 아미노산 서열과 65%이상, 예를 들어, 70%이상, 80%이상, 85%이상, 90%이상, 91%이상, 92%이상, 93%이상, 94%이상, 95%이상, 96%이상, 97%이상, 98%이상, 99%이상, 또는 100%의 서열 동일성을 가지는 아미노산 서열을 포함할 수 있다. 상기 ALDH를 코딩하는 폴리뉴클레오티드는 서열번호 1 내지 20의 아미노산 서열과 65%이상, 예를 들어, 70%이상, 80%이상, 85%이상, 90%이상, 91%이상, 92%이상, 93%이상, 94%이상, 95%이상, 96%이상, 97%이상, 98%이상, 99%이상, 또는 100%의 서열 동일성을 가지는 아미노산 서열을 코딩하는 것일 수 있다. 상기 ALDH를 코딩하는 폴리뉴클레오티드는 서열번호 21 내지 40의 뉴클레오티드 서열과 95%이상의 서열 동일성을 갖는 것일 수 있다. 상기 ALDH는 표 1 및 2에 나타낸 것 중 하나 이상인 것일 수 있다. 상기 ALDH는 그 명칭과 관계없이, 하기 반응을 촉매하는 것일 수 있다. 상기 ALDH는 CoA-acylating propionaldehyde dehydrogenase, aldehyde dehydrogenase, alcohol dehydrogenase, CoA-dependent aldehyde dehydrogenase 또는 그 조합일 수 있다. 상기 ALDH는 pduP, 예를 들면, Lactobacillus reuteri 유래의 pduP일 수 있다.
3-HPA + CoA + NAD(P)+ -> 3-HP-CoA + NAD(P)H
번호 | EC | 카테고리 | 소스균주 | 유전자명 | 구입처 | 서열* |
1 | 1.2.1.10 | 50S ribosomal protein L29 | Lactobacillus reuteri DSM 20016 | Lreu_1735 | KCTC 3594 | 1/21 |
2 | 1.2.1.10 | CoA-dependent propionaldehyde dehydrogenase | Lactobacillus brevis ATCC 367 | LVIS_1603 | ATCC 367 | 2/22 |
3 | 1.2.1.10 | aldehyde dehydrogenase | Pediococcus acidilactici | HMPREF9024_01049 | KCTC 1626 | 3/23 |
4 | 1.2.1.10 | CoA-dependent propionaldehyde dehydrogenase | Pediococcus claussenii ATCC BAA-344 | pduP | DSM 14800 | 4/24 |
5 | 1.2.1.10 | PduP protein | Lactobacillus collinoides | pduP | KCTC 5050 | 5/25 |
6 | 1.2.1.10 | CoA-dependent propionaldehyde dehydrogenase | Listeria welshimeri serovar 6b str. SLCC5334 | NC_008555.1:1134599..1136008 | ATCC 35897 | 6/26 |
7 | 1.2.1.10 | hypothetical protein lin1129 | Listeria innocua Clip11262 | NC_003212.1:1144168..1145577 | ATCC 33090 | 7/27 |
8 | 1.2.1.10 | propanediol utilization Co-A dependent propionaldehyde dehydrogenase | Listeria monocytogenes ATCC 19117 | pduP | ATCC 19117 | 8/28 |
9 | 1.2.1.10 | ethanolamine utilization protein EutE | Listeria marthii FSL S4-120 | NT05LM_1376 | ATCC BAA-1595 | 9/29 |
10 | 1.2.1.10 | putative ethanolamine utilization protein EutE | Listeria ivanovii subsp. ivanovii PAM 55 | LIV_1097 | ATCC BAA-678 | 10/30 |
11 | 1.2.1.10 | CoA-dependent propionaldehyde dehydrogenase | Listeria seeligeri serovar 1/2b str. SLCC3954 | pduP | ATCC 35967 | 11/31 |
12 | 1.2.1.10 | aldehyde dehydrogenase | Shewanella putrefaciens CN-32 | NC_009438.1:221466..222860 | ATCC BAA-453 | 12/32 |
13 | 1.2.1.10 | aldehyde dehydrogenase family protein | Kosakonia radicincitans DSM 16656 | Y71_5889 | DSM 16656 | 13/33 |
14 | 1.2.1.10 | Aldehyde Dehydrogenase | Tolumonas auensis DSM 9187 | NC_012691.1:1861535..1862938 | DSM 9187 | 14/34 |
15 | 1.2.1.10 | hypothetical protein CKO_00785 | Citrobacter koseri ATCC BAA-895 | NC_009792.1:757825..759210 | ATCC BAA-895 | 15/35 |
16 | 1.2.1.10 | propanediol utilization CoA-dependent propionaldehyde dehydrogenase | Yersinia enterocolitica subsp. enterocolitica 8081 | NC_008800.1:2975153..2976541 | ATCC 9610 | 16/36 |
17 | 1.2.1.10 | aldehyde dehydrogenase EutE | Salmonella enterica subsp. enterica serovar Mbandaka str. ATCC 51958 | SEEM1958_22984 | ATCC 51958 | 17/37 |
18 | 1.2.1.10 | putative propanediol utilization protein:CoA-dependent propionaldehyde dehydrogenase | Yersinia mollaretii ATCC 43969 | ymoll0001_15900 | ATCC 43969 | 18/38 |
19 | 1.2.1.10 | CoA-dependent proprionaldehyde dehydrogenase pduP | Escherichia fergusonii ATCC 35469 | NC_011740.1:2070780..2072162 | ATCC 35469 | 19/39 |
20 | 1.2.1.10 | putative CoA-dependent proprionaldehyde dehydrogenase | Salmonella enterica subsp. enterica serovar Urbana str. ATCC 9261 | eutE | ATCC 9261 | 20/40 |
* 서열은 아미노산 서열번호/뉴클레오티드 서열번호를 나타낸다.
상기 3-HP-CoA 데히드라타제는 EC 4.2.1.17, EC 4.2.1.55, 및 EC 4.2.1.166를 포함한, EC 4.2.1.-에 속하는 것일 수 있다. 상기 3-HP-CoA 데히드라타제는 3-HP-CoA를 아크릴릴-CoA (acrylyl-CoA)로 전환하는 것을 촉매하는 활성이 그 역반응을 촉매하는 것보다 높은 것일 수 있다. 상기 3-HP-CoA 데히드라타제는 서열번호 41 내지 119의 아미노산 서열과 65%이상, 예를 들어, 70%이상, 80%이상, 85%이상, 90%이상, 91%이상, 92%이상, 93%이상, 94%이상, 95%이상, 96%이상, 97%이상, 98%이상, 99%이상, 또는 100%의 서열 동일성을 가지는 아미노산 서열을 포함할 수 있다. 상기 3-HP-CoA 데히드라타제를 코딩하는 폴리뉴클레오티드는 서열번호 41 내지 119의 아미노산 서열과 65%이상, 예를 들어, 70%이상, 80%이상, 85%이상, 90%이상, 91%이상, 92%이상, 93%이상, 94%이상, 95%이상, 96%이상, 97%이상, 98%이상, 99%이상, 또는 100%의 서열 동일성을 가지는 아미노산 서열을 코딩하는 것일 수 있다. 상기 3-HP-CoA 데히드라타제를 코딩하는 폴리뉴클레오티드는 서열번호 120 내지 198의 뉴클레오티드 서열과 95%이상의 서열 동일성을 갖는 것일 수 있다. 상기 3-HP-CoA 데히드라타제는 표 3 내지 표 6에 나타낸 것 중 하나 이상인 것일 수 있다. 표 3 내지 표 6에 나타낸 효소는 E2 타입일 수 있다. 표 3 내지 6에서 "서열*"은 아미노산 서열번호/뉴클레오티드 서열번호를 나타낸다.
번호 | EC | 카테고리 | 소스 균주 | 유전자 | 구입체 | 서열* |
1 | 4.2.1.- | 3-hydroxybutyryl-CoA dehydratase(Crotonase) | Dictyostelium discoideum (Slime mold) | Q869N6 | DSM947 | 41/120 |
2 | 4.2.1.55 | 3-hydroxybutyryl-CoA dehydratase(Crotonase) | Clostridium acetobutylicum | crt CA_C2712 | KCTC1790 | 42/121 |
3 | 4.2.1.55 | 3-hydroxybutyryl-CoA dehydratase(Crotonase) | Clostridium difficile | crt ech | KCTC5009 | 43/122 |
4 | 4.2.1.55 | 3-hydroxybutyryl-CoA dehydratase(Crotonase) | Clostridium pasteurianum | F502_09038 | KCTC1674 | 44/123 |
5 | 4.2.1.55 | 3-hydroxybutyryl-CoA dehydratase(Crotonase) | Clostridium pasteurianum | F502_06297 | KCTC1674 | 45/124 |
6 | 4.2.1.55 | 3-hydroxybutyryl-CoA dehydratase(Crotonase) | Megasphaera elsdenii | MELS_1449 | KCTC5187 | 46/125 |
7 | 4.2.1.116 | 3-hydroxybutyryl-CoA dehydratase(Crotonase) | Metallosphaera sedula | Msed_2001 | DSM5348 | 47/126 |
8 | 4.2.1.55 | 3-hydroxybutyryl-CoA dehydratase(Crotonase) | Clostridicum kluyvery | crt1 | DSM555 | 48/127 |
9 | 4.2.1.- | 4-hydroxybutyryl-CoA dehydratase | Sulfolobus tokodaii | STK_16590 | DSM16993 | 49/128 |
10 | 4.2.1.- | 4-hydroxybutyryl-CoA dehydratase | Geobacter metallireducens | Gmet_2215 | DSM7210 | 50/129 |
11 | 4.2.1.- | 4-hydroxybutyryl-CoA dehydratase | Sulfolobus solfataricus | abfD-1 | DSM1617 | 51/130 |
12 | 4.2.1.- | 4-hydroxybutyryl-CoA dehydratase | Syntrophobacter fumaroxidans | Sfum_3141 | DSM10017 | 52/131 |
13 | 4.2.1.- | 4-hydroxybutyryl-CoA dehydratase | Porphyromonas gingivalis | PGN_0727 | DSM20709 | 53/132 |
14 | 4.2.1.- | 4-hydroxybutyryl-CoA dehydratase | Polynucleobacter necessarius subsp. Asymbioticus | Pnuc_0370 | DSM18221 | 54/133 |
15 | 4.2.1.116 | 3-hydroxypropionyl-CoA dehydratase | Sulfolobus tokodaii | STK_15160 | DSM16993 | 55/134 |
16 | 4.2.1.- | 3-hydroxypropionyl-CoA dehydratase | Gordonia terrae C-6 | GTC6_11571 | KCTC9807 | 56/135 |
17 | 4.2.1.- | 3-hydroxypropionyl-CoA dehydratase | Halalkalicoccus jeotgali | HacjB3_17558 C497_07209 | DSM18796 | 57/136 |
18 | 4.2.1.- | 3-hydroxypropionyl-CoA dehydratase | Carboxydothermus hydrogenoformans | CHY_1739 | DSM6008 | 58/137 |
19 | 4.2.1.55 | 3-hydroxypropionyl-CoA dehydratase | Thermomicrobium roseum | trd_0041 | DSM5159 | 59/138 |
20 | 4.2.1.17 | 3-hydroxypropionyl-CoA dehydratase | Methylobacterium extorquens | croA METDI5699 | DSM1337 | 60/139 |
번호 | EC | 카테고리 | 소스 균주 | 유전자 | 구입처 | 서열* |
21 | 4.2.1.- | R-phenyllactate dehydratase | Clostridium sporogenes |
fldB | KCTC5654 | 61/140 |
22 | 4.2.1.- | R-phenyllactate dehydratase | fldC | KCTC5654 | 62/141 | |
23 | 4.2.1.- | R-phenyllactate dehydratase | fldI | KCTC5654 | 63/142 | |
24 | 4.2.1.- | R-phenyllactate dehydratase | fldA | KCTC5654 | 64/143 | |
25 | 4.2.1.- | R-phenyllactate dehydratase | Lachnoanaerobaculum saburreum |
fldC HMPREF0381_2734 | DSM3986 | 65/144 |
26 | 4.2.1.- | R-phenyllactate dehydratase | fldB HMPREF0381_2735 | DSM3986 | 66/145 | |
27 | 4.2.1.- | R-phenyllactate dehydratase | fldI2 HMPREF0381_2736 | DSM3986 | 67/146 | |
28 | 4.2.1.- | R-phenyllactate dehydratase | Peptostreptococcus stomatis |
fldI HMPREF0634_1391 | DSM17678 | 68/147 |
29 | 4.2.1.- | R-phenyllactate dehydratase | HMPREF0634_1028 | DSM17678 | 69/148 | |
30 | 4.2.1.- | R-phenyllactate dehydratase | fldB HMPREF0634_1029 | DSM17678 | 70/149 | |
31 | 4.2.1.- | 2-hydroxyisocaproyl-CoA dehydratase | Clostridium difficile |
hadB | KCTC5009 | 71/150 |
32 | 4.2.1.- | 2-hydroxyisocaproyl-CoA dehydratase | hadC | KCTC5009 | 72/151 | |
33 | 4.2.1.- | 2-hydroxyisocaproyl-CoA dehydratase | hadI | KCTC5009 | 73/152 | |
34 | 4.2.1.- | 2-hydroxyisocaproyl-CoA dehydratase | hadA | KCTC5009 | 74/153 | |
35 | 4.2.1.17 | Enoyl-CoA hydratase | Escherichia coli (strain K12) | paaF | 보유 | 75/154 |
36 | 4.2.1.17 | Enoyl-CoA hydratase | Rhodobacter capsulatus | fadB1 | KCTC2583 | 76/155 |
37 | 4.2.1.- | Enoyl-CoA hydratase | Pseudomonas stutzeri | PSTAA_0117 | DSM4166 | 77/156 |
38 | 4.2.1.- | Enoyl-CoA hydratase | Haliangium ochraceum | Hoch_4602 | DSM14365 | 78/157 |
39 | 4.2.1.- | Enoyl-CoA hydratase | Anoxybacillus flavithermus | Aflv_0566 | DSM21510 | 79/158 |
40 | 4.2.1.- | Enoyl-CoA hydratase | Streptomyces avermitilis | echA3 SAV_717 | DSM46492 | 80/159 |
41 | 4.2.1.- | Enoyl-CoA hydratase | Advenella kashmirensis | TKWG_10020 | DSM17095 | 81/160 |
번호 | EC | 카테고리 | 소스 균주 | 유전자 | 구입처 | 서열* |
42 | 4.2.1.- | Enoyl-CoA hydratase | Oligotropha carboxidovorans | OCA5_c12950 OCAR_6780 | DSM1227 | 82/161 |
43 | 4.2.1.- | Enoyl-CoA hydratase | Riemerella anatipestifer | Riean_1526 RA0C_1812 | DSM15868 | 83/162 |
44 | 4.2.1.- | Enoyl-CoA hydratase | Fusobacterium necrophorum subsp. funduliforme Fnf 1007 |
HMPREF1127_1435 | DSM19678 | 84/163 |
45 | 4.2.1.- | Enoyl-CoA hydratase | HMPREF1127_1434 | DSM19678 | 85/164 | |
46 | 4.2.1.- | Enoyl-CoA hydratase | HMPREF1127_1436 | DSM19678 | 86/165 | |
47 | 4.2.1.- | Enoyl-CoA hydratase | Desulfosporosinus youngiae DSM 17734 |
DesyoDRAFT_3696 | DSM17734 | 87/166 |
48 | 4.2.1.- | Enoyl-CoA hydratase | DesyoDRAFT_3695 | DSM17734 | 88/167 | |
49 | 4.2.1.- | Enoyl-CoA hydratase | DesyoDRAFT_3697 | DSM17734 | 89/168 | |
50 | 4.2.1.- | Enoyl-CoA hydratase | Peptoniphilus indolicus ATCC 29427 |
fldB HMPREF9129_0353 | KCTC15023 | 90/169 |
51 | 4.2.1.- | Enoyl-CoA hydratase | HMPREF9129_0354 | KCTC15023 | 91/170 | |
52 | 4.2.1.- | Enoyl-CoA hydratase | HMPREF9129_0352 | KCTC15023 | 92/171 | |
53 | 4.2.1.- | Enoyl-CoA hydratase | Desulfosporosinus meridiei (strain ATCC BAA-275 / DSM 13257 / NCIMB 13706 / S10) |
Desmer_1800 | DSM13257 | 93/172 |
54 | 4.2.1.- | Enoyl-CoA hydratase | Desmer_1801 | DSM13257 | 94/173 | |
55 | 4.2.1.- | Enoyl-CoA hydratase | Desmer_1799 | DSM13257 | 95/174 | |
56 | 4.2.1.- | 2-hydroxyglutaryl-CoA dehydratase | Acidaminococcus fermentans |
hgdA Acfer_1815 | DSM20731 | 96/175 |
57 | 4.2.1.- | 2-hydroxyglutaryl-CoA dehydratase | hgdB Acfer_1815 | DSM20731 | 97/176 | |
58 | 4.2.1.- | 2-hydroxyglutaryl-CoA dehydratase | hgdC Acfer_1815 | DSM20731 | 98/177 | |
59 | 4.2.1.- | 2-hydroxyglutaryl-CoA dehydratase | Carboxydothermus hydrogenoformans |
hgdB CHY_0846 | DSM6008 | 99/178 |
60 | 4.2.1.- | 2-hydroxyglutaryl-CoA dehydratase | hgdA CHY_0847 | DSM6008 | 100/179 | |
61 | 4.2.1.- | 2-hydroxyglutaryl-CoA dehydratase | hgdC CHY_0848 | DSM6008 | 101/180 | |
62 | 4.2.1.- | 2-hydroxyglutaryl-CoA dehydratase | Oscillibacter valericigenes |
hgdC OBV_10870 | DSM18026 | 102/181 |
63 | 4.2.1.- | 2-hydroxyglutaryl-CoA dehydratase | hgdA OBV_10880 | DSM18026 | 103/182 | |
64 | 4.2.1.- | 2-hydroxyglutaryl-CoA dehydratase | hgdB OBV_10890 | DSM18026 | 104/183 |
번호 | EC | 카테고리 | 소스 균주 | 유전자 | 구입처 | 서열* |
65 | 4.2.1.- | 2-hydroxyglutaryl-CoA dehydratase | Desulfosporosinus orientis (strain ATCC 19365 / DSM 765 / NCIMB 8382 / VKM B-1628) (Desulfotomaculum orientis) |
Desor_3092 | DSM765 | 105/184 |
66 | 4.2.1.- | 2-hydroxyglutaryl-CoA dehydratase | Desor_3093 | DSM765 | 106/185 | |
67 | 4.2.1.- | 2-hydroxyglutaryl-CoA dehydratase | Desor_3091 | DSM765 | 107/186 | |
68 | 4.2.1.- | 2-hydroxyglutaryl-CoA dehydratase | Peptostreptococcus anaerobius CAG:621 |
BN738_00824 | KCTC5182 | 108/187 |
69 | 4.2.1.- | 2-hydroxyglutaryl-CoA dehydratase | BN738_00823 | KCTC5182 | 109/188 | |
70 | 4.2.1.- | 2-hydroxyglutaryl-CoA dehydratase | BN738_00825 | KCTC5182 | 110/189 | |
71 | 4.2.1.- | 2-hydroxyglutaryl-CoA dehydratase | Chloroflexus aggregans (strain MD-66 / DSM 9485) | Cagg_1174 | DSM9485 | 111/190 |
72 | 4.2.1.17 | 2-hydroxyglutaryl-CoA dehydratase | Marivirga tractuosa (strain ATCC 23168 / DSM 4126 / NBRC 15989 / NCIMB 1408 / VKM B-1430 / H-43) (Microscilla tractuosa) (Flexibacter tractuosus) | Ftrac_3721 | KCTC2958 | 112/191 |
73 | 4.2.1.- | 2-hydroxyglutaryl-CoA dehydratase | Marinithermus hydrothermalis (strain DSM 14884 / JCM 11576 / T1) | Marky_1278 | DSM14884 | 113/192 |
74 | 4.2.1.- | 2-hydroxyglutaryl-CoA dehydratase | Chitinophaga pinensis (strain ATCC 43595 / DSM 2588 / NCIB 11800 / UQM 2034) | Cpin_6304 | KCTC3412 | 114/193 |
75 | 4.2.1.- | 2-hydroxyglutaryl-CoA dehydratase | Megasphaera elsdenii DSM 20460 | MELS_0744 | KCTC5187 | 115/194 |
76 | 4.2.1.- | 2-hydroxyglutaryl-CoA dehydratase | Megasphaera elsdenii DSM 20460 | MELS_0745 | KCTC5187 | 116/195 |
77 | 4.2.1.- | 2-hydroxyglutaryl-CoA dehydratase | Megasphaera elsdenii DSM 20460 | MELS_0746 | KCTC5187 | 117/196 |
78 | 4.2.1.- | 2-hydroxyglutaryl-CoA dehydratase | Chloroflexus aurantiacus (strain ATCC 29364 / DSM 637 / Y-400-fl) | Chy400_0108 | DSM635 | 118/197 |
79 | 4.2.1.- | enoyl-CoA hydrastase | Ruegeria pomeroyi DSS-3 | SP00147 | DSM15171 | 119/198 |
상기 미생물은 아크릴릴-CoA를 아크릴레이트로 전환하는 것을 촉매하는 효소의 활성이 더 증가되어 있는 것일 수 있다.
상기 아크릴릴-CoA를 아크릴레이트로 전환하는 것을 촉매하는 효소는 EC 3.1.2.4를 포함한, EC 3.1.2-에 속하는 것일 수 있다. 상기 아크릴릴-CoA를 아크릴레이트로 전환하는 것을 촉매하는 효소는 3-HP-CoA hydrolase, 또는 3-hydroxyisobutyryl-CoA hydrolase인 것일 수 있다. 상기 아크릴릴-CoA를 아크릴레이트로 전환하는 것을 촉매하는 효소는 아크릴릴-CoA를 아크릴레이트로 전환하는 것을 촉매하는 활성이 그 역반응을 촉매하는 것보다 높은 것일 수 있다. 상기 아크릴릴-CoA를 아크릴레이트로 전환하는 것을 촉매하는 효소는 서열번호 199 내지 204의 아미노산 서열과 65%이상, 예를 들어, 70%이상, 80%이상, 85%이상, 90%이상, 91%이상, 92%이상, 93%이상, 94%이상, 95%이상, 96%이상, 97%이상, 98%이상, 99%이상, 또는 100%의 서열 동일성을 가지는 아미노산 서열을 포함할 수 있다. 상기 아크릴릴-CoA를 아크릴레이트로 전환하는 것을 촉매하는 효소를 코딩하는 폴리뉴클레오티드는 서열번호 199 내지 204의 아미노산 서열과 65%이상, 예를 들어, 70%이상, 80%이상, 85%이상, 90%이상, 91%이상, 92%이상, 93%이상, 94%이상, 95%이상, 96%이상, 97%이상, 98%이상, 99%이상, 또는 100%의 서열 동일성을 가지는 아미노산 서열을 코딩하는 것일 수 있다. 상기 아크릴릴-CoA를 아크릴레이트로 전환하는 것을 촉매하는 효소를 코딩하는 폴리뉴클레오티드는 서열번호 205 내지 210의 뉴클레오티드 서열과 95%이상의 서열 동일성을 갖는 것일 수 있다. 상기 아크릴릴-CoA를 아크릴레이트로 전환하는 것을 촉매하는 효소는 표 7에 나타낸 것 중 하나 이상인 것일 수 있다. 표 7에 나타낸 효소는 E3 타입일 수 있다. 표 7에서, "서열*"은 아미노산 서열번호/뉴클레오티드 서열번호를 나타낸다.
번호 | EC | 카테고리 | 소스 균주 | 유전자명 | 구입 | 서열* |
1 | 3.1.2.- | Acyl - CoA thioester hydrolase | E. coli | yciA | 보유 | 199/205 |
2 | 3.1.2.- | Acyl-CoA thioester hydrolase | Klebsiella oxytoca 10-5245 | HMPREF9689_01673 | KCTC1686 | 200/206 |
3 | 3.1.2.- | Acyl-CoA thioester hydrolase | Cronobacter turicensis | yciA | 보유 | 201/207 |
4 | 3.1.2.- | Acyl-CoA thioester hydrolase | Citrobacter freundii | D186_20262 | 보유 | 202/208 |
5 | 3.1.2.- | Acyl-CoA thioester hydrolase | Salmonella enterica | SeI_A1458 | DSM5569 | 203/209 |
6 | 3.1.2.- | Acyl-CoA thioester hydrolase | Shigella flexneri 1235-66 | SF123566_2028 | 보유 | 204/210 |
상기 미생물은 유전적으로 조작되지 않은 세포에 비하여 상기 2개 효소 또는 3개 효소의 유전자의 발현이 증가되도록 유전적으로 조작된 미생물일 수 있다. 상기 3개 효소의 활성이 모(parent) 세포에 이미 존재하는 경우에는 유전적 조작을 통해 상기 2개 효소 또는 3개 효소의의 발현을 더욱 증가시킬 수 있거나, 야생형 미생물에 존재하지 않는 경우에는 유전적 조작의 방법으로 상기 2개 효소 또는 3개 효소를 코딩하는 유전자를 모세포에 도입하여 발현 또는 과발현되도록 할 수 있다. 상기 유전적으로 조작되지 않은 세포는 야생형 또는 상기 미생물이 유래된 모세포를 의미한다.
상기 2개 또는 3개 효소 유전자의 발현 또는 과발현은 당업자에게 알려진 여러 방법에 의해 달성될 수 있다. 일례로 유전자 카피 수를 증가시키거나, 유도제 (inducer) 또는 억제제 (repressor)와 같은 조절 물질을 사용하여 발현을 증가시킬 수 있다. 상기 카피 수 증가는 상기 유전자의 도입 또는 증폭에 의한 것일 수 있다. 즉, 작동조절 가능하도록 연결된 조절 인자 및 상기 2개 또는 3개 효소 유전자를 포함하는 벡터, 발현 카세트 등을 숙주세포에 도입함으로써 달성될 수 있다.
또는 2개 또는 3개 효소의 활성의 증가는 상기 유전자의 발현 조절 서열의 변형에 의한 것일 수 있다. 상기 조절 서열은 상기 유전자 발현을 위한 프로모터 서열 또는 전사 종결자 서열일 수 있다. 또한, 상기 조절 서열은 유전자 발현에 영향을 줄 수 있는 모티프를 코딩하는 서열일 수 있다. 상기 모티프는 예를 들면, 이차 구조-안정화 모티프, RNA 불안정화 모티프, 스플라이스-활성화 모티프, 폴리아데닐화 모티프, 아데닌-풍부 서열 (adenine-rich sequence), 또는 엔도뉴클레아제 인식 부위일 수 있다.
상기 미생물은 박테리아, 효모, 및 곰팡이로 이루어진 군으로부터 선택될 수 있다. 예를 들면, 에세리키아 (Escherichia), 루멘박테리아, 코리네박테리움 (Corynebacterium) 속 및 브레비박테리움 (Brevibacterium) 속으로 구성된 군으로부터 선택되는 것일 수 있다. 상기 세포는 코리네박테리움 속일 수 있다. 상기 미생물은 대장균 (E.coli), 코리네박테리움 글루타미쿰 (Corynebacterium glutamicum), 코리네박테리움 써모아미노게네스 (Corynebacterium thermoaminogenes), 브리비박테리움 플라붐 (Brevibacterium flavum) 및 브리비박테리움 락토페르멘툼 (Brevibacterium lactofermentum)으로 구성된 군으로부터 선택되는 미생물일 수 있다.
상기 미생물은 아크릴산을 천연으로 생산하거나 또는 재조합 방법으로 아크릴산을 생산하도록 유전적으로 조작된 것일 수 있다. 또한, 상기 미생물은 3-HPA 생산능을 갖는 것일 수 있다. 상기 미생물이 천연적으로 3-HPA를 생산하지 하지 않는 경우, 3-HPA를 생산하도록 유전적으로 조작된 것일 수 있다. 상기 미생물은 글리세롤을 3-프로피온산 알데히드 (3-propionic aldehyde: 3-HPA)로 전환하는 반응을 촉매하는 효소를 코딩하는 유전자가 도입되어 있어, 3-HPA 생산능을 갖는 것일 수 있다. 상기 미생물은 예를 들면, 대장균을 포함한 에세리키아 속 미생물일 수 있다. 상기 글리세롤을 3-HPA로 전환하는 반응을 촉매하는 효소는 글리세롤 데히드라타제 (glycerol dehydratase: GDH)일 수 있다.
상기 GDH는 글리세롤을 3-HPA로의 전환을 촉매하는 것이면 어느 것이 포함될 수 있다. 상기 GDH는 EC 4.2.1.30 또는 디올 데히드라타제 (EC 4.2.1.28)에 속하는 것일 수 있다. 상기 GDH 및 그를 코딩하는 뉴클레오티드는 Ilyobacter polytropus, Klebsiella pneumoniae, Citrobacter freundii, Clostritidium pasteurianum, Salmonella typhimurium, 또는 Klebsiella oxytoca로부터 유래한 것일 수 있다. 각 경우에 있어서, 상기 GDH는 세 개 서브유닛으로 구성될 수 있다: 큰 (large) 또는 "α" 서브유닛, 중간 (medium) 또는 "β" 서브유닛, 및 작은 (small) 또는 "γ" 서브유닛. GDH의 큰 (large) 또는 "α" 서브유닛을 코딩하는 유전자는 dhaB1, gldA 및 dhaB를 포함할 수 있다. 중간 (medium) 또는 "β" 서브유닛를 코딩하는 유전자는 dhaB2, gldB 및 dhaC를 포함할 수 있다. 작은 (small) 또는 "γ" 서브유닛을 코딩하는 유전자는 dhaB3, gldC 및 dhaE를 포함할 수 있다. 디올 데히드라타제의 큰 (large) 또는 "α" 서브유닛을 코딩하는 유전자는 pduC 및 pddA를 포함할 수 있다. 디올 데히드라타제의 중간 (medium) 또는 "β" 서브유닛를 코딩하는 유전자는 pduD 및 pddB를 포함할 수 있다. 디올 데히드라타제의 작은 (small) 또는 "γ" 서브유닛을 코딩하는 유전자는 pduE 및 pddC를 포함할 수 있다. 표 8 및 표 9는 GDH 및 GDH 연결된 기능에 대한 유전자 명칭 및 GenBank 참조를 비교한 것이다. 상기 GDH는 Ilyobacter polytropus 유래의 dhaB1, dhaB2, 및 dhaB3를 포함하는 것일 수 있다. Ilyobacter polytropus 유래의 DhaB1, DhaB2, 및 DhaB3는 각각 서열번호 211,212,및 213의 아미노산 서열을 갖는 것일 수 있다. dhaB1 유전자, dhaB2 유전자, 및 dhaB3 유전자는 각각 서열번호 211,212,및 213의 아미노산 서열을 코딩하는 것일 수 있다. Ilyobacter polytropus 유래의 dhaB1 유전자, dhaB2 유전자, 및 dhaB3 유전자는 각각 서열번호 214, 215, 및 216을 갖는 것일 수 있다.
개체 (GenBank참조번호) |
유전자 기능 | |||||||
조절 |
미지 |
재활성화 (reactivation) |
미지 |
|||||
유전자 | 염기쌍 | 유전자 | 염기쌍 | 유전자 | 염기쌍 | 유전자 | 염기쌍 | |
K.pneumoniae(U30903) | orf2c | 7116-7646 | orf2b | 6762-7115 | orf2a | 5125-5556 | ||
K.pneumoniae(U60992) | GdrB | |||||||
C.freundii(U09771) | dhaR | 3746-5671 | orfW | 5649-6179 | orfX | 6180-6533 | orfY | 7736-8164 |
C.pasteurianum(AF051373) | ||||||||
C.pasteurianum(AF026270) | orfW | 210-731 | orfX | 1-196 | orfY | 746-1177 | ||
S.typhimurium(AF026270) | pduH | 8274-8645 | ||||||
K.oxytoca(AF017781) | DdrB | 2063-2440 | ||||||
K.oxytoca(AF051373) |
개체 (GenBank참조번호) |
유전자 기능 | |||||||
데히드라타제,α | 데히드라타제,α |
데히드라타제,α | 재활성화 (reactivation) |
|||||
유전자 | 염기쌍 | 유전자 | 염기쌍 | 유전자 | 염기쌍 | 유전자 | 염기쌍 | |
K.pneumoniae(U30903) | dhaB1 | 3047-4714 | dhaB2 | 2450-2890 | dhaB3 | 2022-2447 | orf2a | 186-2009 |
K.pneumoniae(U60992) | gldA | 121-1788 | gldB | 1801-2382 | gldB | 2388-2813 | gdrA | |
C.freundii(U09771) | dhaB | 8556-10223 | dhaC | 10235-10819 | dhaC | 10822-11250 | orfY | 11261-13072 |
C.pasteurianum(AF051373) | dhaB | 84-1748 | dhaC | 1779-2318 | dhaC | 2333-2773 | 2790-4598 | |
C.pasteurianum(AF026270) | orfY | |||||||
S.typhimurium(AF026270) | pduC | 3557-5221 | pduD | 5232-5906 | pduD | 5921-6442 | 6452-8284 | |
K.oxytoca(AF017781) | 241-2073 | |||||||
K.oxytoca(AF051373) | pddA | 121-1785 | pddB | 1796-2470 | pddB | 2485-3006 |
상기 GDH는 Ilyobacter polytropus 유래의 dhaB1, dhaB2, 및 dhaB3의 각 서열과 65%이상, 예를 들어, 70%이상, 80%이상, 85%이상, 90%이상, 91%이상, 92%이상, 93%이상, 94%이상, 95%이상, 96%이상, 97%이상, 98%이상, 99%이상, 또는 100%의 서열 동일성을 가지는 아미노산 서열을 포함할 수 있다.
상기 미생물은 글리세롤 데하이드라타제 재활성화효소 (glycerol dehydratase reactivase: GDR)를 코딩하는 폴리뉴클레오티드를 더 포함하는 것일 수 있다. 글리세롤 및 디올 데히드라타제는 글리세롤 및 다른 일부 기질에 의한 기작-근거한 자살 불활성화의 대상이 된다 (Daniel et al., FEMS Microbiol. Rev. 22, 553(1999)). 용어 "글리세롤 데하이드라타제 재활성화효소 (glycerol dehydratase reactivase: GDR)"는 상기 데히드라타제 활성을 재활성화하는 단백질을 나타낸다. 용어 "데히드라타제 재활성화 활성 (dehydratase reactivating activity)"은 기질을 촉매할 수 없는 데히드라타제를 기질을 촉매할 수 있는 것으로 전환하는 현상 또는 데히드라타제의 저해를 억제하는 현상 또는 인 비보에서 데히드라타제 효소의 유용한 반감기 (half-life)를 연장시키는 현상을 나타낸다. 상기 GDR은 dhaB, gdrA, pduG 및 ddrA 중 하나 이상일 수 있다. 또한, GDR은 orfX, orf2b, gdrB, pduH 및 ddrB 중 하나 이상일 수 있다.
상기 GDR은 K.pneumoniae (U60992) 유래의 gdrA 및 gdrB로서 각각 서열번호 217 및 218의 아미노산 서열을 갖는 것일 수 있다. 또는, 상기 GDR은 Ilyobacter polytropus 유래의 gdrA 및 gdrB로서 각각 서열번호 219 및 220의 아미노산 서열을 갖는 것일 수 있다. 상기 GDR은 서열번호 217 내지 220의 아미노산 서열과 각각 65%이상, 예를 들어, 70%이상, 80%이상, 85%이상, 90%이상, 91%이상, 92%이상, 93%이상, 94%이상, 95%이상, 96%이상, 97%이상, 98%이상, 99%이상, 또는 100%의 서열 동일성을 가지는 아미노산 서열을 포함할 수 있다. GdrA 및 GdrB을 코딩하는 유전자는 각각 서열번호 217 내지 220의 아미노산 서열을 코딩하는 서열을 갖는 것, 예를 들면, 서열번호 221 내지 224의 각 뉴클레오티드 서열을 갖는 것일 수 있다.
상기 미생물에 있어서, GDH를 코딩하는 폴리뉴클레오티드, 및 GDR을 코딩하는 폴리뉴클레오티드 중 하나 이상은 유전적으로 조작되지 않은 미생물에 비하여 더 높은 수준으로 발현되는 것일 수 있다. 상기 발현 수준은 mRNA 또는 단백질 수준의 발현일 수 있다. 상기 단백질 수준의 발현은 발현된 단백질의 양 또는 활성에 근거한 것일 수 있다. 상기 발현 수준은 약 5% 이상, 약 10% 이상, 약 15% 이상, 약 20% 이상, 약 30% 이상, 약 50% 이상, 약 60% 이상, 약 70% 이상, 약 100% 이상, 200% 이상, 또는 300% 이상 증가된 것일 수 있다.
상기 미생물은 3-HPA 생산능을 갖는 것일 수 있다. 상기 미생물에 있어서, 상기 발현 증가는 유전적으로 조작되지 않은 미생물에 비하여 더 높은 수준의 3-HPA를 생산하도록 하는 것일 수 있다. 상기 3-HPA의 생산은 세포 내에서 생산되는 것, 세포 내에서 생산되어 세포 외부로 분비되는 것, 또는 그 조합을 포함한다. 세포 내에서 생산된 3-HPA은 아크릴산과 같은 다른 대사 산물로부터 전환될 수 있다. 상기 3-HPA 생산은 약 5% 이상, 약 10% 이상, 약 15% 이상, 약 20% 이상, 약 30% 이상, 약 50% 이상, 약 60% 이상, 약 70% 이상, 약 100% 이상, 200% 이상, 또는 300% 이상 증가된 것일 수 있다.
상기 발현 증가는 폴리펩티드를 코딩하는 폴리뉴클레오티드가 세포에 도입되거나 세포 내 카피 수가 증가되거나, 또는 상기 폴리뉴클레오티드의 조절 영역의 변이에 의한 것일 수 있다. 외부에서 도입되거나 또는 카피 수가 증가되는 폴리뉴클레오티드는 내인성 (endogenous) 또는 외인성 (exogenous)일 수 있다. 상기 내인성 유전자는 미생물 내부에 포함된 유전물질 상에 존재하던 유전자를 말한다. 외인성 유전자는 숙주 세포 게놈으로 도입 (integration)되는 등의 숙주 세포 내로 유전자가 도입되는 것을 의미하며, 도입되는 유전자는 도입되는 숙주세포에 대해 동종 (homologous) 또는 이종 (heterologous)일 수 있다.
상기 미생물은 아크릴레이트를 분해하거나 다른 산물로 전환하는 경로에 관여하는 하나 이상의 효소의 활성이 감소된 것일 수 있다. 상기 미생물은 아크릴레이트를 분해하거나 다른 산물로 전환하는 경로에 관여하는 하나 이상의 효소를 코딩하는 유전자가 제거 또는 파괴되어 있는 것일 수 있다.
또한, 상기 미생물은 아크릴레이트를 다른 산물로 전환하는 경로를 더 포함하는 것일 수 있다. 상기 미생물에 있어서, 아크릴산의 생산은 세포내, 또는 세포 내에서 생성되어 분비되는 것일 수 있다. 따라서, 상기 미생물은 아크릴산을 세포 내에 생성하고, 이를 다른 산물로 전환하는데 관여하는 경로, 예를 들면, 효소 유전자 및 그 발현물을 더 포함할 수 있다. 상기 다른 산물은 아크릴레이트 에스테르일 수 있다.
상기 미생물은 피루베이트로부터 락테이트를 합성하는 경로가 불활성화 또는 감쇄된 것일 수 있다. 상기 미생물은 락테이트 데히드로게나아제(lactate dehydrogenase, LDH)의 활성이 제거되거나 감소된 것일 수 있다. 상기 LDH는 피루베이트를 락테이트로 전환하는 반응을 촉매하는 활성을 가질 수 있다. 상기 LDH는 EC.1.1.1.27로 분류되는 효소일 수 있다. 일례로 상기 LDH는 서열번호 225의 아미노산 서열과 65%이상, 예를 들어, 70%이상, 80%이상, 85%이상, 90%이상, 91%이상, 92%이상, 93%이상, 94%이상, 95%이상, 96%이상, 97%이상, 98%이상, 99%이상, 또는 100%의 서열 동일성을 가지는 아미노산 서열을 포함할 수 있다. 상기 미생물은 락테이트 데히드로게나아제를 코딩하는 유전자가 파괴 또는 제거된 것일 수 있다. 상기 LDH 유전자는 서열번호 225의 아미노산 서열과 65%이상, 예를 들어, 70%이상, 80%이상, 85%이상, 90%이상, 91%이상, 92%이상, 93%이상, 94%이상, 95%이상, 96%이상, 97%이상, 98%이상, 99%이상, 또는 100%의 서열 동일성을 가지는 아미노산 서열을 코딩하는 것일 수 있다.
다른 양상은 상기한 바와 같은 미생물을 배지 중에서 배양하는 단계;를 포함하는, 아크릴레이트를 생산하는 방법을 제공한다.
상기 배양은 당업계에 알려진 적당한 배지와 배양조건에 따라 이루어질 수 있다. 통상의 기술자라면 선택되는 미생물에 따라 배지 및 배양조건을 용이하게 조정하여 사용할 수 있다. 배양 방법은 회분식, 연속식, 유가식, 또는 이들의 조합 배양을 포함할 수 있다. 상기 미생물은 아크릴레이트를 세포 외부로 분비하는 것일 수 있다.
상기 배지는 다양한 탄소원, 질소원 및 미량원소 성분을 포함할 수 있다.
상기 탄소원은, 예를 들면, 포도당, 자당, 유당, 과당, 말토오스, 전분, 셀룰로오스와 같은 탄수화물, 대두유, 해바라기유, 피마자유, 코코넛유와 같은 지방, 팔미트산, 스테아린산, 리놀레산과 같은 지방산, 글리세롤 및 에탄올과 같은 알코올, 아세트산과 같은 유기산, 또는 이들의 조합을 포함할 수 있다. 상기 배양은 글루코스를 탄소원으로 하여 수행될 수 있다. 상기 질소원은, 펩톤, 효모 추출물, 육즙, 맥아 추출물, 옥수수 침지액 (CSL), 및 대두밀과 같은 유기 질소원 및 요소, 황산암모늄, 염화암모늄, 인산암모늄, 탄산암모늄 및 질산암모늄과 같은 무기 질소원, 또는 이들의 조합을 포함할 수 있다. 상기 배지는 인의 공급원으로서, 예를 들면, 인산이수소칼륨, 인산수소이칼륨 및 상응하는 소듐-함유 염, 황산마그네슘 또는 황산철과 같은 금속염을 포함할 수 있다. 또한, 아미노산, 비타민, 및 적절한 전구체 등이 배지에 포함될 수 있다. 상기 배지 또는 개별 성분은 배양액에 회분식, 유가식 또는 연속식으로 첨가될 수 있다.
또한, 배양 중에 달리 pH를 조절하지 않거나, 수산화암모늄, 수산화칼륨, 암모니아, 인산 및 황산과 같은 화합물을 미생물 배양액에 적절한 방식으로 첨가하여 배양액의 pH를 조정할 수 있다. 또한, 배양 중에 지방산 폴리글리콜 에스테르와 같은 소포제를 사용하여 기포 생성을 억제할 수 있다.
상기 배양은 미호기성 조건에서 배양하는 것일 수 있다. 용어 "미호기성 조건 (microaerobic condition)"은 정상 대기상태보다 산소의 함량이 적은 양의 산소를 가진 공기가 배양액과 접촉하는 상태에서 배양액에 공급되는 산소의 양을 의미한다. 미호기성 조건은 예를 들면, 대기중 공기에 이산화탄소 또는 질소를 약 0.1 내지 0.4 vvm, 약 0.2 내지 0.3 vvm, 또는 약 0.25 vvm의 유속으로 공급하여 조성될 수 있다. 또한, 미호기성 조건은 통기 속도가 약 0 내지 0.4 vvm, 약 0.1 내지 0.3 vvm, 또는 약 0.15 내지 0.25 vvm인 것일 수 있다. 상기 배양은 글리세롤 예를 들면, 1-20중량%, 1-10중량% 또는 2-10중량%를 포함하는 배지 중에서 배양하는 것일 수 있다.
상기 방법은 배양물로부터 아크릴레이트를 회수하는 단계를 더 포함할 수 있다. 상기 회수는 세포 또는 세포를 제외한 배양액, 또는 둘 모두에서 분리될 수 있다. 배양물로부터 아크릴산의 분리는 당업계에 알려진 분리 및 정제방법을 사용하여 수행될 수 있다. 상기 회수는 원심분리, 크로마토그래피, 추출, 여과, 침전, 또는 이들의 조합에 의하여 이루어질 수 있다.
상기 방법에 있어서, 상기 미생물은 아크릴레이트를 다른 산물로 전환하는 경로를 더 포함하는 것이고, 생산된 아크릴레이트를 다른 산물로 전환하는 단계를 더 포함하는 것일 수 있다. 다른 산물은 폴리아크릴레이트를 포함한 아크릴레이트 에스테르인 것일 수 있다.
일 양상에 따른 미생물에 의하면, 3-아크릴산의 증가된 생산능을 갖는다.
다른 양상에 따른 아크릴산을 생산하는 방법에 의하면, 아크릴산을 효율적으로 생산할 수 있다.
도 1은 pET-iBAB_PduP 벡터의 개열 지도를 나타낸다.
도 2는 글리세롤 함유 배지에서 대장균 SH3, 및 ALDH 유전자와 3-HP-CoA dehydratase 유전자가 도입된 두 재조합 대장균 균주를 48시간 배양한 경우, 배양물 중의 아크릴레이트를 질량 분석기에 의하여 분석한 결과를 나타낸다.
도 3은 48 시간 경과시에 도 1에서 측정된 아크릴레이트의 양을 나타낸다.
도 4는 대장균 SH3와 SH3/pET-iBAB-eP(PduP)/pACYC-E2(#12) 균주를 발효기 (fermentor)에서 48 시간 동안 배양한 후 배양액 중의 아크릴레이트를 측정한 결과를 나타낸다.
도 5는 대장균 SH3와 SH3/pET-iBAB-eP(PduP)/pACYC-E2(#12) 균주를 발효기 (fermentor)에서 48 시간 동안 배양한 후 배양액 중의 3-HP를 측정한 결과를 나타낸다.
도 6은 일 실시예의 대장균에서 포도당 또는 글리세롤로부터 아크릴산의 생산의 예측 경로를 나타낸 도면이다.
도 2는 글리세롤 함유 배지에서 대장균 SH3, 및 ALDH 유전자와 3-HP-CoA dehydratase 유전자가 도입된 두 재조합 대장균 균주를 48시간 배양한 경우, 배양물 중의 아크릴레이트를 질량 분석기에 의하여 분석한 결과를 나타낸다.
도 3은 48 시간 경과시에 도 1에서 측정된 아크릴레이트의 양을 나타낸다.
도 4는 대장균 SH3와 SH3/pET-iBAB-eP(PduP)/pACYC-E2(#12) 균주를 발효기 (fermentor)에서 48 시간 동안 배양한 후 배양액 중의 아크릴레이트를 측정한 결과를 나타낸다.
도 5는 대장균 SH3와 SH3/pET-iBAB-eP(PduP)/pACYC-E2(#12) 균주를 발효기 (fermentor)에서 48 시간 동안 배양한 후 배양액 중의 3-HP를 측정한 결과를 나타낸다.
도 6은 일 실시예의 대장균에서 포도당 또는 글리세롤로부터 아크릴산의 생산의 예측 경로를 나타낸 도면이다.
이하 본 발명을 실시예를 통하여 보다 상세하게 설명한다. 그러나, 이들 실시예는 본 발명을 예시적으로 설명하기 위한 것으로 본 발명의 범위가 이들 실시예에 한정되는 것은 아니다.
재료 및 방법
달리 언급이 없으면, 이하 실시예에서는 다음의 재료 및 방법이 사용되었다.
(1) 3-
HPA
생산능을
갖는 대장균 세포의 제조
3-HPA 생산능을 갖는 대장균 균주 즉, E.coli K12 (DE3) (ㅿyqhD ㅿack-pta/pET-iBAB)를 다음과 같은 과정에 의하여 제조하였다. ackA-pta 및 yqhD 유전자가 결실된 균주는 Red recombinase 효소 발현에 의한 방법으로 다음과 같은 과정으로 제작되었다. 우선, ackA-pta를 결실시키기 위해 pKD4 벡터 (서열번호 226)를 주형으로 하고 ackAKF 프라이머 (서열번호 227) 및 ackAKR 프라이머 (서열번호 228)의 프라이머 세트를 사용한 PCR 증폭에 의해 45 bp의 ackA-pta 유전자 양 말단과 상동성을 가지는 증폭 산물을 얻었다. 이 DNA를 Red recombinase를 발현하는 pKD46 벡터 (서열번호 229)를 가지는 대장균 K12 (DE3) 균주에 전기천공 (electroporation)에 의하여 도입하여 카나마이신 항생제에 저항성 (KmR)을 가지는 균주를 선별한 후 해당 균주의 게놈의 ackA-pta 유전자 부위가 카나마이신 항생제에 대한 저항성을 부여하는 유전자로 치환되었음을 확인하였다.
여기서 얻어진 균주에 고온에서 발현되는 Flp recombinase의 유전자를 가지는 pCP20 벡터 (서열번호 230)를 도입하여 Flp recombinase를 발현시켜 게놈 내부 KmR 유전자를 제거한 후, PCR을 통해 ackA-pta 유전자가 결실되고 KmR 유전자를 가지지 않은 균주를 얻었음을 확인하였다. 동일한 실험과정을 통해, pKD4 벡터를 주형으로 하고 yqhDKF 프라이머(서열번호 231) 및 yqhDKR 프라이머(서열번호 232)의 프라이머 세트를 통한 PCR 증폭 산물을 상기 ackA-pta 유전자가 결실되고 KmR 유전자를 가지지 않은 균주에 도입한 후 KmR 유전자를 제거하여 최종적으로 ackA-pta 및 yqhD 유전자가 결실된 SH3 균주를 획득하였다.
pET-iBAB 벡터는 다음의 과정으로 제작하였다.
Ilyobacter polytropus의 게놈 DNA로부터 글리세롤 데하이드라타제 (glycerol dehydratase: GDH)를 코딩하는 유전자 (dhaB1, dhaB2, 및 dhaB3) (서열번호 214, 215,및 216) 및 글리세롤 데히드라타제 재활성화효소 인자 (glycerol dehydratase reactivase: GDR)를 코딩하는 유전자 (gdrA, 및 gdrB)(서열번호 223 및 224)를 확보하였다. 상기 dhaB1, dhaB2, 및 dhaB3 유전자는 Ilyobacter polytropus의 게놈 DNA를 주형으로 하고 dhaB123_F (서열번호 233) 및 dhaB123_R(서열번호 234)의 프라이머 세트를 사용한 PCR 증폭에 의하여 dhaB123를 한 증폭 산물로 얻었다. 상기 gdrA 및 gdrB 유전자는 Ilyobacter polytropus의 게놈 DNA를 주형으로 하고 gdrAB_F(서열번호 235) 및 gdrAB_R(서열번호 236)의 프라이머 세트를 사용한 PCR 증폭에 의하여 gdrAB를 한 증폭 산물로 얻었다. 얻어진 PCR 산물을 BamHI 및 SacI 제한효소로 처리한 후, pETDuetTM-1 벡터 (Novagen, Cat. No. 71146-3)에 클로닝하여 pET-iBAB 벡터를 얻었다.
(2) 3-
HPA
생산능을
갖는 대장균 세포에
ALDH
및 3-
HP
-
CoA
데히드라타제
유전자가 도입된 균주의 제조
글리세롤로부터 3-HPA를 거쳐 3-HP-CoA를 생산하기 위한 벡터 (pET-iBAB-PduP)는 다음의 과정을 거쳐 제작되었다. 상기 pET-iBAB 벡터를 주형으로 하여 iBAB_Up 및 iBAB_Dn 프라이머 세트(서열번호 237 및 238)를 사용한 PCR 증폭에 의해 dhaB123 및 gdrAB를 포함하는 선형의 벡터를 얻었다. PCR은 Primestar Max (Takara Inc., R045A)를 사용하여 95℃에서 15초, 50℃에서 15초, 및 72℃에서 2분을 30회 (cycle) 수행하였다. 또한, Lactobacillus reuteri DSM 20016의 게놈 DNA로부터 CoA acylating aldehyde dehydrogenase (ALDH)를 코딩하는 유전자 (PduP)를 pduP_F 및 pduP_R의 프라이머 세트(서열번호 239 및 240)를 사용한 PCR 증폭에 의하여 얻었다. 얻어진 PCR 산물을 In-FusionTM HD Cloning Kit (Clontech Laboratories, Inc.)를 사용하여 상기의 선형 벡터에 클로닝하였다. 그 결과, pET-iBAB_PduP (pETDuet-1/dhaB_gdrAB_pduP) 벡터를 얻었다.
도 1은 pET-iBAB_PduP 벡터의 개열 지도를 나타낸다.
E.coli K12 (DE3) (ㅿyqhD ㅿack-pta/pET-iBAB-PduP)에 3-HP-CoA 데히드라타제 유전자로서 MELS_1449 유전자를 도입하였다.
구체적으로, Megasphaera elsdenii 균주의 게놈을 주형으로 하고 HindIII 및 BamHI 자리를 가진 각 프라이머 세트 (서열번호 241 및 242)를 사용하여 PCR을 통하여 증폭하였으며, PCR은 Primestar Max (Takara Inc., R045A)를 사용하여 95℃에서 15초, 50℃에서 15초, 및 72℃에서 2분을 30회 (cycle) 수행하였다. 얻어진 각 증폭 산물을 HindIII 및 BamHI으로 소화시키고, 이를 pACYCDuetTM-1 벡터 (Novagen, cat. no. 71147-3)의 HindIII 및 BamHI 자리에 연결시켜 pACYC-MDH 벡터를 제조하였다.
pET-iBAB-PduP 벡터 및 pACYC-MDH 벡터를 대장균 SH3 균주에 전기천공에 의하여 도입하였다. 구체적으로, 전기 천공용으로 제작된 0.05mL의 SH3 세포액에 두 벡터를 200 내지 300ng이 되도록 넣어준 후, electroporation cuvette (Bio-rad Inc., cat. No. 165-2802)에 넣고 Gene Pulser XcellTM Total System (Bio-rad Inc., cat. No. 165-2660)을 사용하여 2.5 kV의 pulse를 인가하여 형질도입하였다. 이렇게 형질 도입된 세포에서 카나마이신 항생제 및 클로람페니콜 항생제에 동시에 내성을 가지는 균주를 선별하여, 최종적으로 SH3/pET-iBAB-PduP/pACYC-MDH 균주를 제조하였다.
(3) 3-
HPA
생산능을
갖는 대장균 세포에
ALDH
및 3-
HP
-
CoA
데히드라타제
및 아크릴릴-
CoA
를
아크릴레이트로
전환하는 것을
촉매하는
효소 유전자가 도입된 균주의 제조
E.coli K12 (DE3) (ㅿyqhD ㅿack-pta/pET-iBAB-PduP)에 3-HP-CoA 데히드라타제 및 아크릴릴-CoA를 아크릴레이트로 전환하는 것을 촉매하는 효소 유전자로서 M.elsdenii 유래의 3-HP-CoA dehydratase를 코딩하는 MELS_1449 유전자, 및 대장균 유래의 CoA hydrolase yciA 유전자를 도입하였다.
구체적으로, 대장균 유래의 CoA hydrolase yciA 유전자를 대장균 (K12 MG1655)의 게놈을 주형으로 하고, yciA_F 및 yciA_R 프라이머 세트 (서열번호 243 및 244)를 사용하여 PCR로 증폭하여 얻고, 이를 BglII 및 XhoI 제한 효소를 사용하여 각각 소화시킨 후, 동일한 효소로 소화된 pACYC-MDH 벡터에 도입하여, 2개 유전자 발현용 벡터 (pACYC-MDH-yciA)를 제조하였다.
다음으로, (2)에 기재된 pET-iBAB-PduP 벡터 및 pACYC-MDH-YciA 벡터를 대장균 SH3/pET-iBAB-PduP/pACYC-MDH 균주를 제작할 때와 동일한 방법으로 electroporation 기법으로 형질도입하였으며, 카나마이신 항생제 및 클로람페니콜 항생제에 동시에 내성을 가지는 균주를 선별하여, 대장균 SH3/pET-iBAB-PduP/pACYC-MDH-YciA 균주를 제조하였다.
실시예
1:
ALDH
, 및 3-
HP
-
CoA
를
아크릴릴
-
CoA
(
acrylyl
-
CoA
)로 전환하는 것을
촉매하는
3-
HP
-
CoA
데히드라타제
유전자가 도입된 미생물의
아크릴레트
생산성의 확인
재료 및 방법에 제조된 대장균 SH3, SH3/pET-iBAB-PduP/pACYC-MDH 균주 및 SH3/pET-iBAB-PduP/pACYC-MDH-YciA 균주를 각각 250mL 플라스크에서 RM 최소 배지 (MgSO4 ㆍ7H2O 1.4 g/L, K2HPO4 17.4 g/L, KH2PO4 3g/L, (NH4)2HPO4 4 g/L, 시트르산 1.7 g/L, ZnCl2 0.014 g/L, FeCl2ㆍ4H2O 0.041 g/L, MnCl2 0.015 g/L, CuCl2 0.0015 g/L, H3BO3 0.003 g/L, Na2MoO4 0.0025 g/L, 비타민 B12 10 uM, 포도당 1.0 g/L, 및 글리세롤 30 g/L) 20mL에서 OD600=0.25가 되게 접종하고 30℃에서 OD600=0.6이 될 때까지 배양한 후 0.03mM IPTG를 첨가하고 33℃에서 48 시간 동안 배양하였다. 배양은 220mL 플라스크에서 진탕하면서 48 시간 동안 이루어졌다.
다음으로, 배양물 중의 아크릴산 및 기타 유기산의 농도는 HPLC를 사용하여 측정하였다. 구체적으로, 배양이 종료된 후, 배양액 중 일부를 채취하여 흡광도를 측정하고, 세포가 제외된 배양액에 대하여 굴절율 검출기(refractive index detector) 및 포토다이오드 어레이 검출기 (photodiode array detector)가 부착된 HPLC (Waters)에 Aminex HPX-87H 컬럼을 장착하여 5 mM H2SO4 수용액을 사용, 0.1 ml/min의 flow rate로 흘려주어 아크릴레이트(AA)가 생산되는지를 확인하였다. 생성된 AA의 정량은 포토다이오드 검출기의 210 nm 파장에서 정제된 AA 시료 (Sigma Aldrich)와의 정량 비교를 통해 이루어졌다. HPLC 분석 결과, ALDH 유전자와 3-HP-CoA dehydratase 유전자가 도입된 두 재조합 대장균 균주, 즉 SH3/pET-iBAB-PduP/pACYC-MDH 균주 및 SH3/pET-iBAB-PduP/pACYC-MDH-YciA 균주는 48 시간 동안 배양한 경우, 약 6 mg/L의 아크릴산을 생산하였다 (도 2 참조).
도 2는 글리세롤 함유 배지에서 ALDH 유전자와 3-HP-CoA dehydratase 유전자가 도입된 두 재조합 대장균 균주를 48시간 배양한 경우, 배양물 중의 아크릴레이트를 HPLC에 의하여 분석한 결과를 나타낸다. 도 2에서, A는 대장균 SH3/pET-iBAB-PduP/pACYC-MDH 균주를 나타내고, B는 SH3/pET-iBAB-PduP/pACYC-MDH-YciA 균주, 및 C는 2.8mg/L 아크릴레이트 표준물을 각각 나타낸다. 도 2에서 가로축은 HPLC에 연결된 Aminex HPX-87H 컬럼에 배약액을 주입 후 0.1 ml/min의 속도로 5 mM H2SO4 수용액을 흘려 주었을 때 포토다이오드 어레이 검출기에 도달한 시간 (min)을 나타내고, 세로축은 포토다이오드 어레이 검출기에서 210 nm 파장 범위에서 측정된 전압 (uV)을 나타낸다. 아크릴레이트 농도는 아크릴레이트 표준물 기준으로 두 시료 모두 약 6 mg/L로 측정되었다.
도 3은 대장균 SH3/pET-iBAB-PduP/pACYC-MDH 균주를 발효기 (fermentor)에서 48 시간 동안 배양한 후 배양액 중의 아크릴레이트를 측정한 결과를 나타낸다. 도 3에서, 배양은 1.5 L 배양기 (Biotron)에 상기의 RM 최소 배지 600 mL에서 상기 균주를 OD600이 0.1이 되게 접종하고, 33 ℃에서 600 rpm으로 교반하면서 48 시간 동안 배양한 것이다. 도 3에 나타낸 바와 같이, SH3/pET-iBAB-PduP/pACYC-MDH 균주는 아크릴레이트를 현저하게 증가된 양으로 생산하였다. 최대 생산은 40 시간에서 44 mg/L 아크릴레이트를 생산하였다.
도 4는 본 실시예의 대장균에서 포도당 또는 글리세롤로부터 아크릴산의 생산의 예측 경로를 나타낸 도면이다. 본 실시예에서 아크릴산은 도 4에 나타낸 경로에 따라 생산될 것으로 여겨지지만, 청구된 발명이 특별한 기작에 한정되는 것은 아니다. 도 4에서, 포도당 또는 글리세롤로부터 전환된 3-HPA는 PduP에 의하여 3-HP-CoA로 전환되는 것이 촉매되며, 3-HP-CoA는 MELS_1449에 의하여 AA-CoA로 전환되는 것이 촉매된다. AA-CoA로부터 AA로의 전환은 세포에 내재적으로 존재하는 효소, 예를 들면, YciA에 의하여 촉매되거나, 외부에서 도입된 효소 유전자, 예를 들면, YciA 유전자의 발현 산물에 의하여 전환이 촉매될 수 있다. 대장균의 경우 YciA 유전자는 내재적으로 존재할 수 있으며, 그에 따라 외부에서 도입하지 않아도 AA-CoA로부터 AA로 전환될 수 있다. 탄소원, 예를 들면, 포도당 또는 글리세롤로부터 3-HPA로 전환하는 경로를 갖는 균주 즉, 3-HPA 생산능을 갖는 균주는, 본 실시예에 기재된 SH3/pET-iBAB-PduP/pACYC-MDH-YciA 균주, 및 SH3/pET-iBAB-PduP/pACYC-MDH 균주뿐만 아니라, 당업계에 알려진 균주가 사용될 수 있다.
<110> Samsung Electronics Co., Ltd.
<120> Microorganism having novel acrylic acid synthesis pathway having
enhanced activity of CoA acylating aldehyde dehydrogenase and
method of producing acrylic acid using the same
<130> PN105322KR
<160> 244
<170> KopatentIn 2.0
<210> 1
<211> 477
<212> PRT
<213> Lactobacillus reuteri DSM 20016
<400> 1
Met Gln Ile Asn Asp Ile Glu Ser Ala Val Arg Lys Ile Leu Ala Glu
1 5 10 15
Glu Leu Asp Asn Ala Ser Ser Ser Ser Ala Asn Val Ala Ala Thr Thr
20 25 30
Asp Asn Gly His Arg Gly Ile Phe Thr Asn Val Asn Asp Ala Ile Ala
35 40 45
Ala Ala Lys Ala Ala Gln Glu Ile Tyr Arg Asp Lys Pro Ile Ala Val
50 55 60
Arg Gln Gln Val Ile Asp Ala Ile Lys Glu Gly Phe Arg Pro Tyr Ile
65 70 75 80
Glu Lys Met Ala Lys Asp Ile Lys Glu Glu Thr Gly Met Gly Thr Val
85 90 95
Glu Ala Lys Ile Ala Lys Leu Asn Asn Ala Leu Tyr Asn Thr Pro Gly
100 105 110
Pro Glu Ile Leu Glu Pro Val Val Glu Asn Gly Asp Gly Gly Met Val
115 120 125
Met Tyr Glu Arg Leu Pro Tyr Gly Val Ile Gly Ala Val Gly Pro Ser
130 135 140
Thr Asn Pro Ser Glu Thr Val Ile Ala Asn Ala Ile Met Met Leu Ala
145 150 155 160
Gly Gly Asn Thr Leu Tyr Phe Gly Ala His Pro Gly Ala Lys Asn Val
165 170 175
Thr Arg Trp Thr Ile Glu Lys Met Asn Asp Phe Ile Ala Asp Ala Thr
180 185 190
Gly Leu His Asn Leu Val Val Ser Ile Glu Thr Pro Thr Ile Glu Ser
195 200 205
Val Gln Gln Met Met Lys His Pro Asp Ile Ala Met Leu Ala Val Thr
210 215 220
Gly Gly Pro Ala Val Val His Gln Ala Met Thr Ser Gly Lys Lys Ala
225 230 235 240
Val Gly Ala Gly Pro Gly Asn Pro Pro Ala Met Val Asp Ala Thr Ala
245 250 255
Asp Ile Asp Leu Ala Ala His Asn Ile Ile Thr Ser Ala Ser Phe Asp
260 265 270
Asn Asp Ile Leu Cys Thr Ala Glu Lys Glu Val Val Ala Glu Ser Ser
275 280 285
Ile Lys Asp Glu Leu Ile Arg Lys Met Gln Asp Glu Gly Ala Phe Val
290 295 300
Val Asn Arg Glu Gln Ala Asp Lys Leu Ala Asp Met Cys Ile Gln Glu
305 310 315 320
Asn Gly Ala Pro Asp Arg Lys Phe Val Gly Lys Asp Ala Thr Tyr Ile
325 330 335
Leu Asp Gln Ala Asn Ile Pro Tyr Thr Gly His Pro Val Glu Ile Ile
340 345 350
Cys Glu Leu Pro Lys Glu His Pro Leu Val Met Thr Glu Met Leu Met
355 360 365
Pro Ile Leu Pro Val Val Ser Cys Pro Thr Phe Asp Asp Val Leu Lys
370 375 380
Thr Ala Val Glu Val Glu Lys Gly Asn His His Thr Ala Thr Ile His
385 390 395 400
Ser Asn Asn Leu Lys His Ile Asn Asn Ala Ala His Arg Met Gln Cys
405 410 415
Ser Ile Phe Val Val Asn Gly Pro Ser Tyr Val Gly Thr Gly Val Ala
420 425 430
Asp Asn Gly Ala His Ser Gly Ala Ser Ala Leu Thr Ile Ala Thr Pro
435 440 445
Thr Gly Glu Gly Thr Cys Thr Ala Arg Thr Phe Thr Arg Arg Val Arg
450 455 460
Leu Asn Ser Pro Gln Gly Phe Ser Val Arg Asn Trp Tyr
465 470 475
<210> 2
<211> 477
<212> PRT
<213> Lactobacillus brevis ATCC 367
<400> 2
Met Asn Thr Glu Asn Ile Glu Gln Ala Ile Arg Lys Ile Leu Ser Glu
1 5 10 15
Glu Leu Ser Asn Pro Gln Ser Ser Thr Ala Thr Asn Thr Thr Val Pro
20 25 30
Gly Lys Asn Gly Ile Phe Lys Thr Val Asn Glu Ala Ile Ala Ala Thr
35 40 45
Lys Ala Ala Gln Glu Asn Tyr Ala Asp Gln Pro Ile Ser Val Arg Asn
50 55 60
Lys Val Ile Asp Ala Ile Arg Glu Gly Phe Arg Pro Tyr Ile Glu Asp
65 70 75 80
Met Ala Lys Arg Ile His Asp Glu Thr Gly Met Gly Thr Val Ser Ala
85 90 95
Lys Ile Ala Lys Leu Asn Asn Ala Leu Tyr Asn Thr Pro Gly Pro Glu
100 105 110
Ile Leu Gln Pro Glu Ala Glu Thr Gly Asp Gly Gly Leu Val Met Tyr
115 120 125
Glu Tyr Ala Pro Phe Gly Val Ile Gly Ala Val Gly Pro Ser Thr Asn
130 135 140
Pro Ser Glu Thr Val Ile Ala Asn Ala Ile Met Met Leu Ala Gly Gly
145 150 155 160
Asn Thr Leu Phe Phe Gly Ala His Pro Gly Ala Lys Asn Ile Thr Arg
165 170 175
Trp Thr Ile Glu Lys Leu Asn Glu Leu Val Ala Asp Ala Thr Gly Leu
180 185 190
His Asn Leu Val Val Ser Leu Glu Thr Pro Ser Ile Glu Ser Val Gln
195 200 205
Glu Val Met Gln His Pro Asp Val Ala Met Leu Ser Ile Thr Gly Gly
210 215 220
Pro Ala Val Val His Gln Ala Leu Ile Ser Gly Lys Lys Ala Val Gly
225 230 235 240
Ala Gly Ala Gly Asn Pro Pro Ala Met Val Asp Ala Thr Ala Asn Ile
245 250 255
Ala Leu Ala Ala His Asn Ile Val Asp Ser Ala Ala Phe Asp Asn Asn
260 265 270
Ile Leu Cys Thr Ala Glu Lys Glu Val Val Val Glu Ala Ala Val Lys
275 280 285
Asp Glu Leu Ile Met Arg Met Gln Gln Glu Gly Ala Phe Leu Val Thr
290 295 300
Asp Ser Ala Asp Ile Glu Lys Leu Ala Gln Met Thr Ile Gly Pro Lys
305 310 315 320
Gly Ala Pro Asp Arg Lys Phe Val Gly Lys Asp Ala Thr Tyr Ile Leu
325 330 335
Asp Gln Ala Gly Ile Ser Tyr Thr Gly Thr Pro Thr Leu Ile Ile Leu
340 345 350
Glu Ala Ala Lys Asp His Pro Leu Val Thr Thr Glu Met Leu Met Pro
355 360 365
Ile Leu Pro Val Val Cys Cys Pro Asp Phe Asp Ser Val Leu Ala Thr
370 375 380
Ala Thr Glu Val Glu Gly Gly Leu His His Thr Ala Ser Ile His Ser
385 390 395 400
Glu Asn Leu Pro His Ile Asn Lys Ala Ala His Arg Leu Asn Thr Ser
405 410 415
Ile Phe Val Val Asn Gly Pro Thr Tyr Cys Gly Thr Gly Val Ala Thr
420 425 430
Asn Gly Ala His Ser Gly Ala Ser Ala Leu Thr Ile Ala Thr Pro Thr
435 440 445
Gly Glu Gly Thr Ala Thr Ser Lys Thr Tyr Thr Arg Arg Arg Arg Leu
450 455 460
Asn Ser Pro Glu Gly Phe Ser Leu Arg Thr Trp Glu Ala
465 470 475
<210> 3
<211> 477
<212> PRT
<213> Pediococcus acidilactici
<400> 3
Met Glu Ile Gln Asn Leu Glu Glu Asp Ile Arg Arg Ile Leu Ser Glu
1 5 10 15
Glu Leu Lys Lys Ser Gly Thr Ser Gln Thr Ala Ser Thr Ser Asp Ala
20 25 30
Gly Gln Asn Gly Ile Phe Lys Thr Val Asp Glu Ala Ile Ala Ala Ala
35 40 45
Lys Ala Ala Glu Asp Val Tyr Ile Asp Lys Pro Leu Ala Phe Arg Glu
50 55 60
Lys Val Leu Thr Ala Ile Arg Glu Gly Phe Arg Pro Tyr Ile Glu Lys
65 70 75 80
Met Ala Lys Asp Ile Lys Asp Glu Thr Gly Met Gly Thr Val Glu Ala
85 90 95
Lys Ile Ala Lys Leu Asn Asn Ala Leu Tyr Asn Thr Pro Gly Thr Glu
100 105 110
Ile Leu Gln Pro Glu Ala Glu Thr Gly Asp Gly Gly Leu Val Met Tyr
115 120 125
Glu Tyr Ala Pro Phe Gly Val Ile Gly Ala Val Gly Pro Ser Thr Asn
130 135 140
Pro Ser Glu Thr Val Ile Ala Asn Ala Ile Met Met Leu Ala Gly Gly
145 150 155 160
Asn Thr Leu Tyr Phe Gly Ala His Pro Gly Ala Lys Lys Ile Thr Arg
165 170 175
Trp Thr Ile Glu Lys Leu Asn Lys Leu Val Tyr Glu Ala Thr Gly Met
180 185 190
Lys Asn Leu Val Val Ser Ile Glu Glu Pro Ser Ile Glu Ser Val Gln
195 200 205
Glu Met Met Gln His Pro Asp Ile Ala Met Leu Ser Ile Thr Gly Gly
210 215 220
Pro Ala Val Val His Gln Ala Leu Val Ser Gly Lys Lys Ala Val Gly
225 230 235 240
Ala Gly Ala Gly Asn Pro Pro Ala Ile Val Asp Ala Thr Ala Asn Val
245 250 255
Ala Leu Ala Ala His Asn Ile Val Asp Ser Ala Ser Phe Asp Asn Asn
260 265 270
Ile Leu Cys Thr Ala Glu Lys Glu Val Val Val Glu Ser Ser Val Lys
275 280 285
Asp Glu Leu Ile Lys Lys Met Gln Glu Glu Gly Ala Phe Leu Val Thr
290 295 300
Asn Ala Ser Asp Ile Asp Lys Leu Ala Glu Met Thr Ile Gly Lys Asn
305 310 315 320
Gly Ala Pro Asp Arg Gln Phe Val Gly Lys Asp Ala Thr Tyr Ile Leu
325 330 335
Asp Lys Ala Gly Ile Ala Tyr Thr Gly Thr Pro Lys Leu Ile Ile Met
340 345 350
Glu Ala Gln Lys Asp His Pro Leu Val Thr Thr Glu Met Leu Met Pro
355 360 365
Ile Val Pro Val Val Ser Cys Pro Thr Phe Asp Gln Val Leu Ala Thr
370 375 380
Ala Val Glu Val Glu Gln Gly Leu His His Thr Ala Ser Ile His Ser
385 390 395 400
Glu Asn Leu Pro Asn Ile Asn Arg Ala Ala His Arg Met Asn Thr Ser
405 410 415
Ile Phe Val Val Asn Gly Ala Thr Tyr Val Gly Thr Gly Val Gly Ala
420 425 430
Asn Gly Ala His Ala Gly Ala Ser Ala Leu Thr Ile Ala Thr Pro Thr
435 440 445
Gly Glu Gly Thr Ala Thr Ala Lys Thr Phe Thr Arg Arg Arg Arg Leu
450 455 460
Asn Ser Pro Glu Ala Phe Ser Leu Arg Ser Trp Glu Ala
465 470 475
<210> 4
<211> 476
<212> PRT
<213> Pediococcus claussenii ATCC BAA-344
<400> 4
Met Glu Met Asp Lys Leu Glu Gln Asp Ile Arg Arg Ile Leu Ser Glu
1 5 10 15
Glu Leu Gln Asp Ser Asp Asn Ser Val Ser Ala Ser Ser Asp Asn Gly
20 25 30
Thr Asn Gly Ile Phe Lys Thr Val Asp Glu Ala Ile Ala Ala Ala Lys
35 40 45
Ala Ala Gln Glu Ile Tyr Val Asp Lys Ser Leu Ala Phe Arg Asn Gln
50 55 60
Val Leu Asp Ala Ile Lys Glu Gly Phe Arg Pro Tyr Ile Glu Gln Met
65 70 75 80
Ala Lys Asp Ile Lys Glu Glu Thr Gly Met Gly Thr Val Glu Ala Lys
85 90 95
Ile Ala Lys Leu Asn Asn Ala Leu Tyr Asn Thr Pro Gly Thr Glu Ile
100 105 110
Leu Glu Pro Glu Ala Glu Thr Gly Asp Gly Gly Leu Val Leu Tyr Glu
115 120 125
Tyr Ala Pro Phe Gly Val Ile Gly Ala Val Gly Pro Ser Thr Asn Pro
130 135 140
Ser Glu Thr Val Ile Ala Asn Ala Leu Met Met Leu Ala Gly Gly Asn
145 150 155 160
Thr Val Tyr Phe Gly Ala His Pro Gly Ala Lys Lys Ile Thr Arg Trp
165 170 175
Thr Ile Glu Lys Leu Asn Glu Phe Val Phe Lys Ala Thr Gly Met Arg
180 185 190
Asn Met Val Val Ser Ile Glu Glu Pro Ser Ile Glu Ser Val Gln Gln
195 200 205
Met Met Gln His Pro Asp Ile Ala Met Leu Ser Ile Thr Gly Gly Pro
210 215 220
Gly Val Val His Gln Ala Met Ile Ser Gly Lys Lys Ala Val Gly Ala
225 230 235 240
Gly Ala Gly Asn Pro Pro Ala Ile Val Asp Ala Thr Ala Asn Ile Asp
245 250 255
Leu Ala Ala His Asn Ile Val Asp Ser Ser Ser Phe Asp Asn Asn Ile
260 265 270
Leu Cys Thr Ala Glu Lys Glu Val Val Val Glu Glu Ser Val Lys Asp
275 280 285
Glu Leu Ile Ser Lys Met Gln Asn Glu Gly Ala Phe Leu Val Thr Ser
290 295 300
Ala His Asp Ile Glu Lys Ile Val Gln Ile Thr Ile Gly Lys Asn Gly
305 310 315 320
Ala Pro Asp Arg Lys Phe Val Gly Lys Asp Ala Thr Phe Ile Leu Asp
325 330 335
Ser Ala Gly Ile Asn Tyr Thr Gly Thr Pro Lys Leu Ile Ile Leu Glu
340 345 350
Ala His Lys Asn His Pro Leu Val Thr Thr Glu Met Leu Met Pro Ile
355 360 365
Leu Pro Val Val Ser Cys Pro Thr Phe Asp Arg Ala Leu Ala Thr Ala
370 375 380
Val Glu Val Glu Gln Gly Leu His His Thr Ala Ser Ile His Ser Glu
385 390 395 400
Asn Leu Pro His Ile Asn Gln Ala Ala His Arg Met Asn Thr Ser Ile
405 410 415
Phe Val Val Asn Gly Ala Thr Tyr Val Gly Thr Gly Val Gly Ala Asn
420 425 430
Gly Ala His Ala Gly Ala Ser Ala Leu Thr Ile Ala Thr Pro Thr Gly
435 440 445
Glu Gly Thr Ala Thr Ala Lys Thr Phe Thr Arg Arg Arg Arg Leu Asn
450 455 460
Ser Pro Glu Ala Phe Ser Leu Arg Ser Trp Glu Ala
465 470 475
<210> 5
<211> 481
<212> PRT
<213> Lactobacillus collinoides
<400> 5
Met Ala Asp Gln Asn Ile Glu Ala Glu Ile Arg Arg Ile Leu Gln Glu
1 5 10 15
Glu Leu Ser Gly Asn Ala Ser Ser Ser Ala Ala Gly Thr Thr Thr Ser
20 25 30
Gln Pro Asp Gly Leu Gly Asn Arg Ile Phe Thr Asn Val Asn Asp Ala
35 40 45
Ile Ala Ala Ala Lys Gln Ala Gln Ala Ile Tyr Gln Asp Lys Pro Leu
50 55 60
Ala Phe Arg Lys Lys Val Val Gln Ala Ile Lys Asp Gly Phe Gly Pro
65 70 75 80
Tyr Ile Glu Tyr Met Ala Lys Gln Thr Arg Glu Glu Thr Gly Met Gly
85 90 95
Thr Ala Glu Ala Lys Ile Ala Lys Leu Lys Asn Ala Leu Tyr Asn Thr
100 105 110
Pro Gly Val Glu Leu Leu Asp Pro Glu Val Glu Thr Gly Asp Gly Gly
115 120 125
Met Val Met Tyr Glu Tyr Thr Pro Phe Gly Val Ile Gly Ala Val Gly
130 135 140
Pro Ser Thr Asn Pro Cys Glu Thr Val Leu Asn Asn Ser Ile Met Met
145 150 155 160
Met Ser Ala Gly Asn Ala Leu Phe Phe Gly Ala His Pro Gly Ala Lys
165 170 175
Asn Ile Thr Arg Trp Ala Val Glu Lys Leu Asn Glu Phe Val Tyr Lys
180 185 190
Ala Thr Gly Leu Lys Asn Leu Leu Val Ser Leu Asp Thr Pro Ser Ile
195 200 205
Glu Ser Val Gln Glu Met Met Gln His Pro Asp Val Ala Met Leu Ala
210 215 220
Val Thr Gly Gly Pro Ala Val Val His Gln Ala Leu Thr Ser Gly Lys
225 230 235 240
Lys Ala Val Gly Ala Gly Ala Gly Asn Pro Pro Ala Met Val Asp Ala
245 250 255
Thr Ala Asp Ile Asp Leu Ala Ala His Asn Leu Phe Thr Ser Ala Lys
260 265 270
Phe Asp Asn Glu Ile Leu Cys Thr Ser Glu Lys Glu Ile Ile Ala Glu
275 280 285
Asp Ser Ile Lys Asp Glu Leu Leu Gln Lys Ile Val Ala Lys Gly Ala
290 295 300
Cys Leu Val Thr Asp Pro Lys Asp Ile Lys His Leu Ala Asp Met Thr
305 310 315 320
Ile Gly Asp Asn Gly Ala Pro Asp Arg Lys Tyr Val Gly Lys Asp Ala
325 330 335
Thr Val Ile Leu Asp Ala Ala Gly Ile Ser Tyr Thr Gly Asp Pro Lys
340 345 350
Leu Ile Met Met Asp Val Asp Lys Asp Asn Pro Leu Val Lys Thr Glu
355 360 365
Met Leu Met Pro Ile Leu Pro Ile Val Gly Cys Pro Asp Phe Asp Ala
370 375 380
Val Leu Ala Thr Ala Ile Glu Val Glu Gly Gly Asn His His Thr Ala
385 390 395 400
Ser Ile His Ser Asn Asn Ile Leu His Ile Asn Lys Ala Ala His Arg
405 410 415
Met Asn Thr Ser Ile Phe Val Ala Asn Gly Pro Thr Phe Ala Ala Thr
420 425 430
Gly Val Gly Asp Asn Gly Tyr Tyr Ser Gly Ala Ala Ala Leu Thr Ile
435 440 445
Ala Thr Pro Thr Gly Glu Gly Thr Thr Thr Thr Lys Thr Phe Thr Arg
450 455 460
Arg Arg Arg Phe Asn Cys Pro Gln Gly Phe Ser Leu Arg Ser Trp Glu
465 470 475 480
Val
<210> 6
<211> 469
<212> PRT
<213> Listeria welshimeri serovar 6b str. SLCC5334
<400> 6
Met Glu Ser Leu Glu Leu Glu Gln Leu Val Lys Lys Val Leu Leu Glu
1 5 10 15
Lys Leu Ala Glu Gln Lys Asp Val Pro Val Lys Thr Thr Thr Gln Gly
20 25 30
Ala Lys Ser Gly Ile Phe Asp Thr Val Asp Glu Ala Val Gln Ala Ala
35 40 45
Val Gln Ala Gln Asn Ser Tyr Lys Glu Lys Ser Leu Glu Glu Arg Arg
50 55 60
Asn Val Val Lys Ala Ile Arg Glu Ala Leu Tyr Pro Glu Ile Glu Ser
65 70 75 80
Ile Ala Thr Arg Ala Val Ala Glu Thr Gly Met Gly Asn Val Thr Asp
85 90 95
Lys Ile Leu Lys Asn Thr Leu Ala Ile Glu Lys Thr Pro Gly Val Glu
100 105 110
Asp Leu Tyr Thr Glu Val Ala Thr Gly Asp Asn Gly Met Thr Leu Tyr
115 120 125
Glu Leu Ser Pro Tyr Gly Val Ile Gly Ala Val Ala Pro Ser Thr Asn
130 135 140
Pro Thr Glu Thr Leu Ile Cys Asn Thr Ile Gly Met Leu Ala Ala Gly
145 150 155 160
Asn Ala Val Phe Tyr Ser Pro His Pro Gly Ala Lys Asn Ile Ser Leu
165 170 175
Trp Leu Ile Glu Lys Leu Asn Thr Ile Val Arg Glu Ser Cys Gly Ile
180 185 190
Asp Asn Leu Val Val Thr Val Glu Lys Pro Ser Ile Gln Ala Ala Gln
195 200 205
Glu Met Met Asn His Pro Lys Val Pro Leu Leu Val Ile Thr Gly Gly
210 215 220
Pro Gly Val Val Leu Gln Ala Met Gln Ser Gly Lys Lys Val Ile Gly
225 230 235 240
Ala Gly Ala Gly Asn Pro Pro Ser Ile Val Asp Glu Thr Ala Asn Ile
245 250 255
Glu Lys Ala Ala Ala Asp Ile Val Asp Gly Ala Ser Phe Asp His Asn
260 265 270
Ile Leu Cys Ile Ala Glu Lys Ser Val Val Ala Val Asp Ser Ile Thr
275 280 285
Asp Phe Leu Leu Phe Gln Met Glu Lys Asn Gly Ala Leu His Val Thr
290 295 300
Asn Pro Ser Asp Ile Lys Lys Leu Glu Lys Val Ala Val Thr Asp Lys
305 310 315 320
Gly Val Thr Asn Lys Lys Leu Val Gly Lys Ser Ala Ser Glu Ile Leu
325 330 335
Lys Glu Ala Gly Ile Thr Cys Asp Phe Thr Pro Arg Leu Ile Ile Val
340 345 350
Glu Thr Asp Lys Ser His Pro Phe Ala Thr Val Glu Leu Leu Met Pro
355 360 365
Ile Val Pro Val Val Arg Val Pro Asp Phe Asp Glu Ala Leu Lys Val
370 375 380
Ala Ile Glu Leu Glu Gln Gly Leu His His Thr Ala Thr Met His Ser
385 390 395 400
Gln Asn Ile Ser Arg Leu Asn Lys Ala Ala Arg Asp Met Gln Thr Ser
405 410 415
Ile Phe Val Lys Asn Gly Pro Ser Phe Ala Gly Leu Gly Phe Arg Gly
420 425 430
Glu Gly Ser Thr Thr Phe Thr Ile Ala Thr Pro Thr Gly Glu Gly Thr
435 440 445
Thr Thr Ala Arg His Phe Ala Arg Arg Arg Arg Cys Val Leu Thr Asp
450 455 460
Gly Phe Ser Ile Arg
465
<210> 7
<211> 469
<212> PRT
<213> Listeria innocua Clip11262
<400> 7
Met Glu Ser Leu Glu Leu Glu Gln Leu Val Lys Lys Val Leu Leu Glu
1 5 10 15
Lys Leu Ala Glu Gln Lys Glu Val Pro Thr Lys Thr Thr Thr Gln Gly
20 25 30
Ala Lys Ser Gly Val Phe Asp Thr Val Asp Glu Ala Val Gln Ala Ala
35 40 45
Val Ile Ala Gln Asn Cys Tyr Lys Glu Lys Ser Leu Glu Glu Arg Arg
50 55 60
Asn Val Val Lys Ala Ile Arg Glu Ala Leu Tyr Pro Glu Ile Glu Thr
65 70 75 80
Ile Ala Thr Arg Ala Val Ala Glu Thr Gly Met Gly Asn Val Thr Asp
85 90 95
Lys Ile Leu Lys Asn Thr Leu Ala Ile Glu Lys Thr Pro Gly Val Glu
100 105 110
Asp Leu Tyr Thr Glu Val Ala Thr Gly Asp Asn Gly Met Thr Leu Tyr
115 120 125
Glu Leu Ser Pro Tyr Gly Val Ile Gly Ala Val Ala Pro Ser Thr Asn
130 135 140
Pro Thr Glu Thr Leu Ile Cys Asn Ser Ile Gly Met Leu Ala Ala Gly
145 150 155 160
Asn Ala Val Phe Tyr Ser Pro His Pro Gly Ala Lys Asn Ile Ser Leu
165 170 175
Trp Leu Ile Glu Lys Leu Asn Thr Ile Val Arg Asp Ser Cys Gly Ile
180 185 190
Asp Asn Leu Ile Val Thr Val Ala Lys Pro Ser Ile Gln Ala Ala Gln
195 200 205
Glu Met Met Asn His Pro Lys Val Pro Leu Leu Val Ile Thr Gly Gly
210 215 220
Pro Gly Val Val Leu Gln Ala Met Gln Ser Gly Lys Lys Val Ile Gly
225 230 235 240
Ala Gly Ala Gly Asn Pro Pro Ser Ile Val Asp Glu Thr Ala Asn Ile
245 250 255
Glu Lys Ala Ala Ala Asp Ile Val Asp Gly Ala Ser Phe Asp His Asn
260 265 270
Ile Leu Cys Ile Ala Glu Lys Ser Val Val Ala Val Asp Ser Ile Ala
275 280 285
Asp Phe Leu Leu Phe Gln Met Glu Lys Asn Gly Ala Leu His Val Thr
290 295 300
Asn Pro Ser Asp Ile Gln Lys Leu Glu Lys Val Ala Val Thr Asp Lys
305 310 315 320
Gly Val Thr Asn Lys Lys Leu Val Gly Lys Ser Ala Thr Glu Ile Leu
325 330 335
Lys Glu Ala Gly Ile Ala Cys Asp Phe Thr Pro Arg Leu Ile Ile Val
340 345 350
Glu Thr Glu Lys Ser His Pro Phe Ala Thr Val Glu Leu Leu Met Pro
355 360 365
Ile Val Pro Val Val Arg Val Pro Asp Phe Asp Glu Ala Leu Glu Val
370 375 380
Ala Ile Glu Leu Glu Gln Gly Leu His His Thr Ala Thr Met His Ser
385 390 395 400
Gln Asn Ile Ser Arg Leu Asn Lys Ala Ala Arg Asp Met Gln Thr Ser
405 410 415
Ile Phe Val Lys Asn Gly Pro Ser Phe Ala Gly Leu Gly Phe Arg Gly
420 425 430
Glu Gly Ser Thr Thr Phe Thr Ile Ala Thr Pro Thr Gly Glu Gly Thr
435 440 445
Thr Thr Ala Arg His Phe Ala Arg Arg Arg Arg Cys Val Leu Thr Asp
450 455 460
Gly Phe Ser Ile Arg
465
<210> 8
<211> 469
<212> PRT
<213> Listeria monocytogenes ATCC 19117
<400> 8
Met Glu Ser Leu Glu Leu Glu Gln Leu Val Lys Lys Val Leu Leu Glu
1 5 10 15
Lys Leu Ala Glu Gln Lys Asp Ala Pro Val Lys Thr Thr Val Lys Gly
20 25 30
Ala Lys Ser Gly Val Phe Asp Thr Val Asp Glu Ala Val Gln Ala Ala
35 40 45
Val Ile Ala Gln Asn Asn Tyr Lys Glu Lys Ser Leu Glu Glu Arg Arg
50 55 60
Asn Val Val Lys Ala Ile Arg Glu Ala Leu Tyr Pro Glu Ile Glu Ser
65 70 75 80
Ile Ala Ala Arg Ala Val Ala Glu Thr Gly Met Gly Asn Val Ala Asp
85 90 95
Lys Ile Leu Lys Asn Thr Leu Ala Ile Glu Lys Thr Pro Gly Val Glu
100 105 110
Asp Leu Tyr Thr Glu Val Ala Thr Gly Asp Asn Gly Met Thr Leu Tyr
115 120 125
Glu Leu Ser Pro Tyr Gly Val Ile Gly Ala Val Ala Pro Ser Thr Asn
130 135 140
Pro Thr Glu Thr Leu Ile Cys Asn Thr Ile Gly Met Leu Ala Ala Gly
145 150 155 160
Asn Ala Val Phe Tyr Ser Pro His Pro Gly Ala Lys Asn Ile Ser Leu
165 170 175
Trp Leu Ile Glu Lys Leu Asn Thr Ile Val Arg Glu Ser Cys Gly Ile
180 185 190
Asp Asn Leu Val Val Thr Val Glu Lys Pro Ser Ile Gln Ala Ala Gln
195 200 205
Glu Met Met Asn His Pro Lys Val Pro Leu Leu Val Ile Thr Gly Gly
210 215 220
Pro Gly Val Val Leu Gln Ala Met Gln Ser Gly Lys Lys Val Ile Gly
225 230 235 240
Ala Gly Ala Gly Asn Pro Pro Ser Ile Val Asp Glu Thr Ala Asn Ile
245 250 255
Glu Lys Ala Ala Ala Asp Ile Val Asp Gly Ala Ser Phe Asp His Asn
260 265 270
Ile Leu Cys Ile Ala Glu Lys Ser Ile Val Ala Val Asp Ser Ile Ala
275 280 285
Asp Phe Leu Met Phe Gln Met Glu Lys Asn Gly Ala Leu His Val Thr
290 295 300
Asn Pro Ser Asp Ile Gln Lys Leu Glu Lys Val Ala Val Thr Asp Lys
305 310 315 320
Gly Val Thr Asn Lys Lys Leu Val Gly Lys Ser Ala Ser Glu Ile Leu
325 330 335
Lys Glu Ala Gly Ile Val Cys Asp Phe Ser Pro Arg Leu Ile Ile Val
340 345 350
Glu Thr Glu Lys Thr His Pro Phe Ala Thr Val Glu Leu Leu Met Pro
355 360 365
Ile Val Pro Val Val Arg Val Pro Asn Phe Asp Glu Ala Leu Asp Val
370 375 380
Ala Ile Glu Leu Glu Gln Gly Leu His His Thr Ala Thr Met His Ser
385 390 395 400
Gln Asn Ile Ser Arg Leu Asn Lys Ala Ala Arg Asp Met Gln Thr Ser
405 410 415
Ile Phe Val Lys Asn Gly Pro Ser Phe Ala Gly Leu Gly Phe Arg Gly
420 425 430
Glu Gly Ser Thr Thr Phe Thr Ile Ala Thr Pro Thr Gly Glu Gly Thr
435 440 445
Thr Thr Ala Arg His Phe Ala Arg Arg Arg Arg Cys Val Leu Thr Asp
450 455 460
Gly Phe Ser Ile Arg
465
<210> 9
<211> 469
<212> PRT
<213> Listeria marthii FSL S4-120
<400> 9
Met Glu Ser Leu Glu Leu Glu Gln Leu Val Lys Lys Val Leu Leu Glu
1 5 10 15
Lys Leu Ala Glu Gln Lys Glu Ala Pro Ala Lys Pro Ile Thr Gln Gly
20 25 30
Ala Lys Ser Gly Ile Phe Asp Thr Val Asp Glu Ala Val Gln Ala Ala
35 40 45
Val Ile Ala Gln Asn Cys Tyr Lys Glu Lys Ser Leu Glu Glu Arg Arg
50 55 60
Asn Val Val Lys Ala Ile Arg Glu Thr Leu Tyr Pro Glu Ile Glu Thr
65 70 75 80
Ile Ala Thr Lys Ala Val Ala Glu Thr Gly Met Gly Asn Val Ala Asp
85 90 95
Lys Ile Leu Lys Asn Thr Leu Ala Ile Glu Lys Thr Pro Gly Val Glu
100 105 110
Asp Leu Tyr Thr Glu Val Ala Thr Gly Asp Asn Gly Met Thr Leu Tyr
115 120 125
Glu Leu Ser Pro Tyr Gly Val Ile Gly Ala Val Ala Pro Ser Thr Asn
130 135 140
Pro Thr Glu Thr Leu Ile Cys Asn Thr Ile Gly Met Leu Ala Ala Gly
145 150 155 160
Asn Ala Val Phe Tyr Ser Pro His Pro Gly Ala Lys Asn Ile Ser Leu
165 170 175
Trp Leu Ile Glu Lys Leu Asn Thr Ile Val Arg Glu Ser Cys Gly Ile
180 185 190
Asp Asn Leu Val Val Thr Val Glu Lys Pro Ser Ile Gln Ala Ala Gln
195 200 205
Glu Met Met Asn His Pro Lys Val Pro Leu Leu Val Ile Thr Gly Gly
210 215 220
Pro Gly Val Val Leu Gln Ala Met Gln Ser Gly Lys Lys Val Ile Gly
225 230 235 240
Ala Gly Ala Gly Asn Pro Pro Ser Ile Val Asp Glu Thr Ala Asn Ile
245 250 255
Glu Lys Ala Ala Ala Asp Ile Val Asp Gly Ala Ser Phe Asp His Asn
260 265 270
Ile Leu Cys Ile Ala Glu Lys Ser Ile Val Ala Val Glu Ser Ile Ala
275 280 285
Asp Phe Leu Leu Phe Gln Met Glu Lys Asn Gly Ala Leu His Val Thr
290 295 300
Asn Pro Ser Asp Ile Gln Lys Leu Glu Lys Val Ala Val Thr Asp Lys
305 310 315 320
Gly Val Thr Asn Lys Lys Leu Val Gly Lys Ser Ala Ala Glu Ile Leu
325 330 335
Lys Glu Ala Gly Ile Thr Cys Asp Phe Thr Pro Arg Leu Ile Ile Val
340 345 350
Glu Thr Thr Lys Thr His Pro Phe Ala Thr Val Glu Leu Leu Met Pro
355 360 365
Ile Val Pro Leu Val Arg Val Pro Asp Phe Asp Glu Ala Leu Glu Val
370 375 380
Ala Ile Glu Leu Glu Gln Gly Leu His His Thr Ala Thr Met His Ser
385 390 395 400
Gln Asn Ile Ser Arg Leu Asn Lys Ala Ala Arg Asp Met Gln Thr Ser
405 410 415
Ile Phe Val Lys Asn Gly Pro Ser Phe Ala Gly Leu Gly Phe Arg Gly
420 425 430
Glu Gly Ser Thr Thr Phe Thr Ile Ala Thr Pro Thr Gly Glu Gly Thr
435 440 445
Thr Thr Ala Arg His Phe Ala Arg Arg Arg Arg Cys Val Leu Thr Asp
450 455 460
Gly Phe Ser Ile Arg
465
<210> 10
<211> 469
<212> PRT
<213> Listeria ivanovii subsp. ivanovii PAM 55
<400> 10
Met Glu Ser Leu Glu Leu Glu Gln Leu Val Lys Lys Val Leu Leu Glu
1 5 10 15
Lys Leu Ala Gly Gln Asn Glu Glu Thr Pro Lys Lys Pro Ser Gln Gly
20 25 30
Ala Lys Ser Gly Ile Phe Asp Thr Val Asp Glu Ala Val Gln Ala Ala
35 40 45
Val Ile Ala Gln Asn Cys Tyr Lys Glu Lys Ser Leu Glu Asp Arg Arg
50 55 60
Asn Val Val Lys Ala Ile Arg Glu Ala Leu Tyr Pro Glu Ile Glu Asn
65 70 75 80
Ile Ala Thr Arg Ala Ala Ala Glu Thr Gly Met Gly Asn Val Ala Asp
85 90 95
Lys Ile Leu Lys Asn Thr Leu Ala Ile Glu Lys Thr Pro Gly Val Glu
100 105 110
Asp Leu Tyr Thr Glu Val Ala Thr Gly Asp Asn Gly Met Thr Leu Tyr
115 120 125
Glu Leu Ser Pro Tyr Gly Val Ile Gly Ala Val Ala Pro Ser Thr Asn
130 135 140
Pro Thr Glu Thr Leu Ile Cys Asn Thr Ile Gly Met Leu Ala Ala Gly
145 150 155 160
Asn Ala Val Phe Tyr Ser Pro His Pro Gly Ala Lys Asn Ile Ser Leu
165 170 175
Trp Leu Ile Glu Lys Leu Asn Thr Ile Val Arg Glu Ser Cys Gly Ile
180 185 190
Asp Asn Leu Val Val Thr Val Glu Lys Pro Ser Ile Gln Ala Ala Gln
195 200 205
Glu Met Met Asn His Pro Lys Val Pro Leu Leu Val Ile Thr Gly Gly
210 215 220
Pro Gly Val Val Leu Gln Ala Met Gln Ser Gly Lys Lys Val Ile Gly
225 230 235 240
Ala Gly Ala Gly Asn Pro Pro Ser Ile Val Asp Glu Thr Ala Asn Ile
245 250 255
Glu Lys Ala Ala Ala Asp Ile Val Ala Gly Ala Ser Phe Asp His Asn
260 265 270
Ile Leu Cys Ile Ala Glu Lys Ser Val Val Ala Val Asp Ser Ile Thr
275 280 285
Asp Phe Leu Leu Phe Gln Met Glu Lys Asn Gly Ala Phe His Val Thr
290 295 300
Asn Pro Ser Asp Ile Arg Lys Leu Glu Lys Val Ala Val Thr Glu Lys
305 310 315 320
Gly Val Thr Asn Lys Lys Leu Val Gly Lys Ser Ala Ser Glu Ile Leu
325 330 335
Lys Glu Ala Gly Ile Ala Cys Asp Phe Thr Pro Arg Leu Ile Ile Ala
340 345 350
Glu Thr Asp Arg Ser His Pro Phe Ala Thr Val Glu Leu Leu Met Pro
355 360 365
Ile Val Pro Val Val Arg Val Ala Asp Phe Asp Gln Ala Leu Glu Val
370 375 380
Ala Leu Glu Leu Glu Gln Gly Leu His His Thr Ala Thr Met His Ser
385 390 395 400
Gln Asn Ile Ser Arg Leu Asn Lys Ala Ala Arg Asp Met Gln Thr Ser
405 410 415
Ile Phe Val Lys Asn Gly Pro Ser Phe Ala Gly Leu Gly Phe Gly Gly
420 425 430
Glu Gly Ser Ala Thr Phe Thr Ile Ala Thr Pro Thr Gly Glu Gly Thr
435 440 445
Thr Thr Ala Arg His Phe Ala Arg Arg Arg Arg Cys Val Leu Thr Asp
450 455 460
Gly Phe Ser Ile Arg
465
<210> 11
<211> 469
<212> PRT
<213> Listeria seeligeri serovar 1/2b str. SLCC3954
<400> 11
Met Glu Ser Leu Glu Leu Glu Gln Leu Val Lys Lys Val Leu Leu Glu
1 5 10 15
Lys Leu Ala Gly Gln Asn Glu Glu Thr Pro Lys Lys Pro Ser Gln Gly
20 25 30
Ala Lys Ser Gly Ile Phe Asp Thr Val Asp Glu Ala Val Gln Ala Ala
35 40 45
Val Ile Ala Gln Asn Cys Tyr Lys Glu Lys Ser Leu Glu Asp Arg Arg
50 55 60
Asn Val Val Lys Ala Ile Arg Glu Ala Leu Tyr Pro Glu Ile Lys Asn
65 70 75 80
Ile Ala Thr Arg Ala Val Ala Glu Thr Gly Met Gly Asn Val Ala Asp
85 90 95
Lys Ile Leu Lys Asn Thr Leu Ala Ile Glu Lys Thr Pro Gly Val Glu
100 105 110
Asp Leu Tyr Thr Glu Val Ala Thr Gly Asp Asn Gly Met Thr Leu Tyr
115 120 125
Glu Leu Ser Pro Tyr Gly Val Ile Gly Ala Val Ala Pro Ser Thr Asn
130 135 140
Pro Thr Glu Thr Leu Ile Cys Asn Thr Ile Gly Met Leu Ala Ala Gly
145 150 155 160
Asn Ala Val Phe Tyr Ser Pro His Pro Gly Ala Lys Asn Ile Ser Leu
165 170 175
Trp Leu Ile Glu Lys Leu Asn Thr Ile Val Arg Glu Ser Cys Gly Ile
180 185 190
Asp Asn Leu Val Val Thr Val Glu Lys Pro Ser Ile Gln Ala Ala Gln
195 200 205
Glu Met Met Asn His Pro Lys Val Pro Leu Leu Val Ile Thr Gly Gly
210 215 220
Pro Gly Val Val Leu Gln Ala Met Gln Ser Gly Lys Lys Val Ile Gly
225 230 235 240
Ala Gly Ala Gly Asn Pro Pro Ser Ile Val Asp Glu Thr Ala Asn Ile
245 250 255
Glu Lys Ala Ala Ala Asp Ile Val Ala Gly Ala Ser Phe Asp His Asn
260 265 270
Ile Leu Cys Ile Ala Glu Lys Ser Val Val Ala Val Asp Ser Ile Thr
275 280 285
Asp Phe Leu Leu Phe Gln Met Glu Lys Asn Gly Ala Leu His Val Thr
290 295 300
Asn Pro Ser Asp Ile Arg Lys Leu Glu Lys Val Ala Val Thr Glu Lys
305 310 315 320
Gly Val Thr Asn Lys Lys Leu Val Gly Lys Ser Ala Ser Glu Ile Leu
325 330 335
Lys Glu Ala Gly Ile Ala Cys Asp Phe Thr Pro Arg Leu Ile Ile Val
340 345 350
Glu Thr Asp Arg Ser His Pro Phe Ala Thr Val Glu Leu Leu Met Pro
355 360 365
Ile Val Pro Val Val Arg Val Ala Asp Phe Asp Gln Ala Leu Glu Val
370 375 380
Ala Leu Glu Leu Glu Gln Gly Leu His His Thr Ala Thr Met His Ser
385 390 395 400
Gln Asn Ile Ser Arg Leu Asn Lys Ala Ala Arg Asp Met Gln Thr Ser
405 410 415
Ile Phe Val Lys Asn Gly Pro Ser Phe Ala Gly Leu Gly Phe Gly Gly
420 425 430
Glu Gly Ser Ala Thr Phe Thr Ile Ala Thr Pro Thr Gly Glu Gly Thr
435 440 445
Thr Thr Ala Arg His Phe Ala Arg Arg Arg Arg Cys Val Leu Thr Asp
450 455 460
Gly Phe Ser Ile Arg
465
<210> 12
<211> 464
<212> PRT
<213> Shewanella putrefaciens CN-32
<400> 12
Met Asn Thr Thr Glu Leu Glu Asn Met Ile Arg Asn Ile Leu Ala Asp
1 5 10 15
Asn Leu Lys Gly Thr Ala Thr Ala Pro Gly Asn Ile Gln His Thr Ile
20 25 30
Phe Ala Arg Val Glu Asp Ala Ile Thr Ala Ser Tyr Asp Ala Tyr Lys
35 40 45
Lys Tyr Met Ala Glu Pro Leu Ala Leu Arg Thr Arg Ile Ile Thr Ala
50 55 60
Leu Lys Glu Glu Leu Ala Pro Trp Ile Lys Glu Met Ser Glu Arg Ala
65 70 75 80
Ala Glu Glu Thr Gly Met Gly Asn Ala Pro Asp Lys Ile Ser Lys Asn
85 90 95
Thr Ala Ala Leu Asn Asn Thr Pro Gly Ile Glu Asp Leu Thr Thr Ser
100 105 110
Ala Leu Thr Gly Asp Gly Gly Met Val Leu Phe Glu Leu Ser Pro Phe
115 120 125
Gly Val Ile Gly Ala Ile Ala Pro Ser Thr Asn Pro Thr Glu Thr Ile
130 135 140
Ile Asn Asn Thr Ile Ser Met Leu Ala Ala Gly Asn Ala Val Tyr Phe
145 150 155 160
Ser Pro His Pro Gly Ala Lys Lys Val Ser Leu Trp Leu Ile Glu Lys
165 170 175
Ile Glu Asp Ile Ile Tyr Arg Val Ser Gly Ile Arg Asn Leu Val Thr
180 185 190
Thr Val Ala Glu Pro Thr Phe Asp Ala Thr Arg Glu Met Met Ser Asp
195 200 205
Pro Arg Ile Ala Leu Leu Ala Val Thr Gly Gly Pro Ala Ile Val Asn
210 215 220
Met Ala Met Lys Thr Gly Lys Lys Val Ile Gly Ala Gly Pro Gly Asn
225 230 235 240
Pro Pro Val Leu Val Asp Glu Thr Ala Cys Pro Val Lys Ala Ala Lys
245 250 255
Asp Ile Val Asp Gly Ala Ser Phe Asp His Asn Val Leu Cys Ile Ala
260 265 270
Glu Lys Cys Val Ile Val Val Asp Ser Ile Ala Asp Arg Leu Met Asp
275 280 285
Asn Met Gln Lys Asn Asp Ala Phe Leu Val Lys Thr Pro Gly Asp Ile
290 295 300
Ala Arg Leu Arg Lys Val Val Ile Asn Asp Lys Gly Glu Ala Asn Lys
305 310 315 320
Lys Leu Val Gly Lys Ser Pro Ala Val Ile Leu Gln Ala Ala Asp Leu
325 330 335
Asn Thr Ser Thr Ala Pro Arg Leu Ile Ile Val Glu Val Glu Gln Asp
340 345 350
Asp Pro Leu Val Met Val Glu Gln Leu Met Pro Val Leu Pro Val Val
355 360 365
Arg Val Ser Asp Phe Glu Thr Gly Leu Ala Leu Ala Leu Lys Val Glu
370 375 380
Asn Glu Gln His His Thr Ala Ile Met His Ser Gln Asn Val Thr Arg
385 390 395 400
Leu Asn Leu Ala Ala Lys Thr Met Gln Thr Ser Ile Phe Val Lys Asn
405 410 415
Gly Pro Ser Tyr Ala Gly Leu Gly Ile Gly Ala Glu Gly Phe Thr Thr
420 425 430
Phe Thr Ile Ala Thr Pro Thr Gly Glu Gly Thr Thr Ser Ala Arg Ser
435 440 445
Phe Ala Arg Lys Arg Arg Cys Val Leu Thr Asn Gly Phe Ser Ile Arg
450 455 460
<210> 13
<211> 464
<212> PRT
<213> Kosakonia radicincitans DSM 16656
<400> 13
Met Asn Thr Thr Glu Leu Glu Asn Met Ile Arg Thr Ile Leu Ala Asp
1 5 10 15
Asn Leu Thr Gly Ile Ala Thr Ala Pro Gly Asn Ile Gln His Thr Ile
20 25 30
Phe Ala Arg Val Glu Asp Ala Ile Thr Ala Ser Tyr Asp Ala Tyr Lys
35 40 45
Lys Tyr Leu Ala Glu Pro Leu Ala Leu Arg Thr Arg Ile Ile Thr Ala
50 55 60
Leu Lys Glu Glu Leu Ala Pro Trp Ile Lys Glu Met Ser Glu Arg Ala
65 70 75 80
Ala Glu Glu Thr Gly Met Gly Asn Ala Leu Asp Lys Ile Ser Lys Asn
85 90 95
Thr Ala Ala Leu Asn Asn Thr Pro Gly Ile Glu Asp Leu Thr Thr Ser
100 105 110
Ala Leu Thr Gly Asp Gly Gly Met Val Leu Phe Glu Leu Ser Pro Phe
115 120 125
Gly Val Ile Gly Ala Ile Ala Pro Ser Thr Asn Pro Thr Glu Thr Ile
130 135 140
Ile Asn Asn Thr Ile Ser Met Leu Ala Ala Gly Asn Ala Val Tyr Phe
145 150 155 160
Ser Pro His Pro Gly Ala Lys Lys Val Ser Leu Trp Leu Ile Glu Lys
165 170 175
Ile Glu Asp Ile Ile Tyr Arg Val Ser Gly Ile Arg Asn Leu Val Thr
180 185 190
Thr Val Ala Glu Pro Thr Phe Asp Ala Thr Arg Glu Met Met Ser Asp
195 200 205
Pro Arg Ile Ala Leu Leu Val Val Thr Gly Gly Pro Ala Ile Val Asn
210 215 220
Met Ala Met Lys Thr Gly Lys Lys Val Ile Gly Ala Gly Pro Gly Asn
225 230 235 240
Pro Pro Val Leu Val Asp Glu Thr Ala Cys Pro Val Lys Ala Ala Lys
245 250 255
Asp Ile Val Asp Gly Ala Ser Phe Asp His Asn Val Leu Cys Ile Ala
260 265 270
Glu Lys Cys Val Ile Val Val Asp Ser Ile Ala Asp Arg Leu Val Glu
275 280 285
Asn Met Gln Lys Asn Asp Ala Phe Leu Val Lys Thr Pro Gly Asp Ile
290 295 300
Ala Arg Leu Arg Gln Val Val Ile Asn Asp Lys Gly Glu Ala Asn Lys
305 310 315 320
Lys Leu Val Gly Lys Ser Pro Ala Val Ile Leu Gln Ala Ala Asp Leu
325 330 335
Asn Thr Ser Thr Ala Pro Arg Leu Ile Ile Val Glu Val Glu Gln Asp
340 345 350
Asp Pro Leu Val Met Val Glu Gln Leu Met Pro Val Leu Pro Val Val
355 360 365
Arg Val Arg Asp Phe Glu Thr Gly Leu Ala Leu Ala Leu Lys Val Glu
370 375 380
Asn Asp Gln His His Thr Ala Ile Met His Ser Gln Asn Val Ser Arg
385 390 395 400
Leu Asn Leu Ala Ala Lys Thr Met Gln Thr Ser Ile Phe Val Lys Asn
405 410 415
Gly Pro Ser Tyr Ala Gly Leu Gly Ile Glu Ala Glu Gly Phe Thr Thr
420 425 430
Phe Thr Ile Ala Thr Pro Thr Gly Glu Gly Thr Thr Ser Ala Arg Ser
435 440 445
Phe Ala Arg Lys Arg Arg Cys Val Leu Thr Asn Gly Phe Ser Ile Arg
450 455 460
<210> 14
<211> 467
<212> PRT
<213> Tolumonas auensis DSM 9187
<400> 14
Met Asn Asn Thr Glu Leu Glu Ser Leu Ile Arg Thr Ile Leu Thr Glu
1 5 10 15
Gln Leu Thr Pro Ser Ala Thr Asp Thr Pro Ala Cys Thr Ala Ser Ser
20 25 30
Val Ala Leu Phe Asp Asp Val Asp Ser Ala Ile Cys Ala Ala His Ala
35 40 45
Ala Phe Leu Arg Tyr Gln Glu Ala Pro Leu Lys Thr Arg Ser Ala Ile
50 55 60
Ile Ala Ala Ile Arg Ala Glu Ile Ala Pro Cys Leu Ser Glu Leu Ala
65 70 75 80
Glu Arg Ala Ala Ala Glu Thr Gly Met Gly Asn Thr Ala Asp Lys Ile
85 90 95
Leu Lys Asn Lys Ala Ala Leu Glu Asn Thr Pro Gly Ile Glu Asp Leu
100 105 110
Lys Thr Thr Ala Leu Thr Gly Asp Glu Gly Met Val Leu Phe Glu Tyr
115 120 125
Ser Pro Phe Gly Val Val Gly Ala Val Ala Pro Ser Thr Asn Pro Thr
130 135 140
Glu Thr Ile Ile Asn Asn Ser Ile Ser Met Leu Ala Ala Gly Asn Ala
145 150 155 160
Ile Tyr Phe Ser Pro His Pro Gly Ala Lys Asn Ile Ser Leu Trp Leu
165 170 175
Ile Gln Lys Met Glu Glu Ile Ala Phe Lys Val Cys Gly Ile His Asn
180 185 190
Leu Ile Val Thr Val Lys Glu Pro Thr Phe Glu Ala Thr Gln Gln Met
195 200 205
Met Ala His Asp Lys Ile Ala Leu Leu Ala Ile Thr Gly Gly Pro Gly
210 215 220
Ile Val Asn Met Gly Leu Lys Ser Gly Lys Lys Val Ile Gly Ala Gly
225 230 235 240
Ala Gly Asn Pro Pro Cys Leu Val Asp Glu Thr Ala Glu Ile Val Lys
245 250 255
Ala Ala Gln Asp Ile Val Ala Gly Ala Ser Phe Asp Tyr Asn Leu Pro
260 265 270
Cys Ile Ala Glu Lys Ser Val Ile Ala Val Asp Cys Ile Ala Asp Gln
275 280 285
Leu Ile Gln Gln Met Arg Glu Phe Gly Ala Met Gln Ile Thr Asp Pro
290 295 300
Gln Gln Ile Ala Gln Leu Arg Glu Val Cys Ile Gln Lys Gly Ala Ala
305 310 315 320
Asn Lys Ser Leu Val Gly Lys Ser Pro Ala Thr Ile Leu Ala Ala Ala
325 330 335
Gly Ile Pro Cys Pro Ala Lys Glu Pro Arg Leu Ile Ile Leu Glu Val
340 345 350
Pro Ala Asn Asp Pro Phe Val Val Thr Glu Gln Leu Met Pro Val Leu
355 360 365
Pro Ile Val Arg Val Asp Asn Phe Glu Gln Gly Leu Gln Leu Ala Leu
370 375 380
Lys Val Glu Asp Gly Leu His His Thr Ala Met Met His Ser Gln Asn
385 390 395 400
Val Ser Arg Leu Asn Lys Ala Ala His Leu Met Gln Thr Ser Ile Phe
405 410 415
Val Lys Asn Gly Pro Ser Tyr Ala Gly Ile Gly Val Gly Ala Glu Gly
420 425 430
Phe Thr Thr Phe Thr Ile Ala Thr Pro Thr Gly Glu Gly Thr Thr Ser
435 440 445
Ala Arg Thr Phe Gly Arg Leu Arg Arg Cys Val Leu Thr Asn Gly Phe
450 455 460
Ser Ile Arg
465
<210> 15
<211> 461
<212> PRT
<213> Citrobacter koseri ATCC BAA-895
<400> 15
Met Asn Thr Ser Glu Leu Glu Thr Leu Ile Arg Asn Ile Leu Ser Glu
1 5 10 15
Gln Leu Ala Pro Ala Gln Ala Glu Thr Gln Gly His Gly Ile Phe Gln
20 25 30
Ser Val Gly Glu Ala Ile Asp Ala Ala His Gln Ala Phe Leu Arg Tyr
35 40 45
Gln Gln Cys Pro Leu Lys Thr Arg Ser Ala Ile Ile Ser Ala Leu Arg
50 55 60
Gln Glu Leu Thr Pro His Leu Ala Thr Leu Ala Ala Glu Ser Ala Ala
65 70 75 80
Glu Thr Gly Met Gly Asn Lys Glu Asp Lys Phe Leu Lys Asn Lys Ala
85 90 95
Ala Leu Asp Asn Thr Pro Gly Ile Glu Asp Leu Thr Thr Thr Ala Leu
100 105 110
Thr Gly Asp Gly Gly Met Val Leu Phe Glu Tyr Ser Pro Phe Gly Val
115 120 125
Ile Gly Ser Val Ala Pro Ser Thr Asn Pro Thr Glu Thr Ile Ile Asn
130 135 140
Asn Ser Ile Ser Met Leu Ala Ala Gly Asn Ser Val Tyr Phe Ser Pro
145 150 155 160
His Pro Gly Ala Lys Asn Val Ser Leu Lys Leu Ile Gly Met Ile Glu
165 170 175
Asp Ile Ala Phe Arg Cys Cys Gly Ile Arg Asn Leu Val Val Thr Val
180 185 190
Ala Glu Pro Thr Phe Glu Ala Thr Gln Gln Met Met Ala His Pro Asn
195 200 205
Ile Ala Val Leu Ala Ile Thr Gly Gly Pro Gly Ile Val Ala Met Gly
210 215 220
Met Lys Ser Gly Lys Lys Val Ile Gly Ala Gly Ala Gly Asn Pro Pro
225 230 235 240
Cys Ile Val Asp Glu Thr Ala Asp Ile Val Lys Ala Ala Glu Asp Ile
245 250 255
Ile Asn Gly Ala Ala Phe Asp Tyr Asn Leu Pro Cys Ile Ala Glu Lys
260 265 270
Ser Leu Ile Val Val Glu Ser Val Ala Glu Arg Leu Val Gln Gln Met
275 280 285
Gln Ala Phe Gly Ala Leu Leu Leu Asn Ala Ala Asp Ile Asp Lys Leu
290 295 300
Arg Ala Val Cys Leu Pro Glu Gly His Ala Asn Lys Lys Leu Val Gly
305 310 315 320
Lys Ser Pro Ala Ala Met Leu Glu Ala Ala Gly Ile Ala Val Pro Ala
325 330 335
Lys Pro Pro Arg Leu Leu Ile Gly Ile Val Ser Ala Asp Asp Pro Trp
340 345 350
Val Thr Ser Glu Gln Leu Met Pro Met Leu Pro Val Val Lys Val Asp
355 360 365
Asn Phe Asp Ser Ala Leu Ala Leu Ala Leu Lys Val Glu Glu Gly Leu
370 375 380
His His Thr Ala Ile Met His Ser Gln Asn Val Ser Arg Leu Asn Leu
385 390 395 400
Ala Ala Arg Thr Leu Gln Thr Ser Ile Phe Val Lys Asn Gly Pro Ser
405 410 415
Tyr Ala Gly Ile Gly Val Gly Gly Glu Gly Phe Thr Thr Phe Thr Ile
420 425 430
Ala Thr Pro Thr Gly Glu Gly Thr Thr Ser Ala Arg Thr Phe Ala Arg
435 440 445
Ser Arg Arg Cys Val Leu Thr Asn Gly Phe Ser Ile Arg
450 455 460
<210> 16
<211> 462
<212> PRT
<213> Yersinia enterocolitica subsp. enterocolitica 8081
<400> 16
Met Asn Thr Asn Asp Leu Glu Ser Leu Ile Arg Thr Ile Leu Thr Glu
1 5 10 15
Gln Leu Thr Pro Val Thr Ala Pro Ala Ser Ser Ala Ile Phe Ala Ser
20 25 30
Val Asp Glu Ala Ile Asn Ala Ala His Ser Ala Phe Leu Arg Tyr Gln
35 40 45
Gln Ser Pro Met Lys Thr Arg Ser Ala Ile Ile Arg Ala Ile Arg Glu
50 55 60
Gln Leu Lys Pro Gln Leu Val Ser Leu Ser Glu Arg Gly Ala Ser Glu
65 70 75 80
Thr Gly Met Gly Asn Lys Glu Asp Lys Phe Leu Lys Asn Lys Ala Ala
85 90 95
Leu Glu Asn Thr Pro Gly Ile Glu Asp Leu Ser Thr Thr Ala Leu Thr
100 105 110
Gly Asp Gly Gly Met Val Leu Phe Glu Tyr Ser Pro Phe Gly Val Ile
115 120 125
Gly Ser Val Thr Pro Ser Thr Asn Pro Thr Glu Thr Ile Ile Asn Asn
130 135 140
Ser Ile Ser Met Leu Ala Ala Gly Asn Ala Val Tyr Phe Ser Pro His
145 150 155 160
Pro Gly Ala Lys Ala Val Ser Leu Asp Leu Ile Ala Gln Ile Glu Glu
165 170 175
Ile Ile Phe Asn Ser Cys Gly Ile Arg Asn Leu Val Val Thr Val Lys
180 185 190
Glu Pro Ser Phe Glu Ala Thr Gln Gln Met Met Ala His Asp Lys Ile
195 200 205
Ala Leu Leu Ala Ile Thr Gly Gly Pro Ala Ile Val Ala Met Ser Met
210 215 220
Lys Ser Gly Lys Lys Val Ile Gly Ala Gly Ala Gly Asn Pro Pro Cys
225 230 235 240
Leu Val Asp Glu Thr Ala Glu Leu Val Lys Ala Ala Gln Asp Ile Val
245 250 255
Ala Gly Ala Ser Phe Asp Tyr Asn Leu Pro Cys Ile Ala Glu Lys Ser
260 265 270
Leu Ile Val Val Glu Ser Val Ala Asp Arg Leu Leu Gln Gln Met Gln
275 280 285
Ala Phe Asp Ala Leu Leu Ile Ser Asn Pro Gln Glu Ile Asp Ser Leu
290 295 300
Arg Lys Ala Cys Leu Thr Pro Gln Gly His Ala Asn Lys Asn Leu Val
305 310 315 320
Gly Lys Ser Pro Ile Glu Leu Leu Lys Ala Ala Gly Ile Thr Cys Pro
325 330 335
Ala Lys Ala Pro Arg Leu Leu Leu Val Glu Val Ala Gly Asp Asp Pro
340 345 350
Leu Val Thr Thr Glu Gln Leu Met Pro Leu Leu Pro Val Val Arg Val
355 360 365
Lys Asp Phe Asp Ala Ala Leu Thr Leu Ala Leu His Val Glu Gly Gly
370 375 380
Leu His His Thr Ala Thr Met His Ser Gln Asn Val Ser Arg Leu Asn
385 390 395 400
Leu Ala Ala Arg Leu Leu Gln Thr Ser Ile Phe Val Lys Asn Gly Pro
405 410 415
Ser Tyr Ala Gly Ile Gly Val Gly Gly Glu Gly Phe Thr Thr Phe Thr
420 425 430
Ile Ala Thr Pro Thr Gly Glu Gly Thr Thr Ser Ala Arg Thr Phe Ala
435 440 445
Arg Gln Arg Arg Cys Val Leu Thr Asn Gly Phe Ser Ile Arg
450 455 460
<210> 17
<211> 464
<212> PRT
<213> Salmonella enterica subsp. enterica serovar Mbandaka str. ATCC 51958
<400> 17
Met Asn Thr Ser Glu Leu Glu Thr Leu Ile Arg Thr Ile Leu Ser Glu
1 5 10 15
Gln Leu Thr Thr Pro Ala Gln Thr Thr Ala Gln Pro Gln Gly Lys Gly
20 25 30
Ile Phe Gln Ser Val Ser Glu Ala Ile Asp Ala Ala His Gln Ala Phe
35 40 45
Leu Arg Tyr Gln Gln Cys Pro Leu Lys Thr Arg Ser Ala Ile Ile Ser
50 55 60
Ala Met Arg Gln Glu Leu Thr Pro Leu Leu Ala Thr Leu Ala Glu Glu
65 70 75 80
Ser Ala Asn Glu Thr Gly Met Gly Asn Lys Glu Asp Lys Leu Leu Lys
85 90 95
Asn Lys Ala Ala Leu Asp Asn Thr Pro Gly Val Glu Asp Leu Thr Thr
100 105 110
Thr Ala Leu Thr Gly Asp Gly Gly Met Val Leu Phe Glu Tyr Ser Pro
115 120 125
Phe Gly Val Ile Gly Ser Val Ala Pro Ser Thr Asn Pro Thr Glu Thr
130 135 140
Ile Ile Asn Asn Ser Ile Ser Met Leu Ala Ala Gly Asn Ser Val Tyr
145 150 155 160
Phe Ser Pro His Pro Gly Ala Lys Lys Val Ser Leu Lys Leu Ile Ser
165 170 175
Leu Ile Glu Glu Ile Ala Phe Arg Cys Cys Gly Ile Arg Asn Leu Val
180 185 190
Val Thr Val Ala Glu Pro Thr Phe Glu Ala Thr Gln Gln Met Met Ala
195 200 205
His Pro Arg Ile Ala Val Leu Ala Ile Thr Gly Gly Pro Gly Ile Val
210 215 220
Ala Met Gly Met Lys Ser Gly Lys Lys Val Ile Gly Ala Gly Ala Gly
225 230 235 240
Asn Pro Pro Cys Ile Val Asp Glu Thr Ala Asp Leu Val Lys Ala Ala
245 250 255
Glu Asp Ile Ile Asn Gly Ala Ser Phe Asp Tyr Asn Leu Pro Cys Ile
260 265 270
Ala Glu Lys Ser Leu Ile Val Val Glu Ser Val Ala Glu Arg Leu Val
275 280 285
Gln Gln Met Gln Thr Phe Gly Ala Leu Leu Leu Ser Pro Ala Asp Thr
290 295 300
Asp Lys Leu Arg Ala Val Cys Leu Pro Glu Gly Gln Ala Asn Lys Lys
305 310 315 320
Leu Val Gly Lys Ser Pro Ser Ala Met Leu Glu Ala Ala Gly Ile Ala
325 330 335
Val Pro Ala Lys Ala Pro Arg Leu Leu Ile Ala Leu Val Ser Ala Asp
340 345 350
Asp Pro Trp Val Thr Ser Glu Gln Leu Met Pro Met Leu Pro Val Val
355 360 365
Lys Val Ser Asp Phe Asp Ser Ala Leu Ala Leu Ala Leu Lys Val Glu
370 375 380
Glu Gly Leu His His Thr Ala Ile Met His Ser Gln Asn Val Ser Arg
385 390 395 400
Leu Asn Leu Ala Ala Arg Thr Leu Gln Thr Ser Ile Phe Val Lys Asn
405 410 415
Gly Pro Ser Tyr Ala Gly Ile Gly Val Gly Gly Glu Gly Phe Thr Thr
420 425 430
Phe Thr Ile Ala Thr Pro Thr Gly Glu Gly Thr Thr Ser Ala Arg Thr
435 440 445
Phe Ala Arg Ser Arg Arg Cys Val Leu Thr Asn Gly Phe Ser Ile Arg
450 455 460
<210> 18
<211> 462
<212> PRT
<213> Yersinia mollaretii ATCC 43969
<400> 18
Met Asn Thr His Asp Ile Glu Ser Leu Ile Arg Thr Ile Leu Thr Glu
1 5 10 15
Gln Leu Thr Pro Ala Thr Ala Ser Ala Val Ser Ala Ile Phe Ala Ser
20 25 30
Val Asp Glu Ala Val Thr Ala Ala His Ser Ala Phe Leu Arg Tyr Gln
35 40 45
Gln Ser Pro Met Lys Thr Arg Ser Ala Ile Ile Ser Ala Leu Arg Glu
50 55 60
Gln Leu Ala Pro Gln Leu Ala Ser Leu Ser Glu Arg Gly Ala Ser Glu
65 70 75 80
Thr Gly Met Gly Asn Lys Glu Asp Lys Phe Leu Lys Asn Arg Ala Ala
85 90 95
Leu Glu Asn Thr Pro Gly Ile Glu Asp Leu Ser Thr Thr Ala Leu Thr
100 105 110
Gly Asp Gly Gly Met Val Leu Phe Glu Tyr Ser Pro Phe Gly Val Ile
115 120 125
Gly Ser Val Ala Pro Ser Thr Asn Pro Thr Glu Thr Ile Ile Asn Asn
130 135 140
Ser Ile Ser Met Leu Ala Ala Gly Asn Ala Val Tyr Phe Ser Pro His
145 150 155 160
Pro Gly Ala Lys Ala Val Ser Leu Asp Leu Ile Ala Gln Ile Glu Ala
165 170 175
Ile Ile Phe Asn Arg Cys Gly Ile Arg Asn Leu Val Val Thr Val Gln
180 185 190
Glu Pro Ser Phe Glu Ala Thr Gln Gln Met Met Ala His Asp Lys Ile
195 200 205
Ala Leu Leu Ala Ile Thr Gly Gly Pro Ala Ile Val Ala Met Gly Met
210 215 220
Lys Ser Gly Lys Lys Val Ile Gly Ala Gly Ala Gly Asn Pro Pro Cys
225 230 235 240
Leu Val Asp Glu Thr Ala Glu Leu Val Lys Ala Ala Gln Asp Ile Val
245 250 255
Ser Gly Ala Ser Phe Asp Tyr Asn Leu Pro Cys Ile Ala Glu Lys Ser
260 265 270
Leu Ile Val Val Glu Ser Val Ala Asp Arg Leu Leu Gln Gln Met Gln
275 280 285
Ala Phe Asp Ala Leu Leu Ile Thr Gln Pro Gln Glu Val Asp Ser Leu
290 295 300
Arg Lys Ala Cys Leu Thr Pro Gln Gly His Ala Asn Lys Asn Leu Val
305 310 315 320
Gly Lys Ser Pro Ala Glu Leu Leu Lys Ala Ala Gly Ile Thr Cys Pro
325 330 335
Ala Lys Ala Pro Arg Leu Leu Leu Val Glu Val Ala Gly Asp Asp Pro
340 345 350
Leu Val Thr Thr Glu Gln Leu Met Pro Leu Leu Pro Val Val Arg Val
355 360 365
Lys Asp Phe Asp Ala Ala Leu Thr Leu Ala Leu Gln Val Glu Gly Gly
370 375 380
Leu His His Thr Ala Thr Met His Ser Gln Asn Val Ser Arg Leu Asn
385 390 395 400
Leu Ala Ala Arg Leu Leu Gln Thr Ser Ile Phe Val Lys Asn Gly Pro
405 410 415
Ser Tyr Ala Gly Ile Gly Val Gly Gly Glu Gly Phe Thr Thr Phe Thr
420 425 430
Ile Ala Thr Pro Thr Gly Glu Gly Thr Thr Ser Ala Arg Thr Phe Ala
435 440 445
Arg Gln Arg Arg Cys Val Leu Thr Asn Gly Phe Ser Ile Arg
450 455 460
<210> 19
<211> 460
<212> PRT
<213> Escherichia fergusonii ATCC 35469
<400> 19
Met Asn Thr Arg Glu Leu Glu Asn Ile Ile Arg Asn Ile Leu Arg Glu
1 5 10 15
Gln Leu Ser Thr Thr Ala Asp Ala Pro Thr Asn Gly Ile Phe Asp Ser
20 25 30
Val Asp Glu Ala Ile Asn Ala Ala His Gln Ala Phe Leu Arg Tyr Gln
35 40 45
Gln Cys Pro Leu Lys Thr Arg Ser Ala Ile Ile Ser Ala Ile Arg Gln
50 55 60
Glu Leu Thr Pro His Leu Asp Met Leu Ala Thr Glu Ser Ala Asn Glu
65 70 75 80
Thr Gly Met Gly Asn Lys Glu Asp Lys Phe Leu Lys Asn Lys Ala Ala
85 90 95
Leu Asp Asn Thr Pro Gly Ile Glu Asp Leu Thr Thr Thr Ala Leu Thr
100 105 110
Gly Asp Gly Gly Met Val Leu Phe Glu Tyr Ser Pro Phe Gly Val Ile
115 120 125
Gly Ser Val Thr Pro Ser Thr Asn Pro Thr Glu Thr Ile Ile Asn Asn
130 135 140
Ser Ile Ser Met Leu Ala Ala Gly Asn Ser Val Tyr Phe Ser Pro His
145 150 155 160
Pro Gly Ala Lys Asn Ile Ser Leu Lys Leu Ile Ala Met Ile Glu Glu
165 170 175
Ile Ala Phe Arg Cys Ser Gly Ile His Asn Leu Ile Val Thr Val Ala
180 185 190
Glu Pro Thr Phe Glu Ala Thr Gln Gln Met Met Thr His Pro Asn Ile
195 200 205
Ala Val Leu Ala Ile Thr Gly Gly Pro Gly Ile Val Ala Met Gly Met
210 215 220
Lys Ser Gly Lys Lys Val Ile Gly Ala Gly Ala Gly Asn Pro Pro Cys
225 230 235 240
Ile Val Asp Glu Thr Ala Asp Leu Val Lys Ala Ala Glu Asp Ile Ile
245 250 255
Asn Gly Ala Ser Phe Asp Tyr Asn Leu Pro Cys Ile Ala Glu Lys Ser
260 265 270
Leu Ile Val Val Glu Glu Ile Ala Gly Thr Leu Val Gln Gln Met Gln
275 280 285
Asn Phe Gly Ala Leu Leu Leu Asn Lys Glu Glu Thr Asp Lys Leu Arg
290 295 300
Asp Val Cys Leu Pro Gln Gly Met Ala Asn Lys Gln Leu Val Gly Lys
305 310 315 320
Ser Pro Ala Ala Leu Leu Gln Ala Ala Gly Ile Ala Val Pro Leu Lys
325 330 335
Thr Pro Arg Leu Leu Ile Ala Leu Val Asp Ala Cys Asp Lys Trp Val
340 345 350
Thr Ser Glu Gln Leu Met Pro Met Leu Pro Ile Val Lys Val Lys Asp
355 360 365
Phe Asp Ser Ala Leu Thr Leu Ala Leu Lys Val Glu Glu Gly Leu His
370 375 380
His Thr Ala Ile Met His Ser Gln Asn Val Ser Arg Leu Asn Leu Ala
385 390 395 400
Ala Arg Thr Leu Gln Thr Ser Ile Phe Val Lys Asn Gly Pro Ser Tyr
405 410 415
Ala Gly Ile Gly Val Gly Gly Glu Gly Phe Thr Thr Phe Thr Ile Ala
420 425 430
Thr Pro Thr Gly Glu Gly Thr Thr Ser Ala Lys Thr Phe Ala Arg Ser
435 440 445
Arg Arg Cys Val Leu Thr Ser Gly Phe Ser Ile Arg
450 455 460
<210> 20
<211> 464
<212> PRT
<213> Salmonella enterica subsp. enterica serovar Urbana str. ATCC 9261
<400> 20
Met Asn Thr Ser Glu Leu Glu Thr Leu Ile Arg Thr Ile Leu Ser Glu
1 5 10 15
Gln Leu Thr Thr Pro Ala Gln Thr Pro Ala Gln Pro Lys Gly Lys Gly
20 25 30
Ile Phe Gln Ser Val Ser Glu Ala Ile Asp Ala Ala His Gln Ala Phe
35 40 45
Leu Arg Tyr Gln Gln Cys Pro Leu Lys Thr Arg Ser Ala Ile Ile Ser
50 55 60
Ala Met Arg Gln Glu Leu Thr Pro Leu Leu Ala Thr Leu Ala Glu Glu
65 70 75 80
Ser Ala Asn Glu Thr Gly Met Gly Asn Lys Glu Asp Lys Leu Leu Lys
85 90 95
Asn Lys Ala Ala Leu Asp Asn Thr Pro Gly Val Glu Asp Leu Thr Thr
100 105 110
Thr Ala Leu Thr Gly Asp Gly Gly Met Val Leu Phe Glu Tyr Ser Pro
115 120 125
Phe Gly Val Ile Gly Ser Val Ala Pro Ser Thr Asn Pro Thr Glu Thr
130 135 140
Ile Ile Asn Asn Ser Ile Ser Met Leu Ala Ala Gly Asn Ser Ile Tyr
145 150 155 160
Phe Ser Pro His Pro Gly Ala Lys Lys Val Ser Leu Lys Leu Ile Ser
165 170 175
Leu Ile Glu Glu Ile Ala Phe Arg Cys Cys Gly Ile Arg Asn Leu Val
180 185 190
Val Thr Val Ala Glu Pro Thr Phe Glu Ala Thr Gln Gln Met Met Ala
195 200 205
His Pro Arg Ile Ala Val Leu Ala Ile Thr Gly Gly Pro Gly Ile Val
210 215 220
Ala Met Gly Met Lys Ser Gly Lys Lys Val Ile Gly Ala Gly Ala Gly
225 230 235 240
Asn Pro Pro Cys Ile Val Asp Glu Thr Ala Asp Leu Val Lys Ala Ala
245 250 255
Glu Asp Ile Ile Asn Gly Ala Ser Phe Asp Tyr Asn Leu Pro Cys Ile
260 265 270
Ala Glu Lys Ser Leu Ile Val Val Glu Ser Val Ala Glu Arg Leu Val
275 280 285
Gln Gln Met Gln Thr Phe Gly Ala Leu Leu Leu Ser Pro Ala Asp Thr
290 295 300
Asp Lys Leu Arg Ala Val Cys Leu Pro Glu Gly Gln Ala Asn Lys Lys
305 310 315 320
Leu Val Gly Lys Ser Pro Ser Ala Met Leu Glu Ala Ala Gly Ile Ala
325 330 335
Val Pro Ala Lys Ala Pro Arg Leu Leu Ile Ala Leu Val Ser Ala Asp
340 345 350
Asp Pro Trp Val Thr Ser Glu Gln Leu Met Pro Met Leu Pro Val Val
355 360 365
Lys Val Ser Asp Phe Asp Ser Ala Leu Ala Leu Ala Leu Lys Val Glu
370 375 380
Glu Gly Leu His His Thr Ala Ile Met His Ser Gln Asn Val Ser Arg
385 390 395 400
Leu Asn Leu Ala Ala Arg Thr Leu Gln Thr Ser Ile Phe Val Lys Asn
405 410 415
Gly Pro Ser Tyr Ala Gly Ile Gly Val Gly Gly Glu Gly Phe Thr Thr
420 425 430
Phe Thr Ile Ala Thr Pro Thr Gly Glu Gly Thr Thr Ser Ala Arg Thr
435 440 445
Phe Ala Arg Ser Arg Arg Cys Val Leu Thr Asn Gly Phe Ser Ile Arg
450 455 460
<210> 21
<211> 1434
<212> DNA
<213> Lactobacillus reuteri DSM 20016
<400> 21
ttaataccag ttacgtactg agaatccttg tggtgagttc aaacgaaccc gacgagtaaa 60
tgttcgtgca gtacatgttc cttcaccagt tggcgtagca attgttaatg ctgaagcacc 120
tgagtgagct ccattatctg caacacctgt accaacatag gatgggccat taacaacaaa 180
gattgaacat tgcatccggt gagcagcatt attaatatgc ttaaggttat tggaatgaat 240
agtagctgtg tgatggttac ctttttcaac ttcaacagca gtcttcaaaa catcatcaaa 300
tgttggacaa gaaacaactg gtaaaattgg cattaacatt tcagtcatta ctaatggatg 360
ttccttagga agttcacaaa taatttcaac tgggtggcct gtgtaaggaa tattagcttg 420
gtctaagata taagttgcat ccttaccaac aaatttacga tcaggagcac cattttcttg 480
gatacacata tcagctaatt tatcggcttg ttcacggtta actacaaagg caccttcatc 540
ttgcatctta cgaattaatt catctttaat gctactttct gcaactactt ccttttcagc 600
agtacataaa atatcattat caaatgaagc agatgtaatg atattatgag cagctaaatc 660
aatatcagca gtagcatcaa ccattgcagg aggattacca ggaccagcac caaccgcttt 720
cttaccactg gtcattgctt ggtgaacaac agctgggcca ccagttactg ctaacattgc 780
aatgtcgggg tgcttcatca tttgttgaac tgattcaatt gttggtgttt caatacttac 840
aactaaatta tgaaggcctg ttgcatctgc aataaaatcg ttcatctttt caattgtcca 900
gcgagtaaca ttctttgcgc cagggtgagc accaaagtaa agagtattac caccggcaag 960
catcatgatc gcattagcaa ttacagtttc tgaagggttt gtacttgggc caaccgcacc 1020
aataacacca tatggtaacc gttcatacat aaccatccca ccgtcaccgt tttctacaac 1080
tggttcaaga atctcgggac caggagtgtt gtacaaggca ttgtttaact tagcaatttt 1140
ggcctctact gttcccattc ctgtttcttc tttgatatct ttagccattt tttcaatata 1200
tgggcggaat ccttccttaa tggcatcaat cacttgttgg cgaacagcaa ttggcttatc 1260
ccgatatatt tcttgagcag cttttgcagc agcaattgca tcattgacat tagtgaaaat 1320
tccgcgatga ccattatcag tagtagctgc aacgtttgca cttgaagagc tggcattatc 1380
tagttcttcg gcaagaattt tgcgtacagc actttcaata tcattaatct gcat 1434
<210> 22
<211> 1434
<212> DNA
<213> Lactobacillus brevis ATCC 367
<400> 22
ctaagcctcc caagtccgta atgagaaccc ttctggcgag ttaagccggc gccggcgcgt 60
gtaagtctta gacgttgccg ttccttcacc cgttggtgtg gcaatcgtta aggctgaagc 120
cccactatgc gcaccattcg ttgcaacacc agtcccacaa taagttgggc cgttaaccac 180
gaagattgac gtattcaacc ggtgcgctgc cttattgatg tgtggtaaat tctcggaatg 240
aatggaagcc gtgtggtgta acccaccttc aacttctgta gccgttgcta aaacgctatc 300
aaagtcaggg caacaaacga ctggcaaaat tggcatcaac atttctgtcg ttactaacgg 360
atgatcctta gctgcttcaa gaataatcag tgttggtgtc ccggtgtaag agattcctgc 420
ttgatccaaa atgtaagtgg catctttacc aacaaacttc cgatctggtg cgcccttagg 480
cccaatggtc atttgcgcta atttttcaat atcggcagaa tcggtaacca agaaggcccc 540
ttcttgttgc atccgcatga tgagttcatc cttgacagcg gcttcaacga caacttcctt 600
ttcggccgtg cagagaatgt tattatcaaa ggctgctgaa tcaacaatgt tgtgggctgc 660
taaagcaata ttggcagttg catccaccat tgccggtggg ttaccagcac cggcaccaac 720
cgccttctta ccactgataa gcgcttggtg gacaacagca ggccctccag tgattgacag 780
catggcaacg tcaggatgtt gcataacttc ttgcacggat tcaattgaag gcgtttccag 840
tgaaacgact aagttatgta acccagttgc atcagctacc aattcgttta atttttcgat 900
cgtccaacgg gtaatgttct tagcacctgg atgggcacca aagaacaacg tattcccacc 960
agccaacatc atgatggcat tggcaatcac cgtttcagag gggttggtac taggaccaac 1020
ggcaccaatg acaccaaatg gcgcgtattc atacataacc agtccaccgt caccggtttc 1080
ggcttctggc tgcagaattt ctggaccggg tgtgttataa agggcgttat tgagtttggc 1140
aattttcgcg ctaaccgttc ccatgccagt ttcgtcatga atccgcttag ccatatcctc 1200
aatgtatggc cggaaaccct cacggatcgc atcaatcact ttgttccgaa ctgagattgg 1260
ttggtcggcg tagttttctt gcgccgcttt tgtggccgca atggcttcat tgaccgtctt 1320
aaagatccca tttttgccgg gaacggtcgt attggtggcc gttgatgact gaggattgct 1380
aagttcttca ctcaaaattt tacggatggc ttgttcaatg ttttctgtgt tcat 1434
<210> 23
<211> 1434
<212> DNA
<213> Pediococcus acidilactici
<400> 23
ttatgcctcc cacgaacgta acgaaaatgc ttctggcgaa tttaaccgac gacggcgagt 60
gaaggtctta gctgttgcgg ttccttctcc agttggggtt gcaatcgtta atgctgaagc 120
acctgcatga gcaccgttag ctcctacccc ggttcccaca tacgttgcac cgttaactac 180
gaaaatggaa gtgttcattc ggtgtgccgc acggttaata tttggtaagt tttcagaatg 240
aattgaagcc gtgtgatgca atccttgttc aacttctacc gcagtcgcta aaacttgatc 300
aaacgttggg caagaaacta ctggtacaat tggcatcaac atttcggtag taaccaatgg 360
atgatccttt tgggcttcca aaataattaa ttttggtgtg ccagtgtatg caattcctgc 420
cttatctaaa atgtaggttg catctttacc gacgaattgg cggtcaggtg caccattttt 480
gccaatggtc atttctgcta acttatcgat atcactagcg ttagttacca aaaatgctcc 540
ttcttcttgc atctttttaa taagttcatc cttcacgcta gattcaacta ctacttcttt 600
ttctgcggta caaaggatgt tgttatcaaa agatgcggaa tctacaatgt tgtgtgcagc 660
taacgcaacg ttagcagtgg catcaacaat tgcaggaggg ttacctgcac cagcacccac 720
ggcttttttg ccactgacta atgcttggtg aaccactgct gggccgcccg tgatggacag 780
cattgcaatg tcaggatgtt gcatcatttc ttgaacagat tcaattgacg gttcttcaat 840
tgatacaact aaattcttca tcccggtagc ttcataaact aactcgttta acttttcaat 900
tgtccaacgg gtaatttttt tagcacctgg atgggcgccg aagtacaacg tatttccacc 960
agctaacatc ataatggcgt tagcaattac cgtttcagaa gggttagtac ttggaccaac 1020
tgccccaatt actccaaacg gagcgtattc gtacatcact aaaccaccgt cgccagtttc 1080
agcttcaggt tgcaaaattt cagttccagg agtgttatat aaagcattgt ttagcttagc 1140
aattttagct tcaaccgttc ccattcctgt ttcatcttta atgtccttag ccattttttc 1200
gatgtatgga cggaatcctt cacgaattgc tgttaaaacc ttttcccgaa aggctaatgt 1260
tttatcaatg taaacatctt cagctgcctt agctgccgcg attgcttcat ccacggtctt 1320
aaaaattccg ttttggcccg catcactagt tgacgcagtt tggctagtgc cacttttctt 1380
tagttcttca cttaaaatgc gtcgaatatc ttcttcaaga ttttgaattt ccat 1434
<210> 24
<211> 1431
<212> DNA
<213> Pediococcus claussenii ATCC BAA-344
<400> 24
ttatgcctcc catgaacgaa gtgagaatgc ctcaggagaa tttagacgac gacgtcttgt 60
aaatgtttta gcagttgctg taccttcacc agtcggtgtg gcaattgtca aagctgatgc 120
tccagcatgc gcgccgtttg cacctactcc tgtaccaaca tacgttgcac cattaacaac 180
aaaaattgat gtattcatac ggtgtgctgc ttgattaata tgtggtaagt tctctgaatg 240
aattgaagct gtatgatgta aaccttgttc gacctcaaca gctgttgcta atgcacggtc 300
aaaggttggg caggaaacaa ctggtaagat tggcatcaac atttccgttg ttactaatgg 360
atgattctta tgagcctcaa gaataattag tttgggtgtt ccagtataat taattcctgc 420
actatcaaga ataaatgttg cgtctttacc aacaaattta cggtccggag caccattctt 480
accaatcgta atttgaacta ttttttcaat atcatgggca cttgtaacca agaatgcacc 540
ctcgttttgc attttagaaa ttagttcatc tttaacgctt tcctcaacaa caacttcttt 600
ttctgctgta cacaaaatat tgttatcaaa agatgatgaa tccacaatat tatgtgcggc 660
taaatcgata ttagcagttg catcaacaat tgcaggtggg ttaccagctc cagcaccaac 720
cgccttctta ccactaatca tagcttgatg aacaacacca gggcctccag taattgaaag 780
catcgcaata tcaggatgtt gcatcatctg ttgaactgat tcaattgaag gttcttcgat 840
tgaaacaacc atgtttctca taccagttgc cttaaaaaca aattcattta acttttcaat 900
tgtccatcgt gtgatttttt tagcaccagg atgcgcacca aagtacacag tgttaccacc 960
agcaagcatc attaaagcat ttgcaataac tgtttcagat gggtttgtac ttggtcctac 1020
agccccaatc actccaaatg gtgcatattc atataatact aatccgccat cacccgtctc 1080
agcttctggt tcgagtattt ctgttccagg agtattgtat agagcattgt tcaatttagc 1140
gatctttgct tcaacagtac ccattcctgt ttcttctttg atatctttag ccatttgttc 1200
aatataaggc cggaaacctt ctttaatggc atctaatact tggtttcgaa aagccaatga 1260
tttgtcaaca tatatttctt gcgcagcttt agcggcagca atcgcttcgt caactgtttt 1320
gaagattccg ttcgttccat tatcacttga agcgcttaca gaattatcac tatcttgtag 1380
ttcctcactt aaaattcggc ggatatcttg ttctaactta tccatttcca t 1431
<210> 25
<211> 1446
<212> DNA
<213> Lactobacillus collinoides
<400> 25
atggcagatc aaaatattga agcagaaatc agacgaattt tacaagaaga attaagcggt 60
aacgcttcgt ccagcgctgc tggtacgact accagtcaac ctgatgggtt aggcaaccgg 120
atcttcacca acgtgaacga tgccattgct gctgctaagc aagctcaggc aatctaccaa 180
gataaaccac ttgccttccg taaaaaagtc gttcaagcaa ttaaagatgg tttcggccca 240
tacattgaat atatggcaaa gcagacccgt gaagaaactg gcatgggaac tgccgaagct 300
aagattgcta agttaaagaa cgccctctac aacaccccag gcgttgaatt actggaccca 360
gaagttgaaa ctggtgacgg cgggatggtc atgtatgaat acacgccatt cggtgttatc 420
ggtgccgttg gaccaagtac aaacccttgt gaaacggttc tgaacaactc catcatgatg 480
atgtctgctg ggaacgcatt gttctttggc gcccatcctg gtgcaaagaa cattactcgc 540
tgggcagttg aaaaattgaa cgaattcgtt tacaaggcta ctgggttgaa gaacctctta 600
gtttccttgg acacaccatc aattgaatcc gttcaagaaa tgatgcaaca tccagatgtt 660
gcaatgctgg ctgtaactgg tggcccagct gttgtgcatc aagcattaac gagtggtaaa 720
aaagccgttg gtgccggtgc tggtaacccg cctgcaatgg ttgatgcaac tgctgatatt 780
gatttagcag ctcataacct atttacttca gctaagtttg acaatgaaat tctgtgtact 840
tcagaaaagg aaatcattgc tgaagattca attaaggatg aacttcttca aaagattgtt 900
gctaagggcg cttgcctagt aactgatcct aaagacatca agcatttagc tgacatgacc 960
attggggaca acggtgcccc tgaccggaaa tatgttggta aggatgccac tgttatctta 1020
gatgccgctg gtatttcata caccggcgat cctaagttga tcatgatgga tgttgataaa 1080
gacaacccat tggttaagac agaaatgttg atgccaatct tgcctatcgt tgggtgccca 1140
gactttgacg ccgttttggc tacggctatt gaagttgaag gtggcaatca ccatactgct 1200
tcaattcact cgaacaacat cctgcacatc aacaaggctg ctcaccggat gaacacctcg 1260
atcttcgtcg caaatggccc aacatttgcc gcaactggtg tcggtgataa cggttattac 1320
agtggtgctg ctgcgctgac aattgctacc ccaaccggtg aaggtactac cactactaag 1380
acctttaccc gtcgtcgtcg tttcaactgt ccacaagggt tctcacttcg ttcttgggag 1440
gtttaa 1446
<210> 26
<211> 1410
<212> DNA
<213> Listeria welshimeri serovar 6b str. SLCC5334
<400> 26
atggaatcat tagaactcga acaactggtg aaaaaagttc tgttagaaaa attagctgaa 60
caaaaagatg taccagtaaa aacaactaca caaggcgcaa aaagtgggat ttttgataca 120
gtggatgagg cagttcaagc agctgtccaa gcacaaaata gttataaaga aaaatctctg 180
gaagaacgcc gcaatgtagt aaaagcaatt cgtgaagcac tttatccaga aattgagtca 240
attgccacaa gagcagttgc tgaaacagga atgggtaatg tgacagataa aattttgaaa 300
aatactttag cgattgaaaa aacgccgggc gtagaagatt tatatacaga agtagctact 360
ggtgataatg gcatgacgct ttatgaatta tctccgtatg gtgtaattgg tgctgtggcg 420
ccgagtacga atccaaccga aacgttaatt tgtaatacaa tcggtatgct tgcagctggg 480
aatgcagtgt tttatagccc acatcctggt gcaaaaaata tatctctttg gttgattgaa 540
aagttgaata cgattgttcg tgaaagttgt ggtattgata acttggttgt gacagtggaa 600
aaaccttcca ttcaagcagc gcaagaaatg atgaatcatc caaaagtacc attacttgtc 660
attacaggtg gaccaggcgt cgtgcttcaa gcaatgcaat caggtaaaaa agtcattgga 720
gctggtgccg gaaatccgcc ttccatcgta gacgaaacag ctaatatcga aaaagctgca 780
gccgatattg ttgacggagc ctcttttgac cacaatatct tatgtattgc tgaaaaaagc 840
gttgttgccg ttgatagcat tactgatttc ctattattcc aaatggaaaa aaatggagca 900
ctacatgtga ccaatccgag cgatattaaa aaattagaaa aagttgctgt aacggataaa 960
ggtgtaacga ataaaaaatt agtcggaaaa agcgcttctg aaattttaaa agaagctgga 1020
ataacttgtg attttacccc gcgattaatc attgtggaaa cagataaatc acatccattt 1080
gcaacagtag aattactaat gccaatcgtt ccagtggtaa gagtgcctga ttttgatgaa 1140
gcgcttaaag tagctattga attagaacaa ggactacatc atacagcaac aatgcattca 1200
caaaatattt ccagattaaa taaagctgca agagatatgc aaacatcgat ctttgtgaaa 1260
aatggtcctt cctttgcagg tttaggtttt agaggggaag gtagtactac atttactatt 1320
gcaaccccaa ctggagaagg aaccactaca gcacgtcatt ttgctagacg ccgccgttgt 1380
gttttaacag atggtttttc gattcgttaa 1410
<210> 27
<211> 1410
<212> DNA
<213> Listeria innocua Clip11262
<400> 27
atggaatcat tagaactcga acaactggta aaaaaagttc tcttagaaaa attagcagaa 60
caaaaagaag taccaacaaa aacaactaca caaggcgcga aaagtggcgt ttttgataca 120
gttgacgagg ctgttcaagc agcagttata gcgcagaatt gctataaaga aaaatcactt 180
gaagaacgcc gcaatgttgt aaaagcaatt cgtgaagcac tttatccaga aattgaaaca 240
attgcgacaa gagcagttgc agagactggt atgggaaatg tgacagataa aattttgaaa 300
aacacgttag caatcgaaaa aacgccaggg gtagaagatt tatatacaga agtagctaca 360
ggtgataacg gtatgacact atatgaactc tctccgtatg gcgtaattgg tgcagtagcg 420
ccgagcacaa acccaacgga aacattgatt tgtaattcaa tcggtatgct cgcagctgga 480
aatgccgttt tttatagccc tcatccaggg gcaaaaaaca tttcactgtg gttgattgaa 540
aaactaaaca caattgttcg cgatagttgt ggtatagata atctaattgt caccgtggct 600
aaaccatcca tccaagcagc tcaagaaatg atgaaccatc caaaagtacc gctacttgtt 660
attacaggtg gtccgggcgt tgttctccaa gcgatgcaat caggtaaaaa agtgattgga 720
gcaggagcag ggaacccgcc ttctattgtt gacgaaacag ctaatatcga aaaagcggct 780
gctgacatcg tagacggagc atcttttgac cataatattt tatgtattgc tgaaaaaagt 840
gtggtagctg ttgatagcat tgctgatttc ttgttattcc aaatggaaaa aaatggtgcc 900
cttcatgtta ctaatccaag tgatattcaa aaattagaaa aagtagccgt taccgataaa 960
ggtgtaacta ataaaaaatt agtcggaaaa agtgcaactg aaatcttaaa agaagcagga 1020
atagcttgtg attttacacc acgtttaatc attgtggaaa cggagaaatc tcatccattt 1080
gcaacagtag agctattaat gccaatcgtt ccagttgtaa gggtgcctga ttttgacgaa 1140
gcccttgaag tggctattga actcgaacaa ggcttacatc atacagcaac aatgcattca 1200
caaaatatct cgagattaaa caaagctgca agagatatgc aaacttccat ctttgtcaaa 1260
aatggtccgt cctttgcggg attaggcttt agaggagaag gtagtactac tttcactatt 1320
gcaacgccta ctggagaagg aacaactaca gcacgtcatt ttgctagacg ccgccgctgt 1380
gttttaacag atggtttttc gattcgttaa 1410
<210> 28
<211> 1410
<212> DNA
<213> Listeria monocytogenes ATCC 19117
<400> 28
atggaatcat tagaactcga acaactggta aaaaaagttc ttttagaaaa attagcagaa 60
caaaaagatg caccagtaaa aacaacggtc aaaggcgcga aaagtggggt ttttgataca 120
gttgacgagg ccgttcaagc agcagttata gcacaaaata actataaaga aaaatcatta 180
gaagaacgcc gcaacgttgt gaaagcaatt cgcgaagcac tttatccaga aattgaatcc 240
attgcagcgc gagcagttgc tgaaacaggt atgggaaatg tagcagataa aattttgaaa 300
aacacgttag cgattgaaaa aacgccaggt gtggaagatt tgtatacaga agttgctact 360
ggtgataatg gcatgacgct ttacgaactt tctccatatg gcgtaatcgg agctgttgca 420
ccaagcacga acccaacgga aaccttgatt tgcaatacaa tcggcatgct cgcagctggg 480
aatgcagtat tttatagccc gcatccaggt gcgaaaaata tttctctttg gttgattgaa 540
aagttgaata cgattgtccg tgaaagttgc ggcattgata atttagttgt tacagtcgaa 600
aaaccatcta ttcaagccgc gcaagaaatg atgaatcatc cgaaagtacc gctccttgtt 660
attacaggtg gccctggtgt agttcttcaa gccatgcaat ccggtaaaaa agttattggc 720
gcaggtgccg ggaatccgcc atctattgta gatgagacag caaacatcga aaaagcagct 780
gctgatatcg tagacggcgc atcttttgac cataatattc tatgtattgc ggagaaaagt 840
attgttgcag ttgatagcat cgcagatttc ttaatgttcc aaatggaaaa aaatggtgca 900
ctacatgtga ccaatccaag cgatattcaa aaactagaaa aagtagctgt cacagataaa 960
ggcgtaacaa acaaaaaact agtcggaaaa agtgcttcag aaattttaaa agaagcgggg 1020
attgtttgtg atttttcacc acgtttaatt attgtggaaa cagaaaaaac acatccgttt 1080
gcaactgtag aattattgat gccgattgtt cctgttgtaa gagttcctaa ttttgacgaa 1140
gcgcttgatg tcgctattga gttagagcaa ggcttgcatc acacagctac gatgcattca 1200
caaaatattt ctagattaaa caaagctgca cgagatatgc aaacatccat ctttgtcaaa 1260
aatggtcctt catttgcggg attaggcttt agaggagaag gtagcactac tttcactatt 1320
gcaacgccta ccggagaagg aaccactaca gcgcgccatt ttgctagacg tcgccgttgt 1380
gttttaacag atggtttttc gattcgttaa 1410
<210> 29
<211> 1410
<212> DNA
<213> Listeria marthii FSL S4-120
<400> 29
atggaatcat tagaactcga acaactggtg aaaaaagttc ttttagaaaa attagcagaa 60
caaaaagaag caccagcaaa accaataaca caaggtgcga aaagtggtat ttttgatacc 120
gtcgatgaag ccgttcaagc agcagtaata gcgcaaaatt gttataaaga aaaatcacta 180
gaagaacgcc gcaatgttgt gaaagcaatt cgcgaaactc tttatccaga aattgaaaca 240
atcgcgacga aagcagtagc agaaacagga atgggtaatg tagcagataa aattttgaaa 300
aacactttag cgattgaaaa aactccaggg gtagaagatt tatatacaga agtagctact 360
ggcgataatg gtatgacact ttatgaacta tctccgtatg gcgttattgg tgcagttgcg 420
ccgagcacga atccgactga aacattgatt tgtaatacga tcggcatgct cgctgcggga 480
aatgcagtat tttacagtcc gcatccaggg gcaaaaaata tttctctatg gttgattgaa 540
aaactaaata caattgttcg cgaaagttgc ggaattgata atttggtcgt tacagtcgaa 600
aaaccatcta ttcaagctgc acaagaaatg atgaatcatc cgaaagtacc gttacttgtg 660
attacaggtg gcccaggcgt agttctgcaa gcgatgcaat ccggtaagaa agtgattggt 720
gctggagccg gaaatccgcc gtcaatcgta gacgaaacag ctaatattga aaaagctgcg 780
gctgatatcg tggacggagc atcttttgac cataatattt tatgtatcgc ggaaaaaagt 840
attgtggcag tagagagcat tgctgatttc ttattattcc aaatggaaaa aaatggtgca 900
ctgcatgtga ccaatccaag tgatattcaa aaattagaaa aagtggcagt aacagataaa 960
ggcgtgacca ataaaaaatt agttgggaaa agtgccgcag aaattttaaa agaagctggc 1020
ataacttgtg actttacccc gcgtttaatc attgtagaaa cgacaaaaac gcatccattt 1080
gcaacagtgg aactattaat gccaatcgtt ccgcttgtaa gagtgcctga ttttgacgaa 1140
gcacttgaag tagcaattga gttagagcaa ggattacatc atactgcaac gatgcattca 1200
caaaatattt ccagattaaa caaagcggca agagacatgc aaacatccat ctttgtaaaa 1260
aatgggcctt catttgcagg attaggtttc agaggtgaag gtagcactac gtttaccatt 1320
gcaacgccta ccggagaagg aaccactaca gcacgtcatt ttgctagacg ccgccgttgt 1380
gttttaactg atggtttttc gattcgttaa 1410
<210> 30
<211> 1410
<212> DNA
<213> Listeria ivanovii subsp. ivanovii PAM 55
<400> 30
atggaatcat tagaactcga acaactggtg aaaaaagttc tcttagaaaa attagcagga 60
caaaacgaag aaacaccaaa aaaaccaagc caaggtgcca aaagtggcat ttttgacaca 120
gtggatgagg cagttcaagc agcagtaatt gcgcaaaact gctacaaaga aaagtcgcta 180
gaagaccgca gaaatgtagt aaaagcaatt cgcgaagcac tttatccgga aatcgaaaat 240
attgcgacac gtgcggctgc tgaaacaggt atgggtaatg tagccgataa aattttgaaa 300
aatacgttag caattgaaaa aacaccagga gtagaagatc tctatacaga agtagctact 360
ggcgataatg gtatgacgct ttatgaactt tctccttatg gtgttattgg tgctgttgct 420
ccaagtacga atccaacaga aacattaatt tgcaacacaa ttggaatgct tgcagctgga 480
aatgcagttt tttatagccc gcatccaggt gcaaaaaata tttcgctttg gttgattgaa 540
aaactaaata cgattgttcg tgaaagctgc ggaatcgata acctagtcgt tacagtagaa 600
aaaccatcta ttcaagcagc acaagaaatg atgaatcatc caaaagttcc gttactagtt 660
atcactggcg gccctggcgt tgttcttcaa gcgatgcaat ccggtaagaa agtaatcgga 720
gcaggcgctg gaaatccacc gtctatcgta gacgaaacag cgaatatcga aaaagcagct 780
gcagatatcg ttgcgggcgc atcttttgat cataatattt tatgtatcgc agaaaaaagc 840
gtagtagcag tggacagcat tactgatttt ctattattcc aaatggaaaa aaatggcgcc 900
tttcatgtta cgaatccaag cgatattcgc aaactggaaa aagtggcggt taccgaaaaa 960
ggcgttacca acaagaagtt agttggtaaa agcgcttcgg aaattttaaa agaagcaggg 1020
atagcatgtg attttacccc tcgattaatt attgctgaaa cagatagatc ccatccattt 1080
gcaacggtag aactgctaat gccaattgtt ccagttgtca gagtggctga ttttgatcaa 1140
gcacttgaag tagcacttga gttagaacaa ggcttgcatc atacggcaac aatgcattcg 1200
caaaatattt ctagactgaa caaagcagca agagatatgc aaacttctat ttttgtgaaa 1260
aatggaccat cgtttgctgg acttggcttt ggaggagaag gtagtgcgac tttcactatc 1320
gctaccccaa caggtgaagg aactactaca gcgcgacact ttgctagacg ccgtcgttgt 1380
gttttaacag atggtttttc gattcgttaa 1410
<210> 31
<211> 1410
<212> DNA
<213> Listeria seeligeri serovar 1/2b str. SLCC3954
<400> 31
atggaatcat tagaactcga acaactggtg aaaaaagttc tcttagaaaa attagcagga 60
caaaacgaag aaacaccaaa aaaaccaagc caaggtgcca aaagtggcat tttcgataca 120
gtggatgagg cagttcaagc agcagtaatt gcgcaaaact gctacaaaga gaagtcacta 180
gaagaccgca gaaatgttgt aaaagcaatt cgtgaagcac tttatccgga aatcaaaaat 240
attgcgacac gtgcggttgc tgaaacaggt atgggtaacg tagccgataa aattttgaaa 300
aatacgttag caattgaaaa aacaccagga gtagaagatc tctatacaga agtagctaca 360
ggcgataatg gtatgacgct ttatgaactt tctccttatg gtgttattgg tgctgttgct 420
ccaagtacga atccaacaga aacattaatt tgcaacacaa ttggaatgct tgcagctgga 480
aatgcagttt tttatagccc gcatccaggt gcaaaaaata tttcgctttg gttgattgaa 540
aaactaaata cgattgttcg cgaaagctgc gggattgata acctagtcgt tacagttgaa 600
aaaccatcta ttcaagcagc gcaagaaatg atgaatcatc caaaagtacc gttactagtt 660
atcactggcg gtcctggtgt tgttcttcaa gcgatgcaat ctggtaagaa agtaatcgga 720
gcaggtgcgg gaaatccacc ttctatcgta gacgaaacag cgaatatcga aaaagcagct 780
gctgatatcg ttgcgggtgc atcttttgat cataatattt tatgtatcgc agaaaaaagc 840
gtagtagcag tggatagcat cactgatttt ctcttattcc aaatggaaaa aaatggtgcg 900
ttgcatgtta cgaatccaag cgatattcgc aaactggaaa aagtggcagt taccgaaaaa 960
ggcgttacca ataagaagtt agttggtaaa agcgcttcgg aaattttaaa agaagcaggg 1020
atagcatgtg attttacccc tcgattaatt attgttgaaa cagatagatc ccatccattt 1080
gcaacggtag aacttttaat gccgattgtt ccagtggtac gagttgctga ttttgatcaa 1140
gcacttgaag tagcacttga gttagaacaa ggcttacatc acacggcaac aatgcattca 1200
caaaatatct ctagactgaa caaagcagca cgagatatgc aaacatccat tttcgtgaaa 1260
aatggaccat cgtttgctgg acttggcttt ggaggagaag gtagtgcaac tttcactatc 1320
gctaccccaa caggtgaagg aactactact gcgcgacact ttgctagacg ccgtcgttgt 1380
gttttaacag atggtttttc gattcgttaa 1410
<210> 32
<211> 1395
<212> DNA
<213> Shewanella putrefaciens CN-32
<400> 32
ttagcgaatg gaaaatccat tggtcagcac acaacggcgt ttacgggcga agctccgtgc 60
tgatgttgtc ccttcgccag tgggtgtcgc aatagtaaag gtggtaaaac cctcggcacc 120
tatgcccagt ccggcatagg aagggccatt tttcacaaat attgaagtct gcatggtctt 180
cgcagccagg ttcagacggg taacattctg agaatgcatt atggcggtat ggtgctgctc 240
gttttccacc ttcagagcca gtgccagtcc tgtctcgaaa tcactgaccc gtacaacagg 300
cagaactggc attaattgtt cgaccatcac cagcggatcg tcctgctcca cttctacaat 360
aatcagtcgc ggtgctgttg atgtgttcag atcagcagcc tgcaggatca ctgccggact 420
tttacctacc agtttcttgt ttgcttctcc tttgtcattg ataacgacct tgcgcagtct 480
ggcaatatca ccgggcgttt ttaccagaaa ggcatcgttt ttctgcatat tatccatgag 540
gcgatcggcg atgctatcta ctacgatgac gcatttttcg gcgatacaaa ggacgttatg 600
atcgaaagag gcaccatcaa cgatatcttt ggcagccttg accggacagg cagtctcatc 660
caccagtaca ggggggttac cagggccagc accgataact tttttaccgg ttttcatggc 720
catattgaca attgccggac cacctgtaac cgcaagcaga gcaatgcgcg gatctgacat 780
catttcgcga gtagcgtcaa aggtgggctc tgctacggtg gtaaccagat tgcggatccc 840
gcttacccga taaatgatgt cttcgatttt ttcaataagc cacaaggata ccttttttgc 900
gcccggatgg gggctgaagt agacagcatt accagcggcc agcatactga tggtgttgtt 960
aatgatagtt tcggtcggat tggtgctggg ggcgatagcg ccaatgactc cgaagggaga 1020
tagttcgaat aaaaccatgc caccatcacc agtcagggcg cttgtggtca aatcctcaat 1080
ccccggagta ttattcagtg ccgcagtgtt tttactgatt ttgtcgggtg cattacccat 1140
gccggtttct tctgcggcgc gttcggacat ctctttgatc cagggtgcca gctcttcctt 1200
cagggctgta atgatcctgg ttcgaagtgc cagaggttct gccatgtatt ttttataggc 1260
atcgtagctg gcagtgatag catcttccac acgagcgaag atagtgtgct gaatgtttcc 1320
aggggcagta gcggtccctt tcagattatc agcaagaata ttgcggatca tattctccag 1380
ttcagtggta ttcat 1395
<210> 33
<211> 1395
<212> DNA
<213> Kosakonia radicincitans DSM 16656
<400> 33
ttagcggatg gaaaatccat tagtcagtac acaacggcgt ttacgggcga agctccgtgc 60
agatgttgtc ccctcgccag tgggagttgc aatagtaaag gtggtaaaac cctcagcctc 120
aatgcccagc ccagcataag aagggccatt tttcacaaat attgaagttt gcatggtctt 180
cgcggccagg ttcagacggg aaacattctg cgaatgcatt atggctgtat ggtgctgatc 240
gttttccact ttcagcgcca gtgccagccc tgtctcaaaa tccctgaccc gcacaacagg 300
cagaaccggc atcaattgtt cgaccatcac cagcggatcg tcctgctcca cttccacaat 360
aatcagtcgc ggtgccgttg atgtgttcag atcagcagcc tgcaggatca ctgccgggct 420
tttgcctacc agtttcttgt ttgcttctcc tttgtcattg ataacgacct ggcgcagtct 480
ggcaatatca ccgggtgttt ttaccagaaa ggcatcgttt ttctgcatat tttctacgag 540
gcgatcggcg atgctatcga ccacgatgac gcatttttcg gcgatgcaaa ggacgttatg 600
atcgaaagag gcaccatcaa cgatatcttt ggcagctttg accggacagg cagtttcatc 660
caccagtaca ggggggttac ctgggccggc accgataact tttttaccgg ttttcatggc 720
catattgaca attgccgggc cgcctgtaac cacaagcaga gcaatgcgcg gatctgacat 780
catttcgcga gtggcgtcaa aggtgggctc tgcgacggtg gtaaccagat tccggatccc 840
gcttacccga taaatgatgt cttcgatttt ttcaataagc cacaaagata ccttctttgc 900
gccaggatgg gggctgaagt agacagcatt accagcggcc agcatactga tggtgttgtt 960
aatgatagtt tcggtcggat tggtgctggg ggcgatagcg ccaatgactc caaaggggga 1020
aagttcgaac aaaaccatgc caccatcgcc agtcagggcg cttgtggtca aatcctcaat 1080
ccctggagta ttattcagtg ccgcagtatt tttactgatt ttgtcgagtg cattacccat 1140
gccggtttct tctgcggcgc gttcggacat ctctttgatc cagggtgcca gctcttcctt 1200
cagggcagta ataatcctgg ttcgaagtgc cagaggttct gccaagtatt ttttataggc 1260
atcgtagctg gcagtgatag catcttccac acgagcgaag atagtgtgct gaatatttcc 1320
aggggcagta gcgatccctg tcagattatc agcaagaata gtgcggatca tattctccag 1380
ttcagtggta ttcat 1395
<210> 34
<211> 1404
<212> DNA
<213> Tolumonas auensis DSM 9187
<400> 34
atgaataaca ctgagttaga aagcttaatc cgcactattc tgactgaaca gctcacgcct 60
tccgctacgg acacgcctgc atgtaccgct tcgtctgttg cactgtttga tgatgtggac 120
agtgccatct gtgcagcgca tgccgccttc ctgcgttatc aggaagcacc gttaaaaacc 180
cgcagtgcca ttattgccgc cattcgtgct gagattgcgc cctgcctgtc tgaactggca 240
gaacgtgctg ccgcagaaac cggtatgggc aacaccgccg acaagatcct gaaaaacaaa 300
gcggcactgg aaaatactcc cggtatcgaa gatttgaaaa caactgctct gaccggtgat 360
gaaggtatgg tgttgtttga atactctccg tttggggtag ttggtgccgt ggcgccaagc 420
acaaatccga ccgaaaccat tatcaataac agcatcagta tgctggccgc cggaaatgcg 480
atctatttca gcccgcatcc cggtgcaaaa aatatctctt tgtggttaat ccagaaaatg 540
gaagagatcg ccttcaaagt ctgcggtatc cacaatctga tcgtgacggt caaagagccg 600
acttttgaag ccacccagca aatgatggca catgacaaaa tcgcgttgtt agccatcacc 660
ggtggccccg gtatcgtgaa tatggggctg aaaagcggga aaaaagtgat tggtgccggc 720
gccggtaatc cgccttgtct ggtggatgaa accgcagaga tcgtcaaagc cgcacaagac 780
atcgtcgcgg gagcctcttt tgactacaac ctgccctgca tcgcagaaaa aagcgtgatt 840
gccgttgatt gcatcgccga tcaactgatt cagcaaatgc gcgaattcgg cgccatgcag 900
atcacggatc ctcaacaaat cgcgcagtta cgcgaagtct gcattcagaa aggtgcggct 960
aataagagcc tggtcggcaa aagcccggca acgattctgg cagccgcagg tattccctgc 1020
ccggccaaag aaccgcgact gatcattctg gaagtcccgg ccaatgaccc gtttgttgtt 1080
accgaacaac tgatgccggt gctgccgatt gttcgcgttg ataactttga acaaggcctg 1140
cagctggcac tgaaagtgga agatggcctg caccatacgg ccatgatgca ttcacagaat 1200
gtttcccgcc tgaacaaggc tgcacatctg atgcaaactt caattttcgt gaaaaacggc 1260
ccttcctacg caggaattgg tgtgggagca gaaggattca ccaccttcac cattgccacc 1320
ccgaccggcg aaggcaccac atcagcccgc acgttcggtc gcttacgccg ctgtgtactg 1380
accaatggct tttcaattcg ctaa 1404
<210> 35
<211> 1386
<212> DNA
<213> Citrobacter koseri ATCC BAA-895
<400> 35
ttagcgaatt gaaaagccgt tggtcagtac gcagcgacgg gaacgggcaa atgtgcgtgc 60
tgaggttgtc ccttcgccgg tcggggtggc gatggtaaag gtggtaaacc cttcgccgcc 120
aacgccgata ccggcatagg aagggccgtt tttcacaaaa atagaggtct gtaaggtgcg 180
cgccgccaga ttcaggcgag agacattctg cgagtgcata atggcggtat ggtgcaggcc 240
ttcttcaact ttcagcgcca gtgccagcgc gctgtcgaaa ttatcgacct tcacgacggg 300
tagcattggc atcaactgtt cgctggttac ccacggatcg tcggcactga cgataccgat 360
aagcaaacgc ggcggttttg cgggaacggc aataccggcg gcttccagca tggcggcagg 420
gcttttcccg accagttttt tattcgcgtg accttccggg aggcagacgg cgcgtaattt 480
atcgatatcc gcagcgttta acagcaatgc gccgaaagcc tgcatttgct gaaccaggcg 540
ctcagcgacg ctctctacga caatcagact tttttctgcg atacagggca ggttgtaatc 600
gaacgctgcg ccgttgatga tatcttcggc cgctttgacg atatcggcgg tctcatcaac 660
gatgcagggc ggattgcccg ccccggcgcc aatgaccttt ttaccgcttt tcattcccat 720
cgcaacaatg ccggggccgc cagtaatagc cagcaccgca atattgggat gcgccatcat 780
ttgctgagtt gcctcaaagg tcggttctgc gacagtgacg accagattac ggatgccgca 840
gcagcggaag gcgatatctt cgatcatgcc gatcaatttg agtgagacgt tcttcgcgcc 900
aggatgcggg ctgaaataaa cgctgttgcc cgcagccagc atactgatgc tgttgttaat 960
aatggtttcg gtagggttgg tgctgggcgc gacggaacca atgacgccga acggtgaata 1020
ttcaaacagc accatgccgc catcgccggt gagggccgtg gttgtcaaat cctcaatgcc 1080
tggcgtgtta tccagcgcgg ctttgttttt aagaaattta tcttctttgt ttcccatccc 1140
cgtttccgct gcgctctccg ccgccagcgt ggcaagatgc ggcgtaagct cctggcgcag 1200
ggcgctgata atggcgctgc gcgttttgag cggacactgc tgataacgta agaaagcctg 1260
gtgcgccgcg tctatcgctt cgccgacgga ctgaaaaata ccgtgtcctt gcgtttcggc 1320
ctgcgcaggc gccaactgtt cgcttaaaat attacggatg agggtttcca gttcagaagt 1380
attcat 1386
<210> 36
<211> 1389
<212> DNA
<213> Yersinia enterocolitica subsp. enterocolitica 8081
<400> 36
atgaatacca atgaccttga atcgctcatt cgcactatcc tcaccgagca actgacgccg 60
gtcacggccc ctgcctccag cgccattttt gccagcgtgg atgaagccat taatgctgct 120
cacagcgcgt ttttgcgcta tcagcaaagc ccgatgaaaa ctcgcagcgc cattatccgc 180
gctatccgtg agcaattaaa gccacaactt gtctctctgt ccgagcgcgg tgccagtgaa 240
accggcatgg gtaataaaga agataaattc ctgaaaaaca aagctgcact ggaaaacaca 300
ccgggtattg aagacttatc taccaccgcc ctgaccggtg atggcggcat ggtgttattc 360
gagtattcac ccttcggcgt tattggttca gtcaccccca gcactaaccc gaccgaaacc 420
attattaata acagcatcag tatgttggca gcgggtaatg cagtctattt cagcccccac 480
cctggtgcta aagccgtgtc actggatctc atcgcccaaa ttgaagagat cattttcaac 540
agttgcggca ttcgcaatct ggtggtgaca gtaaaagaac cgagtttcga agccacccaa 600
cagatgatgg cacacgacaa aattgcctta ctcgcgatta ctggtggccc ggccattgtg 660
gcgatgagca tgaaaagcgg caagaaagtg attggtgccg gtgcgggtaa cccaccttgt 720
ctggtggatg aaaccgccga gttagtcaaa gcggcgcagg atatcgtggc gggagcttca 780
tttgactaca acctgccgtg cattgcagag aaaagcctga tcgtggtgga aagtgttgcc 840
gaccgtttat tgcaacagat gcaggccttc gatgcattac tgataagcaa tccgcaagag 900
atcgacagct tacgcaaagc ctgcctgacg ccgcagggcc atgccaataa aaatctggtg 960
ggtaaaagtc caattgaact gctgaaagca gccggcatca cctgcccagc taaagccccg 1020
cgcctgttat tggtcgaagt agctggtgac gatccactgg tcaccaccga acaattgatg 1080
ccgctgttac cggtggtgcg ggtaaaggat tttgatgcgg ccctgacatt ggcactgcac 1140
gtcgagggcg gcctgcatca taccgcaacc atgcactcac aaaatgtctc gcgcttgaat 1200
ctggctgcac gtttgttgca aacctccatt tttgtcaaaa atggcccgtc ctatgctggg 1260
ataggggtcg gcggtgaagg ctttaccacc tttactattg ccaccccaac cggggagggt 1320
accacttcgg cgcgtacctt tgcgcgtcaa cgccgctgtg tactgactaa tggtttctct 1380
attcgctga 1389
<210> 37
<211> 1395
<212> DNA
<213> Salmonella enterica subsp. enterica serovar Mbandaka str. ATCC 51958
<400> 37
ttagcgaata gaaaagccgt tggtcagtac gcagcgccgg gagcgggcaa aagtacgcgc 60
tgacgtggtc ccttcaccgg ttggcgtggc aatagtaaag gtggtaaagc cttcgccgcc 120
gacgccgatc ccggcataag aggggccgtt tttgacgaat atcgaggttt gcagggtgcg 180
ggctgcgagg ttcaggcgcg acacgttctg cgagtgcata atggcggtat gatgcagccc 240
ctcttcaacc ttcagggcca gcgccagcgc gctatcgaaa tcgctgactt ttaccaccgg 300
cagcatcggc atcagctgtt cgctggtgac ccacggatcg tcagcgctaa ccagcgcaat 360
cagcagacgc ggcgcttttg cagggacagc gatcccggcg gcttccagca tggccgatgg 420
gctcttgccg accagttttt tattcgcctg gccttcaggc aggcagacgg cgcggagttt 480
gtcggtatcg gcagggctta gcagcagcgc gccgaaggtt tgcatttgct gcaccagacg 540
ctcggcgacg ctctccacta cgatcaggct cttctcggca atgcagggca ggttataatc 600
gaatgacgcg ccgttgatga tatcttctgc cgctttcacc aggtccgctg tttcatcgac 660
gatgcagggc gggttacccg cgccagcgcc aatcaccttc ttaccgctct tcatgcccat 720
tgccacaatg cccgggccac cggtaatagc cagtaccgcg attcgcgggt gggccatcat 780
ctgctgggtc gcttcgaagg tgggttcagc tacagtcacc accagattgc ggatgccgca 840
gcagcggaag gcaatctctt caatcaggct aatcagcttc agagagacct ttttcgctcc 900
cggatgcggg ctaaagtaga cgctgttgcc cgccgccagc atgctgatgc tgttgttgat 960
gatggtttcc gtcgggttgg tgcttggggc gaccgaaccg atgacgccaa acggcgagta 1020
ttcaaagagc accatgccgc cgtcgccggt cagcgcggtg gtggtgagat cttctacgcc 1080
cggcgtgttg tccagcgcag ccttgttttt gaggagttta tcttctttgt tgcccatccc 1140
tgtttcattg gcactctctt ccgccagggt cgccagcagc ggcgtcagct cctgacgcat 1200
cgcgctgata atggcgctgc gggtttttag cgggcactgc tgataacgta agaacgcctg 1260
gtgcgcggca tcgatggcct cgctcacgga ctggaaaatc cctttgccct gaggctgggc 1320
cgtagtttgc gccggcgttg ttaattgctc gctaagaatg gtgcgaatca gggtttcgag 1380
ttcagaagta ttcat 1395
<210> 38
<211> 1389
<212> DNA
<213> Yersinia mollaretii ATCC 43969
<400> 38
atgaacaccc atgatattga atctctcatt cgcactatcc tcaccgagca actgacgcct 60
gcgacggcct ctgccgtcag cgccattttt gccagcgtgg atgaagccgt gactgccgcc 120
cacagcgcct ttttgcgcta tcagcaaagc ccgatgaaaa cccgtagcgc cattatcagc 180
gccctgcgtg agcagttagc ccctcagttg gcgtcactct ctgagcgtgg tgccagcgaa 240
accggtatgg gcaacaaaga agataaattc ctgaaaaaca gggccgcgct ggagaatacc 300
cccggcatcg aagacctctc caccacggct ctgacgggcg acggcggtat ggtgctgttc 360
gaatattcgc cgttcggcgt gattggctct gtcgccccca gcactaaccc caccgaaacc 420
attatcaata acagcatcag catgttagcc gcgggtaatg cggtctattt tagcccgcac 480
cccggcgcta aagccgtctc actggatctg attgcccaaa ttgaagcgat cattttcaac 540
cgttgcggca tccgcaattt ggtggtgacg gtgcaagaac cgagctttga ggccacccaa 600
cagatgatgg cccacgacaa aatcgctcta ctggcgatca ccggtgggcc agccattgtg 660
gcgatgggca tgaagagcgg caaaaaagtg attggtgcgg gcgcgggtaa tccgccttgt 720
ctggtggatg agactgccga actggtgaaa gcggcgcaag atatcgtgtc cggcgcgtca 780
ttcgactaca acctgccctg cattgccgag aagagtttga ttgtggtgga gagtgtcgcc 840
gaccgcctgt tgcagcagat gcaagctttc gacgcgctgc tgatcactca gccgcaagag 900
gtcgatagcc tacgcaaagc ctgcctgacc ccccaaggcc acgctaacaa aaatctggtg 960
ggcaaaagcc cggctgaact gctgaaagcg gcgggtatca cttgccctgc caaagcccca 1020
cgcctactgc tggtggaagt ggcgggtgac gatccgctag tgaccacgga acaactgatg 1080
ccgctgctgc cagtggtgcg ggtaaaggat tttgatgcgg cgctgacact ggcgctgcaa 1140
gtggaaggcg gcctgcatca caccgcaacc atgcactccc agaatgtctc gcgcctgaat 1200
ctggcggccc gcctattgca gacctccatt tttgtcaaaa atggcccctc ctatgcgggg 1260
atcggggtcg gcggcgaggg ctttaccacc ttcaccatcg ccacccccac cggagagggc 1320
accacctcgg cccgcacctt tgcgcgtcaa cgccgctgtg tgctgactaa cggtttctcc 1380
attcgctga 1389
<210> 39
<211> 1383
<212> DNA
<213> Escherichia fergusonii ATCC 35469
<400> 39
atgaataccc gcgaactgga aaacatcatc cgcaatattc tgcgcgaaca actgagcaca 60
acagcagatg ccccgacgaa tggcattttt gattctgttg atgaagcgat taatgccgcc 120
catcaggcct ttttgcgcta tcaacaatgc ccactgaaaa cccgtagcgc cattatcagc 180
gccattcgcc aggagctgac tccacatctc gatatgttgg cgacagaaag cgccaacgaa 240
acaggcatgg gcaataaaga ggataaattc ctcaaaaaca aagccgcgct cgataacaca 300
ccaggtattg aagacctgac cacaaccgcg ctcactggtg atggcggcat ggtgttattt 360
gaatattcgc cttttggtgt tattggttct gtgacgccga gcactaaccc aaccgaaacc 420
attattaaca acagtattag catgttagcc gctggaaaca gtgtctattt cagcccacat 480
ccgggggcaa aaaatatctc tttgaaattg attgccatga ttgaagagat cgcttttcgc 540
tgtagcggta tccacaacct gattgtcacc gttgctgaac caacatttga agccacacag 600
caaatgatga ctcaccccaa tatcgccgtt ctggcgatta ccggtggacc tggcattgtc 660
gcaatgggca tgaaaagcgg taaaaaagtc attggggctg gcgccggaaa tccgccatgc 720
atcgtagatg aaaccgcaga tctggtaaaa gctgcggaag atattattaa tggtgcctcg 780
tttgactaca acctgccctg cattgctgag aaaagcctga ttgtcgttga ggagattgca 840
ggtacgttgg tgcaacaaat gcagaatttt ggcgctctgc ttctcaacaa agaggaaacc 900
gataagttac gtgacgtttg tctgccacaa ggaatggcaa ataaacaact ggtaggtaaa 960
agtccggcag ctctgttgca ggcggcaggc attgctgtgc cgctaaaaac accacgtctg 1020
ttaattgccc ttgttgacgc ctgcgacaag tgggtaacca gcgaacaact tatgccaatg 1080
ctgccaatcg taaaagttaa ggatttcgat agcgcactga cgctggcact gaaagtggaa 1140
gaaggtttgc atcacaccgc cattatgcac tcgcaaaatg tttcgcgact caacctggca 1200
gcccggacct tacagacctc aatctttgtt aagaatggtc cgtcatatgc tggtatcggt 1260
gtcggtggtg aaggatttac cacctttacg atcgctaccc ccacgggtga aggtactacc 1320
tcggccaaaa cgtttgcccg ttcccgtcgt tgcgtgttga ccagcggttt ttcgatccgt 1380
taa 1383
<210> 40
<211> 1395
<212> DNA
<213> Salmonella enterica subsp. enterica serovar Urbana str. ATCC 9261
<400> 40
ttagcgaata gaaaagccgt tggtcagcac gcagcgccgg gagcgggcaa aagtacgcgc 60
tgacgtggtc ccttcaccgg ttggcgtggc gatagtgaag gtggtaaagc cttcgccgcc 120
gacgccgatc ccggcataag aggggccgtt tttgacgaat atcgaggttt gcagcgtgcg 180
ggccgcgagg ttcaggcgcg acacgttctg cgagtgcata atggcggtat gatgcagccc 240
ctcttcaacc ttcagggcca gcgccagcgc gctatcgaaa tcgctgactt ttaccaccgg 300
cagcatcggc atcagctgtt cgctggtgac ccacggatcg tcagcgctaa ccagcgcaat 360
cagcagacgc ggcgcttttg cagggacagc gatcccggcg gcttccagca tggccgatgg 420
gctcttgccg accagttttt tattggcctg accttcaggc aggcagacgg cgcggagttt 480
gtcggtatcg gccgggctta gcagcagcgc gccgaaggtt tgcatttgct gcaccagacg 540
ctcggcgacg ctctccacta cgatcaggct cttctcggca atgcagggca ggttgtaatc 600
gaatgacgcg ccgttgatga tatcttccgc cgctttcacc aggtctgctg tttcatcgac 660
gatgcagggc gggttacccg cgccagcgcc aatcaccttc ttaccgctct tcatgcccat 720
tgccacaatg cccgggccac cggtaatggc cagtaccgcg attcgcgggt gggccatcat 780
ctgctgggtc gcttcgaagg tgggttcagc cacggtcacc accagattgc ggatgccgca 840
gcagcggaag gcaatctctt caattaggct aatcagcttc agagagacct ttttcgcgcc 900
cggatgcggg ctaaagtaaa tactattgcc cgccgccaac atgctgatgc tgttattgat 960
gatggtttcc gtcgggttgg tgcttggggc gaccgaaccg atgacgccaa acggtgagta 1020
ttcaaacagc accatgccgc cgtcgccggt cagcgcggtg gtggtgagat cttctacgcc 1080
cggcgtgttg tccagcgcag ccttgttttt gaggagttta tcttctttgt tgcccatccc 1140
tgtttcattg gcactctctt ccgccagggt cgccagcagc ggcgtcagct cctgacgcat 1200
cgcgctgata atggcgctgc gggtttttag cgggcactgc tgataacgta agaacgcctg 1260
gtgcgcggca tcgatggcct cgctcacgga ctggaaaatc cctttgccct taggctgggc 1320
cggcgtttgc gctggcgtgg ttaattgctc gctaagaatg gtgcgaatca gggtttcgag 1380
ttcagaagta ttcat 1395
<210> 41
<211> 299
<212> PRT
<213> Dictyostelium discoideum (Slime mold)
<400> 41
Met Ile Asn Arg Leu Phe Ser Ile Asn Asn Ile Lys Asn Gly Ser Lys
1 5 10 15
Phe Phe Ser Ser Ser Thr Thr Val Glu Thr Lys Gln Pro Leu Val Leu
20 25 30
Leu Glu Lys His Leu Val Asn Gly Lys Tyr Thr Gly Ile Gln Ile Val
35 40 45
Lys Leu Asn Lys Pro Lys Gln Leu Asn Ala Leu Thr Phe Glu Met Gly
50 55 60
Val Asp Tyr Lys Lys Val Val Asp Thr Leu Ala Glu Asp Lys Asp Leu
65 70 75 80
Lys Cys Val Val Leu Thr Gly Glu Gly Lys Ala Phe Ser Ala Gly Gly
85 90 95
Asp Leu Asp Phe Leu Ile Glu Arg Thr Lys Asp Thr Pro Glu Asn Asn
100 105 110
Gln Arg Ile Met Glu Arg Phe Tyr Arg Thr Phe Leu Tyr Ile Arg Ser
115 120 125
Leu Pro Val Pro Ile Ile Ser Ala Ile Asn Gly Ala Ala Ile Gly Ala
130 135 140
Gly Phe Cys Leu Ala Leu Ala Thr Asp Ile Arg Val Val Ser Asn Lys
145 150 155 160
Ala Pro Val Gly Leu Thr Phe Thr Lys Leu Gly Ile His Pro Gly Met
165 170 175
Gly Val Thr His Ser Ile Thr Asn Ile Val Gly Gln Asp Val Ala Ser
180 185 190
Tyr Met Leu Leu Ser Ser Asp Ile Ile Lys Gly Asp Glu Ala Gln Arg
195 200 205
Leu Gly Leu Val Leu Lys Ser Val Glu Ser Asp Gln Val Leu Pro Thr
210 215 220
Ala Leu Asn Leu Ala Glu Thr Ile Ser Lys Asn Ser Thr Ile Ala Val
225 230 235 240
Asn Ser Thr Thr Lys Thr Leu Arg Asn Lys Tyr Asn Ser Asp Leu Asp
245 250 255
Lys Ser Leu Thr Arg Glu Ala Asp Ala Gln Ser Gln Cys Trp Ala Ser
260 265 270
Lys Asp Ile Val Glu Gly Ile Leu Ala Ile Arg Glu Ser Arg Asp Pro
275 280 285
Lys His Asn Tyr Leu Leu Phe Asp Asp Gln Lys
290 295
<210> 42
<211> 261
<212> PRT
<213> Clostridium acetobutylicum
<400> 42
Met Glu Leu Asn Asn Val Ile Leu Glu Lys Glu Gly Lys Val Ala Val
1 5 10 15
Val Thr Ile Asn Arg Pro Lys Ala Leu Asn Ala Leu Asn Ser Asp Thr
20 25 30
Leu Lys Glu Met Asp Tyr Val Ile Gly Glu Ile Glu Asn Asp Ser Glu
35 40 45
Val Leu Ala Val Ile Leu Thr Gly Ala Gly Glu Lys Ser Phe Val Ala
50 55 60
Gly Ala Asp Ile Ser Glu Met Lys Glu Met Asn Thr Ile Glu Gly Arg
65 70 75 80
Lys Phe Gly Ile Leu Gly Asn Lys Val Phe Arg Arg Leu Glu Leu Leu
85 90 95
Glu Lys Pro Val Ile Ala Ala Val Asn Gly Phe Ala Leu Gly Gly Gly
100 105 110
Cys Glu Ile Ala Met Ser Cys Asp Ile Arg Ile Ala Ser Ser Asn Ala
115 120 125
Arg Phe Gly Gln Pro Glu Val Gly Leu Gly Ile Thr Pro Gly Phe Gly
130 135 140
Gly Thr Gln Arg Leu Ser Arg Leu Val Gly Met Gly Met Ala Lys Gln
145 150 155 160
Leu Ile Phe Thr Ala Gln Asn Ile Lys Ala Asp Glu Ala Leu Arg Ile
165 170 175
Gly Leu Val Asn Lys Val Val Glu Pro Ser Glu Leu Met Asn Thr Ala
180 185 190
Lys Glu Ile Ala Asn Lys Ile Val Ser Asn Ala Pro Val Ala Val Lys
195 200 205
Leu Ser Lys Gln Ala Ile Asn Arg Gly Met Gln Cys Asp Ile Asp Thr
210 215 220
Ala Leu Ala Phe Glu Ser Glu Ala Phe Gly Glu Cys Phe Ser Thr Glu
225 230 235 240
Asp Gln Lys Asp Ala Met Thr Ala Phe Ile Glu Lys Arg Lys Ile Glu
245 250 255
Gly Phe Lys Asn Arg
260
<210> 43
<211> 155
<212> PRT
<213> Clostridium difficile
<400> 43
Asn Ser Lys Lys Val Val Ile Ala Ala Val Asn Gly Phe Ala Leu Gly
1 5 10 15
Gly Cys Glu Leu Ala Met Ala Cys Asp Ile Arg Ile Ala Ser Ala Lys
20 25 30
Ala Lys Phe Gly Gln Pro Glu Val Thr Leu Gly Ile Thr Pro Gly Tyr
35 40 45
Gly Gly Thr Gln Arg Leu Thr Arg Leu Val Gly Met Ala Lys Ala Lys
50 55 60
Glu Leu Ile Phe Thr Gly Gln Val Ile Lys Ala Asp Glu Ala Glu Lys
65 70 75 80
Ile Gly Leu Val Asn Arg Val Val Glu Pro Asp Ile Leu Ile Glu Glu
85 90 95
Val Glu Lys Leu Ala Lys Ile Ile Ala Lys Asn Ala Gln Leu Ala Val
100 105 110
Arg Tyr Ser Lys Glu Ala Ile Gln Leu Gly Ala Gln Thr Asp Ile Asn
115 120 125
Thr Gly Ile Asp Ile Glu Ser Asn Leu Phe Gly Leu Cys Phe Ser Thr
130 135 140
Lys Asp Gln Lys Glu Gly Ile Val Ser Phe Arg
145 150 155
<210> 44
<211> 258
<212> PRT
<213> Clostridium pasteurianum
<400> 44
Met Gly Asn Ile Ile Phe Glu Glu Glu Asp Gly Ile Glu Lys Val Thr
1 5 10 15
Ile Asn Arg Pro Lys Ala Leu Asn Ala Leu Asn Ser Glu Thr Leu Lys
20 25 30
Glu Leu Gly Thr Val Ile Asn Asp Ile Ser Val Asn Asp Gly Ile Lys
35 40 45
Ala Val Ile Ile Thr Gly Ser Gly Ser Lys Ala Phe Val Ala Gly Ala
50 55 60
Asp Ile Ala Glu Met Ser Thr Leu Asn Ser Ile Glu Ala Thr Asn Phe
65 70 75 80
Ser Arg Leu Ala Gln Asn Val Phe Ser Gln Ile Glu Asn Leu Pro Lys
85 90 95
Leu Val Val Ala Ala Val Asn Gly Phe Ala Leu Gly Gly Gly Cys Glu
100 105 110
Leu Ala Met Ala Cys Asp Val Arg Phe Ala Ser Lys Lys Ala Lys Phe
115 120 125
Gly Gln Pro Glu Val Asn Leu Gly Ile Leu Pro Ser Phe Gly Gly Thr
130 135 140
Gln Arg Leu Pro Lys Leu Val Gly Lys Gly Ile Ala Lys Glu Leu Ile
145 150 155 160
Phe Ser Thr Asp Met Ile Thr Ala Asp Glu Ala Tyr Arg Ile Gly Leu
165 170 175
Ala Asn Lys Val Tyr Glu Pro Glu Glu Leu Leu Val Lys Ser Gln Glu
180 185 190
Phe Ala Glu Lys Val Met Thr Lys Ser Pro Trp Gly Val Lys Leu Ala
195 200 205
Lys Ala Cys Ile Asn Asn Gly Leu Asp Val Asp Leu Glu Ala Gly Leu
210 215 220
Lys Tyr Glu Ala Asn Ser Phe Gly Leu Cys Phe Ser Thr Glu Asp Gln
225 230 235 240
Lys Glu Gly Met Lys Ala Phe Leu Glu Lys Arg Lys Ala Asp Phe Lys
245 250 255
Gly Leu
<210> 45
<211> 262
<212> PRT
<213> Clostridium pasteurianum
<400> 45
Met Asp Phe Asn Asn Ile Ile Leu Glu Lys Glu Glu Lys Ile Ala Val
1 5 10 15
Val Thr Ile Asn Arg Pro Lys Ala Leu Asn Ala Leu Asn Ser Glu Thr
20 25 30
Leu Thr Glu Leu Asp Ser Val Ile Asp Glu Ile Asp Lys Asp Asn Glu
35 40 45
Ile Leu Ala Val Val Leu Thr Gly Ala Gly Lys Ser Phe Val Ala Gly
50 55 60
Ala Asp Ile Ser Glu Met Lys Asp Met Asn Val Val Glu Gly Arg Lys
65 70 75 80
Phe Gly Ile Leu Gly Asn Lys Val Phe Arg Lys Leu Glu Asn Leu Glu
85 90 95
Lys Pro Val Ile Ala Ala Leu Asn Gly Phe Thr Leu Gly Gly Gly Cys
100 105 110
Glu Ile Ala Met Ser Cys Asp Ile Arg Ile Ala Ser Thr Lys Ala Lys
115 120 125
Phe Gly Gln Pro Glu Val Gln Leu Gly Ile Thr Pro Gly Phe Gly Gly
130 135 140
Thr Gln Arg Leu Ala Arg Leu Ile Gly Pro Gly Ala Ala Lys Glu Leu
145 150 155 160
Ile Tyr Thr Gly Lys Ile Ile Asn Ala Glu Glu Ala Tyr Arg Leu Gly
165 170 175
Leu Val Asn Arg Val Ile Glu Pro Glu Thr Leu Leu Asp Glu Ala Lys
180 185 190
Gln Leu Ala Asn Thr Ile Ala Ala Asn Ala Pro Ile Ala Val Lys Leu
195 200 205
Ala Lys Ser Ala Ile Asn Arg Gly Ile Gln Thr Asp Ile Asp Thr Gly
210 215 220
Val Ser Ile Glu Ser Glu Val Phe Gly Ala Cys Phe Ser Thr Glu Asp
225 230 235 240
Gln Lys Glu Gly Met Asn Thr Phe Leu Asn Asp Lys Lys Tyr Leu Thr
245 250 255
Gly Asn Phe Lys Asn Lys
260
<210> 46
<211> 260
<212> PRT
<213> Megasphaera elsdenii
<400> 46
Met Asp Tyr Gln Asn Ile Ile Phe Ala Val Glu Asp Gly Ile Ala Thr
1 5 10 15
Ile Thr Ile Asn Arg Pro Lys Ala Leu Asn Ala Leu Asn Gln Ala Thr
20 25 30
Val Ser Glu Leu Lys Asp Val Val Glu Lys Ile Ala Ala Asp Lys Ala
35 40 45
Ile Lys Val Val Ile Ile Thr Gly Ala Gly Ala Lys Ser Phe Val Ala
50 55 60
Gly Ala Asp Ile Lys Glu Met Ala Ser Lys Asn Ala Ala Glu Gly Arg
65 70 75 80
Glu Trp Gly Gln Phe Gly Gln Asn Val Phe Thr Glu Ile Glu Asn Leu
85 90 95
Pro Gln Pro Val Ile Ala Ala Ile Asn Gly Phe Ala Leu Gly Gly Gly
100 105 110
Cys Glu Leu Ser Cys Ala Cys Asp Ile Arg Tyr Ala Ala Glu Asn Ala
115 120 125
Lys Phe Gly Gln Pro Glu Val Gly Leu Gly Ile Thr Pro Gly Phe Gly
130 135 140
Gly Thr Gln Arg Leu Thr Arg Val Val Gly Arg Gly His Ala Lys Glu
145 150 155 160
Leu Ile Tyr Thr Gly Gly Met Ile Asp Ala Glu Lys Ala Lys Ala Ile
165 170 175
Gly Leu Val Asn Glu Val Phe Pro Gln Glu Glu Leu Met Pro Ala Ala
180 185 190
Val Lys Leu Ala Lys Lys Ile Ala Lys Asn Ala Pro Ile Ala Val Gln
195 200 205
Leu Ser Lys Ala Ala Ile Asn Arg Gly Ile Asn Cys Asp Val Val Thr
210 215 220
Gly Ile Ala Tyr Glu Ala Glu Val Phe Gly Leu Cys Phe Ser Thr Ala
225 230 235 240
Asp Gln Lys Glu Gly Met Ala Ala Phe Cys Glu Lys Arg Lys Ala Thr
245 250 255
Phe Glu Gly Lys
260
<210> 47
<211> 259
<212> PRT
<213> Metallosphaera sedula
<400> 47
Met Glu Phe Glu Thr Ile Glu Thr Lys Lys Glu Gly Asn Leu Phe Trp
1 5 10 15
Ile Thr Leu Asn Arg Pro Asp Lys Leu Asn Ala Leu Asn Ala Lys Leu
20 25 30
Leu Glu Glu Leu Asp Arg Ala Val Ser Gln Ala Glu Ser Asp Pro Glu
35 40 45
Ile Arg Val Ile Ile Ile Thr Gly Lys Gly Lys Ala Phe Cys Ala Gly
50 55 60
Ala Asp Ile Thr Gln Phe Asn Gln Leu Thr Pro Ala Glu Ala Trp Lys
65 70 75 80
Phe Ser Lys Lys Gly Arg Glu Ile Met Asp Lys Ile Glu Ala Leu Ser
85 90 95
Lys Pro Thr Ile Ala Met Ile Asn Gly Tyr Ala Leu Gly Gly Gly Leu
100 105 110
Glu Leu Ala Leu Ala Cys Asp Ile Arg Ile Ala Ala Glu Glu Ala Gln
115 120 125
Leu Gly Leu Pro Glu Ile Asn Leu Gly Ile Tyr Pro Gly Tyr Gly Gly
130 135 140
Thr Gln Arg Leu Thr Arg Val Ile Gly Lys Gly Arg Ala Leu Glu Met
145 150 155 160
Met Met Thr Gly Asp Arg Ile Pro Gly Lys Asp Ala Glu Lys Tyr Gly
165 170 175
Leu Val Asn Arg Val Val Pro Leu Ala Asn Leu Glu Gln Glu Thr Arg
180 185 190
Lys Leu Ala Glu Lys Ile Ala Lys Lys Ser Pro Ile Ser Leu Ala Leu
195 200 205
Ile Lys Glu Val Val Asn Arg Gly Leu Asp Ser Pro Leu Leu Ser Gly
210 215 220
Leu Ala Leu Glu Ser Val Gly Trp Gly Val Val Phe Ser Thr Glu Asp
225 230 235 240
Lys Lys Glu Gly Val Ser Ala Phe Leu Glu Lys Arg Glu Pro Thr Phe
245 250 255
Lys Gly Lys
<210> 48
<211> 259
<212> PRT
<213> Clostridicum kluyvery
<400> 48
Met Glu Phe Lys Asn Ile Ile Leu Glu Lys Asp Gly Asn Val Ala Ser
1 5 10 15
Ile Thr Leu Asn Arg Pro Lys Ala Leu Asn Ala Leu Asn Ala Ala Thr
20 25 30
Leu Lys Glu Ile Asp Ala Ala Ile Asn Asp Ile Ala Glu Asp Asp Asn
35 40 45
Val Tyr Ala Val Ile Ile Thr Gly Ser Gly Lys Ala Phe Val Ala Gly
50 55 60
Ala Asp Ile Ala Glu Met Lys Asp Leu Thr Ala Val Glu Gly Arg Lys
65 70 75 80
Phe Ser Val Leu Gly Asn Lys Ile Phe Arg Lys Leu Glu Asn Leu Glu
85 90 95
Lys Pro Val Ile Ala Ala Ile Asn Gly Phe Ala Leu Gly Gly Gly Cys
100 105 110
Glu Leu Ser Leu Ser Cys Asp Ile Arg Ile Ala Ser Ser Lys Ala Lys
115 120 125
Phe Gly Gln Pro Glu Val Gly Leu Gly Ile Thr Pro Gly Phe Gly Gly
130 135 140
Thr Gln Arg Leu Ala Arg Ala Ile Gly Val Gly Met Ala Lys Glu Leu
145 150 155 160
Ile Tyr Thr Gly Lys Val Ile Asn Ala Glu Glu Ala Leu Arg Ile Gly
165 170 175
Leu Val Asn Lys Val Val Glu Pro Asp Lys Leu Leu Glu Glu Ala Lys
180 185 190
Ala Leu Val Asp Ala Ile Ile Val Asn Ala Pro Ile Ala Val Arg Met
195 200 205
Cys Lys Ala Ala Ile Asn Gln Gly Leu Gln Cys Asp Ile Asp Thr Gly
210 215 220
Val Ala Tyr Glu Ala Glu Val Phe Gly Glu Cys Phe Ala Thr Glu Asp
225 230 235 240
Arg Val Glu Gly Met Thr Ala Phe Val Glu Lys Arg Asp Lys Ala Phe
245 250 255
Lys Asn Lys
<210> 49
<211> 502
<212> PRT
<213> Sulfolobus tokodaii
<400> 49
Met Ala Ile Arg Thr Gly Glu Gln Tyr Leu Asp Ser Ile Lys Ile Arg
1 5 10 15
Asn Lys Ala Glu Ile Tyr Val Met Gly Lys Glu Val Lys Asp Val Thr
20 25 30
Thr His Pro Phe Leu Lys Pro Ser Val Met Ala Phe Lys Ala Thr Phe
35 40 45
Asp Ala Ala Trp Glu Glu Asp Thr Lys Glu Leu Ala Arg Ala Trp Ser
50 55 60
Pro Phe Ile Asn Glu Glu Val Asn Arg Phe Asn His Ile His Arg Ser
65 70 75 80
Pro Glu Asp Leu Ala Ala Lys Val Lys Leu Leu Arg Lys Leu Ser His
85 90 95
Lys Thr Gly Ala Cys Phe Gln Arg Cys Val Gly Trp Asp Ala Leu Asn
100 105 110
Thr Leu Trp Ile Met Thr Asn Ile Met Ala Gln Lys Gly Lys Lys Glu
115 120 125
Tyr Lys Asp Arg Phe Val Glu Tyr Leu Ser Tyr Val Gln Lys Lys Asp
130 135 140
Leu Ala Leu Ala Gly Ala Met Thr Asp Ala Lys Gly Val Arg Thr Leu
145 150 155 160
Lys Pro His Gln Gln Pro Asn Lys Asn Ala Tyr Val Arg Ile Glu Glu
165 170 175
Val Thr Lys Asp Gly Ile Tyr Val Ser Gly Ala Lys Ala Asn Ile Thr
180 185 190
Gly Val Ala Ala Thr Glu Glu Ile Val Val Leu Pro Thr Arg Ala Met
195 200 205
Gly Pro Glu Asp Lys Asp Tyr Ala Val Ala Phe Ser Ile Pro Thr Asp
210 215 220
Thr Glu Gly Ile Lys Ile Ile Val Gly Arg Gln Leu Asn Asp Ala Arg
225 230 235 240
Arg Leu Glu Gly Gly Asp Ile Asp Ala Leu Pro Tyr Phe Tyr Asn His
245 250 255
Glu Gly Leu Val Ile Phe Asp His Val Phe Val Pro Met Asp Arg Val
260 265 270
Phe Leu Met Gly Glu Tyr Glu Phe Thr Ser Gln Leu Val Glu Val Phe
275 280 285
Ser Ala Tyr His Arg Gln Gly Tyr Gly Gly Cys Lys Ala Gly Leu Gly
290 295 300
Asp Val Ile Ile Gly Ala Ser Met Asn Leu Ala Lys Gln Leu Gly Val
305 310 315 320
Glu Lys Ala Ser His Val Gln Glu Lys Leu Thr Glu Met Ile Phe Leu
325 330 335
Thr Glu Thr Met Tyr Ser Ala Gly Ile Ala Ala Ser Leu Asn Ala Val
340 345 350
Lys Val Cys Asp Asn Cys Trp Trp Val Asn Pro Met His Ala Asn Val
355 360 365
Thr Lys His Leu Val Ala Arg Phe Pro Ala Gln Ile Ser Gln Leu Ser
370 375 380
Ile Asp Ile Ala Gly Gly Ile Ile Gly Thr Ala Pro Ser Glu Trp Asp
385 390 395 400
Leu Lys Asn Pro Lys Leu Arg Glu Tyr Ile Ala Lys Tyr Leu Gln Gly
405 410 415
Val Glu Gly Tyr Thr Ala Glu Asp Arg Leu Arg Met Val Arg Leu Leu
420 425 430
Glu Asn Val Ser Leu Gly Val Ala Phe Gln Ile Glu Ser Val His Gly
435 440 445
Ala Gly Ser Pro Ala Ala Gln Arg Ile Met Phe Ser Arg Leu Tyr Asp
450 455 460
Leu Asn Tyr Ala Glu Glu Val Ala Lys Arg Leu Ala Gly Lys Lys Thr
465 470 475 480
Asp Leu Gln Trp Lys Pro Lys Ala Glu Pro Trp Arg Glu Ser Glu Thr
485 490 495
Glu Lys Leu Val Lys Ser
500
<210> 50
<211> 483
<212> PRT
<213> Geobacter metallireducens
<400> 50
Met Ala Leu Arg Asp Gly Asn Ser Tyr Arg Glu Ser Leu Arg Ala Leu
1 5 10 15
Asn Ile Lys Val Tyr Ala Phe Gly Glu Lys Ile Asp Ser Ile Val Asp
20 25 30
His Pro Leu Phe Gln Pro His Ile Asn Ala Ala Ala Leu Thr Phe Asp
35 40 45
Leu Ala His Asp Pro Thr Thr Glu Ala Leu Val Thr Ala Thr Ser His
50 55 60
Leu Thr Gly Ser Lys Ile Ser Arg Phe Thr His Ile His Gln Ser Thr
65 70 75 80
Asp Asp Leu Ile Lys Lys Val Lys Met Leu Arg Leu Ile Ala Gly Lys
85 90 95
Thr Gly Ser Cys Tyr Gln Arg Cys Val Gly Trp Asp Ala Leu Asn Ala
100 105 110
Asn Tyr Thr Val Thr Tyr Glu Met Asp Gln Glu Leu Gly Thr Asp Tyr
115 120 125
His Gln Arg Phe Arg Arg Tyr Leu Glu Tyr Ile Gln Asp Asn Asp Leu
130 135 140
Met Val Ala Gly Ala Met Thr Asp Pro Lys Gly Asp Arg Gly Leu Pro
145 150 155 160
Pro Ala Lys Gln Lys Asp Pro Asp Met Phe Val His Val Val Ala Lys
165 170 175
Asn Asp Lys Gly Ile Val Ile Arg Gly Ala Lys Val His Gln Thr Gly
180 185 190
Ile Val Asn Ser His Glu Met Leu Ile Met Pro Thr Met Ala Met Gly
195 200 205
Glu Glu Asp Gly Asp Tyr Ala Val Ala Cys Ala Leu Pro Thr Asp Ser
210 215 220
Pro Gly Val Ile His Ile Phe Gly Arg Gln Thr Asn Asp Thr Arg Arg
225 230 235 240
Leu Glu Lys Gly Asp Leu Asp Gln Gly Asn Ala Glu Tyr Gly Thr Val
245 250 255
Gly Gly Glu Ala Leu Thr Ile Leu Glu Asp Val Phe Val Pro Trp Glu
260 265 270
Arg Val Phe Met Cys Gly Glu Tyr Lys Tyr Ala Gly Leu Leu Val Glu
275 280 285
Arg Phe Ala Ser Tyr His Arg Gln Asn Tyr Gly Gly Cys Lys Ala Gly
290 295 300
Val Ser Asp Val Ile Ile Gly Ala Thr Thr Ala Met Ala Glu Tyr Asn
305 310 315 320
Gly Ala Ala Lys Ala Ser His Val Arg Asp Lys Ile Val Glu Met Val
325 330 335
His Leu Thr Glu Thr Leu Tyr Cys Gly Ser Ile Ala Cys Ser Cys Glu
340 345 350
Gly Ala Pro Thr Pro Ser Gly Ala Tyr Phe Val Asn Pro Leu Leu Ala
355 360 365
Asn Thr Val Lys Gln Asn Val Thr Arg Phe Ile Tyr Glu Ile Ala Arg
370 375 380
Leu Ser His Asp Ile Ser Gly Gly Cys Met Ala Thr Met Pro Ser Glu
385 390 395 400
Lys Asp Leu His His Asp Glu Ile Gly Lys Tyr Val Glu Lys Tyr Phe
405 410 415
Arg Gly Val Asp Glu Ala Pro Thr Glu Glu Arg Met Arg Met Ala Arg
420 425 430
Leu Val Glu Asn Met Thr Gly Gly Thr Ala Leu Val Glu Ser Met His
435 440 445
Gly Ala Gly Ser Pro Gln Ala Gln Arg Val Met Ile Leu Arg Gln Ala
450 455 460
Asn Leu Gly His Lys Val Lys Leu Ala Lys Lys Leu Ala Gly Ile Lys
465 470 475 480
Glu Glu Lys
<210> 51
<211> 463
<212> PRT
<213> Sulfolobus solfataricus
<400> 51
Met Arg Ser Lys Glu Asp Phe Leu Lys Ser Leu Lys Asp Gly Arg Asn
1 5 10 15
Leu Tyr Tyr Arg Gly Lys Leu Val Glu Asp Ile Thr Thr His Gln Ile
20 25 30
Leu Lys Thr Ala Ala Leu His Ala Ala Lys Leu Tyr Glu Tyr Ala Asp
35 40 45
Arg Val Tyr Glu Asp Asn Lys Met Gly Lys Met Ser Lys Phe Phe Lys
50 55 60
Val Pro Trp Thr Ser Gln Asp Leu Leu Asp Arg His Lys Leu Ile Tyr
65 70 75 80
Asp Leu Thr Met Tyr Cys Asn Gly Val Phe Asn Ile Ser Gln Ala Ile
85 90 95
Gly Ser Asp Ala Ile Phe Ala Leu Met Ile Thr Ala Lys Gln Val Asp
100 105 110
Arg Lys Tyr Gly Thr Asp Tyr Ser Lys Arg Val Glu Lys Tyr Phe Glu
115 120 125
Arg Val Ala Lys Glu Asp Leu Thr Leu Ala Thr Ala Gln Thr Asp Val
130 135 140
Lys Gly Asp Arg Ser Lys Arg Pro Ser Glu Gln Val Asp Pro Asp Met
145 150 155 160
Tyr Val Arg Val Val Asp Val Lys Ser Asp Gly Ile Val Val Arg Gly
165 170 175
Ala Lys Ala His Thr Thr Gln Ser Ala Val Ser Asp Glu Ile Ile Val
180 185 190
Ile Pro Thr Arg Val Met Arg Asp Ser Asp Lys Asp Tyr Ala Val Ala
195 200 205
Phe Ala Val Pro Ala Asn Thr Lys Gly Leu Lys Met Tyr Ile Arg Pro
210 215 220
Ile Asp Glu Ile Glu Gly Asn Ser Ser Ser Val Leu Ser Arg Lys Asp
225 230 235 240
Tyr Glu Leu Glu Thr Leu Thr Val Phe Asn Asp Val Phe Val Pro Trp
245 250 255
Asp Arg Val Phe Leu Phe Lys Glu Tyr Asp Tyr Ala Gly Thr Leu Ala
260 265 270
Met Leu Phe Ala Thr Phe His Arg Phe Thr Ala Leu Ser Tyr Arg Ser
275 280 285
Ala Thr Met Asn Leu Tyr Leu Gly Ala Ser Lys Val Ala Ser Gln Val
290 295 300
Asn Gly Ile Glu Asn Glu Lys His Val Arg Asp Asp Ile Val Asp Ile
305 310 315 320
Ile Leu Tyr Lys Glu Ile Met Arg Ser Ser Ala Ile Ala Ala Ala Val
325 330 335
Tyr Pro Val Asn Met Glu Gly Ile Ala Val Pro Asn Pro Leu Phe Thr
340 345 350
Asn Val Gly Lys Leu Tyr Ser Asn Met His Phe His Asp Val Val Arg
355 360 365
Asp Leu Ile Asp Ile Ala Gly Gly Ile Ile Ala Thr Met Pro Ser Gln
370 375 380
Glu Asp Leu Glu Ser Asp Glu Gly Lys Asn Ile Val Lys Tyr Leu Arg
385 390 395 400
Gly Ser Val Asp Gly Glu Glu Arg Ala Lys Val Leu Lys Leu Ala Lys
405 410 415
Glu Leu Gly Ala Ser Thr Phe Thr Gly Tyr Leu Leu Thr Gly Met Ile
420 425 430
His Ala Glu Gly Ser Met Glu Ala Ser Lys Ile Glu Leu Phe Arg Ser
435 440 445
Tyr Asn Phe Lys Glu Ala Glu Asn Leu Val Lys Arg Val Leu Ser
450 455 460
<210> 52
<211> 479
<212> PRT
<213> Syntrophobacter fumaroxidans
<400> 52
Met Gly Leu Lys Thr Lys Ala Glu Tyr Ile Glu Ser Leu Arg Gly Met
1 5 10 15
Lys Pro Thr Val Tyr Met Phe Gly Glu Lys Ile Glu Ser Val Val Asp
20 25 30
Asn Pro Arg Leu Arg Ala Gly Ile Glu Ala Thr Gly Ala Thr Tyr Glu
35 40 45
Leu Ala Glu Thr Glu Glu Tyr Arg Pro Leu Ile Val Thr Glu Ser Pro
50 55 60
Leu Ile His Glu Pro Val Asn Arg Tyr Thr Leu Pro Pro Ser Ser Ile
65 70 75 80
Ala Asp Leu Val Ala Arg Val Lys Ile Asn Arg Leu Met Gly Thr Arg
85 90 95
Val Gly Thr Cys Phe Gln Arg Cys Thr Gly Leu Asp Cys Leu Ser Ala
100 105 110
Leu Ser Ile Val Thr Tyr Asp Ile Asp Ala Lys His Ser Thr Pro Tyr
115 120 125
Phe Lys Arg Phe Ile Glu Phe Leu Lys His Val Gln Lys Asn Asp Leu
130 135 140
Thr Cys Asn Ala Gly Val Thr Asp Val Lys Gly Asp Arg Ser Leu Ala
145 150 155 160
Pro His Glu Gln Glu Asp Lys Asp Met Tyr Val Arg Val Val Glu Arg
165 170 175
Asn Ala Asp Gly Ile Val Val Arg Gly Ala Lys Ala His Gln Thr Gly
180 185 190
Ser Leu Ser Ser His Glu Ile Ile Val Leu Pro Thr Arg Ala Leu Arg
195 200 205
Lys Gly Asp Glu Asp Tyr Ala Leu Ala Phe Ala Ile Pro Asn Asp Thr
210 215 220
Pro Gly Leu Ile His Val Val Gly Arg Ser Ser Leu Asp Thr Arg Gln
225 230 235 240
Leu Asp Gly Cys Asp Leu Gly Asn Leu His Tyr Ser Lys Tyr Cys Pro
245 250 255
Thr Val Ile Phe Lys Asp Val Phe Val Pro Trp Glu Arg Val Phe Met
260 265 270
Cys Gly Glu Val Glu Phe Ala Val Glu Met Val Asn Arg Phe Ser Ala
275 280 285
Tyr His Arg Gln Ser His Gly Gly Cys Lys Ser Gly Lys Ile Asp Cys
290 295 300
Met Val Gly Ala Ala Leu Thr Met Met Asp Tyr Asn Gly Thr Glu Lys
305 310 315 320
Ala Gly His Leu Lys Gln Lys Ala Ile Glu Met Val His Arg Ala Glu
325 330 335
Thr Leu Tyr Gly Cys Ser Leu Ala Ala Ser Tyr Glu Gly Lys Lys Glu
340 345 350
Pro Ser Gly Thr Tyr Phe Ile Asp Thr Val Leu Ala Asn Ala Ser Lys
355 360 365
Ile His Glu Gly Lys Glu Met Ser Glu Ala Gly Arg Leu Leu Val Asp
370 375 380
Ile Ala Gly Gly Phe Val Ala Asp Leu Pro Ser Asp Arg Asp Leu Ala
385 390 395 400
Ile Pro Glu Val Gly Glu Leu Leu Lys Lys Tyr Leu Lys Gly Val Ala
405 410 415
Ser Val Pro Val Glu Asp Arg Val Lys Met Tyr Arg Leu Ile Glu Lys
420 425 430
Leu Val Met Glu Ser Ala Asp Thr Ile Ser Asp Ile His Gly Gly Gly
435 440 445
Ser Pro Glu Ala His Arg Ile Thr Ile Leu Arg Glu Ser Asn Leu Lys
450 455 460
Ala Lys Lys Asp Ala Ala Lys Arg Leu Ala Gly Ile Glu Ser Lys
465 470 475
<210> 53
<211> 486
<212> PRT
<213> Porphyromonas gingivalis
<400> 53
Met Met Thr Ser Glu Gln Tyr Val Glu Ser Leu Arg Lys Leu Asn Leu
1 5 10 15
Lys Val Tyr Phe Met Gly Glu Arg Ile Glu Asn Pro Val Asp His Pro
20 25 30
Met Ile Arg Pro Ser Met Asn Ser Val Ala Met Thr Tyr Lys Leu Ala
35 40 45
Glu Met Asp Glu Tyr Lys His Leu Met Thr Ala Thr Ser Asn Leu Thr
50 55 60
Gly Lys Gln Val Asn Arg Phe Cys His Leu His Gln Ser Thr Glu Asp
65 70 75 80
Leu Lys Asp Lys Val Lys Met Gln Arg Leu Met Gly Gln Lys Thr Ala
85 90 95
Ser Cys Phe Gln Arg Cys Val Gly Met Asp Ala Phe Asn Ala Ile Tyr
100 105 110
Ser Thr Thr Tyr Glu Met Asp Gln Ala Leu Gly Thr Thr Tyr His Lys
115 120 125
Arg Phe Ile Glu Tyr Met Lys Tyr Val Gln Asp Asn Asp Leu Val Val
130 135 140
Asp Gly Ala Met Thr Asp Pro Lys Gly Asp Arg Gly Leu Ser Pro Ser
145 150 155 160
Glu Gln Ala Asp Pro Asp Leu Tyr Leu His Ile Val Glu Val Arg Glu
165 170 175
Asp Gly Ile Val Val Ser Gly Ala Lys Ala His Gln Thr Gly Ala Val
180 185 190
Asn Ser His Glu His Leu Ile Met Pro Thr Ile Ala Met Arg Glu Ala
195 200 205
Asp Ala Asp Tyr Ala Val Ser Phe Ala Val Pro Ser Asp Ala Glu Gly
210 215 220
Val Ile Met Ile Tyr Gly Arg Gln Ser Cys Asp Thr Arg Lys Met Glu
225 230 235 240
Glu Gly Ala Asp Ile Asp Leu Gly Asn Ser Glu Phe Gly Gly His Glu
245 250 255
Ala Leu Val Val Phe Asp Arg Val Phe Val Pro Asn Asp Arg Val Phe
260 265 270
Met Cys Lys Glu Tyr Gln Phe Ala Gly Met Met Val Glu Arg Phe Ala
275 280 285
Gly Tyr His Arg Gln Ser Tyr Gly Gly Cys Lys Val Gly Val Gly Asp
290 295 300
Val Leu Ile Gly Ala Ala Ala Leu Ala Ala Asp Tyr Asn Gly Val Pro
305 310 315 320
Lys Ala Ser His Ile Lys Asp Lys Leu Ile Glu Met Ile His Leu Asn
325 330 335
Glu Thr Leu Tyr Ala Cys Gly Ile Ala Cys Ser Ser Glu Gly Thr Gln
340 345 350
Met Lys Ala Gly Asn Tyr Met Ile Asp Leu Leu Leu Ala Asn Val Cys
355 360 365
Lys Gln Asn Ile Thr Arg Leu Pro Tyr Glu Ile Ala Arg Leu Ala Glu
370 375 380
Asp Ile Ala Gly Gly Leu Met Val Thr Met Pro Ser Gln Gln Asp Phe
385 390 395 400
Arg His Pro Glu Ile Gly Pro Ile Val Lys Lys Tyr Leu Ala Gly Ala
405 410 415
Thr Gly Lys Ser Thr Glu Asn Arg Met Arg Val Leu Arg Leu Ile Glu
420 425 430
Asn Ile Thr Leu Gly Thr Ala Ala Val Gly Tyr Arg Thr Glu Ser Met
435 440 445
His Gly Ala Gly Ser Pro Gln Ala Gln Arg Ile Met Ile Ala Arg Gln
450 455 460
Gly Asp Leu Glu Gly Lys Lys Lys Leu Ala Arg Ala Ile Ala His Ile
465 470 475 480
Asp Glu Ser Leu Asp Lys
485
<210> 54
<211> 529
<212> PRT
<213> Polynucleobacter necessarius subsp. Asymbioticus
<400> 54
Met Ser Gln Ser Thr Ser Gln Phe Met Asn Ser Lys Asp Tyr Gln Glu
1 5 10 15
Ser Leu Arg Ser Leu Lys Pro Thr Val Tyr Val Asp Gly Arg Leu Ile
20 25 30
Glu Ser Val Ala Asp Glu Pro Ser Leu Arg Pro Gly Val Gln Ala Leu
35 40 45
Gly Val Thr Tyr Asp Met Val His Asp Pro Ala Leu Ala Pro Leu Met
50 55 60
Leu Ala Asp Ser Asn Gly Thr Pro Val Pro Arg Met Leu His Ile Asn
65 70 75 80
Gln Ser Ser Gly Asp Leu Leu Asn Lys Leu Glu Ala Val Arg Val Leu
85 90 95
Cys Gln Glu Thr Gly Cys Ala Gln Arg Tyr Leu Ala His Asp Ala Leu
100 105 110
Asn Ala Ile Ala Gln Val Ser Ala Arg Ile Asp Asp Ala Lys Gly Ser
115 120 125
Asn Glu His Ser Ala Lys Phe Ser Glu Tyr Leu Ser His Val Gln Thr
130 135 140
Lys Asp Leu Ala Leu Gly Ile Ala Met Thr Asp Ala Lys Gly Asp Arg
145 150 155 160
Ser Arg Arg Pro His Glu Gln Glu Asn Pro Asp Thr Tyr Val His Ile
165 170 175
Val Ser Gln Asp Ala Lys Gly Val Val Ile Ser Gly Thr Lys Ala Ile
180 185 190
Val Thr Gly Ala Pro Tyr Met His Glu Phe Leu Val Met Pro Gly Arg
195 200 205
Asn Met Thr Lys Glu Asp Ala Ala Phe Ala Ile Cys Cys Ala Val Pro
210 215 220
Val Asp Ala Lys Gly Ile Thr Ile Val Ala Arg Pro Ala Gly Arg Pro
225 230 235 240
Gly Asp Lys Val Glu His Gly Lys Pro Ile Phe Ser Ser Lys Tyr Gly
245 250 255
Gln Ser Thr Gly Val Val Ile Phe Asp Lys Val Phe Val Pro Trp Asp
260 265 270
Arg Val Phe Tyr Ala Gly Glu Trp Glu His Ser Ser Val Leu Thr Tyr
275 280 285
Asn Tyr Ala Thr His His Arg His Ser Cys Ile Ala Ala Arg Ala Gly
290 295 300
Phe Gly Asp Leu Leu Ile Gly Ala Gly Ala Leu Met Cys Glu Ala Asn
305 310 315 320
Gly Leu Asp Pro Ala Thr Lys Ser Asn Leu Arg Asp Pro Met Val Glu
325 330 335
Leu Ile Lys Ile Thr Glu Gly Phe Tyr Ala Cys Gly Val Ala Ala Ser
340 345 350
Val Tyr Gly Thr Gln Asp Pro Tyr Ser Lys Ser Phe Met Pro Glu Pro
355 360 365
Val Phe Ser Asn Ile Gly Lys Leu Leu Leu Ala Thr Gln Ile Tyr Asp
370 375 380
Met His Arg Leu Ala His Glu Val Ser Gly Gly Leu Ile Val Ala Leu
385 390 395 400
Pro Gly Pro Asp Glu Asp His Asn Pro Ala Thr Ala Ala Thr Leu Ala
405 410 415
Glu Val Leu Arg Ala Asn Pro Ala Val Pro Tyr Asp Lys Arg Ile Glu
420 425 430
Val Ala Arg Phe Ile Glu Asp Leu Thr Ala Ser Tyr Gln Gly Gly Trp
435 440 445
Tyr Ser Val Ile Ser Leu His Gly Gly Gly Ser Pro Ala Ala Met Lys
450 455 460
Gln Glu Ile Tyr Arg Gln Tyr Pro Ile Gly Asn Lys Val Glu Leu Val
465 470 475 480
Glu Arg Leu Leu Asp Arg Gly Val Leu Thr Ser Ser Glu Glu Arg Ala
485 490 495
Ile Thr Lys Asn Lys Gln Pro Gly Arg Cys Cys Asp Gln Gly Cys Ser
500 505 510
Ala Pro Gly Gln Ala Val Met Val Pro Leu Pro Glu Pro Gly Arg Arg
515 520 525
Thr
<210> 55
<211> 257
<212> PRT
<213> Sulfolobus tokodaii
<400> 55
Met Glu Thr Ile Val Ile Lys Lys Glu Thr Pro Ile Gly Trp Ile Tyr
1 5 10 15
Leu Asn Arg Pro Asp Arg Leu Asn Ala Ile Asn Gln Gln Met Ile Lys
20 25 30
Glu Leu Arg Gln Gly Ile Asp Glu Met Val Tyr Asp Ser Asp Ile Lys
35 40 45
Val Ile Ile Ile Thr Gly Asn Gly Lys Ala Phe Ser Ala Gly Ala Asp
50 55 60
Ile Ser Arg Phe Lys Glu Leu Asn Gly Tyr Thr Ala Trp Gln Phe Ala
65 70 75 80
Lys Ser Gly Arg Glu Leu Met Asp Tyr Ile Glu Asn Ile Ser Lys Pro
85 90 95
Thr Ile Ala Met Val Asn Gly Tyr Ala Leu Gly Gly Gly Leu Glu Leu
100 105 110
Ala Met Ala Cys Asp Ile Arg Ile Ala Ala Glu Glu Ala Gln Leu Gly
115 120 125
Leu Pro Glu Ile Asn Leu Gly Ile Tyr Pro Gly Phe Gly Gly Thr Gln
130 135 140
Arg Leu Val Arg Leu Ile Gly Lys Gly Lys Ala Leu Glu Leu Met Leu
145 150 155 160
Thr Gly Asp Arg Ile Ser Ala Lys Glu Ala Glu Lys Ile Gly Leu Val
165 170 175
Asn Lys Val Val Pro Leu Ser Asn Leu Glu Gln Glu Thr Arg Asn Phe
180 185 190
Ala Leu Lys Leu Ala Glu Lys Pro Pro Ile Ser Ile Ala Leu Ile Lys
195 200 205
Leu Leu Val Asn Gln Gly Ile Asp Leu Pro Ile Leu Ala Gly Leu Asn
210 215 220
Met Glu Ser Leu Gly Trp Gly Val Val Phe Ser Thr Glu Asp Glu Lys
225 230 235 240
Glu Gly Val Ser Ala Phe Leu Glu Lys Arg Lys Ala Gln Phe Lys Gly
245 250 255
Lys
<210> 56
<211> 258
<212> PRT
<213> Gordonia terrae C-6
<400> 56
Met Thr Glu His Gln Thr Ile Val Val Glu Thr Ser Gly Arg Val Gly
1 5 10 15
Ile Ile Thr Leu Asn Arg Pro Lys Ala Leu Asn Ala Leu Asn Thr Glu
20 25 30
Leu Met Asn Glu Val Val Gly Ala Val Lys Glu Phe Asp Val Asp Gln
35 40 45
Gly Ile Gly Ala Ile Val Ile Thr Gly Ser Glu Lys Ala Phe Ala Ala
50 55 60
Gly Ala Asp Ile Lys Glu Met Ser Ser Lys Ser Tyr Ala Asp Val Val
65 70 75 80
Asn Glu Gln Phe Phe Gly Ala Trp Asp Glu Leu Ser Arg Ala Arg Thr
85 90 95
Pro Ile Ile Ala Ala Val Thr Gly Tyr Ala Leu Gly Gly Gly Cys Glu
100 105 110
Leu Ala Met Leu Cys Asp Thr Ile Ile Ala Gly Asp Asn Ala Val Phe
115 120 125
Gly Gln Pro Glu Ile Asn Leu Gly Val Ile Pro Gly Ile Gly Gly Ser
130 135 140
Gln Arg Leu Thr Arg Ala Val Gly Lys Ala Lys Ala Met Asp Met Val
145 150 155 160
Leu Thr Gly Arg Gln Met Lys Val Asp Glu Ala Glu Arg Leu Gly Leu
165 170 175
Val Ser Arg Val Val Pro Lys Glu Asp Cys Arg Ala Ala Ala Ile Glu
180 185 190
Val Ala Glu Ile Ile Ala Ser Lys Ser Leu Ile Ala Ala Ala Ala Ala
195 200 205
Lys Asp Ala Val Asn Arg Ala Phe Glu Ser Ser Leu Val Glu Gly Val
210 215 220
Arg Ala Glu Arg Ala Leu Phe Tyr Ser Thr Phe Ala Thr Asp Asp Gln
225 230 235 240
Thr Glu Gly Met Ala Ala Phe Val Glu Lys Arg Asp Pro Asn Phe Thr
245 250 255
His Arg
<210> 57
<211> 258
<212> PRT
<213> Halalkalicoccus jeotgali
<400> 57
Met Ala Asp Arg Val Leu Ile Glu Arg Glu Asn Asp Ile Ala Thr Ile
1 5 10 15
Ile Val Asn Arg Pro Glu Lys Arg Asn Ala Met Asp Ile Pro Thr Arg
20 25 30
Lys Ala Leu Tyr Ala Ala Phe Glu Glu Val Ser Glu Asp Asp Asp Val
35 40 45
Arg Ala Ile Val Leu Arg Gly Ala Gly Asp Gly Ser Phe Ile Ala Gly
50 55 60
Gly Asp Ile Asp Ser Phe Ala Asp Phe Asp His Met Asp Gly Met Glu
65 70 75 80
Tyr Ser Glu Lys Tyr Ala Gln Gly Leu Tyr Asn Tyr Val Ala Asp Arg
85 90 95
His Lys Pro Thr Ile Ala Ala Val Asp Gly Tyr Ala Leu Gly Gly Gly
100 105 110
Thr Glu Ile Ala Leu Ala Cys Asp Ile Arg Leu Ala Thr Asp Asp Ala
115 120 125
Lys Phe Gly Leu Pro Glu Val Gly Ile Gly Val Ile Pro Ala Gly Gly
130 135 140
Gly Thr Gln Arg Leu Val Gln Val Val Gly Ala Gly Leu Ala Ser Glu
145 150 155 160
Leu Ile Leu Thr Gly Arg Ile Ile Ser Ala Asp Glu Ala Lys Arg Ile
165 170 175
Gly Leu Ala Asn His Val Tyr Ala Ala Glu Glu Phe Asp Asn Glu Val
180 185 190
Arg Ala Met Ala Glu Asp Leu Ala Ser Lys Ala Pro Val Ala Gln Arg
195 200 205
Leu Ala Lys Glu Ser Ile Arg Arg Ser Leu Asp Ile Asp Ala Gly Leu
210 215 220
Glu Tyr Glu Arg Leu Ala Gly Ala Phe Leu Phe Gly Thr Asp Asp Gln
225 230 235 240
Lys Glu Gly Ala Asn Ala Phe Leu Glu Asp Arg Glu Pro Lys Tyr Arg
245 250 255
Asn Arg
<210> 58
<211> 257
<212> PRT
<213> Carboxydothermus hydrogenoformans
<400> 58
Met Glu Phe Glu Lys Ile Lys Phe Glu Val Thr Asp Gly Tyr Ala Val
1 5 10 15
Ile Tyr Leu Asn Asn Pro Pro Val Asn Ala Leu Gly Gln Lys Val Leu
20 25 30
Lys Asp Leu Gln Lys Ala Leu Gln Glu Ile Glu Lys Asn Pro Glu Ile
35 40 45
Arg Ala Val Ile Ile Ser Gly Glu Gly Ser Lys Val Phe Cys Ala Gly
50 55 60
Ala Asp Ile Thr Glu Phe Ala Asp Arg Ala Lys Gly Ile Leu Pro Glu
65 70 75 80
Val Glu Gly Ser Val Leu Phe Arg Gln Ile Glu Leu Phe Pro Lys Pro
85 90 95
Val Ile Ala Ala Leu Asn Gly Ser Ser Tyr Gly Gly Gly Thr Glu Leu
100 105 110
Ala Ile Ser Cys His Leu Arg Ile Leu Ala Asp Asp Ala Ser Met Ala
115 120 125
Leu Pro Glu Val Lys Leu Gly Ile Ile Pro Gly Trp Gly Gly Thr Gln
130 135 140
Arg Leu Pro Arg Leu Ile Gly Lys Thr Arg Ala Leu Glu Ala Met Leu
145 150 155 160
Thr Gly Glu Pro Ile Thr Ala Glu Glu Ala Leu Ser Tyr Gly Leu Val
165 170 175
Asn Lys Val Val Pro Lys Asp Gln Val Leu Thr Glu Ala Arg Ala Leu
180 185 190
Ala Ala Lys Leu Ala Lys Gly Ala Pro Ile Ala Met Arg Glu Ile Leu
195 200 205
Lys Ala Val Thr Leu Gly Leu Asp Thr Ser Ile Glu Glu Gly Leu Lys
210 215 220
Ile Glu Lys Glu Gly Ser Lys Val Ala Phe Ser Ser Glu Asp Ala Val
225 230 235 240
Glu Gly Arg Thr Ala Phe Phe Glu Lys Arg Pro Pro Asn Phe Lys Gly
245 250 255
Arg
<210> 59
<211> 257
<212> PRT
<213> Thermomicrobium roseum
<400> 59
Met Ser Val Arg Val Glu Arg Glu Gly Ala Ile Thr Leu Val Thr Val
1 5 10 15
Glu Arg Pro Glu Arg Leu Asn Ala Leu Asp Thr Ala Thr Leu Arg Ala
20 25 30
Leu Leu Ala Ala Val Gln Glu Leu Ala Thr Glu Glu Ala Ile Ala Val
35 40 45
Val Val Leu Thr Gly Ala Gly Asp Arg Ala Phe Ile Ala Gly Ala Asp
50 55 60
Ile Ser Glu Met Val Glu Lys Ser Pro Ala Glu Ala Leu Ala Phe Ala
65 70 75 80
Glu Leu Gly His Ala Val Cys Arg Ala Ile Glu Glu Ala Pro Gln Pro
85 90 95
Tyr Ile Ala Ala Val Asn Gly Tyr Ala Leu Gly Gly Gly Cys Glu Ile
100 105 110
Ala Leu Ala Cys Asp Ile Arg Leu Ala Ser Glu Arg Ala Val Phe Ala
115 120 125
Gln Pro Glu Val Thr Leu Gly Ile Pro Pro Gly Trp Gly Gly Ser Gln
130 135 140
Arg Leu Pro Arg Val Val Pro Pro Gly Ile Ala Arg Glu Leu Leu Tyr
145 150 155 160
Thr Gly Arg Arg Val Asp Ala Gln Glu Ala Leu Arg Ile Gly Leu Val
165 170 175
Asn Ala Val Tyr Pro Ala Asp Gln Leu Leu Glu Arg Ala Arg Glu Leu
180 185 190
Ala Asn Arg Ile Ala Ala Asn Gly Pro Leu Ala Val Arg Leu Thr Lys
195 200 205
Ala Ala Val Arg Phe Gly Leu Glu Gln Gly Leu Glu Ala Gly Leu Thr
210 215 220
Tyr Glu Arg Gln Val Phe Ala Tyr Ala Phe Thr Thr Glu Asp Gln Arg
225 230 235 240
Glu Gly Met Arg Ala Phe Leu Glu Lys Arg Arg Pro Ala Phe Arg Gly
245 250 255
Arg
<210> 60
<211> 274
<212> PRT
<213> Methylobacterium extorquens
<400> 60
Met Asn Ala Asp Ala Glu Thr Ala Ser Thr Asp Glu Leu Leu Phe Ala
1 5 10 15
Val Asp Ala Ala Gly Ile Ala Arg Ile Thr Leu Asn Arg Pro Lys Ala
20 25 30
Arg Asn Ala Leu Thr Phe Ala Met Tyr Arg Gly Leu Val Glu Leu Cys
35 40 45
Glu Arg Ile Glu Ala Asp His Ala Ile Lys Ala Val Ile Ile Thr Gly
50 55 60
Ala Gly Asp Lys Ala Phe Ala Ala Gly Thr Asp Ile Ala Gln Phe Arg
65 70 75 80
Ser Phe Ser Lys Pro Glu Asp Ala Ile Gly Tyr Glu Arg Phe Met Asp
85 90 95
Arg Val Leu Gly Gly Leu Glu Arg Leu Arg Val Pro Thr Ile Ala Ala
100 105 110
Val Ala Gly Ala Cys Thr Gly Gly Gly Ala Ala Ile Ala Ala Ala Cys
115 120 125
Asp Met Arg Ile Ala Ser Arg Asp Ala Arg Phe Gly Ile Pro Ile Ala
130 135 140
Arg Thr Leu Gly Asn Cys Leu Ser Gln Asn Thr Leu Arg Arg Leu Ala
145 150 155 160
Asn Leu Ile Gly Ala Pro Arg Val Lys Asp Ile Leu Phe Thr Ala Arg
165 170 175
Leu Val Glu Ala Gln Glu Ala Leu Ala Ile Gly Leu Val Asn Glu Val
180 185 190
Val Glu Asp Ala Ala Ala Val Ala Ala Arg Ala Asp Ala Leu Ala Thr
195 200 205
Leu Leu Ala Ser His Ala Pro Leu Thr Leu Gln Ala Thr Lys Glu Gly
210 215 220
Leu Arg Arg Ile Gly Glu Glu Gly Ala Ala Glu Ala Ala Glu Gly Glu
225 230 235 240
Arg Pro Gly Asp Asp Leu Ile Val Met Thr Tyr Met Ser Ala Asp Phe
245 250 255
Arg Glu Gly Met Glu Ala Phe Leu Gly Lys Arg Pro Pro Asn Phe Lys
260 265 270
Gly Arg
<210> 61
<211> 407
<212> PRT
<213> Clostridium sporogenes
<400> 61
Met Ser Asp Arg Asn Lys Glu Val Lys Glu Lys Lys Ala Lys His Tyr
1 5 10 15
Leu Arg Glu Ile Thr Ala Lys His Tyr Lys Glu Ala Leu Glu Ala Lys
20 25 30
Glu Arg Gly Glu Lys Val Gly Trp Cys Ala Ser Asn Phe Pro Gln Glu
35 40 45
Ile Ala Thr Thr Leu Gly Val Lys Val Val Tyr Pro Glu Asn His Ala
50 55 60
Ala Ala Val Ala Ala Arg Gly Asn Gly Gln Asn Met Cys Glu His Ala
65 70 75 80
Glu Ala Met Gly Phe Ser Asn Asp Val Cys Gly Tyr Ala Arg Val Asn
85 90 95
Leu Ala Val Met Asp Ile Gly His Ser Glu Asp Gln Pro Ile Pro Met
100 105 110
Pro Asp Phe Val Leu Cys Cys Asn Asn Ile Cys Asn Gln Met Ile Lys
115 120 125
Trp Tyr Glu His Ile Ala Lys Thr Leu Asp Ile Pro Met Ile Leu Ile
130 135 140
Asp Ile Pro Tyr Asn Thr Glu Asn Thr Val Ser Gln Asp Arg Ile Lys
145 150 155 160
Tyr Ile Arg Ala Gln Phe Asp Asp Ala Ile Lys Gln Leu Glu Glu Ile
165 170 175
Thr Gly Lys Lys Trp Asp Glu Asn Lys Phe Glu Glu Val Met Lys Ile
180 185 190
Ser Gln Glu Ser Ala Lys Gln Trp Leu Arg Ala Ala Ser Tyr Ala Lys
195 200 205
Tyr Lys Pro Ser Pro Phe Ser Gly Phe Asp Leu Phe Asn His Met Ala
210 215 220
Val Ala Val Cys Ala Arg Gly Thr Gln Glu Ala Ala Asp Ala Phe Lys
225 230 235 240
Met Leu Ala Asp Glu Tyr Glu Glu Asn Val Lys Thr Gly Lys Ser Thr
245 250 255
Tyr Arg Gly Glu Glu Lys Gln Arg Ile Leu Phe Glu Gly Ile Ala Cys
260 265 270
Trp Pro Tyr Leu Arg His Lys Leu Thr Lys Leu Ser Glu Tyr Gly Met
275 280 285
Asn Val Thr Ala Thr Val Tyr Ala Glu Ala Phe Gly Val Ile Tyr Glu
290 295 300
Asn Met Asp Glu Leu Met Ala Ala Tyr Asn Lys Val Pro Asn Ser Ile
305 310 315 320
Ser Phe Glu Asn Ala Leu Lys Met Arg Leu Asn Ala Val Thr Ser Thr
325 330 335
Asn Thr Glu Gly Ala Val Ile His Ile Asn Arg Ser Cys Lys Leu Trp
340 345 350
Ser Gly Phe Leu Tyr Glu Leu Ala Arg Arg Leu Glu Lys Glu Thr Gly
355 360 365
Ile Pro Val Val Ser Phe Asp Gly Asp Gln Ala Asp Pro Arg Asn Phe
370 375 380
Ser Glu Ala Gln Tyr Asp Thr Arg Ile Gln Gly Leu Asn Glu Val Met
385 390 395 400
Val Ala Lys Lys Glu Ala Glu
405
<210> 62
<211> 374
<212> PRT
<213> Clostridium sporogenes
<400> 62
Met Ser Asn Ser Asp Lys Phe Phe Asn Asp Phe Lys Asp Ile Val Glu
1 5 10 15
Asn Pro Lys Lys Tyr Ile Met Lys His Met Glu Gln Thr Gly Gln Lys
20 25 30
Ala Ile Gly Cys Met Pro Leu Tyr Thr Pro Glu Glu Leu Val Leu Ala
35 40 45
Ala Gly Met Phe Pro Val Gly Val Trp Gly Ser Asn Thr Glu Leu Ser
50 55 60
Lys Ala Lys Thr Tyr Phe Pro Ala Phe Ile Cys Ser Ile Leu Gln Thr
65 70 75 80
Thr Leu Glu Asn Ala Leu Asn Gly Glu Tyr Asp Met Leu Ser Gly Met
85 90 95
Met Ile Thr Asn Tyr Cys Asp Ser Leu Lys Cys Met Gly Gln Asn Phe
100 105 110
Lys Leu Thr Val Glu Asn Ile Glu Phe Ile Pro Val Thr Val Pro Gln
115 120 125
Asn Arg Lys Met Glu Ala Gly Lys Glu Phe Leu Lys Ser Gln Tyr Lys
130 135 140
Met Asn Ile Glu Gln Leu Glu Lys Ile Ser Gly Asn Lys Ile Thr Asp
145 150 155 160
Glu Ser Leu Glu Lys Ala Ile Glu Ile Tyr Asp Glu His Arg Lys Val
165 170 175
Met Asn Asp Phe Ser Met Leu Ala Ser Lys Tyr Pro Gly Ile Ile Thr
180 185 190
Pro Thr Lys Arg Asn Tyr Val Met Lys Ser Ala Tyr Tyr Met Asp Lys
195 200 205
Lys Glu His Thr Glu Lys Val Arg Gln Leu Met Asp Glu Ile Lys Ala
210 215 220
Ile Glu Pro Lys Pro Phe Glu Gly Lys Arg Val Ile Thr Thr Gly Ile
225 230 235 240
Ile Ala Asp Ser Glu Asp Leu Leu Lys Ile Leu Glu Glu Asn Asn Ile
245 250 255
Ala Ile Val Gly Asp Asp Ile Ala His Glu Ser Arg Gln Tyr Arg Thr
260 265 270
Leu Thr Pro Glu Ala Asn Thr Pro Met Asp Arg Leu Ala Glu Gln Phe
275 280 285
Ala Asn Arg Glu Cys Ser Thr Leu Tyr Asp Pro Glu Lys Lys Arg Gly
290 295 300
Gln Tyr Ile Val Glu Met Ala Lys Glu Arg Lys Ala Asp Gly Ile Ile
305 310 315 320
Phe Phe Met Thr Lys Phe Cys Asp Pro Glu Glu Tyr Asp Tyr Pro Gln
325 330 335
Met Lys Lys Asp Phe Glu Glu Ala Gly Ile Pro His Val Leu Ile Glu
340 345 350
Thr Asp Met Gln Met Lys Asn Tyr Glu Gln Ala Arg Thr Ala Ile Gln
355 360 365
Ala Phe Ser Glu Thr Leu
370
<210> 63
<211> 264
<212> PRT
<213> Clostridium sporogenes
<400> 63
Met Ala Asp Ile Tyr Thr Met Gly Val Asp Ile Gly Ser Thr Ala Ser
1 5 10 15
Lys Thr Val Val Leu Lys Asn Gly Lys Glu Ile Val Ser Gln Ala Val
20 25 30
Ile Ser Val Gly Ala Gly Thr Ser Gly Pro Lys Arg Ala Ile Asp Ser
35 40 45
Val Leu Lys Asp Ala Lys Leu Ser Ile Glu Asp Leu Asp Tyr Ile Val
50 55 60
Ser Thr Gly Tyr Gly Arg Asn Ser Phe Asp Phe Ala Asn Lys Gln Ile
65 70 75 80
Ser Glu Leu Ser Cys His Ala Lys Gly Val Tyr Phe Asp Asn Asn Lys
85 90 95
Ala Arg Thr Val Ile Asp Ile Gly Gly Gln Asp Ile Lys Val Leu Lys
100 105 110
Leu Ala Asp Ser Gly Arg Leu Leu Asn Phe Ile Met Asn Asp Lys Cys
115 120 125
Ala Ala Gly Thr Gly Arg Phe Leu Asp Val Met Ser Arg Val Ile Glu
130 135 140
Val Pro Val Asp Glu Leu Gly Lys Lys Ala Leu Glu Ser Lys Asn Pro
145 150 155 160
Cys Thr Ile Ser Ser Thr Cys Thr Val Phe Ala Glu Ser Glu Val Ile
165 170 175
Ser Gln Leu Ala Arg Gly Val Lys Thr Glu Asp Leu Ile Ala Gly Ile
180 185 190
Cys Lys Ser Val Ala Ser Arg Val Ala Ser Leu Ala Lys Arg Ser Gly
195 200 205
Ile Glu Glu Leu Val Val Met Ser Gly Gly Val Ala Lys Asn Ile Gly
210 215 220
Val Val Lys Ala Met Glu Ala Glu Leu Gly Arg Asp Ile Tyr Ile Ser
225 230 235 240
Lys Asn Ser Gln Leu Asn Gly Ala Leu Gly Ala Ser Leu Tyr Ala Tyr
245 250 255
Glu Ser Phe Gln Lys Glu Arg Ser
260
<210> 64
<211> 412
<212> PRT
<213> Clostridium sporogenes
<400> 64
Met Glu Asn Asn Thr Asn Met Phe Ser Gly Val Lys Val Ile Glu Leu
1 5 10 15
Ala Asn Phe Ile Ala Ala Pro Ala Ala Gly Arg Phe Phe Ala Asp Gly
20 25 30
Gly Ala Glu Val Ile Lys Ile Glu Ser Pro Ala Gly Asp Pro Leu Arg
35 40 45
Tyr Thr Ala Pro Ser Glu Gly Arg Pro Leu Ser Gln Glu Glu Asn Thr
50 55 60
Thr Tyr Asp Leu Glu Asn Ala Asn Lys Lys Ala Ile Val Leu Asn Leu
65 70 75 80
Lys Ser Glu Lys Gly Lys Lys Ile Leu His Glu Met Leu Ala Glu Ala
85 90 95
Asp Ile Leu Leu Thr Asn Trp Arg Thr Lys Ala Leu Val Lys Gln Gly
100 105 110
Leu Asp Tyr Glu Thr Leu Lys Glu Lys Tyr Pro Lys Leu Val Phe Ala
115 120 125
Gln Ile Thr Gly Tyr Gly Glu Lys Gly Pro Asp Lys Asp Leu Pro Gly
130 135 140
Phe Asp Tyr Thr Ala Phe Phe Ala Arg Gly Gly Val Ser Gly Thr Leu
145 150 155 160
Tyr Glu Lys Gly Thr Val Pro Pro Asn Val Val Pro Gly Leu Gly Asp
165 170 175
His Gln Ala Gly Met Phe Leu Ala Ala Gly Met Ala Gly Ala Leu Tyr
180 185 190
Lys Ala Lys Thr Thr Gly Gln Gly Asp Lys Val Thr Val Ser Leu Met
195 200 205
His Ser Ala Met Tyr Gly Leu Gly Ile Met Ile Gln Ala Ala Gln Tyr
210 215 220
Lys Asp His Gly Leu Val Tyr Pro Ile Asn Arg Asn Glu Thr Pro Asn
225 230 235 240
Pro Phe Ile Val Ser Tyr Lys Ser Lys Asp Asp Tyr Phe Val Gln Val
245 250 255
Cys Met Pro Pro Tyr Asp Val Phe Tyr Asp Arg Phe Met Thr Ala Leu
260 265 270
Gly Arg Glu Asp Leu Val Gly Asp Glu Arg Tyr Asn Lys Ile Glu Asn
275 280 285
Leu Lys Asp Gly Arg Ala Lys Glu Val Tyr Ser Ile Ile Glu Gln Gln
290 295 300
Met Val Thr Lys Thr Lys Asp Glu Trp Asp Asn Ile Phe Arg Asp Ala
305 310 315 320
Asp Ile Pro Phe Ala Ile Ala Gln Thr Trp Glu Asp Leu Leu Glu Asp
325 330 335
Glu Gln Ala Trp Ala Asn Asp Tyr Leu Tyr Lys Met Lys Tyr Pro Thr
340 345 350
Gly Asn Glu Arg Ala Leu Val Arg Leu Pro Val Phe Phe Lys Glu Ala
355 360 365
Gly Leu Pro Glu Tyr Asn Gln Ser Pro Gln Ile Ala Glu Asn Thr Val
370 375 380
Glu Val Leu Lys Glu Met Gly Tyr Thr Glu Gln Glu Ile Glu Glu Leu
385 390 395 400
Glu Lys Asp Lys Asp Ile Met Val Arg Lys Glu Lys
405 410
<210> 65
<211> 368
<212> PRT
<213> Lachnoanaerobaculum saburreum
<400> 65
Met Trp His Cys Leu Glu Thr Leu Lys Lys Ile Ser Ala Ser Pro Lys
1 5 10 15
Glu Gln Leu Asn Lys Tyr Leu Glu Glu Gly Lys Lys Val Ile Ala Val
20 25 30
Ala Pro Val Tyr Thr Pro Glu Glu Ile Ile His Ala Phe Gly Phe Val
35 40 45
Pro Met Gly Val Trp Gly Ala Asp Ile Glu Ile Asn Glu Ser Lys Lys
50 55 60
Tyr Tyr Pro Ala Phe Ile Cys Ser Ile Met Gln Thr Val Leu Glu Leu
65 70 75 80
Gly Ile Lys Gly Asn Tyr Asn Gly Val Ser Ala Ile Val Val Pro Ser
85 90 95
Leu Cys Asp Ser Leu Lys Thr Leu Gly Gln Asn Trp Lys Tyr Ala Val
100 105 110
Lys Asp Ile Pro Phe Ile Pro Met Thr Tyr Pro Gln Asn Arg Lys Ser
115 120 125
Asp Tyr Ala Val Asp Phe Thr Leu Glu Met Tyr Lys Arg Val Ile Ser
130 135 140
Asp Leu Glu Asn Ile Thr Gly Glu Lys Phe Asp Glu Gly Lys Leu Lys
145 150 155 160
Asn Thr Tyr Glu Ile Tyr Asn Glu His Asn Arg Val Met Arg Glu Phe
165 170 175
Thr Lys Val Ser Glu Glu Tyr Glu Val Ser Ala Thr Asp Arg Ser Ala
180 185 190
Val Phe Lys Ser Ala Trp Phe Met Leu Lys Glu Glu His Thr Glu Leu
195 200 205
Val Arg Glu Leu Ile Glu Leu Ile Lys Lys Glu Gly Lys Ile Ser Lys
210 215 220
Lys Leu Arg Ile Tyr Thr Thr Gly Ile Leu Ala Asp Ala Pro Asp Leu
225 230 235 240
Leu Asn Ile Phe Asp Ser Asn Asn Met Gln Ile Val Gly Asp Asp Ile
245 250 255
Ala Tyr Glu Ser Arg Gln Tyr Arg Thr Asp Ile Pro Asp Gly Asn Gly
260 265 270
Leu Tyr Ala Leu Ala Lys Lys Phe Ser Asn Met Asp Asn Cys Thr Leu
275 280 285
Leu Tyr Asp Lys Asp Lys Arg Arg Val Asp Phe Ile Ile Glu Glu Ala
290 295 300
Lys Lys Lys Arg Ala Asp Gly Ile Val Val Leu Met Thr Lys Phe Cys
305 310 315 320
Asp Pro Glu Glu Phe Asp Tyr Val Pro Ile Lys Arg Ala Ala Asn Glu
325 330 335
Ala Gly Ile Pro His Ile Asn Ile Glu Val Asp Arg Gln Met Lys Asn
340 345 350
Tyr Gln Gln Ala Asn Thr Met Leu Gln Thr Phe Ala Asp Met Leu Val
355 360 365
<210> 66
<211> 409
<212> PRT
<213> Lachnoanaerobaculum saburreum
<400> 66
Met Glu Glu Ala Lys Lys Gln Lys Pro Thr Val Asp Pro Asn Ser Ala
1 5 10 15
Lys Ala Arg Leu Gly Arg Ile Ala Ala Lys Ala Tyr Ser Asp Cys Val
20 25 30
Glu Ala Lys Lys Arg Gly Glu Leu Val Gly Trp Cys Ala Ser Asn Phe
35 40 45
Pro Val Glu Ile Pro Glu Thr Leu Gly Leu Tyr Val Cys Tyr Pro Glu
50 55 60
Asn Gln Ala Ala Gly Ile Ala Ala Arg Gly Gly Gly Glu Arg Met Cys
65 70 75 80
Ser Glu Ser Glu Gly Asp Gly Tyr Ser Asn Asp Ile Cys Ala Tyr Ala
85 90 95
Arg Ile Ser Leu Ala Tyr Met Lys Leu Lys Glu Ala Pro Glu Gln Asp
100 105 110
Met Pro Gln Pro Asp Phe Val Leu Cys Cys Asn Asn Ile Cys Asn Cys
115 120 125
Met Ile Lys Trp Tyr Glu Asn Ile Ala Lys Glu Leu Asn Ile Pro Met
130 135 140
Ile Met Ile Asp Ile Pro Phe Asn Pro Asp Tyr Glu Val Ser Asp Ala
145 150 155 160
Met Thr Ala Tyr Ile Arg Asn Gln Phe Trp Asp Ala Ile His Gln Leu
165 170 175
Glu Glu Ile Thr Gly Lys Lys Trp Ser Asn Glu Arg Tyr Glu Glu Val
180 185 190
Arg Lys Ile Ser Gly Arg Ser Ser Arg Ala Trp Leu Glu Ala Thr Ala
195 200 205
Thr Ala Lys Tyr Ser Pro Ser Pro Phe Asn Gly Phe Asp Leu Leu Asn
210 215 220
His Met Ala Val Met Val Thr Ala Arg Gly Lys Leu Glu Ala Ala Glu
225 230 235 240
Ala Met Glu Thr Leu Leu Gln Glu Tyr Lys Asp Asn His Glu Lys Gly
245 250 255
Glu Ser Thr Phe Lys Gly Glu Glu Lys Tyr Arg Ile Met Phe Glu Gly
260 265 270
Ile Ala Cys Trp Pro Trp Leu Arg Ala Thr Ala Thr Gly Leu Lys Ser
275 280 285
Arg Gly Ile Asn Met Val Thr Thr Ile Tyr Ala Asp Ala Phe Gly Phe
290 295 300
Ile Tyr Asp Asp Phe Asp Gly Met Cys Arg Ala Tyr Ala Asn Val Pro
305 310 315 320
Asn Cys Met Asn Ile Glu His Ala Arg Asp Lys Arg Ile Lys Leu Cys
325 330 335
Lys Asp Asn Ser Val Glu Gly Leu Leu Val His Thr Asn Arg Ser Cys
340 345 350
Lys Leu Trp Ser Gly Phe Met Ser Glu Met Ser Arg Gln Ile Gly Glu
355 360 365
Glu Cys Gly Ile Pro Val Val Ser Phe Asp Gly Asp Gln Ala Asp Pro
370 375 380
Arg Asn Phe Ser Glu Ala Gln Tyr Asp Thr Arg Val Gln Gly Leu Thr
385 390 395 400
Glu Ile Met Glu Ala Asn Lys Glu Ile
405
<210> 67
<211> 256
<212> PRT
<213> Lachnoanaerobaculum saburreum
<400> 67
Met Tyr Thr Leu Gly Val Asp Ile Gly Ser Thr Thr Ser Lys Ala Val
1 5 10 15
Ile Leu Glu Asp Gly Glu Asn Ile Val Ala Ser Ser Ile Val Ile Ala
20 25 30
Thr Val Gly Thr Ala Gly Val Glu Glu Ala Val Lys Asn Val Leu Asn
35 40 45
Phe Ser Lys Leu Glu Leu Asn Asp Ile Lys Ala Val Val Ala Thr Gly
50 55 60
Tyr Gly Arg Met Asn Tyr Asp Val Ala Asp Tyr Lys Val Ser Glu Leu
65 70 75 80
Thr Cys His Ala Leu Gly Val His Lys Glu Phe Pro Asn Val Arg Thr
85 90 95
Val Ile Asp Ile Gly Gly Gln Asp Ala Lys Val Ile Ser Leu Ala Ala
100 105 110
Asn Gly Lys Met Thr Asn Phe Val Met Asn Asp Lys Cys Ala Ala Gly
115 120 125
Thr Gly Arg Phe Leu Asp Val Met Ala Asn Ile Leu Asn Leu Asp Ile
130 135 140
Gln Asp Leu Glu Val Glu Ala Leu Lys Ser Asp Asn Pro Ala Asn Ile
145 150 155 160
Ser Ser Thr Cys Thr Val Phe Ala Glu Ser Glu Val Ile Ser Gln Leu
165 170 175
Ala Thr Gly Arg Asn Ile Pro Asp Leu Val Ala Gly Ile Cys Lys Ser
180 185 190
Val Ala Val Arg Val Ala Ala Leu Ala Lys Arg Val Gly Ile Val Glu
195 200 205
Glu Val Cys Met Ser Gly Gly Val Ala Lys Asn Ser Gly Val Arg Asn
210 215 220
Ala Met Ser Lys Glu Leu Gly Val Asp Ile Val Phe Ser Lys Asp Ala
225 230 235 240
Gln Leu Met Gly Ala Leu Gly Ala Ala Ile Tyr Gly Phe Lys Lys Leu
245 250 255
<210> 68
<211> 264
<212> PRT
<213> Peptostreptococcus stomatis
<400> 68
Met Ser Ser Val Tyr Thr Met Gly Ile Asp Ile Gly Ser Thr Ser Ser
1 5 10 15
Lys Cys Val Ile Met Lys Asp Gly Lys Glu Ile Val Ser Glu Gly Val
20 25 30
Val Ser Leu Gly Ala Gly Thr Lys Gly Ser Asp Leu Val Ile Glu Glu
35 40 45
Val Leu Gly Lys Ala Gly Met Thr Phe Asp Glu Ile Asp Leu Ile Val
50 55 60
Ser Thr Gly Tyr Gly Arg Asn Ser Tyr Glu Arg Ala Ala Lys Thr Val
65 70 75 80
Ser Glu Leu Ser Cys His Ala Lys Gly Gly Gly Tyr Ile Phe Gly Gly
85 90 95
Ala Gly Thr Ile Ile Asp Ile Gly Gly Gln Asp Ile Lys Val Leu Lys
100 105 110
Leu Asn Asp Lys Gly Gly Leu Val Asn Phe Leu Met Asn Asp Lys Cys
115 120 125
Ala Ala Gly Thr Gly Arg Phe Leu Glu Val Met Ser Gly Val Leu Asp
130 135 140
Val Lys Leu Asp Glu Leu Gly Glu Leu Asp Ala Lys Ala Thr Glu Val
145 150 155 160
Thr Pro Ile Ser Ser Thr Cys Thr Val Phe Ala Glu Ser Glu Val Ile
165 170 175
Ser Cys Met Ala Lys Lys Ile Pro Leu Glu Asn Ile Ile Arg Gly Ile
180 185 190
His Ala Ser Val Ala Thr Arg Val Ala Ser Leu Ala Arg Arg Gly Gly
195 200 205
Leu Lys Thr Pro Val Ala Met Thr Gly Gly Val Ser Lys Asn Lys Gly
210 215 220
Ile Val Arg Ala Leu Lys Glu Glu Leu Glu Cys Asp Ile Leu Ile Ser
225 230 235 240
Pro Asp Ser Gln Met Ala Gly Ala Ile Gly Ala Ala Leu Tyr Ala Tyr
245 250 255
Asp Glu Tyr Gln Lys Gln Asn Ala
260
<210> 69
<211> 372
<212> PRT
<213> Peptostreptococcus stomatis
<400> 69
Met Ser Asn Ile Asp Val Leu Leu Gly Lys Leu Asp Val Ser Leu Leu
1 5 10 15
Gly Gln Val Asp Lys Tyr Val Ser Glu Gly Lys Lys Val Ile Gly Cys
20 25 30
Ala Pro Val Tyr Thr Pro Glu Glu Leu Val Tyr Ala Ala Gly Met Val
35 40 45
Pro Ile Gly Val Trp Gly Ala Glu Gly Glu Val Gly Leu Ser Lys Glu
50 55 60
Tyr Phe Pro Ala Phe Tyr Ala Ala Ile Ile Leu Arg Leu Met Asp Leu
65 70 75 80
Gly Leu Glu Gly Lys Leu Asp Lys Met Ser Gly Met Ile Ile Pro Gly
85 90 95
Leu Ser Asp Gly Leu Lys Gly Leu Ser Gln Asn Trp Lys Arg Ala Ile
100 105 110
Lys Gln Val Pro Ala Leu Tyr Ile Gly Tyr Gly Gln Asn Arg Lys Ile
115 120 125
Glu Ala Gly Ile Thr Tyr Asn Glu Lys Gln Tyr Ile Lys Leu Arg Gly
130 135 140
Gln Leu Glu Glu Ile Ala Gly Cys Lys Ile Glu Asp Ala Lys Val Glu
145 150 155 160
Glu Ala Ile Val Leu Tyr Asn Lys His Arg Lys Ala Met Gln Glu Phe
165 170 175
Ser Ser Leu Ala Ala Ser His Leu Asn Thr Ile Thr Pro Ile Leu Arg
180 185 190
Ala Arg Val Met Thr Ser Ala Phe Leu Phe Asp Lys Ala Glu His Leu
195 200 205
Ala Ile Leu Glu Glu Leu Asn Lys Glu Leu Lys Ala Leu Pro Glu Glu
210 215 220
Lys Phe Ala Gly Lys Lys Val Val Thr Thr Gly Ile Leu Ala Asn Ser
225 230 235 240
Pro Gly Met Leu Glu Ile Leu Asp Glu Tyr Lys Leu Gly Ile Val Asp
245 250 255
Asp Asn Ile Asn His Glu Ser Gly Gln Phe Asp Tyr Leu Val Asp Glu
260 265 270
Gly Thr Gly Asn Pro Val Arg Ala Leu Ser Lys Trp Ile Ser Asp Ile
275 280 285
Glu Gly Ser Thr Leu Leu Tyr Asp Pro Glu Lys Leu Arg Gly Gln Ile
290 295 300
Ile Ile Asp Lys Val Lys Lys His Gln Ala Asp Gly Val Ile Tyr Leu
305 310 315 320
Met Thr Lys Phe Ser Asp Ser Asp Glu Phe Asp Tyr Pro Ile Ile Arg
325 330 335
Lys Glu Leu Glu Asn Ala Gly Ile Leu His Ile Leu Val Glu Val Asp
340 345 350
Gln Gln Met Thr Asn Phe Glu Gln Ala Lys Thr Ala Leu Gln Thr Phe
355 360 365
Ala Asp Met Ile
370
<210> 70
<211> 411
<212> PRT
<213> Peptostreptococcus stomatis
<400> 70
Met Ser Asn Thr Gly Met Val Glu Glu Lys Pro Ala Lys Val Leu Leu
1 5 10 15
Gly Glu Ile Val Ala Lys His Tyr Lys Glu Ala Trp Glu Ala Lys Asn
20 25 30
Asn Gly Glu Leu Val Gly Trp Cys Ala Ser Asn Phe Pro Gln Glu Ile
35 40 45
Phe Glu Thr Met Asp Ile Lys Val Val Tyr Pro Glu Asn Gln Ala Ala
50 55 60
Ala Ile Ser Ala Lys Gly Gly Gly Gln Arg Met Cys Glu Ile Ala Glu
65 70 75 80
Asn Glu Gly Tyr Ser Asn Asp Ile Cys Ala Tyr Ala Arg Ile Ser Leu
85 90 95
Ala Tyr Met Asp Val Lys Asp Ala Pro Glu Leu Asn Met Pro Gln Pro
100 105 110
Asp Phe Val Ala Cys Cys Asn Asn Ile Cys Asn Cys Met Ile Lys Trp
115 120 125
Tyr Glu Asn Ile Ala Lys Glu Leu Asn Ile Pro Leu Ile Leu Ile Asp
130 135 140
Val Pro Tyr Asn Asn Asp Tyr Glu Ala Glu Asp Asp Arg Val Glu Tyr
145 150 155 160
Leu Arg Gly Gln Phe Asp Tyr Ala Ile Lys Gln Leu Glu Glu Leu Thr
165 170 175
Gly Lys Lys Trp Asp Glu Lys Lys Phe Glu Glu Val Met Glu Val Ser
180 185 190
Gln Arg Thr Gly Arg Ala Trp Leu Lys Ala Thr Gly Tyr Ala Lys Tyr
195 200 205
Thr Pro Ser Pro Phe Ser Gly Phe Asp Val Phe Asn His Met Ala Val
210 215 220
Ala Val Cys Ala Arg Gly Lys Ile Glu Ser Ala Ile Ala Phe Glu Lys
225 230 235 240
Leu Ala Glu Glu Phe Asp Glu Asn Val Arg Thr Gly Lys Ser Thr Phe
245 250 255
Lys Gly Glu Glu Lys Phe Arg Val Leu Phe Glu Gly Ile Ala Cys Trp
260 265 270
Pro His Leu Arg His Thr Phe Lys Gln Leu Lys Asp Ala Gly Val Asn
275 280 285
Val Cys Gly Thr Val Tyr Ala Asp Ala Phe Gly Tyr Ile Tyr Asp Asn
290 295 300
Thr Tyr Gln Leu Met Gln Ala Tyr Cys Gly Thr Pro Asn Ala Ile Ser
305 310 315 320
Tyr Glu Arg Ala Thr Asp Met Arg Leu Lys Val Ile Glu Glu Asn Asn
325 330 335
Ile Asp Gly Met Leu Ile His Ile Asn Arg Ser Cys Lys Gln Trp Ser
340 345 350
Gly Ile Met Tyr Glu Met Glu Arg Asp Ile Arg Glu Lys Thr Gly Ile
355 360 365
Pro Thr Ala Thr Phe Asp Gly Asp Gln Ala Asp Pro Arg Asn Phe Ser
370 375 380
Glu Ala Gln Tyr Asp Thr Arg Val Gln Gly Leu Ile Glu Leu Met Glu
385 390 395 400
Ala Asn Lys Ala Ala Lys Met Lys Glu Ala His
405 410
<210> 71
<211> 408
<212> PRT
<213> Clostridium difficile
<400> 71
Met Ser Glu Lys Lys Glu Ala Arg Val Val Ile Asn Asp Leu Leu Ala
1 5 10 15
Glu Gln Tyr Ala Asn Ala Phe Lys Ala Lys Glu Glu Gly Arg Pro Val
20 25 30
Gly Trp Ser Thr Ser Val Phe Pro Gln Glu Leu Ala Glu Val Phe Asp
35 40 45
Leu Asn Val Leu Tyr Pro Glu Asn Gln Ala Ala Gly Val Ala Ala Lys
50 55 60
Lys Gly Ser Leu Glu Leu Cys Glu Ile Ala Glu Ser Lys Gly Tyr Ser
65 70 75 80
Ile Asp Leu Cys Ala Tyr Ala Arg Thr Asn Phe Gly Leu Leu Glu Asn
85 90 95
Gly Gly Cys Glu Ala Leu Asp Met Pro Ala Pro Asp Phe Leu Leu Cys
100 105 110
Cys Asn Asn Ile Cys Asn Gln Val Ile Lys Trp Tyr Glu Asn Ile Ser
115 120 125
Arg Glu Leu Asp Ile Pro Leu Ile Met Ile Asp Thr Thr Phe Asn Asn
130 135 140
Glu Asp Glu Val Thr Gln Ser Arg Ile Asp Tyr Ile Lys Ala Gln Phe
145 150 155 160
Glu Glu Ala Ile Lys Gln Leu Glu Ile Ile Ser Gly Lys Lys Phe Asp
165 170 175
Pro Lys Lys Phe Glu Glu Val Met Lys Ile Ser Ala Glu Asn Gly Arg
180 185 190
Leu Trp Lys Tyr Ser Met Ser Leu Pro Ala Asp Ser Ser Pro Ser Pro
195 200 205
Met Asn Gly Phe Asp Leu Phe Thr Tyr Met Ala Val Ile Val Cys Ala
210 215 220
Arg Gly Lys Lys Glu Thr Thr Glu Ala Phe Lys Leu Leu Ile Glu Glu
225 230 235 240
Leu Glu Asp Asn Met Lys Thr Gly Lys Ser Ser Phe Arg Gly Glu Glu
245 250 255
Lys Tyr Arg Ile Met Met Glu Gly Ile Pro Cys Trp Pro Tyr Ile Gly
260 265 270
Tyr Lys Met Lys Thr Leu Ala Lys Phe Gly Val Asn Met Thr Gly Ser
275 280 285
Val Tyr Pro His Ala Trp Ala Leu Gln Tyr Glu Val Asn Asp Leu Asp
290 295 300
Gly Met Ala Val Ala Tyr Ser Thr Met Phe Asn Asn Val Asn Leu Asp
305 310 315 320
Arg Met Thr Lys Tyr Arg Val Asp Ser Leu Val Glu Gly Lys Cys Asp
325 330 335
Gly Ala Phe Tyr His Met Asn Arg Ser Cys Lys Leu Met Ser Leu Ile
340 345 350
Gln Tyr Glu Met Gln Arg Arg Ala Ala Glu Glu Thr Gly Leu Pro Tyr
355 360 365
Ala Gly Phe Asp Gly Asp Gln Ala Asp Pro Arg Ala Phe Thr Asn Ala
370 375 380
Gln Phe Glu Thr Arg Ile Gln Gly Leu Val Glu Val Met Glu Glu Arg
385 390 395 400
Lys Lys Leu Asn Arg Gly Glu Ile
405
<210> 72
<211> 375
<212> PRT
<213> Clostridium difficile
<400> 72
Met Glu Ala Ile Leu Ser Lys Met Lys Glu Val Val Glu Asn Pro Asn
1 5 10 15
Ala Ala Val Lys Lys Tyr Lys Ser Glu Thr Gly Lys Lys Ala Ile Gly
20 25 30
Cys Phe Pro Val Tyr Cys Pro Glu Glu Ile Ile His Ala Ala Gly Met
35 40 45
Leu Pro Val Gly Ile Trp Gly Gly Gln Thr Glu Leu Asp Leu Ala Lys
50 55 60
Gln Tyr Phe Pro Ala Phe Ala Cys Ser Ile Met Gln Ser Cys Leu Glu
65 70 75 80
Tyr Gly Leu Lys Gly Ala Tyr Asp Glu Leu Ser Gly Val Ile Ile Pro
85 90 95
Gly Met Cys Asp Thr Leu Ile Cys Leu Gly Gln Asn Trp Lys Ser Ala
100 105 110
Val Pro His Ile Lys Tyr Ile Ser Leu Val His Pro Gln Asn Arg Lys
115 120 125
Leu Glu Ala Gly Val Lys Tyr Leu Ile Ser Glu Tyr Lys Gly Val Lys
130 135 140
Arg Glu Leu Glu Glu Ile Cys Gly Tyr Glu Ile Glu Glu Ala Lys Ile
145 150 155 160
His Glu Ser Ile Glu Val Tyr Asn Glu His Arg Lys Thr Met Arg Asp
165 170 175
Phe Val Glu Val Ala Tyr Lys His Ser Asn Thr Ile Lys Pro Ser Ile
180 185 190
Arg Ser Leu Val Ile Lys Ser Gly Phe Phe Met Arg Lys Glu Glu His
195 200 205
Thr Glu Leu Val Lys Asp Leu Ile Ala Lys Leu Asn Ala Met Pro Glu
210 215 220
Glu Val Cys Ser Gly Lys Lys Val Leu Leu Thr Gly Ile Leu Ala Asp
225 230 235 240
Ser Lys Asp Ile Leu Asp Ile Leu Glu Asp Asn Asn Ile Ser Val Val
245 250 255
Ala Asp Asp Leu Ala Gln Glu Thr Arg Gln Phe Arg Thr Asp Val Pro
260 265 270
Ala Gly Asp Asp Ala Leu Glu Arg Leu Ala Arg Gln Trp Ser Asn Ile
275 280 285
Glu Gly Cys Ser Leu Ala Tyr Asp Pro Lys Lys Lys Arg Gly Ser Leu
290 295 300
Ile Val Asp Glu Val Lys Lys Lys Asp Ile Asp Gly Val Ile Phe Cys
305 310 315 320
Met Met Lys Phe Cys Asp Pro Glu Glu Tyr Asp Tyr Pro Leu Val Arg
325 330 335
Lys Asp Ile Glu Asp Ser Gly Ile Pro Thr Leu Tyr Val Glu Ile Asp
340 345 350
Gln Gln Thr Gln Asn Asn Glu Gln Ala Arg Thr Arg Ile Gln Thr Phe
355 360 365
Ala Glu Met Met Ser Leu Ala
370 375
<210> 73
<211> 266
<212> PRT
<213> Clostridium difficile
<400> 73
Met Tyr Thr Met Gly Leu Asp Ile Gly Ser Thr Ala Ser Lys Gly Val
1 5 10 15
Ile Leu Lys Asn Gly Glu Asp Ile Val Ala Ser Glu Thr Ile Ser Ser
20 25 30
Gly Thr Gly Thr Thr Gly Pro Ser Arg Val Leu Glu Lys Leu Tyr Gly
35 40 45
Lys Thr Gly Leu Ala Arg Glu Asp Ile Lys Lys Val Val Val Thr Gly
50 55 60
Tyr Gly Arg Met Asn Tyr Ser Asp Ala Asp Lys Gln Ile Ser Glu Leu
65 70 75 80
Ser Cys His Ala Arg Gly Val Asn Phe Ile Ile Pro Glu Thr Arg Thr
85 90 95
Ile Ile Asp Ile Gly Gly Gln Asp Ala Lys Val Leu Lys Leu Asp Asn
100 105 110
Asn Gly Arg Leu Leu Asn Phe Leu Met Asn Asp Lys Cys Ala Ala Gly
115 120 125
Thr Gly Arg Phe Leu Asp Val Met Ala Lys Ile Ile Glu Val Asp Val
130 135 140
Ser Glu Leu Gly Ser Ile Ser Met Asn Ser Gln Asn Glu Val Ser Ile
145 150 155 160
Ser Ser Thr Cys Thr Val Phe Ala Glu Ser Glu Val Ile Ser His Leu
165 170 175
Ser Glu Asn Ala Lys Ile Glu Asp Ile Val Ala Gly Ile His Thr Ser
180 185 190
Val Ala Lys Arg Val Ser Ser Leu Val Lys Arg Ile Gly Val Gln Arg
195 200 205
Asn Val Val Met Val Gly Gly Val Ala Arg Asn Ser Gly Ile Val Arg
210 215 220
Ala Met Ala Arg Glu Ile Asn Thr Glu Ile Ile Val Pro Asp Ile Pro
225 230 235 240
Gln Leu Thr Gly Ala Leu Gly Ala Ala Leu Tyr Ala Phe Asp Glu Ala
245 250 255
Lys Glu Ser Gln Lys Glu Val Lys Asn Ile
260 265
<210> 74
<211> 399
<212> PRT
<213> Clostridium difficile
<400> 74
Met Leu Leu Glu Gly Val Lys Val Val Glu Leu Ser Ser Phe Ile Ala
1 5 10 15
Ala Pro Cys Cys Ala Lys Met Leu Gly Asp Trp Gly Ala Glu Val Ile
20 25 30
Lys Ile Glu Pro Ile Glu Gly Asp Gly Ile Arg Val Met Gly Gly Thr
35 40 45
Phe Lys Ser Pro Ala Ser Asp Asp Glu Asn Pro Met Phe Glu Leu Glu
50 55 60
Asn Gly Asn Lys Lys Gly Val Ser Ile Asn Val Lys Ser Lys Glu Gly
65 70 75 80
Val Glu Ile Leu His Lys Leu Leu Ser Glu Ala Asp Ile Phe Val Thr
85 90 95
Asn Val Arg Val Gln Ala Leu Glu Lys Met Gly Ile Ala Tyr Asp Gln
100 105 110
Ile Lys Asp Lys Tyr Pro Gly Leu Ile Phe Ser Gln Ile Leu Gly Tyr
115 120 125
Gly Glu Lys Gly Pro Leu Lys Asp Lys Pro Gly Phe Asp Tyr Thr Ala
130 135 140
Tyr Phe Ala Arg Gly Gly Val Ser Gln Ser Val Met Glu Lys Gly Thr
145 150 155 160
Ser Pro Ala Asn Thr Ala Ala Gly Phe Gly Asp His Tyr Ala Gly Leu
165 170 175
Ala Leu Ala Ala Gly Ser Leu Ala Ala Leu His Lys Lys Ala Gln Thr
180 185 190
Gly Lys Gly Glu Arg Val Thr Val Ser Leu Phe His Thr Ala Ile Tyr
195 200 205
Gly Met Gly Thr Met Ile Thr Thr Ala Gln Tyr Gly Asn Glu Met Pro
210 215 220
Leu Ser Arg Glu Asn Pro Asn Ser Pro Leu Met Thr Thr Tyr Lys Cys
225 230 235 240
Lys Asp Gly Arg Trp Ile Gln Leu Ala Leu Ile Gln Tyr Asn Lys Trp
245 250 255
Leu Gly Lys Phe Cys Lys Val Ile Asn Arg Glu Tyr Ile Leu Glu Asp
260 265 270
Asp Arg Tyr Asn Asn Ile Asp Ser Met Val Asn His Val Glu Asp Leu
275 280 285
Val Lys Ile Val Gly Glu Ala Met Leu Glu Lys Thr Leu Asp Glu Trp
290 295 300
Ser Ala Leu Leu Glu Glu Ala Asp Leu Pro Phe Glu Lys Ile Gln Ser
305 310 315 320
Cys Glu Asp Leu Leu Asp Asp Glu Gln Ala Trp Ala Asn Asp Phe Leu
325 330 335
Phe Lys Lys Thr Tyr Asp Ser Gly Asn Thr Gly Val Leu Val Asn Thr
340 345 350
Pro Val Met Phe Arg Asn Glu Gly Ile Lys Glu Tyr Thr Pro Ala Pro
355 360 365
Lys Val Gly Gln His Thr Val Glu Val Leu Lys Ser Leu Gly Tyr Asp
370 375 380
Glu Glu Lys Ile Asn Asn Phe Lys Asp Ser Lys Val Val Arg Tyr
385 390 395
<210> 75
<211> 255
<212> PRT
<213> Escherichia coli (strain K12)
<400> 75
Met Ser Glu Leu Ile Val Ser Arg Gln Gln Arg Val Leu Leu Leu Thr
1 5 10 15
Leu Asn Arg Pro Ala Ala Arg Asn Ala Leu Asn Asn Ala Leu Leu Thr
20 25 30
Gln Leu Val Asn Glu Leu Glu Ala Ala Ala Ile Asp Thr Ser Ile Ser
35 40 45
Val Cys Val Ile Thr Gly Asn Ala Arg Phe Phe Ala Ala Gly Ala Asp
50 55 60
Leu Asn Glu Met Ala Glu Lys Asp Leu Ala Ala Thr Leu Asn Asp Thr
65 70 75 80
Arg Pro Gln Leu Trp Ala Arg Leu Gln Ala Phe Asn Lys Pro Leu Ile
85 90 95
Ala Ala Val Asn Gly Tyr Ala Leu Gly Ala Gly Cys Glu Leu Ala Leu
100 105 110
Leu Cys Asp Val Val Val Ala Gly Glu Asn Ala Arg Phe Gly Leu Pro
115 120 125
Glu Ile Thr Leu Gly Ile Met Pro Gly Ala Gly Gly Thr Gln Arg Leu
130 135 140
Ile Arg Ser Val Gly Lys Ser Leu Ala Ser Lys Met Val Leu Ser Gly
145 150 155 160
Glu Ser Ile Thr Ala Arg Gln Ala Gln Gln Ala Gly Leu Val Ser Asp
165 170 175
Val Phe Pro Ser Asp Leu Thr Leu Glu Tyr Ala Leu Gln Leu Ala Ser
180 185 190
Lys Met Ala Arg His Ser Pro Leu Ala Leu Gln Ala Ala Lys Gln Ala
195 200 205
Leu Arg Gln Ser Gln Glu Val Ala Leu Gln Ala Gly Leu Ala Gln Glu
210 215 220
Arg Gln Leu Phe Thr Leu Leu Ala Ala Thr Glu Asp Arg His Glu Gly
225 230 235 240
Ile Ser Ala Phe Leu Gln Lys Arg Ser Pro Asp Phe Lys Gly Arg
245 250 255
<210> 76
<211> 257
<212> PRT
<213> Rhodobacter capsulatus
<400> 76
Met Ser Tyr His Thr Ile Arg Tyr Glu Ile Ser Glu Gly Leu Ala Val
1 5 10 15
Ile Thr Leu Asp Arg Pro Glu Val Met Asn Ala Leu Asn Ala Ala Met
20 25 30
Arg His Glu Leu Thr Ala Ala Leu His Arg Ala Arg Gly Glu Ala Arg
35 40 45
Ala Ile Val Leu Thr Gly Ser Gly Arg Ala Phe Cys Ser Gly Gln Asp
50 55 60
Leu Gly Asp Gly Ala Ala Glu Gly Leu Asn Leu Glu Thr Val Leu Arg
65 70 75 80
Glu Glu Tyr Glu Pro Leu Leu Gln Ala Ile Tyr Ser Cys Pro Leu Pro
85 90 95
Val Leu Ala Ala Val Asn Gly Ala Ala Ala Gly Ala Gly Ala Asn Leu
100 105 110
Ala Leu Ala Ala Asp Val Val Ile Ala Ala Gln Ser Ala Ala Phe Met
115 120 125
Gln Ala Phe Thr Arg Ile Gly Leu Met Pro Asp Ala Gly Gly Thr Trp
130 135 140
Trp Leu Pro Arg Gln Val Gly Met Ala Arg Ala Met Gly Met Ala Leu
145 150 155 160
Phe Ala Glu Lys Ile Gly Ala Glu Glu Ala Ala Arg Met Gly Leu Ile
165 170 175
Trp Glu Ala Val Pro Asp Val Asp Phe Glu His His Trp Arg Ala Arg
180 185 190
Ala Ala His Leu Ala Arg Gly Pro Ser Ala Ala Phe Ala Ala Val Lys
195 200 205
Lys Ala Phe His Ala Gly Leu Ser Asn Pro Leu Pro Ala Gln Leu Ala
210 215 220
Leu Glu Ala Arg Leu Gln Gly Glu Leu Gly Gln Ser Ala Asp Phe Arg
225 230 235 240
Glu Gly Val Gln Ala Phe Leu Glu Lys Arg Pro Pro His Phe Thr Gly
245 250 255
Arg
<210> 77
<211> 701
<212> PRT
<213> Pseudomonas stutzeri
<400> 77
Met Thr Asp Val Ile Arg Leu Glu Arg Arg Gly Asp Ile Ala Leu Ile
1 5 10 15
Leu Val Asn Asn Pro Pro Val Asn Ala Leu Gly His Ala Val Arg Lys
20 25 30
Gly Leu Leu Asp Ala Phe Gln Glu Ala Asp Glu Ala Pro Glu Val Thr
35 40 45
Ala Val Val Leu Val Cys Glu Gly Pro Thr Phe Met Ala Gly Ala Asp
50 55 60
Ile Lys Glu Phe Gly Lys Pro Pro Gln Ala Pro Ser Leu Pro Glu Val
65 70 75 80
Ile Glu Val Ile Glu Gly Cys Arg Lys Pro Ser Val Ala Val Ile His
85 90 95
Gly Thr Ala Leu Gly Gly Gly Leu Glu Val Ala Leu Gly Cys His Tyr
100 105 110
Arg Ile Ala Arg Ser Asp Ala Lys Val Gly Leu Pro Glu Val Lys Leu
115 120 125
Gly Leu Leu Pro Gly Ala Gly Gly Thr Gln Arg Leu Pro Arg Leu Ala
130 135 140
Gly Val Glu Lys Ala Leu Glu Met Ile Val Ser Gly Gln Pro Ile Gly
145 150 155 160
Ala Ala Glu Ala Leu Glu His Tyr Ile Val Asp Glu Leu Phe Glu Gly
165 170 175
Asp Leu Ile Glu Ala Gly Leu Thr Tyr Ala Arg Arg Leu Val Glu Glu
180 185 190
Gly Arg Gly Pro Arg Arg Ser Gly Glu Gln Thr Arg Gly Leu Glu Gly
195 200 205
Val Asp Asn Glu Ala Leu Ile Arg Ala Lys His Ala Glu Val Ala Lys
210 215 220
Arg Met Pro Gly Leu Phe Ser Pro Leu Arg Cys Ile Ala Ala Val Glu
225 230 235 240
Ala Ala Thr Arg Leu Pro Leu Ala Glu Gly Leu Lys Arg Glu Arg Glu
245 250 255
Leu Phe Thr Glu Cys Leu Asn Ser Pro Gln Arg Gly Ala Leu Ile His
260 265 270
Ser Phe Phe Ala Glu Arg Gln Ala Gly Lys Ile Asp Asp Leu Pro Ser
275 280 285
Asp Val Thr Pro Arg Pro Ile Arg Thr Ala Ala Val Ile Gly Gly Gly
290 295 300
Thr Met Gly Val Gly Ile Ala Leu Ser Phe Ala Asn Ala Gly Val Pro
305 310 315 320
Val Lys Leu Leu Glu Ile Asn Asp Glu Ala Leu Gln Arg Gly Leu Gln
325 330 335
Arg Ala Arg Glu Thr Tyr Ala Ala Ser Val Lys Arg Gly Ser Leu Thr
340 345 350
Glu Asp Ala Met Glu Gln Arg Leu Ala Leu Ile Ala Gly Val Thr Asp
355 360 365
Tyr Gly Ala Leu Ala Asp Ala Asp Val Val Val Glu Ala Val Phe Glu
370 375 380
Glu Met Gly Val Lys Gln Gln Val Phe Glu Gln Leu Asp Ala Val Cys
385 390 395 400
Lys Pro Gly Ala Ile Leu Ala Ser Asn Thr Ser Ser Leu Asp Leu Asn
405 410 415
Ala Ile Ala Gly Phe Thr Arg Arg Pro Glu Asp Val Val Gly Met His
420 425 430
Phe Phe Ser Pro Ala Asn Val Met Arg Leu Leu Glu Val Val Arg Gly
435 440 445
Glu Arg Thr Ser Asp Glu Val Leu Ala Ala Ala Met Ala Ile Gly Lys
450 455 460
Gln Leu Lys Lys Val Ser Val Val Val Gly Val Cys Asp Gly Phe Val
465 470 475 480
Gly Asn Arg Met Val Phe Gln Tyr Gly Arg Glu Ala Glu Phe Leu Leu
485 490 495
Glu Glu Gly Ala Thr Pro Gln Gln Val Asp Ala Ala Leu Arg Asn Phe
500 505 510
Gly Met Ala Met Gly Pro Phe Ala Met Arg Asp Leu Ser Gly Leu Asp
515 520 525
Ile Gly Gln Ala Ile Arg Lys Arg Gln Arg Ala Thr Leu Pro Ala His
530 535 540
Leu Asp Phe Pro Thr Val Ser Asp Lys Leu Cys Ala Ala Gly Met Leu
545 550 555 560
Gly Gln Lys Thr Gly Ala Gly Tyr Tyr Arg Tyr Glu Pro Gly Asn Arg
565 570 575
Thr Pro Gln Glu Asn Pro Asp Leu Ala Pro Met Leu Glu Ala Ala Ser
580 585 590
Arg Glu Lys Gly Ile Glu Arg Gln Ala Leu Asp Glu Gln Tyr Ile Val
595 600 605
Glu Arg Cys Ile Phe Ala Leu Val Asn Glu Gly Ala Lys Ile Leu Glu
610 615 620
Glu Gly Ile Ala Gln Arg Ser Ser Asp Ile Asp Val Ile Tyr Leu Asn
625 630 635 640
Gly Tyr Gly Phe Pro Ala Phe Arg Gly Gly Pro Met Tyr Tyr Ala Asp
645 650 655
Ser Val Gly Leu Asp Lys Val Leu Ala Arg Val Lys Glu Leu His Ala
660 665 670
Arg Cys Gly Asp Trp Trp Lys Pro Ala Pro Leu Leu Glu Lys Leu Ala
675 680 685
Ala Glu Gly Arg Thr Phe Thr Glu Trp Gln Ala Gly Gln
690 695 700
<210> 78
<211> 655
<212> PRT
<213> Haliangium ochraceum
<400> 78
Met Ile Val Gly Val Ile Gly Ser Gly Ala Ile Gly Pro Asp Leu Ala
1 5 10 15
Tyr Gly Phe Ala Ser Ala Leu Ala Ser Val Pro Gly Ala Arg Val Tyr
20 25 30
Leu His Asp Ile Lys Gln Glu Ala Leu Asp Ala Gly Met Gln Arg Ile
35 40 45
Arg Gly Tyr Ile Ala Lys Gly Leu Ala Arg Gly Lys Ile Ser Glu Arg
50 55 60
Val Ala Gly Ala Leu Glu Thr Val Leu Val Pro Thr Leu Ser Leu Ala
65 70 75 80
Asp Leu Ala Pro Cys Ser Tyr Val Leu Glu Ala Ala Thr Glu Glu Leu
85 90 95
Gly Val Lys Arg Ala Ile Leu Arg Ser Leu Glu Asp Thr Val Asp Ser
100 105 110
Glu Cys Leu Ile Gly Phe Ala Thr Ser Gly Leu Pro Arg Ala Ile Ile
115 120 125
Ala Ala Glu Val Lys His Pro Glu Arg Cys Phe Val Asn His Pro Phe
130 135 140
Tyr Pro Ala Trp Arg Ser Leu Pro Val Glu Val Val Leu Ser Gly Ser
145 150 155 160
Pro Ala His Gly Gln Arg Met Leu Ala Thr Leu Glu Ala Leu Gly Lys
165 170 175
Val Pro Val Ile Thr Ala Asp Ala Pro Cys Phe Ala Ala Asp Asp Ile
180 185 190
Phe Cys Asn Tyr Cys Ser Glu Ala Ala Arg Ile Val Glu Glu Gly Ile
195 200 205
Ala Asn Pro Ala Gln Val Asp Ala Ile Val His Gly Ala Ile Gly Gly
210 215 220
Gly Gly Pro Leu Asn Val Leu Asp Ala Thr Arg Gly Asn Leu Leu Thr
225 230 235 240
Val His Cys Gln Glu Leu Met Arg Asp Ala Asp Thr Gly Thr Pro Trp
245 250 255
Phe Glu Pro Pro Ala Ile Leu Arg Glu Arg Gly Asp Ala Leu Trp His
260 265 270
Asp Pro Lys Ala Pro His Asp Pro Ala Phe Asp Glu Ala Leu Arg Glu
275 280 285
Arg Val Leu Asp Arg Ile Leu Ala Val Leu Leu Ala Arg Thr Val Phe
290 295 300
Val Leu Asp His Gly Ile Cys Ala Ala Thr Glu Leu Asp Trp Met Thr
305 310 315 320
Arg Thr Ala Leu Gly Phe Arg Thr Gly Leu Val Asp Leu Val Asp Glu
325 330 335
Leu Gly Pro Glu Arg Val Ala Glu Leu Cys Gln Arg Tyr Ala Ala Glu
340 345 350
His Pro Gly Phe Val Ile Pro Asp Ser Ile Arg Glu Gln His Lys Pro
355 360 365
Arg Phe Tyr Gly Asn Leu Arg Val Thr Arg Gln Asp Glu Leu Ala Ile
370 375 380
Val Arg Ile Phe Arg Pro Glu Val Lys Asn Ala Leu Asp Arg Arg Thr
385 390 395 400
Leu Ser Glu Leu Asp His Leu Met Ala Ala Leu Ser Ala Asp Asp Ser
405 410 415
Val Glu Gly Val Val Leu Ser Ser Ala Gly Gly Ala Leu Ala Gly Ala
420 425 430
Asp Ile Thr Glu Leu Ala Arg Val Arg Thr Thr Glu Glu Ala Val Ser
435 440 445
Thr Cys Ala Phe Gly Gln Ala Val Leu Asn Arg Ile Ala Ala Met Asp
450 455 460
Lys Pro Val Val Ala Ala Val Asp Gly Pro Val Leu Gly Gly Gly Ala
465 470 475 480
Glu Leu Ser Met Ala Cys His Ala Arg Val Val Gly Pro Arg Leu Ser
485 490 495
Met Gly Gln Pro Glu Val Asn Leu Gly Ile Ile Pro Gly Tyr Gly Gly
500 505 510
Thr Gln Arg Leu Pro Arg Leu Ile Gly Val Glu Arg Ala Leu Ala Met
515 520 525
Met Arg Thr Ala Gln Ser Ile Asp Ala Gln Thr Ala Cys Glu Trp Gly
530 535 540
Trp Ala Ser Gly Thr Pro Met Val Asp Phe Val Gly Ala Ala Ala Thr
545 550 555 560
Leu Ile Arg Ser His Leu Ala Gly Glu Ala Glu Leu Ala Pro Leu Asp
565 570 575
Pro Ala Pro Met Ser Val Pro Ala Ala Ala Ala Pro Val Asp Ile Gly
580 585 590
His Arg Ser Arg Val Ile Asp Glu Ile Leu Val Asp Val Val Gln Ser
595 600 605
Gly Leu Arg Ala Pro Leu Ser Glu Gly Leu Ala Thr Glu Ala Ala Gly
610 615 620
Phe Gly Arg Cys Val Leu Thr Val Asp Leu Asp Ile Gly Leu Lys Asn
625 630 635 640
Phe Met Gln Asn Gly Pro Arg Val Pro Ala Leu Phe Leu His Glu
645 650 655
<210> 79
<211> 255
<212> PRT
<213> Anoxybacillus flavithermus
<400> 79
Met Phe Ser Ile Gln Gln Glu Gly Tyr Val Ala Ile Leu Ala Leu His
1 5 10 15
Arg Pro Pro Ala Asn Ala Leu Ala Ser Ser Val Leu Lys Glu Leu Ser
20 25 30
Glu Arg Leu Asp Ala Leu Lys Glu Asp Glu Gln Val Arg Val Ile Val
35 40 45
Leu His Gly Glu Gly Arg Phe Phe Ser Ala Gly Ala Asp Ile Lys Glu
50 55 60
Phe Thr Ala Ile Glu Ala Ser Glu Gln Ala Ala Glu Leu Ala Arg Ala
65 70 75 80
Gly Gln Gln Val Met Glu Lys Ile Glu Gln Phe Pro Lys Pro Ile Ile
85 90 95
Ala Ala Ile His Gly Ala Ala Leu Gly Gly Gly Leu Glu Leu Ala Met
100 105 110
Ser Cys His Leu Arg Ile Val Ala Glu Asn Ala Lys Leu Gly Leu Pro
115 120 125
Glu Leu Gln Leu Gly Ile Ile Pro Gly Phe Ala Gly Thr Gln Arg Leu
130 135 140
Leu Arg His Val Gly Met Ala Lys Ala Leu Glu Met Met Trp Thr Ser
145 150 155 160
Glu Pro Ile Thr Gly Ala Glu Ala Val Gln Trp Gly Leu Ala Asn Lys
165 170 175
Ala Val Pro Glu Glu Gln Leu Leu Asp Thr Ala Lys Gln Leu Ala Gln
180 185 190
Lys Ile Ala Gln Lys Ser Pro Ile Ser Val Gln Ala Val Leu Lys Leu
195 200 205
Val Asn Glu Ala Arg Thr Lys Thr Phe His Glu Cys Val Glu Lys Glu
210 215 220
Ala Gln Leu Phe Gly Gln Val Phe Val Thr Glu Asp Ala Lys Glu Gly
225 230 235 240
Ile Ser Ala Phe Ile Glu Lys Arg Thr Pro Gln Phe Gln Gly Lys
245 250 255
<210> 80
<211> 260
<212> PRT
<213> Streptomyces avermitilis
<400> 80
Met Ser Thr Ala Pro Glu Ala Ala Asp Leu Val Leu His Glu Arg His
1 5 10 15
Gly Gly Val Leu Thr Ile Thr Ile Asn Arg Pro Ala Gln Lys Asn Ala
20 25 30
Val Asp His Glu Ala Ala Val Gln Leu Ala Ala Ala Val Asp Leu Leu
35 40 45
Asp Ala Asp Pro Glu Leu Ser Val Gly Val Leu Thr Gly Ala Gly Gly
50 55 60
Val Phe Ser Ala Gly Met Asp Leu Lys Ala Phe Ala Lys Gly Glu Leu
65 70 75 80
Pro Leu Leu Pro Ser Arg Gly Leu Gly Gly Leu Thr Arg Ala Ser Val
85 90 95
Arg Lys Pro Leu Val Ala Ala Val Glu Gly Trp Ala Leu Gly Gly Gly
100 105 110
Phe Glu Leu Val Leu Ala Cys Asp Leu Ile Val Ala Ala Glu Asp Ala
115 120 125
Arg Phe Gly Phe Pro Glu Val Met Arg Gly Leu Val Ala Ala Glu Gly
130 135 140
Gly Leu Val Arg Leu Pro Arg Arg Leu Pro Tyr His Val Ala Ala Arg
145 150 155 160
Val Leu Leu Thr Gly Glu Pro Leu Thr Ala Val Glu Ala Lys Glu Tyr
165 170 175
Gly Leu Val Asn Glu Leu Thr Pro Pro Gly Ala Ala Leu Asp Ala Ala
180 185 190
Arg Glu Leu Ala Gly Arg Val Ala Arg Asn Ala Pro Leu Ala Leu Ala
195 200 205
Ala Val Lys Glu Val Leu Arg Glu Thr Gln Gly Leu Lys Glu Ser Asp
210 215 220
Ala Phe Arg Arg Gln Asp Glu Leu Thr Ser Gly Leu Ala Ala Ser Glu
225 230 235 240
Asp Ala Arg Glu Gly Ala Gln Ala Phe Ala Glu Lys Arg Ala Pro Val
245 250 255
Trp His Gly Arg
260
<210> 81
<211> 560
<212> PRT
<213> Advenella kashmirensis
<400> 81
Met Asp Asn Gly Arg Lys Leu Ile Glu Arg Gly Trp His Leu Phe Asn
1 5 10 15
Arg Ile Glu Lys Leu Ala Phe Pro Thr Leu Ala Leu Met His Gly Pro
20 25 30
Cys Leu Gly Gly Gly Leu Glu Leu Ala Leu Ala Cys Arg Tyr Arg Ile
35 40 45
Ala Ile Asp Ser Pro Lys Pro Val Ile Gly Leu Pro Glu Val Lys Leu
50 55 60
Gly Ile Phe Pro Ala Trp Gly Gly Leu Met Arg Leu Pro Arg Leu Ile
65 70 75 80
Gly Pro Gln Thr Ala Leu Asn Met Met Leu Thr Gly Arg Thr Leu Asp
85 90 95
Gly Arg Lys Ala Arg Ser Ala Gly Leu Val Asp Leu Leu Val Ala Pro
100 105 110
Arg Val Ala Glu Lys Ser Ala Ile Asp Leu Val Thr Ser Gly Lys Pro
115 120 125
Ala Arg Gln Ala Arg Gly Leu Ala Gly Leu Leu Asn Arg Ala Pro Phe
130 135 140
Lys Ser Leu Val Ala Ala Gln Ala Arg Lys Ser Val Lys Gln Lys Asp
145 150 155 160
Pro Tyr Gly His Tyr Pro Ala Thr Leu Thr Met Leu Asp Leu Trp Glu
165 170 175
Lys His Asp Gly Asp Pro Leu Ala Asp Pro Gln Ala Leu Thr Arg Leu
180 185 190
Leu Gln Ser Asp Val Thr Arg Asn Leu Ile Arg Val Phe His Leu Gln
195 200 205
Glu Arg Leu Lys Ala Phe Gly Lys Lys Asp Asn Ala Thr Pro Val Asn
210 215 220
His Val His Val Ile Gly Ala Gly Val Met Gly Gly Gly Ile Ala Ala
225 230 235 240
Trp Cys Ala Leu Gln Gly Ile Lys Thr Thr Leu Gln Asp Thr Asp Ala
245 250 255
Gln Arg Ile Ala Gly Ala Phe Lys Asn Ala Val Ser Ile Tyr Ala Arg
260 265 270
Lys Asp Arg Tyr Thr Ala Gln Ala Ala Arg Asp Arg Leu Ile Pro Asp
275 280 285
Leu Ala Gly His Gly Ile Ala Thr Ala Asp Leu Val Ile Glu Ala Ile
290 295 300
Ser Glu Asn Pro Gln Ala Lys Gln Ser Leu Tyr Gln Gln Ile Glu Pro
305 310 315 320
Lys Met Lys Glu Gly Ala Ile Leu Ala Thr Asn Thr Ser Ser Leu Ser
325 330 335
Ile Ala Gln Leu Arg Ser Val Leu Val His Pro Glu Arg Phe Val Gly
340 345 350
Ile His Phe Phe Asn Pro Val Ser Arg Met Pro Leu Val Glu Val Val
355 360 365
His Ala Asp Gly Ile Ala Gln Glu Thr Leu Asp Thr Ala Ala Ala Phe
370 375 380
Val Gly Lys Ile Gly Lys Leu Pro Leu Pro Val Gln Asp Thr Pro Gly
385 390 395 400
Phe Leu Val Asn Ala Val Leu Ala Pro Tyr Met Leu Gln Ala Met Arg
405 410 415
Cys Ile Asp Glu Gly Met Asp Pro Glu Val Ile Asp Thr Ala Met Leu
420 425 430
Glu Phe Gly Met Pro Met Gly Pro Ile Thr Leu Ala Asp Thr Val Gly
435 440 445
Leu Asp Ile Ala Met Ala Ala Gly Lys Gln Leu Ser Glu Gly Gln Glu
450 455 460
Pro Pro Arg Cys Leu Gln Glu Lys Ile Ala Gln Gly Lys Leu Gly Val
465 470 475 480
Lys Ser Gly Glu Gly Phe Tyr Val Trp Lys Asp Arg Lys His Asp Gln
485 490 495
Arg Ser Ser Lys Ala Ile Pro Gln Gly Leu Ala Gln Arg Leu Ile Lys
500 505 510
Pro Leu Ile Glu Gln Thr Glu Lys Gln Leu Ala Asn Asn Ile Val Gln
515 520 525
Asp Ala Asp Leu Ala Asp Ala Gly Val Ile Phe Gly Thr Gly Phe Ala
530 535 540
Pro Phe Thr Gly Gly Pro Ile His Tyr Lys Gln Ser Lys Gly Gly Leu
545 550 555 560
<210> 82
<211> 237
<212> PRT
<213> Oligotropha carboxidovorans
<400> 82
Met Ser Leu Ser Pro Leu Ala Asn Gly Val Arg Val Leu Thr Leu Asp
1 5 10 15
Arg Pro Ser Lys Ala Asn Ala Leu Asn Ala Glu Val Val Asp Gln Leu
20 25 30
Leu Ala Cys Val Ala Gln Ala Glu Ala Glu Asp Cys Arg Val Leu Ile
35 40 45
Leu Ala Ala Asn Gly Lys Ala Phe Cys Gly Gly Phe Asp Phe Gly Gly
50 55 60
Tyr Glu Ser Met Ser Ala Gly Asp Leu Leu Leu Arg Phe Val Arg Ile
65 70 75 80
Glu Glu Leu Leu Gln Arg Met Arg Gln Ser Ser Phe Val Ser Ile Ala
85 90 95
Leu Val His Gly Ala Ala Met Gly Ala Gly Ala Asp Ile Val Ala Ser
100 105 110
Cys Thr Tyr Arg Ile Gly Thr Asp Ala Ser Arg Phe Arg Phe Pro Gly
115 120 125
Phe Arg Phe Gly Val Ala Leu Gly Thr Arg His Leu Ala Gln Leu Val
130 135 140
Gly Pro Gln Arg Ala Arg Asp Ile Leu Leu Thr Asn Ala Thr Ile Asp
145 150 155 160
Ala Leu Thr Ala Val Asp Ile Gly Leu Leu Thr His Leu Val Asp Ala
165 170 175
Gly Ser Met Arg Gln Lys Ala Asp Glu Ile Ile Ala Gln Ile Gly Ser
180 185 190
Leu Asp Arg Val Ala Arg Asn Arg Ile Leu His Leu Thr Ser Ala Gln
195 200 205
Asn Asn Asp Gly Asp Met Ala Glu Leu Val Lys Ser Val Ser Ala Pro
210 215 220
Gly Leu His Glu Arg Ile Ala Gln Tyr Arg Ala Gly His
225 230 235
<210> 83
<211> 266
<212> PRT
<213> Riemerella anatipestifer
<400> 83
Met Tyr Lys Leu Ile Asp Val Asp Asn His Phe Glu Gly Lys Leu Gln
1 5 10 15
Ile Ala Tyr Ile Asn Gln Pro Glu Ser Phe Asn Ser Leu Asn Lys Val
20 25 30
Val Leu Glu Glu Leu Leu His Phe Ile Lys Ala Cys Asp Ala Asp Ser
35 40 45
Ser Val Arg Cys Ile Ala Ile Ser Gly Lys Gly Lys Ala Phe Cys Ser
50 55 60
Gly Gln Asn Leu Lys Glu Ala Leu Asp Tyr Lys Ala Glu Ala Asn Glu
65 70 75 80
Glu Arg Phe Ile Gln Arg Ile Val Ile Asp Tyr Tyr Asn Pro Leu Val
85 90 95
Lys Ala Ile Val Tyr Ala Lys Lys Pro Val Ile Ala Leu Val Asn Gly
100 105 110
Pro Ala Val Gly Ala Gly Ala Met Leu Ala Leu Ile Cys Asp Phe Ala
115 120 125
Val Ala Ser Glu Ser Ala Tyr Phe Ser Leu Ala Phe Ser Asn Ile Gly
130 135 140
Leu Val Pro Asp Thr Ala Gly Thr Tyr Tyr Leu Pro Lys Leu Leu Gly
145 150 155 160
Arg Ser Leu Ala Ser Tyr Leu Ala Phe Thr Gly Lys Lys Leu Ser Ala
165 170 175
Lys Glu Ser Leu Glu Arg Gly Leu Val Val Asp Val Phe Ser Asp Ala
180 185 190
Thr Phe Ser Glu Gln Ser Leu Gln Val Leu Glu His Ile Thr His Gln
195 200 205
Pro Thr Val Ala Leu Gly Leu Thr Lys Lys Ala Phe Asn Lys Ser Tyr
210 215 220
Gln Asn Ser Leu Ser Glu Gln Leu Asp Leu Glu Ser Ile Leu Gln Gln
225 230 235 240
Asp Ala Ala Glu Thr Trp Asp Phe Gln Glu Gly Ile Ala Ala Phe Leu
245 250 255
Ala Lys Arg Lys Pro Gln Tyr Lys Gly Lys
260 265
<210> 84
<211> 422
<212> PRT
<213> Fusobacterium necrophorum subsp. funduliforme Fnf 1007
<400> 84
Met Ser Glu Thr Ile Asn Leu Asp Glu Met Ser Ala Lys Gln Leu Leu
1 5 10 15
Gly Tyr Tyr Gln Glu Lys Leu Asp Glu Glu Ala Arg Gln Ala Lys Arg
20 25 30
Glu Gly Lys Leu Val Cys Trp Ser Ala Ser Val Ala Pro Pro Glu Phe
35 40 45
Cys Val Ala Met Asp Ile Ala Met Val Tyr Pro Glu Thr His Ala Ala
50 55 60
Gly Ile Gly Ala Arg Lys Gly Ser Leu Asp Leu Leu Glu Val Ala Asp
65 70 75 80
Glu Lys Gly Tyr Ser Leu Asp Ile Cys Ser Tyr Ala Arg Val Asn Leu
85 90 95
Gly Tyr Met Glu Leu Leu Lys Gln Gln Ala Leu Thr Gly Glu Thr Pro
100 105 110
Glu Lys Leu Ala Asn Ser Pro Ala Ala Lys Val Pro Leu Pro Asp Leu
115 120 125
Val Ile Thr Cys Asn Asn Ile Cys Asn Thr Leu Leu Lys Trp Tyr Glu
130 135 140
Asn Leu Ala Lys Glu Leu Asn Ile Pro Cys Ile Val Ile Asp Val Pro
145 150 155 160
Phe Asn His Thr Met Pro Ile Thr Lys His Ser Lys Glu Tyr Ile Ala
165 170 175
Asp Gln Phe Lys Tyr Ala Ile Gln Gln Leu Glu Glu Ile Thr Gly Lys
180 185 190
Lys Phe Asp Tyr Asp Lys Phe Leu Glu Val Gln Glu Gln Thr Gln Arg
195 200 205
Ser Val Tyr Gln Trp Asn Arg Leu Ala Ala Leu Ala His Tyr Lys Pro
210 215 220
Ser Pro Leu Asn Gly Phe Asp Leu Phe Asn Phe Met Ala Leu Ile Val
225 230 235 240
Cys Ala Arg Ser Arg Asp Tyr Ala Glu Ile Thr Phe Lys Lys Phe Ala
245 250 255
Asp Glu Leu Glu Glu Asn Leu Lys Asn Glu Val Tyr Ala Phe Lys Gly
260 265 270
Ala Glu Lys Asn Arg Val Thr Trp Glu Gly Ile Ala Val Trp Pro Tyr
275 280 285
Leu Gly His Thr Phe Lys Ser Leu Lys Gly Met Gly Ser Ile Met Thr
290 295 300
Gly Ser Ala Tyr Pro Gly Ile Trp Asn Leu Thr Tyr Thr Pro Gly Asp
305 310 315 320
Met Glu Ser Met Ala Glu Ala Tyr Thr Arg Val Tyr Ile Asn Thr Cys
325 330 335
Leu Gln Asn Lys Ala Asp Val Leu Ser Lys Ile Val Thr Asp Gly Lys
340 345 350
Cys Asp Gly Ile Leu Tyr His Leu Asn Arg Ser Cys Lys Leu Met Ser
355 360 365
Phe Leu Asn Val Glu Thr Ala Glu Leu Val Glu Lys Ala Thr Gly Val
370 375 380
Pro Tyr Val Ser Phe Asp Gly Asp Gln Thr Asp Pro Arg Asn Phe Ala
385 390 395 400
Pro Ala Gln Phe Asp Thr Arg Val Gln Ala Leu Asn Glu Met Met Glu
405 410 415
Val Asn Asn Glu Thr Lys
420
<210> 85
<211> 277
<212> PRT
<213> Fusobacterium necrophorum subsp. funduliforme Fnf 1007
<400> 85
Met Gln Asp Asp Arg Ser Phe Lys Lys Gly Lys Arg Arg Gly Met Tyr
1 5 10 15
Thr Val Gly Val Asp Ile Gly Ser Ser Ser Ser Lys Val Val Ile Leu
20 25 30
Lys Asp Gly Thr Glu Ile Val Ser Gln Ser Ala Ile Gln Ser Gly Ile
35 40 45
Gly Ser Asn Arg Ala Ile Val Ala Leu Glu Asp Asn Leu Lys Lys Ala
50 55 60
Asn Leu Thr Lys Glu Asp Ile Gly Phe Thr Val Val Thr Gly Tyr Gly
65 70 75 80
Arg Phe Thr Phe Glu Gly Ala Asp Lys Gln Ile Ser Glu Ile Ser Cys
85 90 95
His Ala Arg Gly Ile His Phe Leu Leu Pro Asn Val Arg Thr Ile Ile
100 105 110
Asp Ile Gly Gly Gln Asp Ala Lys Ala Ile Ser Leu Asp Glu Lys Gly
115 120 125
His Val Arg Gln Phe Phe Met Asn Asp Lys Cys Ala Ala Gly Thr Gly
130 135 140
Arg Phe Leu Thr Val Met Ala Arg Val Leu Glu Ile Ser Leu Asp Glu
145 150 155 160
Met Gly Thr Tyr Asp Ala Leu Ser Lys Asn Pro Cys Asn Ile Ser Ser
165 170 175
Thr Cys Ala Val Phe Ala Glu Ser Glu Val Ile Ser Gln Leu Ala Lys
180 185 190
Gly Asn Thr Lys Glu Asp Val Ile Ala Gly Val His Asn Ser Val Ala
195 200 205
His Lys Ile Leu Gly Leu Val Tyr Arg Thr Ser Met Glu Glu Lys Phe
210 215 220
Ala Ile Cys Gly Gly Val Ala Gln Asn Thr Gly Ala Leu Arg Ala Ile
225 230 235 240
Arg Glu Ala Leu Lys Lys Glu Val Ile Val Ala Pro Asn Pro Gln Leu
245 250 255
Thr Gly Ala Leu Gly Ala Ala Ile Phe Ala Tyr Asp Glu Leu Lys Lys
260 265 270
Leu Arg Lys Gly Glu
275
<210> 86
<211> 374
<212> PRT
<213> Fusobacterium necrophorum subsp. funduliforme Fnf 1007
<400> 86
Met Lys Gly Arg Leu Glu Glu Leu Ile His Ile Phe Glu Asp Val Ala
1 5 10 15
Asn Asn Pro Lys Lys Met Val Ala Glu Tyr Lys Lys Glu Val Gly Lys
20 25 30
Glu Val Ile Gly Val Met Pro Val Tyr Ala Pro Glu Glu Ile Ile His
35 40 45
Ala Ala Gly Cys Leu Pro Ile Gly Leu Trp Gly Gly Lys Lys Glu Val
50 55 60
Ser Lys Ala Arg Ala Tyr Leu Pro Pro Phe Ala Cys Ser Ile Met Gln
65 70 75 80
Thr Val Met Glu Leu Gln Ile Gly Gly Thr Tyr Asp Ile Leu Asp Ala
85 90 95
Val Leu Phe Ser Val Pro Cys Asp Thr Leu Lys Cys Leu Ser Gln Lys
100 105 110
Trp Lys Gly Lys Ser Pro Val Ile Val Phe Thr His Pro Gln Asn Arg
115 120 125
Val Ile Glu Gly Ala Asn Ala Tyr Leu Val Lys Glu Tyr Gln Ala Val
130 135 140
Lys Glu Lys Leu Glu Gly Ile Leu Gly Arg Thr Ile Pro Met Glu Ala
145 150 155 160
Ile Glu Glu Ser Val Lys Val Tyr Asn Glu Asn Arg Arg Val Met Arg
165 170 175
Glu Phe Val Glu Val Ala Ala Gln Tyr Pro Gln Ile Ile Asp Pro Ile
180 185 190
Val Arg His Asn Val Met Lys Ser Arg Trp Phe Leu Arg Lys Glu Lys
195 200 205
His Thr Glu Tyr Val Lys Glu Leu Ile Ala Glu Leu Lys Lys Glu Thr
210 215 220
Ile Val Pro Trp Asp Gly Lys Lys Val Ile Leu Thr Gly Ile Met Thr
225 230 235 240
Glu Pro Val Glu Leu Leu Gln Ile Phe Lys Asp Glu Lys Leu Ala Ile
245 250 255
Val Ala Asp Asp Leu Ala His Glu Ser Arg Gln Phe Arg Gly Asp Val
260 265 270
Pro Glu Glu Gly Gly Asp Val Leu Tyr Arg Met Ala Lys Trp Trp Gln
275 280 285
Asn Leu Glu Gly Cys Ser Leu Ala Thr Asp Thr Asn Lys Gly Arg Gly
290 295 300
Gln Met Leu Met Asp Met Cys Lys Asp Thr Lys Ala Asp Ala Val Ile
305 310 315 320
Val Cys Met Met Lys Phe Cys Asp Pro Glu Glu Phe Asp Tyr Pro Val
325 330 335
Tyr Tyr Arg Glu Phe Thr Glu Ser Gly Ile Lys Asn Ile Thr Val Glu
340 345 350
Val Asp Leu Glu Val Ser Ser Phe Glu Gln Ile Arg Thr Arg Ile Gln
355 360 365
Thr Phe Lys Asp Ile Leu
370
<210> 87
<211> 422
<212> PRT
<213> Desulfosporosinus youngiae DSM 17734
<400> 87
Met Thr Asp Thr Thr Thr Met Ser Ala Lys Glu Leu Leu Gly Phe Tyr
1 5 10 15
Gln Glu Glu Leu Tyr Glu Glu Ala Arg Gln Ala Lys Lys Glu Gly Lys
20 25 30
Leu Val Cys Trp Ser Ala Ser Val Ala Pro Ser Glu Phe Cys Val Ala
35 40 45
Met Asp Val Ala Met Ile Tyr Pro Glu Thr His Ala Ala Gly Ile Gly
50 55 60
Ala Arg Lys Gly Ala Leu Asp Val Leu Glu Val Ala Asp Glu Lys Gly
65 70 75 80
Tyr Asn Leu Asp Thr Cys Ser Tyr Ala Arg Val Asn Met Gly Tyr Met
85 90 95
Glu Leu Leu Lys Gln Glu Ala Leu Thr Gly Ile Thr Pro Glu Lys Leu
100 105 110
Glu Lys Ser Pro Ala Ala Arg Ile Pro Leu Pro Asp Phe Val Ile Thr
115 120 125
Cys Asn Asn Ile Cys Asn Thr Leu Leu Lys Trp Tyr Glu Asn Leu Ala
130 135 140
Val Glu Leu Asn Ile Pro Cys Ile Ile Ile Asp Val Pro Phe Asn His
145 150 155 160
Thr Met Pro Ile Pro Gln Tyr Ala Lys Asp Tyr Ile Ala Glu Gln Phe
165 170 175
Lys Glu Ala Ile Thr Gln Leu Glu Glu Ile Cys Gly Arg Lys Phe Asp
180 185 190
Tyr Asp Lys Phe Leu Lys Val Gln Glu Gln Thr Gln Arg Ser Val Ala
195 200 205
Gln Trp Asn Arg Ile Ala Ala Leu Ser Gly His Lys Pro Ser Pro Leu
210 215 220
Asn Gly Phe Asp Leu Phe Asn Tyr Met Ala Leu Ile Val Cys Ala Arg
225 230 235 240
Ser Arg Asp Tyr Ala Glu Ile Thr Phe Lys Lys Phe Ala Asp Glu Leu
245 250 255
Glu Glu Asn Leu Lys Asn Gly Ile Tyr Ala Phe Lys Gly Asn Glu Gln
260 265 270
Lys Arg Val Thr Trp Glu Gly Ile Ala Val Trp Pro His Leu Gly His
275 280 285
Thr Phe Lys Gly Leu Lys Asn Leu Gly Asn Ile Met Thr Gly Ser Ala
290 295 300
Tyr Pro Gly Leu Trp Asn Leu Thr Tyr Thr Pro Gly Asp Met Ser Ser
305 310 315 320
Met Ala Glu Ala Tyr Thr Arg Ile Tyr Ile Asn Thr Cys Leu Asp Asn
325 330 335
Lys Val Lys Val Leu Ser Asp Val Ile Ser Gly Gly Lys Cys Asp Gly
340 345 350
Val Ile Tyr His Gln Asn Arg Ser Cys Lys Leu Met Ser Leu Leu Asn
355 360 365
Val Glu Thr Ala Asp Ile Leu Gln Lys Gln Asn His Leu Pro Tyr Val
370 375 380
Ser Phe Asp Gly Asp Gln Thr Asp Pro Arg Asn Phe Ala Pro Ala Gln
385 390 395 400
Phe Asp Thr Arg Ile Gln Ala Leu Asp Glu Met Met Lys Gln Asn Lys
405 410 415
Glu Gly Val Ser Asn Glu
420
<210> 88
<211> 372
<212> PRT
<213> Desulfosporosinus youngiae DSM 17734
<400> 88
Met Ser Arg Ile Glu Thr Ile Ile Ser Glu Leu Thr Ser Ile Ala Asn
1 5 10 15
Asn Pro Arg Gln Ala Met Glu Asp Tyr Lys Lys Glu Thr Gly Lys Gly
20 25 30
Ser Val Gly Val Met Pro Tyr Tyr Ala Pro Glu Glu Ile Ile His Ala
35 40 45
Ala Gly Tyr Leu Pro Val Gly Ile Trp Gly Gly Gln Lys Ser Ile Ser
50 55 60
Lys Ala Arg Ala Tyr Leu Pro Pro Phe Ala Cys Ser Ile Met Gln Ser
65 70 75 80
Val Val Glu Met Gln Leu Glu Gly Val Tyr Asp Asp Leu Glu Ala Val
85 90 95
Leu Phe Pro Val Pro Cys Asp Thr Leu Lys Cys Leu Ser Gln Lys Trp
100 105 110
Lys Gly Thr Ser Pro Val Ile Val Leu Thr His Pro Gln Asn Arg Lys
115 120 125
Leu Glu Ala Ala Asn Lys Phe Leu Ala Glu Glu Tyr Arg Leu Val Arg
130 135 140
Glu Lys Leu Glu Lys Ile Leu Asn Val Lys Ile Thr Asp Glu Ala Leu
145 150 155 160
Asn Gln Ser Ile Glu Ile Tyr Asn Glu Asn Arg Lys Val Met Arg Glu
165 170 175
Phe Thr Glu Ile Ala Ala Asn Tyr Pro Asn Ile Ile Asp Pro Val Lys
180 185 190
Arg His Ala Leu Ile Lys Ala Arg Phe Phe Met Glu Lys Ala Lys His
195 200 205
Thr Ala Leu Val Lys Glu Leu Asn Ala Glu Leu Lys Ala Leu Pro Val
210 215 220
Glu Ala Phe Thr Gly Lys Lys Val Val Leu Thr Gly Ile Met Ala Glu
225 230 235 240
Pro Asn Glu Val Leu Asp Ile Leu Gln Asp Asn Gly Phe Ala Val Val
245 250 255
Ala Asp Asp Leu Ala Gln Glu Ser Arg Leu Phe Arg Asn Asp Val Pro
260 265 270
Ser Gly Thr Asp Pro Leu Tyr Arg Leu Ala Lys Trp Trp Gln Glu Phe
275 280 285
Asp Gly Cys Ser Leu Ala Val Asp Ala Lys Lys Pro Arg Gly Pro Met
290 295 300
Leu Met Asp Met Val Lys Ala Ser Lys Ala Asp Ala Val Val Val Cys
305 310 315 320
Met Met Lys Phe Cys Asp Pro Glu Glu Phe Asp Tyr Pro Ile Tyr Tyr
325 330 335
Arg Gln Phe Glu Glu Ala Gly Ile Lys Ser Leu Phe Ile Glu Ile Asp
340 345 350
Leu Glu Pro Thr Ser Phe Glu Gln Thr Lys Thr Arg Val Gln Ser Phe
355 360 365
Arg Glu Met Leu
370
<210> 89
<211> 272
<212> PRT
<213> Desulfosporosinus youngiae DSM 17734
<400> 89
Met Phe Thr Met Gly Ile Asp Ile Gly Ser Ser Ser Ser Lys Val Val
1 5 10 15
Ile Leu Glu Asp Gly Val Asn Ile Ile Ala Gly Glu Val Ile Gln Ile
20 25 30
Gly Thr Gly Ser Thr Gly Pro Lys Arg Val Leu Asp Glu Ala Leu Ala
35 40 45
Lys Ala Gly Leu Thr Leu Gln Asp Met Ala Lys Ile Ile Ala Thr Gly
50 55 60
Tyr Gly Arg Ser Ser Val Glu Glu Ala His Lys Gln Ile Ser Glu Ile
65 70 75 80
Ser Cys Gln Ala Lys Gly Val Phe Phe Leu Val Pro Ser Ala Lys Leu
85 90 95
Ile Ile Asp Ile Gly Gly Gln Asp Val Lys Ala Ile Lys Leu Asp Ser
100 105 110
Lys Gly Cys Val Lys Gln Phe Phe Met Asn Asp Lys Cys Ala Ala Gly
115 120 125
Thr Gly Arg Phe Leu Asp Val Met Ser Arg Val Leu Glu Val Asn Leu
130 135 140
Asp Glu Met Ala Glu Tyr Asp Ala Arg Ala Thr Glu Pro Ala Thr Val
145 150 155 160
Ser Ser Thr Cys Thr Val Phe Ala Glu Ser Glu Val Ile Ser Gln Leu
165 170 175
Ala Asn Gly Val Ala Lys Glu Asn Ile Ile Ala Gly Val His Gln Ser
180 185 190
Val Ala Ser Lys Ala Cys Gly Leu Ala Tyr Arg Cys Gly Val Glu Glu
195 200 205
Asp Ile Val Met Cys Gly Gly Val Ala Lys Asp Leu Gly Val Val Arg
210 215 220
Ala Ile Ser Lys Glu Leu Lys Lys Pro Val Ile Val Ala Pro Asn Pro
225 230 235 240
Gln Ile Thr Ala Ala Leu Gly Ala Ala Ile Phe Ala Phe Glu Glu Val
245 250 255
Met Glu Thr Val Met Val Ala Phe Glu Glu Val Arg Gly Ala Asn Lys
260 265 270
<210> 90
<211> 422
<212> PRT
<213> Peptoniphilus indolicus ATCC 29427
<400> 90
Met Asn Thr Ile Asp Ile Ser Asn Met Lys Ala Lys Glu Met Leu Gly
1 5 10 15
Tyr Phe Gln Asn Lys Leu Asp Glu Glu Ala Arg Glu Ala Lys Lys Asn
20 25 30
Gly Lys Leu Val Cys Trp Ser Ala Ser Val Ala Pro Ser Glu Phe Cys
35 40 45
Val Thr Met Asp Ile Ala Leu Val Tyr Pro Glu Thr His Ala Ala Gly
50 55 60
Ile Gly Ala Arg Lys Gly Ser Leu Ala Met Leu Asp Val Ala Asp Arg
65 70 75 80
Lys Gly Tyr Asn Thr Asp Ile Cys Ser Tyr Ala Arg Val Asn Leu Gly
85 90 95
Tyr Met Glu Leu Leu Lys Glu Tyr Ala Lys Thr Gly Val Lys Pro Lys
100 105 110
Glu Leu Glu Glu Ser Pro Ala Ala Asp Val Pro Leu Pro Asp Leu Val
115 120 125
Ile Thr Cys Asn Asn Ile Cys Asn Thr Leu Leu Lys Trp Tyr Glu Asn
130 135 140
Leu Ala Ala Glu Leu Asn Ile Pro Cys Ile Val Ile Asp Val Pro Phe
145 150 155 160
Asn His Thr Met Pro Ile Pro Lys Tyr Ser Lys Glu Tyr Ile Ala Asp
165 170 175
Gln Phe Lys Glu Ala Ile Arg Gln Leu Glu Glu Ile Thr Gly Lys Asp
180 185 190
Phe Asp Tyr Asp Lys Phe Leu Glu Val Gln Glu Gln Thr Gln Arg Ser
195 200 205
Val Ala Gln Trp Asn Arg Leu Ala Ala Leu Ser Lys Tyr Glu Pro Ser
210 215 220
Pro Leu Asn Gly Phe Asp Leu Phe Asn Tyr Met Ala Leu Ile Val Cys
225 230 235 240
Ala Arg Ser Lys Asn Tyr Ala Glu Leu Thr Phe Lys Lys Phe Ala Asp
245 250 255
Glu Leu Glu Glu Asn Met Gln Asn Gly Val Tyr Pro Tyr Lys Ala Gly
260 265 270
Glu Gln Ser Arg Ile Thr Trp Glu Gly Ile Ala Ile Trp Pro Tyr Leu
275 280 285
Gly His Thr Phe Lys Thr Leu Lys Gly Tyr Gly Ser Ile Met Thr Gly
290 295 300
Ser Ala Tyr Pro Gly Leu Trp Asn Leu Glu Tyr Thr Pro Gly Asp Met
305 310 315 320
Leu Ser Met Ala Glu Ala Tyr Thr Arg Ile Tyr Ile Asn Thr Cys Leu
325 330 335
Asp Asn Lys Val Asp Val Leu Arg Lys Ile Ile Lys Asn Gly Lys Cys
340 345 350
Asp Gly Val Ala Tyr His Leu Asn Arg Ser Cys Lys Leu Met Ser Leu
355 360 365
Leu Asn Val Glu Thr Ala Glu Ile Leu Asn Lys Glu Asn Asn Leu Pro
370 375 380
Tyr Val Ser Phe Asp Gly Asp Gln Thr Asp Pro Arg Asn Phe Ser Glu
385 390 395 400
Ala Gln Tyr Asp Asn Arg Ile Gln Thr Leu Thr Glu Met Met Ser Ala
405 410 415
Asn Lys Lys Met Arg Gly
420
<210> 91
<211> 263
<212> PRT
<213> Peptoniphilus indolicus ATCC 29427
<400> 91
Met Tyr Thr Met Gly Val Asp Ile Gly Ser Thr Ser Ser Lys Ile Ile
1 5 10 15
Ile Leu Glu Asp Gly Ile Lys Ile Ile Gly Asn Ile Val Val Gln Ser
20 25 30
Gly Thr Gly Thr Ser Gly Pro Thr Ile Ala Thr Ala Lys Ala Lys Ser
35 40 45
Phe Leu Ser Asn Asn Asn Leu Thr Leu Asp Asp Ile Ser Lys Ile Val
50 55 60
Val Thr Gly Tyr Gly Arg Phe Ser Phe Asp Ile Ala Asp Lys Gln Ile
65 70 75 80
Ser Glu Ile Thr Cys His Thr Lys Gly Ile Asn Phe Leu Val Pro Glu
85 90 95
Ala Arg Thr Ile Leu Asp Ile Gly Gly Gln Asp Thr Lys Ala Ile Ser
100 105 110
Val Asn Asp Lys Gly Gln Val Leu Gln Phe Phe Met Asn Asp Lys Cys
115 120 125
Ala Ala Gly Thr Gly Arg Phe Leu Glu Val Met Ala Lys Ile Leu Glu
130 135 140
Ile Pro Leu Glu Lys Met Gly Glu Tyr Asp Arg Leu Ser Thr Asn Pro
145 150 155 160
Val Ala Ile Ser Ser Thr Cys Thr Val Phe Ala Glu Ser Glu Val Ile
165 170 175
Ser Gln Leu Ser Lys Gly Ile Ser Lys Glu Asn Ile Leu Ala Gly Val
180 185 190
His Asn Ser Thr Ala Asn Lys Val Cys Gly Leu Leu Tyr Arg Thr Gly
195 200 205
Ile Lys Glu Lys Ile Val Leu Cys Gly Gly Val Ala Gln Asn Gln Gly
210 215 220
Val Val Arg Ala Leu Gln Glu Glu Leu Lys Lys Glu Ile Thr Ile Ala
225 230 235 240
Pro His Pro Gln Met Thr Gly Ala Ile Gly Ala Ala Leu Phe Ala Tyr
245 250 255
Glu Glu Ala Asn Lys Asn Leu
260
<210> 92
<211> 372
<212> PRT
<213> Peptoniphilus indolicus ATCC 29427
<400> 92
Met Asn Lys Ile Asn Glu Ile Ile Asn Leu Leu Asp Glu Val Ser Lys
1 5 10 15
Asp Pro Lys Leu Thr Val Lys Lys Tyr Lys Glu Lys Thr Gly Lys Gly
20 25 30
Val Val Gly Val Met Pro Leu Tyr Ala Pro Glu Glu Ile Ile His Ala
35 40 45
Ala Gly Phe Leu Pro Met Gly Leu Trp Gly Ala Gln Lys Glu Val Ser
50 55 60
Lys Ala Arg Ile Tyr Leu Pro Pro Phe Ala Cys Ser Ile Met Gln Thr
65 70 75 80
Asn Met Glu Leu Gln Ile Glu Gly Ala Tyr Asp Asp Leu Asp Ala Val
85 90 95
Val Phe Ser Val Pro Cys Asp Thr Leu Lys Cys Met Ser Gln Lys Trp
100 105 110
Lys Gly Lys Ser Pro Val Ile Val Phe Thr His Pro Gln Asn Arg Lys
115 120 125
Leu Glu Ser Ala Asn Lys Phe Leu Val Thr Glu Tyr Glu Ile Leu Lys
130 135 140
Asp Lys Leu Glu Lys Ile Leu Asn Val Lys Ile Ser Asp Glu Ser Ile
145 150 155 160
Thr Asn Ser Ile Glu Ile Tyr Asn Glu Asn Arg Lys Val Met Arg Glu
165 170 175
Phe Ser Asp Leu Ala Gly Gln Tyr Pro Asn Ile Ile Asp Pro Ile Gln
180 185 190
Arg His Ile Val Phe Lys Ser Arg Trp Phe Met Glu Lys Ser Glu His
195 200 205
Thr Lys Leu Val Lys Glu Leu Ile Ser Glu Ile Lys Lys Leu Pro Ile
210 215 220
Glu Glu Trp Asp Gly Tyr Lys Val Ile Ala Thr Gly Ile Met Ile Glu
225 230 235 240
Pro Glu Glu Ile Leu Gln Ile Phe Lys Asp Lys Lys Ile Ala Ile Val
245 250 255
Ala Asp Asp Leu Ala Gln Glu Ser Arg Gln Phe Arg His Asp Val Pro
260 265 270
Glu Gly Asp Gln Pro Leu Leu Arg Leu Ala Lys Trp Trp Gln Asn Leu
275 280 285
Glu Gly Cys Ala Leu Ala Thr Asp Thr Lys Lys Leu Arg Gly Gln Met
290 295 300
Leu Ile Asp Met Ala Lys Lys Tyr Asn Ala Asp Ala Val Leu Ile Cys
305 310 315 320
Met Met Lys Phe Cys Asp Pro Glu Glu Phe Asp Tyr Pro Val Tyr Tyr
325 330 335
Arg Glu Phe Gln Glu Ala Gly Ile Lys Asn Leu Leu Ile Glu Ile Asp
340 345 350
Leu Glu Met Thr Ala Phe Glu Gln Thr Asn Thr Arg Leu Gln Thr Leu
355 360 365
Val Glu Thr Leu
370
<210> 93
<211> 422
<212> PRT
<213> Desulfosporosinus meridiei (strain ATCC BAA-275 / DSM 13257 / NCIMB 13706 / S10)
<400> 93
Met Thr Asp Thr Thr Ala Met Ser Ala Lys Glu Leu Leu Gly Phe Tyr
1 5 10 15
Gln Glu Glu Leu Tyr Glu Glu Ala Arg Arg Ala Lys Lys Glu Gly Lys
20 25 30
Leu Val Cys Trp Ser Ala Ser Val Ala Pro Ser Glu Phe Cys Val Ala
35 40 45
Met Asp Val Ala Met Ile Tyr Pro Glu Thr His Ala Ala Gly Ile Gly
50 55 60
Ala Arg Lys Gly Ala Leu Asp Val Leu Glu Val Ala Asp Glu Lys Gly
65 70 75 80
Tyr Asn Val Asp Thr Cys Ser Tyr Ala Arg Val Asn Leu Gly Tyr Met
85 90 95
Glu Leu Leu Lys Gln Glu Ala Leu Thr Gly Ile Thr Pro Glu Lys Leu
100 105 110
Glu Lys Ser Pro Ala Ala Arg Ile Pro Leu Pro Asp Phe Val Ile Thr
115 120 125
Cys Asn Asn Ile Cys Asn Thr Leu Leu Lys Trp Tyr Glu Asn Leu Ala
130 135 140
Val Glu Leu Asn Ile Pro Cys Ile Ile Ile Asp Val Pro Phe Asn His
145 150 155 160
Thr Met Pro Ile Pro Gln Tyr Ala Lys Asp Tyr Ile Ala Glu Gln Phe
165 170 175
Lys Glu Ala Ile Thr Gln Leu Glu Glu Ile Cys Gly Lys Lys Phe Asp
180 185 190
Tyr Asp Lys Phe Leu Lys Val Gln Glu Gln Thr Gln Arg Ser Val Ala
195 200 205
Gln Trp Asn Arg Ile Ala Ala Leu Ser Ser His Lys Pro Ser Pro Leu
210 215 220
Asn Gly Phe Asp Leu Phe Asn Tyr Met Ala Leu Ile Val Cys Ala Arg
225 230 235 240
Ser Lys Asp Tyr Ala Glu Ile Thr Phe Lys Lys Phe Ala Asp Glu Leu
245 250 255
Glu Glu Asn Leu Asn Lys Gly Ile Phe Ala Phe Lys Gly Asn Glu Gln
260 265 270
Lys Arg Val Thr Trp Glu Gly Ile Ala Val Trp Pro His Leu Gly His
275 280 285
Thr Phe Lys Gly Leu Lys Asn Leu Gly Asn Ile Met Thr Gly Ser Ala
290 295 300
Tyr Pro Gly Leu Trp Asn Val Ser Tyr Thr Pro Gly Asp Met Ser Ser
305 310 315 320
Met Ala Glu Ala Tyr Thr Arg Ile Tyr Ile Asn Thr Cys Leu Asp Asn
325 330 335
Lys Val Lys Val Leu Ser Asp Val Ile Ser Gly Gly Lys Cys Asp Gly
340 345 350
Val Ile Tyr His Gln Asn Arg Ser Cys Lys Leu Met Ser Phe Leu Asn
355 360 365
Val Glu Thr Ala Asp Ile Leu Gln Lys Glu Asn Gly Leu Pro Tyr Val
370 375 380
Ser Phe Asp Gly Asp Gln Thr Asp Pro Arg Asn Phe Ser Pro Ala Gln
385 390 395 400
Phe Asp Thr Arg Ile Gln Ala Leu Asp Glu Met Met Lys Gln Asn Lys
405 410 415
Glu Gly Val Ser Asn Glu
420
<210> 94
<211> 372
<212> PRT
<213> Desulfosporosinus meridiei (strain ATCC BAA-275 / DSM 13257 / NCIMB 13706 / S10)
<400> 94
Met Ser Arg Ile Glu Thr Ile Ile Ser Glu Leu Ser Ser Ile Ser Asn
1 5 10 15
Asn Pro Arg Lys Ala Met Glu Asp Tyr Lys Lys Glu Thr Gly Lys Gly
20 25 30
Ser Val Gly Val Met Pro Tyr Tyr Ala Pro Glu Glu Ile Ile His Ala
35 40 45
Ala Gly Phe Leu Pro Val Gly Ile Trp Gly Gly Gln Lys Ser Ile Ser
50 55 60
Lys Ala Arg Ala Tyr Leu Pro Pro Phe Ala Cys Ser Ile Met Gln Ser
65 70 75 80
Val Met Glu Met Gln Leu Glu Gly Val Tyr Asp Asp Leu Glu Ala Val
85 90 95
Leu Phe Pro Val Pro Cys Asp Thr Leu Lys Cys Leu Ser Gln Lys Trp
100 105 110
Lys Gly Thr Ser Pro Val Ile Val Phe Thr His Pro Gln Asn Arg Lys
115 120 125
Leu Glu Ala Ala Asn Lys Phe Leu Ala Glu Glu Tyr Arg Leu Val Arg
130 135 140
Glu Lys Leu Glu Thr Ile Leu Asn Val Lys Ile Thr Asp Glu Ala Leu
145 150 155 160
Asn Gln Ser Ile Glu Thr Tyr Asn Glu Asn Arg Lys Val Met Arg Glu
165 170 175
Phe Thr Asp Leu Ala Ala Asn Tyr Pro Gln Ile Ile Asp Pro Arg Ile
180 185 190
Arg His Ala Ile Ile Lys Ala Arg Phe Phe Met Glu Lys Ser Lys His
195 200 205
Thr Ala Met Val Lys Glu Leu Asn Ser Glu Leu Lys Ser Leu Pro Val
210 215 220
Glu Ala Phe Thr Gly Lys Lys Val Val Leu Thr Gly Ile Met Ala Glu
225 230 235 240
Pro Asn Glu Val Leu Asp Ile Leu Lys Asp Asn Gly Phe Ala Val Val
245 250 255
Ala Asp Asp Leu Ala Gln Glu Ser Arg Leu Phe Arg Asn Asp Val Pro
260 265 270
Ser Gly Thr Asp Pro Leu Tyr Arg Leu Ala Lys Trp Trp Gln Glu Phe
275 280 285
Asp Gly Cys Ser Leu Ala Thr Asp Ala Lys Lys Ser Arg Gly Pro Met
290 295 300
Leu Met Glu Met Val Lys Gly Ser Lys Ala Asp Ala Val Val Val Cys
305 310 315 320
Met Met Lys Phe Cys Asp Pro Glu Glu Phe Asp Tyr Pro Ile Tyr Tyr
325 330 335
Arg Gln Phe Glu Glu Ala Gly Ile Lys Ser Leu Phe Ile Glu Ile Asp
340 345 350
Leu Glu Thr Thr Ser Phe Glu Gln Thr Lys Thr Arg Val Gln Ser Phe
355 360 365
Ser Glu Met Leu
370
<210> 95
<211> 261
<212> PRT
<213> Desulfosporosinus meridiei (strain ATCC BAA-275 / DSM 13257 / NCIMB 13706 / S10)
<400> 95
Met Phe Thr Met Gly Ile Asp Ile Gly Ser Ser Ser Ser Lys Val Val
1 5 10 15
Ile Leu Glu Asp Gly Val Asn Ile Ile Ala Gly Glu Val Ile Gln Ile
20 25 30
Gly Thr Gly Ser Thr Gly Pro Lys Arg Val Leu Asn Glu Ala Leu Ser
35 40 45
Lys Ala Gly Leu Lys Leu Glu Asp Met Ala Lys Ile Ile Ala Thr Gly
50 55 60
Tyr Gly Arg Ser Ser Val Glu Glu Ala His Lys Gln Ile Ser Glu Ile
65 70 75 80
Ser Cys Gln Ala Lys Gly Val Phe Phe Leu Val Pro Ser Ala Lys Leu
85 90 95
Ile Ile Asp Ile Gly Gly Gln Asp Val Lys Ala Ile Arg Leu Asp Ser
100 105 110
Lys Gly Gly Val Lys Gln Phe Phe Met Asn Asp Lys Cys Ala Ala Gly
115 120 125
Thr Gly Arg Phe Leu Asp Val Met Ser Arg Val Leu Glu Val Asn Leu
130 135 140
Asp Glu Met Ala Glu Tyr Asp Ala Arg Ala Thr Glu Pro Ala Thr Val
145 150 155 160
Ser Ser Thr Cys Thr Val Phe Ala Glu Ser Glu Val Ile Ser Gln Leu
165 170 175
Ser Asn Gly Val Ala Lys Glu Asn Ile Ile Ala Gly Val His Gln Ser
180 185 190
Val Ala Ser Lys Ala Cys Gly Leu Ala Tyr Arg Cys Gly Val Glu Glu
195 200 205
Asp Ile Val Met Cys Gly Gly Val Ala Lys Asp Leu Gly Val Val Arg
210 215 220
Ala Ile Ser Lys Glu Leu Lys Lys Pro Val Ile Val Ala Pro Asn Pro
225 230 235 240
Gln Ile Thr Ala Ala Leu Gly Ala Ala Ile Phe Ala Phe Glu Glu Val
245 250 255
Arg Gly Ala Asn Lys
260
<210> 96
<211> 477
<212> PRT
<213> Acidaminococcus fermentans
<400> 96
Met Pro Lys Thr Val Ser Pro Gly Val Gln Ala Leu Arg Asp Val Val
1 5 10 15
Glu Lys Val Tyr Arg Glu Leu Arg Glu Ala Lys Glu Arg Gly Glu Lys
20 25 30
Val Gly Trp Ser Ser Ser Lys Phe Pro Cys Glu Leu Ala Glu Ser Phe
35 40 45
Gly Leu His Val Gly Tyr Pro Glu Asn Gln Ala Ala Gly Ile Ala Ala
50 55 60
Asn Arg Asp Gly Glu Val Met Cys Gln Ala Ala Glu Asp Ile Gly Tyr
65 70 75 80
Asp Asn Asp Ile Cys Gly Tyr Ala Arg Ile Ser Leu Ala Tyr Ala Ala
85 90 95
Gly Phe Arg Gly Ala Asn Lys Met Asp Lys Asp Gly Asn Tyr Val Ile
100 105 110
Asn Pro His Ser Gly Lys Gln Met Lys Asp Ala Asn Gly Lys Lys Val
115 120 125
Phe Asp Ala Asp Gly Lys Pro Val Ile Asp Pro Lys Thr Leu Lys Pro
130 135 140
Phe Ala Thr Thr Asp Asn Ile Tyr Glu Ile Ala Ala Leu Pro Glu Gly
145 150 155 160
Glu Glu Lys Thr Arg Arg Gln Asn Ala Leu His Lys Tyr Arg Gln Met
165 170 175
Thr Met Pro Met Pro Asp Phe Val Leu Cys Cys Asn Asn Ile Cys Asn
180 185 190
Cys Met Thr Lys Trp Tyr Glu Asp Ile Ala Arg Arg His Asn Ile Pro
195 200 205
Leu Ile Met Ile Asp Val Pro Tyr Asn Glu Phe Asp His Val Asn Glu
210 215 220
Ala Asn Val Lys Tyr Ile Arg Ser Gln Leu Asp Thr Ala Ile Arg Gln
225 230 235 240
Met Glu Glu Ile Thr Gly Lys Lys Phe Asp Glu Asp Lys Phe Glu Gln
245 250 255
Cys Cys Gln Asn Ala Asn Arg Thr Ala Lys Ala Trp Leu Lys Val Cys
260 265 270
Asp Tyr Leu Gln Tyr Lys Pro Ala Pro Phe Asn Gly Phe Asp Leu Phe
275 280 285
Asn His Met Ala Asp Val Val Thr Ala Arg Gly Arg Val Glu Ala Ala
290 295 300
Glu Ala Phe Glu Leu Leu Ala Lys Glu Leu Glu Gln His Val Lys Glu
305 310 315 320
Gly Thr Thr Thr Ala Pro Phe Lys Glu Gln His Arg Ile Met Phe Glu
325 330 335
Gly Ile Pro Cys Trp Pro Lys Leu Pro Asn Leu Phe Lys Pro Leu Lys
340 345 350
Ala Asn Gly Leu Asn Ile Thr Gly Val Val Tyr Ala Pro Ala Phe Gly
355 360 365
Phe Val Tyr Asn Asn Leu Asp Glu Leu Val Lys Ala Tyr Cys Lys Ala
370 375 380
Pro Asn Ser Val Ser Ile Glu Gln Gly Val Ala Trp Arg Glu Gly Leu
385 390 395 400
Ile Arg Asp Asn Lys Val Asp Gly Val Leu Val His Tyr Asn Arg Ser
405 410 415
Cys Lys Pro Trp Ser Gly Tyr Met Pro Glu Met Gln Arg Arg Phe Thr
420 425 430
Lys Asp Met Gly Ile Pro Thr Ala Gly Phe Asp Gly Asp Gln Ala Asp
435 440 445
Pro Arg Asn Phe Asn Ala Ala Gln Tyr Glu Thr Arg Val Gln Gly Leu
450 455 460
Val Glu Ala Met Glu Ala Asn Asp Glu Lys Lys Gly Lys
465 470 475
<210> 97
<211> 379
<212> PRT
<213> Acidaminococcus fermentans
<400> 97
Met Ala Ile Ser Ala Leu Ile Glu Glu Phe Gln Lys Val Ser Ala Ser
1 5 10 15
Pro Lys Thr Met Leu Ala Lys Tyr Lys Ala Gln Gly Lys Lys Ala Ile
20 25 30
Gly Cys Leu Pro Tyr Tyr Val Pro Glu Glu Leu Val Tyr Ala Ala Gly
35 40 45
Met Val Pro Met Gly Val Trp Gly Cys Asn Gly Lys Gln Glu Val Arg
50 55 60
Ser Lys Glu Tyr Cys Ala Ser Phe Tyr Cys Thr Ile Ala Gln Gln Ser
65 70 75 80
Leu Glu Met Leu Leu Asp Gly Thr Leu Asp Gly Leu Asp Gly Ile Ile
85 90 95
Thr Pro Val Leu Cys Asp Thr Leu Arg Pro Met Ser Gln Asn Phe Lys
100 105 110
Val Ala Met Lys Asp Lys Met Pro Val Ile Phe Leu Ala His Pro Gln
115 120 125
Val Arg Gln Asn Ala Ala Gly Lys Gln Phe Thr Tyr Asp Ala Tyr Ser
130 135 140
Glu Val Lys Gly His Leu Glu Glu Ile Cys Gly His Glu Ile Thr Asn
145 150 155 160
Asp Ala Ile Leu Asp Ala Ile Lys Val Tyr Asn Lys Ser Arg Ala Ala
165 170 175
Arg Arg Glu Phe Cys Lys Leu Ala Asn Glu His Pro Asp Leu Ile Pro
180 185 190
Ala Ser Val Arg Ala Thr Val Leu Arg Ala Ala Tyr Phe Met Leu Lys
195 200 205
Asp Glu Tyr Thr Glu Lys Leu Glu Glu Leu Asn Lys Glu Leu Ala Ala
210 215 220
Ala Pro Ala Gly Lys Phe Asp Gly His Lys Val Val Val Ser Gly Ile
225 230 235 240
Ile Tyr Asn Met Pro Gly Ile Leu Lys Ala Met Asp Asp Asn Lys Leu
245 250 255
Ala Ile Ala Ala Asp Asp Cys Ala Tyr Glu Ser Arg Ser Phe Ala Val
260 265 270
Asp Ala Pro Glu Asp Leu Asp Asn Gly Leu Gln Ala Leu Ala Val Gln
275 280 285
Phe Ser Lys Gln Lys Asn Asp Val Leu Leu Tyr Asp Pro Glu Phe Ala
290 295 300
Lys Asn Thr Arg Ser Glu His Val Cys Asn Leu Val Lys Glu Ser Gly
305 310 315 320
Ala Glu Gly Leu Ile Val Phe Met Met Gln Phe Cys Asp Pro Glu Glu
325 330 335
Met Glu Tyr Pro Asp Leu Lys Lys Ala Leu Asp Ala His His Ile Pro
340 345 350
His Val Lys Ile Gly Val Asp Gln Met Thr Arg Asp Phe Gly Gln Ala
355 360 365
Gln Thr Ala Leu Glu Ala Phe Ala Glu Ser Leu
370 375
<210> 98
<211> 260
<212> PRT
<213> Acidaminococcus fermentans
<400> 98
Met Ser Ile Tyr Thr Leu Gly Ile Asp Val Gly Ser Thr Ala Ser Lys
1 5 10 15
Cys Ile Ile Leu Lys Asp Gly Lys Glu Ile Val Ala Lys Ser Leu Val
20 25 30
Ala Val Gly Thr Gly Thr Ser Gly Pro Ala Arg Ser Ile Ser Glu Val
35 40 45
Leu Glu Asn Ala His Met Lys Lys Glu Asp Met Ala Phe Thr Leu Ala
50 55 60
Thr Gly Tyr Gly Arg Asn Ser Leu Glu Gly Ile Ala Asp Lys Gln Met
65 70 75 80
Ser Glu Leu Ser Cys His Ala Met Gly Ala Ser Phe Ile Trp Pro Asn
85 90 95
Val His Thr Val Ile Asp Ile Gly Gly Gln Asp Val Lys Val Ile His
100 105 110
Val Glu Asn Gly Thr Met Thr Asn Phe Gln Met Asn Asp Lys Cys Ala
115 120 125
Ala Gly Thr Gly Arg Phe Leu Asp Val Met Ala Asn Ile Leu Glu Val
130 135 140
Lys Val Ser Asp Leu Ala Glu Leu Gly Ala Lys Ser Thr Lys Arg Val
145 150 155 160
Ala Ile Ser Ser Thr Cys Thr Val Phe Ala Glu Ser Glu Val Ile Ser
165 170 175
Gln Leu Ser Lys Gly Thr Asp Lys Ile Asp Ile Ile Ala Gly Ile His
180 185 190
Arg Ser Val Ala Ser Arg Val Ile Gly Leu Ala Asn Arg Val Gly Ile
195 200 205
Val Lys Asp Val Val Met Thr Gly Gly Val Ala Gln Asn Tyr Gly Val
210 215 220
Arg Gly Ala Leu Glu Glu Gly Leu Gly Val Glu Ile Lys Thr Ser Pro
225 230 235 240
Leu Ala Gln Tyr Asn Gly Ala Leu Gly Ala Ala Leu Tyr Ala Tyr Lys
245 250 255
Lys Ala Ala Lys
260
<210> 99
<211> 336
<212> PRT
<213> Carboxydothermus hydrogenoformans
<400> 99
Met Lys Leu Asn Tyr Phe Cys Ser Tyr Trp Pro Val Glu Ile Ser Glu
1 5 10 15
Gly Ala Gly Ile Ser Thr Val Arg Tyr Phe Pro Ser Asp Glu Ser Lys
20 25 30
Ala Pro Val Arg Leu Pro Ala Tyr Cys Cys Ser Tyr Ala Arg Gly Ser
35 40 45
Leu Ala Glu Ile Glu Glu Glu Gly Asp Gly Asp Phe Trp Gly Phe Ala
50 55 60
His Ser Cys Asp Thr Met Gln Ser Leu Tyr Gly Ile Thr Lys Ser Leu
65 70 75 80
Leu Gly Asp Asp Arg Val Phe Leu Phe Val Pro Pro Val Asp Leu Thr
85 90 95
Thr Ala Phe Ala Arg Glu Tyr Tyr Arg Glu Ala Leu Ile Tyr Leu Trp
100 105 110
Arg Glu Leu Ser Gln Lys Ser Gly Val Asn Gly Glu Glu Lys Leu Lys
115 120 125
Leu Thr Trp Glu Lys Leu Lys Glu Leu Arg Asn Lys Val Lys Ser Leu
130 135 140
Glu Asn Leu Thr Ser Ile Ile Pro Ser Ser Glu Ile Phe Glu Leu Leu
145 150 155 160
Lys Lys Leu Gln Thr Leu Pro Leu Asp Glu Ala Leu Asp Tyr Leu Glu
165 170 175
Ala Lys Lys Ala Glu Phe Thr Ser Leu Ser Val Ala Gln Lys Ala Ile
180 185 190
Gly Ile Ile Leu Thr Gly Ala Val Val Thr Asn Ser Lys Leu Tyr Leu
195 200 205
Ala Leu Glu Gln Gln Gly Phe Arg Val Val Tyr Asp Asp Thr Cys Thr
210 215 220
Gly Phe Arg His Phe Ala Gly Glu Ile Glu Asp Lys Asp Asp Ile Leu
225 230 235 240
Glu Ala Ile Val Ser Tyr Tyr Leu Ser Lys Pro Pro Cys Pro Cys Arg
245 250 255
His Lys Gly Val Trp Ala Arg Ala Glu Tyr Leu Lys Asn Leu Tyr His
260 265 270
Asn Lys Asn Ala Arg Ala Ile Val Leu Leu Gln Asn Lys Phe Cys Asp
275 280 285
Pro Phe Ala Trp Asp Val Pro Tyr Leu Val Asp Tyr Phe Lys Lys Gln
290 295 300
Gly Val Pro Val Leu Val Leu Glu Val Glu Gly Gly Glu Ile Gly Glu
305 310 315 320
Gln Asn Lys Thr Arg Leu Gln Ala Phe Arg Glu Ser Val Gly Gly Val
325 330 335
<210> 100
<211> 404
<212> PRT
<213> Carboxydothermus hydrogenoformans
<400> 100
Met Ala Lys Lys Ile Phe Lys Pro Leu Lys Ala Ser Glu Lys Ile Asn
1 5 10 15
Lys Ile Leu Lys Asn His Tyr Leu Lys Ala Lys Tyr Leu Pro Thr Leu
20 25 30
Gly Lys Phe Phe Gly Tyr Lys Thr Ala Trp Ile Thr Ser Gly Ala Pro
35 40 45
Val Glu Leu Leu Arg Ala Phe Gly Ile Glu Pro Val Tyr Pro Glu Asn
50 55 60
Tyr Gly Ala Ile Cys Gly Ala Arg Lys Val Ser Pro Ser Leu Cys Gln
65 70 75 80
Val Ala Glu Asn Arg Gly Tyr Ser Leu Asp Leu Cys Ser Tyr Ala Lys
85 90 95
Ser Asn Leu Gly Ser Ile Trp Asn Pro Lys Glu Ser Pro Phe Asn Gly
100 105 110
Leu Pro Arg Pro Asp Leu Leu Val Val Cys Asn Asn Ile Cys Gly Thr
115 120 125
Val Leu Lys Trp Tyr Glu Thr Leu Ser Arg Glu Phe Asn Ile Pro Leu
130 135 140
Phe Ile Ile Asp Thr Pro Phe Ile Thr Gly Glu Pro Gln Pro Trp Gln
145 150 155 160
Ile Gln Tyr Val Ala Lys Gln Ile Glu Lys Leu Ala Ile Glu Leu Glu
165 170 175
Lys Phe Phe Arg Lys Lys Leu Asp Leu Asn Arg Leu Glu Lys Val Ile
180 185 190
Leu Leu Ala Asn Glu Thr Val Asp Leu Trp Lys Gly Ile Arg Asn Phe
195 200 205
Ala Lys Asn Lys Pro Ser Pro Val Asn Val Thr Asp Leu Phe Ile Asn
210 215 220
Leu Gly Pro Met Val Val Leu Arg Gly Thr Glu Val Ala Arg Asp Phe
225 230 235 240
Tyr Glu Glu Val Tyr Arg Glu Val Glu Glu Arg Tyr Lys Ala Gly Val
245 250 255
Pro Ala Val Glu Gly Glu Lys Tyr Arg Leu Val Trp Asp Asn Ile Pro
260 265 270
Ile Trp Tyr Gly Leu Tyr Arg Phe Tyr Gly Tyr Phe Ala Glu Arg Gly
275 280 285
Ala Val Phe Val Thr Asp Ser Tyr Thr Gly Gly Trp Ala Val Asn Ile
290 295 300
Lys Lys Gly Pro Pro Phe Tyr Ala Leu Ala Glu Thr Tyr Ala Gly Val
305 310 315 320
Phe Leu Asn Arg Asp Leu Glu Phe Arg Lys Asn Gln Leu Gln Ser Phe
325 330 335
Ile Glu Glu Phe Ser Ala Asp Gly Phe Val Met His Ser Asn Arg Ser
340 345 350
Cys Lys Ala Tyr Ser Phe Val Gln Glu Glu Ile Arg Arg Gln Ile Met
355 360 365
Arg Ser Leu Gly Val Pro Gly Leu Ile Val Asp Ala Asp Met Thr Asp
370 375 380
Ser Arg Leu Tyr Ser Glu Glu Thr Val Leu Asn Arg Val Gln Ala Phe
385 390 395 400
Leu Glu Ser Leu
<210> 101
<211> 254
<212> PRT
<213> Carboxydothermus hydrogenoformans
<400> 101
Met Tyr Leu Gly Val Asp Ile Gly Ser Leu Thr Thr Lys Val Val Leu
1 5 10 15
Ile Asp Arg Gly Lys Asn Leu Ile Ala Tyr Arg Tyr Ser Lys Thr Gly
20 25 30
Pro Ala Gly Lys Glu Thr Ala Glu Arg Leu Ile Gln Glu Val Leu Ile
35 40 45
Lys Ala Asn Ile Ser Arg Asp Asp Ile Gln Gly Ile Val Ala Thr Gly
50 55 60
Tyr Gly Arg Val Leu Phe Ser Gly Lys Glu Phe Ser Glu Ile Thr Cys
65 70 75 80
Gln Ala Arg Gly Ile Gly His Leu Tyr Pro Glu Ala Lys Thr Ile Ile
85 90 95
Asp Ile Gly Gly Gln Asp Ser Lys Val Ile Ser Leu Gly Lys Asn Gly
100 105 110
Lys Val Leu Asp Phe Ala Met Asn Asp Lys Cys Ala Ala Gly Thr Gly
115 120 125
Arg Phe Leu Glu Val Met Ser Gln Ala Leu Glu Val Arg Leu Glu Glu
130 135 140
Ile Gly Glu Leu Ala Glu Lys Ser Gln Glu Ala Ala Lys Ile Ser Ser
145 150 155 160
Val Cys Thr Val Phe Ala Glu Ser Glu Val Ile Ser Asn Leu Ser Arg
165 170 175
Gly Gln Ser Arg Glu Ala Val Ala Arg Gly Ile Cys Glu Ala Val Ala
180 185 190
Ala Arg Thr Ala Ile Leu Ala Gln Lys Val Gly Val Val Glu Pro Val
195 200 205
Val Phe Thr Gly Gly Val Ala Lys Asn Thr Gly Val Val Ala Ala Leu
210 215 220
Glu Arg Lys Leu Gly Val Lys Leu Leu Ile Pro Glu Asp Ser Thr Ile
225 230 235 240
Thr Ala Ala Leu Gly Ala Ala Leu Leu Ala Ala Glu Asn Ser
245 250
<210> 102
<211> 261
<212> PRT
<213> Oscillibacter valericigenes
<400> 102
Met Asn Asn Ile Tyr Thr Met Gly Ile Asp Val Gly Ser Thr Ala Ser
1 5 10 15
Lys Cys Leu Ile Leu Lys Asp Gly Ser Glu Ile Val Ala Lys Ser Leu
20 25 30
Val Asp Val Gly Ala Gly Thr Ser Gly Pro Thr Arg Ala Ile Ala Glu
35 40 45
Val Leu Glu Ala Ala Gly Met Lys Lys Glu Asp Met Ala Phe Ile Leu
50 55 60
Ala Thr Gly Tyr Gly Arg Asn Ser Leu Asp Asp Ile Ala Asp His Gln
65 70 75 80
Met Ser Glu Leu Ser Cys His Ala Lys Gly Ala Phe Phe Leu Phe Pro
85 90 95
Asp Val His Thr Val Ile Asp Ile Gly Gly Gln Asp Val Lys Ile Leu
100 105 110
Glu Ile Glu Asn Gly Val Met Val Asn Phe Ala Met Asn Asp Lys Cys
115 120 125
Ala Ala Gly Thr Gly Arg Phe Leu Asp Val Met Ala Arg Val Leu Glu
130 135 140
Val Lys Val Glu Asp Leu Ala Asp Leu Gly Ala Gln Ser Thr Lys Asn
145 150 155 160
Val Glu Ile Ser Ser Thr Cys Thr Val Phe Ala Glu Ser Glu Val Ile
165 170 175
Ser Gln Leu Ala Lys Gly Ser Asp Lys Arg Asp Ile Ile His Gly Ile
180 185 190
His Lys Ser Val Ala Ser Arg Val Val Gly Leu Ala Asn Arg Ile Gly
195 200 205
Val Arg Asp Ala Val Val Met Thr Gly Gly Val Ala Gln Asn Gly Gly
210 215 220
Val Val Ser Ala Leu Gln Glu Ala Leu Gly His Pro Ile His Thr Ser
225 230 235 240
Pro Leu Thr Gln Tyr Asn Gly Ala Leu Gly Ala Ala Leu Phe Ala Trp
245 250 255
Gln Lys Ala Thr Lys
260
<210> 103
<211> 427
<212> PRT
<213> Oscillibacter valericigenes
<400> 103
Met Ala Glu Asn Glu Lys Ala Thr Ala Ala Ala Pro Glu Ala Ala Pro
1 5 10 15
Val Lys Lys Ala Pro Lys Pro Val Ser Pro Gly Thr Gln Ala Leu Arg
20 25 30
Asp Val Val Thr Lys Val Tyr Ala Ala Ala Trp Asp Ala Lys Lys Ala
35 40 45
Gly Arg Pro Val Gly Trp Ser Ser Ser Lys Phe Pro Cys Glu Ile Ala
50 55 60
Glu Ala Leu Gly Leu Ala Val Val Tyr Pro Glu Asn Gln Ala Ala Gly
65 70 75 80
Ile Gly Ala Gln His Asp Gly Gln Arg Met Cys Glu Ser Ala Glu Ser
85 90 95
Leu Gly Phe Asp Pro Asp Ile Cys Gly Tyr Ala Arg Ile Ser Leu Ala
100 105 110
Tyr Ser Ala Gly Val Glu Thr Thr Asn Glu Ser Arg Arg Val Pro Met
115 120 125
Pro Asp Phe Val Leu Cys Cys Asn Asn Ile Cys Asn Cys Met Thr Lys
130 135 140
Trp Tyr Glu Asn Ile Ala Arg Met His Asn Ile Pro Leu Ile Met Ile
145 150 155 160
Asp Val Pro Tyr Asn Asn Glu Val Thr Val Ser Asp Ser Gln Val Ala
165 170 175
Tyr Ile Arg Gly Gln Phe Asp Asp Ala Ile Lys Gln Met Glu Lys Ile
180 185 190
Ala Gly Val Lys Phe Asp Glu Lys Lys Phe Glu Gln Ala Cys Ala Asn
195 200 205
Ala Asn Arg Thr Ala Lys Ala Trp Leu Thr Val Cys Asp Tyr Leu Gln
210 215 220
Tyr Lys Pro Ala Pro Met Ser Gly Phe Asp Leu Phe Asn His Met Ala
225 230 235 240
Asp Val Val Thr Ala Arg Gly Lys Val Glu Thr Ala Glu Ala Phe Glu
245 250 255
Leu Leu Ala Ser Glu Leu Glu Gln His Val Lys Asn Gly Thr Ser Thr
260 265 270
Ala Pro Phe Pro Glu Gln Tyr Arg Val Met Phe Glu Gly Ile Pro Cys
275 280 285
Trp Pro Asn Leu Arg Thr Leu Phe Lys Pro Leu Lys Ala Asn Gly Val
290 295 300
Asn Val Thr Ala Val Val Tyr Ala Pro Ala Phe Gly Phe Val Tyr Asn
305 310 315 320
Gly Leu Asp Glu Met Ala Arg Ala Tyr Cys Lys Ala Pro Asn Ser Val
325 330 335
Cys Ile Glu Gln Gly Val Asp Trp Arg Glu Gly Ile Cys Arg Glu Asn
340 345 350
Lys Val Asp Gly Val Leu Val His Tyr Asn Arg Ser Cys Lys Pro Trp
355 360 365
Ser Gly Tyr Met Ala Glu Met Gln Arg Arg Phe Thr Lys Asp Leu Gly
370 375 380
Val Pro Cys Ala Gly Phe Asp Gly Asp Gln Ala Asp Pro Arg Asn Phe
385 390 395 400
Asn Glu Ala Gln Tyr Glu Thr Arg Val Gln Gly Leu Val Glu Ala Met
405 410 415
Glu Glu Asn Lys Lys Gln Lys Glu Ala Arg Ala
420 425
<210> 104
<211> 380
<212> PRT
<213> Oscillibacter valericigenes
<400> 104
Met Ser Ile Glu Thr Ile Val Lys Glu Phe Ala Asp Val Ala Ala Asp
1 5 10 15
Pro Lys Ala Gln Leu Lys Lys Tyr Lys Ala Glu Gly Lys Lys Cys Ile
20 25 30
Gly Val Met Pro Tyr Tyr Ala Pro Glu Glu Leu Val Ala Ala Ala Gly
35 40 45
Met Val Pro Phe Gly Met Trp Gly Ser Asn Asp Lys Thr Ile Ser Arg
50 55 60
Ala Lys Glu Tyr Cys Ala Thr Phe Tyr Cys Thr Ile Ala Gln Leu Asp
65 70 75 80
Leu Glu Met Leu Leu Asp Gly Thr Met Asp Leu Leu Asp Gly Val Ile
85 90 95
Thr Pro Thr Ile Cys Asp Thr Leu Arg Pro Met Ser Gln Asn Ile Arg
100 105 110
Val Ala Met Gly Glu Lys Leu Pro Cys Ile Phe Leu Ala His Pro Gln
115 120 125
Asn Arg Lys Pro Ala Tyr Gly Lys Lys Phe Cys Leu Asp Gln Tyr Thr
130 135 140
His Ile Lys Thr Glu Leu Glu Lys Ile Ala Gly Ala Pro Ile Thr Asp
145 150 155 160
Ala Ala Leu Ser Glu Thr Ile Lys Val Tyr Asn Lys Ser Arg Ala Ala
165 170 175
Arg Arg Glu Phe Val Lys Leu Val Ser Asp His Cys Asp Val Ile Thr
180 185 190
Pro Thr Lys Arg Ser Ala Val Leu Lys Ala Ala Trp Phe Met Pro Lys
195 200 205
Ala Glu Tyr Thr Glu Lys Leu Lys Ala Leu Asn Ala Glu Leu Lys Ala
210 215 220
Leu Pro Val Cys Asp Trp Lys Gly Thr Lys Val Val Thr Ser Gly Ile
225 230 235 240
Ile Cys Asp Asn Pro Lys Leu Leu Glu Ile Phe Glu Glu Asn Lys Ile
245 250 255
Ala Ile Ala Ala Asp Asp Val Ala His Glu Ser Arg Ser Phe Arg Val
260 265 270
Asp Ala Pro Glu Thr Gly Asp Pro Met Glu Ala Leu Ala Gln Gln Phe
275 280 285
Ala Asn Gln Asp Tyr Asp Val Leu Leu Tyr Asp Glu His Ser Ser Glu
290 295 300
Asn Arg Arg Gly Glu Phe Val Ala Lys Leu Val Lys Asp Ser Gly Ala
305 310 315 320
Lys Gly Leu Val Leu Phe Met Gln Gln Phe Cys Asp Pro Glu Glu Met
325 330 335
Glu Tyr Pro Ser Leu Lys Lys Ala Leu Asp Glu Ala Lys Ile Pro His
340 345 350
Ile Lys Leu Gly Val Asp Gln Gln Met Arg Asp Phe Gly Gln Ala Arg
355 360 365
Thr Ala Ile Gln Ala Phe Ala Asp Val Ile Ser Leu
370 375 380
<210> 105
<211> 422
<212> PRT
<213> Desulfosporosinus orientis (strain ATCC 19365 / DSM 765 / NCIMB 8382 / VKM B-1628)
<400> 105
Met Thr Asp Thr Ala Asn Met Ser Ala Lys Glu Leu Leu Gly Phe Tyr
1 5 10 15
Gln Glu Glu Leu Tyr Glu Glu Ala Arg Gln Ala Lys Lys Glu Gly Lys
20 25 30
Leu Val Cys Trp Ser Ala Ser Val Ala Pro Ser Glu Phe Cys Val Ala
35 40 45
Met Asp Val Ala Met Ile Tyr Pro Glu Thr His Ala Ala Gly Ile Gly
50 55 60
Ala Arg Lys Gly Ala Leu Asp Met Leu Glu Val Ala Asp Glu Lys Gly
65 70 75 80
Tyr Asn Leu Asp Thr Cys Ser Tyr Ala Arg Val Asn Leu Gly Tyr Met
85 90 95
Glu Leu Leu Lys Gln Glu Ala Leu Thr Gly Ile Thr Pro Glu Lys Leu
100 105 110
Glu Lys Ser Pro Ala Ala Arg Val Pro Leu Pro Asp Phe Val Ile Thr
115 120 125
Cys Asn Asn Ile Cys Asn Thr Leu Leu Lys Trp Tyr Glu Asn Leu Ala
130 135 140
Val Glu Leu Asn Ile Pro Cys Ile Val Ile Asp Val Pro Phe Asn His
145 150 155 160
Thr Met Pro Ile Pro Gln Tyr Ala Lys Asp Tyr Ile Ala Glu Gln Phe
165 170 175
Lys Glu Ala Ile Ala Gln Leu Glu Glu Ile Cys Gly Lys Lys Phe Asp
180 185 190
Tyr Asp Lys Phe Leu Gln Val Gln Glu Gln Thr Gln Arg Ser Val Ala
195 200 205
Gln Trp Asn Arg Ile Ala Ser Leu Ser Gly His Lys Pro Ser Pro Leu
210 215 220
Asn Gly Phe Asp Leu Phe Asn Tyr Met Ala Leu Ile Val Cys Ala Arg
225 230 235 240
Ser Arg Asp Cys Ala Glu Ile Thr Phe Lys Lys Phe Ala Asp Glu Leu
245 250 255
Glu Asp Asn Leu Ser Lys Gly Ile Tyr Ala Phe Lys Gly Asn Glu Gln
260 265 270
Lys Arg Ile Thr Trp Glu Gly Ile Ala Val Trp Pro His Leu Gly His
275 280 285
Thr Phe Lys Gly Leu Lys Asn Leu Gly Asn Ile Met Thr Gly Ser Ala
290 295 300
Tyr Pro Gly Leu Trp Asn Leu Ser Tyr Thr Pro Gly Asp Met Ser Ser
305 310 315 320
Met Ala Glu Ala Tyr Thr Arg Ile Tyr Ile Asn Thr Cys Leu Asp Asn
325 330 335
Lys Val Lys Val Leu Ser Asp Ile Ile Ser Gly Gly Lys Cys Asp Gly
340 345 350
Val Ile Tyr His Gln Asn Arg Ser Cys Lys Leu Met Ser Phe Leu Asn
355 360 365
Val Glu Thr Ala Asp Ile Leu Gln Gln Gln Asn His Leu Pro Tyr Val
370 375 380
Ser Phe Asp Gly Asp Gln Thr Asp Pro Arg Asn Phe Ala Pro Ala Gln
385 390 395 400
Phe Asp Thr Arg Ile Gln Ala Leu Asp Glu Met Met Lys Gln Asn Lys
405 410 415
Glu Gly Val Ser His Glu
420
<210> 106
<211> 372
<212> PRT
<213> Desulfosporosinus orientis (strain ATCC 19365 / DSM 765 / NCIMB 8382 / VKM B-1628)
<400> 106
Met Ser Arg Ile Glu Ala Ile Ile Ser Glu Leu Ser Ser Ile Ala Asn
1 5 10 15
Asn Pro Arg Lys Ala Met Glu Asp Tyr Lys Lys Glu Thr Gly Lys Gly
20 25 30
Ser Val Gly Ile Met Pro Tyr Tyr Ala Pro Glu Glu Ile Val His Ala
35 40 45
Ala Gly Tyr Leu Pro Val Gly Ile Trp Gly Gly Gln Lys Ser Ile Ser
50 55 60
Lys Ala Arg Ala Tyr Leu Pro Pro Phe Ala Cys Ser Ile Met Gln Ser
65 70 75 80
Val Val Glu Met Gln Leu Glu Gly Val Tyr Asn Asp Leu Ala Ala Val
85 90 95
Leu Phe Pro Val Pro Cys Asp Thr Leu Lys Cys Leu Ser Gln Lys Trp
100 105 110
Lys Gly Thr Ser Pro Val Ile Val Met Thr His Pro Gln Asn Arg Lys
115 120 125
Leu Glu Ala Ala Asn Lys Phe Leu Ala Glu Glu Tyr Arg Leu Val Arg
130 135 140
Glu Lys Leu Glu Lys Ile Leu Asn Val Gln Ile Thr Asp Glu Ala Leu
145 150 155 160
Asn His Ser Ile Asp Val Tyr Asn Glu Asn Arg Lys Ala Met Arg Glu
165 170 175
Phe Thr Asp Ile Ala Ala Asn Tyr Leu Asn Ile Ile Asp Pro Arg Lys
180 185 190
Arg His Glu Ile Ile Lys Ala Arg Phe Phe Met Glu Lys Ser Lys His
195 200 205
Thr Ala Leu Val Lys Glu Leu Asn Ser Glu Leu Lys Ser Leu Pro Val
210 215 220
Glu Asp Phe Thr Gly Lys Lys Val Ile Leu Thr Gly Ile Met Ala Glu
225 230 235 240
Pro Asn Glu Val Leu Asp Ile Leu Lys Glu Asn Asp Phe Ala Val Val
245 250 255
Ala Asp Asp Leu Ala Gln Glu Ser Arg Leu Phe Arg Ile Asp Val Pro
260 265 270
Ala Gly Pro Asp Pro Leu Tyr Arg Leu Ala Lys Trp Trp Gln Glu Phe
275 280 285
Asp Gly Cys Ser Leu Ala Val Asp Thr Lys Lys Leu Arg Gly Pro Met
290 295 300
Leu Met Asn Met Val Asn Val Asp Lys Ala Asp Ala Val Val Val Cys
305 310 315 320
Met Met Lys Phe Cys Asp Pro Glu Glu Phe Asp Tyr Pro Ile Tyr Tyr
325 330 335
Arg Gln Phe Glu Glu Ala Gly Ile Lys Ser Leu Phe Ile Glu Ile Asp
340 345 350
Leu Glu Pro Thr Ser Phe Glu Gln Thr Lys Thr Arg Val Gln Ser Phe
355 360 365
Arg Glu Met Leu
370
<210> 107
<211> 266
<212> PRT
<213> Desulfosporosinus orientis (strain ATCC 19365 / DSM 765 / NCIMB 8382 / VKM B-1628)
<400> 107
Met Tyr Thr Met Gly Ile Asp Ile Gly Ser Ser Ser Ser Lys Val Val
1 5 10 15
Ile Leu Glu Asp Gly Val Asn Leu Ile Ala Gly Glu Val Ile Gln Ile
20 25 30
Gly Thr Gly Ser Thr Gly Pro Lys Arg Val Leu Glu Glu Ala Leu Ala
35 40 45
Lys Thr Gly Leu Thr Leu Ala Asp Met Ala Lys Ile Ile Ala Thr Gly
50 55 60
Tyr Gly Arg Ser Ser Val Glu Val Ser Asp Lys Gln Ile Ser Glu Ile
65 70 75 80
Ser Cys Gln Ala Lys Gly Val Tyr Phe Leu Val Pro Thr Ala Lys Leu
85 90 95
Ile Ile Asp Ile Gly Gly Gln Asp Val Lys Ala Ile Arg Leu Asp Arg
100 105 110
Ile Gly Gly Val Arg Gln Phe Phe Met Asn Asp Lys Cys Ala Ala Gly
115 120 125
Thr Gly Arg Phe Leu Asp Val Met Ser Arg Val Leu Glu Val Asp Leu
130 135 140
Asp Glu Met Ala Glu Tyr Asp Ala Arg Ala Thr Glu Pro Ala Thr Val
145 150 155 160
Ser Ser Thr Cys Thr Val Phe Ala Glu Ser Glu Val Ile Ser Gln Leu
165 170 175
Ala Asn Gly Val Ala Lys Glu Asn Ile Ile Ala Gly Val His Gln Ser
180 185 190
Val Ala Ser Lys Ala Cys Gly Leu Ala Tyr Arg Cys Gly Val Glu Glu
195 200 205
Asp Val Val Met Cys Gly Gly Val Ala Lys Asp Leu Gly Val Val Arg
210 215 220
Ala Ile Ser Lys Glu Leu Lys Lys Pro Val Ile Val Ala Pro Asn Pro
225 230 235 240
Gln Ile Thr Ala Ala Leu Gly Ala Ala Leu Phe Ala Tyr Glu Glu Val
245 250 255
Met Glu Ala Asn Lys Leu Arg Lys Glu Val
260 265
<210> 108
<211> 411
<212> PRT
<213> Peptostreptococcus anaerobius CAG:621
<400> 108
Met Ser Asn Thr Gly Ala Val Glu Glu Lys Pro Ala Lys Val Leu Leu
1 5 10 15
Gly Glu Ile Val Ala Lys His Tyr Lys Glu Ala Trp Glu Ala Lys Glu
20 25 30
Arg Gly Glu Lys Val Gly Trp Cys Ala Ser Asn Phe Pro Gln Glu Ile
35 40 45
Phe Glu Thr Met Asp Ile Lys Val Val Phe Pro Glu Asn Gln Ala Ala
50 55 60
Ala Ile Ser Ala Lys Gly Gly Gly Gln Arg Met Cys Glu Ile Ala Glu
65 70 75 80
Asn Glu Gly Tyr Ser Asn Asp Ile Cys Ala Tyr Ala Arg Ile Ser Leu
85 90 95
Ala Tyr Met Asp Val Lys Asp Ala Pro Glu Leu Asn Met Pro Gln Pro
100 105 110
Asp Phe Val Ala Cys Cys Asn Asn Ile Cys Asn Cys Met Ile Lys Trp
115 120 125
Tyr Glu Asn Ile Ala Lys Glu Leu Asn Ile Pro Leu Ile Leu Val Asp
130 135 140
Val Pro Tyr Asn Asn Asp Tyr Glu Ala Gly Asp Asp Arg Val Glu Tyr
145 150 155 160
Leu Arg Gly Gln Phe Asp His Ala Ile Lys Gln Leu Glu Asp Leu Thr
165 170 175
Gly Lys Lys Trp Asp Glu Lys Lys Phe Glu Glu Val Met Ala Ile Ser
180 185 190
Gln Arg Thr Gly Arg Ala Trp Leu Lys Ala Thr Gly Tyr Ala Lys Tyr
195 200 205
Thr Pro Ser Pro Phe Ser Gly Phe Asp Val Phe Asn His Met Ala Val
210 215 220
Ala Val Cys Ala Arg Gly Lys Glu Glu Ser Ala Ile Ala Phe Glu Lys
225 230 235 240
Leu Ala Glu Glu Phe Asp Glu Asn Val Lys Thr Gly Lys Ser Thr Phe
245 250 255
Lys Gly Glu Glu Lys Tyr Arg Val Leu Phe Glu Gly Ile Ala Cys Trp
260 265 270
Pro His Leu Arg His Thr Phe Lys Gln Leu Lys Asp Ser Gly Val Asn
275 280 285
Val Cys Gly Thr Val Tyr Ala Asp Ala Phe Gly Tyr Ile Tyr Asp Asn
290 295 300
Thr Tyr Glu Leu Met Gln Ala Tyr Cys Gly Thr Pro Asn Ala Ile Ser
305 310 315 320
Tyr Glu Arg Ser Leu Asp Met Arg Leu Lys Val Ile Glu Glu Asn Asn
325 330 335
Ile Asp Gly Met Leu Ile His Ile Asn Arg Ser Cys Lys Gln Trp Ser
340 345 350
Gly Ile Met Tyr Glu Met Glu Arg Glu Ile Arg Glu Arg Thr Gly Ile
355 360 365
Pro Thr Ala Thr Phe Asp Gly Asp Gln Ala Asp Pro Arg Asn Phe Ser
370 375 380
Glu Ala Gln Tyr Asp Thr Arg Val Gln Gly Leu Ile Glu Val Met Glu
385 390 395 400
Ala Asn Lys Ala Ala Lys Met Lys Glu Glu Asn
405 410
<210> 109
<211> 372
<212> PRT
<213> Peptostreptococcus anaerobius CAG:621
<400> 109
Met Ser Asn Leu Glu Glu Leu Phe Gly Lys Leu Ala Val Cys Pro Leu
1 5 10 15
Glu Gln Ile Asp Lys Tyr Val Ala Asp Gly Lys Lys Val Ile Gly Cys
20 25 30
Ala Pro Val Tyr Ala Pro Glu Glu Leu Val Tyr Ala Ser Gly Met Ile
35 40 45
Pro Met Ala Ile Trp Gly Ala Glu Gly Glu Val Thr Leu Ala Lys Glu
50 55 60
Tyr Phe Pro Ala Phe Tyr Val Ser Ile Ile Leu Arg Leu Leu Asp Leu
65 70 75 80
Gly Leu Glu Gly Lys Leu Asp Lys Met Ser Gly Met Ile Leu Pro Gly
85 90 95
Leu Ser Asp Gly Leu Lys Gly Leu Ser Gln Asn Trp Lys Arg Ala Val
100 105 110
Lys Asn Val Pro Ala Leu Tyr Ile Gly Tyr Gly Gln Asn Arg Lys Ile
115 120 125
Glu Ala Gly Ile Val Tyr Asn Ala Arg Gln Tyr Glu Lys Leu Lys Val
130 135 140
Gln Leu Glu Glu Ile Ala Gly Lys Lys Ile Glu Asp Ala Gln Ile Glu
145 150 155 160
Glu Ala Ile Val Leu Tyr Asn Lys His Arg Lys Ala Met Gln Ala Phe
165 170 175
Ser Asp Leu Ala Ala Lys His Leu Asn Thr Val Thr Pro Ser Leu Arg
180 185 190
Ala Lys Val Met Ser Ser Ala Cys Leu Met Asp Lys Ala Glu His Leu
195 200 205
Glu Ile Val Glu Ala Ile Asn Ala Glu Leu Ser Ala Met Pro Glu Glu
210 215 220
Lys Phe Asp Gly Lys Lys Ile Val Thr Thr Gly Leu Leu Ala Asn Ser
225 230 235 240
Pro Glu Ile Leu Lys Ile Phe Glu Glu Phe Lys Leu Gly Ile Val Ala
245 250 255
Asp Asn Ile Asn His Glu Ser Gly Gln Phe Asp Tyr Leu Val Asp Glu
260 265 270
Ala Thr Gly Asn Pro Ile Lys Ala Leu Ser Lys Trp Ile Ser Asp Ile
275 280 285
Glu Gly Ser Thr Leu Leu Tyr Asp Pro Glu Lys Leu Arg Gly Gln Ile
290 295 300
Ile Ile Asp Lys Ala Lys Lys Tyr Asp Ala Asp Gly Val Val Tyr Leu
305 310 315 320
Leu Ser Lys Phe Ser Asp Ser Asp Glu Phe Asp Tyr Pro Ile Ile Arg
325 330 335
Lys Gln Leu Glu Glu Ala Gly Tyr Met His Ile Leu Val Glu Val Asp
340 345 350
Gln Gln Met Thr Asn Phe Glu Gln Ala Lys Thr Ala Leu Gln Thr Phe
355 360 365
Ala Asp Met Ile
370
<210> 110
<211> 263
<212> PRT
<213> Peptostreptococcus anaerobius CAG:621
<400> 110
Met Ser Asp Ile Tyr Thr Met Gly Ile Asp Ile Gly Ser Thr Ser Ser
1 5 10 15
Lys Cys Val Val Leu Lys Asn Gly Lys Asp Leu Val Ser Ser Gly Val
20 25 30
Val Asn Leu Gly Ala Gly Thr Lys Gly Ala Asp Gln Val Ile Glu Lys
35 40 45
Val Leu Ala Asp Cys Gly Ile Lys Phe Glu Asp Leu Asn Val Ile Val
50 55 60
Ser Thr Gly Tyr Gly Arg Asn Ser Tyr Asp Ser Ala Lys Lys Thr Met
65 70 75 80
Ser Glu Leu Ser Cys His Ala Lys Gly Gly Thr Tyr Ile Phe Gly Pro
85 90 95
Val Arg Thr Ile Ile Asp Ile Gly Gly Gln Asp Ile Lys Val Leu Lys
100 105 110
Leu Asn Asp Lys Gly Met Met Thr Asn Phe Leu Met Asn Asp Lys Cys
115 120 125
Ala Ala Gly Thr Gly Arg Phe Leu Glu Val Met Ala Gly Val Leu Asp
130 135 140
Val Lys Leu Ala Glu Leu Gly Asp Leu Asp Lys Leu Ala Thr Glu Lys
145 150 155 160
Thr Pro Ile Ser Ser Thr Cys Thr Val Phe Ala Glu Ser Glu Val Ile
165 170 175
Ser Cys Met Ala Lys Lys Ile Pro Ile Pro Asn Ile Ile Arg Gly Ile
180 185 190
His Ala Ser Val Ala Thr Arg Val Ala Gly Leu Ala Lys Arg Gly Gly
195 200 205
Leu Thr Thr Pro Val Ala Met Thr Gly Gly Val Thr Lys Asn Ser Gly
210 215 220
Ile Val Arg Ala Leu Ser Glu Glu Leu Glu Thr Asp Ile Met Ile Ser
225 230 235 240
Glu Ile Ser Gln Leu Ala Gly Ala Ile Gly Ala Ala Leu Tyr Ala Tyr
245 250 255
Asp Glu Tyr Leu Lys Glu Asn
260
<210> 111
<211> 258
<212> PRT
<213> Chloroflexus aggregans (strain MD-66 / DSM 9485)
<400> 111
Met Ser Asp Glu Thr Leu Val Leu Ser Thr Ile Glu Gly Pro Val Ala
1 5 10 15
Ile Leu Thr Leu Asn Arg Pro Gln Ala Leu Asn Ala Leu Ser Pro Ala
20 25 30
Leu Ile Asp Ala Leu Ile Arg His Leu Glu His Cys Asp Asn Asp Asp
35 40 45
Thr Ile Arg Val Ile Ile Ile Thr Gly Ala Gly Arg Ala Phe Ala Ala
50 55 60
Gly Ala Asp Ile Lys Ala Met Ala Asp Ala Thr Pro Ile Asp Met Leu
65 70 75 80
Thr Thr Asp Met Ile Ala Arg Trp Ala Arg Ile Ala Ala Val Arg Lys
85 90 95
Pro Val Ile Ala Ala Val Asn Gly Phe Ala Leu Gly Gly Gly Cys Glu
100 105 110
Leu Ala Met Met Cys Asp Ile Ile Leu Ala Ser Glu Thr Ala Gln Phe
115 120 125
Gly Gln Pro Glu Ile Asn Ile Gly Ile Ile Pro Gly Ala Gly Gly Thr
130 135 140
Gln Arg Leu Thr Arg Ala Ile Gly Pro Tyr Arg Ala Met Glu Met Val
145 150 155 160
Leu Thr Gly Ala Thr Ile Ser Ala Gln Glu Ala Tyr Ala Tyr Gly Leu
165 170 175
Val Asn Arg Val Cys Pro Pro Asp Ser Leu Leu Asp Glu Ala Arg Arg
180 185 190
Leu Ala Gln Thr Ile Ala Ala Lys Pro Pro Leu Ala Val Arg Leu Ala
195 200 205
Lys Glu Ala Val Arg Ala Ala Ala Glu Thr Thr Val Arg Glu Gly Leu
210 215 220
Ala Ile Glu Leu Arg Asn Phe Tyr Leu Leu Phe Ala Ser Ala Asp Gln
225 230 235 240
Lys Glu Gly Met Arg Ala Phe Ile Glu Lys Arg Thr Ala Asn Phe Ser
245 250 255
Gly Arg
<210> 112
<211> 257
<212> PRT
<213> Marivirga tractuosa
<400> 112
Met Glu Phe Ile Lys Val Asn Thr Gln Tyr Lys Lys His Ile Ala Leu
1 5 10 15
Ile Asn Leu Asn Arg Pro Lys Glu Leu Asn Ala Leu Asn Leu Gln Leu
20 25 30
Met Thr Glu Leu Lys Asp Thr Leu Lys Val Leu Asp Glu Asp Glu Asn
35 40 45
Val Arg Val Ile Ile Leu Thr Gly Asn Glu Lys Ala Phe Ala Ala Gly
50 55 60
Ala Asp Ile Lys Gln Met Ala Gly Lys Thr Ala Ile Asp Met Leu Asn
65 70 75 80
Val Asp Gln Phe Ser Thr Trp Asp Gln Ile Lys Lys Thr Lys Lys Pro
85 90 95
Leu Ile Ala Ala Val Ser Gly Phe Ala Leu Gly Gly Gly Cys Glu Leu
100 105 110
Ala Met Thr Cys Asp Met Ile Val Ala Ser Glu Ser Ala Lys Phe Gly
115 120 125
Gln Pro Glu Ile Lys Ile Gly Val Met Pro Gly Ala Gly Gly Thr Gln
130 135 140
Arg Leu Thr Arg Ala Ile Gly Lys Ala Lys Ala Met Glu Leu Val Leu
145 150 155 160
Thr Gly Asn Phe Ile Ser Ala Glu Glu Ala Met His Tyr Gly Leu Val
165 170 175
Asn Lys Val Val Pro Thr Glu Met Tyr Leu Glu Ala Ala Ala Glu Leu
180 185 190
Ala Glu Gln Ile Ala Gln Met Ser Pro Val Ala Ala Lys Leu Ala Lys
195 200 205
Glu Ser Val Asn Arg Ala Phe Glu Thr His Leu Asp Glu Gly Leu His
210 215 220
Phe Glu Arg Lys Asn Phe Tyr Leu Thr Phe Ala Ser Glu Asp Gln Thr
225 230 235 240
Glu Gly Met Glu Ala Phe Val Glu Lys Arg Lys Pro Glu Phe Lys Gly
245 250 255
Lys
<210> 113
<211> 257
<212> PRT
<213> Marinithermus hydrothermalis (strain DSM 14884/JCM 11576/T1)
<400> 113
Met Tyr Glu Asn Leu Ile Val Glu Thr Leu Glu Gly Gly Val Gly Leu
1 5 10 15
Ile Arg Ile His Arg Pro Lys Arg Leu Asn Ala Leu Asn Gln Ala Thr
20 25 30
Met Asp Glu Ile Val Arg Ala Val Arg Ala Phe Glu Ala Asp Asp Ala
35 40 45
Val Arg Ala Ile Val Leu Thr Gly Asp Glu Arg Ala Phe Ala Ala Gly
50 55 60
Ala Asp Val Thr Glu Met Asp Gly Ala Asn Val Pro Glu Met Leu Ser
65 70 75 80
Gly Tyr Arg Phe Glu Gln Trp Glu Thr Leu Arg Arg Thr Thr Lys Pro
85 90 95
Leu Ile Ala Ala Val Ser Gly Phe Ala Leu Gly Gly Gly Leu Glu Leu
100 105 110
Ala Met Leu Cys Asp Ile Ile Val Ala Ser Glu Thr Ala Arg Leu Gly
115 120 125
Gln Pro Glu Ile Asn Leu Gly Ile Met Pro Gly Ala Gly Gly Thr Gln
130 135 140
Arg Leu Thr Arg Gln Val Gly Lys Tyr Leu Ala Met Glu Met Val Leu
145 150 155 160
Thr Gly Arg Met Leu Thr Ala Glu Glu Ala Tyr Arg His Gly Leu Val
165 170 175
Asn Arg Val Val Pro Val Glu Phe Tyr Leu Glu Glu Ala Ile Gln Ile
180 185 190
Ala Arg Glu Ile Ala Lys Lys Ala Pro Val Ala Val Arg Leu Ala Lys
195 200 205
Asp Ala Ile Leu Lys Ala Glu Asp Thr Pro Leu Glu Val Gly Leu Ala
210 215 220
Tyr Glu Arg His Asn Phe Tyr Leu Leu Phe Gly Thr Glu Asp Lys Gln
225 230 235 240
Glu Gly Ile Arg Ala Phe Leu Glu Lys Arg Lys Pro Glu Trp Lys Gly
245 250 255
Arg
<210> 114
<211> 259
<212> PRT
<213> Chitinophaga pinensis (strain ATCC 43595/DSM 2588/NCIB11800/UQM 2034)
<400> 114
Met Gln Pro Gln Phe Ile Ile Ile His Arg Gln Val Ala Pro Tyr Val
1 5 10 15
Ala His Ile Gln Leu Asn Arg Pro Lys Glu Leu Asn Ala Leu Asn Leu
20 25 30
Glu Leu Met Ile Glu Leu Arg Asp Ala Leu Lys Met Leu Asp Ala Asp
35 40 45
Asp Asn Val Arg Ala Ile Val Ile Ser Gly Asn Glu Lys Ala Phe Ala
50 55 60
Ala Gly Ala Asp Ile Lys Gln Met Ala Gly Lys Thr Ala Met Asp Met
65 70 75 80
Tyr Asn Ile Asp Gln Phe Ser Thr Trp Asp Thr Ile Lys Lys Thr Lys
85 90 95
Lys Pro Leu Ile Ala Ala Val Ser Gly Phe Ala Leu Gly Gly Gly Cys
100 105 110
Glu Leu Val Met Leu Cys Asp Met Ile Val Ala Ser Glu Thr Ala Arg
115 120 125
Phe Gly Gln Pro Glu Ile Lys Ile Gly Val Met Pro Gly Ala Gly Gly
130 135 140
Thr Gln Arg Leu Thr Arg Ala Val Gly Lys Ala Leu Ala Met Glu Met
145 150 155 160
Val Leu Thr Gly Arg Phe Ile Thr Ala Gln Glu Ala Ala Arg Ala Gly
165 170 175
Leu Ile Asn Arg Val Ile Pro Val Glu Leu Phe Leu Gln Glu Ala Ile
180 185 190
Arg Leu Ala Thr Glu Val Ala Ala Leu Ser Pro Leu Ala Val Lys Met
195 200 205
Ala Lys Glu Ser Val Leu Lys Ala Phe Asp Ser Ser Leu Glu Glu Gly
210 215 220
Leu His Phe Glu Arg Lys Asn Phe Tyr Leu Leu Phe Ala Ser Glu Asp
225 230 235 240
Gln Lys Glu Gly Met Gln Ala Phe Val Asp Lys Arg Ser Pro Val Phe
245 250 255
Lys Gly Lys
<210> 115
<211> 258
<212> PRT
<213> Megasphaera elsdenii DSM 20460
<400> 115
Met Tyr Thr Leu Gly Ile Asp Val Gly Ser Ser Ser Ser Lys Ala Val
1 5 10 15
Ile Leu Glu Asp Gly Lys Lys Ile Val Ala His Ala Val Val Glu Ile
20 25 30
Gly Thr Gly Ser Thr Gly Pro Glu Arg Val Leu Asp Glu Val Phe Lys
35 40 45
Asp Thr Asn Leu Lys Ile Glu Asp Met Ala Asn Ile Ile Ala Thr Gly
50 55 60
Tyr Gly Arg Phe Asn Val Asp Cys Ala Lys Gly Glu Val Ser Glu Ile
65 70 75 80
Thr Cys His Ala Lys Gly Ala Leu Phe Glu Cys Pro Gly Thr Thr Thr
85 90 95
Ile Leu Asp Ile Gly Gly Gln Asp Val Lys Ser Ile Lys Leu Asn Gly
100 105 110
Gln Gly Leu Val Met Gln Phe Ala Met Asn Asp Lys Cys Ala Ala Gly
115 120 125
Thr Gly Arg Phe Leu Asp Val Met Ser Lys Val Leu Glu Ile Pro Met
130 135 140
Ser Glu Met Gly Asp Trp Tyr Phe Lys Ser Lys His Pro Ala Ala Val
145 150 155 160
Ser Ser Thr Cys Thr Val Phe Ala Glu Ser Glu Val Ile Ser Leu Leu
165 170 175
Ser Lys Asn Val Pro Lys Glu Asp Ile Val Ala Gly Val His Gln Ser
180 185 190
Ile Ala Ala Lys Ala Cys Ala Leu Val Arg Arg Val Gly Val Gly Glu
195 200 205
Asp Leu Thr Met Thr Gly Gly Gly Ser Arg Asp Pro Gly Val Val Asp
210 215 220
Ala Val Ser Lys Glu Leu Gly Ile Pro Val Arg Val Ala Leu His Pro
225 230 235 240
Gln Ala Val Gly Ala Leu Gly Ala Ala Leu Ile Ala Tyr Asp Lys Ile
245 250 255
Lys Lys
<210> 116
<211> 428
<212> PRT
<213> Megasphaera elsdenii DSM 20460
<400> 116
Met Ser Glu Glu Lys Thr Val Asp Ile Glu Ser Met Ser Ser Lys Glu
1 5 10 15
Ala Leu Gly Tyr Phe Leu Pro Lys Val Asp Glu Asp Ala Arg Lys Ala
20 25 30
Lys Lys Glu Gly Arg Leu Val Cys Trp Ser Ala Ser Val Ala Pro Pro
35 40 45
Glu Phe Cys Thr Ala Met Asp Ile Ala Ile Val Tyr Pro Glu Thr His
50 55 60
Ala Ala Gly Ile Gly Ala Arg His Gly Ala Pro Ala Met Leu Glu Val
65 70 75 80
Ala Glu Asn Lys Gly Tyr Asn Gln Asp Ile Cys Ser Tyr Cys Arg Val
85 90 95
Asn Met Gly Tyr Met Glu Leu Leu Lys Gln Gln Ala Leu Thr Gly Glu
100 105 110
Thr Pro Glu Val Leu Lys Asn Ser Pro Ala Ser Pro Ile Pro Leu Pro
115 120 125
Asp Val Val Leu Thr Cys Asn Asn Ile Cys Asn Thr Leu Leu Lys Trp
130 135 140
Tyr Glu Asn Leu Ala Lys Glu Leu Asn Val Pro Leu Ile Asn Ile Asp
145 150 155 160
Val Pro Phe Asn His Glu Phe Pro Val Thr Lys His Ala Lys Gln Tyr
165 170 175
Ile Val Gly Glu Phe Lys His Ala Ile Lys Gln Leu Glu Asp Leu Cys
180 185 190
Gly Arg Pro Phe Asp Tyr Asp Lys Phe Phe Glu Val Gln Lys Gln Thr
195 200 205
Gln Arg Ser Ile Ala Ala Trp Asn Lys Ile Ala Thr Tyr Phe Gln Tyr
210 215 220
Lys Pro Ser Pro Leu Asn Gly Phe Asp Leu Phe Asn Tyr Met Gly Leu
225 230 235 240
Ala Val Ala Ala Arg Ser Leu Asn Tyr Ser Glu Ile Thr Phe Asn Lys
245 250 255
Phe Leu Lys Glu Leu Asp Glu Lys Val Ala Asn Lys Lys Trp Ala Phe
260 265 270
Gly Glu Asn Glu Lys Ser Arg Val Thr Trp Glu Gly Ile Ala Val Trp
275 280 285
Ile Ala Leu Gly His Thr Phe Lys Glu Leu Lys Gly Gln Gly Ala Leu
290 295 300
Met Thr Gly Ser Ala Tyr Pro Gly Met Trp Asp Val Ser Tyr Glu Pro
305 310 315 320
Gly Asp Leu Glu Ser Met Ala Glu Ala Tyr Ser Arg Thr Tyr Ile Asn
325 330 335
Cys Cys Leu Glu Gln Arg Gly Ala Val Leu Glu Lys Val Val Arg Asp
340 345 350
Gly Lys Cys Asp Gly Leu Ile Met His Gln Asn Arg Ser Cys Lys Asn
355 360 365
Met Ser Leu Leu Asn Asn Glu Gly Gly Gln Arg Ile Gln Lys Asn Leu
370 375 380
Gly Val Pro Tyr Val Ile Phe Asp Gly Asp Gln Thr Asp Ala Arg Asn
385 390 395 400
Phe Ser Glu Ala Gln Phe Asp Thr Arg Val Glu Ala Leu Ala Glu Met
405 410 415
Met Ala Asp Lys Lys Ala Asn Glu Gly Gly Asn His
420 425
<210> 117
<211> 372
<212> PRT
<213> Megasphaera elsdenii DSM 20460
<400> 117
Met Ser Gln Ile Asp Glu Leu Ile Ser Lys Leu Gln Glu Val Ser Asn
1 5 10 15
His Pro Gln Lys Thr Val Leu Asn Tyr Lys Lys Gln Gly Lys Gly Leu
20 25 30
Val Gly Met Met Pro Tyr Tyr Ala Pro Glu Glu Ile Val Tyr Ala Ala
35 40 45
Gly Tyr Leu Pro Val Gly Met Phe Gly Ser Gln Asn Pro Gln Ile Ser
50 55 60
Ala Ala Arg Thr Tyr Leu Pro Pro Phe Ala Cys Ser Leu Met Gln Ala
65 70 75 80
Asp Met Glu Leu Gln Leu Asn Gly Thr Tyr Asp Cys Leu Asp Ala Val
85 90 95
Ile Phe Ser Val Pro Cys Asp Thr Leu Arg Cys Met Ser Gln Lys Trp
100 105 110
His Gly Lys Ala Pro Val Ile Val Phe Thr Gln Pro Gln Asn Arg Lys
115 120 125
Ile Arg Pro Ala Val Asp Phe Leu Lys Ala Glu Tyr Glu His Val Arg
130 135 140
Thr Glu Leu Glu Arg Ile Leu Asn Val Lys Ile Ser Asp Leu Ala Ile
145 150 155 160
Gln Glu Ala Ile Lys Val Tyr Asn Glu Asn Arg Gln Val Met Arg Glu
165 170 175
Phe Cys Asp Val Ala Ala Gln Tyr Pro Gln Ile Phe Thr Pro Val Lys
180 185 190
Arg His Asp Val Ile Lys Ala Arg Trp Phe Met Asp Lys Ala Glu His
195 200 205
Thr Ala Leu Val Arg Glu Leu Ile Asp Ala Val Lys Lys Glu Pro Val
210 215 220
Gln Pro Trp Asn Gly Lys Lys Val Ile Leu Ser Gly Ile Met Ala Glu
225 230 235 240
Pro Asp Glu Phe Leu Asp Ile Phe Ser Glu Phe Asn Ile Ala Val Val
245 250 255
Ala Asp Asp Leu Ala Gln Glu Ser Arg Gln Phe Arg Thr Asp Val Pro
260 265 270
Ser Gly Ile Asp Pro Leu Glu Gln Leu Ala Gln Gln Trp Gln Asp Phe
275 280 285
Asp Gly Cys Pro Leu Ala Leu Asn Glu Asp Lys Pro Arg Gly Gln Met
290 295 300
Leu Ile Asp Met Thr Lys Lys Tyr Asn Ala Asp Ala Val Val Ile Cys
305 310 315 320
Met Met Arg Phe Cys Asp Pro Glu Glu Phe Asp Tyr Pro Ile Tyr Lys
325 330 335
Pro Glu Phe Glu Ala Ala Gly Val Arg Tyr Thr Val Leu Asp Leu Asp
340 345 350
Ile Glu Ser Pro Ser Leu Glu Gln Leu Arg Thr Arg Ile Gln Ala Phe
355 360 365
Ser Glu Ile Leu
370
<210> 118
<211> 258
<212> PRT
<213> Chloroflexus aurantiacus (strain ATCC 29364 / DSM 637 / Y-400-fl)
<400> 118
Met Ser Glu Glu Ser Leu Val Leu Ser Thr Ile Glu Gly Pro Ile Ala
1 5 10 15
Ile Leu Thr Leu Asn Arg Pro Gln Ala Leu Asn Ala Leu Ser Pro Ala
20 25 30
Leu Ile Asp Asp Leu Ile Arg His Leu Glu Ala Cys Asp Ala Asp Asp
35 40 45
Thr Ile Arg Val Ile Ile Ile Thr Gly Ala Gly Arg Ala Phe Ala Ala
50 55 60
Gly Ala Asp Ile Lys Ala Met Ala Asn Ala Thr Pro Ile Asp Met Leu
65 70 75 80
Thr Ser Gly Met Ile Ala Arg Trp Ala Arg Ile Ala Ala Val Arg Lys
85 90 95
Pro Val Ile Ala Ala Val Asn Gly Tyr Ala Leu Gly Gly Gly Cys Glu
100 105 110
Leu Ala Met Met Cys Asp Ile Ile Ile Ala Ser Glu Asn Ala Gln Phe
115 120 125
Gly Gln Pro Glu Ile Asn Leu Gly Ile Ile Pro Gly Ala Gly Gly Thr
130 135 140
Gln Arg Leu Thr Arg Ala Leu Gly Pro Tyr Arg Ala Met Glu Leu Ile
145 150 155 160
Leu Thr Gly Ala Thr Ile Ser Ala Gln Glu Ala Leu Ala His Gly Leu
165 170 175
Val Cys Arg Val Cys Pro Pro Glu Ser Leu Leu Asp Glu Ala Arg Arg
180 185 190
Ile Ala Gln Thr Ile Ala Thr Lys Ser Pro Leu Ala Val Gln Leu Ala
195 200 205
Lys Glu Ala Val Arg Met Ala Ala Glu Thr Thr Val Arg Glu Gly Leu
210 215 220
Ala Ile Glu Leu Arg Asn Phe Tyr Leu Leu Phe Ala Ser Ala Asp Gln
225 230 235 240
Lys Glu Gly Met Gln Ala Phe Ile Glu Lys Arg Ala Pro Asn Phe Ser
245 250 255
Gly Arg
<210> 119
<211> 258
<212> PRT
<213> Ruegeria pomeroyi DSS-3
<400> 119
Met Ala Phe Glu Thr Ile Ile Val Glu Val Glu Asp His Val Ala Leu
1 5 10 15
Ile Arg Leu Asn Arg Pro Asp Ala Leu Asn Ala Leu Asn Thr Gln Leu
20 25 30
Leu Gly Glu Leu Cys Thr Ala Leu Glu Glu Ala Asp Gly Asn Asp Lys
35 40 45
Val Arg Cys Ile Val Ile Thr Gly Ser Asp Lys Ala Phe Ala Ala Gly
50 55 60
Ala Asp Ile Arg Glu Met Ser Gln Lys Thr Tyr Val Glu Val Tyr Ser
65 70 75 80
Glu Asn Leu Phe Ala Ala Ala Asn Asp Arg Val Ser Ala Ile Arg Lys
85 90 95
Pro Ile Ile Ala Ala Val Ala Gly Tyr Ala Leu Gly Gly Gly Cys Glu
100 105 110
Leu Ala Met Leu Cys Asp Phe Ile Ile Ala Ala Asp Thr Ala Lys Phe
115 120 125
Gly Gln Pro Glu Ile Asn Leu Gly Val Ile Ala Gly Ile Gly Gly Thr
130 135 140
Gln Arg Leu Thr Arg Leu Val Gly Lys Ser Lys Ser Met Asp Leu Asn
145 150 155 160
Leu Thr Gly Arg Phe Met Asp Ala Glu Glu Ala Glu Arg Ala Gly Leu
165 170 175
Val Ser Arg Val Val Pro Ala Lys Lys Leu Val Glu Glu Ala Leu Ser
180 185 190
Ala Ala Gln Lys Ile Ala Glu Lys Ser Met Ile Ser Ala Tyr Ala Val
195 200 205
Lys Glu Ala Val Asn Arg Ser Tyr Glu Thr Thr Leu Ser Glu Gly Leu
210 215 220
Leu Phe Glu Arg Arg Val Phe His Ser Met Phe Ala Thr Glu Asp Gln
225 230 235 240
Lys Glu Gly Met Ala Ala Phe Leu Glu Lys Arg Ala Ala Gln Phe Arg
245 250 255
Asp Lys
<210> 120
<211> 900
<212> DNA
<213> Dictyostelium discoideum (Slime mold)
<400> 120
atgattaata gattattttc aattaataat attaaaaatg gatcaaaatt ttttagttca 60
tcaacaacag ttgaaactaa acaaccatta gttttattag aaaaacattt agtaaatgga 120
aaatatacag gtattcaaat tgttaaatta aataaaccaa aacaattgaa tgcattaaca 180
tttgaaatgg gagttgatta taagaaggtg gtggatacat tagcagaaga taaagatttg 240
aaatgtgttg tattgacagg tgaaggtaag gcattttcgg caggtggtga tttagatttc 300
ttaattgaaa gaactaaaga cacaccagaa aacaatcaaa gaattatgga aagattctat 360
agaacatttt tatatattcg ttcattacca gtaccaatca tttctgcaat caatggtgca 420
gcaattggtg caggtttctg tttagcttta gcaactgata ttcgtgtcgt tagtaataaa 480
gcaccagtgg gtttaacatt caccaaatta ggtattcatc caggtatggg tgtaactcat 540
tcaattacaa atatagttgg tcaagatgtt gcatcctata tgttattatc aagtgatatt 600
atcaaaggtg atgaagctca aagattaggt ttagttttaa aatcggttga atctgatcaa 660
gttttaccaa ctgctttaaa tctcgctgaa acaatctcaa aaaattcaac tatcgctgta 720
aactctacaa caaaaacttt acgtaataaa tataattcag atttagataa aagtttaact 780
cgtgaagctg atgctcaaag tcaatgttgg gcttcaaaag atatagttga aggtatttta 840
gcaattagag aaagtagaga tccaaaacat aattatttat tatttgatga tcaaaaataa 900
900
<210> 121
<211> 786
<212> DNA
<213> Clostridium acetobutylicum
<400> 121
atggaactaa acaatgtcat ccttgaaaag gaaggtaaag ttgctgtagt taccattaac 60
agacctaaag cattaaatgc gttaaatagt gatacactaa aagaaatgga ttatgttata 120
ggtgaaattg aaaatgatag cgaagtactt gcagtaattt taactggagc aggagaaaaa 180
tcatttgtag caggagcaga tatttctgag atgaaggaaa tgaataccat tgaaggtaga 240
aaattcggga tacttggaaa taaagtgttt agaagattag aacttcttga aaagcctgta 300
atagcagctg ttaatggttt tgctttagga ggcggatgcg aaatagctat gtcttgtgat 360
ataagaatag cttcaagcaa cgcaagattt ggtcaaccag aagtaggtct cggaataaca 420
cctggttttg gtggtacaca aagactttca agattagttg gaatgggcat ggcaaagcag 480
cttatattta ctgcacaaaa tataaaggca gatgaagcat taagaatcgg acttgtaaat 540
aaggtagtag aacctagtga attaatgaat acagcaaaag aaattgcaaa caaaattgtg 600
agcaatgctc cagtagctgt taagttaagc aaacaggcta ttaatagagg aatgcagtgt 660
gatattgata ctgctttagc atttgaatca gaagcatttg gagaatgctt ttcaacagag 720
gatcaaaagg atgcaatgac agctttcata gagaaaagaa aaattgaagg cttcaaaaat 780
agatag 786
<210> 122
<211> 468
<212> DNA
<213> Clostridium difficile
<400> 122
aatagtaaaa aagtagtgat agctgctgta aacggatttg ctttaggtgg atgtgaactt 60
gcaatggcat gtgatataag aattgcatct gctaaagcta aatttggtca gccagaagta 120
actcttggaa taactccagg atatggagga actcaaaggc ttacaagatt ggttggaatg 180
gcaaaagcaa aagaattaat ctttacaggt caagttataa aagctgatga agctgaaaaa 240
atagggctag taaatagagt cgttgagcca gacattttaa tagaagaagt tgagaaatta 300
gctaagataa tagctaaaaa tgctcagctt gcagttagat actctaaaga agcaatacaa 360
cttggtgctc aaactgatat aaatactgga atagatatag aatctaattt atttggtctt 420
tgtttttcaa ctaaagacca aaaagaagga attgtcagct ttcgttga 468
<210> 123
<211> 777
<212> DNA
<213> Clostridium pasteurianum
<400> 123
atgggaaata ttatctttga agaagaagat ggaatagaaa aagttacaat taacagacct 60
aaagctctta atgcattaaa tagtgaaaca ttaaaagaac ttggtacagt aataaatgac 120
atatctgtaa acgatggaat aaaagctgta ataataacag gttcgggatc aaaagctttt 180
gtagctggtg cagatatagc tgaaatgagt actctaaatt caatagaggc aacaaatttt 240
tcaagacttg cccaaaatgt attttcacaa atagaaaatc tacctaaatt agtagtagca 300
gcagttaacg gttttgctct tggaggagga tgtgagcttg caatggcttg tgatgtaagg 360
tttgcttcaa aaaaagctaa atttggtcaa ccagaagtta atttaggaat attgccaagt 420
ttcggaggaa ctcaacggct tccaaaattg gttggaaagg gaatagcaaa agaattgata 480
ttttctacag atatgattac tgccgatgaa gcttatcgta taggacttgc taataaagtc 540
tatgaacctg aggaattatt agtaaaatca caggagtttg ctgaaaaggt aatgactaaa 600
tctccatggg gtgttaaatt agcaaaagca tgtataaata atggattaga tgtagatttg 660
gaagcaggac ttaaatatga agcaaattca tttggtctgt gtttttcaac ggaagatcaa 720
aaggaaggta tgaaagcatt tttagaaaaa agaaaagcag acttcaaagg actttaa 777
<210> 124
<211> 789
<212> DNA
<213> Clostridium pasteurianum
<400> 124
atggatttta ataatattat ccttgaaaaa gaggaaaaaa ttgccgtagt tacaattaat 60
agacctaaag ctcttaatgc tttgaacagt gaaacgttaa ctgagcttga ttctgtaatt 120
gatgaaattg acaaagataa tgaaatttta gcagtggtat taacgggagc gggaaaatcc 180
ttcgtagctg gagccgatat atcagaaatg aaagacatga atgtagtaga aggaagaaaa 240
tttggaatac taggtaataa ggtgttcaga aaacttgaaa atttagaaaa gccagtaata 300
gcagccctta atggatttac attgggtggt ggttgtgaaa ttgctatgtc ttgcgatata 360
agaatagctt ctactaaggc aaaatttgga cagccagagg tacagcttgg aataactcca 420
ggttttggcg gtactcaaag attagctaga ttaataggcc caggagctgc aaaggaactt 480
atatatactg gaaaaattat aaatgctgaa gaggcctata gattaggact tgttaataga 540
gttatagaac cagaaacttt attagatgaa gcaaaacaat tggcaaatac tatagcagcc 600
aatgcaccta tagctgttaa gttggctaaa tcagcaataa atagaggaat tcaaactgat 660
attgatacag gtgtgtcaat tgaatcagaa gtatttggag cttgtttctc tacagaagat 720
caaaaagaag gtatgaatac attcttgaat gataaaaaat atttaactgg taattttaag 780
aataaataa 789
<210> 125
<211> 783
<212> DNA
<213> Megasphaera elsdenii
<400> 125
atggattacc agaacattat ttttgctgta gaagacggta ttgcaacgat tacgatcaat 60
cgcccgaagg ctctgaacgc tttgaaccag gctacggtca gcgaattgaa agacgtcgtt 120
gaaaagattg cagctgataa agctatcaaa gtcgtcatca tcaccggtgc aggcgctaaa 180
tccttcgtcg ctggcgctga catcaaagaa atggcttcca agaacgctgc tgaaggccgc 240
gaatggggcc agttcggtca gaacgtcttc acggaaatcg aaaacctgcc gcagcctgtc 300
atcgcagcta tcaacggctt cgctctcggc ggcggctgcg aactctcctg cgcttgcgat 360
atccgctatg cagctgaaaa cgctaaattc ggccagccgg aagtcggctt gggcatcact 420
ccgggctttg gcggcacgca gcgcctgacc cgtgtcgtag gccgcggcca cgcgaaagaa 480
ctcatctaca cgggcggcat gatcgacgct gaaaaagcaa aagctatcgg cttggtcaat 540
gaagtcttcc cgcaggaaga actgatgccg gctgctgtta aattggctaa gaagatcgct 600
aagaacgctc ctattgcagt acagctctcc aaagctgcca tcaaccgcgg catcaactgc 660
gacgtcgtaa ccggtatcgc ttatgaagct gaagtcttcg gcctctgctt ctccacggct 720
gaccagaagg aaggcatggc tgctttctgc gaaaaacgca aagcaacgtt tgaaggtaaa 780
taa 783
<210> 126
<211> 780
<212> DNA
<213> Metallosphaera sedula
<400> 126
atggaatttg aaacaataga aactaaaaaa gaaggaaact tgttctggat tacgttaaat 60
agacccgata aactaaacgc actaaacgct aaattacttg aggagttaga tagggcagtc 120
tctcaggcag agtctgaccc agagattagg gttatcatca ttacagggaa aggaaaggcc 180
ttctgcgcag gggctgacat aacccagttt aaccagttaa ccccagcaga agcctggaaa 240
ttctctaaga aaggaagaga gatcatggac aagatagagg cactgagcaa acccaccatt 300
gccatgatca atggatatgc ccttgggggt ggactagagc tagccttagc ctgtgatata 360
aggatcgcag cggaggaggc ccaactaggc cttccagaga taaacctagg gatatatccg 420
gggtatgggg ggactcagag gttaaccaga gttataggaa agggaagagc cctggagatg 480
atgatgacgg gcgatcgtat tcctggtaag gatgctgaga aatatggtct cgtgaatagg 540
gttgtccccc tagctaactt ggagcaagag acaaggaagc tggcagaaaa gatagccaag 600
aagtctccta tctctctcgc cttaatcaag gaagttgtaa acaggggact agactctccc 660
ctactgtcag gtctagcgtt ggaaagcgta ggatggggag tcgtgttttc tacggaggac 720
aagaaggagg gggtaagtgc cttcctggag aagagagagc ctacgtttaa gggaaaatag 780
780
<210> 127
<211> 779
<212> DNA
<213> Clostridicum kluyvery
<400> 127
atggaattta aaaatatcat tcttgaaaag gatggaaatg tggcttcaat aacgttgaat 60
agacctaagg cattaaatgc attaaatgca gcaactttaa aagagataga tgccgcaata 120
aacgacattg ctgaagatga taacgtatat gctgtgataa ttactgggtc aggtaaagct 180
tttgtagcag gagcagatat agctgagatg aaagatctta ctgcagttga gggaagaaag 240
ttttcagttc ttggcaataa aatatttaga aaattagaaa atttagaaaa accagttata 300
gcagctataa atggatttgc actgggtggt ggctgtgaat tgtcattgtc ttgcgatata 360
agaatagctt catcaaaggc taagtttggt caaccagagg ttggtcttgg aattactcca 420
gggtttggag gtactcaaag acttgcaaga gcaataggcg ttggtatggc taaggaactt 480
atatataccg gaaaagtaat taatgctgaa gaggcattaa gaataggttt ggtaaataaa 540
gtagttgagc cagataaatt attggaagaa gctaaagctt tagtagatgc tattattgtt 600
aatgcaccta tagctgttag aatgtgtaag gctgctataa atcaaggact tcagtgtgat 660
atagatacag gtgtagctta tgaagcagaa gtatttgggg aatgttttgc tacagaagat 720
agagtagaag gaatgacagc atttgtagaa aaaagagaca aggcttttaa aaataagta 779
<210> 128
<211> 1509
<212> DNA
<213> Sulfolobus tokodaii
<400> 128
atggcaatta gaactggaga gcaatattta gattctataa aaattagaaa taaggctgaa 60
atttacgtaa tgggaaaaga agtaaaggat gtaaccactc atcccttctt gaaaccttct 120
gtaatggcat ttaaggcaac atttgatgct gcttgggaag aggacacaaa agaattagcc 180
agagcatgga gtcctttcat aaatgaagaa gtgaatagat ttaatcacat acacaggtca 240
ccagaagact tagctgctaa agtgaaatta ctgagaaaat taagccataa gaccggtgca 300
tgtttccaaa gatgtgtagg atgggacgct ctgaacactt tgtggattat gacgaatata 360
atggctcaaa aaggtaaaaa agaatataag gatagatttg tcgaatactt aagttacgtc 420
caaaagaagg atttagcatt agctggtgct atgacagatg caaaaggtgt aagaacatta 480
aaaccgcatc aacaaccaaa taagaacgct tatgttagaa ttgaggaagt taccaaagac 540
ggtatttatg tttctggtgc aaaggcaaat attactggtg tagctgcaac agaagaaatt 600
gtggttttac ctactagggc tatggggcca gaagataaag attatgctgt tgcattttca 660
ataccgacag atactgaggg tataaaaatt atagttggta gacaattaaa tgatgctaga 720
agattagaag gtggtgacat agatgcttta ccgtacttct ataaccacga gggtttagta 780
atctttgacc atgtttttgt accaatggat agagtattct taatgggaga atacgagttt 840
acttcacaat tagttgaagt attctcagca tatcatagac aaggatatgg tggttgcaag 900
gctggtttag gagatgtaat tattggtgca tcaatgaatt tagcaaaaca attaggagta 960
gaaaaagctt cacatgtaca agaaaaacta acggaaatga tattcttaac tgagaccatg 1020
tattctgcag gaattgcagc tagtttaaat gcagttaagg tctgcgataa ttgttggtgg 1080
gttaatccta tgcacgctaa tgttacaaaa catttagtag ctagatttcc agcccagatt 1140
tctcagttat ctatcgatat tgcaggtgga ataataggta ctgcaccaag tgagtgggat 1200
ctcaagaatc ctaaattaag agaatatatt gccaaatact tacaaggtgt tgagggttat 1260
acagctgaag atagattaag aatggttaga ttactggaaa acgttagtct gggtgttgca 1320
ttccaaattg aatctgtaca cggtgcagga agtccagcag cacaaagaat aatgtttagt 1380
agactttatg acttaaacta tgctgaggaa gtcgcaaaga ggttagctgg gaagaagact 1440
gatttacagt ggaaacctaa agcagagcct tggagagaaa gtgagacaga aaaattagta 1500
aaaagttaa 1509
<210> 129
<211> 1452
<212> DNA
<213> Geobacter metallireducens
<400> 129
atggcactaa gagatgggaa ttcctaccgg gaaagccttc gggcgctcaa tatcaaagtc 60
tatgcctttg gagagaagat tgacagcata gtagatcacc cattgttcca gccccatatc 120
aatgcggctg cattgacgtt cgacttggcc catgatccga ccacggaagc gctcgtcaca 180
gccacctcac acctgacggg gagtaaaatc agccgcttca cccatatcca ccagagcacc 240
gacgatctca taaaaaaggt gaagatgttg cggcttattg cagggaagac gggaagttgc 300
taccagcgct gtgtggggtg ggatgccctg aacgctaact atacggtaac ctatgagatg 360
gaccaggagc ttggtaccga ctatcaccag cgttttaggc gttacctcga atatatacag 420
gacaatgacc tgatggtggc gggagcaatg accgatccca agggggacag ggggctgcct 480
ccggcaaaac agaaagaccc ggacatgttc gtgcacgtgg tggcaaagaa tgacaagggg 540
atagtcattc gtggggcaaa ggttcaccag accggaattg tcaattccca tgaaatgctg 600
attatgccaa ccatggccat gggggaggag gacggcgact atgcggttgc ctgtgctctc 660
cccacggatt cccccggtgt catccatatc tttggtcgtc aaaccaacga tacacgccgt 720
ctggaaaagg gagaccttga tcagggtaat gctgagtatg gaactgtcgg aggcgaggct 780
ttgaccatac ttgaagatgt cttcgtcccg tgggaacgcg tcttcatgtg cggagagtac 840
aagtatgcgg ggctgctggt tgagcgtttc gcgagctatc atcgacagaa ctatggtgga 900
tgcaaggcag gcgtgagcga tgtgatcatc ggcgcaacta ccgctatggc agagtacaac 960
ggagcagcca aggcttccca cgtgcgtgac aagatcgtgg agatggtcca cctcaccgag 1020
accctttatt gcggttccat cgcctgctcc tgtgagggtg ctcccacgcc gtcaggggcc 1080
tatttcgtca atcccctgct ggccaatacg gttaagcaga acgtgacccg tttcatctat 1140
gagattgcac gcctttccca cgatatttcc ggtggctgca tggcaaccat gccttcggag 1200
aaggatctgc accacgatga gatcggcaaa tatgtagaga agtatttccg gggggtggac 1260
gaagctccca ctgaagagcg catgcggatg gcccggctcg ttgaaaatat gacgggcggc 1320
acggctttgg tggaaagcat gcatggtgcc ggctctcccc aggcgcagag agtcatgatc 1380
ctccgccagg caaatctcgg ccataaggta aagcttgcca agaaactggc cggcataaag 1440
gaagaaaaat ag 1452
<210> 130
<211> 1392
<212> DNA
<213> Sulfolobus solfataricus
<400> 130
atgagatcaa aagaagattt cctaaagtcc ttaaaagatg gaagaaattt gtattatagg 60
gggaagttag tagaagatat aacaacacat cagatcttaa agacagccgc attgcacgca 120
gctaagttat atgaatacgc tgatagagtc tatgaggata ataaaatggg aaaaatgagc 180
aagttcttta aggtaccttg gacatctcaa gatttgctag atagacataa actaatttac 240
gatttaacga tgtattgtaa tggggtattt aacatttcac aagcaatagg aagtgatgcg 300
atctttgccc ttatgatcac ggcaaaacaa gttgatagaa aatacggaac tgattactca 360
aaacgtgttg aaaaatattt tgagagagtt gctaaagaag atttaacgtt agccactgcc 420
cagactgacg ttaagggaga tcgaagtaag aggccttctg aacaagttga tccagatatg 480
tatgttagag tagttgatgt gaaaagcgat ggaatagttg ttagaggagc aaaggctcat 540
acaactcaat ctgcggtatc tgatgagatt attgtcatac caaccagagt aatgagggat 600
agcgataaag attacgcagt agcctttgcg gttccagcta atactaaagg tttgaagatg 660
tatattagac caattgatga aattgagggc aattcctcct cagtactcag tagaaaagat 720
tatgagctag aaacattaac cgtcttcaac gacgttttcg ttccttggga tagggtattt 780
ttatttaagg aatacgacta cgctggaaca ttggctatgc tatttgcaac cttccatagg 840
tttactgcat tatcgtatag gtcagcgacc atgaatctat atttgggagc atctaaagtg 900
gcatctcaag taaatggcat tgagaatgaa aagcatgtga gagatgatat agttgatata 960
attctctaca aggaaattat gaggagtagc gcgatagctg cggctgtgta tccagtaaac 1020
atggagggta tagctgtgcc caacccgctt tttactaatg ttggtaaatt atactccaat 1080
atgcatttcc atgatgttgt aagagattta attgacattg ctggggggat aatagctact 1140
atgccctctc aagaagattt ggaaagtgat gaaggaaaga atattgttaa atatttaagg 1200
ggctcagttg atggagagga aagagcaaaa gtgttaaaac tagctaagga attaggggct 1260
agtacgttta ctggctattt gctaactggt atgatacatg cggaaggttc tatggaagct 1320
agcaaaatag agctattcag aagttataat tttaaggagg ccgagaactt agttaaaagg 1380
gtattaagct ag 1392
<210> 131
<211> 1440
<212> DNA
<213> Syntrophobacter fumaroxidans
<400> 131
atgggactca aaacgaaggc ggaatatata gaatccttgc gaggcatgaa gccgacggtc 60
tacatgttcg gtgagaagat cgaaagcgtt gtggacaatc cacgcctgcg agcgggcatc 120
gaggcgacgg gggcgacgta cgaactggca gagacggagg agtatcgccc tctcattgtg 180
actgaaagtc ccctcattca cgaacccgtc aaccggtata cgttgccccc gtcgtccatc 240
gcggacctcg tcgccagggt gaagatcaat cgtctcatgg gcactcgtgt cgggacctgc 300
tttcaacggt gcacggggct ggactgcctg tccgcccttt ccatcgtgac ctacgacatc 360
gacgccaagc attccacccc ttacttcaaa cggttcatcg agtttctgaa gcatgttcag 420
aaaaacgacc tgacctgcaa cgccggcgtg accgacgtca agggcgaccg ttccctggcc 480
ccccacgagc aggaagacaa ggacatgtac gtgagggtcg tggaacgcaa tgcggacggc 540
atcgtcgtga ggggcgccaa ggcgcaccag accggttccc tctcctcgca cgaaatcatc 600
gtcctgccga cgcgtgccct gcgaaagggc gacgaggact acgcgctcgc ttttgccatc 660
cccaacgaca ctcccggcct gattcacgtc gtgggccgat cgagcctcga cacccgccag 720
ctggacggct gcgacctggg caaccttcac tattccaagt actgcccgac cgtgatcttc 780
aaggacgtgt tcgttccctg ggagcgggtc ttcatgtgcg gcgaggtgga attcgccgtg 840
gagatggtga accgcttttc ggcttatcac cgccagagcc acggcggctg caagtcgggc 900
aagatcgact gcatggtcgg agcggccctc accatgatgg actacaacgg gacggagaag 960
gccgggcatc tcaagcagaa ggccatcgag atggtccacc gggcggaaac cctctacggc 1020
tgcagcctgg ccgcgtccta cgagggcaaa aaagaacctt ccggaaccta cttcatcgac 1080
acggtgctgg ccaatgcgtc caagatccac gaaggcaagg aaatgagcga ggccggccgc 1140
ctgctggtgg acatcgccgg aggcttcgtg gccgatctgc cttcggatcg cgacctggcc 1200
attcccgaag tcggggaact gctgaaaaaa tacctgaagg gggtggcgtc ggtgccggtg 1260
gaagaccgcg tcaaaatgta ccggctgatc gaaaagctcg tcatggaaag cgccgatacg 1320
atttcggaca tccatggagg cggttctccc gaggcccaca ggatcacgat cctgcgggaa 1380
agcaacctca aggccaagaa ggacgcggcc aagcggttgg cgggaatcga atcgaagtag 1440
1440
<210> 132
<211> 1461
<212> DNA
<213> Porphyromonas gingivalis
<400> 132
atgatgacta gcgaacagta cgtagaaagt cttcggaaac ttaatctgaa ggtttacttc 60
atgggtgaaa ggatcgaaaa ccctgtagat catcccatga ttcgtccctc aatgaattca 120
gtagctatga cttataagct tgctgagatg gacgaataca agcatttaat gacagcaact 180
tcaaacttga ctggtaagca agtgaatcgt ttctgccatc tacatcagag cacagaggat 240
ctgaaagaca aagtgaagat gcagcgtctc atgggacaaa aaacagcttc atgcttccag 300
cgttgtgtgg gaatggatgc attcaatgcc atctattcta ctacttacga aatggatcaa 360
gctctgggta ccacttatca caagcgtttc atcgagtaca tgaaatatgt acaagacaac 420
gacttggtcg tagatggagc catgacagac cccaaagggg atcgcggttt atctccctca 480
gaacaagccg atccggatct ttatctgcac attgttgaag ttcgtgaaga tgggatcgtc 540
gtttccggtg caaaggcaca ccaaaccgga gcagtcaatt cgcacgagca tctgatcatg 600
cctacgatcg ctatgcgcga agctgatgct gactatgccg tttcttttgc cgttcccagt 660
gatgcagagg gcgttattat gatctatggc cgccagtcat gcgacactcg caaaatggaa 720
gaaggggcag acattgacct cggcaactct gaattcggcg gacatgaagc tcttgttgta 780
ttcgaccgcg tattcgtgcc caatgaccgc gtgttcatgt gcaaagaata ccagtttgca 840
ggtatgatgg tagaacgttt cgccggatac caccgtcagt cttatggagg atgtaaagta 900
ggtgttggtg atgtacttat cggtgcagct gctctcgcag cagactacaa tggagttcct 960
aaggcatctc acattaagga taaactcatt gagatgatcc acctgaatga aaccctttat 1020
gcttgcggta ttgcatgctc ttcagaggga actcagatga aagccggcaa ctatatgatc 1080
gatttgctgt tagctaatgt ttgtaagcaa aatatcaccc gccttcctta tgaaatagct 1140
cgcttggcag aagatattgc aggaggtttg atggtaacca tgccttctca acaagacttc 1200
cgccatccgg aaataggccc gatcgtaaag aaatatcttg caggggcaac aggcaaatcg 1260
acagaaaacc gtatgcgtgt tctgcgtttg atagagaata tcacgctggg aacagctgcc 1320
gtcggttatc gaaccgagtc tatgcacgga gccggatctc ctcaagctca gagaatcatg 1380
atcgctcgtc agggagatct tgagggcaag aaaaagcttg cacgggcgat tgctcatatc 1440
gacgaatcac tcgataagta a 1461
<210> 133
<211> 1590
<212> DNA
<213> Polynucleobacter necessarius subsp. Asymbioticus
<400> 133
atgagtcaaa gcacctccca gttcatgaat agcaaagact atcaagagtc attgcgctca 60
ctaaagccaa ctgtctatgt cgatggtcga ttgatcgaat ccgtcgccga tgagccttct 120
cttcgccctg gagtccaagc cttaggagtg acttatgaca tggtccatga cccagcgcta 180
gcaccgctca tgttggctga ctcgaatggc actcctgtac caagaatgct gcacattaat 240
cagtcttctg gagatctctt aaataaatta gaagcggtac gtgtactctg ccaagaaact 300
ggatgtgccc aacgctattt agcccatgat gcgttaaatg cgattgcaca agtttctgcg 360
cgcattgatg atgccaaagg aagtaatgag catagtgcta aattttctga gtatctatcg 420
catgtacaaa cgaaggactt ggcattaggc attgccatga cagatgcaaa aggagatcgc 480
tcccgcagac ctcatgagca agaaaatcca gatacttacg tacatatcgt ttctcaagat 540
gctaaagggg tcgtgatctc gggtacaaaa gcgattgtga ctggcgcccc ttacatgcat 600
gaattcttag tcatgccagg tcgcaatatg actaaagagg atgcagcctt tgcgatttgc 660
tgtgctgtcc ctgtggatgc caaaggtatt acgattgtgg cacgcccagc gggacgccca 720
ggcgacaagg tcgagcatgg taaaccgata ttttctagta aatatggtca atcgactggg 780
gtagtgatat tcgataaagt attcgttccc tgggatcgtg ttttttatgc tggcgaatgg 840
gaacactcta gcgtgctgac ttataactac gccacccatc atcgtcatag ctgcatcgcg 900
gcgcgagcag gctttggaga tctgttaatt ggtgctggcg ctttaatgtg cgaagcgaac 960
ggattggatc cagcaaccaa atctaattta cgtgatccga tggttgaact cattaagatc 1020
actgaaggat tttatgcttg cggtgtggct gctagcgtct atggaacgca agatccgtac 1080
agtaaatcat ttatgcctga gccggtattt tctaatatcg gaaaactctt attagcaacg 1140
cagatttatg acatgcatcg cttggcacat gaagtatcgg gaggattaat cgtagcgttg 1200
ccaggaccag acgaagatca caacccagca actgcagcca ctttggcaga ggtgttacga 1260
gccaatccag ccgtccctta tgacaagcga attgaagttg cacggtttat tgaagatctc 1320
acagcgtctt atcaaggcgg ttggtattcc gtcattagcc tacatggtgg cggctctcca 1380
gcagcaatga agcaagaaat ctatcgtcag taccctattg gcaataaagt agagctagtg 1440
gaacgtttat tagatcgcgg agtgctgact agtagcgaag agcgggcgat tacgaaaaat 1500
aaacaacctg ggcgctgctg cgatcaaggc tgtagcgcgc caggacaagc agtgatggta 1560
cctttgccag agcctggcag aagaacttaa 1590
<210> 134
<211> 777
<212> DNA
<213> Gordonia terrae C-6
<400> 134
gtgaccgaac accagaccat cgtcgtcgag accagcggcc gggtgggcat catcaccctc 60
aaccgcccga aagcgctgaa cgcgctcaac accgagttga tgaacgaagt ggtcggcgcc 120
gtcaaggagt tcgacgtcga ccaggggatc ggcgccatcg tgatcaccgg ttcggagaag 180
gcgttcgccg cgggcgccga catcaaggag atgtcatcga agtcctacgc ggatgtggtg 240
aacgagcagt tcttcggcgc ctgggatgag ctgtcgcggg cgcgtacgcc gatcatcgcc 300
gcagtgaccg gctacgccct cggcggcggc tgcgaactcg cgatgctgtg cgacaccatc 360
atcgccggcg acaacgccgt cttcggtcag cccgagatca acctcggcgt catccccggc 420
atcggtggtt cgcagcgcct cacccgcgcc gtcggcaagg ccaaggcgat ggacatggtg 480
ctcaccggcc ggcagatgaa ggtcgacgag gccgagcgtc tgggcctggt ctcgcgggtg 540
gtgcccaagg aggactgccg cgccgccgcg atcgaagtcg ccgagataat cgcctcgaag 600
tcgctgatcg ccgccgcggc cgccaaggac gcggtcaacc gtgccttcga gtcgagcctg 660
gtggagggtg tccgcgccga gcgcgcgctg ttctactcga cgttcgcgac cgacgaccag 720
accgagggca tggccgcctt cgtcgagaag cgggacccga acttcaccca ccgctga 777
<210> 135
<211> 777
<212> DNA
<213> Halalkalicoccus jeotgali
<400> 135
atggcagaca gagtactcat cgaacgagag aatgacatag cgacgatcat cgttaatcgg 60
cctgagaagc gtaatgcgat ggatatcccg acgcgaaaag ccctctatgc cgccttcgaa 120
gaggttagcg aggatgacga tgtgcgggca atcgtgctcc gcggagcagg agatgggtcg 180
tttatcgccg gtggcgatat tgattctttc gccgacttcg accacatgga cggcatggag 240
tacagcgaga agtacgccca agggctgtac aactatgttg cggaccgcca caaaccaacc 300
atcgccgcgg ttgacggcta cgctctcggt ggaggcaccg aaatcgccct cgcttgcgac 360
attcgcctcg ccacggacga cgcgaagttc ggcctgcccg aagtcggcat cggcgtcatc 420
ccagccggtg gtggaacaca gcgactcgtt caagtcgtcg gagccgggct tgcaagcgaa 480
cttatcctca ctggccgcat tatcagcgcc gacgaggcaa agagaattgg tcttgcaaac 540
catgtctacg ccgccgagga attcgataat gaagtccgag ccatggccga agatcttgcc 600
tcgaaggcgc ctgtcgccca gcgacttgca aaagaatcca tccgacgtag ccttgatatc 660
gacgccggcc ttgaatacga gcgactggcc ggagcgtttc tgttcggcac cgacgaccag 720
aaagagggtg caaacgcctt ccttgaggac cgagagccga agtaccgaaa ccggtaa 777
<210> 136
<211> 774
<212> DNA
<213> Carboxydothermus hydrogenoformans
<400> 136
gtggaatttg aaaaaattaa atttgaggtt acggacggtt atgccgttat ttacctaaac 60
aacccgccgg taaatgctct tggccagaaa gttttaaaag atttacaaaa agctttgcag 120
gaaattgaga aaaatcccga gattcgggcg gtaataatta gcggggaagg tagcaaggtt 180
ttctgtgccg gggcagatat cacggaattt gctgaccggg ctaaagggat tttaccggaa 240
gtggaaggaa gtgttctttt ccggcaaatt gagcttttcc ccaagccggt gattgctgcg 300
ctgaacggta gctcctacgg cggaggaacc gaattagcga taagctgtca cctgcgcatt 360
ttagcagatg atgcttccat ggctttgccc gaagtaaaac tgggcattat ccctggctgg 420
ggaggtaccc agaggttacc ccggttaatt ggtaaaacca gagccctgga agcaatgctt 480
accggagagc caataacggc agaagaagcc ttaagctacg gtctggtaaa caaagtcgta 540
cccaaagacc aggtactaac agaagcccgg gcgctggcag ctaagcttgc caaaggggcg 600
cccatcgcta tgcgggaaat tttaaaggcg gtaactttag ggctggatac ttcaatagaa 660
gaaggtttaa aaattgagaa agaaggttcc aaagtggcgt ttagcagtga agatgcggtg 720
gagggaagaa ctgctttctt tgaaaaacgg ccgccgaatt ttaaaggccg gtaa 774
<210> 137
<211> 774
<212> DNA
<213> Thermomicrobium roseum
<400> 137
atgagcgtgc gtgtcgagcg ggagggggcg atcaccctcg tcacggtcga gcgcccggaa 60
cgactgaacg cgctcgatac cgcgacgttg cgtgccttac tcgcggcagt gcaggaactg 120
gcaacggagg aggcgatcgc tgtcgtcgtc ctcaccgggg caggcgatcg cgcgttcatc 180
gccggagccg atatcagcga gatggtagag aagtcgccag ccgaggcgct cgccttcgcc 240
gagttgggac acgccgtttg ccgggcgatc gaggaagcgc cgcaaccgta catcgcagcg 300
gtcaatggct acgcgctagg aggcggctgc gagatcgcgc tggcgtgcga tatccgcctc 360
gccagcgagc gcgccgtctt cgcccagccg gaagtaacgc tgggtattcc accaggctgg 420
ggcggatcgc aacggctgcc gcgcgtcgtt cctcctggta tcgcgcgcga gttgctctat 480
acggggcgcc gcgtcgatgc gcaggaagca ctgcggatcg ggctcgtcaa tgccgtctat 540
ccggctgacc aactcctcga gcgagctcgg gaactggcga accggatcgc ggccaacggg 600
ccactcgcgg tccgcttgac caaggcggcg gttcgcttcg gtctcgagca ggggctggaa 660
gctggactga cctacgagcg gcaggtgttc gcgtacgcgt tcaccaccga ggatcagcgg 720
gaggggatgc gggcatttct ggaaaagcgt cgtccggctt ttcgcgggcg ctga 774
<210> 138
<211> 825
<212> DNA
<213> Methylobacterium extorquens
<400> 138
atgaacgctg acgccgagac cgcctcgacc gacgaactgc tcttcgcggt ggatgcggcg 60
ggcatcgccc gcatcaccct caaccggccg aaggcgcgca acgcgctgac cttcgcgatg 120
tatcgcgggc tggtggagtt gtgcgagcgg atcgaggcgg accacgcgat caaggcggtg 180
atcatcaccg gcgccgggga caaggcgttc gcggcgggta ccgacatcgc ccagttccgt 240
agcttcagca aaccggaaga cgcgatcggc tacgagcgct tcatggaccg ggtgctcggc 300
ggcctggagc gcctgcgggt gccgaccatc gcggcggtcg ccggagcctg caccgggggc 360
ggtgcagcga tcgctgcggc ctgcgacatg cgcatcgcca gccgcgacgc ccgcttcggc 420
atccccatcg cccgcacgct cggcaattgc ctctcgcaga acaccctgag gcggctggcg 480
aacctcattg gggcgccccg cgtgaaggac attctgttca ccgctcggct cgtcgaggcg 540
caggaggctc tggcgatcgg cctcgtcaac gaggtggtcg aggatgccgc ggccgtcgcg 600
gcccgagcgg atgcgctggc caccctgctc gcgagccacg cgcccctcac cctccaggcc 660
accaaggaag gcctgcgccg catcggcgag gagggcgcgg cggaggccgc cgagggcgag 720
cggcccggcg acgacctgat cgtgatgacc tatatgagcg cggatttccg ggagggcatg 780
gaagccttcc tgggcaagcg cccgccgaac ttcaaagggc gctga 825
<210> 139
<211> 1224
<212> DNA
<213> Clostridium sporogenes
<400> 139
atgagtgata gaaataagga agtaaaagaa aaaaaggcaa agcattatct tagagagatt 60
actgcaaagc attacaaaga agctctcgaa gcaaaagaaa ggggagaaaa ggttggttgg 120
tgtgcatcta acttcccaca agaaatagct acaacattgg gggtaaaagt tgtttatcca 180
gaaaatcatg cagcagctgt agcagctaga gggaatggac aaaatatgtg tgaacatgct 240
gaggctatgg gtttttctaa tgatgtatgt ggttatgcaa gagtaaattt agctgttatg 300
gacataggtc atagtgaaga tcaaccaata cctatgccag actttgtact ttgctgtaat 360
aacatttgta atcaaatgat taaatggtat gagcatatag caaaaacttt agatatacca 420
atgattctta tagatatacc atacaataca gaaaatactg tttcacaaga tagaattaaa 480
tatattagag cacaatttga tgatgcaata aaacaattgg aagaaataac aggcaaaaaa 540
tgggatgaaa ataaatttga agaagttatg aaaatatccc aagaaagtgc aaaacaatgg 600
ttaagagcag catcctatgc aaagtataaa ccttcaccat ttagcggatt tgatttattt 660
aatcatatgg ctgtagcagt ttgtgcaaga ggtacacaag aagctgcaga tgcatttaag 720
atgttagcag atgaatatga ggagaatgta aaaactggaa aatccactta taggggagaa 780
gaaaaacaac gtatattatt tgaagggatt gcctgttggc catatttgag acataaatta 840
actaagctta gtgaatatgg tatgaacgta actgcaactg tatacgcaga agcctttggt 900
gttatatatg agaatatgga tgaattaatg gctgcttata ataaagttcc taattcaatt 960
agttttgaaa acgcattaaa aatgagatta aatgctgtta caagcactaa tacagaaggt 1020
gctgttattc atataaatag aagctgtaaa ttatggagtg gatttttata tgagctagca 1080
agaagattag aaaaggaaac aggaattcct gtagtatcat ttgatgggga ccaggcagac 1140
ccaagaaatt tctcagaagc tcaatatgat actagaattc aaggacttaa tgaagtaatg 1200
gttgctaaaa aggaggctga ataa 1224
<210> 140
<211> 1125
<212> DNA
<213> Clostridium sporogenes
<400> 140
atgtcaaatt cagataaatt ttttaatgac tttaaggata ttgtagaaaa tcctaaaaaa 60
tatataatga agcatatgga acaaactgga caaaaggcta taggatgtat gccattatat 120
actcctgagg aacttgtatt agctgctgga atgtttccag taggggtatg gggaagcaat 180
acagaacttt caaaagctaa aacatatttc ccagcattta tttgttcaat attacaaaca 240
acattggaaa atgcattaaa tggagaatat gatatgttat ctggtatgat gattacaaat 300
tattgtgatt cattaaaatg catgggacaa aattttaaac taaccgttga aaatattgag 360
tttatcccag taacagttcc acaaaataga aaaatggaag ctggaaaaga gtttttaaaa 420
agtcaatata aaatgaatat tgagcaatta gaaaagattt ctggtaataa aataacagat 480
gaatctttag aaaaagctat agaaatatat gatgaacaca gaaaagtaat gaatgacttt 540
tcaatgttag catcaaaata tccaggtata ataacaccaa ctaaacgtaa ttatgttatg 600
aaatctgctt attatatgga taaaaaagaa catactgaaa aagttagaca attaatggat 660
gaaattaaag ctatagaacc aaaaccattt gaaggaaaga gagttataac tacaggtata 720
attgcagatt cagaagattt acttaaaata ttagaagaaa ataatatagc tatagttggt 780
gatgatatag cacatgaatc tagacaatat agaacattga ctccagaagc gaacacacca 840
atggataggt tagctgagca atttgctaat agagaatgta gtactttata tgatcctgaa 900
aagaaaaggg gtcaatatat agtagaaatg gctaaagaga gaaaagcaga tggaattata 960
tttttcatga caaaattctg tgacccagag gaatatgatt atccacaaat gaaaaaggat 1020
tttgaagaag caggcattcc acatgtacta atagaaactg atatgcaaat gaaaaattat 1080
gaacaagcta gaactgcaat tcaggctttt tcagaaacac tttaa 1125
<210> 141
<211> 795
<212> DNA
<213> Clostridium sporogenes
<400> 141
atggcagaca tttatactat gggtgtagac ataggttcaa ctgcatcaaa aacagtagta 60
ttaaaaaatg gtaaagaaat tgtaagtcaa gcagtaataa gtgtaggggc cggaacaagt 120
ggccccaaga gagctataga ttctgtatta aaagatgcta aattatccat tgaagattta 180
gactatattg tatccactgg atatggaaga aatagtttcg attttgctaa caaacaaatt 240
tctgaattaa gttgtcatgc aaaaggggtc tatttcgata acaataaagc tagaacagtt 300
attgatatag gcggacaaga tattaaagta ttaaaattag cggatagtgg aagactttta 360
aactttataa tgaatgataa atgtgctgca ggaacgggac gatttttaga tgtaatgtct 420
agagtaatag aagttccagt tgatgagtta ggaaaaaaag cattagaaag caaaaatcct 480
tgtactatta gttctacctg tacagtattt gcagagtcag aagtaatttc tcaacttgca 540
agaggagtta aaactgaaga tttgatagca ggaatttgta aatctgtagc atcaagagtg 600
gctagccttg caaagagaag tggtatagaa gaattagtag ttatgagtgg aggagtagct 660
aaaaatatag gtgtagtaaa ggcaatggaa gcagaattgg gaagagacat atatatatct 720
aaaaattctc aattaaatgg agcattggga gcaagtctat acgcttatga aagttttcaa 780
aaagaaagga gctaa 795
<210> 142
<211> 1239
<212> DNA
<213> Clostridium sporogenes
<400> 142
atggaaaaca atacaaatat gtttagtgga gtaaaggtta ttgaattagc aaattttata 60
gctgctccag cagcaggtag attttttgct gatggtggtg cagaggtaat aaaaattgaa 120
tcacctgctg gagatccttt aagatatact gctccttcag aaggaagacc attaagccaa 180
gaagaaaata ctacttatga tttggaaaat gcaaataaaa aagcaatagt attaaatctt 240
aaaagcgaaa aaggtaaaaa gatattacat gaaatgttag cagaagcaga tatattatta 300
actaattgga gaacaaaggc tttagttaaa caaggattag actatgaaac actaaaagaa 360
aaatatccta aattagtttt tgcacaaata actggttatg gtgaaaaagg accagataaa 420
gatcttccag gctttgatta tactgcattt ttcgctagag gcggtgtttc aggtactctt 480
tatgaaaaag gaactgtgcc tccaaatgtt gttccaggac ttggagacca tcaagctggg 540
atgtttttag cagcgggtat ggcaggagct ttatataaag caaaaacaac aggacaagga 600
gataaagtaa cagtaagttt aatgcatagt gctatgtatg gactaggtat tatgatacaa 660
gctgctcaat ataaagatca tggattagta tatccgataa atcgtaatga aactccaaat 720
ccttttatag tttcatataa atctaaggat gattactttg ttcaagtatg tatgccacca 780
tatgatgttt tctatgatag atttatgacc gctttaggaa gagaagattt agttggagac 840
gaaagataca ataaaataga aaatttaaaa gatggacgtg ctaaggaagt atacagtata 900
atcgaacaac aaatggttac aaagacaaag gatgaatggg ataacatatt tagagatgca 960
gacattccat ttgctatcgc acaaacttgg gaagatttat tagaagatga acaagcttgg 1020
gcaaatgatt atttgtataa gatgaaatat ccaacaggaa acgaaagagc attagtaaga 1080
cttccagtat tctttaaaga agcaggatta ccagaatata atcaatcacc acaaatagca 1140
gaaaatactg tagaagtttt aaaagaaatg ggatatacag aacaagagat tgaggaatta 1200
gaaaaagata aagatataat ggtaaggaag gaaaaataa 1239
<210> 143
<211> 1107
<212> DNA
<213> Lachnoanaerobaculum saburreum
<400> 143
atgtggcatt gtttagaaac tttaaaaaag attagtgcgt ctccaaagga acagcttaat 60
aaataccttg aagaaggaaa aaaagttatt gctgttgcac cggtttatac acctgaggag 120
attatccatg cttttggatt tgtacctatg ggggtatggg gcgcagatat tgaaattaat 180
gagtcaaaaa aatattatcc tgcatttatt tgctcaataa tgcagacagt attggagctg 240
ggaataaagg gaaattataa cggagttagt gctatagtgg ttccttcgct atgtgactca 300
ttaaaaactt tgggacaaaa ttggaaatat gcggtaaagg acattccttt tataccaatg 360
acctatccac aaaatagaaa atctgattat gctgttgatt tcacattgga gatgtataag 420
agagtgatca gtgatttgga aaatattacc ggagaaaagt ttgatgaagg taaactcaaa 480
aacacttatg aaatttataa tgagcataat agggttatga gagaatttac aaaagtttcg 540
gaagagtatg aagtttcggc aacagataga agtgcagtat ttaaaagtgc ttggtttatg 600
cttaaggagg aacatacaga acttgttagg gaattgatcg aacttataaa aaaagagggt 660
aaaatatcta agaagctaag aatttataca acaggaatat tggcggatgc accggattta 720
ctcaatattt ttgacagcaa taatatgcaa atcgtaggtg atgatattgc ttatgaatcc 780
agacagtata gaacagatat acccgatgga aatggtttat atgctcttgc aaagaagttt 840
tcaaatatgg acaactgtac tcttttatat gataaggata agagaagggt tgactttatt 900
attgaagaag caaagaaaaa aagagctgac ggaatagtag ttcttatgac caagttttgc 960
gatcctgaag aatttgacta tgtgcctata aagagggcgg caaatgaagc aggtattcca 1020
catatcaata tagaagtgga tagacaaatg aaaaattatc aacaggcaaa tactatgtta 1080
caaacatttg cagacatgtt ggtttag 1107
<210> 144
<211> 1230
<212> DNA
<213> Lachnoanaerobaculum saburreum
<400> 144
gtggaagaag ctaaaaaaca aaagcctaca gttgatccaa acagcgcaaa ggctagattg 60
ggcaggatag cagcaaaagc atatagtgac tgtgttgagg ctaaaaagcg aggagaattg 120
gtaggatggt gtgcaagtaa ttttccggtg gagatacctg agaccttggg attgtacgta 180
tgttaccctg agaatcaggc ggcaggtatt gctgccagag gcggtggaga acgaatgtgc 240
agtgagagtg aaggtgacgg atactctaat gatatatgcg catatgcaag aatttcgctt 300
gcatatatga agctgaagga agctcctgaa caggatatgc cacagcctga ctttgttcta 360
tgttgtaata atatatgcaa ctgcatgatt aagtggtatg aaaatatagc aaaagaactt 420
aatattccta tgattatgat tgatatacct tttaatcctg attatgaagt ttcagatgct 480
atgacagcat atatcagaaa tcagttttgg gatgcaatac atcaattgga ggaaattaca 540
ggcaaaaaat ggagtaatga aagatatgaa gaggtaagga aaatatcagg aagaagctcc 600
agagcatggc ttgaggctac agcgactgcc aaatattcac catctccgtt taacggattt 660
gatttattaa atcatatggc ggttatggtt actgccagag gaaaacttga agctgcagaa 720
gcaatggaaa cacttttgca ggagtacaag gataatcatg agaagggaga gtctacgttc 780
aagggagaag aaaaatatag aataatgttt gagggtatag catgctggcc atggcttcgt 840
gctactgcta caggacttaa gagtcgtgga atcaatatgg ttacaactat atatgcggat 900
gctttcggat ttatctatga tgactttgac ggaatgtgca gagcatatgc caatgttcct 960
aattgtatga atatagagca tgcaagagat aagagaataa aactttgtaa ggacaatagt 1020
gttgaagggc ttctcgttca cacaaacagg tcttgtaaac tttggtcagg atttatgtct 1080
gaaatgagca ggcaaatagg tgaagaatgt ggtattccgg ttgtaagctt tgatggagac 1140
caagcagatc caagaaattt ctcagaggct caatatgata cgagagttca gggattgaca 1200
gagataatgg aagcaaataa ggaaatttaa 1230
<210> 145
<211> 771
<212> DNA
<213> Lachnoanaerobaculum saburreum
<400> 145
atgtacacat tgggtgttga tataggctca actacatcca aagcggtaat attggaggat 60
ggagaaaata tagttgcatc ttcaattgtt atagcaactg taggaacggc aggagtagaa 120
gaggctgtaa aaaatgtact aaacttttca aaactcgaac taaatgacat taaagcagtg 180
gttgctacag gatatggaag aatgaattat gatgtagcag attacaaggt tagtgaattg 240
acatgtcatg cattaggtgt acataaggag ttcccgaatg tcagaactgt aattgatatc 300
ggaggtcagg atgccaaggt aatatctctt gcggcaaacg gtaagatgac aaattttgtt 360
atgaatgata aatgtgcggc agggacaggt agatttcttg atgtaatggc taatatatta 420
aatcttgata tacaggattt ggaggtggaa gccttaaaat cagataatcc ggcaaatata 480
tcaagtactt gtacagtttt tgcggaatcg gaagtcatat cacagcttgc tacaggaaga 540
aatattcctg atttggttgc agggatatgc aaatctgttg cagtaagggt tgccgccctg 600
gctaaacgag taggtatagt tgaagaagtg tgtatgagcg gcggagtggc aaaaaactcg 660
ggtgtgagga atgctatgag taaagagctt ggtgtagata tagtgtttag taaggatgct 720
caacttatgg gagcacttgg agccgcaata tacggtttta aaaagttata a 771
<210> 146
<211> 795
<212> DNA
<213> Peptostreptococcus stomatis
<400> 146
atgagcagtg tatacacaat gggtattgac attggatcaa catcatcaaa gtgtgtgata 60
atgaaggatg gtaaggaaat tgtaagtgaa ggtgtagtta gcttgggtgc tggaactaag 120
ggttctgacc tagttattga ggaagtgctt ggtaaggcag gaatgacttt cgatgaaata 180
gacctaatcg tatcgactgg atatggtaga aatagctatg aaagagctgc caagactgtt 240
agtgagctta gttgtcatgc caagggtggt ggatatatct ttggtggtgc cggaactatt 300
atagatatcg gtggtcagga tataaaggta ttgaagctaa atgacaaggg tggtcttgtt 360
aacttcctga tgaatgataa gtgtgctgcc ggtacaggta ggttcttgga agttatgtct 420
ggcgtattgg atgtaaagct agatgaacta ggggaactag atgccaaggc tacagaagtt 480
acaccaatca gttctacatg tacagttttt gctgagtcag aagttatatc atgtatggct 540
aagaagattc ctctagaaaa tatcataaga ggtatacacg catctgttgc aacaagggtt 600
gctagtttgg caagaagagg tggtttgaag actcctgtag ccatgacagg tggagttagt 660
aagaacaagg gtatagtaag ggctcttaaa gaagaactag aatgtgatat cttgatatct 720
cctgattctc agatggctgg tgctataggt gcagccctat atgcatatga cgaataccag 780
aagcaaaacg cttaa 795
<210> 147
<211> 1119
<212> DNA
<213> Peptostreptococcus stomatis
<400> 147
atgagtaata tagatgtatt gttaggtaaa cttgatgtaa gtcttttggg acaggtagac 60
aagtatgttt cagaaggtaa gaaggtaata ggttgcgcgc cagtttatac accagaagaa 120
ttagtatatg ctgcaggcat ggtaccaatt ggtgtatggg gtgcagaagg tgaagtaggt 180
ctatcaaagg aatacttccc agcattttat gcagctataa tccttagatt aatggacctt 240
ggtttagaag gtaagcttga caagatgtca ggtatgatta taccgggact aagtgacggt 300
ctaaagggac ttagccagaa ctggaagagg gctataaagc aggttccggc cctatacata 360
ggctatggtc agaacagaaa aattgaagct ggtattactt acaatgaaaa gcagtacatc 420
aagctaagag gacagttaga agaaatagct ggttgcaaga tagaagatgc taaggttgaa 480
gaggctatag ttctttacaa caagcacaga aaggcaatgc aggaattcag ttctctagca 540
gctagtcact taaatactat tacacctatt ctaagagcta gagtaatgac aagtgccttc 600
ttgttcgaca aggcagaaca tttagctata ttggaagaat tgaataaaga attaaaggcg 660
ttacctgaag aaaaatttgc tggcaagaag gtagttacta ctggtattct tgcaaatagc 720
ccaggtatgc tagaaatact agatgagtac aaacttggta tagttgatga caatatcaac 780
catgaatcag gccagtttga ctacctagtt gatgaaggta ctggtaatcc agttagagcc 840
ttatctaagt ggatttcaga tatagaagga agtactttgt tgtatgatcc agaaaaactt 900
aggggacaga taataattga caaggttaag aagcatcagg cagatggtgt tatataccta 960
atgactaagt ttagtgattc tgatgaattc gactatccaa tcatcagaaa agaattagaa 1020
aatgcaggta tcttgcatat actagttgag gttgatcagc aaatgactaa ctttgaacag 1080
gcgaaaacag cattacagac tttcgctgat atgatttaa 1119
<210> 148
<211> 1236
<212> DNA
<213> Peptostreptococcus stomatis
<400> 148
atgagtaata caggaatggt agaagaaaag ccggcaaaag tattgttagg agaaattgtt 60
gcaaagcact ataaggaagc ttgggaggca aagaataatg gtgaactagt tggatggtgt 120
gcatctaact tcccacagga aatattcgaa actatggata taaaggttgt ttatccagaa 180
aaccaggctg ctgctatatc tgctaagggt ggcggacaga gaatgtgcga aatagctgaa 240
aatgaaggat attcaaatga tatctgtgct tacgctagaa tatctttggc atacatggac 300
gttaaggatg ctccagaatt aaatatgcca cagccagact tcgttgcttg ctgtaacaat 360
atttgtaact gtatgatcaa gtggtatgaa aatatagcta aggaattgaa tataccttta 420
attttaatag acgttcctta caacaatgac tacgaggctg aagacgatag agttgaatat 480
ctaagaggtc agtttgatta tgctatcaag cagttagaag aactaactgg caagaagtgg 540
gatgaaaaga agtttgaaga agtaatggaa gtttctcaga gaacaggtag ggcttggtta 600
aaggctactg gatatgctaa gtatactcca tcaccattct caggctttga cgtattcaac 660
cacatggctg ttgcagtttg tgcaagaggt aagatagaat cagctatagc attcgaaaag 720
ctagctgaag aatttgacga aaacgtaaga actggtaagt caacatttaa gggcgaagaa 780
aagttcaggg tgttatttga aggtatagca tgttggccac acctaagaca tacattcaag 840
cagcttaagg atgctggtgt taatgtctgt ggtacagtat atgcggatgc tttcggatat 900
atctatgaca atacatatca gttaatgcag gcttactgcg gaactccaaa tgctatttca 960
tacgaaaggg caactgatat gagactaaag gttattgaag aaaacaatat agatggtatg 1020
ttaatccaca tcaacagaag ttgtaagcag tggtcaggta tcatgtacga gatggaaaga 1080
gatattagag aaaagactgg tataccaaca gctacattcg atggtgacca ggccgatcca 1140
agaaacttct ctgaagctca gtatgatact agagtacagg gtcttataga actaatggaa 1200
gctaataaag ctgcaaagat gaaggaggcg cactaa 1236
<210> 149
<211> 1227
<212> DNA
<213> Clostridium difficile
<400> 149
atgtctgaaa aaaaagaagc tagagtagta attaatgatt tattagctga acaatatgca 60
aatgcattta aagctaaaga agaaggaaga cctgtaggtt ggtcaacatc agtatttcct 120
caagagttag cagaagtatt tgacttaaac gtattatatc cagaaaacca agcagctgga 180
gtagcagcta aaaaaggttc tttagaatta tgtgaaatag ctgaatctaa aggatattct 240
attgacctat gtgcatatgc aagaacaaat tttggtcttt tagaaaatgg tggatgtgaa 300
gctttggata tgccagctcc agatttccta ctttgctgta acaatatatg taaccaagtt 360
ataaaatggt atgaaaatat ttcaagagaa ttagatatac ctttaataat gattgataca 420
actttcaata atgaagacga agttactcaa tcaagaatag attatattaa agctcaattt 480
gaagaagcta taaaacaact agaaattata tcaggaaaga aatttgaccc taagaagttt 540
gaagaagtaa tgaaaatatc agctgaaaac ggaagactat ggaagtattc tatgagttta 600
ccagcagatt cttctccttc tccaatgaat ggatttgact tatttactta catggctgta 660
atagtttgtg ctagaggtaa aaaagaaact acagaagcat ttaagttact tatagaagaa 720
ttagaggaca acatgaaaac tggtaaatct tctttcagag gggaagaaaa atacagaata 780
atgatggaag gtataccttg ttggccatat ataggataca agatgaaaac attagctaaa 840
tttggagtta acatgacagg tagtgtttac ccacatgctt gggcattaca atatgaagtt 900
aatgatttag atggaatggc agtagcatat agtactatgt ttaacaatgt aaacctagac 960
cgtatgacaa aatatagagt tgattcttta gtagagggta aatgtgatgg agcattctat 1020
catatgaaca gaagctgtaa acttatgagt ttaatacaat atgaaatgca aagaagagca 1080
gctgaagaaa ctggattacc atatgctgga tttgatggtg accaagcaga ccctagagct 1140
ttcactaatg ctcaatttga aacaagaatt caaggtttag ttgaagtaat ggaagaaaga 1200
aaaaaactta atagaggtga gatataa 1227
<210> 150
<211> 1125
<212> DNA
<213> Clostridium difficile
<400> 150
atggaagcta ttttatctaa aatgaaagaa gtagttgaaa atccaaatgc ggctgtaaaa 60
aaatataaaa gtgaaactgg taaaaaagct ataggttgtt tcccagttta ttgcccagaa 120
gaaattatac atgcagctgg aatgcttcca gttggtatat ggggaggaca aacagaatta 180
gatttagcta aacaatattt ccctgcattt gcatgttcaa taatgcaatc atgtttagaa 240
tatggattaa aaggtgctta tgatgaatta tctggagtta ttataccagg tatgtgtgat 300
acactaattt gtttaggaca aaactggaaa tcagcagtac ctcatataaa atatatatca 360
ttagtacacc cacaaaatag aaaacttgaa gctggtgtaa aatacttaat cagtgagtac 420
aaaggcgtaa aaagagaact tgaagaaatt tgtggatatg aaatagaaga agcaaaaatt 480
catgaaagta tagaagttta caatgaacat agaaaaacta tgagagactt tgttgaagta 540
gcttataaac attctaatac tataaaacca tcaataagaa gcttagtaat taagagtggg 600
ttctttatga gaaaagaaga acatactgag ctagtgaaag atttaatagc aaaattaaat 660
gctatgccag aagaagtctg ttctggaaag aaagttttat taacaggtat attagctgat 720
tctaaagata tattagacat tttagaagac aacaatatat cagttgtagc tgacgactta 780
gcacaagaaa caagacaatt cagaacagat gtaccagcag gtgatgatgc gttagagaga 840
ttagcaagac aatggtcaaa catagaagga tgttcattag cttatgaccc taagaaaaaa 900
cgtgggtcac ttatagtaga tgaagttaaa aagaaagata tagatggtgt tatcttctgt 960
atgatgaaat tctgtgaccc agaagaatac gattatcctt tagttagaaa agatatagaa 1020
gatagtggaa tacctacttt atatgttgaa atcgaccaac aaactcagaa taatgaacaa 1080
gccagaactc gtattcaaac ttttgctgag atgatgagtt tagcg 1125
<210> 151
<211> 798
<212> DNA
<213> Clostridium difficile
<400> 151
atgtacacaa tgggattaga tataggttca actgcatcaa agggagtaat cttaaagaat 60
ggggaagata ttgtagcttc tgaaacaata tcctctggta ctgggactac tggaccatca 120
agagttttag aaaaattata tggcaagaca ggtcttgcaa gagaagatat taaaaaagtt 180
gtagttacag gatatggaag aatgaactat tcagatgctg ataagcaaat aagtgaatta 240
agctgtcatg ctagaggggt aaatttcata attccagaga caagaaccat tattgacata 300
ggtggtcaag atgcaaaggt attaaaatta gataataatg gaagactatt aaactttctt 360
atgaatgaca aatgtgctgc aggtacagga agatttttag atgtaatggc aaaaataata 420
gaggttgatg tatctgaact cggaagtata tctatgaatt ctcaaaatga agtatcaata 480
agcagtacat gtacagtatt tgcagagtct gaggttatat cacatttatc tgaaaatgca 540
aaaattgaag atatagtggc aggtattcat acttcagtag caaagagagt ttctagccta 600
gtaaaaagaa taggagtaca aagaaatgta gttatggttg gtggggttgc tagaaatagt 660
ggtattgtaa gagctatggc aagagaaatc aacacagaaa ttattgtacc tgatatacct 720
caattaactg gtgctttagg agcagcgtta tatgcttttg atgaagcaaa agaatcacaa 780
aaagaagtga aaaatata 798
<210> 152
<211> 1194
<212> DNA
<213> Clostridium difficile
<400> 152
cttttagaag gagttaaagt agtagaactt tcaagtttca tcgcagcacc atgttgtgca 60
aaaatgttag gtgactgggg tgcagaggtt attaagattg aacctataga aggtgatgga 120
ataagagtta tgggtggaac atttaaatct ccagcatcag atgatgaaaa ccctatgttt 180
gaattagaaa atggaaataa aaagggtgta agtattaatg taaaatcaaa agaaggagta 240
gaaatattac ataaattatt atcagaagca gacatatttg taactaatgt tagagttcaa 300
gcattagaaa aaatgggtat agcttatgac caaataaaag ataagtatcc aggattaata 360
ttctctcaaa tattaggata tggtgaaaaa ggacctttaa aagataaacc aggatttgac 420
tatactgcat acttcgcaag aggaggagtt agccaatctg ttatggaaaa aggaacatct 480
ccagcaaata cagcagcagg atttggtgac cactatgcag gtctagcact agcagcagga 540
agtttagcag cattacataa aaaagctcaa actggtaaag gtgagagagt aacagtaagt 600
cttttccata cagctatata tggaatggga acaatgataa caacagcaca atacggaaat 660
gaaatgcctt tatcaagaga aaatccaaac agcccattaa tgactacata taaatgtaaa 720
gatggaagat ggattcaatt agctttaata caatacaaca agtggttagg caaattctgt 780
aaggttataa atagagaata tatattagaa gacgatagat ataataacat agattcaatg 840
gttaatcatg ttgaagattt agttaagata gttggagaag ctatgttaga aaaaacatta 900
gacgagtggt cagctttatt agaagaagca gacttaccat ttgaaaaaat tcaaagctgt 960
gaagatttat tagatgacga acaagcttgg gcaaatgact tcttatttaa gaaaacatac 1020
gatagcggaa atacaggtgt cttagttaat actccagtta tgtttagaaa tgaaggaatt 1080
aaagaatata caccagcacc aaaagtaggt caacatactg tagaagtatt aaaatcttta 1140
ggctacgatg aagagaaaat aaataacttt aaagatagta aagttgtaag atat 1194
<210> 153
<211> 768
<212> DNA
<213> Escherichia coli (strain K12)
<400> 153
atgagcgaac tgatcgtcag ccgtcagcaa caagtattgt tgctgaccct taaccgtccc 60
gccgcacgta atgcgctaaa taatgccctt ctgacgcaac tggtaaatga actggaagct 120
gcggctaccg atagcagcat ttcggtctgt gtgattaccg gtaatgcacg cttttttgcc 180
gctggggccg atctcaacga aatggcagaa aaagatctcg cggccacctt aaacgataca 240
cgcccgcagc tatgggcgcg attgcaggcc ttcaacaaac ctctcatcgc agccgtcaac 300
ggttacgcgc ttggtgcggg ttgcgaactg gcattgttgt gcgatgtggt ggttgccgga 360
gagaacgcgc gttttggttt gccggaaatc actctcggca tcatgccagg cgcaggagga 420
acgcaacgtt taatccgtag tgtcggtaaa tcgttagcca gcaaaatggt gctgagcgga 480
gaaagtatca ccgctcagca agcacagcag gccgggctgg ttagcgacgt cttccccagc 540
gatttaaccc tcgaatacgc cttacagctg gcatcgaaaa tggcacgtca ctcgccgctg 600
gccttacaag cggcaaagca agcgctgcgc cagtcgcagg aagtggcttt gcaagccgga 660
cttgcccagg agcgacagtt attcaccttg ctggcggcaa cagaagatcg tcatgaaggc 720
atctccgctt tcttacaaaa acgcacgccc gactttaaag gacgctaa 768
<210> 154
<211> 774
<212> DNA
<213> Rhodobacter capsulatus
<400> 154
atgagctatc acacgatccg ctacgagatc tccgaagggc tggcggtgat cacgctcgat 60
cgccccgagg tgatgaatgc gctgaacgcg gcgatgcggc acgaattgac cgcggcgctg 120
caccgcgcgc ggggcgaggc gcgggcgatc gtgctgaccg gatcggggcg ggccttttgc 180
tctgggcagg atctgggcga tggcgcggcc gaggggctga acctggaaac cgtgctgcgc 240
gaggaatacg agccgctttt gcaggcgatt tacagctgtc cgctgccggt tctggcggcg 300
gtgaacggcg cggcggcggg ggcgggggcc aatctggctc tggcggccga tgtggtgatc 360
gcggcgcaat ctgcggcctt catgcaggct ttcacccgga tcgggctgat gccggatgcg 420
ggcgggacct ggtggctgcc gcggcaggtc ggcatggccc gcgccatggg gatggccctg 480
ttcgccgaga agatcggcgc cgaagaggcc gcgcgcatgg ggctgatctg ggaagccgtg 540
cccgatgtcg atttcgagca tcactggcgg gcccgggcgg cgcatctggc gcggggccct 600
tcggcggcct ttgcggcggt gaagaaggcc tttcatgccg gtctgagcaa tcccctgccc 660
gcgcagctgg cgctggaagc ccggttgcag ggcgaactgg gccagagcgc ggatttccgc 720
gagggcgtgc aggcctttct ggaaaagcgc ccgccgcatt tcaccgggcg ctag 774
<210> 155
<211> 2106
<212> DNA
<213> Pseudomonas stutzeri
<400> 155
atgacggatg tcattcggct cgaacgccgg ggcgatatcg ctctgatcct ggtcaacaac 60
ccgccggtca acgcccttgg ccatgccgta cgaaaaggcc tgttggatgc ctttcaagag 120
gctgacgagg cgcccgaggt gacggccgtg gtgctggtct gcgaaggccc gaccttcatg 180
gccggcgccg atatcaagga gttcggcaaa ccgccgcagg caccgagcct gccggaggtg 240
atcgaggtga tcgagggctg ccgcaagccg agcgtcgcgg tgatccacgg caccgccctg 300
ggtggtgggc tggaggtcgc gctgggctgc cattaccgta tcgcccggtc ggacgccaag 360
gtcggcctgc cggaggtgaa gctgggcctg ctgcccggcg ccggcggtac ccagcgcttg 420
ccgcggctgg ccggtgtcga gaaggcgctg gagatgatcg tcagcggcca gcccatcggt 480
gcggcggagg cgctggagca ctatatcgtc gacgagctgt tcgaaggcga tctgatcgag 540
gccggtctga cctatgcgcg tcgccttgtc gaggagggcc gcggtccgcg ccgcagtggc 600
gagcagaccc gcggtctgga aggcgtcgac aacgaggcgc tgattcgcgc caagcacgcc 660
gaggtggcca agcgcatgcc ggggctgttc tcgccgctgc gctgcattgc cgcggtggaa 720
gccgccacca ggctgccgct ggccgaaggc ctcaagcgcg agcgcgagtt gttcaccgag 780
tgcctgaatt caccgcagcg cggcgcgctg atccattcgt tcttcgccga gcgtcaggcc 840
ggcaagatcg acgacctacc atccgacgtc accccccgcc cgatcaggac cgccgcggtg 900
atcggcggcg gcaccatggg cgtcggcatc gccttgagct tcgccaacgc cggggtgccg 960
gtgaagctgc tggaaatcaa tgacgaggcg ttgcaacgcg gcctgcagcg tgcccgcgaa 1020
acctacgcgg cgagcgtcaa gcgcggcagc ctgaccgagg atgcgatgga gcagcgcctc 1080
gcgctgatcg ctggcgtcac cgactacggc gccctggctg atgccgacgt ggtggtcgag 1140
gccgtgttcg aagagatggg cgtcaagcag caggtcttcg agcaactgga tgcggtgtgc 1200
aagccgggtg cgatcctcgc ctccaacacc tcgtcgctgg acctgaacgc catcgccggc 1260
ttcaccaggc gccccgagga tgtggtcggc atgcacttct tcagcccggc caatgtcatg 1320
cgcctgctgg aagtggtgcg cggtgagcgg accagcgatg aagtgctcgc cgccgccatg 1380
gcgatcggca agcagctgaa gaaggtctcg gtggtggtcg gcgtctgcga cggcttcgtc 1440
ggcaaccgca tggtcttcca gtacggccgc gaggcggagt tcctgctgga ggaaggcgcc 1500
acgccacaac aggtcgacgc tgccctgcgc aatttcggca tggccatggg accgttcgcc 1560
atgcgcgatc tgtccggtct cgacatcggc caggcgatcc gcaagcgcca gcgcgcgacg 1620
ctgccggcgc acctggattt tcccaccgtc tcggacaagc tctgcgccgc cggcatgctg 1680
gggcagaaga ccggtgccgg ctactaccgc tacgaacccg gcaaccgcac cccgcaggag 1740
aatcccgacc tcgcgcccat gctggaagcc gcgtcgcggg aaaagggcat cgagcggcag 1800
gcgctggacg agcagtacat cgtcgagcgc tgcatcttcg cgctggtcaa cgagggcgcg 1860
aagattctcg aggaaggcat tgcccagcgc tccagcgaca tcgacgtcat ctacctcaac 1920
ggctacggct tcccggcctt ccgcggcggg ccgatgtact acgccgacag cgtcggcctg 1980
gacaaggtgc tggcgcgagt aaaagaactg cacgcgcgtt gcggcgactg gtggaagccg 2040
gcgccactgc tggaaaaact ggccgccgaa ggccgcacct tcaccgaatg gcaggccggg 2100
caatga 2106
<210> 156
<211> 1968
<212> DNA
<213> Haliangium ochraceum
<400> 156
atgatcgtcg gagtcatcgg gtcgggcgcc atcggcccag acctcgccta cggattcgcc 60
tcggccctgg ccagcgttcc cggcgccagg gtctatctac acgatatcaa gcaggaggcc 120
ctcgacgccg gtatgcagcg catccgcggc tacatcgcca agggcctggc ccgcggcaag 180
atcagcgaac gcgtcgccgg cgccctggag acggtgctcg tgcccacgct ctcgctcgcc 240
gatctcgcgc cgtgcagcta cgtgctcgag gccgccaccg aggagctcgg ggtcaagcgc 300
gccatcttgc gcagcctcga ggatacagtc gatagcgagt gcctcatcgg cttcgccacc 360
tcgggcctgc cgcgcgcgat catcgccgcc gaggtcaaac atcccgagcg ctgcttcgtc 420
aatcacccct tctaccccgc ctggcgttcg ctgcccgtcg aggtcgtgct ctcgggtagc 480
ccggcgcacg gccagcgcat gctggccacc ctcgaggccc tgggcaaagt ccccgtcatc 540
accgcggacg cgccctgctt cgcggccgac gacatctttt gcaactactg ctcggaggcc 600
gcgcgcatcg tcgaggaagg catcgccaat cccgcccagg tcgacgccat cgtccacggc 660
gccatcggcg gcggcggccc gctcaacgtc ctcgacgcca cccgcggcaa cctgctcacc 720
gtgcactgcc aggagctgat gcgcgacgcc gacaccggca cgccgtggtt cgagccgccc 780
gccatcctgc gcgagcgcgg cgacgccctg tggcacgatc ccaaggcccc gcacgacccc 840
gccttcgacg aggccctgcg cgagcgcgtg ctcgaccgca tcctggccgt gctgctcgcg 900
cgcacagtgt tcgtgctcga tcacggcatc tgcgccgcca ccgagctcga ctggatgacg 960
cgcaccgcgc tcggcttccg caccggcttg gtcgacctgg tggacgaact cggccccgag 1020
cgcgtggccg agctgtgcca gcgctacgcc gccgagcacc ccggcttcgt catcccggac 1080
agcatccgcg agcagcacaa gccgcgcttc tacggcaacc tgcgcgtcac ccgccaggac 1140
gagctggcca tcgtgcgcat cttccgcccc gaggtgaaga acgcgctcga ccgccgcacc 1200
ctgagcgagc tcgaccacct catggccgcg ctgtcggccg acgacagcgt cgagggcgtg 1260
gtcctgagca gcgccggcgg cgcgctggcc ggcgccgaca tcaccgagct agcgcgcgtg 1320
cgcaccaccg aggaggcggt gtccacctgc gctttcggac aagcggtctt gaaccgcatc 1380
gcggccatgg acaagcccgt ggtcgccgcc gtcgacggcc cggtgctggg cggcggcgcc 1440
gagctgtcga tggcgtgcca tgcgcgcgtc gtcggcccgc gcctgagcat gggccaaccc 1500
gaggtcaacc tcggcatcat ccccggctac ggcggcaccc agcggctgcc gcggctcatc 1560
ggcgtggagc gcgcgctggc catgatgcgc acggcgcaga gcatcgacgc gcagaccgcg 1620
tgcgagtggg gctgggccag cggcacgccg atggtcgact tcgtcggcgc ggccgcgacc 1680
ctcatccgca gccacctcgc cggcgaggcc gagctcgcgc cgctcgaccc cgcgcccatg 1740
agcgtacccg ccgcggccgc ccccgtggac atcggccacc gctcgcgcgt catcgacgag 1800
atcctcgtgg atgtggtcca gtccggcttg cgcgcgccgc tgagcgaggg cctggccacc 1860
gaggccgccg gcttcggccg ctgcgtgctc accgtggacc tcgacatcgg actcaagaac 1920
ttcatgcaga acggcccccg ggttccggcg ctgttcctcc acgagtag 1968
<210> 157
<211> 768
<212> DNA
<213> Anoxybacillus flavithermus
<400> 157
atgttttcta ttcaacaaga ggggtatgtg gcgattttag cacttcatcg tccaccagca 60
aacgctttag catcttctgt tttgaaagag ctttcagaac ggcttgatgc attaaaagaa 120
gacgaacaag tacgtgtcat cgttcttcac ggagaaggaa gatttttctc agctggtgcc 180
gatattaaag agtttacagc gatcgaggcg agcgaacaag cggctgaact tgctcgagct 240
ggacaacaag tgatggagaa aattgaacag tttccgaaac cgattattgc cgcgattcac 300
ggtgctgcac ttggcggagg gctcgagtta gctatgagtt gccatctgcg catcgtagcg 360
gaaaacgcca aacttggctt accagaattg cagctcggca tcattccggg atttgcagga 420
acacaacgct tattgcgtca tgtcggtatg gcaaaagcgc tagaaatgat gtggacaagc 480
gaaccgatca caggtgcaga agctgtgcag tggggactag caaacaaagc cgtcccagaa 540
gaacaattgc ttgatacagc gaagcaactt gcacaaaaaa ttgctcaaaa gagcccgatt 600
tctgttcaag cggtattgaa actagttaat gaagctcgca caaaaacgtt ccatgaatgc 660
gttgaaaaag aggctcaact gtttggacaa gtctttgtaa cagaagatgc gaaagagggc 720
atttcggcat ttatcgaaaa acggacacca cagtttcaag gaaaataa 768
<210> 158
<211> 783
<212> DNA
<213> Streptomyces avermitilis
<400> 158
atgagcacgg cgcccgaagc tgccgacttg gtgctccacg agcgtcacgg cggcgtactg 60
accatcacca tcaaccgccc cgcgcagaag aacgccgtcg accacgaggc cgcggtacag 120
ctcgcggcgg ccgtggatct gctcgacgcg gacccggagc tgtcggtcgg cgtcctcacg 180
ggcgcgggcg gggtgttcag cgcgggcatg gacctgaagg cgttcgccaa gggcgagctg 240
cccttgctgc ccagccgggg cctgggcggg ctcacccgcg cgtcggtgcg aaagccgctg 300
gtcgccgcgg tcgagggctg ggcgctcggc ggtggcttcg agctggtcct cgcctgcgac 360
ctgatcgtcg ccgcggagga cgcccgcttc gggtttcccg aggtcatgcg tggtctcgtg 420
gcggcggagg gcggactggt caggctgccg cgccgacttc cgtaccacgt cgccgcgcgc 480
gtactgctga cgggcgagcc gctgaccgcc gtcgaagcca aggagtacgg gctcgtcaat 540
gagctgaccc cgcccggcgc cgcgctggac gcggcccggg agctcgcggg ccgcgtcgcg 600
cggaacgcac cgcttgcact ggcggccgtc aaggaggtcc tgcgcgagac acagggcctg 660
aaggagagcg acgcgttcag acgccaggac gagctcacga gcggactggc cgccagcgag 720
gacgcgcggg aaggcgcaca ggcgttcgcc gagaaacgcg ccccggtctg gcacggccgc 780
tga 783
<210> 159
<211> 1683
<212> DNA
<213> Advenella kashmirensis
<400> 159
gtggacaatg gccgtaagct gattgaacgt ggctggcatt tattcaaccg tatcgaaaag 60
ctagcctttc ctacactggc actcatgcac ggcccctgcc tgggtggcgg gctggaactg 120
gcactggcgt gccgttatcg aatcgcgatc gattctccca agccggtgat cggcctgcct 180
gaagtcaaat tgggcatctt ccccgcctgg ggcggcctga tgcgactacc ccgcctgatt 240
ggtccgcaaa ccgccctgaa catgatgctg accggtcgca cactggatgg ccgcaaggcc 300
aggtctgccg gtctggtaga tttgctggtc gcaccccgag ttgcagagaa atcggcgatc 360
gatctggtca cgtcgggcaa accggcgcgt caggctcgcg gcctggccgg cttgctcaat 420
cgtgcaccgt tcaagtcgct ggtggctgcc caggcacgca aaagcgtcaa gcaaaaagac 480
ccttatggcc actaccccgc caccctgacc atgctggatc tgtgggaaaa acatgatggc 540
gacccgttgg ccgatcccca ggcgctgacc cggctgctgc aatcggatgt cacccgcaat 600
ctgatccgtg tatttcacct gcaggagcgg ctcaaggcgt ttggcaagaa ggataatgcc 660
actcccgtca accatgttca tgtgatcggg gccggcgtga tgggcggtgg catcgctgcc 720
tggtgcgcgc tgcagggcat caaaaccacc ttgcaggata ccgacgccca gcgcatcgcc 780
ggggcgttca aaaacgccgt ctccatttat gcccgcaagg atcggtatac cgcgcaggca 840
gcccgcgatc gcctgattcc ggacctggcg ggccacggta tcgcgacggc tgatctggtg 900
attgaagcga tcagcgaaaa tccgcaagcc aagcaatcgc tctaccagca gatcgaacca 960
aaaatgaaag aaggcgccat tttagccacc aatacatcca gtctgtccat tgcgcagtta 1020
cgcagcgtgc tggtgcaccc cgaacgtttt gtcggtattc attttttcaa tccagtctca 1080
cgcatgccgc tggtagaagt ggtacatgcc gatggcatcg cccaggaaac tctggacacc 1140
gctgccgcct ttgtcggcaa aatcggcaaa ctgccgctgc cggttcagga cacgccgggc 1200
tttctggtca acgccgtgct tgctccctat atgctgcaag ccatgcggtg cattgacgaa 1260
ggcatggatc ccgaagtcat cgataccgca atgctggagt tcggcatgcc catggggccg 1320
atcacgctgg ccgatacggt tggtctggat attgccatgg cagccggcaa acagctgtcc 1380
gaaggccagg agccgccacg ctgcctgcaa gagaagattg cccaaggcaa gctgggtgtc 1440
aaaagcggcg aaggctttta cgtgtggaaa gaccgcaagc atgaccagcg cagtagcaaa 1500
gccatcccgc aaggcctggc acagcgcctg atcaagccgc tgatagagca gaccgaaaaa 1560
caacttgcga acaacatcgt gcaagatgca gatcttgccg atgcaggcgt gatattcgga 1620
accgggtttg cgccttttac cggaggaccc attcattaca aacaaagtaa aggaggacta 1680
tga 1683
<210> 160
<211> 714
<212> DNA
<213> Oligotropha carboxidovorans
<400> 160
gtgagccttt cgccgcttgc caacggcgta cgcgttctca cactggatcg tccgtccaag 60
gccaacgcgt tgaatgcgga ggtcgtggac cagttgcttg cgtgtgtcgc ccaggccgag 120
gcggaggatt gccgcgtgct gatcctcgcc gccaacggca aggcgttttg cggcgggttt 180
gatttcggtg gttatgaatc gatgtcggcg ggcgacctgc tgctgcgctt tgtccggatc 240
gaggagttgc tgcagcggat gcgccagtcg tcgtttgtca gcattgctct ggtgcatggt 300
gcggcgatgg gggcgggggc ggacatcgtc gcgtcttgca cctatcgcat cggcaccgac 360
gcaagccggt ttcgctttcc gggattccgt ttcggcgtgg cgcttggcac gcggcatctg 420
gcgcagcttg tcggcccgca acgggcgcgc gatatcctgc tgaccaatgc aacgatcgat 480
gcattgaccg ctgtcgatat cggattgctg acgcacctcg tcgatgccgg gagcatgcgg 540
cagaaagcgg acgagattat tgcgcagatt ggctcgctgg accgtgtcgc acgcaaccgg 600
attttgcatc tgacctcggc tcagaacaat gacggtgaca tggctgagct ggtgaaatcg 660
gtgagcgcgc ccgggctaca cgagcgcatt gcgcagtacc gcgccgggca ttga 714
<210> 161
<211> 801
<212> DNA
<213> Riemerella anatipestifer
<400> 161
atgtacaaat taatagatgt agataaccat tttgaaggaa agcttcaaat cgcatatatc 60
aatcagccag aatcgtttaa tagtcttaat aaggttgttt tagaagagtt attgcacttt 120
ataaaagctt gtgacgcaga ttctagtgta cgctgtattg caattagtgg caaaggtaag 180
gcgttttgtt ctggtcagaa tttaaaggag gctttagatt ataaagcaga agccaatgag 240
gaacgcttta tccaaaggat tgtgatagat tattataatc cgttagtgaa ggctattgtc 300
tatgctaaaa aaccagtaat tgcattggtt aatggtcctg cggttggtgc aggagcaatg 360
ttagctctca tctgtgattt tgcagtggcg tcagagtcag cgtatttttc cttagctttt 420
tctaatatag gactagtgcc agatacggca ggtacttact atttgcctaa acttttaggg 480
cgttccttag cgagttattt ggcatttaca gggaagaagc tatctgctaa agagtcttta 540
gaaagaggtt tggtggtaga tgttttttca gatgctactt tttcggaaca atctttacaa 600
gtcctagaac atattactca tcagcctact gtggcattgg ggcttacaaa aaaagccttt 660
aataaatctt atcagaatag tctatcggag cagttagatt tggagagtat tctccagcaa 720
gatgctgcag aaacttggga ttttcaagag gggatagccg cttttttagc aaaaagaaaa 780
cctcagtata aaggtaagta a 801
<210> 162
<211> 1269
<212> DNA
<213> Fusobacterium necrophorum subsp. funduliforme Fnf 1007
<400> 162
atgtcagaaa caatcaattt agatgaaatg tcagcaaaac aattattggg ttattatcaa 60
gaaaaattgg atgaagaagc aagacaggca aaaagagaag gaaaattagt ttgttggtct 120
gcttccgttg ctccaccaga attctgtgta gctatggata ttgccatggt gtatccagaa 180
actcatgcag cagggattgg agctagaaaa gggtcgttag atctgctaga agtagcagat 240
gaaaaagggt attctttaga tatttgttct tatgcaagag taaatttggg gtatatggaa 300
ttgttaaaac aacaagcctt aactggagaa actcctgaaa aattagcaaa ctctccggct 360
gcaaaagttc ctttaccgga tttagttatt acatgtaata acatttgtaa tactttgtta 420
aaatggtacg aaaatttggc aaaggaatta aatattccat gtattgtaat tgacgttccg 480
ttcaatcata ctatgccaat tacaaaacat tcaaaagaat atattgcaga tcaatttaaa 540
tatgcaattc aacaattaga agaaattaca ggaaagaaat ttgactatga taaattctta 600
gaagtgcaag agcaaacaca aagatctgta tatcaatgga atcgtttagc agctcttgct 660
cactacaaac cttctccatt aaatggtttc gatttattta acttcatggc tttaattgta 720
tgtgctagaa gtagagatta tgcagaaatc actttcaaga aatttgcaga tgaattggaa 780
gaaaacttga aaaatgaagt atatgcgttc aaaggagctg aaaagaacag agttacttgg 840
gaaggaattg cagtatggcc ttaccttgga cacactttca agtctttaaa aggaatggga 900
agtatcatga ccggttctgc atatccagga atctggaact tgacatatac tcctggagat 960
atggaatcta tggcggaagc atatacaaga gtctacatta atacttgctt acaaaataaa 1020
gcggatgtcc tttctaaaat tgtaacagac ggaaaatgtg atggaatact atatcatttg 1080
aatagaagtt gtaaactgat gagtttcttg aatgtggaaa ctgctgaatt agttgaaaaa 1140
gcgactggag tgccatatgt aagtttcgat ggagaccaaa cagatccgag aaatttcgca 1200
ccggctcaat ttgatacaag agtacaagct ttaaatgaaa tgatggaagt taataacgaa 1260
acaaaataa 1269
<210> 163
<211> 834
<212> DNA
<213> Fusobacterium necrophorum subsp. funduliforme Fnf 1007
<400> 163
gtgcaagatg acagaagttt taagaaagga aagagaagag gaatgtatac agttggagtg 60
gatataggtt cttcttcttc aaaagtagtg atattaaagg atggaacaga gattgtaagt 120
caatcggcaa ttcagtcggg aattggaagt aatcgagcca ttgttgcttt ggaagataat 180
ttaaaaaaag caaacttgac gaaggaagat attggtttta cagttgttac tggatatgga 240
cgctttactt ttgaaggagc agataaacaa atcagcgaga ttagttgtca tgccaggggg 300
attcattttt tattaccgaa tgtgagaacc attattgata ttggtggaca agatgccaaa 360
gcgatcagct tagatgaaaa aggtcatgta agacaatttt ttatgaatga caaatgtgca 420
gcaggaacag gacgattttt aactgtaatg gcacgcgtac tagagatttc cctagatgag 480
atgggaactt atgatgctct ttctaaaaat ccttgtaata ttagtagtac ttgtgctgta 540
tttgcagaat cagaagtcat ttctcaattg gcaaagggaa ataccaaaga ggatgtcatt 600
gcaggagtac ataattctgt cgctcataag atattaggtt tagtatatcg tacttctatg 660
gaagaaaaat ttgcgatttg tggtggtgtt gctcagaata caggtgcatt gcgtgcaata 720
cgggaagctt tgaaaaaaga agtaatcgtt gctcctaatc cacaattaac aggagcatta 780
ggagctgcaa tttttgctta tgatgagctg aaaaaattaa gaaagggtga ataa 834
<210> 164
<211> 1125
<212> DNA
<213> Fusobacterium necrophorum subsp. funduliforme Fnf 1007
<400> 164
atgaaaggca gattagaaga attaattcat atatttgaag atgttgcaaa caaccccaaa 60
aaaatggtag cagaatataa aaaagaagta gggaaagaag tgattggagt catgccagta 120
tatgctccag aagaaattat tcacgctgct ggatgtttac ctattggatt atggggagga 180
aaaaaagaag tttctaaagc aagagcatat ttacctcctt ttgcatgttc tattatgcaa 240
actgttatgg aattacaaat tggaggaaca tatgacattt tagatgcagt attattctct 300
gtaccttgtg atactttgaa atgtttaagt caaaaatgga aaggaaaatc tcctgtaatt 360
gtatttactc atcctcaaaa cagagtaatt gaaggagcaa atgcttactt agtaaaggaa 420
tatcaagcag taaaagaaaa attagaagga atcttaggaa gaaccattcc tatggaagcg 480
attgaagaaa gcgtaaaagt atataatgaa aatagaagag ttatgagaga atttgtagaa 540
gtggcggcac aatatccaca aattatcgat ccaattgtta gacataatgt gatgaaatcc 600
agatggttct taagaaaaga aaaacatact gaatatgtaa aagaattaat cgctgaatta 660
aaaaaagaaa ctattgttcc ttgggacgga aagaaagtaa tcttaacagg aattatgaca 720
gaaccagtag aattgttgca aatctttaaa gatgaaaaac ttgctattgt agccgatgat 780
ttagctcatg aaagccgaca atttagagga gatgttcctg aagaaggagg agatgttcta 840
tacagaatgg caaaatggtg gcaaaattta gaaggatgtt ctttagcaac ggatactaat 900
aaaggtagag gacaaatgct aatggatatg tgtaaggata cgaaagcaga tgccgttatc 960
gtgtgtatga tgaaattctg tgatcctgaa gaatttgact atccggtata ctatagagaa 1020
tttactgaat ccggaattaa aaatattaca gtggaagtgg acttagaagt ttcttctttt 1080
gaacaaatta gaacaagaat acaaacattt aaagatattt tataa 1125
<210> 165
<211> 1269
<212> DNA
<213> Desulfosporosinus youngiae DSM 17734
<400> 165
atgacggata caacaactat gagtgccaaa gaattgttag gtttctatca ggaagaattg 60
tatgaagaag cgagacaggc caaaaaagaa ggaaaacttg tttgttggtc tgcatcggtt 120
gctccttcgg agttttgtgt ggctatggat gtggcgatga tctatcctga aacacatgct 180
gcggggattg gggcaagaaa aggtgcctta gatgtgctgg aagttgccga tgaaaaaggc 240
tataacctgg atacttgctc ctatgcaaga gtcaatatgg gttatatgga acttctgaaa 300
caagaggctt taacaggaat aacgccggaa aagcttgaaa aatccccggc ggccagaata 360
ccgctgcccg attttgtcat aacctgcaac aacatttgca acaccttgct taagtggtat 420
gagaatcttg ccgttgaatt aaatattccc tgcatcatca ttgatgttcc ctttaatcat 480
accatgccca ttccccagta tgctaaggac tatattgcgg aacagtttaa ggaggctatt 540
actcagcttg aggaaatttg cggcaggaaa ttcgactacg acaaattttt gaaagtacag 600
gaacaaaccc agcgttctgt ggcccagtgg aacagaattg ctgctttgtc gggacataaa 660
ccatctcctt taaatggttt tgatcttttc aactatatgg ccctgatcgt ttgtgccaga 720
agcagagact acgcggaaat tacctttaaa aagtttgccg atgaacttga agaaaacctc 780
aaaaacggta tctacgcctt taaaggaaat gaacaaaagc gtgtaacttg ggagggcata 840
gctgtttggc cgcatctggg ccatacattt aaaggcttaa agaatctggg caatatcatg 900
acaggttcgg cttatcccgg tttgtggaat cttacctaca cacctgggga tatgagttcc 960
atggcggaag cttataccag aatttatatc aatacttgtc tcgataacaa agttaaggtg 1020
cttagtgacg tcatcagcgg cggaaagtgt gacggggtta tttatcatca gaacagaagc 1080
tgtaagctca tgagtcttct caatgtcgaa acggctgata tactccaaaa acaaaatcat 1140
ttaccctatg tcagctttga tggggaccaa acggatcctc gtaactttgc tcctgcccag 1200
tttgatacac gtatccaggc cttagatgaa atgatgaagc agaataagga gggagtttcc 1260
aatgagtag 1269
<210> 166
<211> 1119
<212> DNA
<213> Desulfosporosinus youngiae DSM 17734
<400> 166
atgagtagaa ttgaaacgat tatcagtgaa ttaacgtcca ttgccaataa tccccgccag 60
gctatggaag attataaaaa agaaaccggc aaagggtcgg ttggggttat gccttattat 120
gctcctgaag aaatcattca tgccgcaggg tatctgcccg taggtatttg gggaggacaa 180
aagagtattt ccaaggcccg ggcctatttg cctccctttg cttgttcaat tatgcaatcc 240
gtggtggaaa tgcagcttga aggggtctat gacgatttag aagcggtcct tttccctgtt 300
ccttgtgaca ccttaaaatg tcttagccaa aaatggaaag gaacctcccc tgtcatcgtt 360
ttaactcatc ctcaaaacag aaaactggaa gcagccaata agtttcttgc tgaggaatat 420
aggcttgtgc gtgaaaaact ggaaaaaatc ctgaatgtta agattacaga cgaggcactt 480
aaccaaagca ttgaaattta taacgaaaat cgtaaagtaa tgcgtgaatt tacagagata 540
gctgctaatt atcccaacat tattgatccc gtaaaacgtc atgcgcttat caaagccaga 600
ttctttatgg aaaaagccaa acataccgct ctggtcaaag aattgaatgc agagcttaaa 660
gcgttaccgg tggaagcctt tacaggcaaa aaggttgttt tgacaggcat tatggctgaa 720
cccaatgaag tattggacat tttgcaagat aacggttttg ctgttgtggc agatgacctg 780
gcccaggaat ccagactgtt cagaaatgat gttccctcag ggacagaccc actctatcgc 840
ttggctaaat ggtggcagga attcgatggt tgttctctgg ctgtcgatgc gaaaaaacca 900
agaggcccca tgctgatgga tatggttaaa gcatctaagg ccgatgccgt tgtggtttgc 960
atgatgaagt tctgtgaccc tgaagaattt gactatccaa tctactacag acagtttgaa 1020
gaagccggaa ttaagagctt atttatagaa attgacctgg aaccaacctc ctttgaacag 1080
actaaaacca gagttcaaag ttttagagaa atgctgtga 1119
<210> 167
<211> 819
<212> DNA
<213> Desulfosporosinus youngiae DSM 17734
<400> 167
atgtttacaa tggggattga tattgggtcc tcatcctcaa aggttgtaat ccttgaagat 60
ggagttaata ttatcgccgg agaagttatt cagattggaa caggttctac gggacctaaa 120
cgtgtactgg atgaagctct tgccaaagca ggtcttacat tgcaagacat ggctaaaatt 180
attgctacag gctatggaag atcgtctgtg gaagaagcac acaaacaaat cagcgaaatc 240
agttgtcagg ctaagggagt tttcttttta gttccttcag caaaattaat tattgatatt 300
ggcggtcagg atgttaaggc cattaaactt gacagtaaag gctgtgttaa gcagtttttt 360
atgaatgata aatgtgccgc cggaacagga cgttttctcg atgttatgtc gcgggtactg 420
gaagttaatc ttgatgaaat ggcggaatac gatgcccggg caacagaacc tgccacggtc 480
agcagcactt gcacagtttt tgcagaatct gaggtaatat ctcagcttgc caacggagtt 540
gctaaagaga acattattgc aggggttcac cagtcagttg ctagcaaagc ctgtggactt 600
gcctatcgat gtggggtgga agaggacatc gtgatgtgcg gaggcgttgc taaggactta 660
ggggttgtca gagcaatcag caaagaactg aaaaaaccgg tcattgtagc tcctaatcca 720
caaattacag ctgcacttgg agctgctata tttgccttcg aagaagttat ggaaactgtt 780
atggttgcct tcgaagaagt taggggagct aataaataa 819
<210> 168
<211> 1269
<212> DNA
<213> Peptoniphilus indolicus ATCC 29427
<400> 168
atgaatacta tagatatatc aaatatgaaa gctaaagaaa tgcttggata ttttcaaaac 60
aaacttgacg aagaagcacg tgaagctaaa aaaaatggaa aattagtttg ctggtcagcc 120
tctgtagctc catctgaatt ttgtgtaacc atggatatcg cattagttta tccagaaact 180
cacgcagccg gtataggtgc tagaaaaggc tctttagcta tgttagatgt tgctgataga 240
aaaggttata atacagatat atgttcttat gccagagtaa acttaggata tatggaactt 300
ttaaaagaat atgctaagac aggagtgaaa cctaaagaac ttgaagaatc tcctgctgca 360
gatgttcctc tacctgattt agtaataact tgcaataata tatgcaacac tttactaaaa 420
tggtatgaaa atttagctgc agaattaaat attccttgta tagttataga cgttcctttt 480
aatcatacta tgcctattcc taagtattct aaagaatata ttgctgacca atttaaggaa 540
gcaataagac aacttgaaga aataacagga aaagattttg actatgataa atttttagaa 600
gttcaagagc aaacgcaaag atctgttgct caatggaata gacttgctgc actttctaaa 660
tatgaaccgt ctcctctaaa tggatttgat ttatttaact atatggctct tatagtttgt 720
gcaagaagta aaaattatgc tgaattaact tttaaaaaat ttgccgatga acttgaagaa 780
aatatgcaaa atggagtgta tccttacaag gctggagaac aatccagaat tacttgggaa 840
ggtatagcta tttggccata tttaggacac acttttaaga ctcttaaagg ctatggctca 900
ataatgacag gctctgctta tcctggactt tggaacttag aatacacacc tggagatatg 960
ctttcaatgg cagaagctta tacgagaata tatataaaca cttgccttga caataaagtt 1020
gatgtattga gaaaaatcat taaaaacggt aaatgtgatg gggtcgcata ccatctaaat 1080
agaagttgta aattgatgag tcttctaaac gttgagacag ctgaaatttt aaataaagaa 1140
aataatcttc catatgttag ttttgatggt gatcaaactg atcctagaaa tttctcagaa 1200
gcacaatatg ataacagaat acaaactctt actgagatga tgtctgccaa taaaaaaatg 1260
aggggttga 1269
<210> 169
<211> 792
<212> DNA
<213> Peptoniphilus indolicus ATCC 29427
<400> 169
atgtacacta tgggagtaga tatcggttct acatcatcta aaatcataat acttgaagat 60
ggaataaaaa ttatcggaaa tattgtagta caatctggaa ccggtacaag tgggccaaca 120
attgctactg caaaagctaa gtcctttctt tcaaataata atttaacttt agatgatata 180
tctaaaatcg ttgtcacagg ttacggcaga ttttcatttg atattgccga taaacaaata 240
agtgaaataa cttgtcatac aaaaggtatt aactttttag tgcctgaagc tcgaactatt 300
ttagatatag gtggacaaga tacaaaagct atttcagtta atgataaagg tcaagttcta 360
caatttttca tgaatgacaa atgtgccgcc ggcactggca gatttttaga agtcatggct 420
aaaattttag aaataccttt agaaaaaatg ggtgaatatg atagattatc aactaatccg 480
gtagctataa gtagtacttg taccgttttt gctgagtctg aagttatttc tcagctatca 540
aagggcatat ctaaagaaaa tatattagcc ggtgtacata attcaactgc taacaaagtt 600
tgtggtcttt tatatcgtac aggaattaag gaaaaaatag ttttatgtgg aggagttgct 660
caaaaccaag gtgttgttag agcgctccaa gaggaattaa aaaaagaaat aaccatagct 720
cctcacccac aaatgacagg cgccataggt gctgctttat ttgcttatga agaggcgaat 780
aaaaatttat ag 792
<210> 170
<211> 1119
<212> DNA
<213> Peptoniphilus indolicus ATCC 29427
<400> 170
atgaacaaaa ttaatgaaat aataaattta ttggatgaag tttctaaaga tcctaaacta 60
acagttaaaa aatataaaga aaaaacagga aaaggtgttg taggtgtcat gccattatat 120
gcacctgaag aaattattca tgctgcaggt tttctaccta tgggactttg gggtgcacaa 180
aaagaagtat ctaaagcaag aatttattta cctccttttg catgttcaat aatgcaaact 240
aatatggaac ttcaaataga aggtgcctat gatgacttag atgcagttgt attttctgta 300
ccgtgcgata ctctaaaatg tatgagtcaa aaatggaagg gtaaaagtcc tgttatagta 360
tttactcatc ctcaaaacag aaaattagaa tctgcaaata aatttttggt tacagaatat 420
gaaatcttaa aagataaatt agaaaagata ttaaatgtaa aaatatctga tgaatccata 480
acaaatagta ttgaaattta caatgaaaat agaaaagtca tgagagaatt ttcagaccta 540
gctggtcaat atcctaatat aattgaccct attcaaagac atattgtatt taagtccaga 600
tggtttatgg aaaaatcaga acatactaaa ttagttaaag aactaatatc tgaaattaaa 660
aaattaccta ttgaagaatg ggatggctat aaagttatag caactggtat tatgatagaa 720
cctgaagaaa tacttcaaat atttaaagat aagaaaatag ctattgttgc agatgattta 780
gctcaagaat caagacaatt tagacatgac gtacctgaag gagatcaacc tcttttaaga 840
cttgctaagt ggtggcaaaa tttagaagga tgtgctcttg caactgatac aaaaaaatta 900
agaggccaaa tgctaattga tatggcgaaa aaatataatg ccgatgctgt attgatatgt 960
atgatgaaat tctgcgatcc tgaagaattt gactaccctg tatactatag agagttccaa 1020
gaagctggca taaagaattt actaattgaa attgacttag aaatgacagc ttttgaacaa 1080
actaacacaa gacttcaaac tcttgtagaa actctctaa 1119
<210> 171
<211> 1269
<212> DNA
<213> Desulfosporosinus meridiei (strain ATCC BAA-275/DSM 3257/ NCIMB 13706/S10)
<400> 171
atgactgata caacagctat gagcgccaaa gaattgttag gtttctatca ggaagaattg 60
tatgaagaag cgagacgggc aaaaaaagaa ggaaaacttg tttgttggtc tgcatccgtt 120
gctccttcgg agttttgtgt ggctatggat gtagctatga tatatcctga aacccatgct 180
gcgggtattg gggccagaaa aggtgcctta gatgtgcttg aagttgcgga tgaaaaaggc 240
tataacgtgg atacttgctc ctatgcaaga gtaaatcttg gttatatgga acttttaaaa 300
caggaggctt taacaggaat aacaccggaa aagcttgaaa aatccccagc ggccagaata 360
ccccttcccg attttgtcat aacctgtaac aacatttgta acaccttgct taagtggtat 420
gagaatcttg ccgttgaatt aaatattcct tgcatcatca ttgatgttcc ctttaatcat 480
acaatgccca ttccacagta tgccaaggat tatattgcgg aacagtttaa ggaagctatt 540
actcagcttg aggaaatttg cggcaagaaa ttcgactatg acaaattttt aaaagtacag 600
gaacaaaccc aacgttctgt tgcccaatgg aatagaatcg ctgctttgtc atcacataaa 660
ccatcccctt taaatggttt tgatcttttc aactatatgg ccctgatcgt ttgtgcaagg 720
agtaaagact acgcagaaat tacctttaaa aagtttgctg atgaacttga agaaaatctt 780
aataagggta tcttcgcctt taaaggaaat gaacaaaagc gggtaacttg ggaaggcata 840
gctgtttggc cgcacctggg acatacattt aaaggcttaa agaatcttgg caatataatg 900
acaggttcag cctatccggg tctgtggaat gttagttata caccaggtga tatgagttca 960
atggcggaag cttatactag aatttatatc aatacttgtc ttgataataa agttaaggtt 1020
cttagtgacg taattagtgg cggaaagtgt gacggtgtta tttatcatca gaacagaagc 1080
tgtaagctca tgagttttct gaatgtagaa actgctgata tcctccaaaa agaaaatggt 1140
ttaccctatg taagctttga tggagaccaa actgatcctc gtaacttttc tcctgcccag 1200
tttgacacac gtatccaggc cttagatgaa atgatgaagc agaataagga gggagtttcc 1260
aatgagtag 1269
<210> 172
<211> 1119
<212> DNA
<213> Desulfosporosinus meridiei (strain ATCC BAA-275/DSM 13257/NCIMB13706/S10)
<400> 172
atgagtagaa ttgaaactat tattagtgaa ttatcttcaa tttcaaataa tccccgcaag 60
gctatggaag attataaaaa agaaaccggt aaagggtcgg taggggttat gccttattat 120
gcccctgaag aaataattca tgctgctggt tttcttcccg taggtatttg gggaggacaa 180
aagagtattt caaaagcccg tgcctattta cctccctttg cttgttcaat tatgcaatca 240
gttatggaaa tgcagcttga aggggtatat gacgatttag aagcagtact tttccccgtt 300
ccttgtgaca ctttaaaatg tctcagccaa aaatggaaag gaacatcacc tgtcatcgta 360
tttactcatc ctcaaaacag aaaactcgaa gcagccaata agtttcttgc tgaggaatat 420
cgacttgttc gtgaaaagct ggaaacaata ttgaatgtaa agattactga tgaagcactc 480
aaccaaagta ttgaaactta taacgaaaat cgtaaagtaa tgcgtgaatt tacggaccta 540
gctgctaatt atcctcagat tattgatccc agaatacgtc atgcaattat aaaagctaga 600
ttttttatgg aaaaatctaa acataccgct atggtaaaag aattgaattc agagcttaaa 660
tcgttacctg ttgaagcctt tacaggtaaa aaggttgttt taacaggaat tatggctgaa 720
cccaatgaag tattagacat tttaaaagat aacggttttg ctgttgtggc agacgacctg 780
gcccaggaat ccagactgtt cagaaatgat gttccgtcag gtacagaccc actatatcga 840
ttggctaaat ggtggcaaga attcgatggt tgttctcttg ctacagatgc gaaaaaatca 900
agaggcccca tgctgatgga gatggttaaa gggtctaagg ccgatgcagt tgtggtttgc 960
atgatgaagt tctgtgaccc tgaagaattt gactatccaa tctactatag acagtttgaa 1020
gaagctggaa ttaagagcct atttatagaa attgacctgg aaacaacatc ctttgaacag 1080
actaaaacca gagttcaaag ttttagtgaa atgctgtga 1119
<210> 173
<211> 786
<212> DNA
<213> Desulfosporosinus meridiei (strain ATCC BAA-275/DSM 13257/NCIMB 13706/S10)
<400> 173
atgtttacaa tggggattga tattgggtcc tcatcctcaa aggttgtaat acttgaagat 60
ggagttaata ttatcgctgg agaagtcatt cagattggaa caggttcgac aggacctaaa 120
cgtgtactga atgaagctct ttccaaagca ggtcttaaat tggaagacat ggctaaaatt 180
attgctacag gctacggaag atcttctgtg gaagaagcac acaaacaaat tagcgaaatc 240
agttgtcagg ctaagggagt tttcttttta gttccttcag caaaattaat tattgatatc 300
ggcggtcaag atgttaaggc aattagactt gacagtaaag gcggcgttaa gcagtttttt 360
atgaatgata aatgtgccgc cggaacagga cgttttctcg atgttatgtc acgagtactt 420
gaagttaatc ttgatgaaat ggcagaatac gatgctcgtg caacagaacc tgccacggtc 480
agcagcactt gcacagtttt tgcagaatct gaggtaatat ctcagctttc caacggagtt 540
gctaaagaga atattattgc aggggttcac cagtcagttg ctagcaaagc ctgtggactt 600
gcctatagat gtggggtgga agaggacatt gttatgtgcg gaggtgttgc taaggactta 660
ggggttgtcc gggcaataag caaagaacta aaaaaacctg tcattgtagc tcctaatcca 720
caaattacag ctgcccttgg agctgctatc tttgccttcg aagaagtcag gggagctaat 780
aaataa 786
<210> 174
<211> 1434
<212> DNA
<213> Acidaminococcus fermentans
<400> 174
atgccaaaga cagtaagccc tggcgttcag gcattgagag atgtagttga aaaggtttac 60
agagaactgc gggaaccgaa agaaagagga gaaaaagtag gctggtcctc ttccaagttc 120
ccctgcgaac tggctgaatc ttttcggctg catgttgggt atccggaaaa ccaggctgct 180
ggtatcgctg ccaaccgtga cggcgaagtg atgtgccagg ctgcagaaga tatcggttat 240
gacaacgata tctgcggcta tgcccgtatt tccctggctt atgctgccgg gttccggggt 300
gccaacaaaa tggacaaaga tggcaactat gtcatcaacc cccacagcgg caaacagatg 360
aaagatgcca atggcaaaaa ggtattcgac gcagatggca aacccgtaat cgatcccaag 420
accctgaaac cctttgccac caccgacaac atctatgaaa tcgctgctct gccggaaggg 480
gaagaaaaga cccgccgcca gaatgccctg cacaaatatc gtcagatgac catgcccatg 540
ccggacttcg tgctgtgctg caacaacatc tgcaactgca tgaccaaatg gtatgaagac 600
attgcccgtc ggcacaacat tcctttgatc atgatcgacg ttccttacaa cgaattcgac 660
catgtcaacg aagccaacgt gaaatacatc cggtcccagc tggatacggc catccgtcaa 720
atggaagaaa tcaccggcaa gaagttcgat gaagacaaat tcgaacagtg ctgccagaac 780
gccaaccgta ctgccaaagc atggctgaag gtttgcgact acctgcagta caaaccggct 840
ccgttcaacg ggttcgacct gttcaaccat atggctgacg tggttaccgc ccgtggccgt 900
gtggaagctg ctgaagcttt cgaactgctg gccaaggaac tggaacagca tgtgaaggaa 960
ggcaccacca ccgctccctt caaagaacag catcgtatca tgttcgaagg gatcccctgc 1020
tggccgaaac tgccgaacct gttcaaaccg ctgaaagcca acggcctgaa catcaccggc 1080
gttgtatatg ctcctgcttt cgggttcgtg tacaacaacc tggacgaatt ggtcaaagcc 1140
tactgcaaag ccccgaactc cgtcagcatc gaacagggtg ttgcctggcg tgaaggcctg 1200
atccgcgaca acaaggttga cggcgtactg gttcactaca accggtcctg caaaccctgg 1260
agcggctaca tgcctgaaat gcagcgtcgt ttcaccaaag acatgggtat ccccactgct 1320
ggattcgacg gtgaccaggc tgacccgaga aacttcaacg cggctcagta tgagacccgt 1380
gttcagggct tggtcgaagc catggaagca aatgatgaaa agaaggggaa ataa 1434
<210> 175
<211> 1140
<212> DNA
<213> Acidaminococcus fermentans
<400> 175
atggctatca gtgcacttat tgaagagttc caaaaagtat ctgccagccc gaagaccatg 60
ctggccaaat ataaagccca gggcaaaaaa gccatcggct gcctgccgta ctatgttccg 120
gaagaactgg tctatgctgc aggcatggtt cccatgggtg tatggggctg caatggcaaa 180
caggaagtcc gttccaagga atactgtgct tccttctact gcaccattgc ccagcagtct 240
ctggaaatgc tgctggacgg gaccctggat gggttggacg ggatcatcac tccggtactg 300
tgtgataccc tgcgtcccat gagccagaac ttcaaagtgg ccatgaaaga caagatgccg 360
gttattttcc tggctcatcc ccaggtccgt cagaatgccg ccggcaagca gttcacctat 420
gatgcctaca gcgaagtgaa aggccatctg gaagaaatct gcggccatga aatcaccaat 480
gatgccatcc tggatgccat caaagtgtac aacaagagcc gtgctgcccg ccgcgaattc 540
tgcaaactgg ccaacgaaca tcctgatctg atcccggctt ccgtacgggc caccgtactg 600
cgtgccgctt acttcatgct gaaggatgaa tacaccgaaa agctggaaga actgaacaag 660
gaactggcag ctgctcctgc cggcaagttc gacggccaca aagtggttgt ttccggcatc 720
atctacaaca cgcccggcat cctgaaagcc atggatgaca acaaactggc cattgctgct 780
gatgactgcg cttatgaaag ccgcagcttt gccgtggatg ctccggaaga tctggacaac 840
ggactgcatg ctctggctgt acagttctcc aaacagaaga acgatgttct gctgtacgat 900
cctgaatttg ccaagaatac ccgttctgaa cacgttggca atctggtaaa agaaagcggc 960
gcagaaggac tgatcgtgtt catgatgcag ttctgcgatc cggaagaaat ggaatatcct 1020
gatctgaaga aggctctgga tgcccaccac attcctcatg tgaagattgg tgtggaccag 1080
atgacccggg actttggtca ggcccagacc gctctggaag ctttcgcaga aagcctgtaa 1140
1140
<210> 176
<211> 783
<212> DNA
<213> Acidaminococcus fermentans
<400> 176
atgagtatct ataccttggg aatcgatgtt ggatctactg catccaagtg cattatcctg 60
aaagatggaa aagaaatcgt ggcgaaatcc ctggtagccg tggggaccgg aacttccggt 120
cccgcacggt ctatttcgga agtcctggaa aatgcccaca tgaaaaaaga agacatggcc 180
tttaccctgg ctaccggcta cggacgcaat tcgctggaag gcattgccga caagcagatg 240
agcgaactga gctgccatgc catgggcgcc agctttatct ggcccaacgt ccataccgtc 300
atcgatatcg gcgggcagga tgtgaaggtc atccatgtgg aaaacgggac catgaccaat 360
ttccagatga atgataaatg cgctgccggg actggccgtt tcctggatgt tatggccaat 420
atcctggaag tgaaggtttc cgacctggct gagctgggag ccaaatccac caaacgggtg 480
gctatcagct ccacctgtac tgtgtttgca gaaagtgaag tcatcagcca gctgtccaaa 540
ggaaccgaca agatcgacat cattgccggg atccatcgtt ctgtagccag ccgggtcatt 600
ggtcttgcca atcgggtggg gattgtgaaa gacgtggtca tgaccggcgg tgtagcccag 660
aactatggcg tgagaggagc cctggaagaa ggccttggcg tggaaatcaa gacgtctccc 720
ctggctcagt acaacggtgc cctgggtgcc gctctgtatg cgtataaaaa agcagccaaa 780
taa 783
<210> 177
<211> 1011
<212> DNA
<213> Carboxydothermus hydrogenoformans
<400> 177
atgaaattaa actatttttg cagttactgg ccggtggaaa tatccgaagg agcggggatt 60
tctacggtcc gttatttccc gtccgatgaa agcaaagctc cggtaaggct tcctgcttac 120
tgctgttctt atgccagggg aagccttgcc gaaattgaag aagaaggaga cggtgacttt 180
tggggatttg cccacagttg cgacacgatg cagagtttat acggcattac taagagttta 240
ctgggagacg accgggtttt tcttttcgtt ccgccggttg acttaaccac cgcttttgcc 300
cgggaatact accgggaagc tttaatttat ctctggcggg aactttccca aaaaagcggg 360
gttaatggtg aggaaaagtt aaagcttacc tgggaaaagt tgaaggagtt aagaaataag 420
gttaaatctt tggaaaactt gacgtcaatt attccttcct ccgaaatttt tgagctttta 480
aaaaagcttc agaccctgcc gctggatgag gctttggatt acctcgaggc caaaaaagcg 540
gaatttacca gtttatctgt ggctcaaaag gctataggga ttattttaac gggagcggta 600
gtcactaaca gtaaacttta ccttgcttta gaacaacagg gatttagagt agtttatgat 660
gatacctgta ccggctttcg tcattttgct ggagagatag aggataaaga cgatattttg 720
gaggcaatag tttcttacta cctttcaaag cccccctgtc cctgcaggca taagggagta 780
tgggcgaggg cggagtattt aaaaaatctt tatcataaca aaaatgcccg ggccattgta 840
cttttacaaa ataaattttg tgaccccttt gcctgggatg ttccctattt agtggactac 900
tttaaaaaac agggagttcc ggttttagtt ttagaggtgg aaggcggaga aatcggcgag 960
caaaataaaa ctcgcctcca ggccttccgg gaaagcgtgg gtggagtgta a 1011
<210> 178
<211> 1215
<212> DNA
<213> Carboxydothermus hydrogenoformans
<400> 178
atggctaaaa aaatctttaa gcctcttaag gcttcagaga aaataaataa aattttaaaa 60
aatcattatt taaaagcaaa gtatttgcca acgcttggaa aattttttgg ttataaaacc 120
gcctggatta ccagcggagc tccggtggaa ctactgcggg cctttggtat agagccggtt 180
tatccggaga attacggtgc catttgcggt gcccgcaagg tttcgccgag tctttgccag 240
gtagcggaaa acaggggtta ttctctcgat ttgtgttctt atgccaagag taatctcgga 300
agtatctgga atccgaaaga aagtccattt aacggcttac cccggccgga tttactggtg 360
gtttgcaaca acatttgcgg gacggtttta aagtggtacg aaactttaag ccgggaattt 420
aatattcccc tttttatcat tgatacccct tttatcaccg gtgaacccca accctggcaa 480
atccagtatg tggccaaaca gatagaaaaa ctggcgattg aactggaaaa atttttccgg 540
aaaaagttgg atttaaaccg tttggaaaaa gtaattctcc ttgccaatga gacggtggat 600
ttatggaagg ggataagaaa ttttgccaaa aataaacctt cgccggtaaa cgttaccgat 660
ttatttatta atctggggcc aatggtggtt ttaaggggta ccgaagttgc ccgggatttt 720
tacgaggaag tttaccggga agtggaagaa aggtacaaag ccggggttcc ggcggtagag 780
ggagaaaaat accgtttagt ctgggacaac attcccatct ggtacggact gtaccgtttt 840
tacggttatt ttgccgaaag gggagcggtt tttgttaccg attcctatac cggtggctgg 900
gcggtcaaca taaaaaaggg tcctcccttt tatgcattag ccgagaccta tgccggcgtc 960
tttttaaatc gggatttaga atttcgcaaa aatcagttgc aatctttcat tgaggaattt 1020
tctgccgatg gctttgtcat gcactccaat cgttcgtgca aagcttattc ttttgtgcag 1080
gaggaaatcc ggcgccaaat catgaggtca ctaggagtgc cggggttaat agtggatgcc 1140
gatatgaccg acagccggct ttattccgaa gaaacggttt taaaccgggt ccaggctttc 1200
ctggagagcc tgtag 1215
<210> 179
<211> 765
<212> DNA
<213> Carboxydothermus hydrogenoformans
<400> 179
ttgtatcttg gagttgatat tggttcgctt acgaccaagg ttgtcttaat tgaccgggga 60
aaaaatctta ttgcttatcg ttacagtaaa accggacctg ccggaaagga aacggccgag 120
cggttaattc aagaggtttt gataaaagcg aatatttccc gggacgatat tcagggaata 180
gttgctaccg gttacggcag ggttctcttt tccggaaagg agttttcgga gataacctgt 240
caggcccggg ggattgggca tttatacccg gaggcaaaaa cgattatcga tattggtggc 300
caggatagca aagtaatttc tctgggaaaa aacggaaagg tactggactt tgccatgaac 360
gataaatgtg ctgctggcac cggacgtttt ttggaggtga tgagtcaggc ccttgaagtt 420
cgtctggaag agatagggga acttgccgaa aagagccagg aggcagctaa gatatcttcg 480
gtttgtaccg tttttgccga atcggaagtg atatccaatt tatcccgggg gcagagccgg 540
gaagcggtag cacggggaat ttgtgaggcg gtggcggccc gaacggctat actggcgcaa 600
aaagtggggg tggtagaacc ggtggttttt accggagggg tggccaaaaa tactggagtt 660
gtggcggctt tggagcgaaa gcttggggtt aagttattaa ttccggaaga ttccacgatt 720
accgcagctc tgggggcggc tttattagcc gctgaaaatt cttaa 765
<210> 180
<211> 786
<212> DNA
<213> Oscillibacter valericigenes
<400> 180
atgaacaata tttacacgat gggcatcgac gtggggtcca ccgcatccaa gtgcctcatc 60
ctgaaagacg gcagcgaaat cgttgccaag tctctggtag atgtgggcgc gggtaccagc 120
ggccctaccc gtgctattgc ggaggtactg gaagccgcgg ggatgaagaa ggaggacatg 180
gcttttattc tggctaccgg ctatggccgc aattcactgg acgacattgc cgaccaccag 240
atgagcgagc tgagctgcca tgccaaaggc gcgtttttcc tgtttccgga tgtccacacc 300
gtcatcgaca tcggcgggca ggatgtgaag attcttgaga ttgagaacgg cgttatggtg 360
aattttgcca tgaatgacaa gtgcgccgcc gggacgggcc ggttcctgga cgtgatggcc 420
cgggtgctgg aggtgaaggt ggaggatctg gcggacctgg gagcccagtc caccaagaat 480
gtggagatca gctccacatg caccgtgttc gctgagagcg aggtcatcag ccagctggcc 540
aagggcagcg acaagcgcga catcatccac ggcatccaca agtctgtggc atcccgggtg 600
gttggccttg ccaaccgtat cggtgtgcgg gacgcggtgg tgatgaccgg cggcgtcgcc 660
cagaacggcg gcgtggtctc cgcgcttcag gaggcgttgg gccatcccat tcacacttcg 720
cctctgacgc agtacaacgg cgcgctgggc gcggcgttgt ttgcatggca gaaggcaacc 780
aaataa 786
<210> 181
<211> 1284
<212> DNA
<213> Oscillibacter valericigenes
<400> 181
atggccgaaa acgaaaaagc cactgcggcc gctcccgagg cggctcctgt taagaaagct 60
ccgaagccgg tcagccccgg tacgcaggcg ctgcgcgacg ttgtcaccaa ggtgtacgcc 120
gccgcgtggg atgcgaaaaa ggcgggccgc cccgtgggct ggtcgtcttc caagttcccc 180
tgcgagatcg ccgaggcgct gggccttgca gtcgtatatc cggaaaacca ggctgccggt 240
atcggcgccc agcacgatgg ccagcggatg tgtgaatctg ccgagtcctt gggcttcgac 300
ccagatatct gcggatacgc ccggatttcc ctggcttatt ccgcgggcgt tgagacgacc 360
aatgagtccc gccgggttcc catgccggac ttcgtgctgt gctgcaacaa tatttgtaac 420
tgcatgacca agtggtatga gaatattgcc cggatgcaca acattcccct gattatgatc 480
gacgtgccct ataacaacga ggtcaccgtc agcgattccc aggtggctta cattcgcggc 540
cagttcgatg acgccattaa gcagatggag aagattgccg gcgtgaagtt cgacgaaaag 600
aagtttgaac aggcctgcgc caatgccaac cgcactgcca aggcgtggct gacggtctgt 660
gactatttgc agtataagcc cgctcccatg agcggcttcg atctgtttaa ccatatggct 720
gatgtggtga ctgcccgcgg caaggtggag actgccgagg cgttcgagct gctggcaagc 780
gagctggaac agcacgtaaa aaacggaacc agcaccgctc cgttccccga gcagtaccgc 840
gtcatgttcg agggcattcc ctgctggccc aacctaagga cgcttttcaa gcccctgaaa 900
gccaacggcg tcaacgtcac cgccgtggtg tacgcgcccg cgttcggttt tgtgtataac 960
gggctggacg agatggcccg cgcatactgc aaggccccca acagcgtgtg cattgagcag 1020
ggcgtggact ggcgcgaggg catctgtcgc gagaacaagg tagacggcgt gctggtgcac 1080
tataaccgat cctgcaagcc ctggtccggc tacatggccg agatgcagcg ccgtttcacc 1140
aaggatctgg gcgtcccctg cgccgggttc gacggagatc aggccgatcc ccgcaacttc 1200
aacgaggctc agtatgagac ccgtgtccag ggcctggtag aggctatgga ggagaataaa 1260
aagcagaagg aggcccgggc atga 1284
<210> 182
<211> 1143
<212> DNA
<213> Oscillibacter valericigenes
<400> 182
atgagtatcg aaacgattgt aaaggagttt gccgacgttg cggccgaccc gaaagcacag 60
ctgaagaaat acaaggcgga gggcaaaaaa tgcattggtg tgatgccgta ttacgcgccc 120
gaggagctgg tggccgccgc cggtatggtg ccgtttggta tgtggggcag caatgacaag 180
accatttctc gcgccaagga atactgcgct acattttact gcaccatcgc ccagctggat 240
cttgagatgc tgctggacgg caccatggat cttttagacg gagtcatcac ccccaccatc 300
tgcgacacgc tccgtcccat gagccagaac atccgcgtgg ccatgggcga gaagctcccc 360
tgcattttcc tggcccatcc ccagaaccgc aagcccgctt acggcaagaa gttctgcctg 420
gaccaatata cccacatcaa gactgagctt gagaagatcg ccggcgcgcc catcaccgac 480
gccgcactgt ccgagaccat caaggtctat aataagagcc gcgccgcccg ccgtgagttc 540
gtgaagctgg tcagcgacca ctgcgatgtt atcaccccca ccaaacgcag cgctgttttg 600
aaagccgcgt ggtttatgcc caaggcggag tacaccgaga agctgaaggc cctcaacgca 660
gagctgaagg ctctgcctgt gtgcgactgg aaggggacca aggtggtcac ctccggcatc 720
atatgcgaca accctaagct tctggagatc ttcgaggaga acaaaatcgc catcgccgcc 780
gacgacgtgg ctcatgagtc ccgctccttc cgcgtagacg ctcccgagac cggcgatccc 840
atggaggcac tcgcccagca gtttgccaat caggattacg atgttctgct gtacgatgag 900
cattccagcg agaaccgccg gggcgagttt gtggccaagc tggtgaagga cagcggcgcc 960
aaggggctgg tcctgtttat gcagcagttc tgcgacccgg aggagatgga gtatccctcc 1020
ctcaaaaagg cgctggacga agccaagatc ccccacatca agctgggtgt ggatcaacag 1080
atgcgggact tcggtcaggc tcgcaccgcg attcaggcgt ttgccgatgt gatctccctc 1140
taa 1143
<210> 183
<211> 1269
<212> DNA
<213> Desulfosporosinus orientis (strain ATCC 19365 / DSM 765 / NCIMB 8382 / VKM B-1628)
<400> 183
atgactgata cagccaatat gagtgctaaa gaattgttag gtttctatca ggaagaattg 60
tatgaagaag cgagacaggc caaaaaagaa ggaaaacttg tttgctggtc ggcttccgtt 120
gctccttcgg agttttgtgt agctatggac gtggccatga tctatcctga aacccatgct 180
gcagggatcg gggccagaaa aggcgcctta gatatgcttg aagttgccga tgaaaaaggg 240
tataacctgg acacttgctc ctatgccaga gtgaatctgg gttatatgga acttttaaaa 300
caagaggctt taaccggaat aaccccggag aaactggaaa aatctccggc ggccagagta 360
cccctgcctg attttgtcat aacctgcaac aacatttgta acaccttgct taagtggtat 420
gaaaatcttg ccgttgagct aaatattccc tgcatcgtca ttgatgttcc ctttaatcac 480
accatgccca ttccccagta tgctaaagac tatattgcgg aacagtttaa ggaggcaatt 540
gctcagcttg aagagatttg cggcaagaaa ttcgactatg acaaattctt gcaagtccag 600
gaacaaaccc agcgctctgt ggcccaatgg aaccggattg cttctttgtc agggcataaa 660
ccatccccct taaatggttt tgatcttttc aactatatgg ccctgatcgt ttgtgcccgc 720
agcagggact gcgcagaaat tacctttaaa aagtttgccg atgaactgga agacaatcta 780
agcaaaggaa tctacgcctt taaaggcaat gaacaaaagc gtatcacttg ggaaggcatc 840
gctgtttggc cgcacctggg ccataccttt aaaggcttaa agaatcttgg caatatcatg 900
accggttcag cctatcccgg tttgtggaat ctttcttata cgcccggtga tatgagttcc 960
atggcagaag cttacaccag aatttatatc aatacttgtc tggataacaa agttaaggtt 1020
cttagtgaca tcatcagcgg cggaaagtgt gacggtgtta tttatcatca gaacagaagc 1080
tgtaagctca tgagttttct caatgtcgaa acggccgata tcctccaaca acaaaatcat 1140
ttaccctatg tcagctttga tggagaccaa accgatcccc gtaactttgc tcctgcccag 1200
tttgatacac ggatccaagc cttagatgaa atgatgaagc agaataagga gggagtttcc 1260
catgagtag 1269
<210> 184
<211> 1119
<212> DNA
<213> Desulfosporosinus orientis (strain ATCC 19365 / DSM 765 / NCIMB 8382 / VKM B-1628)
<400> 184
atgagtagaa ttgaagcgat tatcagtgaa ttatcttcta ttgccaataa tccccgtaag 60
gccatggaag attataagaa agaaacgggc aaagggtcgg tagggattat gccttattat 120
gctccggaag aaatcgttca tgccgccggt tacctgcccg taggaatttg gggagggcaa 180
aagagtattt ctaaagcccg tgcttattta cctccttttg cttgttcaat catgcaatcc 240
gttgtggaaa tgcagctgga aggggtctat aacgacttag cggcggtcct tttccccgtt 300
ccttgtgaca ctttaaaatg tctcagccaa aaatggaaag gcacatcccc ggtcatcgtc 360
atgactcatc ctcaaaaccg aaaactcgaa gcagccaata agtttctggc tgaggaatat 420
cgccttgttc gtgaaaagct ggaaaaaatc ttaaatgttc agattaccga tgaggcactg 480
aaccacagca ttgatgttta taacgaaaat cgcaaggcaa tgcgtgaatt tacggacata 540
gccgctaatt atttgaacat tattgatccc agaaagcgtc atgagattat caaggccaga 600
ttctttatgg aaaaatccaa acataccgcc ttggtcaaag aattgaattc cgagcttaaa 660
tctttacctg tggaagattt tacaggcaaa aaggtgattt taaccggaat catggctgaa 720
cccaatgaag tattagacat tttgaaagag aatgattttg ctgttgtggc agatgacctg 780
gcccaggaat ccagactgtt caggattgat gttccggctg gtccagaccc actctaccgc 840
ttggctaaat ggtggcaaga attcgacggt tgttctctgg ctgtagatac gaaaaaatta 900
agaggaccca tgctgatgaa tatggttaac gtggataagg ccgatgccgt ggtggtttgc 960
atgatgaagt tctgtgaccc tgaagaattt gactatccca tctactacag acagtttgaa 1020
gaagccggaa ttaagagctt atttatagaa attgacctgg agccaacctc ctttgaacag 1080
actaaaacca gagttcaaag ttttcgtgaa atgctgtga 1119
<210> 185
<211> 801
<212> DNA
<213> Desulfosporosinus orientis (strain ATCC 19365 / DSM 765 / NCIMB 8382 / VKM B-1628)
<400> 185
atgtatacta tggggattga tatcggttcc tcatcctcaa aggttgtcat acttgaagat 60
ggagttaacc tcatcgccgg cgaagtcatt cagattggaa caggctcgac aggtcctaaa 120
cgggtactgg aggaagctct tgccaaaaca ggtctcacct tggcagacat ggctaaaatt 180
attgctaccg gctacggccg atcttctgtg gaagtatccg acaagcaaat cagcgaaatc 240
agctgtcagg ctaagggagt ttacttttta gttcctacag caaaattaat cattgatatc 300
ggcggtcagg atgtgaaggc cattagactt gaccgtatag gcggcgtcag gcagtttttt 360
atgaatgata aatgtgccgc cggaacagga cgttttctcg atgtgatgtc acgagtactg 420
gaagtggatc tggatgaaat ggcagaatac gatgcccggg ccacagaacc cgccacggtc 480
agcagcacct gcacagtgtt tgccgaatcc gaggtaatat ctcagcttgc caacggagtt 540
gctaaagaga atattattgc cggggttcac cagtccgttg ccagcaaagc ctgtggactc 600
gcctatcgat gcggggtgga agaggacgtt gtgatgtgcg gaggagttgc taaggactta 660
ggagttgtcc gggccatcag caaagaacta aaaaaaccgg tcattgtagc tcctaatccc 720
caaattacag ccgcccttgg cgctgcccta tttgcttatg aagaagttat ggaagctaat 780
aaattaagga aagaggtatg a 801
<210> 186
<211> 1236
<212> DNA
<213> Peptostreptococcus anaerobius CAG:621
<400> 186
atgagtaaca caggtgcagt tgaagaaaag ccggcaaaag tattgttagg cgagatagtt 60
gcaaaacatt ataaggaagc ttgggaagct aaagaaagag gcgaaaaagt tggttggtgt 120
gcttctaact tcccacagga aatatttgaa acaatggata tcaaggttgt attccctgaa 180
aaccaggcag cagcaatttc tgctaagggt ggtggacaga ggatgtgcga aatcgcagaa 240
aacgaaggat attcaaacga catatgtgct tacgctagaa tatctctagc atacatggac 300
gttaaagatg ctccagagtt aaatatgcct cagccagact ttgttgcatg ctgtaacaat 360
atctgtaact gtatgatcaa gtggtatgaa aatatagcta aagaactaaa tatacctcta 420
atccttgttg acgtgccata taacaatgac tatgaagcag gcgatgacag agtagaatac 480
ttaagaggac agttcgatca cgctataaag cagttagaag acttaactgg taaaaagtgg 540
gatgaaaaga agttcgaaga agtaatggca atatctcaga gaacaggtag agcttggtta 600
aaggctactg gatatgctaa gtacactcca tcaccattct caggatttga cgtattcaac 660
catatggcag ttgctgtatg tgctagaggt aaggaagaat cagcaatagc atttgaaaag 720
ctagctgaag aatttgatga aaatgtaaag actggtaagt ctacattcaa gggagaagaa 780
aagtacagag tactatttga aggtatagct tgttggccac acctaagaca tacatttaag 840
cagctaaagg attcaggagt aaacgtttgt ggtactgttt atgcagatgc attcggatac 900
atctacgaca atacttatga attaatgcag gcttattgtg gaactcctaa tgcaatatct 960
tatgaaagat cattagatat gagacttaag gttatagaag aaaataatat agacggtatg 1020
ttgatacata taaacagaag ctgtaagcag tggtctggta tcatgtacga aatggaaaga 1080
gaaataagag aaagaactgg tataccaaca gctacattcg atggtgatca ggctgaccca 1140
agaaacttct cagaagcaca gtacgacaca agagtacagg gtctaataga agttatggaa 1200
gcaaacaaag ctgcaaagat gaaggaggaa aactag 1236
<210> 187
<211> 1119
<212> DNA
<213> Megasphaera elsdenii DSM 20460
<400> 187
atgagtcaga tcgacgaact tatcagcaaa ttacaggaag tatccaacca tccccagaag 60
acggttttga attataaaaa acagggtaaa ggcctcgtag gcatgatgcc ctactacgct 120
ccggaagaaa tcgtatatgc tgcaggctac ctcccggtag gcatgttcgg ttcccagaac 180
ccgcagatct ccgcagctcg tacgtacctt cctccgttcg cttgctcctt gatgcaggct 240
gacatggaac tccagctcaa cggcacctat gactgcctcg acgctgttat cttctccgtt 300
ccttgcgaca ctctccgctg catgagccag aaatggcacg gcaaagctcc ggtcatcgtc 360
ttcacacagc cgcagaaccg taagatccgc ccggctgtcg atttcctcaa agctgaatac 420
gaacatgtcc gtacggaatt ggaacgtatc ctcaacgtaa aaatctccga cctggctatc 480
caggaagcta tcaaagtata taacgaaaac cgtcaggtta tgcgtgaatt ctgcgacgta 540
gctgctcagt acccgcagat cttcactccg gtaaaacgtc atgacgtcat caaagcccgc 600
tggttcatgg acaaagctga acacaccgct ttggtccgcg aactcatcga cgctgtcaag 660
aaagaaccgg tacagccgtg gaatggcaaa aaagtcatcc tctccggtat catggcagaa 720
ccggatgaat tcctcgatat cttcagcgaa ttcaacatcg ctgtcgtcgc tgacgacctc 780
gctcaggaat cccgccagtt ccgtacagac gtaccgtccg gcatcgatcc cctcgaacag 840
ctcgctcagc agtggcagga cttcgatggc tgcccgctcg ctttgaacga agacaaaccg 900
cgtggccaga tgctcatcga catgactaag aaatacaatg ctgacgccgt cgtcatctgc 960
atgatgcgtt tctgcgatcc tgaagaattc gactatccga tttacaaacc ggaatttgaa 1020
gctgctggcg ttcgttacac ggtcctcgac ctcgacatcg aatctccgtc cctcgaacag 1080
ctccgcaccc gtatccaggc tttctcggaa atcctctaa 1119
<210> 188
<211> 1119
<212> DNA
<213> Peptostreptococcus anaerobius CAG:621
<400> 188
atgagtaact tagaagaact atttggaaaa cttgctgtat gtccattaga gcagatagat 60
aaatatgttg ctgatggtaa gaaagttatt ggttgcgcgc cagtatatgc tccagaagaa 120
cttgtatacg catcaggtat gattcctatg gcaatatggg gagcagaggg tgaagtaact 180
cttgcaaaag aatatttccc agctttctac gtatcaatca tcttaagact tttagatcta 240
ggtctagaag gcaagcttga taagatgtca ggaatgattc taccaggtct aagtgacgga 300
ctaaagggac ttagccagaa ctggaaaaga gctgtaaaga atgttccagc attatatata 360
ggatatggac agaacagaaa gatagaagct ggtatagttt acaatgctag acagtatgaa 420
aagctaaaag tacagttaga agaaatagct ggaaagaaga tagaagatgc tcagatagaa 480
gaagcaatcg ttttatacaa caagcacaga aaagctatgc aggcattctc agaccttgca 540
gctaaacact taaatacagt tactcctagc ctaagagcta aggtaatgtc aagtgcatgc 600
ctaatggaca aggctgaaca tttagaaata gtagaagcaa tcaacgctga actttcagct 660
atgccagaag aaaaatttga tggtaagaag attgtaacta ctggactact agctaacagt 720
cctgaaatat taaagatatt tgaagaattt aaacttggta ttgttgctga caacataaac 780
cacgaatcag gacagtttga ttatttagtt gatgaagcta ctggtaaccc aataaaggcg 840
ttgtctaagt ggatttcaga tattgaagga agtactttgc tatacgatcc agaaaaacta 900
agaggacaga taatcatcga taaggctaaa aaatacgatg cagatggtgt agtataccta 960
ctatctaaat tctctgattc agatgaattt gactacccaa tcattagaaa acagctagaa 1020
gaggctggat atatgcacat cttagttgaa gtagatcagc aaatgactaa cttcgaacaa 1080
gcaaaaactg cattgcagac ttttgcagac atgatatag 1119
<210> 189
<211> 792
<212> DNA
<213> Peptostreptococcus anaerobius CAG:621
<400> 189
atgagtgata tatacacaat gggtattgac attggatcaa catcatctaa atgtgtagtg 60
cttaagaatg gtaaagattt agttagtagc ggcgtcgtca atcttggcgc cggtactaaa 120
ggtgccgatc aggttataga aaaggtacta gctgactgtg gtatcaagtt cgaagatctg 180
aatgtgattg tttccacagg atatggtaga aattcttacg acagtgcaaa gaagactatg 240
agtgaactta gctgtcatgc taagggtggt acatatatct tcggacctgt aagaactatt 300
atagatatag gcggacagga cataaaggta ctaaaactaa atgacaaagg tatgatgaca 360
aatttcttga tgaatgataa atgtgcagct ggtacaggta gattcttaga ggttatggct 420
ggagtacttg atgttaagct agcagaacta ggtgacttag acaagttagc aactgaaaaa 480
acaccaatat cttcaacttg tacagtattt gcagaatcag aagtaatatc ttgtatggct 540
aagaaaatac ctattcctaa tataattagg ggtatacacg cttctgttgc tacaagagtt 600
gcaggtcttg ctaagagagg tggattaaca actccagtcg ctatgactgg tggtgttact 660
aagaactcag gaatagtaag ggcacttagc gaagagttag aaacagatat catgatttcg 720
gaaatttctc agttggcagg cgcaattgga gcggcattgt acgcttacga tgagtatctg 780
aaggaaaatt ag 792
<210> 190
<211> 777
<212> DNA
<213> Chloroflexus aggregans (strain MD-66 / DSM 9485)
<400> 190
atgagcgatg aaacgcttgt gctcagcact atcgaaggcc ccgttgcaat ccttacgctc 60
aatcgaccac aagcactcaa tgcccttagc cctgccctca tcgacgcact catccgccat 120
cttgagcatt gcgataacga cgatacgatc cgggtgatca ttatcaccgg cgccggtcgc 180
gcctttgccg ccggcgccga catcaaggcg atggccgatg cgacgccgat cgatatgctt 240
acaaccgata tgattgcccg ctgggcgcgg attgcggcgg tgcgcaaacc cgtgatcgca 300
gccgtgaacg gatttgccct cggtggtggc tgcgagttgg ctatgatgtg tgacatcatt 360
cttgccagtg aaacagccca attcggtcaa cccgaaatca acatcggcat tatccccggc 420
gccggtggca cccaacgcct gacccgcgca attggcccat accgtgcaat ggagatggtc 480
ttaaccggtg ctaccatcag tgcccaagaa gcttacgcct acggcctggt gaatcgggta 540
tgcccacccg atagcctgct tgatgaagcc cgccggttgg cccagaccat tgcagccaag 600
ccgccgctcg ctgtgcgttt agccaaggaa gccgtgcgcg ctgcggctga aacgaccgtg 660
cgtgaagggt tagccattga attgcgtaac ttttatctgc tctttgccag tgccgatcag 720
aaagagggca tgcgagcctt tatcgaaaag cgtacagcca acttcagtgg tcgctaa 777
<210> 191
<211> 774
<212> DNA
<213> Marivirga tractuosa (strain ATCC 23168/DSM 4126/NBRC 15989/NCIMB 1408/VKM B-1430/H-43)
<400> 191
atggaattca taaaagtaaa cacacaatat aaaaagcata ttgcgctcat caatcttaac 60
agacctaaag aattaaatgc cttgaactta cagttaatga ctgaattgaa ggacacttta 120
aaggtcttgg atgaggatga aaatgttaga gttataattt taacaggtaa tgagaaggct 180
tttgccgctg gagcagacat taagcaaatg gcaggtaaaa cggctattga catgctcaat 240
gttgatcaat tcagcacttg ggatcaaatc aaaaaaacaa agaagccatt gattgcagcc 300
gtttcaggat ttgcattggg cggtggttgc gaattagcga tgacttgcga tatgattgta 360
gcgtcagaat ctgctaaatt cggtcagcct gaaataaaaa tcggagtaat gccgggagca 420
ggtggtacac aaaggttaac tagggcaatt ggtaaagcca aagcgatgga attagtcttg 480
actggtaatt ttattagtgc agaggaagca atgcattatg gcttagttaa taaagttgtt 540
cctacagaga tgtatctgga agcagctgct gaactggctg agcaaatagc acaaatgtct 600
cctgtagcag ctaagttggc aaaagaatca gttaacaggg cttttgaaac gcatttggac 660
gaaggcttgc actttgagag aaaaaacttc tatttaacat ttgcttcaga agatcagact 720
gaaggtatgg aagcttttgt agagaaaaga aagcctgaat tcaaggggaa ataa 774
<210> 192
<211> 774
<212> DNA
<213> Marinithermus hydrothermalis (strain DSM 14884 / JCM 11576 / T1)
<400> 192
atgtacgaga acctcatcgt ggagacgctc gagggcggcg tggggctcat tcgcatccac 60
cggcccaagc gcctcaacgc cctgaaccag gccaccatgg acgagatcgt ccgcgcagta 120
cgcgcgtttg aagcggatga cgcggtgcgc gcgatcgtcc tcacggggga cgagcgggcg 180
ttcgccgcgg gcgcggacgt caccgagatg gacggcgcga acgtgccgga gatgctctcc 240
gggtaccgct tcgagcagtg ggagaccctc cggcgcacca cgaaaccctt gatcgccgcg 300
gtctcggggt tcgcgctcgg gggcgggctc gagctcgcga tgctgtgcga catcatcgta 360
gcctcggaga ccgcgcggct cggccagccc gagatcaacc tcgggatcat gccgggggcg 420
ggcggcacgc aacggctcac gcggcaggtg ggcaagtacc tcgcgatgga gatggtcctc 480
acggggcgca tgctcaccgc ggaggaggcg taccgtcacg gcctggtgaa ccgggtcgtc 540
ccggtcgagt tctacctgga ggaagccatc cagatcgcgc gggagatcgc gaagaaagcc 600
ccggtggcgg tgcgcctggc caaggacgcg atcctcaagg cagaggacac gccgctcgag 660
gtgggcctcg cgtacgagcg ccacaacttc tacctgctct tcggcaccga ggacaagcaa 720
gaagggatcc gcgctttcct cgagaagcgc aagcccgaat ggaaagggag gtag 774
<210> 193
<211> 780
<212> DNA
<213> Chitinophaga pinensis (strain ATCC 43595 / DSM 2588 / NCIB 11800 / UQM 2034)
<400> 193
atgcaaccac aatttataat catacaccgg caggtagccc catatgtggc tcatatacag 60
ttaaaccgcc ccaaagaact caatgcactg aaccttgaac tgatgattga gctcagggat 120
gcattaaaaa tgttggatgc ggatgacaat gttcgtgcaa tcgtcatcag cggtaatgaa 180
aaagcattcg ctgcaggcgc ggatatcaaa cagatggcgg ggaaaactgc catggacatg 240
tataacattg accagttcag cacctgggac acaataaaaa aaactaaaaa gccgttgatt 300
gcggcagtaa gcggcttcgc gctgggaggg ggatgtgagc tggtgatgct atgcgatatg 360
atagtagcca gtgaaacagc gcggttcgga cagccggaaa taaaaattgg cgtcatgcct 420
ggcgcaggtg gtacacaacg cctgacccgc gccgtaggta aagccctggc catggaaatg 480
gtattgacag gtcgctttat cactgcacaa gaagctgcac gtgcaggtct tatcaaccgg 540
gtaataccgg tggaactttt cctgcaggaa gccatccggc tggcgactga agtagctgcg 600
cttagtccgt tggcagtaaa gatggctaaa gaatctgtac tgaaagcatt tgatagctcc 660
ctcgaagaag gactacattt tgaacgtaaa aacttttatc tgctgtttgc ctctgaagat 720
cagaaagaag gcatgcaggc ttttgttgat aagagatcac ctgtttttaa aggaaaataa 780
780
<210> 194
<211> 777
<212> DNA
<213> Megasphaera elsdenii DSM 20460
<400> 194
gtgtatactc tcggaatcga cgttggttct tcttcttcca aggcagtcat cctggaagat 60
ggcaagaaga tcgtcgccca tgccgtcgtt gaaatcggca ccggttcgac cggtccggaa 120
cgcgtcctgg acgaagtctt caaagatacc aacttaaaaa ttgaagacat ggcgaacatc 180
atcgccacag gctatggccg tttcaatgtc gactgcgcca aaggcgaagt cagcgaaatc 240
acgtgccatg ccaaaggggc cctctttgaa tgccccggta cgacgaccat cctcgatatc 300
ggcggtcagg acgtcaagtc catcaaattg aatggccagg gcctggtcat gcagtttgcc 360
atgaacgaca aatgcgccgc tggtacgggc cgtttcctcg acgtcatgtc gaaggtactg 420
gaaatcccca tgtctgaaat gggggactgg tacttcaaat cgaagcatcc cgctgccgtc 480
agcagtacct gcacggtttt tgctgaatcg gaagtcattt cccttctttc caagaatgtc 540
ccgaaagaag atatcgtagc cggtgtccat cagtccatcg ccgccaaagc ctgcgctctc 600
gtgcgccgcg tcggtgtcgg tgaagacctg accatgaccg gcggtggctc ccgcgatccc 660
ggcgtcgtcg atgccgtatc gaaagaatta ggtattcctg tcagagtcgc tctgcatccc 720
caagcggtgg gtgctctcgg agctgctttg attgcttatg ataaaatcaa gaaataa 777
<210> 195
<211> 1287
<212> DNA
<213> Megasphaera elsdenii DSM 20460
<400> 195
atgagtgaag aaaaaacagt agatattgaa agcatgagct ccaaggaagc ccttggttac 60
ttcttgccga aagtcgatga agacgcacgt aaagcgaaaa aagaaggccg cctcgtttgc 120
tggtccgctt ctgtcgctcc tccggaattc tgcacggcta tggacatcgc catcgtctat 180
ccggaaactc acgcagctgg tatcggtgcc cgtcacggtg ctccggccat gctcgaagtt 240
gctgaaaaca aaggttacaa ccaggacatc tgttcctact gccgcgtcaa catgggctac 300
atggaactcc tcaaacagca ggctctgaca ggcgaaacgc cggaagtcct caaaaactcc 360
ccggcttctc cgattcccct tccggatgtt gtcctcactt gcaacaacat ctgcaatacc 420
ttgctcaaat ggtatgaaaa cttggctaaa gaattgaacg tacctctcat caacatcgac 480
gtaccgttca accatgaatt ccctgttacg aaacacgcta aacagtacat cgtcggcgaa 540
ttcaaacatg ctatcaaaca gctcgaagac ctttgcggcc gtcccttcga ctatgacaaa 600
ttcttcgaag tacagaaaca gacacagcgc tccatcgctg cctggaacaa aatcgctacg 660
tacttccagt acaaaccgtc gccgctcaac ggcttcgacc tcttcaacta catgggcctc 720
gccgttgctg cccgctcctt gaactactcg gaaatcacgt tcaacaaatt cctcaaagaa 780
ttggacgaaa aagtagctaa taagaaatgg gctttcggtg aaaacgaaaa atcccgtgtt 840
acttgggaag gtatcgctgt ctggatcgct ctcggccaca ccttcaaaga actcaaaggt 900
cagggcgctc tcatgactgg ttccgcttat cctggcatgt gggacgtttc ctacgaaccg 960
ggcgacctcg aatccatggc agaagcttat tcccgtacat acatcaactg ctgcctcgaa 1020
cagcgcggtg ctgttcttga aaaagttgtc cgcgatggca aatgcgacgg cttgatcatg 1080
caccagaacc gttcctgcaa gaacatgagc ctcctcaaca acgaaggcgg ccagcgcatc 1140
cagaagaacc tcggcgtacc gtacgtcatc ttcgacggcg accagaccga tgctcgtaac 1200
ttctcggaag cacagttcga tacccgcgta gaagctttgg cagaaatgat ggcagacaaa 1260
aaagccaatg aaggaggaaa ccactaa 1287
<210> 196
<211> 1119
<212> DNA
<213> Megasphaera elsdenii DSM 20460
<400> 196
atgagtcaga tcgacgaact tatcagcaaa ttacaggaag tatccaacca tccccagaag 60
acggttttga attataaaaa acagggtaaa ggcctcgtag gcatgatgcc ctactacgct 120
ccggaagaaa tcgtatatgc tgcaggctac ctcccggtag gcatgttcgg ttcccagaac 180
ccgcagatct ccgcagctcg tacgtacctt cctccgttcg cttgctcctt gatgcaggct 240
gacatggaac tccagctcaa cggcacctat gactgcctcg acgctgttat cttctccgtt 300
ccttgcgaca ctctccgctg catgagccag aaatggcacg gcaaagctcc ggtcatcgtc 360
ttcacacagc cgcagaaccg taagatccgc ccggctgtcg atttcctcaa agctgaatac 420
gaacatgtcc gtacggaatt ggaacgtatc ctcaacgtaa aaatctccga cctggctatc 480
caggaagcta tcaaagtata taacgaaaac cgtcaggtta tgcgtgaatt ctgcgacgta 540
gctgctcagt acccgcagat cttcactccg gtaaaacgtc atgacgtcat caaagcccgc 600
tggttcatgg acaaagctga acacaccgct ttggtccgcg aactcatcga cgctgtcaag 660
aaagaaccgg tacagccgtg gaatggcaaa aaagtcatcc tctccggtat catggcagaa 720
ccggatgaat tcctcgatat cttcagcgaa ttcaacatcg ctgtcgtcgc tgacgacctc 780
gctcaggaat cccgccagtt ccgtacagac gtaccgtccg gcatcgatcc cctcgaacag 840
ctcgctcagc agtggcagga cttcgatggc tgcccgctcg ctttgaacga agacaaaccg 900
cgtggccaga tgctcatcga catgactaag aaatacaatg ctgacgccgt cgtcatctgc 960
atgatgcgtt tctgcgatcc tgaagaattc gactatccga tttacaaacc ggaatttgaa 1020
gctgctggcg ttcgttacac ggtcctcgac ctcgacatcg aatctccgtc cctcgaacag 1080
ctccgcaccc gtatccaggc tttctcggaa atcctctaa 1119
<210> 197
<211> 777
<212> DNA
<213> Chloroflexus aurantiacus (strain ATCC 29364 / DSM 637 / Y-400-fl)
<400> 197
atgagtgaag agtctctggt tctcagcaca attgaaggcc ccatcgccat cctcaccctc 60
aatcgccccc aggccctcaa tgcgctcagt ccggccttga ttgatgacct cattcgccat 120
ttagaagcct gcgatgccga tgacacaatc cgcgtgatca ttatcaccgg cgccggacgg 180
gcatttgctg ccggcgctga catcaaagcg atggccaatg ccacgcctat tgatatgctc 240
accagtggca tgattgcgcg ctgggcacgc atcgccgcgg tgcgcaaacc ggtgattgct 300
gccgtgaatg ggtatgcgct cggtggtggt tgtgaattgg caatgatgtg cgacatcatc 360
atcgccagtg aaaacgcgca gttcggacaa ccggaaatca atctgggcat cattcccggt 420
gctggtggca cccaacggct gacccgcgcc cttggcccgt atcgcgcaat ggaattgatc 480
ctgaccggcg cgaccatcag tgctcaggaa gctctcgccc acggcctggt gtgccgggtc 540
tgcccgcctg aaagcctgct cgatgaagcc cgtcggatcg cgcaaaccat tgccaccaaa 600
tcaccactgg ctgtacagtt ggcgaaagag gcagtccgta tggccgccga aaccactgtg 660
cgcgaggggt tggctatcga gctgcgtaac ttctatctgc tgtttgccag tgctgaccaa 720
aaagagggga tgcaggcatt tatcgagaaa cgcgctccca acttcagtgg tcgttga 777
<210> 198
<211> 777
<212> DNA
<213> Ruegeria pomeroyi DSS-3
<400> 198
atggcctttg agacgatcat cgtcgaagtt gaagaccacg tagccctgat caggctgaac 60
cgtcccgatg cgctcaatgc gctcaacacc cagttgctgg gcgagttgtg taccgcgctg 120
gaagaggccg acggcaatga caaggtgcgc tgcatcgtca tcaccggcag cgacaaggca 180
tttgccgccg gggccgatat ccgcgagatg tcccaaaaga cctatgtcga ggtgtatagc 240
gagaacctgt tcgcggccgc caacgaccgt gtcagcgcca tccgcaagcc gatcatcgcc 300
gcagtggcgg gctatgcgct gggcggtggc tgtgaactgg cgatgctgtg cgatttcatc 360
atcgcggcgg acaccgcaaa gttcggccag cccgagatca acctgggcgt gatcgccggt 420
atcggcggca cccagcgtct gacccggctg gtgggcaagt ccaagtcgat ggacctgaac 480
ctgaccgggc ggttcatgga tgccgaagag gccgagcgcg ccgggctggt cagccgcgtg 540
gttccggcca agaagctggt cgaagaggcg ctgagcgcag cccagaagat cgccgagaaa 600
tcgatgatct cggcctatgc ggtcaaggag gcggtcaacc gctcttacga gaccacgctg 660
agcgaggggc tgctgttcga gcgccgggtg ttccattcga tgttcgccac cgaagatcag 720
aaggaaggca tggccgcttt cctcgagaag cgggcggcac agttccgcga caagtga 777
<210> 199
<211> 132
<212> PRT
<213> E. coli
<400> 199
Met Ser Thr Thr His Asn Val Pro Gln Gly Asp Leu Val Leu Arg Thr
1 5 10 15
Leu Ala Met Pro Ala Asp Thr Asn Ala Asn Gly Asp Ile Phe Gly Gly
20 25 30
Trp Leu Met Ser Gln Met Asp Ile Gly Gly Ala Ile Leu Ala Lys Glu
35 40 45
Ile Ala His Gly Arg Val Val Thr Val Arg Val Glu Gly Met Thr Phe
50 55 60
Leu Arg Pro Val Ala Val Gly Asp Val Val Cys Cys Tyr Ala Arg Cys
65 70 75 80
Val Gln Lys Gly Thr Thr Ser Val Ser Ile Asn Ile Glu Val Trp Val
85 90 95
Lys Lys Val Ala Ser Glu Pro Ile Gly Gln Arg Tyr Lys Ala Thr Glu
100 105 110
Ala Leu Phe Lys Tyr Val Ala Val Asp Pro Glu Gly Lys Pro Arg Ala
115 120 125
Leu Pro Val Glu
130
<210> 200
<211> 132
<212> PRT
<213> Klebsiella oxytoca 10-5245
<400> 200
Met Thr Thr Thr Asp Leu Ala Pro Lys Gly Glu Leu Val Leu Arg Thr
1 5 10 15
Leu Ala Met Pro Ala Asp Thr Asn Ala Asn Gly Asp Ile Phe Gly Gly
20 25 30
Trp Leu Met Ser Gln Met Asp Ile Gly Gly Ala Ile Met Ala Lys Glu
35 40 45
Ile Ala His Gly Arg Val Val Thr Val Arg Val Asp Gly Met Thr Phe
50 55 60
Leu Arg Pro Val Ala Val Gly Asp Val Val Cys Cys Tyr Ala Asn Cys
65 70 75 80
Val Lys Arg Gly Asn Thr Ser Ile Thr Ile Asn Met Glu Val Trp Val
85 90 95
Lys Lys Val Ser Ser Glu Pro Ile Gly Gln Arg Tyr Lys Ala Thr Glu
100 105 110
Ala Leu Phe Ile Tyr Val Ala Val Asp Asn Gln Gly Lys Pro Arg Ala
115 120 125
Leu Pro Thr Leu
130
<210> 201
<211> 133
<212> PRT
<213> Cronobacter turicensis
<400> 201
Met Thr Thr Glu Gln Thr Thr Pro Gln Gly Glu Leu Val Leu Arg Thr
1 5 10 15
Leu Ala Met Pro Ala Asp Thr Asn Ala Asn Gly Asp Ile Phe Gly Gly
20 25 30
Trp Leu Met Ala Gln Met Asp Ile Gly Gly Ala Ile Leu Ala Lys Glu
35 40 45
Ile Ala His Gly Arg Val Val Thr Val Arg Val Asp Gly Met Thr Phe
50 55 60
Leu Arg Pro Val Ala Val Gly Asp Val Val Cys Cys Tyr Ala Arg Cys
65 70 75 80
Val Lys Arg Gly Asn Thr Ser Val Thr Ile Asn Ile Glu Val Trp Val
85 90 95
Lys Lys Val Ser Ser Glu Pro Leu Gly Gln Arg Tyr Arg Ala Thr Glu
100 105 110
Ala Leu Phe Ile Tyr Val Ala Val Asp Asp Asn Gly Lys Pro Arg Pro
115 120 125
Leu Pro Pro Val Ala
130
<210> 202
<211> 133
<212> PRT
<213> Citrobacter freundii
<400> 202
Met Thr Thr Thr Asn Asn Thr Pro Gln Gly Glu Leu Val Leu Arg Thr
1 5 10 15
Leu Ala Met Pro Ala Asp Thr Asn Ala Asn Gly Asp Ile Phe Gly Gly
20 25 30
Trp Leu Met Ser Gln Met Asp Ile Gly Gly Ala Ile Gln Ala Lys Glu
35 40 45
Ile Ala His Gly Arg Val Val Thr Val Arg Val Glu Gly Met Ser Phe
50 55 60
Leu Arg Pro Val Ala Val Gly Asp Val Val Cys Cys Tyr Ala Arg Cys
65 70 75 80
Val Lys Arg Gly Thr Thr Ser Ile Ser Ile Asn Ile Glu Val Trp Val
85 90 95
Lys Lys Val Ala Ser Glu Pro Ile Gly Gln Arg Tyr Lys Ala Thr Glu
100 105 110
Ala Leu Phe Ile Tyr Val Ala Val Asp Lys Asp Gly Lys Pro Arg Pro
115 120 125
Ile Pro Thr Leu Ala
130
<210> 203
<211> 130
<212> PRT
<213> Salmonella enterica
<400> 203
Met Asp Asn Thr Pro Gln Gly Glu Leu Val Leu Arg Thr Leu Ala Met
1 5 10 15
Pro Ala Asp Thr Asn Ala Asn Gly Asp Ile Phe Gly Gly Trp Leu Met
20 25 30
Ser Gln Met Asp Ile Gly Gly Ala Ile Leu Ala Lys Glu Ile Ala His
35 40 45
Gly Arg Val Val Thr Val Arg Val Glu Gly Met Thr Phe Leu Arg Pro
50 55 60
Val Ala Val Gly Asp Val Val Cys Cys Tyr Ala Arg Cys Val Lys Arg
65 70 75 80
Gly Thr Thr Ser Ile Ser Ile Asn Ile Glu Val Trp Val Lys Lys Val
85 90 95
Ala Ser Glu Pro Ile Gly Gln Arg Tyr Lys Ala Thr Glu Ala Leu Phe
100 105 110
Ile Tyr Val Ala Val Asp Pro Asp Gly Lys Pro Arg Pro Leu Pro Val
115 120 125
Gln Gly
130
<210> 204
<211> 133
<212> PRT
<213> Shigella flexneri 1235-66
<400> 204
Met Thr Thr Thr Asn Asn Thr Pro Gln Gly Glu Leu Val Leu Arg Thr
1 5 10 15
Leu Ala Met Pro Ala Asp Thr Asn Ala Asn Gly Asp Ile Phe Gly Gly
20 25 30
Trp Leu Met Ser Gln Met Asp Ile Gly Gly Ala Ile Gln Ala Lys Glu
35 40 45
Ile Ala His Gly Arg Val Val Thr Val Arg Val Glu Gly Met Ser Phe
50 55 60
Leu Arg Pro Val Ala Val Gly Asp Val Val Cys Cys Tyr Ala Arg Cys
65 70 75 80
Val Lys Arg Gly Thr Thr Ser Ile Ser Ile Asn Ile Glu Val Trp Val
85 90 95
Lys Lys Val Ala Ser Glu Pro Ile Gly Gln Arg Tyr Lys Ala Thr Glu
100 105 110
Ala Leu Phe Ile Tyr Val Ala Val Asp Lys Asp Gly Lys Pro Arg Pro
115 120 125
Ile Pro Lys Gln Val
130
<210> 205
<211> 399
<212> DNA
<213> E. coli
<400> 205
atgtctacaa cacataacgt ccctcagggc gatcttgttt tacgtacttt agccatgccc 60
gccgatacca atgccaatgg tgacatcttt ggtggttggt taatgtcaca aatggatatt 120
ggcggcgcta ttctggcaaa agaaattgcc cacggtcgcg tagtgactgt gcgggttgaa 180
ggaatgactt tcttacggcc ggttgcggtc ggcgatgtgg tgtgctgcta tgcacgctgt 240
gtccagaaag ggacgacatc ggtcagcatt aatattgaag tgtgggtgaa aaaagtagcg 300
tctgaaccaa ttgggcaacg ctataaagcg acagaagcat tatttaagta tgtcgcggtt 360
gatcctgaag gaaaacctcg cgccttacct gttgagtaa 399
<210> 206
<211> 399
<212> DNA
<213> Klebsiella oxytoca 10-5245
<400> 206
atgacaacaa cagatcttgc gccgaagggc gaattggttt tacgcaccct ggcgatgccg 60
gcggacacca acgcaaacgg cgatattttc ggcggctggc tgatgtcgca aatggatatt 120
ggcggggcca ttatggccaa agaaattgcc cacggtcgcg tcgtgaccgt gcgcgtcgac 180
ggcatgacct ttttgcgccc ggtggcggtc ggcgacgtcg tgtgctgcta cgccaactgc 240
gtgaagcgcg gcaatacgtc gataactatc aatatggaag tgtgggtcaa gaaagtgtcg 300
tctgagccca tcggccagcg ctacaaagcc accgaagcgc tgtttatcta cgtcgcggtg 360
gataatcagg gaaaaccgcg cgcactgccg actctgtga 399
<210> 207
<211> 402
<212> DNA
<213> Cronobacter turicensis
<400> 207
atgacgacag agcaaaccac gcctcaaggt gaactggttt tacgtaccct ggcgatgccc 60
gccgatacca acgccaatgg cgatattttt ggcggctggc tgatggccca gatggacatt 120
ggcggcgcga tccttgccaa agagatagcc catggccgcg tggtgacggt acgcgttgac 180
ggcatgacgt tcctgcgccc ggtcgcggtt ggcgatgtgg tgtgctgtta tgcccgttgc 240
gtgaagcgcg gcaatacatc ggtgacgatt aatattgaag tgtgggtgaa gaaggtttct 300
tccgagccgc ttggccagcg ctaccgcgcg accgaggcgc tgttcattta tgttgcggtc 360
gatgacaacg gcaaaccgcg cccgctgccg cctgtggcgt ga 402
<210> 208
<211> 402
<212> DNA
<213> Citrobacter freundii
<400> 208
atgacaacaa cgaataacac tccccagggt gaactggttt tacgcactct ggccatgcct 60
gccgatacca acgcgaacgg tgatattttt ggcggctggc tgatgtcaca aatggatata 120
ggtggcgcga ttcaggccaa agagatcgca catggtcgtg tggtaactgt gcgggttgaa 180
ggaatgagct ttttgcgccc ggtcgccgta ggtgatgtag tgtgttgcta tgctcgctgt 240
gtgaaacgcg ggacaacctc aatcagcatc aatattgaag tttgggtgaa gaaagtcgct 300
tctgaaccta ttggccagcg ttataaggcc accgaagctc tgtttatcta cgttgccgtt 360
gataaagacg ggaaaccgcg tccaatcccc acgttggcct ga 402
<210> 209
<211> 393
<212> DNA
<213> Salmonella enterica
<400> 209
atggataata ctcctcaggg cgagctggtt ttacgtacat tggccatgcc tgccgatacc 60
aatgcgaacg gcgatatttt tggcggctgg ctgatgtcgc aaatggatat tggcggcgcg 120
atactggcca aagagatcgc gcacggtcgg gttgtaaccg tacgcgtgga aggaatgaca 180
tttctgcgcc ccgtcgcggt tggcgatgtc gtatgctgct acgcgcgctg cgttaaacgc 240
ggtacgacgt ctattagcat aaatattgaa gtctgggtga aaaaagtcgc gtcagaaccg 300
attgggcagc gctacaaggc caccgaggcg ctgtttattt atgttgccgt cgatccggac 360
ggtaaacctc gcccgctccc ggttcagggt taa 393
<210> 210
<211> 402
<212> DNA
<213> Shigella flexneri 1235-66
<400> 210
atgacaacaa cgaataacac cccccagggt gaactggttt tacgcactct ggccatgcct 60
gccgatacca atgctaacgg tgatattttt ggcggctggc tgatgtcaca gatggatatt 120
ggtggcgcta ttcaggccaa agagatcgca cacggtcgcg tggtgacggt gcgagttgaa 180
ggaatgagct ttttgcgccc ggttgccgtg ggtgatgtgg tctgttgcta cgcacgctgc 240
gtaaaacgcg ggacgacgtc aatcagcatt aatattgaag tctgggtgaa gaaagtcgct 300
tcggaaccta ttggccagcg ttacaaagcc actgaagccc tgtttatcta cgtcgctgta 360
gataaagacg gtaaaccccg tccgatacct aaacaggtct ga 402
<210> 211
<211> 554
<212> PRT
<213> Ilyobacter polytropus DhaB1 protein
<400> 211
Met Lys Ser Lys Arg Phe Glu Val Leu Lys Glu Arg Pro Val Asn Lys
1 5 10 15
Asp Gly Phe Ile Ser Glu Trp Ile Glu Glu Gly Leu Ile Ala Met Glu
20 25 30
Ser Pro Asn Asp Pro Asn Pro Ser Leu Lys Ile Glu Asn Gly Gln Ile
35 40 45
Thr Glu Leu Asp Gly Lys Ser Arg Glu Glu Phe Asp Met Ile Asp Arg
50 55 60
Phe Ile Ala Asp Tyr Ala Ile Asn Met Glu Asn Ala Glu Lys Ala Met
65 70 75 80
Lys Met Ser Ser Met Glu Ile Ser Lys Lys Leu Val Asp Ile Asn Val
85 90 95
Ser Arg Asp Glu Val Leu Glu Ile Thr Thr Gly Ile Thr Pro Ala Lys
100 105 110
Ile Ile Lys Val Met Glu His Met Asn Val Val Glu Met Met Met Ala
115 120 125
Val Gln Lys Met Arg Ala Arg Lys Thr Pro Ser Asn Gln Cys His Val
130 135 140
Thr Asn Leu Arg Asp Asn Pro Val Leu Ile Ala Ala Asp Ala Ala Glu
145 150 155 160
Ala Ser Val Arg Gly Phe Asp Glu Gln Glu Thr Thr Ile Gly Ile Val
165 170 175
Arg Tyr Ala Pro Phe Asn Ala Ile Ser Ile Phe Val Gly Ser Gln Val
180 185 190
Gly Arg Gly Gly Ile Leu Thr Gln Cys Ser Val Glu Glu Ala Thr Glu
195 200 205
Leu Glu Leu Gly Met Lys Gly Phe Thr Ser Tyr Ala Glu Thr Val Ser
210 215 220
Val Tyr Gly Thr Glu Gln Val Phe Ile Asp Gly Asp Asp Thr Pro Trp
225 230 235 240
Ser Lys Ala Phe Leu Ala Ser Ala Tyr Ala Ser Arg Gly Leu Lys Met
245 250 255
Arg Phe Thr Ser Gly Thr Gly Ser Glu Ala Leu Met Gly Asn Ala Glu
260 265 270
Gly Lys Ser Met Leu Tyr Leu Glu Ala Arg Cys Ile Tyr Val Thr Arg
275 280 285
Gly Ser Gly Val Gln Gly Leu Gln Asn Gly Ser Val Ser Cys Ile Gly
290 295 300
Met Pro Gly Ser Leu Pro Gly Gly Ile Arg Ala Val Leu Ala Glu Asn
305 310 315 320
Leu Ile Ala Met Leu Leu Asp Leu Glu Cys Ala Ser Ala Asn Asp Gln
325 330 335
Thr Phe Ser His Ser Glu Tyr Arg Arg Thr Ala Arg Thr Leu Met Gln
340 345 350
Met Leu Pro Gly Thr Asp Phe Ile Phe Ser Gly Tyr Ser Ala Val Pro
355 360 365
Asn Cys Asp Asn Met Phe Ala Gly Ser Asn Phe Asp Ala Glu Asp Phe
370 375 380
Asp Asp Tyr Asn Ala Leu Gln Arg Asp Leu Lys Ile Asp Gly Gly Leu
385 390 395 400
Lys Pro Val Thr Glu Asp Glu Ile Val Lys Val Arg Asn Lys Ala Ala
405 410 415
Arg Ala Ile Gln Gly Leu Phe Lys Glu Leu Asp Leu Pro Glu Ile Thr
420 425 430
Asp Glu Glu Val Glu Ala Ala Thr Tyr Ala His Gly Ser Val Asp Met
435 440 445
Pro Ala Arg Asn Val Val Glu Asp Leu Lys Ala Ala Glu Glu Leu Leu
450 455 460
Ser Ser Gly Ile Thr Gly Val Asp Leu Val Lys Gly Leu Ser Arg Ser
465 470 475 480
Gly Phe Asp Asp Val Ala Glu His Val Leu Gly Met Leu Lys Gln Arg
485 490 495
Val Ser Gly Asp Tyr Leu Gln Thr Ser Ala Ile Leu Asp Lys Gly Phe
500 505 510
Lys Ile Lys Ser Ala Ile Asn Asp Arg Asn Asp Tyr Met Gly Pro Gly
515 520 525
Ser Gly Tyr Arg Ile Ser Glu Glu Arg Trp Glu Glu Ile Lys Asn Ile
530 535 540
Pro Ser Ala Ile Lys Pro Glu Ser Ile Glu
545 550
<210> 212
<211> 187
<212> PRT
<213> Ilyobacter polytropus DhaB2 protein
<400> 212
Met Glu Asn Lys Phe Val Pro Ser Val Lys Ile Glu Glu Ile Gly Glu
1 5 10 15
Ala Lys Lys Gly Ser Arg Ser Glu Glu Val Val Ile Gly Leu Ala Pro
20 25 30
Ala Phe Lys Lys Phe Gln His Lys Thr Ile Thr Asp Val Pro His Asp
35 40 45
Glu Val Leu Thr Glu Leu Ile Ala Gly Ile Glu Glu Glu Gly Leu Lys
50 55 60
Ala Arg Ile Val Arg Val Thr Arg Thr Ser Asp Val Ser Phe Met Ala
65 70 75 80
Leu Asp Ala Ala Lys Leu Ser Gly Ser Gly Ile Gly Ile Gly Ile Gln
85 90 95
Ser Lys Gly Thr Thr Val Ile His Gln Lys Asp Leu Leu Pro Leu Asn
100 105 110
Asn Leu Glu Leu Phe Pro Gln Ala Pro Leu Leu Thr Pro Glu Thr Phe
115 120 125
Arg Leu Ile Gly Lys Asn Ala Ala Lys Tyr Ala Lys Gly Glu Ser Pro
130 135 140
Asn Pro Val Pro Val Ala Ser Asp Gln Met Ala Arg Pro Lys Tyr Gln
145 150 155 160
Ala Lys Ala Ala Leu Leu His Ile Lys Glu Thr Lys His Val Val Gln
165 170 175
His Gly Lys Pro Val Glu Ile Lys Tyr Glu Phe
180 185
<210> 213
<211> 143
<212> PRT
<213> Ilyobacter polytropus DhaB3 protein
<400> 213
Met Asn Ile Asp Val Lys Asn Ile Asn Pro Ile Ser Asp Tyr Pro Leu
1 5 10 15
Gly Glu Lys Arg Lys Glu Trp Leu Lys Thr Ser Thr Gly Lys Thr Leu
20 25 30
Asp Glu Ile Thr Leu Glu Asn Val Ile Asn Gly Asp Ile Lys Pro Glu
35 40 45
Asp Ile Arg Ile Ser Pro Glu Thr Leu Lys Leu Gln Gly Glu Ile Ala
50 55 60
Lys Lys Gly Asn Arg Pro Thr Ile Thr Lys Asn Phe Glu Arg Ala Ser
65 70 75 80
Glu Met Val Ala Ile Pro Asp Asp Lys Ile Leu Ala Thr Tyr Asn Ala
85 90 95
Leu Arg Pro Tyr Arg Ser Ser Lys Glu Glu Leu Phe Glu Ile Ala Asp
100 105 110
Glu Leu Glu Ser Lys Tyr Ser Ala Val Val Ile Ser Ala Phe Ile Lys
115 120 125
Glu Ala Ala Glu Val Tyr Glu Gln Arg Gly Gln Leu Arg Lys Asp
130 135 140
<210> 214
<211> 1665
<212> DNA
<213> Ilyobacter polytropus dhaB1 gene
<400> 214
atgaaatcaa aaagatttga agtattgaag gaacgtcctg taaataaaga tggctttata 60
agtgaatgga tagaagaagg actaatcgca atggaaagtc ctaacgatcc taatccaagt 120
ttgaaaatag aaaatggtca aataacagag ttagacggta aaagcagaga agaatttgac 180
atgatcgaca gatttatagc agattatgca ataaatatgg aaaatgctga aaaagctatg 240
aaaatgtcat ctatggaaat atctaaaaaa ctagtagaca taaatgtatc aagagatgaa 300
gtgctggaaa taacaacagg aattacccca gcaaaaataa ttaaagttat ggaacacatg 360
aatgttgtag agatgatgat ggccgtacaa aaaatgagag ccagaaaaac tccttccaat 420
cagtgtcatg taactaactt gagagacaat cctgtattaa ttgccgctga tgctgccgaa 480
gcgtcagtaa gaggttttga tgaacaggag actacaatcg gtatagtaag atatgcacct 540
ttcaatgcca tctcaatatt tgtaggttca caagtaggta gaggaggaat actgactcag 600
tgttctgtag aagaagctac tgaattagag cttggaatga aaggattcac aagttatgca 660
gaaacagtgt ctgtatatgg tacagagcaa gtgtttatag acggtgacga cactccttgg 720
tcaaaagcct tccttgcttc agcatatgca tcaagaggat taaaaatgag atttacatct 780
ggaactggtt cagaggctct tatgggaaat gctgaaggga aatcaatgct ttaccttgaa 840
gcaagatgta tctacgtaac aagagggtct ggagtacaag gactacaaaa tggttctgta 900
agctgcatag ggatgcctgg gtcactacct ggaggaataa gggctgtact ggctgaaaac 960
ctgatagcaa tgttacttga cttagaatgt gcatcagcaa atgaccagac attctctcac 1020
tcagaatata gaaggacagc aagaactcta atgcagatgc ttcctggaac agacttcata 1080
ttctcaggat atagtgccgt accaaactgt gataacatgt ttgctggatc aaattttgat 1140
gcagaggatt ttgatgacta taatgctctt cagagagacc ttaaaataga cggtggttta 1200
aaacctgtaa ctgaagatga gattgtcaaa gtaagaaata aagcagccag agcaatacag 1260
gggttattca aagaacttga tcttcctgaa ataacagatg aagaagtgga agcagcaaca 1320
tatgcccacg gaagtgttga tatgcctgca agaaatgtgg ttgaagattt aaaagcggca 1380
gaagaacttt taagctctgg aataacagga gtagatcttg ttaaaggact tagcagaagc 1440
ggatttgacg atgtagctga gcatgtttta ggtatgttaa aacagagagt ttcaggagat 1500
tacctgcaaa cttcagctat attagacaaa ggctttaaaa taaagagtgc cataaacgat 1560
agaaatgatt acatgggtcc tggaagcgga tatagaataa gcgaggaaag atgggaagag 1620
atcaaaaata tcccatcagc tataaaacca gaaagtatag aatag 1665
<210> 215
<211> 564
<212> DNA
<213> Ilyobacter polytropus dhaB2 gene
<400> 215
atggaaaata aatttgtacc atctgtaaag atagaagaaa tcggagaagc aaaaaaagga 60
agcagatctg aagaagtagt tataggactg gctcctgcat ttaaaaaatt tcaacataaa 120
acaataacag atgtccctca cgatgaagtc ctgactgaac ttatcgcagg tatagaggaa 180
gagggattaa aggcaagaat cgtaagagta acaagaactt ctgatgtttc atttatggcg 240
ctggatgctg caaagttaag tggttctgga ataggaatag gaattcagtc aaagggaaca 300
acagtaatcc accaaaagga tctgcttcct ctaaacaatc tagaactttt cccacaggct 360
ccactattaa cacctgaaac attcagatta ataggaaaaa atgctgcaaa atatgcaaag 420
ggagaatctc caaatccagt acctgtagcc agtgaccaga tggcgagacc taaatatcag 480
gcaaaagcag cattactaca tataaaagag acaaaacatg tcgttcaaca cggaaaacca 540
gtagagataa agtatgaatt ttag 564
<210> 216
<211> 432
<212> DNA
<213> Ilyobacter polytropus dhaB3 gene
<400> 216
atgaatatag atgttaaaaa tataaatcca atctctgatt atccattagg agaaaagaga 60
aaagaatggt tgaaaacatc cacaggtaaa actttggatg aaataacttt agaaaatgta 120
ataaatggag atataaagcc tgaagatata agaatctcac ctgaaactct aaaattacag 180
ggagagatag caaagaaagg taacaggcca actataacaa agaactttga aagagccagt 240
gaaatggttg ccattccaga tgataaaata ttagcaactt acaacgcttt gagaccttac 300
agatcttcaa aggaagaatt atttgaaata gccgatgaac tagaaagtaa gtattcagct 360
gttgtaatat ctgcatttat caaggaagcc gcagaagttt atgaacaaag aggtcaactt 420
agaaaagatt ag 432
<210> 217
<211> 607
<212> PRT
<213> K.pneumoniae gdrA protein
<400> 217
Met Pro Leu Ile Ala Gly Ile Asp Ile Gly Asn Ala Thr Thr Glu Val
1 5 10 15
Ala Leu Ala Ser Asp Asp Pro Gln Ala Arg Ala Phe Val Ala Ser Gly
20 25 30
Ile Val Ala Thr Thr Gly Met Lys Gly Thr Arg Asp Asn Ile Ala Gly
35 40 45
Thr Leu Ala Ala Leu Glu Gln Ala Leu Ala Lys Thr Pro Trp Ser Val
50 55 60
Ser Asp Val Ser Arg Ile Tyr Leu Asn Glu Ala Ala Pro Val Ile Gly
65 70 75 80
Asp Val Ala Met Glu Thr Ile Thr Glu Thr Ile Ile Thr Glu Ser Thr
85 90 95
Met Ile Gly His Asn Pro Gln Thr Pro Gly Gly Val Gly Val Gly Val
100 105 110
Gly Thr Thr Ile Ala Leu Gly Arg Leu Ala Thr Leu Pro Ala Ala Gln
115 120 125
Tyr Ala Glu Gly Trp Ile Val Leu Ile Asp Asp Ala Val Asp Phe Leu
130 135 140
Asp Ala Val Trp Trp Leu Asn Glu Ala Leu Asp Arg Gly Ile Asn Val
145 150 155 160
Val Ala Ala Ile Leu Lys Lys Asp Asp Gly Val Leu Val Asn Asn Arg
165 170 175
Leu Arg Lys Thr Leu Pro Val Val Asp Glu Val Thr Leu Leu Glu Gln
180 185 190
Val Pro Glu Gly Val Met Ala Ala Val Glu Val Ala Ala Pro Gly Gln
195 200 205
Val Val Arg Ile Leu Ser Asn Pro Tyr Gly Ile Ala Thr Phe Phe Gly
210 215 220
Leu Ser Pro Glu Glu Thr Gln Ala Ile Val Pro Ile Ala Arg Ala Leu
225 230 235 240
Ile Gly Asn Arg Ser Ala Val Val Leu Lys Thr Pro Gln Gly Asp Val
245 250 255
Gln Ser Arg Val Ile Pro Ala Gly Asn Leu Tyr Ile Ser Gly Glu Lys
260 265 270
Arg Arg Gly Glu Ala Asp Val Ala Glu Gly Ala Glu Ala Ile Met Gln
275 280 285
Ala Met Ser Ala Cys Ala Pro Val Arg Asp Ile Arg Gly Glu Pro Gly
290 295 300
Thr His Ala Gly Gly Met Leu Glu Arg Val Arg Lys Val Met Ala Ser
305 310 315 320
Leu Thr Asp His Glu Met Ser Ala Ile Tyr Ile Gln Asp Leu Leu Ala
325 330 335
Val Asp Thr Phe Ile Pro Arg Lys Val Gln Gly Gly Met Ala Gly Glu
340 345 350
Cys Ala Met Glu Asn Ala Val Gly Met Ala Ala Met Val Lys Ala Asp
355 360 365
Arg Leu Gln Met Gln Val Ile Ala Arg Glu Leu Ser Ala Arg Leu Gln
370 375 380
Thr Glu Val Val Val Gly Gly Val Glu Ala Asn Met Ala Ile Ala Gly
385 390 395 400
Ala Leu Thr Thr Pro Gly Cys Ala Ala Pro Leu Ala Ile Leu Asp Leu
405 410 415
Gly Ala Gly Ser Thr Asp Ala Ala Ile Val Asn Ala Glu Gly Gln Ile
420 425 430
Thr Ala Val His Leu Ala Gly Ala Gly Asn Met Val Ser Leu Leu Ile
435 440 445
Lys Thr Glu Leu Gly Leu Glu Asp Leu Ser Leu Ala Glu Ala Ile Lys
450 455 460
Lys Tyr Pro Leu Ala Lys Val Glu Ser Leu Phe Ser Ile Arg His Glu
465 470 475 480
Asn Gly Ala Val Glu Phe Phe Arg Glu Ala Leu Ser Pro Ala Val Phe
485 490 495
Ala Lys Val Val Tyr Ile Lys Glu Gly Glu Leu Val Pro Ile Asp Asn
500 505 510
Ala Ser Pro Leu Glu Lys Ile Arg Leu Val Arg Arg Gln Ala Lys Glu
515 520 525
Lys Val Phe Val Thr Asn Cys Leu Arg Ala Leu Arg Gln Val Ser Pro
530 535 540
Gly Gly Ser Ile Arg Asp Ile Ala Phe Val Val Leu Val Gly Gly Ser
545 550 555 560
Ser Leu Asp Phe Glu Ile Pro Gln Leu Ile Thr Glu Ala Leu Ser His
565 570 575
Tyr Gly Val Val Ala Gly Gln Gly Asn Ile Arg Gly Thr Glu Gly Pro
580 585 590
Arg Asn Ala Val Ala Thr Gly Leu Leu Leu Ala Gly Gln Ala Asn
595 600 605
<210> 218
<211> 117
<212> PRT
<213> K.pneumoniae gdrB protein
<400> 218
Met Ser Leu Ser Pro Pro Gly Val Arg Leu Phe Tyr Asp Pro Arg Gly
1 5 10 15
His His Ala Gly Ala Ile Asn Glu Leu Cys Trp Gly Leu Glu Glu Gln
20 25 30
Gly Val Pro Cys Gln Thr Ile Thr Tyr Asp Gly Gly Gly Asp Ala Ala
35 40 45
Ala Leu Gly Ala Leu Ala Ala Arg Ser Ser Pro Leu Arg Val Gly Ile
50 55 60
Gly Leu Ser Ala Ser Gly Glu Ile Ala Leu Thr His Ala Gln Leu Pro
65 70 75 80
Ala Asp Ala Pro Leu Ala Thr Gly His Val Thr Asp Ser Asp Asp His
85 90 95
Leu Arg Thr Leu Gly Ala Asn Ala Gly Gln Leu Val Lys Val Leu Pro
100 105 110
Leu Ser Glu Arg Asn
115
<210> 219
<211> 607
<212> PRT
<213> Ilyobacter polytropus gdrA protien
<400> 219
Met Lys Ile Ile Val Gly Val Asp Ile Gly Asn Ala Thr Thr Glu Val
1 5 10 15
Ala Leu Ala Lys Val Asp Asn Ile Glu Cys Lys Phe Leu Ser Ser Ala
20 25 30
Leu His Glu Thr Thr Gly Leu Lys Gly Thr Lys Asp Asn Val Leu Gly
35 40 45
Ile Lys Arg Ala Ile Lys Lys Ala Met Lys Arg Ala Asp Leu Lys Asn
50 55 60
Ala Asp Leu Ser Leu Ile Arg Ile Asn Glu Ala Thr Pro Val Ile Gly
65 70 75 80
Asp Val Ser Met Glu Thr Ile Thr Glu Thr Ile Ile Thr Glu Ser Thr
85 90 95
Met Ile Gly His Asn Pro Ser Thr Pro Gly Gly Ile Gly Leu Gly Ile
100 105 110
Gly Glu Thr Ile Leu Phe Gln Glu Leu Gly Asn Phe Glu Asn Asp Lys
115 120 125
Asp Tyr Ile Val Ile Val Glu Lys Ser Phe Ser Phe Leu Glu Val Ala
130 135 140
His Arg Ile Asn Glu Ala Phe Lys Asn Gly Cys Lys Ile Lys Gly Ala
145 150 155 160
Ile Ile Gln Lys Asp Asp Gly Val Leu Ile Asn Asn Arg Leu Ile Asn
165 170 175
Lys Ile Pro Ile Val Asp Glu Val Leu Phe Val Lys Lys Val Pro Thr
180 185 190
Gly Met Lys Ala Ala Val Glu Val Ala Pro Gln Gly Lys Ile Ile Glu
195 200 205
Val Ile Ser Asn Pro Tyr Gly Ile Ala Thr Ile Phe Ser Leu Thr Ser
210 215 220
Glu Glu Thr Lys Lys Ile Val Pro Ile Ser Lys Ala Leu Ile Gly Asn
225 230 235 240
Arg Ser Gly Val Val Ile Lys Thr Pro His Gly Asp Val Lys Glu Lys
245 250 255
Val Ile Pro Ala Gly Arg Ile Gln Ile Asp Gly Asn Tyr Arg Ser Lys
260 265 270
Ser Val Asn Ile Glu Glu Gly Ser Lys Arg Ile Met Lys Ala Leu Gly
275 280 285
Ser Ile Glu His Val Gln Asp Ile Asn Gly Glu Ser Gly Thr Asn Ile
290 295 300
Gly Gly Met Leu Lys Asn Val Lys Ser Val Met Gly Asn Phe Thr Asn
305 310 315 320
Glu Ser Ile Asp Asn Ile Lys Ile Lys Asp Ile Leu Ala Val Asp Thr
325 330 335
Phe Val Pro Gln Lys Ile Lys Gly Gly Ile Ala Glu Glu Phe Val Phe
340 345 350
Glu Asn Ala Val Gly Ile Ala Ala Met Val Asn Thr Lys Lys Asn Gln
355 360 365
Met Ser Glu Val Ala Lys Glu Ile Glu Lys Glu Leu Gly Val Lys Val
370 375 380
Glu Val Gly Gly Val Glu Ala Asp Met Ala Ile Thr Gly Ala Leu Thr
385 390 395 400
Thr Pro Gly Thr Gly Thr Pro Leu Val Ile Val Asp Ile Gly Ala Gly
405 410 415
Ser Thr Asp Ala Cys Ser Ile Asp Arg Tyr Gly Asn Lys Glu Leu Val
420 425 430
His Leu Ala Gly Ala Gly Asn Met Thr Thr Leu Leu Ile Gln Lys Glu
435 440 445
Leu Gly Ile Glu Asp Phe Asn Leu Ala Glu Asp Ile Lys Lys Tyr Pro
450 455 460
Leu Ala Lys Val Glu Ser Leu Phe Tyr Ile Arg His Glu Asp Gly Asn
465 470 475 480
Val Gln Phe Phe Glu Asn Ser Leu Ser Pro Lys Val Phe Ala Lys Asn
485 490 495
Val Leu Ile Lys Glu Gly Glu Leu Ile Pro Ile Asp Leu Asp Met Ser
500 505 510
Leu Glu Lys Ile Arg Ile Ile Arg Arg Ser Ala Lys Arg Lys Ile Phe
515 520 525
Ile Thr Asn Val Leu Arg Ser Leu Arg Lys Val Ser His Thr Lys Asn
530 535 540
Ile Arg Asp Phe Glu Phe Val Val Ile Val Gly Gly Ser Ala Leu Asp
545 550 555 560
Phe Glu Ile Ser Gln Met Ile Thr Glu Ala Leu Ser Glu Tyr Gly Ile
565 570 575
Val Ala Gly Cys Gly Asn Ile Arg Gly Thr Glu Gly Pro Arg Asn Ala
580 585 590
Val Ala Thr Gly Leu Val Met Gly Val Asn Asp Gly Gln Gln Ala
595 600 605
<210> 220
<211> 107
<212> PRT
<213> Ilyobacter polytropus gdrB
<400> 220
Met Asp Asn Arg Pro Asn Ile Thr Leu Phe Cys Ser Asp Asn Ile Asp
1 5 10 15
Arg Glu Tyr Ile Asn Glu Ile Leu Trp Gly Ile Glu Glu Glu Glu Ile
20 25 30
Pro Tyr Leu Leu Lys Ile Val Pro Ser Lys Glu Val Val Lys Glu Asn
35 40 45
Tyr Val Ser Gly Thr Leu Glu Ile Gly Ile Gly Val Leu Glu Asn Gly
50 55 60
Asp Ala Leu Leu Thr Thr Arg Lys Tyr Asp Lys Glu Tyr Ile Gln Lys
65 70 75 80
Ala Asn Ile Phe Val Glu Lys Asn Lys Leu Arg Asp Leu Gly Ser Asn
85 90 95
Gly Ala Arg Leu Val Lys Gly Leu Pro Leu Arg
100 105
<210> 221
<211> 1824
<212> DNA
<213> Ilyobacter polytropus gdrA gene
<400> 221
atgaagatca tagtgggtgt agatattgga aatgctacaa cagaagtagc tttggcaaag 60
gtagacaata tagaatgtaa gtttttatcc agtgccttac atgaaacaac aggtttaaaa 120
ggtactaaag ataatgtttt gggaataaaa agagccatta agaaggcaat gaaaagagct 180
gatttaaaaa atgcagattt atctttaatc aggataaatg aagctactcc tgttatagga 240
gacgtttcta tggaaactat aacagaaaca ataattacag agtctactat gattggacat 300
aacccttcaa ctcctggggg aataggtctt gggataggag aaacaatcct attccaagag 360
cttggaaatt ttgaaaatga taaagattac atagtaatag tggaaaaaag tttcagcttc 420
ttagaggtag ctcacagaat caatgaagct tttaaaaatg gatgcaaaat aaagggtgct 480
attattcaaa aagatgatgg ggttctcata aataacagac tcataaataa aatccccata 540
gttgatgagg tactttttgt taaaaaagta cctacaggga tgaaggctgc tgtagaagta 600
gctccacagg gaaaaataat agaggttatt tcaaatccat atggcattgc cacaattttt 660
tccctcactt cagaagagac taaaaaaata gttcctattt ctaaagcact tataggcaac 720
aggtctggag tagttatcaa gacacctcac ggagatgtaa aagagaaggt tatccctgct 780
ggaaggatac agattgacgg aaactacagg tcaaaaagtg taaatataga agagggttcc 840
aaaagaataa tgaaagccct gggaagtatt gagcatgtcc aagatataaa tggagaatct 900
ggaaccaata tcggaggaat gctaaaaaat gtaaaaagtg taatggggaa tttcaccaat 960
gagtccattg ataatataaa aataaaagac atattggcag tagatacctt tgtcccacaa 1020
aagataaagg ggggaattgc agaagaattt gtatttgaaa atgctgtagg aatagctgca 1080
atggtaaata ccaaaaaaaa tcaaatgtcc gaagtagcga aagagattga aaaagaactg 1140
ggagtaaaag tagaagtagg aggagtagag gcagatatgg ctataaccgg tgctctaact 1200
actccaggca caggaacacc tctggtaatt gtagatatag gagcaggttc gacagatgca 1260
tgttccattg acagatatgg aaataaagaa ctggttcatc tggccggagc tggtaatatg 1320
acaacacttc ttattcaaaa agagctgggt atagaggatt ttaatcttgc tgaagatata 1380
aaaaaatatc ctctggcaaa agtagaatct ctattttata taagacacga ggatggaaat 1440
gttcaatttt ttgaaaactc tctttctccg aaagtatttg ctaaaaatgt ccttataaaa 1500
gaaggtgaac ttattccaat cgaccttgat atgtctctgg aaaaaatcag aattatcaga 1560
aggtctgcca aaagaaaaat ttttataacc aatgtactta gatcattaag gaaagtttct 1620
catacaaaaa atattaggga ttttgaattt gtagttattg ttggaggatc tgcattggat 1680
tttgaaatat ctcagatgat aactgaagct ttatctgagt atggaatagt agcaggatgc 1740
ggaaatataa gaggaacaga gggccctaga aatgctgtag ccactggact tgtaatgggg 1800
gtgaatgatg gacaacaggc ctaa 1824
<210> 222
<211> 324
<212> DNA
<213> Ilyobacter polytropus gdrB gene
<400> 222
atggacaaca ggcctaatat aacattattt tgctcagata atattgacag ggaatatatt 60
aatgaaattt tgtggggtat agaggaggaa gagataccat atcttctgaa aattgtacct 120
tctaaagaag ttgtcaaaga aaattatgtt tcaggaactc tagagatagg tatcggagta 180
ttagaaaatg gcgacgccct tctaacaaca aggaagtacg ataaggaata tatacaaaag 240
gcaaacattt ttgtagaaaa aaataaattg agagatttag gaagcaacgg agcaagactt 300
gtaaagggtc tgccacttag ataa 324
<210> 223
<211> 1824
<212> DNA
<213> K.pneumoniae gdrA gene
<400> 223
atgccgttaa tagccgggat tgatatcggc aacgccacca ccgaggtggc gctggcgtcc 60
gacgacccgc aggcgagggc gtttgttgcc agcgggatcg tcgcgacgac gggcatgaaa 120
gggacgcggg acaatatcgc cgggaccctc gccgcgctgg agcaggccct ggcgaaaaca 180
ccgtggtcgg tgagcgatgt ctctcgcatc tatcttaacg aagccgcgcc ggtgattggc 240
gatgtggcga tggagaccat caccgagacc attatcaccg aatcgaccat gatcggtcat 300
aacccgcaga cgccgggcgg ggtgggcgtt ggcgtgggga cgactatcgc cctcgggcgg 360
ctggcgacgc tgccggcggc gcagtatgcc gaggggtgga tcgtactgat tgacgacgcc 420
gtcgatttcc ttgacgccgt gtggtggctc aatgaggcgc tcgaccgggg gatcaacgtg 480
gtggcggcga tcctcaaaaa ggacgacggc gtgctggtga acaaccgcct gcgtaaaacc 540
ctgccggtgg tagatgaagt gacgctgctg gagcaggtcc ccgagggggt aatggcggcg 600
gtggaagtgg ccgcgccggg ccaggtggtg cggatcctgt cgaatcccta cgggatcgcc 660
accttcttcg ggctaagccc ggaagagacc caggccatcg tccccatcgc ccgcgccctg 720
attggcaacc gttcagcggt ggtgctcaag accccgcagg gggatgtgca gtcgcgggtg 780
atcccggcgg gcaacctcta cattagcggc gaaaagcgcc gcggagaggc cgatgtcgcc 840
gagggcgcgg aagccatcat gcaggcgatg agcgcctgcg ctccggtacg cgacatccgc 900
ggcgaaccgg gcactcacgc cggcggcatg cttgagcggg tgcgcaaggt aatggcgtcc 960
ctgaccgacc atgagatgag cgcgatatac atccaggatc tgctggcggt ggatacgttt 1020
attccgcgca aggtgcaggg cgggatggcc ggcgagtgcg ccatggaaaa tgccgtcggg 1080
atggcggcga tggtgaaagc ggatcgtctg caaatgcagg ttatcgcccg cgaactgagc 1140
gcccgactgc agaccgaggt ggtggtgggc ggcgtggagg ccaacatggc catcgccggg 1200
gcgttaacca ctcccggctg tgcggcgccg ctggcgatcc tcgacctcgg cgccggctcg 1260
acggatgcgg cgatcgtcaa cgcggagggg cagataacgg cggtccatct cgccggggcg 1320
gggaatatgg tcagcctgtt gattaaaacc gagctgggcc tcgaggatct ttcgctggcg 1380
gaagcgataa aaaaataccc gctggccaaa gtggaaagcc tgttcagtat tcgtcacgag 1440
aatggcgcgg tggagttctt tcgggaagcc ctcagcccgg cggtgttcgc caaagtggtg 1500
tacatcaagg agggcgaact ggtgccgatc gataacgcca gcccgctgga aaaaattcgt 1560
ctcgtgcgcc ggcaggcgaa agagaaagtg tttgtcacca actgcctgcg cgcgctgcgc 1620
caggtctcac ccggcggttc cattcgcgat atcgcctttg tggtgctggt gggcggctca 1680
tcgctggact ttgagatccc gcagcttatc acggaagcct tgtcgcacta tggcgtggtc 1740
gccgggcagg gcaatattcg gggaacagaa gggccgcgca acgcggtcgc caccgggctg 1800
ctactggccg gtcaggcgaa ttaa 1824
<210> 224
<211> 354
<212> DNA
<213> K.pneumoniae gdrB gene
<400> 224
atgtcgcttt caccgccagg cgtacgcctg ttttacgatc cgcgcgggca ccatgccggc 60
gccatcaatg agctgtgctg ggggctggag gagcaggggg tcccctgcca gaccataacc 120
tatgacggag gcggtgacgc cgctgcgctg ggcgccctgg cggccagaag ctcgcccctg 180
cgggtgggta ttgggctcag cgcgtccggc gagatagccc tcactcatgc ccagctgccg 240
gcggacgcgc cgctggctac cggacacgtc accgatagcg acgatcatct gcgtacgctc 300
ggcgccaacg ccgggcagct ggttaaagtc ctgccgttaa gtgagagaaa ctga 354
<210> 225
<211> 314
<212> PRT
<213> Corynebacterium glutamicum LDH
<400> 225
Met Lys Glu Thr Val Gly Asn Lys Ile Val Leu Ile Gly Ala Gly Asp
1 5 10 15
Val Gly Val Ala Tyr Ala Tyr Ala Leu Ile Asn Gln Gly Met Ala Asp
20 25 30
His Leu Ala Ile Ile Asp Ile Asp Glu Lys Lys Leu Glu Gly Asn Val
35 40 45
Met Asp Leu Asn His Gly Val Val Trp Ala Asp Ser Arg Thr Arg Val
50 55 60
Thr Lys Gly Thr Tyr Ala Asp Cys Glu Asp Ala Ala Met Val Val Ile
65 70 75 80
Cys Ala Gly Ala Ala Gln Lys Pro Gly Glu Thr Arg Leu Gln Leu Val
85 90 95
Asp Lys Asn Val Lys Ile Met Lys Ser Ile Val Gly Asp Val Met Asp
100 105 110
Ser Gly Phe Asp Gly Ile Phe Leu Val Ala Ser Asn Pro Val Asp Ile
115 120 125
Leu Thr Tyr Ala Val Trp Lys Phe Ser Gly Leu Glu Trp Asn Arg Val
130 135 140
Ile Gly Ser Gly Thr Val Leu Asp Ser Ala Arg Phe Arg Tyr Met Leu
145 150 155 160
Gly Glu Leu Tyr Glu Val Ala Pro Ser Ser Val His Ala Tyr Ile Ile
165 170 175
Gly Glu His Gly Asp Thr Glu Leu Pro Val Leu Ser Ser Ala Thr Ile
180 185 190
Ala Gly Val Ser Leu Ser Arg Met Leu Asp Lys Asp Pro Glu Leu Glu
195 200 205
Gly Arg Leu Glu Lys Ile Phe Glu Asp Thr Arg Asp Ala Ala Tyr His
210 215 220
Ile Ile Asp Ala Lys Gly Ser Thr Ser Tyr Gly Ile Gly Met Gly Leu
225 230 235 240
Ala Arg Ile Thr Arg Ala Ile Leu Gln Asn Gln Asp Val Ala Val Pro
245 250 255
Val Ser Ala Leu Leu His Gly Glu Tyr Gly Glu Glu Asp Ile Tyr Ile
260 265 270
Gly Thr Pro Ala Val Val Asn Arg Arg Gly Ile Arg Arg Val Val Glu
275 280 285
Leu Glu Ile Thr Asp His Glu Met Glu Arg Phe Lys His Ser Ala Asn
290 295 300
Thr Leu Arg Glu Ile Gln Lys Gln Phe Phe
305 310
<210> 226
<211> 3267
<212> DNA
<213> Artificial Sequence
<220>
<223> pKD4 vector
<400> 226
agattgcagc attacacgtc ttgagcgatt gtgtaggctg gagctgcttc gaagttccta 60
tactttctag agaataggaa cttcggaata ggaacttcaa gatcccctca cgctgccgca 120
agcactcagg gcgcaagggc tgctaaagga agcggaacac gtagaaagcc agtccgcaga 180
aacggtgctg accccggatg aatgtcagct actgggctat ctggacaagg gaaaacgcaa 240
gcgcaaagag aaagcaggta gcttgcagtg ggcttacatg gcgatagcta gactgggcgg 300
ttttatggac agcaagcgaa ccggaattgc cagctggggc gccctctggt aaggttggga 360
agccctgcaa agtaaactgg atggctttct tgccgccaag gatctgatgg cgcaggggat 420
caagatctga tcaagagaca ggatgaggat cgtttcgcat gattgaacaa gatggattgc 480
acgcaggttc tccggccgct tgggtggaga ggctattcgg ctatgactgg gcacaacaga 540
caatcggctg ctctgatgcc gccgtgttcc ggctgtcagc gcaggggcgc ccggttcttt 600
ttgtcaagac cgacctgtcc ggtgccctga atgaactgca ggacgaggca gcgcggctat 660
cgtggctggc cacgacgggc gttccttgcg cagctgtgct cgacgttgtc actgaagcgg 720
gaagggactg gctgctattg ggcgaagtgc cggggcagga tctcctgtca tctcaccttg 780
ctcctgccga gaaagtatcc atcatggctg atgcaatgcg gcggctgcat acgcttgatc 840
cggctacctg cccattcgac caccaagcga aacatcgcat cgagcgagca cgtactcgga 900
tggaagccgg tcttgtcgat caggatgatc tggacgaaga gcatcagggg ctcgcgccag 960
ccgaactgtt cgccaggctc aaggcgcgca tgcccgacgg cgaggatctc gtcgtgaccc 1020
atggcgatgc ctgcttgccg aatatcatgg tggaaaatgg ccgcttttct ggattcatcg 1080
actgtggccg gctgggtgtg gcggaccgct atcaggacat agcgttggct acccgtgata 1140
ttgctgaaga gcttggcggc gaatgggctg accgcttcct cgtgctttac ggtatcgccg 1200
ctcccgattc gcagcgcatc gccttctatc gccttcttga cgagttcttc tgagcgggac 1260
tctggggttc gaaatgaccg accaagcgac gcccaacctg ccatcacgag atttcgattc 1320
caccgccgcc ttctatgaaa ggttgggctt cggaatcgtt ttccgggacg ccggctggat 1380
gatcctccag cgcggggatc tcatgctgga gttcttcgcc caccccagct tcaaaagcgc 1440
tctgaagttc ctatactttc tagagaatag gaacttcgga ataggaacta aggaggatat 1500
tcatatggac catggctaat tcccatgtca gccgttaagt gttcctgtgt cactgaaaat 1560
tgctttgaga ggctctaagg gcttctcagt gcgttacatc cctggcttgt tgtccacaac 1620
cgttaaacct taaaagcttt aaaagcctta tatattcttt tttttcttat aaaacttaaa 1680
accttagagg ctatttaagt tgctgattta tattaatttt attgttcaaa catgagagct 1740
tagtacgtga aacatgagag cttagtacgt tagccatgag agcttagtac gttagccatg 1800
agggtttagt tcgttaaaca tgagagctta gtacgttaaa catgagagct tagtacgtga 1860
aacatgagag cttagtacgt actatcaaca ggttgaactg cggatcttgc ggccgcaaaa 1920
attaaaaatg aagttttaaa tcaatctaaa gtatatatga gtaaacttgg tctgacagtt 1980
accaatgctt aatcagtgag gcacctatct cagcgatctg tctatttcgt tcatccatag 2040
ttgcctgact ccccgtcgtg tagataacta cgatacggga gggcttacca tctggcccca 2100
gtgctgcaat gataccgcga gacccacgct caccggctcc agatttatca gcaataaacc 2160
agccagccgg aagggccgag cgcagaagtg gtcctgcaac tttatccgcc tccatccagt 2220
ctattaattg ttgccgggaa gctagagtaa gtagttcgcc agttaatagt ttgcgcaacg 2280
ttgttgccat tgctacaggc atcgtggtgt cacgctcgtc gtttggtatg gcttcattca 2340
gctccggttc ccaacgatca aggcgagtta catgatcccc catgttgtgc aaaaaagcgg 2400
ttagctcctt cggtcctccg atcgttgtca gaagtaagtt ggccgcagtg ttatcactca 2460
tggttatggc agcactgcat aattctctta ctgtcatgcc atccgtaaga tgcttttctg 2520
tgactggtga gtactcaacc aagtcattct gagaatagtg tatgcggcga ccgagttgct 2580
cttgcccggc gtcaatacgg gataataccg cgccacatag cagaacttta aaagtgctca 2640
tcattggaaa acgttcttcg gggcgaaaac tctcaaggat cttaccgctg ttgagatcca 2700
gttcgatgta acccactcgt gcacccaact gatcttcagc atcttttact ttcaccagcg 2760
tttctgggtg agcaaaaaca ggaaggcaaa atgccgcaaa aaagggaata agggcgacac 2820
ggaaatgttg aatactcata ctcttccttt ttcaatatta ttgaagcatt tatcagggtt 2880
attgtctcat gagcggatac atatttgaat gtatttagaa aaataaacaa ataggggttc 2940
cgcgcacatt tccccgaaaa gtgccacctg catcgatggc cccccgatgg tagtgtgggg 3000
tctccccatg cgagagtagg gaactgccag gcatcaaata aaacgaaagg ctcagtcgaa 3060
agactgggcc tttcgtttta tctgttgttt gtcggtgaac gctctcctga gtaggacaaa 3120
tccgccggga gcggatttga acgttgcgaa gcaacggccc ggagggtggc gggcaggacg 3180
cccgccataa actgccaggc atcaaattaa gcagaaggcc atcctgacgg atggcctttt 3240
tgcgtggcca gtgccaagct tgcatgc 3267
<210> 227
<211> 58
<212> DNA
<213> Artificial Sequence
<220>
<223> ackAKF primer
<400> 227
cgtagtgatc gatgagtctg ttattcaggg tatcaaaggt gtaggctgga gctgcttc 58
<210> 228
<211> 56
<212> DNA
<213> Artificial Sequence
<220>
<223> ackAKR primer
<400> 228
caatccctgc acccagttct acaccctgag acgctgattc cggggatccg tcgacc 56
<210> 229
<211> 6329
<212> DNA
<213> Artificial Sequence
<220>
<223> pKD46 vector
<400> 229
catcgattta ttatgacaac ttgacggcta catcattcac tttttcttca caaccggcac 60
ggaactcgct cgggctggcc ccggtgcatt ttttaaatac ccgcgagaaa tagagttgat 120
cgtcaaaacc aacattgcga ccgacggtgg cgataggcat ccgggtggtg ctcaaaagca 180
gcttcgcctg gctgatacgt tggtcctcgc gccagcttaa gacgctaatc cctaactgct 240
ggcggaaaag atgtgacaga cgcgacggcg acaagcaaac atgctgtgcg acgctggcga 300
tatcaaaatt gctgtctgcc aggtgatcgc tgatgtactg acaagcctcg cgtacccgat 360
tatccatcgg tggatggagc gactcgttaa tcgcttccat gcgccgcagt aacaattgct 420
caagcagatt tatcgccagc agctccgaat agcgcccttc cccttgcccg gcgttaatga 480
tttgcccaaa caggtcgctg aaatgcggct ggtgcgcttc atccgggcga aagaaccccg 540
tattggcaaa tattgacggc cagttaagcc attcatgcca gtaggcgcgc ggacgaaagt 600
aaacccactg gtgataccat tcgcgagcct ccggatgacg accgtagtga tgaatctctc 660
ctggcgggaa cagcaaaata tcacccggtc ggcaaacaaa ttctcgtccc tgatttttca 720
ccaccccctg accgcgaatg gtgagattga gaatataacc tttcattccc agcggtcggt 780
cgataaaaaa atcgagataa ccgttggcct caatcggcgt taaacccgcc accagatggg 840
cattaaacga gtatcccggc agcaggggat cattttgcgc ttcagccata cttttcatac 900
tcccgccatt cagagaagaa accaattgtc catattgcat cagacattgc cgtcactgcg 960
tcttttactg gctcttctcg ctaaccaaac cggtaacccc gcttattaaa agcattctgt 1020
aacaaagcgg gaccaaagcc atgacaaaaa cgcgtaacaa aagtgtctat aatcacggca 1080
gaaaagtcca cattgattat ttgcacggcg tcacactttg ctatgccata gcatttttat 1140
ccataagatt agcggatcct acctgacgct ttttatcgca actctctact gtttctccat 1200
acccgttttt ttgggaattc gagctctaag gaggttataa aaaatggata ttaatactga 1260
aactgagatc aagcaaaagc attcactaac cccctttcct gttttcctaa tcagcccggc 1320
atttcgcggg cgatattttc acagctattt caggagttca gccatgaacg cttattacat 1380
tcaggatcgt cttgaggctc agagctgggc gcgtcactac cagcagctcg cccgtgaaga 1440
gaaagaggca gaactggcag acgacatgga aaaaggcctg ccccagcacc tgtttgaatc 1500
gctatgcatc gatcatttgc aacgccacgg ggccagcaaa aaatccatta cccgtgcgtt 1560
tgatgacgat gttgagtttc aggagcgcat ggcagaacac atccggtaca tggttgaaac 1620
cattgctcac caccaggttg atattgattc agaggtataa aacgaatgag tactgcactc 1680
gcaacgctgg ctgggaagct ggctgaacgt gtcggcatgg attctgtcga cccacaggaa 1740
ctgatcacca ctcttcgcca gacggcattt aaaggtgatg ccagcgatgc gcagttcatc 1800
gcattactga tcgttgccaa ccagtacggc cttaatccgt ggacgaaaga aatttacgcc 1860
tttcctgata agcagaatgg catcgttccg gtggtgggcg ttgatggctg gtcccgcatc 1920
atcaatgaaa accagcagtt tgatggcatg gactttgagc aggacaatga atcctgtaca 1980
tgccggattt accgcaagga ccgtaatcat ccgatctgcg ttaccgaatg gatggatgaa 2040
tgccgccgcg aaccattcaa aactcgcgaa ggcagagaaa tcacggggcc gtggcagtcg 2100
catcccaaac ggatgttacg tcataaagcc atgattcagt gtgcccgtct ggccttcgga 2160
tttgctggta tctatgacaa ggatgaagcc gagcgcattg tcgaaaatac tgcatacact 2220
gcagaacgtc agccggaacg cgacatcact ccggttaacg atgaaaccat gcaggagatt 2280
aacactctgc tgatcgccct ggataaaaca tgggatgacg acttattgcc gctctgttcc 2340
cagatatttc gccgcgacat tcgtgcatcg tcagaactga cacaggccga agcagtaaaa 2400
gctcttggat tcctgaaaca gaaagccgca gagcagaagg tggcagcatg acaccggaca 2460
ttatcctgca gcgtaccggg atcgatgtga gagctgtcga acagggggat gatgcgtggc 2520
acaaattacg gctcggcgtc atcaccgctt cagaagttca caacgtgata gcaaaacccc 2580
gctccggaaa gaagtggcct gacatgaaaa tgtcctactt ccacaccctg cttgctgagg 2640
tttgcaccgg tgtggctccg gaagttaacg ctaaagcact ggcctgggga aaacagtacg 2700
agaacgacgc cagaaccctg tttgaattca cttccggcgt gaatgttact gaatccccga 2760
tcatctatcg cgacgaaagt atgcgtaccg cctgctctcc cgatggttta tgcagtgacg 2820
gcaacggcct tgaactgaaa tgcccgttta cctcccggga tttcatgaag ttccggctcg 2880
gtggtttcga ggccataaag tcagcttaca tggcccaggt gcagtacagc atgtgggtga 2940
cgcgaaaaaa tgcctggtac tttgccaact atgacccgcg tatgaagcgt gaaggcctgc 3000
attatgtcgt gattgagcgg gatgaaaagt acatggcgag ttttgacgag atcgtgccgg 3060
agttcatcga aaaaatggac gaggcactgg ctgaaattgg ttttgtattt ggggagcaat 3120
ggcgatgacg catcctcacg ataatatccg ggtaggcgca atcactttcg tctactccgt 3180
tacaaagcga ggctgggtat ttcccggcct ttctgttatc cgaaatccac tgaaagcaca 3240
gcggctggct gaggagataa ataataaacg aggggctgta tgcacaaagc atcttctgtt 3300
gagttaagaa cgagtatcga gatggcacat agccttgctc aaattggaat caggtttgtg 3360
ccaataccag tagaaacaga cgaagaatcc atgggtatgg acagttttcc ctttgatatg 3420
taacggtgaa cagttgttct acttttgttt gttagtcttg atgcttcact gatagataca 3480
agagccataa gaacctcaga tccttccgta tttagccagt atgttctcta gtgtggttcg 3540
ttgtttttgc gtgagccatg agaacgaacc attgagatca tacttacttt gcatgtcact 3600
caaaaatttt gcctcaaaac tggtgagctg aatttttgca gttaaagcat cgtgtagtgt 3660
ttttcttagt ccgttacgta ggtaggaatc tgatgtaatg gttgttggta ttttgtcacc 3720
attcattttt atctggttgt tctcaagttc ggttacgaga tccatttgtc tatctagttc 3780
aacttggaaa atcaacgtat cagtcgggcg gcctcgctta tcaaccacca atttcatatt 3840
gctgtaagtg tttaaatctt tacttattgg tttcaaaacc cattggttaa gccttttaaa 3900
ctcatggtag ttattttcaa gcattaacat gaacttaaat tcatcaaggc taatctctat 3960
atttgccttg tgagttttct tttgtgttag ttcttttaat aaccactcat aaatcctcat 4020
agagtatttg ttttcaaaag acttaacatg ttccagatta tattttatga atttttttaa 4080
ctggaaaaga taaggcaata tctcttcact aaaaactaat tctaattttt cgcttgagaa 4140
cttggcatag tttgtccact ggaaaatctc aaagccttta accaaaggat tcctgatttc 4200
cacagttctc gtcatcagct ctctggttgc tttagctaat acaccataag cattttccct 4260
actgatgttc atcatctgag cgtattggtt ataagtgaac gataccgtcc gttctttcct 4320
tgtagggttt tcaatcgtgg ggttgagtag tgccacacag cataaaatta gcttggtttc 4380
atgctccgtt aagtcatagc gactaatcgc tagttcattt gctttgaaaa caactaattc 4440
agacatacat ctcaattggt ctaggtgatt ttaatcacta taccaattga gatgggctag 4500
tcaatgataa ttactagtcc ttttcctttg agttgtgggt atctgtaaat tctgctagac 4560
ctttgctgga aaacttgtaa attctgctag accctctgta aattccgcta gacctttgtg 4620
tgtttttttt gtttatattc aagtggttat aatttataga ataaagaaag aataaaaaaa 4680
gataaaaaga atagatccca gccctgtgta taactcacta ctttagtcag ttccgcagta 4740
ttacaaaagg atgtcgcaaa cgctgtttgc tcctctacaa aacagacctt aaaaccctaa 4800
aggcttaagt agcaccctcg caagctcggt tgcggccgca atcgggcaaa tcgctgaata 4860
ttccttttgt ctccgaccat caggcacctg agtcgctgtc tttttcgtga cattcagttc 4920
gctgcgctca cggctctggc agtgaatggg ggtaaatggc actacaggcg ccttttatgg 4980
attcatgcaa ggaaactacc cataatacaa gaaaagcccg tcacgggctt ctcagggcgt 5040
tttatggcgg gtctgctatg tggtgctatc tgactttttg ctgttcagca gttcctgccc 5100
tctgattttc cagtctgacc acttcggatt atcccgtgac aggtcattca gactggctaa 5160
tgcacccagt aaggcagcgg tatcatcaac ggggtctgac gctcagtgga acgaaaactc 5220
acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga tccttttaaa 5280
ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt ctgacagtta 5340
ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt catccatagt 5400
tgcctgactc cccgtcgtgt agataactac gatacgggag ggcttaccat ctggccccag 5460
tgctgcaatg ataccgcgag acccacgctc accggctcca gatttatcag caataaacca 5520
gccagccgga agggccgagc gcagaagtgg tcctgcaact ttatccgcct ccatccagtc 5580
tattaattgt tgccgggaag ctagagtaag tagttcgcca gttaatagtt tgcgcaacgt 5640
tgttgccatt gctacaggca tcgtggtgtc acgctcgtcg tttggtatgg cttcattcag 5700
ctccggttcc caacgatcaa ggcgagttac atgatccccc atgttgtgca aaaaagcggt 5760
tagctccttc ggtcctccga tcgttgtcag aagtaagttg gccgcagtgt tatcactcat 5820
ggttatggca gcactgcata attctcttac tgtcatgcca tccgtaagat gcttttctgt 5880
gactggtgag tactcaacca agtcattctg agaatagtgt atgcggcgac cgagttgctc 5940
ttgcccggcg tcaatacggg ataataccgc gccacatagc agaactttaa aagtgctcat 6000
cattggaaaa cgttcttcgg ggcgaaaact ctcaaggatc ttaccgctgt tgagatccag 6060
ttcgatgtaa cccactcgtg cacccaactg atcttcagca tcttttactt tcaccagcgt 6120
ttctgggtga gcaaaaacag gaaggcaaaa tgccgcaaaa aagggaataa gggcgacacg 6180
gaaatgttga atactcatac tcttcctttt tcaatattat tgaagcattt atcagggtta 6240
ttgtctcatg agcggataca tatttgaatg tatttagaaa aataaacaaa taggggttcc 6300
gcgcacattt ccccgaaaag tgccacctg 6329
<210> 230
<211> 9332
<212> DNA
<213> Artificial Sequence
<220>
<223> pCP20 vector
<400> 230
gagacacaac gtggctttgt tgaataaatc gaacttttgc tgagttgaag gatcagatca 60
cgcatcttcc cgacaacgca gaccgttccg tggcaaagca aaagttcaaa atcaccaact 120
ggtccaccta caacaaagct ctcatcaacc gtggctccct cactttctgg ctggatgatg 180
gggcgattca ggcctggtat gagtcagcaa caccttcttc acgaggcaga cctcagcgcc 240
acaggtgcgg ttgctggcgc taaccgtttt tatcaggctc tgggaggcag aataaatgat 300
catatcgtca attattacct ccacggggag agcctgagca aactggcctc aggcatttga 360
gaagcacacg gtcacactgc ttccggtagt caataaaccg gtaaaccagc aatagacata 420
agcggctatt taacgaccct gccctgaacc gacgaccggg tcgaatttgc tttcgaattt 480
ctgccattca tccgcttatt atcacttatt caggcgtagc aaccaggcgt ttaagggcac 540
caataactgc cttaaaaaaa ttacgccccg ccctgccact catcgcagta ctgttgtaat 600
tcattaagca ttctgccgac atggaagcca tcacaaacgg catgatgaac ctgaatcgcc 660
agcggcatca gcaccttgtc gccttgcgta taatatttgc ccatggtgaa aacgggggcg 720
aagaagttgt ccatattggc cacgtttaaa tcaaaactgg tgaaactcac ccagggattg 780
gctgagacga aaaacatatt ctcaataaac cctttaggga aataggccag gttttcaccg 840
taacacgcca catcttgcga atatatgtgt agaaactgcc ggaaatcgtc gtggtattca 900
ctccagagcg atgaaaacgt ttcagtttgc tcatggaaaa cggtgtaaca agggtgaaca 960
ctatcccata tcaccagctc accgtctttc attgccatac ggaattccgg atgagcattc 1020
atcaggcggg caagaatgtg aataaaggcc ggataaaact tgtgcttatt tttctttacg 1080
gtctttaaaa aggccgtaat atccagctga acggtctggt tataggtaca ttgagcaact 1140
gactgaaatg cctcaaaatg ttctttacga tgccattggg atatatcaac ggtggtatat 1200
ccagtgattt ttttctccat tttagcttcc ttagctcctg aaaatctcga taactcaaaa 1260
aatacgcccg gtagtgatct tatttcatta tggtgaaagt tggaacctct tacgtgccga 1320
tcaacgtctc attttcgcca aaagttggcc cagggcttcc cggtatcaac agggacacca 1380
ggatttattt attctgcgaa gtgatcttcc gtcacaggta tttattcggc gcaaagtgcg 1440
tcgggtgatg ctgccaactt actgatttag tgtatgatgg tgtttttgag gtgctccagt 1500
ggcttctgtt tctatcagct gtccctcctg ttcagctact gacggggtgg tgcgtaacgg 1560
caaaagcacc gccggacatc agcgcttgtt tcggcgtggg tatggtggca ggccccgtgg 1620
ccgggggact gttgggcgcc tgtagtgcca tttaccccca ttcactgcca gagccgtgag 1680
cgcagcgaac tgaatgtcac gaaaaagaca gcgactcagg tgcctgatgg tcggagacaa 1740
aaggaatatt cagcgatttg cccgagcttg cgagggtgct acttaagcct ttagggtttt 1800
aaggtctgtt ttgtagagga gcaaacagcg tttgcgacat ccttttgtaa tactgcggaa 1860
ctgactaaag tagtgagtta tacacagggc tgggatctat tctttttatc tttttttatt 1920
ctttctttat tctataaatt ataaccactt gaatataaac aaaaaaaaca cacaaaggtc 1980
tagcggaatt tacagagggt ctagcagaat ttacaagttt tccagcaaag gtctagcaga 2040
atttacagat acccacaact caaaggaaaa ggactagtaa ttatcattga ctagcccatc 2100
tcaattggta tagtgattaa aatcacctag accaattgag atgtatgtct gaattagttg 2160
ttttcaaagc aaatgaacta gcgattagtc gctatgactt aacggagcat gaaaccaagc 2220
taattttatg ctgtgtggca ctactcaacc ccacgattga aaaccctaca aggaaagaac 2280
ggacggtatc gttcacttat aaccaatacg ttcagatgat gaacatcagt agggaaaatg 2340
cttatggtgt attagctaaa gcaaccagag agctgatgac gagaactgtg gaaatcagga 2400
atcctttggt taaaggcttt gagattttcc agtggacaaa ctatgccaag ttctcaagcg 2460
aaaaattaga attagttttt agtgaagaga tattgcctta tcttttccag ttaaaaaaat 2520
tcataaaata taatctggaa catgttaagt cttttgaaaa caaatactct atgaggattt 2580
atgagtggtt attaaaagaa ctaacacaaa agaaaactca caaggcaaat atagagatta 2640
gccttgatga atttaagttc atgttaatgc ttgaaaataa ctaccatgag tttaaaaggc 2700
ttaaccaatg ggttttgaaa ccaataagta aagatttaaa cacttacagc aatatgaaat 2760
tggtggttga taagcgaggc cgcccgactg atacgttgat tttccaagtt gaactagata 2820
gacaaatgga tctcgtaacc gaacttgaga acaaccagat aaaaatgaat ggtgacaaaa 2880
taccaacaac cattacatca gattcctacc tacataacgg actaagaaaa acactacacg 2940
atgctttaac tgcaaaaatt cagctcacca gttttgaggc aaaatttttg agtgacatgc 3000
aaagtaagta tgatctcaat ggttcgttct catggctcac gcaaaaacaa cgaaccacac 3060
tagagaacat actggctaaa tacggaagga tctgaggttc ttatggctct tgtatctatc 3120
agtgaagcat caagactaac aaacaaaagt agaacaactg ttcaccgtta catatcaaag 3180
ggaaaactgt ccatatgcac agatgaaaac ggtgtaaaaa agatagatac atcagagctt 3240
ttacgagttt ttggtgcatt taaagctgtt caccatgaac agatcgacaa tgtaacagat 3300
gaacagcatg taacacctaa tagaacaggt gaaaccagta aaacaaagca actagaacat 3360
gaaattgaac acctgagaca acttgttaca gctcaacagt cacacataga cagcctgaaa 3420
caggcgatgc tgcttatcga atcaaagctg ccgacaacac gggagccagt gacgcctccc 3480
gtggggaaaa aatcatggca attctggaag aaatagcgcc tgtttcgttt caggcaggtt 3540
atcagggagt gtcagcgtcc tgcggttctc cggggcgttc gggtcatgca gcccgtaatg 3600
gtgatttacc agcgtctgcc aggcatcaat tctaggcctg tctgcgcggt cgtagtacgg 3660
ctggaggcgt tttccggtct gtagctccat gttcggaatg acaaaattca gctcaagccg 3720
tcccttgtcc tggtgctcca cccacaggat gctgtactga tttttttcga gaccgggcat 3780
cagtacacgc tcaaagctcg ccatcacttt ttcacgtcct cccggcggca gctccttctc 3840
cgcgaacgac agaacaccgg acgtgtattt cttcgcaaat ggcgtggcat cgatgagttc 3900
ccggacttct tccggattac cctgaagcac cgttgcgcct tcgcggttac gctccctccc 3960
cagcaggtaa tcaaccggac cactgccacc accttttccc ctggcatgaa atttaactat 4020
catcccgcgc cccctgttcc ctgacagcca gacgcagccg gcgcagctca tccccgatgg 4080
ccatcagtgc ggccaccacc tgaacccggt caccggaaga ccactgcccg ctgttcacct 4140
tacgggctgt ctgattcagg ttatttccga tggcggccag ctgacgcagt aacggcggtg 4200
ccagtgtcgg cagttttccg gaacgggcaa ccggctcccc caggcagacc cgccgcatcc 4260
ataccgccag ttgtttaccc tcacagcgtt caagtaaccg ggcatgttca tcatcagtaa 4320
cccgtattgt gagcatcctc tcgcgtttca tcggtatcat taccccatga acagaaatcc 4380
cccttacacg gaggcatcag tgactaaacg gggtctgacg ctcagtggaa cgaaaactca 4440
cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat ccttttaaat 4500
taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc tgacagttac 4560
caatgcttaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc atccatagtt 4620
gcctgactcc ccgtcgtgta gataactacg atacgggagg gcttaccatc tggccccagt 4680
gctgcaatga taccgcgaga cccacgctca ccggctccag atttatcagc aataaaccag 4740
ccagccggaa gggccgagcg cagaagtggt cctgcaactt tatccgcctc catccagtct 4800
attaattgtt gccgggaagc tagagtaagt agttcgccag ttaatagttt gcgcaacgtt 4860
gttgccattg ctgcaggcat cgtggtgtca cgctcgtcgt ttggtatggc ttcattcagc 4920
tccggttccc aacgatcaag gcgagttaca tgatccccca tgttgtgcaa aaaagcggtt 4980
agctccttcg gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt atcactcatg 5040
gttatggcag cactgcataa ttctcttact gtcatgccat ccgtaagatg cttttctgtg 5100
actggtgagt actcaaccaa gtcattctga gaatagtgta tgcggcgacc gagttgctct 5160
tgcccggcgt caacacggga taataccgcg ccacatagca gaactttaaa agtgctcatc 5220
attggaaaac gttcttcggg gcgaaaactc tcaaggatct taccgctgtt gagatccagt 5280
tcgatgtaac ccactcgtgc acccaactga tcttcagcat cttttacttt caccagcgtt 5340
tctgggtgag caaaaacagg aaggcaaaat gccgcaaaaa agggaataag ggcgacacgg 5400
aaatgttgaa tactcatact cttccttttt caatattatt gaagcattta tcagggttat 5460
tgtctcatga gcggatacat atttgaatgt atttagaaaa ataaacaaat aggggttccg 5520
cgcacatttc cccgaaaagt gccacctgac gtctaagaaa ccattattat catgacatta 5580
acctataaaa ataggcgtat cacgaggccc tttcgtcttc aagaatttta taaaccgtgg 5640
agcgggcaat actgagctga tgagcaattt ccgttgcacc agtgcccttc tgatgaagcg 5700
tcagcacgac gttcctgtcc acggtacgcc tgcggccaaa tttgattcct ttcagctttg 5760
cttcctgtcg gccctcattc gtgcgctcta ggatcctcta cgccggacgc atcgtggccg 5820
gcatcaccgg cgctgaggtc tgcctcgtga agaaggtgtt gctgactcat accaggcctg 5880
aatcgcccca tcatccagcc agaaagtgag ggagccacgg ttgatgagag ctttgttgta 5940
ggtggaccag ttggtgattt tgaacttttg ctttgccacg gaacggtctg cgttgtcggg 6000
aagatgcgtg atctgatcct tcaactcagc aaaagttcga tttattcaac aaagccgccg 6060
tcccgtcaag tcagcgtaat gctctgccag tgttacaacc aattaaccaa ttctgattag 6120
aaaaactcat cgagcatcaa atgaaactgc aatttattca tatcaggatt atcaatacca 6180
tatttttgaa aaagccgttt ctgtaatgaa ggagaaaact caccgaggca gttccatagg 6240
atggcaagat cctggtatcg gtctgcgatt ccgactcgtc caacatcaat acaacctatt 6300
aatttcccct cgtcaaaaat aaggttatca agtgagaaat caccatgagt gacgactgaa 6360
tccggtgaga atggcagaat aggaacttcg gaataggaac ttcaaagcgt ttccgaaaac 6420
gagcgcttcc gaaaatgcaa cgcgagctgc gcacatacag ctcactgttc acgtcgcacc 6480
tatatctgcg tgttgcctgt atatatatat acatgagaag aacggcatag tgcgtgttta 6540
tgcttaaatg cgtacttata tgcgtctatt tatgtaggat gaaaggtagt ctagtacctc 6600
ctgtgatatt atcccattcc atgcggggta tcgtatgctt ccttcagcac taccctttag 6660
ctgttctata tgctgccact cctcaattgg attagtctca tccttcaatg ctatcatttc 6720
ctttgatatt ggatcatatg catagtaccg agaaactagt gcgaagtagt gatcaggtat 6780
tgctgttatc tgatgagtat acgttgtcct ggccacggca gaagcacgct tatcgctcca 6840
atttcccaca acattagtca actccgttag gcccttcatt gaaagaaatg aggtcatcaa 6900
atgtcttcca atgtgagatt ttgggccatt ttttatagca aagattgaat aaggcgcatt 6960
tttcttcaaa gctttattgt acgatctgac taagttatct tttaataatt ggtattcctg 7020
tttattgctt gaagaattgc cggtcctatt tactcgtttt aggactggtt cagaattcct 7080
caaaaattca tccaaatata caagtggatc gatcctaccc cttgcgctaa agaagtatat 7140
gtgcctacta acgcttgtct ttgtctctgt cactaaacac tggattatta ctcccagata 7200
cttattttgg actaatttaa atgatttcgg atcaacgttc ttaatatcgc tgaatcttcc 7260
acaattgatg aaagtagcta ggaagaggaa ttggtataaa gtttttgttt ttgtaaatct 7320
cgaagtatac tcaaacgaat ttagtatttt ctcagtgatc tcccagatgc tttcaccctc 7380
acttagaagt gctttaagca tttttttact gtggctattt cccttatctg cttcttccga 7440
tgattcgaac tgtaattgca aactacttac aatatcagtg atatcagatt gatgtttttg 7500
tccatagtaa ggaataattg taaattccca agcaggaatc aatttcttta atgaggcttc 7560
cagaattgtt gctttttgcg tcttgtattt aaactggagt gatttattga caatatcgaa 7620
actcagcgaa ttgcttatga tagtattata gctcatgaat gtggctctct tgattgctgt 7680
tccgttatgt gtaatcatcc aacataaata ggttagttca gcagcacata atgctatttt 7740
ctcacctgaa ggtctttcaa acctttccac aaactgacga acaagcacct taggtggtgt 7800
tttacataat atatcaaatt gtggcataca acctccttag tacatgcaac cattatcacc 7860
gccagaggta aaatagtcaa cacgcacggt gttagatatt tatcccttgc ggtgatagat 7920
ttaacgtatg agcacaaaaa agaaaccatt aacacaagag cagcttgagg acgcacgtcg 7980
ccttaaagca atttatgaaa aaaagaaaaa tgaacttggc ttatcccagg aatctgtcgc 8040
agacaagatg gggatggggc agtcaggcgt tggtgcttta tttaatggca tcaatgcatt 8100
aaatgcttat aacgccgcat tgcttacaaa aattctcaaa gttagcgttg aagaatttag 8160
cccttcaatc gccagagaaa tctacgagat gtatgaagcg gttagtatgc agccgtcact 8220
tagaagtgag tatgagtacc ctgttttttc tcatgttcag gcagggatgt tctcacctaa 8280
gcttagaacc tttaccaaag gtgatgcgga gagatgggta agcacaacca aaaaagccag 8340
tgattctgca ttctggcttg aggttgaagg taattccatg accgcaccaa caggctccaa 8400
gccaagcttt cctgacggaa tgttaattct cgttgaccct gagcaggctg ttgagccagg 8460
tgatttctgc atagccagac ttgggggtga tgagtttacc ttcaagaaac tgatcaggga 8520
tagcggtcag gtgtttttac aaccactaaa cccacagtac ccaatgatcc catgcaatga 8580
gagttgttcc gttgtgggga aagttatcgc tagtcagtgg cctgaagaga cgtttggctg 8640
atcggcaagg tgttctggtc ggcgcatagc tgataacaat tgagcaagaa tctgcatttc 8700
tttccagact tgttcaacag gccagccatt acgctcgtca tcaaaatcac tcgcatcaac 8760
caaaccgtta ttcattcgtg attgcgcctg agcgagacga aatacgcgat cgctgttaaa 8820
aggacaatta caaacaggaa tcgaatgcaa ccggcgcagg aacactgcca gcgcatcaac 8880
aatattttca cctgaatcag gatattcttc taatacctgg aatgctgttt tcccggggat 8940
cgcagtggtg agtaaccatg catcatcagg agtacggata aaatgcttga tggtcggaag 9000
aggcataaat tccgtcagcc agtttagtct gaccatctca tctgtaacat cattggcaac 9060
gctacctttg ccatgtttca gaaacaactc tggcgcatcg ggcttcccat acaatcgata 9120
gattgtcgca cctgattgcc cgacattatc gcgagcccat ttatacccat ataaatcagc 9180
atccatgttg gaatttaatc gcggcctcga gcaagacgtt tcccgttgaa tatggctcat 9240
aacacccctt gtattactgt ttatgtaagc agacagtttt attgttcatg atgatatatt 9300
tttatcttgt gcaatgtaac atcagagatt tt 9332
<210> 231
<211> 65
<212> DNA
<213> Artificial Sequence
<220>
<223> yqhDKF primer
<400> 231
cgccatcatg gcggtgcggc gctgccttcc agttcggtta acacggtgta ggctggagct 60
gcttc 65
<210> 232
<211> 65
<212> DNA
<213> Artificial Sequence
<220>
<223> yqhDKR primer
<400> 232
gcgcgagttc tcaataatgg cgcgtttggt gcgaacttcg tggtaattcc ggggatccgt 60
cgacc 65
<210> 233
<211> 32
<212> DNA
<213> Artificial Sequence
<220>
<223> dhaB123_F primer
<400> 233
tcatgaaatc aaaaagattt gaagtattga ag 32
<210> 234
<211> 36
<212> DNA
<213> Artificial Sequence
<220>
<223> dhaB123_R primer
<400> 234
ggatccctaa tcttttctaa gttgacctct ttgttc 36
<210> 235
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> gdrAB_F primer
<400> 235
ggatccaaag gttcggggat agttatgaag 30
<210> 236
<211> 33
<212> DNA
<213> Artificial Sequence
<220>
<223> gdrAB_R primer
<400> 236
gagctcttat ctaagtggca gaccctttac aag 33
<210> 237
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> iBAB_Up primer
<400> 237
atgtatatct ccttcttata cttaactaat 30
<210> 238
<211> 35
<212> DNA
<213> Artificial Sequence
<220>
<223> iBAB_Dn primer
<400> 238
atcggccggc cacgcgatcg ctgacgtcgg taccc 35
<210> 239
<211> 35
<212> DNA
<213> Artificial Sequence
<220>
<223> pduP_F primer
<400> 239
gaaggagata tacatatgca gattaatgat attga 35
<210> 240
<211> 35
<212> DNA
<213> Artificial Sequence
<220>
<223> pduP_R primer
<400> 240
gcgtggccgg ccgatttaat accagttacg tactg 35
<210> 241
<211> 49
<212> DNA
<213> Artificial Sequence
<220>
<223> primer for MELS_1449 gene
<400> 241
tcatcaccac agccaggatc cgatggcact aagagatggg aattcctac 49
<210> 242
<211> 43
<212> DNA
<213> Artificial Sequence
<220>
<223> primer for MELS_1449 gene
<400> 242
gcattatgcg gccgcaagct tttatttttc ttcctttatg ccg 43
<210> 243
<211> 43
<212> DNA
<213> Artificial Sequence
<220>
<223> yciA_F primer
<400> 243
tcatcaccac agccaagatc tgatgtctac aacacataac gtc 43
<210> 244
<211> 40
<212> DNA
<213> Artificial Sequence
<220>
<223> yciA_R primer
<400> 244
gcattatgcg gccgcctcga gttactcaac aggtaaggcg 40
Claims (20)
- 유전적으로 조작되지 않은 세포에 비하여 3-히드록시프로피온알데히드 (3-HPA)를 3-히드록시프로피오닐-CoA (3-HP-CoA)로 전환하는 것을 촉매하는 CoA-아실화 알데히드 데히드로게나제(CoA acylating aldehyde dehydrogenase: ALDH), 및 3-HP-CoA를 아크릴릴-CoA (acrylyl-CoA)로 전환하는 것을 촉매하는 3-HP-CoA 데히드라타제의 활성이 증가되어 있는, 아크릴레이트 생산능을 갖는 미생물.
- 청구항 1에 있어서, 아크릴릴-CoA를 아크릴레이트로 전환하는 것을 촉매하는 효소의 활성이 더 증가되어 있는 것인 미생물.
- 청구항 1에 있어서, 상기 ALDH는 서열번호 1 내지 20의 아미노산 서열로 이루어진 군으로부터 선택된 하나 이상을 갖는 것인 미생물.
- 청구항 1에 있어서, 상기 ALDH는 EC 1.2.1.10, 또는 EC 1.2.1.87에 속하는 것인 미생물.
- 청구항 1에 있어서, 상기 ALDH는 프로피온알데히드 데히드로게나제 (propioaldehyde dehydrogenase: pduP)인 것인 미생물.
- 청구항 1에 있어서, 상기 3-HP-CoA 데히드라타제는 서열번호 41 내지 119의 아미노산 서열로 이루어진 군으로부터 선택된 하나 이상을 갖는 것인 미생물.
- 청구항 1에 있어서, 상기 3-HP-CoA 데히드라타제는 EC 4.2.1.17, EC 4.2.1.55, 및 EC 4.2.1.166를 포함한, EC 4.2.1.-에 속하는 것인 미생물.
- 청구항 2에 있어서, 상기 아크릴릴-CoA를 아크릴레이트로 전환하는 것을 촉매하는 효소는 서열번호 199 내지 204의 아미노산 서열로 이루어진 군으로부터 선택된 하나 이상을 갖는 것인 미생물.
- 청구항 2에 있어서, 상기 아크릴릴-CoA를 아크릴레이트로 전환하는 것을 촉매하는 효소는 EC 3.1.2.4를 포함한, EC 3.1.2-에 속하는 것인 미생물.
- 청구항 2에 있어서, 상기 아크릴릴-CoA를 아크릴레이트로 전환하는 것을 촉매하는 효소는 3-HP-CoA 히드롤라제 (3-HP-CoA hydrolase), 또는 3-히드록시이소부티릴-CoA 히드롤라제(3-hydroxyisobutyryl-CoA hydrolase)인 것인 미생물.
- 청구항 1에 있어서, ALDH, 및 3-HP-CoA 데히드라타제의 활성의 증가는 이들 효소를 코딩하는 폴리뉴클레오티드의 발현 증가에 의한 것인 미생물.
- 청구항 1에 있어서, 상기 미생물은 ALDH, 및 3-HP-CoA 데히드라타제, 또는 아크릴릴-CoA를 아크릴레이트로 전환하는 것을 촉매하는 효소를 코딩하는 폴리뉴클레오티드가 도입된 것인 미생물.
- 청구항 1에 있어서, 상기 미생물은 에세리키아 (Esherichia), 루멘박테리아, 코리네박테리움 (Corynebacterium) 속 및 브레비박테리움 (Brevibacterium) 속으로구성된 군으로부터 선택되는 것인 미생물.
- 청구항 1에 있어서, 상기 미생물은 아크릴레이트를 분해하거나 다른 산물로 전환하는 경로에 관여하는 하나 이상의 효소의 활성이 감소된 것인 미생물.
- 청구항 1에 있어서, 상기 미생물은 아크릴레이트를 분해하거나 다른 산물로 전환하는 경로에 관여하는 하나 이상의 효소를 코딩하는 유전자가 제거 또는 파괴되어 있는 것인 미생물.
- 청구항 1에 있어서, 상기 미생물은 3-HPA 생산능을 갖는 것인 미생물.
- 청구항 1에 있어서, 상기 3-HPA 생산능을 갖는 미생물은 대장균이고, 글리세롤 데히드라타제 (glycerol dehydratase: GDH)를 코딩하는 유전자, 및 글리세롤 데하이드라타제 재활성화효소 (glycerol dehydratase reactivase: GDR)를 코딩하는 유전자가 도입되어 있어 있는 것인 미생물.
- 청구항 1의 미생물을 배지 중에서 배양하는 단계;를 포함하는, 아크릴레이트를 생산하는 방법.
- 청구항 18에 있어서, 배양물로부터 아크릴레이트를 회수하는 단계를 더 포함하는 것인 방법.
- 청구항 18에 있어서, 상기 미생물은 아크릴레이트를 다른 산물로 전환하는 경로를 더 포함하는 것이고, 생산된 아크릴레이트를 다른 산물로 전환하는 단계를 더 포함하는 것인 방법.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020140085356A KR20160006030A (ko) | 2014-07-08 | 2014-07-08 | CoA 아실화 알데히드 데히드게나제 활성이 증가된 신규한 아크릴산 생성 경로를 갖는 미생물 및 이를 이용한 아크릴산 생산 방법 |
US14/620,002 US20160010124A1 (en) | 2014-07-08 | 2015-02-11 | Microorganism having novel acrylic acid synthesis pathway having enhanced activity of coa acylating aldehyde dehydrogenase and method of producing acrylic acid using the same |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020140085356A KR20160006030A (ko) | 2014-07-08 | 2014-07-08 | CoA 아실화 알데히드 데히드게나제 활성이 증가된 신규한 아크릴산 생성 경로를 갖는 미생물 및 이를 이용한 아크릴산 생산 방법 |
Publications (1)
Publication Number | Publication Date |
---|---|
KR20160006030A true KR20160006030A (ko) | 2016-01-18 |
Family
ID=55067131
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020140085356A KR20160006030A (ko) | 2014-07-08 | 2014-07-08 | CoA 아실화 알데히드 데히드게나제 활성이 증가된 신규한 아크릴산 생성 경로를 갖는 미생물 및 이를 이용한 아크릴산 생산 방법 |
Country Status (2)
Country | Link |
---|---|
US (1) | US20160010124A1 (ko) |
KR (1) | KR20160006030A (ko) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR102173569B1 (ko) * | 2016-01-18 | 2020-11-04 | 한화솔루션 주식회사 | 아크릴산 생성능을 가지는 재조합 변이 미생물 및 이를 이용한 아크릴산의 제조방법 |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20150110144A (ko) * | 2014-03-24 | 2015-10-02 | 삼성전자주식회사 | 글리세롤, 3-hp, 또는 아크릴산 생산능을 갖는 재조합 미생물 및 그를 이용하여 글리세롤, 3-hp, 또는 아크릴산을 생산하는 방법 |
KR102208963B1 (ko) * | 2014-05-14 | 2021-01-28 | 삼성전자주식회사 | 신규한 아크릴산 생성 경로를 갖는 미생물 및 이를 이용한 아크릴산 생산 방법 |
-
2014
- 2014-07-08 KR KR1020140085356A patent/KR20160006030A/ko not_active Application Discontinuation
-
2015
- 2015-02-11 US US14/620,002 patent/US20160010124A1/en not_active Abandoned
Also Published As
Publication number | Publication date |
---|---|
US20160010124A1 (en) | 2016-01-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2021203008B2 (en) | Genetically engineered bacterium comprising energy-generating fermentation pathway | |
KR102493174B1 (ko) | 고암모니아혈증과 관련된 질병을 치료하기 위해 공학처리된 박테리아 | |
KR20100031525A (ko) | 메타크릴산 또는 메타크릴산 에스테르의 제조 방법 | |
KR101417146B1 (ko) | 이소프레노이드의 생산 방법 | |
KR101420889B1 (ko) | 생물 유기 화합물을 제조하기 위한 장치 | |
KR102303832B1 (ko) | 내산성을 갖는 효모 세포, 상기 효모 세포를 제조하는 방법 및 이의 용도 | |
CN101297042A (zh) | 四碳醇的发酵生产 | |
BRPI0719748A2 (pt) | Microrganismo modificados por engenharia para produzir n-butanol e métodos relacionados | |
KR20130117753A (ko) | 포스포케톨라아제를 포함하는 재조합 숙주 세포 | |
TW201040276A (en) | Protein production in microorganisms of the phylum labyrinthulomycota | |
KR20240005196A (ko) | 유전자 조작된 미생물로부터 개선된 뮤콘산 생산 | |
KR20150042856A (ko) | 클라빈-유형 알칼로이드의 생산을 위한 유전자 및 방법 | |
KR20220002348A (ko) | 아이소유제놀로부터의 바닐린의 생합성 | |
KR102384227B1 (ko) | Nadph 생성이 증가된 유전적으로 조작된 효모 세포, 효모 세포 중 nadph 수준을 증가시키는 방법, 상기 효모 세포를 제조하는 방법 및 상기 효모 세포를 이용하여 락테이트를 생산하는 방법 | |
KR20160006030A (ko) | CoA 아실화 알데히드 데히드게나제 활성이 증가된 신규한 아크릴산 생성 경로를 갖는 미생물 및 이를 이용한 아크릴산 생산 방법 | |
KR102311681B1 (ko) | 내산성을 갖는 효모 세포, 그를 이용하여 유기산을 생산하는 방법 및 상기 내산성 효모 세포를 생산하는 방법 | |
KR102558303B1 (ko) | 재조합 실크 제조를 위한 변형 균주 | |
CN101627109A (zh) | 用于生产正丁醇的工程化改造的微生物及相关方法 | |
KR102287346B1 (ko) | Rgt1의 활성이 감소된 효모 세포, 그를 제조하는 방법 및 그를 사용하여 산물을 생산하는 방법 | |
CN113832087B (zh) | 一种利用大肠杆菌全生物合成丙二酸的方法 | |
CN114231562A (zh) | 一种表达荧光素酶基因的淋巴脉络丛脑膜炎病毒及其构建方法和应用 | |
KR102287347B1 (ko) | Rim15 및 igo2의 활성이 감소된 락테이트 생산능을 갖는 효모 세포, 그를 제조하는 방법 및 그를 사용하여 락테이트를 생산하는 방법 | |
CN114250172B (zh) | 一种海运海杆菌及其应用 | |
KR102255306B1 (ko) | 아세트알데히드 데히드로게나제를 포함하는 락테이트 생산능을 갖는 유전적으로 조작된 효모 세포, 그를 제조하는 방법 및 그를 사용하여 락테이트를 생산하는 방법 | |
KR102331270B1 (ko) | 폴리하이드록시알카노에이트를 생산하는 형질전환 미생물 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WITN | Application deemed withdrawn, e.g. because no request for examination was filed or no examination fee was paid |