CN114150024B - 一种双功能酶生物催化剂及其制备方法和应用 - Google Patents
一种双功能酶生物催化剂及其制备方法和应用 Download PDFInfo
- Publication number
- CN114150024B CN114150024B CN202111470133.1A CN202111470133A CN114150024B CN 114150024 B CN114150024 B CN 114150024B CN 202111470133 A CN202111470133 A CN 202111470133A CN 114150024 B CN114150024 B CN 114150024B
- Authority
- CN
- China
- Prior art keywords
- gly
- ala
- val
- thr
- ile
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 102000004190 Enzymes Human genes 0.000 title claims abstract description 61
- 108090000790 Enzymes Proteins 0.000 title claims abstract description 61
- 238000002360 preparation method Methods 0.000 title abstract description 11
- 230000001588 bifunctional effect Effects 0.000 title abstract description 4
- 102000007698 Alcohol dehydrogenase Human genes 0.000 claims abstract description 118
- 108010021809 Alcohol dehydrogenase Proteins 0.000 claims abstract description 118
- PUPZLCDOIYMWBV-SCSAIBSYSA-N (R)-butane-1,3-diol Chemical compound C[C@@H](O)CCO PUPZLCDOIYMWBV-SCSAIBSYSA-N 0.000 claims abstract description 32
- 238000006555 catalytic reaction Methods 0.000 claims abstract description 22
- LVSQXDHWDCMMRJ-UHFFFAOYSA-N 4-hydroxybutan-2-one Chemical compound CC(=O)CCO LVSQXDHWDCMMRJ-UHFFFAOYSA-N 0.000 claims abstract description 18
- 238000000034 method Methods 0.000 claims abstract description 17
- 239000000758 substrate Substances 0.000 claims abstract description 4
- 238000010189 synthetic method Methods 0.000 claims abstract 2
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 36
- 241000607598 Vibrio Species 0.000 claims description 27
- 238000006243 chemical reaction Methods 0.000 claims description 25
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 claims description 23
- QMGVPVSNSZLJIA-UHFFFAOYSA-N Nux Vomica Natural products C1C2C3C4N(C=5C6=CC=CC=5)C(=O)CC3OCC=C2CN2C1C46CC2 QMGVPVSNSZLJIA-UHFFFAOYSA-N 0.000 claims description 13
- 244000107975 Strychnos nux-vomica Species 0.000 claims description 13
- 230000035772 mutation Effects 0.000 claims description 8
- 239000008363 phosphate buffer Substances 0.000 claims description 8
- 239000005515 coenzyme Substances 0.000 claims description 7
- 230000015572 biosynthetic process Effects 0.000 claims description 6
- 238000003786 synthesis reaction Methods 0.000 claims description 6
- 239000008176 lyophilized powder Substances 0.000 claims description 5
- 241000223996 Toxoplasma Species 0.000 claims description 4
- 102220021458 rs397509053 Human genes 0.000 claims description 4
- 102220554288 Ankyrin repeat domain-containing protein 42_N198D_mutation Human genes 0.000 claims description 3
- 102220098668 rs878853245 Human genes 0.000 claims description 3
- 238000001308 synthesis method Methods 0.000 claims description 3
- 239000007795 chemical reaction product Substances 0.000 claims description 2
- 210000000582 semen Anatomy 0.000 claims 1
- 230000002194 synthesizing effect Effects 0.000 abstract description 8
- 239000002994 raw material Substances 0.000 abstract 1
- 108090000623 proteins and genes Proteins 0.000 description 58
- 241000894006 Bacteria Species 0.000 description 51
- 108010061238 threonyl-glycine Proteins 0.000 description 32
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 31
- 210000004027 cell Anatomy 0.000 description 29
- 108010050848 glycylleucine Proteins 0.000 description 26
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 23
- 102000004169 proteins and genes Human genes 0.000 description 20
- 108010005942 methionylglycine Proteins 0.000 description 19
- 230000001546 nitrifying effect Effects 0.000 description 18
- 238000002741 site-directed mutagenesis Methods 0.000 description 18
- 108020004414 DNA Proteins 0.000 description 17
- MXIULRKNFSCJHT-STQMWFEESA-N Gly-Phe-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 MXIULRKNFSCJHT-STQMWFEESA-N 0.000 description 17
- 230000004927 fusion Effects 0.000 description 17
- 241000222120 Candida <Saccharomycetales> Species 0.000 description 16
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 16
- UNPGTBHYKJOCCZ-DCAQKATOSA-N Met-Lys-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O UNPGTBHYKJOCCZ-DCAQKATOSA-N 0.000 description 15
- 108010011559 alanylphenylalanine Proteins 0.000 description 15
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 15
- 239000000243 solution Substances 0.000 description 15
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 14
- 108010085203 methionylmethionine Proteins 0.000 description 14
- LXAARTARZJJCMB-CIQUZCHMSA-N Ala-Ile-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LXAARTARZJJCMB-CIQUZCHMSA-N 0.000 description 13
- IWAXHBCACVWNHT-BQBZGAKWSA-N Gly-Asp-Arg Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IWAXHBCACVWNHT-BQBZGAKWSA-N 0.000 description 13
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 13
- 108010079364 N-glycylalanine Proteins 0.000 description 13
- GNZCMRRSXOBHLC-JYJNAYRXSA-N Phe-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N GNZCMRRSXOBHLC-JYJNAYRXSA-N 0.000 description 13
- 108010005233 alanylglutamic acid Proteins 0.000 description 13
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 13
- 108091026890 Coding region Proteins 0.000 description 12
- SCJJPCQUJYPHRZ-BQBZGAKWSA-N Gly-Pro-Asn Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O SCJJPCQUJYPHRZ-BQBZGAKWSA-N 0.000 description 12
- QBHGXFQJFPWJIH-XUXIUFHCSA-N Lys-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN QBHGXFQJFPWJIH-XUXIUFHCSA-N 0.000 description 12
- HUKLXYYPZWPXCC-KZVJFYERSA-N Met-Ala-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HUKLXYYPZWPXCC-KZVJFYERSA-N 0.000 description 12
- DBMLDOWSVHMQQN-XGEHTFHBSA-N Met-Ser-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DBMLDOWSVHMQQN-XGEHTFHBSA-N 0.000 description 12
- YKQNVTOIYFQMLW-IHRRRGAJSA-N Pro-Cys-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 YKQNVTOIYFQMLW-IHRRRGAJSA-N 0.000 description 12
- MAWSJXHRLWVJEZ-ACZMJKKPSA-N Ser-Gln-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N MAWSJXHRLWVJEZ-ACZMJKKPSA-N 0.000 description 12
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 12
- 238000006477 desulfuration reaction Methods 0.000 description 12
- 230000023556 desulfurization Effects 0.000 description 12
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 12
- YIFUFYZELCMPJP-YUMQZZPRSA-N Gly-Leu-Cys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O YIFUFYZELCMPJP-YUMQZZPRSA-N 0.000 description 11
- HIAHVKLTHNOENC-HGNGGELXSA-N His-Glu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HIAHVKLTHNOENC-HGNGGELXSA-N 0.000 description 11
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 11
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 11
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 11
- MVYRJYISVJWKSX-KBPBESRZSA-N Tyr-His-Gly Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)NCC(=O)O)N)O MVYRJYISVJWKSX-KBPBESRZSA-N 0.000 description 11
- ZMKDQRJLMRZHRI-ACRUOGEOSA-N Tyr-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N ZMKDQRJLMRZHRI-ACRUOGEOSA-N 0.000 description 11
- 108010060199 cysteinylproline Proteins 0.000 description 11
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 11
- QTUSJASXLGLJSR-OSUNSFLBSA-N Ile-Arg-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N QTUSJASXLGLJSR-OSUNSFLBSA-N 0.000 description 10
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 10
- BAWFJGJZGIEFAR-NNYOXOHSSA-N NAD zwitterion Chemical compound NC(=O)C1=CC=C[N+]([C@H]2[C@@H]([C@H](O)[C@@H](COP([O-])(=O)OP(O)(=O)OC[C@@H]3[C@H]([C@@H](O)[C@@H](O3)N3C4=NC=NC(N)=C4N=C3)O)O2)O)=C1 BAWFJGJZGIEFAR-NNYOXOHSSA-N 0.000 description 10
- 241001141101 Nitrospirales Species 0.000 description 10
- APIQKJYZDWVOCE-VEVYYDQMSA-N Thr-Asp-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O APIQKJYZDWVOCE-VEVYYDQMSA-N 0.000 description 10
- NZRUWPIYECBYRK-HTUGSXCWSA-N Thr-Phe-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O NZRUWPIYECBYRK-HTUGSXCWSA-N 0.000 description 10
- 108010056582 methionylglutamic acid Proteins 0.000 description 10
- 229950006238 nadide Drugs 0.000 description 10
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 9
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 9
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 9
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 9
- NPROWIBAWYMPAZ-GUDRVLHUSA-N Ile-Asp-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N NPROWIBAWYMPAZ-GUDRVLHUSA-N 0.000 description 9
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 9
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 9
- AUMNPAUHKUNHHN-BYULHYEWSA-N Val-Asn-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N AUMNPAUHKUNHHN-BYULHYEWSA-N 0.000 description 9
- FXVDGDZRYLFQKY-WPRPVWTQSA-N Val-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C FXVDGDZRYLFQKY-WPRPVWTQSA-N 0.000 description 9
- 108010092114 histidylphenylalanine Proteins 0.000 description 9
- 239000000843 powder Substances 0.000 description 9
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 8
- VDBKFYYIBLXEIF-GUBZILKMSA-N Arg-Gln-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VDBKFYYIBLXEIF-GUBZILKMSA-N 0.000 description 8
- TWVTVZUGEDBAJF-ACZMJKKPSA-N Asn-Cys-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N TWVTVZUGEDBAJF-ACZMJKKPSA-N 0.000 description 8
- 241000066531 Desulfonatronovibrio magnus Species 0.000 description 8
- ZTNHPMZHAILHRB-JSGCOSHPSA-N Glu-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)NCC(O)=O)=CNC2=C1 ZTNHPMZHAILHRB-JSGCOSHPSA-N 0.000 description 8
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 8
- BGGTYDNTOYRTTR-MEYUZBJRSA-N Leu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(C)C)N)O BGGTYDNTOYRTTR-MEYUZBJRSA-N 0.000 description 8
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 8
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 8
- 241000586786 Peptococcaceae bacterium BRH_c4b Species 0.000 description 8
- LGSANCBHSMDFDY-GARJFASQSA-N Pro-Glu-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O LGSANCBHSMDFDY-GARJFASQSA-N 0.000 description 8
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 8
- 108010093581 aspartyl-proline Proteins 0.000 description 8
- RMGVZKRVHHSUIM-UHFFFAOYSA-N dithionic acid Chemical compound OS(=O)(=O)S(O)(=O)=O RMGVZKRVHHSUIM-UHFFFAOYSA-N 0.000 description 8
- 108010087823 glycyltyrosine Proteins 0.000 description 8
- 108010003700 lysyl aspartic acid Proteins 0.000 description 8
- 108010038320 lysylphenylalanine Proteins 0.000 description 8
- 108010051242 phenylalanylserine Proteins 0.000 description 8
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 8
- STACJSVFHSEZJV-GHCJXIJMSA-N Ala-Asn-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STACJSVFHSEZJV-GHCJXIJMSA-N 0.000 description 7
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 7
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 7
- ACKNRKFVYUVWAC-ZPFDUUQYSA-N Asn-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ACKNRKFVYUVWAC-ZPFDUUQYSA-N 0.000 description 7
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 7
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 7
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 7
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 7
- 241000192001 Pediococcus Species 0.000 description 7
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 7
- GRSCONMARGNYHA-PMVMPFDFSA-N Trp-Lys-Phe Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GRSCONMARGNYHA-PMVMPFDFSA-N 0.000 description 7
- 108010092854 aspartyllysine Proteins 0.000 description 7
- 108010081551 glycylphenylalanine Proteins 0.000 description 7
- 108010077515 glycylproline Proteins 0.000 description 7
- 108010045383 histidyl-glycyl-glutamic acid Proteins 0.000 description 7
- 239000007788 liquid Substances 0.000 description 7
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 6
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 6
- ZMWDUIIACVLIHK-GHCJXIJMSA-N Asn-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N ZMWDUIIACVLIHK-GHCJXIJMSA-N 0.000 description 6
- ZPMNECSEJXXNBE-CIUDSAMLSA-N Asn-Cys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O ZPMNECSEJXXNBE-CIUDSAMLSA-N 0.000 description 6
- NVWJMQNYLYWVNQ-BYULHYEWSA-N Asn-Ile-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O NVWJMQNYLYWVNQ-BYULHYEWSA-N 0.000 description 6
- HSWYMWGDMPLTTH-FXQIFTODSA-N Asp-Glu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HSWYMWGDMPLTTH-FXQIFTODSA-N 0.000 description 6
- 241000193830 Bacillus <bacterium> Species 0.000 description 6
- NITLUESFANGEIW-BQBZGAKWSA-N Cys-Pro-Gly Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O NITLUESFANGEIW-BQBZGAKWSA-N 0.000 description 6
- XEKOWRVHYACXOJ-UHFFFAOYSA-N Ethyl acetate Chemical compound CCOC(C)=O XEKOWRVHYACXOJ-UHFFFAOYSA-N 0.000 description 6
- MTAOBYXRYJZRGQ-WDSKDSINSA-N Glu-Gly-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MTAOBYXRYJZRGQ-WDSKDSINSA-N 0.000 description 6
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 6
- ZOTGXWMKUFSKEU-QXEWZRGKSA-N Gly-Ile-Met Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O ZOTGXWMKUFSKEU-QXEWZRGKSA-N 0.000 description 6
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 6
- GBYYQVBXFVDJPJ-WLTAIBSBSA-N Gly-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)CN)O GBYYQVBXFVDJPJ-WLTAIBSBSA-N 0.000 description 6
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 6
- CJGDTAHEMXLRMB-ULQDDVLXSA-N His-Arg-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CJGDTAHEMXLRMB-ULQDDVLXSA-N 0.000 description 6
- VXZZUXWAOMWWJH-QTKMDUPCSA-N His-Thr-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VXZZUXWAOMWWJH-QTKMDUPCSA-N 0.000 description 6
- CSQNHSGHAPRGPQ-YTFOTSKYSA-N Ile-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(=O)O)N CSQNHSGHAPRGPQ-YTFOTSKYSA-N 0.000 description 6
- GLYJPWIRLBAIJH-FQUUOJAGSA-N Ile-Lys-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N GLYJPWIRLBAIJH-FQUUOJAGSA-N 0.000 description 6
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 6
- YIBOAHAOAWACDK-QEJZJMRPSA-N Lys-Ala-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YIBOAHAOAWACDK-QEJZJMRPSA-N 0.000 description 6
- ZAENPHCEQXALHO-GUBZILKMSA-N Lys-Cys-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZAENPHCEQXALHO-GUBZILKMSA-N 0.000 description 6
- CYZBFPYMSJGBRL-DRZSPHRISA-N Phe-Ala-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CYZBFPYMSJGBRL-DRZSPHRISA-N 0.000 description 6
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 6
- NHDVNAKDACFHPX-GUBZILKMSA-N Pro-Arg-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O NHDVNAKDACFHPX-GUBZILKMSA-N 0.000 description 6
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 6
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 6
- PVDTYLHUWAEYGY-CIUDSAMLSA-N Ser-Glu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PVDTYLHUWAEYGY-CIUDSAMLSA-N 0.000 description 6
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 6
- CNLKDWSAORJEMW-KWQFWETISA-N Tyr-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O CNLKDWSAORJEMW-KWQFWETISA-N 0.000 description 6
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 6
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 6
- DIOSYUIWOQCXNR-ONGXEEELSA-N Val-Lys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O DIOSYUIWOQCXNR-ONGXEEELSA-N 0.000 description 6
- XPKCFQZDQGVJCX-RHYQMDGZSA-N Val-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N)O XPKCFQZDQGVJCX-RHYQMDGZSA-N 0.000 description 6
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 6
- 108010049041 glutamylalanine Proteins 0.000 description 6
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 6
- 108010010147 glycylglutamine Proteins 0.000 description 6
- 108010012581 phenylalanylglutamate Proteins 0.000 description 6
- 108010073969 valyllysine Proteins 0.000 description 6
- 239000013598 vector Substances 0.000 description 6
- 108010027345 wheylin-1 peptide Proteins 0.000 description 6
- BRPMXFSTKXXNHF-IUCAKERBSA-N (2s)-1-[2-[[(2s)-pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H]1NCCC1 BRPMXFSTKXXNHF-IUCAKERBSA-N 0.000 description 5
- QCTFKEJEIMPOLW-JURCDPSOSA-N Ala-Ile-Phe Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QCTFKEJEIMPOLW-JURCDPSOSA-N 0.000 description 5
- 241001136698 Anaerolineales Species 0.000 description 5
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 5
- HGKHPCFTRQDHCU-IUCAKERBSA-N Arg-Pro-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HGKHPCFTRQDHCU-IUCAKERBSA-N 0.000 description 5
- 241000665760 Desulfobacterales bacterium Species 0.000 description 5
- 241000894129 Desulfovibrio sulfodismutans Species 0.000 description 5
- NSORZJXKUQFEKL-JGVFFNPUSA-N Gln-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)N)N)C(=O)O NSORZJXKUQFEKL-JGVFFNPUSA-N 0.000 description 5
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 description 5
- WIZPFZKOFZXDQG-HTFCKZLJSA-N Ile-Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O WIZPFZKOFZXDQG-HTFCKZLJSA-N 0.000 description 5
- CKRFDMPBSWYOBT-PPCPHDFISA-N Ile-Lys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CKRFDMPBSWYOBT-PPCPHDFISA-N 0.000 description 5
- 241000880493 Leptailurus serval Species 0.000 description 5
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 5
- TUIOUEWKFFVNLH-DCAQKATOSA-N Leu-Val-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O TUIOUEWKFFVNLH-DCAQKATOSA-N 0.000 description 5
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 5
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 5
- HZVXPUHLTZRQEL-UWVGGRQHSA-N Met-Leu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O HZVXPUHLTZRQEL-UWVGGRQHSA-N 0.000 description 5
- 108010047562 NGR peptide Proteins 0.000 description 5
- JJHVFCUWLSKADD-ONGXEEELSA-N Phe-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O JJHVFCUWLSKADD-ONGXEEELSA-N 0.000 description 5
- 241001642893 Planctomycetaceae bacterium Species 0.000 description 5
- FUOGXAQMNJMBFG-WPRPVWTQSA-N Pro-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FUOGXAQMNJMBFG-WPRPVWTQSA-N 0.000 description 5
- 241001112743 Thermoanaerobacteraceae Species 0.000 description 5
- WPAKPLPGQNUXGN-OSUNSFLBSA-N Thr-Ile-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WPAKPLPGQNUXGN-OSUNSFLBSA-N 0.000 description 5
- DOBIBIXIHJKVJF-XKBZYTNZSA-N Thr-Ser-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DOBIBIXIHJKVJF-XKBZYTNZSA-N 0.000 description 5
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 5
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 5
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 5
- 108010047495 alanylglycine Proteins 0.000 description 5
- 108010038633 aspartylglutamate Proteins 0.000 description 5
- 108010068265 aspartyltyrosine Proteins 0.000 description 5
- 238000001514 detection method Methods 0.000 description 5
- 108010037850 glycylvaline Proteins 0.000 description 5
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 5
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 5
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 4
- FVSOUJZKYWEFOB-KBIXCLLPSA-N Ala-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)N FVSOUJZKYWEFOB-KBIXCLLPSA-N 0.000 description 4
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 4
- DDPXDCKYWDGZAL-BQBZGAKWSA-N Asn-Gly-Arg Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N DDPXDCKYWDGZAL-BQBZGAKWSA-N 0.000 description 4
- PNHQRQTVBRDIEF-CIUDSAMLSA-N Asn-Leu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)N)N PNHQRQTVBRDIEF-CIUDSAMLSA-N 0.000 description 4
- FBODFHMLALOPHP-GUBZILKMSA-N Asn-Lys-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O FBODFHMLALOPHP-GUBZILKMSA-N 0.000 description 4
- LMIWYCWRJVMAIQ-NHCYSSNCSA-N Asn-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N LMIWYCWRJVMAIQ-NHCYSSNCSA-N 0.000 description 4
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 4
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 4
- 241000816697 Candidatus Abyssubacteria bacterium SURF_5 Species 0.000 description 4
- 241000685637 Desulfotomaculum arcticum Species 0.000 description 4
- 241000194033 Enterococcus Species 0.000 description 4
- 241000588724 Escherichia coli Species 0.000 description 4
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 4
- VGUYMZGLJUJRBV-YVNDNENWSA-N Glu-Ile-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VGUYMZGLJUJRBV-YVNDNENWSA-N 0.000 description 4
- GRHXUHCFENOCOS-ZPFDUUQYSA-N Glu-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N GRHXUHCFENOCOS-ZPFDUUQYSA-N 0.000 description 4
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 4
- PMSDOVISAARGAV-FHWLQOOXSA-N Glu-Tyr-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 PMSDOVISAARGAV-FHWLQOOXSA-N 0.000 description 4
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 4
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 4
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 description 4
- LKJCZEPXHOIAIW-HOTGVXAUSA-N Gly-Trp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN LKJCZEPXHOIAIW-HOTGVXAUSA-N 0.000 description 4
- PGTISAJTWZPFGN-PEXQALLHSA-N His-Gly-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O PGTISAJTWZPFGN-PEXQALLHSA-N 0.000 description 4
- GYXDQXPCPASCNR-NHCYSSNCSA-N His-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N GYXDQXPCPASCNR-NHCYSSNCSA-N 0.000 description 4
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 4
- SVZFKLBRCYCIIY-CYDGBPFRSA-N Ile-Pro-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SVZFKLBRCYCIIY-CYDGBPFRSA-N 0.000 description 4
- OWSWUWDMSNXTNE-GMOBBJLQSA-N Ile-Pro-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N OWSWUWDMSNXTNE-GMOBBJLQSA-N 0.000 description 4
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 4
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 4
- KUIDCYNIEJBZBU-AJNGGQMLSA-N Leu-Ile-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O KUIDCYNIEJBZBU-AJNGGQMLSA-N 0.000 description 4
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 4
- CPONGMJGVIAWEH-DCAQKATOSA-N Leu-Met-Ala Chemical compound CSCC[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O CPONGMJGVIAWEH-DCAQKATOSA-N 0.000 description 4
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 4
- ONGCSGVHCSAATF-CIUDSAMLSA-N Met-Ala-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O ONGCSGVHCSAATF-CIUDSAMLSA-N 0.000 description 4
- RDLSEGZJMYGFNS-FXQIFTODSA-N Met-Ser-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RDLSEGZJMYGFNS-FXQIFTODSA-N 0.000 description 4
- BAWFJGJZGIEFAR-NNYOXOHSSA-O NAD(+) Chemical compound NC(=O)C1=CC=C[N+]([C@H]2[C@@H]([C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OC[C@@H]3[C@H]([C@@H](O)[C@@H](O3)N3C4=NC=NC(N)=C4N=C3)O)O2)O)=C1 BAWFJGJZGIEFAR-NNYOXOHSSA-O 0.000 description 4
- XJLXINKUBYWONI-NNYOXOHSSA-O NADP(+) Chemical compound NC(=O)C1=CC=C[N+]([C@H]2[C@@H]([C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OC[C@@H]3[C@H]([C@@H](OP(O)(O)=O)[C@@H](O3)N3C4=NC=NC(N)=C4N=C3)O)O2)O)=C1 XJLXINKUBYWONI-NNYOXOHSSA-O 0.000 description 4
- BRJGUPWVFXKBQI-XUXIUFHCSA-N Pro-Leu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRJGUPWVFXKBQI-XUXIUFHCSA-N 0.000 description 4
- BYIROAKULFFTEK-CIUDSAMLSA-N Ser-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO BYIROAKULFFTEK-CIUDSAMLSA-N 0.000 description 4
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 4
- KCRQEJSKXAIULJ-FJXKBIBVSA-N Thr-Gly-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCRQEJSKXAIULJ-FJXKBIBVSA-N 0.000 description 4
- SCSVNSNWUTYSFO-WDCWCFNPSA-N Thr-Lys-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O SCSVNSNWUTYSFO-WDCWCFNPSA-N 0.000 description 4
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 4
- GYKDRHDMGQUZPU-MGHWNKPDSA-N Tyr-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GYKDRHDMGQUZPU-MGHWNKPDSA-N 0.000 description 4
- FMXFHNSFABRVFZ-BZSNNMDCSA-N Tyr-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FMXFHNSFABRVFZ-BZSNNMDCSA-N 0.000 description 4
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 4
- IRLYZKKNBFPQBW-XGEHTFHBSA-N Val-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N)O IRLYZKKNBFPQBW-XGEHTFHBSA-N 0.000 description 4
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 4
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 4
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 4
- 108010062796 arginyllysine Proteins 0.000 description 4
- YZBQHRLRFGPBSL-RXMQYKEDSA-N carbapenem Chemical compound C1C=CN2C(=O)C[C@H]21 YZBQHRLRFGPBSL-RXMQYKEDSA-N 0.000 description 4
- 108010016616 cysteinylglycine Proteins 0.000 description 4
- 239000013604 expression vector Substances 0.000 description 4
- 108010078144 glutaminyl-glycine Proteins 0.000 description 4
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 4
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 4
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 4
- 108010012988 lysyl-glutamyl-aspartyl-glycine Proteins 0.000 description 4
- 108010064235 lysylglycine Proteins 0.000 description 4
- 238000004519 manufacturing process Methods 0.000 description 4
- 108020004707 nucleic acids Proteins 0.000 description 4
- 102000039446 nucleic acids Human genes 0.000 description 4
- 150000007523 nucleic acids Chemical class 0.000 description 4
- 239000013612 plasmid Substances 0.000 description 4
- AXFMEGAFCUULFV-BLFANLJRSA-N (2s)-2-[[(2s)-1-[(2s,3r)-2-amino-3-methylpentanoyl]pyrrolidine-2-carbonyl]amino]pentanedioic acid Chemical compound CC[C@@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AXFMEGAFCUULFV-BLFANLJRSA-N 0.000 description 3
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 description 3
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 3
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 3
- DHONNEYAZPNGSG-UBHSHLNASA-N Ala-Val-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DHONNEYAZPNGSG-UBHSHLNASA-N 0.000 description 3
- ZZZWQALDSQQBEW-STQMWFEESA-N Arg-Gly-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZZZWQALDSQQBEW-STQMWFEESA-N 0.000 description 3
- KXEGPPNPXOKKHK-ZLUOBGJFSA-N Asn-Asp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KXEGPPNPXOKKHK-ZLUOBGJFSA-N 0.000 description 3
- SQZIAWGBBUSSPJ-ZKWXMUAHSA-N Asn-Cys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N SQZIAWGBBUSSPJ-ZKWXMUAHSA-N 0.000 description 3
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 3
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 3
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 3
- JUWZKMBALYLZCK-WHFBIAKZSA-N Asp-Gly-Asn Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O JUWZKMBALYLZCK-WHFBIAKZSA-N 0.000 description 3
- BIVYLQMZPHDUIH-WHFBIAKZSA-N Asp-Gly-Cys Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)C(=O)O BIVYLQMZPHDUIH-WHFBIAKZSA-N 0.000 description 3
- HAFCJCDJGIOYPW-WDSKDSINSA-N Asp-Gly-Gln Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O HAFCJCDJGIOYPW-WDSKDSINSA-N 0.000 description 3
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 3
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 3
- IZUNQDRIAOLWCN-YUMQZZPRSA-N Cys-Leu-Gly Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N IZUNQDRIAOLWCN-YUMQZZPRSA-N 0.000 description 3
- NAPULYCVEVVFRB-HEIBUPTGSA-N Cys-Thr-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CS NAPULYCVEVVFRB-HEIBUPTGSA-N 0.000 description 3
- WUAYFMZULZDSLB-ACZMJKKPSA-N Gln-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O WUAYFMZULZDSLB-ACZMJKKPSA-N 0.000 description 3
- SZXSSXUNOALWCH-ACZMJKKPSA-N Glu-Ala-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O SZXSSXUNOALWCH-ACZMJKKPSA-N 0.000 description 3
- UTKICHUQEQBDGC-ACZMJKKPSA-N Glu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N UTKICHUQEQBDGC-ACZMJKKPSA-N 0.000 description 3
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 3
- OGCIHJPYKVSMTE-YUMQZZPRSA-N Gly-Arg-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O OGCIHJPYKVSMTE-YUMQZZPRSA-N 0.000 description 3
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 3
- XQHSBNVACKQWAV-WHFBIAKZSA-N Gly-Asp-Asn Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XQHSBNVACKQWAV-WHFBIAKZSA-N 0.000 description 3
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 3
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 3
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 3
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 3
- ORXZVPZCPMKHNR-IUCAKERBSA-N Gly-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CNC=N1 ORXZVPZCPMKHNR-IUCAKERBSA-N 0.000 description 3
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 3
- KBBFOULZCHWGJX-KBPBESRZSA-N Gly-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)CN)O KBBFOULZCHWGJX-KBPBESRZSA-N 0.000 description 3
- MUGLKCQHTUFLGF-WPRPVWTQSA-N Gly-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)CN MUGLKCQHTUFLGF-WPRPVWTQSA-N 0.000 description 3
- AKEDPWJFQULLPE-IUCAKERBSA-N His-Glu-Gly Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O AKEDPWJFQULLPE-IUCAKERBSA-N 0.000 description 3
- UXSATKFPUVZVDK-KKUMJFAQSA-N His-Lys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N UXSATKFPUVZVDK-KKUMJFAQSA-N 0.000 description 3
- LPFBXFILACZHIB-LAEOZQHASA-N Ile-Gly-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)O)N LPFBXFILACZHIB-LAEOZQHASA-N 0.000 description 3
- IVXJIMGDOYRLQU-XUXIUFHCSA-N Ile-Pro-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O IVXJIMGDOYRLQU-XUXIUFHCSA-N 0.000 description 3
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 3
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 3
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 3
- SEMUSFOBZGKBGW-YTFOTSKYSA-N Leu-Ile-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SEMUSFOBZGKBGW-YTFOTSKYSA-N 0.000 description 3
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 3
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 3
- PBLLTSKBTAHDNA-KBPBESRZSA-N Lys-Gly-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PBLLTSKBTAHDNA-KBPBESRZSA-N 0.000 description 3
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 3
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 3
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 3
- FYRUJIJAUPHUNB-IUCAKERBSA-N Met-Gly-Arg Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N FYRUJIJAUPHUNB-IUCAKERBSA-N 0.000 description 3
- BEZJTLKUMFMITF-AVGNSLFASA-N Met-Lys-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCNC(N)=N BEZJTLKUMFMITF-AVGNSLFASA-N 0.000 description 3
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 3
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 3
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 3
- -1 NADP+) Chemical compound 0.000 description 3
- MPGJIHFJCXTVEX-KKUMJFAQSA-N Phe-Arg-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O MPGJIHFJCXTVEX-KKUMJFAQSA-N 0.000 description 3
- YFNOUBWUIIJQHF-LPEHRKFASA-N Pro-Asp-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O YFNOUBWUIIJQHF-LPEHRKFASA-N 0.000 description 3
- XYHMFGGWNOFUOU-QXEWZRGKSA-N Pro-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 XYHMFGGWNOFUOU-QXEWZRGKSA-N 0.000 description 3
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 3
- UGTZYIPOBYXWRW-SRVKXCTJSA-N Ser-Phe-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O UGTZYIPOBYXWRW-SRVKXCTJSA-N 0.000 description 3
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 3
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 3
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 3
- BIENEHRYNODTLP-HJGDQZAQSA-N Thr-Glu-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N)O BIENEHRYNODTLP-HJGDQZAQSA-N 0.000 description 3
- DXPURPNJDFCKKO-RHYQMDGZSA-N Thr-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DXPURPNJDFCKKO-RHYQMDGZSA-N 0.000 description 3
- KWTRGSQOQHZKIA-PMVMPFDFSA-N Trp-Lys-Tyr Chemical compound C([C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)CCCCN)C(O)=O)C1=CC=C(O)C=C1 KWTRGSQOQHZKIA-PMVMPFDFSA-N 0.000 description 3
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 3
- LTFLDDDGWOVIHY-NAKRPEOUSA-N Val-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N LTFLDDDGWOVIHY-NAKRPEOUSA-N 0.000 description 3
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 3
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 3
- ZIGZPYJXIWLQFC-QTKMDUPCSA-N Val-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C(C)C)N)O ZIGZPYJXIWLQFC-QTKMDUPCSA-N 0.000 description 3
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 229940079593 drug Drugs 0.000 description 3
- 239000003814 drug Substances 0.000 description 3
- 210000003527 eukaryotic cell Anatomy 0.000 description 3
- 238000000855 fermentation Methods 0.000 description 3
- 238000004817 gas chromatography Methods 0.000 description 3
- 230000006698 induction Effects 0.000 description 3
- 108010057821 leucylproline Proteins 0.000 description 3
- 108010009298 lysylglutamic acid Proteins 0.000 description 3
- 239000000203 mixture Substances 0.000 description 3
- 239000008055 phosphate buffer solution Substances 0.000 description 3
- 239000000047 product Substances 0.000 description 3
- 210000001236 prokaryotic cell Anatomy 0.000 description 3
- 108010031719 prolyl-serine Proteins 0.000 description 3
- 230000008929 regeneration Effects 0.000 description 3
- 238000011069 regeneration method Methods 0.000 description 3
- 108010038745 tryptophylglycine Proteins 0.000 description 3
- SADYNMDJGAWAEW-JKQORVJESA-N (2s)-2-[[(2s)-3-carboxy-2-[[(2s)-2-[[(2s)-2,6-diaminohexanoyl]amino]-3-methylbutanoyl]amino]propanoyl]amino]-4-methylpentanoic acid Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN SADYNMDJGAWAEW-JKQORVJESA-N 0.000 description 2
- CSCPPACGZOOCGX-UHFFFAOYSA-N Acetone Chemical compound CC(C)=O CSCPPACGZOOCGX-UHFFFAOYSA-N 0.000 description 2
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 2
- FOWHQTWRLFTELJ-FXQIFTODSA-N Ala-Asp-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N FOWHQTWRLFTELJ-FXQIFTODSA-N 0.000 description 2
- NJIFPLAJSVUQOZ-JBDRJPRFSA-N Ala-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C)N NJIFPLAJSVUQOZ-JBDRJPRFSA-N 0.000 description 2
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 2
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 2
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 2
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 2
- YNOCMHZSWJMGBB-GCJQMDKQSA-N Ala-Thr-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O YNOCMHZSWJMGBB-GCJQMDKQSA-N 0.000 description 2
- JJHBEVZAZXZREW-LFSVMHDDSA-N Ala-Thr-Phe Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O JJHBEVZAZXZREW-LFSVMHDDSA-N 0.000 description 2
- LFFOJBOTZUWINF-ZANVPECISA-N Ala-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O)=CNC2=C1 LFFOJBOTZUWINF-ZANVPECISA-N 0.000 description 2
- KLKARCOHVHLAJP-UWJYBYFXSA-N Ala-Tyr-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CS)C(O)=O KLKARCOHVHLAJP-UWJYBYFXSA-N 0.000 description 2
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 2
- HJDNZFIYILEIKR-OSUNSFLBSA-N Arg-Ile-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HJDNZFIYILEIKR-OSUNSFLBSA-N 0.000 description 2
- BTJVOUQWFXABOI-IHRRRGAJSA-N Arg-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCNC(N)=N BTJVOUQWFXABOI-IHRRRGAJSA-N 0.000 description 2
- JCROZIFVIYMXHM-GUBZILKMSA-N Arg-Met-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCCN=C(N)N JCROZIFVIYMXHM-GUBZILKMSA-N 0.000 description 2
- RRVBEKYEFMCDIF-WHFBIAKZSA-N Asn-Cys-Gly Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N)C(=O)N RRVBEKYEFMCDIF-WHFBIAKZSA-N 0.000 description 2
- HJRBIWRXULGMOA-ACZMJKKPSA-N Asn-Gln-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJRBIWRXULGMOA-ACZMJKKPSA-N 0.000 description 2
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 2
- WONGRTVAMHFGBE-WDSKDSINSA-N Asn-Gly-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N WONGRTVAMHFGBE-WDSKDSINSA-N 0.000 description 2
- YXVAESUIQFDBHN-SRVKXCTJSA-N Asn-Phe-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O YXVAESUIQFDBHN-SRVKXCTJSA-N 0.000 description 2
- ZAESWDKAMDVHLL-RCOVLWMOSA-N Asn-Val-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O ZAESWDKAMDVHLL-RCOVLWMOSA-N 0.000 description 2
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 2
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 2
- PSLSTUMPZILTAH-BYULHYEWSA-N Asp-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PSLSTUMPZILTAH-BYULHYEWSA-N 0.000 description 2
- RQYMKRMRZWJGHC-BQBZGAKWSA-N Asp-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N RQYMKRMRZWJGHC-BQBZGAKWSA-N 0.000 description 2
- KQBVNNAPIURMPD-PEFMBERDSA-N Asp-Ile-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KQBVNNAPIURMPD-PEFMBERDSA-N 0.000 description 2
- YQKYLDVPCOGIRB-SEKJGCFDSA-N Asp-Leu-Thr-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O YQKYLDVPCOGIRB-SEKJGCFDSA-N 0.000 description 2
- JUWISGAGWSDGDH-KKUMJFAQSA-N Asp-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=CC=C1 JUWISGAGWSDGDH-KKUMJFAQSA-N 0.000 description 2
- DZLQXIFVQFTFJY-BYPYZUCNSA-N Cys-Gly-Gly Chemical compound SC[C@H](N)C(=O)NCC(=O)NCC(O)=O DZLQXIFVQFTFJY-BYPYZUCNSA-N 0.000 description 2
- UIKLEGZPIOXFHJ-DLOVCJGASA-N Cys-Phe-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O UIKLEGZPIOXFHJ-DLOVCJGASA-N 0.000 description 2
- IPHGBVYWRKCGKG-FXQIFTODSA-N Gln-Cys-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O IPHGBVYWRKCGKG-FXQIFTODSA-N 0.000 description 2
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 2
- NKSGKPWXSWBRRX-ACZMJKKPSA-N Glu-Asn-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N NKSGKPWXSWBRRX-ACZMJKKPSA-N 0.000 description 2
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 2
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 2
- ALMBZBOCGSVSAI-ACZMJKKPSA-N Glu-Ser-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ALMBZBOCGSVSAI-ACZMJKKPSA-N 0.000 description 2
- DTLLNDVORUEOTM-WDCWCFNPSA-N Glu-Thr-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DTLLNDVORUEOTM-WDCWCFNPSA-N 0.000 description 2
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 2
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 2
- GZUKEVBTYNNUQF-WDSKDSINSA-N Gly-Ala-Gln Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GZUKEVBTYNNUQF-WDSKDSINSA-N 0.000 description 2
- BGVYNAQWHSTTSP-BYULHYEWSA-N Gly-Asn-Ile Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BGVYNAQWHSTTSP-BYULHYEWSA-N 0.000 description 2
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 2
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 2
- LEGMTEAZGRRIMY-ZKWXMUAHSA-N Gly-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN LEGMTEAZGRRIMY-ZKWXMUAHSA-N 0.000 description 2
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 2
- DTRUBYPMMVPQPD-YUMQZZPRSA-N Gly-Gln-Arg Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DTRUBYPMMVPQPD-YUMQZZPRSA-N 0.000 description 2
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 2
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 2
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 2
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 2
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 2
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 2
- FQKKPCWTZZEDIC-XPUUQOCRSA-N Gly-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 FQKKPCWTZZEDIC-XPUUQOCRSA-N 0.000 description 2
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 2
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 2
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 2
- JYPCXBJRLBHWME-IUCAKERBSA-N Gly-Pro-Arg Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JYPCXBJRLBHWME-IUCAKERBSA-N 0.000 description 2
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 2
- CQMFNTVQVLQRLT-JHEQGTHGSA-N Gly-Thr-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CQMFNTVQVLQRLT-JHEQGTHGSA-N 0.000 description 2
- MJNWEIMBXKKCSF-XVYDVKMFSA-N His-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N MJNWEIMBXKKCSF-XVYDVKMFSA-N 0.000 description 2
- DCRODRAURLJOFY-XPUUQOCRSA-N His-Ala-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)NCC(O)=O DCRODRAURLJOFY-XPUUQOCRSA-N 0.000 description 2
- HDXNWVLQSQFJOX-SRVKXCTJSA-N His-Arg-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HDXNWVLQSQFJOX-SRVKXCTJSA-N 0.000 description 2
- HQKADFMLECZIQJ-HVTMNAMFSA-N His-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N HQKADFMLECZIQJ-HVTMNAMFSA-N 0.000 description 2
- YADRBUZBKHHDAO-XPUUQOCRSA-N His-Gly-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C)C(O)=O YADRBUZBKHHDAO-XPUUQOCRSA-N 0.000 description 2
- IGBBXBFSLKRHJB-BZSNNMDCSA-N His-Lys-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 IGBBXBFSLKRHJB-BZSNNMDCSA-N 0.000 description 2
- KQJBFMJFUXAYPK-AVGNSLFASA-N His-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N KQJBFMJFUXAYPK-AVGNSLFASA-N 0.000 description 2
- CUEQQFOGARVNHU-VGDYDELISA-N His-Ser-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUEQQFOGARVNHU-VGDYDELISA-N 0.000 description 2
- VQUCKIAECLVLAD-SVSWQMSJSA-N Ile-Cys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N VQUCKIAECLVLAD-SVSWQMSJSA-N 0.000 description 2
- YBGTWSFIGHUWQE-MXAVVETBSA-N Ile-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CN=CN1 YBGTWSFIGHUWQE-MXAVVETBSA-N 0.000 description 2
- URWXDJAEEGBADB-TUBUOCAGSA-N Ile-His-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N URWXDJAEEGBADB-TUBUOCAGSA-N 0.000 description 2
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 description 2
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 2
- UIEZQYNXCYHMQS-BJDJZHNGSA-N Ile-Lys-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)O)N UIEZQYNXCYHMQS-BJDJZHNGSA-N 0.000 description 2
- FFAUOCITXBMRBT-YTFOTSKYSA-N Ile-Lys-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FFAUOCITXBMRBT-YTFOTSKYSA-N 0.000 description 2
- UDBPXJNOEWDBDF-XUXIUFHCSA-N Ile-Lys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)O)N UDBPXJNOEWDBDF-XUXIUFHCSA-N 0.000 description 2
- IIWQTXMUALXGOV-PCBIJLKTSA-N Ile-Phe-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IIWQTXMUALXGOV-PCBIJLKTSA-N 0.000 description 2
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 2
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 2
- QDSKNVXKLPQNOJ-GVXVVHGQSA-N Leu-Gln-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QDSKNVXKLPQNOJ-GVXVVHGQSA-N 0.000 description 2
- AUBMZAMQCOYSIC-MNXVOIDGSA-N Leu-Ile-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O AUBMZAMQCOYSIC-MNXVOIDGSA-N 0.000 description 2
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 2
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 description 2
- SXOFUVGLPHCPRQ-KKUMJFAQSA-N Leu-Tyr-Cys Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(O)=O SXOFUVGLPHCPRQ-KKUMJFAQSA-N 0.000 description 2
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 2
- FMFNIDICDKEMOE-XUXIUFHCSA-N Leu-Val-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMFNIDICDKEMOE-XUXIUFHCSA-N 0.000 description 2
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 2
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 2
- MPOHDJKRBLVGCT-CIUDSAMLSA-N Lys-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N MPOHDJKRBLVGCT-CIUDSAMLSA-N 0.000 description 2
- VHNOAIFVYUQOOY-XUXIUFHCSA-N Lys-Arg-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VHNOAIFVYUQOOY-XUXIUFHCSA-N 0.000 description 2
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 2
- QQYRCUXKLDGCQN-SRVKXCTJSA-N Lys-Cys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N QQYRCUXKLDGCQN-SRVKXCTJSA-N 0.000 description 2
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 2
- YSPZCHGIWAQVKQ-AVGNSLFASA-N Lys-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN YSPZCHGIWAQVKQ-AVGNSLFASA-N 0.000 description 2
- IUYCGMNKIZDRQI-BQBZGAKWSA-N Met-Gly-Ala Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O IUYCGMNKIZDRQI-BQBZGAKWSA-N 0.000 description 2
- WPTDJKDGICUFCP-XUXIUFHCSA-N Met-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCSC)N WPTDJKDGICUFCP-XUXIUFHCSA-N 0.000 description 2
- XOFDBXYPKZUAAM-GUBZILKMSA-N Met-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N XOFDBXYPKZUAAM-GUBZILKMSA-N 0.000 description 2
- XPVCDCMPKCERFT-GUBZILKMSA-N Met-Ser-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XPVCDCMPKCERFT-GUBZILKMSA-N 0.000 description 2
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 2
- 229930182555 Penicillin Natural products 0.000 description 2
- JGSARLDLIJGVTE-MBNYWOFBSA-N Penicillin G Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 JGSARLDLIJGVTE-MBNYWOFBSA-N 0.000 description 2
- GNUCSNWOCQFMMC-UFYCRDLUSA-N Phe-Arg-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 GNUCSNWOCQFMMC-UFYCRDLUSA-N 0.000 description 2
- KJJROSNFBRWPHS-JYJNAYRXSA-N Phe-Glu-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KJJROSNFBRWPHS-JYJNAYRXSA-N 0.000 description 2
- YYKZDTVQHTUKDW-RYUDHWBXSA-N Phe-Gly-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N YYKZDTVQHTUKDW-RYUDHWBXSA-N 0.000 description 2
- PEFJUUYFEGBXFA-BZSNNMDCSA-N Phe-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 PEFJUUYFEGBXFA-BZSNNMDCSA-N 0.000 description 2
- APZNYJFGVAGFCF-JYJNAYRXSA-N Phe-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccccc1)C(C)C)C(O)=O APZNYJFGVAGFCF-JYJNAYRXSA-N 0.000 description 2
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 2
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 2
- ORPZXBQTEHINPB-SRVKXCTJSA-N Pro-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H]1CCCN1)C(O)=O ORPZXBQTEHINPB-SRVKXCTJSA-N 0.000 description 2
- VJLJGKQAOQJXJG-CIUDSAMLSA-N Pro-Asp-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJLJGKQAOQJXJG-CIUDSAMLSA-N 0.000 description 2
- ZBAGOWGNNAXMOY-IHRRRGAJSA-N Pro-Cys-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZBAGOWGNNAXMOY-IHRRRGAJSA-N 0.000 description 2
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 2
- FKVNLUZHSFCNGY-RVMXOQNASA-N Pro-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 FKVNLUZHSFCNGY-RVMXOQNASA-N 0.000 description 2
- VTFXTWDFPTWNJY-RHYQMDGZSA-N Pro-Leu-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VTFXTWDFPTWNJY-RHYQMDGZSA-N 0.000 description 2
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 2
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 2
- 108010003201 RGH 0205 Proteins 0.000 description 2
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 2
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 2
- YMEXHZTVKDAKIY-GHCJXIJMSA-N Ser-Asn-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO)C(O)=O YMEXHZTVKDAKIY-GHCJXIJMSA-N 0.000 description 2
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 2
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 2
- YPUSXTWURJANKF-KBIXCLLPSA-N Ser-Gln-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YPUSXTWURJANKF-KBIXCLLPSA-N 0.000 description 2
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 2
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 2
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 2
- QSHKTZVJGDVFEW-GUBZILKMSA-N Ser-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CO)N QSHKTZVJGDVFEW-GUBZILKMSA-N 0.000 description 2
- BUYHXYIUQUBEQP-AVGNSLFASA-N Ser-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N BUYHXYIUQUBEQP-AVGNSLFASA-N 0.000 description 2
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 2
- NINIDFKCEFEMDL-UHFFFAOYSA-N Sulfur Chemical compound [S] NINIDFKCEFEMDL-UHFFFAOYSA-N 0.000 description 2
- PXQUBKWZENPDGE-CIQUZCHMSA-N Thr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)O)N PXQUBKWZENPDGE-CIQUZCHMSA-N 0.000 description 2
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 2
- ZTPXSEUVYNNZRB-CDMKHQONSA-N Thr-Gly-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZTPXSEUVYNNZRB-CDMKHQONSA-N 0.000 description 2
- FKIGTIXHSRNKJU-IXOXFDKPSA-N Thr-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CN=CN1 FKIGTIXHSRNKJU-IXOXFDKPSA-N 0.000 description 2
- SXAGUVRFGJSFKC-ZEILLAHLSA-N Thr-His-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SXAGUVRFGJSFKC-ZEILLAHLSA-N 0.000 description 2
- KRDSCBLRHORMRK-JXUBOQSCSA-N Thr-Lys-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O KRDSCBLRHORMRK-JXUBOQSCSA-N 0.000 description 2
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 2
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 2
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 2
- PRONOHBTMLNXCZ-BZSNNMDCSA-N Tyr-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PRONOHBTMLNXCZ-BZSNNMDCSA-N 0.000 description 2
- WYOBRXPIZVKNMF-IRXDYDNUSA-N Tyr-Tyr-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 WYOBRXPIZVKNMF-IRXDYDNUSA-N 0.000 description 2
- GOPQNCQSXBJAII-ULQDDVLXSA-N Tyr-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GOPQNCQSXBJAII-ULQDDVLXSA-N 0.000 description 2
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 2
- ZSZFTYVFQLUWBF-QXEWZRGKSA-N Val-Asp-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N ZSZFTYVFQLUWBF-QXEWZRGKSA-N 0.000 description 2
- XKVXSCHXGJOQND-ZOBUZTSGSA-N Val-Asp-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N XKVXSCHXGJOQND-ZOBUZTSGSA-N 0.000 description 2
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 2
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 2
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 description 2
- PMDOQZFYGWZSTK-LSJOCFKGSA-N Val-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C PMDOQZFYGWZSTK-LSJOCFKGSA-N 0.000 description 2
- PTFPUAXGIKTVNN-ONGXEEELSA-N Val-His-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N PTFPUAXGIKTVNN-ONGXEEELSA-N 0.000 description 2
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 description 2
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 2
- GQMNEJMFMCJJTD-NHCYSSNCSA-N Val-Pro-Gln Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O GQMNEJMFMCJJTD-NHCYSSNCSA-N 0.000 description 2
- RYQUMYBMOJYYDK-NHCYSSNCSA-N Val-Pro-Glu Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RYQUMYBMOJYYDK-NHCYSSNCSA-N 0.000 description 2
- VSCIANXXVZOYOC-AVGNSLFASA-N Val-Pro-His Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N VSCIANXXVZOYOC-AVGNSLFASA-N 0.000 description 2
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 2
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 2
- BGTDGENDNWGMDQ-KJEVXHAQSA-N Val-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N)O BGTDGENDNWGMDQ-KJEVXHAQSA-N 0.000 description 2
- WBPFYNYTYASCQP-CYDGBPFRSA-N Val-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N WBPFYNYTYASCQP-CYDGBPFRSA-N 0.000 description 2
- 108010087924 alanylproline Proteins 0.000 description 2
- 108010013835 arginine glutamate Proteins 0.000 description 2
- 108010047857 aspartylglycine Proteins 0.000 description 2
- 230000001580 bacterial effect Effects 0.000 description 2
- 238000012258 culturing Methods 0.000 description 2
- 208000010643 digestive system disease Diseases 0.000 description 2
- 108010054812 diprotin A Proteins 0.000 description 2
- 108020001507 fusion proteins Proteins 0.000 description 2
- 102000037865 fusion proteins Human genes 0.000 description 2
- 208000018685 gastrointestinal system disease Diseases 0.000 description 2
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 2
- 108010025801 glycyl-prolyl-arginine Proteins 0.000 description 2
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 2
- 108010084389 glycyltryptophan Proteins 0.000 description 2
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 2
- 108010076718 lysyl-glutamyl-tryptophan Proteins 0.000 description 2
- 108010075702 lysyl-valyl-aspartyl-leucine Proteins 0.000 description 2
- 230000014759 maintenance of location Effects 0.000 description 2
- 239000012528 membrane Substances 0.000 description 2
- 230000033116 oxidation-reduction process Effects 0.000 description 2
- 229940049954 penicillin Drugs 0.000 description 2
- 239000011541 reaction mixture Substances 0.000 description 2
- 238000003259 recombinant expression Methods 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 102220233287 rs762138576 Human genes 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- 108010005652 splenotritin Proteins 0.000 description 2
- 238000011426 transformation method Methods 0.000 description 2
- 108700004896 tripeptide FEG Proteins 0.000 description 2
- 241001148471 unidentified anaerobic bacterium Species 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- 108010000998 wheylin-2 peptide Proteins 0.000 description 2
- PIPTUBPKYFRLCP-NHCYSSNCSA-N Ala-Ala-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PIPTUBPKYFRLCP-NHCYSSNCSA-N 0.000 description 1
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 description 1
- GORKKVHIBWAQHM-GCJQMDKQSA-N Ala-Asn-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GORKKVHIBWAQHM-GCJQMDKQSA-N 0.000 description 1
- ZIBWKCRKNFYTPT-ZKWXMUAHSA-N Ala-Asn-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZIBWKCRKNFYTPT-ZKWXMUAHSA-N 0.000 description 1
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 1
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 1
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 1
- CXZFXHGJJPVUJE-CIUDSAMLSA-N Ala-Cys-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O)N CXZFXHGJJPVUJE-CIUDSAMLSA-N 0.000 description 1
- MIPWEZAIMPYQST-FXQIFTODSA-N Ala-Cys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O MIPWEZAIMPYQST-FXQIFTODSA-N 0.000 description 1
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 1
- ZODMADSIQZZBSQ-FXQIFTODSA-N Ala-Gln-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZODMADSIQZZBSQ-FXQIFTODSA-N 0.000 description 1
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 1
- RGQCNKIDEQJEBT-CQDKDKBSSA-N Ala-Leu-Tyr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 RGQCNKIDEQJEBT-CQDKDKBSSA-N 0.000 description 1
- IHMCQESUJVZTKW-UBHSHLNASA-N Ala-Phe-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 IHMCQESUJVZTKW-UBHSHLNASA-N 0.000 description 1
- DYXOFPBJBAHWFY-JBDRJPRFSA-N Ala-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N DYXOFPBJBAHWFY-JBDRJPRFSA-N 0.000 description 1
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 1
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 1
- SBVJJNJLFWSJOV-UBHSHLNASA-N Arg-Ala-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SBVJJNJLFWSJOV-UBHSHLNASA-N 0.000 description 1
- BIOCIVSVEDFKDJ-GUBZILKMSA-N Arg-Arg-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O BIOCIVSVEDFKDJ-GUBZILKMSA-N 0.000 description 1
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 1
- DPXDVGDLWJYZBH-GUBZILKMSA-N Arg-Asn-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DPXDVGDLWJYZBH-GUBZILKMSA-N 0.000 description 1
- MFAMTAVAFBPXDC-LPEHRKFASA-N Arg-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O MFAMTAVAFBPXDC-LPEHRKFASA-N 0.000 description 1
- OBFTYSPXDRROQO-SRVKXCTJSA-N Arg-Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCN=C(N)N OBFTYSPXDRROQO-SRVKXCTJSA-N 0.000 description 1
- QAODJPUKWNNNRP-DCAQKATOSA-N Arg-Glu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QAODJPUKWNNNRP-DCAQKATOSA-N 0.000 description 1
- HAVKMRGWNXMCDR-STQMWFEESA-N Arg-Gly-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HAVKMRGWNXMCDR-STQMWFEESA-N 0.000 description 1
- YKBHOXLMMPZPHQ-GMOBBJLQSA-N Arg-Ile-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O YKBHOXLMMPZPHQ-GMOBBJLQSA-N 0.000 description 1
- OKKMBOSPBDASEP-CYDGBPFRSA-N Arg-Ile-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O OKKMBOSPBDASEP-CYDGBPFRSA-N 0.000 description 1
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 1
- DNUKXVMPARLPFN-XUXIUFHCSA-N Arg-Leu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DNUKXVMPARLPFN-XUXIUFHCSA-N 0.000 description 1
- YVTHEZNOKSAWRW-DCAQKATOSA-N Arg-Lys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O YVTHEZNOKSAWRW-DCAQKATOSA-N 0.000 description 1
- MJINRRBEMOLJAK-DCAQKATOSA-N Arg-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N MJINRRBEMOLJAK-DCAQKATOSA-N 0.000 description 1
- HIMXTOIXVXWHTB-DCAQKATOSA-N Arg-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N HIMXTOIXVXWHTB-DCAQKATOSA-N 0.000 description 1
- NYDIVDKTULRINZ-AVGNSLFASA-N Arg-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NYDIVDKTULRINZ-AVGNSLFASA-N 0.000 description 1
- ZEBDYGZVMMKZNB-SRVKXCTJSA-N Arg-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCN=C(N)N)N ZEBDYGZVMMKZNB-SRVKXCTJSA-N 0.000 description 1
- YTMKMRSYXHBGER-IHRRRGAJSA-N Arg-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YTMKMRSYXHBGER-IHRRRGAJSA-N 0.000 description 1
- NGYHSXDNNOFHNE-AVGNSLFASA-N Arg-Pro-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O NGYHSXDNNOFHNE-AVGNSLFASA-N 0.000 description 1
- SYFHFLGAROUHNT-VEVYYDQMSA-N Arg-Thr-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SYFHFLGAROUHNT-VEVYYDQMSA-N 0.000 description 1
- VLIJAPRTSXSGFY-STQMWFEESA-N Arg-Tyr-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 VLIJAPRTSXSGFY-STQMWFEESA-N 0.000 description 1
- SWLOHUMCUDRTCL-ZLUOBGJFSA-N Asn-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N SWLOHUMCUDRTCL-ZLUOBGJFSA-N 0.000 description 1
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 1
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 1
- DMLSCRJBWUEALP-LAEOZQHASA-N Asn-Glu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O DMLSCRJBWUEALP-LAEOZQHASA-N 0.000 description 1
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 description 1
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 1
- NLDNNZKUSLAYFW-NHCYSSNCSA-N Asn-Lys-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O NLDNNZKUSLAYFW-NHCYSSNCSA-N 0.000 description 1
- NNDSLVWAQAUPPP-GUBZILKMSA-N Asn-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N NNDSLVWAQAUPPP-GUBZILKMSA-N 0.000 description 1
- BYLSYQASFJJBCL-DCAQKATOSA-N Asn-Pro-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BYLSYQASFJJBCL-DCAQKATOSA-N 0.000 description 1
- AWXDRZJQCVHCIT-DCAQKATOSA-N Asn-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O AWXDRZJQCVHCIT-DCAQKATOSA-N 0.000 description 1
- BUVNWKQBMZLCDW-UGYAYLCHSA-N Asp-Asn-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BUVNWKQBMZLCDW-UGYAYLCHSA-N 0.000 description 1
- TVVYVAUGRHNTGT-UGYAYLCHSA-N Asp-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O TVVYVAUGRHNTGT-UGYAYLCHSA-N 0.000 description 1
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 1
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 1
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 1
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 1
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 1
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 1
- KFAFUJMGHVVYRC-DCAQKATOSA-N Asp-Leu-Met Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O KFAFUJMGHVVYRC-DCAQKATOSA-N 0.000 description 1
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 1
- UCHSVZYJKJLPHF-BZSNNMDCSA-N Asp-Phe-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O UCHSVZYJKJLPHF-BZSNNMDCSA-N 0.000 description 1
- GPPIDDWYKJPRES-YDHLFZDLSA-N Asp-Phe-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GPPIDDWYKJPRES-YDHLFZDLSA-N 0.000 description 1
- UAXIKORUDGGIGA-DCAQKATOSA-N Asp-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O UAXIKORUDGGIGA-DCAQKATOSA-N 0.000 description 1
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 1
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 1
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 1
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 1
- KBJVTFWQWXCYCQ-IUKAMOBKSA-N Asp-Thr-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KBJVTFWQWXCYCQ-IUKAMOBKSA-N 0.000 description 1
- LTARLVHGOGBRHN-AAEUAGOBSA-N Asp-Trp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O LTARLVHGOGBRHN-AAEUAGOBSA-N 0.000 description 1
- BYLPQJAWXJWUCJ-YDHLFZDLSA-N Asp-Tyr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O BYLPQJAWXJWUCJ-YDHLFZDLSA-N 0.000 description 1
- RKXVTTIQNKPCHU-KKHAAJSZSA-N Asp-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O RKXVTTIQNKPCHU-KKHAAJSZSA-N 0.000 description 1
- 241000222173 Candida parapsilosis Species 0.000 description 1
- 241000816681 Candidatus Abyssubacteria Species 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- QYKJOVAXAKTKBR-FXQIFTODSA-N Cys-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N QYKJOVAXAKTKBR-FXQIFTODSA-N 0.000 description 1
- BPHKULHWEIUDOB-FXQIFTODSA-N Cys-Gln-Gln Chemical compound SC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BPHKULHWEIUDOB-FXQIFTODSA-N 0.000 description 1
- MBILEVLLOHJZMG-FXQIFTODSA-N Cys-Gln-Glu Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N MBILEVLLOHJZMG-FXQIFTODSA-N 0.000 description 1
- 108020005199 Dehydrogenases Proteins 0.000 description 1
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 description 1
- 241000244332 Flavobacteriaceae Species 0.000 description 1
- JFOKLAPFYCTNHW-SRVKXCTJSA-N Gln-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N JFOKLAPFYCTNHW-SRVKXCTJSA-N 0.000 description 1
- JFSNBQJNDMXMQF-XHNCKOQMSA-N Gln-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O JFSNBQJNDMXMQF-XHNCKOQMSA-N 0.000 description 1
- COYGBRTZEVWZBW-XKBZYTNZSA-N Gln-Cys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(N)=O COYGBRTZEVWZBW-XKBZYTNZSA-N 0.000 description 1
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 1
- GNMQDOGFWYWPNM-LAEOZQHASA-N Gln-Gly-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)CNC(=O)[C@@H](N)CCC(N)=O)C(O)=O GNMQDOGFWYWPNM-LAEOZQHASA-N 0.000 description 1
- ORYMMTRPKVTGSJ-XVKPBYJWSA-N Gln-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O ORYMMTRPKVTGSJ-XVKPBYJWSA-N 0.000 description 1
- AVZHGSCDKIQZPQ-CIUDSAMLSA-N Glu-Arg-Ala Chemical compound C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AVZHGSCDKIQZPQ-CIUDSAMLSA-N 0.000 description 1
- MLCPTRRNICEKIS-FXQIFTODSA-N Glu-Asn-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLCPTRRNICEKIS-FXQIFTODSA-N 0.000 description 1
- LJLPOZGRPLORTF-CIUDSAMLSA-N Glu-Asn-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O LJLPOZGRPLORTF-CIUDSAMLSA-N 0.000 description 1
- QPRZKNOOOBWXSU-CIUDSAMLSA-N Glu-Asp-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N QPRZKNOOOBWXSU-CIUDSAMLSA-N 0.000 description 1
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 1
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 1
- UHVIQGKBMXEVGN-WDSKDSINSA-N Glu-Gly-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UHVIQGKBMXEVGN-WDSKDSINSA-N 0.000 description 1
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 1
- CXRWMMRLEMVSEH-PEFMBERDSA-N Glu-Ile-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CXRWMMRLEMVSEH-PEFMBERDSA-N 0.000 description 1
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 1
- WTMZXOPHTIVFCP-QEWYBTABSA-N Glu-Ile-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WTMZXOPHTIVFCP-QEWYBTABSA-N 0.000 description 1
- ZHNHJYYFCGUZNQ-KBIXCLLPSA-N Glu-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O ZHNHJYYFCGUZNQ-KBIXCLLPSA-N 0.000 description 1
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 1
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 1
- FMBWLLMUPXTXFC-SDDRHHMPSA-N Glu-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N)C(=O)O FMBWLLMUPXTXFC-SDDRHHMPSA-N 0.000 description 1
- YRMZCZIRHYCNHX-RYUDHWBXSA-N Glu-Phe-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O YRMZCZIRHYCNHX-RYUDHWBXSA-N 0.000 description 1
- NNQDRRUXFJYCCJ-NHCYSSNCSA-N Glu-Pro-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O NNQDRRUXFJYCCJ-NHCYSSNCSA-N 0.000 description 1
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 1
- QOXDAWODGSIDDI-GUBZILKMSA-N Glu-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N QOXDAWODGSIDDI-GUBZILKMSA-N 0.000 description 1
- CQGBSALYGOXQPE-HTUGSXCWSA-N Glu-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O CQGBSALYGOXQPE-HTUGSXCWSA-N 0.000 description 1
- QEJKKJNDDDPSMU-KKUMJFAQSA-N Glu-Tyr-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(O)=O QEJKKJNDDDPSMU-KKUMJFAQSA-N 0.000 description 1
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 1
- HQTDNEZTGZUWSY-XVKPBYJWSA-N Glu-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)NCC(O)=O HQTDNEZTGZUWSY-XVKPBYJWSA-N 0.000 description 1
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 1
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 1
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 1
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 1
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 1
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 1
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 1
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 1
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 1
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 1
- AAHSHTLISQUZJL-QSFUFRPTSA-N Gly-Ile-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AAHSHTLISQUZJL-QSFUFRPTSA-N 0.000 description 1
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 1
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 1
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 1
- RUDRIZRGOLQSMX-IUCAKERBSA-N Gly-Met-Met Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O RUDRIZRGOLQSMX-IUCAKERBSA-N 0.000 description 1
- IEGFSKKANYKBDU-QWHCGFSZSA-N Gly-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)CN)C(=O)O IEGFSKKANYKBDU-QWHCGFSZSA-N 0.000 description 1
- CSMYMGFCEJWALV-WDSKDSINSA-N Gly-Ser-Gln Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O CSMYMGFCEJWALV-WDSKDSINSA-N 0.000 description 1
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 1
- OCRQUYDOYKCOQG-IRXDYDNUSA-N Gly-Tyr-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 OCRQUYDOYKCOQG-IRXDYDNUSA-N 0.000 description 1
- NGBGZCUWFVVJKC-IRXDYDNUSA-N Gly-Tyr-Tyr Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NGBGZCUWFVVJKC-IRXDYDNUSA-N 0.000 description 1
- UCDWNBFOZCZSNV-AVGNSLFASA-N His-Arg-Met Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O UCDWNBFOZCZSNV-AVGNSLFASA-N 0.000 description 1
- JFFAPRNXXLRINI-NHCYSSNCSA-N His-Asp-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JFFAPRNXXLRINI-NHCYSSNCSA-N 0.000 description 1
- CYHWWHKRCKHYGQ-GUBZILKMSA-N His-Cys-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N CYHWWHKRCKHYGQ-GUBZILKMSA-N 0.000 description 1
- MJUUWJJEUOBDGW-IHRRRGAJSA-N His-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 MJUUWJJEUOBDGW-IHRRRGAJSA-N 0.000 description 1
- BSVLMPMIXPQNKC-KBPBESRZSA-N His-Phe-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O BSVLMPMIXPQNKC-KBPBESRZSA-N 0.000 description 1
- ALPXXNRQBMRCPZ-MEYUZBJRSA-N His-Thr-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ALPXXNRQBMRCPZ-MEYUZBJRSA-N 0.000 description 1
- PUFNQIPSRXVLQJ-IHRRRGAJSA-N His-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N PUFNQIPSRXVLQJ-IHRRRGAJSA-N 0.000 description 1
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 1
- HGNUKGZQASSBKQ-PCBIJLKTSA-N Ile-Asp-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HGNUKGZQASSBKQ-PCBIJLKTSA-N 0.000 description 1
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 1
- KYLIZSDYWQQTFM-PEDHHIEDSA-N Ile-Ile-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N KYLIZSDYWQQTFM-PEDHHIEDSA-N 0.000 description 1
- YNMQUIVKEFRCPH-QSFUFRPTSA-N Ile-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O)N YNMQUIVKEFRCPH-QSFUFRPTSA-N 0.000 description 1
- FQYQMFCIJNWDQZ-CYDGBPFRSA-N Ile-Pro-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 FQYQMFCIJNWDQZ-CYDGBPFRSA-N 0.000 description 1
- RQJUKVXWAKJDBW-SVSWQMSJSA-N Ile-Ser-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N RQJUKVXWAKJDBW-SVSWQMSJSA-N 0.000 description 1
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 description 1
- ZUWSVOYKBCHLRR-MGHWNKPDSA-N Ile-Tyr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUWSVOYKBCHLRR-MGHWNKPDSA-N 0.000 description 1
- YWCJXQKATPNPOE-UKJIMTQDSA-N Ile-Val-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YWCJXQKATPNPOE-UKJIMTQDSA-N 0.000 description 1
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 1
- 240000001929 Lactobacillus brevis Species 0.000 description 1
- 235000013957 Lactobacillus brevis Nutrition 0.000 description 1
- 240000000599 Lentinula edodes Species 0.000 description 1
- 235000001715 Lentinula edodes Nutrition 0.000 description 1
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 1
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 1
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 1
- JQSXWJXBASFONF-KKUMJFAQSA-N Leu-Asp-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JQSXWJXBASFONF-KKUMJFAQSA-N 0.000 description 1
- GBDMISNMNXVTNV-XIRDDKMYSA-N Leu-Asp-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O GBDMISNMNXVTNV-XIRDDKMYSA-N 0.000 description 1
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 1
- FEHQLKKBVJHSEC-SZMVWBNQSA-N Leu-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 FEHQLKKBVJHSEC-SZMVWBNQSA-N 0.000 description 1
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 1
- ORWTWZXGDBYVCP-BJDJZHNGSA-N Leu-Ile-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(C)C ORWTWZXGDBYVCP-BJDJZHNGSA-N 0.000 description 1
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 1
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 1
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 1
- PPQRKXHCLYCBSP-IHRRRGAJSA-N Leu-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N PPQRKXHCLYCBSP-IHRRRGAJSA-N 0.000 description 1
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 1
- FKQPWMZLIIATBA-AJNGGQMLSA-N Leu-Lys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FKQPWMZLIIATBA-AJNGGQMLSA-N 0.000 description 1
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 1
- NHRINZSPIUXYQZ-DCAQKATOSA-N Leu-Met-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)O)N NHRINZSPIUXYQZ-DCAQKATOSA-N 0.000 description 1
- AUNMOHYWTAPQLA-XUXIUFHCSA-N Leu-Met-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AUNMOHYWTAPQLA-XUXIUFHCSA-N 0.000 description 1
- IBSGMIPRBMPMHE-IHRRRGAJSA-N Leu-Met-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O IBSGMIPRBMPMHE-IHRRRGAJSA-N 0.000 description 1
- ZAVCJRJOQKIOJW-KKUMJFAQSA-N Leu-Phe-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=CC=C1 ZAVCJRJOQKIOJW-KKUMJFAQSA-N 0.000 description 1
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 1
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 1
- 108090001060 Lipase Proteins 0.000 description 1
- 102000004882 Lipase Human genes 0.000 description 1
- 239000004367 Lipase Substances 0.000 description 1
- NFLFJGGKOHYZJF-BJDJZHNGSA-N Lys-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN NFLFJGGKOHYZJF-BJDJZHNGSA-N 0.000 description 1
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 1
- IRNSXVOWSXSULE-DCAQKATOSA-N Lys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN IRNSXVOWSXSULE-DCAQKATOSA-N 0.000 description 1
- SJNZALDHDUYDBU-IHRRRGAJSA-N Lys-Arg-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(O)=O SJNZALDHDUYDBU-IHRRRGAJSA-N 0.000 description 1
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 1
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 1
- MLLKLNYPZRDIQG-GUBZILKMSA-N Lys-Cys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N MLLKLNYPZRDIQG-GUBZILKMSA-N 0.000 description 1
- OPTCSTACHGNULU-DCAQKATOSA-N Lys-Cys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCCCN OPTCSTACHGNULU-DCAQKATOSA-N 0.000 description 1
- GHOIOYHDDKXIDX-SZMVWBNQSA-N Lys-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN)C(O)=O)=CNC2=C1 GHOIOYHDDKXIDX-SZMVWBNQSA-N 0.000 description 1
- ITWQLSZTLBKWJM-YUMQZZPRSA-N Lys-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCCN ITWQLSZTLBKWJM-YUMQZZPRSA-N 0.000 description 1
- DTUZCYRNEJDKSR-NHCYSSNCSA-N Lys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN DTUZCYRNEJDKSR-NHCYSSNCSA-N 0.000 description 1
- JZMGVXLDOQOKAH-UWVGGRQHSA-N Lys-Gly-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O JZMGVXLDOQOKAH-UWVGGRQHSA-N 0.000 description 1
- NNKLKUUGESXCBS-KBPBESRZSA-N Lys-Gly-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NNKLKUUGESXCBS-KBPBESRZSA-N 0.000 description 1
- GAHJXEMYXKLZRQ-AJNGGQMLSA-N Lys-Lys-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GAHJXEMYXKLZRQ-AJNGGQMLSA-N 0.000 description 1
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 1
- ALEVUGKHINJNIF-QEJZJMRPSA-N Lys-Phe-Ala Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ALEVUGKHINJNIF-QEJZJMRPSA-N 0.000 description 1
- TWPCWKVOZDUYAA-KKUMJFAQSA-N Lys-Phe-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O TWPCWKVOZDUYAA-KKUMJFAQSA-N 0.000 description 1
- OHXUUQDOBQKSNB-AVGNSLFASA-N Lys-Val-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OHXUUQDOBQKSNB-AVGNSLFASA-N 0.000 description 1
- JQEBITVYKUCBMC-SRVKXCTJSA-N Met-Arg-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JQEBITVYKUCBMC-SRVKXCTJSA-N 0.000 description 1
- WDTLNWHPIPCMMP-AVGNSLFASA-N Met-Arg-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O WDTLNWHPIPCMMP-AVGNSLFASA-N 0.000 description 1
- OLWAOWXIADGIJG-AVGNSLFASA-N Met-Arg-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(O)=O OLWAOWXIADGIJG-AVGNSLFASA-N 0.000 description 1
- YLLWCSDBVGZLOW-CIUDSAMLSA-N Met-Gln-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O YLLWCSDBVGZLOW-CIUDSAMLSA-N 0.000 description 1
- OXHSZBRPUGNMKW-DCAQKATOSA-N Met-Gln-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OXHSZBRPUGNMKW-DCAQKATOSA-N 0.000 description 1
- CRGKLOXHKICQOL-GARJFASQSA-N Met-Gln-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N CRGKLOXHKICQOL-GARJFASQSA-N 0.000 description 1
- KQBJYJXPZBNEIK-DCAQKATOSA-N Met-Glu-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQBJYJXPZBNEIK-DCAQKATOSA-N 0.000 description 1
- DGNZGCQSVGGYJS-BQBZGAKWSA-N Met-Gly-Asp Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O DGNZGCQSVGGYJS-BQBZGAKWSA-N 0.000 description 1
- BMHIFARYXOJDLD-WPRPVWTQSA-N Met-Gly-Val Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O BMHIFARYXOJDLD-WPRPVWTQSA-N 0.000 description 1
- QGRJTULYDZUBAY-ZPFDUUQYSA-N Met-Ile-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGRJTULYDZUBAY-ZPFDUUQYSA-N 0.000 description 1
- MVMNUCOHQGYYKB-PEDHHIEDSA-N Met-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CCSC)N MVMNUCOHQGYYKB-PEDHHIEDSA-N 0.000 description 1
- PHKBGZKVOJCIMZ-SRVKXCTJSA-N Met-Pro-Arg Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PHKBGZKVOJCIMZ-SRVKXCTJSA-N 0.000 description 1
- KSIPKXNIQOWMIC-RCWTZXSCSA-N Met-Thr-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KSIPKXNIQOWMIC-RCWTZXSCSA-N 0.000 description 1
- QYIGOFGUOVTAHK-ZJDVBMNYSA-N Met-Thr-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QYIGOFGUOVTAHK-ZJDVBMNYSA-N 0.000 description 1
- LPNWWHBFXPNHJG-AVGNSLFASA-N Met-Val-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN LPNWWHBFXPNHJG-AVGNSLFASA-N 0.000 description 1
- 241000192121 Nitrospira <genus> Species 0.000 description 1
- 241001219697 Nitrospira sp. Species 0.000 description 1
- AJOKKVTWEMXZHC-DRZSPHRISA-N Phe-Ala-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 AJOKKVTWEMXZHC-DRZSPHRISA-N 0.000 description 1
- AWAYOWOUGVZXOB-BZSNNMDCSA-N Phe-Asn-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 AWAYOWOUGVZXOB-BZSNNMDCSA-N 0.000 description 1
- VJLLEKDQJSMHRU-STQMWFEESA-N Phe-Gly-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O VJLLEKDQJSMHRU-STQMWFEESA-N 0.000 description 1
- NHCKESBLOMHIIE-IRXDYDNUSA-N Phe-Gly-Phe Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 NHCKESBLOMHIIE-IRXDYDNUSA-N 0.000 description 1
- VADLTGVIOIOKGM-BZSNNMDCSA-N Phe-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CN=CN1 VADLTGVIOIOKGM-BZSNNMDCSA-N 0.000 description 1
- PBXYXOAEQQUVMM-ULQDDVLXSA-N Phe-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N PBXYXOAEQQUVMM-ULQDDVLXSA-N 0.000 description 1
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 1
- KLXQWABNAWDRAY-ACRUOGEOSA-N Phe-Lys-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 KLXQWABNAWDRAY-ACRUOGEOSA-N 0.000 description 1
- GPSMLZQVIIYLDK-ULQDDVLXSA-N Phe-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O GPSMLZQVIIYLDK-ULQDDVLXSA-N 0.000 description 1
- BONHGTUEEPIMPM-AVGNSLFASA-N Phe-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O BONHGTUEEPIMPM-AVGNSLFASA-N 0.000 description 1
- FGWUALWGCZJQDJ-URLPEUOOSA-N Phe-Thr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGWUALWGCZJQDJ-URLPEUOOSA-N 0.000 description 1
- PTDAGKJHZBGDKD-OEAJRASXSA-N Phe-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O PTDAGKJHZBGDKD-OEAJRASXSA-N 0.000 description 1
- QSKCKTUQPICLSO-AVGNSLFASA-N Pro-Arg-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O QSKCKTUQPICLSO-AVGNSLFASA-N 0.000 description 1
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 1
- WIPAMEKBSHNFQE-IUCAKERBSA-N Pro-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@@H]1CCCN1 WIPAMEKBSHNFQE-IUCAKERBSA-N 0.000 description 1
- AUYKOPJPKUCYHE-SRVKXCTJSA-N Pro-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1 AUYKOPJPKUCYHE-SRVKXCTJSA-N 0.000 description 1
- AWQGDZBKQTYNMN-IHRRRGAJSA-N Pro-Phe-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)O)C(=O)O AWQGDZBKQTYNMN-IHRRRGAJSA-N 0.000 description 1
- GFHXZNVJIKMAGO-IHRRRGAJSA-N Pro-Phe-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GFHXZNVJIKMAGO-IHRRRGAJSA-N 0.000 description 1
- AIOWVDNPESPXRB-YTWAJWBKSA-N Pro-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2)O AIOWVDNPESPXRB-YTWAJWBKSA-N 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 240000005319 Sedum acre Species 0.000 description 1
- 235000014327 Sedum acre Nutrition 0.000 description 1
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 1
- HVKMTOIAYDOJPL-NRPADANISA-N Ser-Gln-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVKMTOIAYDOJPL-NRPADANISA-N 0.000 description 1
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 1
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 1
- JEHPKECJCALLRW-CUJWVEQBSA-N Ser-His-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEHPKECJCALLRW-CUJWVEQBSA-N 0.000 description 1
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 1
- VIIJCAQMJBHSJH-FXQIFTODSA-N Ser-Met-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O VIIJCAQMJBHSJH-FXQIFTODSA-N 0.000 description 1
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 1
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 1
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 1
- 239000005864 Sulphur Substances 0.000 description 1
- 241001052560 Thallis Species 0.000 description 1
- GFDUZZACIWNMPE-KZVJFYERSA-N Thr-Ala-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O GFDUZZACIWNMPE-KZVJFYERSA-N 0.000 description 1
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 1
- QNJZOAHSYPXTAB-VEVYYDQMSA-N Thr-Asn-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O QNJZOAHSYPXTAB-VEVYYDQMSA-N 0.000 description 1
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 1
- MQUZMZBFKCHVOB-HJGDQZAQSA-N Thr-Gln-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O MQUZMZBFKCHVOB-HJGDQZAQSA-N 0.000 description 1
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 1
- NQVDGKYAUHTCME-QTKMDUPCSA-N Thr-His-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O NQVDGKYAUHTCME-QTKMDUPCSA-N 0.000 description 1
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 1
- ZSPQUTWLWGWTPS-HJGDQZAQSA-N Thr-Lys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZSPQUTWLWGWTPS-HJGDQZAQSA-N 0.000 description 1
- KDGBLMDAPJTQIW-RHYQMDGZSA-N Thr-Met-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N)O KDGBLMDAPJTQIW-RHYQMDGZSA-N 0.000 description 1
- DNCUODYZAMHLCV-XGEHTFHBSA-N Thr-Pro-Cys Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N)O DNCUODYZAMHLCV-XGEHTFHBSA-N 0.000 description 1
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 1
- 241000223997 Toxoplasma gondii Species 0.000 description 1
- AOAMKFFPFOPMLX-BVSLBCMMSA-N Trp-Arg-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=CC=C1 AOAMKFFPFOPMLX-BVSLBCMMSA-N 0.000 description 1
- YXSSXUIBUJGHJY-SFJXLCSZSA-N Trp-Thr-Phe Chemical compound C([C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)[C@H](O)C)C(O)=O)C1=CC=CC=C1 YXSSXUIBUJGHJY-SFJXLCSZSA-N 0.000 description 1
- AYHSJESDFKREAR-KKUMJFAQSA-N Tyr-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AYHSJESDFKREAR-KKUMJFAQSA-N 0.000 description 1
- CWQZAUYFWRLITN-AVGNSLFASA-N Tyr-Gln-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)O CWQZAUYFWRLITN-AVGNSLFASA-N 0.000 description 1
- OLWFDNLLBWQWCP-STQMWFEESA-N Tyr-Gly-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O OLWFDNLLBWQWCP-STQMWFEESA-N 0.000 description 1
- ZOBLBMGJKVJVEV-BZSNNMDCSA-N Tyr-Lys-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O ZOBLBMGJKVJVEV-BZSNNMDCSA-N 0.000 description 1
- CWVHKVVKAQIJKY-ACRUOGEOSA-N Tyr-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=C(C=C2)O)N CWVHKVVKAQIJKY-ACRUOGEOSA-N 0.000 description 1
- OFHKXNKJXURPSY-ULQDDVLXSA-N Tyr-Met-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O OFHKXNKJXURPSY-ULQDDVLXSA-N 0.000 description 1
- OKDNSNWJEXAMSU-IRXDYDNUSA-N Tyr-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 OKDNSNWJEXAMSU-IRXDYDNUSA-N 0.000 description 1
- SCZJKZLFSSPJDP-ACRUOGEOSA-N Tyr-Phe-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O SCZJKZLFSSPJDP-ACRUOGEOSA-N 0.000 description 1
- KLOZTPOXVVRVAQ-DZKIICNBSA-N Tyr-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 KLOZTPOXVVRVAQ-DZKIICNBSA-N 0.000 description 1
- ZMDCGGKHRKNWKD-LAEOZQHASA-N Val-Asn-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZMDCGGKHRKNWKD-LAEOZQHASA-N 0.000 description 1
- XEYUMGGWQCIWAR-XVKPBYJWSA-N Val-Gln-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N XEYUMGGWQCIWAR-XVKPBYJWSA-N 0.000 description 1
- PGBJAZDAEWPDAA-NHCYSSNCSA-N Val-Gln-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N PGBJAZDAEWPDAA-NHCYSSNCSA-N 0.000 description 1
- CVIXTAITYJQMPE-LAEOZQHASA-N Val-Glu-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CVIXTAITYJQMPE-LAEOZQHASA-N 0.000 description 1
- GBESYURLQOYWLU-LAEOZQHASA-N Val-Glu-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GBESYURLQOYWLU-LAEOZQHASA-N 0.000 description 1
- AHHJARQXFFGOKF-NRPADANISA-N Val-Glu-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N AHHJARQXFFGOKF-NRPADANISA-N 0.000 description 1
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 1
- RKIGNDAHUOOIMJ-BQFCYCMXSA-N Val-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 RKIGNDAHUOOIMJ-BQFCYCMXSA-N 0.000 description 1
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 1
- MDYSKHBSPXUOPV-JSGCOSHPSA-N Val-Gly-Phe Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MDYSKHBSPXUOPV-JSGCOSHPSA-N 0.000 description 1
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 1
- OPGWZDIYEYJVRX-AVGNSLFASA-N Val-His-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N OPGWZDIYEYJVRX-AVGNSLFASA-N 0.000 description 1
- OACSGBOREVRSME-NHCYSSNCSA-N Val-His-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CC(N)=O)C(O)=O OACSGBOREVRSME-NHCYSSNCSA-N 0.000 description 1
- KDKLLPMFFGYQJD-CYDGBPFRSA-N Val-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N KDKLLPMFFGYQJD-CYDGBPFRSA-N 0.000 description 1
- BZMIYHIJVVJPCK-QSFUFRPTSA-N Val-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N BZMIYHIJVVJPCK-QSFUFRPTSA-N 0.000 description 1
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 1
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 1
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 1
- UEPLNXPLHJUYPT-AVGNSLFASA-N Val-Met-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O UEPLNXPLHJUYPT-AVGNSLFASA-N 0.000 description 1
- LJSZPMSUYKKKCP-UBHSHLNASA-N Val-Phe-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 LJSZPMSUYKKKCP-UBHSHLNASA-N 0.000 description 1
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 1
- YTNGABPUXFEOGU-SRVKXCTJSA-N Val-Pro-Arg Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTNGABPUXFEOGU-SRVKXCTJSA-N 0.000 description 1
- ZXYPHBKIZLAQTL-QXEWZRGKSA-N Val-Pro-Asp Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N ZXYPHBKIZLAQTL-QXEWZRGKSA-N 0.000 description 1
- BGXVHVMJZCSOCA-AVGNSLFASA-N Val-Pro-Lys Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N BGXVHVMJZCSOCA-AVGNSLFASA-N 0.000 description 1
- QPJSIBAOZBVELU-BPNCWPANSA-N Val-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N QPJSIBAOZBVELU-BPNCWPANSA-N 0.000 description 1
- RLVTVHSDKHBFQP-ULQDDVLXSA-N Val-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 RLVTVHSDKHBFQP-ULQDDVLXSA-N 0.000 description 1
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 1
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 239000003905 agrochemical Substances 0.000 description 1
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 1
- 108010008355 arginyl-glutamine Proteins 0.000 description 1
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 1
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 1
- MNFORVFSTILPAW-UHFFFAOYSA-N azetidin-2-one Chemical compound O=C1CCN1 MNFORVFSTILPAW-UHFFFAOYSA-N 0.000 description 1
- 239000003782 beta lactam antibiotic agent Substances 0.000 description 1
- 230000002210 biocatalytic effect Effects 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- HXCHCVDVKSCDHU-LULTVBGHSA-N calicheamicin Chemical compound C1[C@H](OC)[C@@H](NCC)CO[C@H]1O[C@H]1[C@H](O[C@@H]2C\3=C(NC(=O)OC)C(=O)C[C@](C/3=C/CSSSC)(O)C#C\C=C/C#C2)O[C@H](C)[C@@H](NO[C@@H]2O[C@H](C)[C@@H](SC(=O)C=3C(=C(OC)C(O[C@H]4[C@@H]([C@H](OC)[C@@H](O)[C@H](C)O4)O)=C(I)C=3C)OC)[C@@H](O)C2)[C@@H]1O HXCHCVDVKSCDHU-LULTVBGHSA-N 0.000 description 1
- 229930195731 calicheamicin Natural products 0.000 description 1
- 229940055022 candida parapsilosis Drugs 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 239000012295 chemical reaction liquid Substances 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 238000004140 cleaning Methods 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000005336 cracking Methods 0.000 description 1
- 108010069495 cysteinyltyrosine Proteins 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 239000012634 fragment Substances 0.000 description 1
- 108010079547 glutamylmethionine Proteins 0.000 description 1
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 1
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 1
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 1
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010045126 glycyl-tyrosyl-glycine Proteins 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 1
- 235000019421 lipase Nutrition 0.000 description 1
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010057952 lysyl-phenylalanyl-lysine Proteins 0.000 description 1
- 108010017391 lysylvaline Proteins 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 239000002609 medium Substances 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 230000003647 oxidation Effects 0.000 description 1
- 238000007254 oxidation reaction Methods 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- HHXMXAQDOUCLDN-RXMQYKEDSA-N penem Chemical compound S1C=CN2C(=O)C[C@H]21 HHXMXAQDOUCLDN-RXMQYKEDSA-N 0.000 description 1
- 230000001175 peptic effect Effects 0.000 description 1
- 239000002304 perfume Substances 0.000 description 1
- 239000000575 pesticide Substances 0.000 description 1
- 108010072637 phenylalanyl-arginyl-phenylalanine Proteins 0.000 description 1
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 1
- 108010025488 pinealon Proteins 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 108010015796 prolylisoleucine Proteins 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 230000035484 reaction time Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 108010071207 serylmethionine Proteins 0.000 description 1
- 230000001954 sterilising effect Effects 0.000 description 1
- 238000004659 sterilization and disinfection Methods 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 229910052717 sulfur Inorganic materials 0.000 description 1
- 239000011593 sulfur Substances 0.000 description 1
- 208000024891 symptom Diseases 0.000 description 1
- 230000009261 transgenic effect Effects 0.000 description 1
- 108010078580 tyrosylleucine Proteins 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
- 239000002132 β-lactam antibiotic Substances 0.000 description 1
- 229940124586 β-lactam antibiotics Drugs 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/02—Preparation of oxygen-containing organic compounds containing a hydroxy group
- C12P7/04—Preparation of oxygen-containing organic compounds containing a hydroxy group acyclic
- C12P7/18—Preparation of oxygen-containing organic compounds containing a hydroxy group acyclic polyhydric
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02E—REDUCTION OF GREENHOUSE GAS [GHG] EMISSIONS, RELATED TO ENERGY GENERATION, TRANSMISSION OR DISTRIBUTION
- Y02E50/00—Technologies for the production of fuel of non-fossil origin
- Y02E50/10—Biofuels, e.g. bio-diesel
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Genetics & Genomics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- General Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Biotechnology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Enzymes And Modification Thereof (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
本发明公开了一种双功能酶生物催化剂及其制备方法和应用。该方法包括如下步骤:以4‑羟基‑2‑丁酮为底物,经醇脱氢酶或醇脱氢酶酶的突变体催化反应生成R‑1,3‑丁二醇。本发明提供了一种以4‑羟基‑2‑丁酮为原料,通过单酶催化合成R‑1,3‑丁二醇的合成方法。
Description
技术领域
本发明属于生物技术领域,涉及一种R-1,3-丁二醇的合成方法,特别涉及一种以利用生物酶催化合成R-1,3-丁二醇的方法。
背景技术
(R)-1,3-丁二醇是一种重要的手性合成子,被广泛地用于碳青霉烯类抗生素母核氮杂环丁酮、香料、信息激素和杀虫剂等物质的合成。在这些化合物中,碳青霉烯类药物、青霉烯类药物和青霉素都同属于β-内酰胺类抗生素,而青霉素的滥用导致许多抗药性菌株产生,但碳青霉烯类药物能够有效地缓解这些抗药性菌株引起的症状,碳青霉烯类药物在国内被广泛的应用,因此,开发绿色、高效的(R)-1,3-丁二醇合成技术成为国内外重要研究的目的。
目前,国内外关于生物催化方法合成(R)-1,3-丁二醇的相关报道很少。1993年,Eguchi等利用固定化SP382脂肪酶重复催化双酰基化反应拆分(R,S)-1,3-丁二醇工艺制备(R)-1,3-丁二醇,其光学纯度达98%以上,但其收率较低(Tamotsu Eguchi,KenichiMochida.Lipase-catalyzed diacylation of 1,3-butanediol[J].Biotechnol.Lett.,1993,15(9):955-960.)。Akinobu Matsuyama报道了利用近平滑假丝酵母Candiaparapsilosis IFO 1396拆分外消旋体制备(R)-1,3-丁二醇,其光学纯度达到94%,得率达到48%,但这两种反应的立体选择性较低,很难满足药物的合成需求(Matsuyama A,Kobayashi Y,Ohnishi H.Microbial Production of Optically Active 1,3-Butanediolfrom 4-Hydroxy-2-butanone.Journal of the Agricultural Chemical Society,1993,57(4):685-686.)。
发明内容
本发明所要解决的技术问题是合成手性R-1,3-丁二醇。
为解决上述技术问题,本发明首先提供了一种R-1,3-丁二醇的合成方法,包括以下步骤:
以4-羟基-2-丁酮为底物,利用醇脱氢酶或所述醇脱氢酶的突变体进行催化反应,反应产物为R-1,3-丁二醇。
优选的,所述醇脱氢酶来源于:弓形脱硫菌(Desulfallas arcticus)、马钱子脱硫弧菌(Desulfonatronovibrio magnus)、消化球菌(Peptococcaceae bacterium BRH_c4b)、嗜热厌氧菌(Thermoanaerobacteraceae bacterium)、磺基磺酸盐脱硫弧菌(Desulfovibrio sulfodismutans)、斯氏假丝酵母(Candidatus Stahlbacteria)、硫氧化还原脱硫弧菌(Desulfolutivibrio sulfoxidireducens)、脱硫杆菌(Desulfobacteralesbacterium)、假丝酵母亚细菌(Candidatus Abyssubacteria bacterium SURF_5)、硝化螺旋菌(Nitrospirales bacterium)、扁平菌科细菌(Planctomycetaceae bacterium)或厌氧绳菌(Anaerolineales bacterium)。
优选的,来源于弓形脱硫菌的醇脱氢酶的氨基酸序列如SEQ ID No.2所示;
来源于马钱子脱硫弧菌的醇脱氢酶的氨基酸序列如SEQ ID No.4所示;
来源于消化球菌的醇脱氢酶的氨基酸序列如SEQ ID No.6所示;
来源于嗜热厌氧菌的醇脱氢酶的氨基酸序列如SEQ ID No.8所示;
来源于消化球菌的醇脱氢酶的氨基酸序列如SEQ ID No.10所示;
来源于磺基磺酸盐脱硫弧菌的醇脱氢酶的氨基酸序如SEQ ID No.12所示;
来源于斯氏假丝酵母的醇脱氢酶的氨基酸序列如SEQ ID No.14所示;
来源于硫氧化还原脱硫弧菌的醇脱氢酶的氨基酸序列如SEQ ID No.16所示;
来源于脱硫杆菌的醇脱氢酶的氨基酸序列如SEQ ID No.18所示;
来源于假丝酵母亚细菌的醇脱氢酶的氨基酸序列如SEQ ID No.20所示;
来源于硝化螺旋菌的醇脱氢酶的氨基酸序列如SEQ ID No.22所示;
来源于硝化螺旋菌的醇脱氢酶的氨基酸序列如SEQ ID No.24所示;
来源于硝化螺旋菌的醇脱氢酶的氨基酸序列如SEQ ID No.26所示;
来源于扁平菌科细菌的醇脱氢酶的氨基酸序列如SEQ ID No.28所示;
来源于厌氧绳菌的醇脱氢酶的氨基酸序列如SEQ ID No.30所示。
优选的,在所述醇脱氢酶的N端和/或C端连接标签。
优选的,所述醇脱氢酶的突变体包括:对SEQ ID No.2所示的氨基酸序列进行以下点突变:R80V和/或Y147N;
对SEQ ID No.4所示的氨基酸序列进行如下点突变中的至少一种:A84G、I85T、K110T、N198D。
优选的,所述醇脱氢酶或所述醇脱氢酶的突变体以粗酶液、粗酶液冻干粉、纯酶或全细胞的形式加入。
优选的,所述催化反应的温度为20~35℃,所述催化反应的时间为24~48h。
优选的,当所述醇脱氢酶是以粗酶液、粗酶液冻干粉或纯酶的形式加入时,所述醇脱氢酶在反应体系中的浓度为0.1g/L~10g/L;
当所述醇脱氢酶以全细胞的形式加入时,所述全细胞的湿重为100g/L。
优选的,所述催化反应在浓度为50~100mM,pH值为6.5~8.5的磷酸盐缓冲液中进行。
优选的,当所述醇脱氢酶是以粗酶液、粗酶液冻干粉或纯酶的形式加入时,所述催化反应的反应体系中除了含有4-羟基-2-丁酮、所述醇脱氢酶和辅酶NAD+外,还含有异丙醇。
本发明所提供的合成R-1,3-丁二醇的方法中,存在辅因子再生系统。所述辅因子再生系统为醇脱氢酶催化异丙醇氧化促进辅因子再生。在本发明合成R-1,3-丁二醇的方法中,醇脱氢酶催化4-羟基-2-丁酮还原成R-1,3-丁二醇,NAD(P)H被重新氧化成NAD(P)+,同时,醇脱氢酶催化异丙醇氧化成丙酮,NAD(P)+被还原成NAD(P)H,生成的NAD(P)H重新参与到4-羟基-2-丁酮还原成R-1,3-丁二醇的还原。
附图说明
图1为醇脱氢酶制备催化4-羟基-2-丁酮合成R-1,3-丁二醇的反应原理图;
图2为4-羟基-2-丁酮与R-1,3-丁二醇标准品气相检测峰图;
图3为反应液气相检测峰图。
具体实施方式
一种利用生物酶催化合成手性R-1,3-丁二醇的方法,可包括如下步骤(反应原理见图1):以4-羟基-2-丁酮为底物,经醇脱氢酶及其辅酶催化反应生成R-1,3-丁二醇;
进一步地,所述醇脱氢酶均可来源于如下任一微生物:弓形脱硫菌(Desulfallasarcticus)、马钱子脱硫弧菌(Desulfonatronovibrio magnus)、消化球菌(Peptococcaceaebacterium BRH_c4b)、嗜热厌氧菌(Thermoanaerobacteraceae bacterium)、磺基磺酸盐脱硫弧菌(Desulfovibrio sulfodismutans)、斯氏假丝酵母(Candidatus Stahlbacteria)、硫氧化还原脱硫弧菌(Desulfolutivibrio sulfoxidireducens)、脱硫杆菌(Desulfobacterales bacterium)、假丝酵母亚细菌(Candidatus Abyssubacteriabacterium SURF_5)、硝化螺旋菌(Nitrospirales bacterium)、扁平菌科细菌(Planctomycetaceae bacterium)或厌氧绳菌(Anaerolineales bacterium);
更进一步地,所述醇脱氢酶具体可为如下(a1)-(a15)中任一:
(a1)来源于弓形脱硫菌的醇脱氢酶,氨基酸序列为SEQ ID No.2;
(a2)来源于马钱子脱硫弧菌的醇脱氢酶,氨基酸序列为SEQ ID No.4;
(a3)来源于消化球菌的醇脱氢酶,氨基酸序列为SEQ ID No.6;
(a4)来源于嗜热厌氧菌的醇脱氢酶,氨基酸序列为SEQ ID No.8;
(a5)来源于消化球菌的醇脱氢酶,氨基酸序列为SEQ ID No.10;
(a6)来源于磺基磺酸盐脱硫弧菌的醇脱氢酶,氨基酸序SEQ ID No.12;
(a7)来源于斯氏假丝酵母的醇脱氢酶,氨基酸序列为SEQ ID No.14;
(a8)来源于硫氧化还原脱硫弧菌的醇脱氢酶,氨基酸序列为SEQ ID No.16;
(a9)来源于脱硫杆菌的醇脱氢酶,氨基酸序列为SEQ ID No.18;
(a10)来源于假丝酵母亚细菌的醇脱氢酶,氨基酸序列为SEQ ID No.20;
(a11)来源于硝化螺旋菌的醇脱氢酶,氨基酸序列为SEQ ID No.22;
(a12)来源于硝化螺旋菌的醇脱氢酶,氨基酸序列为SEQ ID No.24;
(a13)来源于硝化螺旋菌的醇脱氢酶,氨基酸序列为SEQ ID No.26;
(a14)来源于扁平菌科细菌的醇脱氢酶,氨基酸序列为SEQ ID No.28;
(a15)来源于厌氧绳菌的醇脱氢酶,氨基酸序列为SEQ ID No.30;
(a16)在(1)-(15)中任一所限定的蛋白质的N端和/或C端连接标签后得到的融合蛋白;
进一步地,所述醇脱氢酶的突变体具体可为如下(b1)-(b3):
(b1)与SEQ ID No.2所示来源于弓形脱硫菌的醇脱氢酶相比,存在或仅存在如下突变中的至少一种:R80V、Y147K;
(b2)与SEQ ID No.4所示来源于马钱子脱硫弧菌的醇脱氢酶相比,存在或仅存在如下突变中的至少一种:A84G、I85T、K110T、N198D;
(b3)在(b1)-(b2)中任一所限定的蛋白质的N端和/或C端连接标签后得到的融合蛋白。
所述方法中,所述醇脱氢酶是以粗酶液、粗酶液冻干粉、纯酶或全细胞的形式发生催化作用的;
进一步,所述粗酶液、粗酶液冻干粉和纯酶均按照包括如下步骤的方法制备得到:在宿主细胞中表达所述醇脱氢酶,得到重组细胞;裂解所述重组细胞获得所述粗酶液、粗酶液冻干粉或纯酶;
进一步,所述全细胞均按照包括如下步骤的方法制备得到:在宿主细胞中表达所述醇脱氢酶,得到的重组细胞即为所述全细胞;
再进一步,所述重组细胞是按照包括如下步骤的方法制备获得的:向所述宿主细胞导入能够表达所述醇脱氢酶的核酸分子,经诱导培养后获得表达所述醇脱氢酶的所述重组细胞;
更进一步,所述“能够表达所述醇脱氢酶的核酸分子”是通过重组载体的形式导入到所述宿主细胞中的;所述重组载体为携带有所述醇脱氢酶的编码基因的细菌质粒(如在细菌中表达的基于T7启动子的表达载体,具体如pET-28a等)、噬菌体、酵母质粒或逆转录病毒包装质粒;和/或所述宿主细胞为原核细胞或低等真核细胞;
在本发明的一个实施例中,所述重组载体具体为将所述酶A或所述酶B的编码基因替换pET-28a载体的酶切位点NdeI和XhoI之间的小片段后得到的重组质粒。
进一步地,所述宿主细胞可为原核细胞或低等真核细胞。
更进一步地,所述原核细胞具体可为细菌。所述低等真核细胞具体可为酵母细胞。
在本发明的一个实施例中,所述宿主细胞具体为大肠杆菌,更加具体的为E.coliBL21(DE3)。相应的,所述诱导培养为向培养体系中加IPTG至终浓度0.1-0.5mM(具体如0.1mM),20-37℃诱导培养12-24h(具体如16h)。
所述来源于弓形脱硫菌的醇脱氢酶的编码基因的序列为SEQ ID No.1或在其5’端和/或3’端连接标签编码序列后得到的融合序列或保有功能且编码相同蛋白的随机或/和定点诱变序列;
所述来源于马钱子脱硫弧菌的醇脱氢酶的编码基因的序列为SEQ ID No.3或在其5’端和/或3’端连接标签编码序列后得到的融合序列或保有功能且编码相同蛋白的随机或/和定点诱变序列;
所述来源于消化球菌的醇脱氢酶的编码基因的序列为SEQ ID No.5或在其5’端和/或3’端连接标签编码序列后得到的融合序列或保有功能且编码相同蛋白的随机或/和定点诱变序列;
所述来源于嗜热厌氧菌的醇脱氢酶的编码基因的序列为SEQ ID No.7或在其5’端和/或3’端连接标签编码序列后得到的融合序列或保有功能且编码相同蛋白的随机或/和定点诱变序列;
所述来源于消化球菌的醇脱氢酶的编码基因的序列为SEQ ID No.9或在其5’端和/或3’端连接标签编码序列后得到的融合序列或保有功能且编码相同蛋白的随机或/和定点诱变序列;
所述来源于磺基磺酸盐脱硫弧菌的醇脱氢酶的编码基因的序列为SEQ ID No.11或在其5’端和/或3’端连接标签编码序列后得到的融合序列或保有功能且编码相同蛋白的随机或/和定点诱变序列;
所述来源于斯氏假丝酵母的醇脱氢酶的编码基因的序列为SEQ ID No.13或在其5’端和/或3’端连接标签编码序列后得到的融合序列或保有功能且编码相同蛋白的随机或/和定点诱变序列;
所述来源于硫氧化还原脱硫弧菌的醇脱氢酶的编码基因的序列为SEQ ID No.15或在其5’端和/或3’端连接标签编码序列后得到的融合序列或保有功能且编码相同蛋白的随机或/和定点诱变序列;
所述来源于脱硫杆菌的醇脱氢酶的编码基因的序列为SEQ ID No.17或在其5’端和/或3’端连接标签编码序列后得到的融合序列或保有功能且编码相同蛋白的随机或/和定点诱变序列;
所述来源于假丝酵母亚细菌的醇脱氢酶的编码基因的序列为SEQ ID No.19或在其5’端和/或3’端连接标签编码序列后得到的融合序列或保有功能且编码相同蛋白的随机或/和定点诱变序列;
所述来源于硝化螺旋菌的醇脱氢酶的编码基因的序列为SEQ ID No.21或在其5’端和/或3’端连接标签编码序列后得到的融合序列或保有功能且编码相同蛋白的随机或/和定点诱变序列;
所述来源于硝化螺旋菌的醇脱氢酶的编码基因的序列为SEQ ID No.23或在其5’端和/或3’端连接标签编码序列后得到的融合序列或保有功能且编码相同蛋白的随机或/和定点诱变序列;
所述来源于硝化螺旋菌的醇脱氢酶的编码基因的序列为SEQ ID No.25或在其5’端和/或3’端连接标签编码序列后得到的融合序列或保有功能且编码相同蛋白的随机或/和定点诱变序列;
所述来源于扁平菌科细菌的醇脱氢酶的编码基因的序列为SEQ ID No.27或在其5’端和/或3’端连接标签编码序列后得到的融合序列或保有功能且编码相同蛋白的随机或/和定点诱变序列;
所述来源于厌氧绳菌的醇脱氢酶的编码基因的序列为SEQ ID No.29或在其5’端和/或3’端连接标签编码序列后得到的融合序列或保有功能且编码相同蛋白的随机或/和定点诱变序列;
所述来源于短小乳杆菌的醇脱氢酶的突变体的编码基因的序列为如下(c1)-(c3)中任一:(c1)与SEQ ID No.1相比,存在或仅存在如下突变中的至少一种:G239T、T439A;(c2)在(c1)所限定序列的5’端和/或3’端连接标签编码序列后得到的融合序列;(c3)与(c1)或(c2)所限定的序列相比保有功能且编码相同蛋白的随机或/和定点诱变序列;
所述来源于高温厌氧杆菌的醇脱氢酶的突变体的编码基因的序列为如下(d1)-(d3)中任一:(d1)与SEQ ID No.3相比,存在或仅存在如下突变中的至少一种:C251G、T253C、A329C、G593A;(d2)在(d1)所限定序列的5’端和/或3’端连接标签编码序列后得到的融合序列;(d3)与(d1)或(d2)所限定的序列相比保有功能且编码相同蛋白的随机或/和定点诱变序列;
在催化反应中,所述催化反应的温度均可为20~35℃,如25~30℃,具体如25℃。所述催化反应的时间均可为24~48h,如24h。
当所述醇脱氢酶是以粗酶液、粗酶液冻干粉或纯酶的形式发生催化作用时,所述催化反应可在浓度为50~100mM,pH值为6.5~8.0的磷酸盐缓冲液中进行,具体如:浓度为50mM,pH值为7.0的磷酸盐缓冲液;当所述醇脱氢酶以全细胞的形式发生催化作用时,催化反应可在浓度为50~100mM,pH值为7.5~8.5的磷酸盐缓冲液,具体如:浓度为50mM,pH值为7.5的磷酸盐缓冲液。
当所述醇脱氢酶是以粗酶液、粗酶液冻干粉或纯酶的形式发生催化作用时,所述醇脱氢酶在各自反应体系中的浓度均可为0.1g/L~10g/L,如10g/L。当所述醇脱氢酶以全细胞的形式发生催化作用时,所述反应体系中所述全细胞的浓度为100g/L(每升反应体系中含有所述全细胞的湿重为100g)。
在本发明中,所述醇脱氢酶的辅酶具体可为氧化型辅酶I(即NAD+)或氧化型辅酶II(即NADP+)。
在本发明中,当所述醇脱氢酶是以粗酶液、粗酶液冻干粉或纯酶的形式发生催化作用时,所述醇脱氢酶的辅酶在其各自的反应体系中的浓度均可为0.1~5mM(具体如5mM)。
当所述醇脱氢酶是以粗酶液、粗酶液冻干粉或纯酶的形式发生催化作用时,所述催化反应的反应体系中除了含有4-羟基-2-丁酮和所述醇脱氢酶及其辅酶外,还含有异丙醇。
具体的,所述催化反应的反应体系组成如下:浓度为50mM,pH值为7.0的磷酸盐缓冲、终浓度为500mM的4-羟基-2-丁酮、终浓度为5mM的氧化型辅酶I(即NAD+)或氧化型辅酶II(即NADP+)、终浓度为10%的异丙醇(v/v)、终浓度为10g/L的所述醇脱氢酶的粗酶液、粗酶液冻干粉或纯酶。
当所述醇脱氢酶以全细胞的形式发生催化作用时,所述反应体系中还含有异丙醇。
具体的,所述反应体系的组成可为如下:浓度为50mM、pH值为7.5的磷酸盐缓冲液;终浓度为500mM的4-羟基-2-丁酮、终浓度为10%的异丙醇(v/v)、终浓度为100g/L的所述全细胞(即每升反应体系中含有所述全细胞的湿重为100g)。
本发明还提供了一种酶及其相关产品。
本发明所提供的酶为前文所述醇脱氢酶。当然,也可以包括所述脱氢酶各自的辅酶。
所述相关产品可为能够表达所述酶系统中各酶的核酸分子,或含有所述核酸分子的表达盒、重组载体、重组菌或转基因细胞系。
所述酶或所述相关产品在合成R-1,3-丁二醇中的应用也属于本发明的保护范围。
为使本发明的目的、技术方案和优点更加清楚明白,下面结合实施例对本发明进行详细的说明,但是不能把它们理解为对本发明保护范围的限定。
下述实施例中,如无特殊说明,均为常规方法。
下述实施例中所用的材料、试剂等,如无特殊说明,均可从商业途径得到。
实施例1醇脱氢酶或其突变体工程菌的制备
将醇脱氢酶的编码基因分别进行全基因合成(根据需要以大肠杆菌为宿主进行密码子优化),将合成的基因连接于pET28a(+)表达载体上,并经测序验证正确后得到重组载体。利用定点突变方法得到其相关基因突变体。
将上述测序验证正确的重组表达载体转化至大肠杆菌BL21(DE3)的微生物宿主中,得到本发明的基因工程菌株。其中所述转化方法为本领域常规转化方法,较佳的为电转法或化学转化法。
本实施例中所涉及的醇脱氢酶或其突变体详见表1。
表1醇脱氢酶及其突变体
编号 | 菌种 | 基因序列 | 蛋白序列 |
a1 | 弓形脱硫菌(Desulfallas arcticus) | SEQID No.1 | SEQID No.2 |
a2 | 马钱子脱硫弧菌(Desulfonatronovibrio magnus) | SEQID No.3 | SEQID No.4 |
a3 | 消化球菌(Peptococcaceae bacterium BRH_c4b) | SEQID No.5 | SEQID No.6 |
a4 | 嗜热厌氧菌(Thermoanaerobacteraceae bacterium) | SEQID No.7 | SEQID No.8 |
a5 | 消化球菌(Peptococcaceae bacterium BRH_c4b) | SEQID No.9 | SEQID No.10 |
a6 | 磺基磺酸盐脱硫弧菌(Desulfovibrio sulfodismutans) | SEQID No.11 | SEQID No.12 |
a7 | 斯氏假丝酵母(Candidatus Stahlbacteria) | SEQID No.13 | SEQID No.14 |
a8 | 硫氧化还原脱硫弧菌(Desulfolutivibrio sulfoxidireducens) | SEQID No.15 | SEQID No.16 |
a9 | 脱硫杆菌(Desulfobacterales bacterium) | SEQID No.17 | SEQID No.18 |
a10 | 假丝酵母亚细菌(Candidatus Abyssubacteria) | SEQID No.19 | SEQID No.20 |
a11 | 硝化螺旋菌(Nitrospirales bacterium) | SEQID No.21 | SEQID No.22 |
a12 | 硝化螺旋菌(Nitrospirales bacterium) | SEQID No.23 | SEQID No.24 |
a13 | 硝化螺旋菌(Nitrospira sp.) | SEQID No.25 | SEQID No.26 |
a14 | 扁平菌科细菌(Planctomycetaceae bacterium) | SEQID No.27 | SEQID No.28 |
a15 | 厌氧绳菌(Anaerolineales bacterium) | SEQID No.29 | SEQID No.30 |
b1 | 弓形脱硫菌(Desulfallas arcticus) | SEQID No.31 | SEQID No.32 |
b2 | 马钱子脱硫弧菌(Desulfonatronovibrio magnus) | SEQID No.33 | SEQID No.34 |
实施例2醇脱氢酶或其突变体的表达及粗酶制备
将实施例1构建的醇脱氢酶或突变体的重组表达载体转入大肠杆菌BL21(DE3)感受态细胞中,37℃培养12-16h,待转化子长出,将单个转化子接种到含有卡纳霉素(50μg/mL)的5mL LB培养基中,37℃、220rmp培养过夜(12-16h)。然后按照1%(体积百分含量)的比例接种到LB培养基中,37℃、220rmp培养到OD600nm为0.6左右,添加终浓度为0.1mM的IPTG,25℃诱导培养16h,于4℃,6000rpm离心20min收集菌体,用50mM,pH值为7.0的磷酸盐缓冲重悬清洗一次,之后超声破菌并制备酶冻干粉。
实施例3利用醇脱氢酶或突变体的粗酶制备R-1,3-丁二醇
在反应体系中依次加入浓度为50mM,pH值为7.0的磷酸盐缓冲液、终浓度为500mM的4-羟基-2-丁酮、终浓度为5mM的氧化型辅酶I(即NAD+)、终浓度为10%(v/v)的异丙醇和终浓度为10g/L的醇脱氢酶冻干粉或酶液组成反应体系。将该反应体系在25℃的条件下反应24h。
反应结束后,反应液中加入等体积乙酸乙酯萃取,涡旋振荡10min,离心(14000rpm,2min)。取上相过0.22μm膜,进行气相色谱检测。气相检测条件为:手性柱supelcoβ-120(250*2.5mM),柱温:140℃,进样口温度:250℃,FID检测器温度:300℃。
气相色谱检测结果如图2~3所示。图2为4-羟基-2-丁酮和R-1,3-丁二醇标准品气相检测峰图,4-羟基-2-丁酮保留时间在7.6min左右,R-1,3-丁二醇的保留时间在8.3min左右;图3为醇脱氢酶或突变体粗酶制备R-1,3-丁二醇的气相检测图,表2为醇脱氢酶或其突变体粗酶制备R-1,3-丁二醇的结果。
表2醇脱氢酶或其突变体粗酶制备R-1,3-丁二醇的结果
实施例4利用醇脱氢酶或突变体的全细胞制备R-1,3-丁二醇
在反应体系中依次加入浓度为50mM,pH值为7.5的磷酸盐缓冲液、终浓度为500mM的4-羟基-2-丁酮、终浓度为5mM的氧化型辅酶I(即NAD+)、终浓度为10%(v/v)的异丙醇和终浓度为100g/L的醇脱氢酶全细胞组成反应体系。将该反应体系在25℃的条件下反应24h。
反应结束后,反应液中加入等体积乙酸乙酯萃取,涡旋振荡10min,离心(14000rpm,2min)。取上相过0.22μm膜,进行气相色谱检测。气相检测条件为:手性柱supelcoβ-120(250*2.5mM),柱温:140℃,进样口温度:250℃,FID检测器温度:300℃。表3为醇脱氢酶或其突变体全细胞制备R-1,3-丁二醇的结果。
表3醇脱氢酶或其突变体全细胞制备R-1,3-丁二醇的结果
以上所述仅是本发明的优选实施方式,应当指出,对于本技术领域的普通技术人员来说,在不脱离本发明原理的前提下,还可以做出若干改进和润饰,这些改进和润饰也应视为本发明的保护范围。
序列表
<110> 河北省科学院生物研究所
<120> 一种双功能酶生物催化剂及其制备方法和应用
<160> 34
<170> SIPOSequenceListing 1.0
<210> 1
<211> 1059
<212> DNA
<213> 弓形脱硫菌(Desulfotomaculum arcticum)
<400> 1
atgaaagctt ttgtaatgca cggaataggg cgcgtgggca tcatggaaaa gccgctgccg 60
gcagatccgg gtccaaacga cgcgatcatt aaaaccaccg cggctttggt ttgtacctcg 120
gacgttcaca ccgtgcacgg cgcgattggt gagaaggtgg atctgaccct gggccatgaa 180
gcgttgggaa tcatccacaa gctgggttcc gcggtcgaga gcctgaaagt tggcgaccgc 240
gtcgttgtga acgccatcac cccgtgttat aagtgcgaaa actgccagcg tggttttacc 300
agccaatgta ccaccatgct gggcgggtgg aaattcgcta atatcaagga cggctgcttc 360
gccgaatatt tccacgtaaa cgacgcggat gcaaatctgg tcccggttcc gtcgagcgtg 420
agcgacgagg cggcgctcta cgctacggat atgatgtcaa ccggtttcat gggtgctgag 480
cacgccggta ttccgctggg cggtacggtt gcaatcttcg gtcaaggtcc ggttggcctg 540
atggcgactg cgggctctcg cctgttgggc gctggtctca ttattgcggt tgagacagtg 600
ccgcatcgtc agcagctgag caaatactac ggcgcagaca tcatcctgga ttttaagaaa 660
gtggatgttg tggaggaaat tttgcgcatc accaatggcc aaggtgtcga cagcgcgata 720
gaagcgttag gcacgcaggt taccttcgag aactgcatta aggcaacccg tcctggtggc 780
acgatcagca acattggtta tcatggcgag ggtgattacc tgaaaattcc gcgtgtggac 840
tggggtgtgg gtatgtccga caaaaccatc cgtaccggtc tgtgcccggg cggtcgtgaa 900
cgtatgagtc gtcttctgcg tttgatcgaa tctggtagaa ttgatccgac tgcgttgact 960
acgcataaat tccgctttga tgaaatcgag aaggcctttg agctgatggc tatcaagggt 1020
gataatgtta ttaaaccact tattctgttt gaagattaa 1059
<210> 2
<211> 352
<212> PRT
<213> 弓形脱硫菌(Desulfotomaculum arcticum)
<400> 2
Met Lys Ala Phe Val Met His Gly Ile Gly Arg Val Gly Ile Met Glu
1 5 10 15
Lys Pro Leu Pro Ala Asp Pro Gly Pro Asn Asp Ala Ile Ile Lys Thr
20 25 30
Thr Ala Ala Leu Val Cys Thr Ser Asp Val His Thr Val His Gly Ala
35 40 45
Ile Gly Glu Lys Val Asp Leu Thr Leu Gly His Glu Ala Leu Gly Ile
50 55 60
Ile His Lys Leu Gly Ser Ala Val Glu Ser Leu Lys Val Gly Asp Arg
65 70 75 80
Val Val Val Asn Ala Ile Thr Pro Cys Tyr Lys Cys Glu Asn Cys Gln
85 90 95
Arg Gly Phe Thr Ser Gln Cys Thr Thr Met Leu Gly Gly Trp Lys Phe
100 105 110
Ala Asn Ile Lys Asp Gly Cys Phe Ala Glu Tyr Phe His Val Asn Asp
115 120 125
Ala Asp Ala Asn Leu Val Pro Val Pro Ser Ser Val Ser Asp Glu Ala
130 135 140
Ala Leu Tyr Ala Thr Asp Met Met Ser Thr Gly Phe Met Gly Ala Glu
145 150 155 160
His Ala Gly Ile Pro Leu Gly Gly Thr Val Ala Ile Phe Gly Gln Gly
165 170 175
Pro Val Gly Leu Met Ala Thr Ala Gly Ser Arg Leu Leu Gly Ala Gly
180 185 190
Leu Ile Ile Ala Val Glu Thr Val Pro His Arg Gln Gln Leu Ser Lys
195 200 205
Tyr Tyr Gly Ala Asp Ile Ile Leu Asp Phe Lys Lys Val Asp Val Val
210 215 220
Glu Glu Ile Leu Arg Ile Thr Asn Gly Gln Gly Val Asp Ser Ala Ile
225 230 235 240
Glu Ala Leu Gly Thr Gln Val Thr Phe Glu Asn Cys Ile Lys Ala Thr
245 250 255
Arg Pro Gly Gly Thr Ile Ser Asn Ile Gly Tyr His Gly Glu Gly Asp
260 265 270
Tyr Leu Lys Ile Pro Arg Val Asp Trp Gly Val Gly Met Ser Asp Lys
275 280 285
Thr Ile Arg Thr Gly Leu Cys Pro Gly Gly Arg Glu Arg Met Ser Arg
290 295 300
Leu Leu Arg Leu Ile Glu Ser Gly Arg Ile Asp Pro Thr Ala Leu Thr
305 310 315 320
Thr His Lys Phe Arg Phe Asp Glu Ile Glu Lys Ala Phe Glu Leu Met
325 330 335
Ala Ile Lys Gly Asp Asn Val Ile Lys Pro Leu Ile Leu Phe Glu Asp
340 345 350
<210> 3
<211> 1053
<212> DNA
<213> 马钱子脱硫弧菌(Desulfonatronovibrio magnus)
<400> 3
atgaaagcgt ttgttgttca cagtatcggt aaagttggaa taatggagaa accagttcct 60
gaaccagggc caaatgatgt aattgtaaag acaaccaacg ccttgatttg tacatctgat 120
gtacataccg ttgccggtgc tattggcgag aaaagcgacc tgactcttgg gcacgaaggt 180
gcaggtactg tttataagat tggcagtgcg gttaaagggt tcaaggaggg cgagagagtt 240
ctggttaatg ccataactcc ctgtttcaag tgtcacaact gccagcgcgg ctatacttca 300
caatgcggac aggccctggg tggatggaag tttgccaata ttaaggatgg ctgttttgct 360
gagtactttc atgttaatga tgctgaatcc aacttagtaa agattccaga ctcagtttct 420
gatgaagctg ctctatatac aacagatatg atgtccaccg gattcatggg ggctgagaat 480
ggaaatattc ctcttggagg gattgtagct gtttttggtc agggaccggt tggtcttatg 540
tcaacagcag gagcacgcct gttgggtgct ggtttggtca tagcagtgga aaacattcct 600
gccagacagg aactcgctaa attctacggt gctgacgtta ttgttgattt taccaaggtt 660
gatgcagtgg aagaaattat gaagctgact gatgggcagg gggtagatgc tgccattgag 720
gcactgggag ctcagatcac atttgagaac tgcattaaag tcaccaagcc tggaggtacc 780
atatccaata ttggctacca tggagaggga gattacataa aaataccgag agctgaatgg 840
ggtgtgggca tgtcggataa gactattcgg actggacttt gccctggagg cagcgaaaga 900
atgtccagac ttctgcggct tattgaaaac ggacgtattg atcccaccaa gctgaccact 960
catcgtttca gctttgatga gattgaaaaa ggatttcaca tgatggccaa taaagaggac 1020
ggagttatca aacccctggt aacattcagc tga 1053
<210> 4
<211> 350
<212> PRT
<213> 马钱子脱硫弧菌(Desulfonatronovibrio magnus)
<400> 4
Met Lys Ala Phe Val Val His Ser Ile Gly Lys Val Gly Ile Met Glu
1 5 10 15
Lys Pro Val Pro Glu Pro Gly Pro Asn Asp Val Ile Val Lys Thr Thr
20 25 30
Asn Ala Leu Ile Cys Thr Ser Asp Val His Thr Val Ala Gly Ala Ile
35 40 45
Gly Glu Lys Ser Asp Leu Thr Leu Gly His Glu Gly Ala Gly Thr Val
50 55 60
Tyr Lys Ile Gly Ser Ala Val Lys Gly Phe Lys Glu Gly Glu Arg Val
65 70 75 80
Leu Val Asn Ala Ile Thr Pro Cys Phe Lys Cys His Asn Cys Gln Arg
85 90 95
Gly Tyr Thr Ser Gln Cys Gly Gln Ala Leu Gly Gly Trp Lys Phe Ala
100 105 110
Asn Ile Lys Asp Gly Cys Phe Ala Glu Tyr Phe His Val Asn Asp Ala
115 120 125
Glu Ser Asn Leu Val Lys Ile Pro Asp Ser Val Ser Asp Glu Ala Ala
130 135 140
Leu Tyr Thr Thr Asp Met Met Ser Thr Gly Phe Met Gly Ala Glu Asn
145 150 155 160
Gly Asn Ile Pro Leu Gly Gly Ile Val Ala Val Phe Gly Gln Gly Pro
165 170 175
Val Gly Leu Met Ser Thr Ala Gly Ala Arg Leu Leu Gly Ala Gly Leu
180 185 190
Val Ile Ala Val Glu Asn Ile Pro Ala Arg Gln Glu Leu Ala Lys Phe
195 200 205
Tyr Gly Ala Asp Val Ile Val Asp Phe Thr Lys Val Asp Ala Val Glu
210 215 220
Glu Ile Met Lys Leu Thr Asp Gly Gln Gly Val Asp Ala Ala Ile Glu
225 230 235 240
Ala Leu Gly Ala Gln Ile Thr Phe Glu Asn Cys Ile Lys Val Thr Lys
245 250 255
Pro Gly Gly Thr Ile Ser Asn Ile Gly Tyr His Gly Glu Gly Asp Tyr
260 265 270
Ile Lys Ile Pro Arg Ala Glu Trp Gly Val Gly Met Ser Asp Lys Thr
275 280 285
Ile Arg Thr Gly Leu Cys Pro Gly Gly Ser Glu Arg Met Ser Arg Leu
290 295 300
Leu Arg Leu Ile Glu Asn Gly Arg Ile Asp Pro Thr Lys Leu Thr Thr
305 310 315 320
His Arg Phe Ser Phe Asp Glu Ile Glu Lys Gly Phe His Met Met Ala
325 330 335
Asn Lys Glu Asp Gly Val Ile Lys Pro Leu Val Thr Phe Ser
340 345 350
<210> 5
<211> 1053
<212> DNA
<213> 消化球菌(Peptococcaceae bacterium BRH_c4b)
<400> 5
atgaaagcat tcgtaatgca tgagatcggc aaaattggta ttatggagaa gccgattcct 60
gaacctggtc ccaacgatgc gattcttaaa actaccggag cccttgtatg cacatctgac 120
gttcataccg ttcatggcgc agtgggcccg agggaaaata tgaccctagg tcacgaagca 180
gtaggaatca ttcataaact tggcagtgcc gtacaaggtc taaaagaagg tgatagagta 240
gtagtaaatg ctatcactcc ttgttacaag tgtgtaaatt gccagcgtgg atttacttcc 300
cagtgtacca atatgttggg aggctggaag tttgccaata ttaaggacgg tagtatggct 360
gagtatttcc atgtgaatga tgccgaggca aacctcattt taattcccga agaaatatcc 420
gacgaagaag ctctatatac tgtagacatg atcagtaccg gtttcatggg ggcggaacac 480
gcctgcattc ctcttggagg aaccgtggca atttttggtc agggaccggt tggcctgatg 540
gctaccgcag gggcgcgtct tctgggagca ggccttatta ttactgttga aacaaatcca 600
aaacggcagg agttgtccag aaagttcggg gctgacgtag tagttgattt taaagcggtc 660
gacgcggtcc aggaaatatt taacttaacc ggtggtgttg gtgtagattc agctattgaa 720
tgtttgggct cccagataac ctttgaaaat tgcattaaag caacaaggcc gggtggcact 780
atttccaatg tagggtatca tggtgaaggt gaatatctga tgatacctag ggctgaatgg 840
ggagtaggca tgagcgataa aacaatccgc acaggccttt gcccgggggg aagggagcgt 900
atggaacgtt tgcttcgcct tattgagaca ggccgcatag atccgaggcc gttaacaaca 960
catactttca aatttgccga gattgaaaaa gcctttaata tgatggcaac aaaggaagat 1020
ggtataatta aacctatgat actgtttgat taa 1053
<210> 6
<211> 350
<212> PRT
<213> 消化球菌(Peptococcaceae bacterium BRH_c4b)
<400> 6
Met Lys Ala Phe Val Met His Glu Ile Gly Lys Ile Gly Ile Met Glu
1 5 10 15
Lys Pro Ile Pro Glu Pro Gly Pro Asn Asp Ala Ile Leu Lys Thr Thr
20 25 30
Gly Ala Leu Val Cys Thr Ser Asp Val His Thr Val His Gly Ala Val
35 40 45
Gly Pro Arg Glu Asn Met Thr Leu Gly His Glu Ala Val Gly Ile Ile
50 55 60
His Lys Leu Gly Ser Ala Val Gln Gly Leu Lys Glu Gly Asp Arg Val
65 70 75 80
Val Val Asn Ala Ile Thr Pro Cys Tyr Lys Cys Val Asn Cys Gln Arg
85 90 95
Gly Phe Thr Ser Gln Cys Thr Asn Met Leu Gly Gly Trp Lys Phe Ala
100 105 110
Asn Ile Lys Asp Gly Ser Met Ala Glu Tyr Phe His Val Asn Asp Ala
115 120 125
Glu Ala Asn Leu Ile Leu Ile Pro Glu Glu Ile Ser Asp Glu Glu Ala
130 135 140
Leu Tyr Thr Val Asp Met Ile Ser Thr Gly Phe Met Gly Ala Glu His
145 150 155 160
Ala Cys Ile Pro Leu Gly Gly Thr Val Ala Ile Phe Gly Gln Gly Pro
165 170 175
Val Gly Leu Met Ala Thr Ala Gly Ala Arg Leu Leu Gly Ala Gly Leu
180 185 190
Ile Ile Thr Val Glu Thr Asn Pro Lys Arg Gln Glu Leu Ser Arg Lys
195 200 205
Phe Gly Ala Asp Val Val Val Asp Phe Lys Ala Val Asp Ala Val Gln
210 215 220
Glu Ile Phe Asn Leu Thr Gly Gly Val Gly Val Asp Ser Ala Ile Glu
225 230 235 240
Cys Leu Gly Ser Gln Ile Thr Phe Glu Asn Cys Ile Lys Ala Thr Arg
245 250 255
Pro Gly Gly Thr Ile Ser Asn Val Gly Tyr His Gly Glu Gly Glu Tyr
260 265 270
Leu Met Ile Pro Arg Ala Glu Trp Gly Val Gly Met Ser Asp Lys Thr
275 280 285
Ile Arg Thr Gly Leu Cys Pro Gly Gly Arg Glu Arg Met Glu Arg Leu
290 295 300
Leu Arg Leu Ile Glu Thr Gly Arg Ile Asp Pro Arg Pro Leu Thr Thr
305 310 315 320
His Thr Phe Lys Phe Ala Glu Ile Glu Lys Ala Phe Asn Met Met Ala
325 330 335
Thr Lys Glu Asp Gly Ile Ile Lys Pro Met Ile Leu Phe Asp
340 345 350
<210> 7
<211> 1050
<212> DNA
<213> 嗜热厌氧菌(Thermoanaerobacteraceae bacterium)
<400> 7
atgaaagctt ttgttatgca tggaattaat aaagttggct ttatggaaaa gccaattccc 60
gaacccggac caaatgaggt cctgattaag actaccgcat gcttggtatg tacttccgac 120
gtccataccg taagtggcgc aattggagaa agaaaggatc tgactcttgg ccacgaagcc 180
gtgggaacta tagccaagtt aggtagtgct gttgaaggtt ttacgatcgg acagagagtt 240
gccgttaatg ccattacccc ttgttaccag tgtgaaaatt gccagcgcgg gtacacttct 300
caatgcacgc aaatgttagg gggatggcgt tttgccaatg ttaaagacgg tacctttgcc 360
gagtattttc tggttaacga cgcaaaggcc aacctcgcac ctatcccgga aactattact 420
gatgaacagg ccttatacac aaccgatatg atgagcaccg gttttatggg agccgaaaac 480
gcggatatcc ccttaggcgg gtcagtagcc atatttgccc agggggctgt cggtctaatg 540
gctactgccg gtgcgaggct tttaggagcc ggattaataa tagccgtaga atccgtgccc 600
gaacgcaaaa aactggcaaa acattttggt gctgattacg tgcttgattt taacgaagtt 660
gatgttattg aagaaattaa caagctgact gacggtcaag gagttgactc gtccatagaa 720
gcgctgggat cccaggtaac ttttgaaaat tgcataaagg taacccggcc tggcggaacc 780
atatctaata ttggctatca tggtgagggg gaatatctga aaataccgcg tttggactgg 840
ggcgttggca tgggtgaaaa gactatcaaa accggtctct gtcccggcgg ttctgaaaga 900
atgcgcaggt taatgcgcct aatcgcaaat ggccgcattg acccaactcc actgacaact 960
cacaagtttt catttgatga agtagataaa gcatttgagt atatgaaata taaaaaagat 1020
aacattatta aaccgctgat tactttttaa 1050
<210> 8
<211> 349
<212> PRT
<213> 嗜热厌氧菌(Thermoanaerobacteraceae bacterium)
<400> 8
Met Lys Ala Phe Val Met His Gly Ile Asn Lys Val Gly Phe Met Glu
1 5 10 15
Lys Pro Ile Pro Glu Pro Gly Pro Asn Glu Val Leu Ile Lys Thr Thr
20 25 30
Ala Cys Leu Val Cys Thr Ser Asp Val His Thr Val Ser Gly Ala Ile
35 40 45
Gly Glu Arg Lys Asp Leu Thr Leu Gly His Glu Ala Val Gly Thr Ile
50 55 60
Ala Lys Leu Gly Ser Ala Val Glu Gly Phe Thr Ile Gly Gln Arg Val
65 70 75 80
Ala Val Asn Ala Ile Thr Pro Cys Tyr Gln Cys Glu Asn Cys Gln Arg
85 90 95
Gly Tyr Thr Ser Gln Cys Thr Gln Met Leu Gly Gly Trp Arg Phe Ala
100 105 110
Asn Val Lys Asp Gly Thr Phe Ala Glu Tyr Phe Leu Val Asn Asp Ala
115 120 125
Lys Ala Asn Leu Ala Pro Ile Pro Glu Thr Ile Thr Asp Glu Gln Ala
130 135 140
Leu Tyr Thr Thr Asp Met Met Ser Thr Gly Phe Met Gly Ala Glu Asn
145 150 155 160
Ala Asp Ile Pro Leu Gly Gly Ser Val Ala Ile Phe Ala Gln Gly Ala
165 170 175
Val Gly Leu Met Ala Thr Ala Gly Ala Arg Leu Leu Gly Ala Gly Leu
180 185 190
Ile Ile Ala Val Glu Ser Val Pro Glu Arg Lys Lys Leu Ala Lys His
195 200 205
Phe Gly Ala Asp Tyr Val Leu Asp Phe Asn Glu Val Asp Val Ile Glu
210 215 220
Glu Ile Asn Lys Leu Thr Asp Gly Gln Gly Val Asp Ser Ser Ile Glu
225 230 235 240
Ala Leu Gly Ser Gln Val Thr Phe Glu Asn Cys Ile Lys Val Thr Arg
245 250 255
Pro Gly Gly Thr Ile Ser Asn Ile Gly Tyr His Gly Glu Gly Glu Tyr
260 265 270
Leu Lys Ile Pro Arg Leu Asp Trp Gly Val Gly Met Gly Glu Lys Thr
275 280 285
Ile Lys Thr Gly Leu Cys Pro Gly Gly Ser Glu Arg Met Arg Arg Leu
290 295 300
Met Arg Leu Ile Ala Asn Gly Arg Ile Asp Pro Thr Pro Leu Thr Thr
305 310 315 320
His Lys Phe Ser Phe Asp Glu Val Asp Lys Ala Phe Glu Tyr Met Lys
325 330 335
Tyr Lys Lys Asp Asn Ile Ile Lys Pro Leu Ile Thr Phe
340 345
<210> 9
<211> 1056
<212> DNA
<213> 消化球菌(Peptococcaceae bacterium BRH_c4b)
<400> 9
atgaaagcat tcgtaatgca tgaaatcggc aaggtcggca tcatggacaa gccaatccct 60
gaagcgggtc ccaatgatgc ggttattaaa actaccgcag ctttcgtatg cacatctgac 120
gtacataccg ttcatggcgc attagggcca agggaagacc ggaccctggg tcacgaagca 180
gtaggaatca tctacaaact gggcagtgcc gtacataatc taaaagtagg ggatagagta 240
gcagtaaatg ctatcactcc ttgctataag tgtgaaaatt gccagcgtgg atttacttcc 300
cagtgtaaag gtatgttggg aggttacaag tttaccaact tagtggaagg aaatatggct 360
gagtatttcc acgtaaatga cgccgaggca aatgtcgttc caattcccga atcggtatcc 420
gacgaggaag ctctgtatac tgtagatatg atgactaccg gtttcatggg agcagaacac 480
gccagtattc ctcttggagg caccgtggca atttttggtc agggaccggt cggcctgatg 540
gctactgtag gcgcgcgtct tctgggagcc ggcctgatta taactgttga aacagatcca 600
aaacgacagg agttgtccag aaagtacggg gctgatatag tggttgattt taaagcggtc 660
gacgcagtcc aggaaataat gaatataact ggtggaatag gcgtagattc agctattgaa 720
tgcctgggcg ctcagataac ctttgaaaac tgtattaaag tgacaaggcc ggggggcact 780
atttccaatg tggggtatca cggtggtgac gctgaatata tgttggtacc gagacttgat 840
tggggagtgg gcatgagcga caaaactatc cgcaccggtc tatgcccggg aggaagggag 900
cgtatggttc gcttactccg ccttattgaa acaggccgcg tagatccaaa gccattaaca 960
acacacacct ttaaatttgc cgatgttgaa aaagccttcc agttaatgga atcaaaggaa 1020
gacggtataa ttaaaccaat ggtactgttt gattag 1056
<210> 10
<211> 351
<212> PRT
<213> 消化球菌(Peptococcaceae bacterium BRH_c4b)
<400> 10
Met Lys Ala Phe Val Met His Glu Ile Gly Lys Val Gly Ile Met Asp
1 5 10 15
Lys Pro Ile Pro Glu Ala Gly Pro Asn Asp Ala Val Ile Lys Thr Thr
20 25 30
Ala Ala Phe Val Cys Thr Ser Asp Val His Thr Val His Gly Ala Leu
35 40 45
Gly Pro Arg Glu Asp Arg Thr Leu Gly His Glu Ala Val Gly Ile Ile
50 55 60
Tyr Lys Leu Gly Ser Ala Val His Asn Leu Lys Val Gly Asp Arg Val
65 70 75 80
Ala Val Asn Ala Ile Thr Pro Cys Tyr Lys Cys Glu Asn Cys Gln Arg
85 90 95
Gly Phe Thr Ser Gln Cys Lys Gly Met Leu Gly Gly Tyr Lys Phe Thr
100 105 110
Asn Leu Val Glu Gly Asn Met Ala Glu Tyr Phe His Val Asn Asp Ala
115 120 125
Glu Ala Asn Val Val Pro Ile Pro Glu Ser Val Ser Asp Glu Glu Ala
130 135 140
Leu Tyr Thr Val Asp Met Met Thr Thr Gly Phe Met Gly Ala Glu His
145 150 155 160
Ala Ser Ile Pro Leu Gly Gly Thr Val Ala Ile Phe Gly Gln Gly Pro
165 170 175
Val Gly Leu Met Ala Thr Val Gly Ala Arg Leu Leu Gly Ala Gly Leu
180 185 190
Ile Ile Thr Val Glu Thr Asp Pro Lys Arg Gln Glu Leu Ser Arg Lys
195 200 205
Tyr Gly Ala Asp Ile Val Val Asp Phe Lys Ala Val Asp Ala Val Gln
210 215 220
Glu Ile Met Asn Ile Thr Gly Gly Ile Gly Val Asp Ser Ala Ile Glu
225 230 235 240
Cys Leu Gly Ala Gln Ile Thr Phe Glu Asn Cys Ile Lys Val Thr Arg
245 250 255
Pro Gly Gly Thr Ile Ser Asn Val Gly Tyr His Gly Gly Asp Ala Glu
260 265 270
Tyr Met Leu Val Pro Arg Leu Asp Trp Gly Val Gly Met Ser Asp Lys
275 280 285
Thr Ile Arg Thr Gly Leu Cys Pro Gly Gly Arg Glu Arg Met Val Arg
290 295 300
Leu Leu Arg Leu Ile Glu Thr Gly Arg Val Asp Pro Lys Pro Leu Thr
305 310 315 320
Thr His Thr Phe Lys Phe Ala Asp Val Glu Lys Ala Phe Gln Leu Met
325 330 335
Glu Ser Lys Glu Asp Gly Ile Ile Lys Pro Met Val Leu Phe Asp
340 345 350
<210> 11
<211> 1053
<212> DNA
<213> 磺基磺酸盐脱硫弧菌(Desulfovibrio sulfodismutans)
<400> 11
atgaaggcat tcgtcatgct aggcattgga aaggttggca tcatcgacaa gcccattcct 60
gagccaggtc caaatgatgt cattctcaag acgacaagcg cgctcatttg cacctccgat 120
gtacacaccg tcgggggagc catcggcgac agaaaaaatc tcaccttggg acatgaagcc 180
ggtggcgtcg tgtacaagat cggcagtgcg gtgaccgggt tcaaggttgg cgaccgttgt 240
gtggtcaacg ccatcacccc gtgctacaaa tgcgagaact gcctgcgcgg ctttacgtcc 300
cagtgcggtg aggcctgcgg cggctggaaa tacgccaaca tcaaggacgg ttccttcgcg 360
gaatatttcc atgtcaatga cgccatcgcc aacctggtca aggtgcccga cgacgtcacc 420
gacgaggcgg ccctgtacac cacggacatg atggccacgg gatttatggg cgcggaacac 480
ggcaacaccc ccctgggcgg cagcgtggcc gtcttcggcc agggcccggt ggggcttatg 540
gccaccgccg gggcgcggct tttgggcgcg ggactgatca tcgccgtgga aagcgtgccc 600
cagcggcagg agttggccaa gtactacggc gcggacgtga tcgtggactt cagcaaggtc 660
gacgccgtgg ccgagatcaa gcgcctgacc ggcggccagg gcgtggacac cgccatcgaa 720
tgtctgggca tgcaggccac cttcgagaac tgcgtcaagg ccacccggcc cggcggcgtg 780
atttccaact gcggctacca tggcaagggc gagtacgtga agattccccg cgtcgaatgg 840
ggctacggca tggccgacaa gaccatccgc accgggcttt gccccggcgg cagcgagcgc 900
atgggccgtc tgctgcggct catccagacc ggacgcattg atcccacgaa gttgaccacg 960
catcgcctgc ccttcgccga catcgaaaag ggcttccgca tcatggccaa caaggaagac 1020
aacgtgatca agccgatgat catcttctcc taa 1053
<210> 12
<211> 350
<212> PRT
<213> 磺基磺酸盐脱硫弧菌(Desulfovibrio sulfodismutans)
<400> 12
Met Lys Ala Phe Val Met Leu Gly Ile Gly Lys Val Gly Ile Ile Asp
1 5 10 15
Lys Pro Ile Pro Glu Pro Gly Pro Asn Asp Val Ile Leu Lys Thr Thr
20 25 30
Ser Ala Leu Ile Cys Thr Ser Asp Val His Thr Val Gly Gly Ala Ile
35 40 45
Gly Asp Arg Lys Asn Leu Thr Leu Gly His Glu Ala Gly Gly Val Val
50 55 60
Tyr Lys Ile Gly Ser Ala Val Thr Gly Phe Lys Val Gly Asp Arg Cys
65 70 75 80
Val Val Asn Ala Ile Thr Pro Cys Tyr Lys Cys Glu Asn Cys Leu Arg
85 90 95
Gly Phe Thr Ser Gln Cys Gly Glu Ala Cys Gly Gly Trp Lys Tyr Ala
100 105 110
Asn Ile Lys Asp Gly Ser Phe Ala Glu Tyr Phe His Val Asn Asp Ala
115 120 125
Ile Ala Asn Leu Val Lys Val Pro Asp Asp Val Thr Asp Glu Ala Ala
130 135 140
Leu Tyr Thr Thr Asp Met Met Ala Thr Gly Phe Met Gly Ala Glu His
145 150 155 160
Gly Asn Thr Pro Leu Gly Gly Ser Val Ala Val Phe Gly Gln Gly Pro
165 170 175
Val Gly Leu Met Ala Thr Ala Gly Ala Arg Leu Leu Gly Ala Gly Leu
180 185 190
Ile Ile Ala Val Glu Ser Val Pro Gln Arg Gln Glu Leu Ala Lys Tyr
195 200 205
Tyr Gly Ala Asp Val Ile Val Asp Phe Ser Lys Val Asp Ala Val Ala
210 215 220
Glu Ile Lys Arg Leu Thr Gly Gly Gln Gly Val Asp Thr Ala Ile Glu
225 230 235 240
Cys Leu Gly Met Gln Ala Thr Phe Glu Asn Cys Val Lys Ala Thr Arg
245 250 255
Pro Gly Gly Val Ile Ser Asn Cys Gly Tyr His Gly Lys Gly Glu Tyr
260 265 270
Val Lys Ile Pro Arg Val Glu Trp Gly Tyr Gly Met Ala Asp Lys Thr
275 280 285
Ile Arg Thr Gly Leu Cys Pro Gly Gly Ser Glu Arg Met Gly Arg Leu
290 295 300
Leu Arg Leu Ile Gln Thr Gly Arg Ile Asp Pro Thr Lys Leu Thr Thr
305 310 315 320
His Arg Leu Pro Phe Ala Asp Ile Glu Lys Gly Phe Arg Ile Met Ala
325 330 335
Asn Lys Glu Asp Asn Val Ile Lys Pro Met Ile Ile Phe Ser
340 345 350
<210> 13
<211> 1056
<212> DNA
<213> 斯氏假丝酵母(Candidatus Stahlbacteria)
<400> 13
atgaaagcgt ttgtgatgaa aggcatcggg aaggtcggat tcatggagaa acccattcct 60
gaagatccag gaccaaatgg tgccatcatt aagacaacta aagcccttgt ctgcacctct 120
gatacccata cagttgctgg ggcaatcggc gacaggaaag acctcacgct tggtcatgaa 180
gcggttggga ttgtttacaa actgggtagc gaagtgaaag ggataaagga aggtgatcgc 240
gttgcagtta acgcaataac accttgctat aagtgcgaaa actgtttacg aggttacacg 300
tcgcaatgtc agcaaatgct tggtggatgg aagttcgcca acattaagga tggggtcttc 360
gcagaatatt ttcacgtgaa cgatgctgag gcgaatctgg cactaattcc agattccatt 420
cccgatgagg cagctgtata cactacagac atgatgtcca cgggctttat gggtgcagaa 480
catgcgaata tccctctagg gggaactgtt gcaatttttg ctcagggtcc gatagggctc 540
atgtgtaccg tgggtgcacg cctgctgggt gcaggtctcg ttattgcggt tgaaagcatg 600
tcgaaacgaa aggagcttgc taagcatttt ggggctgaca cagttgtgga tttcacaaaa 660
gtagatcctg taaaggatat tctccgattg actgagggca aaggtgttga ctcagccata 720
gaggccttgg gagctcaaga gacctttgaa gcttgcatca aagtaactcg cccaggtggg 780
acgatctcag tagtgggata ctttggcaaa ggagattatg tcaaaattcc aaggcttgaa 840
tggggcgttg gaatgagtga taagaccatc aggacaggac tctgtccggg tggtaaagag 900
cgcatgcagc ggttgcttat gttgattaaa acaggccggg ttgatcctac acctctgacc 960
actcacacat tcaagttcga tgaactcgaa agagccttcc acatgatgga aactaaagag 1020
gacgggataa taaagcccct cataatcttt gactaa 1056
<210> 14
<211> 351
<212> PRT
<213> 斯氏假丝酵母(Candidatus Stahlbacteria)
<400> 14
Met Lys Ala Phe Val Met Lys Gly Ile Gly Lys Val Gly Phe Met Glu
1 5 10 15
Lys Pro Ile Pro Glu Asp Pro Gly Pro Asn Gly Ala Ile Ile Lys Thr
20 25 30
Thr Lys Ala Leu Val Cys Thr Ser Asp Thr His Thr Val Ala Gly Ala
35 40 45
Ile Gly Asp Arg Lys Asp Leu Thr Leu Gly His Glu Ala Val Gly Ile
50 55 60
Val Tyr Lys Leu Gly Ser Glu Val Lys Gly Ile Lys Glu Gly Asp Arg
65 70 75 80
Val Ala Val Asn Ala Ile Thr Pro Cys Tyr Lys Cys Glu Asn Cys Leu
85 90 95
Arg Gly Tyr Thr Ser Gln Cys Gln Gln Met Leu Gly Gly Trp Lys Phe
100 105 110
Ala Asn Ile Lys Asp Gly Val Phe Ala Glu Tyr Phe His Val Asn Asp
115 120 125
Ala Glu Ala Asn Leu Ala Leu Ile Pro Asp Ser Ile Pro Asp Glu Ala
130 135 140
Ala Val Tyr Thr Thr Asp Met Met Ser Thr Gly Phe Met Gly Ala Glu
145 150 155 160
His Ala Asn Ile Pro Leu Gly Gly Thr Val Ala Ile Phe Ala Gln Gly
165 170 175
Pro Ile Gly Leu Met Cys Thr Val Gly Ala Arg Leu Leu Gly Ala Gly
180 185 190
Leu Val Ile Ala Val Glu Ser Met Ser Lys Arg Lys Glu Leu Ala Lys
195 200 205
His Phe Gly Ala Asp Thr Val Val Asp Phe Thr Lys Val Asp Pro Val
210 215 220
Lys Asp Ile Leu Arg Leu Thr Glu Gly Lys Gly Val Asp Ser Ala Ile
225 230 235 240
Glu Ala Leu Gly Ala Gln Glu Thr Phe Glu Ala Cys Ile Lys Val Thr
245 250 255
Arg Pro Gly Gly Thr Ile Ser Val Val Gly Tyr Phe Gly Lys Gly Asp
260 265 270
Tyr Val Lys Ile Pro Arg Leu Glu Trp Gly Val Gly Met Ser Asp Lys
275 280 285
Thr Ile Arg Thr Gly Leu Cys Pro Gly Gly Lys Glu Arg Met Gln Arg
290 295 300
Leu Leu Met Leu Ile Lys Thr Gly Arg Val Asp Pro Thr Pro Leu Thr
305 310 315 320
Thr His Thr Phe Lys Phe Asp Glu Leu Glu Arg Ala Phe His Met Met
325 330 335
Glu Thr Lys Glu Asp Gly Ile Ile Lys Pro Leu Ile Ile Phe Asp
340 345 350
<210> 15
<211> 912
<212> DNA
<213> 硫氧化还原脱硫弧菌(Desulfolutivibrio sulfoxidireducens)
<400> 15
atgaaacaca cgcaatgcct cattgtcggc gcggggccag ccgggctttc ggcggccatc 60
tataccgcca gggccggcgt ggacaccctg gccctgggct gcaatcccaa ggtcgcgggc 120
gactacgaca tcgacaacta ttttggtttc ccggagaccg tcaccggccg tgagctcatc 180
gagcgcggca tggcccaggc caaacgtttc ggggccacgc tgcggtgcga gcgggtgatg 240
gccgtgcacc acggcgaaaa cggaaacttt gtggtcaaga gcgatacgga cgaatatgcg 300
gcggacgcgg tgatcatcgc cgccggggtg gcaagggtgc gaccgggcat cgccaacctc 360
gcggactacg agggcaaggg cgtgtcctac tgcgtcagtt gcgacggttt tttctaccgg 420
ggcaaatcgg tcctggtcct cggcgaaggg gactacgccg ccaaccaggc cctggaactg 480
accaccttca ccccccaggt ggccatctac acccagggca aggccccggt catatccgaa 540
ggcttcgcgg cgcgcctctc ccaggcggga atcccggtga tcgaacagac cgtggccacc 600
ctggccggcg aaccggccct ggccgcggcg cgtctggcca acggcaccga actccccgtg 660
gacggcatct tcatcgccat gggccaggcc tcggccctgg atttcgccaa gaccctgggt 720
ttgccgcttc gcggggcgtt catccaggcc gaccacgacc agaaaaccgc cctgcccggg 780
gtgttcgcgg ccggagactg cgtggggcgg tttctccaga tcagcgtggc cgtgggcgaa 840
ggggccaagg ccggaaaatc ggccatcgcc tatctcaaac aggcgggcgc ggcccataca 900
aagacatcgt ga 912
<210> 16
<211> 350
<212> PRT
<213> 硫氧化还原脱硫弧菌(Desulfolutivibrio sulfoxidireducens)
<400> 16
Met Lys Ala Phe Val Met His Gly Ile Gly Lys Val Gly Ile Val Asp
1 5 10 15
Lys Pro Ile Pro Glu Pro Gly Pro Asn Asp Val Ile Leu Lys Thr Thr
20 25 30
Ser Ala Leu Ile Cys Thr Ser Asp Val His Thr Val Gly Gly Ala Ile
35 40 45
Gly Asp Arg Lys Asn Leu Thr Leu Gly His Glu Ala Gly Gly Ile Val
50 55 60
Tyr Lys Ile Gly Ser Ala Val Thr Gly Phe Lys Glu Gly Asp Arg Cys
65 70 75 80
Val Val Asn Ala Ile Thr Pro Cys Tyr Lys Cys Gln Asn Cys Leu Arg
85 90 95
Gly Phe Pro Ser Gln Cys Gly Glu Ala Cys Gly Gly Trp Lys Tyr Ala
100 105 110
Asn Ile Lys Asp Gly Ser Phe Ala Glu Tyr Phe His Val Asn Asp Ala
115 120 125
Ile Ala Asn Leu Val Lys Val Pro Asp Asp Val Thr Asp Glu Ala Ala
130 135 140
Leu Tyr Thr Thr Asp Met Met Ala Thr Gly Phe Met Gly Ala Glu His
145 150 155 160
Gly Asn Thr Pro Leu Gly Gly Ser Val Ala Val Phe Gly Gln Gly Pro
165 170 175
Val Gly Leu Met Ala Thr Ala Gly Ala Arg Leu Leu Gly Ala Gly Leu
180 185 190
Ile Ile Ala Val Glu Ser Val Pro Lys Arg Gln Glu Leu Ala Lys Tyr
195 200 205
Tyr Gly Ala Asp Val Ile Val Asp Phe Ser Lys Val Asp Ala Val Ala
210 215 220
Glu Ile Lys Arg Leu Thr Gly Gly Glu Gly Val Asp Thr Ala Ile Glu
225 230 235 240
Ala Leu Gly Met Gln Pro Thr Phe Glu Asn Cys Val Lys Ala Thr Arg
245 250 255
Pro Gly Gly Thr Ile Ser Asn Cys Gly Tyr His Gly Lys Gly Glu Tyr
260 265 270
Val Lys Ile Pro Arg Val Glu Trp Gly Phe Gly Met Ala Asp Lys Thr
275 280 285
Ile Arg Thr Gly Leu Cys Pro Gly Gly Ser Glu Arg Met Gly Arg Leu
290 295 300
Leu Arg Leu Ile Gln Asn Gly Arg Ile Asp Pro Thr Lys Leu Thr Thr
305 310 315 320
His Arg Met Pro Phe Ser Asp Ile Glu Lys Gly Phe Arg Ile Met Ala
325 330 335
Asn Lys Glu Asp Gly Val Ile Lys Pro Met Ile Leu Phe Ser
340 345 350
<210> 17
<211> 1050
<212> DNA
<213> 脱硫杆菌(Desulfobacterales bacterium)
<400> 17
atgaaagcat tcgttatgca gggtattggt aaggtaggta tcgtggataa acccattccg 60
gaacccggcc caaatgatgc cgtcatcaaa accaccacag cacttgtctg tacatctgac 120
gttcatactg tcgctggtgc gattggtgac agatacaacc tcacacttgg tcacgaaggt 180
atgggtgttg tttacaaatt ggggagcgaa gtgaaggggt ttaaggaagg cgaccgcgtt 240
gtgatcaatg cgattacccc gtgttaccag tgcacaaact gtcagcgggg gttcacctcc 300
caatgtgagg aagccttagg gggctggaag tatgcgaaca ttaaagatgg ttgctttgcc 360
gaatactttc atgtcaacga cgccaaggcc aatatggtca aggttcctga cgatgttacg 420
gatgaacagg cgctctacac aacagacatg atgtccacag gctttatggg ggctgagcac 480
gcaaacatcc ccttgggtgg cacggttgcc gtttttggac agggtcctgt tggcctgatg 540
gccacagcag gtgcaaaact tttaggcgcg ggtttgatta tcggtgttga atgcgtgcct 600
gaacgccaaa agctgtccag acaatttggt gcagacgaaa tagtggattt tacaaaggtt 660
gacaccattg aagaaatcat gcgcctcact gacggagaag gcgttgactc tgccatagaa 720
gccttgggtt cacaaattac ttttgaaaat tgtgttaagg ccactcgtcc tggtggcacc 780
atttccaatg cgggatatca tggtgatggc gactatgtca aaatacctcg cgcagaatgg 840
ggcgtaggta tggcggataa gacaattcga accggcctgt gccctggtgg aagcgaaaga 900
atggggcgtc tcctccgact cttgcaacgg aatcggattg atccgacggc catgaccaca 960
catcggttcc cctttgacga gatcgaaaaa ggatatcttc tcatgaagac caaagaagat 1020
aatgtattga aaccacttat agaattctag 1050
<210> 18
<211> 349
<212> PRT
<213> 脱硫杆菌(Desulfobacterales bacterium)
<400> 18
Met Lys Ala Phe Val Met Gln Gly Ile Gly Lys Val Gly Ile Val Asp
1 5 10 15
Lys Pro Ile Pro Glu Pro Gly Pro Asn Asp Ala Val Ile Lys Thr Thr
20 25 30
Thr Ala Leu Val Cys Thr Ser Asp Val His Thr Val Ala Gly Ala Ile
35 40 45
Gly Asp Arg Tyr Asn Leu Thr Leu Gly His Glu Gly Met Gly Val Val
50 55 60
Tyr Lys Leu Gly Ser Glu Val Lys Gly Phe Lys Glu Gly Asp Arg Val
65 70 75 80
Val Ile Asn Ala Ile Thr Pro Cys Tyr Gln Cys Thr Asn Cys Gln Arg
85 90 95
Gly Phe Thr Ser Gln Cys Glu Glu Ala Leu Gly Gly Trp Lys Tyr Ala
100 105 110
Asn Ile Lys Asp Gly Cys Phe Ala Glu Tyr Phe His Val Asn Asp Ala
115 120 125
Lys Ala Asn Met Val Lys Val Pro Asp Asp Val Thr Asp Glu Gln Ala
130 135 140
Leu Tyr Thr Thr Asp Met Met Ser Thr Gly Phe Met Gly Ala Glu His
145 150 155 160
Ala Asn Ile Pro Leu Gly Gly Thr Val Ala Val Phe Gly Gln Gly Pro
165 170 175
Val Gly Leu Met Ala Thr Ala Gly Ala Lys Leu Leu Gly Ala Gly Leu
180 185 190
Ile Ile Gly Val Glu Cys Val Pro Glu Arg Gln Lys Leu Ser Arg Gln
195 200 205
Phe Gly Ala Asp Glu Ile Val Asp Phe Thr Lys Val Asp Thr Ile Glu
210 215 220
Glu Ile Met Arg Leu Thr Asp Gly Glu Gly Val Asp Ser Ala Ile Glu
225 230 235 240
Ala Leu Gly Ser Gln Ile Thr Phe Glu Asn Cys Val Lys Ala Thr Arg
245 250 255
Pro Gly Gly Thr Ile Ser Asn Ala Gly Tyr His Gly Asp Gly Asp Tyr
260 265 270
Val Lys Ile Pro Arg Ala Glu Trp Gly Val Gly Met Ala Asp Lys Thr
275 280 285
Ile Arg Thr Gly Leu Cys Pro Gly Gly Ser Glu Arg Met Gly Arg Leu
290 295 300
Leu Arg Leu Leu Gln Arg Asn Arg Ile Asp Pro Thr Ala Met Thr Thr
305 310 315 320
His Arg Phe Pro Phe Asp Glu Ile Glu Lys Gly Tyr Leu Leu Met Lys
325 330 335
Thr Lys Glu Asp Asn Val Leu Lys Pro Leu Ile Glu Phe
340 345
<210> 19
<211> 1050
<212> DNA
<213> 假丝酵母亚细菌(Candidatus Abyssubacteria bacterium SURF_5)
<400> 19
atgagagcct ttgtgatgaa agggatcggg gaggtgggca ttgttgaaaa gccgattccc 60
gaggatcccg gcccgaatgg ggcgatcata agaacgacca gggcgcttat ttgcacgtcg 120
gatgcgcaca cggtgttggg cggtatcgcc gagcgcaggg atctcaccct cggccatgag 180
gccgttggca tagtgcatcg gctcggcagc gaggtcaggc acgtcaaaga aggcgaccgc 240
gtcgccgtca atgcgattac gccgtgttat cactgtgaaa actgtcttcg cggctacact 300
tcacagtgca ccacgatgct cggcggttgg aagttcgcaa acacgaaaga cggcgtgttt 360
tccgacttct ttcatgtgaa tgacgccgaa gccaaccttg cgcctattcc cgattccgtt 420
cccgacgagg cggccgttta cacgtgcgac atgatgtcga caggcttcat gggcgccgag 480
aacgcgaata tccctgtcgg gggcattgtg gcggtattcg cgcaaggtcc ggtggggctg 540
atggcgaccg cgggtgcaaa gcttttggga gcgggactga tcatcactgt agaatcgatc 600
ccccggcgca aagagctttc caaacgatac ggagccgatt tggtgatcga cttcaaggat 660
cgtgatcccg tcaaggcgat acttgatctt accgatggca gcggagtgga ttcgagcatc 720
gaatcgctag gatcgcaagc gacgtttgag gcctgcataa aagtgacccg cccgggcggc 780
accatttcct cgatcgggta ctacggcaaa ggagactacg tcaagattcc gcgcgtcgaa 840
tggggggttg ggatgggtga taaaaccatc cggaccggcc tgtgcccggg cggcaaggag 900
cggatgaagc ggctgttgag gatgatcgag aacgggcgaa tcgacccgac gcctttgact 960
tcgcacacgt tcggttttga cgagatcgag aaggcgttcc acttgatgga aacgaaaggc 1020
gataacatca taaagccgct tatcatcttt 1050
<210> 20
<211> 351
<212> PRT
<213> 假丝酵母亚细菌(Candidatus Abyssubacteria bacterium SURF_5)
<400> 20
Met Arg Ala Phe Val Met Lys Gly Ile Gly Glu Val Gly Ile Val Glu
1 5 10 15
Lys Pro Ile Pro Glu Asp Pro Gly Pro Asn Gly Ala Ile Ile Arg Thr
20 25 30
Thr Arg Ala Leu Ile Cys Thr Ser Asp Ala His Thr Val Leu Gly Gly
35 40 45
Ile Ala Glu Arg Arg Asp Leu Thr Leu Gly His Glu Ala Val Gly Ile
50 55 60
Val His Arg Leu Gly Ser Glu Val Arg His Val Lys Glu Gly Asp Arg
65 70 75 80
Val Ala Val Asn Ala Ile Thr Pro Cys Tyr His Cys Glu Asn Cys Leu
85 90 95
Arg Gly Tyr Thr Ser Gln Cys Thr Thr Met Leu Gly Gly Trp Lys Phe
100 105 110
Ala Asn Thr Lys Asp Gly Val Phe Ser Asp Phe Phe His Val Asn Asp
115 120 125
Ala Glu Ala Asn Leu Ala Pro Ile Pro Asp Ser Val Pro Asp Glu Ala
130 135 140
Ala Val Tyr Thr Cys Asp Met Met Ser Thr Gly Phe Met Gly Ala Glu
145 150 155 160
Asn Ala Asn Ile Pro Val Gly Gly Ile Val Ala Val Phe Ala Gln Gly
165 170 175
Pro Val Gly Leu Met Ala Thr Ala Gly Ala Lys Leu Leu Gly Ala Gly
180 185 190
Leu Ile Ile Thr Val Glu Ser Ile Pro Arg Arg Lys Glu Leu Ser Lys
195 200 205
Arg Tyr Gly Ala Asp Leu Val Ile Asp Phe Lys Asp Arg Asp Pro Val
210 215 220
Lys Ala Ile Leu Asp Leu Thr Asp Gly Ser Gly Val Asp Ser Ser Ile
225 230 235 240
Glu Ser Leu Gly Ser Gln Ala Thr Phe Glu Ala Cys Ile Lys Val Thr
245 250 255
Arg Pro Gly Gly Thr Ile Ser Ser Ile Gly Tyr Tyr Gly Lys Gly Asp
260 265 270
Tyr Val Lys Ile Pro Arg Val Glu Trp Gly Val Gly Met Gly Asp Lys
275 280 285
Thr Ile Arg Thr Gly Leu Cys Pro Gly Gly Lys Glu Arg Met Lys Arg
290 295 300
Leu Leu Arg Met Ile Glu Asn Gly Arg Ile Asp Pro Thr Pro Leu Thr
305 310 315 320
Ser His Thr Phe Gly Phe Asp Glu Ile Glu Lys Ala Phe His Leu Met
325 330 335
Glu Thr Lys Gly Asp Asn Ile Ile Lys Pro Leu Ile Ile Phe Asp
340 345 350
<210> 21
<211> 1053
<212> DNA
<213> 硝化螺旋菌(Nitrospirales bacterium)
<400> 21
atgaaagcct ttgtgatgaa acggatcggc gaggtgggcg tgatggagaa gccaataccc 60
gatccgggtc ccaatgacgc aatcgttaag accactgccg cactcgtctg tacatcagac 120
atccatacgg tagctggttc cattggggaa cgggggaatt taactctcgg gcatgaagcg 180
gtcggagtcg tacacaaact cggccatgcg gtaaagggat ttcgagaagg tgatcgggtc 240
gtcgtgaatg ccattacccc atgttacctc tgtgagaatt gtttgcgagg gtacacctcc 300
caatgcaccg agatgcttgg aggttggaaa ttcgccaatg tgaaagatgg caacatggcg 360
gaatattttc atgtcaacag cgcgcaagcc aatctggcgc ccatcccggc tagtttgacg 420
gatgagcagg cactctattg cacggacatg atgtcgaccg gcttcatggg cgccgaacac 480
gccaacattc ccatcggtgg aagcgtcgcc atttttgcgc agggacccgt cggattgatg 540
gcgacagtcg gtgcaaaatt gttaggagca ggtcttatca tcgccgtaga aacagtgccg 600
catcggcaag aactggccaa gcggtttggt gcagatgtgg tcatcgattt caaaaaccaa 660
gaccctgtca ccgcgattct tgatctaacg gatgggctag gtgtggatgc ggctattgag 720
gcactgggac tccaggtaag tttcgagggc tgtatcaagg cgacgcgacc tgggggaacg 780
atttcaaaca ttggctatca cggcgatggg gagtttgtgg agattccaag ggcagcctgg 840
ggggtcggga tgggtgataa gaccattcga accgggcttt gtccgggagg agctgagcgg 900
atgaaacgac tcatgcgact ccttgtgatg ggtcgggtgg atccaacccc gctcacgact 960
catcgcttca acttcagcga ggtggaaaag gctttctcca tgatgaaaac caaagaggat 1020
ggcatgctga aacccttgat ccttttcgac taa 1053
<210> 22
<211> 350
<212> PRT
<213> 硝化螺旋菌(Nitrospirales bacterium)
<400> 22
Met Lys Ala Phe Val Met Lys Arg Ile Gly Glu Val Gly Val Met Glu
1 5 10 15
Lys Pro Ile Pro Asp Pro Gly Pro Asn Asp Ala Ile Val Lys Thr Thr
20 25 30
Ala Ala Leu Val Cys Thr Ser Asp Ile His Thr Val Ala Gly Ser Ile
35 40 45
Gly Glu Arg Gly Asn Leu Thr Leu Gly His Glu Ala Val Gly Val Val
50 55 60
His Lys Leu Gly His Ala Val Lys Gly Phe Arg Glu Gly Asp Arg Val
65 70 75 80
Val Val Asn Ala Ile Thr Pro Cys Tyr Leu Cys Glu Asn Cys Leu Arg
85 90 95
Gly Tyr Thr Ser Gln Cys Thr Glu Met Leu Gly Gly Trp Lys Phe Ala
100 105 110
Asn Val Lys Asp Gly Asn Met Ala Glu Tyr Phe His Val Asn Ser Ala
115 120 125
Gln Ala Asn Leu Ala Pro Ile Pro Ala Ser Leu Thr Asp Glu Gln Ala
130 135 140
Leu Tyr Cys Thr Asp Met Met Ser Thr Gly Phe Met Gly Ala Glu His
145 150 155 160
Ala Asn Ile Pro Ile Gly Gly Ser Val Ala Ile Phe Ala Gln Gly Pro
165 170 175
Val Gly Leu Met Ala Thr Val Gly Ala Lys Leu Leu Gly Ala Gly Leu
180 185 190
Ile Ile Ala Val Glu Thr Val Pro His Arg Gln Glu Leu Ala Lys Arg
195 200 205
Phe Gly Ala Asp Val Val Ile Asp Phe Lys Asn Gln Asp Pro Val Thr
210 215 220
Ala Ile Leu Asp Leu Thr Asp Gly Leu Gly Val Asp Ala Ala Ile Glu
225 230 235 240
Ala Leu Gly Leu Gln Val Ser Phe Glu Gly Cys Ile Lys Ala Thr Arg
245 250 255
Pro Gly Gly Thr Ile Ser Asn Ile Gly Tyr His Gly Asp Gly Glu Phe
260 265 270
Val Glu Ile Pro Arg Ala Ala Trp Gly Val Gly Met Gly Asp Lys Thr
275 280 285
Ile Arg Thr Gly Leu Cys Pro Gly Gly Ala Glu Arg Met Lys Arg Leu
290 295 300
Met Arg Leu Leu Val Met Gly Arg Val Asp Pro Thr Pro Leu Thr Thr
305 310 315 320
His Arg Phe Asn Phe Ser Glu Val Glu Lys Ala Phe Ser Met Met Lys
325 330 335
Thr Lys Glu Asp Gly Met Leu Lys Pro Leu Ile Leu Phe Asp
340 345 350
<210> 23
<211> 1053
<212> DNA
<213> 硝化螺旋菌(Nitrospirales bacterium)
<400> 23
atgaaagcct ttgtgatgaa acggatcggc gaggtgggcg tgatggagaa accaataccc 60
gatccgggtc ccaatgacgc aatcgttaag accactgccg cactcgtctg tacatcagac 120
atccatacgg tagctggttc cattgggcaa cgggggaatt taactctcgg gcatgaagcg 180
gtcggagtcg tacacaaact cggccatgcg gtaaagggat ttcgagaagg tgatcgggtc 240
gtcgtgaatg ccattacccc atgttacctc tgtgagaatt gtttgcgagg gtacacctcc 300
caatgcaccg agatgcttgg aggttggaaa ttcgccaatg tgaaagatgg caacatggcg 360
gaatattttc atgtcaacag cgcgcaagcc aatctggcgc ccatcccggc tagtttgacg 420
gatgagcagg cactctattg cacggacatg atgtcgaccg gcttcatggg cgccgaacac 480
gccaacattc ccatcggtgg aagcgtcgcc atttttgcgc agggacccgt cggattgatg 540
gcgacagtcg gtgcaaaatt gttaggagca ggtcttatca tcgccgtaga aacagtgccg 600
catcggcaag aactggccaa gcggtttggt gcagatgtgg tcatcgattt caaaaaccaa 660
gaccctgtca ccgcgattct tgatctaacg gatgggctag gtgtggatgc ggctattgag 720
gcactgggac tccaggtaag tttcgagggc tgtatcaagg cgacgcgacc tgggggaacg 780
atttcaaaca ttggctatca cggcgatggg gagtttgtgg agattccaag ggcagcctgg 840
ggggtcggga tgggtgataa gaccattcga accgggcttt gtccgggagg agctgagcgg 900
atgaaacgac tcatgcgact ccttgtgatg ggtcgggtgg atccaacccc gctcacgact 960
catcgcttca acttcagcga ggtggaaaag gctttctcca tgatgaaaac caaagaggat 1020
ggcatgctga aacccttgat ccttttcgac taa 1053
<210> 24
<211> 350
<212> PRT
<213> 硝化螺旋菌(Nitrospirales bacterium)
<400> 24
Met Lys Ala Phe Val Met Lys Arg Ile Gly Glu Val Gly Val Met Glu
1 5 10 15
Lys Pro Ile Pro Asp Pro Gly Pro Asn Asp Ala Ile Val Lys Thr Thr
20 25 30
Ala Ala Leu Val Cys Thr Ser Asp Ile His Thr Val Ala Gly Ser Ile
35 40 45
Gly Gln Arg Gly Asn Leu Thr Leu Gly His Glu Ala Val Gly Val Val
50 55 60
His Lys Leu Gly His Ala Val Lys Gly Phe Arg Glu Gly Asp Arg Val
65 70 75 80
Val Val Asn Ala Ile Thr Pro Cys Tyr Leu Cys Glu Asn Cys Leu Arg
85 90 95
Gly Tyr Thr Ser Gln Cys Thr Glu Met Leu Gly Gly Trp Lys Phe Ala
100 105 110
Asn Val Lys Asp Gly Asn Met Ala Glu Tyr Phe His Val Asn Ser Ala
115 120 125
Gln Ala Asn Leu Ala Pro Ile Pro Ala Ser Leu Thr Asp Glu Gln Ala
130 135 140
Leu Tyr Cys Thr Asp Met Met Ser Thr Gly Phe Met Gly Ala Glu His
145 150 155 160
Ala Asn Ile Pro Ile Gly Gly Ser Val Ala Ile Phe Ala Gln Gly Pro
165 170 175
Val Gly Leu Met Ala Thr Val Gly Ala Lys Leu Leu Gly Ala Gly Leu
180 185 190
Ile Ile Ala Val Glu Thr Val Pro His Arg Gln Glu Leu Ala Lys Arg
195 200 205
Phe Gly Ala Asp Val Val Ile Asp Phe Lys Asn Gln Asp Pro Val Thr
210 215 220
Ala Ile Leu Asp Leu Thr Asp Gly Leu Gly Val Asp Ala Ala Ile Glu
225 230 235 240
Ala Leu Gly Leu Gln Val Ser Phe Glu Gly Cys Ile Lys Ala Thr Arg
245 250 255
Pro Gly Gly Thr Ile Ser Asn Ile Gly Tyr His Gly Asp Gly Glu Phe
260 265 270
Val Glu Ile Pro Arg Ala Ala Trp Gly Val Gly Met Gly Asp Lys Thr
275 280 285
Ile Arg Thr Gly Leu Cys Pro Gly Gly Ala Glu Arg Met Lys Arg Leu
290 295 300
Met Arg Leu Leu Val Met Gly Arg Val Asp Pro Thr Pro Leu Thr Thr
305 310 315 320
His Arg Phe Asn Phe Ser Glu Val Glu Lys Ala Phe Ser Met Met Lys
325 330 335
Thr Lys Glu Asp Gly Met Leu Lys Pro Leu Ile Leu Phe Asp
340 345 350
<210> 25
<211> 1065
<212> DNA
<213> 硝化螺旋菌(Nitrospirales bacterium)
<400> 25
atgacccgga caatgaaagc ctttgtaatg cggaagctcg gctcggtggg agtgatggag 60
aagccgattc ctgatcctgg accgaatgat gcgatcgtca aaaccacggc cgcgctcatc 120
tgcacctcgg atgtccatac cgttgacggc gcgatcggag aacggaccaa cctgaccctg 180
ggccatgaag ccgtcggcgt catttacaaa ctcggaagcg ccgtcaaggg gcttcgtgag 240
ggagatcggg tcggagtcaa cgccattact ccctgctatc agtgcgagaa ttgcctgcgc 300
ggctacacct cgcagtgtca ggagatgctc ggcggctgga agttcgccaa cgtcaaggat 360
ggaaacctcg cggaatattt ccatgtcaat agcgcgcaag cgaacctggc gccgattcca 420
acgggtctta ccgacgaaca agtcgcgtac tgcgcggaca tgatgtccac cgggttcatg 480
ggggctgagc atgcgaacat tcctgttggg ggctcagtgg ccgtctttgc gcagggacca 540
gtcggactga tggccaccgt gggggcccgg ttactgggag ccggcttggt catcgcagtg 600
gaggccgtcc cgcagcgcaa gaaactggcg aaagagttcg gggcggatgt ggtgatcgac 660
ttcaaggaac aggatcccgt cgaggtcatt ctcggtttga cgggcggaca aggcgtggat 720
tcgtccatcg aagccctcgg ggctcaggcg acgttcgaag cctgtatcaa agcgacacgg 780
cccggcggga ccatctctaa cgtgggctat cacggagagg gagaatacgt gcagatgcct 840
cgtaaagaat ggggcgtcgg gatgagcgac aaggtcatcc gaacggggct gtgtcccggc 900
ggagcggaac ggatgaaacg gctgatgcgg ctcttagaaa ccgggcgcgt caatccactc 960
cctctgacga cccatcgatt caatttccac gatgtggaga aggcctttga gttgatgcga 1020
acgaaggcag acggcatgtt gaagccgctc atcacgtttg cgtga 1065
<210> 26
<211> 354
<212> PRT
<213> 硝化螺旋菌(Nitrospirales bacterium)
<400> 26
Met Thr Arg Thr Met Lys Ala Phe Val Met Arg Lys Leu Gly Ser Val
1 5 10 15
Gly Val Met Glu Lys Pro Ile Pro Asp Pro Gly Pro Asn Asp Ala Ile
20 25 30
Val Lys Thr Thr Ala Ala Leu Ile Cys Thr Ser Asp Val His Thr Val
35 40 45
Asp Gly Ala Ile Gly Glu Arg Thr Asn Leu Thr Leu Gly His Glu Ala
50 55 60
Val Gly Val Ile Tyr Lys Leu Gly Ser Ala Val Lys Gly Leu Arg Glu
65 70 75 80
Gly Asp Arg Val Gly Val Asn Ala Ile Thr Pro Cys Tyr Gln Cys Glu
85 90 95
Asn Cys Leu Arg Gly Tyr Thr Ser Gln Cys Gln Glu Met Leu Gly Gly
100 105 110
Trp Lys Phe Ala Asn Val Lys Asp Gly Asn Leu Ala Glu Tyr Phe His
115 120 125
Val Asn Ser Ala Gln Ala Asn Leu Ala Pro Ile Pro Thr Gly Leu Thr
130 135 140
Asp Glu Gln Val Ala Tyr Cys Ala Asp Met Met Ser Thr Gly Phe Met
145 150 155 160
Gly Ala Glu His Ala Asn Ile Pro Val Gly Gly Ser Val Ala Val Phe
165 170 175
Ala Gln Gly Pro Val Gly Leu Met Ala Thr Val Gly Ala Arg Leu Leu
180 185 190
Gly Ala Gly Leu Val Ile Ala Val Glu Ala Val Pro Gln Arg Lys Lys
195 200 205
Leu Ala Lys Glu Phe Gly Ala Asp Val Val Ile Asp Phe Lys Glu Gln
210 215 220
Asp Pro Val Glu Val Ile Leu Gly Leu Thr Gly Gly Gln Gly Val Asp
225 230 235 240
Ser Ser Ile Glu Ala Leu Gly Ala Gln Ala Thr Phe Glu Ala Cys Ile
245 250 255
Lys Ala Thr Arg Pro Gly Gly Thr Ile Ser Asn Val Gly Tyr His Gly
260 265 270
Glu Gly Glu Tyr Val Gln Met Pro Arg Lys Glu Trp Gly Val Gly Met
275 280 285
Ser Asp Lys Val Ile Arg Thr Gly Leu Cys Pro Gly Gly Ala Glu Arg
290 295 300
Met Lys Arg Leu Met Arg Leu Leu Glu Thr Gly Arg Val Asn Pro Leu
305 310 315 320
Pro Leu Thr Thr His Arg Phe Asn Phe His Asp Val Glu Lys Ala Phe
325 330 335
Glu Leu Met Arg Thr Lys Ala Asp Gly Met Leu Lys Pro Leu Ile Thr
340 345 350
Phe Ala
<210> 27
<211> 1053
<212> DNA
<213> 扁平菌科细菌(Planctomycetaceae bacterium)
<400> 27
atgaaagcat ttgtcatgaa gaagctcggc tcggtgggga tgatggagaa gccaattcct 60
gatcccggac caaacgatgc cgttgtcaag acgaccgccg cactcgtctg tacgtccgat 120
gtccatacgg tcggcggggc gatcggtgaa cgcaccaacc tgaccctggg ccatgaggcc 180
gtgggggtca tttacaaatt gggcagcgcc gtgaagggat ttcgtgaggg agatcgagtc 240
gcagtcaacg ccatcacgcc ctgttatcag tgcgagaact gtttgcgcgg ttacacctct 300
caatgtaccg agatgctcgg cgggtggaag ttcgcgaacg tgaaggatgg gaatctcgcc 360
gaatatttcc atgtaaacag cgcgcaagcg aacctggcga cgattcctcc gagtctcaca 420
gatgagcagg tggcttactg cacggacatg atgtcgaccg ggttcatggg agctgagcat 480
gcgaacattc ccgttggggg cgcagtggct gtcttcgcgc agggaccagt cggactgatg 540
gccaccgtgg gggcccggtt attgggggcc ggcttggtca ttgcggtgga ggtggtcccg 600
cagcgcaaga agctagcgaa agagttcggg gccgatgtcg tgctggactt caaggaacag 660
gatccggtca gggccattct ggatttgacc ggcggacagg gggtggactc gtccatcgag 720
gccttggggg ctcaggagac cttcgaggcc tgcgtcaagg ccacgcgtcc gggcgggacg 780
atctcgaata ttggatacca tggcgaagga gattatgtgc agatgcctcg caaggagtgg 840
ggggtcggga tgggcgataa ggtgatcagg acgggtctct gccccggggg agccgaacgg 900
atgaagcggc tgatgcgact cttggaaacc ggccgggtca atccactccc actgacgacg 960
catcggttcc gattcaatga catcgaaaag gccttcgacc tgatgagaac gaaggcagac 1020
ggcatattga agccgctcat cacgtttatg tga 1053
<210> 28
<211> 350
<212> PRT
<213> 扁平菌科细菌(Planctomycetaceae bacterium)
<400> 28
Met Lys Ala Phe Val Met Lys Lys Leu Gly Ser Val Gly Met Met Glu
1 5 10 15
Lys Pro Ile Pro Asp Pro Gly Pro Asn Asp Ala Val Val Lys Thr Thr
20 25 30
Ala Ala Leu Val Cys Thr Ser Asp Val His Thr Val Gly Gly Ala Ile
35 40 45
Gly Glu Arg Thr Asn Leu Thr Leu Gly His Glu Ala Val Gly Val Ile
50 55 60
Tyr Lys Leu Gly Ser Ala Val Lys Gly Phe Arg Glu Gly Asp Arg Val
65 70 75 80
Ala Val Asn Ala Ile Thr Pro Cys Tyr Gln Cys Glu Asn Cys Leu Arg
85 90 95
Gly Tyr Thr Ser Gln Cys Thr Glu Met Leu Gly Gly Trp Lys Phe Ala
100 105 110
Asn Val Lys Asp Gly Asn Leu Ala Glu Tyr Phe His Val Asn Ser Ala
115 120 125
Gln Ala Asn Leu Ala Thr Ile Pro Pro Ser Leu Thr Asp Glu Gln Val
130 135 140
Ala Tyr Cys Thr Asp Met Met Ser Thr Gly Phe Met Gly Ala Glu His
145 150 155 160
Ala Asn Ile Pro Val Gly Gly Ala Val Ala Val Phe Ala Gln Gly Pro
165 170 175
Val Gly Leu Met Ala Thr Val Gly Ala Arg Leu Leu Gly Ala Gly Leu
180 185 190
Val Ile Ala Val Glu Val Val Pro Gln Arg Lys Lys Leu Ala Lys Glu
195 200 205
Phe Gly Ala Asp Val Val Leu Asp Phe Lys Glu Gln Asp Pro Val Arg
210 215 220
Ala Ile Leu Asp Leu Thr Gly Gly Gln Gly Val Asp Ser Ser Ile Glu
225 230 235 240
Ala Leu Gly Ala Gln Glu Thr Phe Glu Ala Cys Val Lys Ala Thr Arg
245 250 255
Pro Gly Gly Thr Ile Ser Asn Ile Gly Tyr His Gly Glu Gly Asp Tyr
260 265 270
Val Gln Met Pro Arg Lys Glu Trp Gly Val Gly Met Gly Asp Lys Val
275 280 285
Ile Arg Thr Gly Leu Cys Pro Gly Gly Ala Glu Arg Met Lys Arg Leu
290 295 300
Met Arg Leu Leu Glu Thr Gly Arg Val Asn Pro Leu Pro Leu Thr Thr
305 310 315 320
His Arg Phe Arg Phe Asn Asp Ile Glu Lys Ala Phe Asp Leu Met Arg
325 330 335
Thr Lys Ala Asp Gly Ile Leu Lys Pro Leu Ile Thr Phe Met
340 345 350
<210> 29
<211> 1053
<212> DNA
<213> 厌氧绳菌(Anaerolineales bacterium)
<400> 29
atgaaagcat tcgtcatgaa gaaaattggt gaagtaggtt ttaccgaaaa accaatccct 60
gaaccgggac ctaatgatgc tgtcataaaa acaaccaaag cgttggtttg cacctcggat 120
gttcataccg ttaaaggcgc aatcggtgaa agagagaacc aaacccttgg gcatgaggct 180
gttggtgtcg cttataaact agggagcgaa gtaaacgagg tgaaagaagg cgaccgcgtt 240
gccgtaaatg ctattacacc atgctataaa tgtgaaaact gcatacgagg ttttacatct 300
caatgtcaac aactgttagg tggttggaaa tttgctaata tcaaagatgg ggtcttttca 360
gaatatttcc atgtcaacga tgctgaagcc aatcttacgt taattccaga tagtgtccct 420
gatgaagcag cagtctacgc tgccgatatg ctctctactg gttttatggg agctgaaaat 480
gctgatatac ctatgggtgg cagcgttgct gtttttgcac aaggtccagt tgggcttatg 540
gctactgtag gcgcacgtct tcaaggagca agtctagtaa ttgcagtcga caccattccc 600
aggagaatgg aattggcaaa acaatacgga gctgattttg tgatcgattt tcgaaaagct 660
gaacccgtgg aagaaattct tcgtttgact gatggaaaag gcgtagattc tacaatagaa 720
gctttaggcg ctcaaatcac attcgaagca tgtattaaag taactaaacc tggaggcaca 780
atctcgatca ccggttactt cggcgaaggt gattatgtaa aaattcccag actcgaatgg 840
ggagtaggga tgggtgataa gaccatccgc accggtcttt gtccaggagg caaagtacgc 900
atgcagaggt tgctacggct gatcgaaaat ggtcgcgtag atcccactct gatgactacc 960
cataaattca aattcgatga ggtggataaa gcttttcatc tgatggatac caaagatgat 1020
gatatattaa aaccccttat cctattcgat tga 1053
<210> 30
<211> 350
<212> PRT
<213> 厌氧绳菌(Anaerolineales bacterium)
<400> 30
Met Lys Ala Phe Val Met Lys Lys Ile Gly Glu Val Gly Phe Thr Glu
1 5 10 15
Lys Pro Ile Pro Glu Pro Gly Pro Asn Asp Ala Val Ile Lys Thr Thr
20 25 30
Lys Ala Leu Val Cys Thr Ser Asp Val His Thr Val Lys Gly Ala Ile
35 40 45
Gly Glu Arg Glu Asn Gln Thr Leu Gly His Glu Ala Val Gly Val Ala
50 55 60
Tyr Lys Leu Gly Ser Glu Val Asn Glu Val Lys Glu Gly Asp Arg Val
65 70 75 80
Ala Val Asn Ala Ile Thr Pro Cys Tyr Lys Cys Glu Asn Cys Ile Arg
85 90 95
Gly Phe Thr Ser Gln Cys Gln Gln Leu Leu Gly Gly Trp Lys Phe Ala
100 105 110
Asn Ile Lys Asp Gly Val Phe Ser Glu Tyr Phe His Val Asn Asp Ala
115 120 125
Glu Ala Asn Leu Thr Leu Ile Pro Asp Ser Val Pro Asp Glu Ala Ala
130 135 140
Val Tyr Ala Ala Asp Met Leu Ser Thr Gly Phe Met Gly Ala Glu Asn
145 150 155 160
Ala Asp Ile Pro Met Gly Gly Ser Val Ala Val Phe Ala Gln Gly Pro
165 170 175
Val Gly Leu Met Ala Thr Val Gly Ala Arg Leu Gln Gly Ala Ser Leu
180 185 190
Val Ile Ala Val Asp Thr Ile Pro Arg Arg Met Glu Leu Ala Lys Gln
195 200 205
Tyr Gly Ala Asp Phe Val Ile Asp Phe Arg Lys Ala Glu Pro Val Glu
210 215 220
Glu Ile Leu Arg Leu Thr Asp Gly Lys Gly Val Asp Ser Thr Ile Glu
225 230 235 240
Ala Leu Gly Ala Gln Ile Thr Phe Glu Ala Cys Ile Lys Val Thr Lys
245 250 255
Pro Gly Gly Thr Ile Ser Ile Thr Gly Tyr Phe Gly Glu Gly Asp Tyr
260 265 270
Val Lys Ile Pro Arg Leu Glu Trp Gly Val Gly Met Gly Asp Lys Thr
275 280 285
Ile Arg Thr Gly Leu Cys Pro Gly Gly Lys Val Arg Met Gln Arg Leu
290 295 300
Leu Arg Leu Ile Glu Asn Gly Arg Val Asp Pro Thr Leu Met Thr Thr
305 310 315 320
His Lys Phe Lys Phe Asp Glu Val Asp Lys Ala Phe His Leu Met Asp
325 330 335
Thr Lys Asp Asp Asp Ile Leu Lys Pro Leu Ile Leu Phe Asp
340 345 350
<210> 31
<211> 1059
<212> DNA
<213> 弓形脱硫菌(Desulfotomaculum arcticum)
<400> 31
atgaaagctt ttgtaatgca cggaataggg cgcgtgggca tcatggaaaa gccgctgccg 60
gcagatccgg gtccaaacga cgcgatcatt aaaaccaccg cggctttggt ttgtacctcg 120
gacgttcaca ccgtgcacgg cgcgattggt gagaaggtgg atctgaccct gggccatgaa 180
gcgttgggaa tcatccacaa gctgggttcc gcggtcgaga gcctgaaagt tggcgaccgc 240
gtcgttgtga acgccatcac cccgtgttat aagtgcgaaa actgccagcg tggttttacc 300
agccaatgta ccaccatgct gggcgggtgg aaattcgcta atatcaagga cggctgcttc 360
gccgaatatt tccacgtaaa cgacgcggat gcaaatctgg tcccggttcc gtcgagcgtg 420
agcgacgagg cggcgctcta cgctacggat atgatgtcaa ccggtttcat gggtgctgag 480
cacgccggta ttccgctggg cggtacggtt gcaatcttcg gtcaaggtcc ggttggcctg 540
atggcgactg cgggctctcg cctgttgggc gctggtctca ttattgcggt tgagacagtg 600
ccgcatcgtc agcagctgag caaatactac ggcgcagaca tcatcctgga ttttaagaaa 660
gtggatgttg tggaggaaat tttgcgcatc accaatggcc aaggtgtcga cagcgcgata 720
gaagcgttag gcacgcaggt taccttcgag aactgcatta aggcaacccg tcctggtggc 780
acgatcagca acattggtta tcatggcgag ggtgattacc tgaaaattcc gcgtgtggac 840
tggggtgtgg gtatgtccga caaaaccatc cgtaccggtc tgtgcccggg cggtcgtgaa 900
cgtatgagtc gtcttctgcg tttgatcgaa tctggtagaa ttgatccgac tgcgttgact 960
acgcataaat tccgctttga tgaaatcgag aaggcctttg agctgatggc tatcaagggt 1020
gataatgtta ttaaaccact tattctgttt gaagattaa 1059
<210> 32
<211> 352
<212> PRT
<213> 弓形脱硫菌(Desulfotomaculum arcticum)
<400> 32
Met Lys Ala Phe Val Met His Gly Ile Gly Arg Val Gly Ile Met Glu
1 5 10 15
Lys Pro Leu Pro Ala Asp Pro Gly Pro Asn Asp Ala Ile Ile Lys Thr
20 25 30
Thr Ala Ala Leu Val Cys Thr Ser Asp Val His Thr Val His Gly Ala
35 40 45
Ile Gly Glu Lys Val Asp Leu Thr Leu Gly His Glu Ala Leu Gly Ile
50 55 60
Ile His Lys Leu Gly Ser Ala Val Glu Ser Leu Lys Val Gly Asp Arg
65 70 75 80
Val Val Val Asn Ala Ile Thr Pro Cys Tyr Lys Cys Glu Asn Cys Gln
85 90 95
Val Gly Phe Thr Ser Gln Cys Thr Thr Met Leu Gly Gly Trp Lys Phe
100 105 110
Ala Asn Ile Lys Asp Gly Cys Phe Ala Glu Tyr Phe His Val Asn Asp
115 120 125
Ala Asp Ala Asn Leu Val Pro Val Pro Ser Ser Val Ser Asp Glu Ala
130 135 140
Ala Leu Lys Ala Thr Asp Met Met Ser Thr Gly Phe Met Gly Ala Glu
145 150 155 160
His Ala Gly Ile Pro Leu Gly Gly Thr Val Ala Ile Phe Gly Gln Gly
165 170 175
Pro Val Gly Leu Met Ala Thr Ala Gly Ser Arg Leu Leu Gly Ala Gly
180 185 190
Leu Ile Ile Ala Val Glu Thr Val Pro His Arg Gln Gln Leu Ser Lys
195 200 205
Tyr Tyr Gly Ala Asp Ile Ile Leu Asp Phe Lys Lys Val Asp Val Val
210 215 220
Glu Glu Ile Leu Arg Ile Thr Asn Gly Gln Gly Val Asp Ser Ala Ile
225 230 235 240
Glu Ala Leu Gly Thr Gln Val Thr Phe Glu Asn Cys Ile Lys Ala Thr
245 250 255
Arg Pro Gly Gly Thr Ile Ser Asn Ile Gly Tyr His Gly Glu Gly Asp
260 265 270
Tyr Leu Lys Ile Pro Arg Val Asp Trp Gly Val Gly Met Ser Asp Lys
275 280 285
Thr Ile Arg Thr Gly Leu Cys Pro Gly Gly Arg Glu Arg Met Ser Arg
290 295 300
Leu Leu Arg Leu Ile Glu Ser Gly Arg Ile Asp Pro Thr Ala Leu Thr
305 310 315 320
Thr His Lys Phe Arg Phe Asp Glu Ile Glu Lys Ala Phe Glu Leu Met
325 330 335
Ala Ile Lys Gly Asp Asn Val Ile Lys Pro Leu Ile Leu Phe Glu Asp
340 345 350
<210> 33
<211> 1053
<212> DNA
<213> 马钱子脱硫弧菌(Desulfonatronovibrio magnus)
<400> 33
atgaaagcgt ttgttgttca cagtatcggt aaagttggaa taatggagaa accagttcct 60
gaaccagggc caaatgatgt aattgtaaag acaaccaacg ccttgatttg tacatctgat 120
gtacataccg ttgccggtgc tattggcgag aaaagcgacc tgactcttgg gcacgaaggt 180
gcaggtactg tttataagat tggcagtgcg gttaaagggt tcaaggaggg cgagagagtt 240
ctggttaatg ccataactcc ctgtttcaag tgtcacaact gccagcgcgg ctatacttca 300
caatgcggac aggccctggg tggatggaag tttgccaata ttaaggatgg ctgttttgct 360
gagtactttc atgttaatga tgctgaatcc aacttagtaa agattccaga ctcagtttct 420
gatgaagctg ctctatatac aacagatatg atgtccaccg gattcatggg ggctgagaat 480
ggaaatattc ctcttggagg gattgtagct gtttttggtc agggaccggt tggtcttatg 540
tcaacagcag gagcacgcct gttgggtgct ggtttggtca tagcagtgga aaacattcct 600
gccagacagg aactcgctaa attctacggt gctgacgtta ttgttgattt taccaaggtt 660
gatgcagtgg aagaaattat gaagctgact gatgggcagg gggtagatgc tgccattgag 720
gcactgggag ctcagatcac atttgagaac tgcattaaag tcaccaagcc tggaggtacc 780
atatccaata ttggctacca tggagaggga gattacataa aaataccgag agctgaatgg 840
ggtgtgggca tgtcggataa gactattcgg actggacttt gccctggagg cagcgaaaga 900
atgtccagac ttctgcggct tattgaaaac ggacgtattg atcccaccaa gctgaccact 960
catcgtttca gctttgatga gattgaaaaa ggatttcaca tgatggccaa taaagaggac 1020
ggagttatca aacccctggt aacattcagc tga 1053
<210> 34
<211> 350
<212> PRT
<213> 马钱子脱硫弧菌(Desulfonatronovibrio magnus)
<400> 34
Met Lys Ala Phe Val Val His Ser Ile Gly Lys Val Gly Ile Met Glu
1 5 10 15
Lys Pro Val Pro Glu Pro Gly Pro Asn Asp Val Ile Val Lys Thr Thr
20 25 30
Asn Ala Leu Ile Cys Thr Ser Asp Val His Thr Val Ala Gly Ala Ile
35 40 45
Gly Glu Lys Ser Asp Leu Thr Leu Gly His Glu Gly Ala Gly Thr Val
50 55 60
Tyr Lys Ile Gly Ser Ala Val Lys Gly Phe Lys Glu Gly Glu Arg Val
65 70 75 80
Leu Val Asn Gly Thr Thr Pro Cys Phe Lys Cys His Asn Cys Gln Arg
85 90 95
Gly Tyr Thr Ser Gln Cys Gly Gln Ala Leu Gly Gly Trp Thr Phe Ala
100 105 110
Asn Ile Lys Asp Gly Cys Phe Ala Glu Tyr Phe His Val Asn Asp Ala
115 120 125
Glu Ser Asn Leu Val Lys Ile Pro Asp Ser Val Ser Asp Glu Ala Ala
130 135 140
Leu Tyr Thr Thr Asp Met Met Ser Thr Gly Phe Met Gly Ala Glu Asn
145 150 155 160
Gly Asn Ile Pro Leu Gly Gly Ile Val Ala Val Phe Gly Gln Gly Pro
165 170 175
Val Gly Leu Met Ser Thr Ala Gly Ala Arg Leu Leu Gly Ala Gly Leu
180 185 190
Val Ile Ala Val Glu Asp Ile Pro Ala Arg Gln Glu Leu Ala Lys Phe
195 200 205
Tyr Gly Ala Asp Val Ile Val Asp Phe Thr Lys Val Asp Ala Val Glu
210 215 220
Glu Ile Met Lys Leu Thr Asp Gly Gln Gly Val Asp Ala Ala Ile Glu
225 230 235 240
Ala Leu Gly Ala Gln Ile Thr Phe Glu Asn Cys Ile Lys Val Thr Lys
245 250 255
Pro Gly Gly Thr Ile Ser Asn Ile Gly Tyr His Gly Glu Gly Asp Tyr
260 265 270
Ile Lys Ile Pro Arg Ala Glu Trp Gly Val Gly Met Ser Asp Lys Thr
275 280 285
Ile Arg Thr Gly Leu Cys Pro Gly Gly Ser Glu Arg Met Ser Arg Leu
290 295 300
Leu Arg Leu Ile Glu Asn Gly Arg Ile Asp Pro Thr Lys Leu Thr Thr
305 310 315 320
His Arg Phe Ser Phe Asp Glu Ile Glu Lys Gly Phe His Met Met Ala
325 330 335
Asn Lys Glu Asp Gly Val Ile Lys Pro Leu Val Thr Phe Ser
340 345 350
Claims (6)
1.一种R-1,3-丁二醇的合成方法,其特征在于,包括以下步骤:
以4-羟基-2-丁酮为底物,利用醇脱氢酶的突变体进行催化反应,反应产物为R-1,3-丁二醇;
所述醇脱氢酶来源于:弓形脱硫菌(Desulfallas arcticus)、马钱子脱硫弧菌(Desulfonatronovibrio magnus);
来源于弓形脱硫菌的醇脱氢酶的氨基酸序列如SEQ ID No.2所示;
来源于马钱子脱硫弧菌的醇脱氢酶的氨基酸序列如SEQ ID No.4所示;
所述醇脱氢酶的突变体为(1)对SEQ ID No . 2所示的氨基酸序列进行以下点突变:R97V和Y147N;
(2)对SEQ ID No . 4所示的氨基酸序列进行如下点突变: A84G、I85T、K110T、N198D;
在所述醇脱氢酶的N端和/或C端连接标签。
2.根据权利要求1所述的合成方法,其特征在于,所述醇脱氢酶的突变体以粗酶液、粗酶液冻干粉、纯酶或全细胞的形式加入。
3.根据权利要求1所述的合成方法,其特征在于,所述催化反应的温度为20-35℃,所述催化反应的时间为24-48h。
4.根据权利要求1或3所述的合成方法,其特征在于,当所述醇脱氢酶的突变体是以粗酶液、粗酶液冻干粉或纯酶的形式加入时,所述醇脱氢酶的突变体在反应体系中的浓度为0.1g/L-10g/L;
当所述醇脱氢酶的突变体以全细胞的形式加入时,所述全细胞的湿重为100g/L。
5.根据权利要求4所述的合成方法,其特征在于,所述催化反应在浓度为50-100mM,pH值为6.5-8.5的磷酸盐缓冲液中进行。
6.根据权利要求1或3所述的合成方法,其特征在于,当所述醇脱氢酶的突变体是以粗酶液、粗酶液冻干粉或纯酶的形式加入时,所述催化反应的反应体系中除了含有4-羟基-2-丁酮、所述醇脱氢酶的突变体和辅酶NAD+外,还含有异丙醇。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111470133.1A CN114150024B (zh) | 2021-12-03 | 2021-12-03 | 一种双功能酶生物催化剂及其制备方法和应用 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111470133.1A CN114150024B (zh) | 2021-12-03 | 2021-12-03 | 一种双功能酶生物催化剂及其制备方法和应用 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114150024A CN114150024A (zh) | 2022-03-08 |
CN114150024B true CN114150024B (zh) | 2023-06-20 |
Family
ID=80452816
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202111470133.1A Active CN114150024B (zh) | 2021-12-03 | 2021-12-03 | 一种双功能酶生物催化剂及其制备方法和应用 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114150024B (zh) |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1564298A1 (de) * | 2004-02-13 | 2005-08-17 | BioSpring GmbH | Enzymatische asymmetrische Decarboxylierung von disubstituierten Malonsäuren |
EP2065470A1 (en) * | 2007-11-28 | 2009-06-03 | Basf Se | New malonate decarboxylases for industrial applications |
CN102071174B (zh) * | 2010-11-24 | 2012-12-05 | 天津工业生物技术研究所 | (2r,3r)-2,3-丁二醇脱氢酶及其编码基因与应用 |
AU2012273177A1 (en) * | 2011-06-22 | 2013-05-02 | Genomatica, Inc. | Microorganisms for producing 1,3-butanediol and methods related thereto |
US8778652B2 (en) * | 2011-06-30 | 2014-07-15 | Codexis, Inc. | Pentose fermentation by a recombinant microorganism |
EP3470512A1 (en) * | 2017-10-10 | 2019-04-17 | Metabolic Explorer | Mutant phosphoserine aminotransferase for the conversion of homoserine into 4-hydroxy-2-ketobutyrate |
CN109609473A (zh) * | 2019-01-29 | 2019-04-12 | 河北省科学院生物研究所 | 一种羰基还原酶DmCR及其编码基因、重组表达载体、重组表达细胞及其应用 |
CN112394050A (zh) * | 2019-08-19 | 2021-02-23 | 中国科学院天津工业生物技术研究所 | 一种高通量筛选酮类化合物的检测方法及其在酶筛选中的应用 |
CN113249349B (zh) * | 2021-06-22 | 2022-01-25 | 中国科学院苏州生物医学工程技术研究所 | 突变型醇脱氢酶、重组载体及其制备方法和应用 |
-
2021
- 2021-12-03 CN CN202111470133.1A patent/CN114150024B/zh active Active
Also Published As
Publication number | Publication date |
---|---|
CN114150024A (zh) | 2022-03-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20210009981A1 (en) | Nitrilase mutant, construction method therefor, and application thereof | |
CN109825538B (zh) | 一种手性2-氨基-1-丁醇的合成方法 | |
CN110551771B (zh) | 一种手性3-氨基-1-丁醇的合成方法 | |
CN112481224A (zh) | 一种Baeyer-Villiger单加氧酶及其应用 | |
CN112877307B (zh) | 一种氨基酸脱氢酶突变体及其应用 | |
CN109055324B (zh) | 一种改进的酮还原酶及其应用 | |
KR20220021465A (ko) | 메탄올 활용 | |
EP3077502A1 (en) | Microorganisms and methods for the production of ketones | |
EP0983367B1 (en) | Enantioselective epoxide hydrolases and genes encoding these | |
Abdel-Hady et al. | Engineering cofactor specificity of a thermostable phosphite dehydrogenase for a highly efficient and robust NADPH regeneration system | |
CN114150024B (zh) | 一种双功能酶生物催化剂及其制备方法和应用 | |
CN112852895A (zh) | 一种双酶级联催化合成(r)-3-氨基-1-丁醇的方法 | |
CN114908129B (zh) | 用于制备(r)-4-氯-3-羟基丁酸乙酯的脱氢酶 | |
CN113061593B (zh) | 一种l-苹果酸脱氢酶突变体及其应用 | |
CN117511831A (zh) | 产麦角硫因的大肠杆菌的构建方法 | |
CN115433721A (zh) | 一种羰基还原酶突变体及其应用 | |
CN113122525B (zh) | 一种甲醛转化蛋白及其应用 | |
CN110713990B (zh) | 一种烯酸还原酶的突变体蛋白及其应用 | |
CN114196659B (zh) | 酰胺酶突变体、编码基因、工程菌及其应用 | |
CN118272331B (zh) | 一种烯还原酶突变体及其在(r)-香茅醛合成中的应用 | |
CN112852770B (zh) | 醇脱氢酶突变体及其在高效不对称还原制备手性双芳基醇化合物中的应用 | |
CN111286509B (zh) | 一种烯还原酶突变体及其编码基因和应用 | |
CN117070486A (zh) | 一种双功能酶生物催化剂及其制备方法和应用 | |
Moberg et al. | A gene cluster in Chlorobium vibrioforme encoding the first enzymes of chlorophyll biosynthesis | |
CN117887699A (zh) | 来源于莫纳杆菌的d-阿洛酮糖3-差向异构酶及其用途 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |