CN113755539B - 一种二氢嘧啶氨基水解酶及其应用 - Google Patents
一种二氢嘧啶氨基水解酶及其应用 Download PDFInfo
- Publication number
- CN113755539B CN113755539B CN202111224455.8A CN202111224455A CN113755539B CN 113755539 B CN113755539 B CN 113755539B CN 202111224455 A CN202111224455 A CN 202111224455A CN 113755539 B CN113755539 B CN 113755539B
- Authority
- CN
- China
- Prior art keywords
- ala
- gly
- dihydropyrimidine
- seq
- val
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 108091022884 dihydropyrimidinase Proteins 0.000 title claims abstract description 58
- HYHLWVJLJXARGY-UHFFFAOYSA-N 3-(aminomethyl)benzamide Chemical compound NCC1=CC=CC(C(N)=O)=C1 HYHLWVJLJXARGY-UHFFFAOYSA-N 0.000 claims abstract description 27
- 238000000034 method Methods 0.000 claims abstract description 19
- 239000000758 substrate Substances 0.000 claims abstract description 14
- FNAQPQLVCOZGRH-UHFFFAOYSA-N 4-(2-methylpropyl)piperidine-2,6-dione Chemical compound CC(C)CC1CC(=O)NC(=O)C1 FNAQPQLVCOZGRH-UHFFFAOYSA-N 0.000 claims abstract description 13
- 238000007142 ring opening reaction Methods 0.000 claims abstract description 5
- 238000006243 chemical reaction Methods 0.000 claims description 23
- 102000004190 Enzymes Human genes 0.000 claims description 21
- 108090000790 Enzymes Proteins 0.000 claims description 21
- 108090000623 proteins and genes Proteins 0.000 claims description 20
- 241000588724 Escherichia coli Species 0.000 claims description 14
- 244000005700 microbiome Species 0.000 claims description 11
- 108010093096 Immobilized Enzymes Proteins 0.000 claims description 8
- 239000002773 nucleotide Substances 0.000 claims description 4
- 125000003729 nucleotide group Chemical group 0.000 claims description 4
- 238000002360 preparation method Methods 0.000 claims description 4
- 239000003456 ion exchange resin Substances 0.000 claims description 3
- 244000063299 Bacillus subtilis Species 0.000 claims description 2
- 235000014469 Bacillus subtilis Nutrition 0.000 claims description 2
- 239000004593 Epoxy Substances 0.000 claims description 2
- 241000235058 Komagataella pastoris Species 0.000 claims description 2
- 240000004808 Saccharomyces cerevisiae Species 0.000 claims description 2
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 claims description 2
- 125000003275 alpha amino acid group Chemical group 0.000 claims 1
- 239000000047 product Substances 0.000 description 11
- AYXYPKUFHZROOJ-ZETCQYMHSA-N pregabalin Chemical compound CC(C)C[C@H](CN)CC(O)=O AYXYPKUFHZROOJ-ZETCQYMHSA-N 0.000 description 10
- 150000001413 amino acids Chemical group 0.000 description 9
- 229960001233 pregabalin Drugs 0.000 description 9
- 108020004705 Codon Proteins 0.000 description 8
- 239000000543 intermediate Substances 0.000 description 8
- 239000013612 plasmid Substances 0.000 description 8
- 230000015572 biosynthetic process Effects 0.000 description 7
- 238000004128 high performance liquid chromatography Methods 0.000 description 7
- 238000003786 synthesis reaction Methods 0.000 description 7
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 6
- 241000589540 Pseudomonas fluorescens Species 0.000 description 6
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 6
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 6
- 239000000872 buffer Substances 0.000 description 6
- 238000001514 detection method Methods 0.000 description 6
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 6
- 239000000243 solution Substances 0.000 description 6
- 238000006555 catalytic reaction Methods 0.000 description 5
- 238000002474 experimental method Methods 0.000 description 5
- 238000000855 fermentation Methods 0.000 description 5
- 230000004151 fermentation Effects 0.000 description 5
- 239000002609 medium Substances 0.000 description 5
- 239000006228 supernatant Substances 0.000 description 5
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 5
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 4
- NBIIXXVUZAFLBC-UHFFFAOYSA-N Phosphoric acid Chemical compound OP(O)(O)=O NBIIXXVUZAFLBC-UHFFFAOYSA-N 0.000 description 4
- 230000001580 bacterial effect Effects 0.000 description 4
- 239000012295 chemical reaction liquid Substances 0.000 description 4
- 230000002255 enzymatic effect Effects 0.000 description 4
- 238000006460 hydrolysis reaction Methods 0.000 description 4
- 238000004519 manufacturing process Methods 0.000 description 4
- 108020004414 DNA Proteins 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- AFPFGFUGETYOSY-HGNGGELXSA-N His-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AFPFGFUGETYOSY-HGNGGELXSA-N 0.000 description 3
- 238000010276 construction Methods 0.000 description 3
- -1 cyclic imides Chemical class 0.000 description 3
- 239000001963 growth medium Substances 0.000 description 3
- 230000000813 microbial effect Effects 0.000 description 3
- 238000005457 optimization Methods 0.000 description 3
- 239000008057 potassium phosphate buffer Substances 0.000 description 3
- 239000011780 sodium chloride Substances 0.000 description 3
- 108010061238 threonyl-glycine Proteins 0.000 description 3
- 229920001817 Agar Polymers 0.000 description 2
- NJPMYXWVWQWCSR-ACZMJKKPSA-N Ala-Glu-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NJPMYXWVWQWCSR-ACZMJKKPSA-N 0.000 description 2
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 2
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 2
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 description 2
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 2
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 2
- XVWDJUROVRQKAE-KKUMJFAQSA-N Ser-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=CC=C1 XVWDJUROVRQKAE-KKUMJFAQSA-N 0.000 description 2
- LXXCHJKHJYRMIY-FQPOAREZSA-N Thr-Tyr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O LXXCHJKHJYRMIY-FQPOAREZSA-N 0.000 description 2
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 2
- 239000008272 agar Substances 0.000 description 2
- 229910000147 aluminium phosphate Inorganic materials 0.000 description 2
- 108010062796 arginyllysine Proteins 0.000 description 2
- 229940041514 candida albicans extract Drugs 0.000 description 2
- 150000001875 compounds Chemical class 0.000 description 2
- 238000001816 cooling Methods 0.000 description 2
- 238000012258 culturing Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 238000001914 filtration Methods 0.000 description 2
- 229960003692 gamma aminobutyric acid Drugs 0.000 description 2
- BTCSSZJGUNDROE-UHFFFAOYSA-N gamma-aminobutyric acid Chemical compound NCCCC(O)=O BTCSSZJGUNDROE-UHFFFAOYSA-N 0.000 description 2
- 238000010353 genetic engineering Methods 0.000 description 2
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 2
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 2
- 108010050848 glycylleucine Proteins 0.000 description 2
- 238000007689 inspection Methods 0.000 description 2
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 239000008363 phosphate buffer Substances 0.000 description 2
- 239000013600 plasmid vector Substances 0.000 description 2
- 239000000843 powder Substances 0.000 description 2
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 2
- 102000004169 proteins and genes Human genes 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- 238000003756 stirring Methods 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 239000012137 tryptone Substances 0.000 description 2
- 239000012138 yeast extract Substances 0.000 description 2
- OGNSCSPNOLGXSM-UHFFFAOYSA-N (+/-)-DABA Natural products NCCC(N)C(O)=O OGNSCSPNOLGXSM-UHFFFAOYSA-N 0.000 description 1
- NWUYHJFMYQTDRP-UHFFFAOYSA-N 1,2-bis(ethenyl)benzene;1-ethenyl-2-ethylbenzene;styrene Chemical compound C=CC1=CC=CC=C1.CCC1=CC=CC=C1C=C.C=CC1=CC=CC=C1C=C NWUYHJFMYQTDRP-UHFFFAOYSA-N 0.000 description 1
- HBAQYPYDRFILMT-UHFFFAOYSA-N 8-[3-(1-cyclopropylpyrazol-4-yl)-1H-pyrazolo[4,3-d]pyrimidin-5-yl]-3-methyl-3,8-diazabicyclo[3.2.1]octan-2-one Chemical class C1(CC1)N1N=CC(=C1)C1=NNC2=C1N=C(N=C2)N1C2C(N(CC1CC2)C)=O HBAQYPYDRFILMT-UHFFFAOYSA-N 0.000 description 1
- 241000589220 Acetobacter Species 0.000 description 1
- 241000589212 Acetobacter pasteurianus Species 0.000 description 1
- 241000928015 Acidovorax citrulli AAC00-1 Species 0.000 description 1
- YHOPXCAOTRUGLV-XAMCCFCMSA-N Ala-Ala-Asp-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O YHOPXCAOTRUGLV-XAMCCFCMSA-N 0.000 description 1
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 1
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 1
- KQFRUSHJPKXBMB-BHDSKKPTSA-N Ala-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 KQFRUSHJPKXBMB-BHDSKKPTSA-N 0.000 description 1
- FSBCNCKIQZZASN-GUBZILKMSA-N Ala-Arg-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O FSBCNCKIQZZASN-GUBZILKMSA-N 0.000 description 1
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 1
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 1
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 1
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 1
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 1
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 1
- VBRDBGCROKWTPV-XHNCKOQMSA-N Ala-Glu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N VBRDBGCROKWTPV-XHNCKOQMSA-N 0.000 description 1
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 1
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 1
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 1
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 1
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 1
- GSHKMNKPMLXSQW-KBIXCLLPSA-N Ala-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C)N GSHKMNKPMLXSQW-KBIXCLLPSA-N 0.000 description 1
- LXAARTARZJJCMB-CIQUZCHMSA-N Ala-Ile-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LXAARTARZJJCMB-CIQUZCHMSA-N 0.000 description 1
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 1
- UWIQWPWWZUHBAO-ZLIFDBKOSA-N Ala-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)CC(C)C)C(O)=O)=CNC2=C1 UWIQWPWWZUHBAO-ZLIFDBKOSA-N 0.000 description 1
- KQESEZXHYOUIIM-CQDKDKBSSA-N Ala-Lys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KQESEZXHYOUIIM-CQDKDKBSSA-N 0.000 description 1
- CYBJZLQSUJEMAS-LFSVMHDDSA-N Ala-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C)N)O CYBJZLQSUJEMAS-LFSVMHDDSA-N 0.000 description 1
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 1
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 1
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 1
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 1
- YNOCMHZSWJMGBB-GCJQMDKQSA-N Ala-Thr-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O YNOCMHZSWJMGBB-GCJQMDKQSA-N 0.000 description 1
- QKHWNPQNOHEFST-VZFHVOOUSA-N Ala-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C)N)O QKHWNPQNOHEFST-VZFHVOOUSA-N 0.000 description 1
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 1
- XAXMJQUMRJAFCH-CQDKDKBSSA-N Ala-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 XAXMJQUMRJAFCH-CQDKDKBSSA-N 0.000 description 1
- 241001614497 Aminomonas paucivorans DSM 12260 Species 0.000 description 1
- 241001069487 Anaerococcus prevotii DSM 20548 Species 0.000 description 1
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 1
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 1
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 1
- QKSAZKCRVQYYGS-UWVGGRQHSA-N Arg-Gly-His Chemical compound N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O QKSAZKCRVQYYGS-UWVGGRQHSA-N 0.000 description 1
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 1
- GNYUVVJYGJFKHN-RVMXOQNASA-N Arg-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GNYUVVJYGJFKHN-RVMXOQNASA-N 0.000 description 1
- HJDNZFIYILEIKR-OSUNSFLBSA-N Arg-Ile-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HJDNZFIYILEIKR-OSUNSFLBSA-N 0.000 description 1
- YVTHEZNOKSAWRW-DCAQKATOSA-N Arg-Lys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O YVTHEZNOKSAWRW-DCAQKATOSA-N 0.000 description 1
- NGTYEHIRESTSRX-UWVGGRQHSA-N Arg-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NGTYEHIRESTSRX-UWVGGRQHSA-N 0.000 description 1
- FIQKRDXFTANIEJ-ULQDDVLXSA-N Arg-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FIQKRDXFTANIEJ-ULQDDVLXSA-N 0.000 description 1
- IGFJVXOATGZTHD-UHFFFAOYSA-N Arg-Phe-His Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccccc1)C(=O)NC(Cc2c[nH]cn2)C(=O)O IGFJVXOATGZTHD-UHFFFAOYSA-N 0.000 description 1
- BSYKSCBTTQKOJG-GUBZILKMSA-N Arg-Pro-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BSYKSCBTTQKOJG-GUBZILKMSA-N 0.000 description 1
- FVBZXNSRIDVYJS-AVGNSLFASA-N Arg-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N FVBZXNSRIDVYJS-AVGNSLFASA-N 0.000 description 1
- ATABBWFGOHKROJ-GUBZILKMSA-N Arg-Pro-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O ATABBWFGOHKROJ-GUBZILKMSA-N 0.000 description 1
- JOTRDIXZHNQYGP-DCAQKATOSA-N Arg-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JOTRDIXZHNQYGP-DCAQKATOSA-N 0.000 description 1
- SYFHFLGAROUHNT-VEVYYDQMSA-N Arg-Thr-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SYFHFLGAROUHNT-VEVYYDQMSA-N 0.000 description 1
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 1
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 1
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 1
- XXAOXVBAWLMTDR-ZLUOBGJFSA-N Asn-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N XXAOXVBAWLMTDR-ZLUOBGJFSA-N 0.000 description 1
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 1
- OWUCNXMFJRFOFI-BQBZGAKWSA-N Asn-Gly-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O OWUCNXMFJRFOFI-BQBZGAKWSA-N 0.000 description 1
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 1
- XMHFCUKJRCQXGI-CIUDSAMLSA-N Asn-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O XMHFCUKJRCQXGI-CIUDSAMLSA-N 0.000 description 1
- JZLFYAAGGYMRIK-BYULHYEWSA-N Asn-Val-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O JZLFYAAGGYMRIK-BYULHYEWSA-N 0.000 description 1
- ZVTDYGWRRPMFCL-WFBYXXMGSA-N Asp-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)O)N ZVTDYGWRRPMFCL-WFBYXXMGSA-N 0.000 description 1
- UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 1
- SVFOIXMRMLROHO-SRVKXCTJSA-N Asp-Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SVFOIXMRMLROHO-SRVKXCTJSA-N 0.000 description 1
- VFUXXFVCYZPOQG-WDSKDSINSA-N Asp-Glu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VFUXXFVCYZPOQG-WDSKDSINSA-N 0.000 description 1
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 1
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 1
- CMCIMCAQIULNDJ-CIUDSAMLSA-N Asp-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N CMCIMCAQIULNDJ-CIUDSAMLSA-N 0.000 description 1
- SPKCGKRUYKMDHP-GUDRVLHUSA-N Asp-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N SPKCGKRUYKMDHP-GUDRVLHUSA-N 0.000 description 1
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 1
- XLILXFRAKOYEJX-GUBZILKMSA-N Asp-Leu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLILXFRAKOYEJX-GUBZILKMSA-N 0.000 description 1
- GWIJZUVQVDJHDI-AVGNSLFASA-N Asp-Phe-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GWIJZUVQVDJHDI-AVGNSLFASA-N 0.000 description 1
- RPUYTJJZXQBWDT-SRVKXCTJSA-N Asp-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N RPUYTJJZXQBWDT-SRVKXCTJSA-N 0.000 description 1
- NONWUQAWAANERO-BZSNNMDCSA-N Asp-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@H](CC(O)=O)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 NONWUQAWAANERO-BZSNNMDCSA-N 0.000 description 1
- HJZLUGQGJWXJCJ-CIUDSAMLSA-N Asp-Pro-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJZLUGQGJWXJCJ-CIUDSAMLSA-N 0.000 description 1
- LLRJPYJQNBMOOO-QEJZJMRPSA-N Asp-Trp-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N LLRJPYJQNBMOOO-QEJZJMRPSA-N 0.000 description 1
- AWPWHMVCSISSQK-QWRGUYRKSA-N Asp-Tyr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O AWPWHMVCSISSQK-QWRGUYRKSA-N 0.000 description 1
- PLOKOIJSGCISHE-BYULHYEWSA-N Asp-Val-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLOKOIJSGCISHE-BYULHYEWSA-N 0.000 description 1
- 241000933166 Burkholderia ambifaria MC40-6 Species 0.000 description 1
- 241000132899 Burkholderia vietnamiensis G4 Species 0.000 description 1
- 241000237645 Chitinophaga pinensis DSM 2588 Species 0.000 description 1
- 241000947687 Comamonas testosteroni KF-1 Species 0.000 description 1
- 206010010904 Convulsion Diseases 0.000 description 1
- GRNOCLDFUNCIDW-ACZMJKKPSA-N Cys-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N GRNOCLDFUNCIDW-ACZMJKKPSA-N 0.000 description 1
- 241000933179 Delftia acidovorans SPH-1 Species 0.000 description 1
- 102100036238 Dihydropyrimidinase Human genes 0.000 description 1
- 241001452028 Escherichia coli DH1 Species 0.000 description 1
- 241001206367 Frankia inefficax Species 0.000 description 1
- RGXXLQWXBFNXTG-CIUDSAMLSA-N Gln-Arg-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O RGXXLQWXBFNXTG-CIUDSAMLSA-N 0.000 description 1
- CYTSBCIIEHUPDU-ACZMJKKPSA-N Gln-Asp-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O CYTSBCIIEHUPDU-ACZMJKKPSA-N 0.000 description 1
- JKPGHIQCHIIRMS-AVGNSLFASA-N Gln-Asp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N JKPGHIQCHIIRMS-AVGNSLFASA-N 0.000 description 1
- DRDSQGHKTLSNEA-GLLZPBPUSA-N Gln-Glu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DRDSQGHKTLSNEA-GLLZPBPUSA-N 0.000 description 1
- NNXIQPMZGZUFJJ-AVGNSLFASA-N Gln-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N NNXIQPMZGZUFJJ-AVGNSLFASA-N 0.000 description 1
- SBHVGKBYOQKAEA-SDDRHHMPSA-N Gln-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)N)N)C(=O)O SBHVGKBYOQKAEA-SDDRHHMPSA-N 0.000 description 1
- IHSGESFHTMFHRB-GUBZILKMSA-N Gln-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O IHSGESFHTMFHRB-GUBZILKMSA-N 0.000 description 1
- HPCOBEHVEHWREJ-DCAQKATOSA-N Gln-Lys-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HPCOBEHVEHWREJ-DCAQKATOSA-N 0.000 description 1
- DBNLXHGDGBUCDV-KKUMJFAQSA-N Gln-Phe-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O DBNLXHGDGBUCDV-KKUMJFAQSA-N 0.000 description 1
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 1
- JKDBRTNMYXYLHO-JYJNAYRXSA-N Gln-Tyr-Leu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 JKDBRTNMYXYLHO-JYJNAYRXSA-N 0.000 description 1
- OACPJRQRAHMQEQ-NHCYSSNCSA-N Gln-Val-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OACPJRQRAHMQEQ-NHCYSSNCSA-N 0.000 description 1
- BBFCMGBMYIAGRS-AUTRQRHGSA-N Gln-Val-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BBFCMGBMYIAGRS-AUTRQRHGSA-N 0.000 description 1
- SDSMVVSHLAAOJL-UKJIMTQDSA-N Gln-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N SDSMVVSHLAAOJL-UKJIMTQDSA-N 0.000 description 1
- 241000453422 Gloeocapsa sp. PCC 7428 Species 0.000 description 1
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 1
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 1
- IYAUFWMUCGBFMQ-CIUDSAMLSA-N Glu-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N IYAUFWMUCGBFMQ-CIUDSAMLSA-N 0.000 description 1
- CGYDXNKRIMJMLV-GUBZILKMSA-N Glu-Arg-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CGYDXNKRIMJMLV-GUBZILKMSA-N 0.000 description 1
- FLLRAEJOLZPSMN-CIUDSAMLSA-N Glu-Asn-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FLLRAEJOLZPSMN-CIUDSAMLSA-N 0.000 description 1
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 1
- QPRZKNOOOBWXSU-CIUDSAMLSA-N Glu-Asp-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N QPRZKNOOOBWXSU-CIUDSAMLSA-N 0.000 description 1
- KASDBWKLWJKTLJ-GUBZILKMSA-N Glu-Glu-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O KASDBWKLWJKTLJ-GUBZILKMSA-N 0.000 description 1
- XOIATPHFYVWFEU-DCAQKATOSA-N Glu-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XOIATPHFYVWFEU-DCAQKATOSA-N 0.000 description 1
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 1
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 1
- 241000423296 Gluconacetobacter diazotrophicus PA1 5 Species 0.000 description 1
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 1
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 1
- QIZJOTQTCAGKPU-KWQFWETISA-N Gly-Ala-Tyr Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 QIZJOTQTCAGKPU-KWQFWETISA-N 0.000 description 1
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 1
- UPOJUWHGMDJUQZ-IUCAKERBSA-N Gly-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UPOJUWHGMDJUQZ-IUCAKERBSA-N 0.000 description 1
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 1
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 1
- BPQYBFAXRGMGGY-LAEOZQHASA-N Gly-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN BPQYBFAXRGMGGY-LAEOZQHASA-N 0.000 description 1
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 1
- BEQGFMIBZFNROK-JGVFFNPUSA-N Gly-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)CN)C(=O)O BEQGFMIBZFNROK-JGVFFNPUSA-N 0.000 description 1
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 1
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 1
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 1
- HHSOPSCKAZKQHQ-PEXQALLHSA-N Gly-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN HHSOPSCKAZKQHQ-PEXQALLHSA-N 0.000 description 1
- DGKBSGNCMCLDSL-BYULHYEWSA-N Gly-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN DGKBSGNCMCLDSL-BYULHYEWSA-N 0.000 description 1
- UTYGDAHJBBDPBA-BYULHYEWSA-N Gly-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN UTYGDAHJBBDPBA-BYULHYEWSA-N 0.000 description 1
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 1
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 1
- YXTFLTJYLIAZQG-FJXKBIBVSA-N Gly-Thr-Arg Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YXTFLTJYLIAZQG-FJXKBIBVSA-N 0.000 description 1
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 description 1
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 1
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 1
- DUAWRXXTOQOECJ-JSGCOSHPSA-N Gly-Tyr-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O DUAWRXXTOQOECJ-JSGCOSHPSA-N 0.000 description 1
- YDIDLLVFCYSXNY-RCOVLWMOSA-N Gly-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN YDIDLLVFCYSXNY-RCOVLWMOSA-N 0.000 description 1
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 1
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 1
- 241001258120 Granulicella tundricola MP5ACTX9 Species 0.000 description 1
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 1
- 241000107014 Halomonas sp. KO116 Species 0.000 description 1
- VCDNHBNNPCDBKV-DLOVCJGASA-N His-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VCDNHBNNPCDBKV-DLOVCJGASA-N 0.000 description 1
- RXVOMIADLXPJGW-GUBZILKMSA-N His-Asp-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O RXVOMIADLXPJGW-GUBZILKMSA-N 0.000 description 1
- UVUIXIVPKVMONA-CIUDSAMLSA-N His-Cys-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC1=CN=CN1 UVUIXIVPKVMONA-CIUDSAMLSA-N 0.000 description 1
- UPGJWSUYENXOPV-HGNGGELXSA-N His-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N UPGJWSUYENXOPV-HGNGGELXSA-N 0.000 description 1
- ZNNNYCXPCKACHX-DCAQKATOSA-N His-Gln-Gln Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZNNNYCXPCKACHX-DCAQKATOSA-N 0.000 description 1
- NKRWVZQTPXPNRZ-SRVKXCTJSA-N His-Met-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC1=CN=CN1 NKRWVZQTPXPNRZ-SRVKXCTJSA-N 0.000 description 1
- AJTBOTWDSRSUDV-ULQDDVLXSA-N His-Phe-Met Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O AJTBOTWDSRSUDV-ULQDDVLXSA-N 0.000 description 1
- PZAJPILZRFPYJJ-SRVKXCTJSA-N His-Ser-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O PZAJPILZRFPYJJ-SRVKXCTJSA-N 0.000 description 1
- IXQGOKWTQPCIQM-YJRXYDGGSA-N His-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O IXQGOKWTQPCIQM-YJRXYDGGSA-N 0.000 description 1
- NBWATNYAUVSAEQ-ZEILLAHLSA-N His-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O NBWATNYAUVSAEQ-ZEILLAHLSA-N 0.000 description 1
- VXZZUXWAOMWWJH-QTKMDUPCSA-N His-Thr-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VXZZUXWAOMWWJH-QTKMDUPCSA-N 0.000 description 1
- CISBRYJZMFWOHJ-JBDRJPRFSA-N Ile-Ala-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O)N CISBRYJZMFWOHJ-JBDRJPRFSA-N 0.000 description 1
- QLRMMMQNCWBNPQ-QXEWZRGKSA-N Ile-Arg-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N QLRMMMQNCWBNPQ-QXEWZRGKSA-N 0.000 description 1
- ATXGFMOBVKSOMK-PEDHHIEDSA-N Ile-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N ATXGFMOBVKSOMK-PEDHHIEDSA-N 0.000 description 1
- AZEYWPUCOYXFOE-CYDGBPFRSA-N Ile-Arg-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N AZEYWPUCOYXFOE-CYDGBPFRSA-N 0.000 description 1
- AQTWDZDISVGCAC-CFMVVWHZSA-N Ile-Asp-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AQTWDZDISVGCAC-CFMVVWHZSA-N 0.000 description 1
- DFFTXLCCDFYRKD-MBLNEYKQSA-N Ile-Gly-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N DFFTXLCCDFYRKD-MBLNEYKQSA-N 0.000 description 1
- PWDSHAAAFXISLE-SXTJYALSSA-N Ile-Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O PWDSHAAAFXISLE-SXTJYALSSA-N 0.000 description 1
- AYLAAGNJNVZDPY-CYDGBPFRSA-N Ile-Met-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(=O)O)N AYLAAGNJNVZDPY-CYDGBPFRSA-N 0.000 description 1
- OTSVBELRDMSPKY-PCBIJLKTSA-N Ile-Phe-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OTSVBELRDMSPKY-PCBIJLKTSA-N 0.000 description 1
- SAVXZJYTTQQQDD-QEWYBTABSA-N Ile-Phe-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SAVXZJYTTQQQDD-QEWYBTABSA-N 0.000 description 1
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 1
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 1
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 1
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 description 1
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 description 1
- APQYGMBHIVXFML-OSUNSFLBSA-N Ile-Val-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N APQYGMBHIVXFML-OSUNSFLBSA-N 0.000 description 1
- 241001203194 Ilyobacter polytropus DSM 2926 Species 0.000 description 1
- 108010065920 Insulin Lispro Proteins 0.000 description 1
- 239000007836 KH2PO4 Substances 0.000 description 1
- 241001468094 Komagataeibacter europaeus Species 0.000 description 1
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 1
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 1
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 1
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 1
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 1
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 1
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 1
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 1
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 1
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 1
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 1
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 1
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 1
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 1
- GOFJOGXGMPHOGL-DCAQKATOSA-N Leu-Ser-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(C)C GOFJOGXGMPHOGL-DCAQKATOSA-N 0.000 description 1
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 1
- LSLUTXRANSUGFY-XIRDDKMYSA-N Leu-Trp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O LSLUTXRANSUGFY-XIRDDKMYSA-N 0.000 description 1
- URJUVJDTPXCQFL-IHPCNDPISA-N Leu-Trp-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N URJUVJDTPXCQFL-IHPCNDPISA-N 0.000 description 1
- AXVIGSRGTMNSJU-YESZJQIVSA-N Leu-Tyr-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N AXVIGSRGTMNSJU-YESZJQIVSA-N 0.000 description 1
- VQHUBNVKFFLWRP-ULQDDVLXSA-N Leu-Tyr-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 VQHUBNVKFFLWRP-ULQDDVLXSA-N 0.000 description 1
- LMDVGHQPPPLYAR-IHRRRGAJSA-N Leu-Val-His Chemical compound N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O LMDVGHQPPPLYAR-IHRRRGAJSA-N 0.000 description 1
- UETQMSASAVBGJY-QWRGUYRKSA-N Lys-Gly-His Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 UETQMSASAVBGJY-QWRGUYRKSA-N 0.000 description 1
- ORVFEGYUJITPGI-IHRRRGAJSA-N Lys-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN ORVFEGYUJITPGI-IHRRRGAJSA-N 0.000 description 1
- IEVXCWPVBYCJRZ-IXOXFDKPSA-N Lys-Thr-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IEVXCWPVBYCJRZ-IXOXFDKPSA-N 0.000 description 1
- VVURYEVJJTXWNE-ULQDDVLXSA-N Lys-Tyr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O VVURYEVJJTXWNE-ULQDDVLXSA-N 0.000 description 1
- 241001435983 Megasphaera sp. MJR8396C Species 0.000 description 1
- 241000343831 Meiothermus ruber DSM 1279 Species 0.000 description 1
- DTICLBJHRYSJLH-GUBZILKMSA-N Met-Ala-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O DTICLBJHRYSJLH-GUBZILKMSA-N 0.000 description 1
- RAAVFTFEAUAVIY-DCAQKATOSA-N Met-Glu-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N RAAVFTFEAUAVIY-DCAQKATOSA-N 0.000 description 1
- VSJAPSMRFYUOKS-IUCAKERBSA-N Met-Pro-Gly Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O VSJAPSMRFYUOKS-IUCAKERBSA-N 0.000 description 1
- HLZORBMOISUNIV-DCAQKATOSA-N Met-Ser-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C HLZORBMOISUNIV-DCAQKATOSA-N 0.000 description 1
- MIXPUVSPPOWTCR-FXQIFTODSA-N Met-Ser-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MIXPUVSPPOWTCR-FXQIFTODSA-N 0.000 description 1
- 241001098757 Methylobacterium nodulans ORS 2060 Species 0.000 description 1
- 241001363857 Mitsuokella sp. oral taxon 131 str. W9106 Species 0.000 description 1
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- 241001365234 Niastella koreensis GR20-10 Species 0.000 description 1
- 241001503642 Nocardia seriolae Species 0.000 description 1
- 108091028043 Nucleic acid sequence Proteins 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 241000179039 Paenibacillus Species 0.000 description 1
- 241000451371 Pandoraea thiooxydans Species 0.000 description 1
- 241000281792 Pantoea sp. At-9b Species 0.000 description 1
- 241001206540 Peptoniphilus asaccharolyticus DSM 20463 Species 0.000 description 1
- BRDYYVQTEJVRQT-HRCADAONSA-N Phe-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BRDYYVQTEJVRQT-HRCADAONSA-N 0.000 description 1
- BFYHIHGIHGROAT-HTUGSXCWSA-N Phe-Glu-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFYHIHGIHGROAT-HTUGSXCWSA-N 0.000 description 1
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 1
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 1
- PPHFTNABKQRAJV-JYJNAYRXSA-N Phe-His-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PPHFTNABKQRAJV-JYJNAYRXSA-N 0.000 description 1
- PBXYXOAEQQUVMM-ULQDDVLXSA-N Phe-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N PBXYXOAEQQUVMM-ULQDDVLXSA-N 0.000 description 1
- SPXWRYVHOZVYBU-ULQDDVLXSA-N Phe-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N SPXWRYVHOZVYBU-ULQDDVLXSA-N 0.000 description 1
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 1
- BQMFWUKNOCJDNV-HJWJTTGWSA-N Phe-Val-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BQMFWUKNOCJDNV-HJWJTTGWSA-N 0.000 description 1
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 1
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 1
- BODDREDDDRZUCF-QTKMDUPCSA-N Pro-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@@H]2CCCN2)O BODDREDDDRZUCF-QTKMDUPCSA-N 0.000 description 1
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 1
- CPRLKHJUFAXVTD-ULQDDVLXSA-N Pro-Leu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CPRLKHJUFAXVTD-ULQDDVLXSA-N 0.000 description 1
- DWGFLKQSGRUQTI-IHRRRGAJSA-N Pro-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 DWGFLKQSGRUQTI-IHRRRGAJSA-N 0.000 description 1
- DYMPSOABVJIFBS-IHRRRGAJSA-N Pro-Phe-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CS)C(=O)O DYMPSOABVJIFBS-IHRRRGAJSA-N 0.000 description 1
- AJBQTGZIZQXBLT-STQMWFEESA-N Pro-Phe-Gly Chemical compound C([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 AJBQTGZIZQXBLT-STQMWFEESA-N 0.000 description 1
- DSGSTPRKNYHGCL-JYJNAYRXSA-N Pro-Phe-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O DSGSTPRKNYHGCL-JYJNAYRXSA-N 0.000 description 1
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 1
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 1
- 241001209206 Pseudomonas fluorescens Pf0-1 Species 0.000 description 1
- 241001291885 Pseudomonas putida GB-1 Species 0.000 description 1
- 241000278003 Pseudomonas sp. GM41 Species 0.000 description 1
- 241000766350 Rhizobium favelukesii Species 0.000 description 1
- 241001503020 Rubrobacter radiotolerans DSM 5868 Species 0.000 description 1
- 241001458488 Segniliparus rotundus DSM 44985 Species 0.000 description 1
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 1
- MESDJCNHLZBMEP-ZLUOBGJFSA-N Ser-Asp-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MESDJCNHLZBMEP-ZLUOBGJFSA-N 0.000 description 1
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 1
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 1
- LRWBCWGEUCKDTN-BJDJZHNGSA-N Ser-Lys-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LRWBCWGEUCKDTN-BJDJZHNGSA-N 0.000 description 1
- DINQYZRMXGWWTG-GUBZILKMSA-N Ser-Pro-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DINQYZRMXGWWTG-GUBZILKMSA-N 0.000 description 1
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 1
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 1
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 1
- ODRUTDLAONAVDV-IHRRRGAJSA-N Ser-Val-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ODRUTDLAONAVDV-IHRRRGAJSA-N 0.000 description 1
- 241001475376 Sinorhizobium medicae WSM419 Species 0.000 description 1
- 241001186851 Starkeya novella DSM 506 Species 0.000 description 1
- 241001446311 Streptomyces coelicolor A3(2) Species 0.000 description 1
- 241001013926 Sulfobacillus acidophilus DSM 10332 Species 0.000 description 1
- 241001176802 Thalassospira sp. KO164 Species 0.000 description 1
- 241000065704 Thermanaeromonas toyohensis ToBE Species 0.000 description 1
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 1
- ZUXQFMVPAYGPFJ-JXUBOQSCSA-N Thr-Ala-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN ZUXQFMVPAYGPFJ-JXUBOQSCSA-N 0.000 description 1
- WFUAUEQXPVNAEF-ZJDVBMNYSA-N Thr-Arg-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CCCN=C(N)N WFUAUEQXPVNAEF-ZJDVBMNYSA-N 0.000 description 1
- JXKMXEBNZCKSDY-JIOCBJNQSA-N Thr-Asp-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O JXKMXEBNZCKSDY-JIOCBJNQSA-N 0.000 description 1
- KWQBJOUOSNJDRR-XAVMHZPKSA-N Thr-Cys-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N)O KWQBJOUOSNJDRR-XAVMHZPKSA-N 0.000 description 1
- KCRQEJSKXAIULJ-FJXKBIBVSA-N Thr-Gly-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCRQEJSKXAIULJ-FJXKBIBVSA-N 0.000 description 1
- XSTGOZBBXFKGHA-YJRXYDGGSA-N Thr-His-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O XSTGOZBBXFKGHA-YJRXYDGGSA-N 0.000 description 1
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 1
- MUAFDCVOHYAFNG-RCWTZXSCSA-N Thr-Pro-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MUAFDCVOHYAFNG-RCWTZXSCSA-N 0.000 description 1
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 1
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 1
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 1
- SCQBNMKLZVCXNX-ZFWWWQNUSA-N Trp-Arg-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N SCQBNMKLZVCXNX-ZFWWWQNUSA-N 0.000 description 1
- YTCNLMSUXPCFBW-SXNHZJKMSA-N Trp-Ile-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O YTCNLMSUXPCFBW-SXNHZJKMSA-N 0.000 description 1
- UIRPULWLRODAEQ-QEJZJMRPSA-N Trp-Ser-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 UIRPULWLRODAEQ-QEJZJMRPSA-N 0.000 description 1
- NRFTYDWKWGJLAR-MELADBBJSA-N Tyr-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O NRFTYDWKWGJLAR-MELADBBJSA-N 0.000 description 1
- HVHJYXDXRIWELT-RYUDHWBXSA-N Tyr-Glu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O HVHJYXDXRIWELT-RYUDHWBXSA-N 0.000 description 1
- ILTXFANLDMJWPR-SIUGBPQLSA-N Tyr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N ILTXFANLDMJWPR-SIUGBPQLSA-N 0.000 description 1
- WDGDKHLSDIOXQC-ACRUOGEOSA-N Tyr-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 WDGDKHLSDIOXQC-ACRUOGEOSA-N 0.000 description 1
- CDBXVDXSLPLFMD-BPNCWPANSA-N Tyr-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDBXVDXSLPLFMD-BPNCWPANSA-N 0.000 description 1
- SZEIFUXUTBBQFQ-STQMWFEESA-N Tyr-Pro-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SZEIFUXUTBBQFQ-STQMWFEESA-N 0.000 description 1
- VYQQQIRHIFALGE-UWJYBYFXSA-N Tyr-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VYQQQIRHIFALGE-UWJYBYFXSA-N 0.000 description 1
- HIZMLPKDJAXDRG-FXQIFTODSA-N Val-Cys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N HIZMLPKDJAXDRG-FXQIFTODSA-N 0.000 description 1
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 1
- XBRMBDFYOFARST-AVGNSLFASA-N Val-His-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N XBRMBDFYOFARST-AVGNSLFASA-N 0.000 description 1
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 description 1
- RFKJNTRMXGCKFE-FHWLQOOXSA-N Val-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC(C)C)C(O)=O)=CNC2=C1 RFKJNTRMXGCKFE-FHWLQOOXSA-N 0.000 description 1
- GVJUTBOZZBTBIG-AVGNSLFASA-N Val-Lys-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N GVJUTBOZZBTBIG-AVGNSLFASA-N 0.000 description 1
- WMRWZYSRQUORHJ-YDHLFZDLSA-N Val-Phe-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WMRWZYSRQUORHJ-YDHLFZDLSA-N 0.000 description 1
- AIWLHFZYOUUJGB-UFYCRDLUSA-N Val-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 AIWLHFZYOUUJGB-UFYCRDLUSA-N 0.000 description 1
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 1
- KRAHMIJVUPUOTQ-DCAQKATOSA-N Val-Ser-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KRAHMIJVUPUOTQ-DCAQKATOSA-N 0.000 description 1
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 1
- JPBGMZDTPVGGMQ-ULQDDVLXSA-N Val-Tyr-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N JPBGMZDTPVGGMQ-ULQDDVLXSA-N 0.000 description 1
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 1
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 1
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 1
- 241000287986 Veillonella sp. DNF00869 Species 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- 238000009098 adjuvant therapy Methods 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 1
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 1
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 108010005233 alanylglutamic acid Proteins 0.000 description 1
- 108010047495 alanylglycine Proteins 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 1
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 108010093581 aspartyl-proline Proteins 0.000 description 1
- 108010038633 aspartylglutamate Proteins 0.000 description 1
- 108010047857 aspartylglycine Proteins 0.000 description 1
- 108010068265 aspartyltyrosine Proteins 0.000 description 1
- 238000011914 asymmetric synthesis Methods 0.000 description 1
- 239000011942 biocatalyst Substances 0.000 description 1
- 239000007853 buffer solution Substances 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 238000004140 cleaning Methods 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 238000001035 drying Methods 0.000 description 1
- 239000012149 elution buffer Substances 0.000 description 1
- 239000011982 enantioselective catalyst Substances 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 238000001976 enzyme digestion Methods 0.000 description 1
- 239000000706 filtrate Substances 0.000 description 1
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 1
- 108010078144 glutaminyl-glycine Proteins 0.000 description 1
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 1
- 108010049041 glutamylalanine Proteins 0.000 description 1
- 108010079547 glutamylmethionine Proteins 0.000 description 1
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 1
- 108010033719 glycyl-histidyl-glycine Proteins 0.000 description 1
- 108010028188 glycyl-histidyl-serine Proteins 0.000 description 1
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 108010084389 glycyltryptophan Proteins 0.000 description 1
- 108010037850 glycylvaline Proteins 0.000 description 1
- 238000010438 heat treatment Methods 0.000 description 1
- 108010028295 histidylhistidine Proteins 0.000 description 1
- 150000001469 hydantoins Chemical class 0.000 description 1
- 230000007062 hydrolysis Effects 0.000 description 1
- 230000003301 hydrolyzing effect Effects 0.000 description 1
- 150000003949 imides Chemical class 0.000 description 1
- 238000011081 inoculation Methods 0.000 description 1
- 239000002054 inoculum Substances 0.000 description 1
- 229920003303 ion-exchange polymer Polymers 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- OOYGSFOGFJDDHP-KMCOLRRFSA-N kanamycin A sulfate Chemical compound OS(O)(=O)=O.O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N OOYGSFOGFJDDHP-KMCOLRRFSA-N 0.000 description 1
- 229960002064 kanamycin sulfate Drugs 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 238000009630 liquid culture Methods 0.000 description 1
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 1
- 108010009298 lysylglutamic acid Proteins 0.000 description 1
- 108010064235 lysylglycine Proteins 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000011177 media preparation Methods 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- 108010068488 methionylphenylalanine Proteins 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 229910000402 monopotassium phosphate Inorganic materials 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 208000004296 neuralgia Diseases 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 230000000144 pharmacologic effect Effects 0.000 description 1
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 1
- 108010012581 phenylalanylglutamate Proteins 0.000 description 1
- 108010051242 phenylalanylserine Proteins 0.000 description 1
- 108010025488 pinealon Proteins 0.000 description 1
- GNSKLFRGEWLPPA-UHFFFAOYSA-M potassium dihydrogen phosphate Chemical compound [K+].OP(O)([O-])=O GNSKLFRGEWLPPA-UHFFFAOYSA-M 0.000 description 1
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 238000005086 pumping Methods 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 239000008213 purified water Substances 0.000 description 1
- 239000000018 receptor agonist Substances 0.000 description 1
- 229940044601 receptor agonist Drugs 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 238000010257 thawing Methods 0.000 description 1
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 1
- 108700004896 tripeptide FEG Proteins 0.000 description 1
- 108010058119 tryptophyl-glycyl-glycine Proteins 0.000 description 1
- 108010029384 tryptophyl-histidine Proteins 0.000 description 1
- 108010071635 tyrosyl-prolyl-arginine Proteins 0.000 description 1
- 108010078580 tyrosylleucine Proteins 0.000 description 1
- 239000013598 vector Substances 0.000 description 1
- 239000011534 wash buffer Substances 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P13/00—Preparation of nitrogen-containing organic compounds
- C12P13/02—Amides, e.g. chloramphenicol or polyamides; Imides or polyimides; Urethanes, i.e. compounds comprising N-C=O structural element or polyurethanes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/78—Hydrolases (3) acting on carbon to nitrogen bonds other than peptide bonds (3.5)
- C12N9/86—Hydrolases (3) acting on carbon to nitrogen bonds other than peptide bonds (3.5) acting on amide bonds in cyclic amides, e.g. penicillinase (3.5.2)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y305/00—Hydrolases acting on carbon-nitrogen bonds, other than peptide bonds (3.5)
- C12Y305/02—Hydrolases acting on carbon-nitrogen bonds, other than peptide bonds (3.5) in cyclic amides (3.5.2)
- C12Y305/02002—Dihydropyrimidinase (3.5.2.2), i.e. hydantoinase
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Genetics & Genomics (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Biotechnology (AREA)
- Biochemistry (AREA)
- Microbiology (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Medicinal Chemistry (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Enzymes And Modification Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
本发明提供了一种制备(R)‑3‑(氨甲酰甲基)‑5‑甲基己酸的方法,包括如下步骤:以3‑异丁基戊二酰亚胺为底物,使用二氢嘧啶氨基水解酶SEQ ID NO:1或者SEQ ID NO:3催化开环反应,得到(R)‑3‑(氨甲酰甲基)‑5‑甲基己酸。
Description
技术领域
本发明属于生物催化技术领域,涉及一种制备普瑞巴林手性中间体的技术,具体涉及一种利用二氢嘧啶氨基水解酶制备(R)-3-(氨甲酰甲基)-5-甲基己酸的方法。
背景技术
普瑞巴林(Pregabalin)化学名为(S)-3-氨甲基-5-甲基己酸,结构式如下,是美国Warner-lambert公司研发的新一代γ-氨基丁酸(γ-GABA)受体激动剂,主要用于治疗外周神经痛以及辅助性治疗局限性部分癫痫发作,其中S-构型的药理活性是R-构型的10倍,因此合成光学纯的手性普瑞巴林对于增强药物疗效、减少非活性对映体引发的副作用具有重要意义。
目前已报道的普瑞巴林合成方法有不少,但主要是通过不对称催化剂、手性配体的化学不对称合成法,或者是使用化学试剂或商品酶对反应过程中的消旋中间体进行手性拆分。通过高效酶催化潜手性化合物直接获得手性中间体是最经济、有效的途径。其中二氢嘧啶氨基水解酶底物谱较为广泛,可以水解一些酰亚胺、二氢嘧啶和海因,然而它们更倾向于水解一些环酰亚胺类物质。普瑞巴林的手性中间体(R)-3-(氨甲酰甲基)-5-甲基己酸可以通过水解3-异丁基戊二酰亚胺获得,而3-异丁基戊二酰亚胺是环酰亚胺类化合物。
因此,推测高选择性、高活性的二氢嘧啶氨基水解酶直接水解3-异丁基戊二酰亚胺可以获得手性中间体(R)-3-(氨甲酰甲基)-5-甲基己酸。
发明内容
基于上述推测,发明人对于多种微生物来源的二氢嘧啶氨基水解酶(Dihydropyrimidinase,EC.3.5.2.2)进行了筛选,包括41种二氢嘧啶氨基水解酶。利用大肠杆菌表达这些二氢嘧啶氨基水解酶,并通过发酵菌体催化3-异丁基戊二酰亚胺发生水解反应,检测反应液中(R)-3-(氨甲酰甲基)-5-甲基己酸的含量,进行目标二氢嘧啶氨基水解酶的筛选。
经过筛选,发现荧光假单胞菌Pseudomonas fluorescens来源的两种二氢嘧啶氨基水解酶SEQ ID NO:1(NCBI登录号:WP_011030900.1)和SEQ ID NO:3(NCBI登录号:WP_011334810.1)能够催化3-异丁基戊二酰亚胺反应生成(R)-3-(氨甲酰甲基)-5-甲基己酸。
因此,本发明的第一个目的在于提供一种酶催化法制备(R)-3-(氨甲酰甲基)-5-甲基己酸的技术方案。
一种制备(R)-3-(氨甲酰甲基)-5-甲基己酸的方法,其包括如下步骤:以3-异丁基戊二酰亚胺为底物,使用二氢嘧啶氨基水解酶SEQ ID NO:1或者SEQ ID NO:3催化开环反应,得到(R)-3-(氨甲酰甲基)-5-甲基己酸。
其中,第一种二氢嘧啶氨基水解酶的氨基酸序列为SEQ ID NO:1:
MSSRTVIRGGLVITASDEIHADVLIEDGRVAALAATGTPAAEAFTAENVIDASGKYVIPGGVDGHTHMEMPFGGTYAADTFETGTRAAAWGGTTTIVDFAIQSVGHSLREGLDAWHAKAEGNCAIDYGFHMIVSDVNQETLKEMDLLVEEGVTSFKQFMAYPGVFYSDDGQILRAMQRAAENGGLIMMHAENGIAIDVLVEQALARGETDPRFHGEVRKALLEAEATHRAIRLAQVAGAPLYVVHVSATEAVAELTRARDEGLPVFGETCPQYLFLSTDNLAEPDFEGAKYVCSTPLRPKEHQAALWRGLRTNDLQVVSTDHCPFCFSGQKELGRGDFSRIPNGMPGVENRMDLLHQAVVEGHIGRRRWIEIACATPARMFGLYPKKGTIAPGADADIVVYDPHAEQVISAETHHMNVDYSAYEGRRITGRVETVLSRGEPVVTEREYTGRKGHGAYTPRATCQYLT(SEQ ID NO:1);
第二种二氢嘧啶氨基水解酶的氨基酸序列为SEQ ID NO:3:
MSLLIRGATIVTHDESYRADVYCADGVIKAIGENLDIPAGAEVLDGSGQYLMPGGIDPHTHMQLPFMGTVASEDFYSGTAAGLAGGTTSIIDFVIPNPQQSLLEAFHQWRGWAEKSASDYGFHVAITWWSEQVREEMAELVSHHGINSFKHFMAYKNAIMAADDTLVASFERCLELGAVPTVHAENGELVYHLQRKLMAQGITGPEAHPLSRPSQVEGEAASRAIRIAETIGTPLYLVHVSTKEALDEITYARSKGQPVYGEVLAGHLLLDDSVYQHPDWQTAAGYVMSPPFRPRGHQDALWHGLQSGNLHTTATDHCCFCAEQKAAGRDDFSKIPNGTAGIEDRMAVLWDEGVNSGRLSMQDFVALTSTNTAKIFNLYPRKGAIRVGADADLVLWDPQGTRTISAKTHHQQVDFNIFEGKTVTGVPSHTVSQGRVVWADGDLRAERGAGRYIERPAYPAVFDLLSKRAEQHKPTAVKR(SEQ ID NO:3)。
在一种实施方式中,上述二氢嘧啶氨基水解酶SEQ ID NO:1或者SEQ ID NO:3可以呈酶的形式例如游离酶或固定化酶,也可以呈其表达微生物菌体形式。
上述微生物可以是任何适合于表达上述二氢嘧啶氨基水解酶的微生物,例如选自枯草芽孢杆菌、毕赤酵母、酿酒酵母、大肠杆菌。
优选地,所述微生物是大肠杆菌,更优选大肠杆菌BL21(DE3)。
本发明的第二个方面提供了一种编码上述二氢嘧啶氨基水解酶SEQ ID NO:1或者SEQ ID NO:3的基因。当上述二氢嘧啶氨基水解酶的表达宿主是大肠杆菌时,上述二氢嘧啶氨基水解酶SEQ ID NO:1的编码基因的核苷酸序列可以为SEQ ID NO:2,但不限于此;上述二氢嘧啶氨基水解酶SEQ ID NO:3的编码基因的核苷酸序列可以是为SEQ ID NO:4,但不限于此。
上述编码基因可以克隆在一种质粒载体,用于转化微生物宿主细胞。
在一种实施方式中,上述质粒载体选自PET系列,例如选自pET22b、pET24a、pET28a等,但并不受限于此。
本发明的第三个方面提供了一种大肠杆菌,其转化了包含上述二氢嘧啶氨基水解酶编码基因的质粒。
上述酶促反应的反应体系可以是pH6-9的缓冲液,例如Tris-HCl缓冲液或者磷酸缓冲液等,但并不受限于此。
在反应过程中,控制反应体系pH7.0-9.0、优选pH7.5-8.5、更优选pH7.5左右;反应温度25-50℃、优选28-48℃、更优选30-45℃、更优选35-40℃、最优选40℃左右。
本发明筛选出的二氢嘧啶氨基水解酶SEQ ID NO:1和SEQ ID NO:3可以催化底物3-异丁基戊二酰亚胺发生水解反应,开环生成(R)-3-(氨甲酰甲基)-5-甲基己酸,开辟了一种生物催化法生产普瑞巴林手性中间体(R)-3-(氨甲酰甲基)-5-甲基己酸的新途径。
附图说明
图1是本发明实施例中底物反应8小时后的HPLC检测谱图。
图2是本发明实施例催化反应结束后HPLC检测的产物手性色谱图。
具体实施方式
发明人从已报到的41种二氢嘧啶氨基水解酶中筛选能用于合成(R)-3-(氨甲酰甲基)-5-甲基己酸的二氢嘧啶氨基水解酶。这41种二氢嘧啶氨基水解酶包括如下NCBI登录号/GenBank登录号:WP_011030900.1(Streptomyces coelicolor A3(2))、WP_011334810.1(Pseudomonas fluorescens PfO-1)、EUB75082.1(Pseudomonas sp.GM41)、ADU69230.1(Pantoea sp.At-9b)、ABY99529.1(Pseudomonas putida GB-1)、ARW18102.1(Komagataeibacter europaeus)、ADD28645.1(Meiothermus ruber DSM 1279)、SMC02439.1(Rubrobacter radiotolerans DSM 5868)、SMB90978.1(Peptoniphilusasaccharolyticus DSM20463)、SMB99550.1(Thermanaeromonas toyohensis ToBE)、ADO81913.1(Ilyobacter polytropus DSM 2926)、AEW05800.1(Sulfobacillusacidophilus DSM 10332)、EFQ24407.1(Aminomonas paucivorans DSM 12260)、AFZ32681.1(Gloeocapsa sp.PCC 7428)、AEV96856.1(Niastella koreensis GR20-10)、OCK09729.1(Thalassospira sp.KO164)、EED66725.1(Comamonas testosteroni KF-1)、APA99167.1(Nocardia seriolae)、CDM58815.1(Rhizobium favelukesii)、ADP81169.1(Frankia inefficax)、ARW10815.1(Acetobacter ascendens)、APR94628.1(Pandoraeathiooxydans)、ODP29750.1(Paenibacillus nuruki)、ACL62933.1(Methylobacteriumnodulans ORS 2060)、OAZ72086.1(Acetobacter pasteurianus)、KXA69515.1(Megasphaera sp.MJR8396C)、KXB88395.1(Veillonella sp.DNF00869)、AJY49223.1(Halomonas sp.KO116)、ADW67334.1(Granulicella tundricola MP5ACTX9)、ADH91532.1(Starkeya novella DSM 506)、ADG98040.1(Segniliparus rotundus DSM 44985)、ACX38503.1(Escherichia coli DH1)、ACI50717.1(Gluconacetobacter diazotrophicusPA1 5)、ACB68161.1(Burkholderia ambifaria MC40-6)、ABR61199.1(Sinorhizobiummedicae WSM419)、ABO58457.1(Burkholderia vietnamiensis G4)、ABM31567.1(Acidovorax citrulli AAC00-1)、ACU60912.1(Chitinophaga pinensis DSM2588)、ACV29453.1(Anaerococcus prevotii DSM 20548)、ABX35502.1(Delftia acidovoransSPH-1)、ERL25429.1(Mitsuokella sp.oral taxon 131str.W9106)。最终得到两种野生型二氢嘧啶氨基水解酶具有该功能,它们均来源于荧光假单胞菌Pseudomonas fluorescens,其中NCBI登录号为WP_011030900.1的二氢嘧啶氨基水解酶SEQ ID NO:1的氨基酸数量有467个,NCBI登录号为WP_011334810.1的二氢嘧啶氨基水解酶SEQ ID NO:3的氨基酸数量有479个。
由于它们的氨基酸序列明确,因此本领域技术人员很容易获得其编码基因、包含这些基因的表达盒和质粒、以及包含该质粒的转化体。
这些基因、表达盒、质粒、转化体可以通过本领域技术人员所熟知的基因工程构建方式获得。
为了在微生物宿主例如基因工程中最常用的大肠杆菌宿主中最佳地表达二氢嘧啶氨基水解酶SEQ ID NO:1或者SEQ ID NO:3,本发明对其表达基因进行了密码子优化。
密码子优化是可用于通过增加感兴趣基因的翻译效率使生物体中蛋白质表达最大化的一种技术。不同的生物体由于突变倾向和天然选择而通常示出对于编码相同氨基酸的一些密码子之一的特殊偏好性。例如,在生长快速的微生物如大肠杆菌中,优化密码子反映出其各自的基因组tRNA库的组成。因此,在生长快速的微生物中,氨基酸的低频率密码子可以用用于相同氨基酸的但高频率的密码子置换。因此,优化的DNA序列的表达在快速生长的微生物中得以改良。
经过密码子优化,二氢嘧啶氨基水解酶SEQ ID NO:1的编码基因可以是SEQ IDNO:2,二氢嘧啶氨基水解酶SEQ ID NO:3的编码基因可以是SEQ ID NO:4。
当作为生物催化剂用于制备(R)-3-(氨甲酰甲基)-5-甲基己酸时,本发明的二氢嘧啶氨基水解酶以及添加的葡萄糖二氢嘧啶氨基水解酶可以呈现酶的形式或者菌体的形式。所述酶的形式包括游离酶、固定化酶,包括纯化酶、粗酶、发酵液、载体固定的酶等;所述菌体的形式包括存活菌体和死亡菌体。
纯的二氢嘧啶氨基水解酶SEQ ID NO:1和SEQ ID NO:3用于催化水解反应时,一般只能一次性使用,使得(R)-3-(氨甲酰甲基)-5-甲基己酸的生产成本较高。为了提高反应工艺经济性,有必要重复利用该二氢嘧啶氨基水解酶。
生物催化领域公知,与游离酶法相比,应用固定化酶技术具有生产过程简化、生产效率提高等优点。同时,由于酶可多次使用,且酶的稳定性提高,从而有效提高了单位酶的生产力;其次,固定化酶极易与底物、产物分开,简化了提纯工艺,产率较高,产品质量较好。
本发明采用离子交换树脂作为固定化载体分别对二氢嘧啶氨基水解酶SEQ IDNO:1和SEQ ID NO:3进行固定化,能够重复多次用于底物3-异丁基戊二酰亚胺的水解开环反应。
以下结合具体实施例对本发明做进一步详细说明。应理解,以下实施例仅用于说明本发明而非用于限定本发明的范围。
实施例
实施例中涉及到多种物质的添加量、含量及浓度,其中所述的百分含量,除特别说明外,皆指质量百分含量。
材料和方法
实施例中的全基因合成、引物合成及测序皆由苏州金唯智生物科技有限公司完成。
实施例中的分子生物学实验包括质粒构建、酶切、连接、感受态细胞制备、转化、培养基配制等等,主要参照《分子克隆实验指南》第三版(J.萨姆布鲁克,D.W.拉塞尔(美)编著,黄培堂等译,科学出版社,北京,2002)进行。必要时可以通过简单试验确定具体实验条件。
PCR扩增实验根据质粒或DNA模板供应商提供的反应条件或试剂盒说明书进行。必要时可以通过简单试验予以调整。
LB培养基:10g/L胰蛋白胨,5g/L酵母提取物,10g/L氯化钠,pH7.2。(LB固体培养基另加20g/L琼脂粉。)
TB培养基:24g/L酵母提取物、12g/L胰蛋白胨、16.43g/L K2HPO4·3H2O、2.31g/LKH2PO4、5g/L甘油,pH7.0-7.5。(TB固体培养基另加20g/L琼脂粉。)
底物和产物HPLC检测方法:
1、含量检测
色谱仪:岛津LC-2010
柱子:C18 column
流动相:乙腈:水(20:80,v/v;pH 2.5,磷酸调pH)
流速:1.0ml/min
检查测器,UV 210
柱温箱:30℃。
2、手性ee检测
色谱仪:岛津LC-2010
柱子:Chiralpack AD-RH column(Daicel)
流动相:乙腈:水(50:50,v/v;pH 2.5,磷酸调pH)
流速:0.5ml/min
检查测器,UV 210
柱温箱:30℃。
实施例1:构建表达二氢嘧啶氨基水解酶的重组大肠杆菌
1.1根据Pseudomonas fluorescens(荧光假单胞菌)来源的二氢嘧啶氨基水解酶的氨基酸序列SEQ ID NO:1(NCBI登录号WP_011030900.1),进行适于大肠杆菌表达的密码子优化,优化后的基因序列为SEQ ID NO:2。全基因合成该基因序列,在两端设计酶切位点Nde I和XhoI,并亚克隆到载体pET24a(购自Novagen)上相应位点,从而获得重组质粒pET24a-IH-1。将构建好的重组质粒pET24a-IH-1用电转化法转入大肠杆菌BL21(DE3)感受态,得到表达二氢嘧啶氨基水解酶SEQ ID NO:1的重组大肠杆菌BL21(DE3)/pET24a-IH-1。
1.2参照步骤1.1,构建表达二氢嘧啶氨基水解酶SEQ ID NO:3的重组大肠杆菌BL21(DE3)/pET24a-IH-2。
1.3参照步骤1.1,分别构建表达其他微生物来源的二氢嘧啶氨基水解酶(GenBank登录号或NCBI登录号WP_011334810.1、EUB75082.1、ADU69230.1、ABY99529.1、ARW18102.1、ADD28645.1、SMC02439.1、SMB90978.1、SMB99550.1、ADO81913.1、AEW05800.1、EFQ24407.1、AFZ32681.1、AEV96856.1、OCK09729.1、EED66725.1、APA99167.1、CDM58815.1、ADP81169.1、ARW10815.1、APR94628.1、ODP29750.1、ACL62933.1、OAZ72086.1、KXA69515.1、KXB88395.1、AJY49223.1、ADW67334.1、ADH91532.1、ADG98040.1、ACX38503.1、ACI50717.1、ACB68161.1、ABR61199.1、ABO58457.1、ABM31567.1、ACU60912.1、ACV29453.1、ABX35502.1、ERL25429.1)的重组大肠杆菌BL21(DE3)/pET24a-IH-3至BL21(DE3)/pET24a-IH-41。
实施例2:二氢嘧啶氨基水解酶的筛选
2.1菌株发酵
分别挑取各个二氢嘧啶氨基水解酶表达菌株单菌落,分别接种至3mL含50μg/mL硫酸卡那霉素的LB培养基中,37℃,200rpm培养过夜。分别按照1v/v%接种量分别转接至200mL TB培养基中,37℃,250rpm培养至OD600 0.6-0.8时,加入0.5mM IPTG,28℃,200rpm培养过夜。然后4℃,10000rpm,离心10min,收集菌体冷冻备用。
2.2催化(R)-3-(氨甲酰甲基)-5-甲基己酸合成
反应体系:0.1M Tris-HCl缓冲液(pH7.5)1L、10wt%底物3-异丁基戊二酰亚胺、5%w/v冻融菌体,30℃反应24小时,HPLC检测底物样品浓度和产物,筛选有产物(R)-3-(氨甲酰甲基)-5-甲基己酸出现的菌株。
经过HPLC检测验证,发现BL21(DE3)/pET24a-IH-1和BL21(DE3)/pET24a-IH-2催化的反应液中出现了(R)-3-(氨甲酰甲基)-5-甲基己酸吸收峰。
实施例3:二氢嘧啶氨基水解酶的提取和固定化
3.1二氢嘧啶氨基水解酶表达菌株的摇瓶发酵:分别配制液体培养基TB(pH7.2),分装于500mL三角摇瓶,装液量100mL,然后在高压灭菌锅中于121℃加热灭菌20min。从工程菌BL21(DE3)/pET24a-IH-1或BL21(DE3)/pET24a-IH-2的LB平板上用接种环挑取数个菌落,接种于TB摇瓶中,接种前在TB培养基中加入100μg/mL卡那霉素,于37℃、220rpm摇床培养至OD600=5-6,加入0.2mM IPTG于28℃诱导24h左右。
3.2酶的提取分离:取发酵液50ml装入离心管中;8000rpm离心去上清液得到菌体,按菌体200g/L加纯化水重悬,悬浮菌体用冰浴冷却,进行超声破碎(电压400W,超声时间3s,间隔时间5s,工作次数80次)得粗酶液,放置于冰浴中待用。
粗酶液采用Ni-NTG亲和层析柱(南京金瑞斯生物,产品编号:L00250/L00250-C)纯化目的蛋白。缓冲液预平衡Ni-NTG亲和层析柱,使用10个柱体积的洗涤缓冲液(50mM pH8.0Tris-HCl,300mM NaCl,50mM咪唑)清洗,之后用洗脱缓冲液(50mM pH 8.0Tris-HCl,300mM NaCl,250mM咪唑)将目的蛋白从Ni柱中洗脱。
3.3酶固定化:采用环氧型离子交换树脂LX-1000EP对酶进行固定化,方法如下:用0.05M磷酸缓冲液(pH8.0)反复清洗固定化载体/>LX-1000EP(西安蓝晓科技有限公司)。按酶蛋白:载体比大约1:12-15的比例,置于0.1M磷酸钾缓冲液(pH8.0),在20-25℃条件下,150rpm振摇;1min后停止振摇,检测并调整pH8.0左右,继续振摇18h;停止振摇,在相同温度下静置24h;除去上清,用0.02M磷酸钾缓冲液(pH8.0)在20-25℃条件下振摇2min;除去上清,再次以相同缓冲液振摇洗涤45min;除去上清,再次用0.02M磷酸钾缓冲液(pH8.0)清洗,用真空泵抽干,即得固定化酶,在2℃-8℃条件下保存备用。
实施例4:酶法合成(R)-3-(氨甲酰甲基)-5-甲基己酸
反应体系(1L):0.1M Tris-HCl缓冲液(pH7.5)1L、10wt%底物3-异丁基戊二酰亚胺,10wt%固定化二氢嘧啶氨基水解酶SEQ ID NO:1,用6M NaOH校正pH7.5。在30℃的条件下反应8小时后,取100μl反应液,12000rpm离心5分钟,对上清液进行超滤,滤液进行HPLC检测产物的转化情况,结果见图1,反应8小时后有至少60%的底物已经转化为产物,12小时转化率达到95%以上。
固定化二氢嘧啶氨基水解酶SEQ ID NO:3的催化反应结果与上述结果相似。
实施例5:酶法合成(R)-3-(氨甲酰甲基)-5-甲基己酸
反应体系(1L):0.1M Tris-HCl缓冲液(pH7.5)1L、10wt%底物3-异丁基戊二酰亚胺,10wt%固定化二氢嘧啶氨基水解酶SEQ ID NO:1,用6M NaOH校正pH8.5。在40℃的条件下搅拌反应24个小时后,过滤获得反应液,反应液减压蒸馏,浓缩至剩余约1/5体积,降温至0-5℃,搅拌析晶lh,过滤、干燥得目标产物化合物。利用HPLC检测(R)-3-(氨甲酰甲基)-5-甲基己酸手性纯度。结果如图2所示,产物(R)-3-(氨甲酰甲基)-5-甲基己酸晶体中,R-构型的ee值为99.5%以上。说明固定化二氢嘧啶氨基水解酶SEQ ID NO:1或SEQ ID NO:3的立体选择性很高,其可以催化制备高手性纯度的普瑞巴林中间体(R)-3-(氨甲酰甲基)-5-甲基己酸。
固定化二氢嘧啶氨基水解酶SEQ ID NO:3的催化反应结果相似,也得到了高光学纯度的(R)-3-(氨甲酰甲基)-5-甲基己酸。
以上实验表明,二氢嘧啶氨基水解酶SEQ ID NO:1和SEQ ID NO:3能够催化3-异丁基戊二酰亚胺进行开环反应,得到(R)-3-(氨甲酰甲基)-5-甲基己酸,且都具有高度的立体选择性,可用于制备高手性纯度的普瑞巴林中间体。
序列表
<110> 杭州酶因生物技术有限公司
<120> 一种二氢嘧啶氨基水解酶及其应用
<130> SHPI2110434
<160> 4
<170> SIPOSequenceListing 1.0
<210> 1
<211> 467
<212> PRT
<213> Pseudomonas fluorescens
<400> 1
Met Ser Ser Arg Thr Val Ile Arg Gly Gly Leu Val Ile Thr Ala Ser
1 5 10 15
Asp Glu Ile His Ala Asp Val Leu Ile Glu Asp Gly Arg Val Ala Ala
20 25 30
Leu Ala Ala Thr Gly Thr Pro Ala Ala Glu Ala Phe Thr Ala Glu Asn
35 40 45
Val Ile Asp Ala Ser Gly Lys Tyr Val Ile Pro Gly Gly Val Asp Gly
50 55 60
His Thr His Met Glu Met Pro Phe Gly Gly Thr Tyr Ala Ala Asp Thr
65 70 75 80
Phe Glu Thr Gly Thr Arg Ala Ala Ala Trp Gly Gly Thr Thr Thr Ile
85 90 95
Val Asp Phe Ala Ile Gln Ser Val Gly His Ser Leu Arg Glu Gly Leu
100 105 110
Asp Ala Trp His Ala Lys Ala Glu Gly Asn Cys Ala Ile Asp Tyr Gly
115 120 125
Phe His Met Ile Val Ser Asp Val Asn Gln Glu Thr Leu Lys Glu Met
130 135 140
Asp Leu Leu Val Glu Glu Gly Val Thr Ser Phe Lys Gln Phe Met Ala
145 150 155 160
Tyr Pro Gly Val Phe Tyr Ser Asp Asp Gly Gln Ile Leu Arg Ala Met
165 170 175
Gln Arg Ala Ala Glu Asn Gly Gly Leu Ile Met Met His Ala Glu Asn
180 185 190
Gly Ile Ala Ile Asp Val Leu Val Glu Gln Ala Leu Ala Arg Gly Glu
195 200 205
Thr Asp Pro Arg Phe His Gly Glu Val Arg Lys Ala Leu Leu Glu Ala
210 215 220
Glu Ala Thr His Arg Ala Ile Arg Leu Ala Gln Val Ala Gly Ala Pro
225 230 235 240
Leu Tyr Val Val His Val Ser Ala Thr Glu Ala Val Ala Glu Leu Thr
245 250 255
Arg Ala Arg Asp Glu Gly Leu Pro Val Phe Gly Glu Thr Cys Pro Gln
260 265 270
Tyr Leu Phe Leu Ser Thr Asp Asn Leu Ala Glu Pro Asp Phe Glu Gly
275 280 285
Ala Lys Tyr Val Cys Ser Thr Pro Leu Arg Pro Lys Glu His Gln Ala
290 295 300
Ala Leu Trp Arg Gly Leu Arg Thr Asn Asp Leu Gln Val Val Ser Thr
305 310 315 320
Asp His Cys Pro Phe Cys Phe Ser Gly Gln Lys Glu Leu Gly Arg Gly
325 330 335
Asp Phe Ser Arg Ile Pro Asn Gly Met Pro Gly Val Glu Asn Arg Met
340 345 350
Asp Leu Leu His Gln Ala Val Val Glu Gly His Ile Gly Arg Arg Arg
355 360 365
Trp Ile Glu Ile Ala Cys Ala Thr Pro Ala Arg Met Phe Gly Leu Tyr
370 375 380
Pro Lys Lys Gly Thr Ile Ala Pro Gly Ala Asp Ala Asp Ile Val Val
385 390 395 400
Tyr Asp Pro His Ala Glu Gln Val Ile Ser Ala Glu Thr His His Met
405 410 415
Asn Val Asp Tyr Ser Ala Tyr Glu Gly Arg Arg Ile Thr Gly Arg Val
420 425 430
Glu Thr Val Leu Ser Arg Gly Glu Pro Val Val Thr Glu Arg Glu Tyr
435 440 445
Thr Gly Arg Lys Gly His Gly Ala Tyr Thr Pro Arg Ala Thr Cys Gln
450 455 460
Tyr Leu Thr
465
<210> 2
<211> 1404
<212> DNA
<213> 人工序列()
<400> 2
atgagcagcc gtaccgttat tcgtggtggt ctggttatta ccgcaagtga tgaaattcat 60
gccgatgtgc tgattgaaga tggtcgtgtt gcagcactgg cagcaaccgg tacaccggca 120
gcagaagcat ttaccgcaga aaatgttatt gatgccagcg gcaaatatgt tattccaggt 180
ggtgttgatg gtcataccca catggaaatg ccgtttggtg gcacctatgc agcagatacc 240
tttgaaaccg gtacgcgtgc agcagcatgg ggtggcacca ccaccattgt tgattttgca 300
attcagagcg ttggtcatag cctgcgtgaa ggtctggatg catggcatgc aaaagccgaa 360
ggtaattgtg caattgatta tggctttcac atgattgtga gcgacgttaa tcaagaaacc 420
ctgaaagaaa tggatctgct ggttgaagaa ggtgtgacca gctttaaaca gtttatggca 480
tatccgggtg tgttctatag tgatgatggt cagattctgc gtgcaatgca gcgtgcagcc 540
gaaaatggtg gcctgattat gatgcatgcg gaaaatggta ttgccattga tgttctggtt 600
gaacaggcac tggcacgtgg tgaaaccgat ccgcgttttc atggtgaagt tcgtaaagca 660
ctgctggaag ccgaagcaac ccatcgtgca attcgtctgg cacaggttgc gggtgcaccg 720
ctgtatgttg ttcatgttag cgcaaccgaa gcagttgcag aactgacccg tgcacgtgat 780
gaaggcctgc cggtttttgg cgaaacctgt ccgcagtacc tgtttctgag caccgataat 840
ctggccgaac cggattttga aggtgcaaaa tatgtttgta gcacaccgct gcgtccgaaa 900
gaacatcagg cagcactgtg gcgtggtctg cgtaccaatg atctgcaggt tgttagcacc 960
gatcattgtc cgttttgttt tagcggtcag aaagaattag gtcgcggtga ttttagccgt 1020
attccgaatg gtatgcctgg tgttgaaaat cgtatggacc tgctgcatca ggccgttgtg 1080
gaaggtcata ttggtcgtcg tcgttggatt gaaattgcat gtgcaacacc ggcacgtatg 1140
tttggtctgt atccgaaaaa aggcaccatt gcaccgggtg cagatgcaga tattgttgtt 1200
tatgatccgc atgccgaaca ggttattagc gcagaaaccc atcacatgaa tgttgattat 1260
agcgcctatg aaggtcgtcg tattaccggt cgtgtggaaa ccgttctgag ccgtggtgaa 1320
ccggttgtta ccgaacgtga atataccggt cgcaaaggtc atggtgcata tacaccgcgt 1380
gcaacctgtc agtatctgac ctaa 1404
<210> 3
<211> 479
<212> PRT
<213> Pseudomonas fluorescens
<400> 3
Met Ser Leu Leu Ile Arg Gly Ala Thr Ile Val Thr His Asp Glu Ser
1 5 10 15
Tyr Arg Ala Asp Val Tyr Cys Ala Asp Gly Val Ile Lys Ala Ile Gly
20 25 30
Glu Asn Leu Asp Ile Pro Ala Gly Ala Glu Val Leu Asp Gly Ser Gly
35 40 45
Gln Tyr Leu Met Pro Gly Gly Ile Asp Pro His Thr His Met Gln Leu
50 55 60
Pro Phe Met Gly Thr Val Ala Ser Glu Asp Phe Tyr Ser Gly Thr Ala
65 70 75 80
Ala Gly Leu Ala Gly Gly Thr Thr Ser Ile Ile Asp Phe Val Ile Pro
85 90 95
Asn Pro Gln Gln Ser Leu Leu Glu Ala Phe His Gln Trp Arg Gly Trp
100 105 110
Ala Glu Lys Ser Ala Ser Asp Tyr Gly Phe His Val Ala Ile Thr Trp
115 120 125
Trp Ser Glu Gln Val Arg Glu Glu Met Ala Glu Leu Val Ser His His
130 135 140
Gly Ile Asn Ser Phe Lys His Phe Met Ala Tyr Lys Asn Ala Ile Met
145 150 155 160
Ala Ala Asp Asp Thr Leu Val Ala Ser Phe Glu Arg Cys Leu Glu Leu
165 170 175
Gly Ala Val Pro Thr Val His Ala Glu Asn Gly Glu Leu Val Tyr His
180 185 190
Leu Gln Arg Lys Leu Met Ala Gln Gly Ile Thr Gly Pro Glu Ala His
195 200 205
Pro Leu Ser Arg Pro Ser Gln Val Glu Gly Glu Ala Ala Ser Arg Ala
210 215 220
Ile Arg Ile Ala Glu Thr Ile Gly Thr Pro Leu Tyr Leu Val His Val
225 230 235 240
Ser Thr Lys Glu Ala Leu Asp Glu Ile Thr Tyr Ala Arg Ser Lys Gly
245 250 255
Gln Pro Val Tyr Gly Glu Val Leu Ala Gly His Leu Leu Leu Asp Asp
260 265 270
Ser Val Tyr Gln His Pro Asp Trp Gln Thr Ala Ala Gly Tyr Val Met
275 280 285
Ser Pro Pro Phe Arg Pro Arg Gly His Gln Asp Ala Leu Trp His Gly
290 295 300
Leu Gln Ser Gly Asn Leu His Thr Thr Ala Thr Asp His Cys Cys Phe
305 310 315 320
Cys Ala Glu Gln Lys Ala Ala Gly Arg Asp Asp Phe Ser Lys Ile Pro
325 330 335
Asn Gly Thr Ala Gly Ile Glu Asp Arg Met Ala Val Leu Trp Asp Glu
340 345 350
Gly Val Asn Ser Gly Arg Leu Ser Met Gln Asp Phe Val Ala Leu Thr
355 360 365
Ser Thr Asn Thr Ala Lys Ile Phe Asn Leu Tyr Pro Arg Lys Gly Ala
370 375 380
Ile Arg Val Gly Ala Asp Ala Asp Leu Val Leu Trp Asp Pro Gln Gly
385 390 395 400
Thr Arg Thr Ile Ser Ala Lys Thr His His Gln Gln Val Asp Phe Asn
405 410 415
Ile Phe Glu Gly Lys Thr Val Thr Gly Val Pro Ser His Thr Val Ser
420 425 430
Gln Gly Arg Val Val Trp Ala Asp Gly Asp Leu Arg Ala Glu Arg Gly
435 440 445
Ala Gly Arg Tyr Ile Glu Arg Pro Ala Tyr Pro Ala Val Phe Asp Leu
450 455 460
Leu Ser Lys Arg Ala Glu Gln His Lys Pro Thr Ala Val Lys Arg
465 470 475
<210> 4
<211> 1440
<212> DNA
<213> 人工序列()
<400> 4
atgagcctgc tgattcgtgg tgcaaccatt gttacccatg atgaaagcta tcgtgccgat 60
gtttattgtg cagatggtgt gattaaagcc attggcgaaa atctggatat tcctgccggt 120
gccgaagttc tggatggtag cggtcagtat ctgatgcctg gtggtattga tccgcataca 180
cacatgcagc tgccgtttat gggcaccgtt gcaagcgaag atttctatag cggcaccgca 240
gcaggtctgg caggcggtac aaccagcatt attgattttg ttattccgaa tccgcagcaa 300
agcctgctgg aagcatttca tcagtggcgt ggttgggcag aaaaaagcgc aagcgattat 360
ggttttcatg ttgcaattac ctggtggtca gaacaggttc gtgaagaaat ggcagaactg 420
gttagccatc atggcattaa cagctttaaa cacttcatgg cctataaaaa cgcaatcatg 480
gcagcagatg ataccctggt tgccagcttt gaacgttgtc tggaactggg tgcagttccg 540
accgttcatg cagaaaatgg tgaactggtt tatcatttac agcgtaaact gatggcacag 600
ggtattaccg gtccggaagc acatccgctg agccgtccga gccaggttga aggtgaagca 660
gcaagccgtg caattcgtat tgcagaaacc attggtacac cgctgtatct ggttcatgtt 720
agcaccaaag aagcactgga cgaaatcacc tatgcacgta gcaaaggtca gccggtttat 780
ggtgaagtgc tggcaggtca tttactgctg gatgatagcg tttatcagca tccggattgg 840
cagaccgcag ccggttatgt tatgagccct ccgtttcgtc cgcgtggtca tcaggatgca 900
ctgtggcatg gtctgcagag cggtaatctg cataccaccg caaccgatca ttgttgtttt 960
tgtgccgaac agaaagcagc cggtcgtgat gattttagca aaattccgaa tggtacagcc 1020
ggtattgaag atcgtatggc agttctgtgg gatgaaggtg ttaatagcgg tcgtctgagc 1080
atgcaggatt ttgttgcact gaccagcacc aataccgcca aaatctttaa tctgtatccg 1140
cgtaaaggtg ccattcgtgt tggtgcagat gcagatctgg tgctgtggga cccgcagggc 1200
acccgtacca ttagcgcaaa aacccatcat cagcaggttg attttaacat ctttgaaggt 1260
aaaaccgtta ccggtgttcc gagccatacc gttagccagg gtcgtgttgt ttgggcagat 1320
ggggatctgc gtgcagaacg tggtgcaggt cgttatattg aacgtccggc atatccggca 1380
gtttttgatc tgctgagcaa acgtgcggaa cagcataaac cgaccgcagt taaacgttag 1440
Claims (10)
1.一种制备(R)-3-(氨甲酰甲基)-5-甲基己酸的方法,其特征在于,包括如下步骤:
以3-异丁基戊二酰亚胺为底物,使用氨基酸序列为SEQ ID NO:1或者SEQ ID NO:3的二氢嘧啶氨基水解酶催化开环反应,得到ee值为99.5%以上的(R)-3-(氨甲酰甲基)-5-甲基己酸。
2.如权利要求1所述的方法,其特征在于,所述二氢嘧啶氨基水解酶呈酶形式或者呈其表达微生物菌体形式。
3.如权利要求2所述的方法,其特征在于,所述酶为固定化酶。
4.如权利要求3所述的方法,其特征在于,所述固定化酶是采用环氧型离子交换树脂LX-1000EP进行固定化的二氢嘧啶氨基水解酶。
5.如权利要求2所述的方法,其特征在于,所述微生物选自枯草芽孢杆菌、毕赤酵母、酿酒酵母、大肠杆菌。
6.如权利要求5所述的方法,其特征在于,所述微生物是大肠杆菌BL21(DE3)。
7.如权利要求6所述的方法,其特征在于,所述二氢嘧啶氨基水解酶SEQ ID NO:1的编码基因的核苷酸序列为SEQ ID NO:2;或者二氢嘧啶氨基水解酶SEQ ID NO:3的编码基因的核苷酸序列为SEQ ID NO:4。
8.如权利要求1所述的方法,其特征在于,在反应过程中,控制反应体系pH7.0-9.0。
9.如权利要求1所述的方法,其特征在于,反应温度为25-50℃。
10.如权利要求4所述的方法,其特征在于,反应温度为40℃。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111224455.8A CN113755539B (zh) | 2021-10-19 | 2021-10-19 | 一种二氢嘧啶氨基水解酶及其应用 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111224455.8A CN113755539B (zh) | 2021-10-19 | 2021-10-19 | 一种二氢嘧啶氨基水解酶及其应用 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113755539A CN113755539A (zh) | 2021-12-07 |
CN113755539B true CN113755539B (zh) | 2024-05-28 |
Family
ID=78784188
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202111224455.8A Active CN113755539B (zh) | 2021-10-19 | 2021-10-19 | 一种二氢嘧啶氨基水解酶及其应用 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113755539B (zh) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109554358A (zh) * | 2018-11-21 | 2019-04-02 | 安徽瑞达健康产业有限公司 | 多肽、dna分子、重组载体、转化体及其应用 |
CN111944856A (zh) * | 2020-08-10 | 2020-11-17 | 宁波酶赛生物工程有限公司 | 一种普瑞巴林中间体的合成方法 |
CN113502305A (zh) * | 2021-07-16 | 2021-10-15 | 台州学院 | 一种利用重组酰亚胺酶合成(r)-异丁基戊二酸单酰胺的方法 |
WO2023088077A1 (en) * | 2021-11-21 | 2023-05-25 | Enzymaster (Ningbo) Bio-Engineering Co., Ltd. | Biocatalysts and methods for the synthesis of pregabalin intermediates |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
AU2010321223A1 (en) * | 2009-11-18 | 2012-08-02 | Basf Plant Science Company Gmbh | Process for the production of fine chemicals |
-
2021
- 2021-10-19 CN CN202111224455.8A patent/CN113755539B/zh active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109554358A (zh) * | 2018-11-21 | 2019-04-02 | 安徽瑞达健康产业有限公司 | 多肽、dna分子、重组载体、转化体及其应用 |
CN111944856A (zh) * | 2020-08-10 | 2020-11-17 | 宁波酶赛生物工程有限公司 | 一种普瑞巴林中间体的合成方法 |
CN113502305A (zh) * | 2021-07-16 | 2021-10-15 | 台州学院 | 一种利用重组酰亚胺酶合成(r)-异丁基戊二酸单酰胺的方法 |
WO2023088077A1 (en) * | 2021-11-21 | 2023-05-25 | Enzymaster (Ningbo) Bio-Engineering Co., Ltd. | Biocatalysts and methods for the synthesis of pregabalin intermediates |
Non-Patent Citations (5)
Title |
---|
"Q3KAM5_PSEPF, AC:Q3KAM5";Bentley S.D.等;《uniprot》;第1-2页 * |
Bentley S.D.等."HYDA_STRCO,AC: O69809".《uniprot》.2020,第1-2页. * |
Bentley S.D.等."Q3KAM5_PSEPF, AC:Q3KAM5".《uniprot》.2020,第1-2页. * |
Imidase catalyzing desymmetric imide hydrolysis forming optically active 3-substituted glutaric acid monoamides for the synthesis of gamma-aminobutyric acid (GABA) analogs;Masutoshi Nojiri等;《Appl Microbiol Biotechnol》;第99卷;第9961-9969页 * |
生物催化技术制备手性酸医药化学品的研究进展;江千姿等;《合成化学》;第31卷(第6期);第476-496页 * |
Also Published As
Publication number | Publication date |
---|---|
CN113755539A (zh) | 2021-12-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112359036B (zh) | 一种催化活力和反应专一性提高的腈水解酶突变体及应用 | |
CN113462666B (zh) | 羰基还原酶突变体及其应用 | |
CN113355299B (zh) | 酮酸还原酶、基因、工程菌及在合成手性芳香2-羟酸中的应用 | |
CN117431228A (zh) | 一种高立体选择性转氨酶突变体、编码基因及其应用 | |
CN112852789B (zh) | 腈水解酶突变体及其应用 | |
CN111454918B (zh) | 一种烯醇还原酶突变体及其在制备(r)-香茅醛中的应用 | |
CN110358751B (zh) | 一种重组脂肪酶突变体、编码基因、重组工程菌及应用 | |
CN112175839A (zh) | 一种d-泛解酸内酯水解酶及其产生菌、基因工程菌及它们的应用 | |
CN109652470B (zh) | 一种脂肪酶在拆分(r,s)-扁桃酸甲酯中的应用 | |
CN113755539B (zh) | 一种二氢嘧啶氨基水解酶及其应用 | |
CN113322291A (zh) | 一种手性氨基醇类化合物的合成方法 | |
CN112143725B (zh) | 一种重组酯酶、编码基因、工程菌及在拆分甲霜灵中的应用 | |
CN116064447A (zh) | 转氨酶及其突变体在手性胺合成中的应用 | |
CN109943618B (zh) | 一种重组脂肪酶在拆分(R,S)-α-乙基-2-氧-1-吡咯烷乙酸甲酯中的应用 | |
CN110438194B (zh) | 一种脂肪酶在制备d-托品酸甲酯中的应用 | |
CN114908129A (zh) | 用于制备(r)-4-氯-3-羟基丁酸乙酯的脱氢酶 | |
CN110358804B (zh) | R-3-氨基正丁醇的酶法生产工艺 | |
CN114774491B (zh) | 制备(2s,3r)-2-(邻苯二甲酰亚胺基甲基)-3-羟基丁酸酯的方法 | |
CN113667704B (zh) | 一种两步酶法制备(r)-n-(2,6-二甲基苯基)氨基丙酸甲酯的方法 | |
CN115029329B (zh) | 一种羰基还原酶突变体及其在制备r-扁桃酸中的应用 | |
CN110272879B (zh) | 醛酮还原酶BcAKR及其突变体和应用 | |
CN112280756B (zh) | 异亮氨酸羟化酶突变体及在(2s,3r,4s)-4-羟基异亮氨酸合成中的应用 | |
CN111057736B (zh) | 一种脂肪酶在拆分boc-dl-脯氨酸甲酯中的应用 | |
CN110257350B (zh) | 醛酮还原酶BmAKR1及其突变体和应用 | |
CN109897836A (zh) | 一种来源于米曲霉的单胺氧化酶用于手性胺中间体的制备 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |