CN114717170A - 异源合成黄酮类化合物的宿主细胞及其应用 - Google Patents
异源合成黄酮类化合物的宿主细胞及其应用 Download PDFInfo
- Publication number
- CN114717170A CN114717170A CN202110009696.4A CN202110009696A CN114717170A CN 114717170 A CN114717170 A CN 114717170A CN 202110009696 A CN202110009696 A CN 202110009696A CN 114717170 A CN114717170 A CN 114717170A
- Authority
- CN
- China
- Prior art keywords
- leu
- ala
- val
- gly
- ser
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 230000002194 synthesizing effect Effects 0.000 title claims abstract description 27
- -1 flavonoid compound Chemical class 0.000 title claims description 37
- 229930003935 flavonoid Natural products 0.000 title description 11
- 235000017173 flavonoids Nutrition 0.000 title description 11
- FXNFHKRTJBSTCS-UHFFFAOYSA-N Baicalein Natural products C=1C(=O)C=2C(O)=C(O)C(O)=CC=2OC=1C1=CC=CC=C1 FXNFHKRTJBSTCS-UHFFFAOYSA-N 0.000 claims abstract description 78
- UDFLTIRFTXWNJO-UHFFFAOYSA-N baicalein Chemical compound O1C2=CC(=O)C(O)=C(O)C2=C(O)C=C1C1=CC=CC=C1 UDFLTIRFTXWNJO-UHFFFAOYSA-N 0.000 claims abstract description 78
- 229940015301 baicalein Drugs 0.000 claims abstract description 78
- JVXZRQGOGOXCEC-UHFFFAOYSA-N scutellarein Chemical compound C1=CC(O)=CC=C1C1=CC(=O)C2=C(O)C(O)=C(O)C=C2O1 JVXZRQGOGOXCEC-UHFFFAOYSA-N 0.000 claims abstract description 62
- RTIXKCRFFJGDFG-UHFFFAOYSA-N chrysin Chemical class C=1C(O)=CC(O)=C(C(C=2)=O)C=1OC=2C1=CC=CC=C1 RTIXKCRFFJGDFG-UHFFFAOYSA-N 0.000 claims abstract description 50
- 102000004190 Enzymes Human genes 0.000 claims abstract description 32
- 108090000790 Enzymes Proteins 0.000 claims abstract description 32
- 230000015572 biosynthetic process Effects 0.000 claims abstract description 26
- 238000003786 synthesis reaction Methods 0.000 claims abstract description 24
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 claims abstract description 21
- 239000008103 glucose Substances 0.000 claims abstract description 21
- 108090000623 proteins and genes Proteins 0.000 claims description 128
- 108700023158 Phenylalanine ammonia-lyases Proteins 0.000 claims description 72
- 108010016192 4-coumarate-CoA ligase Proteins 0.000 claims description 52
- 239000003446 ligand Substances 0.000 claims description 43
- NGSWKAQJJWESNS-UHFFFAOYSA-N 4-coumaric acid Chemical compound OC(=O)C=CC1=CC=C(O)C=C1 NGSWKAQJJWESNS-UHFFFAOYSA-N 0.000 claims description 30
- 210000001236 prokaryotic cell Anatomy 0.000 claims description 29
- 238000004519 manufacturing process Methods 0.000 claims description 28
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 claims description 26
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 claims description 26
- 238000000034 method Methods 0.000 claims description 23
- 101150023849 pheA gene Proteins 0.000 claims description 23
- NYCXYKOXLNBYID-UHFFFAOYSA-N 5,7-Dihydroxychromone Natural products O1C=CC(=O)C=2C1=CC(O)=CC=2O NYCXYKOXLNBYID-UHFFFAOYSA-N 0.000 claims description 21
- 210000004027 cell Anatomy 0.000 claims description 21
- 235000015838 chrysin Nutrition 0.000 claims description 21
- 229940043370 chrysin Drugs 0.000 claims description 21
- 108010004539 Chalcone isomerase Proteins 0.000 claims description 20
- 101150105575 ecpB gene Proteins 0.000 claims description 20
- 239000000758 substrate Substances 0.000 claims description 20
- 101150076125 aroG gene Proteins 0.000 claims description 19
- 230000004927 fusion Effects 0.000 claims description 18
- 101001112118 Homo sapiens NADPH-cytochrome P450 reductase Proteins 0.000 claims description 17
- 102100023897 NADPH-cytochrome P450 reductase Human genes 0.000 claims description 17
- 101150046913 ecpA gene Proteins 0.000 claims description 17
- 239000005516 coenzyme A Substances 0.000 claims description 16
- 229940093530 coenzyme a Drugs 0.000 claims description 16
- 108010060641 flavanone synthetase Proteins 0.000 claims description 16
- 229940093681 4-coumaric acid Drugs 0.000 claims description 15
- RGJOEKWQDUBAIZ-IBOSZNHHSA-N CoASH Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCS)O[C@H]1N1C2=NC=NC(N)=C2N=C1 RGJOEKWQDUBAIZ-IBOSZNHHSA-N 0.000 claims description 14
- RGJOEKWQDUBAIZ-UHFFFAOYSA-N coenzime A Natural products OC1C(OP(O)(O)=O)C(COP(O)(=O)OP(O)(=O)OCC(C)(C)C(O)C(=O)NCCC(=O)NCCS)OC1N1C2=NC=NC(N)=C2N=C1 RGJOEKWQDUBAIZ-UHFFFAOYSA-N 0.000 claims description 14
- KDTSHFARGAKYJN-UHFFFAOYSA-N dephosphocoenzyme A Natural products OC1C(O)C(COP(O)(=O)OP(O)(=O)OCC(C)(C)C(O)C(=O)NCCC(=O)NCCS)OC1N1C2=NC=NC(N)=C2N=C1 KDTSHFARGAKYJN-UHFFFAOYSA-N 0.000 claims description 14
- 229930003944 flavone Natural products 0.000 claims description 14
- 235000011949 flavones Nutrition 0.000 claims description 14
- 108090000364 Ligases Proteins 0.000 claims description 12
- 102000003960 Ligases Human genes 0.000 claims description 12
- 101100002724 Thermus thermophilus aroH gene Proteins 0.000 claims description 12
- GAMYVSCDDLXAQW-AOIWZFSPSA-N Thermopsosid Natural products O(C)c1c(O)ccc(C=2Oc3c(c(O)cc(O[C@H]4[C@H](O)[C@@H](O)[C@H](O)[C@H](CO)O4)c3)C(=O)C=2)c1 GAMYVSCDDLXAQW-AOIWZFSPSA-N 0.000 claims description 11
- 230000004850 protein–protein interaction Effects 0.000 claims description 11
- VHBFFQKBGNRLFZ-UHFFFAOYSA-N vitamin p Natural products O1C2=CC=CC=C2C(=O)C=C1C1=CC=CC=C1 VHBFFQKBGNRLFZ-UHFFFAOYSA-N 0.000 claims description 11
- 108090001036 flavone synthase I Proteins 0.000 claims description 9
- STBCOMYLYYPNLI-HZJYAPBZSA-N OC(=O)\C=C\C1=CC=C(O)C=C1.O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCS)O[C@H]1N1C2=NC=NC(N)=C2N=C1 Chemical compound OC(=O)\C=C\C1=CC=C(O)C=C1.O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCS)O[C@H]1N1C2=NC=NC(N)=C2N=C1 STBCOMYLYYPNLI-HZJYAPBZSA-N 0.000 claims description 8
- 102000000470 PDZ domains Human genes 0.000 claims description 8
- 108050008994 PDZ domains Proteins 0.000 claims description 8
- 102000000395 SH3 domains Human genes 0.000 claims description 8
- 108050008861 SH3 domains Proteins 0.000 claims description 8
- 108020001507 fusion proteins Proteins 0.000 claims description 7
- 102000037865 fusion proteins Human genes 0.000 claims description 7
- DQFBYFPFKXHELB-UHFFFAOYSA-N Chalcone Natural products C=1C=CC=CC=1C(=O)C=CC1=CC=CC=C1 DQFBYFPFKXHELB-UHFFFAOYSA-N 0.000 claims description 6
- 235000005513 chalcones Nutrition 0.000 claims description 6
- 150000002212 flavone derivatives Chemical class 0.000 claims description 6
- LTYOQGRJFJAKNA-KKIMTKSISA-N Malonyl CoA Natural products S(C(=O)CC(=O)O)CCNC(=O)CCNC(=O)[C@@H](O)C(CO[P@](=O)(O[P@](=O)(OC[C@H]1[C@@H](OP(=O)(O)O)[C@@H](O)[C@@H](n2c3ncnc(N)c3nc2)O1)O)O)(C)C LTYOQGRJFJAKNA-KKIMTKSISA-N 0.000 claims description 3
- LTYOQGRJFJAKNA-DVVLENMVSA-N malonyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)CC(O)=O)O[C@H]1N1C2=NC=NC(N)=C2N=C1 LTYOQGRJFJAKNA-DVVLENMVSA-N 0.000 claims description 3
- 230000035772 mutation Effects 0.000 claims description 2
- 102000005870 Coenzyme A Ligases Human genes 0.000 claims 2
- 108010011449 Long-chain-fatty-acid-CoA ligase Proteins 0.000 claims 2
- NGSWKAQJJWESNS-ZZXKWVIFSA-N trans-4-coumaric acid Chemical compound OC(=O)\C=C\C1=CC=C(O)C=C1 NGSWKAQJJWESNS-ZZXKWVIFSA-N 0.000 claims 1
- 238000012986 modification Methods 0.000 abstract description 25
- 230000004048 modification Effects 0.000 abstract description 25
- 241000894006 Bacteria Species 0.000 abstract description 17
- 238000001338 self-assembly Methods 0.000 abstract description 11
- 238000005457 optimization Methods 0.000 abstract description 4
- 238000005516 engineering process Methods 0.000 abstract description 3
- 239000013612 plasmid Substances 0.000 description 48
- 102000004169 proteins and genes Human genes 0.000 description 37
- 229960005190 phenylalanine Drugs 0.000 description 25
- 108090000765 processed proteins & peptides Proteins 0.000 description 23
- 238000000855 fermentation Methods 0.000 description 21
- 230000004151 fermentation Effects 0.000 description 21
- 150000001875 compounds Chemical class 0.000 description 20
- 108010050848 glycylleucine Proteins 0.000 description 20
- 241000588724 Escherichia coli Species 0.000 description 17
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 17
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 17
- 238000003752 polymerase chain reaction Methods 0.000 description 17
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 17
- 239000002243 precursor Substances 0.000 description 15
- WUFHZIRMAZZWRS-OSUNSFLBSA-N Val-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C(C)C)N WUFHZIRMAZZWRS-OSUNSFLBSA-N 0.000 description 14
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 14
- 108010038633 aspartylglutamate Proteins 0.000 description 14
- 150000001413 amino acids Chemical group 0.000 description 13
- 230000014509 gene expression Effects 0.000 description 13
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 13
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 13
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 12
- 108010005233 alanylglutamic acid Proteins 0.000 description 12
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 11
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 11
- 108010062796 arginyllysine Proteins 0.000 description 11
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 10
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 10
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 10
- 108010093581 aspartyl-proline Proteins 0.000 description 10
- 238000012258 culturing Methods 0.000 description 10
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 10
- 108010064235 lysylglycine Proteins 0.000 description 10
- 239000000047 product Substances 0.000 description 10
- 241000196324 Embryophyta Species 0.000 description 9
- AKOYRLRUFBZOSP-BJDJZHNGSA-N Ile-Lys-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N AKOYRLRUFBZOSP-BJDJZHNGSA-N 0.000 description 9
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 9
- WINFHLHJTRGLCV-BZSNNMDCSA-N Lys-Tyr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=C(O)C=C1 WINFHLHJTRGLCV-BZSNNMDCSA-N 0.000 description 9
- 238000010276 construction Methods 0.000 description 9
- 238000001514 detection method Methods 0.000 description 9
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 9
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 9
- 108010057821 leucylproline Proteins 0.000 description 9
- 239000007788 liquid Substances 0.000 description 9
- 108010017391 lysylvaline Proteins 0.000 description 9
- 229920001184 polypeptide Polymers 0.000 description 9
- 102000004196 processed proteins & peptides Human genes 0.000 description 9
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 9
- HKZAAJSTFUZYTO-LURJTMIESA-N (2s)-2-[[2-[[2-[[2-[(2-aminoacetyl)amino]acetyl]amino]acetyl]amino]acetyl]amino]-3-hydroxypropanoic acid Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O HKZAAJSTFUZYTO-LURJTMIESA-N 0.000 description 8
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 8
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 8
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 8
- HUWSBFYAGXCXKC-CIUDSAMLSA-N Glu-Ala-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O HUWSBFYAGXCXKC-CIUDSAMLSA-N 0.000 description 8
- WTOAPTKSZJJWKK-HTFCKZLJSA-N Ile-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N WTOAPTKSZJJWKK-HTFCKZLJSA-N 0.000 description 8
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 8
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 8
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 8
- 241000589194 Rhizobium leguminosarum Species 0.000 description 8
- DKDHTRVDOUZZTP-IFFSRLJSSA-N Thr-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DKDHTRVDOUZZTP-IFFSRLJSSA-N 0.000 description 8
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 8
- 108010092854 aspartyllysine Proteins 0.000 description 8
- 239000001963 growth medium Substances 0.000 description 8
- 108010040030 histidinoalanine Proteins 0.000 description 8
- 108010034529 leucyl-lysine Proteins 0.000 description 8
- 239000012071 phase Substances 0.000 description 8
- DJSISFGPUUYILV-ZFORQUDYSA-N scutellarin Chemical class O1[C@H](C(O)=O)[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1OC(C(=C1O)O)=CC2=C1C(=O)C=C(C=1C=CC(O)=CC=1)O2 DJSISFGPUUYILV-ZFORQUDYSA-N 0.000 description 8
- 229960000268 spectinomycin Drugs 0.000 description 8
- UNFWWIHTNXNPBV-WXKVUWSESA-N spectinomycin Chemical compound O([C@@H]1[C@@H](NC)[C@@H](O)[C@H]([C@@H]([C@H]1O1)O)NC)[C@]2(O)[C@H]1O[C@H](C)CC2=O UNFWWIHTNXNPBV-WXKVUWSESA-N 0.000 description 8
- 108010061238 threonyl-glycine Proteins 0.000 description 8
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 7
- WCFCYFDBMNFSPA-ACZMJKKPSA-N Asp-Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O WCFCYFDBMNFSPA-ACZMJKKPSA-N 0.000 description 7
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 7
- OHZIZVWQXJPBJS-IXOXFDKPSA-N Leu-His-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OHZIZVWQXJPBJS-IXOXFDKPSA-N 0.000 description 7
- DJSISFGPUUYILV-UHFFFAOYSA-N UNPD161792 Natural products O1C(C(O)=O)C(O)C(O)C(O)C1OC(C(=C1O)O)=CC2=C1C(=O)C=C(C=1C=CC(O)=CC=1)O2 DJSISFGPUUYILV-UHFFFAOYSA-N 0.000 description 7
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 description 7
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 7
- NPLTVGMLNDMOQE-UHFFFAOYSA-N carthamidin Natural products C1=CC(O)=CC=C1C1OC2=CC(O)=C(O)C(O)=C2C(=O)C1 NPLTVGMLNDMOQE-UHFFFAOYSA-N 0.000 description 7
- 238000005520 cutting process Methods 0.000 description 7
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 7
- 230000037361 pathway Effects 0.000 description 7
- 229930190376 scutellarin Natural products 0.000 description 7
- 108010000998 wheylin-2 peptide Proteins 0.000 description 7
- LBOLGUYQEPZSKM-YUMQZZPRSA-N Cys-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N LBOLGUYQEPZSKM-YUMQZZPRSA-N 0.000 description 6
- 108020004414 DNA Proteins 0.000 description 6
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 6
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 6
- XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 6
- OWYIDJCNRWRSJY-QTKMDUPCSA-N His-Pro-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O OWYIDJCNRWRSJY-QTKMDUPCSA-N 0.000 description 6
- PNTWNAXGBOZMBO-MNXVOIDGSA-N Ile-Lys-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PNTWNAXGBOZMBO-MNXVOIDGSA-N 0.000 description 6
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 6
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 6
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 6
- WZVSHTFTCYOFPL-GARJFASQSA-N Lys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N)C(=O)O WZVSHTFTCYOFPL-GARJFASQSA-N 0.000 description 6
- JHDNAOVJJQSMMM-GMOBBJLQSA-N Met-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCSC)N JHDNAOVJJQSMMM-GMOBBJLQSA-N 0.000 description 6
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 6
- 241000208317 Petroselinum Species 0.000 description 6
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 6
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 6
- BMOFUVHDBROBSE-DCAQKATOSA-N Val-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N BMOFUVHDBROBSE-DCAQKATOSA-N 0.000 description 6
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 6
- 229940024606 amino acid Drugs 0.000 description 6
- 229960000723 ampicillin Drugs 0.000 description 6
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 6
- 108010077245 asparaginyl-proline Proteins 0.000 description 6
- 229960005091 chloramphenicol Drugs 0.000 description 6
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 6
- 238000000605 extraction Methods 0.000 description 6
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 6
- 108010081551 glycylphenylalanine Proteins 0.000 description 6
- 238000004128 high performance liquid chromatography Methods 0.000 description 6
- 230000001965 increasing effect Effects 0.000 description 6
- 239000002609 medium Substances 0.000 description 6
- 150000007523 nucleic acids Chemical group 0.000 description 6
- 235000011197 perejil Nutrition 0.000 description 6
- 230000009465 prokaryotic expression Effects 0.000 description 6
- 108010073969 valyllysine Proteins 0.000 description 6
- 239000013598 vector Substances 0.000 description 6
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 5
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 5
- RXTBLQVXNIECFP-FXQIFTODSA-N Ala-Gln-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RXTBLQVXNIECFP-FXQIFTODSA-N 0.000 description 5
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 5
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 5
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 5
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 5
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 5
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 5
- JOTRDIXZHNQYGP-DCAQKATOSA-N Arg-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JOTRDIXZHNQYGP-DCAQKATOSA-N 0.000 description 5
- SCQIQCWLOMOEFP-DCAQKATOSA-N Asp-Leu-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SCQIQCWLOMOEFP-DCAQKATOSA-N 0.000 description 5
- GXIUDSXIUSTSLO-QXEWZRGKSA-N Asp-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N GXIUDSXIUSTSLO-QXEWZRGKSA-N 0.000 description 5
- DTCCMDYODDPHBG-ACZMJKKPSA-N Gln-Ala-Cys Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O DTCCMDYODDPHBG-ACZMJKKPSA-N 0.000 description 5
- PGPJSRSLQNXBDT-YUMQZZPRSA-N Gln-Arg-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O PGPJSRSLQNXBDT-YUMQZZPRSA-N 0.000 description 5
- ILKYYKRAULNYMS-JYJNAYRXSA-N Gln-Lys-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ILKYYKRAULNYMS-JYJNAYRXSA-N 0.000 description 5
- YVYVMJNUENBOOL-KBIXCLLPSA-N Glu-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N YVYVMJNUENBOOL-KBIXCLLPSA-N 0.000 description 5
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 5
- SWRVAQHFBRZVNX-GUBZILKMSA-N Glu-Lys-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SWRVAQHFBRZVNX-GUBZILKMSA-N 0.000 description 5
- SOEPMWQCTJITPZ-SRVKXCTJSA-N Glu-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N SOEPMWQCTJITPZ-SRVKXCTJSA-N 0.000 description 5
- CQGBSALYGOXQPE-HTUGSXCWSA-N Glu-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O CQGBSALYGOXQPE-HTUGSXCWSA-N 0.000 description 5
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 5
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 5
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 5
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 5
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 5
- JMSONHOUHFDOJH-GUBZILKMSA-N His-Ser-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 JMSONHOUHFDOJH-GUBZILKMSA-N 0.000 description 5
- DMZOUKXXHJQPTL-GRLWGSQLSA-N Ile-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N DMZOUKXXHJQPTL-GRLWGSQLSA-N 0.000 description 5
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 description 5
- WSSGUVAKYCQSCT-XUXIUFHCSA-N Ile-Met-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)O)N WSSGUVAKYCQSCT-XUXIUFHCSA-N 0.000 description 5
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 5
- REXAUQBGSGDEJY-IGISWZIWSA-N Ile-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N REXAUQBGSGDEJY-IGISWZIWSA-N 0.000 description 5
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 description 5
- XBCWOTOCBXXJDG-BZSNNMDCSA-N Leu-His-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 XBCWOTOCBXXJDG-BZSNNMDCSA-N 0.000 description 5
- ZALAVHVPPOHAOL-XUXIUFHCSA-N Leu-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(C)C)N ZALAVHVPPOHAOL-XUXIUFHCSA-N 0.000 description 5
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 5
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 description 5
- LMKSBGIUPVRHEH-FXQIFTODSA-N Met-Ala-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(N)=O LMKSBGIUPVRHEH-FXQIFTODSA-N 0.000 description 5
- FMMIYCMOVGXZIP-AVGNSLFASA-N Phe-Glu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O FMMIYCMOVGXZIP-AVGNSLFASA-N 0.000 description 5
- RVEVENLSADZUMS-IHRRRGAJSA-N Phe-Pro-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RVEVENLSADZUMS-IHRRRGAJSA-N 0.000 description 5
- JSGWNFKWZNPDAV-YDHLFZDLSA-N Phe-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JSGWNFKWZNPDAV-YDHLFZDLSA-N 0.000 description 5
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 5
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 5
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 5
- FZXSYIPVAFVYBH-KKUMJFAQSA-N Pro-Tyr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O FZXSYIPVAFVYBH-KKUMJFAQSA-N 0.000 description 5
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 5
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 5
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 5
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 5
- LRWBCWGEUCKDTN-BJDJZHNGSA-N Ser-Lys-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LRWBCWGEUCKDTN-BJDJZHNGSA-N 0.000 description 5
- FWTFAZKJORVTIR-VZFHVOOUSA-N Thr-Ser-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O FWTFAZKJORVTIR-VZFHVOOUSA-N 0.000 description 5
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 5
- AKXBNSZMYAOGLS-STQMWFEESA-N Tyr-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AKXBNSZMYAOGLS-STQMWFEESA-N 0.000 description 5
- OLWFDNLLBWQWCP-STQMWFEESA-N Tyr-Gly-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O OLWFDNLLBWQWCP-STQMWFEESA-N 0.000 description 5
- JXGUUJMPCRXMSO-HJOGWXRNSA-N Tyr-Phe-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 JXGUUJMPCRXMSO-HJOGWXRNSA-N 0.000 description 5
- MQGGXGKQSVEQHR-KKUMJFAQSA-N Tyr-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 MQGGXGKQSVEQHR-KKUMJFAQSA-N 0.000 description 5
- JFAWZADYPRMRCO-UBHSHLNASA-N Val-Ala-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JFAWZADYPRMRCO-UBHSHLNASA-N 0.000 description 5
- KTEZUXISLQTDDQ-NHCYSSNCSA-N Val-Lys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KTEZUXISLQTDDQ-NHCYSSNCSA-N 0.000 description 5
- 108010047495 alanylglycine Proteins 0.000 description 5
- 108010087924 alanylproline Proteins 0.000 description 5
- 239000003153 chemical reaction reagent Substances 0.000 description 5
- 108010004073 cysteinylcysteine Proteins 0.000 description 5
- 108010069495 cysteinyltyrosine Proteins 0.000 description 5
- 108010049041 glutamylalanine Proteins 0.000 description 5
- 108010079547 glutamylmethionine Proteins 0.000 description 5
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 5
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 5
- 108010038320 lysylphenylalanine Proteins 0.000 description 5
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 5
- 108010070643 prolylglutamic acid Proteins 0.000 description 5
- 108010053725 prolylvaline Proteins 0.000 description 5
- 108010026333 seryl-proline Proteins 0.000 description 5
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 5
- OZRFYUJEXYKQDV-UHFFFAOYSA-N 2-[[2-[[2-[(2-amino-3-carboxypropanoyl)amino]-3-carboxypropanoyl]amino]-3-carboxypropanoyl]amino]butanedioic acid Chemical compound OC(=O)CC(N)C(=O)NC(CC(O)=O)C(=O)NC(CC(O)=O)C(=O)NC(CC(O)=O)C(O)=O OZRFYUJEXYKQDV-UHFFFAOYSA-N 0.000 description 4
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 4
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 4
- FVSOUJZKYWEFOB-KBIXCLLPSA-N Ala-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)N FVSOUJZKYWEFOB-KBIXCLLPSA-N 0.000 description 4
- CWEAKSWWKHGTRJ-BQBZGAKWSA-N Ala-Gly-Met Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O CWEAKSWWKHGTRJ-BQBZGAKWSA-N 0.000 description 4
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 4
- JDIQCVUDDFENPU-ZKWXMUAHSA-N Ala-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CNC=N1 JDIQCVUDDFENPU-ZKWXMUAHSA-N 0.000 description 4
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 4
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 4
- GFEDXKNBZMPEDM-KZVJFYERSA-N Ala-Met-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFEDXKNBZMPEDM-KZVJFYERSA-N 0.000 description 4
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 4
- OMCKWYSDUQBYCN-FXQIFTODSA-N Ala-Ser-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O OMCKWYSDUQBYCN-FXQIFTODSA-N 0.000 description 4
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 4
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 4
- 241000219195 Arabidopsis thaliana Species 0.000 description 4
- LMPKCSXZJSXBBL-NHCYSSNCSA-N Arg-Gln-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O LMPKCSXZJSXBBL-NHCYSSNCSA-N 0.000 description 4
- NXDXECQFKHXHAM-HJGDQZAQSA-N Arg-Glu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NXDXECQFKHXHAM-HJGDQZAQSA-N 0.000 description 4
- KRQSPVKUISQQFS-FJXKBIBVSA-N Arg-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N KRQSPVKUISQQFS-FJXKBIBVSA-N 0.000 description 4
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 4
- AWMAZIIEFPFHCP-RCWTZXSCSA-N Arg-Pro-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWMAZIIEFPFHCP-RCWTZXSCSA-N 0.000 description 4
- AUZAXCPWMDBWEE-HJGDQZAQSA-N Arg-Thr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O AUZAXCPWMDBWEE-HJGDQZAQSA-N 0.000 description 4
- INOIAEUXVVNJKA-XGEHTFHBSA-N Arg-Thr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O INOIAEUXVVNJKA-XGEHTFHBSA-N 0.000 description 4
- QNYWYYNQSXANBL-WDSOQIARSA-N Arg-Trp-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N QNYWYYNQSXANBL-WDSOQIARSA-N 0.000 description 4
- FXGMURPOWCKNAZ-JYJNAYRXSA-N Arg-Val-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FXGMURPOWCKNAZ-JYJNAYRXSA-N 0.000 description 4
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 4
- GXMSVVBIAMWMKO-BQBZGAKWSA-N Asn-Arg-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N GXMSVVBIAMWMKO-BQBZGAKWSA-N 0.000 description 4
- QHBMKQWOIYJYMI-BYULHYEWSA-N Asn-Asn-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QHBMKQWOIYJYMI-BYULHYEWSA-N 0.000 description 4
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 4
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 4
- WIDVAWAQBRAKTI-YUMQZZPRSA-N Asn-Leu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O WIDVAWAQBRAKTI-YUMQZZPRSA-N 0.000 description 4
- TZFQICWZWFNIKU-KKUMJFAQSA-N Asn-Leu-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 TZFQICWZWFNIKU-KKUMJFAQSA-N 0.000 description 4
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 4
- AYOAHKWVQLNPDM-HJGDQZAQSA-N Asn-Lys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AYOAHKWVQLNPDM-HJGDQZAQSA-N 0.000 description 4
- BKFXFUPYETWGGA-XVSYOHENSA-N Asn-Phe-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BKFXFUPYETWGGA-XVSYOHENSA-N 0.000 description 4
- SZNGQSBRHFMZLT-IHRRRGAJSA-N Asn-Pro-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SZNGQSBRHFMZLT-IHRRRGAJSA-N 0.000 description 4
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 4
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 4
- UGIBTKGQVWFTGX-BIIVOSGPSA-N Asp-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O UGIBTKGQVWFTGX-BIIVOSGPSA-N 0.000 description 4
- NAPNAGZWHQHZLG-ZLUOBGJFSA-N Asp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N NAPNAGZWHQHZLG-ZLUOBGJFSA-N 0.000 description 4
- PMEHKVHZQKJACS-PEFMBERDSA-N Asp-Gln-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PMEHKVHZQKJACS-PEFMBERDSA-N 0.000 description 4
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 4
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 4
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 4
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 4
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 4
- ZQFRDAZBTSFGGW-SRVKXCTJSA-N Asp-Ser-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZQFRDAZBTSFGGW-SRVKXCTJSA-N 0.000 description 4
- PLNJUJGNLDSFOP-UWJYBYFXSA-N Asp-Tyr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PLNJUJGNLDSFOP-UWJYBYFXSA-N 0.000 description 4
- XMKXONRMGJXCJV-LAEOZQHASA-N Asp-Val-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XMKXONRMGJXCJV-LAEOZQHASA-N 0.000 description 4
- 101150023103 CHI gene Proteins 0.000 description 4
- 101150058917 CHS gene Proteins 0.000 description 4
- 108091026890 Coding region Proteins 0.000 description 4
- FCXJJTRGVAZDER-FXQIFTODSA-N Cys-Val-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O FCXJJTRGVAZDER-FXQIFTODSA-N 0.000 description 4
- 101150044894 ER gene Proteins 0.000 description 4
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 4
- INKFLNZBTSNFON-CIUDSAMLSA-N Gln-Ala-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O INKFLNZBTSNFON-CIUDSAMLSA-N 0.000 description 4
- MLZRSFQRBDNJON-GUBZILKMSA-N Gln-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MLZRSFQRBDNJON-GUBZILKMSA-N 0.000 description 4
- JSYULGSPLTZDHM-NRPADANISA-N Gln-Ala-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O JSYULGSPLTZDHM-NRPADANISA-N 0.000 description 4
- BTSPOOHJBYJRKO-CIUDSAMLSA-N Gln-Asp-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BTSPOOHJBYJRKO-CIUDSAMLSA-N 0.000 description 4
- NSNUZSPSADIMJQ-WDSKDSINSA-N Gln-Gly-Asp Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NSNUZSPSADIMJQ-WDSKDSINSA-N 0.000 description 4
- LKVCNGLNTAPMSZ-JYJNAYRXSA-N Gln-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)N)N LKVCNGLNTAPMSZ-JYJNAYRXSA-N 0.000 description 4
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 4
- BYKZWDGMJLNFJY-XKBZYTNZSA-N Gln-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)O BYKZWDGMJLNFJY-XKBZYTNZSA-N 0.000 description 4
- UXXIVIQGOODKQC-NUMRIWBASA-N Gln-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UXXIVIQGOODKQC-NUMRIWBASA-N 0.000 description 4
- BBFCMGBMYIAGRS-AUTRQRHGSA-N Gln-Val-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BBFCMGBMYIAGRS-AUTRQRHGSA-N 0.000 description 4
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 4
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 4
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 4
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 4
- XOIATPHFYVWFEU-DCAQKATOSA-N Glu-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XOIATPHFYVWFEU-DCAQKATOSA-N 0.000 description 4
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 4
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 4
- WNRZUESNGGDCJX-JYJNAYRXSA-N Glu-Leu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WNRZUESNGGDCJX-JYJNAYRXSA-N 0.000 description 4
- YKBUCXNNBYZYAY-MNXVOIDGSA-N Glu-Lys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YKBUCXNNBYZYAY-MNXVOIDGSA-N 0.000 description 4
- ZGEJRLJEAMPEDV-SRVKXCTJSA-N Glu-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N ZGEJRLJEAMPEDV-SRVKXCTJSA-N 0.000 description 4
- UMHRCVCZUPBBQW-GARJFASQSA-N Glu-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UMHRCVCZUPBBQW-GARJFASQSA-N 0.000 description 4
- GQGAFTPXAPKSCF-WHFBIAKZSA-N Gly-Ala-Cys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O GQGAFTPXAPKSCF-WHFBIAKZSA-N 0.000 description 4
- OCQUNKSFDYDXBG-QXEWZRGKSA-N Gly-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OCQUNKSFDYDXBG-QXEWZRGKSA-N 0.000 description 4
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 4
- JVACNFOPSUPDTK-QWRGUYRKSA-N Gly-Asn-Phe Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JVACNFOPSUPDTK-QWRGUYRKSA-N 0.000 description 4
- TZOVVRJYUDETQG-RCOVLWMOSA-N Gly-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN TZOVVRJYUDETQG-RCOVLWMOSA-N 0.000 description 4
- YFGONBOFGGWKKY-VHSXEESVSA-N Gly-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)CN)C(=O)O YFGONBOFGGWKKY-VHSXEESVSA-N 0.000 description 4
- VIIBEIQMLJEUJG-LAEOZQHASA-N Gly-Ile-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O VIIBEIQMLJEUJG-LAEOZQHASA-N 0.000 description 4
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 description 4
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 4
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 4
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 4
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 4
- YYXJFBMCOUSYSF-RYUDHWBXSA-N Gly-Phe-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYXJFBMCOUSYSF-RYUDHWBXSA-N 0.000 description 4
- IBYOLNARKHMLBG-WHOFXGATSA-N Gly-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IBYOLNARKHMLBG-WHOFXGATSA-N 0.000 description 4
- IXHQLZIWBCQBLQ-STQMWFEESA-N Gly-Pro-Phe Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IXHQLZIWBCQBLQ-STQMWFEESA-N 0.000 description 4
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 4
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 4
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 4
- MUGLKCQHTUFLGF-WPRPVWTQSA-N Gly-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)CN MUGLKCQHTUFLGF-WPRPVWTQSA-N 0.000 description 4
- QSLKWWDKIXMWJV-SRVKXCTJSA-N His-Cys-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N QSLKWWDKIXMWJV-SRVKXCTJSA-N 0.000 description 4
- CTGZVVQVIBSOBB-AVGNSLFASA-N His-His-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O CTGZVVQVIBSOBB-AVGNSLFASA-N 0.000 description 4
- JIUYRPFQJJRSJB-QWRGUYRKSA-N His-His-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)NCC(O)=O)C1=CN=CN1 JIUYRPFQJJRSJB-QWRGUYRKSA-N 0.000 description 4
- PBVQWNDMFFCPIZ-ULQDDVLXSA-N His-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 PBVQWNDMFFCPIZ-ULQDDVLXSA-N 0.000 description 4
- KAXZXLSXFWSNNZ-XVYDVKMFSA-N His-Ser-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KAXZXLSXFWSNNZ-XVYDVKMFSA-N 0.000 description 4
- DEMIXZCKUXVEBO-BWAGICSOSA-N His-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O DEMIXZCKUXVEBO-BWAGICSOSA-N 0.000 description 4
- WSXNWASHQNSMRX-GVXVVHGQSA-N His-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N WSXNWASHQNSMRX-GVXVVHGQSA-N 0.000 description 4
- QLRMMMQNCWBNPQ-QXEWZRGKSA-N Ile-Arg-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N QLRMMMQNCWBNPQ-QXEWZRGKSA-N 0.000 description 4
- AZEYWPUCOYXFOE-CYDGBPFRSA-N Ile-Arg-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N AZEYWPUCOYXFOE-CYDGBPFRSA-N 0.000 description 4
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 4
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 4
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 4
- UYODHPPSCXBNCS-XUXIUFHCSA-N Ile-Val-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C UYODHPPSCXBNCS-XUXIUFHCSA-N 0.000 description 4
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 description 4
- 101150042441 K gene Proteins 0.000 description 4
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 4
- 239000012880 LB liquid culture medium Substances 0.000 description 4
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 4
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 4
- IASQBRJGRVXNJI-YUMQZZPRSA-N Leu-Cys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)NCC(O)=O IASQBRJGRVXNJI-YUMQZZPRSA-N 0.000 description 4
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 4
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 4
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 4
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 4
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 4
- YWYQSLOTVIRCFE-SRVKXCTJSA-N Leu-His-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O YWYQSLOTVIRCFE-SRVKXCTJSA-N 0.000 description 4
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 4
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 4
- JFSGIJSCJFQGSZ-MXAVVETBSA-N Leu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N JFSGIJSCJFQGSZ-MXAVVETBSA-N 0.000 description 4
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 4
- VVQJGYPTIYOFBR-IHRRRGAJSA-N Leu-Lys-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N VVQJGYPTIYOFBR-IHRRRGAJSA-N 0.000 description 4
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 4
- QMKFDEUJGYNFMC-AVGNSLFASA-N Leu-Pro-Arg Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QMKFDEUJGYNFMC-AVGNSLFASA-N 0.000 description 4
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 4
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 4
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 4
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 description 4
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 4
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 4
- FGZVGOAAROXFAB-IXOXFDKPSA-N Leu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N)O FGZVGOAAROXFAB-IXOXFDKPSA-N 0.000 description 4
- RIHIGSWBLHSGLV-CQDKDKBSSA-N Leu-Tyr-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O RIHIGSWBLHSGLV-CQDKDKBSSA-N 0.000 description 4
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 4
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 4
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 4
- VHNOAIFVYUQOOY-XUXIUFHCSA-N Lys-Arg-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VHNOAIFVYUQOOY-XUXIUFHCSA-N 0.000 description 4
- HWMZUBUEOYAQSC-DCAQKATOSA-N Lys-Gln-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O HWMZUBUEOYAQSC-DCAQKATOSA-N 0.000 description 4
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 4
- NDORZBUHCOJQDO-GVXVVHGQSA-N Lys-Gln-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O NDORZBUHCOJQDO-GVXVVHGQSA-N 0.000 description 4
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 4
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 4
- PYFNONMJYNJENN-AVGNSLFASA-N Lys-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PYFNONMJYNJENN-AVGNSLFASA-N 0.000 description 4
- CTJUSALVKAWFFU-CIUDSAMLSA-N Lys-Ser-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N CTJUSALVKAWFFU-CIUDSAMLSA-N 0.000 description 4
- TVHCDSBMFQYPNA-RHYQMDGZSA-N Lys-Thr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TVHCDSBMFQYPNA-RHYQMDGZSA-N 0.000 description 4
- BWECSLVQIWEMSC-IHRRRGAJSA-N Lys-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N BWECSLVQIWEMSC-IHRRRGAJSA-N 0.000 description 4
- ULNXMMYXQKGNPG-LPEHRKFASA-N Met-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N ULNXMMYXQKGNPG-LPEHRKFASA-N 0.000 description 4
- UYAKZHGIPRCGPF-CIUDSAMLSA-N Met-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N UYAKZHGIPRCGPF-CIUDSAMLSA-N 0.000 description 4
- DGNZGCQSVGGYJS-BQBZGAKWSA-N Met-Gly-Asp Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O DGNZGCQSVGGYJS-BQBZGAKWSA-N 0.000 description 4
- QZPXMHVKPHJNTR-DCAQKATOSA-N Met-Leu-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O QZPXMHVKPHJNTR-DCAQKATOSA-N 0.000 description 4
- AXHNAGAYRGCDLG-UWVGGRQHSA-N Met-Lys-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AXHNAGAYRGCDLG-UWVGGRQHSA-N 0.000 description 4
- GGXZOTSDJJTDGB-GUBZILKMSA-N Met-Ser-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O GGXZOTSDJJTDGB-GUBZILKMSA-N 0.000 description 4
- CIIJWIAORKTXAH-FJXKBIBVSA-N Met-Thr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O CIIJWIAORKTXAH-FJXKBIBVSA-N 0.000 description 4
- ZBLSZPYQQRIHQU-RCWTZXSCSA-N Met-Thr-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ZBLSZPYQQRIHQU-RCWTZXSCSA-N 0.000 description 4
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 4
- 108010079364 N-glycylalanine Proteins 0.000 description 4
- 108010065395 Neuropep-1 Proteins 0.000 description 4
- 101100391071 Petroselinum crispum FNSI gene Proteins 0.000 description 4
- 240000007377 Petunia x hybrida Species 0.000 description 4
- JEGFCFLCRSJCMA-IHRRRGAJSA-N Phe-Arg-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N JEGFCFLCRSJCMA-IHRRRGAJSA-N 0.000 description 4
- OYQBFWWQSVIHBN-FHWLQOOXSA-N Phe-Glu-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O OYQBFWWQSVIHBN-FHWLQOOXSA-N 0.000 description 4
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 4
- NPLGQVKZFGJWAI-QWHCGFSZSA-N Phe-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O NPLGQVKZFGJWAI-QWHCGFSZSA-N 0.000 description 4
- ZKSLXIGKRJMALF-MGHWNKPDSA-N Phe-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N ZKSLXIGKRJMALF-MGHWNKPDSA-N 0.000 description 4
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 4
- YFXXRYFWJFQAFW-JHYOHUSXSA-N Phe-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YFXXRYFWJFQAFW-JHYOHUSXSA-N 0.000 description 4
- WDOCBGZHAQQIBL-IHPCNDPISA-N Phe-Trp-Ser Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CO)C(O)=O)C1=CC=CC=C1 WDOCBGZHAQQIBL-IHPCNDPISA-N 0.000 description 4
- ZSKJPKFTPQCPIH-RCWTZXSCSA-N Pro-Arg-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSKJPKFTPQCPIH-RCWTZXSCSA-N 0.000 description 4
- PZSCUPVOJGKHEP-CIUDSAMLSA-N Pro-Gln-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PZSCUPVOJGKHEP-CIUDSAMLSA-N 0.000 description 4
- YSUZKYSRAFNLRB-ULQDDVLXSA-N Pro-Gln-Trp Chemical compound N([C@@H](CCC(=O)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C(=O)[C@@H]1CCCN1 YSUZKYSRAFNLRB-ULQDDVLXSA-N 0.000 description 4
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 4
- YTWNSIDWAFSEEI-RWMBFGLXSA-N Pro-His-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)N3CCC[C@@H]3C(=O)O YTWNSIDWAFSEEI-RWMBFGLXSA-N 0.000 description 4
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 4
- BRJGUPWVFXKBQI-XUXIUFHCSA-N Pro-Leu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRJGUPWVFXKBQI-XUXIUFHCSA-N 0.000 description 4
- XQPHBAKJJJZOBX-SRVKXCTJSA-N Pro-Lys-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O XQPHBAKJJJZOBX-SRVKXCTJSA-N 0.000 description 4
- CDGABSWLRMECHC-IHRRRGAJSA-N Pro-Lys-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O CDGABSWLRMECHC-IHRRRGAJSA-N 0.000 description 4
- WHNJMTHJGCEKGA-ULQDDVLXSA-N Pro-Phe-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WHNJMTHJGCEKGA-ULQDDVLXSA-N 0.000 description 4
- WWXNZNWZNZPDIF-SRVKXCTJSA-N Pro-Val-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 WWXNZNWZNZPDIF-SRVKXCTJSA-N 0.000 description 4
- 108010025216 RVF peptide Proteins 0.000 description 4
- 244000042430 Rhodiola rosea Species 0.000 description 4
- 235000003713 Rhodiola rosea Nutrition 0.000 description 4
- 240000004534 Scutellaria baicalensis Species 0.000 description 4
- 235000017089 Scutellaria baicalensis Nutrition 0.000 description 4
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 4
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 4
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 4
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 4
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 4
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 4
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 4
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 4
- GSCVDSBEYVGMJQ-SRVKXCTJSA-N Ser-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)O GSCVDSBEYVGMJQ-SRVKXCTJSA-N 0.000 description 4
- UBTNVMGPMYDYIU-HJPIBITLSA-N Ser-Tyr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UBTNVMGPMYDYIU-HJPIBITLSA-N 0.000 description 4
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 4
- TWLMXDWFVNEFFK-FJXKBIBVSA-N Thr-Arg-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O TWLMXDWFVNEFFK-FJXKBIBVSA-N 0.000 description 4
- WFUAUEQXPVNAEF-ZJDVBMNYSA-N Thr-Arg-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CCCN=C(N)N WFUAUEQXPVNAEF-ZJDVBMNYSA-N 0.000 description 4
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 description 4
- MFEBUIFJVPNZLO-OLHMAJIHSA-N Thr-Asp-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MFEBUIFJVPNZLO-OLHMAJIHSA-N 0.000 description 4
- OHAJHDJOCKKJLV-LKXGYXEUSA-N Thr-Asp-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OHAJHDJOCKKJLV-LKXGYXEUSA-N 0.000 description 4
- KWQBJOUOSNJDRR-XAVMHZPKSA-N Thr-Cys-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N)O KWQBJOUOSNJDRR-XAVMHZPKSA-N 0.000 description 4
- GARULAKWZGFIKC-RWRJDSDZSA-N Thr-Gln-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GARULAKWZGFIKC-RWRJDSDZSA-N 0.000 description 4
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 4
- CRZNCABIJLRFKZ-IUKAMOBKSA-N Thr-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N CRZNCABIJLRFKZ-IUKAMOBKSA-N 0.000 description 4
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 4
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 4
- UJQVSMNQMQHVRY-KZVJFYERSA-N Thr-Met-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O UJQVSMNQMQHVRY-KZVJFYERSA-N 0.000 description 4
- MCDVZTRGHNXTGK-HJGDQZAQSA-N Thr-Met-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O MCDVZTRGHNXTGK-HJGDQZAQSA-N 0.000 description 4
- VTMGKRABARCZAX-OSUNSFLBSA-N Thr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O VTMGKRABARCZAX-OSUNSFLBSA-N 0.000 description 4
- CYCGARJWIQWPQM-YJRXYDGGSA-N Thr-Tyr-Ser Chemical compound C[C@@H](O)[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CO)C([O-])=O)CC1=CC=C(O)C=C1 CYCGARJWIQWPQM-YJRXYDGGSA-N 0.000 description 4
- SBYQHZCMVSPQCS-RCWTZXSCSA-N Thr-Val-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O SBYQHZCMVSPQCS-RCWTZXSCSA-N 0.000 description 4
- NQJDICVXXIMMMB-XDTLVQLUSA-N Tyr-Glu-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O NQJDICVXXIMMMB-XDTLVQLUSA-N 0.000 description 4
- CTDPLKMBVALCGN-JSGCOSHPSA-N Tyr-Gly-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O CTDPLKMBVALCGN-JSGCOSHPSA-N 0.000 description 4
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 4
- BIWVVOHTKDLRMP-ULQDDVLXSA-N Tyr-Pro-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BIWVVOHTKDLRMP-ULQDDVLXSA-N 0.000 description 4
- SQUMHUZLJDUROQ-YDHLFZDLSA-N Tyr-Val-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O SQUMHUZLJDUROQ-YDHLFZDLSA-N 0.000 description 4
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 4
- UUYCNAXCCDNULB-QXEWZRGKSA-N Val-Arg-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O UUYCNAXCCDNULB-QXEWZRGKSA-N 0.000 description 4
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 4
- OGNMURQZFMHFFD-NHCYSSNCSA-N Val-Asn-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N OGNMURQZFMHFFD-NHCYSSNCSA-N 0.000 description 4
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 4
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 4
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 description 4
- GMOLURHJBLOBFW-ONGXEEELSA-N Val-Gly-His Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N GMOLURHJBLOBFW-ONGXEEELSA-N 0.000 description 4
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 4
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 4
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 4
- AIWLHFZYOUUJGB-UFYCRDLUSA-N Val-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 AIWLHFZYOUUJGB-UFYCRDLUSA-N 0.000 description 4
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 4
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 4
- LTTQCQRTSHJPPL-ZKWXMUAHSA-N Val-Ser-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LTTQCQRTSHJPPL-ZKWXMUAHSA-N 0.000 description 4
- MNSSBIHFEUUXNW-RCWTZXSCSA-N Val-Thr-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N MNSSBIHFEUUXNW-RCWTZXSCSA-N 0.000 description 4
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 4
- SSKKGOWRPNIVDW-AVGNSLFASA-N Val-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SSKKGOWRPNIVDW-AVGNSLFASA-N 0.000 description 4
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 4
- 108010066829 alanyl-glutamyl-aspartylprolyine Proteins 0.000 description 4
- KZNIFHPLKGYRTM-UHFFFAOYSA-N apigenin Chemical compound C1=CC(O)=CC=C1C1=CC(=O)C2=C(O)C=C(O)C=C2O1 KZNIFHPLKGYRTM-UHFFFAOYSA-N 0.000 description 4
- XADJWCRESPGUTB-UHFFFAOYSA-N apigenin Natural products C1=CC(O)=CC=C1C1=CC(=O)C2=CC(O)=C(O)C=C2O1 XADJWCRESPGUTB-UHFFFAOYSA-N 0.000 description 4
- 235000008714 apigenin Nutrition 0.000 description 4
- 229940117893 apigenin Drugs 0.000 description 4
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 4
- 108010060035 arginylproline Proteins 0.000 description 4
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 4
- 210000004899 c-terminal region Anatomy 0.000 description 4
- 238000003776 cleavage reaction Methods 0.000 description 4
- 238000001816 cooling Methods 0.000 description 4
- 108010016616 cysteinylglycine Proteins 0.000 description 4
- 108010054813 diprotin B Proteins 0.000 description 4
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 4
- 108010028188 glycyl-histidyl-serine Proteins 0.000 description 4
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 4
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 4
- 108010084389 glycyltryptophan Proteins 0.000 description 4
- 108010025306 histidylleucine Proteins 0.000 description 4
- 239000000411 inducer Substances 0.000 description 4
- 230000006698 induction Effects 0.000 description 4
- 108010027338 isoleucylcysteine Proteins 0.000 description 4
- 244000005700 microbiome Species 0.000 description 4
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 4
- 108091033319 polynucleotide Proteins 0.000 description 4
- 102000040430 polynucleotide Human genes 0.000 description 4
- 239000002157 polynucleotide Substances 0.000 description 4
- 108010004914 prolylarginine Proteins 0.000 description 4
- 238000005070 sampling Methods 0.000 description 4
- 230000007017 scission Effects 0.000 description 4
- 108010071207 serylmethionine Proteins 0.000 description 4
- 239000007787 solid Substances 0.000 description 4
- 108010080629 tryptophan-leucine Proteins 0.000 description 4
- 108010051110 tyrosyl-lysine Proteins 0.000 description 4
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 4
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 3
- 101150029435 4CL gene Proteins 0.000 description 3
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 3
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 3
- LKDHUGLXOHYINY-XUXIUFHCSA-N Arg-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LKDHUGLXOHYINY-XUXIUFHCSA-N 0.000 description 3
- CVXXSWQORBZAAA-SRVKXCTJSA-N Arg-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N CVXXSWQORBZAAA-SRVKXCTJSA-N 0.000 description 3
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 3
- XEKOWRVHYACXOJ-UHFFFAOYSA-N Ethyl acetate Chemical compound CCOC(C)=O XEKOWRVHYACXOJ-UHFFFAOYSA-N 0.000 description 3
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 3
- LZRMPXRYLLTAJX-GUBZILKMSA-N Gln-Arg-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZRMPXRYLLTAJX-GUBZILKMSA-N 0.000 description 3
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 3
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 3
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- UYNXBNHVWFNVIN-HJWJTTGWSA-N Ile-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=CC=C1 UYNXBNHVWFNVIN-HJWJTTGWSA-N 0.000 description 3
- NLZVTPYXYXMCIP-XUXIUFHCSA-N Ile-Pro-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O NLZVTPYXYXMCIP-XUXIUFHCSA-N 0.000 description 3
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 3
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 3
- CANPXOLVTMKURR-WEDXCCLWSA-N Lys-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN CANPXOLVTMKURR-WEDXCCLWSA-N 0.000 description 3
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 description 3
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 3
- 108091028043 Nucleic acid sequence Proteins 0.000 description 3
- 241001165494 Rhodiola Species 0.000 description 3
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 3
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 3
- 238000010367 cloning Methods 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 239000003814 drug Substances 0.000 description 3
- 239000013604 expression vector Substances 0.000 description 3
- 150000002213 flavones Chemical class 0.000 description 3
- 150000002215 flavonoids Chemical class 0.000 description 3
- 108010001064 glycyl-glycyl-glycyl-glycine Proteins 0.000 description 3
- 108010078274 isoleucylvaline Proteins 0.000 description 3
- 108010009298 lysylglutamic acid Proteins 0.000 description 3
- 241000894007 species Species 0.000 description 3
- 108010005652 splenotritin Proteins 0.000 description 3
- 230000009466 transformation Effects 0.000 description 3
- 238000011144 upstream manufacturing Methods 0.000 description 3
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 3
- BYXHQQCXAJARLQ-ZLUOBGJFSA-N Ala-Ala-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O BYXHQQCXAJARLQ-ZLUOBGJFSA-N 0.000 description 2
- XCVRVWZTXPCYJT-BIIVOSGPSA-N Ala-Asn-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N XCVRVWZTXPCYJT-BIIVOSGPSA-N 0.000 description 2
- KRHRBKYBJXMYBB-WHFBIAKZSA-N Ala-Cys-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O KRHRBKYBJXMYBB-WHFBIAKZSA-N 0.000 description 2
- ROLXPVQSRCPVGK-XDTLVQLUSA-N Ala-Glu-Tyr Chemical compound N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O ROLXPVQSRCPVGK-XDTLVQLUSA-N 0.000 description 2
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 2
- OARAZORWIMYUPO-FXQIFTODSA-N Ala-Met-Cys Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CS)C(O)=O OARAZORWIMYUPO-FXQIFTODSA-N 0.000 description 2
- FEGOCLZUJUFCHP-CIUDSAMLSA-N Ala-Pro-Gln Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FEGOCLZUJUFCHP-CIUDSAMLSA-N 0.000 description 2
- BTRULDJUUVGRNE-DCAQKATOSA-N Ala-Pro-Lys Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O BTRULDJUUVGRNE-DCAQKATOSA-N 0.000 description 2
- PAYRUJLWNCNPSJ-UHFFFAOYSA-N Aniline Chemical compound NC1=CC=CC=C1 PAYRUJLWNCNPSJ-UHFFFAOYSA-N 0.000 description 2
- SYAUZLVLXCDRSH-IUCAKERBSA-N Arg-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N SYAUZLVLXCDRSH-IUCAKERBSA-N 0.000 description 2
- NVUIWHJLPSZZQC-CYDGBPFRSA-N Arg-Ile-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NVUIWHJLPSZZQC-CYDGBPFRSA-N 0.000 description 2
- ZPWMEWYQBWSGAO-ZJDVBMNYSA-N Arg-Thr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZPWMEWYQBWSGAO-ZJDVBMNYSA-N 0.000 description 2
- JQSWHKKUZMTOIH-QWRGUYRKSA-N Asn-Gly-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N JQSWHKKUZMTOIH-QWRGUYRKSA-N 0.000 description 2
- PJERDVUTUDZPGX-ZKWXMUAHSA-N Asp-Cys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC(O)=O PJERDVUTUDZPGX-ZKWXMUAHSA-N 0.000 description 2
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 2
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 2
- NZWDWXSWUQCNMG-GARJFASQSA-N Asp-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)C(=O)O NZWDWXSWUQCNMG-GARJFASQSA-N 0.000 description 2
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 2
- 108091033380 Coding strand Proteins 0.000 description 2
- UVZFZTWNHOQWNK-NAKRPEOUSA-N Cys-Ile-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UVZFZTWNHOQWNK-NAKRPEOUSA-N 0.000 description 2
- DVIHGGUODLILFN-GHCJXIJMSA-N Cys-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N DVIHGGUODLILFN-GHCJXIJMSA-N 0.000 description 2
- OXFOKRAFNYSREH-BJDJZHNGSA-N Cys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CS)N OXFOKRAFNYSREH-BJDJZHNGSA-N 0.000 description 2
- VPQZSNQICFCCSO-BJDJZHNGSA-N Cys-Leu-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VPQZSNQICFCCSO-BJDJZHNGSA-N 0.000 description 2
- 101150068667 FNSI gene Proteins 0.000 description 2
- NVEASDQHBRZPSU-BQBZGAKWSA-N Gln-Gln-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O NVEASDQHBRZPSU-BQBZGAKWSA-N 0.000 description 2
- YXQCLIVLWCKCRS-RYUDHWBXSA-N Gln-Gly-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N)O YXQCLIVLWCKCRS-RYUDHWBXSA-N 0.000 description 2
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 2
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 2
- JRCUFCXYZLPSDZ-ACZMJKKPSA-N Glu-Asp-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O JRCUFCXYZLPSDZ-ACZMJKKPSA-N 0.000 description 2
- IFZWDJWERARYFC-WNHJNPCNSA-N Glu-Glu-Gln-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCC(O)=O)N)C(O)=O)=CNC2=C1 IFZWDJWERARYFC-WNHJNPCNSA-N 0.000 description 2
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 2
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 2
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 2
- JVYNYWXHZWVJEF-NUMRIWBASA-N Glu-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O JVYNYWXHZWVJEF-NUMRIWBASA-N 0.000 description 2
- XEJTYSCIXKYSHR-WDSKDSINSA-N Gly-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN XEJTYSCIXKYSHR-WDSKDSINSA-N 0.000 description 2
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 2
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 2
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 2
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 2
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 2
- XKIYNCLILDLGRS-QWRGUYRKSA-N His-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 XKIYNCLILDLGRS-QWRGUYRKSA-N 0.000 description 2
- HVWXAQVMRBKKFE-UGYAYLCHSA-N Ile-Asp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HVWXAQVMRBKKFE-UGYAYLCHSA-N 0.000 description 2
- IDAHFEPYTJJZFD-PEFMBERDSA-N Ile-Asp-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N IDAHFEPYTJJZFD-PEFMBERDSA-N 0.000 description 2
- CSQNHSGHAPRGPQ-YTFOTSKYSA-N Ile-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(=O)O)N CSQNHSGHAPRGPQ-YTFOTSKYSA-N 0.000 description 2
- ZUPJCJINYQISSN-XUXIUFHCSA-N Ile-Met-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUPJCJINYQISSN-XUXIUFHCSA-N 0.000 description 2
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 2
- RQZFWBLDTBDEOF-RNJOBUHISA-N Ile-Val-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N RQZFWBLDTBDEOF-RNJOBUHISA-N 0.000 description 2
- 241000880493 Leptailurus serval Species 0.000 description 2
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 2
- KTOIECMYZZGVSI-BZSNNMDCSA-N Leu-Phe-His Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 KTOIECMYZZGVSI-BZSNNMDCSA-N 0.000 description 2
- SYRTUBLKWNDSDK-DKIMLUQUSA-N Leu-Phe-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYRTUBLKWNDSDK-DKIMLUQUSA-N 0.000 description 2
- NTBFKPBULZGXQL-KKUMJFAQSA-N Lys-Asp-Tyr Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTBFKPBULZGXQL-KKUMJFAQSA-N 0.000 description 2
- ULUQBUKAPDUKOC-GVXVVHGQSA-N Lys-Glu-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ULUQBUKAPDUKOC-GVXVVHGQSA-N 0.000 description 2
- PBLLTSKBTAHDNA-KBPBESRZSA-N Lys-Gly-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PBLLTSKBTAHDNA-KBPBESRZSA-N 0.000 description 2
- OWRUUFUVXFREBD-KKUMJFAQSA-N Lys-His-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O OWRUUFUVXFREBD-KKUMJFAQSA-N 0.000 description 2
- 241000219823 Medicago Species 0.000 description 2
- 240000004658 Medicago sativa Species 0.000 description 2
- 235000010624 Medicago sativa Nutrition 0.000 description 2
- 235000017587 Medicago sativa ssp. sativa Nutrition 0.000 description 2
- 102000008300 Mutant Proteins Human genes 0.000 description 2
- 108010021466 Mutant Proteins Proteins 0.000 description 2
- GYEPCBNTTRORKW-PCBIJLKTSA-N Phe-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O GYEPCBNTTRORKW-PCBIJLKTSA-N 0.000 description 2
- PEFJUUYFEGBXFA-BZSNNMDCSA-N Phe-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 PEFJUUYFEGBXFA-BZSNNMDCSA-N 0.000 description 2
- AMBLXEMWFARNNQ-DCAQKATOSA-N Pro-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 AMBLXEMWFARNNQ-DCAQKATOSA-N 0.000 description 2
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 2
- JUJCUYWRJMFJJF-AVGNSLFASA-N Pro-Lys-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 JUJCUYWRJMFJJF-AVGNSLFASA-N 0.000 description 2
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 2
- FIDNSJUXESUDOV-JYJNAYRXSA-N Pro-Tyr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O FIDNSJUXESUDOV-JYJNAYRXSA-N 0.000 description 2
- 241000207929 Scutellaria Species 0.000 description 2
- CRZRTKAVUUGKEQ-ACZMJKKPSA-N Ser-Gln-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CRZRTKAVUUGKEQ-ACZMJKKPSA-N 0.000 description 2
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 2
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 2
- RNFZZCMCRDFNAE-WFBYXXMGSA-N Trp-Asn-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O RNFZZCMCRDFNAE-WFBYXXMGSA-N 0.000 description 2
- WOAQYWUEUYMVGK-ULQDDVLXSA-N Tyr-Lys-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WOAQYWUEUYMVGK-ULQDDVLXSA-N 0.000 description 2
- IEWKKXZRJLTIOV-AVGNSLFASA-N Tyr-Ser-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O IEWKKXZRJLTIOV-AVGNSLFASA-N 0.000 description 2
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 2
- DNOOLPROHJWCSQ-RCWTZXSCSA-N Val-Arg-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DNOOLPROHJWCSQ-RCWTZXSCSA-N 0.000 description 2
- XQVRMLRMTAGSFJ-QXEWZRGKSA-N Val-Asp-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XQVRMLRMTAGSFJ-QXEWZRGKSA-N 0.000 description 2
- RQOMPQGUGBILAG-AVGNSLFASA-N Val-Met-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O RQOMPQGUGBILAG-AVGNSLFASA-N 0.000 description 2
- 108010068380 arginylarginine Proteins 0.000 description 2
- 230000004071 biological effect Effects 0.000 description 2
- 230000003197 catalytic effect Effects 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 2
- 239000002131 composite material Substances 0.000 description 2
- 239000013256 coordination polymer Substances 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 239000012634 fragment Substances 0.000 description 2
- 229960001031 glucose Drugs 0.000 description 2
- 108010077515 glycylproline Proteins 0.000 description 2
- 108010092114 histidylphenylalanine Proteins 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 2
- 230000000813 microbial effect Effects 0.000 description 2
- 238000002156 mixing Methods 0.000 description 2
- 229930014626 natural product Natural products 0.000 description 2
- 108020004707 nucleic acids Proteins 0.000 description 2
- 102000039446 nucleic acids Human genes 0.000 description 2
- 108010030237 phenylalanyl-arginyl-valyl-phenylalanine Proteins 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 238000002390 rotary evaporation Methods 0.000 description 2
- 108010045269 tryptophyltryptophan Proteins 0.000 description 2
- 229960004441 tyrosine Drugs 0.000 description 2
- LIWOHUSRWUWRSX-ZJZGAYNASA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-phenylpropanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-3-methylbutanoyl]amino]-3-phenylpropanoic acid Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 LIWOHUSRWUWRSX-ZJZGAYNASA-N 0.000 description 1
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 1
- 108091064702 1 family Proteins 0.000 description 1
- HNSDLXPSAYFUHK-UHFFFAOYSA-N 1,4-bis(2-ethylhexyl) sulfosuccinate Chemical compound CCCCC(CC)COC(=O)CC(S(O)(=O)=O)C(=O)OCC(CC)CCCC HNSDLXPSAYFUHK-UHFFFAOYSA-N 0.000 description 1
- PIDRBUDUWHBYSR-UHFFFAOYSA-N 1-[2-[[2-[(2-amino-4-methylpentanoyl)amino]-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O PIDRBUDUWHBYSR-UHFFFAOYSA-N 0.000 description 1
- QMOQBVOBWVNSNO-UHFFFAOYSA-N 2-[[2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]acetyl]amino]acetate Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(O)=O QMOQBVOBWVNSNO-UHFFFAOYSA-N 0.000 description 1
- QDGAVODICPCDMU-UHFFFAOYSA-N 2-amino-3-[3-[bis(2-chloroethyl)amino]phenyl]propanoic acid Chemical compound OC(=O)C(N)CC1=CC=CC(N(CCCl)CCCl)=C1 QDGAVODICPCDMU-UHFFFAOYSA-N 0.000 description 1
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 1
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 1
- YEELWQSXYBJVSV-UWJYBYFXSA-N Ala-Cys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YEELWQSXYBJVSV-UWJYBYFXSA-N 0.000 description 1
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 1
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 1
- UHMQKOBNPRAZGB-CIUDSAMLSA-N Ala-Glu-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N UHMQKOBNPRAZGB-CIUDSAMLSA-N 0.000 description 1
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 1
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 1
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 1
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 1
- VHVVPYOJIIQCKS-QEJZJMRPSA-N Ala-Leu-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VHVVPYOJIIQCKS-QEJZJMRPSA-N 0.000 description 1
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 1
- BLTRAARCJYVJKV-QEJZJMRPSA-N Ala-Lys-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccccc1)C(O)=O BLTRAARCJYVJKV-QEJZJMRPSA-N 0.000 description 1
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 1
- XRUJOVRWNMBAAA-NHCYSSNCSA-N Ala-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 XRUJOVRWNMBAAA-NHCYSSNCSA-N 0.000 description 1
- IHMCQESUJVZTKW-UBHSHLNASA-N Ala-Phe-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 IHMCQESUJVZTKW-UBHSHLNASA-N 0.000 description 1
- YYAVDNKUWLAFCV-ACZMJKKPSA-N Ala-Ser-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYAVDNKUWLAFCV-ACZMJKKPSA-N 0.000 description 1
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 1
- XPSGESXVBSQZPL-SRVKXCTJSA-N Arg-Arg-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XPSGESXVBSQZPL-SRVKXCTJSA-N 0.000 description 1
- DCGLNNVKIZXQOJ-FXQIFTODSA-N Arg-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N DCGLNNVKIZXQOJ-FXQIFTODSA-N 0.000 description 1
- XUBLMYHWSFRACH-CYDGBPFRSA-N Arg-Asn-Gln-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XUBLMYHWSFRACH-CYDGBPFRSA-N 0.000 description 1
- RKRSYHCNPFGMTA-CIUDSAMLSA-N Arg-Glu-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O RKRSYHCNPFGMTA-CIUDSAMLSA-N 0.000 description 1
- IYMAXBFPHPZYIK-BQBZGAKWSA-N Arg-Gly-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IYMAXBFPHPZYIK-BQBZGAKWSA-N 0.000 description 1
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 1
- SYFHFLGAROUHNT-VEVYYDQMSA-N Arg-Thr-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SYFHFLGAROUHNT-VEVYYDQMSA-N 0.000 description 1
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 1
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 1
- LEFKSBYHUGUWLP-ACZMJKKPSA-N Asn-Ala-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LEFKSBYHUGUWLP-ACZMJKKPSA-N 0.000 description 1
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 1
- ZDOQDYFZNGASEY-BIIVOSGPSA-N Asn-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZDOQDYFZNGASEY-BIIVOSGPSA-N 0.000 description 1
- XWFPGQVLOVGSLU-CIUDSAMLSA-N Asn-Gln-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XWFPGQVLOVGSLU-CIUDSAMLSA-N 0.000 description 1
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 1
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 1
- PHJPKNUWWHRAOC-PEFMBERDSA-N Asn-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PHJPKNUWWHRAOC-PEFMBERDSA-N 0.000 description 1
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 1
- JTXVXGXTRXMOFJ-FXQIFTODSA-N Asn-Pro-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O JTXVXGXTRXMOFJ-FXQIFTODSA-N 0.000 description 1
- HPNDKUOLNRVRAY-BIIVOSGPSA-N Asn-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N)C(=O)O HPNDKUOLNRVRAY-BIIVOSGPSA-N 0.000 description 1
- XPGVTUBABLRGHY-BIIVOSGPSA-N Asp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N XPGVTUBABLRGHY-BIIVOSGPSA-N 0.000 description 1
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 1
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 1
- FTNVLGCFIJEMQT-CIUDSAMLSA-N Asp-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N FTNVLGCFIJEMQT-CIUDSAMLSA-N 0.000 description 1
- SEMWSADZTMJELF-BYULHYEWSA-N Asp-Ile-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O SEMWSADZTMJELF-BYULHYEWSA-N 0.000 description 1
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 1
- RTXQQDVBACBSCW-CFMVVWHZSA-N Asp-Ile-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RTXQQDVBACBSCW-CFMVVWHZSA-N 0.000 description 1
- DPNWSMBUYCLEDG-CIUDSAMLSA-N Asp-Lys-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O DPNWSMBUYCLEDG-CIUDSAMLSA-N 0.000 description 1
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 1
- IDDMGSKZQDEDGA-SRVKXCTJSA-N Asp-Phe-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 IDDMGSKZQDEDGA-SRVKXCTJSA-N 0.000 description 1
- KBJVTFWQWXCYCQ-IUKAMOBKSA-N Asp-Thr-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KBJVTFWQWXCYCQ-IUKAMOBKSA-N 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- OXOQBEVULIBOSH-ZDLURKLDSA-N Cys-Gly-Thr Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O OXOQBEVULIBOSH-ZDLURKLDSA-N 0.000 description 1
- DIHCYBRLTVEPBW-SRVKXCTJSA-N Cys-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CS)N DIHCYBRLTVEPBW-SRVKXCTJSA-N 0.000 description 1
- OZSBRCONEMXYOJ-AVGNSLFASA-N Cys-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N OZSBRCONEMXYOJ-AVGNSLFASA-N 0.000 description 1
- ZOMMHASZJQRLFS-IHRRRGAJSA-N Cys-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CS)N ZOMMHASZJQRLFS-IHRRRGAJSA-N 0.000 description 1
- 108010052832 Cytochromes Proteins 0.000 description 1
- 102000018832 Cytochromes Human genes 0.000 description 1
- 150000008574 D-amino acids Chemical class 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 description 1
- QWVMSYBGKWZIIE-RDFNRINOSA-N Flavochrome Natural products CC(=C/C=C/C=C(C)/C=C/C=C(C)/C1OC2(C)CCCC(C)(C)C2=C1)C=CC=C(/C)C=CC3C(=CCCC3(C)C)C QWVMSYBGKWZIIE-RDFNRINOSA-N 0.000 description 1
- KYFSMWLWHYZRNW-ACZMJKKPSA-N Gln-Asp-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N KYFSMWLWHYZRNW-ACZMJKKPSA-N 0.000 description 1
- UFNSPPFJOHNXRE-AUTRQRHGSA-N Gln-Gln-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UFNSPPFJOHNXRE-AUTRQRHGSA-N 0.000 description 1
- RGAOLBZBLOJUTP-GRLWGSQLSA-N Gln-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N RGAOLBZBLOJUTP-GRLWGSQLSA-N 0.000 description 1
- FYAULIGIFPPOAA-ZPFDUUQYSA-N Gln-Ile-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O FYAULIGIFPPOAA-ZPFDUUQYSA-N 0.000 description 1
- DQLVHRFFBQOWFL-JYJNAYRXSA-N Gln-Lys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N)O DQLVHRFFBQOWFL-JYJNAYRXSA-N 0.000 description 1
- QFXNFFZTMFHPST-DZKIICNBSA-N Gln-Phe-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)N)N QFXNFFZTMFHPST-DZKIICNBSA-N 0.000 description 1
- ZZLDMBMFKZFQMU-NRPADANISA-N Gln-Val-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O ZZLDMBMFKZFQMU-NRPADANISA-N 0.000 description 1
- VDMABHYXBULDGN-LAEOZQHASA-N Gln-Val-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VDMABHYXBULDGN-LAEOZQHASA-N 0.000 description 1
- SOEXCCGNHQBFPV-DLOVCJGASA-N Gln-Val-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SOEXCCGNHQBFPV-DLOVCJGASA-N 0.000 description 1
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 1
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 1
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 1
- AFODTOLGSZQDSL-PEFMBERDSA-N Glu-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N AFODTOLGSZQDSL-PEFMBERDSA-N 0.000 description 1
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 1
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 description 1
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 1
- GGJOGFJIPPGNRK-JSGCOSHPSA-N Glu-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)N)C(O)=O)=CNC2=C1 GGJOGFJIPPGNRK-JSGCOSHPSA-N 0.000 description 1
- AQNYKMCFCCZEEL-JYJNAYRXSA-N Glu-Lys-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AQNYKMCFCCZEEL-JYJNAYRXSA-N 0.000 description 1
- ARIORLIIMJACKZ-KKUMJFAQSA-N Glu-Pro-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ARIORLIIMJACKZ-KKUMJFAQSA-N 0.000 description 1
- GMVCSRBOSIUTFC-FXQIFTODSA-N Glu-Ser-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMVCSRBOSIUTFC-FXQIFTODSA-N 0.000 description 1
- JWNZHMSRZXXGTM-XKBZYTNZSA-N Glu-Ser-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWNZHMSRZXXGTM-XKBZYTNZSA-N 0.000 description 1
- PMSDOVISAARGAV-FHWLQOOXSA-N Glu-Tyr-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 PMSDOVISAARGAV-FHWLQOOXSA-N 0.000 description 1
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 1
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 1
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 1
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 1
- AIJAPFVDBFYNKN-WHFBIAKZSA-N Gly-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN)C(=O)N AIJAPFVDBFYNKN-WHFBIAKZSA-N 0.000 description 1
- XQHSBNVACKQWAV-WHFBIAKZSA-N Gly-Asp-Asn Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XQHSBNVACKQWAV-WHFBIAKZSA-N 0.000 description 1
- SUDUYJOBLHQAMI-WHFBIAKZSA-N Gly-Asp-Cys Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(O)=O SUDUYJOBLHQAMI-WHFBIAKZSA-N 0.000 description 1
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 1
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 1
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 1
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 1
- HAXARWKYFIIHKD-ZKWXMUAHSA-N Gly-Ile-Ser Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HAXARWKYFIIHKD-ZKWXMUAHSA-N 0.000 description 1
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 1
- IUZGUFAJDBHQQV-YUMQZZPRSA-N Gly-Leu-Asn Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IUZGUFAJDBHQQV-YUMQZZPRSA-N 0.000 description 1
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 1
- UWQDKRIZSROAKS-FJXKBIBVSA-N Gly-Met-Thr Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UWQDKRIZSROAKS-FJXKBIBVSA-N 0.000 description 1
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 1
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 1
- SFOXOSKVTLDEDM-HOTGVXAUSA-N Gly-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)CN)=CNC2=C1 SFOXOSKVTLDEDM-HOTGVXAUSA-N 0.000 description 1
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 1
- WTJBVCUCLWFGAH-JUKXBJQTSA-N His-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N WTJBVCUCLWFGAH-JUKXBJQTSA-N 0.000 description 1
- SKOKHBGDXGTDDP-MELADBBJSA-N His-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N SKOKHBGDXGTDDP-MELADBBJSA-N 0.000 description 1
- HYWZHNUGAYVEEW-KKUMJFAQSA-N His-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N HYWZHNUGAYVEEW-KKUMJFAQSA-N 0.000 description 1
- MDOBWSFNSNPENN-PMVVWTBXSA-N His-Thr-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O MDOBWSFNSNPENN-PMVVWTBXSA-N 0.000 description 1
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 1
- LJKDGRWXYUTRSH-YVNDNENWSA-N Ile-Gln-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N LJKDGRWXYUTRSH-YVNDNENWSA-N 0.000 description 1
- LKACSKJPTFSBHR-MNXVOIDGSA-N Ile-Gln-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N LKACSKJPTFSBHR-MNXVOIDGSA-N 0.000 description 1
- ODPKZZLRDNXTJZ-WHOFXGATSA-N Ile-Gly-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ODPKZZLRDNXTJZ-WHOFXGATSA-N 0.000 description 1
- GQKSJYINYYWPMR-NGZCFLSTSA-N Ile-Gly-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N GQKSJYINYYWPMR-NGZCFLSTSA-N 0.000 description 1
- AFERFBZLVUFWRA-HTFCKZLJSA-N Ile-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)O)N AFERFBZLVUFWRA-HTFCKZLJSA-N 0.000 description 1
- GAZGFPOZOLEYAJ-YTFOTSKYSA-N Ile-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N GAZGFPOZOLEYAJ-YTFOTSKYSA-N 0.000 description 1
- PMMMQRVUMVURGJ-XUXIUFHCSA-N Ile-Leu-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O PMMMQRVUMVURGJ-XUXIUFHCSA-N 0.000 description 1
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 1
- FFJQAEYLAQMGDL-MGHWNKPDSA-N Ile-Lys-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FFJQAEYLAQMGDL-MGHWNKPDSA-N 0.000 description 1
- IMRKCLXPYOIHIF-ZPFDUUQYSA-N Ile-Met-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N IMRKCLXPYOIHIF-ZPFDUUQYSA-N 0.000 description 1
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 1
- RQJUKVXWAKJDBW-SVSWQMSJSA-N Ile-Ser-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N RQJUKVXWAKJDBW-SVSWQMSJSA-N 0.000 description 1
- NAFIFZNBSPWYOO-RWRJDSDZSA-N Ile-Thr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N NAFIFZNBSPWYOO-RWRJDSDZSA-N 0.000 description 1
- ZGKVPOSSTGHJAF-HJPIBITLSA-N Ile-Tyr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CO)C(=O)O)N ZGKVPOSSTGHJAF-HJPIBITLSA-N 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- 150000008575 L-amino acids Chemical class 0.000 description 1
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 1
- XIRYQRLFHWWWTC-QEJZJMRPSA-N Leu-Ala-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XIRYQRLFHWWWTC-QEJZJMRPSA-N 0.000 description 1
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 1
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 1
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 1
- GZAUZBUKDXYPEH-CIUDSAMLSA-N Leu-Cys-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N GZAUZBUKDXYPEH-CIUDSAMLSA-N 0.000 description 1
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 1
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 1
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 1
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 1
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 1
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 1
- AUBMZAMQCOYSIC-MNXVOIDGSA-N Leu-Ile-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O AUBMZAMQCOYSIC-MNXVOIDGSA-N 0.000 description 1
- SEMUSFOBZGKBGW-YTFOTSKYSA-N Leu-Ile-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SEMUSFOBZGKBGW-YTFOTSKYSA-N 0.000 description 1
- QLDHBYRUNQZIJQ-DKIMLUQUSA-N Leu-Ile-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QLDHBYRUNQZIJQ-DKIMLUQUSA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- BJWKOATWNQJPSK-SRVKXCTJSA-N Leu-Met-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BJWKOATWNQJPSK-SRVKXCTJSA-N 0.000 description 1
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- UFPLDOKWDNTTRP-ULQDDVLXSA-N Leu-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CC=C(O)C=C1 UFPLDOKWDNTTRP-ULQDDVLXSA-N 0.000 description 1
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 1
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 1
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 1
- DEFGUIIUYAUEDU-ZPFDUUQYSA-N Lys-Asn-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DEFGUIIUYAUEDU-ZPFDUUQYSA-N 0.000 description 1
- PGBPWPTUOSCNLE-JYJNAYRXSA-N Lys-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N PGBPWPTUOSCNLE-JYJNAYRXSA-N 0.000 description 1
- LLSUNJYOSCOOEB-GUBZILKMSA-N Lys-Glu-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O LLSUNJYOSCOOEB-GUBZILKMSA-N 0.000 description 1
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 1
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 1
- PAMDBWYMLWOELY-SDDRHHMPSA-N Lys-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O PAMDBWYMLWOELY-SDDRHHMPSA-N 0.000 description 1
- VEGLGAOVLFODGC-GUBZILKMSA-N Lys-Glu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VEGLGAOVLFODGC-GUBZILKMSA-N 0.000 description 1
- NNKLKUUGESXCBS-KBPBESRZSA-N Lys-Gly-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NNKLKUUGESXCBS-KBPBESRZSA-N 0.000 description 1
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 1
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 1
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 1
- ORVFEGYUJITPGI-IHRRRGAJSA-N Lys-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN ORVFEGYUJITPGI-IHRRRGAJSA-N 0.000 description 1
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 1
- KVNLHIXLLZBAFQ-RWMBFGLXSA-N Lys-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N KVNLHIXLLZBAFQ-RWMBFGLXSA-N 0.000 description 1
- TWPCWKVOZDUYAA-KKUMJFAQSA-N Lys-Phe-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O TWPCWKVOZDUYAA-KKUMJFAQSA-N 0.000 description 1
- AEIIJFBQVGYVEV-YESZJQIVSA-N Lys-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCCCN)N)C(=O)O AEIIJFBQVGYVEV-YESZJQIVSA-N 0.000 description 1
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 1
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 1
- MYTOTTSMVMWVJN-STQMWFEESA-N Lys-Tyr Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MYTOTTSMVMWVJN-STQMWFEESA-N 0.000 description 1
- VKCPHIOZDWUFSW-ONGXEEELSA-N Lys-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN VKCPHIOZDWUFSW-ONGXEEELSA-N 0.000 description 1
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 1
- BMHIFARYXOJDLD-WPRPVWTQSA-N Met-Gly-Val Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O BMHIFARYXOJDLD-WPRPVWTQSA-N 0.000 description 1
- BKIFWLQFOOKUCA-DCAQKATOSA-N Met-His-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N BKIFWLQFOOKUCA-DCAQKATOSA-N 0.000 description 1
- ZEVPMOHYCQFWSE-NAKRPEOUSA-N Met-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCSC)N ZEVPMOHYCQFWSE-NAKRPEOUSA-N 0.000 description 1
- SODXFJOPSCXOHE-IHRRRGAJSA-N Met-Leu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O SODXFJOPSCXOHE-IHRRRGAJSA-N 0.000 description 1
- XDGFFEZAZHRZFR-RHYQMDGZSA-N Met-Leu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDGFFEZAZHRZFR-RHYQMDGZSA-N 0.000 description 1
- HOZNVKDCKZPRER-XUXIUFHCSA-N Met-Lys-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HOZNVKDCKZPRER-XUXIUFHCSA-N 0.000 description 1
- BQHLZUMZOXUWNU-DCAQKATOSA-N Met-Pro-Glu Chemical compound CSCC[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BQHLZUMZOXUWNU-DCAQKATOSA-N 0.000 description 1
- XIGAHPDZLAYQOS-SRVKXCTJSA-N Met-Pro-Pro Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 XIGAHPDZLAYQOS-SRVKXCTJSA-N 0.000 description 1
- LXCSZPUQKMTXNW-BQBZGAKWSA-N Met-Ser-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O LXCSZPUQKMTXNW-BQBZGAKWSA-N 0.000 description 1
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 1
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 1
- 206010028980 Neoplasm Diseases 0.000 description 1
- 108091005461 Nucleic proteins Chemical group 0.000 description 1
- LBSARGIQACMGDF-WBAXXEDZSA-N Phe-Ala-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 LBSARGIQACMGDF-WBAXXEDZSA-N 0.000 description 1
- DJPXNKUDJKGQEE-BZSNNMDCSA-N Phe-Asp-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DJPXNKUDJKGQEE-BZSNNMDCSA-N 0.000 description 1
- MFQXSDWKUXTOPZ-DZKIICNBSA-N Phe-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N MFQXSDWKUXTOPZ-DZKIICNBSA-N 0.000 description 1
- RORUIHAWOLADSH-HJWJTTGWSA-N Phe-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 RORUIHAWOLADSH-HJWJTTGWSA-N 0.000 description 1
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 1
- AUJWXNGCAQWLEI-KBPBESRZSA-N Phe-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AUJWXNGCAQWLEI-KBPBESRZSA-N 0.000 description 1
- MGLBSROLWAWCKN-FCLVOEFKSA-N Phe-Phe-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MGLBSROLWAWCKN-FCLVOEFKSA-N 0.000 description 1
- IPFXYNKCXYGSSV-KKUMJFAQSA-N Phe-Ser-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N IPFXYNKCXYGSSV-KKUMJFAQSA-N 0.000 description 1
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 1
- VGTJSEYTVMAASM-RPTUDFQQSA-N Phe-Thr-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VGTJSEYTVMAASM-RPTUDFQQSA-N 0.000 description 1
- VIIRRNQMMIHYHQ-XHSDSOJGSA-N Phe-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N VIIRRNQMMIHYHQ-XHSDSOJGSA-N 0.000 description 1
- APZNYJFGVAGFCF-JYJNAYRXSA-N Phe-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccccc1)C(C)C)C(O)=O APZNYJFGVAGFCF-JYJNAYRXSA-N 0.000 description 1
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 1
- OCSACVPBMIYNJE-GUBZILKMSA-N Pro-Arg-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O OCSACVPBMIYNJE-GUBZILKMSA-N 0.000 description 1
- UVKNEILZSJMKSR-FXQIFTODSA-N Pro-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 UVKNEILZSJMKSR-FXQIFTODSA-N 0.000 description 1
- GDXZRWYXJSGWIV-GMOBBJLQSA-N Pro-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 GDXZRWYXJSGWIV-GMOBBJLQSA-N 0.000 description 1
- AUQGUYPHJSMAKI-CYDGBPFRSA-N Pro-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 AUQGUYPHJSMAKI-CYDGBPFRSA-N 0.000 description 1
- FYPGHGXAOZTOBO-IHRRRGAJSA-N Pro-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 FYPGHGXAOZTOBO-IHRRRGAJSA-N 0.000 description 1
- DRKAXLDECUGLFE-ULQDDVLXSA-N Pro-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O DRKAXLDECUGLFE-ULQDDVLXSA-N 0.000 description 1
- ABSSTGUCBCDKMU-UWVGGRQHSA-N Pro-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 ABSSTGUCBCDKMU-UWVGGRQHSA-N 0.000 description 1
- BUEIYHBJHCDAMI-UFYCRDLUSA-N Pro-Phe-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BUEIYHBJHCDAMI-UFYCRDLUSA-N 0.000 description 1
- XYAFCOJKICBRDU-JYJNAYRXSA-N Pro-Phe-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O XYAFCOJKICBRDU-JYJNAYRXSA-N 0.000 description 1
- PCWLNNZTBJTZRN-AVGNSLFASA-N Pro-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 PCWLNNZTBJTZRN-AVGNSLFASA-N 0.000 description 1
- VEUACYMXJKXALX-IHRRRGAJSA-N Pro-Tyr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VEUACYMXJKXALX-IHRRRGAJSA-N 0.000 description 1
- XDKKMRPRRCOELJ-GUBZILKMSA-N Pro-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 XDKKMRPRRCOELJ-GUBZILKMSA-N 0.000 description 1
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 1
- 108010003201 RGH 0205 Proteins 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 241000221523 Rhodotorula toruloides Species 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 1
- QFBNNYNWKYKVJO-DCAQKATOSA-N Ser-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N QFBNNYNWKYKVJO-DCAQKATOSA-N 0.000 description 1
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 1
- MESDJCNHLZBMEP-ZLUOBGJFSA-N Ser-Asp-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MESDJCNHLZBMEP-ZLUOBGJFSA-N 0.000 description 1
- DSSOYPJWSWFOLK-CIUDSAMLSA-N Ser-Cys-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O DSSOYPJWSWFOLK-CIUDSAMLSA-N 0.000 description 1
- HVKMTOIAYDOJPL-NRPADANISA-N Ser-Gln-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVKMTOIAYDOJPL-NRPADANISA-N 0.000 description 1
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 1
- GRSLLFZTTLBOQX-CIUDSAMLSA-N Ser-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N GRSLLFZTTLBOQX-CIUDSAMLSA-N 0.000 description 1
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 1
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 1
- BYCVMHKULKRVPV-GUBZILKMSA-N Ser-Lys-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O BYCVMHKULKRVPV-GUBZILKMSA-N 0.000 description 1
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 1
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 1
- XGQKSRGHEZNWIS-IHRRRGAJSA-N Ser-Pro-Tyr Chemical compound N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O XGQKSRGHEZNWIS-IHRRRGAJSA-N 0.000 description 1
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 1
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 1
- RXUOAOOZIWABBW-XGEHTFHBSA-N Ser-Thr-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RXUOAOOZIWABBW-XGEHTFHBSA-N 0.000 description 1
- UYLKOSODXYSWMQ-XGEHTFHBSA-N Ser-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CO)N)O UYLKOSODXYSWMQ-XGEHTFHBSA-N 0.000 description 1
- FGBLCMLXHRPVOF-IHRRRGAJSA-N Ser-Tyr-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FGBLCMLXHRPVOF-IHRRRGAJSA-N 0.000 description 1
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 1
- SWIKDOUVROTZCW-GCJQMDKQSA-N Thr-Asn-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O SWIKDOUVROTZCW-GCJQMDKQSA-N 0.000 description 1
- OYTNZCBFDXGQGE-XQXXSGGOSA-N Thr-Gln-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O OYTNZCBFDXGQGE-XQXXSGGOSA-N 0.000 description 1
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 1
- NIEWSKWFURSECR-FOHZUACHSA-N Thr-Gly-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NIEWSKWFURSECR-FOHZUACHSA-N 0.000 description 1
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 1
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 1
- IGGFFPOIFHZYKC-PBCZWWQYSA-N Thr-His-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O IGGFFPOIFHZYKC-PBCZWWQYSA-N 0.000 description 1
- FKIGTIXHSRNKJU-IXOXFDKPSA-N Thr-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CN=CN1 FKIGTIXHSRNKJU-IXOXFDKPSA-N 0.000 description 1
- URPSJRMWHQTARR-MBLNEYKQSA-N Thr-Ile-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O URPSJRMWHQTARR-MBLNEYKQSA-N 0.000 description 1
- ADPHPKGWVDHWML-PPCPHDFISA-N Thr-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N ADPHPKGWVDHWML-PPCPHDFISA-N 0.000 description 1
- LCCSEJSPBWKBNT-OSUNSFLBSA-N Thr-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N LCCSEJSPBWKBNT-OSUNSFLBSA-N 0.000 description 1
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 description 1
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 1
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 1
- OMRWDMWXRWTQIU-YJRXYDGGSA-N Thr-Tyr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CS)C(=O)O)N)O OMRWDMWXRWTQIU-YJRXYDGGSA-N 0.000 description 1
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 1
- IQXWAJUIAQLZNX-IHPCNDPISA-N Trp-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N IQXWAJUIAQLZNX-IHPCNDPISA-N 0.000 description 1
- VCXWRWYFJLXITF-AUTRQRHGSA-N Tyr-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VCXWRWYFJLXITF-AUTRQRHGSA-N 0.000 description 1
- RCLOWEZASFJFEX-KKUMJFAQSA-N Tyr-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RCLOWEZASFJFEX-KKUMJFAQSA-N 0.000 description 1
- BVDHHLMIZFCAAU-BZSNNMDCSA-N Tyr-Cys-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BVDHHLMIZFCAAU-BZSNNMDCSA-N 0.000 description 1
- IMXAAEFAIBRCQF-SIUGBPQLSA-N Tyr-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N IMXAAEFAIBRCQF-SIUGBPQLSA-N 0.000 description 1
- OHOVFPKXPZODHS-SJWGOKEGSA-N Tyr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OHOVFPKXPZODHS-SJWGOKEGSA-N 0.000 description 1
- MVFQLSPDMMFCMW-KKUMJFAQSA-N Tyr-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O MVFQLSPDMMFCMW-KKUMJFAQSA-N 0.000 description 1
- VTCKHZJKWQENKX-KBPBESRZSA-N Tyr-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O VTCKHZJKWQENKX-KBPBESRZSA-N 0.000 description 1
- PMHLLBKTDHQMCY-ULQDDVLXSA-N Tyr-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMHLLBKTDHQMCY-ULQDDVLXSA-N 0.000 description 1
- XYNFFTNEQDWZNY-ULQDDVLXSA-N Tyr-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N XYNFFTNEQDWZNY-ULQDDVLXSA-N 0.000 description 1
- UMSZZGTXGKHTFJ-SRVKXCTJSA-N Tyr-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UMSZZGTXGKHTFJ-SRVKXCTJSA-N 0.000 description 1
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 1
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 1
- VDPRBUOZLIFUIM-GUBZILKMSA-N Val-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N VDPRBUOZLIFUIM-GUBZILKMSA-N 0.000 description 1
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 1
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 1
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 1
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 1
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 1
- BZMIYHIJVVJPCK-QSFUFRPTSA-N Val-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N BZMIYHIJVVJPCK-QSFUFRPTSA-N 0.000 description 1
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 1
- UOUIMEGEPSBZIV-ULQDDVLXSA-N Val-Lys-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UOUIMEGEPSBZIV-ULQDDVLXSA-N 0.000 description 1
- MGVYZTPLGXPVQB-CYDGBPFRSA-N Val-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MGVYZTPLGXPVQB-CYDGBPFRSA-N 0.000 description 1
- YDVDTCJGBBJGRT-GUBZILKMSA-N Val-Met-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N YDVDTCJGBBJGRT-GUBZILKMSA-N 0.000 description 1
- BCBFMJYTNKDALA-UFYCRDLUSA-N Val-Phe-Phe Chemical compound N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O BCBFMJYTNKDALA-UFYCRDLUSA-N 0.000 description 1
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 1
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 1
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 1
- XNLUVJPMPAZHCY-JYJNAYRXSA-N Val-Val-Phe Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 XNLUVJPMPAZHCY-JYJNAYRXSA-N 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 101150047711 acs gene Proteins 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 108010011559 alanylphenylalanine Proteins 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 108010047857 aspartylglycine Proteins 0.000 description 1
- 108010068265 aspartyltyrosine Proteins 0.000 description 1
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 1
- 239000012620 biological material Substances 0.000 description 1
- 230000003570 biosynthesizing effect Effects 0.000 description 1
- 230000006696 biosynthetic metabolic pathway Effects 0.000 description 1
- 239000006143 cell culture medium Substances 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 231100000481 chemical toxicant Toxicity 0.000 description 1
- 210000001072 colon Anatomy 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- QWVMSYBGKWZIIE-FZKBJVJCSA-N flavochrome Chemical compound O1C2(C)CCCC(C)(C)C2=CC1C(\C)=C\C=C\C(\C)=C\C=C\C=C(/C)\C=C\C=C(/C)\C=C\C1C(C)=CCCC1(C)C QWVMSYBGKWZIIE-FZKBJVJCSA-N 0.000 description 1
- HQVFCQRVQFYGRJ-UHFFFAOYSA-N formic acid;hydrate Chemical compound O.OC=O HQVFCQRVQFYGRJ-UHFFFAOYSA-N 0.000 description 1
- 238000012215 gene cloning Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 108010078144 glutaminyl-glycine Proteins 0.000 description 1
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 1
- 230000034659 glycolysis Effects 0.000 description 1
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 1
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 238000011031 large-scale manufacturing process Methods 0.000 description 1
- 108010076756 leucyl-alanyl-phenylalanine Proteins 0.000 description 1
- 108010012058 leucyltyrosine Proteins 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 238000004811 liquid chromatography Methods 0.000 description 1
- 239000007791 liquid phase Substances 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 230000037353 metabolic pathway Effects 0.000 description 1
- 239000002207 metabolite Substances 0.000 description 1
- 108010056582 methionylglutamic acid Proteins 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 239000003471 mutagenic agent Substances 0.000 description 1
- 231100000707 mutagenic chemical Toxicity 0.000 description 1
- 239000012074 organic phase Substances 0.000 description 1
- 230000003647 oxidation Effects 0.000 description 1
- 238000007254 oxidation reaction Methods 0.000 description 1
- 229910052760 oxygen Inorganic materials 0.000 description 1
- 239000001301 oxygen Substances 0.000 description 1
- 101150077062 pal gene Proteins 0.000 description 1
- 230000004108 pentose phosphate pathway Effects 0.000 description 1
- 108010012581 phenylalanylglutamate Proteins 0.000 description 1
- 230000001766 physiological effect Effects 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 1
- 108010093296 prolyl-prolyl-alanine Proteins 0.000 description 1
- 108010079317 prolyl-tyrosine Proteins 0.000 description 1
- 108010015796 prolylisoleucine Proteins 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 108020001580 protein domains Proteins 0.000 description 1
- 238000002708 random mutagenesis Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 238000009877 rendering Methods 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- JXOHGGNKMLTUBP-HSUXUTPPSA-N shikimic acid Chemical compound O[C@@H]1CC(C(O)=O)=C[C@@H](O)[C@H]1O JXOHGGNKMLTUBP-HSUXUTPPSA-N 0.000 description 1
- JXOHGGNKMLTUBP-JKUQZMGJSA-N shikimic acid Natural products O[C@@H]1CC(C(O)=O)=C[C@H](O)[C@@H]1O JXOHGGNKMLTUBP-JKUQZMGJSA-N 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 108010084272 syntrophin alpha1 Proteins 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 239000003440 toxic substance Substances 0.000 description 1
- DQFBYFPFKXHELB-VAWYXSNFSA-N trans-chalcone Chemical compound C=1C=CC=CC=1C(=O)\C=C\C1=CC=CC=C1 DQFBYFPFKXHELB-VAWYXSNFSA-N 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 108700026220 vif Genes Proteins 0.000 description 1
- 108010027345 wheylin-1 peptide Proteins 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1025—Acyltransferases (2.3)
- C12N9/1029—Acyltransferases (2.3) transferring groups other than amino-acyl groups (2.3.1)
- C12N9/1037—Naringenin-chalcone synthase (2.3.1.74), i.e. chalcone synthase
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/415—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from plants
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/52—Genes encoding for enzymes or proenzymes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/74—Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/0012—Oxidoreductases (1.) acting on nitrogen containing compounds as donors (1.4, 1.5, 1.6, 1.7)
- C12N9/0036—Oxidoreductases (1.) acting on nitrogen containing compounds as donors (1.4, 1.5, 1.6, 1.7) acting on NADH or NADPH (1.6)
- C12N9/0038—Oxidoreductases (1.) acting on nitrogen containing compounds as donors (1.4, 1.5, 1.6, 1.7) acting on NADH or NADPH (1.6) with a heme protein as acceptor (1.6.2)
- C12N9/0042—NADPH-cytochrome P450 reductase (1.6.2.4)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/0071—Oxidoreductases (1.) acting on paired donors with incorporation of molecular oxygen (1.14)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/88—Lyases (4.)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/90—Isomerases (5.)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/93—Ligases (6)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P17/00—Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms
- C12P17/02—Oxygen as only ring hetero atoms
- C12P17/06—Oxygen as only ring hetero atoms containing a six-membered hetero ring, e.g. fluorescein
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y106/00—Oxidoreductases acting on NADH or NADPH (1.6)
- C12Y106/02—Oxidoreductases acting on NADH or NADPH (1.6) with a heme protein as acceptor (1.6.2)
- C12Y106/02004—NADPH-hemoprotein reductase (1.6.2.4), i.e. NADP-cytochrome P450-reductase
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y114/00—Oxidoreductases acting on paired donors, with incorporation or reduction of molecular oxygen (1.14)
- C12Y114/11—Oxidoreductases acting on paired donors, with incorporation or reduction of molecular oxygen (1.14) with 2-oxoglutarate as one donor, and incorporation of one atom each of oxygen into both donors (1.14.11)
- C12Y114/11022—Flavone synthase (1.14.11.22)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y114/00—Oxidoreductases acting on paired donors, with incorporation or reduction of molecular oxygen (1.14)
- C12Y114/11—Oxidoreductases acting on paired donors, with incorporation or reduction of molecular oxygen (1.14) with 2-oxoglutarate as one donor, and incorporation of one atom each of oxygen into both donors (1.14.11)
- C12Y114/11023—Flavonol synthase (1.14.11.23)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y203/00—Acyltransferases (2.3)
- C12Y203/01—Acyltransferases (2.3) transferring groups other than amino-acyl groups (2.3.1)
- C12Y203/01074—Naringenin-chalcone synthase (2.3.1.74), i.e. chalcone synthase
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y403/00—Carbon-nitrogen lyases (4.3)
- C12Y403/01—Ammonia-lyases (4.3.1)
- C12Y403/01024—Phenylalanine ammonia-lyase (4.3.1.24)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y505/00—Intramolecular lyases (5.5)
- C12Y505/01—Intramolecular lyases (5.5.1)
- C12Y505/01006—Chalcone isomerase (5.5.1.6)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y602/00—Ligases forming carbon-sulfur bonds (6.2)
- C12Y602/01—Acid-Thiol Ligases (6.2.1)
- C12Y602/01012—4-Coumarate-CoA ligase (6.2.1.12)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12R—INDEXING SCHEME ASSOCIATED WITH SUBCLASSES C12C - C12Q, RELATING TO MICROORGANISMS
- C12R2001/00—Microorganisms ; Processes using microorganisms
- C12R2001/01—Bacteria or Actinomycetales ; using bacteria or Actinomycetales
- C12R2001/185—Escherichia
- C12R2001/19—Escherichia coli
Landscapes
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Biotechnology (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Medicinal Chemistry (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Botany (AREA)
- Gastroenterology & Hepatology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
本发明提供了一种异源合成黄芩素、野黄芩素类或白杨素类化合物的宿主细胞及其应用。本发明提供了新型的黄芩素、野黄芩素类化合物或白杨素类化合物生物合成的优化改造,可实现以原核生物为底盘利用酶自组装技术合成黄芩素、野黄芩素和白杨素类化合物,以及实现利用葡萄糖从头合成黄芩素类化合物。本发明也揭示了优化改造后的宿主细胞及其应用。
Description
技术领域
本发明涉及合成生物学及医药技术领域,具体地,本发明涉及异源合成黄酮类化合物的宿主细胞及其应用。
背景技术
黄芩素和野黄芩素是黄酮类化合物,主要存在于中药黄芩中。这两种活性黄酮仅在黄芩等相关药用植物的根中积累较少。黄芩素和黄芩素都是通过类黄酮生物合成途径合成的。黄芩素和野黄芩素具有抗氧化、抗肿瘤、抗菌、护心等重要生理活性。最近,黄芩素在体外被报道为SARS-CoV-2 3Clpro的抑制剂,显示了中药的巨大潜力。
目前,黄酮类化合物的主要来源是从植物中提取和化学合成。然而,由于使用有毒化学物质和极端的反应条件,植物萃取或化学合成无法提供大规模生产的绿色路线。因此,微生物合成黄酮类化合物的研究已经深入开展。由于从植物引入复杂的异质途径,酶的失衡和中间代谢物的积累,通常会导致产物滴度低。为了解决这些问题,采用多变量模块化方法,通过调节启动子强度和质粒拷贝数来合成黄酮类化合物。然而,它是费时的,总是需要大量的工作。之前的工作报道了在工程酵母和大肠杆菌中实现黄芩素和野黄芩素的合成,但黄芩素和野黄芩素的产量仍处于很低水平。
建立底物通道是提高酶在体内催化效率的有效途径。利用相互作用蛋白对组装多酶已被证明是提高催化效率的有效方法。脚手架也是增加产品滴度的有效策略,许多不同构象的连接子被开发用于构建各种相互作用蛋白。
在本领域中,多种天然产物的合成元件经过组装后实现了在微生物中的异源合成。但是利用生物体内酶组装策略生产黄芩素和野黄芩素这两种活性黄酮类化合物还尚未见报道;从葡萄糖合成黄芩素和野黄芩素的大肠杆菌还尚未见报道。
因此,本领域亟待优化能够高效地异源合成黄芩素和野黄芩素或类似化合物的微生物菌株。
发明内容
本发明的目的在于提供异源合成黄芩素、野黄芩素类或白杨素类化合物的宿主细胞及其应用。
在本发明的第一方面,提供一种用于合成黄芩素、野黄芩素类化合物(如黄芩素或野黄芩素)的原核细胞,其包括外源的下组酶的编码基因:黄酮6-羟化酶(F6H),细胞色素P450氧化还原酶(CPR),苯丙氨酸解氨酶(PAL)、4-香豆酸辅酶A连接酶(4CL)、查尔酮合成酶(CHS)、查尔酮异构酶(CHI)和黄酮合成酶I(FNSI);且所述酶被表达后,苯丙氨酸解氨酶(PAL)和4-香豆酸辅酶A连接酶(4CL)构成复合体(复合反应器)。
在本发明的另一方面,提供一种用于合成白杨素类化合物(如白杨素或芹菜素)的原核细胞,其包括外源的下组酶的编码基因:苯丙氨酸解氨酶(PAL)、4-香豆酸辅酶A连接酶(4CL)、查尔酮合成酶(CHS)、查尔酮异构酶(CHI)和黄酮合成酶I(FNSI);且所述酶被表达后,苯丙氨酸解氨酶(PAL)和4-香豆酸辅酶A连接酶(4CL)构成复合体(复合反应器)。
在一个优选例中,所述的苯丙氨酸解氨酶和4-香豆酸辅酶A连接酶的复合体包括:苯丙氨酸解氨酶和4-香豆酸辅酶A通过蛋白-蛋白相互作用结构域及其配体的结合而靠近,获得复合体。
在另一优选例中,所述的苯丙氨酸解氨酶和4-香豆酸辅酶A连接酶直接连接或通过连接子连接、获得融合蛋白形式的复合体。
在另一优选例中,所述蛋白-蛋白相互作用结构域包括选自下组的结构域:PDZ结构域,SH3结构域,WW结构域,LIM结构域,DD结构域,PH结构域,EH结构域,GBD结构域。
在另一优选例中,所述蛋白-蛋白相互作用结构域包括PDZ结构域,其配体为PDZligand;所述苯丙氨酸解氨酶和4-香豆酸辅酶A分别与所述PDZ结构域及其配体融合;较佳地,所述苯丙氨酸解氨酶与PDZ融合、所述4-香豆酸辅酶A与PDZ ligand融合;更佳地,所述苯丙氨酸解氨酶与PDZ融合时还包括以ER/K连接子连接(PAL-ER/K-PDZ),所述4-香豆酸辅酶A与PDZ ligand融合时还包括以(GGGGS)2连接子连接(PDZlig-(GGGGS)2-4CL)。
在另一优选例中,所述蛋白-蛋白相互作用结构域包括SH3结构域,其配体为SH3ligand;所述苯丙氨酸解氨酶和4-香豆酸辅酶A分别与所述SH3结构域及其配体融合;较佳地,所述苯丙氨酸解氨酶与SH3融合、所述4-香豆酸辅酶A与SH3 ligand融合;更佳地,所述苯丙氨酸解氨酶与SH3融合时还包括以ER/K连接子连接(PAL-ER/K-SH3),所述4-香豆酸辅酶A与SH3 ligand融合时还包括以(GGGGS)2连接子连接(SH3lig-(GGGGS)2-4CL)。
在另一优选例中,所述苯丙氨酸解氨酶与PDZ融合时,所述苯丙氨酸解氨酶位于N端,所述PDZ位于C端。
在另一优选例中,所述4-香豆酸辅酶A与PDZ ligand融合时,所述PDZ ligand位于N端,所述4-香豆酸辅酶A位于C端。
在另一优选例中,所述苯丙氨酸解氨酶与SH3融合时;所述苯丙氨酸解氨酶位于N端,所述SH3位于C端。
在另一优选例中,所述4-香豆酸辅酶A与SH3 ligand融合时,所述SH3 ligand位于N端,所述4-香豆酸辅酶A位于C端。
在另一优选例中,所述细胞中还包括外源的促进丙二酰CoA生成的酶的编码基因;较佳地,包括matC,matB,ACS,FabF。
在另一优选例中,所述的原核细胞为大肠杆菌细胞。
在另一优选例中,所述细胞中还包括外源的促进苯丙氨酸合成的酶的编码基因;较佳地,包括:aroG,pheA;更佳地,所述pheA为第976位由A突变为C的基因;更佳地,所述aroG为第436位由G突变为A的基因。
在另一优选例中,所述“促进”为统计学意义的“促进”,例如促进5%以上,10%以上,20%以上,50%以上,80%以上,100%以上或更高。
在本发明的另一方面,提供所述的原核细胞的应用,用于合成黄芩素或野黄芩素类化合物。
在本发明的另一方面,提供所述的原核细胞的应用,用于合成白杨素类化合物。
在本发明的另一方面,提供一种合成黄芩素或野黄芩素类化合物的方法,包括:提供所述的原核细胞(含有F6H和CPR),以式(I)为底物,合成黄芩素或野黄芩素类化合物;
其中,R包括H或OH。
在本发明的另一方面,提供一种合成白杨素类化合物的方法,包括:提供所述的原核细胞(可不含有F6H和CPR),以式(I)为底物,合成白杨素类化合物。
在本发明的另一方面,提供一种合成黄芩素或野黄芩素类化合物或白杨素类化合物的方法,包括:提供所述的原核细胞,以葡萄糖为底物,合成黄芩素或野黄芩素类化合物或白杨素类化合物。
在一个优选例中,在引入细胞时,所述PDZligand、4-香豆酸辅酶A连接酶、苯丙氨酸解氨酶、ER/K、PDZ、黄酮合成酶I、查尔酮合成酶、查尔酮异构酶的编码基因位于一个构建体(质粒)中。
在另一优选例中,所述黄酮6-羟化酶,细胞色素P450氧化还原酶的编码基因位于一个构建体中,较佳地还包括2B1(细胞色素P450 2B1家族可溶性蛋白)基因。
在另一优选例中,所述matC,matB,ACS,FabF的编码基因位于一个构建体中。
在另一优选例中,所述SH3lig,4-香豆酸辅酶A连接酶,苯丙氨酸解氨酶,ER/K,SH3,查尔酮合成酶的编码基因位于一个构建体中。
在另一优选例中,所述查尔酮异构酶,黄酮合成酶I的编码基因位于一个构建体中。
在另一优选例中,所述matC、matB、ACS、FabF的编码基因,第976位由A突变为C的pheA基因(pheAfbr),第436位由G突变为A的aroG基因(aroGfbr)位于一个构建体中。
在本发明的另一方面,提供一种用于生产黄芩素或野黄芩素类化合物(试剂盒中含有F6H和CPR)或白杨素类化合物(试剂盒中可不含有F6H和CPR)的试剂盒,其包括所述的重组的宿主细胞。
在本发明的另一方面,提供一种用于建立合成黄芩素或野黄芩素类化合物或白杨素类化合物的宿主细胞的试剂盒,其包括:包含PDZligand、4-香豆酸辅酶A连接酶、苯丙氨酸解氨酶、ER/K、PDZ、黄酮合成酶I、查尔酮合成酶、查尔酮异构酶的编码基因的构建体;包含matC,matB,ACS,FabF的编码基因的构建体;包含SH3lig,4-香豆酸辅酶A连接酶,苯丙氨酸解氨酶,ER/K,SH3,查尔酮合成酶的编码基因的构建体;包含查尔酮异构酶,黄酮合成酶I的编码基因的构建体;包含matC、matB、ACS、FabF的编码基因,第976位由A突变为C的pheA基因(pheAfbr),第436位由G突变为A的aroG基因(aroGfbr)的构建体;可选地,还包含黄酮6-羟化酶,细胞色素P450氧化还原酶的编码基因的构建体;较佳地所述构建体还包含2B1基因。
在一个优选例中,所述的试剂盒中还包括:葡萄糖;或,式(I)底物。
本发明的其它方面由于本文的公开内容,对本领域的技术人员而言是显而易见的。
附图说明
图1、质粒pZZ41的构建示意图。
图2、质粒pZZ55的构建示意图。
图3、非自组装菌株DN-1、自组装菌株DN-2的黄芩素产量的柱形图。
图4、非自组装菌株DN-1、自组装菌株DN-2产生的黄芩素的HPLC检测图谱。
图5、对照菌株DN-3相比,自组装菌株DN-4的黄芩素产量的柱形图。
图6、非自组装菌株DN-1、自组装菌株DN-2的野黄芩素产量的柱形图。
图7、自组装工程菌株DN-6、非自组装工程菌株DN-5的黄芩素产量的柱形图。
图8、以苯丙氨酸为前体,进行发酵生成黄芩素、野黄岑素的合成途径示意图。
图9、以葡萄糖为前体,进行发酵生成黄芩素的合成途径示意图。
图10、非自组装的菌株JH-0相比与自组装菌株DN-0的白杨素产量比较。
具体实施方式
本发明人经过深入的研究,提供了新型的黄芩素或野黄芩素类化合物/白杨素类化合物生物合成的优化改造,可实现以原核生物为底盘利用酶自组装技术合成黄芩素或野黄芩素类化合物/白杨素类化合物,以及实现利用葡萄糖从头合成黄芩素或野黄芩类化合物/白杨素类化合物。本发明也揭示了优化改造后的宿主细胞及其应用。
如本文所用,“外源的”或“异源的”是指来自不同来源的两条或多条核酸或蛋白质序列之间的关系。
如本文所用,所述的“可操作地连接(相连)”或“操作性连接(相连)”是指两个或多个核酸区域或核酸序列的功能性的空间排列。例如:启动子区被置于相对于目的基因核酸序列的特定位置,使得核酸序列的转录受到该启动子区域的引导,从而,启动子区域被“可操作地连接”到该核酸序列上。
如本文所用,所述的“表达构建物”是指重组DNA分子,它包含预期的核酸编码序列,其可以包含一个或多个基因表达盒。所述的“构建物”通常被包含在表达载体中。
如本文所用,所述的PAL、4CL、CHS、CHI和FNSI蛋白是在表达系统中形成白杨素或芹菜素合成途径的蛋白。
如本文所用,所述的F6H和CPR蛋白是在表达系统中转化白杨素或芹菜素、生成黄芩素或野黄芩素类化合物的蛋白。
如本文所用,所述的matC、matB、ACS和/或FabF蛋白在表达系统中促进丙二酰CoA生成的酶。
如本文所用,所述的aroG或其突变体,pheA或其突变体在表达系统中促进苯丙氨酸合成。
野生型的上述蛋白或基因为本领域已经鉴定的,因此,可以从公众途径获得和制备。作为本发明的优选方式,PAL来源于红景天(Rhodotorula toruloides),其具有GenBank登录号AAA33883.1所示的序列;4CL来源于欧芹(Petroselium crispum),其具有GenBank登录号KF765780.1所示的序列;CHS来源于矮牵牛(Petunia X hybrida),其具有GenBank登录号KF765781.1所示的序列;CHI基因来源于苜蓿(Medicago sativa),其具有GenBank登录号KF765782.1所示的序列;FNSI来源于欧芹(Petroselium crispum),其具有Swiss-Prot登录号Q7XZQ8.1所示的序列。
野生型的F6H和CPR也是本领域已经鉴定的。作为本发明的优选方式,F6H来源于黄岑(Scutellaria baicalensis),其具有GenBank登录号ASW21050.1所示的序列。作为本发明的优选方式,CPR来自于拟南芥(Arabidopsis thaliana),其具有GenBank登录号NP_849472.2所示的序列。
野生型的matC、matB、ACS、FabF蛋白也是本领域已经鉴定的。作为本发明的优选方式,matC来源于豆科根瘤菌(Rhizobium leguminosarum),其具有GenBank登录号KF765784.1所示的序列;matB来源于豆科根瘤菌(Rhizobium leguminosarum),其具有GenBank登录号AGZ04579.1所示的序列;ACS来源于大肠杆菌(Escherichia coli),其具有GenBank登录号CP062211.1所示的序列;FabF来源于大肠杆菌(Escherichia coli),其具有GenBank登录号AP023237.1所示的序列。
黄芩素和野黄芩素是二个结构相似且重要的黄酮类化合物。黄芩素的分子式为C15H10O5,分子量为270.24,而野黄芩素的分子量为C15H10O6,分子量为286.24。它们的结构如下所示:
本发明人发现,利用宿主细胞生产黄芩素、野黄岑素类化合物(如黄芩素或野黄芩素)或其前体白杨素类化合物(如白杨素或芹菜素)的过程中,仍然存在产物含量不够高的情形,因此对多个参与反应的蛋白进行了分析,经过大量筛选和实验,获得了一种优选的改造方案,极为显著地提高了微生物,尤其是原核表达系统(原核细胞)如大肠杆菌中化合物的产量。
作为本发明的改造方案的一个方面,本发明人利用酶组装技术发酵生产黄芩素或野黄芩素。该方案的原理为:利用相互作用的蛋白对(例如PDZ和PDZ ligand)与黄芩素合成途径中的酶PAL和4CL进行融合,使PAL和4CL能够在大肠杆菌体内进行自发组装,形成双酶复合反应器,从而提目标化合物的产量。
本发明人首次在合成黄岑类化合物/白杨素类化合物的原核表达系统中发现将PAL与4CL构建成复合体(复合反应器),可极为有效地提高表达系统的产量。适用于使得PAL与4CL构成有活性的复合体的任何生物材料或技术手段可被应用于本发明中。作为本发明的优选方式,所述的蛋白-蛋白相互作用结构域可包括选自下组的结构域:PDZ结构域,SH3结构域,WW结构域,LIM结构域,DD结构域,PH结构域,EH结构域。作为本发明的更优选的方式,所述的蛋白-蛋白相互作用结构域可包括选自下组的结构域:PDZ结构域,SH3结构域;它们的相应的配体为PDZ ligand(PDZlig)或SH3 ligand(SH3lig)。
蛋白质-蛋白质相互作用主要由蛋白质结构域来高效介导。PDZ、SH3、WW等结构域可通过一个或多个识别“口袋”来识别和结合配体蛋白的一段保守的短肽序列。就PDZ结构域而言,它通常结合配体蛋白C末端4-5个氨基酸残基,其也能够结合配体蛋白的中间序列,与自身或其他结构域聚合,或与膜上的脂类结合。
本发明中,也可包括其它的将PAL与4CL构成复合体、且能保留所述PAL与4CL的生物学活性的方法,例如将它们进行融合,构成具有适合的空间结构的融合蛋白;可通过实验测试来确定融合蛋白的活性。PAL与4CL之间的融合可以是直接连接,也可以利用连接子(Linker)来进行连接。
作为本发明的改造方案的另一个方面,本发明人在原核表达系统中过表达aroG、特别是其aroGfbr,以及pheA、特别是其pheAfbr基因,构建获得高产苯丙氨酸原核表达系统,在该原核表达系统中引入外源的黄芩素或野黄芩素类化合物/白杨素类化合物合成途径,使该菌株能够利用葡萄糖从头合成黄芩素化合物/白杨素类化合物。
常用的原核表达系统包括大肠杆菌、枯草杆菌等;例如可为大肠杆菌细胞(E.coli),如大肠杆菌BL21(DE3)。
在上述优选的蛋白(包括上述野生型的蛋白,突变型的蛋白)的基础上,本发明还包括它们的类似物。这些类似物与天然蛋白的差别可以是氨基酸序列上的差异,也可以是不影响序列的修饰形式上的差异,或者兼而有之。这些蛋白包括天然或诱导的遗传变异体。诱导变异体可以通过各种技术得到,如通过辐射或暴露于诱变剂而产生随机诱变,还可通过定点诱变法或其他已知分子生物学的技术。类似物还包括具有不同于天然L-氨基酸的残基(如D-氨基酸)的类似物,以及具有非天然存在的或合成的氨基酸(如β、γ-氨基酸)的类似物。应理解,本发明的蛋白并不限于上述例举的代表性的蛋白。
在上述优选的蛋白(包括上述野生型的蛋白,突变型的蛋白)的基础上,本发明还包括与所述的蛋白同源性高(比如与所列举的具体蛋白序列的同源性为70%或更高;优选地同源性为80%或更高;更优选地同源性为90%或更高,如同源性95%,98%或99%)的、且具有相应多肽相同功能的蛋白也包括在本发明内。
本发明中列举了来自特定物种的蛋白或基因。应理解,虽然本发明中优选研究了获自特定物种的蛋白或基因,但是获自其它物种的与所述蛋白或基因高度同源(如具有60%以上,如70%,80%,85%、90%、95%、甚至98%序列相同性)的其它蛋白或基因也在本发明考虑的范围之内。
发明还涉及本发明还提供了编码本发明的蛋白或其保守性变异蛋白的多核苷酸序列。本发明的多核苷酸可以是DNA形式或RNA形式。DNA形式包括cDNA、基因组DNA或人工合成的DNA。DNA可以是单链的或是双链的。DNA可以是编码链或非编码链。编码本发明的突变体成熟蛋白的多核苷酸包括:只编码成熟蛋白的编码序列;成熟蛋白的编码序列和各种附加编码序列;成熟蛋白的编码序列(和任选的附加编码序列)以及非编码序列。
本发明还包括针对所述基因的序列,进行密码子优化后形成的多核苷酸序列,例如,根据宿主细胞的偏好进行密码子优化。
本发明中,还构建了高产黄芩素或野黄芩素类化合物的工程菌株,其中包括外源的下组酶的编码基因:F6H,CPR,PAL、4CL、CHS、CHI和FNSI;且所述酶被表达后,PAL和4CL构成复合体(复合反应器)。培养该重组菌株,并以苯丙氨酸或酪氨酸为底物,生产黄芩素或野黄芩素类化合物。以苯丙氨酸或酪氨酸为底物的生产,适合于规模化的化合物生产。
本发明中,还构建了高产白杨素类化合物的工程菌株,其中包括外源的下组酶的编码基因:PAL、4CL、CHS、CHI和FNSI;且所述酶被表达后,PAL和4CL构成复合体(复合反应器)。培养该重组菌株,并以苯丙氨酸或酪氨酸为底物,生产黄芩素或野黄芩素类化合物。以苯丙氨酸或酪氨酸为底物的生产,适合于规模化的化合物生产。
为本发明的优选方式,所述F6H还包括与之融合的多肽标签,所述的多肽标签例如选自:8RP,Sumo,MBP,2B1,或它们的组合;较佳地为2B1。所述的多肽标签与所述F6H之间,可以包含或不包含连接肽,所述的连接肽不影响两者的生物学活性。F6H于2B1连接,可获得一种改进的F6H突变体2B1trF6H。
还可以在上述的工程菌株中,进一步引入上述底物(苯丙氨酸或酪氨酸)的上游生成途径,例如包括:由葡萄糖或甘油通过糖酵解、磷酸戊糖途径、莽草酸途径生成苯丙氨酸或酪氨酸。应理解,基于此类途径来形成苯丙氨酸或酪氨酸的方案也包含在本发明中。通过本领域已知手段来加强所述形成苯丙氨酸或酪氨酸途径的方法可包含在本发明中。
作为一种优选方式,可以在上述以苯丙氨酸或酪氨酸为底物的工程菌株中,进一步引入外源的aroG、特别是其aroGfbr,以及pheA、特别是其pheAfbr,获得另一种重组菌株,该菌株能够以葡萄糖为底物,生产黄芩素类化合物/白杨素类化合物。以葡萄糖为底物的生产,成本低廉,非常适合于规模化的化合物生产。
在建立如本发明优化的表达系统以及利用其进行生产的基础上,本领域人员还可系统研提高黄芩素、野黄芩素类化合物或白杨素类化合物产量的一系列因素,包括基因的效率和适宜性、基因剂量和培养基。此外,也可通过扩大生产规模来提高目标化合物的产量。例如,在摇瓶规模、简单培养条件下的产量基础上,当进一步扩大生产规模、进行培养基流加方案(可以源源不断提供充沛的底物)或给予良好的发酵罐水平生产条件(如温度的优化控制、溶氧的优化控制等)时,其产量通常可增加2~1000倍。这些操作和优化方式也应被包含在本发明中。可以预期,本发明的重组原核细胞,在一些优化的设备和操作工艺中,目标产物的量会发生长足的增长。
在获得了发酵产物后,从发酵产物中提取目标化合物可以采用本领域已知的技术。可以采用一些公知技术如高效液相色谱来对产物进行分析鉴定,以确定获得了所需的化合物。
本发明的菌株稳定性好,并可实现在生物反应器中规模性培养及生产黄芩素或野黄芩素类化合物/白杨素类化合物。本发明优选的菌株的目标化合物得率非常高。
相对于传统的植物提取手段,微生物发酵具有速度快、受外界因素影响较小等优势;部分化合物通过微生物合成的产量远高于植物提取,已经成为天然产物获得的一种重要手段。本发明中,通过大肠杆菌生产黄芩素或野黄芩素类化合物/白杨素类化合物,实现目标化合物更经济、更方便的制造。
本发明还提供了用于生产黄芩素或野黄芩素类化合物工程菌株的试剂盒。此外,其中还可包括原核细胞的培养基,用于合成的底物如苯丙氨酸、酪氨酸或葡萄糖,黄芩素或野黄芩素类化合物分离或检测试剂。更佳地,所述试剂盒中还可包括说明进行生物合成黄层素的方法的使用说明书等。
本发明还提供了用于构建所述生产黄芩素或野黄芩素类化合物/白杨素类化合物,工程菌株的试剂盒,所述试剂盒中可包括一系列构建体,例如可参考本发明的实施例中所提供的构建体,也可以为含有所述基因但基因排列或串联方式不同的其它构建体。表达载体(表达构建物)的建立可以采用本领域技术人员熟悉的技术。在得知了所需选择的酶以及所需表达的细胞体系之后,本领域技术人员可以进行表达构建物的建立。基因序列可以被插入到不同的表达构建物(如表达载体)中,也可以被插入到同一表达构建物中,只要在转入到细胞后其编码的多肽能够被有效地表达和发挥活性即可。所述试剂盒中还可包括原核细胞,原核细胞的培养基,用于合成的底物如苯丙氨酸、酪氨酸或葡萄糖,黄芩素或野黄芩素类化合物分离或检测试剂。更佳地,所述试剂盒中还可包括说明进行生物合成黄芩素或野黄芩素的方法的使用说明书等。
下面结合具体实施例,进一步阐述本发明。应理解,这些实施例仅用于说明本发明而不用于限制本发明的范围。下列实施例中未注明具体条件的实验方法,通常按照常规条件如J.萨姆布鲁克等编著,分子克隆实验指南,第三版,科学出版社,2002中所述的条件,或按照制造厂商所建议的条件。
1.实验材料
多聚酶链式反应(PCR)胶回收试剂盒,质粒抽提试剂盒均为美国Axygen产品;聚合酶链式反应(PCR)高保真酶PrimeSTAR Max DNA Polymerase为日本宝生物公司(TAKARA)产品;限制性内切酶均为NEB产品。
标准品化合物黄芩素和野黄芩素购自上海源叶生物科技有限公司。其他试剂为国产分析纯或色谱纯试剂,购自国药集团化学试剂有限公司。
PCR使用Arktik Thermal Cycler(Thermo Fisher Scientific);恒温培养使用ZXGP-A2050恒温培养箱和ZWY-211G恒温培养振荡器;离心使用5418R高速冷冻式离心机和5418小型离心机(Eppendorf)。真空浓缩使用Concentrator plus浓缩仪(Eppendorf);OD600使用UV-1200紫外可见分光光度计检测(上海美谱达仪器有限公司)。旋转蒸发系统由IKA RV 10digital旋转蒸发仪(IKA)和MZ 2C NT化学隔膜泵、CVC3000真空控制器(vacuubrand)组成。高效液相色谱使用Dionex UltiMate 3000液相色谱系统(ThermoFisher Scientific)。
2、本发明所涉及的菌株及质粒
大肠杆菌DH10B用于基因克隆,大肠杆菌BL21(DE3)菌株用于蛋白表达和黄芩素、野黄芩素的生产。
pCDFDuet-1、pETDuet-1、pACYCDuet-1载体用于代谢途径基因装配。
5nm rigid linker ER/K序列:KAKLKEEEERKQREEEERIKRLEELAKRKEEERKGT。
3、酶的选择
野生型的上述蛋白或基因均为本领域已经鉴定的,因此,可以从公众途径获得和制备,具体如下:
PAL:来源于红景天(Rhodotorula toruloides),其具有GenBank登录号AAA33883.1所示的序列(RtPAL);
4CL:来源于欧芹(Petroselium crispum),其具有GenBank登录号KF765780.1所示的序列(Pc4CL);
CHS:来源于矮牵牛(Petunia X hybrida),其具有GenBank登录号KF765781.1所示的序列;
CHI基因:来源于苜蓿(Medicago sativa),其具有GenBank登录号KF765782.1所示的序列;
FNS I:来源于欧芹(Petroselium crispum),其具有Swiss-Prot登录号Q7XZQ8.1所示的序列;
F6H:来源于黄岑(Scutellaria baicalensis),其具有GenBank登录号ASW21050.1所示的序列;
CPR:来自于拟南芥(Arabidopsis thaliana),其具有GenBank登录号NP_849472.2所示的序列。
PDZ结构域:来自于小鼠Mouseα-syntrophin(syn),77-171氨基酸序列,其具有GenBank登录号EDL06069所示的序列。
matB基因:来源于豆科根瘤菌(Rhizobium leguminosarum),其具有GenBank登录号AGZ04579.1所示的序列。
matC基因:来源于豆科根瘤菌(Rhizobium leguminosarum),其具有GenBank登录号KF765784.1所示的序列。
ACS基因:来源于大肠杆菌(Escherichia coli),其具有GenBank登录号CP062211.1所示的序列。
PDZ ligand:序列为GVKESLV(SEQ ID NO:12)。
SH3结构域:AEYVRALFDFNGNDEEDLPFKKGDILRIRDKPEEQWWNAEDSEGKRGMIPVPYVEKY(SEQ ID NO:13)。
SH3lig:PPPALPPKRRR(SEQ ID NO:14)。
4、质粒的构建
构建含单个基因的质粒
(1)PAL和PDZ的融合蛋白质粒的构建
首先在PAL基因N端增加NcoI酶切位点,PDZ序列C端增加EcoRI酶切位点,利用Over-Lap PCR方法将PAL,5nm rigid linker ER/K,PDZ进行基因融合,载体骨架选择pCDFDuet-1,将Over-Lap PCR后得到的融合基因PAL-ER/K-PDZ连接到pCDFDuet-1的NcoI和EcoRI位点,得到pCDFDuet1-T7 PAL-ER/K-PDZ。
(2)4CL和PDZ ligand的融合蛋白质粒的构建
将PDZ ligand序列设计在上游引物中,N端增加NcoI酶切位点,4CL序列C端增加BamHI酶切位点,PCR得到PDZlig-4CL融合基因,将融合基因构建到pCDFDuet-1的NcoI和BamHI位点,得到pCDFDuet1-T7 PDZlig-4CL。
(3)4CL和PDZ ligand的融合蛋白质粒的构建
将PDZ ligand序列设计在下游引物中,4CL N端增加NcoI酶切位点,PDZ ligand C端增加BamHI酶切位点,PCR得到4CL-PDZlig融合基因,将融合基因构建到pCDFDuet-1的NcoI和BamHI位点,得到pCDFDuet1-T7 4CL-PDZlig。
(4)pheA基因从BL21(DE3)基因组克隆得到,在pheA基因N端增加NcoI酶切位点,C端增加BamHI酶切位点。利用PCR引物,将pheA基因的976位由A突变为C得到pheAfbr,载体骨架选择pETDuet-1,将pheAfbr基因连接到pCDFDuet-1的NcoI和BamHI位点,得到pCDFDuet1-T7 pheAfbr。
(5)aroG基因从BL21(DE3)基因组克隆得到,在aroG基因N端增加NcoI酶切位点,C端增加BamHI酶切位点。利用PCR引物,将aroG基因的436位由G突变为A得到aroGfbr,载体骨架选择pETDuet-1,将aroGfbr基因连接到pCDFDuet-1的NcoI和BamHI位点,得到pCDFDuet1-T7 aroGfbr。
5、构建携带多基因的质粒
pYH57(pCDFDuet1-T74CL-T7PAL-T7FNSI-T7CHS-T7CHI)的建立:参见引用文献1:Production of plant-specific flavones baicalein and scutellarein in anengineered E.coli from available phenylalanine and tyrosine。
pYH66(pETDuet1-T72B1trF6H-T7CPR)的建立:参见引用文献1:Production ofplant-specific flavones baicalein and scutellarein in an engineered E.colifrom available phenylalanine and tyrosine。
pYH38(pACYCDuet1-T7matC-T7matB-T7ACS-T7FabF)的建立:利用PCR得到matC基因和matB基因;在matC基因N端加上NcoI酶切位点,C端加上HindIII酶切位点,将matC基因连到pACYCDuet1的NcoI和HindIII位点(该位点前具有载体自带的T7启动子),获得T7matC。在T7matC基因位点N端加上HindIII位点,C端加上Acc65I位点,利用PCR得到T7matC基因,将T7matC基因连到pACYCDuet1的HindIII和Acc65I位点。在T7ACS基因N端加上Acc65I位点,C端加上NotI位点,利用PCR得到T7ACS基因,将T7ACS基因连接到pACYCDuet1的Acc65I和NotI位点。在T7FabF基因的N端加上NotI位点,在C端加上XbaI位点,PCR得到T7 FabF基因,将T7FabF基因连接到pACYCDuet1的NotI和XbaI位点。得到pACYCDuet1-T7matC-T7matB-T7ACS-T7FabF质粒。
pZZ12(pCDFDuet1-T7-CHI-FNSI)的建立:在CHI基因N端加上NcoI位点,C端加上HindIII位点,PCR得到CHI基因,将CHI基因连接到pCDFDuet1的NcoI和HindIII位点。在FNSI基因N端加上HindIII位点,C端加上EcoRI位点,PCR得到FNSI基因,连接到pCDFDuet1的HindIII和EcoRI位点。得到pCDFDuet1-T7-CHI-FNSI。
pZZ22(pET28a-SH3lig-T7-4CL-PAL-ER/K-SH3-CHS)的建立:在SH3lig基因N端加上NcoI位点,4CL基因C端加上BamHI位点,以pET28a为载体,利用Over-lap PCR将SH3lig和4CL融合得到融合基因SH3lig-4CL,将SH3lig-4CL连接到pET28a的NcoI和BamHI位点。Over-Lap PCR得到融合基因PAL-ER/K,在PAL-ER/K基因N端加上BamHI位点,SH3基因C端加上EcoRI位点,利用Over-lap PCR将PAL-ER/K基因和SH3基因融合,得到PAL-ER/K-SH3基因,将PAL-ER/K-SH3连接到pET28a的BamHI和EcoRI位点。在CHS基因N端加上EcoRI位点,C端加上SalI位点,将CHS连接到pET28a的EcoRI和SalI位点。得到质粒pZZ22(pET28a-SH3lig-T7-4CL-PAL-ER/K-SH3-CHS)。
pZZ23(pET28a-T7-4CL-PAL-ER/K-SH3-CHS)的建立:在4CL基因N端加上NcoI位点,4CL基因C端加上BamHI位点,以pET28a为载体,将4CL连接到pET28a的NcoI和BamHI位点。在PAL-ER/K基因N端加上BamHI位点,SH3基因C端加上EcoRI位点,利用Over-Lap PCR将PAL-ER/K基因和SH3基因融合,得到PAL-ER/K-SH3基因,将PAL-ER/K-SH3连接到pET28a的BamHI和EcoRI位点。在CHS基因N端加上EcoRI位点,C端加上SalI位点,将CHS连接到pET28a的EcoRI和SalI位点。得到质粒pZZ23(pET28a-T7-4CL-PAL-ER/K-SH3-CHS)。
pZZ41(pCDFDuet1-T7PDZlig-4CL-T7PAL-ER/K-PDZ-T7FNSI-T7CHS-T7CHI)的建立:将4(1)中构建的PAL-ER/K-PDZ融合基因N端增加BamHI酶切位点,C端增加EcoRI酶切位点,插入到pCDFDuet1中,获得pCDFDuet1-T7PAL-ER/K-PDZ。以pCDFDuet1-T7PAL-ER/K-PDZ为模板,PCR后将基因PAL-ER/K-PDZ构建到pCDFDuet1-T7PDZlig-4CL质粒的BamHI和EcoRI位点,得到pCDFDuet1-T7PDZlig-4CL-T7PAL-ER/K-PDZ。将T7FNSI-T7CHS-T7CHI基因利用HindIII和AvrII双酶切从pYH57质粒上酶切下来,连接到上述质粒的HindIII和AvrII酶切位点得到pZZ41(pCDFDuet1-T7PDZlig-4CL-T7PAL-ER/K-PDZ-T7FNSI-T7CHS-T7CHI)质粒。
pZZ52(pETDuet1-T7pheAfbr-T7aroGfbr)的建立:在pCDFDuet1-T7 aroGfbr质粒T7启动子前设计引物,增加BamHI酶切位点,C端增加EcoRI酶切位点,以pETDuet1-T7aroGfbr为模板,PCR得到T7aroGfbr片段,将该片段连接到pCDFDuet1-T7pheAfbr质粒的BamHI和EcoRI位点,得到pZZ52(pETDuet1-T7pheAfbr-T7aroGfbr)质粒。
pZZ55(pACYCDuet1-T7matC-T7matB-T7ACS-T7FabF-T7pheAfbr-T7aroGfbr)的建立:以pZZ52(pETDuet1-T7pheAfbr-T7aroGfbr)质粒为模板,T7启动子上游设计引物增加AvrII质粒,在C端增加AvrII酶切位点,PCR克隆得到T7pheAfbr-T7aroGfbr片段,利用一步克隆试剂盒将T7pheAfbr-T7aroGfbr连接到pYH38(pACYCDuet1-T7matC-T7matB-T7ACS-T7FabF)的AvrII位点,得到pZZ55(pACYCDuet1-T7matC-T7matB-T7ACS-T7FabF-T7pheAfbr-T7aroGfbr)质粒。
质粒详细信息见表1,细胞信息见表2,质粒构建示意图见图1~图2。
表1
表2
5、大肠杆菌摇瓶发酵黄芩素和野黄芩素
发酵菌株的准备:构建好的质粒转化大肠杆菌BL21(DE3),37℃倒置培养12h后挑阳性克隆于2mL LB抗性的培养基中,37℃,250rpm培养10h制备发酵种子菌,发酵工程菌的详细信息见表2。
种子液1%转接至10mL含2%葡萄糖的M9培养基中(培养基中加入相应的抗生素),37℃,250rpm培养至菌株OD=0.5,加入0.1mM的IPTG及500mg/L的苯丙氨酸,22℃,220rpm发酵3天,取样1mL,菌液超声破碎3次,等体积乙酸乙酯混匀萃取两次,12000rpm,2min离心转移有机相至新管,室温或30℃旋转蒸干后,加200μL甲醇复溶(浓缩5倍)充分混匀,12000rpm,2min后转移上清HPLC检测。
6、HPLC检测
液相检测条件:A相:0.1%甲酸水,B相:乙腈;分离条件:0-20min 20%B相-55%B相,20-22min 55%B相-100%B相,22-27min 100%B相,27-35min 100%B相-20%B相,35-40min,20%B相;检测波长:340nm,柱温:30℃。
色谱柱:Thermo syncronis C18反相柱(250mm×4.6mm,5μm)。
实施例
实施例1、蛋白的优化改造
1、PAL蛋白序列的优化改造
来自红景天(Rhodotorula toruloides)的PAL(RtPAL)长度为693aa(GenBank登录号AAA33883.1),具体序列如下(SEQ ID NO:1):
MAPRPTSQSQARTCPTTQVTQVDIVEKMLAAPTDSTLELDGYSLNLGDVVSAARKGRPVRVKDSDEIRSKIDKSVEFLRSQLSMSVYGVTTGFGGSADTRTEDAISLQKALLEHQLCGVLPSSFDSFRLGRGLENSLPLEVVRGAMTIRVNSLTRGHSAVRLVVLEALTNFLNHGITPIVPLRGTISASGDLSPLSYIAAAISGHPDSKVHVVHEGKEKILYAREAMALFNLEPVVLGPKEGLGLVNGTAVSASMATLALHDAHMLSLLSQSLTAMTVEAMVGHAGSFHPFLHDVTRPHPTQIEVAGNIRKLLEGSRFAVHHEEEVKVKDDEGILRQDRYPLRTSPQWLGPLVSDLIHAHAVLTIEAGQSTTDNPLIDVENKTSHHGGNFQAAAVANTMEKTRLGLAQIGKLNFTQLTEMLNAGMNRGLPSCLAAEDPSLSYHCKGLDIAAAAYTSELGHLANPVTTHVQPAEMANQAVNSLALISARRTTESNDVLSLLLATHLYCVLQAIDLRAIEFEFKKQFGPAIVSLIDQHFGSAMTGSNLRDELVEKVNKTLAKRLEQTNSYDLVPRWHDAFSFAAGTVVEVLSSTSLSLAAVNAWKVAAAESAISLTRQVRETFWSAASTSSPALSYLSPRTQILYAFVREELGVKARRGDVFLGKQEVTIGSNVSKIYEAIKSGRINNVLLKMLA*
改造1:本发明人针对SEQ ID NO:1进行序列改造,氨基酸,在C端加上5nm rigidlinker ER/K连接子,获得改进的PAL突变体PAL-ER/K,具体序列如下(SEQ ID NO:2):
改造2:本发明人针对SEQ ID NO:2进行序列改造,再在C端加上PDZ的氨基酸序列,获得改进的PAL-ER/K突变体PAL-ER/K-PDZ,具体序列如下(SEQ ID NO:3):
其中,PAL位于SEQ ID NO:3的第1~693位;ER/K位于SEQ ID NO:3的第694~729位;PDZ位于SEQ ID NO:3的第730~824位。
改造3:本发明人针对SEQ ID NO:2进行序列改造,再在C端加上SH3的氨基酸序列,获得改进的PAL-ER/K突变体PAL-ER/K-SH3,具体序列如下(SEQ ID NO:4):
其中,PAL位于SEQ ID NO:4的第1~693位;ER/K位于SEQ ID NO:3的第694~729位;PDZ位于SEQ ID NO:3的第730~786位。
2、4CL蛋白序列的优化改造
来自欧芹(Petroselium crispum)的4CL(Pc4CL)长度为544aa(GenBank登录号KF765780.1),具体序列如下(SEQ ID NO:5):
MGDCVAPKEDLIFRSKLPDIYIPKHLPLHTYCFENISKVGDKSCLINGATGETFTYSQVELLSRKVASGLNKLGIQQGDTIMLLLPNSPEYFFAFLGASYRGAISTMANPFFTSAEVIKQLKASQAKLIITQACYVDKVKDYAAEKNIQIICIDDAPQDCLHFSKLMEADESEMPEVVINSDDVVALPYSSGTTGLPKGVMLTHKGLVTSVAQQVDGDNPNLYMHSEDVMICILPLFHIYSLNAVLCCGLRAGVTILIMQKFDIVPFLELIQKYKVTIGPFVPPIVLAIAKSPVVDKYDLSSVRTVMSGAAPLGKELEDAVRAKFPNAKLGQGYGMTEAGPVLAMCLAFAKEPYEIKSGACGTVVRNAEMKIVDPETNASLPRNQRGEICIRGDQIMKGYLNDPESTRTTIDEEGWLHTGDIGFIDDDDELFIVDRLKEIIKYKGFQVAPAELEALLLTHPTISDAAVVPMIDEKAGEVPVAFVVRTNGFTTTEEEIKQFVSKQVVFYKRIFRVFFVDAIPKSPSGKILRKDLRARIASGDLPK*
改造1:本发明人针对SEQ ID NO:5进行序列改造,去除其中第1位氨基酸,再在N端加上(GGGGS)2的氨基酸序列,获得改进的4CL突变体(GGGGS)2-4CL,具体序列如下(SEQ IDNO:6):
改造2:本发明人针对SEQ ID NO:6进行序列改造,再在N端加上PDZlig的氨基酸序列,再N端加上M氨基酸,获得改进的(GGGGS)2-4CL突变体PDZlig-(GGGGS)2-4CL,具体序列如下(SEQ ID NO:7):
其中,PDZlig位于SEQ ID NO:7的第1~8位;(GGGGS)2位于SEQ ID NO:7的第9~18位;4CL位于SEQ ID NO:7的第19~561位。
改造3:本发明人针对SEQ ID NO:5进行序列改造,在C端加上(GGGGS)2的氨基酸序列,获得改进的4CL突变体4CL-(GGGGS)2,具体序列如下(SEQ ID NO:8):
改造4:本发明人针对SEQ ID NO:8进行序列改造,在C端加上PDZlig的氨基酸序列,获得改进的4CL突变体4CL-(GGGGS)2-PDZlig,具体序列如下(SEQ ID NO:9):
其中,4CL位于SEQ ID NO:9的第1~544位;(GGGGS)2位于SEQ ID NO:10的第545~554位;PDZlig位于SEQ ID NO:9的第555~561位。
改造5:本发明人针对SEQ ID NO:6进行序列改造,再在N端加上SH3lig的氨基酸序列,再N端加上M氨基酸,获得改进的(GGGGS)2-4CL突变体SH3lig-(GGGGS)2-4CL(简称SH3lig-4CL),具体序列如下(SEQ ID NO:10):
其中,SH3lig位于SEQ ID NO:10的第1~12位;(GGGGS)2位于SEQ ID NO:10的第13~22位;4CL位于SEQ ID NO:10的第23~565位。
改造6:本发明人针对SEQ ID NO:8进行序列改造,在C端加上SH3lig的氨基酸序列,获得改进的4CL突变体4CL-(GGGGS)2-SH3lig,具体序列如下(SEQ ID NO:11):
其中,4CL位于SEQ ID NO:11的第1~544位;(GGGGS)2位于SEQ ID NO:10的第545~554位;SH3lig位于SEQ ID NO:10的第555~565位。
实施例2、发酵工程菌检测白杨素
1、非自组装菌株(JH-0)
将pYH57(pCDFDuet1-T74CL-T7PAL-T7FNSI-T7CHS-T7CHI)质粒转化入BL21(DE3),得到工程菌DN-1,用于以苯丙氨酸为前体,发酵白杨素。
2、4CL-改造2、PAL-改造2菌株(DN-0)
将pZZ41(pCDFDuet1-T7PDZlig-4CL-T7PAL-ER/K-PDZ-T7FNSI-T7CHS-T7CHI)质粒转化入BL21(DE3),得到自组装工程菌DN-0,用于以苯丙氨酸为前体,发酵白杨素。
发酵方法如下:菌株LB固体培养基(壮观霉素80μg/mL)37℃培养过夜。挑取单个克隆到2mL LB液体培养基(壮观霉素80μg/mL),转接过夜培养的菌液到新的10mL M9Y液体抗性培养基中37℃,250r/min培养至OD600=0.5~0.6,水浴降温至16℃左右,然后加入诱导剂IPTG至终浓度0.2mM,加入终浓度为500mg/L经灭菌的苯丙氨酸并转至22℃低温诱导培养,在摇床转速220r/min条件下继续培养72h。取样检测黄芩素产量。
结果显示,与非自组装的菌株JH-0相比,自组装菌株DN-0的白杨素产量,提高到1.6倍(图10)。
实施例3、发酵工程菌检测黄芩素
本实施例中,以苯丙氨酸为前体,进行发酵,合成途径如图8。
1、非自组装菌株(DN-1)
将pYH57(pCDFDuet1-T74CL-T7PAL-T7FNSI-T7CHS-T7CHI)质粒,pYH66(pETDuet1-T72B1-trF6H-T7CPR)质粒,pYH38(pACYCDuet1-T7matC-T7matB-T7ACS-T7FabF)质粒共同转化入BL21(DE3),得到工程菌DN-1,用于以苯丙氨酸为前体,发酵黄芩素。
2、4CL-改造2、PAL-改造2菌株(DN-2)
将pZZ41(pCDFDuet1-T7PDZlig-4CL-T7PAL-ER/K-PDZ-T7FNSI-T7CHS-T7CHI)质粒,pYH66(pETDuet1-T72B1-trF6H-T7CPR)质粒,pYH38(pACYCDuet1-T7matC-T7matB-T7ACS-T7FabF)质粒共同转化入BL21(DE3),得到自组装工程菌DN-2,用于以苯丙氨酸为前体发酵黄芩素。
3、4CL无改造、PAL-改造3菌株(DN-3,作为对照菌株)
将pZZ23(pET28a-T7-4CL-PAL-ER/K-SH3-CHS)质粒,pZZ12(pCDFDuet1-CHI-FNSI)质粒,质粒共同转化入BL21(DE3),得到对照工程菌DN-3,以苯丙氨酸为前体,用于以苯丙氨酸为前体发酵黄芩素。
4、4CL-改造5、PAL-改造3菌株(DN-4)
将pZZ22(pET28a-SH3lig-T7-4CL-PAL-ER/K-SH3-CHS)质粒,pZZ12(pCDFDuet1-T7-CHI-FNSI)质粒,质粒共同转化入BL21(DE3),得到自组装工程菌DN-4,用于以苯丙氨酸为前体发酵黄芩素。
上述四种菌株进行发酵,发酵方法如下:菌株LB固体培养基(壮观霉素80μg/mL,氨苄青霉素100μg/mL,氯霉素34μg/mL)37℃培养过夜。挑取单个克隆到2mL LB液体培养基(壮观霉素80μg/mL,氨苄青霉素100μg/mL,氯霉素34μg/mL),转接过夜培养的菌液到新的10mLM9Y液体抗性培养基中37℃,250r/min培养至OD600=0.5~0.6,水浴降温至16℃左右,然后加入诱导剂IPTG至终浓度0.2mM,加入终浓度为500mg/L经灭菌的苯丙氨酸并转至22℃低温诱导培养,在摇床转速220r/min条件下继续培养72h。取样检测黄芩素产量。
结果显示,与非自组装的菌株DN-1相比,自组装菌株DN-2的黄芩素产量,提高6.6倍(图3)。HPLC检测图谱如图4。
结果显示,与对照菌株DN-3相比,自组装菌株DN-4的黄芩素产量,提高2.5倍(图5)。
根据上述结果,说明本发明的改造方案能够显著提高黄芩素产量,如表3。
表3
对于多种改造方案进行比较,本发明人优选PAL改造2、4CL改造2或PAL改造3、4CL改造5。
实施例4、发酵工程菌检测野黄芩素
本实施例中,以酪氨酸为前体,进行发酵。
如前一实施例获得非组装的工程菌DN-1,用于以酪氨酸为前体发酵黄芩素。
如前一实施例得到自组装工程菌DN-2,用于以酪氨酸为前体发酵黄芩素。
上述两种菌株,在LB固体培养基(壮观霉素80μg/mL,氨苄青霉素100μg/mL,氯霉素34μg/mL)37℃培养过夜。挑取单个克隆到2mL LB液体培养基(壮观霉素80μg/mL,氨苄青霉素100μg/mL,氯霉素34μg/mL),转接过夜培养的菌液到新的10mL M9Y液体抗性培养基中37℃,250r/min培养至OD600=0.5-0.6,水浴降温至16℃左右,然后加入诱导剂IPTG至终浓度0.2mM,加入终浓度为500mg/L经灭菌的酪氨酸并转至22℃低温诱导培养,在摇床转速220r/min条件下继续培养72h。
取样检测生长情况,野黄芩素产量。与菌株DN-1相比,自组装菌株DN-2的野黄芩素产量,提高1.43倍(图6)。
实施例5、从葡萄糖合成黄芩素
本实施例中,以葡萄糖为前体,进行发酵,合成途径如图9。
1、非自组装菌株(DN-5)
将pYH57(pCDFDuet1-T74CL-T7PAL-T7FNSI-T7CHS-T7CHI)质粒,pYH66(pETDuet1-T72B1trF6H-T7CPR)质粒,pZZ55(pACYCDuet1-T7matC-T7matB-T7ACS-T7FabF-T7pheAfbr-T7aroGfbr)质粒共同转化入BL21(DE3),得到非自组装工程菌DN-5,用于从葡萄糖合成黄芩素。
2、4CL改造2,PAL改造2菌株(DN-5)
将pZZ41(pCDFDuet1-T7PDZlig-4CL-T7PAL-ER/K-PDZ-T7FNSI-T7CHS-T7CHI)质粒,pYH66(pETDuet1-T72B1trF6H-T7CPR)质粒,pZZ55(pACYCDuet1-T7matC-T7matB-T7ACS-T7FabF-T7pheAfbr-T7aroGfbr)质粒共同转化入BL21(DE3),得到采用PDZ和PDZlig互作方案的自组装工程菌DN-6,用于从葡萄糖合成黄芩素。
上述建立的两种菌株,LB固体培养基(壮观霉素80μg/mL,氨苄青霉素100μg/mL,氯霉素34μg/mL)37℃培养过夜。挑取单个克隆到2mL LB液体培养基(壮观霉素80μg/mL,氨苄青霉素100μg/mL,氯霉素34μg/mL),转接过夜培养的菌液到新的10mL M9Y液体抗性培养基中37℃,250r/min培养至OD600=0.5-0.6,水浴降温至16℃左右,然后加入诱导剂IPTG至终浓度0.2mM,并转至22℃低温诱导培养,在摇床转速220r/min条件下继续培养72h。
结果显示,非自组装工程菌DN-5能够从葡萄糖中合成黄芩素,自组装工程菌株DN-6能够从葡萄糖中合成黄芩素,自组装工程菌株DN-6比非自组装工程菌株DN-5提高了111.7%(图7)。
在本发明提及的所有文献都在本申请中引用作为参考,就如同每一篇文献被单独引用作为参考那样。此外应理解,在阅读了本发明的上述讲授内容之后,本领域技术人员可以对本发明作各种改动或修改,这些等价形式同样落于本申请所附权利要求书所限定的范围。
序列表
<110> 中国科学院分子植物科学卓越创新中心
<120> 异源合成黄酮类化合物的宿主细胞及其应用
<130> 209377
<160> 14
<170> SIPOSequenceListing 1.0
<210> 1
<211> 693
<212> PRT
<213> 红景天(Rhodotorula toruloides)
<400> 1
Met Ala Pro Arg Pro Thr Ser Gln Ser Gln Ala Arg Thr Cys Pro Thr
1 5 10 15
Thr Gln Val Thr Gln Val Asp Ile Val Glu Lys Met Leu Ala Ala Pro
20 25 30
Thr Asp Ser Thr Leu Glu Leu Asp Gly Tyr Ser Leu Asn Leu Gly Asp
35 40 45
Val Val Ser Ala Ala Arg Lys Gly Arg Pro Val Arg Val Lys Asp Ser
50 55 60
Asp Glu Ile Arg Ser Lys Ile Asp Lys Ser Val Glu Phe Leu Arg Ser
65 70 75 80
Gln Leu Ser Met Ser Val Tyr Gly Val Thr Thr Gly Phe Gly Gly Ser
85 90 95
Ala Asp Thr Arg Thr Glu Asp Ala Ile Ser Leu Gln Lys Ala Leu Leu
100 105 110
Glu His Gln Leu Cys Gly Val Leu Pro Ser Ser Phe Asp Ser Phe Arg
115 120 125
Leu Gly Arg Gly Leu Glu Asn Ser Leu Pro Leu Glu Val Val Arg Gly
130 135 140
Ala Met Thr Ile Arg Val Asn Ser Leu Thr Arg Gly His Ser Ala Val
145 150 155 160
Arg Leu Val Val Leu Glu Ala Leu Thr Asn Phe Leu Asn His Gly Ile
165 170 175
Thr Pro Ile Val Pro Leu Arg Gly Thr Ile Ser Ala Ser Gly Asp Leu
180 185 190
Ser Pro Leu Ser Tyr Ile Ala Ala Ala Ile Ser Gly His Pro Asp Ser
195 200 205
Lys Val His Val Val His Glu Gly Lys Glu Lys Ile Leu Tyr Ala Arg
210 215 220
Glu Ala Met Ala Leu Phe Asn Leu Glu Pro Val Val Leu Gly Pro Lys
225 230 235 240
Glu Gly Leu Gly Leu Val Asn Gly Thr Ala Val Ser Ala Ser Met Ala
245 250 255
Thr Leu Ala Leu His Asp Ala His Met Leu Ser Leu Leu Ser Gln Ser
260 265 270
Leu Thr Ala Met Thr Val Glu Ala Met Val Gly His Ala Gly Ser Phe
275 280 285
His Pro Phe Leu His Asp Val Thr Arg Pro His Pro Thr Gln Ile Glu
290 295 300
Val Ala Gly Asn Ile Arg Lys Leu Leu Glu Gly Ser Arg Phe Ala Val
305 310 315 320
His His Glu Glu Glu Val Lys Val Lys Asp Asp Glu Gly Ile Leu Arg
325 330 335
Gln Asp Arg Tyr Pro Leu Arg Thr Ser Pro Gln Trp Leu Gly Pro Leu
340 345 350
Val Ser Asp Leu Ile His Ala His Ala Val Leu Thr Ile Glu Ala Gly
355 360 365
Gln Ser Thr Thr Asp Asn Pro Leu Ile Asp Val Glu Asn Lys Thr Ser
370 375 380
His His Gly Gly Asn Phe Gln Ala Ala Ala Val Ala Asn Thr Met Glu
385 390 395 400
Lys Thr Arg Leu Gly Leu Ala Gln Ile Gly Lys Leu Asn Phe Thr Gln
405 410 415
Leu Thr Glu Met Leu Asn Ala Gly Met Asn Arg Gly Leu Pro Ser Cys
420 425 430
Leu Ala Ala Glu Asp Pro Ser Leu Ser Tyr His Cys Lys Gly Leu Asp
435 440 445
Ile Ala Ala Ala Ala Tyr Thr Ser Glu Leu Gly His Leu Ala Asn Pro
450 455 460
Val Thr Thr His Val Gln Pro Ala Glu Met Ala Asn Gln Ala Val Asn
465 470 475 480
Ser Leu Ala Leu Ile Ser Ala Arg Arg Thr Thr Glu Ser Asn Asp Val
485 490 495
Leu Ser Leu Leu Leu Ala Thr His Leu Tyr Cys Val Leu Gln Ala Ile
500 505 510
Asp Leu Arg Ala Ile Glu Phe Glu Phe Lys Lys Gln Phe Gly Pro Ala
515 520 525
Ile Val Ser Leu Ile Asp Gln His Phe Gly Ser Ala Met Thr Gly Ser
530 535 540
Asn Leu Arg Asp Glu Leu Val Glu Lys Val Asn Lys Thr Leu Ala Lys
545 550 555 560
Arg Leu Glu Gln Thr Asn Ser Tyr Asp Leu Val Pro Arg Trp His Asp
565 570 575
Ala Phe Ser Phe Ala Ala Gly Thr Val Val Glu Val Leu Ser Ser Thr
580 585 590
Ser Leu Ser Leu Ala Ala Val Asn Ala Trp Lys Val Ala Ala Ala Glu
595 600 605
Ser Ala Ile Ser Leu Thr Arg Gln Val Arg Glu Thr Phe Trp Ser Ala
610 615 620
Ala Ser Thr Ser Ser Pro Ala Leu Ser Tyr Leu Ser Pro Arg Thr Gln
625 630 635 640
Ile Leu Tyr Ala Phe Val Arg Glu Glu Leu Gly Val Lys Ala Arg Arg
645 650 655
Gly Asp Val Phe Leu Gly Lys Gln Glu Val Thr Ile Gly Ser Asn Val
660 665 670
Ser Lys Ile Tyr Glu Ala Ile Lys Ser Gly Arg Ile Asn Asn Val Leu
675 680 685
Leu Lys Met Leu Ala
690
<210> 2
<211> 729
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(729)
<223> PAL-ER/K
<400> 2
Met Ala Pro Arg Pro Thr Ser Gln Ser Gln Ala Arg Thr Cys Pro Thr
1 5 10 15
Thr Gln Val Thr Gln Val Asp Ile Val Glu Lys Met Leu Ala Ala Pro
20 25 30
Thr Asp Ser Thr Leu Glu Leu Asp Gly Tyr Ser Leu Asn Leu Gly Asp
35 40 45
Val Val Ser Ala Ala Arg Lys Gly Arg Pro Val Arg Val Lys Asp Ser
50 55 60
Asp Glu Ile Arg Ser Lys Ile Asp Lys Ser Val Glu Phe Leu Arg Ser
65 70 75 80
Gln Leu Ser Met Ser Val Tyr Gly Val Thr Thr Gly Phe Gly Gly Ser
85 90 95
Ala Asp Thr Arg Thr Glu Asp Ala Ile Ser Leu Gln Lys Ala Leu Leu
100 105 110
Glu His Gln Leu Cys Gly Val Leu Pro Ser Ser Phe Asp Ser Phe Arg
115 120 125
Leu Gly Arg Gly Leu Glu Asn Ser Leu Pro Leu Glu Val Val Arg Gly
130 135 140
Ala Met Thr Ile Arg Val Asn Ser Leu Thr Arg Gly His Ser Ala Val
145 150 155 160
Arg Leu Val Val Leu Glu Ala Leu Thr Asn Phe Leu Asn His Gly Ile
165 170 175
Thr Pro Ile Val Pro Leu Arg Gly Thr Ile Ser Ala Ser Gly Asp Leu
180 185 190
Ser Pro Leu Ser Tyr Ile Ala Ala Ala Ile Ser Gly His Pro Asp Ser
195 200 205
Lys Val His Val Val His Glu Gly Lys Glu Lys Ile Leu Tyr Ala Arg
210 215 220
Glu Ala Met Ala Leu Phe Asn Leu Glu Pro Val Val Leu Gly Pro Lys
225 230 235 240
Glu Gly Leu Gly Leu Val Asn Gly Thr Ala Val Ser Ala Ser Met Ala
245 250 255
Thr Leu Ala Leu His Asp Ala His Met Leu Ser Leu Leu Ser Gln Ser
260 265 270
Leu Thr Ala Met Thr Val Glu Ala Met Val Gly His Ala Gly Ser Phe
275 280 285
His Pro Phe Leu His Asp Val Thr Arg Pro His Pro Thr Gln Ile Glu
290 295 300
Val Ala Gly Asn Ile Arg Lys Leu Leu Glu Gly Ser Arg Phe Ala Val
305 310 315 320
His His Glu Glu Glu Val Lys Val Lys Asp Asp Glu Gly Ile Leu Arg
325 330 335
Gln Asp Arg Tyr Pro Leu Arg Thr Ser Pro Gln Trp Leu Gly Pro Leu
340 345 350
Val Ser Asp Leu Ile His Ala His Ala Val Leu Thr Ile Glu Ala Gly
355 360 365
Gln Ser Thr Thr Asp Asn Pro Leu Ile Asp Val Glu Asn Lys Thr Ser
370 375 380
His His Gly Gly Asn Phe Gln Ala Ala Ala Val Ala Asn Thr Met Glu
385 390 395 400
Lys Thr Arg Leu Gly Leu Ala Gln Ile Gly Lys Leu Asn Phe Thr Gln
405 410 415
Leu Thr Glu Met Leu Asn Ala Gly Met Asn Arg Gly Leu Pro Ser Cys
420 425 430
Leu Ala Ala Glu Asp Pro Ser Leu Ser Tyr His Cys Lys Gly Leu Asp
435 440 445
Ile Ala Ala Ala Ala Tyr Thr Ser Glu Leu Gly His Leu Ala Asn Pro
450 455 460
Val Thr Thr His Val Gln Pro Ala Glu Met Ala Asn Gln Ala Val Asn
465 470 475 480
Ser Leu Ala Leu Ile Ser Ala Arg Arg Thr Thr Glu Ser Asn Asp Val
485 490 495
Leu Ser Leu Leu Leu Ala Thr His Leu Tyr Cys Val Leu Gln Ala Ile
500 505 510
Asp Leu Arg Ala Ile Glu Phe Glu Phe Lys Lys Gln Phe Gly Pro Ala
515 520 525
Ile Val Ser Leu Ile Asp Gln His Phe Gly Ser Ala Met Thr Gly Ser
530 535 540
Asn Leu Arg Asp Glu Leu Val Glu Lys Val Asn Lys Thr Leu Ala Lys
545 550 555 560
Arg Leu Glu Gln Thr Asn Ser Tyr Asp Leu Val Pro Arg Trp His Asp
565 570 575
Ala Phe Ser Phe Ala Ala Gly Thr Val Val Glu Val Leu Ser Ser Thr
580 585 590
Ser Leu Ser Leu Ala Ala Val Asn Ala Trp Lys Val Ala Ala Ala Glu
595 600 605
Ser Ala Ile Ser Leu Thr Arg Gln Val Arg Glu Thr Phe Trp Ser Ala
610 615 620
Ala Ser Thr Ser Ser Pro Ala Leu Ser Tyr Leu Ser Pro Arg Thr Gln
625 630 635 640
Ile Leu Tyr Ala Phe Val Arg Glu Glu Leu Gly Val Lys Ala Arg Arg
645 650 655
Gly Asp Val Phe Leu Gly Lys Gln Glu Val Thr Ile Gly Ser Asn Val
660 665 670
Ser Lys Ile Tyr Glu Ala Ile Lys Ser Gly Arg Ile Asn Asn Val Leu
675 680 685
Leu Lys Met Leu Ala Lys Ala Lys Leu Lys Glu Glu Glu Glu Arg Lys
690 695 700
Gln Arg Glu Glu Glu Glu Arg Ile Lys Arg Leu Glu Glu Leu Ala Lys
705 710 715 720
Arg Lys Glu Glu Glu Arg Lys Gly Thr
725
<210> 3
<211> 824
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(786)
<223> PAL-ER/K-SH3
<400> 3
Met Ala Pro Arg Pro Thr Ser Gln Ser Gln Ala Arg Thr Cys Pro Thr
1 5 10 15
Thr Gln Val Thr Gln Val Asp Ile Val Glu Lys Met Leu Ala Ala Pro
20 25 30
Thr Asp Ser Thr Leu Glu Leu Asp Gly Tyr Ser Leu Asn Leu Gly Asp
35 40 45
Val Val Ser Ala Ala Arg Lys Gly Arg Pro Val Arg Val Lys Asp Ser
50 55 60
Asp Glu Ile Arg Ser Lys Ile Asp Lys Ser Val Glu Phe Leu Arg Ser
65 70 75 80
Gln Leu Ser Met Ser Val Tyr Gly Val Thr Thr Gly Phe Gly Gly Ser
85 90 95
Ala Asp Thr Arg Thr Glu Asp Ala Ile Ser Leu Gln Lys Ala Leu Leu
100 105 110
Glu His Gln Leu Cys Gly Val Leu Pro Ser Ser Phe Asp Ser Phe Arg
115 120 125
Leu Gly Arg Gly Leu Glu Asn Ser Leu Pro Leu Glu Val Val Arg Gly
130 135 140
Ala Met Thr Ile Arg Val Asn Ser Leu Thr Arg Gly His Ser Ala Val
145 150 155 160
Arg Leu Val Val Leu Glu Ala Leu Thr Asn Phe Leu Asn His Gly Ile
165 170 175
Thr Pro Ile Val Pro Leu Arg Gly Thr Ile Ser Ala Ser Gly Asp Leu
180 185 190
Ser Pro Leu Ser Tyr Ile Ala Ala Ala Ile Ser Gly His Pro Asp Ser
195 200 205
Lys Val His Val Val His Glu Gly Lys Glu Lys Ile Leu Tyr Ala Arg
210 215 220
Glu Ala Met Ala Leu Phe Asn Leu Glu Pro Val Val Leu Gly Pro Lys
225 230 235 240
Glu Gly Leu Gly Leu Val Asn Gly Thr Ala Val Ser Ala Ser Met Ala
245 250 255
Thr Leu Ala Leu His Asp Ala His Met Leu Ser Leu Leu Ser Gln Ser
260 265 270
Leu Thr Ala Met Thr Val Glu Ala Met Val Gly His Ala Gly Ser Phe
275 280 285
His Pro Phe Leu His Asp Val Thr Arg Pro His Pro Thr Gln Ile Glu
290 295 300
Val Ala Gly Asn Ile Arg Lys Leu Leu Glu Gly Ser Arg Phe Ala Val
305 310 315 320
His His Glu Glu Glu Val Lys Val Lys Asp Asp Glu Gly Ile Leu Arg
325 330 335
Gln Asp Arg Tyr Pro Leu Arg Thr Ser Pro Gln Trp Leu Gly Pro Leu
340 345 350
Val Ser Asp Leu Ile His Ala His Ala Val Leu Thr Ile Glu Ala Gly
355 360 365
Gln Ser Thr Thr Asp Asn Pro Leu Ile Asp Val Glu Asn Lys Thr Ser
370 375 380
His His Gly Gly Asn Phe Gln Ala Ala Ala Val Ala Asn Thr Met Glu
385 390 395 400
Lys Thr Arg Leu Gly Leu Ala Gln Ile Gly Lys Leu Asn Phe Thr Gln
405 410 415
Leu Thr Glu Met Leu Asn Ala Gly Met Asn Arg Gly Leu Pro Ser Cys
420 425 430
Leu Ala Ala Glu Asp Pro Ser Leu Ser Tyr His Cys Lys Gly Leu Asp
435 440 445
Ile Ala Ala Ala Ala Tyr Thr Ser Glu Leu Gly His Leu Ala Asn Pro
450 455 460
Val Thr Thr His Val Gln Pro Ala Glu Met Ala Asn Gln Ala Val Asn
465 470 475 480
Ser Leu Ala Leu Ile Ser Ala Arg Arg Thr Thr Glu Ser Asn Asp Val
485 490 495
Leu Ser Leu Leu Leu Ala Thr His Leu Tyr Cys Val Leu Gln Ala Ile
500 505 510
Asp Leu Arg Ala Ile Glu Phe Glu Phe Lys Lys Gln Phe Gly Pro Ala
515 520 525
Ile Val Ser Leu Ile Asp Gln His Phe Gly Ser Ala Met Thr Gly Ser
530 535 540
Asn Leu Arg Asp Glu Leu Val Glu Lys Val Asn Lys Thr Leu Ala Lys
545 550 555 560
Arg Leu Glu Gln Thr Asn Ser Tyr Asp Leu Val Pro Arg Trp His Asp
565 570 575
Ala Phe Ser Phe Ala Ala Gly Thr Val Val Glu Val Leu Ser Ser Thr
580 585 590
Ser Leu Ser Leu Ala Ala Val Asn Ala Trp Lys Val Ala Ala Ala Glu
595 600 605
Ser Ala Ile Ser Leu Thr Arg Gln Val Arg Glu Thr Phe Trp Ser Ala
610 615 620
Ala Ser Thr Ser Ser Pro Ala Leu Ser Tyr Leu Ser Pro Arg Thr Gln
625 630 635 640
Ile Leu Tyr Ala Phe Val Arg Glu Glu Leu Gly Val Lys Ala Arg Arg
645 650 655
Gly Asp Val Phe Leu Gly Lys Gln Glu Val Thr Ile Gly Ser Asn Val
660 665 670
Ser Lys Ile Tyr Glu Ala Ile Lys Ser Gly Arg Ile Asn Asn Val Leu
675 680 685
Leu Lys Met Leu Ala Lys Ala Lys Leu Lys Glu Glu Glu Glu Arg Lys
690 695 700
Gln Arg Glu Glu Glu Glu Arg Ile Lys Arg Leu Glu Glu Leu Ala Lys
705 710 715 720
Arg Lys Glu Glu Glu Arg Lys Gly Thr Leu Gln Arg Arg Arg Val Thr
725 730 735
Val Arg Lys Ala Asp Ala Gly Gly Leu Gly Ile Ser Ile Lys Gly Gly
740 745 750
Arg Glu Asn Lys Met Pro Ile Leu Ile Ser Lys Ile Phe Lys Gly Leu
755 760 765
Ala Ala Asp Gln Thr Glu Ala Leu Phe Val Gly Asp Ala Ile Leu Ser
770 775 780
Val Asn Gly Glu Asp Leu Ser Ser Ala Thr His Asp Glu Ala Val Gln
785 790 795 800
Ala Leu Lys Lys Thr Gly Lys Glu Val Val Leu Glu Val Lys Tyr Met
805 810 815
Lys Glu Val Ser Pro Tyr Phe Lys
820
<210> 4
<211> 786
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(786)
<223> PAL-ER/K-SH3
<400> 4
Met Ala Pro Arg Pro Thr Ser Gln Ser Gln Ala Arg Thr Cys Pro Thr
1 5 10 15
Thr Gln Val Thr Gln Val Asp Ile Val Glu Lys Met Leu Ala Ala Pro
20 25 30
Thr Asp Ser Thr Leu Glu Leu Asp Gly Tyr Ser Leu Asn Leu Gly Asp
35 40 45
Val Val Ser Ala Ala Arg Lys Gly Arg Pro Val Arg Val Lys Asp Ser
50 55 60
Asp Glu Ile Arg Ser Lys Ile Asp Lys Ser Val Glu Phe Leu Arg Ser
65 70 75 80
Gln Leu Ser Met Ser Val Tyr Gly Val Thr Thr Gly Phe Gly Gly Ser
85 90 95
Ala Asp Thr Arg Thr Glu Asp Ala Ile Ser Leu Gln Lys Ala Leu Leu
100 105 110
Glu His Gln Leu Cys Gly Val Leu Pro Ser Ser Phe Asp Ser Phe Arg
115 120 125
Leu Gly Arg Gly Leu Glu Asn Ser Leu Pro Leu Glu Val Val Arg Gly
130 135 140
Ala Met Thr Ile Arg Val Asn Ser Leu Thr Arg Gly His Ser Ala Val
145 150 155 160
Arg Leu Val Val Leu Glu Ala Leu Thr Asn Phe Leu Asn His Gly Ile
165 170 175
Thr Pro Ile Val Pro Leu Arg Gly Thr Ile Ser Ala Ser Gly Asp Leu
180 185 190
Ser Pro Leu Ser Tyr Ile Ala Ala Ala Ile Ser Gly His Pro Asp Ser
195 200 205
Lys Val His Val Val His Glu Gly Lys Glu Lys Ile Leu Tyr Ala Arg
210 215 220
Glu Ala Met Ala Leu Phe Asn Leu Glu Pro Val Val Leu Gly Pro Lys
225 230 235 240
Glu Gly Leu Gly Leu Val Asn Gly Thr Ala Val Ser Ala Ser Met Ala
245 250 255
Thr Leu Ala Leu His Asp Ala His Met Leu Ser Leu Leu Ser Gln Ser
260 265 270
Leu Thr Ala Met Thr Val Glu Ala Met Val Gly His Ala Gly Ser Phe
275 280 285
His Pro Phe Leu His Asp Val Thr Arg Pro His Pro Thr Gln Ile Glu
290 295 300
Val Ala Gly Asn Ile Arg Lys Leu Leu Glu Gly Ser Arg Phe Ala Val
305 310 315 320
His His Glu Glu Glu Val Lys Val Lys Asp Asp Glu Gly Ile Leu Arg
325 330 335
Gln Asp Arg Tyr Pro Leu Arg Thr Ser Pro Gln Trp Leu Gly Pro Leu
340 345 350
Val Ser Asp Leu Ile His Ala His Ala Val Leu Thr Ile Glu Ala Gly
355 360 365
Gln Ser Thr Thr Asp Asn Pro Leu Ile Asp Val Glu Asn Lys Thr Ser
370 375 380
His His Gly Gly Asn Phe Gln Ala Ala Ala Val Ala Asn Thr Met Glu
385 390 395 400
Lys Thr Arg Leu Gly Leu Ala Gln Ile Gly Lys Leu Asn Phe Thr Gln
405 410 415
Leu Thr Glu Met Leu Asn Ala Gly Met Asn Arg Gly Leu Pro Ser Cys
420 425 430
Leu Ala Ala Glu Asp Pro Ser Leu Ser Tyr His Cys Lys Gly Leu Asp
435 440 445
Ile Ala Ala Ala Ala Tyr Thr Ser Glu Leu Gly His Leu Ala Asn Pro
450 455 460
Val Thr Thr His Val Gln Pro Ala Glu Met Ala Asn Gln Ala Val Asn
465 470 475 480
Ser Leu Ala Leu Ile Ser Ala Arg Arg Thr Thr Glu Ser Asn Asp Val
485 490 495
Leu Ser Leu Leu Leu Ala Thr His Leu Tyr Cys Val Leu Gln Ala Ile
500 505 510
Asp Leu Arg Ala Ile Glu Phe Glu Phe Lys Lys Gln Phe Gly Pro Ala
515 520 525
Ile Val Ser Leu Ile Asp Gln His Phe Gly Ser Ala Met Thr Gly Ser
530 535 540
Asn Leu Arg Asp Glu Leu Val Glu Lys Val Asn Lys Thr Leu Ala Lys
545 550 555 560
Arg Leu Glu Gln Thr Asn Ser Tyr Asp Leu Val Pro Arg Trp His Asp
565 570 575
Ala Phe Ser Phe Ala Ala Gly Thr Val Val Glu Val Leu Ser Ser Thr
580 585 590
Ser Leu Ser Leu Ala Ala Val Asn Ala Trp Lys Val Ala Ala Ala Glu
595 600 605
Ser Ala Ile Ser Leu Thr Arg Gln Val Arg Glu Thr Phe Trp Ser Ala
610 615 620
Ala Ser Thr Ser Ser Pro Ala Leu Ser Tyr Leu Ser Pro Arg Thr Gln
625 630 635 640
Ile Leu Tyr Ala Phe Val Arg Glu Glu Leu Gly Val Lys Ala Arg Arg
645 650 655
Gly Asp Val Phe Leu Gly Lys Gln Glu Val Thr Ile Gly Ser Asn Val
660 665 670
Ser Lys Ile Tyr Glu Ala Ile Lys Ser Gly Arg Ile Asn Asn Val Leu
675 680 685
Leu Lys Met Leu Ala Lys Ala Lys Leu Lys Glu Glu Glu Glu Arg Lys
690 695 700
Gln Arg Glu Glu Glu Glu Arg Ile Lys Arg Leu Glu Glu Leu Ala Lys
705 710 715 720
Arg Lys Glu Glu Glu Arg Lys Gly Thr Ala Glu Tyr Val Arg Ala Leu
725 730 735
Phe Asp Phe Asn Gly Asn Asp Glu Glu Asp Leu Pro Phe Lys Lys Gly
740 745 750
Asp Ile Leu Arg Ile Arg Asp Lys Pro Glu Glu Gln Trp Trp Asn Ala
755 760 765
Glu Asp Ser Glu Gly Lys Arg Gly Met Ile Pro Val Pro Tyr Val Glu
770 775 780
Lys Tyr
785
<210> 5
<211> 544
<212> PRT
<213> 欧芹(Petroselium crispum)
<400> 5
Met Gly Asp Cys Val Ala Pro Lys Glu Asp Leu Ile Phe Arg Ser Lys
1 5 10 15
Leu Pro Asp Ile Tyr Ile Pro Lys His Leu Pro Leu His Thr Tyr Cys
20 25 30
Phe Glu Asn Ile Ser Lys Val Gly Asp Lys Ser Cys Leu Ile Asn Gly
35 40 45
Ala Thr Gly Glu Thr Phe Thr Tyr Ser Gln Val Glu Leu Leu Ser Arg
50 55 60
Lys Val Ala Ser Gly Leu Asn Lys Leu Gly Ile Gln Gln Gly Asp Thr
65 70 75 80
Ile Met Leu Leu Leu Pro Asn Ser Pro Glu Tyr Phe Phe Ala Phe Leu
85 90 95
Gly Ala Ser Tyr Arg Gly Ala Ile Ser Thr Met Ala Asn Pro Phe Phe
100 105 110
Thr Ser Ala Glu Val Ile Lys Gln Leu Lys Ala Ser Gln Ala Lys Leu
115 120 125
Ile Ile Thr Gln Ala Cys Tyr Val Asp Lys Val Lys Asp Tyr Ala Ala
130 135 140
Glu Lys Asn Ile Gln Ile Ile Cys Ile Asp Asp Ala Pro Gln Asp Cys
145 150 155 160
Leu His Phe Ser Lys Leu Met Glu Ala Asp Glu Ser Glu Met Pro Glu
165 170 175
Val Val Ile Asn Ser Asp Asp Val Val Ala Leu Pro Tyr Ser Ser Gly
180 185 190
Thr Thr Gly Leu Pro Lys Gly Val Met Leu Thr His Lys Gly Leu Val
195 200 205
Thr Ser Val Ala Gln Gln Val Asp Gly Asp Asn Pro Asn Leu Tyr Met
210 215 220
His Ser Glu Asp Val Met Ile Cys Ile Leu Pro Leu Phe His Ile Tyr
225 230 235 240
Ser Leu Asn Ala Val Leu Cys Cys Gly Leu Arg Ala Gly Val Thr Ile
245 250 255
Leu Ile Met Gln Lys Phe Asp Ile Val Pro Phe Leu Glu Leu Ile Gln
260 265 270
Lys Tyr Lys Val Thr Ile Gly Pro Phe Val Pro Pro Ile Val Leu Ala
275 280 285
Ile Ala Lys Ser Pro Val Val Asp Lys Tyr Asp Leu Ser Ser Val Arg
290 295 300
Thr Val Met Ser Gly Ala Ala Pro Leu Gly Lys Glu Leu Glu Asp Ala
305 310 315 320
Val Arg Ala Lys Phe Pro Asn Ala Lys Leu Gly Gln Gly Tyr Gly Met
325 330 335
Thr Glu Ala Gly Pro Val Leu Ala Met Cys Leu Ala Phe Ala Lys Glu
340 345 350
Pro Tyr Glu Ile Lys Ser Gly Ala Cys Gly Thr Val Val Arg Asn Ala
355 360 365
Glu Met Lys Ile Val Asp Pro Glu Thr Asn Ala Ser Leu Pro Arg Asn
370 375 380
Gln Arg Gly Glu Ile Cys Ile Arg Gly Asp Gln Ile Met Lys Gly Tyr
385 390 395 400
Leu Asn Asp Pro Glu Ser Thr Arg Thr Thr Ile Asp Glu Glu Gly Trp
405 410 415
Leu His Thr Gly Asp Ile Gly Phe Ile Asp Asp Asp Asp Glu Leu Phe
420 425 430
Ile Val Asp Arg Leu Lys Glu Ile Ile Lys Tyr Lys Gly Phe Gln Val
435 440 445
Ala Pro Ala Glu Leu Glu Ala Leu Leu Leu Thr His Pro Thr Ile Ser
450 455 460
Asp Ala Ala Val Val Pro Met Ile Asp Glu Lys Ala Gly Glu Val Pro
465 470 475 480
Val Ala Phe Val Val Arg Thr Asn Gly Phe Thr Thr Thr Glu Glu Glu
485 490 495
Ile Lys Gln Phe Val Ser Lys Gln Val Val Phe Tyr Lys Arg Ile Phe
500 505 510
Arg Val Phe Phe Val Asp Ala Ile Pro Lys Ser Pro Ser Gly Lys Ile
515 520 525
Leu Arg Lys Asp Leu Arg Ala Arg Ile Ala Ser Gly Asp Leu Pro Lys
530 535 540
<210> 6
<211> 553
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(553)
<223> (GGGGS)2-4CL
<400> 6
Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Asp Cys Val Ala Pro
1 5 10 15
Lys Glu Asp Leu Ile Phe Arg Ser Lys Leu Pro Asp Ile Tyr Ile Pro
20 25 30
Lys His Leu Pro Leu His Thr Tyr Cys Phe Glu Asn Ile Ser Lys Val
35 40 45
Gly Asp Lys Ser Cys Leu Ile Asn Gly Ala Thr Gly Glu Thr Phe Thr
50 55 60
Tyr Ser Gln Val Glu Leu Leu Ser Arg Lys Val Ala Ser Gly Leu Asn
65 70 75 80
Lys Leu Gly Ile Gln Gln Gly Asp Thr Ile Met Leu Leu Leu Pro Asn
85 90 95
Ser Pro Glu Tyr Phe Phe Ala Phe Leu Gly Ala Ser Tyr Arg Gly Ala
100 105 110
Ile Ser Thr Met Ala Asn Pro Phe Phe Thr Ser Ala Glu Val Ile Lys
115 120 125
Gln Leu Lys Ala Ser Gln Ala Lys Leu Ile Ile Thr Gln Ala Cys Tyr
130 135 140
Val Asp Lys Val Lys Asp Tyr Ala Ala Glu Lys Asn Ile Gln Ile Ile
145 150 155 160
Cys Ile Asp Asp Ala Pro Gln Asp Cys Leu His Phe Ser Lys Leu Met
165 170 175
Glu Ala Asp Glu Ser Glu Met Pro Glu Val Val Ile Asn Ser Asp Asp
180 185 190
Val Val Ala Leu Pro Tyr Ser Ser Gly Thr Thr Gly Leu Pro Lys Gly
195 200 205
Val Met Leu Thr His Lys Gly Leu Val Thr Ser Val Ala Gln Gln Val
210 215 220
Asp Gly Asp Asn Pro Asn Leu Tyr Met His Ser Glu Asp Val Met Ile
225 230 235 240
Cys Ile Leu Pro Leu Phe His Ile Tyr Ser Leu Asn Ala Val Leu Cys
245 250 255
Cys Gly Leu Arg Ala Gly Val Thr Ile Leu Ile Met Gln Lys Phe Asp
260 265 270
Ile Val Pro Phe Leu Glu Leu Ile Gln Lys Tyr Lys Val Thr Ile Gly
275 280 285
Pro Phe Val Pro Pro Ile Val Leu Ala Ile Ala Lys Ser Pro Val Val
290 295 300
Asp Lys Tyr Asp Leu Ser Ser Val Arg Thr Val Met Ser Gly Ala Ala
305 310 315 320
Pro Leu Gly Lys Glu Leu Glu Asp Ala Val Arg Ala Lys Phe Pro Asn
325 330 335
Ala Lys Leu Gly Gln Gly Tyr Gly Met Thr Glu Ala Gly Pro Val Leu
340 345 350
Ala Met Cys Leu Ala Phe Ala Lys Glu Pro Tyr Glu Ile Lys Ser Gly
355 360 365
Ala Cys Gly Thr Val Val Arg Asn Ala Glu Met Lys Ile Val Asp Pro
370 375 380
Glu Thr Asn Ala Ser Leu Pro Arg Asn Gln Arg Gly Glu Ile Cys Ile
385 390 395 400
Arg Gly Asp Gln Ile Met Lys Gly Tyr Leu Asn Asp Pro Glu Ser Thr
405 410 415
Arg Thr Thr Ile Asp Glu Glu Gly Trp Leu His Thr Gly Asp Ile Gly
420 425 430
Phe Ile Asp Asp Asp Asp Glu Leu Phe Ile Val Asp Arg Leu Lys Glu
435 440 445
Ile Ile Lys Tyr Lys Gly Phe Gln Val Ala Pro Ala Glu Leu Glu Ala
450 455 460
Leu Leu Leu Thr His Pro Thr Ile Ser Asp Ala Ala Val Val Pro Met
465 470 475 480
Ile Asp Glu Lys Ala Gly Glu Val Pro Val Ala Phe Val Val Arg Thr
485 490 495
Asn Gly Phe Thr Thr Thr Glu Glu Glu Ile Lys Gln Phe Val Ser Lys
500 505 510
Gln Val Val Phe Tyr Lys Arg Ile Phe Arg Val Phe Phe Val Asp Ala
515 520 525
Ile Pro Lys Ser Pro Ser Gly Lys Ile Leu Arg Lys Asp Leu Arg Ala
530 535 540
Arg Ile Ala Ser Gly Asp Leu Pro Lys
545 550
<210> 7
<211> 561
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(561)
<223> PDZlig-(GGGGS)2-4CL
<400> 7
Met Gly Val Lys Glu Ser Leu Val Gly Gly Gly Gly Ser Gly Gly Gly
1 5 10 15
Gly Ser Gly Asp Cys Val Ala Pro Lys Glu Asp Leu Ile Phe Arg Ser
20 25 30
Lys Leu Pro Asp Ile Tyr Ile Pro Lys His Leu Pro Leu His Thr Tyr
35 40 45
Cys Phe Glu Asn Ile Ser Lys Val Gly Asp Lys Ser Cys Leu Ile Asn
50 55 60
Gly Ala Thr Gly Glu Thr Phe Thr Tyr Ser Gln Val Glu Leu Leu Ser
65 70 75 80
Arg Lys Val Ala Ser Gly Leu Asn Lys Leu Gly Ile Gln Gln Gly Asp
85 90 95
Thr Ile Met Leu Leu Leu Pro Asn Ser Pro Glu Tyr Phe Phe Ala Phe
100 105 110
Leu Gly Ala Ser Tyr Arg Gly Ala Ile Ser Thr Met Ala Asn Pro Phe
115 120 125
Phe Thr Ser Ala Glu Val Ile Lys Gln Leu Lys Ala Ser Gln Ala Lys
130 135 140
Leu Ile Ile Thr Gln Ala Cys Tyr Val Asp Lys Val Lys Asp Tyr Ala
145 150 155 160
Ala Glu Lys Asn Ile Gln Ile Ile Cys Ile Asp Asp Ala Pro Gln Asp
165 170 175
Cys Leu His Phe Ser Lys Leu Met Glu Ala Asp Glu Ser Glu Met Pro
180 185 190
Glu Val Val Ile Asn Ser Asp Asp Val Val Ala Leu Pro Tyr Ser Ser
195 200 205
Gly Thr Thr Gly Leu Pro Lys Gly Val Met Leu Thr His Lys Gly Leu
210 215 220
Val Thr Ser Val Ala Gln Gln Val Asp Gly Asp Asn Pro Asn Leu Tyr
225 230 235 240
Met His Ser Glu Asp Val Met Ile Cys Ile Leu Pro Leu Phe His Ile
245 250 255
Tyr Ser Leu Asn Ala Val Leu Cys Cys Gly Leu Arg Ala Gly Val Thr
260 265 270
Ile Leu Ile Met Gln Lys Phe Asp Ile Val Pro Phe Leu Glu Leu Ile
275 280 285
Gln Lys Tyr Lys Val Thr Ile Gly Pro Phe Val Pro Pro Ile Val Leu
290 295 300
Ala Ile Ala Lys Ser Pro Val Val Asp Lys Tyr Asp Leu Ser Ser Val
305 310 315 320
Arg Thr Val Met Ser Gly Ala Ala Pro Leu Gly Lys Glu Leu Glu Asp
325 330 335
Ala Val Arg Ala Lys Phe Pro Asn Ala Lys Leu Gly Gln Gly Tyr Gly
340 345 350
Met Thr Glu Ala Gly Pro Val Leu Ala Met Cys Leu Ala Phe Ala Lys
355 360 365
Glu Pro Tyr Glu Ile Lys Ser Gly Ala Cys Gly Thr Val Val Arg Asn
370 375 380
Ala Glu Met Lys Ile Val Asp Pro Glu Thr Asn Ala Ser Leu Pro Arg
385 390 395 400
Asn Gln Arg Gly Glu Ile Cys Ile Arg Gly Asp Gln Ile Met Lys Gly
405 410 415
Tyr Leu Asn Asp Pro Glu Ser Thr Arg Thr Thr Ile Asp Glu Glu Gly
420 425 430
Trp Leu His Thr Gly Asp Ile Gly Phe Ile Asp Asp Asp Asp Glu Leu
435 440 445
Phe Ile Val Asp Arg Leu Lys Glu Ile Ile Lys Tyr Lys Gly Phe Gln
450 455 460
Val Ala Pro Ala Glu Leu Glu Ala Leu Leu Leu Thr His Pro Thr Ile
465 470 475 480
Ser Asp Ala Ala Val Val Pro Met Ile Asp Glu Lys Ala Gly Glu Val
485 490 495
Pro Val Ala Phe Val Val Arg Thr Asn Gly Phe Thr Thr Thr Glu Glu
500 505 510
Glu Ile Lys Gln Phe Val Ser Lys Gln Val Val Phe Tyr Lys Arg Ile
515 520 525
Phe Arg Val Phe Phe Val Asp Ala Ile Pro Lys Ser Pro Ser Gly Lys
530 535 540
Ile Leu Arg Lys Asp Leu Arg Ala Arg Ile Ala Ser Gly Asp Leu Pro
545 550 555 560
Lys
<210> 8
<211> 554
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(554)
<223> 4CL-(GGGGS)2
<400> 8
Met Gly Asp Cys Val Ala Pro Lys Glu Asp Leu Ile Phe Arg Ser Lys
1 5 10 15
Leu Pro Asp Ile Tyr Ile Pro Lys His Leu Pro Leu His Thr Tyr Cys
20 25 30
Phe Glu Asn Ile Ser Lys Val Gly Asp Lys Ser Cys Leu Ile Asn Gly
35 40 45
Ala Thr Gly Glu Thr Phe Thr Tyr Ser Gln Val Glu Leu Leu Ser Arg
50 55 60
Lys Val Ala Ser Gly Leu Asn Lys Leu Gly Ile Gln Gln Gly Asp Thr
65 70 75 80
Ile Met Leu Leu Leu Pro Asn Ser Pro Glu Tyr Phe Phe Ala Phe Leu
85 90 95
Gly Ala Ser Tyr Arg Gly Ala Ile Ser Thr Met Ala Asn Pro Phe Phe
100 105 110
Thr Ser Ala Glu Val Ile Lys Gln Leu Lys Ala Ser Gln Ala Lys Leu
115 120 125
Ile Ile Thr Gln Ala Cys Tyr Val Asp Lys Val Lys Asp Tyr Ala Ala
130 135 140
Glu Lys Asn Ile Gln Ile Ile Cys Ile Asp Asp Ala Pro Gln Asp Cys
145 150 155 160
Leu His Phe Ser Lys Leu Met Glu Ala Asp Glu Ser Glu Met Pro Glu
165 170 175
Val Val Ile Asn Ser Asp Asp Val Val Ala Leu Pro Tyr Ser Ser Gly
180 185 190
Thr Thr Gly Leu Pro Lys Gly Val Met Leu Thr His Lys Gly Leu Val
195 200 205
Thr Ser Val Ala Gln Gln Val Asp Gly Asp Asn Pro Asn Leu Tyr Met
210 215 220
His Ser Glu Asp Val Met Ile Cys Ile Leu Pro Leu Phe His Ile Tyr
225 230 235 240
Ser Leu Asn Ala Val Leu Cys Cys Gly Leu Arg Ala Gly Val Thr Ile
245 250 255
Leu Ile Met Gln Lys Phe Asp Ile Val Pro Phe Leu Glu Leu Ile Gln
260 265 270
Lys Tyr Lys Val Thr Ile Gly Pro Phe Val Pro Pro Ile Val Leu Ala
275 280 285
Ile Ala Lys Ser Pro Val Val Asp Lys Tyr Asp Leu Ser Ser Val Arg
290 295 300
Thr Val Met Ser Gly Ala Ala Pro Leu Gly Lys Glu Leu Glu Asp Ala
305 310 315 320
Val Arg Ala Lys Phe Pro Asn Ala Lys Leu Gly Gln Gly Tyr Gly Met
325 330 335
Thr Glu Ala Gly Pro Val Leu Ala Met Cys Leu Ala Phe Ala Lys Glu
340 345 350
Pro Tyr Glu Ile Lys Ser Gly Ala Cys Gly Thr Val Val Arg Asn Ala
355 360 365
Glu Met Lys Ile Val Asp Pro Glu Thr Asn Ala Ser Leu Pro Arg Asn
370 375 380
Gln Arg Gly Glu Ile Cys Ile Arg Gly Asp Gln Ile Met Lys Gly Tyr
385 390 395 400
Leu Asn Asp Pro Glu Ser Thr Arg Thr Thr Ile Asp Glu Glu Gly Trp
405 410 415
Leu His Thr Gly Asp Ile Gly Phe Ile Asp Asp Asp Asp Glu Leu Phe
420 425 430
Ile Val Asp Arg Leu Lys Glu Ile Ile Lys Tyr Lys Gly Phe Gln Val
435 440 445
Ala Pro Ala Glu Leu Glu Ala Leu Leu Leu Thr His Pro Thr Ile Ser
450 455 460
Asp Ala Ala Val Val Pro Met Ile Asp Glu Lys Ala Gly Glu Val Pro
465 470 475 480
Val Ala Phe Val Val Arg Thr Asn Gly Phe Thr Thr Thr Glu Glu Glu
485 490 495
Ile Lys Gln Phe Val Ser Lys Gln Val Val Phe Tyr Lys Arg Ile Phe
500 505 510
Arg Val Phe Phe Val Asp Ala Ile Pro Lys Ser Pro Ser Gly Lys Ile
515 520 525
Leu Arg Lys Asp Leu Arg Ala Arg Ile Ala Ser Gly Asp Leu Pro Lys
530 535 540
Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser
545 550
<210> 9
<211> 561
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(561)
<223> 4CL-(GGGGS)2- PDZlig
<400> 9
Met Gly Asp Cys Val Ala Pro Lys Glu Asp Leu Ile Phe Arg Ser Lys
1 5 10 15
Leu Pro Asp Ile Tyr Ile Pro Lys His Leu Pro Leu His Thr Tyr Cys
20 25 30
Phe Glu Asn Ile Ser Lys Val Gly Asp Lys Ser Cys Leu Ile Asn Gly
35 40 45
Ala Thr Gly Glu Thr Phe Thr Tyr Ser Gln Val Glu Leu Leu Ser Arg
50 55 60
Lys Val Ala Ser Gly Leu Asn Lys Leu Gly Ile Gln Gln Gly Asp Thr
65 70 75 80
Ile Met Leu Leu Leu Pro Asn Ser Pro Glu Tyr Phe Phe Ala Phe Leu
85 90 95
Gly Ala Ser Tyr Arg Gly Ala Ile Ser Thr Met Ala Asn Pro Phe Phe
100 105 110
Thr Ser Ala Glu Val Ile Lys Gln Leu Lys Ala Ser Gln Ala Lys Leu
115 120 125
Ile Ile Thr Gln Ala Cys Tyr Val Asp Lys Val Lys Asp Tyr Ala Ala
130 135 140
Glu Lys Asn Ile Gln Ile Ile Cys Ile Asp Asp Ala Pro Gln Asp Cys
145 150 155 160
Leu His Phe Ser Lys Leu Met Glu Ala Asp Glu Ser Glu Met Pro Glu
165 170 175
Val Val Ile Asn Ser Asp Asp Val Val Ala Leu Pro Tyr Ser Ser Gly
180 185 190
Thr Thr Gly Leu Pro Lys Gly Val Met Leu Thr His Lys Gly Leu Val
195 200 205
Thr Ser Val Ala Gln Gln Val Asp Gly Asp Asn Pro Asn Leu Tyr Met
210 215 220
His Ser Glu Asp Val Met Ile Cys Ile Leu Pro Leu Phe His Ile Tyr
225 230 235 240
Ser Leu Asn Ala Val Leu Cys Cys Gly Leu Arg Ala Gly Val Thr Ile
245 250 255
Leu Ile Met Gln Lys Phe Asp Ile Val Pro Phe Leu Glu Leu Ile Gln
260 265 270
Lys Tyr Lys Val Thr Ile Gly Pro Phe Val Pro Pro Ile Val Leu Ala
275 280 285
Ile Ala Lys Ser Pro Val Val Asp Lys Tyr Asp Leu Ser Ser Val Arg
290 295 300
Thr Val Met Ser Gly Ala Ala Pro Leu Gly Lys Glu Leu Glu Asp Ala
305 310 315 320
Val Arg Ala Lys Phe Pro Asn Ala Lys Leu Gly Gln Gly Tyr Gly Met
325 330 335
Thr Glu Ala Gly Pro Val Leu Ala Met Cys Leu Ala Phe Ala Lys Glu
340 345 350
Pro Tyr Glu Ile Lys Ser Gly Ala Cys Gly Thr Val Val Arg Asn Ala
355 360 365
Glu Met Lys Ile Val Asp Pro Glu Thr Asn Ala Ser Leu Pro Arg Asn
370 375 380
Gln Arg Gly Glu Ile Cys Ile Arg Gly Asp Gln Ile Met Lys Gly Tyr
385 390 395 400
Leu Asn Asp Pro Glu Ser Thr Arg Thr Thr Ile Asp Glu Glu Gly Trp
405 410 415
Leu His Thr Gly Asp Ile Gly Phe Ile Asp Asp Asp Asp Glu Leu Phe
420 425 430
Ile Val Asp Arg Leu Lys Glu Ile Ile Lys Tyr Lys Gly Phe Gln Val
435 440 445
Ala Pro Ala Glu Leu Glu Ala Leu Leu Leu Thr His Pro Thr Ile Ser
450 455 460
Asp Ala Ala Val Val Pro Met Ile Asp Glu Lys Ala Gly Glu Val Pro
465 470 475 480
Val Ala Phe Val Val Arg Thr Asn Gly Phe Thr Thr Thr Glu Glu Glu
485 490 495
Ile Lys Gln Phe Val Ser Lys Gln Val Val Phe Tyr Lys Arg Ile Phe
500 505 510
Arg Val Phe Phe Val Asp Ala Ile Pro Lys Ser Pro Ser Gly Lys Ile
515 520 525
Leu Arg Lys Asp Leu Arg Ala Arg Ile Ala Ser Gly Asp Leu Pro Lys
530 535 540
Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Val Lys Glu Ser Leu
545 550 555 560
Val
<210> 10
<211> 565
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(565)
<223> SH3lig-(GGGGS)2-4CL
<400> 10
Met Pro Pro Pro Ala Leu Pro Pro Lys Arg Arg Arg Gly Gly Gly Gly
1 5 10 15
Ser Gly Gly Gly Gly Ser Gly Asp Cys Val Ala Pro Lys Glu Asp Leu
20 25 30
Ile Phe Arg Ser Lys Leu Pro Asp Ile Tyr Ile Pro Lys His Leu Pro
35 40 45
Leu His Thr Tyr Cys Phe Glu Asn Ile Ser Lys Val Gly Asp Lys Ser
50 55 60
Cys Leu Ile Asn Gly Ala Thr Gly Glu Thr Phe Thr Tyr Ser Gln Val
65 70 75 80
Glu Leu Leu Ser Arg Lys Val Ala Ser Gly Leu Asn Lys Leu Gly Ile
85 90 95
Gln Gln Gly Asp Thr Ile Met Leu Leu Leu Pro Asn Ser Pro Glu Tyr
100 105 110
Phe Phe Ala Phe Leu Gly Ala Ser Tyr Arg Gly Ala Ile Ser Thr Met
115 120 125
Ala Asn Pro Phe Phe Thr Ser Ala Glu Val Ile Lys Gln Leu Lys Ala
130 135 140
Ser Gln Ala Lys Leu Ile Ile Thr Gln Ala Cys Tyr Val Asp Lys Val
145 150 155 160
Lys Asp Tyr Ala Ala Glu Lys Asn Ile Gln Ile Ile Cys Ile Asp Asp
165 170 175
Ala Pro Gln Asp Cys Leu His Phe Ser Lys Leu Met Glu Ala Asp Glu
180 185 190
Ser Glu Met Pro Glu Val Val Ile Asn Ser Asp Asp Val Val Ala Leu
195 200 205
Pro Tyr Ser Ser Gly Thr Thr Gly Leu Pro Lys Gly Val Met Leu Thr
210 215 220
His Lys Gly Leu Val Thr Ser Val Ala Gln Gln Val Asp Gly Asp Asn
225 230 235 240
Pro Asn Leu Tyr Met His Ser Glu Asp Val Met Ile Cys Ile Leu Pro
245 250 255
Leu Phe His Ile Tyr Ser Leu Asn Ala Val Leu Cys Cys Gly Leu Arg
260 265 270
Ala Gly Val Thr Ile Leu Ile Met Gln Lys Phe Asp Ile Val Pro Phe
275 280 285
Leu Glu Leu Ile Gln Lys Tyr Lys Val Thr Ile Gly Pro Phe Val Pro
290 295 300
Pro Ile Val Leu Ala Ile Ala Lys Ser Pro Val Val Asp Lys Tyr Asp
305 310 315 320
Leu Ser Ser Val Arg Thr Val Met Ser Gly Ala Ala Pro Leu Gly Lys
325 330 335
Glu Leu Glu Asp Ala Val Arg Ala Lys Phe Pro Asn Ala Lys Leu Gly
340 345 350
Gln Gly Tyr Gly Met Thr Glu Ala Gly Pro Val Leu Ala Met Cys Leu
355 360 365
Ala Phe Ala Lys Glu Pro Tyr Glu Ile Lys Ser Gly Ala Cys Gly Thr
370 375 380
Val Val Arg Asn Ala Glu Met Lys Ile Val Asp Pro Glu Thr Asn Ala
385 390 395 400
Ser Leu Pro Arg Asn Gln Arg Gly Glu Ile Cys Ile Arg Gly Asp Gln
405 410 415
Ile Met Lys Gly Tyr Leu Asn Asp Pro Glu Ser Thr Arg Thr Thr Ile
420 425 430
Asp Glu Glu Gly Trp Leu His Thr Gly Asp Ile Gly Phe Ile Asp Asp
435 440 445
Asp Asp Glu Leu Phe Ile Val Asp Arg Leu Lys Glu Ile Ile Lys Tyr
450 455 460
Lys Gly Phe Gln Val Ala Pro Ala Glu Leu Glu Ala Leu Leu Leu Thr
465 470 475 480
His Pro Thr Ile Ser Asp Ala Ala Val Val Pro Met Ile Asp Glu Lys
485 490 495
Ala Gly Glu Val Pro Val Ala Phe Val Val Arg Thr Asn Gly Phe Thr
500 505 510
Thr Thr Glu Glu Glu Ile Lys Gln Phe Val Ser Lys Gln Val Val Phe
515 520 525
Tyr Lys Arg Ile Phe Arg Val Phe Phe Val Asp Ala Ile Pro Lys Ser
530 535 540
Pro Ser Gly Lys Ile Leu Arg Lys Asp Leu Arg Ala Arg Ile Ala Ser
545 550 555 560
Gly Asp Leu Pro Lys
565
<210> 11
<211> 565
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(565)
<223> 4CL-(GGGGS)2-SH3lig
<400> 11
Met Gly Asp Cys Val Ala Pro Lys Glu Asp Leu Ile Phe Arg Ser Lys
1 5 10 15
Leu Pro Asp Ile Tyr Ile Pro Lys His Leu Pro Leu His Thr Tyr Cys
20 25 30
Phe Glu Asn Ile Ser Lys Val Gly Asp Lys Ser Cys Leu Ile Asn Gly
35 40 45
Ala Thr Gly Glu Thr Phe Thr Tyr Ser Gln Val Glu Leu Leu Ser Arg
50 55 60
Lys Val Ala Ser Gly Leu Asn Lys Leu Gly Ile Gln Gln Gly Asp Thr
65 70 75 80
Ile Met Leu Leu Leu Pro Asn Ser Pro Glu Tyr Phe Phe Ala Phe Leu
85 90 95
Gly Ala Ser Tyr Arg Gly Ala Ile Ser Thr Met Ala Asn Pro Phe Phe
100 105 110
Thr Ser Ala Glu Val Ile Lys Gln Leu Lys Ala Ser Gln Ala Lys Leu
115 120 125
Ile Ile Thr Gln Ala Cys Tyr Val Asp Lys Val Lys Asp Tyr Ala Ala
130 135 140
Glu Lys Asn Ile Gln Ile Ile Cys Ile Asp Asp Ala Pro Gln Asp Cys
145 150 155 160
Leu His Phe Ser Lys Leu Met Glu Ala Asp Glu Ser Glu Met Pro Glu
165 170 175
Val Val Ile Asn Ser Asp Asp Val Val Ala Leu Pro Tyr Ser Ser Gly
180 185 190
Thr Thr Gly Leu Pro Lys Gly Val Met Leu Thr His Lys Gly Leu Val
195 200 205
Thr Ser Val Ala Gln Gln Val Asp Gly Asp Asn Pro Asn Leu Tyr Met
210 215 220
His Ser Glu Asp Val Met Ile Cys Ile Leu Pro Leu Phe His Ile Tyr
225 230 235 240
Ser Leu Asn Ala Val Leu Cys Cys Gly Leu Arg Ala Gly Val Thr Ile
245 250 255
Leu Ile Met Gln Lys Phe Asp Ile Val Pro Phe Leu Glu Leu Ile Gln
260 265 270
Lys Tyr Lys Val Thr Ile Gly Pro Phe Val Pro Pro Ile Val Leu Ala
275 280 285
Ile Ala Lys Ser Pro Val Val Asp Lys Tyr Asp Leu Ser Ser Val Arg
290 295 300
Thr Val Met Ser Gly Ala Ala Pro Leu Gly Lys Glu Leu Glu Asp Ala
305 310 315 320
Val Arg Ala Lys Phe Pro Asn Ala Lys Leu Gly Gln Gly Tyr Gly Met
325 330 335
Thr Glu Ala Gly Pro Val Leu Ala Met Cys Leu Ala Phe Ala Lys Glu
340 345 350
Pro Tyr Glu Ile Lys Ser Gly Ala Cys Gly Thr Val Val Arg Asn Ala
355 360 365
Glu Met Lys Ile Val Asp Pro Glu Thr Asn Ala Ser Leu Pro Arg Asn
370 375 380
Gln Arg Gly Glu Ile Cys Ile Arg Gly Asp Gln Ile Met Lys Gly Tyr
385 390 395 400
Leu Asn Asp Pro Glu Ser Thr Arg Thr Thr Ile Asp Glu Glu Gly Trp
405 410 415
Leu His Thr Gly Asp Ile Gly Phe Ile Asp Asp Asp Asp Glu Leu Phe
420 425 430
Ile Val Asp Arg Leu Lys Glu Ile Ile Lys Tyr Lys Gly Phe Gln Val
435 440 445
Ala Pro Ala Glu Leu Glu Ala Leu Leu Leu Thr His Pro Thr Ile Ser
450 455 460
Asp Ala Ala Val Val Pro Met Ile Asp Glu Lys Ala Gly Glu Val Pro
465 470 475 480
Val Ala Phe Val Val Arg Thr Asn Gly Phe Thr Thr Thr Glu Glu Glu
485 490 495
Ile Lys Gln Phe Val Ser Lys Gln Val Val Phe Tyr Lys Arg Ile Phe
500 505 510
Arg Val Phe Phe Val Asp Ala Ile Pro Lys Ser Pro Ser Gly Lys Ile
515 520 525
Leu Arg Lys Asp Leu Arg Ala Arg Ile Ala Ser Gly Asp Leu Pro Lys
530 535 540
Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Pro Pro Pro Ala Leu Pro
545 550 555 560
Pro Lys Arg Arg Arg
565
<210> 12
<211> 7
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(7)
<223> PDZ ligand
<400> 12
Gly Val Lys Glu Ser Leu Val
1 5
<210> 13
<211> 57
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(57)
<223> SH3结构域
<400> 13
Ala Glu Tyr Val Arg Ala Leu Phe Asp Phe Asn Gly Asn Asp Glu Glu
1 5 10 15
Asp Leu Pro Phe Lys Lys Gly Asp Ile Leu Arg Ile Arg Asp Lys Pro
20 25 30
Glu Glu Gln Trp Trp Asn Ala Glu Asp Ser Glu Gly Lys Arg Gly Met
35 40 45
Ile Pro Val Pro Tyr Val Glu Lys Tyr
50 55
<210> 14
<211> 11
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(11)
<223> SH3lig
<400> 14
Pro Pro Pro Ala Leu Pro Pro Lys Arg Arg Arg
1 5 10
Claims (20)
1.一种用于合成黄芩素、野黄芩素类化合物的原核细胞,其包括外源的下组酶的编码基因:黄酮6-羟化酶,细胞色素P450氧化还原酶,苯丙氨酸解氨酶、4-香豆酸辅酶A连接酶、查尔酮合成酶、查尔酮异构酶和黄酮合成酶I;且所述酶被表达后,苯丙氨酸解氨酶和4-香豆酸辅酶A连接酶构成复合体。
2.一种用于合成白杨素类化合物的原核细胞,其包括外源的下组酶的编码基因:苯丙氨酸解氨酶、4-香豆酸辅酶A连接酶、查尔酮合成酶、查尔酮异构酶和黄酮合成酶I;且所述酶被表达后,苯丙氨酸解氨酶和4-香豆酸辅酶A连接酶构成复合体。
3.如权利要求1或2所述的原核细胞,其特征在于,所述的苯丙氨酸解氨酶和4-香豆酸辅酶A连接酶的复合体包括:
苯丙氨酸解氨酶和4-香豆酸辅酶A通过蛋白-蛋白相互作用结构域及其配体的结合而靠近,获得复合体;或
苯丙氨酸解氨酶和4-香豆酸辅酶A连接酶直接连接或通过连接子连接、获得融合蛋白形式的复合体。
4.如权利要求3所述的原核细胞,其特征在于,所述蛋白-蛋白相互作用结构域包括选自下组的结构域:PDZ结构域,SH3结构域,WW结构域,LIM结构域,DD结构域,PH结构域,EH结构域,GBD结构域。
5.如权利要求3所述的原核细胞,其特征在于,所述蛋白-蛋白相互作用结构域包括PDZ结构域,其配体为PDZ ligand;所述苯丙氨酸解氨酶和4-香豆酸辅酶A分别与所述PDZ结构域及其配体融合;较佳地,所述苯丙氨酸解氨酶与PDZ融合、所述4-香豆酸辅酶A与PDZligand融合;更佳地,所述苯丙氨酸解氨酶与PDZ融合时还包括以ER/K连接子连接,所述4-香豆酸辅酶A与PDZ ligand融合时还包括以(GGGGS)2连接子连接。
6.如权利要求3所述的原核细胞,其特征在于,所述蛋白-蛋白相互作用结构域包括SH3结构域,其配体为SH3 ligand;所述苯丙氨酸解氨酶和4-香豆酸辅酶A分别与所述SH3结构域及其配体融合;较佳地,所述苯丙氨酸解氨酶与SH3融合、所述4-香豆酸辅酶A与SH3ligand融合;更佳地,所述苯丙氨酸解氨酶与SH3融合时还包括以ER/K连接子连接,所述4-香豆酸辅酶A与SH3 ligand融合时还包括以(GGGGS)2连接子连接。
7.如权利要求5或6所述的原核细胞,其特征在于,所述苯丙氨酸解氨酶与PDZ融合时,所述苯丙氨酸解氨酶位于N端,所述PDZ位于C端;或
所述4-香豆酸辅酶A与PDZ ligand融合时,所述PDZ ligand位于N端,所述4-香豆酸辅酶A位于C端;或
所述苯丙氨酸解氨酶与SH3融合时;所述苯丙氨酸解氨酶位于N端,所述SH3位于C端;或
所述4-香豆酸辅酶A与SH3 ligand融合时,所述SH3 ligand位于N端,所述4-香豆酸辅酶A位于C端。
8.如权利要求1所述的原核细胞,其特征在于,所述细胞中还包括外源的促进丙二酰CoA生成的酶的编码基因;较佳地,包括matC,matB,ACS,FabF。
9.如权利要求1所述的原核细胞,其特征在于,所述的原核细胞为大肠杆菌细胞。
10.如权利要求1~9任一所述的原核细胞,其特征在于,所述细胞中还包括外源的促进苯丙氨酸合成的酶的编码基因;较佳地,包括:aroG,pheA;更佳地,所述pheA为第976位由A突变为C的基因;更佳地,所述aroG为第436位由G突变为A的基因。
11.权利要求1、3~10任一所述的原核细胞的应用,用于合成黄芩素或野黄芩素类化合物。
12.权利要求2、3~10任一所述的原核细胞的应用,用于合成白杨素类化合物。
15.一种合成黄芩素类化合物或白杨素类化合物的方法,包括:提供权利要求10所述的原核细胞,以葡萄糖为底物,合成黄芩素类化合物或白杨素类化合物。
16.如权利要求13~15任一所述的方法,其特征在于,在引入细胞时,
所述PDZligand、4-香豆酸辅酶A连接酶、苯丙氨酸解氨酶、ER/K、PDZ、黄酮合成酶I、查尔酮合成酶、查尔酮异构酶的编码基因位于一个构建体中;
所述黄酮6-羟化酶,细胞色素P450氧化还原酶的编码基因位于一个构建体中,较佳地还包括2B1基因;
所述matC,matB,ACS,FabF的编码基因位于一个构建体中;
所述SH3lig,4-香豆酸辅酶A连接酶,苯丙氨酸解氨酶,ER/K,SH3,查尔酮合成酶的编码基因位于一个构建体中;
所述查尔酮异构酶,黄酮合成酶I的编码基因位于一个构建体中;或
所述matC、matB、ACS、FabF的编码基因,第976位由A突变为C的pheA基因,第436位由G突变为A的aroG基因位于一个构建体中。
17.一种试剂盒,其包括权利要求1~10任一所述的重组的宿主细胞。
18.一种试剂盒,其包括:包含PDZligand、4-香豆酸辅酶A连接酶、苯丙氨酸解氨酶、ER/K、PDZ、黄酮合成酶I、查尔酮合成酶、查尔酮异构酶的编码基因的构建体;
包含黄酮6-羟化酶,细胞色素P450氧化还原酶的编码基因的构建体;较佳地所述构建体还包含2B1基因;
包含matC,matB,ACS,FabF的编码基因的构建体;
包含SH3lig,4-香豆酸辅酶A连接酶,苯丙氨酸解氨酶,ER/K,SH3,查尔酮合成酶的编码基因的构建体;
包含查尔酮异构酶,黄酮合成酶I的编码基因的构建体;
包含matC、matB、ACS、FabF的编码基因,第976位由A突变为C的pheA基因,第436位由G突变为A的aroG基因的构建体。
19.一种试剂盒,其包括:包含PDZligand、4-香豆酸辅酶A连接酶、苯丙氨酸解氨酶、ER/K、PDZ、黄酮合成酶I、查尔酮合成酶、查尔酮异构酶的编码基因的构建体;
包含matC,matB,ACS,FabF的编码基因的构建体;
包含SH3lig,4-香豆酸辅酶A连接酶,苯丙氨酸解氨酶,ER/K,SH3,查尔酮合成酶的编码基因的构建体;
包含查尔酮异构酶,黄酮合成酶I的编码基因的构建体;
包含matC、matB、ACS、FabF的编码基因,第976位由A突变为C的pheA基因,第436位由G突变为A的aroG基因的构建体。
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110009696.4A CN114717170B (zh) | 2021-01-05 | 2021-01-05 | 异源合成黄酮类化合物的宿主细胞及其应用 |
PCT/CN2022/070316 WO2022148377A1 (zh) | 2021-01-05 | 2022-01-05 | 异源合成黄酮类化合物的宿主细胞及其应用 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110009696.4A CN114717170B (zh) | 2021-01-05 | 2021-01-05 | 异源合成黄酮类化合物的宿主细胞及其应用 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114717170A true CN114717170A (zh) | 2022-07-08 |
CN114717170B CN114717170B (zh) | 2024-06-04 |
Family
ID=82234033
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110009696.4A Active CN114717170B (zh) | 2021-01-05 | 2021-01-05 | 异源合成黄酮类化合物的宿主细胞及其应用 |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN114717170B (zh) |
WO (1) | WO2022148377A1 (zh) |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107723317A (zh) * | 2017-11-29 | 2018-02-23 | 华东理工大学 | 一种在大肠杆菌中生产衣康酸的方法 |
CN110885846A (zh) * | 2018-09-07 | 2020-03-17 | 中国科学院上海生命科学研究院 | 合成黄芩素和野黄芩素的微生物、其制备方法及其应用 |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2013067173A1 (en) * | 2011-11-02 | 2013-05-10 | The Board Of Trustees Of The Leland Stanford Junior University | Systematic control of protein interaction using a modular er/k linker |
WO2018226757A1 (en) * | 2017-06-05 | 2018-12-13 | Sivaramakrishnan Sivaraj | Screening system and methods for identifying enzyme substrates and modulators of enzyme activity |
CN113621530A (zh) * | 2020-04-07 | 2021-11-09 | 华东理工大学 | 生产汉黄芩素类化合物的基因工程酵母菌、其构建方法及应用 |
-
2021
- 2021-01-05 CN CN202110009696.4A patent/CN114717170B/zh active Active
-
2022
- 2022-01-05 WO PCT/CN2022/070316 patent/WO2022148377A1/zh active Application Filing
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107723317A (zh) * | 2017-11-29 | 2018-02-23 | 华东理工大学 | 一种在大肠杆菌中生产衣康酸的方法 |
CN110885846A (zh) * | 2018-09-07 | 2020-03-17 | 中国科学院上海生命科学研究院 | 合成黄芩素和野黄芩素的微生物、其制备方法及其应用 |
Non-Patent Citations (2)
Title |
---|
LI,J.等: "Production of plant-specific flavones baicalein and scutellarein in an engineered E.coli from available phenylalanine and tyrosine", METABOLIC ENGINEERING * |
方从兵,宛晓春,江昌俊: "黄酮类化合物生物合成的研究进展(综述)", 安徽农业大学学报 * |
Also Published As
Publication number | Publication date |
---|---|
CN114717170B (zh) | 2024-06-04 |
WO2022148377A1 (zh) | 2022-07-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9181539B2 (en) | Strains for the production of flavonoids from glucose | |
CN113322288B (zh) | 新型黄酮羟基化酶、合成黄酮碳苷类化合物的微生物及其应用 | |
WO2020048523A1 (zh) | 合成黄芩素和野黄芩素的微生物、其制备方法及其应用 | |
CN108913672A (zh) | 一种新型异戊烯转移酶及其应用 | |
CN113265433A (zh) | 双功能碳苷糖基转移酶及其应用 | |
CN112391300B (zh) | 水飞蓟来源的黄酮3β-羟化酶及其辅酶的应用 | |
CN114717170B (zh) | 异源合成黄酮类化合物的宿主细胞及其应用 | |
CN110904062B (zh) | 一株高产l-丙氨酸的菌株 | |
CN108220260B (zh) | 一种催化柚皮素生成山萘酚的融合酶及其应用 | |
CN114657160B (zh) | 一种糖基转移酶突变体及其应用 | |
CN112877349B (zh) | 一种重组表达载体、包含其的基因工程菌及其应用 | |
CN114277024B (zh) | 一种新型三萜合酶及其应用 | |
CN114032222B (zh) | 糖链延伸糖基转移酶突变体及其编码基因以及基因工程菌和它们的应用 | |
KR20240032944A (ko) | 람노스가 고도로 특이적인 글리코실트랜스퍼라제 및 이의 응용 | |
CN116656641A (zh) | 咖啡酸o-甲基转移酶突变体及其应用 | |
CN113621629A (zh) | 一种基于丙二酰辅酶a再生的柚皮素体外酶促合成方法 | |
WO2023138679A1 (zh) | 异源合成黄酮类化合物的调控方法与应用 | |
CN113583983A (zh) | 一种融合蛋白或其变体及其在制备骨化二醇中的应用 | |
CN106906192B (zh) | 一种葡萄糖基转移酶及在合成藏红花酸葡萄糖酯中的应用 | |
CN114806999B (zh) | 一种基因工程菌及其在制备二氢大豆苷元中的应用 | |
CN113528471B (zh) | 一种从头合成黄烷酮的三功能酶及其合成方法和应用 | |
CN114250237B (zh) | 一种酪氨酸酚裂解酶突变体、工程菌及在催化合成左旋多巴中的应用 | |
CN114774442B (zh) | 一种产灯盏乙素的重组解脂耶氏酵母及其构建方法和用途 | |
CN109486789B (zh) | 一种立体选择性提高的菜豆环氧化物水解酶突变体 | |
CN117946988A (zh) | 小檗碱桥酶突变体及其应用 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |