CN112280758A - 一种甾体5β还原酶变体及其用途 - Google Patents
一种甾体5β还原酶变体及其用途 Download PDFInfo
- Publication number
- CN112280758A CN112280758A CN202011215626.6A CN202011215626A CN112280758A CN 112280758 A CN112280758 A CN 112280758A CN 202011215626 A CN202011215626 A CN 202011215626A CN 112280758 A CN112280758 A CN 112280758A
- Authority
- CN
- China
- Prior art keywords
- steroid
- 5beta
- glu
- lys
- gly
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 108010051372 3-oxo-5 beta-steroid delta 4-dehydrogenase Proteins 0.000 title claims abstract description 79
- 229930027945 nicotinamide-adenine dinucleotide Natural products 0.000 claims abstract description 59
- BOPGDPNILDQYTO-NNYOXOHSSA-N nicotinamide-adenine dinucleotide Chemical compound C1=CCC(C(=O)N)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OC[C@@H]2[C@H]([C@@H](O)[C@@H](O2)N2C3=NC=NC(N)=C3N=C2)O)O1 BOPGDPNILDQYTO-NNYOXOHSSA-N 0.000 claims abstract description 51
- 239000005515 coenzyme Substances 0.000 claims abstract description 47
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract description 24
- 230000035772 mutation Effects 0.000 claims abstract description 13
- 102200023390 rs11556797 Human genes 0.000 claims abstract description 9
- 102220216380 rs747938069 Human genes 0.000 claims abstract description 9
- 150000001413 amino acids Chemical class 0.000 claims abstract description 8
- 230000000694 effects Effects 0.000 claims abstract description 4
- XMRPGKVKISIQBV-UHFFFAOYSA-N (+-)-5- Pregnane-3,20-dione Natural products C1CC2CC(=O)CCC2(C)C2C1C1CCC(C(=O)C)C1(C)CC2 XMRPGKVKISIQBV-UHFFFAOYSA-N 0.000 claims description 37
- XMRPGKVKISIQBV-XWOJZHJZSA-N 5beta-pregnane-3,20-dione Chemical compound C([C@H]1CC2)C(=O)CC[C@]1(C)[C@@H]1[C@@H]2[C@@H]2CC[C@H](C(=O)C)[C@@]2(C)CC1 XMRPGKVKISIQBV-XWOJZHJZSA-N 0.000 claims description 26
- 210000004027 cell Anatomy 0.000 claims description 20
- 239000013604 expression vector Substances 0.000 claims description 19
- 238000000034 method Methods 0.000 claims description 16
- 239000000758 substrate Substances 0.000 claims description 16
- 108091033319 polynucleotide Proteins 0.000 claims description 15
- 239000002157 polynucleotide Substances 0.000 claims description 15
- 102000040430 polynucleotide Human genes 0.000 claims description 15
- 239000001257 hydrogen Substances 0.000 claims description 14
- 229910052739 hydrogen Inorganic materials 0.000 claims description 14
- -1 bile acid compound Chemical class 0.000 claims description 11
- 239000013612 plasmid Substances 0.000 claims description 10
- KXGVEGMKQFWNSR-UHFFFAOYSA-N deoxycholic acid Natural products C1CC2CC(O)CCC2(C)C2C1C1CCC(C(CCC(O)=O)C)C1(C)C(O)C2 KXGVEGMKQFWNSR-UHFFFAOYSA-N 0.000 claims description 9
- ACFIXJIJDZMPPO-NNYOXOHSSA-N NADPH Chemical compound C1=CCC(C(=O)N)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OC[C@@H]2[C@H]([C@@H](OP(O)(O)=O)[C@@H](O2)N2C3=NC=NC(N)=C3N=C2)O)O1 ACFIXJIJDZMPPO-NNYOXOHSSA-N 0.000 claims description 8
- KXGVEGMKQFWNSR-LLQZFEROSA-N deoxycholic acid Chemical group C([C@H]1CC2)[C@H](O)CC[C@]1(C)[C@@H]1[C@@H]2[C@@H]2CC[C@H]([C@@H](CCC(O)=O)C)[C@@]2(C)[C@@H](O)C1 KXGVEGMKQFWNSR-LLQZFEROSA-N 0.000 claims description 8
- 241000588724 Escherichia coli Species 0.000 claims description 7
- 239000000203 mixture Substances 0.000 claims description 7
- 238000004519 manufacturing process Methods 0.000 claims description 6
- BHQCQFFYRZLCQQ-UHFFFAOYSA-N (3alpha,5alpha,7alpha,12alpha)-3,7,12-trihydroxy-cholan-24-oic acid Natural products OC1CC2CC(O)CCC2(C)C2C1C1CCC(C(CCC(O)=O)C)C1(C)C(O)C2 BHQCQFFYRZLCQQ-UHFFFAOYSA-N 0.000 claims description 5
- 239000004380 Cholic acid Substances 0.000 claims description 5
- 229960002471 cholic acid Drugs 0.000 claims description 5
- 235000019416 cholic acid Nutrition 0.000 claims description 5
- 239000003613 bile acid Substances 0.000 claims description 4
- 150000001875 compounds Chemical class 0.000 claims description 4
- 229960003964 deoxycholic acid Drugs 0.000 claims description 4
- 210000003527 eukaryotic cell Anatomy 0.000 claims description 4
- 238000002360 preparation method Methods 0.000 claims description 4
- 210000001236 prokaryotic cell Anatomy 0.000 claims description 4
- 241000894006 Bacteria Species 0.000 claims description 3
- UFHFLCQGNIYNRP-UHFFFAOYSA-N Hydrogen Chemical compound [H][H] UFHFLCQGNIYNRP-UHFFFAOYSA-N 0.000 claims description 3
- 238000006467 substitution reaction Methods 0.000 claims description 3
- 241000186046 Actinomyces Species 0.000 claims description 2
- 241000186063 Arthrobacter Species 0.000 claims description 2
- 244000063299 Bacillus subtilis Species 0.000 claims description 2
- 235000014469 Bacillus subtilis Nutrition 0.000 claims description 2
- 241000206602 Eukaryota Species 0.000 claims description 2
- 241000186359 Mycobacterium Species 0.000 claims description 2
- 241000589516 Pseudomonas Species 0.000 claims description 2
- 241000316848 Rhodococcus <scale insect> Species 0.000 claims description 2
- 240000004808 Saccharomyces cerevisiae Species 0.000 claims description 2
- 238000007792 addition Methods 0.000 claims description 2
- 210000000349 chromosome Anatomy 0.000 claims description 2
- 238000012217 deletion Methods 0.000 claims description 2
- 230000037430 deletion Effects 0.000 claims description 2
- 230000002538 fungal effect Effects 0.000 claims description 2
- 230000009465 prokaryotic expression Effects 0.000 claims description 2
- 239000000047 product Substances 0.000 abstract description 46
- 238000005984 hydrogenation reaction Methods 0.000 abstract description 7
- 230000009467 reduction Effects 0.000 abstract description 5
- 230000008901 benefit Effects 0.000 abstract description 2
- 239000012084 conversion product Substances 0.000 abstract description 2
- RJKFOVLPORLFTN-LEKSSAKUSA-N Progesterone Chemical compound C1CC2=CC(=O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H](C(=O)C)[C@@]1(C)CC2 RJKFOVLPORLFTN-LEKSSAKUSA-N 0.000 description 66
- YPJMOVVQKBFRNH-UHFFFAOYSA-N 1-(9-ethylcarbazol-3-yl)-n-(pyridin-2-ylmethyl)methanamine Chemical compound C=1C=C2N(CC)C3=CC=CC=C3C2=CC=1CNCC1=CC=CC=N1 YPJMOVVQKBFRNH-UHFFFAOYSA-N 0.000 description 33
- 239000000186 progesterone Substances 0.000 description 33
- 229960003387 progesterone Drugs 0.000 description 33
- 238000006243 chemical reaction Methods 0.000 description 28
- 108090000623 proteins and genes Proteins 0.000 description 27
- 101150090155 R gene Proteins 0.000 description 22
- 108700026215 vpr Genes Proteins 0.000 description 22
- XEKOWRVHYACXOJ-UHFFFAOYSA-N Ethyl acetate Chemical compound CCOC(C)=O XEKOWRVHYACXOJ-UHFFFAOYSA-N 0.000 description 18
- 238000001514 detection method Methods 0.000 description 14
- 108010050848 glycylleucine Proteins 0.000 description 12
- XMRPGKVKISIQBV-BJMCWZGWSA-N 5alpha-pregnane-3,20-dione Chemical compound C([C@@H]1CC2)C(=O)CC[C@]1(C)[C@@H]1[C@@H]2[C@@H]2CC[C@H](C(=O)C)[C@@]2(C)CC1 XMRPGKVKISIQBV-BJMCWZGWSA-N 0.000 description 11
- 239000012634 fragment Substances 0.000 description 11
- 238000004809 thin layer chromatography Methods 0.000 description 11
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 10
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 10
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 10
- 239000000126 substance Substances 0.000 description 10
- 102000004190 Enzymes Human genes 0.000 description 9
- 108090000790 Enzymes Proteins 0.000 description 9
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 8
- KYYMILWEGJYPQZ-IHRRRGAJSA-N Phe-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KYYMILWEGJYPQZ-IHRRRGAJSA-N 0.000 description 8
- 108010041407 alanylaspartic acid Proteins 0.000 description 8
- 108010081551 glycylphenylalanine Proteins 0.000 description 8
- 102000004169 proteins and genes Human genes 0.000 description 8
- 108010071207 serylmethionine Proteins 0.000 description 8
- 101100381300 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) atp-5 gene Proteins 0.000 description 7
- FOHXUHGZZKETFI-JBDRJPRFSA-N Ala-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C)N FOHXUHGZZKETFI-JBDRJPRFSA-N 0.000 description 6
- WUHJHHGYVVJMQE-BJDJZHNGSA-N Ala-Leu-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WUHJHHGYVVJMQE-BJDJZHNGSA-N 0.000 description 6
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 6
- OAIGZYFGCNNVIE-ZPFDUUQYSA-N Ala-Val-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O OAIGZYFGCNNVIE-ZPFDUUQYSA-N 0.000 description 6
- RKXVTTIQNKPCHU-KKHAAJSZSA-N Asp-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O RKXVTTIQNKPCHU-KKHAAJSZSA-N 0.000 description 6
- YUZPQIQWXLRFBW-ACZMJKKPSA-N Cys-Glu-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O YUZPQIQWXLRFBW-ACZMJKKPSA-N 0.000 description 6
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 6
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 6
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 6
- IXHQLZIWBCQBLQ-STQMWFEESA-N Gly-Pro-Phe Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IXHQLZIWBCQBLQ-STQMWFEESA-N 0.000 description 6
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 6
- GGXUJBKENKVYNV-ULQDDVLXSA-N His-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N GGXUJBKENKVYNV-ULQDDVLXSA-N 0.000 description 6
- PMMMQRVUMVURGJ-XUXIUFHCSA-N Ile-Leu-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O PMMMQRVUMVURGJ-XUXIUFHCSA-N 0.000 description 6
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 6
- KEVYYIMVELOXCT-KBPBESRZSA-N Leu-Gly-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KEVYYIMVELOXCT-KBPBESRZSA-N 0.000 description 6
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 6
- KKFVKBWCXXLKIK-AVGNSLFASA-N Lys-His-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCCN)N KKFVKBWCXXLKIK-AVGNSLFASA-N 0.000 description 6
- MNGBICITWAPGAS-BPUTZDHNSA-N Met-Ser-Trp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O MNGBICITWAPGAS-BPUTZDHNSA-N 0.000 description 6
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 6
- 108090000854 Oxidoreductases Proteins 0.000 description 6
- 102000004316 Oxidoreductases Human genes 0.000 description 6
- FYXCBXDAMPEHIQ-FHWLQOOXSA-N Pro-Trp-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CCCCN)C(=O)O FYXCBXDAMPEHIQ-FHWLQOOXSA-N 0.000 description 6
- ZWZOCUWOXSDYFZ-CQDKDKBSSA-N Tyr-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ZWZOCUWOXSDYFZ-CQDKDKBSSA-N 0.000 description 6
- ABSXSJZNRAQDDI-KJEVXHAQSA-N Tyr-Val-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ABSXSJZNRAQDDI-KJEVXHAQSA-N 0.000 description 6
- GUIYPEKUEMQBIK-JSGCOSHPSA-N Val-Tyr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)NCC(O)=O GUIYPEKUEMQBIK-JSGCOSHPSA-N 0.000 description 6
- 108010038633 aspartylglutamate Proteins 0.000 description 6
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 6
- 108010037850 glycylvaline Proteins 0.000 description 6
- 108010043322 lysyl-tryptophyl-alpha-lysine Proteins 0.000 description 6
- 108010009298 lysylglutamic acid Proteins 0.000 description 6
- 108010017391 lysylvaline Proteins 0.000 description 6
- 238000003786 synthesis reaction Methods 0.000 description 6
- 108010061238 threonyl-glycine Proteins 0.000 description 6
- 108010045269 tryptophyltryptophan Proteins 0.000 description 6
- 108010000998 wheylin-2 peptide Proteins 0.000 description 6
- 230000015572 biosynthetic process Effects 0.000 description 5
- 238000004949 mass spectrometry Methods 0.000 description 5
- LZRNYBIJOSKKRJ-XVYDVKMFSA-N Ala-Asp-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LZRNYBIJOSKKRJ-XVYDVKMFSA-N 0.000 description 4
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 4
- UFBURHXMKFQVLM-CIUDSAMLSA-N Arg-Glu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UFBURHXMKFQVLM-CIUDSAMLSA-N 0.000 description 4
- ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N Asn-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N 0.000 description 4
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 4
- ULRPXVNMIIYDDJ-ACZMJKKPSA-N Asn-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N ULRPXVNMIIYDDJ-ACZMJKKPSA-N 0.000 description 4
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 4
- SPCONPVIDFMDJI-QSFUFRPTSA-N Asn-Ile-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O SPCONPVIDFMDJI-QSFUFRPTSA-N 0.000 description 4
- REQUGIWGOGSOEZ-ZLUOBGJFSA-N Asn-Ser-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N REQUGIWGOGSOEZ-ZLUOBGJFSA-N 0.000 description 4
- VLDRQOHCMKCXLY-SRVKXCTJSA-N Asn-Ser-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VLDRQOHCMKCXLY-SRVKXCTJSA-N 0.000 description 4
- POTCZYQVVNXUIG-BQBZGAKWSA-N Asp-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O POTCZYQVVNXUIG-BQBZGAKWSA-N 0.000 description 4
- PYXXJFRXIYAESU-PCBIJLKTSA-N Asp-Ile-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PYXXJFRXIYAESU-PCBIJLKTSA-N 0.000 description 4
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 4
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 4
- WVWRADGCZPIJJR-IHRRRGAJSA-N Cys-Val-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CS)N WVWRADGCZPIJJR-IHRRRGAJSA-N 0.000 description 4
- GQZDDFRXSDGUNG-YVNDNENWSA-N Gln-Ile-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O GQZDDFRXSDGUNG-YVNDNENWSA-N 0.000 description 4
- DYFJZDDQPNIPAB-NHCYSSNCSA-N Glu-Arg-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O DYFJZDDQPNIPAB-NHCYSSNCSA-N 0.000 description 4
- GZWOBWMOMPFPCD-CIUDSAMLSA-N Glu-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N GZWOBWMOMPFPCD-CIUDSAMLSA-N 0.000 description 4
- XHWLNISLUFEWNS-CIUDSAMLSA-N Glu-Gln-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O XHWLNISLUFEWNS-CIUDSAMLSA-N 0.000 description 4
- RFDHKPSHTXZKLL-IHRRRGAJSA-N Glu-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N RFDHKPSHTXZKLL-IHRRRGAJSA-N 0.000 description 4
- KASDBWKLWJKTLJ-GUBZILKMSA-N Glu-Glu-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O KASDBWKLWJKTLJ-GUBZILKMSA-N 0.000 description 4
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 4
- MCGNJCNXIMQCMN-DCAQKATOSA-N Glu-Met-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCC(O)=O MCGNJCNXIMQCMN-DCAQKATOSA-N 0.000 description 4
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 4
- HAGKYCXGTRUUFI-RYUDHWBXSA-N Glu-Tyr-Gly Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)O HAGKYCXGTRUUFI-RYUDHWBXSA-N 0.000 description 4
- HKSNHPVETYYJBK-LAEOZQHASA-N Gly-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)CN HKSNHPVETYYJBK-LAEOZQHASA-N 0.000 description 4
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 description 4
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 4
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 4
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 4
- XHVONGZZVUUORG-WEDXCCLWSA-N Gly-Thr-Lys Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN XHVONGZZVUUORG-WEDXCCLWSA-N 0.000 description 4
- LMMPTUVWHCFTOT-GARJFASQSA-N His-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O LMMPTUVWHCFTOT-GARJFASQSA-N 0.000 description 4
- VGYOLSOFODKLSP-IHPCNDPISA-N His-Leu-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CN=CN1 VGYOLSOFODKLSP-IHPCNDPISA-N 0.000 description 4
- DAKSMIWQZPHRIB-BZSNNMDCSA-N His-Tyr-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DAKSMIWQZPHRIB-BZSNNMDCSA-N 0.000 description 4
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 4
- DCQMJRSOGCYKTR-GHCJXIJMSA-N Ile-Asp-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DCQMJRSOGCYKTR-GHCJXIJMSA-N 0.000 description 4
- AREBLHSMLMRICD-PYJNHQTQSA-N Ile-His-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AREBLHSMLMRICD-PYJNHQTQSA-N 0.000 description 4
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 4
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 4
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 4
- UAELWXJFLZBKQS-WHOFXGATSA-N Ile-Phe-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O UAELWXJFLZBKQS-WHOFXGATSA-N 0.000 description 4
- JDCQDJVYUXNCGF-SPOWBLRKSA-N Ile-Ser-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N JDCQDJVYUXNCGF-SPOWBLRKSA-N 0.000 description 4
- JTBFQNHKNRZJDS-SYWGBEHUSA-N Ile-Trp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](C)C(=O)O)N JTBFQNHKNRZJDS-SYWGBEHUSA-N 0.000 description 4
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 4
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 4
- CIVKXGPFXDIQBV-WDCWCFNPSA-N Leu-Gln-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CIVKXGPFXDIQBV-WDCWCFNPSA-N 0.000 description 4
- YRWCPXOFBKTCFY-NUTKFTJISA-N Lys-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N YRWCPXOFBKTCFY-NUTKFTJISA-N 0.000 description 4
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 4
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 4
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 4
- YUTZYVTZDVZBJJ-IHPCNDPISA-N Lys-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 YUTZYVTZDVZBJJ-IHPCNDPISA-N 0.000 description 4
- DRXODWRPPUFIAY-DCAQKATOSA-N Met-Asn-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN DRXODWRPPUFIAY-DCAQKATOSA-N 0.000 description 4
- GMMLGMFBYCFCCX-KZVJFYERSA-N Met-Thr-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMMLGMFBYCFCCX-KZVJFYERSA-N 0.000 description 4
- 108010066427 N-valyltryptophan Proteins 0.000 description 4
- UUWCIPUVJJIEEP-SRVKXCTJSA-N Phe-Asn-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N UUWCIPUVJJIEEP-SRVKXCTJSA-N 0.000 description 4
- KIQUCMUULDXTAZ-HJOGWXRNSA-N Phe-Tyr-Tyr Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O KIQUCMUULDXTAZ-HJOGWXRNSA-N 0.000 description 4
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 4
- ICTZKEXYDDZZFP-SRVKXCTJSA-N Pro-Arg-Pro Chemical compound N([C@@H](CCCN=C(N)N)C(=O)N1[C@@H](CCC1)C(O)=O)C(=O)[C@@H]1CCCN1 ICTZKEXYDDZZFP-SRVKXCTJSA-N 0.000 description 4
- KQCCDMFIALWGTL-GUBZILKMSA-N Pro-Asn-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 KQCCDMFIALWGTL-GUBZILKMSA-N 0.000 description 4
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 4
- AQGUSRZKDZYGGV-GMOBBJLQSA-N Pro-Ile-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O AQGUSRZKDZYGGV-GMOBBJLQSA-N 0.000 description 4
- SPLBRAKYXGOFSO-UNQGMJICSA-N Pro-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@@H]2CCCN2)O SPLBRAKYXGOFSO-UNQGMJICSA-N 0.000 description 4
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 4
- BTPAWKABYQMKKN-LKXGYXEUSA-N Ser-Asp-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BTPAWKABYQMKKN-LKXGYXEUSA-N 0.000 description 4
- XXNYYSXNXCJYKX-DCAQKATOSA-N Ser-Leu-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O XXNYYSXNXCJYKX-DCAQKATOSA-N 0.000 description 4
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 4
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 4
- XGQKSRGHEZNWIS-IHRRRGAJSA-N Ser-Pro-Tyr Chemical compound N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O XGQKSRGHEZNWIS-IHRRRGAJSA-N 0.000 description 4
- PZVGOVRNGKEFCB-KKHAAJSZSA-N Thr-Asn-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N)O PZVGOVRNGKEFCB-KKHAAJSZSA-N 0.000 description 4
- VUVCRYXYUUPGSB-GLLZPBPUSA-N Thr-Gln-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O VUVCRYXYUUPGSB-GLLZPBPUSA-N 0.000 description 4
- IJKNKFJZOJCKRR-GBALPHGKSA-N Thr-Trp-Ser Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 IJKNKFJZOJCKRR-GBALPHGKSA-N 0.000 description 4
- HHPSUFUXXBOFQY-AQZXSJQPSA-N Trp-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O HHPSUFUXXBOFQY-AQZXSJQPSA-N 0.000 description 4
- CUHBVKUVJIXRFK-DVXDUOKCSA-N Trp-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC=3C4=CC=CC=C4NC=3)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CUHBVKUVJIXRFK-DVXDUOKCSA-N 0.000 description 4
- AOLQJUGGZLTUBD-WIRXVTQYSA-N Trp-Trp-Phe Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O AOLQJUGGZLTUBD-WIRXVTQYSA-N 0.000 description 4
- AXWBYOVVDRBOGU-SIUGBPQLSA-N Tyr-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N AXWBYOVVDRBOGU-SIUGBPQLSA-N 0.000 description 4
- JAGGEZACYAAMIL-CQDKDKBSSA-N Tyr-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JAGGEZACYAAMIL-CQDKDKBSSA-N 0.000 description 4
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 4
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 4
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 4
- 108010005233 alanylglutamic acid Proteins 0.000 description 4
- 108010047495 alanylglycine Proteins 0.000 description 4
- 108010060035 arginylproline Proteins 0.000 description 4
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 4
- 108010068265 aspartyltyrosine Proteins 0.000 description 4
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 4
- 238000004817 gas chromatography Methods 0.000 description 4
- 230000014509 gene expression Effects 0.000 description 4
- 108010085325 histidylproline Proteins 0.000 description 4
- 238000009776 industrial production Methods 0.000 description 4
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 4
- 239000006166 lysate Substances 0.000 description 4
- 108010054155 lysyllysine Proteins 0.000 description 4
- 239000002609 medium Substances 0.000 description 4
- 108010012581 phenylalanylglutamate Proteins 0.000 description 4
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 4
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 4
- 238000001228 spectrum Methods 0.000 description 4
- 150000003431 steroids Chemical class 0.000 description 4
- 108091005804 Peptidases Proteins 0.000 description 3
- 239000004365 Protease Substances 0.000 description 3
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 3
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 3
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 3
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 3
- 108010068380 arginylarginine Proteins 0.000 description 3
- 230000036983 biotransformation Effects 0.000 description 3
- 238000005520 cutting process Methods 0.000 description 3
- 229940079593 drug Drugs 0.000 description 3
- 239000003814 drug Substances 0.000 description 3
- 238000000605 extraction Methods 0.000 description 3
- 239000013558 reference substance Substances 0.000 description 3
- 239000002904 solvent Substances 0.000 description 3
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 2
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 2
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 2
- BGNLUHXLSAQYRQ-FXQIFTODSA-N Ala-Glu-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BGNLUHXLSAQYRQ-FXQIFTODSA-N 0.000 description 2
- BVSGPHDECMJBDE-HGNGGELXSA-N Ala-Glu-His Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BVSGPHDECMJBDE-HGNGGELXSA-N 0.000 description 2
- CFPQUJZTLUQUTJ-HTFCKZLJSA-N Ala-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@H](C)N CFPQUJZTLUQUTJ-HTFCKZLJSA-N 0.000 description 2
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 2
- CJQAEJMHBAOQHA-DLOVCJGASA-N Ala-Phe-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CJQAEJMHBAOQHA-DLOVCJGASA-N 0.000 description 2
- UBTKNYUAMYRMKE-GOPGUHFVSA-N Ala-Trp-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N UBTKNYUAMYRMKE-GOPGUHFVSA-N 0.000 description 2
- 101100519158 Arabidopsis thaliana PCR2 gene Proteins 0.000 description 2
- 101100519159 Arabidopsis thaliana PCR3 gene Proteins 0.000 description 2
- IIABBYGHLYWVOS-FXQIFTODSA-N Arg-Asn-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O IIABBYGHLYWVOS-FXQIFTODSA-N 0.000 description 2
- DGFXIWKPTDKBLF-AVGNSLFASA-N Arg-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N DGFXIWKPTDKBLF-AVGNSLFASA-N 0.000 description 2
- HGKHPCFTRQDHCU-IUCAKERBSA-N Arg-Pro-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HGKHPCFTRQDHCU-IUCAKERBSA-N 0.000 description 2
- LRPZJPMQGKGHSG-XGEHTFHBSA-N Arg-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)O LRPZJPMQGKGHSG-XGEHTFHBSA-N 0.000 description 2
- RZVVKNIACROXRM-ZLUOBGJFSA-N Asn-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N RZVVKNIACROXRM-ZLUOBGJFSA-N 0.000 description 2
- NKTLGLBAGUJEGA-BIIVOSGPSA-N Asn-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N)C(=O)O NKTLGLBAGUJEGA-BIIVOSGPSA-N 0.000 description 2
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 2
- XLZCLJRGGMBKLR-PCBIJLKTSA-N Asn-Ile-Phe Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XLZCLJRGGMBKLR-PCBIJLKTSA-N 0.000 description 2
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 2
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 2
- RBOBTTLFPRSXKZ-BZSNNMDCSA-N Asn-Phe-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RBOBTTLFPRSXKZ-BZSNNMDCSA-N 0.000 description 2
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 2
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 2
- VTYQAQFKMQTKQD-ACZMJKKPSA-N Asp-Ala-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O VTYQAQFKMQTKQD-ACZMJKKPSA-N 0.000 description 2
- XEDQMTWEYFBOIK-ACZMJKKPSA-N Asp-Ala-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XEDQMTWEYFBOIK-ACZMJKKPSA-N 0.000 description 2
- VPSHHQXIWLGVDD-ZLUOBGJFSA-N Asp-Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VPSHHQXIWLGVDD-ZLUOBGJFSA-N 0.000 description 2
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 2
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 2
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 2
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 2
- ZVGRHIRJLWBWGJ-ACZMJKKPSA-N Asp-Ser-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVGRHIRJLWBWGJ-ACZMJKKPSA-N 0.000 description 2
- MJJIHRWNWSQTOI-VEVYYDQMSA-N Asp-Thr-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MJJIHRWNWSQTOI-VEVYYDQMSA-N 0.000 description 2
- XWKPSMRPIKKDDU-RCOVLWMOSA-N Asp-Val-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O XWKPSMRPIKKDDU-RCOVLWMOSA-N 0.000 description 2
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 2
- HKALUUKHYNEDRS-GUBZILKMSA-N Cys-Leu-Gln Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HKALUUKHYNEDRS-GUBZILKMSA-N 0.000 description 2
- RJPKQCFHEPPTGL-ZLUOBGJFSA-N Cys-Ser-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RJPKQCFHEPPTGL-ZLUOBGJFSA-N 0.000 description 2
- RTZKZFJDLAIYFH-UHFFFAOYSA-N Diethyl ether Chemical compound CCOCC RTZKZFJDLAIYFH-UHFFFAOYSA-N 0.000 description 2
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 2
- LWDGZZGWDMHBOF-FXQIFTODSA-N Gln-Glu-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LWDGZZGWDMHBOF-FXQIFTODSA-N 0.000 description 2
- MAGNEQBFSBREJL-DCAQKATOSA-N Gln-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N MAGNEQBFSBREJL-DCAQKATOSA-N 0.000 description 2
- ZGHMRONFHDVXEF-AVGNSLFASA-N Gln-Ser-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZGHMRONFHDVXEF-AVGNSLFASA-N 0.000 description 2
- GHAXJVNBAKGWEJ-AVGNSLFASA-N Gln-Ser-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GHAXJVNBAKGWEJ-AVGNSLFASA-N 0.000 description 2
- NTBDVNJIWCKURJ-ACZMJKKPSA-N Glu-Asp-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NTBDVNJIWCKURJ-ACZMJKKPSA-N 0.000 description 2
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 2
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 2
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 2
- XIKYNVKEUINBGL-IUCAKERBSA-N Glu-His-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O XIKYNVKEUINBGL-IUCAKERBSA-N 0.000 description 2
- BIYNPVYAZOUVFQ-CIUDSAMLSA-N Glu-Pro-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O BIYNPVYAZOUVFQ-CIUDSAMLSA-N 0.000 description 2
- SWDNPSMMEWRNOH-HJGDQZAQSA-N Glu-Pro-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWDNPSMMEWRNOH-HJGDQZAQSA-N 0.000 description 2
- WXONSNSSBYQGNN-AVGNSLFASA-N Glu-Ser-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WXONSNSSBYQGNN-AVGNSLFASA-N 0.000 description 2
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 2
- RXJFSLQVMGYQEL-IHRRRGAJSA-N Glu-Tyr-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 RXJFSLQVMGYQEL-IHRRRGAJSA-N 0.000 description 2
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 2
- KFMBRBPXHVMDFN-UWVGGRQHSA-N Gly-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCNC(N)=N KFMBRBPXHVMDFN-UWVGGRQHSA-N 0.000 description 2
- NZAFOTBEULLEQB-WDSKDSINSA-N Gly-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN NZAFOTBEULLEQB-WDSKDSINSA-N 0.000 description 2
- TZOVVRJYUDETQG-RCOVLWMOSA-N Gly-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN TZOVVRJYUDETQG-RCOVLWMOSA-N 0.000 description 2
- CEXINUGNTZFNRY-BYPYZUCNSA-N Gly-Cys-Gly Chemical compound [NH3+]CC(=O)N[C@@H](CS)C(=O)NCC([O-])=O CEXINUGNTZFNRY-BYPYZUCNSA-N 0.000 description 2
- AAHSHTLISQUZJL-QSFUFRPTSA-N Gly-Ile-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AAHSHTLISQUZJL-QSFUFRPTSA-N 0.000 description 2
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 2
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 2
- 108010009504 Gly-Phe-Leu-Gly Proteins 0.000 description 2
- POJJAZJHBGXEGM-YUMQZZPRSA-N Gly-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN POJJAZJHBGXEGM-YUMQZZPRSA-N 0.000 description 2
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 2
- HTZKFIYQMHJWSQ-INTQDDNPSA-N His-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N HTZKFIYQMHJWSQ-INTQDDNPSA-N 0.000 description 2
- RAVLQPXCMRCLKT-KBPBESRZSA-N His-Gly-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RAVLQPXCMRCLKT-KBPBESRZSA-N 0.000 description 2
- ORERHHPZDDEMSC-VGDYDELISA-N His-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ORERHHPZDDEMSC-VGDYDELISA-N 0.000 description 2
- WZBLRQQCDYYRTD-SIXJUCDHSA-N His-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N WZBLRQQCDYYRTD-SIXJUCDHSA-N 0.000 description 2
- VTMSUKSRIKCCAD-ULQDDVLXSA-N His-Tyr-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N VTMSUKSRIKCCAD-ULQDDVLXSA-N 0.000 description 2
- WSAILOWUJZEAGC-DCAQKATOSA-N His-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N WSAILOWUJZEAGC-DCAQKATOSA-N 0.000 description 2
- XDVKZSJODLMNLJ-GGQYPGDFSA-N Ile-Trp-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC=3C4=CC=CC=C4NC=3)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 XDVKZSJODLMNLJ-GGQYPGDFSA-N 0.000 description 2
- VKOAHIRLIUESLU-ULQDDVLXSA-N Leu-Arg-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VKOAHIRLIUESLU-ULQDDVLXSA-N 0.000 description 2
- WCTCIIAGNMFYAO-DCAQKATOSA-N Leu-Cys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O WCTCIIAGNMFYAO-DCAQKATOSA-N 0.000 description 2
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 2
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 2
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 2
- ONPJGOIVICHWBW-BZSNNMDCSA-N Leu-Lys-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 ONPJGOIVICHWBW-BZSNNMDCSA-N 0.000 description 2
- MJTOYIHCKVQICL-ULQDDVLXSA-N Leu-Met-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MJTOYIHCKVQICL-ULQDDVLXSA-N 0.000 description 2
- UHNQRAFSEBGZFZ-YESZJQIVSA-N Leu-Phe-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N UHNQRAFSEBGZFZ-YESZJQIVSA-N 0.000 description 2
- QMKFDEUJGYNFMC-AVGNSLFASA-N Leu-Pro-Arg Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QMKFDEUJGYNFMC-AVGNSLFASA-N 0.000 description 2
- RNYLNYTYMXACRI-VFAJRCTISA-N Leu-Thr-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O RNYLNYTYMXACRI-VFAJRCTISA-N 0.000 description 2
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 2
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 2
- DGAAQRAUOFHBFJ-CIUDSAMLSA-N Lys-Asn-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DGAAQRAUOFHBFJ-CIUDSAMLSA-N 0.000 description 2
- GCMWRRQAKQXDED-IUCAKERBSA-N Lys-Glu-Gly Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)N[C@@H](CCC([O-])=O)C(=O)NCC([O-])=O GCMWRRQAKQXDED-IUCAKERBSA-N 0.000 description 2
- PAMDBWYMLWOELY-SDDRHHMPSA-N Lys-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O PAMDBWYMLWOELY-SDDRHHMPSA-N 0.000 description 2
- ZMMDPRTXLAEMOD-BZSNNMDCSA-N Lys-His-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZMMDPRTXLAEMOD-BZSNNMDCSA-N 0.000 description 2
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 2
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 2
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 2
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 2
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 2
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 2
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 2
- YNOVBMBQSQTLFM-DCAQKATOSA-N Met-Asn-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O YNOVBMBQSQTLFM-DCAQKATOSA-N 0.000 description 2
- HGAJNEWOUHDUMZ-SRVKXCTJSA-N Met-Leu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O HGAJNEWOUHDUMZ-SRVKXCTJSA-N 0.000 description 2
- AXHNAGAYRGCDLG-UWVGGRQHSA-N Met-Lys-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AXHNAGAYRGCDLG-UWVGGRQHSA-N 0.000 description 2
- LNXGEYIEEUZGGH-JYJNAYRXSA-N Met-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCSC)CC1=CC=CC=C1 LNXGEYIEEUZGGH-JYJNAYRXSA-N 0.000 description 2
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 2
- 238000005481 NMR spectroscopy Methods 0.000 description 2
- 238000012408 PCR amplification Methods 0.000 description 2
- JEBWZLWTRPZQRX-QWRGUYRKSA-N Phe-Gly-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O JEBWZLWTRPZQRX-QWRGUYRKSA-N 0.000 description 2
- HNFUGJUZJRYUHN-JSGCOSHPSA-N Phe-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HNFUGJUZJRYUHN-JSGCOSHPSA-N 0.000 description 2
- CWFGECHCRMGPPT-MXAVVETBSA-N Phe-Ile-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O CWFGECHCRMGPPT-MXAVVETBSA-N 0.000 description 2
- YKUGPVXSDOOANW-KKUMJFAQSA-N Phe-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKUGPVXSDOOANW-KKUMJFAQSA-N 0.000 description 2
- WZEWCHQHNCMBEN-PMVMPFDFSA-N Phe-Lys-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N WZEWCHQHNCMBEN-PMVMPFDFSA-N 0.000 description 2
- QSWKNJAPHQDAAS-MELADBBJSA-N Phe-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O QSWKNJAPHQDAAS-MELADBBJSA-N 0.000 description 2
- AMBLXEMWFARNNQ-DCAQKATOSA-N Pro-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 AMBLXEMWFARNNQ-DCAQKATOSA-N 0.000 description 2
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 2
- BBFRBZYKHIKFBX-GMOBBJLQSA-N Pro-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BBFRBZYKHIKFBX-GMOBBJLQSA-N 0.000 description 2
- VTFXTWDFPTWNJY-RHYQMDGZSA-N Pro-Leu-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VTFXTWDFPTWNJY-RHYQMDGZSA-N 0.000 description 2
- CDGABSWLRMECHC-IHRRRGAJSA-N Pro-Lys-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O CDGABSWLRMECHC-IHRRRGAJSA-N 0.000 description 2
- AJNGQVUFQUVRQT-JYJNAYRXSA-N Pro-Pro-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 AJNGQVUFQUVRQT-JYJNAYRXSA-N 0.000 description 2
- DLZBBDSPTJBOOD-BPNCWPANSA-N Pro-Tyr-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O DLZBBDSPTJBOOD-BPNCWPANSA-N 0.000 description 2
- SQBLRDDJTUJDMV-ACZMJKKPSA-N Ser-Glu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQBLRDDJTUJDMV-ACZMJKKPSA-N 0.000 description 2
- XERQKTRGJIKTRB-CIUDSAMLSA-N Ser-His-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CN=CN1 XERQKTRGJIKTRB-CIUDSAMLSA-N 0.000 description 2
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 2
- AMRRYKHCILPAKD-FXQIFTODSA-N Ser-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N AMRRYKHCILPAKD-FXQIFTODSA-N 0.000 description 2
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 2
- AXKJPUBALUNJEO-UBHSHLNASA-N Ser-Trp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O AXKJPUBALUNJEO-UBHSHLNASA-N 0.000 description 2
- SYCFMSYTIFXWAJ-DCAQKATOSA-N Ser-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N SYCFMSYTIFXWAJ-DCAQKATOSA-N 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- GZYNMZQXFRWDFH-YTWAJWBKSA-N Thr-Arg-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O GZYNMZQXFRWDFH-YTWAJWBKSA-N 0.000 description 2
- CQNFRKAKGDSJFR-NUMRIWBASA-N Thr-Glu-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CQNFRKAKGDSJFR-NUMRIWBASA-N 0.000 description 2
- LGNBRHZANHMZHK-NUMRIWBASA-N Thr-Glu-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O LGNBRHZANHMZHK-NUMRIWBASA-N 0.000 description 2
- WYKJENSCCRJLRC-ZDLURKLDSA-N Thr-Gly-Cys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)O WYKJENSCCRJLRC-ZDLURKLDSA-N 0.000 description 2
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 2
- BJJRNAVDQGREGC-HOUAVDHOSA-N Thr-Trp-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O BJJRNAVDQGREGC-HOUAVDHOSA-N 0.000 description 2
- BDWDMRSGCXEDMR-WFBYXXMGSA-N Trp-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N BDWDMRSGCXEDMR-WFBYXXMGSA-N 0.000 description 2
- LHHDBONOFZDWMW-AAEUAGOBSA-N Trp-Asp-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N LHHDBONOFZDWMW-AAEUAGOBSA-N 0.000 description 2
- CZWIHKFGHICAJX-BPUTZDHNSA-N Trp-Glu-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 CZWIHKFGHICAJX-BPUTZDHNSA-N 0.000 description 2
- FEZASNVQLJQBHW-CABZTGNLSA-N Trp-Gly-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O)=CNC2=C1 FEZASNVQLJQBHW-CABZTGNLSA-N 0.000 description 2
- HLDFBNPSURDYEN-VHWLVUOQSA-N Trp-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N HLDFBNPSURDYEN-VHWLVUOQSA-N 0.000 description 2
- RCLOWEZASFJFEX-KKUMJFAQSA-N Tyr-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RCLOWEZASFJFEX-KKUMJFAQSA-N 0.000 description 2
- NOOMDULIORCDNF-IRXDYDNUSA-N Tyr-Gly-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NOOMDULIORCDNF-IRXDYDNUSA-N 0.000 description 2
- GYKDRHDMGQUZPU-MGHWNKPDSA-N Tyr-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GYKDRHDMGQUZPU-MGHWNKPDSA-N 0.000 description 2
- GQVZBMROTPEPIF-SRVKXCTJSA-N Tyr-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GQVZBMROTPEPIF-SRVKXCTJSA-N 0.000 description 2
- XYBNMHRFAUKPAW-IHRRRGAJSA-N Tyr-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CC=C(C=C1)O)N XYBNMHRFAUKPAW-IHRRRGAJSA-N 0.000 description 2
- KLOZTPOXVVRVAQ-DZKIICNBSA-N Tyr-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 KLOZTPOXVVRVAQ-DZKIICNBSA-N 0.000 description 2
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 2
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 2
- FTKXYXACXYOHND-XUXIUFHCSA-N Val-Ile-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O FTKXYXACXYOHND-XUXIUFHCSA-N 0.000 description 2
- JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 description 2
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 2
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 2
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 2
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 2
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 2
- 108010011559 alanylphenylalanine Proteins 0.000 description 2
- 108010087924 alanylproline Proteins 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 108010077245 asparaginyl-proline Proteins 0.000 description 2
- 108010093581 aspartyl-proline Proteins 0.000 description 2
- 108010092854 aspartyllysine Proteins 0.000 description 2
- 238000003556 assay Methods 0.000 description 2
- 230000003197 catalytic effect Effects 0.000 description 2
- 239000007795 chemical reaction product Substances 0.000 description 2
- 239000008367 deionised water Substances 0.000 description 2
- 229910021641 deionized water Inorganic materials 0.000 description 2
- 239000013613 expression plasmid Substances 0.000 description 2
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 2
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 2
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 2
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 2
- 239000001963 growth medium Substances 0.000 description 2
- 108010028295 histidylhistidine Proteins 0.000 description 2
- 108010018006 histidylserine Proteins 0.000 description 2
- 125000004435 hydrogen atom Chemical group [H]* 0.000 description 2
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 2
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 2
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 2
- 230000014759 maintenance of location Effects 0.000 description 2
- 238000001819 mass spectrum Methods 0.000 description 2
- 108010005942 methionylglycine Proteins 0.000 description 2
- 238000007857 nested PCR Methods 0.000 description 2
- ZRSNZINYAWTAHE-UHFFFAOYSA-N p-methoxybenzaldehyde Chemical compound COC1=CC=C(C=O)C=C1 ZRSNZINYAWTAHE-UHFFFAOYSA-N 0.000 description 2
- 108010031719 prolyl-serine Proteins 0.000 description 2
- 108010079317 prolyl-tyrosine Proteins 0.000 description 2
- 108010029020 prolylglycine Proteins 0.000 description 2
- 108010053725 prolylvaline Proteins 0.000 description 2
- 238000000425 proton nuclear magnetic resonance spectrum Methods 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 108091008146 restriction endonucleases Proteins 0.000 description 2
- 238000010839 reverse transcription Methods 0.000 description 2
- 229920006395 saturated elastomer Polymers 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 238000002741 site-directed mutagenesis Methods 0.000 description 2
- 238000005507 spraying Methods 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 108010003137 tyrosyltyrosine Proteins 0.000 description 2
- 239000013598 vector Substances 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- BFZHCUBIASXHPK-QJSKAATBSA-N 11alpha-hydroxyprogesterone Chemical compound C1CC2=CC(=O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H](C(=O)C)[C@@]1(C)C[C@H]2O BFZHCUBIASXHPK-QJSKAATBSA-N 0.000 description 1
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 1
- 101150096316 5 gene Proteins 0.000 description 1
- 241000219195 Arabidopsis thaliana Species 0.000 description 1
- OMFXVFTZEKFJBZ-UHFFFAOYSA-N Corticosterone Natural products O=C1CCC2(C)C3C(O)CC(C)(C(CC4)C(=O)CO)C4C3CCC2=C1 OMFXVFTZEKFJBZ-UHFFFAOYSA-N 0.000 description 1
- 206010064571 Gene mutation Diseases 0.000 description 1
- 101150102573 PCR1 gene Proteins 0.000 description 1
- 241000235546 Rhizopus stolonifer Species 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- 101710142587 Short-chain dehydrogenase/reductase Proteins 0.000 description 1
- 101150035122 VEP1 gene Proteins 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 125000000218 acetic acid group Chemical group C(C)(=O)* 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000003766 bioinformatics method Methods 0.000 description 1
- 229960000074 biopharmaceutical Drugs 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 229940041514 candida albicans extract Drugs 0.000 description 1
- 238000006555 catalytic reaction Methods 0.000 description 1
- 230000006037 cell lysis Effects 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- OMFXVFTZEKFJBZ-HJTSIMOOSA-N corticosterone Chemical compound O=C1CC[C@]2(C)[C@H]3[C@@H](O)C[C@](C)([C@H](CC4)C(=O)CO)[C@@H]4[C@@H]3CCC2=C1 OMFXVFTZEKFJBZ-HJTSIMOOSA-N 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 238000010812 external standard method Methods 0.000 description 1
- 239000000706 filtrate Substances 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 239000007789 gas Substances 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 239000003163 gonadal steroid hormone Substances 0.000 description 1
- 238000010438 heat treatment Methods 0.000 description 1
- 239000012535 impurity Substances 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- OOYGSFOGFJDDHP-KMCOLRRFSA-N kanamycin A sulfate Chemical compound OS(O)(=O)=O.O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N OOYGSFOGFJDDHP-KMCOLRRFSA-N 0.000 description 1
- 229960002064 kanamycin sulfate Drugs 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 239000002773 nucleotide Substances 0.000 description 1
- 125000003729 nucleotide group Chemical group 0.000 description 1
- 239000012074 organic phase Substances 0.000 description 1
- 239000003208 petroleum Substances 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 238000003259 recombinant expression Methods 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 230000001954 sterilising effect Effects 0.000 description 1
- 230000003637 steroidlike Effects 0.000 description 1
- 125000001424 substituent group Chemical group 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 239000012137 tryptone Substances 0.000 description 1
- 238000005303 weighing Methods 0.000 description 1
- 239000012138 yeast extract Substances 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/001—Oxidoreductases (1.) acting on the CH-CH group of donors (1.3)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P33/00—Preparation of steroids
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y103/00—Oxidoreductases acting on the CH-CH group of donors (1.3)
- C12Y103/99—Oxidoreductases acting on the CH-CH group of donors (1.3) with other acceptors (1.3.99)
- C12Y103/99006—3-Oxo-5-beta-steroid 4-dehydrogenase (1.3.99.6)
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Health & Medical Sciences (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Genetics & Genomics (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Biochemistry (AREA)
- Biotechnology (AREA)
- Microbiology (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
本发明涉及一种甾体5β还原酶变体及其用途。所述甾体5β还原酶变体具有甾体5β还原酶活性,其氨基酸序列包含R63K和R64H突变,能够利用NADH为辅酶,且具有选自以下的氨基酸序列:1)SEQ ID NO:1、3或5所示氨基酸序列;2)SEQ ID NO:1、3或5所示氨基酸序列经过取代、缺失或添加一个或多个氨基酸而得到的氨基酸序列;或3)与SEQ ID NO:1、3或5具有69%以上序列同一性的氨基酸序列。所述甾体5β还原酶变体的转化产物构象单一,只有β型还原产物出现,具有空间加氢特异性,产物中无α型结构,具有高效、稳定的特点。
Description
技术领域
本发明涉及基因工程和酶工程领域,更具体而言,本发明涉及一种甾体5β还原酶变体及其用途。
背景技术
甾体药物是广泛应用于临床的一大类药物,其大致分为类皮质激素和性激素两大类,对机体起着非常重要的调节作用,在医药工业中占有重要的地位。甾体药物的发现及成功合成是近半个世纪以来医药工业取得的最引人注目的两大进展之一。然而,利用化学法合成时,往往合成步骤多,效率低,价格昂贵。1950年Murray和Peterson利用黑根霉高效转化黄体酮为11α-羟基黄体酮,使从孕酮合成皮质酮只需要3步,这一研究成果引起了微生物学者的极大兴趣,此后开展了大量的微生物对甾体转化的研究工作。
有别于甾体化合物结构中的5α氢,胆酸类化合物5位上的氢为5β,是一个重要的化学结构,其化学合成难度大,急需高效专一的生物转化法。文献报道,所有甾体5β还原酶均以NADPH为辅酶(Herl V,Fischer G,Reva VA,Stiebritz M,Muller YA,Müller-Uri F,Kreis W:The VEP1 gene(At4g24220)encodes a short-chain dehydrogenase/reductasewith3-oxo-Delta4,5-steroid 5beta-reductase activity in Arabidopsis thalianaL.Biochimie 2009,91(4):517-525.),但是在实际工业生产中由于NADPH稳定性差,价格昂贵,不能工业化应用。
由于自然存在的酶在工业生产上会存在很多问题,例如对底物催化缓慢,稳定性差,或生产成本高等,不适用于工业应用。而对于甾体5β还原酶类,考虑最多的是酶对底物的催化速度和辅酶的选择性。因此,需要深入了解甾体5β还原酶的辅酶特异性问题。若甾体5β还原酶能够以NADH为辅酶,则将可以克服这些困难,因此甾体5β还原酶利用的辅酶由NADPH转变为NADH的研究具有重大应用价值。
发明内容
本发明的目的在于克服现有技术中存在的缺陷与不足,提供了一种甾体5β还原酶的变体,该变体的辅酶特异性由稳定性差的NADPH改变为价格低廉、性质稳定的NADH,该变体具有高效、稳定的特点,能够大大降低实际工业生产的成本。
不受具体理论约束,本发明提供的甾体5β还原酶变体将甾体转化为5β-甾体的方程式如下:
其中,R可为任何取代基,如羟丙基或乙酰基。
本发明通过对甾体5β还原酶进行改造,获得了一种甾体5β还原酶变体,所述甾体5β还原酶变体的氨基酸序列包括两个相邻位点的RR突变为相邻的KH。优选地,该变体的SEQID NO:1、3或5所示氨基酸序列具有R63K和R64H突变(R63K/R64H)。该突变使得甾体5β还原酶可利用NADPH和NADH为辅酶。
本发明提供了一种具有甾体5β还原酶活性的甾体5β还原酶变体,所述甾体5β还原酶变体的氨基酸序列包含R63K和R64H突变,能够利用NADH为辅酶,且具有选自以下的氨基酸序列:
1)SEQ ID NO:1、SEQ ID NO:3或SEQ ID NO:5所示氨基酸序列;
2)SEQ ID NO:1、SEQ ID NO:3或SEQ ID NO:5所示氨基酸序列经过取代、缺失或添加一个或多个氨基酸而得到的氨基酸序列;或
3)与SEQ ID NO:1、SEQ ID NO:3或SEQ ID NO:5具有69%以上序列同一性的氨基酸序列。
优选地,所述甾体5β还原酶变体能够利用NADPH和NADH为辅酶。
优选地,所述甾体5β还原酶变体是CmP5βR-LY,其氨基酸序列如SEQ ID NO:1所示,其编码多核苷酸序列如SEQ ID NO:2所示。具体序列如下:
SEQ ID NO:1
MSWWGAGAIGAAKKKLDDDEPTQSYESVALIIGVTGIVGNSLAEILPLSDTLGGPWKVYGVAKHPRPSWNADHPIDYIQCDVSNADDARSKLSPLTDVTHVFYVTWTNRESETENCEANGSMLRNVLRAVVPHAPNLRHVCLQTGTKHYLGPFTNVDGPHHDPPFTEDMPRLQIQNFYYTQEDVLFEEIKKKEGVTWSIHRPNMIFGFSPYSLMNIVGTLCVYAAICKHEGSPLMFPGSKKAWEGFMTASDADLIAEQQIWAAVDPYAKNEAFNCNNADIFKWKHLWKILAEQFGIEEYGFEEGKNLGLVEMMKGKERVWEEMVKENQLLEKKLDEVGVWWFADVILGVEGMIDSMNKSKEHGFLGFRNSNNSFISWIDKYKAFKIVP
SEQ ID NO:2
ATGAGCTGGTGGGGCGCCGGTGCGATTGGTGCGGCGAAAAAGAAACTGGACGATGACGAGCCGACCCAGAGCTACGAGAGTGTTGCGCTGATCATCGGCGTTACGGGCATCGTTGGCAACAGTCTGGCGGAAATTCTGCCACTGAGCGATACGCTGGGTGGCCCGTGGAAAGTGTATGGTGTTGCGAAACATCCACGTCCAAGCTGGAATGCCGACCACCCGATCGACTACATCCAGTGCGACGTGAGTAACGCCGATGATGCGCGCAGCAAACTGAGCCCGCTGACCGATGTTACCCACGTGTTTTACGTGACGTGGACCAACCGCGAAAGCGAAACGGAAAACTGCGAAGCGAACGGCAGCATGCTGCGCAATGTGCTGCGCGCCGTTGTGCCACATGCCCCAAATCTGCGCCATGTGTGTCTGCAGACCGGCACGAAACACTATCTGGGCCCGTTTACGAATGTGGATGGCCCACACCACGACCCACCGTTCACGGAAGATATGCCGCGCCTCCAGATCCAGAACTTCTACTACACCCAAGAAGATGTGCTCTTTGAGGAGATCAAGAAAAAAGAAGGCGTGACGTGGAGCATCCACCGCCCAAATATGATCTTCGGCTTCAGCCCGTACAGTCTGATGAATATCGTGGGCACGCTGTGTGTGTACGCCGCCATCTGCAAACATGAGGGTAGTCCGCTGATGTTCCCGGGCAGTAAAAAAGCGTGGGAGGGCTTCATGACCGCCAGCGATGCCGACCTCATCGCCGAACAGCAGATTTGGGCGGCCGTGGACCCGTATGCCAAGAACGAGGCGTTCAACTGCAACAACGCCGACATCTTCAAGTGGAAACACCTCTGGAAAATTCTGGCCGAGCAGTTTGGCATTGAGGAGTACGGCTTCGAAGAAGGCAAGAACCTCGGTCTGGTGGAGATGATGAAAGGCAAGGAACGCGTGTGGGAGGAGATGGTTAAGGAAAACCAGCTGCTCGAGAAAAAGCTCGACGAGGTTGGCGTTTGGTGGTTTGCGGACGTTATTCTGGGTGTTGAGGGCATGATCGACAGTATGAATAAGAGCAAGGAACACGGCTTTCTGGGCTTCCGCAACAGCAACAACAGCTTCATTAGCTGGATCGATAAGTATAAAGCCTTCAAAATTGTGCCG
优选地,所述甾体5β还原酶变体是AtP5βR-LY,其氨基酸序列如SEQ ID NO:3所示,其编码多核苷酸序列如SEQ ID NO:4所示。
SEQ ID NO:3
MSWWWAGAIGAAKKKLDEDEPSQSFESVALIIGVTGIVGNSLAEILPLSDTPGGPWKVYGVAKHPRPTWNADHPIDYIQCDVSDAEDTRSKLSPLTDVTHVFYVTWTNRESESENCEANGSMLRNVLQAIIPYAPNLRHVCLQTGTKHYLGPFTNVDGPRHDPPFTEDMPRLQIQNFYYTQEDILFEEIKKIETVTWSIHRPNMIFGFSPYSLMNIVGTLCVYAAICKHEGSPLLFPGSKKAWEGFMTASDADLIAEQQIWAAVDPYAKNEAFNCNNADIFKWKHLWKILAEQFGIEEYGFEEGKNLGLVEMMKGKERVWEEMVKENQLQEKKLEEVGVWWFADVILGVEGMIDSMNKSKEYGFLGFRNSNNSFISWIDKYKAFKIVP
SEQ ID NO:4
ATGAGCTGGTGGTGGGCAGGTGCAATTGGTGCCGCCAAGAAGAAGCTGGATGAGGATGAACCGAGCCAGAGCTTTGAAAGCGTGGCCCTGATCATCGGTGTGACCGGCATCGTTGGCAATAGCCTGGCCGAAATCCTGCCGCTGAGCGATACCCCTGGTGGTCCGTGGAAAGTTTATGGTGTGGCAAAACATCCTCGTCCGACCTGGAACGCAGATCACCCGATTGACTACATCCAATGCGACGTGAGCGATGCAGAAGACACCCGTAGCAAACTGAGCCCGCTGACAGATGTGACCCACGTGTTCTACGTGACCTGGACCAACCGTGAAAGCGAGAGCGAAAATTGTGAGGCCAACGGCAGCATGCTGCGCAATGTGCTGCAGGCAATTATCCCGTACGCACCGAATCTGCGTCACGTGTGTCTGCAGACAGGCACCAAGCATTACCTGGGCCCGTTTACCAACGTTGATGGCCCTCGCCATGATCCTCCGTTTACCGAGGACATGCCGCGCCTGCAGATCCAGAATTTCTACTACACCCAAGAAGATATTCTGTTTGAAGAAATCAAAAAGATCGAAACCGTGACCTGGAGCATCCACCGCCCGAACATGATCTTTGGCTTCAGCCCGTATAGCCTGATGAACATCGTGGGCACACTGTGCGTGTACGCAGCCATCTGCAAGCACGAAGGTAGCCCGCTGCTGTTTCCGGGTAGCAAGAAAGCCTGGGAGGGCTTTATGACAGCAAGCGATGCCGACCTGATTGCCGAACAGCAGATTTGGGCCGCCGTGGATCCGTATGCCAAAAACGAGGCCTTCAACTGCAATAACGCCGATATTTTTAAATGGAAACATCTGTGGAAAATCCTGGCCGAGCAGTTTGGCATCGAAGAATACGGCTTCGAAGAAGGCAAGAACCTGGGCCTGGTTGAGATGATGAAAGGCAAGGAGCGCGTGTGGGAAGAAATGGTTAAGGAGAACCAGCTGCAGGAGAAAAAGCTGGAGGAAGTGGGTGTGTGGTGGTTCGCCGATGTGATCCTGGGCGTTGAAGGCATGATCGATAGTATGAATAAAAGCAAGGAATATGGCTTCCTGGGCTTTCGCAACAGCAACAACAGCTTTATTAGCTGGATTGATAAATATAAAGCATTTAAGATTGTGCCT
优选地,所述甾体5β还原酶变体是DlP5βR-LY,其氨基酸序列如SEQ ID NO:5所示,其编码多核苷酸序列如SEQ ID NO:6所示。
SEQ ID NO:5
MSWWWAGAIGAAKKRLEEDDAQPKHSSVALIVGVTGIIGNSLAEILPLADTPGGPWKVYGVAKHTRPAWHEDNPINYVQCDISDPDDSQAKLSPLTDVTHVFYVTWANRSTEQENCEANSKMFRNVLDAVIPNCPNLKHISLQTGRKHYMGPFESYGKIESHDPPYTEDLPRLKYMNFYYDLEDIMLEEVEKKEGLTWSVHRPGNIFGFSPYSMMNLVGTLCVYAAICKHEGKVLRFTGCKAAWDGYSDCSDADLIAEHHIWAAVDPYAKNEAFNVSNGDVFKWKHFWKVLAEQFGVGCGEYEEGVDLKLQDLMKGKEPVWEEIVRENGLTPTKLKDVGIWWFGDVILGNECFLDSMNKSKEHGFLGFRNSKNAFISWIDKAKAYKIVP
SEQ ID NO:6
ATGAGCTGGTGGTGGGCCGGGGCGATTGGTGCCGCAAAAAAACGTCTGGAAGAAGATGATGCACAGCCGAAACATAGCAGCGTTGCACTGATTGTTGGTGTTACCGGTATTATTGGTAATAGCCTGGCAGAAATTCTGCCGCTGGCAGATACCCCGGGTGGTCCGTGGAAAGTTTATGGTGTTGCAAAACATACCCGTCCGGCATGGCATGAAGATAATCCGATTAATTATGTTCAGTGTGATATTAGCGATCCGGATGATAGCCAGGCAAAACTGAGCCCGCTGACCGATGTTACCCATGTTTTTTATGTTACCTGGGCAAATCGTAGCACCGAACAGGAAAATTGTGAAGCAAATAGCAAAATGTTTCGTAATGTTCTGGATGCAGTTATTCCGAATTGTCCGAATCTGAAACATATTAGCCTGCAGACCGGTCGTAAACATTATATGGGTCCGTTTGAAAGCTATGGTAAAATTGAAAGCCATGATCCGCCGTATACCGAAGATCTGCCGCGTCTGAAATATATGAATTTTTATTATGATCTGGAAGATATTATGCTGGAAGAAGTTGAAAAAAAAGAAGGTCTGACCTGGAGCGTTCATCGTCCGGGTAATATTTTTGGTTTTAGCCCGTATAGCATGATGAATCTGGTTGGTACCCTGTGTGTTTATGCAGCAATTTGTAAACATGAAGGTAAAGTTCTGCGTTTTACCGGTTGTAAAGCAGCATGGGATGGTTATAGCGATTGTAGCGATGCAGATCTGATTGCAGAACATCATATTTGGGCAGCAGTTGATCCGTATGCAAAAAATGAAGCATTTAATGTTAGCAATGGTGATGTTTTTAAATGGAAACATTTTTGGAAAGTTCTGGCAGAACAGTTTGGTGTTGGTTGTGGTGAATATGAAGAAGGTGTTGATCTGAAACTGCAGGATCTGATGAAAGGTAAAGAACCGGTTTGGGAAGAAATTGTTCGTGAAAATGGTCTGACCCCGACCAAACTGAAAGATGTTGGTATTTGGTGGTTTGGTGATGTTATTCTGGGTAATGAATGTTTTCTGGATAGCATGAATAAAAGCAAAGAACATGGTTTTCTGGGTTTTCGTAATAGCAAAAATGCATTTATTAGCTGGATTGATAAAGCAAAAGCATATAAAATTGTTCCG
优选地,所述甾体5β还原酶变体催化底物所获得的产物5位的氢构象为β型。此还原酶具有空间加氢特异性,产物中无5α型氢结构。
本发明还提供了编码所述甾体5β还原酶变体的分离的多核苷酸、包含所述多核苷酸的表达载体以及包含所述载体的宿主细胞。优选地,所述多核苷酸序列如SEQ ID NO:2、SEQ ID NO:4或SEQ ID NO:6所示。
优选地,所述表达载体为真核或原核生物的染色体。优选地,所述表达载体选自真核表达载体或原核表达载体。优选地,所述表达载体为质粒,优选为pcDNA3.1或pET26b(+)。
所述宿主细胞选自真核细胞或原核细胞。
更优选地,所述真核细胞是真菌细胞,更进一步优选为酵母菌。
更优选地,所述原核细胞选自大肠杆菌、分枝杆菌、假单胞菌、红球菌、节杆菌、枯草杆菌或放线菌细胞;更进一步优选为大肠杆菌T7 express或BL21(DE3)细胞。
本发明还提供了一种甾体5β还原酶组合物,其包含所述甾体5β还原酶变体。
本发明还提供了一种制备所述甾体5β还原酶变体的方法,所述方法包括以下步骤:
(1)在有助于生产所述甾体5β还原酶变体的条件下培养含有表达载体的宿主细胞,以及
(2)从得到的培养液中获得所述的甾体5β还原酶变体。
本发明还提供了所述甾体5β还原酶变体、所述多核苷酸、所述表达载体、所述宿主细胞或所述甾体5β还原酶组合物在制备5β-氢的甾体中的用途。优选地,所述5β-氢的甾体为5β-孕甾烷-3,20-二酮、20-羟甲基-5β-孕甾-3-酮(5β-PHM)或胆酸类化合物。更优选地,所述胆酸类化合物为具有脱氧胆酸骨架结构的化合物,例如脱氧胆酸。
本发明还提供了一种制备5β-氢的甾体的方法,所述方法包括使用所述甾体5β还原酶变体、所述多核苷酸、所述表达载体、所述宿主细胞或所述甾体5β还原酶组合物制备5β-氢的甾体。优选地,所述5β-氢的甾体为5β-孕甾烷-3,20-二酮、20-羟甲基-5β-孕甾-3-酮或胆酸类化合物。更优选地,所述胆酸类化合物为具有脱氧胆酸骨架结构的化合物,例如脱氧胆酸。
本发明的有益效果包括:
(1)本发明获得的以NADH为辅酶的甾体5β还原酶变体相较于利用NADPH为辅酶的甾体5β还原酶具有高效、稳定的特点,能够大大降低实际工业生产的成本;
(2)本发明的方法为微生物转化法,反应条件温和,对环境没有污染;
(3)应用本发明的甾体5β还原酶变体进行转化,生产成本低,工艺简单,经济效益可观;
(4)本发明的甾体5β还原酶变体可以以NADH为辅酶转化黄体酮生成5β-孕甾烷-3,20-二酮,或将20-羟甲基孕甾-3-酮(PHM)转化为20-羟甲基-5β-孕甾-3-酮(5β-PHM),且转化产物构象单一,只有β型还原产物出现,此还原酶具有空间加氢特异性,产物中无α型结构。
附图说明
图1显示了TLC法检测CmP5βR及CmP5βR-LY利用NADH转化黄体酮的产物点图,其中,“–”:CmP5βR以NADH为辅酶反应4h的产物情况;“+”:CmP5βR-LY以NADH为辅酶反应4h的产物情况;“5β”:5β-孕甾烷-3,20-二酮标准品(Rf=0.5);“Pro”:黄体酮标准品(Rf=0.4)。
图2显示了GC分析5α-孕甾烷-3,20-二酮标准品、5β-孕甾烷-3,20-二酮标准品和黄体酮标准品混合峰图,其中,1.5β-孕甾烷-3,20-二酮标准品,tR=25.106min;2.5α-孕甾烷-3,20-二酮标准品,tR=25.741min;3.黄体酮标准品,tR=26.776min。
图3显示了GC检测CmP5βR-LY利用NADH为辅酶,特异性转化黄体酮为5β-孕甾烷-3,20-二酮所获得的样品中各物质的色谱图,其中,1.产物峰5β-孕甾烷-3,20-二酮,tR=25.144min;3.底物峰黄体酮,tR=26.780min。
图4-1显示了CmP5βR-LY以NADH为辅酶转化黄体酮所得产物的质谱图。
图4-2显示了CmP5βR-LY利用NADH转化黄体酮的产物的1H谱图。
图4-3显示了CmP5βR-LY利用NADH转化黄体酮的产物的13C谱图。
图5显示了TLC法检测CmP5βR及CmP5βR-LY利用NADH转化20-羟甲基孕甾-3-酮(PHM)的产物点图,其中,“1”:20-羟甲基孕甾-3-酮(PHM)底物标准品(Rf=0.5);“2”:CmP5βR以NADH为辅酶反应4h的产物情况;“3”:CmP5βR-LY以NADH为辅酶反应4h的产物情况。
图6-1显示了CmP5βR-LY以NADH为辅酶转化PHM所得产物的质谱图。
图6-2显示了CmP5βR-LY利用NADH转化PHM的产物的1H谱图。
图6-3显示了CmP5βR-LY利用NADH转化PHM的产物的13C谱图。
具体实施方式
以下参照具体的实施例来说明本发明。本领域技术人员能够理解,这些实施例仅用于说明本发明,其不以任何方式限制本发明的范围。
下述实施例中的实验方法,如无特殊说明,均为常规方法。下述实施例中所用的菌株、质粒、试剂盒等,如无特殊说明,均为市售购买产品。
实施例1甾体5β还原酶变体CmP5βR-LY基因的获取
通过对甾体5β还原酶CmP5βR的生物信息学分析发现相应的第63/64位氨基酸参与辅酶的结合,与辅酶的专一性直接相关。替代该位点的氨基酸可使得甾体5β还原酶CmP5βR利用NADH为辅酶。通过基因的突变和表达,筛选获得可利用NADH为辅酶的突变体,可催化黄体酮转化为5β-孕甾烷-3,20-二酮。
采用以下步骤获取编码甾体5β还原酶变体CmP5βR-LY的基因:
(1)甾体5β还原酶CmP5βR基因全序列DNA的获得
以CmP5βR氨基酸序列SEQ ID NO:7(GenBank:ALB78110),通过反向翻译法转换成基因的核苷酸序列SEQ ID NO:8,全合成CmP5βR基因DNA。设计CmP5βR基因引物序列SEQ IDNO:9和SEQ ID NO:10,以CmP5βR基因DNA为模板,通过PCR扩增出具有酶切位点NdeI/HindIII的CmP5βR基因的全序列DNA片段。
SEQ ID NO:7
MSWWGAGAIGAAKKKLDDDEPTQSYESVALIIGVTGIVGNSLAEILPLSDTLGGPWKVYGVARRPRPSWNADHPIDYIQCDVSNADDARSKLSPLTDVTHVFYVTWTNRESETENCEANGSMLRNVLRAVVPHAPNLRHVCLQTGTKHYLGPFTNVDGPHHDPPFTEDMPRLQIQNFYYTQEDVLFEEIKKKEGVTWSIHRPNMIFGFSPYSLMNIVGTLCVYAAICKHEGSPLMFPGSKKAWEGFMTASDADLIAEQQIWAAVDPYAKNEAFNCNNADIFKWKHLWKILAEQFGIEEYGFEEGKNLGLVEMMKGKERVWEEMVKENQLLEKKLDEVGVWWFADVILGVEGMIDSMNKSKEHGFLGFRNSNNSFISWIDKYKAFKIVP
SEQ ID NO:8
ATGAGCTGGTGGGGCGCCGGTGCGATTGGTGCGGCGAAAAAGAAACTGGACGATGACGAGCCGACCCAGAGCTACGAGAGTGTTGCGCTGATCATCGGCGTTACGGGCATCGTTGGCAACAGTCTGGCGGAAATTCTGCCACTGAGCGATACGCTGGGTGGCCCGTGGAAAGTGTATGGTGTTGCGCGTCGCCCACGTCCAAGCTGGAATGCCGACCACCCGATCGACTACATCCAGTGCGACGTGAGTAACGCCGATGATGCGCGCAGCAAACTGAGCCCGCTGACCGATGTTACCCACGTGTTTTACGTGACGTGGACCAACCGCGAAAGCGAAACGGAAAACTGCGAAGCGAACGGCAGCATGCTGCGCAATGTGCTGCGCGCCGTTGTGCCACATGCCCCAAATCTGCGCCATGTGTGTCTGCAGACCGGCACGAAACACTATCTGGGCCCGTTTACGAATGTGGATGGCCCACACCACGACCCACCGTTCACGGAAGATATGCCGCGCCTCCAGATCCAGAACTTCTACTACACCCAAGAAGATGTGCTCTTTGAGGAGATCAAGAAAAAAGAAGGCGTGACGTGGAGCATCCACCGCCCAAATATGATCTTCGGCTTCAGCCCGTACAGTCTGATGAATATCGTGGGCACGCTGTGTGTGTACGCCGCCATCTGCAAACATGAGGGTAGTCCGCTGATGTTCCCGGGCAGTAAAAAAGCGTGGGAGGGCTTCATGACCGCCAGCGATGCCGACCTCATCGCCGAACAGCAGATTTGGGCGGCCGTGGACCCGTATGCCAAGAACGAGGCGTTCAACTGCAACAACGCCGACATCTTCAAGTGGAAACACCTCTGGAAAATTCTGGCCGAGCAGTTTGGCATTGAGGAGTACGGCTTCGAAGAAGGCAAGAACCTCGGTCTGGTGGAGATGATGAAAGGCAAGGAACGCGTGTGGGAGGAGATGGTTAAGGAAAACCAGCTGCTCGAGAAAAAGCTCGACGAGGTTGGCGTTTGGTGGTTTGCGGACGTTATTCTGGGTGTTGAGGGCATGATCGACAGTATGAATAAGAGCAAGGAACACGGCTTTCTGGGCTTCCGCAACAGCAACAACAGCTTCATTAGCTGGATCGATAAGTATAAAGCCTTCAAAATTGTGCCG
SEQ ID NO:9:
CmP5βR-F AAAAACATATGAGCTGGTGGGGCGCCGGTG
SEQ ID NO:10:
CmP5βR-R AAAAAAAGCTTCGGCACAATTTTGAAGGCTT
(2)表达载体pET26b-CmP5βR的构建
将甾体5β还原酶DNA片段和pET26b(+)质粒分别用NdeI/HindIII酶切并回收,连接,化学法转化E.coli DH5α感受态细胞,并筛选重组菌,提取质粒验证成功后命名为pET26b-CmP5βR。
(3)CmP5βR基因63/64位定点饱和突变
设计CmP5βR63/64位定点饱和突变引物SEQ ID NO:11和SEQ ID NO:12,以pET26b-CmP5βR质粒DNA为模板,通过重叠延伸PCR法将甾体5β还原酶CmP5βR进行定点饱和突变。
SEQ ID NO:11:
R(63X)m CCAGCTTGGACGTGGNNNNNNCGCAACACCATACAC
SEQ ID NO:12:
F(63X)m GGTGTATGGTGTTGCGNNNNNNCCACGTCCAAGCTGG
重叠延伸PCR法的具体方法如下:
A.以pET26b-CmP5βR质粒DNA为模板,以CmP5βR-F(SEQ ID NO:9)和R(63X)m(SEQID NO:11)为引物,通过PCR扩增获得突变基因片段PCR1;
B.以pET26b-CmP5βR质粒DNA为模板,以F(63X)m(SEQ ID NO:12)和CmP5βR-R(SEQID NO:10)为引物,通过PCR扩增获得突变基因片段PCR2;
C.以PCR1和PCR2的混合物DNA为模板,以CmP5βR-F(SEQ ID NO:9)和CmP5βR-R(SEQID NO:10)为引物,通过PCR扩增获得具有酶切位点NdeI/HindIII且含有定点饱和突变基因的DNA片段PCR3;
(4)CmP5βR基因定点饱和突变株的转化子筛选及检测分析
将定点饱和突变获得的DNA片段PCR3使用限制性内切酶NdeI/HindIII酶切后,酶连于已使用限制性内切酶NdeI/HindIII酶切后的pET26b(+)质粒,电转化于E.coli T7感受态细胞,获得大量转化子。将每个转化子进行基因表达,并进行底物的转化(实施例2),产物萃取后进行TLC和GC检测和定性定量分析,筛选成功可利用NADH为辅酶的突变质粒命名为pET26b-CmP5βR-LY,获得编码甾体5β还原酶变体CmP5βR-LY的基因(SEQ ID NO:2),该变体CmP5βR-LY具有R63K/R64H突变。
其中,提取产物5β-孕甾烷-3,20-二酮的方法如下:
将产物用2倍体积乙酸乙酯萃取一次,收集有机相,真空干燥后得5β-孕甾烷-3,20-二酮样品,将该样品进行TLC检测(实施例3)及GC检测(实施例4)。
实施例2CmP5βR-LY酶以NADH为辅酶制备5辅酶孕甾烷-3,20-二酮
采用以下步骤制备5β-孕甾烷-3,20-二酮:
(1)将含有CmP5βR-LY编码基因的pET26b(+)载体重组表达质粒转化进高效表达菌株E.coli T7感受态细胞中,获得重组菌。
(2)接重组菌单菌落于LBK(LB培养基含50μg/mL硫酸卡那霉素)培养基中,37℃过夜培养。其中,LB培养基由以下组分组成:
胰蛋白胨 10g/L
酵母提取物 5g/L
氯化钠 10g/L
LB培养基配制方法如下:定量称取上述培养基成分溶于800mL去离子水中,用5mol/L NaOH调pH至7.4,用去离子水定容至1L。在121℃高压下蒸汽灭菌20min,即得。
(3)取50μL菌液转接至5mL新鲜LBK培养基中,37℃培养2h至OD(600nm)值约等于0.6。随后加入IPTG至终浓度为1mM,于22℃培养过夜20h。
(4)离心收集诱导后菌体,以Tris-HCl(pH 5.5)缓冲液洗涤两次。将细胞低温超声波裂解,得裂解液。
(5)取上述裂解液300μL,再加入辅酶NADH至终浓度为3mg/mL,加入底物黄体酮至终浓度0.2mg/mL。整个反应体系在40℃条件下反应4h后,加入2倍体积乙酸乙酯终止反应过程。
(6)涡旋萃取上述反应产物,取上清挥干溶剂,复溶即可进行TLC检测、GC检测、MS检测及NMR检测,对产物定性和定量分析。
结果显示,CmP5βR-LY基因表达的蛋白酶以黄体酮为底物,NADH为辅酶,反应生成5β-孕甾烷-3,20-二酮,底物转化率为70%。
图4显示了CmP5βR-LY利用NADH转化黄体酮的产物的MS、NMR-1H、NMR-13C谱图。
由此可证,甾体5β还原酶变体CmP5βR-LY可以以NADH为辅酶转化黄体酮生成5β-孕甾烷-3,20-二酮,且转化产物构象单一,只有β型还原产物出现,此还原酶具有空间加氢特异性,产物中无α型结构。
实施例3TLC法检测生物转化产物
将对照品黄体酮和5β-孕甾烷-3,20-二酮标准品及生物转化获得的5β-孕甾烷-3,20-二酮样品点样于薄层层析板上,于石油醚:乙酸乙酯(3:1,V/V)的展开剂中展开30min。
将展开后的薄层层析板喷淋茴香醛试剂,于200℃高温喷烤显色。
如图1所示,“–”为CmP5βR基因以NADH为辅酶反应4h的产物情况;“+”为CmP5βR-LY以NADH为辅酶反应4h的产物情况;“5β”为5β-孕甾烷-3,20-二酮标准品(Rf=0.5);“Pro”为黄体酮标准品(Rf=0.4)。
由图1可知,CmP5βR基因不能以NADH为辅酶进行生物转化,CmP5βR-LY基因能以NADH为辅酶进行生物转化。
实施例4GC法检测生物转化产物
固体样品的制备方法:将生物转化获得的5β-孕甾烷-3,20-二酮样品用乙酸乙酯溶解,配制1mg/mL的溶液,并通过0.22μm的有机膜过滤除杂,滤液利用气相色谱法分析产物的含量。
GC分析条件包括:
仪器:Agilent Technologies 7890D
色谱柱为Agilent HP-5;检测器为FID检测器;进样量1μl,进样口温度为220℃;检测器温度为300℃;
采用程序升温:初温150℃,4min,升温速率5L/min,升到280℃,5min。
取样品1μl,注入气相色谱仪,记录色谱图;另取黄体酮、5α-孕甾烷-3,20-二酮和5β-孕甾烷-3,20-二酮标准品25mg,精密称定,置25ml容量瓶中,加乙酸乙酯溶解并稀释至刻度,摇匀,作为对照品溶液,同法测定,按外标法以峰面积计算出供试品中黄体酮、5α-孕甾烷-3,20-二酮标准品、5β-孕甾烷-3,20-二酮的浓度。
计算公式如下:
Ax为供试品中黄体酮(5α-孕甾烷-3,20-二酮标准品,5β-孕甾烷-3,20-二酮)峰面积,
Ar为对照品中黄体酮(5α-孕甾烷-3,20-二酮标准品,5β-孕甾烷-3,20-二酮)峰面积,
Cr为对照品中黄体酮(5α-孕甾烷-3,20-二酮标准品,5β-孕甾烷-3,20-二酮)的浓度(mg/mL),
进样要求:含量测定的样品,单样双针,平行测定。
利用各物质的出峰时间不同,对比黄体酮标准品、5α-孕甾烷-3,20-二酮标准品及5β-孕甾烷-3,20-二酮标准品来检测产物的生成。
图2显示了GC分析5α-孕甾烷-3,20-二酮标准品、5β-孕甾烷-3,20-二酮标准品和黄体酮标准品混合峰图,其中,1.5β-孕甾烷-3,20-二酮标准品,tR=25.106min;2.5α-孕甾烷-3,20-二酮标准品,tR=25.741min;3.黄体酮标准品,tR=26.776min。
图3显示了实施例1中CmP5βR-LY突变体利用NADH转化黄体酮生成5β-孕甾烷-3,20-二酮所获得的样品中各物质的出峰时间。其中,1:产物5β-孕甾烷-3,20-二酮,保留时间T=25.144min;3:底物黄体酮,保留时间T=26.780min。
由图3显示的各物质出峰时间可见,CmP5βR-LY突变体以NADH为辅酶,将黄体酮转化成5β-孕甾烷-3,20-二酮,而不转化为5α-孕甾烷-3,20-二酮,该CmP5βR-LY突变体具有空间加氢特异性。
实施例5_AtP5β5基因及DlP5及R基因产物的辅酶特异性转换
两个与CmP5βR同源的蛋白序列,AtP5βR和DlP5βR,与CmP5βR的同源性分别为94%和69%。通过与CmP5βR的序列比对,在AtP5βR和DlP5βR序列里找到了与CmP5βR的R63/R64相应氨基酸。
表2甾体5β还原酶的结合口袋和催化位点的序列分析
其中,AtP5βR(EF579963)蛋白的氨基酸序列如SEQ ID NO:13所示。DlP5βR(AY585867)蛋白的氨基酸序列如SEQ ID NO:14所示。AtP5βR与本发明的甾体5β还原酶CmP5βR具有94%的氨基酸同一性,是高度同源的蛋白。DlP5βR与本发明的甾体5β还原酶CmP5βR具有69%的氨基酸同一性,是低度同源的蛋白。
合成基因,将AtP5βR的反转录序列SEQ ID NO:19和DlP5βR的反转录序列SEQ IDNO:20进行全基因合成。设计AtP5βR基因引物SEQ ID NO:15和SEQ ID NO:16及DlP5βR基因引物SEQ ID NO:17和SEQ ID NO:18,以AtP5βR基因、DlP5βR基因DNA为模板,通过PCR扩增出具有酶切位点NdeI/HindIII的AtP5βR基因的全序列DNA片段和DlP5βR基因的全序列DNA片段,并克隆到表达质粒的NdeI/HindIII位点。
上述序列如下所示。
SEQ ID NO:13
MSWWWAGAIGAAKKKLDEDEPSQSFESVALIIGVTGIVGNSLAEILPLSDTPGGPWKVYGVARRPRPTWNADHPIDYIQCDVSDAEDTRSKLSPLTDVTHVFYVTWTNRESESENCEANGSMLRNVLQAIIPYAPNLRHVCLQTGTKHYLGPFTNVDGPRHDPPFTEDMPRLQIQNFYYTQEDILFEEIKKIETVTWSIHRPNMIFGFSPYSLMNIVGTLCVYAAICKHEGSPLLFPGSKKAWEGFMTASDADLIAEQQIWAAVDPYAKNEAFNCNNADIFKWKHLWKILAEQFGIEEYGFEEGKNLGLVEMMKGKERVWEEMVKENQLQEKKLEEVGVWWFADVILGVEGMIDSMNKSKEYGFLGFRNSNNSFISWIDKYKAFKIVP
SEQ ID NO:14
MSWWWAGAIGAAKKRLEEDDAQPKHSSVALIVGVTGIIGNSLAEILPLADTPGGPWKVYGVARRTRPAWHEDNPINYVQCDISDPDDSQAKLSPLTDVTHVFYVTWANRSTEQENCEANSKMFRNVLDAVIPNCPNLKHISLQTGRKHYMGPFESYGKIESHDPPYTEDLPRLKYMNFYYDLEDIMLEEVEKKEGLTWSVHRPGNIFGFSPYSMMNLVGTLCVYAAICKHEGKVLRFTGCKAAWDGYSDCSDADLIAEHHIWAAVDPYAKNEAFNVSNGDVFKWKHFWKVLAEQFGVGCGEYEEGVDLKLQDLMKGKEPVWEEIVRENGLTPTKLKDVGIWWFGDVILGNECFLDSMNKSKEHGFLGFRNSKNAFISWIDKAKAYKIVP
SEQ ID NO:15
AtP5βR-F AAAAACATATGAGCTGGTGGTGGGCAGGTG
SEQ ID NO:16
AtP5βR-R AAAAAAAGCTTAGGCACAATCTTAAATG
SEQ ID NO:17
DlP5βR-F AAAAACATATGAGCTGGTGGTGGGCCGGG
SEQ ID NO:18
DlP5βR-R AAAAAAAGCTTCGGAACAATTTTATATGC
SEQ ID NO:19
ATGAGCTGGTGGTGGGCAGGTGCAATTGGTGCCGCCAAGAAGAAGCTGGATGAGGATGAACCGAGCCAGAGCTTTGAAAGCGTGGCCCTGATCATCGGTGTGACCGGCATCGTTGGCAATAGCCTGGCCGAAATCCTGCCGCTGAGCGATACCCCTGGTGGTCCGTGGAAAGTTTATGGTGTGGCACGTCGCCCTCGTCCGACCTGGAACGCAGATCACCCGATTGACTACATCCAATGCGACGTGAGCGATGCAGAAGACACCCGTAGCAAACTGAGCCCGCTGACAGATGTGACCCACGTGTTCTACGTGACCTGGACCAACCGTGAAAGCGAGAGCGAAAATTGTGAGGCCAACGGCAGCATGCTGCGCAATGTGCTGCAGGCAATTATCCCGTACGCACCGAATCTGCGTCACGTGTGTCTGCAGACAGGCACCAAGCATTACCTGGGCCCGTTTACCAACGTTGATGGCCCTCGCCATGATCCTCCGTTTACCGAGGACATGCCGCGCCTGCAGATCCAGAATTTCTACTACACCCAAGAAGATATTCTGTTTGAAGAAATCAAAAAGATCGAAACCGTGACCTGGAGCATCCACCGCCCGAACATGATCTTTGGCTTCAGCCCGTATAGCCTGATGAACATCGTGGGCACACTGTGCGTGTACGCAGCCATCTGCAAGCACGAAGGTAGCCCGCTGCTGTTTCCGGGTAGCAAGAAAGCCTGGGAGGGCTTTATGACAGCAAGCGATGCCGACCTGATTGCCGAACAGCAGATTTGGGCCGCCGTGGATCCGTATGCCAAAAACGAGGCCTTCAACTGCAATAACGCCGATATTTTTAAATGGAAACATCTGTGGAAAATCCTGGCCGAGCAGTTTGGCATCGAAGAATACGGCTTCGAAGAAGGCAAGAACCTGGGCCTGGTTGAGATGATGAAAGGCAAGGAGCGCGTGTGGGAAGAAATGGTTAAGGAGAACCAGCTGCAGGAGAAAAAGCTGGAGGAAGTGGGTGTGTGGTGGTTCGCCGATGTGATCCTGGGCGTTGAAGGCATGATCGATAGTATGAATAAAAGCAAGGAATATGGCTTCCTGGGCTTTCGCAACAGCAACAACAGCTTTATTAGCTGGATTGATAAATATAAAGCATTTAAGATTGTGCCT
SEQ ID NO:20
ATGAGCTGGTGGTGGGCCGGGGCGATTGGTGCCGCAAAAAAACGTCTGGAAGAAGATGATGCACAGCCGAAACATAGCAGCGTTGCACTGATTGTTGGTGTTACCGGTATTATTGGTAATAGCCTGGCAGAAATTCTGCCGCTGGCAGATACCCCGGGTGGTCCGTGGAAAGTTTATGGTGTTGCACGTCGTACCCGTCCGGCATGGCATGAAGATAATCCGATTAATTATGTTCAGTGTGATATTAGCGATCCGGATGATAGCCAGGCAAAACTGAGCCCGCTGACCGATGTTACCCATGTTTTTTATGTTACCTGGGCAAATCGTAGCACCGAACAGGAAAATTGTGAAGCAAATAGCAAAATGTTTCGTAATGTTCTGGATGCAGTTATTCCGAATTGTCCGAATCTGAAACATATTAGCCTGCAGACCGGTCGTAAACATTATATGGGTCCGTTTGAAAGCTATGGTAAAATTGAAAGCCATGATCCGCCGTATACCGAAGATCTGCCGCGTCTGAAATATATGAATTTTTATTATGATCTGGAAGATATTATGCTGGAAGAAGTTGAAAAAAAAGAAGGTCTGACCTGGAGCGTTCATCGTCCGGGTAATATTTTTGGTTTTAGCCCGTATAGCATGATGAATCTGGTTGGTACCCTGTGTGTTTATGCAGCAATTTGTAAACATGAAGGTAAAGTTCTGCGTTTTACCGGTTGTAAAGCAGCATGGGATGGTTATAGCGATTGTAGCGATGCAGATCTGATTGCAGAACATCATATTTGGGCAGCAGTTGATCCGTATGCAAAAAATGAAGCATTTAATGTTAGCAATGGTGATGTTTTTAAATGGAAACATTTTTGGAAAGTTCTGGCAGAACAGTTTGGTGTTGGTTGTGGTGAATATGAAGAAGGTGTTGATCTGAAACTGCAGGATCTGATGAAAGGTAAAGAACCGGTTTGGGAAGAAATTGTTCGTGAAAATGGTCTGACCCCGACCAAACTGAAAGATGTTGGTATTTGGTGGTTTGGTGATGTTATTCTGGGTAATGAATGTTTTCTGGATAGCATGAATAAAAGCAAAGAACATGGTTTTCTGGGTTTTCGTAATAGCAAAAATGCATTTATTAGCTGGATTGATAAAGCAAAAGCATATAAAATTGTTCCG
按照与实施例1相似的方法将AtP5βR基因的全序列DNA片段和DlP5βR基因的全序列DNA片段进行定点突变,基因分别命名为AtP5βR-LY基因(其编码的甾体5β还原酶变体AtP5βR-LY的氨基酸序列如SEQ ID NO:3所示,其多核苷酸序列如SEQ ID NO:4所示)和DlP5βR-LY基因(其编码的甾体5β还原酶变体DlP5βR-LY的氨基酸序列如SEQ ID NO:5所示,其多核苷酸序列如SEQ ID NO:6所示)。突变体AtP5βR-LY和DlP5βR-LY序列均具有R63K/R64H氨基酸转换。
将AtP5βR基因、AtP5βR-LY基因、DlP5βR基因及DlP5βR-LY基因表达的蛋白,利用NADH为辅酶催化底物黄体酮进行反应,产物送至TLC和GC检测,结果见下表:
表3表达蛋白利用NADH转化产物积累统计表
结果显示,AtP5βR-LY基因、DlP5βR-LY基因表达的蛋白酶以黄体酮为底物,NADH为辅酶,反应生成5β-孕甾烷-3,20-二酮,且转化产物构象单一,只有β型还原产物出现,无α型结构,此还原酶具有空间加氢特异性;AtP5βR基因、DlP5βR基因表达的蛋白酶以黄体酮为底物,NADH为辅酶,无产物生成,其不可利用NADH为辅酶催化底物生成产物。这表明无论是具有与本发明的甾体5β还原酶CmP5βR高度同源性的甾体5β还原酶AtP5βR,还是具有与本发明的甾体5β还原酶CmP5βR低度同源性的甾体5β还原酶DlP5βR,在进行定点突变R63K/R64H后,均可利用NADH为辅酶,可见本发明R63K/R64H突变可成功改变甾体5β还原酶的辅酶专一性。
实施例6CmP5β5及CmP5例R-LY酶以NADH为辅酶转化20-羟甲基孕甾-3-酮(PHM)
采用以下步骤转化PHM:
(1)以实施例2的方法,获得CmP5βR及CmP5βR-LY基因表达蛋白的裂解液。
(2)取上述裂解液300μL,再加入辅酶NADH至终浓度为3mg/mL,加入底物PHM至终浓度0.2mg/mL。整个反应体系在40℃条件下反应4h后,加入2倍体积乙酸乙酯终止反应过程。
(3)涡旋萃取上述反应产物,取上清挥干溶剂,复溶即可进行TLC检测、MS检测及NMR检测,对产物定性和定量分析。
图5显示了TLC法检测CmP5βR及CmP5βR-LY利用NADH转化20-羟甲基孕甾-3-酮(PHM)的产物点图,其中,“1”:PHM底物标准品(Rf=0.5);“2”:CmP5βR以NADH为辅酶反应4h的产物情况;“3”:CmP5βR-LY以NADH为辅酶反应4h的产物情况。
图6显示了CmP5βR-LY利用NADH转化PHM的产物的MS、NMR-1H、NMR-13C谱图。
由此可证明,甾体5β还原酶变体CmP5βR-LY可以以NADH为辅酶转化PHM生成5β-PHM,且转化产物构象单一,只有β型还原产物出现,此还原酶具有空间加氢特异性,产物中无α型结构。
以上对本发明具体实施方式的描述并不限制本发明,本领域技术人员可以根据本发明作出各种改变或变形,只要不脱离本发明的精神,均应属于本发明所附权利要求的范围。
序列表
<110> 沈阳博泰生物制药有限公司
<120> 一种甾体5β还原酶变体及其用途
<130> DIC18110079R
<150> 2019111284283
<151> 2019-11-18
<160> 20
<170> SIPOSequenceListing 1.0
<210> 1
<211> 388
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 1
Met Ser Trp Trp Gly Ala Gly Ala Ile Gly Ala Ala Lys Lys Lys Leu
1 5 10 15
Asp Asp Asp Glu Pro Thr Gln Ser Tyr Glu Ser Val Ala Leu Ile Ile
20 25 30
Gly Val Thr Gly Ile Val Gly Asn Ser Leu Ala Glu Ile Leu Pro Leu
35 40 45
Ser Asp Thr Leu Gly Gly Pro Trp Lys Val Tyr Gly Val Ala Lys His
50 55 60
Pro Arg Pro Ser Trp Asn Ala Asp His Pro Ile Asp Tyr Ile Gln Cys
65 70 75 80
Asp Val Ser Asn Ala Asp Asp Ala Arg Ser Lys Leu Ser Pro Leu Thr
85 90 95
Asp Val Thr His Val Phe Tyr Val Thr Trp Thr Asn Arg Glu Ser Glu
100 105 110
Thr Glu Asn Cys Glu Ala Asn Gly Ser Met Leu Arg Asn Val Leu Arg
115 120 125
Ala Val Val Pro His Ala Pro Asn Leu Arg His Val Cys Leu Gln Thr
130 135 140
Gly Thr Lys His Tyr Leu Gly Pro Phe Thr Asn Val Asp Gly Pro His
145 150 155 160
His Asp Pro Pro Phe Thr Glu Asp Met Pro Arg Leu Gln Ile Gln Asn
165 170 175
Phe Tyr Tyr Thr Gln Glu Asp Val Leu Phe Glu Glu Ile Lys Lys Lys
180 185 190
Glu Gly Val Thr Trp Ser Ile His Arg Pro Asn Met Ile Phe Gly Phe
195 200 205
Ser Pro Tyr Ser Leu Met Asn Ile Val Gly Thr Leu Cys Val Tyr Ala
210 215 220
Ala Ile Cys Lys His Glu Gly Ser Pro Leu Met Phe Pro Gly Ser Lys
225 230 235 240
Lys Ala Trp Glu Gly Phe Met Thr Ala Ser Asp Ala Asp Leu Ile Ala
245 250 255
Glu Gln Gln Ile Trp Ala Ala Val Asp Pro Tyr Ala Lys Asn Glu Ala
260 265 270
Phe Asn Cys Asn Asn Ala Asp Ile Phe Lys Trp Lys His Leu Trp Lys
275 280 285
Ile Leu Ala Glu Gln Phe Gly Ile Glu Glu Tyr Gly Phe Glu Glu Gly
290 295 300
Lys Asn Leu Gly Leu Val Glu Met Met Lys Gly Lys Glu Arg Val Trp
305 310 315 320
Glu Glu Met Val Lys Glu Asn Gln Leu Leu Glu Lys Lys Leu Asp Glu
325 330 335
Val Gly Val Trp Trp Phe Ala Asp Val Ile Leu Gly Val Glu Gly Met
340 345 350
Ile Asp Ser Met Asn Lys Ser Lys Glu His Gly Phe Leu Gly Phe Arg
355 360 365
Asn Ser Asn Asn Ser Phe Ile Ser Trp Ile Asp Lys Tyr Lys Ala Phe
370 375 380
Lys Ile Val Pro
385
<210> 2
<211> 1164
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 2
atgagctggt ggggcgccgg tgcgattggt gcggcgaaaa agaaactgga cgatgacgag 60
ccgacccaga gctacgagag tgttgcgctg atcatcggcg ttacgggcat cgttggcaac 120
agtctggcgg aaattctgcc actgagcgat acgctgggtg gcccgtggaa agtgtatggt 180
gttgcgaaac atccacgtcc aagctggaat gccgaccacc cgatcgacta catccagtgc 240
gacgtgagta acgccgatga tgcgcgcagc aaactgagcc cgctgaccga tgttacccac 300
gtgttttacg tgacgtggac caaccgcgaa agcgaaacgg aaaactgcga agcgaacggc 360
agcatgctgc gcaatgtgct gcgcgccgtt gtgccacatg ccccaaatct gcgccatgtg 420
tgtctgcaga ccggcacgaa acactatctg ggcccgttta cgaatgtgga tggcccacac 480
cacgacccac cgttcacgga agatatgccg cgcctccaga tccagaactt ctactacacc 540
caagaagatg tgctctttga ggagatcaag aaaaaagaag gcgtgacgtg gagcatccac 600
cgcccaaata tgatcttcgg cttcagcccg tacagtctga tgaatatcgt gggcacgctg 660
tgtgtgtacg ccgccatctg caaacatgag ggtagtccgc tgatgttccc gggcagtaaa 720
aaagcgtggg agggcttcat gaccgccagc gatgccgacc tcatcgccga acagcagatt 780
tgggcggccg tggacccgta tgccaagaac gaggcgttca actgcaacaa cgccgacatc 840
ttcaagtgga aacacctctg gaaaattctg gccgagcagt ttggcattga ggagtacggc 900
ttcgaagaag gcaagaacct cggtctggtg gagatgatga aaggcaagga acgcgtgtgg 960
gaggagatgg ttaaggaaaa ccagctgctc gagaaaaagc tcgacgaggt tggcgtttgg 1020
tggtttgcgg acgttattct gggtgttgag ggcatgatcg acagtatgaa taagagcaag 1080
gaacacggct ttctgggctt ccgcaacagc aacaacagct tcattagctg gatcgataag 1140
tataaagcct tcaaaattgt gccg 1164
<210> 3
<211> 388
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 3
Met Ser Trp Trp Trp Ala Gly Ala Ile Gly Ala Ala Lys Lys Lys Leu
1 5 10 15
Asp Glu Asp Glu Pro Ser Gln Ser Phe Glu Ser Val Ala Leu Ile Ile
20 25 30
Gly Val Thr Gly Ile Val Gly Asn Ser Leu Ala Glu Ile Leu Pro Leu
35 40 45
Ser Asp Thr Pro Gly Gly Pro Trp Lys Val Tyr Gly Val Ala Lys His
50 55 60
Pro Arg Pro Thr Trp Asn Ala Asp His Pro Ile Asp Tyr Ile Gln Cys
65 70 75 80
Asp Val Ser Asp Ala Glu Asp Thr Arg Ser Lys Leu Ser Pro Leu Thr
85 90 95
Asp Val Thr His Val Phe Tyr Val Thr Trp Thr Asn Arg Glu Ser Glu
100 105 110
Ser Glu Asn Cys Glu Ala Asn Gly Ser Met Leu Arg Asn Val Leu Gln
115 120 125
Ala Ile Ile Pro Tyr Ala Pro Asn Leu Arg His Val Cys Leu Gln Thr
130 135 140
Gly Thr Lys His Tyr Leu Gly Pro Phe Thr Asn Val Asp Gly Pro Arg
145 150 155 160
His Asp Pro Pro Phe Thr Glu Asp Met Pro Arg Leu Gln Ile Gln Asn
165 170 175
Phe Tyr Tyr Thr Gln Glu Asp Ile Leu Phe Glu Glu Ile Lys Lys Ile
180 185 190
Glu Thr Val Thr Trp Ser Ile His Arg Pro Asn Met Ile Phe Gly Phe
195 200 205
Ser Pro Tyr Ser Leu Met Asn Ile Val Gly Thr Leu Cys Val Tyr Ala
210 215 220
Ala Ile Cys Lys His Glu Gly Ser Pro Leu Leu Phe Pro Gly Ser Lys
225 230 235 240
Lys Ala Trp Glu Gly Phe Met Thr Ala Ser Asp Ala Asp Leu Ile Ala
245 250 255
Glu Gln Gln Ile Trp Ala Ala Val Asp Pro Tyr Ala Lys Asn Glu Ala
260 265 270
Phe Asn Cys Asn Asn Ala Asp Ile Phe Lys Trp Lys His Leu Trp Lys
275 280 285
Ile Leu Ala Glu Gln Phe Gly Ile Glu Glu Tyr Gly Phe Glu Glu Gly
290 295 300
Lys Asn Leu Gly Leu Val Glu Met Met Lys Gly Lys Glu Arg Val Trp
305 310 315 320
Glu Glu Met Val Lys Glu Asn Gln Leu Gln Glu Lys Lys Leu Glu Glu
325 330 335
Val Gly Val Trp Trp Phe Ala Asp Val Ile Leu Gly Val Glu Gly Met
340 345 350
Ile Asp Ser Met Asn Lys Ser Lys Glu Tyr Gly Phe Leu Gly Phe Arg
355 360 365
Asn Ser Asn Asn Ser Phe Ile Ser Trp Ile Asp Lys Tyr Lys Ala Phe
370 375 380
Lys Ile Val Pro
385
<210> 4
<211> 1164
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 4
atgagctggt ggtgggcagg tgcaattggt gccgccaaga agaagctgga tgaggatgaa 60
ccgagccaga gctttgaaag cgtggccctg atcatcggtg tgaccggcat cgttggcaat 120
agcctggccg aaatcctgcc gctgagcgat acccctggtg gtccgtggaa agtttatggt 180
gtggcaaaac atcctcgtcc gacctggaac gcagatcacc cgattgacta catccaatgc 240
gacgtgagcg atgcagaaga cacccgtagc aaactgagcc cgctgacaga tgtgacccac 300
gtgttctacg tgacctggac caaccgtgaa agcgagagcg aaaattgtga ggccaacggc 360
agcatgctgc gcaatgtgct gcaggcaatt atcccgtacg caccgaatct gcgtcacgtg 420
tgtctgcaga caggcaccaa gcattacctg ggcccgttta ccaacgttga tggccctcgc 480
catgatcctc cgtttaccga ggacatgccg cgcctgcaga tccagaattt ctactacacc 540
caagaagata ttctgtttga agaaatcaaa aagatcgaaa ccgtgacctg gagcatccac 600
cgcccgaaca tgatctttgg cttcagcccg tatagcctga tgaacatcgt gggcacactg 660
tgcgtgtacg cagccatctg caagcacgaa ggtagcccgc tgctgtttcc gggtagcaag 720
aaagcctggg agggctttat gacagcaagc gatgccgacc tgattgccga acagcagatt 780
tgggccgccg tggatccgta tgccaaaaac gaggccttca actgcaataa cgccgatatt 840
tttaaatgga aacatctgtg gaaaatcctg gccgagcagt ttggcatcga agaatacggc 900
ttcgaagaag gcaagaacct gggcctggtt gagatgatga aaggcaagga gcgcgtgtgg 960
gaagaaatgg ttaaggagaa ccagctgcag gagaaaaagc tggaggaagt gggtgtgtgg 1020
tggttcgccg atgtgatcct gggcgttgaa ggcatgatcg atagtatgaa taaaagcaag 1080
gaatatggct tcctgggctt tcgcaacagc aacaacagct ttattagctg gattgataaa 1140
tataaagcat ttaagattgt gcct 1164
<210> 5
<211> 389
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 5
Met Ser Trp Trp Trp Ala Gly Ala Ile Gly Ala Ala Lys Lys Arg Leu
1 5 10 15
Glu Glu Asp Asp Ala Gln Pro Lys His Ser Ser Val Ala Leu Ile Val
20 25 30
Gly Val Thr Gly Ile Ile Gly Asn Ser Leu Ala Glu Ile Leu Pro Leu
35 40 45
Ala Asp Thr Pro Gly Gly Pro Trp Lys Val Tyr Gly Val Ala Lys His
50 55 60
Thr Arg Pro Ala Trp His Glu Asp Asn Pro Ile Asn Tyr Val Gln Cys
65 70 75 80
Asp Ile Ser Asp Pro Asp Asp Ser Gln Ala Lys Leu Ser Pro Leu Thr
85 90 95
Asp Val Thr His Val Phe Tyr Val Thr Trp Ala Asn Arg Ser Thr Glu
100 105 110
Gln Glu Asn Cys Glu Ala Asn Ser Lys Met Phe Arg Asn Val Leu Asp
115 120 125
Ala Val Ile Pro Asn Cys Pro Asn Leu Lys His Ile Ser Leu Gln Thr
130 135 140
Gly Arg Lys His Tyr Met Gly Pro Phe Glu Ser Tyr Gly Lys Ile Glu
145 150 155 160
Ser His Asp Pro Pro Tyr Thr Glu Asp Leu Pro Arg Leu Lys Tyr Met
165 170 175
Asn Phe Tyr Tyr Asp Leu Glu Asp Ile Met Leu Glu Glu Val Glu Lys
180 185 190
Lys Glu Gly Leu Thr Trp Ser Val His Arg Pro Gly Asn Ile Phe Gly
195 200 205
Phe Ser Pro Tyr Ser Met Met Asn Leu Val Gly Thr Leu Cys Val Tyr
210 215 220
Ala Ala Ile Cys Lys His Glu Gly Lys Val Leu Arg Phe Thr Gly Cys
225 230 235 240
Lys Ala Ala Trp Asp Gly Tyr Ser Asp Cys Ser Asp Ala Asp Leu Ile
245 250 255
Ala Glu His His Ile Trp Ala Ala Val Asp Pro Tyr Ala Lys Asn Glu
260 265 270
Ala Phe Asn Val Ser Asn Gly Asp Val Phe Lys Trp Lys His Phe Trp
275 280 285
Lys Val Leu Ala Glu Gln Phe Gly Val Gly Cys Gly Glu Tyr Glu Glu
290 295 300
Gly Val Asp Leu Lys Leu Gln Asp Leu Met Lys Gly Lys Glu Pro Val
305 310 315 320
Trp Glu Glu Ile Val Arg Glu Asn Gly Leu Thr Pro Thr Lys Leu Lys
325 330 335
Asp Val Gly Ile Trp Trp Phe Gly Asp Val Ile Leu Gly Asn Glu Cys
340 345 350
Phe Leu Asp Ser Met Asn Lys Ser Lys Glu His Gly Phe Leu Gly Phe
355 360 365
Arg Asn Ser Lys Asn Ala Phe Ile Ser Trp Ile Asp Lys Ala Lys Ala
370 375 380
Tyr Lys Ile Val Pro
385
<210> 6
<211> 1167
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 6
atgagctggt ggtgggccgg ggcgattggt gccgcaaaaa aacgtctgga agaagatgat 60
gcacagccga aacatagcag cgttgcactg attgttggtg ttaccggtat tattggtaat 120
agcctggcag aaattctgcc gctggcagat accccgggtg gtccgtggaa agtttatggt 180
gttgcaaaac atacccgtcc ggcatggcat gaagataatc cgattaatta tgttcagtgt 240
gatattagcg atccggatga tagccaggca aaactgagcc cgctgaccga tgttacccat 300
gttttttatg ttacctgggc aaatcgtagc accgaacagg aaaattgtga agcaaatagc 360
aaaatgtttc gtaatgttct ggatgcagtt attccgaatt gtccgaatct gaaacatatt 420
agcctgcaga ccggtcgtaa acattatatg ggtccgtttg aaagctatgg taaaattgaa 480
agccatgatc cgccgtatac cgaagatctg ccgcgtctga aatatatgaa tttttattat 540
gatctggaag atattatgct ggaagaagtt gaaaaaaaag aaggtctgac ctggagcgtt 600
catcgtccgg gtaatatttt tggttttagc ccgtatagca tgatgaatct ggttggtacc 660
ctgtgtgttt atgcagcaat ttgtaaacat gaaggtaaag ttctgcgttt taccggttgt 720
aaagcagcat gggatggtta tagcgattgt agcgatgcag atctgattgc agaacatcat 780
atttgggcag cagttgatcc gtatgcaaaa aatgaagcat ttaatgttag caatggtgat 840
gtttttaaat ggaaacattt ttggaaagtt ctggcagaac agtttggtgt tggttgtggt 900
gaatatgaag aaggtgttga tctgaaactg caggatctga tgaaaggtaa agaaccggtt 960
tgggaagaaa ttgttcgtga aaatggtctg accccgacca aactgaaaga tgttggtatt 1020
tggtggtttg gtgatgttat tctgggtaat gaatgttttc tggatagcat gaataaaagc 1080
aaagaacatg gttttctggg ttttcgtaat agcaaaaatg catttattag ctggattgat 1140
aaagcaaaag catataaaat tgttccg 1167
<210> 7
<211> 388
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 7
Met Ser Trp Trp Gly Ala Gly Ala Ile Gly Ala Ala Lys Lys Lys Leu
1 5 10 15
Asp Asp Asp Glu Pro Thr Gln Ser Tyr Glu Ser Val Ala Leu Ile Ile
20 25 30
Gly Val Thr Gly Ile Val Gly Asn Ser Leu Ala Glu Ile Leu Pro Leu
35 40 45
Ser Asp Thr Leu Gly Gly Pro Trp Lys Val Tyr Gly Val Ala Arg Arg
50 55 60
Pro Arg Pro Ser Trp Asn Ala Asp His Pro Ile Asp Tyr Ile Gln Cys
65 70 75 80
Asp Val Ser Asn Ala Asp Asp Ala Arg Ser Lys Leu Ser Pro Leu Thr
85 90 95
Asp Val Thr His Val Phe Tyr Val Thr Trp Thr Asn Arg Glu Ser Glu
100 105 110
Thr Glu Asn Cys Glu Ala Asn Gly Ser Met Leu Arg Asn Val Leu Arg
115 120 125
Ala Val Val Pro His Ala Pro Asn Leu Arg His Val Cys Leu Gln Thr
130 135 140
Gly Thr Lys His Tyr Leu Gly Pro Phe Thr Asn Val Asp Gly Pro His
145 150 155 160
His Asp Pro Pro Phe Thr Glu Asp Met Pro Arg Leu Gln Ile Gln Asn
165 170 175
Phe Tyr Tyr Thr Gln Glu Asp Val Leu Phe Glu Glu Ile Lys Lys Lys
180 185 190
Glu Gly Val Thr Trp Ser Ile His Arg Pro Asn Met Ile Phe Gly Phe
195 200 205
Ser Pro Tyr Ser Leu Met Asn Ile Val Gly Thr Leu Cys Val Tyr Ala
210 215 220
Ala Ile Cys Lys His Glu Gly Ser Pro Leu Met Phe Pro Gly Ser Lys
225 230 235 240
Lys Ala Trp Glu Gly Phe Met Thr Ala Ser Asp Ala Asp Leu Ile Ala
245 250 255
Glu Gln Gln Ile Trp Ala Ala Val Asp Pro Tyr Ala Lys Asn Glu Ala
260 265 270
Phe Asn Cys Asn Asn Ala Asp Ile Phe Lys Trp Lys His Leu Trp Lys
275 280 285
Ile Leu Ala Glu Gln Phe Gly Ile Glu Glu Tyr Gly Phe Glu Glu Gly
290 295 300
Lys Asn Leu Gly Leu Val Glu Met Met Lys Gly Lys Glu Arg Val Trp
305 310 315 320
Glu Glu Met Val Lys Glu Asn Gln Leu Leu Glu Lys Lys Leu Asp Glu
325 330 335
Val Gly Val Trp Trp Phe Ala Asp Val Ile Leu Gly Val Glu Gly Met
340 345 350
Ile Asp Ser Met Asn Lys Ser Lys Glu His Gly Phe Leu Gly Phe Arg
355 360 365
Asn Ser Asn Asn Ser Phe Ile Ser Trp Ile Asp Lys Tyr Lys Ala Phe
370 375 380
Lys Ile Val Pro
385
<210> 8
<211> 1164
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 8
atgagctggt ggggcgccgg tgcgattggt gcggcgaaaa agaaactgga cgatgacgag 60
ccgacccaga gctacgagag tgttgcgctg atcatcggcg ttacgggcat cgttggcaac 120
agtctggcgg aaattctgcc actgagcgat acgctgggtg gcccgtggaa agtgtatggt 180
gttgcgcgtc gcccacgtcc aagctggaat gccgaccacc cgatcgacta catccagtgc 240
gacgtgagta acgccgatga tgcgcgcagc aaactgagcc cgctgaccga tgttacccac 300
gtgttttacg tgacgtggac caaccgcgaa agcgaaacgg aaaactgcga agcgaacggc 360
agcatgctgc gcaatgtgct gcgcgccgtt gtgccacatg ccccaaatct gcgccatgtg 420
tgtctgcaga ccggcacgaa acactatctg ggcccgttta cgaatgtgga tggcccacac 480
cacgacccac cgttcacgga agatatgccg cgcctccaga tccagaactt ctactacacc 540
caagaagatg tgctctttga ggagatcaag aaaaaagaag gcgtgacgtg gagcatccac 600
cgcccaaata tgatcttcgg cttcagcccg tacagtctga tgaatatcgt gggcacgctg 660
tgtgtgtacg ccgccatctg caaacatgag ggtagtccgc tgatgttccc gggcagtaaa 720
aaagcgtggg agggcttcat gaccgccagc gatgccgacc tcatcgccga acagcagatt 780
tgggcggccg tggacccgta tgccaagaac gaggcgttca actgcaacaa cgccgacatc 840
ttcaagtgga aacacctctg gaaaattctg gccgagcagt ttggcattga ggagtacggc 900
ttcgaagaag gcaagaacct cggtctggtg gagatgatga aaggcaagga acgcgtgtgg 960
gaggagatgg ttaaggaaaa ccagctgctc gagaaaaagc tcgacgaggt tggcgtttgg 1020
tggtttgcgg acgttattct gggtgttgag ggcatgatcg acagtatgaa taagagcaag 1080
gaacacggct ttctgggctt ccgcaacagc aacaacagct tcattagctg gatcgataag 1140
tataaagcct tcaaaattgt gccg 1164
<210> 9
<211> 30
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 9
aaaaacatat gagctggtgg ggcgccggtg 30
<210> 10
<211> 31
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 10
aaaaaaagct tcggcacaat tttgaaggct t 31
<210> 11
<211> 36
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 11
ccagcttgga cgtggnnnnn ncgcaacacc atacac 36
<210> 12
<211> 37
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 12
ggtgtatggt gttgcgnnnn nnccacgtcc aagctgg 37
<210> 13
<211> 388
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 13
Met Ser Trp Trp Trp Ala Gly Ala Ile Gly Ala Ala Lys Lys Lys Leu
1 5 10 15
Asp Glu Asp Glu Pro Ser Gln Ser Phe Glu Ser Val Ala Leu Ile Ile
20 25 30
Gly Val Thr Gly Ile Val Gly Asn Ser Leu Ala Glu Ile Leu Pro Leu
35 40 45
Ser Asp Thr Pro Gly Gly Pro Trp Lys Val Tyr Gly Val Ala Arg Arg
50 55 60
Pro Arg Pro Thr Trp Asn Ala Asp His Pro Ile Asp Tyr Ile Gln Cys
65 70 75 80
Asp Val Ser Asp Ala Glu Asp Thr Arg Ser Lys Leu Ser Pro Leu Thr
85 90 95
Asp Val Thr His Val Phe Tyr Val Thr Trp Thr Asn Arg Glu Ser Glu
100 105 110
Ser Glu Asn Cys Glu Ala Asn Gly Ser Met Leu Arg Asn Val Leu Gln
115 120 125
Ala Ile Ile Pro Tyr Ala Pro Asn Leu Arg His Val Cys Leu Gln Thr
130 135 140
Gly Thr Lys His Tyr Leu Gly Pro Phe Thr Asn Val Asp Gly Pro Arg
145 150 155 160
His Asp Pro Pro Phe Thr Glu Asp Met Pro Arg Leu Gln Ile Gln Asn
165 170 175
Phe Tyr Tyr Thr Gln Glu Asp Ile Leu Phe Glu Glu Ile Lys Lys Ile
180 185 190
Glu Thr Val Thr Trp Ser Ile His Arg Pro Asn Met Ile Phe Gly Phe
195 200 205
Ser Pro Tyr Ser Leu Met Asn Ile Val Gly Thr Leu Cys Val Tyr Ala
210 215 220
Ala Ile Cys Lys His Glu Gly Ser Pro Leu Leu Phe Pro Gly Ser Lys
225 230 235 240
Lys Ala Trp Glu Gly Phe Met Thr Ala Ser Asp Ala Asp Leu Ile Ala
245 250 255
Glu Gln Gln Ile Trp Ala Ala Val Asp Pro Tyr Ala Lys Asn Glu Ala
260 265 270
Phe Asn Cys Asn Asn Ala Asp Ile Phe Lys Trp Lys His Leu Trp Lys
275 280 285
Ile Leu Ala Glu Gln Phe Gly Ile Glu Glu Tyr Gly Phe Glu Glu Gly
290 295 300
Lys Asn Leu Gly Leu Val Glu Met Met Lys Gly Lys Glu Arg Val Trp
305 310 315 320
Glu Glu Met Val Lys Glu Asn Gln Leu Gln Glu Lys Lys Leu Glu Glu
325 330 335
Val Gly Val Trp Trp Phe Ala Asp Val Ile Leu Gly Val Glu Gly Met
340 345 350
Ile Asp Ser Met Asn Lys Ser Lys Glu Tyr Gly Phe Leu Gly Phe Arg
355 360 365
Asn Ser Asn Asn Ser Phe Ile Ser Trp Ile Asp Lys Tyr Lys Ala Phe
370 375 380
Lys Ile Val Pro
385
<210> 14
<211> 389
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 14
Met Ser Trp Trp Trp Ala Gly Ala Ile Gly Ala Ala Lys Lys Arg Leu
1 5 10 15
Glu Glu Asp Asp Ala Gln Pro Lys His Ser Ser Val Ala Leu Ile Val
20 25 30
Gly Val Thr Gly Ile Ile Gly Asn Ser Leu Ala Glu Ile Leu Pro Leu
35 40 45
Ala Asp Thr Pro Gly Gly Pro Trp Lys Val Tyr Gly Val Ala Arg Arg
50 55 60
Thr Arg Pro Ala Trp His Glu Asp Asn Pro Ile Asn Tyr Val Gln Cys
65 70 75 80
Asp Ile Ser Asp Pro Asp Asp Ser Gln Ala Lys Leu Ser Pro Leu Thr
85 90 95
Asp Val Thr His Val Phe Tyr Val Thr Trp Ala Asn Arg Ser Thr Glu
100 105 110
Gln Glu Asn Cys Glu Ala Asn Ser Lys Met Phe Arg Asn Val Leu Asp
115 120 125
Ala Val Ile Pro Asn Cys Pro Asn Leu Lys His Ile Ser Leu Gln Thr
130 135 140
Gly Arg Lys His Tyr Met Gly Pro Phe Glu Ser Tyr Gly Lys Ile Glu
145 150 155 160
Ser His Asp Pro Pro Tyr Thr Glu Asp Leu Pro Arg Leu Lys Tyr Met
165 170 175
Asn Phe Tyr Tyr Asp Leu Glu Asp Ile Met Leu Glu Glu Val Glu Lys
180 185 190
Lys Glu Gly Leu Thr Trp Ser Val His Arg Pro Gly Asn Ile Phe Gly
195 200 205
Phe Ser Pro Tyr Ser Met Met Asn Leu Val Gly Thr Leu Cys Val Tyr
210 215 220
Ala Ala Ile Cys Lys His Glu Gly Lys Val Leu Arg Phe Thr Gly Cys
225 230 235 240
Lys Ala Ala Trp Asp Gly Tyr Ser Asp Cys Ser Asp Ala Asp Leu Ile
245 250 255
Ala Glu His His Ile Trp Ala Ala Val Asp Pro Tyr Ala Lys Asn Glu
260 265 270
Ala Phe Asn Val Ser Asn Gly Asp Val Phe Lys Trp Lys His Phe Trp
275 280 285
Lys Val Leu Ala Glu Gln Phe Gly Val Gly Cys Gly Glu Tyr Glu Glu
290 295 300
Gly Val Asp Leu Lys Leu Gln Asp Leu Met Lys Gly Lys Glu Pro Val
305 310 315 320
Trp Glu Glu Ile Val Arg Glu Asn Gly Leu Thr Pro Thr Lys Leu Lys
325 330 335
Asp Val Gly Ile Trp Trp Phe Gly Asp Val Ile Leu Gly Asn Glu Cys
340 345 350
Phe Leu Asp Ser Met Asn Lys Ser Lys Glu His Gly Phe Leu Gly Phe
355 360 365
Arg Asn Ser Lys Asn Ala Phe Ile Ser Trp Ile Asp Lys Ala Lys Ala
370 375 380
Tyr Lys Ile Val Pro
385
<210> 15
<211> 30
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 15
aaaaacatat gagctggtgg tgggcaggtg 30
<210> 16
<211> 28
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 16
aaaaaaagct taggcacaat cttaaatg 28
<210> 17
<211> 29
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 17
aaaaacatat gagctggtgg tgggccggg 29
<210> 18
<211> 29
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 18
aaaaaaagct tcggaacaat tttatatgc 29
<210> 19
<211> 1164
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 19
atgagctggt ggtgggcagg tgcaattggt gccgccaaga agaagctgga tgaggatgaa 60
ccgagccaga gctttgaaag cgtggccctg atcatcggtg tgaccggcat cgttggcaat 120
agcctggccg aaatcctgcc gctgagcgat acccctggtg gtccgtggaa agtttatggt 180
gtggcacgtc gccctcgtcc gacctggaac gcagatcacc cgattgacta catccaatgc 240
gacgtgagcg atgcagaaga cacccgtagc aaactgagcc cgctgacaga tgtgacccac 300
gtgttctacg tgacctggac caaccgtgaa agcgagagcg aaaattgtga ggccaacggc 360
agcatgctgc gcaatgtgct gcaggcaatt atcccgtacg caccgaatct gcgtcacgtg 420
tgtctgcaga caggcaccaa gcattacctg ggcccgttta ccaacgttga tggccctcgc 480
catgatcctc cgtttaccga ggacatgccg cgcctgcaga tccagaattt ctactacacc 540
caagaagata ttctgtttga agaaatcaaa aagatcgaaa ccgtgacctg gagcatccac 600
cgcccgaaca tgatctttgg cttcagcccg tatagcctga tgaacatcgt gggcacactg 660
tgcgtgtacg cagccatctg caagcacgaa ggtagcccgc tgctgtttcc gggtagcaag 720
aaagcctggg agggctttat gacagcaagc gatgccgacc tgattgccga acagcagatt 780
tgggccgccg tggatccgta tgccaaaaac gaggccttca actgcaataa cgccgatatt 840
tttaaatgga aacatctgtg gaaaatcctg gccgagcagt ttggcatcga agaatacggc 900
ttcgaagaag gcaagaacct gggcctggtt gagatgatga aaggcaagga gcgcgtgtgg 960
gaagaaatgg ttaaggagaa ccagctgcag gagaaaaagc tggaggaagt gggtgtgtgg 1020
tggttcgccg atgtgatcct gggcgttgaa ggcatgatcg atagtatgaa taaaagcaag 1080
gaatatggct tcctgggctt tcgcaacagc aacaacagct ttattagctg gattgataaa 1140
tataaagcat ttaagattgt gcct 1164
<210> 20
<211> 1167
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 20
atgagctggt ggtgggccgg ggcgattggt gccgcaaaaa aacgtctgga agaagatgat 60
gcacagccga aacatagcag cgttgcactg attgttggtg ttaccggtat tattggtaat 120
agcctggcag aaattctgcc gctggcagat accccgggtg gtccgtggaa agtttatggt 180
gttgcacgtc gtacccgtcc ggcatggcat gaagataatc cgattaatta tgttcagtgt 240
gatattagcg atccggatga tagccaggca aaactgagcc cgctgaccga tgttacccat 300
gttttttatg ttacctgggc aaatcgtagc accgaacagg aaaattgtga agcaaatagc 360
aaaatgtttc gtaatgttct ggatgcagtt attccgaatt gtccgaatct gaaacatatt 420
agcctgcaga ccggtcgtaa acattatatg ggtccgtttg aaagctatgg taaaattgaa 480
agccatgatc cgccgtatac cgaagatctg ccgcgtctga aatatatgaa tttttattat 540
gatctggaag atattatgct ggaagaagtt gaaaaaaaag aaggtctgac ctggagcgtt 600
catcgtccgg gtaatatttt tggttttagc ccgtatagca tgatgaatct ggttggtacc 660
ctgtgtgttt atgcagcaat ttgtaaacat gaaggtaaag ttctgcgttt taccggttgt 720
aaagcagcat gggatggtta tagcgattgt agcgatgcag atctgattgc agaacatcat 780
atttgggcag cagttgatcc gtatgcaaaa aatgaagcat ttaatgttag caatggtgat 840
gtttttaaat ggaaacattt ttggaaagtt ctggcagaac agtttggtgt tggttgtggt 900
gaatatgaag aaggtgttga tctgaaactg caggatctga tgaaaggtaa agaaccggtt 960
tgggaagaaa ttgttcgtga aaatggtctg accccgacca aactgaaaga tgttggtatt 1020
tggtggtttg gtgatgttat tctgggtaat gaatgttttc tggatagcat gaataaaagc 1080
aaagaacatg gttttctggg ttttcgtaat agcaaaaatg catttattag ctggattgat 1140
aaagcaaaag catataaaat tgttccg 1167
Claims (11)
1.一种具有甾体5β还原酶活性的甾体5β还原酶变体,所述甾体5β还原酶变体的氨基酸序列包括两个相邻位点的RR突变为相邻的KH。
2.根据权利要求1所述的甾体5β还原酶变体,其中所述甾体5β还原酶变体包含R63K和R64H突变,能够利用NADH为辅酶,且具有选自以下的氨基酸序列:
1)SEQ ID NO:1、SEQ ID NO:3或SEQ ID NO:5所示氨基酸序列;
2)SEQ ID NO:1、SEQ ID NO:3或SEQ ID NO:5所示氨基酸序列经过取代、缺失或添加一个或多个氨基酸而得到的氨基酸序列;或
3)与SEQ ID NO:1、SEQ ID NO:3或SEQ ID NO:5具有69%以上序列同一性的氨基酸序列;
优选地,所述甾体5β还原酶变体能够利用NADPH和NADH为辅酶。
3.根据权利要求1或2所述的甾体5β还原酶变体,其中所述甾体5β还原酶变体催化底物所获得的产物5位的氢构象为β型。
4.根据权利要求1-3中任一项所述的甾体5β还原酶变体,其中所述甾体5β还原酶变体的氨基酸序列如SEQ ID NO:1、SEQ ID NO:3或SEQ ID NO:5所示。
5.一种分离的多核苷酸,其编码权利要求1-4中任一项所述的甾体5β还原酶变体;优选地,所述多核苷酸序列如SEQ ID NO:2、SEQ ID NO:4或SEQ ID NO:6所示。
6.一种表达载体,其包含权利要求5所述的多核苷酸;优选地,所述表达载体为真核或原核生物的染色体;优选地,所述表达载体选自真核表达载体或原核表达载体;优选地,所述表达载体为质粒,优选为pcDNA3.1或pET26b(+)。
7.一种宿主细胞,其包含权利要求6所述的表达载体;
优选地,所述宿主细胞选自真核细胞或原核细胞;
更优选地,所述真核细胞是真菌细胞,更进一步优选为酵母菌;
更优选地,所述原核细胞选自大肠杆菌、分枝杆菌、假单胞菌、红球菌、节杆菌、枯草杆菌或放线菌细胞;更进一步优选为大肠杆菌T7 express或BL21(DE3)细胞。
8.一种甾体5β还原酶组合物,其包含权利要求1-4中任一项所述的甾体5β还原酶变体。
9.一种制备权利要求1-4中任一项所述的甾体5β还原酶变体的方法,所述方法包括以下步骤:
(1)在有助于生产所述甾体5β还原酶变体的条件下培养权利要求6所述的宿主细胞,以及
(2)从得到的培养液中获得所述甾体5β还原酶变体。
10.权利要求1-4中任一项所述的甾体5β还原酶变体、权利要求5所述的多核苷酸、权利要求6所述的表达载体、权利要求7所述的宿主细胞或权利要求8所述的组合物在制备5β-氢的甾体中的用途;
优选地,所述5β-氢的甾体为5β-孕甾烷-3,20-二酮、20-羟甲基-5β-孕甾-3-酮或胆酸类化合物;
更优选地,所述胆酸类化合物为具有去氧胆酸骨架结构的化合物,例如脱氧胆酸。
11.一种用于制备5β-氢的甾体的方法,其包括使用权利要求1-4中任一项所述的甾体5β还原酶变体、权利要求5所述的多核苷酸、权利要求6所述的表达载体、权利要求7所述的宿主细胞或权利要求8所述的组合物制备5β-氢的甾体;优选地,所述5β-氢的甾体为5β-孕甾烷-3,20-二酮、20-羟甲基-5β-孕甾-3-酮或胆酸类化合物;更优选地,所述胆酸类化合物为具有去氧胆酸骨架结构的化合物,例如脱氧胆酸。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911128428 | 2019-11-18 | ||
CN2019111284283 | 2019-11-18 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN112280758A true CN112280758A (zh) | 2021-01-29 |
Family
ID=74350827
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011215626.6A Pending CN112280758A (zh) | 2019-11-18 | 2020-11-04 | 一种甾体5β还原酶变体及其用途 |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN112280758A (zh) |
WO (1) | WO2021098506A1 (zh) |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
FR2734839A1 (fr) * | 1995-06-01 | 1996-12-06 | Roussel Uclaf | Sequence d'adn codant pour une proteine d'a. thaliana ayant une activite delta-5,7 sterol, delta-7 reductase, proteine delta-7red, procede de production, souches de levures transformees, applications. |
CN108048416A (zh) * | 2017-12-25 | 2018-05-18 | 吉林凯莱英医药化学有限公司 | 改进的酮还原酶突变体及其制备方法和应用 |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE10233723A1 (de) * | 2002-07-24 | 2004-02-12 | Schering Ag | Mikrobiologische Verfahren zur Herstellung von 7α-substituierten 11α-Hydroxysteroiden, daraus herstellbare 7α,17α-substituierte 11β-Halogensteroide, deren Herstellungsverfahren und Verwendung sowie pharmazeutische Präparate, die diese Verbindungen enthalten, sowie daraus herstellbare 7α-substituierte Estra-1,3,5(10)-triene |
CN101565709B (zh) * | 2009-05-20 | 2010-12-29 | 华东理工大学 | 3-甾酮-9α-羟基化酶基因、3-甾酮-9α羟基化酶还原酶基因、相关载体和工程菌及应用 |
CN106434705B (zh) * | 2016-07-01 | 2019-11-26 | 浙江仙琚制药股份有限公司 | 一种酰基辅酶A-还原酶基因phsR及其应用 |
-
2020
- 2020-11-04 WO PCT/CN2020/126374 patent/WO2021098506A1/zh active Application Filing
- 2020-11-04 CN CN202011215626.6A patent/CN112280758A/zh active Pending
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
FR2734839A1 (fr) * | 1995-06-01 | 1996-12-06 | Roussel Uclaf | Sequence d'adn codant pour une proteine d'a. thaliana ayant une activite delta-5,7 sterol, delta-7 reductase, proteine delta-7red, procede de production, souches de levures transformees, applications. |
CN108048416A (zh) * | 2017-12-25 | 2018-05-18 | 吉林凯莱英医药化学有限公司 | 改进的酮还原酶突变体及其制备方法和应用 |
Also Published As
Publication number | Publication date |
---|---|
WO2021098506A1 (zh) | 2021-05-27 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112877307B (zh) | 一种氨基酸脱氢酶突变体及其应用 | |
CN111748537B (zh) | 一种尿苷磷酸酶突变体及其应用 | |
CN112029739A (zh) | 7β羟基类固醇脱氢酶突变体及其在制备UDCA中的应用 | |
CN113528606B (zh) | 一种酶催化制备17β-羟基类固醇的方法 | |
CN110564788A (zh) | 一种利用亚胺还原酶生产麻黄碱的方法 | |
CN111662888B (zh) | 一种具有高热稳定性的黄递酶突变体、基因及其制备方法 | |
CN109355265B (zh) | 一种羰基还原酶突变体mut-AcCR(I147V/G152L)及其应用与编码基因 | |
WO2022192688A1 (en) | Biosynthesis of mogrosides | |
CN111484961B (zh) | 一种产5α-雄烷二酮的基因工程菌及其应用 | |
Ming et al. | Engineering the activity of amine dehydrogenase in the asymmetric reductive amination of hydroxyl ketones | |
US11098287B2 (en) | 17β-hydroxysteroid dehydrogenase mutants and application thereof | |
US7402419B2 (en) | Phosphite dehydrogenase mutants for nicotinamide cofactor regeneration | |
CN107267474B (zh) | 一种二氢硫辛酰胺脱氢酶突变体蛋白及其制备方法和应用 | |
WO2024045796A1 (zh) | 一种溶剂耐受性提高的环糊精葡萄糖基转移酶及其制备 | |
CN112280758A (zh) | 一种甾体5β还原酶变体及其用途 | |
CN109468293B (zh) | 一种羰基还原酶突变体mut-AcCR(E144A/G152L)及其应用与编码基因 | |
CN110317765B (zh) | 一种高产香叶醇葡萄糖苷的大肠杆菌表达菌株及其应用 | |
CN107779459B (zh) | 葡萄糖脱氢酶dna分子、载体和菌株及应用 | |
CN112831532B (zh) | 一种酶促合成d-亮氨酸的方法 | |
Radoš et al. | Stereospecificity of Corynebacterium glutamicum 2, 3-butanediol dehydrogenase and implications for the stereochemical purity of bioproduced 2, 3-butanediol | |
CN110343728B (zh) | 一种生物转化合成六氢哒嗪-3-羧酸的方法 | |
CN113817704A (zh) | 一种有机溶剂耐受性提高的环糊精葡萄糖基转移酶及其制备方法 | |
Zhang et al. | Identification of a novel ene reductase from Pichia angusta with potential application in (R)-levodione production | |
CN114231507B (zh) | 一种胆分节杆菌胆碱氧化酶突变体及其应用 | |
CN113846082B (zh) | 一种卤醇脱卤酶突变体及其编码基因、重组载体、重组基因工程菌以及应用 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |