CN113832122A - 一种7β-HSDH酶突变体及其编码基因和应用 - Google Patents
一种7β-HSDH酶突变体及其编码基因和应用 Download PDFInfo
- Publication number
- CN113832122A CN113832122A CN202111215918.4A CN202111215918A CN113832122A CN 113832122 A CN113832122 A CN 113832122A CN 202111215918 A CN202111215918 A CN 202111215918A CN 113832122 A CN113832122 A CN 113832122A
- Authority
- CN
- China
- Prior art keywords
- seq
- amino acid
- acid sequence
- gly
- beta
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 102000004190 Enzymes Human genes 0.000 title claims abstract description 139
- 108090000790 Enzymes Proteins 0.000 title claims abstract description 139
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 72
- 230000035772 mutation Effects 0.000 claims abstract description 35
- 238000006243 chemical reaction Methods 0.000 claims abstract description 22
- QYYDXDSPYPOWRO-JHMCBHKWSA-N (3r)-3-[(3r,5s,7s,8r,9s,10s,13r,14s,17r)-3,7-dihydroxy-10,13-dimethyl-2,3,4,5,6,7,8,9,11,12,14,15,16,17-tetradecahydro-1h-cyclopenta[a]phenanthren-17-yl]butanoic acid Chemical compound C([C@H]1C[C@@H]2O)[C@H](O)CC[C@]1(C)[C@@H]1[C@@H]2[C@@H]2CC[C@H]([C@@H](CC(O)=O)C)[C@@]2(C)CC1 QYYDXDSPYPOWRO-JHMCBHKWSA-N 0.000 claims abstract description 21
- 238000004128 high performance liquid chromatography Methods 0.000 claims abstract description 9
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract 48
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 claims description 50
- 235000009582 asparagine Nutrition 0.000 claims description 50
- 229960001230 asparagine Drugs 0.000 claims description 50
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 claims description 49
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 claims description 48
- 239000002773 nucleotide Substances 0.000 claims description 44
- 125000003729 nucleotide group Chemical group 0.000 claims description 44
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 claims description 30
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 claims description 18
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 claims description 17
- 235000013922 glutamic acid Nutrition 0.000 claims description 14
- 239000004220 glutamic acid Substances 0.000 claims description 14
- 239000004475 Arginine Substances 0.000 claims description 12
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 claims description 12
- 239000013604 expression vector Substances 0.000 claims description 12
- 238000000034 method Methods 0.000 claims description 11
- 239000013598 vector Substances 0.000 claims description 9
- 239000012064 sodium phosphate buffer Substances 0.000 claims description 8
- 108010050375 Glucose 1-Dehydrogenase Proteins 0.000 claims description 7
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 claims description 5
- 238000001514 detection method Methods 0.000 claims description 5
- 239000008103 glucose Substances 0.000 claims description 5
- 238000003756 stirring Methods 0.000 claims description 4
- 235000014469 Bacillus subtilis Nutrition 0.000 claims description 3
- 238000002360 preparation method Methods 0.000 claims description 3
- 244000063299 Bacillus subtilis Species 0.000 claims description 2
- 241000588724 Escherichia coli Species 0.000 claims description 2
- 241000235058 Komagataella pastoris Species 0.000 claims description 2
- 241000187747 Streptomyces Species 0.000 claims description 2
- XJLXINKUBYWONI-NNYOXOHSSA-O NADP(+) Chemical compound NC(=O)C1=CC=C[N+]([C@H]2[C@@H]([C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OC[C@@H]3[C@H]([C@@H](OP(O)(O)=O)[C@@H](O3)N3C4=NC=NC(N)=C4N=C3)O)O2)O)=C1 XJLXINKUBYWONI-NNYOXOHSSA-O 0.000 claims 2
- 239000000758 substrate Substances 0.000 abstract description 10
- 230000003197 catalytic effect Effects 0.000 abstract description 7
- 239000011942 biocatalyst Substances 0.000 abstract description 2
- 208000012839 conversion disease Diseases 0.000 abstract description 2
- 230000002194 synthesizing effect Effects 0.000 abstract description 2
- 238000012795 verification Methods 0.000 abstract description 2
- 150000001413 amino acids Chemical group 0.000 description 76
- 241000222511 Coprinus Species 0.000 description 34
- 241001262170 Collinsella aerofaciens Species 0.000 description 33
- 108020004414 DNA Proteins 0.000 description 32
- 108010037850 glycylvaline Proteins 0.000 description 32
- 230000000694 effects Effects 0.000 description 18
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 16
- MKZCBYZBCINNJN-DLOVCJGASA-N Ala-Asp-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MKZCBYZBCINNJN-DLOVCJGASA-N 0.000 description 16
- NKJBKNVQHBZUIX-ACZMJKKPSA-N Ala-Gln-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKJBKNVQHBZUIX-ACZMJKKPSA-N 0.000 description 16
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 16
- BDQNLQSWRAPHGU-DLOVCJGASA-N Ala-Phe-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N BDQNLQSWRAPHGU-DLOVCJGASA-N 0.000 description 16
- RUXQNKVQSKOOBS-JURCDPSOSA-N Ala-Phe-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RUXQNKVQSKOOBS-JURCDPSOSA-N 0.000 description 16
- OGUPCHKBOKJFMA-SRVKXCTJSA-N Arg-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N OGUPCHKBOKJFMA-SRVKXCTJSA-N 0.000 description 16
- AQPVUEJJARLJHB-BQBZGAKWSA-N Arg-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N AQPVUEJJARLJHB-BQBZGAKWSA-N 0.000 description 16
- SUEIIIFUBHDCCS-PBCZWWQYSA-N Asn-His-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SUEIIIFUBHDCCS-PBCZWWQYSA-N 0.000 description 16
- MJIJBEYEHBKTIM-BYULHYEWSA-N Asn-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MJIJBEYEHBKTIM-BYULHYEWSA-N 0.000 description 16
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 16
- SAKCBXNPWDRWPE-BQBZGAKWSA-N Asp-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)O)N SAKCBXNPWDRWPE-BQBZGAKWSA-N 0.000 description 16
- XMKXONRMGJXCJV-LAEOZQHASA-N Asp-Val-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XMKXONRMGJXCJV-LAEOZQHASA-N 0.000 description 16
- DIHCYBRLTVEPBW-SRVKXCTJSA-N Cys-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CS)N DIHCYBRLTVEPBW-SRVKXCTJSA-N 0.000 description 16
- OETOANMAHTWESF-KKUMJFAQSA-N Cys-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CS)N OETOANMAHTWESF-KKUMJFAQSA-N 0.000 description 16
- HUWSBFYAGXCXKC-CIUDSAMLSA-N Glu-Ala-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O HUWSBFYAGXCXKC-CIUDSAMLSA-N 0.000 description 16
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 16
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 16
- PKYAVRMYTBBRLS-FXQIFTODSA-N Glu-Cys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O PKYAVRMYTBBRLS-FXQIFTODSA-N 0.000 description 16
- FKGNJUCQKXQNRA-NRPADANISA-N Glu-Cys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(O)=O FKGNJUCQKXQNRA-NRPADANISA-N 0.000 description 16
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 16
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 16
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 16
- YGLCLCMAYUYZSG-AVGNSLFASA-N Glu-Lys-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 YGLCLCMAYUYZSG-AVGNSLFASA-N 0.000 description 16
- YKBUCXNNBYZYAY-MNXVOIDGSA-N Glu-Lys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YKBUCXNNBYZYAY-MNXVOIDGSA-N 0.000 description 16
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 16
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 16
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 16
- UPOJUWHGMDJUQZ-IUCAKERBSA-N Gly-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UPOJUWHGMDJUQZ-IUCAKERBSA-N 0.000 description 16
- DTRUBYPMMVPQPD-YUMQZZPRSA-N Gly-Gln-Arg Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DTRUBYPMMVPQPD-YUMQZZPRSA-N 0.000 description 16
- JLJLBWDKDRYOPA-RYUDHWBXSA-N Gly-Gln-Tyr Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JLJLBWDKDRYOPA-RYUDHWBXSA-N 0.000 description 16
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 16
- VTMSUKSRIKCCAD-ULQDDVLXSA-N His-Tyr-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N VTMSUKSRIKCCAD-ULQDDVLXSA-N 0.000 description 16
- FVEWRQXNISSYFO-ZPFDUUQYSA-N Ile-Arg-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FVEWRQXNISSYFO-ZPFDUUQYSA-N 0.000 description 16
- GECLQMBTZCPAFY-PEFMBERDSA-N Ile-Gln-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GECLQMBTZCPAFY-PEFMBERDSA-N 0.000 description 16
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 16
- HQEPKOFULQTSFV-JURCDPSOSA-N Ile-Phe-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)O)N HQEPKOFULQTSFV-JURCDPSOSA-N 0.000 description 16
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 16
- 241000880493 Leptailurus serval Species 0.000 description 16
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 16
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 16
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 16
- VVQJGYPTIYOFBR-IHRRRGAJSA-N Leu-Lys-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N VVQJGYPTIYOFBR-IHRRRGAJSA-N 0.000 description 16
- SLQJJFAVWSZLBL-BJDJZHNGSA-N Lys-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN SLQJJFAVWSZLBL-BJDJZHNGSA-N 0.000 description 16
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 16
- YNOVBMBQSQTLFM-DCAQKATOSA-N Met-Asn-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O YNOVBMBQSQTLFM-DCAQKATOSA-N 0.000 description 16
- OHMKUHXCDSCOMT-QXEWZRGKSA-N Met-Asn-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHMKUHXCDSCOMT-QXEWZRGKSA-N 0.000 description 16
- LRALLISKBZNSKN-BQBZGAKWSA-N Met-Gly-Ser Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LRALLISKBZNSKN-BQBZGAKWSA-N 0.000 description 16
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 16
- 108010065395 Neuropep-1 Proteins 0.000 description 16
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 16
- PSKRILMFHNIUAO-JYJNAYRXSA-N Phe-Glu-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N PSKRILMFHNIUAO-JYJNAYRXSA-N 0.000 description 16
- HBGFEEQFVBWYJQ-KBPBESRZSA-N Phe-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HBGFEEQFVBWYJQ-KBPBESRZSA-N 0.000 description 16
- LRBSWBVUCLLRLU-BZSNNMDCSA-N Phe-Leu-Lys Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1ccccc1)C(=O)N[C@@H](CCCCN)C(O)=O LRBSWBVUCLLRLU-BZSNNMDCSA-N 0.000 description 16
- FQUUYTNBMIBOHS-IHRRRGAJSA-N Phe-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N FQUUYTNBMIBOHS-IHRRRGAJSA-N 0.000 description 16
- VFDRDMOMHBJGKD-UFYCRDLUSA-N Phe-Tyr-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N VFDRDMOMHBJGKD-UFYCRDLUSA-N 0.000 description 16
- ZPPVJIJMIKTERM-YUMQZZPRSA-N Pro-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ZPPVJIJMIKTERM-YUMQZZPRSA-N 0.000 description 16
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 16
- RSTWKJFWBKFOFC-JYJNAYRXSA-N Pro-Trp-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O RSTWKJFWBKFOFC-JYJNAYRXSA-N 0.000 description 16
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 16
- BQWCDDAISCPDQV-XHNCKOQMSA-N Ser-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N)C(=O)O BQWCDDAISCPDQV-XHNCKOQMSA-N 0.000 description 16
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 16
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 16
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 16
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 16
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 16
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 16
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 16
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 16
- IWAVRIPRTCJAQO-HSHDSVGOSA-N Thr-Pro-Trp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O IWAVRIPRTCJAQO-HSHDSVGOSA-N 0.000 description 16
- PELIQFPESHBTMA-WLTAIBSBSA-N Thr-Tyr-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 PELIQFPESHBTMA-WLTAIBSBSA-N 0.000 description 16
- RPVDDQYNBOVWLR-HOCLYGCPSA-N Trp-Gly-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O RPVDDQYNBOVWLR-HOCLYGCPSA-N 0.000 description 16
- NLLARHRWSFNEMH-NUTKFTJISA-N Trp-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NLLARHRWSFNEMH-NUTKFTJISA-N 0.000 description 16
- GIOBXJSONRQHKQ-RYUDHWBXSA-N Tyr-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O GIOBXJSONRQHKQ-RYUDHWBXSA-N 0.000 description 16
- PJWCWGXAVIVXQC-STECZYCISA-N Tyr-Ile-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PJWCWGXAVIVXQC-STECZYCISA-N 0.000 description 16
- RGJZPXFZIUUQDN-BPNCWPANSA-N Tyr-Val-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O RGJZPXFZIUUQDN-BPNCWPANSA-N 0.000 description 16
- WOCYUGQDXPTQPY-FXQIFTODSA-N Val-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N WOCYUGQDXPTQPY-FXQIFTODSA-N 0.000 description 16
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 16
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 16
- WJVLTYSHNXRCLT-NHCYSSNCSA-N Val-His-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WJVLTYSHNXRCLT-NHCYSSNCSA-N 0.000 description 16
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 16
- BZMIYHIJVVJPCK-QSFUFRPTSA-N Val-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N BZMIYHIJVVJPCK-QSFUFRPTSA-N 0.000 description 16
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 16
- MJFSRZZJQWZHFQ-SRVKXCTJSA-N Val-Met-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)O)N MJFSRZZJQWZHFQ-SRVKXCTJSA-N 0.000 description 16
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 16
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 16
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 16
- 108010005233 alanylglutamic acid Proteins 0.000 description 16
- 108010047495 alanylglycine Proteins 0.000 description 16
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 16
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 16
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 16
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 16
- 108010015792 glycyllysine Proteins 0.000 description 16
- 108010018006 histidylserine Proteins 0.000 description 16
- 108010057821 leucylproline Proteins 0.000 description 16
- 108010009298 lysylglutamic acid Proteins 0.000 description 16
- 108010051242 phenylalanylserine Proteins 0.000 description 16
- 108010029020 prolylglycine Proteins 0.000 description 16
- 108010026333 seryl-proline Proteins 0.000 description 16
- 108010071207 serylmethionine Proteins 0.000 description 16
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 16
- 238000005516 engineering process Methods 0.000 description 12
- 239000000243 solution Substances 0.000 description 11
- 102000004169 proteins and genes Human genes 0.000 description 8
- 235000018102 proteins Nutrition 0.000 description 7
- 108010032887 7 beta-hydroxysteroid dehydrogenase Proteins 0.000 description 6
- 239000013612 plasmid Substances 0.000 description 6
- FTSAJSADJCMDHH-CIUDSAMLSA-N Asn-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N FTSAJSADJCMDHH-CIUDSAMLSA-N 0.000 description 5
- TWIAMTNJOMRDAK-GUBZILKMSA-N Gln-Lys-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O TWIAMTNJOMRDAK-GUBZILKMSA-N 0.000 description 5
- QEYUCKCWTMIERU-SRVKXCTJSA-N His-Lys-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N QEYUCKCWTMIERU-SRVKXCTJSA-N 0.000 description 5
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 5
- XJLXINKUBYWONI-DQQFMEOOSA-N [[(2r,3r,4r,5r)-5-(6-aminopurin-9-yl)-3-hydroxy-4-phosphonooxyoxolan-2-yl]methoxy-hydroxyphosphoryl] [(2s,3r,4s,5s)-5-(3-carbamoylpyridin-1-ium-1-yl)-3,4-dihydroxyoxolan-2-yl]methyl phosphate Chemical compound NC(=O)C1=CC=C[N+]([C@@H]2[C@H]([C@@H](O)[C@H](COP([O-])(=O)OP(O)(=O)OC[C@@H]3[C@H]([C@@H](OP(O)(O)=O)[C@@H](O3)N3C4=NC=NC(N)=C4N=C3)O)O2)O)=C1 XJLXINKUBYWONI-DQQFMEOOSA-N 0.000 description 5
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 5
- 229960000723 ampicillin Drugs 0.000 description 5
- 108010008355 arginyl-glutamine Proteins 0.000 description 5
- 230000015572 biosynthetic process Effects 0.000 description 5
- 238000010276 construction Methods 0.000 description 5
- 239000012634 fragment Substances 0.000 description 5
- 230000014509 gene expression Effects 0.000 description 5
- 238000004519 manufacturing process Methods 0.000 description 5
- 229930027945 nicotinamide-adenine dinucleotide Natural products 0.000 description 5
- RUDATBOHQWOJDD-UHFFFAOYSA-N (3beta,5beta,7alpha)-3,7-Dihydroxycholan-24-oic acid Natural products OC1CC2CC(O)CCC2(C)C2C1C1CCC(C(CCC(O)=O)C)C1(C)CC2 RUDATBOHQWOJDD-UHFFFAOYSA-N 0.000 description 4
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 4
- 239000007788 liquid Substances 0.000 description 4
- 108091008146 restriction endonucleases Proteins 0.000 description 4
- 238000012216 screening Methods 0.000 description 4
- 238000004088 simulation Methods 0.000 description 4
- RUDATBOHQWOJDD-UZVSRGJWSA-N ursodeoxycholic acid Chemical compound C([C@H]1C[C@@H]2O)[C@H](O)CC[C@]1(C)[C@@H]1[C@@H]2[C@@H]2CC[C@H]([C@@H](CCC(O)=O)C)[C@@]2(C)CC1 RUDATBOHQWOJDD-UZVSRGJWSA-N 0.000 description 4
- 229960001661 ursodiol Drugs 0.000 description 4
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 3
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 3
- WSDDJLMGYRLUKR-WUEGHLCSSA-L disodium;[(2r,3r,4r,5r)-2-(6-aminopurin-9-yl)-5-[[[[(2r,3s,4r,5r)-5-(3-carbamoylpyridin-1-ium-1-yl)-3,4-dihydroxyoxolan-2-yl]methoxy-oxidophosphoryl]oxy-oxidophosphoryl]oxymethyl]-4-hydroxyoxolan-3-yl] hydrogen phosphate Chemical compound [Na+].[Na+].NC(=O)C1=CC=C[N+]([C@H]2[C@@H]([C@H](O)[C@@H](COP([O-])(=O)OP(O)(=O)OC[C@@H]3[C@H]([C@@H](OP([O-])([O-])=O)[C@@H](O3)N3C4=NC=NC(N)=C4N=C3)O)O2)O)=C1 WSDDJLMGYRLUKR-WUEGHLCSSA-L 0.000 description 3
- 239000000499 gel Substances 0.000 description 3
- 239000003262 industrial enzyme Substances 0.000 description 3
- 208000008338 non-alcoholic fatty liver disease Diseases 0.000 description 3
- 239000000843 powder Substances 0.000 description 3
- 230000009467 reduction Effects 0.000 description 3
- 238000002741 site-directed mutagenesis Methods 0.000 description 3
- 239000006228 supernatant Substances 0.000 description 3
- 238000003786 synthesis reaction Methods 0.000 description 3
- 241000894006 Bacteria Species 0.000 description 2
- 101710088194 Dehydrogenase Proteins 0.000 description 2
- 108010062875 Hydroxysteroid Dehydrogenases Proteins 0.000 description 2
- 102000011145 Hydroxysteroid Dehydrogenases Human genes 0.000 description 2
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 2
- 238000012408 PCR amplification Methods 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 229940024606 amino acid Drugs 0.000 description 2
- 235000001014 amino acid Nutrition 0.000 description 2
- 235000003704 aspartic acid Nutrition 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 2
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 2
- DKPFZGUDAPQIHT-UHFFFAOYSA-N butyl acetate Chemical compound CCCCOC(C)=O DKPFZGUDAPQIHT-UHFFFAOYSA-N 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- 230000002255 enzymatic effect Effects 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 238000000855 fermentation Methods 0.000 description 2
- 230000004151 fermentation Effects 0.000 description 2
- 108010029645 galactitol 2-dehydrogenase Proteins 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 210000004185 liver Anatomy 0.000 description 2
- 239000008176 lyophilized powder Substances 0.000 description 2
- 229920002521 macromolecule Polymers 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 239000000047 product Substances 0.000 description 2
- 230000009465 prokaryotic expression Effects 0.000 description 2
- 210000002966 serum Anatomy 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- RHCPKKNRWFXMAT-RRWYKFPJSA-N 3alpha,12alpha-dihydroxy-7-oxo-5beta-cholanic acid Chemical compound C1C[C@@H](O)C[C@H]2CC(=O)[C@H]3[C@@H]4CC[C@H]([C@@H](CCC(O)=O)C)[C@@]4(C)[C@@H](O)C[C@@H]3[C@]21C RHCPKKNRWFXMAT-RRWYKFPJSA-N 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- 206010008609 Cholangitis sclerosing Diseases 0.000 description 1
- 206010008635 Cholestasis Diseases 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- 102000012410 DNA Ligases Human genes 0.000 description 1
- 108010061982 DNA Ligases Proteins 0.000 description 1
- ALGGDNMLQNFVIZ-SRVKXCTJSA-N Lys-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ALGGDNMLQNFVIZ-SRVKXCTJSA-N 0.000 description 1
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 230000010757 Reduction Activity Effects 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- BHTRKEVKTKCXOH-UHFFFAOYSA-N Taurochenodesoxycholsaeure Natural products OC1CC2CC(O)CCC2(C)C2C1C1CCC(C(CCC(=O)NCCS(O)(=O)=O)C)C1(C)CC2 BHTRKEVKTKCXOH-UHFFFAOYSA-N 0.000 description 1
- 102000003929 Transaminases Human genes 0.000 description 1
- 108090000340 Transaminases Proteins 0.000 description 1
- 238000002835 absorbance Methods 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 230000002300 anti-fibrosis Effects 0.000 description 1
- 230000003110 anti-inflammatory effect Effects 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 125000000613 asparagine group Chemical group N[C@@H](CC(N)=O)C(=O)* 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 230000002210 biocatalytic effect Effects 0.000 description 1
- 239000003054 catalyst Substances 0.000 description 1
- 238000006555 catalytic reaction Methods 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 239000007795 chemical reaction product Substances 0.000 description 1
- 230000007870 cholestasis Effects 0.000 description 1
- 231100000359 cholestasis Toxicity 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 238000009510 drug design Methods 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 239000000411 inducer Substances 0.000 description 1
- 238000009776 industrial production Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 125000001570 methylene group Chemical group [H]C([H])([*:1])[*:2] 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 238000003032 molecular docking Methods 0.000 description 1
- 201000000742 primary sclerosing cholangitis Diseases 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 230000002633 protecting effect Effects 0.000 description 1
- 230000035484 reaction time Effects 0.000 description 1
- 230000008707 rearrangement Effects 0.000 description 1
- 238000003259 recombinant expression Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000008929 regeneration Effects 0.000 description 1
- 238000011069 regeneration method Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 208000010157 sclerosing cholangitis Diseases 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- BHTRKEVKTKCXOH-LBSADWJPSA-N tauroursodeoxycholic acid Chemical compound C([C@H]1C[C@@H]2O)[C@H](O)CC[C@]1(C)[C@@H]1[C@@H]2[C@@H]2CC[C@H]([C@@H](CCC(=O)NCCS(O)(=O)=O)C)[C@@]2(C)CC1 BHTRKEVKTKCXOH-LBSADWJPSA-N 0.000 description 1
- UFTFJSFQGQCHQW-UHFFFAOYSA-N triformin Chemical compound O=COCC(OC=O)COC=O UFTFJSFQGQCHQW-UHFFFAOYSA-N 0.000 description 1
- 239000012137 tryptone Substances 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/0006—Oxidoreductases (1.) acting on CH-OH groups as donors (1.1)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/74—Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora
- C12N15/75—Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora for Bacillus
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/74—Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora
- C12N15/76—Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora for Actinomyces; for Streptomyces
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/80—Vectors or expression systems specially adapted for eukaryotic hosts for fungi
- C12N15/81—Vectors or expression systems specially adapted for eukaryotic hosts for fungi for yeasts
- C12N15/815—Vectors or expression systems specially adapted for eukaryotic hosts for fungi for yeasts for yeasts other than Saccharomyces
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P33/00—Preparation of steroids
- C12P33/06—Hydroxylating
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y101/00—Oxidoreductases acting on the CH-OH group of donors (1.1)
- C12Y101/01—Oxidoreductases acting on the CH-OH group of donors (1.1) with NAD+ or NADP+ as acceptor (1.1.1)
- C12Y101/01201—7-Beta-hydroxysteroid dehydrogenase (NADP+) (1.1.1.201)
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A50/00—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE in human health protection, e.g. against extreme weather
- Y02A50/30—Against vector-borne diseases, e.g. mosquito-borne, fly-borne, tick-borne or waterborne diseases whose impact is exacerbated by climate change
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- Biomedical Technology (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Microbiology (AREA)
- Molecular Biology (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Mycology (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Medicinal Chemistry (AREA)
- Enzymes And Modification Thereof (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
本发明公开了一种7β‑HSDH酶突变体及其编码基因和应用。所述的7β‑HSDH酶突变体的氨基酸序列与氨基酸序列如SEQ ID NO:2所示的野生型7β‑HSDH酶相比,在SEQ ID NO:2所示的氨基酸序列的第237位、第240位进行单突变或两两联合突变中的任意一种突变。所述的7β‑HSDH酶突变体可用于24‑去甲熊去氧胆酸的合成制备,将其作为生物催化剂转化底物24‑去甲‑7‑酮胆石酸生成24‑去甲熊去氧胆酸,反应后产物经HPLC验证,反应转化率>90%。本发明构建的7β‑HSDH酶突变体与野生型酶相比,催化活性显著提高,能显著降低酶的使用量,具有大规模工业应用的广泛前景。
Description
技术领域
本发明涉及生物酶工程技术领域,具体涉及一种7β-HSDH酶突变体及其编码基因和应用。
背景技术
24-去甲熊去氧胆酸(简称为norUDCA),为熊去氧胆酸的同系物,较熊去氧胆酸侧链少一个亚甲基,具有保护肝脏、抗炎和抗纤维化活性,可改善原发性硬化性胆管炎患者的血清碱性磷酸酶水平及其他胆汁淤积的状况。最新一项IIa期随机双盲对照研究报道,大剂量的24-去甲熊去氧胆酸(norUDCA)可以显著改善非酒精性脂肪性肝病(NAFLD)患者的血清转氨酶、甘油三酯和肝脏影像学指标,且药物安全性较好[杨蕊旭、范建高.AASLD2017:24-去甲熊去氧胆酸治疗非酒精性脂肪性肝病]。
目前未有关于24-去甲熊去氧胆酸的酶法合成报道,尤其是以24-去甲-7-酮胆石酸为底物进行7位不对称还原时,现有已报道的7β-类固醇脱氢酶(7β-Hydroxysteroiddehydrogenase,7β-HSDH)的催化效率极其低下,不到其原始底物7-酮胆石酸催化活性的5%,严重限制了24-去甲熊脱氧胆酸的工业化酶法生产。
如中国专利CN109182284A公开了一种7β-羟基类固醇脱氢酶突变体、编码序列、重组基因表达载体、基因工程菌及应用,该专利公开了对来源于产气柯林斯菌的7β-羟基类固醇脱氢酶进行突变,将野生型7β-羟基类固醇脱氢酶的第175位的谷氨酸突变成天冬氨酸,得到突变体Ca7β-1;或将第175位的谷氨酸突和第197位的天冬酰胺均突变成天冬氨酸,得到突变体Ca7β-2。7β-羟基类固醇脱氢酶突变体的还原活力均得到了提高,可利用7β-羟基类固醇脱氢酶突变体催化合成熊去氧胆酸(UDCA)和牛磺熊去氧胆酸(T-DUCA)。但该专利并没有公开7β-羟基类固醇脱氢酶能作为催化剂用于合成norUDCA的相关记载。
蛋白质三维结构模拟和蛋白定向进化技术,是近年来发展起来的对原始基因序列进行人工改造、以满足工业化应用需求的高科技技术,其中蛋白定向进化技术更是获得了2018年诺贝尔化学奖。因此结合蛋白质三维结构模拟和蛋白定向进化技术,进一步寻找和开发新的适用于工业大规模生产的羟基类固醇脱氢酶是目前研究的热点。
发明内容
针对现有技术存在的不足,本发明要解决的技术问题是提供一种7β-HSDH酶突变体及其编码基因和应用,以解决现有羟基类固醇脱氢酶活性不理想,难以实现工业化生产的问题。本发明采用蛋白质三维结构模拟和蛋白定向进化技术,对来源于产气柯林斯菌(Collinsella aerofaciens)DSM 3979的7β-HSDH酶(Luo Liu,Arno Aigner,RolfD.Schmid.Appl Microbiol Biotechnol.2011,90:127-135)进行了人工定向改造,显著提高了其对24-去甲-7-酮胆石酸的催化活性,将十分有助于实现工业上减少酶量,降低生产成本。
为解决上述技术问题,本发明提供以下技术方案:
来源于Collinsella aerofaciens的野生型7β-HSDH酶的氨基酸序列如SEQ IDNO:2所示,其编码基因的核苷酸序列如SEQ ID NO:1所示。
7β-HSDH酶的编码基因核苷酸序列通过常州基宇生物技术有限公司全基因合成所得,在编码区两端分别添加NdeI和HindIII限制性内切酶位点。目的基因片段通过限制性内切酶NdeI和HindIII酶切后,与经过同样双酶切的pET21a(+)载体(Novagen公司)进行连接、转化和筛选,筛选得到的阳性质粒7β-HSDH-pET21a(+)转入BL21(DE3)宿主菌中,从而构建7β-HSDH酶的体外异源表达体系。
7β-HSDH酶的突变体的构建,是通过定向进化的技术手段得到的,即利用易错PCR、DNA重排、半理性设计及大分子建模技术模拟三维结构等定向进行技术来获得突变体。具体地,本发明通过大分子建模技术模拟三维结构来进行酶的定向进化。采用同源建模的方法来模拟7β-HSDH酶的三维结构,利用能量最低原理和分子对接技术预测出可能的与催化相关的一个或多个活性位点,然后对这些活性位点进行定点突变,从中筛选出活性有显著提高的突变体。
更为具体的过程如下:本发明通过大分子建模技术预测出可能与催化活性相关的位点,分别为N237、S240两个位点。分别对这两个位点进行定点突变,利用高压液相色谱法(HPLC)来进行突变体的筛选。更为具体的为:1、当位点237的天冬酰胺(N)突变为组氨酸(H)时,突变体的催化活性相对于野生型酶来说得到了提高;2、当位点237的天冬酰胺(N)突变为谷氨酰胺(Q)时,突变体酶活得到了提高;3、当位点240的丝氨酸(S)突变为天冬酰胺(N)时,突变体酶活相对野生型酶来说得到了提高;4、当位点240的丝氨酸(S)突变为精氨酸(R)时,突变体酶活得到了显著提高;5、当位点240的丝氨酸(S)突变为谷氨酸(E)时,突变体酶活得到了显著提高;6、当位点240的丝氨酸(S)突变为谷氨酰胺(Q)时,突变体酶活得到了显著提高。当将上述2个位点进行两两联合突变时,突变体的催化活性相对于单个突变体来说得到了更大的提高。
因此,一方面,本发明请求保护一种7β-HSDH酶突变体,其氨基酸序列与氨基酸序列如SEQ ID NO:2所示的野生型7β-HSDH酶相比,在SEQ ID NO:2所示的氨基酸序列的第237位、第240位进行单突变或两两联合突变中的任意一种突变。
具体地,所述的单突变为:
当SEQ ID NO:2所示的氨基酸序列的第237位由天冬酰胺突变为组氨酸,所述的7β-HSDH酶突变体的氨基酸序列如SEQ ID NO:6所示;
或,当SEQ ID NO:2所示的氨基酸序列的第237位由天冬酰胺突变为谷氨酰胺,所述的7β-HSDH酶突变体的氨基酸序列如SEQ ID NO:8所示;
或,当SEQ ID NO:2所示的氨基酸序列的第240位由丝氨酸突变为天冬酰胺,所述的7β-HSDH酶突变体的氨基酸序列如SEQ ID NO:10所示;
或,当SEQ ID NO:2所示的氨基酸序列的第240位由丝氨酸突变为精氨酸,所述的7β-HSDH酶突变体的氨基酸序列如SEQ ID NO:12所示;
或,当SEQ ID NO:2所示的氨基酸序列的第240位由丝氨酸突变为谷氨酸,所述的7β-HSDH酶突变体的氨基酸序列如SEQ ID NO:14所示;
或,当SEQ ID NO:2所示的氨基酸序列的第240位由丝氨酸突变为谷氨酰胺,所述的7β-HSDH酶突变体的氨基酸序列如SEQ ID NO:16所示。
具体地,所述的两两联合突变为:
当SEQ ID NO:2所示的氨基酸序列的第237位由天冬酰胺突变为组氨酸,第240位由丝氨酸突变为天冬酰胺,所述的7β-HSDH酶突变体的氨基酸序列如SEQ ID NO:18所示;
或,当SEQ ID NO:2所示的氨基酸序列的第237位由天冬酰胺突变为组氨酸,第240位由丝氨酸突变为精氨酸,所述的7β-HSDH酶突变体的氨基酸序列如SEQ ID NO:20所示;
或,当SEQ ID NO:2所示的氨基酸序列的第237位由天冬酰胺突变为组氨酸,第240位由丝氨酸突变为谷氨酸,所述的7β-HSDH酶突变体的氨基酸序列如SEQ ID NO:22所示;
或,当SEQ ID NO:2所示的氨基酸序列的第237位由天冬酰胺突变为组氨酸,第240位由丝氨酸突变为谷氨酰胺,所述的7β-HSDH酶突变体的氨基酸序列如SEQ ID NO:24所示;
或,当SEQ ID NO:2所示的氨基酸序列的第237位由天冬酰胺突变为谷氨酰胺,第240位由丝氨酸突变为天冬酰胺,所述的7β-HSDH酶突变体的氨基酸序列如SEQ ID NO:26所示;
或,当SEQ ID NO:2所示的氨基酸序列的第237位由天冬酰胺突变为谷氨酰胺,第240位由丝氨酸突变为精氨酸,所述的7β-HSDH酶突变体的氨基酸序列如SEQ ID NO:28所示;
或,当SEQ ID NO:2所示的氨基酸序列的第237位由天冬酰胺突变为谷氨酰胺,第240位由丝氨酸突变为谷氨酸,所述的7β-HSDH酶突变体的氨基酸序列如SEQ ID NO:30所示;
或,当SEQ ID NO:2所示的氨基酸序列的第237位由天冬酰胺突变为谷氨酰胺,第240位由丝氨酸突变为谷氨酰胺,所述的7β-HSDH酶突变体的氨基酸序列如SEQ ID NO:32所示。
另一方面,本发明还请求保护上述7β-HSDH酶突变体的编码基因。
具体地,氨基酸序列如SEQ ID NO:6所示的7β-HSDH酶突变体的编码基因的核苷酸序列如SEQ ID NO:5所示;
或,氨基酸序列如SEQ ID NO:8所示的7β-HSDH酶突变体的编码基因的核苷酸序列如SEQ ID NO:7所示;
或,氨基酸序列如SEQ ID NO:10所示的7β-HSDH酶突变体的编码基因的核苷酸序列如SEQ ID NO:9所示;
或,氨基酸序列如SEQ ID NO:12所示的7β-HSDH酶突变体的编码基因的核苷酸序列如SEQ ID NO:11所示;
或,氨基酸序列如SEQ ID NO:14所示的7β-HSDH酶突变体的编码基因的核苷酸序列如SEQ ID NO:13所示;
或,氨基酸序列如SEQ ID NO:16所示的7β-HSDH酶突变体的编码基因的核苷酸序列如SEQ ID NO:15所示;
或,氨基酸序列如SEQ ID NO:18所示的7β-HSDH酶突变体的编码基因的核苷酸序列如SEQ ID NO:17所示;
或,氨基酸序列如SEQ ID NO:20所示的7β-HSDH酶突变体的编码基因的核苷酸序列如SEQ ID NO:19所示;
或,氨基酸序列如SEQ ID NO:22所示的7β-HSDH酶突变体的编码基因的核苷酸序列如SEQ ID NO:21所示;
或,氨基酸序列如SEQ ID NO:24所示的7β-HSDH酶突变体的编码基因的核苷酸序列如SEQ ID NO:23所示;
或,氨基酸序列如SEQ ID NO:26所示的7β-HSDH酶突变体的编码基因的核苷酸序列如SEQ ID NO:25所示;
或,氨基酸序列如SEQ ID NO:28所示的7β-HSDH酶突变体的编码基因的核苷酸序列如SEQ ID NO:27所示;
或,氨基酸序列如SEQ ID NO:30所示的7β-HSDH酶突变体的编码基因的核苷酸序列如SEQ ID NO:29所示;
或,氨基酸序列如SEQ ID NO:32所示的7β-HSDH酶突变体的编码基因的核苷酸序列如SEQ ID NO:31所示。
根据现有公共知识,任何基因经操作或者改造后连入各类表达载体,转化至合适宿主细胞,经适当条件诱导均能过量表达目的蛋白。
因此,又一方面,本发明还请求保护含有上述编码基因的载体。
具体地,所述的载体可以为各种表达载体,包括但不限于pET表达载体、pCW表达载体、pUC表达载体或pPIC9k表达载体中的任意一种表达载体。
又一方面,本发明还请求保护含有上述编码基因的宿主细胞。
具体地,所述的宿主细胞可以为任一种合适的宿主细胞,包括但不限于大肠杆菌、毕赤酵母、链霉菌或枯草芽孢杆菌中的任意一种宿主细胞。
又一方面,本发明还请求保护上述7β-HSDH酶突变体、编码基因、载体、宿主细胞在制备24-去甲熊脱氧胆酸中的应用。
再一方面,本发明还提供了一种制备24-去甲熊脱氧胆酸的方法,包括以下步骤:
S1.配置反应体系,包含:1-10g/L上述的7β-HSDH酶突变体,50mM pH6.0-8.0磷酸钠缓冲液,0.2mM NADP+,10-50g/L 24-去甲-7-酮胆石酸,5-40g/L葡萄糖,0.1-10g/L葡萄糖脱氢酶;控制反应体系温度为25-40℃,pH值为6.0-8.0,进行搅拌反应;
S2.反应24h后进行HPLC检测,即得24-去甲熊去氧胆酸。
优选地,所述的方法包括以下步骤:
S1.配置反应体系,包含:1g/L上述的7β-HSDH酶突变体,50mM pH7.0磷酸钠缓冲液,0.2mM NADP+,10-50g/L 24-去甲-7-酮胆石酸,5-40g/L葡萄糖,0.5g/L葡萄糖脱氢酶;控制反应体系温度为30℃,pH值为7.0,进行搅拌反应;
S2.反应24h后进行HPLC检测,即得24-去甲熊去氧胆酸。
反应产物经HPLC检测,反应转化率>90%。从而可证明该酶突变体可作为生物催化剂,转化底物24-去甲-7-酮胆石酸生成24-去甲熊去氧胆酸。
此外,可进行上述生物催化反应的酶包括纯酶、相应的重组菌休止细胞、粗酶液或者粗酶粉等其他存在形态。
相对于现有技术,本发明具有以下有益效果:
本发明构建的7β-HSDH酶突变体与野生型酶相比,突变体的单位酶活得到大大提高,从而可以显著加快反应速率、降低酶的使用量,降低反应时间和生产成本,可在室温、24h之内将10-50g/L的底物24-去甲-7-酮胆石酸完全转化为24-去甲熊去氧胆酸,转化率>90%。本发明构建的7β-HSDH酶突变体能扫清24-去甲熊去氧胆酸的工业化酶法生产的障碍,具有广阔的工业化应用前景。
具体实施方式
下面结合具体实施例,对本发明作进一步详细的阐述,下述实施例不用于限制本发明,仅用于说明本发明。以下实施例中所使用的实验方法如无特殊说明,实施例中未注明具体条件的实验方法,通常按照常规条件,下述实施例中所使用的材料、试剂等,如无特殊说明,均可从商业途径得到。
实施例中,未注明具体条件的实验方法,通常按常规条件,如《分子克隆实验指南》(J.萨姆布鲁克,D.W.拉塞尔著,黄培堂,汪嘉玺,朱厚础等译,第三版,北京:科学出版社,2002)中所述的方法进行。
实施例一原核表达体系的构建
7β-HSDH基因片段由常州基宇生物技术有限公司合成,并重组到PUC57载体上。经限制性内切酶NdeI和HindIII(购自New England Biolabs公司,NEB)在37℃双酶切4h后,1%琼脂糖凝胶电泳分离并进行切胶回收(胶回收试剂盒购自天根生化科技(北京)有限公司)。随后与同样经过双酶切的表达载体pET21a(+)(Novagen公司),在T4 DNA连接酶(购自Takara公司)作用下置于低温连接仪里连接过夜。连接液转化DH5a感受态细胞(购自天根生化科技(北京)有限公司),并进行菌落PCR筛选和测序验证,从而得到阳性重组质粒7β-HSDH-pET21a(+)。
将阳性重组质粒7β-HSDH-pET21a(+)转化表达宿主菌BL21(DE3)(购自天根生化科技(北京)有限公司),得到原核表达菌株7β-HSDH-pET21a(+)/BL21(DE3),作为后续定向进化和发酵的原代菌株。
用于NADPH再生的葡萄糖脱氢酶(GDH,来源于B.subtilis)由常州基宇生物技术有限公司合成,后续重组表达质粒的构建同7β-HSDH-pET21a(+)质粒的构建,转入BL21(DE3)中后得到表达菌株。
实施例二酶的摇瓶发酵制备酶冻干粉
上述构建的表达菌株7β-HSDH-pET21a(+)/BL21(DE3),在加有终浓度为100μg/mL氨苄青霉素的5mL LB液体培养基【10g/L胰蛋白胨(OXIOD),5g/L酵母粉(OXIOD),10g/L氯化钠(国药试剂)】中于37℃、200rpm振荡培养过夜后,按1%(V/V)比例接种于含有终浓度为100μg/mL氨苄青霉素的400mL LB液体培养基中,于37℃、200rpm振荡培养。待OD600在0.8-1.0之间时,加入终浓度为0.1mM的诱导剂IPTG(异丙基-β-D-硫代半乳糖苷,IPTG),并在30℃诱导过夜。菌体在4℃、8000rpm条件下离心收集,然后悬浮于50mM pH7.0磷酸钠缓冲液中,超声破碎(200W,3s/5s,20min),于4℃、12000rpm离心20min,取上清进行冷冻干燥,即得酶冻干粉。
实施例三突变体的构建和筛选
突变体的构建:采用大分子建模技术预测出可能有益的突变位点为N237、S240两个位点,分别对这两个位点进行定点突变(N237K、N237H、N237Q、S240N、S240R、S240E、S240Q)。随后以7β-HSDH-pET28a(+)重组质粒为模板,使用合成的相应引物,第一次PCR扩增突变DNA片段,然后以PCR将所得片段作为模板,第二次PCR扩增全长得到7β-HSDH的突变基因。(具体突变操作参照stratagene公司的Site-Directed Mutagenesis Kit操作说明)。
其中:
N237K位点突变(第237位点天冬酰胺突变为赖氨酸)
正向引物(SEQ ID NO:33):5'TCGCCGGTCAACGTAAAAAAGATAGCGTCC 3',
反向引物(SEQ ID NO:34):5'GGACGCTATCTTTTTTACGTTGACCGGCGA 3';
N237H位点突变(第237位点天冬酰胺突变为组氨酸)
正向引物(SEQ ID NO:35):5'TCGCCGGTCAACGTCATAAAGATAGCGTCCAT 3',
反向引物(SEQ ID NO:36):5'ATGGACGCTATCTTTATGACGTTGACCGGCGA 3';
N237Q位点突变(第237位点天冬酰胺突变为谷氨酰胺)
正向引物(SEQ ID NO:37):5'TCGCCGGTCAACGTCAGAAAGATAGCGTCC 3',
反向引物(SEQ ID NO:38):5'GGACGCTATCTTTCTGACGTTGACCGGCGA 3';
S240N位点突变(第240位点丝氨酸突变为天冬酰胺)
正向引物(SEQ ID NO:39):5'CGGTCAACGTAATAAAGATAATGTCCATGACTGG 3',
反向引物(SEQ ID NO:40):5'CCAGTCATGGACATTATCTTTATTACGTTGACCG 3';
S240R位点突变(第240位点丝氨酸突变为精氨酸)
正向引物(SEQ ID NO:41):5'CGTAATAAAGATCGAGTCCATGACTGG 3',
反向引物(SEQ ID NO:42):5'CCAGTCATGGACTCGATCTTTATTACG 3';
S240E位点突变(第240位点丝氨酸突变为谷氨酸)
正向引物(SEQ ID NO:43):5'CGTAATAAAGATGAAGTCCATGACTGG 3',
反向引物(SEQ ID NO:44):5'CCAGTCATGGACTTCATCTTTATTACG 3';
S240Q位点突变(第240位点丝氨酸突变为谷氨酰胺)
正向引物(SEQ ID NO:45):5'CGTAATAAAGATCAGGTCCATGACTGG 3',
反向引物(SEQ ID NO:46):5'CCAGTCATGGACCTGATCTTTATTACG 3'。
突变体培养:将上述突变得到的质粒转化BL21(DE3)宿主菌后,涂布于含100μg/mL氨苄青霉素的LB固体培养基上,37℃倒置培养过夜,随后从平板上挑取单克隆置于含有100μg/mL氨苄青霉素的5mL LB液体培养基中进行培养。过夜培养的菌液再按1%(V/V)比例接种于含有100μg/mL氨苄青霉素的100mL LB液体培养基中,于37℃、200rpm振荡培养4h后加入终浓度为0.1mM的IPTG进行诱导,于30℃培养过夜。于4℃、8000rpm离心10min收集菌体,用50mM pH7.0磷酸钠缓冲液悬浮后超声破碎(200W,3s/5s,30min),于4℃、12000rpm离心20min,取上清进行单位酶活测定。
突变体的活性筛选:底物浓度2g/L(DMSO配置),NADPH 0.2mM,加入适量上述制备的上清液,用50mM pH7.0磷酸钠缓冲液补充体积至3mL,室温反应,实时检测NADPH在340nm处吸光值的变化。根据NADPH的消耗量和降低速率来计算突变体的单位酶活(U/mg)。1U定义为1min内消耗1μmol NADPH所需的酶量。
试验结果如下表1所示。
表1野生型与不同突变体的酶活
氨基酸编号 | 野生型/突变体名称 | 单位酶活(U/mg) | 提高倍数 |
SEQ ID NO:2 | 野生型7β-HSDH | 0.35 | --- |
SEQ ID NO:4 | N237K | 0.13 | --- |
SEQ ID NO:6 | N237H | 0.68 | 1.94 |
SEQ ID NO:8 | N237Q | 0.54 | 1.54 |
SEQ ID NO:10 | S240N | 1.00 | 2.86 |
SEQ ID NO:12 | S240R | 1.11 | 3.17 |
SEQ ID NO:14 | S240E | 0.54 | 1.54 |
SEQ ID NO:16 | S240Q | 0.95 | 2.71 |
以上结果显示,突变体酶活得到显著提高的克隆中含有的突变位点如下:位点237的天冬酰胺(N)突变为组氨酸(H);位点237的天冬酰胺(N)突变为谷氨酰胺(Q);位点240的丝氨酸(S)突变为天冬酰胺(N);位点240的丝氨酸(S)突变为精氨酸(R);位点240的丝氨酸(S)突变为谷氨酸(E);位点240的丝氨酸(S)突变为谷氨酰胺(Q)。
其中,当SEQ ID NO:2所示的氨基酸序列的第237位由天冬酰胺突变为组氨酸,所述的7β-HSDH酶突变体的氨基酸序列如SEQ ID NO:6所示,相应地,其编码基因的核苷酸序列如SEQ ID NO:5所示。
当SEQ ID NO:2所示的氨基酸序列的第237位由天冬酰胺突变为谷氨酰胺,所述的7β-HSDH酶突变体的氨基酸序列如SEQ ID NO:8所示,相应地,其编码基因的核苷酸序列如SEQ ID NO:7所示。
当SEQ ID NO:2所示的氨基酸序列的第240位由丝氨酸突变为天冬酰胺,所述的7β-HSDH酶突变体的氨基酸序列如SEQ ID NO:10所示,相应地,其编码基因的核苷酸序列如SEQ ID NO:9所示。
当SEQ ID NO:2所示的氨基酸序列的第240位由丝氨酸突变为精氨酸,所述的7β-HSDH酶突变体的氨基酸序列如SEQ ID NO:12所示,相应地,其编码基因的核苷酸序列如SEQID NO:11所示。
当SEQ ID NO:2所示的氨基酸序列的第240位由丝氨酸突变为谷氨酸,所述的7β-HSDH酶突变体的氨基酸序列如SEQ ID NO:14所示,相应地,其编码基因的核苷酸序列如SEQID NO:13所示。
当SEQ ID NO:2所示的氨基酸序列的第240位由丝氨酸突变为谷氨酰胺,所述的7β-HSDH酶突变体的氨基酸序列如SEQ ID NO:16所示,相应地,其编码基因的核苷酸序列如SEQ ID NO:15所示。
实施例四突变位点的两两联合突变
将上述筛选到的活性得到显著提高的突变体位点,进行两两联合突变,活性筛选方法和酶活定义方法同上。试验结果如下表2所示。
表2野生型与不同突变体的酶活
氨基酸编号 | 野生型/突变体名称 | 单位酶活(U/mg) | 提高倍数 |
SEQ ID NO:2 | 野生型7β-HSDH | 0.35 | --- |
SEQ ID NO:18 | N237H/S240N | 2.61 | 7.46 |
SEQ ID NO:20 | N237H/S240R | 5.07 | 14.48 |
SEQ ID NO:22 | N237H/S240E | 1.51 | 4.31 |
SEQ ID NO:24 | N237H/S240Q | 2.70 | 7.71 |
SEQ ID NO:26 | N237Q/S240N | 1.58 | 4.51 |
SEQ ID NO:28 | N237Q/S240R | 2.31 | 6.60 |
SEQ ID NO:30 | N237Q/S240E | 1.13 | 3.23 |
SEQ ID NO:32 | N237Q/S240Q | 1.68 | 4.80 |
其中,当SEQ ID NO:2所示的氨基酸序列的第237位由天冬酰胺突变为组氨酸,第240位由丝氨酸突变为天冬酰胺,所述的7β-HSDH酶突变体的氨基酸序列如SEQ ID NO:18所示,相应地,其编码基因的核苷酸序列如SEQ ID NO:17所示。
当SEQ ID NO:2所示的氨基酸序列的第237位由天冬酰胺突变为组氨酸,第240位由丝氨酸突变为精氨酸,所述的7β-HSDH酶突变体的氨基酸序列如SEQ ID NO:20所示,相应地,其编码基因的核苷酸序列如SEQ ID NO:19所示。
当SEQ ID NO:2所示的氨基酸序列的第237位由天冬酰胺突变为组氨酸,第240位由丝氨酸突变为谷氨酸,所述的7β-HSDH酶突变体的氨基酸序列如SEQ ID NO:22所示,相应地,其编码基因的核苷酸序列如SEQ ID NO:21所示。
当SEQ ID NO:2所示的氨基酸序列的第237位由天冬酰胺突变为组氨酸,第240位由丝氨酸突变为谷氨酰胺,所述的7β-HSDH酶突变体的氨基酸序列如SEQ ID NO:24所示,相应地,其编码基因的核苷酸序列如SEQ ID NO:23所示。
当SEQ ID NO:2所示的氨基酸序列的第237位由天冬酰胺突变为谷氨酰胺,第240位由丝氨酸突变为天冬酰胺,所述的7β-HSDH酶突变体的氨基酸序列如SEQ ID NO:26所示,相应地,其编码基因的核苷酸序列如SEQ ID NO:25所示。
当SEQ ID NO:2所示的氨基酸序列的第237位由天冬酰胺突变为谷氨酰胺,第240位由丝氨酸突变为精氨酸,所述的7β-HSDH酶突变体的氨基酸序列如SEQ ID NO:28所示,相应地,其编码基因的核苷酸序列如SEQ ID NO:27所示。
当SEQ ID NO:2所示的氨基酸序列的第237位由天冬酰胺突变为谷氨酰胺,第240位由丝氨酸突变为谷氨酸,所述的7β-HSDH酶突变体的氨基酸序列如SEQ ID NO:30所示,相应地,其编码基因的核苷酸序列如SEQ ID NO:29所示。
当SEQ ID NO:2所示的氨基酸序列的第237位由天冬酰胺突变为谷氨酰胺,第240位由丝氨酸突变为谷氨酰胺,所述的7β-HSDH酶突变体的氨基酸序列如SEQ ID NO:32所示,相应地,其编码基因的核苷酸序列如SEQ ID NO:31所示。
实施例五突变体的生物催化
将1.6g底物24-去甲-7-酮胆石酸溶解于32mL乙酸正丁酯中,待底物完全溶解后,依次加入45mL 50mM pH7.0磷酸钠缓冲液、1.15g葡萄糖、0.2mM NADP二钠盐、1g/L酶突变体和0.5g/L GDH酶(葡萄糖脱氢酶)粉,在30℃恒温机械搅拌器下搅拌反应。反应期间使用2M氢氧化钠溶液实时维持pH值在7.0。反应24h后进行HPLC检测,不同突变体的底物转化率和产物生成率见下表3。
表3野生型与不同突变体的底物转化率和产物生成率
最后应当说明的是,以上内容仅用以说明本发明的技术方案,而非对本发明保护范围的限制,本领域的普通技术人员对本发明的技术方案进行的简单修改或者等同替换,均不脱离本发明技术方案的实质和范围。
序列表
<110> 中山百灵生物技术股份有限公司
<120> 一种7β-HSDH酶突变体及其编码基因和应用
<160> 46
<170> SIPOSequenceListing 1.0
<210> 1
<211> 792
<212> DNA
<213> 产气柯林斯菌(Collinsella aerofaciens)
<400> 1
atgaatctgc gtgaaaaata cggtgaatgg ggtctgatcc tgggtgctac ggaaggtgtc 60
ggtaaagcgt tctgtgaaaa aatcgccgcg ggcggtatga acgtggttat ggtcggccgt 120
cgcgaagaaa aactgaatgt gctggcaggc gaaattcgtg aaacctatgg tgttgaaacg 180
aaagtcgtgc gtgctgattt ttcccagccg ggtgcagcag aaaccgtctt cgcagctacg 240
gaaggcctgg acatgggttt tatgtcttac gtggcctgcc tgcatagttt cggtaaaatt 300
caagataccc cgtgggaaaa acacgaagca atgatcaacg tgaatgttgt cacgtttctg 360
aaatgtttcc atcactatat gcgtatcttt gcggcccagg atcgcggtgc cgtgattaac 420
gttagctcta tgaccggcat cagttcctca ccgtggaatg gtcaatacgg cgcgggtaaa 480
gccttcattc tgaaaatgac cgaagcagtt gcttgcgaat gtgaaggcac gggtgtggac 540
gttgaagtca tcaccctggg taccacgctg acgccgtcgc tgctgagcaa cctgccgggc 600
ggtccgcagg gtgaagcagt gatgaaaatt gctctgaccc cggaagaatg cgttgatgaa 660
gcgtttgaaa aactgggcaa agaactgtca gttatcgccg gtcaacgtaa taaagatagc 720
gtccatgact ggaaagcaaa ccacaccgaa gacgaataca tccgctacat gggctcattt 780
taccgtgact aa 792
<210> 2
<211> 263
<212> PRT
<213> 产气柯林斯菌(Collinsella aerofaciens)
<400> 2
Met Asn Leu Arg Glu Lys Tyr Gly Glu Trp Gly Leu Ile Leu Gly Ala
1 5 10 15
Thr Glu Gly Val Gly Lys Ala Phe Cys Glu Lys Ile Ala Ala Gly Gly
20 25 30
Met Asn Val Val Met Val Gly Arg Arg Glu Glu Lys Leu Asn Val Leu
35 40 45
Ala Gly Glu Ile Arg Glu Thr Tyr Gly Val Glu Thr Lys Val Val Arg
50 55 60
Ala Asp Phe Ser Gln Pro Gly Ala Ala Glu Thr Val Phe Ala Ala Thr
65 70 75 80
Glu Gly Leu Asp Met Gly Phe Met Ser Tyr Val Ala Cys Leu His Ser
85 90 95
Phe Gly Lys Ile Gln Asp Thr Pro Trp Glu Lys His Glu Ala Met Ile
100 105 110
Asn Val Asn Val Val Thr Phe Leu Lys Cys Phe His His Tyr Met Arg
115 120 125
Ile Phe Ala Ala Gln Asp Arg Gly Ala Val Ile Asn Val Ser Ser Met
130 135 140
Thr Gly Ile Ser Ser Ser Pro Trp Asn Gly Gln Tyr Gly Ala Gly Lys
145 150 155 160
Ala Phe Ile Leu Lys Met Thr Glu Ala Val Ala Cys Glu Cys Glu Gly
165 170 175
Thr Gly Val Asp Val Glu Val Ile Thr Leu Gly Thr Thr Leu Thr Pro
180 185 190
Ser Leu Leu Ser Asn Leu Pro Gly Gly Pro Gln Gly Glu Ala Val Met
195 200 205
Lys Ile Ala Leu Thr Pro Glu Glu Cys Val Asp Glu Ala Phe Glu Lys
210 215 220
Leu Gly Lys Glu Leu Ser Val Ile Ala Gly Gln Arg Asn Lys Asp Ser
225 230 235 240
Val His Asp Trp Lys Ala Asn His Thr Glu Asp Glu Tyr Ile Arg Tyr
245 250 255
Met Gly Ser Phe Tyr Arg Asp
260
<210> 3
<211> 792
<212> DNA
<213> 产气柯林斯菌(Collinsella aerofaciens)
<400> 3
atgaatctgc gtgaaaaata cggtgaatgg ggtctgatcc tgggtgctac ggaaggtgtc 60
ggtaaagcgt tctgtgaaaa aatcgccgcg ggcggtatga acgtggttat ggtcggccgt 120
cgcgaagaaa aactgaatgt gctggcaggc gaaattcgtg aaacctatgg tgttgaaacg 180
aaagtcgtgc gtgctgattt ttcccagccg ggtgcagcag aaaccgtctt cgcagctacg 240
gaaggcctgg acatgggttt tatgtcttac gtggcctgcc tgcatagttt cggtaaaatt 300
caagataccc cgtgggaaaa acacgaagca atgatcaacg tgaatgttgt cacgtttctg 360
aaatgtttcc atcactatat gcgtatcttt gcggcccagg atcgcggtgc cgtgattaac 420
gttagctcta tgaccggcat cagttcctca ccgtggaatg gtcaatacgg cgcgggtaaa 480
gccttcattc tgaaaatgac cgaagcagtt gcttgcgaat gtgaaggcac gggtgtggac 540
gttgaagtca tcaccctggg taccacgctg acgccgtcgc tgctgagcaa cctgccgggc 600
ggtccgcagg gtgaagcagt gatgaaaatt gctctgaccc cggaagaatg cgttgatgaa 660
gcgtttgaaa aactgggcaa agaactgtca gttatcgccg gtcaacgtaa aaaagatagc 720
gtccatgact ggaaagcaaa ccacaccgaa gacgaataca tccgctacat gggctcattt 780
taccgtgact aa 792
<210> 4
<211> 263
<212> PRT
<213> 产气柯林斯菌(Collinsella aerofaciens)
<400> 4
Met Asn Leu Arg Glu Lys Tyr Gly Glu Trp Gly Leu Ile Leu Gly Ala
1 5 10 15
Thr Glu Gly Val Gly Lys Ala Phe Cys Glu Lys Ile Ala Ala Gly Gly
20 25 30
Met Asn Val Val Met Val Gly Arg Arg Glu Glu Lys Leu Asn Val Leu
35 40 45
Ala Gly Glu Ile Arg Glu Thr Tyr Gly Val Glu Thr Lys Val Val Arg
50 55 60
Ala Asp Phe Ser Gln Pro Gly Ala Ala Glu Thr Val Phe Ala Ala Thr
65 70 75 80
Glu Gly Leu Asp Met Gly Phe Met Ser Tyr Val Ala Cys Leu His Ser
85 90 95
Phe Gly Lys Ile Gln Asp Thr Pro Trp Glu Lys His Glu Ala Met Ile
100 105 110
Asn Val Asn Val Val Thr Phe Leu Lys Cys Phe His His Tyr Met Arg
115 120 125
Ile Phe Ala Ala Gln Asp Arg Gly Ala Val Ile Asn Val Ser Ser Met
130 135 140
Thr Gly Ile Ser Ser Ser Pro Trp Asn Gly Gln Tyr Gly Ala Gly Lys
145 150 155 160
Ala Phe Ile Leu Lys Met Thr Glu Ala Val Ala Cys Glu Cys Glu Gly
165 170 175
Thr Gly Val Asp Val Glu Val Ile Thr Leu Gly Thr Thr Leu Thr Pro
180 185 190
Ser Leu Leu Ser Asn Leu Pro Gly Gly Pro Gln Gly Glu Ala Val Met
195 200 205
Lys Ile Ala Leu Thr Pro Glu Glu Cys Val Asp Glu Ala Phe Glu Lys
210 215 220
Leu Gly Lys Glu Leu Ser Val Ile Ala Gly Gln Arg Lys Lys Asp Ser
225 230 235 240
Val His Asp Trp Lys Ala Asn His Thr Glu Asp Glu Tyr Ile Arg Tyr
245 250 255
Met Gly Ser Phe Tyr Arg Asp
260
<210> 5
<211> 792
<212> DNA
<213> 产气柯林斯菌(Collinsella aerofaciens)
<400> 5
atgaatctgc gtgaaaaata cggtgaatgg ggtctgatcc tgggtgctac ggaaggtgtc 60
ggtaaagcgt tctgtgaaaa aatcgccgcg ggcggtatga acgtggttat ggtcggccgt 120
cgcgaagaaa aactgaatgt gctggcaggc gaaattcgtg aaacctatgg tgttgaaacg 180
aaagtcgtgc gtgctgattt ttcccagccg ggtgcagcag aaaccgtctt cgcagctacg 240
gaaggcctgg acatgggttt tatgtcttac gtggcctgcc tgcatagttt cggtaaaatt 300
caagataccc cgtgggaaaa acacgaagca atgatcaacg tgaatgttgt cacgtttctg 360
aaatgtttcc atcactatat gcgtatcttt gcggcccagg atcgcggtgc cgtgattaac 420
gttagctcta tgaccggcat cagttcctca ccgtggaatg gtcaatacgg cgcgggtaaa 480
gccttcattc tgaaaatgac cgaagcagtt gcttgcgaat gtgaaggcac gggtgtggac 540
gttgaagtca tcaccctggg taccacgctg acgccgtcgc tgctgagcaa cctgccgggc 600
ggtccgcagg gtgaagcagt gatgaaaatt gctctgaccc cggaagaatg cgttgatgaa 660
gcgtttgaaa aactgggcaa agaactgtca gttatcgccg gtcaacgtca taaagatagc 720
gtccatgact ggaaagcaaa ccacaccgaa gacgaataca tccgctacat gggctcattt 780
taccgtgact aa 792
<210> 6
<211> 263
<212> PRT
<213> 产气柯林斯菌(Collinsella aerofaciens)
<400> 6
Met Asn Leu Arg Glu Lys Tyr Gly Glu Trp Gly Leu Ile Leu Gly Ala
1 5 10 15
Thr Glu Gly Val Gly Lys Ala Phe Cys Glu Lys Ile Ala Ala Gly Gly
20 25 30
Met Asn Val Val Met Val Gly Arg Arg Glu Glu Lys Leu Asn Val Leu
35 40 45
Ala Gly Glu Ile Arg Glu Thr Tyr Gly Val Glu Thr Lys Val Val Arg
50 55 60
Ala Asp Phe Ser Gln Pro Gly Ala Ala Glu Thr Val Phe Ala Ala Thr
65 70 75 80
Glu Gly Leu Asp Met Gly Phe Met Ser Tyr Val Ala Cys Leu His Ser
85 90 95
Phe Gly Lys Ile Gln Asp Thr Pro Trp Glu Lys His Glu Ala Met Ile
100 105 110
Asn Val Asn Val Val Thr Phe Leu Lys Cys Phe His His Tyr Met Arg
115 120 125
Ile Phe Ala Ala Gln Asp Arg Gly Ala Val Ile Asn Val Ser Ser Met
130 135 140
Thr Gly Ile Ser Ser Ser Pro Trp Asn Gly Gln Tyr Gly Ala Gly Lys
145 150 155 160
Ala Phe Ile Leu Lys Met Thr Glu Ala Val Ala Cys Glu Cys Glu Gly
165 170 175
Thr Gly Val Asp Val Glu Val Ile Thr Leu Gly Thr Thr Leu Thr Pro
180 185 190
Ser Leu Leu Ser Asn Leu Pro Gly Gly Pro Gln Gly Glu Ala Val Met
195 200 205
Lys Ile Ala Leu Thr Pro Glu Glu Cys Val Asp Glu Ala Phe Glu Lys
210 215 220
Leu Gly Lys Glu Leu Ser Val Ile Ala Gly Gln Arg His Lys Asp Ser
225 230 235 240
Val His Asp Trp Lys Ala Asn His Thr Glu Asp Glu Tyr Ile Arg Tyr
245 250 255
Met Gly Ser Phe Tyr Arg Asp
260
<210> 7
<211> 792
<212> DNA
<213> 产气柯林斯菌(Collinsella aerofaciens)
<400> 7
atgaatctgc gtgaaaaata cggtgaatgg ggtctgatcc tgggtgctac ggaaggtgtc 60
ggtaaagcgt tctgtgaaaa aatcgccgcg ggcggtatga acgtggttat ggtcggccgt 120
cgcgaagaaa aactgaatgt gctggcaggc gaaattcgtg aaacctatgg tgttgaaacg 180
aaagtcgtgc gtgctgattt ttcccagccg ggtgcagcag aaaccgtctt cgcagctacg 240
gaaggcctgg acatgggttt tatgtcttac gtggcctgcc tgcatagttt cggtaaaatt 300
caagataccc cgtgggaaaa acacgaagca atgatcaacg tgaatgttgt cacgtttctg 360
aaatgtttcc atcactatat gcgtatcttt gcggcccagg atcgcggtgc cgtgattaac 420
gttagctcta tgaccggcat cagttcctca ccgtggaatg gtcaatacgg cgcgggtaaa 480
gccttcattc tgaaaatgac cgaagcagtt gcttgcgaat gtgaaggcac gggtgtggac 540
gttgaagtca tcaccctggg taccacgctg acgccgtcgc tgctgagcaa cctgccgggc 600
ggtccgcagg gtgaagcagt gatgaaaatt gctctgaccc cggaagaatg cgttgatgaa 660
gcgtttgaaa aactgggcaa agaactgtca gttatcgccg gtcaacgtca gaaagatagc 720
gtccatgact ggaaagcaaa ccacaccgaa gacgaataca tccgctacat gggctcattt 780
taccgtgact aa 792
<210> 8
<211> 263
<212> PRT
<213> 产气柯林斯菌(Collinsella aerofaciens)
<400> 8
Met Asn Leu Arg Glu Lys Tyr Gly Glu Trp Gly Leu Ile Leu Gly Ala
1 5 10 15
Thr Glu Gly Val Gly Lys Ala Phe Cys Glu Lys Ile Ala Ala Gly Gly
20 25 30
Met Asn Val Val Met Val Gly Arg Arg Glu Glu Lys Leu Asn Val Leu
35 40 45
Ala Gly Glu Ile Arg Glu Thr Tyr Gly Val Glu Thr Lys Val Val Arg
50 55 60
Ala Asp Phe Ser Gln Pro Gly Ala Ala Glu Thr Val Phe Ala Ala Thr
65 70 75 80
Glu Gly Leu Asp Met Gly Phe Met Ser Tyr Val Ala Cys Leu His Ser
85 90 95
Phe Gly Lys Ile Gln Asp Thr Pro Trp Glu Lys His Glu Ala Met Ile
100 105 110
Asn Val Asn Val Val Thr Phe Leu Lys Cys Phe His His Tyr Met Arg
115 120 125
Ile Phe Ala Ala Gln Asp Arg Gly Ala Val Ile Asn Val Ser Ser Met
130 135 140
Thr Gly Ile Ser Ser Ser Pro Trp Asn Gly Gln Tyr Gly Ala Gly Lys
145 150 155 160
Ala Phe Ile Leu Lys Met Thr Glu Ala Val Ala Cys Glu Cys Glu Gly
165 170 175
Thr Gly Val Asp Val Glu Val Ile Thr Leu Gly Thr Thr Leu Thr Pro
180 185 190
Ser Leu Leu Ser Asn Leu Pro Gly Gly Pro Gln Gly Glu Ala Val Met
195 200 205
Lys Ile Ala Leu Thr Pro Glu Glu Cys Val Asp Glu Ala Phe Glu Lys
210 215 220
Leu Gly Lys Glu Leu Ser Val Ile Ala Gly Gln Arg Gln Lys Asp Ser
225 230 235 240
Val His Asp Trp Lys Ala Asn His Thr Glu Asp Glu Tyr Ile Arg Tyr
245 250 255
Met Gly Ser Phe Tyr Arg Asp
260
<210> 9
<211> 792
<212> DNA
<213> 产气柯林斯菌(Collinsella aerofaciens)
<400> 9
atgaatctgc gtgaaaaata cggtgaatgg ggtctgatcc tgggtgctac ggaaggtgtc 60
ggtaaagcgt tctgtgaaaa aatcgccgcg ggcggtatga acgtggttat ggtcggccgt 120
cgcgaagaaa aactgaatgt gctggcaggc gaaattcgtg aaacctatgg tgttgaaacg 180
aaagtcgtgc gtgctgattt ttcccagccg ggtgcagcag aaaccgtctt cgcagctacg 240
gaaggcctgg acatgggttt tatgtcttac gtggcctgcc tgcatagttt cggtaaaatt 300
caagataccc cgtgggaaaa acacgaagca atgatcaacg tgaatgttgt cacgtttctg 360
aaatgtttcc atcactatat gcgtatcttt gcggcccagg atcgcggtgc cgtgattaac 420
gttagctcta tgaccggcat cagttcctca ccgtggaatg gtcaatacgg cgcgggtaaa 480
gccttcattc tgaaaatgac cgaagcagtt gcttgcgaat gtgaaggcac gggtgtggac 540
gttgaagtca tcaccctggg taccacgctg acgccgtcgc tgctgagcaa cctgccgggc 600
ggtccgcagg gtgaagcagt gatgaaaatt gctctgaccc cggaagaatg cgttgatgaa 660
gcgtttgaaa aactgggcaa agaactgtca gttatcgccg gtcaacgtaa taaagataat 720
gtccatgact ggaaagcaaa ccacaccgaa gacgaataca tccgctacat gggctcattt 780
taccgtgact aa 792
<210> 10
<211> 263
<212> PRT
<213> 产气柯林斯菌(Collinsella aerofaciens)
<400> 10
Met Asn Leu Arg Glu Lys Tyr Gly Glu Trp Gly Leu Ile Leu Gly Ala
1 5 10 15
Thr Glu Gly Val Gly Lys Ala Phe Cys Glu Lys Ile Ala Ala Gly Gly
20 25 30
Met Asn Val Val Met Val Gly Arg Arg Glu Glu Lys Leu Asn Val Leu
35 40 45
Ala Gly Glu Ile Arg Glu Thr Tyr Gly Val Glu Thr Lys Val Val Arg
50 55 60
Ala Asp Phe Ser Gln Pro Gly Ala Ala Glu Thr Val Phe Ala Ala Thr
65 70 75 80
Glu Gly Leu Asp Met Gly Phe Met Ser Tyr Val Ala Cys Leu His Ser
85 90 95
Phe Gly Lys Ile Gln Asp Thr Pro Trp Glu Lys His Glu Ala Met Ile
100 105 110
Asn Val Asn Val Val Thr Phe Leu Lys Cys Phe His His Tyr Met Arg
115 120 125
Ile Phe Ala Ala Gln Asp Arg Gly Ala Val Ile Asn Val Ser Ser Met
130 135 140
Thr Gly Ile Ser Ser Ser Pro Trp Asn Gly Gln Tyr Gly Ala Gly Lys
145 150 155 160
Ala Phe Ile Leu Lys Met Thr Glu Ala Val Ala Cys Glu Cys Glu Gly
165 170 175
Thr Gly Val Asp Val Glu Val Ile Thr Leu Gly Thr Thr Leu Thr Pro
180 185 190
Ser Leu Leu Ser Asn Leu Pro Gly Gly Pro Gln Gly Glu Ala Val Met
195 200 205
Lys Ile Ala Leu Thr Pro Glu Glu Cys Val Asp Glu Ala Phe Glu Lys
210 215 220
Leu Gly Lys Glu Leu Ser Val Ile Ala Gly Gln Arg Asn Lys Asp Asn
225 230 235 240
Val His Asp Trp Lys Ala Asn His Thr Glu Asp Glu Tyr Ile Arg Tyr
245 250 255
Met Gly Ser Phe Tyr Arg Asp
260
<210> 11
<211> 792
<212> DNA
<213> 产气柯林斯菌(Collinsella aerofaciens)
<400> 11
atgaatctgc gtgaaaaata cggtgaatgg ggtctgatcc tgggtgctac ggaaggtgtc 60
ggtaaagcgt tctgtgaaaa aatcgccgcg ggcggtatga acgtggttat ggtcggccgt 120
cgcgaagaaa aactgaatgt gctggcaggc gaaattcgtg aaacctatgg tgttgaaacg 180
aaagtcgtgc gtgctgattt ttcccagccg ggtgcagcag aaaccgtctt cgcagctacg 240
gaaggcctgg acatgggttt tatgtcttac gtggcctgcc tgcatagttt cggtaaaatt 300
caagataccc cgtgggaaaa acacgaagca atgatcaacg tgaatgttgt cacgtttctg 360
aaatgtttcc atcactatat gcgtatcttt gcggcccagg atcgcggtgc cgtgattaac 420
gttagctcta tgaccggcat cagttcctca ccgtggaatg gtcaatacgg cgcgggtaaa 480
gccttcattc tgaaaatgac cgaagcagtt gcttgcgaat gtgaaggcac gggtgtggac 540
gttgaagtca tcaccctggg taccacgctg acgccgtcgc tgctgagcaa cctgccgggc 600
ggtccgcagg gtgaagcagt gatgaaaatt gctctgaccc cggaagaatg cgttgatgaa 660
gcgtttgaaa aactgggcaa agaactgtca gttatcgccg gtcaacgtaa taaagatcga 720
gtccatgact ggaaagcaaa ccacaccgaa gacgaataca tccgctacat gggctcattt 780
taccgtgact aa 792
<210> 12
<211> 263
<212> PRT
<213> 产气柯林斯菌(Collinsella aerofaciens)
<400> 12
Met Asn Leu Arg Glu Lys Tyr Gly Glu Trp Gly Leu Ile Leu Gly Ala
1 5 10 15
Thr Glu Gly Val Gly Lys Ala Phe Cys Glu Lys Ile Ala Ala Gly Gly
20 25 30
Met Asn Val Val Met Val Gly Arg Arg Glu Glu Lys Leu Asn Val Leu
35 40 45
Ala Gly Glu Ile Arg Glu Thr Tyr Gly Val Glu Thr Lys Val Val Arg
50 55 60
Ala Asp Phe Ser Gln Pro Gly Ala Ala Glu Thr Val Phe Ala Ala Thr
65 70 75 80
Glu Gly Leu Asp Met Gly Phe Met Ser Tyr Val Ala Cys Leu His Ser
85 90 95
Phe Gly Lys Ile Gln Asp Thr Pro Trp Glu Lys His Glu Ala Met Ile
100 105 110
Asn Val Asn Val Val Thr Phe Leu Lys Cys Phe His His Tyr Met Arg
115 120 125
Ile Phe Ala Ala Gln Asp Arg Gly Ala Val Ile Asn Val Ser Ser Met
130 135 140
Thr Gly Ile Ser Ser Ser Pro Trp Asn Gly Gln Tyr Gly Ala Gly Lys
145 150 155 160
Ala Phe Ile Leu Lys Met Thr Glu Ala Val Ala Cys Glu Cys Glu Gly
165 170 175
Thr Gly Val Asp Val Glu Val Ile Thr Leu Gly Thr Thr Leu Thr Pro
180 185 190
Ser Leu Leu Ser Asn Leu Pro Gly Gly Pro Gln Gly Glu Ala Val Met
195 200 205
Lys Ile Ala Leu Thr Pro Glu Glu Cys Val Asp Glu Ala Phe Glu Lys
210 215 220
Leu Gly Lys Glu Leu Ser Val Ile Ala Gly Gln Arg Asn Lys Asp Arg
225 230 235 240
Val His Asp Trp Lys Ala Asn His Thr Glu Asp Glu Tyr Ile Arg Tyr
245 250 255
Met Gly Ser Phe Tyr Arg Asp
260
<210> 13
<211> 792
<212> DNA
<213> 产气柯林斯菌(Collinsella aerofaciens)
<400> 13
atgaatctgc gtgaaaaata cggtgaatgg ggtctgatcc tgggtgctac ggaaggtgtc 60
ggtaaagcgt tctgtgaaaa aatcgccgcg ggcggtatga acgtggttat ggtcggccgt 120
cgcgaagaaa aactgaatgt gctggcaggc gaaattcgtg aaacctatgg tgttgaaacg 180
aaagtcgtgc gtgctgattt ttcccagccg ggtgcagcag aaaccgtctt cgcagctacg 240
gaaggcctgg acatgggttt tatgtcttac gtggcctgcc tgcatagttt cggtaaaatt 300
caagataccc cgtgggaaaa acacgaagca atgatcaacg tgaatgttgt cacgtttctg 360
aaatgtttcc atcactatat gcgtatcttt gcggcccagg atcgcggtgc cgtgattaac 420
gttagctcta tgaccggcat cagttcctca ccgtggaatg gtcaatacgg cgcgggtaaa 480
gccttcattc tgaaaatgac cgaagcagtt gcttgcgaat gtgaaggcac gggtgtggac 540
gttgaagtca tcaccctggg taccacgctg acgccgtcgc tgctgagcaa cctgccgggc 600
ggtccgcagg gtgaagcagt gatgaaaatt gctctgaccc cggaagaatg cgttgatgaa 660
gcgtttgaaa aactgggcaa agaactgtca gttatcgccg gtcaacgtaa taaagatgaa 720
gtccatgact ggaaagcaaa ccacaccgaa gacgaataca tccgctacat gggctcattt 780
taccgtgact aa 792
<210> 14
<211> 263
<212> PRT
<213> 产气柯林斯菌(Collinsella aerofaciens)
<400> 14
Met Asn Leu Arg Glu Lys Tyr Gly Glu Trp Gly Leu Ile Leu Gly Ala
1 5 10 15
Thr Glu Gly Val Gly Lys Ala Phe Cys Glu Lys Ile Ala Ala Gly Gly
20 25 30
Met Asn Val Val Met Val Gly Arg Arg Glu Glu Lys Leu Asn Val Leu
35 40 45
Ala Gly Glu Ile Arg Glu Thr Tyr Gly Val Glu Thr Lys Val Val Arg
50 55 60
Ala Asp Phe Ser Gln Pro Gly Ala Ala Glu Thr Val Phe Ala Ala Thr
65 70 75 80
Glu Gly Leu Asp Met Gly Phe Met Ser Tyr Val Ala Cys Leu His Ser
85 90 95
Phe Gly Lys Ile Gln Asp Thr Pro Trp Glu Lys His Glu Ala Met Ile
100 105 110
Asn Val Asn Val Val Thr Phe Leu Lys Cys Phe His His Tyr Met Arg
115 120 125
Ile Phe Ala Ala Gln Asp Arg Gly Ala Val Ile Asn Val Ser Ser Met
130 135 140
Thr Gly Ile Ser Ser Ser Pro Trp Asn Gly Gln Tyr Gly Ala Gly Lys
145 150 155 160
Ala Phe Ile Leu Lys Met Thr Glu Ala Val Ala Cys Glu Cys Glu Gly
165 170 175
Thr Gly Val Asp Val Glu Val Ile Thr Leu Gly Thr Thr Leu Thr Pro
180 185 190
Ser Leu Leu Ser Asn Leu Pro Gly Gly Pro Gln Gly Glu Ala Val Met
195 200 205
Lys Ile Ala Leu Thr Pro Glu Glu Cys Val Asp Glu Ala Phe Glu Lys
210 215 220
Leu Gly Lys Glu Leu Ser Val Ile Ala Gly Gln Arg Asn Lys Asp Glu
225 230 235 240
Val His Asp Trp Lys Ala Asn His Thr Glu Asp Glu Tyr Ile Arg Tyr
245 250 255
Met Gly Ser Phe Tyr Arg Asp
260
<210> 15
<211> 792
<212> DNA
<213> 产气柯林斯菌(Collinsella aerofaciens)
<400> 15
atgaatctgc gtgaaaaata cggtgaatgg ggtctgatcc tgggtgctac ggaaggtgtc 60
ggtaaagcgt tctgtgaaaa aatcgccgcg ggcggtatga acgtggttat ggtcggccgt 120
cgcgaagaaa aactgaatgt gctggcaggc gaaattcgtg aaacctatgg tgttgaaacg 180
aaagtcgtgc gtgctgattt ttcccagccg ggtgcagcag aaaccgtctt cgcagctacg 240
gaaggcctgg acatgggttt tatgtcttac gtggcctgcc tgcatagttt cggtaaaatt 300
caagataccc cgtgggaaaa acacgaagca atgatcaacg tgaatgttgt cacgtttctg 360
aaatgtttcc atcactatat gcgtatcttt gcggcccagg atcgcggtgc cgtgattaac 420
gttagctcta tgaccggcat cagttcctca ccgtggaatg gtcaatacgg cgcgggtaaa 480
gccttcattc tgaaaatgac cgaagcagtt gcttgcgaat gtgaaggcac gggtgtggac 540
gttgaagtca tcaccctggg taccacgctg acgccgtcgc tgctgagcaa cctgccgggc 600
ggtccgcagg gtgaagcagt gatgaaaatt gctctgaccc cggaagaatg cgttgatgaa 660
gcgtttgaaa aactgggcaa agaactgtca gttatcgccg gtcaacgtaa taaagatcag 720
gtccatgact ggaaagcaaa ccacaccgaa gacgaataca tccgctacat gggctcattt 780
taccgtgact aa 792
<210> 16
<211> 263
<212> PRT
<213> 产气柯林斯菌(Collinsella aerofaciens)
<400> 16
Met Asn Leu Arg Glu Lys Tyr Gly Glu Trp Gly Leu Ile Leu Gly Ala
1 5 10 15
Thr Glu Gly Val Gly Lys Ala Phe Cys Glu Lys Ile Ala Ala Gly Gly
20 25 30
Met Asn Val Val Met Val Gly Arg Arg Glu Glu Lys Leu Asn Val Leu
35 40 45
Ala Gly Glu Ile Arg Glu Thr Tyr Gly Val Glu Thr Lys Val Val Arg
50 55 60
Ala Asp Phe Ser Gln Pro Gly Ala Ala Glu Thr Val Phe Ala Ala Thr
65 70 75 80
Glu Gly Leu Asp Met Gly Phe Met Ser Tyr Val Ala Cys Leu His Ser
85 90 95
Phe Gly Lys Ile Gln Asp Thr Pro Trp Glu Lys His Glu Ala Met Ile
100 105 110
Asn Val Asn Val Val Thr Phe Leu Lys Cys Phe His His Tyr Met Arg
115 120 125
Ile Phe Ala Ala Gln Asp Arg Gly Ala Val Ile Asn Val Ser Ser Met
130 135 140
Thr Gly Ile Ser Ser Ser Pro Trp Asn Gly Gln Tyr Gly Ala Gly Lys
145 150 155 160
Ala Phe Ile Leu Lys Met Thr Glu Ala Val Ala Cys Glu Cys Glu Gly
165 170 175
Thr Gly Val Asp Val Glu Val Ile Thr Leu Gly Thr Thr Leu Thr Pro
180 185 190
Ser Leu Leu Ser Asn Leu Pro Gly Gly Pro Gln Gly Glu Ala Val Met
195 200 205
Lys Ile Ala Leu Thr Pro Glu Glu Cys Val Asp Glu Ala Phe Glu Lys
210 215 220
Leu Gly Lys Glu Leu Ser Val Ile Ala Gly Gln Arg Asn Lys Asp Gln
225 230 235 240
Val His Asp Trp Lys Ala Asn His Thr Glu Asp Glu Tyr Ile Arg Tyr
245 250 255
Met Gly Ser Phe Tyr Arg Asp
260
<210> 17
<211> 792
<212> DNA
<213> 产气柯林斯菌(Collinsella aerofaciens)
<400> 17
atgaatctgc gtgaaaaata cggtgaatgg ggtctgatcc tgggtgctac ggaaggtgtc 60
ggtaaagcgt tctgtgaaaa aatcgccgcg ggcggtatga acgtggttat ggtcggccgt 120
cgcgaagaaa aactgaatgt gctggcaggc gaaattcgtg aaacctatgg tgttgaaacg 180
aaagtcgtgc gtgctgattt ttcccagccg ggtgcagcag aaaccgtctt cgcagctacg 240
gaaggcctgg acatgggttt tatgtcttac gtggcctgcc tgcatagttt cggtaaaatt 300
caagataccc cgtgggaaaa acacgaagca atgatcaacg tgaatgttgt cacgtttctg 360
aaatgtttcc atcactatat gcgtatcttt gcggcccagg atcgcggtgc cgtgattaac 420
gttagctcta tgaccggcat cagttcctca ccgtggaatg gtcaatacgg cgcgggtaaa 480
gccttcattc tgaaaatgac cgaagcagtt gcttgcgaat gtgaaggcac gggtgtggac 540
gttgaagtca tcaccctggg taccacgctg acgccgtcgc tgctgagcaa cctgccgggc 600
ggtccgcagg gtgaagcagt gatgaaaatt gctctgaccc cggaagaatg cgttgatgaa 660
gcgtttgaaa aactgggcaa agaactgtca gttatcgccg gtcaacgtca taaagataat 720
gtccatgact ggaaagcaaa ccacaccgaa gacgaataca tccgctacat gggctcattt 780
taccgtgact aa 792
<210> 18
<211> 263
<212> PRT
<213> 产气柯林斯菌(Collinsella aerofaciens)
<400> 18
Met Asn Leu Arg Glu Lys Tyr Gly Glu Trp Gly Leu Ile Leu Gly Ala
1 5 10 15
Thr Glu Gly Val Gly Lys Ala Phe Cys Glu Lys Ile Ala Ala Gly Gly
20 25 30
Met Asn Val Val Met Val Gly Arg Arg Glu Glu Lys Leu Asn Val Leu
35 40 45
Ala Gly Glu Ile Arg Glu Thr Tyr Gly Val Glu Thr Lys Val Val Arg
50 55 60
Ala Asp Phe Ser Gln Pro Gly Ala Ala Glu Thr Val Phe Ala Ala Thr
65 70 75 80
Glu Gly Leu Asp Met Gly Phe Met Ser Tyr Val Ala Cys Leu His Ser
85 90 95
Phe Gly Lys Ile Gln Asp Thr Pro Trp Glu Lys His Glu Ala Met Ile
100 105 110
Asn Val Asn Val Val Thr Phe Leu Lys Cys Phe His His Tyr Met Arg
115 120 125
Ile Phe Ala Ala Gln Asp Arg Gly Ala Val Ile Asn Val Ser Ser Met
130 135 140
Thr Gly Ile Ser Ser Ser Pro Trp Asn Gly Gln Tyr Gly Ala Gly Lys
145 150 155 160
Ala Phe Ile Leu Lys Met Thr Glu Ala Val Ala Cys Glu Cys Glu Gly
165 170 175
Thr Gly Val Asp Val Glu Val Ile Thr Leu Gly Thr Thr Leu Thr Pro
180 185 190
Ser Leu Leu Ser Asn Leu Pro Gly Gly Pro Gln Gly Glu Ala Val Met
195 200 205
Lys Ile Ala Leu Thr Pro Glu Glu Cys Val Asp Glu Ala Phe Glu Lys
210 215 220
Leu Gly Lys Glu Leu Ser Val Ile Ala Gly Gln Arg His Lys Asp Asn
225 230 235 240
Val His Asp Trp Lys Ala Asn His Thr Glu Asp Glu Tyr Ile Arg Tyr
245 250 255
Met Gly Ser Phe Tyr Arg Asp
260
<210> 19
<211> 792
<212> DNA
<213> 产气柯林斯菌(Collinsella aerofaciens)
<400> 19
atgaatctgc gtgaaaaata cggtgaatgg ggtctgatcc tgggtgctac ggaaggtgtc 60
ggtaaagcgt tctgtgaaaa aatcgccgcg ggcggtatga acgtggttat ggtcggccgt 120
cgcgaagaaa aactgaatgt gctggcaggc gaaattcgtg aaacctatgg tgttgaaacg 180
aaagtcgtgc gtgctgattt ttcccagccg ggtgcagcag aaaccgtctt cgcagctacg 240
gaaggcctgg acatgggttt tatgtcttac gtggcctgcc tgcatagttt cggtaaaatt 300
caagataccc cgtgggaaaa acacgaagca atgatcaacg tgaatgttgt cacgtttctg 360
aaatgtttcc atcactatat gcgtatcttt gcggcccagg atcgcggtgc cgtgattaac 420
gttagctcta tgaccggcat cagttcctca ccgtggaatg gtcaatacgg cgcgggtaaa 480
gccttcattc tgaaaatgac cgaagcagtt gcttgcgaat gtgaaggcac gggtgtggac 540
gttgaagtca tcaccctggg taccacgctg acgccgtcgc tgctgagcaa cctgccgggc 600
ggtccgcagg gtgaagcagt gatgaaaatt gctctgaccc cggaagaatg cgttgatgaa 660
gcgtttgaaa aactgggcaa agaactgtca gttatcgccg gtcaacgtca taaagatcga 720
gtccatgact ggaaagcaaa ccacaccgaa gacgaataca tccgctacat gggctcattt 780
taccgtgact aa 792
<210> 20
<211> 263
<212> PRT
<213> 产气柯林斯菌(Collinsella aerofaciens)
<400> 20
Met Asn Leu Arg Glu Lys Tyr Gly Glu Trp Gly Leu Ile Leu Gly Ala
1 5 10 15
Thr Glu Gly Val Gly Lys Ala Phe Cys Glu Lys Ile Ala Ala Gly Gly
20 25 30
Met Asn Val Val Met Val Gly Arg Arg Glu Glu Lys Leu Asn Val Leu
35 40 45
Ala Gly Glu Ile Arg Glu Thr Tyr Gly Val Glu Thr Lys Val Val Arg
50 55 60
Ala Asp Phe Ser Gln Pro Gly Ala Ala Glu Thr Val Phe Ala Ala Thr
65 70 75 80
Glu Gly Leu Asp Met Gly Phe Met Ser Tyr Val Ala Cys Leu His Ser
85 90 95
Phe Gly Lys Ile Gln Asp Thr Pro Trp Glu Lys His Glu Ala Met Ile
100 105 110
Asn Val Asn Val Val Thr Phe Leu Lys Cys Phe His His Tyr Met Arg
115 120 125
Ile Phe Ala Ala Gln Asp Arg Gly Ala Val Ile Asn Val Ser Ser Met
130 135 140
Thr Gly Ile Ser Ser Ser Pro Trp Asn Gly Gln Tyr Gly Ala Gly Lys
145 150 155 160
Ala Phe Ile Leu Lys Met Thr Glu Ala Val Ala Cys Glu Cys Glu Gly
165 170 175
Thr Gly Val Asp Val Glu Val Ile Thr Leu Gly Thr Thr Leu Thr Pro
180 185 190
Ser Leu Leu Ser Asn Leu Pro Gly Gly Pro Gln Gly Glu Ala Val Met
195 200 205
Lys Ile Ala Leu Thr Pro Glu Glu Cys Val Asp Glu Ala Phe Glu Lys
210 215 220
Leu Gly Lys Glu Leu Ser Val Ile Ala Gly Gln Arg His Lys Asp Arg
225 230 235 240
Val His Asp Trp Lys Ala Asn His Thr Glu Asp Glu Tyr Ile Arg Tyr
245 250 255
Met Gly Ser Phe Tyr Arg Asp
260
<210> 21
<211> 792
<212> DNA
<213> 产气柯林斯菌(Collinsella aerofaciens)
<400> 21
atgaatctgc gtgaaaaata cggtgaatgg ggtctgatcc tgggtgctac ggaaggtgtc 60
ggtaaagcgt tctgtgaaaa aatcgccgcg ggcggtatga acgtggttat ggtcggccgt 120
cgcgaagaaa aactgaatgt gctggcaggc gaaattcgtg aaacctatgg tgttgaaacg 180
aaagtcgtgc gtgctgattt ttcccagccg ggtgcagcag aaaccgtctt cgcagctacg 240
gaaggcctgg acatgggttt tatgtcttac gtggcctgcc tgcatagttt cggtaaaatt 300
caagataccc cgtgggaaaa acacgaagca atgatcaacg tgaatgttgt cacgtttctg 360
aaatgtttcc atcactatat gcgtatcttt gcggcccagg atcgcggtgc cgtgattaac 420
gttagctcta tgaccggcat cagttcctca ccgtggaatg gtcaatacgg cgcgggtaaa 480
gccttcattc tgaaaatgac cgaagcagtt gcttgcgaat gtgaaggcac gggtgtggac 540
gttgaagtca tcaccctggg taccacgctg acgccgtcgc tgctgagcaa cctgccgggc 600
ggtccgcagg gtgaagcagt gatgaaaatt gctctgaccc cggaagaatg cgttgatgaa 660
gcgtttgaaa aactgggcaa agaactgtca gttatcgccg gtcaacgtca taaagatgaa 720
gtccatgact ggaaagcaaa ccacaccgaa gacgaataca tccgctacat gggctcattt 780
taccgtgact aa 792
<210> 22
<211> 263
<212> PRT
<213> 产气柯林斯菌(Collinsella aerofaciens)
<400> 22
Met Asn Leu Arg Glu Lys Tyr Gly Glu Trp Gly Leu Ile Leu Gly Ala
1 5 10 15
Thr Glu Gly Val Gly Lys Ala Phe Cys Glu Lys Ile Ala Ala Gly Gly
20 25 30
Met Asn Val Val Met Val Gly Arg Arg Glu Glu Lys Leu Asn Val Leu
35 40 45
Ala Gly Glu Ile Arg Glu Thr Tyr Gly Val Glu Thr Lys Val Val Arg
50 55 60
Ala Asp Phe Ser Gln Pro Gly Ala Ala Glu Thr Val Phe Ala Ala Thr
65 70 75 80
Glu Gly Leu Asp Met Gly Phe Met Ser Tyr Val Ala Cys Leu His Ser
85 90 95
Phe Gly Lys Ile Gln Asp Thr Pro Trp Glu Lys His Glu Ala Met Ile
100 105 110
Asn Val Asn Val Val Thr Phe Leu Lys Cys Phe His His Tyr Met Arg
115 120 125
Ile Phe Ala Ala Gln Asp Arg Gly Ala Val Ile Asn Val Ser Ser Met
130 135 140
Thr Gly Ile Ser Ser Ser Pro Trp Asn Gly Gln Tyr Gly Ala Gly Lys
145 150 155 160
Ala Phe Ile Leu Lys Met Thr Glu Ala Val Ala Cys Glu Cys Glu Gly
165 170 175
Thr Gly Val Asp Val Glu Val Ile Thr Leu Gly Thr Thr Leu Thr Pro
180 185 190
Ser Leu Leu Ser Asn Leu Pro Gly Gly Pro Gln Gly Glu Ala Val Met
195 200 205
Lys Ile Ala Leu Thr Pro Glu Glu Cys Val Asp Glu Ala Phe Glu Lys
210 215 220
Leu Gly Lys Glu Leu Ser Val Ile Ala Gly Gln Arg His Lys Asp Glu
225 230 235 240
Val His Asp Trp Lys Ala Asn His Thr Glu Asp Glu Tyr Ile Arg Tyr
245 250 255
Met Gly Ser Phe Tyr Arg Asp
260
<210> 23
<211> 792
<212> DNA
<213> 产气柯林斯菌(Collinsella aerofaciens)
<400> 23
atgaatctgc gtgaaaaata cggtgaatgg ggtctgatcc tgggtgctac ggaaggtgtc 60
ggtaaagcgt tctgtgaaaa aatcgccgcg ggcggtatga acgtggttat ggtcggccgt 120
cgcgaagaaa aactgaatgt gctggcaggc gaaattcgtg aaacctatgg tgttgaaacg 180
aaagtcgtgc gtgctgattt ttcccagccg ggtgcagcag aaaccgtctt cgcagctacg 240
gaaggcctgg acatgggttt tatgtcttac gtggcctgcc tgcatagttt cggtaaaatt 300
caagataccc cgtgggaaaa acacgaagca atgatcaacg tgaatgttgt cacgtttctg 360
aaatgtttcc atcactatat gcgtatcttt gcggcccagg atcgcggtgc cgtgattaac 420
gttagctcta tgaccggcat cagttcctca ccgtggaatg gtcaatacgg cgcgggtaaa 480
gccttcattc tgaaaatgac cgaagcagtt gcttgcgaat gtgaaggcac gggtgtggac 540
gttgaagtca tcaccctggg taccacgctg acgccgtcgc tgctgagcaa cctgccgggc 600
ggtccgcagg gtgaagcagt gatgaaaatt gctctgaccc cggaagaatg cgttgatgaa 660
gcgtttgaaa aactgggcaa agaactgtca gttatcgccg gtcaacgtca taaagatcag 720
gtccatgact ggaaagcaaa ccacaccgaa gacgaataca tccgctacat gggctcattt 780
taccgtgact aa 792
<210> 24
<211> 263
<212> PRT
<213> 产气柯林斯菌(Collinsella aerofaciens)
<400> 24
Met Asn Leu Arg Glu Lys Tyr Gly Glu Trp Gly Leu Ile Leu Gly Ala
1 5 10 15
Thr Glu Gly Val Gly Lys Ala Phe Cys Glu Lys Ile Ala Ala Gly Gly
20 25 30
Met Asn Val Val Met Val Gly Arg Arg Glu Glu Lys Leu Asn Val Leu
35 40 45
Ala Gly Glu Ile Arg Glu Thr Tyr Gly Val Glu Thr Lys Val Val Arg
50 55 60
Ala Asp Phe Ser Gln Pro Gly Ala Ala Glu Thr Val Phe Ala Ala Thr
65 70 75 80
Glu Gly Leu Asp Met Gly Phe Met Ser Tyr Val Ala Cys Leu His Ser
85 90 95
Phe Gly Lys Ile Gln Asp Thr Pro Trp Glu Lys His Glu Ala Met Ile
100 105 110
Asn Val Asn Val Val Thr Phe Leu Lys Cys Phe His His Tyr Met Arg
115 120 125
Ile Phe Ala Ala Gln Asp Arg Gly Ala Val Ile Asn Val Ser Ser Met
130 135 140
Thr Gly Ile Ser Ser Ser Pro Trp Asn Gly Gln Tyr Gly Ala Gly Lys
145 150 155 160
Ala Phe Ile Leu Lys Met Thr Glu Ala Val Ala Cys Glu Cys Glu Gly
165 170 175
Thr Gly Val Asp Val Glu Val Ile Thr Leu Gly Thr Thr Leu Thr Pro
180 185 190
Ser Leu Leu Ser Asn Leu Pro Gly Gly Pro Gln Gly Glu Ala Val Met
195 200 205
Lys Ile Ala Leu Thr Pro Glu Glu Cys Val Asp Glu Ala Phe Glu Lys
210 215 220
Leu Gly Lys Glu Leu Ser Val Ile Ala Gly Gln Arg His Lys Asp Gln
225 230 235 240
Val His Asp Trp Lys Ala Asn His Thr Glu Asp Glu Tyr Ile Arg Tyr
245 250 255
Met Gly Ser Phe Tyr Arg Asp
260
<210> 25
<211> 792
<212> DNA
<213> 产气柯林斯菌(Collinsella aerofaciens)
<400> 25
atgaatctgc gtgaaaaata cggtgaatgg ggtctgatcc tgggtgctac ggaaggtgtc 60
ggtaaagcgt tctgtgaaaa aatcgccgcg ggcggtatga acgtggttat ggtcggccgt 120
cgcgaagaaa aactgaatgt gctggcaggc gaaattcgtg aaacctatgg tgttgaaacg 180
aaagtcgtgc gtgctgattt ttcccagccg ggtgcagcag aaaccgtctt cgcagctacg 240
gaaggcctgg acatgggttt tatgtcttac gtggcctgcc tgcatagttt cggtaaaatt 300
caagataccc cgtgggaaaa acacgaagca atgatcaacg tgaatgttgt cacgtttctg 360
aaatgtttcc atcactatat gcgtatcttt gcggcccagg atcgcggtgc cgtgattaac 420
gttagctcta tgaccggcat cagttcctca ccgtggaatg gtcaatacgg cgcgggtaaa 480
gccttcattc tgaaaatgac cgaagcagtt gcttgcgaat gtgaaggcac gggtgtggac 540
gttgaagtca tcaccctggg taccacgctg acgccgtcgc tgctgagcaa cctgccgggc 600
ggtccgcagg gtgaagcagt gatgaaaatt gctctgaccc cggaagaatg cgttgatgaa 660
gcgtttgaaa aactgggcaa agaactgtca gttatcgccg gtcaacgtca gaaagataat 720
gtccatgact ggaaagcaaa ccacaccgaa gacgaataca tccgctacat gggctcattt 780
taccgtgact aa 792
<210> 26
<211> 263
<212> PRT
<213> 产气柯林斯菌(Collinsella aerofaciens)
<400> 26
Met Asn Leu Arg Glu Lys Tyr Gly Glu Trp Gly Leu Ile Leu Gly Ala
1 5 10 15
Thr Glu Gly Val Gly Lys Ala Phe Cys Glu Lys Ile Ala Ala Gly Gly
20 25 30
Met Asn Val Val Met Val Gly Arg Arg Glu Glu Lys Leu Asn Val Leu
35 40 45
Ala Gly Glu Ile Arg Glu Thr Tyr Gly Val Glu Thr Lys Val Val Arg
50 55 60
Ala Asp Phe Ser Gln Pro Gly Ala Ala Glu Thr Val Phe Ala Ala Thr
65 70 75 80
Glu Gly Leu Asp Met Gly Phe Met Ser Tyr Val Ala Cys Leu His Ser
85 90 95
Phe Gly Lys Ile Gln Asp Thr Pro Trp Glu Lys His Glu Ala Met Ile
100 105 110
Asn Val Asn Val Val Thr Phe Leu Lys Cys Phe His His Tyr Met Arg
115 120 125
Ile Phe Ala Ala Gln Asp Arg Gly Ala Val Ile Asn Val Ser Ser Met
130 135 140
Thr Gly Ile Ser Ser Ser Pro Trp Asn Gly Gln Tyr Gly Ala Gly Lys
145 150 155 160
Ala Phe Ile Leu Lys Met Thr Glu Ala Val Ala Cys Glu Cys Glu Gly
165 170 175
Thr Gly Val Asp Val Glu Val Ile Thr Leu Gly Thr Thr Leu Thr Pro
180 185 190
Ser Leu Leu Ser Asn Leu Pro Gly Gly Pro Gln Gly Glu Ala Val Met
195 200 205
Lys Ile Ala Leu Thr Pro Glu Glu Cys Val Asp Glu Ala Phe Glu Lys
210 215 220
Leu Gly Lys Glu Leu Ser Val Ile Ala Gly Gln Arg Gln Lys Asp Asn
225 230 235 240
Val His Asp Trp Lys Ala Asn His Thr Glu Asp Glu Tyr Ile Arg Tyr
245 250 255
Met Gly Ser Phe Tyr Arg Asp
260
<210> 27
<211> 792
<212> DNA
<213> 产气柯林斯菌(Collinsella aerofaciens)
<400> 27
atgaatctgc gtgaaaaata cggtgaatgg ggtctgatcc tgggtgctac ggaaggtgtc 60
ggtaaagcgt tctgtgaaaa aatcgccgcg ggcggtatga acgtggttat ggtcggccgt 120
cgcgaagaaa aactgaatgt gctggcaggc gaaattcgtg aaacctatgg tgttgaaacg 180
aaagtcgtgc gtgctgattt ttcccagccg ggtgcagcag aaaccgtctt cgcagctacg 240
gaaggcctgg acatgggttt tatgtcttac gtggcctgcc tgcatagttt cggtaaaatt 300
caagataccc cgtgggaaaa acacgaagca atgatcaacg tgaatgttgt cacgtttctg 360
aaatgtttcc atcactatat gcgtatcttt gcggcccagg atcgcggtgc cgtgattaac 420
gttagctcta tgaccggcat cagttcctca ccgtggaatg gtcaatacgg cgcgggtaaa 480
gccttcattc tgaaaatgac cgaagcagtt gcttgcgaat gtgaaggcac gggtgtggac 540
gttgaagtca tcaccctggg taccacgctg acgccgtcgc tgctgagcaa cctgccgggc 600
ggtccgcagg gtgaagcagt gatgaaaatt gctctgaccc cggaagaatg cgttgatgaa 660
gcgtttgaaa aactgggcaa agaactgtca gttatcgccg gtcaacgtca gaaagatcga 720
gtccatgact ggaaagcaaa ccacaccgaa gacgaataca tccgctacat gggctcattt 780
taccgtgact aa 792
<210> 28
<211> 263
<212> PRT
<213> 产气柯林斯菌(Collinsella aerofaciens)
<400> 28
Met Asn Leu Arg Glu Lys Tyr Gly Glu Trp Gly Leu Ile Leu Gly Ala
1 5 10 15
Thr Glu Gly Val Gly Lys Ala Phe Cys Glu Lys Ile Ala Ala Gly Gly
20 25 30
Met Asn Val Val Met Val Gly Arg Arg Glu Glu Lys Leu Asn Val Leu
35 40 45
Ala Gly Glu Ile Arg Glu Thr Tyr Gly Val Glu Thr Lys Val Val Arg
50 55 60
Ala Asp Phe Ser Gln Pro Gly Ala Ala Glu Thr Val Phe Ala Ala Thr
65 70 75 80
Glu Gly Leu Asp Met Gly Phe Met Ser Tyr Val Ala Cys Leu His Ser
85 90 95
Phe Gly Lys Ile Gln Asp Thr Pro Trp Glu Lys His Glu Ala Met Ile
100 105 110
Asn Val Asn Val Val Thr Phe Leu Lys Cys Phe His His Tyr Met Arg
115 120 125
Ile Phe Ala Ala Gln Asp Arg Gly Ala Val Ile Asn Val Ser Ser Met
130 135 140
Thr Gly Ile Ser Ser Ser Pro Trp Asn Gly Gln Tyr Gly Ala Gly Lys
145 150 155 160
Ala Phe Ile Leu Lys Met Thr Glu Ala Val Ala Cys Glu Cys Glu Gly
165 170 175
Thr Gly Val Asp Val Glu Val Ile Thr Leu Gly Thr Thr Leu Thr Pro
180 185 190
Ser Leu Leu Ser Asn Leu Pro Gly Gly Pro Gln Gly Glu Ala Val Met
195 200 205
Lys Ile Ala Leu Thr Pro Glu Glu Cys Val Asp Glu Ala Phe Glu Lys
210 215 220
Leu Gly Lys Glu Leu Ser Val Ile Ala Gly Gln Arg Gln Lys Asp Arg
225 230 235 240
Val His Asp Trp Lys Ala Asn His Thr Glu Asp Glu Tyr Ile Arg Tyr
245 250 255
Met Gly Ser Phe Tyr Arg Asp
260
<210> 29
<211> 792
<212> DNA
<213> 产气柯林斯菌(Collinsella aerofaciens)
<400> 29
atgaatctgc gtgaaaaata cggtgaatgg ggtctgatcc tgggtgctac ggaaggtgtc 60
ggtaaagcgt tctgtgaaaa aatcgccgcg ggcggtatga acgtggttat ggtcggccgt 120
cgcgaagaaa aactgaatgt gctggcaggc gaaattcgtg aaacctatgg tgttgaaacg 180
aaagtcgtgc gtgctgattt ttcccagccg ggtgcagcag aaaccgtctt cgcagctacg 240
gaaggcctgg acatgggttt tatgtcttac gtggcctgcc tgcatagttt cggtaaaatt 300
caagataccc cgtgggaaaa acacgaagca atgatcaacg tgaatgttgt cacgtttctg 360
aaatgtttcc atcactatat gcgtatcttt gcggcccagg atcgcggtgc cgtgattaac 420
gttagctcta tgaccggcat cagttcctca ccgtggaatg gtcaatacgg cgcgggtaaa 480
gccttcattc tgaaaatgac cgaagcagtt gcttgcgaat gtgaaggcac gggtgtggac 540
gttgaagtca tcaccctggg taccacgctg acgccgtcgc tgctgagcaa cctgccgggc 600
ggtccgcagg gtgaagcagt gatgaaaatt gctctgaccc cggaagaatg cgttgatgaa 660
gcgtttgaaa aactgggcaa agaactgtca gttatcgccg gtcaacgtca gaaagatgaa 720
gtccatgact ggaaagcaaa ccacaccgaa gacgaataca tccgctacat gggctcattt 780
taccgtgact aa 792
<210> 30
<211> 263
<212> PRT
<213> 产气柯林斯菌(Collinsella aerofaciens)
<400> 30
Met Asn Leu Arg Glu Lys Tyr Gly Glu Trp Gly Leu Ile Leu Gly Ala
1 5 10 15
Thr Glu Gly Val Gly Lys Ala Phe Cys Glu Lys Ile Ala Ala Gly Gly
20 25 30
Met Asn Val Val Met Val Gly Arg Arg Glu Glu Lys Leu Asn Val Leu
35 40 45
Ala Gly Glu Ile Arg Glu Thr Tyr Gly Val Glu Thr Lys Val Val Arg
50 55 60
Ala Asp Phe Ser Gln Pro Gly Ala Ala Glu Thr Val Phe Ala Ala Thr
65 70 75 80
Glu Gly Leu Asp Met Gly Phe Met Ser Tyr Val Ala Cys Leu His Ser
85 90 95
Phe Gly Lys Ile Gln Asp Thr Pro Trp Glu Lys His Glu Ala Met Ile
100 105 110
Asn Val Asn Val Val Thr Phe Leu Lys Cys Phe His His Tyr Met Arg
115 120 125
Ile Phe Ala Ala Gln Asp Arg Gly Ala Val Ile Asn Val Ser Ser Met
130 135 140
Thr Gly Ile Ser Ser Ser Pro Trp Asn Gly Gln Tyr Gly Ala Gly Lys
145 150 155 160
Ala Phe Ile Leu Lys Met Thr Glu Ala Val Ala Cys Glu Cys Glu Gly
165 170 175
Thr Gly Val Asp Val Glu Val Ile Thr Leu Gly Thr Thr Leu Thr Pro
180 185 190
Ser Leu Leu Ser Asn Leu Pro Gly Gly Pro Gln Gly Glu Ala Val Met
195 200 205
Lys Ile Ala Leu Thr Pro Glu Glu Cys Val Asp Glu Ala Phe Glu Lys
210 215 220
Leu Gly Lys Glu Leu Ser Val Ile Ala Gly Gln Arg Gln Lys Asp Glu
225 230 235 240
Val His Asp Trp Lys Ala Asn His Thr Glu Asp Glu Tyr Ile Arg Tyr
245 250 255
Met Gly Ser Phe Tyr Arg Asp
260
<210> 31
<211> 792
<212> DNA
<213> 产气柯林斯菌(Collinsella aerofaciens)
<400> 31
atgaatctgc gtgaaaaata cggtgaatgg ggtctgatcc tgggtgctac ggaaggtgtc 60
ggtaaagcgt tctgtgaaaa aatcgccgcg ggcggtatga acgtggttat ggtcggccgt 120
cgcgaagaaa aactgaatgt gctggcaggc gaaattcgtg aaacctatgg tgttgaaacg 180
aaagtcgtgc gtgctgattt ttcccagccg ggtgcagcag aaaccgtctt cgcagctacg 240
gaaggcctgg acatgggttt tatgtcttac gtggcctgcc tgcatagttt cggtaaaatt 300
caagataccc cgtgggaaaa acacgaagca atgatcaacg tgaatgttgt cacgtttctg 360
aaatgtttcc atcactatat gcgtatcttt gcggcccagg atcgcggtgc cgtgattaac 420
gttagctcta tgaccggcat cagttcctca ccgtggaatg gtcaatacgg cgcgggtaaa 480
gccttcattc tgaaaatgac cgaagcagtt gcttgcgaat gtgaaggcac gggtgtggac 540
gttgaagtca tcaccctggg taccacgctg acgccgtcgc tgctgagcaa cctgccgggc 600
ggtccgcagg gtgaagcagt gatgaaaatt gctctgaccc cggaagaatg cgttgatgaa 660
gcgtttgaaa aactgggcaa agaactgtca gttatcgccg gtcaacgtca gaaagatcag 720
gtccatgact ggaaagcaaa ccacaccgaa gacgaataca tccgctacat gggctcattt 780
taccgtgact aa 792
<210> 32
<211> 263
<212> PRT
<213> 产气柯林斯菌(Collinsella aerofaciens)
<400> 32
Met Asn Leu Arg Glu Lys Tyr Gly Glu Trp Gly Leu Ile Leu Gly Ala
1 5 10 15
Thr Glu Gly Val Gly Lys Ala Phe Cys Glu Lys Ile Ala Ala Gly Gly
20 25 30
Met Asn Val Val Met Val Gly Arg Arg Glu Glu Lys Leu Asn Val Leu
35 40 45
Ala Gly Glu Ile Arg Glu Thr Tyr Gly Val Glu Thr Lys Val Val Arg
50 55 60
Ala Asp Phe Ser Gln Pro Gly Ala Ala Glu Thr Val Phe Ala Ala Thr
65 70 75 80
Glu Gly Leu Asp Met Gly Phe Met Ser Tyr Val Ala Cys Leu His Ser
85 90 95
Phe Gly Lys Ile Gln Asp Thr Pro Trp Glu Lys His Glu Ala Met Ile
100 105 110
Asn Val Asn Val Val Thr Phe Leu Lys Cys Phe His His Tyr Met Arg
115 120 125
Ile Phe Ala Ala Gln Asp Arg Gly Ala Val Ile Asn Val Ser Ser Met
130 135 140
Thr Gly Ile Ser Ser Ser Pro Trp Asn Gly Gln Tyr Gly Ala Gly Lys
145 150 155 160
Ala Phe Ile Leu Lys Met Thr Glu Ala Val Ala Cys Glu Cys Glu Gly
165 170 175
Thr Gly Val Asp Val Glu Val Ile Thr Leu Gly Thr Thr Leu Thr Pro
180 185 190
Ser Leu Leu Ser Asn Leu Pro Gly Gly Pro Gln Gly Glu Ala Val Met
195 200 205
Lys Ile Ala Leu Thr Pro Glu Glu Cys Val Asp Glu Ala Phe Glu Lys
210 215 220
Leu Gly Lys Glu Leu Ser Val Ile Ala Gly Gln Arg Gln Lys Asp Gln
225 230 235 240
Val His Asp Trp Lys Ala Asn His Thr Glu Asp Glu Tyr Ile Arg Tyr
245 250 255
Met Gly Ser Phe Tyr Arg Asp
260
<210> 33
<211> 30
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 33
tcgccggtca acgtaaaaaa gatagcgtcc 30
<210> 34
<211> 30
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 34
ggacgctatc ttttttacgt tgaccggcga 30
<210> 35
<211> 32
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 35
tcgccggtca acgtcataaa gatagcgtcc at 32
<210> 36
<211> 32
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 36
atggacgcta tctttatgac gttgaccggc ga 32
<210> 37
<211> 30
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 37
tcgccggtca acgtcagaaa gatagcgtcc 30
<210> 38
<211> 30
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 38
ggacgctatc tttctgacgt tgaccggcga 30
<210> 39
<211> 34
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 39
cggtcaacgt aataaagata atgtccatga ctgg 34
<210> 40
<211> 34
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 40
ccagtcatgg acattatctt tattacgttg accg 34
<210> 41
<211> 27
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 41
cgtaataaag atcgagtcca tgactgg 27
<210> 42
<211> 27
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 42
ccagtcatgg actcgatctt tattacg 27
<210> 43
<211> 27
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 43
cgtaataaag atgaagtcca tgactgg 27
<210> 44
<211> 27
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 44
ccagtcatgg acttcatctt tattacg 27
<210> 45
<211> 27
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 45
cgtaataaag atcaggtcca tgactgg 27
<210> 46
<211> 27
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 46
ccagtcatgg acctgatctt tattacg 27
Claims (10)
1.一种7β-HSDH酶突变体,其特征在于,所述的7β-HSDH酶突变体的氨基酸序列与氨基酸序列如SEQ ID NO:2所示的野生型7β-HSDH酶相比,在SEQ ID NO:2所示的氨基酸序列的第237位、第240位进行单突变或两两联合突变中的任意一种突变。
2.根据权利要求1所述的7β-HSDH酶突变体,其特征在于,所述的单突变为:
当SEQ ID NO:2所示的氨基酸序列的第237位由天冬酰胺突变为组氨酸,所述的7β-HSDH酶突变体的氨基酸序列如SEQ ID NO:6所示;
或,当SEQ ID NO:2所示的氨基酸序列的第237位由天冬酰胺突变为谷氨酰胺,所述的7β-HSDH酶突变体的氨基酸序列如SEQ ID NO:8所示;
或,当SEQ ID NO:2所示的氨基酸序列的第240位由丝氨酸突变为天冬酰胺,所述的7β-HSDH酶突变体的氨基酸序列如SEQ ID NO:10所示;
或,当SEQ ID NO:2所示的氨基酸序列的第240位由丝氨酸突变为精氨酸,所述的7β-HSDH酶突变体的氨基酸序列如SEQ ID NO:12所示;
或,当SEQ ID NO:2所示的氨基酸序列的第240位由丝氨酸突变为谷氨酸,所述的7β-HSDH酶突变体的氨基酸序列如SEQ ID NO:14所示;
或,当SEQ ID NO:2所示的氨基酸序列的第240位由丝氨酸突变为谷氨酰胺,所述的7β-HSDH酶突变体的氨基酸序列如SEQ ID NO:16所示。
3.根据权利要求1所述的7β-HSDH酶突变体,其特征在于,所述的两两联合突变为:
当SEQ ID NO:2所示的氨基酸序列的第237位由天冬酰胺突变为组氨酸,第240位由丝氨酸突变为天冬酰胺,所述的7β-HSDH酶突变体的氨基酸序列如SEQ ID NO:18所示;
或,当SEQ ID NO:2所示的氨基酸序列的第237位由天冬酰胺突变为组氨酸,第240位由丝氨酸突变为精氨酸,所述的7β-HSDH酶突变体的氨基酸序列如SEQ ID NO:20所示;
或,当SEQ ID NO:2所示的氨基酸序列的第237位由天冬酰胺突变为组氨酸,第240位由丝氨酸突变为谷氨酸,所述的7β-HSDH酶突变体的氨基酸序列如SEQ ID NO:22所示;
或,当SEQ ID NO:2所示的氨基酸序列的第237位由天冬酰胺突变为组氨酸,第240位由丝氨酸突变为谷氨酰胺,所述的7β-HSDH酶突变体的氨基酸序列如SEQ ID NO:24所示;
或,当SEQ ID NO:2所示的氨基酸序列的第237位由天冬酰胺突变为谷氨酰胺,第240位由丝氨酸突变为天冬酰胺,所述的7β-HSDH酶突变体的氨基酸序列如SEQ ID NO:26所示;
或,当SEQ ID NO:2所示的氨基酸序列的第237位由天冬酰胺突变为谷氨酰胺,第240位由丝氨酸突变为精氨酸,所述的7β-HSDH酶突变体的氨基酸序列如SEQ ID NO:28所示;
或,当SEQ ID NO:2所示的氨基酸序列的第237位由天冬酰胺突变为谷氨酰胺,第240位由丝氨酸突变为谷氨酸,所述的7β-HSDH酶突变体的氨基酸序列如SEQ ID NO:30所示;或,当SEQ ID NO:2所示的氨基酸序列的第237位由天冬酰胺突变为谷氨酰胺,第240位由丝氨酸突变为谷氨酰胺,所述的7β-HSDH酶突变体的氨基酸序列如SEQ ID NO:32所示。
4.权利要求2或3所述的7β-HSDH酶突变体的编码基因,其特征在于,氨基酸序列如SEQID NO:6所示的7β-HSDH酶突变体的编码基因的核苷酸序列如SEQ ID NO:5所示;
或,氨基酸序列如SEQ ID NO:8所示的7β-HSDH酶突变体的编码基因的核苷酸序列如SEQ ID NO:7所示;
或,氨基酸序列如SEQ ID NO:10所示的7β-HSDH酶突变体的编码基因的核苷酸序列如SEQ ID NO:9所示;
或,氨基酸序列如SEQ ID NO:12所示的7β-HSDH酶突变体的编码基因的核苷酸序列如SEQ ID NO:11所示;
或,氨基酸序列如SEQ ID NO:14所示的7β-HSDH酶突变体的编码基因的核苷酸序列如SEQ ID NO:13所示;
或,氨基酸序列如SEQ ID NO:16所示的7β-HSDH酶突变体的编码基因的核苷酸序列如SEQ ID NO:15所示;
或,氨基酸序列如SEQ ID NO:18所示的7β-HSDH酶突变体的编码基因的核苷酸序列如SEQ ID NO:17所示;
或,氨基酸序列如SEQ ID NO:20所示的7β-HSDH酶突变体的编码基因的核苷酸序列如SEQ ID NO:19所示;
或,氨基酸序列如SEQ ID NO:22所示的7β-HSDH酶突变体的编码基因的核苷酸序列如SEQ ID NO:21所示;
或,氨基酸序列如SEQ ID NO:24所示的7β-HSDH酶突变体的编码基因的核苷酸序列如SEQ ID NO:23所示;
或,氨基酸序列如SEQ ID NO:26所示的7β-HSDH酶突变体的编码基因的核苷酸序列如SEQ ID NO:25所示;
或,氨基酸序列如SEQ ID NO:28所示的7β-HSDH酶突变体的编码基因的核苷酸序列如SEQ ID NO:27所示;
或,氨基酸序列如SEQ ID NO:30所示的7β-HSDH酶突变体的编码基因的核苷酸序列如SEQ ID NO:29所示;
或,氨基酸序列如SEQ ID NO:32所示的7β-HSDH酶突变体的编码基因的核苷酸序列如SEQ ID NO:31所示。
5.含有权利要求4所述的编码基因的载体。
6.根据权利要求5所述的载体,其特征在于,所述的载体为pET表达载体、pCW表达载体、pUC表达载体或pPIC9k表达载体。
7.含有权利要求4所述的编码基因的宿主细胞,其特征在于,所述的宿主细胞为大肠杆菌、毕赤酵母、链霉菌或枯草芽孢杆菌。
8.权利要求1-3任一项所述的7β-HSDH酶突变体、权利要求4所述的编码基因、权利要求5或6所述的载体、权利要求7所述的宿主细胞在制备24-去甲熊脱氧胆酸中的应用。
9.一种制备24-去甲熊脱氧胆酸的方法,其特征在于,所述的方法包括以下步骤:
S1.配置反应体系,包含:1-10g/L权利要求1-3任一项所述的7β-HSDH酶突变体,50mMpH6.0-8.0磷酸钠缓冲液,0.2mM NADP+,10-50g/L 24-去甲-7-酮胆石酸,5-40g/L葡萄糖,0.1-10g/L葡萄糖脱氢酶;控制反应体系温度为25-40℃,pH值为6.0-8.0,进行搅拌反应;
S2.反应24h后进行HPLC检测,即得24-去甲熊去氧胆酸。
10.根据权利要求9所述的方法,其特征在于,所述的方法包括以下步骤:
S1.配置反应体系,包含:1g/L权利要求1-3任一项所述的7β-HSDH酶突变体,50mMpH7.0磷酸钠缓冲液,0.2mM NADP+,10-50g/L 24-去甲-7-酮胆石酸,5-40g/L葡萄糖,0.5g/L葡萄糖脱氢酶;控制反应体系温度为30℃,pH值为7.0,进行搅拌反应;
S2.反应24h后进行HPLC检测,即得24-去甲熊去氧胆酸。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111215918.4A CN113832122B (zh) | 2021-10-19 | 2021-10-19 | 一种7β-HSDH酶突变体及其编码基因和应用 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111215918.4A CN113832122B (zh) | 2021-10-19 | 2021-10-19 | 一种7β-HSDH酶突变体及其编码基因和应用 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113832122A true CN113832122A (zh) | 2021-12-24 |
CN113832122B CN113832122B (zh) | 2023-06-16 |
Family
ID=78965407
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202111215918.4A Active CN113832122B (zh) | 2021-10-19 | 2021-10-19 | 一种7β-HSDH酶突变体及其编码基因和应用 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113832122B (zh) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114276401A (zh) * | 2021-12-27 | 2022-04-05 | 中山百灵生物技术股份有限公司 | 一种24-去甲熊去氧胆酸的合成方法 |
CN114480319A (zh) * | 2022-01-27 | 2022-05-13 | 南京桦冠生物技术有限公司 | 一种单胺氧化酶突变体及其应用 |
CN114752572A (zh) * | 2022-02-18 | 2022-07-15 | 深圳希吉亚生物技术有限公司 | 甲酸脱氢酶突变体及其应用 |
CN114854707A (zh) * | 2022-06-14 | 2022-08-05 | 苏州百福安酶技术有限公司 | 一种7β-羟基甾体脱氢酶突变体 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108546691A (zh) * | 2018-05-09 | 2018-09-18 | 华东理工大学 | 7β-羟基甾醇脱氢酶突变体及其在制备熊脱氧胆酸中的应用 |
US20200407766A1 (en) * | 2016-06-20 | 2020-12-31 | Pharmazell Gmbh | Coupled, Self-Sufficient Biotransformation of Chenodeoxcholic Acid to Ursodeoxycholic Acid and Novel Enzyme Mutants Applicable in said Process |
CN113388592A (zh) * | 2021-06-30 | 2021-09-14 | 中山百灵生物技术股份有限公司 | 一种7β-HSDH酶突变体及其编码基因和应用 |
CN113462665A (zh) * | 2021-06-30 | 2021-10-01 | 中山百灵生物技术股份有限公司 | 一种7α-HSDH酶突变体及其编码基因和应用 |
-
2021
- 2021-10-19 CN CN202111215918.4A patent/CN113832122B/zh active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20200407766A1 (en) * | 2016-06-20 | 2020-12-31 | Pharmazell Gmbh | Coupled, Self-Sufficient Biotransformation of Chenodeoxcholic Acid to Ursodeoxycholic Acid and Novel Enzyme Mutants Applicable in said Process |
CN108546691A (zh) * | 2018-05-09 | 2018-09-18 | 华东理工大学 | 7β-羟基甾醇脱氢酶突变体及其在制备熊脱氧胆酸中的应用 |
CN113388592A (zh) * | 2021-06-30 | 2021-09-14 | 中山百灵生物技术股份有限公司 | 一种7β-HSDH酶突变体及其编码基因和应用 |
CN113462665A (zh) * | 2021-06-30 | 2021-10-01 | 中山百灵生物技术股份有限公司 | 一种7α-HSDH酶突变体及其编码基因和应用 |
Non-Patent Citations (5)
Title |
---|
MING-MIN ZHENG ET AL.: "Engineering 7β-Hydroxysteroid Dehydrogenase for Enhanced Ursodeoxycholic Acid Production by Multiobjective Directed Evolution" * |
SIMONE SAVINO ET AL.: "Structural and biochemical insights into 7b-hydroxysteroid dehydrogenase stereoselectivity" * |
ZHI-NENG YOU ET AL.: "Switching Cofactor Dependence of 7β-Hydroxysteroid Dehydrogenase for Cost-Effective Production of Ursodeoxycholic Acid" * |
董新星等: "3β和17β羟基类固醇脱氢酶的研究进展" * |
贺俊斌等: "多酶催化串联策略在复杂天然产物合成中的应用" * |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114276401A (zh) * | 2021-12-27 | 2022-04-05 | 中山百灵生物技术股份有限公司 | 一种24-去甲熊去氧胆酸的合成方法 |
CN114480319A (zh) * | 2022-01-27 | 2022-05-13 | 南京桦冠生物技术有限公司 | 一种单胺氧化酶突变体及其应用 |
CN114752572A (zh) * | 2022-02-18 | 2022-07-15 | 深圳希吉亚生物技术有限公司 | 甲酸脱氢酶突变体及其应用 |
CN114752572B (zh) * | 2022-02-18 | 2023-07-18 | 深圳希吉亚生物技术有限公司 | 甲酸脱氢酶突变体及其应用 |
CN114854707A (zh) * | 2022-06-14 | 2022-08-05 | 苏州百福安酶技术有限公司 | 一种7β-羟基甾体脱氢酶突变体 |
CN114854707B (zh) * | 2022-06-14 | 2023-09-12 | 苏州百福安酶技术有限公司 | 一种7β-羟基甾体脱氢酶突变体 |
Also Published As
Publication number | Publication date |
---|---|
CN113832122B (zh) | 2023-06-16 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN113832122A (zh) | 一种7β-HSDH酶突变体及其编码基因和应用 | |
CN108546691B (zh) | 7β-羟基甾醇脱氢酶突变体及其在制备熊脱氧胆酸中的应用 | |
CN110373398B (zh) | 一种烟酰胺核糖激酶突变体及其应用 | |
CN113388592B (zh) | 一种7β-HSDH酶突变体及其编码基因和应用 | |
CN110373397B (zh) | 一种烟酰胺磷酸核糖转移酶突变体及其应用 | |
CN112280762B (zh) | 一种烟酰胺核糖激酶突变体及其编码基因和应用 | |
CN112553178B (zh) | 热稳定性和活性增强的烟酰胺核糖激酶突变体及其编码基因和应用 | |
CN113832125B (zh) | 一种烟酰胺核糖激酶突变体及其编码基因和应用 | |
CN113462665B (zh) | 一种7α-HSDH酶突变体及其编码基因和应用 | |
WO2022001038A1 (zh) | 一种草铵膦脱氢酶突变体、基因工程菌及一锅法多酶同步定向进化方法 | |
US10837036B2 (en) | Method for preparing L-aspartic acid with maleic acid by whole-cell biocatalysis | |
CN110358750B (zh) | 新型蔗糖磷酸化酶突变体及其在合成甘油葡糖苷中的应用 | |
CN112877307B (zh) | 一种氨基酸脱氢酶突变体及其应用 | |
CN113817763B (zh) | β-半乳糖苷酶家族基因定向进化方法、突变体及其应用 | |
WO2022016597A1 (zh) | 环己烯甲酸酯水解酶及其突变体、编码基因、表达载体、重组菌与应用 | |
CN111004787B (zh) | 一种链霉菌磷脂酶d突变体、改造方法及其应用 | |
CN112908417A (zh) | 功能序列和结构模拟相结合的基因挖掘方法、nadh偏好型草铵膦脱氢酶突变体及应用 | |
CN109694892B (zh) | 制备红景天苷的方法和试剂盒 | |
CN112831532B (zh) | 一种酶促合成d-亮氨酸的方法 | |
CN110804602B (zh) | 一种L-天冬氨酸β-脱羧酶突变体及其应用 | |
CN111172143B (zh) | D-木糖酸脱水酶及其应用 | |
CN114540338B (zh) | 固定化经修饰的7β-羟基甾体脱氢酶及其应用 | |
CN115044565A (zh) | 一种胆绿素还原酶突变体及其编码基因和应用 | |
CN117701521A (zh) | 一种7α-羟基类固醇脱氢酶突变体及其编码基因和应用 | |
CN118460485A (zh) | 一种用于合成l-天冬酰胺的天冬酰胺合成酶a突变体及应用 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |