CN117897480A - Mouse Li Tanggao degree specific glycosyltransferase and application thereof - Google Patents
Mouse Li Tanggao degree specific glycosyltransferase and application thereof Download PDFInfo
- Publication number
- CN117897480A CN117897480A CN202280053383.1A CN202280053383A CN117897480A CN 117897480 A CN117897480 A CN 117897480A CN 202280053383 A CN202280053383 A CN 202280053383A CN 117897480 A CN117897480 A CN 117897480A
- Authority
- CN
- China
- Prior art keywords
- glycosyl
- rhamnose
- group
- seq
- ginsenoside
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 108700023372 Glycosyltransferases Proteins 0.000 title claims abstract description 118
- 102000051366 Glycosyltransferases Human genes 0.000 title claims abstract description 115
- 125000003147 glycosyl group Chemical group 0.000 claims abstract description 61
- -1 triterpene compound Chemical class 0.000 claims abstract description 57
- SHZGCJCMOBCMKK-JFNONXLTSA-N L-rhamnopyranose Chemical compound C[C@@H]1OC(O)[C@H](O)[C@H](O)[C@H]1O SHZGCJCMOBCMKK-JFNONXLTSA-N 0.000 claims abstract description 38
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 94
- 229920001184 polypeptide Polymers 0.000 claims description 93
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 93
- 239000000348 glycosyl donor Substances 0.000 claims description 51
- 150000001875 compounds Chemical class 0.000 claims description 50
- YURJSTAIMNSZAE-UHFFFAOYSA-N UNPD89172 Natural products C1CC(C2(CC(C3C(C)(C)C(O)CCC3(C)C2CC2O)OC3C(C(O)C(O)C(CO)O3)O)C)(C)C2C1C(C)(CCC=C(C)C)OC1OC(CO)C(O)C(O)C1O YURJSTAIMNSZAE-UHFFFAOYSA-N 0.000 claims description 45
- 238000000034 method Methods 0.000 claims description 43
- AGBCLJAHARWNLA-DQUQINEDSA-N Ginsenoside RG2 Chemical compound O[C@@H]1[C@H](O)[C@@H](O)[C@H](C)O[C@H]1O[C@H]1[C@H](O[C@@H]2[C@H]3C(C)(C)[C@@H](O)CC[C@]3(C)[C@@H]3[C@@]([C@@]4(CC[C@@H]([C@H]4[C@H](O)C3)[C@@](C)(O)CCC=C(C)C)C)(C)C2)O[C@H](CO)[C@@H](O)[C@@H]1O AGBCLJAHARWNLA-DQUQINEDSA-N 0.000 claims description 42
- YURJSTAIMNSZAE-HHNZYBFYSA-N ginsenoside Rg1 Chemical compound O([C@@](C)(CCC=C(C)C)[C@@H]1[C@@H]2[C@@]([C@@]3(C[C@@H]([C@H]4C(C)(C)[C@@H](O)CC[C@]4(C)[C@H]3C[C@H]2O)O[C@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O2)O)C)(C)CC1)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O YURJSTAIMNSZAE-HHNZYBFYSA-N 0.000 claims description 33
- 108091033319 polynucleotide Proteins 0.000 claims description 33
- 102000040430 polynucleotide Human genes 0.000 claims description 33
- 239000002157 polynucleotide Substances 0.000 claims description 33
- 238000006243 chemical reaction Methods 0.000 claims description 30
- 239000013598 vector Substances 0.000 claims description 28
- RAQNTCRNSXYLAH-RFCGZQMISA-N (20S)-ginsenoside Rh1 Chemical compound O([C@@H]1[C@H]2C(C)(C)[C@@H](O)CC[C@]2(C)[C@H]2C[C@@H](O)[C@H]3[C@@]([C@@]2(C1)C)(C)CC[C@@H]3[C@@](C)(O)CCC=C(C)C)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O RAQNTCRNSXYLAH-RFCGZQMISA-N 0.000 claims description 24
- RAQNTCRNSXYLAH-UHFFFAOYSA-N Ginsenoside Rh1 Natural products CC(C)=CCCC(C)(O)C1CCC(C2(C3)C)(C)C1C(O)CC2C1(C)CCC(O)C(C)(C)C1C3OC1OC(CO)C(O)C(O)C1O RAQNTCRNSXYLAH-UHFFFAOYSA-N 0.000 claims description 24
- CBEHEBUBNAGGKC-UHFFFAOYSA-N ginsenoside Rg1 Natural products CC(=CCCC(C)(OC1OC(CO)C(O)C(O)C1O)C2CCC3(C)C2C(O)CC4C5(C)CCC(O)C(C)(C)C5CC(OC6OC(CO)C(O)C(O)C6O)C34C)C CBEHEBUBNAGGKC-UHFFFAOYSA-N 0.000 claims description 24
- 150000007523 nucleic acids Chemical class 0.000 claims description 23
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 22
- PWAOOJDMFUQOKB-WCZZMFLVSA-N ginsenoside Re Chemical compound O[C@@H]1[C@H](O)[C@@H](O)[C@H](C)O[C@H]1O[C@H]1[C@H](O[C@@H]2[C@H]3C(C)(C)[C@@H](O)CC[C@]3(C)[C@@H]3[C@@]([C@@]4(CC[C@@H]([C@H]4[C@H](O)C3)[C@](C)(CCC=C(C)C)O[C@H]3[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O3)O)C)(C)C2)O[C@H](CO)[C@@H](O)[C@@H]1O PWAOOJDMFUQOKB-WCZZMFLVSA-N 0.000 claims description 22
- AOGZLQUEBLOQCI-UHFFFAOYSA-N ginsenoside-Re Natural products CC1OC(OCC2OC(OC3CC4(C)C(CC(O)C5C(CCC45C)C(C)(CCC=C(C)C)OC6OC(CO)C(O)C(O)C6O)C7(C)CCC(O)C(C)(C)C37)C(O)C(O)C2O)C(O)C(O)C1O AOGZLQUEBLOQCI-UHFFFAOYSA-N 0.000 claims description 22
- 125000002791 glucosyl group Chemical group C1([C@H](O)[C@@H](O)[C@H](O)[C@H](O1)CO)* 0.000 claims description 22
- AGBCLJAHARWNLA-UHFFFAOYSA-N (20R)-ginsenoside Rg2 Natural products OC1C(O)C(O)C(C)OC1OC1C(OC2C3C(C)(C)C(O)CCC3(C)C3C(C4(CCC(C4C(O)C3)C(C)(O)CCC=C(C)C)C)(C)C2)OC(CO)C(O)C1O AGBCLJAHARWNLA-UHFFFAOYSA-N 0.000 claims description 21
- JURZHOVRCOWZFN-UHFFFAOYSA-N notoginsenoside R1 Natural products CC(=CCCC(C)(OC1OC(CO)C(O)C(O)C1O)C2CCC3(C)C2C(O)CC4C5(C)CCC(O)C(C)(C)C5C(CC34C)OC6OC(COC7OCC(O)C(O)C7O)C(O)C(O)C6O)C JURZHOVRCOWZFN-UHFFFAOYSA-N 0.000 claims description 21
- YPUHYSBFIMWSEC-OEHDJSDTSA-N (2r,3r,4s,5s,6r)-2-[[(2r,3s,4s,5r,6s)-6-[(2s)-2-[(3s,5r,6s,8r,9r,10r,12r,13r,14r,17s)-3,12-dihydroxy-4,4,8,10,14-pentamethyl-6-[(2r,3r,4s,5s,6r)-3,4,5-trihydroxy-6-(hydroxymethyl)oxan-2-yl]oxy-2,3,5,6,7,9,11,12,13,15,16,17-dodecahydro-1h-cyclopenta[a]phen Chemical compound C([C@H]1O[C@H]([C@@H]([C@@H](O)[C@@H]1O)O)O[C@@](C)(CCC=C(C)C)[C@@H]1[C@@H]2[C@@]([C@@]3(C[C@@H]([C@H]4C(C)(C)[C@@H](O)CC[C@]4(C)[C@H]3C[C@H]2O)O[C@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O2)O)C)(C)CC1)O[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O YPUHYSBFIMWSEC-OEHDJSDTSA-N 0.000 claims description 20
- 102000039446 nucleic acids Human genes 0.000 claims description 20
- 108020004707 nucleic acids Proteins 0.000 claims description 20
- BKQLBIWHYYCPJF-UHFFFAOYSA-N Yesanchinoside E Natural products OC1C(O)C(O)C(C)OC1OC1C(OC2C3C(C)(C)C(O)CCC3(C)C3C(C4(CCC(C4C(O)C3)C(C)(CCC=C(C)C)OC3C(C(O)C(O)C(COC4C(C(O)C(O)C(CO)O4)O)O3)O)C)(C)C2)OC(CO)C(O)C1O BKQLBIWHYYCPJF-UHFFFAOYSA-N 0.000 claims description 16
- 239000013604 expression vector Substances 0.000 claims description 11
- YPUHYSBFIMWSEC-UHFFFAOYSA-N notoginsenoside-R3 Natural products C1CC(C2(CC(C3C(C)(C)C(O)CCC3(C)C2CC2O)OC3C(C(O)C(O)C(CO)O3)O)C)(C)C2C1C(C)(CCC=C(C)C)OC(C(C(O)C1O)O)OC1COC1OC(CO)C(O)C(O)C1O YPUHYSBFIMWSEC-UHFFFAOYSA-N 0.000 claims description 10
- 239000002243 precursor Substances 0.000 claims description 10
- 238000012546 transfer Methods 0.000 claims description 10
- 238000002744 homologous recombination Methods 0.000 claims description 7
- 230000006801 homologous recombination Effects 0.000 claims description 7
- 125000000969 xylosyl group Chemical group C1([C@H](O)[C@@H](O)[C@H](O)CO1)* 0.000 claims description 7
- 125000000089 arabinosyl group Chemical group C1([C@@H](O)[C@H](O)[C@H](O)CO1)* 0.000 claims description 6
- 108010076504 Protein Sorting Signals Proteins 0.000 claims description 4
- 238000012258 culturing Methods 0.000 claims description 4
- 230000003834 intracellular effect Effects 0.000 claims description 2
- 239000012713 reactive precursor Substances 0.000 claims description 2
- WOSQVRAUHHPIKD-DEZHIRTDSA-N OP(O)(=O)OP(O)(O)=O.C[C@H](O)[C@H](O)[C@@H](O)[C@@H](O)C=O Chemical compound OP(O)(=O)OP(O)(O)=O.C[C@H](O)[C@H](O)[C@@H](O)[C@@H](O)C=O WOSQVRAUHHPIKD-DEZHIRTDSA-N 0.000 claims 10
- OIRDTQYFTABQOQ-KQYNXXCUSA-N adenosine Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OIRDTQYFTABQOQ-KQYNXXCUSA-N 0.000 claims 10
- UHDGCWIWMRVCDJ-UHFFFAOYSA-N 1-beta-D-Xylofuranosyl-NH-Cytosine Natural products O=C1N=C(N)C=CN1C1C(O)C(O)C(CO)O1 UHDGCWIWMRVCDJ-UHFFFAOYSA-N 0.000 claims 5
- 239000002126 C01EB10 - Adenosine Substances 0.000 claims 5
- WUKAYDLASGHIDN-VQFINVRFSA-N C[C@H](O)[C@H](O)[C@@H](O)[C@@H](O)C=O.C1=NC=2C(=O)NC(N)=NC=2N1[C@@H]1O[C@H](COP(O)(=O)OP(O)(O)=O)[C@@H](O)[C@H]1O Chemical compound C[C@H](O)[C@H](O)[C@@H](O)[C@@H](O)C=O.C1=NC=2C(=O)NC(N)=NC=2N1[C@@H]1O[C@H](COP(O)(=O)OP(O)(O)=O)[C@@H](O)[C@H]1O WUKAYDLASGHIDN-VQFINVRFSA-N 0.000 claims 5
- UHDGCWIWMRVCDJ-PSQAKQOGSA-N Cytidine Natural products O=C1N=C(N)C=CN1[C@@H]1[C@@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-PSQAKQOGSA-N 0.000 claims 5
- 229960005305 adenosine Drugs 0.000 claims 5
- UHDGCWIWMRVCDJ-ZAKLUEHWSA-N cytidine Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-ZAKLUEHWSA-N 0.000 claims 5
- ZOSQFDVXNQFKBY-CGAXJHMRSA-N dTDP-beta-L-rhamnose Chemical compound O[C@@H]1[C@H](O)[C@@H](O)[C@H](C)O[C@@H]1OP(O)(=O)OP(O)(=O)OC[C@@H]1[C@@H](O)C[C@H](N2C(NC(=O)C(C)=C2)=O)O1 ZOSQFDVXNQFKBY-CGAXJHMRSA-N 0.000 claims 5
- 125000003275 alpha amino acid group Chemical group 0.000 claims 4
- 108700014210 glycosyltransferase activity proteins Proteins 0.000 abstract description 32
- 229930182494 ginsenoside Natural products 0.000 abstract description 25
- 229940089161 ginsenoside Drugs 0.000 abstract description 24
- 239000000758 substrate Substances 0.000 abstract description 19
- SHZGCJCMOBCMKK-UHFFFAOYSA-N D-mannomethylose Natural products CC1OC(O)C(O)C(O)C1O SHZGCJCMOBCMKK-UHFFFAOYSA-N 0.000 abstract description 18
- PNNNRSAQSRJVSB-UHFFFAOYSA-N L-rhamnose Natural products CC(O)C(O)C(O)C(O)C=O PNNNRSAQSRJVSB-UHFFFAOYSA-N 0.000 abstract description 18
- 230000003197 catalytic effect Effects 0.000 abstract description 14
- 230000013595 glycosylation Effects 0.000 abstract description 8
- 238000006206 glycosylation reaction Methods 0.000 abstract description 8
- 238000010276 construction Methods 0.000 abstract description 3
- 230000006172 rhamnosylation Effects 0.000 abstract description 2
- 102000045442 glycosyltransferase activity proteins Human genes 0.000 abstract 2
- 210000004027 cell Anatomy 0.000 description 74
- 108090000623 proteins and genes Proteins 0.000 description 35
- 102000004190 Enzymes Human genes 0.000 description 31
- 108090000790 Enzymes Proteins 0.000 description 31
- 244000131316 Panax pseudoginseng Species 0.000 description 31
- 239000000047 product Substances 0.000 description 29
- BBEUDPAEKGPXDG-UHFFFAOYSA-N protopanaxatriol Natural products CC(CCC=C(C)C)C1CCC2(C)C1C(O)CC3C4(C)CCC(O)C(C)(C)C4C(O)CC23C BBEUDPAEKGPXDG-UHFFFAOYSA-N 0.000 description 28
- 241000588724 Escherichia coli Species 0.000 description 27
- 235000003140 Panax quinquefolius Nutrition 0.000 description 25
- 235000005035 Panax pseudoginseng ssp. pseudoginseng Nutrition 0.000 description 24
- 235000008434 ginseng Nutrition 0.000 description 24
- 241000196324 Embryophyta Species 0.000 description 23
- 230000000694 effects Effects 0.000 description 23
- 229930182490 saponin Natural products 0.000 description 23
- 235000017709 saponins Nutrition 0.000 description 23
- 150000001413 amino acids Chemical group 0.000 description 21
- 102000004169 proteins and genes Human genes 0.000 description 21
- SHCBCKBYTHZQGZ-DLHMIPLTSA-N protopanaxatriol Chemical compound C1C[C@H](O)C(C)(C)[C@@H]2[C@@H](O)C[C@@]3(C)[C@]4(C)CC[C@H]([C@](C)(O)CCC=C(C)C)[C@H]4[C@H](O)C[C@@H]3[C@]21C SHCBCKBYTHZQGZ-DLHMIPLTSA-N 0.000 description 20
- 235000018102 proteins Nutrition 0.000 description 19
- 230000015572 biosynthetic process Effects 0.000 description 18
- 230000014509 gene expression Effects 0.000 description 18
- 238000005918 transglycosylation reaction Methods 0.000 description 18
- 108020004414 DNA Proteins 0.000 description 17
- 239000001397 quillaja saponaria molina bark Substances 0.000 description 17
- 239000012634 fragment Substances 0.000 description 14
- 239000006228 supernatant Substances 0.000 description 14
- 238000004809 thin layer chromatography Methods 0.000 description 14
- 239000006166 lysate Substances 0.000 description 13
- 150000007949 saponins Chemical class 0.000 description 13
- 239000000937 glycosyl acceptor Substances 0.000 description 12
- 238000004128 high performance liquid chromatography Methods 0.000 description 11
- 229940024606 amino acid Drugs 0.000 description 10
- 125000000539 amino acid group Chemical group 0.000 description 10
- 239000013612 plasmid Substances 0.000 description 10
- 235000000346 sugar Nutrition 0.000 description 10
- 150000003648 triterpenes Chemical class 0.000 description 10
- HSCJRCZFDFQWRP-JZMIEXBBSA-N UDP-alpha-D-glucose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1OP(O)(=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C(NC(=O)C=C2)=O)O1 HSCJRCZFDFQWRP-JZMIEXBBSA-N 0.000 description 9
- HSCJRCZFDFQWRP-UHFFFAOYSA-N Uridindiphosphoglukose Natural products OC1C(O)C(O)C(CO)OC1OP(O)(=O)OP(O)(=O)OCC1C(O)C(O)C(N2C(NC(=O)C=C2)=O)O1 HSCJRCZFDFQWRP-UHFFFAOYSA-N 0.000 description 9
- 235000001014 amino acid Nutrition 0.000 description 9
- 238000006467 substitution reaction Methods 0.000 description 9
- XTWYTFMLZFPYCI-KQYNXXCUSA-N 5'-adenylphosphoric acid Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP(O)(=O)OP(O)(O)=O)[C@@H](O)[C@H]1O XTWYTFMLZFPYCI-KQYNXXCUSA-N 0.000 description 8
- XTWYTFMLZFPYCI-UHFFFAOYSA-N Adenosine diphosphate Natural products C1=NC=2C(N)=NC=NC=2N1C1OC(COP(O)(=O)OP(O)(O)=O)C(O)C1O XTWYTFMLZFPYCI-UHFFFAOYSA-N 0.000 description 8
- ZWIADYZPOWUWEW-XVFCMESISA-N CDP Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(O)=O)O1 ZWIADYZPOWUWEW-XVFCMESISA-N 0.000 description 8
- 108091033409 CRISPR Proteins 0.000 description 8
- QGWNDRXFNXRZMB-UUOKFMHZSA-N GDP Chemical compound C1=2NC(N)=NC(=O)C=2N=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(O)=O)[C@@H](O)[C@H]1O QGWNDRXFNXRZMB-UUOKFMHZSA-N 0.000 description 8
- XCCTYIAWTASOJW-XVFCMESISA-N Uridine-5'-Diphosphate Chemical compound O[C@@H]1[C@H](O)[C@@H](COP(O)(=O)OP(O)(O)=O)O[C@H]1N1C(=O)NC(=O)C=C1 XCCTYIAWTASOJW-XVFCMESISA-N 0.000 description 8
- UJLXYODCHAELLY-XLPZGREQSA-N dTDP Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(O)=O)[C@@H](O)C1 UJLXYODCHAELLY-XLPZGREQSA-N 0.000 description 8
- 210000003527 eukaryotic cell Anatomy 0.000 description 8
- QGWNDRXFNXRZMB-UHFFFAOYSA-N guanidine diphosphate Natural products C1=2NC(N)=NC(=O)C=2N=CN1C1OC(COP(O)(=O)OP(O)(O)=O)C(O)C1O QGWNDRXFNXRZMB-UHFFFAOYSA-N 0.000 description 8
- XCCTYIAWTASOJW-UHFFFAOYSA-N UDP-Glc Natural products OC1C(O)C(COP(O)(=O)OP(O)(O)=O)OC1N1C(=O)NC(=O)C=C1 XCCTYIAWTASOJW-UHFFFAOYSA-N 0.000 description 7
- 230000004071 biological effect Effects 0.000 description 7
- 239000013592 cell lysate Substances 0.000 description 7
- 238000000338 in vitro Methods 0.000 description 7
- 238000003786 synthesis reaction Methods 0.000 description 7
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Chemical compound OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 6
- 241000180649 Panax notoginseng Species 0.000 description 6
- 235000003143 Panax notoginseng Nutrition 0.000 description 6
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 6
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 6
- 230000006870 function Effects 0.000 description 6
- 230000001976 improved effect Effects 0.000 description 6
- 238000002360 preparation method Methods 0.000 description 6
- 238000012216 screening Methods 0.000 description 6
- 230000006098 transglycosylation Effects 0.000 description 6
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 5
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 5
- 235000003181 Panax pseudoginseng Nutrition 0.000 description 5
- 239000000654 additive Substances 0.000 description 5
- 230000001580 bacterial effect Effects 0.000 description 5
- PYXFVCFISTUSOO-UHFFFAOYSA-N betulafolienetriol Natural products C1CC(O)C(C)(C)C2CCC3(C)C4(C)CCC(C(C)(O)CCC=C(C)C)C4C(O)CC3C21C PYXFVCFISTUSOO-UHFFFAOYSA-N 0.000 description 5
- 238000012217 deletion Methods 0.000 description 5
- 230000037430 deletion Effects 0.000 description 5
- 239000003623 enhancer Substances 0.000 description 5
- 238000003780 insertion Methods 0.000 description 5
- 230000037431 insertion Effects 0.000 description 5
- 238000004519 manufacturing process Methods 0.000 description 5
- 230000035772 mutation Effects 0.000 description 5
- 239000002773 nucleotide Substances 0.000 description 5
- 125000003729 nucleotide group Chemical group 0.000 description 5
- 210000001236 prokaryotic cell Anatomy 0.000 description 5
- SWQINCWATANGKN-UHFFFAOYSA-N protopanaxadiol Natural products CC(CCC=C(C)C)C1CCC2(C)C1C(O)CC1C3(C)CCC(O)C(C)(C)C3CCC21C SWQINCWATANGKN-UHFFFAOYSA-N 0.000 description 5
- 239000000126 substance Substances 0.000 description 5
- 230000009261 transgenic effect Effects 0.000 description 5
- 238000001262 western blot Methods 0.000 description 5
- 108091028043 Nucleic acid sequence Proteins 0.000 description 4
- 101000932810 Panax ginseng Dammarenediol 12-hydroxylase Proteins 0.000 description 4
- 101000722816 Panax ginseng Dammarenediol II synthase Proteins 0.000 description 4
- 108020004511 Recombinant DNA Proteins 0.000 description 4
- 230000000996 additive effect Effects 0.000 description 4
- 238000000246 agarose gel electrophoresis Methods 0.000 description 4
- 238000006555 catalytic reaction Methods 0.000 description 4
- 238000005119 centrifugation Methods 0.000 description 4
- 238000010367 cloning Methods 0.000 description 4
- 239000000386 donor Substances 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 230000002255 enzymatic effect Effects 0.000 description 4
- 230000004927 fusion Effects 0.000 description 4
- 239000000499 gel Substances 0.000 description 4
- 239000008103 glucose Substances 0.000 description 4
- 238000013508 migration Methods 0.000 description 4
- 230000005012 migration Effects 0.000 description 4
- 238000011160 research Methods 0.000 description 4
- PYXFVCFISTUSOO-HKUCOEKDSA-N (20S)-protopanaxadiol Chemical compound C1C[C@H](O)C(C)(C)[C@@H]2CC[C@@]3(C)[C@]4(C)CC[C@H]([C@@](C)(O)CCC=C(C)C)[C@H]4[C@H](O)C[C@@H]3[C@]21C PYXFVCFISTUSOO-HKUCOEKDSA-N 0.000 description 3
- 238000010354 CRISPR gene editing Methods 0.000 description 3
- NLHQJXWYMZLQJY-UHFFFAOYSA-N Dammarendiol Natural products C1CC(O)C(C)(C)C2CCC3(C)C4(C)CCC(C(C)(O)CCC=C(C)C)C4CCC3C21C NLHQJXWYMZLQJY-UHFFFAOYSA-N 0.000 description 3
- 102000004357 Transferases Human genes 0.000 description 3
- 108090000992 Transferases Proteins 0.000 description 3
- DRDCJEIZVLVWNC-SLBWPEPYSA-N UDP-beta-L-rhamnose Chemical compound O[C@@H]1[C@H](O)[C@@H](O)[C@H](C)O[C@@H]1OP(O)(=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C(NC(=O)C=C2)=O)O1 DRDCJEIZVLVWNC-SLBWPEPYSA-N 0.000 description 3
- 238000007792 addition Methods 0.000 description 3
- 239000011543 agarose gel Substances 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 3
- 238000003556 assay Methods 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 230000008827 biological function Effects 0.000 description 3
- 230000000295 complement effect Effects 0.000 description 3
- 239000002299 complementary DNA Substances 0.000 description 3
- NLHQJXWYMZLQJY-TXNIMPHESA-N dammarenediol-II Chemical compound C1C[C@H](O)C(C)(C)[C@@H]2CC[C@@]3(C)[C@]4(C)CC[C@H]([C@@](C)(O)CCC=C(C)C)[C@H]4CC[C@@H]3[C@]21C NLHQJXWYMZLQJY-TXNIMPHESA-N 0.000 description 3
- 239000013613 expression plasmid Substances 0.000 description 3
- 238000000605 extraction Methods 0.000 description 3
- 230000002068 genetic effect Effects 0.000 description 3
- 239000000178 monomer Substances 0.000 description 3
- 239000013641 positive control Substances 0.000 description 3
- 238000010188 recombinant method Methods 0.000 description 3
- 230000001172 regenerating effect Effects 0.000 description 3
- 230000010076 replication Effects 0.000 description 3
- 230000003248 secreting effect Effects 0.000 description 3
- 238000012163 sequencing technique Methods 0.000 description 3
- 238000013518 transcription Methods 0.000 description 3
- 230000035897 transcription Effects 0.000 description 3
- 230000001131 transforming effect Effects 0.000 description 3
- PLQMEXSCSAIXGB-SAXRGWBVSA-N (+)-artemisinic acid Chemical compound C1=C(C)CC[C@H]2[C@H](C)CC[C@@H](C(=C)C(O)=O)[C@H]21 PLQMEXSCSAIXGB-SAXRGWBVSA-N 0.000 description 2
- INLFWQCRAJUDCR-IQVMEADQSA-N (1R,2S,4S,5'S,6R,7S,8R,9S,12S,13S)-5',7,9,13-tetramethylspiro[5-oxapentacyclo[10.8.0.02,9.04,8.013,18]icosane-6,2'-oxane] Chemical compound O([C@@H]1[C@@H]([C@]2(CC[C@@H]3[C@@]4(C)CCCCC4CC[C@H]3[C@@H]2C1)C)[C@@H]1C)[C@]11CC[C@H](C)CO1 INLFWQCRAJUDCR-IQVMEADQSA-N 0.000 description 2
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 2
- 108091026890 Coding region Proteins 0.000 description 2
- 108091033380 Coding strand Proteins 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 2
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 2
- 241000238631 Hexapoda Species 0.000 description 2
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 2
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 2
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 2
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 2
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 2
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 2
- 229930189092 Notoginsenoside Natural products 0.000 description 2
- FNIRVWPHRMMRQI-PGOMJGFXSA-N Notoginsenoside R2 Chemical compound O([C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@H]2C(C)(C)[C@@H](O)CC[C@]2(C)[C@H]2C[C@@H](O)[C@H]3[C@@]([C@@]2(C1)C)(C)CC[C@@H]3[C@@](C)(O)CCC=C(C)C)[C@@H]1OC[C@@H](O)[C@H](O)[C@H]1O FNIRVWPHRMMRQI-PGOMJGFXSA-N 0.000 description 2
- 101710163270 Nuclease Proteins 0.000 description 2
- 108090000854 Oxidoreductases Proteins 0.000 description 2
- 101000644012 Panax ginseng UDP-glycosyltransferase 100 Proteins 0.000 description 2
- 108091005804 Peptidases Proteins 0.000 description 2
- 239000002202 Polyethylene glycol Substances 0.000 description 2
- 239000004365 Protease Substances 0.000 description 2
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 2
- 241000187747 Streptomyces Species 0.000 description 2
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 2
- 239000004473 Threonine Substances 0.000 description 2
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 2
- DQQDLYVHOTZLOR-UHFFFAOYSA-N UDP-alpha-D-xylose Natural products O1C(N2C(NC(=O)C=C2)=O)C(O)C(O)C1COP(O)(=O)OP(O)(=O)OC1OCC(O)C(O)C1O DQQDLYVHOTZLOR-UHFFFAOYSA-N 0.000 description 2
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- 229960000723 ampicillin Drugs 0.000 description 2
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 2
- 239000000427 antigen Substances 0.000 description 2
- 102000036639 antigens Human genes 0.000 description 2
- 108091007433 antigens Proteins 0.000 description 2
- 230000000975 bioactive effect Effects 0.000 description 2
- 238000010170 biological method Methods 0.000 description 2
- 239000008280 blood Substances 0.000 description 2
- 210000004369 blood Anatomy 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 239000003795 chemical substances by application Substances 0.000 description 2
- 239000013599 cloning vector Substances 0.000 description 2
- OORMXZNMRWBSTK-LGFJJATJSA-N dammarane Chemical compound C1CCC(C)(C)[C@@H]2CC[C@@]3(C)[C@]4(C)CC[C@H]([C@H](C)CCCC(C)C)[C@H]4CC[C@@H]3[C@]21C OORMXZNMRWBSTK-LGFJJATJSA-N 0.000 description 2
- 206010012601 diabetes mellitus Diseases 0.000 description 2
- WQLVFSAGQJTQCK-UHFFFAOYSA-N diosgenin Natural products CC1C(C2(CCC3C4(C)CCC(O)CC4=CCC3C2C2)C)C2OC11CCC(C)CO1 WQLVFSAGQJTQCK-UHFFFAOYSA-N 0.000 description 2
- 239000003814 drug Substances 0.000 description 2
- 238000003028 enzyme activity measurement method Methods 0.000 description 2
- 238000001952 enzyme assay Methods 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 238000000855 fermentation Methods 0.000 description 2
- 230000004151 fermentation Effects 0.000 description 2
- 102000037865 fusion proteins Human genes 0.000 description 2
- 108020001507 fusion proteins Proteins 0.000 description 2
- 238000003198 gene knock in Methods 0.000 description 2
- 239000005090 green fluorescent protein Substances 0.000 description 2
- 239000001963 growth medium Substances 0.000 description 2
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 2
- 230000001965 increasing effect Effects 0.000 description 2
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 2
- 229960000318 kanamycin Drugs 0.000 description 2
- 229930027917 kanamycin Natural products 0.000 description 2
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 2
- 229930182823 kanamycin A Natural products 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 230000037353 metabolic pathway Effects 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 238000009335 monocropping Methods 0.000 description 2
- 150000002772 monosaccharides Chemical class 0.000 description 2
- 229930014626 natural product Natural products 0.000 description 2
- FNIRVWPHRMMRQI-UHFFFAOYSA-N notoginsenoside-R2 Natural products CC(C)=CCCC(C)(O)C1CCC(C2(C3)C)(C)C1C(O)CC2C1(C)CCC(O)C(C)(C)C1C3OC1OC(CO)C(O)C(O)C1OC1OCC(O)C(O)C1O FNIRVWPHRMMRQI-UHFFFAOYSA-N 0.000 description 2
- 230000037361 pathway Effects 0.000 description 2
- 239000000575 pesticide Substances 0.000 description 2
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 2
- 229920001223 polyethylene glycol Polymers 0.000 description 2
- 230000002797 proteolythic effect Effects 0.000 description 2
- 238000002708 random mutagenesis Methods 0.000 description 2
- 239000002994 raw material Substances 0.000 description 2
- 238000003259 recombinant expression Methods 0.000 description 2
- 230000001105 regulatory effect Effects 0.000 description 2
- 108091008146 restriction endonucleases Proteins 0.000 description 2
- NWMIYTWHUDFRPL-UHFFFAOYSA-N sapogenin Natural products COC(=O)C1(CO)C(O)CCC2(C)C1CCC3(C)C2CC=C4C5C(C)(O)C(C)CCC5(CCC34C)C(=O)O NWMIYTWHUDFRPL-UHFFFAOYSA-N 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 125000001424 substituent group Chemical group 0.000 description 2
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 2
- 239000004474 valine Substances 0.000 description 2
- 210000005253 yeast cell Anatomy 0.000 description 2
- 101710135150 (+)-T-muurolol synthase ((2E,6E)-farnesyl diphosphate cyclizing) Proteins 0.000 description 1
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 1
- DLGAUVSRZXNATA-DHYYHALDSA-N (2s,3s)-2-amino-3-methylpentanoic acid;(2s)-pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1.CC[C@H](C)[C@H](N)C(O)=O DLGAUVSRZXNATA-DHYYHALDSA-N 0.000 description 1
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 1
- FBFMBWCLBGQEBU-GYMUUCMZSA-N 20-gluco-ginsenoside-Rf Natural products O([C@](CC/C=C(\C)/C)(C)[C@@H]1[C@H]2[C@H](O)C[C@H]3[C@](C)([C@]2(C)CC1)C[C@H](O[C@@H]1[C@H](O[C@H]2[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O2)[C@@H](O)[C@H](O)[C@@H](CO)O1)[C@H]1C(C)(C)[C@@H](O)CC[C@]31C)[C@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 FBFMBWCLBGQEBU-GYMUUCMZSA-N 0.000 description 1
- PUERAZFQIXLYDR-CQNOQWDGSA-N 3-[(4e,8e,12e,16e)-4,8,13,17,21-pentamethyldocosa-4,8,12,16,20-pentaenylidene]oxetane Chemical compound CC(C)=CCC\C(C)=C\CC\C(C)=C\CC\C=C(/C)CC\C=C(/C)CCC=C1COC1 PUERAZFQIXLYDR-CQNOQWDGSA-N 0.000 description 1
- MIJYXULNPSFWEK-GTOFXWBISA-N 3beta-hydroxyolean-12-en-28-oic acid Chemical compound C1C[C@H](O)C(C)(C)[C@@H]2CC[C@@]3(C)[C@]4(C)CC[C@@]5(C(O)=O)CCC(C)(C)C[C@H]5C4=CC[C@@H]3[C@]21C MIJYXULNPSFWEK-GTOFXWBISA-N 0.000 description 1
- BSLYZLYLUUIFGZ-JRUDBKCSSA-N 4,4,14-trimethyl-9,19-cyclo-5alpha,9beta-cholestane Chemical compound C1CCC(C)(C)[C@H]2[C@@]31C[C@@]13CC[C@]3(C)[C@@H]([C@H](C)CCCC(C)C)CC[C@@]3(C)[C@@H]1CC2 BSLYZLYLUUIFGZ-JRUDBKCSSA-N 0.000 description 1
- 208000024827 Alzheimer disease Diseases 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 108091062157 Cis-regulatory element Proteins 0.000 description 1
- RRTBTJPVUGMUNR-UHFFFAOYSA-N Cycloartanol Natural products C12CCC(C(C(O)CC3)(C)C)C3C2(CC)CCC2(C)C1(C)CCC2C(C)CCCC(C)C RRTBTJPVUGMUNR-UHFFFAOYSA-N 0.000 description 1
- 102000002004 Cytochrome P-450 Enzyme System Human genes 0.000 description 1
- 108010015742 Cytochrome P-450 Enzyme System Proteins 0.000 description 1
- 150000008574 D-amino acids Chemical class 0.000 description 1
- 230000006820 DNA synthesis Effects 0.000 description 1
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 1
- JKLISIRFYWXLQG-UHFFFAOYSA-N Epioleonolsaeure Natural products C1CC(O)C(C)(C)C2CCC3(C)C4(C)CCC5(C(O)=O)CCC(C)(C)CC5C4CCC3C21C JKLISIRFYWXLQG-UHFFFAOYSA-N 0.000 description 1
- 241000701959 Escherichia virus Lambda Species 0.000 description 1
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 1
- 101710119400 Geranylfarnesyl diphosphate synthase Proteins 0.000 description 1
- 101710107752 Geranylgeranyl diphosphate synthase Proteins 0.000 description 1
- UZIOUZHBUYLDHW-MSJHMJQNSA-N Ginsenoside Rf Natural products O([C@H]1[C@@H](O)[C@H](O)[C@H](CO)O[C@@H]1O[C@@H]1[C@H]2C(C)(C)[C@@H](O)CC[C@]2(C)[C@@H]2[C@](C)([C@@]3(C)[C@H]([C@@H](O)C2)[C@@H]([C@@](O)(CC/C=C(\C)/C)C)CC3)C1)[C@H]1[C@@H](O)[C@@H](O)[C@H](O)[C@H](CO)O1 UZIOUZHBUYLDHW-MSJHMJQNSA-N 0.000 description 1
- 102400000322 Glucagon-like peptide 1 Human genes 0.000 description 1
- DTHNMHAUYICORS-KTKZVXAJSA-N Glucagon-like peptide 1 Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(N)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC=1N=CNC=1)[C@@H](C)O)[C@@H](C)O)C(C)C)C1=CC=CC=C1 DTHNMHAUYICORS-KTKZVXAJSA-N 0.000 description 1
- 101800000224 Glucagon-like peptide 1 Proteins 0.000 description 1
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 108020005004 Guide RNA Proteins 0.000 description 1
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- 150000008575 L-amino acids Chemical class 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 1
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 1
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 1
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 1
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 1
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- 239000012880 LB liquid culture medium Substances 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 241001529936 Murinae Species 0.000 description 1
- 102000008300 Mutant Proteins Human genes 0.000 description 1
- 108010021466 Mutant Proteins Proteins 0.000 description 1
- 101710135898 Myc proto-oncogene protein Proteins 0.000 description 1
- 102100038895 Myc proto-oncogene protein Human genes 0.000 description 1
- 101710118186 Neomycin resistance protein Proteins 0.000 description 1
- YBRJHZPWOMJYKQ-UHFFFAOYSA-N Oleanolic acid Natural products CC1(C)CC2C3=CCC4C5(C)CCC(O)C(C)(C)C5CCC4(C)C3(C)CCC2(C1)C(=O)O YBRJHZPWOMJYKQ-UHFFFAOYSA-N 0.000 description 1
- MIJYXULNPSFWEK-UHFFFAOYSA-N Oleanolinsaeure Natural products C1CC(O)C(C)(C)C2CCC3(C)C4(C)CCC5(C(O)=O)CCC(C)(C)CC5C4=CCC3C21C MIJYXULNPSFWEK-UHFFFAOYSA-N 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 241000208343 Panax Species 0.000 description 1
- 101000644025 Panax ginseng UDP-glycosyltransferase 1 Proteins 0.000 description 1
- 240000005373 Panax quinquefolius Species 0.000 description 1
- QFJUYMMIBFBOJY-UXZRXANASA-N Panaxatriol Chemical compound C[C@]1([C@H]2CC[C@@]3([C@@H]2[C@H](O)C[C@H]2[C@]3(C[C@@H](O)[C@H]3C(C)(C)[C@@H](O)CC[C@@]32C)C)C)CCCC(C)(C)O1 QFJUYMMIBFBOJY-UXZRXANASA-N 0.000 description 1
- VIXIMKLMEZTTTC-UHFFFAOYSA-N Panaxatriol Natural products CC1(C)CCCC(O1)C2CCC3(C)C2C(O)CC4C5(C)CCC(O)C(C)(C)C5C(O)CC34C VIXIMKLMEZTTTC-UHFFFAOYSA-N 0.000 description 1
- 240000000220 Panda oleosa Species 0.000 description 1
- 235000016496 Panda oleosa Nutrition 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 102000018120 Recombinases Human genes 0.000 description 1
- 108010091086 Recombinases Proteins 0.000 description 1
- 241000293869 Salmonella enterica subsp. enterica serovar Typhimurium Species 0.000 description 1
- 206010039966 Senile dementia Diseases 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- 108010022394 Threonine synthase Proteins 0.000 description 1
- 102000006601 Thymidine Kinase Human genes 0.000 description 1
- 108020004440 Thymidine kinase Proteins 0.000 description 1
- 101710150448 Transcriptional regulator Myc Proteins 0.000 description 1
- DQQDLYVHOTZLOR-OCIMBMBZSA-N UDP-alpha-D-xylose Chemical compound C([C@@H]1[C@H]([C@H]([C@@H](O1)N1C(NC(=O)C=C1)=O)O)O)OP(O)(=O)OP(O)(=O)O[C@H]1OC[C@@H](O)[C@H](O)[C@H]1O DQQDLYVHOTZLOR-OCIMBMBZSA-N 0.000 description 1
- 241000607479 Yersinia pestis Species 0.000 description 1
- JCPSMIOSLWKUPV-XQQPQPTDSA-N [[(3r,4r,5s)-3,4-dihydroxy-5-(hydroxymethyl)oxolan-2-yl]-[(2s,3s,4r,5r)-5-(2,4-dioxopyrimidin-1-yl)-3,4-dihydroxyoxolan-2-yl]methyl] phosphono hydrogen phosphate Chemical compound O[C@@H]1[C@@H](O)[C@H](CO)OC1C(OP(O)(=O)OP(O)(O)=O)[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C(NC(=O)C=C2)=O)O1 JCPSMIOSLWKUPV-XQQPQPTDSA-N 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 238000005377 adsorption chromatography Methods 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 229930191701 arteannuin Natural products 0.000 description 1
- BLUAFEHZUWYNDE-NNWCWBAJSA-N artemisinin Chemical compound C([C@](OO1)(C)O2)C[C@H]3[C@H](C)CC[C@@H]4[C@@]31[C@@H]2OC(=O)[C@@H]4C BLUAFEHZUWYNDE-NNWCWBAJSA-N 0.000 description 1
- 229960004191 artemisinin Drugs 0.000 description 1
- PLQMEXSCSAIXGB-UHFFFAOYSA-N artemisininic acid Natural products C1=C(C)CCC2C(C)CCC(C(=C)C(O)=O)C21 PLQMEXSCSAIXGB-UHFFFAOYSA-N 0.000 description 1
- 125000003118 aryl group Chemical group 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 239000006227 byproduct Substances 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 235000014633 carbohydrates Nutrition 0.000 description 1
- 239000003054 catalyst Substances 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 239000007795 chemical reaction product Substances 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- ZYZJWAJOTPNVPI-ZVBSCDOUSA-N cucurbitane Chemical compound C([C@H]1[C@]2(C)CC[C@@H]([C@]2(CC[C@]11C)C)[C@H](C)CCCC(C)C)CC2[C@H]1CCCC2(C)C ZYZJWAJOTPNVPI-ZVBSCDOUSA-N 0.000 description 1
- YNBJLDSWFGUFRT-UHFFFAOYSA-N cycloartenol Natural products CC(CCC=C(C)C)C1CCC2(C)C1(C)CCC34CC35CCC(O)C(C)(C)C5CCC24C YNBJLDSWFGUFRT-UHFFFAOYSA-N 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- JYGAZEJXUVDYHI-DGTMBMJNSA-N dihydroartemisinic acid Chemical compound C1CC(C)=C[C@@H]2[C@H]([C@@H](C)C(O)=O)CC[C@@H](C)[C@@H]21 JYGAZEJXUVDYHI-DGTMBMJNSA-N 0.000 description 1
- JYGAZEJXUVDYHI-UHFFFAOYSA-N dihydroartemisininic acid Natural products C1CC(C)=CC2C(C(C)C(O)=O)CCC(C)C21 JYGAZEJXUVDYHI-UHFFFAOYSA-N 0.000 description 1
- 102000004419 dihydrofolate reductase Human genes 0.000 description 1
- 239000001177 diphosphate Substances 0.000 description 1
- 235000011180 diphosphates Nutrition 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 230000002124 endocrine Effects 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- VAXXNBLRJUUBPR-UHFFFAOYSA-N ginsenoside F3 Natural products CC(=CCCC(C)(O)C1CCC2(C)C1C(O)CC3C4(C)CCC(O)C(C)(C)C4C(CC23C)OC5OC(COC6OCC(O)C(O)C6O)C(O)C(O)C5O)C VAXXNBLRJUUBPR-UHFFFAOYSA-N 0.000 description 1
- UZIOUZHBUYLDHW-XUBRWZAZSA-N ginsenoside Rf Chemical compound O([C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@H]2C(C)(C)[C@@H](O)CC[C@]2(C)[C@H]2C[C@@H](O)[C@H]3[C@@]([C@@]2(C1)C)(C)CC[C@@H]3[C@@](C)(O)CCC=C(C)C)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O UZIOUZHBUYLDHW-XUBRWZAZSA-N 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 229930186669 gynosaponin Natural products 0.000 description 1
- 230000004217 heart function Effects 0.000 description 1
- 230000007062 hydrolysis Effects 0.000 description 1
- 238000006460 hydrolysis reaction Methods 0.000 description 1
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 1
- 230000014726 immortalization of host cell Effects 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 230000000968 intestinal effect Effects 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 238000004811 liquid chromatography Methods 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 239000002609 medium Substances 0.000 description 1
- 201000001441 melanoma Diseases 0.000 description 1
- 230000007087 memory ability Effects 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 239000002808 molecular sieve Substances 0.000 description 1
- 239000003471 mutagenic agent Substances 0.000 description 1
- 231100000707 mutagenic chemical Toxicity 0.000 description 1
- 230000004770 neurodegeneration Effects 0.000 description 1
- 208000015122 neurodegenerative disease Diseases 0.000 description 1
- 239000002777 nucleoside Substances 0.000 description 1
- 229940100243 oleanolic acid Drugs 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 230000003204 osmotic effect Effects 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- HZLWUYJLOIAQFC-UHFFFAOYSA-N prosapogenin PS-A Natural products C12CC(C)(C)CCC2(C(O)=O)CCC(C2(CCC3C4(C)C)C)(C)C1=CCC2C3(C)CCC4OC1OCC(O)C(O)C1O HZLWUYJLOIAQFC-UHFFFAOYSA-N 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 239000012264 purified product Substances 0.000 description 1
- 238000004153 renaturation Methods 0.000 description 1
- 238000012827 research and development Methods 0.000 description 1
- 238000010839 reverse transcription Methods 0.000 description 1
- 238000005185 salting out Methods 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 150000003384 small molecules Chemical class 0.000 description 1
- URGAHOPLAPQHLN-UHFFFAOYSA-N sodium aluminosilicate Chemical compound [Na+].[Al+3].[O-][Si]([O-])=O.[O-][Si]([O-])=O URGAHOPLAPQHLN-UHFFFAOYSA-N 0.000 description 1
- 238000000527 sonication Methods 0.000 description 1
- 230000001954 sterilising effect Effects 0.000 description 1
- 238000004659 sterilization and disinfection Methods 0.000 description 1
- 230000004960 subcellular localization Effects 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- 230000002463 transducing effect Effects 0.000 description 1
- 230000014621 translational initiation Effects 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 239000013603 viral vector Substances 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1048—Glycosyltransferases (2.4)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N5/00—Undifferentiated human, animal or plant cells, e.g. cell lines; Tissues; Cultivation or maintenance thereof; Culture media therefor
- C12N5/10—Cells modified by introduction of foreign genetic material
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P33/00—Preparation of steroids
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P33/00—Preparation of steroids
- C12P33/20—Preparation of steroids containing heterocyclic rings
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Genetics & Genomics (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Medicinal Chemistry (AREA)
- Cell Biology (AREA)
- Enzymes And Modification Thereof (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
The invention provides a rhamnose highly specific glycosyltransferase and application thereof. The invention discloses a specific glycosyltransferase for the first time, which can catalyze a substrate to carry out rhamnosylation at a specific position and has high catalytic activity. In particular, the specific glycosyltransferases of the invention are capable of specifically and efficiently catalyzing glycosylation of the C-6 position of a tetracyclic triterpene compound substrate at the first glycosyl group to extend the rhamnose group. The invention also provides mutants of the specific glycosyltransferases. The specific glycosyltransferase has good specificity and high efficiency, can be applied to the construction of artificially synthesized ginsenoside, various new ginsenosides and derivatives thereof, and has good application value in the fields of pharmacy and the like.
Description
The present application claims priority from the application of chinese application No. 202110871374.0 filed at day 7 and 30 of 2021.
The invention relates to the fields of biotechnology and plant biology, in particular to a high-specificity glycosyltransferase of rhamnose and application thereof.
Ginsenoside is the general name of saponin separated from Panax plant (such as Ginseng radix, notoginseng radix, radix Panacis Quinquefolii, etc.) and herba Gynostemmatis, and is a triterpene compound. The ginsenoside can also be called ginsenoside, notoginsenoside and gynosaponin according to the isolated source. Ginsenoside is the main bioactive component in these medicinal plants. Currently, about 150 saponins have been isolated. Structurally, ginsenoside is mainly a bioactive small molecule formed by glycosylation of sapogenin. There are only a limited number of sapogenins of ginsenoside, mainly protopanaxadiol and protopanaxatriol of dammarane type tetracyclic triterpenes, and oleanolic acid. Sapogenins can increase water solubility, alter their subcellular localization, and produce different biological activities after glycosylation. Most protopanaxadiol saponins are glycosylation modified at the C3 and/or C20 hydroxyl groups, while protopanaxatriol saponins are glycosylation modified at the C6 and/or C20 hydroxyl groups. Different types of glycosyl and different degrees of glycosylation modification produce ginsenoside with various molecular structures.
Rhamnosylated modified ginsenoside has abundant bioactivity. For example, rg2 extends a molecule of rhamnose at C6-O-Glc of Rh1, and Rg2 has good effects in treating depression, improving heart function, improving learning and memory ability, resisting senile dementia and the like; ginsenoside Re extends one molecule of rhamnose at C6-O-Glc of Rg1, and can play roles in reducing blood sugar and treating diabetes by promoting secretion of glucagon-like peptide-1 in intestinal tissues.
The ginsenoside is prepared from total or abundant ginsenoside of Ginseng radix or Notoginseng radix by chemical, enzyme and microbial fermentation hydrolysis method. Because wild ginseng resources are basically exhausted, ginsenoside resources are currently derived from artificial cultivation of ginseng or pseudo-ginseng, the artificial cultivation has a long growth period (generally more than 5-7 years), is limited by regions, and is also frequently subjected to diseases and insect pests, and a large amount of pesticides are required to be applied, so that the artificial cultivation of ginseng or pseudo-ginseng has serious continuous cropping obstacles (the cultivation of ginseng or pseudo-ginseng in a place needs to be fallowed for more than 5-15 years to overcome the continuous cropping obstacles), and the yield, quality and safety of the ginsenoside are challenged.
The development of synthetic biology provides a new opportunity for heterologous synthesis of natural products of plant origin. The yeast is taken as a chassis, and the synthesis of the arteannuic acid or the dihydroarteannuic acid by fermenting with cheap monosaccharide through the assembly and optimization of metabolic pathways is realized, and then the arteannuin is produced by a one-step chemical conversion method, which shows that the synthesis biology has great potential in the aspect of the drug synthesis of natural products. The ginsenoside monomer is heterologously synthesized by utilizing yeast chassis cells through a synthetic biological method, the raw materials are cheap monosaccharides, the preparation process is a fermentation process with adjustable safety, and any external pollution (such as pesticides used during artificial planting of raw material plants) is avoided, so that the ginsenoside monomer is prepared through the synthetic biological technology, the cost advantage is realized, and the quality and safety of a finished product can be ensured. The preparation method utilizes the synthetic biological technology to prepare a sufficient amount of various natural and unnatural ginsenoside monomers with high purity, which are used for activity determination and clinical experiments and promote the research and development of innovative medicaments of rare ginsenoside.
In recent years, analysis of the synthetic pathway of ginsenoside has been greatly advanced by research on transcriptomes and functional genomes of ginseng, notoginseng and American ginseng. In 2006, japanese and korean scientists identified the terpene cyclase element da (myrdiol synthase, pgDDS) that converts epoxysqualene to dammarenediol, respectively. From 2011 to 2012, korean scientists have identified cytochrome P450 elements CYP716A4 and CYP716a53v2 that oxidize dammarenediol to protopanaxadiol and further oxidize protopanaxadiol to protopanaxatriol.
The synthetic biological method is utilized to artificially synthesize the ginsenoside with medicinal activity, so that not only the metabolic pathway of the synthesized sapogenin is required to be constructed, but also UDP-glycosyltransferase for catalyzing glycosylation of the ginsenoside is required to be identified. The function of UDP-glycosyltransferases is to transfer the glycosyl group on a glycosyl donor (nucleoside diphosphate sugars such as UDP-glucose, UDP-rhamnose, UDP-xylose and UDP-arabinose) to a different glycosyl acceptor. From the analysis of plant genomes that have been sequenced at present, plant genomes often encode more than hundred different glycosyltransferases. The national students in 2015 identified a UDP-glycosyltransferase element (UGTPG 100) capable of transferring a glucosyl group at the C6 position of the original panaxatriol. Chinese scholars disclose glycosyltransferases (gGT 29-7, etc.) that can extend sugar chains at the C6 position of protopanaxatriol saponins (PCT/CN 2015/081111), for example gGT-7 can use UDP-Xyl to catalyze the C6 position of Rh1 to extend one molecule of xylosyl to generate notoginsenoside R2, can use UDP-Glc to catalyze the C6 position of Rh1 to extend one molecule of glucosyl to generate Rf, but cannot basically use UDP-Rha; mutants gGT-7 (N343G, A359P) of gGT-7 disclosed in patent (PCT/CN 2015/081111) have the advantage of catalyzing extension of the C6 position of Rh1 by UDP-Rha to form Rg2 by one molecule of rhamnosyl, but have very low activity, only about 9% conversion. Besides, gGT-7 (N343G, A359P) can also take UDP-glc as a donor for transglycosylation reaction besides taking UDP-Rha as the donor for transglycosylation reaction, and the catalysis efficiency is higher than that of the catalytic reaction taking UDP-Rha as the glycosyl donor. Therefore, gGT-7 (N343G, A359P) has low and non-specific UDP-Rha catalysis activity, so that the synthesis of a large amount of byproducts cannot meet the application requirements.
Disclosure of Invention
Under the above background, the inventor screens glycosyltransferase URT94-1 and URT94-2 which can extend UPD-rhamnose at C6 position from ginseng, can specifically obtain UDP-Rha as glycosyl donor, and efficiently catalyzes ginsenoside Rh1, ginsenoside Rg1 or Notoginseng radix R3 to extend a molecule of rhamnose at the first glycosyl at C6 position so as to obtain ginsenoside Rg2, ginsenoside Re or Yesanchinoside E respectively. However, URT94-1 and URT94-2 cannot catalyze the above-mentioned saponin substrates using UDP-glucose as a glycosyl donor. Therefore, the glycosyltransferases provide highly specific glycosyltransferases for the efficient preparation of saponins such as ginsenoside Rg2, ginsenoside Re, yesanchinoside E and the like.
In a first aspect of the present invention there is provided a method of linking a rhamnosyl group to the first glycosyl group at the C-6 position of a tetracyclic triterpene(s) compound comprising: transfer is performed with a specific glycosyltransferase having the amino acid sequence of SEQ ID NO: 2. SEQ ID NO:4 or SEQ ID NO:14, or a conservatively variant polypeptide thereof.
In another aspect of the invention there is provided the use of a specific glycosyltransferase for the attachment of a rhamnosyl group on the first glycosyl group in the C-6 position of a tetracyclic triterpene(s) (including as a catalyst for the reaction) which specific glycosyltransferase is one having the amino acid sequence of SEQ ID NO: 2. SEQ ID NO:4 or SEQ ID NO:14, or a conservatively variant polypeptide thereof.
In one or more embodiments, the rhamnosyl is provided by a glycosyl donor; preferably, the glycosyl donor is a glycosyl donor carrying a rhamnose group; more preferably, the glycosyl donor includes (but is not limited to): uridine Diphosphate (UDP) -rhamnose, guanosine Diphosphate (GDP) -rhamnose, adenosine Diphosphate (ADP) -rhamnose, cytidine Diphosphate (CDP) -rhamnose, thymidine Diphosphate (TDP) -rhamnose.
In one or more embodiments, the tetracyclic triterpene compound is a compound of formula (I), and the compound having a glycosyl group attached to the glycosyl group at the C-6 position is a compound of formula (II);
wherein R1 and R2 are H or glycosyl, R3 is monosaccharide glycosyl, and R4 is rhamnosyl; preferably, the glycosyl or monosaccharide glycosyl (R3) is selected from: a glucosyl, xylosyl, arabinosyl or rhamnosyl group;
preferably, when R1 is H, R2 and R3 is glucosyl, the compound of formula (I) is ginsenoside Rg1 and the compound of formula (II) is ginsenoside Re; when R1 and R2 are H, R and are glucosyl, the compound of formula (I) is ginsenoside Rh1, and the compound of formula (II) is ginsenoside Rg2.
In one or more embodiments, the tetracyclic triterpene compound is a compound of formula (III), and the compound having a glycosyl group attached to the glycosyl group at the C-6 position is a compound of formula (IV);
Wherein R1 is H or glycosyl, R2, R3 and R4 are monosaccharide glycosyl, and R5 is rhamnosyl; preferably, the glycosyl (R1) or monosaccharide glycosyl (R2, R3, R4) is selected from: a glucosyl, xylosyl, arabinosyl or rhamnosyl group;
preferably, when R1 is H, R, R3 and R4 are glucosyl, and R5 is rhamnosyl, the compound of formula (III) is notoginsenoside R3, and the compound of formula (IV) is Yesanchinoside E.
In one or more embodiments, the group species, substrate, or product is as follows:
substrate(s) | R1 | R2 | R3 | R4 | Product(s) |
Ginsenoside Rg1 | H | Glc | Glc | Rha | Ginsenoside Re |
Ginsenoside Rh1 | H | H | Glc | Rha | Ginsenoside Rg2 |
In one or more embodiments, the group species, substrate, or product is as follows:
substrate(s) | R1 | R2 | R3 | R4 | R5 | Product(s) |
Notoginseng radix saponin R3 | H | Glc | Glc | Glc | Rha | Yesanchinoside E |
In one or more embodiments, the compounds of formula (I), (III) include, but are not limited to: dammarane type tetracyclic triterpene compounds of S configuration or R configuration, lanolate type tetracyclic triterpene compounds, hydrokansuine (apoirucalane) type tetracyclic triterpene compounds, kansuine type tetracyclic triterpene compounds, cycloartene (cycloartane) type tetracyclic triterpene compounds, cucurbitane type tetracyclic triterpene compounds, or chinaberrane type tetracyclic triterpene compounds.
In one or more embodiments, the compound of formula (II) or (IV) comprises ginsenoside Rg2, ginsenoside Re, yenanchinoside E.
In another aspect of the invention there is provided a method of intracellular attachment of a rhamnosyl group at the first glycosyl group at the C-6 position of a tetracyclic triterpene(s) compound comprising:
(a) Introducing a tetracyclic triterpene compound reaction precursor or a construct expressing/forming the same, and introducing a specific glycosyltransferase or a construct expressing the same into a host cell to obtain a recombinant host cell; the specific glycosyltransferase is a glycosyltransferase having the sequence of SEQ ID NO: 2. SEQ ID NO:4 or SEQ ID NO:14, or a conservatively variant polypeptide thereof; the presence of a glycosyl donor bearing a rhamnose group or the introduction of a glycosyl donor bearing a rhamnose group (including constructs/precursors capable of forming the glycosyl donor) within the host cell;
(b) Culturing the recombinant host cell of (a) to obtain a tetracyclic triterpene compound having a rhamnosyl group attached to the first glycosyl group at position C-6;
preferably, the tetracyclic triterpene compound reaction precursor comprises: ginsenoside Rg1, ginsenoside Rh1, and notoginsenoside R3; the corresponding products include: ginsenoside Re, ginsenoside Rg2, yesanchinoside E;
Preferably, the glycosyl donor includes (but is not limited to): uridine Diphosphate (UDP) -rhamnose, guanosine Diphosphate (GDP) -rhamnose, adenosine Diphosphate (ADP) -rhamnose, cytidine Diphosphate (CDP) -rhamnose, thymidine Diphosphate (TDP) -rhamnose.
In one or more embodiments, the method further comprises: an additive for regulating the activity of the enzyme is provided to the reaction system.
In one or more embodiments, the additive for modulating enzymatic activity is: additives for increasing or inhibiting the enzymatic activity.
In one or more embodiments, the additive for modulating enzymatic activity is selected from the group consisting of: ca (Ca) 2+ 、Co 2+ 、Mn 2+ 、Ba 2+ 、Al3+、Ni 2+ 、Zn 2+ Or Fe 2+ 。
In one or more embodiments, the additive for modulating enzymatic activity is: can generate Ca 2+ 、Co 2+ 、Mn 2+ 、Ba 2+ 、Al3+、Ni 2+ 、Zn 2+ Or Fe 2+ Is a substance of (a).
In one or more embodiments, the pH of the reaction system is: the pH is 4.0-10.0, preferably 5.5-9.0.
In one or more embodiments, the temperature of the reaction system is: 10 ℃ to 105 ℃, preferably 20 ℃ to 50 ℃.
In another aspect of the invention, there is provided a specific glycosyltransferase having the sequence set forth in SEQ ID NO: 2. SEQ ID NO:4 or SEQ ID NO:14, or a conservatively variant polypeptide thereof; preferably, the conservatively variant polypeptide comprises:
(1) Consists of SEQ ID NO: 2. SEQ ID NO:4 or SEQ ID NO:14 via substitution, deletion or addition of one or more (e.g., 1-20, preferably 1-10, more preferably 1-5, more preferably 1-3) amino acid residues, and having a rhamnosyl functionality linked to the first glycosyl at the C-6 position of the tetracyclic triterpene compound;
(2) Amino acid sequence identical to SEQ ID NO: 2. SEQ ID NO:4 or SEQ ID NO:14, and has a rhamnosyl function of linking the first glycosyl at the C-6 position of the tetracyclic triterpene compound to the first glycosyl of the tetracyclic triterpene compound; or (b)
(3) In SEQ ID NO: 2. SEQ ID NO:4 or SEQ ID NO:14, or a signal peptide sequence is added to the N-terminus of the polypeptide having the sequence shown in fig. 14.
In another aspect of the invention, an isolated polynucleotide encoding the specific glycosyltransferase is provided.
In one or more embodiments, the polynucleotide encoding the specific glycosyltransferase comprises a polynucleotide selected from the group consisting of: (A) the sequence shown in SEQ ID NO: 1. 3 or 13; (B) a sequence corresponding to SEQ ID NO: 1. 3 or 13, and a nucleotide sequence having at least 95% identity to the sequence set forth in seq id no; (E) the amino acid sequence set forth in SEQ ID NO: 1. 3 or 13 and/or a nucleotide sequence formed by truncating or adding 1 to 60 (preferably 1 to 30, more preferably 1 to 10) nucleotides to the 5 'and/or 3' end of the sequence shown in seq id no; (F) A complement of the nucleotide sequence of any one of (a) - (E); (G) Fragments of the sequences described in (A) - (F) 20-50 bases in length.
In one or more embodiments, the polynucleotide sequence is selected from the group consisting of SEQ ID NOs: 1. 3 or 13 or a complement thereof.
In another aspect of the invention, a nucleic acid construct (construct) is provided comprising said polynucleotide, or expressing said specific glycosyltransferase; preferably, the nucleic acid construct is an expression vector or a homologous recombination vector.
In another aspect of the invention, a recombinant host cell is provided which expresses said specific glycosyltransferase, or comprises said polynucleotide, or comprises said nucleic acid construct; preferably, the recombinant host cell further comprises a tetracyclic triterpene compound reaction precursor or a construct expressing/forming the same; preferably, a glycosyl donor bearing a rhamnose group or a glycosyl donor bearing a rhamnose group (including constructs/precursors capable of forming the glycosyl donor) is also present in the recombinant host cell;
in one or more embodiments, the tetracyclic triterpene compound reaction precursors comprise: ginsenoside Rg1, ginsenoside Rh1, and notoginsenoside R3; the corresponding products include: ginsenoside Re, ginsenoside Rg2, yesanchinoside E.
In one or more embodiments, the glycosyl donor includes (but is not limited to): uridine Diphosphate (UDP) -rhamnose, guanosine Diphosphate (GDP) -rhamnose, adenosine Diphosphate (ADP) -rhamnose, cytidine Diphosphate (CDP) -rhamnose, thymidine Diphosphate (TDP) -rhamnose.
In one or more embodiments, the host cell is a prokaryotic cell or a eukaryotic cell.
In one or more embodiments, the host cell is a eukaryotic cell, such as a yeast cell or a plant cell. In one or more embodiments, the host cell is a saccharomyces cerevisiae cell. In one or more embodiments, the host cell is a ginseng cell or a pseudo-ginseng cell.
In one or more embodiments, the host cell is a prokaryotic cell, such as E.coli.
In one or more embodiments, the host cell is not a cell that naturally produces the product formed upon treatment with the specific glycosyltransferases of the invention; for example, it is not a cell that naturally produces a compound of formula (II), (IV).
In one or more embodiments, the host cell is not a cell that naturally produces one or more of the following: ginsenoside Rh1, ginsenoside Rg1, notoginsenoside R3, ginsenoside Rg2, ginsenoside Re, and Yesanchinoside E.
In one or more embodiments, the host cell further has a feature selected from the group consisting of:
(a) A mutant expressing dammarenediol and/or protopanaxadiol saponin and/or a key enzyme in the protopanaxatriol saponin anabolic pathway having 50% sequence identity to the enzyme;
(b) Expressing a polypeptide comprising a functional fragment of the enzyme of (a) or a mutant having 50% sequence identity to the fragment;
(c) A polynucleotide comprising the enzyme of (a) or the polypeptide of (b) or a complement thereof, and/or
(d) Comprising a nucleic acid construct comprising the coding sequence of (c).
In one or more embodiments, the protopanaxatriol saponins comprise ginsenoside Rh1, ginsenoside Rg1, ginsenoside R3, ginsenoside Rg2, ginsenoside Re, yesanchinoside E.
In one or more embodiments, key genes in the ginsenoside Rh1 anabolic pathway include (but are not limited to): dammarenediol synthase gene, cytochrome P450 CYP716a47 gene and P450 CYP716a47 reductase gene and the glycosyltransferase UGTPg100 of tetracyclic triterpene C6 (Genbank accession number AKQ 76388.1), or combinations thereof.
In one or more embodiments, key genes in the anabolic pathway of ginsenoside Rg1 include (but are not limited to): dammarenediol synthase gene, cytochrome P450 CYP716A47 gene and P450 CYP716A47 reductase gene and glycosyltransferases UGTPG1 and UGTPG100 (Genbank accession number AKQ 76388.1) at C20 and C6 of tetracyclic triterpenes, or combinations thereof.
In one or more embodiments, key genes in the ginsenoside Rg2 anabolic pathway include (but are not limited to): dammarenediol synthase gene, cytochrome P450 CYP716A47 gene, and P450 CYP716A47 reductase gene, and the glycosyltransferase UGTPG100 (Genbank accession number AKQ 76388.1) of tetracyclic triterpene C6 catalyze glycosyl extension at the C6 position, glycosyltransferases URT94-1 and URT94-2 in the invention, or a combination thereof.
In one or more embodiments, key genes in the ginsenoside Re anabolic pathway include (but are not limited to): dammarenediol synthase gene, cytochrome P450 CYP716a47 gene, and P450 CYP716a47 reductase gene, and tetracyclic triterpene C20 and C6 glycosyltransferases UGTPg1 and UGTPg100 (Genbank accession number AKQ 76388.1), glycosyltransferases URT94-1 and URT94-2 herein catalyzing glycosyl extension at the C6 position, or combinations thereof.
In a further aspect of the invention there is also provided the use of a host cell according to the invention for the preparation of a glycosyltransferase, a catalytic agent, or a compound of formula (II), (IV).
In another aspect of the invention there is also provided a method of producing a glycosyltransferase or a compound of formula (II) or (IV), comprising incubating a host cell according to the invention.
In a further aspect of the invention there is also provided the use of a host cell according to the invention for the preparation of an enzyme catalytic agent, or for the production of a glycosyltransferase, or as a catalytic cell, or for the production of a compound of formula (II), (IV).
In another aspect of the invention, there is provided a method of producing a transgenic plant comprising the steps of: regenerating the host cell of the invention into a plant, wherein the host cell is a plant cell. In one or more embodiments, the host cell is a ginseng cell. In one or more embodiments, the host cell is a pseudo-ginseng cell.
In another aspect of the invention, there is provided a kit for glycosyl transfer comprising: the specific glycosyltransferase can be connected with a rhamnosyl group on the first glycosyl group at the C-6 position of a tetracyclic triterpene (class) compound, and the specific glycosyltransferase is provided with a nucleotide sequence shown in SEQ ID NO: 2. SEQ ID NO:4 or SEQ ID NO:14, or a conservatively variant polypeptide thereof.
In another aspect of the invention, there is provided a kit for glycosyl transfer comprising: the isolated polynucleotide.
In another aspect of the invention, there is provided a kit for glycosyl transfer comprising: the nucleic acid construct (construct).
In another aspect of the invention, there is provided a kit for glycosyl transfer comprising: the recombinant host cell.
In one or more embodiments, the kit further comprises: a glycosyl donor carrying a rhamnose group; more preferably, the glycosyl donor includes (but is not limited to): uridine Diphosphate (UDP) -rhamnose, guanosine Diphosphate (GDP) -rhamnose, adenosine Diphosphate (ADP) -rhamnose, cytidine Diphosphate (CDP) -rhamnose, thymidine Diphosphate (TDP) -rhamnose.
In one or more embodiments, the kit further comprises: tetracyclic triterpene compounds are reactive precursors.
It is understood that within the scope of the present invention, the above-described technical features of the present invention and technical features specifically described below (e.g., in the examples) may be combined with each other to constitute new or preferred technical solutions. And are limited to a space, and are not described in detail herein.
FIG. 1 shows the result of DNA agarose gel electrophoresis detection of amplified products obtained by amplifying 2 glycosyltransferase target bands from a single ginseng plant.
FIG. 2 shows the expression of glycosyltransferases URT94-1 and URT94-2 in E.coli by Western Blot. "1" represents the lysate supernatant of empty vector pET28a E.coli recombinants; marker, representing protein molecular weight standard; "2" represents the lysate supernatant of the glycosyltransferase BL21-URT94-1 E.coli recombinant; "3" represents the lysate supernatant of the glycosyltransferase BL21-URT94-2 E.coli recombinant; "4" represents the lysate supernatant of the glycosyltransferase BL 21-gGT-7 E.coli recombinant; "5" represents the lysate supernatant of the E.coli recombinant of glycosyltransferase BL 21-gGT-7 (N343G, A359P).
FIG. 3, panel a, show TLC patterns of glycosyltransferases URT94-1 and URT94-2 catalyzing transglycosylation reaction using protopanaxatriol type ginsenoside Rh1 as glycosyl acceptor and UDP-Rha as glycosyl donor. "1" represents the supernatant of the lysate of the pet28a empty vector recombinant as the enzyme solution; "2", "3", "4", "5" represent the lysate supernatants of BL21-URT94-1, BL21-URT94-2, BL21-gGT29-7 (N343G, A359P) and BL21-gGT29-7, respectively, as enzyme solutions. The arrow indicates the migration position of the saponin standard; panel b shows HPLC patterns of glycosyltransferases URT94-1 and URT94-2 for catalyzing transglycosylation reaction with protopanaxatriol type ginsenoside Rh1 as glycosyl acceptor and UDP-Rha as glycosyl donor.
FIG. 4, panel a, show TLC patterns of transglycosylation reactions using protopanaxatriol type ginsenoside Rg1 as a glycosyl acceptor and UDP-Rha as a glycosyl donor, catalyzed by glycosyl transferases URT94-1 and URT 94-2. "1" represents the supernatant of the lysate of the pet28a empty vector recombinant as the enzyme solution; "2", "3", "4", "5" represent BL21-gGT29-7, BL 21-gGT-7 (N343G, A359P), respectively, and the supernatants of lysates of BL21-URT94-1 and BL21-URT94-2 are used as enzyme solutions. The arrow indicates the migration position of the saponin standard; panel b shows HPLC patterns of glycosyltransferases URT94-1 and URT94-2 for catalyzing transglycosylation reaction with protopanaxatriol type ginsenoside Rg1 as glycosyl acceptor and UDP-Rha as glycosyl donor.
FIG. 5, TLC patterns of transglycosylation reactions with protopanaxatriol type ginsenoside Rh1 as the glycosyl acceptor and UDP-Glc as the glycosyl donor catalyzed by glycosyltransferases URT94-1 and URT 94-2. "1" represents the supernatant of the lysate of the pet28a empty vector recombinant as the enzyme solution; "2", "3", "4", "5" represent BL21-gGT29-7, BL 21-gGT-7 (N343G, A359P), respectively, and the supernatants of lysates of BL21-URT94-1 and BL21-URT94-2 are used as enzyme solutions. The arrow indicates the migration position of the saponin standard.
FIG. 6, TLC patterns of transglycosylation reactions with protopanaxatriol type ginsenoside Rg1 as the glycosyl acceptor and UDP-Glc as the glycosyl donor catalyzed by glycosyltransferases URT94-1 and URT 94-2. "1" represents the supernatant of the lysate of the pet28a empty vector recombinant as the enzyme solution; "2", "3", "4", "5" represent BL21-gGT29-7, BL 21-gGT-7 (N343G, A359P), respectively, and the supernatants of lysates of BL21-URT94-1 and BL21-URT94-2 are used as enzyme solutions. The arrow indicates the migration position of the saponin standard.
FIG. 7, comparison of catalytic activity of glycosyltransferase URT94-1m mutant with wild type.
FIG. 8, western blot method for detecting expression of glycosyltransferase URT94-1m mutant and wild type.
The inventor provides a specific glycosyltransferase for the first time through intensive research and screening, and the specific glycosyltransferase can catalyze a substrate to carry out rhamnosylation at a specific position and improve the catalytic activity. In particular, the specific glycosyltransferases of the invention are capable of specifically and efficiently catalyzing the hydroxyl glycosylation of the C-6 position of a tetracyclic triterpene compound substrate at the first glycosyl group to extend the rhamnose group.
Definition of the definition
As used herein, an "isolated polypeptide" or "active polypeptide" means that the polypeptide is substantially free of other proteins, lipids, carbohydrates, or other substances with which it is naturally associated. The person skilled in the art is able to purify the polypeptides using standard protein purification techniques. Substantially pure polypeptides can produce a single main band on a non-reducing polyacrylamide gel. The purity of the polypeptide can also be further analyzed by amino acid sequence.
As used herein, the terms "active polypeptide", "polypeptide of the invention and its derivatives", "enzyme of the invention", "glycosyltransferase" are used interchangeably and include URT94-1 (SEQ ID NO: 2), URT94-2 (SEQ ID NO: 4) polypeptide or its derivatives; also, they may refer to mutants of glycosyltransferases, including URT94-1m (SEQ ID NO: 14).
As used herein, the term "conservatively modified polypeptide" refers to a polypeptide that retains essentially the same biological function or activity as the polypeptide. The "conservative variant polypeptide" may be (i) a polypeptide having one or more conservative or non-conservative amino acid residues (preferably conservative amino acid residues) substituted, which may or may not be encoded by the genetic code, or (ii) a polypeptide having a substituent in one or more amino acid residues, or (iii) a polypeptide formed by fusion of a mature polypeptide with another compound (such as a compound that increases the half-life of the polypeptide, e.g., polyethylene glycol), or (iv) a polypeptide formed by fusion of an additional amino acid sequence to the polypeptide sequence (such as a leader or secretory sequence or a sequence used to purify the polypeptide or a proteolytic sequence, or a fusion protein with the formation of an antigen IgG fragment). Such fragments, derivatives and analogs are within the purview of one skilled in the art and would be well known in light of the teachings herein.
As used herein, the term "variant" or "mutant" refers to a peptide or polypeptide that has an amino acid sequence that is altered by the insertion, deletion or substitution of one or more amino acids as compared to a reference sequence, but retains at least one biological activity. The mutants described in any of the embodiments herein comprise an amino acid sequence having at least 50%, 60% or 70%, preferably at least 80%, preferably at least 85%, preferably at least 90%, preferably at least 95%, preferably at least 97% sequence identity with a reference sequence (SEQ ID NO:2, 4 or 14 as described herein) and retaining the biological activity of the reference sequence (e.g.as a glycosyltransferase). Sequence identity between two aligned sequences can be calculated using BLASTp, e.g., NCBI. Mutants also include amino acid sequences that have one or more mutations (insertions, deletions, or substitutions) in the amino acid sequence of the reference sequence, while still retaining the biological activity of the reference sequence. The plurality of mutations generally refers to within 1-20, such as 1-15, 1-10, 1-8, 1-5, or 1-3. The substitution is preferably a conservative substitution. For example, conservative substitutions with amino acids that are similar or analogous in nature typically do not alter the function of the protein or polypeptide. "similar or analogous amino acids" include, for example, families of amino acid residues with similar side chains, including amino acids with basic side chains (e.g., lysine, arginine, histidine), acidic side chains (e.g., aspartic acid, glutamic acid), uncharged polar side chains (e.g., glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine), nonpolar side chains (e.g., alanine, valine, leucine, isoleucine proline, phenylalanine, methionine, tryptophan), beta-branched side chains (e.g., threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, phenylalanine, tryptophan, histidine). Thus, substitution of one or several sites with another amino acid residue from the same side chain class in a polypeptide of the invention will not substantially affect its activity.
For URT94-1m (SEQ ID NO: 14), it is a mutant of URT 94-1. In the present invention, a conservative variant polypeptide of URT94-1m may also be included, but in a variant corresponding to SEQ ID NO:14 is conserved at amino acid residue position 55.
Active polypeptide, coding gene, vector and host thereof
The inventor discloses a novel specific glycosyltransferase through excavating genome and transcriptome information and combining a great amount of research and experimental work, the novel specific glycosyltransferase can transfer glycosyl onto the first glycosyl of C-6 of a tetracyclic triterpene compound substrate specifically and efficiently to extend a sugar chain, and a reaction product of the novel specific glycosyltransferase has good application value in the fields of pharmacy and the like.
The sequence of the specific glycosyltransferase is preferably shown in SEQ ID NO: 2. 4 or 14. The polypeptide also includes a polypeptide having the same function as the polypeptide shown in SEQ ID NO: 2. 4 or 14. The invention also includes fragments, derivatives and analogues of the polypeptides. As used herein, the terms "fragment," "derivative," and "analog" refer to a polypeptide that retains substantially the same biological function or activity as the polypeptide.
In the present invention, the term "conservatively modified polypeptide" refers to a polypeptide which retains substantially the same biological function or activity as the polypeptide. The "conservative variant polypeptide" may be (i) a polypeptide having one or more conservative or non-conservative amino acid residues (preferably conservative amino acid residues) substituted, which may or may not be encoded by the genetic code, or (ii) a polypeptide having a substituent in one or more amino acid residues, or (iii) a polypeptide formed by fusion of a mature polypeptide with another compound (such as a compound that increases the half-life of the polypeptide, e.g., polyethylene glycol), or (iv) a polypeptide formed by fusion of an additional amino acid sequence to the polypeptide sequence (such as a leader or secretory sequence or a sequence used to purify the polypeptide or a proteolytic sequence, or a fusion protein with the formation of an antigen IgG fragment). Such fragments, derivatives and analogs are within the purview of one skilled in the art and would be well known in light of the teachings herein.
The "conservatively variant polypeptide" may include (but is not limited to): deletions, insertions and/or substitutions of one or more (usually 1-50, preferably 1-30, more preferably 1-20, most preferably 1-10) amino acids, and additions or deletions of one or more (e.g., within 50, more preferably within 20 or 10, more preferably within 5) amino acids at the C-terminus and/or N-terminus. For example, in the art, substitution with amino acids of similar or similar properties does not generally alter the function of the protein. As another example, the addition of one or more amino acids at the C-terminus and/or N-terminus typically does not alter the function of the protein. The invention also provides analogs of the polypeptides. These analogs may differ from the native polypeptide by differences in amino acid sequence, by differences in modified forms that do not affect the sequence, or by both. These polypeptides include natural or induced genetic variants. Induced variants can be obtained by various techniques, such as random mutagenesis by irradiation or exposure to mutagens, by site-directed mutagenesis or other known techniques of molecular biology. Analogs also include analogs having residues other than the natural L-amino acid (e.g., D-amino acids), as well as analogs having non-naturally occurring or synthetic amino acids (e.g., beta, gamma-amino acids). It is to be understood that the polypeptides of the present invention are not limited to the representative polypeptides exemplified above.
The amino-or carboxy-terminal end of URT94-1 (SEQ ID NO: 2), URT94-2 (SEQ ID NO: 4) or URT94-1m (SEQ ID NO: 14) or a conservatively mutated polypeptide thereof of the present invention may further comprise one or more polypeptide fragments as protein tags. Any suitable label may be used with the present invention. For example, the tag may be FLAG, HA, HA, c-Myc, poly-His, poly-Arg, strep-TagII, AU1, EE, T7, 4A6, ε, B, gE, and Ty1. These tags can be used to purify proteins.
When for the purpose of producing a specific glycosyltransferase of the invention or other enzyme (e.g., an enzyme in a host cell that reacts to form a substrate for a specific glycosyltransferase of the invention, an enzyme involved in any step of the product synthesis pathway of the invention), a signal peptide sequence may also be added to the amino terminus of a polypeptide of the invention for secretory expression (e.g., to the outside of the cell) of the translated protein. The signal peptide may be cleaved off during endocrine egress of the polypeptide from the cell.
The active polypeptide of the present invention may be a recombinant polypeptide, a natural polypeptide, a synthetic polypeptide. The polypeptides of the invention may be naturally purified products, or chemically synthesized products, or produced from prokaryotic or eukaryotic hosts (e.g., bacteria, yeast, higher plants) using recombinant techniques. Depending on the host used in the recombinant production protocol, the polypeptides of the invention may be glycosylated or may be non-glycosylated. The polypeptides of the invention may or may not also include an initial methionine residue.
Polynucleotides encoding specific glycosyltransferases and other enzymes of the invention may be in the form of DNA or RNA. DNA forms include cDNA, genomic DNA, or synthetic DNA. The DNA may be single-stranded or double-stranded. The DNA may be a coding strand or a non-coding strand. The term "polynucleotide encoding a polypeptide" may include polynucleotides encoding the polypeptide, or may include additional coding and/or non-coding sequences.
The invention also relates to vectors comprising the polynucleotides of the invention, as well as host cells genetically engineered with the vectors or polypeptide coding sequences of the invention, and methods for producing the polypeptides of the invention by recombinant techniques.
The present invention relates to nucleic acid constructs comprising a polynucleotide as described herein, and one or more regulatory sequences operably linked to these sequences or sequences required for genomic homologous recombination. The polynucleotides of the invention may be manipulated in a variety of ways to ensure expression of the polypeptides or proteins. The nucleic acid construct may be manipulated according to the expression vector or requirements prior to insertion into the vector. Techniques for altering polynucleotide sequences using recombinant DNA methods are known in the art.
In certain embodiments, the nucleic acid construct is a vector. The vector may be a cloning vector, an expression vector, or a gene knock-in vector. Polynucleotides of the invention may be cloned into many types of vectors, e.g., plasmids, phagemids, phage derivatives, animal viruses and cosmids. Cloning vectors may be used to provide the coding sequence for a protein or polypeptide of the invention. The expression vector may be provided to the cell as a bacterial vector or a viral vector. Expression of the polynucleotides of the invention is typically achieved by operably linking the polynucleotides of the invention to a promoter and incorporating the construct into an expression vector. The vector may be suitable for replication and integration of eukaryotic cells. Typical expression vectors contain expression control sequences that can be used to regulate the expression of a desired nucleic acid sequence.
The knock-in vector is used to integrate the polynucleotide sequences described herein into a region of interest of the genome. Typically, the knock-in vector will contain, in addition to the polynucleotide sequence, a 5 'homology arm and a 3' homology arm required for homologous recombination of the genome. In some embodiments, the nucleic acid constructs herein contain a 5 'homology arm, a polynucleotide sequence described herein, and a 3' homology arm. When using a knock-in vector, the CRISPR/Cas9 technique can be used simultaneously to homologous recombine polynucleotide sequences to a location of interest. The CRISPR/Cas9 technology guides Cas9 nuclease to modify the genome at an insertion position by designing guide RNA for a target gene, resulting in increased homologous recombination efficiency in the modified region of the gene, and homologous recombination of a target fragment contained in a gene knock-in vector to the target site. Procedures for CRISPR/Cas9 technology are well known in the art, as well as reagents used, such as Cas9 nucleases.
Methods well known to those skilled in the art can be used to construct the nucleic acid construct. These methods include in vitro recombinant DNA techniques, DNA synthesis techniques, in vivo recombinant techniques, and the like. The DNA sequence may be operably linked to an appropriate promoter in an expression vector to direct mRNA synthesis. Representative examples of these promoters are: the lac or trp promoter of E.coli; a lambda phage PL promoter; eukaryotic promoters include the CMV immediate early promoter, the HSV thymidine kinase promoter, the early and late SV40 promoters, LTRs from retroviruses, and other known promoters that control the expression of genes in prokaryotic or eukaryotic cells or viruses thereof. The expression vector also includes a ribosome binding site for translation initiation and a transcription terminator. In addition, the expression vector preferably comprises one or more selectable marker genes to provide phenotypic traits for selection of transformed host cells, such as dihydrofolate reductase, neomycin resistance and Green Fluorescent Protein (GFP) for eukaryotic cell culture, or tetracycline or ampicillin resistance for E.coli.
When the polynucleotide of the present invention is expressed in higher eukaryotic cells, transcription will be enhanced if an enhancer sequence is inserted into the vector. Enhancers are cis-acting elements of DNA, usually about 10 to 300 base pairs, that act on a promoter to increase the transcription of a gene. Examples include the SV40 enhancer 100 to 270 base pairs on the late side of the origin of replication, the polyoma enhancer on the late side of the origin of replication, and adenovirus enhancers.
The invention also provides host cells for biosynthesis of a product of interest. The host cell may be a prokaryotic cell, such as, but not limited to, E.coli, yeast, streptomyces; more preferably E.coli cells. The cell host is a production tool, and a person skilled in the art can modify various host cells by means of some technical means, so that biosynthesis according to the invention is also achieved, and thus the host cells and the production method are also intended to be encompassed by the invention.
The polynucleotide sequences of the present invention may be used to express or produce the polypeptides described herein by conventional recombinant DNA techniques. Generally, there are the following steps: (1) Transforming or transducing a suitable host cell with a polynucleotide (or variant) encoding the specific glycosyltransferase of the invention, or with an expression vector comprising the polynucleotide; (2) host cells cultured in a suitable medium; (3) isolating and purifying the protein from the culture medium or the cells.
Vectors comprising the appropriate DNA sequences as described above, as well as appropriate promoter or control sequences, may be used to transform appropriate host cells to enable expression of the protein. The host cell may be a prokaryotic cell, such as a bacterial cell; or lower eukaryotic cells, such as yeast cells; or higher eukaryotic cells, such as mammalian cells. Representative examples are: coli, streptomyces; bacterial cells of salmonella typhimurium; fungal cells such as yeast; a plant cell; insect cells of Drosophila S2 or Sf 9; CHO, COS, 293 cells, or Bowes melanoma cells. It will be clear to a person of ordinary skill in the art how to select appropriate vectors, promoters, enhancers and host cells.
Transformation of host cells with recombinant DNA can be performed using conventional techniques well known to those skilled in the art. The recombinant polypeptide in the above method may be expressed in a cell, or on a cell membrane, or secreted outside the cell. If desired, the recombinant proteins can be isolated and purified by various separation methods using their physical, chemical and other properties. Such methods are well known to those skilled in the art. Examples of such methods include, but are not limited to: conventional renaturation treatment, treatment with a protein precipitant (salting-out method), centrifugation, osmotic sterilization, super-treatment, super-centrifugation, molecular sieve chromatography (gel filtration), adsorption chromatography, ion exchange chromatography, high Performance Liquid Chromatography (HPLC), and other various liquid chromatography techniques and combinations of these methods.
Application of
The present inventors have made an effort on the study of glycosyltransferases, however, in the previous work, an enzyme which can efficiently utilize a rhamnosyl donor, specifically linking a rhamnosyl group to the first glycosyl group at the C-6 position of a tetracyclic triterpene (type) compound has not been obtained. Among the existing enzymes, some are substantially incapable of utilizing rhamnosyl donors (e.g., UDP-Rha); some of them have very low activity and cannot fully meet the application requirements.
Under the above background, the present inventors screened from ginseng to obtain a specific glycosyltransferase (URT 94 s) capable of extending rhamnose at C6 position, which can efficiently catalyze the reaction of protopanaxatriol saponin (protopanaxatriol saponin/protopanaxatriol saponin): ginsenoside Rh1, ginsenoside Rg1 and notoginsenoside R3 extend 1 molecule of rhamnose on the first glycosyl at C-6 position; thereby obtaining ginsenoside Rg2, ginsenoside Re or Yesanchinoside E. The glycosyltransferase is a highly specific glycosyltransferase provided for the efficient preparation of ginsenoside Rg2 or ginsenoside Re or Yesanchinoside E. Preferably, the protopanaxatriol saponin comprises ginsenoside Rh1 and ginsenoside Rg1.
As a specific embodiment of the invention, the active polypeptides of the invention have glycosyltransferase activity and are capable of catalyzing one or more of the following reactions:
wherein R1 and R2 are H or glycosyl, and R3 and R4 are monosaccharide glycosyl.
In one or more embodiments, the substituted compounds of R1-R4 are as follows:
substrate(s) | R1 | R2 | R3 | R4 | Product(s) |
Rg1 | H | Glc | Glc | Rha | Ginsenoside Re |
Rh1 | H | H | Glc | Rha | Ginsenoside Rg2 |
Namely, when R1 is H, R2 and R3 are glucosyl, the compound of formula (I) is ginsenoside Rg1, and when R4 is rhamnosyl, the compound of formula (II) is notoginsenoside Re; or when R1 and R2 are H and R3 is glucosyl, the compound of formula (I) is ginsenoside Rh1, and when R4 is rhamnosyl, the compound of formula (II) is notoginsenoside Rg2.
As a further embodiment of the present invention,
wherein R1 is H or glycosyl, and R2, R3, R4 and R5 are monosaccharide glycosyl; the polypeptide is selected from SEQ ID NO: 2. 4 or 14 or a polypeptide derived therefrom.
In one or more embodiments, the substituted compounds of R1-R5 are as follows:
substrate(s) | R1 | R2 | R3 | R4 | R5 | Product(s) |
Notoginseng radix saponin R3 | H | Glc | Glc | Glc | Rha | Yesanchinoside E |
That is, when R1 is H, R2, R3 and R4 are glucosyl groups, the compound of formula (III) is notoginsenoside R3, and when R5 is rhamnosyl, the compound of formula (IV) is Yesanchinoside E.
The invention also provides a method of constructing a transgenic plant comprising regenerating a host cell containing a polypeptide or polynucleotide described herein into a plant, said host cell being a plant cell. Methods and reagents for regenerating plant cells are well known in the art.
The glycosyltransferase of the present invention is especially capable of converting ginsenoside Rh1 into ginsenoside Rg2 with other activities. The glycosyltransferase of the invention can respectively convert ginsenoside Rg1 into ginsenoside Re with other activities.
The active polypeptide or glycosyltransferase related by the invention can be used for artificially synthesizing known ginsenoside and new ginsenoside and derivatives thereof, and can convert Rh1 into active ginsenoside Rg2 and Rg1 into active ginsenoside Re.
The invention also provides a method of constructing a transgenic plant comprising transforming a plant with a polynucleotide or nucleic acid construct as described herein, and obtaining a transgenic positive plant expressing a polypeptide as described herein, comprising said polynucleotide or comprising said nucleic acid construct in the progeny of the plant by crossing, screening. Methods for transforming plants with nucleic acids, crossing plants, and screening for transgenic positive plants are well known in the art.
The invention also provides a kit for biosynthesis of a target product or an intermediate thereof, comprising: SEQ ID NO: 2. 4 or 14 or a conservative variant polypeptide thereof; preferably also included therein are glycosyl donors; preferably host cells are also included. More preferably, the kit further comprises instructions for performing the method of biosynthesis.
The invention has the main advantages that:
(1) The specific glycosyltransferase of the invention can transfer the first glycosyl of C-6 of the substrate of the tetracyclic triterpene compound into glycosyl to extend the sugar chain in a specific and efficient way;
(2) Rh1 can be efficiently converted into ginsenoside Rg2 with activity by the glycosyltransferase; rg1 can be efficiently converted into ginsenoside Re with activity by the glycosyltransferase. The activity of Rg2 in preventing and treating neurodegenerative diseases; re has the activity of reducing blood sugar and treating diabetes. Therefore, the glycosyltransferase of the invention has wide application value.
(3) The catalytic efficiency is high. The activity of URT94-1 and URT94-2 in catalyzing extension of the sugar chain at the C6 position of Rh1 with UDP-rhamnose as a glycosyl donor is improved by at least 5 times as compared with glycosyltransferase disclosed in patent PCT/CN 2015/081111.
The invention will be further illustrated with reference to specific examples. It is to be understood that these examples are illustrative of the present invention and are not intended to limit the scope of the present invention. The experimental procedures, which are not specifically noted in the examples below, are generally carried out according to conventional conditions such as those described in J.Sam Brookfield et al, molecular cloning guidelines, third edition, scientific Press, or according to the manufacturer's recommendations.
Sequence information
SEQ ID NO:1 (URT 94-1 nucleic acid)
SEQ ID NO:2 (URT 94-1 protein)
SEQ ID NO:3 (URT 94-2 nucleic acid)
SEQ ID NO:4 (URT 94-2 protein)
SEQ ID NO:5 (primer set 1-F)
SEQ ID NO:6 (primer set 1-R)
SEQ ID NO:7 (primer set 2-F)
SEQ ID NO:8 (primer set 2-R)
SEQ ID NO:9(URT94-1_Pet28a-F)
SEQ ID NO:10(URT94-1_Pet28a-R)
SEQ ID NO:11(URT94-2_Pet28a-F)
SEQ ID NO:12(URT94-2_Pet28a-R)
SEQ ID NO:13 (URT 94-1m1 nucleic acid)
SEQ ID NO:14 (URT 94-1m1 protein)
EXAMPLE 1 cloning of ginseng-derived glycosyltransferase URT94s
The inventor performs intensive research and screening to clone two glycosyltransferases from a single ginseng plant, which are named URT94-1 and URT94-2 (URT 94 s).
Cloning of the URT94 s: extracting ginseng RNA and carrying out reverse transcription to obtain ginseng cDNA. 2 pairs of primers (SEQ ID NO:5-SEQ ID NO:6 amplified URT94-1; SEQ ID NO:7-SEQ ID NO:8 amplified URT 94-2) were designed using the cDNA as a template for PCR amplification. The DNA polymerase is PrimeSTAR, a high-fidelity DNA polymerase of Takara Bio-engineering Co. The PCR products were detected by agarose gel electrophoresis (FIG. 1). The target DNA band is excised by irradiation with ultraviolet light. Then, the amplified DNA fragment was obtained by recovering DNA from agarose gel using AxyPrep DNA Gel Extraction Kit (AXYGEN Co.). The DNA fragment was ligated with the commercial cloning vector pMD18T plasmid by rTaq DNA polymerase of Takara Bio-engineering Co., ltd at the end, and recombinant plasmids URT94-1-pMD18T and URT94-2-pMD18T were obtained. The ligation product was transformed into competent cells of E.coli Top10, and the transformed E.coli broth was plated on LB plates supplemented with 100ug/mL ampicillin, and recombinant clones were further verified by PCR and restriction enzyme. And respectively selecting one clone to extract recombinant plasmid, and sequencing. It was found that URT94-1 and URT94-2 are glycosyltransferase genes whose ORFs encode the PSPG cassette of the conserved domain of glycosyltransferase family 1.
The present inventors performed expression and transglycosylation reaction analyses on URT94-1 and URT94-2, respectively. Wherein the glycosyltransferases encoded by 2 nucleic acid sequences (SEQ ID NO: i, 3, respectively) can catalyze extension of 1 rhamnosyl group at the C6 position of Rh1 to generate Rg2, and the catalytic activity is improved by at least 5 times compared with the mutant gGT-7 (N343G, A359P) of gGT29-7 disclosed in the prior patent (PCT/CN 2015/081111), and both cannot catalyze extension of 1 glucosyl group at the C6 position of Rh1 to generate Rf.
Experimental results show that the conversion rate of the ginseng-derived URT94-1 and URT94-2 for catalyzing the C6 position of Rh1 to extend one rhamnosyl to generate Rg2 is more than 50%, the conversion rate of the ginseng-derived URT94-1 and URT94-2 for catalyzing the C6 position of Rh1 to extend one rhamnosyl to generate Re is more than 50%, and the ginseng-derived URT94-1 and URT94-2 cannot catalyze the C6 position of Rh1 to extend 1 glucosyl to generate C20-O-Glc-Rf, so that the ginseng-derived URT94-2 is UDP-mouse Li Tanggao degree specific glycosyltransferase.
EXAMPLE 2 construction of recombinant expression plasmid for the Ginseng glycosyltransferase URT94s Gene
The pMD18T plasmids containing the URT94-1 and URT94-2 genes constructed in example 1 were used as examples, the forward primer comprising two parts, the 5 '-end-3' -end comprising 20bp of the sequence of the pET28a homology arm and the initiation sequence encoding URT94-1 in sequence, the reverse primer comprising two parts, the 5 '-end-3' -end comprising 20bp of the sequence of the pET28a homology arm and the end sequence encoding URT94-1 comprising 20bp (SEQ ID NO:9-SEQ ID NO:10, see Table 1), and the gene encoding URT94-1 (comprising the pET28a homology arm) was amplified by PCR using the above primers. The DNA polymerase is high-fidelity DNA polymerase PrimeSTAR of Takara Bio-engineering Co., ltd, and the PCR program is set by referring to the instruction book: 94 ℃ for 2min;94℃for 15s,57℃for 30s and 68℃for 1.5min, 33 cycles in total; 68 ℃ for 10min; preserving heat at 16 ℃. The PCR product is detected by agarose gel electrophoresis, and a band consistent with the size of the target DNA is cut off under ultraviolet light. The DNA fragment was then recovered from the agarose gel using AxyPrep DNA Gel Extraction Kit (AXYGEN Co.).
Plasmid pET28a was digested with the FD restriction enzymes NcoI and SalI from Thermo, 37℃for 50min, and then linear plasmid pET28a was recovered from agarose gel using AxyPrep DNA Gel Extraction Kit (AXYGEN). The enzyme tangential plasmids were subjected to homologous recombination with 2 UGTs such as URT94-1 obtained above, respectively, using a recombinase from assist in Shanghai, st.Johnsonian, inc., and the ligation products were transformed into E.coli BL21 (DE 3) competent cells, which were plated on LB plates supplemented with 50. Mu.g/mL kanamycin (Kana). Positive transformants were verified by colony PCR and sequenced to further verify whether the recombinant expression plasmid was constructed successfully. Positive transformants were designated as E.coli BL21-URT94-1 and BL21-URT94-2.
TABLE 1 primers for construction of Gene expression plasmids
Example 3 expression of the Ginseng glycosyltransferase URT94s in E.coli
Two kinds of colibacillus BL21-URT94-1 and BL21-URT94-2 with correct sequencing are inoculated into 50mL LB culture medium, cultured at 37 ℃ and 200rpm until OD600 is about 0.6-0.8, the bacterial liquid is cooled to 4 ℃, IPTG with the final concentration of 200 mu tM is added, and induced expression is carried out at 18 ℃ and 120rpm for 16h. The cells were collected by centrifugation at 4℃and disrupted by sonication, and the supernatant of the cell lysate was collected by centrifugation at 12000g for 10min at 4℃to thereby obtain a crude protease solution. The 6 XHis tag sequence on pET28a makes the C terminal of proteins URT94-1 and URT94-2 carry 6 XHis tag tags, respectively. Thus, western blot is carried out on the two protein crude enzyme solutions to detect the protein expression condition. Anti 6X His tag Western Blot (FIG. 2) shows that there is a distinct band between 45-55kD and that glycosyltransferases URT94-1 and URT94-2 are both expressed soluble in E.coli.
EXAMPLE 4 in vitro transglycosylation Activity and product identification by glycosyltransferase URT94s Using protopanaxatriol type saponin Rh1 as substrate
The cell lysates of recombinant E.coli BL21-URT94-1 and BL21-URT94-2 of example 4 were used as crude enzyme solutions for the transglycosylation reaction, and the cell lysates of recombinant E.coli transformed with empty vector pET28a were used as controls. The ginseng glycosyltransferase gGT-7, gGT29-7 (N343G, A359P) from patent PCT/CN2015/081111 was selected as a positive control. In vitro transglycosylation assays were performed according to the reaction system presented in table 2, and reacted overnight at 35 ℃.
The reaction results were detected by Thin Layer Chromatography (TLC), high Performance Liquid Chromatography (HPLC):
TABLE 2 enzyme Activity measurement reaction System
As shown in figures 3a-b, protopanaxatriol type ginsenoside Rh1 is taken as a glycosyl acceptor, UDP-Rha is taken as a glycosyl donor, BL21-URT94-1 and BL21-URT94-2 catalyze the protopanaxatriol type ginsenoside Rh1 to generate Rg2, and the catalytic efficiency of the protopanaxatriol type ginsenoside Rh1 and the UDP-Rha are obviously better than that of glycosyltransferase gGT-7 (N343G, A359P) disclosed before. The results of HPLC were consistent with the results of TLC.
Therefore, like URT94-1, URT94-2 and gGT29-7 (N343G, A359P), it is able to catalyze C6-O-Glc extension of Rh1 by one molecule of rhamnose to generate ginsenoside Rg2.
Example 5 in vitro transglycosylation Activity and product identification by glycosyltransferase URT94s with protopanaxatriol type saponin Rg1 as substrate
The cell lysates of recombinant E.coli BL21-URT94-1 and BL21-RT94-2 of example 4 were used as crude enzyme solutions for the transglycosylation reaction, and the cell lysates of recombinant E.coli transformed with empty vector pET28a were used as controls. The ginseng glycosyltransferase gGT-7, gGT29-7 (N343G, A359P) from patent PCT/CN2015/081111 was selected as a positive control. In vitro transglycosylation assays were performed according to the reaction system presented in table 3, at 35 ℃ overnight.
The reaction results were detected by Thin Layer Chromatography (TLC), high Performance Liquid Chromatography (HPLC):
the protopanaxatriol type ginsenoside Rg1 is taken as a glycosyl acceptor, UDP-Rha is taken as a glycosyl donor, URT94-1 and URT94-2 catalyze the protopanaxatriol type ginsenoside Rg1 to generate Re, and the catalytic efficiency is obviously better than that of glycosyltransferase gGT29-7 (N343G, A359P) (PCT/CN 2015/081111) disclosed before. The results of HPLC were consistent with the results of TLC. As shown in fig. 4 a-b.
Therefore, like URT94-1, URT94-2 and gGT29-7 (N343G, A359P), it is capable of catalyzing C6-O-Glc extension of Rg1 by one molecule of rhamnose to generate ginsenoside Re.
Example 6 in vitro transglycosylation Activity and product identification by glycosyltransferase URT94s with protopanaxatriol type saponin Rh1/Rg1 as substrate and UDP-Glc as sugar donor
The cell lysates of recombinant E.coli BL21-URT94-1 and BL21-URT94-2 of example 4 were used as crude enzyme solutions for the transglycosylation reaction, and the cell lysates of recombinant E.coli transformed with empty vector pET28a were used as controls. The ginseng glycosyltransferase gGT-7, gGT29-7 (N343G, A359P) from patent PCT/CN2015/081111 was selected as a positive control. In vitro transglycosylation assays were performed according to the reaction system presented in table 3, at 35 ℃ overnight. The reaction results were detected by Thin Layer Chromatography (TLC) and High Performance Liquid Chromatography (HPLC), respectively.
TABLE 3 enzyme activity assay reaction System
Protopanaxatriol ginsenoside Rh1 is taken as a glycosyl acceptor, UDP-Glc is taken as a glycosyl donor, URT94-1 and URT94-2 cannot catalyze the protopanaxatriol ginsenoside Rh1 to generate Rf, and the HPLC result is consistent with the TLC result. Thus, unlike gGT-29 and gGT-7 (N343G, A359P), glycosyltransferases URT94-1 and URT94-2 of the present invention are not capable of catalyzing the extension of C6-O-Glc of Rh1 by one molecule of glucose to generate ginsenoside Rf, as shown in FIG. 5.
Protopanaxatriol type ginsenoside Rg1 is taken as a glycosyl acceptor, UDP-Glc is taken as a glycosyl donor, URT94-1 and URT94-2 cannot catalyze the protopanaxatriol type ginsenoside Rg1 to generate C20-O-Glc-Rf, and the HPLC result is consistent with the TLC result. Thus, unlike gGT-29 and gGT-7 (N343G, A359P), the glycosyltransferases URT94-1 and URT94-2 of the present invention are not capable of catalyzing the extension of C6-O-Glc of Rg1 by one molecule of glucose to produce ginsenoside C20-O-Glc-Rf, as shown in FIG. 6. It was shown that URT94-1 and URT94-2 are UDP-mouse Li Tanggao degree specific glycosyltransferases.
Example 7 comparison of efficiency of URT94s catalyzing extension of C6 by one molecule of rhamnose
Glycosyltransferase gGT-7 of source patent PCT/CN2015/081111 can extend a molecule of glucose at C6, gGT-7 (N343G, A359P), can extend a molecule of glucose at C6 or can extend a molecule of rhamnose at C6. Glycosyltransferases gGT-7, gGT29-7 (N343G, A359P) and glycosyltransferases URT94-1 and URT94-2 of the present invention were expressed and crude enzyme solutions were prepared as described in example 4. The enzyme-catalyzed reaction was carried out as in example 5, using UDP-Rha as the glycosyl donor and Rh1 and/or Rg1 as the glycosyl acceptor, at 35℃for 1 hour, and the product was quantified by HPLC. The calculation of catalytic efficiency is carried out according to the following formula:
conversion efficiency (%) =product amount/(substrate amount+product amount)
As shown in Table 4, the activity of the glycosyltransferases gGT29-7, gGT29-7 (N343G, A359P), URT94-1 and URT94-2, both catalyzing the extension of the sugar chain at the C6 position of Rh1 and/or Rg1 with UDP-rhamnose as a glycosyl donor, was improved compared to those disclosed in PCT/CN 2015/081111.
Table 4 comparison of catalytic efficiency of glycosyltransferases catalyzing extended Rha at C6 position
Thus, unlike prior glycosyltransferases, URT94-1 and URT94-2 of the present invention can specifically and efficiently add a rhamnosyl group further to the first glycosyl group of C-6 of the tetracyclic triterpene compound substrate to extend the sugar chain.
Example 8 efficient murine glycosyltransferase URT94-1 mutant protein
In order to further increase the catalytic activity of rhamnosyl transferase, the inventors constructed a mutant library thereof against URT94-1 using a random mutation method.
(1) Error-prone PCR
Error-prone PCR was performed using the rhamnosyltransferase URT94-1 gene sequence (SEQ ID NO: 1) as a template, and primers URT94-1_Pet28a-F (5'-ctttaagaaggagatataccatggataccaatgaaaaaacca-3' (SEQ ID NO: 9)) and URT94-1_Pet28a-R (5'-ctcgagtgcggccgcaagcttggggcatcgcttcccctggcctg-3' (SEQ ID NO: 10)). The error-prone PCR is selected from Stratagene GeneMorph II Random Mutagenesis Kit random mutation kit. The PCR procedure was: 95 ℃ for 2min;95 ℃ for 10s,55 ℃ for 15s and 72 ℃ for 2min, 28 cycles in total; the temperature was reduced to 10℃for 10min at 72℃and the template was used in an amount of 50ng. And (3) recovering the PCR product after agarose gel electrophoresis to obtain the error-prone PCR product of the rhamnosyl transferase URT 94-1.
(2) Expression of enzymes
The PCR product described above was ligated to pET28a plasmid (one-step cloning kit, purchased from Shanghai assist in Saint.) and the ligation product transformed into laboratory-prepared competent E.coli BL21 cells, and the transformed E.coli bacterial solution was plated on LB plates supplemented with 100ug/mL kanamycin and recombinant clones were further verified by PCR. And respectively selecting a plurality of clones to extract recombinant plasmids, and sequencing.
(3) Enzyme activity assay and screening
And (3) selecting a plurality of escherichia coli expression strains of the ginseng glycosyltransferase URT94-1 mutant obtained in the step (2), respectively inoculating the escherichia coli expression strains into 50mL of LB liquid culture medium, and culturing at 37 ℃ and 200rpm to ensure that OD600 reaches 0.6-0.8, inducing with 0.2mM IPTG and culturing at 110rpm for 18h at 16 ℃. After low temperature collection, 2mL of 50mM tris-HCl pH8.0 was used for reselecting the cells, and the cells were crushed by a cell crusher to obtain a crude protease solution.
By using the enzyme activity measurement reaction system of Table 3 to compare enzyme activities, the inventor obtains 1 mutant which has high enzyme activity (the nucleic acid sequence is shown as SEQ ID NO:13; the protein sequence is shown as SEQ ID NO: 14) which is named as URT94-1M1 after analyzing and screening a large number of mutants, and the 55 th position of the mutant is mutated from L to M (L55M) corresponding to the wild type URT 94-1. The results of the comparison of the activity of rhamnosyl transferase URT94-1 and its mutant URT94-1m1 are shown in Table 5.
TABLE 5
Sequence numbering | Sequence name | Mutation site | Rg2 conversion (%) | Re conversion (%) |
SEQ ID NO:2 | PgURT94 | - | 75% | 62% |
SEQ ID NO:14 | PgURT94m1 | L55M | 92% | 99% |
According to Table 5, the catalytic activity of the mutant is obviously improved compared with URT94-1, the efficiency of catalyzing Rh1 to synthesize Rg2 is improved to 92%, and the efficiency of catalyzing Rg1 to generate Re is improved to 99%, and the specific results are shown in Table 5 and FIG. 7.
The inventor detects protein expression by using a Western blot method, and the result is shown in figure 8, and the mutant URT94-1m1 can be efficiently expressed.
All documents mentioned in this application are incorporated by reference as if each were individually incorporated by reference. Further, it will be appreciated that various changes and modifications may be made by those skilled in the art after reading the above teachings, and such equivalents are intended to fall within the scope of the claims appended hereto.
Claims (15)
- A method of linking a rhamnosyl group on the first glycosyl group at the C-6 position of a tetracyclic triterpene compound comprising: transfer is performed with a specific glycosyltransferase having the amino acid sequence of SEQ ID NO: 2. SEQ ID NO:4 or SEQ ID NO:14, or a conservatively variant polypeptide thereof.
- The method of claim 1, wherein said rhamnosyl is provided by a glycosyl donor; preferably, the glycosyl donor is a glycosyl donor carrying a rhamnose group; more preferably, the glycosyl donor comprises a glycosyl donor selected from the group consisting of: uridine diphosphate-rhamnose, guanosine diphosphate-rhamnose, adenosine diphosphate-rhamnose, cytidine diphosphate-rhamnose, thymidine diphosphate-rhamnose, or a combination thereof.
- The method of claim 1, wherein the tetracyclic triterpene compound is of formula (I), and the compound having a glycosyl group attached to the glycosyl group at the C-6 position is of formula (II);Wherein R1 and R2 are H or glycosyl, R3 is monosaccharide glycosyl, and R4 is rhamnosyl; preferably, the glycosyl or monosaccharide glycosyl is selected from: a glucosyl, xylosyl, arabinosyl or rhamnosyl group;preferably, when R1 is H, R2 and R3 is glucosyl, the compound of formula (I) is ginsenoside Rg1 and the compound of formula (II) is ginsenoside Re; when R1 and R2 are H, R and are glucosyl, the compound of formula (I) is ginsenoside Rh1, and the compound of formula (II) is ginsenoside Rg2.
- The method of claim 1, wherein the tetracyclic triterpene compound is of formula (III), and the compound having a glycosyl group attached to the glycosyl group at the C-6 position is of formula (IV);wherein R1 is H or glycosyl, R2, R3 and R4 are monosaccharide glycosyl, and R5 is rhamnosyl; preferably, the glycosyl or monosaccharide glycosyl is selected from: a glucosyl, xylosyl, arabinosyl or rhamnosyl group;preferably, when R1 is H, R, R3 and R4 are glucosyl, and R5 is rhamnosyl, the compound of formula (III) is notoginsenoside R3, and the compound of formula (IV) is Yesanchinoside E.
- Use of a specific glycosyltransferase for the attachment of a rhamnosyl group on the first glycosyl group in position C-6 of a tetracyclic triterpene compound, said specific glycosyltransferase being a glycosyl transferase having the amino acid sequence of SEQ ID NO: 2. SEQ ID NO:4 or SEQ ID NO:14, or a conservatively variant polypeptide thereof.
- The use according to claim 5, wherein the rhamnosyl group is provided by a glycosyl donor; preferably, the glycosyl donor is a glycosyl donor carrying a rhamnose group; more preferably, the glycosyl donor comprises a glycosyl donor selected from the group consisting of: uridine diphosphate-rhamnose, guanosine diphosphate-rhamnose, adenosine diphosphate-rhamnose, cytidine diphosphate-rhamnose, thymidine diphosphate-rhamnose, or a combination thereof.
- The use according to claim 5, wherein the tetracyclic triterpene compound is of formula (I), and the compound having a glycosyl group attached to the glycosyl group at the C-6 position is of formula (II);wherein R1 and R2 are H or glycosyl, R3 is monosaccharide glycosyl, and R4 is rhamnosyl; preferably, the glycosyl or monosaccharide glycosyl is selected from: a glucosyl, xylosyl, arabinosyl or rhamnosyl group;preferably, when R1 is H, R2 and R3 is glucosyl, the compound of formula (I) is ginsenoside Rg1 and the compound of formula (II) is ginsenoside Re; when R1 and R2 are H, R and are glucosyl, the compound of formula (I) is ginsenoside Rh1, and the compound of formula (II) is ginsenoside Rg2.
- The use according to claim 5, wherein the tetracyclic triterpene compound is of formula (III), and the compound having a glycosyl group attached to the glycosyl group at the C-6 position is of formula (IV);Wherein R1 is H or glycosyl, R2, R3 and R4 are monosaccharide glycosyl, and R5 is rhamnosyl; preferably, the glycosyl or monosaccharide glycosyl is selected from: a glucosyl, xylosyl, arabinosyl or rhamnosyl group;preferably, when R1 is H, R, R3 and R4 are glucosyl, and R5 is rhamnosyl, the compound of formula (III) is notoginsenoside R3, and the compound of formula (IV) is Yesanchinoside E.
- A method of intracellular attachment of a rhamnosyl group on the first glycosyl group at the C-6 position of a tetracyclic triterpene compound comprising:(a) Introducing a tetracyclic triterpene compound reaction precursor or a construct expressing/forming the same, and introducing a specific glycosyltransferase or a construct expressing the same into a host cell to obtain a recombinant host cell; the specific glycosyltransferase is a glycosyltransferase having the sequence of SEQ ID NO: 2. SEQ ID NO:4 or SEQ ID NO:14, or a conservatively variant polypeptide thereof; a glycosyl donor carrying a rhamnose group or a glycosyl donor carrying a rhamnose group is introduced into the host cell;(b) Culturing the recombinant host cell of (a) to obtain a tetracyclic triterpene compound having a rhamnosyl group attached to the first glycosyl group at position C-6.
- The method of claim 9, wherein the tetracyclic triterpene compound reaction precursor comprises: ginsenoside Rg1, ginsenoside Rh1, and notoginsenoside R3; the corresponding products include: ginsenoside Re, ginsenoside Rg2, yesanchinoside E;preferably, the glycosyl donor comprises a glycosyl donor selected from the group consisting of: uridine diphosphate-rhamnose, guanosine diphosphate-rhamnose, adenosine diphosphate-rhamnose, cytidine diphosphate-rhamnose, thymidine diphosphate-rhamnose, or a combination thereof.
- A specific glycosyltransferase having the sequence of SEQ ID NO: 2. SEQ ID NO:4 or SEQ ID NO:14, or a conservatively variant polypeptide thereof; preferably, the conservatively variant polypeptide comprises:(1) Consists of SEQ ID NO: 2. SEQ ID NO:4 or SEQ ID NO:14, and has a rhamnosyl function linked to the first glycosyl at the C-6 position of the tetracyclic triterpene compound;(2) Amino acid sequence identical to SEQ ID NO: 2. SEQ ID NO:4 or SEQ ID NO:14 and has more than 50% identity to the polypeptide of the sequence shown in figure 14 and has the function of linking a rhamnosyl group to the first glycosyl group at the C-6 position of the tetracyclic triterpene compound; or (b)(3) In SEQ ID NO: 2. SEQ ID NO:4 or SEQ ID NO:14, or a signal peptide sequence is added to the N-terminus of the polypeptide having the sequence shown in fig. 14.
- An isolated polynucleotide encoding the specific glycosyltransferase of claim 11.
- A nucleic acid construct comprising the polynucleotide of claim 11, or expressing the specific glycosyltransferase of claim 11; preferably, the nucleic acid construct is an expression vector or a homologous recombination vector.
- A recombinant host cell expressing the specific glycosyltransferase of claim 11, or comprising the polynucleotide of claim 12, or comprising the nucleic acid construct of claim 13; preferably, the recombinant host cell further comprises a tetracyclic triterpene compound reaction precursor or a construct expressing/forming the same; preferably, a glycosyl donor carrying a rhamnose group or a glycosyl donor carrying a rhamnose group is also present in the recombinant host cell;preferably, the tetracyclic triterpene compound reaction precursor comprises: ginsenoside Rg1, ginsenoside Rh1, and notoginsenoside R3; the corresponding products include: ginsenoside Re, ginsenoside Rg2, yesanchinoside E;Preferably, the glycosyl donor comprises a glycosyl donor selected from the group consisting of: uridine diphosphate-rhamnose, guanosine diphosphate-rhamnose, adenosine diphosphate-rhamnose, cytidine diphosphate-rhamnose, thymidine diphosphate-rhamnose, or a combination thereof.
- A kit for glycosyl transfer comprising:the specific glycosyltransferase of claim 7, which is capable of linking a rhamnosyl group to the first glycosyl group at position C-6 of a tetracyclic triterpene compound, said specific glycosyltransferase being a glycosyl transferase having the amino acid sequence of SEQ ID NO: 2. SEQ ID NO:4 or SEQ ID NO:14, or a conservatively variant polypeptide thereof; or (b)The isolated polynucleotide of claim 12; or (b)The nucleic acid construct of claim 13; or (b)The recombinant host cell of claim 14;preferably, the method further comprises: a glycosyl donor carrying a rhamnose group; more preferably, the glycosyl donor comprises: uridine diphosphate-rhamnose, guanosine diphosphate-rhamnose, adenosine diphosphate-rhamnose, cytidine diphosphate-rhamnose, thymidine diphosphate-rhamnose;preferably, the method further comprises: tetracyclic triterpene compounds are reactive precursors.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110871374.0A CN115678952A (en) | 2021-07-30 | 2021-07-30 | Highly specific glycosyltransferase for rhamnose and application thereof |
CN2021108713740 | 2021-07-30 | ||
PCT/CN2022/109355 WO2023006109A1 (en) | 2021-07-30 | 2022-08-01 | Highly specific glycosyltransferase for rhamnose, and use thereof |
Publications (1)
Publication Number | Publication Date |
---|---|
CN117897480A true CN117897480A (en) | 2024-04-16 |
Family
ID=85058549
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110871374.0A Pending CN115678952A (en) | 2021-07-30 | 2021-07-30 | Highly specific glycosyltransferase for rhamnose and application thereof |
CN202280053383.1A Pending CN117897480A (en) | 2021-07-30 | 2022-08-01 | Mouse Li Tanggao degree specific glycosyltransferase and application thereof |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110871374.0A Pending CN115678952A (en) | 2021-07-30 | 2021-07-30 | Highly specific glycosyltransferase for rhamnose and application thereof |
Country Status (3)
Country | Link |
---|---|
KR (1) | KR20240032944A (en) |
CN (2) | CN115678952A (en) |
WO (1) | WO2023006109A1 (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116286712B (en) * | 2023-05-11 | 2023-08-25 | 中国中医科学院中药研究所 | Rhamnosyl transferase mutant, coding gene, preparation method and application |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
IL128193A (en) * | 1999-01-22 | 2008-03-20 | Yeda Res & Dev | Rhamnosyl-transferase gene and uses thereof |
JP6165875B2 (en) * | 2012-12-06 | 2017-07-19 | 上海 インスティテューツ フォー バイオロジカル サイエンシーズ、チャイニーズ アカデミー オブ サイエンシーズShanghai Institutes For Biological Sciences, Chinese Academy Of Sciences | A series of glycosyltransferases and their applications |
CN104232723B (en) * | 2013-06-07 | 2021-03-26 | 中国科学院分子植物科学卓越创新中心 | Group of glycosyltransferases and application thereof |
WO2015188742A2 (en) * | 2014-06-09 | 2015-12-17 | 中国科学院上海生命科学研究院 | Group of glycosyltransferases and use thereof |
CN105985938A (en) * | 2015-01-30 | 2016-10-05 | 中国科学院上海生命科学研究院 | Glycosyl transferase mutant protein and applications thereof |
CN112831481B (en) * | 2019-11-22 | 2024-01-19 | 生合万物(上海)生物科技有限公司 | Glycosyltransferase and method for catalyzing sugar chain extension |
-
2021
- 2021-07-30 CN CN202110871374.0A patent/CN115678952A/en active Pending
-
2022
- 2022-08-01 WO PCT/CN2022/109355 patent/WO2023006109A1/en active Application Filing
- 2022-08-01 KR KR1020247004280A patent/KR20240032944A/en unknown
- 2022-08-01 CN CN202280053383.1A patent/CN117897480A/en active Pending
Also Published As
Publication number | Publication date |
---|---|
CN115678952A (en) | 2023-02-03 |
KR20240032944A (en) | 2024-03-12 |
WO2023006109A1 (en) | 2023-02-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP6479084B2 (en) | A series of glycosyltransferases and their applications | |
KR102338008B1 (en) | A group of UDP-glycosyltransferases and their applications to catalyze sugar chain extension | |
CN104232723B (en) | Group of glycosyltransferases and application thereof | |
WO2015188742A2 (en) | Group of glycosyltransferases and use thereof | |
CN105087739B (en) | A kind of new catalyst system and its application for preparing rare ginsenoside | |
KR102418138B1 (en) | Glycosyltransferases, mutants and applications thereof | |
WO2021147575A1 (en) | New carbon glycoside glycosyltransferase and use thereof | |
WO2021164673A1 (en) | Bifunctional c-glycoside glycosyltransferases and application thereof | |
CN109796516B (en) | A method for synthesizing natural and unnatural protopanaxatriol type ginsenoside | |
CN108330111B (en) | Cytochrome P450 mutant protein and application thereof | |
CN117897480A (en) | Mouse Li Tanggao degree specific glycosyltransferase and application thereof | |
CN112831481B (en) | Glycosyltransferase and method for catalyzing sugar chain extension | |
CN114507646B (en) | Cytochrome P450 mutant protein and application thereof | |
CN113444703B (en) | Glycosyltransferase mutant for catalyzing sugar chain extension and application thereof | |
CN118048329A (en) | CYP98A20 protein participating in acteoside biosynthesis, and coding gene and application thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |