CN118048329A - CYP98A20 protein participating in acteoside biosynthesis, and coding gene and application thereof - Google Patents
CYP98A20 protein participating in acteoside biosynthesis, and coding gene and application thereof Download PDFInfo
- Publication number
- CN118048329A CN118048329A CN202211436961.8A CN202211436961A CN118048329A CN 118048329 A CN118048329 A CN 118048329A CN 202211436961 A CN202211436961 A CN 202211436961A CN 118048329 A CN118048329 A CN 118048329A
- Authority
- CN
- China
- Prior art keywords
- polypeptide
- cyp98a20
- equal
- acteoside
- amino acid
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 46
- QFRYQWYZSQDFOS-UHFFFAOYSA-N verbascoside Natural products CC1OC(COC2C(O)C(COC3OC(C(O)C(O)C3O)C(=O)O)OC(Oc4cc(O)cc5OC(=CC(=O)c45)c6ccc(O)c(O)c6)C2O)C(O)C(O)C1O QFRYQWYZSQDFOS-UHFFFAOYSA-N 0.000 title claims abstract description 33
- 229930185474 acteoside Natural products 0.000 title claims abstract description 29
- FBSKJMQYURKNSU-ZLSOWSIRSA-N acteoside Chemical compound O[C@@H]1[C@H](O)[C@@H](O)[C@H](C)O[C@H]1O[C@H]1[C@H](OC(=O)\C=C\C=2C=C(O)C(O)=CC=2)[C@@H](CO)O[C@@H](OCCC=2C=C(O)C(O)=CC=2)[C@@H]1O FBSKJMQYURKNSU-ZLSOWSIRSA-N 0.000 title claims abstract description 29
- FBSKJMQYURKNSU-UKQWSTALSA-N acteoside I Natural products C[C@@H]1O[C@H](O[C@@H]2[C@@H](O)[C@H](OCCc3ccc(O)c(O)c3)O[C@H](CO)[C@H]2OC(=O)C=Cc4ccc(O)c(O)c4)[C@H](O)[C@H](O)[C@H]1O FBSKJMQYURKNSU-UKQWSTALSA-N 0.000 title claims abstract description 29
- 102000004169 proteins and genes Human genes 0.000 title claims abstract description 23
- 230000015572 biosynthetic process Effects 0.000 title description 11
- 241000196324 Embryophyta Species 0.000 claims abstract description 15
- 235000003434 Sesamum indicum Nutrition 0.000 claims abstract description 13
- 229910052799 carbon Inorganic materials 0.000 claims abstract description 12
- YCCILVSKPBXVIP-UHFFFAOYSA-N 2-(4-hydroxyphenyl)ethanol Chemical compound OCCC1=CC=C(O)C=C1 YCCILVSKPBXVIP-UHFFFAOYSA-N 0.000 claims abstract description 10
- 238000005805 hydroxylation reaction Methods 0.000 claims abstract description 9
- 239000001606 7-[(2S,3R,4S,5S,6R)-4,5-dihydroxy-6-(hydroxymethyl)-3-[(2S,3R,4R,5R,6S)-3,4,5-trihydroxy-6-methyloxan-2-yl]oxyoxan-2-yl]oxy-5-hydroxy-2-(4-hydroxyphenyl)chroman-4-one Substances 0.000 claims abstract description 5
- DBLDQZASZZMNSL-QMMMGPOBSA-N L-tyrosinol Natural products OC[C@@H](N)CC1=CC=C(O)C=C1 DBLDQZASZZMNSL-QMMMGPOBSA-N 0.000 claims abstract description 5
- DFPMSGMNTNDNHN-ZPHOTFPESA-N naringin Chemical compound O[C@@H]1[C@H](O)[C@@H](O)[C@H](C)O[C@H]1O[C@H]1[C@H](OC=2C=C3O[C@@H](CC(=O)C3=C(O)C=2)C=2C=CC(O)=CC=2)O[C@H](CO)[C@@H](O)[C@@H]1O DFPMSGMNTNDNHN-ZPHOTFPESA-N 0.000 claims abstract description 5
- 229940052490 naringin Drugs 0.000 claims abstract description 5
- 229930019673 naringin Natural products 0.000 claims abstract description 5
- 235000004330 tyrosol Nutrition 0.000 claims abstract description 5
- 239000002253 acid Substances 0.000 claims abstract description 4
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 63
- 229920001184 polypeptide Polymers 0.000 claims description 60
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 60
- 210000004027 cell Anatomy 0.000 claims description 46
- 238000000034 method Methods 0.000 claims description 37
- 238000006243 chemical reaction Methods 0.000 claims description 24
- 239000013598 vector Substances 0.000 claims description 23
- 108091033319 polynucleotide Proteins 0.000 claims description 19
- 102000040430 polynucleotide Human genes 0.000 claims description 19
- 239000002157 polynucleotide Substances 0.000 claims description 19
- 240000004808 Saccharomyces cerevisiae Species 0.000 claims description 16
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 14
- 239000013612 plasmid Substances 0.000 claims description 13
- 239000013604 expression vector Substances 0.000 claims description 12
- 230000014509 gene expression Effects 0.000 claims description 8
- 241000700605 Viruses Species 0.000 claims description 7
- 230000001580 bacterial effect Effects 0.000 claims description 7
- 229930027945 nicotinamide-adenine dinucleotide Natural products 0.000 claims description 7
- 241000588724 Escherichia coli Species 0.000 claims description 6
- 239000000654 additive Substances 0.000 claims description 6
- 230000008569 process Effects 0.000 claims description 6
- ACFIXJIJDZMPPO-NNYOXOHSSA-N NADPH Chemical compound C1=CCC(C(=O)N)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OC[C@@H]2[C@H]([C@@H](OP(O)(O)=O)[C@@H](O2)N2C3=NC=NC(N)=C3N=C2)O)O1 ACFIXJIJDZMPPO-NNYOXOHSSA-N 0.000 claims description 5
- 244000000231 Sesamum indicum Species 0.000 claims description 5
- 125000000539 amino acid group Chemical group 0.000 claims description 5
- 230000003197 catalytic effect Effects 0.000 claims description 5
- 238000006555 catalytic reaction Methods 0.000 claims description 5
- 230000002255 enzymatic effect Effects 0.000 claims description 5
- 230000033444 hydroxylation Effects 0.000 claims description 5
- 238000006467 substitution reaction Methods 0.000 claims description 5
- 108010076504 Protein Sorting Signals Proteins 0.000 claims description 4
- 230000000996 additive effect Effects 0.000 claims description 4
- 238000012217 deletion Methods 0.000 claims description 4
- 230000037430 deletion Effects 0.000 claims description 4
- 230000000694 effects Effects 0.000 claims description 4
- 210000004962 mammalian cell Anatomy 0.000 claims description 4
- 241001430294 unidentified retrovirus Species 0.000 claims description 4
- 241000238631 Hexapoda Species 0.000 claims description 3
- 150000001413 amino acids Chemical class 0.000 claims description 3
- 230000000295 complement effect Effects 0.000 claims description 3
- 108020001507 fusion proteins Proteins 0.000 claims description 3
- 102000037865 fusion proteins Human genes 0.000 claims description 3
- 210000005253 yeast cell Anatomy 0.000 claims description 3
- 238000007792 addition Methods 0.000 claims description 2
- 210000004102 animal cell Anatomy 0.000 claims description 2
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 claims description 2
- 238000012258 culturing Methods 0.000 claims description 2
- 230000002538 fungal effect Effects 0.000 claims description 2
- 230000002401 inhibitory effect Effects 0.000 claims description 2
- 230000010354 integration Effects 0.000 claims description 2
- BOPGDPNILDQYTO-NNYOXOHSSA-N nicotinamide-adenine dinucleotide Chemical compound C1=CCC(C(=O)N)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OC[C@@H]2[C@H]([C@@H](O)[C@@H](O2)N2C3=NC=NC(N)=C3N=C2)O)O1 BOPGDPNILDQYTO-NNYOXOHSSA-N 0.000 claims description 2
- 239000001301 oxygen Substances 0.000 claims description 2
- 229910052760 oxygen Inorganic materials 0.000 claims description 2
- 230000035484 reaction time Effects 0.000 claims description 2
- 230000028327 secretion Effects 0.000 claims description 2
- 239000013605 shuttle vector Substances 0.000 claims description 2
- 239000000758 substrate Substances 0.000 claims description 2
- 241000220485 Fabaceae Species 0.000 claims 1
- 235000021374 legumes Nutrition 0.000 claims 1
- 102000002004 Cytochrome P-450 Enzyme System Human genes 0.000 abstract description 11
- 241000207961 Sesamum Species 0.000 abstract description 8
- 101710198130 NADPH-cytochrome P450 reductase Proteins 0.000 abstract description 7
- -1 phenethyl alcohol glycoside compounds Chemical class 0.000 abstract description 6
- 229930182470 glycoside Natural products 0.000 abstract description 5
- WRMNZCZEMHIOCP-UHFFFAOYSA-N 2-Phenylethanol Natural products OCCC1=CC=CC=C1 WRMNZCZEMHIOCP-UHFFFAOYSA-N 0.000 abstract description 4
- 239000004480 active ingredient Substances 0.000 abstract description 4
- 230000001105 regulatory effect Effects 0.000 abstract description 4
- KDSWDGKIENPKLB-QJDQKFITSA-N verbascoside Chemical compound O[C@@H]1[C@H](O)[C@@H](O)[C@H](C)O[C@H]1O[C@H]1[C@H](OC(=O)CCC=2C=C(O)C(O)=CC=2)[C@@H](CO)O[C@@H](OCCC=2C=C(O)C(O)=CC=2)[C@@H]1O KDSWDGKIENPKLB-QJDQKFITSA-N 0.000 abstract description 4
- 125000004432 carbon atom Chemical group C* 0.000 abstract 3
- 239000002773 nucleotide Substances 0.000 description 16
- 125000003729 nucleotide group Chemical group 0.000 description 16
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 15
- 239000012634 fragment Substances 0.000 description 13
- 108020004414 DNA Proteins 0.000 description 12
- 102000004190 Enzymes Human genes 0.000 description 12
- 108090000790 Enzymes Proteins 0.000 description 12
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 12
- 239000000243 solution Substances 0.000 description 9
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 8
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 8
- 210000003527 eukaryotic cell Anatomy 0.000 description 8
- 230000003228 microsomal effect Effects 0.000 description 8
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 7
- 108091028043 Nucleic acid sequence Proteins 0.000 description 7
- 239000002299 complementary DNA Substances 0.000 description 7
- 239000008103 glucose Substances 0.000 description 7
- 108010015742 Cytochrome P-450 Enzyme System Proteins 0.000 description 6
- 238000007796 conventional method Methods 0.000 description 6
- 238000001514 detection method Methods 0.000 description 6
- 230000014759 maintenance of location Effects 0.000 description 6
- 239000002609 medium Substances 0.000 description 6
- YWJRHENUFPWISM-UHFFFAOYSA-N 3'-alpha-L-rhamnopyranoside Natural products OC1C(O)C(O)C(C)OC1OC1C(OC(=O)C=CC=2C=C(O)C(O)=CC=2)C(CO)OC(OCCC=2C=CC(O)=CC=2)C1O YWJRHENUFPWISM-UHFFFAOYSA-N 0.000 description 5
- 238000010367 cloning Methods 0.000 description 5
- 239000003623 enhancer Substances 0.000 description 5
- 238000000589 high-performance liquid chromatography-mass spectrometry Methods 0.000 description 5
- 239000000543 intermediate Substances 0.000 description 5
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 4
- 238000012408 PCR amplification Methods 0.000 description 4
- 238000010521 absorption reaction Methods 0.000 description 4
- 239000011324 bead Substances 0.000 description 4
- 239000000872 buffer Substances 0.000 description 4
- 230000002950 deficient Effects 0.000 description 4
- 239000011521 glass Substances 0.000 description 4
- 238000000338 in vitro Methods 0.000 description 4
- 239000007788 liquid Substances 0.000 description 4
- 150000007523 nucleic acids Chemical group 0.000 description 4
- 210000001236 prokaryotic cell Anatomy 0.000 description 4
- 238000010188 recombinant method Methods 0.000 description 4
- 239000000126 substance Substances 0.000 description 4
- 238000003786 synthesis reaction Methods 0.000 description 4
- 229940035893 uracil Drugs 0.000 description 4
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 3
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical group CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 3
- 241000219195 Arabidopsis thaliana Species 0.000 description 3
- 108091026890 Coding region Proteins 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- PXAALZXMCCZTKN-UHFFFAOYSA-N Syringalide A Natural products OC1C(O)C(OC(=O)C=CC=2C=C(O)C(O)=CC=2)C(CO)OC1OCCC1=CC=C(O)C=C1 PXAALZXMCCZTKN-UHFFFAOYSA-N 0.000 description 3
- PXAALZXMCCZTKN-WDIXTDMMSA-N [(2r,3s,4r,5r,6r)-4,5-dihydroxy-2-(hydroxymethyl)-6-[2-(4-hydroxyphenyl)ethoxy]oxan-3-yl] (e)-3-(3,4-dihydroxyphenyl)prop-2-enoate Chemical compound O([C@@H]1O[C@@H]([C@H]([C@H](O)[C@H]1O)OC(=O)\C=C\C=1C=C(O)C(O)=CC=1)CO)CCC1=CC=C(O)C=C1 PXAALZXMCCZTKN-WDIXTDMMSA-N 0.000 description 3
- 230000004071 biological effect Effects 0.000 description 3
- 150000001875 compounds Chemical class 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 238000004128 high performance liquid chromatography Methods 0.000 description 3
- 238000004519 manufacturing process Methods 0.000 description 3
- 210000001589 microsome Anatomy 0.000 description 3
- 230000010076 replication Effects 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 238000012163 sequencing technique Methods 0.000 description 3
- 238000001179 sorption measurement Methods 0.000 description 3
- 239000006228 supernatant Substances 0.000 description 3
- 238000013518 transcription Methods 0.000 description 3
- 230000035897 transcription Effects 0.000 description 3
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 108091033380 Coding strand Proteins 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- SHZGCJCMOBCMKK-UHFFFAOYSA-N D-mannomethylose Natural products CC1OC(O)C(O)C(O)C1O SHZGCJCMOBCMKK-UHFFFAOYSA-N 0.000 description 2
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 2
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 2
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 2
- 108010045510 NADPH-Ferrihemoprotein Reductase Proteins 0.000 description 2
- 241000207960 Pedaliaceae Species 0.000 description 2
- 108020004511 Recombinant DNA Proteins 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 230000003321 amplification Effects 0.000 description 2
- 230000000844 anti-bacterial effect Effects 0.000 description 2
- 230000003110 anti-inflammatory effect Effects 0.000 description 2
- 239000003963 antioxidant agent Substances 0.000 description 2
- 230000003078 antioxidant effect Effects 0.000 description 2
- 238000003556 assay Methods 0.000 description 2
- 238000005119 centrifugation Methods 0.000 description 2
- 230000009089 cytolysis Effects 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 238000006911 enzymatic reaction Methods 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 235000013305 food Nutrition 0.000 description 2
- 230000004927 fusion Effects 0.000 description 2
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 2
- 239000010931 gold Substances 0.000 description 2
- 229910052737 gold Inorganic materials 0.000 description 2
- 239000005090 green fluorescent protein Substances 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 150000002500 ions Chemical class 0.000 description 2
- 238000004895 liquid chromatography mass spectrometry Methods 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 239000012528 membrane Substances 0.000 description 2
- BDAGIHXWWSANSR-UHFFFAOYSA-N methanoic acid Natural products OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 description 2
- 229910052757 nitrogen Inorganic materials 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- 102000039446 nucleic acids Human genes 0.000 description 2
- 108020004707 nucleic acids Proteins 0.000 description 2
- 229930000223 plant secondary metabolite Natural products 0.000 description 2
- 239000000047 product Substances 0.000 description 2
- 238000003259 recombinant expression Methods 0.000 description 2
- 239000002904 solvent Substances 0.000 description 2
- 125000001424 substituent group Chemical group 0.000 description 2
- 241000701161 unidentified adenovirus Species 0.000 description 2
- OSWFIVFLDKOXQC-UHFFFAOYSA-N 4-(3-methoxyphenyl)aniline Chemical compound COC1=CC=CC(C=2C=CC(N)=CC=2)=C1 OSWFIVFLDKOXQC-UHFFFAOYSA-N 0.000 description 1
- 101150005771 ATR1 gene Proteins 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- 101000998379 Arabidopsis thaliana NADPH-cytochrome P450 reductase 1 Proteins 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- 108091062157 Cis-regulatory element Proteins 0.000 description 1
- 241000336291 Cistanche deserticola Species 0.000 description 1
- 230000006820 DNA synthesis Effects 0.000 description 1
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 241000701959 Escherichia virus Lambda Species 0.000 description 1
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 1
- 101000579123 Homo sapiens Phosphoglycerate kinase 1 Proteins 0.000 description 1
- 241000207832 Lamiales Species 0.000 description 1
- 101710118186 Neomycin resistance protein Proteins 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- KJWZYMMLVHIVSU-IYCNHOCDSA-N PGK1 Chemical compound CCCCC[C@H](O)\C=C\[C@@H]1[C@@H](CCCCCCC(O)=O)C(=O)CC1=O KJWZYMMLVHIVSU-IYCNHOCDSA-N 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 239000001888 Peptone Substances 0.000 description 1
- 108010080698 Peptones Proteins 0.000 description 1
- 102100028251 Phosphoglycerate kinase 1 Human genes 0.000 description 1
- 239000002202 Polyethylene glycol Substances 0.000 description 1
- 238000003559 RNA-seq method Methods 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 241000405414 Rehmannia Species 0.000 description 1
- ILRCGYURZSFMEG-UHFFFAOYSA-N Salidroside Natural products OC1C(O)C(O)C(CO)OC1OCCC1=CC=C(O)C=C1 ILRCGYURZSFMEG-UHFFFAOYSA-N 0.000 description 1
- 241000293869 Salmonella enterica subsp. enterica serovar Typhimurium Species 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- 235000009367 Sesamum alatum Nutrition 0.000 description 1
- 241000187747 Streptomyces Species 0.000 description 1
- 229930194834 Syringalide Natural products 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- 108010022394 Threonine synthase Proteins 0.000 description 1
- 102000006601 Thymidine Kinase Human genes 0.000 description 1
- 108020004440 Thymidine kinase Proteins 0.000 description 1
- 239000007984 Tris EDTA buffer Substances 0.000 description 1
- 238000005377 adsorption chromatography Methods 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 230000003698 anagen phase Effects 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 230000001093 anti-cancer Effects 0.000 description 1
- 230000000840 anti-viral effect Effects 0.000 description 1
- 239000000427 antigen Substances 0.000 description 1
- 108091007433 antigens Proteins 0.000 description 1
- 102000036639 antigens Human genes 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 239000007864 aqueous solution Substances 0.000 description 1
- 238000000889 atomisation Methods 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 230000001851 biosynthetic effect Effects 0.000 description 1
- 238000010804 cDNA synthesis Methods 0.000 description 1
- 239000001110 calcium chloride Substances 0.000 description 1
- 235000011148 calcium chloride Nutrition 0.000 description 1
- 229910001628 calcium chloride Inorganic materials 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 229940041514 candida albicans extract Drugs 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 235000014633 carbohydrates Nutrition 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 239000013599 cloning vector Substances 0.000 description 1
- 238000000975 co-precipitation Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 239000002989 correction material Substances 0.000 description 1
- 230000001120 cytoprotective effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 102000004419 dihydrofolate reductase Human genes 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 239000003480 eluent Substances 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 238000005538 encapsulation Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 150000002148 esters Chemical class 0.000 description 1
- 238000010195 expression analysis Methods 0.000 description 1
- 235000019253 formic acid Nutrition 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 238000010230 functional analysis Methods 0.000 description 1
- 239000007789 gas Substances 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 125000002791 glucosyl group Chemical group C1([C@H](O)[C@@H](O)[C@H](O)[C@H](O1)CO)* 0.000 description 1
- 150000002338 glycosides Chemical class 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 230000014726 immortalization of host cell Effects 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 239000004615 ingredient Substances 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 238000004811 liquid chromatography Methods 0.000 description 1
- 229910001629 magnesium chloride Inorganic materials 0.000 description 1
- 238000004949 mass spectrometry Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000010297 mechanical methods and process Methods 0.000 description 1
- 201000001441 melanoma Diseases 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 238000012269 metabolic engineering Methods 0.000 description 1
- 230000037353 metabolic pathway Effects 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 239000002808 molecular sieve Substances 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 229930014626 natural product Natural products 0.000 description 1
- 230000004112 neuroprotection Effects 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 230000003204 osmotic effect Effects 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 235000019319 peptone Nutrition 0.000 description 1
- 150000007965 phenolic acids Chemical group 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 229920001223 polyethylene glycol Polymers 0.000 description 1
- 239000000843 powder Substances 0.000 description 1
- 238000004321 preservation Methods 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 239000012264 purified product Substances 0.000 description 1
- 238000003908 quality control method Methods 0.000 description 1
- 238000004153 renaturation Methods 0.000 description 1
- 238000010839 reverse transcription Methods 0.000 description 1
- ILRCGYURZSFMEG-RQICVUQASA-N salidroside Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)OC1OCCC1=CC=C(O)C=C1 ILRCGYURZSFMEG-RQICVUQASA-N 0.000 description 1
- 238000005185 salting out Methods 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 230000003248 secreting effect Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- URGAHOPLAPQHLN-UHFFFAOYSA-N sodium aluminosilicate Chemical compound [Na+].[Al+3].[O-][Si]([O-])=O.[O-][Si]([O-])=O URGAHOPLAPQHLN-UHFFFAOYSA-N 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- UYCAUPASBSROMS-AWQJXPNKSA-M sodium;2,2,2-trifluoroacetate Chemical compound [Na+].[O-][13C](=O)[13C](F)(F)F UYCAUPASBSROMS-AWQJXPNKSA-M 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 230000001954 sterilising effect Effects 0.000 description 1
- 238000004659 sterilization and disinfection Methods 0.000 description 1
- 238000001308 synthesis method Methods 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 238000003151 transfection method Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 230000014621 translational initiation Effects 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- 238000005199 ultracentrifugation Methods 0.000 description 1
- 239000011534 wash buffer Substances 0.000 description 1
- 239000003643 water by type Substances 0.000 description 1
- 239000012138 yeast extract Substances 0.000 description 1
- 239000007222 ypd medium Substances 0.000 description 1
Landscapes
- Enzymes And Modification Thereof (AREA)
Abstract
The invention belongs to the technical field of biology, and particularly provides cytochrome P450 monooxygenase CYP98A20 protein synthesized by acteoside, and a coding gene and application thereof. The CYP98A20 protein can catalyze the hydroxylation reaction of the 3-position carbon atom of coumaric acid or the 3-position carbon atom of tyrosol in the naringin B, and then catalyze the hydroxylation reaction of the other 3-position carbon atom, so that the acteoside is generated. The invention has important theoretical and practical significance for regulating and producing plant phenethyl alcohol glycoside compounds and improving the content of the phenethyl alcohol glycoside active ingredient verbascoside and derivatives thereof in sesame by biotechnology.
Description
Technical Field
The invention belongs to the technical field of biology, and particularly relates to CYP98A20 protein participating in acteoside biosynthesis, and a coding gene and application thereof.
Background
Phenylethanoid glycosides (PhGs) are an important class of plant natural products in the kingdom of plants. PhGs has various biological properties such as neuroprotection, anti-inflammatory, antioxidant, antibacterial, antiviral, etc. The core structure is characterized in that the hydroxyphenylethyl is connected to beta-glucopyranose. The glucose moiety is typically modified with phenolic acid and various sugar substituents via ester or glycosidic linkages. At least 572 PhGs have been identified, most of which are isolated from medicinal plants. The acteoside is a hydroxylglycoside residue formed by concentrating caffeoyl and rhamnosyl, is one of the best-known PhGs due to the potential benefit of the acteoside to human health, and confirms that the quality control standard of rehmannia and cistanche deserticola is acteoside in Chinese pharmacopoeia. Acteoside has been found in more than 200 species worldwide, most of which belong to the order cheiliales (Lamiales). A large number of drug studies have shown that acteoside has a wide range of biological activities, such as antioxidant, antibacterial, anti-inflammatory, cytoprotective, anticancer, etc. Sesame (school name: sesamum indicum) is an annual upright herb of the genus Sesamum of the family Pedaliaceae.
With the development of molecular biology, physiological biochemistry and other subjects, the research on plant secondary metabolites is also more and more advanced. However, plant secondary metabolites are of various kinds, different structures, and secondary metabolic pathways are also diverse and complex, many of which are currently unknown or are merely known about the general route of synthetic pathways (Yazaki K.2017, plant Biotechnol,34 (3): 131-142). Biosynthetic enzymes that catalyze the production of acteoside from artocarside B have not been reported. Therefore, there is an urgent need to develop a clone-related enzyme that can increase the content of a target ingredient or directly produce an active ingredient or an intermediate.
Disclosure of Invention
The invention aims to provide CYP98A20 protein participating in acteoside biosynthesis, and a coding gene and application thereof.
In a first aspect of the invention there is provided an isolated cytochrome P450 monooxygenase CYP98a20 polypeptide selected from the group consisting of:
a) A polypeptide having the amino acid sequence shown in SEQ ID NO. 1;
b) A derivative protein which is formed by substitution, deletion or addition of one or a plurality of amino acid residues, preferably 1 to 50, more preferably 1 to 30, still more preferably 1 to 10, most preferably 1 to 6, 1, 2 or 3 amino acids of the amino acid sequence shown in SEQ ID NO.1 and has catalytic activity of cinnarin A;
(c) A derivative protein comprising the protein sequence of (a) or (b) in the sequence;
c) The homology of the amino acid sequence with the amino acid sequence shown in SEQ ID NO. 1 is more than or equal to 65%, preferably more than or equal to 70%, preferably more than or equal to 75%, preferably more than or equal to 80%, preferably more than or equal to 85%, more preferably more than or equal to 90%, preferably more than or equal to 95%, preferably more than or equal to 98%, preferably more than or equal to 99%, and has the derivative protein catalyzing the activity of the cinnarin B.
In another preferred embodiment, the sequence (c) is a fusion protein formed by adding a tag sequence, a signal sequence or a secretion signal sequence to the sequence (a) or (b).
In another preferred embodiment, the CYP98A20 polypeptide is derived from the family Pedaliaceae, preferably from a plant selected from the group consisting of sesame.
In another preferred embodiment, the amino acid sequence of the CYP98A20 polypeptide is shown in SEQ ID NO. 1.
In a second aspect, the invention provides an isolated polynucleotide selected from the group consisting of:
(a) A nucleotide sequence encoding a CYP82AR2 polypeptide as set forth in SEQ ID No. 1;
(b) A nucleotide sequence as set forth in SEQ ID NO. 2;
(c) A nucleotide sequence having a homology of greater than or equal to 75% (preferably greater than or equal to 80%, more preferably greater than or equal to 90%) to the sequence shown in SEQ ID NOs.2;
(d) A nucleotide sequence of 1 to 60 (preferably 1 to 30, more preferably 1 to 10) nucleotides truncated or added at the 5 'and/or 3' end of the nucleotide sequence shown in SEQ ID nos.;
(e) A nucleotide sequence complementary (preferably fully complementary) to the nucleotide sequence of any one of (a) - (d).
In another preferred embodiment, the sequence of the nucleotide is set forth in SEQ ID NOs.2.
In another preferred embodiment, the polynucleotide having the sequence set forth in SEQ ID NOs.2 encodes a polypeptide having the amino acid sequence set forth in SEQ ID NOs.1.
In a third aspect, the invention provides a vector comprising a polynucleotide according to the second aspect of the invention.
In another preferred embodiment, the carrier is selected from the group consisting of: expression vectors, shuttle vectors, integration vectors, or combinations thereof.
In another preferred embodiment, the carrier is selected from the group consisting of: bacterial plasmids, phage, yeast plasmids, plant cell viruses, animal cell viruses, retroviruses, or combinations thereof.
In another preferred example, the vector includes a vector expressed in yeast, such as a pESC-series vector, a pYES-series vector, a pUG-series vector, a pSH-series vector, a pRS-series vector.
In a fourth aspect, the invention provides a genetically engineered host cell comprising a vector according to the third aspect of the invention, or having integrated into its genome a polynucleotide according to the second aspect of the invention.
In another preferred embodiment, the host cell is a prokaryotic cell or a eukaryotic cell. Further preferably, the host cell is a lower eukaryotic cell, such as a yeast cell. Alternatively, the host cell is a higher eukaryotic cell, such as a mammalian cell. Alternatively, the host cell is a prokaryotic cell, such as a bacterial cell, preferably E.coli. More preferably, the host cell may also be selected from higher plants, insect cells.
Particularly preferably, the host cell is selected from the group consisting of: saccharomyces cerevisiae, E.coli. Particularly preferably, the host cell is a Saccharomyces cerevisiae cell.
In a fifth aspect, the invention provides a method of producing a CYP98a20 polypeptide, said method comprising:
(a) Culturing the host cell of the fourth aspect of the invention under conditions suitable for expression;
(b) Isolating the CYP98A20 polypeptide from the culture.
In a sixth aspect, the present invention provides the use of a CYP98a20 polypeptide according to the first aspect of the present invention or a derivative thereof, a vector according to the third aspect of the present invention, or a host cell according to the fourth aspect of the present invention, for catalyzing the following reaction, or for preparing a catalytic formulation for catalyzing the following reaction, for catalyzing the hydroxylation of the 3-carbon atom of coumaric acid or the 3-carbon atom of tyrosol in euonylbumin B, followed by catalyzing the hydroxylation of the other 3-carbon atom to produce acteoside.
The seventh aspect of the present invention provides a method for preparing a acteoside, comprising:
In the presence of the polypeptide or the derivative polypeptide thereof in the first aspect of the invention, catalyzing hydroxylation reaction on the 3-carbon atom of coumaric acid or the 3-carbon atom of tyrosol in the naringin B, and then catalyzing hydroxylation reaction of the other 3-carbon atom, thereby obtaining acteoside; the catalytic reaction process is shown in figure 1 below.
Specifically, the polypeptide and the derivative polypeptide thereof are added into catalytic reaction at the same time. In another preferred embodiment, the method is performed in the presence of a cofactor, preferably NADPH and/or NADH.
In a further preferred embodiment, the cofactor is used in an amount of 0.5-6.0 mM, preferably 0.5-5.0 mM, more preferably 1.0-3.0 mM.
In a particularly preferred mode, the process is carried out in the presence of oxygen.
In a further preferred embodiment, the method further comprises: an additive for regulating the activity of the enzyme is provided to the reaction system. Further preferably, the additive for regulating the enzymatic activity is: the additives for increasing or inhibiting the enzymatic activity, more specifically the additives for modulating the enzymatic activity, are selected from the group consisting of: ca 2+、Co2+、Mn2+、Ba2+、Al3+、Ni2 +、Zn2+, or Fe 2+.
In a particularly preferred embodiment, the pH of the reaction system is: 6.5-8.5, preferably pH 7.4-7.6. In a specific embodiment, the temperature of the reaction system is: 25 ℃ to 35 ℃, preferably 28 ℃ to 30 ℃.
In a particularly preferred embodiment, the reaction time is from 0.5 h to 24: 24 h, preferably from 1h to 10: 10 h, more preferably from 3: 3h to 6: 6 h.
The invention firstly digs and separates the artocarpine B cytochrome P450 monooxygenase CYP98A20 from the sesame (Sesamum indicum) transcriptome, and verifies that the artocarpine B cytochrome P450 monooxygenase CYP98A20 is a key enzyme in the biosynthesis process of acteoside; CYP98A20 catalyzes the production of acteoside from artocarside B. Furthermore, the invention has important theoretical and practical significance for regulating and producing plant phenethyl alcohol glycoside compounds and improving the content of the phenethyl alcohol glycoside active ingredient verbascoside and derivatives thereof in sesame by biotechnology.
Drawings
FIG. 1 shows the chemical structure and catalytic process of the glycosylation of naringin B, monohydroxy intermediate, and acteoside.
FIG. 2 shows the results of the CYP98A20 crude enzyme reaction assay.
FIG. 3 shows the results of CYP98A20 microsomal assay.
FIG. 4 shows the LC-MS detection results of CYP98A20 catalyzed reactions.
Detailed Description
Through intensive research on acteoside biosynthesis, the inventor firstly separates a cytochrome P450 monooxygenase CYP98A20 protein synthesized by acteoside biosynthesis. Specifically, the CYP98A20 protein can catalyze the naringin B to generate the acteoside. The cloning and functional research of the cytochrome P450 monooxygenase gene are key to analyzing the biosynthesis path of acteoside and derivatives in sesame, and bring wide application space for improving the content of target components or directly producing active ingredients or intermediates by using biotechnology. The present invention has been completed on the basis of this finding.
Definition of the definition
As used herein, the terms "active polypeptide", "polypeptide of the invention and its derivatives", "enzyme of the invention", "CYP 98a20 of the invention" all refer to CYP98a20 (SEQ ID No.: 1) polypeptides and their derivatives.
As used herein, an "isolated polypeptide" means that the polypeptide is substantially free of other proteins, lipids, carbohydrates, or other substances with which it is naturally associated. The person skilled in the art is able to purify the polypeptides using standard protein purification techniques. Substantially pure polypeptides can produce a single main band on a non-reducing polyacrylamide gel. The purity of the polypeptide can also be further analyzed by amino acid sequence.
The active polypeptide of the present invention may be a recombinant polypeptide, a natural polypeptide, a synthetic polypeptide. The polypeptides of the invention may be naturally purified products, or chemically synthesized products, or produced from prokaryotic or eukaryotic hosts (e.g., bacteria, yeast, plants) using recombinant techniques.
The invention also includes fragments, derivatives and analogues of the polypeptides. As used herein, the terms "fragment," "derivative," and "analog" refer to a polypeptide that retains substantially the same biological function or activity as the polypeptide.
The polypeptide fragments, derivatives or analogues of the invention may be (i) polypeptides having one or more conserved or non-conserved amino acid residues, preferably conserved amino acid residues, substituted, which may or may not be encoded by the genetic code, or (ii) polypeptides having a substituent in one or more amino acid residues, or (iii) polypeptides formed by fusion of a mature polypeptide with another compound, such as a compound that extends the half-life of the polypeptide, for example polyethylene glycol, or (iv) polypeptides formed by fusion of an additional amino acid sequence to the polypeptide sequence, such as a leader or secretory sequence or a sequence used to purify the polypeptide or a proprotein sequence, or fusion proteins with the formation of an antigen IgG fragment. Such fragments, derivatives and analogs are within the purview of one skilled in the art and would be well known in light of the teachings herein.
The polynucleotides of the invention may be in the form of DNA or RNA. DNA forms include cDNA, genomic DNA, or synthetic DNA. The DNA may be single-stranded or double-stranded. The DNA may be a coding strand or a non-coding strand. The coding region sequence encoding the mature polypeptide may be identical to the coding region sequence set forth in SEQ ID NO. 1 or a degenerate variant.
The term "polynucleotide encoding a polypeptide" may include polynucleotides encoding the polypeptide, or may include additional coding and/or non-coding sequences.
The invention also relates to variants of the above polynucleotides which encode polypeptides having the same amino acid sequence as the invention or fragments, analogs and derivatives of the polypeptides. Variants of the polynucleotide may be naturally occurring allelic variants or non-naturally occurring variants. Such nucleotide variants include substitution variants, deletion variants and insertion variants. As known in the art, an allelic variant is a substitution of a polynucleotide, which may be a substitution, deletion, or insertion of one or more nucleotides, without substantially altering the function of the encoded polypeptide.
The invention also relates to polynucleotides which hybridize to the sequences described above and which have at least 50%, preferably at least 70%, more preferably at least 80% identity between the two sequences. The invention relates in particular to polynucleotides which hybridize under stringent conditions (or stringent conditions) to the polynucleotides of the invention.
The invention also relates to nucleic acid fragments which hybridize to the sequences described above. As used herein, a "nucleic acid fragment" is at least 15 nucleotides, preferably at least 30 nucleotides, more preferably at least 50 nucleotides, and most preferably at least 100 nucleotides or more in length. The nucleic acid fragments may be used in nucleic acid amplification techniques (e.g., PCR) to determine and/or isolate polynucleotides encoding the CYP98a20 protein.
The polypeptides and polynucleotides of the invention are preferably provided in isolated form, and more preferably purified to homogeneity.
The full-length nucleotide sequence or a fragment thereof of the present invention can be usually obtained by a PCR amplification method, a recombinant method or an artificial synthesis method. For the PCR amplification method, primers can be designed according to the nucleotide sequences disclosed in the present invention, particularly the open reading frame sequences, and amplified to obtain the relevant sequences using a commercially available cDNA library or a cDNA library prepared according to a conventional method known to those skilled in the art as a template. When the sequence is longer, it is often necessary to perform two or more PCR amplifications, and then splice the amplified fragments together in the correct order.
Once the relevant sequences are obtained, recombinant methods can be used to obtain the relevant sequences in large quantities. This is usually done by cloning it into a vector, transferring it into a cell, and isolating the relevant sequence from the propagated host cell by conventional methods.
Furthermore, the sequences concerned, in particular fragments of short length, can also be synthesized by artificial synthesis. In general, fragments of very long sequences are obtained by first synthesizing a plurality of small fragments and then ligating them.
At present, it is already possible to obtain the DNA sequences encoding the proteins of the invention (or fragments or derivatives thereof) entirely by chemical synthesis. The DNA sequence can then be introduced into a variety of existing DNA molecules (or vectors, for example) and cells known in the art. In addition, mutations can be introduced into the protein sequences of the invention by chemical synthesis.
Methods of amplifying DNA/RNA using PCR techniques are preferred for obtaining the genes of the present invention. In particular, when it is difficult to obtain full-length cDNA from a library, it is preferable to use RACE method (RACE-cDNA end rapid amplification method), and primers for PCR can be appropriately selected according to the sequence information of the present invention disclosed herein and synthesized by a conventional method. The amplified DNA/RNA fragments can be isolated and purified by conventional methods, such as by gel electrophoresis.
The term "recombinant expression vector" refers to bacterial plasmids, phages, yeast plasmids, plant cell viruses, mammalian cell viruses such as adenoviruses, retroviruses or other vectors well known in the art. Any plasmid or vector may be used as long as it is replicable and stable in the host. An important feature of expression vectors is that they generally contain an origin of replication, a promoter, a marker gene and translational control elements.
Methods well known to those skilled in the art can be used to construct expression vectors containing the CYP98A20 polypeptide, coding DNA sequences, and appropriate transcriptional/translational control signals. These methods include in vitro recombinant DNA techniques, DNA synthesis techniques, in vivo recombinant techniques, and the like. The DNA sequence may be operably linked to an appropriate promoter in an expression vector to direct mRNA synthesis. Representative examples of these promoters are: the lac or trp promoter of E.coli; a lambda phage PL promoter; eukaryotic promoters include the CMV immediate early promoter, the HSV thymidine kinase promoter, the early and late SV40 promoters, LTRs from retroviruses, and other known promoters that control the expression of genes in prokaryotic or eukaryotic cells or viruses thereof. The expression vector also includes a ribosome binding site for translation initiation and a transcription terminator.
In addition, the expression vector preferably comprises one or more selectable marker genes to provide phenotypic traits for selection of transformed host cells, such as dihydrofolate reductase, neomycin resistance and Green Fluorescent Protein (GFP) for eukaryotic cell culture, or tetracycline or ampicillin resistance for E.coli.
Vectors comprising the appropriate DNA sequences as described above, as well as appropriate promoter or control sequences, may be used to transform appropriate host cells to enable expression of the protein.
The host cell may be a prokaryotic cell, such as a bacterial cell; or lower eukaryotic cells, such as yeast cells; or higher eukaryotic cells, such as mammalian cells. Representative examples are: coli, streptomyces; bacterial cells of salmonella typhimurium; fungal cells such as yeast; a plant cell; insect cells of Drosophila S2 or Sf 9; CHO, COS, 293 cells, or Bowes melanoma cells.
When the polynucleotide of the present invention is expressed in higher eukaryotic cells, transcription will be enhanced if an enhancer sequence is inserted into the vector. Enhancers are cis-acting elements of DNA, usually about 10 to 300 base pairs, that act on a promoter to increase the transcription of a gene. Examples include the SV40 enhancer 100 to 270 base pairs on the late side of the origin of replication, the polyoma enhancer on the late side of the origin of replication, and adenovirus enhancers.
It will be clear to a person of ordinary skill in the art how to select appropriate vectors, promoters, enhancers and host cells.
Transformation of host cells with recombinant DNA can be performed using conventional techniques well known to those skilled in the art. When the host is a prokaryote such as E.coli, competent cells, which are capable of absorbing DNA, can be obtained after an exponential growth phase and treated by the CaCl2 method using procedures well known in the art. Another approach is to use MgCl2. Transformation can also be performed by electroporation, if desired. When the host is eukaryotic, the following DNA transfection methods may be used: calcium phosphate co-precipitation, conventional mechanical methods such as microinjection, electroporation, liposome encapsulation, etc.
The transformant obtained can be cultured by a conventional method to express the polypeptide encoded by the gene of the present invention. The medium used in the culture may be selected from various conventional media depending on the host cell used. The culture is carried out under conditions suitable for the growth of the host cell. After the host cells have grown to the appropriate cell density, the selected promoters are induced by suitable means (e.g., temperature switching or chemical induction) and the cells are cultured for an additional period of time.
The recombinant polypeptide in the above method may be expressed in a cell, or on a cell membrane, or secreted outside the cell. If desired, the recombinant proteins can be isolated and purified by various separation methods using their physical, chemical and other properties. Such methods are well known to those skilled in the art. Examples of such methods include, but are not limited to: conventional renaturation treatment, treatment with a protein precipitant (salting-out method), centrifugation, osmotic sterilization, super-treatment, super-centrifugation, molecular sieve chromatography (gel filtration), adsorption chromatography, ion exchange chromatography, high Performance Liquid Chromatography (HPLC), and other various liquid chromatography techniques and combinations of these methods.
The invention will be further illustrated with reference to specific examples. These examples are only for illustrating the present invention and are not intended to limit the scope of the present invention.
The experimental procedure, in which the specific conditions are not noted in the following examples, is generally followed by routine conditions, such as for example Sambrook et al, molecular cloning: conditions described in the laboratory Manual (New York: cold Spring Harbor Laboratory Press, 1989) or as recommended by the manufacturer. Percentages and parts are weight percentages and parts unless otherwise indicated.
Materials and reagents
Lysis (lysis) buffer: contains 50mM NaH 2PO4 and 300mM NaCl, pH7.4.
TEK wash buffer: 50 mMTris-HCl (pH 7.5) buffer containing 0.1M KCl and 1mM EDTA.
TEG preservation buffer: TE buffer containing 20% (v/v) glycerol.
YPD medium: 10 g/L yeast extract, 20 g/L peptone, 20 g/L glucose, and if solid culture medium is prepared, adding 20 g/L agar powder;
SC-Ura medium: 6.7 g/L of an amino-free yeast nitrogen source, 2 g/L,20 g/L of glucose, 0.9 g/L SC Dropoutmix-Ura;
Saccharomyces cerevisiae BY4742 was used as the expression host for candidate P450 enzymes, the expression vector pESC-URA3 was purchased from the company Invitrogen, and the expression vector pCf302 was described in the following document :Jingjie Jiang, et al. "Metabolic Engineering of Saccharomyces cerevisiae for High-Level Production of Salidroside from Glucose."J. Agric. Food Chem. 2018, 66, 4431-4438.
The cinnaringin B is purchased from Chengdu plant-labeled pure biotechnology Co., ltd; NADPH is purchased from soribao biotechnology limited; acteoside was purchased from beijing merida technologies limited; reverse transcription kit TRANSSCRIPT ONE-step gDNA Removal AND CDNA SYNTHESIS Supermix was purchased from Beijing full gold Limited; the blunt end rapid cloning kit pEASY-Blunt Cloning Kit was purchased from Beijing all gold Limited;
Codon optimized Arabidopsis thaliana source AtCPR1 was synthesized by Shanghai JieRui Bioengineering Co.
EXAMPLE 1 acquisition of CYP98A20 protein and Gene encoding same
By sequence analysis of transcriptome data of sesame (Sesamum indicum) root, stem, leaf and petiole tissues, gene expression level analysis is performed to obtain candidate genes with expression trends such as (leaf > petiole > stem > root). Based on RNA-seq second generation transcriptome sequencing, gene differential expression analysis was performed on the sequencing data to find cytochrome P450 monooxygenase candidate genes. And (3) searching candidate genes with annotation information of P450 enzyme by taking |Log2 (ratio) |1 or more and FDR < 0.001 as a standard and meeting the condition of Max (FPKM) |10 or more, and finally determining 23 candidate genes of the P450 enzyme with complete open reading and expression trend such as leaf blade leaf stalk stem root. Designing primers according to the full-length ORF sequence of the candidate genes, carrying out PCR amplification by taking sesame cDNA as a template to obtain 23 full-length P450 enzyme candidate genes, constructing the 23 full-length P450 enzyme candidate genes on a saccharomyces cerevisiae expression vector carrying an arabidopsis thaliana-derived cytochrome P450 reductase gene (ATR 1), obtaining a yeast recombinant expression vector pCf-ATR 1-CYP carrying different candidate P450 enzyme genes, and carrying out subsequent activity screening to obtain CYP98A20.
Wherein the CYP98A20 primer sequence CYP98A20-5F:5, -ATGGCTTTACCTCTACTCATCC-3,; CYP98A20-3R:5, -TCACACATTTCCGGAAGCCACA-3, using reverse transcribed cDNA as template, PCR amplifying to obtain target gene product, using flat-end quick cloning kit to clone it on pEASY-Blunt cloning vector, selecting three monoclonals, sequencing them in Jinweizhi biotechnology Co-Ltd, PCR amplifying to obtain sequence identical to that obtained by transcriptome, its DNA coding sequence and protein amino acid sequence are respectively shown as SEQ ID NO 2 and SEQ ID NO 1.
EXAMPLE 2 CYP98A20 protein functional analysis
1. Construction of recombinant strains
Recombinant plasmid pCf-ATR 1-CYP98A20 was constructed. The pCf is taken as a starting vector, and the cytochrome P450 reductase gene (ATR 1) gene derived from arabidopsis thaliana synthesized after codon optimization is constructed to the promoter P TDH3 to obtain the recombinant plasmid pCf302-ATR1. The recombinant plasmid pCf-ATR 1-CYP98A20 is obtained by inserting a DNA sequence shown by SEQ ID NO. 2 into a promoter P PGK1 by taking the plasmid pCf-ATR 1 as a starting vector. For specific procedures see (Jingjie Jiang, et al J. Agric. Food chem. 2018, 66, 4431-4438.).
Converting the recombinant plasmid pCf-ATR 1-CYP98A20 constructed in the above into Saccharomyces cerevisiae BY4742 to obtain a recombinant strain BY4742-ATR1-CYP98A20; the plasmid pCf-ATR 1 carrying only the ATR1 gene was transformed into Saccharomyces cerevisiae BY4742 to obtain recombinant strain BY4742-pCf302-ATR1, which was used as a blank control strain.
2. Crude enzyme and microsomal reaction experiments
Single colonies of the recombinant strain BY4742-ATR1-CYP98A20 and the blank strain BY4742-pCf were inoculated into 3mL of a Ura (uracil) -deficient SC-Ura liquid medium containing 20g/L glucose, and cultured at 30℃and 220rpm for 24 hours, at 1:100 were inoculated into 50mL of a Ura (uracil) -deficient SC-Ura liquid medium containing 20g/L glucose, and cultured at 30℃and 220rpm for 36-48 hours, and the cells were collected. Glass bead extraction (adding 0.5g of 0.4-0.6 μm glass beads into 1ml of thallus, vortex shaking for 1min, standing on ice for 1min, repeating for 5 times) to obtain supernatant as crude enzyme. In vitro crude enzyme reaction (100. Mu.L crude enzyme, 50. Mu.L 50mM Tris-HCl, 1mM NADPH,100. Mu.M cinnaringin B) was carried out at 30℃for 6h. After the reaction is finished, the reaction is stopped by the methanol with the same volume, and the methanol is filtered by a microporous filter membrane with the size of 0.22 mu M for HPLC-MS detection.
Single colonies of the recombinant strain BY4742-ATR1-CYP98A20 and the blank strain BY4742-pCf were inoculated into 3mL of a Ura (uracil) -deficient SC-Ura liquid medium containing 20g/L glucose, and cultured at 30℃and 220rpm for 24 hours, at 1:100 were inoculated into 50mL of a Ura (uracil) -deficient SC-Ura liquid medium containing 20g/L glucose, and cultured at 30℃and 220rpm for 36-48 hours, and the cells were collected. Glass bead extraction (adding 0.5g of 0.4-0.6 μm glass beads into 1ml of thallus, vortex shaking for 1min, standing on ice for 1min, repeating for 5 times) to obtain supernatant. Yeast microsomes were extracted using ultracentrifugation (2 h centrifuged at 150000g with ultracentrifuge at 4 ℃, supernatant discarded, TEG buffer resuspended). In vitro microsomal reactions (100. Mu.L microsomes, 50. Mu.L 50mM Tris-HCl, 1mM NADPH,100. Mu.M cinnaringin B) were performed at 30℃for 6h. After the reaction is finished, the reaction is stopped by the methanol with the same volume, and the methanol is filtered by a microporous filter membrane with the size of 0.22 mu M for HPLC-MS detection. HPLC-MS was performed using the Agilent 1200 HPLC system series a Bruker-MicrOTOF-II mass spectrometer (Bruker, germany) system.
The HPLC detection parameters were as follows: column YMC-Pack ODS-A (4.6X1250 mm, 5. Mu.M); the eluent consists of a solution A and a solution B: solution A is 0.1% (v/v) formic acid aqueous solution, solution B is acetonitrile, the sample injection amount is 50 mu L, the detection wavelength is 312 nm, the flow rate is 1 mL/min, and the column temperature is 40 ℃. The gradient elution method is adopted: 0min, 5% (v/v) solution B;5-18 min,5% -13.2% (v/v) solution B;18-43 min,18% -21.6% (v/v) solution B; 43-50 min,100% (v/v) solution B;50-57min,5% (v/v) solution B. The mass spectrometry conditions were as follows: the mode is Electrospray (ESI) negative ions, the capillary voltage is-4500V, the atomization air pressure is 1 bar, the solvent removing gas is nitrogen, the flow rate is 6.0L/min, the solvent removing temperature and the ion source temperature are 180 ℃, the scanning range m/z is 50-1000, the LC-MS data collection software is MassLynx 4.0 (Waters, USA), and sodium trifluoroacetate is used as a correction fluid for accurate molecular weight.
As shown in FIG. 2, the retention time of the adsorption peak of the blank strain BY4742-pCf and that of the standard artocarpine B (I) (t R =39.27 min) are consistent, while the adsorption peak of the recombinant strain BY4742-ATR1-CYP98A20 in the crude enzyme and microsomal reaction system is weakened, and three new adsorption peaks appear at the position with increased polarity, wherein the position peak with the maximum polarity and that of the acteoside standard (IV) are consistent (t R =27.66 min), indicating that CYP98A20 can effectively convert the substrate artocarpine B to acteoside. The exact molecular weight of this compound (IV) ([ M-H ] - = 623.2054), as determined by HPLC-MS, was consistent with the molecular weight of the acteoside, and the conversion of microsomal reactions was found to be about 88.9%. At the same time, new peaks (II) and (III) also appear in the reaction of catalytic cinnarin B of CYP98a20, the retention times of the new peaks (III) and SYRINGALIDE A3 '- α -L-rhamnopyranoside are consistent (t R =32.12 min), and the exact molecular weight of (III) ([ M-H ] - = 607.2037) is determined to be SYRINGALIDE A3' - α -L-rhamnopyranoside by HPLC-MS detection (fig. 4). In addition, (II) the precise molecular weight ([ M-H ] - = 607.2039) suggests that CYP98a20 produces two monohydroxy intermediates (II, C3 position hydroxylation of tyrosol) and SYRINGALIDE A3' - α -L-rhamnopyranoside during the reaction that catalyzes the reaction of cinnarin B.
3. SYRINGALIDE A3 crude enzyme and microsomal reaction experiments of 3' -alpha-L-rhamnopyranoside
The CYP98A20 crude enzyme and microsomes extracted in the 2 nd point are used for the SYRINGALIDE A' -alpha-L-rhamnopyranoside in-vitro catalytic reaction. As shown in FIG. 3, the retention time of the SYRINGALIDE A ' - α -L-rhamnopyranoside (III) absorption peak of the blank strain BY4742-pCf302 and the retention time of the standard absorption peak (t R =32.12 min) are consistent, while the absorption peak of SYRINGALIDE A3 ' - α -L-rhamnopyranoside (III) in the crude enzyme and microsomal reaction system of the recombinant strain BY4742-ATR1-CYP98A20 is weakened, a new absorption peak appears at the position with increased polarity, the retention time is consistent with the retention time of the verbascoside standard (t R =27.66 min), which indicates that CYP98A20 can effectively convert the monohydroxy intermediate SYRINGALIDE A ' - α -L-rhamnopyranoside to form the verbascoside, and the conversion rate of microsomal reaction is calculated to be about 93.6%.
All documents mentioned in this disclosure are incorporated by reference in this disclosure as if each were individually incorporated by reference. Further, it will be appreciated that various changes and modifications may be made by those skilled in the art after reading the above teachings, and such equivalents are intended to fall within the scope of the application as defined in the appended claims.
Claims (10)
1. An isolated CYP98a20 polypeptide, wherein said polypeptide is selected from the group consisting of:
a) A polypeptide having an amino acid sequence as shown in SEQ ID NO. 1;
b) A derivative protein which is formed by substitution, deletion or addition of one or a plurality of amino acid residues, preferably 1 to 50, more preferably 1 to 30, still more preferably 1 to 10, most preferably 1 to 6, 1, 2 or 3 amino acids of the amino acid sequence shown in SEQ ID NO.1 and has catalytic activity of cinnarin A;
c) The homology of the amino acid sequence with the amino acid sequence shown in SEQ ID NO. 1 is more than or equal to 65%, preferably more than or equal to 70%, preferably more than or equal to 75%, preferably more than or equal to 80%, preferably more than or equal to 85%, more preferably more than or equal to 90%, preferably more than or equal to 95%, preferably more than or equal to 98%, preferably more than or equal to 99%, and has the derivative protein catalyzing the activity of the cinnarin B.
2. The polypeptide of claim 1, wherein the CYP98a20 polypeptide is from the family of the general fabaceae, preferably from a member selected from the group consisting of sesame.
3. A peptide derived from the polypeptide according to claim 1 or 2, wherein the polypeptide according to claim 1 or 2 is added with a tag sequence, a signal sequence or a secretion signal sequence to form a fusion protein.
4. An isolated polynucleotide, wherein the encoded amino acid is the isolated CYP98a20 polypeptide of any one of claims 1 to 3, or the complement thereof.
5. A gene vector comprising the polynucleotide of claim 3, preferably it is an expression vector, a shuttle vector, or an integration vector; or bacterial plasmids, phages, yeast plasmids, plant cell viruses, animal cell viruses, retroviruses.
6. A genetically engineered host cell comprising the vector of claim 4, or the polynucleotide of claim 3 integrated into its genome, preferably the host cell is selected from the group consisting of: bacterial cells, fungal cells, higher plant cells, insect or mammalian cells, more preferably yeast cells, E.coli.
7. A method of producing a CYP98a20 polypeptide, said method comprising:
(a) Culturing the host cell of claim 6 under conditions suitable for expression;
(b) Isolating the CYP98A20 polypeptide from the culture.
8. Use of a CYP98a20 polypeptide according to any one of claims 1 to 2 or a derivative peptide according to claim 3 or a gene encoding the same, for catalyzing the hydroxylation of the 3-carbon atom of coumaric acid or the 3-carbon atom of tyrosol in naringin B, followed by catalyzing the hydroxylation of another 3-carbon atom to produce acteoside.
9. A process for the preparation of a acteoside, characterized in that it comprises a catalytic reaction with artocarpine B as substrate in the presence of a CYP98a20 polypeptide according to any one of claims 1 to 2 or a derivative peptide according to claim 3, thereby obtaining acteoside.
10. The method of preparation according to claim 9, wherein the method is carried out in the presence of a cofactor, preferably NADPH and/or NADH, further preferably the cofactor is used in an amount of 0.5-6.0 mM, preferably 0.5-5.0 mM, more preferably 1.0-3.0 mM; preferably, the process is carried out in the presence of oxygen; preferably, an additive for increasing or inhibiting the enzymatic activity is also provided to the reaction system, more specifically said additive for modulating the enzymatic activity is selected from the group consisting of: ca 2+、Co2+、Mn2+、Ba2+、Al3+、Ni2+、Zn2+, or Fe 2+; further preferably, the pH of the reaction system is: 6.5 to 8.5, preferably pH 7.4 to 7.6, the temperature of the reaction system is: 25 ℃ to 35 ℃, preferably 28 ℃ to 30 ℃; preferably, the reaction time is from 0.5h to 24h, preferably from 1h to 10h, more preferably from 3h to 6h.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202211436961.8A CN118048329A (en) | 2022-11-16 | 2022-11-16 | CYP98A20 protein participating in acteoside biosynthesis, and coding gene and application thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202211436961.8A CN118048329A (en) | 2022-11-16 | 2022-11-16 | CYP98A20 protein participating in acteoside biosynthesis, and coding gene and application thereof |
Publications (1)
Publication Number | Publication Date |
---|---|
CN118048329A true CN118048329A (en) | 2024-05-17 |
Family
ID=91045331
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202211436961.8A Pending CN118048329A (en) | 2022-11-16 | 2022-11-16 | CYP98A20 protein participating in acteoside biosynthesis, and coding gene and application thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN118048329A (en) |
-
2022
- 2022-11-16 CN CN202211436961.8A patent/CN118048329A/en active Pending
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107058446B (en) | Group of glycosyltransferases and application thereof | |
CN110225971B (en) | UDP-glycosyltransferase for catalyzing sugar chain extension and application thereof | |
CN104232723B (en) | Group of glycosyltransferases and application thereof | |
CN110551701B (en) | Carbonyl reductase mutant and application thereof in reduction of cyclopentadione compounds | |
CN115094046A (en) | Group of glycosyltransferases and application thereof | |
CN113136373B (en) | Carbonoside glycosyltransferase and application thereof | |
CN113265433B (en) | Bifunctional glycosyltransferase and application thereof | |
CN108330111B (en) | Cytochrome P450 mutant protein and application thereof | |
EP2573174B1 (en) | Ketoreductase mutant | |
WO2023006109A1 (en) | Highly specific glycosyltransferase for rhamnose, and use thereof | |
CN118048329A (en) | CYP98A20 protein participating in acteoside biosynthesis, and coding gene and application thereof | |
US20240035062A1 (en) | Cytochrome P450 Mutant Protein and Use Thereof | |
CN114277024B (en) | Novel triterpene synthase and application thereof | |
CN112831481B (en) | Glycosyltransferase and method for catalyzing sugar chain extension | |
CN112553175B (en) | Preparation and application of glycosyltransferase UGT76G1 mutant | |
CN113755458B (en) | CYP82AR2 protein involved in shikonin and/or acarnine biosynthesis, and encoding gene and application thereof | |
CN113755464B (en) | LrUGT2 protein involved in biosynthesis of cinnamyl leaf glycoside B and acteoside, and encoding gene and application thereof | |
CN112322592B (en) | CYP76B100 protein involved in alkannin biosynthesis, coding gene thereof and application thereof | |
CN108866142B (en) | Cytochrome P450, nicotinamide adenine dinucleotide-cytochrome P450 reductase and application thereof | |
CN113046333B (en) | Flavone-8-hydroxylase and application thereof in synthesis of 8-hydroxyflavone compound | |
CN113444703B (en) | Glycosyltransferase mutant for catalyzing sugar chain extension and application thereof | |
CN118048335A (en) | PbAT1 protein involved in cinnamomum cassia leaf glycoside A biosynthesis, encoding gene and application thereof | |
CN116987682A (en) | O-methyltransferase for catalyzing xanthotoxin to generate xanthotoxin, and coding gene and application thereof | |
CN114045270B (en) | Method for producing ginsenoside Rg1 and special engineering bacterium thereof | |
CN116987683A (en) | O-methyltransferase for catalyzing Parsley phenol to generate osthole, and encoding gene and application thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination |