WO2024120148A1 - Nouvelle diterpène synthase et son utilisation - Google Patents
Nouvelle diterpène synthase et son utilisation Download PDFInfo
- Publication number
- WO2024120148A1 WO2024120148A1 PCT/CN2023/132096 CN2023132096W WO2024120148A1 WO 2024120148 A1 WO2024120148 A1 WO 2024120148A1 CN 2023132096 W CN2023132096 W CN 2023132096W WO 2024120148 A1 WO2024120148 A1 WO 2024120148A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- nucleic acid
- sequence
- diterpene
- ent
- cyclase
- Prior art date
Links
- 101710118490 Copalyl diphosphate synthase Proteins 0.000 title description 2
- 101710174833 Tuberculosinyl adenosine transferase Proteins 0.000 title description 2
- 101001009859 Herpetosiphon aurantiacus (strain ATCC 23779 / DSM 785 / 114-95) (+)-kolavenyl diphosphate synthase Proteins 0.000 claims abstract description 68
- 230000000694 effects Effects 0.000 claims abstract description 13
- 150000007523 nucleic acids Chemical class 0.000 claims description 65
- 102000039446 nucleic acids Human genes 0.000 claims description 54
- 108020004707 nucleic acids Proteins 0.000 claims description 54
- 238000000034 method Methods 0.000 claims description 41
- 102000004190 Enzymes Human genes 0.000 claims description 40
- 108090000790 Enzymes Proteins 0.000 claims description 40
- JCAIWDXKLCEQEO-PGHZQYBFSA-N 5beta,9alpha,10alpha-labda-8(20),13-dien-15-yl diphosphate Chemical compound CC1(C)CCC[C@@]2(C)[C@H](CCC(/C)=C/COP(O)(=O)OP(O)(O)=O)C(=C)CC[C@@H]21 JCAIWDXKLCEQEO-PGHZQYBFSA-N 0.000 claims description 30
- 241000588724 Escherichia coli Species 0.000 claims description 30
- XEKOWRVHYACXOJ-UHFFFAOYSA-N Ethyl acetate Chemical group CCOC(C)=O XEKOWRVHYACXOJ-UHFFFAOYSA-N 0.000 claims description 30
- 150000004141 diterpene derivatives Chemical group 0.000 claims description 26
- 230000015572 biosynthetic process Effects 0.000 claims description 23
- 239000012634 fragment Substances 0.000 claims description 23
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 20
- NUHSROFQTUXZQQ-UHFFFAOYSA-N isopentenyl diphosphate Chemical compound CC(=C)CCO[P@](O)(=O)OP(O)(O)=O NUHSROFQTUXZQQ-UHFFFAOYSA-N 0.000 claims description 19
- XDSYKASBVOZOAG-LFGUQSLTSA-N ent-sandaracopimara-8(14),15-diene Chemical compound C1C[C@@](C)(C=C)C=C2CC[C@@H]3C(C)(C)CCC[C@@]3(C)[C@@H]21 XDSYKASBVOZOAG-LFGUQSLTSA-N 0.000 claims description 18
- 230000037361 pathway Effects 0.000 claims description 18
- 229920001184 polypeptide Polymers 0.000 claims description 18
- CBIDRCWHNCKSTO-UHFFFAOYSA-N prenyl diphosphate Chemical compound CC(C)=CCO[P@](O)(=O)OP(O)(O)=O CBIDRCWHNCKSTO-UHFFFAOYSA-N 0.000 claims description 18
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 18
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 18
- 239000013604 expression vector Substances 0.000 claims description 15
- 102000037865 fusion proteins Human genes 0.000 claims description 15
- 108020001507 fusion proteins Proteins 0.000 claims description 15
- ZSLZBFCDCINBPY-ZSJPKINUSA-N acetyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 ZSLZBFCDCINBPY-ZSJPKINUSA-N 0.000 claims description 14
- 150000001875 compounds Chemical class 0.000 claims description 14
- 239000013598 vector Substances 0.000 claims description 13
- 101000895629 Synechococcus sp. (strain ATCC 27264 / PCC 7002 / PR-6) Geranylgeranyl pyrophosphate synthase Proteins 0.000 claims description 12
- 101000699803 Talaromyces verruculosus Geranylgeranyl diphosphate synthase Proteins 0.000 claims description 10
- 101000757195 Zea mays Ent-copalyl diphosphate synthase AN1, chloroplastic Proteins 0.000 claims description 10
- XDSYKASBVOZOAG-UHFFFAOYSA-N pimaradiene Natural products C1CC(C)(C=C)C=C2CCC3C(C)(C)CCCC3(C)C21 XDSYKASBVOZOAG-UHFFFAOYSA-N 0.000 claims description 9
- BBPXZLJCPUPNGH-CMKODMSKSA-N (-)-Abietadiene Chemical compound CC1(C)CCC[C@]2(C)[C@@H](CCC(C(C)C)=C3)C3=CC[C@H]21 BBPXZLJCPUPNGH-CMKODMSKSA-N 0.000 claims description 8
- OINNEUNVOZHBOX-QIRCYJPOSA-N 2-trans,6-trans,10-trans-geranylgeranyl diphosphate Chemical compound CC(C)=CCC\C(C)=C\CC\C(C)=C\CC\C(C)=C\COP(O)(=O)OP(O)(O)=O OINNEUNVOZHBOX-QIRCYJPOSA-N 0.000 claims description 8
- BBPXZLJCPUPNGH-UHFFFAOYSA-N Abietadien Natural products CC1(C)CCCC2(C)C(CCC(C(C)C)=C3)C3=CCC21 BBPXZLJCPUPNGH-UHFFFAOYSA-N 0.000 claims description 8
- 108700040132 Mevalonate kinases Proteins 0.000 claims description 6
- 229930014912 abieta-7,13-diene Natural products 0.000 claims description 6
- QUUCYKKMFLJLFS-AZUAARDMSA-N abietatriene Chemical compound CC1(C)CCC[C@]2(C)C3=CC=C(C(C)C)C=C3CC[C@H]21 QUUCYKKMFLJLFS-AZUAARDMSA-N 0.000 claims description 6
- 239000013599 cloning vector Substances 0.000 claims description 6
- 238000012258 culturing Methods 0.000 claims description 6
- 239000003960 organic solvent Substances 0.000 claims description 6
- 108091000116 phosphomevalonate kinase Proteins 0.000 claims description 6
- 101710158485 3-hydroxy-3-methylglutaryl-coenzyme A reductase Proteins 0.000 claims description 5
- 102100022524 Alpha-1-antichymotrypsin Human genes 0.000 claims description 5
- 101150056978 HMGS gene Proteins 0.000 claims description 5
- 101000678026 Homo sapiens Alpha-1-antichymotrypsin Proteins 0.000 claims description 5
- 101000958922 Homo sapiens Diphosphomevalonate decarboxylase Proteins 0.000 claims description 5
- KEQXEEMBFONZBL-AZUAARDMSA-N Palustradiene Chemical compound C([C@@]12C)CCC(C)(C)[C@@H]1CCC1=C2CCC(C(C)C)=C1 KEQXEEMBFONZBL-AZUAARDMSA-N 0.000 claims description 5
- 108010076504 Protein Sorting Signals Proteins 0.000 claims description 5
- 101100011891 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) ERG13 gene Proteins 0.000 claims description 5
- 101150091791 mvk gene Proteins 0.000 claims description 5
- KEQXEEMBFONZBL-UHFFFAOYSA-N palustradiene Natural products CC12CCCC(C)(C)C1CCC1=C2CCC(C(C)C)=C1 KEQXEEMBFONZBL-UHFFFAOYSA-N 0.000 claims description 5
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 4
- QUUCYKKMFLJLFS-UHFFFAOYSA-N Dehydroabietan Natural products CC1(C)CCCC2(C)C3=CC=C(C(C)C)C=C3CCC21 QUUCYKKMFLJLFS-UHFFFAOYSA-N 0.000 claims description 4
- 238000003119 immunoblot Methods 0.000 claims description 4
- 238000000746 purification Methods 0.000 claims description 4
- 238000001291 vacuum drying Methods 0.000 claims description 4
- 238000006555 catalytic reaction Methods 0.000 claims description 3
- 230000000295 complement effect Effects 0.000 claims description 3
- 230000014509 gene expression Effects 0.000 claims description 3
- 230000010354 integration Effects 0.000 claims description 3
- 230000009465 prokaryotic expression Effects 0.000 claims description 3
- 239000013603 viral vector Substances 0.000 claims description 3
- 241001646826 Isodon rubescens Species 0.000 abstract description 11
- 101710090965 Class I diterpene synthase Proteins 0.000 abstract 1
- 210000004027 cell Anatomy 0.000 description 50
- 238000003786 synthesis reaction Methods 0.000 description 22
- 108090000623 proteins and genes Proteins 0.000 description 19
- 229930004069 diterpene Natural products 0.000 description 17
- 239000013612 plasmid Substances 0.000 description 16
- 108020004414 DNA Proteins 0.000 description 13
- 238000002290 gas chromatography-mass spectrometry Methods 0.000 description 13
- 239000013615 primer Substances 0.000 description 13
- KJTLQQUUPVSXIM-ZCFIWIBFSA-N (R)-mevalonic acid Chemical compound OCC[C@](O)(C)CC(O)=O KJTLQQUUPVSXIM-ZCFIWIBFSA-N 0.000 description 12
- KJTLQQUUPVSXIM-UHFFFAOYSA-N DL-mevalonic acid Natural products OCCC(O)(C)CC(O)=O KJTLQQUUPVSXIM-UHFFFAOYSA-N 0.000 description 12
- 238000001228 spectrum Methods 0.000 description 12
- 102000004169 proteins and genes Human genes 0.000 description 11
- 108020004705 Codon Proteins 0.000 description 8
- 238000005481 NMR spectroscopy Methods 0.000 description 8
- 238000010367 cloning Methods 0.000 description 8
- 238000000855 fermentation Methods 0.000 description 8
- 230000004151 fermentation Effects 0.000 description 8
- 125000003275 alpha amino acid group Chemical group 0.000 description 7
- 238000004458 analytical method Methods 0.000 description 7
- 108091033319 polynucleotide Proteins 0.000 description 7
- 102000040430 polynucleotide Human genes 0.000 description 7
- 239000002157 polynucleotide Substances 0.000 description 7
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 6
- ONVABDHFQKWOSV-UHFFFAOYSA-N 16-Phyllocladene Natural products C1CC(C2)C(=C)CC32CCC2C(C)(C)CCCC2(C)C31 ONVABDHFQKWOSV-UHFFFAOYSA-N 0.000 description 6
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 6
- 241000191967 Staphylococcus aureus Species 0.000 description 6
- 230000004071 biological effect Effects 0.000 description 6
- 229910052799 carbon Inorganic materials 0.000 description 6
- 239000002609 medium Substances 0.000 description 6
- 239000002243 precursor Substances 0.000 description 6
- 239000000126 substance Substances 0.000 description 6
- 239000000758 substrate Substances 0.000 description 6
- 241000196324 Embryophyta Species 0.000 description 5
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 description 5
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 5
- UFHFLCQGNIYNRP-UHFFFAOYSA-N Hydrogen Chemical compound [H][H] UFHFLCQGNIYNRP-UHFFFAOYSA-N 0.000 description 5
- 102100024279 Phosphomevalonate kinase Human genes 0.000 description 5
- 238000007796 conventional method Methods 0.000 description 5
- ONVABDHFQKWOSV-HPUSYDDDSA-N ent-kaur-16-ene Chemical compound C1C[C@H](C2)C(=C)C[C@@]32CC[C@@H]2C(C)(C)CCC[C@@]2(C)[C@@H]31 ONVABDHFQKWOSV-HPUSYDDDSA-N 0.000 description 5
- 238000004817 gas chromatography Methods 0.000 description 5
- 239000008103 glucose Substances 0.000 description 5
- 239000001257 hydrogen Substances 0.000 description 5
- 229910052739 hydrogen Inorganic materials 0.000 description 5
- 238000003752 polymerase chain reaction Methods 0.000 description 5
- -1 tanshinone diene Chemical class 0.000 description 5
- MAKBWIUHFAVVJP-HAXARLPTSA-N (2R,3S)-pentane-1,2,3,4-tetrol phosphoric acid Chemical compound OP(O)(O)=O.CC(O)[C@H](O)[C@H](O)CO MAKBWIUHFAVVJP-HAXARLPTSA-N 0.000 description 4
- 102100038390 Diphosphomevalonate decarboxylase Human genes 0.000 description 4
- 241000194032 Enterococcus faecalis Species 0.000 description 4
- 102100022259 Mevalonate kinase Human genes 0.000 description 4
- 238000012408 PCR amplification Methods 0.000 description 4
- HEDRZPFGACZZDS-MICDWDOJSA-N Trichloro(2H)methane Chemical compound [2H]C(Cl)(Cl)Cl HEDRZPFGACZZDS-MICDWDOJSA-N 0.000 description 4
- 230000006696 biosynthetic metabolic pathway Effects 0.000 description 4
- 239000003153 chemical reaction reagent Substances 0.000 description 4
- 238000010276 construction Methods 0.000 description 4
- 229940032049 enterococcus faecalis Drugs 0.000 description 4
- 239000000543 intermediate Substances 0.000 description 4
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 4
- 238000004519 manufacturing process Methods 0.000 description 4
- VLKZOEOYAKHREP-UHFFFAOYSA-N n-Hexane Chemical compound CCCCCC VLKZOEOYAKHREP-UHFFFAOYSA-N 0.000 description 4
- 239000012044 organic layer Substances 0.000 description 4
- 241000894006 Bacteria Species 0.000 description 3
- 102000053602 DNA Human genes 0.000 description 3
- 229930000074 abietane Natural products 0.000 description 3
- 239000002299 complementary DNA Substances 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- XPPKVPWEQAFLFU-UHFFFAOYSA-J diphosphate(4-) Chemical compound [O-]P([O-])(=O)OP([O-])([O-])=O XPPKVPWEQAFLFU-UHFFFAOYSA-J 0.000 description 3
- 235000011180 diphosphates Nutrition 0.000 description 3
- ONVABDHFQKWOSV-YQXATGRUSA-N ent-Kaur-16-ene Natural products C1C[C@@H](C2)C(=C)C[C@@]32CC[C@@H]2C(C)(C)CCC[C@@]2(C)[C@@H]31 ONVABDHFQKWOSV-YQXATGRUSA-N 0.000 description 3
- UIXMIBNGPQGJJJ-UHFFFAOYSA-N ent-kaurene Natural products CC1CC23CCC4C(CCCC4(C)C)C2CCC1C3 UIXMIBNGPQGJJJ-UHFFFAOYSA-N 0.000 description 3
- 238000009396 hybridization Methods 0.000 description 3
- 238000002360 preparation method Methods 0.000 description 3
- 238000010898 silica gel chromatography Methods 0.000 description 3
- 241000894007 species Species 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- OINNEUNVOZHBOX-QIRCYJPOSA-K 2-trans,6-trans,10-trans-geranylgeranyl diphosphate(3-) Chemical compound CC(C)=CCC\C(C)=C\CC\C(C)=C\CC\C(C)=C\COP([O-])(=O)OP([O-])([O-])=O OINNEUNVOZHBOX-QIRCYJPOSA-K 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 244000063299 Bacillus subtilis Species 0.000 description 2
- 235000014469 Bacillus subtilis Nutrition 0.000 description 2
- 108091033380 Coding strand Proteins 0.000 description 2
- 241001302160 Escherichia coli str. K-12 substr. DH10B Species 0.000 description 2
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 2
- 241000233866 Fungi Species 0.000 description 2
- 108091092584 GDNA Proteins 0.000 description 2
- OINNEUNVOZHBOX-XBQSVVNOSA-N Geranylgeranyl diphosphate Natural products [P@](=O)(OP(=O)(O)O)(OC/C=C(\CC/C=C(\CC/C=C(\CC/C=C(\C)/C)/C)/C)/C)O OINNEUNVOZHBOX-XBQSVVNOSA-N 0.000 description 2
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 2
- 238000010806 PrimeScriptTM RT Reagent kit Methods 0.000 description 2
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 2
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 2
- 101000896804 Salvia miltiorrhiza Copalyl diphosphate synthase CPS1, chloroplastic Proteins 0.000 description 2
- 229930014549 abietadiene Natural products 0.000 description 2
- STIVVCHBLMGYSL-ZYNAIFEFSA-N abietane Chemical compound CC1(C)CCC[C@]2(C)[C@H]3CC[C@H](C(C)C)C[C@@H]3CC[C@H]21 STIVVCHBLMGYSL-ZYNAIFEFSA-N 0.000 description 2
- 238000007792 addition Methods 0.000 description 2
- 150000001413 amino acids Chemical class 0.000 description 2
- 230000001580 bacterial effect Effects 0.000 description 2
- 238000004113 cell culture Methods 0.000 description 2
- 238000005119 centrifugation Methods 0.000 description 2
- 239000000287 crude extract Substances 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 239000000284 extract Substances 0.000 description 2
- 238000012215 gene cloning Methods 0.000 description 2
- 239000001963 growth medium Substances 0.000 description 2
- 230000006698 induction Effects 0.000 description 2
- 238000002347 injection Methods 0.000 description 2
- 239000007924 injection Substances 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- VCOVNILQQQZROK-QGZVKYPTSA-N (13S)-pimara-7,15-diene Chemical compound C1C[C@](C)(C=C)CC2=CC[C@H]3C(C)(C)CCC[C@]3(C)[C@H]21 VCOVNILQQQZROK-QGZVKYPTSA-N 0.000 description 1
- IYDAPILQPCDHTO-UHFFFAOYSA-N 8beta-15-Pimaren-8-ol Natural products CC1(C)CCCC2(C)C1CCC3(O)CC(C)(CCC23)C=C IYDAPILQPCDHTO-UHFFFAOYSA-N 0.000 description 1
- BQACOLQNOUYJCE-FYZZASKESA-N Abietic acid Natural products CC(C)C1=CC2=CC[C@]3(C)[C@](C)(CCC[C@@]3(C)C(=O)O)[C@H]2CC1 BQACOLQNOUYJCE-FYZZASKESA-N 0.000 description 1
- RSWGJHLUYNHPMX-UHFFFAOYSA-N Abietic-Saeure Natural products C12CCC(C(C)C)=CC2=CCC2C1(C)CCCC2(C)C(O)=O RSWGJHLUYNHPMX-UHFFFAOYSA-N 0.000 description 1
- 102000005345 Acetyl-CoA C-acetyltransferase Human genes 0.000 description 1
- 108010006229 Acetyl-CoA C-acetyltransferase Proteins 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- 244000025254 Cannabis sativa Species 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- 239000003155 DNA primer Substances 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 102000057412 Diphosphomevalonate decarboxylases Human genes 0.000 description 1
- 229920001917 Ficoll Polymers 0.000 description 1
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- 235000021506 Ipomoea Nutrition 0.000 description 1
- 241000207783 Ipomoea Species 0.000 description 1
- 241001183967 Isodon Species 0.000 description 1
- 241000207923 Lamiaceae Species 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- 102000008109 Mixed Function Oxygenases Human genes 0.000 description 1
- 108010074633 Mixed Function Oxygenases Proteins 0.000 description 1
- 101000958834 Neosartorya fumigata (strain ATCC MYA-4609 / Af293 / CBS 101355 / FGSC A1100) Diphosphomevalonate decarboxylase mvd1 Proteins 0.000 description 1
- 102000004316 Oxidoreductases Human genes 0.000 description 1
- 108090000854 Oxidoreductases Proteins 0.000 description 1
- 101000958925 Panax ginseng Diphosphomevalonate decarboxylase 1 Proteins 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 108020005091 Replication Origin Proteins 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 241000304195 Salvia miltiorrhiza Species 0.000 description 1
- 235000011135 Salvia miltiorrhiza Nutrition 0.000 description 1
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- 229930183118 Tanshinone Natural products 0.000 description 1
- 239000002250 absorbent Substances 0.000 description 1
- 230000002745 absorbent Effects 0.000 description 1
- 238000005377 adsorption chromatography Methods 0.000 description 1
- 229930013930 alkaloid Natural products 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- 230000003698 anagen phase Effects 0.000 description 1
- 230000000202 analgesic effect Effects 0.000 description 1
- 230000000844 anti-bacterial effect Effects 0.000 description 1
- 230000000259 anti-tumor effect Effects 0.000 description 1
- 239000000427 antigen Substances 0.000 description 1
- 102000036639 antigens Human genes 0.000 description 1
- 108091007433 antigens Proteins 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 230000001851 biosynthetic effect Effects 0.000 description 1
- 239000001110 calcium chloride Substances 0.000 description 1
- 235000011148 calcium chloride Nutrition 0.000 description 1
- 229910001628 calcium chloride Inorganic materials 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 244000309466 calf Species 0.000 description 1
- 239000012159 carrier gas Substances 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 238000000975 co-precipitation Methods 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 239000003398 denaturant Substances 0.000 description 1
- 239000011903 deuterated solvents Substances 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- UYNPPIDGSVPVSW-UHFFFAOYSA-N ent-kaurane Natural products CC1(O)CC23CCC4C(CCCC4(C)C(=O)O)C2C=CC1C3 UYNPPIDGSVPVSW-UHFFFAOYSA-N 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 229930003935 flavonoid Natural products 0.000 description 1
- 150000002215 flavonoids Chemical class 0.000 description 1
- 235000017173 flavonoids Nutrition 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- 230000014726 immortalization of host cell Effects 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- VCOVNILQQQZROK-UHFFFAOYSA-N isopimaradiene Natural products C1CC(C)(C=C)CC2=CCC3C(C)(C)CCCC3(C)C21 VCOVNILQQQZROK-UHFFFAOYSA-N 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 238000004811 liquid chromatography Methods 0.000 description 1
- 229910001629 magnesium chloride Inorganic materials 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 238000001819 mass spectrum Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000010297 mechanical methods and process Methods 0.000 description 1
- 102000002678 mevalonate kinase Human genes 0.000 description 1
- JFMNRCSZAUKVCF-YBAYKBPUSA-N mevalonyl-CoA Chemical compound CC(O)(CCO)CC(=O)SCCNC(=O)CCNC(=O)[C@H](O)C(C)(C)COP(O)(=O)OP(O)(=O)OC[C@H]1O[C@H]([C@H](O)[C@@H]1OP(O)(O)=O)n1cnc2c(N)ncnc12 JFMNRCSZAUKVCF-YBAYKBPUSA-N 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 238000005065 mining Methods 0.000 description 1
- 239000002808 molecular sieve Substances 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- IYDAPILQPCDHTO-HHUCQEJWSA-N nezukol Chemical compound CC1(C)CCC[C@@]2(C)[C@H]1CC[C@@]1(O)C[C@](C)(CC[C@H]21)C=C IYDAPILQPCDHTO-HHUCQEJWSA-N 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 238000000655 nuclear magnetic resonance spectrum Methods 0.000 description 1
- 239000002773 nucleotide Substances 0.000 description 1
- 125000003729 nucleotide group Chemical group 0.000 description 1
- 210000003463 organelle Anatomy 0.000 description 1
- 230000003204 osmotic effect Effects 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- 230000027086 plasmid maintenance Effects 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 230000019525 primary metabolic process Effects 0.000 description 1
- 230000035755 proliferation Effects 0.000 description 1
- 238000010188 recombinant method Methods 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 238000004153 renaturation Methods 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 238000010839 reverse transcription Methods 0.000 description 1
- 230000000630 rising effect Effects 0.000 description 1
- 238000002390 rotary evaporation Methods 0.000 description 1
- 238000005185 salting out Methods 0.000 description 1
- 238000007480 sanger sequencing Methods 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 239000000741 silica gel Substances 0.000 description 1
- 229910002027 silica gel Inorganic materials 0.000 description 1
- URGAHOPLAPQHLN-UHFFFAOYSA-N sodium aluminosilicate Chemical compound [Na+].[Al+3].[O-][Si]([O-])=O.[O-][Si]([O-])=O URGAHOPLAPQHLN-UHFFFAOYSA-N 0.000 description 1
- 239000002904 solvent Substances 0.000 description 1
- 230000001954 sterilising effect Effects 0.000 description 1
- 238000004659 sterilization and disinfection Methods 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 239000002352 surface water Substances 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- AIGAZQPHXLWMOJ-UHFFFAOYSA-N tanshinone IIA Natural products C1=CC2=C(C)C=CC=C2C(C(=O)C2=O)=C1C1=C2C(C)=CO1 AIGAZQPHXLWMOJ-UHFFFAOYSA-N 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- 238000003151 transfection method Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- 150000005671 trienes Chemical class 0.000 description 1
- 150000003648 triterpenes Chemical class 0.000 description 1
- 238000005199 ultracentrifugation Methods 0.000 description 1
- 238000000870 ultraviolet spectroscopy Methods 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
Definitions
- the present invention belongs to the field of biotechnology, and more specifically, the present invention relates to a group of novel diterpene synthases derived from Rubescens rubescens, which can specifically synthesize ent-sandaracopimaradiene, palustradiene, abieta-7,13-diene or abietatriene skeletons.
- Isodon rubescens also known as ice ling grass and ice ling flower, is a perennial herbaceous plant of the Labiatae family. The whole plant is used as medicine, which has good heat-clearing and detoxifying, blood-activating and analgesic, antibacterial and anti-tumor effects, and has great comprehensive development and utilization value.
- the chemical components of Isodon rubescens are mainly diterpenoids, and triterpenoids, flavonoids and alkaloids are also reported.
- the diterpenoids of Isodon rubescens include ent-kaurene type, abietane type, atane type and hemi-florane type.
- Isodon rubescens In previous studies, several type I and type II diterpene cyclases have been reported in Isodon rubescens, involving the synthesis of skeletons such as ent-kaurene, tanshinone diene, nezukol, and isopimaradiene. In addition to the above diterpene skeletons, there are still a large number of diterpene cyclases in Isodon rubescens that need to be identified. By using these new diterpene cyclases, Isodon rubescens can have the potential to synthesize more diverse diterpene skeletons.
- the present invention first provides a diterpene cyclase, which is a type II diterpene cyclase derived from the genus Isodon (preferably Isodon rubescens).
- the diterpene cyclase is N-(2-aminoethyl)-2-aminoethyl-N-(2-aminoethyl)-2-aminoethyl-N-(2-aminoethyl)-2-aminoethyl-N-(2-aminoethyl)-2-aminoethyl-N-(2-aminoethyl)-2-ase cyclase
- the present invention also provides a fusion protein having the amino acid sequence of the diterpene cyclase of the present invention and other polypeptides.
- the other polypeptides are located at the N-terminus and/or C-terminus of the diterpene cyclase.
- the other polypeptides include a signal peptide, a tag for purification, or a tag for immunoblotting.
- the first aspect of the present invention also provides a nucleic acid molecule comprising a sequence selected from the following:
- nucleic acid sequence encoding a diterpene cyclase or a fusion protein as described herein;
- the fragment is a primer.
- nucleic acid sequence is codon-optimized for E. coli.
- the nucleic acid sequence is DNA or RNA.
- the nucleic acid sequence is as shown in SEQ ID NO: 2, 4, 6 or 8.
- Another aspect of the present invention is to provide a nucleic acid construct comprising the nucleic acid molecule described herein.
- the nucleic acid construct is a vector, such as a cloning vector, an integration vector, or an expression vector.
- the nucleic acid molecule is operably linked to an expression control sequence.
- the expression vector is selected from a prokaryotic expression vector, a eukaryotic expression vector, and a viral vector.
- the nucleic acid construct further contains any one or more of the coding sequences of GGPPS, ent-CPS, and CPS.
- the present invention also provides a host cell, wherein:
- nucleic acid molecule comprising a nucleic acid molecule and/or a nucleic acid construct as described in any embodiment of the present invention.
- the host cell is an E. coli cell.
- the E. coli is a B-lineage E. coli; more preferably, it is E. coli BL21 (DE3).
- the host cell also expresses an enzyme that catalyzes the production of IPP or DMAPP from acetyl-CoA, or contains a nucleic acid construct encoding the enzyme.
- the host cell further expresses an enzyme of the MVA pathway, or contains a nucleic acid construct encoding the enzyme.
- the protein of the MVA pathway includes one or more selected from the following: MVD, PMK, MVK, HMGR, HMGS, AACT, preferably one or more selected from the following: AtoB, MvaS, MvaE, Mvk1, Mvk2, MvaD and Fni.
- the host cell also expresses an enzyme of the MEP pathway, or contains a nucleic acid construct encoding the enzyme.
- the present invention also provides a method for catalyzing copalyl pyrophosphate (CPP) or enantio-copalyl pyrophosphate (ent-CPP) to generate a product, comprising:
- diterpene cyclase or a fusion protein containing the diterpene cyclase to catalyze the enantiomeric CPP, wherein the diterpene cyclase has a sequence as shown in SEQ ID NO: 1 or a sequence having at least 70% identity thereto and retaining diterpene cyclase activity,
- the product is a diterpene core skeleton compound.
- the product of (1) is ent-sandamaradiene.
- the product of (2) is parrustadiene, abietadiene-7,13-diene or abietadiene.
- the method further comprises:
- a step of catalyzing the formation of ent-copalyl pyrophosphate or copalyl pyrophosphate from (E,E,E)-geranylgeranyl pyrophosphate preferably using ent-CPS or CPS catalysis, and/or
- step of catalyzing IPP or DMAPP to generate (E,E,E)-geranylgeranyl pyrophosphate which step is preferably catalyzed by GGPPS, and/or
- the step of catalyzing acetyl-CoA to produce IPP or DMAPP is preferably catalyzed by one or more enzymes selected from the group consisting of MVD, PMK, MVK, HMGR, HMGS, and AACT.
- the method comprises the step of culturing the host cell described in any of the embodiments herein under conditions suitable for catalyzing copalyl pyrophosphate or ent-copalyl pyrophosphate to produce a product.
- the conditions include TB medium.
- the TB medium contains an initial carbon source, preferably glucose; preferably, the concentration of glucose is 2%.
- the culturing temperature is 20-30°C, preferably 28°C.
- the culturing is for at least 24 hours, preferably at least 96 hours.
- the conditions include an inducing agent, preferably IPTG; preferably, the concentration of IPTG is at least 0.05 mM.
- the method further comprises the step of isolating ent-sandamaradiene, palustadiene, abietriene-7,13-diene and abietriene from the host cells; specifically comprising: crushing the cells, extracting with an organic solvent and vacuum drying, and the organic solvent is preferably ethyl acetate.
- FIG. 1 Biosynthetic pathway of ent-sandamaradiene.
- Module I precursor synthesis
- MVA mevalonic acid pathway
- MEP intrinsic methylerythritol phosphate pathway
- module II diterpene nucleus synthesis
- GGPPS GGPPS
- ent-CPS ent-CPS
- IrubKSL4 synthesize ent-sandamaradiene using IPP and DMAPP as substrates.
- Figure 2 Gas chromatography analysis confirms that strain sIrubDiT1 is capable of producing ent-sandaracopimaradiene.
- Figure 3 Nuclear magnetic resonance spectroscopy (A, hydrogen spectrum; B, carbon spectrum) results and gas chromatography-mass spectrometry (C) detection, confirming that the produced compound is ent-sandamaradiene.
- FIG. 4 Biosynthetic pathways of parrustadiene, abiet-7,13-diene and abiettriene.
- Module I precursor synthesis
- MVA mevalonate pathway
- MEP intrinsic methylerythritol phosphate
- IPP and DMAPP is used as substrate
- GGPPS and CPS synthesize the intermediate CPP
- IrubKSL7, IrubKSL8 and IrubKSL9 catalyze CPP to produce palustradiene, abieta-7,13-diene and abietatriene, respectively.
- Figure 5 Gas chromatography analysis confirmed that strains sIrubDiT2, sIrubDiT3, and sIrubDiT4 were able to produce parrustadiene, abietriene-7,13-diene, and abietriene, respectively.
- Figure 7 Nuclear magnetic resonance spectroscopy (A, hydrogen spectrum; B, carbon spectrum) results and gas chromatography-mass spectrometry (C) detection confirmed that the compound produced by sIrubDiT3 is arosin-7,13-diene.
- Figure 8 Nuclear magnetic resonance spectroscopy (A, hydrogen spectrum; B, carbon spectrum) results and gas chromatography-mass spectrometry (C) detection confirmed that the compound produced by sIrubDiT4 is abietriene.
- the present invention relates to a group of novel diterpene cyclases encoded by nucleotide sequences derived from Isodon rubescens.
- the inventors cloned and heterologously expressed kaurene synthase-like (KSL) diterpene cyclase encoding genes IrubKSL4, IrubKSL7, IrubKSL8 and IrubKSL9 in Escherichia coli BL21 (DE3) by mining the genome and transcriptome information of Isodon rubescens.
- KSL kaurene synthase-like
- the generated recombinant proteins can specifically cyclize the diterpene intermediates ent-copalyl pyrophosphate or copalyl pyrophosphate, thereby producing ent-sandaracopimaradiene, palustradiene, abieta-7,13-diene or abietatriene skeletons.
- the present invention first provides a group of novel diterpene cyclases, named IrubKSL4, IrubKSL7, IrubKSL8 and IrubKSL9 derived from Isodon rubescens, and their amino acid sequences are shown in SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5 and SEQ ID NO: 7, respectively.
- the present invention also includes SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5 and SEQ ID NO: 7 has at least 80% (e.g., at least 90%, 96%, at least 98%, at least 99%) sequence identity while retaining its biological activity.
- the "biological activity" of a diterpene cyclase generally refers to its ability to specifically catalyze the synthesis of enantio-copalyl pyrophosphate or copalyl pyrophosphate into a diterpene nucleus skeleton compound.
- Exemplary such variants include biologically active fragments of the enzyme and variants of the enzyme or its biologically active fragment.
- the meaning of a biologically active fragment of an enzyme refers to a polypeptide that still retains all or part of the functions of a full-length enzyme or protein. Typically, the biologically active fragment retains at least 98% or 99% activity.
- “diterpene core skeleton compounds” mainly include ent-pimaradiene diterpene compounds, abietane diterpene compounds, and ent-kaurane diterpene compounds.
- ent-pimaradiene includes ent-sandamaradiene
- abietane includes palustadiene, abietane-7,13-diene, and abietriene.
- a mutant of the enzyme or its biologically active fragment refers to an amino acid sequence formed by substitution, deletion or addition of one or more amino acid residues.
- Appropriate replacement of amino acids is a technique well known in the art, which can be easily implemented and ensures that the biological activity of the resulting molecule is not changed. These techniques have made those skilled in the art realize that, in general, changing a single amino acid in a non-essential region of a polypeptide will not substantially change the biological activity.
- the present invention includes a protein or enzyme whose amino acid sequence has at least 98%, at least 99% sequence identity with the enzyme, while retaining the biological activity of the enzyme.
- the variant is from the same or similar source (such as the same plant), such as the diterpene cyclase is from Rubescens rubescens, so its variant is preferably also from Rubescens rubescens or its co-genus, the genus Ipomoea.
- the present invention also provides a fusion polypeptide comprising the diterpene cyclase described herein and other polypeptides.
- polypeptides include polypeptides that localize the diterpene cyclase to different organelles or sub-organelles, tags for purification, or tags for immunoblotting.
- the present invention also provides a polynucleotide encoding a diterpene cyclase or a variant thereof as described herein.
- the polynucleotide of the present invention may be in the form of DNA or RNA.
- the DNA form includes cDNA, genomic DNA or artificially synthesized DNA.
- the DNA may be single-stranded or double-stranded.
- the DNA may be a coding strand or a non-coding strand.
- nucleic acids As will be appreciated by those skilled in the art, due to the degeneracy of the genetic code, a very large number of nucleic acids can be produced. They all encode the antibodies or antigen-binding fragments thereof of the present invention. Therefore, in the case where a specific amino acid sequence has been identified, a person skilled in the art can make any number of different nucleic acids by simply modifying the sequence of one or more codons in a manner that does not change the amino acid sequence of the encoded protein. For example, a nucleic acid sequence is optimized using a species (e.g., E. coli) preferred codon to make the sequence more easily expressed in the species, such as SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5 and SEQ ID NO: 7. The E.
- E. coli preferred codon to make the sequence more easily expressed in the species
- the present invention also relates to polynucleotides that hybridize with the above-mentioned polynucleotide sequences and have at least 50%, preferably at least 70%, and more preferably at least 80% identity between the two sequences.
- the present invention particularly relates to polynucleotides that can hybridize with the polynucleotides of the present invention under stringent conditions.
- stringent conditions refer to: (1) hybridization and elution at relatively low ionic strength and relatively high temperature, such as 0.2 ⁇ SSC, 0.1% SDS, 60°C; or (2) addition of denaturants during hybridization, such as 50% (v/v) formamide, 0.1% calf serum/0.1% Ficoll, 42°C, etc.; or (3) hybridization occurs only when the identity between the two sequences is at least 90%, preferably at least 95%.
- the polypeptide encoded by the hybridizable polynucleotide has the same biological function and activity as the mature polypeptide.
- the full-length nucleotide sequence of the protein or enzyme of the present invention or its fragment can usually be obtained by PCR amplification, recombination or artificial synthesis.
- a feasible method is to synthesize the relevant sequence by artificial synthesis, especially when the fragment length is short.
- a fragment with a very long sequence can be obtained by first synthesizing multiple small fragments and then connecting them.
- the relevant sequence can be obtained in large quantities by recombinant methods. This is usually done by cloning it into a vector, then transferring it into cells, and then isolating the relevant sequence from the host cells after proliferation by conventional methods.
- the biomolecules (nucleic acids, proteins, etc.) involved in the present invention include biomolecules in isolated form.
- the DNA sequence encoding the protein of the present invention (or its fragment, or its derivative) can be obtained completely by chemical synthesis. The DNA sequence can then be introduced into various existing DNA molecules (or vectors) and cells known in the art. In addition, mutations can also be introduced into the protein sequence of the present invention by chemical synthesis.
- the present invention also relates to a nucleic acid construct comprising the above-mentioned appropriate DNA sequence and an appropriate promoter or control sequence
- the nucleic acid construct usually carries an extrachromosomal element of a gene that is not part of the central metabolism of the cell, and is often in the form of a circular double-stranded DNA molecule.
- Such elements may be autonomously replicating sequences, genome-integrating sequences, phage or nucleotide sequences, linear or circular sequences obtained from any source.
- nucleic acid constructs include expression vectors and recombinant vectors. These vectors can be used to transform appropriate host cells to enable them to express proteins.
- the vector generally contains sequences for plasmid maintenance and for cloning and expressing exogenous nucleotide sequences.
- the sequence generally includes one or more of the following nucleotide sequences: promoter, one or more enhancer sequences, replication origin, transcription termination sequence, complete intron sequence containing donor and acceptor splice sites, sequence encoding leader sequence for polypeptide secretion, ribosome binding site, polyadenylation sequence, multiple linker regions and selectable marker elements for inserting nucleic acids encoding antibodies to be expressed.
- exemplary nucleic acid constructs include pET21a.
- the nucleic acid construct may also contain any one or more of the coding sequences of GGPPS, ent-CPS, and CPS.
- Transformation of host cells with recombinant DNA can be carried out using conventional techniques well known to those skilled in the art.
- the host is a prokaryotic organism such as Escherichia coli
- competent cells that can absorb DNA can be harvested after the exponential growth phase and treated with the CaCl2 method, the steps used are well known in the art. Another method is to use MgCl2 . If necessary, transformation can also be carried out by electroporation.
- the following DNA transfection methods can be selected: calcium phosphate coprecipitation method, conventional mechanical methods such as microinjection, electroporation, liposome packaging, etc.
- the obtained transformant can be cultured by conventional methods to express the polypeptide encoded by the gene of the present invention.
- the culture medium used in the culture can be selected from various conventional culture media (e.g., LB or TB supplemented with glucose). Culture is carried out under conditions suitable for the growth of the host cells (e.g., 37° C.). When the host cells grow to an appropriate cell density, the selected promoter is induced by a suitable method (e.g., temperature conversion or chemical induction), and the cells are cultured for a period of time (e.g., 28° C., more than 96 hours).
- a suitable method e.g., temperature conversion or chemical induction
- the recombinant polypeptide in the above method can be expressed in the cell, on the cell membrane, or secreted outside the cell. If necessary, the recombinant protein can be separated and purified by various separation methods using its physical, chemical and other properties. These methods are well known to those skilled in the art. Examples of these methods include but are not limited to: conventional renaturation treatment, treatment with protein precipitants (salting out method), centrifugation, osmotic sterilization, ultra-treatment, ultracentrifugation, molecular sieve chromatography (gel filtration), adsorption chromatography, ion exchange chromatography, high performance liquid chromatography (HPLC) and other various liquid chromatography techniques and combinations of these methods.
- the protein or enzyme of the sequence of the present invention can be expressed in a heterologous host cell, such as a bacterial cell, a fungal cell, such as a yeast cell, a mammalian cell, an insect cell and a plant cell.
- a heterologous host cell for expressing the nucleic acid molecule of the present invention can be a microbial host present in a fungus or bacterial family and grown in a wide temperature, pH value and solvent tolerance range.
- any bacterium, yeast and filamentous fungi can be a suitable host for expressing the nucleic acid molecule of the present invention.
- the host cell is an Escherichia coli, such as a B-series Escherichia coli, preferably an Escherichia coli BL21 (DE3).
- Escherichia coli such as a B-series Escherichia coli, preferably an Escherichia coli BL21 (DE3).
- these novel diterpene cyclases are applied to an artificially constructed recombinant Escherichia coli system to produce enantio-sandarama-piperadiene, palustadiene, abietic acid-7,13-diene and abietic acid triene by fermentation engineering.
- a host cell which: (1) expresses a diterpene cyclase or a variant thereof described herein, or (2) contains a nucleic acid molecule and/or a nucleic acid construct described herein.
- the host cell may also express other enzymes in the diterpene nucleus synthesis pathway, or contain the coding sequence of the enzyme or its expression vector.
- other enzymes include enzymes that catalyze IPP or DMAPP to generate enantio-copalyl pyrophosphate (ent-CPP) or copalyl pyrophosphate (CPP), including: pyrophosphate synthase (e.g., ent-CPS; preferably SrCPS) that catalyzes (E, E, E)-geranylgeranyl pyrophosphate (GGPP) to generate enantio-copalyl pyrophosphate (ent-CPP), pyrophosphate synthase (e.g., CPS; preferably SmCPS1) that catalyzes (E, E, E)-geranylgeranyl pyrophosphate (GGPP) to generate copalyl pyrophosphate (
- ent-CPP e.g., ent-
- the host cell may also express an enzyme that catalyzes acetyl-CoA to generate IPP or DMAPP, including: MVD, PMK, MVK, HMGR, HMGS, AACT, such as AtoB (WP_077475940.1, from Escherichia coli, acetyl-CoA acetyltransferase), MvaS (WP_002361740.1, from Enterococcus faecalis, meglutaryl-CoA synthetase), MvaE (WP_002361740.1, from Enterococcus faecalis, mevalonyl-CoA reductase), Mvk1 (WP_000197034.1, from Staphylococcus aureus, mevalonate kinase), Mvk2 (WP_000616885.1, from Staphylococcus aureus, phosphomevalonate kinase),
- Homologous sequences or variants of these enzymes in various species are easily obtained by those skilled in the art. See, for example: CN202210947431, CN202210948412, CN202210947464.
- Host cells can express these enzymes by incorporating their coding sequences into nucleic acid constructs (e.g., expression vectors) and introducing them into cells. Nucleic acid constructs are described elsewhere herein.
- the present invention also provides a method for catalyzing the production of a diterpene core skeleton compound, comprising the step of using the diterpene cyclase described herein in a catalytic system to catalyze enantio-copalyl pyrophosphate or copalyl pyrophosphate.
- the method comprises the step of using the diterpene cyclase shown in SEQ ID NO: 1 or a sequence having at least 70% identity thereto to catalyze the enantio-CPP to produce enantio-sandamaradiene.
- the method comprises the step of using the diterpene cyclase shown in SEQ ID NO: 3, 5, 7 or a sequence having at least 70% identity thereto to catalyze the CPP to produce parrustadiene, abietadiene-7,13-diene or abietadiene.
- the method comprises incubating a host cell expressing a diterpene cyclase or a variant thereof as described herein under conditions suitable for producing a diterpene core skeleton compound. For example, culturing in TB medium containing 2% glucose and 0.1 mM IPTG at 28°C for 96 hours. Thereafter, the diterpene core skeleton compound can be enriched from the cell or cell culture by conventional methods of the present invention, such as by crushing the cells, extracting with ethyl acetate and vacuum drying.
- a diterpene cyclase which is a type II diterpene cyclase derived from the genus Isodonis, wherein the diterpene cyclase:
- Item 2 A fusion protein comprising the diterpene cyclase described in Item 1 and other polypeptides,
- the other polypeptide includes a signal peptide, a tag for purification or a tag for immunoblotting.
- Item 3 A nucleic acid molecule comprising a sequence selected from the following:
- the fragment is a primer.
- Item 4 The nucleic acid molecule according to Item 3, wherein: (1) the nucleic acid sequence is codon-optimized for Escherichia coli;
- nucleic acid sequence is as shown in SEQ ID NO: 2, 4, 6 or 8.
- nucleic acid construct comprising the nucleic acid molecule described in Item 3 or 4,
- the nucleic acid construct is a vector, such as a cloning vector, an integration vector or an expression vector.
- the expression vector is selected from a prokaryotic expression vector, a eukaryotic expression vector and a viral vector.
- the nucleic acid molecule is operably linked to an expression control sequence, and/or
- the nucleic acid construct also contains any one or more of the coding sequences of GGPPS, ent-CPS and CPS.
- Item 7 A host cell, wherein:
- the host cell is an Escherichia coli cell; more preferably, the Escherichia coli is a B strain Escherichia coli,
- the host cell also expresses an enzyme that catalyzes acetyl-CoA to produce IPP or DMAPP, or contains a nucleic acid construct encoding the enzyme.
- the host cell also expresses an enzyme of the MVA pathway, or contains a nucleic acid construct encoding the enzyme.
- the host cell also expresses an enzyme of the MEP pathway, or contains a nucleic acid construct encoding the enzyme.
- Item 8 A method for catalyzing copalyl pyrophosphate or enantio-copalyl pyrophosphate to produce a product, comprising:
- diterpene cyclase or a fusion protein comprising the same to catalyze ent-copalyl pyrophosphate, wherein the diterpene cyclase has a sequence as shown in SEQ ID NO: 1 or has at least 70% identity thereto and Sequences retaining diterpene cyclase activity,
- the product is a diterpene core skeleton compound; more preferably, the product of (1) is ent-sandaracopimaradiene, and the product of (2) is palustradiene, abieta-7,13-diene or abietatriene.
- Item 9 The method according to Item 8, characterized in that the method further comprises:
- a step of catalyzing the formation of ent-copalyl pyrophosphate or copalyl pyrophosphate from (E,E,E)-geranylgeranyl pyrophosphate preferably using ent-CPS or CPS catalysis, and/or
- step of catalyzing IPP or DMAPP to generate (E,E,E)-geranylgeranyl pyrophosphate which step is preferably catalyzed by GGPPS, and/or
- the step of catalyzing acetyl-CoA to produce IPP or DMAPP is preferably catalyzed by one or more enzymes selected from the group consisting of MVD, PMK, MVK, HMGR, HMGS, and AACT.
- Item 10 The method according to Item 8 or 9, characterized in that the method further comprises:
- the method comprises the step of culturing the host cell of item 7 under conditions suitable for catalyzing copalyl pyrophosphate or antero-copalyl pyrophosphate to produce a product,
- the method further comprises the step of isolating ent-sandamaradiene, palustadiene, abietriene-7,13-diene and abietriene from the host cell.
- the step comprises: crushing the cells, extracting with an organic solvent and vacuum drying, and the organic solvent is preferably ethyl acetate.
- Rubescens rubescens was collected from Jiyuan County, Henan province. Oligonucleotide primers were purchased from Shenggong Biotechnology (Shanghai) Co., Ltd. Sangon Biotech and GenScript Biotech. First-generation Sanger sequencing was commissioned to Sangon Biotech. GenScript Biotech was commissioned to perform full synthesis of related genes and clone them into the target vector.
- AxyPrep total RNA miniprep kit, polymerase chain reaction (PCR) gel recovery kit, and plasmid extraction kit are all products of Axygen in the United States.
- PrimeSTAR Max DNA Polymerase are products of TAKARA in Japan; restriction endonucleases are all products of NEB.
- Terrific Broth was purchased from Sangon Biotech. Seamless cloning kit was purchased from Novozyme Biotech.
- Escherichia coli DH10B was used for cloning construction, and BL21 (DE3) was used for de novo synthesis fermentation test.
- pET21a and pACYCDuet-1 vectors were used for gene cloning and tandem construction of genes required for the pathway.
- Arktik Thermal Cycler (Thermo Fisher Scientific) was used for PCR; ZXGP-A2050 constant temperature incubator (Zhicheng) and ZWY-211G constant temperature culture oscillator (Zhicheng) were used for constant temperature culture; 5418R high-speed refrigerated centrifuge and 5418 small centrifuge (Eppendorf) were used for centrifugation. Concentrator plus concentrator (Eppendorf) was used for vacuum concentration; OD600 was detected by UV-1200 ultraviolet visible spectrophotometer (Shanghai Meipuda Instrument Co., Ltd.).
- the rotary evaporation system consisted of IKA RV 10digital rotary evaporator (IKA), MZ 2C NT chemical diaphragm pump, and CVC3000 vacuum controller (vacuubrand). JY92-IIN ultrasonic cell crusher (Ningbo Xinzhi Biotechnology) was used for cell disruption.
- Thermo Trace GC ultra-ISQ gas chromatography-mass spectrometry analysis was performed using a Thermo Fisher Scientific gas chromatography-mass spectrometry instrument (Thermo Fisher Scientific). Silica gel column chromatography used 200-300 mesh silica gel (Qingdao Ocean Chemical).
- Module I specifically synthesizes ent-sandaracopimaradiene, and its biosynthetic pathway is shown in Figure 1.
- Module I precursor synthesis
- MVA mevalonic acid pathway
- MEP intrinsic methylerythritol phosphate
- module II diterpene nucleus synthesis
- IPP and DMAPP are used as substrates
- GGPPS and ent-CPS synthesize the intermediate ent-copalyl pyrophosphate (ent-CPP)
- IrubKSL4 catalyzes ent-CPP to synthesize the diterpene nucleus ent-sandaracopimaradiene.
- the plasmid pSY400 constructed in the early stage was used as a template, and the pSY400 part was linearized by PCR amplification using primer pair 358V-F/358V-R (primers are shown in Table 1).
- the genes such as TcGGPPS and SrCPS contained in the linearized pSY400-1 fragment were all optimized by E. coli codons.
- TcGGPPS truncated the N-terminal signal peptide.
- IrubKSL4 codon optimized by E.
- coli from winter rubescens was amplified (primers are shown in Table 1), and connected to the linearized pSY400 fragment using a seamless cloning method to form a plasmid pSYW541.
- the above plasmid and the plasmid pCZ153 constructed in the early stage were co-transformed into E. coli BL21 (DE3) to form the strain sIrubDiT1.
- the pCZ153 contains all the biosynthetic enzymes required for the MVA synthesis pathway: AtoB (WP_077475940.1, from Escherichia coli), MvaS (WP_002361740.1, from Enterococcus faecalis), MvaE (WP_002361740.1, from Enterococcus faecalis), Mvk1 (WP_000197034.1, from Staphylococcus aureus), Mvk2 (WP_000616885.1, from Staphylococcus aureus), MvaD (WP_000597335.1, from Staphylococcus aureus) and Fni (WP_004399098.1, from Bacillus subtilis), which are used to enhance the production of precursor DMAPP/IPP.
- AtoB WP_077475940.1, from Escherichia coli
- MvaS WP_002361740.1, from Enterococcus fa
- a single clone of the engineered strain sIrubDiT1 was picked and cultured overnight in LB medium containing appropriate resistance.
- the seeds of the overnight culture were transferred to 50 mL of TB medium (containing 2% glucose) at 1% v/v and cultured at 37°C and 200 rpm until OD 600 ⁇ 0.5-0.8.
- IPTG with a final concentration of 0.1 mM was used for induction, and the fermentation broth was collected after 96 hours of culture at 28°C. 500 ⁇ L of the fermentation broth was ultrasonically broken and extracted three times with an equal volume of ethyl acetate. The organic layers were combined and concentrated to dryness in vacuo.
- the extract was reconstituted with 100 ⁇ L of ethyl acetate and analyzed by gas chromatography-mass spectrometry (GC-MS).
- GC-MS gas chromatography-mass spectrometry
- HP-5MS glass capillary column (0.25 mm id ⁇ 30 m, 0.25 ⁇ m film thickness) (Agilent Technologies, USA) was used for gas chromatography analysis.
- the chromatographic conditions were set as follows: initial 100°C for 3 minutes, rising to 268°C at 14°C/min and then maintained for 4 minutes, and the carrier gas flow rate was 36.9 cm s -1 .
- the injection temperature was 280°C
- the injection mode was splitless mode
- the electron impact ionization was set to 70 eV
- the ion source temperature was set to 280°C
- the mass spectrum was collected in the range of m/z 30-550.
- the engineered strain sIrubDiT1 was amplified to 1L scale culture. After the culture was completed, the fermentation liquid was ultrasonically crushed, extracted three times with equal volume of ethyl acetate, and the organic layers were combined and concentrated to dryness in vacuo. The crude extract was separated and purified by silica gel column chromatography (200-300 mesh), eluted with n-hexane. The fraction containing enantio-sandamaradiene was detected by GC-MS, and the chromatographic conditions were as shown above.
- IrubKSL7, IrubKSL8 and IrubKSL9 specifically synthesize parrustadiene, abieta-7,13-diene and abietatriene, respectively.
- the biosynthetic pathway is shown in Figure 4.
- Module I (precursor synthesis) includes the heterologously introduced mevalonic acid pathway (MVA) and the intrinsic methylerythritol phosphate (MEP) pathway of Escherichia coli; in module II (diterpene nucleus synthesis), IPP and DMAPP are used as substrates, GGPPS and CPS synthesize the intermediate copalyl pyrophosphate (CPP), IrubKSL7 catalyzes the synthesis of diterpene nucleus palustadiene from CPP, IrubKSL8 catalyzes the synthesis of diterpene nucleus abietariene from CPP, and IrubKSL9 catalyzes the synthesis of diterpene nucleus abietariene from CPP.
- MVA mevalonic acid pathway
- MEP intrinsic methylerythritol phosphate pathway of Escherichia coli
- IPP and DMAPP are used as substrates
- GGPPS and CPS synthesize the
- the pSY400 part was linearized by PCR amplification using primers pGGPPS-EcoRI-revF2/pGGPPS-SpeI-revRn2 (primers are shown in Table 2).
- the TcGGPPS contained in the linearized pSY400-2 fragment was optimized by Escherichia coli codons and the N-terminal signal peptide was removed.
- the SmCPS1 (codon optimized by Escherichia coli) from the salvia miltiorrhiza source was amplified using primers pGGPPS-SynSmCPS1-F/pGGPPS-SynSmCPS1-R, and connected to the linearized pSY400-2 fragment using seamless cloning to form plasmid pSYW542.
- the pSYW542 part was linearized by PCR amplification using primers AX2-SalI-F/SynSmCPS1-R (primers are shown in Table 2).
- IrubKSL7, IrubKSL8 and IrubKSL9 (codon optimized in E. coli) from Rubescens were amplified (primers are shown in Table 2), and connected to the linearized pSYW542 fragment using seamless cloning to form plasmids pSYW543, pSYW544 and pSYW545.
- the above plasmids and the previously constructed plasmid pCZ153 were co-transformed into E. coli BL21 (DE3) to form strains sIrubDiT2-4.
- the pCZ153 was used to enhance the production of precursor DMAPP/IPP as described in Example 2.
- strain sIrubDiT2 can produce parustierene and abietriene
- strain sIrubDiT3 can more specifically produce abietriene-7,13-diene (with a trace amount of abietriene)
- strain sIrubDiT4 can produce abietriene and two unknown diterpene core products.
- the engineered strains sIrubDiT2-4 were scaled up to 1L scale culture. After the culture was completed, the fermentation broth was ultrasonically broken, extracted three times with an equal volume of ethyl acetate, and the organic layers were combined and concentrated to dryness in vacuo. The crude extract was separated and purified by silica gel column chromatography (200-300 mesh) and eluted with n-hexane. The fractions containing the target diterpene nucleus were detected by GC-MS, and the chromatographic conditions were as shown above.
Landscapes
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
L'invention concerne une diterpène cyclase de classe II dérivée d'Isodon rubescens, qui possède une séquence choisie parmi l'une quelconque de SEQ ID NO : 1, 3, 5 et 7 ou une séquence identique à au moins 70 % à celle-ci, et conservant l'activité d'une diterpène cyclase.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202211583472.5 | 2022-12-09 | ||
CN202211583472.5A CN118165968A (zh) | 2022-12-09 | 2022-12-09 | 新型二萜合成酶及其应用 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2024120148A1 true WO2024120148A1 (fr) | 2024-06-13 |
Family
ID=91347306
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2023/132096 WO2024120148A1 (fr) | 2022-12-09 | 2023-11-16 | Nouvelle diterpène synthase et son utilisation |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN118165968A (fr) |
WO (1) | WO2024120148A1 (fr) |
-
2022
- 2022-12-09 CN CN202211583472.5A patent/CN118165968A/zh active Pending
-
2023
- 2023-11-16 WO PCT/CN2023/132096 patent/WO2024120148A1/fr unknown
Also Published As
Publication number | Publication date |
---|---|
CN118165968A (zh) | 2024-06-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9297004B2 (en) | Method for producing α-santalene | |
CN104004789B (zh) | 用于生产β-檀香萜的方法 | |
US8703454B2 (en) | Method for producing (+)-zizaene | |
US9714440B2 (en) | Method for producing patchoulol and 7-epi-α-selinene | |
US11773414B2 (en) | Sesquiterpene synthases for production of drimenol and mixtures thereof | |
WO2024120148A1 (fr) | Nouvelle diterpène synthase et son utilisation | |
JP6856553B2 (ja) | マノオールの製造 | |
JP6748108B2 (ja) | 芳香性化合物の製造 | |
WO2024120509A1 (fr) | Nouvelles ent-caurène hydroxylases et leur utilisation | |
JP2019524104A (ja) | ベチバー | |
CN117586970A (zh) | 细胞色素p450氧化酶变体及其应用 | |
CN117586969A (zh) | 对映-贝壳杉烯酸13-羟基化酶变体及其应用 | |
CN117586971A (zh) | 一种新型对映-贝壳杉烯19位氧化酶及其应用 | |
AU2015201051B2 (en) | Method for producing beta-santalene | |
BR112017028183B1 (pt) | Produção de manool |