US20230416789A1 - Olivetolic acid cyclase variants with improved activity for use in production of phytocannabinoids - Google Patents
Olivetolic acid cyclase variants with improved activity for use in production of phytocannabinoids Download PDFInfo
- Publication number
- US20230416789A1 US20230416789A1 US18/253,066 US202118253066A US2023416789A1 US 20230416789 A1 US20230416789 A1 US 20230416789A1 US 202118253066 A US202118253066 A US 202118253066A US 2023416789 A1 US2023416789 A1 US 2023416789A1
- Authority
- US
- United States
- Prior art keywords
- seq
- ggg
- g43silent
- oac
- protein
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 108030006655 Olivetolic acid cyclases Proteins 0.000 title claims abstract description 150
- 230000000694 effects Effects 0.000 title claims description 15
- 238000004519 manufacturing process Methods 0.000 title abstract description 34
- 230000001976 improved effect Effects 0.000 title abstract description 12
- 108091033319 polynucleotide Proteins 0.000 claims abstract description 69
- 102000040430 polynucleotide Human genes 0.000 claims abstract description 69
- 239000002157 polynucleotide Substances 0.000 claims abstract description 69
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 59
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 57
- SXFKFRRXJUJGSS-UHFFFAOYSA-N olivetolic acid Chemical compound CCCCCC1=CC(O)=CC(O)=C1C(O)=O SXFKFRRXJUJGSS-UHFFFAOYSA-N 0.000 claims abstract description 52
- 150000001413 amino acids Chemical class 0.000 claims abstract description 51
- 239000002773 nucleotide Substances 0.000 claims abstract description 51
- 125000003729 nucleotide group Chemical group 0.000 claims abstract description 51
- 238000000034 method Methods 0.000 claims abstract description 40
- 230000035772 mutation Effects 0.000 claims abstract description 40
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 26
- 229920001184 polypeptide Polymers 0.000 claims abstract description 23
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 23
- 239000013604 expression vector Substances 0.000 claims abstract description 22
- 238000012258 culturing Methods 0.000 claims abstract description 5
- 230000001131 transforming effect Effects 0.000 claims abstract description 3
- 102220421646 c.83T>C Human genes 0.000 claims description 183
- 102220509841 Transcription initiation factor TFIID subunit 11_T68R_mutation Human genes 0.000 claims description 110
- 210000004027 cell Anatomy 0.000 claims description 99
- 102220244530 rs763940329 Human genes 0.000 claims description 93
- 102200117731 rs796051888 Human genes 0.000 claims description 51
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 20
- 108010030975 Polyketide Synthases Proteins 0.000 claims description 19
- UCONUSSAWGCZMV-HZPDHXFCSA-N Delta(9)-tetrahydrocannabinolic acid Chemical compound C([C@H]1C(C)(C)O2)CC(C)=C[C@H]1C1=C2C=C(CCCCC)C(C(O)=O)=C1O UCONUSSAWGCZMV-HZPDHXFCSA-N 0.000 claims description 14
- 238000006467 substitution reaction Methods 0.000 claims description 14
- SEEZIOZEUUMJME-FOWTUZBSSA-N cannabigerolic acid Chemical compound CCCCCC1=CC(O)=C(C\C=C(/C)CCC=C(C)C)C(O)=C1C(O)=O SEEZIOZEUUMJME-FOWTUZBSSA-N 0.000 claims description 13
- CYQFCXCEBYINGO-UHFFFAOYSA-N THC Natural products C1=C(C)CCC2C(C)(C)OC3=CC(CCCCC)=CC(O)=C3C21 CYQFCXCEBYINGO-UHFFFAOYSA-N 0.000 claims description 12
- CYQFCXCEBYINGO-IAGOWNOFSA-N delta1-THC Chemical compound C1=C(C)CC[C@H]2C(C)(C)OC3=CC(CCCCC)=CC(O)=C3[C@@H]21 CYQFCXCEBYINGO-IAGOWNOFSA-N 0.000 claims description 12
- 229960004242 dronabinol Drugs 0.000 claims description 12
- 102000005454 Dimethylallyltranstransferase Human genes 0.000 claims description 11
- 108010006731 Dimethylallyltranstransferase Proteins 0.000 claims description 11
- 230000000295 complement effect Effects 0.000 claims description 11
- YJYIDZLGVYOPGU-XNTDXEJSSA-N 2-[(2e)-3,7-dimethylocta-2,6-dienyl]-5-propylbenzene-1,3-diol Chemical compound CCCC1=CC(O)=C(C\C=C(/C)CCC=C(C)C)C(O)=C1 YJYIDZLGVYOPGU-XNTDXEJSSA-N 0.000 claims description 8
- 125000000539 amino acid group Chemical group 0.000 claims description 8
- YJYIDZLGVYOPGU-UHFFFAOYSA-N cannabigeroldivarin Natural products CCCC1=CC(O)=C(CC=C(C)CCC=C(C)C)C(O)=C1 YJYIDZLGVYOPGU-UHFFFAOYSA-N 0.000 claims description 8
- FAVCTJGKHFHFHJ-GXDHUFHOSA-N 3-[(2e)-3,7-dimethylocta-2,6-dienyl]-2,4-dihydroxy-6-propylbenzoic acid Chemical compound CCCC1=CC(O)=C(C\C=C(/C)CCC=C(C)C)C(O)=C1C(O)=O FAVCTJGKHFHFHJ-GXDHUFHOSA-N 0.000 claims description 6
- 239000002253 acid Substances 0.000 claims description 5
- 230000001580 bacterial effect Effects 0.000 claims description 5
- RIVVNGIVVYEIRS-UHFFFAOYSA-N Divaric acid Chemical compound CCCC1=CC(O)=CC(O)=C1C(O)=O RIVVNGIVVYEIRS-UHFFFAOYSA-N 0.000 claims description 4
- 241000588724 Escherichia coli Species 0.000 claims description 4
- 241001099156 Komagataella phaffii Species 0.000 claims description 4
- 241000235015 Yarrowia lipolytica Species 0.000 claims description 4
- QXACEHWTBCFNSA-SFQUDFHCSA-N cannabigerol Chemical compound CCCCCC1=CC(O)=C(C\C=C(/C)CCC=C(C)C)C(O)=C1 QXACEHWTBCFNSA-SFQUDFHCSA-N 0.000 claims description 4
- SVTKBAIRFMXQQF-UHFFFAOYSA-N cannabivarin Chemical compound C1=C(C)C=C2C3=C(O)C=C(CCC)C=C3OC(C)(C)C2=C1 SVTKBAIRFMXQQF-UHFFFAOYSA-N 0.000 claims description 4
- 230000002538 fungal effect Effects 0.000 claims description 4
- ZLHQMHUXJUPEHK-UHFFFAOYSA-N Cannabivarin Natural products CCCc1cc(O)c2c(OC(C)(C)c3ccccc23)c1 ZLHQMHUXJUPEHK-UHFFFAOYSA-N 0.000 claims description 3
- 210000005253 yeast cell Anatomy 0.000 claims description 3
- 102220325697 rs781680294 Human genes 0.000 description 70
- 108020004414 DNA Proteins 0.000 description 57
- 229940024606 amino acid Drugs 0.000 description 51
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 49
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 47
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 42
- 101150084072 ERG20 gene Proteins 0.000 description 41
- 101100373202 Rattus norvegicus Cx3cl1 gene Proteins 0.000 description 41
- 101100055274 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) ALD6 gene Proteins 0.000 description 41
- 101001055100 Homo sapiens Repressor of RNA polymerase III transcription MAF1 homolog Proteins 0.000 description 40
- 102100026898 Repressor of RNA polymerase III transcription MAF1 homolog Human genes 0.000 description 40
- 229940035893 uracil Drugs 0.000 description 39
- 102000004190 Enzymes Human genes 0.000 description 34
- 108090000790 Enzymes Proteins 0.000 description 34
- 102220622699 Focadhesin_I74D_mutation Human genes 0.000 description 30
- 229930003827 cannabinoid Natural products 0.000 description 24
- 239000003557 cannabinoid Substances 0.000 description 24
- IRMPFYJSHJGOPE-UHFFFAOYSA-N olivetol Chemical compound CCCCCC1=CC(O)=CC(O)=C1 IRMPFYJSHJGOPE-UHFFFAOYSA-N 0.000 description 24
- 244000025254 Cannabis sativa Species 0.000 description 22
- 239000013598 vector Substances 0.000 description 16
- 230000014509 gene expression Effects 0.000 description 14
- 230000010354 integration Effects 0.000 description 13
- 230000004048 modification Effects 0.000 description 13
- 238000012986 modification Methods 0.000 description 13
- 150000007523 nucleic acids Chemical class 0.000 description 13
- 241000196324 Embryophyta Species 0.000 description 12
- 230000009466 transformation Effects 0.000 description 12
- 108091028043 Nucleic acid sequence Proteins 0.000 description 11
- 230000015572 biosynthetic process Effects 0.000 description 11
- 108091026890 Coding region Proteins 0.000 description 10
- SEEZIOZEUUMJME-UHFFFAOYSA-N cannabinerolic acid Natural products CCCCCC1=CC(O)=C(CC=C(C)CCC=C(C)C)C(O)=C1C(O)=O SEEZIOZEUUMJME-UHFFFAOYSA-N 0.000 description 10
- 229940065144 cannabinoids Drugs 0.000 description 10
- 150000001875 compounds Chemical class 0.000 description 10
- 239000013612 plasmid Substances 0.000 description 10
- SEEZIOZEUUMJME-VBKFSLOCSA-N Cannabigerolic acid Natural products CCCCCC1=CC(O)=C(C\C=C(\C)CCC=C(C)C)C(O)=C1C(O)=O SEEZIOZEUUMJME-VBKFSLOCSA-N 0.000 description 9
- 241000168726 Dictyostelium discoideum Species 0.000 description 9
- 102000039446 nucleic acids Human genes 0.000 description 9
- 108020004707 nucleic acids Proteins 0.000 description 9
- QHMBSVQNZZTUGM-UHFFFAOYSA-N Trans-Cannabidiol Natural products OC1=CC(CCCCC)=CC(O)=C1C1C(C(C)=C)CCC(C)=C1 QHMBSVQNZZTUGM-UHFFFAOYSA-N 0.000 description 8
- ZTGXAWYVTLUPDT-UHFFFAOYSA-N cannabidiol Natural products OC1=CC(CCCCC)=CC(O)=C1C1C(C(C)=C)CC=C(C)C1 ZTGXAWYVTLUPDT-UHFFFAOYSA-N 0.000 description 8
- QHMBSVQNZZTUGM-ZWKOTPCHSA-N cannabidiol Chemical compound OC1=CC(CCCCC)=CC(O)=C1[C@H]1[C@H](C(C)=C)CCC(C)=C1 QHMBSVQNZZTUGM-ZWKOTPCHSA-N 0.000 description 8
- 229950011318 cannabidiol Drugs 0.000 description 8
- 238000006243 chemical reaction Methods 0.000 description 8
- PCXRACLQFPRCBB-ZWKOTPCHSA-N dihydrocannabidiol Natural products OC1=CC(CCCCC)=CC(O)=C1[C@H]1[C@H](C(C)C)CCC(C)=C1 PCXRACLQFPRCBB-ZWKOTPCHSA-N 0.000 description 8
- 239000000463 material Substances 0.000 description 8
- 239000003550 marker Substances 0.000 description 7
- 230000037361 pathway Effects 0.000 description 7
- 239000000047 product Substances 0.000 description 7
- 230000001105 regulatory effect Effects 0.000 description 7
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 6
- 230000006696 biosynthetic metabolic pathway Effects 0.000 description 6
- 239000002207 metabolite Substances 0.000 description 6
- 229930001119 polyketide Natural products 0.000 description 6
- 239000002243 precursor Substances 0.000 description 6
- 238000003786 synthesis reaction Methods 0.000 description 6
- -1 Aromatic Amino Acid Chemical class 0.000 description 5
- 230000002378 acidificating effect Effects 0.000 description 5
- 230000002209 hydrophobic effect Effects 0.000 description 5
- 241000894007 species Species 0.000 description 5
- AAXZFUQLLRMVOG-UHFFFAOYSA-N 2-methyl-2-(4-methylpent-3-enyl)-7-propylchromen-5-ol Chemical compound C1=CC(C)(CCC=C(C)C)OC2=CC(CCC)=CC(O)=C21 AAXZFUQLLRMVOG-UHFFFAOYSA-N 0.000 description 4
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 4
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 4
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 4
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 4
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 4
- 101150022636 MAFB gene Proteins 0.000 description 4
- LTYOQGRJFJAKNA-KKIMTKSISA-N Malonyl CoA Natural products S(C(=O)CC(=O)O)CCNC(=O)CCNC(=O)[C@@H](O)C(CO[P@](=O)(O[P@](=O)(OC[C@H]1[C@@H](OP(=O)(O)O)[C@@H](O)[C@@H](n2c3ncnc(N)c3nc2)O1)O)O)(C)C LTYOQGRJFJAKNA-KKIMTKSISA-N 0.000 description 4
- ZSLZBFCDCINBPY-ZSJPKINUSA-N acetyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 ZSLZBFCDCINBPY-ZSJPKINUSA-N 0.000 description 4
- 125000004429 atom Chemical group 0.000 description 4
- 239000006227 byproduct Substances 0.000 description 4
- 229910052799 carbon Inorganic materials 0.000 description 4
- 238000010276 construction Methods 0.000 description 4
- 230000001747 exhibiting effect Effects 0.000 description 4
- GVVPGTZRZFNKDS-JXMROGBWSA-N geranyl diphosphate Chemical compound CC(C)=CCC\C(C)=C\CO[P@](O)(=O)OP(O)(O)=O GVVPGTZRZFNKDS-JXMROGBWSA-N 0.000 description 4
- 238000009396 hybridization Methods 0.000 description 4
- 230000001965 increasing effect Effects 0.000 description 4
- 101150039067 maf-1 gene Proteins 0.000 description 4
- BDAGIHXWWSANSR-UHFFFAOYSA-N methanoic acid Natural products OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 description 4
- 231100000350 mutagenesis Toxicity 0.000 description 4
- 238000002703 mutagenesis Methods 0.000 description 4
- 231100000219 mutagenic Toxicity 0.000 description 4
- 230000003505 mutagenic effect Effects 0.000 description 4
- 230000000869 mutational effect Effects 0.000 description 4
- 230000008569 process Effects 0.000 description 4
- 238000012216 screening Methods 0.000 description 4
- 239000000758 substrate Substances 0.000 description 4
- 238000013518 transcription Methods 0.000 description 4
- 230000035897 transcription Effects 0.000 description 4
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 3
- OKDRUMBNXIYUEO-VHJVCUAWSA-N (2s,3s)-3-hydroxy-2-[(e)-prop-1-enyl]-2,3-dihydropyran-6-one Chemical compound C\C=C\[C@@H]1OC(=O)C=C[C@@H]1O OKDRUMBNXIYUEO-VHJVCUAWSA-N 0.000 description 3
- 241000218236 Cannabis Species 0.000 description 3
- 235000008697 Cannabis sativa Nutrition 0.000 description 3
- 101000712615 Cannabis sativa Tetrahydrocannabinolic acid synthase Proteins 0.000 description 3
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 3
- 101001081533 Homo sapiens Isopentenyl-diphosphate Delta-isomerase 1 Proteins 0.000 description 3
- 102100027665 Isopentenyl-diphosphate Delta-isomerase 1 Human genes 0.000 description 3
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 3
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 3
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 3
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 3
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 3
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 3
- 101150050051 OAC gene Proteins 0.000 description 3
- 108091034117 Oligonucleotide Proteins 0.000 description 3
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 3
- 125000001931 aliphatic group Chemical group 0.000 description 3
- 125000003118 aryl group Chemical group 0.000 description 3
- 230000001419 dependent effect Effects 0.000 description 3
- 239000007789 gas Substances 0.000 description 3
- 239000000203 mixture Substances 0.000 description 3
- 229910052757 nitrogen Inorganic materials 0.000 description 3
- 125000000830 polyketide group Chemical group 0.000 description 3
- 230000000506 psychotropic effect Effects 0.000 description 3
- ZROLHBHDLIHEMS-HUUCEWRRSA-N (6ar,10ar)-6,6,9-trimethyl-3-propyl-6a,7,8,10a-tetrahydrobenzo[c]chromen-1-ol Chemical compound C1=C(C)CC[C@H]2C(C)(C)OC3=CC(CCC)=CC(O)=C3[C@@H]21 ZROLHBHDLIHEMS-HUUCEWRRSA-N 0.000 description 2
- OSWFIVFLDKOXQC-UHFFFAOYSA-N 4-(3-methoxyphenyl)aniline Chemical compound COC1=CC=CC(C=2C=CC(N)=CC=2)=C1 OSWFIVFLDKOXQC-UHFFFAOYSA-N 0.000 description 2
- KASVLYINZPAMNS-UHFFFAOYSA-N Cannabigerol monomethylether Natural products CCCCCC1=CC(O)=C(CC=C(C)CCC=C(C)C)C(OC)=C1 KASVLYINZPAMNS-UHFFFAOYSA-N 0.000 description 2
- 101150059484 CycT gene Proteins 0.000 description 2
- ZROLHBHDLIHEMS-UHFFFAOYSA-N Delta9 tetrahydrocannabivarin Natural products C1=C(C)CCC2C(C)(C)OC3=CC(CCC)=CC(O)=C3C21 ZROLHBHDLIHEMS-UHFFFAOYSA-N 0.000 description 2
- ODKSFYDXXFIFQN-BYPYZUCNSA-N L-arginine Chemical compound OC(=O)[C@@H](N)CCCN=C(N)N ODKSFYDXXFIFQN-BYPYZUCNSA-N 0.000 description 2
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 2
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 2
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 2
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 2
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 2
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 2
- 241000235648 Pichia Species 0.000 description 2
- 229930182558 Sterol Natural products 0.000 description 2
- 238000009825 accumulation Methods 0.000 description 2
- 230000004075 alteration Effects 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 108060000514 aromatic prenyltransferase Proteins 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 2
- 230000000975 bioactive effect Effects 0.000 description 2
- 230000004071 biological effect Effects 0.000 description 2
- WVOLTBSCXRRQFR-DLBZAZTESA-N cannabidiolic acid Chemical compound OC1=C(C(O)=O)C(CCCCC)=CC(O)=C1[C@H]1[C@H](C(C)=C)CCC(C)=C1 WVOLTBSCXRRQFR-DLBZAZTESA-N 0.000 description 2
- 150000001732 carboxylic acid derivatives Chemical group 0.000 description 2
- 230000002759 chromosomal effect Effects 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 238000004807 desolvation Methods 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 239000003480 eluent Substances 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 150000002185 fatty acyl-CoAs Chemical class 0.000 description 2
- 235000019253 formic acid Nutrition 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- BRZYSWJRSDMWLG-CAXSIQPQSA-N geneticin Chemical compound O1C[C@@](O)(C)[C@H](NC)[C@@H](O)[C@H]1O[C@@H]1[C@@H](O)[C@H](O[C@@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](C(C)O)O2)N)[C@@H](N)C[C@H]1N BRZYSWJRSDMWLG-CAXSIQPQSA-N 0.000 description 2
- 125000001072 heteroaryl group Chemical group 0.000 description 2
- OEXFMSFODMQEPE-HDRQGHTBSA-N hexanoyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)CCCCC)O[C@H]1N1C2=NC=NC(N)=C2N=C1 OEXFMSFODMQEPE-HDRQGHTBSA-N 0.000 description 2
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 2
- 229910052739 hydrogen Inorganic materials 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 229940124280 l-arginine Drugs 0.000 description 2
- LTYOQGRJFJAKNA-DVVLENMVSA-N malonyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)CC(O)=O)O[C@H]1N1C2=NC=NC(N)=C2N=C1 LTYOQGRJFJAKNA-DVVLENMVSA-N 0.000 description 2
- 125000005637 malonyl-CoA group Chemical group 0.000 description 2
- 230000008450 motivation Effects 0.000 description 2
- 239000013642 negative control Substances 0.000 description 2
- 230000002018 overexpression Effects 0.000 description 2
- 229910052698 phosphorus Inorganic materials 0.000 description 2
- 150000003881 polyketide derivatives Chemical class 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 238000011002 quantification Methods 0.000 description 2
- 230000002441 reversible effect Effects 0.000 description 2
- 150000003432 sterols Chemical class 0.000 description 2
- 235000003702 sterols Nutrition 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 150000003505 terpenes Chemical class 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 210000001519 tissue Anatomy 0.000 description 2
- 230000009261 transgenic effect Effects 0.000 description 2
- 230000001052 transient effect Effects 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- 101710165761 (2E,6E)-farnesyl diphosphate synthase Proteins 0.000 description 1
- FQVLRGLGWNWPSS-BXBUPLCLSA-N (4r,7s,10s,13s,16r)-16-acetamido-13-(1h-imidazol-5-ylmethyl)-10-methyl-6,9,12,15-tetraoxo-7-propan-2-yl-1,2-dithia-5,8,11,14-tetrazacycloheptadecane-4-carboxamide Chemical compound N1C(=O)[C@@H](NC(C)=O)CSSC[C@@H](C(N)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)NC(=O)[C@@H]1CC1=CN=CN1 FQVLRGLGWNWPSS-BXBUPLCLSA-N 0.000 description 1
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 1
- GHURLXDGHMZIGJ-UHFFFAOYSA-N 4-hydroxy-6-(2-oxoheptyl)pyran-2-one Chemical compound CCCCCC(=O)CC1=CC(O)=CC(=O)O1 GHURLXDGHMZIGJ-UHFFFAOYSA-N 0.000 description 1
- YIXLDRQOKWESBI-UHFFFAOYSA-N 4-hydroxy-6-pentylpyran-2-one Chemical compound CCCCCC1=CC(O)=CC(=O)O1 YIXLDRQOKWESBI-UHFFFAOYSA-N 0.000 description 1
- UHPMCKVQTMMPCG-UHFFFAOYSA-N 5,8-dihydroxy-2-methoxy-6-methyl-7-(2-oxopropyl)naphthalene-1,4-dione Chemical compound CC1=C(CC(C)=O)C(O)=C2C(=O)C(OC)=CC(=O)C2=C1O UHPMCKVQTMMPCG-UHFFFAOYSA-N 0.000 description 1
- 108030002854 Acetoacetyl-CoA synthases Proteins 0.000 description 1
- 101150021974 Adh1 gene Proteins 0.000 description 1
- 241000219195 Arabidopsis thaliana Species 0.000 description 1
- 235000017060 Arachis glabrata Nutrition 0.000 description 1
- 244000105624 Arachis hypogaea Species 0.000 description 1
- 235000010777 Arachis hypogaea Nutrition 0.000 description 1
- 235000018262 Arachis monticola Nutrition 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- 240000002900 Arthrospira platensis Species 0.000 description 1
- 235000016425 Arthrospira platensis Nutrition 0.000 description 1
- 241000228212 Aspergillus Species 0.000 description 1
- 241000351920 Aspergillus nidulans Species 0.000 description 1
- 241000228245 Aspergillus niger Species 0.000 description 1
- 240000006439 Aspergillus oryzae Species 0.000 description 1
- 235000002247 Aspergillus oryzae Nutrition 0.000 description 1
- 235000007319 Avena orientalis Nutrition 0.000 description 1
- 244000075850 Avena orientalis Species 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 241000151861 Barnettozyma salicaria Species 0.000 description 1
- 235000014698 Brassica juncea var multisecta Nutrition 0.000 description 1
- 235000006008 Brassica napus var napus Nutrition 0.000 description 1
- 240000000385 Brassica napus var. napus Species 0.000 description 1
- 235000006618 Brassica rapa subsp oleifera Nutrition 0.000 description 1
- 235000004977 Brassica sinapistrum Nutrition 0.000 description 1
- 101100221193 Caenorhabditis elegans coa-7 gene Proteins 0.000 description 1
- UVOLYTDXHDXWJU-UHFFFAOYSA-N Cannabichromene Chemical compound C1=CC(C)(CCC=C(C)C)OC2=CC(CCCCC)=CC(O)=C21 UVOLYTDXHDXWJU-UHFFFAOYSA-N 0.000 description 1
- WVOLTBSCXRRQFR-SJORKVTESA-N Cannabidiolic acid Natural products OC1=C(C(O)=O)C(CCCCC)=CC(O)=C1[C@@H]1[C@@H](C(C)=C)CCC(C)=C1 WVOLTBSCXRRQFR-SJORKVTESA-N 0.000 description 1
- 108010075293 Cannabidiolic acid synthase Proteins 0.000 description 1
- REOZWEGFPHTFEI-JKSUJKDBSA-N Cannabidivarin Chemical compound OC1=CC(CCC)=CC(O)=C1[C@H]1[C@H](C(C)=C)CCC(C)=C1 REOZWEGFPHTFEI-JKSUJKDBSA-N 0.000 description 1
- 102000018208 Cannabinoid Receptor Human genes 0.000 description 1
- 108050007331 Cannabinoid receptor Proteins 0.000 description 1
- VBGLYOIFKLUMQG-UHFFFAOYSA-N Cannabinol Chemical compound C1=C(C)C=C2C3=C(O)C=C(CCCCC)C=C3OC(C)(C)C2=C1 VBGLYOIFKLUMQG-UHFFFAOYSA-N 0.000 description 1
- 241000195597 Chlamydomonas reinhardtii Species 0.000 description 1
- 241000195651 Chlorella sp. Species 0.000 description 1
- 206010061765 Chromosomal mutation Diseases 0.000 description 1
- 241001674013 Chrysosporium lucknowense Species 0.000 description 1
- 241000186226 Corynebacterium glutamicum Species 0.000 description 1
- 229920000742 Cotton Polymers 0.000 description 1
- 101710095468 Cyclase Proteins 0.000 description 1
- 102100028717 Cytosolic 5'-nucleotidase 3A Human genes 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 101100136092 Drosophila melanogaster peng gene Proteins 0.000 description 1
- 241001560459 Dunaliella sp. Species 0.000 description 1
- 241000620209 Escherichia coli DH5[alpha] Species 0.000 description 1
- 101710156207 Farnesyl diphosphate synthase Proteins 0.000 description 1
- VWFJDQUYCIWHTN-UHFFFAOYSA-N Farnesyl pyrophosphate Natural products CC(C)=CCCC(C)=CCCC(C)=CCOP(O)(=O)OP(O)(O)=O VWFJDQUYCIWHTN-UHFFFAOYSA-N 0.000 description 1
- 101710125754 Farnesyl pyrophosphate synthase Proteins 0.000 description 1
- 102100035111 Farnesyl pyrophosphate synthase Human genes 0.000 description 1
- 101710089428 Farnesyl pyrophosphate synthase erg20 Proteins 0.000 description 1
- 239000004606 Fillers/Extenders Substances 0.000 description 1
- 241000223218 Fusarium Species 0.000 description 1
- 241001149959 Fusarium sp. Species 0.000 description 1
- 241000567178 Fusarium venenatum Species 0.000 description 1
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 1
- 244000068988 Glycine max Species 0.000 description 1
- 235000010469 Glycine max Nutrition 0.000 description 1
- 241000219146 Gossypium Species 0.000 description 1
- 241000168517 Haematococcus lacustris Species 0.000 description 1
- 244000020551 Helianthus annuus Species 0.000 description 1
- 235000003222 Helianthus annuus Nutrition 0.000 description 1
- 240000005979 Hordeum vulgare Species 0.000 description 1
- 235000007340 Hordeum vulgare Nutrition 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- 241001138401 Kluyveromyces lactis Species 0.000 description 1
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 1
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 1
- 101001110310 Lentilactobacillus kefiri NADP-dependent (R)-specific alcohol dehydrogenase Proteins 0.000 description 1
- 241000219745 Lupinus Species 0.000 description 1
- 235000007688 Lycopersicon esculentum Nutrition 0.000 description 1
- 240000005561 Musa balbisiana Species 0.000 description 1
- 235000018290 Musa x paradisiaca Nutrition 0.000 description 1
- 241000204051 Mycoplasma genitalium Species 0.000 description 1
- 241001300629 Nannochloropsis oceanica Species 0.000 description 1
- 241000221961 Neurospora crassa Species 0.000 description 1
- 241000208135 Nicotiana sp. Species 0.000 description 1
- IGHTZQUIFGUJTG-QSMXQIJUSA-N O1C2=CC(CCCCC)=CC(O)=C2[C@H]2C(C)(C)[C@@H]3[C@H]2[C@@]1(C)CC3 Chemical compound O1C2=CC(CCCCC)=CC(O)=C2[C@H]2C(C)(C)[C@@H]3[C@H]2[C@@]1(C)CC3 IGHTZQUIFGUJTG-QSMXQIJUSA-N 0.000 description 1
- 241000320412 Ogataea angusta Species 0.000 description 1
- 241001452677 Ogataea methanolica Species 0.000 description 1
- 241001099341 Ogataea polymorpha Species 0.000 description 1
- 241000489470 Ogataea trehalophila Species 0.000 description 1
- 241000826199 Ogataea wickerhamii Species 0.000 description 1
- 240000007594 Oryza sativa Species 0.000 description 1
- 235000007164 Oryza sativa Nutrition 0.000 description 1
- 241000530350 Phaffomyces opuntiae Species 0.000 description 1
- 241000529953 Phaffomyces thermotolerans Species 0.000 description 1
- ZJPGOXWRFNKIQL-JYJNAYRXSA-N Phe-Pro-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 ZJPGOXWRFNKIQL-JYJNAYRXSA-N 0.000 description 1
- 241000235062 Pichia membranifaciens Species 0.000 description 1
- 240000004713 Pisum sativum Species 0.000 description 1
- 235000010582 Pisum sativum Nutrition 0.000 description 1
- 102000019337 Prenyltransferases Human genes 0.000 description 1
- 108050006837 Prenyltransferases Proteins 0.000 description 1
- 101710150389 Probable farnesyl diphosphate synthase Proteins 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 108010076504 Protein Sorting Signals Proteins 0.000 description 1
- 241000589517 Pseudomonas aeruginosa Species 0.000 description 1
- 241001453299 Pseudomonas mevalonii Species 0.000 description 1
- 241000589776 Pseudomonas putida Species 0.000 description 1
- 102000009572 RNA Polymerase II Human genes 0.000 description 1
- 108010009460 RNA Polymerase II Proteins 0.000 description 1
- MUPFEKGTMRGPLJ-RMMQSMQOSA-N Raffinose Natural products O(C[C@H]1[C@@H](O)[C@H](O)[C@@H](O)[C@@H](O[C@@]2(CO)[C@H](O)[C@@H](O)[C@@H](CO)O2)O1)[C@@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 MUPFEKGTMRGPLJ-RMMQSMQOSA-N 0.000 description 1
- 241000191023 Rhodobacter capsulatus Species 0.000 description 1
- 241000191043 Rhodobacter sphaeroides Species 0.000 description 1
- 241000187562 Rhodococcus sp. Species 0.000 description 1
- 241000190984 Rhodospirillum rubrum Species 0.000 description 1
- 241000607142 Salmonella Species 0.000 description 1
- 241000293871 Salmonella enterica subsp. enterica serovar Typhi Species 0.000 description 1
- 241000235060 Scheffersomyces stipitis Species 0.000 description 1
- 241000235347 Schizosaccharomyces pombe Species 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 241000607768 Shigella Species 0.000 description 1
- 241000607762 Shigella flexneri Species 0.000 description 1
- 241000607760 Shigella sonnei Species 0.000 description 1
- 240000003768 Solanum lycopersicum Species 0.000 description 1
- 244000061456 Solanum tuberosum Species 0.000 description 1
- 235000002595 Solanum tuberosum Nutrition 0.000 description 1
- 101001079635 Solanum tuberosum 3-hydroxy-3-methylglutaryl-coenzyme A reductase 1 Proteins 0.000 description 1
- 240000006394 Sorghum bicolor Species 0.000 description 1
- 235000011684 Sorghum saccharatum Nutrition 0.000 description 1
- 238000002105 Southern blotting Methods 0.000 description 1
- 241000187432 Streptomyces coelicolor Species 0.000 description 1
- 241000192560 Synechococcus sp. Species 0.000 description 1
- 244000299461 Theobroma cacao Species 0.000 description 1
- 235000009470 Theobroma cacao Nutrition 0.000 description 1
- 241001313536 Thermothelomyces thermophila Species 0.000 description 1
- 108091036066 Three prime untranslated region Proteins 0.000 description 1
- 102000004357 Transferases Human genes 0.000 description 1
- 108090000992 Transferases Proteins 0.000 description 1
- 241000499912 Trichoderma reesei Species 0.000 description 1
- 235000021307 Triticum Nutrition 0.000 description 1
- 244000098338 Triticum aestivum Species 0.000 description 1
- MUPFEKGTMRGPLJ-UHFFFAOYSA-N UNPD196149 Natural products OC1C(O)C(CO)OC1(CO)OC1C(O)C(O)C(O)C(COC2C(C(O)C(O)C(CO)O2)O)O1 MUPFEKGTMRGPLJ-UHFFFAOYSA-N 0.000 description 1
- 241000370136 Wickerhamomyces pijperi Species 0.000 description 1
- 241000607479 Yersinia pestis Species 0.000 description 1
- 240000008042 Zea mays Species 0.000 description 1
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 description 1
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 1
- 241000588902 Zymomonas mobilis Species 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 238000007792 addition Methods 0.000 description 1
- 108010081577 aldehyde dehydrogenase (NAD(P)+) Proteins 0.000 description 1
- 235000005550 amino acid supplement Nutrition 0.000 description 1
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 1
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 1
- 235000011130 ammonium sulphate Nutrition 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 229960003121 arginine Drugs 0.000 description 1
- 125000000637 arginyl group Chemical group N[C@@H](CCCNC(N)=N)C(=O)* 0.000 description 1
- 229940011019 arthrospira platensis Drugs 0.000 description 1
- 210000004507 artificial chromosome Anatomy 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 230000001851 biosynthetic effect Effects 0.000 description 1
- CRFNGMNYKDXRTN-CITAKDKDSA-N butyryl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)CCC)O[C@H]1N1C2=NC=NC(N)=C2N=C1 CRFNGMNYKDXRTN-CITAKDKDSA-N 0.000 description 1
- HRHJHXJQMNWQTF-UHFFFAOYSA-N cannabichromenic acid Chemical compound O1C(C)(CCC=C(C)C)C=CC2=C1C=C(CCCCC)C(C(O)=O)=C2O HRHJHXJQMNWQTF-UHFFFAOYSA-N 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 150000001768 cations Chemical class 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 235000013339 cereals Nutrition 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 238000006482 condensation reaction Methods 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 238000000354 decomposition reaction Methods 0.000 description 1
- XPPKVPWEQAFLFU-UHFFFAOYSA-J diphosphate(4-) Chemical compound [O-]P([O-])(=O)OP([O-])([O-])=O XPPKVPWEQAFLFU-UHFFFAOYSA-J 0.000 description 1
- 235000011180 diphosphates Nutrition 0.000 description 1
- 230000003828 downregulation Effects 0.000 description 1
- 238000002330 electrospray ionisation mass spectrometry Methods 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 238000000855 fermentation Methods 0.000 description 1
- 230000004151 fermentation Effects 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 102000034287 fluorescent proteins Human genes 0.000 description 1
- 108091006047 fluorescent proteins Proteins 0.000 description 1
- 238000005194 fractionation Methods 0.000 description 1
- 229930182830 galactose Natural products 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 125000002350 geranyl group Chemical group [H]C([*])([H])/C([H])=C(C([H])([H])[H])/C([H])([H])C([H])([H])C([H])=C(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- 125000005842 heteroatom Chemical group 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- 238000000589 high-performance liquid chromatography-mass spectrometry Methods 0.000 description 1
- 125000004435 hydrogen atom Chemical group [H]* 0.000 description 1
- GPRLSGONYQIRFK-UHFFFAOYSA-N hydron Chemical compound [H+] GPRLSGONYQIRFK-UHFFFAOYSA-N 0.000 description 1
- 125000002883 imidazolyl group Chemical group 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 238000011031 large-scale manufacturing process Methods 0.000 description 1
- MYUGVHJLXONYNC-QHTZZOMLSA-L magnesium;(2s)-2-amino-5-hydroxy-5-oxopentanoate Chemical compound [Mg+2].[O-]C(=O)[C@@H](N)CCC(O)=O.[O-]C(=O)[C@@H](N)CCC(O)=O MYUGVHJLXONYNC-QHTZZOMLSA-L 0.000 description 1
- 230000014759 maintenance of location Effects 0.000 description 1
- 235000009973 maize Nutrition 0.000 description 1
- LTYOQGRJFJAKNA-VFLPNFFSSA-N malonyl-coa Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)C(O)C(=O)NCCC(=O)NCCSC(=O)CC(O)=O)O[C@H]1N1C2=NC=NC(N)=C2N=C1 LTYOQGRJFJAKNA-VFLPNFFSSA-N 0.000 description 1
- 230000007721 medicinal effect Effects 0.000 description 1
- 238000002844 melting Methods 0.000 description 1
- 230000008018 melting Effects 0.000 description 1
- 229930003658 monoterpene Natural products 0.000 description 1
- 150000002773 monoterpene derivatives Chemical class 0.000 description 1
- 235000002577 monoterpenes Nutrition 0.000 description 1
- 238000007857 nested PCR Methods 0.000 description 1
- 230000002352 nonmutagenic effect Effects 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 238000007899 nucleic acid hybridization Methods 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-O oxonium Chemical compound [OH3+] XLYOFNOQVPJJNP-UHFFFAOYSA-O 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 235000020232 peanut Nutrition 0.000 description 1
- 101150079312 pgk1 gene Proteins 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 238000003908 quality control method Methods 0.000 description 1
- MUPFEKGTMRGPLJ-ZQSKZDJDSA-N raffinose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO[C@@H]2[C@@H]([C@@H](O)[C@@H](O)[C@@H](CO)O2)O)O1 MUPFEKGTMRGPLJ-ZQSKZDJDSA-N 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 235000009566 rice Nutrition 0.000 description 1
- 125000003607 serino group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C(O[H])([H])[H] 0.000 description 1
- 229940115939 shigella sonnei Drugs 0.000 description 1
- 230000037432 silent mutation Effects 0.000 description 1
- 239000007858 starting material Substances 0.000 description 1
- 230000001629 suppression Effects 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
- 231100000331 toxic Toxicity 0.000 description 1
- 230000002588 toxic effect Effects 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 230000007306 turnover Effects 0.000 description 1
- 238000004704 ultra performance liquid chromatography Methods 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- 239000013603 viral vector Substances 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 239000003643 water by type Substances 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/40—Preparation of oxygen-containing organic compounds containing a carboxyl group including Peroxycarboxylic acids
- C12P7/42—Hydroxy-carboxylic acids
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/88—Lyases (4.)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P17/00—Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms
- C12P17/02—Oxygen as only ring hetero atoms
- C12P17/06—Oxygen as only ring hetero atoms containing a six-membered hetero ring, e.g. fluorescein
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/02—Preparation of oxygen-containing organic compounds containing a hydroxy group
- C12P7/22—Preparation of oxygen-containing organic compounds containing a hydroxy group aromatic
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y404/00—Carbon-sulfur lyases (4.4)
- C12Y404/01—Carbon-sulfur lyases (4.4.1)
- C12Y404/01026—Olivetolic acid cyclase (4.4.1.26)
Definitions
- the present disclosure relates generally to proteins having olivetolic acid cyclase activity, useful in production of phytocannabinoids.
- Phytocannabinoids are a large class of compounds with over 100 different known structures that are produced in the Cannabis sativa plant. Phytocannabinoids are known to be biosynthesized in C. sativa , or may result from thermal or other decomposition from phytocannabinoids biosynthesized in C. sativa . These bio-active molecules, such as tetrahydrocannabinol (THC) and cannabidiol (CBD), can be extracted from plant material for medical and recreational purposes. However, the synthesis of plant material is costly, not readily scalable to large volumes, and requires lengthy growing periods to produce sufficient quantities of phytocannabinoids. While the C.
- THC tetrahydrocannabinol
- CBD cannabidiol
- sativa plant is also a valuable source of grain, fiber, and other material, growing C. sativa for phytocannabinoid production, particularly indoors, is costly in terms of energy and labour. Subsequent extraction, purification, and fractionation of phytocannabinoids from the C. sativa plant is also labour and energy intensive.
- Phytocannabinoids are pharmacologically active molecules that contribute to the medical and psychotropic effects of C. sativa .
- Biosynthesis of phytocannabinoids in the C. sativa plant scales similarly to other agricultural projects.
- large scale production of phytocannabinoids by growing C. sativa requires a variety of inputs (e.g. nutrients, light, pest control, CO, etc.). The inputs required for cultivating C. sativa must be provided.
- cultivation of C. sativa where allowed, is currently subject to heavy regulation, taxation, and rigorous quality control where products prepared from the plant are for commercial use, further increasing costs.
- Phytocannabinoid analogues are pharmacologically active molecules that are structurally similar to phytocannabinoids. Phytocannabinoid analogues are often synthesized chemically, which can be labour intensive and costly. As a result, it may be economical to produce the phytocannabinoids and phytocannabinoid analogues in a robust and scalable, fermentable organism. Saccharomyces cerevisiae is an example of a fermentable organism that has been used to produce industrial scales of similar molecules.
- Olivetolic Acid Cyclase (OAC) variants are described herein which are capable of producing olivetolic acid (OVLa). These variants are useful in the production of olivetolic acid and relevant phytocannabinoids in a heterologous host. Methods of production are described. The described OAC variants that can produce olivetolic acid and downstream metabolites in a modified yeast cell can be applied to any host and used in phytocannabinoid production.
- OAC Olivetolic Acid Cyclase
- OAC variants comprise 6 or greater non-conservative substitution amino acid mutations relative to the wild type enzyme. Certain OAC variants described have improved activity and/or show improved ratios of olivetol to olivetolic acid compared to the wild type enzyme.
- a method of producing OVLa or a phytocannabinoid derived therefrom in a heterologous host cell having OVLa-producing or phytocannabinoid-producing capacity comprises: transforming the host cell with a nucleotide encoding a variant olivetolic acid cyclase (OAC) protein having at least 6 amino acid mutations relative to the wild type OAC protein, and culturing said transformed host cell to produce olivetolic acid and/or phytocannabinoids therefrom, wherein the variant OAC protein comprises at least 85%, at least 90%, at least 95%, or at least 99% sequence identity with the wild type OAC protein sequence according to SEQ ID NO:91.
- OAC olivetolic acid cyclase
- An isolated polypeptide having olivetolic acid cyclase activity comprising an amino acid sequence of at least 85%, at least 90%, at least 95%, or at least 99% sequence identity with SEQ ID NO: 92, wherein 6 or more amino acid residues comprise mutations relative to SEQ ID NO:91, located at 6 or more of residues 28, 31, 41, 43, 44, 68, 74, 84, 100 or 102 of SEQ ID NO:91.
- An isolated polynucleotide comprising: (a) a nucleotide sequence according to SEQ ID NO:3-SEQ ID NO:39; (b) a nucleotide sequence having at least 85%, at least 90%, at least 95%, or at least 99% identity with the nucleotide sequence of (a), or (c) a nucleotide sequence that hybridizes with the complementary strand of the nucleotide having the sequence of (a).
- Expression vectors comprising the polynucleotide, and host cells transformed with such expression vectors are described.
- FIG. 1 illustrates a cannabinoid biosynthesis pathway in Cannabis sativa.
- FIG. 2 illustrates a cannabinoid biosynthesis pathway as described in Applicant's co-pending International Application No. PCT/CA2020/050687.
- FIG. 3 illustrates PCR primers used in site-saturation mutagenesis protocol.
- FIG. 4 shows an overlap-extension approach that was used to assemble mutagenic oligonucleotides for combinatorial library construction.
- the symbol x represents a point mutation.
- FIG. 5 shows olivetolic acid production with mutant OAC variants.
- a method of producing olivetolic acid (OVLa) or a phytocannabinoid produced therefrom is described herein.
- a heterologous host cell comprising OVLa-producing or phytocannabinoid-producing capacity is transformed with a nucleotide encoding a variant olivetolic acid cyclase (OAC) protein having at least 6 amino acid mutations relative to the wild type OAC protein and culturing said transformed host cell to produce olivetolic acid and/or phytocannabinoids therefrom, wherein the variant OAC protein comprises at least 85%, at least 90%, at least 95%, or at least 99% sequence identity with the wild type OAC protein sequence according to SEQ ID NO: 91.
- Exemplary variant proteins may result in improved OVLa or phytocannabinoid production over wild type according to the method described.
- At least 4 of the at least 6 amino acid mutations of the OAC protein are in residues 28, 31, 41, 43, 44, 68, 74, 84, 100 or 102 of the wild type OAC protein (SEQ ID NO:91), with other mutations being located elsewhere in the sequence.
- a mutation can be either a conservative or non-conservative amino acid substitution, but may advantageously be a non-conservative amino acid substitution.
- the method may encompass transformation of the host cell with a nucleotide encoding the variant olivetolic acid cyclase (OAC) protein, which nucleotide has a sequence comprising: (a) a nucleotide sequence according to SEQ ID NO:3-SEQ ID NO:39; (b) a nucleotide sequence having at least 85%, at least 90%, at least 95%, or at least 99% identity with the sequence of (a); or (c) a nucleotide sequence that hybridizes with the complementary strand of the nucleotide having the sequence of (a).
- the variant OAC protein may comprise a according to any one of SEQ ID NO:40 to SEQ ID NO:76.
- At least 4 of the at least 6 amino acid mutations relative to the wild type OAC protein are selected from the group consisting of: V28A; V31G; Y41T, Y41S or Y41V; K44V; T68L or T68R; 174E, 174R, 174D or 174G; V84R; R100M or R100E; and G102R, G102S, or G102STOP.
- at least 10, at least 9, at least 8, at least 7, at least 6, or at least 5 of these amino acid mutations relative to wild type may be present.
- the production of a phytocannabinoid by the transformed host cell may involve production of phytocannabinoids including but not limited to cannabigerol (CBG), cannabigerolic acid (CBGa), cannabigerovarin (CBGV), cannabigerovarinic acid (CBGVa), cannabigerocin (CBGO), cannabigerocinic acid (CBGOa), a cannabivarin, tetrahydrocannabinol (THC), or tetrahydrocannabinolic acid (THCa).
- CBD cannabigerol
- CBGa cannabigerolic acid
- CBDGV cannabigerovarin
- CBGVa cannabigerovarinic acid
- CBDGO cannabigerocin
- CBDGOa cannabigerocinic acid
- THC tetrahydrocannabinol
- THCa tetrahydrocannabinolic acid
- the host cell may be transformed with a nucleotide encoding variant olivetolic acid cyclase (OAC) protein with at least 85%, at least 90%, at least 95%, at least 99%, or 100% sequence identity of any one of the following sequences with the indicated substitutions from OAC wild type (SEQ ID NO:91) being present:
- OAC olivetolic acid cyclase
- the host cell transformed in the method described may be a bacterial cell, a fungal cell, a protist cell, or a plant cell.
- Exemplary organisms include S. cerevisiae, E. coli, Yarrowia lipolytica , or Komagataella phaffii , as well as others described herein.
- the transformed host cell may additionally comprise, or be transformed with, other enzymes useful in phytocannabinoid production.
- a polynucleotide encoding a polyketide synthase enzyme and/or a polynucleotide encoding a prenyltransferase enzyme may also be included in the host cell. Further options for polynucleotides and methods, such as described in Applicant's co-pending International Application No. PCT/CA2020/050687 (hereby incorporated by reference) are envisioned.
- An isolated polypeptide is described herein, which has olivetolic acid cyclase (OAC) activity.
- the polypeptide activity comprises an amino acid sequence according to SEQ ID NO: 92, wherein 6 or more amino acid residues comprise mutations relative to SEQ ID NO: 91 (wild type OAC) which are mutations at residues 28, 31, 41, 43, 44, 68, 74, 84, 100 or 102 of SEQ ID NO:91.
- the isolated polypeptide may have an amino acid sequence according to one of SEQ ID NO:40 to SEQ ID NO:76.
- An isolated polynucleotide which may have (a) a nucleotide sequence according to SEQ ID NO:3-SEQ ID NO:39; (b) a nucleotide sequence having at least 85% identity with the nucleotide sequence of (a), or (c) a nucleotide sequence that hybridizes with the complementary strand of the nucleotide having the sequence of (a).
- an expression vector comprising a polynucleotide encoding a variant olivetolic acid cyclase (OAC) protein having the sequence of SEQ ID NO: 92, in which 6 or more amino acid mutations are present relative to the wild type OAC protein.
- the polynucleotide encoding the variant OAC protein may have at least 85% sequence identity with any one of SEQ ID NO:3 to SEQ ID NO:39.
- a host cell transformed with the above-described expression vector is also encompassed herein.
- a host cell may additionally comprise a polynucleotide encoding other enzymes useful in synthesis of olivetolic acid and/or phytocannabinoids, such as encoding a polyketide synthase enzyme and/or a prenyltransferase enzyme.
- Such a host cell may be a bacterial cell, a fungal cell, a protist cell, or a plant cell, for example: S. cerevisiae, E. coli, Yarrowia lipolytica , or Komagataella phaffii.
- cannabinoid refers to a chemical compound that shows direct or indirect activity at a cannabinoid receptor.
- cannabinoids include tetrahydrocannabinol (THC), cannabidiol (CBD), cannabinol (CBN), cannabigerol (CBG), cannabichromene (CBC), cannabicyclol (CBL), cannabivarin (CBV), tetrahydrocannabivarin (THCV), cannabidivarin (CBDV), cannabichromevarin (CBCV), cannabigerovarin (CBGV), and cannabigerol monomethyl ether (CBGM).
- phytocannabinoid refers to a cannabinoid that typically occurs in a plant species.
- exemplary phytocannabinoids produced according to the invention include cannabigerol (CBG), cannabigerolic acid (CBGa), cannabigerovarin (CBGV), cannabigerovarinic acid (CBGVa), cannabigerocin (CBGo), or cannabigerocinic acid (CBGoa).
- Cannabinoids and phytocannabinoids may contain or may lack one or more carboxylic acid functional groups.
- Non limiting examples of such cannabinoids or phytocannabinoids containing carboxylic acid function groups or phytocannabinoids include tetrahydrocannabinolic acid (THCA), cannabidiolic acid (CBDA), and cannabichromenic acid (CBCA).
- homologue includes homologous sequences from the same and other species and orthologous sequences from the same and other species, without necessarily reference to biological origins. Different polynucleotides or polypeptides having homology may be referred to as homologues.
- compositions and methods herein may further comprise homologues to the polypeptide and polynucleotide sequences described herein.
- orthologous refers to homologous polypeptide sequences and/or polynucleotide sequences in different species that arose from a common ancestral gene during speciation.
- a “homologue” may be a sequence with specified functionality, whether referencing percent identity or percent homology, with a significant sequence identity (e.g., 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% and/or 100%) to the polynucleotide sequences herein.
- sequence identity e.g., 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% and/or 100%
- sequence identity refers to the extent to which two optimally aligned polynucleotide or peptide sequences are invariant throughout a window of alignment of components, e.g., nucleotides or amino acids. “Identity” can be readily calculated by known methods.
- percent sequence identity refers to the percentage of identical nucleotides in a linear polynucleotide sequence of a reference (“query”) polynucleotide molecule (or its complementary strand) as compared to a test (“subject”) polynucleotide molecule (or its complementary strand) when the two sequences are optimally aligned.
- percent identity can refer to the percentage of identical amino acids in an amino acid sequence.
- fatty acid-CoA may refer to compounds useful in polyketide synthesis as primer molecules which react in a condensation reaction with an extender unit (such as malonyl-CoA) to form a polyketide.
- extender unit such as malonyl-CoA
- fatty acid-CoA molecules also referred to herein as primer molecules or CoA donors
- useful in the synthetic routes described herein include but are not limited to: acetyl-CoA, butyryl-CoA, hexanoyl-CoA.
- These fatty acid-CoA molecules may be provided to host cells or may be synthesized by the host cells for biosynthesis of polyketides, as described herein.
- Two nucleotide sequences can be considered to be substantially “complementary” when the two sequences hybridize to each other under stringent conditions. In some examples, two nucleotide sequences considered to be substantially complementary hybridize to each other under highly stringent conditions.
- stringent hybridization conditions and “stringent hybridization wash conditions” in the context of nucleic acid hybridization experiments, for example in Southern hybridizations and Northern hybridizations are sequence dependent, and are different under different environmental parameters.
- highly stringent hybridization and wash conditions are selected to be about 5° C. lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH.
- polynucleotides include polynucleotides or “variants” having at least about 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% sequence identity to any of the reference sequences described herein, typically where the variant maintains at least one biological activity of the reference sequence.
- polynucleotide variant and “variant” and the like refer to polynucleotides displaying substantial sequence identity with a reference polynucleotide sequence or polynucleotides that hybridize with a reference sequence under, for example, stringent conditions. These terms may include polynucleotides in which one or more nucleotides have been added or deleted, or replaced with different nucleotides compared to a reference polynucleotide. It will be understood that that certain alterations inclusive of mutations, additions, deletions and substitutions can be made to a reference polynucleotide whereby the altered polynucleotide retains the biological function or activity of the reference polynucleotide.
- polynucleotides described herein may be included within “vectors” and/or “expression cassettes”.
- the nucleotide sequences and/or nucleic acid molecules described herein may be “operably” or “operatively” linked to a variety of promoters for expression in host cells.
- the invention provides transformed host cells and transformed organisms comprising the transformed host cells, wherein the host cells and organisms are transformed with one or more nucleic acid molecules/nucleotide sequences of the invention.
- “operably linked to,” when referring to a first nucleic acid sequence that is operably linked to a second nucleic acid sequence means a situation when the first nucleic acid sequence is placed in a functional relationship with the second nucleic acid sequence.
- a promoter is operably associated with a coding sequence if the promoter effects the transcription or expression of the coding sequence.
- operably linked to when referring to a first polypeptide sequence that is operably linked to a second polypeptide sequence, refers to a situation when the first polypeptide sequence is placed in a functional relationship with the second polypeptide sequence.
- a “promoter,” as used herein, refers to a nucleotide sequence that controls or regulates the transcription of a nucleotide sequence (i.e., a coding sequence) that is operably associated with the promoter.
- a “promoter” refers to a nucleotide sequence that contains a binding site for RNA polymerase II and directs the initiation of transcription.
- promoters are found 5′, or upstream, relative to the start of the coding region of the corresponding coding sequence.
- the promoter region may comprise other elements that act as regulators of gene expression.
- Promoters can include, for example, constitutive, inducible, temporally regulated, developmentally regulated, chemically regulated, tissue-preferred and tissue-specific promoters for use in the preparation of recombinant nucleic acid molecules, i.e., chimeric genes.
- promoter will vary depending on the temporal and spatial requirements for expression, and also depending on the host cell to be transformed. Thus, for example, where expression in response to a stimulus is desired a promoter inducible by stimuli or chemicals can be used. Where continuous expression at a relatively constant level is desired throughout the cells or tissues of an organism a constitutive promoter can be chosen.
- vectors may be used.
- polynucleotide molecules and nucleotide sequences described herein can be used in connection with vectors.
- vector refers to a composition for transferring, delivering or introducing a nucleic acid or polynucleotide into a host cell.
- a vector may comprise a polynucleotide molecule comprising the nucleotide sequence(s) to be transferred, delivered or introduced.
- general classes of vectors include, but are not limited to, a viral vector, a plasmid vector, a phage vector, a phagemid vector, a cosmid, a fosmid, a bacteriophage, or an artificial chromosome. The selection of a vector will depend upon the preferred transformation technique and the target species for transformation.
- expression vectors refers to a nucleic acid molecule comprising a nucleotide sequence of interest, wherein said nucleotide sequence is operatively associated with at least a control sequence (e.g., a promoter).
- control sequence e.g., a promoter
- An expression vector comprising a polynucleotide sequence of interest may be “chimeric”, meaning that at least one of its components is heterologous with respect to at least one of its other components.
- An expression cassette may also be one that is naturally occurring but has been obtained in a recombinant form useful for heterologous expression. In some examples, however, the expression vector is heterologous with respect to the host.
- the particular polynucleotide sequence of the expression vector does not occur naturally in the host cell and must have been introduced into the host cell or an ancestor of the host cell by a transformation event.
- an expression vector may also include other regulatory sequences.
- regulatory sequences means nucleotide sequences located upstream (5′ non-coding sequences), within or downstream (3′ non-coding sequences) of a coding sequence, and which influence the transcription, RNA processing or stability, or translation of the associated coding sequence. Regulatory sequences include, but are not limited to, promoters, enhancers, introns, 5′ and 3′ untranslated regions, translation leader sequences, termination signals, and polyadenylation signal sequences.
- An expression vector may also include a nucleotide sequence for a selectable marker, which can be used to select a transformed host cell.
- selectable marker means a nucleotide sequence that when expressed imparts a distinct phenotype to the host cell expressing the marker and thus allows such transformed host cells to be distinguished from those that do not have the marker.
- Such a nucleotide sequence may encode either a selectable or screenable marker, depending on whether the marker confers a trait that can be selected for by chemical means, such as by using a selective agent (e.g., an antibiotic, a sugar, a carbon source, or the like), or on whether the marker is simply a trait that one can identify through observation or testing, such as by screening. Examples of suitable selectable markers are known in the art and can be used in the expression vectors described herein.
- the vector and/or expression vectors and/or polynucleotides may be introduced in to a cell.
- introducing in the context of a nucleotide sequence of interest (e.g., the nucleic acid molecules/constructs/expression vectors), refers to presenting the nucleotide sequence of interest to cell host in such a manner that the nucleotide sequence gains access to the interior of a cell.
- these nucleotide sequences can be assembled as part of a single polynucleotide or nucleic acid construct, or as separate polynucleotide or nucleic acid constructs, and can be located on the same or different transformation vectors. Accordingly, these polynucleotides may be introduced into host cells in a single transformation event, or in separate transformation events.
- the term “contacting” refers to a process by which, for example, a compound may be delivered to a cell.
- the compound may be administered in a number of ways, including, but not limited to, direct introduction into a cell (i.e., intracellularly) and/or extracellular introduction into a cavity, interstitial space, or into the circulation of the organism.
- transformation or “transfection” as used herein refers to the introduction of a polynucleotide or heterologous nucleic acid into a cell. Transformation of a cell may be stable or transient.
- transient transformation refers to a polynucleotide introduced into the cell and does not integrate into the genome of the cell.
- stably introducing or “stably introduced” in the context of a polynucleotide introduced into a cell is intended to represent that the introduced polynucleotide is stably incorporated into the genome of the cell, and thus the cell is stably transformed with the polynucleotide.
- host cell includes an individual cell or cell culture which can be or has been a recipient of any recombinant vector(s) or isolated polynucleotide of the invention.
- Host cells include progeny of a single host cell, and the progeny may not necessarily be completely identical (in morphology or in total DNA complement) to the original parent cell due to natural, accidental, or deliberate mutation and/or change.
- a host cell includes cells transformed in vivo or in vitro with a recombinant vector or a polynucleotide of the invention.
- a host cell which comprises a recombinant vector of the invention is a recombinant host cell.
- a host cell may be a bacterial cell, a fungal cell, a protist cell, or a plant cell. Specific examples of host cells are described below.
- “Conversion” refers to the enzymatic transformation of a substrate to the corresponding product. “Percent conversion” refers to the percent of the substrate that is converted to the product within a period of time under specified conditions. Thus, for example, the “activity” or “conversion rate” of a ketoreductase polypeptide can be expressed as “percent conversion” of the substrate to the product.
- Hydrophilic Amino Acid or Residue refers to an amino acid or residue having a side chain exhibiting a hydrophobicity of less than zero according to the normalized consensus hydrophobicity scale Eisenberg et al., 1984. Genetically encoded hydrophilic amino acids include L-Thr (T), L-Ser (S), L-His (H), L-Glu (E), L-Asn (N), L-Gln (Q), L-Asp (D), L-Lys (K) and L-Arg (R).
- Acidic Amino Acid or Residue refers to a hydrophilic amino acid or residue having a side chain exhibiting a pKa value of less than about 6 when the amino acid is included in a peptide or polypeptide. Acidic amino acids typically have negatively charged side chains at physiological pH due to loss of a hydrogen ion. Genetically encoded acidic amino acids include L-Glu (E) and L-Asp (D).
- Basic Amino Acid or Residue refers to a hydrophilic amino acid or residue having a side chain exhibiting a pKa value of greater than about 6 when the amino acid is included in a peptide or polypeptide.
- Basic amino acids typically have positively charged side chains at physiological pH due to association with hydronium ion.
- Genetically encoded basic amino acids include L-Arg (R) and L-Lys (K).
- Poly Amino Acid or Residue refers to a hydrophilic amino acid or residue having a side chain that is uncharged at physiological pH, but which has at least one bond in which the pair of electrons shared in common by two atoms is held more closely by one of the atoms.
- Genetically encoded polar amino acids include L-Asn (N), L-Gln (Q), L-Ser (S) and L-Thr (T).
- Hydrophobic Amino Acid or Residue refers to an amino acid or residue having a side chain exhibiting a hydrophobicity of greater than zero according to the normalized consensus hydrophobicity scale (Eisenberg et al., 1984). Genetically encoded hydrophobic amino acids include L-Pro (P), L-Ile (I), L-Phe (F), L-Val (V), L-Leu (L), L-Trp (W), L-Met (M), L-Ala (A) and L-Tyr (Y).
- Aromatic Amino Acid or Residue refers to a hydrophilic or hydrophobic amino acid or residue having a side chain that includes at least one aromatic or heteroaromatic ring.
- Genetically encoded aromatic amino acids include L-Phe (F), L-Tyr (Y) and L-Trp O).
- L His heteroaromatic nitrogen atom
- histidine is classified as a hydrophilic residue.
- Constrained amino acid or residue refers to an amino acid or residue that has a constrained geometry.
- constrained residues include L-Pro (P) and L-His (H).
- Histidine has a constrained geometry because it has a relatively small imidazole ring.
- Proline has a constrained geometry because it also has a five membered ring.
- Non-polar Amino Acid or Residue refers to a hydrophobic amino acid or residue having a side chain that is uncharged at physiological pH and which has bonds in which the pair of electrons shared in common by two atoms is generally held equally by each of the two atoms (i.e., the side chain is not polar).
- Genetically encoded non-polar amino acids include L-Gly (G), L-Leu (L), L-Val (V), L-Ile (I), L-Met (M) and L-Ala (A).
- “Aliphatic Amino Acid or Residue” refers to a hydrophobic amino acid or residue having an aliphatic hydrocarbon side chain. Genetically encoded aliphatic amino acids include L-Ala (A), L-Val (V), L-Leu (L) and L-Ile (I).
- “Small Amino Acid or Residue” refers to an amino acid or residue having a side chain that is composed of a total three or fewer carbon and/or heteroatoms (excluding the ⁇ -carbon and hydrogens).
- the small amino acids or residues may be further categorized as aliphatic, non-polar, polar or acidic small amino acids or residues, in accordance with the above definitions.
- Genetically-encoded small amino acids include L-Ala (A), L-Val (V), L-Cys (C), L-Asn (N), L-Ser (S), L-Thr (T) and L-Asp (D).
- a “conservative” amino acid substitution refers to the substitution of a residue with a residue having a similar side chain, and thus typically involves substitution of the amino acid in the polypeptide with amino acids within the same or similar defined class of amino acids.
- the possible conservative mutations are provided in parentheses: A, L, V, I (Other aliphatic residues: A, L, V, I); A, L, V, I, G, M (Other non-polar residues: A, L, V, I, G, M); D, E (Other acidic residues: D, E); K, R (Other basic residues: K, R); P, H (Other constrained residues: P, H); N, Q, S, T (Other polar residues: N, Q, S, T); Y, W, F (Other aromatic residues: Y, W, F); and C (none).
- Phytocannabinoids are a large class of compounds with over 100 different known structures that are produced in the Cannabis plant. These bio-active molecules, such as tetrahydrocannabinol (THC) and cannabidiol (CBD), can be extracted from plant material for medical and psychotropic purposes.
- THC tetrahydrocannabinol
- CBD cannabidiol
- a fermentable organism such as Saccharomyces cerevisiae capable of producing cannabinoids would provide an economical route to producing these compounds on an industrial scale. The extensive time, energy, and labour involved in growing C.
- sativa for phytocannabinoid production provides a motivation to produce transgenic cell lines for production of phytocannabinoids in yeast.
- One example of such efforts is provided in PCT application by Mookerjee et al WO2018/148848.
- Aromatic prenyltransferase from cannabis are described by Page et al. in U.S. Pat. No. 8,884,100.
- FIG. 1 illustrates a cannabinoid biosynthesis pathway in Cannabis sativa .
- This pathway is described in FIG. 1 and comprises a multi-enzyme system.
- DiPKS from D. discoideum and OAC from C. sativa are used to produce olivetolic acid directly from glucose.
- GPP from the yeast terpenoid pathway and OLA are subsequently converted to cannabigerolic acid catalyzed by using a prenyltransferase.
- C. sativa THCa synthase or CBDa synthase is used to further cyclize cannabigerolic acid to form THCa or CBDa respectively.
- FIG. 2 illustrates a cannabinoid biosynthesis pathway as described in Applicant's co-pending PCT Application No. CA2020/050687 (Bourgeois et al., filed May 21, 2019), which is herein incorporated by reference.
- the first committed step in the cannabinoid biosynthesis pathway is the biosynthesis of olivetolic acid.
- This is done using a polyketide synthase such as DiPKS (Ghosh et al., 2008) from D. discoideum or OAS (Taura et al., 2009) from C. sativa and also requires the use of an olivetolic acid cyclase (OAC) (Gagne et al., 2012).
- OAC olivetolic acid cyclase
- the process begins with the polyketide synthase condensing three units of malonyl-CoA to form a linear tetraketide.
- Olivetolic acid cyclase can then cyclize the tetraketide backbone to form olivetolic acid.
- Enzyme engineering is the process of improving a desired phenotype of the enzyme by making modifications to the amino acid sequence of the polypeptide.
- functionality of the enzyme is dependent on the structure of the enzyme and the structure of the enzyme is dependent, partially, on the primary amino acid sequence; modification of the amino acid sequence of the enzyme could lead to a beneficial impact on the desired phenotype.
- This principle was applied to olivetolic acid cyclase (OAC) and modifications were made to its amino acid sequence using a directed evolution approach. This allowed for the identification of amino acid residues that improved olivetolic acid production in a strain of recombinant S. cerevisiae . Beneficial mutations were then tested in conjunction to identify combinations of mutations that improve enzyme performance.
- Improvements to one or more enzyme properties as exhibited in the engineered OACs may include increases in enzyme activity, improved enzyme kinetics and turnover, higher tolerance to increased levels of substrate, and improved tolerance to increased product levels.
- modifications of the amino acid residues, as compared to the wild type OAC sequence may be conservative modifications or non-conservative modifications. Insertions or deletions may be used to modify the residues, relative to wild type OAC. Note that in the OAC described herein, the protein may end at position 101 instead of 102, as in other reports of wild type OAC sequences.
- the residues represented as X ⁇ # ⁇ may be modified, where ⁇ # ⁇ represents the sequence position in the amino acid position of the wild type OAC sequence referenced herein as (SEQ ID NO:91).
- SEQ ID NO:92 comprises the option of mutations at X ⁇ 28 ⁇ , X ⁇ 31 ⁇ , X ⁇ 41 ⁇ , X ⁇ 43 ⁇ , X ⁇ 44 ⁇ , X ⁇ 68 ⁇ , X ⁇ 74 ⁇ , X ⁇ 84 ⁇ , X ⁇ 100 ⁇ , and X ⁇ 102 ⁇ , as outlined below:
- SEQ ID NO:91 represents wild type OAC protein:
- SEQ ID NO:92 represents the generalized variant OAC protein, wherein X
- Vector VB40 was used to construct all expression plasmids encoding enzyme proteins disclosed herein, including OAC and variants.
- the OAC variants were constructed in a combinatorial library using mutations that were initially selected in a site-saturation mutagenesis library screen. Plasmid VB40_OAC was used as the template in all library construction.
- Site-saturation mutagenesis was conducted at each amino acid position by a PCR reaction using a forward degenerate NNK primer and a ‘back-to-back’ reverse non-mutagenic primer ( FIG. 3 ).
- the PCR products were then processed through in vitro kinase-ligase-Dpnl reactions and transformed into Escherichia coli DH5alpha strain for amplification.
- FIG. 3 illustrates PCR primers used in site-saturation mutagenesis protocol.
- Right-facing arrows represents forward degenerate NNK primer, symbol * denotes the mutational position, and the left-facing arrows represent reverse primer designed ‘back-to-back’ in the opposite direction of the forward primer.
- the combinatorial library was constructed by an in-house protocol. Selected mutations were combined through an overlap-extension PCR of a batch of mutagenic oligonucleotides that were generated using targeted mutagenic primers. ( FIG. 4 ). Double-stranded DNA of the assembled combinatorial mutant variants were cloned into a vector with complementary overlapping sequences, which resulted in a pool of OAC combinatorial variants. FIG. 4 shows an overlap-extension assembly of mutagenic oligonucleotides for combinatorial library construction. The symbol x represents a point mutation.
- the plasmids encoding OAC and variant proteins as disclosed herein were transformed and expressed in Saccharomyces cerevisiae , with the host strain HB1416. All DNA was transformed into background strains using the Gietz et al. transformation protocol (Gietz 2014).
- Strains were grown in yeast synthetic complete media with a composition of 1.7 g/L YNB without ammonium sulfate, 1.92 g/L URA dropout amino acid supplement, 1.5 g/L magnesium L-glutamate, with 2% w/v galactose, 2% w/v raffinose, 200 ⁇ g/l geneticin, and 200 ⁇ g/L ampicillin (Sigma-Aldrich Canada). The culture was incubated at 30° C. for four days (96 hours). Strain HB1891 and HB1892 were respectively used as wild type control and negative control in all of the screenings.
- Each variant was tested in three replicates and each replicate was clonally derived from single colonies. All strains were grown in 500 ⁇ l of media for 96 hours in 96-well deepwell plates. The 96-well deepwell plates were incubated at 30° C. and shaken at 950 rpm for 96 hrs.
- Metabolite extraction was performed by adding 30 ⁇ l of culture to 270 ⁇ l of 56% acetonitrile in a new 96-well microtiter plate. The solutions were mixed thoroughly, then centrifuged at 3750 rpm for 10 mins. 200 ⁇ l of the soluble layer was removed and stored in a 96-well v-bottom microtiter plate. Samples were stored at ⁇ 20° C. until analysis.
- DiPKSG1516R Produces Olivetol from malonyl- Prm9t::Wu3down coA 5 DiPKS-4 81 Wu site 6 Type 1 FAS fused to Type 3 Wu6up::Gal1p: integration PKS from D. discoideum .
- DiPKSG1516R Produces Olivetol from malonyl- Prm9t::Wu6down COA 6 DiPKS-5 82 Wu site 18 Type 1 FAS fused to Type 3 Wu18up::Gal1p: integration PKS from D. discoideum .
- DiPKSG1516R Produces Olivetol from malonyl- Prm9t::Wu18down COA 7 PDH 83 Flagfeldt Acetaldehyde dehydrogenase 19Up::Tdh3p: Site 19 (ALD6) from S. cerevisiae and Ald6:Adh1::Tef1p: integration acetoacetyl COA synthase seACS1 L641P : (AscL641P) from Salmonella Prm9t::19Down enterica . Will allow greater accumulation of acetyl-coA in the cell (Shiba et al., 2007).
- Maf1 84 Flagfeldt Maf1 is a regulator of tRNA Site5Up::Tef1p: Site 5 biosynthesis. Overexpression in Maf1:Prm9t: integration S. cerevisiae has demonstrated Site5Down higher monoterpene (GPP) yields (Liu et al., 2013).
- 9 Erg20K197E 85 Chromosomal Mutant of Erg20 protein that Tpi1t:ERG20K197E: modification diminishes FPP synthase Cyc1t::Tef1p: activity creating greater pool of KanMX:Tef1t GPP precursor. Negatively affects growth phenotype (Oswald et al., 2007).
- Erg1p 86 Flagfeldt Sterol responsive promoter Site18Up::Erg1p: UB14- Site 18 controlling Erg20 protein UB14deg:ERG20: Erg20: integration activity. Allows for regular FPP Adh1t:Site18down deg synthase activity and uninhibited growth phenotype until accumulation of sterols which leads to a suppression of expression of enzyme (Peng et al., 2018).
- Pgk1 ACC1 S659A, S1157A modification cerevisiae acetyl-coA ACC1 S659A, S1157A : carboxylase that removes post- Acc1t translational modification based down-regulation. Leads to greater malonyl-coA pools.
- the promoter of Acc1 was also changed to a constitutive promoter for higher expression (Shi et al., 2014).
- An OAC mutants library was constructed in a plasmid regulated by the Gall p promoter, and expressed in an olivetol-producing background strain (HB1416) harboring downstream enzymes of the cannabinoid production pathway.
- the strains expressing wild type OAC (HB1891) and mScarlet fluorescent protein (HB1892) were utilized as control in the screening to facilitate identification of OAC mutant hits with improved activity.
- FIG. 5 shows olivetolic acid production by engineered OACs strains. The measured values of each cannabinoid are shown in FIG. 5 .
- the mutants tested had the silent mutation G43G(GGG) present, which is an artifact of the plasmid construction process and not relevant to increased enzyme performance.
- Table 7 shows production of olivetol, olivetolic acid and downstream cannabinoids in OAC wild type and engineered OACs strains.
- Table 8 illustrates the ratio of OVLa or downstream metabolites (CBGa, CBDa, THCa) to OVL in OAC variants.
- Table 9 provides a summary of mutational frequency data.
- Phytocannabinoids such as tetrahydrocannabinol (THC) and cannabidiol (CBD), can be extracted from plant material for medical and psychotropic purposes.
- THC tetrahydrocannabinol
- CBD cannabidiol
- An organism capable of fermentation, such as Saccharomyces cerevisiae that is capable of producing cannabinoids would provide an economical route to producing these compounds on an industrial scale.
- the early stages of the cannabinoid pathway proceeds via the generation of olivetolic acid by the type III PKS olivetolic acid synthase (OAS) and cyclase olivetolic acid cyclase (OAC).
- This reaction uses a hexanoyl-CoA starter as well as three units of malonyl-CoA.
- Olivetolic acid is the backbone of most classical cannabinoids and can be prenylated to form CBGA, which is ultimately converted to CBDA or THCA by an oxidocyclase.
- Production of olivetolic acid in S. cerevisiae is challenging as OAS generates significant by-products such as HTAL, PDAL and olivetol. These by-products can be reduced in a recombinant organism by the introduction of olivetolic acid cyclase (OAC) but even with this enzyme by-products can account for up to 80% of the total carbon in the reaction.
- Table 10 lists specific examples of host cell organisms in which the described OAC variants may be utilized for preparation of cannabinoids in the described pathways.
- Organisms Bac- Escherichia coli , Streptomyces coelicolor and other species., teria Bacillus subtilis , Mycoplasma genitalium , Synechocytis , Zymomonas mobilis , Corynebacterium glutamicum , Synechococcus sp., Salmonella typhi , Shigella flexneri , Shigella sonnei , and Shigella disenteriae , Pseudomonas putida , Pseudomonas aeruginosa , Pseudomonas mevalonii , Rhodobacter sphaeroides , Rhodobacter capsulatus , Rhodospirillum rubrum , Rhodococcus sp.
- Phytocannabinoids may be produced in a host cell involving Dictyostelium discoideum polyketide synthase (DiPKS), olivetolic acid cyclase (OAC), prenyltransferases, and/or mutants of these, as described in Applicant's co-pending International Application No. PCT/CA2020/050687 (herein incorporated by reference).
- DIPKS Dictyostelium discoideum polyketide synthase
- OAC olivetolic acid cyclase
- prenyltransferases and/or mutants of these, as described in Applicant's co-pending International Application No. PCT/CA2020/050687 (herein incorporated by reference).
- a host cell transformed with a polyketide synthase coding sequence, an olivetolic acid cyclase coding sequence, and a prenyltransferase coding sequence may be prepared.
- the polyketide synthase and the olivetolic acid cyclase catalyze synthesis of olivetolic acid from malonyl CoA.
- the olivetolic acid cyclase may include wild type, or any of the functional mutants described herein.
- the host cell may include a yeast cell, a bacterial cell, a protest cell or a plant cell, selected from among those listed in Table 10.
- Combinations of the methods, nucleotides, and expression vectors described herein as well as in Applicant's co-pending International Application No. PCT/CA2020/050687 may be employed together to produce phytocannabinoids, phytocannabinoid precursors such as polyketides.
- selections of characteristics of the cells and methods employed may be selected to achieve production of the cannabinoid, cannabinoid precursor, or intermediate of interest.
- Methods of producing a phytocannabinoid may comprising culturing a host cell under suitable culture conditions to form a phytocannabinoid, said host cell comprising: a polynucleotide encoding a polyketide synthase (PKS) enzyme; a polynucleotide encoding an olivetolic acid cyclase (OAC) enzyme mutants as described herein; and a polynucleotide encoding a prenyltransferase (PT) enzyme; and optionally comprising: a polynucleotide encoding an acyl-CoA synthase (Alk) enzyme; a polynucleotide encoding a fatty acyl CoA activating (CsAAE) enzyme; and/or a polynucleotide encoding a THCa synthase (OXC) enzyme.
- PKS polyketide synthase
- OAC olivetolic acid cycla
- An expression vector can be prepared comprising a polynucleotide encoding a polyketide synthase (PKS) enzyme; a polynucleotide encoding an olivetolic acid cyclase (OAC) enzyme mutants as described herein; and a polynucleotide encoding a prenyltransferase (PT) enzyme.
- the expression vector can optionally comprise a polynucleotide encoding an acyl-CoA synthase (Alk) enzyme; a polynucleotide encoding CsAAE1; and/or a polynucleotide encoding a THCa synthase (OXC) enzyme.
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Health & Medical Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Genetics & Genomics (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Biochemistry (AREA)
- Biotechnology (AREA)
- Microbiology (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Enzymes And Modification Thereof (AREA)
- Pharmaceuticals Containing Other Organic And Inorganic Compounds (AREA)
Abstract
The present disclosure relates generally to methods, isolated polypeptides and polynucleotides, expression vectors, and host cells for the production of olivetolic acid and phytocannabinoids. A method of producing olivetolic acid (OVLa) and/or a phytocannabinoid in a heterologous host cell having OVLa-producing or phytocannabinoid-producing capacity comprises transforming the host cell with a nucleotide encoding a variant olivetolic acid cyclase (OAC) protein having at least 6 amino acid mutations relative to the wild type OAC protein, and culturing the transformed host cell to produce OVLa and/or phytocannabinoids therefrom. The variant OAC protein (SEQ ID NO:92) has at least 85% sequence identity with the wild type OAC protein (SEQ ID NO:91). Exemplary variants having improved OVLa or phytocannabinoid production capacity are described.
Description
- This application claims the benefit of and priority to U.S. patent application Ser. No. 16/953,638 filed Nov. 20, 2020, the entirety of which is hereby incorporated by reference.
- The present disclosure relates generally to proteins having olivetolic acid cyclase activity, useful in production of phytocannabinoids.
- Phytocannabinoids are a large class of compounds with over 100 different known structures that are produced in the Cannabis sativa plant. Phytocannabinoids are known to be biosynthesized in C. sativa, or may result from thermal or other decomposition from phytocannabinoids biosynthesized in C. sativa. These bio-active molecules, such as tetrahydrocannabinol (THC) and cannabidiol (CBD), can be extracted from plant material for medical and recreational purposes. However, the synthesis of plant material is costly, not readily scalable to large volumes, and requires lengthy growing periods to produce sufficient quantities of phytocannabinoids. While the C. sativa plant is also a valuable source of grain, fiber, and other material, growing C. sativa for phytocannabinoid production, particularly indoors, is costly in terms of energy and labour. Subsequent extraction, purification, and fractionation of phytocannabinoids from the C. sativa plant is also labour and energy intensive.
- Phytocannabinoids are pharmacologically active molecules that contribute to the medical and psychotropic effects of C. sativa. Biosynthesis of phytocannabinoids in the C. sativa plant scales similarly to other agricultural projects. As with other agricultural projects, large scale production of phytocannabinoids by growing C. sativa requires a variety of inputs (e.g. nutrients, light, pest control, CO, etc.). The inputs required for cultivating C. sativa must be provided. In addition, cultivation of C. sativa, where allowed, is currently subject to heavy regulation, taxation, and rigorous quality control where products prepared from the plant are for commercial use, further increasing costs.
- Phytocannabinoid analogues are pharmacologically active molecules that are structurally similar to phytocannabinoids. Phytocannabinoid analogues are often synthesized chemically, which can be labour intensive and costly. As a result, it may be economical to produce the phytocannabinoids and phytocannabinoid analogues in a robust and scalable, fermentable organism. Saccharomyces cerevisiae is an example of a fermentable organism that has been used to produce industrial scales of similar molecules.
- The extensive time, energy, and labour involved in growing C. sativa for production of naturally-occurring phytocannabinoids provides a motivation to produce transgenic cell lines for production of phytocannabinoids by other means. Polyketides, including olivetolic acid and its analogues are valuable precursors to phytocannabinoids.
- It is desirable to find alternative enzymes and methods for the production of phytocannabinoids, and/or for the production of compounds useful in phytocannabinoid biosynthesis as intermediate or precursor compounds.
- Olivetolic Acid Cyclase (OAC) variants are described herein which are capable of producing olivetolic acid (OVLa). These variants are useful in the production of olivetolic acid and relevant phytocannabinoids in a heterologous host. Methods of production are described. The described OAC variants that can produce olivetolic acid and downstream metabolites in a modified yeast cell can be applied to any host and used in phytocannabinoid production.
- In certain aspects described, OAC variants comprise 6 or greater non-conservative substitution amino acid mutations relative to the wild type enzyme. Certain OAC variants described have improved activity and/or show improved ratios of olivetol to olivetolic acid compared to the wild type enzyme.
- A method of producing OVLa or a phytocannabinoid derived therefrom in a heterologous host cell having OVLa-producing or phytocannabinoid-producing capacity is described. The method comprises: transforming the host cell with a nucleotide encoding a variant olivetolic acid cyclase (OAC) protein having at least 6 amino acid mutations relative to the wild type OAC protein, and culturing said transformed host cell to produce olivetolic acid and/or phytocannabinoids therefrom, wherein the variant OAC protein comprises at least 85%, at least 90%, at least 95%, or at least 99% sequence identity with the wild type OAC protein sequence according to SEQ ID NO:91.
- An isolated polypeptide having olivetolic acid cyclase activity is described, comprising an amino acid sequence of at least 85%, at least 90%, at least 95%, or at least 99% sequence identity with SEQ ID NO: 92, wherein 6 or more amino acid residues comprise mutations relative to SEQ ID NO:91, located at 6 or more of
residues 28, 31, 41, 43, 44, 68, 74, 84, 100 or 102 of SEQ ID NO:91. - An isolated polynucleotide is described, comprising: (a) a nucleotide sequence according to SEQ ID NO:3-SEQ ID NO:39; (b) a nucleotide sequence having at least 85%, at least 90%, at least 95%, or at least 99% identity with the nucleotide sequence of (a), or (c) a nucleotide sequence that hybridizes with the complementary strand of the nucleotide having the sequence of (a). Expression vectors comprising the polynucleotide, and host cells transformed with such expression vectors are described.
- Other aspects and features of the present disclosure will become apparent to those ordinarily skilled in the art upon review of the following description of specific embodiments in conjunction with the accompanying figures.
- Embodiments of the present disclosure will now be described, by way of example only, with reference to the attached Figures.
-
FIG. 1 illustrates a cannabinoid biosynthesis pathway in Cannabis sativa. -
FIG. 2 illustrates a cannabinoid biosynthesis pathway as described in Applicant's co-pending International Application No. PCT/CA2020/050687. -
FIG. 3 illustrates PCR primers used in site-saturation mutagenesis protocol. -
FIG. 4 shows an overlap-extension approach that was used to assemble mutagenic oligonucleotides for combinatorial library construction. The symbol x represents a point mutation. -
FIG. 5 shows olivetolic acid production with mutant OAC variants. - A method of producing olivetolic acid (OVLa) or a phytocannabinoid produced therefrom is described herein. A heterologous host cell comprising OVLa-producing or phytocannabinoid-producing capacity is transformed with a nucleotide encoding a variant olivetolic acid cyclase (OAC) protein having at least 6 amino acid mutations relative to the wild type OAC protein and culturing said transformed host cell to produce olivetolic acid and/or phytocannabinoids therefrom, wherein the variant OAC protein comprises at least 85%, at least 90%, at least 95%, or at least 99% sequence identity with the wild type OAC protein sequence according to SEQ ID NO: 91. Exemplary variant proteins may result in improved OVLa or phytocannabinoid production over wild type according to the method described.
- In some embodiments, at least 4 of the at least 6 amino acid mutations of the OAC protein are in
residues 28, 31, 41, 43, 44, 68, 74, 84, 100 or 102 of the wild type OAC protein (SEQ ID NO:91), with other mutations being located elsewhere in the sequence. When a mutation is present atresidue 28, 31, 41, 43, 44, 68, 74, 84, 100 or 102, it can be either a conservative or non-conservative amino acid substitution, but may advantageously be a non-conservative amino acid substitution. While at least 4 of the 6 amino acid mutations are present in the specified residue locations, in exemplary embodiments, more than 4 may be present in the specified residues, such as 6, 7, 8, 9 or 10 of the amino acid mutations may be found atpositions 28, 31, 41, 43, 44, 68, 74, 84, 100 or 102, relative to the wild type sequence. In certain embodiments, mutations other than those located atresidue 28, 31, 41, 43, 44, 68, 74, 84, 100 or 102 may be limited to conservative amino acid substitutions, such that the variant OAC protein remains within at least 85%, at least 90%, at least 95%, or at least 99% sequence identity with the wild type OAC protein. - The method may encompass transformation of the host cell with a nucleotide encoding the variant olivetolic acid cyclase (OAC) protein, which nucleotide has a sequence comprising: (a) a nucleotide sequence according to SEQ ID NO:3-SEQ ID NO:39; (b) a nucleotide sequence having at least 85%, at least 90%, at least 95%, or at least 99% identity with the sequence of (a); or (c) a nucleotide sequence that hybridizes with the complementary strand of the nucleotide having the sequence of (a). For example, the variant OAC protein may comprise a according to any one of SEQ ID NO:40 to SEQ ID NO:76.
- In certain embodiments, at least 4 of the at least 6 amino acid mutations relative to the wild type OAC protein are selected from the group consisting of: V28A; V31G; Y41T, Y41S or Y41V; K44V; T68L or T68R; 174E, 174R, 174D or 174G; V84R; R100M or R100E; and G102R, G102S, or G102STOP. For example, in some embodiments at least 10, at least 9, at least 8, at least 7, at least 6, or at least 5 of these amino acid mutations relative to wild type may be present.
- In the method, the production of a phytocannabinoid by the transformed host cell may involve production of phytocannabinoids including but not limited to cannabigerol (CBG), cannabigerolic acid (CBGa), cannabigerovarin (CBGV), cannabigerovarinic acid (CBGVa), cannabigerocin (CBGO), cannabigerocinic acid (CBGOa), a cannabivarin, tetrahydrocannabinol (THC), or tetrahydrocannabinolic acid (THCa). Further, using the OAC variants described, in combination with a divarinic acid synthase within the host cell, the host cell may produce a cannabivarin, such as divarinic acid.
- The host cell may be transformed with a nucleotide encoding variant olivetolic acid cyclase (OAC) protein with at least 85%, at least 90%, at least 95%, at least 99%, or 100% sequence identity of any one of the following sequences with the indicated substitutions from OAC wild type (SEQ ID NO:91) being present:
-
- V28A/V31G/Y41S/G4351LENT(=GGG)/K44V/T68L/174R/V84R/R100E/G102R (SEQ ID N0:42),
- V28A/Y41T/G43SILENT(=GGG)/T68L/174EA/84R/R100M/G102R (SEQ ID NO:40),
- Y41S/G43SILENT(=GGG)/K44V/T68R/174R/V84R (SEQ ID NO:41),
- V28A/Y41T/G43SILENT(=GGG)/T68L/174G/V84R/R100E (SEQ ID NO:43),
- V28A/Y41T/G43SILENT(=GGG)/K44V/T68L/174D/V84R/R100M/G102R (SEQ ID NO:44),
- V28A/Y41T/G43SILENT(=GGG)/T68L/174D/V84R/G102R (SEQ ID NO:45),
- V28A/Y41T/G43SILENT(=GGG)/K44V/T68L/174R/V84R/R100E/G102R (SEQ ID N0:46),
- Y41T/G43SILENT(=GGG)/T68R/174R/V84R/R100M/G102STOP (SEQ ID NO:47),
- V28A/Y41V/G43SILENT(=GGG)/K44V/T68L/174G/V84R/R100E/G102R (SEQ ID N0:48),
- V28A/Y41T/G43SILENT(=GGG)/K44V/T68R/174G/V84R/G102STOP (SEQ ID NO:49),
- V28A/Y41T/G43SILENT(=GGG)/K44V/T68R/174G/V84R/G102STOP (SEQ ID NO:50),
- V28A/V31G/Y41T/G43SILENT(=GGG)/K44V/T68R/174E/V84R/R100E (SEQ ID NO:51),
- V28A/Y41S/G43SILENT(=GGG)/T68R/174R/V84R/R100M/G102STOP (SEQ ID NO:52),
- Y41T/G43SILENT(=GGG)/K44V/T68L/174G/V84R/G102R (SEQ ID NO:53),
- V28A/Y41T/G43SILENT(=GGG)/K44V/T68R/174R/V84R/R100E/G102R (SEQ ID NO:54),
- V28A/V31G/Y41S/G43SILENT(=GGG)/K44V/T68R/174RA/84R (SEQ ID NO:55),
- V28A/G43SILENT(=GGG)/K44V/174DA/84R/R100E/G102R(=CGC) (SEQ ID NO:56),
- V28A/Y41V/G43SILENT(=GGG)/K44V/T68LJ174G/V84R/G102R (SEQ ID NO:57),
- Y41T/G43SILENT(=GGG)/T68L/174G/V84R/R100M/G102R (SEQ ID NO:58),
- V28A/V31G/Y41T/G43SILENT(=GGG)/K44V/T68R/174RA/84R/R100E/G102R (SEQ ID NO:59),
- V31G/Y41S/G43SILENT(=GGG)/K44V/T68RA/84R/R100E (SEQ ID NO:60),
- V28A/Y41V/G43SILENT(=GGG)/T68L/174D/R100E/G102STOP (SEQ ID NO:61)
- V28A/Y41V/G43SILENT(=GGG)/T68R/174G/V84R/R100M/G102R (SEQ ID NO:62),
- V31G/G43SILENT(=GGG)/174G/V84R/R100E (SEQ ID NO:63),
- V28A/Y41S/G43SILENT(=GGG)/K44V/T68R/174R/V84R/R100M/G102R (SEQ ID NO:64),
- V28A/V31G/Y41V/G43SILENT(=GGG)/K44V/T68L/174G/V84R (SEQ ID NO:65),
- V28A/Y41V/G43SILENT(=GGG)/K44V/T68L/174G/V84R/R100M/G102R (SEQ ID NO:66),
- V28A/V31G/G43SILENT(=GGG)/T68L/174RA/84R/R100E/G102R (SEQ ID NO:67),
- V31G/Y41V/G43SILENT(=GGG)/K44V/T68L/174RA/84R/R100E/G102STOP (SEQ ID NO:68),
- V31G/Y41T/G43SILENT(=GGG)/K44V/T68R/174D/V84R/G102R (SEQ ID NO:69),
- V31G/Y41T/G43SILENT(=GGG)/K44V/T68R/174D/V84R/R100E/G102R (SEQ ID NO:70),
- V28A/Y41S/G43SILENT(=GGG)/K44V/T68R/174GA/84R/R100M/G102R (SEQ ID NO:71),
- V28A/Y41V/G43SILENT(=GGG)/K44V/174R/R100E/G102STOP (SEQ ID NO:72),
- V28A/V31G/Y41T/G43SILENT(=GGG)/K44V/174E/V84R/R100M/G102R (SEQ ID NO:73),
- V28A/G43SILENT(=GGG)/K44V/T68R/174E/V84R/R100E/G102STOP (SEQ ID NO:74),
- V31G/Y41T/G43SILENT(=GGG)/K44V/T68L/174D/V84R/G102R (SEQ ID NO:75), or
- Y41V/G43SILENT(=GGG)/K44V/T68L/174R/V84R/G102R (SEQ ID NO:76).
- The host cell transformed in the method described may be a bacterial cell, a fungal cell, a protist cell, or a plant cell. Exemplary organisms include S. cerevisiae, E. coli, Yarrowia lipolytica, or Komagataella phaffii, as well as others described herein. The transformed host cell may additionally comprise, or be transformed with, other enzymes useful in phytocannabinoid production. For example, a polynucleotide encoding a polyketide synthase enzyme and/or a polynucleotide encoding a prenyltransferase enzyme may also be included in the host cell. Further options for polynucleotides and methods, such as described in Applicant's co-pending International Application No. PCT/CA2020/050687 (hereby incorporated by reference) are envisioned.
- An isolated polypeptide is described herein, which has olivetolic acid cyclase (OAC) activity. The polypeptide activity comprises an amino acid sequence according to SEQ ID NO: 92, wherein 6 or more amino acid residues comprise mutations relative to SEQ ID NO: 91 (wild type OAC) which are mutations at
residues 28, 31, 41, 43, 44, 68, 74, 84, 100 or 102 of SEQ ID NO:91. The isolated polypeptide may have an amino acid sequence according to one of SEQ ID NO:40 to SEQ ID NO:76. - An isolated polynucleotide is described, which may have (a) a nucleotide sequence according to SEQ ID NO:3-SEQ ID NO:39; (b) a nucleotide sequence having at least 85% identity with the nucleotide sequence of (a), or (c) a nucleotide sequence that hybridizes with the complementary strand of the nucleotide having the sequence of (a).
- An expression vector is described, comprising a polynucleotide encoding a variant olivetolic acid cyclase (OAC) protein having the sequence of SEQ ID NO: 92, in which 6 or more amino acid mutations are present relative to the wild type OAC protein. In such an expression vector, the polynucleotide encoding the variant OAC protein may have at least 85% sequence identity with any one of SEQ ID NO:3 to SEQ ID NO:39.
- A host cell transformed with the above-described expression vector is also encompassed herein. Such a host cell may additionally comprise a polynucleotide encoding other enzymes useful in synthesis of olivetolic acid and/or phytocannabinoids, such as encoding a polyketide synthase enzyme and/or a prenyltransferase enzyme. Such a host cell may be a bacterial cell, a fungal cell, a protist cell, or a plant cell, for example: S. cerevisiae, E. coli, Yarrowia lipolytica, or Komagataella phaffii.
- Certain terms used herein are described below.
- The term “cannabinoid” as used herein refers to a chemical compound that shows direct or indirect activity at a cannabinoid receptor. Non limiting examples of cannabinoids include tetrahydrocannabinol (THC), cannabidiol (CBD), cannabinol (CBN), cannabigerol (CBG), cannabichromene (CBC), cannabicyclol (CBL), cannabivarin (CBV), tetrahydrocannabivarin (THCV), cannabidivarin (CBDV), cannabichromevarin (CBCV), cannabigerovarin (CBGV), and cannabigerol monomethyl ether (CBGM).
- The term “phytocannabinoid” as used herein refers to a cannabinoid that typically occurs in a plant species. Exemplary phytocannabinoids produced according to the invention include cannabigerol (CBG), cannabigerolic acid (CBGa), cannabigerovarin (CBGV), cannabigerovarinic acid (CBGVa), cannabigerocin (CBGo), or cannabigerocinic acid (CBGoa).
- Cannabinoids and phytocannabinoids may contain or may lack one or more carboxylic acid functional groups. Non limiting examples of such cannabinoids or phytocannabinoids containing carboxylic acid function groups or phytocannabinoids include tetrahydrocannabinolic acid (THCA), cannabidiolic acid (CBDA), and cannabichromenic acid (CBCA).
- The term “homologue” includes homologous sequences from the same and other species and orthologous sequences from the same and other species, without necessarily reference to biological origins. Different polynucleotides or polypeptides having homology may be referred to as homologues.
- The term “homology” may refer to the level of similarity between two or more polynucleotide and/or polypeptide sequences in terms of percent of positional identity (i.e., sequence similarity or identity), without necessarily pertaining to genetic origins. Homology also refers to the concept of similar functional properties among different polynucleotide or polypeptides. Thus, the compositions and methods herein may further comprise homologues to the polypeptide and polynucleotide sequences described herein.
- The term “orthologous,” as used herein, refers to homologous polypeptide sequences and/or polynucleotide sequences in different species that arose from a common ancestral gene during speciation.
- As used herein, a “homologue” may be a sequence with specified functionality, whether referencing percent identity or percent homology, with a significant sequence identity (e.g., 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% and/or 100%) to the polynucleotide sequences herein.
- As used herein “sequence identity” refers to the extent to which two optimally aligned polynucleotide or peptide sequences are invariant throughout a window of alignment of components, e.g., nucleotides or amino acids. “Identity” can be readily calculated by known methods.
- As used herein, the term “percent sequence identity” or “percent identity” refers to the percentage of identical nucleotides in a linear polynucleotide sequence of a reference (“query”) polynucleotide molecule (or its complementary strand) as compared to a test (“subject”) polynucleotide molecule (or its complementary strand) when the two sequences are optimally aligned. In some embodiments, “percent identity” can refer to the percentage of identical amino acids in an amino acid sequence.
- The terms “fatty acid-CoA”, “fatty acyl-CoA”, or “CoA donors” as used herein may refer to compounds useful in polyketide synthesis as primer molecules which react in a condensation reaction with an extender unit (such as malonyl-CoA) to form a polyketide. Examples of fatty acid-CoA molecules (also referred to herein as primer molecules or CoA donors), useful in the synthetic routes described herein include but are not limited to: acetyl-CoA, butyryl-CoA, hexanoyl-CoA. These fatty acid-CoA molecules may be provided to host cells or may be synthesized by the host cells for biosynthesis of polyketides, as described herein.
- Two nucleotide sequences can be considered to be substantially “complementary” when the two sequences hybridize to each other under stringent conditions. In some examples, two nucleotide sequences considered to be substantially complementary hybridize to each other under highly stringent conditions.
- The terms “stringent hybridization conditions” and “stringent hybridization wash conditions” in the context of nucleic acid hybridization experiments, for example in Southern hybridizations and Northern hybridizations are sequence dependent, and are different under different environmental parameters. In some examples, generally, highly stringent hybridization and wash conditions are selected to be about 5° C. lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH.
- In some examples, polynucleotides include polynucleotides or “variants” having at least about 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% sequence identity to any of the reference sequences described herein, typically where the variant maintains at least one biological activity of the reference sequence.
- As used herein, the terms “polynucleotide variant” and “variant” and the like refer to polynucleotides displaying substantial sequence identity with a reference polynucleotide sequence or polynucleotides that hybridize with a reference sequence under, for example, stringent conditions. These terms may include polynucleotides in which one or more nucleotides have been added or deleted, or replaced with different nucleotides compared to a reference polynucleotide. It will be understood that that certain alterations inclusive of mutations, additions, deletions and substitutions can be made to a reference polynucleotide whereby the altered polynucleotide retains the biological function or activity of the reference polynucleotide.
- In some examples, the polynucleotides described herein may be included within “vectors” and/or “expression cassettes”.
- In some embodiments, the nucleotide sequences and/or nucleic acid molecules described herein may be “operably” or “operatively” linked to a variety of promoters for expression in host cells. Thus, in some examples, the invention provides transformed host cells and transformed organisms comprising the transformed host cells, wherein the host cells and organisms are transformed with one or more nucleic acid molecules/nucleotide sequences of the invention. As used herein, “operably linked to,” when referring to a first nucleic acid sequence that is operably linked to a second nucleic acid sequence, means a situation when the first nucleic acid sequence is placed in a functional relationship with the second nucleic acid sequence. For instance, a promoter is operably associated with a coding sequence if the promoter effects the transcription or expression of the coding sequence.
- In the context of a polypeptide, “operably linked to,” when referring to a first polypeptide sequence that is operably linked to a second polypeptide sequence, refers to a situation when the first polypeptide sequence is placed in a functional relationship with the second polypeptide sequence.
- The term a “promoter,” as used herein, refers to a nucleotide sequence that controls or regulates the transcription of a nucleotide sequence (i.e., a coding sequence) that is operably associated with the promoter. Typically, a “promoter” refers to a nucleotide sequence that contains a binding site for RNA polymerase II and directs the initiation of transcription. In general, promoters are found 5′, or upstream, relative to the start of the coding region of the corresponding coding sequence. The promoter region may comprise other elements that act as regulators of gene expression.
- Promoters can include, for example, constitutive, inducible, temporally regulated, developmentally regulated, chemically regulated, tissue-preferred and tissue-specific promoters for use in the preparation of recombinant nucleic acid molecules, i.e., chimeric genes.
- The choice of promoter will vary depending on the temporal and spatial requirements for expression, and also depending on the host cell to be transformed. Thus, for example, where expression in response to a stimulus is desired a promoter inducible by stimuli or chemicals can be used. Where continuous expression at a relatively constant level is desired throughout the cells or tissues of an organism a constitutive promoter can be chosen.
- In some examples, vectors may be used.
- In some examples, the polynucleotide molecules and nucleotide sequences described herein can be used in connection with vectors.
- The term “vector” refers to a composition for transferring, delivering or introducing a nucleic acid or polynucleotide into a host cell. A vector may comprise a polynucleotide molecule comprising the nucleotide sequence(s) to be transferred, delivered or introduced. Non-limiting examples of general classes of vectors include, but are not limited to, a viral vector, a plasmid vector, a phage vector, a phagemid vector, a cosmid, a fosmid, a bacteriophage, or an artificial chromosome. The selection of a vector will depend upon the preferred transformation technique and the target species for transformation.
- As used herein, “expression vectors” refers to a nucleic acid molecule comprising a nucleotide sequence of interest, wherein said nucleotide sequence is operatively associated with at least a control sequence (e.g., a promoter). Thus, some examples provide expression vectors designed to express the polynucleotide sequences of described herein.
- An expression vector comprising a polynucleotide sequence of interest may be “chimeric”, meaning that at least one of its components is heterologous with respect to at least one of its other components. An expression cassette may also be one that is naturally occurring but has been obtained in a recombinant form useful for heterologous expression. In some examples, however, the expression vector is heterologous with respect to the host. For example, the particular polynucleotide sequence of the expression vector does not occur naturally in the host cell and must have been introduced into the host cell or an ancestor of the host cell by a transformation event.
- In some examples, an expression vector may also include other regulatory sequences. As used herein, “regulatory sequences” means nucleotide sequences located upstream (5′ non-coding sequences), within or downstream (3′ non-coding sequences) of a coding sequence, and which influence the transcription, RNA processing or stability, or translation of the associated coding sequence. Regulatory sequences include, but are not limited to, promoters, enhancers, introns, 5′ and 3′ untranslated regions, translation leader sequences, termination signals, and polyadenylation signal sequences.
- An expression vector may also include a nucleotide sequence for a selectable marker, which can be used to select a transformed host cell.
- As used herein, “selectable marker” means a nucleotide sequence that when expressed imparts a distinct phenotype to the host cell expressing the marker and thus allows such transformed host cells to be distinguished from those that do not have the marker. Such a nucleotide sequence may encode either a selectable or screenable marker, depending on whether the marker confers a trait that can be selected for by chemical means, such as by using a selective agent (e.g., an antibiotic, a sugar, a carbon source, or the like), or on whether the marker is simply a trait that one can identify through observation or testing, such as by screening. Examples of suitable selectable markers are known in the art and can be used in the expression vectors described herein.
- The vector and/or expression vectors and/or polynucleotides may be introduced in to a cell.
- The term “introducing,” in the context of a nucleotide sequence of interest (e.g., the nucleic acid molecules/constructs/expression vectors), refers to presenting the nucleotide sequence of interest to cell host in such a manner that the nucleotide sequence gains access to the interior of a cell. Where more than one nucleotide sequence is to be introduced these nucleotide sequences can be assembled as part of a single polynucleotide or nucleic acid construct, or as separate polynucleotide or nucleic acid constructs, and can be located on the same or different transformation vectors. Accordingly, these polynucleotides may be introduced into host cells in a single transformation event, or in separate transformation events.
- As used herein, the term “contacting” refers to a process by which, for example, a compound may be delivered to a cell. The compound may be administered in a number of ways, including, but not limited to, direct introduction into a cell (i.e., intracellularly) and/or extracellular introduction into a cavity, interstitial space, or into the circulation of the organism.
- The term “transformation” or “transfection” as used herein refers to the introduction of a polynucleotide or heterologous nucleic acid into a cell. Transformation of a cell may be stable or transient.
- The term “transient transformation” as used herein in the context of a polynucleotide refers to a polynucleotide introduced into the cell and does not integrate into the genome of the cell.
- The terms “stably introducing” or “stably introduced” in the context of a polynucleotide introduced into a cell is intended to represent that the introduced polynucleotide is stably incorporated into the genome of the cell, and thus the cell is stably transformed with the polynucleotide.
- The term “host cell” includes an individual cell or cell culture which can be or has been a recipient of any recombinant vector(s) or isolated polynucleotide of the invention. Host cells include progeny of a single host cell, and the progeny may not necessarily be completely identical (in morphology or in total DNA complement) to the original parent cell due to natural, accidental, or deliberate mutation and/or change. A host cell includes cells transformed in vivo or in vitro with a recombinant vector or a polynucleotide of the invention. A host cell which comprises a recombinant vector of the invention is a recombinant host cell.
- In some examples, a host cell may be a bacterial cell, a fungal cell, a protist cell, or a plant cell. Specific examples of host cells are described below.
- “Conversion” refers to the enzymatic transformation of a substrate to the corresponding product. “Percent conversion” refers to the percent of the substrate that is converted to the product within a period of time under specified conditions. Thus, for example, the “activity” or “conversion rate” of a ketoreductase polypeptide can be expressed as “percent conversion” of the substrate to the product.
- “Hydrophilic Amino Acid or Residue” refers to an amino acid or residue having a side chain exhibiting a hydrophobicity of less than zero according to the normalized consensus hydrophobicity scale Eisenberg et al., 1984. Genetically encoded hydrophilic amino acids include L-Thr (T), L-Ser (S), L-His (H), L-Glu (E), L-Asn (N), L-Gln (Q), L-Asp (D), L-Lys (K) and L-Arg (R).
- “Acidic Amino Acid or Residue” refers to a hydrophilic amino acid or residue having a side chain exhibiting a pKa value of less than about 6 when the amino acid is included in a peptide or polypeptide. Acidic amino acids typically have negatively charged side chains at physiological pH due to loss of a hydrogen ion. Genetically encoded acidic amino acids include L-Glu (E) and L-Asp (D).
- “Basic Amino Acid or Residue” refers to a hydrophilic amino acid or residue having a side chain exhibiting a pKa value of greater than about 6 when the amino acid is included in a peptide or polypeptide. Basic amino acids typically have positively charged side chains at physiological pH due to association with hydronium ion. Genetically encoded basic amino acids include L-Arg (R) and L-Lys (K).
- “Polar Amino Acid or Residue” refers to a hydrophilic amino acid or residue having a side chain that is uncharged at physiological pH, but which has at least one bond in which the pair of electrons shared in common by two atoms is held more closely by one of the atoms. Genetically encoded polar amino acids include L-Asn (N), L-Gln (Q), L-Ser (S) and L-Thr (T).
- “Hydrophobic Amino Acid or Residue” refers to an amino acid or residue having a side chain exhibiting a hydrophobicity of greater than zero according to the normalized consensus hydrophobicity scale (Eisenberg et al., 1984). Genetically encoded hydrophobic amino acids include L-Pro (P), L-Ile (I), L-Phe (F), L-Val (V), L-Leu (L), L-Trp (W), L-Met (M), L-Ala (A) and L-Tyr (Y).
- “Aromatic Amino Acid or Residue” refers to a hydrophilic or hydrophobic amino acid or residue having a side chain that includes at least one aromatic or heteroaromatic ring. Genetically encoded aromatic amino acids include L-Phe (F), L-Tyr (Y) and L-Trp O). Although owing to the pKa of its heteroaromatic nitrogen atom L His (H) it is sometimes classified as a basic residue, or as an aromatic residue as its side chain includes a heteroaromatic ring, herein histidine is classified as a hydrophilic residue.
- “Constrained amino acid or residue” refers to an amino acid or residue that has a constrained geometry. Herein, constrained residues include L-Pro (P) and L-His (H). Histidine has a constrained geometry because it has a relatively small imidazole ring. Proline has a constrained geometry because it also has a five membered ring.
- “Non-polar Amino Acid or Residue” refers to a hydrophobic amino acid or residue having a side chain that is uncharged at physiological pH and which has bonds in which the pair of electrons shared in common by two atoms is generally held equally by each of the two atoms (i.e., the side chain is not polar). Genetically encoded non-polar amino acids include L-Gly (G), L-Leu (L), L-Val (V), L-Ile (I), L-Met (M) and L-Ala (A).
- “Aliphatic Amino Acid or Residue” refers to a hydrophobic amino acid or residue having an aliphatic hydrocarbon side chain. Genetically encoded aliphatic amino acids include L-Ala (A), L-Val (V), L-Leu (L) and L-Ile (I).
- “Small Amino Acid or Residue” refers to an amino acid or residue having a side chain that is composed of a total three or fewer carbon and/or heteroatoms (excluding the α-carbon and hydrogens). The small amino acids or residues may be further categorized as aliphatic, non-polar, polar or acidic small amino acids or residues, in accordance with the above definitions. Genetically-encoded small amino acids include L-Ala (A), L-Val (V), L-Cys (C), L-Asn (N), L-Ser (S), L-Thr (T) and L-Asp (D).
- A “conservative” amino acid substitution (or mutation) refers to the substitution of a residue with a residue having a similar side chain, and thus typically involves substitution of the amino acid in the polypeptide with amino acids within the same or similar defined class of amino acids. For the following residues, the possible conservative mutations are provided in parentheses: A, L, V, I (Other aliphatic residues: A, L, V, I); A, L, V, I, G, M (Other non-polar residues: A, L, V, I, G, M); D, E (Other acidic residues: D, E); K, R (Other basic residues: K, R); P, H (Other constrained residues: P, H); N, Q, S, T (Other polar residues: N, Q, S, T); Y, W, F (Other aromatic residues: Y, W, F); and C (none).
- Phytocannabinoids are a large class of compounds with over 100 different known structures that are produced in the Cannabis plant. These bio-active molecules, such as tetrahydrocannabinol (THC) and cannabidiol (CBD), can be extracted from plant material for medical and psychotropic purposes. However, the synthesis of plant material is costly, not readily scalable to large volumes, and requires lengthy growth periods to produce sufficient quantities of phytocannabinoids. A fermentable organism such as Saccharomyces cerevisiae capable of producing cannabinoids would provide an economical route to producing these compounds on an industrial scale. The extensive time, energy, and labour involved in growing C. sativa for phytocannabinoid production provides a motivation to produce transgenic cell lines for production of phytocannabinoids in yeast. One example of such efforts is provided in PCT application by Mookerjee et al WO2018/148848. Aromatic prenyltransferase from cannabis are described by Page et al. in U.S. Pat. No. 8,884,100.
-
FIG. 1 illustrates a cannabinoid biosynthesis pathway in Cannabis sativa. As expression and functionality of the C. sativa pathway in S. cerevisiae is hindered by problems of toxic precursors and poor expression, a novel biosynthetic route for cannabinoid production was developed that overcomes said issues. This pathway is described inFIG. 1 and comprises a multi-enzyme system. DiPKS from D. discoideum and OAC from C. sativa are used to produce olivetolic acid directly from glucose. GPP from the yeast terpenoid pathway and OLA are subsequently converted to cannabigerolic acid catalyzed by using a prenyltransferase. Then, C. sativa THCa synthase or CBDa synthase is used to further cyclize cannabigerolic acid to form THCa or CBDa respectively. -
FIG. 2 illustrates a cannabinoid biosynthesis pathway as described in Applicant's co-pending PCT Application No. CA2020/050687 (Bourgeois et al., filed May 21, 2019), which is herein incorporated by reference. - The first committed step in the cannabinoid biosynthesis pathway is the biosynthesis of olivetolic acid. This is done using a polyketide synthase such as DiPKS (Ghosh et al., 2008) from D. discoideum or OAS (Taura et al., 2009) from C. sativa and also requires the use of an olivetolic acid cyclase (OAC) (Gagne et al., 2012). The process begins with the polyketide synthase condensing three units of malonyl-CoA to form a linear tetraketide. Olivetolic acid cyclase can then cyclize the tetraketide backbone to form olivetolic acid. In the absence of OAC, the tetraketide can spontaneously cyclize to form olivetol, an unwanted byproduct in the cannabinoid biosynthesis pathway. In order to improve enzyme performance in a heterologous host, the authors subjected OAC to an enzyme engineering regimen.
- Enzyme engineering is the process of improving a desired phenotype of the enzyme by making modifications to the amino acid sequence of the polypeptide. As the functionality of the enzyme is dependent on the structure of the enzyme and the structure of the enzyme is dependent, partially, on the primary amino acid sequence; modification of the amino acid sequence of the enzyme could lead to a beneficial impact on the desired phenotype. This principle was applied to olivetolic acid cyclase (OAC) and modifications were made to its amino acid sequence using a directed evolution approach. This allowed for the identification of amino acid residues that improved olivetolic acid production in a strain of recombinant S. cerevisiae. Beneficial mutations were then tested in conjunction to identify combinations of mutations that improve enzyme performance.
- Sequences are described herein that have multiple residues modified as compared to the wild type OAC sequence. Certain mutations produce over 2× more olivetolic acid than the wild type OAC when expressed in S. cerevisiae. Improvements to one or more enzyme properties as exhibited in the engineered OACs may include increases in enzyme activity, improved enzyme kinetics and turnover, higher tolerance to increased levels of substrate, and improved tolerance to increased product levels.
- The modifications of the amino acid residues, as compared to the wild type OAC sequence may be conservative modifications or non-conservative modifications. Insertions or deletions may be used to modify the residues, relative to wild type OAC. Note that in the OAC described herein, the protein may end at position 101 instead of 102, as in other reports of wild type OAC sequences. In embodiments described herein, the residues represented as X{#} may be modified, where {#} represents the sequence position in the amino acid position of the wild type OAC sequence referenced herein as (SEQ ID NO:91). Thus, SEQ ID NO:92 comprises the option of mutations at X{28}, X{31}, X{41}, X{43}, X{44}, X{68}, X{74}, X{84}, X{100}, and X{102}, as outlined below:
- SEQ ID NO:91 represents wild type OAC protein:
-
- MAVKHLIVLK FKDEITEAQK EEFFKTYVNL VNIIPAMKDV YWGKDVTQKN 50
- KEEGYTHIVE VTFESVETIQ
DYIIHPAHVG FGDVYRSFWE KLLIFDYTPR 100 - KG 102
- SEQ ID NO:92 represents the generalized variant OAC protein, wherein X
- represents candidate locations for mutated residues (where X represents any amino acid):
-
MAVKHLIVLK FKDEITEAQK EEFFKTYXNL XNIIPAMKDV XWXXDVTQKN 50 KEEGYTHIVE VTFESVEXIQ DYIXHPAHVG FGDXYRSFWE KLLIFDYTPX 100KX 102 - Materials and Methods:
- Genetic Manipulations:
- Vector VB40 was used to construct all expression plasmids encoding enzyme proteins disclosed herein, including OAC and variants.
- The OAC variants were constructed in a combinatorial library using mutations that were initially selected in a site-saturation mutagenesis library screen. Plasmid VB40_OAC was used as the template in all library construction.
- Site-saturation mutagenesis was conducted at each amino acid position by a PCR reaction using a forward degenerate NNK primer and a ‘back-to-back’ reverse non-mutagenic primer (
FIG. 3 ). The PCR products were then processed through in vitro kinase-ligase-Dpnl reactions and transformed into Escherichia coli DH5alpha strain for amplification. -
FIG. 3 illustrates PCR primers used in site-saturation mutagenesis protocol. Right-facing arrows represents forward degenerate NNK primer, symbol * denotes the mutational position, and the left-facing arrows represent reverse primer designed ‘back-to-back’ in the opposite direction of the forward primer. - The combinatorial library was constructed by an in-house protocol. Selected mutations were combined through an overlap-extension PCR of a batch of mutagenic oligonucleotides that were generated using targeted mutagenic primers. (
FIG. 4 ). Double-stranded DNA of the assembled combinatorial mutant variants were cloned into a vector with complementary overlapping sequences, which resulted in a pool of OAC combinatorial variants.FIG. 4 shows an overlap-extension assembly of mutagenic oligonucleotides for combinatorial library construction. The symbol x represents a point mutation. - The plasmids encoding OAC and variant proteins as disclosed herein were transformed and expressed in Saccharomyces cerevisiae, with the host strain HB1416. All DNA was transformed into background strains using the Gietz et al. transformation protocol (Gietz 2014).
- Strain Growth and Media:
- Strains were grown in yeast synthetic complete media with a composition of 1.7 g/L YNB without ammonium sulfate, 1.92 g/L URA dropout amino acid supplement, 1.5 g/L magnesium L-glutamate, with 2% w/v galactose, 2% w/v raffinose, 200 μg/l geneticin, and 200 μg/L ampicillin (Sigma-Aldrich Canada). The culture was incubated at 30° C. for four days (96 hours). Strain HB1891 and HB1892 were respectively used as wild type control and negative control in all of the screenings.
- Variant Screening Conditions:
- Each variant was tested in three replicates and each replicate was clonally derived from single colonies. All strains were grown in 500 μl of media for 96 hours in 96-well deepwell plates. The 96-well deepwell plates were incubated at 30° C. and shaken at 950 rpm for 96 hrs.
- Metabolite extraction was performed by adding 30 μl of culture to 270 μl of 56% acetonitrile in a new 96-well microtiter plate. The solutions were mixed thoroughly, then centrifuged at 3750 rpm for 10 mins. 200 μl of the soluble layer was removed and stored in a 96-well v-bottom microtiter plate. Samples were stored at −20° C. until analysis.
- Quantification Protocol:
- The quantification of olivetolic acid was performed using HPLC-MS/MS on a Waters Acquity UPLC-TQD MS. The chromatography and MS conditions are described below.
- HPLC Conditions
- Column: ACQUITY HSS C18 UPLC 50×1 mm, 1.8 μm particle size (PN:186003529); Column temperature: 45° C.; Flow rate: 0.350 mL/min; Eluent A: Water+Formic Acid; Eluent B: Acetonitrile+0.1% Formic Acid; Gradient is shown in Table 1.
-
TABLE 1 Gradient Time (min) % B 0 20 0.60 98 1.10 98 1.11 20 1.60 20 - ESI-MS Conditions
- The following conditions were utilized: Capillary: 2.90 (kV); Source temperature: 150° C.; Desolvation gas temperature: 250° C.; Desolvation gas flow (nitrogen): 500 L/hour; Cone gas flow (nitrogen): 1 L/hour; Detection parameters are shown in Table 2.
-
TABLE 2 Detection Parameters OVLa OVL CBGa THCa Retention time 0.70 0.72 0.98 1.12 (min) Parent (m/z) 223.0 181.1 359.2 357.2 Daughter (m/z) 179.0 71.0 341.2 313.2 Mode ES−, MRM ES+, MRM ES−, MRM ES−, MRM Cone (V) 35 20 40 45 Collision (V) 20 12 25 30 - Strains used are described in Table 3.
-
TABLE 3 Strains Used Strain # Background Plasmids Genotype Notes HB1416 -URA, -LEU None Saccharomyces cerevisiae Parent strain for CEN.PK2; ΔLEU2; ΔURA3; Erg2 olivetolic acid 0K197E::KanMx; ALD6; ASC1L6 production 41P; NPGA; MAF1; PGK1p:Acc1; screen tHMGR1; IDI; DiPKS_G1516R X 5; ACC1_S659A_S1157A; UB14 p:ERG20; PT254-R2S; Ost1- pro-alpha-f(I)-OXC53 HB1891 -URA, -LEU PLAS-417 Saccharomyces cerevisiae Expresses CEN.PK2; ΔLEU2; ΔURA3; Erg2 OAC; positive 0K197E::KanMx; ALD6; ASC1L6 control for 41P; NPGA; MAF1; PGK1p:Acc1; olivetolic acid tHMGR1; IDI; DiPKS_G1516R X production 5; ACC1_S659A_S1157A; UB14 screen p:ERG20; PT254-R2S; Ost1- pro-alpha-f(I)-OXC53 HB1892 -URA, -LEU PLAS-416 Saccharomyces cerevisiae Expresses non- CEN.PK2; ΔLEU2; ΔURA3; Erg2 catalytic 0K197E::KanMx; ALD6; ASC1L6 mScarlet; 41P; NPGA; MAF1; PGK1p:Acc1; negative control tHMGR1; IDI; DiPKS_G1516R X for olivetolic 5; ACC1_S659A_S1157A; UB14 acid production p:ERG20; PT254-R2S; Ost1- screen pro-alpha-f(I)-OXC53 PLT1577- -URA, -LEU PLAS-527 Saccharomyces cerevisiae Expresses D10 CEN.PK2; ΔLEU2; ΔURA3; Erg2 mutant OAC 0K197E::KanMx; ALD6; ASC1L6 41P; NPGA; MAF1; PGK1p:Acc1; tHMGR1; IDI; DiPKS_G1516R X 5; ACC1_S659A_S1157A; UB14 p:ERG20; PT254-R2S; Ost1- pro-alpha-f(I)-OXC53 PLT1575- -URA, -LEU PLAS-528 Saccharomyces cerevisiae Expresses D12 CEN.PK2; ΔLEU2; ΔURA3; Erg2 mutant OAC 0K197E::KanMx; ALD6; ASC1L6 41P; NPGA; MAF1; PGK1p:Acc1; tHMGR1; IDI; DiPKS_G1516R X 5; ACC1_S659A_S1157A; UB14 p:ERG20; PT254-R2S; Ost1- pro-alpha-f(I)-OXC53 PLT1576- -URA, -LEU PLAS-529 Saccharomyces cerevisiae Expresses B9 CEN.PK2; ΔLEU2; ΔURA3; Erg2 mutant OAC 0K197E::KanMx; ALD6; ASC1L6 41P; NPGA; MAF1; PGK1p:Acc1; tHMGR1; IDI; DiPKS_G1516R X 5; ACC1_S659A_S1157A; UB14 p:ERG20; PT254-R2S; Ost1- pro-alpha-f(I)-OXC53 PLT1575- -URA, -LEU PLAS-530 Saccharomyces cerevisiae Expresses C12 CEN.PK2; ΔLEU2; ΔURA3; Erg2 mutant OAC 0K197E::KanMx; ALD6; ASC1L6 41P; NPGA; MAF1; PGK1p:Acc1; tHMGR1; IDI; DiPKS_G1516R X 5; ACC1_S659A_S1157A; UB14 p:ERG20; PT254-R2S; Ost1- pro-alpha-f(I)-OXC53 PLT1575- -URA, -LEU PLAS-531 Saccharomyces cerevisiae Expresses D2 CEN.PK2; ΔLEU2; ΔURA3; Erg2 mutant OAC 0K197E::KanMx; ALD6; ASC1L6 41P; NPGA; MAF1; PGK1p:Acc1; tHMGR1; IDI; DiPKS_G1516R X 5; ACC1_S659A_S1157A; UB14 p:ERG20; PT254-R2S; Ost1- pro-alpha-f(I)-OXC53 PLT1572- -URA, -LEU PLAS-532 Saccharomyces cerevisiae Expresses B10 CEN.PK2; ΔLEU2; ΔURA3; Erg2 mutant OAC 0K197E::KanMx; ALD6; ASC1L6 41P; NPGA; MAF1; PGK1p:Acc1; tHMGR1; IDI; DiPKS_G1516R X 5; ACC1_S659A_S1157A; UB14 p:ERG20; PT254-R2S; Ost1- pro-alpha-f(I)-OXC53 PLT1576- -URA, -LEU PLAS-533 Saccharomyces cerevisiae Expresses A9 CEN.PK2; ΔLEU2; ΔURA3; Erg2 mutant OAC 0K197E::KanMx; ALD6; ASC1L6 41P; NPGA; MAF1; PGK1p:Acc1; tHMGR1; IDI; DiPKS_G1516R X 5; ACC1_S659A_S1157A; UB14 p:ERG20; PT254-R2S; Ost1- pro-alpha-f(I)-OXC53 PLT1577- -URA, -LEU PLAS-534 Saccharomyces cerevisiae Expresses D1 CEN.PK2; ΔLEU2; ΔURA3; Erg2 mutant OAC 0K197E::KanMx; ALD6; ASC1L6 41P; NPGA; MAF1; PGK1p:Acc1; tHMGR1; IDI; DiPKS_G1516R X 5; ACC1_S659A_S1157A; UB14 p:ERG20; PT254-R2S; Ost1- pro-alpha-f(I)-OXC53 PLT1577- -URA, -LEU PLAS-535 Saccharomyces cerevisiae Expresses B7 CEN.PK2; ΔLEU2; ΔURA3; Erg2 mutant OAC 0K197E::KanMx; ALD6; ASC1L6 41P; NPGA; MAF1; PGK1p:Acc1; tHMGR1; IDI; DiPKS_G1516R X 5; ACC1_S659A_S1157A; UB14 p:ERG20; PT254-R2S; Ost1- pro-alpha-f(I)-OXC53 PLT1575- -URA, -LEU PLAS-536 Saccharomyces cerevisiae Expresses F8 CEN.PK2; ΔLEU2; ΔURA3; Erg2 mutant OAC 0K197E::KanMx; ALD6; ASC1L6 41P; NPGA; MAF1; PGK1p:Acc1; tHMGR1; IDI; DiPKS_G1516R X 5; ACC1_S659A_S1157A; UB14 p:ERG20; PT254-R2S; Ost1- pro-alpha-f(I)-OXC53 PLT1574- -URA, -LEU PLAS-537 Saccharomyces cerevisiae Expresses H11 CEN.PK2; ΔLEU2; ΔURA3; Erg2 mutant OAC 0K197E::KanMx; ALD6; ASC1L6 41P; NPGA; MAF1; PGK1p:Acc1; tHMGR1; IDI; DiPKS_G1516R X 5; ACC1_S659A_S1157A; UB14 p:ERG20; PT254-R2S; Ost1- pro-alpha-f(I)-OXC53 PLT1575- -URA, -LEU PLAS-538 Saccharomyces cerevisiae Expresses E1 CEN.PK2; ΔLEU2; ΔURA3; Erg2 mutant OAC 0K197E::KanMx; ALD6; ASC1L6 41P; NPGA; MAF1; PGK1p:Acc1; tHMGR1; IDI; DiPKS_G1516R X 5; ACC1_S659A_S1157A; UB14 p:ERG20; PT254-R2S; Ost1- pro-alpha-f(I)-OXC53 PLT1575- -URA, -LEU PLAS-539 Saccharomyces cerevisiae Expresses G1 CEN.PK2; ΔLEU2; ΔURA3; Erg2 mutant OAC 0K197E::KanMx; ALD6; ASC1L6 41P; NPGA; MAF1; PGK1p:Acc1; tHMGR1; IDI; DiPKS_G1516R X 5; ACC1_S659A_S1157A; UB14 p:ERG20; PT254-R2S; Ost1- pro-alpha-f(I)-OXC53 PLT1576- -URA, -LEU PLAS-540 Saccharomyces cerevisiae Expresses A2 CEN.PK2; ΔLEU2; ΔURA3; Erg2 mutant OAC 0K197E::KanMx; ALD6; ASC1L6 41P; NPGA; MAF1; PGK1p:Acc1; tHMGR1; IDI; DiPKS_G1516R X 5; ACC1_S659A_S1157A; UB14 p:ERG20; PT254-R2S; Ost1- pro-alpha-f(I)-OXC53 PLT1576- -URA, -LEU PLAS-541 Saccharomyces cerevisiae Expresses B10 CEN.PK2; ΔLEU2; ΔURA3; Erg2 mutant OAC 0K197E::KanMx; ALD6; ASC1L6 41P; NPGA; MAF1; PGK1p:Acc1; tHMGR1; IDI; DiPKS_G1516R X 5; ACC1_S659A_S1157A; UB14 p:ERG20; PT254-R2S; Ost1- pro-alpha-f(I)-OXC53 PLT1573- -URA, -LEU PLAS-542 Saccharomyces cerevisiae Expresses D7 CEN.PK2; ΔLEU2; ΔURA3; Erg2 mutant OAC 0K197E::KanMx; ALD6; ASC1L6 41P; NPGA; MAF1; PGK1p:Acc1; tHMGR1; IDI; DiPKS_G1516R X 5; ACC1_S659A_S1157A; UB14 p:ERG20; PT254-R2S; Ost1- pro-alpha-f(I)-OXC53 PLT1575- -URA, -LEU PLAS-543 Saccharomyces cerevisiae Expresses H10 CEN.PK2; ΔLEU2; ΔURA3; Erg2 mutant OAC 0K197E::KanMx; ALD6; ASC1L6 41P; NPGA; MAF1; PGK1p:Acc1; tHMGR1; IDI; DiPKS_G1516R X 5; ACC1_S659A_S1157A; UB14 p:ERG20; PT254-R2S; Ost1- pro-alpha-f(I)-OXC53 PLT1574- -URA, -LEU PLAS-544 Saccharomyces cerevisiae Expresses F5 CEN.PK2; ΔLEU2; ΔURA3; Erg2 mutant OAC 0K197E::KanMx; ALD6; ASC1L6 41P; NPGA; MAF1; PGK1p:Acc1; tHMGR1; IDI; DiPKS_G1516R X 5; ACC1_S659A_S1157A; UB14 p:ERG20; PT254-R2S; Ost1- pro-alpha-f(I)-OXC53 PLT1573- -URA, -LEU PLAS-545 Saccharomyces cerevisiae Expresses D8 CEN.PK2; ΔLEU2; ΔURA3; Erg2 mutant OAC 0K197E::KanMx; ALD6; ASC1L6 41P; NPGA; MAF1; PGK1p:Acc1; tHMGR1; IDI; DiPKS_G1516R X 5; ACC1_S659A_S1157A; UB14 p:ERG20; PT254-R2S; Ost1- pro-alpha-f(I)-OXC53 PLT1574- -URA, -LEU PLAS-546 Saccharomyces cerevisiae Expresses A11 CEN.PK2; ΔLEU2; ΔURA3; Erg2 mutant OAC 0K197E::KanMx; ALD6; ASC1L6 41P; NPGA; MAF1; PGK1p:Acc1; tHMGR1; IDI; DiPKS_G1516R X 5; ACC1_S659A_S1157A; UB14 p:ERG20; PT254-R2S; Ost1- pro-alpha-f(I)-OXC53 PLT1574- -URA, -LEU PLAS-547 Saccharomyces cerevisiae Expresses B9 CEN.PK2; ΔLEU2; ΔURA3; Erg2 mutant OAC 0K197E::KanMx; ALD6; ASC1L6 41P; NPGA; MAF1; PGK1p:Acc1; tHMGR1; IDI; DiPKS_G1516R X 5; ACC1_S659A_S1157A; UB14 p:ERG20; PT254-R2S; Ost1- pro-alpha-f(I)-OXC53 PLT1576- -URA, -LEU PLAS-548 Saccharomyces cerevisiae Expresses B5 CEN.PK2; ΔLEU2; ΔURA3; Erg2 mutant OAC 0K197E::KanMx; ALD6; ASC1L6 41P; NPGA; MAF1; PGK1p:Acc1; tHMGR1; IDI; DiPKS_G1516R X 5; ACC1_S659A_S1157A; UB14 p:ERG20; PT254-R2S; Ost1- pro-alpha-f(I)-OXC53 PLT1573- -URA, -LEU PLAS-549 Saccharomyces cerevisiae Expresses F2 CEN.PK2; ΔLEU2; ΔURA3; Erg2 mutant OAC 0K197E::KanMx; ALD6; ASC1L6 41P; NPGA; MAF1; PGK1p:Acc1; tHMGR1; IDI; DiPKS_G1516R X 5; ACC1_S659A_S1157A; UB14 p:ERG20; PT254-R2S; Ost1- pro-alpha-f(I)-OXC53 PLT1572- -URA, -LEU PLAS-550 Saccharomyces cerevisiae Expresses H10 CEN.PK2; ΔLEU2; ΔURA3; Erg2 mutant OAC 0K197E::KanMx; ALD6; ASC1L6 41P; NPGA; MAF1; PGK1p:Acc1; tHMGR1; IDI; DiPKS_G1516R X 5; ACC1_S659A_S1157A; UB14 p:ERG20; PT254-R2S; Ost1- pro-alpha-f(I)-OXC53 PLT1573- -URA, -LEU PLAS-551 Saccharomyces cerevisiae Expresses E12 CEN.PK2; ΔLEU2; ΔURA3; Erg2 mutant OAC 0K197E::KanMx; ALD6; ASC1L6 41P; NPGA; MAF1; PGK1p:Acc1; tHMGR1; IDI; DiPKS_G1516R X 5; ACC1_S659A_S1157A; UB14 p:ERG20; PT254-R2S; Ost1- pro-alpha-f(I)-OXC53 PLT1573- -URA, -LEU PLAS-552 Saccharomyces cerevisiae Expresses D12 CEN.PK2; ΔLEU2; ΔURA3; Erg2 mutant OAC 0K197E::KanMx; ALD6; ASC1L6 41P; NPGA; MAF1; PGK1p:Acc1; HMGR1; IDI; DiPKS_G1516R X 5; ACC1_S659A_S1157A; UB14 p:ERG20; PT254-R2S; Ost1- pro-alpha-f(I)-OXC53 PLT1573- -URA, -LEU PLAS-553 Saccharomyces cerevisiae Expresses B8 CEN.PK2; ΔLEU2; ΔURA3; Erg2 mutant OAC 0K197E::KanMx; ALD6; ASC1L6 41P; NPGA; MAF1; PGK1p:Acc1; tHMGR1; IDI; DiPKS_G1516R X 5; ACC1_S659A_S1157A; UB14 p:ERG20; PT254-R2S; Ost1- pro-alpha-f(I)-OXC53 PLT1573- -URA, -LEU PLAS-554 Saccharomyces cerevisiae Expresses A2 CEN.PK2; ΔLEU2; ΔURA3; Erg2 mutant OAC 0K197E::KanMx; ALD6; ASC1L6 41P; NPGA; MAF1; PGK1p:Acc1; tHMGR1; IDI; DiPKS_G1516R X 5; ACC1_S659A_S1157A; UB14 p:ERG20; PT254-R2S; Ost1- pro-alpha-f(I)-OXC53 PLT1576- -URA, -LEU PLAS-555 Saccharomyces cerevisiae Expresses B4 CEN.PK2; ΔLEU2; ΔURA3; Erg2 mutant OAC 0K197E::KanMx; ALD6; ASC1L6 41P; NPGA; MAF1; PGK1p:Acc1; tHMGR1; IDI; DiPKS_G1516R X 5; ACC1_S659A_S1157A; UB14 p:ERG20; PT254-R2S; Ost1- pro-alpha-f(I)-OXC53 PLT1573- -URA, -LEU PLAS-556 Saccharomyces cerevisiae Expresses G12 CEN.PK2; ΔLEU2; ΔURA3; Erg2 mutant OAC 0K197E::KanMx; ALD6; ASC1L6 41P; NPGA; MAF1; PGK1p:Acc1; tHMGR1; IDI; DiPKS_G1516R X 5; ACC1_S659A_S1157A; UB14 p:ERG20; PT254-R2S; Ost1- pro-alpha-f(I)-OXC53 PLT1576- -URA, -LEU PLAS-557 Saccharomyces cerevisiae Expresses G12 CEN.PK2; ΔLEU2; ΔURA3; Erg2 mutant OAC 0K197E::KanMx; ALD6; ASC1L6 41P; NPGA; MAF1; PGK1p:Acc1; tHMGR1; IDI; DiPKS_G1516R X 5; ACC1_S659A_S1157A; UB14 p:ERG20; PT254-R2S; Ost1- pro-alpha-f(I)-OXC53 PLT1575- -URA, -LEU PLAS-558 Saccharomyces cerevisiae Expresses H5 CEN.PK2; ΔLEU2; ΔURA3; Erg2 mutant OAC 0K197E::KanMx; ALD6; ASC1L6 41P; NPGA; MAF1; PGK1p:Acc1; tHMGR1; IDI; DiPKS_G1516R X 5; ACC1_S659A_S1157A; UB14 p:ERG20; PT254-R2S; Ost1- pro-alpha-f(I)-OXC53 PLT1576- -URA, -LEU PLAS-559 Saccharomyces cerevisiae Expresses A6 CEN.PK2; ΔLEU2; ΔURA3; Erg2 mutant OAC 0K197E::KanMx; ALD6; ASC1L6 41P; NPGA; MAF1; PGK1p:Acc1; tHMGR1; IDI; DiPKS_G1516R X 5; ACC1_S659A_S1157A; UB14 p:ERG20; PT254-R2S; Ost1- pro-alpha-f(I)-OXC53 PLT1576- -URA, -LEU PLAS-560 Saccharomyces cerevisiae Expresses D1 CEN.PK2; ΔLEU2; ΔURA3; Erg2 mutant OAC 0K197E::KanMx; ALD6; ASC1L6 41P; NPGA; MAF1; PGK1p:Acc1; tHMGR1; IDI; DiPKS_G1516R X 5; ACC1_S659A_S1157A; UB14 p:ERG20; PT254-R2S; Ost1- pro-alpha-f(I)-OXC53 PLT1572- -URA, -LEU PLAS-561 Saccharomyces cerevisiae Expresses E8 CEN.PK2; ΔLEU2; ΔURA3; Erg2 mutant OAC 0K197E::KanMx; ALD6; ASC1L6 41P; NPGA; MAF1; PGK1p:Acc1; tHMGR1; IDI; DiPKS_G1516R X 5; ACC1_S659A_S1157A; UB14 p:ERG20; PT254-R2S; Ost1- pro-alpha-f(I)-OXC53 PLT1572- -URA, -LEU PLAS-562 Saccharomyces cerevisiae Expresses H9 CEN.PK2; ΔLEU2; ΔURA3; Erg2 mutant OAC 0K197E::KanMx; ALD6; ASC1L6 41P; NPGA; MAF1; PGK1p:Acc1; tHMGR1; IDI; DiPKS_G1516R X 5; ACC1_S659A_S1157A; UB14 p:ERG20; PT254-R2S; Ost1- pro-alpha-f(I)-OXC53 PLT1575- -URA, -LEU PLAS-563 Saccharomyces cerevisiae Expresses F9 CEN.PK2; ΔLEU2; ΔURA3; Erg2 mutant OAC 0K197E::KanMx; ALD6; ASC1L6 41P; NPGA; MAF1; PGK1p:Acc1; tHMGR1; IDI; DiPKS_G1516R X 5; ACC1_S659A_S1157A; UB14 p:ERG20; PT254-R2S; Ost1- pro-alpha-f(I)-OXC53 - The following plasmids were used, as described in Table 4.
-
TABLE 4 Plasmids Plasmid SEQ # Name ID NO. Description Selection 1 PLAS-416 001 Gal1p:mScarlet:Cyc1t Uracil 2 PLAS-417 002 Gal1p:OAC:Cyc1t Uracil 3 PLAS-527 003 Gal1p:OAC- Uracil V28A/Y41T/G43SILENT(=GGG)/ T68L/174E/V84R/R100M/G102R: Cyc1t 4 PLAS-528 004 Gal1p:OAC- Uracil Y41S/G43SILENT(=GGG)/K44V/ T68R/I74R/V84R:Cyc1t 5 PLAS-529 005 Gal1p:OAC- Uracil V28A/V31G/Y41S/G43SILENT (=GGG)/K44V/T68L/I74R/V84R/ R100E/G102R:Cyc1t 6 PLAS-530 006 Gal1p:OAC- Uracil V28A/Y41T/G43SILENT(=GGG)/ T68L/174G/V84R/R100E:Cyc1t 7 PLAS-531 007 Gal1p:OAC- Uracil V28A/Y41T/G43SILENT(=GGG)/ K44V/T68L/174D/V84R/R100M/ G102R:Cyc1t 8 PLAS-532 008 Gal1p:OAC- Uracil V28A/Y41T/G43SILENT(=GGG)/ T68L/174D/V84R/G102R:Cyc1t 9 PLAS-533 009 Gal1p:OAC- Uracil V28A/Y41T/G43SILENT(=GGG)/ K44V/T68L/I74R/V84R/R100E/ G102R:Cyc1t 10 PLAS-534 010 Gal1p:OAC- Uracil Y41T/G43SILENT(=GGG)/T68R/ I74R/V84R/R100M/G102STOP: Cyc1t 11 PLAS-535 011 Gal1p:OAC- Uracil V28A/Y41V/G43SILENT(=GGG)/ K44V/T68L/174G/V84R/R100E/ G102R:Cyc1t 12 PLAS-536 012 Gal1p:OAC- Uracil V28A/Y41T/G43SILENT(=GGG)/ K44V/T68R/174G/V84R/ G102STOP:Cyc1t 13 PLAS-537 013 Gal1p:OAC- Uracil V28A/Y41V/G43SILENT(=GGG)/ K44V/T68L/I74R/V84R/R100E/ G102STOP:Cyc1t 14 PLAS-538 014 Gal1p:OAC- Uracil V28A/V31G/Y41T/G43SILENT (=GGG)/K44V/T68R/174E/V84R/ R100E:Cyc1t 15 PLAS-539 015 Gal1p:OAC- Uracil V28A/Y41S/G43SILENT(=GGG)/ T68R/I74R/V84R/R100M/ G102STOP:Cyc1t 16 PLAS-540 016 Gal1p:OAC- Uracil Y41T/G43SILENT(=GGG)/K44V/ T68L/174G/V84R/G102R:Cyc1t 17 PLAS-541 017 Gal1p:OAC- Uracil V28A/Y41T/G43SILENT(=GGG)/ K44V/T68R/I74R/V84R/R100E/ G102R:Cyc1t 18 PLAS-542 018 Gal1p:OAC- Uracil V28A/V31G/Y41S/G43SILENT (=GGG)/K44V/T68R/I74R/V84R: Cyc1t 19 PLAS-543 019 Gal1p:OAC- Uracil V28A/G43SILENT(=GGG)/K44V/ I74D/V84R/R100E/G102R(=CGC): Cyc1t 20 PLAS-544 020 Gal1p:OAC- Uracil V28A/Y41V/G43SILENT(=GGG)/ K44V/T68L/174G/V84R/G102R: Cyc1t 21 PLAS-545 021 Gal1p:OAC- Uracil Y41T/G43SILENT(=GGG)/T68L/ I74G/V84R/R100M/G102R:Cyc1t 22 PLAS-546 022 Gal1p:OAC- Uracil V28A/V31G/Y41T/G43SILENT (=GGG)/K44V/T68R/I74R/V84R/ R100E/G102R:Cyc1t 23 PLAS-547 023 Gal1p:OAC- Uracil V31G/Y41S/G43SILENT(=GGG)/ K44V/T68R/V84R/R100E:Cyc1t 24 PLAS-548 024 Gal1p:OAC- Uracil V28A/Y41V/G43SILENT(=GGG)/ T68L/174D/R100E/G102STOP: Cyc1t 25 PLAS-549 025 Gal1p:OAC- Uracil V28A/Y41V/G43SILENT(=GGG)/ T68R/174G/V84R/R100M/G102R: Cyc1t 26 PLAS-550 026 Gal1p:OAC- Uracil V31G/G43SILENT(=GGG)/174G/ V84R/R100E:Cyc1t 27 PLAS-551 027 Gal1p:OAC- Uracil V28A/Y41S/G43SILENT(=GGG)/ 4K4V/T68R/I74R/V84R/R100M/ G102R:Cyc1t 28 PLAS-552 028 Gal1p:OAC- Uracil V28A/V31G/Y41V/G43SILENT (=GGG)/K44V/T68L/174G/V84R: Cyc1t 29 PLAS-553 029 Gal1p:OAC- Uracil V28A/Y41V/G43SILENT(=GGG)/ K44V/T68L/174G/V84R/R100M/ G102R:Cyc1t 30 PLAS-554 030 Gal1p:OAC- Uracil V28A/V31G/G43SILENT(=GGG)/ T68L/I74R/V84R/R100E/G102R: Cyc1t 31 PLAS-555 031 Gal1p:OAC- Uracil V31G/Y41V/G43SILENT(=GGG)/ K44V/T68L/I74R/V84R/R100E/ G102STOP:Cyc1t 32 PLAS-556 032 Gal1p:OAC- Uracil V31G/Y41T/G43SILENT(=GGG)/ K44V/T68R/174D/V84R/G102R: Cyc1t 33 PLAS-557 033 Gal1p:OAC- Uracil V31G/Y41T/G43SILENT(=GGG)/ K44V/T68R/174D/V84R/R100E/ G102R:Cyc1t 34 PLAS-558 034 Gal1p:OAC- Uracil V28A/Y41S/G43SILENT(=GGG)/ K44V/T68R/174G/V84R/R100M/ G102R:Cyc1t 35 PLAS-559 035 Gal1p:OAC- Uracil V28A/Y41V/G43SILENT(=GGG)/ K44V/I74R/R100E/G102STOP: Cyc1t 36 PLAS-560 036 Gal1p:OAC- Uracil V28A/V31G/Y41T/G43SILENT (=GGG)/K44V/174E/V84R/R100M/ G102R:Cyc1t 37 PLAS-561 037 Gal1p:OAC- Uracil V28A/G43SILENT(=GGG)/K44V/ T68R/174E/V84R/R100E/ G102STOP:Cyc1t 38 PLAS-562 038 Gal1p:OAC- Uracil V31G/Y41T/G43SILENT(=GGG)/ K44V/T68L/174D/V84R/G102R: Cyc1t 39 PLAS-563 039 Gal1p:OAC- Uracil Y41V/G43SILENT(=GGG)/K44V/ T68L/I74R/V84R/G102R:Cyc1t - The following sequences are described herein (Table 5)
-
TABLE 5 Sequences SEQ Length Position of ID Plasmid DNA/ of se- coding NO: Name Description Protein quence sequence 001 PLAS- Gal1p:mScarlet: DNA 6114 2649 to 3347 416 Cyc1t 002 PLAS- Gal1p:OAC:Cyc1t DNA 5724 2649 to 2957 417 003 PLAS- Gal1p:OAC- DNA 5724 2648 to 2957 527 V28A/Y41T/ G43SILENT(=GGG)/ T68L/I74E/V84R/ R100M/G102R: Cyc1t 004 PLAS- Gal1p:OAC- DNA 5724 2648 to 2957 528 Y41S/G43SILENT (=GGG)/K44V/ T68R/I74R/V84R: Cyc1t 005 PLAS- Gal1p:OAC- DNA 5724 2648 to 2957 529 V28A/V31G/ Y41S/G43SILENT (=GGG)/K44V/ T68L/I74R/V84R/ R100E/G102R:Cyc1t 006 PLAS- Gal1p:OAC- DNA 5724 2648 to 2957 530 V28A/Y41T/ G43SILENT(=GGG)/ T68L/I74G/V84R/ R100E:Cyc1t 007 PLAS- Gal1p:OAC- DNA 5724 2648 to 2957 531 V28A/Y41T/ G43SILENT(=GGG)/ K44V/T68L/I74D/ V84R/R100M/ G102R:Cyc1t 008 PLAS- Gal1p:OAC- DNA 5724 2648 to 2957 532 V28A/Y41T/ G43SILENT(=GGG)/ T68L/I74D/V84R/ G102R:Cyc1t 009 PLAS- Gal1p:OAC- DNA 5724 2648 to 2957 533 V28A/Y41T/ G43SILENT(=GGG)/ K44V/T68L/I74R/ V84R/R100E/ G102R:Cyc1t 010 PLAS- Gal1p:OAC- DNA 5724 2648 to 2954 534 Y41T/G43SILENT (=GGG)/T68R/ I74R/V84R/R100M/ G102STOP: Cyc1t 011 PLAS- Gal1p:OAC- DNA 5724 2648 to 2957 535 V28A/Y41V/ G43SILENT(=GGG)/ K44V/T68L/I74G/ V84R/R100E/ G102R:Cyc1t 012 PLAS- Gal1p:OAC- DNA 5724 2648 to 2954 536 V28A/Y41T/ G43SILENT(=GGG)/ K44V/T68R/I74G/ V84R/G102STOP: Cyc1t 013 PLAS- Gal1p:OAC- DNA 5724 2648 to 2954 537 V28A/Y41V/ G43SILENT(=GGG)/ K44V/T68L/I74R/ V84R/R100E/ G102STOP:Cyc1t 014 PLAS- Gal1p:OAC- DNA 5724 2648 to 2957 538 V28A/V31G/ Y41T/G43SILENT (=GGG)/K44V/ T68R/I74E/V84R/ R100E:Cyc1t 015 PLAS- Gal1p:OAC- DNA 5724 2648 to 2954 539 V28A/Y41S/ G43SILENT(=GGG)/ T68R/I74R/V84R/ R100M/G102STOP: Cyc1t 016 PLAS- Gal1p:OAC- DNA 5724 2648 to 2957 540 Y41T/G43SILENT (=GGG)/K44V/ T68L/I74G/V84R/ G102R:Cyc1t 017 PLAS- Gal1p:OAC- DNA 5724 2648 to 2957 541 V28A/Y41T/ G43SILENT(=GGG)/ K44V/T68R/I74R/ V84R/R100E/ G102R:Cyc1t 018 PLAS- Gal1p:OAC- DNA 5724 2648 to 2957 542 V28A/V31G/Y41S/ G43SILENT (=GGG)/K44V/ T68R/I74R/V84R: Cyc1t 019 PLAS- Gal1p:OAC- DNA 5724 2648 to 2957 543 V28A/G43SILENT (=GGG)/K44V/ I74D/V84R/R100E/ G102R(=CGC): Cyc1t 020 PLAS- Gal1p:OAC- DNA 5724 2648 to 2957 544 V28A/Y41V/ G43SILENT(=GGG)/ K44V/T68L/I74G/ V84R/G102R: Cyc1t 021 PLAS- Gal1p:OAC- DNA 5724 2648 to 2957 545 Y41T/G43SILENT (=GGG)/T68L/ I74G/V84R/R100M/ G102R:Cyc1t 022 PLAS- Gal1p:OAC- DNA 5724 2648 to 2957 546 V28A/V31G/ Y41T/G43SILENT (=GGG)/K44V/ T68R/I74R/V84R/ R100E/G102R: Cyc1t 023 PLAS- Gal1p:OAC- DNA 5724 2648 to 2957 547 V31G/Y41S/ G43SILENT(=GGG)/ K44V/T68R/ V84R/R100E:Cyc1t 024 PLAS- Gal1p:OAC- DNA 5724 2648 to 2954 548 V28A/Y41V/ G43SILENT(=GGG)/ T68L/I74D/ R100E/G102STOP: Cyc1t 025 PLAS- Gal1p:OAC- DNA 5724 2648 to 2957 549 V28A/Y41V/ G43SILENT(=GGG)/ T68R/I74G/V84R/ R100M/G102R: Cyc1t 026 PLAS- Gal1p:OAC- DNA 5724 2648 to 2957 550 V31G/G43SILENT (=GGG)/I74G/ V84R/R100E: Cyc1t 027 PLAS- Gal1p:OAC- DNA 5724 2648 to 2957 551 V28A/Y41S/ G43SILENT(=GGG)/ K44V/T68R/I74R/ V84R/R100M/ G102R:Cyc1t 028 PLAS- Gal1p:OAC- DNA 5724 2648 to 2957 552 V28A/V31G/ Y41V/G43SILENT (=GGG)/K44V/ T68L/I74G/V84R: Cyc1t 029 PLAS- Gal1p:OAC- DNA 5724 2648 to 2957 553 V28A/Y41V/ G43SILENT(=GGG)/ K44V/T68L/I74G/ V84R/R100M/ G102R:Cyc1t 030 PLAS- Gal1p:OAC- DNA 5724 2648 to 2957 554 V28A/V31G/ G43SILENT(=GGG)/ T68L/I74R/V84R/ R100E/G102R: Cyc1t 031 PLAS- Gal1p:OAC- DNA 5724 2648 to 2954 555 V31G/Y41V/ G43SILENT(=GGG)/ K44V/T68L/I74R/ V84R/R100E/ G102STOP:Cyc1t 032 PLAS- Gal1p:OAC- DNA 5724 2648 to 2957 556 V31G/Y41T/ G43SILENT(=GGG)/ K44V/T68R/I74D/ V84R/G102R: Cyc1t 033 PLAS- Gal1p:OAC- DNA 5724 2648 to 2957 557 V31G/Y41T/ G43SILENT(=GGG)/ K44V/T68R/I74D/ V84R/R100E/ G102R:Cyc1t 034 PLAS- Gal1p:OAC- DNA 5724 2648 to 2957 558 V28A/Y41S/ G43SILENT(=GGG)/ K44V/T68R/I74G/ V84R/R100M/ G102R:Cyc1t 035 PLAS- Gal1p:OAC- DNA 5724 2648 to 2954 559 V28A/Y41V/ G43SILENT(=GGG)/ K44V/I74R/R100E/ G102STOP: Cyc1t 036 PLAS- Gal1p:OAC- DNA 5724 2648 to 2957 560 V28A/V31G/Y41T/ G43SILENT (=GGG)/K44V/I74E/ V84R/R100M/ G102R:Cyc1t 037 PLAS- Gal1p:OAC- DNA 5724 2648 to 2954 561 V28A/G43SILENT (=GGG)/K44V/ T68R/I74E/V84R/ R100E/G102STOP: Cyc1t 038 PLAS- Gal1p:OAC- DNA 5724 2648 to 2957 562 V31G/Y41T/ G43SILENT(=GGG)/ K44V/T68L/I74D/ V84R/G102R: Cyc1t 039 PLAS- Gal1p:OAC- DNA 5724 2648 to 2957 563 Y41V/G43SILENT (=GGG)/K44V/ T68L/I74R/V84R/ G102R:Cyc1t 40 PLAS- Gal1p:OAC- Protein 102 All 527 V28A/Y41T/ G43SILENT(=GGG)/ T68L/I74E/V84R/ R100M/G102R: Cyc1t 41 PLAS- Gal1p:OAC- Protein 102 All 528 Y41S/G43SILENT (=GGG)/K44V/ T68R/I74R/V84R: Cyc1t 42 PLAS- Gal1p:OAC- Protein 102 All 529 V28A/V31G/ Y41S/G43SILENT(= GGG)/K44V/T68L/ I74R/V84R/R100E/ G102R:Cyc1t 43 PLAS- Gal1p:OAC- Protein 102 All 530 V28A/Y41T/ G43SILENT(=GGG)/ T68L/I74G/V84R/ R100E:Cyc1t 44 PLAS- Gal1p:OAC- Protein 102 All 531 V28A/Y41T/ G43SILENT(=GGG)/ K44V/T68L/I74D/ V84R/R100M/ G102R:Cyc1t 45 PLAS- Gal1p:OAC- Protein 102 All 532 V28A/Y41T/ G43SILENT(=GGG)/ T68L/I74D/V84R/ G102R:Cyc1t 46 PLAS- Gal1p:OAC- Protein 102 All 533 V28A/Y41T/ G43SILENT(=GGG)/ K44V/T68L/I74R/ V84R/R100E/ G102R:Cyc1t 47 PLAS- Gal1p:OAC- Protein 101 All 534 Y41T/G43SILENT (=GGG)/T68R/ I74R/V84R/ R100M/G102STOP: Cyc1t 48 PLAS- Gal1p:OAC- Protein 102 All 535 V28A/Y41V/ G43SILENT(=GGG)/ K44V/T68L/I74G/ V84R/R100E/ G102R:Cyc1t 49 PLAS- Gal1p:OAC- Protein 101 All 536 V28A/Y41T/ G43SILENT(=GGG)/ K44V/T68R/I74G/ V84R/G102STOP: Cyc1t 50 PLAS- Gal1p:OAC- Protein 101 All 537 V28A/Y41T/ G43SILENT(=GGG)/ K44V/T68R/I74G/ V84R/G102STOP: Cyc1t 51 PLAS- Gal1p:OAC- Protein 102 All 538 V28A/V31G/ Y41T/G43SILENT (=GGG)/K44V/ T68R/I74E/V84R/ R100E:Cyc1t 52 PLAS- Gal1p:OAC- Protein 101 All 539 V28A/Y41S/ G43SILENT(=GGG)/ T68R/I74R/V84R/ R100M/G102STOP: Cyc1t 53 PLAS- Gal1p:OAC- Protein 102 All 540 Y41T/G43SILENT (=GGG)/K44V/ T68L/I74G/V84R/ G102R:Cyc1t 54 PLAS- Gal1p:OAC- Protein 102 All 541 V28A/Y41T/ G43SILENT(=GGG)/ K44V/T68R/I74R/ V84R/R100E/ G102R:Cyc1t 55 PLAS- Gal1p:OAC- Protein 102 All 542 V28A/V31G/ Y41S/G43SILENT (=GGG)/K44V/ T68R/I74R/V84R: Cyc1t 56 PLAS- Gal1p:OAC- Protein 102 All 543 V28A/G43SILENT (=GGG)/K44V/ I74D/V84R/R100E/ G102R(=CGC): Cyc1t 57 PLAS- Gal1p:OAC- Protein 102 All 544 V28A/Y41V/ G43SILENT(=GGG)/ K44V/T68L/I74G/ V84R/G102R: Cyc1t 58 PLAS- Gal1p:OAC- Protein 102 All 545 Y41T/G43SILENT (=GGG)/T68L/ I74G/V84R/R100M/ G102R:Cyc1t 59 PLAS- Gal1p:OAC- Protein 102 All 546 V28A/V31G/ Y41T/G43SILENT (=GGG)/K44V/ T68R/I74R/V84R/ R100E/G102R: Cyc1t 60 PLAS- Gal1p:OAC- Protein 102 All 547 V31G/Y41S/ G43SILENT(=GGG)/ K44V/T68R/ V84R/R100E:Cyc1t 61 PLAS- Gal1p:OAC- Protein 101 All 548 V28A/Y41V/ G43SILENT(=GGG)/ T68L/I74D/ R100E/G102STOP: Cyc1t 62 PLAS- Gal1p:OAC- Protein 102 All 549 V28A/Y41V/ G43SILENT(=GGG)/ T68R/I74G/V84R/ R100M/G102R: Cyc1t 63 PLAS- Gal1p:OAC- Protein 102 All 550 V31G/G43SILENT (=GGG)/I74G/ V84R/R100E: Cyc1t 64 PLAS- Gal1p:OAC- Protein 102 All 551 V28A/Y41S/ G43SILENT(=GGG)/ K44V/T68R/I74R/ V84R/R100M/ G102R:Cyc1t 65 PLAS- Gal1p:OAC- Protein 102 All 552 V28A/V31G/ Y41V/G43SILENT (=GGG)/K44V/ T68L/I74G/V84R: Cyc1t 66 PLAS- Gal1p:OAC- Protein 102 All 553 V28A/Y41V/ G43SILENT(=GGG)/ K44V/T68L/I74G/ V84R/R100M/ G102R:Cyc1t 67 PLAS- Gal1p:OAC- Protein 102 All 554 V28A/V31G/ G43SILENT(=GGG)/ T68L/I74R/V84R/ R100E/G102R: Cyc1t 68 PLAS- Gal1p:OAC- Protein 101 All 555 V31G/Y41V/ G43SILENT(=GGG)/ K44V/T68L/I74R/ V84R/R100E/ G102STOP:Cyc1t 69 PLAS- Gal1p:OAC- Protein 102 All 556 V31G/Y41T/ G43SILENT(=GGG)/ K44V/T68R/I74D/ V84R/G102R: Cyc1t 70 PLAS- Gal1p:OAC- Protein 102 All 557 V31G/Y41T/ G43SILENT(=GGG)/ K44V/T68R/I74D/ V84R/R100E/G102R: Cyc1t 71 PLAS- Gal1p:OAC- Protein 102 All 558 V28A/Y41S/ G43SILENT(=GGG)/ K44V/T68R/I74G/ V84R/R100M/G102R: Cyc1t 72 PLAS- Gal1p:OAC- Protein 101 All 559 V28A/Y41V/ G43SILENT(=GGG)/ K44V/I74R/R100E/ G102STOP: Cyc1t 73 PLAS- Gal1p:OAC- Protein 102 All 560 V28A/V31G/Y41T/ G43SILENT (=GGG)/K44V/I74E/ V84R/R100M/ G102R:Cyc1t 74 PLAS- Gal1p:OAC- Protein 101 All 561 V28A/G43SILENT (=GGG)/K44V/ T68R/I74E/V84R/ R100E/G102STOP: Cyc1t 75 PLAS- Gal1p:OAC- Protein 102 All 562 V31G/Y41T/ G43SILENT(=GGG)/ K44V/T68L/I74D/ V84R/G102R: Cyc1t 76 PLAS- Gal1p:OAC- Protein 102 All 563 Y41V/G43SILENT (=GGG)/K44V/ T68L/I74R/V84R/ G102R:Cyc1t 77 NpgA DNA 3564 1170-2201 78 DiPKS-1 DNA 11114 849-10292 79 DiPKS-2 DNA 10890 717-10160 80 DiPKS-3 DNA 11300 795-10238 81 DiPKS-4 DNA 11140 794-10237 82 DiPKS-5 DNA 11637 1172-10615 83 PDH DNA 7114 Ald6: 1444- 2949 ACS: 3888- 5843 84 Maf1 DNA 3256 936-2123 85 Erg20K197E DNA 4254 2842-3900 (4538) 86 Erg1p:UB14- DNA 3503 1364-2701 Erg20:deg 87 tHMGr-IDI DNA 4843 tHMGR1: 885-2393 IDI1: 3209- 4075 88 PGK1p: DNA 7673 Pgk1p: 222- ACC1S659A, S1157A 971 Acc1mut: 972-7673 89 PT254-R2S DNA 4707 1957-2925 90 Ost1-pro-alpha- DNA 4137 1615-3168 f(I)-OXC53 91 PLAS- Wild Type OAC Protein 102 All 417 92 PLAS- Variant OAC Protein 102 All 417 93 PLAS- mScarlet Protein 232 All 416 - Modifications to base strains used herein are outlined below in Table 6.
-
TABLE 6 Modifications to Base Strains Modifi- SEQ Integration Genetic cation ID Region/ Structure of # name NO. Plasmid Description Sequence 1 NpgA 77 Flagfeldt Phosphopantetheinyl Site14Up::Tef1p: Site 14 Transferase from Aspergillus NpgA:Prm9t:Site integration niger. Accessory Protein for 14Down DiPKS (Kim et al., 2015) 2 DiPKS-1 78 USER Site Type 1 FAS fused to Type 3XII- XII-1 PKS from D. discoideum. 1up::Gal1p: integration Produces Olivetol from malonyl- DiPKSG1516R: coA Prm9t::XII1-down 3 DiPKS-2 79 Wu site 1Type 1 FAS fused toType 3Wu1up::Gal1p: integration PKS from D. discoideum. DiPKSG1516R: Produces Olivetol from malonyl- Prm9t:: Wu1down COA 4 DiPKS-3 80 Wu site 3Type 1 FAS fused toType 3Wu3up::Gal1p: integration PKS from D. discoideum. DiPKSG1516R: Produces Olivetol from malonyl- Prm9t:: Wu3down coA 5 DiPKS-4 81 Wu site 6Type 1 FAS fused toType 3Wu6up::Gal1p: integration PKS from D. discoideum. DiPKSG1516R: Produces Olivetol from malonyl- Prm9t:: Wu6down COA 6 DiPKS-5 82 Wu site 18 Type 1 FAS fused toType 3Wu18up::Gal1p: integration PKS from D. discoideum. DiPKSG1516R: Produces Olivetol from malonyl- Prm9t::Wu18down COA 7 PDH 83 Flagfeldt Acetaldehyde dehydrogenase 19Up::Tdh3p: Site 19 (ALD6) from S. cerevisiae and Ald6:Adh1::Tef1p: integration acetoacetyl COA synthase seACS1L641P: (AscL641P) from Salmonella Prm9t::19Down enterica. Will allow greater accumulation of acetyl-coA in the cell (Shiba et al., 2007). 8 Maf1 84 Flagfeldt Maf1 is a regulator of tRNA Site5Up::Tef1p: Site 5biosynthesis. Overexpression in Maf1:Prm9t: integration S. cerevisiae has demonstrated Site5Down higher monoterpene (GPP) yields (Liu et al., 2013). 9 Erg20K197E 85 Chromosomal Mutant of Erg20 protein that Tpi1t:ERG20K197E: modification diminishes FPP synthase Cyc1t::Tef1p: activity creating greater pool of KanMX:Tef1t GPP precursor. Negatively affects growth phenotype (Oswald et al., 2007). 10 Erg1p: 86 Flagfeldt Sterol responsive promoter Site18Up::Erg1p: UB14- Site 18 controlling Erg20 protein UB14deg:ERG20: Erg20: integration activity. Allows for regular FPP Adh1t:Site18down deg synthase activity and uninhibited growth phenotype until accumulation of sterols which leads to a suppression of expression of enzyme (Peng et al., 2018). 11 tHMGr- 87 USER Site Overexpression of truncated X3up::Tdh3p: IDI X-3 HMGr1 and IDI1 proteins that tHMGR1:Adh1t:: integration have been previously identified Tef1p:IDI1:Prm9t:: to be bottlenecks in the S. X3down cerevisiae terpenoid pathway responsible for GPP production (Ro et al., 206). 12 PGK1p: 88 Chromosomal Mutations in the native S. Pgk1: ACC1S659A, S1157A modification cerevisiae acetyl-coA ACC1S659A, S1157A: carboxylase that removes post- Acc1t translational modification based down-regulation. Leads to greater malonyl-coA pools. The promoter of Acc1 was also changed to a constitutive promoter for higher expression (Shi et al., 2014). 13 PT254- 89 Flagfeldt The Cannabis sativa FgF18up::Tef1p: R2S Site 18 prenyltransferase PT254 allows R2S- integration CBGa to be produced from PT254:Cyct:: olivetolic acid and geranyl FgF20down pyrophosphate (Luo et al., 2019). The N terminal arginine of this enzyme has been replaced with a serine in order to enhance protein stability in accordance with N-end rule (Varshavsky 1996). 14 Ost1- 90 Apel-3 Δ28THCa synthase (OXC53) Apel3up::Tef1p: pro- from C. sativa. (Sirikantaramas Ost1-pro-alpha-f(I)- alpha- et al., 2005). Fused with a Ost1- OXC53t:Cyct:: f(I)- pro-alpha-f(I) tag. Produces Apel3down OXC53 THCa from CBGa - Results:
- Identification of Variants that Demonstrate Improved Production of Olivetolic Acid (OVLa) and Downstream Cannabinoids
- An OAC mutants library was constructed in a plasmid regulated by the Gall p promoter, and expressed in an olivetol-producing background strain (HB1416) harboring downstream enzymes of the cannabinoid production pathway. The strains expressing wild type OAC (HB1891) and mScarlet fluorescent protein (HB1892) were utilized as control in the screening to facilitate identification of OAC mutant hits with improved activity.
-
FIG. 5 shows olivetolic acid production by engineered OACs strains. The measured values of each cannabinoid are shown inFIG. 5 . The mutants tested had the silent mutation G43G(GGG) present, which is an artifact of the plasmid construction process and not relevant to increased enzyme performance. - Table 7 shows production of olivetol, olivetolic acid and downstream cannabinoids in OAC wild type and engineered OACs strains.
-
TABLE 7 Production of Olivetol, Olivetolic Acid and Downstream Cannabinoids Total Down- stream Meta- bolites # of Olivetolic (OVLa, non- Olivetol Acid CBGa THCa CBGa, conservative Strain OAC mutant (mg/L) (mg/L) (mg/L) (mg/L) THCa) mutations HB1891 Wild type OAC 53.367 53.850 8.283 2.367 64.500 NA HB1892 RFP negative— 84.833 5.717 0.517 0.150 6.383 NA no OAC PLT1577-D10 Gal1p:OAC- 70.633 110.800 13.933 2.700 127.433 6 V28A/Y41T/ G43SILENT(=GGG)/ T68L/I74E/V84R/ R100M/G102R:Cyc1t PLT1575-D12 Gal1p:OAC-Y41S/ 77.200 94.367 13.200 4.133 111.700 5 G43SILENT(=GGG)/ K44V/T68R/ I74R/V84R:Cyc1t PLT1576-B9 Gal1p:OAC- 52.500 78.167 14.933 3.367 96.467 7 V28A/V31G/Y41S/ G43SILENT(=GGG)/ K44V/T68L/I74R/ V84R/R100E/ G102R:Cyc1t PLT1575-C12 Gal1p:OAC- 75.333 75.400 12.600 3.500 91.500 5 V28A/Y41T/ G43SILENT(=GGG)/ T68L/I74G/V84R/ R100E:Cyc1t PLT1575-D2 Gal1p:OAC- 81.400 73.267 12.800 2.700 88.767 7 V28A/Y41T/G43 SILENT(=GGG)/ K44V/T68L/I74D/ V84R/R100M/ G102R:Cyc1t PLT1572-B10 Gal1p:OAC- 45.433 72.400 8.433 2.867 83.700 5 V28A/Y41T/G43 SILENT(=GGG)/ T68L/I74D/V84R/ G102R:Cyc1t PLT1576-A9 Gal1p:OAC- 66.633 69.767 10.800 2.733 83.300 7 V28A/Y41T/G43 SILENT(=GGG)/ K44V/T68L/I74R/ V84R/R100E/ G102R:Cyc1t PLT1577-D1 Gal1p:OAC-Y41T/ 71.567 68.833 12.933 2.967 84.733 5 G43SILENT(=GGG)/ T68R/I74R/V84R/ R100M/G102STOP: Cyc1t PLT1577-B7 Gal1p:OAC- 77.800 68.800 10.767 2.533 82.100 7 V28A/Y41V/ G43SILENT(=GGG)/ K44V/T68L/I74G/ V84R/R100E/ G102R:Cyc1t PLT1575-F8 Gal1p:OAC- 81.667 68.533 12.167 3.067 83.767 5 V28A/Y41T/ G43SILENT(=GGG)/ K44V/T68R/I74G/ V84R/G102S TOP:Cyc1t PLT1574-H11 Gal1p:OAC- 72.833 67.200 11.533 3.333 82.067 6 V28A/Y41V/ G43SILENT(=GGG)/ K44V/T68L/I74R/ V84R/R100E/ G102STOP:Cyc1t PLT1575-E1 Gal1p:OAC- 85.100 66.433 11.500 3.467 81.400 6 V28A/V31G/Y41T/ G43SILENT(=GGG)/ K44V/T68R/ I74E/V84R/ R100E:Cyc1t PLT1575-G1 Gal1p:OAC- 71.200 64.200 9.300 2.367 75.867 5 V28A/Y41S/ G43SILENT(=GGG)/ T68R/I74R/V84R/ R100M/G102 STOP:Cyc1t PLT1576-A2 Gal1p:OAC-Y41T/ 56.633 62.267 12.300 3.067 77.633 6 G43SILENT(=GGG)/ K44V/ T68L/I74G/V84R/ G102R:Cyc1t PLT1576-B10 Gal1p:OAC- 63.633 61.600 12.533 2.967 77.100 7 V28A/Y41T/ G43SILENT(=GGG)/ K44V/T68R/I74R/ V84R/R100E/ G102R:Cyc1t PLT1573-D7 Gal1p:OAC- 58.467 61.033 8.767 2.833 72.633 5 V28A/V31G/Y41S/ G43SILENT(=GGG)/ K44V/T68R/ I74R/V84R:Cyc1t PLT1575-H10 Gal1p:OAC-V28A/ 72.533 60.400 11.467 2.600 74.467 5 G43SILENT(=GGG)/ K44V/I74D/ V84R/R100E/ G102R(=CGC):Cyc1t PLT1574-F5 Gal1p:OAC- 71.300 59.400 10.767 2.867 73.033 6 V28A/Y41V/ G43SILENT(=GGG)/ K44V/T68L/I74G/ V84R/G102R:Cyc1t PLT1573-D8 Gal1p:OAC-Y41T/ 58.367 58.167 7.733 2.800 68.700 6 G43SILENT(=GGG)/ T68L/I74G/ V84R/R100M/ G102R:Cyc1t PLT1574-A11 Gal1p:OAC- 65.033 58.133 8.167 3.067 69.367 7 V28A/V31G/Y41T/ G43SILENT(=GGG)/ K44V/T68R/ I74R/V84R/ R100E/G102R:Cyc1t PLT1574-B9 Gal1p:OAC- 75.800 57.133 9.600 2.700 69.433 5 V31G/Y41S/ G43SILENT(=GGG)/ K44V/T68R/V84R/ R100E:Cyc1t PLT1576-B5 Gal1p:OAC- 72.900 55.833 9.900 2.467 68.200 4 V28A/Y41V/G43 SILENT(=GGG)/ T68L/I74D/R100E/ G102STOP:Cyc1t PLT1573-F2 Gal1p:OAC- 71.900 55.433 8.367 2.567 66.367 6 V28A/Y41V/ G43SILENT(=GGG)/ T68R/I74G/V84R/ R100M/G102R:Cyc1t PLT1572-H10 Gal1p:OAC-V31G/ 49.733 54.100 6.700 2.433 63.233 3 G43SILENT(=GGG)/ I74G/V84R/ R100E:Cyc1t PLT1573-E12 Gal1p:OAC- 56.833 54.000 7.100 2.433 63.533 7 V28A/Y41S/ G43SILENT(=GGG)/ K44V/T68R/I74R/ V84R/R100M/ G102R:Cyc1t PLT1573-D12 Gal1p:OAC- 57.633 52.500 6.967 2.467 61.933 5 V28A/V31G/Y41V/ G43SILENT(=GGG)/ K44V/T68L/ I74G/V84R:Cyc1t PLT1573-B8 Gal1p:OAC- 53.267 51.367 6.733 2.133 60.233 7 V28A/Y41V/ G43SILENT(=GGG)/ K44V/T68L/I74G/ V84R/R100M/ G102R:Cyc1t PLT1573-A2 Gal1p:OAC- 66.700 49.600 6.900 2.467 58.967 5 V28A/V31G/ G43SILENT(=GGG)/ T68L/I74R/V84R/ R100E/G102R:Cyc1t PLT1576-B4 Gal1p:OAC- 70.700 47.633 10.833 2.600 61.067 6 V31G/Y41V/ G43SILENT(=GGG)/ K44V/T68L/I74R/ V84R/R100E/ G102STOP:Cyc1t PLT1573-G12 Gal1p:OAC- 71.933 46.133 7.600 2.400 56.133 6 V31G/Y41T/ G43SILENT(=GGG)/ K44V/T68R/I74D/ V84R/G102R:Cyc1t PLT1576-G12 Gal1p:OAC- 56.133 45.800 10.667 2.467 58.933 7 V31G/Y41T/ G43SILENT(=GGG)/ K44V/T68R/I74D/ V84R/R100E/ G102R:Cyc1t PLT1575-H5 Gal1p:OAC- 59.733 44.533 10.500 2.467 57.500 7 V28A/Y41S/ G43SILENT(=GGG)/ K44V/T68R/I74G/ V84R/R100M/ G102R:Cyc1t PLT1576-A6 Gal1p:OAC- 63.500 44.400 9.533 2.367 56.300 4 V28A/Y41V/ G43SILENT(=GGG)/ K44V/I74R/R100E/ G102STOP:Cyc1t PLT1576-D1 Gal1p:OAC- 67.767 41.500 9.867 2.533 53.900 6 V28A/V31G/Y41T/ G43SILENT(=GGG)/ K44V/I74E/ V84R/R100M/ G102R:Cyc1t PLT1572-E8 Gal1p:OAC-V28A/ 29.467 32.433 2.833 1.167 36.433 5 G43SILENT(=GGG)/ K44V/ T68R/I74E/V84R/ R100E/G102 STOP:Cyc1t PLT1572-H9 Gal1p:OAC- 43.067 30.200 3.333 1.233 34.767 6 V31G/Y41T/ G43SILENT(=GGG)/ K44V/T68L/I74D/ V84R/G102R:Cyc1t PLT1575-F9 Gal1p:OAC-Y41V/ 106.267 13.633 2.033 0.533 16.200 6 G43SILENT(=GGG)/ K44V/T68L/ I74R/V84R/ G102R:Cyc1t - Table 8 illustrates the ratio of OVLa or downstream metabolites (CBGa, CBDa, THCa) to OVL in OAC variants.
-
TABLE 8 Ratio of OVLa or Downstream Metabolites (CBGa, CBDa, THCa) to OVL in OAC Variants # of non- Total conservative Downstream: Strain OAC mutant mutations OVLa:OVL ratio OVL HB1891 Wild type OAC NA 1.105 1.301 HB1892 RFP negative: NA 0.076 0.083 no OAC PLT1577-D10 Gal1p:OAC- 6 1.569 1.804 V28A/Y41T/ G43SILENT(=GGG)/ T68L/I74E/V84R/ R100M/G102R: Cyc1t PLT1575-D12 Gal1p:OAC- 5 1.227 1.451 Y41S/G43SILENT (=GGG)/K44V/ T68R/I74R/ V84R:Cyc1t PLT1576-B9 Gal1p:OAC- 7 1.485 1.839 V28A/V31G/ Y41S/G43SILENT (=GGG)/K44V/ T68L/I74R/V84R/ R100E/G102R: Cyc1t PLT1575-C12 Gal1p:OAC- 5 1.003 1.218 V28A/Y41T/ G43SILENT(=GGG)/ T68L/I74G/V84R/ R100E:Cyc1t PLT1575-D2 Gal1p:OAC- 7 0.901 1.091 V28A/Y41T/ G43SILENT(=GGG)/ K44V/T68L/ I74D/V84R/R100M/ G102R:Cyc1t PLT1572-B10 Gal1p:OAC- 5 1.609 1.856 V28A/Y41T/ G43SILENT(=GGG)/ T68L/I74D/V84R/ G102R:Cyc1t PLT1576-A9 Gal1p:OAC- 7 1.053 1.259 V28A/Y41T/ G43SILENT(=GGG)/ K44V/T68L/I74R/ V84R/R100E/ G102R:Cyc1t PLT1577-D1 Gal1p:OAC- 5 0.961 1.184 Y41T/G43SILENT (=GGG)/T68R/ I74R/V84R/ R100M/G102STOP: Cyc1t PLT1577-B7 Gal1p:OAC- 7 0.907 1.084 V28A/Y41V/ G43SILENT(=GGG)/ K44V/T68L/I74G/ V84R/R100E/G 102R:Cyc1t PLT1575-F8 Gal1p:OAC- 5 0.839 1.027 V28A/Y41T/ G43SILENT(=GGG)/ K44V/T68R/I74G/ V84R/G102STOP: Cyc1t PLT1574-H11 Gal1p:OAC- 6 0.929 1.136 V28A/Y41V/ G43SILENT(=GGG)/ K44V/T68L/I74R/ V84R/R100E/ G102STOP:Cyc1t PLT1575-E1 Gal1p:OAC- 6 0.783 0.959 V28A/V31G/ Y41T/G43SILENT (=GGG)/K44V/ T68R/I74E/V84R/ R100E:Cyc1t PLT1575-G1 Gal1p:OAC- 5 0.904 1.068 V28A/Y41S/ G43SILENT(=GGG)/ T68R/I74R/V84R/ R100M/G102 STOP:Cyc1t PLT1576-A2 Gal1p:OAC- 6 1.107 1.378 Y41T/G43SILENT (=GGG)/K44V/ T68L/I74G/V84R/ G102R:Cyc1t PLT1576-B10 Gal1p:OAC- 7 0.981 1.227 V28A/Y41T/ G43SILENT(=GGG)/ K44V/T68R/I74R/ V84R/R100E/ G102R:Cyc1t PLT1573-D7 Gal1p:OAC- 5 1.046 1.247 V28A/V31G/ Y41S/G43SILENT (=GGG)/K44V/ T68R/I74R/V84R: Cyc1t PLT1575-H10 Gal1p:OAC- 5 0.834 1.031 V28A/G43SILENT (=GGG)/K44V/ I74D/V84R/R100E/ G102R(=CGC): Cyc1t PLT1574-F5 Gal1p:OAC- 6 0.838 1.030 V28A/Y41V/ G43SILENT(=GGG)/ K44V/T68L/I74G/ V84R/G102R: Cyc1t PLT1573-D8 Gal1p:OAC- 6 0.994 1.174 Y41T/G43SILENT (=GGG)/T68L/ I74G/V84R/R100M/ G102R:Cyc1t PLT1574-A11 Gal1p:OAC- 7 0.890 1.061 V28A/V31G/ Y41T/G43SILENT (=GGG)/K44V/ T68R/I74R/V84R/ R100E/G102R: Cyc1t PLT1574-B9 Gal1p:OAC- 5 0.757 0.921 V31G/Y41S/ G43SILENT(=GGG)/ K44V/T68R/ V84R/R100E:Cyc1t PLT1576-B5 Gal1p:OAC- 4 0.767 0.937 V28A/Y41V/ G43SILENT(=GGG)/ T68L/I74D/R100E/ G102STOP: Cyc1t PLT1573-F2 Gal1p:OAC- 6 0.783 0.936 V28A/Y41V/ G43SILENT(=GGG)/ T68R/I74G/ V84R/R100M/ G102R:Cyc1t PLT1572-H10 Gal1p:OAC- 3 1.102 1.287 V31G/G43SILENT (=GGG)/I74G/ V84R/R100E: Cyc1t PLT1573-E12 Gal1p:OAC- 7 0.951 1.119 V28A/Y41S/ G43SILENT(=GGG)/ K44V/T68R/I74R/ V84R/R100M/ G102R:Cyc1t PLT1573-D12 Gal1p:OAC- 5 1.082 1.267 V28A/V31G/ Y41V/G43SILENT (=GGG)/K44V/ T68L/I74G/V84R: Cyc1t PLT1573-B8 Gal1p:OAC- 7 0.986 1.152 V28A/Y41V/ G43SILENT(=GGG)/ K44V/T68L/ I74G/V84R/R100M/ G102R:Cyc1t PLT1573-A2 Gal1p:OAC- 5 0.753 0.894 V28A/V31G/ G43SILENT(=GGG)/ T68L/I74R/V84R/ R100E/G102R: Cyc1t PLT1576-B4 Gal1p:OAC- 6 0.679 0.873 V31G/Y41V/ G43SILENT(=GGG)/ K44V/T68L/I74R/ V84R/R100E/ G102STOP:Cyc1t PLT1573-G12 Gal1p:OAC- 6 0.638 0.777 V31G/Y41T/ G43SILENT(=GGG)/ K44V/T68R/I74D/ V84R/G102R: Cyc1t PLT1576-G12 Gal1p:OAC- 7 0.824 1.064 V31G/Y41T/ G43SILENT(=GGG)/ K44V/T68R/I74D/ V84R/R100E/ G102R:Cyc1t PLT1575-H5 Gal1p:OAC- 7 0.748 0.966 V28A/Y41S/ G43SILENT(=GGG)/ K44V/T68R/I74G/ V84R/R100M/ G102R:Cyc1t PLT1576-A6 Gal1p:OAC- 4 0.703 0.891 V28A/Y41V/ G43SILENT(=GGG)/ K44V/I74R/R100E/ G102STOP: Cyc1t PLT1576-D1 Gal1p:OAC- 6 0.614 0.798 V28A/V31G/ Y41T/G43SILENT (=GGG)/K44V/ I74E/V84R/R100M/ G102R:Cyc1t PLT1572-E8 Gal1p:OAC- 5 1.101 1.236 V28A/G43SILENT (=GGG)/K44V/ T68R/I74E/V84R/ R100E/G102STOP: Cyc1t PLT1572-H9 Gal1p:OAC- 6 0.729 0.838 V31G/Y41T/ G43SILENT(=GGG)/ K44V/T68L/I74D/ V84R/G102R: Cyc1t PLT1575-F9 Gal1p:OAC- 6 0.138 0.164 Y41V/G43SILENT (=GGG)/K44V/ T68L/I74R/V84R/ G102R:Cyc1t - Table 9 provides a summary of mutational frequency data.
-
TABLE 9 Mutational Frequency Data Occurrences Occurrences in in strains with strains Occurrences improved with Total in best 10 total improved occurrences olivetolic acid downstream OVLa:OVL in sequence producers metabolites ratios vs Mutation mutational type set (/37) (/10) vs WT (/24) WT (/5) V28A Conservative 26 8 18 3 Y41T Non- 16 7 12 3 conservative T68L Non- 18 7 12 4 conservative I74E Non- 4 1 2 1 conservative V84R Non- 35 10 22 5 conservative R100M Non- 10 3 6 1 conservative G102R Non- 22 6 13 5 conservative K44V Non- 27 6 15 3 conservative T68R Non- 15 3 10 1 conservative V31G Conservative 13 2 5 1 I74R Non- 14 4 9 2 conservative R100E Non- 17 4 11 1 conservative I74D Non- 7 2 4 1 conservative G102S Non- 8 2 5 0 conservative Y41S Non- 7 1 5 2 conservative Y41V Non- 10 1 5 0 conservative I74G Conservative 11 3 7 2 G102STOP Conservative 8 2 6 0 G43SILENT Conservative 37 10 24 5 (=GGG - Use in Host Cells
- Phytocannabinoids, such as tetrahydrocannabinol (THC) and cannabidiol (CBD), can be extracted from plant material for medical and psychotropic purposes. However, the synthesis of plant material is costly, not readily scalable to large volumes, and requires a lengthy grow periods to produce sufficient quantities of phytocannabinoids. An organism capable of fermentation, such as Saccharomyces cerevisiae, that is capable of producing cannabinoids would provide an economical route to producing these compounds on an industrial scale.
- The early stages of the cannabinoid pathway proceeds via the generation of olivetolic acid by the type III PKS olivetolic acid synthase (OAS) and cyclase olivetolic acid cyclase (OAC). This reaction uses a hexanoyl-CoA starter as well as three units of malonyl-CoA. Olivetolic acid is the backbone of most classical cannabinoids and can be prenylated to form CBGA, which is ultimately converted to CBDA or THCA by an oxidocyclase. Production of olivetolic acid in S. cerevisiae is challenging as OAS generates significant by-products such as HTAL, PDAL and olivetol. These by-products can be reduced in a recombinant organism by the introduction of olivetolic acid cyclase (OAC) but even with this enzyme by-products can account for up to 80% of the total carbon in the reaction.
- Table 10 lists specific examples of host cell organisms in which the described OAC variants may be utilized for preparation of cannabinoids in the described pathways.
-
TABLE 10 List of Host Cell Organisms Type Organisms Bac- Escherichia coli, Streptomyces coelicolor and other species., teria Bacillus subtilis, Mycoplasma genitalium, Synechocytis, Zymomonas mobilis, Corynebacterium glutamicum, Synechococcus sp., Salmonella typhi, Shigella flexneri, Shigella sonnei, and Shigella disenteriae, Pseudomonas putida, Pseudomonas aeruginosa, Pseudomonas mevalonii, Rhodobacter sphaeroides, Rhodobacter capsulatus, Rhodospirillum rubrum, Rhodococcus sp. Fungi Saccharomyces cerevisiae, Ogataea polymorpha, Komagataella phaffii, Kluyveromyces lactis, Neurospora crassa, Aspergillus niger, Aspergillus nidulans, Schizosaccharomyces pombe, Yarrowia lipolytica, Myceliophthora thermophila, Aspergillus oryzae, Trichoderma reesei, Chrysosporium lucknowense, Fusarium sp., Fusarium gramineum, Fusarium venenatum, Pichia finlandica, Pichia trehalophila, Pichia koclamae, Pichia membranaefaciens, Pichia opuntiae, Pichia thermotolerans, Pichia salictaria, Pichia guercuum, Pichia pijperi, Pichia stipitis, Pichia methanolica, Hansenula polymorpha. Protists Chlamydomonas reinhardtii, Dictyostelium discoideum, Chlorella sp., Haematococcus pluvialis, Arthrospira platensis, Dunaliella sp., Nannochloropsis oceanica. Plants Cannabis sativa, Arabidopsis thaliana, Theobroma cacao, maize, banana, peanut, field peas, sunflower, Nicotiana sp., tomato, canola, wheat, barley, oats, potato, soybeans, cotton, sorghum, lupin, rice. - Phytocannabinoids may be produced in a host cell involving Dictyostelium discoideum polyketide synthase (DiPKS), olivetolic acid cyclase (OAC), prenyltransferases, and/or mutants of these, as described in Applicant's co-pending International Application No. PCT/CA2020/050687 (herein incorporated by reference). For example, a host cell transformed with a polyketide synthase coding sequence, an olivetolic acid cyclase coding sequence, and a prenyltransferase coding sequence may be prepared. The polyketide synthase and the olivetolic acid cyclase catalyze synthesis of olivetolic acid from malonyl CoA. The olivetolic acid cyclase may include wild type, or any of the functional mutants described herein. The host cell may include a yeast cell, a bacterial cell, a protest cell or a plant cell, selected from among those listed in Table 10.
- Combinations of the methods, nucleotides, and expression vectors described herein as well as in Applicant's co-pending International Application No. PCT/CA2020/050687 may be employed together to produce phytocannabinoids, phytocannabinoid precursors such as polyketides. Depending on the desired product, selections of characteristics of the cells and methods employed may be selected to achieve production of the cannabinoid, cannabinoid precursor, or intermediate of interest.
- Methods of producing a phytocannabinoid may comprising culturing a host cell under suitable culture conditions to form a phytocannabinoid, said host cell comprising: a polynucleotide encoding a polyketide synthase (PKS) enzyme; a polynucleotide encoding an olivetolic acid cyclase (OAC) enzyme mutants as described herein; and a polynucleotide encoding a prenyltransferase (PT) enzyme; and optionally comprising: a polynucleotide encoding an acyl-CoA synthase (Alk) enzyme; a polynucleotide encoding a fatty acyl CoA activating (CsAAE) enzyme; and/or a polynucleotide encoding a THCa synthase (OXC) enzyme. An expression vector can be prepared comprising a polynucleotide encoding a polyketide synthase (PKS) enzyme; a polynucleotide encoding an olivetolic acid cyclase (OAC) enzyme mutants as described herein; and a polynucleotide encoding a prenyltransferase (PT) enzyme. The expression vector can optionally comprise a polynucleotide encoding an acyl-CoA synthase (Alk) enzyme; a polynucleotide encoding CsAAE1; and/or a polynucleotide encoding a THCa synthase (OXC) enzyme.
- In the preceding description, for purposes of explanation, numerous details are set forth in order to provide a thorough understanding of the embodiments. However, it will be apparent to one skilled in the art that these specific details are not required.
- The embodiments described herein are intended to be examples only. Alterations, modifications and variations can be effected to the particular embodiments by those of skill in the art. The scope of the claims should not be limited by the particular embodiments set forth herein, but should be construed in a manner consistent with the specification as a whole.
- The invention being thus described, it will be obvious that the same may be varied in many ways. Such variations are not to be regarded as a departure from the spirit and scope of the invention, and all such modification as would be obvious to one skilled in the art are intended to be included within the scope of the following claims.
- All publications, patents and patent applications mentioned in this Specification are indicative of the level of skill those skilled in the art to which this invention pertains and are herein incorporated by reference to the same extent as if each individual publication patent, or patent application was specifically and individually indicated to be incorporated by reference.
-
- U.S. Pat. No. 7,361,482
- U.S. Pat. No. 8,884,100 (Page et al.) Aromatic Prenyltransferase from Cannabis.
- WO2018148848 (Mookerjee et al.) publication of PCT/CA2018/050189, METHOD AND CELL LINE FOR PRODUCTION OF PHYTOCANNABINOIDS AND PHYTOCANNABINOID ANALOGUES IN YEAST
- WO2018148849 (Mookerjee et al.) publication of PCT/CA2018/050190, METHOD AND CELL LINE FOR PRODUCTION OF POLYKETIDES IN YEAST
-
- Bai Flagfeldt, D., Siewers, V., Huang, L. and Nielsen, J. (2009) “Characterization of chromosomal integration sites for heterologous gene expression in Saccharomyces cerevisiae” Yeast, 26, 545-551.
- Eisenberg, D., Schwarz, E., Komaromy, M., & Wall, R. (1984). “Analysis of membrane and surface protein sequences with the hydrophobic moment plot”. Journal of Molecular Biology, 179(1), 125-142. https://doi.org/10.1016/0022-2836(84)90309-7.
- Gagne, S. J., et al. (2012) “Identification of Olivetolic Acid Cyclase from Cannabis Sativa Reveals a Unique Catalytic Route to Plant Polyketides.” Proceedings of the National Academy of Sciences, vol. 109, no. 31, 2012, pp. 12811-12816. doi:10.1073/pnas.1200330109.
- Ghosh, R., A. Chhabra, P. A. Phatale, S. K. Samrat, J. Sharma, A. Gosain, D. Mohanty, S. Saran and R. S. Gokhale (2008) “Dissecting the Functional Role of Polyketide Synthases in Dictyostelium discoideum biosynthesis of the differentiation regulating factor 4-methyl-5-pentylbenzene-1,3-diol” Journal of Biological Chemistry, 283(17), 11348-11354.
- Gietz, R. D. and Schiestl, R. H., (2007) “High-efficiency yeast transformation using the LiAc/SS carrier DNA/PEG method.” Nat. Protoc. 2, 31-34.
- Gietz R. D. (2014) Yeast Transformation by the LiAc/SS Carrier DNA/PEG Method (pp 1-12). In: Smith J., Burke D. (eds) Yeast Genetics. Methods in Molecular Biology (Methods and Protocols), vol 1205. Humana Press, New York, NY. https://doi.org/10.1007/978-1-4939-1363-3_1.
- Jensen, N. B., Strucko, T., Kildegaard, K. R., David, F., et al., (2014). EasyClone: method for iterative chromosomal integration of multiple genes in Saccharomyces cerevisiae. FEMS Yeast Research, Volume 14,
Issue 2, pages 238-248; https://doi.org/10.1111/1567-1364.12118. - Kim, J.-M., Song, H.-Y., Choi, H.-J., So, K.-K., Kim, D.-H., Chae, K.-S., . . . Jahng, K.-Y. (2015). “Characterization of NpgA, a 4′-phosphopantetheinyl transferase of Aspergillus nidulans, and evidence of its involvement in fungal growth and formation of conidia and cleistothecia for development.” Journal of Microbiology, 53(1), 21-31 https://doi.org/10.1007/s12275-015-4657-8.
- Kuzuyama et al. (2005) Structural basis for the promiscuous biosynthetic prenylation of aromatic natural products, Nature, volume 435, pages 983-987; doi: 10.1038/nature03668.
- Liu, J., Zhang, W., Du, G., Chen, J., & Zhou, J. (2013). “Overproduction of geraniol by enhanced precursor supply in Saccharomyces cerevisiae.” Journal of Biotechnology, 168(4), 446-451. https://doi.org/10.1016/J.JBIOTEC.2013.10.017.
- Luo, X., Reiter, M., d'Espaux, L., Wong, J., Denby, C., Lechner, A., Zhang, Y., Grzybowski, A., Harth, S., Lin, W., Lee, H., Yu, C., Shin, J., Deng, K., Benites, V., Wang, G., Baidoo, E., Chen, Y., Dev, I., Petzold, C. and Keasling, J. (2019). “Complete biosynthesis of cannabinoids and their unnatural analogues in yeast.” Nature, 567(7746), pp. 123-126.
- Oswald, Marilyne; Marc Fischer, Nicole Dirninger, Francis Karst, (2007) “Monoterpenoid biosynthesis in Saccharomyces cerevisiae.” FEMS Yeast Research, 7(3), 413-421. https://doi.org/10.1111/j.1567-1364.2006.00172.x
- Peng, B., Nielsen, L. K., Kampranis, S. C., & Vickers, C. E. (2018). Engineered protein degradation of farnesyl pyrophosphate synthase is an effective regulatory mechanism to increase monoterpene production in Saccharomyces cerevisiae. Metabolic Engineering, 47, 83-93. https://doi.org/10.1016/J.YMBEN.2018.02.005.
- Ro, D.-K., Paradise, E. M., Ouellet, M., Fisher, K. J., Newman, K. L., Ndungu, J. M., Keasling, J. D. (2006). Production of the antimalarial drug precursor artemisinic acid in engineered yeast. Nature, 440(7086), 940-943. JOUR. https://doi.org/10.1038/nature04640.
- Ryan, O. W., Poddar, S., & Cate, J. H. D. (2016). CRISPR-Cas9 Genome Engineering in Saccharomyces cerevisiae Cells. Cold Spring Harbor Protocols, 2016(6), pdb.prot086827. https://doi.org/10.1101/pdb.prot086827.
- Saeki, H., Hara, R., Takahashi, H., lijima, M., Munakata, R., Kenmoku, H., . . . Taura, F. (2018). An Aromatic Farnesyltransferase Functions in Biosynthesis of the Anti-HIV Meroterpenoid Daurichromenic Acid. Plant Physiology, 178(2), 535-551; https://doi.org/10.1104/PP.18.00655.
- Shi, S., Chen, Y., Siewers, V., & Nielsen, J. (2014). “Improving Production of Malonyl Coenzyme A-Derived Metabolites by Abolishing Snf1-Dependent Regulation of Acc1.” mBio, 5(3), e01130-14. https://doi.org/10.1128/mBio.01130-14.
- Shiba, Y., Paradise, E. M., Kirby, J., Ro, D.-K., & Keasling, J. D. (2007). “Engineering of the pyruvate dehydrogenase bypass in Saccharomyces cerevisiae for high-level production of isoprenoids.” Metabolic Engineering, 9(2), 160-168. https://doi.org/10.1016/J.YMBEN.2006.10.005.
- Sirikantaramas, S., Taura, F., Tanaka, Y., Ishikawa, Y., Morimoto, S., & Shoyama, Y. (2005). “Tetrahydrocannabinolic acid synthase, the enzyme controlling marijuana psychoactivity, is secreted into the storage cavity of the glandular trichomes”. Plant and Cell Physiology, 46(9), 1578-1582.
- Stout, J. M., Boubakir, Z., Ambrose, S. J., Purves, R. W., & Page, J. E. (2012). The hexanoyl-CoA precursor for cannabinoid biosynthesis is formed by an acyl-activating enzyme in Cannabis sativa trichomes. The Plant Journal, 71(3), 353-365.
- Taura, Futoshi, et al. (2009) “Characterization of olivetol synthase, a polyketide synthase putatively involved in cannabinoid biosynthetic pathway.” FEBS letters, 583.12 (2009): 2061-2066.
- Varshaysky, A. (2011). The N-end rule pathway and regulation by proteolysis. Protein Science 20(8):1285-1476. https://doi.org/10.1002/pro.666.
Claims (22)
1. A method of producing olivetolic acid (OVLa) or a phytocannabinoid produced therefrom in a heterologous host cell comprising OVLa-producing or phytocannabinoid-producing capacity, said method comprising:
transforming said host cell with a nucleotide encoding a variant olivetolic acid cyclase (OAC) protein having at least 6 amino acid mutations relative to the wild type OAC protein, and
culturing said transformed host cell to produce olivetolic acid and/or phytocannabinoids therefrom,
wherein said variant OAC protein comprises at least 85%, at least 90%, at least 95%, or at least 99% sequence identity with the wild type OAC protein sequence according to SEQ ID NO:91.
2. The method of claim 1 , wherein at least 4 of the at least 6 amino acid mutations are in residues selected from the group consisting of: residues 28, 31, 41, 43, 44, 68, 74, 84, 100 and 102 of SEQ ID NO:91.
3. The method according to claim 2 , wherein at least one of the at least 4 mutations in amino acid residue 28, 31, 41, 43, 44, 68, 74, 84, 100 or 102 is a conservative amino acid substitution.
4. The method according to claim 2 , wherein at least one of the at least 4 mutations in amino acid residue 28, 31, 41, 43, 44, 68, 74, 84, 100 or 102 is a non-conservative amino acid substitution.
5. The method of claim 1 , wherein said variant OAC protein has amino acid substitutions in at least 10, at least 9, at least 8, at least 7, at least 6, or at least 5 amino acid residues at positions 28, 31, 41, 43, 44, 68, 74, 84, 100 or 102.
6. The method according to claim 1 , wherein any amino acid mutation at a residue other than 28, 31, 41, 43, 44, 68, 74, 84, 100 or 102 is a conservative amino acid substitution.
7. The method according to claim 1 , wherein the nucleotide encoding the variant olivetolic acid cyclase (OAC) protein has a sequence comprising:
(a) a nucleotide sequence according to SEQ ID NO:5, SEQ ID NO: 3, SEQ ID NO:4, or SEQ ID NO:6-SEQ ID NO:39;
(b) a nucleotide sequence having at least 85%, at least 90%, at least 95%, or at least 99%, identity with the sequence of (a); or
(c) a nucleotide sequence that hybridizes with the complementary strand of the nucleotide having the sequence of (a).
8. The method according to claim 1 , wherein the variant OAC protein comprises a sequence selected from the group consisting of SEQ ID NO:42, SEQ ID NO:40, SEQ ID NO:41, and SEQ ID NO:43-SEQ ID NO:76, and sequences having, at least 90%, at least 95%, or at least 99% sequence identity therewith.
9. The method according to claim 1 , wherein at least 4 of the at least 6 amino acid mutations relative to the wild type OAC protein are selected from the group consisting of:
V28A;
V31G;
Y41T, Y41S or Y41V;
K44V;
T68L or T68R;
174E, 174R, 174D or 174G;
V84R;
R100M or R100E; and
G102R, G102S, or G102STOP.
10. The method of claim 1 , wherein the host cell is transformed with a nucleotide encoding variant olivetolic acid cyclase (OAC) protein with at least 85%, at least 90%, at least 95%, at least 99%, or 100% sequence identity of any one of the following sequences with the indicated substitutions from OAC wild type (SEQ ID NO:91) being present:
V28A/V31G/Y41S/G43SILENT(=GGG)/K44V/T68L/174R/V84R/R100E/G102R (SEQ ID NO:42),
V28A/Y41T/G43SILENT(=GGG)/T68L/174E/V84R/R100M/G102R (SEQ ID NO:40),
Y41S/G43SILENT(=GGG)/K44V/T68R/174R/V84R (SEQ ID NO:41),
V28A/Y41T/G43SILENT(=GGG)/T68L/174G/V84R/R100E (SEQ ID NO:43),
V28A/Y41T/G43SILENT(=GGG)/K44V/T68L/174D/V84R/R100M/G102R (SEQ ID NO:44),
V28A/Y41T/G43SILENT(=GGG)/T68L/174D/V84R/G102R (SEQ ID NO:45),
V28A/Y41T/G43SILENT(=GGG)/K44V/T68L/174R/V84R/R100E/G102R (SEQ ID NO:46),
Y41T/G43SILENT(=GGG)/T68R/174R/V84R/R100M/G102STOP (SEQ ID NO:47),
V28A/Y41V/G43SILENT(=GGG)/K44V/T68L/174G/V84R/R100E/G102R (SEQ ID NO:48),
V28A/Y41T/G43SILENT(=GGG)/K44V/T68R/174G/V84R/G102STOP (SEQ ID NO:49),
V28A/Y41T/G43SILENT(=GGG)/K44V/T68R/174G/V84R/G102STOP (SEQ ID NO:50),
V28A/V31G/Y41T/G43SILENT(=GGG)/K44V/T68R/174E/V84R/R100E (SEQ ID NO:51),
V28A/Y41S/G43SILENT(=GGG)/T68R/174R/V84R/R100M/G102STOP (SEQ ID NO:52),
Y41T/G43SILENT(=GGG)/K44V/T68L/174G/V84R/G102R (SEQ ID NO:53),
V28A/Y41T/G43SILENT(=GGG)/K44V/T68R/174R/V84R/R100E/G102R (SEQ ID NO:54),
V28A/V31G/Y41S/G4351LENT(=GGG)/K44V/T68R/174R/V84R (SEQ ID NO:55),
V28A/G43SILENT(=GGG)/K44V/174D/V84R/R100E/G102R(=CGC) (SEQ ID NO:56),
V28A/Y41V/G43SILENT(=GGG)/K44V/T68L/174G/V84R/G102R (SEQ ID NO:57),
Y41T/G43SILENT(=GGG)/T68L/174G/V84R/R100M/G102R (SEQ ID NO:58),
V28A/V31G/Y41T/G43SILENT(=GGG)/K44V/T68R/174R/V84R/R100E/G102R (SEQ ID NO:59),
V31G/Y41S/G43SILENT(=GGG)/K44V/T68R/V84R/R100E (SEQ ID NO:60),
V28A/Y41V/G43SILENT(=GGG)/T68L/174D/R100E/G102STOP (SEQ ID NO:61)
V28A/Y41V/G43SILENT(=GGG)/T68R/174G/V84R/R100M/G102R (SEQ ID NO:62),
V31G/G43SILENT(=GGG)/174G/V84R/R100E (SEQ ID NO:63),
V28A/Y41S/G43SILENT(=GGG)/K44V/T68R/174R/V84R/R100M/G102R (SEQ ID NO:64),
V28A/V31G/Y41V/G43SILENT(=GGG)/K44V/T68L/174G/V84R (SEQ ID NO:65),
V28A/Y41V/G43SILENT(=GGG)/K44V/T68L/174G/V84R/R100M/G102R (SEQ ID NO:66),
V28A/V31G/G43SILENT(=GGG)/T68L/174R/V84R/R100E/G102R (SEQ ID NO:67),
V31G/Y41V/G43SILENT(=GGG)/K44V/T68L/174R/V84R/R100E/G102STOP (SEQ ID NO:68),
V31G/Y41T/G43SILENT(=GGG)/K44V/T68R/174D/V84R/G102R (SEQ ID NO:69),
V31G/Y41T/G43SILENT(=GGG)/K44V/T68R/174D/V84R/R100E/G102R (SEQ ID NO:70),
V28A/Y41S/G43SILENT(=GGG)/K44V/T68R/174G/V84R/R100M/G102R (SEQ ID NO:71),
V28A/Y41V/G43SILENT(=GGG)/K44V/174R/R100E/G102STOP (SEQ ID NO:72),
V28A/V31G/Y41T/G43SILENT(=GGG)/K44V/174E/V84R/R100M/G102R (SEQ ID NO:73),
V28A/G43SILENT(=GGG)/K44V/T68R/174E/V84R/R100E/G102STOP (SEQ ID NO:74),
V31G/Y41T/G43SILENT(=GGG)/K44V/T68L/174D/V84R/G102R (SEQ ID NO:75), or
Y41V/G43SILENT(=GGG)/K44V/T68L/174R/V84R/G102R (SEQ ID NO:76).
11. The method of claim 1 , wherein said phytocannabinoid is cannabigerol (CBG), cannabigerolic acid (CBGa), cannabigerovarin (CBGV), cannabigerovarinic acid (CBGVa), cannabigerocin (CBGO), cannabigerocinic acid (CBGOa), a cannabivarin, tetrahydrocannabinol (THC), or tetrahydrocannabinolic acid (THCa).
12. The method of claim 1 , wherein the host cell additionally comprises a divarinic acid synthase and produces divarinic acid.
13. The method of claim 1 , wherein said host cell is a yeast cell, a bacterial cell, a fungal cell, a protist cell, or a plant cell said host cell is S. cerevisiae, E. coli, Yarrowia lipolytica, or Komagataella phaffii.
14. (canceled)
15. The method of claim 1 , wherein said transformed host cell additionally comprises a polynucleotide encoding a polyketide synthase enzyme and/or a polynucleotide encoding a prenyltransferase enzyme.
16. An isolated polypeptide having olivetolic acid cyclase activity comprising an amino acid sequence of at least 85%, at least 90%, at least 95%, or at least 99% sequence identity with SEQ ID NO: 92, wherein 6 or more amino acid residues comprise mutations relative to SEQ ID NO: 91, said mutations being located at positions selected from the group consisting of: residues 28, 31, 41, 43, 44, 68, 74, 84, 100 and 102 of SEQ ID NO:91.
17. The isolated polypeptide of claim 16 , wherein the isolated polypeptide comprises an amino acid sequence according to SEQ ID NO:42, SEQ ID NO:40, SEQ ID NO:41, or SEQ ID NO:43-SEQ ID NO:76.
18. An isolated polynucleotide comprising:
(a) a nucleotide sequence according to SEQ ID NO:5, SEQ ID NO: 3, SEQ ID NO:4, or SEQ ID NO:6-SEQ ID NO:39;
(b) a nucleotide sequence having at least 85%, at least 90%, at least 95%, or at least 99% identity with the nucleotide sequence of (a), or
(c) a nucleotide sequence that hybridizes with the complementary strand of the nucleotide having the sequence of (a).
19. An expression vector comprising the polynucleotide according to claim 18 , encoding a variant olivetolic acid cyclase (OAC) protein with a sequence according to SEQ ID NO:92 having 6 or more amino acid mutations relative to the wild type OAC protein.
20. The expression vector of claim 19 , wherein the polynucleotide encoding the variant OAC protein comprises the nucleotide sequence according to SEQ ID NO:5, SEQ ID NO: 3, SEQ ID NO:4, or SEQ ID NO:6-SEQ ID NO:39.
21. A host cell transformed with the expression vector of claim 19 , optionally wherein the host cell additionally comprises a polynucleotide encoding a polyketide synthase enzyme and/or a polynucleotide encoding a prenyltransferase enzyme.
22. (canceled)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US18/253,066 US20230416789A1 (en) | 2020-11-20 | 2021-11-17 | Olivetolic acid cyclase variants with improved activity for use in production of phytocannabinoids |
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US16/953,638 US11427840B2 (en) | 2020-11-20 | 2020-11-20 | Olivetolic acid cyclase variants with improved activity for use in production of phytocannabinoids |
US18/253,066 US20230416789A1 (en) | 2020-11-20 | 2021-11-17 | Olivetolic acid cyclase variants with improved activity for use in production of phytocannabinoids |
PCT/CA2021/051626 WO2022104460A1 (en) | 2020-11-20 | 2021-11-17 | Olivetolic acid cyclase variants with improved activity for use in production of phytocannabinoids |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US16/953,638 Continuation US11427840B2 (en) | 2020-11-20 | 2020-11-20 | Olivetolic acid cyclase variants with improved activity for use in production of phytocannabinoids |
Publications (1)
Publication Number | Publication Date |
---|---|
US20230416789A1 true US20230416789A1 (en) | 2023-12-28 |
Family
ID=81707893
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US16/953,638 Active US11427840B2 (en) | 2020-11-20 | 2020-11-20 | Olivetolic acid cyclase variants with improved activity for use in production of phytocannabinoids |
US18/253,066 Pending US20230416789A1 (en) | 2020-11-20 | 2021-11-17 | Olivetolic acid cyclase variants with improved activity for use in production of phytocannabinoids |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US16/953,638 Active US11427840B2 (en) | 2020-11-20 | 2020-11-20 | Olivetolic acid cyclase variants with improved activity for use in production of phytocannabinoids |
Country Status (8)
Country | Link |
---|---|
US (2) | US11427840B2 (en) |
EP (1) | EP4247956A1 (en) |
JP (1) | JP2023550500A (en) |
KR (1) | KR20230110271A (en) |
CN (1) | CN116529374A (en) |
AU (1) | AU2021381020A1 (en) |
CA (1) | CA3195078A1 (en) |
WO (1) | WO2022104460A1 (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2023023621A1 (en) * | 2021-08-19 | 2023-02-23 | Willow Biosciences, Inc. | Recombinant olivetolic acid cyclase polypeptides engineered for enhanced biosynthesis of cannabinoids |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7186519B2 (en) | 2002-11-13 | 2007-03-06 | Trustees Of Dartmouth College | Method for detecting intracellular cholesterol |
WO2011017798A1 (en) | 2009-08-12 | 2011-02-17 | National Research Council Of Canada | Aromatic prenyltransferase from cannabis |
US11046978B2 (en) | 2016-03-16 | 2021-06-29 | William Marsh Rice University | Synthesis of isoprenoids and derivatives |
MX2019009708A (en) | 2017-02-17 | 2020-02-07 | Hyasynth Biologicals Inc | Method and cell line for production of phytocannabinoids and phytocannabinoid analogues in yeast. |
EP3622059A4 (en) * | 2017-05-10 | 2021-03-10 | BayMedica, Inc. | Recombinant production systems for prenylated polyketides of the cannabinoid family |
CA3134844A1 (en) * | 2019-04-19 | 2020-10-22 | Genomatica, Inc. | Olivetol synthase variants and methods for production of olivetolic acid and its analog compounds |
-
2020
- 2020-11-20 US US16/953,638 patent/US11427840B2/en active Active
-
2021
- 2021-11-17 JP JP2023530896A patent/JP2023550500A/en active Pending
- 2021-11-17 EP EP21893173.1A patent/EP4247956A1/en active Pending
- 2021-11-17 CA CA3195078A patent/CA3195078A1/en active Pending
- 2021-11-17 CN CN202180078158.9A patent/CN116529374A/en active Pending
- 2021-11-17 US US18/253,066 patent/US20230416789A1/en active Pending
- 2021-11-17 WO PCT/CA2021/051626 patent/WO2022104460A1/en active Application Filing
- 2021-11-17 KR KR1020237017474A patent/KR20230110271A/en unknown
- 2021-11-17 AU AU2021381020A patent/AU2021381020A1/en active Pending
Also Published As
Publication number | Publication date |
---|---|
AU2021381020A9 (en) | 2024-04-18 |
KR20230110271A (en) | 2023-07-21 |
WO2022104460A1 (en) | 2022-05-27 |
AU2021381020A1 (en) | 2023-06-22 |
EP4247956A1 (en) | 2023-09-27 |
CN116529374A (en) | 2023-08-01 |
US11427840B2 (en) | 2022-08-30 |
JP2023550500A (en) | 2023-12-01 |
US20220170055A1 (en) | 2022-06-02 |
CA3195078A1 (en) | 2022-05-27 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Lv et al. | Dual regulation of cytoplasmic and mitochondrial acetyl-CoA utilization for improved isoprene production in Saccharomyces cerevisiae | |
US20220259603A1 (en) | Methods and cells for microbial production of phytocannabinoids and phytocannabinoid precursors | |
US20220290194A1 (en) | Cannabidiolic acid synthase variants with improved activity for use in production of phytocannabinoids | |
JP2020530308A (en) | Production of cannabinoids in yeast | |
US8679804B2 (en) | Modified yeast strain and a method for producing squalene using the same | |
CN110144300B (en) | Recombinant yeast strain and application thereof in carotenoid synthesis | |
WO2020069142A1 (en) | Optimized expression systems for expressing berberine bridge enzyme and berberine bridge enzyme-like polypeptides | |
US20210403959A1 (en) | Use of type i and type ii polyketide synthases for the production of cannabinoids and cannabinoid analogs | |
US20230416789A1 (en) | Olivetolic acid cyclase variants with improved activity for use in production of phytocannabinoids | |
US20210010035A1 (en) | Production of manool | |
WO2022040475A1 (en) | Microbial production of cannabinoids | |
US11773414B2 (en) | Sesquiterpene synthases for production of drimenol and mixtures thereof | |
WO2022241298A2 (en) | Engineered cells, enzymes, and methods for producing cannabinoids | |
US20230257787A1 (en) | Methods and cells with modifying enzymes for producing substituted cannabinoids and precursors | |
US20220127648A1 (en) | Genetically engineered yeast yarrowia lipolytica and methods for producing bio-based glycolic acid | |
US20230416759A1 (en) | Transformant and method for producing carotenoid composition using same | |
US10351868B2 (en) | Engineering cyclopropane fatty acid accumulation in plants | |
CA3237656A1 (en) | Optimized biosynthesis pathway for cannabinoid biosynthesis | |
CN114599787A (en) | Sustainable production of cannabinoids from simple precursor materials using Saccharomyces Cerevisiae |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: HYASYNTH BIOLOGICALS INC., CANADA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:LIAO, TIMOTHY S.;SONG, LETIAN;HOM, LOUIS;AND OTHERS;SIGNING DATES FROM 20220112 TO 20220301;REEL/FRAME:063662/0462 |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION |