WO2024124165A2 - Procédés et compositions de purification de cannabinoïdes - Google Patents
Procédés et compositions de purification de cannabinoïdes Download PDFInfo
- Publication number
- WO2024124165A2 WO2024124165A2 PCT/US2023/083173 US2023083173W WO2024124165A2 WO 2024124165 A2 WO2024124165 A2 WO 2024124165A2 US 2023083173 W US2023083173 W US 2023083173W WO 2024124165 A2 WO2024124165 A2 WO 2024124165A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- seq
- amino acid
- acid sequence
- culture medium
- synthase
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 111
- 229930003827 cannabinoid Natural products 0.000 title claims abstract description 82
- 239000003557 cannabinoid Substances 0.000 title claims abstract description 82
- 239000000203 mixture Substances 0.000 title claims abstract description 78
- 229940065144 cannabinoids Drugs 0.000 title description 19
- 108010002861 cannabichromenic acid synthase Proteins 0.000 claims abstract description 96
- SEEZIOZEUUMJME-FOWTUZBSSA-N cannabigerolic acid Chemical compound CCCCCC1=CC(O)=C(C\C=C(/C)CCC=C(C)C)C(O)=C1C(O)=O SEEZIOZEUUMJME-FOWTUZBSSA-N 0.000 claims abstract description 61
- HRHJHXJQMNWQTF-UHFFFAOYSA-N cannabichromenic acid Chemical compound O1C(C)(CCC=C(C)C)C=CC2=C1C=C(CCCCC)C(C(O)=O)=C2O HRHJHXJQMNWQTF-UHFFFAOYSA-N 0.000 claims abstract description 58
- SEEZIOZEUUMJME-VBKFSLOCSA-N Cannabigerolic acid Natural products CCCCCC1=CC(O)=C(C\C=C(\C)CCC=C(C)C)C(O)=C1C(O)=O SEEZIOZEUUMJME-VBKFSLOCSA-N 0.000 claims abstract description 50
- SEEZIOZEUUMJME-UHFFFAOYSA-N cannabinerolic acid Natural products CCCCCC1=CC(O)=C(CC=C(C)CCC=C(C)C)C(O)=C1C(O)=O SEEZIOZEUUMJME-UHFFFAOYSA-N 0.000 claims abstract description 50
- 238000000855 fermentation Methods 0.000 claims abstract description 32
- 230000004151 fermentation Effects 0.000 claims abstract description 32
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 614
- 238000006467 substitution reaction Methods 0.000 claims description 422
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 188
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 184
- 229920001184 polypeptide Polymers 0.000 claims description 183
- 150000001413 amino acids Chemical class 0.000 claims description 162
- 210000004027 cell Anatomy 0.000 claims description 139
- 239000001963 growth medium Substances 0.000 claims description 106
- 150000007523 nucleic acids Chemical class 0.000 claims description 104
- 102000039446 nucleic acids Human genes 0.000 claims description 88
- 108020004707 nucleic acids Proteins 0.000 claims description 88
- 101001120927 Cannabis sativa 3,5,7-trioxododecanoyl-CoA synthase Proteins 0.000 claims description 73
- 239000003921 oil Substances 0.000 claims description 35
- 235000019198 oils Nutrition 0.000 claims description 35
- 238000004519 manufacturing process Methods 0.000 claims description 33
- 230000002255 enzymatic effect Effects 0.000 claims description 28
- 240000004808 Saccharomyces cerevisiae Species 0.000 claims description 23
- 241000196324 Embryophyta Species 0.000 claims description 21
- 101000997933 Mycobacterium tuberculosis (strain ATCC 25618 / H37Rv) (2E,6E)-farnesyl diphosphate synthase Proteins 0.000 claims description 21
- 101001015102 Mycobacterium tuberculosis (strain ATCC 25618 / H37Rv) Dimethylallyltranstransferase Proteins 0.000 claims description 21
- 238000005119 centrifugation Methods 0.000 claims description 19
- 229920000858 Cyclodextrin Polymers 0.000 claims description 16
- 238000001704 evaporation Methods 0.000 claims description 15
- HFHDHCJBZVLPGP-UHFFFAOYSA-N schardinger α-dextrin Chemical compound O1C(C(C2O)O)C(CO)OC2OC(C(C2O)O)C(CO)OC2OC(C(C2O)O)C(CO)OC2OC(C(O)C2O)C(CO)OC2OC(C(C2O)O)C(CO)OC2OC2C(O)C(O)C1OC2CO HFHDHCJBZVLPGP-UHFFFAOYSA-N 0.000 claims description 14
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 claims description 12
- 238000003756 stirring Methods 0.000 claims description 11
- GVVPGTZRZFNKDS-YFHOEESVSA-N Geranyl diphosphate Natural products CC(C)=CCC\C(C)=C/COP(O)(=O)OP(O)(O)=O GVVPGTZRZFNKDS-YFHOEESVSA-N 0.000 claims description 10
- 235000019486 Sunflower oil Nutrition 0.000 claims description 10
- 239000010779 crude oil Substances 0.000 claims description 10
- GVVPGTZRZFNKDS-JXMROGBWSA-N geranyl diphosphate Chemical compound CC(C)=CCC\C(C)=C\CO[P@](O)(=O)OP(O)(O)=O GVVPGTZRZFNKDS-JXMROGBWSA-N 0.000 claims description 10
- 239000002600 sunflower oil Substances 0.000 claims description 10
- 239000004094 surface-active agent Substances 0.000 claims description 10
- 238000006114 decarboxylation reaction Methods 0.000 claims description 9
- 150000002148 esters Chemical class 0.000 claims description 9
- 150000002191 fatty alcohols Chemical class 0.000 claims description 9
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 claims description 8
- 238000012258 culturing Methods 0.000 claims description 8
- 239000000828 canola oil Substances 0.000 claims description 6
- 235000019519 canola oil Nutrition 0.000 claims description 6
- 239000008169 grapeseed oil Substances 0.000 claims description 6
- 239000003549 soybean oil Substances 0.000 claims description 6
- 235000012424 soybean oil Nutrition 0.000 claims description 6
- 210000005253 yeast cell Anatomy 0.000 claims description 6
- 235000019485 Safflower oil Nutrition 0.000 claims description 5
- 102000012479 Serine Proteases Human genes 0.000 claims description 5
- 108010022999 Serine Proteases Proteins 0.000 claims description 5
- 239000004359 castor oil Substances 0.000 claims description 5
- 235000019438 castor oil Nutrition 0.000 claims description 5
- 239000001913 cellulose Substances 0.000 claims description 5
- 229920002678 cellulose Polymers 0.000 claims description 5
- ZEMPKEQAKRGZGQ-XOQCFJPHSA-N glycerol triricinoleate Natural products CCCCCC[C@@H](O)CC=CCCCCCCCC(=O)OC[C@@H](COC(=O)CCCCCCCC=CC[C@@H](O)CCCCCC)OC(=O)CCCCCCCC=CC[C@H](O)CCCCCC ZEMPKEQAKRGZGQ-XOQCFJPHSA-N 0.000 claims description 5
- 239000007788 liquid Substances 0.000 claims description 5
- 235000005713 safflower oil Nutrition 0.000 claims description 5
- 239000003813 safflower oil Substances 0.000 claims description 5
- 239000000377 silicon dioxide Substances 0.000 claims description 4
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 3
- 101710084186 Acetyl-coenzyme A synthetase Proteins 0.000 claims description 3
- 102100035709 Acetyl-coenzyme A synthetase, cytoplasmic Human genes 0.000 claims description 3
- 101710194784 Acetyl-coenzyme A synthetase, cytoplasmic Proteins 0.000 claims description 3
- 238000004460 liquid liquid chromatography Methods 0.000 claims description 3
- 238000000622 liquid--liquid extraction Methods 0.000 claims description 3
- 238000005192 partition Methods 0.000 claims description 3
- 238000007127 saponification reaction Methods 0.000 claims description 3
- 238000000638 solvent extraction Methods 0.000 claims description 3
- 102000004190 Enzymes Human genes 0.000 abstract description 64
- 108090000790 Enzymes Proteins 0.000 abstract description 64
- 230000000694 effects Effects 0.000 abstract description 19
- 235000001014 amino acid Nutrition 0.000 description 499
- 229940024606 amino acid Drugs 0.000 description 160
- 102220635684 Inner centromere protein_V169E_mutation Human genes 0.000 description 145
- 102220359218 c.223C>G Human genes 0.000 description 141
- 102220058620 rs869025194 Human genes 0.000 description 126
- 102220502546 Mitogen-activated protein kinase-binding protein 1_S140A_mutation Human genes 0.000 description 125
- 229940088598 enzyme Drugs 0.000 description 63
- 108090000623 proteins and genes Proteins 0.000 description 53
- ZSLZBFCDCINBPY-ZSJPKINUSA-N acetyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 ZSLZBFCDCINBPY-ZSJPKINUSA-N 0.000 description 35
- 230000002209 hydrophobic effect Effects 0.000 description 35
- 230000014509 gene expression Effects 0.000 description 33
- 238000006243 chemical reaction Methods 0.000 description 32
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 26
- 102000004169 proteins and genes Human genes 0.000 description 25
- 235000018102 proteins Nutrition 0.000 description 24
- UVOLYTDXHDXWJU-UHFFFAOYSA-N Cannabichromene Chemical compound C1=CC(C)(CCC=C(C)C)OC2=CC(CCCCC)=CC(O)=C21 UVOLYTDXHDXWJU-UHFFFAOYSA-N 0.000 description 22
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 22
- 108020004705 Codon Proteins 0.000 description 22
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 22
- 229910052799 carbon Inorganic materials 0.000 description 22
- 230000006696 biosynthetic metabolic pathway Effects 0.000 description 21
- 239000000758 substrate Substances 0.000 description 21
- 108091028043 Nucleic acid sequence Proteins 0.000 description 20
- 238000007792 addition Methods 0.000 description 20
- 239000002243 precursor Substances 0.000 description 20
- 230000002068 genetic effect Effects 0.000 description 19
- OWEGMIWEEQEYGQ-UHFFFAOYSA-N 100676-05-9 Natural products OC1C(O)C(O)C(CO)OC1OCC1C(O)C(O)C(O)C(OC2C(OC(O)C(O)C2O)CO)O1 OWEGMIWEEQEYGQ-UHFFFAOYSA-N 0.000 description 18
- GUBGYTABKSRVRQ-PICCSMPSSA-N Maltose Natural products O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-PICCSMPSSA-N 0.000 description 18
- 239000003795 chemical substances by application Substances 0.000 description 18
- UVOLYTDXHDXWJU-NRFANRHFSA-N Cannabichromene Natural products C1=C[C@](C)(CCC=C(C)C)OC2=CC(CCCCC)=CC(O)=C21 UVOLYTDXHDXWJU-NRFANRHFSA-N 0.000 description 17
- ORKZJYDOERTGKY-UHFFFAOYSA-N Dihydrocannabichromen Natural products C1CC(C)(CCC=C(C)C)OC2=CC(CCCCC)=CC(O)=C21 ORKZJYDOERTGKY-UHFFFAOYSA-N 0.000 description 17
- 229930182830 galactose Natural products 0.000 description 16
- FUZZWVXGSFPDMH-UHFFFAOYSA-N hexanoic acid Chemical compound CCCCCC(O)=O FUZZWVXGSFPDMH-UHFFFAOYSA-N 0.000 description 16
- 244000005700 microbiome Species 0.000 description 16
- 230000001105 regulatory effect Effects 0.000 description 16
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 15
- 108030006655 Olivetolic acid cyclases Proteins 0.000 description 15
- 125000000129 anionic group Chemical group 0.000 description 15
- 230000001965 increasing effect Effects 0.000 description 15
- 108091033319 polynucleotide Proteins 0.000 description 15
- 102000040430 polynucleotide Human genes 0.000 description 15
- 239000002157 polynucleotide Substances 0.000 description 15
- 108020004414 DNA Proteins 0.000 description 14
- 239000008103 glucose Substances 0.000 description 14
- 229910052757 nitrogen Inorganic materials 0.000 description 14
- 125000003729 nucleotide group Chemical group 0.000 description 14
- 230000037361 pathway Effects 0.000 description 14
- 230000012010 growth Effects 0.000 description 13
- 230000035772 mutation Effects 0.000 description 13
- 239000002773 nucleotide Substances 0.000 description 13
- 239000000047 product Substances 0.000 description 13
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 12
- 150000001875 compounds Chemical class 0.000 description 12
- 239000002609 medium Substances 0.000 description 12
- 229910052751 metal Inorganic materials 0.000 description 11
- 239000002184 metal Substances 0.000 description 11
- 150000002739 metals Chemical class 0.000 description 11
- 230000009466 transformation Effects 0.000 description 11
- 108091026890 Coding region Proteins 0.000 description 10
- 239000012071 phase Substances 0.000 description 10
- 102000005369 Aldehyde Dehydrogenase Human genes 0.000 description 9
- 108020002663 Aldehyde Dehydrogenase Proteins 0.000 description 9
- 229910019142 PO4 Inorganic materials 0.000 description 9
- 239000002253 acid Substances 0.000 description 9
- QXACEHWTBCFNSA-SFQUDFHCSA-N cannabigerol Chemical group CCCCCC1=CC(O)=C(C\C=C(/C)CCC=C(C)C)C(O)=C1 QXACEHWTBCFNSA-SFQUDFHCSA-N 0.000 description 9
- 238000012217 deletion Methods 0.000 description 9
- 230000037430 deletion Effects 0.000 description 9
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 9
- 239000010452 phosphate Substances 0.000 description 9
- 235000021317 phosphate Nutrition 0.000 description 9
- -1 such as an rRNA Proteins 0.000 description 9
- AAXZFUQLLRMVOG-UHFFFAOYSA-N 2-methyl-2-(4-methylpent-3-enyl)-7-propylchromen-5-ol Chemical compound C1=CC(C)(CCC=C(C)C)OC2=CC(CCC)=CC(O)=C21 AAXZFUQLLRMVOG-UHFFFAOYSA-N 0.000 description 8
- WWZKQHOCKIZLMA-UHFFFAOYSA-N Caprylic acid Natural products CCCCCCCC(O)=O WWZKQHOCKIZLMA-UHFFFAOYSA-N 0.000 description 8
- 108010011939 Pyruvate Decarboxylase Proteins 0.000 description 8
- 229930006000 Sucrose Natural products 0.000 description 8
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 8
- 125000000539 amino acid group Chemical group 0.000 description 8
- GONOPSZTUGRENK-UHFFFAOYSA-N benzyl(trichloro)silane Chemical compound Cl[Si](Cl)(Cl)CC1=CC=CC=C1 GONOPSZTUGRENK-UHFFFAOYSA-N 0.000 description 8
- 230000008569 process Effects 0.000 description 8
- 239000005720 sucrose Substances 0.000 description 8
- 244000025254 Cannabis sativa Species 0.000 description 7
- 235000008697 Cannabis sativa Nutrition 0.000 description 7
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 7
- FYYHWMGAXLPEAU-UHFFFAOYSA-N Magnesium Chemical compound [Mg] FYYHWMGAXLPEAU-UHFFFAOYSA-N 0.000 description 7
- QXACEHWTBCFNSA-UHFFFAOYSA-N cannabigerol Natural products CCCCCC1=CC(O)=C(CC=C(C)CCC=C(C)C)C(O)=C1 QXACEHWTBCFNSA-UHFFFAOYSA-N 0.000 description 7
- 239000011777 magnesium Substances 0.000 description 7
- 229910052749 magnesium Inorganic materials 0.000 description 7
- 239000000126 substance Substances 0.000 description 7
- 238000013518 transcription Methods 0.000 description 7
- 230000035897 transcription Effects 0.000 description 7
- 235000015112 vegetable and seed oil Nutrition 0.000 description 7
- 239000008158 vegetable oil Substances 0.000 description 7
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 6
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 6
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 6
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 6
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 6
- 230000007423 decrease Effects 0.000 description 6
- 235000015097 nutrients Nutrition 0.000 description 6
- SXFKFRRXJUJGSS-UHFFFAOYSA-N olivetolic acid Chemical compound CCCCCC1=CC(O)=CC(O)=C1C(O)=O SXFKFRRXJUJGSS-UHFFFAOYSA-N 0.000 description 6
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 5
- 101150101889 GAS4 gene Proteins 0.000 description 5
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 5
- 241000235648 Pichia Species 0.000 description 5
- 238000000605 extraction Methods 0.000 description 5
- 238000002703 mutagenesis Methods 0.000 description 5
- 231100000350 mutagenesis Toxicity 0.000 description 5
- 239000000243 solution Substances 0.000 description 5
- 238000013519 translation Methods 0.000 description 5
- 238000011144 upstream manufacturing Methods 0.000 description 5
- ZROLHBHDLIHEMS-HUUCEWRRSA-N (6ar,10ar)-6,6,9-trimethyl-3-propyl-6a,7,8,10a-tetrahydrobenzo[c]chromen-1-ol Chemical compound C1=C(C)CC[C@H]2C(C)(C)OC3=CC(CCC)=CC(O)=C3[C@@H]21 ZROLHBHDLIHEMS-HUUCEWRRSA-N 0.000 description 4
- KJTLQQUUPVSXIM-ZCFIWIBFSA-M (R)-mevalonate Chemical compound OCC[C@](O)(C)CC([O-])=O KJTLQQUUPVSXIM-ZCFIWIBFSA-M 0.000 description 4
- ZLYNXDIDWUWASO-UHFFFAOYSA-N 6,6,9-trimethyl-3-pentyl-8,10-dihydro-7h-benzo[c]chromene-1,9,10-triol Chemical compound CC1(C)OC2=CC(CCCCC)=CC(O)=C2C2=C1CCC(C)(O)C2O ZLYNXDIDWUWASO-UHFFFAOYSA-N 0.000 description 4
- 241000219195 Arabidopsis thaliana Species 0.000 description 4
- 241000222120 Candida <Saccharomycetales> Species 0.000 description 4
- KJTLQQUUPVSXIM-UHFFFAOYSA-N DL-mevalonic acid Natural products OCCC(O)(C)CC(O)=O KJTLQQUUPVSXIM-UHFFFAOYSA-N 0.000 description 4
- UCONUSSAWGCZMV-HZPDHXFCSA-N Delta(9)-tetrahydrocannabinolic acid Chemical compound C([C@H]1C(C)(C)O2)CC(C)=C[C@H]1C1=C2C=C(CCCCC)C(C(O)=O)=C1O UCONUSSAWGCZMV-HZPDHXFCSA-N 0.000 description 4
- ZROLHBHDLIHEMS-UHFFFAOYSA-N Delta9 tetrahydrocannabivarin Natural products C1=C(C)CCC2C(C)(C)OC3=CC(CCC)=CC(O)=C3C21 ZROLHBHDLIHEMS-UHFFFAOYSA-N 0.000 description 4
- 102000057412 Diphosphomevalonate decarboxylases Human genes 0.000 description 4
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 4
- 102000004286 Hydroxymethylglutaryl CoA Reductases Human genes 0.000 description 4
- 108090000895 Hydroxymethylglutaryl CoA Reductases Proteins 0.000 description 4
- 108010000775 Hydroxymethylglutaryl-CoA synthase Proteins 0.000 description 4
- 102100028888 Hydroxymethylglutaryl-CoA synthase, cytoplasmic Human genes 0.000 description 4
- 108010065958 Isopentenyl-diphosphate Delta-isomerase Proteins 0.000 description 4
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 4
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 4
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 4
- 108700040132 Mevalonate kinases Proteins 0.000 description 4
- 101000958834 Neosartorya fumigata (strain ATCC MYA-4609 / Af293 / CBS 101355 / FGSC A1100) Diphosphomevalonate decarboxylase mvd1 Proteins 0.000 description 4
- 101000958925 Panax ginseng Diphosphomevalonate decarboxylase 1 Proteins 0.000 description 4
- 102100024279 Phosphomevalonate kinase Human genes 0.000 description 4
- 108010076504 Protein Sorting Signals Proteins 0.000 description 4
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 4
- CYQFCXCEBYINGO-UHFFFAOYSA-N THC Natural products C1=C(C)CCC2C(C)(C)OC3=CC(CCCCC)=CC(O)=C3C21 CYQFCXCEBYINGO-UHFFFAOYSA-N 0.000 description 4
- 108060008225 Thiolase Proteins 0.000 description 4
- 102000002932 Thiolase Human genes 0.000 description 4
- QHMBSVQNZZTUGM-UHFFFAOYSA-N Trans-Cannabidiol Natural products OC1=CC(CCCCC)=CC(O)=C1C1C(C(C)=C)CCC(C)=C1 QHMBSVQNZZTUGM-UHFFFAOYSA-N 0.000 description 4
- 235000004279 alanine Nutrition 0.000 description 4
- 125000001931 aliphatic group Chemical group 0.000 description 4
- ZTGXAWYVTLUPDT-UHFFFAOYSA-N cannabidiol Natural products OC1=CC(CCCCC)=CC(O)=C1C1C(C(C)=C)CC=C(C)C1 ZTGXAWYVTLUPDT-UHFFFAOYSA-N 0.000 description 4
- QHMBSVQNZZTUGM-ZWKOTPCHSA-N cannabidiol Chemical compound OC1=CC(CCCCC)=CC(O)=C1[C@H]1[C@H](C(C)=C)CCC(C)=C1 QHMBSVQNZZTUGM-ZWKOTPCHSA-N 0.000 description 4
- 229950011318 cannabidiol Drugs 0.000 description 4
- 230000010261 cell growth Effects 0.000 description 4
- 235000010980 cellulose Nutrition 0.000 description 4
- 230000008859 change Effects 0.000 description 4
- 239000002738 chelating agent Substances 0.000 description 4
- 238000010367 cloning Methods 0.000 description 4
- 230000000295 complement effect Effects 0.000 description 4
- CYQFCXCEBYINGO-IAGOWNOFSA-N delta1-THC Chemical compound C1=C(C)CC[C@H]2C(C)(C)OC3=CC(CCCCC)=CC(O)=C3[C@@H]21 CYQFCXCEBYINGO-IAGOWNOFSA-N 0.000 description 4
- PCXRACLQFPRCBB-ZWKOTPCHSA-N dihydrocannabidiol Natural products OC1=CC(CCCCC)=CC(O)=C1[C@H]1[C@H](C(C)C)CCC(C)=C1 PCXRACLQFPRCBB-ZWKOTPCHSA-N 0.000 description 4
- 229960004242 dronabinol Drugs 0.000 description 4
- 230000002538 fungal effect Effects 0.000 description 4
- 230000010354 integration Effects 0.000 description 4
- XIXADJRWDQXREU-UHFFFAOYSA-M lithium acetate Chemical compound [Li+].CC([O-])=O XIXADJRWDQXREU-UHFFFAOYSA-M 0.000 description 4
- 239000000463 material Substances 0.000 description 4
- 102000002678 mevalonate kinase Human genes 0.000 description 4
- 108091000116 phosphomevalonate kinase Proteins 0.000 description 4
- 239000013587 production medium Substances 0.000 description 4
- 238000000746 purification Methods 0.000 description 4
- 150000003839 salts Chemical class 0.000 description 4
- 239000002904 solvent Substances 0.000 description 4
- 241000894007 species Species 0.000 description 4
- 235000013343 vitamin Nutrition 0.000 description 4
- 239000011782 vitamin Substances 0.000 description 4
- 229940088594 vitamin Drugs 0.000 description 4
- 229930003231 vitamin Natural products 0.000 description 4
- GHOKWGTUZJEAQD-ZETCQYMHSA-N (D)-(+)-Pantothenic acid Chemical compound OCC(C)(C)[C@@H](O)C(=O)NCCC(O)=O GHOKWGTUZJEAQD-ZETCQYMHSA-N 0.000 description 3
- QGZKDVFQNNGYKY-UHFFFAOYSA-N Ammonia Chemical compound N QGZKDVFQNNGYKY-UHFFFAOYSA-N 0.000 description 3
- 239000002028 Biomass Substances 0.000 description 3
- OYPRJOBELJOOCE-UHFFFAOYSA-N Calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 3
- VBGLYOIFKLUMQG-UHFFFAOYSA-N Cannabinol Chemical compound C1=C(C)C=C2C3=C(O)C=C(CCCCC)C=C3OC(C)(C)C2=C1 VBGLYOIFKLUMQG-UHFFFAOYSA-N 0.000 description 3
- 241000218236 Cannabis Species 0.000 description 3
- 102100031780 Endonuclease Human genes 0.000 description 3
- 108010042407 Endonucleases Proteins 0.000 description 3
- 241001465321 Eremothecium Species 0.000 description 3
- 102100028501 Galanin peptides Human genes 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- 101150045458 KEX2 gene Proteins 0.000 description 3
- 239000004472 Lysine Substances 0.000 description 3
- KWYUFKZDYYNOTN-UHFFFAOYSA-M Potassium hydroxide Chemical compound [OH-].[K+] KWYUFKZDYYNOTN-UHFFFAOYSA-M 0.000 description 3
- 241000589776 Pseudomonas putida Species 0.000 description 3
- 241000235070 Saccharomyces Species 0.000 description 3
- 101100335936 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GAS4 gene Proteins 0.000 description 3
- 241000235003 Saccharomycopsis Species 0.000 description 3
- 150000007513 acids Chemical class 0.000 description 3
- 238000013019 agitation Methods 0.000 description 3
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 3
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 3
- 235000011130 ammonium sulphate Nutrition 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 3
- 239000008346 aqueous phase Substances 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 210000004899 c-terminal region Anatomy 0.000 description 3
- 239000011575 calcium Substances 0.000 description 3
- 229910052791 calcium Inorganic materials 0.000 description 3
- 235000001465 calcium Nutrition 0.000 description 3
- 125000002091 cationic group Chemical group 0.000 description 3
- 238000004113 cell culture Methods 0.000 description 3
- 230000003247 decreasing effect Effects 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 239000000543 intermediate Substances 0.000 description 3
- 125000003071 maltose group Chemical group 0.000 description 3
- 230000000813 microbial effect Effects 0.000 description 3
- 229940014662 pantothenate Drugs 0.000 description 3
- 235000019161 pantothenic acid Nutrition 0.000 description 3
- 239000011713 pantothenic acid Substances 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- 102220064590 rs786205522 Human genes 0.000 description 3
- 239000007787 solid Substances 0.000 description 3
- 239000011550 stock solution Substances 0.000 description 3
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 2
- RBEAVAMWZAJWOI-MTOHEIAKSA-N (5as,6s,9r,9ar)-6-methyl-3-pentyl-9-prop-1-en-2-yl-7,8,9,9a-tetrahydro-5ah-dibenzofuran-1,6-diol Chemical compound C1=2C(O)=CC(CCCCC)=CC=2O[C@H]2[C@@H]1[C@H](C(C)=C)CC[C@]2(C)O RBEAVAMWZAJWOI-MTOHEIAKSA-N 0.000 description 2
- ALSTYHKOOCGGFT-KTKRTIGZSA-N (9Z)-octadecen-1-ol Chemical compound CCCCCCCC\C=C/CCCCCCCCO ALSTYHKOOCGGFT-KTKRTIGZSA-N 0.000 description 2
- TWKHUZXSTKISQC-UHFFFAOYSA-N 2-(5-methyl-2-prop-1-en-2-ylphenyl)-5-pentylbenzene-1,3-diol Chemical compound OC1=CC(CCCCC)=CC(O)=C1C1=CC(C)=CC=C1C(C)=C TWKHUZXSTKISQC-UHFFFAOYSA-N 0.000 description 2
- YJYIDZLGVYOPGU-XNTDXEJSSA-N 2-[(2e)-3,7-dimethylocta-2,6-dienyl]-5-propylbenzene-1,3-diol Chemical compound CCCC1=CC(O)=C(C\C=C(/C)CCC=C(C)C)C(O)=C1 YJYIDZLGVYOPGU-XNTDXEJSSA-N 0.000 description 2
- LRFVTYWOQMYALW-UHFFFAOYSA-N 9H-xanthine Chemical compound O=C1NC(=O)NC2=C1NC=N2 LRFVTYWOQMYALW-UHFFFAOYSA-N 0.000 description 2
- 241001673062 Achromobacter xylosoxidans Species 0.000 description 2
- VHUUQVKOLVNVRT-UHFFFAOYSA-N Ammonium hydroxide Chemical compound [NH4+].[OH-] VHUUQVKOLVNVRT-UHFFFAOYSA-N 0.000 description 2
- 244000105624 Arachis hypogaea Species 0.000 description 2
- 235000010777 Arachis hypogaea Nutrition 0.000 description 2
- 239000004475 Arginine Substances 0.000 description 2
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 2
- 241000589171 Bradyrhizobium sp. Species 0.000 description 2
- 241000722885 Brettanomyces Species 0.000 description 2
- 101100351811 Caenorhabditis elegans pgal-1 gene Proteins 0.000 description 2
- REOZWEGFPHTFEI-JKSUJKDBSA-N Cannabidivarin Chemical compound OC1=CC(CCC)=CC(O)=C1[C@H]1[C@H](C(C)=C)CCC(C)=C1 REOZWEGFPHTFEI-JKSUJKDBSA-N 0.000 description 2
- 102000018208 Cannabinoid Receptor Human genes 0.000 description 2
- 108050007331 Cannabinoid receptor Proteins 0.000 description 2
- 240000006162 Chenopodium quinoa Species 0.000 description 2
- 235000015493 Chenopodium quinoa Nutrition 0.000 description 2
- 108700010070 Codon Usage Proteins 0.000 description 2
- 241001527609 Cryptococcus Species 0.000 description 2
- 102100034583 Dolichyl-diphosphooligosaccharide-protein glycosyltransferase subunit 1 Human genes 0.000 description 2
- 238000002965 ELISA Methods 0.000 description 2
- 241000588724 Escherichia coli Species 0.000 description 2
- 241001149669 Hanseniaspora Species 0.000 description 2
- 101000848781 Homo sapiens Dolichyl-diphosphooligosaccharide-protein glycosyltransferase subunit 1 Proteins 0.000 description 2
- 235000008694 Humulus lupulus Nutrition 0.000 description 2
- 244000025221 Humulus lupulus Species 0.000 description 2
- VEXZGXHMUGYJMC-UHFFFAOYSA-N Hydrochloric acid Chemical compound Cl VEXZGXHMUGYJMC-UHFFFAOYSA-N 0.000 description 2
- 108091092195 Intron Proteins 0.000 description 2
- 241000235649 Kluyveromyces Species 0.000 description 2
- 244000285963 Kluyveromyces fragilis Species 0.000 description 2
- 241001138401 Kluyveromyces lactis Species 0.000 description 2
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 2
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 2
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 2
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 2
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 2
- 241001149698 Lipomyces Species 0.000 description 2
- 108091005461 Nucleic proteins Chemical group 0.000 description 2
- 239000001888 Peptone Substances 0.000 description 2
- 108010080698 Peptones Proteins 0.000 description 2
- NBIIXXVUZAFLBC-UHFFFAOYSA-N Phosphoric acid Chemical compound OP(O)(O)=O NBIIXXVUZAFLBC-UHFFFAOYSA-N 0.000 description 2
- 241000235645 Pichia kudriavzevii Species 0.000 description 2
- 241000203720 Pimelobacter simplex Species 0.000 description 2
- 108010009736 Protein Hydrolysates Proteins 0.000 description 2
- 241000681946 Rhododendron dauricum Species 0.000 description 2
- 241000223252 Rhodotorula Species 0.000 description 2
- 108091028664 Ribonucleotide Proteins 0.000 description 2
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 2
- QAOWNCQODCNURD-UHFFFAOYSA-N Sulfuric acid Chemical compound OS(O)(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-N 0.000 description 2
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 2
- 239000004473 Threonine Substances 0.000 description 2
- 108700019146 Transgenes Proteins 0.000 description 2
- 241000223230 Trichosporon Species 0.000 description 2
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 2
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 2
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 2
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 2
- 241000311098 Yamadazyma Species 0.000 description 2
- 241000588902 Zymomonas mobilis Species 0.000 description 2
- 230000003213 activating effect Effects 0.000 description 2
- 239000000654 additive Substances 0.000 description 2
- WQZGKKKJIJFFOK-PHYPRBDBSA-N alpha-D-galactose Chemical compound OC[C@H]1O[C@H](O)[C@H](O)[C@@H](O)[C@H]1O WQZGKKKJIJFFOK-PHYPRBDBSA-N 0.000 description 2
- 239000000908 ammonium hydroxide Substances 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 2
- 125000003118 aryl group Chemical group 0.000 description 2
- 235000009582 asparagine Nutrition 0.000 description 2
- 229960001230 asparagine Drugs 0.000 description 2
- 235000003704 aspartic acid Nutrition 0.000 description 2
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 2
- GUBGYTABKSRVRQ-QUYVBRFLSA-N beta-maltose Chemical compound OC[C@H]1O[C@H](O[C@H]2[C@H](O)[C@@H](O)[C@H](O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@@H]1O GUBGYTABKSRVRQ-QUYVBRFLSA-N 0.000 description 2
- 238000011088 calibration curve Methods 0.000 description 2
- 229940041514 candida albicans extract Drugs 0.000 description 2
- WVOLTBSCXRRQFR-DLBZAZTESA-N cannabidiolic acid Chemical compound OC1=C(C(O)=O)C(CCCCC)=CC(O)=C1[C@H]1[C@H](C(C)=C)CCC(C)=C1 WVOLTBSCXRRQFR-DLBZAZTESA-N 0.000 description 2
- YJYIDZLGVYOPGU-UHFFFAOYSA-N cannabigeroldivarin Natural products CCCC1=CC(O)=C(CC=C(C)CCC=C(C)C)C(O)=C1 YJYIDZLGVYOPGU-UHFFFAOYSA-N 0.000 description 2
- 238000004590 computer program Methods 0.000 description 2
- 238000002425 crystallisation Methods 0.000 description 2
- 230000008025 crystallization Effects 0.000 description 2
- 125000004122 cyclic group Chemical group 0.000 description 2
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 2
- 150000002016 disaccharides Chemical class 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 239000000284 extract Substances 0.000 description 2
- 235000013922 glutamic acid Nutrition 0.000 description 2
- 239000004220 glutamic acid Substances 0.000 description 2
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 2
- 150000004676 glycans Chemical class 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- OEXFMSFODMQEPE-HDRQGHTBSA-N hexanoyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)CCCCC)O[C@H]1N1C2=NC=NC(N)=C2N=C1 OEXFMSFODMQEPE-HDRQGHTBSA-N 0.000 description 2
- 238000004128 high performance liquid chromatography Methods 0.000 description 2
- 235000014304 histidine Nutrition 0.000 description 2
- 125000000487 histidyl group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C([H])=N1 0.000 description 2
- 210000005260 human cell Anatomy 0.000 description 2
- FDGQSTZJBFJUBT-UHFFFAOYSA-N hypoxanthine Chemical compound O=C1NC=NC2=C1NC=N2 FDGQSTZJBFJUBT-UHFFFAOYSA-N 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 239000012535 impurity Substances 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 238000009655 industrial fermentation Methods 0.000 description 2
- 230000005764 inhibitory process Effects 0.000 description 2
- 229910052500 inorganic mineral Inorganic materials 0.000 description 2
- DRAVOWXCEBXPTN-UHFFFAOYSA-N isoguanine Chemical compound NC1=NC(=O)NC2=C1NC=N2 DRAVOWXCEBXPTN-UHFFFAOYSA-N 0.000 description 2
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 2
- 229960000310 isoleucine Drugs 0.000 description 2
- KWGKDLIKAYFUFQ-UHFFFAOYSA-M lithium chloride Chemical compound [Li+].[Cl-] KWGKDLIKAYFUFQ-UHFFFAOYSA-M 0.000 description 2
- 238000011068 loading method Methods 0.000 description 2
- 238000004949 mass spectrometry Methods 0.000 description 2
- 230000004060 metabolic process Effects 0.000 description 2
- 229930182817 methionine Natural products 0.000 description 2
- 235000010755 mineral Nutrition 0.000 description 2
- 239000011707 mineral Substances 0.000 description 2
- 239000002480 mineral oil Substances 0.000 description 2
- 235000010446 mineral oil Nutrition 0.000 description 2
- 150000002772 monosaccharides Chemical class 0.000 description 2
- 229940055577 oleyl alcohol Drugs 0.000 description 2
- XMLQWXUVTXCDDL-UHFFFAOYSA-N oleyl alcohol Natural products CCCCCCC=CCCCCCCCCCCO XMLQWXUVTXCDDL-UHFFFAOYSA-N 0.000 description 2
- IRMPFYJSHJGOPE-UHFFFAOYSA-N olivetol Chemical compound CCCCCC1=CC(O)=CC(O)=C1 IRMPFYJSHJGOPE-UHFFFAOYSA-N 0.000 description 2
- 210000000056 organ Anatomy 0.000 description 2
- 235000019319 peptone Nutrition 0.000 description 2
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 2
- 239000013612 plasmid Substances 0.000 description 2
- 229920000642 polymer Polymers 0.000 description 2
- 229920001282 polysaccharide Polymers 0.000 description 2
- 239000005017 polysaccharide Substances 0.000 description 2
- LWIHDJKSTIGBAC-UHFFFAOYSA-K potassium phosphate Substances [K+].[K+].[K+].[O-]P([O-])([O-])=O LWIHDJKSTIGBAC-UHFFFAOYSA-K 0.000 description 2
- 230000006861 primary carbon metabolism Effects 0.000 description 2
- 238000003753 real-time PCR Methods 0.000 description 2
- 230000010076 replication Effects 0.000 description 2
- 239000002336 ribonucleotide Substances 0.000 description 2
- 125000002652 ribonucleotide group Chemical group 0.000 description 2
- 238000007480 sanger sequencing Methods 0.000 description 2
- 229920006395 saturated elastomer Polymers 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 150000003384 small molecules Chemical class 0.000 description 2
- 239000011780 sodium chloride Substances 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 238000010189 synthetic method Methods 0.000 description 2
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 2
- 231100000331 toxic Toxicity 0.000 description 2
- 230000002588 toxic effect Effects 0.000 description 2
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 2
- 239000004474 valine Substances 0.000 description 2
- 229940054967 vanquish Drugs 0.000 description 2
- 239000012138 yeast extract Substances 0.000 description 2
- HDTRYLNUVZCQOY-UHFFFAOYSA-N α-D-glucopyranosyl-α-D-glucopyranoside Natural products OC1C(O)C(O)C(CO)OC1OC1C(O)C(O)C(O)C(CO)O1 HDTRYLNUVZCQOY-UHFFFAOYSA-N 0.000 description 1
- VUDQSRFCCHQIIU-UHFFFAOYSA-N 1-(3,5-dichloro-2,6-dihydroxy-4-methoxyphenyl)hexan-1-one Chemical compound CCCCCC(=O)C1=C(O)C(Cl)=C(OC)C(Cl)=C1O VUDQSRFCCHQIIU-UHFFFAOYSA-N 0.000 description 1
- KXKOBIRSQLNUPS-UHFFFAOYSA-N 1-hydroxy-6,6,9-trimethyl-3-pentylbenzo[c]chromene-2-carboxylic acid Chemical compound O1C(C)(C)C2=CC=C(C)C=C2C2=C1C=C(CCCCC)C(C(O)=O)=C2O KXKOBIRSQLNUPS-UHFFFAOYSA-N 0.000 description 1
- QGKBSGBYSPTPKJ-UZMKXNTCSA-N 2,6-di-o-methyl-β-cyclodextrin Chemical compound COC[C@H]([C@H]([C@@H]([C@H]1OC)O)O[C@H]2O[C@@H]([C@@H](O[C@H]3O[C@H](COC)[C@H]([C@@H]([C@H]3OC)O)O[C@H]3O[C@H](COC)[C@H]([C@@H]([C@H]3OC)O)O[C@H]3O[C@H](COC)[C@H]([C@@H]([C@H]3OC)O)O[C@H]3O[C@H](COC)[C@H]([C@@H]([C@H]3OC)O)O3)[C@H](O)[C@H]2OC)COC)O[C@@H]1O[C@H]1[C@H](O)[C@@H](OC)[C@@H]3O[C@@H]1COC QGKBSGBYSPTPKJ-UZMKXNTCSA-N 0.000 description 1
- XQCZBXHVTFVIFE-UHFFFAOYSA-N 2-amino-4-hydroxypyrimidine Chemical compound NC1=NC=CC(O)=N1 XQCZBXHVTFVIFE-UHFFFAOYSA-N 0.000 description 1
- QTBSBXVTEAMEQO-UHFFFAOYSA-M Acetate Chemical compound CC([O-])=O QTBSBXVTEAMEQO-UHFFFAOYSA-M 0.000 description 1
- 241000159572 Aciculoconidium Species 0.000 description 1
- 244000298715 Actinidia chinensis Species 0.000 description 1
- 235000009434 Actinidia chinensis Nutrition 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 description 1
- 241000609240 Ambelania acida Species 0.000 description 1
- 241001508809 Ambrosiozyma Species 0.000 description 1
- 239000004254 Ammonium phosphate Substances 0.000 description 1
- 241000024188 Andala Species 0.000 description 1
- 241001533085 Aquilaria sinensis Species 0.000 description 1
- 101001094837 Arabidopsis thaliana Pectinesterase 5 Proteins 0.000 description 1
- 235000001405 Artemisia annua Nutrition 0.000 description 1
- 240000000011 Artemisia annua Species 0.000 description 1
- 241001638540 Arthroascus Species 0.000 description 1
- 241001508785 Arxiozyma Species 0.000 description 1
- 241000228212 Aspergillus Species 0.000 description 1
- 241000972773 Aulopiformes Species 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 244000177578 Bacterium linens Species 0.000 description 1
- 235000012539 Bacterium linens Nutrition 0.000 description 1
- 241000235114 Bensingtonia Species 0.000 description 1
- 241000415908 Bhargavaea cecembensis DSE10 Species 0.000 description 1
- 241000680806 Blastobotrys adeninivorans Species 0.000 description 1
- 241000178289 Botryozyma Species 0.000 description 1
- 241000995051 Brenda Species 0.000 description 1
- 235000000287 Brettanomyces bruxellensis Nutrition 0.000 description 1
- 244000027711 Brettanomyces bruxellensis Species 0.000 description 1
- 241000661177 Brevibacterium yomogidense Species 0.000 description 1
- 241000235172 Bullera Species 0.000 description 1
- 241000033328 Bulleromyces Species 0.000 description 1
- 101100494773 Caenorhabditis elegans ctl-2 gene Proteins 0.000 description 1
- 244000105627 Cajanus indicus Species 0.000 description 1
- 235000010773 Cajanus indicus Nutrition 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- WVOLTBSCXRRQFR-SJORKVTESA-N Cannabidiolic acid Natural products OC1=C(C(O)=O)C(CCCCC)=CC(O)=C1[C@@H]1[C@@H](C(C)=C)CCC(C)=C1 WVOLTBSCXRRQFR-SJORKVTESA-N 0.000 description 1
- 241000223019 Caragana korshinskii Species 0.000 description 1
- 229920002101 Chitin Polymers 0.000 description 1
- 241001508787 Citeromyces Species 0.000 description 1
- 241001082424 Citreicella sp. Species 0.000 description 1
- 244000175448 Citrus madurensis Species 0.000 description 1
- 235000004332 Citrus madurensis Nutrition 0.000 description 1
- 241001508811 Clavispora Species 0.000 description 1
- 241000193403 Clostridium Species 0.000 description 1
- 241000186216 Corynebacterium Species 0.000 description 1
- 235000003405 Curcuma zedoaria Nutrition 0.000 description 1
- 240000009138 Curcuma zedoaria Species 0.000 description 1
- 241000222039 Cystofilobasidium Species 0.000 description 1
- GUBGYTABKSRVRQ-CUHNMECISA-N D-Cellobiose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-CUHNMECISA-N 0.000 description 1
- WQZGKKKJIJFFOK-QTVWNMPRSA-N D-mannopyranose Chemical compound OC[C@H]1OC(O)[C@@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-QTVWNMPRSA-N 0.000 description 1
- HMFHBZSHGGEWLO-SOOFDHNKSA-N D-ribofuranose Chemical compound OC[C@H]1OC(O)[C@H](O)[C@@H]1O HMFHBZSHGGEWLO-SOOFDHNKSA-N 0.000 description 1
- 230000033616 DNA repair Effects 0.000 description 1
- 230000004543 DNA replication Effects 0.000 description 1
- 241000235035 Debaryomyces Species 0.000 description 1
- 101710088194 Dehydrogenase Proteins 0.000 description 1
- 241001642843 Deltaproteobacteria bacterium Species 0.000 description 1
- 241000859983 Dendrobium catenatum Species 0.000 description 1
- 102100037458 Dephospho-CoA kinase Human genes 0.000 description 1
- 241000224495 Dictyostelium Species 0.000 description 1
- 108010006731 Dimethylallyltranstransferase Proteins 0.000 description 1
- 102000005454 Dimethylallyltranstransferase Human genes 0.000 description 1
- 241001123630 Dipodascopsis Species 0.000 description 1
- 241001123635 Dipodascus Species 0.000 description 1
- 241000255601 Drosophila melanogaster Species 0.000 description 1
- 240000003133 Elaeis guineensis Species 0.000 description 1
- 235000001950 Elaeis guineensis Nutrition 0.000 description 1
- 241000588914 Enterobacter Species 0.000 description 1
- 241000235167 Eremascus Species 0.000 description 1
- 241000695842 Erythrobacter citreus LAMA 915 Species 0.000 description 1
- 241000222042 Erythrobasidium Species 0.000 description 1
- LYCAIKOWRPUZTN-UHFFFAOYSA-N Ethylene glycol Chemical compound OCCO LYCAIKOWRPUZTN-UHFFFAOYSA-N 0.000 description 1
- 108700024394 Exon Proteins 0.000 description 1
- 101100112369 Fasciola hepatica Cat-1 gene Proteins 0.000 description 1
- 241000222840 Fellomyces Species 0.000 description 1
- 241000221207 Filobasidium Species 0.000 description 1
- 229930091371 Fructose Natural products 0.000 description 1
- 239000005715 Fructose Substances 0.000 description 1
- RFSUNEUAIZKAJO-ARQDHWQXSA-N Fructose Chemical compound OC[C@H]1O[C@](O)(CO)[C@@H](O)[C@@H]1O RFSUNEUAIZKAJO-ARQDHWQXSA-N 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- 101150094690 GAL1 gene Proteins 0.000 description 1
- 101150038242 GAL10 gene Proteins 0.000 description 1
- 101150037782 GAL2 gene Proteins 0.000 description 1
- 101150103804 GAL3 gene Proteins 0.000 description 1
- 241001123633 Galactomyces Species 0.000 description 1
- 102100024637 Galectin-10 Human genes 0.000 description 1
- 102100021735 Galectin-2 Human genes 0.000 description 1
- 102100039558 Galectin-3 Human genes 0.000 description 1
- 102100039555 Galectin-7 Human genes 0.000 description 1
- 235000017048 Garcinia mangostana Nutrition 0.000 description 1
- 240000006053 Garcinia mangostana Species 0.000 description 1
- 241000159512 Geotrichum Species 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 235000010469 Glycine max Nutrition 0.000 description 1
- 244000068988 Glycine max Species 0.000 description 1
- 229920002527 Glycogen Polymers 0.000 description 1
- 108020005004 Guide RNA Proteins 0.000 description 1
- SQUHHTBVTRBESD-UHFFFAOYSA-N Hexa-Ac-myo-Inositol Natural products CC(=O)OC1C(OC(C)=O)C(OC(C)=O)C(OC(C)=O)C(OC(C)=O)C1OC(C)=O SQUHHTBVTRBESD-UHFFFAOYSA-N 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- 241001236629 Holtermannia Species 0.000 description 1
- 101100121078 Homo sapiens GAL gene Proteins 0.000 description 1
- 101000608772 Homo sapiens Galectin-7 Proteins 0.000 description 1
- 101000780205 Homo sapiens Long-chain-fatty-acid-CoA ligase 5 Proteins 0.000 description 1
- 101000780202 Homo sapiens Long-chain-fatty-acid-CoA ligase 6 Proteins 0.000 description 1
- 101000878605 Homo sapiens Low affinity immunoglobulin epsilon Fc receptor Proteins 0.000 description 1
- 241000376403 Hyphopichia Species 0.000 description 1
- UGQMRVRMYYASKQ-UHFFFAOYSA-N Hypoxanthine nucleoside Natural products OC1C(O)C(CO)OC1N1C(NC=NC2=O)=C2N=C1 UGQMRVRMYYASKQ-UHFFFAOYSA-N 0.000 description 1
- DGAQECJNVWCQMB-PUAWFVPOSA-M Ilexoside XXIX Chemical compound C[C@@H]1CC[C@@]2(CC[C@@]3(C(=CC[C@H]4[C@]3(CC[C@@H]5[C@@]4(CC[C@@H](C5(C)C)OS(=O)(=O)[O-])C)C)[C@@H]2[C@]1(C)O)C)C(=O)O[C@H]6[C@@H]([C@H]([C@@H]([C@H](O6)CO)O)O)O.[Na+] DGAQECJNVWCQMB-PUAWFVPOSA-M 0.000 description 1
- 229930010555 Inosine Natural products 0.000 description 1
- UGQMRVRMYYASKQ-KQYNXXCUSA-N Inosine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(O)=C2N=C1 UGQMRVRMYYASKQ-KQYNXXCUSA-N 0.000 description 1
- 241000235644 Issatchenkia Species 0.000 description 1
- 235000014663 Kluyveromyces fragilis Nutrition 0.000 description 1
- 241000235058 Komagataella pastoris Species 0.000 description 1
- 241001489120 Kondoa Species 0.000 description 1
- 241001304304 Kuraishia Species 0.000 description 1
- 241000222661 Kurtzmanomyces Species 0.000 description 1
- 241000481961 Lachancea thermotolerans Species 0.000 description 1
- 241000194036 Lactococcus Species 0.000 description 1
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 1
- 241000221479 Leucosporidium Species 0.000 description 1
- 241001508815 Lodderomyces Species 0.000 description 1
- 102100034337 Long-chain-fatty-acid-CoA ligase 6 Human genes 0.000 description 1
- 235000017617 Lonicera japonica Nutrition 0.000 description 1
- 244000167230 Lonicera japonica Species 0.000 description 1
- 102100038007 Low affinity immunoglobulin epsilon Fc receptor Human genes 0.000 description 1
- 241000555676 Malassezia Species 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- 241000196329 Marchantia polymorpha Species 0.000 description 1
- 241001619555 Marchantia polymorpha subsp. ruderalis Species 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 241001123674 Metschnikowia Species 0.000 description 1
- 241000235048 Meyerozyma guilliermondii Species 0.000 description 1
- 241001633954 Microbacterium oxydans Species 0.000 description 1
- 229920000168 Microcrystalline cellulose Polymers 0.000 description 1
- 241001149967 Mrakia Species 0.000 description 1
- 241000529863 Myxozyma Species 0.000 description 1
- 150000001200 N-acyl ethanolamides Chemical class 0.000 description 1
- 241000193596 Nadsonia Species 0.000 description 1
- 241001099335 Nakazawaea Species 0.000 description 1
- 241000233892 Neocallimastix Species 0.000 description 1
- 241000221960 Neurospora Species 0.000 description 1
- 101100005271 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) cat-1 gene Proteins 0.000 description 1
- GRYLNZFGIOXLOG-UHFFFAOYSA-N Nitric acid Chemical compound O[N+]([O-])=O GRYLNZFGIOXLOG-UHFFFAOYSA-N 0.000 description 1
- 241000119779 Novosphingobium sp. Species 0.000 description 1
- IGHTZQUIFGUJTG-QSMXQIJUSA-N O1C2=CC(CCCCC)=CC(O)=C2[C@H]2C(C)(C)[C@@H]3[C@H]2[C@@]1(C)CC3 Chemical compound O1C2=CC(CCCCC)=CC(O)=C2[C@H]2C(C)(C)[C@@H]3[C@H]2[C@@]1(C)CC3 IGHTZQUIFGUJTG-QSMXQIJUSA-N 0.000 description 1
- 241001112159 Ogataea Species 0.000 description 1
- 241000320412 Ogataea angusta Species 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 241000159576 Oosporidium Species 0.000 description 1
- 239000004235 Orange GGN Substances 0.000 description 1
- 241001502335 Orpinomyces Species 0.000 description 1
- 235000007164 Oryza sativa Nutrition 0.000 description 1
- 240000007594 Oryza sativa Species 0.000 description 1
- 241000235652 Pachysolen Species 0.000 description 1
- 241001542817 Phaffia Species 0.000 description 1
- 241000195887 Physcomitrella patens Species 0.000 description 1
- 241000235379 Piromyces Species 0.000 description 1
- 240000002904 Plumbago indica Species 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 241000589516 Pseudomonas Species 0.000 description 1
- 241000869142 Pseudonocardia sp. Species 0.000 description 1
- 244000294611 Punica granatum Species 0.000 description 1
- 235000014360 Punica granatum Nutrition 0.000 description 1
- LCTONWCANYUPML-UHFFFAOYSA-M Pyruvate Chemical compound CC(=O)C([O-])=O LCTONWCANYUPML-UHFFFAOYSA-M 0.000 description 1
- MUPFEKGTMRGPLJ-RMMQSMQOSA-N Raffinose Natural products O(C[C@H]1[C@@H](O)[C@H](O)[C@@H](O)[C@@H](O[C@@]2(CO)[C@H](O)[C@@H](O)[C@@H](CO)O2)O1)[C@@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 MUPFEKGTMRGPLJ-RMMQSMQOSA-N 0.000 description 1
- PYMYPHUHKUWMLA-LMVFSUKVSA-N Ribose Natural products OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PYMYPHUHKUWMLA-LMVFSUKVSA-N 0.000 description 1
- 235000009122 Rubus idaeus Nutrition 0.000 description 1
- 244000235659 Rubus idaeus Species 0.000 description 1
- 240000005746 Ruta graveolens Species 0.000 description 1
- 235000001347 Ruta graveolens Nutrition 0.000 description 1
- 101150040884 SDH3 gene Proteins 0.000 description 1
- 101100055274 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) ALD6 gene Proteins 0.000 description 1
- 101100008072 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) CWP2 gene Chemical group 0.000 description 1
- 241000582914 Saccharomyces uvarum Species 0.000 description 1
- 241001489223 Saccharomycodes Species 0.000 description 1
- 240000000111 Saccharum officinarum Species 0.000 description 1
- 235000007201 Saccharum officinarum Nutrition 0.000 description 1
- 241000222838 Saitoella Species 0.000 description 1
- 241001514651 Sakaguchia Species 0.000 description 1
- 241000607142 Salmonella Species 0.000 description 1
- 241001149673 Saturnispora Species 0.000 description 1
- 241000235060 Scheffersomyces stipitis Species 0.000 description 1
- 241000159586 Schizoblastosporion Species 0.000 description 1
- 241000235346 Schizosaccharomyces Species 0.000 description 1
- 241000235347 Schizosaccharomyces pombe Species 0.000 description 1
- 241000311088 Schwanniomyces Species 0.000 description 1
- 240000003705 Senecio vulgaris Species 0.000 description 1
- 235000009337 Spinacia oleracea Nutrition 0.000 description 1
- 244000300264 Spinacia oleracea Species 0.000 description 1
- 241000228389 Sporidiobolus Species 0.000 description 1
- 241000222068 Sporobolomyces <Sporidiobolaceae> Species 0.000 description 1
- 241000193640 Sporopachydermia Species 0.000 description 1
- 241000191967 Staphylococcus aureus Species 0.000 description 1
- 229920002472 Starch Polymers 0.000 description 1
- 241000222665 Sterigmatomyces Species 0.000 description 1
- 241000040567 Sterigmatosporidium Species 0.000 description 1
- 241000187762 Streptomyces actuosus Species 0.000 description 1
- 241000794190 Streptomyces sp. ADI96-02 Species 0.000 description 1
- LSNNMFCWUKXFEE-UHFFFAOYSA-N Sulfurous acid Chemical compound OS(O)=O LSNNMFCWUKXFEE-UHFFFAOYSA-N 0.000 description 1
- 241000122237 Symbiotaphrina Species 0.000 description 1
- 241000159597 Sympodiomyces Species 0.000 description 1
- 241001523623 Sympodiomycopsis Species 0.000 description 1
- 241000255588 Tephritidae Species 0.000 description 1
- 241000589499 Thermus thermophilus Species 0.000 description 1
- 241000235006 Torulaspora Species 0.000 description 1
- 241001495125 Torulaspora pretoriensis Species 0.000 description 1
- 108020004566 Transfer RNA Proteins 0.000 description 1
- HDTRYLNUVZCQOY-WSWWMNSNSA-N Trehalose Natural products O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-WSWWMNSNSA-N 0.000 description 1
- 244000186513 Trema orientalis Species 0.000 description 1
- 241000400381 Trichosporiella Species 0.000 description 1
- 241001480014 Trigonopsis Species 0.000 description 1
- 241000222671 Tsuchiyaea Species 0.000 description 1
- MUPFEKGTMRGPLJ-UHFFFAOYSA-N UNPD196149 Natural products OC1C(O)C(CO)OC1(CO)OC1C(O)C(O)C(O)C(COC2C(C(O)C(O)C(CO)O2)O)O1 MUPFEKGTMRGPLJ-UHFFFAOYSA-N 0.000 description 1
- 241000145580 Udeniomyces Species 0.000 description 1
- 241000221566 Ustilago Species 0.000 description 1
- 235000016635 Vitis pseudoreticulata Nutrition 0.000 description 1
- 241000600611 Vitis pseudoreticulata Species 0.000 description 1
- 241000193620 Wickerhamia Species 0.000 description 1
- 241000193624 Wickerhamiella Species 0.000 description 1
- 241000235152 Williopsis Species 0.000 description 1
- 241000235013 Yarrowia Species 0.000 description 1
- 241000235015 Yarrowia lipolytica Species 0.000 description 1
- 240000000038 Ziziphus mauritiana Species 0.000 description 1
- 235000006545 Ziziphus mauritiana Nutrition 0.000 description 1
- 235000008529 Ziziphus vulgaris Nutrition 0.000 description 1
- 241000222676 Zygoascus Species 0.000 description 1
- 241000235017 Zygosaccharomyces Species 0.000 description 1
- 241000685534 Zygowilliopsis Species 0.000 description 1
- 241000193645 Zygozyma Species 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 125000002252 acyl group Chemical group 0.000 description 1
- 101150027964 ada gene Proteins 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 238000010564 aerobic fermentation Methods 0.000 description 1
- HDTRYLNUVZCQOY-LIZSDCNHSA-N alpha,alpha-trehalose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-LIZSDCNHSA-N 0.000 description 1
- HMFHBZSHGGEWLO-UHFFFAOYSA-N alpha-D-Furanose-Ribose Natural products OCC1OC(O)C(O)C1O HMFHBZSHGGEWLO-UHFFFAOYSA-N 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 229910000147 aluminium phosphate Inorganic materials 0.000 description 1
- 229910021529 ammonia Inorganic materials 0.000 description 1
- 229910000148 ammonium phosphate Inorganic materials 0.000 description 1
- 235000019289 ammonium phosphates Nutrition 0.000 description 1
- 150000003863 ammonium salts Chemical class 0.000 description 1
- 229940044197 ammonium sulfate Drugs 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 230000001195 anabolic effect Effects 0.000 description 1
- 239000012491 analyte Substances 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 239000002518 antifoaming agent Substances 0.000 description 1
- 239000012736 aqueous medium Substances 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 239000008163 avocado oil Substances 0.000 description 1
- 235000021302 avocado oil Nutrition 0.000 description 1
- 239000010905 bagasse Substances 0.000 description 1
- 238000011021 bench scale process Methods 0.000 description 1
- 238000010364 biochemical engineering Methods 0.000 description 1
- 230000008238 biochemical pathway Effects 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 239000011616 biotin Substances 0.000 description 1
- 229960002685 biotin Drugs 0.000 description 1
- 235000020958 biotin Nutrition 0.000 description 1
- 150000005693 branched-chain amino acids Chemical class 0.000 description 1
- 239000006227 byproduct Substances 0.000 description 1
- 229960005069 calcium Drugs 0.000 description 1
- 239000001110 calcium chloride Substances 0.000 description 1
- 229910001628 calcium chloride Inorganic materials 0.000 description 1
- LLSDKQJKOVVTOJ-UHFFFAOYSA-L calcium chloride dihydrate Chemical compound O.O.[Cl-].[Cl-].[Ca+2] LLSDKQJKOVVTOJ-UHFFFAOYSA-L 0.000 description 1
- 239000004202 carbamide Substances 0.000 description 1
- 235000013877 carbamide Nutrition 0.000 description 1
- 231100000357 carcinogen Toxicity 0.000 description 1
- 239000003183 carcinogenic agent Substances 0.000 description 1
- 230000001925 catabolic effect Effects 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 239000003054 catalyst Substances 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 238000011210 chromatographic step Methods 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 239000012141 concentrate Substances 0.000 description 1
- 230000001143 conditioned effect Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000011109 contamination Methods 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 229940097362 cyclodextrins Drugs 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- 230000000911 decarboxylating effect Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 108010049285 dephospho-CoA kinase Proteins 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 239000008121 dextrose Substances 0.000 description 1
- MNNHAPBLZZVQHP-UHFFFAOYSA-N diammonium hydrogen phosphate Chemical compound [NH4+].[NH4+].OP([O-])([O-])=O MNNHAPBLZZVQHP-UHFFFAOYSA-N 0.000 description 1
- 150000004683 dihydrates Chemical class 0.000 description 1
- 238000007865 diluting Methods 0.000 description 1
- NAGJZTKCGNOGPW-UHFFFAOYSA-K dioxido-sulfanylidene-sulfido-$l^{5}-phosphane Chemical compound [O-]P([O-])([S-])=S NAGJZTKCGNOGPW-UHFFFAOYSA-K 0.000 description 1
- 238000012444 downstream purification process Methods 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 239000000839 emulsion Substances 0.000 description 1
- 239000002621 endocannabinoid Substances 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 238000001952 enzyme assay Methods 0.000 description 1
- 230000008020 evaporation Effects 0.000 description 1
- 239000011888 foil Substances 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 238000005194 fractionation Methods 0.000 description 1
- 239000012634 fragment Substances 0.000 description 1
- 238000010230 functional analysis Methods 0.000 description 1
- 238000012239 gene modification Methods 0.000 description 1
- 230000004077 genetic alteration Effects 0.000 description 1
- 231100000118 genetic alteration Toxicity 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 230000005017 genetic modification Effects 0.000 description 1
- 230000004034 genetic regulation Effects 0.000 description 1
- 235000013617 genetically modified food Nutrition 0.000 description 1
- 230000000762 glandular Effects 0.000 description 1
- 235000019420 glucose oxidase Nutrition 0.000 description 1
- 229940096919 glycogen Drugs 0.000 description 1
- 239000007952 growth promoter Substances 0.000 description 1
- 238000010438 heat treatment Methods 0.000 description 1
- 239000010460 hemp oil Substances 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 230000005661 hydrophobic surface Effects 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 230000000415 inactivating effect Effects 0.000 description 1
- 230000002779 inactivation Effects 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 239000000411 inducer Substances 0.000 description 1
- 238000009776 industrial production Methods 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 238000011081 inoculation Methods 0.000 description 1
- 239000002054 inoculum Substances 0.000 description 1
- 229960003786 inosine Drugs 0.000 description 1
- CDAISMWEOUEBRE-GPIVLXJGSA-N inositol Chemical compound O[C@H]1[C@H](O)[C@@H](O)[C@H](O)[C@H](O)[C@@H]1O CDAISMWEOUEBRE-GPIVLXJGSA-N 0.000 description 1
- 229960000367 inositol Drugs 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 229940119170 jojoba wax Drugs 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- WRUGWIBCXHJTDG-UHFFFAOYSA-L magnesium sulfate heptahydrate Chemical compound O.O.O.O.O.O.O.[Mg+2].[O-]S([O-])(=O)=O WRUGWIBCXHJTDG-UHFFFAOYSA-L 0.000 description 1
- 229940061634 magnesium sulfate heptahydrate Drugs 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 230000014759 maintenance of location Effects 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 230000007102 metabolic function Effects 0.000 description 1
- 239000002207 metabolite Substances 0.000 description 1
- 108091070501 miRNA Proteins 0.000 description 1
- 239000002679 microRNA Substances 0.000 description 1
- 239000013586 microbial product Substances 0.000 description 1
- 238000009629 microbiological culture Methods 0.000 description 1
- 235000019813 microcrystalline cellulose Nutrition 0.000 description 1
- 239000008108 microcrystalline cellulose Substances 0.000 description 1
- 229940016286 microcrystalline cellulose Drugs 0.000 description 1
- 230000003278 mimic effect Effects 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 238000001823 molecular biology technique Methods 0.000 description 1
- 239000002071 nanotube Substances 0.000 description 1
- 239000002070 nanowire Substances 0.000 description 1
- 229910017604 nitric acid Inorganic materials 0.000 description 1
- QJGQUHMNIGDVPM-UHFFFAOYSA-N nitrogen group Chemical group [N] QJGQUHMNIGDVPM-UHFFFAOYSA-N 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- ZQPPMHVWECSIRJ-KTKRTIGZSA-N oleic acid group Chemical group C(CCCCCCC\C=C/CCCCCCCC)(=O)O ZQPPMHVWECSIRJ-KTKRTIGZSA-N 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 125000001477 organic nitrogen group Chemical group 0.000 description 1
- 230000008723 osmotic stress Effects 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 125000001997 phenyl group Chemical group [H]C1=C([H])C([H])=C(*)C([H])=C1[H] 0.000 description 1
- PTMHPRAIXMAOOB-UHFFFAOYSA-L phosphoramidate Chemical compound NP([O-])([O-])=O PTMHPRAIXMAOOB-UHFFFAOYSA-L 0.000 description 1
- 150000003013 phosphoric acid derivatives Chemical class 0.000 description 1
- 238000011020 pilot scale process Methods 0.000 description 1
- 238000007747 plating Methods 0.000 description 1
- 229920002523 polyethylene Glycol 1000 Polymers 0.000 description 1
- 229920001223 polyethylene glycol Polymers 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 235000011009 potassium phosphates Nutrition 0.000 description 1
- 235000004252 protein component Nutrition 0.000 description 1
- 238000002331 protein detection Methods 0.000 description 1
- 239000003531 protein hydrolysate Substances 0.000 description 1
- MUPFEKGTMRGPLJ-ZQSKZDJDSA-N raffinose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO[C@@H]2[C@@H]([C@@H](O)[C@@H](O)[C@@H](CO)O2)O)O1 MUPFEKGTMRGPLJ-ZQSKZDJDSA-N 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 238000004064 recycling Methods 0.000 description 1
- 230000014493 regulation of gene expression Effects 0.000 description 1
- 230000004202 respiratory function Effects 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 150000003290 ribose derivatives Chemical group 0.000 description 1
- 239000001229 ruta graveolens Substances 0.000 description 1
- 235000019515 salmon Nutrition 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- CDAISMWEOUEBRE-UHFFFAOYSA-N scyllo-inosotol Natural products OC1C(O)C(O)C(O)C(O)C1O CDAISMWEOUEBRE-UHFFFAOYSA-N 0.000 description 1
- 238000005204 segregation Methods 0.000 description 1
- 239000006152 selective media Substances 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 235000021309 simple sugar Nutrition 0.000 description 1
- 229910052708 sodium Inorganic materials 0.000 description 1
- 239000001509 sodium citrate Substances 0.000 description 1
- 239000001488 sodium phosphate Substances 0.000 description 1
- 235000011008 sodium phosphates Nutrition 0.000 description 1
- 230000002269 spontaneous effect Effects 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 235000019698 starch Nutrition 0.000 description 1
- 210000000130 stem cell Anatomy 0.000 description 1
- 239000008223 sterile water Substances 0.000 description 1
- 230000035882 stress Effects 0.000 description 1
- 235000000346 sugar Nutrition 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
- 238000011191 terminal modification Methods 0.000 description 1
- 150000003505 terpenes Chemical class 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 229960003495 thiamine Drugs 0.000 description 1
- DPJRMOMPQZCRJU-UHFFFAOYSA-M thiamine hydrochloride Chemical compound Cl.[Cl-].CC1=C(CCO)SC=[N+]1CC1=CN=C(C)N=C1N DPJRMOMPQZCRJU-UHFFFAOYSA-M 0.000 description 1
- 235000019190 thiamine hydrochloride Nutrition 0.000 description 1
- 239000011747 thiamine hydrochloride Substances 0.000 description 1
- RYYWUUFWQRZTIU-UHFFFAOYSA-K thiophosphate Chemical compound [O-]P([O-])([O-])=S RYYWUUFWQRZTIU-UHFFFAOYSA-K 0.000 description 1
- 229940113082 thymine Drugs 0.000 description 1
- 231100000816 toxic dose Toxicity 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 238000000844 transformation Methods 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 230000005945 translocation Effects 0.000 description 1
- HRXKRNGNAMMEHJ-UHFFFAOYSA-K trisodium citrate Chemical compound [Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O HRXKRNGNAMMEHJ-UHFFFAOYSA-K 0.000 description 1
- 229940038773 trisodium citrate Drugs 0.000 description 1
- 238000001195 ultra high performance liquid chromatography Methods 0.000 description 1
- 238000000825 ultraviolet detection Methods 0.000 description 1
- 229940035893 uracil Drugs 0.000 description 1
- 229940045136 urea Drugs 0.000 description 1
- 235000013311 vegetables Nutrition 0.000 description 1
- 230000035899 viability Effects 0.000 description 1
- 238000011179 visual inspection Methods 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 229940075420 xanthine Drugs 0.000 description 1
- 239000007222 ypd medium Substances 0.000 description 1
Definitions
- Processes developed for the production of minor cannabinoids typically fall into two categories: 1 ) the extraction and purification from the cannabis plant, and 2) the conversion of a cannabinoid precursor into various products, either chemically or biochemically.
- Processes typically involved in the purification of plant-derived cannabinoids include extraction from the cannabis plant, decarboxylation to convert the acidic cannabinoids by heating the extract, evaporation to concentrate the product, crystallization, and a variety of chromatography steps to remove residual impurities.
- the present disclosure features methods of isolating and purifying one or more cannabinoids from a fermentation composition.
- the methods described herein include for the first time using an oil overlay in combination with a cannabichromenic acid (CBCa) synthase. It has been presently discovered that this combination results in near complete conversion of cannabigerolic acid (CBGa) to CBCa.
- CBCa cannabichromenic acid
- CBGa cannabigerolic acid
- the present disclosure provides variant CBCa synthase polypeptides having one or more amino acid substitutions.
- the variant CBCa synthases described herein show improved enzymatic activity in comparison to the wild-type enzyme.
- the disclosure provides a method of making a cannabichromenic acid (CBCa).
- the method includes (a) culturing a population of host cells capable of producing cannabigerolic acid (CBGa) in a culture medium comprising a fermentation broth and an overlay, under conditions suitable for the host cells to produce CBGa, and wherein the CBGa partitions into the overlay; (b) separating the overlay from the fermentation broth; (c) combining the separated overlay of step (b), with a CBCa synthase, thereby producing a bioconversion mixture; and (d) purifying the CBCa from the bioconversion mixture.
- CBGa cannabigerolic acid
- the overlay comprises a plant-based oil.
- the plant-based oil is selected from soybean oil, sunflower oil, safflower oil, canola oil, grapeseed oil, or castor oil.
- the plant-based oil is sunflower oil.
- the overlay comprises a synthetic ester or a fatty alcohol.
- the CBCa synthase is produced by culturing a population of host cells capable of producing a CBCa synthase in a culture medium and under conditions suitable for the host cells to produce the CBCa synthase, thereby producing the CBCa synthase.
- the overlay and the fermentation broth are separated by centrifugation. In some embodiments, the overlay and the fermentation broth are separated by demulsification.
- the demulsification includes contacting the fermentation broth with an oil.
- the oil includes a mineral oil, a vegetable oil, a synthetic ester, or a fatty alcohol.
- the oil includes a vegetable oil.
- the vegetable oil is soybean oil, sunflower oil, safflower oil, canola oil, grapeseed oil, or castor oil.
- the oil includes a synthetic ester, optionally wherein the synthetic ester is ESTEREXTM A51 .
- the oil includes a fatty alcohol, optionally wherein the fatty alcohol is oleyl alcohol or JARCOLTM 1-16.
- the oil has a concentration of between about 1% (w/v) and about 10% (w/v) (e.g., about 1% (w/v), 2% (w/v), 3% (w/v), 4% (w/v), 5% (w/v), 6% (w/v), 7% (w/v), 8% (w/v), 9% (w/v), or 10% (w/v)). In some embodiments, the oil has a concentration of about 5% (w/v).
- the overlay includes CBGa.
- the CBGa has a concentration of between about 0.1% (w/v) or and 10% (w/v) (e.g., between 0.1% (w/v) and 8% (w/v), 0.1% (w/v) and 6% (w/v), 0.1% (w/v) and 4% (w/v), 0.1% (w/v) and 2% (w/v), 2% (w/v) and 10% (w/v), 4% (w/v) and 10% (w/v), 6% (w/v) and 10% (w/v), or 8% (w/v) and 10% (w/v)).
- the CBGa has a concentration of between about 0.5% (w/v) and 5% (w/v) (e.g., between 0.5% (w/v) and 4% (w/v), 0.5% (w/v) and 3% (w/v), 0.5% (w/v) and 2% (w/v), 0.5% (w/v) and 1% (w/v), 1% (w/v) and 5% (w/v), 2% (w/v) and 5% (w/v), 3% (w/v) and 5% (w/v), or 4% (w/v) and 5% (w/v)).
- 0.5% (w/v) and 5% (w/v) e.g., between 0.5% (w/v) and 4% (w/v), 0.5% (w/v) and 3% (w/v), 0.5% (w/v) and 2% (w/v), 0.5% (w/v), 1% (w/v) and 5% (w/v), or 4% (w/v)
- the method further includes stirring the bioconversion mixture for between 12 hours and 144 hours before performing step (d) (e.g., between 12 hours 120 hours, 12 hours and 96 hours, 12 hours and 72 hours, 12 hours and 48 hours, 12 hours and 24 hours, 24 hours and 144 hours, 48 hours and 144 hours, 72 hours and 144 hours, 96 hours and 144 hours, or 120 hours and 144 hours).
- the method includes stirring the bioconversion mixture for between 24 hours and 96 hours (e.g., between 24 hours and 84 hours, 24 hours and 72 hours, 24 hours and 60 hours, 24 hours and 48 hours, 48 hours and 96 hours, 60 hours and 96 hours, 72 hours and 96 hours, or 84 hours and 96 hours).
- the method includes stirring the bioconversion mixture for about 48 hours.
- the bioconversion mixture is at a temperature of between 4 °C and 50 °C.
- the bioconversion mixture is at a temperature of between 20 °C and 40 °C.
- the bioconversion mixture is at a temperature of about 35 °C.
- the second mixture further includes one or more amphiphilic moieties.
- the one or more amphiphilic moieties includes a cyclodextrin, plant derived silica, cellulose, or a combination thereof.
- the cyclodextrin includes randomly methylated cyclodextrin, 2, 6-Di-O-methyl-p-cyclodextrin, or a combination thereof.
- the demulsification includes one or more of: (i) contacting the culture medium with an enzymatic composition comprising a serine protease, (ii)contacting the culture medium with a surfactant; and (iii) contacting the culture medium with NaOH to adjust the culture medium to a pH of between pH 7 and pH 9.
- the demulsification includes contacting the culture medium with the enzymatic composition including the serine protease.
- the purifying comprises contacting the culture medium with the surfactant.
- the culture medium is contacted with the enzymatic composition or surfactant after the mixture is adjusted to a pH of between about pH 7 and pH 9.
- the culture medium is contacted with the enzymatic composition or surfactant after the culture medium is adjusted to a pH of pH 8.
- the final concentration of the enzymatic composition is between about 0.5% (w/v) to about 3% (w/v) (e.g., between about 0.5% (w/v) and 2% (w/v), 0.5% (w/v) and 1 % (w/v), 1 % (w/v) and 3% (w/v), or 2% (w/v) and 3% (w/v)) after contacting the culture medium including a cannabinoid with the enzymatic composition.
- the culture medium is contacted with the enzymatic composition at a final concentration of about 1 % (w/v). In some embodiments, the culture medium is mixed with the enzymatic composition for between 0.5 hours and 2 hours (e.g., between 0.5 hours and 1 .5 hours, 0.5 hours and 1 hour, 1 hour and 2 hours, or 1 .5 hours and 2 hours).
- the demulsification includes centrifugation of the culture medium. In some embodiments, the centrifugation includes liquid-liquid centrifugation. In some embodiments, the centrifugation results in a crude oil light phase and an aqueous heavy phase. In some embodiments, the demulsification further includes a decarboxylation step comprising evaporating the culture medium. In some embodiments, the decarboxylation includes evaporating the crude oil light phase. In some embodiments, evaporating includes one or more passes. In some embodiments, evaporating includes a first pass and a second pass.
- the first pass is performed at a temperature of between about 100 °C and about 500 °C (e.g., between about 100 °C and 400 °C, 100 °C and 300 °C, 100 °C and 200 °C, 200 °C and 500 °C, 300 °C and 500 °C, or 400 °C and 500 °C). In some embodiments, the first pass is performed at a temperature of about 180 °C.
- the first pass is performed at a pressure of between about 0.5 torr and 760 torr (e.g., between 0.5 torr and 700 torr, 0.5 torr and 500 torr, 0.5 torr and 200 torr, 0.5 torr and 50 torr, 50 torr and 760 torr, 200 torr and 760 torr, 400 torr and 760 torr, or 600 torr and 760 torr). In some embodiments, the first pass is performed at a pressure of about 1 torr.
- the second pass is performed at a temperature of between 150 °C and 300 °C (e.g., between 150 °C and 250 °C, 150 °C and 200 °C, 200 °C and 300 °C, or 250 °C and 300 °C). In some embodiments, the second pass is performed at a temperature of about 240 C.
- the second pass is performed at a pressure of between about 0.5 torr and 760 torr (e.g., between 0.5 torr and 700 torr, 0.5 torr and 500 torr, 0.5 torr and 200 torr, 0.5 torr and 50 torr, 50 torr and 760 torr, 200 torr and 760 torr, 400 torr and 760 torr, or 600 torr and 760 torr). In some embodiments, the second pass is performed at a pressure of about 1 torr.
- purifying includes one or more of a liquid-liquid extraction, chromatography, or saponification.
- the host cells include a nucleic acid sequence encoding a cannabichromenic acid (CBCa) synthase.
- CBCa synthase includes one or more amino acid substitutions relative to the amino acid sequence of SEQ ID NO: 1 .
- the CBCa synthase includes an amino acid substitution at a residue selected from Q75, F82, T130, S140, V169, N240, V294, A299, K305, T335, R340, H354, L435, Y461 , K535, S540, and T545 of SEQ ID NO: 1.
- the one or more amino acid substitutions include an amino acid substitution at residue Q75 of SEQ ID NO: 1 .
- the amino acid substitution at residue Q75 of SEQ ID NO: 1 substitutes Q75 with an amino acid including a hydrophobic, uncharged side chain at physiological pH.
- the amino acid substitution at residue Q75 of SEQ ID NO: 1 is a Q75L substitution.
- the amino acid substitution at residue Q75 of SEQ ID NO: 1 substitutes Q75 with an amino acid including an anionic side chain at physiological pH.
- the amino acid substitution at residue Q75 of SEQ ID NO: 1 is a Q75E substitution.
- the one or more amino acid substitutions include an amino acid substitution at residue S140 of SEQ ID NO: 1 .
- the amino acid substitution at residue S140 of SEQ ID NO: 1 substitutes S140 with an amino acid including a hydrophobic, uncharged side chain at physiological pH.
- the amino acid substitution at residue S140 of SEQ ID NO: 1 is a S140A substitution.
- the amino acid substitution at residue S140 of SEQ ID NO: 1 is a S140T substitution.
- the one or more amino acid substitutions include an amino acid substitution at residue V169 of SEQ ID NO: 1 .
- the amino acid substitution at residue V169 of SEQ ID NO: 1 substitutes V169 with an amino acid including an anionic side chain at physiological pH.
- the amino acid substitution at residue V169 of SEQ ID NO: 1 is a V169E substitution.
- the one or more amino acid substitutions include an amino acid substitution at residue N240 of SEQ ID NO: 1 .
- the amino acid substitution at residue N240 of SEQ ID NO: 1 substitutes N240 with an amino acid including a polar, uncharged side chain at physiological pH.
- the amino acid substitution at residue N240 of SEQ ID NO: 1 is a N240Q substitution.
- the amino acid substitution at residue N240 of SEQ ID NO: 1 substitutes N240 with an amino acid including a hydrophobic, uncharged side chain at physiological pH.
- the amino acid substitution at residue N240 of SEQ ID NO: 1 is a N240M substitution.
- the one or more amino acid substitutions include an amino acid substitution at residue V294 of SEQ ID NO: 1 .
- the amino acid substitution at residue V294 of SEQ ID NO: 1 substitutes V294 with an amino acid including a polar, uncharged side chain at physiological pH.
- the amino acid substitution at residue V294 of SEQ ID NO: 1 is a V294S substitution.
- the amino acid substitution at residue V294 of SEQ ID NO: 1 substitutes V294 with an amino acid including an anionic side chain at physiological pH.
- the amino acid substitution at residue V294 of SEQ ID NO: 1 is a V294E substitution.
- the amino acid substitution at residue V294 of SEQ ID NO: 1 substitutes V294 with an amino acid including a cationic side chain at physiological pH. In some embodiments, the amino acid substitution at residue V294 of SEQ ID NO: 1 is a V294R substitution.
- the one or more amino acid substitutions include an amino acid substitution at residue A299 of SEQ ID NO: 1 .
- the amino acid substitution at residue A299 of SEQ ID NO: 1 substitutes A299 with an amino acid including a hydrophobic, uncharged side chain at physiological pH.
- the amino acid substitution at residue A299 of SEQ ID NO: 1 is an A299V substitution.
- the one or more amino acid substitutions include an amino acid substitution at residue K305 of SEQ ID NO: 1 .
- the amino acid substitution at residue K305 of SEQ ID NO: 1 substitutes K305 with an amino acid including a hydrophobic, uncharged side chain at physiological pH.
- the amino acid substitution at residue K305 of SEQ ID NO: 1 is a K305C substitution.
- the one or more amino acid substitutions include an amino acid substitution at residue D328 of SEQ ID NO: 1 .
- the amino acid substitution at residue D328 of SEQ ID NO: 1 is a D328P substitution.
- the one or more amino acid substitutions include an amino acid substitution at residue T335 of SEQ ID NO: 1 .
- the amino acid substitution at residue T335 of SEQ ID NO: 1 substitutes T335 with an amino acid including a hydrophobic, uncharged side chain at physiological pH.
- the amino acid substitution at residue T335 of SEQ ID NO: 1 is a T335L substitution.
- the one or more amino acid substitutions include an amino acid substitution at residue R340 of SEQ ID NO: 1 .
- the amino acid substitution at residue R340 of SEQ ID NO: 1 substitutes R340 with an amino acid including a hydrophobic, uncharged side chain at physiological pH.
- the amino acid substitution at residue R340 of SEQ ID NO: 1 is a R340M substitution.
- the amino acid substitution at residue R340 of SEQ ID NO: 1 is a R340G substitution.
- the one or more amino acid substitutions include an amino acid substitution at residue H354 of SEQ ID NO: 1 .
- the amino acid substitution at residue H354 of SEQ ID NO: 1 substitutes H354 with an amino acid including a hydrophobic, uncharged side chain at physiological pH.
- the amino acid substitution at residue R340 of SEQ ID NO: 1 is a H354V substitution.
- the one or more amino acid substitutions include an amino acid substitution at residue L435 of SEQ ID NO: 1 .
- the amino acid substitution at residue L435 of SEQ ID NO: 1 substitutes L435 with an amino acid including a hydrophobic, uncharged side chain at physiological pH.
- the amino acid substitution at residue L435 of SEQ ID NO: 1 is a L435A substitution.
- the one or more amino acid substitutions include an amino acid substitution at residue Y461 of SEQ ID NO: 1 .
- the amino acid substitution at residue Y461 of SEQ ID NO: 1 substitutes Y461 with an amino acid including a hydrophobic, uncharged side chain at physiological pH.
- the amino acid substitution at residue Y461 of SEQ ID NO: 1 is a Y4611 substitution.
- the one or more amino acid substitutions include an amino acid substitution at residue K535 of SEQ ID NO: 1 .
- the amino acid substitution at residue K535 of SEQ ID NO: 1 substitutes K535 with an amino acid including a hydrophobic, uncharged side chain at physiological pH.
- the amino acid substitution at residue Y461 of SEQ ID NO: 1 is a K535M substitution.
- the one or more amino acid substitutions include an amino acid substitution at residue S540 of SEQ ID NO: 1 .
- the amino acid substitution at residue S540 of SEQ ID NO: 1 substitutes S540 with an amino acid including an anionic side chain at physiological pH.
- the amino acid substitution at residue S540 of SEQ ID NO: 1 is a S540D substitution.
- the one or more amino acid substitutions include an amino acid substitution at residue T545 of SEQ ID NO: 1 .
- the amino acid substitution at residue T545 of SEQ ID NO: 1 substitutes T545 with an amino acid including an anionic side chain at physiological pH.
- the amino acid substitution at residue T545 of SEQ ID NO: 1 is a T545E substitution.
- the polypeptide has amino acid substitutions including Q75E, F82I, T130L, S140A, V169E, N240M, V294S, A299V, K305C, T335L, R340M, H354V, L435A Y461 I, K535, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, F82I, T130L, S140A, V169E, V294S, A299V, K305C, R340M, H354V, L435A, Y461 I, K535M, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, F82I, T130L, S140A, V169E, N240M, V294S, A299V, T335L, R340M, H354V, L435A, K535M, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, T130L, S140A, V169E, N240M, V294S, A299V, T335L, H354V, L435A, Y4611, K535M, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, T130L, S140A, V169E, N240M, V294S, A299V, T335L, R340M, H354V, L435A, K535M, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, F82I, T130L, V169E, N240M, V294S, A299V, K305C, T335L, R340M, L435A, Y4611, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, F82I, T130L, S140A, V169E, T335L, R340M, H354V, L435A, Y4611, K535M, and S540D of SEQ ID NO: 1 . In some embodiments, the polypeptide has amino acid substitutions including Q75E, F82I, T130L, S140A, V294S, A299V, K305C, H354V, L435A, K535M, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, S140A, V169E, N240M, K305C, T335L, R340M, H354V, L435A, Y4611, and K535M of SEQ ID NO: 1 . In some embodiments, the polypeptide has amino acid substitutions including Q75E, V169E, V294S, A299V, R340M, H354V, L435A, K535M, or S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, F82I, T130L, S140A, V169E, V294S, A299V, K305C, T335L, R340M, H354V, L435A, Y4611, K535M, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, F82I, T130L, V169E, N240M, V294S, A299V, K305C, T335L, R340M, H354V, L435A, Y4611, K535M, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, F82I, T130L, V169E, V294S, A299V, T335L, H354V, L435A, and S540D of SEQ ID NO: 1 . In some embodiments, the polypeptide has amino acid substitutions including Q75E, F82I, T130L, S140A, V169E, N240M, V294S, A299V, K305C, R340M, H354V, L435A, Y4611, and K535M of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, T130L, S140A, V169E, V294S, A299V, K305C, R340M, H354V, L435A, K535M, and S540D of SEQ ID NO: 1 . In some embodiments, the polypeptide has amino acid substitutions including Q75E, F82I, T130L, V169E, A299V, K305C, R340M, H354V, L435A, K535M, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, F82I, T130L, V169E, N240M, V294S, T335L, R340M, H354V, L435A, and K535M of SEQ ID NO: 1 . In some embodiments, the polypeptide has amino acid substitutions including Q75E, F82I, S140A, V294S, K305C, T335L, R340M, H354V, L435A, Y461 I, K535M, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, F82I, T130L, S140A, V169E, N240M, V294S, A299V, T335L, R340M, L435A, K535M, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including F82I, T130L, S140A, V169E, N240M, V294S, A299V, K305C, H354V, L435A, K535M, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, F82I, S140A, V169E, N240M, V294S, A299V, K305C, T335L, R340M, H354V, L435A, Y4611, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, F82I, T130L, S140A, V169E, V294S, A299V, K305C, T335L, R340M, H354V, L435A, Y4611, and K535M of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including T130L, S140A, V169E, V294S, A299V, R340M, H354V, L435A, Y4611, K535M, and S540D of SEQ ID NO: 1 . In some embodiments, the polypeptide has amino acid substitutions including Q75E, F82I, T130L, N240M, V294S, A299V, T335L, H354V, L435A, K535M, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, F82I, S140A, V169E, N240M, V294S, A299V, K305C, T335L, R340M, H354V, L435A, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, F82I, T130L, S140A, V169E, N240M, V294S, A299V, K305C, R340M, L435A, Y4611, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, F82I, S140A, V169E, N240M, A299V, K305C, R340M, H354V, L435A, Y4611, and K535M of SEQ ID NO: 1 . In some embodiments, the polypeptide has amino acid substitutions including Q75E, F82I, T130L, S140A, V169E, N240M, V294S, A299V, K305C, R340M, H354V, L435A, K535M, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, T130L, S140A, V169E, V294S, A299V, K305C, H354V, L435A, Y461 1, K535M, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, F82I, V169E, V294S, A299V, K305C, T335L, H354V, L435A, Y461 1, K535M, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, F82I, T130L, S140A, V169E, N240M, V294S, K305C, T335L, R340M, L435A, Y461 1, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including F82I, T130L, S140A, V169E, N240M, V294S, A299V, K305C, T335L, R340M, H354V, L435A, Y461 1, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including F82I, T130L, S140A, V169E, N240M, A299V, T335L, R340M, H354V, L435A, Y461 1, K535M, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, F82I, S140A, V169E, N240M, V294S, A299V, K305C, T335L, L435A, Y461 1, K535M, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, F82I, T130L, S140A, V169E, N240M, V294S, T335L, H354V, L435A, Y461 1, K535M, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including F82I, T130L, S140A, V169E, N240M, V294S, A299V, K305C, T335L, H354V, L435A, Y461 1, K535M, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, F82I, T130L, S140A, V169E, N240M, A299V, T335L, R340M, H354V, L435A, Y461 1, K535M, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, F82I, T130L, S140A, N240M, V294S, A299V, K305C, T335L, R340M, H354V, L435A, Y461 I, K535M, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, V169E, N240M, V294S, A299V, K305C, T335L, L435A, Y461 1, and K535M of SEQ ID NO: 1 .
- the CBCa synthase has an amino acid sequence that is from about 85% to about 99.7% identical to the amino acid sequence of SEQ ID NO: 1 (e.g., about 85%, 96%, 87%, 88%, 89%, 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 99.5%, or 99.7%).
- the CBCa synthase has an amino acid sequence that is at least 80% identical to the amino acid sequence of SEQ ID NO: 1 .
- the CBCa synthase has an amino acid sequence that is at least 90% identical to the amino acid sequence of SEQ ID NO: 1 .
- the CBCa synthase has an amino acid sequence that is from about 90% to about 99.7% (e.g., about 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 99.5%, or 99.7%) identical to the amino acid sequence of SEQ ID NO: 1 .
- the CBCa synthase has an amino acid sequence that is from about 95% to about 99.7% (e.g., about 95%, 95.5%, 96%, 96.5%, 97%, 97.5%, 98%, 98.5%, 99%, 99.5%, or 99.7%) identical to the amino acid sequence of SEQ ID NO: 1 .
- the CBCa synthase has an amino acid sequence that differs from the amino acid sequence of SEQ ID NO: 1 only by way of (i) the one or more amino acid substitutions or deletions and, optionally, (ii) one or more additional, conservative amino acid substitutions. In some embodiments, the CBCa synthase has an amino acid sequence that differs from the amino acid sequence of SEQ ID NO: 1 only by way of the one or more amino acid substitutions or deletions. In some embodiments, the CBCa synthase has an amino acid sequence that is at least 85% identical to the amino acid sequence of any one of SEQ ID NO: 2-67.
- the CBCa synthase has an amino acid sequence that is at least 90% identical to the amino acid sequence of any one of SEQ ID NO: 2-67. In some embodiments, the CBCa synthase has an amino acid sequence that is at least 95% identical to the amino acid sequence of any one of SEQ ID NO: 2-67. In some embodiments, the CBCa synthase has the amino acid sequence of any one of SEQ ID NO: 2-67. In some embodiments, the CBCa synthase has an amino acid sequence that is at least 85% identical to the amino acid sequence of any one of SEQ ID NO: 29-67.
- the CBCa synthase has an amino acid sequence that is at least 90% identical to the amino acid sequence of any one of SEQ ID NO: 29-67. In some embodiments, the CBCa synthase has an amino acid sequence that is at least 95% identical to the amino acid sequence of any one of SEQ ID NO: 29-67. In some embodiments, the CBCa synthase has the amino acid sequence of any one of SEQ
- the host cell includes one or more heterologous nucleic acids that each, independently, encode (a) an acyl activating enzyme (AAE), and/or (b) a tetraketide synthase (TKS), and/or (c) a cannabigerolic acid synthase (CBGaS), and/or (d) a geranyl pyrophosphate (GPP) synthase.
- AAE acyl activating enzyme
- TKS tetraketide synthase
- CBGaS cannabigerolic acid synthase
- GPP geranyl pyrophosphate
- the host cell includes heterologous nucleic acids that independently encode (a) an AAE, (b) a TKS, (c) a CBGaS, and (d) a GPP synthase.
- the host cell includes a heterologous nucleic acid that encodes an AAE having an amino acid sequence that is at least 90% identical to the amino acid sequence of any one of SEQ ID NO: 68-91 .
- the AAE has an amino acid sequence that is at least 95% identical to the amino acid sequence of any one of SEQ ID NO: 68-91 .
- the AAE has the amino acid sequence of any one of SEQ ID NO: 68-91 .
- the host cell includes a heterologous nucleic acid that encodes an AAE having an amino acid sequence that is at least 90% identical to the amino acid sequence of any one of SEQ ID NO: 68-80.
- the AAE has an amino acid sequence that is at least 95% identical to the amino acid sequence of any one of SEQ ID NO: 68-80. In some embodiments, the AAE has the amino acid sequence of any one of SEQ ID NO: 68-80. In some embodiments, the host cell includes a heterologous nucleic acid that encodes an AAE having an amino acid sequence that is at least 90% identical to the amino acid sequence of any one of SEQ ID NO: 68-72. In some embodiments, the AAE has an amino acid sequence that is at least 95% identical to the amino acid sequence of any one of SEQ ID NO: 68-72. In some embodiments, the AAE has the amino acid sequence of any one of SEQ ID NO: 68-72.
- the host cell includes a heterologous nucleic acid that encodes a TKS having an amino acid sequence that is at least 90% identical to the amino acid sequence of any one of SEQ ID NO: 92-126. In some embodiments, the TKS has an amino acid sequence that is at least 95% identical to the amino acid sequence of any one of SEQ ID NO: 92-126. In some embodiments, the TKS has the amino acid sequence of any one of SEQ ID NO: 92-126. In some embodiments, the host cell includes a heterologous nucleic acid that encodes a TKS having an amino acid sequence that is at least 90% identical to the amino acid sequence of any one of SEQ ID NO: 92-95.
- the TKS has an amino acid sequence that is at least 95% identical to the amino acid sequence of any one of SEQ ID NO: 92-95. In some embodiments, the TKS has the amino acid sequence of any one of SEQ ID NO: 92-95. In some embodiments, the host cell includes a heterologous nucleic acid that encodes a TKS having an amino acid sequence that is at least 90% identical to the amino acid sequence of SEQ ID NO: 92. In some embodiments, the TKS has an amino acid sequence that is at least 95% identical to the amino acid sequence of SEQ ID NO: 92, optionally wherein the TKS has the amino acid sequence of SEQ ID NO: 92.
- the host cell includes a heterologous nucleic acid that encodes a CBGaS having an amino acid sequence that is at least 90% identical to the amino acid sequence of any one of SEQ ID NO: 127-131 .
- the CBGaS has an amino acid sequence that is at least 95% identical to the amino acid sequence of any one of SEQ ID NO: 127-131 .
- the CBGaS has the amino acid sequence of any one of SEQ ID NO: 127-131 .
- the host cell includes a heterologous nucleic acid encoding a GPP synthase having an amino acid sequence that is at least 90% identical to the amino acid sequence of any one of SEQ ID NO: 132-137. In some embodiments, the GPP synthase has an amino acid sequence that is at least 95% identical to the amino acid sequence of any one of SEQ ID NO: 132- 137. In some embodiments, the GPP has the amino acid sequence of any one of SEQ ID NO: 132- 137. In some embodiments, the host cell includes a heterologous nucleic acid encoding a GPP synthase having an amino acid sequence that is at least 90% identical to the amino acid sequence of SEQ ID NO: 132. In some embodiments, the GPP synthase has an amino acid sequence that is at least 95% identical to the amino acid sequence of SEQ ID NO: 132. In some embodiments, the GPP has the amino acid sequence of SEQ ID NO: 132.
- the host cell includes heterologous nucleic acids that independently encode an AAE having the amino acid sequence of any one of SEQ ID NO: 68-91 , a TKS having the amino acid sequence of any one of SEQ ID NO: 92-126, a CBGaS having the amino acid sequences of any one of SEQ ID NO: 127-131 , and a GPP synthase having the amino acid sequence of any one of SEQ ID NO: 132-137.
- the host cell further includes one or more heterologous nucleic acids that each, independently, encode an enzyme of the mevalonate biosynthetic pathway, wherein the enzyme is selected from an acetyl-CoA thiolase, an HMG-CoA synthase, an HMG-CoA reductase, a mevalonate kinase, a phosphomevalonate kinase, a mevalonate pyrophosphate decarboxylase, and an IPP:DMAPP isomerase.
- the enzyme is selected from an acetyl-CoA thiolase, an HMG-CoA synthase, an HMG-CoA reductase, a mevalonate kinase, a phosphomevalonate kinase, a mevalonate pyrophosphate decarboxylase, and an IPP:DMAPP isomerase.
- the host cell includes heterologous nucleic acids that independently encode an acetyl-CoA thiolase, an HMG-CoA synthase, an HMG-CoA reductase, a mevalonate kinase, a phosphomevalonate kinase, a mevalonate pyrophosphate decarboxylase, and an IPP:DMAPP isomerase.
- the host cell further includes a heterologous nucleic acid that encodes an olivetolic acid cyclase (OAC).
- OAC olivetolic acid cyclase
- the OAC has an amino acid sequence that is at least 90% identical to the amino acid sequence of SEQ ID NO: 138.
- the OAC has an amino acid sequence that is at least 95% identical to the amino acid sequence of SEQ ID NO: 138.
- the OAC has the amino acid sequence of SEQ ID NO: 138.
- the host cell further includes one or more heterologous nucleic acids that each, independently, encode an acetyl-CoA synthase, and/or an aldehyde dehydrogenase, and/or a pyruvate decarboxylase.
- the acetyl-CoA synthase has an amino acid sequence that is at least 90% identical to the amino acid sequence of SEQ ID NO: 139.
- the acetyl-CoA synthase has an amino acid sequence that is at least 95% identical to the amino acid sequence of SEQ ID NO: 139.
- the acetyl-CoA synthase has the amino acid sequence of SEQ ID NO: 139.
- the acetyl-CoA synthase has an amino acid sequence that is at least 90% identical to the amino acid sequence of SEQ ID NO: 140. In some embodiments, the acetyl-CoA synthase has an amino acid sequence that is at least 95% identical to the amino acid sequence of SEQ ID NO: 140. In some embodiments, the acetyl-CoA synthase has the amino acid sequence of SEQ ID NO: 140. In some embodiments, the aldehyde dehydrogenase has an amino acid sequence that is at least 90% identical to the amino acid sequence of SEQ ID NO: 141 .
- the aldehyde dehydrogenase has an amino acid sequence that is at least 95% identical to the amino acid sequence of SEQ ID NO: 141 . In some embodiments, the aldehyde dehydrogenase synthase has the amino acid sequence of SEQ ID NO: 141 . In some embodiments, the pyruvate decarboxylase has an amino acid sequence that is at least 90% identical to the amino acid sequence of SEQ ID NO: 142. In some embodiments, the pyruvate decarboxylase has an amino acid sequence that is at least 95% identical to the amino acid sequence of SEQ ID NO: 142. In some embodiments, the pyruvate decarboxylase has the amino acid sequence of SEQ ID NO: 142.
- expression of the one or more heterologous nucleic acids is regulated by an exogenous agent.
- the exogenous agent includes a regulator of gene expression.
- the exogenous agent decreases production of the cannabinoid.
- the exogenous agent is maltose.
- the exogenous agent increases production of the cannabinoid.
- the exogenous agent is galactose.
- the exogenous agent is galactose and expression of one or more heterologous nucleic acids encoding the AAE, TKS, and CBGaS enzymes is under the control of a GAL promoter.
- expression of one or more heterologous nucleic acids encoding the AAE, TKS, and CBGaS enzymes is under the control of a galactose-responsive promoter, a maltose-responsive promoter, or a combination of both.
- the method further includes culturing the host cell with a precursor required to make the cannabinoid.
- the precursor required to make the cannabinoid is hexanoate.
- the cannabinoid is cannabichromene (CBC), cannabinol (CBN), cannabidivarin (CBDV), tetrahydrocannabivarin (THCV), cannabichromevarin (CBCV), cannabidiolic acid (CBDA), cannabidiol (CBD) or an acid form thereof, cannabigerolic acid (CBGA), cannabigerol (CBG) or an acid form thereof, tetrahydrocannabinol (THC) or an acid form thereof, or tetrahydrocannabinolic acid (THCa).
- the cannabinoid is CBC.
- the CBC has a purity of at least about 50% (w/v) following purifying. In some embodiments, the CBC has a purity of between about 50% (w/w) and 100% (w/w) (e.g., between 50% (w/w) and 80 % (w/w), 50% (w/w) and 60% (w/w), 60% (w/w) and 100% (w/w), or 80% (w/w) and 100% (w/w)) following purifying. In some embodiments, the CBC has a purity of about 70% (w/w).
- the host cell is a yeast cell or yeast strain.
- the yeast cell is S. cerevisiae.
- the disclosure provides a composition including CBC produced by any one of the methods described herein.
- the CBC has a purity of at least about 50% (w/w).
- the CBC has a purity of between about 50% (w/w) to about 100% (w/w) (e.g., between 50% (w/w) and 80 % (w/w), 50% (w/w) and 60% (w/w), 60% (w/w) and 100% (w/w), or 80% (w/w) and 100% (w/w)).
- the CBC has a purity of between about 70% (w/w) to about 100% (w/w) (e.g., between 70% (w/w) and 90% (w/w), 70% (w/w) and 80% (w/w), 80% (w/w) and 100% (w/w), or 90% (w/w) and 100% (w/w)). In some embodiments, the CBC has a purity of from about 99.5% (w/w) to about 100% (w/w) (e.g., about 99.5% (w/w), 99.6% (w/w), 99.7% (w/w), 99.8% (w/w), or 99.9% (w/w)).
- the disclosure provides a variant CBCa synthase polypeptide including one or more amino acid substitutions relative to the amino acid sequence of SEQ ID NO: 1 .
- the CBCa synthase includes an amino acid substitution at a residue selected from Q75, F82, T130, S140, V169, N240, V294, A299, K305, T335, R340, H354, L435, Y461 , K535, S540, of SEQ ID NO: 1 .
- the one or more amino acid substitutions include an amino acid substitution at residue Q75 of SEQ ID NO: 1 .
- the amino acid substitution at residue Q75 of SEQ ID NO: 1 substitutes Q75 with an amino acid including a hydrophobic, uncharged side chain at physiological pH.
- the amino acid substitution at residue Q75 of SEQ ID NO: 1 is a Q75L substitution.
- the amino acid substitution at residue Q75 of SEQ ID NO: 1 substitutes Q75 with an amino acid including an anionic side chain at physiological pH.
- the amino acid substitution at residue Q75 of SEQ ID NO: 1 is a Q75E substitution.
- the one or more amino acid substitutions include an amino acid substitution at residue S140 of SEQ ID NO: 1 .
- the amino acid substitution at residue S140 of SEQ ID NO: 1 substitutes S140 with an amino acid including a hydrophobic, uncharged side chain at physiological pH.
- the amino acid substitution at residue S140 of SEQ ID NO: 1 is a S140A substitution.
- the amino acid substitution at residue S140 of SEQ ID NO: 1 is a S140T substitution.
- the one or more amino acid substitutions include an amino acid substitution at residue V169 of SEQ ID NO: 1 .
- the amino acid substitution at residue V169 of SEQ ID NO: 1 substitutes V169 with an amino acid including an anionic side chain at physiological pH.
- the amino acid substitution at residue V169 of SEQ ID NO: 1 is a V169E substitution.
- the one or more amino acid substitutions include an amino acid substitution at residue N240 of SEQ ID NO: 1 .
- the amino acid substitution at residue N240 of SEQ ID NO: 1 substitutes N240 with an amino acid including a polar, uncharged side chain at physiological pH.
- the amino acid substitution at residue N240 of SEQ ID NO: 1 is a N240Q substitution.
- the amino acid substitution at residue N240 of SEQ ID NO: 1 substitutes N240 with an amino acid including a hydrophobic, uncharged side chain at physiological pH.
- the amino acid substitution at residue N240 of SEQ ID NO: 1 is a N240M substitution.
- the one or more amino acid substitutions include an amino acid substitution at residue V294 of SEQ ID NO: 1 .
- the amino acid substitution at residue V294 of SEQ ID NO: 1 substitutes V294 with an amino acid including a polar, uncharged side chain at physiological pH.
- the amino acid substitution at residue V294 of SEQ ID NO: 1 is a V294S substitution.
- the amino acid substitution at residue V294 of SEQ ID NO: 1 substitutes V294 with an amino acid including an anionic side chain at physiological pH.
- the amino acid substitution at residue V294 of SEQ ID NO: 1 is a V294E substitution.
- the amino acid substitution at residue V294 of SEQ ID NO: 1 substitutes V294 with an amino acid including a cationic side chain at physiological pH. In some embodiments, the amino acid substitution at residue V294 of SEQ ID NO: 1 is a V294R substitution.
- the one or more amino acid substitutions include an amino acid substitution at residue A299 of SEQ ID NO: 1 .
- the amino acid substitution at residue A299 of SEQ ID NO: 1 substitutes A299 with an amino acid including a hydrophobic, uncharged side chain at physiological pH.
- the amino acid substitution at residue A299 of SEQ ID NO: 1 is an A299V substitution.
- the one or more amino acid substitutions include an amino acid substitution at residue K305 of SEQ ID NO: 1 .
- the amino acid substitution at residue K305 of SEQ ID NO: 1 substitutes K305 with an amino acid including a hydrophobic, uncharged side chain at physiological pH.
- the amino acid substitution at residue K305 of SEQ ID NO: 1 is a K305C substitution.
- the one or more amino acid substitutions include an amino acid substitution at residue D328 of SEQ ID NO: 1 .
- the amino acid substitution at residue D328 of SEQ ID NO: 1 is a D328P substitution.
- the one or more amino acid substitutions include an amino acid substitution at residue T335 of SEQ ID NO: 1 .
- the amino acid substitution at residue T335 of SEQ ID NO: 1 substitutes T335 with an amino acid including a hydrophobic, uncharged side chain at physiological pH.
- the amino acid substitution at residue T335 of SEQ ID NO: 1 is a T335L substitution.
- the one or more amino acid substitutions include an amino acid substitution at residue R340 of SEQ ID NO: 1 .
- the amino acid substitution at residue R340 of SEQ ID NO: 1 substitutes R340 with an amino acid including a hydrophobic, uncharged side chain at physiological pH.
- the amino acid substitution at residue R340 of SEQ ID NO: 1 is a R340M substitution.
- the amino acid substitution at residue R340 of SEQ ID NO: 1 is a R340G substitution.
- the one or more amino acid substitutions include an amino acid substitution at residue H354 of SEQ ID NO: 1 .
- the amino acid substitution at residue H354 of SEQ ID NO: 1 substitutes H354 with an amino acid including a hydrophobic, uncharged side chain at physiological pH.
- the amino acid substitution at residue R340 of SEQ ID NO: 1 is a H354V substitution.
- the one or more amino acid substitutions include an amino acid substitution at residue L435 of SEQ ID NO: 1 .
- the amino acid substitution at residue L435 of SEQ ID NO: 1 substitutes L435 with an amino acid including a hydrophobic, uncharged side chain at physiological pH.
- the amino acid substitution at residue L435 of SEQ ID NO: 1 is a L435A substitution.
- the one or more amino acid substitutions include an amino acid substitution at residue Y461 of SEQ ID NO: 1 .
- the amino acid substitution at residue Y461 of SEQ ID NO: 1 substitutes Y461 with an amino acid including a hydrophobic, uncharged side chain at physiological pH.
- the amino acid substitution at residue Y461 of SEQ ID NO: 1 is a Y4611 substitution.
- the one or more amino acid substitutions include an amino acid substitution at residue K535 of SEQ ID NO: 1 .
- the amino acid substitution at residue K535 of SEQ ID NO: 1 substitutes K535 with an amino acid including a hydrophobic, uncharged side chain at physiological pH.
- the amino acid substitution at residue Y461 of SEQ ID NO: 1 is a K535M substitution.
- the one or more amino acid substitutions include an amino acid substitution at residue S540 of SEQ ID NO: 1 .
- the amino acid substitution at residue S540 of SEQ ID NO: 1 substitutes S540 with an amino acid including an anionic side chain at physiological pH.
- the amino acid substitution at residue S540 of SEQ ID NO: 1 is a S540D substitution.
- the one or more amino acid substitutions include an amino acid substitution at residue T545 of SEQ ID NO: 1 .
- the amino acid substitution at residue T545 of SEQ ID NO: 1 substitutes T545 with an amino acid including an anionic side chain at physiological pH.
- the amino acid substitution at residue T545 of SEQ ID NO: 1 is a T545E substitution.
- the polypeptide has amino acid substitutions including Q75E, F82I, T130L, S140A, V169E, N240M, V294S, A299V, K305C, T335L, R340M, H354V, L435A Y461 I, K535, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, F82I, T130L, S140A, V169E, V294S, A299V, K305C, R340M, H354V, L435A, Y461 I, K535M, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, F82I, T130L, S140A, V169E, N240M, V294S, A299V, T335L, R340M, H354V, L435A, K535M, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, T130L, S140A, V169E, N240M, V294S, A299V, T335L, H354V, L435A, Y4611, K535M, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, T130L, S140A, V169E, N240M, V294S, A299V, T335L, R340M, H354V, L435A, K535M, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, F82I, T130L, V169E, N240M, V294S, A299V, K305C, T335L, R340M, L435A, Y4611, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, F82I, T130L, S140A, V169E, T335L, R340M, H354V, L435A, Y4611, K535M, and S540D of SEQ ID NO: 1 . In some embodiments, the polypeptide has amino acid substitutions including Q75E, F82I, T130L, S140A, V294S, A299V, K305C, H354V, L435A, K535M, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, S140A, V169E, N240M, K305C, T335L, R340M, H354V, L435A, Y4611, and K535M of SEQ ID NO: 1 . In some embodiments, the polypeptide has amino acid substitutions including Q75E, V169E, V294S, A299V, R340M, H354V, L435A, K535M, or S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, F82I, T130L, S140A, V169E, V294S, A299V, K305C, T335L, R340M, H354V, L435A, Y4611, K535M, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, F82I, T130L, V169E, N240M, V294S, A299V, K305C, T335L, R340M, H354V, L435A, Y4611, K535M, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, F82I, T130L, V169E, V294S, A299V, T335L, H354V, L435A, and S540D of SEQ ID NO: 1 . In some embodiments, the polypeptide has amino acid substitutions including Q75E, F82I, T130L, S140A, V169E, N240M, V294S, A299V, K305C, R340M, H354V, L435A, Y4611, and K535M of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, T130L, S140A, V169E, V294S, A299V, K305C, R340M, H354V, L435A, K535M, and S540D of SEQ ID NO: 1 . In some embodiments, the polypeptide has amino acid substitutions including Q75E, F82I, T130L, V169E, A299V, K305C, R340M, H354V, L435A, K535M, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, F82I, T130L, V169E, N240M, V294S, T335L, R340M, H354V, L435A, and K535M of SEQ ID NO: 1 . In some embodiments, the polypeptide has amino acid substitutions including Q75E, F82I, S140A, V294S, K305C, T335L, R340M, H354V, L435A, Y461 I, K535M, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, F82I, T130L, S140A, V169E, N240M, V294S, A299V, T335L, R340M, L435A, K535M, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including F82I, T130L, S140A, V169E, N240M, V294S, A299V, K305C, H354V, L435A, K535M, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, F82I, S140A, V169E, N240M, V294S, A299V, K305C, T335L, R340M, H354V, L435A, Y4611, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, F82I, T130L, S140A, V169E, V294S, A299V, K305C, T335L, R340M, H354V, L435A, Y4611, and K535M of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including T130L, S140A, V169E, V294S, A299V, R340M, H354V, L435A, Y4611, K535M, and S540D of SEQ ID NO: 1 . In some embodiments, the polypeptide has amino acid substitutions including Q75E, F82I, T130L, N240M, V294S, A299V, T335L, H354V, L435A, K535M, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, F82I, S140A, V169E, N240M, V294S, A299V, K305C, T335L, R340M, H354V, L435A, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, F82I, T130L, S140A, V169E, N240M, V294S, A299V, K305C, R340M, L435A, Y461 1, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, F82I, S140A, V169E, N240M, A299V, K305C, R340M, H354V, L435A, Y461 1, and K535M of SEQ ID NO: 1 . In some embodiments, the polypeptide has amino acid substitutions including Q75E, F82I, T130L, S140A, V169E, N240M, V294S, A299V, K305C, R340M, H354V, L435A, K535M, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, T130L, S140A, V169E, V294S, A299V, K305C, H354V, L435A, Y461 1, K535M, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, F82I, V169E, V294S, A299V, K305C, T335L, H354V, L435A, Y461 1, K535M, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, F82I, T130L, S140A, V169E, N240M, V294S, K305C, T335L, R340M, L435A, Y461 1, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including F82I, T130L, S140A, V169E, N240M, V294S, A299V, K305C, T335L, R340M, H354V, L435A, Y461 1, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including F82I, T130L, S140A, V169E, N240M, A299V, T335L, R340M, H354V, L435A, Y461 1, K535M, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, F82I, S140A, V169E, N240M, V294S, A299V, K305C, T335L, L435A, Y461 1, K535M, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, F82I, T130L, S140A, V169E, N240M, V294S, T335L, H354V, L435A, Y461 1, K535M, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including F82I, T130L, S140A, V169E, N240M, V294S, A299V, K305C, T335L, H354V, L435A, Y461 1, K535M, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, F82I, T130L, S140A, V169E, N240M, A299V, T335L, R340M, H354V, L435A, Y461 1, K535M, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, F82I, T130L, S140A, N240M, V294S, A299V, K305C, T335L, R340M, H354V, L435A, Y461 I, K535M, and S540D of SEQ ID NO: 1 .
- the polypeptide has amino acid substitutions including Q75E, V169E, N240M, V294S, A299V, K305C, T335L, L435A, Y461 1, and K535M of SEQ ID NO: 1 .
- the CBCa synthase comprises an amino acid substitution at a residue selected from Q75, F82, T130, S140, V169, N240, V294, A299, K305, T335, R340, H354, L435, Y461 , K535, S540, and T545 of SEQ ID NO: 1
- the CBCa synthase has an amino acid sequence that is from about 85% to about 99.7% identical to the amino acid sequence of SEQ ID NO: 1 (e.g., about 85%, 96%, 87%, 88%, 89%, 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 99.5%, or 99.7%).
- the CBCa synthase has an amino acid sequence that is at least 80% identical to the amino acid sequence of SEQ ID NO: 1 .
- the CBCa synthase has an amino acid sequence that is at least 90% identical to the amino acid sequence of SEQ ID NO: 1 .
- the CBCa synthase has an amino acid sequence that is from about 90% to about 99.7% (e.g., about 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 99.5%, or 99.7%) identical to the amino acid sequence of SEQ ID NO: 1 .
- the CBCa synthase has an amino acid sequence that is from about 95% to about 99.7% (e.g., about 95%, 95.5%, 96%, 96.5%, 97%, 97.5%, 98%, 98.5%, 99%, 99.5%, or 99.7%) identical to the amino acid sequence of SEQ ID NO: 1.
- the CBCa synthase has an amino acid sequence that differs from the amino acid sequence of SEQ ID NO: 1 only by way of (i) the one or more amino acid substitutions or deletions and, optionally, (ii) one or more additional, conservative amino acid substitutions. In some embodiments, the CBCa synthase has an amino acid sequence that differs from the amino acid sequence of SEQ ID NO: 1 only by way of the one or more amino acid substitutions or deletions.
- the CBCa synthase includes an amino acid sequence selected from SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11 , SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21 , SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, and SEQ ID NO: 28.
- the CBCa synthase has an amino acid sequence that is at least 85% identical to the amino acid sequence of any one of SEQ ID NO: 2-67. In some embodiments, the CBCa synthase has an amino acid sequence that is at least 90% identical to the amino acid sequence of any one of SEQ ID NO: 2-67. In some embodiments, the CBCa synthase has an amino acid sequence that is at least 95% identical to the amino acid sequence of any one of SEQ ID NO: 2-67. In some embodiments, the CBCa synthase has the amino acid sequence of any one of SEQ ID NO: 2-67.
- the CBCa synthase has an amino acid sequence that is at least 85% identical to the amino acid sequence of any one of SEQ ID NO: 29-67. In some embodiments, the CBCa synthase has an amino acid sequence that is at least 90% identical to the amino acid sequence of any one of SEQ ID NO: 29-67. In some embodiments, the CBCa synthase has an amino acid sequence that is at least 95% identical to the amino acid sequence of any one of SEQ ID NO: 29-67.
- the CBCa synthase has the amino acid sequence selected from SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31 , SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, SEQ ID NO: 40, SEQ ID NO: 41 , SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51 , SEQ ID NO: 52, SEQ ID NO: 53, SEQ ID NO: 54, SEQ ID NO: 55, SEQ ID NO: 56, SEQ ID NO: 57, SEQ ID NO: 58, SEQ ID NO: 59, SEQ ID NO: 60, SEQ ID NO: 61
- the disclosure provides a variant CBCa synthase polypeptide, wherein the polypeptide is capable producing a yield of CBCa concentration of at least 50 mg/L.
- the disclosure provides a nucleic acid encoding any one of the variant polypeptides described herein.
- the disclosure provides a host cell including any one of the variant polypeptides or nucleic acids described herein.
- FIG. 1 is a graph showing a schematic of the Production process for generating cannabichromene (CBC) by combining cannabigerolic acid (CBGa) in high oleic sunflower oil (HOSUN) overlay with aqueous CBCa synthase, chemically or mechanically demulsifying and decarboxylating CBCa, distilling the separated overlay to remove HOSUN, and purifying the resulting material.
- CBDa cannabigerolic acid
- HOSUN high oleic sunflower oil
- FIG. 2 is a graph showing the bioconversion time course titers of cannabichromenic acid (CBCa), CBGa, and cannabigerol (CBG).
- CBCa cannabichromenic acid
- CBGa cannabigerol
- FIG. 3 is a graph showing the reaction rate as a function of CBGa concentration in the reaction and in the overlay for a given CBGa reaction, as the overlay % was decreased as the CBGa concentration in the overlay increased.
- FIG. 4 is a graph showing the changeover base case bioconversion for BiosilicaTM (silica derived from sugar cane bagasse) (purple square), microcrystalline cellulose, randomly methylated cyclodextrin, and 2,6-Di-O-methyl-beta-cyclodextrin.
- FIG. 5 is a graph showing the reaction rates of the recycle experiments described in Example 1 without cyclodextrin and with cyclodextrin.
- FIG. 6 is a graph showing the concentration of CBCa produced by CBGa synthase enzymes having a point mutation in comparison to the wild-type CBCa synthase.
- FIG. 7 is a graph showing the concentration of CBCa produced by CBGa synthase enzymes having a combination of point mutations in comparison to the wild-type CBCa synthase.
- bioconversion mixture refers to a composition including a substrate, for example, cannabigerolic acid (CBGa), and an enzyme capable of transforming the substrate, for example, cannabichromenic acid (CBCa) synthase, to generate a product, for example, cannabichromenic acid (CBCa).
- CBDa cannabigerolic acid
- CBCa cannabichromenic acid synthase
- cannabinoid refers to a chemical substance that binds or interacts with a cannabinoid receptor (for example, a human cannabinoid receptor) and includes, without limitation, chemical compounds such endocannabinoids, phytocannabinoids, and synthetic cannabinoids. Synthetic compounds are chemicals made to mimic phytocannabinoids which are naturally found in the cannabis plant (e.g., Cannabis sativa).
- Cannabinoids includes but not limited to cannabigerols (CBG), cannabichromens (CBC), cannabidiol (CBD), tetrahydrocannabinol (THC), cannabinol (CBN), cannabinodiol (CBDL), cannabicyclol (CBL), cannabielsoin (CBE), cannabitriol (CBT), cannabinol (CBN), cannabichromene (CBC), cannabidioloic acid (CBDA), cannabigerolic acid (CBGA), tetrahydrocannabinolic acid (THCA), cannabinolic acid (CBNA), cannabidivarin (CBDV), tetrahydrocannabivarin (THCV), cannabigerovarin (CBGV), cannabichromevarin (CBCV), and others.
- CBD cannabigerols
- CBD cannabidiol
- the term “capable of producing” refers to a host cell that is genetically modified to express the enzyme(s) necessary for the production of a given compound in accordance with a biochemical pathway that produces the compound.
- a host cell e.g., a yeast cell
- a cannabinoid is one that expresses the enzymes necessary for production of the cannabinoid according to the cannabinoid biosynthetic pathway.
- endogenous describes a molecule (e.g., a polypeptide, nucleic acid, or cofactor) that is found naturally in a particular organism (e.g., a human) or in a particular location within an organism (e.g., an organ, a tissue, or a cell, such as a human cell).
- a particular organism e.g., a human
- a particular location within an organism e.g., an organ, a tissue, or a cell, such as a human cell.
- exogenous describes a molecule (e.g., a polypeptide, nucleic acid, or cofactor) that is not found naturally in a particular organism (e.g., a human) or in a particular location within an organism (e.g., an organ, a tissue, or a cell, such as a human cell).
- Exogenous materials include those that are provided from an external source to an organism or to cultured matter extracted there from.
- the term "express” refers to any one or more of the following events: (1 ) production of an RNA template from a DNA sequence (e.g., by transcription); (2) processing of an RNA transcript (e.g., by splicing, editing, 5' cap formation, and/or 3' end processing); (3) translation of an RNA into a polypeptide or protein; and (4) post-translational modification of a polypeptide or protein.
- Expression of a gene of interest in a cell, tissue sample, or subject can manifest, for example, as: an increase in the quantity or concentration of mRNA encoding a corresponding protein (as assessed, e.g., using RNA detection procedures described herein or known in the art, such as quantitative polymerase chain reaction (qPCR) and RNA seq techniques), an increase in the quantity or concentration of a corresponding protein (as assessed, e.g., using protein detection methods described herein or known in the art, such as enzyme-linked immunosorbent assays (ELISA), among others), and/or an increase in the activity of a corresponding protein (e.g., in the case of an enzyme, as assessed using an enzymatic activity assay described herein or known in the art).
- RNA detection procedures described herein or known in the art such as quantitative polymerase chain reaction (qPCR) and RNA seq techniques
- qPCR quantitative polymerase chain reaction
- RNA seq techniques an increase in the quantity or concentration of a corresponding protein (
- expression cassette or “expression construct” refers to a nucleic acid construct that, when introduced into a host cell, results in transcription and/or translation of an RNA or polypeptide, respectively.
- expression of transgenes one of skill will recognize that the inserted polynucleotide sequence need not be identical but may be only substantially identical to a sequence of the gene from which it was derived. As explained herein, these substantially identical variants are specifically covered by reference to a specific nucleic acid sequence.
- an expression cassette is a polynucleotide construct that includes a polynucleotide sequence encoding a polypeptide for use in the invention operably linked to a promoter, e.g., its native promoter, where the expression cassette is introduced into a heterologous microorganism.
- an expression cassette includes a polynucleotide sequence encoding a polypeptide of the invention where the polynucleotide that is targeted to a position in the genome of a microorganism such that expression of the polynucleotide sequence is driven by a promoter that is present in the microorganism.
- the term “gene” refers to the segment of DNA involved in producing or encoding a polypeptide chain. It may include regions preceding and following the coding region (leader and trailer) as well as intervening sequences (introns) between individual coding segments (exons). Alternatively, the term “gene” can refer to the segment of DNA involved in producing or encoding a non-translated RNA, such as an rRNA, tRNA, gRNA, or micro RNA.
- the term “fermentation broth” refers to a composition which contains host cells and products, or metabolites produced by the host cells.
- An example of a fermentation broth is a whole cell broth, which may be the entire contents of a vessel, including cells, aqueous phase, and compounds produced from the host cells.
- a “genetic pathway” or “biosynthetic pathway” as used herein refers to a set of at least two different coding sequences, where the coding sequences encode enzymes that catalyze different parts of a synthetic pathway to form a desired product (e.g., a cannabinoid).
- a first encoded enzyme uses a substrate to make a first product which in turn is used as a substrate for a second encoded enzyme to make a second product.
- the genetic pathway includes 3 or more members (e.g., 3, 4, 5, 6, 7, 8, 9, etc.), wherein the product of one encoded enzyme is the substrate for the next enzyme in the synthetic pathway.
- host cell refers to a microorganism, such as yeast, and includes an individual cell or cell culture including a heterologous vector or heterologous polynucleotide as described herein.
- Host cells include progeny of a single host cell, and the progeny may not necessarily be completely identical (in morphology or in total DNA complement) to the original parent cell due to natural, accidental, or deliberate mutation and/or change.
- a host cell includes cells into which a recombinant vector or a heterologous polynucleotide of the invention has been introduced, including by transformation, transfection, and the like.
- Percent (%) sequence identity with respect to a reference polynucleotide or polypeptide sequence is defined as the percentage of nucleic acids or amino acids in a candidate sequence that are identical to the nucleic acids or amino acids in the reference polynucleotide or polypeptide sequence, after aligning the sequences and introducing gaps, if necessary, to achieve the maximum percent sequence identity. Alignment for purposes of determining percent nucleic acid or amino acid sequence identity can be achieved in various ways that are within the capabilities of one of skill in the art, for example, using publicly available computer software such as BLAST, BLAST-2, or Megalign software.
- percent sequence identity values may be generated using the sequence comparison computer program BLAST.
- percent sequence identity of a given nucleic acid or amino acid sequence, A, to, with, or against a given nucleic acid or amino acid sequence, B, (which can alternatively be phrased as a given nucleic acid or amino acid sequence, A that has a certain percent sequence identity to, with, or against a given nucleic acid or amino acid sequence, B) is calculated as follows:
- nucleic acid or amino acid sequence A is not equal to the length of nucleic acid or amino acid.
- polynucleotide and nucleic acid are used interchangeably and refer to a single or double-stranded polymer of deoxyribonucleotide or ribonucleotide bases read from the 5' to the 3' end.
- a nucleic acid as used in the present disclosure will generally contain phosphodiester bonds, although in some cases, nucleic acid analogs may be used that may have alternate backbones, including, e.g., phosphoramidate, phosphorothioate, phosphorodithioate, or O- methylphosphoroamidite linkages (see Eckstein, Oligonucleotides and Analogues: A Practical Approach, Oxford University Press); positive backbones; non-ionic backbones, and non-ribose backbones. Nucleic acids or polynucleotides may also include modified nucleotides that permit correct read-through by a polymerase.
- Polynucleotide sequence or “nucleic acid sequence” includes both the sense and antisense strands of a nucleic acid as either individual single strands or in a duplex. As will be appreciated by those in the art, the depiction of a single strand also defines the sequence of the complementary strand; thus, the sequences described herein also provide the complement of the sequence. Unless otherwise indicated, a particular nucleic acid sequence also implicitly encompasses variants thereof (e.g., degenerate codon substitutions) and complementary sequences, as well as the sequence explicitly indicated.
- the nucleic acid may be DNA, both genomic and cDNA, RNA, or a hybrid, where the nucleic acid may contain combinations of deoxyribo- and ribonucleotides, and combinations of bases, including uracil, adenine, thymine, cytosine, guanine, inosine, xanthine hypoxanthine, isocytosine, isoguanine, etc. Nucleic acid sequences are presented in the 5’ to 3’ direction unless otherwise specified.
- polypeptide As used herein, the terms “polypeptide,” “peptide,” and “protein” are used interchangeably to refer to a polymer of amino acid residues. The terms encompass amino acid chains of any length, including full-length proteins, wherein the amino acid residues are linked by covalent peptide bonds.
- precursor cannabinoid refers to a small molecule which by way of an enzymatic reaction is transformed to generate the cannabinoid of interest.
- the small molecule may be considered to be a cannabinoid itself.
- Two sequences are "substantially identical” if two sequences have a specified percentage of amino acid residues or nucleotides that are the same (i.e. , 60% identity, optionally 65%, 70%, 75%, 80%, 85%, 90%, 95%, or 99% identity over a specified region, or, when not specified, over the entire sequence), when compared and aligned for maximum correspondence over a comparison window, or designated region as measured using a sequence comparison algorithm or by manual alignment and visual inspection as described above.
- the identity exists over a region that is at least about 50 nucleotides (or 20 amino acids) in length, or more preferably over a region that is 100 to 500 or 1000 or more nucleotides (or 50, 100, or 200 or more amino acids) in length.
- Nucleic acid or protein sequences that are substantially identical to a reference sequence include “conservatively modified variants.” With respect to particular nucleic acid sequences, conservatively modified variants refer to those nucleic acids which encode identical or essentially identical amino acid sequences, or where the nucleic acid does not encode an amino acid sequence, to essentially identical sequences. Because of the degeneracy of the genetic code, a large number of functionally identical nucleic acids encode any given protein. For instance, the codons GCA, GCC, GCG and GCU all encode the amino acid alanine. Thus, at every position where an alanine is specified by a codon, the codon can be altered to any of the corresponding codons described without altering the encoded polypeptide.
- nucleic acid variations are "silent variations," which are one species of conservatively modified variations. Every nucleic acid sequence herein which encodes a polypeptide also describes every possible silent variation of the nucleic acid.
- each codon in a nucleic acid except AUG, which is ordinarily the only codon for methionine
- each silent variation of a nucleic acid which encodes a polypeptide is implicit in each described sequence.
- amino acid sequences one of skill will recognize that individual substitutions in a nucleic acid, peptide, polypeptide, or protein sequence which alters a single amino acid or a small percentage of amino acids in the encoded sequence is a "conservatively modified variant" where the alteration results in the substitution of an amino acid with a chemically similar amino acid. Conservative substitution tables providing functionally similar amino acids are well known in the art.
- amino acid groups defined in this manner can include: a "charged/polar group” including Glu (Glutamic acid or E), Asp (Aspartic acid or D), Asn (Asparagine or N), Gin (Glutamine or Q), Lys (Lysine or K), Arg (Arginine or R) and His (Histidine or H); an "aromatic or cyclic group” including Pro (Proline or P), Phe (Phenylalanine or F), Tyr (Tyrosine or Y) and Trp (Tryptophan or W); and an "aliphatic group” including Gly (Glycine or G), Ala (Alanine or A), Vai (Valine or V), Leu (Leucine or L), lie (Isoleucine or I), Met (Methionine or M), Ser (Serine or S), Thr (Threonine or T) and Cys (Cysteine or C).
- a "charged/polar group” including Glu (Glutamic acid
- subgroups can also be identified.
- the group of charged/polar amino acids can be sub-divided into sub-groups including: the "positively-charged subgroup” including Lys, Arg and His; the "negatively-charged sub-group” comprising Glu and Asp; and the "polar sub-group” comprising Asn and Gin.
- the aromatic or cyclic group can be sub-divided into sub-groups including: the "nitrogen ring sub-group” comprising Pro, His and Trp; and the "phenyl sub-group” comprising Phe and Tyr.
- the aliphatic group can be sub-divided into sub-groups including: the "large aliphatic non-polar sub-group” comprising Vai, Leu, and lie; the "aliphatic slightly-polar sub-group” comprising Met, Ser, Thr and Cys; and the "small-residue sub-group” comprising Gly and Ala.
- conservative mutations include amino acid substitutions of amino acids within the sub-groups above, such as, but not limited to: Lys for Arg or vice versa, such that a positive charge can be maintained; Glu for Asp or vice versa, such that a negative charge can be maintained; Ser for Thr or vice versa, such that a free -OH can be maintained; and Gin for Asn or vice versa, such that a free -NH2 can be maintained.
- the following six groups each contain amino acids that further provide illustrative conservative substitutions for one another.
- the terms “conservative mutation,” “conservative substitution,” and “conservative amino acid substitution” refer to a substitution of one or more amino acids for one or more different amino acids that exhibit similar physicochemical properties, such as polarity, electrostatic charge, and steric volume. These properties are summarized for each of the twenty naturally occurring amino acids in Table 1 , below. Table 1. Representative physicochemical properties of naturally occurring amino acids
- production generally refers to an amount of compound produced by a genetically modified host cell provided herein. In some embodiments, production is expressed as a yield of the compound by the host cell. In other embodiments, production is expressed as a productivity of the host cell in producing the compound.
- promoter refers to a synthetic or naturally derived nucleic acid that is capable of activating, increasing, or enhancing expression of a DNA coding sequence, or inactivating, decreasing, or inhibiting expression of a DNA coding sequence.
- a promoter may contain one or more specific transcriptional regulatory sequences to further enhance or repress expression and/or to alter the spatial expression and/or temporal expression of the coding sequence.
- a promoter may be positioned 5' (upstream) of the coding sequence under its control.
- a promoter may also initiate transcription in the downstream (3’) direction, the upstream (5’) direction, or be designed to initiate transcription in both the downstream (3’) and upstream (5’) directions.
- the distance between the promoter and a coding sequence to be expressed may be approximately the same as the distance between that promoter and the native nucleic acid sequence it controls. As is known in the art, variation in this distance may be accommodated without loss of promoter function.
- the term also includes a regulated promoter, which generally allows transcription of the nucleic acid sequence while in a permissive environment (e.g., microaerobic fermentation conditions, or the presence of maltose), but ceases transcription of the nucleic acid sequence while in a non-permissive environment (e.g., aerobic fermentation conditions, or in the absence of maltose). Promoters used herein can be constitutive, inducible, or repressible.
- heterologous refers to what is not normally found in nature.
- heterologous nucleic acid refers to a nucleic acid not normally found in a given cell in nature.
- a heterologous nucleic acid can be: (a) foreign to its host cell, i.e., exogenous to the host cell such that a host cell does not naturally contain the nucleic acid; (b) naturally found in the host cell, i.e., endogenous or native to the host cell, but present at an unnatural quantity in the cell (i.e., greater or lesser quantity than naturally found in the host cell); (c) be naturally found in the host cell but positioned outside of its natural locus.
- a “heterologous” polypeptide refers to a polypeptide that is encoded by a “heterologous nucleic acid.”
- a “heterologous” polypeptide may be naturally produced by a host cell but is encoded by a heterologous nucleic acid that has been introduced into the host cell by genetic engineering.
- a “heterologous” polypeptide can include embodiments in which an endogenous polypeptide is produced by an expression construct and is overexpressed in the host cell compared to native levels of the polypeptide produced by the host cell.
- the term “introducing” in the context of a nucleic acid or protein in a host cell refers to any process that results in the presence of a heterologous nucleic acid or polypeptide inside the host cell.
- the term encompasses introducing a nucleic acid molecule (e.g., a plasmid or a linear nucleic acid) that encodes the nucleic acid of interest (e.g., an RNA molecule) or polypeptide of interest and results in the transcription of the RNA molecules and translation of the polypeptides.
- the term also encompasses integrating the nucleic acid encoding the RNA molecules or polypeptides into the genome of a progenitor cell.
- nucleic acid is then passed through subsequent generations to the host cell, so that, for example, a nucleic acid encoding an RNA-guided endonuclease is “pre-integrated” into the host cell genome.
- introducing refers to translocation of a nucleic acid or polypeptide from outside the host cell to inside the host cell.
- Various methods of introducing nucleic acids, polypeptides and other biomolecules into host cells are contemplated, including but not limited to, electroporation, contact with nanowires or nanotubes, spheroplasting, PEG 1000-mediated transformation, biolistics, lithium acetate transformation, lithium chloride transformation, and the like.
- transformation refers to a genetic alteration of a host cell resulting from the introduction of exogenous genetic material, e.g., nucleic acids, into the host cell.
- mutation refers to a change in the nucleotide sequence of a gene. Mutations in a gene may occur naturally as a result of, for example, errors in DNA replication, DNA repair, irradiation, and exposure to carcinogens or mutations may be induced as a result of administration of a transgene expressing a mutant gene. Mutations may result from a single nucleotide substitution or deletion.
- oil refers to a biologically compatible hydrophobic, lipophilic, carbon-containing substance including but not limited to geologically-derived crude oil, distillate fractions of geologically-derived crude oil, vegetable oil, algal oil, microbial lipids, or synthetic oils.
- the oil is neither itself toxic to a biological molecule, a cell, a tissue, or a subject, nor does it degrade (if the oil degrades) at a rate that produces byproducts at toxic concentrations to a biological molecule, a cell, a tissue or a subject.
- Preferred examples of oils include but are not limited to avocado oil, canola oil, grapeseed oil, hemp oil, soybean oil, jojoba oil, and sunflower oil.
- operably linked refers to a functional linkage between nucleic acid sequences such that the sequences encode a desired function.
- a coding sequence for a gene of interest is in operable linkage with its promoter and/or regulatory sequences when the linked promoter and/or regulatory region functionally controls expression of the coding sequence. It also refers to the linkage between coding sequences such that they may be controlled by the same linked promoter and/or regulatory region; such linkage between coding sequences may also be referred to as being linked in frame or in the same coding frame.
- “Operably linked” also refers to a linkage of functional but non-coding sequences, such as an autonomous propagation sequence or origin of replication. Such sequences are in operable linkage when they are able to perform their normal function, e.g., enabling the replication, propagation, and/or segregation of a vector bearing the sequence in a host cell.
- the term “about” is used herein to mean a value that is ⁇ 10% of the recited value.
- the present disclosure features methods of isolating and purifying one or more cannabinoids from a fermentation composition.
- the fermentation composition may include host cells capable of producing one or more cannabinoids, such as for example, cannabichromene (CBC).
- CBC cannabichromene
- the methods described herein include using an oil overlay in combination with a cannabichromenic acid (CBCa) synthase to carry out near complete conversion of cannabigerolic acid (CBGa) to CBCa.
- CBCa cannabichromenic acid
- CBGa cannabigerolic acid
- the present disclosure additionally features variant CBCa synthase polypeptides having one or more amino acid substitutions.
- the variant CBCa synthase may have increased enzymatic activity in comparison to the wild-type enzyme.
- the disclosure provides a method of making a cannabichromenic acid (CBCa).
- the method may include (a) culturing a population of host cells capable of producing cannabigerolic acid (CBGa) in a culture medium comprising a fermentation broth and an overlay, under conditions suitable for the host cells to produce CBGa, and wherein the CBGa partitions into the overlay; (b) separating the overlay from the fermentation broth; (c) combining the separated overlay of step (b), with a CBCa synthase, thereby producing a bioconversion mixture; and (d) purifying the CBCa from the bioconversion mixture.
- CBGa cannabigerolic acid
- the demulsification may include contacting the second mixture containing the cannabinoid with an oil.
- the oil may be a mineral oil, a vegetable oil, a synthetic ester, or a fatty alcohol.
- the oil may be a vegetable oil.
- the vegetable oil is soybean oil, sunflower oil, safflower oil, canola oil, grapeseed oil, or castor oil.
- the oil is a synthetic ester, optionally wherein the synthetic ester is ESTEREXTM A51 .
- the oil includes a fatty alcohol, optionally wherein the fatty alcohol is oleyl alcohol or JARCOLTM 1-16.
- the oil may have a concentration of between about 1% (w/v) and about 10% (w/v) (e.g., about 1% (w/v), 2% (w/v), 3% (w/v), 4% (w/v), 5% (w/v), 6% (w/v), 7% (w/v), 8% (w/v), 9% (w/v), or 10% (w/v)).
- the oil may have a concentration of is about 5% (w/v).
- Th overlay further includes CBGa.
- the CBGa has a concentration of between about 0.1% (w/v) and 10% (w/v) (e.g., between about 0.1 % (w/v) and 8% (w/v), 0.1% (w/v) and 6% (w/v), 0.1% (w/v) and 4% (w/v), 0.1% (w/v) and 2% (w/v), 0.1% (w/v) and 1% (w/v), 1% (w/v) and 10% (w/v), 2% (w/v) and 10% (w/v), 4% (w/v) and 10% (w/v), 6% (w/v) and 10% (w/v), or 8% (w/v) and 10% (w/v)).
- the CBGa may have a concentration of between about 0.5% (w/v) or and 5% (w/v) (e.g., between about 0.5% (w/v) and 4% (w/v), 0.5% (w/v) and 3% (w/v), 0.5% (w/v) and 2% (w/v), 0.5% (w/v) and 1% (w/v), 1% (w/v) and 5% (w/v), 2% (w/v) and 5% (w/v), 3% (w/v) and 5% (w/v), or 4% (w/v) and 5% (w/v)).
- the method may include stirring the bioconversion mixture.
- the bioconversion mixture may be stirred may include stirring the second mixture for between 12 hours and 144 hours (e.g., between 12 hours 120 hours, 12 hours and 96 hours, 12 hours and 72 hours, 12 hours and 48 hours, 12 hours and 24 hours, 24 hours and 144 hours, 48 hours and 144 hours, 72 hours and 144 hours, 96 hours and 144 hours, or 120 hours and 144 hours) prior to purifying the CBCa from the bioconversion mixture.
- 12 hours and 144 hours e.g., between 12 hours 120 hours, 12 hours and 96 hours, 12 hours and 72 hours, 12 hours and 48 hours, 12 hours and 24 hours, 24 hours and 144 hours, 48 hours and 144 hours, 72 hours and 144 hours, 96 hours and 144 hours, or 120 hours and 144 hours
- the bioconversion mixture may be stirred for between 24 hours and 96 hours (e.g., between 24 hours and 84 hours, 24 hours and 72 hours, 24 hours and 60 hours, 24 hours and 48 hours, 48 hours and 96 hours, 60 hours and 96 hours, 72 hours and 96 hours, or 84 hours and 96 hours). In some embodiments, the bioconversion mixture may be stirred for about 48 hours.
- the bioconversion mixture may be stirred at a rate of between about 50 rotations per minute (rpm) and 300 rpm (e.g., between about 50 rpm and 250 rpm, 50 rpm and 200 rpm, 50 rpm and 150 rpm, 50 rpm and 100 rpm, 100 rpm and 250 rpm, 150 rpm and 250 rpm, or 200 rpm and 250 rpm).
- rpm rotations per minute
- 300 rpm e.g., between about 50 rpm and 250 rpm, 50 rpm and 200 rpm, 50 rpm and 150 rpm, 50 rpm and 100 rpm, 100 rpm and 250 rpm, 150 rpm and 250 rpm, or 200 rpm and 250 rpm.
- the bioconversion mixture may be stirred at a rate of about 150 rpm.
- the second mixture may include one or more amphiphilic moieties in addition to the cannabinoid.
- the one or more amphiphilic moieties may include a cyclodextrin, plant derived silica, cellulose, or a combination thereof.
- the cyclodextrin may include randomly methylated cyclodextrin, 2, 6-Di-O-methyl-p-cyclodextrin, or a combination thereof.
- the final concentration of the enzymatic composition may be from about 0.5% (w/v) to about 3% (w/v) (e.g., about 0.5% (w/v) and 2.5% (w/v), 0.5% (w/v) and 2% (w/v), 0.5% (w/v) and 1 .5% (w/v), 0.5% (w/v) and 1 % (w/v), 1 % (w/v) and 3% (w/v), 1 .5% (w/v) and 3% (w/v), 2% (w/v) and 3% (w/v), or 2.5% (w/v) and 3% (w/v)) after contacting the culture medium including a cannabinoid with the enzymatic composition.
- the enzymatic composition may have a final concentration of about 1 % (w/v).
- the demulsification may include centrifugation of the culture medium including a cannabinoid.
- the centrifugation may include liquid-liquid centrifugation, which may result in a crude oil light phase and an aqueous heavy phase.
- the variant polypeptide may include an amino acid substitution at residue V294 of SEQ ID NO: 1 .
- This substitution may include an amino acid including a polar, uncharged side chain at physiological pH.
- the amino acid substitution at residue V294 of SEQ ID NO: 1 substitutes V294 with an amino acid including an anionic side chain at physiological pH.
- the amino acid substitution at residue V294 of SEQ ID NO: 1 substitutes V294 with an amino acid including a cationic side chain at physiological pH.
- the amino acid substitution at residue V294 of SEQ ID NO: 1 may be a V294S substitution, a V294E substitution, or V294R substitution.
- the variant polypeptide may have an amino acid substitution at residue H354 of SEQ ID NO: 1 .
- the amino acid substitution at residue H354 of SEQ ID NO: 1 includes an amino acid including a hydrophobic, uncharged side chain at physiological pH.
- the amino acid substitution at residue R340 of SEQ ID NO: 1 may be a H354V substitution.
- the variant CBCa synthase may include an amino acid substitution at residue L435 of SEQ ID NO: 1 .
- the amino acid substitution at residue L435 of SEQ ID NO: 1 may substitute L435 with an amino acid including a hydrophobic, uncharged side chain at physiological pH.
- the amino acid substitution at residue L435 of SEQ ID NO: 1 may be a L435A substitution.
- the variant polypeptide may have amino acid substitutions including Q75E, T130L, S140A, V169E, N240M, V294S, A299V, T335L, R340M, H354V, L435A, K535M, and S540D of SEQ ID NO: 1 .
- the variant polypeptide may have amino acid substitutions including Q75E, F82I, T130L, V169E, N240M, V294S, A299V, K305C, T335L, R340M, L435A, Y4611, and S540D of SEQ ID NO: 1 .
- the variant polypeptide may have amino acid substitutions including Q75E, F82I, T130L, S140A, V169E, N240M, V294S, A299V, T335L, R340M, L435A, K535M, and S540D of SEQ ID NO: 1 .
- the variant polypeptide may have amino acid substitutions including F82I, T130L, S140A, V169E, N240M, V294S, A299V, K305C, H354V, L435A, K535M, and S540D of SEQ ID NO: 1 .
- the variant polypeptide may have amino acid substitutions including T130L, S140A, V169E, V294S, A299V, R340M, H354V, L435A, Y461 I, K535M, and S540D of SEQ ID NO: 1 .
- the variant polypeptide may have amino acid substitutions including Q75E, F82I, T130L, N240M, V294S, A299V, T335L, H354V, L435A, K535M, and S540D of SEQ ID NO: 1 .
- the variant polypeptide may have amino acid substitutions including Q75E, F82I, S140A, V169E, N240M, V294S, A299V, K305C, T335L, R340M, H354V, L435A, and S540D of SEQ ID NO: 1 .
- the variant polypeptide may have amino acid substitutions including Q75E, F82I, T130L, S140A, V169E, N240M, V294S, A299V, K305C, R340M, L435A, Y461 1, and S540D of SEQ ID NO: 1 .
- the variant polypeptide may have amino acid substitutions including Q75E, F82I, S140A, V169E, N240M, A299V, K305C, R340M, H354V, L435A, Y461 1, and K535M of SEQ ID NO: 1 .
- the variant polypeptide may have amino acid substitutions including Q75E, F82I, T130L, S140A, V169E, N240M, V294S, A299V, K305C, R340M, H354V, L435A, K535M, and S540D of SEQ ID NO: 1 .
- the variant polypeptide may have amino acid substitutions including Q75E, F82I, T130L, S140A, V169E, N240M, V294S, K305C, T335L, R340M, L435A, Y461 1, and S540D of SEQ ID NO: 1 .
- the variant polypeptide may have amino acid substitutions including F82I, T130L, S140A, V169E, N240M, V294S, A299V, K305C, T335L, R340M, H354V, L435A, Y461 1, and S540D of SEQ ID NO: 1 .
- the variant polypeptide may have amino acid substitutions including F82I, T130L, S140A, V169E, N240M, A299V, T335L, R340M, H354V, L435A, Y461 1, K535M, and S540D of SEQ ID NO: 1 .
- the variant polypeptide may have amino acid substitutions including Q75E, F82I, S140A, V169E, N240M, V294S, A299V, K305C, T335L, L435A, Y461 1, K535M, and S540D of SEQ ID NO: 1 .
- the CBCa synthase may have an amino acid sequence that is from about 90% to about 99.7% (e.g., about 95%, 96%, 97%, 98%, 99%, 99.5%, or 99.7%) identical to the amino acid sequence of SEQ ID NO: 1 .
- the CBCa synthase may have an amino acid sequence that is from about 95% to about 99.7% (e.g., about 95%, 95.5%, 96%, 96.5%, 97%, 97.5%, 98%, 98.5%, 99%, 99.1%, 99.2%, 99.3%, 99.4%, 99.5%, 99.6%, or 99.7%) identical to the amino acid sequence of SEQ ID NO: 1 .
- the AAE has an amino acid sequence that is at least 95% identical to the amino acid sequence of any one of SEQ ID NO: 68-91 (e.g., an amino acid sequence that is 95%, 96%, 97%, 98%, 99%, or 100% identical to any one of SEQ ID NO: 68-91 ). In some embodiments, the AAE has the amino acid sequence of any one of SEQ ID NO: 68-91 .
- the AAE has an amino acid sequence that is at least 95% identical to the amino acid sequence of any one of SEQ ID NO: 68-80 (e.g., an amino acid sequence that is 95%, 96%, 97%, 98%, 99%, or 100% identical to any one of SEQ ID NO: 68-80). In some embodiments, the AAE has the amino acid sequence of any one of SEQ ID NO: 68-80.
- the host cell contains a heterologous nucleic acid that encodes a TKS having an amino acid sequence that is at least 90% identical to the amino acid sequence of any one of SEQ ID NO: 92-95 (e.g., an amino acid sequence that is 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to any one of SEQ ID NO: 92-95).
- a heterologous nucleic acid that encodes a TKS having an amino acid sequence that is at least 90% identical to the amino acid sequence of any one of SEQ ID NO: 92-95 (e.g., an amino acid sequence that is 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to any one of SEQ ID NO: 92-95).
- Some embodiments concern a host cell that includes a heterologous CBGaS such that the host cell is capable of producing a cannabinoid.
- a CBGaS uses the olivetolic acid precursor and GPP precursor to generate cannabigerolic acid.
- the CBGaS may be from Cannabis sativa or may be an enzyme from another plant or fungal source which has been shown to have CBGaS activity in the cannabinoid biosynthetic pathway, resulting in the production of the cannabinoid cannabigerolic acid.
- the host cell contains a heterologous nucleic acid that encodes a CBGaS having an amino acid sequence that is at least 90% identical to the amino acid sequence of any one of SEQ ID NO: 127-131 (e.g., an amino acid sequence that is 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to any one of SEQ ID NO: 127-131 ).
- a heterologous nucleic acid that encodes a CBGaS having an amino acid sequence that is at least 90% identical to the amino acid sequence of any one of SEQ ID NO: 127-131 (e.g., an amino acid sequence that is 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to any one of SEQ ID NO: 127-131 ).
- the CBGaS has an amino acid sequence that is at least 95% identical to the amino acid sequence of any one of SEQ ID NO: 127-131 (e.g., an amino acid sequence that is 95%, 96%, 97%, 98%, 99%, or 100% identical to any one of SEQ ID NO: 127-131 ). In some embodiments, the CBGaS has the amino acid sequence of any one of SEQ ID NO: 127-131 .
- Some embodiments concern a host cell that includes a heterologous GPP synthase such that the host cell is capable of producing a cannabinoid.
- a GPP synthase uses the product of the isoprenoid biosynthesis pathway precursor to generate cannabigerolic acid together with a prenyltransferase enzyme.
- the GPP synthase may be from Cannabis sativa or may be an enzyme from another plant or bacterial source which has been shown to have GPP synthase activity in the cannabinoid biosynthetic pathway, resulting in the production of the cannabinoid cannabigerolic acid.
- the GPP synthase has an amino acid sequence that is at least 95% identical to the amino acid sequence of any one of SEQ ID NO: 132-137 (e.g., an amino acid sequence that is 95%, 96%, 97%, 98%, 99%, or 100% identical to any one of SEQ ID NO: 132-137). In some embodiments, the GPP synthase has the amino acid sequence of any one of SEQ ID NO: 132-137.
- the host cell contains a heterologous nucleic acid that encodes a GPP synthase having an amino acid sequence that is at least 90% identical to the amino acid sequence of SEQ ID NO: 132 (e.g., an amino acid sequence that is 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to SEQ ID NO: 132).
- the GPP synthase has an amino acid sequence that is at least 95% identical to the amino acid sequence of SEQ ID NO: 132 (e.g., an amino acid sequence that is 95%, 96%, 97%, 98%, 99%, or 100% identical to SEQ ID NO: 132).
- the GPP synthase has the amino acid sequence of SEQ ID NO: 132.
- the host cell may further express other heterologous enzymes in addition to the AAE, TKS, CBGaS, and/or GPP synthase.
- the host cell may include an olivetolic acid cyclase (OAC) as part of the cannabinoid biosynthetic pathway.
- OAC olivetolic acid cyclase
- the OAC may have an amino acid sequence that is at least 90% (e.g., at least 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%) identical to SEQ ID NO: 138.
- the OAC has an amino acid sequence of SEQ ID NO: 138.
- the host cell may include a heterologous nucleic acid that encodes at least one enzyme from the mevalonate biosynthetic pathway.
- Enzymes which make up the mevalonate biosynthetic pathway may include but are not limited to an acetyl-CoA thiolase, an HMG-CoA synthase, an HMG-CoA reductase, a mevalonate kinase, a phosphomevalonate kinase, a mevalonate pyrophosphate decarboxylase, and an IPP:DMAPP isomerase.
- the host cell expresses a heterologous acetyl-CoA synthase having an amino acid sequence that is at least 90% (e.g., at least 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%) identical to the amino acid sequence of SEQ ID NO: 140.
- the acetyl-CoA synthase has the amino acid sequence of SEQ ID NO: 140.
- the aldehyde dehydrogenase has an amino acid sequence that is at least 90% (e.g., at least 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%) identical to the amino acid sequence of SEQ ID NO: 141 . In some embodiments, the aldehyde dehydrogenase has the amino acid sequence of SEQ ID NO: 141 . In some embodiments, the pyruvate dehydrogenase has an amino acid sequence that is at least 90% (e.g., at least 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%) identical to the amino acid sequence of SEQ ID NO: 142. In some embodiments, the pyruvate decarboxylase has an amino acid sequence of SEQ ID NO: 142.
- any one of the polypeptide sequences disclosed herein may be encoded by DNA molecules of any sequence that encode the amino acid sequences of the polypeptides and proteins of the enzymes utilized in the methods of the disclosure.
- a polypeptide can typically tolerate one or more amino acid substitutions, deletions, and insertions in its amino acid sequence without loss or significant loss of a desired activity.
- the disclosure includes such polypeptides with different amino acid sequences than the specific proteins described herein so long as the modified or variant polypeptides have the enzymatic anabolic or catabolic activity of the reference polypeptide.
- the amino acid sequences encoded by the DNA sequences shown herein merely illustrate embodiments of the disclosure.
- homologs of enzymes useful for the compositions and methods provided herein are encompassed by the disclosure.
- two proteins are substantially homologous when the amino acid sequences have at least about 30%, 40%, 50% 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity.
- Sequence homology for polypeptides is typically measured using sequence analysis software.
- a typical algorithm used for comparing a molecule sequence to a database containing a large number of sequences from different organisms is the computer program BLAST. When searching a database containing sequences from a large number of different organisms, it is typical to compare amino acid sequences.
- any of the genes encoding the foregoing enzymes may be optimized by genetic/protein engineering techniques, such as directed evolution or rational mutagenesis, which are known to those of ordinary skill in the art. Such action allows those of ordinary skill in the art to optimize the enzymes for expression and activity in a host cell, for example, a yeast.
- the genetic regulatory element is a nucleic acid sequence, such as a promoter.
- yeast strains useful in the present methods include yeasts that have been deposited with microorganism depositories (e.g. IFO, ATCC, etc.) and belong to the genera Aciculoconidium, Ambrosiozyma, Arthroascus, Arxiozyma, Ashbya, Babjevia, Bensingtonia, Botryoascus, Botryozyma, Brettanomyces, Bullera, Bulleromyces, Candida, Citeromyces, Clavispora, Cryptococcus, Cystofilobasidium, Debaryomyces, Dekkara, Dipodascopsis, Dipodascus, Eeniella, Endomycopsella, Eremascus, Eremothecium, Erythrobasidium, Fellomyces, Filobasidium, Galactomyces, Geotrichum, Guilliermondella, Hanseniaspora, Hansenula, Hasegawaea, Holtermannia
- the concentration of a carbon source, such as glucose or sucrose, in the culture medium is typically less than about 100 g/L, preferably less than about 50 g/L, and more preferably less than about 20 g/L. It should be noted that references to culture component concentrations can refer to both initial and/or ongoing component concentrations. In some cases, it may be desirable to allow the culture medium to become depleted of a carbon source during culture.
- Sources of assimilable nitrogen that can be used in a suitable culture medium include, but are not limited to, simple nitrogen sources, organic nitrogen sources and complex nitrogen sources. Such nitrogen sources include anhydrous ammonia, ammonium salts and substances of animal, vegetable and/or microbial origin. Suitable nitrogen sources include, but are not limited to, protein hydrolysates, microbial biomass hydrolysates, peptone, yeast extract, ammonium sulfate, urea, and amino acids. Typically, the concentration of the nitrogen sources in the culture medium is greater than about 0.1 g/L, preferably greater than about 0.25 g/L, and more preferably greater than about 1 .0 g/L.
- the culture medium can also include a biologically acceptable calcium source, including, but not limited to, calcium chloride.
- a biologically acceptable calcium source including, but not limited to, calcium chloride.
- concentration of the calcium source, such as calcium chloride, dihydrate, in the culture medium is within the range of from about 5 mg/L to about 2000 mg/L, preferably within the range of from about 20 mg/L to about 1000 mg/L, and more preferably in the range of from about 50 mg/L to about 500 mg/L.
- the culture medium can also include sodium chloride.
- the concentration of sodium chloride in the culture medium is within the range of from about 0.1 g/L to about 5 g/L, preferably within the range of from about 1 g/L to about 4 g/L, and more preferably in the range of from about 2 g/L to about 4 g/L.
- Example 1 Large scale bioconversion of cannabigerolic acid to cannabichromenic acid
- WGEKYFGKNFNRLVKVKTKADPNNFFRNEQSIPPLPPHHH* SEQ ID NO: 51 - T130L/S140A/V169E/V294S/A299V/R340M/H354V/L435A/Y4611/K535M/S540D
Landscapes
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
L'invention concerne des procédés de production et d'isolement d'un cannabinoïde à partir d'une composition de fermentation comprenant des cellules hôtes capables de produire un cannabinoïde tel que par combinaison d'un mélange comprenant un acide cannabigérolique avec une synthase d'acide cannabichroménique (CBCa), éventuellement dans un mélange d'huile. L'invention concerne également des synthases CBCa ayant une activité améliorée par rapport à l'enzyme de type sauvage.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US202263431560P | 2022-12-09 | 2022-12-09 | |
US63/431,560 | 2022-12-09 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2024124165A2 true WO2024124165A2 (fr) | 2024-06-13 |
Family
ID=91380247
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2023/083173 WO2024124165A2 (fr) | 2022-12-09 | 2023-12-08 | Procédés et compositions de purification de cannabinoïdes |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2024124165A2 (fr) |
-
2023
- 2023-12-08 WO PCT/US2023/083173 patent/WO2024124165A2/fr unknown
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11939613B2 (en) | Production of cannabinoids in yeast | |
US11767533B2 (en) | Compositions and methods for production of myrcene | |
EP4200426A1 (fr) | Production microbienne de cannabinoïdes | |
WO2023288187A9 (fr) | Production à haut rendement d'acide cannabidiolique | |
JP7487099B2 (ja) | レバウジオシドの高効率生成のためのエンドウ(pisum sativum)カウレンオキシダーゼ | |
WO2023288188A1 (fr) | Production à haut rendement d'acide cannabigérolique et d'acide cannabidiolique | |
US20220325313A1 (en) | Biosynthesis of alpha-ionone and beta-ionone | |
US20220290194A1 (en) | Cannabidiolic acid synthase variants with improved activity for use in production of phytocannabinoids | |
WO2021034847A1 (fr) | Production de cannabinoïdes | |
EP3947658A1 (fr) | Production de cannabinoïdes | |
EP4370684A2 (fr) | Nouvelles enzymes pour la production d'acétate de gamma-ambryle | |
WO2024124165A2 (fr) | Procédés et compositions de purification de cannabinoïdes | |
US20230051453A1 (en) | Biosynthetic platform for the production of olivetolic acid and analogues of olivetolic acid | |
US20220127620A1 (en) | Microbial production of compounds | |
WO2022256697A1 (fr) | Procédés de purification de cannabinoïdes | |
WO2022256691A1 (fr) | Procédés de purification de cannabinoïde | |
EP4370671A1 (fr) | Nouvelles enzymes pour la production d'e-copalol | |
CA3237656A1 (fr) | Voie de biosynthese optimisee pour la biosynthese des cannabinoides |