WO2024033303A1 - Synthetic formolase pathway - Google Patents
Synthetic formolase pathway Download PDFInfo
- Publication number
- WO2024033303A1 WO2024033303A1 PCT/EP2023/071821 EP2023071821W WO2024033303A1 WO 2024033303 A1 WO2024033303 A1 WO 2024033303A1 EP 2023071821 W EP2023071821 W EP 2023071821W WO 2024033303 A1 WO2024033303 A1 WO 2024033303A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- cell
- seq
- fls
- acid
- protein
- Prior art date
Links
- 230000037361 pathway Effects 0.000 title claims abstract description 65
- WSFSSNUMVMOOMR-UHFFFAOYSA-N Formaldehyde Chemical compound O=C WSFSSNUMVMOOMR-UHFFFAOYSA-N 0.000 claims abstract description 107
- RXKJFZQQPQGTFL-UHFFFAOYSA-N dihydroxyacetone Chemical compound OCC(=O)CO RXKJFZQQPQGTFL-UHFFFAOYSA-N 0.000 claims abstract description 65
- 229940120503 dihydroxyacetone Drugs 0.000 claims abstract description 31
- 210000003527 eukaryotic cell Anatomy 0.000 claims abstract description 14
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 claims description 182
- 210000004027 cell Anatomy 0.000 claims description 173
- 230000014509 gene expression Effects 0.000 claims description 105
- 108090000623 proteins and genes Proteins 0.000 claims description 98
- 102000004169 proteins and genes Human genes 0.000 claims description 58
- 235000018102 proteins Nutrition 0.000 claims description 57
- 230000035772 mutation Effects 0.000 claims description 46
- 150000007523 nucleic acids Chemical class 0.000 claims description 45
- 241000235058 Komagataella pastoris Species 0.000 claims description 41
- 229910052799 carbon Inorganic materials 0.000 claims description 41
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 claims description 37
- 240000004808 Saccharomyces cerevisiae Species 0.000 claims description 37
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 claims description 37
- 102000004190 Enzymes Human genes 0.000 claims description 36
- 108090000790 Enzymes Proteins 0.000 claims description 36
- 108010085186 Peroxisomal Targeting Signals Proteins 0.000 claims description 36
- 102000039446 nucleic acids Human genes 0.000 claims description 34
- 108020004707 nucleic acids Proteins 0.000 claims description 34
- 150000001875 compounds Chemical class 0.000 claims description 33
- 150000001413 amino acids Chemical group 0.000 claims description 31
- 210000002824 peroxisome Anatomy 0.000 claims description 26
- 238000000034 method Methods 0.000 claims description 25
- 102220085897 rs140581841 Human genes 0.000 claims description 24
- 102000037865 fusion proteins Human genes 0.000 claims description 22
- 108020001507 fusion proteins Proteins 0.000 claims description 22
- 238000004113 cell culture Methods 0.000 claims description 21
- CURLTUGMZLYLDI-UHFFFAOYSA-N Carbon dioxide Chemical compound O=C=O CURLTUGMZLYLDI-UHFFFAOYSA-N 0.000 claims description 20
- 239000002028 Biomass Substances 0.000 claims description 17
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 17
- 239000002207 metabolite Substances 0.000 claims description 14
- 108010025188 Alcohol oxidase Proteins 0.000 claims description 12
- 235000001014 amino acid Nutrition 0.000 claims description 11
- 241001099156 Komagataella phaffii Species 0.000 claims description 10
- 229910002092 carbon dioxide Inorganic materials 0.000 claims description 10
- 230000001105 regulatory effect Effects 0.000 claims description 10
- 101150067325 DAS1 gene Proteins 0.000 claims description 8
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 claims description 8
- 101100008874 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) DAS2 gene Proteins 0.000 claims description 8
- 101100516268 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) NDT80 gene Proteins 0.000 claims description 8
- 241000196324 Embryophyta Species 0.000 claims description 7
- 241000235648 Pichia Species 0.000 claims description 7
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 claims description 6
- 241000282414 Homo sapiens Species 0.000 claims description 6
- KRKNYBCHXYNGOX-UHFFFAOYSA-N citric acid Chemical compound OC(=O)CC(O)(C(O)=O)CC(O)=O KRKNYBCHXYNGOX-UHFFFAOYSA-N 0.000 claims description 6
- 238000012258 culturing Methods 0.000 claims description 6
- VNWKTOKETHGBQD-UHFFFAOYSA-N methane Chemical compound C VNWKTOKETHGBQD-UHFFFAOYSA-N 0.000 claims description 6
- 210000005253 yeast cell Anatomy 0.000 claims description 6
- 108010048916 alcohol dehydrogenase (acceptor) Proteins 0.000 claims description 5
- 230000003115 biocidal effect Effects 0.000 claims description 5
- 150000001720 carbohydrates Chemical class 0.000 claims description 5
- 235000014633 carbohydrates Nutrition 0.000 claims description 5
- 238000012217 deletion Methods 0.000 claims description 5
- 230000037430 deletion Effects 0.000 claims description 5
- 210000003463 organelle Anatomy 0.000 claims description 5
- GHOKWGTUZJEAQD-ZETCQYMHSA-N (D)-(+)-Pantothenic acid Chemical compound OCC(C)(C)[C@@H](O)C(=O)NCCC(O)=O GHOKWGTUZJEAQD-ZETCQYMHSA-N 0.000 claims description 4
- KBPLFHHGFOOTCA-UHFFFAOYSA-N 1-Octanol Chemical compound CCCCCCCCO KBPLFHHGFOOTCA-UHFFFAOYSA-N 0.000 claims description 4
- BBMCTIGTTCKYKF-UHFFFAOYSA-N 1-heptanol Chemical compound CCCCCCCO BBMCTIGTTCKYKF-UHFFFAOYSA-N 0.000 claims description 4
- ALRHLSYJTWAHJZ-UHFFFAOYSA-N 3-hydroxypropionic acid Chemical compound OCCC(O)=O ALRHLSYJTWAHJZ-UHFFFAOYSA-N 0.000 claims description 4
- JOOXCMJARBKPKM-UHFFFAOYSA-N 4-oxopentanoic acid Chemical compound CC(=O)CCC(O)=O JOOXCMJARBKPKM-UHFFFAOYSA-N 0.000 claims description 4
- CIWBSHSKHKDKBQ-JLAZNSOCSA-N Ascorbic acid Chemical compound OC[C@H](O)[C@H]1OC(=O)C(O)=C1O CIWBSHSKHKDKBQ-JLAZNSOCSA-N 0.000 claims description 4
- FERIUCNNQQJTOY-UHFFFAOYSA-N Butyric acid Chemical compound CCCC(O)=O FERIUCNNQQJTOY-UHFFFAOYSA-N 0.000 claims description 4
- 241000222120 Candida <Saccharomycetales> Species 0.000 claims description 4
- UGFAIRIUMAVXCW-UHFFFAOYSA-N Carbon monoxide Chemical compound [O+]#[C-] UGFAIRIUMAVXCW-UHFFFAOYSA-N 0.000 claims description 4
- 241000192700 Cyanobacteria Species 0.000 claims description 4
- RGHNJXZEOKUKBD-SQOUGZDYSA-N D-gluconic acid Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@@H](O)C(O)=O RGHNJXZEOKUKBD-SQOUGZDYSA-N 0.000 claims description 4
- VZCYOOQTPOCHFL-OWOJBTEDSA-N Fumaric acid Chemical compound OC(=O)\C=C\C(O)=O VZCYOOQTPOCHFL-OWOJBTEDSA-N 0.000 claims description 4
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 claims description 4
- LRHPLDYGYMQRHN-UHFFFAOYSA-N N-Butanol Chemical compound CCCCO LRHPLDYGYMQRHN-UHFFFAOYSA-N 0.000 claims description 4
- AMQJEAYHLZJPGS-UHFFFAOYSA-N N-Pentanol Chemical compound CCCCCO AMQJEAYHLZJPGS-UHFFFAOYSA-N 0.000 claims description 4
- 241000320412 Ogataea angusta Species 0.000 claims description 4
- AUNGANRZJHBGPY-SCRDCRAPSA-N Riboflavin Chemical compound OC[C@@H](O)[C@@H](O)[C@@H](O)CN1C=2C=C(C)C(C)=CC=2N=C2C1=NC(=O)NC2=O AUNGANRZJHBGPY-SCRDCRAPSA-N 0.000 claims description 4
- 241000223259 Trichoderma Species 0.000 claims description 4
- WNLRTRBMVRJNCN-UHFFFAOYSA-N adipic acid Chemical compound OC(=O)CCCCC(O)=O WNLRTRBMVRJNCN-UHFFFAOYSA-N 0.000 claims description 4
- 239000001569 carbon dioxide Substances 0.000 claims description 4
- 229910002091 carbon monoxide Inorganic materials 0.000 claims description 4
- XBDQKXXYIPTUBI-UHFFFAOYSA-N dimethylselenoniopropionate Natural products CCC(O)=O XBDQKXXYIPTUBI-UHFFFAOYSA-N 0.000 claims description 4
- OVBPIULPVIDEAO-LBPRGKRZSA-N folic acid Chemical compound C=1N=C2NC(N)=NC(=O)C2=NC=1CNC1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 OVBPIULPVIDEAO-LBPRGKRZSA-N 0.000 claims description 4
- CHTHALBTIRVDBM-UHFFFAOYSA-N furan-2,5-dicarboxylic acid Chemical compound OC(=O)C1=CC=C(C(O)=O)O1 CHTHALBTIRVDBM-UHFFFAOYSA-N 0.000 claims description 4
- ZSIAUFGUXNUGDI-UHFFFAOYSA-N hexan-1-ol Chemical compound CCCCCCO ZSIAUFGUXNUGDI-UHFFFAOYSA-N 0.000 claims description 4
- JVTAAEKCZFNVCJ-UHFFFAOYSA-N lactic acid Chemical compound CC(O)C(O)=O JVTAAEKCZFNVCJ-UHFFFAOYSA-N 0.000 claims description 4
- HEBKCHPVOIAQTA-UHFFFAOYSA-N meso ribitol Natural products OCC(O)C(O)C(O)CO HEBKCHPVOIAQTA-UHFFFAOYSA-N 0.000 claims description 4
- DNIAPMSPPWPWGF-UHFFFAOYSA-N monopropylene glycol Natural products CC(O)CO DNIAPMSPPWPWGF-UHFFFAOYSA-N 0.000 claims description 4
- LXNHXLLTXMVWPM-UHFFFAOYSA-N pyridoxine Chemical compound CC1=NC=C(CO)C(CO)=C1O LXNHXLLTXMVWPM-UHFFFAOYSA-N 0.000 claims description 4
- NQPDZGIKBAWPEJ-UHFFFAOYSA-N valeric acid Chemical compound CCCCC(O)=O NQPDZGIKBAWPEJ-UHFFFAOYSA-N 0.000 claims description 4
- 241000195493 Cryptophyta Species 0.000 claims description 3
- BDAGIHXWWSANSR-UHFFFAOYSA-M Formate Chemical compound [O-]C=O BDAGIHXWWSANSR-UHFFFAOYSA-M 0.000 claims description 3
- 241000238631 Hexapoda Species 0.000 claims description 3
- 241001099157 Komagataella Species 0.000 claims description 3
- 241000428705 Komagataella pseudopastoris Species 0.000 claims description 3
- 241001112159 Ogataea Species 0.000 claims description 3
- 229960002685 biotin Drugs 0.000 claims description 3
- 235000020958 biotin Nutrition 0.000 claims description 3
- 239000011616 biotin Substances 0.000 claims description 3
- 210000003470 mitochondria Anatomy 0.000 claims description 3
- 150000007524 organic acids Chemical class 0.000 claims description 3
- 229960004063 propylene glycol Drugs 0.000 claims description 3
- DNIAPMSPPWPWGF-VKHMYHEASA-N (+)-propylene glycol Chemical compound C[C@H](O)CO DNIAPMSPPWPWGF-VKHMYHEASA-N 0.000 claims description 2
- DNIAPMSPPWPWGF-GSVOUGTGSA-N (R)-(-)-Propylene glycol Chemical compound C[C@@H](O)CO DNIAPMSPPWPWGF-GSVOUGTGSA-N 0.000 claims description 2
- BJEPYKJPYRNKOW-REOHCLBHSA-N (S)-malic acid Chemical compound OC(=O)[C@@H](O)CC(O)=O BJEPYKJPYRNKOW-REOHCLBHSA-N 0.000 claims description 2
- 229940035437 1,3-propanediol Drugs 0.000 claims description 2
- YPFDHNVEDLHUCE-UHFFFAOYSA-N 1,3-propanediol Substances OCCCO YPFDHNVEDLHUCE-UHFFFAOYSA-N 0.000 claims description 2
- SMZOUWXMTYCWNB-UHFFFAOYSA-N 2-(2-methoxy-5-methylphenyl)ethanamine Chemical compound COC1=CC=C(C)C=C1CCN SMZOUWXMTYCWNB-UHFFFAOYSA-N 0.000 claims description 2
- JAHNSTQSQJOJLO-UHFFFAOYSA-N 2-(3-fluorophenyl)-1h-imidazole Chemical compound FC1=CC=CC(C=2NC=CN=2)=C1 JAHNSTQSQJOJLO-UHFFFAOYSA-N 0.000 claims description 2
- NIXOWILDQLNWCW-UHFFFAOYSA-N 2-Propenoic acid Natural products OC(=O)C=C NIXOWILDQLNWCW-UHFFFAOYSA-N 0.000 claims description 2
- KJJPLEZQSCZCKE-UHFFFAOYSA-N 2-aminopropane-1,3-diol Chemical compound OCC(N)CO KJJPLEZQSCZCKE-UHFFFAOYSA-N 0.000 claims description 2
- AKXKFZDCRYJKTF-UHFFFAOYSA-N 3-Hydroxypropionaldehyde Chemical compound OCCC=O AKXKFZDCRYJKTF-UHFFFAOYSA-N 0.000 claims description 2
- WHBMMWSBFZVSSR-UHFFFAOYSA-M 3-hydroxybutyrate Chemical compound CC(O)CC([O-])=O WHBMMWSBFZVSSR-UHFFFAOYSA-M 0.000 claims description 2
- FUDDLSHBRSNCBV-UHFFFAOYSA-N 4-hydroxyoxolan-2-one Chemical compound OC1COC(=O)C1 FUDDLSHBRSNCBV-UHFFFAOYSA-N 0.000 claims description 2
- 241000611184 Amphora Species 0.000 claims description 2
- 241000219194 Arabidopsis Species 0.000 claims description 2
- 240000002900 Arthrospira platensis Species 0.000 claims description 2
- 235000016425 Arthrospira platensis Nutrition 0.000 claims description 2
- 241000228212 Aspergillus Species 0.000 claims description 2
- 241001513093 Aspergillus awamori Species 0.000 claims description 2
- 241000351920 Aspergillus nidulans Species 0.000 claims description 2
- 241000228245 Aspergillus niger Species 0.000 claims description 2
- 240000006439 Aspergillus oryzae Species 0.000 claims description 2
- 235000002247 Aspergillus oryzae Nutrition 0.000 claims description 2
- 241001465318 Aspergillus terreus Species 0.000 claims description 2
- 241001467606 Bacillariophyceae Species 0.000 claims description 2
- 241000283690 Bos taurus Species 0.000 claims description 2
- WWZKQHOCKIZLMA-UHFFFAOYSA-N Caprylic acid Natural products CCCCCCCC(O)=O WWZKQHOCKIZLMA-UHFFFAOYSA-N 0.000 claims description 2
- GHOKWGTUZJEAQD-UHFFFAOYSA-N Chick antidermatitis factor Natural products OCC(C)(C)C(O)C(=O)NCCC(O)=O GHOKWGTUZJEAQD-UHFFFAOYSA-N 0.000 claims description 2
- 241000195585 Chlamydomonas Species 0.000 claims description 2
- 241000195649 Chlorella <Chlorellales> Species 0.000 claims description 2
- 241000699800 Cricetinae Species 0.000 claims description 2
- 241000699802 Cricetulus griseus Species 0.000 claims description 2
- 102000004127 Cytokines Human genes 0.000 claims description 2
- 108090000695 Cytokines Proteins 0.000 claims description 2
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 claims description 2
- AUNGANRZJHBGPY-UHFFFAOYSA-N D-Lyxoflavin Natural products OCC(O)C(O)C(O)CN1C=2C=C(C)C(C)=CC=2N=C2C1=NC(=O)NC2=O AUNGANRZJHBGPY-UHFFFAOYSA-N 0.000 claims description 2
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 claims description 2
- HEBKCHPVOIAQTA-QWWZWVQMSA-N D-arabinitol Chemical compound OC[C@@H](O)C(O)[C@H](O)CO HEBKCHPVOIAQTA-QWWZWVQMSA-N 0.000 claims description 2
- ZZZCUOFIHGPKAK-UHFFFAOYSA-N D-erythro-ascorbic acid Natural products OCC1OC(=O)C(O)=C1O ZZZCUOFIHGPKAK-UHFFFAOYSA-N 0.000 claims description 2
- DSLZVSRJTYRBFB-LLEIAEIESA-N D-glucaric acid Chemical compound OC(=O)[C@@H](O)[C@@H](O)[C@H](O)[C@@H](O)C(O)=O DSLZVSRJTYRBFB-LLEIAEIESA-N 0.000 claims description 2
- FBPFZTCFMRRESA-JGWLITMVSA-N D-glucitol Chemical compound OC[C@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-JGWLITMVSA-N 0.000 claims description 2
- RGHNJXZEOKUKBD-UHFFFAOYSA-N D-gluconic acid Natural products OCC(O)C(O)C(O)C(O)C(O)=O RGHNJXZEOKUKBD-UHFFFAOYSA-N 0.000 claims description 2
- 241000195634 Dunaliella Species 0.000 claims description 2
- 241000233866 Fungi Species 0.000 claims description 2
- 241000159512 Geotrichum Species 0.000 claims description 2
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 claims description 2
- 244000068988 Glycine max Species 0.000 claims description 2
- 235000010469 Glycine max Nutrition 0.000 claims description 2
- 241000235649 Kluyveromyces Species 0.000 claims description 2
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 claims description 2
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 claims description 2
- 235000007688 Lycopersicon esculentum Nutrition 0.000 claims description 2
- 240000003183 Manihot esculenta Species 0.000 claims description 2
- 235000016735 Manihot esculenta subsp esculenta Nutrition 0.000 claims description 2
- 229930195725 Mannitol Natural products 0.000 claims description 2
- 240000004658 Medicago sativa Species 0.000 claims description 2
- 235000017587 Medicago sativa ssp. sativa Nutrition 0.000 claims description 2
- OVBPIULPVIDEAO-UHFFFAOYSA-N N-Pteroyl-L-glutaminsaeure Natural products C=1N=C2NC(N)=NC(=O)C2=NC=1CNC1=CC=C(C(=O)NC(CCC(O)=O)C(O)=O)C=C1 OVBPIULPVIDEAO-UHFFFAOYSA-N 0.000 claims description 2
- 241000224474 Nannochloropsis Species 0.000 claims description 2
- 241000221960 Neurospora Species 0.000 claims description 2
- 241000221961 Neurospora crassa Species 0.000 claims description 2
- PVNIIMVLHYAWGP-UHFFFAOYSA-N Niacin Chemical compound OC(=O)C1=CC=CN=C1 PVNIIMVLHYAWGP-UHFFFAOYSA-N 0.000 claims description 2
- 244000061176 Nicotiana tabacum Species 0.000 claims description 2
- 235000002637 Nicotiana tabacum Nutrition 0.000 claims description 2
- 241000199478 Ochromonas Species 0.000 claims description 2
- 241001489174 Ogataea minuta Species 0.000 claims description 2
- 240000007594 Oryza sativa Species 0.000 claims description 2
- 235000007164 Oryza sativa Nutrition 0.000 claims description 2
- 241000228143 Penicillium Species 0.000 claims description 2
- 241000228150 Penicillium chrysogenum Species 0.000 claims description 2
- 241000195887 Physcomitrella patens Species 0.000 claims description 2
- 206010035226 Plasma cell myeloma Diseases 0.000 claims description 2
- WHBMMWSBFZVSSR-UHFFFAOYSA-N R3HBA Natural products CC(O)CC(O)=O WHBMMWSBFZVSSR-UHFFFAOYSA-N 0.000 claims description 2
- 241000235527 Rhizopus Species 0.000 claims description 2
- 240000005384 Rhizopus oryzae Species 0.000 claims description 2
- 235000013752 Rhizopus oryzae Nutrition 0.000 claims description 2
- 241000283984 Rodentia Species 0.000 claims description 2
- 241000235070 Saccharomyces Species 0.000 claims description 2
- 235000005775 Setaria Nutrition 0.000 claims description 2
- 241000232088 Setaria <nematode> Species 0.000 claims description 2
- 240000003768 Solanum lycopersicum Species 0.000 claims description 2
- 244000061456 Solanum tuberosum Species 0.000 claims description 2
- 235000002595 Solanum tuberosum Nutrition 0.000 claims description 2
- 101710172711 Structural protein Proteins 0.000 claims description 2
- KDYFGRWQOYBRFD-UHFFFAOYSA-N Succinic acid Natural products OC(=O)CCC(O)=O KDYFGRWQOYBRFD-UHFFFAOYSA-N 0.000 claims description 2
- JZRWCGZRTZMZEH-UHFFFAOYSA-N Thiamine Natural products CC1=C(CCO)SC=[N+]1CC1=CN=C(C)N=C1N JZRWCGZRTZMZEH-UHFFFAOYSA-N 0.000 claims description 2
- 235000021307 Triticum Nutrition 0.000 claims description 2
- 244000098338 Triticum aestivum Species 0.000 claims description 2
- 229930003268 Vitamin C Natural products 0.000 claims description 2
- TVXBFESIOXBWNM-UHFFFAOYSA-N Xylitol Natural products OCCC(O)C(O)C(O)CCO TVXBFESIOXBWNM-UHFFFAOYSA-N 0.000 claims description 2
- 241000235013 Yarrowia Species 0.000 claims description 2
- 241000235015 Yarrowia lipolytica Species 0.000 claims description 2
- 240000008042 Zea mays Species 0.000 claims description 2
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 claims description 2
- 235000002017 Zea mays subsp mays Nutrition 0.000 claims description 2
- 239000001361 adipic acid Substances 0.000 claims description 2
- 235000011037 adipic acid Nutrition 0.000 claims description 2
- 150000001299 aldehydes Chemical class 0.000 claims description 2
- BJEPYKJPYRNKOW-UHFFFAOYSA-N alpha-hydroxysuccinic acid Natural products OC(=O)C(O)CC(O)=O BJEPYKJPYRNKOW-UHFFFAOYSA-N 0.000 claims description 2
- 150000001412 amines Chemical class 0.000 claims description 2
- 239000000427 antigen Substances 0.000 claims description 2
- 102000025171 antigen binding proteins Human genes 0.000 claims description 2
- 108091000831 antigen binding proteins Proteins 0.000 claims description 2
- 102000036639 antigens Human genes 0.000 claims description 2
- 108091007433 antigens Proteins 0.000 claims description 2
- 235000003704 aspartic acid Nutrition 0.000 claims description 2
- GONOPSZTUGRENK-UHFFFAOYSA-N benzyl(trichloro)silane Chemical compound Cl[Si](Cl)(Cl)CC1=CC=CC=C1 GONOPSZTUGRENK-UHFFFAOYSA-N 0.000 claims description 2
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 claims description 2
- CDQSJQSWAWPGKG-UHFFFAOYSA-N butane-1,1-diol Chemical compound CCCC(O)O CDQSJQSWAWPGKG-UHFFFAOYSA-N 0.000 claims description 2
- KDYFGRWQOYBRFD-NUQCWPJISA-N butanedioic acid Chemical compound O[14C](=O)CC[14C](O)=O KDYFGRWQOYBRFD-NUQCWPJISA-N 0.000 claims description 2
- 235000015165 citric acid Nutrition 0.000 claims description 2
- GVJHHUAWPYXKBD-UHFFFAOYSA-N d-alpha-tocopherol Natural products OC1=C(C)C(C)=C2OC(CCCC(C)CCCC(C)CCCC(C)C)(C)CCC2=C1C GVJHHUAWPYXKBD-UHFFFAOYSA-N 0.000 claims description 2
- 229960000304 folic acid Drugs 0.000 claims description 2
- 235000019152 folic acid Nutrition 0.000 claims description 2
- 239000011724 folic acid Substances 0.000 claims description 2
- 239000001530 fumaric acid Substances 0.000 claims description 2
- 235000011087 fumaric acid Nutrition 0.000 claims description 2
- 102000034356 gene-regulatory proteins Human genes 0.000 claims description 2
- 108091006104 gene-regulatory proteins Proteins 0.000 claims description 2
- 239000000174 gluconic acid Substances 0.000 claims description 2
- 235000012208 gluconic acid Nutrition 0.000 claims description 2
- 239000004220 glutamic acid Substances 0.000 claims description 2
- 235000013922 glutamic acid Nutrition 0.000 claims description 2
- 239000003102 growth factor Substances 0.000 claims description 2
- NAQMVNRVTILPCV-UHFFFAOYSA-N hexane-1,6-diamine Chemical compound NCCCCCCN NAQMVNRVTILPCV-UHFFFAOYSA-N 0.000 claims description 2
- 239000005556 hormone Substances 0.000 claims description 2
- 229940088597 hormone Drugs 0.000 claims description 2
- 150000002576 ketones Chemical class 0.000 claims description 2
- 210000003292 kidney cell Anatomy 0.000 claims description 2
- 239000004310 lactic acid Substances 0.000 claims description 2
- 235000014655 lactic acid Nutrition 0.000 claims description 2
- 229940040102 levulinic acid Drugs 0.000 claims description 2
- 235000009973 maize Nutrition 0.000 claims description 2
- 239000001630 malic acid Substances 0.000 claims description 2
- 235000011090 malic acid Nutrition 0.000 claims description 2
- 239000000594 mannitol Substances 0.000 claims description 2
- 235000010355 mannitol Nutrition 0.000 claims description 2
- WSFSSNUMVMOOMR-NJFSPNSNSA-N methanone Chemical compound O=[14CH2] WSFSSNUMVMOOMR-NJFSPNSNSA-N 0.000 claims description 2
- LVHBHZANLOWSRM-UHFFFAOYSA-N methylenebutanedioic acid Natural products OC(=O)CC(=C)C(O)=O LVHBHZANLOWSRM-UHFFFAOYSA-N 0.000 claims description 2
- 201000000050 myeloid neoplasm Diseases 0.000 claims description 2
- FUZZWVXGSFPDMH-UHFFFAOYSA-N n-hexanoic acid Natural products CCCCCC(O)=O FUZZWVXGSFPDMH-UHFFFAOYSA-N 0.000 claims description 2
- 235000001968 nicotinic acid Nutrition 0.000 claims description 2
- 229960003512 nicotinic acid Drugs 0.000 claims description 2
- 239000011664 nicotinic acid Substances 0.000 claims description 2
- 230000000174 oncolytic effect Effects 0.000 claims description 2
- 210000001672 ovary Anatomy 0.000 claims description 2
- 229940055726 pantothenic acid Drugs 0.000 claims description 2
- 235000019161 pantothenic acid Nutrition 0.000 claims description 2
- 239000011713 pantothenic acid Substances 0.000 claims description 2
- GPCKFIWBUTWTDH-UHFFFAOYSA-N pentane-3,3-diamine Chemical compound CCC(N)(N)CC GPCKFIWBUTWTDH-UHFFFAOYSA-N 0.000 claims description 2
- 229920000070 poly-3-hydroxybutyrate Polymers 0.000 claims description 2
- 229920000166 polytrimethylene carbonate Polymers 0.000 claims description 2
- 230000008569 process Effects 0.000 claims description 2
- BDERNNFJNOPAEC-UHFFFAOYSA-N propan-1-ol Chemical compound CCCO BDERNNFJNOPAEC-UHFFFAOYSA-N 0.000 claims description 2
- 235000019260 propionic acid Nutrition 0.000 claims description 2
- 235000013772 propylene glycol Nutrition 0.000 claims description 2
- 235000008160 pyridoxine Nutrition 0.000 claims description 2
- 239000011677 pyridoxine Substances 0.000 claims description 2
- IUVKMZGDUIUOCP-BTNSXGMBSA-N quinbolone Chemical compound O([C@H]1CC[C@H]2[C@H]3[C@@H]([C@]4(C=CC(=O)C=C4CC3)C)CC[C@@]21C)C1=CCCC1 IUVKMZGDUIUOCP-BTNSXGMBSA-N 0.000 claims description 2
- 235000019192 riboflavin Nutrition 0.000 claims description 2
- 229960002477 riboflavin Drugs 0.000 claims description 2
- 239000002151 riboflavin Substances 0.000 claims description 2
- 235000009566 rice Nutrition 0.000 claims description 2
- 239000000600 sorbitol Substances 0.000 claims description 2
- 235000010356 sorbitol Nutrition 0.000 claims description 2
- 229940082787 spirulina Drugs 0.000 claims description 2
- 150000005846 sugar alcohols Chemical class 0.000 claims description 2
- 230000001225 therapeutic effect Effects 0.000 claims description 2
- 239000011721 thiamine Substances 0.000 claims description 2
- 235000019157 thiamine Nutrition 0.000 claims description 2
- 229960003495 thiamine Drugs 0.000 claims description 2
- KYMBYSLLVAOCFI-UHFFFAOYSA-N thiamine Chemical compound CC1=C(CCO)SCN1CC1=CN=C(C)N=C1N KYMBYSLLVAOCFI-UHFFFAOYSA-N 0.000 claims description 2
- 229960001295 tocopherol Drugs 0.000 claims description 2
- 239000011732 tocopherol Substances 0.000 claims description 2
- 235000010384 tocopherol Nutrition 0.000 claims description 2
- 229930003799 tocopherol Natural products 0.000 claims description 2
- 239000003053 toxin Substances 0.000 claims description 2
- 231100000765 toxin Toxicity 0.000 claims description 2
- VZCYOOQTPOCHFL-UHFFFAOYSA-N trans-butenedioic acid Natural products OC(=O)C=CC(O)=O VZCYOOQTPOCHFL-UHFFFAOYSA-N 0.000 claims description 2
- 229960005486 vaccine Drugs 0.000 claims description 2
- 229940005605 valeric acid Drugs 0.000 claims description 2
- 229940088594 vitamin Drugs 0.000 claims description 2
- 229930003231 vitamin Natural products 0.000 claims description 2
- 235000013343 vitamin Nutrition 0.000 claims description 2
- 239000011782 vitamin Substances 0.000 claims description 2
- 239000011718 vitamin C Substances 0.000 claims description 2
- 235000019154 vitamin C Nutrition 0.000 claims description 2
- 229940011671 vitamin b6 Drugs 0.000 claims description 2
- 150000003722 vitamin derivatives Chemical class 0.000 claims description 2
- 239000000811 xylitol Substances 0.000 claims description 2
- HEBKCHPVOIAQTA-SCDXWVJYSA-N xylitol Chemical compound OC[C@H](O)[C@@H](O)[C@H](O)CO HEBKCHPVOIAQTA-SCDXWVJYSA-N 0.000 claims description 2
- 235000010447 xylitol Nutrition 0.000 claims description 2
- 229960002675 xylitol Drugs 0.000 claims description 2
- GVJHHUAWPYXKBD-IEOSBIPESA-N α-tocopherol Chemical compound OC1=C(C)C(C)=C2O[C@@](CCC[C@H](C)CCC[C@H](C)CCCC(C)C)(C)CCC2=C1C GVJHHUAWPYXKBD-IEOSBIPESA-N 0.000 claims description 2
- XLSMFKSTNGKWQX-UHFFFAOYSA-N hydroxyacetone Chemical compound CC(=O)CO XLSMFKSTNGKWQX-UHFFFAOYSA-N 0.000 claims 2
- FDJOLVPMNUYSCM-WZHZPDAFSA-L cobalt(3+);[(2r,3s,4r,5s)-5-(5,6-dimethylbenzimidazol-1-yl)-4-hydroxy-2-(hydroxymethyl)oxolan-3-yl] [(2r)-1-[3-[(1r,2r,3r,4z,7s,9z,12s,13s,14z,17s,18s,19r)-2,13,18-tris(2-amino-2-oxoethyl)-7,12,17-tris(3-amino-3-oxopropyl)-3,5,8,8,13,15,18,19-octamethyl-2 Chemical compound [Co+3].N#[C-].N([C@@H]([C@]1(C)[N-]\C([C@H]([C@@]1(CC(N)=O)C)CCC(N)=O)=C(\C)/C1=N/C([C@H]([C@@]1(CC(N)=O)C)CCC(N)=O)=C\C1=N\C([C@H](C1(C)C)CCC(N)=O)=C/1C)[C@@H]2CC(N)=O)=C\1[C@]2(C)CCC(=O)NC[C@@H](C)OP([O-])(=O)O[C@H]1[C@@H](O)[C@@H](N2C3=CC(C)=C(C)C=C3N=C2)O[C@@H]1CO FDJOLVPMNUYSCM-WZHZPDAFSA-L 0.000 claims 1
- 239000000047 product Substances 0.000 description 39
- 239000002773 nucleotide Substances 0.000 description 33
- 125000003729 nucleotide group Chemical group 0.000 description 33
- 108091028043 Nucleic acid sequence Proteins 0.000 description 29
- GUGNSJAORJLKGP-UHFFFAOYSA-K sodium 8-methoxypyrene-1,3,6-trisulfonate Chemical group [Na+].[Na+].[Na+].C1=C2C(OC)=CC(S([O-])(=O)=O)=C(C=C3)C2=C2C3=C(S([O-])(=O)=O)C=C(S([O-])(=O)=O)C2=C1 GUGNSJAORJLKGP-UHFFFAOYSA-K 0.000 description 28
- 108020004414 DNA Proteins 0.000 description 22
- 101100465559 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) PRE7 gene Proteins 0.000 description 22
- 238000006243 chemical reaction Methods 0.000 description 22
- GNGACRATGGDKBX-UHFFFAOYSA-N dihydroxyacetone phosphate Chemical compound OCC(=O)COP(O)(O)=O GNGACRATGGDKBX-UHFFFAOYSA-N 0.000 description 22
- 101150076896 pts1 gene Proteins 0.000 description 22
- 108091026890 Coding region Proteins 0.000 description 18
- 102000040430 polynucleotide Human genes 0.000 description 17
- 108091033319 polynucleotide Proteins 0.000 description 17
- 239000002157 polynucleotide Substances 0.000 description 17
- 239000013598 vector Substances 0.000 description 17
- 230000037353 metabolic pathway Effects 0.000 description 14
- 108091035707 Consensus sequence Proteins 0.000 description 13
- 230000012010 growth Effects 0.000 description 12
- 229920001184 polypeptide Polymers 0.000 description 12
- 102000004196 processed proteins & peptides Human genes 0.000 description 12
- 238000013518 transcription Methods 0.000 description 11
- 230000000694 effects Effects 0.000 description 10
- 230000008685 targeting Effects 0.000 description 10
- 230000035897 transcription Effects 0.000 description 10
- 230000015572 biosynthetic process Effects 0.000 description 8
- 239000013604 expression vector Substances 0.000 description 8
- BRZYSWJRSDMWLG-CAXSIQPQSA-N geneticin Natural products O1C[C@@](O)(C)[C@H](NC)[C@@H](O)[C@H]1O[C@@H]1[C@@H](O)[C@H](O[C@@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](C(C)O)O2)N)[C@@H](N)C[C@H]1N BRZYSWJRSDMWLG-CAXSIQPQSA-N 0.000 description 8
- 239000000284 extract Substances 0.000 description 7
- 239000013612 plasmid Substances 0.000 description 7
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 6
- 102100036826 Aldehyde oxidase Human genes 0.000 description 6
- 101000928314 Homo sapiens Aldehyde oxidase Proteins 0.000 description 6
- 241000421270 Komagataella phaffii CBS 7435 Species 0.000 description 6
- 241000589540 Pseudomonas fluorescens Species 0.000 description 6
- 230000003321 amplification Effects 0.000 description 6
- 230000001580 bacterial effect Effects 0.000 description 6
- 108010024302 benzaldehyde lyase Proteins 0.000 description 6
- 210000004899 c-terminal region Anatomy 0.000 description 6
- 230000010354 integration Effects 0.000 description 6
- 238000004519 manufacturing process Methods 0.000 description 6
- 238000003199 nucleic acid amplification method Methods 0.000 description 6
- 238000003259 recombinant expression Methods 0.000 description 6
- 238000010367 cloning Methods 0.000 description 5
- 238000000855 fermentation Methods 0.000 description 5
- 230000004151 fermentation Effects 0.000 description 5
- 239000000543 intermediate Substances 0.000 description 5
- 108020004999 messenger RNA Proteins 0.000 description 5
- 244000005700 microbiome Species 0.000 description 5
- 239000000203 mixture Substances 0.000 description 5
- 230000000858 peroxisomal effect Effects 0.000 description 5
- 102000005962 receptors Human genes 0.000 description 5
- 108020003175 receptors Proteins 0.000 description 5
- 230000028327 secretion Effects 0.000 description 5
- 239000000758 substrate Substances 0.000 description 5
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 4
- 241000588724 Escherichia coli Species 0.000 description 4
- 108010067193 Formaldehyde transketolase Proteins 0.000 description 4
- 108091005461 Nucleic proteins Proteins 0.000 description 4
- 239000011324 bead Substances 0.000 description 4
- 230000001413 cellular effect Effects 0.000 description 4
- 238000006911 enzymatic reaction Methods 0.000 description 4
- 230000001939 inductive effect Effects 0.000 description 4
- 230000004108 pentose phosphate pathway Effects 0.000 description 4
- 238000012216 screening Methods 0.000 description 4
- 238000003786 synthesis reaction Methods 0.000 description 4
- 101150006240 AOX2 gene Proteins 0.000 description 3
- 229920001817 Agar Polymers 0.000 description 3
- 108010021809 Alcohol dehydrogenase Proteins 0.000 description 3
- 102000007698 Alcohol dehydrogenase Human genes 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- 108090000856 Lyases Proteins 0.000 description 3
- 102000004317 Lyases Human genes 0.000 description 3
- KWYUFKZDYYNOTN-UHFFFAOYSA-M Potassium hydroxide Chemical compound [OH-].[K+] KWYUFKZDYYNOTN-UHFFFAOYSA-M 0.000 description 3
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 3
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 3
- -1 Ribulose Monophosphate Chemical class 0.000 description 3
- 239000008272 agar Substances 0.000 description 3
- 125000000539 amino acid group Chemical group 0.000 description 3
- 150000001722 carbon compounds Chemical class 0.000 description 3
- 230000001276 controlling effect Effects 0.000 description 3
- 239000005547 deoxyribonucleotide Substances 0.000 description 3
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 3
- 238000007824 enzymatic assay Methods 0.000 description 3
- 101150096282 fls gene Proteins 0.000 description 3
- 239000012634 fragment Substances 0.000 description 3
- 230000004927 fusion Effects 0.000 description 3
- 238000000338 in vitro Methods 0.000 description 3
- 238000011534 incubation Methods 0.000 description 3
- 239000011159 matrix material Substances 0.000 description 3
- 239000002609 medium Substances 0.000 description 3
- 239000000243 solution Substances 0.000 description 3
- 238000006467 substitution reaction Methods 0.000 description 3
- 238000011144 upstream manufacturing Methods 0.000 description 3
- GFAZHVHNLUBROE-UHFFFAOYSA-N 1-hydroxybutan-2-one Chemical compound CCC(=O)CO GFAZHVHNLUBROE-UHFFFAOYSA-N 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 241000282461 Canis lupus Species 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- 102100038385 Coiled-coil domain-containing protein R3HCC1L Human genes 0.000 description 2
- ZAQJHHRNXZUBTE-UHFFFAOYSA-N D-threo-2-Pentulose Natural products OCC(O)C(O)C(=O)CO ZAQJHHRNXZUBTE-UHFFFAOYSA-N 0.000 description 2
- FNZLKVNUWIIPSJ-RFZPGFLSSA-N D-xylulose 5-phosphate Chemical compound OCC(=O)[C@@H](O)[C@H](O)COP(O)(O)=O FNZLKVNUWIIPSJ-RFZPGFLSSA-N 0.000 description 2
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 2
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 2
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 2
- 101000743767 Homo sapiens Coiled-coil domain-containing protein R3HCC1L Proteins 0.000 description 2
- 101001073025 Homo sapiens Peroxisomal targeting signal 1 receptor Proteins 0.000 description 2
- UFHFLCQGNIYNRP-UHFFFAOYSA-N Hydrogen Chemical compound [H][H] UFHFLCQGNIYNRP-UHFFFAOYSA-N 0.000 description 2
- 108091034117 Oligonucleotide Proteins 0.000 description 2
- 101150056463 PEX5 gene Proteins 0.000 description 2
- 102100036598 Peroxisomal targeting signal 1 receptor Human genes 0.000 description 2
- NBBJYMSMWIIQGU-UHFFFAOYSA-N Propionic aldehyde Chemical compound CCC=O NBBJYMSMWIIQGU-UHFFFAOYSA-N 0.000 description 2
- 238000003559 RNA-seq method Methods 0.000 description 2
- 238000011529 RT qPCR Methods 0.000 description 2
- 108091028664 Ribonucleotide Proteins 0.000 description 2
- 108010003581 Ribulose-bisphosphate carboxylase Proteins 0.000 description 2
- 210000004436 artificial bacterial chromosome Anatomy 0.000 description 2
- 210000004507 artificial chromosome Anatomy 0.000 description 2
- 210000001106 artificial yeast chromosome Anatomy 0.000 description 2
- 230000004888 barrier function Effects 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 239000006143 cell culture medium Substances 0.000 description 2
- 238000007398 colorimetric assay Methods 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 2
- 230000002255 enzymatic effect Effects 0.000 description 2
- 230000002538 fungal effect Effects 0.000 description 2
- 238000010353 genetic engineering Methods 0.000 description 2
- 239000011521 glass Substances 0.000 description 2
- 239000008103 glucose Substances 0.000 description 2
- 230000034659 glycolysis Effects 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- 230000001976 improved effect Effects 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 150000002632 lipids Chemical class 0.000 description 2
- 230000004807 localization Effects 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 238000002493 microarray Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 229930027945 nicotinamide-adenine dinucleotide Natural products 0.000 description 2
- BOPGDPNILDQYTO-NNYOXOHSSA-N nicotinamide-adenine dinucleotide Chemical compound C1=CCC(C(=O)N)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OC[C@@H]2[C@H]([C@@H](O)[C@@H](O2)N2C3=NC=NC(N)=C3N=C2)O)O1 BOPGDPNILDQYTO-NNYOXOHSSA-N 0.000 description 2
- 239000008188 pellet Substances 0.000 description 2
- 229920000642 polymer Polymers 0.000 description 2
- 239000002243 precursor Substances 0.000 description 2
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 2
- 230000006861 primary carbon metabolism Effects 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 230000035806 respiratory chain Effects 0.000 description 2
- 239000002336 ribonucleotide Substances 0.000 description 2
- 125000002652 ribonucleotide group Chemical group 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 229960002363 thiamine pyrophosphate Drugs 0.000 description 2
- 235000008170 thiamine pyrophosphate Nutrition 0.000 description 2
- 239000011678 thiamine pyrophosphate Substances 0.000 description 2
- YXVCLPJQTZXJLH-UHFFFAOYSA-N thiamine(1+) diphosphate chloride Chemical compound [Cl-].CC1=C(CCOP(O)(=O)OP(O)(O)=O)SC=[N+]1CC1=CN=C(C)N=C1N YXVCLPJQTZXJLH-UHFFFAOYSA-N 0.000 description 2
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- 230000014621 translational initiation Effects 0.000 description 2
- 230000032258 transport Effects 0.000 description 2
- LXJXRIRHZLFYRP-VKHMYHEASA-L (R)-2-Hydroxy-3-(phosphonooxy)-propanal Natural products O=C[C@H](O)COP([O-])([O-])=O LXJXRIRHZLFYRP-VKHMYHEASA-L 0.000 description 1
- ASJSAQIRZKANQN-CRCLSJGQSA-N 2-deoxy-D-ribose Chemical compound OC[C@@H](O)[C@@H](O)CC=O ASJSAQIRZKANQN-CRCLSJGQSA-N 0.000 description 1
- 101150061183 AOX1 gene Proteins 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 102100038779 Arfaptin-2 Human genes 0.000 description 1
- 101100434663 Bacillus subtilis (strain 168) fbaA gene Proteins 0.000 description 1
- 101800001415 Bri23 peptide Proteins 0.000 description 1
- 101800000655 C-terminal peptide Proteins 0.000 description 1
- 102400000107 C-terminal peptide Human genes 0.000 description 1
- 101150027801 CTA1 gene Proteins 0.000 description 1
- 101100273295 Candida albicans (strain SC5314 / ATCC MYA-2876) CAT1 gene Proteins 0.000 description 1
- 101100351264 Candida albicans (strain SC5314 / ATCC MYA-2876) PDC11 gene Proteins 0.000 description 1
- 101100055370 Candida boidinii AOD1 gene Proteins 0.000 description 1
- KRKNYBCHXYNGOX-UHFFFAOYSA-K Citrate Chemical compound [O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O KRKNYBCHXYNGOX-UHFFFAOYSA-K 0.000 description 1
- YASYEJJMZJALEJ-UHFFFAOYSA-N Citric acid monohydrate Chemical compound O.OC(=O)CC(O)(C(O)=O)CC(O)=O YASYEJJMZJALEJ-UHFFFAOYSA-N 0.000 description 1
- 229920000742 Cotton Polymers 0.000 description 1
- LXJXRIRHZLFYRP-VKHMYHEASA-N D-glyceraldehyde 3-phosphate Chemical compound O=C[C@H](O)COP(O)(O)=O LXJXRIRHZLFYRP-VKHMYHEASA-N 0.000 description 1
- FNZLKVNUWIIPSJ-UHNVWZDZSA-N D-ribulose 5-phosphate Chemical compound OCC(=O)[C@H](O)[C@H](O)COP(O)(O)=O FNZLKVNUWIIPSJ-UHNVWZDZSA-N 0.000 description 1
- 101710088194 Dehydrogenase Proteins 0.000 description 1
- 238000002965 ELISA Methods 0.000 description 1
- 101150095274 FBA1 gene Proteins 0.000 description 1
- 108010015133 Galactose oxidase Proteins 0.000 description 1
- 101000809446 Homo sapiens Arfaptin-2 Proteins 0.000 description 1
- 108010001336 Horseradish Peroxidase Proteins 0.000 description 1
- 108010044467 Isoenzymes Proteins 0.000 description 1
- 241001043155 Komagataella sp. Species 0.000 description 1
- 101150084262 MDH3 gene Proteins 0.000 description 1
- 229910004616 Na2MoO4.2H2 O Inorganic materials 0.000 description 1
- 238000000636 Northern blotting Methods 0.000 description 1
- 102000004316 Oxidoreductases Human genes 0.000 description 1
- 108090000854 Oxidoreductases Proteins 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 101150050255 PDC1 gene Proteins 0.000 description 1
- 239000001888 Peptone Substances 0.000 description 1
- 108010080698 Peptones Proteins 0.000 description 1
- 108010077056 Peroxisomal Targeting Signal 2 Receptor Proteins 0.000 description 1
- 102100032924 Peroxisomal targeting signal 2 receptor Human genes 0.000 description 1
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- 108010076504 Protein Sorting Signals Proteins 0.000 description 1
- 101100313649 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) POT1 gene Proteins 0.000 description 1
- 241000235343 Saccharomycetales Species 0.000 description 1
- 229920002472 Starch Polymers 0.000 description 1
- QAOWNCQODCNURD-UHFFFAOYSA-N Sulfuric acid Chemical compound OS(O)(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-N 0.000 description 1
- 102000014701 Transketolase Human genes 0.000 description 1
- 108010043652 Transketolase Proteins 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 101100161758 Yarrowia lipolytica (strain CLIB 122 / E 150) POX3 gene Proteins 0.000 description 1
- 108010084455 Zeocin Proteins 0.000 description 1
- NRAUADCLPJTGSF-ZPGVOIKOSA-N [(2r,3s,4r,5r,6r)-6-[[(3as,7r,7as)-7-hydroxy-4-oxo-1,3a,5,6,7,7a-hexahydroimidazo[4,5-c]pyridin-2-yl]amino]-5-[[(3s)-3,6-diaminohexanoyl]amino]-4-hydroxy-2-(hydroxymethyl)oxan-3-yl] carbamate Chemical compound NCCC[C@H](N)CC(=O)N[C@@H]1[C@@H](O)[C@H](OC(N)=O)[C@@H](CO)O[C@H]1\N=C/1N[C@H](C(=O)NC[C@H]2O)[C@@H]2N\1 NRAUADCLPJTGSF-ZPGVOIKOSA-N 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 241000222124 [Candida] boidinii Species 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 230000003044 adaptive effect Effects 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 235000006708 antioxidants Nutrition 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- OHDRQQURAXLVGJ-HLVWOLMTSA-N azane;(2e)-3-ethyl-2-[(e)-(3-ethyl-6-sulfo-1,3-benzothiazol-2-ylidene)hydrazinylidene]-1,3-benzothiazole-6-sulfonic acid Chemical compound [NH4+].[NH4+].S/1C2=CC(S([O-])(=O)=O)=CC=C2N(CC)C\1=N/N=C1/SC2=CC(S([O-])(=O)=O)=CC=C2N1CC OHDRQQURAXLVGJ-HLVWOLMTSA-N 0.000 description 1
- 238000005842 biochemical reaction Methods 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 230000036983 biotransformation Effects 0.000 description 1
- KGBXLFKZBHKPEV-UHFFFAOYSA-N boric acid Chemical compound OB(O)O KGBXLFKZBHKPEV-UHFFFAOYSA-N 0.000 description 1
- ZTQSAGDEMFDKMZ-UHFFFAOYSA-N butyric aldehyde Natural products CCCC=O ZTQSAGDEMFDKMZ-UHFFFAOYSA-N 0.000 description 1
- LLSDKQJKOVVTOJ-UHFFFAOYSA-L calcium chloride dihydrate Chemical compound O.O.[Cl-].[Cl-].[Ca+2] LLSDKQJKOVVTOJ-UHFFFAOYSA-L 0.000 description 1
- 229940041514 candida albicans extract Drugs 0.000 description 1
- 238000005251 capillar electrophoresis Methods 0.000 description 1
- 238000004850 capillary HPLC Methods 0.000 description 1
- 150000001721 carbon Chemical group 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 238000007036 catalytic synthesis reaction Methods 0.000 description 1
- 230000010261 cell growth Effects 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 229960004106 citric acid Drugs 0.000 description 1
- 229960002303 citric acid monohydrate Drugs 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 239000013599 cloning vector Substances 0.000 description 1
- ASARMUCNOOHMLO-WLORSUFZSA-L cobalt(2+);[(2r,3s,4r,5s)-5-(5,6-dimethylbenzimidazol-1-yl)-4-hydroxy-2-(hydroxymethyl)oxolan-3-yl] [(2s)-1-[3-[(1r,2r,3r,4z,7s,9z,12s,13s,14z,17s,18s,19r)-2,13,18-tris(2-amino-2-oxoethyl)-7,12,17-tris(3-amino-3-oxopropyl)-3,5,8,8,13,15,18,19-octamethyl-2 Chemical compound [Co+2].[N-]([C@@H]1[C@H](CC(N)=O)[C@@]2(C)CCC(=O)NC[C@H](C)OP([O-])(=O)O[C@H]3[C@H]([C@H](O[C@@H]3CO)N3C4=CC(C)=C(C)C=C4N=C3)O)\C2=C(C)/C([C@H](C\2(C)C)CCC(N)=O)=N/C/2=C\C([C@H]([C@@]/2(CC(N)=O)C)CCC(N)=O)=N\C\2=C(C)/C2=N[C@]1(C)[C@@](C)(CC(N)=O)[C@@H]2CCC(N)=O ASARMUCNOOHMLO-WLORSUFZSA-L 0.000 description 1
- 238000006482 condensation reaction Methods 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 238000001816 cooling Methods 0.000 description 1
- JZCCFEFSEZPSOG-UHFFFAOYSA-L copper(II) sulfate pentahydrate Chemical compound O.O.O.O.O.[Cu+2].[O-]S([O-])(=O)=O JZCCFEFSEZPSOG-UHFFFAOYSA-L 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- 230000001086 cytosolic effect Effects 0.000 description 1
- 231100000433 cytotoxic Toxicity 0.000 description 1
- 230000001472 cytotoxic effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- MNNHAPBLZZVQHP-UHFFFAOYSA-N diammonium hydrogen phosphate Chemical compound [NH4+].[NH4+].OP([O-])([O-])=O MNNHAPBLZZVQHP-UHFFFAOYSA-N 0.000 description 1
- 229910000388 diammonium phosphate Inorganic materials 0.000 description 1
- 235000019838 diammonium phosphate Nutrition 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000003937 drug carrier Substances 0.000 description 1
- 210000002472 endoplasmic reticulum Anatomy 0.000 description 1
- 238000005265 energy consumption Methods 0.000 description 1
- 238000004146 energy storage Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 238000001952 enzyme assay Methods 0.000 description 1
- 238000012869 ethanol precipitation Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 230000007717 exclusion Effects 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 230000004907 flux Effects 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 239000008098 formaldehyde solution Substances 0.000 description 1
- 239000000446 fuel Substances 0.000 description 1
- 239000007789 gas Substances 0.000 description 1
- 238000012239 gene modification Methods 0.000 description 1
- 230000005017 genetic modification Effects 0.000 description 1
- 235000013617 genetically modified food Nutrition 0.000 description 1
- 238000002873 global sequence alignment Methods 0.000 description 1
- 230000004110 gluconeogenesis Effects 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 239000001257 hydrogen Substances 0.000 description 1
- 229910052739 hydrogen Inorganic materials 0.000 description 1
- 239000012535 impurity Substances 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 230000002458 infectious effect Effects 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 150000002484 inorganic compounds Chemical class 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- SURQXAFEQWPFPV-UHFFFAOYSA-L iron(2+) sulfate heptahydrate Chemical compound O.O.O.O.O.O.O.[Fe+2].[O-]S([O-])(=O)=O SURQXAFEQWPFPV-UHFFFAOYSA-L 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- ISPYRSDWRDQNSW-UHFFFAOYSA-L manganese(II) sulfate monohydrate Chemical compound O.[Mn+2].[O-]S([O-])(=O)=O ISPYRSDWRDQNSW-UHFFFAOYSA-L 0.000 description 1
- 238000004949 mass spectrometry Methods 0.000 description 1
- 230000035800 maturation Effects 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 238000006241 metabolic reaction Methods 0.000 description 1
- ONCZDRURRATYFI-QTCHDTBASA-N methyl (2z)-2-methoxyimino-2-[2-[[(e)-1-[3-(trifluoromethyl)phenyl]ethylideneamino]oxymethyl]phenyl]acetate Chemical compound CO\N=C(/C(=O)OC)C1=CC=CC=C1CO\N=C(/C)C1=CC=CC(C(F)(F)F)=C1 ONCZDRURRATYFI-QTCHDTBASA-N 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 102000035118 modified proteins Human genes 0.000 description 1
- 108091005573 modified proteins Proteins 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 239000013642 negative control Substances 0.000 description 1
- QJGQUHMNIGDVPM-UHFFFAOYSA-N nitrogen group Chemical group [N] QJGQUHMNIGDVPM-UHFFFAOYSA-N 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 235000005985 organic acids Nutrition 0.000 description 1
- 150000002894 organic compounds Chemical class 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 230000003647 oxidation Effects 0.000 description 1
- 238000007254 oxidation reaction Methods 0.000 description 1
- 150000002972 pentoses Chemical class 0.000 description 1
- 235000019319 peptone Nutrition 0.000 description 1
- 239000000546 pharmaceutical excipient Substances 0.000 description 1
- CWCMIVBLVUHDHK-ZSNHEYEWSA-N phleomycin D1 Chemical compound N([C@H](C(=O)N[C@H](C)[C@@H](O)[C@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)NCCC=1SC[C@@H](N=1)C=1SC=C(N=1)C(=O)NCCCCNC(N)=N)[C@@H](O[C@H]1[C@H]([C@@H](O)[C@H](O)[C@H](CO)O1)O[C@@H]1[C@H]([C@@H](OC(N)=O)[C@H](O)[C@@H](CO)O1)O)C=1N=CNC=1)C(=O)C1=NC([C@H](CC(N)=O)NC[C@H](N)C(N)=O)=NC(N)=C1C CWCMIVBLVUHDHK-ZSNHEYEWSA-N 0.000 description 1
- 125000002467 phosphate group Chemical group [H]OP(=O)(O[H])O[*] 0.000 description 1
- 150000003904 phospholipids Chemical class 0.000 description 1
- 102000020233 phosphotransferase Human genes 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 230000019525 primary metabolic process Effects 0.000 description 1
- 230000002035 prolonged effect Effects 0.000 description 1
- ULWHHBHJGPPBCO-UHFFFAOYSA-N propane-1,1-diol Chemical compound CCC(O)O ULWHHBHJGPPBCO-UHFFFAOYSA-N 0.000 description 1
- 230000012846 protein folding Effects 0.000 description 1
- 230000026447 protein localization Effects 0.000 description 1
- 238000001273 protein sequence alignment Methods 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 239000002994 raw material Substances 0.000 description 1
- 239000000376 reactant Substances 0.000 description 1
- 238000010188 recombinant method Methods 0.000 description 1
- 230000008929 regeneration Effects 0.000 description 1
- 238000011069 regeneration method Methods 0.000 description 1
- 238000009877 rendering Methods 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 210000003705 ribosome Anatomy 0.000 description 1
- 239000012266 salt solution Substances 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 230000003248 secreting effect Effects 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 230000009919 sequestration Effects 0.000 description 1
- 108010027322 single cell proteins Proteins 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- FDEIWTXVNPKYDL-UHFFFAOYSA-N sodium molybdate dihydrate Chemical compound O.O.[Na+].[Na+].[O-][Mo]([O-])(=O)=O FDEIWTXVNPKYDL-UHFFFAOYSA-N 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 235000019698 starch Nutrition 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 239000001117 sulphuric acid Substances 0.000 description 1
- 235000011149 sulphuric acid Nutrition 0.000 description 1
- 238000002560 therapeutic procedure Methods 0.000 description 1
- 229960000344 thiamine hydrochloride Drugs 0.000 description 1
- DPJRMOMPQZCRJU-UHFFFAOYSA-M thiamine hydrochloride Chemical compound Cl.[Cl-].CC1=C(CCO)SC=[N+]1CC1=CN=C(C)N=C1N DPJRMOMPQZCRJU-UHFFFAOYSA-M 0.000 description 1
- 235000019190 thiamine hydrochloride Nutrition 0.000 description 1
- 239000011747 thiamine hydrochloride Substances 0.000 description 1
- 229940113082 thymine Drugs 0.000 description 1
- 230000005026 transcription initiation Effects 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 210000003934 vacuole Anatomy 0.000 description 1
- 230000035899 viability Effects 0.000 description 1
- 108700026220 vif Genes Proteins 0.000 description 1
- 238000001262 western blot Methods 0.000 description 1
- 239000012138 yeast extract Substances 0.000 description 1
- 239000007222 ypd medium Substances 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/88—Lyases (4.)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/24—Preparation of oxygen-containing organic compounds containing a carbonyl group
- C12P7/26—Ketones
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/80—Vectors or expression systems specially adapted for eukaryotic hosts for fungi
- C12N15/81—Vectors or expression systems specially adapted for eukaryotic hosts for fungi for yeasts
- C12N15/815—Vectors or expression systems specially adapted for eukaryotic hosts for fungi for yeasts for yeasts other than Saccharomyces
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y401/00—Carbon-carbon lyases (4.1)
- C12Y401/02—Aldehyde-lyases (4.1.2)
- C12Y401/02038—Benzoin aldolase (4.1.2.38)
Definitions
- the invention relates to eukaryotic cells engineered to express a synthetic formolase (FLS) pathway, in particular wherein the FLS pathway is comprised in an organelle of the eukaryotic cell, methods of producing biomass and/or expression products of the eukaryotic cell in a cell culture using a Ci (one carbon) compound as a carbon source, and fusion proteins comprising an FLS that is fused to a peroxisomal targeting signal (PTS), to target expression of the FLS into the cell’s peroxisome.
- FLS synthetic formolase
- Methanol plays a crucial role in a future methanol economy.
- methanol is produced from fossil resources but “green-methanol” or “bio-methanol” can be directly produced from CO2.
- methanol has the advantage to be more easily transported and stored than molecular hydrogen. It is already used as a bulk chemical and multiple processing routes are available. It can serve as fuel and in addition it can be utilized as a carbon source by eukaryotic cells, Ribulose Monophosphate cycle, Dihydroxyacetone cycle, ribulose bisphosphate cycle or the Xylulose Monophosphate cycle. All natural pathways consume high energy (in form of ATP or NAD(P)H).
- Komagataella phaffii (herein also referred to as Pichia pastoris) is an industrially used methylotrophic yeast and it can grow on methanol as a single carbon source to high biomass densities.
- the methanol assimilation pathway (Xylulose-Monophosphate cycle, XuMP) has similarities to the Calvin-Benson-Bassham cycle and the pentose-phosphate pathway. However, this pathway requires at least 10 enzymes to enable the conversion from methanol to dihydroxyacetone (DHA). In addition, three (3) moles of ATP are required for the conversion of three moles of (3) formaldehyde to one (1) mol of Dihydroxyacetone phosphate (DHAP). In order to produce ATP a significant amount of methanol has to be dissimilated by P. pastoris to CO2 obtaining NADH and thus ATP via the respiratory chain.
- DHA dihydroxyacetone
- DHAP Dihydroxyacetone phosphate
- Cai et al. (2021) and WO2022042427A1 disclose cell-free chemoenzymatic starch synthesis from carbon dioxide.
- a mutant of the formolase enzyme (FLS-M3) was described.
- CN113151230A discloses FLS mutants.
- CN113122525A discloses mutant formaldehyde-converting enzymes and expression of said mutants in a bacterium.
- CN110438169A discloses a method for catalytic synthesis of 1-hydroxy-2- butanone in bacterial cells using formaldehyde and propionaldehyde as raw materials and an engineered E. coli strain containing formaldehyde lyase (FLS).
- CN107475281 A discloses E. coli engineered to comprise a methanol metabolism pathway including methanol dehydrogenase and formaldehyde lyase FLS.
- US20200181629A1 discloses a yeast comprising a synthetic Calvin cycle wherein a ribulose-bisphosphate carboxylase (RuBisCO) gene and ribulose phosphate kinase (PRK) gene are fused with a peroxisomal targeting signal (PTS).
- RuBisCO ribulose-bisphosphate carboxylase
- PRK ribulose phosphate kinase
- RuRmayer et al. (2015) describe systems-level organization of yeast methylotrophic lifestyle.
- the authors analyzed the regulation patterns of 5,354 genes, 575 proteins, 141 metabolites, and fluxes through 39 reactions of P. pastoris comparing growth on glucose and on a methanol/glycerol mixed medium, respectively. It was found that the entire methanol assimilation pathway is localized to peroxisomes rather than employing part of the cytosolic pentose phosphate pathway for xylulose-5-phosphate regeneration.
- KR20210038805A discloses methylotrophic bacteria with enhanced 1 ,2- propylene glycol (propanediol; PDO) productivity and a 1 ,2-PDO production method using the same.
- WO201 2037413A2 discloses systems, compounds and methods for the conversion of C1 carbon compounds to higher carbon compounds useful for the generation of commodity compounds.
- WO2012170292A1 discloses modulation of metabolic pathways for improving bioprocess performance and secreted protein productivity of yeast.
- An isolated fungal host cell such as Pichia pastoris, lacking full wild-type levels of dihydroxyacetone synthase activity due to a knocked- out DAS1 or DAS2, is provided.
- the host cell further comprises a heterologous polynucleotide encoding a heterologous polypeptide. Methods for expressing the heterologous polypeptide in such cells, and purifying the expressed polypeptide are also disclosed.
- Novel strategies are needed to address current challenges in energy storage and carbon sequestration.
- the invention provides for a eukaryotic cell which is engineered to express a synthetic formolase (FLS) pathway comprising a recombinant FLS to biotransform formaldehyde into dihydroxyacetone (DHA).
- FLS synthetic formolase
- DHA is herein understood to refer to DHA in its unphosphorylated form or its phosphorylated form, dihydroxyacetone phosphate (DHAP). DHA is also known as 1 ,3- dihydroxypropan-2-one, 1 ,3-dihydroxypropanone, or glycerone.
- DHAP dihydroxyacetone phosphate
- the biotransformation comprises biosynthesis of dihydroxyacetone from formaldehyde through enzymatic reaction with the FLS which converts formaldehyde to dihydroxyacetone.
- the FLS is a lyase which catalyzes the carboligation of three one- carbon formaldehyde molecules into one three-carbon dihydroxyacetone molecule.
- an FLS gene that was introduced into eukaryotic cells, in particular into peroxisomes of a methylotrophic yeast, to replace the natural Xylulose Monophosphate pathway of Pichia pastoris ( Komagataella phaffii).
- a peroxisome targeting signal was fused to the FLS. Cells containing this pathway were able to grow on methanol as carbon source.
- a eukaryotic cell that expresses a recombinant FLS comprises the new FLS pathway.
- the new FLS pathway comprises assimilating a Ci compound like methanol to produce formaldehyde in an enzymatic reaction prior to the FLS enzymatic reaction. Therefore, the invention provides for engineered cells comprising a new carbon fixation pathway including the FLS pathway, which has the advantage of only a small number of thermodynamically favorable chemical transformations that convert a Ci compound into a three-carbon sugar in central metabolism.
- the new FLS pathway has a lower energy consumption compared to the natural pathway.
- FLS enzyme formolase
- dihydroxyacetone phosphate can be produced in an energy efficient way, since the formose reaction catalyzed by the enzyme formolase (FLS) can directly form DHA from three (3) moles of formaldehyde. Only one (1) mol of ATP is required (instead of three mol ATP as in the Xylulose- Monophosphate cycle) to form dihydroxyacetone phosphate, which is an intermediate of glycolysis and thus of the central carbon metabolism.
- FLS constructs proteins and nucleic acid molecules which can be expressed in eukaryotic cells, in particular in cellular compartments like organelles or peroxisomes, and respective expression cassettes.
- the FLS pathway comprises or consists of the recombinant FLS to convert formaldehyde into dihydroxyacetone (DHA).
- DHA dihydroxyacetone
- recombinant FLS is expressed under the control of a methanolinducible promoter, preferably pDAS2 or pAOX1.
- the invention provides for a yeast cell comprising a peroxisome, wherein the yeast cell is engineered to express a synthetic formolase (FLS) pathway in the peroxisome, comprising a recombinant FLS to biotransform formaldehyde into dihydroxyacetone (DHA), wherein the FLS is expressed as an FLS fusion protein that comprises the FLS fused to a peroxisomal targeting signal (PTS).
- FLS synthetic formolase
- the FLS pathway further comprises one or more enzymes to biotransform a Ci compound into formaldehyde, preferably wherein the Ci compound is any one of methanol, formate, methane, carbon monoxide, or carbon dioxide.
- the FLS pathway further comprises a methanol to formaldehyde converting enzyme (“MFCE”), such as an alcohol oxidase or a methanol dehydrogenase, to biotransform methanol into formaldehyde.
- MFCE methanol to formaldehyde converting enzyme
- the alcohol oxidase is a methanol oxidase.
- the FLS pathway is a linear pathway.
- the FLS pathway is comprised in a metabolic pathway which further comprises one or more enzymes of gluconeogenesis using DHA (or DHAP) as a precursor.
- the FLS pathway is comprised in an organelle of the cell, preferably a peroxisome or mitochondria.
- the FLS is a heterologous protein, in particular an artificial enzyme originating from or comprising a wild-type (such as naturally-occurring in any organism) or a mutant FLS.
- the FLS is fused to one or more heterologous sequences.
- the FLS is expressed as an FLS fusion protein that comprises the FLS fused to a peroxisomal targeting signal (PTS), preferably wherein the PTS is selected from the group consisting of the Minimal PTS1 sequence (“SKL”), the “PTS1 consensus sequence” (SEQ ID NO:17), the “PTS2 consensus sequence” (SEQ ID NO:18), SEQ ID NO:19, 20, 21 , 22, 23, 24, 25, and 26.
- PTS peroxisomal targeting signal
- the PTS comprises or consists of an amino acid sequence selected from the group consisting of SKL, SEQ ID NO:17, 18, 19, 20, 21 , 22, 23, 24, 25, and 26.
- SEQ ID NO:17 comprises variables such that the amino acid sequence can have a length of 3-12 amino acids.
- the respective embodiments of SEQ ID NO:17 are provided as SEQ ID NO:35-44.
- the PTS can be fused at the C-terminus or the N-terminus of the FLS amino acid sequence, with or without a linking sequence.
- the PTS1 sequence comprises or consists of the sequence motif of a PTS1 sequence having a consensus sequence of (S/A/C/T)-(K/R/H)-(L/M/F) (i.e., the “PTS1 consensus sequence”), which is optionally extended at the N-terminus by one or more, up to nine amino acids, (X)o-9. Therefore, the PTS1 consensus sequence is characterized by the following formula: (X)o-9-(S/A/C/T)-(K/R/H)-(L/M/F) (see SEQ ID NO:17).
- a PTS of the PTS1 consensus sequence is fused to the C-terminus.
- the PTS2 sequence comprises or consists of the sequence motif of a PTS2 sequence having a consensus sequence of (R/K)-(L/I/V)-(X)5-(H/Q)-(L/A) (i.e., the “PTS2 consensus sequence”, see SEQ ID NO: 18), which is preferably fused to the N-terminus.
- the FLS pathway is expressed into the cell’s peroxisome, in particular wherein the cell is a yeast, such as e.g., a methylotrophic yeast.
- the FLS pathway expressed into the peroxisome comprises an MFCE and a FLS to convert methanol to dihydroxyacetone.
- the FLS pathway comprises an alcohol oxidase and FLS, wherein: a) the MFCE is an endogenously expressed peroxisomal enzyme; and b) the FLS is an enzyme that is targeted to the peroxisome by fusing a PTS.
- the FLS is characterized by one or more of the following features: a) the FLS is a bacterial FLS or a benzaldehyde lyase of bacterial origin, such as originating from Pseudomonas fluorescens e.g., Pseudomonas fluorescens biovar I, preferably wherein the FLS is identified as SEQ ID NO:33 or by the NCBI accession number: 2AG0_A (Chain A, benzaldehyde lyase), or a naturally-occurring FLS homolog or ortholog thereof; b) the FLS is a recombinant FLS which is a mutant of a), preferably wherein the recombinant FLS comprises SEQ ID NO:33 which is engineered to incorporate one or more point mutations, preferably up to 15, 14, 13, 12, 11 , 10, 9, 8, or 7 point mutations, wherein the point mutations comprise any one or more point mutations, preferably comprising
- the FLS comprises or consists of SEQ ID NO:31 or SEQ ID NO:32; d) the FLS comprises at least any one of 90%, 95%, 96%, 97%, 98%, 99% or 100% sequence identity to any one of the FLS of a), b), or c); e) the FLS is functional as determined in a colorimetric enzymatic assay, such as described herein; f) the nucleotide sequence encoding the FLS is codon-optimized for expression in the cell.
- FLS described herein are: a) a wild-type FLS of Pseudomonas fluorescens, wherein the FLS is identified as SEQ ID NO:33 or by the NCBI accession number: 2AG0 A (Chain A, benzaldehyde lyase); b) a recombinant FLS which comprises or consists of SEQ ID NO:32, which recombinant FLS comprises the point mutations A28I, W89R, L90T, R188H, A394G, G419N, and A480W as compared to SEQ ID NO:33; c) a recombinant FLS which comprises or consists of SEQ ID NO:31 , which recombinant FLS comprises the point mutations A28L, W89R, R188H, N283H, A394G, G419N, and A480W as compared to SEQ ID NO:33.
- the FLS identified as SEQ ID NO:31 or SEQ ID NO:32 is a functional FLS mutant of a wild-type FLS.
- the wild-type FLS identified by SEQ ID NO:33 comprises an amino acid (aa) sequence of 563 aa length.
- the mutant FLS can be longer than SEQ ID NO:33 e.g., by a C-terminal extension of the aa sequence.
- the C-terminal extension of SEQ ID NO:33 can be an extension of 1 to 12 consecutive aa of “GSTENLYFQSGA 1 (SEQ ID NO:34) wherein the extension starts with the first aa indicated in SEQ ID NO:34.
- a recombinant or mutant FLS originating from a wild-type FLS such as comprising or consisting of SEQ ID NO:33 may or may not comprise such C-terminal extension.
- any of the recombinant or mutant FLS comprising or consisting of SEQ ID NO:31 or SEQ ID NO:32, or a functional variant of any of the foregoing may or may not comprise such C-terminal extension.
- SEQ ID NO:31 or SEQ ID NO:32 comprises an aa sequence of 575 aa length.
- the FLS comprises of consists of SEQ ID NO:31 or SEQ ID NO:32, or a functional variant of any of the foregoing, preferably wherein a) the functional variant of SEQ ID NO:31 comprises at least 1 , 2, 3, 4, 5, 6, or 7 (all) of the point mutations A28L, W89R, R188H, N283H, A394G, G419N, and A480W, preferably at least 3, 4, or 5 mutations selected from W89R, R188H, A394G, G419N, or A480W, point mutations as compared to SEQ ID NO:33, and at least at least any one of 90%, 95%, 96%, 97%, 98%, 99% or 100% sequence identity to SEQ ID NO:31 ; b) the functional variant of SEQ ID NO:32 comprises at least 1 , 2, 3, 4, 5, 6, or 7 (all) of the point mutations A28I, W89R, L90T, R188H, A394G, G419N, and
- the cell is a microbial cell, a mammalian, insect, plant or algae cell.
- the cell is: a) a yeast cell, preferably of a genus selected from the group consisting of Pichia, Komagataella, Hansenula, Saccharomyces, Kluyveromyces, Candida, Ogataea, Yarrowia, and Geotrichum, preferably Pichia pastoris, Komagataella phaffii, Komagataella pastoris, Komagataella pseudopastoris, Saccharomyces cerevisiae, Ogataea minuta, Kluyveromces lactis, Kluyveromyes marxianus, Yarrowia lipolytica or Hansenula polymorpha, preferably a methylotrophic yeast; b) a cell of filamentous fungi, preferably of a genus selected from the group consisting of Aspergillus, Penicillium, Trichoderma, Neurospora, Rhizopus preferably Aspergillus niger, Aspergillus awamori Asperg
- the preferred yeast host cells are derived from methylotrophic yeast, such as from Pichia or Komagataella e.g., Pichia pastoris, or Komagataella pastoris, or K. phaffii, or K. pseudopastoris.
- yeasts such as P. pastoris. Examples of P.
- a preferred yeast host cell may be a P. pastoris or S. cerevisiae host cell, specifically wherein such host cell contains heterologous or recombinant promoter sequences, which may be derived from a P. pastoris or S. cerevisiae strain, different from the production host.
- the host cell described herein comprises a recombinant expression construct described herein comprising the promoter originating from the same genus, species or strain as the host cell.
- the cell is an engineered Mut' yeast, specifically, wherein the Mut' yeast is characterized by a phenotype with reduced growth on methanol.
- the Mut' yeast is engineered to reduce expression of a dihydroxyacetone synthase (DAS) compared to an endogenous expression thereof, in particular as compared to endogenous expression in the yeast without such engineering, preferably wherein said engineering is a deletion, knock-out or disruption of any one or both of the DAS1 and DAS2 genes.
- DAS dihydroxyacetone synthase
- the cell can be cultured in a cell culture using a cell culture medium comprising a Ci compound as a carbon source, in particular wherein the carbon source is used as a source of energy and/or as a source of producing fermentation products including e.g., expression products.
- a cell culture medium comprising a Ci compound as a carbon source, in particular wherein the carbon source is used as a source of energy and/or as a source of producing fermentation products including e.g., expression products.
- the cell is capable of utilizing a Ci compound as the sole carbon source for producing biomass and/or an expression product, such as e.g., in a cell culture.
- the invention further provides for a method of culturing the cell described herein in a cell culture using a Ci compound as a carbon source, thereby obtaining biomass and/or an expression product in the cell culture, preferably wherein the Ci compound is the sole carbon source.
- the expression product is a protein of interest (POI) or a metabolite, preferably wherein: a) the POI is a peptide or protein selected from the group consisting of an antigen-binding protein, a therapeutic protein, an enzyme, a peptide, a protein antibiotic, a toxin fusion protein, a carbohydrate - protein conjugate, a structural protein, a regulatory protein, a vaccine antigen, a growth factor, a hormone, a cytokine, a process enzyme; b) the metabolite is an amino acid, organic acid, alcohol, sugar alcohol, carbohydrate, vitamin, amine, aldehyde, ketone, or a polyhydroxyketone, preferably wherein the metabolite is selected from the group consisting of lactic acid, citric acid, propionic acid, butyric acid, valeric acid, hexanoic acid, adipic acid, succinic acid, fumaric acid, malic acid, 2,5-furan
- a recombinant expression cassette which comprises one or more heterologous nucleic acid sequence(s).
- the cell is a yest such as e.g., a methylotrophic yeast
- the expression cassette comprises a heterologous coding sequence (or gene) which is operably linked to a methanol-inducible promoter.
- a method of producing biomass and/or an expression product in a cell culture is provided, using a cell described herein and a Ci compound, preferably wherein the Ci compound is the sole carbon source in the cell culture.
- the methods described herein employ the cell as described herein. Therefore, the methods described herein are specifically characterized by the features of the cell as described herein.
- the invention further provides for a fusion protein comprising a formolase (FLS) that is fused to a peroxisomal targeting signal (PTS).
- FLS formolase
- PTS peroxisomal targeting signal
- the fusion protein is characterized by the features described as features of the FLS as used in the FLS pathway described herein.
- the fusion protein is characterized by one or more of the following features: a) the PTS is selected from the group consisting of the Minimal PTS1 sequence (“SKL”), the “PTS1 consensus sequence” (as defined herein), the “PTS2 consensus sequence” (SEQ ID NO:17), SEQ ID NO:18, 19, 20, 21 , 22, 23, 24, 25, and 26; and b) the FLS is as further described herein.
- the fusion protein comprises an FLS which is a recombinant FLS such as a functional FLS mutant of a wild-type bacterial FLS, such as an FLS of bacterial origin e.g., originating from Pseudomonas fluorescens, in particular Pseudomonas fluorescens biovarl, wherein the recombinant FLS comprises the amino acid sequence SEQ ID NO:33 wherein the amino acid sequence is engineered to incorporate one or more point mutations, preferably up to 15, 14, 13, 12, 11 , 10, 9, 8, or 7 point mutations, wherein the point mutations comprise any one or more point mutations, preferably comprising i) at least 1 , 2, 3, 4, 5, 6, or 7 (all) of any of the following mutations: A28I, W89R, L90T, R188H, A394G, G419N, or A480W; or ii) at least 1 , 2, 3, 4, 5, 6, or 7 (all) of any of the following mutations: A28
- the fusion protein comprises an FLS which comprises of consists of SEQ ID NO:31 or SEQ ID NO:32, or a functional variant of any of the foregoing, preferably wherein a) the functional variant of SEQ ID NO:31 comprises at least 1 , 2, 3, 4, 5, 6, or 7 (all) of the point mutations A28L, W89R, R188H, N283H, A394G, G419N, and A480W, preferably at least 3, 4, or 5 mutations selected from W89R, R188H, A394G, G419N, or A480W, point mutations as compared to SEQ ID NO:33, and at least at least any one of 90%, 95%, 96%, 97%, 98%, 99% or 100% sequence identity to SEQ ID NO:31 ; b) the functional variant of SEQ ID NO:32 comprises at least 1 , 2, 3, 4, 5, 6, or 7 (all) of the point mutations A28I, W89R, L90T, R188H, A394G,
- the invention further provides for a nucleic acid molecule encoding the fusion protein described herein, preferably wherein the nucleic acid molecule is codon- optimized for expression in a eukaryotic host cell.
- the nucleic acid molecule is an isolated nucleic acid molecule.
- the isolated nucleic acid molecule may be incorporated into an expression cassette or an expression construct comprising the expression cassette, such as a vector, plasmid, or artificial chromosome.
- a preferred yeast expression vector is for expression in a yeast host cell, such as selected from the group consisting of methylotrophic yeasts represented by the genera Ogataea, Hansenula, Pichia, Candida and Torulopsis.
- plasmids derived from pPICZ, pGAPZ, pPIC9, pPICZalfa, pGAPZalfa, pPIC9K, pGAPHis or pPUZZLE can be used as a vector.
- the invention further provides for an expression cassette comprising the nucleic acid molecule described herein and regulatory sequences to express the fusion protein in a eukaryotic host cell.
- the FLS fusion protein is expressed under the control of a methanolinducible promoter, preferably pDAS2 or pAOX1.
- a eukaryotic cell comprising the expression cassette described herein.
- the recombinant FLS described herein is a functional enzyme, wherein the enzymatic activity is determined by a standard colorimetric assay such as described in Example 4, or by Cai et al. (Cai et al. 2021), or by Siegel et al. (Siegel et al. 2015).
- a FLS is considered functional if a catalytic efficiency higher than 0,01 M' 1 *s' 1 can be detected.
- Figure 1 Schematic representation of the constructed and integrated gene cassettes for generation of the P. pastoris strain (A) ADAS1 , ADAS2, AAOX1 , (pDAS2::FLS::HpH r ) + and (B) strain ADAS1 , ADAS2, AAOX1 , (pAOX1 ::FLS::G418 r ).
- Figure 2 Confirmation of the FLS gene expression cassettes by PCR amplification using genomic DNA as template.
- Amplified gene fragments (1-6) SEQ3 from ADAS1 , ADAS2.AA0X1, (pDAS2::FLS::HpH r ) + , transformants, and (7-12) SEQ4 from ADAS1 , ADAS2, AA0X1 , (pAOX1 ::FLS::G418 r ) + , (M) Quick-Load 1 kb Plus DNA Ladder (New England Biolabs).
- Figure 3 (A) The growth curves of the FLS expressing P. pastoris transformants compared to the parental strain and an empty vector strain in synthetic M2 media with methanol as carbon source. (B) The logarithmic representation of the same growth curves presenting the linear fitted regions used to calculate the growth rate.
- Figure 4 Microscopic images after 40 hours of incubation in M2/methanol media (A) FLS expressing P. pastoris strain ADAS1 , ADAS2, AAOX1 , (pAOX1 ::FLS::G418 r ) + and of (B) the parental P. pastoris MUT' strain ADAS1 , ADAS2, AAOX1 at a 1000x magnification with a total field of view diameter of 200 pm.
- A FLS expressing P. pastoris strain ADAS1 , ADAS2, AAOX1 , (pAOX1 ::FLS::G418 r ) + and of (B) the parental P. pastoris MUT' strain ADAS1 , ADAS2, AAOX1 at a 1000x magnification with a total field of view diameter of 200 pm.
- SEQ ID NO:1 DNA sequence of FLS::PTS1 coding sequence
- SEQ ID NO:2 Protein sequence of FLS::PTS1 (fusion protein)
- SEQ ID NO:3 pDAS2::FLS::HpH r cassette sequenced integrated into genome of P. pastoris CBS7435 ADAS1, ADAS2, AAOX1
- SEQ ID NO:4 pAOX1 ::FLS::HpH r cassette sequenced integrated into genome of P. pastoris CBS7435 ADAS1, ADAS2, AAOX1
- SEQ ID NO:5 PCR Primer Amplification DAS1 locus, (DAS1-F)
- SEQ ID NO:6 PCR Primer Amplification DAS1 locus, (DAS1-R)
- SEQ ID NO:7 PCR Primer Amplification DAS2 locus
- DAS2-F PCR Primer Amplification DAS2 locus
- DAS2-R PCR Primer Amplification DAS2 locus
- SEQ ID NO:9 PCR Primer Amplification AOX1 locus
- AOX1-F SEQ ID NQ:10: PCR Primer Amplification AOX1 locus
- AOX1-R SEQ ID NO:11 : PCR Screening Primer for FLS cassette integration (Screening PCR-F)
- SEQ ID NO:12 PCR Screening Primer for FLS cassette integration (Screening PCR-R)
- SEQ ID NO: 13 DNA sequence of genomic locus of AAOX1 of P. pastoris CBS7435 ADAS1, ADAS2, AAOX1
- SEQ ID NO:14 DNA sequence of genomic locus of ADAS1 of P. pastoris CBS7435 ADAS1, ADAS2, AAOX1
- SEQ ID NO: 15 DNA sequence of genomic locus of ADAS2 of P. pastoris CBS7435
- SEQ ID NO:16 Fusion protein comprising a FLS (from Siegel et al. 2015) and a PTS
- SEQ ID NO:28 AOX2 from Komagataella phaffii strain ATCC 76273 / CBS 7435,
- SEQ ID NO:29 AOD1 from Candida boidinii
- SEQ ID NO:30 MOX Hansenula polymorpha
- SEQ ID NO:31 recombinant FLS of SEQ ID NO:2
- SEQ ID NO:32 recombinant FLS of SEQ ID NO:16
- SEQ ID NO:33 FLS of Pseudomonas fluorescens, NCBI accession number:
- 2AG0_A Choin A, benzaldehyde lyase
- SEQ ID NO:34 exemplary C-terminal extension, to extend SEQ ID NO:33
- SEQ ID NO:35-SEQ ID NO:44 embodiments of SEQ ID NO:17
- biomass refers to the substance obtained by isolating whole cells, fractions or extracts thereof which are obtained from a cell culture, in particular where the biomass includes carbohydrates and/or proteins.
- carbon source is herein understood as “carbon substrate” and shall mean a fermentable carbon substrate which can be used to produce organic carbon compounds, suitable as an energy source for microorganisms.
- Ci compounds can be used as a carbon source, which are inorganic or organic compounds that comprise only one carbon atom per molecule or ion.
- Exemplary Ci carbon molecules used as substrates for biomass production and other fermentation processes described herein include methane, carbon dioxide (in the gaseous or solubilized form), carbon monoxide, methanol and synthesis gas (a mixture of carbon monoxide and hydrogen), but may also be formate, or mixtures of any of the foregoing.
- a compound may be used as a sole carbon source, or else a mixture of different compounds can be used as a carbon source.
- the term “cell” with respect to an engineered cell as used herein, which comprises a metabolic pathway, which is herein also referred to as a “host cell”, shall refer to a single cell, a single cell clone, or a cell line
- cell line refers to an established clone of a particular cell type that has acquired the ability to proliferate over a prolonged period of time.
- a cell line is typically used for expressing an endogenous or recombinant nucleic acid molecule or gene, a metabolic pathway or products of a metabolic pathway, to produce fermentation products, such as biomass or expression products e.g., polypeptides (including e.g., a protein of interest, POI), nucleic acid products (for example DNA or RNA), cells and/or viruses, or cell metabolites.
- polypeptides including e.g., a protein of interest, POI
- nucleic acid products for example DNA or RNA
- viruses for example DNA or RNA
- production host cell line or “production cell line” is commonly understood to be a cell line ready-to-use for cell culture in a bioreactor to obtain fermentation products. It is well understood that the term “host cell” refers to recombinant host cells and does not include human beings. Specifically, recombinant host cells as described herein are artificial organisms and derivatives of native (wild-type) host cells.
- a host cell which is engineered as described herein, such as to incorporate and to express a FLS pathway as described herein.
- Such recombinant host cells are provided as isolated host cell and respective host cell lines or cultures. Recombinant host cells can be cultured ex vivo to produce fermentation products.
- cell culture or “culturing” as used herein with respect to a host cell refers to the maintenance of cells in an artificial, e.g., an in vitro environment, under conditions favoring growth, differentiation or continued viability, in an active or quiescent state, of the cells, specifically in a controlled bioreactor according to methods known in the industry.
- an artificial e.g., an in vitro environment
- the cells are brought into contact with the media in a culture vessel or with substrate under conditions suitable to support culturing the cells in the cell culture.
- Standard cell culture media and techniques are well-known in the art.
- Recombinant proteins or cellular metabolites can be produced using the host cell and respective cell lines described herein, by culturing in an appropriate medium, isolating the expressed product or metabolite from the culture, and optionally purifying it by a suitable method.
- the host cell described herein can be tested for its capacity to produce expression products by any of the following tests: ELISA, activity assay, capillary electrophoresis, HPLC, or other suitable tests, such as SDS-PAGE and Western Blotting techniques, or mass spectrometry.
- expression refers to expression of a polynucleotide or gene from an expression cassette, or to the expression of the respective polypeptide or protein.
- expression cassette is herein understood to refer to nucleic acid molecules (herein also referred to as polynucleotides), which contain a desired coding sequence (herein referred to as a gene), and control sequences in operable linkage, so that hosts transformed or transfected with these molecules incorporate the respective sequences and are capable of producing the expression products.
- An expression cassette may comprise one or more nucleic acid molecules (e.g., a gene) that are endogenous or heterologous to a host cell.
- the expression cassette may comprise polynucleotides encoding a metabolic pathway, such as a pathway comprising the FLS pathway described herein.
- the metabolic pathway described herein may be encoded by polynucleotides comprised in one or more expression cassettes, e.g., in one or more expression constructs comprised in the host cell.
- An expression cassette can be engineered ex vivo or in vivo.
- an expression cassette can be engineered as part of a vector or an artificial expression construct that can be provided as isolated expression construct that is engineered using in vitro techniques, and optionally incorporated in a cell culture.
- the recombinant FLS expression cassette described herein comprises the FLS encoding nucleic acid molecule operably linked to one or more regulatory elements that control expression of the FLS.
- the expression cassette expressing the recombinant FLS described herein comprises one or more regulatory elements not natively associated with the gene encoding the FLS.
- Such expression cassettes comprising one or more elements that are not natively associated with a gene, are herein understood as artificial or recombinant expression cassettes.
- An element of an expression cassette that is “not natively associated with” such gene is typically not operably linked to such gene in a wild-type host cell, whereas the wild-type host cell is endogenously expressing such gene with regulatory elements that are “natively associated with” such gene.
- a wild-type host cell is herein understood to be a naturally-occurring host cell that is not recombined by any artificial means.
- One or more expression cassettes are herein also understood as “expression system”.
- the expression system may be included in an expression construct, such as an artificial heterologous expression cassette, a vector and in particular a plasmid.
- the relevant DNA of an expression cassette or construct may also be integrated into a host cell chromosome. Expression may refer to secreted or non-secreted expression products, including polypeptides or metabolites.
- Expression cassettes are conveniently provided as expression constructs e.g., in the form of “vectors” or “plasmids”, which are typically DNA sequences that are required for the transcription of cloned recombinant nucleotide sequences i.e., of recombinant genes and the translation of their mRNA in a suitable host organism.
- Expression vectors or plasmids usually comprise an origin for autonomous replication or a locus for genome integration in the host cells, selectable markers (e.g., an amino acid synthesis gene or a gene conferring resistance to antibiotics such as zeocin, kanamycin, G418 or hygromycin, nourseothricin), a number of restriction enzyme cleavage sites, a suitable promoter sequence and a transcription terminator, which components are operably linked together.
- selectable markers e.g., an amino acid synthesis gene or a gene conferring resistance to antibiotics such as zeocin, kanamycin, G418 or hygromycin, nourseothricin
- Plasmid and vector as used herein include autonomously replicating nucleotide sequences as well as genome integrating nucleotide sequences, such as artificial chromosomes e.g., a bacterial artificial chromosome (BAC) or yeast artificial chromosome (YAC).
- artificial chromosomes e.g., a bacterial artificial chromosome (BAC) or yeast artificial chromosome (YAC).
- Expression vectors may include but are not limited to cloning vectors, modified cloning vectors and specifically designed plasmids.
- Preferred expression vectors described herein are expression vectors suitable for expressing a recombinant gene in a eukaryotic host cell and are selected depending on the host organism.
- Appropriate expression vectors typically comprise regulatory sequences suitable for expressing DNA encoding polypeptide or protein of interest in a eukaryotic host cell. Examples of regulatory sequences include promoter, operators, enhancers, ribosomal binding sites, and sequences that control transcription and translation initiation and termination. The regulatory sequences are typically operably linked to the DNA sequence to be expressed.
- a promoter sequence is typically regulating and initiating transcription of the downstream nucleotide sequence, with which it is operably linked.
- An expression cassette or vector typically comprises a promoter nucleotide sequence which is adjacent to the 5’ end of a coding sequence, e.g., upstream from and adjacent to the coding sequence (e.g., gene of interest) or if a signal or leader sequence is used, upstream from and adjacent to said signal and leader sequence, respectively, to facilitate translation initiation and expression of coding sequences to obtain the expression product.
- adjacent is meant to refer to a distance between elements of a nucleic acid sequence which is “within about 10bp” or “within about 5 bp” of the nucleic acid sequence.
- Specific expression cassettes described herein comprise a promoter operably linked to a coding nucleotide sequence.
- the promoter can be used to control expression of a coding sequence, wherein the promoter is not natively associated with the coding sequence.
- Specific expression constructs described herein comprise a coding sequence linked with a leader sequence (e.g., a secretion signal peptide sequence (presequence), a pro-sequence, or a pre-pro-sequence), which causes transport of the expressed polypeptide or protein into the secretory pathway and/or secretion from the host cell.
- a leader sequence e.g., a secretion signal peptide sequence (presequence), a pro-sequence, or a pre-pro-sequence
- the presence of such a secretion leader sequence in an expression cassette or construct is typically required when the polypeptide or protein intended for recombinant expression is not naturally secreted by the host cell and therefore lacks a natural secretion leader sequence, or its nucleotide sequence has been cloned without its natural secretion leader sequence.
- multicloning vectors may be used, which are vectors having a multicloning site.
- one or more desired polynucleotides can be integrated or incorporated at a multicloning site to prepare an expression vector.
- a promoter is typically placed upstream of the multicloning site to control expression of said one or more desired polynucleotides.
- gene expression means to encompass at least one step selected from the group consisting of DNA transcription into mRNA, mRNA translation and processing, mRNA maturation, mRNA export, protein folding and/or protein transport, thereby obtaining expression products, precursors or intermediates thereof.
- nucleotide refers to nucleotides, either ribonucleotides or deoxyribonucleotides or a combination of both, in a polymeric unbranched form of any length.
- a polynucleotide refers to deoxyribonucleotides in a polymeric unbranched form of any length.
- nucleotides consist of a pentose sugar (deoxyribose), a nitrogenous base (adenine, guanine, cytosine or thymine) and a phosphate group.
- endogenous as used herein is meant to include those molecules and sequences, in particular genes or proteins, which are present in a wild-type (native, not recombinant) host cell or expressed by such wild-type host cell, thereby “endogenous” to said wild-type host cell.
- an endogenous nucleic acid molecule e.g., a gene
- protein that does occur in (and originates from, or can be obtained from) a particular host cell as it is found in nature, is understood to be “host cell endogenous” or “endogenous to the host cell”.
- a cell “endogenously expressing” a nucleic acid or protein expresses that nucleic acid or protein as does a host of the same particular type as it is found in nature.
- a host cell “endogenously producing” or that “endogenously produces” a nucleic acid, protein, or other compound produces that nucleic acid, protein, or compound as does a host cell of the same particular type as it is found in nature.
- An endogenous protein can be overexpressed such as e.g., in an engineered host cell comprising an artificial expression cassette that expresses the protein at a higher level as compared to the level expressed by a respective wild-type host cell without such engineering. Even if a protein is no more produced by a host cell, such as e.g., a knockout mutant of the host cell, where the protein encoding gene is inactivated or deleted, the protein is herein still referred to as “endogenous”.
- heterologous refers to a compound which is either foreign to a given host cell, i.e. “exogenous”, such as not found in nature in said host cell; or that is naturally found in a given host cell, e.g., is “endogenous”, however, in the context of a heterologous construct or integrated in such heterologous construct, e.g., employing a heterologous nucleic acid fused or in conjunction with an endogenous nucleic acid, thereby rendering the construct heterologous.
- heterologous nucleotide sequence as found endogenously may be produced in an unnatural, e.g., greater than expected or greater than naturally found, amount in the cell.
- the heterologous nucleotide sequence, or a nucleic acid comprising the heterologous nucleotide sequence possibly differs in sequence from the endogenous nucleotide sequence but may still encode the same protein as found endogenously.
- heterologous nucleotide sequences are those not found in the same relationship to a host cell in nature. Any recombinant or artificial nucleotide sequence is understood to be heterologous.
- heterologous polynucleotide is a fusion product of a nucleotide sequence encoding an FLS that is fused to a PTS coding nucleotide sequence to which the respective endogenous, naturally-occurring FLS coding sequence is not normally fused, or a nucleotide sequence operably linked to a promoter to which the respective endogenous, naturally-occurring nucleotide sequence is not normally operably linked.
- FLS formaldehyde
- An FLS may originate from a natural source, or can be a functional mutant thereof.
- an FLS can be a computationally-designed enzyme derived from benzaldehyde lyase (BAL).
- the FLS may comprise or consist of any one of SEQ ID NO:31 or SEQ ID NO:32, or be a functional variant of any of the foregoing.
- the FLS may be provided as a derivative of any of the foregoing e.g., a fusion protein comprising the FLS sequence and a PTS.
- methanol to formaldehyde converting enzyme abbreviated “MFCE”, as used herein shall refer to an “alcohol oxidase”, abbreviated “AOX”, or “methanol oxidase”, or “methanol dehydrogenase”, and shall refer to an enzyme which catalyzes conversion of methanol into formaldehyde. Unlike alcohol dehydrogenases, alcohol oxidases are unable to catalyze the reverse reaction. Reference is made to EC 1.1.3.13 (alcohol oxidase) and EC 1.1.1.1 (alcohol dehydrogenase).
- An MFCE may originate from a natural source, such as originating from a methylotrophic yeast, or can be a functional mutant thereof.
- an MFCE can be a computationally-designed enzyme derived from a functional oxidoreductase or dehydrogenase.
- the MFCE may comprise or consist of any one of SEQ ID NO:27, 28, 29 or 30, or be a functional variant of any of the foregoing.
- a “functional mutant” or “functional variant” of an enzyme like FLS or MFCE, as used herein shall mean a variant in which at least one amino acid among wild-type enzyme amino acids is substituted, inserted, removed or modified, preferably wherein the mutation is a limited number of point mutations (a point mutation being the substitution, insertion, removal or modification of only one amino acid in a sequence), such as up to 10, 9, 8, 7, 6, 5, 4, 3, 2, or 1 point mutation(s), and/or wherein the enzyme mutant comprises a certain sequence identity to the wild-type sequence, such as at least any one of 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% sequence identity.
- isolated as used herein with respect to a polynucleotide, nucleic acid molecule, expression cassettes or constructs, or host cells shall refer to such compound that has been sufficiently separated from the environment with which it would naturally be associated. Yet, “isolated” does not necessarily mean the exclusion of artificial or synthetic mixtures with other compounds or materials, or the presence of impurities that do not interfere with the fundamental activity, and that may be present, for example, due to incomplete purification. Isolated compounds can be further formulated to produce preparations thereof, and still for practical purposes be isolated. For example, an isolated expression product can be mixed with pharmaceutically acceptable carriers or excipients when used in diagnosis or therapy.
- metabolic refers to a product of metabolic reactions catalyzed by one or more enzymes of a metabolic pathway of a host cell incorporating the pathway, and may include reactant, product and cofactor molecules of said enzymes. Metabolites may arise in the same pathway(s) as the cell metabolic pathway or pathways encoding an enzyme which catalyzes the synthesis of the cell growth and/or productivity inhibitor or intermediate thereof or may be synthesized in a branching pathway.
- operably linked refers to the association of nucleotide sequences on a single nucleic acid molecule, e.g., a vector, an expression cassette or an expression construct, in a way such that the function of one or more nucleotide sequences is affected by at least one other nucleotide sequence present on said nucleic acid molecule.
- a nucleic acid sequence is placed into a functional relationship with another nucleic acid sequence on the same nucleic acid molecule.
- a promoter and/or a gene switch is operably linked with a coding sequence of a recombinant gene, when it is capable of effecting the expression of that coding sequence.
- nucleic acids operably linked to each other may be directly linked i.e., immediately linked, meaning without additional elements or nucleic acid sequences in between the nucleic acids.
- a suitable linking sequence can be used such as e.g., a cloning site positioned between the promoter and/or gene switch and a gene of interest e.g., a target gene.
- a promoter sequence is typically understood to be operably linked to a coding sequence, if the promoter controls the transcription of the coding sequence. If a promoter sequence is not natively associated with the coding sequence, its transcription is either not controlled by the promoter in native (wild-type) cells or the sequences are recombined with different contiguous sequences.
- promoter as used herein is meant to refer to a nucleic acid molecule that initiates, regulates, or otherwise mediates or controls the expression of a protein coding polynucleotide (DNA).
- Promoter DNA and/or coding DNA may be native DNA such as occurring or originating from a wild-type cell e.g., from the same gene or from different genes, or may be from the same or different organisms, but can be recombinant or artificial.
- a heterologous promoter may be heterologous to a polynucleotide to be expressed and/or an artificial promoter, or a promoter that is originating from the wild-type host cell, but positioned in the host cell genome within a heterologous expression cassette or positioned at a location where it is not naturally- occurring in the wild-type host cell.
- promoters are regulatable e.g., inducible or repressible by a compound, or constitutive.
- Exemplary promoters which can be used for the purpose described herein to express the recombinant FLS in the eukaryotic host cell are selected from the group consisting of methanol-inducible promoters, such as the A0X1 or A0X2 promoter, or a constitutive promoter such as MDH3, POR1 , PDC1 , FBA1-1 , or GPMI (Prielhofer et al. 2017, BMC Sys Biol. 11 : 123), or a functional variant of any of the foregoing.
- methanol-inducible promoters such as the A0X1 or A0X2 promoter
- a constitutive promoter such as MDH3, POR1 , PDC1 , FBA1-1 , or GPMI (Prielhofer et al. 2017, BMC Sys Biol. 11 : 123), or a functional variant of any of the foregoing.
- native or wild-type promoter sequences functional variants of such native or wild-type promoter sequences (herein understood as parent promoters) can be used, which have at least any one of 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% sequence identity and are functional in controlling the expression of a gene in substantially similar way, e.g. being an inducible promoter or constitutive promoter as the parent promoter, and preferably having about the same or increased promoter strength.
- a functional variant of a promoter described herein comprises a certain sequence identity to the promoter from which it is derived, over the full-length or the part at the 3’-end of the promoter sequence which part has a length of at least 300, 400, or 500 bp, and which is functional to operatively control expression of the polynucleotide to be expressed, in particular with about the same promoter activity (e.g. +/- any one of 50%, 40%, 30%, 20%, or 10%), although the promoter activity may be increased to higher than 100% as compared to the promoter from which it is derived.
- the promoter activity can be determined by measuring the strength of the promoter.
- the promoter strength is typically understood as the transcription strength.
- the transcription strength of a promoter such as by measuring the quantity of transcripts, e.g., employing a microarray, or else in a cell culture, such as by measuring the quantity of respective gene expression products in recombinant cells.
- the transcription rate may be determined by the transcription strength on a microarray, Northern blot or with quantitative real time PCR (qRT-PCR) or with RNA sequencing (RNA-seq).
- a “methanol-inducible promoter” is herein understood as a naturally occurring or wild-type promoter controlling expression of genes of the methanol dissimilatory pathway of organisms, in particular methylotrophic microorganisms.
- expression of the recombinant FLS may be driven by a methanol-inducible promoter, such as the A0X1 or A0X2 promoter, or a functional variant of any of the foregoing.
- methanol passively diffuses into the yeast peroxisome. There it is converted to formaldehyde by one of two different alcohol oxidase isozymes (Aox1 , Aox2).
- formaldehyde can be further oxidized in several steps to CO2 via the methanol dissimilatory pathway.
- formaldehyde is incorporated into the pentose phosphate pathway via a condensation reaction with xylulose 5-phosphate, a reaction catalyzed by a specialized transketolase enzyme called DiHydroxyAcetone Synthase (Das).
- This reaction yields a molecule of dihydroxyacetone (DHA) and a molecule of glyceraldehyde 3-phosphate. Each of these reactions occurs in peroxisomes in methylotrophic yeasts.
- formaldehyde is directly converted to DHA by the recombinant FLS in an engineered cell as described herein.
- nucleic acid sequence refers to either DNA or RNA.
- Nucleic acid sequence or “polynucleotide sequence” or simply “polynucleotide” refers to a single or double-stranded polymer of deoxyribonucleotide or ribonucleotide bases read from the 5' to the 3' end. It includes expression cassettes, self-replicating plasmids, infectious polymers of DNA or RNA, and non-functional DNA or RNA.
- PTS peroxisomal targeting signal
- a functional PTS is an amino acid sequence which is specifically recognized by any of the receptors Pex5 (PTS1) or Pex7 (PTS2), thereby activating the receptor and directing expression of the gene that is fused with such PTS to the host cell peroxisome.
- a nucleotide sequence encoding the PTS1 is typically linked to a gene at the 3’- end, such that the PTS is fused at the carboxy terminus of the respective gene expression product.
- the C-terminus of the amino acid sequence of the gene expression product is directly linked to the N-terminus of the PTS.
- a nucleotide sequence encoding the PTS2 is typically linked to a gene at the 5’- end or integrated in proximity to the 5’-end, such that the PTS is fused at the amino terminus or close to the amino terminus of the respective gene expression product.
- the N-terminus of the amino acid sequence of the gene expression product is directly linked to the C-terminus of the PTS2.
- PTS1 predictor Nemt al.
- Eisenhaber B Hartig A
- Eisenhaber F Motif refinement of the peroxisomal targeting signal 1 and evaluation of taxon-specific differences. J Mol Biol. 2003 May 2;328(3):567-79.
- PTS prediction tool WoLF PSORT Horton P, Park K-J, Obayashi T et al. WoLF PSORT: protein localization predictor. Nucleic Acids Res 2007;35:W585-7.
- protein of interest refers to a polypeptide or a protein that is produced by means of recombinant technology in a host cell. More specifically, the protein may either be a polypeptide not naturally-occurring in the host cell, i.e. a heterologous protein, or else may be native to the host cell, i.e.
- sequence identity of a variant, homologue or orthologue as compared to a parent nucleotide or amino acid sequence indicates the degree of identity of two or more sequences.
- Two or more amino acid sequences may have the same or conserved amino acid residues at a corresponding position, to a certain degree, up to 100%.
- Two or more nucleotide sequences may have the same or conserved base pairs at a corresponding position, to a certain degree, up to 100%.
- Sequence similarity searching is an effective and reliable strategy for identifying homologs with excess (e.g., at least 50%) sequence identity. Sequence similarity search tools frequently used are e.g., BLAST, FASTA, and HMMER.
- Sequence similarity searches can identify such homologous proteins or genes by detecting excess similarity, and statistically significant similarity that reflects common ancestry.
- Homologues may encompass orthologues, which are herein understood as the same protein in different organisms, e.g., variants of such protein in different different organisms or species.
- Percent (%) amino acid sequence identity with respect to an amino acid sequence, homologs and orthologues described herein is defined as the percentage of amino acid residues in a candidate sequence that are identical with the amino acid residues in the specific polypeptide sequence, after aligning the sequence and introducing gaps, if necessary, to achieve the maximum percent sequence identity, and not considering any conservative substitutions as part of the sequence identity.
- Those skilled in the art can determine appropriate parameters for measuring alignment, including any algorithms needed to achieve maximal alignment over the full length of the sequences being compared.
- sequence identity between two amino acid sequences can be determined using the NCBI BLAST program version BLASTP 2.8.1 with the following exemplary parameters: Program: blastp, Word size: 6, Expect value: 10, Hitlist size: 100, Gapcosts: 11.1 , Matrix: BLOSUM62, Filter string: F, Compositional adjustment: Conditional compositional score matrix adjustment.
- EMBOSS Needle webserver For pairwise protein sequence alignment of two amino acid sequences along their entire length the EMBOSS Needle webserver can be used with default settings (Matrix: EBLOSUM62; Gap open:10; Gap extend: 0.5; End Gap Penalty: false; End Gap Open: 10; End Gap Extend: 0.5). EMBOSS Needle uses the Needleman-Wunsch alignment algorithm to find the optimum alignment (including gaps) of the two input sequences and writes their optimal global sequence alignment to file.
- Percent (%) identity with respect to a nucleotide sequence e.g., of a coding or non-coding nucleotide sequence, is defined as the percentage of nucleotides in a candidate nucleotide sequence that is identical with the nucleotides in the nucleotide sequence, after aligning the sequence and introducing gaps, if necessary, to achieve the maximum percent sequence identity, and not considering any conservative substitutions as part of the sequence identity. Alignment for purposes of determining percent nucleotide sequence identity can be achieved in various ways that are within the skill in the art, for instance, using publicly available computer software. Those skilled in the art can determine appropriate parameters for measuring alignment, including any algorithms needed to achieve maximal alignment over the full length of the sequences being compared.
- sequence identity between two amino acid sequences can be determined using the NCBI BLAST program version BLASTN 2.8.1 with the following exemplary parameters: Program: blastn, Word size: 11 , Expect threshold: 10, Hitlist size: 100, Gap Costs: 5.2, Match/Mismatch Scores: 2,-3, Filter string: Low complexity regions, Mark for lookup table only.
- a “recombinant” as used herein shall mean “being prepared by or the result of genetic engineering.
- a “recombinant cell” or “recombinant host cell” is herein understood as a cell or host cell that has been genetically engineered or modified to comprise a nucleic acid sequence which was not native to said cell.
- a recombinant host may be engineered to delete and/or inactivate one or more nucleotides or nucleotide sequences, and may specifically comprise an expression vector or cloning vector containing a recombinant nucleic acid sequence, in particular employing nucleotide sequence foreign to the host.
- a recombinant protein is produced by expressing a respective recombinant nucleic acid in a host.
- the term “recombinant” as used herein with respect to expression products includes those compounds that are prepared, expressed, created or isolated by recombinant means, such as isolated from a host cell transformed or transfected to express the expression products.
- Certain recombinant host cells are “engineered” host cells which are understood as host cells which have been manipulated using genetic engineering, i.e. by human intervention.
- a host cell is engineered to express, co-express or overexpress a certain pathway, gene or the respective protein
- the host cell is manipulated such that the host cell has the capability to express such pathway, gene and protein, respectively, to a different extent compared to the host cell under the same condition prior to manipulation, or compared to the host cells which are not engineered such that said gene or protein is expressed, co-expressed or overexpressed.
- synthetic as used herein in the context with a synthetic pathway shall mean a pathway that is recombinant.
- methanol assimilation pathway Xylulose-Monophosphate cycle, XuMP
- Xylulose-Monophosphate cycle XuMP
- this pathway requires at least 10 enzymes to enable the conversion from methanol to DHA.
- three (3) moles of ATP are required for the conversion of three moles of (3) formaldehyde to one (1) mol of Dihydroxyacetone phosphate (DHAP).
- DHAP Dihydroxyacetone phosphate
- dihydroxyacetone phosphate is thus not as energy efficient as the new FLS pathway described herein which can directly form dihydroxyacetone from three (3) moles of formaldehyde. Then only one (1) mol of ATP is required (instead of three mol ATP) to form dihydroxyacetone phosphate, which is an intermediate of glycolysis and thus of the central carbon metabolism. Therefore, a recombinant pathway was implemented into eukaryotic cells, in particular microorganisms, to enable a formaldehyde conversion into biomass via the formose reaction. This can enable an energy efficient conversion of Ci building blocks into carbon molecules with multiple C-C bonds.
- an FLS gene (version FLS-M3) as published by (Cai et al. 2021) was codon optimized for Komagataella phaffii and a PTS1 sequence was added to the C-terminus to enable targeting to peroxisomes of K. phaffii.
- the coding sequence was placed under the control of a strong methanol inducible promoter (either pAOX1 or pDAS2) and expression cassettes were integrated into a K. phaffii strain which has a deletion in DAS1 and DAS2 (Mut- strain).
- a Mut' strain is not capable to grow on methanol as carbon source.
- the presence of the expression cassette can be checked by for instance PCR approaches. Obtained strains can grow on medium with methanol as single carbon source and show formolase activity in a cell-free extract.
- the formolase pathway described herein enables a more efficient utilization of a Ci (one carbon) carbon source like methanol.
- Methanol is an important carbon source to grow methylotrophic yeasts like Pichia pastoris ( Komagataella sp.) to produce mainly proteins and enzymes.
- the production of single cell proteins for food and feed applications is in the industrial focus again.
- With a more efficient methanol assimilation a higher yield of the product formed can be achieved (e.g., proteins, organic acids, DNA, RNA, lipids, biomass, ).
- Cooling bioreactors to e.g., 30°C is cost intensive when an exothermic reaction is taking place.
- Formaldehyde is a reactive molecule, which is in many cases cytotoxic to microorganism when formed intracellularly at higher concentrations or without specialized compartmentalization in the cell.
- an intracellular compartment of the cell is utilized (or generated), where both formaldehyde is formed and converted by formolase to dihydroxyacetone.
- biochemical reactions leading to the formation of formaldehyde are placed into the same compartment as the formose reaction carried out by the enzyme formolase (FLS) to form dihydroxyacetone from formaldehyde.
- a compartment is a subcellular container, which is spatially separating its interior from the surrounding environment by a biochemical barrier.
- This biochemical barrier can be a lipid membrane consisting of phospholipids (e.g., organelles like peroxisomes, mitochondria, vacuole, endoplasmic reticulum) or an artificial shell formed by protein structures like in bacterial microcompartments.
- the FLS pathway as described herein is using formaldehyde as a substrate or intermediate.
- the FLS pathway is spatially localized to the same subcellular compartment like the peroxisome. Doing so other cellular compounds like DNA, RNA or proteins are not modified by formaldehyde in an untargeted way.
- Example 1 Construction of FLS coding sequence with a peroxisomal targeting sequence
- FLS-M3 (Cai et al., 2021) was targeted to the peroxisome of Pichia pastoris by fusing a peroxisomal targeting sequence to the C-terminus of the protein.
- the targeting sequence derived from the C-terminus of the native AOX1 of P. pastoris (represented in the protein dodecamer: LGTYEKTGLARF, SEQ ID NO:19, termed PTS1 here) was fused to the C- terminus of FLS (protein sequence: SEQ ID NO:2) yielding FLS::PTS1.
- the respective protein sequence was codon optimized for P.
- DNA sequence was ordered at TWIST Bioscience (South San Francisco, CA, USA) (DNA sequence: SEQ ID NO:1).
- other targeting signals for peroxisomal localization can be added to the protein, which targets the PEX5 receptor (mainly PTS1) or like PTS2 (via PEX5 or PEX7 receptor) or PTS3 (Kempinski et al., 2020). Proteins showing a peroxisomal localization were reported previously together with respective targeting signals (RuRmayer et al., 2015).
- the coding sequence was ordered with flanking Bpil (Bbsl) sites and respective fusion sites (FS2, FS3) to be compatible with a Golden Gate cloning pipeline GoldenMOCS described previously (Sarkari et al., 2017).
- the sequencing results (SEQ ID NO:13, SEQ ID NO:14, and SEQ ID NO:15) confirmed the deletion of the mentioned genes in the genome of the parental Pichia pastoris MUT' strain. This deletion strain cannot grow on methanol.
- the methanol assimilation pathway (Xylulose- Monophosphate cycle) is blocked when both DAS1/DAS2 are deleted (Gassier et al., 2020).
- the fusion protein FLS::PTS1 was expressed using a strong inducible methanol promoter using pDAS2 or pAOX1 by integrating respective expression cassettes into the genome.
- the integration cassettes ( Figure 1) were constructed using a Golden gate cloning strategy GoldenMOCS and transformed to electro-competent P. pastoris ADAS1 , ADAS2, AA0X1 cells. 5 pg DNA was transformed to each competent cell. The final incubated cells were plated on YPD-Agar plates having 500 pg mL' 1 G418 or 200 pg mL -1 Hygromycin as antibiotic selection markers.
- the transformants that grew on both antibiotic (Hygromycin or G418) and methanol plates were selected.
- a cell bank of the selected strain was prepared, in 1 mL aliquots and stored at -80°C.
- the extraction of the genomic DNA in order to check the integration of the genes of interest in the genome was done by breaking the cells down using glass beads in a Bead ruptor device (FastPrep-24, MP Biomedical, 6 m/s) and genomic DNA was isolated using an ethanol precipitation protocol (Promega kit).
- the SEQ ID NO:3 and SEQ ID NO:4 were amplified from yeast genomic DNA in PCR reactions using OneTaq DNA polymerase (NEB) with SEQ ID NO:11 and SEQ ID NO:12 as the primer oligonucleotides.
- the DNA fragments sized 4.6 kb for SEQ ID NO:3 (pDAS2::FLS::HpH r ) and 4.3 kb for SEQ ID NO:4 (pAOX1 ::FLS::G418 r ) were expected.
- the presence of the mentioned fragments was checked on 1 % Agarose gel ( Figure 2).
- the positive P. pastoris transformants having SEQ ID NO:4 ( Figure 2, Lane 1-6) or SEQ ID NO:5 ( Figure 2, Lane 7,9-12) integrated into their genomes were selected and used for further experiments.
- Example 3 Cultivation of FLS expressing yeast strains on synthetic media containing methanol
- Strains expressing FLS can grow on methanol as sole carbon source. No growth is obtained for an empty vector control and the parental MUT' strain on methanol. From Figure 4 it can be seen that strains expressing FLS show a dense and budding yeast culture on methanol under the microscope compared to the not growing parental strain.
- YPD Media 1 % yeast extract, 2% peptone, 2% glucose.
- Synthetic Media (M2 Citrate buffered): Per liter; 3.15 g (NH4)2HPO4, 0.49 g MgSO4.7H20 , 0.80 g KCI, 0.0268 g CaCl2.2H2O, 22.0 g citric acid monohydrate, 1.47 mL Trace salt (PTM0), 4 mL biotin (0.1 g L' 1 ), 4 mL thiamine-hydrochloride (5 g L' 1 ). The pH of the media was adjusted at 6 by adding potassium hydroxide pellets.
- Trace salt solution PTM0: Per liter; 5 mL sulphuric acid (95%-98%), 65 g FeSO4.7H 2 O, 20 g ZnCI 2 , 6 g CuSO4.5H 2 O, 3.36 g MnSO4.H 2 O, 0.82 g COCI2.6H2O, 0.2 g Na 2 MoO4.2H 2 O, 0.08 g Nal, 0.02 g H3BO3.
- the enzymatic assay was done by a colorimetric assay protocol (Cai et al., 2021). After cultivation in synthetic media, the yeast cell pellets were collected and resuspended in (HEPES-NaCI buffer, 100 mM pH 7.5). The cells were broken down by adding glass beads in a bead ruptor device (4 m/s). The cell free extract containing FLS enzyme was separated from the cell debris. The enzyme activity assay was carried out in microtiter plates. 50 mL of formaldehyde solution (ranging from 20 to 100 mM) containing 1 mM thiamine pyrophosphate (TPP) was added to 50 pL of cell free extract dilutions.
- formaldehyde solution ranging from 20 to 100 mM
- TPP 1 mM thiamine pyrophosphate
- RuRmayer H., Buchetics, M., Gruber, C., Valli, M., Grillitsch, K., Modarres, G., Guerrasio, R., Klavins, K., Neubauer, S., Drexler, H., Steiger, M., Troyer, C., Al Chalabi, A., Krebiehl, G., Stanford, D., Zellnig, G., Daum, G., Graf, A.B., Altmann, F., Koellensperger, G., Hann, S., Sauer, M., Mattanovich, D., Gasser, B., 2015. Systems-level organization of yeast methylotrophic lifestyle. BMC Biology, Vol. 13(80), 1-25.
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Genetics & Genomics (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Biotechnology (AREA)
- Biochemistry (AREA)
- Microbiology (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Mycology (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Medicinal Chemistry (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
A eukaryotic cell which is engineered to express a synthetic formolase (FLS) pathway comprising a recombinant FLS to biotransform formaldehyde into dihydroxyacetone.
Description
SYNTHETIC FQRMQLASE PATHWAY
FIELD OF THE INVENTION
The invention relates to eukaryotic cells engineered to express a synthetic formolase (FLS) pathway, in particular wherein the FLS pathway is comprised in an organelle of the eukaryotic cell, methods of producing biomass and/or expression products of the eukaryotic cell in a cell culture using a Ci (one carbon) compound as a carbon source, and fusion proteins comprising an FLS that is fused to a peroxisomal targeting signal (PTS), to target expression of the FLS into the cell’s peroxisome.
BACKGROUND OF THE INVENTION
Methanol plays a crucial role in a future methanol economy. Currently methanol is produced from fossil resources but “green-methanol” or “bio-methanol” can be directly produced from CO2. As a liquid under ambient conditions, methanol has the advantage to be more easily transported and stored than molecular hydrogen. It is already used as a bulk chemical and multiple processing routes are available. It can serve as fuel and in addition it can be utilized as a carbon source by eukaryotic cells, Ribulose Monophosphate cycle, Dihydroxyacetone cycle, ribulose bisphosphate cycle or the Xylulose Monophosphate cycle. All natural pathways consume high energy (in form of ATP or NAD(P)H). Komagataella phaffii (herein also referred to as Pichia pastoris) is an industrially used methylotrophic yeast and it can grow on methanol as a single carbon source to high biomass densities.
The methanol assimilation pathway (Xylulose-Monophosphate cycle, XuMP) has similarities to the Calvin-Benson-Bassham cycle and the pentose-phosphate pathway. However, this pathway requires at least 10 enzymes to enable the conversion from methanol to dihydroxyacetone (DHA). In addition, three (3) moles of ATP are required for the conversion of three moles of (3) formaldehyde to one (1) mol of Dihydroxyacetone phosphate (DHAP). In order to produce ATP a significant amount of methanol has to be dissimilated by P. pastoris to CO2 obtaining NADH and thus ATP via the respiratory chain.
Siegel et al. (2015) describe a computationally designed FLS for use in a metabolic pathway using purified enzymes.
US2013196359A1 discloses an in vitro method for converting formaldehyde to dihydroxyacetone by a formolase.
Wang et al. (Bioresour. Bioprocess. 2017, 4:41) describe biological conversion of methanol by evolved Escherichia coli carrying a linear methanol assimilation pathway. The linear pathway consists of two enzymatic reactions: oxidation of methanol into formaldehyde by methanol dehydrogenase and carboligation of formaldehyde into DHA by formolase (FLS). E. coli expressing recombinant FLS could not initiate growth with methanol as the sole carbon source. Methanol assimilation was improved by adaptive evolution to provide for FLS mutants.
Cai et al. (2021) and WO2022042427A1 disclose cell-free chemoenzymatic starch synthesis from carbon dioxide. A mutant of the formolase enzyme (FLS-M3) was described.
CN113151230A discloses FLS mutants.
CN113122525A discloses mutant formaldehyde-converting enzymes and expression of said mutants in a bacterium.
CN110438169A discloses a method for catalytic synthesis of 1-hydroxy-2- butanone in bacterial cells using formaldehyde and propionaldehyde as raw materials and an engineered E. coli strain containing formaldehyde lyase (FLS).
Van der Klei et al. (2006) review the role of peroxisomes in methanol metabolism of methylotrophic yeasts.
CN107475281 A discloses E. coli engineered to comprise a methanol metabolism pathway including methanol dehydrogenase and formaldehyde lyase FLS.
US20200181629A1 discloses a yeast comprising a synthetic Calvin cycle wherein a ribulose-bisphosphate carboxylase (RuBisCO) gene and ribulose phosphate kinase (PRK) gene are fused with a peroxisomal targeting signal (PTS).
Gassier et al. (2020) describe a Pichia pastoris strain which is an autotroph capable of growth on CO2. A CO2 assimilation pathway was engineered into P. pastoris when targeted to the peroxisome of the yeast.
RuRmayer et al. (2015) describe systems-level organization of yeast methylotrophic lifestyle. The authors analyzed the regulation patterns of 5,354 genes, 575 proteins, 141 metabolites, and fluxes through 39 reactions of P. pastoris comparing growth on glucose and on a methanol/glycerol mixed medium, respectively. It was found that the entire methanol assimilation pathway is localized to peroxisomes rather than
employing part of the cytosolic pentose phosphate pathway for xylulose-5-phosphate regeneration.
KR20210038805A discloses methylotrophic bacteria with enhanced 1 ,2- propylene glycol (propanediol; PDO) productivity and a 1 ,2-PDO production method using the same.
WO201 2037413A2 discloses systems, compounds and methods for the conversion of C1 carbon compounds to higher carbon compounds useful for the generation of commodity compounds.
WO2012170292A1 discloses modulation of metabolic pathways for improving bioprocess performance and secreted protein productivity of yeast. An isolated fungal host cell, such as Pichia pastoris, lacking full wild-type levels of dihydroxyacetone synthase activity due to a knocked- out DAS1 or DAS2, is provided. The host cell further comprises a heterologous polynucleotide encoding a heterologous polypeptide. Methods for expressing the heterologous polypeptide in such cells, and purifying the expressed polypeptide are also disclosed.
Novel strategies are needed to address current challenges in energy storage and carbon sequestration. In particular, there is a need for new biosynthesis methods to produce biomass or expression products in a eukaryotic cell culture.
SUMMARY OF THE INVENTION
It is the objective to provide a new metabolic pathway for carbon fixation in eukaryotic cells. It is a particular objective to provide for cell cultures capable of using a Ci compound as a carbon source to effectively produce biomass or expression products including e.g., recombinant proteins or cellular metabolites. It is a further objective to engineer cells for producing higher yields of biomass or expression products.
The objective is solved by the subject matter as claimed and as further described herein.
The invention provides for a eukaryotic cell which is engineered to express a synthetic formolase (FLS) pathway comprising a recombinant FLS to biotransform formaldehyde into dihydroxyacetone (DHA).
DHA is herein understood to refer to DHA in its unphosphorylated form or its phosphorylated form, dihydroxyacetone phosphate (DHAP). DHA is also known as 1 ,3- dihydroxypropan-2-one, 1 ,3-dihydroxypropanone, or glycerone.
Specifically, the biotransformation comprises biosynthesis of dihydroxyacetone from formaldehyde through enzymatic reaction with the FLS which converts formaldehyde to dihydroxyacetone.
Specifically, the FLS is a lyase which catalyzes the carboligation of three one- carbon formaldehyde molecules into one three-carbon dihydroxyacetone molecule. Herein described is the recombinant expression of an FLS gene that was introduced into eukaryotic cells, in particular into peroxisomes of a methylotrophic yeast, to replace the natural Xylulose Monophosphate pathway of Pichia pastoris (Komagataella phaffii). To achieve targeting to the peroxisomes, a peroxisome targeting signal (PTS) was fused to the FLS. Cells containing this pathway were able to grow on methanol as carbon source.
It was found that a eukaryotic cell that expresses a recombinant FLS comprises the new FLS pathway. Specifically, the new FLS pathway comprises assimilating a Ci compound like methanol to produce formaldehyde in an enzymatic reaction prior to the FLS enzymatic reaction. Therefore, the invention provides for engineered cells comprising a new carbon fixation pathway including the FLS pathway, which has the advantage of only a small number of thermodynamically favorable chemical
transformations that convert a Ci compound into a three-carbon sugar in central metabolism.
The new FLS pathway has a lower energy consumption compared to the natural pathway. By the FLS pathway described herein, dihydroxyacetone phosphate can be produced in an energy efficient way, since the formose reaction catalyzed by the enzyme formolase (FLS) can directly form DHA from three (3) moles of formaldehyde. Only one (1) mol of ATP is required (instead of three mol ATP as in the Xylulose- Monophosphate cycle) to form dihydroxyacetone phosphate, which is an intermediate of glycolysis and thus of the central carbon metabolism.
Herein described are also recombinant FLS constructs (proteins and nucleic acid molecules) which can be expressed in eukaryotic cells, in particular in cellular compartments like organelles or peroxisomes, and respective expression cassettes.
According to a specific aspect, the FLS pathway comprises or consists of the recombinant FLS to convert formaldehyde into dihydroxyacetone (DHA).
Specifically, recombinant FLS is expressed under the control of a methanolinducible promoter, preferably pDAS2 or pAOX1.
Specifically, the invention provides for a yeast cell comprising a peroxisome, wherein the yeast cell is engineered to express a synthetic formolase (FLS) pathway in the peroxisome, comprising a recombinant FLS to biotransform formaldehyde into dihydroxyacetone (DHA), wherein the FLS is expressed as an FLS fusion protein that comprises the FLS fused to a peroxisomal targeting signal (PTS).
Specifically, the FLS pathway further comprises one or more enzymes to biotransform a Ci compound into formaldehyde, preferably wherein the Ci compound is any one of methanol, formate, methane, carbon monoxide, or carbon dioxide.
According to a specific aspect, the FLS pathway further comprises a methanol to formaldehyde converting enzyme (“MFCE”), such as an alcohol oxidase or a methanol dehydrogenase, to biotransform methanol into formaldehyde. Specifically, the alcohol oxidase is a methanol oxidase.
Specifically, the FLS pathway is a linear pathway.
Specifically, the FLS pathway is comprised in a metabolic pathway which further comprises one or more enzymes of gluconeogenesis using DHA (or DHAP) as a precursor.
According to a specific aspect, the FLS pathway is comprised in an organelle of the cell, preferably a peroxisome or mitochondria.
Specifically, the FLS is a heterologous protein, in particular an artificial enzyme originating from or comprising a wild-type (such as naturally-occurring in any organism) or a mutant FLS. Optionally, the FLS is fused to one or more heterologous sequences.
Specifically, the FLS is expressed as an FLS fusion protein that comprises the FLS fused to a peroxisomal targeting signal (PTS), preferably wherein the PTS is selected from the group consisting of the Minimal PTS1 sequence (“SKL”), the “PTS1 consensus sequence” (SEQ ID NO:17), the “PTS2 consensus sequence” (SEQ ID NO:18), SEQ ID NO:19, 20, 21 , 22, 23, 24, 25, and 26.
Specifically, the PTS comprises or consists of an amino acid sequence selected from the group consisting of SKL, SEQ ID NO:17, 18, 19, 20, 21 , 22, 23, 24, 25, and 26.
SEQ ID NO:17 comprises variables such that the amino acid sequence can have a length of 3-12 amino acids. The respective embodiments of SEQ ID NO:17 are provided as SEQ ID NO:35-44.
Specifically, the PTS can be fused at the C-terminus or the N-terminus of the FLS amino acid sequence, with or without a linking sequence.
Specifically, the PTS1 sequence comprises or consists of the sequence motif of a PTS1 sequence having a consensus sequence of (S/A/C/T)-(K/R/H)-(L/M/F) (i.e., the “PTS1 consensus sequence”), which is optionally extended at the N-terminus by one or more, up to nine amino acids, (X)o-9. Therefore, the PTS1 consensus sequence is characterized by the following formula: (X)o-9-(S/A/C/T)-(K/R/H)-(L/M/F) (see SEQ ID NO:17).
Preferably a PTS of the PTS1 consensus sequence is fused to the C-terminus.
Specifically, the PTS2 sequence comprises or consists of the sequence motif of a PTS2 sequence having a consensus sequence of (R/K)-(L/I/V)-(X)5-(H/Q)-(L/A) (i.e., the “PTS2 consensus sequence”, see SEQ ID NO: 18), which is preferably fused to the N-terminus.
In the consensus sequences, the “forward slash” indicates alternative amino acids in the sequence, and the “hyphen” indicates a peptidic linkage.
Specifically, the FLS pathway is expressed into the cell’s peroxisome, in particular wherein the cell is a yeast, such as e.g., a methylotrophic yeast. Preferably, the FLS pathway expressed into the peroxisome comprises an MFCE and a FLS to convert methanol to dihydroxyacetone.
According to a specific aspect, the FLS pathway comprises an alcohol oxidase and FLS, wherein: a) the MFCE is an endogenously expressed peroxisomal enzyme; and b) the FLS is an enzyme that is targeted to the peroxisome by fusing a PTS.
Specifically, the FLS is characterized by one or more of the following features: a) the FLS is a bacterial FLS or a benzaldehyde lyase of bacterial origin, such as originating from Pseudomonas fluorescens e.g., Pseudomonas fluorescens biovar I, preferably wherein the FLS is identified as SEQ ID NO:33 or by the NCBI accession number: 2AG0_A (Chain A, benzaldehyde lyase), or a naturally-occurring FLS homolog or ortholog thereof; b) the FLS is a recombinant FLS which is a mutant of a), preferably wherein the recombinant FLS comprises SEQ ID NO:33 which is engineered to incorporate one or more point mutations, preferably up to 15, 14, 13, 12, 11 , 10, 9, 8, or 7 point mutations, wherein the point mutations comprise any one or more point mutations, preferably comprising i) at least 1 , 2, 3, 4, 5, 6, or 7 (all) of any of the following mutations: A28I, W89R, L90T, R188H, A394G, G419N, or A480W; or ii) at least 1 , 2, 3, 4, 5, 6, or 7 (all) of any of the following mutations: A28L, W89R, R188H, N283H, A394G, G419N, and A480W; or iii) at least 1 , 2, 3, 4, or 5 of any of the following mutations:
W89R, R188H, A394G, G419N, orA480W, and optionally one, two or three additional mutations selected from: A28I, A28L, L90T, N283H; c) the FLS comprises or consists of SEQ ID NO:31 or SEQ ID NO:32; d) the FLS comprises at least any one of 90%, 95%, 96%, 97%, 98%, 99% or 100% sequence identity to any one of the FLS of a), b), or c); e) the FLS is functional as determined in a colorimetric enzymatic assay, such as described herein; f) the nucleotide sequence encoding the FLS is codon-optimized for expression in the cell.
Specific FLS described herein are:
a) a wild-type FLS of Pseudomonas fluorescens, wherein the FLS is identified as SEQ ID NO:33 or by the NCBI accession number: 2AG0 A (Chain A, benzaldehyde lyase); b) a recombinant FLS which comprises or consists of SEQ ID NO:32, which recombinant FLS comprises the point mutations A28I, W89R, L90T, R188H, A394G, G419N, and A480W as compared to SEQ ID NO:33; c) a recombinant FLS which comprises or consists of SEQ ID NO:31 , which recombinant FLS comprises the point mutations A28L, W89R, R188H, N283H, A394G, G419N, and A480W as compared to SEQ ID NO:33.
The FLS identified as SEQ ID NO:31 or SEQ ID NO:32 is a functional FLS mutant of a wild-type FLS.
The wild-type FLS identified by SEQ ID NO:33 comprises an amino acid (aa) sequence of 563 aa length. The mutant FLS can be longer than SEQ ID NO:33 e.g., by a C-terminal extension of the aa sequence. For example, the C-terminal extension of SEQ ID NO:33 can be an extension of 1 to 12 consecutive aa of “GSTENLYFQSGA1 (SEQ ID NO:34) wherein the extension starts with the first aa indicated in SEQ ID NO:34.
It is understood that a recombinant or mutant FLS originating from a wild-type FLS such as comprising or consisting of SEQ ID NO:33, may or may not comprise such C-terminal extension. In particular, it is understood that any of the recombinant or mutant FLS comprising or consisting of SEQ ID NO:31 or SEQ ID NO:32, or a functional variant of any of the foregoing, may or may not comprise such C-terminal extension.
SEQ ID NO:31 or SEQ ID NO:32 comprises an aa sequence of 575 aa length.
Specifically, the FLS comprises of consists of SEQ ID NO:31 or SEQ ID NO:32, or a functional variant of any of the foregoing, preferably wherein a) the functional variant of SEQ ID NO:31 comprises at least 1 , 2, 3, 4, 5, 6, or 7 (all) of the point mutations A28L, W89R, R188H, N283H, A394G, G419N, and A480W, preferably at least 3, 4, or 5 mutations selected from W89R, R188H, A394G, G419N, or A480W, point mutations as compared to SEQ ID NO:33, and at least at least any one of 90%, 95%, 96%, 97%, 98%, 99% or 100% sequence identity to SEQ ID NO:31 ; b) the functional variant of SEQ ID NO:32 comprises at least 1 , 2, 3, 4, 5, 6, or 7 (all) of the point mutations A28I, W89R, L90T, R188H, A394G, G419N, and A480W, preferably at least 3, 4, or 5 mutations selected from W89R, R188H,
A394G, G419N, or A480W, point mutations as compared to SEQ ID NO:33, and at least at least any one of 90%, 95%, 96%, 97%, 98%, 99% or 100% sequence identity to SEQ ID NO:32.
According to a specific aspect, the cell is a microbial cell, a mammalian, insect, plant or algae cell.
Specifically, the cell is: a) a yeast cell, preferably of a genus selected from the group consisting of Pichia, Komagataella, Hansenula, Saccharomyces, Kluyveromyces, Candida, Ogataea, Yarrowia, and Geotrichum, preferably Pichia pastoris, Komagataella phaffii, Komagataella pastoris, Komagataella pseudopastoris, Saccharomyces cerevisiae, Ogataea minuta, Kluyveromces lactis, Kluyveromyes marxianus, Yarrowia lipolytica or Hansenula polymorpha, preferably a methylotrophic yeast; b) a cell of filamentous fungi, preferably of a genus selected from the group consisting of Aspergillus, Penicillium, Trichoderma, Neurospora, Rhizopus preferably Aspergillus niger, Aspergillus awamori Aspergillus terreus, Aspergillus oryzae, Aspergillus nidulans, Neurospora crassa, Rhizopus oryzae, Penicillium chrysogenum or Trichoderma reeser, c) a non-human primate, human, rodent, or bovine cell, preferably a mouse myeloma (NSO)-cell line, a Chinese hamster ovary (CHO)-cell line, HT1080, H9, HepG2, MCF7, MDBK Jurkat, MDCK, NIH3T3, PC12, BHK (baby hamster kidney cell), VERO, SP2/0, YB2/0, Y0, C127, L cell, COS, QC1-3, HEK-293, PER.C6, HeLA, EBI, EB2, EB3, oncolytic or hybridoma-cell line; d) an insect cell, preferably a Sf9, Mimic™ Sf9, Sf21 , High Five (BT1-TN-5B1- 4), or BT1-Ea88 cell; e) an algae cell, preferably of the genus Amphora, Bacillariophyceae, Dunaliella, Chlorella, Chlamydomonas, Cyanophyta (cyanobacteria), Nannochloropsis, Spirulina, or Ochromonas); or f) a plant cell, preferably a cell from monocotyledonous plants, preferably maize, rice, wheat, or Setaria, or from a dicotyledonous plant, preferably cassava, potato, soybean, tomato, tobacco, alfalfa, Physcomitrella patens or Arabidopsis.
The preferred yeast host cells are derived from methylotrophic yeast, such as from Pichia or Komagataella e.g., Pichia pastoris, or Komagataella pastoris, or K. phaffii, or K. pseudopastoris. Examples of the host include yeasts such as P. pastoris. Examples of P. pastoris strains include CBS 704 (=NRRL Y-1603 = DSMZ 70382), CBS
2612 (=NRRL Y-7556), CBS 7435 (=NRRL Y-11430), CBS 9173-9189 (CBS strains: CBS-KNAW Fungal Biodiversity Centre, Centraalbureau voor Schimmelcultures, Utrecht, The Netherlands), and DSMZ 70877 (German Collection of Microorganisms and Cell Cultures), but also strains from Invitrogen, such as X-33, GS115, KM71 and SMD1168. Examples of S. cerevisiae strains include W303, CEN.PK and the BY-series (EUROSCARF collection). All of the strains described above have been successfully used to produce transformants and express heterologous genes.
A preferred yeast host cell may be a P. pastoris or S. cerevisiae host cell, specifically wherein such host cell contains heterologous or recombinant promoter sequences, which may be derived from a P. pastoris or S. cerevisiae strain, different from the production host. In another specific embodiment the host cell described herein comprises a recombinant expression construct described herein comprising the promoter originating from the same genus, species or strain as the host cell.
According to a specific aspect, the cell is an engineered Mut' yeast, specifically, wherein the Mut' yeast is characterized by a phenotype with reduced growth on methanol.
Specifically, the Mut' yeast is engineered to reduce expression of a dihydroxyacetone synthase (DAS) compared to an endogenous expression thereof, in particular as compared to endogenous expression in the yeast without such engineering, preferably wherein said engineering is a deletion, knock-out or disruption of any one or both of the DAS1 and DAS2 genes.
According to a specific aspect, the cell can be cultured in a cell culture using a cell culture medium comprising a Ci compound as a carbon source, in particular wherein the carbon source is used as a source of energy and/or as a source of producing fermentation products including e.g., expression products.
Specifically, the cell is capable of utilizing a Ci compound as the sole carbon source for producing biomass and/or an expression product, such as e.g., in a cell culture.
The invention further provides for a method of culturing the cell described herein in a cell culture using a Ci compound as a carbon source, thereby obtaining biomass and/or an expression product in the cell culture, preferably wherein the Ci compound is the sole carbon source.
Specifically, the expression product is a protein of interest (POI) or a metabolite, preferably wherein:
a) the POI is a peptide or protein selected from the group consisting of an antigen-binding protein, a therapeutic protein, an enzyme, a peptide, a protein antibiotic, a toxin fusion protein, a carbohydrate - protein conjugate, a structural protein, a regulatory protein, a vaccine antigen, a growth factor, a hormone, a cytokine, a process enzyme; b) the metabolite is an amino acid, organic acid, alcohol, sugar alcohol, carbohydrate, vitamin, amine, aldehyde, ketone, or a polyhydroxyketone, preferably wherein the metabolite is selected from the group consisting of lactic acid, citric acid, propionic acid, butyric acid, valeric acid, hexanoic acid, adipic acid, succinic acid, fumaric acid, malic acid, 2,5-furan dicarboxylic acid, aspartic acid, glucaric acid, gluconic acid, glutamic acid, itaconic acid, levulinic acid, acrylic acid, 3-hydroxy propionic acid, ethanol, propanol, isopropanol, butanol, pentanol, hexanol, heptanol, octanol, butanediol, 1 ,3-propanediol, 1 ,2-propanediol, 2-amino-1 ,3-propanediol, 3- hydroxybutyrate, poly-3-hydroxybutyrate, 3-hydroxy propionaldehyde, 3- hydroxybutyrolactone, xylitol, arabinitol, sorbitol, mannitol, vitamin C, riboflavin, thiamine, tocopherol, cobalamin, pantothenic acid, biotin, pyridoxine, niacin, folic acid, diamino pentane, diamino hexane and dihydroxyacetone.
Specifically, for the purpose of expressing an expression product, a recombinant expression cassette is used, which comprises one or more heterologous nucleic acid sequence(s).
According to a preferred aspect, the cell is a yest such as e.g., a methylotrophic yeast, and the expression cassette comprises a heterologous coding sequence (or gene) which is operably linked to a methanol-inducible promoter.
According to a specific aspect, a method of producing biomass and/or an expression product in a cell culture is provided, using a cell described herein and a Ci compound, preferably wherein the Ci compound is the sole carbon source in the cell culture.
The methods described herein employ the cell as described herein. Therefore, the methods described herein are specifically characterized by the features of the cell as described herein.
The invention further provides for a fusion protein comprising a formolase (FLS) that is fused to a peroxisomal targeting signal (PTS).
Specifically, the fusion protein is characterized by the features described as features of the FLS as used in the FLS pathway described herein.
Specifically, the fusion protein is characterized by one or more of the following features: a) the PTS is selected from the group consisting of the Minimal PTS1 sequence (“SKL”), the “PTS1 consensus sequence” (as defined herein), the “PTS2 consensus sequence” (SEQ ID NO:17), SEQ ID NO:18, 19, 20, 21 , 22, 23, 24, 25, and 26; and b) the FLS is as further described herein.
According to a specific aspect, the fusion protein comprises an FLS which is a recombinant FLS such as a functional FLS mutant of a wild-type bacterial FLS, such as an FLS of bacterial origin e.g., originating from Pseudomonas fluorescens, in particular Pseudomonas fluorescens biovarl, wherein the recombinant FLS comprises the amino acid sequence SEQ ID NO:33 wherein the amino acid sequence is engineered to incorporate one or more point mutations, preferably up to 15, 14, 13, 12, 11 , 10, 9, 8, or 7 point mutations, wherein the point mutations comprise any one or more point mutations, preferably comprising i) at least 1 , 2, 3, 4, 5, 6, or 7 (all) of any of the following mutations: A28I, W89R, L90T, R188H, A394G, G419N, or A480W; or ii) at least 1 , 2, 3, 4, 5, 6, or 7 (all) of any of the following mutations: A28L, W89R, R188H, N283H, A394G, G419N, and A480W; or iii) at least 1 , 2, 3, 4, or 5 of any of the following mutations:
W89R, R188H, A394G, G419N, or A480W, and optionally one, two or three additional mutations selected from: A28I, A28L, L90T, N283H; preferably wherein the recombinant FLS comprises or consists of SEQ ID NO:31 or SEQ ID NO:32.
Specifically, the fusion protein comprises an FLS which comprises of consists of SEQ ID NO:31 or SEQ ID NO:32, or a functional variant of any of the foregoing, preferably wherein a) the functional variant of SEQ ID NO:31 comprises at least 1 , 2, 3, 4, 5, 6, or 7 (all) of the point mutations A28L, W89R, R188H, N283H, A394G, G419N, and A480W, preferably at least 3, 4, or 5 mutations selected from W89R, R188H, A394G, G419N, or A480W, point mutations as compared to SEQ ID NO:33, and at least at least any one of 90%, 95%, 96%, 97%, 98%, 99% or 100% sequence identity to SEQ ID NO:31 ; b) the functional variant of SEQ ID NO:32 comprises at least 1 , 2, 3, 4, 5, 6, or 7 (all) of the point mutations A28I, W89R, L90T, R188H, A394G, G419N, and
A480W, preferably at least 3, 4, or 5 mutations selected from W89R, R188H, A394G, G419N, or A480W, point mutations as compared to SEQ ID NO:33, and at least at least any one of 90%, 95%, 96%, 97%, 98%, 99% or 100% sequence identity to SEQ ID NO:32.
The invention further provides for a nucleic acid molecule encoding the fusion protein described herein, preferably wherein the nucleic acid molecule is codon- optimized for expression in a eukaryotic host cell.
Specifically, the nucleic acid molecule is an isolated nucleic acid molecule. The isolated nucleic acid molecule may be incorporated into an expression cassette or an expression construct comprising the expression cassette, such as a vector, plasmid, or artificial chromosome. A preferred yeast expression vector is for expression in a yeast host cell, such as selected from the group consisting of methylotrophic yeasts represented by the genera Ogataea, Hansenula, Pichia, Candida and Torulopsis. Specifically, plasmids derived from pPICZ, pGAPZ, pPIC9, pPICZalfa, pGAPZalfa, pPIC9K, pGAPHis or pPUZZLE can be used as a vector.
The invention further provides for an expression cassette comprising the nucleic acid molecule described herein and regulatory sequences to express the fusion protein in a eukaryotic host cell.
Specifically, the FLS fusion protein is expressed under the control of a methanolinducible promoter, preferably pDAS2 or pAOX1.
Specifically provided herein is a eukaryotic cell comprising the expression cassette described herein.
Specifically, the recombinant FLS described herein is a functional enzyme, wherein the enzymatic activity is determined by a standard colorimetric assay such as described in Example 4, or by Cai et al. (Cai et al. 2021), or by Siegel et al. (Siegel et al. 2015).
Specifically, a FLS is considered functional if a catalytic efficiency higher than 0,01 M'1*s'1 can be detected.
FIGURES
Figure 1 : Schematic representation of the constructed and integrated gene cassettes for generation of the P. pastoris strain (A) ADAS1 , ADAS2, AAOX1 , (pDAS2::FLS::HpHr)+ and (B) strain ADAS1 , ADAS2, AAOX1 , (pAOX1 ::FLS::G418r).
Figure 2: Confirmation of the FLS gene expression cassettes by PCR amplification using genomic DNA as template. Amplified gene fragments (1-6) SEQ3 from ADAS1 , ADAS2.AA0X1, (pDAS2::FLS::HpHr)+, transformants, and (7-12) SEQ4 from ADAS1 , ADAS2, AA0X1 , (pAOX1 ::FLS::G418r)+, (M) Quick-Load 1 kb Plus DNA Ladder (New England Biolabs).
Figure 3: (A) The growth curves of the FLS expressing P. pastoris transformants compared to the parental strain and an empty vector strain in synthetic M2 media with methanol as carbon source. (B) The logarithmic representation of the same growth curves presenting the linear fitted regions used to calculate the growth rate.
Figure 4: Microscopic images after 40 hours of incubation in M2/methanol media (A) FLS expressing P. pastoris strain ADAS1 , ADAS2, AAOX1 , (pAOX1 ::FLS::G418r)+ and of (B) the parental P. pastoris MUT' strain ADAS1 , ADAS2, AAOX1 at a 1000x magnification with a total field of view diameter of 200 pm.
Figure 5:
SEQ ID NO:1 : DNA sequence of FLS::PTS1 coding sequence
SEQ ID NO:2: Protein sequence of FLS::PTS1 (fusion protein)
SEQ ID NO:3: pDAS2::FLS::HpHr cassette sequenced integrated into genome of P. pastoris CBS7435 ADAS1, ADAS2, AAOX1
SEQ ID NO:4: pAOX1 ::FLS::HpHr cassette sequenced integrated into genome of P. pastoris CBS7435 ADAS1, ADAS2, AAOX1
SEQ ID NO:5: PCR Primer Amplification DAS1 locus, (DAS1-F)
SEQ ID NO:6: PCR Primer Amplification DAS1 locus, (DAS1-R)
SEQ ID NO:7: PCR Primer Amplification DAS2 locus, (DAS2-F) SEQ ID NO:8: PCR Primer Amplification DAS2 locus, (DAS2-R) SEQ ID NO:9: PCR Primer Amplification AOX1 locus, (AOX1-F) SEQ ID NQ:10: PCR Primer Amplification AOX1 locus, (AOX1-R) SEQ ID NO:11 : PCR Screening Primer for FLS cassette integration (Screening PCR-F)
SEQ ID NO:12: PCR Screening Primer for FLS cassette integration (Screening PCR-R)
SEQ ID NO: 13: DNA sequence of genomic locus of AAOX1 of P. pastoris CBS7435 ADAS1, ADAS2, AAOX1
SEQ ID NO:14: DNA sequence of genomic locus of ADAS1 of P. pastoris CBS7435 ADAS1, ADAS2, AAOX1
SEQ ID NO: 15: DNA sequence of genomic locus of ADAS2 of P. pastoris CBS7435
ADAS1, ADAS2, AA0X1
SEQ ID NO:16: Fusion protein comprising a FLS (from Siegel et al. 2015) and a PTS
Minimal PTS1 sequence: SKL
SEQ ID NO:17 PTS1 consensus sequence: (X)o-9-(S/A/C/T)-(K/R/H)-(L/M/F)
SEQ ID NO:18 PTS2 consensus sequence: (R/K)-(L/IA/)-(X)s-(H/Q)-(L/A)
SEQ ID NO:19 PTS1 from AOX1 (Pichia pastoris)
SEQ ID NO:20 PTS1 from AOX2 Pichia pastoris)
SEQ ID NO:21 PTS1 from FBA1 Pichia pastoris)
SEQ ID NO:22 PTS1 from RKI1-2 (Pichia pastoris)
SEQ ID NO:23 PTS1 from TAL1-2 (Pichia pastoris)
SEQ ID NO:24 PTS1 from CTA1 (Pichia pastoris)
SEQ ID NO:25 PTS2 from POX3 (Saccharomyces cerevisiae)
SEQ ID NO:26 PTS2 from AMO (Hansenula polymorpha)
SEQ ID NO:27 AOX1 from Komagataella phaffii strain ATCC 76273 / CBS 7435
SEQ ID NO:28: AOX2 from Komagataella phaffii strain ATCC 76273 / CBS 7435,
SEQ ID NO:29: AOD1 from Candida boidinii
SEQ ID NO:30: MOX Hansenula polymorpha
SEQ ID NO:31 : recombinant FLS of SEQ ID NO:2
SEQ ID NO:32: recombinant FLS of SEQ ID NO:16
SEQ ID NO:33: FLS of Pseudomonas fluorescens, NCBI accession number:
2AG0_A (Chain A, benzaldehyde lyase)
SEQ ID NO:34: exemplary C-terminal extension, to extend SEQ ID NO:33
SEQ ID NO:35-SEQ ID NO:44: embodiments of SEQ ID NO:17
DETAILED DESCRIPTION OF THE INVENTION
Unless indicated or defined otherwise, all terms used herein have their usual meaning in the art, which will be clear to the skilled person. Reference is for example made to the standard handbooks. Genetic modifications described herein may employ tools, methods and techniques known in the art, such as described by J. Sambrook et al., Molecular Cloning: A Laboratory Manual (3rd edition), Cold Spring Harbor
Laboratory, Cold Spring Harbor Laboratory Press, New York (2001), Lewin "Genes ”V", Oxford University Press, New York (1990), and Janeway et al. "Immunobiology" (5th Ed., or more recent editions), Garland Science, New York (2001).
The terms “comprise”, “contain”, “have” and “include” as used herein can be used synonymously and shall be understood as an open definition, allowing further members or parts or elements. “Consisting” is considered as a closest definition without further elements of the “consisting” definition feature. Thus “comprising” is broader and contains the “consisting” definition.
The term “about” as used herein refers to the same value or a value differing by +/-10% or +/-5% of the given value.
The subject matter of the claims specifically refers to artificial products or methods employing or producing such artificial products, which may be variants of native (wild-type) products. Though there can be a certain degree of sequence identity to the native structure, it is well understood that the materials, methods and uses of the invention, e.g., specifically referring to isolated nucleic acid sequences, amino acid sequences, fusion constructs, expression constructs, engineered cells such as host cells incorporating recombinant expression constructs or synthetic metabolic pathways, and modified proteins, are “man-made” or synthetic, and are therefore not considered as a result of “laws of nature”.
Specific terms as used throughout the specification have the following meaning: The term “biomass” as used herein refers to the substance obtained by isolating whole cells, fractions or extracts thereof which are obtained from a cell culture, in particular where the biomass includes carbohydrates and/or proteins.
The term “carbon source” is herein understood as “carbon substrate” and shall mean a fermentable carbon substrate which can be used to produce organic carbon compounds, suitable as an energy source for microorganisms. Ci compounds can be used as a carbon source, which are inorganic or organic compounds that comprise only one carbon atom per molecule or ion. Exemplary Ci carbon molecules used as substrates for biomass production and other fermentation processes described herein include methane, carbon dioxide (in the gaseous or solubilized form), carbon monoxide, methanol and synthesis gas (a mixture of carbon monoxide and hydrogen), but may also be formate, or mixtures of any of the foregoing. A compound may be used as a sole carbon source, or else a mixture of different compounds can be used as a carbon source.
The term “cell” with respect to an engineered cell as used herein, which comprises a metabolic pathway, which is herein also referred to as a “host cell”, shall refer to a single cell, a single cell clone, or a cell line.
The term “cell line” as used herein refers to an established clone of a particular cell type that has acquired the ability to proliferate over a prolonged period of time. A cell line is typically used for expressing an endogenous or recombinant nucleic acid molecule or gene, a metabolic pathway or products of a metabolic pathway, to produce fermentation products, such as biomass or expression products e.g., polypeptides (including e.g., a protein of interest, POI), nucleic acid products (for example DNA or RNA), cells and/or viruses, or cell metabolites.
A “production host cell line” or “production cell line” is commonly understood to be a cell line ready-to-use for cell culture in a bioreactor to obtain fermentation products. It is well understood that the term “host cell” refers to recombinant host cells and does not include human beings. Specifically, recombinant host cells as described herein are artificial organisms and derivatives of native (wild-type) host cells.
Specific embodiments described herein refer to a host cell which is engineered as described herein, such as to incorporate and to express a FLS pathway as described herein. Such recombinant host cells are provided as isolated host cell and respective host cell lines or cultures. Recombinant host cells can be cultured ex vivo to produce fermentation products.
The term “cell culture” or “culturing” as used herein with respect to a host cell refers to the maintenance of cells in an artificial, e.g., an in vitro environment, under conditions favoring growth, differentiation or continued viability, in an active or quiescent state, of the cells, specifically in a controlled bioreactor according to methods known in the industry. When culturing a cell culture using appropriate culture media, the cells are brought into contact with the media in a culture vessel or with substrate under conditions suitable to support culturing the cells in the cell culture. Standard cell culture media and techniques are well-known in the art.
Recombinant proteins or cellular metabolites can be produced using the host cell and respective cell lines described herein, by culturing in an appropriate medium, isolating the expressed product or metabolite from the culture, and optionally purifying it by a suitable method.
The host cell described herein can be tested for its capacity to produce expression products by any of the following tests: ELISA, activity assay, capillary
electrophoresis, HPLC, or other suitable tests, such as SDS-PAGE and Western Blotting techniques, or mass spectrometry.
The term “expression” as used herein refers to expression of a polynucleotide or gene from an expression cassette, or to the expression of the respective polypeptide or protein.
The term "expression cassette” is herein understood to refer to nucleic acid molecules (herein also referred to as polynucleotides), which contain a desired coding sequence (herein referred to as a gene), and control sequences in operable linkage, so that hosts transformed or transfected with these molecules incorporate the respective sequences and are capable of producing the expression products.
An expression cassette may comprise one or more nucleic acid molecules (e.g., a gene) that are endogenous or heterologous to a host cell. Specifically, the expression cassette may comprise polynucleotides encoding a metabolic pathway, such as a pathway comprising the FLS pathway described herein. Specifically, the metabolic pathway described herein may be encoded by polynucleotides comprised in one or more expression cassettes, e.g., in one or more expression constructs comprised in the host cell.
An expression cassette can be engineered ex vivo or in vivo. For example, an expression cassette can be engineered as part of a vector or an artificial expression construct that can be provided as isolated expression construct that is engineered using in vitro techniques, and optionally incorporated in a cell culture.
Specifically, the recombinant FLS expression cassette described herein comprises the FLS encoding nucleic acid molecule operably linked to one or more regulatory elements that control expression of the FLS.
Specifically, the expression cassette expressing the recombinant FLS described herein comprises one or more regulatory elements not natively associated with the gene encoding the FLS. Such expression cassettes comprising one or more elements that are not natively associated with a gene, are herein understood as artificial or recombinant expression cassettes.
An element of an expression cassette that is “not natively associated with” such gene is typically not operably linked to such gene in a wild-type host cell, whereas the wild-type host cell is endogenously expressing such gene with regulatory elements that are “natively associated with” such gene. A wild-type host cell is herein understood to be a naturally-occurring host cell that is not recombined by any artificial means.
One or more expression cassettes are herein also understood as “expression system”. The expression system may be included in an expression construct, such as an artificial heterologous expression cassette, a vector and in particular a plasmid. The relevant DNA of an expression cassette or construct may also be integrated into a host cell chromosome. Expression may refer to secreted or non-secreted expression products, including polypeptides or metabolites.
Expression cassettes are conveniently provided as expression constructs e.g., in the form of “vectors” or “plasmids”, which are typically DNA sequences that are required for the transcription of cloned recombinant nucleotide sequences i.e., of recombinant genes and the translation of their mRNA in a suitable host organism. Expression vectors or plasmids usually comprise an origin for autonomous replication or a locus for genome integration in the host cells, selectable markers (e.g., an amino acid synthesis gene or a gene conferring resistance to antibiotics such as zeocin, kanamycin, G418 or hygromycin, nourseothricin), a number of restriction enzyme cleavage sites, a suitable promoter sequence and a transcription terminator, which components are operably linked together. The terms “plasmid” and “vector” as used herein include autonomously replicating nucleotide sequences as well as genome integrating nucleotide sequences, such as artificial chromosomes e.g., a bacterial artificial chromosome (BAC) or yeast artificial chromosome (YAC).
Expression vectors may include but are not limited to cloning vectors, modified cloning vectors and specifically designed plasmids. Preferred expression vectors described herein are expression vectors suitable for expressing a recombinant gene in a eukaryotic host cell and are selected depending on the host organism. Appropriate expression vectors typically comprise regulatory sequences suitable for expressing DNA encoding polypeptide or protein of interest in a eukaryotic host cell. Examples of regulatory sequences include promoter, operators, enhancers, ribosomal binding sites, and sequences that control transcription and translation initiation and termination. The regulatory sequences are typically operably linked to the DNA sequence to be expressed.
To allow expression of a recombinant nucleotide sequence in a host cell, a promoter sequence is typically regulating and initiating transcription of the downstream nucleotide sequence, with which it is operably linked. An expression cassette or vector typically comprises a promoter nucleotide sequence which is adjacent to the 5’ end of a coding sequence, e.g., upstream from and adjacent to the coding sequence (e.g.,
gene of interest) or if a signal or leader sequence is used, upstream from and adjacent to said signal and leader sequence, respectively, to facilitate translation initiation and expression of coding sequences to obtain the expression product.
As used herein, the term “adjacent” is meant to refer to a distance between elements of a nucleic acid sequence which is “within about 10bp” or “within about 5 bp” of the nucleic acid sequence.
Specific expression cassettes described herein comprise a promoter operably linked to a coding nucleotide sequence. Specifically, the promoter can be used to control expression of a coding sequence, wherein the promoter is not natively associated with the coding sequence.
Specific expression constructs described herein comprise a coding sequence linked with a leader sequence (e.g., a secretion signal peptide sequence (presequence), a pro-sequence, or a pre-pro-sequence), which causes transport of the expressed polypeptide or protein into the secretory pathway and/or secretion from the host cell. The presence of such a secretion leader sequence in an expression cassette or construct is typically required when the polypeptide or protein intended for recombinant expression is not naturally secreted by the host cell and therefore lacks a natural secretion leader sequence, or its nucleotide sequence has been cloned without its natural secretion leader sequence.
In specific embodiments, multicloning vectors may be used, which are vectors having a multicloning site. Specifically, one or more desired polynucleotides can be integrated or incorporated at a multicloning site to prepare an expression vector. In the case of multicloning vectors, a promoter is typically placed upstream of the multicloning site to control expression of said one or more desired polynucleotides.
The term "gene expression", “expressing a polynucleotide”, “expressing a nucleic acid molecule” or “expressing a pathway” as used herein, is meant to encompass at least one step selected from the group consisting of DNA transcription into mRNA, mRNA translation and processing, mRNA maturation, mRNA export, protein folding and/or protein transport, thereby obtaining expression products, precursors or intermediates thereof.
The terms “polynucleotide”, “nucleic acid molecule(s)” or “nucleic acid sequence(s)” are interchangeably used herein and refer to nucleotides, either ribonucleotides or deoxyribonucleotides or a combination of both, in a polymeric unbranched form of any length. Preferably, a polynucleotide refers to
deoxyribonucleotides in a polymeric unbranched form of any length. Here, nucleotides consist of a pentose sugar (deoxyribose), a nitrogenous base (adenine, guanine, cytosine or thymine) and a phosphate group.
The term “endogenous” as used herein is meant to include those molecules and sequences, in particular genes or proteins, which are present in a wild-type (native, not recombinant) host cell or expressed by such wild-type host cell, thereby “endogenous” to said wild-type host cell. In particular, an endogenous nucleic acid molecule (e.g., a gene) or protein that does occur in (and originates from, or can be obtained from) a particular host cell as it is found in nature, is understood to be “host cell endogenous” or “endogenous to the host cell”. Moreover, a cell “endogenously expressing” a nucleic acid or protein expresses that nucleic acid or protein as does a host of the same particular type as it is found in nature. Moreover, a host cell “endogenously producing” or that “endogenously produces” a nucleic acid, protein, or other compound produces that nucleic acid, protein, or compound as does a host cell of the same particular type as it is found in nature. An endogenous protein can be overexpressed such as e.g., in an engineered host cell comprising an artificial expression cassette that expresses the protein at a higher level as compared to the level expressed by a respective wild-type host cell without such engineering. Even if a protein is no more produced by a host cell, such as e.g., a knockout mutant of the host cell, where the protein encoding gene is inactivated or deleted, the protein is herein still referred to as “endogenous”.
The term “heterologous” as used herein with respect to a nucleotide sequence, an expression cassette or construct, or any element or part of any one of the foregoing, an amino acid sequence or protein, refers to a compound which is either foreign to a given host cell, i.e. “exogenous”, such as not found in nature in said host cell; or that is naturally found in a given host cell, e.g., is “endogenous”, however, in the context of a heterologous construct or integrated in such heterologous construct, e.g., employing a heterologous nucleic acid fused or in conjunction with an endogenous nucleic acid, thereby rendering the construct heterologous. The heterologous nucleotide sequence as found endogenously may be produced in an unnatural, e.g., greater than expected or greater than naturally found, amount in the cell. The heterologous nucleotide sequence, or a nucleic acid comprising the heterologous nucleotide sequence, possibly differs in sequence from the endogenous nucleotide sequence but may still encode the same protein as found endogenously. Specifically, heterologous nucleotide sequences are those not found in the same relationship to a host cell in nature. Any recombinant
or artificial nucleotide sequence is understood to be heterologous. An example of a heterologous polynucleotide is a fusion product of a nucleotide sequence encoding an FLS that is fused to a PTS coding nucleotide sequence to which the respective endogenous, naturally-occurring FLS coding sequence is not normally fused, or a nucleotide sequence operably linked to a promoter to which the respective endogenous, naturally-occurring nucleotide sequence is not normally operably linked.
The term “formolase”, abbreviated FLS, as used herein shall refer to an enzyme which catalyzes carboligation of three 1 -carbon formaldehyde molecules into one 3- carbon dihydroxyacetone molecule.
An FLS may originate from a natural source, or can be a functional mutant thereof. Alternatively, an FLS can be a computationally-designed enzyme derived from benzaldehyde lyase (BAL).
The FLS may comprise or consist of any one of SEQ ID NO:31 or SEQ ID NO:32, or be a functional variant of any of the foregoing. The FLS may be provided as a derivative of any of the foregoing e.g., a fusion protein comprising the FLS sequence and a PTS.
The term “methanol to formaldehyde converting enzyme”, abbreviated “MFCE”, as used herein shall refer to an “alcohol oxidase”, abbreviated “AOX”, or “methanol oxidase”, or “methanol dehydrogenase”, and shall refer to an enzyme which catalyzes conversion of methanol into formaldehyde. Unlike alcohol dehydrogenases, alcohol oxidases are unable to catalyze the reverse reaction. Reference is made to EC 1.1.3.13 (alcohol oxidase) and EC 1.1.1.1 (alcohol dehydrogenase).
An MFCE may originate from a natural source, such as originating from a methylotrophic yeast, or can be a functional mutant thereof. Alternatively, an MFCE can be a computationally-designed enzyme derived from a functional oxidoreductase or dehydrogenase.
The MFCE may comprise or consist of any one of SEQ ID NO:27, 28, 29 or 30, or be a functional variant of any of the foregoing. A “functional mutant” or “functional variant” of an enzyme like FLS or MFCE, as used herein shall mean a variant in which at least one amino acid among wild-type enzyme amino acids is substituted, inserted, removed or modified, preferably wherein the mutation is a limited number of point mutations (a point mutation being the substitution, insertion, removal or modification of only one amino acid in a sequence), such as up to 10, 9, 8, 7, 6, 5, 4, 3, 2, or 1 point mutation(s), and/or wherein the enzyme mutant comprises a certain sequence identity
to the wild-type sequence, such as at least any one of 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% sequence identity.
The term “isolated” as used herein with respect to a polynucleotide, nucleic acid molecule, expression cassettes or constructs, or host cells shall refer to such compound that has been sufficiently separated from the environment with which it would naturally be associated. Yet, “isolated” does not necessarily mean the exclusion of artificial or synthetic mixtures with other compounds or materials, or the presence of impurities that do not interfere with the fundamental activity, and that may be present, for example, due to incomplete purification. Isolated compounds can be further formulated to produce preparations thereof, and still for practical purposes be isolated. For example, an isolated expression product can be mixed with pharmaceutically acceptable carriers or excipients when used in diagnosis or therapy.
The term "metabolite” as used herein refers to a product of metabolic reactions catalyzed by one or more enzymes of a metabolic pathway of a host cell incorporating the pathway, and may include reactant, product and cofactor molecules of said enzymes. Metabolites may arise in the same pathway(s) as the cell metabolic pathway or pathways encoding an enzyme which catalyzes the synthesis of the cell growth and/or productivity inhibitor or intermediate thereof or may be synthesized in a branching pathway.
The term "operably linked" as used herein refers to the association of nucleotide sequences on a single nucleic acid molecule, e.g., a vector, an expression cassette or an expression construct, in a way such that the function of one or more nucleotide sequences is affected by at least one other nucleotide sequence present on said nucleic acid molecule. By operably linking, a nucleic acid sequence is placed into a functional relationship with another nucleic acid sequence on the same nucleic acid molecule. For example, a promoter and/or a gene switch is operably linked with a coding sequence of a recombinant gene, when it is capable of effecting the expression of that coding sequence. Specifically, such nucleic acids operably linked to each other may be directly linked i.e., immediately linked, meaning without additional elements or nucleic acid sequences in between the nucleic acids. Alternatively, a suitable linking sequence can be used such as e.g., a cloning site positioned between the promoter and/or gene switch and a gene of interest e.g., a target gene.
A promoter sequence is typically understood to be operably linked to a coding sequence, if the promoter controls the transcription of the coding sequence. If a
promoter sequence is not natively associated with the coding sequence, its transcription is either not controlled by the promoter in native (wild-type) cells or the sequences are recombined with different contiguous sequences.
The term “promoter” as used herein is meant to refer to a nucleic acid molecule that initiates, regulates, or otherwise mediates or controls the expression of a protein coding polynucleotide (DNA). Promoter DNA and/or coding DNA may be native DNA such as occurring or originating from a wild-type cell e.g., from the same gene or from different genes, or may be from the same or different organisms, but can be recombinant or artificial. A heterologous promoter may be heterologous to a polynucleotide to be expressed and/or an artificial promoter, or a promoter that is originating from the wild-type host cell, but positioned in the host cell genome within a heterologous expression cassette or positioned at a location where it is not naturally- occurring in the wild-type host cell.
Specific promoters are regulatable e.g., inducible or repressible by a compound, or constitutive.
Exemplary promoters which can be used for the purpose described herein to express the recombinant FLS in the eukaryotic host cell are selected from the group consisting of methanol-inducible promoters, such as the A0X1 or A0X2 promoter, or a constitutive promoter such as MDH3, POR1 , PDC1 , FBA1-1 , or GPMI (Prielhofer et al. 2017, BMC Sys Biol. 11 : 123), or a functional variant of any of the foregoing.
As an alternative to native or wild-type promoter sequences, functional variants of such native or wild-type promoter sequences (herein understood as parent promoters) can be used, which have at least any one of 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% sequence identity and are functional in controlling the expression of a gene in substantially similar way, e.g. being an inducible promoter or constitutive promoter as the parent promoter, and preferably having about the same or increased promoter strength.
Specifically, a functional variant of a promoter described herein comprises a certain sequence identity to the promoter from which it is derived, over the full-length or the part at the 3’-end of the promoter sequence which part has a length of at least 300, 400, or 500 bp, and which is functional to operatively control expression of the polynucleotide to be expressed, in particular with about the same promoter activity (e.g. +/- any one of 50%, 40%, 30%, 20%, or 10%), although the promoter activity may be increased to higher than 100% as compared to the promoter from which it is derived.
The promoter activity can be determined by measuring the strength of the promoter. The promoter strength is typically understood as the transcription strength. There are standard methods to determine the transcription strength of a promoter, such as by measuring the quantity of transcripts, e.g., employing a microarray, or else in a cell culture, such as by measuring the quantity of respective gene expression products in recombinant cells. In particular, the transcription rate may be determined by the transcription strength on a microarray, Northern blot or with quantitative real time PCR (qRT-PCR) or with RNA sequencing (RNA-seq).
A “methanol-inducible promoter” is herein understood as a naturally occurring or wild-type promoter controlling expression of genes of the methanol dissimilatory pathway of organisms, in particular methylotrophic microorganisms.
For the purpose described herein, expression of the recombinant FLS may be driven by a methanol-inducible promoter, such as the A0X1 or A0X2 promoter, or a functional variant of any of the foregoing.
According to the methanol dissimilatory pathway in methylotrophic yeast, such as P. pastoris, methanol passively diffuses into the yeast peroxisome. There it is converted to formaldehyde by one of two different alcohol oxidase isozymes (Aox1 , Aox2). In a wild-type methylotrophic yeast, formaldehyde can be further oxidized in several steps to CO2 via the methanol dissimilatory pathway. Alternatively, formaldehyde is incorporated into the pentose phosphate pathway via a condensation reaction with xylulose 5-phosphate, a reaction catalyzed by a specialized transketolase enzyme called DiHydroxyAcetone Synthase (Das). This reaction yields a molecule of dihydroxyacetone (DHA) and a molecule of glyceraldehyde 3-phosphate. Each of these reactions occurs in peroxisomes in methylotrophic yeasts. As an alternative to such wild-type pathways, formaldehyde is directly converted to DHA by the recombinant FLS in an engineered cell as described herein.
The term "nucleotide sequence" or “nucleic acid sequence” as used herein refers to either DNA or RNA. "Nucleic acid sequence" or "polynucleotide sequence" or simply “polynucleotide” refers to a single or double-stranded polymer of deoxyribonucleotide or ribonucleotide bases read from the 5' to the 3' end. It includes expression cassettes, self-replicating plasmids, infectious polymers of DNA or RNA, and non-functional DNA or RNA.
The term “peroxisomal targeting signal” (PTS) as used herein shall refer to short nucleic acid sequences which when linked to or positioned within a coding sequence,
e.g. as a nucleotide sequence encoding a C-terminal peptide or an N-terminal peptide of typically 3-9 amino acids, directs the expression of the expression product to the peroxisome of the host cell. By such a functional PTS, an enzyme can be relocated to the peroxisome. Most organisms including Pichia pastoris have two different targeting systems. The first one (PTS1) uses the receptor Pex5 to achieve targeting to the peroxisome. The second one (PTS2) uses Pex7 as receptor. A functional PTS is an amino acid sequence which is specifically recognized by any of the receptors Pex5 (PTS1) or Pex7 (PTS2), thereby activating the receptor and directing expression of the gene that is fused with such PTS to the host cell peroxisome.
A nucleotide sequence encoding the PTS1 is typically linked to a gene at the 3’- end, such that the PTS is fused at the carboxy terminus of the respective gene expression product. Thereby, the C-terminus of the amino acid sequence of the gene expression product is directly linked to the N-terminus of the PTS.
A nucleotide sequence encoding the PTS2 is typically linked to a gene at the 5’- end or integrated in proximity to the 5’-end, such that the PTS is fused at the amino terminus or close to the amino terminus of the respective gene expression product. Thereby, the N-terminus of the amino acid sequence of the gene expression product is directly linked to the C-terminus of the PTS2.
The following tools can be used to determine targeting signals in a given protein sequence: PTS1 predictor (Neuberger G, Maurer-Stroh S, Eisenhaber B, Hartig A, Eisenhaber F. Motif refinement of the peroxisomal targeting signal 1 and evaluation of taxon-specific differences. J Mol Biol. 2003 May 2;328(3):567-79.), or PTS prediction tool WoLF PSORT (Horton P, Park K-J, Obayashi T et al. WoLF PSORT: protein localization predictor. Nucleic Acids Res 2007;35:W585-7.).
The term "protein of interest” or “(POI)" as used herein refers to a polypeptide or a protein that is produced by means of recombinant technology in a host cell. More specifically, the protein may either be a polypeptide not naturally-occurring in the host cell, i.e. a heterologous protein, or else may be native to the host cell, i.e. a endogenous protein to the host cell, but is produced, for example, by transformation or transfection with a self-replicating vector containing the nucleic acid sequence encoding the POI, or upon integration by recombinant techniques of one or more copies of the nucleic acid sequence encoding the POI into the genome of the host cell, or by recombinant modification of one or more regulatory sequences controlling the expression of the gene encoding the POI, e.g., of the promoter sequence.
The term “sequence identity” of a variant, homologue or orthologue as compared to a parent nucleotide or amino acid sequence indicates the degree of identity of two or more sequences. Two or more amino acid sequences may have the same or conserved amino acid residues at a corresponding position, to a certain degree, up to 100%. Two or more nucleotide sequences may have the same or conserved base pairs at a corresponding position, to a certain degree, up to 100%.
Sequence similarity searching is an effective and reliable strategy for identifying homologs with excess (e.g., at least 50%) sequence identity. Sequence similarity search tools frequently used are e.g., BLAST, FASTA, and HMMER.
Sequence similarity searches can identify such homologous proteins or genes by detecting excess similarity, and statistically significant similarity that reflects common ancestry. Homologues may encompass orthologues, which are herein understood as the same protein in different organisms, e.g., variants of such protein in different different organisms or species.
“Percent (%) amino acid sequence identity” with respect to an amino acid sequence, homologs and orthologues described herein is defined as the percentage of amino acid residues in a candidate sequence that are identical with the amino acid residues in the specific polypeptide sequence, after aligning the sequence and introducing gaps, if necessary, to achieve the maximum percent sequence identity, and not considering any conservative substitutions as part of the sequence identity. Those skilled in the art can determine appropriate parameters for measuring alignment, including any algorithms needed to achieve maximal alignment over the full length of the sequences being compared.
For purposes described herein, the sequence identity between two amino acid sequences can be determined using the NCBI BLAST program version BLASTP 2.8.1 with the following exemplary parameters: Program: blastp, Word size: 6, Expect value: 10, Hitlist size: 100, Gapcosts: 11.1 , Matrix: BLOSUM62, Filter string: F, Compositional adjustment: Conditional compositional score matrix adjustment.
For pairwise protein sequence alignment of two amino acid sequences along their entire length the EMBOSS Needle webserver can be used with default settings (Matrix: EBLOSUM62; Gap open:10; Gap extend: 0.5; End Gap Penalty: false; End Gap Open: 10; End Gap Extend: 0.5). EMBOSS Needle uses the Needleman-Wunsch alignment algorithm to find the optimum alignment (including gaps) of the two input sequences and writes their optimal global sequence alignment to file.
"Percent (%) identity" with respect to a nucleotide sequence e.g., of a coding or non-coding nucleotide sequence, is defined as the percentage of nucleotides in a candidate nucleotide sequence that is identical with the nucleotides in the nucleotide sequence, after aligning the sequence and introducing gaps, if necessary, to achieve the maximum percent sequence identity, and not considering any conservative substitutions as part of the sequence identity. Alignment for purposes of determining percent nucleotide sequence identity can be achieved in various ways that are within the skill in the art, for instance, using publicly available computer software. Those skilled in the art can determine appropriate parameters for measuring alignment, including any algorithms needed to achieve maximal alignment over the full length of the sequences being compared.
For purposes described herein (unless indicated otherwise), the sequence identity between two amino acid sequences can be determined using the NCBI BLAST program version BLASTN 2.8.1 with the following exemplary parameters: Program: blastn, Word size: 11 , Expect threshold: 10, Hitlist size: 100, Gap Costs: 5.2, Match/Mismatch Scores: 2,-3, Filter string: Low complexity regions, Mark for lookup table only.
The term “recombinant” as used herein shall mean “being prepared by or the result of genetic engineering. A “recombinant cell” or “recombinant host cell” is herein understood as a cell or host cell that has been genetically engineered or modified to comprise a nucleic acid sequence which was not native to said cell. A recombinant host may be engineered to delete and/or inactivate one or more nucleotides or nucleotide sequences, and may specifically comprise an expression vector or cloning vector containing a recombinant nucleic acid sequence, in particular employing nucleotide sequence foreign to the host. A recombinant protein is produced by expressing a respective recombinant nucleic acid in a host. The term “recombinant” as used herein with respect to expression products, includes those compounds that are prepared, expressed, created or isolated by recombinant means, such as isolated from a host cell transformed or transfected to express the expression products.
Certain recombinant host cells are “engineered” host cells which are understood as host cells which have been manipulated using genetic engineering, i.e. by human intervention. When a host cell is engineered to express, co-express or overexpress a certain pathway, gene or the respective protein, the host cell is manipulated such that the host cell has the capability to express such pathway, gene and protein, respectively,
to a different extent compared to the host cell under the same condition prior to manipulation, or compared to the host cells which are not engineered such that said gene or protein is expressed, co-expressed or overexpressed.
The term “synthetic” as used herein in the context with a synthetic pathway shall mean a pathway that is recombinant.
As herein described, it was found that the methanol assimilation pathway (Xylulose-Monophosphate cycle, XuMP) is compartmentalized in peroxisomes and has similarities to the Calvin-Benson-Bassham cycle and the pentose-phosphate pathway (RuRmayer et al., 2015). However, this pathway requires at least 10 enzymes to enable the conversion from methanol to DHA. In addition, three (3) moles of ATP are required for the conversion of three moles of (3) formaldehyde to one (1) mol of Dihydroxyacetone phosphate (DHAP). In order to produce ATP a significant amount of methanol has to be dissimilated by a cell such as P. pastoris to CO2 obtaining NADH and thus ATP via the respiratory chain. The formation of dihydroxyacetone phosphate is thus not as energy efficient as the new FLS pathway described herein which can directly form dihydroxyacetone from three (3) moles of formaldehyde. Then only one (1) mol of ATP is required (instead of three mol ATP) to form dihydroxyacetone phosphate, which is an intermediate of glycolysis and thus of the central carbon metabolism. Therefore, a recombinant pathway was implemented into eukaryotic cells, in particular microorganisms, to enable a formaldehyde conversion into biomass via the formose reaction. This can enable an energy efficient conversion of Ci building blocks into carbon molecules with multiple C-C bonds.
According to a specific example, an FLS gene (version FLS-M3) as published by (Cai et al. 2021) was codon optimized for Komagataella phaffii and a PTS1 sequence was added to the C-terminus to enable targeting to peroxisomes of K. phaffii. The coding sequence was placed under the control of a strong methanol inducible promoter (either pAOX1 or pDAS2) and expression cassettes were integrated into a K. phaffii strain which has a deletion in DAS1 and DAS2 (Mut- strain). A Mut' strain is not capable to grow on methanol as carbon source. The presence of the expression cassette can be checked by for instance PCR approaches. Obtained strains can grow on medium with methanol as single carbon source and show formolase activity in a cell-free extract.
The formolase pathway described herein enables a more efficient utilization of a Ci (one carbon) carbon source like methanol. Methanol is an important carbon source to grow methylotrophic yeasts like Pichia pastoris (Komagataella sp.) to produce mainly
proteins and enzymes. Furthermore, the production of single cell proteins for food and feed applications is in the industrial focus again. With a more efficient methanol assimilation a higher yield of the product formed can be achieved (e.g., proteins, organic acids, DNA, RNA, lipids, biomass, ...). Furthermore, as the pathway is more efficient less heat development is expected to occur when this pathway is operated. Cooling bioreactors to e.g., 30°C is cost intensive when an exothermic reaction is taking place.
Formaldehyde is a reactive molecule, which is in many cases cytotoxic to microorganism when formed intracellularly at higher concentrations or without specialized compartmentalization in the cell. According to a specific example, to establish a formolase pathway in vivo, an intracellular compartment of the cell is utilized (or generated), where both formaldehyde is formed and converted by formolase to dihydroxyacetone.
According to a specific aspect, biochemical reactions leading to the formation of formaldehyde are placed into the same compartment as the formose reaction carried out by the enzyme formolase (FLS) to form dihydroxyacetone from formaldehyde. Hereby, a compartment is a subcellular container, which is spatially separating its interior from the surrounding environment by a biochemical barrier. This biochemical barrier can be a lipid membrane consisting of phospholipids (e.g., organelles like peroxisomes, mitochondria, vacuole, endoplasmic reticulum) or an artificial shell formed by protein structures like in bacterial microcompartments.
The FLS pathway as described herein is using formaldehyde as a substrate or intermediate. Preferably, the FLS pathway is spatially localized to the same subcellular compartment like the peroxisome. Doing so other cellular compounds like DNA, RNA or proteins are not modified by formaldehyde in an untargeted way.
The foregoing description will be more fully understood with reference to the following examples. Such examples are, however, merely representative of methods of practicing one or more embodiments of the present invention and should not be read as limiting the scope of invention.
EXAMPLES
Example 1 : Construction of FLS coding sequence with a peroxisomal targeting sequence
An improved version of formolase (FLS), which was described as FLS-M3 (Cai et al., 2021) was targeted to the peroxisome of Pichia pastoris by fusing a peroxisomal targeting sequence to the C-terminus of the protein. The targeting sequence, derived from the C-terminus of the native AOX1 of P. pastoris (represented in the protein dodecamer: LGTYEKTGLARF, SEQ ID NO:19, termed PTS1 here) was fused to the C- terminus of FLS (protein sequence: SEQ ID NO:2) yielding FLS::PTS1. The respective protein sequence was codon optimized for P. pastoris and the resulting DNA sequence was ordered at TWIST Bioscience (South San Francisco, CA, USA) (DNA sequence: SEQ ID NO:1). Alternatively other targeting signals for peroxisomal localization can be added to the protein, which targets the PEX5 receptor (mainly PTS1) or like PTS2 (via PEX5 or PEX7 receptor) or PTS3 (Kempinski et al., 2020). Proteins showing a peroxisomal localization were reported previously together with respective targeting signals (RuRmayer et al., 2015).
The coding sequence was ordered with flanking Bpil (Bbsl) sites and respective fusion sites (FS2, FS3) to be compatible with a Golden Gate cloning pipeline GoldenMOCS described previously (Sarkari et al., 2017).
Example 2: Construction of DNA cassettes and generation of FLS expressing yeast strains in the peroxisome
The genotype of the parental strain (Komagataella phaffii CBS7435 ADAS1, ADAS2, AAOX1 = MUT was determined by amplifying the CDS of DAS1, DAS2, and AOX1 from isolated genomic DNA in a PCR reaction using Q5 High-Fidelity DNA Polymerase (NEB) using SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NQ:10 as the primer oligonucleotide pairs. The sequencing results (SEQ ID NO:13, SEQ ID NO:14, and SEQ ID NO:15) confirmed the deletion of the mentioned genes in the genome of the parental Pichia pastoris MUT' strain. This deletion strain cannot grow on methanol. The methanol assimilation pathway (Xylulose- Monophosphate cycle) is blocked when both DAS1/DAS2 are deleted (Gassier et al., 2020).
In this strain background the fusion protein FLS::PTS1 was expressed using a strong inducible methanol promoter using pDAS2 or pAOX1 by integrating respective expression cassettes into the genome.
The integration cassettes (Figure 1) were constructed using a Golden gate cloning strategy GoldenMOCS and transformed to electro-competent P. pastoris ADAS1 , ADAS2, AA0X1 cells. 5 pg DNA was transformed to each competent cell. The final incubated cells were plated on YPD-Agar plates having 500 pg mL'1 G418 or 200 pg mL-1 Hygromycin as antibiotic selection markers.
After 48 hours of incubation at 28°C the single colonies were picked from plates and were streaked out twice on antibiotic plates (Hygromycin, G418) and once on M2 Media + 0.5% Methanol + 1.5 % Agar.
The transformants that grew on both antibiotic (Hygromycin or G418) and methanol plates were selected. A cell bank of the selected strain was prepared, in 1 mL aliquots and stored at -80°C. The extraction of the genomic DNA in order to check the integration of the genes of interest in the genome was done by breaking the cells down using glass beads in a Bead ruptor device (FastPrep-24, MP Biomedical, 6 m/s) and genomic DNA was isolated using an ethanol precipitation protocol (Promega kit).
The SEQ ID NO:3 and SEQ ID NO:4 were amplified from yeast genomic DNA in PCR reactions using OneTaq DNA polymerase (NEB) with SEQ ID NO:11 and SEQ ID NO:12 as the primer oligonucleotides. The DNA fragments sized 4.6 kb for SEQ ID NO:3 (pDAS2::FLS::HpHr) and 4.3 kb for SEQ ID NO:4 (pAOX1 ::FLS::G418r) were expected. The presence of the mentioned fragments was checked on 1 % Agarose gel (Figure 2). The positive P. pastoris transformants having SEQ ID NO:4 (Figure 2, Lane 1-6) or SEQ ID NO:5 (Figure 2, Lane 7,9-12) integrated into their genomes were selected and used for further experiments.
Example 3: Cultivation of FLS expressing yeast strains on synthetic media containing methanol
One single colony from each of the described P. pastoris transformants from YPD agar plates was taken and inoculated in shake flasks containing 40 mL of YPD medium, cultivated overnight at 28°C in a shaking incubator at 200 rpm. The next day the yeast cells were collected at 5000g, for 5 min and resuspended in 20 mL M2 Media without a carbon source. The optical density (OD) was measured at 600 nm and the cells were inoculated to the 50 mL M2 media (in wide-neck 500 mL flasks with cotton- wools) containing 5 g L'1 methanol to reach the starting OD of 2.5. The flasks were incubated at 28°C at 200 rpm. The growth of the yeast cultures on methanol as the sole
carbon source were monitored for the period of 40 hours by measuring OD at the time points shown in Figure 3. The methanol feeding strategy is shown in Table 1. The obtained growth rate is shown in Table 2.
Strains expressing FLS can grow on methanol as sole carbon source. No growth is obtained for an empty vector control and the parental MUT' strain on methanol. From Figure 4 it can be seen that strains expressing FLS show a dense and budding yeast culture on methanol under the microscope compared to the not growing parental strain.
Media compositions:
YPD Media: 1 % yeast extract, 2% peptone, 2% glucose.
Synthetic Media (M2 Citrate buffered): Per liter; 3.15 g (NH4)2HPO4, 0.49 g MgSO4.7H20 , 0.80 g KCI, 0.0268 g CaCl2.2H2O, 22.0 g citric acid monohydrate, 1.47 mL Trace salt (PTM0), 4 mL biotin (0.1 g L'1), 4 mL thiamine-hydrochloride (5 g L'1). The pH of the media was adjusted at 6 by adding potassium hydroxide pellets.
Trace salt solution (PTM0): Per liter; 5 mL sulphuric acid (95%-98%), 65 g FeSO4.7H2O, 20 g ZnCI2, 6 g CuSO4.5H2O, 3.36 g MnSO4.H2O, 0.82 g COCI2.6H2O, 0.2 g Na2MoO4.2H2O, 0.08 g Nal, 0.02 g H3BO3.
Table 1. Methanol feeding strategy in 40 hours cultivation of P. pastoris strains in synthetic M2/methanol media
Table 2. Comparison of the growth and enzyme activity of P. pastoris MUT- parental strain compared to P. pastoris FLS expressing transformants
Example 4: Enzymatic assay to measure FLS activity in cell free extracts
The enzymatic assay was done by a colorimetric assay protocol (Cai et al., 2021). After cultivation in synthetic media, the yeast cell pellets were collected and resuspended in (HEPES-NaCI buffer, 100 mM pH 7.5). The cells were broken down by adding glass beads in a bead ruptor device (4 m/s). The cell free extract containing FLS enzyme was separated from the cell debris. The enzyme activity assay was carried out in microtiter plates. 50 mL of formaldehyde solution (ranging from 20 to 100 mM) containing 1 mM thiamine pyrophosphate (TPP) was added to 50 pL of cell free extract dilutions. After incubation at 30°C, 60 pL of enzyme solution (0.3 mg mL-1 galactose oxidase, 35 U mL-1 horseradish peroxidase) and 50 pL of 8 mM ABTS solution was added to each well. The plate was subjected to UVA/is measurement at 410 nm using a microtiter plate reader. The blank had the same above-mentioned reaction solutions but 50 pL buffer was added instead of cell free extract. The cell free extract from the parental strain (ADAS1 , ADAS2, AAOX1) was measured as a negative control.
Strains expressing FLS show enzymatic activity for FLS when grown on methanol. No FLS activity is obtained for an empty vector control and the parental MUT' strain.
REFERENCES (non-patent literature)
Cai, T., Sun, H., Qiao, J., Zhu, L., Zhang, F., Zhang, J., Tang, Z., Wei, X., Yang, J., Yuan, Q., Wang, W., Yang, X., Chu, H., Wang, Q., You, C., Ma, H., Sun, Y., Li, Y., Li, C., Jiang, H., Wang, Q., Ma, Y., 2021. Cell-free chemoenzymatic starch synthesis from carbon dioxide. Science, Vol. 373, 1523-1527.
Gassier, T., Sauer, M., Gasser, B., Egermeier, M., Troyer, C., Causon, T., Hann, S., Mattanovich, D., Steiger, M.G., 2020. The industrial yeast Pichia pastoris is converted from a heterotroph into an autotroph capable of growth on CO2. Nature Biotechnology, Vol. 38(2), 210-216.
Kempinski, B., Chelstowska, A., Poznahski, J., Krol, K., Rymer, L., Frydzihska, Z., Girzalsky, W., Skoneczna, A., Erdmann, R., Skoneczny, M., 2020. The Peroxisomal Targeting Signal 3 (PTS3) of the Budding Yeast Acyl-CoA Oxidase Is a Signal Patch. Frontiers in Cell and Developmental Biology, Vol. 8(198), 1-8.
RuRmayer, H., Buchetics, M., Gruber, C., Valli, M., Grillitsch, K., Modarres, G., Guerrasio, R., Klavins, K., Neubauer, S., Drexler, H., Steiger, M., Troyer, C., Al Chalabi, A., Krebiehl, G., Sonntag, D., Zellnig, G., Daum, G., Graf, A.B., Altmann, F., Koellensperger, G., Hann, S., Sauer, M., Mattanovich, D., Gasser, B., 2015. Systems-level organization of yeast methylotrophic lifestyle. BMC Biology, Vol. 13(80), 1-25.
Sarkari, P., Marx, H., Blumhoff, M.L., Mattanovich, D., Sauer, M., Steiger, M.G., 2017. An efficient tool for metabolic pathway construction and gene integration for Aspergillus niger. Bioresource Technology, Vol. 245, 1327- 1333.
Siegel, J.B., Smith, A.L., Poust, S., Wargacki, A.J., Bar-Even, A., Louw, C., Shen, B.W., Eiben, C.B., Tran, H.M., Noor, E., Gallaher, J.L., Bale, J., Yoshikuni, Y., Gelb, M.H., Keasling, J.D., Stoddard, B.L., Lidstrom, M.E., Baker, D., 2015. Computational protein design enables a novel one-carbon assimilation pathway. Proceedings of the National Academy of Sciences U.S.A., Vol. 112(12), 3704-3709.
Van der Klei, I. J., Yurimoto, H., Sakai, Y., Veenhuis, M. 2006. Biochimica et Biophysica Acta, Vol. 1763(12), 1453-1462.
Wang, X., Wang, Y., Liu, J., Li, Q., Zhang, Z., Zheng, P., Lu, F., Sun, J., 2017. Biological conversion of methanol by evolved Escherichia coli carrying a linear methanol assimilation pathway. Bioresources and Bioprocessing, Vol. 4(41).
Claims
1. A eukaryotic cell which is engineered to express a synthetic formolase (FLS) pathway comprising a recombinant FLS to biotransform formaldehyde into dihydroxyacetone.
2. The cell of claim 1 , wherein the FLS pathway further comprises one or more enzymes to biotransform a Ci compound into formaldehyde, preferably wherein the Ci compound is any one of methanol, formate, methane, carbon monoxide, or carbon dioxide.
3. The cell of claim 1 or 2, wherein the FLS pathway further comprises a methanol to formaldehyde converting enzyme (“MFCE”), preferably an alcohol oxidase or a methanol dehydrogenase, to biotransform methanol into formaldehyde.
4. The cell of any one of claims 1 to 3, which is: a) a yeast cell, preferably of a genus selected from the group consisting of Pichia, Komagataella, Hansenula, Saccharomyces, Kluyveromyces, Candida, Ogataea, Yarrowia, and Geotrichum, preferably Pichia pastoris, Komagataella phaffii, Komagataella pastoris, Komagataella pseudopastoris, Saccharomyces cerevisiae, Ogataea minuta, Kluyveromces lactis, Kluyveromyes marxianus, Yarrowia lipolytica or Hansenula polymorpha, preferably a methylotrophic yeast; b) a cell of filamentous fungi, preferably of a genus selected from the group consisting of Aspergillus, Penicillium, Trichoderma, Neurospora, Rhizopus preferably Aspergillus niger, Aspergillus awamori Aspergillus terreus, Aspergillus oryzae, Aspergillus nidulans, Neurospora crassa, Rhizopus oryzae, Penicillium chrysogenum or Trichoderma reeser, c) a non-human primate, human, rodent, or bovine cell, preferably a mouse myeloma (NSO)-cell line, a Chinese hamster ovary (CHO)-cell line, HT1080, H9, HepG2, MCF7, MDBK Jurkat, MDCK, NIH3T3, PC12, BHK (baby hamster kidney cell), VERO, SP2/0, YB2/0, YO, C127, L cell, COS, QC1-3, HEK-293, PER.C6, HeLA, EBI, EB2, EB3, oncolytic or hybridoma-cell line; d) an insect cell, preferably a Sf9, Mimic™ Sf9, Sf21 , High Five (BT1-TN-5B1- 4), or BT1-Ea88 cell;
e) an algae cell, preferably of the genus Amphora, Bacillariophyceae, Dunaliella, Chlorella, Chlamydomonas, Cyanophyta (cyanobacteria), Nannochloropsis, Spirulina, or Ochromonas); or f) a plant cell, preferably a cell from monocotyledonous plants, preferably maize, rice, wheat, or Setaria, or from a dicotyledonous plant, preferably cassava, potato, soybean, tomato, tobacco, alfalfa, Physcomitrella patens or Arabidopsis.
5. The cell of any one of claims 1 to 4, wherein the FLS pathway is comprised in an organelle of the cell, preferably a peroxisome or mitochondria.
6. The cell of any one of claims 1 to 5, wherein the FLS is expressed as an FLS fusion protein that comprises the FLS fused to a peroxisomal targeting signal (PTS), preferably wherein the PTS comprises or consists of an amino acid sequence selected from the group consisting of SKL, SEQ ID NO:17, 18, 19, 20, 21 , 22, 23, 24, 25, and 26.
7. The cell of any one of claims 1 to 6, wherein the FLS comprises of consists of SEQ ID NO:31 or SEQ ID NO:32, or a functional variant of any of the foregoing, preferably wherein a) the functional variant of SEQ ID NO:31 comprises at least 1 , 2, 3, 4, 5, 6, or 7 (all) of the point mutations A28L, W89R, R188H, N283H, A394G, G419N, and A480W, preferably at least 3, 4, or 5 mutations selected from W89R, R188H, A394G, G419N, or A480W, point mutations as compared to SEQ ID NO:33, and at least at least any one of 90%, 95%, 96%, 97%, 98%, 99% or 100% sequence identity to SEQ ID NO:31 ; b) the functional variant of SEQ ID NO:32 comprises at least 1 , 2, 3, 4, 5, 6, or 7 (all) of the point mutations A28I, W89R, L90T, R188H, A394G, G419N, and A480W, preferably at least 3, 4, or 5 mutations selected from W89R, R188H, A394G, G419N, or A480W, point mutations as compared to SEQ ID NO:33, and at least at least any one of 90%, 95%, 96%, 97%, 98%, 99% or 100% sequence identity to SEQ ID NO:32.
8. The cell of any one of claims 1 to 7, which is an engineered Mut' yeast, preferably engineered to reduce expression of a di hydroxyacetone synthase (DAS)
compared to an endogenous expression thereof, preferably by a deletion of any one or both of the DAS1 and DAS2 genes.
9. The cell of any one of claims 1 to 8, which is capable of utilizing a Ci compound as the sole carbon source for producing biomass and/or an expression product.
10. A method of culturing the cell of any of claims 1 to 9 in a cell culture using a Ci compound as a carbon source, thereby obtaining biomass and/or an expression product in the cell culture, preferably wherein the Ci compound is the sole carbon source.
11. The method of claim 10, wherein the expression product is a protein of interest (POI) or a metabolite, preferably wherein: a) the POI is a peptide or protein selected from the group consisting of an antigen-binding protein, a therapeutic protein, an enzyme, a peptide, a protein antibiotic, a toxin fusion protein, a carbohydrate - protein conjugate, a structural protein, a regulatory protein, a vaccine antigen, a growth factor, a hormone, a cytokine, a process enzyme; b) the metabolite is an amino acid, organic acid, alcohol, sugar alcohol, carbohydrate, vitamin, amine, aldehyde, ketone, or a polyhydroxyketone, preferably wherein the metabolite is selected from the group consisting of lactic acid, citric acid, propionic acid, butyric acid, valeric acid, hexanoic acid, adipic acid, succinic acid, fumaric acid, malic acid, 2,5-furan dicarboxylic acid, aspartic acid, glucaric acid, gluconic acid, glutamic acid, itaconic acid, levulinic acid, acrylic acid, 3-hydroxy propionic acid, ethanol, propanol, isopropanol, butanol, pentanol, hexanol, heptanol, octanol, butanediol, 1 ,3-propanediol, 1 ,2-propanediol, 2-amino-1 ,3-propanediol, 3- hydroxybutyrate, poly-3-hydroxybutyrate, 3-hydroxy propionaldehyde, 3- hydroxybutyrolactone, xylitol, arabinitol, sorbitol, mannitol, vitamin C, riboflavin, thiamine, tocopherol, cobalamin, pantothenic acid, biotin, pyridoxine, niacin, folic acid, diamino pentane, diamino hexane and dihydroxyacetone.
12. A method of producing biomass and/or an expression product in a cell culture, using a cell of any one of claims 1 to 9 and a Ci compound, preferably wherein the Ci compound is the sole carbon source in the cell culture.
13. A fusion protein comprising a formolase (FLS) that is fused to a peroxisomal targeting signal (PTS), preferably wherein the fusion protein is characterized by one or more of the following features: a) the PTS comprises or consists of an amino acid sequence selected from the group consisting of SKL, SEQ ID NO:17, 18, 19, 20, 21 , 22, 23, 24, 25, and 26; b) the FLS comprises of consists of SEQ ID NO:31 or SEQ ID NO:32, or a functional variant of any of the foregoing, preferably wherein i) the functional variant of SEQ ID NO:31 comprises at least 1 , 2, 3, 4, 5, 6, or 7 (all) of the point mutations A28L, W89R, R188H, N283H, A394G, G419N, and A480W, preferably at least 3, 4, or 5 mutations selected from W89R, R188H, A394G, G419N, or A480W, point mutations as compared to SEQ ID NO:33, and at least at least any one of 90%, 95%, 96%, 97%, 98%, 99% or 100% sequence identity to SEQ ID NO:31 ; ii) the functional variant of SEQ ID NO:32 comprises at least 1 , 2, 3, 4, 5, 6, or 7 (all) of the point mutations A28I, W89R, L90T, R188H, A394G, G419N, and A480W, preferably at least 3, 4, or 5 mutations selected from W89R, R188H, A394G, G419N, or A480W, point mutations as compared to SEQ ID NO:33, and at least at least any one of 90%, 95%, 96%, 97%, 98%, 99% or 100% sequence identity to SEQ ID NO:32.
14. A nucleic acid molecule encoding the fusion protein of claim 13, preferably wherein the nucleic acid molecule is codon-optimized for expression in a eukaryotic host cell.
15. An expression cassette comprising the nucleic acid molecule of claim 14 and regulatory sequences to express the fusion protein in a eukaryotic host cell.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
ATA50597/2022A AT526405A1 (en) | 2022-08-08 | 2022-08-08 | Synthetic formolase pathway |
ATA50597/2022 | 2022-08-08 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2024033303A1 true WO2024033303A1 (en) | 2024-02-15 |
Family
ID=87747792
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/EP2023/071821 WO2024033303A1 (en) | 2022-08-08 | 2023-08-07 | Synthetic formolase pathway |
Country Status (2)
Country | Link |
---|---|
AT (1) | AT526405A1 (en) |
WO (1) | WO2024033303A1 (en) |
Citations (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2012037413A2 (en) | 2010-09-15 | 2012-03-22 | University Of Washington Through Its Center For Commercialization | Systems and methods for biotransformation of carbon dioxide into higher carbon compounds |
WO2012170292A1 (en) | 2011-06-07 | 2012-12-13 | Merck Sharp & Dohme Corp. | Modulation of metabolic pathways for improving bioprocess performance and secreted protein productivity of yeast |
CN107475281A (en) | 2017-09-06 | 2017-12-15 | 福建农林大学 | A kind of bioconversion methanol metabolic pathway |
KR20190074917A (en) * | 2017-12-20 | 2019-06-28 | 건국대학교 산학협력단 | A method for producing acetoin from ethanol |
KR20190088648A (en) * | 2018-01-19 | 2019-07-29 | 건국대학교 산학협력단 | A method for producing butandiol from ethanol |
CN110438169A (en) | 2019-08-21 | 2019-11-12 | 福建农林大学 | A kind of method of whole-cell catalytic synthesis 1- hydroxy-2-butanone |
US20200181629A1 (en) | 2017-05-31 | 2020-06-11 | Universität Für Bodenkultur Wien | Yeast expressing a synthetic calvin cycle |
KR20210038805A (en) | 2019-09-30 | 2021-04-08 | 서강대학교산학협력단 | Methylotroph with enhanced 1,2-propylene glycol productivity and method for producing 1,2-propylene glycol using the same |
CN113122525A (en) | 2020-01-16 | 2021-07-16 | 中国科学院天津工业生物技术研究所 | Formaldehyde conversion protein and application thereof |
CN113136377A (en) * | 2020-01-19 | 2021-07-20 | 中国科学院天津工业生物技术研究所 | Glycanase and application thereof in ligustrazine biosynthesis |
CN113151230A (en) | 2021-03-26 | 2021-07-23 | 北京化工大学 | Mutant protein of formaldehyde lyase and application thereof |
WO2022042427A1 (en) | 2020-08-24 | 2022-03-03 | 中国科学院天津工业生物技术研究所 | Starch biosynthesis method |
-
2022
- 2022-08-08 AT ATA50597/2022A patent/AT526405A1/en unknown
-
2023
- 2023-08-07 WO PCT/EP2023/071821 patent/WO2024033303A1/en unknown
Patent Citations (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20130196359A1 (en) | 2010-09-15 | 2013-08-01 | University Of Washington Through Its Center For Commercialization | Systems and methods for biotransformation of carbon dioxide into higher carbon compounds |
WO2012037413A2 (en) | 2010-09-15 | 2012-03-22 | University Of Washington Through Its Center For Commercialization | Systems and methods for biotransformation of carbon dioxide into higher carbon compounds |
WO2012170292A1 (en) | 2011-06-07 | 2012-12-13 | Merck Sharp & Dohme Corp. | Modulation of metabolic pathways for improving bioprocess performance and secreted protein productivity of yeast |
US20200181629A1 (en) | 2017-05-31 | 2020-06-11 | Universität Für Bodenkultur Wien | Yeast expressing a synthetic calvin cycle |
CN107475281A (en) | 2017-09-06 | 2017-12-15 | 福建农林大学 | A kind of bioconversion methanol metabolic pathway |
KR20190074917A (en) * | 2017-12-20 | 2019-06-28 | 건국대학교 산학협력단 | A method for producing acetoin from ethanol |
KR20190088648A (en) * | 2018-01-19 | 2019-07-29 | 건국대학교 산학협력단 | A method for producing butandiol from ethanol |
CN110438169A (en) | 2019-08-21 | 2019-11-12 | 福建农林大学 | A kind of method of whole-cell catalytic synthesis 1- hydroxy-2-butanone |
KR20210038805A (en) | 2019-09-30 | 2021-04-08 | 서강대학교산학협력단 | Methylotroph with enhanced 1,2-propylene glycol productivity and method for producing 1,2-propylene glycol using the same |
CN113122525A (en) | 2020-01-16 | 2021-07-16 | 中国科学院天津工业生物技术研究所 | Formaldehyde conversion protein and application thereof |
CN113136377A (en) * | 2020-01-19 | 2021-07-20 | 中国科学院天津工业生物技术研究所 | Glycanase and application thereof in ligustrazine biosynthesis |
WO2022042427A1 (en) | 2020-08-24 | 2022-03-03 | 中国科学院天津工业生物技术研究所 | Starch biosynthesis method |
CN113151230A (en) | 2021-03-26 | 2021-07-23 | 北京化工大学 | Mutant protein of formaldehyde lyase and application thereof |
Non-Patent Citations (16)
Title |
---|
"Lewin ''Genes ''V", 1990, OXFORD UNIVERSITY PRESS |
B.W., EIBENC.B., TRANH.M., NOORE., GALLAHERJ.L., BALEJ., YOSHIKUNIY., GELBM.H., KEASLINGJ.D., STODDARDB.L., LIDSTROM: "Computational protein design enables a novel one-carbon assimilation pathway", PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES U.S.A., vol. 112, no. 12, 2015, pages 3704 - 3709, XP093071857, DOI: 10.1073/pnas.1500545112 |
CAI, T., SUN, H., QIAO, J., ZHU, L., ZHANG, F., ZHANG, J., TANG, Z., WEI, X., YANG, J. ,YUAN, Q., WANG, W., YANG, X., CHU, H., WAN: "Cell-free chemoenzymatic starch synthesis from carbon dioxide.", SCIENCE, vol. 373, 2021, pages 1523 - 1527, XP055903515, DOI: 10.1126/science.abh4049 |
GASSIER, T., SAUER, M., GASSER, B., EGERMEIER, M., TROYER, C., CAUSON, T., HANN, S., MATTANOVICH, D., STEIGER, M.G.: "The industrial yeast Pichia pastoris is converted from a heterotroph into an autotroph capable of growth on CO2.", NATURE BIOTECHNOLOGY, vol. 38, no. 2, 2020, pages 210 - 216 |
GIRZALSKY, W.SKONECZNA, AERDMANN, R.SKONECZNY, M.: "The Peroxisomal Targeting Signal 3 (PTS3) of the Budding Yeast Acyl-CoA Oxidase Is a Signal Patch", FRONTIERS IN CELL AND DEVELOPMENTAL BIOLOGY, vol. 8, no. 198, 2020, pages 1 - 8 |
GUERRASIO, R.KLAVINS, K.NEUBAUER, SDREXLER, H.STEIGER, M.TROYER, C.AL CHALABIA., KREBIEHLG., SONNTAGD., ZELLNIG: "Systems-level organization of yeast methylotrophic lifestyle", BMC BIOLOGY, vol. 13, no. 80, 2015, pages 1 - 25 |
HANNES RUSSMAYER ET AL: "Systems-level organization of yeast methylotrophic lifestyle", BMC BIOLOGY, vol. 13, no. 1, 23 September 2015 (2015-09-23), XP055589013, DOI: 10.1186/s12915-015-0186-5 * |
HORTON PPARK K-JOBAYASHI T ET AL.: "WoLF PSORT: protein localization predictor", NUCLEIC ACIDS RES, vol. 35, 2007, pages 585 - 7 |
J. SAMBROOK ET AL.: "Molecular Cloning: A Laboratory Manual", 2001, COLD SPRING HARBOR LABORATORY |
NEUBERGER GMAURER-STROH SEISENHABER BHARTIG AEISENHABER F: "Motif refinement of the peroxisomal targeting signal 1 and evaluation of taxon-specific differences", J MOL BIOL., vol. 328, no. 3, 2 May 2003 (2003-05-02), pages 567 - 79, XP004454193, DOI: 10.1016/S0022-2836(03)00318-8 |
PRIELHOFER ET AL., BMC SYS BIOL, vol. 11, 2017, pages 123 |
SARKARI, P., MARX, H., BLUMHOFF, M.L., MATTANOVICH, D., SAUER, M., STEIGER, M.G.: "An efficient tool for metabolic pathway construction and gene integration for Aspergillus niger", BIORESOURCE TECHNOLOGY, vol. 245, 2017, pages 1327 - 1333, XP085238051, DOI: 10.1016/j.biortech.2017.05.004 |
SIEGEL JUSTIN B. ET AL: "Computational protein design enables a novel one-carbon assimilation pathway", PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES, vol. 112, no. 12, 9 March 2015 (2015-03-09), pages 3704 - 3709, XP093071857, ISSN: 0027-8424, DOI: 10.1073/pnas.1500545112 * |
VAN DER KLEII.J., YURIMOTOH., SAKAIY., VEENHUIS, M, BIOCHIMICA ET BIOPHYSICA ACTA, vol. 1763, no. 12, 2006, pages 1453 - 1462 |
WANG ET AL., BIORESOUR. BIOPROCESS, vol. 4, 2017, pages 41 |
WANG, X., WANG, Y., LIU, J., LI, Q., ZHANG, Z., ZHENG, P., LU, F., SUN, J.: "Biological conversion of methanol by evolved Escherichia coli carrying a linear methanol assimilation pathway", BIORESOURCES AND BIOPROCESSING, no. 41, pages 4 |
Also Published As
Publication number | Publication date |
---|---|
AT526405A1 (en) | 2024-02-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20080090273A1 (en) | Malic Acid Production in Recombinant Yeast | |
US10947571B2 (en) | Method for producing lactic acid in recombinant yeast cell culture | |
US20110045559A1 (en) | Malic acid production in recombinant yeast | |
JP4963488B2 (en) | Mutant yeast and substance production method using the same | |
JP2022025108A (en) | Fungal production of fdca | |
FI125136B (en) | Eukaryotic cells and process for the production of glycolic acid | |
WO2010111344A2 (en) | Methods and microorganisms for production of c4-dicarboxylic acids | |
KR101616171B1 (en) | Use of monascus in organic acid production | |
EP2807262A2 (en) | Methods for succinate production | |
CN112204146A (en) | Acid-tolerant yeast having inhibited ethanol production pathway and method for producing lactic acid using the same | |
CN110651034A (en) | FDCA-decarboxylated monooxygenase-deficient host cells for the production of FDCA | |
CN112789353A (en) | Recombinant acid-resistant yeast inhibiting ethanol production and method for preparing lactic acid using same | |
WO2009065777A1 (en) | Dicarboxylic acid production in a filamentous fungus | |
CA2931591C (en) | Fumarate reductases | |
WO2024033303A1 (en) | Synthetic formolase pathway | |
US9598709B2 (en) | Genetically engineered and stress resistant yeast cell with enhanced MSN2 activity and method of producing lactate using the same | |
US20130309736A1 (en) | Compositions and methods for malate and fumarate production | |
WO2012103263A2 (en) | Compositions and methods for malate and fumarate production | |
JP2009268407A (en) | Highly efficient method for producing lactic acid by candida boidinii | |
KR101392968B1 (en) | Novel autonomously replicating sequence and uses thereof | |
CA3008963A1 (en) | Host cells for dicarboxylic acid production |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 23757520 Country of ref document: EP Kind code of ref document: A1 |