CN117897490A - Recombinant yeast cells - Google Patents
Recombinant yeast cells Download PDFInfo
- Publication number
- CN117897490A CN117897490A CN202280059116.5A CN202280059116A CN117897490A CN 117897490 A CN117897490 A CN 117897490A CN 202280059116 A CN202280059116 A CN 202280059116A CN 117897490 A CN117897490 A CN 117897490A
- Authority
- CN
- China
- Prior art keywords
- seq
- acid sequence
- nucleic acid
- protein
- activity
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 210000005253 yeast cell Anatomy 0.000 title claims abstract description 264
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 412
- 150000007523 nucleic acids Chemical group 0.000 claims abstract description 385
- 108091028043 Nucleic acid sequence Proteins 0.000 claims abstract description 279
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 270
- 230000000694 effects Effects 0.000 claims abstract description 223
- 108010043652 Transketolase Proteins 0.000 claims abstract description 193
- 102000014701 Transketolase Human genes 0.000 claims abstract description 184
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 224
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 claims description 125
- 102000004190 Enzymes Human genes 0.000 claims description 115
- 108090000790 Enzymes Proteins 0.000 claims description 115
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 claims description 106
- 108090000818 Nitrite reductase (NAD(P)H) Proteins 0.000 claims description 97
- 230000037430 deletion Effects 0.000 claims description 89
- 238000012217 deletion Methods 0.000 claims description 89
- 238000006467 substitution reaction Methods 0.000 claims description 86
- 238000003780 insertion Methods 0.000 claims description 83
- 230000037431 insertion Effects 0.000 claims description 83
- 108010015895 Glycerone kinase Proteins 0.000 claims description 72
- 230000001419 dependent effect Effects 0.000 claims description 70
- 230000014509 gene expression Effects 0.000 claims description 66
- 108010003581 Ribulose-bisphosphate carboxylase Proteins 0.000 claims description 62
- 238000000034 method Methods 0.000 claims description 58
- 230000035772 mutation Effects 0.000 claims description 57
- 101000892220 Geobacillus thermodenitrificans (strain NG80-2) Long-chain-alcohol dehydrogenase 1 Proteins 0.000 claims description 55
- 108010006519 Molecular Chaperones Proteins 0.000 claims description 47
- 108010081577 aldehyde dehydrogenase (NAD(P)+) Proteins 0.000 claims description 42
- 150000001413 amino acids Chemical class 0.000 claims description 42
- 108010004621 phosphoketolase Proteins 0.000 claims description 41
- 230000000397 acetylating effect Effects 0.000 claims description 40
- 108090000849 Nitrate reductase (NADH) Proteins 0.000 claims description 39
- 102000005431 Molecular Chaperones Human genes 0.000 claims description 36
- IOVCWXUNBOPUCH-UHFFFAOYSA-M Nitrite anion Chemical compound [O-]N=O IOVCWXUNBOPUCH-UHFFFAOYSA-M 0.000 claims description 32
- 108700023175 Phosphate acetyltransferases Proteins 0.000 claims description 32
- 108010080971 phosphoribulokinase Proteins 0.000 claims description 29
- 229910052799 carbon Inorganic materials 0.000 claims description 28
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 claims description 27
- 229910002651 NO3 Inorganic materials 0.000 claims description 26
- NHNBFGGVMKEFGY-UHFFFAOYSA-N Nitrate Chemical compound [O-][N+]([O-])=O NHNBFGGVMKEFGY-UHFFFAOYSA-N 0.000 claims description 26
- 238000012239 gene modification Methods 0.000 claims description 26
- 230000005017 genetic modification Effects 0.000 claims description 26
- 235000013617 genetically modified food Nutrition 0.000 claims description 26
- 108010092060 Acetate kinase Proteins 0.000 claims description 25
- 108090000417 Oxygenases Proteins 0.000 claims description 25
- 102000004020 Oxygenases Human genes 0.000 claims description 25
- YAHZABJORDUQGO-NQXXGFSBSA-N D-ribulose 1,5-bisphosphate Chemical compound OP(=O)(O)OC[C@@H](O)[C@@H](O)C(=O)COP(O)(O)=O YAHZABJORDUQGO-NQXXGFSBSA-N 0.000 claims description 23
- 108010073178 Glucan 1,4-alpha-Glucosidase Proteins 0.000 claims description 22
- 102100022624 Glucoamylase Human genes 0.000 claims description 22
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 claims description 22
- 239000008103 glucose Substances 0.000 claims description 21
- 238000004519 manufacturing process Methods 0.000 claims description 19
- 150000001720 carbohydrates Chemical class 0.000 claims description 16
- 108010041921 Glycerolphosphate Dehydrogenase Proteins 0.000 claims description 14
- 102000000587 Glycerolphosphate Dehydrogenase Human genes 0.000 claims description 13
- 230000037353 metabolic pathway Effects 0.000 claims description 12
- AWUCVROLDVIAJX-GSVOUGTGSA-N sn-glycerol 3-phosphate Chemical compound OC[C@@H](O)COP(O)(O)=O AWUCVROLDVIAJX-GSVOUGTGSA-N 0.000 claims description 8
- 101150059556 ANB1 gene Proteins 0.000 claims description 5
- 108090000608 Phosphoric Monoester Hydrolases Proteins 0.000 claims description 5
- 102000004160 Phosphoric Monoester Hydrolases Human genes 0.000 claims description 4
- AWUCVROLDVIAJX-UHFFFAOYSA-N alpha-glycerophosphate Natural products OCC(O)COP(O)(O)=O AWUCVROLDVIAJX-UHFFFAOYSA-N 0.000 claims description 4
- 230000001131 transforming effect Effects 0.000 claims description 4
- 102000040811 transporter activity Human genes 0.000 claims description 4
- 108091092194 transporter activity Proteins 0.000 claims description 4
- 102100033448 Lysosomal alpha-glucosidase Human genes 0.000 claims description 2
- 235000018102 proteins Nutrition 0.000 description 244
- 210000004027 cell Anatomy 0.000 description 158
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 132
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 126
- 229940088598 enzyme Drugs 0.000 description 111
- 239000002773 nucleotide Substances 0.000 description 72
- 125000003729 nucleotide group Chemical group 0.000 description 72
- 229940081969 saccharomyces cerevisiae Drugs 0.000 description 69
- 235000001014 amino acid Nutrition 0.000 description 59
- 108010021809 Alcohol dehydrogenase Proteins 0.000 description 44
- BAWFJGJZGIEFAR-NNYOXOHSSA-N NAD zwitterion Chemical compound NC(=O)C1=CC=C[N+]([C@H]2[C@@H]([C@H](O)[C@@H](COP([O-])(=O)OP(O)(=O)OC[C@@H]3[C@H]([C@@H](O)[C@@H](O3)N3C4=NC=NC(N)=C4N=C3)O)O2)O)=C1 BAWFJGJZGIEFAR-NNYOXOHSSA-N 0.000 description 43
- 229950006238 nadide Drugs 0.000 description 43
- 102000007698 Alcohol dehydrogenase Human genes 0.000 description 42
- 229940024606 amino acid Drugs 0.000 description 42
- 238000000855 fermentation Methods 0.000 description 41
- 108090000765 processed proteins & peptides Proteins 0.000 description 41
- 241000588724 Escherichia coli Species 0.000 description 39
- 229920001184 polypeptide Polymers 0.000 description 39
- 102000004196 processed proteins & peptides Human genes 0.000 description 39
- 230000004151 fermentation Effects 0.000 description 38
- 102000039446 nucleic acids Human genes 0.000 description 38
- 108020004707 nucleic acids Proteins 0.000 description 38
- XJLXINKUBYWONI-DQQFMEOOSA-N [[(2r,3r,4r,5r)-5-(6-aminopurin-9-yl)-3-hydroxy-4-phosphonooxyoxolan-2-yl]methoxy-hydroxyphosphoryl] [(2s,3r,4s,5s)-5-(3-carbamoylpyridin-1-ium-1-yl)-3,4-dihydroxyoxolan-2-yl]methyl phosphate Chemical compound NC(=O)C1=CC=C[N+]([C@@H]2[C@H]([C@@H](O)[C@H](COP([O-])(=O)OP(O)(=O)OC[C@@H]3[C@H]([C@@H](OP(O)(O)=O)[C@@H](O3)N3C4=NC=NC(N)=C4N=C3)O)O2)O)=C1 XJLXINKUBYWONI-DQQFMEOOSA-N 0.000 description 35
- BAWFJGJZGIEFAR-NNYOXOHSSA-O NAD(+) Chemical compound NC(=O)C1=CC=C[N+]([C@H]2[C@@H]([C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OC[C@@H]3[C@H]([C@@H](O)[C@@H](O3)N3C4=NC=NC(N)=C4N=C3)O)O2)O)=C1 BAWFJGJZGIEFAR-NNYOXOHSSA-O 0.000 description 34
- 108010049926 Acetate-CoA ligase Proteins 0.000 description 30
- 238000006243 chemical reaction Methods 0.000 description 30
- 108010078791 Carrier Proteins Proteins 0.000 description 29
- 102100035709 Acetyl-coenzyme A synthetase, cytoplasmic Human genes 0.000 description 28
- SRBFZHDQGSBBOR-IOVATXLUSA-N D-xylopyranose Chemical compound O[C@@H]1COC(O)[C@H](O)[C@H]1O SRBFZHDQGSBBOR-IOVATXLUSA-N 0.000 description 26
- 102000004195 Isomerases Human genes 0.000 description 23
- 108090000769 Isomerases Proteins 0.000 description 23
- FNZLKVNUWIIPSJ-UHFFFAOYSA-N Rbl5P Natural products OCC(=O)C(O)C(O)COP(O)(O)=O FNZLKVNUWIIPSJ-UHFFFAOYSA-N 0.000 description 22
- 108020004530 Transaldolase Proteins 0.000 description 22
- 102100028601 Transaldolase Human genes 0.000 description 22
- 101000689035 Mus musculus Ribulose-phosphate 3-epimerase Proteins 0.000 description 21
- 101000729343 Oryza sativa subsp. japonica Ribulose-phosphate 3-epimerase, cytoplasmic isoform Proteins 0.000 description 21
- IKHGUXGNUITLKF-UHFFFAOYSA-N Acetaldehyde Chemical compound CC=O IKHGUXGNUITLKF-UHFFFAOYSA-N 0.000 description 20
- 108020004705 Codon Proteins 0.000 description 20
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 20
- 241000282326 Felis catus Species 0.000 description 18
- 241000894007 species Species 0.000 description 18
- PYMYPHUHKUWMLA-UHFFFAOYSA-N arabinose Natural products OCC(O)C(O)C(O)C=O PYMYPHUHKUWMLA-UHFFFAOYSA-N 0.000 description 17
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 description 17
- 230000004108 pentose phosphate pathway Effects 0.000 description 17
- 108091033319 polynucleotide Proteins 0.000 description 17
- 102000040430 polynucleotide Human genes 0.000 description 17
- 239000002157 polynucleotide Substances 0.000 description 17
- 102100024341 10 kDa heat shock protein, mitochondrial Human genes 0.000 description 16
- 240000008042 Zea mays Species 0.000 description 16
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 16
- 102100038222 60 kDa heat shock protein, mitochondrial Human genes 0.000 description 15
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 15
- 235000014633 carbohydrates Nutrition 0.000 description 15
- 235000005822 corn Nutrition 0.000 description 15
- 229930027945 nicotinamide-adenine dinucleotide Natural products 0.000 description 15
- RXKJFZQQPQGTFL-UHFFFAOYSA-N Dihydroxyacetone Natural products OCC(=O)CO RXKJFZQQPQGTFL-UHFFFAOYSA-N 0.000 description 14
- -1 RKI Proteins 0.000 description 14
- 241000235070 Saccharomyces Species 0.000 description 14
- ZSLZBFCDCINBPY-ZSJPKINUSA-N acetyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 ZSLZBFCDCINBPY-ZSJPKINUSA-N 0.000 description 14
- 230000015572 biosynthetic process Effects 0.000 description 14
- 230000003197 catalytic effect Effects 0.000 description 14
- 230000037361 pathway Effects 0.000 description 14
- 230000008569 process Effects 0.000 description 14
- 235000000346 sugar Nutrition 0.000 description 14
- 238000013518 transcription Methods 0.000 description 14
- 108010059013 Chaperonin 10 Proteins 0.000 description 13
- 108010058432 Chaperonin 60 Proteins 0.000 description 13
- 108700040097 Glycerol dehydrogenases Proteins 0.000 description 13
- 108090000913 Nitrate Reductases Proteins 0.000 description 13
- 230000001588 bifunctional effect Effects 0.000 description 13
- 230000002018 overexpression Effects 0.000 description 13
- 239000000047 product Substances 0.000 description 13
- 230000035897 transcription Effects 0.000 description 13
- 230000004907 flux Effects 0.000 description 12
- 230000009467 reduction Effects 0.000 description 12
- 239000013598 vector Substances 0.000 description 12
- 102100026859 FAD-AMP lyase (cyclizing) Human genes 0.000 description 11
- 108010025915 Nitrite Reductases Proteins 0.000 description 11
- 229910019142 PO4 Inorganic materials 0.000 description 11
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 11
- 230000002068 genetic effect Effects 0.000 description 11
- 244000005700 microbiome Species 0.000 description 11
- 229910052760 oxygen Inorganic materials 0.000 description 11
- 239000001301 oxygen Substances 0.000 description 11
- 239000010452 phosphate Substances 0.000 description 11
- QGZKDVFQNNGYKY-UHFFFAOYSA-N Ammonia Chemical compound N QGZKDVFQNNGYKY-UHFFFAOYSA-N 0.000 description 10
- 101100356529 Candida albicans (strain SC5314 / ATCC MYA-2876) RFG1 gene Proteins 0.000 description 10
- 101100361174 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) ROX1 gene Proteins 0.000 description 10
- 241000235346 Schizosaccharomyces Species 0.000 description 10
- 150000001875 compounds Chemical class 0.000 description 10
- 239000000243 solution Substances 0.000 description 10
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 9
- 241000722885 Brettanomyces Species 0.000 description 9
- 101710088194 Dehydrogenase Proteins 0.000 description 9
- 241000589516 Pseudomonas Species 0.000 description 9
- 238000009396 hybridization Methods 0.000 description 9
- 230000001965 increasing effect Effects 0.000 description 9
- 229920001282 polysaccharide Polymers 0.000 description 9
- 239000005017 polysaccharide Substances 0.000 description 9
- 150000004804 polysaccharides Chemical class 0.000 description 9
- 230000002829 reductive effect Effects 0.000 description 9
- 238000003786 synthesis reaction Methods 0.000 description 9
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 8
- ZKHQWZAMYRWXGA-KQYNXXCUSA-J ATP(4-) Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP([O-])(=O)OP([O-])(=O)OP([O-])([O-])=O)[C@@H](O)[C@H]1O ZKHQWZAMYRWXGA-KQYNXXCUSA-J 0.000 description 8
- ZKHQWZAMYRWXGA-UHFFFAOYSA-N Adenosine triphosphate Natural products C1=NC=2C(N)=NC=NC=2N1C1OC(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)C(O)C1O ZKHQWZAMYRWXGA-UHFFFAOYSA-N 0.000 description 8
- 239000002028 Biomass Substances 0.000 description 8
- 241000195649 Chlorella <Chlorellales> Species 0.000 description 8
- 108700010070 Codon Usage Proteins 0.000 description 8
- FNZLKVNUWIIPSJ-UHNVWZDZSA-N D-ribulose 5-phosphate Chemical compound OCC(=O)[C@H](O)[C@H](O)COP(O)(O)=O FNZLKVNUWIIPSJ-UHNVWZDZSA-N 0.000 description 8
- 108020004414 DNA Proteins 0.000 description 8
- 102000004316 Oxidoreductases Human genes 0.000 description 8
- 108090000854 Oxidoreductases Proteins 0.000 description 8
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 8
- 101100298818 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) RPT3 gene Proteins 0.000 description 8
- 230000001105 regulatory effect Effects 0.000 description 8
- 239000010902 straw Substances 0.000 description 8
- 108700001448 Aldo-keto reductase family 1 member A1 Proteins 0.000 description 7
- 241000351920 Aspergillus nidulans Species 0.000 description 7
- CURLTUGMZLYLDI-UHFFFAOYSA-N Carbon dioxide Chemical compound O=C=O CURLTUGMZLYLDI-UHFFFAOYSA-N 0.000 description 7
- 108700035271 EC 1.1.1.2 Proteins 0.000 description 7
- 102100036669 Glycerol-3-phosphate dehydrogenase [NAD(+)], cytoplasmic Human genes 0.000 description 7
- 240000006024 Lactobacillus plantarum Species 0.000 description 7
- 241000320412 Ogataea angusta Species 0.000 description 7
- 108091000080 Phosphotransferase Proteins 0.000 description 7
- 102000001253 Protein Kinase Human genes 0.000 description 7
- 108010076504 Protein Sorting Signals Proteins 0.000 description 7
- 108020004511 Recombinant DNA Proteins 0.000 description 7
- 101100330447 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) DAN1 gene Proteins 0.000 description 7
- 239000006227 byproduct Substances 0.000 description 7
- 230000006652 catabolic pathway Effects 0.000 description 7
- 230000006870 function Effects 0.000 description 7
- 239000000413 hydrolysate Substances 0.000 description 7
- 230000007062 hydrolysis Effects 0.000 description 7
- 238000006460 hydrolysis reaction Methods 0.000 description 7
- 239000000463 material Substances 0.000 description 7
- 238000002703 mutagenesis Methods 0.000 description 7
- 231100000350 mutagenesis Toxicity 0.000 description 7
- 102000020233 phosphotransferase Human genes 0.000 description 7
- 230000035755 proliferation Effects 0.000 description 7
- 101100259716 Arabidopsis thaliana TAA1 gene Proteins 0.000 description 6
- 241000234653 Cyperus Species 0.000 description 6
- 241000196324 Embryophyta Species 0.000 description 6
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 6
- 235000013965 Lactobacillus plantarum Nutrition 0.000 description 6
- 240000007594 Oryza sativa Species 0.000 description 6
- 235000007164 Oryza sativa Nutrition 0.000 description 6
- 241000235648 Pichia Species 0.000 description 6
- 241000191023 Rhodobacter capsulatus Species 0.000 description 6
- 101100206899 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) TIR2 gene Proteins 0.000 description 6
- 241000219315 Spinacia Species 0.000 description 6
- 101150052008 TKL-1 gene Proteins 0.000 description 6
- 239000002253 acid Chemical class 0.000 description 6
- 230000002255 enzymatic effect Effects 0.000 description 6
- 239000000835 fiber Substances 0.000 description 6
- 150000003278 haem Chemical class 0.000 description 6
- 230000001976 improved effect Effects 0.000 description 6
- 239000003112 inhibitor Substances 0.000 description 6
- 229940072205 lactobacillus plantarum Drugs 0.000 description 6
- 108020004999 messenger RNA Proteins 0.000 description 6
- 230000004048 modification Effects 0.000 description 6
- 238000012986 modification Methods 0.000 description 6
- 101150044129 nirB gene Proteins 0.000 description 6
- 150000003839 salts Chemical class 0.000 description 6
- 239000000126 substance Substances 0.000 description 6
- 230000009466 transformation Effects 0.000 description 6
- 238000013519 translation Methods 0.000 description 6
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 5
- LXJXRIRHZLFYRP-VKHMYHEASA-L (R)-2-Hydroxy-3-(phosphonooxy)-propanal Natural products O=C[C@H](O)COP([O-])([O-])=O LXJXRIRHZLFYRP-VKHMYHEASA-L 0.000 description 5
- 241000894006 Bacteria Species 0.000 description 5
- 108050001186 Chaperonin Cpn60 Proteins 0.000 description 5
- 102000052603 Chaperonins Human genes 0.000 description 5
- LXJXRIRHZLFYRP-VKHMYHEASA-N D-glyceraldehyde 3-phosphate Chemical compound O=C[C@H](O)COP(O)(O)=O LXJXRIRHZLFYRP-VKHMYHEASA-N 0.000 description 5
- 101150090270 DAK1 gene Proteins 0.000 description 5
- 102100030395 Glycerol-3-phosphate dehydrogenase, mitochondrial Human genes 0.000 description 5
- 229920002488 Hemicellulose Polymers 0.000 description 5
- 101001072574 Homo sapiens Glycerol-3-phosphate dehydrogenase [NAD(+)], cytoplasmic Proteins 0.000 description 5
- 241000588748 Klebsiella Species 0.000 description 5
- 241000588747 Klebsiella pneumoniae Species 0.000 description 5
- 241000235649 Kluyveromyces Species 0.000 description 5
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 5
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 5
- 244000061176 Nicotiana tabacum Species 0.000 description 5
- 101100115804 Schizosaccharomyces pombe (strain 972 / ATCC 24843) dak2 gene Proteins 0.000 description 5
- 101150014383 adhE gene Proteins 0.000 description 5
- 229910021529 ammonia Inorganic materials 0.000 description 5
- 230000009604 anaerobic growth Effects 0.000 description 5
- 238000004422 calculation algorithm Methods 0.000 description 5
- 101150068366 cbbM gene Proteins 0.000 description 5
- 229940120503 dihydroxyacetone Drugs 0.000 description 5
- 230000007613 environmental effect Effects 0.000 description 5
- 101150033931 gldA gene Proteins 0.000 description 5
- 239000012978 lignocellulosic material Substances 0.000 description 5
- 239000002609 medium Substances 0.000 description 5
- 101150045642 nirD gene Proteins 0.000 description 5
- 229920001277 pectin Polymers 0.000 description 5
- 229920000642 polymer Polymers 0.000 description 5
- 235000009566 rice Nutrition 0.000 description 5
- 210000000582 semen Anatomy 0.000 description 5
- 239000000758 substrate Substances 0.000 description 5
- 150000008163 sugars Chemical class 0.000 description 5
- 239000002699 waste material Substances 0.000 description 5
- 101710122378 10 kDa heat shock protein, mitochondrial Proteins 0.000 description 4
- YCOXTKKNXUZSKD-UHFFFAOYSA-N 3,4-xylenol Chemical compound CC1=CC=C(O)C=C1C YCOXTKKNXUZSKD-UHFFFAOYSA-N 0.000 description 4
- 101710154868 60 kDa heat shock protein, mitochondrial Proteins 0.000 description 4
- 108020002663 Aldehyde Dehydrogenase Proteins 0.000 description 4
- 102000005369 Aldehyde Dehydrogenase Human genes 0.000 description 4
- 235000015911 Amorpha canescens Nutrition 0.000 description 4
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 4
- 101100404144 Bacillus subtilis (strain 168) nasD gene Proteins 0.000 description 4
- 101100404147 Bacillus subtilis (strain 168) nasE gene Proteins 0.000 description 4
- 241001474374 Blennius Species 0.000 description 4
- 235000011332 Brassica juncea Nutrition 0.000 description 4
- 244000178993 Brassica juncea Species 0.000 description 4
- 235000014700 Brassica juncea var napiformis Nutrition 0.000 description 4
- 241001453380 Burkholderia Species 0.000 description 4
- 241000222120 Candida <Saccharomycetales> Species 0.000 description 4
- 240000008574 Capsicum frutescens Species 0.000 description 4
- 241000193454 Clostridium beijerinckii Species 0.000 description 4
- 241000186570 Clostridium kluyveri Species 0.000 description 4
- 241000235646 Cyberlindnera jadinii Species 0.000 description 4
- KTVPXOYAKDPRHY-MBMOQRBOSA-N D-Ribose 5-phosphate Natural products O[C@H]1O[C@H](COP(O)(O)=O)[C@@H](O)[C@@H]1O KTVPXOYAKDPRHY-MBMOQRBOSA-N 0.000 description 4
- FNZLKVNUWIIPSJ-RFZPGFLSSA-N D-xylulose 5-phosphate Chemical compound OCC(=O)[C@@H](O)[C@H](O)COP(O)(O)=O FNZLKVNUWIIPSJ-RFZPGFLSSA-N 0.000 description 4
- 241000223221 Fusarium oxysporum Species 0.000 description 4
- 101150106451 HEM13 gene Proteins 0.000 description 4
- 101001009678 Homo sapiens Glycerol-3-phosphate dehydrogenase, mitochondrial Proteins 0.000 description 4
- 241000235058 Komagataella pastoris Species 0.000 description 4
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 4
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 4
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 4
- 235000013629 Lippia javanica Nutrition 0.000 description 4
- HDAJUGGARUFROU-JSUDGWJLSA-L MoO2-molybdopterin cofactor Chemical compound O([C@H]1NC=2N=C(NC(=O)C=2N[C@H]11)N)[C@H](COP(O)(O)=O)C2=C1S[Mo](=O)(=O)S2 HDAJUGGARUFROU-JSUDGWJLSA-L 0.000 description 4
- 241000187479 Mycobacterium tuberculosis Species 0.000 description 4
- LCTONWCANYUPML-UHFFFAOYSA-N Pyruvic acid Chemical compound CC(=O)C(O)=O LCTONWCANYUPML-UHFFFAOYSA-N 0.000 description 4
- 108090001066 Racemases and epimerases Proteins 0.000 description 4
- 102000004879 Racemases and epimerases Human genes 0.000 description 4
- 108060007030 Ribulose-phosphate 3-epimerase Proteins 0.000 description 4
- 101100099697 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) TKL2 gene Proteins 0.000 description 4
- 241000235347 Schizosaccharomyces pombe Species 0.000 description 4
- 244000039265 Symplocos fasciculata Species 0.000 description 4
- 101710094544 Transketolase 1 Proteins 0.000 description 4
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 4
- 241000235015 Yarrowia lipolytica Species 0.000 description 4
- 241000222124 [Candida] boidinii Species 0.000 description 4
- PYMYPHUHKUWMLA-WDCZJNDASA-N arabinose Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)C=O PYMYPHUHKUWMLA-WDCZJNDASA-N 0.000 description 4
- 230000001580 bacterial effect Effects 0.000 description 4
- 230000015556 catabolic process Effects 0.000 description 4
- 238000012512 characterization method Methods 0.000 description 4
- GNGACRATGGDKBX-UHFFFAOYSA-N dihydroxyacetone phosphate Chemical compound OCC(=O)COP(O)(O)=O GNGACRATGGDKBX-UHFFFAOYSA-N 0.000 description 4
- 239000013604 expression vector Substances 0.000 description 4
- VWWQXMAJTJZDQX-UYBVJOGSSA-N flavin adenine dinucleotide Chemical compound C1=NC2=C(N)N=CN=C2N1[C@@H]([C@H](O)[C@@H]1O)O[C@@H]1CO[P@](O)(=O)O[P@@](O)(=O)OC[C@@H](O)[C@@H](O)[C@@H](O)CN1C2=NC(=O)NC(=O)C2=NC2=C1C=C(C)C(C)=C2 VWWQXMAJTJZDQX-UYBVJOGSSA-N 0.000 description 4
- 235000019162 flavin adenine dinucleotide Nutrition 0.000 description 4
- 239000011714 flavin adenine dinucleotide Substances 0.000 description 4
- 229940093632 flavin-adenine dinucleotide Drugs 0.000 description 4
- 101150087371 gpd1 gene Proteins 0.000 description 4
- 230000012010 growth Effects 0.000 description 4
- 230000010354 integration Effects 0.000 description 4
- 229920002521 macromolecule Polymers 0.000 description 4
- 230000007246 mechanism Effects 0.000 description 4
- 239000000203 mixture Substances 0.000 description 4
- 108010046778 molybdenum cofactor Proteins 0.000 description 4
- 239000001814 pectin Substances 0.000 description 4
- 235000010987 pectin Nutrition 0.000 description 4
- 239000013612 plasmid Substances 0.000 description 4
- 238000011160 research Methods 0.000 description 4
- 239000007787 solid Substances 0.000 description 4
- 239000010907 stover Substances 0.000 description 4
- 238000005406 washing Methods 0.000 description 4
- GHOKWGTUZJEAQD-ZETCQYMHSA-N (D)-(+)-Pantothenic acid Chemical compound OCC(C)(C)[C@@H](O)C(=O)NCCC(O)=O GHOKWGTUZJEAQD-ZETCQYMHSA-N 0.000 description 3
- 101150039109 AAC3 gene Proteins 0.000 description 3
- 102100026397 ADP/ATP translocase 3 Human genes 0.000 description 3
- 235000014748 Amaranthus tricolor Nutrition 0.000 description 3
- 244000024893 Amaranthus tricolor Species 0.000 description 3
- 101100004408 Arabidopsis thaliana BIG gene Proteins 0.000 description 3
- 241000228245 Aspergillus niger Species 0.000 description 3
- 241000589151 Azotobacter Species 0.000 description 3
- 244000063299 Bacillus subtilis Species 0.000 description 3
- 235000014469 Bacillus subtilis Nutrition 0.000 description 3
- 241000186018 Bifidobacterium adolescentis Species 0.000 description 3
- 241000186016 Bifidobacterium bifidum Species 0.000 description 3
- 241000123650 Botrytis cinerea Species 0.000 description 3
- 235000011303 Brassica alboglabra Nutrition 0.000 description 3
- 240000007124 Brassica oleracea Species 0.000 description 3
- 235000011302 Brassica oleracea Nutrition 0.000 description 3
- 235000002566 Capsicum Nutrition 0.000 description 3
- 108010059892 Cellulase Proteins 0.000 description 3
- 241000193403 Clostridium Species 0.000 description 3
- 108091026890 Coding region Proteins 0.000 description 3
- 241000195493 Cryptophyta Species 0.000 description 3
- WQZGKKKJIJFFOK-QTVWNMPRSA-N D-mannopyranose Chemical compound OC[C@H]1OC(O)[C@@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-QTVWNMPRSA-N 0.000 description 3
- 101150035424 DAK2 gene Proteins 0.000 description 3
- 101100019554 Drosophila melanogaster Adk2 gene Proteins 0.000 description 3
- 108700035019 EC 1.1.1.72 Proteins 0.000 description 3
- 241000588722 Escherichia Species 0.000 description 3
- 101150015217 FET4 gene Proteins 0.000 description 3
- 229930091371 Fructose Natural products 0.000 description 3
- 239000005715 Fructose Substances 0.000 description 3
- RFSUNEUAIZKAJO-ARQDHWQXSA-N Fructose Chemical compound OC[C@H]1O[C@](O)(CO)[C@@H](O)[C@@H]1O RFSUNEUAIZKAJO-ARQDHWQXSA-N 0.000 description 3
- 241000233866 Fungi Species 0.000 description 3
- 101150004714 GPP1 gene Proteins 0.000 description 3
- 101150059691 GPP2 gene Proteins 0.000 description 3
- 108050008938 Glucoamylases Proteins 0.000 description 3
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 3
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 3
- 101000780205 Homo sapiens Long-chain-fatty-acid-CoA ligase 5 Proteins 0.000 description 3
- 101000780202 Homo sapiens Long-chain-fatty-acid-CoA ligase 6 Proteins 0.000 description 3
- 206010021143 Hypoxia Diseases 0.000 description 3
- 108010044467 Isoenzymes Proteins 0.000 description 3
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 3
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 3
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 3
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 3
- 244000165082 Lavanda vera Species 0.000 description 3
- 235000002997 Lavandula Nutrition 0.000 description 3
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 3
- 102100024295 Maltase-glucoamylase Human genes 0.000 description 3
- 101100492388 Mus musculus Nat3 gene Proteins 0.000 description 3
- 108091034117 Oligonucleotide Proteins 0.000 description 3
- 235000014676 Phragmites communis Nutrition 0.000 description 3
- 241000195887 Physcomitrella patens Species 0.000 description 3
- 241000209504 Poaceae Species 0.000 description 3
- 241000206613 Pyropia yezoensis Species 0.000 description 3
- 241000232299 Ralstonia Species 0.000 description 3
- 102000007382 Ribose-5-phosphate isomerase Human genes 0.000 description 3
- 101150102498 SLC25A6 gene Proteins 0.000 description 3
- 101100387347 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) DIP5 gene Proteins 0.000 description 3
- 101100296458 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) PAU1 gene Proteins 0.000 description 3
- 101100242851 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) PAU10 gene Proteins 0.000 description 3
- 101100242852 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) PAU11 gene Proteins 0.000 description 3
- 101100296452 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) PAU14 gene Proteins 0.000 description 3
- 101100296453 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) PAU15 gene Proteins 0.000 description 3
- 101100296459 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) PAU20 gene Proteins 0.000 description 3
- 101100296462 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) PAU23 gene Proteins 0.000 description 3
- 101100296463 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) PAU24 gene Proteins 0.000 description 3
- 101100296465 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) PAU3 gene Proteins 0.000 description 3
- 101100296467 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) PAU5 gene Proteins 0.000 description 3
- 101100296468 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) PAU6 gene Proteins 0.000 description 3
- 101100296469 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) PAU7 gene Proteins 0.000 description 3
- 101100206901 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) TIR3 gene Proteins 0.000 description 3
- 101100206902 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) TIR4 gene Proteins 0.000 description 3
- 101100213465 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) YHK8 gene Proteins 0.000 description 3
- 235000009337 Spinacia oleracea Nutrition 0.000 description 3
- 241001509286 Thiobacillus denitrificans Species 0.000 description 3
- 102000004357 Transferases Human genes 0.000 description 3
- 108090000992 Transferases Proteins 0.000 description 3
- 239000000370 acceptor Substances 0.000 description 3
- 238000009825 accumulation Methods 0.000 description 3
- 235000011054 acetic acid Nutrition 0.000 description 3
- LIPOUNRJVLNBCD-UHFFFAOYSA-N acetyl dihydrogen phosphate Chemical compound CC(=O)OP(O)(O)=O LIPOUNRJVLNBCD-UHFFFAOYSA-N 0.000 description 3
- 230000006978 adaptation Effects 0.000 description 3
- 235000004279 alanine Nutrition 0.000 description 3
- 230000001476 alcoholic effect Effects 0.000 description 3
- PPQRONHOSHZGFQ-LMVFSUKVSA-N aldehydo-D-ribose 5-phosphate Chemical compound OP(=O)(O)OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PPQRONHOSHZGFQ-LMVFSUKVSA-N 0.000 description 3
- WQZGKKKJIJFFOK-PHYPRBDBSA-N alpha-D-galactose Chemical compound OC[C@H]1O[C@H](O)[C@H](O)[C@@H](O)[C@H]1O WQZGKKKJIJFFOK-PHYPRBDBSA-N 0.000 description 3
- 108010028144 alpha-Glucosidases Proteins 0.000 description 3
- 239000001390 capsicum minimum Substances 0.000 description 3
- 239000001569 carbon dioxide Substances 0.000 description 3
- 229910002092 carbon dioxide Inorganic materials 0.000 description 3
- 150000001735 carboxylic acids Chemical class 0.000 description 3
- 239000001913 cellulose Substances 0.000 description 3
- 229920002678 cellulose Polymers 0.000 description 3
- 235000013339 cereals Nutrition 0.000 description 3
- 238000006047 enzymatic hydrolysis reaction Methods 0.000 description 3
- 239000000284 extract Substances 0.000 description 3
- 230000002538 fungal effect Effects 0.000 description 3
- 229930182830 galactose Natural products 0.000 description 3
- 230000013595 glycosylation Effects 0.000 description 3
- 238000006206 glycosylation reaction Methods 0.000 description 3
- 239000010903 husk Substances 0.000 description 3
- 239000003550 marker Substances 0.000 description 3
- 230000004060 metabolic process Effects 0.000 description 3
- 150000002772 monosaccharides Chemical class 0.000 description 3
- 235000015097 nutrients Nutrition 0.000 description 3
- 230000036284 oxygen consumption Effects 0.000 description 3
- 150000002972 pentoses Chemical class 0.000 description 3
- 150000002989 phenols Chemical class 0.000 description 3
- 108060006633 protein kinase Proteins 0.000 description 3
- 239000002994 raw material Substances 0.000 description 3
- 230000010076 replication Effects 0.000 description 3
- 230000002441 reversible effect Effects 0.000 description 3
- 108020005610 ribose 5-phosphate isomerase Proteins 0.000 description 3
- 102000004688 ribulosephosphate 3-epimerase Human genes 0.000 description 3
- JDTUMPKOJBQPKX-GBNDHIKLSA-N sedoheptulose 7-phosphate Chemical compound OCC(=O)[C@@H](O)[C@H](O)[C@H](O)[C@H](O)COP(O)(O)=O JDTUMPKOJBQPKX-GBNDHIKLSA-N 0.000 description 3
- 230000002459 sustained effect Effects 0.000 description 3
- 230000009261 transgenic effect Effects 0.000 description 3
- QVWAEZJXDYOKEH-UHFFFAOYSA-N 3-(3-hydroxyphenyl)propanoic acid Chemical compound OC(=O)CCC1=CC=CC(O)=C1 QVWAEZJXDYOKEH-UHFFFAOYSA-N 0.000 description 2
- PKQIDSVLSKFZQC-UHFFFAOYSA-N 3-oxobutanal Chemical compound CC(=O)CC=O PKQIDSVLSKFZQC-UHFFFAOYSA-N 0.000 description 2
- HFKQINMYQUXOCH-UHFFFAOYSA-N 4-hydroxy-2-oxopentanoic acid Chemical compound CC(O)CC(=O)C(O)=O HFKQINMYQUXOCH-UHFFFAOYSA-N 0.000 description 2
- HBAQYPYDRFILMT-UHFFFAOYSA-N 8-[3-(1-cyclopropylpyrazol-4-yl)-1H-pyrazolo[4,3-d]pyrimidin-5-yl]-3-methyl-3,8-diazabicyclo[3.2.1]octan-2-one Chemical class C1(CC1)N1N=CC(=C1)C1=NNC2=C1N=C(N=C2)N1C2C(N(CC1CC2)C)=O HBAQYPYDRFILMT-UHFFFAOYSA-N 0.000 description 2
- 102100024642 ATP-binding cassette sub-family C member 9 Human genes 0.000 description 2
- QTBSBXVTEAMEQO-UHFFFAOYSA-M Acetate Chemical compound CC([O-])=O QTBSBXVTEAMEQO-UHFFFAOYSA-M 0.000 description 2
- 102000008146 Acetate-CoA ligase Human genes 0.000 description 2
- 241000219317 Amaranthaceae Species 0.000 description 2
- 235000004135 Amaranthus viridis Nutrition 0.000 description 2
- 244000055702 Amaranthus viridis Species 0.000 description 2
- VHUUQVKOLVNVRT-UHFFFAOYSA-N Ammonium hydroxide Chemical compound [NH4+].[OH-] VHUUQVKOLVNVRT-UHFFFAOYSA-N 0.000 description 2
- 241000219195 Arabidopsis thaliana Species 0.000 description 2
- 239000004475 Arginine Substances 0.000 description 2
- 241000235349 Ascomycota Species 0.000 description 2
- 241000228193 Aspergillus clavatus Species 0.000 description 2
- 241000223651 Aureobasidium Species 0.000 description 2
- 241000223678 Aureobasidium pullulans Species 0.000 description 2
- 241000195652 Auxenochlorella pyrenoidosa Species 0.000 description 2
- 241000726110 Azoarcus Species 0.000 description 2
- 101100092567 Bacillus subtilis (strain 168) rpsL gene Proteins 0.000 description 2
- 241000221198 Basidiomycota Species 0.000 description 2
- 241001134770 Bifidobacterium animalis Species 0.000 description 2
- 241000901050 Bifidobacterium animalis subsp. lactis Species 0.000 description 2
- 241000589173 Bradyrhizobium Species 0.000 description 2
- 241001522017 Brettanomyces anomalus Species 0.000 description 2
- 235000000287 Brettanomyces bruxellensis Nutrition 0.000 description 2
- 244000027711 Brettanomyces bruxellensis Species 0.000 description 2
- 102100027194 CDP-diacylglycerol-inositol 3-phosphatidyltransferase Human genes 0.000 description 2
- 101100120909 Caenorhabditis briggsae gpd-3.2 gene Proteins 0.000 description 2
- 101100120910 Caenorhabditis elegans gpd-2 gene Proteins 0.000 description 2
- 101100327692 Caenorhabditis elegans hsp-60 gene Proteins 0.000 description 2
- 235000002568 Capsicum frutescens Nutrition 0.000 description 2
- 241000219504 Caryophyllales Species 0.000 description 2
- 241000219312 Chenopodium Species 0.000 description 2
- 240000006122 Chenopodium album Species 0.000 description 2
- 235000009344 Chenopodium album Nutrition 0.000 description 2
- 241000195597 Chlamydomonas reinhardtii Species 0.000 description 2
- 240000009108 Chlorella vulgaris Species 0.000 description 2
- 235000007089 Chlorella vulgaris Nutrition 0.000 description 2
- 108020004638 Circular DNA Proteins 0.000 description 2
- 241000193401 Clostridium acetobutylicum Species 0.000 description 2
- RGJOEKWQDUBAIZ-IBOSZNHHSA-N CoASH Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCS)O[C@H]1N1C2=NC=NC(N)=C2N=C1 RGJOEKWQDUBAIZ-IBOSZNHHSA-N 0.000 description 2
- RYGMFSIKBFXOCR-UHFFFAOYSA-N Copper Chemical compound [Cu] RYGMFSIKBFXOCR-UHFFFAOYSA-N 0.000 description 2
- 241000219112 Cucumis Species 0.000 description 2
- 240000008067 Cucumis sativus Species 0.000 description 2
- 241000366859 Cupriavidus taiwanensis Species 0.000 description 2
- 102100024638 Cytochrome c oxidase subunit 5B, mitochondrial Human genes 0.000 description 2
- SHZGCJCMOBCMKK-UHFFFAOYSA-N D-mannomethylose Natural products CC1OC(O)C(O)C(O)C1O SHZGCJCMOBCMKK-UHFFFAOYSA-N 0.000 description 2
- 102000053602 DNA Human genes 0.000 description 2
- 102100037373 DNA-(apurinic or apyrimidinic site) endonuclease Human genes 0.000 description 2
- 108700035486 EC 1.7.1.15 Proteins 0.000 description 2
- 101150100477 ERG26 gene Proteins 0.000 description 2
- 101100404840 Emericella nidulans (strain FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139) niiA gene Proteins 0.000 description 2
- 241000224432 Entamoeba histolytica Species 0.000 description 2
- 241000194033 Enterococcus Species 0.000 description 2
- 241001646716 Escherichia coli K-12 Species 0.000 description 2
- 241000206602 Eukaryota Species 0.000 description 2
- 101710088570 Flagellar hook-associated protein 1 Proteins 0.000 description 2
- 108010067193 Formaldehyde transketolase Proteins 0.000 description 2
- YLQBMQCUIZJEEH-UHFFFAOYSA-N Furan Chemical compound C=1C=COC=1 YLQBMQCUIZJEEH-UHFFFAOYSA-N 0.000 description 2
- IAJILQKETJEXLJ-UHFFFAOYSA-N Galacturonsaeure Natural products O=CC(O)C(O)C(O)C(O)C(O)=O IAJILQKETJEXLJ-UHFFFAOYSA-N 0.000 description 2
- 101000946191 Galerina sp Laccase-1 Proteins 0.000 description 2
- 108700007698 Genetic Terminator Regions Proteins 0.000 description 2
- 101710086812 Glycerol-3-phosphate dehydrogenase 1 Proteins 0.000 description 2
- 101710086809 Glycerol-3-phosphate dehydrogenase 2 Proteins 0.000 description 2
- 235000010469 Glycine max Nutrition 0.000 description 2
- 244000068988 Glycine max Species 0.000 description 2
- 244000299507 Gossypium hirsutum Species 0.000 description 2
- 241000206581 Gracilaria Species 0.000 description 2
- 101150009243 HAP1 gene Proteins 0.000 description 2
- 101150045879 HEM14 gene Proteins 0.000 description 2
- 101000760581 Homo sapiens ATP-binding cassette sub-family C member 9 Proteins 0.000 description 2
- 101000914522 Homo sapiens CDP-diacylglycerol-inositol 3-phosphatidyltransferase Proteins 0.000 description 2
- 101000908835 Homo sapiens Cytochrome c oxidase subunit 5B, mitochondrial Proteins 0.000 description 2
- 101001032502 Homo sapiens Iron-sulfur cluster assembly enzyme ISCU, mitochondrial Proteins 0.000 description 2
- 101001019117 Homo sapiens Mediator of RNA polymerase II transcription subunit 23 Proteins 0.000 description 2
- 101000889450 Homo sapiens Trefoil factor 2 Proteins 0.000 description 2
- 240000005979 Hordeum vulgare Species 0.000 description 2
- 235000007340 Hordeum vulgare Nutrition 0.000 description 2
- 102100038096 Iron-sulfur cluster assembly enzyme ISCU, mitochondrial Human genes 0.000 description 2
- 244000285963 Kluyveromyces fragilis Species 0.000 description 2
- 235000014663 Kluyveromyces fragilis Nutrition 0.000 description 2
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 2
- SHZGCJCMOBCMKK-JFNONXLTSA-N L-rhamnopyranose Chemical compound C[C@@H]1OC(O)[C@H](O)[C@H](O)[C@H]1O SHZGCJCMOBCMKK-JFNONXLTSA-N 0.000 description 2
- PNNNRSAQSRJVSB-UHFFFAOYSA-N L-rhamnose Natural products CC(O)C(O)C(O)C(O)C=O PNNNRSAQSRJVSB-UHFFFAOYSA-N 0.000 description 2
- 240000001046 Lactobacillus acidophilus Species 0.000 description 2
- 244000199866 Lactobacillus casei Species 0.000 description 2
- 241000192130 Leuconostoc mesenteroides Species 0.000 description 2
- 241000186781 Listeria Species 0.000 description 2
- 102100034337 Long-chain-fatty-acid-CoA ligase 6 Human genes 0.000 description 2
- 241000186359 Mycobacterium Species 0.000 description 2
- 241000187492 Mycobacterium marinum Species 0.000 description 2
- 241000187917 Mycobacterium ulcerans Species 0.000 description 2
- UFWIBTONFRDIAS-UHFFFAOYSA-N Naphthalene Chemical compound C1=CC=CC2=CC=CC=C21 UFWIBTONFRDIAS-UHFFFAOYSA-N 0.000 description 2
- MWUXSHHQAYIFBG-UHFFFAOYSA-N Nitric oxide Chemical compound O=[N] MWUXSHHQAYIFBG-UHFFFAOYSA-N 0.000 description 2
- CBENFWSGALASAD-UHFFFAOYSA-N Ozone Chemical compound [O-][O+]=O CBENFWSGALASAD-UHFFFAOYSA-N 0.000 description 2
- 241001520808 Panicum virgatum Species 0.000 description 2
- 241000228150 Penicillium chrysogenum Species 0.000 description 2
- 244000273256 Phragmites communis Species 0.000 description 2
- 241000589776 Pseudomonas putida Species 0.000 description 2
- 241000228453 Pyrenophora Species 0.000 description 2
- 241000589771 Ralstonia solanacearum Species 0.000 description 2
- 241000589180 Rhizobium Species 0.000 description 2
- 235000003846 Ricinus Nutrition 0.000 description 2
- 241000322381 Ricinus <louse> Species 0.000 description 2
- 241000235072 Saccharomyces bayanus Species 0.000 description 2
- 101100280133 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) EUG1 gene Proteins 0.000 description 2
- 101100231696 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) FRT2 gene Proteins 0.000 description 2
- 101000861374 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) Fumarate reductase 1 Proteins 0.000 description 2
- 101100028327 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OYE2 gene Proteins 0.000 description 2
- 101100296450 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) PAU12 gene Proteins 0.000 description 2
- 101100296451 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) PAU13 gene Proteins 0.000 description 2
- 101100296454 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) PAU16 gene Proteins 0.000 description 2
- 101100296457 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) PAU19 gene Proteins 0.000 description 2
- 101100296460 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) PAU21 gene Proteins 0.000 description 2
- 101100296461 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) PAU22 gene Proteins 0.000 description 2
- 101100296466 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) PAU4 gene Proteins 0.000 description 2
- 101100518980 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) PAU8 gene Proteins 0.000 description 2
- 101100375638 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) YAR028W gene Proteins 0.000 description 2
- 101100376208 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) YGR035C gene Proteins 0.000 description 2
- 101100320840 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) YMR252C gene Proteins 0.000 description 2
- 101100376711 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) YNR014W gene Proteins 0.000 description 2
- 235000018368 Saccharomyces fragilis Nutrition 0.000 description 2
- 241001123227 Saccharomyces pastorianus Species 0.000 description 2
- 241000235343 Saccharomycetales Species 0.000 description 2
- 240000000111 Saccharum officinarum Species 0.000 description 2
- 235000007201 Saccharum officinarum Nutrition 0.000 description 2
- 241000235060 Scheffersomyces stipitis Species 0.000 description 2
- 238000012300 Sequence Analysis Methods 0.000 description 2
- 102100026209 Serine/threonine-protein kinase PLK3 Human genes 0.000 description 2
- 241000607768 Shigella Species 0.000 description 2
- 241000206732 Skeletonema costatum Species 0.000 description 2
- 240000003768 Solanum lycopersicum Species 0.000 description 2
- 241000191967 Staphylococcus aureus Species 0.000 description 2
- 229920002472 Starch Polymers 0.000 description 2
- 241001313536 Thermothelomyces thermophila Species 0.000 description 2
- 241000605118 Thiobacillus Species 0.000 description 2
- 244000288561 Torulaspora delbrueckii Species 0.000 description 2
- 235000014681 Torulaspora delbrueckii Nutrition 0.000 description 2
- 108700009124 Transcription Initiation Site Proteins 0.000 description 2
- 102100039172 Trefoil factor 2 Human genes 0.000 description 2
- 235000021307 Triticum Nutrition 0.000 description 2
- 244000098338 Triticum aestivum Species 0.000 description 2
- TVXBFESIOXBWNM-UHFFFAOYSA-N Xylitol Natural products OCCC(O)C(O)C(O)CCO TVXBFESIOXBWNM-UHFFFAOYSA-N 0.000 description 2
- 241000607734 Yersinia <bacteria> Species 0.000 description 2
- 230000009471 action Effects 0.000 description 2
- 238000007792 addition Methods 0.000 description 2
- 101150049512 ald gene Proteins 0.000 description 2
- 125000000539 amino acid group Chemical group 0.000 description 2
- 239000000908 ammonium hydroxide Substances 0.000 description 2
- 230000001651 autotrophic effect Effects 0.000 description 2
- 239000002585 base Substances 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 229940118852 bifidobacterium animalis Drugs 0.000 description 2
- 229940002008 bifidobacterium bifidum Drugs 0.000 description 2
- 230000027455 binding Effects 0.000 description 2
- YCIMNLLNPGFGHC-UHFFFAOYSA-N catechol Chemical compound OC1=CC=CC=C1O YCIMNLLNPGFGHC-UHFFFAOYSA-N 0.000 description 2
- 230000001413 cellular effect Effects 0.000 description 2
- RGJOEKWQDUBAIZ-UHFFFAOYSA-N coenzime A Natural products OC1C(OP(O)(O)=O)C(COP(O)(=O)OP(O)(=O)OCC(C)(C)C(O)C(=O)NCCC(=O)NCCS)OC1N1C2=NC=NC(N)=C2N=C1 RGJOEKWQDUBAIZ-UHFFFAOYSA-N 0.000 description 2
- 239000005515 coenzyme Substances 0.000 description 2
- 239000005516 coenzyme A Substances 0.000 description 2
- 229940093530 coenzyme a Drugs 0.000 description 2
- 230000000295 complement effect Effects 0.000 description 2
- 238000004590 computer program Methods 0.000 description 2
- 229910052802 copper Inorganic materials 0.000 description 2
- 239000010949 copper Substances 0.000 description 2
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 2
- 230000007423 decrease Effects 0.000 description 2
- 230000003247 decreasing effect Effects 0.000 description 2
- 238000006731 degradation reaction Methods 0.000 description 2
- KDTSHFARGAKYJN-UHFFFAOYSA-N dephosphocoenzyme A Natural products OC1C(O)C(COP(O)(=O)OP(O)(=O)OCC(C)(C)C(O)C(=O)NCCC(=O)NCCS)OC1N1C2=NC=NC(N)=C2N=C1 KDTSHFARGAKYJN-UHFFFAOYSA-N 0.000 description 2
- 238000001784 detoxification Methods 0.000 description 2
- ZUOUZKKEUPVFJK-UHFFFAOYSA-N diphenyl Chemical compound C1=CC=CC=C1C1=CC=CC=C1 ZUOUZKKEUPVFJK-UHFFFAOYSA-N 0.000 description 2
- 150000002016 disaccharides Chemical class 0.000 description 2
- 101150032129 egsA gene Proteins 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 229940007078 entamoeba histolytica Drugs 0.000 description 2
- 230000009088 enzymatic function Effects 0.000 description 2
- 230000007071 enzymatic hydrolysis Effects 0.000 description 2
- 210000003527 eukaryotic cell Anatomy 0.000 description 2
- 239000012634 fragment Substances 0.000 description 2
- 238000010230 functional analysis Methods 0.000 description 2
- 150000002240 furans Chemical class 0.000 description 2
- HYBBIBNJHNGZAN-UHFFFAOYSA-N furfural Chemical compound O=CC1=CC=CO1 HYBBIBNJHNGZAN-UHFFFAOYSA-N 0.000 description 2
- 238000007429 general method Methods 0.000 description 2
- 238000012252 genetic analysis Methods 0.000 description 2
- 238000011331 genomic analysis Methods 0.000 description 2
- 238000000227 grinding Methods 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- 239000011121 hardwood Substances 0.000 description 2
- 230000007954 hypoxia Effects 0.000 description 2
- 230000002779 inactivation Effects 0.000 description 2
- 238000010348 incorporation Methods 0.000 description 2
- 230000006698 induction Effects 0.000 description 2
- 238000011090 industrial biotechnology method and process Methods 0.000 description 2
- 238000009776 industrial production Methods 0.000 description 2
- 230000002401 inhibitory effect Effects 0.000 description 2
- 238000002955 isolation Methods 0.000 description 2
- 229960000310 isoleucine Drugs 0.000 description 2
- 229940031154 kluyveromyces marxianus Drugs 0.000 description 2
- JVTAAEKCZFNVCJ-UHFFFAOYSA-N lactic acid Chemical compound CC(O)C(O)=O JVTAAEKCZFNVCJ-UHFFFAOYSA-N 0.000 description 2
- HEBKCHPVOIAQTA-UHFFFAOYSA-N meso ribitol Natural products OCC(O)C(O)C(O)CO HEBKCHPVOIAQTA-UHFFFAOYSA-N 0.000 description 2
- 238000012269 metabolic engineering Methods 0.000 description 2
- BDAGIHXWWSANSR-UHFFFAOYSA-N methanoic acid Natural products OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 description 2
- 101150024975 mhpF gene Proteins 0.000 description 2
- 238000002493 microarray Methods 0.000 description 2
- 238000003801 milling Methods 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 239000010813 municipal solid waste Substances 0.000 description 2
- 229910052757 nitrogen Inorganic materials 0.000 description 2
- 238000002515 oligonucleotide synthesis Methods 0.000 description 2
- 229920001542 oligosaccharide Polymers 0.000 description 2
- 150000002482 oligosaccharides Chemical class 0.000 description 2
- 230000036542 oxidative stress Effects 0.000 description 2
- 239000010893 paper waste Substances 0.000 description 2
- 230000036961 partial effect Effects 0.000 description 2
- 150000002978 peroxides Chemical class 0.000 description 2
- 230000035479 physiological effects, processes and functions Effects 0.000 description 2
- 229930001119 polyketide Natural products 0.000 description 2
- 150000003881 polyketide derivatives Chemical class 0.000 description 2
- 239000002243 precursor Substances 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- 229940107700 pyruvic acid Drugs 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 238000002741 site-directed mutagenesis Methods 0.000 description 2
- 239000011122 softwood Substances 0.000 description 2
- 239000002904 solvent Substances 0.000 description 2
- 239000008107 starch Substances 0.000 description 2
- 235000019698 starch Nutrition 0.000 description 2
- 230000002194 synthesizing effect Effects 0.000 description 2
- JMSVCTWVEWCHDZ-UHFFFAOYSA-N syringic acid Chemical compound COC1=CC(C(O)=O)=CC(OC)=C1O JMSVCTWVEWCHDZ-UHFFFAOYSA-N 0.000 description 2
- 230000009897 systematic effect Effects 0.000 description 2
- 238000009997 thermal pre-treatment Methods 0.000 description 2
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 2
- 210000001519 tissue Anatomy 0.000 description 2
- 238000010361 transduction Methods 0.000 description 2
- 230000026683 transduction Effects 0.000 description 2
- 239000004474 valine Substances 0.000 description 2
- MWOOGOJBHIARFG-UHFFFAOYSA-N vanillin Chemical compound COC1=CC(C=O)=CC=C1O MWOOGOJBHIARFG-UHFFFAOYSA-N 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- 239000002023 wood Substances 0.000 description 2
- 239000002916 wood waste Substances 0.000 description 2
- 239000000811 xylitol Substances 0.000 description 2
- HEBKCHPVOIAQTA-SCDXWVJYSA-N xylitol Chemical compound OC[C@H](O)[C@@H](O)[C@H](O)CO HEBKCHPVOIAQTA-SCDXWVJYSA-N 0.000 description 2
- 235000010447 xylitol Nutrition 0.000 description 2
- 229960002675 xylitol Drugs 0.000 description 2
- HDTRYLNUVZCQOY-UHFFFAOYSA-N α-D-glucopyranosyl-α-D-glucopyranoside Natural products OC1C(O)C(O)C(CO)OC1OC1C(O)C(O)C(O)C(CO)O1 HDTRYLNUVZCQOY-UHFFFAOYSA-N 0.000 description 1
- BEJKOYIMCGMNRB-GRHHLOCNSA-N (2s)-2-amino-3-(4-hydroxyphenyl)propanoic acid;(2s)-2-amino-3-phenylpropanoic acid Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1.OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 BEJKOYIMCGMNRB-GRHHLOCNSA-N 0.000 description 1
- FQVLRGLGWNWPSS-BXBUPLCLSA-N (4r,7s,10s,13s,16r)-16-acetamido-13-(1h-imidazol-5-ylmethyl)-10-methyl-6,9,12,15-tetraoxo-7-propan-2-yl-1,2-dithia-5,8,11,14-tetrazacycloheptadecane-4-carboxamide Chemical compound N1C(=O)[C@@H](NC(C)=O)CSSC[C@@H](C(N)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)NC(=O)[C@@H]1CC1=CN=CN1 FQVLRGLGWNWPSS-BXBUPLCLSA-N 0.000 description 1
- KSEBMYQBYZTDHS-HWKANZROSA-M (E)-Ferulic acid Natural products COC1=CC(\C=C\C([O-])=O)=CC=C1O KSEBMYQBYZTDHS-HWKANZROSA-M 0.000 description 1
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 1
- OWEGMIWEEQEYGQ-UHFFFAOYSA-N 100676-05-9 Natural products OC1C(O)C(O)C(CO)OC1OCC1C(O)C(O)C(O)C(OC2C(OC(O)C(O)C2O)CO)O1 OWEGMIWEEQEYGQ-UHFFFAOYSA-N 0.000 description 1
- OSJPPGNTCRNQQC-UWTATZPHSA-N 3-phospho-D-glyceric acid Chemical compound OC(=O)[C@H](O)COP(O)(O)=O OSJPPGNTCRNQQC-UWTATZPHSA-N 0.000 description 1
- OSWFIVFLDKOXQC-UHFFFAOYSA-N 4-(3-methoxyphenyl)aniline Chemical compound COC1=CC=CC(C=2C=CC(N)=CC=2)=C1 OSWFIVFLDKOXQC-UHFFFAOYSA-N 0.000 description 1
- 108010029348 4-hydroxy-2-oxovalerate aldolase Proteins 0.000 description 1
- NOEGNKMFWQHSLB-UHFFFAOYSA-N 5-hydroxymethylfurfural Chemical compound OCC1=CC=C(C=O)O1 NOEGNKMFWQHSLB-UHFFFAOYSA-N 0.000 description 1
- 230000005730 ADP ribosylation Effects 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 235000009899 Agrostemma githago Nutrition 0.000 description 1
- 240000000254 Agrostemma githago Species 0.000 description 1
- 102100034035 Alcohol dehydrogenase 1A Human genes 0.000 description 1
- 102100034042 Alcohol dehydrogenase 1C Human genes 0.000 description 1
- 102100039702 Alcohol dehydrogenase class-3 Human genes 0.000 description 1
- 108010025188 Alcohol oxidase Proteins 0.000 description 1
- 108010053754 Aldehyde reductase Proteins 0.000 description 1
- 102100027265 Aldo-keto reductase family 1 member B1 Human genes 0.000 description 1
- 102100034044 All-trans-retinol dehydrogenase [NAD(+)] ADH1B Human genes 0.000 description 1
- 101710193111 All-trans-retinol dehydrogenase [NAD(+)] ADH4 Proteins 0.000 description 1
- 102100031795 All-trans-retinol dehydrogenase [NAD(+)] ADH4 Human genes 0.000 description 1
- 241000223600 Alternaria Species 0.000 description 1
- 241000219318 Amaranthus Species 0.000 description 1
- 240000001592 Amaranthus caudatus Species 0.000 description 1
- 235000009328 Amaranthus caudatus Nutrition 0.000 description 1
- 244000300297 Amaranthus hybridus Species 0.000 description 1
- 241000609240 Ambelania acida Species 0.000 description 1
- 229920000856 Amylose Polymers 0.000 description 1
- 241000196169 Ankistrodesmus Species 0.000 description 1
- 101100517196 Arabidopsis thaliana NRPE1 gene Proteins 0.000 description 1
- 241001183432 Arcobacter ellisii Species 0.000 description 1
- 241001062687 Arcobacter pacificus Species 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 241001225321 Aspergillus fumigatus Species 0.000 description 1
- 241000589149 Azotobacter vinelandii Species 0.000 description 1
- 241000288015 Bambusicola <bird> Species 0.000 description 1
- 235000016068 Berberis vulgaris Nutrition 0.000 description 1
- 241000335053 Beta vulgaris Species 0.000 description 1
- 241000219310 Beta vulgaris subsp. vulgaris Species 0.000 description 1
- BVKZGUZCCUSVTD-UHFFFAOYSA-M Bicarbonate Chemical compound OC([O-])=O BVKZGUZCCUSVTD-UHFFFAOYSA-M 0.000 description 1
- 241000186000 Bifidobacterium Species 0.000 description 1
- 241001312346 Bifidobacterium gallicum Species 0.000 description 1
- 241000680806 Blastobotrys adeninivorans Species 0.000 description 1
- 101100190825 Bos taurus PMEL gene Proteins 0.000 description 1
- 235000006463 Brassica alba Nutrition 0.000 description 1
- 244000140786 Brassica hirta Species 0.000 description 1
- 235000014698 Brassica juncea var multisecta Nutrition 0.000 description 1
- 235000006008 Brassica napus var napus Nutrition 0.000 description 1
- 235000006618 Brassica rapa subsp oleifera Nutrition 0.000 description 1
- 235000004977 Brassica sinapistrum Nutrition 0.000 description 1
- 241000722883 Brettanomyces custersianus Species 0.000 description 1
- 241000722860 Brettanomyces naardenensis Species 0.000 description 1
- 241000735514 Brettanomyces nanus Species 0.000 description 1
- 241001136175 Burkholderia pseudomallei Species 0.000 description 1
- 241000863292 Cabera Species 0.000 description 1
- 101100150553 Caenorhabditis elegans ssu-1 gene Proteins 0.000 description 1
- 241000620141 Carboxydothermus Species 0.000 description 1
- 241000620137 Carboxydothermus hydrogenoformans Species 0.000 description 1
- 102000014914 Carrier Proteins Human genes 0.000 description 1
- 108010008885 Cellulose 1,4-beta-Cellobiosidase Proteins 0.000 description 1
- 241000195585 Chlamydomonas Species 0.000 description 1
- 241000195628 Chlorophyta Species 0.000 description 1
- DBPRUZCKPFOVDV-UHFFFAOYSA-N Clorprenaline hydrochloride Chemical compound O.Cl.CC(C)NCC(O)C1=CC=CC=C1Cl DBPRUZCKPFOVDV-UHFFFAOYSA-N 0.000 description 1
- 241001112696 Clostridia Species 0.000 description 1
- 241001478240 Coccus Species 0.000 description 1
- 241001040868 Conticribra weissflogii Species 0.000 description 1
- 229920002261 Corn starch Polymers 0.000 description 1
- 229920000742 Cotton Polymers 0.000 description 1
- 101000796894 Coturnix japonica Alcohol dehydrogenase 1 Proteins 0.000 description 1
- 235000015510 Cucumis melo subsp melo Nutrition 0.000 description 1
- 235000010071 Cucumis prophetarum Nutrition 0.000 description 1
- 235000009849 Cucumis sativus Nutrition 0.000 description 1
- 235000010799 Cucumis sativus var sativus Nutrition 0.000 description 1
- 240000001980 Cucurbita pepo Species 0.000 description 1
- 241000219130 Cucurbita pepo subsp. pepo Species 0.000 description 1
- 235000003954 Cucurbita pepo var melopepo Nutrition 0.000 description 1
- 241000510672 Cuminum Species 0.000 description 1
- 241001528480 Cupriavidus Species 0.000 description 1
- 241000252231 Cyprinus Species 0.000 description 1
- 102100028717 Cytosolic 5'-nucleotidase 3A Human genes 0.000 description 1
- GUBGYTABKSRVRQ-CUHNMECISA-N D-Cellobiose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-CUHNMECISA-N 0.000 description 1
- GSXOAOHZAIYLCY-UHFFFAOYSA-N D-F6P Natural products OCC(=O)C(O)C(O)C(O)COP(O)(O)=O GSXOAOHZAIYLCY-UHFFFAOYSA-N 0.000 description 1
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 description 1
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 1
- AEMOLEFTQBMNLQ-YMDCURPLSA-N D-galactopyranuronic acid Chemical compound OC1O[C@H](C(O)=O)[C@H](O)[C@H](O)[C@H]1O AEMOLEFTQBMNLQ-YMDCURPLSA-N 0.000 description 1
- AEMOLEFTQBMNLQ-AQKNRBDQSA-N D-glucopyranuronic acid Chemical compound OC1O[C@H](C(O)=O)[C@@H](O)[C@H](O)[C@H]1O AEMOLEFTQBMNLQ-AQKNRBDQSA-N 0.000 description 1
- MNQZXJOMYWMBOU-VKHMYHEASA-N D-glyceraldehyde Chemical compound OC[C@@H](O)C=O MNQZXJOMYWMBOU-VKHMYHEASA-N 0.000 description 1
- HMFHBZSHGGEWLO-SOOFDHNKSA-N D-ribofuranose Chemical compound OC[C@H]1OC(O)[C@H](O)[C@@H]1O HMFHBZSHGGEWLO-SOOFDHNKSA-N 0.000 description 1
- ZAQJHHRNXZUBTE-NQXXGFSBSA-N D-ribulose Chemical compound OC[C@@H](O)[C@@H](O)C(=O)CO ZAQJHHRNXZUBTE-NQXXGFSBSA-N 0.000 description 1
- ZAQJHHRNXZUBTE-WUJLRWPWSA-N D-xylulose Chemical compound OC[C@@H](O)[C@H](O)C(=O)CO ZAQJHHRNXZUBTE-WUJLRWPWSA-N 0.000 description 1
- 230000004568 DNA-binding Effects 0.000 description 1
- 201000004624 Dermatitis Diseases 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- 241000195633 Dunaliella salina Species 0.000 description 1
- 241000224431 Entamoeba Species 0.000 description 1
- 241000223682 Exophiala Species 0.000 description 1
- 241000248325 Exophiala dermatitidis Species 0.000 description 1
- 108091092566 Extrachromosomal DNA Proteins 0.000 description 1
- 102100034545 FAD synthase region Human genes 0.000 description 1
- 101710151841 Farnesyl pyrophosphate synthase 1 Proteins 0.000 description 1
- 101150002721 GPD2 gene Proteins 0.000 description 1
- 206010017711 Gangrene Diseases 0.000 description 1
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 1
- 229920002581 Glucomannan Polymers 0.000 description 1
- 108010068370 Glutens Proteins 0.000 description 1
- 102100038261 Glycerol-3-phosphate phosphatase Human genes 0.000 description 1
- 101710171812 Glycerol-3-phosphate phosphatase Proteins 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 235000009429 Gossypium barbadense Nutrition 0.000 description 1
- 241001149669 Hanseniaspora Species 0.000 description 1
- 241001149671 Hanseniaspora uvarum Species 0.000 description 1
- 101710116987 Heat shock protein 60, mitochondrial Proteins 0.000 description 1
- 244000020551 Helianthus annuus Species 0.000 description 1
- 235000003222 Helianthus annuus Nutrition 0.000 description 1
- 101000780443 Homo sapiens Alcohol dehydrogenase 1A Proteins 0.000 description 1
- 101000780463 Homo sapiens Alcohol dehydrogenase 1C Proteins 0.000 description 1
- 101000959452 Homo sapiens Alcohol dehydrogenase class-3 Proteins 0.000 description 1
- 101000775437 Homo sapiens All-trans-retinol dehydrogenase [NAD(+)] ADH4 Proteins 0.000 description 1
- 101000891113 Homo sapiens T-cell acute lymphocytic leukemia protein 1 Proteins 0.000 description 1
- UFHFLCQGNIYNRP-UHFFFAOYSA-N Hydrogen Chemical compound [H][H] UFHFLCQGNIYNRP-UHFFFAOYSA-N 0.000 description 1
- 102000004157 Hydrolases Human genes 0.000 description 1
- 108090000604 Hydrolases Proteins 0.000 description 1
- 206010020852 Hypertonia Diseases 0.000 description 1
- 229930010555 Inosine Natural products 0.000 description 1
- UGQMRVRMYYASKQ-KQYNXXCUSA-N Inosine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(O)=C2N=C1 UGQMRVRMYYASKQ-KQYNXXCUSA-N 0.000 description 1
- 208000007976 Ketosis Diseases 0.000 description 1
- 241000588915 Klebsiella aerogenes Species 0.000 description 1
- 241001138401 Kluyveromyces lactis Species 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 1
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- 125000000393 L-methionino group Chemical group [H]OC(=O)[C@@]([H])(N([H])[*])C([H])([H])C(SC([H])([H])[H])([H])[H] 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- 241000933069 Lachnoclostridium phytofermentans Species 0.000 description 1
- 235000013956 Lactobacillus acidophilus Nutrition 0.000 description 1
- 235000013958 Lactobacillus casei Nutrition 0.000 description 1
- 241000186684 Lactobacillus pentosus Species 0.000 description 1
- 235000003228 Lactuca sativa Nutrition 0.000 description 1
- 240000008415 Lactuca sativa Species 0.000 description 1
- 244000207740 Lemna minor Species 0.000 description 1
- 235000006439 Lemna minor Nutrition 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 241000634997 Leucon Species 0.000 description 1
- 235000004431 Linum usitatissimum Nutrition 0.000 description 1
- 240000006240 Linum usitatissimum Species 0.000 description 1
- 241000219745 Lupinus Species 0.000 description 1
- 235000010649 Lupinus albus Nutrition 0.000 description 1
- 240000000894 Lupinus albus Species 0.000 description 1
- 235000007688 Lycopersicon esculentum Nutrition 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- GUBGYTABKSRVRQ-PICCSMPSSA-N Maltose Natural products O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-PICCSMPSSA-N 0.000 description 1
- 229930195725 Mannitol Natural products 0.000 description 1
- 241000219470 Mirabilis Species 0.000 description 1
- 240000003433 Miscanthus floridulus Species 0.000 description 1
- 241000588771 Morganella <proteobacterium> Species 0.000 description 1
- 101100478600 Mus musculus Ssuh2 gene Proteins 0.000 description 1
- 241000221961 Neurospora crassa Species 0.000 description 1
- 241000208133 Nicotiana plumbaginifolia Species 0.000 description 1
- 108090000836 Nitrate Transporters Proteins 0.000 description 1
- 238000000636 Northern blotting Methods 0.000 description 1
- 241000998584 Nuda Species 0.000 description 1
- 241000202223 Oenococcus Species 0.000 description 1
- 241001099341 Ogataea polymorpha Species 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- 101100073341 Oryza sativa subsp. japonica KAO gene Proteins 0.000 description 1
- 241000795247 Paraburkholderia ribeironis Species 0.000 description 1
- 241001668545 Pascopyrum Species 0.000 description 1
- 241000142651 Pelotomaculum thermopropionicum Species 0.000 description 1
- 108091093037 Peptide nucleic acid Proteins 0.000 description 1
- 241001466488 Phaeocystis antarctica Species 0.000 description 1
- 241000199919 Phaeophyceae Species 0.000 description 1
- 235000010627 Phaseolus vulgaris Nutrition 0.000 description 1
- 244000046052 Phaseolus vulgaris Species 0.000 description 1
- OAICVXFJPJFONN-UHFFFAOYSA-N Phosphorus Chemical compound [P] OAICVXFJPJFONN-UHFFFAOYSA-N 0.000 description 1
- 241000235645 Pichia kudriavzevii Species 0.000 description 1
- 235000008331 Pinus X rigitaeda Nutrition 0.000 description 1
- 241000018646 Pinus brutia Species 0.000 description 1
- 235000011613 Pinus brutia Nutrition 0.000 description 1
- 241000235379 Piromyces Species 0.000 description 1
- 244000058602 Pisum arvense Species 0.000 description 1
- 240000004713 Pisum sativum Species 0.000 description 1
- 235000010582 Pisum sativum Nutrition 0.000 description 1
- 235000016815 Pisum sativum var arvense Nutrition 0.000 description 1
- 241000195877 Polytrichum commune Species 0.000 description 1
- 241000219000 Populus Species 0.000 description 1
- 235000001855 Portulaca oleracea Nutrition 0.000 description 1
- 108010009736 Protein Hydrolysates Proteins 0.000 description 1
- 102100037719 Protein SSUH2 homolog Human genes 0.000 description 1
- 241000589774 Pseudomonas sp. Species 0.000 description 1
- 241001327108 Pseudomonas sp. CF600 Species 0.000 description 1
- 241001038563 Pseudostellaria Species 0.000 description 1
- 238000001190 Q-PCR Methods 0.000 description 1
- 101150012255 RKI1 gene Proteins 0.000 description 1
- 238000003559 RNA-seq method Methods 0.000 description 1
- 241001135508 Ralstonia syzygii Species 0.000 description 1
- 238000001069 Raman spectroscopy Methods 0.000 description 1
- 244000088415 Raphanus sativus Species 0.000 description 1
- 235000006140 Raphanus sativus var sativus Nutrition 0.000 description 1
- 108700005075 Regulator Genes Proteins 0.000 description 1
- 241000921453 Rhodobryum Species 0.000 description 1
- 108091028664 Ribonucleotide Proteins 0.000 description 1
- PYMYPHUHKUWMLA-LMVFSUKVSA-N Ribose Natural products OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PYMYPHUHKUWMLA-LMVFSUKVSA-N 0.000 description 1
- 102100039270 Ribulose-phosphate 3-epimerase Human genes 0.000 description 1
- 235000004443 Ricinus communis Nutrition 0.000 description 1
- 241000192031 Ruminococcus Species 0.000 description 1
- 101100428737 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) VPS54 gene Proteins 0.000 description 1
- 235000018370 Saccharomyces delbrueckii Nutrition 0.000 description 1
- 241001063879 Saccharomyces eubayanus Species 0.000 description 1
- 241000213556 Saccharomyces jurei Species 0.000 description 1
- 241001123228 Saccharomyces paradoxus Species 0.000 description 1
- 241000582914 Saccharomyces uvarum Species 0.000 description 1
- 241001326564 Saccharomycotina Species 0.000 description 1
- 241000736026 Sarcandra Species 0.000 description 1
- 235000010842 Sarcandra glabra Nutrition 0.000 description 1
- 240000004274 Sarcandra glabra Species 0.000 description 1
- 101000702553 Schistosoma mansoni Antigen Sm21.7 Proteins 0.000 description 1
- 101000714192 Schistosoma mansoni Tegument antigen Proteins 0.000 description 1
- 241000025833 Schizosaccharomyces cryophilus Species 0.000 description 1
- 241000235348 Schizosaccharomyces japonicus Species 0.000 description 1
- 241000235350 Schizosaccharomyces octosporus Species 0.000 description 1
- 241001610470 Selaginella kraussiana Species 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 241000607760 Shigella sonnei Species 0.000 description 1
- 241000220261 Sinapis Species 0.000 description 1
- 241000108511 Skeletonema tropicum Species 0.000 description 1
- 241000208292 Solanaceae Species 0.000 description 1
- 235000002560 Solanum lycopersicum Nutrition 0.000 description 1
- 244000138286 Sorghum saccharatum Species 0.000 description 1
- 235000011684 Sorghum saccharatum Nutrition 0.000 description 1
- 241000029117 Sterolibacterium denitrificans Species 0.000 description 1
- 229930006000 Sucrose Natural products 0.000 description 1
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 1
- 235000021536 Sugar beet Nutrition 0.000 description 1
- NINIDFKCEFEMDL-UHFFFAOYSA-N Sulfur Chemical compound [S] NINIDFKCEFEMDL-UHFFFAOYSA-N 0.000 description 1
- LSNNMFCWUKXFEE-UHFFFAOYSA-N Sulfurous acid Chemical compound OS(O)=O LSNNMFCWUKXFEE-UHFFFAOYSA-N 0.000 description 1
- 102100040365 T-cell acute lymphocytic leukemia protein 1 Human genes 0.000 description 1
- 241000544016 Thalassia testudinum Species 0.000 description 1
- 241000323202 Thalassiosira antarctica Species 0.000 description 1
- 244000269722 Thea sinensis Species 0.000 description 1
- 235000006468 Thea sinensis Nutrition 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- 108091023040 Transcription factor Proteins 0.000 description 1
- 102000040945 Transcription factor Human genes 0.000 description 1
- 108700019146 Transgenes Proteins 0.000 description 1
- HDTRYLNUVZCQOY-WSWWMNSNSA-N Trehalose Natural products O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-WSWWMNSNSA-N 0.000 description 1
- 241000218989 Trichosanthes Species 0.000 description 1
- 235000009818 Trichosanthes kirilowii Nutrition 0.000 description 1
- 240000006023 Trichosanthes kirilowii Species 0.000 description 1
- 108010071199 Triokinase Proteins 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- 241000196252 Ulva Species 0.000 description 1
- 108020005202 Viral DNA Proteins 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 235000009392 Vitis Nutrition 0.000 description 1
- 241000219095 Vitis Species 0.000 description 1
- 229920002000 Xyloglucan Polymers 0.000 description 1
- 102100029089 Xylulose kinase Human genes 0.000 description 1
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 description 1
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 description 1
- PTFCDOFLOPIGGS-UHFFFAOYSA-N Zinc dication Chemical compound [Zn+2] PTFCDOFLOPIGGS-UHFFFAOYSA-N 0.000 description 1
- 241000235029 Zygosaccharomyces bailii Species 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- FJJCIZWZNKZHII-UHFFFAOYSA-N [4,6-bis(cyanoamino)-1,3,5-triazin-2-yl]cyanamide Chemical compound N#CNC1=NC(NC#N)=NC(NC#N)=N1 FJJCIZWZNKZHII-UHFFFAOYSA-N 0.000 description 1
- IKHGUXGNUITLKF-XPULMUKRSA-N acetaldehyde Chemical compound [14CH]([14CH3])=O IKHGUXGNUITLKF-XPULMUKRSA-N 0.000 description 1
- 125000000218 acetic acid group Chemical group C(C)(=O)* 0.000 description 1
- 238000005903 acid hydrolysis reaction Methods 0.000 description 1
- 150000007513 acids Chemical class 0.000 description 1
- 239000012190 activator Substances 0.000 description 1
- 239000008186 active pharmaceutical agent Substances 0.000 description 1
- 230000003044 adaptive effect Effects 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- 230000000996 additive effect Effects 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 238000005273 aeration Methods 0.000 description 1
- 230000002776 aggregation Effects 0.000 description 1
- 238000004220 aggregation Methods 0.000 description 1
- 125000003158 alcohol group Chemical group 0.000 description 1
- 150000001299 aldehydes Chemical class 0.000 description 1
- 125000001931 aliphatic group Chemical group 0.000 description 1
- HDTRYLNUVZCQOY-LIZSDCNHSA-N alpha,alpha-trehalose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-LIZSDCNHSA-N 0.000 description 1
- HMFHBZSHGGEWLO-UHFFFAOYSA-N alpha-D-Furanose-Ribose Natural products OCC1OC(O)C(O)C1O HMFHBZSHGGEWLO-UHFFFAOYSA-N 0.000 description 1
- WQZGKKKJIJFFOK-DVKNGEFBSA-N alpha-D-glucose Chemical group OC[C@H]1O[C@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-DVKNGEFBSA-N 0.000 description 1
- 150000001408 amides Chemical class 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 239000010828 animal waste Substances 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 150000001491 aromatic compounds Chemical class 0.000 description 1
- 125000003118 aryl group Chemical group 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 229940091771 aspergillus fumigatus Drugs 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 239000010905 bagasse Substances 0.000 description 1
- 238000012365 batch cultivation Methods 0.000 description 1
- 108010047754 beta-Glucosidase Proteins 0.000 description 1
- 102000006995 beta-Glucosidase Human genes 0.000 description 1
- GUBGYTABKSRVRQ-QUYVBRFLSA-N beta-maltose Chemical compound OC[C@H]1O[C@H](O[C@H]2[C@H](O)[C@@H](O)[C@H](O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@@H]1O GUBGYTABKSRVRQ-QUYVBRFLSA-N 0.000 description 1
- 229940009289 bifidobacterium lactis Drugs 0.000 description 1
- 238000005842 biochemical reaction Methods 0.000 description 1
- 238000006065 biodegradation reaction Methods 0.000 description 1
- 239000002551 biofuel Substances 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 230000001851 biosynthetic effect Effects 0.000 description 1
- 239000004305 biphenyl Substances 0.000 description 1
- 235000010290 biphenyl Nutrition 0.000 description 1
- 244000275904 brauner Senf Species 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 239000001728 capsicum frutescens Substances 0.000 description 1
- 229910002090 carbon oxide Inorganic materials 0.000 description 1
- 230000007910 cell fusion Effects 0.000 description 1
- 230000010261 cell growth Effects 0.000 description 1
- 210000002421 cell wall Anatomy 0.000 description 1
- 229940106157 cellulase Drugs 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000001311 chemical methods and process Methods 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 235000020971 citrus fruits Nutrition 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 239000008120 corn starch Substances 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- 230000021953 cytokinesis Effects 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- 210000000172 cytosol Anatomy 0.000 description 1
- 230000034994 death Effects 0.000 description 1
- 238000006114 decarboxylation reaction Methods 0.000 description 1
- 238000000354 decomposition reaction Methods 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 230000000593 degrading effect Effects 0.000 description 1
- 238000007357 dehydrogenase reaction Methods 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 239000001177 diphosphate Substances 0.000 description 1
- XPPKVPWEQAFLFU-UHFFFAOYSA-J diphosphate(4-) Chemical compound [O-]P([O-])(=O)OP([O-])([O-])=O XPPKVPWEQAFLFU-UHFFFAOYSA-J 0.000 description 1
- 235000011180 diphosphates Nutrition 0.000 description 1
- 101150056052 dmpF gene Proteins 0.000 description 1
- 238000009837 dry grinding Methods 0.000 description 1
- 235000013399 edible fruits Nutrition 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 229940092559 enterobacter aerogenes Drugs 0.000 description 1
- 238000001952 enzyme assay Methods 0.000 description 1
- 238000006345 epimerization reaction Methods 0.000 description 1
- ZINJLDJMHCUBIP-UHFFFAOYSA-N ethametsulfuron-methyl Chemical compound CCOC1=NC(NC)=NC(NC(=O)NS(=O)(=O)C=2C(=CC=CC=2)C(=O)OC)=N1 ZINJLDJMHCUBIP-UHFFFAOYSA-N 0.000 description 1
- 101150041588 eutE gene Proteins 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- KSEBMYQBYZTDHS-HWKANZROSA-N ferulic acid Chemical compound COC1=CC(\C=C\C(O)=O)=CC=C1O KSEBMYQBYZTDHS-HWKANZROSA-N 0.000 description 1
- 229940114124 ferulic acid Drugs 0.000 description 1
- KSEBMYQBYZTDHS-UHFFFAOYSA-N ferulic acid Natural products COC1=CC(C=CC(O)=O)=CC=C1O KSEBMYQBYZTDHS-UHFFFAOYSA-N 0.000 description 1
- 235000001785 ferulic acid Nutrition 0.000 description 1
- 235000019253 formic acid Nutrition 0.000 description 1
- 239000000446 fuel Substances 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 230000006251 gamma-carboxylation Effects 0.000 description 1
- 238000003209 gene knockout Methods 0.000 description 1
- 229940097043 glucuronic acid Drugs 0.000 description 1
- 125000000291 glutamic acid group Chemical group N[C@@H](CCC(O)=O)C(=O)* 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 235000021312 gluten Nutrition 0.000 description 1
- 102000006602 glyceraldehyde-3-phosphate dehydrogenase Human genes 0.000 description 1
- 108020004445 glyceraldehyde-3-phosphate dehydrogenase Proteins 0.000 description 1
- 108010032776 glycerol-1-phosphatase Proteins 0.000 description 1
- 150000002313 glycerolipids Chemical class 0.000 description 1
- 150000002314 glycerols Chemical class 0.000 description 1
- 150000002333 glycines Chemical class 0.000 description 1
- 230000034659 glycolysis Effects 0.000 description 1
- 230000002414 glycolytic effect Effects 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 150000002402 hexoses Chemical class 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 238000002744 homologous recombination Methods 0.000 description 1
- 230000006801 homologous recombination Effects 0.000 description 1
- 101150003679 hsaG gene Proteins 0.000 description 1
- 239000001257 hydrogen Substances 0.000 description 1
- 229910052739 hydrogen Inorganic materials 0.000 description 1
- UQVDQSWZQXDUJB-UHFFFAOYSA-N hydron;7h-purin-6-amine;chloride Chemical compound Cl.NC1=NC=NC2=C1NC=N2 UQVDQSWZQXDUJB-UHFFFAOYSA-N 0.000 description 1
- XLSMFKSTNGKWQX-UHFFFAOYSA-N hydroxyacetone Chemical compound CC(=O)CO XLSMFKSTNGKWQX-UHFFFAOYSA-N 0.000 description 1
- 230000033444 hydroxylation Effects 0.000 description 1
- 238000005805 hydroxylation reaction Methods 0.000 description 1
- RJGBSYZFOCAGQY-UHFFFAOYSA-N hydroxymethylfurfural Natural products COC1=CC=C(C=O)O1 RJGBSYZFOCAGQY-UHFFFAOYSA-N 0.000 description 1
- 230000001771 impaired effect Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000000415 inactivating effect Effects 0.000 description 1
- 239000000411 inducer Substances 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 239000002440 industrial waste Substances 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 229910052816 inorganic phosphate Inorganic materials 0.000 description 1
- 229960003786 inosine Drugs 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 238000006317 isomerization reaction Methods 0.000 description 1
- GSXOAOHZAIYLCY-HSUXUTPPSA-N keto-D-fructose 6-phosphate Chemical compound OCC(=O)[C@@H](O)[C@H](O)[C@H](O)COP(O)(O)=O GSXOAOHZAIYLCY-HSUXUTPPSA-N 0.000 description 1
- 229910052743 krypton Inorganic materials 0.000 description 1
- DNNSSWSSYDEUBZ-UHFFFAOYSA-N krypton atom Chemical compound [Kr] DNNSSWSSYDEUBZ-UHFFFAOYSA-N 0.000 description 1
- 239000004310 lactic acid Substances 0.000 description 1
- 235000014655 lactic acid Nutrition 0.000 description 1
- 229940039695 lactobacillus acidophilus Drugs 0.000 description 1
- 229940017800 lactobacillus casei Drugs 0.000 description 1
- 239000002029 lignocellulosic biomass Substances 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 235000009973 maize Nutrition 0.000 description 1
- 239000000594 mannitol Substances 0.000 description 1
- 235000010355 mannitol Nutrition 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 230000002503 metabolic effect Effects 0.000 description 1
- 239000002207 metabolite Substances 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 210000003470 mitochondria Anatomy 0.000 description 1
- 238000001823 molecular biology technique Methods 0.000 description 1
- 125000000896 monocarboxylic acid group Chemical group 0.000 description 1
- 239000000178 monomer Substances 0.000 description 1
- 108010008217 nidogen Proteins 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 239000002420 orchard Substances 0.000 description 1
- 210000003463 organelle Anatomy 0.000 description 1
- 150000002894 organic compounds Chemical class 0.000 description 1
- 239000011368 organic material Substances 0.000 description 1
- RXCVUXLCNLVYIA-UHFFFAOYSA-N orthocarbonic acid Chemical compound OC(O)(O)O RXCVUXLCNLVYIA-UHFFFAOYSA-N 0.000 description 1
- 230000008723 osmotic stress Effects 0.000 description 1
- 230000003647 oxidation Effects 0.000 description 1
- 238000007254 oxidation reaction Methods 0.000 description 1
- 230000004792 oxidative damage Effects 0.000 description 1
- FCJSHPDYVMKCHI-UHFFFAOYSA-N phenyl benzoate Chemical compound C=1C=CC=CC=1C(=O)OC1=CC=CC=C1 FCJSHPDYVMKCHI-UHFFFAOYSA-N 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 229910052698 phosphorus Inorganic materials 0.000 description 1
- 239000011574 phosphorus Substances 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 239000005648 plant growth regulator Substances 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 238000003752 polymerase chain reaction Methods 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 230000003334 potential effect Effects 0.000 description 1
- 238000002203 pretreatment Methods 0.000 description 1
- 101150074035 prk gene Proteins 0.000 description 1
- 230000012846 protein folding Effects 0.000 description 1
- 230000005855 radiation Effects 0.000 description 1
- 238000002708 random mutagenesis Methods 0.000 description 1
- 230000035484 reaction time Effects 0.000 description 1
- 238000003753 real-time PCR Methods 0.000 description 1
- 238000003259 recombinant expression Methods 0.000 description 1
- 238000010992 reflux Methods 0.000 description 1
- 102000037983 regulatory factors Human genes 0.000 description 1
- 108091008025 regulatory factors Proteins 0.000 description 1
- 238000010405 reoxidation reaction Methods 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 230000001850 reproductive effect Effects 0.000 description 1
- 230000029058 respiratory gaseous exchange Effects 0.000 description 1
- 230000011506 response to oxidative stress Effects 0.000 description 1
- 230000003938 response to stress Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 108091000042 riboflavin kinase Proteins 0.000 description 1
- 239000002336 ribonucleotide Substances 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 108020002667 ribulokinase Proteins 0.000 description 1
- 101150005492 rpe1 gene Proteins 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 229940115939 shigella sonnei Drugs 0.000 description 1
- 239000002002 slurry Substances 0.000 description 1
- 239000000600 sorbitol Substances 0.000 description 1
- 230000002269 spontaneous effect Effects 0.000 description 1
- 239000007858 starting material Substances 0.000 description 1
- 230000035882 stress Effects 0.000 description 1
- 239000005720 sucrose Substances 0.000 description 1
- 150000005846 sugar alcohols Chemical class 0.000 description 1
- 230000019635 sulfation Effects 0.000 description 1
- 238000005670 sulfation reaction Methods 0.000 description 1
- 229910052717 sulfur Inorganic materials 0.000 description 1
- 239000011593 sulfur Substances 0.000 description 1
- 230000004083 survival effect Effects 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 230000002195 synergetic effect Effects 0.000 description 1
- YIBXWXOYFGZLRU-UHFFFAOYSA-N syringic aldehyde Natural products CC12CCC(C3(CCC(=O)C(C)(C)C3CC=3)C)C=3C1(C)CCC2C1COC(C)(C)C(O)C(O)C1 YIBXWXOYFGZLRU-UHFFFAOYSA-N 0.000 description 1
- 230000008685 targeting Effects 0.000 description 1
- 229940113082 thymine Drugs 0.000 description 1
- QURCVMIEKCOAJU-UHFFFAOYSA-N trans-isoferulic acid Natural products COC1=CC=C(C=CC(O)=O)C=C1O QURCVMIEKCOAJU-UHFFFAOYSA-N 0.000 description 1
- 230000005026 transcription initiation Effects 0.000 description 1
- 230000037426 transcriptional repression Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 238000000844 transformation Methods 0.000 description 1
- 230000014621 translational initiation Effects 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 238000009281 ultraviolet germicidal irradiation Methods 0.000 description 1
- 235000018322 upland cotton Nutrition 0.000 description 1
- 230000003827 upregulation Effects 0.000 description 1
- FGQOOHJZONJGDT-UHFFFAOYSA-N vanillin Natural products COC1=CC(O)=CC(C=O)=C1 FGQOOHJZONJGDT-UHFFFAOYSA-N 0.000 description 1
- 235000012141 vanillin Nutrition 0.000 description 1
- 235000013311 vegetables Nutrition 0.000 description 1
- 238000004065 wastewater treatment Methods 0.000 description 1
- 238000001238 wet grinding Methods 0.000 description 1
- 235000015099 wheat brans Nutrition 0.000 description 1
- 229920001221 xylan Polymers 0.000 description 1
- 150000004823 xylans Chemical class 0.000 description 1
- 108091022915 xylulokinase Proteins 0.000 description 1
- 239000010925 yard waste Substances 0.000 description 1
- 150000003751 zinc Chemical class 0.000 description 1
- 239000011701 zinc Substances 0.000 description 1
- 229910052725 zinc Inorganic materials 0.000 description 1
Landscapes
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
Disclosed is a recombinant yeast cell that functionally expresses: -a nucleic acid sequence encoding a native protein having transketolase activity (EC 2.2.1.1); and-a nucleic acid sequence encoding a heterologous protein (EC 2.2.1.1) having transketolase activity.
Description
Technical Field
The present invention relates to a recombinant yeast cell and a method for producing ethanol, wherein the recombinant yeast cell is used.
Background
Microbial fermentation processes are suitable for industrial production of a wide and rapidly expanding range of chemical compounds from renewable carbohydrate feedstocks. In particular in anaerobic fermentation processes, the cofactor pair NADH/NAD + The redox balance of (c) may impose significant limitations on product yield. An example of such a challenge is the formation of glycerol as a major byproduct in the industrial production of, for example, fuel ethanol from saccharomyces cerevisiae (Saccharomyces cerevisiae), which is a direct consequence of the need to reoxidize NADH formed in the biosynthetic reaction.
Ethanol production from Saccharomyces cerevisiae is currently the largest single fermentation process in industrial biotechnology on a volumetric basis. Various methods have been proposed to improve the fermentation properties of organisms used in industrial biotechnology by genetic modification. A significant challenge associated with the stoichiometry of yeast-based ethanol production is that large amounts of NADH-dependent byproducts (such as glycerol) are typically formed as byproducts, especially under anaerobic and oxygen-limited conditions or under conditions where respiration is otherwise limited or absent. It is estimated that in a typical industrial ethanol process, up to about 4wt.% of the sugar feedstock is converted to glycerol (Nissen et al, "Anaerobic and aerobic batch cultivations of Saccharomyces cerevisiae mutants impaired in glycerol Synthesis" [ "effect of anaerobic and aerobic batch culture of Saccharomyces cerevisiae mutants on glycerol synthesis" ], (2000), yeast [ Yeast ], volume 16, pages 463-474). Under conditions ideal for anaerobic growth, the conversion to glycerol may be even higher, up to about 10%.
Glycerol production under anaerobic conditions is mainly related to redox metabolism. During anaerobic growth of Saccharomyces cerevisiae (S. Cerevisiae), glycosylation occurs via alcoholic fermentation. In this process, NADH formed in the glycolytic glyceraldehyde-3-phosphate dehydrogenase reaction is produced by passing NAD + The dependent alcohol dehydrogenase converts acetaldehyde formed by decarboxylation of pyruvic acid into ethanol to be reoxidized. When NAD + The fixed stoichiometry of this redox-neutral catabolic pathway causes problems when the net reduction to NADH occurs elsewhere in the metabolism. Under anaerobic conditions, NADH reoxidation in Saccharomyces cerevisiae is strictly dependent on the reduction of sugar to glycerol. Glycerol formation is initiated by the reduction of dihydroxyacetone phosphate (DHAP), an intermediate of glycolysis, to glycerol 3-phosphate (glycerol-3P), a reaction which is produced by NAD + Dependent glycerol 3-phosphate dehydrogenase catalyzed. Subsequently, the glycerol 3-phosphate formed in this reaction is hydrolyzed by glycerol-3-phosphatase to produce glycerol and inorganic phosphate. Thus, glycerol is a major byproduct in the anaerobic production of ethanol from saccharomyces cerevisiae, which is undesirable because it reduces the overall conversion of sugar to ethanol. Furthermore, the presence of glycerol in the effluent of an ethanol production plant may increase the cost of wastewater treatment.
In the literature, several different methods have been reported that can help reduce the amount of by-product glycerol and shift the carbon to ethanol, resulting in an increase in ethanol yield per gram of fermented carbohydrate.
In WO 2011/010923, WO 2014/081803, WO 2014/129898, WO 2015/107496, WO 2015/148272, WO 2018/172328 and WO 2018/228836 several alternative reduction pathways have been proposed to reoxidize the generated NADH, also called alternative "redox sink", and have been implemented into (recombinant) yeast cells.
For example, in WO 2011/010923, NADH related by-product (glycerol) formation in a process for producing ethanol from a carbohydrate-containing feedstock is addressed by providing a recombinant yeast cell comprising one or more recombinant nucleic acid sequences encoding nad+ dependent acetylating acetaldehyde dehydrogenase (EC 1.2.1.10) activity. The cell may, for example, lack the enzymatic activity required for NADH dependent glycerol synthesis, or the cell may have reduced enzymatic activity in NADH dependent glycerol synthesis compared to its corresponding wild-type yeast cell.
WO 2014/129898 describes recombinant cells functionally expressing heterologous nucleic acid sequences encoding ribulose-1, 5-phosphate carboxylase/oxygenase (EC 4.1.1.39; abbreviated herein as "Rubisco") and optionally a chaperone of Rubisco and phosphoribulokinase (EC 2.7.1.19; abbreviated herein as "PRK"). Furthermore, the use of carbon dioxide as an electron acceptor in recombinant autotrophic microorganisms is mentioned.
Although the described methods and yeast cells are advantageous, continued improvement is desirable. In an industrial setting, the above reductions in glycerol production by recombinant yeast cells can potentially affect their tolerance to hypertonicity and their stress response to the external environment. Especially under challenging process conditions, for example when a fermentation medium with a high dry solids content and/or a high fermentation temperature is applied, this may lead to a decrease of the cell population and/or the cell activity at the end of the fermentation period. It would be an advance in the art to provide such methods and yeast cells for use therein, wherein the yeast cells have improved robustness under high dry solids/high dry matter conditions and/or high temperatures. Furthermore, it would be an advance in the art to provide yeast cells having reduced glucose accumulation and/or total sugar content within the yeast cells. That is, it would be an advance in the art to achieve sustained performance of yeast cells and/or low concentrations of residual glucose at the end of fermentation, even in the presence of high concentrations of glucose at the beginning of fermentation and/or throughout the fermentation process.
Disclosure of Invention
The inventors have now unexpectedly found that the methods and yeast cells of the prior art can be improved by using specific combinations of transketolase proteins.
Accordingly, the present invention provides a recombinant yeast cell that functionally expresses:
-a nucleic acid sequence encoding a native protein having transketolase activity; and
-a nucleic acid sequence encoding a heterologous protein having transketolase activity.
In addition, the present invention provides a method for producing ethanol, comprising transforming a carbon source (such as a carbohydrate or another organic carbon source) using the above recombinant yeast cells, thereby suitably forming ethanol.
Advantageously, the use of the above recombinant yeast cells and/or the above methods results in improved robustness. This is particularly advantageous when media with a high dry solids content are used and/or if high fermentation temperatures are used.
The process of producing ethanol from a carbon source, such as a carbohydrate, may advantageously be performed in the presence of a glycosylase, such as a glucoamylase, to convert polysaccharides and/or oligosaccharides to glucose. When the process is performed in a medium with a high dry matter content, for example after starting the process with a high concentration of corn mash, the concentration of glucose in the medium may become very high. Without wishing to be bound by any type of theory, it is believed that high concentrations of glucose may cause osmotic stress in the yeast cells, causing the yeast cells to cease to exhibit performance, even death.
Without wishing to be bound by any type of theory, it is believed that the above recombinant yeast cells allow for a reduced accumulation of glucose and/or other sugars within the yeast cells, thereby suitably allowing for improved robustness, compared to yeast cells that do not comprise a combination of the claimed transketolase proteins.
The advantages are illustrated by way of example. In the examples, the fermentation is carried out at a high dry matter content of 36% w/w. As demonstrated by the examples, the recombinant yeast cells according to the invention and the methods according to the invention allow for continuous performance of the yeast cells and/or continuous conversion of glucose. Recombinant yeast cells survived after 66 hours and converted carbohydrates to ethanol even in media containing glucose at concentrations up to 36% w/w and/or at temperatures up to 32 ℃. Thus, even in the case where a high concentration of glucose is present at the beginning of the fermentation and/or throughout the fermentation, a low concentration of residual glucose can be obtained at the end of the fermentation.
Description of sequence Listing
The present application contains a sequence listing in computer readable form, which is incorporated herein by reference. Table 1 below provides an overview.
Table 1: overview of the sequence listing:
in the context of the present patent application, each of the above protein/amino acid sequences is preferably encoded by a DNA/nucleic acid sequence optimized for expression in yeast, more preferably for expression in saccharomyces cerevisiae.
Detailed Description
Definition of the definition
Unless defined otherwise or clearly indicated by context, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art.
Throughout this specification and the claims which follow, the words "comprise" and "include" and variations such as "comprises", "comprising", "including" and "including" are to be interpreted as being inclusive. That is, where the context permits, these words are intended to convey that other elements or integers not specifically enumerated may be included.
The articles "a" and "an" are used herein to refer to the grammatical object of the article (i.e., one/one or at least one/at least one). For example, "an element" may mean one element/one element (one element) or more than one element/more than one element (more than one element). When referring to a noun (e.g., a compound, additive, etc.) in the singular, the plural is intended to be included. Thus, when referring to a particular portion (e.g., "a gene"), unless otherwise specified, this means "at least one" in the gene, e.g., "at least one gene".
When referring to a compound in which several isomers (e.g., D and L enantiomers) are present, the compound in principle includes all enantiomers, diastereomers and cis/trans isomers of the compound that may be used in certain aspects of the invention; in particular when referring to such a compound, it includes one or more of the natural isomers.
The various embodiments of the invention described herein may be cross-combined unless explicitly indicated otherwise.
The term "carbon source" refers to a source of carbon, preferably a compound or molecule comprising carbon. Preferably, the carbon source is a carbohydrate. Carbohydrates are understood herein as organic compounds consisting of carbon, oxygen and hydrogen. Suitably, the carbon source may be selected from the group consisting of: monosaccharides, disaccharides and/or polysaccharides, acids and acid salts. More preferably, the carbon source is a compound selected from the group consisting of: glucose, arabinose, xylose, galactose, mannose, rhamnose, fructose, glycerol and acetic acid or salts thereof.
The terms "dry matter" and "dry solids" (abbreviated as "DM" and "DS", respectively) are used interchangeably herein and refer to the material remaining after removal of water. Thus, the dry matter content may be determined by any method known to a person skilled in the art.
The term "fermentation" and variants thereof such as "fermentation" and/or "fermentation" are used herein in a classical sense, i.e. to indicate that the process is or has been performed under anaerobic conditions. Anaerobic fermentation is defined herein as fermentation performed under anaerobic conditions. Anaerobic conditions are defined herein as conditions that do not have any oxygen or that the yeast cells do not substantially consume oxygen. Conditions that do not substantially consume oxygen suitably correspond to less than 5mmol/l.h -1 Oxygen consumption of in particular less than 2.5mmol/l.h -1 Or less than 1mmol/l.h -1 Oxygen consumption of (2). More preferably, 0mmol/L/h is consumed (i.e., oxygen consumption is undetectable). This suitably corresponds to a dissolved oxygen concentration in the culture broth of less than 5% of the air saturation, more suitably less than 1% of the air saturation or less than 0.2% of the air saturation.
The term "fermentation process" refers to a process for preparing or producing a fermentation product.
The term "cell" refers to a eukaryotic organism or a prokaryotic organism, preferably present as a single cell. In the present invention, the cells are recombinant yeast cells. That is, the recombinant cell is selected from the group of genera consisting of yeasts.
The terms "yeast" and "yeast cell" are used interchangeably herein and refer to a group of phylogenetically diverse single-cell fungi, most of which belong to the Ascomycota (Ascomycota) and Basidiomycota (Basidiomycota). Budding yeasts ("true yeasts") are classified in Saccharomyces (Saccharomyces cerevisiae). The yeast cell according to the invention is preferably a yeast cell derived from Saccharomyces (Saccharomyces). More preferably, the yeast cell is a yeast cell of the species Saccharomyces cerevisiae.
As used herein, the term "recombinant" (e.g., references to "recombinant yeast," "recombinant cell," "recombinant microorganism," and/or "recombinant strain") refers to a yeast, cell, microorganism, or strain, respectively, that contains a nucleic acid as a result of one or more genetic modifications. Briefly, a yeast, cell, microorganism or strain contains different combinations of nucleic acids from one or more of its parents (any of them). To construct a recombinant yeast, cell, microorganism or strain, one or more recombinant DNA techniques and/or another one or more mutagenesis techniques may be used. For example, a recombinant yeast and/or recombinant yeast cell may comprise a nucleic acid that is not present in the corresponding wild-type yeast and/or cell into which the nucleic acid has been introduced using recombinant DNA techniques (i.e., a transgenic yeast and/or cell), or which is not present in the wild-type yeast and/or cell as a result of one or more mutations (e.g., using recombinant DNA techniques or another mutagenesis technique such as UV irradiation) in a nucleic acid sequence (such as a gene encoding a wild-type polypeptide) present in the wild-type yeast and/or yeast cell, or wherein the nucleic acid sequence of the gene has been modified to target the polypeptide product (encoding it) to another cellular compartment. Furthermore, the term "recombinant" may suitably relate to, for example, yeasts, cells, microorganisms or strains from which nucleic acid sequences have been removed using recombinant DNA techniques.
Recombinant yeast comprising or having some activity is understood herein as recombinant yeast may comprise one or more nucleic acid sequences encoding a protein having such activity. Thus, recombinant yeast are allowed to functionally express such proteins or enzymes.
The term "functionally express" means that there is functional transcription of the relevant nucleic acid sequence, allowing the nucleic acid sequence to be actually transcribed, for example resulting in the synthesis of a protein.
As used herein, the term "transgene" (e.g., reference to "transgenic yeast" and/or "transgenic cell") refers to a yeast and/or cell, respectively, that contains nucleic acids that do not naturally occur in the yeast and/or cell and that have been introduced into the yeast and/or cell using, for example, recombinant DNA techniques, such as recombinant yeast and/or cells.
The term "mutation" as used herein with respect to a protein or polypeptide means that at least one amino acid has been replaced with, inserted into, or deleted from a different amino acid sequence than the wild-type or naturally occurring protein or polypeptide sequence. Amino acid substitutions, insertions or deletions may be made, for example, by mutagenesis of the nucleic acid encoding the amino acid. Mutagenesis is a method well known in the art and includes site-directed mutagenesis, e.g., by means of PCR or via oligonucleotide-mediated mutagenesis, as described in: sambrook et al, molecular Cloning-ALaboratory Manual, molecular cloning-laboratory Manual, 2 nd edition, volumes 1-3 (1989), published by Cold Spring Harbor Publishing, cold spring harbor publication Co.).
The term "mutation" as used herein with respect to a gene means that at least one nucleotide in the nucleic acid sequence of the gene or its regulatory sequence has been replaced by a different nucleotide, inserted into the nucleic acid sequence or deleted from the nucleic acid sequence, as compared to the wild-type or naturally occurring nucleic acid sequence. Amino acid substitutions, insertions or deletions may be effected, for example, via mutagenesis, resulting in, for example, transcription of a protein sequence with qualitatively or quantitatively altered function or a knockout of the gene. In the context of the present invention, "altered gene" has the same meaning as a mutated gene.
As used herein, the term "gene" or "gene" refers to a nucleic acid sequence of an mRNA that can be transcribed into and then translated into a protein. A gene encoding a protein refers to one or more nucleic acid sequences encoding such a protein.
As used herein, the term "nucleic acid" or "nucleotide" refers to a monomeric unit in a deoxyribonucleotide or ribonucleotide polymer (i.e., polynucleotide) in either single-or double-stranded form, and unless otherwise limited, encompasses known analogs having the essential properties of natural nucleotides, as they hybridize to single-stranded nucleic acids (e.g., peptide nucleic acids) in a manner similar to naturally occurring nucleotides. For example, an enzyme defined by a nucleotide sequence encoding an enzyme includes (unless otherwise limited) a nucleotide sequence that hybridizes to a reference nucleotide sequence encoding the enzyme. The polynucleotide may be the full length or a subsequence of a native or heterologous structure or regulatory gene. Unless otherwise indicated, the term includes references to a specified sequence and its complement. Thus, DNA or RNA having a backbone modified for stability or other reasons is the term "polynucleotide" as contemplated herein. In addition, DNA or RNA comprising rare bases (such as inosine) or modified bases (such as tritylated bases), to name just two examples, is the term polynucleotide as used herein. It will be appreciated that a wide variety of modifications have been made to DNA and RNA for many useful purposes known to those skilled in the art. The term polynucleotide as used herein includes such chemically, enzymatically or metabolically modified forms of polynucleotides, as well as chemical forms of DNA and RNA that are characteristic of viruses and cells (including, inter alia, simple and complex cells).
The terms "nucleotide sequence" and "nucleic acid sequence" are used interchangeably herein. An example of a nucleic acid sequence is a DNA sequence.
The terms "polypeptide", "peptide" and "protein" are used interchangeably herein to refer to a polymer of amino acid residues, for example, as displayed by an amino acid sequence. These terms apply to amino acid polymers in which one or more amino acid residues are artificial chemical analogues of the corresponding naturally occurring amino acid, as well as naturally occurring amino acid polymers. An essential attribute of such analogs of naturally occurring amino acids is that when incorporated into a protein, the protein is specifically reactive to antibodies raised by proteins that are identical but consist entirely of naturally occurring amino acids. The terms "polypeptide", "peptide" and "protein" also include modifications including, but not limited to, glycosylation, lipid attachment, sulfation, gamma-carboxylation of glutamic acid residues, hydroxylation and ADP-ribosylation.
The term "enzyme" refers herein to a protein having a catalytic function. The terms "protein" and "enzyme" may be used interchangeably herein in the context of a protein catalyzing a biological reaction of some sort. When referring to Enzymes (EC), enzymes are a class in which enzymes are classified or may be classified according to enzyme nomenclature provided by the International Union of biochemistry and molecular biology Commission (the Nomenclature Committee of the International Union of Biochemistry and Molecular Biology, NC-IUBMB), which nomenclature may be found in http:// www.chem.qmul.ac.uk/IUBMB/enzyme. It is intended to include other suitable enzymes that have not been (yet) classified in a given class but may be so classified.
If a protein or nucleic acid sequence (such as a gene) is referred to herein by reference to an accession number, this number is used specifically to refer to a protein or nucleic acid sequence (gene) having a sequence that can be found via www.ncbi.nlm.nih.gov/(available 10/1/2020), unless otherwise specified.
Each nucleic acid sequence encoding a polypeptide herein also includes any conservatively modified variant thereof. By reference to the genetic code, this includes that it describes every possible silent variation of the nucleic acid. The term "conservatively modified variants" applies to both amino acid and nucleic acid sequences. With respect to specific nucleic acid sequences, conservatively modified variants refers to those nucleic acids which encode identical amino acid sequences or conservatively modified amino acid sequence variants due to the degeneracy of the genetic code. The term "degeneracy of the genetic code" refers to the fact that a large number of functionally identical nucleic acids encode any given protein. For example, both codons GCA, GCC, GCG and GCU encode the amino acid alanine. Thus, at each position where the codon specifies an alanine, the codon can be changed to any of the described corresponding codons without changing the encoded polypeptide. Such nucleic acid variations are "silent variations" and represent a conservatively modified variation.
As used herein, the term "functional homolog" (or simply "homolog") of a polypeptide and/or amino acid sequence having a particular sequence (e.g., "SEQ ID NO: X") refers to a polypeptide and/or amino acid sequence comprising said particular sequence, provided that one or more amino acids are mutated, substituted, deleted, added and/or inserted, and that the polypeptide has (qualitatively) the same enzymatic function for substrate conversion.
As used herein, the term "functional homolog" (or simply "homolog") of a polynucleotide and/or nucleic acid sequence having a particular sequence (e.g., "SEQ ID NO: X") refers to a polynucleotide and/or nucleic acid sequence comprising said particular sequence, provided that one or more nucleic acids are mutated, substituted, deleted, added and/or inserted, and that the polynucleotide encodes a polypeptide sequence having (qualitatively) the same enzymatic function for substrate conversion. With respect to nucleic acid sequences, the term functional homolog is intended to include nucleic acid sequences that differ from another nucleic acid sequence due to the degeneracy of the genetic code and that encode the same polypeptide sequence.
Sequence identity is defined herein as the relationship between two or more amino acid (polypeptide or protein) sequences or two or more nucleic acid (polynucleotide) sequences, as determined by comparing the sequences. Typically, sequence identity or similarity is compared over the entire length of the sequences being compared. "identity" also means in the art the degree of sequence relatedness between amino acid or nucleic acid sequences, as the case may be, as determined by the match between strings of such sequences.
Amino acid or nucleotide sequences are said to be homologous when they exhibit a certain level of similarity. The two sequences are homologous indicating a common evolutionary origin. Whether two homologous sequences are closely related or more distant related is indicated by a "percent identity" or a "percent similarity", which are high or low, respectively. Although controversial, to indicate "percent identity" or "percent similarity", "level of homology" or "percent homology" are often used interchangeably. Comparison of sequences and determination of percent identity between two sequences may be accomplished using a mathematical algorithm. The skilled artisan will appreciate the fact that several different computer programs are available for aligning two sequences and determining homology between the two sequences (Kruskal et al, "An overview of sequence comparison: time warp, string editions, and macromolecules" [ "overview of sequence comparisons: time warp, string edit and macromolecule" ], (1983), society for Industrial and Applied Mathematics (SIAM) [ Society of Industry and Application Mathematics (SIAM) ], volume 25, phase 2, pages 201-237 and the handbook edited by D.Sankoff and J.B.Kruskal, "Time warp, string edits and macromolecules: the theory and practice of sequence comparison" [ "Time warp, string edit and macromolecule: theory and practice of sequence comparison" ], (1983), pages 1-44, published by Addison-Wesley Publishing Company, massachusetts USA [ Aison-Disli, mass.).
The percentage identity between two amino acid sequences can be determined by aligning the two sequences using the niman (Needleman) and the Wunsch algorithm. (Needleman et al, "A General Method Applicable to the Search for Similarities in the Amino Acid Sequence of Two Proteins" [ "a general method suitable for finding similarity of amino acid sequences of two proteins" ] (1970) J.mol.biol. [ J. Mol. Biol. ] volume 48, pages 443-453). The algorithm aligns amino acid sequences and nucleotide sequences. The nidman-tumbler algorithm has been implemented in the computer program NEEDLE. For the purposes of the present invention, NEEDLE program from EMBOSS package (version 2.8.0 or higher, see Rice et al, "EMBOSS: the European Molecular Biology Open Software Suite" [ EMBOSS: european molecular biology open software suite ] (2000), trends in Genetics [ genetics trend ] (6) pages 276-277, http:// EMBOSS. Bioinformation. Nl /). For protein sequences, EBLOSUM62 was used as a substitution matrix. For the nucleotide sequence, EDNAFULL was used. Other matrices may be specified. The optional parameters for amino acid sequence alignment are a gap opening penalty of 10 and a gap expansion penalty of 0.5. The skilled person will appreciate that all of these different parameters will produce slightly different results, but that the overall percentage of identity of the two sequences does not change significantly when different algorithms are used.
Homology or identity is the percentage of identical matches between two complete sequences over the total alignment region including any gaps or extensions. Homology or identity between two aligned sequences is calculated as follows: the number of corresponding positions showing the same amino acid in both sequences in the alignment is divided by the total length of the alignment including gaps. IDENTITY as defined herein can be obtained from NEEDLE and is labeled "IDENTITY" in the output of the program.
Homology or identity between two aligned sequences is calculated as follows: the number of corresponding positions showing the same amino acid in both sequences in the alignment is divided by the total length of the alignment after subtracting the total number of gaps in the alignment. Identity as defined herein may be obtained from NEEDLE by using the NOBRIEF option and is marked as "longest identity" in the output of the program.
Variants of a nucleotide or amino acid sequence disclosed herein may also be defined as having one or more substitutions, insertions, and/or deletions compared to the nucleotide or amino acid sequence specifically disclosed herein (e.g., in the sequence listing).
Optionally, the skilled artisan may also consider so-called "conservative" amino acid substitutions in determining the degree of amino acid similarity, as will be clear to the skilled artisan. Conservative amino acid substitutions refer to the interchangeability of residues having similar side chains. For example, a group of amino acids having aliphatic side chains is glycine, alanine, valine, leucine and isoleucine; a group of amino acids having aliphatic-hydroxyl side chains are serine and threonine; a group of amino acids having amide-containing side chains are asparagine and glutamine; a group of amino acids having aromatic side chains are phenylalanine, tyrosine and tryptophan; a group of amino acids with basic side chains are lysine, arginine and histidine; and a group of amino acids having sulfur-containing side chains are cysteine and methionine. In one embodiment, the conservative amino acid substitution sets are: valine-leucine-isoleucine, phenylalanine-tyrosine, lysine-arginine, alanine-valine and asparagine-glutamine. A substitution variant of an amino acid sequence disclosed herein is a variant in which at least one residue in the disclosed sequence has been removed and a different residue inserted at its position. Preferably, the amino acid changes are conservative. In one embodiment, conservative substitutions for each naturally occurring amino acid are as follows: ala to Ser; arg to Lys; asn to gin or His; asp to Glu; cys to Ser or Ala; gln to Asn; glu to Asp; gly to Pro; his to Asn or Gln; ile to Leu or Val; leu to Ile or Val; lys to Arg; gln or Glu; met to Leu or Ile; phe to Met, leu, or Tyr; ser to Thr; thr to Ser; trp to Tyr; tyr to Trp or Phe; and Val to Ile or Leu.
The nucleotide sequences of the present invention may also be defined by their ability to hybridize under moderate hybridization conditions or, preferably, under stringent hybridization conditions, respectively, to portions of the specific nucleotide sequences disclosed herein. Stringent hybridization conditions are defined herein as conditions that allow nucleic acid sequences of at least about 25 nucleotides, preferably about 50, 75 or 100 nucleotides, most preferably about 200 or more nucleotides to hybridize at a temperature of about 65 ℃ in a solution comprising about 1M salt (preferably 6xSSC or any other solution having comparable ionic strength), and to wash at 65 ℃ in a solution comprising about 0.1M or less salt (preferably 0.2 xSSC or any other solution having comparable ionic strength). Preferably, hybridization is performed overnight, i.e., for at least 10 hours; and preferably the washing is carried out for at least one hour, wherein the washing solution is replaced at least twice. These conditions will typically allow specific hybridization of sequences having about 90% or greater sequence identity. Moderate conditions are defined herein as conditions that allow nucleic acid sequences of at least 50 nucleotides, preferably about 200 or more nucleotides, to hybridize in a solution comprising about 1M salt (preferably 6x SSC or any other solution having comparable ionic strength) at a temperature of about 45 ℃ and to wash in a solution comprising about 1M salt (preferably 6x SSC or any other solution having comparable ionic strength) at room temperature. Preferably, hybridization is performed overnight, i.e., for at least 10 hours; and preferably the washing is carried out for at least one hour, wherein the washing solution is replaced at least twice. These conditions will typically allow specific hybridization of sequences with up to 50% sequence identity. Those skilled in the art will be able to modify these hybridization conditions in order to specifically identify sequences that vary in identity between 50% and 90%.
"expression" refers to the transcription of a gene into structural RNA (rRNA, tRNA) or messenger RNA (mRNA) followed by translation into a protein.
By "overexpression" is meant that the expression of a gene (and correspondingly nucleic acid sequence) by a recombinant cell exceeds its expression in a corresponding wild-type cell. Such overexpression may be arranged, for example, by: increasing the frequency of transcription of one or more nucleic acid sequences, for example, by operably linking the nucleic acid sequences to a promoter functional in a recombinant cell; and/or by increasing the copy number of a nucleic acid sequence.
The terms "up", "up" and "up" refer to the process by which a cell increases the amount of a cellular component, such as RNA or a protein. Such upregulation may be responsive to or caused by a genetic modification.
The term "pathway" or "metabolic pathway" is understood herein as a series of chemical reactions in a cell that build and break down molecules.
The nucleic acid sequence (i.e., polynucleotide) or protein (i.e., polypeptide) may be native or heterologous to the genome of the host cell.
"native", "homologous" or "endogenous" with respect to a host cell means that the nucleic acid sequence does naturally occur in the genome of the host cell, or that the protein is naturally produced by the cell. The terms "natural," "homologous," and "endogenous" are used interchangeably herein.
As used herein, "heterologous" may refer to a nucleic acid sequence or a protein. For example, with respect to a host cell, "heterologous" may refer to a polynucleotide that does not naturally occur in the genome of the host cell in this manner, or a polypeptide or protein is not naturally produced by the cell in this manner. Heterologous nucleic acid sequences are nucleic acids derived from a foreign species or, if from the same species, have been substantially modified in composition and/or genomic locus relative to their native form by deliberate human intervention. For example, a promoter operably linked to a native structural gene is from a different species than the species from which the structural gene was derived, or if from the same species, one or both are substantially modified relative to their original form. Heterologous proteins may be derived from foreign species or, if from the same species, substantially modified with respect to their original form by deliberate human intervention. That is, heterologous protein expression relates to the expression of proteins that are not naturally expressed in the host cell in this manner. The term "heterologous expression" refers to expression of a heterologous nucleic acid in a host cell. Expression of heterologous proteins in eukaryotic host cell systems, such as yeast, is well known to those skilled in the art. Polynucleotides comprising a nucleic acid sequence encoding a gene for a protein or enzyme having a particular activity may be expressed in such eukaryotic systems. In some embodiments, the transformed/transfected cells may be used as an expression system for expressing enzymes. Expression of heterologous proteins in yeast is well known. Sherman, F.et al, methods in Yeast Genetics [ Yeast genetics methods ], (1986), published by Cold Spring Harbor Laboratory [ Cold spring harbor laboratory ] are well-known works describing a variety of methods that can be used to express proteins in yeast. Two widely used yeasts are Saccharomyces cerevisiae and Pichia pastoris. Vectors, strains and protocols for expression in Saccharomyces and Pichia (Pichia) are known in the art and available from commercial suppliers such as, for example, invitrogen. Suitable vectors typically have expression control sequences such as promoters (including 3-phosphoglycerate kinase or alcohol oxidase promoters), origins of replication, termination sequences, and the like, as desired.
As used herein, a "promoter" refers to a DNA sequence that directs transcription of a (structural) gene or other (partial) nucleic acid sequence. Suitably, the promoter is located in the 5' region of the gene, close to the transcription start site of the (structural) gene. The promoter sequence may be constitutive, inducible or repressible. In one embodiment, no (external) inducer is required.
As used herein, the term "vector" includes reference to an autosomal expression vector and an integration vector for integration into a chromosome.
The term "expression vector" refers to a linear or circular DNA molecule comprising a segment encoding a polypeptide of interest under the control of (i.e., operably linked to) an additional nucleic acid segment that provides for its transcription. Such additional segments may include promoter and terminator sequences, and may optionally include one or more origins of replication, one or more selectable markers, an enhancer, a polyadenylation signal, and the like. Expression vectors are typically derived from plasmid or viral DNA, or may contain elements of both. In particular, the expression vector comprises a nucleic acid sequence comprising and operably linked in the 5 'to 3' direction: (a) a yeast-recognized transcription and translation initiation region, (b) a coding sequence for a polypeptide of interest, and (c) a yeast-recognized transcription and translation termination region.
"plasmid" refers to autonomously replicating extra-chromosomal DNA that does not integrate into the genome of a microorganism and is typically circular in nature.
An "integrative vector" refers to a linear or circular DNA molecule that can be incorporated into the genome of a microorganism and provide stable inheritance of a gene encoding a polypeptide of interest. An integrative vector typically comprises one or more segments containing a gene sequence encoding the polypeptide of interest under the control of (i.e., operably linked to) an additional nucleic acid segment that provides for its transcription. Such additional segments may include promoter and terminator sequences, as well as one or more segments that drive the incorporation of the gene of interest into the genome of the target cell (typically by methods of homologous recombination). Typically, an integrative vector will be a vector that can be transferred into a target cell but has a replicon that is not functional in the organism. If appropriate markers are included in the segment, integration of the segment comprising the gene of interest may be selected.
A "host cell" is herein understood to be a cell, such as a yeast cell, which is transformed with one or more nucleic acid sequences encoding one or more heterologous proteins to construct a transformed cell (also referred to as a recombinant cell). For example, the transformed cells may contain a vector and may support replication and/or expression of the vector.
As used herein, "transformation" and "transformation" refer to insertion of an exogenous polynucleotide into a host cell, regardless of the method used for insertion, such as direct uptake, transduction, f-ligation, or electroporation. The exogenous polynucleotide may be maintained as a non-integrating vector (e.g., a plasmid), or alternatively may be integrated into the host cell genome. As used herein, "transformation" and "transformation" refer to the insertion of an exogenous polynucleotide (i.e., an exogenous nucleic acid sequence) into a host cell, regardless of the method used for insertion, such as direct uptake, transduction, f-ligation, or electroporation. The exogenous polynucleotide may be maintained as a non-integrating vector (e.g., a plasmid), or alternatively may be integrated into the host cell genome.
"constitutive expression (constitutive expression)" and "constitutive expression (constitutively expressing)" are understood herein to mean that there is a continuous transcription of the nucleic acid sequence. That is, the nucleic acid sequence is transcribed in a sustained manner. The constitutively expressed genes are always "on".
"anaerobic constitutive expression" is understood herein to mean that the nucleic acid sequence is constitutively expressed in the organism under anaerobic conditions. That is, under anaerobic conditions, the nucleic acid sequence is transcribed in a sustained manner, i.e., under such anaerobic conditions, the gene is always "on".
"disruption" is understood herein to mean any disruption of activity, including but not limited to deletion, mutation, and reduction of the affinity of disrupted genes and expression of RNAs complementary to such disrupted genes. It includes all nucleic acid modifications such as nucleotide deletions or substitutions, gene knockouts and other actions affecting translation or transcription of the corresponding polypeptide and/or affecting the (specific) activity of the enzyme, its substrate specificity and/or stability. It also includes modifications of the coding sequence or promoter of the gene that can be targeted. A gene disruption strain (disruptant) is a cell that has one or more disruptions of the corresponding gene. Naturally occurring in yeast is understood herein to mean that the gene is present in the yeast cell prior to disruption.
The term "encoding" has the same meaning as "encoding for". Thus, for example, the "gene or genes encoding a protein having activity X (one or more genes encoding a protein having activity X)" has the same meaning as the "gene or genes encoding a protein having activity X (one or more genes coding for a protein having activity X)".
In the case of a gene or nucleic acid sequence encoding a protein or enzyme, the phrase "nucleic acid sequence encoding X" (correspondingly "one or more nucleic acid sequences encoding X") (where X denotes a certain protein or (enzyme) activity) has the same meaning as "nucleic acid sequence encoding a protein with X activity" (correspondingly "one or more nucleic acid sequences encoding a protein with X activity"). Thus, for example, a "nucleic acid sequence or sequences encoding a transketolase" has the same meaning as "nucleic acid sequence or sequences encoding a protein having transketolase activity". As indicated above, the article "a" means "one or more".
By "redox sink" is herein understood a metabolic pathway that generally consumes or oxidizes NADH to NAD+ and/or prevents or reduces NAD+ consumption or reduction to NADH. The non-native metabolic pathway is a metabolic pathway that does not occur in the corresponding wild-type cell. Thus, the non-native metabolic pathway forming the redox sink is preferably a non-native metabolic pathway that increases NADH consumption and/or decreases NAD+ consumption compared to a corresponding wild-type yeast cell. By increasing NADH consumption and/or reducing NAD+ consumption, an (additional) unnatural redox sink can advantageously be produced within the cell.
The abbreviation "NADH" refers to the reduced hydrogenated form of nicotinamide adenine dinucleotide. The abbreviation "NAD+" refers to the oxidized form of nicotinamide adenine dinucleotide. Nicotinamide adenine dinucleotide can act as a so-called cofactor, assisting biochemical reactions and/or transformations in cells.
"NADH dependency" or "NAD+ dependency" is herein equivalent to NADH specificity, and "NADH dependency" or "NAD+ dependency" is herein equivalent to NADH specificity.
An "NADH-dependent" or "NAD+ -dependent" enzyme is herein understood to be an enzyme that, compared to other types of cofactors, depends only on NADH/NAD+ as cofactor or mainly on NADH/NAD+ as cofactor. An "NADH/NAD+ -only dependent" enzyme is herein understood to be an enzyme which has an absolute requirement for NADH/NAD+ relative to NADPH/NADP+. That is, it is active only when NADH/NAD+ is used as a cofactor. A "primary NADH/NDA+ dependent" enzyme is herein understood to be an enzyme having a higher specificity and/or a higher catalytic efficiency for NADH/NAD+ as cofactor than for NADPH/NADP+ as cofactor.
The specificity of an enzyme can be described by the following formula:
1<K m NADP + /K m NAD + <infinity (infinity)
Wherein K is m Is a so-called Mie constant.
For the primary NADH dependent enzyme, preferably K m NADP + /K m NAD + Between 1 and 1000, between 1 and 500, between 1 and 200, between 1 and 100, between 1 and 50, between 1 and 10, between 5 and 100, between 5 and 50, between 5 and 20, or between 5 and 10.
The K of the enzymes herein can be determined using known analytical techniques, calculations and protocols m Determined to be respectively directed to NAD + And enzyme specificity of NADP+. These are described, for example, in the following documents: loish et al Molecular Cell Biology [ molecular cell biology ] ]Edition 6, freeman, pages 80 and 81, e.g. FIGS. 3-22. For the primary NADH dependent enzymePreferably, the catalytic efficiency (k) for NADPH/NADP+ as cofactor cat /K m ) NADP+ And catalytic efficiency (k) for NADH/NAD+ as cofactor cat /K m ) NAD+ Is the ratio of (i.e., the catalytic efficiency ratio (k cat /K m ) NADP+ :(k cat /K m ) NAD+ ) Greater than 1:1, more preferably equal to or greater than 2:1, still more preferably equal to or greater than 5:1, even more preferably equal to or greater than 10:1, yet even more preferably equal to or greater than 20:1, even more preferably equal to or greater than 100:1, and most preferably equal to or greater than 1000:1. There is no upper limit, but for practical reasons the catalytic efficiency ratio (k cat /K m ) NADP+ :(k cat /K m ) NAD+ May be equal to or less than 1.000.000.000:1 (i.e., 1.10) 9 :1)。
Yeast cells
The recombinant yeast cell is preferably a yeast cell or is derived from a yeast cell from the genus Saccharomyces (Saccharomyces cerevisiae) or the genus Schizosaccharomyces (Schizosaccharomyces cerevisiae). That is, preferably, the host cell from which the recombinant yeast cell is derived is a yeast cell from the genus Saccharomyces or Schizosaccharomyces.
Examples of suitable yeast cells include Saccharomyces, such as Saccharomyces cerevisiae, saccharomyces cerevisiae (Saccharomyces eubayanus), saccharomyces jurei, saccharomyces pastorianus (Saccharomyces pastorianus), saccharomyces beticus, fermenting yeast (Saccharomyces fermentati), saccharomyces mirabilis (Saccharomyces paradoxus), saccharomyces vitis (Saccharomyces uvarum), and Saccharomyces bayanus (Saccharomyces bayanus).
Examples of suitable yeast cells further include Schizosaccharomyces (Schizosaccharomyces), such as Schizosaccharomyces pombe, schizosaccharomyces japan (Schizosaccharomyces japonicus), schizosaccharomyces octaspore (Schizosaccharomyces octosporus), and Schizosaccharomyces psychrophilum (Schizosaccharomyces cryophilus).
Other exemplary yeasts include the genus torulopsis (torularia), such as torulopsis delbrueckii (Torulaspora delbrueckii); kluyveromyces (Kluyveromyces) such as Kluyveromyces marxianus (Kluyveromyces marxianus); pichia, such as Pichia stipitis (Pichia stipitis), pichia pastoris, or Pichia angustifolia; bound yeasts such as Saccharomyces cerevisiae (Zygosaccharomyces bailii); brettanomyces, such as Brettanomyces (Brettanomyces inter medius); brettanomyces brucei (Brettanomyces bruxellensis), brettanomyces iso (Brettanomyces anomalus), brettanomyces bambusicola (Brettanomyces custersianus), brettanomyces naughty (Brettanomyces naardenensis), brettanomyces nanensis (Brettanomyces nanus), brettanomyces brucei (Dekkera bruxellensis) and Dekkera anomala; genus mergilmyces (Metschmkowia), genus isatchenkia (isatchenkia), such as isasatchen orientalis (Issatchenkia orientalis), genus klebsiella (Kloeckera), such as klebsiella citrifolia (Kloeckera apiculata); and Aureobasidium (Aureobasidium), such as Aureobasidium pullulans (Aureobasidium pullulans).
The yeast cell is preferably a yeast cell of the genus schizosaccharomyces (also referred to herein as a schizosaccharomyces yeast cell), or a yeast cell of the genus saccharomyces (also referred to herein as a saccharomyces yeast cell). More preferably, the yeast cell is a yeast cell derived from a Saccharomyces cerevisiae species (also referred to herein as a Saccharomyces cerevisiae cell). That is, preferably, the host cell from which the recombinant yeast cell is derived is a yeast cell from the species Saccharomyces cerevisiae.
Preferably, the yeast cell is an industrial yeast cell. The survival environment of yeast cells in industrial processes is significantly different from that in the laboratory. Industrial yeast cells must be capable of performing well under a variety of environmental conditions, which may vary in the process. Such changes include changes in nutrient sources, pH, ethanol concentration, temperature, oxygen concentration, etc., which together have potential effects on cell growth and ethanol production by yeast cells. Industrial yeast cells can be understood to refer to yeast cells having more robust properties when compared to laboratory counterparts. That is, when combined withIndustrial yeast cells exhibit less performance variation when one or more environmental conditions selected from the group of nutrient source, pH, ethanol concentration, temperature, oxygen concentration vary during fermentation than laboratory counterparts. Preferably, the yeast cells are constructed on the basis of industrial yeast cells as hosts, wherein the construction is performed as described below. An example of an industrial yeast cell is Ethanol (Fermentis, fremantis corporation)),>(Dissmann Co., ltd. (DSM)) and +.>(Raman company (Lallemand)).
The recombinant yeast cells described herein can be derived from any host cell capable of producing a fermentation product. Preferably, the host cell is a yeast cell, more preferably an industrial yeast cell as described above. Preferably, the yeast cells described herein are derived from host cells having the ability to produce ethanol.
Thus, the yeast cells described herein may be derived from host cells by any technique known to be suitable to those skilled in the art. Such techniques may include any one or more of mutagenesis, recombinant DNA techniques (including but not limited to CRISPR-CAS techniques), selective and/or adaptive evolution, conjugation, cell fusion, and/or cytokinesis between yeast strains. Suitably, one or more desired genes are incorporated into the yeast cell by a combination of one or more of the above techniques.
The recombinant yeast cells according to the invention are preferably inhibitor tolerant, i.e. they can withstand the common inhibitors at the level of common pretreatment and hydrolysis conditions they typically have, so that the recombinant yeast cells can be used in a wide range of applications, i.e. it has a high adaptability to different raw materials, different pretreatment methods and different hydrolysis conditions. In one embodiment, the recombinant yeast cell is inhibitor tolerant. Inhibitor tolerance is resistance to an inhibitory compound. The presence and level of inhibitory compounds in lignocellulose can vary widely with the feedstock, pretreatment process, hydrolysis process. Examples of inhibitor classes are carboxylic acids, furans and/or phenolic compounds. Examples of carboxylic acids are lactic acid, acetic acid or formic acid. Examples of furans are furfural and hydroxy-methylfurfural. Examples of phenolic compounds are vanillin (vanilin), syringic acid, ferulic acid and coumaric acid. Typical amounts of inhibitors, for carboxylic acids: up to 20 g/liter or more, depending on the feedstock, pretreatment and hydrolysis conditions. For furan: hundreds of milligrams per liter, up to several grams per liter, depending on the feedstock, pretreatment, and hydrolysis conditions. For phenols: up to a gram per liter, tens of milligrams per liter, depending on the starting materials, pretreatment and hydrolysis conditions.
In one embodiment, the recombinant yeast cell is a cell that is naturally capable of alcoholic fermentation, preferably anaerobic alcoholic fermentation. Recombinant yeast cells preferably have high tolerance to ethanol, low pH (i.e., capable of growing at a pH of less than about 5, about 4, about 3, or about 2.5), and organics, and/or high tolerance to elevated temperatures.
Natural transketolase
Recombinant yeast cells suitably express functionally:
-a nucleic acid sequence encoding a native protein having transketolase activity; and
-a nucleic acid sequence encoding a heterologous protein having transketolase activity.
Proteins having transketolase activity are also referred to herein as "transketolase proteins", "transketolase (transketolase enzyme)" or simply as "transketolase". "transketolase" is abbreviated herein as "TKL".
Similarly, a native protein having a transketolase activity is also referred to herein as a "native transketolase protein," "native transketolase (native transketolase enzyme)" or simply as a "native transketolase (native transketolase)".
Transketolase is an enzyme active in the pentose phosphate pathway of yeast cells. The gene encoding the pentose phosphate pathway is also referred to herein as the "PPP" gene. Preferably, references to the pentose phosphate pathway in this specification are to be understood as references to the non-oxidized part of the pentose phosphate pathway. Enzymes active in the pentose phosphate pathway include ribulose-5-phosphate isomerase (RKI), ribulose-5-phosphate epimerase (RPE), transketolase (TKL) and Transaldolase (TAL).
"transketolase" (belonging to the enzyme class EC 2.2.1.1) is defined herein as an enzyme that catalyzes the following reaction: d-ribose 5-phosphate +D-xylulose 5-sedoheptulose 7-phosphate +D-glyceraldehyde 3-phosphate;
and vice versa.
This enzyme is also known as trans-glycolaldehyde enzyme or sedoheptulose-7-phosphate D-glyceraldehyde-3-phosphate trans-glycolaldehyde enzyme. A certain transketolase may be further defined by its amino acid sequence. Likewise, a transketolase may be further defined by a nucleotide sequence encoding a transketolase. As explained in detail below under the definition above, a certain transketolase defined by a nucleotide sequence encoding an enzyme includes (unless otherwise limited) a nucleotide sequence that hybridizes to such a nucleotide sequence encoding a transketolase.
The nucleic acid sequence encoding a native protein having transketolase activity may itself be a homologous nucleic acid sequence, a heterologous nucleic acid sequence, or a mixture of homologous and heterologous nucleic acid sequences, provided that the protein encoded by such nucleic acid sequence is a native (i.e., endogenous) transketolase protein to the host cell. Saccharomyces cerevisiae is a preferred host cell. Thus, preferably, the recombinant yeast cell is a recombinant s.cerevisiae cell functionally expressing a nucleic acid sequence encoding a protein having transketolase activity, which is native to s.cerevisiae cell.
More preferably, the nucleic acid sequence encoding a native protein having transketolase activity is native to the host cell.
Thus, the recombinant yeast cell preferably expresses functionally:
-a native nucleic acid sequence encoding a native protein having transketolase activity; and
-a heterologous nucleic acid sequence encoding a heterologous protein having transketolase activity.
The wild-type yeast may comprise one or two native transketolase genes. In addition to the first polyketide gene "TKL1", some yeasts (such as, for example, saccharomyces cerevisiae) also comprise a paralogous gene "TKL2" (the second polyketide gene).
Suitably, the recombinant yeast cell according to the invention may comprise the native TKL1 gene and/or the native TKL2 gene.
That is, suitably, the recombinant yeast cell may comprise:
-a nucleic acid sequence encoding a natural TKL1 (e.g., gene "TKL 1"); or alternatively
-a nucleic acid sequence encoding a natural TKL2 (e.g., gene "TKL 2"); or alternatively
Both a nucleic acid sequence encoding a natural TKL1 (e.g., gene "TKL 1") and a nucleic acid sequence encoding a natural TKL2 (e.g., gene "TKL 2").
Preferably, the recombinant yeast cell comprises a natural nucleotide sequence encoding a natural transketolase TKL 1. That is, preferably, the recombinant yeast cell comprises a native TKL1 gene.
The recombinant yeast cell may comprise one or more copies (suitably in the range of from equal to or greater than 1 to equal to or less than 30 copies, preferably in the range of from equal to or greater than 1 to equal to or less than 20 copies) of the native gene encoding the transketolase. More preferably, the recombinant yeast cell comprises one, two, three, four, five, six, seven, eight, nine, ten, eleven or twelve copies of the native gene encoding the transketolase.
The recombinant yeast cell may be one in which the native nucleic acid sequence encoding the native protein having transketolase activity is under the control of a TKL promoter, as described in detail below.
As indicated above, host cells from saccharomyces cerevisiae species are preferred. The amino acid sequence of Saccharomyces cerevisiae's native transketolase 1 is shown by SEQ ID NO. 1. The natural nucleic acid sequence encoding transketolase 1 in Saccharomyces cerevisiae is shown by SEQ ID NO. 2.
Preferably, the native protein having transketolase activity comprises or consists of:
the amino acid sequence of SEQ ID NO. 1; or a functional homolog of SEQ ID NO. 1, which is suitably native to the host cell, having at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98% or at least 99% sequence identity with the amino acid sequence of SEQ ID NO. 1.
Thus, preferably, the preferably natural nucleic acid sequence encoding the natural transketolase protein comprises or consists of:
-the nucleic acid sequence of SEQ ID No. 2; or alternatively
-a functional homolog of SEQ ID No. 2 encoding a native transketolase having at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98% or at least 99% sequence identity to the nucleic acid sequence of SEQ ID No. 2.
More preferably, the recombinant yeast cell is a recombinant s.cerevisiae cell comprising (or consisting of, respectively, functionally expressing) a native protein having a transketolase activity, which protein comprises or consists of the amino acid sequence of SEQ ID NO. 1. Most preferably, the recombinant yeast cell is a recombinant Saccharomyces cerevisiae cell comprising (functionally expressed by, respectively) a nucleic acid sequence comprising or consisting of SEQ ID NO. 2.
Heterologous transketolase
As indicated above, the recombinant yeast cells suitably functionally express:
-a nucleic acid sequence encoding a native protein having transketolase activity; and
-a nucleic acid sequence encoding a heterologous protein having transketolase activity.
Heterologous proteins having transketolase activity are also referred to herein as "heterologous transketolase proteins", "heterologous transketolase (heterologous transketolase enzyme)" or simply as "heterologous transketolase (heterologous transketolase)".
The recombinant yeast cell suitably functionally expresses a heterologous nucleic acid sequence encoding a heterologous protein having transketolase activity.
In addition to the preferably natural nucleic acid sequence encoding a natural protein having a transketolase activity (correspondingly natural transketolase), such a heterologous nucleic acid sequence encoding a heterologous protein having a transketolase activity (correspondingly heterologous transketolase) is suitably present.
Preferably, the recombinant yeast cell comprises a heterologous nucleic acid sequence encoding a heterologous transketolase in addition to the native nucleic acid sequence encoding the native transketolase.
Such heterologous nucleic acid sequences encoding a transketolase are preferably under the control of a TKL promoter, as detailed below.
The recombinant yeast cell can comprise (and correspondingly functionally express) one or more heterologous nucleic acid sequences encoding one or more heterologous transketolase proteins. That is, the recombinant yeast cell can comprise one or more heterologous transketolase enzymes.
Preferably, the one or more heterologous transketolase comprises or consists of:
-an amino acid sequence of SEQ ID NO. 3, SEQ ID NO. 4, SEQ ID NO. 5, SEQ ID NO. 6, SEQ ID NO. 7, SEQ ID NO. 8, SEQ ID NO. 9, SEQ ID NO. 10, SEQ ID NO. 11, SEQ ID NO. 12, SEQ ID NO. 13, SEQ ID NO. 14, SEQ ID NO. 15, SEQ ID NO. 16, SEQ ID NO. 17 or SEQ ID NO. 19; or alternatively
-a functional homolog of SEQ ID NO 3, SEQ ID NO 4, SEQ ID NO 5, SEQ ID NO 6, SEQ ID NO 7, SEQ ID NO 8, SEQ ID NO 9, SEQ ID NO 10, SEQ ID NO 11, SEQ ID NO 12, SEQ ID NO 13, SEQ ID NO 14, SEQ ID NO 15, SEQ ID NO 16, SEQ ID NO 17 or SEQ ID NO 19 having at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 95% identity to the amino acid sequence of SEQ ID NO 3, SEQ ID NO 4, SEQ ID NO 5, SEQ ID NO 6, SEQ ID NO 7, SEQ ID NO 8, SEQ ID NO 9, SEQ ID NO 10, SEQ ID NO 11, SEQ ID NO 12, SEQ ID NO 13, SEQ ID NO 14, SEQ ID NO 15, SEQ ID NO 16, SEQ ID NO 17 or SEQ ID NO 19; or alternatively
-SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO 6, SEQ ID NO 7, SEQ ID NO 8, SEQ ID NO 9, SEQ ID NO 10, SEQ ID NO 11, SEQ ID NO 12, SEQ ID NO 13, SEQ ID NO 14, SEQ ID NO 15, SEQ ID NO 16, SEQ ID NO 17 or a functional homolog of SEQ ID NO 19 comprising an amino acid sequence having one or more mutations, substitutions, insertions and/or deletions when compared to the amino acid sequence of SEQ ID NO 3, SEQ ID NO 4, SEQ ID NO 5, SEQ ID NO 6, SEQ ID NO 7, SEQ ID NO 8, SEQ ID NO 9, SEQ ID NO 10, SEQ ID NO 11, SEQ ID NO 12, SEQ ID NO 13, SEQ ID NO 14, SEQ ID NO 15, SEQ ID NO 16, SEQ ID NO 17 or SEQ ID NO 19, wherein more preferably the amino acid sequence of SEQ ID NO 3, SEQ ID NO 4, SEQ ID NO 5, SEQ ID NO 6, SEQ ID NO 7, SEQ ID NO 8, SEQ ID NO 9, SEQ ID NO 10, SEQ ID NO 11, SEQ ID NO 12, SEQ ID NO 13, SEQ ID NO 14, SEQ ID NO 15, SEQ ID NO 17 or SEQ ID NO 19 does not exceed the amino acid sequence of No. 3, 4, 5, 10, 150, 10, 15, 16, 15, 17 or 19, 17, or 19, which does not exceed the amino acid sequence of any of the sequence, no more than 40, no more than 30, no more than 20, no more than 10, or no more than 5 amino acid mutations, substitutions, insertions, and/or deletions.
Preferably, the recombinant yeast cell comprises (is functionally expressed, respectively):
-one or more heterologous nucleic acid sequences encoding one or more amino acid sequences selected from the group consisting of: SEQ ID NO. 3, SEQ ID NO. 4, SEQ ID NO. 5, SEQ ID NO. 6, SEQ ID NO. 7, SEQ ID NO. 8, SEQ ID NO. 9, SEQ ID NO. 10, SEQ ID NO. 11, SEQ ID NO. 12, SEQ ID NO. 13, SEQ ID NO. 14, SEQ ID NO. 15, SEQ ID NO. 16, SEQ ID NO. 17 or SEQ ID NO. 19; and/or
-functional homologs thereof comprising a nucleic acid sequence having at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98% or at least 99% sequence identity to any of those nucleic acid sequences; and/or
Functional homologues thereof comprising a nucleic acid sequence having one or more mutations, substitutions, insertions and/or deletions when compared to any of those nucleic acid sequences.
More preferably, the nucleic acid sequence of any such functional homolog has no more than 300, no more than 250, no more than 200, no more than 150, no more than 100, no more than 75, no more than 50, no more than 40, no more than 30, no more than 20, no more than 10, or no more than 5 nucleic acid mutations, substitutions, insertions, and/or deletions as compared to such nucleic acid sequence.
More preferably, the heterologous transketolase is derived from F.falciparum (a yeast species also known as "Pichia pastoris"). For example, preferably, the heterologous transketolase comprises or consists of: the amino acid sequence of SEQ ID NO. 3, SEQ ID NO. 4, SEQ ID NO. 16 or SEQ ID NO. 17; or a functional homolog thereof comprising an amino acid sequence that has at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98% or at least 99% sequence identity to the amino acid sequence of SEQ ID NO. 3, SEQ ID NO. 4, SEQ ID NO. 16 or SEQ ID NO. 17; or a functional homolog thereof comprising an amino acid sequence having one or more mutations, substitutions, insertions and/or deletions when compared to the amino acid sequence of SEQ ID NO. 3, SEQ ID NO. 4, SEQ ID NO. 16 or SEQ ID NO. 17, wherein more preferably the amino acid sequence of any such functional homolog has NO more than 300, NO more than 250, NO more than 200, NO more than 150, NO more than 100, NO more than 75, NO more than 50, NO more than 40, NO more than 30, NO more than 20, NO more than 10 or NO more than 5 amino acid mutations, substitutions, insertions and/or deletions when compared to the amino acid sequence of SEQ ID NO. 3, SEQ ID NO. 4, SEQ ID NO. 16 or SEQ ID NO. 17.
In order to allow good expression of the heterologous transketolase in the host cell, it may be advantageous to use a heterologous transketolase that may have an amino acid sequence with equal to or greater than 30%, equal to or greater than 35%, equal to or greater than 40%, equal to or greater than 45%, equal to or greater than 50%, equal to or greater than 55%, equal to or greater than 60%, equal to or greater than 65%, equal to or greater than 70%, equal to or greater than 75%, equal to or greater than 80%, equal to or greater than 85%, equal to or greater than 90%, equal to or greater than 95%, equal to or greater than 98% or equal to or greater than 99% sequence identity to the amino acid sequence of the native transketolase of the host cell.
However, the heterologous transketolase may also preferably be a heterologous transketolase that is not regulated by a natural (i.e., endogenous) regulatory factor of the host cell. That is, preferably, the heterologous transketolase is one whose activity cannot be increased or decreased by a molecule naturally produced by the host cell. To avoid native regulatory factors, it may be advantageous to use a heterologous transketolase in the host cell, which heterologous transketolase may have an amino acid sequence having equal to or less than 99%, equal to or less than 98%, equal to or less than 95%, equal to or less than 90%, equal to or less than 85%, equal to or less than 80%, equal to or less than 75%, equal to or less than 70% or equal to or less than 65% sequence identity to the amino acid sequence of the native transketolase of the host cell.
Thus, more preferably, the heterologous transketolase comprises or consists of an amino acid sequence having a percentage of identity with the amino acid sequence of the native transketolase of the host cell within the following ranges: in the range of equal to or greater than 30% to equal to or less than 80%, more preferably in the range of equal to or greater than 30% or equal to or greater than 35% to equal to or less than 75%, and most preferably in the range of equal to or greater than 35% to equal to or less than 70% or even equal to or less than 65%.
More preferably, any heterologous nucleic acid sequence encoding a heterologous transketolase comprises or consists of a nucleic acid sequence having a percentage of identity with a nucleic acid sequence encoding a native transketolase of a host cell within the following ranges: in the range of equal to or greater than 30% to equal to or less than 80%, more preferably in the range of equal to or greater than 30% or equal to or greater than 35% to equal to or less than 75%, and most preferably in the range of equal to or greater than 35% to equal to or less than 70% or even equal to or less than 65%.
Host cells from Saccharomyces cerevisiae species are preferred. As indicated above, the amino acid sequence of the native transketolase 1 of Saccharomyces cerevisiae is shown by SEQ ID NO. 1 and the native nucleic acid sequence encoding transketolase 1 in Saccharomyces cerevisiae is shown by SEQ ID NO. 2.
Thus, the recombinant yeast cell is preferably a recombinant s.cerevisiae cell functionally expressing:
-a nucleic acid sequence encoding a native transketolase, wherein the native transketolase preferably comprises or consists of the amino acid sequence of SEQ ID No. 1; and
a nucleic acid sequence encoding a heterologous transketolase, wherein the heterologous transketolase preferably comprises or consists of an amino acid sequence having a sequence identity in the range of from equal to or more than 30% to equal to or less than 80%, more preferably in the range of from equal to or more than 30% or equal to or more than 35% to equal to or less than 75%, most preferably in the range of from equal to or more than 35% to equal to or less than 70% or even equal to or less than 65% to the amino acid sequence of SEQ ID NO 1.
Similarly, the recombinant yeast cell is preferably a recombinant s.cerevisiae cell functionally expressing:
-a native nucleic acid sequence encoding a native transketolase, wherein the native nucleic acid preferably comprises or consists of the nucleic acid sequence of SEQ ID No. 2; and
a heterologous nucleic acid sequence encoding a heterologous transketolase, wherein the heterologous nucleic acid sequence preferably comprises or consists of a nucleic acid sequence having a sequence identity in the range of from equal to or more than 30% to equal to or less than 80%, more preferably in the range of from equal to or more than 35% or equal to or more than 35% to equal to or less than 75%, most preferably in the range of from equal to or more than 35% to equal to or less than 70% or even equal to or less than 65% to the nucleic acid sequence of SEQ ID NO 2.
Most preferably, the recombinant yeast cell comprises a suitably heterologous nucleic acid sequence encoding a heterologous protein having transketolase activity, wherein the nucleic acid sequence comprises or consists of:
-the nucleic acid sequence of SEQ ID NO. 18 or SEQ ID NO. 20; or alternatively
-a functional homolog of SEQ ID No. 18 or SEQ ID No. 20 having at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98% or at least 99% sequence identity with the nucleic acid sequence of SEQ ID No. 18 or SEQ ID No. 20; or alternatively
Functional homologs of SEQ ID No. 18 or SEQ ID No. 20 which have one or more mutations, substitutions, insertions and/or deletions when compared to the nucleic acid sequence of SEQ ID No. 20 or SEQ ID No. 22, more preferably NO more than 300, NO more than 250, NO more than 200, NO more than 150, NO more than 100, NO more than 75, NO more than 50, NO more than 40, NO more than 30, NO more than 20, NO more than 10 or NO more than 5 nucleic acid mutations, substitutions, insertions and/or deletions when compared to the nucleic acid sequence of SEQ ID No. 18 or SEQ ID No. 20.
The recombinant yeast cell can comprise one, two or more copies of a heterologous nucleic acid sequence (e.g., a heterologous gene) encoding a heterologous transketolase and/or one, two or more copies of a native nucleic acid sequence (e.g., a native gene) encoding a native transketolase.
Most preferably, the recombinant yeast cell can comprise one, two, three, four, five, six, seven, eight, nine, ten, eleven or twelve copies of a heterologous nucleic acid sequence (e.g., a heterologous gene) encoding a heterologous transketolase and/or one, two, three, four, five, six, seven, eight, nine, ten, eleven or twelve copies of a native nucleic acid sequence (e.g., a native gene) encoding a native transketolase.
Preferably, the recombinant yeast cell is a recombinant yeast cell comprising (functionally expressed in correspondence with):
-one, two or more copies of a nucleic acid sequence comprising or consisting of a nucleic acid sequence having a sequence identity in the range of from equal to or more than 30% to equal to or less than 80%, more preferably in the range of from equal to or more than 35% to equal to or less than 75%, most preferably in the range of from equal to or more than 35% to equal to or less than 70% or even equal to or less than 65% to a nucleic acid sequence encoding a native transketolase; and/or
-one, two or more copies of a nucleic acid sequence encoding any of the heterologous transketolase described above; and/or
-one, two or more copies of the nucleic acid sequence of SEQ ID NO. 18 and/or SEQ ID NO. 20; and/or
-one, two or more copies of a nucleic acid sequence having at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98% or at least 99% sequence identity with the nucleic acid sequence of SEQ ID No. 18 and/or SEQ ID No. 20; and/or
One, two or more copies of a nucleic acid sequence having one or more mutations, substitutions, insertions and/or deletions compared to the nucleic acid sequence of SEQ ID NO. 18 and/or SEQ ID NO. 20, respectively, wherein more preferably the nucleic acid sequence has NO more than 300, NO more than 250, NO more than 200, NO more than 150, NO more than 100, NO more than 75, NO more than 50, NO more than 40, NO more than 30, NO more than 20, NO more than 10 or NO more than 5 nucleic acid mutations, substitutions, insertions and/or deletions compared to the nucleic acid sequence of SEQ ID NO. 18 and/or SEQ ID NO. 20, respectively.
Most preferably, the recombinant yeast cell is a recombinant saccharomyces cerevisiae cell comprising (functionally expressed by) the following:
i) One, two or more copies of the nucleic acid sequence of SEQ ID NO. 10; and
ii) one, two or more copies of the nucleic acid sequence of SEQ ID NO. 18 and/or SEQ ID NO. 20; and/or one, two or more copies of a nucleic acid sequence having at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98% or at least 99% sequence identity to the nucleic acid sequence of SEQ ID NO. 18 and/or SEQ ID NO. 20; and/or one, two or more copies of a nucleic acid sequence having one or more mutations, substitutions, insertions and/or deletions compared to the nucleic acid sequence of SEQ ID NO. 18 and/or SEQ ID NO. 20, respectively, wherein more preferably the nucleic acid sequence has NO more than 300, NO more than 250, NO more than 200, NO more than 150, NO more than 100, NO more than 75, NO more than 50, NO more than 40, NO more than 30, NO more than 20, NO more than 10 or NO more than 5 nucleic acid mutations, substitutions, insertions and/or deletions compared to the nucleic acid sequence of SEQ ID NO. 18 and/or SEQ ID NO. 20, respectively.
Optional overexpression of one or more other enzymes of the PPP pathway
The recombinant yeast cell may further optionally comprise one or more genetic modifications in other PPP genes (i.e., RKI, RPE, and TAL) that increase the flux of the pentose phosphate pathway. Advantageously, this or such genetic modification may allow for a further increase in flux through the non-oxidized part of the pentose phosphate pathway.
Thus, the recombinant yeast cell may optionally comprise one or more additional genetic modifications to overexpress (the non-oxidized part of) one or more other enzymes of the pentose phosphate pathway. For example, a recombinant yeast cell can comprise one or more nucleic acid sequences to overexpress one or more enzymes selected from the group consisting of: ribulose-5-phosphate isomerase, ribulose-5-phosphate epimerase and transaldolase.
"ribulose 5-phosphate epimerase" (EC 5.1.3.1) is defined herein as an enzyme that catalyzes the epimerization of D-xylulose 5-phosphate to D-ribulose 5-phosphate (and vice versa). This enzyme is also known as ribulose phosphate epimerase; erythrose-4-phosphate isomerase; pentose phosphate 3-epimerase; xylulose phosphate 3-epimerase; pentose phosphate epimerase; ribulose 5-phosphate 3-epimerase; d-ribulose phosphate-3-epimerase; d-ribulose 5-phosphate epimerase; D-ribulose-5-P3-epimerase; d-xylulose-5-phosphate 3-epimerase; pentose-5-phosphate 3-epimerase; or D-ribulose-5-phosphate 3-epimerase. Ribulose 5-phosphate epimerase can be further defined by its amino acid sequence. Likewise, a ribulose 5-phosphate epimerase can be defined by a nucleotide sequence encoding the enzyme and a nucleotide sequence hybridizing to a reference nucleotide sequence encoding the ribulose 5-phosphate epimerase. The nucleotide sequence encoding ribulose 5-phosphate epimerase is referred to herein as RPE or RPE1.
"ribulose 5-phosphate isomerase" (EC 5.3.1.6) is defined herein as an enzyme that catalyzes the direct isomerisation of D-ribose 5-phosphate to D-ribulose 5-phosphate (and vice versa). This enzyme is also known as pentose phosphate isomerase; phosphoribosyl isomerase; ribose phosphate isomerase; 5-phosphoribosyl isomerase; d-ribose 5-phosphate isomerase; d-ribose-5-phosphate ketol-isomerase; or D-ribose-5-phosphate aldose-ketose-isomerase. Ribulose 5-phosphate isomerase may be further defined by its amino acid sequence. Likewise, a ribulose 5-phosphate isomerase may be defined by a nucleotide sequence encoding the enzyme and a nucleotide sequence hybridizing to a reference nucleotide sequence encoding the ribulose 5-phosphate isomerase. The nucleotide sequence encoding ribulose 5-phosphate isomerase is referred to herein as RKI or RKI1.
"transaldolase" (EC 2.2.1.2) is defined herein as an enzyme that catalyzes the reaction: sedoheptulose 7-phosphate + D-glyceraldehyde 3-phosphate < - > -D-erythrose 4-phosphate + D-fructose 6-phosphate and vice versa. This enzyme is also known as dihydroxyacetone transferase; dihydroxyacetone synthase; formaldehyde transketolase; or sedoheptulose-7-phosphate, D-glyceraldehyde-3-phosphoglyceromulotransferase. Transaldolase may be further defined by its amino acid sequence. Likewise, a transaldolase may be defined by a nucleotide sequence encoding the enzyme and a nucleotide sequence hybridizing to a reference nucleotide sequence encoding the transaldolase. The nucleotide sequence encoding a transketolase is referred to herein as TAL or TAL1.
TKL promoter
The recombinant yeast cell is preferably a recombinant yeast cell in which the nucleic acid sequence encoding the native protein having transketolase activity and/or the nucleic acid sequence encoding the heterologous protein having transketolase activity is under the control of a promoter ("TKL promoter") having an anaerobic/aerobic expression ratio of 2 or more for transketolase. By this is meant appropriately that the expression of heterologous and/or native transketolase ("TKL") under anaerobic conditions is at least 2-fold higher than under aerobic conditions. The above may alternatively be expressed as functionally expressing for a recombinant yeast cell a nucleic acid sequence encoding a native transketolase (corresponding to a heterologous transketolase) under the control of a promoter ("TKL promoter") whose TKL expression ratio Anaerobic/aerobic 2 or higher.
The TKL promoter may suitably be operably linked to a nucleic acid sequence encoding a protein having transketolase activity. Preferably, the TKL promoter is located in the 5' region of the TKL gene; more preferably, it is located close to the transcription initiation site of the TKL gene.
Preferably, the TKL promoter is ROX 1-inhibited. ROX1 is herein a heme-dependent repressor of one or more hypoxia genes; which mediate aerobic transcriptional repression of hypoxia-inducible genes such as COX5b and CYC 7; repressor function is regulated by decreasing promoter occupancy in response to oxidative stress; and contains HMG domains responsible for DNA binding activity; is involved in hypertonic stress resistance. ROX1 is regulated by oxygen.
Without wishing to be bound by any type of theory, it is believed that the regulation of ROX1 may function as follows: genomic analysis of anaerobic induction genes in Saccharomyces cerevisiae according to Kwat et al, "Genomic Analysis of Anaerobically induced genes in Saccharomyces cerevisiae: functional roles of ROX1 and other factors in mediating the anoxic response" [ ": the functional role of ROX1 and other factors in mediating hypoxia responses "] (2002), journal of bacteriology [ journal of bacteriology ], volume 184, phase 1, pages 250-265, incorporated herein by reference: "although Rox1 functions in an O2-dependent manner, its expression is oxygen (heme) -dependent, activated by heme-dependent transcription factor Hap1 [19]. Thus, as oxygen levels drop to a level that limits heme biosynthesis [20], ROX1 no longer transcribes [21], its protein level drops [22], and the gene it regulates de-represses.
Further details and suitable motifs are provided by: keng, T. (1992), "HAP1 and ROX1 form a regulatory pathway in the repression of HEM transcription in Saccharomyces cerevisiae" [ "HAP1 and ROX1 form a regulatory pathway in Saccharomyces cerevisiae that inhibits HEM13 transcription" ], mol.cell.biol. [ molecular and cell biology ] 12:2616-2623, and Ter Kinde and de Steensma, "A microarray-assisted screen for potential Hap1 and Rox1 target genes in Saccharomyces cerevisiae" [ "microarray-assisted screening of potential handle 1 and Rox1 target genes in Saccharomyces cerevisiae" ], (2002), yeast [ Yeast ] 19:825-840, are incorporated herein by reference.
Preferably, the TKL promoter comprises a ROX1 binding motif. The TKL promoter may suitably comprise one or more ROX1 binding motifs.
More preferably, the TKL promoter may comprise one or more copies of motif NNNATTGTTNNN in its nucleic acid sequence. In this context "N" represents a nucleic acid selected from the group consisting of: adenine (A), guanine (G), cytosine (C) and thymine (T). Such a motif is shown by SEQ ID NO. 21.
More preferably, the TKL promoter comprises or consists of: a nucleic acid sequence identical to the nucleic acid sequence of a preferably natural promoter of a gene selected from the list consisting of: FET4, ANB1, YHR048W, DAN1, AAC3, TIR2, DIP5, HEM13, YNR014W, YAR028W, FUN, COX5B, OYE2, SUR2, FRDS1, PIS1, LAC1, YGR035C, YAL028W, EUG1, HEM14, ISU2, ERG26, YMR252C and SML1, more preferably FET4, ANB1, YHR048W, DAN1, AAC3, TIR2, DIP5 and HEM13, or functional homologs thereof comprising a nucleic acid sequence having at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98% or at least 99% sequence identity to the above. Reference herein to a native promoter refers to a promoter that is native to the host cell.
Preferably, the recombinant yeast cell is a recombinant saccharomyces cerevisiae cell; and preferably the TKL promoter is a natural promoter of a saccharomyces cerevisiae gene selected from the list consisting of: FET4, ANB1, YHR048W, DAN1, AAC3, TIR2, DIP5, HEM13, YNR014W, YAR028W, FUN, COX5B, OYE2, SUR2, FRDS1, PIS1, LAC1, YGR035C, YAL028W, EUG1, HEM14, ISU2, ERG26, YMR252C and SML1.
Additionally or in the alternative, the TKL promoter preferably comprises one or more copies of the following motifs in its nucleic acid sequence: TCGTTYAG and/or AAAAATTGTTGA. Herein, "Y" represents C or T. The AAAAATTGTTGA motif is shown by SEQ ID NO. 22.
The TKL promoter may also comprise or consist of a nucleic acid sequence which is identical to the nucleic acid sequence of a preferably natural promoter of a DAN, TIR or PAU gene. For example, the TKL promoter may suitably comprise or consist of: a nucleic acid sequence of a preferably natural promoter of a gene selected from the list consisting of: TIR2, DAN1, TIR4, TIR3, PAU7, PAU5, yl 064C, YGR W, DAN3, YIL176C, YGL261C, YOL161C, PAU1, PAU6, DAN2, YDR542W, YIR041W, YKL224C, PAU3, yl 025W, YOR394W, YHL046C, YMR325W, YAL068C, YPL282C, PAU, and PAU4, or functional homologs thereof, comprising a nucleic acid sequence having at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% sequence identity to the above. Reference herein to a native promoter refers to a promoter that is native to the host cell.
Preferably, the recombinant yeast cell is a recombinant saccharomyces cerevisiae cell; and preferably the TKL promoter is a natural promoter of a saccharomyces cerevisiae gene selected from the list consisting of: TIR2, DAN1, TIR4, TIR3, PAU7, PAU5, yl 064C, YGR294W, DAN3, YIL176C, YGL261C, YOL161C, PAU1, PAU6, DAN2, YDR542W, YIR041W, YKL 35224C, PAU3, yl 025W, YOR394W, YHL046C, YMR325W, YAL068C, YPL282C, PAU and PAU4.
More preferably, the TKL promoter may comprise or consist of: a sequence identical to the nucleic acid sequence of a preferably natural promoter of a gene selected from the list consisting of: TIR2, DAN1, TIR4, TIR3, PAU7, PAU5, yl 064C, YGR294W, DAN3, YIL176C, YGL261C, YOL161C, PAU1, PAU6, DAN2, YDR542W, YIR041W, YKL224C, PAU3, and yl 025W, or a functional homolog thereof comprising a nucleic acid sequence having at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% sequence identity to the above.
The nucleic acid sequence of the Saccharomyces cerevisiae ANB1 promoter is shown in SEQ ID NO. 23. The nucleic acid sequence of the Saccharomyces cerevisiae DAN1 promoter is shown in SEQ ID NO. 24.
Thus, a preferred TKL promoter may comprise or consist of:
-the nucleic acid sequence of SEQ ID NO. 23 or SEQ ID NO. 24; or alternatively
-a functional homolog of the nucleic acid sequence of SEQ ID No. 23 or SEQ ID No. 24, which functional homolog has at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98% or at least 99% sequence identity with the nucleic acid sequence of SEQ ID No. 23 or SEQ ID No. 24; or alternatively
A functional homolog of the nucleic acid sequence of SEQ ID NO. 23 or SEQ ID NO. 24, which has one or more mutations, substitutions, insertions and/or deletions compared to the nucleic acid sequence of SEQ ID NO. 23 or SEQ ID NO. 24, wherein more preferably the nucleic acid sequence has NO more than 300, NO more than 250, NO more than 200, NO more than 150, NO more than 100, NO more than 75, NO more than 50, NO more than 40, NO more than 30, NO more than 20, NO more than 10 or NO more than 5 nucleic acid mutations, substitutions, insertions and/or deletions compared to the nucleic acid sequence of SEQ ID NO. 23 or SEQ ID NO. 24.
The TKL promoter may also be a synthetic oligonucleotide. That is, the TKL promoter may be a product of artificial oligonucleotide synthesis. Artificial oligonucleotide synthesis is a method in synthetic biology for the production of artificial oligonucleotides (such as genes) in the laboratory. Commercial gene synthesis services are now available from many companies around the world, some of which have established their business models around this task. Current methods of gene synthesis are most often based on a combination of organic chemistry and molecular biology techniques, and can synthesize the entire gene "de novo" without the need for a precursor template DNA.
TKL expression ratio of TKL promoter Anaerobic/aerobic Is 2 or higher, preferably 3 or higher, 4 or higher, 5 or higher, 6 or higher, 7 or higher, 8 or higher, 9 or higher, 10 or higher, 20 or higher, or 50 or higher. 2 or higher TKL expression ratio Anaerobic/aerobic Suitably it means that under further identical expression conditions, the expression of the transketolase ("TKL") under anaerobic conditions is at least 2 times higher than under aerobic conditions.
There is no upper limit, and the TKL promoter may be a TKL promoter that allows promotion of expression of the transketolase gene only under anaerobic conditions, not under aerobic conditions.
For practical reasons, it is considered that the ratio of the index 10 (i.e., 10) is equal to or greater than 2 to equal to or less than 10 10 ) Or to or below 10 index 4 (i.e., 10 4 ) TKL expression ratio in the range of (2) Anaerobic/aerobic 。
As indicated above, "expression" herein refers to transcription of a gene into structural RNA (rRNA, tRNA) or messenger RNA (mRNA) followed by translation into a protein.
The TKL expression ratio can be determined, for example, by measuring the amount of the Transketolase (TKL) protein of cells grown under aerobic and anaerobic conditions. The amount of TKL protein may be determined proteomic or any other method known to quantify the amount of protein.
The level or ratio of Transketolase (TKL) expression can also be determined by measuring the Transketolase (TKL) activity of cells grown under aerobic and anaerobic conditions (e.g., in cell-free extracts).
Additionally or alternatively to the above, the level or TKL expression ratio may be determined by measuring the transcript level of the TKL gene (e.g., as an amount of mRNA) in cells grown under aerobic and anaerobic conditions. The skilled artisan knows how to determine the level of translation using methods generally known in the art (e.g., Q-PCR, real-time PCR, northern blotting, RNA-seq).
The TKL promoter advantageously enables higher expression of the transketolase under anaerobic conditions than under aerobic conditions. In the method according to the invention, the recombinant yeast cell preferably expresses a transketolase, wherein the amount of transketolase expressed under anaerobic conditions is a multiple of the amount of transketolase expressed under aerobic conditions, and wherein the multiple is preferably 2 or more, more preferably 3 or more, 4 or more, 5 or more, 6 or more, 7 or more, 8 or more, 9 or more, 10 or more, 20 or more or 50 or more.
Increased flux
Preferably, one or more genetic modifications to the PPP gene (i.e., with respect to TKL1 and optionally RKI, RPE, and TAL) such that the flux of the non-oxidized portion of the pentose phosphate pathway is increased are understood herein to mean a modification that increases the flux by at least about 1.1-fold, about 1.2-fold, about 1.5-fold, about 2-fold, about 5-fold, about 10-fold, or about 20-fold as compared to the flux in a genetically identical strain except for the genetic modification that increases the flux. The flux of the non-oxidized part of the pentose phosphate pathway can be measured by: xylose as sole carbonThe source grows the modified host, determines the specific xylose consumption rate, and if any xylitol is produced, subtracts the specific xylitol production rate from the specific xylose consumption rate. However, the flux of the non-oxidized portion of the pentose phosphate pathway is proportional to the growth rate with xylose as the sole carbon source, preferably proportional to the anaerobic growth rate with xylose as the sole carbon source. At a growth rate (. Mu.) with xylose as sole carbon source max ) There is a linear relationship with the flux of the non-oxidized part of the pentose phosphate pathway. Specific xylose consumption Rate (Q) s ) Equal to the growth rate (μ) divided by the biomass yield by sugar (Y xs ) Because biomass yield against sugar is constant (under a given set of conditions: anaerobic, growth medium, pH, genetic background of strain, etc.; namely Q s =μ/Y xs ). Thus, an increase in flux of the non-oxidized part of the pentose phosphate pathway can be deduced from an increase in maximum growth rate under these conditions, except for transport (uptake is limiting).
One or more genetic modifications that increase the flux of the pentose phosphate pathway can be introduced into a host cell in a variety of ways. These include, for example, achieving higher steady-state activity levels of one or more enzymes of the xylulokinase and/or non-oxidized partial pentose phosphate pathway and/or reduced steady-state levels of non-specific aldose reductase activity. These changes in steady state activity levels can be achieved by selection of mutants (spontaneous or induced by chemicals or radiation) and/or by recombinant DNA techniques (e.g. by overexpression or inactivation of genes encoding enzymes or factors regulating these genes, respectively).
In preferred host cells, the genetic modification comprises overexpression of at least one enzyme of the pentose phosphate pathway (a non-oxidized moiety). Preferably, the enzyme is selected from the group consisting of: enzymes encoding ribulose-5-phosphate isomerase, ribulose-5-phosphate epimerase, transketolase and transaldolase. Various combinations of enzymes of the pentose phosphate pathway can be overexpressed (non-oxidized moieties). For example, the overexpressed enzyme may be at least a ribulose-5-phosphate isomerase and a ribulose-5-phosphate epimerase; or at least a ribulose-5-phosphate isomerase and a transketolase; or at least ribulose-5-phosphate isomerase and transaldolase; or at least ribulose-5-phosphate epimerase and transketolase; or at least ribulose-5-phosphate epimerase and transaldolase; or at least a transketolase and a transaldolase; or at least ribulose-5-phosphate epimerase, transketolase and transaldolase; or at least ribulose-5-phosphate isomerase, transketolase and transaldolase; or at least a ribulose-5-phosphate isomerase, a ribulose-5-phosphate epimerase and a transaldolase; or at least ribulose-5-phosphate isomerase, ribulose-5-phosphate epimerase and transketolase. In one embodiment of the invention, each of the ribulose-5-phosphate isomerase, ribulose-5-phosphate epimerase, transketolase and transaldolase is overexpressed in a host cell. More preferred are host cells wherein the genetic modification comprises at least overexpression of both a transketolase and a transaldolase, as such host cells are already capable of anaerobic growth on xylose. In fact, under some conditions, host cells that overexpress only transketolase and transaldolase already have the same rate of anaerobic growth by xylose as host cells that overexpress all four enzymes (i.e., ribulose-5-phosphate isomerase, ribulose-5-phosphate epimerase, transketolase and transaldolase). Furthermore, host cells that overexpress both ribulose-5-phosphate isomerase and ribulose-5-phosphate epimerase are preferred over host cells that overexpress only isomerase or only epimerase, as overexpression of only one of these enzymes may create a metabolic imbalance.
Redox sink
Preferably, the recombinant yeast cell may further comprise one or more genetic modifications for functionally expressing proteins that play a role in metabolic pathways that form the unnatural redox sink.
For example, the one or more genetic modifications may be one or more genetic modifications for functional expression of one or more optionally heterologous nucleic acid sequences encoding one or more nad+/NADH dependent proteins, which one or more proteins play a role in the metabolic pathway that converts NADH to nad+. There are several examples of such metabolic pathways, as further shown below.
For example, "one or more genetic modifications for functionally expressing a protein that functions in a metabolic pathway that forms a non-native redox sink" may be selected from the group consisting of:
a) Comprising or consisting of one or more of the following genetic modifications:
-a preferably heterologous nucleic acid sequence encoding a protein comprising phosphoketolase activity (EC 4.1.2.9 or EC 4.1.2.22, pkl); and/or
-a preferably heterologous nucleic acid sequence encoding a protein (EC 2.3.1.8) having Phosphotransacetylase (PTA) activity; and/or
Preferably heterologous nucleic acid sequence encoding a protein with acetate kinase (ACK) activity (EC 2.7.2.12).
And/or
b) Comprising or consisting of one or more of the following genetic modifications:
-a preferably heterologous nucleic acid sequence encoding a protein having ribulose-1, 5-bisphosphate carboxylase oxygenase (Rubisco) activity; and/or
-a preferably heterologous nucleic acid sequence encoding a protein having Phosphoribulokinase (PRK) activity; and
-optionally, a preferably heterologous nucleic acid sequence encoding one or more chaperones of a protein having ribulose-1, 5-bisphosphate carboxylase oxygenase (Rubisco) activity;
and/or
c) Comprising or consisting of one or more of the following genetic modifications:
a preferably heterologous nucleic acid sequence encoding a protein comprising NADH dependent acetylating acetaldehyde dehydrogenase activity.
For example, WO 2014/081803 describes recombinant microorganisms expressing heterologous phosphoketolase, phosphotransacetylase or acetate kinase and bifunctional acetaldehyde-alcohol dehydrogenase, incorporated herein by reference; and WO 2015/148272 describes recombinant saccharomyces cerevisiae strains expressing heterologous phosphoketolase, phosphotransacetylase and acetylacetaldehyde dehydrogenase enzymes, incorporated herein by reference. Furthermore, WO 2018172328A1 describes recombinant cells that may comprise one or more (heterologous) genes encoding enzymes having phosphoketolase activity. The Phosphoketolase (PKL) pathway described in WO 2014/081803, WO 2015/148272 and WO 2018172328A1 (all incorporated herein by reference) provides a preferred metabolic pathway for converting NADH to nad+, and the NADH-dependent phosphoketolase described therein is a preferred NADH-dependent protein for use in the present invention.
Rubisco
As indicated above, the recombinant yeast cell may advantageously functionally express a nucleic acid sequence encoding ribulose-1, 5-phosphate carboxylase/oxygenase (EC 4.1.1.39; rubisco) and optionally one or more preferably heterologous chaperones of Rubisco.
More preferably, the recombinant yeast cell functionally expresses:
-a heterologous nucleic acid sequence encoding a protein having ribulose-1, 5-bisphosphate carboxylase oxygenase (Rubisco) activity; and/or
-a heterologous nucleic acid sequence encoding a protein having Phosphoribulokinase (PRK) activity; and/or
-optionally one or more heterologous nucleic acid sequences encoding one or more chaperones of a protein having ribulose-1, 5-bisphosphate carboxylase oxygenase (Rubisco) activity.
Proteins having ribulose-1, 5-bisphosphate carboxylase oxygenase (Rubisco) activity are also referred to herein as "ribulose-1, 5-bisphosphate carboxylase oxygenase (ribulose-1, 5-biphosphate carboxylase oxygenase)", "ribulose-1, 5-bisphosphate carboxylase oxygenase protein", "ribulose-1, 5-bisphosphate carboxylase oxygenase (ribulose-1, 5-biphosphate carboxylase oxygenase enzyme)", "Rubisco enzyme", "Rubisco protein" or simply "Rubisco". The ribulose-1, 5-bisphosphate carboxylase oxygenase can be further defined by its amino acid sequence. Likewise, the ribulose-1, 5-bisphosphate carboxylase oxygenase may be further defined by a nucleotide sequence encoding the ribulose-1, 5-bisphosphate carboxylase oxygenase. As explained in detail below in the definitions above, a certain ribulose-1, 5-bisphosphate carboxylase oxygenase defined by a nucleotide sequence encoding an enzyme includes (unless otherwise limited) a nucleotide sequence that hybridizes to such a nucleotide sequence encoding a ribulose-1, 5-bisphosphate carboxylase oxygenase. Preferred Rubisco proteins and nucleic acid sequences encoding such Rubisco proteins are as described in WO 2014/129898 (incorporated herein by reference).
The Rubisco protein may be suitably selected from the group of eukaryotic and prokaryotic Rubisco proteins. The Rubisco proteins are preferably from non-phototrophic organisms. For example, the Rubisco protein may be from a chemolithoautotrophic microorganism. Good results have been obtained with bacterial Rubisco proteins. Preferably, the Rubisco protein is derived from an inorganic autotrophic Thiobacillus (Thiobacillus), in particular Thiobacillus denitrificans.
The Rubisco protein may be a single subunit Rubisco protein or a Rubisco protein having more than one subunit. Preferably, the Rubisco protein is a single subunit Rubisco protein. Good results have been obtained with the use of the Rubisco protein as a so-called type II Rubisco protein. Particularly good results were obtained with the Rubisco protein encoded by the cbbM gene (also known as cbbM).
Preferred Rubisco proteins are the Rubisco proteins encoded by the cbbM gene from thiobacillus denitrificans. SEQ ID NO. 25 shows the amino acid sequence of a suitable Rubisco protein encoded by the cbbM gene from Thiobacillus denitrificans. SEQ ID NO. 26 shows the nucleic acid sequence of the cbbM gene from S.denitrificans, codon optimized for Saccharomyces cerevisiae.
Thus, preferably, the protein having ribulose-1, 5-bisphosphate carboxylase oxygenase (Rubisco) activity comprises or consists of:
-the amino acid sequence of SEQ ID No. 25; or alternatively
-a functional homolog of SEQ ID No. 25 having at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98% or at least 99% sequence identity to the amino acid sequence of SEQ ID No. 25; or alternatively
Functional homologs of SEQ ID NO. 25 which have one or more mutations, substitutions, insertions and/or deletions when compared to the amino acid sequence of SEQ ID NO. 25, more preferably functional homologs having NO more than 300, NO more than 250, NO more than 200, NO more than 150, NO more than 100, NO more than 75, NO more than 50, NO more than 40, NO more than 30, NO more than 20, NO more than 10 or NO more than 5 amino acid mutations, substitutions, insertions and/or deletions when compared to the amino acid sequence of SEQ ID NO. 25.
Preferably, the nucleic acid sequence encoding a protein having ribulose-1, 5-bisphosphate carboxylase oxygenase (Rubisco) activity comprises or consists of:
-the nucleic acid sequence of SEQ ID NO. 26; or alternatively
-a functional homolog of SEQ ID No. 26 having at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98% or at least 99% sequence identity to the nucleic acid sequence of SEQ ID No. 26; or alternatively
Functional homologs of SEQ ID NO. 26 which have one or more mutations, substitutions, insertions and/or deletions when compared to the nucleic acid sequence of SEQ ID NO. 26, more preferably functional homologs which have not more than 300, not more than 250, not more than 200, not more than 150, not more than 100, not more than 75, not more than 50, not more than 40, not more than 30, not more than 20, not more than 10 or not more than 5 nucleic acid mutations, substitutions, insertions and/or deletions when compared to the nucleic acid sequence of SEQ ID NO. 26.
Examples of other suitable Rubisco polypeptides and sources thereof are given in table 1 and table 2 below of WO 2014/129898 (incorporated herein by reference), with reference to sequence identity to the amino acid sequence of SEQ ID No. 25.
The nucleic acid sequence (e.g. gene) encoding a ribulose-1, 5-bisphosphate carboxylase oxygenase (Rubisco) protein can be suitably incorporated into the genome of a recombinant yeast cell, for example as described in the examples of WO 2014/129898 and the following articles: guadalupe-Medina et al, "Carbon dioxide fixation by Calvin-Cycle enzymes improves ethanol yield in yeast" [ "carbon dioxide fixation by Calvin cycle enzymes improves ethanol yield in yeast" ], published in Biotechnol, biofields [ biotech for biofuel ],2013, volume 6, page 125, both incorporated herein by reference.
Table 2: natural Rubisco polypeptides suitable for expression
As indicated above, the Rubisco protein is suitably functionally expressed in the recombinant yeast cell, at least during use in a fermentation process.
The nucleic acid sequence encoding the Rubisco protein may be present in one, two or more copies in a recombinant yeast cell. Without wishing to be bound by any type of theory, it is believed that the robustness of recombinant yeast cells is best when the nucleic acid sequence (e.g., gene) encoding the Rubisco protein is present in the recombinant yeast cells in less than 12 copies, more preferably less than 8 copies. Thus, preferably, the recombinant yeast cell comprises a nucleic acid sequence (e.g., a gene) encoding a Rubisco protein that ranges from equal to or greater than 1 copy, more preferably equal to or greater than 2 copies to equal to or less than 7 copies, more preferably equal to or less than 6 copies. The recombinant yeast cell can, for example, comprise one, two, three, four, five, six or seven copies of a nucleic acid sequence encoding a ribulose-1, 5-bisphosphate carboxylase oxygenase (Rubisco).
To increase the likelihood that the Rubisco protein is expressed in a transformed (recombinant) host cell of the invention at a sufficient level and in an active form, the nucleic acid sequences encoding the Rubisco protein and other proteins as described herein (see below) are preferably adapted to optimize their codon usage to that of the host cell in question. The adaptability of the nucleic acid sequence encoding the enzyme to the codon usage of the host cell may be expressed as a Codon Adaptation Index (CAI). Codon usage index is defined herein as a measure of the relative fitness of the codon usage of a gene to that of a gene highly expressed in a particular host cell or organism. The relative fitness (w) of each codon is the ratio of the use of each codon to the use of the highest abundance codon for the same amino acid. The CAI index is defined as the geometric mean of these relative fitness values. Non-synonymous codons and stop codons (depending on the genetic code) are excluded. CAI values range from 0 to 1, with higher values indicating higher ratios of most abundant codons (see Sharp and Li, "The codon adaptation index-a measure of directional synonymous codon usage bias, and its potential applications" [ "codon usage index-measure of directional synonymous codon usage bias and its potential use" ], (1987), published in Nucleic Acids Research [ nucleic acids research ] volume 15, pages 1281-1295; see also Jansen et al, "Revisiting the codon adaptation index from a whole-genome perspective: analyzing the relationship between gene expression and codon occurrence in yeast using a variety of models" [ "re-examine codon usage index from a genome-wide perspective" ], (2003), nucleic Acids Res "[ nucleic acids research ] volume 31 (8), pages 2242-51). The CAI of the adapted nucleic acid sequence is preferably at least 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8 or 0.9. Preferably, the sequence has been codon optimized for expression in the fungal host cell in question (e.g. like a s.cerevisiae cell).
Preferably, the functionally expressed Rubisco protein is dependent on ribulose-1, 5-bisphosphate 14 The rate of incorporation of C bicarbonate into the cell extract defines an activity of at least 1nmol.min -1 (mg protein) -1 In particular an activity of at least 2nmol. Min -1 (mg protein) -1 More particularly an activity of at least 4nmol. Min -1 (mg protein) -1 . The upper limit of activity is not critical. In practice, the activity may be about 200nmol.min -1 (mg protein) -1 Or lower, in particular 25nmol. Min -1 (mg protein) -1 More particularly 15nmol. Min -1 (mg protein) -1 Or less, e.g., about 10nmol. Min -1 (mg protein) -1 Or lower. The conditions for determining the assay of this Rubisco activity are as seen in the examples of WO 2014/129898 (incorporated herein by reference) (e.g., example 4).
Ribulokinase phosphate
Preferably, the recombinant yeast cell also functionally expresses a heterologous nucleic acid sequence encoding a protein having ribulose phosphate kinase (PRK) activity (EC 2.7.1.19; PRK).
Proteins having Phosphoribulokinase (PRK) activity are also referred to herein as "phosphoribulokinase proteins", "phosphoribulokinase (phosphoribulokinase enzyme)", "phosphoribulokinase (phosphoribulokinase)", "PRK enzymes", "PRK proteins" or simply "PRK". Preferred PRK proteins and nucleic acid sequences encoding such PRK proteins are as described in WO 2014/129898 (incorporated herein by reference).
Functionally expressed phosphoribulokinase (PRK, (EC 2.7.1.19)) according to the invention is capable of catalyzing the following chemical reactions:
thus, the two substrates of the enzyme are ATP and D-ribulose 5-phosphate; the two products are ADP and D-ribulose 1, 5-bisphosphate.
PRK proteins belong to the family of transferases, in particular transferases (phosphotransferases) which transfer phosphorus-containing groups with alcohol groups as acceptors. The systematic name of this enzyme is ATP, D-ribulose-5-phosphate 1-phosphotransferase. Other names commonly used include pentose phosphate kinase, ribulose-5-phosphate kinase, pentose phosphate kinase, ribulophosphate kinase (phosphorylated), ribulo5-phosphate kinase, ribulophosphate kinase, PKK, PRuK and PRK. PRK enzymes are involved in carbon fixation. The Phosphoribulokinase (PRK) protein may be further defined by its amino acid sequence. Likewise, a Phosphoribulokinase (PRK) protein may be further defined by a nucleotide sequence encoding a Phosphoribulokinase (PRK). As explained in detail below under the definition above, a certain Phosphoribulokinase (PRK) defined by a nucleotide sequence encoding an enzyme includes (unless otherwise limited) a nucleotide sequence that hybridizes to such a nucleotide sequence encoding a Phosphoribulokinase (PRK).
PRKs may be from prokaryotes or eukaryotes. Good results have been obtained with PRKs derived from eukaryotic organisms. Preferably, the PRK protein is derived from a plant selected from the order Caryophyllales (Caryophyllales), in particular from the family Amaranthaceae (Amaranthaceae), more in particular from the genus Spinacia (spindia).
Preferred PRK proteins are PRK proteins from the genus spinacia. SEQ ID NO. 27 shows the amino acid sequence of this PRK protein from the genus spinacia. SEQ ID NO. 28 shows the nucleic acid sequence of the prk gene from spinach-codon optimization for Saccharomyces cerevisiae.
Thus, preferably, the protein having Phosphoribulokinase (PRK) activity comprises or consists of:
-the amino acid sequence of SEQ ID No. 27; or alternatively
-a functional homolog of SEQ ID No. 27 having at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98% or at least 99% sequence identity to the amino acid sequence of SEQ ID No. 27; or alternatively
Functional homologs of SEQ ID NO. 27 which have one or more mutations, substitutions, insertions and/or deletions when compared to the amino acid sequence of SEQ ID NO. 27, more preferably functional homologs having NO more than 300, NO more than 250, NO more than 200, NO more than 150, NO more than 100, NO more than 75, NO more than 50, NO more than 40, NO more than 30, NO more than 20, NO more than 10 or NO more than 5 amino acid mutations, substitutions, insertions and/or deletions when compared to the amino acid sequence of SEQ ID NO. 27.
Preferably, the nucleic acid sequence encoding a protein having Phosphoribulokinase (PRK) activity comprises or consists of:
-the nucleic acid sequence of SEQ ID NO. 28; or alternatively
-a functional homolog of SEQ ID No. 28 having at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98% or at least 99% sequence identity to the nucleic acid sequence of SEQ ID No. 28; or alternatively
Functional homologs of SEQ ID NO. 28 which have one or more mutations, substitutions, insertions and/or deletions when compared to the nucleic acid sequence of SEQ ID NO. 28, more preferably functional homologs which have not more than 300, not more than 250, not more than 200, not more than 150, not more than 100, not more than 75, not more than 50, not more than 40, not more than 30, not more than 20, not more than 10 or not more than 5 nucleic acid mutations, substitutions, insertions and/or deletions when compared to the nucleic acid sequence of SEQ ID NO. 28.
The nucleic acid sequence (e.g. gene) encoding a protein having ribulose phosphate kinase (PRK) activity may be suitably incorporated into the genome of a recombinant yeast cell, for example as described in the examples of WO 2014/129898 (incorporated herein by reference).
Examples of suitable PRK polypeptides and sources thereof are given in Table 2 and Table 3 below of WO 2014/129898 (incorporated herein by reference), with reference to sequence identity to the amino acid sequence of SEQ ID NO. 27.
Table 3: natural PRK polypeptide suitable for expression and identity with PRK from spinach genus
The nucleic acid sequence encoding the PRK protein may be under the control of a promoter ("PRK promoter") that enables expression under anaerobic conditions to be higher than expression under aerobic conditions. Examples of such promoters are described in WO 2017/216136A1 and WO 2018/228836 (both incorporated herein by reference). More preferably, the PRK expression ratio of such a promoter is anaerobic/aerobic preferably 2 or higher, 3 or higher, 4 or higher, 5 or higher, 6 or higher, 7 or higher, 8 or higher, 9 or higher, 10 or higher, 20 or higher or 50 or higher. Other preferences are as described in WO 2018/228836 (incorporated herein by reference).
Rubiosco chaperones
Optionally, the recombinant yeast cell further comprises one or more, preferably heterologous, nucleic acid sequences encoding one or more chaperones for a protein having ribulose-1, 5-bisphosphate carboxylase oxygenase (Rubisco) activity.
Suitably, such chaperones are also referred to herein as "chaperones", "chaperones" or simply as "chaperones". Preferred chaperones and nucleic acid sequences encoding such chaperones are as described in WO 2014/129898 (incorporated herein by reference).
Preferably, the recombinant yeast cell comprises one or more heterologous nucleic acid sequences encoding one or more chaperones for a protein having ribulose-1, 5-bisphosphate carboxylase oxygenase (Rubisco) activity.
Chaperones are proteins that provide a favorable condition for proper folding of other proteins to prevent aggregation. The newly produced protein must typically be folded from a linear chain of amino acids into a three-dimensional form. Chaperones belong to a large class of molecules that assist in protein folding, called chaperones. The energy to fold the protein is provided by Adenosine Triphosphate (ATP). The following documents write a review article about chaperones that can be used herein: y beenes et al, "Chaperonins: two rings for folding" [ "chaperonin: two loops "] (2011) for folding, trends in Biochemical Sciences [ trends in biochemical science ], volume 36, phase 8, pages 424-432, incorporated herein by reference.
The one or more chaperones may be prokaryotic chaperones or eukaryotic chaperones. Furthermore, the chaperones may be homologous or heterologous. For example, the recombinant yeast cell may comprise one or more nucleic acid sequences encoding one or more homologous or heterologous prokaryotic or eukaryotic chaperones that, when expressed, are capable of functionally interacting with enzymes in the recombinant yeast cell, in particular with at least one of Rubisco and PRK.
Suitably, the one or more molecular chaperones are derived from bacteria, more preferably from Escherichia, in particular E.coli. Preferred molecular chaperones are GroEL and GroES from E.coli. Other preferred chaperones are chaperones from the genus Saccharomyces, in particular Saccharomyces cerevisiae Hsp10 and Hsp60.
If chaperones are naturally expressed in organelles such as mitochondria (examples are Hsp60 and Hsp10 of saccharomyces cerevisiae), repositioning to the cytosol may be achieved, for example, by modifying the natural signal sequence of chaperones. In eukaryotes, the proteins Hsp60 and Hsp10 are almost identical in structure and function to GroEL and GroES, respectively. Thus, it is contemplated that Hsp60 and Hsp10 from any recombinant yeast cell may be used as chaperones for Rubisco. This is described, for example, in the following documents: zeilstra-Ryalls et al, "The universally conserved GroE (Hsp 60) chaperonins" [ "GroE (Hsp 60) chaperonin, which is commonly conserved" ] (1991), annu Rev Microbiol [ annual. Microbiology ] volume 45, pages 301-325; horwire et al, "Two Families of Chaperonin: physiology and Mechanism" [ "two chaperone families: physiology and mechanisms "] (2007), annu.Rev.cell.Dev.biol [ annual assessment of cell and developmental biology ] volume 23, pages 115-145, both of which are incorporated herein by reference.
Good results have been obtained with recombinant yeast cells comprising both heterologous chaperones GroEL and GroES.
As an alternative to GroES, there may be functional homologs of GroES, in particular functional homologs comprising an amino acid sequence having at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98% or at least 99% sequence identity with the amino acid sequence of GroES (corresponding to the amino acid sequence of SEQ ID NO: 31).
SEQ ID NO. 31 provides a preferred translated protein sequence for GroES based on E.coli. SEQ ID NO. 32 provides a synthetic nucleic acid sequence based on GroES from E.coli, codon optimized for expression in Saccharomyces cerevisiae.
Examples of suitable natural chaperone polypeptides homologous to GroES are given in Table 4.
Table 4: natural molecular chaperones homologous to GroES polypeptide suitable for expression
As alternatives to GroEL, functional homologs of GroEL may be present, in particular functional homologs comprising an amino acid sequence having at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98% or at least 99% sequence identity with the amino acid sequence of GroEL (corresponding amino acid sequence of SEQ ID NO: 29).
SEQ ID NO. 29 provides a preferred translated protein sequence for GroEL based on E.coli. SEQ ID NO. 30 provides a synthetic nucleic acid sequence based on GroEL from E.coli, codon optimized for expression in Saccharomyces cerevisiae.
Suitable natural chaperone polypeptides homologous to GroEL are given in Table 5.
Table 5: natural molecular chaperones homologous to GroEL polypeptides suitable for expression
The recombinant yeast cell preferably comprises (respectively functionally expresses) a GroES chaperone and a GroEL chaperone. Preferably, the 10kDa chaperone ("GroES") from Table 4 is combined with a matched 60kDa chaperone ("GroEL") from the same organism genus or species of Table 5 for expression in recombinant yeast cells.
For example: 71-168 10kDa chaperone [ Pyrenophora repentis ] are expressed with matched > gi|189189366|ref|XP_00193155.1| heat shock protein 60, mitochondrial, precursor [ Pyrenophora repentis Pt-1C-BFP ]. All other combinations from tables 4 and 5, similarly produced from the same organism source, are also available to the skilled artisan for expression. Furthermore, chaperones from one organism from table 4 may be combined with chaperones from another organism from table 5, or GroES may be combined with chaperones from table 5, or GroEL may be combined with chaperones from table 4.
Thus, preferably, the one or more chaperones comprise or consist of:
-the amino acid sequence of SEQ ID NO. 29 and/or SEQ ID NO. 31; or alternatively
-one or more functional homologs of SEQ ID No. 29 and/or SEQ ID No. 31 having at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98% or at least 99% sequence identity with the amino acid sequence of SEQ ID No. 29 and/or SEQ ID No. 31, respectively; or alternatively
One or more functional homologs of SEQ ID NO. 29 and/or SEQ ID NO. 31 which have one or more mutations, substitutions, insertions and/or deletions when compared to the amino acid sequence of SEQ ID NO. 29 and/or SEQ ID NO. 31, respectively, more preferably one or more functional homologs which have not more than 300, not more than 250, not more than 200, not more than 150, not more than 100, not more than 75, not more than 50, not more than 40, not more than 30, not more than 20, not more than 10 or not more than 5 amino acid mutations, substitutions, insertions and/or deletions when compared to the amino acid sequence of SEQ ID NO. 29 and/or SEQ ID NO. 31, respectively.
Preferably, the one or more nucleic acid sequences encoding a chaperone comprise or consist of:
-the nucleic acid sequence of SEQ ID NO. 30 and/or SEQ ID NO. 32; or alternatively
-one or more functional homologs of SEQ ID No. 30 and/or SEQ ID No. 32 having at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98% or at least 99% sequence identity with the nucleic acid sequence of SEQ ID No. 30 and/or SEQ ID No. 32, respectively; or alternatively
One or more functional homologs of SEQ ID NO. 30 and/or SEQ ID NO. 32 which have one or more mutations, substitutions, insertions and/or deletions when compared to the nucleic acid sequence of SEQ ID NO. 30 and/or SEQ ID NO. 32, respectively, more preferably one or more functional homologs of not more than 300, not more than 250, not more than 200, not more than 150, not more than 100, not more than 75, not more than 50, not more than 40, not more than 30, not more than 20, not more than 10 or not more than 5 nucleic acid mutations, substitutions, insertions and/or deletions when compared to the nucleic acid sequence of SEQ ID NO. 30 and/or SEQ ID NO. 32, respectively.
One or more nucleic acid sequences encoding a chaperone may be suitably incorporated into the genome of a recombinant yeast cell, for example as described in the examples of WO 2014/129898 (incorporated herein by reference).
Phosphoketolase
As indicated above, the recombinant yeast cell may advantageously comprise a preferably heterologous nucleic acid sequence encoding a protein (EC 4.1.2.9 or EC 4.1.2.22) comprising Phosphoketolase (PKL) activity and/or a preferably heterologous nucleic acid sequence encoding a protein (EC 2.3.1.8) having Phosphotransacetylase (PTA) activity and/or a preferably heterologous nucleic acid sequence encoding a protein (EC 2.7.2.12) having acetate kinase (ACK) activity.
The recombinant cells may comprise one or more heterologous genes encoding a protein having phosphoketolase activity. Such proteins having phosphoketolase activity are also referred to herein as "phosphoketolase proteins", "phosphoketolase (phosphoketolase enzyme)" or simply as "phosphoketolase (phosphoketolase)". Phosphoketolase is further abbreviated herein as "PKL" or "XFP".
As used herein, phosphoketolase catalyzes at least the conversion of D-xylulose 5-phosphate to D-glyceraldehyde 3-phosphate and acetyl phosphate. Phosphoketolase participates in at least one of the following reactions:
EC 4.1.2.9:
EC 4.1.2.22:
Suitable enzyme assays for measuring phosphoketolase activity are described, for example, in the following documents: sondereger et al, "Metabolic Engineering of a Phosphoketolase Pathway for Pentose Catabolismin Saccharomyces cerevisiae" [ "metabolic engineering of the phosphoketolase pathway of pentose catabolism in Saccharomyces cerevisiae" ] (2004), applied & Environmental Microbiology [ application and environmental microbiology ], volume 70 (5), pages 2892-2897, incorporated herein by reference.
Preferably, the protein having Phosphoketolase (PKL) activity comprises or consists of:
-the amino acid sequence of SEQ ID NO. 33, SEQ ID NO. 34, SEQ ID NO. 35 or SEQ ID NO. 36; or alternatively
-a functional homolog of SEQ ID No. 33, SEQ ID No. 34, SEQ ID No. 35 or SEQ ID No. 36 having at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98% or at least 99% sequence identity to the amino acid sequence of SEQ ID No. 33, SEQ ID No. 34, SEQ ID No. 35 or SEQ ID No. 36; or alternatively
A functional homolog of SEQ ID NO. 33, SEQ ID NO. 34, SEQ ID NO. 35 or SEQ ID NO. 36 having one or more mutations, substitutions, insertions and/or deletions when compared to the amino acid sequence of SEQ ID NO. 33, SEQ ID NO. 34, SEQ ID NO. 35 or SEQ ID NO. 36, more preferably NO more than 300, NO more than 250, NO more than 200, NO more than 150, NO more than 100, NO more than 75, NO more than 50, NO more than 40, NO more than 30, NO more than 20, NO more than 10 or NO more than 5 amino acid mutations, substitutions, insertions and/or deletions when compared to the amino acid sequence of SEQ ID NO. 33, SEQ ID NO. 34, SEQ ID NO. 35 or SEQ ID NO. 36.
Suitable nucleic acid sequences encoding phosphoketolase proteins can be found in organisms selected from the group consisting of: aspergillus niger, neurospora crassa, lactobacillus casei (L.casei), lactobacillus plantarum (L.plantarum), lactobacillus plantarum, bifidobacterium adolescentis (B.adolescentis), bifidobacterium bifidum (B.bifidum), bifidobacterium homolupulum (B.gallicum), bifidobacterium animalis (B.analalis), bifidobacterium lactis (B.lactis), lactobacillus pentosus (L.pentosum), lactobacillus acidophilus (L.acidophilus), penicillium chrysogenum (P.chrysogenum), aspergillus nidulans (A.nidulans), aspergillus clavatus (A.clavatus), leuconostoc mesenteroides (L.messenteroides) and wine coccus (O.oenii).
The nucleic acid sequence (e.g., gene) encoding a protein having Phosphoketolase (PKL) activity may be suitably incorporated into the genome of a recombinant yeast cell.
The recombinant cells may comprise one or more (heterologous) genes encoding enzymes having phosphoketolase activity.
Phosphotransacetylase
As indicated above, the recombinant yeast cell may advantageously comprise a preferably heterologous nucleic acid sequence encoding a protein (EC 4.1.2.9 or EC 4.1.2.22) comprising Phosphoketolase (PKL) activity and/or a preferably heterologous nucleic acid sequence encoding a protein (EC 2.3.1.8) having Phosphotransacetylase (PTA) activity and/or a preferably heterologous nucleic acid sequence encoding a protein (EC 2.7.2.12) having acetate kinase (ACK) activity.
As used herein, phosphotransacetylase catalyzes at least the conversion of acetyl phosphate to acetyl-coa.
The recombinant cells may comprise one or more heterologous genes encoding a protein having phosphotransacetylase activity. Such a protein having phosphotransacetylase activity is also referred to herein as "phosphotransacetylase protein", "phosphotransacetylase (phosphotransacetylase enzyme)" or simply "phosphotransacetylase". Phosphotransacetylase is further abbreviated herein as "PTA".
Preferably, the protein having Phosphotransacetylase (PTA) activity comprises or consists of:
-the amino acid sequence of SEQ ID NO. 37, SEQ ID NO. 38, SEQ ID NO. 39 or SEQ ID NO. 40; or alternatively
-a functional homolog of SEQ ID No. 37, SEQ ID No. 38, SEQ ID No. 39 or SEQ ID No. 40 having at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98% or at least 99% sequence identity to the amino acid sequence of SEQ ID No. 37, SEQ ID No. 38, SEQ ID No. 39 or SEQ ID No. 40; or alternatively
-a functional homolog of SEQ ID No. 37, SEQ ID No. 38, SEQ ID No. 39 or SEQ ID No. 40 having one or more mutations, substitutions, insertions and/or deletions when compared to the amino acid sequence of SEQ ID No. 37, SEQ ID No. 38, SEQ ID No. 39 or SEQ ID No. 40, more preferably NO more than 300, NO more than 250, NO more than 200, NO more than 150, NO more than 100, NO more than 75, NO more than 50, NO more than 40, NO more than 30, NO more than 20, NO more than 10 or NO more than 5 amino acid mutations, substitutions, insertions and/or deletions when compared to the amino acid sequence of SEQ ID No. 37, SEQ ID No. 38, SEQ ID No. 39 or SEQ ID No. 40.
Suitable nucleic acid sequences encoding enzymes having phosphotransacetylase can be found in organisms selected from the group consisting of: bifidobacterium adolescentis, bacillus subtilis, clostridium defibricum, clostridium phytofermentans, bifidobacterium bifidum, bifidobacterium animalis, leuconostoc mesenteroides, lactobacillus plantarum, myceliophthora thermophila (m. Thermophila) and oenococcus.
The nucleic acid sequence (e.g., gene) encoding a protein having Phosphotransacetylase (PTA) activity may be suitably incorporated into the genome of a recombinant yeast cell.
Acetate kinase
As indicated above, the recombinant yeast cell may comprise a preferably heterologous nucleic acid sequence encoding a protein (EC 4.1.2.9 or EC 4.1.2.22) comprising Phosphoketolase (PKL) activity and/or a preferably heterologous nucleic acid sequence encoding a protein (EC 2.3.1.8) having Phosphotransacetylase (PTA) activity and/or a preferably heterologous nucleic acid sequence encoding a protein (EC 2.7.2.12) having acetate kinase (ACK) activity.
As used herein, acetate kinase catalyzes at least the conversion of acetic acid to acetyl phosphate.
The recombinant cell may comprise one or more preferably heterologous genes encoding a protein having acetate kinase activity (EC 2.7.2.12). Such proteins having acetate kinase activity are also referred to herein as "acetate kinase proteins", "acetate kinase (acetate kinase enzyme)" or simply "acetate kinase". Acetate kinase is further abbreviated herein as "ACK".
Preferably, the protein having acetate kinase (ACK) activity comprises or consists of:
-the amino acid sequence of SEQ ID NO. 41 or SEQ ID NO. 42; or alternatively
-a functional homolog of SEQ ID No. 41 or SEQ ID No. 42 having at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98% or at least 99% sequence identity with the amino acid sequence of SEQ ID No. 41 or SEQ ID No. 42; or alternatively
Functional homologs of SEQ ID No. 41 or SEQ ID No. 42 which have one or more mutations, substitutions, insertions and/or deletions when compared to the amino acid sequence of SEQ ID No. 41 or SEQ ID No. 42, more preferably functional homologs which have NO more than 300, NO more than 250, NO more than 200, NO more than 150, NO more than 100, NO more than 75, NO more than 50, NO more than 40, NO more than 30, NO more than 20, NO more than 10 or NO more than 5 amino acid mutations, substitutions, insertions and/or deletions when compared to the amino acid sequence of SEQ ID No. 41 or SEQ ID No. 42.
The nucleic acid sequence (e.g., gene) encoding a protein having acetate kinase (ACK) activity may be suitably incorporated into the genome of a recombinant yeast cell.
Acetylaldehyde dehydrogenase
As indicated above, the recombinant yeast cell may advantageously comprise and functionally express a preferably heterologous nucleic acid sequence encoding a protein (EC 1.2.1.10) comprising nad+ dependent acetylaldehyde dehydrogenase activity.
More preferably, the recombinant yeast cell functionally expresses, if an acetylating acetaldehyde dehydrogenase is present:
-a preferably heterologous nucleic acid sequence encoding a protein comprising nad+ -dependent acetylating acetaldehyde dehydrogenase activity (EC 1.2.1.10); and
Coding with NAD + Protein dependent alcohol dehydrogenase activity (EC 1).1.1.1 or EC 1.1.1.2) suitably endogenous or heterologous nucleic acid sequences; and
a suitably endogenous or heterologous nucleic acid sequence encoding a protein having acetyl-coa synthetase activity (EC 6.2.1.1).
The acetylating acetaldehyde dehydrogenase is an enzyme that catalyzes the conversion of acetyl-coa to acetaldehyde (EC 1.2.1.10). This conversion can be represented by the following equilibrium equation:
acetyl-CoA+NADH+H + <->acetaldehyde+NAD + +coenzyme A
The protein having an acetylating acetaldehyde dehydrogenase activity is also referred to herein as "acetylating acetaldehyde dehydrogenase protein", "acetylating acetaldehyde dehydrogenase (acetylating acetaldehyde dehydrogenase enzyme)", or simply "acetylating acetaldehyde dehydrogenase (acetylating acetaldehyde dehydrogenase)". Preferred acetylating acetaldehyde dehydrogenases and nucleic acid sequences encoding such acetylating acetaldehyde dehydrogenases are described in WO 2011/010923 and WO 2019/0635507 (incorporated herein by reference).
Encoding a polypeptide having NAD + The nucleic acid sequence of the protein dependent on the activity of the acetylating acetaldehyde dehydrogenase (EC 1.2.1.10) is preferably a heterologous nucleic acid sequence. Thus, the encoded NAD + The dependent acetylating acetaldehyde dehydrogenase may preferably be a heterologous NAD + A dependent acetylating acetaldehyde dehydrogenase.
Proteins having acetylating acetaldehyde dehydrogenase activity may be monofunctional or bifunctional.
Encoding NAD + The nucleic acid sequence of the dependent acetylating acetaldehyde dehydrogenase may in principle originate from any organism comprising a nucleic acid sequence encoding said dehydrogenase. Known acetylating acetaldehyde dehydrogenases that catalyze the NADH-dependent reduction of acetyl-CoA to acetaldehyde can be generally divided into the following three types of NAD+ -dependent acetylating acetaldehyde dehydrogenase functional homologs:
1) Bifunctional proteins that catalyze the reversible conversion of acetyl-coa to acetaldehyde, followed by the reversible conversion of acetaldehyde to ethanol. These types of proteins advantageously have both acetylating acetaldehyde dehydrogenase activity and alcohol dehydrogenase activity. This kind ofAn example of a type of protein is the AdhE protein in E.coli (GenBank accession number: NP-415757). AdhE appears to be an evolutionary product of gene fusion. NH of AdhE protein 2 The terminal region is highly homologous to the aldehyde NAD+ oxidoreductase, while the COOH terminal region is Fe 2+ Dependent ethanol NAD+ oxidoreductase family homology (see Membrillo-Hernandez et al, "Evolution of the adhE Gene Product of Escherichia coli from a Functional Reductase to a Dehydrogenase" [ "evolution of the adhE Gene product of E.coli from functional reductase to dehydrogenase" ](2000) J.biol.chem. [ journal of biochemistry ]]275, pages 33869-33875, incorporated herein by reference). Coli AdhE is subject to metal-catalyzed oxidation and is therefore sensitive to oxygen (see Tamarit et al, "Identification of the Major Oxidatively Damaged Proteins in Escherichia coli Cells Exposed to Oxidative Stress" [ "identification of major oxidative damage proteins in E.coli cells exposed to oxidative stress") "](1998) J.biol.chem. [ journal of biochemistry ]]273, pages 3027-3032, incorporated herein by reference).
2) Proteins that catalyze the reversible conversion of acetyl-CoA to acetaldehyde in a strictly or facultative anaerobic microorganism, but do not possess alcohol dehydrogenase activity. An example of this type of protein has been reported in Clostridium kluyveri (Clostridium kluyveri) (see Smith et al, "Purification, properties, and Kinetic Mechanism of Coenzyme A-Linked Aldehyde Dehydrogenase from Clostridium kluyveri" [ "Purification, nature and kinetic mechanism of coenzyme A-linked aldehyde dehydrogenase from Clostridium kluyveri" ] (1980) arch. Biochem. Biophys. [ Biochem. Biophysics, volume 203: pages 663-675, incorporated herein by reference). The acetylating acetaldehyde dehydrogenase (GenBank accession number: EDK 33116) has been annotated in the genome of Clostridium kluyveromyces DSM 555. A homologous protein AcdH (GenBank accession number: NP-784141) was identified in the genome of Lactobacillus plantarum. Another example of this type of protein is The Gene product described in Clostridium beijerinckii (Clostridium beijerinckii) NRRL B593 (see Toth et al, "The ald Gene, encoding a Coenzyme A-Acylating Aldehyde Dehydrogenase, distinguishes Clostridium beijerinckii and Two Other Solvent-Producing Clostridia from Clostridium acetobutylicum" [ "The ald Gene encoding a CoA-acylated aldehyde dehydrogenase distinguishes Clostridium beijerinckii from The other two solvolyte-producing Clostridium acetobutylicum" ], (1999), appl. Environ. Microbiol. [ applied and environmental microbiology ] volume 65:4973-4980, genBank accession number: AAD31841, incorporated herein by reference).
3) A protein that is part of a bifunctional aldolase-dehydrogenase complex involved in catabolism of 4-hydroxy-2-oxopentanoate. Such bifunctional enzymes catalyze the last two steps of the meta-cleavage pathway of catechol, an intermediate in the degradation of phenol, benzoate, naphthalene, biphenyl, and other aromatic compounds in many bacterial species (Powlowski and Shingler "Genetics and biochemistry of phenol degradation by Pseudomonas sp.cf600" [ "genetics and biochemistry of pseudomonas species CF600 degrading phenol" ] (1994) biodegration [ Biodegradation ] volume 5, pages 219-236, incorporated herein by reference). 4-hydroxy-2-oxopentanoic acid is first converted into pyruvic acid and acetaldehyde by 4-hydroxy-2-oxopentanoic acid aldolase, followed by conversion of acetaldehyde into acetyl-coa by acetylacetaldehyde dehydrogenase. An example of this type of acetylating acetaldehyde dehydrogenase is the DmpF protein in Pseudomonas (Pseudomonas) species CF600 (GenBank accession number: CAA 43226) (Shingler et al, "Nucleotide Sequence and Functional Analysis of the Complete Phenol/3,4-Dimethylphenol Catabolic Pathway of Pseudomonas sp.Strain CF600" [ "nucleotide sequence and functional analysis of the phenol/3,4-dimethylphenol catabolic pathway of Pseudomonas species strain CF600" ] (1992), J.Bacteriol. [ journal of bacteriology ], volume 174, pages 711-724, incorporated herein by reference). The E.coli MphF protein is homologous to the DmpF protein in Pseudomonas species CF600 (Ferrandez et al, "Genetic Characterization and Expression in Heterologous Hosts of the 3- (3-Hydroxyphenyl) Propionate Catabolic Pathway of Escherichia coli K-12" [ "genetic characterization of the 3- (3-Hydroxyphenyl) propionic acid catabolic pathway of E.coli K-12" ] (1997) J.Bacteriol. [ J.Bacteriol. ] 179:p.2573-2581, genBank accession number: NP-414885, incorporated herein by reference).
In a preferred embodiment, the protein having acetylating acetaldehyde dehydrogenase activity is bifunctional and comprises NAD + Dependent acetylating acetaldehyde dehydrogenase (EC 1.2.1.10) activity and NAD + Both dependent alcohol dehydrogenase (EC 1.1.1.1 or EC 1.1.1.2) activities.
Suitable nucleic acid sequences can be found in particular in organisms selected from the group: escherichia, in particular E.coli; mycobacterium (Mycobacterium), in particular Mycobacterium marinum (Mycobacterium marinum), mycobacterium ulcerans (Mycobacterium ulcerans), mycobacterium tuberculosis (Mycobacterium tuberculosis); thermophilic carbon oxide bacteria (Carboxydothermus), in particular thermophilic carbon hydroxide bacteria (Carboxydothermus hydrogenoformans); entamoeba, especially Entamoeba histolytica (Entamoeba histolytica); shigella (Shigella), in particular Shigella sonnei; burkholderia (Burkholderia), particularly Burkholderia-like gangrene (Burkholderia pseudo mallei), klebsiella (Klebsiella), particularly Klebsiella pneumoniae; azotobacter (Azotobacter), in particular Azotobacter brown (Azotobacter vinelandii); a Azoarcus (Azoarcus) species; cupriacus (cupravidus), in particular cupriacus taiwanensis (Cupriavidus taiwanensis); pseudomonas, in particular Pseudomonas species CF600; pelomacuum, in particular Pelotomaculum thermopropionicum. Preferably, NAD is encoded + The nucleic acid sequence of the dependent acetylating acetaldehyde dehydrogenase is derived from the genus Escherichia, more preferably from Escherichia coli.
Particularly suitable are the mhpF genes from E.coli or functional homologs thereof. The gene is described in the following documents: ferrandez et al, "Genetic Characterization and Expression in Heterologous Hosts of the 3- (3-Hydroxyphenyl) Propionate Catabolic Pathway of Escherichia coli K-12" [ "genetic characterization of the 3- (3-Hydroxyphenyl) propionic acid catabolic pathway of E.coli K-12" ] (1997) J.Bacteriol. [ J.bacteriology ] 179:2573-2581. Good results have been obtained with Saccharomyces cerevisiae, in which the mhpF gene from E.coli has been incorporated. In a further advantageous embodiment, the nucleic acid sequence encoding (acetylated) acetaldehyde dehydrogenase is derived from Pseudomonas, in particular dmpF, for example from Pseudomonas species CF 600.
Furthermore, the acetylating acetaldehyde dehydrogenase (or the nucleic acid sequence encoding such an activity) may be, for example, selected from the group of: coli adhE, amoebaadh 2 within the tissue, staphylococcus aureus adhE, ruminococcus (Piromyces) species E2 adhE, clostridium krypton EDK33116, lactobacillus plantarum acdH, escherichia coli eutE, listeria innocuous acdH, and pseudomonas putida (Pseudomonas putida) YP 001268189.
Preferably, have NAD + The protein dependent on the activity of the acetylating acetaldehyde dehydrogenase comprises or consists of:
-the amino acid sequence of SEQ ID NO. 43, SEQ ID NO. 44, SEQ ID NO. 45, SEQ ID NO. 46, SEQ ID NO. 47 or SEQ ID NO. 48; or alternatively
-a functional homolog of SEQ ID No. 43, SEQ ID No. 44, SEQ ID No. 45, SEQ ID No. 46, SEQ ID No. 47 or SEQ ID No. 48 having at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98% or at least 99% sequence identity to the amino acid sequence of SEQ ID No. 43, SEQ ID No. 44, SEQ ID No. 45, SEQ ID No. 46, SEQ ID No. 47 or SEQ ID No. 48; or alternatively
-a functional homolog of SEQ ID No. 43, SEQ ID No. 44, SEQ ID No. 45, SEQ ID No. 46, SEQ ID No. 47 or SEQ ID No. 48 having one or more mutations, substitutions, insertions and/or deletions when compared to the amino acid sequence of SEQ ID No. 43, SEQ ID No. 44, SEQ ID No. 45, SEQ ID No. 46, SEQ ID No. 47 or SEQ ID No. 48, more preferably NO more than 300, NO more than 250, NO more than 200, NO more than 150, NO more than 100, NO more than 75, NO more than 50, NO more than 40, NO more than 30, NO more than 20, NO more than 10 or NO more than 5 amino acid mutations, substitutions, insertions and/or deletions when compared to the amino acid sequence of SEQ ID No. 43, SEQ ID No. 44, SEQ ID No. 45, SEQ ID No. 46, SEQ ID No. 47 or SEQ ID No. 48.
Most preferably, the acetylating acetaldehyde dehydrogenase protein is a bifunctional protein having both acetylating acetaldehyde dehydrogenase activity and alcohol dehydrogenase activity.
The nucleic acid sequence (e.g., gene) encoding a protein having acetylating acetaldehyde dehydrogenase activity may be suitably incorporated into the genome of a recombinant yeast cell.
Examples of suitable enzymes for BLAST of the listed enzymes are further shown in tables 6 (a) to 6 (e) below, giving suitable alternative alcohol/acetaldehyde dehydrogenases.
Table 6 (a)BLAST query-adhE from E.coli
Table 6 (b)BLAST query-acdH from Lactobacillus plantarum
Watch 6 (c)BLAST query-eutE from E.coli
Watch 6 (d)BLAST query-Lin 1129 from Listeria innocens
Watch 6 (e)BLAST query-adhE from Staphylococcus aureus
Acetyl-coa synthetase
If the recombinant yeast cell functionally expresses a protein having acetylating acetaldehyde dehydrogenase activity, then preferably the recombinant yeast cell further functionally expresses:
coding with NAD + Nucleic acid sequences of proteins which depend on alcohol dehydrogenase activity (EC 1.1.1.1 or EC 1.1.1.2); and/or
Nucleic acid sequences encoding proteins with acetyl-CoA synthetase activity (EC 6.2.1.1).
Proteins having acetyl-coa synthetase activity may also be referred to herein as "acetyl-coa synthetase proteins", "acetyl-coa synthetases (acetyl-Coenzyme A synthetase enzyme)", or simply "acetyl-coa synthetases (acetyl-Coenzyme A synthetase)", or even "acetyl-coa synthetases (acetyl CoA synthetase)". This protein is further abbreviated herein as "ACS".
acetyl-CoA synthase (also known as acetate-CoA ligase or acetyl-activator) catalyzes the formation of acetyl-CoA from acetate, coA (coenzee a/CoA) and ATP as follows:
atp+acetate+coa=amp+diphosphate+acetyl coa
It will be appreciated that the recombinant yeast cell may naturally contain an endogenous gene encoding an acetyl-coa synthetase protein. Alternatively or in addition thereto, the recombinant yeast cell may comprise a heterologous nucleic acid sequence encoding a protein having acetyl-coa synthetase activity (EC 6.2.1.1).
For example, a recombinant yeast cell according to the invention may comprise an acetyl-CoA synthetase that may be present in wild-type cells, as is the case, for example, with Saccharomyces cerevisiae, which contains two acetyl-CoA synthetase isozymes encoded by ACS1 (amino acid sequence shown as SEQ ID NO: 49) and ACS2 (amino acid sequence shown as SEQ ID NO: 50) genes (van den Berg et al (1996) J.biol. Chem. [ J. Biochemistry ] 271:pages 28953-28959), or the host cell may be provided with one or more heterologous genes encoding such an activity, for example, ACS1 and/or ACS2 genes or functional homologues thereof of Saccharomyces cerevisiae may be incorporated into cells lacking acetyl-CoA synthetase isozymes activity.
Preferably, have NAD + The protein dependent on acetyl-coa synthetase activity comprises or consists of:
-the amino acid sequence of SEQ ID NO. 49 or SEQ ID NO. 50; or alternatively
-a functional homolog of SEQ ID No. 49 or SEQ ID No. 50 having at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98% or at least 99% sequence identity with the amino acid sequence of SEQ ID No. 49 or SEQ ID No. 50; or alternatively
A functional homolog of SEQ ID NO. 49 or SEQ ID NO. 50 having one or more mutations, substitutions, insertions and/or deletions when compared to the amino acid sequence of SEQ ID NO. 49 or SEQ ID NO. 50, more preferably NO more than 300, NO more than 250, NO more than 200, NO more than 150, NO more than 100, NO more than 75, NO more than 50, NO more than 40, NO more than 30, NO more than 20, NO more than 10 or NO more than 5 amino acid mutations, substitutions, insertions and/or deletions when compared to the amino acid sequence of SEQ ID NO. 49 or SEQ ID NO. 50.
Preferably, the recombinant yeast cell is one in which the endogenous or heterologous acetyl-coa synthetase protein is overexpressed, most preferably by using a suitable promoter as described, for example, in WO 2011/010923 (incorporated herein by reference). Any heterologous nucleic acid sequence (e.g., a gene) encoding a protein having acetyl-coa synthetase activity may be suitably incorporated into the genome of a recombinant yeast cell.
Examples of suitable proteins having acetyl-coa synthetase activity are listed in table 7. At the top of table 7, ACS2 used in the examples and BLAST is mentioned.
Table 7:BLAST query-ACS 2 from Saccharomyces cerevisiae
Alcohol dehydrogenase
If the recombinant yeast cell functionally expresses a protein having acetylating acetaldehyde dehydrogenase activity, then preferably the recombinant yeast cell further functionally expresses:
coding with NAD + Nucleic acid sequences of proteins which depend on alcohol dehydrogenase activity (EC 1.1.1.1 or EC 1.1.1.2); and/or
Nucleic acid sequences encoding proteins with acetyl-CoA synthetase activity (EC 6.2.1.1).
The protein having alcohol dehydrogenase activity is also referred to herein as "alcohol dehydrogenase protein", "alcohol dehydrogenase (alcohol dehydrogenase enzyme)" or simply "alcohol dehydrogenase (alcohol dehydrogenase)". This protein is further abbreviated herein as "ADH".
Alcohol dehydrogenase catalyzes the conversion of acetaldehyde to ethanol.
It will be appreciated that the recombinant yeast cell may naturally comprise an endogenous nucleic acid sequence encoding an alcohol dehydrogenase protein. Alternatively or additionally, the recombinant yeast cell may comprise a heterologous nucleic acid sequence encoding a protein having alcohol dehydrogenase activity
For example, the recombinant yeast cell may naturally comprise a gene encoding an alcohol dehydrogenase as in the case of Saccharomyces cerevisiae (the amino acid sequences of natural Saccharomyces cerevisiae alcohol dehydrogenases ADH1, ADH2, ADH3, ADH4 and ADH5 are shown as SEQ ID NO:51, SEQ ID NO:52, SEQ ID NO:53, SEQ ID NO:54 and SEQ ID NO:55, respectively), see Lutstorf and Megnet, "Multiple Forms of Alcohol Dehydrogenase in Saccharomyces Cerevisiae" [ "various forms of alcohol dehydrogenases in Saccharomyces cerevisiae" ] (1968), arch. Biochem. Biophys. [ biochemical and biophysical archives ], vol.126, pp.933-944, respectively; or Ciriacy, "Genetics of Alcohol Dehydrogenase in Saccharomyces cerevisiae I.isolation and genetic analysis of adh mutants" [ "isolation and genetic analysis of an alcohol dehydrogenase genetic I.adh mutant in Saccharomyces cerevisiae" ] (1975), mutat.Res. [ mutation research ]29, pages 315-326, incorporated herein by reference).
Preferably, however, the recombinant yeast cell comprises an alcohol dehydrogenase activity within a suitably heterologous bifunctional enzyme having both an acetylating acetaldehyde dehydrogenase activity as well as an alcohol dehydrogenase activity, as described above. That is, most preferably, the alcohol dehydrogenase protein is a bifunctional protein having both an acetylating acetaldehyde dehydrogenase activity and an alcohol dehydrogenase activity. When the recombinant yeast cell comprises a heterologous nucleic acid sequence encoding a bifunctional protein having both an acetylating acetaldehyde dehydrogenase activity and an alcohol dehydrogenase activity, any native nucleic acid sequence encoding any native protein encoding an alcohol dehydrogenase activity may or may not be disrupted and/or deleted.
Thus, the recombinant yeast cell may advantageously be a recombinant yeast cell functionally expressing:
-one or more heterologous nucleic acid sequences encoding a bifunctional protein having the following activities: NAD (NAD) + Dependent acetylating acetaldehyde dehydrogenase activity (EC 1.2.1.10); and NAD + A dependent alcohol dehydrogenase activity (EC 1.1.1.1 or EC 1.1.1.2); and
one or more native or heterologous nucleic acid sequences encoding a protein having acetyl-CoA synthetase activity (EC 6.2.1.1),
Wherein optionally the code has NAD + One or more native nucleic acid sequences of the protein (EC 1.1.1.1 or EC 1.1.1.2) which are dependent on alcohol dehydrogenase activity are disrupted or deleted.
Alternatively, the recombinant yeast cell may advantageously be a recombinant yeast cell functionally expressing:
coding with NAD + One or more native or heterologous nucleic acid sequences of a monofunctional protein (EC 1.2.1.10) that is dependent on the activity of an acetylating acetaldehyde dehydrogenase; and
-one or more native or heterologous nucleic acid sequences encoding a protein having acetyl-coa synthetase activity (EC 6.2.1.1); and
coding with NAD + One or more native or heterologous nucleic acid sequences of a protein dependent on alcohol dehydrogenase activity (EC 1.1.1.1 or EC 1.1.1.2).
Preferences for bifunctional proteins are provided above, and as listed for the acetylating acetaldehyde dehydrogenase protein. If the protein is not bifunctional, NAD + The dependent alcohol dehydrogenase protein preferably has NAD + A protein dependent on alcohol dehydrogenase activity comprising or consisting of:
-the amino acid sequence of SEQ ID NO. 51, SEQ ID NO. 52, SEQ ID NO. 53, SEQ ID NO. 54 or SEQ ID NO. 55; or alternatively
-a functional homolog of SEQ ID No. 51, SEQ ID No. 52, SEQ ID No. 53, SEQ ID No. 54 or SEQ ID No. 55 having at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98% or at least 99% sequence identity to the amino acid sequence of SEQ ID No. 51, SEQ ID No. 52, SEQ ID No. 53, SEQ ID No. 54 or SEQ ID No. 55; or alternatively
-a functional homolog of SEQ ID No. 51, SEQ ID No. 52, SEQ ID No. 53, SEQ ID No. 54 or SEQ ID No. 55 having one or more mutations, substitutions, insertions and/or deletions when compared to the amino acid sequence of SEQ ID No. 51, SEQ ID No. 52, SEQ ID No. 53, SEQ ID No. 54 or SEQ ID No. 55, more preferably NO more than 300, NO more than 250, NO more than 200, NO more than 150, NO more than 100, NO more than 75, NO more than 50, NO more than 40, NO more than 30, NO more than 20, NO more than 10 or NO more than 5 amino acid mutations, substitutions, insertions and/or deletions when compared to the amino acid sequence of SEQ ID No. 51, SEQ ID No. 52, SEQ ID No. 53, SEQ ID No. 54 or SEQ ID No. 55.
Can be encoded with NAD + Any heterologous nucleic acid sequence (e.g., gene) of a protein that is dependent on alcohol dehydrogenase activity) Suitably incorporated into the genome of a recombinant yeast cell.
Deletion or disruption of glycerol 3-phosphate phosphohydrolase and/or glycerol 3-phosphate dehydrogenase
The recombinant yeast cell further may or may not comprise a deletion or disruption of one or more endogenous nucleotide sequences encoding a glycerol 3-phosphate phosphohydrolase gene and/or encoding a glycerol 3-phosphate dehydrogenase gene.
Preferably, the enzymatic activity required for NADH dependent glycerol synthesis in the yeast cells is reduced or deleted. The reduction or deletion of the enzymatic activity of glycerol 3-phosphate hydrolase and/or glycerol 3-phosphate dehydrogenase may be achieved by: one or more genes encoding NAD-dependent glycerol 3-phosphate dehydrogenase (GPD) and/or one or more genes encoding glycerophosphate phosphatase (GPP) are modified such that the expression of the enzyme is substantially lower than wild-type or such that the gene encodes a polypeptide with reduced activity. Such modifications may be made using commonly known biotechnology, and may in particular include one or more knockout mutations or site-directed mutagenesis of the promoter region or coding region of structural genes encoding GPD and/or GPP. Alternatively, yeast strains deficient in glycerol production may be obtained by random mutagenesis followed by selection of strains with reduced or absent GPD and/or GPP activity. The Saccharomyces cerevisiae GPD1, GPD2, GPP1 and GPP2 genes are shown in WO 2011010923 and are disclosed in SEQ ID NOS: 24-27 of this application.
Preferably, the recombinant yeast is a recombinant yeast further comprising a deletion or disruption of the glycerol-3-phosphate dehydrogenase (GPD) gene. One or more of the glycerophosphate phosphatase (GPP) genes may or may not be deleted or disrupted.
More preferably, the recombinant yeast is a recombinant yeast comprising a deletion or disruption of the glycerol-3-phosphate dehydrogenase 1 (GPD 1) gene. The glycerol-3-phosphate dehydrogenase 2 (GPD 2) gene may or may not be deleted or disrupted.
Most preferably, the recombinant yeast is a recombinant yeast comprising a deletion or disruption of the glycerol-3-phosphate dehydrogenase 1 (GPD 1) gene, while the glycerol-3-phosphate dehydrogenase 2 (GPD 2) gene remains active and/or intact. Thus, preferably only one of the Saccharomyces cerevisiae GPD1, GPD2, GPP1 and GPP2 genes is disrupted and deleted, and most preferably only GPD1 selected from the group consisting of GPD1, GPD2, GPP1 and GPP2 genes is disrupted or deleted.
Without wishing to be bound by any type of theory, it is believed that recombinant yeasts according to the present invention, wherein the GPD1 gene is not deleted or disrupted, may be advantageous when applied in a fermentation process in which glucose is preferably equal to or greater than 80g/L, more preferably equal to or greater than 90g/L, even more preferably equal to or greater than 100g/L, still more preferably equal to or greater than 110g/L, yet even more preferably equal to or greater than 120g/L, equal to or greater than 130g/L, equal to or greater than 140g/L, equal to or greater than 150g/L, equal to or greater than 160g/L, equal to or greater than 170g/L, or equal to or greater than 180g/L at the beginning of fermentation or during fermentation.
Preferably, at least one gene encoding GPD and/or at least one gene encoding GPP is deleted entirely or at least a part of a gene encoding a part of an enzyme essential for its activity is deleted. Good results can be obtained with s.cerevisiae cells in which the open reading frames of the GPD1 gene and/or the GPD2 gene have been inactivated. Inactivation of a structural gene (target gene) can be accomplished by one of skill in the art by synthetically synthesizing or otherwise constructing DNA fragments consisting of selectable marker genes flanked by DNA sequences that are identical to sequences flanking the genomic region of the host cell to be deleted. Suitably, good results are obtained by inactivating the GPD1 and GPD2 genes in Saccharomyces cerevisiae by integration of the marker genes kanMX and hphMX 4. Subsequently, the DNA fragment is transformed into a host cell. For example by diagnostic polymerase chain reaction or DNA hybridization, it is checked whether transformed cells expressing the dominant marker gene correctly replace the region designed to be deleted.
Thus, in the recombinant yeast cells of the invention, glycerol 3-phosphate phosphatase activity in the cells and/or glycerol 3-phosphate dehydrogenase activity in the cells may be advantageously reduced.
Glucoamylase enzyme
Preferably, the recombinant yeast cell further functionally expresses a nucleic acid sequence encoding a glucoamylase (EC 3.2.1.20 or 3.2.1.3).
A protein having glucoamylase activity is also referred to herein as "glucoamylase (glucoamylase enzyme)", "glucoamylase protein", or simply "glucoamylase (glucoamylase)". Glucoamylases have been abbreviated herein as "GA".
Glucoamylases (also known as amyloglucosidase, alpha-glucosidase, glucan 1, 4-alpha-glucosidase, maltase glucoamylase and maltase-glucoamylase) catalyze the hydrolysis of at least the terminal 1, 4-linked alpha-D-glucose residues from the non-reducing end of the amylose chain to release free D-glucose. Glucoamylases may be further defined by their amino acid sequence. Likewise, a glucoamylase may be further defined by a nucleotide sequence encoding a glucoamylase. As explained in detail below under the definition above, a certain glucoamylase defined by a nucleotide sequence encoding an enzyme includes (unless otherwise limited) a nucleotide sequence that hybridizes to such a nucleotide sequence encoding a glucoamylase.
Preferably, the protein having glucoamylase activity comprises or consists of:
-the amino acid sequence of SEQ ID NO. 56, SEQ ID NO. 57 or SEQ ID NO. 58; or alternatively
-a functional homolog of SEQ ID No. 56, SEQ ID No. 57 or SEQ ID No. 58 having at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98% or at least 99% sequence identity to the amino acid sequence of SEQ ID No. 56, SEQ ID No. 57 or SEQ ID No. 58; or alternatively
A functional homolog of SEQ ID NO. 56, SEQ ID NO. 57 or SEQ ID NO. 58 having one or more mutations, substitutions, insertions and/or deletions when compared to the amino acid sequence of SEQ ID NO. 56, SEQ ID NO. 57 or SEQ ID NO. 58, more preferably NO more than 300, NO more than 250, NO more than 200, NO more than 150, NO more than 100, NO more than 75, NO more than 50, NO more than 40, NO more than 30, NO more than 20, NO more than 10 or NO more than 5 amino acid mutations, substitutions, insertions and/or deletions when compared to the amino acid sequence of SEQ ID NO. 56, SEQ ID NO. 57 or SEQ ID NO. 58.
The polypeptide of SEQ ID NO. 56 encodes a "mature glucoamylase" which refers to an enzyme in its final form after translation and any post-translational modifications such as N-terminal treatment, C-terminal truncation, glycosylation, phosphorylation, etc.
In one embodiment, the nucleotide sequence encodes a polypeptide having the amino acid sequence of SEQ ID NO. 57 or a variant thereof having at least 50%, preferably at least 60%, 70%, 75%, 80%, 85%, 90%, 95%, 98% or 99% amino acid sequence identity with the amino acid sequence of SEQ ID NO. 57. Amino acids 1-17 of SEQ ID NO. 57 may encode a natural signal sequence.
In another embodiment, the nucleotide sequence that allows expression of the glucoamylase encodes a polypeptide having the amino acid sequence of SEQ ID NO. 58 or a variant thereof having at least 50%, preferably at least 60%, 70%, 75%, 80%, 85%, 90%, 95%, 98% or 99% amino acid sequence identity with the amino acid sequence of SEQ ID NO. 58. Amino acids 1-19 of SEQ ID NO. 58 may encode a signal sequence.
A signal sequence (also known as a signal peptide, targeting signal, localization sequence, transit peptide, leader sequence or leader peptide) may be present at the N-terminus of the polypeptide (here, glucoamylase) where it signals that the polypeptide is to be secreted (e.g., secreted into the extracellular and medium).
The nucleic acid sequence (e.g., gene) encoding a protein having glucoamylase activity may be suitably incorporated into the genome of a recombinant yeast cell.
Glycerol reuptake
The recombinant yeast cell may or may not further comprise one or more additional nucleic acid sequences as part of the glycerol reuptake pathway. That is, the recombinant yeast cell may or may not further comprise:
-one or more heterologous nucleic acid sequences encoding glycerol dehydrogenase; and/or
-a nucleic acid sequence encoding one or more homologs or heterogenies of dihydroxyacetone kinase; and/or
-one or more heterologous nucleic acid sequences encoding glycerol transporters.
Thus, in a preferred embodiment, the recombinant yeast cell is a recombinant yeast cell that functionally expresses:
-one or more heterologous nucleic acid sequences encoding ribulose-1, 5-phosphate carboxylase/oxygenase (EC 4.1.1.39; rubisco) and optionally one or more nucleic acid sequences encoding a chaperone of Rubisco;
-one or more heterologous nucleic acid sequences encoding a phosphoribulokinase (EC 2.7.1.19; prk);
one or more nucleic acid sequences encoding a transketolase (EC 2.2.1.1), wherein the transketolase is under the control of a promoter ("TKL promoter") whose TKL expression ratio is Anaerobic/aerobic 2 or higher;
-one or more heterologous nucleic acid sequences encoding glycerol dehydrogenase;
-a nucleic acid sequence encoding one or more homologs or heterogenies of dihydroxyacetone kinase; and
-optionally one or more heterologous nucleic acid sequences encoding glycerol transporters.
Without wishing to be bound by any type of theory, it is believed that recombinant yeast cells further comprising a combination of glycerol dehydrogenase, dihydroxyacetone kinase, and optionally glycerol transporter have improved overall performance in the form of higher ethanol yields.
In an alternative preferred embodiment, the recombinant yeast cell isDoes not takeFunctionally expressing recombinant yeast cells that:
-one or more heterologous nucleic acid sequences encoding glycerol dehydrogenase; and/or
-one or more heterologous nucleic acid sequences encoding dihydroxyacetone kinase; and/or
-one or more heterologous nucleic acid sequences encoding glycerol transporters.
Without wishing to be bound by any type of theory, it is believed that in the absence of one or more of these features of this glycerol re-uptake pathway, the resulting recombinant yeast cells have very low glucose and/or other sugar accumulation and improved robustness when applied in media containing large amounts of sugar. Therefore, when applied to the following fermentation process, Does not takeThe use of recombinant yeast cells comprising one or more of the following may be advantageous: heterologous and/or homologous glycerol dehydrogenases; heterologous and/or homologous dihydroxyacetone kinase; and/or heterologous and/or homologous glycerol transporters, preferably equal to or greater than 80g/L, more preferably equal to or greater than 90g/L, even more preferably equal to or greater than 100g/L, still more preferably equal to or greater than 110g/L, yet even more preferably equal to or greater than 120g/L, equal to or greater than 130g/L, equal to or greater than 140g/L, equal to or greater than 150g/L, equal to or greater than 160g/L, equal to or greater than 170g/L, or equal to or greater than 180g/L at the beginning of or during fermentation.
Thus, most preferably, the recombinant yeast is one that functionally expresses:
-one or more heterologous nucleic acid sequences encoding ribulose-1, 5-phosphate carboxylase/oxygenase (EC 4.1.1.39; rubisco) and optionally one or more nucleic acid sequences encoding a chaperone of Rubisco;
-one or more heterologous nucleic acid sequences encoding a phosphoribulokinase (EC 2.7.1.19; prk);
one or more nucleic acid sequences encoding a transketolase (EC 2.2.1.1), wherein the transketolase is under the control of a promoter ("TKL promoter") whose TKL expression ratio is Anaerobic/aerobic 2 or higher;
wherein the recombinant yeast cellDoes not takeFunctional expression
-one or more heterologous nucleic acid sequences encoding glycerol dehydrogenase; and/or
-one or more heterologous nucleic acid sequences encoding dihydroxyacetone kinase; and/or
-one or more heterologous nucleic acid sequences encoding glycerol transporters.
Glycerol dehydrogenase
As indicated above, the recombinant yeast cells may or may not be functionally expressed
-a nucleic acid sequence encoding a protein having glycerol dehydrogenase activity (e.c. 1.1.1.6);
-a nucleic acid sequence encoding a protein having dihydroxyacetone kinase activity (e.c. 2.7.1.28 or e.c. 2.7.1.29); and
-optionally a nucleic acid sequence encoding a protein having glycerol transporter activity.
Thus, the recombinant yeast cell may or may not functionally express one or more, preferably heterologous, nucleic acid sequences encoding glycerol dehydrogenase.
If glycerol dehydrogenase is present, the recombinant yeast cell can comprise NAD + Linked glycerol dehydrogenase (EC 1.1.1.6) and/or NADP + Linked glycerol dehydrogenase (EC 1.1.1.72). That is, the recombinant yeast cell may or may not contain a nucleic acid encoding a nucleic acid having NAD + Nucleic acid sequences and/or encoding proteins (EC 1.1.1.6) with glycerol dehydrogenase-dependent activity and/or NADP-dependent proteins + Nucleic acid sequence of a protein dependent on glycerol dehydrogenase activity (EC 1.1.1.72).
In one embodiment, the protein having glycerol dehydrogenase activity is preferably a protein having nad+ -dependent glycerol dehydrogenase activity (EC 1.1.1.6); and preferably, the recombinant yeast cell functionally expresses the encoding gene with NAD + Nucleic acid sequence of a protein dependent on glycerol dehydrogenase activity (EC 1.1.1.6). Such proteins may be derived from bacterial sources or, for example, from fungal sources. One example is gldA from E.coli (E.coli).
In an alternative or additional embodiment, NADP can be present + Dependent glycerol dehydrogenase (EC 1.1.1.72).
NAD, if glycerol dehydrogenase is present + Linked glycerol dehydrogenases are preferred.
The protein having glycerol dehydrogenase activity is also referred to herein as "glycerol dehydrogenase protein", "glycerol dehydrogenase (glycerol dehydrogenase enzyme)" or simply as "glycerol dehydrogenase (glycerol dehydrogenase)". Similarly, proteins having nad+ dependent glycerol dehydrogenase activity are also referred to herein as "nad+ dependent glycerol dehydrogenase proteins", "nad+ dependent glycerol dehydrogenases (nad+ dependent glycerol dehydrogenase enzyme)", or simply "nad+ dependent glycerol dehydrogenases (nad+ dependent glycerol dehydrogenase)". Glycerol dehydrogenase is abbreviated GLD.
Preferred glycerol dehydrogenases and nucleic acid sequences encoding such glycerol dehydrogenases are described in WO 2015028582 (incorporated herein by reference).
Nad+ dependent glycerol dehydrogenase (EC 1.1.1.6) is an enzyme that catalyzes the following chemical reaction:
thus, the two substrates of the enzyme are glycerol and NAD + The three products are glycerone, NADH and H + . Glycerone and dihydroxyacetone are synonymous herein.
Glycerol dehydrogenases belong to the family of oxidoreductases, in particular in NAD + Or NADP + Oxidoreductases act as acceptors on CH-OH groups of the donor. The systematic name of this enzyme is glycerol: NAD + 2-oxidoreductase. Other names commonly used include glycerol (dehydrogenase) and NAD + Linked glycerol dehydrogenase. The enzyme is involved in glycerolipid metabolism. The glycerol dehydrogenase protein may be further defined by its amino acid sequence. Likewise, the glycerol dehydrogenase protein may be further defined by a nucleotide sequence encoding the glycerol dehydrogenase protein. As explained in detail under the definition above, a certain glycerol dehydrogenase protein defined by the nucleotide sequence encoding the enzyme includes (unless otherwise specifiedRestriction) to a nucleotide sequence that hybridizes with such a nucleotide sequence encoding a glycerol dehydrogenase protein.
The nucleic acid sequence encoding a protein having glycerol dehydrogenase activity may be a heterologous nucleic acid sequence. The protein having glycerol dehydrogenase activity may be a heterologous protein having NAD+ -dependent glycerol dehydrogenase activity.
If the recombinant yeast cell comprises one or more heterologous nucleic acid sequences encoding a glycerol dehydrogenase, the recombinant yeast cell preferably further comprises a suitable cofactor to enhance the activity of the glycerol dehydrogenase. For example, recombinant yeast cells can comprise zinc, zinc ions, or zinc salts and/or one or more pathways that include these in the cell.
Suitable examples of heterologous proteins having glycerol dehydrogenase activity include glycerol dehydrogenase proteins of klebsiella pneumoniae, enterococcus aerogenes, yersinia arvensis and escherichia coli, respectively. The amino acid sequences of such proteins have been shown by SEQ ID NO 59, SEQ ID NO 60, SEQ ID NO 61 and SEQ ID NO 62, respectively.
Thus, the recombinant yeast cell may or may not comprise one or more suitably heterologous glycerol dehydrogenase proteins having the amino acid sequences of SEQ ID NO 59, SEQ ID NO 60, SEQ ID NO 61 and/or SEQ ID NO 62; and/or functional homologs thereof comprising an amino acid sequence that has at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98% or at least 99% sequence identity to the amino acid sequence of SEQ ID NO. 59, SEQ ID NO. 60, SEQ ID NO. 61 and/or SEQ ID NO. 62; and/or functional homologues thereof comprising an amino acid sequence having one or more mutations, substitutions, insertions and/or deletions compared to the amino acid sequence of SEQ ID NO. 59, SEQ ID NO. 60, SEQ ID NO. 61 and/or SEQ ID NO. 62, wherein more preferably the amino acid sequence of such functional homologues has NO more than 300, NO more than 250, NO more than 200, NO more than 150, NO more than 100, NO more than 75, NO more than 50, NO more than 40, NO more than 30, NO more than 20, NO more than 10 or NO more than 5 amino acid mutations, substitutions, insertions and/or deletions compared to the amino acid sequence of SEQ ID NO. 59, SEQ ID NO. 60, SEQ ID NO. 61 and/or SEQ ID NO. 62.
A preferred glycerol dehydrogenase protein is one encoded by the gldA gene from E.coli. SEQ ID NO. 62 shows the amino acid sequence of this preferred NAD+ -dependent glycerol dehydrogenase protein encoded by the gldA gene from E.coli. The nucleic acid sequence of the gldA gene of E.coli is shown by SEQ ID NO. 63.
If the recombinant yeast cell comprises one or more heterologous nucleic acid sequences encoding glycerol dehydrogenase, the recombinant yeast cell therefore most preferably comprises a heterologous nucleotide sequence derived from E.coli encoding a protein having NAD+ -dependent glycerol dehydrogenase activity (E.C.1.1.1.6), optionally codon optimized for the host cell, as exemplified by the nucleic acid sequence shown in SEQ ID NO: 63.
Thus, preferably, the nucleic acid sequence encoding a protein having glycerol dehydrogenase activity comprises or consists of:
-the nucleic acid sequence of SEQ ID NO. 63; or alternatively
-a functional homolog of SEQ ID No. 63 having at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98% or at least 99% sequence identity to the nucleic acid sequence of SEQ ID No. 63; or alternatively
Functional homologs of SEQ ID NO. 63 which have one or more mutations, substitutions, insertions and/or deletions when compared to the nucleic acid sequence of SEQ ID NO. 63, more preferably functional homologs which have not more than 300, not more than 250, not more than 200, not more than 150, not more than 100, not more than 75, not more than 50, not more than 40, not more than 30, not more than 20, not more than 10 or not more than 5 nucleic acid mutations, substitutions, insertions and/or deletions when compared to the nucleic acid sequence of SEQ ID NO. 63.
If the recombinant yeast cell comprises one or more heterologous nucleic acid sequences encoding a glycerol dehydrogenase, the recombinant yeast cell therefore most preferably comprises one or more nucleotide sequences encoding a glycerol dehydrogenase (e.c. 1.1.1.6) derived from e.coli, optionally codon optimized for the host cell. Such a nucleic acid sequence (e.g. a gene) encoding a glycerol dehydrogenase protein may be suitably incorporated into the genome of a recombinant yeast cell, for example as described in the examples of WO 2015/028583 (incorporated herein by reference).
Other examples of suitable glycerol dehydrogenases are listed in tables 8 (a) to 8 (d). At the top of each table, gldA is mentioned and BLAST.
Table 8 (a):BLAST query-gldA from E.coli
Table 8 (b):BLAST query-gldA from Klebsiella pneumoniae
Table 8 (c):BLAST query-gldA from enterococcus aerogenes
Table 8 (d):BLAST query-gldA from Yersinia arvensis
Dihydroxyacetone kinase
As indicated above, the recombinant yeast cells may or may not be functionally expressed
-a nucleic acid sequence encoding a protein having glycerol dehydrogenase activity (e.c. 1.1.1.6);
-a nucleic acid sequence encoding a protein having dihydroxyacetone kinase activity (e.c. 2.7.1.28 or e.c. 2.7.1.29); and
-optionally a nucleic acid sequence encoding a protein having glycerol transporter activity.
That is, the recombinant yeast cell may or may not functionally express one or more homologous or heterologous nucleic acid sequences encoding dihydroxyacetone kinase (e.c. 2.7.1.28 or e.c. 2.7.1.29).
Proteins having dihydroxyacetone kinase activity are also referred to herein as "dihydroxyacetone kinase proteins", "dihydroxyacetone kinase (dihydroxyacetone kinase enzyme)" or simply as "dihydroxyacetone kinase (dihydroxyacetone kinase)". Dihydroxyacetone kinase is abbreviated herein as DAK.
Preferred dihydroxyacetone kinases and nucleic acid sequences encoding such dihydroxyacetone kinases are as described in WO 2015028582 (incorporated herein by reference).
Proteins having dihydroxykinase activity may suitably belong to the enzyme classes e.c.2.7.1.28 and/or e.c. 2.7.1.29. Thus, the recombinant yeast cell suitably functionally expresses a nucleic acid sequence encoding a protein having dihydroxyacetone kinase activity (e.c. 2.7.1.28 and/or e.c. 2.7.1.29).
Dihydroxyacetone kinase is herein preferably understood to be an enzyme (EC 2.7.1.29) that catalyzes the following chemical reaction:
ATP+glycerone ADP+glycerophosphate
And/or enzymes (EC 2.7.1.28) that catalyze the following chemical reactions:
ATP+D-glyceraldehyde ADP+D-glyceraldehyde 3-phosphate.
Other names commonly used for dihydroxyacetone kinase include glycerone kinase, ATP, glycerone phosphotransferase and (phosphorylated) acetol kinase. It is further understood that glycerone and dihydroxyacetone are the same molecule. Dihydroxyacetone kinase proteins may be further defined by their amino acid sequence. Likewise, dihydroxyacetone kinase proteins may be further defined by a nucleotide sequence encoding dihydroxyacetone kinase protein. As explained in detail below under the definition above, a certain dihydroxyacetone kinase protein defined by a nucleotide sequence encoding an enzyme includes (unless otherwise limited) a nucleotide sequence that hybridizes to such a nucleotide sequence encoding dihydroxyacetone kinase protein.
The recombinant yeast cell, if present, preferably functionally expresses a nucleic acid sequence encoding a native protein having dihydroxyacetone kinase activity. More preferably, the nucleic acid sequence encoding a protein having dihydroxyacetone kinase activity is a native nucleic acid sequence.
The yeast contains two natural isoenzymes of dihydroxyacetone kinase (DAK 1 and DAK 2). According to the invention, these natural dihydroxyacetone kinases are preferred. Preferably, the host cell is a Saccharomyces cerevisiae cell; and preferably, the above natural dihydroxyacetone kinase is a natural dihydroxyacetone kinase of s.cerevisiae cells. The amino acid sequences of the natural dihydroxyacetone kinase proteins DAK1 and DAK2 of Saccharomyces cerevisiae have been shown by SEQ ID NO. 64 and SEQ ID NO. 65, respectively. The nucleic acid sequences encoding these native dihydroxyacetone kinase proteins DAK1 and DAK2 have been shown by SEQ ID NO 69 and SEQ ID NO 70, respectively.
Recombinant yeast cells can also functionally express a nucleic acid sequence encoding a protein having dihydroxyacetone kinase activity, wherein the nucleic acid sequence is a heterologous nucleic acid sequence, and correspondingly wherein the protein is a heterologous protein. In one embodiment, the recombinant yeast cell comprises a heterologous gene encoding dihydroxyacetone kinase. Suitable heterologous genes include genes encoding dihydroxyacetone kinases from the following: kluyveromyces kudrii, kluyveromyces bailii, kluyveromyces lactis, candida glabra, yarrowia lipolytica, klebsiella pneumoniae, enterobacter aerogenes, escherichia coli, yarrowia lipolytica, schizosaccharomyces pombe, botrytis cinerea (Botryotinia fuckeliana) and Exophiala dermatitis (Exophiala dermatitidis). Preferred heterologous proteins having dihydroxyacetone kinase activity include those derived from Klebsiella pneumoniae, yarrowia lipolytica and Schizosaccharomyces pombe, respectively, as shown by SEQ ID NO:66, SEQ ID NO:67 and SEQ ID NO:68, respectively.
The recombinant yeast cell may or may not contain a genetic modification that causes dihydroxyacetone kinase to be overexpressed (e.g., by overexpressing a nucleic acid sequence encoding a protein having dihydroxyacetone kinase activity). The nucleotide sequence encoding dihydroxyacetone kinase may be native or heterologous to the cell. Nucleic acid sequences which can be used for the overexpression of dihydroxyacetone kinase in the cells of the invention are, for example, the dihydroxyacetone kinase genes (DAK 1) and (DAK 2) from saccharomyces cerevisiae, as described, for example, in the following documents: molin et al, "Dihydroxy-acetone kinases in Saccharomyces cerevisiae are involved in detoxification of dihydroxyacetone" [ "dihydroxyacetone kinase in Saccharomyces cerevisiae is involved in detoxification of dihydroxyacetone" ] (2003), J.biol. Chem. [ journal of biochemistry ], volume 278:pages 1415-1423, incorporated herein by reference. In a preferred embodiment, the codon-optimized (see above) nucleotide sequence encoding a dihydroxyacetone kinase is overexpressed, such as, for example, the codon-optimized nucleotide sequence encoding a dihydroxyacetone kinase of SEQ ID NO:64, SEQ ID NO:65, SEQ ID NO:66, SEQ ID NO:67 or SEQ ID NO: 68.
As indicated above, the native nucleic acid sequences encoding dihydroxyacetone kinase proteins DAK1 and DAK2 in Saccharomyces cerevisiae have been shown by SEQ ID NO:69 and SEQ ID NO:70, respectively.
Preferably, the recombinant yeast cell comprises a genetic modification that increases the specific activity of any dihydroxyacetone kinase in the cell. For example, the recombinant yeast cell can comprise one or more native and/or heterologous nucleic acid sequences encoding one or more native and/or heterologous dihydroxyacetone kinase proteins that are overexpressed (such as DAK1 and/or DAK 2). Natural dihydroxyacetone kinases (such as DAK1 and/or DAK 2) may be overexpressed, for example, via one or more genetic modifications such that the gene encoding the dihydroxyacetone kinase is more copied than is present in non-genetically modified cells, and/or a non-natural promoter may be employed.
Preferably, the recombinant yeast cell is a recombinant yeast cell in which expression of the nucleic acid sequence encoding the protein having dihydroxyacetone kinase activity is under the control of a promoter. The promoter may, for example, be a promoter native to another gene in the host cell.
In order to overexpress a nucleotide sequence encoding dihydroxyacetone kinase, the nucleotide sequence (to be overexpressed) may be placed in an expression construct, wherein it is operably linked to suitable expression control regions/sequences to ensure overexpression of dihydroxyacetone kinase after transformation of the expression construct into a host cell of the invention (see above). Suitable promoters for (over) expression of nucleotide sequences encoding enzymes having dihydroxyacetone kinase activity include promoters which are preferably insensitive to inhibition by the decomposition metabolite (glucose), active under anaerobic conditions and/or preferably do not require xylose or arabinose for induction. Examples of such promoters are given above. The dihydroxyacetone kinase that is overexpressed is preferably overexpressed at least 1.1, 1.2, 1.5, 2, 5, 10 or 20-fold compared to a genetically identical strain except for the genetic modification that causes the overexpression. Preferably, dihydroxyacetone kinase is overexpressed at least 1.1, 1.2, 1.5, 2, 5, 10 or 20-fold under anaerobic conditions as compared to a genetically identical strain except for the genetic modification causing the overexpression. It will be appreciated that these levels of overexpression may be applicable to steady-state levels of enzyme activity (specific activity in a cell), steady-state levels of enzyme protein, and steady-state levels of transcripts encoding enzymes in a cell. Overexpression of the nucleotide sequence in the host cell results in a specific dihydroxyacetone kinase activity of at least 0.002, 0.005, 0.01, 0.02 or 0.05U min-1 (mg protein) -1, as determined in cell extracts of transformed host cells at 30 ℃, as described in examples of e.g. WO 2013/081456.
The most preferred dihydroxyacetone kinase protein is the dihydroxyacetone kinase protein encoded by the Dak1 gene from Saccharomyces cerevisiae. SEQ ID NO. 64 shows the amino acid sequence of a suitable dihydroxyacetone kinase protein encoded by the Dak1 gene from Saccharomyces cerevisiae. SEQ ID NO. 69 shows the nucleic acid sequence of the Dak1 gene itself.
If the recombinant yeast cell comprises one or more overexpressed nucleic acid sequences encoding dihydroxyacetone kinase, the recombinant yeast cell thus most preferably comprises one or more overexpressed nucleotide sequences encoding dihydroxyacetone kinase derived from Saccharomyces cerevisiae, as exemplified by the nucleic acid sequence shown in SEQ ID NO: 69.
Thus, preferably, the protein having dihydroxyacetone kinase activity comprises or consists of:
-the amino acid sequence of SEQ ID NO. 64, SEQ ID NO. 65, SEQ ID NO. 66, SEQ ID NO. 67 or SEQ ID NO. 68; or alternatively
-a functional homolog of SEQ ID No. 64, SEQ ID No. 65, SEQ ID No. 66, SEQ ID No. 67 or SEQ ID No. 68 having at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98% or at least 99% sequence identity to the amino acid sequence of SEQ ID No. 64, SEQ ID No. 65, SEQ ID No. 66, SEQ ID No. 67 or SEQ ID No. 68; or alternatively
-a functional homolog of SEQ ID No. 64, SEQ ID No. 65, SEQ ID No. 66, SEQ ID No. 67 or SEQ ID No. 68 having one or more mutations, substitutions, insertions and/or deletions when compared to the amino acid sequence of SEQ ID No. 64, SEQ ID No. 65, SEQ ID No. 66, SEQ ID No. 67 or SEQ ID No. 68, more preferably NO more than 300, NO more than 250, NO more than 200, NO more than 150, NO more than 100, NO more than 75, NO more than 50, NO more than 40, NO more than 30, NO more than 20, NO more than 10 or NO more than 5 amino acid mutations, substitutions, insertions and/or deletions when compared to the amino acid sequence of SEQ ID No. 64, SEQ ID No. 65, SEQ ID No. 66, SEQ ID No. 67 or SEQ ID No. 68.
Proteins having the amino acid sequence of SEQ ID NO. 64 and functional homologs thereof are most preferred.
Preferably, the nucleic acid sequence encoding a protein having dihydroxyacetone kinase activity comprises or consists of:
-the nucleic acid sequence of SEQ ID NO. 69 or SEQ ID NO. 70; or alternatively
-SEQ ID No. 69 or a functional homolog of SEQ ID No. 70 having at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98% or at least 99% sequence identity to the nucleic acid sequence of SEQ ID No. 69 or SEQ ID No. 70; or alternatively
-a functional homolog of SEQ ID No. 69 or SEQ ID No. 70 having one or more mutations, substitutions, insertions and/or deletions when compared to the nucleic acid sequence of SEQ ID No. 69 or SEQ ID No. 70; more preferably NO more than 300, NO more than 250, NO more than 200, NO more than 150, NO more than 100, NO more than 75, NO more than 50, NO more than 40, NO more than 30, NO more than 20, NO more than 10 or NO more than 5 nucleic acid mutations, substitutions, insertions and/or deletions when compared to the nucleic acid sequence of SEQ ID NO. 69 or SEQ ID NO. 70.
The nucleic acid sequence (e.g., gene) encoding dihydroxyacetone kinase protein can be suitably incorporated into the genome of a recombinant yeast cell.
Examples of suitable dihydroxyacetone kinases are listed in tables 9 (a) to 9 (d). At the top of each table, the BLAST DAK used in the examples is mentioned.
Table 9 (a):BLAST query-DAK 1 from Saccharomyces cerevisiae
Table 9 (b):BLAST query-from Klebsiella pneumoniaedhaK
Table 9 (c):BLAST query-DAK 1 from yarrowia lipolytica
Table 9 (d):BLAST query-DAK 1 from Schizosaccharomyces pombe
Glycerol transporter
The recombinant yeast cell may optionally comprise (i.e., may or may not comprise) a nucleotide sequence encoding a glycerol transporter. Such glycerol transporters may allow any glycerol to be transported into the cells and converted to ethanol, which is externally available in the medium (e.g., from reflux in corn mash) or secreted after synthesis by internal cells.
If glycerol transporters are present, the recombinant yeast preferably comprises one or more nucleic acid sequences encoding a heterologous glycerol transporter represented by the amino acid sequence SEQ ID NO:71, SEQ ID NO:72 or a functional homologue thereof having at least 50%, preferably at least 60%, 70%, 75%, 80%, 85%, 90%, 95%, 98% or 99% amino acid sequence identity with the amino acid sequence of SEQ ID NO:71 and/or SEQ ID NO: 72.
In one embodiment, the recombinant yeast may further comprise a deletion or disruption of one or more endogenous nucleotide sequences encoding a glycerol export protein (e.g., FPS 1).
Nitrate reductase
The recombinant yeast cell may also advantageously comprise (correspondingly functionally expressed) a nucleic acid sequence encoding an enzyme having NADH-dependent nitrate reductase activity and/or a nucleic acid sequence encoding an enzyme having NADH-dependent nitrite reductase activity. Details of expressing this alternative redox sink have been described in non-prepublished U.S. patent application US 63087642 filed on 5 months 10 in 2020 to the U.S. patent office, the contents of which are incorporated herein by reference.
Nitrate Reductase (NR) catalyzes nitrate (NO 3 - ) Reduction to Nitrite (NO) 2 - ). Nitrite reductase catalyzes the reduction of nitrite to ammonia (NH) 3 ). Nitrate reductase and/or nitrite reductase may be part of the so-called nitrogen assimilation pathway in certain cells. Cells comprising nitrate reductase activity and/or nitrite reductase activity include certain plant cells and bacterial cells and a few yeast cells. As indicated by Linder, the ability to assimilate inorganic nitrogen sources other than ammonia in budding yeast is considered rare. Few fungi that are naturally capable of assimilating nitrate or nitrite include Botrytis cinerea (Blastobotrys adeninivorans) (Mao Gong Aspergillus fumigatus), candida boidinii (Pichia pastoris), saccharomyces cerevisiae (Cyberlindnera jadinii) (Phaffiamycolaceae) and Hansenula polymorpha (Ogataea polymorpha) (Pichia) which are naturally capable of assimilating nitrate or nitrite.
Preferably, the recombinant yeast cell as described herein comprises at least one or more genes encoding an NADH-dependent nitrate reductase.
NADH-dependent nitrate reductase is understood herein to be a nitrate reductase which depends solely on NADH as cofactor or primarily on NADH as cofactor. Preferably, the catalytic efficiency (k) of NADH-dependent nitrate reductase on NADPH/NADP+ as cofactor cat /K m ) NADP+ And catalytic efficiency (k) for NADH/NAD+ as cofactor cat /K m ) NAD+ Is the ratio of (i.e., the catalytic efficiency ratio (k cat /K m ) NADP+ :(k cat /K m ) NAD+ ) Greater than 1:1, more preferably equal to or greater than 2:1, still more preferably equal to or greater than 5:1, even more preferably equal to or greater than 10:1, yet even more preferably equal to or greater than 20:1, even more preferably equal to or greater than 100:1, and most preferably equal to or greater than 1000:1. There is no upper limit, but for practical reasons the catalytic efficiency ratio (k cat /K m ) NADP+ :(k cat /K m ) NAD+ May be equal to or less than 1.000.000.000:1 (i.e., 1.10) 9 ). Most preferably, the NADH dependent nitrate reductase is dependent only on NADH/NAD+ as cofactor. That is, most preferably, NADH-dependent nitrate reductase absolutely requires NADH/NAD+ as a cofactor, not NADPH/NADP+ as a cofactor.
Preferably, the NADH-dependent nitrate reductase is an NADH-dependent nitrate reductase of enzyme class EC 1.7.1.1 (i.e., having EC number EC 1.7.1.1) or enzyme class EC.1.6.6.1 (i.e., having EC number 1.6.6.1). Suitably, the NADH-dependent nitrate reductase (also referred to as NADH-dependent nitrate oxidoreductase) is an enzyme that catalyzes at least the following chemical reactions:
nitrate+NADH+H + nitrite+NAD + +H 2 O
Suitable NADH-dependent nitrate reductases may include, for example, one or more NADH-dependent nitrate reductases obtained or derived from: the plant species may be selected from the group consisting of Amaranthus (Agrostemma githago), amaranthus viridis (Amaranthus hybridus), amaranthus tricolor (Amaranthus tricolor), brown fiber alga (Ankistrodesmus braunii), arabidopsis thaliana, aspergillus niger, aspergillus nidulans, auxenochlorella pyrenoidosa, rhizobium (Bradyrhizobium) species, rhizobium species 750, brassica Juncea, brassica oleracea (Brassica oleracea), wild tea tree (Camellia sinensis), candida boidinii, candida utilis (Candida), capsici (Capsicum frutescens), chenopodia (Chenopodium album), saccharomyces jikunnii, brassica juncea, wild tea tree, capsicum, chenopodium, chlamydomonas (Chlamydomonas reinhardtii), chlorella (Chlorella furcata), chlorella (Fusca), chlorella (Chlorella) species, chlorella (Cyprinus sp. Berlin), chlorella (Chlorella vulgaris), cyperus (Conticribra weissflogii), cucumis sativus (Cumiq. Sativus), cyperus (Cumiq. Cucumis) species, cumiq. Pastoris (35), leucor (35, leucopia (35), leucopia (Lavandula angusta (35), leucopia (Lavandula) and Cyperus (35), leucopia (Lavandula (35), leucopia (Laurensis) and Cyperus (35, leucor) are the plant species, leucon (Laurena) and Cyperus) that are the plant species Bai Yushan bean (Lupinus albus), mycobacterium tuberculosis, tobacco leaf (Nicotiana plumbaginifolia), tobacco (Nicotiana tabacum), hansenula angusta (Ogataea angusta), hansenula polymorpha, rice (Oryza sativa), antarctica (Phaeocystis Antarctica), reed (Phragmites australis), physcomitrella patens (Physcomitrella patens), purple pea (Pisum arvense), rhodobryum (Polytrichum commune), porphyra yezoensis (Pyropia yezoensis), radish (rapanus sativus), rhodobacter capsulatus (Rhodobacter capsulatus), rhodobacter capsulatus E1F1, castor (Ricinus commu s), small green algae (Selaginella kraussiana), sinapis alba (Skeletonema costatum), ribbonisatus (Skeletonema tropicum), tomato (Solanum lycopersicum), spinach, nuda salsa maritima (3795), tetrandra (Thalassia Testudinum), sarca, sarcandra (Thalassiosira Antarctica), sarcandra glabra (86), uvalsa (4572), or Uvalsa species (4572) of the genus Uvalis; and/or functional homologs of such NADH-dependent nitrate reductase enzymes comprising an amino acid sequence that has at least 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98% or at least 99% amino acid sequence identity to one or more of such NADH-dependent nitrate reductase enzymes described above; and/or functional homologues of such NADH-dependent nitrate reductase enzymes comprising an amino acid sequence having one or several substitutions, insertions and/or deletions compared to the amino acid sequence of one or more of the above-mentioned such NADH-dependent nitrate reductase enzymes, wherein preferably the amino acid sequence of any of the above functional homologues has no more than 300, 250, 200, 150, 100, 75, 50, 40, 30, 20, 10 or 5 amino acid substitutions, insertions and/or deletions compared to the above-mentioned such NADH-dependent nitrate reductase enzymes.
Preferred NADH-dependent nitrate reductases include NADH-dependent nitrate reductases as obtained or derived from: candida boidinii (nitrate reductase capable of utilizing both NADH and NADPH as electron donors), candida utilis (nitrate reductase capable of utilizing both NADH and NADPH as electron donors), fusarium oxysporum (as described by Fujii et al in the article entitled "Denitrification by the Fungus Fusarium oxysporum Involves NADH-Nitrate Reductase" [ "denitrification of the fungus fusarium oxysporum involves NADH-nitrate reductase" ], published in biosci. Biotechnol. Biochem [ bioscience, biotechnology and biochemistry ],72 (2), pages 412-420, 2008, incorporated herein by reference), spinach and maize.
Thus, preferred NADH-dependent nitrate reductases include: such NADH-dependent nitrate reductase comprising a polypeptide having the amino acid sequence of SEQ ID NO. 89 and/or SEQ ID NO. 90 as described herein; and/or functional homologs of SEQ ID NO. 89 and/or SEQ ID NO. 90 comprising an amino acid sequence having at least 40%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98% or at least 99% amino acid sequence identity to one or more of SEQ ID NO. 89 and/or SEQ ID NO. 90, respectively; and/or functional homologs of SEQ ID NO. 89 and/or SEQ ID NO. 90 comprising an amino acid sequence having one or several substitutions, insertions and/or deletions compared to the amino acid sequence of one or more of SEQ ID NO. 89 and/or SEQ ID NO. 90, respectively. Preferably, the amino acid sequence of any of the above functional homologs has NO more than 300, 250, 200, 150, 100, 75, 50, 40, 30, 20, 10 or 5 amino acid substitutions, insertions and/or deletions compared to SEQ ID NO 89 and/or SEQ ID NO 90, respectively.
Preferably, the recombinant yeast cell comprises an exogenous gene encoding an enzyme having NADH dependent nitrate reductase activity. More preferably, the recombinant yeast cell comprises an exogenous gene encoding an enzyme having NADH dependent nitrate reductase activity selected from the group consisting of NADH dependent nitrate reductases as obtained or derived from: the plant growth regulator comprises a wheat-grass, amaranthus viridis, amaranthus tricolor, brown fiber alga, arabidopsis thaliana, aspergillus niger, aspergillus nidulans, auxenochlorella pyrenoidosa, bradyrhizobium species 750, brassica juncea, wild tea tree, candida boidinii, candida utilis, capsicum, chenopodium album, jieskioshima, brassica juncea, brassica oleracea, wild tea tree, capsicum, chenopodium, chlamydomonas reinhardtii, blackish chlorella species, chlorella vulgaris, hawkthorn, cucumber, melon, zucchini, pumpkin species, dunaliella salina, round-grained algae, trichosanthes kirilowii, trichosanthes acum, fusarium oxysporum JCM 11502 rhizoma et radix Pogostemonis, semen glycines, upland cotton, gracilaria chili, gracilaria tenuifolia, sunflower, barley, lettuce, duckweed, lupin, mycobacterium tuberculosis, nicotiana tabacum, tobacco, hansenula polymorpha, rice, antarctic brown algae, phragmites communis, physcomitrella patens, semen Pisi Sativi, david moss, porphyra yezoensis, radix Raphani, rhodobacter capsulatus E1F1, ricinus, cuminum parviflora, semen Sinapis Albae, skeletonema costatum, trogopyrum tropicalis, fructus Lycopersici Esculenti, herba Spinaciae, cyperus nudus, thailand Teslae crassifolium, alternaria lunata, pseudostellaria pseudominis, semen Tritici Lepidii, leucopia species, ulva and semen Maydis; and functional homologs of such NADH-dependent nitrate reductases that comprise an amino acid sequence that has at least 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98% or at least 99% amino acid sequence identity to one or more of such NADH-dependent nitrate reductases described above; and functional homologues of such NADH dependent nitrate reductase enzymes comprising an amino acid sequence having one or more substitutions, insertions and/or deletions compared to the amino acid sequence of one or more of such NADH dependent nitrate reductase enzymes described above, wherein preferably the amino acid sequence of any of the above functional homologues has no more than 300, 250, 200, 150, 100, 75, 50, 40, 30, 20, 10 or 5 amino acid substitutions, insertions and/or deletions compared to such NADH dependent nitrate reductase enzymes described above.
Suitably, the recombinant yeast cell may comprise a nucleotide sequence encoding the amino acid sequence of any of SEQ ID NO. 89 and/or SEQ ID NO. 90 or having one or several substitutions, insertions and/or deletions compared to the amino acid sequence of any of SEQ ID NO. 89 and/or SEQ ID NO. 90. Preferably, the amino acid sequence has NO more than 300, 250, 200, 150, 100, 75, 50, 40, 30, 20, 10 or 5 amino acid substitutions, insertions and/or deletions compared to SEQ ID NO. 89 and/or SEQ ID NO. 90, respectively.
The recombinant yeast cell can combine one or more genes encoding the above NADH-dependent nitrate reductase with one or more genes encoding NADPH-dependent nitrite reductase. Preferably, however, the recombinant yeast cell combines one or more genes encoding the above NADH-dependent nitrate reductase with one or more genes encoding NADH-dependent nitrite reductase.
Examples of suitable NADH-dependent nitrate reductases, their UniProt database accession numbers (which can be found on the Uniprot website (www.uniprot.org/, 10 th.4 th 2020), their descriptions, organisms from which they can be derived and their amino acid sequence identity with SEQ ID NO:89 are listed in Table 10 below.
Table 10:examples of suitable NADH-dependent nitrate reductases, their UniProt database accession numbers (which can be found on the Uniprot website (www.uniprot.org/, 4 th 10 th 2020), their descriptions, organisms from which they can be derived and their amino groups with SEQ ID NO: 89)The acid sequence identity is listed in table 10 below.
Nitrite reductase
As indicated above, nitrite reductase catalyzes the reduction of nitrite to ammonia (NH 3 )。
Preferably, the recombinant yeast cell as described herein comprises at least one or more genes encoding an NADH-dependent nitrite reductase.
NADH-dependent nitrite reductase is understood herein to be a nitrite reductase which depends solely on NADH as cofactor or primarily on NADH as cofactor. Preferably, the catalytic efficiency (k) of NADH-dependent nitrite reductase on NADPH/NADP+ as cofactor cat /K m ) NADP+ And catalytic efficiency (k) for NADH/NAD+ as cofactor cat /K m ) NAD+ Is the ratio of (i.e., the catalytic efficiency ratio (k cat /K m ) NADP+ :(k cat /K m ) NAD+ ) Greater than 1:1, more preferably equal to or greater than 2:1, still more preferably equal to or greater than 5:1, even more preferably equal to or greater than 10:1, yet even more preferably equal to or greater than 20:1, even more preferably equal to or greater than 100:1, and most preferably equal to or greater than 1000:1. There is no upper limit, but for practical reasons the catalytic efficiency ratio (k cat /K m ) NADP+ :(k cat /K m ) NAD+ May be equal to or less than 1.000.000.000:1 (i.e., 1.10) 9 ). Most preferably, the NADH-dependent nitrite reductase is dependent only on NADH/NAD+ as cofactor. That is, most preferably, the NADH-dependent nitrite reductase absolutely requires NADH/NAD+ as a cofactor, not NADPH/NADP+ as a cofactor.
Preferably, the NADH-dependent nitrite reductase is an NADH-dependent nitrite reductase of enzyme class EC 1.7.1.15 (i.e., having EC number EC 1.7.1.15). Suitably, the NADH-dependent nitrite reductase (also referred to as NADH-dependent nitrite oxidoreductase) is an enzyme which catalyzes at least the following chemical reaction:
nitrite +3NADH+5H + Ammonia+3 NAD + +2H 2 O
Those skilled in the art will appreciate that ammonia may also be used as the so-called ammonium hydroxide NH 4 OH is present and/or is referred to as so-called ammonium hydroxide NH 4 OH
Suitable NADH-dependent nitrite reductases may include, for example, NADH-dependent nitrite reductases derived from one or more of the following: aspergillus nidulans (also known as naked shell of nidulans), toxobacter reniforme (Arcobacter ellisii), toxobacter pacific (Arcobacter pacificus), bacillus subtilis JH642, copper species of Pacific fungus of Taiwan, escherichia coli, rockwell of Taiwan (Ralstonia taiwanensis), morganella syringiensis (Ralstonia syzygii), ralstonia solanacearum (Ralstonia solanacearum), rhodobacter capsulatus, burkholderia of Li Bei Luofu (Paraburkholderia ribeironis); and/or functional homologs of such NADH-dependent nitrite reductase comprising an amino acid sequence that has at least 40%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98% or at least 99% amino acid sequence identity to one or more of such NADH-dependent nitrite reductase as described above; and/or functional homologues of such NADH-dependent nitrite reductase comprising an amino acid sequence having one or several substitutions, insertions and/or deletions compared to the amino acid sequence of one or more of such NADH-dependent nitrite reductase as described above, wherein preferably the amino acid sequence of any of the above functional homologues has no more than 300, 250, 200, 150, 100, 75, 50, 40, 30, 20, 10 or 5 amino acid substitutions, insertions and/or deletions compared to such NADH-dependent nitrite reductase as described above.
Coli utilizes several different enzymes in its nitrite assimilation pathway. The nirD gene encodes the NADH-dependent nitrite reductase (NADH) small subunit, while the nirB gene encodes the NADH-dependent nitrite reductase (NADH) large subunit.
Preferred NADH-dependent nitrite reductases include NADH-dependent nitrite reductases derived from Aspergillus nidulans (also known as naked nidulans), which are nitrite reductases capable of utilizing both NADH and NADPH as electron donors, and/or E.coli. Nitrite reductase encoded by the nirB gene of E.coli is particularly preferred at high nitrate and/or nitrite concentrations.
Thus, preferred NADH-dependent nitrite reductases include: such NADH-dependent nitrite reductase comprising a polypeptide having the amino acid sequence of SEQ ID NO. 91 (E.coli nitrite reductase small subunit encoded by nirD) and/or SEQ ID NO. 92 (E.coli nitrite reductase large subunit encoded by nirB) and/or SEQ ID NO. 93 (nidogen nucleocapsid nitrate reductase encoded by niiA) as described herein; and/or functional homologs of SEQ ID NO. 91 and/or SEQ ID NO. 92 and/or SEQ ID NO. 93 comprising an amino acid sequence having at least 40%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98% or at least 99% amino acid sequence identity to one or more of SEQ ID NO. 91 and/or SEQ ID NO. 92 and/or SEQ ID NO. 93, respectively; and/or functional homologs of SEQ ID NO. 91 and/or SEQ ID NO. 92 and/or SEQ ID NO. 93 comprising an amino acid sequence having one or several substitutions, insertions and/or deletions compared to the amino acid sequence of one or more of SEQ ID NO. 91 and/or SEQ ID NO. 92 and/or SEQ ID NO. 93, respectively. Preferably, the amino acid sequence of any of the above functional homologs has NO more than 300, 250, 200, 150, 100, 75, 50, 40, 30, 20, 10 or 5 amino acid substitutions, insertions and/or deletions compared to SEQ ID NO 91 and/or SEQ ID NO 92 and/or SEQ ID NO 93, respectively.
Preferably, the recombinant yeast cell comprises an exogenous gene encoding an enzyme having NADH dependent nitrite reductase activity. More preferably, the recombinant yeast cell comprises an exogenous gene encoding an enzyme having NADH-dependent nitrite reductase activity selected from the group consisting of NADH-dependent nitrite reductases as derived from: aspergillus nidulans (also known as naked shell of nidulans), toxobacter renifolius, toxobacter pacific, bacillus subtilis JH642, copper bacterium, escherichia coli, ralstonia taiwan, ralstonia nectaricum, ralstonia solanaceae, rhodobacter capsulatus, and Burkholderia of Ri Bei Luofu; and/or functional homologs of such NADH-dependent nitrite reductase comprising an amino acid sequence that has at least 40%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98% or at least 99% amino acid sequence identity to one or more of such NADH-dependent nitrite reductase as described above; and/or functional homologues of such NADH-dependent nitrite reductase comprising an amino acid sequence having one or several substitutions, insertions and/or deletions compared to the amino acid sequence of one or more of such NADH-dependent nitrite reductase as described above, wherein preferably the amino acid sequence of any of the above functional homologues has no more than 300, 250, 200, 150, 100, 75, 50, 40, 30, 20, 10 or 5 amino acid substitutions, insertions and/or deletions compared to such NADH-dependent nitrite reductase as described above.
Suitably, the recombinant yeast cell may comprise an amino acid sequence encoding any of SEQ ID NO. 91 (small subunit of E.coli nitrate reductase encoded by nirD) and/or SEQ ID NO. 92 (large subunit of E.coli nitrate reductase encoded by nirB) and/or SEQ ID NO. 93 (naked nido-capsid nitrate reductase encoded by niiA) or a nucleotide sequence having one or several substitutions, insertions and/or deletions compared to the amino acid sequence of any of SEQ ID NO. 91 and/or SEQ ID NO. 92 and/or SEQ ID NO. 93. Preferably, the amino acid sequence has NO more than 300, 250, 200, 150, 100, 75, 50, 40, 30, 20, 10 or 5 amino acid substitutions, insertions and/or deletions compared to SEQ ID NO. 91 and/or SEQ ID NO. 92 and/or SEQ ID NO. 93, respectively.
The recombinant yeast cell can combine one or more genes encoding one or more of the above NADH-dependent nitrite reductase with one or more genes encoding NADPH-dependent nitrate reductase. Preferably, however, the recombinant yeast cell combines one or more genes encoding one or more of the above NADH-dependent nitrite reductase with one or more genes encoding NADH-dependent nitrate reductase.
Examples of suitable NADH-dependent nitrite reductases, their UniProt database accession numbers (which can be found on the Uniprot website (www.uniprot.org/, 10 th.4 th.2020), their descriptions, organisms from which they can be derived and their amino acid sequence identity with SEQ ID NO:91 (the small subunit encoded by nirD) are listed in Table 11 below.
Examples of suitable NADH-dependent nitrite reductases, their UniProt database accession numbers (which can be found on the Uniprot website (www.uniprot.org/, 10 th.4 th.2020), their descriptions, organisms from which they can be derived and their amino acid sequence identity with SEQ ID NO:92 (the large subunit encoded by nirB) are listed in Table 12 below.
Table 11:examples of suitable NADH-dependent nitrite reductases, their UniProt database accession numbers (which can be found on the Uniprot website (www.uniprot.org/, 10 th.4 th.2020)), their descriptions, organisms from which they can be derived, and their amino acid sequence identity with SEQ ID NO:91 (the small subunit encoded by nirD).
Table 12:examples of suitable NADH-dependent nitrite reductases, their UniProt database accession numbers (which can be found on the Uniprot website (www.uniprot.org/, 10 th.4 th.2020)), their descriptions, organisms from which they can be derived and their use with SEQ ID NO:92 (large subunit encoded by nirB) Group) amino acid sequence identity.
Nitrate/nitrite transporter
Preferably, the recombinant yeast cell further comprises one or more genetic modifications that result in increased transport of a nitric oxide source (such as nitrate or nitrite) into the yeast cell. More preferably, the recombinant yeast cell further comprises one or more genes encoding nitrate and/or nitrite transport proteins.
Suitable transporters may include sulfite transporters Ssu1 and Ssu2 (as described by cabera et al in the article entitled "Molecular Components of Nitrate and Nitrite Efflux in Yeast" [ "molecular components of nitrate and nitrite effluents in yeast" ], published on pages 267-278 of the volume 13, 2 nd Eukaryotic Cell [ Eukaryotic Cell ], 2014, incorporated herein by reference); and nitrate/nitrite transporter YNT1 derived from pichia angusta (also known as hansenula polymorpha); and/or a functional homolog of one or more of such nitrate/nitrite transporters comprising an amino acid sequence that has at least 40%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98% or at least 99% amino acid sequence identity to one or more of the nitrate/nitrite transporters described above; and/or functional homologs of one or more of such nitrate/nitrite transporters comprising an amino acid sequence having one or more substitutions, insertions and/or deletions compared to the amino acid sequence of one or more of such nitrate/nitrite transporters described above, wherein preferably the amino acid sequence of any of the above functional homologs has no more than 300, 250, 200, 150, 100, 75, 50, 40, 30, 20, 10 or 5 amino acid substitutions, insertions and/or deletions compared to such nitrate/nitrite transporter YNT1 described above.
Preferably, the recombinant yeast cell comprises a nucleic acid sequence encoding: nitrate/nitrite transporter YNT1 derived from pichia angustifolia; and/or a functional homolog of the nitrate/nitrite transporter YNT1 comprising an amino acid sequence that has at least 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98% or at least 99% amino acid sequence identity to nitrate/nitrite transporter YNT1; and/or functional homologs of such nitrate/nitrite transporter YNT1 comprising an amino acid sequence having one or more substitutions, insertions and/or deletions compared to the amino acid sequence of one or more of such nitrate/nitrite transporter YNT1 described above, wherein preferably the amino acid sequence of any of the above functional homologs has no more than 300, 250, 200, 150, 100, 75, 50, 40, 30, 20, 10 or 5 amino acid substitutions, insertions and/or deletions compared to such nitrate/nitrite transporter YNT1 described above.
Thus, preferred nitrate/nitrite transporters include: such nitrate/nitrite transporter comprising a polypeptide having the amino acid sequence of SEQ ID NO. 94 as described herein; and/or functional homologs of SEQ ID NO. 94 comprising an amino acid sequence having at least 40%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98% or at least 99% amino acid sequence identity with SEQ ID NO. 94; and/or functional homologues of SEQ ID NO. 94 comprising an amino acid sequence having one or more substitutions, insertions and/or deletions compared to the amino acid sequence of SEQ ID NO. 94. Preferably, the amino acid sequence of any of the above functional homologs has NO more than 300, 250, 200, 150, 100, 75, 50, 40, 30, 20, 10 or 5 amino acid substitutions, insertions and/or deletions compared to SEQ ID NO. 94.
Suitably, the recombinant yeast cell may comprise a nucleotide sequence encoding the amino acid sequence of SEQ ID NO. 94 or an amino acid sequence having one or several substitutions, insertions and/or deletions compared to the amino acid sequence of any of SEQ ID NO. 94. Preferably, the amino acid sequence has NO more than 300, 250, 200, 150, 100, 75, 50, 40, 30, 20, 10 or 5 amino acid substitutions, insertions and/or deletions compared to SEQ ID NO. 94, respectively.
Examples of suitable nitrite/nitrate transporters, their UniProt database accession numbers (which can be found on Uniprot website (www.uniprot.org/, 10 th.4 th.2020), descriptions thereof, organisms from which they can be derived and their amino acid sequence identity with SEQ ID NO. 94 are listed in Table 13 below.
Table 13:suitable nitrite/nitrate transportExamples of proteins, their UniProt database accession numbers (which can be found on the UniProt website (www.uniprot.org/, 10 th.4 th 2020)), descriptions thereof, organisms from which they can be derived, and their amino acid sequence identity with SEQ ID No. 94.
Cofactor(s)
Preferably, the recombinant yeast cell further comprises a suitable cofactor to enhance the activity of the above-mentioned NADH-dependent nitrate reductase and/or NADH-dependent nitrite reductase. Preferred cofactors include Flavin Adenine Dinucleotide (FAD), heme cofactor and/or molybdenum cofactor (MoCo). Thus, preferably, the recombinant yeast cell may further comprise one or more genes encoding enzymes for synthesizing one or more of Flavin Adenine Dinucleotide (FAD), heme prosthetic group, and/or molybdenum cofactor (MoCo). For example, a recombinant yeast cell can comprise one or more genes encoding an enzyme having FAD synthase activity. Preferred cofactors are as exemplified in non-prepublished U.S. patent application US 63087642 filed on 5 th 10 th 2020 to the U.S. patent office, the contents of which are incorporated herein by reference.
Recombinant expression
Recombinant yeast cells are such recombinant cells. That is, the recombinant yeast cell comprises a nucleotide sequence that does not naturally occur in the cell in question, or is transformed with or genetically modified with the nucleotide sequence. Techniques for recombinantly expressing enzymes in cells and for making additional genetic modifications to recombinant yeast cells are well known to those skilled in the art. Typically, such techniques involve transforming cells with a nucleic acid construct comprising the relevant sequence. Such methods are known, for example, from standard handbooks, such as Sambrook and Russel (2001) "Molecular Cloning: ALaboratory Manual" [ "molecular cloning: laboratory Manual (3 rd edition), published by Cold Spring Harbor Laboratory Press [ Cold spring harbor laboratory Press ], or edited by F.Ausubel et al, "Current protocols in molecular biology" [ "guidelines for molecular biology experiments" ], green Publishing and Wiley Interscience [ Green publishing company and American Wei Lip, new York (1987). Methods for transforming and genetically modifying fungal host cells are known, for example, from EP-A-06355574, WO 98/46772, WO 99/60102, WO 00/37671, WO 90/14423, EP-A-0481008, EP-A-0635554 and US 6265186.
Fermentation process
The invention further provides a method for producing ethanol, comprising converting a carbon source, preferably a carbohydrate or another organic carbon source, using a recombinant yeast cell as described in the specification, thereby forming ethanol.
The feed for the fermentation process suitably comprises one or more fermentable carbon sources. The fermentable carbon source preferably comprises or consists of one or more fermentable carbohydrates. More preferably, the fermentable carbon source comprises one or more monosaccharides, disaccharides and/or polysaccharides. For example, the fermentable carbon source may comprise one or more carbohydrates selected from the group consisting of: glucose, fructose, sucrose, maltose, xylose, arabinose, galactose, mannose and trehalose. The fermentable carbon source preferably comprising or consisting of one or more carbohydrates may suitably be obtained from starch, cellulose, hemicellulose, lignocellulose and/or pectin. Suitably, the fermentable carbon source may be in the form of a slurry, suspension or liquid, preferably aqueous.
The concentration of fermentable carbohydrates (such as, for example, glucose) during fermentation is preferably equal to or greater than 80g/L. That is, the initial concentration of glucose at the beginning of fermentation is preferably 80g/L or more, more preferably 90g/L or more, even more preferably 100g/L or more, still more preferably 110g/L or more, still even more preferably 120g/L or more, 130g/L or more, 140g/L or more, 150g/L or more, 160g/L or more, 170g/L or 180g/L or more. The initiation of fermentation may be at the time of contacting the fermentable carbohydrate with the recombinant cells of the invention.
The fermentable carbon source may be prepared by contacting starch, lignocellulose and/or pectin with an enzyme composition wherein one or more mono-, di-, and/or polysaccharides are produced and wherein the produced mono-, di-, and/or polysaccharides are subsequently fermented to obtain a fermentation product.
The lignocellulosic material may be pretreated prior to the enzymatic treatment. Pretreatment may include exposing the lignocellulosic material to an acid, base, solvent, heat, peroxide, ozone, mechanical comminution, grinding, milling or rapid depressurization, or a combination of any two or more thereof. Such chemical pretreatment is typically combined with thermal pretreatment (e.g., between 150 ℃ and 220 ℃ for 1 to 30 minutes). The pretreated material may then be subjected to enzymatic hydrolysis to release sugars that may be fermented according to the invention. This can be done in a conventional manner, for example, by contacting with a cellulase (e.g., one or more cellobiohydrolases, one or more endoglucanases, one or more beta-glucosidase enzymes, and optionally other enzymes). The conversion with cellulase enzymes may be carried out at ambient temperature or higher for a reaction time that releases a sufficient amount of one or more sugars. The result of enzymatic hydrolysis is a hydrolysate comprising C5/C6 sugars, referred to herein as a sugar composition.
In one embodiment, the fermentable carbohydrate is or consists of a biomass hydrolysate such as corn stover or corn fiber hydrolysate. Such biomass hydrolysate, in turn, may comprise or be derived from corn stover and/or corn fiber.
By "hydrolysate" is herein understood a material comprising polysaccharides (such as corn stover, corn starch, corn fiber or lignocellulose material) which have been hydrolyzed by the addition of water to form mono-and oligosaccharides. The hydrolysate can be produced by enzymatic or acid hydrolysis of the polysaccharide containing material.
The biomass hydrolysate may be a lignocellulosic biomass hydrolysate. Lignocellulose herein includes hemicellulose and hemicellulose fractions of biomass. Lignocellulose also includes the lignocellulose fraction of biomass. Suitable lignocellulosic materials can be found in the following list: orchard bottom materials, chalcona communities, mill waste, municipal wood waste, municipal waste, felling waste, forest raising waste (forest thining), short-term rotation woody crops, industrial waste, wheat straw, oat straw, rice straw, barley straw, rye straw, flax straw, soybean hulls, rice straw, corn gluten feed, oat hulls, sugarcane, corn stover, corn cobs, corn husks, switchgrass, miscanthus, sweet sorghum, canola stems, soybean stems, grassland grasses, duck-cogongrass, foxtail; beet pulp, citrus fruit pulp, seed hulls, cellulose animal waste, lawn-trim waste (including macroalgae and microalgae), cotton, seaweed, algae (including macroalgae and microalgae), trees, softwood, hard wood, poplar, pine, shrub (shrub), grasses, wheat straw, bagasse, corn husks, corncob, corn kernels, fibers from grain, products and byproducts from wet or dry milling of grain, municipal solid waste, waste paper, yard waste, herbaceous material, agricultural residues, forestry residues, municipal solid waste, waste paper, pulp, paper mill residues, branches, shrubs, sugarcane, corn husks, energy crops, forests, fruits, flowers, grains, grasses, herbaceous crops, leaves, bark, needle leaves, raw wood, roots, saplings, shrub (shrub), switchgrass, trees, vegetables, pericarp, vine, sugar beet pulp, wheat bran, oat hulls, hard wood or softwood, organic materials resulting from an agricultural process, wood waste, or a combination of any two or more thereof. Algae (such as macroalgae and microalgae) have the following advantages: they may contain a large amount of sugar alcohols (such as sorbitol and/or mannitol). Lignocellulose, which can be considered as a potentially renewable raw material, generally comprises the polysaccharides cellulose (dextran) and hemicellulose (xylan, heteroxylan and xyloglucan). In addition, some hemicellulose may be present as glucomannans in, for example, wood derived raw materials. These polysaccharides are enzymatically hydrolyzed to soluble sugars (including both monomers and polymers, such as glucose, cellobiose, xylose, arabinose, galactose, fructose, mannose, rhamnose, ribose, galacturonic acid, glucuronic acid, and other hexoses and pentoses) by the action of synergistic diverse enzymes. In addition, pectins and other pectic substances (such as arabinans) may account for a substantial proportion of the typical cell wall dry mass from non-woody plant tissue (about one-fourth to one-half of the dry mass may be pectin). The lignocellulosic material may be pretreated. Pretreatment may include exposing the lignocellulosic material to an acid, base, solvent, heat, peroxide, ozone, mechanical comminution, grinding, milling or rapid depressurization, or a combination of any two or more thereof. Such chemical pretreatment is typically combined with thermal pretreatment (e.g., between 150 ℃ and 220 ℃ for 1 to 30 minutes).
The method for producing ethanol may include an aerobic proliferation step and an anaerobic fermentation step. More preferably, the method according to the invention is a method comprising the steps of: an aerobic proliferation step in which the population of recombinant yeast cells is increased; and an anaerobic fermentation step in which the carbon source is converted into ethanol by using a recombinant yeast cell population.
Proliferation is understood herein as the process of growing recombinant yeast cells that results in an increased initial population of recombinant yeast cells. The main purpose of proliferation is to increase the population of recombinant yeast cells using the recombinant yeast cells as the natural reproductive capacity of living organisms. That is, proliferation is for biomass production, not for ethanol production. Proliferation conditions may include appropriate carbon sources, aeration, temperature and nutrient addition. Proliferation is an aerobic process, so the proliferation vessel must be properly aerated to maintain a certain level of dissolved oxygen. Proper aeration is typically achieved by an air inductor mounted on the pipe into the propagation tank that introduces air into the propagation mixture as the tank fills and during recirculation. The ability of the propagation mixture to retain dissolved oxygen varies with the amount of air added and the consistency of the mixture, which is why water is typically added in a mash to water ratio of between 50:50 and 90:10. "viscous" proliferation mixtures (80:20 and higher mash to water ratios) typically require the addition of compressed air to compensate for the reduced capacity to retain dissolved oxygen. The amount of dissolved oxygen in the propagation mixture also varies with the bubble size, so some ethanol plants add air through spargers that produce smaller bubbles compared to air inductors. Proper aeration and lower glucose are important to promote aerobic respiration during proliferation, so that the environment during proliferation is different from the anaerobic environment during fermentation.
Anaerobic fermentation process is understood herein to be a fermentation step operating under anaerobic conditions.
Anaerobic fermentation is preferably run at a temperature optimal for the cells. Thus, for most recombinant yeast cells, the fermentation process is conducted at a temperature of less than about 50 ℃, less than about 42 ℃, or less than about 38 ℃. For recombinant yeast cells or filamentous fungal host cells, the fermentation process is preferably conducted at a temperature of less than about 35 ℃, about 33 ℃, about 30 ℃, or about 28 ℃ and at a temperature of greater than about 20 ℃, about 22 ℃, or about 25 ℃.
In the process according to the invention, the ethanol yield based on xylose and/or glucose is preferably at least about 50%, about 60%, about 70%, about 80%, about 90%, about 95% or about 98%. Ethanol yield is defined herein as a percentage of the theoretical maximum yield.
The process according to the invention and the propagation step and/or fermentation step suitably included therein may be carried out in batch, fed-batch or continuous mode. A stepwise hydrolysis and fermentation (separate hydrolysis and fermentation, SHF) process or a simultaneous saccharification and fermentation (simultaneous saccharification and fermentation, SSF) process may also be applied.
The recombinant yeasts and methods according to the invention advantageously allow a more robust method. Advantageously, the process or any anaerobic fermentation during the process may be carried out in the presence of a high concentration of carbon source. Thus, the process (and correspondingly any anaerobic fermentation step therein) is preferably carried out in the presence of glucose at the following concentrations: 25g/L or higher, 30g/L or higher, 35g/L or higher, 40g/L or higher, 45g/L or higher, 50g/L or higher, 55g/L or higher, 60g/L or higher, 65g/L or higher, 70g/L or higher, 75g/L or higher, 80g/L or higher, 85g/L or higher, 90g/L or higher, 95g/L or higher, 100g/L or higher, 110g/L or higher, 120g/L or higher, or may be, for example, in the range of 25g/L-250g/L, 30g/L-200g/L, 40g/L-200g/L, 50g/L-200g/L, 60g/L-200g/L, 70g/L-200g/L, 80g/L-200g/L, or 90g/L-200 g/L.
For recovery of the fermentation product, the prior art is used. Different recovery methods are appropriate for different fermentation products. Existing processes for recovering ethanol from aqueous mixtures typically use fractionation and adsorption techniques. For example, beer distillers can be used to process fermentation products containing ethanol in an aqueous mixture to produce an ethanol-enriched mixture, which is then fractionated (e.g., by fractional distillation or other similar techniques). Next, the fraction containing the highest concentration of ethanol may be passed through an adsorbent to remove most, if not all, of the remaining water from the ethanol. In one embodiment, in addition to recovering the fermentation product, yeast may be recovered.
All patents and references cited in this specification are incorporated herein by reference in their entirety.
The following examples are provided for illustrative purposes only and are not intended to limit the scope of the present invention in any way
Examples
General molecular biology techniques
Unless indicated otherwise, the methods used are standard biochemical techniques. Examples of suitable general method textbooks include Sambrook et al, molecular Cloning, a Laboratory Manual [ molecular cloning, A laboratory Manual ] (1989) and Ausubel et al, current Protocols in Molecular Biology [ guidelines for molecular biology experiments ] (1995), john Wiley & Sons, inc. [ John Willi parent ].
HPLC analysis
HPLC analysis is typically performed as described in the following documents: "Determination of sugars, byproducts and degradation products in liquid fraction in process sample" [ "determination of sugar, by-products and degradation products in liquid fraction in process sample" ]; laboratory analysis procedures (Laboratory Analytical Procedure, LAP, release date: 12/08/2006; A.Sluicter, B.Hames, R.Ruiz, C.Scarlata, J.Sluicter and D.Templeton; technical Report [ Technical Report ] (NREL/TP-51042623); month 1 in 2008; national Renewable Energy Laboratory [ national renewable energy laboratory ].
After fermentation, the sample for HPLC analysis was separated from the yeast biomass and insoluble components (corn mash) by passing the clarified supernatant after centrifugation through a 0.2 μm pore size filter
Initial strain
Using EthanolAs a starting strain, a strain was prepared. Ethanol->Is a commercial strain of Saccharomyces cerevisiae available from Le Sifu company (Lesafre).
The strain construction methods that can be followed are described in WO 2013/144257A1 and WO 2015/028582 (incorporated herein by reference).
Expression cassettes from a variety of genes of interest can be recombined in vivo into the pathway at a specific locus after transformation of the yeast (US 9738890 B2). The promoter, ORF and terminator sequences were assembled into expression cassettes using Golden Gate technology as described by Engler et al (2011) and ligated into Bsal digested backbone vectors that modified the expression cassettes with linkers for in vivo recombination procedures. The expression cassette including the linker was amplified by PCR. In addition, PCR was used to amplify 5 'and 3' dna fragments of the upstream and downstream portions of the integration locus and modified with linker sequences. After transformation of yeast cells with these DNA fragments, in vivo recombination and integration into the genome is performed at the desired location. CRISPR-Cas9 technology is used to create unique double strand breaks at the integration locus to target the pathway to this particular locus (DiCarlo et al, 2013,Nucleic Acids Res [ nucleic acids research ] 41:4336-4343) as well as WO 16110512 and US 2019309268. The gRNA is expressed from a multicopy yeast shuttle vector containing a natMX marker that confers resistance to the yeast cell against the biomass Nociceptin (NTC). The backbone of this plasmid is based on pRS305 (Sikorski and Hieter, genetics [ Genetics ]1989, vol.122, pages 19-27) and comprises a functional 2 μm ORI sequence. Streptococcus pyogenes (Streptococcus pyogenes) CRISPR-associated protein 9 (Cas 9) was expressed from the pRS414 plasmid with kanMX markers (Sikorski and Hieter, 1989), which confer resistance to biomass geneticin (G418) on yeast cells. The guide RNA and protospacer sequences were designed using the gRNA design tool (https:// www.atum.bio/eCommerce/cas 9/input).
Example 1: construction of the "Rubisco" Strain (intermediate Strain IX 1)
In this example, the starting strain was transformed with the cbbM gene from thiobacillus denitrificans encoding the single subunit of ribulose-1, 5-bisphosphate carboxylase (RuBisCO) and the genes from escherichia coli encoding the chaperones GroEL and GroES (to help the correct folding of the RuBisCO protein in the cytosol of saccharomyces cerevisiae).
Expression cassettes with the Saccharomyces cerevisiae TDH3 promoter, the cbbM gene and the Saccharomyces cerevisiae CYC1 terminator were obtained from plasmid pBTWW002, expression cassettes with the Saccharomyces cerevisiae TEF1 promoter, groEL and Saccharomyces cerevisiae ACT1 terminator were obtained from plasmid pUD232, and expression cassettes with the Saccharomyces cerevisiae TPI1 promoter, groES gene and Saccharomyces cerevisiae PGI1 terminator were obtained from plasmid pUD233, as described in the following documents: guadalube-Medina et al, biotechnol, biofuels [ biotech of biofuel ],2013, vol.6, p.125 and US2019309268.
The three cassettes were integrated with CRISPR-Cas9 using the gRNA expression cassette into the INT1 locus located between NTR1 (YOR 071 c) and GYP1 (YOR 070 c) on the XV chromosome of Saccharomyces cerevisiae (as described in Di Carlo et al, nucleic Acids Res. [ nucleic acids Ind. 2013; pages 1-8). This cassette was ordered as a synthetic DNA cassette (gBLOCK) at integrated DNA technologies company (Integrated DNA Technologies) (belgium), as INT1 gBLOCK and by homologous recombination with: PCR fragments (5 '-INT 1) generated using genomic DNA of strain CEN.PK113-7D as template with primers BoZ-783 and DBC-18463, and PCR fragments (3' -INT 1) generated using genomic DNA of strain CEN.PK113-7D as template with primers DBC-18464 and BoZ-788, as described in WO 2018/114762.
Diagnostic PCR was performed to confirm proper assembly and integration of the GroES-cbbM-GroEL cassette at the INT1 locus. Cas9 and gRNA plasmids were removed by two overnight liquid incubations with YEPhD, dilutions were plated on YEPhD plates, after which single colonies were picked. Cells prepared from single colonies of the YEPhD plates were streaked and re-streaked on YEPhD and YEPhD+G418 (400 mg/ml) and YEPhD+NTC (200 mg/ml). Plasmid-free colonies that did not grow on the antibiotic plates were selected.
The above example gives an intermediate strain IX1 which contains two copies of GroES, cbbM and GroEL (see Table 14 for detailed genotypes).
Example 2: reference "PRK-Rubisco" strainConstruction of reference Strain RX2
Intermediate strain IX1 obtained as in example 1 was transformed with ribulose-phosphate kinase (prk) from S.oleracera (as described in Guadalube-Medina et al, biotechnol, biofuels [ biotech of biofuel ],2013, volume 6, page 125) and ribulose-1, 5-bisphosphate carboxylase (RubisCO) from Thiobacillus denitrificans encoded by the cbbM gene. The prk gene was expressed by the Saccharomyces cerevisiae ANB1 promoter (shown by SEQ ID NO:26 (cbbM), SEQ ID NO:28 (prk) and SEQ ID NO:23 (ANB 1)) and the Saccharomyces cerevisiae EBO1 terminator (Sc_EBO1t) (shown in SEQ ID NO: 79) as described in US 10689670.
The cbbM gene was cloned between the saccharomyces cerevisiae ENO2 promoter (shown by SEQ ID No. 80) and the saccharomyces cerevisiae YOX1 terminator (shown by SEQ ID No. 81).
The cassette was assembled into the "prk-cbbM" pathway and the following two sequences for homologous integration were integrated into the INT14.02 locus as a non-coding region between ORF YNL179c and RPS3 (YNL 178W) on chromosome XIV from saccharomyces cerevisiae using CRISPR-Cas9 with the INT14.02 proto-spacer (shown by SEQ ID No. 82): sc_INt14.02_flanking 5 (shown by SEQ ID NO: 83) and Sc_INt14.02_flanking 3 (shown by SEQ ID NO: 84).
Diagnostic PCR was performed to confirm proper assembly and integration of the prk and cbbM cassettes at the INT14.02 locus, and the gRNA and Cas9 plasmids were removed as described above. This gave a reference strain RX2 which contained two copies of prk, groEL and groES and 4 copies of cbbM (see Table 14 for detailed genotypes).
Example 3: construction of novel strains NX3 and NX4
New strains NX3 and NX4 were constructed by transforming the reference strain RX2 obtained in example 2 as follows:
the DNA fragments were compiled using Golden Gate clones (as described, for example, in Engler et al, "Generation of Families of Construct Variants Using Golden Gate Shuffling" [ "construct variant family generated using Golden Gate shuffling" ], (2011), published in Chaofu Lu et al (eds.), cDNA Libraies: methods and Applications, methods in Molecular Biology [ cDNA library: methods and applications, methods of molecular biology ], vol.729, chapter 11 at pages 167-180, incorporated herein by reference), and contained the Saccharomyces cerevisiae ANB1 promoter (shown by SEQ ID NO: 23), pichia pastoris TKL1 gene (shown by SEQ ID NO: 18), and Saccharomyces cerevisiae TDH1 terminator. This DNA fragment was designated "fragment A" (shown by SEQ ID NO: 73). Fragment A was integrated into the INT95 locus located between SOD1 (YJR 104C) and ADO1 (YJR 105W) on chromosome X of Saccharomyces cerevisiae reference strain RX2 using CRISPR-Cas9 and INT95 protospacers (shown by SEQ ID NO: 85) and the following two sequences for homologous integration: sc_INT95B_flanking 5 (shown by SEQ ID NO: 86) and Sc_INT95B_flanking 3 (shown by SEQ ID NO: 87).
Diagnostic PCR was performed to confirm proper assembly and integration of the facilitation pichia TKL1 expression cassette at the INT95 locus. Plasmid-free colonies were selected and this resulted in new strains NX3 and NX4 containing two copies of the promoted pichia TKL1 expression cassette (see table 14 for detailed genotypes).
Example 4: construction of novel strains NX5 and NX6
New strains NX5 and NX6 were constructed by transforming the reference strain RX2 obtained in example 2 with the following four expression cassettes:
the first cassette contains a DNA fragment named "fragment a" comprising the saccharomyces cerevisiae ANB1 promoter, pichia pastoris TKL1 orf and saccharomyces cerevisiae TDH1 terminator, as mentioned in example 3. (shown by SEQ ID NO: 73)
The second cassette contains a DNA fragment named "fragment B" comprising the saccharomyces cerevisiae MYO4 promoter (sc_myo4. Pro), saccharomyces cerevisiae DAK1 orf (sc_dak1. Orf) and saccharomyces cerevisiae GPM1 terminator (sc_gpm1. Ter). The nucleic acid sequence of this DNA fragment is shown in SEQ ID NO. 74.
The third cassette contains a DNA fragment designated "fragment C" comprising the Saccharomyces cerevisiae HHF2 promoter (Sc_HH2. Pro), E.coli gldA orf (ec_gldA. Orf) and the Saccharomyces cerevisiae EFM1 terminator (Sc_EFM1. Ter). The nucleic acid sequence of this DNA fragment is shown in SEQ ID NO. 75.
The fourth cassette contains a DNA fragment named "fragment D" comprising the saccharomyces cerevisiae ANB1 promoter (sc_anb1. Pro_0001), the reuptake yeast orf (zrou_t5. Orf) encoding the glycerol transporter GLYT (ZYRO 0E 01210) and the saccharomyces cerevisiae terminator (sc_tef1. Ter_0001). The nucleic acid sequence of this DNA fragment is shown in SEQ ID NO. 76.
These four cassettes were integrated into the INT95 locus located between SOD1 (YJR 104C) and ADO1 (YJR 105W) on chromosome X of s.cerevisiae reference strain RX2 using CRISPR-Cas9 using the following sequences for homologous integration as described above, i.e., with Sc_INT95B_flanking 5 (shown by SEQ ID NO: 86) and Sc_INT95B_flanking 3 (shown by SEQ ID NO: 87).
Diagnostic PCR was performed to confirm that the four expression cassettes were assembled and integrated correctly at the INT95 locus. Plasmid-free colonies were selected, which resulted in new strains NX5 and NX6 (see table 14 for detailed genotypes).
Example 5: construction of novel strains NX7 and NX8
New strains NX7 and NX8 were constructed by transforming the reference strain RX2 obtained in example 2 with a cassette containing the DNA fragment comprising the Saccharomyces cerevisiae ANB1 promoter, pichia pastoris TKL1 orf and Saccharomyces cerevisiae TDH1 terminator as mentioned in example 3 (i.e. "fragment A") (shown by SEQ ID NO: 73) in the INT95 locus using CRISPR-Cas9 with Sc_INT95B_flanking 5.Acc (SEQ ID NO: 86) and Sc_INT95B_flanking 3.Acc (SEQ ID NO: 87) also shown in example 3 for homologous integration.
GPD2 was deleted using CRISPR-Cas9 and GPD2 proto-spacers (shown by SEQ ID NO: 88) and repair DNA fragments for homologous recombination and deletion of GPD2 (shown by SEQ ID NO: 78). INT95 integration and GPD2 loss were checked by diagnostic PCR. Plasmid-free colonies were selected, which resulted in new strains NX7 and NX8 (see table 14 for detailed genotypes).
Example 6: construction of novel strains NX9 and NX10
New strains NX9 and NX10 were constructed by transforming reference strain RX2 with the following three expression cassettes:
the first cassette contains a DNA fragment named "fragment a" comprising the saccharomyces cerevisiae ANB1 promoter, pichia pastoris TKL1 orf and saccharomyces cerevisiae TDH1 terminator, as mentioned in example 3. (shown by SEQ ID NO: 73)
The second cassette contains a DNA fragment designated "fragment E" which comprises the Saccharomyces cerevisiae PFY1 promoter (Sc_PFY1. Pro), E.coli gldA orf (ec_gldA. Orf) and the Saccharomyces cerevisiae EFM1 terminator (Sc_EFM1. Ter). The nucleic acid sequence of this DNA fragment is shown in SEQ ID NO. 77.
The third cassette contains a DNA fragment named "fragment D" comprising the saccharomyces cerevisiae ANB1 promoter (sc_anb1. Pro_0001), the reuptake yeast orf (zrou_t5. Orf) encoding the glycerol transporter GLYT (ZYRO 0E 01210) and the saccharomyces cerevisiae terminator (sc_tef1. Ter_0001). The nucleic acid sequence of this DNA fragment is shown in SEQ ID NO. 76.
The three cassettes were integrated into the INT95 locus using CRISPR-Cas9 using INT95B_flanking 5 (SEQ ID NO: 86) and INT95B_flanking 3 (SEQ ID NO: 87) for homologous integration. Diagnostic PCR was performed to confirm that the three expression cassettes were assembled and integrated correctly at the INT95 locus. Plasmid-free colonies were selected, which resulted in strains NX9 and NX10 (see table 14 for detailed genotypes).
Table 14: saccharomyces cerevisiae strains used in this study
Example 7: fermentation
Precultures of the above new "NX3-NX10" strains were prepared as follows: glycerol stock (-80 ℃) was thawed at room temperature and used to inoculate 0.2L mineral medium supplemented with 2% (w/v) glucose at pH 6.0 (adjusted with 2m h2so4/4N KOH) in a baffle-free 0.5L shake flask [ as described in: luttik, MLH et al (2000) "The Saccharomyces cerevisiae ICL2 Gene Encodes a Mitochondrial 2-Methylisocitrate Lyase Involved in Propionyl-Coenzyme A Metabolism" [ "Saccharomyces cerevisiae ICL2 gene encodes mitochondrial 2-methyl isocitrate lyase involved in propionyl-CoA metabolism" ].J.Bacteriol. [ J.bacteriology ]182:7007-13]. The preculture was incubated at 32℃for 18 hours and shaken at 200 RPM. After estimating the CDW by OD600 measurement (using the existing yeast Cell Dry Weight (CDW) versus OD600 calibration curve), the preculture was centrifuged (3 min,530 x g) in an amount corresponding to the 0.5g CDW/L inoculum concentration required for proliferation, washed once with sterile demineralized water of a sample volume, centrifuged once again, and resuspended in proliferation medium.
Proliferation of the above "NX3-NX10" strain was performed as follows: the propagation step was performed in 500mL shake flasks using 100mL of filtered and diluted corn mash (70% v/v corn mash: 30% v/v water) supplemented with 1.25g/L urea and the following antibiotics: the final concentrations were 50. Mu.g/mL and 100. Mu.g/mL neomycin and penicillin G, respectively. After all additions, the pH was adjusted to 5.0 using 2M H2SO4/4N KOH. Glucoamylase was administered at a concentration of 0.1mL/L at the beginning of proliferationT, novelimes (Novozymes)). All strains were allowed to proliferate for 6h at 32℃and were shaken at 200 RPM.
The main fermentation of the above "NX3-NX10" strain was performed as follows: the main fermentation step was performed using 200ml of medium in a 500ml Schott bottle equipped with a pressure recording/release cap (Ankom Technology, ma Xideng, new york, usa) while shaking at 140rpm and 32 ℃. The pH was not controlled during fermentation. Fermentation was performed with corn mash with an increased dry solids content of 36% w/w DS. Subsequently, the corn mash was supplemented with 1.0g/L urea and the following antibiotics: neomycin and penicillin G at final concentrations of 50 μg/mL and 100 μg/mL, respectively; antifoam (Basildon, inc., approximately 0.5 mL/L). After all additions, the pH was adjusted to 5.0 using 2M H2SO4/4N KOH. Glucoamylase was administered at a concentration of 0.24mL/L at the beginning of fermentation T, novelin). The amount of yeast added (pitch) required from propagation to fermentation was 1.5% of the fermentation volume. All strains were tested at high solids (i.e. 36% w/wDS).
Sampling of the fermentation was performed as follows: samples were taken from the primary fermentation only. Samples for HPLC analysis were collected at 18, 24, 42, 48 and 66 hours. Table 15 summarizes the ethanol production (g/l) at each time point. Table 16 summarizes the remaining glucose concentrations (g/l) at each time point.
The conclusion is as follows: the remaining glucose concentration is an indicator of the robustness of the yeast strain. Glucose is continuously produced due to the presence of glucoamylase. Without wishing to be bound by any type of theory, it is believed that less robust strains will become more inhibited near the end of fermentation, and therefore will identify higher concentrations of unconverted glucose in the sample. More robust strains will become less inhibited near the end of the fermentation, and therefore will identify lower concentrations of unconverted glucose in the sample.
As shown in table 16, the concentration of unconverted glucose obtained after 48 hours and 66 hours of fermentation by yeasts NX3, NX4, NX5, NX6, NX7, NX8, NX9 and NX10 according to the present invention was lower than the reference strain.
Example 8 construction of reference Strain FGG1-pPATH1 (i.e., reference Strain RX 11) expressing the phosphoketolase pathway
WO 2018/172328 describes the construction of several strains of saccharomyces cerevisiae (including FGG1-pPATH1 strains) that express the phosphoketolase pathway. The strain FGG1-pPATH1 has a related genotype comprising PKL, PTA and AADH. A summary of relevant strains for the following examples is provided in table 17 below. The strain may be constructed as described in WO 2018/172328 (incorporated herein by reference).
As explained in WO 2018/172328, in an industrial environment, the tolerance to hypertonicity and stress response to the external environment of strains such as FGG1-pPATH1 strains may be affected.
Table 17: saccharomyces cerevisiae strain expressing phosphoketolase pathway
Example 9: construction of New Strain NX12 (prophetic according to the invention)
New strain NX12 can be constructed by transforming reference strain RX11 (FGG 1-pPATH1 as described in WO 2018/172328) as follows:
a DNA fragment was compiled comprising the Saccharomyces cerevisiae ANB1 promoter (shown by SEQ ID NO: 23), the Pichia pastoris TKL1 gene (shown by SEQ ID NO: 18) and the Saccharomyces cerevisiae TDH1 terminator. This DNA fragment was designated "fragment A" (shown by SEQ ID NO: 73). DNA fragment A was assembled using Golden Gate clones (as described, for example, in Engler et al, "Generation of Families of Construct Variants Using Golden Gate Shuffling" [ "use Golden Gate shuffling to generate construct variant family" ], (2011), published in Chaofu Lu et al (eds.), cDNA Libraies: methods and Applications, methods in Molecular Biology [ cDNA library: methods and applications, methods of molecular biology ], volume 729, chapter 11, pages 167-180, incorporated herein by reference). Using CRISPR-Cas9 and INT95 protospacers (shown by SEQ ID NO: 85) the following two sequences for homologous integration, the expression cassette can be integrated into the INT95 locus located between SOD1 (YJR 104C) and ADO1 (YJR 105W) on chromosome X of s.cerevisiae reference strain RX 11: sc_INT95B_flanking 5 (shown by SEQ ID NO: 86) and Sc_INT95B_flanking 3 (shown by SEQ ID NO: 87).
Diagnostic PCR can be performed to confirm proper assembly and integration of the facilitation TKL1 expression cassette at the INT95 locus. Plasmid-free colonies were then selected and this resulted in a new strain NX12 containing two copies of the facilitation TKL1 expression cassette (see table 17 for detailed genotypes).
Example 10: fermentation (prophetic)
Precultures of the above new "NX12" strain were prepared as follows: glycerol stock (-80 ℃) was thawed at room temperature and used to inoculate 0.2L mineral medium supplemented with 2% (w/v) glucose at pH 6.0 (adjusted with 2MH2SO4/4N KOH) in a baffle-free 0.5L shake flask [ as described in: luttik, MLH et al (2000) "The Saccharomyces cerevisiae ICL2 Gene Encodes a Mitochondrial 2-Methylisocitrate Lyase Involved in Propionyl-Coenzyme A Metabolism" [ "Saccharomyces cerevisiae ICL2 gene encodes mitochondrial 2-methyl isocitrate lyase involved in propionyl-CoA metabolism" ].J.Bacteriol. [ J.bacteriology ]182:7007-13]. The preculture was incubated at 32℃for 18 hours and shaken at 200 RPM. After estimating the CDW by OD600 measurement (using the existing yeast Cell Dry Weight (CDW) versus OD600 calibration curve), the preculture was centrifuged (3 min,530 x g) in an amount corresponding to the 0.5g CDW/L inoculum concentration required for proliferation, washed once with sterile demineralized water of a sample volume, centrifuged once again, and resuspended in proliferation medium.
Proliferation of the above "NX12" strain can be performed as follows: the propagation step was performed in 500mL shake flasks using 100mL of filtered and diluted corn mash (70% v/v corn mash: 30% v/v water) supplemented with 1.25g/L urea and the following antibiotics: the final concentrations were 50. Mu.g/mL and 100. Mu.g/mL neomycin and penicillin G, respectively. After all additions, the pH was adjusted to 5.0 using 2M H2SO4/4N KOH. Glucoamylase was administered at a concentration of 0.1mL/L at the beginning of proliferationT, novelin). All strains were allowed to proliferate at 32℃for 6 hours and were shaken at 200 RPM.
The main fermentation of the above "NX12" strain can be performed as follows: while shaking at 140rpm and 32 ℃, a pressure recording/releasing cap (angon technologies, U.S. new york) was equippedAbout state Ma Xideng) in a 500ml Schott bottle using 200ml of medium. The pH was not controlled during fermentation. Fermentation was performed with corn mash with an increased dry solids content of 36% w/w DS. Subsequently, the corn mash was supplemented with 1.0g/L urea and the following antibiotics: neomycin and penicillin G at final concentrations of 50 μg/mL and 100 μg/mL, respectively; antifoam (bassinet, approximately 0.5 mL/L). After all additions, the pH was adjusted to 5.0 using 2M H2SO4/4N KOH. Glucoamylase was administered at a concentration of 0.24mL/L at the beginning of fermentation T, novelin). The amount of yeast added required from propagation to fermentation was 1.5% of the fermentation volume. All strains were tested under high solids (i.e. 36% w/w DS).
Sampling of the fermentation can be performed as follows: samples were taken from the primary fermentation only. Samples for HPLC analysis were collected at 18, 24, 42, 48 and 66 hours. The ethanol yield (g/l) at each time point and the remaining glucose concentration (g/l) at each time point can be analyzed.
The conclusion may be as follows: the remaining glucose concentration is an indicator of the robustness of the yeast strain. Glucose is continuously produced due to the presence of glucoamylase. Without wishing to be bound by any type of theory, it is believed that less robust strains (such as reference strain RX 11) will become more inhibited near the end of the fermentation, and therefore will identify higher concentrations of unconverted glucose in the sample. More robust strains (such as NX 12) will become less inhibited near the end of the fermentation, and therefore will identify lower concentrations of unconverted glucose in the sample.
EXAMPLE 11 construction of reference Strain IMZ132 (i.e., reference Strain RX 13) expressing an Acetylaldehyde dehydrogenase
WO 2011/010923 describes a strain IMZ132 expressing an acetylating acetaldehyde dehydrogenase, further referred to herein as reference strain RX13. Strain IMZ132 can be constructed as described in WO 2011/010923 (incorporated herein by reference). In addition, strain IMZ132 was deposited at the netherlands collection of microorganisms and cell cultures (Centraalbureau voor Schimmelcultures) at 7.16, 2009 under accession number CBS125049.
Table 18: saccharomyces cerevisiae strain expressing acetylating acetaldehyde dehydrogenase
Example 12: construction of New Strain NX14 (prophetic according to the invention)
The new strain NX14 can be constructed by transforming the reference strain RX13 (IMZ 132 as described in WO 2011/010923) as follows:
a DNA fragment was compiled comprising the Saccharomyces cerevisiae ANB1 promoter (shown by SEQ ID NO: 23), the Pichia pastoris TKL1 gene (shown by SEQ ID NO: 18) and the Saccharomyces cerevisiae TDH1 terminator. This DNA fragment was designated "fragment A" (shown by SEQ ID NO: 73). DNA fragment A was assembled using Golden Gate clones (as described, for example, in Engler et al, "Generation of Families of Construct Variants Using Golden Gate Shuffling" [ "use Golden Gate shuffling to generate construct variant family" ], (2011), published in Chaofu Lu et al (eds.), cDNA Libraies: methods and Applications, methods in Molecular Biology [ cDNA library: methods and applications, methods of molecular biology ], volume 729, chapter 11, pages 167-180, incorporated herein by reference). Using CRISPR-Cas9 and INT95 protospacers (shown by SEQ ID NO: 85) the following two sequences for homologous integration, the expression cassette can be integrated into the INT95 locus located between SOD1 (YJR 104C) and ADO1 (YJR 105W) on chromosome X of saccharomyces cerevisiae reference strain RX 13: sc_INT95B_flanking 5 (shown by SEQ ID NO: 86) and Sc_INT95B_flanking 3 (shown by SEQ ID NO: 87).
Diagnostic PCR can be performed to confirm proper assembly and integration of the facilitation TKL1 expression cassette at the INT95 locus. Plasmid-free colonies were then selected and this resulted in a new strain NX14 containing two copies of the facilitation TKL1 expression cassette (see table 18 for detailed genotypes).
Example 13: fermentation(prophetic)
Precultures of the above new "NX14" strain were prepared as follows: glycerol stock (-80 ℃) was thawed at room temperature and used to inoculate 0.2L mineral medium supplemented with 2% (w/v) glucose at pH 6.0 (regulated with 2MH2SO4/4N KOH) in a baffle-free 0.5L shake flask (as described in Luttik, mlh et al (2000) "The Saccharomyces cerevisiae ICL2 Gene Encodes a Mitochondrial 2-Methylisocitrate Lyase Involved in Propionyl-Coenzyme A Metabolism" [ "saccharomyces cerevisiae ICL2 gene encodes mitochondrial 2-methyl isocitrate lyase involved in propionyl coa metabolism" ].j.bacteriol. 182:7007-13). The preculture was incubated at 32℃for 18 hours and shaken at 200 RPM. After estimating the CDW by OD600 measurement (using the existing yeast Cell Dry Weight (CDW) versus OD600 calibration curve), the preculture was centrifuged (3 min,530 x g) in an amount corresponding to the 0.5g CDW/L inoculum concentration required for proliferation, washed once with sterile demineralized water of a sample volume, centrifuged once again, and resuspended in proliferation medium.
Proliferation of the above "NX14" strain can be performed as follows: the propagation step was performed in 500mL shake flasks using 100mL of filtered and diluted corn mash (70% v/v corn mash: 30% v/v water) supplemented with 1.25g/L urea and the following antibiotics: the final concentrations were 50. Mu.g/mL and 100. Mu.g/mL neomycin and penicillin G, respectively. After all additions, the pH was adjusted to 5.0 using 2M H2SO4/4N KOH. Glucoamylase was administered at a concentration of 0.1mL/L at the beginning of proliferationT, novelin). All strains were allowed to proliferate at 32℃for 6 hours and were shaken at 200 RPM.
The main fermentation of the above "NX14" strain can be performed as follows: the main fermentation step was performed using 200ml of medium in a 500ml Schott bottle equipped with a pressure recording/release cap (anka technologies, ma Xideng, new york, usa) while shaking at 140rpm and applying a temperature of 32 ℃. The pH was not controlled during fermentation. Hair with corn mash with increased dry solids content of 36% w/w DSAnd (5) fermenting. Subsequently, the corn mash was supplemented with 1.0g/L urea and the following antibiotics: neomycin and penicillin G at final concentrations of 50 μg/mL and 100 μg/mL, respectively; antifoam (bassinet, approximately 0.5 mL/L). After all additions, the pH was adjusted to 5.0 using 2M H2SO4/4N KOH. Glucoamylase was administered at a concentration of 0.24mL/L at the beginning of fermentation T, novelin). The amount of yeast added required from propagation to fermentation was 1.5% of the fermentation volume. All strains were tested under high solids (i.e. 36% w/w DS).
Sampling of the fermentation can be performed as follows: samples were taken from the primary fermentation only. Samples for HPLC analysis were collected at 18, 24, 42, 48 and 66 hours. The ethanol yield (g/l) at each time point and the remaining glucose concentration (g/l) at each time point can be analyzed.
The conclusion may be as follows: the remaining glucose concentration is an indicator of the robustness of the yeast strain. Glucose is continuously produced due to the presence of glucoamylase. Without wishing to be bound by any type of theory, it is believed that less robust strains (such as reference strain RX 13) will become more inhibited near the end of the fermentation, and therefore will identify higher concentrations of unconverted glucose in the sample. More robust strains (such as NX 14) will become less inhibited near the end of the fermentation, and therefore will identify lower concentrations of unconverted glucose in the sample.
Reference to the literature
Entian KD,P.Yeast genetic strain and plasmid collections.Method Microbiol.2007;629-66.
Nijkamp JF,van den Broek M,Datema E,de Kok S,Bosman L,Luttik MA,Daran-Lapujade P,Vongsangnak W,Nielsen J,Heijne WHM,Klaassen P,Paddon CJ,Platt D,P,van Ham RC,Reinders MJT,Pronk JT,de Ridder D,Daran J-M.De novo sequencing,assembly and analysis of the genome of the laboratory strain Saccharomyces cerevisiae CEN.PK113-7D,a model for modern industrial biotechnology.Microb Cell Fact.2012;11:36.
Verduyn C,Postma E,Scheffers WA,van Dijken JP.Effect of benzoic acid on metabolic fluxes in yeasts:A continuous-culture study on the regulation of respiration and alcoholic fermentation.Yeast.1992;8:501-17.
Mans R,van Rossum HM,Wijsman M,Backx A,Kuijpers NG,van denBroek M,Daran-Lapujade P,Pronk JT,van Maris AJA,Daran J-M.CRISPR/Cas9:a molecular Swiss army knife for simultaneous introductionof multiple genetic modifications in Saccharomyces cerevisiae.FEMS YeastRes.2015;15:fov004.
DiCarlo JE,Norville JE,Mali P,Rios X,Aach J,Church GM.Genomeengineering in Saccharomyces cerevisiae using CRISPR-Cassystems.Nucleic Acids Res.2013;1-8.
Mikkelsen MD,Buron LD,Salomonsen B,Olsen CE,Hansen BG,Mortensen UH,Halkier BA.Microbial production of indolylglucosinolatethrough engineering of a multi-gene pathway in a versatile yeast expressionplatform.Metab Eng.2012;14:104-11.
Knijnenburg TA,Daran JM,van den Broek MA,Daran-Lapujade PA,de Winde JH,Pronk JT,Reinders MJ,Wessels LF.Combinatorial effects ofenvironmental parameters on transcriptional regulation in Saccharomycescerevisiae:A quantitative analysis of a compendium of chemostat-basedtranscriptome data.BMC Genomics.2009;10:53.
Mumberg D,Müller R,Funk M.Yeast vectors for the controlledexpression of heterologous proteins in different geneticbackgrounds.Gene.1995;156:119-22.
Gueldener U,Heinisch J,Koehler GJ,Voss D,Hegemann JH.A secondset of loxP marker cassettes for Cre-mediated multiple gene knockouts inbudding yeast.Nucleic Acids Res.2002;30:e23.
Guadalupe-Medina V,Wisselink H,Luttik M,de Hulster E,Daran J-M,Pronk JT,van Maris AJA.Carbon dioxide fixation by Calvin-Cycle enzymesimproves ethanol yield in yeast.Biotechnol Biofuels.2013;6:125.
Daniel Gietz R,Woods RA:Transformation of yeast by lithiumacetate/single-stranded carrier DNA/polyethylene glycol method.MethodsEnzymol.2002:87-96.
Solis-Escalante D,Kuijpers NGA,Bongaerts N,Bolat I,Bosman L,Pronk JT,Daran J-M,Daran-Lapujade P.amdSYM,a new dominantrecyclable marker cassette for Saccharomyces cerevisiae.FEMS Yeast Res.2013;13:126-39.
Guadalupe-Medina V,Almering MJH,van Maris AJA,PronkJT.Elimination of glycerol production in anaerobic cultures of aSaccharomyces cerevisiae strain engineered to use acetic acid as an electronacceptor.Appl Environ Microb.2010;76:190-5.
Papapetridis I,van Dijk M,Dobbe AP,Metz B,Pronk JT,van MarisAJA.Improving ethanol yield in acetate-reducing Saccharomyces cerevisiaeby cofactor engineering of 6-phosphogluconate dehydrogenase and deletionof ALD6.Microb Cell Fact.2016;15:1-16.
Heijnen JJ,van Dijken JP.In search of a thermodynamic description ofbiomass yields for the chemotrophic growth of microorganisms.BiotechnolBioeng.1992;39:833-58.
Postma E,Verduyn C,Scheffers WA,van Dijken JP.Enzymic analysisof the crabtree effect in glucose-limited chemostat cultures ofSaccharomyces cerevisiae.Appl Environ Microbiol.1989;55:468-77.
Verduyn C,Postma E,Scheffers WA,van Dijken JP.Physiology ofSaccharomyces cerevisiae in anaerobic glucose-limited chemostat cultures.JGen Microbiol.1990;136:395-403.
Kwast et al.Genomic Analysis of Anaerobically induced genes inSaccharomyces cerevisiae:Functional roles of ROX1 and other factors inmediating the anoxic response,2002,Journal of bacteriology vol 184,no1p250-265.
Keng,T.1992.HAP1 and ROX1 form a regulatory pathway in therepression of HEM13 transcription in Saccharomyces cerevisiae.Mol.Cell.Biol.12:2616-2623.
Labbe-Bois,R.,and P.Labbe.1990.Tetrapyrrole and heme biosynthesisin the yeast Saccharomyces cerevisiae,p.235-285.In H.A.Dailey(ed.),Biosynthesis of heme and chlorophylls.McGraw-Hill,New York,N.Y.
Zitomer,R.S.,and C.V.Lowry.1992.Regulation of gene expression byoxygen in Saccharomyces cerevisiae.Microbiol.Rev.56:1-11.
Zitomer,R.S.,P.Carrico,and J.Deckert.1997.Regulation of hypoxicgene expression in yeast.Kidney Int.51:507-513.
Cohen et al.,Induction and repression of DAN1 and the family ofanaerobic mannoprotein genes in Saccharomyces cerevisiae occurs through acomplex array of regulatory sites.Nucleic Acid Research,2001 Vol.29,No3,799-808
Ter Kinde and de Steensma,A microarray-assisted screen for potentialHap1 and Rox1 target genes in Saccharomyces cerevisiae,2002,Yeast 19:825-840.
Sertil et al.The DAN1 gene of S cerevisiae is regulated in parallel withthe hypoxic gene,but by a different mechanism,1997,Gene Vol 192,pag199-205.
Nissen et al.,"Anaerobic and aerobic batch cultivations ofSaccharomyces cerevisiae mutants impaired in glycerol Synthesis",(2000),Yeast,vol.16,pages 463-474.
Sambrook et al.,Molecular Cloning-A Laboratory Manual,2nd ed.,Vol.1-3(1989),published by Cold Spring Harbor Publishing.
Kruskal et al,"An overview of sequence comparison:Time warps,stringedits,and macromolecules",(1983),Society for Industrial and AppliedMathematics(SIAM),Vol 25,No.2,pages 201-237.
D.Sankoff and J.B.Kruskal,(ed.),Time warps,string edits andmacromolecules:the theory and practice of sequence comparison,pp.1-44Addison-Wesley Publishing Company.
Needleman et al"A General Method Applicable to the Search for Similarities in the Amino Acid Sequence of Two Proteins"(1970)J.Mol.Biol.Vol.48,pages 443-453.
Sherman,F.,et al.,Methods in Yeast Genetics,Cold Spring Harbor Laboratory(1986)
Rice et al,"EMBOSS:The European Molecular Biology Open Software Suite"(2000),Trends in Genetics vol.16,(6)pages 276-277,http:// emboss.bioinformatics.nl/.
Neves et al.,"Yeast orthologues associated with glycerol transport and metabolism",(2004),FEMS Yeast Res.Vol.5,pages 51-62.
Neves et al"New insights on glycerol transport in Saccharomyces cerevisiae",(2004),FEBS Letters 565(2004)160-162.
Kwast et al.,"Genomic Analysis of Anaerobically induced genes in Saccharomyces cerevisiae:Functional roles of ROX1 and other factors in mediating the anoxic response",(2002),Journal of bacteriology vol 184,no1 pages 250-265.
Molin et al(2003)"Dihydroxy-acetone kinases in Saccharomyces cerevisiae are involved in detoxification of dihydroxyacetone"(2003),J.Biol.Chem.,vol.278:pages 1415-1423.
Guadalupe-Medina et al.,"Carbon dioxide fixation by Calvin-Cycle enzymes improves ethanol yield in yeast",published in Biotechnol,Biofuels,2013,vol.6,p.125 onwards.
Yébenes et al.,“Chaperonins:two rings for folding”(2011),Trends in Biochemical Sciences,Vol.36,No.8,pages 424-432.
Zeilstra-Ryalls et al.,"The universally conserved GroE(Hsp60)chaperonins",published in Annu Rev Microbiol.(1991)vol.45,pages301-25.
Horwich et al.,"Two Families of Chaperonin:Physiology and Mechanism",(2007),Annu.Rev.Cell.Dev.Biol.Vol.23,pages 115-45.
Sonderegger et al.,"Metabolic Engineering of a PhosphoketolasePathway for Pentose Catabolism in Saccharomyces cerevisiae",(2004),Applied&Environmental Microbiology,vol.70(5),pages 2892-2897.
Membrillo-Hernandez et al.,"Evolution of the adhE Gene Product ofEscherichia coli from a Functional Reductase to a Dehydrogenase",(2000)J.Biol.Chem.275:pages 33869-33875.
Tamarit et al."Identification of the Major Oxidatively DamagedProteins in Escherichia coli Cells Exposed to Oxidative Stress"(1998)J.Biol.Chem.273:pages 3027-3032.
Smith et al."Purification,Properties,and Kinetic Mechanism ofCoenzyme A-Linked Aldehyde Dehydrogenase from Clostridium kluyveri"(1980)Arch.Biochem.Biophys.203:pages 663-675.
Toth et al."The ald Gene,Encoding a Coenzyme A-Acylating AldehydeDehydrogenase,Distinguishes Clostridium beijerinckii and Two OtherSolvent-Producing Clostridia from Clostridium acetobutylicum",(1999),Appl.Environ.Microbiol.65:pages 4973-4980.
Powlowski and Shingler"Genetics and biochemistry of phenoldegradation by Pseudomonas sp.CF600",(1994),Biodegradation vol.5,pages 219-236.
Shingler et al.,"Nucleotide Sequence and Functional Analysis of theComplete Phenol/3,4-Dimethylphenol Catabolic Pathway of Pseudomonassp.Strain CF600",(1992),J.Bacteriol.,Vol.174,pages 711-724.
Ferrandez et al.,"Genetic Characterization and Expression inHeterologous Hosts ofthe 3-(3-Hydroxyphenyl)Propionate CatabolicPathway of Escherichia coli K-12"(1997)J.Bacteriol.179:pages2573-2581.
Lutstorf and Megnet,"Multiple Forms of Alcohol Dehydrogenase inSaccharomyces Cerevisiae",(1968),Arch.Biochem.Biophys.,vol.126,pages 933-944.
Ciriacy,"Genetics of Alcohol Dehydrogenase in Saccharomycescerevisiae I.Isolation and genetic analysis of adh mutants",(1975),Mutat.Res.29,pages 315-326.
Engler et al.,"Generation of Families of Construct Variants UsingGolden Gate Shuffling",(2011),published in chapter 11 of Chaofu Lu et al.(eds.),cDNA Libraries:Methods and Applications,Methods in MolecularBiology,vol.729,pages 167-180.
DiCarlo et al.,"Genome engineering in Saccharomyces cerevisiaeusing CRISPR-Cas systems",(2013),Nucleic Acids Res Vol 41,pages4336-4343.
Sikorski and Hieter,"A System of Shuttle Vectors and Yeast HostStrains Designed for Efficient Manipulation of DNA in Saccharomycescerevisiae",(1989),Genetics,vol.122,pages 19-27
Claims (19)
1. A recombinant yeast cell that functionally expresses:
-a nucleic acid sequence encoding a native protein having transketolase activity (EC 2.2.1.1); and
-a nucleic acid sequence encoding a heterologous protein (EC 2.2.1.1) having transketolase activity.
2. The recombinant yeast cell of claim 1, wherein the heterologous protein having transketolase activity comprises or consists of an amino acid sequence having a percent identity with the amino acid sequence of the native protein having transketolase activity within the following ranges: in the range of equal to or greater than 30% to equal to or less than 80%, more preferably in the range of equal to or greater than 30% or equal to or greater than 35% to equal to or less than 75%, and most preferably in the range of equal to or greater than 35% to equal to or less than 70% or even equal to or less than 65%.
3. The recombinant yeast cell of claim 1 or claim 2,
wherein the heterologous protein having transketolase activity comprises or consists of:
-an amino acid sequence of SEQ ID NO. 3, SEQ ID NO. 4, SEQ ID NO. 5, SEQ ID NO. 6, SEQ ID NO. 7, SEQ ID NO. 8, SEQ ID NO. 9, SEQ ID NO. 10, SEQ ID NO. 11, SEQ ID NO. 12, SEQ ID NO. 13, SEQ ID NO. 14, SEQ ID NO. 15, SEQ ID NO. 16, SEQ ID NO. 17 or SEQ ID NO. 19; or alternatively
-a functional homolog of SEQ ID No. 3, SEQ ID No. 4, SEQ ID No. 5, SEQ ID No. 6, SEQ ID No. 7, SEQ ID No. 8, SEQ ID No. 9, SEQ ID No. 10, SEQ ID No. 11, SEQ ID No. 12, SEQ ID No. 13, SEQ ID No. 14, SEQ ID No. 15, SEQ ID No. 16, SEQ ID No. 17 or SEQ ID No. 19 having at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 75%, at least 80%, at least 85%, at least 95% identity to the amino acid sequence of SEQ ID No. 3, SEQ ID No. 4, SEQ ID No. 5, SEQ ID No. 6, SEQ ID No. 7, SEQ ID No. 8, SEQ ID No. 9, SEQ ID No. 10, SEQ ID No. 11, SEQ ID No. 12, SEQ ID No. 13, SEQ ID No. 14, SEQ ID No. 15, SEQ ID No. 16, SEQ ID No. 17 or SEQ ID No. 19; or alternatively
-SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, 6, 7, 8, 9, 10, 11, 12, 13, 15, 16, 17 or 19, which has one or more mutations, substitutions, insertions and/or deletions when compared to the amino acid sequence of SEQ ID NO:16, 17 or 19, more preferably when compared to SEQ ID NO:3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17 or 19, the amino acid sequence of No. 3, 5, 6, 10, 11, 14, 10, 15, 10, or 15, 10, or 15, or NO to the at which is at a non or a or a 5 a or a 5, or 5, 5, functional homologs of substitutions, insertions and/or deletions.
4. The recombinant yeast cell according to any one of claim 1 to 3,
wherein the native protein having transketolase activity comprises or consists of:
-the amino acid sequence of SEQ ID No. 1; or alternatively
-a functional homolog of SEQ ID No. 1 having at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98% or at least 99% sequence identity to the amino acid sequence of SEQ ID No. 1; or alternatively
Functional homologs of SEQ ID NO. 1 having one or more mutations, substitutions, insertions and/or deletions when compared to the amino acid sequence of SEQ ID NO. 1, more preferably functional homologs having NO more than 300, NO more than 250, NO more than 200, NO more than 150, NO more than 100, NO more than 75, NO more than 50, NO more than 40, NO more than 30, NO more than 20, NO more than 10 or NO more than 5 amino acid mutations, substitutions, insertions and/or deletions when compared to the amino acid sequence of SEQ ID NO. 1.
5. The recombinant yeast cell according to any one of claim 1 to 4,
Wherein expression of the nucleic acid sequence encoding the heterologous protein having transketolase activity is under the control of a promoter ("TKL promoter") having an anaerobic/aerobic expression ratio of 2 or more for transketolase.
6. The recombinant yeast cell according to any one of claim 1 to 5,
wherein expression of the nucleic acid sequence encoding the native protein having transketolase activity is under the control of a promoter ("TKL promoter") having an anaerobic/aerobic expression ratio of 2 or more for transketolase.
7. The recombinant yeast cell of claim 5 or claim 6,
wherein the TKL promoter is a promoter of ANB1 and/or DAN 1.
8. The recombinant yeast cell according to any one of claim 1 to 7,
wherein the recombinant yeast cell comprises one or more genetic modifications for functionally expressing a protein that plays a role in a metabolic pathway that forms a non-native redox sink.
9. The recombinant yeast cell according to any one of claim 1 to 8,
wherein the recombinant yeast cell functionally expresses:
-a nucleic acid sequence encoding a protein having ribulose-1, 5-bisphosphate carboxylase oxygenase (Rubisco) activity; and/or
-a nucleic acid sequence encoding a protein having Phosphoribulokinase (PRK) activity; and/or
-optionally a nucleic acid sequence encoding one or more chaperones of said protein having ribulose-1, 5-bisphosphate carboxylase oxygenase (Rubisco) activity.
10. The recombinant yeast cell according to any one of claim 1 to 8,
wherein the recombinant yeast cell functionally expresses:
-a nucleic acid sequence encoding a protein comprising phosphoketolase activity (EC 4.1.2.9 or EC 4.1.2.22, pkl); and/or
-a nucleic acid sequence encoding a protein (EC 2.3.1.8) having Phosphotransacetylase (PTA) activity; and/or
-a nucleic acid sequence encoding a protein having acetate kinase (ACK) activity (EC 2.7.2.12).
11. The recombinant yeast cell according to any one of claim 1 to 8,
wherein the recombinant yeast cell functionally expresses a nucleic acid sequence encoding a protein comprising nad+ -dependent acetylating acetaldehyde dehydrogenase activity (EC 1.2.1.10).
12. The recombinant yeast cell according to any one of claim 1 to 8,
wherein the recombinant yeast cell functionally expresses a nucleic acid sequence encoding an enzyme having NADH-dependent nitrate reductase activity and/or a nucleic acid sequence encoding an enzyme having NADH-dependent nitrite reductase activity.
13. The recombinant yeast cell of claim 12, wherein the recombinant yeast cell further functionally expresses a nucleic acid sequence encoding an enzyme having nitrate and/or nitrite transporter activity.
14. The recombinant yeast cell according to any one of claim 1 to 13,
wherein the recombinant yeast cell further comprises a deletion or disruption of a nucleic acid sequence encoding a protein having glycerol-3-phosphate dehydrogenase (GPD) activity and/or a nucleic acid sequence encoding a protein having glycerophosphate phosphatase (GPP) activity.
15. The recombinant yeast cell according to any one of claim 1 to 14,
wherein the recombinant yeast cell further functionally expresses:
-a nucleic acid sequence encoding a protein having glycerol dehydrogenase activity (e.c. 1.1.1.6);
-a nucleic acid sequence encoding a protein having dihydroxyacetone kinase activity (e.c. 2.7.1.28 or e.c. 2.7.1.29); and/or
-optionally a nucleic acid sequence encoding a protein having glycerol transporter activity.
16. The recombinant yeast cell according to any one of claim 1 to 15,
wherein the recombinant yeast cell further functionally expresses a nucleic acid sequence encoding a protein having glucoamylase activity (EC 3.2.1.20 or 3.2.1.3).
17. A method for producing ethanol, the method comprising transforming a carbon source, preferably a carbohydrate, using a recombinant yeast cell according to any one of claims 1 to 16.
18. The method of claim 17, wherein the method is performed at least in part in a medium comprising glucose at the following glucose concentrations: 25g/L or higher, 30g/L or higher, 35g/L or higher, 40g/L or higher, 45g/L or higher, 50g/L or higher, 55g/L or higher, 60g/L or higher, 65g/L or higher, 70g/L or higher, 75g/L or higher, 80g/L or higher, 85g/L or higher, 90g/L or higher, 95g/L or higher, 100g/L or higher, 110g/L or higher, or 120g/L or higher.
19. The method of claim 17 or claim 18, wherein the method is performed at least in part in the presence of a glycosylase such as a glucoamylase.
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US202163220896P | 2021-07-12 | 2021-07-12 | |
EP21185148.0 | 2021-07-12 | ||
US63/220,896 | 2021-07-12 | ||
PCT/EP2022/068917 WO2023285280A1 (en) | 2021-07-12 | 2022-07-07 | Recombinant yeast cell |
Publications (1)
Publication Number | Publication Date |
---|---|
CN117897490A true CN117897490A (en) | 2024-04-16 |
Family
ID=90643281
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202280059116.5A Pending CN117897490A (en) | 2021-07-12 | 2022-07-07 | Recombinant yeast cells |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN117897490A (en) |
-
2022
- 2022-07-07 CN CN202280059116.5A patent/CN117897490A/en active Pending
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP3638770B1 (en) | Recombinant yeast cell | |
WO2018172328A1 (en) | Improved glycerol free ethanol production | |
EP3298133A1 (en) | Acetate consuming yeast cell | |
EP3359655B1 (en) | Eukaryotic cell with increased production of fermentation product | |
EP4370651A1 (en) | Recombinant yeast cell | |
WO2021089877A1 (en) | Process for producing ethanol | |
US11414683B2 (en) | Acetic acid consuming strain | |
CN117897490A (en) | Recombinant yeast cells | |
CN117940571A (en) | Recombinant yeast cells | |
CN117881773A (en) | Recombinant yeast cells | |
EP4370689A1 (en) | Recombinant yeast cell | |
EP4370692A1 (en) | Recombinant yeast cell | |
CN117916381A (en) | Recombinant yeast cells | |
CN117940570A (en) | Recombinant yeast cells | |
US20230374443A1 (en) | Saccharomyces yeast cell and fermentation process using such | |
CN118056011A (en) | Recombinant yeast cells | |
CN118176296A (en) | Recombinant yeast cells | |
WO2023208762A2 (en) | Mutant yeast cell and process for the production of ethanol |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |