CN117396608A - 通过在酵母中过表达kgd2增加乙醇生产 - Google Patents
通过在酵母中过表达kgd2增加乙醇生产 Download PDFInfo
- Publication number
- CN117396608A CN117396608A CN202280033093.0A CN202280033093A CN117396608A CN 117396608 A CN117396608 A CN 117396608A CN 202280033093 A CN202280033093 A CN 202280033093A CN 117396608 A CN117396608 A CN 117396608A
- Authority
- CN
- China
- Prior art keywords
- cell
- kgd2
- modified
- fold
- polypeptide
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 title claims abstract description 122
- 102100031920 Dihydrolipoyllysine-residue succinyltransferase component of 2-oxoglutarate dehydrogenase complex, mitochondrial Human genes 0.000 title claims description 78
- 101000992065 Homo sapiens Dihydrolipoyllysine-residue succinyltransferase component of 2-oxoglutarate dehydrogenase complex, mitochondrial Proteins 0.000 title claims description 78
- 238000004519 manufacturing process Methods 0.000 title claims description 54
- 240000004808 Saccharomyces cerevisiae Species 0.000 title description 60
- 230000002018 overexpression Effects 0.000 title description 22
- 238000000034 method Methods 0.000 claims abstract description 48
- 210000005253 yeast cell Anatomy 0.000 claims abstract description 39
- 239000000758 substrate Substances 0.000 claims abstract description 12
- 102000006589 Alpha-ketoglutarate dehydrogenase Human genes 0.000 claims abstract description 8
- 108020004306 Alpha-ketoglutarate dehydrogenase Proteins 0.000 claims abstract description 8
- 210000004027 cell Anatomy 0.000 claims description 114
- 108090000623 proteins and genes Proteins 0.000 claims description 91
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 83
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 83
- 229920001184 polypeptide Polymers 0.000 claims description 82
- 230000014509 gene expression Effects 0.000 claims description 71
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 claims description 54
- 101000878916 Homo sapiens Uncharacterized protein C17orf80 Proteins 0.000 claims description 42
- 102100037950 Uncharacterized protein C17orf80 Human genes 0.000 claims description 42
- 238000000855 fermentation Methods 0.000 claims description 35
- 230000004151 fermentation Effects 0.000 claims description 33
- 150000007523 nucleic acids Chemical class 0.000 claims description 21
- 230000037361 pathway Effects 0.000 claims description 20
- 102000004190 Enzymes Human genes 0.000 claims description 17
- 108090000790 Enzymes Proteins 0.000 claims description 17
- 108020004999 messenger RNA Proteins 0.000 claims description 16
- 102000039446 nucleic acids Human genes 0.000 claims description 16
- 108020004707 nucleic acids Proteins 0.000 claims description 16
- 101150022075 ADR1 gene Proteins 0.000 claims description 14
- 230000004077 genetic alteration Effects 0.000 claims description 14
- 231100000118 genetic alteration Toxicity 0.000 claims description 14
- 108091026890 Coding region Proteins 0.000 claims description 13
- ZSLZBFCDCINBPY-ZSJPKINUSA-N acetyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 ZSLZBFCDCINBPY-ZSJPKINUSA-N 0.000 claims description 11
- 241000235070 Saccharomyces Species 0.000 claims description 7
- 150000001720 carbohydrates Chemical class 0.000 claims description 7
- 238000012545 processing Methods 0.000 claims description 5
- 101100518180 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) KGD2 gene Proteins 0.000 claims description 3
- 230000008859 change Effects 0.000 claims description 2
- 230000002103 transcriptional effect Effects 0.000 claims description 2
- 239000000203 mixture Substances 0.000 abstract description 13
- 229920002472 Starch Polymers 0.000 abstract description 2
- 238000011031 large-scale manufacturing process Methods 0.000 abstract description 2
- 235000019698 starch Nutrition 0.000 abstract description 2
- 239000008107 starch Substances 0.000 abstract description 2
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 59
- 102000004169 proteins and genes Human genes 0.000 description 41
- QTBSBXVTEAMEQO-UHFFFAOYSA-N acetic acid Substances CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 19
- 229940088598 enzyme Drugs 0.000 description 16
- 101100508247 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) SUI3 gene Proteins 0.000 description 15
- 108020004414 DNA Proteins 0.000 description 14
- 230000000694 effects Effects 0.000 description 14
- 230000037430 deletion Effects 0.000 description 12
- 238000012217 deletion Methods 0.000 description 12
- 125000003275 alpha amino acid group Chemical group 0.000 description 11
- LRHPLDYGYMQRHN-UHFFFAOYSA-N N-Butanol Chemical compound CCCCO LRHPLDYGYMQRHN-UHFFFAOYSA-N 0.000 description 10
- ZXEKIIBDNHEJCQ-UHFFFAOYSA-N isobutanol Chemical compound CC(C)CO ZXEKIIBDNHEJCQ-UHFFFAOYSA-N 0.000 description 10
- 108010004621 phosphoketolase Proteins 0.000 description 10
- 230000004048 modification Effects 0.000 description 9
- 238000012986 modification Methods 0.000 description 9
- 239000000047 product Substances 0.000 description 9
- 108091028043 Nucleic acid sequence Proteins 0.000 description 8
- 150000001413 amino acids Chemical class 0.000 description 8
- 238000004458 analytical method Methods 0.000 description 8
- 230000006696 biosynthetic metabolic pathway Effects 0.000 description 8
- 238000003559 RNA-seq method Methods 0.000 description 7
- 150000001298 alcohols Chemical class 0.000 description 7
- 229910052799 carbon Inorganic materials 0.000 description 7
- 230000037353 metabolic pathway Effects 0.000 description 7
- 238000006467 substitution reaction Methods 0.000 description 7
- QTBSBXVTEAMEQO-UHFFFAOYSA-M Acetate Chemical compound CC([O-])=O QTBSBXVTEAMEQO-UHFFFAOYSA-M 0.000 description 6
- 101001072574 Homo sapiens Glycerol-3-phosphate dehydrogenase [NAD(+)], cytoplasmic Proteins 0.000 description 6
- 101100459905 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) NCP1 gene Proteins 0.000 description 6
- 101150046305 cpr-1 gene Proteins 0.000 description 6
- 238000004128 high performance liquid chromatography Methods 0.000 description 6
- 230000035772 mutation Effects 0.000 description 6
- 108091033319 polynucleotide Proteins 0.000 description 6
- 102000040430 polynucleotide Human genes 0.000 description 6
- 239000002157 polynucleotide Substances 0.000 description 6
- 239000000126 substance Substances 0.000 description 6
- 108700028369 Alleles Proteins 0.000 description 5
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 5
- 108020004705 Codon Proteins 0.000 description 5
- 102100036669 Glycerol-3-phosphate dehydrogenase [NAD(+)], cytoplasmic Human genes 0.000 description 5
- -1 MAL23C Proteins 0.000 description 5
- 108090000637 alpha-Amylases Proteins 0.000 description 5
- 230000009286 beneficial effect Effects 0.000 description 5
- 235000014633 carbohydrates Nutrition 0.000 description 5
- 230000001413 cellular effect Effects 0.000 description 5
- 230000004907 flux Effects 0.000 description 5
- 239000012634 fragment Substances 0.000 description 5
- 239000000446 fuel Substances 0.000 description 5
- 230000006870 function Effects 0.000 description 5
- 238000010353 genetic engineering Methods 0.000 description 5
- 238000003780 insertion Methods 0.000 description 5
- 230000037431 insertion Effects 0.000 description 5
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 4
- 101150020386 EFB1 gene Proteins 0.000 description 4
- 102100022624 Glucoamylase Human genes 0.000 description 4
- AMIMRNSIRUDHCM-UHFFFAOYSA-N Isopropylaldehyde Chemical compound CC(C)C=O AMIMRNSIRUDHCM-UHFFFAOYSA-N 0.000 description 4
- 101100500655 Methanopyrus kandleri (strain AV19 / DSM 6324 / JCM 9639 / NBRC 100938) ef1b gene Proteins 0.000 description 4
- 102000004139 alpha-Amylases Human genes 0.000 description 4
- 238000003556 assay Methods 0.000 description 4
- GNGACRATGGDKBX-UHFFFAOYSA-N dihydroxyacetone phosphate Chemical compound OCC(=O)COP(O)(O)=O GNGACRATGGDKBX-UHFFFAOYSA-N 0.000 description 4
- 230000012010 growth Effects 0.000 description 4
- 239000011159 matrix material Substances 0.000 description 4
- 238000002703 mutagenesis Methods 0.000 description 4
- 231100000350 mutagenesis Toxicity 0.000 description 4
- 229920000642 polymer Polymers 0.000 description 4
- 238000013518 transcription Methods 0.000 description 4
- 230000035897 transcription Effects 0.000 description 4
- 230000004102 tricarboxylic acid cycle Effects 0.000 description 4
- QHKABHOOEWYVLI-UHFFFAOYSA-N 3-methyl-2-oxobutanoic acid Chemical compound CC(C)C(=O)C(O)=O QHKABHOOEWYVLI-UHFFFAOYSA-N 0.000 description 3
- 241000235349 Ascomycota Species 0.000 description 3
- 101100178170 Caenorhabditis elegans unc-39 gene Proteins 0.000 description 3
- 101100284219 Candida albicans (strain SC5314 / ATCC MYA-2876) GZF3 gene Proteins 0.000 description 3
- 102100031593 DNA-directed RNA polymerase I subunit RPA1 Human genes 0.000 description 3
- 241000233866 Fungi Species 0.000 description 3
- 108700007698 Genetic Terminator Regions Proteins 0.000 description 3
- 101000892220 Geobacillus thermodenitrificans (strain NG80-2) Long-chain-alcohol dehydrogenase 1 Proteins 0.000 description 3
- 108010073178 Glucan 1,4-alpha-Glucosidase Proteins 0.000 description 3
- 101000729474 Homo sapiens DNA-directed RNA polymerase I subunit RPA1 Proteins 0.000 description 3
- 101150046187 MRS6 gene Proteins 0.000 description 3
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 3
- 102000004316 Oxidoreductases Human genes 0.000 description 3
- 108090000854 Oxidoreductases Proteins 0.000 description 3
- 108700023175 Phosphate acetyltransferases Proteins 0.000 description 3
- 108010011939 Pyruvate Decarboxylase Proteins 0.000 description 3
- 101100442138 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) DAL80 gene Proteins 0.000 description 3
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 3
- 108010081577 aldehyde dehydrogenase (NAD(P)+) Proteins 0.000 description 3
- 229940024171 alpha-amylase Drugs 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 230000001276 controlling effect Effects 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 238000012224 gene deletion Methods 0.000 description 3
- 238000012239 gene modification Methods 0.000 description 3
- 230000002068 genetic effect Effects 0.000 description 3
- 230000005017 genetic modification Effects 0.000 description 3
- 235000013617 genetically modified food Nutrition 0.000 description 3
- 238000002360 preparation method Methods 0.000 description 3
- 230000001105 regulatory effect Effects 0.000 description 3
- 108010026333 seryl-proline Proteins 0.000 description 3
- JTEYKUFKXGDTEU-UHFFFAOYSA-N 2,3-dihydroxy-3-methylbutanoic acid Chemical compound CC(C)(O)C(O)C(O)=O JTEYKUFKXGDTEU-UHFFFAOYSA-N 0.000 description 2
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 2
- KQBVNNAPIURMPD-PEFMBERDSA-N Asp-Ile-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KQBVNNAPIURMPD-PEFMBERDSA-N 0.000 description 2
- 241000122821 Aspergillus kawachii Species 0.000 description 2
- 101100448223 Caenorhabditis elegans gcy-1 gene Proteins 0.000 description 2
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 2
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 2
- 102100030395 Glycerol-3-phosphate dehydrogenase, mitochondrial Human genes 0.000 description 2
- 102000000587 Glycerolphosphate Dehydrogenase Human genes 0.000 description 2
- 108010041921 Glycerolphosphate Dehydrogenase Proteins 0.000 description 2
- 108010015895 Glycerone kinase Proteins 0.000 description 2
- 101001009678 Homo sapiens Glycerol-3-phosphate dehydrogenase, mitochondrial Proteins 0.000 description 2
- 102000004195 Isomerases Human genes 0.000 description 2
- 108090000769 Isomerases Proteins 0.000 description 2
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 description 2
- 241000235649 Kluyveromyces Species 0.000 description 2
- 241000880493 Leptailurus serval Species 0.000 description 2
- UQRZFMQQXXJTTF-AVGNSLFASA-N Lys-Lys-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O UQRZFMQQXXJTTF-AVGNSLFASA-N 0.000 description 2
- PLDJDCJLRCYPJB-VOAKCMCISA-N Lys-Lys-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PLDJDCJLRCYPJB-VOAKCMCISA-N 0.000 description 2
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 2
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 2
- AMQJEAYHLZJPGS-UHFFFAOYSA-N N-Pentanol Chemical compound CCCCCO AMQJEAYHLZJPGS-UHFFFAOYSA-N 0.000 description 2
- 108091005804 Peptidases Proteins 0.000 description 2
- 108010059820 Polygalacturonase Proteins 0.000 description 2
- 239000004365 Protease Substances 0.000 description 2
- LCTONWCANYUPML-UHFFFAOYSA-N Pyruvic acid Chemical compound CC(=O)C(O)=O LCTONWCANYUPML-UHFFFAOYSA-N 0.000 description 2
- 241000235346 Schizosaccharomyces Species 0.000 description 2
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 2
- QAOWNCQODCNURD-UHFFFAOYSA-N Sulfuric acid Chemical compound OS(O)(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-N 0.000 description 2
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 2
- KZURUCDWKDEAFZ-XVSYOHENSA-N Thr-Phe-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O KZURUCDWKDEAFZ-XVSYOHENSA-N 0.000 description 2
- 241000499912 Trichoderma reesei Species 0.000 description 2
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 2
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 2
- 230000000397 acetylating effect Effects 0.000 description 2
- 239000008186 active pharmaceutical agent Substances 0.000 description 2
- 230000004075 alteration Effects 0.000 description 2
- 125000000539 amino acid group Chemical group 0.000 description 2
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 2
- 108010038633 aspartylglutamate Proteins 0.000 description 2
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 2
- 235000013405 beer Nutrition 0.000 description 2
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 2
- 230000033228 biological regulation Effects 0.000 description 2
- 150000001721 carbon Chemical group 0.000 description 2
- 239000002299 complementary DNA Substances 0.000 description 2
- 230000009260 cross reactivity Effects 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- 230000002255 enzymatic effect Effects 0.000 description 2
- 108010093305 exopolygalacturonase Proteins 0.000 description 2
- 108010008221 formate C-acetyltransferase Proteins 0.000 description 2
- 108020001507 fusion proteins Proteins 0.000 description 2
- 102000037865 fusion proteins Human genes 0.000 description 2
- GVVPGTZRZFNKDS-JXMROGBWSA-N geranyl diphosphate Chemical compound CC(C)=CCC\C(C)=C\CO[P@](O)(=O)OP(O)(O)=O GVVPGTZRZFNKDS-JXMROGBWSA-N 0.000 description 2
- 239000008103 glucose Substances 0.000 description 2
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 2
- 230000001900 immune effect Effects 0.000 description 2
- 239000000543 intermediate Substances 0.000 description 2
- 108010005942 methionylglycine Proteins 0.000 description 2
- 150000002894 organic compounds Chemical class 0.000 description 2
- 229910052760 oxygen Inorganic materials 0.000 description 2
- 239000001301 oxygen Substances 0.000 description 2
- JYVLIDXNZAXMDK-UHFFFAOYSA-N pentan-2-ol Chemical compound CCCC(C)O JYVLIDXNZAXMDK-UHFFFAOYSA-N 0.000 description 2
- 230000000750 progressive effect Effects 0.000 description 2
- BDERNNFJNOPAEC-UHFFFAOYSA-N propan-1-ol Chemical compound CCCO BDERNNFJNOPAEC-UHFFFAOYSA-N 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 235000000346 sugar Nutrition 0.000 description 2
- 108010061238 threonyl-glycine Proteins 0.000 description 2
- 235000014101 wine Nutrition 0.000 description 2
- NMDWGEGFJUBKLB-YFKPBYRVSA-N (2S)-2-hydroxy-2-methyl-3-oxobutanoic acid Chemical compound CC(=O)[C@](C)(O)C(O)=O NMDWGEGFJUBKLB-YFKPBYRVSA-N 0.000 description 1
- NTUPOKHATNSWCY-PMPSAXMXSA-N (2s)-2-[[(2s)-1-[(2r)-2-amino-3-phenylpropanoyl]pyrrolidine-2-carbonyl]amino]-5-(diaminomethylideneamino)pentanoic acid Chemical compound C([C@@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=CC=C1 NTUPOKHATNSWCY-PMPSAXMXSA-N 0.000 description 1
- FQVLRGLGWNWPSS-BXBUPLCLSA-N (4r,7s,10s,13s,16r)-16-acetamido-13-(1h-imidazol-5-ylmethyl)-10-methyl-6,9,12,15-tetraoxo-7-propan-2-yl-1,2-dithia-5,8,11,14-tetrazacycloheptadecane-4-carboxamide Chemical compound N1C(=O)[C@@H](NC(C)=O)CSSC[C@@H](C(N)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)NC(=O)[C@@H]1CC1=CN=CN1 FQVLRGLGWNWPSS-BXBUPLCLSA-N 0.000 description 1
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 1
- OWEGMIWEEQEYGQ-UHFFFAOYSA-N 100676-05-9 Natural products OC1C(O)C(O)C(CO)OC1OCC1C(O)C(O)C(O)C(OC2C(OC(O)C(O)C2O)CO)O1 OWEGMIWEEQEYGQ-UHFFFAOYSA-N 0.000 description 1
- PKAUICCNAWQPAU-UHFFFAOYSA-N 2-(4-chloro-2-methylphenoxy)acetic acid;n-methylmethanamine Chemical compound CNC.CC1=CC(Cl)=CC=C1OCC(O)=O PKAUICCNAWQPAU-UHFFFAOYSA-N 0.000 description 1
- WTLNOANVTIKPEE-UHFFFAOYSA-N 2-acetyloxypropanoic acid Chemical compound OC(=O)C(C)OC(C)=O WTLNOANVTIKPEE-UHFFFAOYSA-N 0.000 description 1
- KPGXRSRHYNQIFN-UHFFFAOYSA-N 2-oxoglutaric acid Chemical compound OC(=O)CCC(=O)C(O)=O KPGXRSRHYNQIFN-UHFFFAOYSA-N 0.000 description 1
- 108010011619 6-Phytase Proteins 0.000 description 1
- 108010000700 Acetolactate synthase Proteins 0.000 description 1
- 102100026026 Acyl-CoA synthetase short-chain family member 3, mitochondrial Human genes 0.000 description 1
- DVWVZSJAYIJZFI-FXQIFTODSA-N Ala-Arg-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DVWVZSJAYIJZFI-FXQIFTODSA-N 0.000 description 1
- XCVRVWZTXPCYJT-BIIVOSGPSA-N Ala-Asn-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N XCVRVWZTXPCYJT-BIIVOSGPSA-N 0.000 description 1
- CSAHOYQKNHGDHX-ACZMJKKPSA-N Ala-Gln-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CSAHOYQKNHGDHX-ACZMJKKPSA-N 0.000 description 1
- ZODMADSIQZZBSQ-FXQIFTODSA-N Ala-Gln-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZODMADSIQZZBSQ-FXQIFTODSA-N 0.000 description 1
- JPGBXANAQYHTLA-DRZSPHRISA-N Ala-Gln-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JPGBXANAQYHTLA-DRZSPHRISA-N 0.000 description 1
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 1
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 1
- PUBLUECXJRHTBK-ACZMJKKPSA-N Ala-Glu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O PUBLUECXJRHTBK-ACZMJKKPSA-N 0.000 description 1
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 1
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 1
- SDZRIBWEVVRDQI-CIUDSAMLSA-N Ala-Lys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O SDZRIBWEVVRDQI-CIUDSAMLSA-N 0.000 description 1
- AWNAEZICPNGAJK-FXQIFTODSA-N Ala-Met-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O AWNAEZICPNGAJK-FXQIFTODSA-N 0.000 description 1
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 1
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 1
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 1
- 102000007698 Alcohol dehydrogenase Human genes 0.000 description 1
- 108010021809 Alcohol dehydrogenase Proteins 0.000 description 1
- 102100034035 Alcohol dehydrogenase 1A Human genes 0.000 description 1
- 102100034044 All-trans-retinol dehydrogenase [NAD(+)] ADH1B Human genes 0.000 description 1
- 101710193111 All-trans-retinol dehydrogenase [NAD(+)] ADH4 Proteins 0.000 description 1
- 101100492681 Arabidopsis thaliana ATE1 gene Proteins 0.000 description 1
- 101100423825 Arabidopsis thaliana SYP61 gene Proteins 0.000 description 1
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 1
- XVLLUZMFSAYKJV-GUBZILKMSA-N Arg-Asp-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XVLLUZMFSAYKJV-GUBZILKMSA-N 0.000 description 1
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 1
- RKRSYHCNPFGMTA-CIUDSAMLSA-N Arg-Glu-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O RKRSYHCNPFGMTA-CIUDSAMLSA-N 0.000 description 1
- LKDHUGLXOHYINY-XUXIUFHCSA-N Arg-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LKDHUGLXOHYINY-XUXIUFHCSA-N 0.000 description 1
- OKKMBOSPBDASEP-CYDGBPFRSA-N Arg-Ile-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O OKKMBOSPBDASEP-CYDGBPFRSA-N 0.000 description 1
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 1
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 1
- MTYLORHAQXVQOW-AVGNSLFASA-N Arg-Lys-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O MTYLORHAQXVQOW-AVGNSLFASA-N 0.000 description 1
- RIQBRKVTFBWEDY-RHYQMDGZSA-N Arg-Lys-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RIQBRKVTFBWEDY-RHYQMDGZSA-N 0.000 description 1
- VUGWHBXPMAHEGZ-SRVKXCTJSA-N Arg-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N VUGWHBXPMAHEGZ-SRVKXCTJSA-N 0.000 description 1
- AMIQZQAAYGYKOP-FXQIFTODSA-N Arg-Ser-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O AMIQZQAAYGYKOP-FXQIFTODSA-N 0.000 description 1
- ADPACBMPYWJJCE-FXQIFTODSA-N Arg-Ser-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O ADPACBMPYWJJCE-FXQIFTODSA-N 0.000 description 1
- WOZDCBHUGJVJPL-AVGNSLFASA-N Arg-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WOZDCBHUGJVJPL-AVGNSLFASA-N 0.000 description 1
- ACRYGQFHAQHDSF-ZLUOBGJFSA-N Asn-Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ACRYGQFHAQHDSF-ZLUOBGJFSA-N 0.000 description 1
- MSBDSTRUMZFSEU-PEFMBERDSA-N Asn-Glu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MSBDSTRUMZFSEU-PEFMBERDSA-N 0.000 description 1
- OLISTMZJGQUOGS-GMOBBJLQSA-N Asn-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OLISTMZJGQUOGS-GMOBBJLQSA-N 0.000 description 1
- PNHQRQTVBRDIEF-CIUDSAMLSA-N Asn-Leu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)N)N PNHQRQTVBRDIEF-CIUDSAMLSA-N 0.000 description 1
- NLDNNZKUSLAYFW-NHCYSSNCSA-N Asn-Lys-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O NLDNNZKUSLAYFW-NHCYSSNCSA-N 0.000 description 1
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 1
- FMNBYVSGRCXWEK-FOHZUACHSA-N Asn-Thr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O FMNBYVSGRCXWEK-FOHZUACHSA-N 0.000 description 1
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 1
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 1
- DBWYWXNMZZYIRY-LPEHRKFASA-N Asp-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O DBWYWXNMZZYIRY-LPEHRKFASA-N 0.000 description 1
- QRULNKJGYQQZMW-ZLUOBGJFSA-N Asp-Asn-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QRULNKJGYQQZMW-ZLUOBGJFSA-N 0.000 description 1
- CSEJMKNZDCJYGJ-XHNCKOQMSA-N Asp-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O CSEJMKNZDCJYGJ-XHNCKOQMSA-N 0.000 description 1
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 1
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 1
- HJCGDIGVVWETRO-ZPFDUUQYSA-N Asp-Lys-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O)C(O)=O HJCGDIGVVWETRO-ZPFDUUQYSA-N 0.000 description 1
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 1
- QTIZKMMLNUMHHU-DCAQKATOSA-N Asp-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QTIZKMMLNUMHHU-DCAQKATOSA-N 0.000 description 1
- XXAMCEGRCZQGEM-ZLUOBGJFSA-N Asp-Ser-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O XXAMCEGRCZQGEM-ZLUOBGJFSA-N 0.000 description 1
- BPAUXFVCSYQDQX-JRQIVUDYSA-N Asp-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)O)N)O BPAUXFVCSYQDQX-JRQIVUDYSA-N 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 241000221198 Basidiomycota Species 0.000 description 1
- 108700038091 Beta-glucanases Proteins 0.000 description 1
- 102100032487 Beta-mannosidase Human genes 0.000 description 1
- 108091033409 CRISPR Proteins 0.000 description 1
- 238000010354 CRISPR gene editing Methods 0.000 description 1
- 101150011071 CRZ1 gene Proteins 0.000 description 1
- 101100351264 Candida albicans (strain SC5314 / ATCC MYA-2876) PDC11 gene Proteins 0.000 description 1
- 108090000489 Carboxy-Lyases Proteins 0.000 description 1
- 102000004031 Carboxy-Lyases Human genes 0.000 description 1
- 108010084185 Cellulases Proteins 0.000 description 1
- 102000005575 Cellulases Human genes 0.000 description 1
- 229920002261 Corn starch Polymers 0.000 description 1
- KIHRUISMQZVCNO-ZLUOBGJFSA-N Cys-Asp-Asp Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KIHRUISMQZVCNO-ZLUOBGJFSA-N 0.000 description 1
- HMWBPUDETPKSSS-DCAQKATOSA-N Cys-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CCCCN)C(=O)O HMWBPUDETPKSSS-DCAQKATOSA-N 0.000 description 1
- SAEVTQWAYDPXMU-KATARQTJSA-N Cys-Thr-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O SAEVTQWAYDPXMU-KATARQTJSA-N 0.000 description 1
- 102100022768 D-beta-hydroxybutyrate dehydrogenase, mitochondrial Human genes 0.000 description 1
- 101150030825 DPB3 gene Proteins 0.000 description 1
- 101710088194 Dehydrogenase Proteins 0.000 description 1
- 108020005199 Dehydrogenases Proteins 0.000 description 1
- 108700016168 Dihydroxy-acid dehydratases Proteins 0.000 description 1
- 101710121765 Endo-1,4-beta-xylanase Proteins 0.000 description 1
- 108090000371 Esterases Proteins 0.000 description 1
- 102100028121 Fos-related antigen 2 Human genes 0.000 description 1
- 101150004714 GPP1 gene Proteins 0.000 description 1
- RZSLYUUFFVHFRQ-FXQIFTODSA-N Gln-Ala-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O RZSLYUUFFVHFRQ-FXQIFTODSA-N 0.000 description 1
- KWUSGAIFNHQCBY-DCAQKATOSA-N Gln-Arg-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O KWUSGAIFNHQCBY-DCAQKATOSA-N 0.000 description 1
- XOKGKOQWADCLFQ-GARJFASQSA-N Gln-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O XOKGKOQWADCLFQ-GARJFASQSA-N 0.000 description 1
- ZNZPKVQURDQFFS-FXQIFTODSA-N Gln-Glu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZNZPKVQURDQFFS-FXQIFTODSA-N 0.000 description 1
- FFVXLVGUJBCKRX-UKJIMTQDSA-N Gln-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N FFVXLVGUJBCKRX-UKJIMTQDSA-N 0.000 description 1
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 1
- PAOHIZNRJNIXQY-XQXXSGGOSA-N Gln-Thr-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PAOHIZNRJNIXQY-XQXXSGGOSA-N 0.000 description 1
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 1
- IRDASPPCLZIERZ-XHNCKOQMSA-N Glu-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N IRDASPPCLZIERZ-XHNCKOQMSA-N 0.000 description 1
- VAZZOGXDUQSVQF-NUMRIWBASA-N Glu-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)O VAZZOGXDUQSVQF-NUMRIWBASA-N 0.000 description 1
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 1
- CKOFNWCLWRYUHK-XHNCKOQMSA-N Glu-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CKOFNWCLWRYUHK-XHNCKOQMSA-N 0.000 description 1
- SBCYJMOOHUDWDA-NUMRIWBASA-N Glu-Asp-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SBCYJMOOHUDWDA-NUMRIWBASA-N 0.000 description 1
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 1
- YVYVMJNUENBOOL-KBIXCLLPSA-N Glu-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N YVYVMJNUENBOOL-KBIXCLLPSA-N 0.000 description 1
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 1
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 1
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 1
- FMBWLLMUPXTXFC-SDDRHHMPSA-N Glu-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N)C(=O)O FMBWLLMUPXTXFC-SDDRHHMPSA-N 0.000 description 1
- CQAHWYDHKUWYIX-YUMQZZPRSA-N Glu-Pro-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O CQAHWYDHKUWYIX-YUMQZZPRSA-N 0.000 description 1
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 1
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 1
- MWTGQXBHVRTCOR-GLLZPBPUSA-N Glu-Thr-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MWTGQXBHVRTCOR-GLLZPBPUSA-N 0.000 description 1
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 1
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 1
- 108050008938 Glucoamylases Proteins 0.000 description 1
- RQZGFWKQLPJOEQ-YUMQZZPRSA-N Gly-Arg-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)CN)CN=C(N)N RQZGFWKQLPJOEQ-YUMQZZPRSA-N 0.000 description 1
- FUTAPPOITCCWTH-WHFBIAKZSA-N Gly-Asp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FUTAPPOITCCWTH-WHFBIAKZSA-N 0.000 description 1
- XEJTYSCIXKYSHR-WDSKDSINSA-N Gly-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN XEJTYSCIXKYSHR-WDSKDSINSA-N 0.000 description 1
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 1
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 1
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 description 1
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 1
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 1
- HHRODZSXDXMUHS-LURJTMIESA-N Gly-Met-Gly Chemical compound CSCC[C@H](NC(=O)C[NH3+])C(=O)NCC([O-])=O HHRODZSXDXMUHS-LURJTMIESA-N 0.000 description 1
- MXIULRKNFSCJHT-STQMWFEESA-N Gly-Phe-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 MXIULRKNFSCJHT-STQMWFEESA-N 0.000 description 1
- IEGFSKKANYKBDU-QWHCGFSZSA-N Gly-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)CN)C(=O)O IEGFSKKANYKBDU-QWHCGFSZSA-N 0.000 description 1
- WDXLKVQATNEAJQ-BQBZGAKWSA-N Gly-Pro-Asp Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WDXLKVQATNEAJQ-BQBZGAKWSA-N 0.000 description 1
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 1
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 1
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 1
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 1
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 1
- 101150096276 HOG1 gene Proteins 0.000 description 1
- IMCHNUANCIGUKS-SRVKXCTJSA-N His-Glu-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IMCHNUANCIGUKS-SRVKXCTJSA-N 0.000 description 1
- ORERHHPZDDEMSC-VGDYDELISA-N His-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ORERHHPZDDEMSC-VGDYDELISA-N 0.000 description 1
- SKYULSWNBYAQMG-IHRRRGAJSA-N His-Leu-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SKYULSWNBYAQMG-IHRRRGAJSA-N 0.000 description 1
- PGRPSOUCWRBWKZ-DLOVCJGASA-N His-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CN=CN1 PGRPSOUCWRBWKZ-DLOVCJGASA-N 0.000 description 1
- LYDKQVYYCMYNMC-SRVKXCTJSA-N His-Lys-Cys Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CS)C(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 LYDKQVYYCMYNMC-SRVKXCTJSA-N 0.000 description 1
- BCZFOHDMCDXPDA-BZSNNMDCSA-N His-Lys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CN=CN2)N)O BCZFOHDMCDXPDA-BZSNNMDCSA-N 0.000 description 1
- MDOBWSFNSNPENN-PMVVWTBXSA-N His-Thr-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O MDOBWSFNSNPENN-PMVVWTBXSA-N 0.000 description 1
- 101000780443 Homo sapiens Alcohol dehydrogenase 1A Proteins 0.000 description 1
- 101000903373 Homo sapiens D-beta-hydroxybutyrate dehydrogenase, mitochondrial Proteins 0.000 description 1
- 101001059934 Homo sapiens Fos-related antigen 2 Proteins 0.000 description 1
- 101001120260 Homo sapiens Polyadenylate-binding protein 1 Proteins 0.000 description 1
- 101000664600 Homo sapiens Tripartite motif-containing protein 3 Proteins 0.000 description 1
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 1
- HDODQNPMSHDXJT-GHCJXIJMSA-N Ile-Asn-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O HDODQNPMSHDXJT-GHCJXIJMSA-N 0.000 description 1
- NBJAAWYRLGCJOF-UGYAYLCHSA-N Ile-Asp-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NBJAAWYRLGCJOF-UGYAYLCHSA-N 0.000 description 1
- PNDMHTTXXPUQJH-RWRJDSDZSA-N Ile-Glu-Thr Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)O PNDMHTTXXPUQJH-RWRJDSDZSA-N 0.000 description 1
- IGJWJGIHUFQANP-LAEOZQHASA-N Ile-Gly-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N IGJWJGIHUFQANP-LAEOZQHASA-N 0.000 description 1
- AKOYRLRUFBZOSP-BJDJZHNGSA-N Ile-Lys-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N AKOYRLRUFBZOSP-BJDJZHNGSA-N 0.000 description 1
- CIDLJWVDMNDKPT-FIRPJDEBSA-N Ile-Phe-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N CIDLJWVDMNDKPT-FIRPJDEBSA-N 0.000 description 1
- FGBRXCZYVRFNKQ-MXAVVETBSA-N Ile-Phe-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N FGBRXCZYVRFNKQ-MXAVVETBSA-N 0.000 description 1
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 1
- XHBYEMIUENPZLY-GMOBBJLQSA-N Ile-Pro-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O XHBYEMIUENPZLY-GMOBBJLQSA-N 0.000 description 1
- CAHCWMVNBZJVAW-NAKRPEOUSA-N Ile-Pro-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)O)N CAHCWMVNBZJVAW-NAKRPEOUSA-N 0.000 description 1
- HXIDVIFHRYRXLZ-NAKRPEOUSA-N Ile-Ser-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)O)N HXIDVIFHRYRXLZ-NAKRPEOUSA-N 0.000 description 1
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 1
- 102100039929 Intraflagellar transport protein 172 homolog Human genes 0.000 description 1
- 108010028688 Isoamylase Proteins 0.000 description 1
- 101150066291 JID1 gene Proteins 0.000 description 1
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- 108010029541 Laccase Proteins 0.000 description 1
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 1
- GRZSCTXVCDUIPO-SRVKXCTJSA-N Leu-Arg-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRZSCTXVCDUIPO-SRVKXCTJSA-N 0.000 description 1
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 1
- IWTBYNQNAPECCS-AVGNSLFASA-N Leu-Glu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IWTBYNQNAPECCS-AVGNSLFASA-N 0.000 description 1
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 1
- CFZZDVMBRYFFNU-QWRGUYRKSA-N Leu-His-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O CFZZDVMBRYFFNU-QWRGUYRKSA-N 0.000 description 1
- FOBUGKUBUJOWAD-IHPCNDPISA-N Leu-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 FOBUGKUBUJOWAD-IHPCNDPISA-N 0.000 description 1
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 1
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 1
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 1
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 1
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 1
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 1
- WFCKERTZVCQXKH-KBPBESRZSA-N Leu-Tyr-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O WFCKERTZVCQXKH-KBPBESRZSA-N 0.000 description 1
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- 108090001060 Lipase Proteins 0.000 description 1
- 102000004882 Lipase Human genes 0.000 description 1
- 239000004367 Lipase Substances 0.000 description 1
- VHFFQUSNFFIZBT-CIUDSAMLSA-N Lys-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N VHFFQUSNFFIZBT-CIUDSAMLSA-N 0.000 description 1
- BRSGXFITDXFMFF-IHRRRGAJSA-N Lys-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N BRSGXFITDXFMFF-IHRRRGAJSA-N 0.000 description 1
- SSJBMGCZZXCGJJ-DCAQKATOSA-N Lys-Asp-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O SSJBMGCZZXCGJJ-DCAQKATOSA-N 0.000 description 1
- DFXQCCBKGUNYGG-GUBZILKMSA-N Lys-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN DFXQCCBKGUNYGG-GUBZILKMSA-N 0.000 description 1
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 1
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 1
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 1
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 1
- GAHJXEMYXKLZRQ-AJNGGQMLSA-N Lys-Lys-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GAHJXEMYXKLZRQ-AJNGGQMLSA-N 0.000 description 1
- CNGOEHJCLVCJHN-SRVKXCTJSA-N Lys-Pro-Glu Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O CNGOEHJCLVCJHN-SRVKXCTJSA-N 0.000 description 1
- WQDKIVRHTQYJSN-DCAQKATOSA-N Lys-Ser-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N WQDKIVRHTQYJSN-DCAQKATOSA-N 0.000 description 1
- MDDUIRLQCYVRDO-NHCYSSNCSA-N Lys-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN MDDUIRLQCYVRDO-NHCYSSNCSA-N 0.000 description 1
- OZVXDDFYCQOPFD-XQQFMLRXSA-N Lys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N OZVXDDFYCQOPFD-XQQFMLRXSA-N 0.000 description 1
- GUBGYTABKSRVRQ-PICCSMPSSA-N Maltose Natural products O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-PICCSMPSSA-N 0.000 description 1
- 240000003183 Manihot esculenta Species 0.000 description 1
- 235000016735 Manihot esculenta subsp esculenta Nutrition 0.000 description 1
- ONGCSGVHCSAATF-CIUDSAMLSA-N Met-Ala-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O ONGCSGVHCSAATF-CIUDSAMLSA-N 0.000 description 1
- HKRYNJSKVLZIFP-IHRRRGAJSA-N Met-Asn-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HKRYNJSKVLZIFP-IHRRRGAJSA-N 0.000 description 1
- CHQWUYSNAOABIP-ZPFDUUQYSA-N Met-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N CHQWUYSNAOABIP-ZPFDUUQYSA-N 0.000 description 1
- RAAVFTFEAUAVIY-DCAQKATOSA-N Met-Glu-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N RAAVFTFEAUAVIY-DCAQKATOSA-N 0.000 description 1
- QTMIXEQWGNIPBL-JYJNAYRXSA-N Met-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N QTMIXEQWGNIPBL-JYJNAYRXSA-N 0.000 description 1
- CIDICGYKRUTYLE-FXQIFTODSA-N Met-Ser-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CIDICGYKRUTYLE-FXQIFTODSA-N 0.000 description 1
- CIIJWIAORKTXAH-FJXKBIBVSA-N Met-Thr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O CIIJWIAORKTXAH-FJXKBIBVSA-N 0.000 description 1
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 1
- 241000123069 Ocyurus chrysurus Species 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- 101150050255 PDC1 gene Proteins 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- 108700020962 Peroxidase Proteins 0.000 description 1
- 102000003992 Peroxidases Human genes 0.000 description 1
- YMORXCKTSSGYIG-IHRRRGAJSA-N Phe-Arg-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N YMORXCKTSSGYIG-IHRRRGAJSA-N 0.000 description 1
- CDNPIRSCAFMMBE-SRVKXCTJSA-N Phe-Asn-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CDNPIRSCAFMMBE-SRVKXCTJSA-N 0.000 description 1
- ABQFNJAFONNUTH-FHWLQOOXSA-N Phe-Gln-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N ABQFNJAFONNUTH-FHWLQOOXSA-N 0.000 description 1
- BIYWZVCPZIFGPY-QWRGUYRKSA-N Phe-Gly-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O BIYWZVCPZIFGPY-QWRGUYRKSA-N 0.000 description 1
- SWCOXQLDICUYOL-ULQDDVLXSA-N Phe-His-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SWCOXQLDICUYOL-ULQDDVLXSA-N 0.000 description 1
- WEMYTDDMDBLPMI-DKIMLUQUSA-N Phe-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N WEMYTDDMDBLPMI-DKIMLUQUSA-N 0.000 description 1
- BNRFQGLWLQESBG-YESZJQIVSA-N Phe-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BNRFQGLWLQESBG-YESZJQIVSA-N 0.000 description 1
- SCKXGHWQPPURGT-KKUMJFAQSA-N Phe-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O SCKXGHWQPPURGT-KKUMJFAQSA-N 0.000 description 1
- ZJPGOXWRFNKIQL-JYJNAYRXSA-N Phe-Pro-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 ZJPGOXWRFNKIQL-JYJNAYRXSA-N 0.000 description 1
- WEDZFLRYSIDIRX-IHRRRGAJSA-N Phe-Ser-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 WEDZFLRYSIDIRX-IHRRRGAJSA-N 0.000 description 1
- SHUFSZDAIPLZLF-BEAPCOKYSA-N Phe-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O SHUFSZDAIPLZLF-BEAPCOKYSA-N 0.000 description 1
- 108700019535 Phosphoprotein Phosphatases Proteins 0.000 description 1
- 102000045595 Phosphoprotein Phosphatases Human genes 0.000 description 1
- 102100026090 Polyadenylate-binding protein 1 Human genes 0.000 description 1
- FZHBZMDRDASUHN-NAKRPEOUSA-N Pro-Ala-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1)C(O)=O FZHBZMDRDASUHN-NAKRPEOUSA-N 0.000 description 1
- VZKBJNBZMZHKRC-XUXIUFHCSA-N Pro-Ile-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O VZKBJNBZMZHKRC-XUXIUFHCSA-N 0.000 description 1
- YXHYJEPDKSYPSQ-AVGNSLFASA-N Pro-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 YXHYJEPDKSYPSQ-AVGNSLFASA-N 0.000 description 1
- DWGFLKQSGRUQTI-IHRRRGAJSA-N Pro-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 DWGFLKQSGRUQTI-IHRRRGAJSA-N 0.000 description 1
- RCYUBVHMVUHEBM-RCWTZXSCSA-N Pro-Pro-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RCYUBVHMVUHEBM-RCWTZXSCSA-N 0.000 description 1
- KBUAPZAZPWNYSW-SRVKXCTJSA-N Pro-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KBUAPZAZPWNYSW-SRVKXCTJSA-N 0.000 description 1
- SEZGGSHLMROBFX-CIUDSAMLSA-N Pro-Ser-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O SEZGGSHLMROBFX-CIUDSAMLSA-N 0.000 description 1
- SXJOPONICMGFCR-DCAQKATOSA-N Pro-Ser-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O SXJOPONICMGFCR-DCAQKATOSA-N 0.000 description 1
- IURWWZYKYPEANQ-HJGDQZAQSA-N Pro-Thr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IURWWZYKYPEANQ-HJGDQZAQSA-N 0.000 description 1
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 1
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 1
- 102100036894 Protein patched homolog 2 Human genes 0.000 description 1
- 101710161395 Protein patched homolog 2 Proteins 0.000 description 1
- 108091030071 RNAI Proteins 0.000 description 1
- 238000011530 RNeasy Mini Kit Methods 0.000 description 1
- 102000004879 Racemases and epimerases Human genes 0.000 description 1
- 108090001066 Racemases and epimerases Proteins 0.000 description 1
- 108091007187 Reductases Proteins 0.000 description 1
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 1
- 101100055274 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) ALD6 gene Proteins 0.000 description 1
- 101100223955 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) DLS1 gene Proteins 0.000 description 1
- 101100336452 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GDS1 gene Proteins 0.000 description 1
- 101100282741 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GIC1 gene Proteins 0.000 description 1
- 101100282742 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GIC2 gene Proteins 0.000 description 1
- 101100392439 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GIS1 gene Proteins 0.000 description 1
- 101100395959 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) HUG1 gene Proteins 0.000 description 1
- 101100184479 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) MNN4 gene Proteins 0.000 description 1
- 101100132334 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) MVB12 gene Proteins 0.000 description 1
- 101100082596 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) PDC5 gene Proteins 0.000 description 1
- 101100519200 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) PDC6 gene Proteins 0.000 description 1
- 101100209558 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) VMA9 gene Proteins 0.000 description 1
- 101100320836 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) YMR226C gene Proteins 0.000 description 1
- 241000235343 Saccharomycetales Species 0.000 description 1
- 240000000111 Saccharum officinarum Species 0.000 description 1
- 235000007201 Saccharum officinarum Nutrition 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- HQTKVSCNCDLXSX-BQBZGAKWSA-N Ser-Arg-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O HQTKVSCNCDLXSX-BQBZGAKWSA-N 0.000 description 1
- NRCJWSGXMAPYQX-LPEHRKFASA-N Ser-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N)C(=O)O NRCJWSGXMAPYQX-LPEHRKFASA-N 0.000 description 1
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 1
- OBXVZEAMXFSGPU-FXQIFTODSA-N Ser-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)CN=C(N)N OBXVZEAMXFSGPU-FXQIFTODSA-N 0.000 description 1
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 1
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 1
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 1
- MQQBBLVOUUJKLH-HJPIBITLSA-N Ser-Ile-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MQQBBLVOUUJKLH-HJPIBITLSA-N 0.000 description 1
- VZQRNAYURWAEFE-KKUMJFAQSA-N Ser-Leu-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VZQRNAYURWAEFE-KKUMJFAQSA-N 0.000 description 1
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 1
- CRJZZXMAADSBBQ-SRVKXCTJSA-N Ser-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO CRJZZXMAADSBBQ-SRVKXCTJSA-N 0.000 description 1
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 1
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 1
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 1
- OZPDGESCTGGNAD-CIUDSAMLSA-N Ser-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CO OZPDGESCTGGNAD-CIUDSAMLSA-N 0.000 description 1
- JURQXQBJKUHGJS-UHFFFAOYSA-N Ser-Ser-Ser-Ser Chemical compound OCC(N)C(=O)NC(CO)C(=O)NC(CO)C(=O)NC(CO)C(O)=O JURQXQBJKUHGJS-UHFFFAOYSA-N 0.000 description 1
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 1
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 1
- PCMZJFMUYWIERL-ZKWXMUAHSA-N Ser-Val-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMZJFMUYWIERL-ZKWXMUAHSA-N 0.000 description 1
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 1
- 101710172711 Structural protein Proteins 0.000 description 1
- 101150052312 TMN2 gene Proteins 0.000 description 1
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 1
- XOTBWOCSLMBGMF-SUSMZKCASA-N Thr-Glu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOTBWOCSLMBGMF-SUSMZKCASA-N 0.000 description 1
- NIEWSKWFURSECR-FOHZUACHSA-N Thr-Gly-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NIEWSKWFURSECR-FOHZUACHSA-N 0.000 description 1
- SXAGUVRFGJSFKC-ZEILLAHLSA-N Thr-His-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SXAGUVRFGJSFKC-ZEILLAHLSA-N 0.000 description 1
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 1
- QNCFWHZVRNXAKW-OEAJRASXSA-N Thr-Lys-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNCFWHZVRNXAKW-OEAJRASXSA-N 0.000 description 1
- VTMGKRABARCZAX-OSUNSFLBSA-N Thr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O VTMGKRABARCZAX-OSUNSFLBSA-N 0.000 description 1
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 description 1
- BCYUHPXBHCUYBA-CUJWVEQBSA-N Thr-Ser-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BCYUHPXBHCUYBA-CUJWVEQBSA-N 0.000 description 1
- NQQMWWVVGIXUOX-SVSWQMSJSA-N Thr-Ser-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NQQMWWVVGIXUOX-SVSWQMSJSA-N 0.000 description 1
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 1
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 1
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 1
- 108020004530 Transaldolase Proteins 0.000 description 1
- 102100028601 Transaldolase Human genes 0.000 description 1
- 102000004357 Transferases Human genes 0.000 description 1
- 108090000992 Transferases Proteins 0.000 description 1
- 108010043652 Transketolase Proteins 0.000 description 1
- 102000014701 Transketolase Human genes 0.000 description 1
- 102100029677 Trehalase Human genes 0.000 description 1
- 108010087472 Trehalase Proteins 0.000 description 1
- 102100038798 Tripartite motif-containing protein 3 Human genes 0.000 description 1
- BEIGSKUPTIFYRZ-SRVKXCTJSA-N Tyr-Asp-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O BEIGSKUPTIFYRZ-SRVKXCTJSA-N 0.000 description 1
- HSBZWINKRYZCSQ-KKUMJFAQSA-N Tyr-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O HSBZWINKRYZCSQ-KKUMJFAQSA-N 0.000 description 1
- NVZVJIUDICCMHZ-BZSNNMDCSA-N Tyr-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O NVZVJIUDICCMHZ-BZSNNMDCSA-N 0.000 description 1
- YYLHVUCSTXXKBS-IHRRRGAJSA-N Tyr-Pro-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YYLHVUCSTXXKBS-IHRRRGAJSA-N 0.000 description 1
- CLEGSEJVGBYZBJ-MEYUZBJRSA-N Tyr-Thr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CLEGSEJVGBYZBJ-MEYUZBJRSA-N 0.000 description 1
- NXPDPYYCIRDUHO-ULQDDVLXSA-N Tyr-Val-His Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=C(O)C=C1 NXPDPYYCIRDUHO-ULQDDVLXSA-N 0.000 description 1
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 1
- UUYCNAXCCDNULB-QXEWZRGKSA-N Val-Arg-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O UUYCNAXCCDNULB-QXEWZRGKSA-N 0.000 description 1
- DNOOLPROHJWCSQ-RCWTZXSCSA-N Val-Arg-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DNOOLPROHJWCSQ-RCWTZXSCSA-N 0.000 description 1
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 1
- XEYUMGGWQCIWAR-XVKPBYJWSA-N Val-Gln-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N XEYUMGGWQCIWAR-XVKPBYJWSA-N 0.000 description 1
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 description 1
- OPGWZDIYEYJVRX-AVGNSLFASA-N Val-His-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N OPGWZDIYEYJVRX-AVGNSLFASA-N 0.000 description 1
- BZMIYHIJVVJPCK-QSFUFRPTSA-N Val-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N BZMIYHIJVVJPCK-QSFUFRPTSA-N 0.000 description 1
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 1
- GVJUTBOZZBTBIG-AVGNSLFASA-N Val-Lys-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N GVJUTBOZZBTBIG-AVGNSLFASA-N 0.000 description 1
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 1
- MBGFDZDWMDLXHQ-GUBZILKMSA-N Val-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MBGFDZDWMDLXHQ-GUBZILKMSA-N 0.000 description 1
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 1
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 1
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 1
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 1
- JXCOEPXCBVCTRD-JYJNAYRXSA-N Val-Tyr-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JXCOEPXCBVCTRD-JYJNAYRXSA-N 0.000 description 1
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 1
- 240000008042 Zea mays Species 0.000 description 1
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 1
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 1
- 125000000218 acetic acid group Chemical group C(C)(=O)* 0.000 description 1
- LIPOUNRJVLNBCD-UHFFFAOYSA-N acetyl dihydrogen phosphate Chemical compound CC(=O)OP(O)(O)=O LIPOUNRJVLNBCD-UHFFFAOYSA-N 0.000 description 1
- 230000021736 acetylation Effects 0.000 description 1
- 238000006640 acetylation reaction Methods 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 238000010564 aerobic fermentation Methods 0.000 description 1
- 230000009603 aerobic growth Effects 0.000 description 1
- 108010087924 alanylproline Proteins 0.000 description 1
- 230000001476 alcoholic effect Effects 0.000 description 1
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 108010008355 arginyl-glutamine Proteins 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 108010019077 beta-Amylase Proteins 0.000 description 1
- 108010005774 beta-Galactosidase Proteins 0.000 description 1
- 102000005936 beta-Galactosidase Human genes 0.000 description 1
- 108010055059 beta-Mannosidase Proteins 0.000 description 1
- 230000027455 binding Effects 0.000 description 1
- 102000023732 binding proteins Human genes 0.000 description 1
- 108091008324 binding proteins Proteins 0.000 description 1
- 230000008238 biochemical pathway Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 238000006664 bond formation reaction Methods 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 239000004202 carbamide Substances 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 230000010261 cell growth Effects 0.000 description 1
- 230000033077 cellular process Effects 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 230000007073 chemical hydrolysis Effects 0.000 description 1
- 238000007385 chemical modification Methods 0.000 description 1
- 238000004883 computer application Methods 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 235000005822 corn Nutrition 0.000 description 1
- 239000008120 corn starch Substances 0.000 description 1
- 108010005400 cutinase Proteins 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 230000007071 enzymatic hydrolysis Effects 0.000 description 1
- 238000006047 enzymatic hydrolysis reaction Methods 0.000 description 1
- OCLXJTCGWSSVOE-UHFFFAOYSA-N ethanol etoh Chemical compound CCO.CCO OCLXJTCGWSSVOE-UHFFFAOYSA-N 0.000 description 1
- 235000019985 fermented beverage Nutrition 0.000 description 1
- 230000005714 functional activity Effects 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 230000009368 gene silencing by RNA Effects 0.000 description 1
- 108010078144 glutaminyl-glycine Proteins 0.000 description 1
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 1
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 1
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 1
- 108010049041 glutamylalanine Proteins 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 1
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 1
- 108010089804 glycyl-threonine Proteins 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 108010050848 glycylleucine Proteins 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 108010002430 hemicellulase Proteins 0.000 description 1
- 238000010842 high-capacity cDNA reverse transcription kit Methods 0.000 description 1
- 108010092114 histidylphenylalanine Proteins 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 238000006460 hydrolysis reaction Methods 0.000 description 1
- 230000028993 immune response Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- PHTQWCKDNZKARW-UHFFFAOYSA-N isoamylol Chemical compound CC(C)CCO PHTQWCKDNZKARW-UHFFFAOYSA-N 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- 150000004715 keto acids Chemical class 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 1
- 235000019421 lipase Nutrition 0.000 description 1
- 230000029226 lipidation Effects 0.000 description 1
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000010297 mechanical methods and process Methods 0.000 description 1
- MYWUZJCMWCOHBA-VIFPVBQESA-N methamphetamine Chemical compound CN[C@@H](C)CC1=CC=CC=C1 MYWUZJCMWCOHBA-VIFPVBQESA-N 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 230000002438 mitochondrial effect Effects 0.000 description 1
- 235000013379 molasses Nutrition 0.000 description 1
- 238000001823 molecular biology technique Methods 0.000 description 1
- 238000002887 multiple sequence alignment Methods 0.000 description 1
- 101150078678 osm-1 gene Proteins 0.000 description 1
- 238000005895 oxidative decarboxylation reaction Methods 0.000 description 1
- 108010089198 phenylalanyl-prolyl-arginine Proteins 0.000 description 1
- 108010051242 phenylalanylserine Proteins 0.000 description 1
- 108010045857 phosphoglycerate phosphatase Proteins 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 238000003752 polymerase chain reaction Methods 0.000 description 1
- 239000004810 polytetrafluoroethylene Substances 0.000 description 1
- 229920001343 polytetrafluoroethylene Polymers 0.000 description 1
- 230000001323 posttranslational effect Effects 0.000 description 1
- 108010020432 prolyl-prolylisoleucine Proteins 0.000 description 1
- 108010079317 prolyl-tyrosine Proteins 0.000 description 1
- 108010070643 prolylglutamic acid Proteins 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 229940107700 pyruvic acid Drugs 0.000 description 1
- 239000002994 raw material Substances 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 210000003705 ribosome Anatomy 0.000 description 1
- 229920006395 saturated elastomer Polymers 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 101150031442 sfc1 gene Proteins 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- VNOYUJKHFWYWIR-FZEDXVDRSA-N succinyl-coa Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)C(O)C(=O)NCCC(=O)NCCSC(=O)CCC(O)=O)O[C@H]1N1C2=NC=NC(N)=C2N=C1 VNOYUJKHFWYWIR-FZEDXVDRSA-N 0.000 description 1
- 150000008163 sugars Chemical class 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 150000003628 tricarboxylic acids Chemical class 0.000 description 1
- 230000007306 turnover Effects 0.000 description 1
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 1
- 108010078580 tyrosylleucine Proteins 0.000 description 1
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 1
- 108700026220 vif Genes Proteins 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/02—Preparation of oxygen-containing organic compounds containing a hydroxy group
- C12P7/04—Preparation of oxygen-containing organic compounds containing a hydroxy group acyclic
- C12P7/06—Ethanol, i.e. non-beverage
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N1/00—Microorganisms, e.g. protozoa; Compositions thereof; Processes of propagating, maintaining or preserving microorganisms or compositions thereof; Processes of preparing or isolating a composition containing a microorganism; Culture media therefor
- C12N1/14—Fungi; Culture media therefor
- C12N1/16—Yeasts; Culture media therefor
- C12N1/18—Baker's yeast; Brewer's yeast
- C12N1/185—Saccharomyces isolates
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/80—Vectors or expression systems specially adapted for eukaryotic hosts for fungi
- C12N15/81—Vectors or expression systems specially adapted for eukaryotic hosts for fungi for yeasts
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/0008—Oxidoreductases (1.) acting on the aldehyde or oxo group of donors (1.2)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12R—INDEXING SCHEME ASSOCIATED WITH SUBCLASSES C12C - C12Q, RELATING TO MICROORGANISMS
- C12R2001/00—Microorganisms ; Processes using microorganisms
- C12R2001/645—Fungi ; Processes using fungi
- C12R2001/85—Saccharomyces
- C12R2001/865—Saccharomyces cerevisiae
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y102/00—Oxidoreductases acting on the aldehyde or oxo group of donors (1.2)
- C12Y102/04—Oxidoreductases acting on the aldehyde or oxo group of donors (1.2) with a disulfide as acceptor (1.2.4)
- C12Y102/04002—Oxoglutarate dehydrogenase (succinyl-transferring) (1.2.4.2), i.e. alpha-ketoglutarat dehydrogenase
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02E—REDUCTION OF GREENHOUSE GAS [GHG] EMISSIONS, RELATED TO ENERGY GENERATION, TRANSMISSION OR DISTRIBUTION
- Y02E50/00—Technologies for the production of fuel of non-fossil origin
- Y02E50/10—Biofuels, e.g. bio-diesel
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Genetics & Genomics (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Mycology (AREA)
- Microbiology (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Medicinal Chemistry (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Botany (AREA)
- Tropical Medicine & Parasitology (AREA)
- Virology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
描述了涉及过表达α‑酮戊二酸脱氢酶(KGD2)的经修饰的酵母细胞的组合物和方法。与在其他方面相同的亲本酵母细胞相比,这些经修饰的酵母细胞产生增加量的乙醇。这样的酵母细胞对于从淀粉底物大规模生产乙醇特别有用。
Description
交叉引用
本申请要求于2021年5月10日提交的美国临时申请号63/186,332的权益,该临时申请通过引用以其全文并入。
通过引用并入序列表
本申请连同电子格式的序列表一起提交。序列表以标题为NB41694WOPCTSeqList.txt的文件提供,该文件创建于2022年5月9日,大小是14,936字节。电子格式的序列表中的信息通过引用以其全文并入。
技术领域
本发明的组合物和方法涉及过表达α-酮戊二酸脱氢酶(KGD2)的经修饰的酵母。与其他方面相同的亲本细胞相比,该酵母产生增加量的乙醇。这样的酵母对于从淀粉底物大规模生产乙醇特别有用。
背景技术
第一代基于酵母的乙醇生产将糖转化为燃料乙醇。全世界酵母的年燃料乙醇生产量为约900亿升(Gombert,A.K.和van Maris.A.J.(2015)Curr.Opin.Biotechnol.[生物技术新见]33:81-86)。据估计,乙醇生产成本的约70%是原料。由于生产量如此之大,即使产率的微小提升也会对整个行业产生巨大的经济影响。因此,需要与传统酵母和工程化的酵母两者相比,制备更多乙醇的稳健的酵母。
发明内容
本发明的组合物和方法涉及过表达α-酮戊二酸脱氢酶(KGD2)的经修饰的酵母。这些组合物和方法的各方面和实施例描述于以下独立编号的段落中。
1.在一方面,提供了一种衍生自亲本酵母细胞的经修饰的酵母细胞,这些经修饰的细胞包含遗传改变,该遗传改变引起这些经修饰的细胞与这些亲本细胞相比产生增加量的α-酮戊二酸脱氢酶(KGD2)多肽,其中与在其他方面相同的亲本酵母细胞产生的乙醇量相比,这些经修饰的细胞在发酵期间产生增加量的乙醇。
2.在如段落1所述的经修饰的细胞的一些实施例中,该遗传改变包括将核酸引入亲本细胞中,该核酸能够指导KGD2多肽以高于等同条件下生长的亲本细胞的水平表达。
3.在如段落2所述的经修饰的细胞的一些实施例中,该遗传改变包括引入用于表达KGD2多肽的表达盒。
4.在如段落3所述的经修饰的细胞的一些实施例中,该表达盒包含外源KGD2基因。
5.在如段落2所述的经修饰的细胞的一些实施例中,该核酸包含导致发酵后期KGD2多肽的表达增加的启动子。
6.在如段落2所述的经修饰的细胞的一些实施例中,该核酸包含可操作地连接到该KGD2多肽的编码序列的ADR1启动子。
7.在如段落1-6中任一项所述的经修饰的细胞的一些实施例中,与在等同条件下生长的这些亲本细胞中的表达水平相比,KGD2多肽的表达增加量是至少20%、至少30%、至少40%、至少50%、至少70%、至少100%、至少150%、至少200%、或至少500%或更多。
8.在如段落1-6中任一项所述的经修饰的细胞的一些实施例中,与在等同条件下生长的这些亲本细胞产生的KGD2 mRNA的量相比,这些经修饰的细胞产生的KGD2 mRNA的量增加至少2倍、至少5倍、至少10倍、至少20倍、至少50倍、至少100倍或更多。
9.在如段落1-8中任一项所述的经修饰的细胞的一些实施例中,这些细胞进一步包含遗传改变,该遗传改变引起这些经修饰的细胞与这些亲本细胞相比产生增加量的转录调节因子MIG3多肽。
10.在如段落1-9中任一项所述的经修饰的细胞的一些实施例中,这些细胞进一步包含编码碳水化合物加工酶的外源基因。
11.在一些实施例中,如段落1-10中任一项所述的经修饰的细胞进一步包含PKL途径。
12.在一些实施例中,如段落1-11中任一项所述的经修饰的细胞进一步包含甘油途径和/或乙酰辅酶A途径中的改变。
13.在一些实施例中,如段落1-12中任一项所述的经修饰的细胞进一步包含用于制备乙醇的替代途径。
14.在如段落1-13中任一项所述的经修饰的细胞的一些实施例中,这些细胞属于酵母属物种(Saccharomyces spp)。
15.在另一方面,提供了一种用于增加生长于碳水化合物底物上的酵母细胞的醇生产的方法,该方法包括:向亲本酵母细胞中引入遗传改变,与在其他方面相同的亲本细胞中产生的量相比,所述遗传改变使KGD2多肽的产生增加。
16.在如段落15所述的方法的一些实施例中,这些具有引入的遗传改变的经修饰的细胞是经修饰的细胞,这些经修饰的细胞是如段落1-14中任一项所述的细胞。
17.在如段落15或16所述的方法的一些实施例中,在等同发酵条件下,醇的生产增加至少0.5%。
18.在如段落15-17中任一项所述的方法的一些实施例中,与在等同条件下生长的其他方面相同的亲本细胞产生的KGD2的量相比,KGD2的生产的增加是至少20%、至少30%、至少40%、至少50%、至少70%、至少100%、至少150%、至少200%、或至少500%或更多的增加。
19.在如段落15-18中任一项所述的方法的一些实施例中,与在等同条件下生长的其他方面相同的亲本细胞产生的KGD2 mRNA的量相比,这些经修饰的细胞产生的KGD2 mRNA的量增加至少2倍、至少5倍、至少10倍、至少20倍、至少50倍、至少100倍或更多。
根据包括任何附图/图的说明书,本发明的经修饰的细胞和方法的这些和其他方面以及实施例将是清楚的。
具体实施方式
I.定义
在详细地描述本发明的酵母和方法之前,为了清楚起见定义以下术语。未定义的术语应当符合相关领域中所使用的常规含义。
如本文所使用的,术语“醇”是指其中羟基官能团(-OH)与饱和碳原子键合的有机化合物。
如本文所使用的,术语“酵母细胞”、“酵母菌株”、或简称“酵母”是指来自子囊菌门(Ascomycota)和担子菌门(Basidiomycota)的生物。示例性酵母是来自酵母目(Saccharomycetales)的芽殖酵母。酵母的特定实例是酵母属物种,包括但不限于酿酒酵母(S.cerevisiae)。酵母包括用于生产燃料醇的生物以及用于生产可饮用醇的生物,包括用于制备独特味道的啤酒、葡萄酒和其他发酵饮料的特种和专有酵母菌株。
如本文所使用的,短语“工程化的酵母细胞”、“变体酵母细胞”、“经修饰的酵母细胞”、或相似短语是指包括本文所述的遗传修饰和特征的酵母。变体/经修饰的酵母不包括天然存在的酵母。
如本文所使用的,术语“多肽”和“蛋白质”(以及它们各自的复数形式)可互换地使用,是指包含通过肽键连接的氨基酸残基的任何长度的聚合物。本文使用氨基酸残基的常规一字母或三字母代码,并且所有序列均从N末端到C末端方向呈现。聚合物可以包含经修饰的氨基酸,并且它可以被非氨基酸中断。这些术语还包括天然地修饰的或通过干预而修饰的氨基酸聚合物;例如,二硫键形成、糖基化、脂化、乙酰化、磷酸化或任何其他操作或修饰,如与标记组分缀合而修饰的氨基酸聚合物。这些定义内还包括例如含有一种或多种氨基酸类似物(包括例如非天然氨基酸等)以及本领域已知的其他修饰的多肽。
如本文所使用的,功能上和/或结构上相似的蛋白质被认为是“相关蛋白”或“同源物”。这样的蛋白质可以衍生自不同属和/或物种的生物,或不同纲的生物(例如,细菌和真菌),或者是人工设计的蛋白质。相关蛋白还涵盖通过一级序列分析确定的、通过二级或三级结构分析确定的、或者通过免疫交叉反应性确定的、或通过他们的功能确定的同源物。
如本文所使用的,术语“同源蛋白”是指与参考蛋白具有相似活性和/或结构的蛋白。这并不旨在意味着同源物必定是进化上相关的。因此,该术语旨在涵盖从不同生物获得的相同、相似、或相应(即,在结构和功能方面)的一种或多种酶。在一些实施例中,希望鉴定与参考蛋白具有相似的四级、三级和/或一级结构的同源物。在一些实施例中,同源蛋白作为参考蛋白诱导一种或多种相似的免疫应答。在一些实施例中,同源蛋白经过工程化以产生具有一种或多种希望活性的酶。
序列之间的同源性程度可使用本领域已知的任何适合方法确定(参见例如,Smith和Waterman(1981)Adv.Appl.Math.[应用数学进展]2:482;Needleman和Wunsch(1970)J.Mol.Biol.[分子生物学杂志],48:443;Pearson和Lipman(1988)Proc.Natl.Acad.Sci.USA[美国科学院院报]85:2444;威斯康星遗传学软件包(WisconsinGenetics Software Package)(遗传学计算机组公司(Genetics Computer Group),麦迪逊,威斯康星州)中的程序,如GAP、BESTFIT、FASTA、和TFASTA;和Devereux等人(1984)Nucleic Acids Res.[核酸研究]12:387-95)。
例如,PILEUP是确定序列同源性水平的有用程序。PILEUP使用渐进的、两两比对创建了来自一组相关序列的多重序列比对。它还可以绘制显示用于创建该比对的聚类关系的树。PILEUP使用Feng和Doolittle的渐进比对方法的简化版(Feng和Doolittle(1987)J.Mol.Evol.[分子进化杂志]35:351-60)。该方法类似于Higgins和Sharp描述的方法((1989)CABIOS[生物科学中的计算机应用]5:151-53)。有用的PILEUP参数包括3.00的默认空位权重,0.10的默认空位长度权重,以及加权的末端空位。有用算法的另一个实例是BLAST算法,由以下描述:Altschul等人((1990)J.Mol.Biol.[分子生物学杂志]215:403-10)和Karlin等人((1993)Proc.Natl.Acad.Sci.USA[美国科学院院报]90:5873-87)。一个特别有用的BLAST程序是WU-BLAST-2程序(参见,例如,Altschul等人(1996)Meth.Enzymol.[酶学方法]266:460-80)。参数“W”、“T”、以及“X”确定了该比对的灵敏度与速度。该BLAST程序使用字长(W)为11、BLOSUM62得分矩阵(参见,例如,Henikoff和Henikoff(1989)Proc.Natl.Acad.Sci.USA[美国科学院院报]89:10915)比对(B)为50、期望值(E)为10、M'5、N'-4、以及两条链的比较作为默认值。
如本文所使用的,在至少两个核酸或多肽的上下文中,短语“基本上相似”和“基本上相同”典型地意指多核苷酸或多肽包含与参考(即,野生型)序列相比具有至少约70%同一性、至少约75%同一性、至少约80%同一性、至少约85%同一性、至少约90%同一性、至少约91%同一性、至少约92%同一性、至少约93%同一性、至少约94%同一性、至少约95%同一性、至少约96%同一性、至少约97%同一性、至少约98%同一性、或甚至至少约99%同一性、或更高同一性的序列。使用具有默认参数的CLUSTAL W算法计算序列同一性百分比。参见Thompson等人(1994)Nucleic Acids Res.[核酸研究]22:4673-4680。CLUSTAL W算法的默认参数是:
空位开放罚分: 10.0
空位延伸罚分: 0.05
蛋白质权重矩阵: BLOSUM系列
DNA权重矩阵: IUB
延迟发散序列%: 40
空位分隔距离: 8
DNA转换权重: 0.50
列表亲水残基: GPSNDQEKR
使用负性矩阵: 关
切换特殊残基罚分: 开
切换亲水罚分: 开
切换结束空位分隔罚分 关
两种多肽基本上相同的另一个指示是第一多肽与第二多肽具有免疫交叉反应性。典型地,差别在于保守氨基酸取代的多肽具有免疫交叉反应性。因此,多肽与第二多肽基本上相同,例如,其中两个肽的区别仅在于保守取代。两个核酸序列基本上相同的另一个指示是两个分子在严格条件下(例如,在中等至高严格的范围内)彼此杂交。
如本文所使用的,术语“基因”与术语“等位基因”同义,是指编码和指导蛋白质或RNA表达的核酸。丝状真菌的营养体形式通常是单倍体,因此指定基因的单拷贝(即单个等位基因)足以赋予指定表型。当生物含有多于一个相似基因时,术语“等位基因”通常是优选的,在这种情况下,每个不同的相似基因被称为不同的“等位基因”。
如本文所使用的,“组成型”表达是指在基本上所有典型生长条件下,由特定基因编码的多肽的产生,而不是“条件”表达,该“条件”表达需要特定的底物的存在、温度等来诱导或激活表达。
如本文所使用的,术语“表达多肽”和相似术语是指使用细胞的翻译机器(例如,核糖体)产生多肽的细胞过程。
如本文所使用的,“过表达多肽”、“增加多肽的表达”和相似术语是指与不包括指定遗传修饰的亲本或“野生型”细胞所观察到的相比,以高于正常水平的水平表达多肽。
如本文所使用的,“表达盒”是指包括启动子、和氨基酸编码区以及终止子(即,启动子::氨基酸编码区::终止子)以及允许该编码多肽在细胞中生产所需的其他核酸序列的DNA片段。表达盒可以是外源的(即,引入细胞中)或内源的(即,存在于细胞中)。
如本文所使用的,关于两个DNA片段的术语“融合的”和“融合”(如启动子和多肽的编码区)是指导致两个DNA片段变成单个分子的物理键合。
如本文所使用的,术语“野生型”和“天然”可互换地使用,并且是指在自然界中发现的基因、蛋白质或菌株,或者不是为了目前所述酵母的优势而有意修饰的基因、蛋白质或菌株。
如本文所使用的,术语“目的蛋白”是指希望在经修饰的酵母中表达的多肽。这样的蛋白质可以是酶、底物结合蛋白、表面活性蛋白、结构蛋白、可选择标记等,并且能被表达。目的蛋白由相对于亲本菌株的内源基因或异源基因(即,目的基因)编码。目的蛋白可以在细胞内表达或作为分泌的蛋白表达,或者在细胞表面上展示。
如本文所使用的,“基因破坏”泛指基本上阻止细胞在宿主细胞中产生功能性基因产物,例如蛋白质的任何遗传或化学操作,即突变。示例性破坏方法包括基因的任何部分的完全或部分(包括多肽编码序列、启动子、增强子、或另外的调节元件)缺失、或其诱变,其中诱变涵盖取代、插入、缺失、倒位、及其组合和变化,任何这些突变基本上阻止功能基因产物的产生。也可以使用CRISPR、RNAi、反义、或任何其他消除基因表达的方法破坏基因。可以通过非相邻控制元件的缺失或遗传操作来破坏基因。如本文所使用的,“基因缺失”是指该基因从宿主细胞的基因组中去除。当基因包括与基因编码序列不紧邻的控制元件(例如,增强子元件)时,基因缺失是指编码序列、以及任选地相邻增强子元件(例如,包括但不限于启动子和/或终止子序列)的缺失,但未要求非相邻控制元件的缺失。基因缺失也指编码序列的一部分、或与编码序列紧邻或不紧邻的启动子的一部分的缺失,其中工程化的细胞中不存在目的基因的功能活性。
如本文所使用的,术语“遗传操作”、“遗传改变”、“遗传工程化”以及相似术语可互换地使用,并且是指核酸序列的改变/变化。改变可包括但不限于核酸序列中至少一种核酸的取代、缺失、插入或化学修饰。
如本文所使用的,“功能性多肽/蛋白”是具有活性(如酶活性、结合活性、表面活性特性等)的蛋白质,并且其未被诱变、截短、或以其他方式修饰以消除或降低此活性。如所指出的,功能性多肽可以是热稳定的或不耐热的。
如本文所使用的,术语“融合的蛋白”和“融合蛋白”与两个多肽(如两个不同的酶)有关,其在有或没有一个或多个接头的情况下物理连接在一起,导致两个多肽变成单个分子。
如本文所使用的,“功能性基因”是能够被细胞组分用于产生活性基因产物(典型地蛋白质)的基因。功能性基因是破坏的基因的对立体,这些破坏的基因被修饰使得它们不能被细胞组分用于产生活性基因产物,或者具有降低的被细胞组分用于产生活性基因产物的能力。
如本文所使用的,如果已对酵母细胞进行遗传或化学改变以防止产生呈现出野生型蛋白的活性特征的功能性蛋白/多肽,则对这些酵母细胞已经进行了“修饰以防止产生指定蛋白”。这样的修饰包括但不限于编码蛋白质(如本文所述)的基因的缺失或破坏、使得编码的多肽缺乏前述活性的基因的修饰、影响翻译后加工或稳定性的基因的修饰、及其组合。
如本文所使用的,“途径的减弱”或“通过途径的通量的减弱”(即,生物化学途径)泛指通过代谢途径减少或完全阻止生物化学底物或中间体的通量的任何遗传的或化学的操作。可以通过各种众所周知的方法实现途径的减弱。这样的方法包括,但不限于:完全或部分缺失一个或多个基因、用编码具有降低的催化活性或增加的Km值的酶的突变体形式替代这些基因的野生型等位基因、修饰启动子或控制一个或多个基因表达的其他调节元件、针对减少的稳定性工程化这些酶或编码这些酶的mRNA、将酶错误指导进不太可能与底物和中间体相互作用的细胞区室、使用干扰RNA等。
如本文所使用的,“需氧发酵”是指在氧存在下的生长以及生产过程。
如本文所使用的,“厌氧发酵”是指在不存在氧的情况下的生长以及生产。
如本文所使用的,表述“发酵结束”是指就固定和可变成本而言,当连续发酵产生少量另外的醇的经济优势被连续发酵的成本超过时的发酵阶段。在更一般的意义上,“发酵结束”是指发酵不再产生大量另外的醇,即不超过约1%的另外的醇的点。
如本文所使用的,表达“碳通量”是指碳分子通过代谢途径的周转率。碳通量由代谢途径(如葡萄糖代谢途径和麦芽糖代谢途径)中涉及的酶调节。
如本文所使用的,单数冠词“一个/一种(a/an)”以及“该/所述(the)”涵盖复数个指示物,除非上下文中另外明确指明。本文引用的所有参考文献均通过引用以其全文特此并入。除非另外说明,否则以下缩写/首字母缩略词具有以下含义:
除非另外说明,以下含义:
℃ 摄氏度
AA α-淀粉酶
AADH 乙醛脱氢酶
TCA 三羧酸
bp 碱基对
DNA 脱氧核糖核酸
ds或DS 干固体
EC 酶学委员会
EtOH 乙醇
g或gm 克
g/L 克/升
GA 葡糖淀粉酶
H2O 水
HPLC 高效液相色谱法
hr或h 小时
KGD2 α-酮戊二酸脱氢酶
kg 千克
M 摩尔
mg 毫克
min 分钟
mL或ml 毫升
mM 毫摩尔
mRNA 信使RNA
N 当量浓度
nm 纳米
PCR 聚合酶链式反应
PKL 磷酸转酮酶
ppm 百万分率
PTA 磷酸转乙酰酶
Δ 与缺失有关
μg 微克
μL和μl 微升
μM 微摩尔
II.具有增加的KGD2表达的经修饰的酵母细胞
本发明的组合物和方法涉及具有遗传改变的经修饰的酵母细胞,该遗传改变引起这些细胞与其他方面相同的亲本细胞相比产生增加量的α-酮戊二酸脱氢酶(KGD2)多肽,其中在等同发酵条件下与其他方面相同的亲本细胞产生的乙醇量相比,这些经修饰的细胞在发酵期间产生增加量的乙醇。
KGD2是线粒体α-酮戊二酸脱氢酶复合物的组分,其在三羧酸循环(TCA)循环中催化α-酮戊二酸氧化脱羧为琥珀酰辅酶A。KGD2对于酵母的需氧生长至关重要,但其在厌氧发酵中的作用迄今为止尚不清楚。
基于对代谢途径的分析和先前具有工程化代谢途径的酵母的经验,推测发酵后期KGD2的过表达可能导致厌氧条件下乙醇生产增加,如在商业乙醇生产设施中发现的那些。事实上,如所附实例所证明的,可商购的酵母中KGD2的后期过表达导致乙醇生产显著增加。
示例性酿酒酵母KGD2多肽的氨基酸序列如以下SEQ ID NO:1所示:
NCBI数据库包括与SEQ ID NO:1具有不同程度同一性的来自许多生物体的多肽条目。当引入酵母时,这些多肽预期具有相似的功能,特别是考虑到TCA循环在自然界中基本上普遍存在的事实。
在本发明的组合物和方法的特定实施例中,在经修饰的酵母细胞中过表达的KGD2多肽的氨基酸序列与SEQ ID NO:1具有至少约60%、至少约65%、至少约70%、至少约75%、至少约80%、至少约85%、至少约90%、至少约95%、至少约97%、至少约98%、或甚至至少约99%。
预计氨基酸序列中的天然变异不会影响功能。此外,功能和/或结构相似的蛋白质、同源蛋白质和/或基本上相似或相同的蛋白质的过表达预计会产生相似的有益结果。
在一些实施例中,与在相同条件下生长的其他方面相同的亲本细胞产生的KGD2多肽量相比,经修饰的细胞产生的KGD2多肽量的增加是至少20%、至少30%、至少40%、至少50%、至少70%、至少100%、至少150%、至少200%、至少500%、至少1,000%、至少2,000%、或更多的增加。
在一些实施例中,与在相同条件下生长的其他方面相同的亲本细胞产生的KGD2mRNA的量相比,经修饰的细胞产生的KGD2 mRNA的量增加至少2倍、至少5倍、至少10倍、至少20倍、至少50倍、至少100倍或更多。
在一些实施例中,基于产生的mRNA的量,与控制KGD2表达的天然启动子的强度相比,用于控制由经修饰的细胞产生的KGD2多肽的表达的启动子的强度增加至少2倍、至少5倍、至少10倍、至少20倍、至少50倍、至少100倍或更多。
在一些实施例中,启动子在发酵后期(例如,在典型的工业酒精发酵过程,例如48小时的发酵过程的后半段或最后三分之一或四分之一)指导KGD2多肽的最大表达。在一些实施例中,KGD2多肽的表达在48小时时高于在24小时时。在一些实施例中,KGD2多肽的表达在48小时时高于在36小时时。
在一些实施例中,与在相同条件下生长的其他方面相同的细胞产生的乙醇的量相比,经修饰的细胞产生的乙醇的增加是至少约0.5%、至少约1.0%、至少约1.5%、至少约2.0%、至少2.5%或更多的增加。
优选地,通过使用序列特异性分子生物学技术的遗传操作实现KGD2表达增加,该遗传操作与化学诱变相反,化学诱变通常不靶向特定核酸序列。然而,不排除化学诱变作为制备经修饰的酵母细胞的方法。
在一些实施例中,本发明的组合物和方法涉及向酵母细胞中引入能够指导KGD2多肽过表达或表达增加的核酸。特定方法包括但不限于(i)将用于增加多肽生产的外源表达盒的另外的拷贝引入宿主细胞中,(ii)将用于增加多肽生产的一个或多个外源表达盒引入宿主细胞中,(iii)用允许产生增加量的多肽的外源表达盒取代内源表达盒,(iv)修饰或替换内源表达盒的启动子以增加表达,和/或(v)修饰宿主细胞的任何方面以增加该多肽在宿主细胞中的半衰期。
在一些实施例中,经修饰的亲本细胞已包括目的基因,如编码可选择标记、碳水化合物加工酶或其他多肽的基因。在一些实施例中,随后将目的基因引入经修饰的细胞中。
III.具有增加的KGD2和MIG3表达的经修饰的酵母细胞
KGD2的过表达可以有利地与MIG3的过表达组合,MIG3是减少乙酸和甘油产生的转录调节因子。
在一些实施例中,与在相同条件下生长的其他方面相同的亲本细胞产生的MIG3多肽的量相比,经修饰的细胞产生的MIG3多肽量的增加是至少约20%、至少约30%、至少约40%、至少约50%、至少约70%、至少约100%、至少约150%、至少约200%、至少约500%、至少约1,000%、至少约2,000%、或更多的增加。
在一些实施例中,与在相同条件下生长的其他方面相同的亲本细胞产生的MIG3多肽的量相比,经修饰的细胞产生的MIG3 mRNA的量增加至少约2倍、至少约5倍、至少约10倍、至少约20倍、至少约50倍、至少约100倍或更多。
在一些实施例中,基于产生的mRNA的量,与控制MIG3表达的天然启动子的强度相比,用于控制由经修饰的细胞产生的MIG3多肽的表达的启动子的强度增加至少约2倍、至少约5倍、至少约10倍、至少约20倍、至少约50倍、至少约100倍或更多。在一些实施例中,启动子比EFB1启动子弱。在特定实施例中,启动子是SUI3启动子。
在一些实施例中,与在相同条件下生长的其他方面相同的亲本细胞产生的乙酸的量相比,经修饰的细胞的乙酸生产的减少是至少约5%、至少约10%、至少约15%、至少约20%、至少约25%、至少约30%、至少约35%、至少约40%、或更多的减少。
在一些实施例中,与其他方面相同的亲本细胞产生的乙酸的量相比,表达KGD2和MIG3的酵母中甘油减少至少约5%、至少约8%、至少约10%、至少约12%或更多。
用于将能够指导MIG3多肽的过表达或表达增加的核酸引入酵母细胞的方法包括上文对于KGD2多肽的过表达所描述的那些。
示例性酿酒酵母MIG3多肽的氨基酸序列如以下SEQ ID NO:5所示:
NCBI数据库包括酿酒酵母MIG3多肽的超过40个条目,预计其适合引入酵母。预计氨基酸序列中的天然变异不会影响其功能。基于BLAST和Clustal W数据,显然示例性酿酒酵母MIG3多肽与来自其他生物体的多肽具有序列同一性。功能和/或结构相似的蛋白质、同源蛋白质和/或基本上相似或相同的蛋白质的过表达预计会产生相似的有益结果。
在本发明的组合物和方法的特定实施例中,在经修饰的酵母细胞中过表达的MIG3多肽的氨基酸序列与SEQ ID NO:5具有至少约50%、至少约60%、至少约70%、至少约80%、至少约90%、至少约91%、至少约92%、至少约93%、至少约94%、至少约95%、至少约96%、至少约97%、至少约98%、或甚至至少约99%同一性。
IV.具有与外源PKL途径的组合的增加的KGD2表达的经修饰的酵母细胞
KGD2的表达增加(任选地与MIG3的表达增加组合)还可以与PKL途径中基因的表达组合,以进一步增加乙醇生产,其与将外源PKL途径引入酵母相关。
WO 2015148272中先前已描述了具有异源PKL途径的工程化的酵母细胞(Miasnikov等人)。这些细胞表达异源磷酸转酮酶(PKL)、磷酸转乙酰酶(PTA)、和乙酰化乙酰基脱氢酶(AADH)(任选地以及其他酶),以使碳流(carbon flux)远离甘油途径并且朝向乙酰辅酶A的合成,然后该乙酰辅酶A转化为乙醇。与在其他方面相同的亲本酵母细胞相比,这样的经修饰的细胞能够在发酵过程中增加乙醇生产。
V.增加的KGD2生产与影响醇生产的其他突变的组合
在一些实施例中,除了表达增加量的KGD2多肽(任选地与增加的MIG3表达组合,以及任选地与异源PKL途径组合)之外,本发明的经修饰的酵母细胞包括另外的有益修饰。
这些经修饰的细胞可以进一步包括导致天然甘油生物合成途径减弱的突变,已知所述突变增加醇生产。用于减弱酵母中甘油生物合成途径的方法是已知的,并包括例如通过破坏基因GPD1、GPD2、GPP1和/或GPP2中的一个或多个来降低或消除内源性NAD依赖性甘油3-磷酸脱氢酶(GPD)或磷酸甘油磷酸酶(GPP)活性。参见例如美国专利号9,175,270(Elke等人)、8,795,998(Pronk等人)以及8,956,851(Argyros等人)。Methods to enhance thereuse glycerol pathway by over expression of glycerol dehydrogenase(GCY1)anddihydroxyacetone kinase(DAK1)to convert glycerol to dihydroxyacetonephosphate[通过过表达甘油脱氢酶(GCY1)和二羟基丙酮激酶(DAK1)将甘油转化为磷酸二羟基丙酮以增强再利用甘油途径的方法](Zhang等人(2013)J.Ind.Microbiol.Biotechnol.[工业微生物学和生物技术杂志]40:1153-60)。
经修饰的酵母的特征可以进一步在于增加的乙酰辅酶A合酶(也称为乙酰辅酶A连接酶)活性(EC 6.2.1.1)以清除(即捕获)通过化学或酶水解乙酰-磷酸产生(或出于任何其他原因存在于酵母的培养基中)的乙酸并将其转化为乙酰辅酶A。这部分地降低了乙酸对酵母细胞生长的不希望的影响,并且可以进一步有助于醇产率的提升。增加乙酰辅酶A合酶活性可以通过将异源乙酰辅酶A合酶基因引入细胞、增加内源乙酰辅酶A合酶基因的表达等来实现。
在一些实施例中,经修饰的细胞可进一步包括编码具有NAD+依赖性乙酰化乙醛脱氢酶活性的蛋白质的异源基因和/或编码丙酮酸甲酸裂解酶的异源基因。例如,在美国专利号8,795,998(Pronk等人)中描述了与甘油途径减弱组合的这样的基因的引入。在本发明的组合物和方法的一些实施例中,酵母特意缺乏编码乙酰化乙醛脱氢酶、丙酮酸甲酸裂解酶或两者的一种或多种异源基因。
在一些实施例中,本发明的经修饰的酵母细胞可以进一步过表达糖转运蛋白样(STL1)多肽以增加甘油的摄入(参见,例如,Ferreira等人(2005)Mol.Biol.Cell.[细胞分子生物学]16:2068-76;等人(2015)Mol.Microbiol.[分子微生物学]97:541-59以及WO 2015023989A1)以增加乙醇生产和减少乙酸。
在一些实施例中,本发明的经修饰的酵母细胞进一步包括丁醇生物合成途径。在一些实施例中,丁醇生物合成途径是异丁醇生物合成途径。在一些实施例中,异丁醇生物合成途径包含编码如下多肽的多核苷酸,该多肽催化选自由以下组成的组的底物至产物的转化:(a)丙酮酸至乙酰乳酸;(b)乙酰乳酸至2,3-二羟基异戊酸盐;(c)2,3-二羟基异戊酸盐至2-酮异戊酸盐;(d)2-酮异戊酸盐至异丁醛;和(e)异丁醛至异丁醇。在一些实施例中,异丁醇生物合成途径包含编码具有乙酰乳酸合酶、酮酸还原异构酶、二羟酸脱水酶、酮异戊酸盐脱羧酶、和醇脱氢酶活性的多肽的多核苷酸。
在一些实施例中,包含丁醇生物合成途径的经修饰的酵母细胞进一步包含编码具有丙酮酸脱羧酶活性的多肽的多核苷酸中的修饰。在一些实施例中,酵母细胞在编码具有丙酮酸脱羧酶活性的多肽的内源多核苷酸中包含缺失、突变和/或取代。在一些实施例中,具有丙酮酸脱羧酶活性的多肽选自由以下组成的组:PDC1、PDC5、PDC6、及其组合。
在一些实施例中,酵母细胞在一个或多个内源多核苷酸中进一步包含缺失、突变、过表达和/或取代,这些内源多核苷酸编码FRA2、ALD6、ADH1、ADH2、GPD2、BDH1、DLS1、DPB3、CPR1、MAL23C、MNN4、PAB1、TMN2、HAC1、PTC1、PTC2、OSM1、GIS1、CRZ1、HUG1、GDS1、CYB2P、SFC1、MVB12、LDB10、C5SD、GIC1、GIC2、JID1和/或YMR226C。
VI.增加的KGD2表达与其他有益突变的组合
在一些实施例中,除了KGD2多肽的表达增加(任选地与有益于醇生产、或乙酸和/或甘油减少的其他遗传修饰组合)之外,本发明的经修饰的酵母细胞进一步包括任何数量的编码目的蛋白的另外的目的基因。可以在遗传操作之前、期间或之后引入另外的目的基因,这些遗传操作导致活性KGD2多肽的生产增加。目的蛋白包括可选择标记、碳水化合物加工酶以及其他商业上相关的多肽,包括但不限于选自由以下组成的组的酶:脱氢酶、转酮醇酶、磷酸转酮酶、转醛醇酶(transladolase)、差向异构酶、植酸酶、木聚糖酶、β-葡聚糖酶、磷酸酶、蛋白酶、α-淀粉酶、β-淀粉酶、葡糖淀粉酶、支链淀粉酶、异淀粉酶、纤维素酶、海藻糖酶、脂肪酶、果胶酶、聚酯酶、角质酶、氧化酶、转移酶、还原酶、半纤维素酶、甘露聚糖酶、酯酶、异构酶、果胶酶、乳糖酶、过氧化物酶和漆酶。目的蛋白可以被分泌、糖基化并以其他方式修饰。
VII.经修饰的酵母用于增加醇生产的用途
本发明的组合物和方法包括在发酵反应中用于增加醇生产的方法。这样的方法不限于特定的发酵工艺。预期本发明的工程化的酵母是任何醇发酵设施中常规酵母的“普适性(drop-in)”替代品。虽然主要用于燃料醇生产,但本发明的酵母还可用于生产可饮用醇,包括葡萄酒和啤酒。
VIII.适合修饰的酵母细胞
酵母是被归类为真菌界成员的单细胞真核微生物,并且包括来自子囊菌门和担子菌门的生物。可以用于醇生产的酵母包括但不限于酵母属物种(包括酿酒酵母)以及克鲁维酵母属(Kluyveromyces)、拉茜斯酵母属(Lachancea)和裂殖酵母属(Schizosaccharomyces)物种。许多酵母菌株是可商购的,其中许多已被选择或基因工程化以获得所希望的特征,如高醇生产、快速生长速率等。一些酵母已被基因工程化以产生异源酶,如葡糖淀粉酶或α-淀粉酶。
IX.底物和产物
从许多碳水化合物底物(包括但不限于玉米淀粉、甘蔗、木薯和糖蜜)中生产醇是众所周知的,酶条件和化学条件以及机械方法的无数变化和改善也是众所周知的。据信本发明的组合物和方法与这样的底物和条件完全相容。
醇发酵产物包括具有与碳原子键合的羟基官能团(-OH)的有机化合物。示例性醇包括但不限于甲醇、乙醇、正丙醇、异丙醇、正丁醇、异丁醇、正戊醇、2-戊醇、异戊醇、和高级醇。最常制备的燃料醇是乙醇和丁醇。
鉴于本说明书,本发明的酵母菌株和方法的这些和其他方面以及实施例对于技术人员将是清楚的。以下实例旨在进一步说明但不限制组合物和方法。
实例
实例1
材料与方法
液化物制备:
液化物(玉米醪浆料)通过添加600ppm尿素、0.124SAPU/g ds酸性真菌蛋白酶、0.33GAU/g ds变体里氏木霉(Trichoderma reesei)葡糖淀粉酶以及1.46SSCU/g ds白曲霉(Aspergillus kawachii)α-淀粉酶,用硫酸调节至pH 4.8来制备。
AnKom测定:
将300μL浓缩的酵母过夜培养物添加至填充有50g制备的液化物(参见上述)的多个ANKOM瓶中的每个中,以达到0.3的最终OD。然后将这些瓶伴随150RPM振荡在32℃孵育55小时。
HPLC分析:
通过在14,000RPM离心12分钟来将来自AnKom测定的培养物的样品收集在Eppendorf管中。将上清液用0.2μM PTFE过滤器过滤,并且然后用于以下条件下的HPLC(安捷伦科技公司(Agilent Technologies)1200系列)分析:Bio-Rad Aminex HPX-87H柱,运行温度为55℃。0.6ml/min等度流速,0.01N H2SO4,2.5μl注射体积。使用校准标准物定量乙酸、乙醇、甘油、葡萄糖和其他所需分子。所有值以g/L报告。
RNA-Seq分析:
根据TRIzol法(生命技术公司(Life-Tech),罗克韦尔,马里兰州)从单个样品中制备RNA。然后用Qiagen RNeasy迷你试剂盒(凯杰公司(Qiagen),日耳曼敦,马里兰州)清洁RNA。使用应用生物系统公司(Applied Biosystems)的大容量cDNA逆转录试剂盒(赛默飞世尔科技公司(Thermo Fisher Scientific),威明顿市,特拉华州)由单个样品的总mRNA生成cDNA。使用鸟枪法(shotgun method)对制备的每个样品的cDNA进行测序,然后相对于单个基因进行定量。结果报告为每千碱基千万转录物的读数(RPK10M),并用于对样品中每个转录物的量进行定量。
实例2
酵母中KGD2、ADR1、DAL80和MRS6的表达
为了理解酵母中KGD2(YDR148C)的调节,在酿酒酵母菌株FERMAXTMGold(玛翠公司(Martrex Inc.),明尼苏达州,美国;本文中缩写为“FG”)(一种用于乙醇生产的标准菌株)上进行了RNA-Seq分析。如实例1中所述进行RNA-Seq。表达水平报告为每千碱基千万转录物的读数(RPK10M)。
总体而言,KGD2的转录处于低水平,并在发酵期间在24小时时达到最高表达。为了选择在发酵后期用于增加KGD2表达的启动子,分析了ADR1(YDR216W)、DAL80(YKR034W)和MRS6(YOR370C)基因的转录并与KGD2进行比较(表1)。DAL80的转录在32小时时达到最高水平,然后几乎保持恒定。MRS6在36小时时达到最高表达,然后在48小时时下降。只有ADR1(YDR216W)逐渐升高,并在48小时时达到其最高水平。为了增加发酵后期KGD2的表达,选择了ADR1启动子。
表1.发酵期间FG菌株中KGD2和ADR1(YDR216W)基因的RNA-Seq分析
实例3
KGD2表达盒的制备
酿酒酵母的KGD2基因经过密码子优化并合成以生成KGD2s。合成的KGD2s的氨基酸序列与野生型KGD2相同。将ADR1启动子和CPR1终止子(YDR155C基因座)功能性地连接到经密码子优化的KGD2s的编码序列上,以生成ADR1::KGD2s::CPR1表达盒。然后将KGD2表达盒引入FG菌株的RPA190基因座(YOR341W)下游。亲本菌株中KGD2s表达盒的预期插入通过PCR证实。
KGD2的氨基酸序列如以下SEQ ID NO:1所示:
经密码子优化的KGD2s编码区的DNA序列如以下SEQ ID NO:2所示:
ADR1启动子的DNA序列如以下SEQ ID NO:3所示:
CPR1终止子区的DNA序列如以下SEQ ID NO:4所示:
实例5
酵母中MIG3和SUI3的表达
为了了解酵母中MIG3(YER028C)和SUI3(YPL237W)的调节,对菌株FG进行了RNA-Seq分析,如上所述。如实例1中所述进行RNA-Seq并且结果总结于表2中。表达水平表示为每千碱基千万转录物的读数(RPK10M)。结果表明,在FG菌株中,在发酵期间MIG3以低水平表达。SUI3的表达比MIG3高约10倍。选择SUI3启动子来驱动MIG3的表达。
表2.发酵期间FG中MIG表达的RNA-Seq分析
实例6
MIG3表达盒的制备
酿酒酵母MIG3经过密码子优化并合成以生成MIG3s。合成的MIG3s的氨基酸序列与野生型MIG3的相同。将SUI3启动子和GPD1终止子可操作地连接到经编码优化的MIG3的编码序列上,以生成SUI3::MIG3s::GPD1表达盒。然后将MIG3表达盒引入FG菌株的RPA190基因座(YOR341W)下游。亲本菌株中MIG3s表达盒的预期插入通过PCR证实。
MIG3多肽的氨基酸序列如以下SEQ ID NO:5所示:
经密码子优化的MIG3编码区的DNA序列如以下SEQ ID NO:6所示:
SUI3启动子的DNA序列如以下SEQ ID NO:7所示:
GPD1终止子的DNA序列如以下SEQ ID NO:8所示:
实例6
用于表达KGD2和MIG3两者的表达盒的生成
构建体pZK90-D2G3含有实例3和6中描述的KGD2和MIG3过表达盒两者。将含有KGD2和MIG3过表达盒两者的DNA片段引入FG菌株的RPA190基因座(YOR341W)下游。亲本菌株中KGD2s和MIG3s表达盒的预期插入通过PCR证实。构建体的特征总结在表3中。
表3.具有KGD2和MIG3过表达盒的DNA片段的组分
基因 | 启动子 | 终止子 |
KGD2 | ADR1 | CPR1 |
MIG3 | SUI3 | GPD1 |
实例7
使用过表达KGD2、MIG3或KGD2和MIG3一起的酵母的醇生产
过表达KGD2(FG-KGD2)、MIG3(FG-MIG3)或KGD2和MIG3一起(FG-KGD2+MIG3)的酵母菌株以及相应的亲本菌株FG通过含有50g液化物的Ankom测定测试,如实例1中所述。在32℃进行发酵55小时。通过HPLC分析发酵结束的样品。结果总结在表4中。
表4.来自小瓶测定的HPLC结果
与亲本菌株FG相比,使用ADR1启动子的KGD2过表达导致乙醇生产增加大约2%,而没有明显影响乙酸和甘油的生产。使用SUI3启动子的MIG3过表达导致乙酸和甘油生产分别减少约28%和12%。
出人意料的是,与使用EFB1启动子的MIG3过表达不同,使用SUI3启动子的MIG3过表达还导致乙醇生产增加0.7%(数据未显示)。值得注意的是,EFB1启动子比SUI3启动子强得多。然而,EFB1启动子在发酵的早期就驱动峰值表达水平,而不是像SUI3启动子那样在发酵中几乎恒定。使用ADR1启动子的KGD2过表达和使用SUI3启动子的MIG3过表达一起导致乙醇生产增加大约2.5%,乙酸和甘油生产分别减少27%和10.6%。
序列表
<110> 美国丹尼斯科公司(DANISCO US INC)
<120> 通过过表达增加乙醇生产
<130> NB41694-WO-PCT
<140> 尚未指定
<141> 一并同时提交
<150> US 63/186,332
<151> 2021-05-10
<160> 8
<170> PatentIn 3.5版
<210> 1
<211> 463
<212> PRT
<213> 酿酒酵母(S. cerevisiae)
<400> 1
Met Leu Ser Arg Ala Thr Arg Thr Ala Ala Ala Lys Ser Leu Val Lys
1 5 10 15
Ser Lys Val Ala Arg Asn Val Met Ala Ala Ser Phe Val Lys Arg His
20 25 30
Ala Ser Thr Ser Leu Phe Lys Gln Ala Asn Lys Val Glu Ser Leu Gly
35 40 45
Ser Ile Tyr Leu Ser Gly Lys Lys Ile Ser Val Ala Ala Asn Pro Phe
50 55 60
Ser Ile Thr Ser Asn Arg Phe Lys Ser Thr Ser Ile Glu Val Pro Pro
65 70 75 80
Met Ala Glu Ser Leu Thr Glu Gly Ser Leu Lys Glu Tyr Thr Lys Asn
85 90 95
Val Gly Asp Phe Ile Lys Glu Asp Glu Leu Leu Ala Thr Ile Glu Thr
100 105 110
Asp Lys Ile Asp Ile Glu Val Asn Ser Pro Val Ser Gly Thr Val Thr
115 120 125
Lys Leu Asn Phe Lys Pro Glu Asp Thr Val Thr Val Gly Glu Glu Leu
130 135 140
Ala Gln Val Glu Pro Gly Glu Ala Pro Ala Glu Gly Ser Gly Glu Ser
145 150 155 160
Lys Pro Glu Pro Thr Glu Gln Ala Glu Pro Ser Gln Gly Val Ala Ala
165 170 175
Arg Glu Asn Ser Ser Glu Glu Thr Ala Ser Lys Lys Glu Ala Ala Pro
180 185 190
Lys Lys Glu Ala Ala Pro Lys Lys Glu Val Thr Glu Pro Lys Lys Ala
195 200 205
Asp Gln Pro Lys Lys Thr Val Ser Lys Ala Gln Glu Pro Pro Val Ala
210 215 220
Ser Asn Ser Phe Thr Pro Phe Pro Arg Thr Glu Thr Arg Val Lys Met
225 230 235 240
Asn Arg Met Arg Leu Arg Ile Ala Glu Arg Leu Lys Glu Ser Gln Asn
245 250 255
Thr Ala Ala Ser Leu Thr Thr Phe Asn Glu Val Asp Met Ser Ala Leu
260 265 270
Met Glu Met Arg Lys Leu Tyr Lys Asp Glu Ile Ile Lys Lys Thr Gly
275 280 285
Thr Lys Phe Gly Phe Met Gly Leu Phe Ser Lys Ala Cys Thr Leu Ala
290 295 300
Ala Lys Asp Ile Pro Ala Val Asn Gly Ala Ile Glu Gly Asp Gln Ile
305 310 315 320
Val Tyr Arg Asp Tyr Thr Asp Ile Ser Val Ala Val Ala Thr Pro Lys
325 330 335
Gly Leu Val Thr Pro Val Val Arg Asn Ala Glu Ser Leu Ser Val Leu
340 345 350
Asp Ile Glu Asn Glu Ile Val Arg Leu Ser His Lys Ala Arg Asp Gly
355 360 365
Lys Leu Thr Leu Glu Asp Met Thr Gly Gly Thr Phe Thr Ile Ser Asn
370 375 380
Gly Gly Val Phe Gly Ser Leu Tyr Gly Thr Pro Ile Ile Asn Ser Pro
385 390 395 400
Gln Thr Ala Val Leu Gly Leu His Gly Val Lys Glu Arg Pro Val Thr
405 410 415
Val Asn Gly Gln Ile Val Ser Arg Pro Met Met Tyr Leu Ala Leu Thr
420 425 430
Tyr Asp His Arg Leu Leu Asp Gly Arg Glu Ala Val Thr Phe Leu Lys
435 440 445
Thr Val Lys Glu Leu Ile Glu Asp Pro Arg Lys Met Leu Leu Trp
450 455 460
<210> 2
<211> 1392
<212> DNA
<213> 人工序列
<220>
<223> KGD2
<400> 2
atgttgtcca gagctaccag aactgctgct gccaagtctt tggtcaagtc caaagttgcc 60
agaaacgtca tggctgcctc ttttgtcaag agacacgctt ctaccagctt gttcaagcaa 120
gccaacaagg tcgaatcctt aggttctatc tacttgtcag gcaagaaaat ctctgttgct 180
gccaatccat tttccattac ttccaacaga ttcaagtcta ccagtatcga agttcctcca 240
atggccgaat ccttgaccga aggttctttg aaggaataca ccaagaacgt cggtgacttt 300
atcaaggaag acgagctatt agctaccatt gaaaccgaca agatcgatat cgaagtcaac 360
tctccagttt ctggtactgt caccaagttg aacttcaagc cagaagacac tgttaccgtt 420
ggtgaagagt tggctcaagt cgaaccaggt gaagctccag ctgaaggttc tggcgaatcc 480
aagccagaac ctaccgaaca agctgaacca tctcaaggtg ttgctgccag agaaaactcc 540
agcgaggaaa ctgcttccaa gaaagaagct gcaccaaaga aagaagccgc tccaaagaaa 600
gaagttaccg aacccaaaaa ggctgatcaa ccaaagaaaa ccgtctccaa ggctcaagaa 660
cctccagttg cttccaactc ttttactcca tttccacgta ccgaaaccag agtcaagatg 720
aacagaatga ggttgagaat tgccgaacgt ttgaaggaat ctcaaaacac tgctgcctcc 780
ttaactacct tcaacgaagt tgacatgtct gctttgatgg aaatgagaaa gctatacaag 840
gacgaaatta tcaagaaaac tggtaccaag tttggcttca tgggtttgtt ctccaaggct 900
tgtaccttgg ctgccaagga cattccagct gtcaacggtg ccatcgaagg tgaccaaatc 960
gtttacagag attacaccga catttctgtc gctgttgcca ctccaaaggg tttggtcact 1020
ccagttgtca gaaatgccga atccttgtct gtcttagaca tcgagaacga aattgttcgt 1080
ttgtctcaca aggccagaga tggcaagtta actctagaag acatgactgg tggcaccttt 1140
actatttcca atggtggcgt ctttggttcc ttgtacggta ctcccattat caactctcca 1200
caaactgctg ttctaggttt gcacggtgtc aaggaaagac cagttaccgt caacggtcaa 1260
atcgtttcca gaccaatgat gtacttagct ttgacttacg atcacagatt gctagatggc 1320
agagaagctg ttaccttctt gaagactgtc aaggaattga tcgaagatcc aagaaagatg 1380
ttgctatggt aa 1392
<210> 3
<211> 800
<212> DNA
<213> 人工序列
<220>
<223> ADR1启动子
<400> 3
aaacgccgga ggcgacggca aggatgacca tggcgtcaag gaacatgcgg aacatgctag 60
aatgattgcc gaaagggcgg gccctagtca atggcatcat gcgggaacgt gcagtaaagc 120
tcaatgtaac ttgatgccta tcggcgttgt tcgagctaag gacgatgtga tagactgagg 180
aagggtgaac gtcgatagtt agccgttgtg aactaaataa aagtgatatt ttttcgttgt 240
tcttttcttg gtttggcggt gttttcattt gttttcatgt tttacttcgc cacaggtttt 300
caaagaacaa cgccttaaaa ataggaaaac gttttcgcta caggtgttgt tattattgtt 360
gttgtgctgt tgtttattgt gctatacttg tggtatttat tctggacttc cgatcggaaa 420
ttttcttccc ttgaagacct tttgaagaca acagttatat atcattgatc tgaatttctc 480
aggctatttt caaaattcca tacctcctta ttccaacatt tgctcgacta ctatagaaaa 540
gccttattct tttatctttg aaagaaagaa aaggtgtcat agcaaaagtt tattgttact 600
ctgttttgat atactccctc ttattcgttg gaagtataag attgatttgc ataaattaac 660
caatcatttt gctactttcc cggttctccc tttattataa acacttcaga aaaatattct 720
gctactattc cttactttac tataagaatt ttgttttcca aaaaaaaaaa atataaaaaa 780
aataatcata ctctattact 800
<210> 4
<211> 402
<212> DNA
<213> 人工序列
<220>
<223> CPR1终止子区
<400> 4
ttccgctctg cctggaacaa tacagcaaaa attgaaacga actattctct cttaaattat 60
atgtatatgt ataaggtatg tgtatgtatg acaatcaatt cttataacta actcttccta 120
cctatatcgg agtatgttgc caggcttgcc aactagccaa ctaccgccta ggttatgcct 180
gttaagttga cgccgccggg taccgccccc tgtttttccc acgtttcttt ctttcctcta 240
ggtaacggat tttcggctta ttttccacag cgagaaaatt ttctttctcc tgcacttttt 300
ttcatattca ggtcctttga tccattcatt cattcttgtt ttaatttcaa tctagatttt 360
agttttcggt tgaatttatc tttagttttt tatcaataag aa 402
<210> 5
<211> 394
<212> PRT
<213> 酿酒酵母(S. cerevisiae)
<400> 5
Met Asn Tyr Leu Arg Asp Arg Phe Pro Pro Asp Asn Asp Gln Arg Pro
1 5 10 15
Phe Arg Cys Glu Ile Cys Ser Arg Gly Phe His Arg Leu Glu His Lys
20 25 30
Lys Arg His Val Arg Thr His Thr Gly Glu Lys Pro His Lys Cys Thr
35 40 45
Val Gln Gly Cys Pro Lys Ser Phe Ser Arg Ser Asp Glu Leu Lys Arg
50 55 60
His Leu Arg Thr His Thr Lys Gly Val Gln Arg Arg Arg Ile Lys Ser
65 70 75 80
Lys Gly Ser Arg Lys Thr Val Val Asn Thr Ala Thr Ala Ala Pro Thr
85 90 95
Thr Phe Asn Glu Asn Thr Gly Val Ser Leu Thr Gly Ile Gly Gln Ser
100 105 110
Lys Val Pro Pro Ile Leu Ile Ser Val Ala Gln Asn Cys Asp Asp Val
115 120 125
Asn Ile Arg Asn Thr Gly Asn Asn Asn Gly Ile Val Glu Thr Gln Ala
130 135 140
Pro Ala Ile Leu Val Pro Val Ile Asn Ile Pro Asn Asp Pro His Pro
145 150 155 160
Ile Pro Ser Ser Leu Ser Thr Thr Ser Ile Thr Ser Ile Ala Ser Val
165 170 175
Tyr Pro Ser Thr Ser Pro Phe Gln Tyr Leu Lys Ser Gly Phe Pro Glu
180 185 190
Asp Pro Ala Ser Thr Pro Tyr Val His Ser Ser Gly Ser Ser Leu Ala
195 200 205
Leu Gly Glu Leu Ser Ser Asn Ser Ser Ile Phe Ser Lys Ser Arg Arg
210 215 220
Asn Leu Ala Ala Met Ser Gly Pro Asp Ser Leu Ser Ser Ser Lys Asn
225 230 235 240
Gln Ser Ser Ala Ser Leu Leu Ser Gln Thr Ser His Pro Ser Lys Ser
245 250 255
Phe Ser Arg Pro Pro Thr Asp Leu Ser Pro Leu Arg Arg Ile Met Pro
260 265 270
Ser Val Asn Thr Gly Asp Met Glu Ile Ser Arg Thr Val Ser Val Ser
275 280 285
Ser Ser Ser Ser Ser Leu Thr Ser Val Thr Tyr Asp Asp Thr Ala Ala
290 295 300
Lys Asp Met Gly Met Gly Ile Phe Phe Asp Arg Pro Pro Val Thr Gln
305 310 315 320
Lys Ala Cys Arg Ser Asn His Lys Tyr Lys Val Asn Ala Val Ser Arg
325 330 335
Gly Arg Gln His Glu Arg Ala Gln Phe His Ile Ser Gly Asp Asp Glu
340 345 350
Asp Ser Asn Val His Arg Gln Glu Ser Arg Ala Ser Asn Thr Ser Pro
355 360 365
Asn Val Ser Leu Pro Pro Ile Lys Ser Ile Leu Arg Gln Ile Asp Asn
370 375 380
Phe Asn Ser Ala Pro Ser Tyr Phe Ser Lys
385 390
<210> 6
<211> 1185
<212> DNA
<213> 人工序列
<220>
<223> MIG3编码区
<400> 6
atgaattacc tgcgagatag atttcctccg gataatgacc aaagaccctt tagatgtgaa 60
atttgttcac gaggtttcca cagacttgaa cataaaaaaa ggcacgtaag aacgcacact 120
ggcgagaagc ctcacaaatg taccgttcag ggctgtccga aaagcttcag ccgaagcgat 180
gaactaaaaa gacatttgag gacacatact aaaggcgtcc aaaggcgcag aataaaatcc 240
aagggctcgc gaaaaaccgt tgtgaatact gctaccgccg cccctaccac cttcaatgaa 300
aacactggtg tttcgctcac ggggataggt caatctaaag tgccacctat tcttatctcc 360
gttgctcaga attgcgatga cgtgaatata cgaaatactg gaaataataa tggcattgtg 420
gagacacagg cacctgcaat tttagtgcct gtgataaata ttccaaatga ccctcatccg 480
attccaagta gcctctccac tacttctatc acctccattg catcagtata tccctctact 540
tctccattcc agtacctgaa aagcgggttt cctgaagatc ctgcatctac accgtatgta 600
cattcgtccg gaagttcttt agccctgggt gaattgtctt caaactcctc tatattttcg 660
aaatctagga ggaatttggc cgccatgagt ggtcctgatt ctttgagtag ttctaaaaac 720
caatccagtg cttcgcttct ttctcaaact tcacatccat caaagagctt ttcaagaccg 780
ccaacagact taagtcctct gcgaagaatc atgccttctg taaacacagg agacatggaa 840
atttcaagga cagtatccgt ttcgagcagt tcatcatcac tcacttctgt tacgtatgat 900
gacaccgcgg ctaaagacat gggcatggga atattttttg ataggccacc tgtaacacag 960
aaagcttgca ggagcaatca taagtacaag gttaatgctg ttagcagagg gagacaacat 1020
gaaagggcac aatttcatat atctggagat gatgaggaca gtaacgttca ccgccaagaa 1080
tcaagagcat ccaacacaag tcccaatgta tcattgcctc cgataaagag cattttgcga 1140
caaattgata atttcaacag tgctccttct tacttcagta aataa 1185
<210> 7
<211> 762
<212> DNA
<213> 人工序列
<220>
<223> SUI3启动子
<400> 7
cttttgttaa tgaggtgaac aaaacaggca acacggcttt acattgggcg tcgttgaatg 60
gcaaattaga cgtggtcaag ctactgtgtg atgaatatga ggcagacccc tttattagaa 120
acaaattcgg ccacgatgct atctttgagg ccgagaacag cgggaaggaa gaagtggaaa 180
catacttttt gaagaagtat gatgtcgaac ctgaagatga tgaagaagac acacaaactg 240
agggcaagaa ttcggtccaa atcacaaagg gtacagaaat tgaacaagtc accaaggaag 300
ccaccgaggc tttaagagaa gaaaccgaga aactgaatat aaataaagac taaagtaaag 360
agcttgtttt cttcgtggaa taaaagccgt aatatccatt gagcatgata ttttatttct 420
tggtaccgga gaagataaaa ggatggaacc agcagaaatg cattaataaa tacaaaaaat 480
ttagaaaaga gttacagtaa taatgtatat ttctctcaca aactaagtaa ccgcttcaaa 540
agcgtatatt ttgaatgcat tgaactttcc atttcattac ccgcagagcc ggagtcctca 600
tcaacgacgg agtgaaaaat tttgtaattg cgagaaaaag tgaaattgat ggaaaaaaag 660
aaaaagaaag tagaagaaga cattataaga gagactagga aacttcttgc acatcaaccg 720
aaaagcgcct aggcaaccag tcatataata agcacgcacg ag 762
<210> 8
<211> 352
<212> DNA
<213> 人工序列
<220>
<223> GPD1终止子
<400> 8
atttattgga gaaagataac atatcatact ttcccccact tttttcgagg ctcttctata 60
tcatattcat aaattagcat tatgtcattt ctcataacta ctttatcacg ttagaaatta 120
cttattatta ttaaattaat acaaaattta gtaaccaaat aaatataaat aaatatgtat 180
atttgaattt taaaaaaaaa atcctataga gcaaaaggat tttccattat aatattagct 240
gtacacctct tccgcatttt ttgagggtgg ttacaacacc actcattcag aggctgtcgg 300
cacagttgct tctagcatct ggcgtccgta tgtatgggtg tattttaaat aa 352
Claims (19)
1.一种衍生自亲本酵母细胞的经修饰的酵母细胞,所述经修饰的细胞包含遗传改变,所述遗传改变引起所述经修饰的细胞与所述亲本细胞相比产生增加量的α-酮戊二酸脱氢酶(KGD2)多肽,其中与在其他方面相同的亲本酵母细胞产生的乙醇量相比,所述经修饰的细胞在发酵期间产生增加量的乙醇。
2.如权利要求1所述的经修饰的细胞,其中所述遗传改变包括将核酸引入所述亲本细胞中,所述核酸能够指导KGD2多肽以高于等同条件下生长的亲本细胞的水平表达。
3.如权利要求2所述的经修饰的细胞,其中所述遗传改变包括引入用于表达KGD2多肽的表达盒。
4.如权利要求3所述的经修饰的细胞,其中所述表达盒包含外源KGD2基因。
5.如权利要求2所述的经修饰的细胞,其中所述核酸包含导致发酵后期KGD2多肽的表达增加的启动子。
6.如权利要求2所述的经修饰的细胞,其中所述核酸包含可操作地连接到所述KGD2多肽的编码序列的ADR1启动子。
7.如权利要求1-6中任一项所述的经修饰的细胞,其中与在等同条件下生长的所述亲本细胞中的表达水平相比,KGD2多肽的表达增加量是至少20%、至少30%、至少40%、至少50%、至少70%、至少100%、至少150%、至少200%、或至少500%或更多。
8.如权利要求1-6中任一项所述的经修饰的细胞,其中与在等同条件下生长的所述亲本细胞产生的KGD2 mRNA的量相比,所述经修饰的细胞产生的KGD2 mRNA的量增加至少2倍、至少5倍、至少10倍、至少20倍、至少50倍、至少100倍或更多。
9.如权利要求1-8中任一项所述的经修饰的细胞,其中所述细胞进一步包含遗传改变,所述遗传改变引起所述经修饰的细胞与所述亲本细胞相比产生增加量的转录调节因子MIG3多肽。
10.如权利要求1-9中任一项所述的经修饰的细胞,其中所述细胞进一步包含编码碳水化合物加工酶的外源基因。
11.如权利要求1-10中任一项所述的经修饰的细胞,其进一步包含PKL途径。
12.如权利要求1-11中任一项所述的经修饰的细胞,其进一步包含甘油途径和/或乙酰辅酶A途径中的改变。
13.如权利要求1-12中任一项所述的经修饰的细胞,其进一步包含用于制备乙醇的替代途径。
14.如权利要求1-13中任一项所述的经修饰的细胞,其中所述细胞属于酵母属物种(Saccharomyces spp)。
15.一种用于增加生长于碳水化合物底物上的酵母细胞的醇生产的方法,所述方法包括:向亲本酵母细胞中引入遗传改变,与在其他方面相同的亲本细胞中产生的量相比,所述遗传改变使KGD2多肽的产生增加。
16.如权利要求15所述的方法,其中所述具有引入的遗传改变的经修饰的细胞是经修饰的细胞,所述经修饰的细胞是如权利要求1-14中任一项所述的细胞。
17.如权利要求15或16所述的方法,其中在等同发酵条件下,醇的生产增加至少0.5%。
18.如权利要求15-17中任一项所述的方法,其中与在等同条件下生长的其他方面相同的亲本细胞产生的KGD2的量相比,KGD2的生产的增加是至少20%、至少30%、至少40%、至少50%、至少70%、至少100%、至少150%、至少200%、或至少500%或更多的增加。
19.如权利要求15-18中任一项所述的方法,其中与在等同条件下生长的其他方面相同的亲本细胞产生的KGD2 mRNA的量相比,所述经修饰的细胞产生的KGD2 mRNA的量增加至少2倍、至少5倍、至少10倍、至少20倍、至少50倍、至少100倍或更多。
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US202163186332P | 2021-05-10 | 2021-05-10 | |
US63/186,332 | 2021-05-10 | ||
PCT/US2022/028530 WO2022240839A1 (en) | 2021-05-10 | 2022-05-10 | Increased ethanol production by over-expression of kgd2 in yeast |
Publications (1)
Publication Number | Publication Date |
---|---|
CN117396608A true CN117396608A (zh) | 2024-01-12 |
Family
ID=81854523
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202280033093.0A Pending CN117396608A (zh) | 2021-05-10 | 2022-05-10 | 通过在酵母中过表达kgd2增加乙醇生产 |
Country Status (5)
Country | Link |
---|---|
US (1) | US20240318207A1 (zh) |
EP (1) | EP4355881A1 (zh) |
CN (1) | CN117396608A (zh) |
AR (1) | AR125830A1 (zh) |
WO (1) | WO2022240839A1 (zh) |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP2060632A1 (en) | 2007-10-29 | 2009-05-20 | Technische Universität Berlin | Method of modifying a yeast cell for the production of ethanol |
EP2277989A1 (en) | 2009-07-24 | 2011-01-26 | Technische Universiteit Delft | Fermentative glycerol-free ethanol production |
US8956851B2 (en) | 2011-04-05 | 2015-02-17 | Lallemand Hungary Liquidity Management, LLC | Methods for the improvement of product yield and production in a microorganism through the addition of alternate electron acceptors |
CA2920114A1 (en) | 2013-08-15 | 2015-02-19 | Lallemand Hungary Liquidity Management Llc | Methods for the improvement of product yield and production in a microorganism through glycerol recycling |
EP3122876B1 (en) | 2014-03-28 | 2020-11-25 | Danisco US Inc. | Altered host cell pathway for improved ethanol production |
WO2015167043A1 (ko) * | 2014-04-30 | 2015-11-05 | 삼성전자 주식회사 | 증가된 알파-케토글루타레이트 데카르복실라제 활성을 갖는 미생물 및 이를 이용한 1,4-부탄디올 생산방법 |
-
2022
- 2022-05-10 EP EP22726915.6A patent/EP4355881A1/en active Pending
- 2022-05-10 US US18/290,026 patent/US20240318207A1/en active Pending
- 2022-05-10 CN CN202280033093.0A patent/CN117396608A/zh active Pending
- 2022-05-10 AR ARP220101239A patent/AR125830A1/es unknown
- 2022-05-10 WO PCT/US2022/028530 patent/WO2022240839A1/en active Application Filing
Also Published As
Publication number | Publication date |
---|---|
WO2022240839A1 (en) | 2022-11-17 |
EP4355881A1 (en) | 2024-04-24 |
AR125830A1 (es) | 2023-08-16 |
US20240318207A1 (en) | 2024-09-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110177801B (zh) | 具有改善的醇生产的酵母 | |
CN110741014B (zh) | 具有改善的醇生产的酵母 | |
CN112041449B (zh) | 通过过表达pab1的酵母来降低乙酸生产 | |
CN110366594B (zh) | 双功能磷酸转酮酶-磷酸转乙酰酶融合多肽 | |
CN112384609B (zh) | 转录激活因子/阻遏因子gis1在酵母中的过表达用于增加乙醇生产 | |
CN115552016A (zh) | 通过过表达mig多肽的酵母来降低乙酸生产 | |
WO2020069067A1 (en) | Over expression of ribonucleotide reductase inhibitor in yeast for increased ethanol production | |
CN117396608A (zh) | 通过在酵母中过表达kgd2增加乙醇生产 | |
CN112513261B (zh) | 富马酸还原酶过表达导致酵母中增加的发酵速率 | |
CN111201313B (zh) | 通过具有组成型转录激活因子mal等位基因的酵母增加乙醇生产 | |
CN113795503A (zh) | 细胞色素b2在酵母中过表达用于增加乙醇生产 | |
CN118159649A (zh) | 通过rsf2或tda9表达降低的酵母减少产生的乙酸 | |
WO2020186254A1 (en) | Over-expression of fumarate-succinate transporter in yeast for increased ethanol and reduced acetate production | |
US20230116556A1 (en) | Increased ethanol production by overexpression of jid1 in yeast | |
US20230331789A1 (en) | Over-expression of gds1 in yeast for increased ethanol and decreased acetate production | |
WO2021022140A1 (en) | Over-expression of pho13 for increased ethanol production by yeast | |
WO2021022097A1 (en) | Over-expression of adh5p for increased ethanol production by yeast | |
WO2020068907A1 (en) | Selected phosphotransacetylase genes for increased ethanol production in engineered yeast | |
CN113646421A (zh) | 具有增加的乙醇生产的杂交酵母 | |
WO2019195382A1 (en) | Increased alcohol production from yeast producing an increased amount of active hac1 protein | |
EP3571218A1 (en) | Modified yeast cells that overexpress a dna polymerase subunit |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |