OA18211A - Aglycosylated enzyme and uses thereof. - Google Patents
Aglycosylated enzyme and uses thereof. Download PDFInfo
- Publication number
- OA18211A OA18211A OA1201600479 OA18211A OA 18211 A OA18211 A OA 18211A OA 1201600479 OA1201600479 OA 1201600479 OA 18211 A OA18211 A OA 18211A
- Authority
- OA
- OAPI
- Prior art keywords
- polypeptide
- amino acid
- enzyme
- biomass
- acid position
- Prior art date
Links
- 102000004190 Enzymes Human genes 0.000 title claims description 189
- 108090000790 Enzymes Proteins 0.000 title claims description 189
- 229920001184 polypeptide Polymers 0.000 claims abstract description 248
- 108010047754 beta-Glucosidase Proteins 0.000 claims abstract description 128
- 102000006995 beta-Glucosidase Human genes 0.000 claims abstract description 128
- 239000000203 mixture Substances 0.000 claims abstract description 109
- 230000000694 effects Effects 0.000 claims abstract description 102
- 210000004027 cells Anatomy 0.000 claims description 155
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 119
- 150000007523 nucleic acids Chemical group 0.000 claims description 97
- 239000002028 Biomass Substances 0.000 claims description 80
- 239000000047 product Substances 0.000 claims description 80
- 229920001850 Nucleic acid sequence Polymers 0.000 claims description 73
- 230000003899 glycosylation Effects 0.000 claims description 61
- 238000006206 glycosylation reaction Methods 0.000 claims description 61
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 claims description 57
- 239000004473 Threonine Substances 0.000 claims description 56
- 241000499912 Trichoderma reesei Species 0.000 claims description 55
- 235000000346 sugar Nutrition 0.000 claims description 52
- MTCFGRXMJLQNBG-UWTATZPHSA-N D-serine Chemical compound OC[C@@H](N)C(O)=O MTCFGRXMJLQNBG-UWTATZPHSA-N 0.000 claims description 51
- 239000000123 paper Substances 0.000 claims description 51
- 150000001413 amino acids Chemical class 0.000 claims description 48
- WQZGKKKJIJFFOK-GASJEMHNSA-N D-Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 claims description 47
- 239000008103 glucose Substances 0.000 claims description 47
- 244000005700 microbiome Species 0.000 claims description 43
- 241000588724 Escherichia coli Species 0.000 claims description 39
- 230000035772 mutation Effects 0.000 claims description 33
- 239000002699 waste material Substances 0.000 claims description 33
- -1 cardstock Substances 0.000 claims description 32
- 239000000463 material Substances 0.000 claims description 30
- GUBGYTABKSRVRQ-CUHNMECISA-N D-Cellobiose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-CUHNMECISA-N 0.000 claims description 27
- 238000004519 manufacturing process Methods 0.000 claims description 26
- 108020004707 nucleic acids Proteins 0.000 claims description 25
- 239000010902 straw Substances 0.000 claims description 25
- 230000001580 bacterial Effects 0.000 claims description 23
- SRBFZHDQGSBBOR-SQOUGZDYSA-N Xylose Natural products O[C@@H]1CO[C@@H](O)[C@@H](O)[C@@H]1O SRBFZHDQGSBBOR-SQOUGZDYSA-N 0.000 claims description 21
- 239000000758 substrate Substances 0.000 claims description 18
- 230000001965 increased Effects 0.000 claims description 17
- 240000005979 Hordeum vulgare Species 0.000 claims description 15
- 235000007340 Hordeum vulgare Nutrition 0.000 claims description 15
- 239000002023 wood Substances 0.000 claims description 15
- 229920002678 cellulose Polymers 0.000 claims description 14
- 239000001913 cellulose Substances 0.000 claims description 14
- 238000000746 purification Methods 0.000 claims description 14
- 241001019659 Acremonium <Plectosphaerellaceae> Species 0.000 claims description 12
- 108010008885 Cellulose 1,4-beta-Cellobiosidase Proteins 0.000 claims description 12
- 241000223259 Trichoderma Species 0.000 claims description 12
- 230000001939 inductive effect Effects 0.000 claims description 12
- 239000010907 stover Substances 0.000 claims description 12
- 229960003487 Xylose Drugs 0.000 claims description 11
- 241000209149 Zea Species 0.000 claims description 11
- 235000002017 Zea mays subsp mays Nutrition 0.000 claims description 11
- 235000005822 corn Nutrition 0.000 claims description 11
- 235000005824 corn Nutrition 0.000 claims description 11
- 244000075850 Avena orientalis Species 0.000 claims description 10
- 235000007319 Avena orientalis Nutrition 0.000 claims description 10
- 235000007558 Avena sp Nutrition 0.000 claims description 10
- 108010059892 Cellulase Proteins 0.000 claims description 10
- 240000007594 Oryza sativa Species 0.000 claims description 10
- 235000007164 Oryza sativa Nutrition 0.000 claims description 10
- 240000008529 Triticum aestivum Species 0.000 claims description 10
- 101700006119 XYL1 Proteins 0.000 claims description 10
- 101700047052 XYLA Proteins 0.000 claims description 10
- 101700051122 XYLD Proteins 0.000 claims description 10
- 101700065756 XYN4 Proteins 0.000 claims description 10
- 101700001256 Xyn Proteins 0.000 claims description 10
- 230000002401 inhibitory effect Effects 0.000 claims description 10
- 108010062085 ligninase Proteins 0.000 claims description 10
- 239000003550 marker Substances 0.000 claims description 10
- 235000009566 rice Nutrition 0.000 claims description 10
- 235000021307 wheat Nutrition 0.000 claims description 10
- 101700065693 xlnA Proteins 0.000 claims description 10
- 101700006979 xyl2 Proteins 0.000 claims description 10
- 101710017636 xynS20E Proteins 0.000 claims description 10
- 235000013339 cereals Nutrition 0.000 claims description 9
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 claims description 9
- 239000003112 inhibitor Substances 0.000 claims description 9
- 241000609240 Ambelania acida Species 0.000 claims description 8
- 241000228212 Aspergillus Species 0.000 claims description 8
- 239000010905 bagasse Substances 0.000 claims description 8
- 150000001720 carbohydrates Chemical class 0.000 claims description 8
- 235000014633 carbohydrates Nutrition 0.000 claims description 8
- 238000006460 hydrolysis reaction Methods 0.000 claims description 8
- AVKUERGKIZMTKX-NJBDSQKTSA-N Ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 claims description 7
- 229960000723 ampicillin Drugs 0.000 claims description 7
- 239000000126 substance Substances 0.000 claims description 7
- 241000209504 Poaceae Species 0.000 claims description 6
- 238000005119 centrifugation Methods 0.000 claims description 6
- 238000004587 chromatography analysis Methods 0.000 claims description 6
- 239000010865 sewage Substances 0.000 claims description 6
- 240000005337 Agave sisalana Species 0.000 claims description 5
- 240000000116 Alocasia Species 0.000 claims description 5
- 241000082175 Arracacia xanthorrhiza Species 0.000 claims description 5
- 235000017166 Bambusa arundinacea Nutrition 0.000 claims description 5
- 240000003917 Bambusa tulda Species 0.000 claims description 5
- 235000017491 Bambusa tulda Nutrition 0.000 claims description 5
- 235000016068 Berberis vulgaris Nutrition 0.000 claims description 5
- 241000335053 Beta vulgaris Species 0.000 claims description 5
- 235000006008 Brassica napus var napus Nutrition 0.000 claims description 5
- 240000000385 Brassica napus var. napus Species 0.000 claims description 5
- 235000006618 Brassica rapa subsp oleifera Nutrition 0.000 claims description 5
- 235000004977 Brassica sinapistrum Nutrition 0.000 claims description 5
- 240000000218 Cannabis sativa Species 0.000 claims description 5
- 241000123346 Chrysosporium Species 0.000 claims description 5
- 235000013162 Cocos nucifera Nutrition 0.000 claims description 5
- 240000007170 Cocos nucifera Species 0.000 claims description 5
- 235000006481 Colocasia esculenta Nutrition 0.000 claims description 5
- 240000000491 Corchorus aestuans Species 0.000 claims description 5
- 235000011777 Corchorus aestuans Nutrition 0.000 claims description 5
- 235000010862 Corchorus capsularis Nutrition 0.000 claims description 5
- 235000009419 Fagopyrum esculentum Nutrition 0.000 claims description 5
- 240000008620 Fagopyrum esculentum Species 0.000 claims description 5
- 210000003608 Feces Anatomy 0.000 claims description 5
- 101710019333 GLU8 Proteins 0.000 claims description 5
- 235000010469 Glycine max Nutrition 0.000 claims description 5
- 240000007842 Glycine max Species 0.000 claims description 5
- 235000002678 Ipomoea batatas Nutrition 0.000 claims description 5
- 240000003613 Ipomoea batatas Species 0.000 claims description 5
- 240000004322 Lens culinaris Species 0.000 claims description 5
- 235000014647 Lens culinaris subsp culinaris Nutrition 0.000 claims description 5
- 235000004431 Linum usitatissimum Nutrition 0.000 claims description 5
- 240000006240 Linum usitatissimum Species 0.000 claims description 5
- 240000003183 Manihot esculenta Species 0.000 claims description 5
- 235000016735 Manihot esculenta subsp esculenta Nutrition 0.000 claims description 5
- 240000004658 Medicago sativa Species 0.000 claims description 5
- 240000003433 Miscanthus floridulus Species 0.000 claims description 5
- 240000005561 Musa balbisiana Species 0.000 claims description 5
- 235000018290 Musa x paradisiaca Nutrition 0.000 claims description 5
- 235000008469 Oxalis tuberosa Nutrition 0.000 claims description 5
- 240000000645 Oxalis tuberosa Species 0.000 claims description 5
- 241001520808 Panicum virgatum Species 0.000 claims description 5
- 240000006169 Phalaris arundinacea Species 0.000 claims description 5
- 235000010627 Phaseolus vulgaris Nutrition 0.000 claims description 5
- 240000005158 Phaseolus vulgaris Species 0.000 claims description 5
- 235000015334 Phyllostachys viridis Nutrition 0.000 claims description 5
- 240000004713 Pisum sativum Species 0.000 claims description 5
- 235000010582 Pisum sativum Nutrition 0.000 claims description 5
- 241000183024 Populus tremula Species 0.000 claims description 5
- 235000010575 Pueraria lobata Nutrition 0.000 claims description 5
- 244000046146 Pueraria lobata Species 0.000 claims description 5
- 229920001131 Pulp (paper) Polymers 0.000 claims description 5
- 235000002595 Solanum tuberosum Nutrition 0.000 claims description 5
- 240000001016 Solanum tuberosum Species 0.000 claims description 5
- 240000003829 Sorghum propinquum Species 0.000 claims description 5
- 235000011684 Sorghum saccharatum Nutrition 0.000 claims description 5
- 241000746413 Spartina Species 0.000 claims description 5
- 235000017585 alfalfa Nutrition 0.000 claims description 5
- 235000017587 alfalfa Nutrition 0.000 claims description 5
- 239000011425 bamboo Substances 0.000 claims description 5
- 235000008984 brauner Senf Nutrition 0.000 claims description 5
- 235000009120 camo Nutrition 0.000 claims description 5
- 239000011111 cardboard Substances 0.000 claims description 5
- 235000005607 chanvre indien Nutrition 0.000 claims description 5
- 235000004879 dioscorea Nutrition 0.000 claims description 5
- 238000000605 extraction Methods 0.000 claims description 5
- 239000011487 hemp Substances 0.000 claims description 5
- 235000012765 hemp Nutrition 0.000 claims description 5
- 239000001257 hydrogen Substances 0.000 claims description 5
- 229910052739 hydrogen Inorganic materials 0.000 claims description 5
- 125000004435 hydrogen atoms Chemical class [H]* 0.000 claims description 5
- 239000010871 livestock manure Substances 0.000 claims description 5
- 235000012766 marijuana Nutrition 0.000 claims description 5
- 239000011087 paperboard Substances 0.000 claims description 5
- 239000002245 particle Substances 0.000 claims description 5
- 210000001236 prokaryotic cell Anatomy 0.000 claims description 5
- 238000000527 sonication Methods 0.000 claims description 5
- 241000894007 species Species 0.000 claims description 5
- 241001438635 Acremonium brachypenium Species 0.000 claims description 4
- 241001438625 Acremonium dichromosporum Species 0.000 claims description 4
- 241000228209 Acremonium persicinum Species 0.000 claims description 4
- 241001019292 Acremonium pinkertoniae Species 0.000 claims description 4
- 241000193830 Bacillus <bacterium> Species 0.000 claims description 4
- 102100002974 CDKN1A Human genes 0.000 claims description 4
- 102100011820 CDKN3 Human genes 0.000 claims description 4
- 101700087043 CDKN3 Proteins 0.000 claims description 4
- 241000223218 Fusarium Species 0.000 claims description 4
- 241001019284 Gliomastix roseogrisea Species 0.000 claims description 4
- 241000223198 Humicola Species 0.000 claims description 4
- 241001480714 Humicola insolens Species 0.000 claims description 4
- SBUJHOSQTJFQJX-NOAMYHISSA-N Kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 claims description 4
- 241000226677 Myceliophthora Species 0.000 claims description 4
- 108091006542 SLC12A9 Proteins 0.000 claims description 4
- 241000190542 Sarocladium kiliense Species 0.000 claims description 4
- 241000906075 Simplicillium obclavatum Species 0.000 claims description 4
- 241001313536 Thermothelomyces thermophila Species 0.000 claims description 4
- 241001494489 Thielavia Species 0.000 claims description 4
- 239000002154 agricultural waste Substances 0.000 claims description 4
- 239000000835 fiber Substances 0.000 claims description 4
- 229960000318 kanamycin Drugs 0.000 claims description 4
- 238000007254 oxidation reaction Methods 0.000 claims description 4
- 101710022044 xeg74 Proteins 0.000 claims description 4
- 241001619326 Cephalosporium Species 0.000 claims description 3
- 241001674013 Chrysosporium lucknowense Species 0.000 claims description 3
- 241000222511 Coprinus Species 0.000 claims description 3
- 244000251987 Coprinus macrorhizus Species 0.000 claims description 3
- 235000001673 Coprinus macrorhizus Nutrition 0.000 claims description 3
- 241000195493 Cryptophyta Species 0.000 claims description 3
- 241000223221 Fusarium oxysporum Species 0.000 claims description 3
- 241000123318 Meripilus giganteus Species 0.000 claims description 3
- 241001674208 Mycothermus thermophilus Species 0.000 claims description 3
- 241000589516 Pseudomonas Species 0.000 claims description 3
- 240000000111 Saccharum officinarum Species 0.000 claims description 3
- 235000007201 Saccharum officinarum Nutrition 0.000 claims description 3
- 241000223255 Scytalidium Species 0.000 claims description 3
- 241001495429 Thielavia terrestris Species 0.000 claims description 3
- 241000378866 Trichoderma koningii Species 0.000 claims description 3
- 241000223261 Trichoderma viride Species 0.000 claims description 3
- ZHSGGJXRNHWHRS-VIDYELAYSA-N Tunicamycin Chemical compound O([C@H]1[C@@H]([C@H]([C@@H](O)[C@@H](CC(O)[C@@H]2[C@H]([C@@H](O)[C@@H](O2)N2C(NC(=O)C=C2)=O)O)O1)O)NC(=O)/C=C/CC(C)C)[C@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1NC(C)=O ZHSGGJXRNHWHRS-VIDYELAYSA-N 0.000 claims description 3
- 238000002425 crystallisation Methods 0.000 claims description 3
- 230000005712 crystallization Effects 0.000 claims description 3
- 238000004880 explosion Methods 0.000 claims description 3
- 238000000227 grinding Methods 0.000 claims description 3
- 230000001771 impaired Effects 0.000 claims description 3
- 230000003647 oxidation Effects 0.000 claims description 3
- 239000010893 paper waste Substances 0.000 claims description 3
- 238000000197 pyrolysis Methods 0.000 claims description 3
- 230000002829 reduced Effects 0.000 claims description 3
- 240000009030 Agave Species 0.000 claims description 2
- 241001474374 Blennius Species 0.000 claims description 2
- 229920000742 Cotton Polymers 0.000 claims description 2
- 240000000907 Musa textilis Species 0.000 claims description 2
- 125000002791 glucosyl group Chemical group C1([C@H](O)[C@@H](O)[C@H](O)[C@H](O1)CO)* 0.000 claims description 2
- 239000002440 industrial waste Substances 0.000 claims description 2
- 150000002772 monosaccharides Chemical class 0.000 claims description 2
- 239000004460 silage Substances 0.000 claims description 2
- 235000001014 amino acid Nutrition 0.000 description 127
- 229940088598 Enzyme Drugs 0.000 description 88
- WQZGKKKJIJFFOK-VFUOTHLCSA-N β-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 45
- 238000006243 chemical reaction Methods 0.000 description 42
- 102000004169 proteins and genes Human genes 0.000 description 35
- 108090000623 proteins and genes Proteins 0.000 description 35
- 239000000523 sample Substances 0.000 description 33
- 238000000034 method Methods 0.000 description 29
- 235000018102 proteins Nutrition 0.000 description 29
- 238000000855 fermentation Methods 0.000 description 22
- 230000004151 fermentation Effects 0.000 description 22
- 230000004927 fusion Effects 0.000 description 17
- 238000004166 bioassay Methods 0.000 description 16
- RAXXELZNTBOGNW-UHFFFAOYSA-N Imidazole Chemical compound C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 14
- 150000004676 glycans Chemical class 0.000 description 14
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 13
- 150000001298 alcohols Chemical class 0.000 description 13
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 12
- 238000007792 addition Methods 0.000 description 11
- 239000000243 solution Substances 0.000 description 11
- 150000008163 sugars Chemical class 0.000 description 11
- UIIMBOGNXHQVGW-UHFFFAOYSA-M buffer Substances [Na+].OC([O-])=O UIIMBOGNXHQVGW-UHFFFAOYSA-M 0.000 description 10
- 239000000543 intermediate Substances 0.000 description 10
- 239000008188 pellet Substances 0.000 description 10
- PXHVJJICTQNCMI-UHFFFAOYSA-N nickel Chemical compound [Ni] PXHVJJICTQNCMI-UHFFFAOYSA-N 0.000 description 9
- 241000894006 Bacteria Species 0.000 description 8
- 125000000539 amino acid group Chemical group 0.000 description 8
- 229920003013 deoxyribonucleic acid Polymers 0.000 description 8
- 230000002255 enzymatic Effects 0.000 description 8
- 230000002538 fungal Effects 0.000 description 8
- 230000001279 glycosylating Effects 0.000 description 8
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 8
- UNXHWFMMPAWVPI-QWWZWVQMSA-N D-Threitol Natural products OC[C@@H](O)[C@H](O)CO UNXHWFMMPAWVPI-QWWZWVQMSA-N 0.000 description 7
- 241000221946 Podospora anserina Species 0.000 description 7
- QTBSBXVTEAMEQO-UHFFFAOYSA-N acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 7
- 239000002253 acid Substances 0.000 description 7
- 238000001042 affinity chromatography Methods 0.000 description 7
- 230000003115 biocidal Effects 0.000 description 7
- 238000010367 cloning Methods 0.000 description 7
- 239000012139 lysis buffer Substances 0.000 description 7
- 150000007524 organic acids Chemical class 0.000 description 7
- 235000005985 organic acids Nutrition 0.000 description 7
- 230000001105 regulatory Effects 0.000 description 7
- 241000196324 Embryophyta Species 0.000 description 6
- BJHIKXHVCXFQLS-UYFOZJQFSA-N Fructose Natural products OC[C@@H](O)[C@@H](O)[C@H](O)C(=O)CO BJHIKXHVCXFQLS-UYFOZJQFSA-N 0.000 description 6
- CSCPPACGZOOCGX-UHFFFAOYSA-N acetone Chemical compound CC(C)=O CSCPPACGZOOCGX-UHFFFAOYSA-N 0.000 description 6
- 239000003242 anti bacterial agent Substances 0.000 description 6
- 230000001461 cytolytic Effects 0.000 description 6
- BDAGIHXWWSANSR-UHFFFAOYSA-N formic acid Chemical compound OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 description 6
- PEDCQBHIVMGVHV-UHFFFAOYSA-N glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 6
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl β-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 6
- LRHPLDYGYMQRHN-UHFFFAOYSA-N n-butanol Chemical compound CCCCO LRHPLDYGYMQRHN-UHFFFAOYSA-N 0.000 description 6
- 239000011541 reaction mixture Substances 0.000 description 6
- 239000011780 sodium chloride Substances 0.000 description 6
- 230000035897 transcription Effects 0.000 description 6
- 102000005575 Cellulases Human genes 0.000 description 5
- 108010084185 Cellulases Proteins 0.000 description 5
- 108020004705 Codon Proteins 0.000 description 5
- 238000004458 analytical method Methods 0.000 description 5
- 108010003152 bacteriophage T7 RNA polymerase Proteins 0.000 description 5
- KRKNYBCHXYNGOX-UHFFFAOYSA-N citric acid Chemical compound OC(=O)CC(O)(C(O)=O)CC(O)=O KRKNYBCHXYNGOX-UHFFFAOYSA-N 0.000 description 5
- 230000000593 degrading Effects 0.000 description 5
- KCXVZYZYPLLWCC-UHFFFAOYSA-N edta Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 5
- 239000000446 fuel Substances 0.000 description 5
- 238000002955 isolation Methods 0.000 description 5
- 229910052757 nitrogen Inorganic materials 0.000 description 5
- 229920000642 polymer Polymers 0.000 description 5
- 229920000023 polynucleotide Polymers 0.000 description 5
- 239000002157 polynucleotide Substances 0.000 description 5
- 150000003839 salts Chemical class 0.000 description 5
- 238000006467 substitution reaction Methods 0.000 description 5
- 229940064005 Antibiotic throat preparations Drugs 0.000 description 4
- 229940083879 Antibiotics FOR TREATMENT OF HEMORRHOIDS AND ANAL FISSURES FOR TOPICAL USE Drugs 0.000 description 4
- 229940042052 Antibiotics for systemic use Drugs 0.000 description 4
- 229940042786 Antitubercular Antibiotics Drugs 0.000 description 4
- 238000009010 Bradford assay Methods 0.000 description 4
- FERIUCNNQQJTOY-UHFFFAOYSA-N Butanoic acid Chemical compound CCCC(O)=O FERIUCNNQQJTOY-UHFFFAOYSA-N 0.000 description 4
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 description 4
- FBPFZTCFMRRESA-JGWLITMVSA-N D-glucitol Chemical compound OC[C@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-JGWLITMVSA-N 0.000 description 4
- 241000233866 Fungi Species 0.000 description 4
- 229940093922 Gynecological Antibiotics Drugs 0.000 description 4
- ZXEKIIBDNHEJCQ-UHFFFAOYSA-N Isobutanol Chemical compound CC(C)CO ZXEKIIBDNHEJCQ-UHFFFAOYSA-N 0.000 description 4
- 239000006137 Luria-Bertani broth Substances 0.000 description 4
- 108010014251 Muramidase Proteins 0.000 description 4
- 102000016943 Muramidase Human genes 0.000 description 4
- 241000235648 Pichia Species 0.000 description 4
- KDYFGRWQOYBRFD-UHFFFAOYSA-N Succinic acid Chemical compound OC(=O)CCC(O)=O KDYFGRWQOYBRFD-UHFFFAOYSA-N 0.000 description 4
- 229940024982 Topical Antifungal Antibiotics Drugs 0.000 description 4
- HEBKCHPVOIAQTA-SCDXWVJYSA-N Xylitol Chemical compound OC[C@H](O)[C@@H](O)[C@H](O)CO HEBKCHPVOIAQTA-SCDXWVJYSA-N 0.000 description 4
- 229960002675 Xylitol Drugs 0.000 description 4
- 150000007513 acids Chemical class 0.000 description 4
- 239000000654 additive Substances 0.000 description 4
- 244000052616 bacterial pathogens Species 0.000 description 4
- 239000006227 byproduct Substances 0.000 description 4
- 150000001735 carboxylic acids Chemical class 0.000 description 4
- 239000000539 dimer Substances 0.000 description 4
- 235000013305 food Nutrition 0.000 description 4
- 238000004128 high performance liquid chromatography Methods 0.000 description 4
- MHAJPDPJQMAIIY-UHFFFAOYSA-N hydrogen peroxide Chemical compound OO MHAJPDPJQMAIIY-UHFFFAOYSA-N 0.000 description 4
- 229940079866 intestinal antibiotics Drugs 0.000 description 4
- 235000014655 lactic acid Nutrition 0.000 description 4
- 229920005610 lignin Polymers 0.000 description 4
- 239000004325 lysozyme Substances 0.000 description 4
- 229960000274 lysozyme Drugs 0.000 description 4
- 235000010335 lysozyme Nutrition 0.000 description 4
- 239000002609 media Substances 0.000 description 4
- 229910052759 nickel Inorganic materials 0.000 description 4
- IJGRMHOSHXDMSA-UHFFFAOYSA-N nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 4
- 229940005935 ophthalmologic Antibiotics Drugs 0.000 description 4
- 238000003752 polymerase chain reaction Methods 0.000 description 4
- 108091007521 restriction endonucleases Proteins 0.000 description 4
- 229920002477 rna polymer Polymers 0.000 description 4
- 239000000600 sorbitol Substances 0.000 description 4
- 235000010356 sorbitol Nutrition 0.000 description 4
- 239000006228 supernatant Substances 0.000 description 4
- 239000004094 surface-active agent Substances 0.000 description 4
- 238000004704 ultra performance liquid chromatography Methods 0.000 description 4
- 239000000811 xylitol Substances 0.000 description 4
- 235000010447 xylitol Nutrition 0.000 description 4
- LENZDBCJOHFCAS-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 3
- 229920002498 Beta-glucan Polymers 0.000 description 3
- 102100008431 CCL26 Human genes 0.000 description 3
- 101700040437 CCL26 Proteins 0.000 description 3
- GZCGUPFRVQAUEE-KCDKBNATSA-N D-(+)-Galactose Natural products OC[C@@H](O)[C@H](O)[C@H](O)[C@@H](O)C=O GZCGUPFRVQAUEE-KCDKBNATSA-N 0.000 description 3
- QIVBCDIJIAJPQS-SECBINFHSA-N D-tryptophane Chemical compound C1=CC=C2C(C[C@@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-SECBINFHSA-N 0.000 description 3
- 239000004386 Erythritol Substances 0.000 description 3
- 229940009714 Erythritol Drugs 0.000 description 3
- 241000206602 Eukaryota Species 0.000 description 3
- 239000005715 Fructose Substances 0.000 description 3
- 244000285963 Kluyveromyces fragilis Species 0.000 description 3
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 3
- HRXKRNGNAMMEHJ-UHFFFAOYSA-K Trisodium citrate Chemical compound [Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O HRXKRNGNAMMEHJ-UHFFFAOYSA-K 0.000 description 3
- 229920004890 Triton X-100 Polymers 0.000 description 3
- 235000004279 alanine Nutrition 0.000 description 3
- 230000003321 amplification Effects 0.000 description 3
- 230000000845 anti-microbial Effects 0.000 description 3
- 150000001479 arabinose derivatives Chemical class 0.000 description 3
- 235000013405 beer Nutrition 0.000 description 3
- 238000009835 boiling Methods 0.000 description 3
- 150000001875 compounds Chemical class 0.000 description 3
- 230000000875 corresponding Effects 0.000 description 3
- 235000019414 erythritol Nutrition 0.000 description 3
- 238000003306 harvesting Methods 0.000 description 3
- 239000011261 inert gas Substances 0.000 description 3
- 238000011081 inoculation Methods 0.000 description 3
- JVTAAEKCZFNVCJ-UHFFFAOYSA-N lactic acid Chemical compound CC(O)C(O)=O JVTAAEKCZFNVCJ-UHFFFAOYSA-N 0.000 description 3
- 239000004310 lactic acid Substances 0.000 description 3
- 230000000670 limiting Effects 0.000 description 3
- 238000004895 liquid chromatography mass spectrometry Methods 0.000 description 3
- 239000006166 lysate Substances 0.000 description 3
- 230000002934 lysing Effects 0.000 description 3
- 238000004949 mass spectrometry Methods 0.000 description 3
- 230000000813 microbial Effects 0.000 description 3
- 238000003199 nucleic acid amplification method Methods 0.000 description 3
- 235000015097 nutrients Nutrition 0.000 description 3
- 229920001542 oligosaccharide Polymers 0.000 description 3
- 150000002482 oligosaccharides Polymers 0.000 description 3
- MUBZPKHOEPUJKR-UHFFFAOYSA-N oxalic acid Chemical compound OC(=O)C(O)=O MUBZPKHOEPUJKR-UHFFFAOYSA-N 0.000 description 3
- 230000037361 pathway Effects 0.000 description 3
- 229920001282 polysaccharide Polymers 0.000 description 3
- 239000005017 polysaccharide Substances 0.000 description 3
- 150000004804 polysaccharides Polymers 0.000 description 3
- XBDQKXXYIPTUBI-UHFFFAOYSA-N propionic acid Chemical compound CCC(O)=O XBDQKXXYIPTUBI-UHFFFAOYSA-N 0.000 description 3
- 239000001509 sodium citrate Substances 0.000 description 3
- HEMHJVSKTPXQMS-UHFFFAOYSA-M sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 3
- 230000001131 transforming Effects 0.000 description 3
- 239000011778 trisodium citrate Substances 0.000 description 3
- 238000011144 upstream manufacturing Methods 0.000 description 3
- WQZGKKKJIJFFOK-PHYPRBDBSA-N α-D-galactose Chemical compound OC[C@H]1O[C@H](O)[C@H](O)[C@@H](O)[C@H]1O WQZGKKKJIJFFOK-PHYPRBDBSA-N 0.000 description 3
- YPFDHNVEDLHUCE-UHFFFAOYSA-N 1,3-Propanediol Chemical compound OCCCO YPFDHNVEDLHUCE-UHFFFAOYSA-N 0.000 description 2
- WERYXYBDKMZEQL-UHFFFAOYSA-N 1,4-Butanediol Chemical compound OCCCCO WERYXYBDKMZEQL-UHFFFAOYSA-N 0.000 description 2
- UCSJYZPVAKXKNQ-HZYVHMACSA-N 1-[(1S,2R,3R,4S,5R,6R)-3-carbamimidamido-6-{[(2R,3R,4R,5S)-3-{[(2S,3S,4S,5R,6S)-4,5-dihydroxy-6-(hydroxymethyl)-3-(methylamino)oxan-2-yl]oxy}-4-formyl-4-hydroxy-5-methyloxolan-2-yl]oxy}-2,4,5-trihydroxycyclohexyl]guanidine Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 2
- BTANRVKWQNVYAZ-UHFFFAOYSA-N 2-Butanol Chemical compound CCC(C)O BTANRVKWQNVYAZ-UHFFFAOYSA-N 0.000 description 2
- HEBKCHPVOIAQTA-QWWZWVQMSA-N Arabitol Chemical compound OC[C@@H](O)C(O)[C@H](O)CO HEBKCHPVOIAQTA-QWWZWVQMSA-N 0.000 description 2
- 239000004475 Arginine Substances 0.000 description 2
- 102220371723 BVES S28A Human genes 0.000 description 2
- 101710016919 CELA3A Proteins 0.000 description 2
- 101710016948 CELA3B Proteins 0.000 description 2
- 210000002421 Cell Wall Anatomy 0.000 description 2
- 241000135254 Cephalosporium sp. Species 0.000 description 2
- 229960005091 Chloramphenicol Drugs 0.000 description 2
- WIIZWVCIJKGZOK-RKDXNWHRSA-N Chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 2
- 241000193403 Clostridium Species 0.000 description 2
- 229920002676 Complementary DNA Polymers 0.000 description 2
- WQZGKKKJIJFFOK-QTVWNMPRSA-N D-mannopyranose Chemical compound OC[C@H]1OC(O)[C@@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-QTVWNMPRSA-N 0.000 description 2
- 101710035145 DHODH Proteins 0.000 description 2
- HYBBIBNJHNGZAN-UHFFFAOYSA-N Furfural Chemical compound O=CC1=CC=CO1 HYBBIBNJHNGZAN-UHFFFAOYSA-N 0.000 description 2
- 102000016876 Galactokinase Human genes 0.000 description 2
- 108091000074 Galactokinase Proteins 0.000 description 2
- RGHNJXZEOKUKBD-SQOUGZDYSA-N Gluconic acid Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@@H](O)C(O)=O RGHNJXZEOKUKBD-SQOUGZDYSA-N 0.000 description 2
- JFCQEDHGNNZCLN-UHFFFAOYSA-N Glutaric acid Chemical compound OC(=O)CCCC(O)=O JFCQEDHGNNZCLN-UHFFFAOYSA-N 0.000 description 2
- FUZZWVXGSFPDMH-UHFFFAOYSA-N Hexanoic acid Chemical compound CCCCCC(O)=O FUZZWVXGSFPDMH-UHFFFAOYSA-N 0.000 description 2
- 229960000310 ISOLEUCINE Drugs 0.000 description 2
- 102200006063 KCNC4 S21A Human genes 0.000 description 2
- 241000235649 Kluyveromyces Species 0.000 description 2
- 241000235058 Komagataella pastoris Species 0.000 description 2
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 2
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 2
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 2
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 2
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 2
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 2
- 241000186660 Lactobacillus Species 0.000 description 2
- 229940039696 Lactobacillus Drugs 0.000 description 2
- OYHQOLUKZRVURQ-IXWMQOLASA-N Linoleic acid Natural products CCCCC\C=C/C\C=C\CCCCCCCC(O)=O OYHQOLUKZRVURQ-IXWMQOLASA-N 0.000 description 2
- VQHSOMBJVWLPSR-WUJBLJFYSA-N Maltitol Chemical compound OC[C@H](O)[C@@H](O)[C@@H]([C@H](O)CO)O[C@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O VQHSOMBJVWLPSR-WUJBLJFYSA-N 0.000 description 2
- ZQPPMHVWECSIRJ-KTKRTIGZSA-N Oleic acid Chemical compound CCCCCCCC\C=C/CCCCCCCC(O)=O ZQPPMHVWECSIRJ-KTKRTIGZSA-N 0.000 description 2
- IPCSVZSSVZVIGE-UHFFFAOYSA-N Palmitic acid Chemical compound CCCCCCCCCCCCCCCC(O)=O IPCSVZSSVZVIGE-UHFFFAOYSA-N 0.000 description 2
- 229940072417 Peroxidase Drugs 0.000 description 2
- 102000003992 Peroxidases Human genes 0.000 description 2
- 108090000437 Peroxidases Proteins 0.000 description 2
- 229960005190 Phenylalanine Drugs 0.000 description 2
- 102220382767 RARG T42A Human genes 0.000 description 2
- 108020004511 Recombinant DNA Proteins 0.000 description 2
- 241000235070 Saccharomyces Species 0.000 description 2
- 235000003534 Saccharomyces carlsbergensis Nutrition 0.000 description 2
- 229940081969 Saccharomyces cerevisiae Drugs 0.000 description 2
- 101710043164 Segment-4 Proteins 0.000 description 2
- QIQXTHQIDYTFRH-UHFFFAOYSA-N Stearic acid Chemical compound CCCCCCCCCCCCCCCCCC(O)=O QIQXTHQIDYTFRH-UHFFFAOYSA-N 0.000 description 2
- QAOWNCQODCNURD-UHFFFAOYSA-N Sulfuric acid Chemical compound OS(O)(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-N 0.000 description 2
- 102200036005 TP53BP1 S25A Human genes 0.000 description 2
- 101700038759 VP1 Proteins 0.000 description 2
- NQPDZGIKBAWPEJ-UHFFFAOYSA-N Valeric acid Chemical compound CCCCC(O)=O NQPDZGIKBAWPEJ-UHFFFAOYSA-N 0.000 description 2
- 241000588901 Zymomonas Species 0.000 description 2
- IKHGUXGNUITLKF-UHFFFAOYSA-N acetaldehyde Chemical compound CC=O IKHGUXGNUITLKF-UHFFFAOYSA-N 0.000 description 2
- 235000011054 acetic acid Nutrition 0.000 description 2
- 230000000996 additive Effects 0.000 description 2
- 102000004965 antibodies Human genes 0.000 description 2
- 108090001123 antibodies Proteins 0.000 description 2
- 239000004599 antimicrobial Substances 0.000 description 2
- 239000007864 aqueous solution Substances 0.000 description 2
- 239000003225 biodiesel Substances 0.000 description 2
- 239000002551 biofuel Substances 0.000 description 2
- 102220358029 c.232A>G Human genes 0.000 description 2
- 229910052799 carbon Inorganic materials 0.000 description 2
- CURLTUGMZLYLDI-UHFFFAOYSA-N carbon dioxide Chemical compound O=C=O CURLTUGMZLYLDI-UHFFFAOYSA-N 0.000 description 2
- 229910002092 carbon dioxide Inorganic materials 0.000 description 2
- 239000003054 catalyst Substances 0.000 description 2
- 230000024881 catalytic activity Effects 0.000 description 2
- 235000015165 citric acid Nutrition 0.000 description 2
- 238000004140 cleaning Methods 0.000 description 2
- 239000002299 complementary DNA Substances 0.000 description 2
- 239000008367 deionised water Substances 0.000 description 2
- 239000012149 elution buffer Substances 0.000 description 2
- 230000002708 enhancing Effects 0.000 description 2
- 238000006911 enzymatic reaction Methods 0.000 description 2
- 239000012530 fluid Substances 0.000 description 2
- 235000019253 formic acid Nutrition 0.000 description 2
- 230000002068 genetic Effects 0.000 description 2
- 235000013922 glutamic acid Nutrition 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- AEMRFAOFKBGASW-UHFFFAOYSA-N glycolic acid Chemical compound OCC(O)=O AEMRFAOFKBGASW-UHFFFAOYSA-N 0.000 description 2
- 101700005460 hemA Proteins 0.000 description 2
- 239000000185 hemagglutinin Substances 0.000 description 2
- 239000002054 inoculum Substances 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 238000007834 ligase chain reaction Methods 0.000 description 2
- OFOBLEOULBTSOW-UHFFFAOYSA-N malonic acid Chemical compound OC(=O)CC(O)=O OFOBLEOULBTSOW-UHFFFAOYSA-N 0.000 description 2
- 230000001404 mediated Effects 0.000 description 2
- 238000003801 milling Methods 0.000 description 2
- 238000002156 mixing Methods 0.000 description 2
- 239000001301 oxygen Substances 0.000 description 2
- 229910052760 oxygen Inorganic materials 0.000 description 2
- MYMOFIZGZYHOMD-UHFFFAOYSA-N oxygen Chemical compound O=O MYMOFIZGZYHOMD-UHFFFAOYSA-N 0.000 description 2
- 229920001223 polyethylene glycol Polymers 0.000 description 2
- 229920005862 polyol Polymers 0.000 description 2
- 150000003077 polyols Chemical class 0.000 description 2
- 229920000136 polysorbate Polymers 0.000 description 2
- BDERNNFJNOPAEC-UHFFFAOYSA-N propanol Chemical compound CCCO BDERNNFJNOPAEC-UHFFFAOYSA-N 0.000 description 2
- DNIAPMSPPWPWGF-UHFFFAOYSA-N propylene glycol Chemical compound CC(O)CO DNIAPMSPPWPWGF-UHFFFAOYSA-N 0.000 description 2
- 238000000751 protein extraction Methods 0.000 description 2
- 238000011002 quantification Methods 0.000 description 2
- 239000011347 resin Substances 0.000 description 2
- 229920005989 resin Polymers 0.000 description 2
- 102220201531 rs754365645 Human genes 0.000 description 2
- 102220061223 rs786203442 Human genes 0.000 description 2
- 230000003248 secreting Effects 0.000 description 2
- 108010027322 single cell proteins Proteins 0.000 description 2
- 238000002741 site-directed mutagenesis Methods 0.000 description 2
- 239000002002 slurry Substances 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- 230000003595 spectral Effects 0.000 description 2
- 239000007858 starting material Substances 0.000 description 2
- 239000001384 succinic acid Substances 0.000 description 2
- DKGAVHZHDRPRBM-UHFFFAOYSA-N t-BuOH Chemical compound CC(C)(C)O DKGAVHZHDRPRBM-UHFFFAOYSA-N 0.000 description 2
- 101710035147 ura3 Proteins 0.000 description 2
- 239000004474 valine Substances 0.000 description 2
- 230000003612 virological Effects 0.000 description 2
- 238000011179 visual inspection Methods 0.000 description 2
- 238000001262 western blot Methods 0.000 description 2
- 229920001221 xylan Polymers 0.000 description 2
- 150000004823 xylans Chemical class 0.000 description 2
- BRZYSWJRSDMWLG-DJWUNRQOSA-N (2R,3R,4R,5R)-2-[(1S,2S,3R,4S,6R)-4,6-diamino-3-[(2S,3R,4R,5S,6R)-3-amino-4,5-dihydroxy-6-[(1R)-1-hydroxyethyl]oxan-2-yl]oxy-2-hydroxycyclohexyl]oxy-5-methyl-4-(methylamino)oxane-3,5-diol Chemical compound O1C[C@@](O)(C)[C@H](NC)[C@@H](O)[C@H]1O[C@@H]1[C@@H](O)[C@H](O[C@@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H]([C@@H](C)O)O2)N)[C@@H](N)C[C@H]1N BRZYSWJRSDMWLG-DJWUNRQOSA-N 0.000 description 1
- XJCCHWKNFMUJFE-CGQAXDJHSA-N (2S,3R,4R,5R)-4-[(2R,3R,4R,5S,6R)-3,4-dihydroxy-6-(hydroxymethyl)-5-[(2R,3R,4S,5S,6R)-3,4,5-trihydroxy-6-(hydroxymethyl)oxan-2-yl]oxyoxan-2-yl]oxyhexane-1,2,3,5,6-pentol Chemical compound O[C@@H]1[C@@H](O)[C@@H](O[C@@H]([C@H](O)[C@@H](O)CO)[C@H](O)CO)O[C@H](CO)[C@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 XJCCHWKNFMUJFE-CGQAXDJHSA-N 0.000 description 1
- LUAHEUHBAZYUOI-KVXMBEGHSA-N (2S,3R,4R,5R)-4-[(2R,3R,4R,5S,6R)-5-[(2R,3R,4R,5S,6R)-3,4-dihydroxy-6-(hydroxymethyl)-5-[(2R,3R,4S,5S,6R)-3,4,5-trihydroxy-6-(hydroxymethyl)oxan-2-yl]oxyoxan-2-yl]oxy-3,4-dihydroxy-6-(hydroxymethyl)oxan-2-yl]oxyhexane-1,2,3,5,6-pentol Chemical compound O[C@@H]1[C@@H](O)[C@@H](O[C@@H]([C@H](O)[C@@H](O)CO)[C@H](O)CO)O[C@H](CO)[C@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@H](O[C@@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O2)O)[C@@H](CO)O1 LUAHEUHBAZYUOI-KVXMBEGHSA-N 0.000 description 1
- FYGDTMLNYKFZSV-WFYNLLPOSA-N (2S,3R,4S,5S,6R)-2-[(2R,4R,5R,6S)-4,5-dihydroxy-2-(hydroxymethyl)-6-[(2R,3S,4R,5R,6S)-4,5,6-trihydroxy-2-(hydroxymethyl)oxan-3-yl]oxyoxan-3-yl]oxy-6-(hydroxymethyl)oxane-3,4,5-triol Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1OC1[C@@H](CO)O[C@@H](O[C@@H]2[C@H](O[C@H](O)[C@H](O)[C@H]2O)CO)[C@H](O)[C@H]1O FYGDTMLNYKFZSV-WFYNLLPOSA-N 0.000 description 1
- ZCXUVYAZINUVJD-LLSCIWEVSA-N (3R,4S,5S)-3-fluoranyl-6-(hydroxymethyl)oxane-2,4,5-triol Chemical compound OCC1OC(O)[C@H]([18F])[C@@H](O)[C@@H]1O ZCXUVYAZINUVJD-LLSCIWEVSA-N 0.000 description 1
- 125000004066 1-hydroxyethyl group Chemical group [H]OC([H])([*])C([H])([H])[H] 0.000 description 1
- VGONTNSXDCQUGY-RRKCRQDMSA-N 2'-deoxyinosine Chemical group C1[C@H](O)[C@@H](CO)O[C@H]1N1C(N=CNC2=O)=C2N=C1 VGONTNSXDCQUGY-RRKCRQDMSA-N 0.000 description 1
- QRBLKGHRWFGINE-UGWAGOLRSA-N 2-[2-[2-[[2-[[4-[[2-[[6-amino-2-[3-amino-1-[(2,3-diamino-3-oxopropyl)amino]-3-oxopropyl]-5-methylpyrimidine-4-carbonyl]amino]-3-[(2R,3S,4S,5S,6S)-3-[(2S,3R,4R,5S)-4-carbamoyl-3,4,5-trihydroxy-6-(hydroxymethyl)oxan-2-yl]oxy-4,5-dihydroxy-6-(hydroxymethyl)- Chemical compound N=1C(C=2SC=C(N=2)C(N)=O)CSC=1CCNC(=O)C(C(C)=O)NC(=O)C(C)C(O)C(C)NC(=O)C(C(O[C@H]1[C@@]([C@@H](O)[C@H](O)[C@H](CO)O1)(C)O[C@H]1[C@@H]([C@](O)([C@@H](O)C(CO)O1)C(N)=O)O)C=1NC=NC=1)NC(=O)C1=NC(C(CC(N)=O)NCC(N)C(N)=O)=NC(N)=C1C QRBLKGHRWFGINE-UGWAGOLRSA-N 0.000 description 1
- VVQNEPGJFQJSBK-UHFFFAOYSA-N 2-methyl-2-propenoic acid methyl ester Chemical compound COC(=O)C(C)=C VVQNEPGJFQJSBK-UHFFFAOYSA-N 0.000 description 1
- JRBJSXQPQWSCCF-UHFFFAOYSA-N 3,3'-Dimethoxybenzidine Chemical compound C1=C(N)C(OC)=CC(C=2C=C(OC)C(N)=CC=2)=C1 JRBJSXQPQWSCCF-UHFFFAOYSA-N 0.000 description 1
- 125000003143 4-hydroxybenzyl group Chemical group [H]C([*])([H])C1=C([H])C([H])=C(O[H])C([H])=C1[H] 0.000 description 1
- 241001114518 Acaulium acremonium Species 0.000 description 1
- 108010065511 Amylases Proteins 0.000 description 1
- 229940025131 Amylases Drugs 0.000 description 1
- 102000013142 Amylases Human genes 0.000 description 1
- 108020005098 Anticodon Proteins 0.000 description 1
- 241001523626 Arxula Species 0.000 description 1
- 241000235349 Ascomycota Species 0.000 description 1
- 229960001230 Asparagine Drugs 0.000 description 1
- 229960005261 Aspartic Acid Drugs 0.000 description 1
- 241001513093 Aspergillus awamori Species 0.000 description 1
- 241000879125 Aureobasidium sp. Species 0.000 description 1
- APKFDSVGJQXUKY-INPOYWNPSA-N BRL-49594 Chemical compound O[C@H]1[C@@H](N)[C@H](O)[C@@H](C)O[C@H]1O[C@H]1/C=C/C=C/C=C/C=C/C=C/C=C/C=C/[C@H](C)[C@@H](O)[C@@H](C)[C@H](C)OC(=O)C[C@H](O)C[C@H](O)CC[C@@H](O)[C@H](O)C[C@H](O)C[C@](O)(C[C@H](O)[C@H]2C(O)=O)O[C@H]2C1 APKFDSVGJQXUKY-INPOYWNPSA-N 0.000 description 1
- 108020004513 Bacterial RNA Proteins 0.000 description 1
- 229940106157 CELLULASE Drugs 0.000 description 1
- 241000222120 Candida <Saccharomycetales> Species 0.000 description 1
- BVKZGUZCCUSVTD-UHFFFAOYSA-N Carbonic acid Chemical compound OC(O)=O BVKZGUZCCUSVTD-UHFFFAOYSA-N 0.000 description 1
- 229920002134 Carboxymethyl cellulose Polymers 0.000 description 1
- 210000000170 Cell Membrane Anatomy 0.000 description 1
- MYSWGUAQZAJSOK-UHFFFAOYSA-N Ciprofloxacin Chemical compound C12=CC(N3CCNCC3)=C(F)C=C2C(=O)C(C(=O)O)=CN1C1CC1 MYSWGUAQZAJSOK-UHFFFAOYSA-N 0.000 description 1
- 241001508811 Clavispora Species 0.000 description 1
- 241001508813 Clavispora lusitaniae Species 0.000 description 1
- 241000193401 Clostridium acetobutylicum Species 0.000 description 1
- 241001147704 Clostridium puniceum Species 0.000 description 1
- 241000429427 Clostridium saccharobutylicum Species 0.000 description 1
- 229920001405 Coding region Polymers 0.000 description 1
- IERHLVCPSMICTF-XVFCMESISA-N Cytidine monophosphate Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(O)=O)O1 IERHLVCPSMICTF-XVFCMESISA-N 0.000 description 1
- FBPFZTCFMRRESA-KAZBKCHUSA-N D-Mannitol Natural products OC[C@@H](O)[C@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KAZBKCHUSA-N 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 241000235035 Debaryomyces Species 0.000 description 1
- GRRNUXAQVGOGFE-UHFFFAOYSA-N Destomysin Chemical compound OC1C(NC)CC(N)C(O)C1OC1C2OC3(C(C(O)C(O)C(C(N)CO)O3)O)OC2C(O)C(CO)O1 GRRNUXAQVGOGFE-UHFFFAOYSA-N 0.000 description 1
- 101710020319 E. coli tetA Proteins 0.000 description 1
- 101700007082 EBAG Proteins 0.000 description 1
- 108010015776 EC 1.1.3.4 Proteins 0.000 description 1
- 101700047696 EXT2 Proteins 0.000 description 1
- 108010001817 Endo-1,4-beta Xylanases Proteins 0.000 description 1
- 239000005977 Ethylene Substances 0.000 description 1
- 101710028121 GLU1 Proteins 0.000 description 1
- 229940116332 GLUCOSE OXIDASE Drugs 0.000 description 1
- FBPFZTCFMRRESA-GUCUJZIJSA-N Galactitol Chemical compound OC[C@H](O)[C@@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-GUCUJZIJSA-N 0.000 description 1
- CEAZRRDELHUEMR-URQXQFDESA-N Gentamicin Chemical compound O1[C@H](C(C)NC)CC[C@@H](N)[C@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](NC)[C@@](C)(O)CO2)O)[C@H](N)C[C@@H]1N CEAZRRDELHUEMR-URQXQFDESA-N 0.000 description 1
- 108010073178 Glucan 1,4-alpha-Glucosidase Proteins 0.000 description 1
- 239000004366 Glucose oxidase Substances 0.000 description 1
- 229960002989 Glutamic Acid Drugs 0.000 description 1
- BCCRXDTUTZHDEU-VKHMYHEASA-N Gly-Ser Chemical compound NCC(=O)N[C@@H](CO)C(O)=O BCCRXDTUTZHDEU-VKHMYHEASA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 1
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 1
- VEXZGXHMUGYJMC-UHFFFAOYSA-N HCl Chemical compound Cl VEXZGXHMUGYJMC-UHFFFAOYSA-N 0.000 description 1
- 229920002488 Hemicellulose Polymers 0.000 description 1
- OUUQCZGPVNCOIJ-UHFFFAOYSA-N Hydroperoxyl Chemical compound O[O] OUUQCZGPVNCOIJ-UHFFFAOYSA-N 0.000 description 1
- 229940097277 Hygromycin B Drugs 0.000 description 1
- FBPFZTCFMRRESA-ZXXMMSQZSA-N Iditol Chemical compound OC[C@@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-ZXXMMSQZSA-N 0.000 description 1
- CDAISMWEOUEBRE-GPIVLXJGSA-N Inositol Chemical compound O[C@H]1[C@H](O)[C@@H](O)[C@H](O)[C@H](O)[C@@H]1O CDAISMWEOUEBRE-GPIVLXJGSA-N 0.000 description 1
- 229960000367 Inositol Drugs 0.000 description 1
- SERLAGPUMNYUCK-DCUALPFSSA-N Isomalt Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO[C@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O SERLAGPUMNYUCK-DCUALPFSSA-N 0.000 description 1
- 241001138401 Kluyveromyces lactis Species 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-N L-arginine Chemical compound OC(=O)[C@@H](N)CCCN=C(N)N ODKSFYDXXFIFQN-BYPYZUCNSA-N 0.000 description 1
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- XUJNEKJLAYXESH-REOHCLBHSA-N L-cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 1
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 1
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- 125000002842 L-seryl group Chemical group O=C([*])[C@](N([H])[H])([H])C([H])([H])O[H] 0.000 description 1
- 125000000769 L-threonyl group Chemical group [H]N([H])[C@]([H])(C(=O)[*])[C@](O[H])(C([H])([H])[H])[H] 0.000 description 1
- 125000002707 L-tryptophyl group Chemical group [H]C1=C([H])C([H])=C2C(C([C@](N([H])[H])(C(=O)[*])[H])([H])[H])=C([H])N([H])C2=C1[H] 0.000 description 1
- 108010054278 Lac Repressors Proteins 0.000 description 1
- GUBGYTABKSRVRQ-UUNJERMWSA-N Lactose Natural products O([C@@H]1[C@H](O)[C@H](O)[C@H](O)O[C@@H]1CO)[C@H]1[C@@H](O)[C@@H](O)[C@H](O)[C@H](CO)O1 GUBGYTABKSRVRQ-UUNJERMWSA-N 0.000 description 1
- 229920002097 Lichenin Polymers 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- FBPFZTCFMRRESA-KVTDHHQDSA-N Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 1
- 108020004999 Messenger RNA Proteins 0.000 description 1
- 230000035633 Metabolized Effects 0.000 description 1
- 241001675980 Moniliella acetoabutens Species 0.000 description 1
- 241001182779 Moniliella megachiliensis Species 0.000 description 1
- 241000723128 Moniliella pollinis Species 0.000 description 1
- 101710017470 NT5C3B Proteins 0.000 description 1
- 241000221960 Neurospora Species 0.000 description 1
- 239000005642 Oleic acid Substances 0.000 description 1
- 102000015636 Oligopeptides Human genes 0.000 description 1
- 108010038807 Oligopeptides Proteins 0.000 description 1
- 101700057261 PNAA Proteins 0.000 description 1
- 229950010131 PUROMYCIN Drugs 0.000 description 1
- RXWNCPJZOCPEPQ-NVWDDTSBSA-N PUROMYCIN Chemical compound C1=CC(OC)=CC=C1C[C@H](N)C(=O)N[C@H]1[C@@H](O)[C@H](N2C3=NC=NC(=C3N=C2)N(C)C)O[C@@H]1CO RXWNCPJZOCPEPQ-NVWDDTSBSA-N 0.000 description 1
- 241000235652 Pachysolen Species 0.000 description 1
- 241000235647 Pachysolen tannophilus Species 0.000 description 1
- 235000021314 Palmitic acid Nutrition 0.000 description 1
- 229940049954 Penicillin Drugs 0.000 description 1
- JGSARLDLIJGVTE-MBNYWOFBSA-N Penicillin G Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 JGSARLDLIJGVTE-MBNYWOFBSA-N 0.000 description 1
- 241000364057 Peoria Species 0.000 description 1
- 102000007079 Peptide Fragments Human genes 0.000 description 1
- 108010033276 Peptide Fragments Proteins 0.000 description 1
- 108010035235 Phleomycins Proteins 0.000 description 1
- 229920002480 Polybenzimidazole fiber Polymers 0.000 description 1
- 239000002202 Polyethylene glycol Substances 0.000 description 1
- 241000222180 Pseudozyma tsukubaensis Species 0.000 description 1
- XPPKVPWEQAFLFU-UHFFFAOYSA-J Pyrophosphate Chemical compound [O-]P([O-])(=O)OP([O-])([O-])=O XPPKVPWEQAFLFU-UHFFFAOYSA-J 0.000 description 1
- 239000007868 Raney catalyst Substances 0.000 description 1
- 229910000564 Raney nickel Inorganic materials 0.000 description 1
- 102000001218 Rec A Recombinases Human genes 0.000 description 1
- 108010055016 Rec A Recombinases Proteins 0.000 description 1
- 102000009661 Repressor Proteins Human genes 0.000 description 1
- HEBKCHPVOIAQTA-ZXFHETKHSA-N Ribitol Chemical compound OC[C@H](O)[C@H](O)[C@H](O)CO HEBKCHPVOIAQTA-ZXFHETKHSA-N 0.000 description 1
- 210000003705 Ribosomes Anatomy 0.000 description 1
- 241000193448 Ruminiclostridium thermocellum Species 0.000 description 1
- 241000192263 Scheffersomyces shehatae Species 0.000 description 1
- 241000235060 Scheffersomyces stipitis Species 0.000 description 1
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 1
- 241001085826 Sporotrichum Species 0.000 description 1
- 235000021355 Stearic acid Nutrition 0.000 description 1
- 241000187747 Streptomyces Species 0.000 description 1
- 229960005322 Streptomycin Drugs 0.000 description 1
- 241000006364 Torula Species 0.000 description 1
- 229920001949 Transfer RNA Polymers 0.000 description 1
- 241001079965 Trichosporon sp. Species 0.000 description 1
- 241000908249 Trichosporonoides Species 0.000 description 1
- 241001480015 Trigonopsis variabilis Species 0.000 description 1
- FPKOPBFLPLFWAD-UHFFFAOYSA-N Trinitrotoluene Chemical compound CC1=CC=C([N+]([O-])=O)C([N+]([O-])=O)=C1[N+]([O-])=O FPKOPBFLPLFWAD-UHFFFAOYSA-N 0.000 description 1
- GSEJCLTVZPLZKY-UHFFFAOYSA-N Tris Chemical compound OCCN(CCO)CCO GSEJCLTVZPLZKY-UHFFFAOYSA-N 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- 101710042194 Trpgamma Proteins 0.000 description 1
- 241000918129 Typhula variabilis Species 0.000 description 1
- 241000221533 Ustilaginomycetes Species 0.000 description 1
- OXQKEKGBFMQTML-KVTDHHQDSA-N Volemitol Chemical compound OC[C@@H](O)[C@@H](O)C(O)[C@H](O)[C@H](O)CO OXQKEKGBFMQTML-KVTDHHQDSA-N 0.000 description 1
- 229920002000 Xyloglucan Polymers 0.000 description 1
- 241000235013 Yarrowia Species 0.000 description 1
- 241000235015 Yarrowia lipolytica Species 0.000 description 1
- 241000235017 Zygosaccharomyces Species 0.000 description 1
- 241000588902 Zymomonas mobilis Species 0.000 description 1
- 241000222292 [Candida] magnoliae Species 0.000 description 1
- DLFVBJFMPXGRIB-UHFFFAOYSA-N acetamide Chemical compound CC(N)=O DLFVBJFMPXGRIB-UHFFFAOYSA-N 0.000 description 1
- 108010093941 acetylxylan esterase Proteins 0.000 description 1
- 230000002378 acidificating Effects 0.000 description 1
- NIXOWILDQLNWCW-UHFFFAOYSA-N acrylic acid Chemical compound OC(=O)C=C NIXOWILDQLNWCW-UHFFFAOYSA-N 0.000 description 1
- 108010025188 alcohol oxidase Proteins 0.000 description 1
- 150000001336 alkenes Chemical class 0.000 description 1
- 108090000637 alpha-Amylases Proteins 0.000 description 1
- 108010061261 alpha-glucuronidase Proteins 0.000 description 1
- 239000002280 amphoteric surfactant Substances 0.000 description 1
- 229960003942 amphotericin B Drugs 0.000 description 1
- 235000019418 amylase Nutrition 0.000 description 1
- 150000008064 anhydrides Chemical class 0.000 description 1
- 235000006708 antioxidants Nutrition 0.000 description 1
- 125000003118 aryl group Chemical group 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 239000002585 base Substances 0.000 description 1
- 239000011324 bead Substances 0.000 description 1
- 125000001797 benzyl group Chemical group [H]C1=C([H])C([H])=C(C([H])=C1[H])C([H])([H])* 0.000 description 1
- 229960000626 benzylpenicillin Drugs 0.000 description 1
- 108010019077 beta-Amylase Proteins 0.000 description 1
- 125000000188 beta-D-glucosyl group Chemical group C1([C@H](O)[C@@H](O)[C@H](O)[C@H](O1)CO)* 0.000 description 1
- 102000006635 beta-Lactamases Human genes 0.000 description 1
- 108020004256 beta-Lactamases Proteins 0.000 description 1
- 238000007068 beta-elimination reaction Methods 0.000 description 1
- 239000012148 binding buffer Substances 0.000 description 1
- 230000037348 biosynthesis Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 239000001569 carbon dioxide Substances 0.000 description 1
- 239000001768 carboxy methyl cellulose Substances 0.000 description 1
- 235000010948 carboxy methyl cellulose Nutrition 0.000 description 1
- 150000001732 carboxylic acid derivatives Chemical class 0.000 description 1
- 150000001733 carboxylic acid esters Chemical class 0.000 description 1
- 229920003090 carboxymethyl hydroxyethyl cellulose Polymers 0.000 description 1
- 239000008112 carboxymethyl-cellulose Substances 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 230000002759 chromosomal Effects 0.000 description 1
- 229960003405 ciprofloxacin Drugs 0.000 description 1
- 238000007398 colorimetric assay Methods 0.000 description 1
- 238000002485 combustion reaction Methods 0.000 description 1
- 230000000295 complement Effects 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- 230000009089 cytolysis Effects 0.000 description 1
- 230000003013 cytotoxicity Effects 0.000 description 1
- 231100000135 cytotoxicity Toxicity 0.000 description 1
- 230000003247 decreasing Effects 0.000 description 1
- 238000004807 desolvation Methods 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000001177 diphosphate Substances 0.000 description 1
- 235000011180 diphosphates Nutrition 0.000 description 1
- 150000002016 disaccharides Chemical class 0.000 description 1
- 238000004821 distillation Methods 0.000 description 1
- 238000011143 downstream manufacturing Methods 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 101710004143 endOF2 Proteins 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 150000002148 esters Chemical class 0.000 description 1
- OTMSDBZUPAUEDD-UHFFFAOYSA-N ethane Chemical compound CC OTMSDBZUPAUEDD-UHFFFAOYSA-N 0.000 description 1
- 235000019441 ethanol Nutrition 0.000 description 1
- VGGSQFUCUMXWEO-UHFFFAOYSA-N ethene Chemical compound C=C VGGSQFUCUMXWEO-UHFFFAOYSA-N 0.000 description 1
- 125000004494 ethyl ester group Chemical group 0.000 description 1
- 125000001495 ethyl group Chemical group [H]C([H])([H])C([H])([H])* 0.000 description 1
- 108010038658 exo-1,4-beta-D-xylosidase Proteins 0.000 description 1
- 108060000072 faeA Proteins 0.000 description 1
- 239000012467 final product Substances 0.000 description 1
- 239000010419 fine particle Substances 0.000 description 1
- 102000034387 fluorescent proteins Human genes 0.000 description 1
- 108091006031 fluorescent proteins Proteins 0.000 description 1
- 238000007421 fluorometric assay Methods 0.000 description 1
- 239000002816 fuel additive Substances 0.000 description 1
- 230000005714 functional activity Effects 0.000 description 1
- 238000002825 functional assay Methods 0.000 description 1
- 125000000524 functional group Chemical group 0.000 description 1
- 102000037240 fusion proteins Human genes 0.000 description 1
- 108020001507 fusion proteins Proteins 0.000 description 1
- 239000003502 gasoline Substances 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 229960002518 gentamicin Drugs 0.000 description 1
- 101700036661 glaA Proteins 0.000 description 1
- 239000000174 gluconic acid Substances 0.000 description 1
- 235000012208 gluconic acid Nutrition 0.000 description 1
- 229950006191 gluconic acid Drugs 0.000 description 1
- 235000019420 glucose oxidase Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- 150000002307 glutamic acids Chemical class 0.000 description 1
- LYCAIKOWRPUZTN-UHFFFAOYSA-N glycol Chemical compound OCCO LYCAIKOWRPUZTN-UHFFFAOYSA-N 0.000 description 1
- 239000000937 glycosyl acceptor Substances 0.000 description 1
- 239000005090 green fluorescent protein Substances 0.000 description 1
- 108010002430 hemicellulase Proteins 0.000 description 1
- VLKZOEOYAKHREP-UHFFFAOYSA-N hexane Chemical compound CCCCCC VLKZOEOYAKHREP-UHFFFAOYSA-N 0.000 description 1
- 238000000265 homogenisation Methods 0.000 description 1
- 238000006698 hydrazinolysis reaction Methods 0.000 description 1
- 150000002430 hydrocarbons Chemical class 0.000 description 1
- 229910000041 hydrogen chloride Inorganic materials 0.000 description 1
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 1
- 125000004029 hydroxymethyl group Chemical group [H]OC([H])([H])* 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 239000000411 inducer Substances 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 230000003834 intracellular Effects 0.000 description 1
- 239000002563 ionic surfactant Substances 0.000 description 1
- VQTUBCCKSQIDNK-UHFFFAOYSA-N isobutene Chemical compound CC(C)=C VQTUBCCKSQIDNK-UHFFFAOYSA-N 0.000 description 1
- 239000000905 isomalt Substances 0.000 description 1
- 235000010439 isomalt Nutrition 0.000 description 1
- 125000001449 isopropyl group Chemical group [H]C([H])([H])C([H])(*)C([H])([H])[H] 0.000 description 1
- 150000002576 ketones Chemical class 0.000 description 1
- 239000002655 kraft paper Substances 0.000 description 1
- 239000000832 lactitol Substances 0.000 description 1
- 235000010448 lactitol Nutrition 0.000 description 1
- 229960003451 lactitol Drugs 0.000 description 1
- GUBGYTABKSRVRQ-XLOQQCSPSA-N lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 101710030587 ligN Proteins 0.000 description 1
- 101700077585 ligd Proteins 0.000 description 1
- 235000020778 linoleic acid Nutrition 0.000 description 1
- 238000001638 lipofection Methods 0.000 description 1
- 238000011068 load Methods 0.000 description 1
- 239000000845 maltitol Substances 0.000 description 1
- 235000010449 maltitol Nutrition 0.000 description 1
- 229940035436 maltitol Drugs 0.000 description 1
- 239000000594 mannitol Substances 0.000 description 1
- 235000010355 mannitol Nutrition 0.000 description 1
- 238000001819 mass spectrum Methods 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 229920002106 messenger RNA Polymers 0.000 description 1
- CERQOIWHTDAKMF-UHFFFAOYSA-M methacrylate Chemical compound CC(=C)C([O-])=O CERQOIWHTDAKMF-UHFFFAOYSA-M 0.000 description 1
- VNWKTOKETHGBQD-UHFFFAOYSA-N methane Chemical compound C VNWKTOKETHGBQD-UHFFFAOYSA-N 0.000 description 1
- 125000004492 methyl ester group Chemical group 0.000 description 1
- 125000002496 methyl group Chemical group [H]C([H])([H])* 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000006011 modification reaction Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 239000002808 molecular sieve Substances 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 229960004927 neomycin Drugs 0.000 description 1
- 101710015766 ngl Proteins 0.000 description 1
- 229910001453 nickel ion Inorganic materials 0.000 description 1
- 239000002736 nonionic surfactant Substances 0.000 description 1
- 229920002113 octoxynol Polymers 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 239000005416 organic matter Substances 0.000 description 1
- 235000006408 oxalic acid Nutrition 0.000 description 1
- OFBQJSOFQDEBGM-UHFFFAOYSA-N pentane Chemical compound CCCCC OFBQJSOFQDEBGM-UHFFFAOYSA-N 0.000 description 1
- 230000004962 physiological condition Effects 0.000 description 1
- 230000001402 polyadenylating Effects 0.000 description 1
- 229920002704 polyhistidine Polymers 0.000 description 1
- 229920000166 polytrimethylene carbonate Polymers 0.000 description 1
- 230000002335 preservative Effects 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- ATUOYWHBWRKTHZ-UHFFFAOYSA-N propane Chemical compound CCC ATUOYWHBWRKTHZ-UHFFFAOYSA-N 0.000 description 1
- 239000001294 propane Substances 0.000 description 1
- 235000019260 propionic acid Nutrition 0.000 description 1
- 230000004952 protein activity Effects 0.000 description 1
- 230000022811 protein deglycosylation Effects 0.000 description 1
- 239000012474 protein marker Substances 0.000 description 1
- 210000001938 protoplasts Anatomy 0.000 description 1
- 238000010926 purge Methods 0.000 description 1
- 101700043218 pyr4 Proteins 0.000 description 1
- 239000003638 reducing agent Substances 0.000 description 1
- 238000010992 reflux Methods 0.000 description 1
- 230000001718 repressive Effects 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 239000002904 solvent Substances 0.000 description 1
- 229960002920 sorbitol Drugs 0.000 description 1
- 238000010186 staining Methods 0.000 description 1
- 229910001220 stainless steel Inorganic materials 0.000 description 1
- 239000010935 stainless steel Substances 0.000 description 1
- 239000008117 stearic acid Substances 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- 230000002103 transcriptional Effects 0.000 description 1
- 238000004450 types of analysis Methods 0.000 description 1
- 101710035146 ura5 Proteins 0.000 description 1
- 229940005605 valeric acid Drugs 0.000 description 1
- 238000004065 wastewater treatment Methods 0.000 description 1
- 125000000969 xylosyl group Chemical group C1([C@H](O)[C@@H](O)[C@H](O)CO1)* 0.000 description 1
- SJZRECIVHVDYJC-UHFFFAOYSA-N γ-Hydroxybutyric acid Chemical compound OCCCC(O)=O SJZRECIVHVDYJC-UHFFFAOYSA-N 0.000 description 1
Abstract
The present invention relates to compositions comprising an aglycosylated polypeptide having cellobiase activity, and methods for producing and using the same.
Description
AGLYCOSYLATED ENZYME AND USES THEREOF
Related Applications [001] This application claims the benefit of U.S. Provisional Application No. 62/035,346, filed August 8, 2014; the entire contents of which are hereby incorporated by reference.
Sequence Listing [002] The instant application contains a Sequence Listing which has been submitted electronically in ASCII format and is hereby incorporated by reference in its entirety. Said ASCII copy, created on August 5, 2015, is named X2002-7000WO_SL.txt and is 76,949 bytes in size.
Field of the Invention [003] The présent invention relates generally to compositions having cellobiase activity and methods for producing the compositions described herein. The présent invention also provides methods for using such compositions, e.g., to process biomass materials.
Background ofthe Invention [004] Biomass-degrading enzymes, such as cellulases, xylanases, and ligninases, are important for the dégradation of biomass, such as feedstock. Cellulosic and lignocellulosic materials are produced, processed, and used in large quantifies in a number of applications. Often such materials are used once, and then discarded as waste, or are simply considered to be wasted materials, e.g., sewage, bagasse, sawdust, and stover
SüMMARY OF THE INVENTION [005] The présent invention is based, at least in part, on the suiprising discovery that a cellobiase from T. reesei that was expressed in a non-fungal cell line and isolated fforn a host cell that does not significantly glycosylate the enzyme had higher spécifie activity on pure substrate than the endogenous cellobiase (glycosylated and secreted) from T. reesei. Furthermore, when the aglycosylated cellobiase was used in a saccharification reaction with an enzyme mixture containing other saccharifying enzymes, a substantial increase in yield of sugar products was observed compared to the reaction without the aglycosylated cellobiase.
Therefore, provided herein are, an aglycosylated polypeptide having enzymatic activity, e.g., cellobiase activity, compositions comprising the aglycosylated polypeptide and methods for producing and using the compositions described herein.
[006] Accordingly, in one aspect, the disclosure features an aglycosylated polypeptide having cellobiase activity. In one embodiment, the aglycosylated polypeptide has increased cellobiase activity as compared to glycosylated Cel3A enzyme from wild-type T. reesei or a mutant thereof, such as T. reesei RUTC30. For example, the aglycosylated polypeptide can hâve increased substrate récognition or a more active substrate récognition site as compared to glycosylated Cel3A enzyme from wild-type T. reesei. In one embodiment, the aglycosylated polypeptide has reduced steric hindrance as compared to glycosylated Cel3A enzyme from wild-type T. reesei.
[007] In one embodiment, the aglycosylated polypeptide comprises at least 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% identity to SEQ ID NO: l, or a functional fragment thereof. In one embodiment, the aglycosylated polypeptide comprises (e.g., consists of) the amino acid sequence SEQ ID NO: 1. In another embodiment, the aglycosylated polypeptide differs by no more than 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, or 20 amino acids from the amino acid sequence of SEQ ID NO: 1.
[008] In one embodiment, the polypeptide comprises a Cel3 A enzyme from wild-type T. reesei, or a mutant thereof, such as T. reesei RUTC30, or a functional variant or fragment thereof.
[009] In one embodiment, the aglycosylated polypeptide is encoded by a nucleic acid sequence, wherein the nucleic acid sequence comprises (e.g., consists of) at least 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% identity to SEQ ID NO: 2 or SEQ ID NO: 3. In one embodiment, the aglycosylated polypeptide is encoded by a nucleic acid sequence comprising (e.g., consisting of) SEQ ID NO: 2 or SEQ ID NO: 3.
[0010] In one embodiment, the aglcyosylated polypeptide is expressed by a gene that comprises a mutation proximal to or a mutation at one or more glycosylation site, wherein the mutation prevents glycosylation at the glycosylation site. In one embodiment, the mutation is at one or more of the threonine at amino acid position 78, the threonine at amino acid position 241, the serine at amino acid position 343, the serine at amino acid position 450, the threonine at amino acid position 599, the serine at amino acid position 616, the threonine at amino acid position 691, the serine at amino acid position 21, the threonine at amino acid position 24, the serine at amino acid position 25, the serine at amino acid position 28, the threonine at amino acid position 38, the threonine at amino acid position 42, the threonine at amino acid position 303, the serine at amino acid position at 398, the at serine amino acid position 435, the serine at amino acid position 436, the threonine at amino acid position 439, threonine at amino acid position 442, the threonine at amino acid position 446, the serine at amino acid position 451, the serine at amino acid position 619, the serine at amino acid position 622, the threonine at amino acid position 623, the serine at amino acid position 626, or the threonine at amino acid position 630 of SEQ ID NO: 1. A mutation proximal to one or more glycosylation site can prevent glycosylation at that site, e.g., by changing the conformation of the polypeptide or changing the consensus site recognized by the glycosylating enzyme such that glycosylation would not occur at the glycosylation site.
[0011] In one embodiment, the aglycosylated polypeptide hydrolyzes a carbohydrate, such as a dimer, a trimer, a tetramer, a pentamer, a hexamer, a heptamer, an octamer, or an oligomer of glucose; or an oligomer of glucose and xylose, into one or more monosaccharide, e.g., glucose. In one embodiment, the cellobiase activity comprises hydrolysis of a beta 1,4 glycosidic linkage of cellobiose.
[0012] In another aspect, the disclosure features a nucleic acid sequence encoding an aglycosylated polypeptide described herein.
[0013] In one embodiment, the nucleic acid sequence encodes a Cel3A enzyme or functional fragment thereof, wherein the nucleic acid sequence comprises (e.g., consists of) at least 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% identity to SEQ ID NO: 2 or SEQ ID NO: 3. In one embodiment, the nucleic acid sequence encodes a Cel3A enzyme, wherein the nucleic acid sequence comprises (e.g., consists of) SEQ ID NO: 2 or SEQ ID NO: 3.
[0014] In one aspect, the disclosure features a nucleic acid molécule that includes a nucleic acid sequence described herein.
[0015] In one embodiment, the nucleic acid molécule further includes a promoter, e.g., a promoter for prokaryotic cell expression, e.g., bacterial cell expression, e.g., expression in E. coli. In one embodiment, the promoter sequence is a constitutive promoter sequence, inducible promoter sequence, or a repressible promoter sequence. In one embodiment, the promoter is a T7 promoter.
[0016] In one embodiment, the nucleic acid molécule further comprises a nucleic acid sequence encoding a tag, e.g., a tag for détection and/or purification and/or for linkage to another molécule, e.g., a His tag.
[0017] In one embodiment, the nucleic acid molécule further comprises a nucleic acid encoding one or more signal sequences, e.g., a sécrétion signal sequence.
[0018] In one aspect, the disclosure features an expression vector comprising the nucleic acid sequence described herein or a nucleic acid molécule described herein.
[0019] In one embodiment, the vector further comprises a nucleic acid sequence encoding a sélection marker, e.g., a kanamycin or an ampicillin marker.
[0020] In one aspect, the disclosure features a cell comprising a vector described herein.
[0021] In one embodiment, the cell is a prokaryotic cell/bacterial cell, e.g., an E. coli cell,
e.g., an Origami E. coli cell.
[0022] In one embodiment, the cell expresses an aglycosylated polypeptide described herein.
[0023] In one embodiment, the cell is impaired for glycosylation.
[0024] In one aspect, the disclosure features a method for producing the aglycosylated polypeptide described herein, comprising culturing a cell expressing a polypeptide described herein, under conditions suitable for the expression of the polypeptide, wherein the cell does not glycosylate the polypeptide, e.g., a bacterial cell, e.g., an/7 coli cell, e.g., an origami E. coli cell.
[0025] In one aspect, the disclosure features a method for producing the aglycosylated polypeptide described herein, comprising treating a polypeptide comprising an amino acid sequence of SEQ ID NO: 1, or an amino acid sequence with at least 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% identity to SEQ ID NO: 1 with a deglycosylating enzyme. For example, a deglycosylating enzyme can be PGNase or EndoH.
[0026] In one aspect, the disclosure features a method for producing the aglycosylated polypeptide described herein, comprising culturing a cell that comprises a nucleic acid sequence encoding a polypeptide having at least 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% identity to SEQ ID NO: 1, wherein the nucleic acid has one or more mutations which prevent glycosylation of the encoded polypeptide, and optionally obtaining the aglycosylated polypeptide from the culture.
[0027] In one aspect, the disclosure features a method for culturing a cell expressing the aglycosylated polypeptide described herein in the presence of a glycosylation inhibitor. For example, the glycosylation inhibitor is tunicamycin.
[0028] In one aspect, the disclosure features an enzyme mixture comprising an aglycosylated polypeptide described herein and one or more additional enzyme, such as one or more glycosylated enzymes, e.g., cellulases from fungal cells. In one embodiment, the enzyme mixture comprises a glycosylated polypeptide comprising an amino acid sequence with at least 75%, 80%, 95%, 90%, 95%, 96%, 97%, 98%, 99% identity to SEQ ID NO: 1 and an aglycosylated polypeptide described herein, wherein both of the glycosylated polypeptide and the aglycosylated peptide hâve cellobiase activity.
[0029] In one embodiment, the enzyme mixture comprises a glycosylated polypeptide and an aglycosylated polypeptide and both the glycosylated polypeptide and the aglycosylated polypeptide both comprise Cel3 A enzyme from wild-type T. reesei or a mutant thereof, such as T. reesei RUTC30.
[0030] In one embodiment, the enzyme mixture further comprises at least one additional enzyme derived from a microorganism, wherein the additional enzyme has a biomassdegrading activity. In one embodiment, the additional enzyme is selected from a ligninase, an endoglucanase, a cellobiohydrolase, xylanase, and a cellobiase.
[0031] In one embodiment, the reaction mixture has a ratio between the aglycosylated polypeptide and the remaining enzymes of at least 1:32, e.g., 1:32 to 1:300. In one embodiment, the reaction mixture has a ratio of the aglycosylated polypeptide to a glycosylated polypeptide is at least 1:32, e.g., 1:32 to 1:300.
[0032] In one aspect, the disclosure features a method of producing a product (e.g., hydrogen, a sugar, an alcohol, etc.) from a biomass (or converting a biomass to a product) comprising contacting a biomass, e.g., a biomass that has been treated with radiation, e.g., radiation from an électron beam, with an aglycosylated polypeptide described herein, under condiaitions suitable for the production of a sugar product. In one embodiment, the method further comprises contacting the biomass with a microorganism that produces one or more biomass-degrading enzyme or an enzyme mixture comprising biomass-degrading enzymes, e.g., an enzyme mixture described herein.
[0033] In one aspect, the disclosure features a method of producing a product (e.g., hydrogen, sugar, alcohol, etc.) from a biomass (or converting a biomass to a product) comprising contacting a biomass with an enzyme mixture described herein under conditions suitable for the production of the product.
[0034] In one embodiment, the product is a sugar product, e.g., a sugar product described herein. In one embodiment, the sugar product is glucose and/or xylose, or other sugar products, such as fructose, arabinose, galactose, and cellobiose.
[0035] In one embodiment, the method further comprises isolating the sugar product. In one embodiment, the isolating of the sugar product comprises précipitation, crystallization, chromatography, centrifugation, and/or extraction.
[0036] In one embodiment, the enzyme mixture comprises at least two of the enzymes selected from B2AF03, CIP1, CIP2, Cella, Cel3a, Cel5a, Cel6a, Cel7a, Cel7b, Cell2a, Cel45a, Cel74a, paMan5a, paMan26a, Swollenin, or any of the enzymes listed in Table 1. In an embodiment, the enzymes listed above are isolated from a cell that expresses the enzyme heterologously or endogenously, e.g., Trichoderma reesei or Podospora anserina.
[0037] In one embodiment, the biomass comprises a starchy material or a starchy material that includes a cellulosic component. In some embodiments, the biomass comprises one or more of an agricultural product or waste, a paper product or waste, a forestry product, or a general waste, or any combination thereof; wherein: a) an agricultural product or waste comprises sugar cane jute, hemp, flax, bamboo, sisal, alfalfa, hay, arracacha, buckwheat, banana, barley, cassava, kudzu, oca, sago, sorghum, potato, sweet potato, taro, yams, beans, favas, lentils, peas, grasses, switchgrass, miscanthus, cord grass, reed canary grass, grain residues, canola straw, wheat straw, barley straw, oat straw, rice straw, corn cobs, corn stover, corn fiber, coconut hair, beet pulp, bagasse, soybean stover, grain residues, rice hulls, oat hulls, wheat chaff, barley hulls, or beeswing, or a combination thereof; b) a paper product or waste comprises paper, pigmented papers, loaded papers, coated papers, filled papers, magazines, printed matter, printer paper, polycoated paper, cardstock, cardboard, paperboard, or paper pulp, or a combination thereof; c) a forestry product comprises aspen wood, particle board, wood chips, or sawdust, or a combination thereof; and d) a general waste comprises manure, sewage, or offal, or a combination thereof. In one embodiment, the method further comprises a step of treating the biomass prior to introducing the microorganism or the enzyme mixture to reduce the recalcitrance of the biomass, e.g., by treating the biomass with bombardment with électrons, sonication, oxidation, pyrolysis, steam explosion, chemical treatment, mechanical treatment, and/or freeze grinding.
[0038] In one embodiment, the microorganism that produces a biomass-degrading enzyme is from species in the généra selected from Bacillus, Coprinus, Myceliophthora, Cephalosporium, Scytalidium, Pénicillium, Aspergillus, Pseudomonas, Humicola, Fusarium, Thielavia, Acremonium, Chrysosporium or Trichoderma. In one embodiment, the microorganism that produces a biomass-degrading enzyme is selected from Aspergillus, Humicola insolens (Scytalidium thermophilum), Coprinus cinereus, Fusarium oxysporum, Myceliophthora thermophila, Meripilus giganteus, Thielavia terrestris, Acremonium persicinum, Acremonium acremonium, Acremonium brachypenium, Acremonium dichromosporum, Acremonium obclavatum, Acremonium pinkertoniae, Acremonium roseogriseum, Acremonium incoloratum, Acremonium furatum, Chrysosporium lucknowense, Trichoderma viride, Trichoderma reesei, or Trichoderma koningii.
[0039] In one embodiment, the microorganism has been induced to produce biomassdegrading enzymes by combining the microorganism with an induction biomass sample under conditions suitable for increasing production of biomass-degrading enzymes compared to an uninduced microorganism. In one embodiment, the induction biomass sample comprises paper, paper products, paper waste, paper pulp, pigmented papers, loaded papers, coated papers, filled papers, magazines, printed matter, printer paper, polycoated paper, card stock, cardboard, paperboard, cotton, wood, particle board, forestry wastes, sawdust, aspen wood, wood chips, grasses, switchgrass, miscanthus, cord grass, reed canary grass, grain residues, rice hulls, oat hulls, wheat chaff, barley hulls, agricultural waste, silage, canola straw, wheat straw, barley straw, oat straw, rice straw, jute, hemp, flax, bamboo, sisal, abaca, corn cobs, corn stover, soybean stover, corn fiber, alfalfa, hay, coconut haïr, sugar processing residues, bagasse, beet pulp, agave bagasse, algae, seaweed, manure, sewage, offal, agricultural or industrial waste, arracacha, buckwheat, banana, barley, cassava, kudzu, oca, sago, sorghum, potato, sweet potato, taro, yams, beans, favas, lentils, peas, or any combination thereof.
Brief Description of the Drawings [0040] FIG. 1 is a picture of purified Cel3a that was expressed in E. coli. Lane 1 represents molecular weight marker (Précision Plus Ail Blue Protein marker, Biorad); lane 2 represents purified Cel3a-His protein.
[0041] FIG. 2 is a chromatographie profile of the Cel3a-N’His sample, showing the détection of Cel3a-N’His at 31 minutes.
[0042] FIG. 3 is a profile showing no evidence of glycosylation in the mass spectral région for the peak in the chromatographie profile of the Cel3a-N’His at 31 minutes.
[0043] FIG. 4 is a profile showing the deconvolution of the charge state envelope, identifying the molecular weight of the major component (aglycosylated Cel3a-N’His) and the minor components (modified Cel3a-N’His).
[0044] FIG. 5 is a graph showing the cellobiase activity, as detennined by cellobiase assay, of a purified Cel3a-N’His.
[0045] FIG. 6 is a graph showing a standard curve for cellobiase activity generated for a known concentration of Cel3a-N’His that can be used to détermine the concentration of Cel3a of a sample with an unknown concentration of Cel3a.
[0046] FIG. 7 is a graph showing that spécifie activity of recombinant Cel3a compared to endogenous cellobiase from T. Reesei (L4196).
FIG. 8 is a graph showing the yield of glucose product from a saccharification reaction performed with a standard enzyme mix compared to the standard enzyme mix (L331 control) with the addition of aglycosylated cellobiase Cel3a (L331 (0.8mg/ml Cel3a).
Detailed Description
Définitions [0047] Unless defined otherwise, ail technical and scientifîc terms used herein hâve the same meaning as commonly understood by one of ordinary skill in the art to which the invention pertains.
[0048] The tenu “a” and “an” refers to one or to more than one (i.e., to at least one) of the grammatical object of the article. By way of example, “an element” means one element or more than one element.
[0049] The term “aglycosylated”, as used herein, refers to a molécule, e.g., a polypeptide, that is not glycosylated (i.e., it comprises a hydroxyl group or other functional group that is not attached to a glycosylate group) at one or more sites which has a glycan attached when the molécule is produced in its native environment. For example, a Cel3A enzyme is aglycosylated when one or more site in the protein that normally has a glycan group attached to it when the Cel3 A enzyme is produced in T. reesei does not hâve a glycan attached at that site. In some embodiments, the aglycosylated molécule does not hâve any attached glycans. In one embodiment, the molécule has been altered or mutated such that the molécule cannot be glycosylated, e.g., one or more glycosylation site is mutated such that a glycan cannot be attached to the glycosylation site. In another embodiment, an attached glycan can be removed from the molécule, e.g., by an enzymatic process, e.g., by incubating with enzymes that remove glycans or hâve deglycosylating activity. In yet another embodiment, glycosylation of the molécule can be inhibited, e.g., by use of a glycosylation inhibitor (that inhibits a glycosylating enzyme). In another embodiment, the molécule, e.g., the polypeptide, can be produced by a host cell that does not glycosylate, e.g., E. coli.
[0050] The term “biomass”, as used herein, refers to any non-fossilized, organic matter. Biomass can be a starchy material and/or a cellulosic, hemicellulosic, or lignocellulosic material. For example, the biomass can be an agricultural product, a paper product, forestry product, or any intermediate, byproduct, residue or waste thereof, or a general waste. The biomass may be a combination of such materials. In an embodiment, the biomass is processed, e.g., by a saccharification and/or a fermentation reaction described herein, to produce products, such as sugars, alcohols, organic acids, or biofuels.
[0051] The terni “biomass degrading enzymes”, as used herein, refers to enzymes that break down components of the biomass matter described herein into inteimediates or final products. For example, biomass-degrading enzymes include at least amylases, e.g., alpha, beta or gamma amylases, cellulases, hemicellulases, ligninases, endoglucancases, cellobiases, xylanases, and cellobiohydrolases. Biomass-degrading enzymes are produced by a wide variety of microorganisms, and can be isolated from the microorganisms, such as T. reesei. The biomass degrading enzyme can be endogenously expressed or heterologously expressed.
[0052] The term “cellobiase”, as used herein, refers to an enzyme that catalyzes the hydrolysis of a dimer, trimer, tetramer, pentamer, hexamer, heptamer, octamer, or an oligomer of glucose, or an oligomer of glucose and xylose, to glucose and/or xylose. For example, the cellobiase is beta-glucosidase, which catalyzes beta-1,4 bonds in cellobiose to release two glucose molécules.
[0053] The term “cellobiase activity”, as used herein, refers to activity of a category of cellulases that catalyze the hydrolysis of cellobiose to glucose, e.g., catalyzes the hydrolysis of beta-D-glucose residues to release beta-D-glucose. Cellobiase activity can be determined according to the assays described herein, e.g., in Example 6. One unit of cellobiase activity can be defined as [glucose] g/L / [Cel3a] g/L / 30 minutes.
[0054] The term “cellobiohydrolase” as used herein, refers to an enzyme that hydrolyzes glycosidic bonds in cellulose. For example, the cellobiohydrolase is 1 ,4-beta-D-glucan cellobiohydrolase, which catalyzes the hydrolysis of 1,4-beta-D-glucosidic linkages in cellulose, cellooligosaccharides, or any beta- 1,4-linked glucose containing polymer, releasing oligosaccharides from the polymer chain.
[0055] The term “endoglucanase” as used herein, refers to an enzyme that catalyzes the hydrolysis of internai β-1,4 glucosidic bonds of cellulose. For example, the endoglucanase is endo-1,4-(1,3; 1,4)-beta-D-glucan 4-glucanohydrolase, which catalyses endohydrolysis of 1,4beta-D-glycosidic linkages in cellulose, cellulose dérivatives (such as carboxymethyl cellulose and hydroxyethyl cellulose), lichenan, beta- 1,4 bonds in mixed beta- 1, 3 glucans such as cereal beta-D-glucans or xyloglucans, and other plant material containing cellulosic components.
[0056] The term “enzyme mixture” as used herein, refers to a combination of at least two different enzymes, or at least two different variants of an enzyme (e.g., a glycosylated and an aglycosylated version of an enzyme). The enzyme mixture referred to herein includes at least the aglycosylated polypeptide having cellobiase activity described herein. In one embodiment, the enzyme mixture includes one or more of a cellobiase, an endoglucanase, a cellobiohydrolase, a ligninase, and/or a xylanase. In some embodiments, the enzyme mixture includes a cell, e.g., a microorganism, which expresses and, e.g., sécrétés, one or more of the enzymes. For example, the enzyme mixture can include an aglycosylated polypeptide described herein and a cell, e.g., a microorganism, which expresses and, e.g., sécrétés, one or more additional enzymes and/or variants of the polypeptide.
[0057] The tenu “ligninase” as used herein, refers to an enzyme that catalyzes the breakdown of lignin, commonly found in the cell walls of plants, such as by an oxidation reaction.
[0058] The tenus “nucleic acid” or “polynucleotide” are used interchangeable, and refer to deoxyribonucleic acids (DNA) or ribonucleic acids (RNA) and polymers thereof in either single- or double-stranded form. Unless specifically limited, the term encompasses nucleic acids containing known analogues of natural nucléotides that hâve similar binding properties as the reference nucleic acid and are metabolized in a manner similar to naturally occumng nucléotides. Unless otherwise indicated, a particular nucleic acid sequence also implicitly encompasses conservatively modified variants thereof (e.g., degenerate codon substitutions), alleles, orthologs, SNPs, and complementary sequences as well as the sequence explicitly indicated. Specifically, degenerate codon substitutions may be achieved by generating sequences in which the third position of one or more selected (or ail) codons is substituted with mixed-base and/or deoxyinosine residues (Batzer et al., Nucleic Acid Res. 19:5081 (1991); Ohtsuka et al., J. Biol. Chem. 260:2605-2608 (1985); and Rossolini et al., Mol. Cell. Probes 8:91-98 (1994)).
[0059] The term “operably linked”, as used herein, refers to a configuration in which a control or regulatory sequence is placed at a position relative to a nucleic acid sequence that encodes a polypeptide, such that the control sequence influences the expression of a polypeptide (encoded by the DNA sequence). In an embodiment, the control or regulatory sequence is upstream of a nucleic acid sequence that encodes a polypeptide with cellobiase activity. In an embodiment, the control or regulatory sequence is downstream of a nucleic acid sequence that encodes a polypeptide with cellobiase activity.
[0060] The terms “peptide,” “polypeptide,” and “protein” are used interchangeably, and refer to a compound comprised of amino acid residues covalently linked by peptide bonds. A protein or peptide must contain at least two amino acids, and no limitation is placed on the maximum number of amino acids that can comprise a protein’s or peptide’s sequence. Polypeptides include any peptide or protein comprising two or more amino acids joined to each other by peptide bonds. “Polypeptides” include, for example, biologically active fragments, substantially homologous polypeptides, oligopeptides, homodimers, heterodimers, variants of polypeptides, modifîed polypeptides, dérivatives, analogs, fusion proteins, among others. A polypeptide includes a naturel peptide, a recombinant peptide, or a combination thereof.
[0061] The term “promoter”, as used herein, refers to a DNA sequence recognized by the synthetic machinery of the cell, or introduced synthetic machinery, required to initiate the spécifie transcription of a polynucleotide sequence.
[0062] The term “regulatory sequence” or “control sequence”, as used interchangeably herein, refers to a nucleic acid sequence which is required for expression of a nucleic acid product. In some instances, this sequence may be a promoter sequence and in other instances, this sequence may also include an enhancer sequence and other regulatory éléments which are required for expression ofthe gene product. The regulatory/control sequence may, for example, be one which expresses the nucleic acid product in a regulated manner, e.g., inducible manner.
[0063] The term “constitutive” promoter refers to a nucléotide sequence which, when operably linked with a polynucleotide which encodes a polypeptide, causes the polypeptide to be produced in a cell under most or ali physiological conditions of the cell. In an embodiment, the polypeptide is a polypeptide having cellobiase activity.
[0064] The term “inducible” promoter refers to a nucléotide sequence which, when operably linked with a polynucleotide which encodes a polypeptide, causes the polypeptide to be produced in a cell substantially only when an inducer which corresponds to the promoter is présent in the cell. In an embodiment, the polypeptide is a polypeptide having cellobiase activity.
[0065] The term “repressible” promoter refers to a nucléotide sequence, which when operably linked with a polynucleotide which encodes a polypeptide, causes the polypeptide to be produced in a cell substantially only until a repressor which corresponds to the promoter is présent in the cell. In an embodiment, the polypeptide is a polypeptide having cellobiase activity.
[0066] The term “xylanase” as used herein, refers to enzymes that hydrolyze xylancontaining material. Xylan is polysaccharide comprising units of xylose. A xylanase can be an endoxylanase, a beta-xylosidase, an arabinofuranosidase, an alpha-glucuronidase, an acetylxylan esterase, a feruloyl esterase, or an alpha-glucuronyl esterase.
DESCRIPTION [0067] Glycosylation is thought to play a critical rôle in enzyme structure and function, such as enzyme activity, solubility, stability, folding, and/or sécrétion. Accordingly, processes for converting biomass into biofuels and other products hâve focused on producing and utilizing glycosylated enzymes, e.g., cellobiases, for use in saccharification of cellulosic and/or lignocellulosic materials. Enzymes for use in saccharification are typically produced in eukaryote host cell lines that properly glycosylate, fold, and secrete the proteins, such as Pichia pastoris.
[0068] The présent invention is based, at least in part, on the suiprising discovery that a cellobiase from T. reesei that was expressed in a non-fungal cell line and isolated from a host cell that does not significantly glycosylate the enzyme had higher spécifie activity on pure substrate than the endogenous cellobiase (glycosylated and secreted) from T. reesei. Furthermore, when the aglycosylated cellobiase was used in a saccharification reaction with an enzyme mixture containing other saccharifying enzymes, a substantial increase in yield of sugar products was observed compared to the reaction without the aglycosylated cellobiase. Therefore, provided herein are, an aglycosylated polypeptide having enzymatic activity, e.g., cellobiase activity, compositions comprising the aglycosylated polypeptide and methods for producing and using the compositions described herein.
Polypeptides and Variants [0069] The présent disclosure provides an aglycosylated polypeptide with cellobiase activity. In an embodiment, the aglycosylated polypeptide is a cellobiase. A cellobiase is an enzyme that hydrolyzes beta-1,4 bonds in its substrate, cellobiose, to release two glucose molécules. Cellobiose is a water soluble 1,4-linked dimer of glucose.
[0070] Cel3a (also known as Bgll) is a cellobiase that was identified in Trichoderma reesei. The amino acid sequence for Cel3a (GenBank Accession No. NW_006711153) is provided below :
MGDSHSTSGASAEAVVPPAGTPWGTAYDKAKAALAKLNLQDKVGIVSGVGWNGGPC VGNTSPASKISYPSLCLQDGPLGVRYSTGSTAFTPGVQAASTWDVNLIRERGQFIGEEV KASGIHVILGPVAGPLGKTPQGGRNWEGFGVDPYLTGIAMGQTINGIQSVGVQATAKH YILNEQELNRETISSNPDDRTLHELYTWPFADAVQANVASVMCSYNKVNTTWACEDQ YTLQTVLKDQLGFPGYVMTDWNAQHTTVQSANSGLDMSMPGTDFNGNNRLWGPAL TNAVNSNQVPTSRVDDMVTRILAAWYLTGQDQAGYPSFNISRNVQGNHKTNVRAIAR DGIVLLKNDANILPLKKPASIAVVGSAAIIGNHARNSPSCNDKGCDDGALGMGWGSGA VNYPYFVAPYDAINTRASSQGTQVTLSNTDNTSSGASAARGKDVAIVFITADSGEGYIT VEGNAGDRNNLDPWHNGNALVQAVAGANSNVIVVVHSVGAIILEQILALPQVKAVV WAGLPSQESGNALVDVLWGDVSPSGKLVYTIAKSPNDYNTRIVSGGSDSFSEGLFIDY KHFDDANITPRYEFGYGLSYTKFNYSRLSVLSTAKSGPATGAWPGGPSDLFQNVATV TVDIANSGQVTGAEVAQLYITYPSSAPRTPPKQLRGFAKLNLTPGQSGTATFNIRRRDL SYWDTASQKWVVPSGSFGISVGASSRDIRLTSTLSVAGSGS (SEQ ID NO: 1) [0071] The présent disclosure also provides functional variants of an aglycosylated polypeptide having cellobiase activity described herein, e.g., Cel3a. In an embodiment, a functional variant has an amino acid sequence with at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identity to a Cel3a described herein, or a functional fragment thereof, e.g., at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identity to a Cel3a described herein, or a functional fragment thereof. In an embodiment, a functional variant has an amino acid sequence with at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91% identity, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identity to SEQ ID NO: 1, or a functional fragment thereof.
[0072] Percent identity in the context of two or more amino acid or nucleic acid sequences, refers to two or more sequences that are the same. Two sequences are substantially identical if two sequences hâve a specified percentage of amino acid residues or nucléotides that are the same (e.g., 60% identity, optionally 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity over a specified région, or, when not specified, over the entire sequence), when compared and aligned for maximum correspondence over a comparison window, or designated région as measured using one of the following sequence comparison algorithms or by manual alignment and visual inspection. Optionally, the identity exists over a région that is at least about 50 nucléotides, 100 nucléotides, 150 nucléotides, in length. More preferably, the identity exists over a région that is at least about 200 or more amino acids, or at least about 500 or 1000 or more nucléotides, in length.
[0073] For sequence comparison, one sequence typically acts as a reference sequence, to which one or more test sequences are compared. When using a sequence comparison algorithm, test and reference sequences are entered into a computer, subsequence coordinates are designated, if necessaiy, and sequence algorithm program parameters are designated. Default program parameters can be used, or alternative parameters can be designated. The sequence comparison algorithm then calculâtes the percent sequence identities for the test sequences relative to the reference sequence, based on the program parameters. Methods of alignment of sequences for comparison are well known in the art. Optimal alignment of sequences for comparison can be conducted, e.g., by the local homology algorithm of Smith and Waterman, (1970) Adv. Appl. Math. 2:482c, by the homology alignment algorithm of Needleman and Wunsch, (1970) J. Mol. Biol. 48:443, by the search for similarity method of Pearson and Lipman, (1988) Proc. Nat’l. Acad. Sci. USA 85:2444, by computerized implémentations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., Madison, WI), or by manual alignment and visual inspection (see, e.g., Brent et al., (2003) Current Protocols in Molecular Biology).
[0074] Two examples of algorithms that are suitable for determining percent sequence identity and sequence similarity are the BLAST and BLAST 2.0 algorithms, which are described in Altschul et al., (1977) Nue. Acids Res. 25:3389-3402; and Altschul et al., (1990) J.
Mol. Biol. 215:403-410, respectively. Software for performing BLAST analyses is publicly available through the National Center for Biotechnology Information.
[0075] Functional variants may comprise one or more mutations, such that the variant retains cellobiase activity that is better, e.g., increased in comparison to, than the cellobiase activity of an ezyme of SEQ ID NO: 1 produced by T. reesei. In an embodiment, the functional variant has at least 10%, at least 20%, at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, or at least 99% (e.g., at least 80%, at least 85%, at least 90%, at least 95%, or at least 99%) of the cellobiase activity as an aglycosylated version of SEQ ID NO: 1 as produced by E. coli. Cellobiase activity can be tested using the functional assays described herein.
[0076] In another embodiment, the aglycosylated polypeptide differs by no more than 1, no more than 2, no more than 3, no more than 4, no more than 5, no more than 6, no more than 7, no more than 8, no more than 9, no more than 10, no more than 15, no more than 20, no more than 30, no more than 40, or no more than 50 amino acids from a reference amino acid sequence, e.g., the amino acid sequence of SEQ ID NO: 1.
[0077] The mutations présent in a fùnctional variant include amino acid substitutions, additions, and délétions. Mutations can be introduced by standard techniques known in the art, such as site-directed mutagenesis and PCR-mediated mutagenesis. The mutation may be a conservative amino acid substitution, in which the amino acid residue is replaced with an amino acid residue having a similar side chain. Families of amino acid residues having similar side chains hâve been defined in the art. These families include amino acids with basic side chains (e.g., lysine, arginine, histidine), acidic side chains (e.g., aspartic acid, glutamic acid), uncharged polar side chains (e.g., glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine, tryptophan), nonpolar side chains (e.g., alanine, valine, leucine, isoleucine, proline, phenylalanine, méthionine), beta-branched side chains (e.g., threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, phenylalanine, tryptophan, histidine). Thus, one or more amino acid residues within the polypeptide having cellobiase activity of the disclosure can be replaced with other amino acids from the same side chain family, and the résultant polypeptide retains cellobiase activity comparable (e.g., at least 80%, 85%, 90%, 95%, or 99% ofthe cellobiase activity) to that of the wild-type polypeptide. Alternatively, the mutation may be an amino acid substitution in which an amino acid residue is replaced with an amino acid residue having a different side chain.
[0078] Such mutations may alter or affect various enzymatic characteristics of the cellobiase. For example, such mutations may alter or affect the cellobiase activity, thermostability, optimal pH for reaction, enzyme kinetics, or substrate récognition of the cellobiase. In some embodiments, a mutation increases the cellobiase activity of the variant in comparison to the cellobiase produced by T. reesei and/or SEQ ID NO: 1 produced in E.coli. In some embodiments, a mutation increases or decreases the thermostability of the variant in comparison to wild-type cellobiase and/or SEQ ID NO: 1 produced in E.coli. In an embodiment, a mutation changes the pH range at which the variant optimally performs the cellobiase reaction in comparison to wild-type cellobiase and/or SEQ ID NO:1 produced in E.coli. In an embodiment, a mutation increases or decreases the kinetics of the cellobiase reaction (e.g., kcat, Km or Ko) in comparison to wild-type cellobiase and/or SEQ ID NO: 1 produced in E.coli. In an embodiment, a mutation increases or decreases the ability of the cellobiase to recognize or bind to the substrate (e.g., cellobiose) in comparison to wild-type cellobiase and/or SEQ ID NO:1 produced in E.coli.
[0079] The présent invention also provides functional fragments of a polypeptide having cellobiase activity as described herein, e.g., Cel3a or SEQ ID NO: 1. One of ordinary skill in the art could readily envision that a fragment of a polypeptide having cellobiase activity as described herein that contains the functional domains responsible for enzymatic activity would retain functional activity, e.g., cellobiase activity, and therefore, such fragments are encompassed in the présent invention. In an embodiment, the functional fragment is at least 700 amino acids, at least 650 amino acids, at least 600 amino acids, at least 550 amino acids, at least 500 amino acids, at least 450 amino acids, at least 400 amino acids, at least 350 amino acids, at least 300 amino acids, at least 250 amino acids, at least 200 amino acids, at least 150 amino acids, at least 100 amino acids, or at least 50 amino acids in length. In an embodiment, the functional fragment is 700 to744 amino acids, 650 to 699 amino acids, 600 to 649 amino acids, 550 to 599 amino acids, 500 to 549 amino acids, 450 to 499 amino acids, 400 to 449 amino acids, 350 to 399 amino acids, 300 to 349 amino acids, 250 to 299 amino acids, 200 to 249 amino acids, 150 to 199 amino acids, 100 to 149 amino acids, or 50 to 99 amino acids. With regard to the ranges of amino acid length described above, the lowest and highest values of amino acid length are included within each disclosed range. In an embodiment, the functional fragment has at least 10%, at least 20%, at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, or at least 99% of the cellobiase activity as wild-type Cel3a or the polypeptide comprising SEQ ID NO: 1 produced in E.coli.
[0080] Assays for detecting cellobiase activity are known in the art. For example, détection of the amount of glucose released from cellobiose can be detennined by incubating purified cellobiase with substrate, e.g., cellobiose, D-(+)-cellobiose, and detecting the résultant amount of free glucose after completion of the reaction. The amount of free glucose can be determined using a variety of methods known in the art. For example, dilutions of purified cellobiase are prepared in a buffer containing 50mM sodium citrate monobasic, pH 5.0 NaOH. The cellobiose substrate is added to the purified cellobiase in an amount such that the final concentration of cellobiose in the reaction mixture is 30mM. The reaction mixture is incubated under conditions suitable for the reaction to occur, e.g., in a shaker (700 rpm) at 48°C for 30 minutes. To stop the reaction, the reaction mixture is heated for 5 minutes at 100°C. The reaction mixture is filtered through a 0.45 pm filter and the fîltrate is analyzed to quantify the amount of glucose and/or cellobiase. A YSI instrument that measures analytes such as glucose can be used to détermine the concentration of glucose produced from the reaction.
Altematively, UPLC (Ultra Performance Liquid Chromatography) can be used to détermine the concentration of glucose and cellobiose from the reaction. This assay can be formatted in a single reaction or in multiple reaction formats, e.g., 96 well format. In some embodiments, the multiple reaction format may be preferred to generate an activity curve representing cellobiase activity with respect to different concentrations of the purified cellobiase. The concentration of the purified cellobiase can be determined using a standard Bradford assay. Dilutions of the purified cellobiase assay are prepared, e.g., 2-fold dilutions, and are aliquoted into a 96 well plate, e.g., 12 wells of 2-fold dilutions. Cellobiose substrate is added as previously described, such that the final concentration of cellobiase in the reaction is 30mM. The plate is sealed and treated under conditions sufficient for the cellobiase reaction to occur, and then under conditions to stop the reaction. The reaction is then filtered through a 96 well format 0.45 pm membrane (e.g., Durapore) and analyzed by YSI and/or HPLC methods, e.g., UPLC.
[0081] This activity assay can also be used to détermine the concentration, or titer, of a Cel3a in a sample by generating a standard curve of activity of dilutions of a Cel3a sample with a known concentration. The activity of dilutions of the sample with unknown concentration is determined and compared with the standard curve to identify an approximate concentration based on the standard curve. This method is described in further detail in Example 6.
[0082] In other embodiments, a colorimetric/fluorometric assay can be used. The purified cellobiase is incubated with substrate cellobiose under conditions for the reaction to occur. Détection of the product glucose is as follows. Glucose oxidase is added to the mixture, which oxidizes glucose (the product) to gluconic acid and hydrogen peroxide. Peroxidase and odianisidine is then added. O-dianisidine reacts with the hydrogen peroxide in the presence of peroxidase to form a colored product. Sulfuric acid is added, which reacts with the oxidized odianisidine reacts to form a more stable colored product. The intensity of the color when measured, e.g., by spectrophotometer or colorimeter, e.g., at 540nm, is directly proportional to the glucose concentration. Such colorimetric/fluorometric glucose assays are commercially available, for example from Sigma Aldrich, Catalog No. GAGO-20.
[0083] For ail of the polypeptides having cellobiase activity described above, the polypeptides are aglycosylated using the methods for producing aglycosylated polypeptides described herein.
[0084] Glycosylation is the enzymatic process by which a carbohydrate is attached to a glycosyl acceptor, e.g., the nitrogen of arginine or asparginine side chains or the hydroxyl oxygen of serine, threonine, or tyrosine side chains. There are two types of glycosylation: Nlinked and O-linked glycosylation. N-linked glycosylation occurs at consensus site Asn-XSer/Thr, wherein the X can be any amino acid except a praline. O-linked glycosylation occurs at Ser/Thr residues. Glycosylation sites can be predicted using various algorithme known in the art, such as Prosite, publicly available by the Swiss Institute of Bioinformatics, and NetNGlyc 1.0 or NetOGlyc 4.0, publicly available by the Center for Biological Sequence Analysis.
[0085] In embodiments, the functional variant contains one or more mutations wherein one or more glycoslyation sites présent in the Cel3a polypeptide expressed by a nucleic acid sequence described herein that has been mutated such that a glycan can no longer be attached or linked to the glycosylation site. In another embodiment, the functional variant contains one or more mutations proximal to one or more glycosylation sites présent in the Cel3a polypeptide expressed by a nucleic acid sequence described herein that has been mutated such that a glycan can no longer be attached or linked to the glycosylation site. For example, the mutation proximal to a glycosylation site mutâtes the consensus motif recognized by the glycosylating enzyme, or changes the conformation of the polypeptide such that the polypeptide cannot be glycosylated, e.g., the glycoslation site is hidden or steric hindrance due to the new conformation prevents the glycosylating enzymes from accessing the glycosylation site. A mutation proximal to a glycosylation site in the Cel3a polypeptide is directly adjacent to, or at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 15, at least 20, at least 30 or at least 40 amino acids from the glycosylation site that, as a resuit of the proximal mutation, will not be glycosylated.
[0086] In an embodiment, one or more of the following glycosylation sites of Cel3a, or SEQ ID NO: 1, are mutated: the threonine at amino acid position 78, the threonine at amino acid position 241, the serine at amino acid position 343, the serine at amino acid position 450, the threonine at amino acid position 599, the serine at amino acid position 616, the threonine at amino acid position 691, the serine at amino acid position 21, the threonine at amino acid position 24, the serine at amino acid position 25, the serine at amino acid position 28, the threonine at amino acid position 38, the threonine at amino acid position 42, the threonine at amino acid position 303, the serine at amino acid position at 398, the at serine amino acid position 435, the serine at amino acid position 436, the threonine at amino acid position 439, threonine at amino acid position 442, the threonine at amino acid position 446, the serine at amino acid position 451, the serine at amino acid position 619, the serine at amino acid position 622, the threonine at amino acid position 623, the serine at amino acid position 626, or the threonine at amino acid position 630, or any combination thereof. In embodiments, the glycosylation site is mutated from a serine or threonine to an alanine. For example, the aglycosylated polypeptide described herein has one or more of the following mutations: T78A, T241A, S343A, S450A, T599A, S616A, T691A, S21A, T24A, S25A, S28A, T38A, T42A, T303A, T398A, S435A, S436A, T439A, T442A, T446A, S451A, S619A, S622A, T623A, S626A, or T630A, or any combination thereof. Alternatively, one or more amino acids proximal to the glycosylation sites described above is mutated.
[0087] Assays to detect whether a polypeptide is modified by a glycan (e.g., whether the polypeptide is glycosylated or aglycosylated) are known in the art. The polypeptide can be purified or isolated and can be stained for détection and quantification of glycan moieties, or the polypeptide can be analyzed by mass spectrometry, and compared to a corresponding reference polypeptide. The reference polypeptide has the same primary sequence as the test polypeptide (of which the glycosylation state is to be determined), but is either glycosylated or aglycosylated.
[0088] The aglycosylated polypeptides described herein hâve increased cellobiase activity compared to a corresponding glycosylated polypeptide, e.g., glycosylated Cel3a polypeptide. For example, the aglycosylated polypeptide having cellobiase activity has at least 1%, 2%, 5%, 10%, 15%, 20%, 25%, 30%, 40%, 50%, 60%, 70%, 80%, 90% , 100%, or 200% cellobiase activity compared to the glyocyosylated polypeptide.
Nucleic Acids [0089] The présent invention also provides a nucleic acid sequence encoding a polypeptide having cellobiase activity of the présent invention. In an embodiment, the nucleic acid sequence encodes a Cel3a enzyme or a functional fragment thereof with the amino acid sequence described herein.
[0090] In an embodiment, the nucleic acid sequence that encodes Cel3a is provided below:
ATGCGTTACCGAACAGCAGCTGCGCTGGCACTTGCCACTGGGCCCTTTGCTAGGGCAGACAGTCACTCAACATCGG GGGCCTCGGCTGAGGCAGTTGTACCTCCTGCAGGGACTCCATGGGGAACCGCGTACGACAAGGCGAAGGCCGCATT GGCAAAGCTCAATCTCCAAGATAAGGTCGGCATCGTGAGCGGTGTCGGCTGGAACGGCGGTCCTTGCGTTGGAAAC ACATCTCCGGCCTCCAAGATCAGCTATCCATCGCTATGCCTTCAAGACGGACCCCTCGGTGTTCGATACTCGACAG GCAGCACAGCCTTTACGCCGGGCGTTCAAGCGGCCTCGACGTGGGATGTCAATTTGATCCGCGAACGTGGACAGTT CATCGGTGAGGAGGTGAAGGCCTCGGGGATTCATGTCATACTTGGTCCTGTGGCTGGGCCGCTGGGAAAGACTCCG CAGGGCGGTCGCAACTGGGAGGGCTTCGGTGTCGATCCATATCTCACGGGCATTGCCATGGGTCAAACCATCAACG GCATCCAGTCGGTAGGCGTGCAGGCGACAGCGAAGCACTATATCCTCAACGAGCAGGAGCTCAATCGAGAAACCAT TTCGAGCAACCCAGATGACCGAACTCTCCATGAGCTGTATACTTGGCCATTTGCCGACGCGGTTCAGGCCAATGTC GCTTCTGTCATGTGCTCGTACAACAAGGTCAATACCACCTGGGCCTGCGAGGATCAGTACACGCTGCAGACTGTGC TGAAAGACCAGCTGGGGTTCCCAGGCTATGTCATGACGGACTGGAACGCACAGCACACGACTGTCCAAAGCGCGAA TTCTGGGCTTGACATGTCAATGCCTGGCACAGACTTCAACGGTAACAATCGGCTCTGGGGTCCAGCTCTCACCAAT GCGGTAAATAGCAATCAGGTCCCCACGAGCAGAGTCGACGATATGGTGACTCGTATCCTCGCCGCATGGTACTTGA CAGGCCAGGACCAGGCAGGCTATCCGTCGTTCAACATCAGCAGAAATGTTCAAGGAAACCACAAGACCAATGTCAG GGCAATTGCCAGGGACGGCATCGTTCTGCTCAAGAATGACGCCAACATCCTGCCGCTCAAGAAGCCCGCTAGCATT GCCGTCGTTGGATCTGCCGCAATCATTGGTAACCACGCCAGAAACTCGCCCTCGTGCAACGACAAAGGCTGCGACG ACGGGGCCTTGGGCATGGGTTGGGGTTCCGGCGCCGTCAACTATCCGTACTTCGTCGCGCCCTACGATGCCATCAA TACCAGAGCGTCTTCGCAGGGCACCCAGGTTACCTTGAGCAACACCGACAACACGTCCTCAGGCGCATCTGCAGCA AGAGGAAAGGACGTCGCCATCGTCTTCATCACCGCCGACTCGGGTGAAGGCTACATCACCGTGGAGGGCAACGCGG
GCGATCGCAACAACCTGGATCCGTGGCACAACGGCAATGCCCTGGTCCAGGCGGTGGCCGGTGCCAACAGCAACGT CATTGTTGTTGTCCACTCCGTTGGCGCCATCATTCTGGAGCAGATTCTTGCTCTTCCGCAGGTCAAGGCCGTTGTC TGGGCGGGTCTTCCTTCTCAGGAGAGCGGCAATGCGCTCGTCGACGTGCTGTGGGGAGATGTCAGCCCTTCTGGCA AGCTGGTGTACACCATTGCGAAGAGCCCCAATGACTATAACACTCGCATCGTTTCCGGCGGCAGTGACAGCTTCAG CGAGGGACTGTTCATCGACTATAAGCACTTCGACGACGCCAATATCACGCCGCGGTACGAGTTCGGCTATGGACTG TCTTACACCAAGTTCAACTACTCACGCCTCTCCGTCTTGTCGACCGCCAAGTCTGGTCCTGCGACTGGGGCCGTTG TGCCGGGAGGCCCGAGTGATCTGTTCCAGAATGTCGCGACAGTCACCGTTGACATCGCAAACTCTGGCCAAGTGAC TGGTGCCGAGGTAGCCCAGCTGTACATCACCTACCCATCTTCAGCACCCAGGACCCCTCCGAAGCAGCTGCGAGGC TTTGCCAAGCTGAACCTCACGCCTGGTCAGAGCGGAACAGCAACGTTCAACATCCGACGACGAGATCTCAGCTACT GGGACACGGCTTCGCAGAAATGGGTGGTGCCGTCGGGGTCGTTTGGCATCAGCGTGGGAGCGAGCAGCCGGGATAT CAGGCTGACGAGCACTCTGTCGGTAGCG (SEQ ID NO: 2) [0091] The nucleic acid sequence encoding the polypeptide with cellobiase activity described herein can be codon-optimized for increased expression in host cells. Codon optimization includes changing the nucleic acid sequence to take into considération factors including codon usage bias, cryptic splicing sites, mRNA secondary structure, prématuré polyA sites, interaction of codon and anti-codon, and RNA instability motifs, to increase expression of the encoded polypeptide in the host. Various algorithms and commercial services for codonoptimization are known and available in the art.
[0092] The codon-optimized nucleic acid sequence that encodes Cel3a is provided below:
ATGCGTTATCGTACAGCCGCAGCCCTGGCACTGGCCACAGGTCCGTTCGCACGTGCCGATAGTCACAGTACCAGCG GTGCCAGCGCAGAAGCCGTGGTTCCGCCGGCAGGCACACCGTGGGGCACAGCCTATGATAAAGCCAAAGCCGCCCT GGCCAAGCTGAATCTGCAGGATAAAGTGGGCATCGTGAGTGGCGTGGGCTGGAACGGTGGTCCGTGCGTTGGCAAC ACCAGCCCGGCAAGCAAGATCAGCTATCCGAGCTTATGCCTGCAGGATGGTCCGCTGGGCGTGCGCTATAGCACCG GTAGTACCGCCTTTACACCTGGTGTGCAGGCCGCCAGTACCTGGGACGTTAACCTGATCCGCGAACGTGGCCAATT TATCGGCGAAGAAGTTAAAGCCAGCGGCATTCATGTTATTCTGGGTCCGGTGGCCGGTCCTCTGGGTAAAACCCCG CAGGGCGGCCGTAATTGGGAAGGCTTCGGCGTTGATCCGTATTTAACCGGCATCGCAATGGGCCAGACCATTAATG GCATCCAGAGCGTGGGTGTTCAAGCCACCGCCAAACACTACATATTAAACGAACAGGAACTGAATCGTGAAACCAT CAGCAGCAATCCGGATGATCGCACCCTGCATGAGCTGTATACATGGCCTTTTGCCGACGCAGTTCAGGCCAACGTG GCAAGTGTGATGTGTAGCTATAACAAGGTGAACACCACCTGGGCCTGCGAAGACCAGTACACCCTGCAGACCGTTT TAAAAGACCAACTGGGCTTCCCTGGTTACGTGATGACAGATTGGAATGCCCAGCACACAACCGTTCAGAGCGCAAA CAGTGGCCTGGATATGAGCATGCCGGGCACCGACTTCAACGGCAATAATCGTCTGTGGGGTCCGGCACTGACCAAT GCCGTTAACAGCAACCAGGTGCCGACCAGTCGTGTGGACGATATGGTTACCCGTATTCTGGCCGCCTGGTACCTGA CAGGTCAAGACCAGGCCGGCTACCCGAGCTTCAACATCAGCCGCAACGTGCAGGGTAATCACAAGACCAACGTTCG CGCAATCGCACGCGATGGTATCGTGCTGTTAAAGAACGATGCCAACATTCTGCCGCTGAAAAAACCGGCCAGCATC GCCGTTGTTGGTAGCGCAGCCATCATTGGCAACCACGCCCGTAACAGTCCGAGCTGCAATGATAAAGGCTGTGACG ACGGTGCCCTGGGCATGGGTTGGGGTAGTGGTGCCGTGAACTACCCGTATTTCGTGGCCCCGTACGACGCCATTAA CACCCGTGCAAGTAGCCAGGGTACCCAGGTTACCCTGAGCAACACCGACAACACAAGCAGCGGTGCCAGTGCAGCA CGTGGTAAGGATGTGGCCATCGTGTTCATCACCGCCGACAGCGGCGAAGGCTACATTACCGTGGAGGGTAATGCCG GTGATCGCAATAATCTGGACCCGTGGCATAACGGCAACGCCCTGGTTCAGGCAGTGGCAGGCGCAAATAGCAACGT GATCGTTGTGGTGCATAGCGTGGGTGCCATCATTCTGGAGCAGATCCTGGCCCTGCCGCAAGTTAAGGCAGTTGTG TGGGCAGGTCTGCCGAGCCAAGAAAGTGGCAATGCCCTGGTGGACGTTCTGTGGGGCGATGTTAGTCCGAGCGGCA AGCTGGTGTATACAATCGCCAAGAGCCCGAACGACTATAACACCCGCATCGTTAGCGGCGGCAGTGATAGCTTCAG CGAGGGCCTGTTTATCGACTACAAGCATTTCGATGATGCCAATATTACCCCGCGCTACGAATTTGGTTATGGCCTG AGCTATACCAAGTTCAACTACAGCCGCCTGAGCGTTTTAAGTACCGCCAAGAGTGGTCCGGCAACAGGTGCCGTGG TTCCTGGTGGTCCGAGTGATCTGTTTCAGAATGTGGCCACCGTGACCGTGGATATCGCCAACAGTGGTCAGGTTAC CGGCGCCGAAGTGGCACAGCTGTACATCACCTATCCGAGCAGTGCACCGCGCACCCCGCCGAAACAGCTGCGTGGC TTCGCCAAATTAAACCTGACCCCGGGCCAGAGCGGTACAGCAACCTTCAATATTCGCCGCCGTGATCTGAGCTATT
GGGACACCGCCAGCCAAAAATGGGTGGTGCCGAGCGGCAGCTTTGGCATTAGTGTGGGTGCAAGTAGCCGCGACAT TCGCTTAACAAGCACCCTGAGTGTTGCC (SEQ ID NO: 3) [0093] In an embodiment, the nucleic acid sequence encoding a Cel3a enzyme or functional variant thereof comprises at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identity to SEQ ID NO: 2. In an embodiment, the nucleic acid sequence encoding a Cel3a enzyme or functional variant thereof comprises at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identity to SEQ ID NO:2 or SEQ ID NO: 3.
[0094] Provided herein is a nucleic acid sequence encoding an aglycosylated polypeptide, e.g., Cel3a polypeptide, as described above, in which one or more glycoslyation sites présent in the polypeptide has been mutated such that a glycan can no longer be attached or linked to the glycosylation site. In another embodiment, the nucleic acid sequence described herein encoding an aglycosylated polypeptide, e.g., a Cel3a polypeptide, as described above, in which one or more mutations proximal to one or more glycosylation sites présent in the polypeptide has been mutated such that a glycan can no longer be attached or linked to the glycosylation site, as previously described. In an embodiment, the nucleic acid sequence encodes a polypeptide comprising one or more mutations at one or more of the following glycosylation sites of Cel3a, or SEQ ID NO: 1 : the threonine at amino acid position 78, the threonine at amino acid position 241, the serine at amino acid position 343, the serine at amino acid position 450, the threonine at amino acid position 599, the serine at amino acid position 616, the threonine at amino acid position 691, the serine at amino acid position 21, the threonine at amino acid position 24, the serine at amino acid position 25, the serine at amino acid position 28, the threonine at amino acid position 38, the threonine at amino acid position 42, the threonine at amino acid position 303, the serine at amino acid position at 398, the at serine amino acid position 435, the serine at amino acid position 436, the threonine at amino acid position 439, threonine at amino acid position 442, the threonine at amino acid position 446, the serine at amino acid position 451, the serine at amino acid position 619, the serine at amino acid position 622, the threonine at amino acid position 623, the serine at amino acid position
626, or the threonine at amino acid position 630, or any combination thereof. In embodiments, the glycosylation site is mutated from a serine or threonine to an alanine. For example, the nucleic acid sequence of the invention encodes an aglycosylated polypeptide comprising one or more of the following mutations: T78A, T241A, S343A, S450A, T599A, S616A, T691A, S21A, T24A, S25A, S28A, T38A, T42A, T303A, T398A, S435A, S436A, T439A, T442A, T446A, S451 A, S619A, S622A, T623A, S626A, or T630A, or any combination thereof. The ordinarily skilled artisan could readily modify the nucleic acid sequence of wild-type Cel3a (SEQ ID NO: 2) to encode a polypeptide with one or more glycosylation site mutation by using methods known in the art, such as site-directed mutagenesis.
[0095] The techniques used to isolate or clone a nucleic acid sequence encoding a polypeptide are known in the art and include isolation from genomic DNA, préparation from cDNA, or a combination thereof. The cloning of the nucleic acid sequences of the présent invention from such genomic DNA can be effected, e.g., by using the well known polymerase chain reaction (PCR) or antibody screening of expression libraries to detect cloned DNA fragments with shared structural features. See, e.g., Innis et al., 1990, PCR: A Guide to Methods and Application, Academie Press, New York. Other amplification procedures such as ligase chain reaction (LCR), ligated activated transcription (LAT) and nucléotide sequencebased amplification (NASBA) may be used. The nucleic acid sequence may be cloned from a strain of Trichoderma reesei, e.g., wild-type T. reesei, or T. reesei RUTC30, or another or related organism and thus, for example, may be an allelic or species variant of the polypeptide encoding région of the nucleic acid sequence.
[0096] The nucleic acid sequence may be obtained by standard cloning procedures used in genetic engineering to relocate the nucleic acid sequence from its natural location to a different site where it will be reproduced. The cloning procedures may involve excision and isolation of a desired fragment comprising the nucléotide sequence encoding the polypeptide, insertion of the fragment into a vector molécule, and incorporation of the recombinant vector into a host cell where multiple copies or clones of the nucléotide sequence will be replicated. The nucléotide sequence may be of genomic, cDNA, RNA, semisynthetic, synthetic origin, or any combinations thereof.
Expression Vectors and Host Cells [0097] The présent invention also provides nucleic acid constructs comprising a nucleic acid sequence encoding the polypeptide having cellobiase activity described herein operably linked to one or more control sequences that direct the expression, sécrétion, and/or isolation of the expressed polypeptide.
[0098] As used herein, an “expression vector” is a nucleic acid construct for introducing and expressing a nucleic acid sequence of interest into a host cell. In some embodiments, the vector comprises a suitable control sequence operably linked to and capable of effecting the expression of the polypeptide encoded in the nucleic acid sequence of interest. The control sequence may be an appropriate promoter sequence, recognized by a host cell for expression of the nucleic acid sequence. In an embodiment, the nucleic acid sequence of interest is a nucleic acid sequence encoding a polypeptide having cellobiase activity as described herein.
[0099] A promoter in the expression vector of the invention can include promoters obtained from genes encoding extracellular or intracellular polypeptides either homologous or heterologous to the host cell, mutant promoters, truncated promoters, and hybrid promoters.
[00100] Examples of suitable promoters for directing transcription of the nucleic acid constructs of the présent invention in a bacterial host cell are the promoters obtained from the E. coli lac operon, E. coli tac promoter (hybrid promoter, DeBoer et al, PNAS, 1983, 80:2125), E. coli rec A, E. coli araBAD, E. coli tetA, and prokaryotic beta-lactamase. Other examples of suitable promoters include viral promoters, such as promoters from bactériophages, including a T7 promoter, a T5 promoter, a T3 promoter, an Ml3 promoter, and a SP6 promoter. In some embodiments, more than one promoter controls the expression of the nucleic acid sequence of interest, e.g., an E. coli lac promoter and a T7 promoter. Further promoters that may be suitable for use in the présent invention are described in “Useful proteins from recombinant bacteria” in Scientific American, 1980, 242:74-94, and Sambrook et al., Molecular Cloning: A Laboratory Manual, 1989. In some preferred embodiments, the promoter is inducible, where the addition of a molécule stimulâtes the transcription and expression of the downstream reading frame.
[00101] Examples of suitable promoters for directing transcription of the nucleic acid constructs of the présent invention in a eukaryotic host cell, e.g., in a fungal or yeast cell are promoters obtained from the genes of Trichoderma Reesei, methanol-inducible alcohol oxidase (AOX promoter), Aspergillus nididans tryptophan biosynthesis (trpC promoter), Aspergillus nigervar. awamori flucoamylase (glaA), Saccharomyces cerevisiae galactokinase (GAL1), or Kluyveromyces lactis Plac4-PBI promoter.
[00102] A control sequence présent in the expression vector of the présent invention may also be a signal sequence that codes for an amino acid sequence linked to the amino terminus of a polypeptide and directs the encoded polypeptide into the cell’s secretory pathway, e.g., a sécrétion signal sequence. The signal sequence may be an endogenous signal sequence, e.g., where the signal sequence is présent at the N-terminus of the wild-type polypeptide when endogenously expressed by the organism from which the polypeptide of interest originates from. The signal sequence may be a foreign, or heterologous, signal peptide, in which the signal sequence is from a different organism or a different polypeptide than that of the polypeptide of interest being expressed. Any signal sequence which directs the expressed polypeptide into the secretory pathway of a host cell may be used in the présent invention. Typically, signal sequences are composed ofbetween 6 and 136 basic and/or hycrophobic amino acids.
[00103] Examples of signal sequences suitable for the présent invention include the signal sequence from Saccharomyces cerevisiae alpha-factor.
[00104] Fusion tags may also be used in the expression vector of the présent invention to facilitate the détection and purification of the expressed polypeptide. Examples of suitable fusion tags include His-tag (e.g., 3xHis, 6x His (SEQ ID NO: 6), or 8xHis (SEQ ID NO: 7)), GST-tag, HSV-tag, S-tag, T7 tag. Other suitable fusion tags include myc tag, hemagglutinin (HA) tag, and fluorescent protein tags (e.g., green fluorescent protein). The fusion tag is typically operably linked to the N or C terminus of the polypeptide to be expressed. In some embodiments, there may be a linker région between the fusion tag sequence and the N-terminus or C-terminus of the polypeptide to be expressed. In an embodiment, the linker région comprises a sequence between 1 to 20 amino acids, that does not affect or alter the expression or function of the expressed polypeptide.
[00105] Utilization of the fusion tags described herein allows détection of the expressed protein, e.g., by western blot by using antibodies that specifically recognize the tag. The tags also allows for purification of the expressed polypeptide from the host cell, e.g., by affinity chromatography. For example, an expressed polypeptide fused to a His-tag can be purified by using nickel affinity chromatography. The His tag has affinity for the Nickel ions, and a nickel column will retain the his-tagged polypeptide, while allowing ail other proteins and cell débris to flow through the column. Elution of the His-tagged polypeptide using an elution buffer, e.g., containing imidazole, releases the His-tagged polypeptide from the column, resulting in substantially purified polypeptide.
[00106] The expression vector of the invention may further comprise a selectable marker gene to enable isolation of a genetically modified microbe transformed with the construct as is commonly known to those of skill in the art. The selectable marker gene may confer résistance to an antibiotic or the ability to grow on medium lacking a spécifie nutrient to the host organism that otherwise could not grow under these conditions. The présent invention is not limited by the choice of selectable marker gene, and one of skill in the art may readily détermine an appropriate gene. For example, the selectable marker gene may confer résistance to ampicillin, chloramphenicol, tétracycline, kanamycin, hygromycin, phleomycin, geneticin, or G418, or may complément a deficiency of the host microbe in one of the trp, arg, leu, pyr4, pyr, ura3, ura5, his, or ade genes or may confer the ability to grow on acetamide as a sole nitrogen source.
[00107] The expression vector of the invention may further comprise other nucleic acid sequences, e.g., additional control sequences, as is commonly known to those of skill in the art, for example, transcriptional terminators, synthetic sequences to link the various other nucleic acid sequences together, origins of réplication, ribosome binding sites, a multiple cloning site (or polylinker site), a polyadenylation signal and the like. The ribosomal binding site suitable for the expression vector dépends on the host cell used, for example, for expression in a prokaryotic host cell, a prokaryotic RBS, e.g., a T7 phage RBS can be used. A multiple cloning site, or polylinker site, contains one or more restriction enzyme sites that are preferably not présent in the remaining sequence of the expression vector. The restriction enzyme sites are utilized for the insertion of a nucleic acid sequence encoding a polypeptide having cellobiase activity or other desired control sequences. The practice of the présent invention is not limited by the presence of any one or more of these other nucleic acid sequences, e.g., other control sequences.
[00108] Examples of suitable expression vectors for use in the présent invention include vectors for expression in prokaryotes, e.g., bacterial expression vectors. A bacterial expression vector suitable for use in the présent invention in the pET vector (Novagen), which contains the following: a viral T7 promoter which is spécifie to only T7 RNA polymerase (not bacterial RNA polymerase) and also does not occur anywhere in the prokaryotic genome, a lac operator comprising a lac promoter and coding sequence for the lac repressor protein (lacl gene), a polylinker, an fl origin of réplication (so that a single-stranded plasmid can be produced when co-infected with Ml3 helper phage), an ampicillin résistance gene, and a ColEl origin of réplication (Blaber, 1998). Both the promoter and the lac operator are located 5’, or upstream, of the polylinker in which the nucleic acid sequence encoding a polypeptide described herein is inserted. The lac operator confers inducible expression of the nucleic acid sequence encoding a polypeptide having cellobiase activity. Addition of IPTG (Isopropyl β-D-lthiogalactopyranoside), a lactose métabolite, triggers transcription of the lac operon and induces protein expression of the nucleic acid sequence under control of the lac operator. Use of this system requires the addition of T7 RNA polymerase to the host cell for vector expression. The T7 RNA polymerase can be introduced via a second expression vector, or a host cell strain that is genetically engineered to express T7 RNA polymerase can be used.
[00109] An exemplary expression vector for use with the invention is a pET vector, commercially available from Novagen. The pET expression system is described in U.S. Patent Nos. 4,952,496; 5,693,489; and 5,869,320. In one embodiment, the pET vector is a pETDUET vector, e.g., pET-Duetl, commercially available from Novagen. Other vectors suitable for use in the présent invention include vectors containing His-tag sequences, such as those described in U.S. Patent Nos. 5,310,663 and 5,284,933; and European Patent No. 282042.
[00110] The présent invention also relates to a host cell comprising the nucleic acid sequence or expression vector of the invention, which are used in the recombinant production of the polypeptides having cellobiase activity.
[00111] An expression vector comprising a nucleic acid sequence of the présent invention is sintroduced into a host cell so that the vector is maintained (e.g., by chromosomal intégration or as a self-replicating extra-chromosomal vector) such that the polypeptide is expressed.
[00112] The host cell may be a prokaryote or a eukaryote. The host cell may be a bacteria, such as an E. coli strain, e.g., K12 strains NovaBlue, NovaBlue TIR, JM109, and DH5a. Preferably, the bacteria cell has the capability to fold, or partially fold, exogenously expressed proteins, such as E. coli Origami strains, e.g., Origami B, Origami B (DE3), Origami 2, and Origami 2(DE3) strains. In some embodiments, it may be preferred to use a host cell that is déficient for glycosylation, or has an impaired glycosylation pathway such that proteins expressed by the host cell are not signifîcantly glycosylated.
[00113] The host cell may be a yeast or a filamentous fungus, particularly those classified as Ascomycota. Généra of yeasts useful as host microbes for the expression of modified TrCel3 A beta-glucosidases of the présent invention include Saccharomyces, Pichia, Hansenula, Kluyveromyces, Yarrowia, and Arxula. Généra of fungi useful as microbes for the expression of the polypeptides of the présent invention include Trichoderma, Hypocrea, Aspergillus, Fusarium, Humicola, Neurospora, Chrysosporium, Myceliophthora, Thielavia, Sporotrichum and Pénicillium. For example, the host cell may be Pichiapastoris. For example, the host cell may be an industrial strain of Trichoderma reesei,or a mutant thereof, e.g., T. reesei RUTC30. Typically, the host cell is one which does not express a parental cellobiase or Cel3a.
[00114] The sélection of the particular host cell, e.g., bacterial cell or a fungal cell, dépends on the expression vector (e.g., the control sequences) and/or the method utilized for producing an aglycosylated polypeptide of the invention, as described in further detail below.
[00115] The expression vector of the invention may be introduced into the host cell by any number of methods known by one skilled in the art of microbial transformation, including but not limited to, transformation, treatment of cells with CaCb, electroporation, biolistic bombardment, lipofection, and PEG-mediated fusion of protoplasts (e.g. White et al., WO 2005/093072, which is incorporated herein by reference). After selecting the recombinant host cells containing the expression vector (e.g., by sélection utilizing the selectable marker of the expression vector), the recombinant host cells may be cultured under conditions that induce the expression of the polypeptide having cellobiase activity of the invention.
Methods for Producing Aglycosylated Polypeptides [00116] The présent invention further provides methods for producing an aglycosylated polypeptide having cellobiase activity, as described herein. The method comprises culturing the recombinant host cell expressing the polypeptide of the présent invention under conditions suitable for the expression of the polypeptide. The method may also comprise recovering the aglycosylated polypeptide from the recombinant host cell.
[00117] Methods for recovering polypeptides expressed from prokaryote and eukaryote cells are known in the art. In embodiments, the method for recovering the polypeptide comprises lysing the cells, e.g., by mechanical, chemical, or enzymatic means. For example, cells can be physically broken apart, e.g., by sonication, milling (shaking with beads), or shear forces. Cell membranes can be treated such that they are permeabilized such that the contents of the cells are released, such as treatment with détergents, e.g., Triton, NP-40, or SDS. Cells with cell walls, e.g., bacterial cells, can be permeabilized using enzymes, such as a lysozyme or lysonase. Any combination of the mechanical, chemical, and enzymatic techniques described above are also suitable for recovering expressed polypeptides of interest from the host cell in the context of this invention. For example, when expressing an aglycosylated polypeptide having cellobiase activity described herein in a bacterial cell, e.g., an E. coli cell, the cell is typically lysed by centrifuging and pelleting the cell culture, and resuspending in a lysis buffer containing lysozyme. To ensure complété lysis, the resuspended cells are subjected to one of the following methods: sonication, milling, or homogenization.
[00118] In one embodiment, the expressed aglycosylated polypeptides having cellobiase activity described herein are not lysed before addition to the biomass for the saccharification reaction. In some instances, the methods for lysing host cells can resuit in protein dénaturation and/or decreased enzyme activity, which leads to increased cost of downstream processing. Thus, the présent invention also provides methods for directly adding the host cells expressing an aglycosylated polypeptide having cellobiase activity described herein to the biomass prior to the saccharification step.
[00119] In an embodiment, the host cell, e.g., the E. coli cell, expressing the aglycosylated polypeptide having cellobiase activity described herein is isolated, e.g., by centrifugation, and added to the saccharification reaction, e.g., the saccharification reactor containing biomass. The cells are lysed by a combination of shear from the biomass, the impellers, and the increased température. In an embodiment, the culture of host cell, e.g., the E. coli cell, expressing the aglycosylated polypeptide having cellobiase activity described herein is added directly from the fermentation tank directly to the saccharification tank and eliminating the need to pellet cells by centrifugation.
Using a host cell déficient for glycosylation [00120] In embodiments, the expression vector comprises a nucleic acid sequence encoding a polypeptide having cellobiase activity described herein operably linked to a fusion tag is introduced to and expressed in a cell that does not significantly glycosylate proteins expressed in the cell, e.g., a bacterial host cell. The recombinant host cell is cultured under conditions for expression of the polypeptide, resulting in the production of an aglycosylated polypeptide having cellobiase activity. The aglycosylated polypeptide can be purified or isolated from the host cell using affinity chromatography methods for the fusion tag as described herein.
[00121] For example, in this embodiment, the expression vector contains a lac operator and a T7 promoter upstream of the nucleic acid sequence encoding a polypeptide having cellobiase activity, and the host cell has the capacity to express T7 RNA polymerase. Expression of the polypeptide having cellobiase activity is induced by addition of IPTG. Preferably, the host cell is an E. coli cell, preferably an E. coli Origami cell. In this embodiment, the fusion tag is a His-tag, and the purification of the expressed aglycosylated polypeptide comprises nickel affinity chromatography.
Using a host cell with the capacity for glycosylation [00122] In another embodiment, an expression vector comprising a nucleic acid sequence encoding a polypeptide comprising one or more glycosylation site mutations such that the polypeptide is not glyscosylated, as described herein, is expressed in a host cell, wherein the host cell is capable of glycosylating proteins expressed within the cell, e.g., a yeast or fungal host cell. Alternatively, the host cell is not capable of glycosylating proteins expressed within the cell, e.g., a bacterial host cell. In this embodiment, the polypeptide is operably linked to a fusion tag. The aglycosylated polypeptide can be purified or isolated from the bacterial host cell using affinity chromatography methods for the fusion tag as described herein.
[00123] In yet another embodiment, an expression vector comprising a nucleic acid sequence encoding a polypeptide having cellobiase activity described herein is expressed in a host cell, wherein the host cell is capable of glycosylating proteins expressed within the cell. The cells are cultured under conditions sufficient for expression and glycosylation of the polypeptide. In this embodiment, the polypeptide is operably linked to a fusion tag. The glycosylated polypeptide can be purified or isolated from the bacterial host cell using affinity chromatography methods for the fusion tag as described herein. After purification from the host cells and other endogenous host enzymes, e.g., glycosylation enzymes, the glycans of the isolated glycosylated polypeptide can be removed by incubation with deglycosylating enzymes. Deglycosylating enzymes include PNGase F, PNGase A, EndoH (endoglycosidase H), EndoS (endoglycosidase S), EndoD (endoglycosidase D), EndoF (endoglycosidase F), EndoFl (endoglycosidase Fl), or EndoF2 (endoglycosidase F2). Protein deglycosylation mixes containing enzymes sufficient for the complété removal of glycans are commercially available, e.g., from New England Biolabs. The isolated polypeptide is incubated with one or more deglycosylating enzyme under conditions sufficient for the removal of ail of the glycans from the polypeptide. Other methods are known in the art for removing glycans from a polypeptide, e.g., β-elimination with mild alkali or mild hydrazinolysis. Assessment of the glycosylation state of the polypeptide can be determined using methods for staining and visualization of glycans known in the art, or mass spectrometry.
[00124] In yet another embodiment, an expression vector comprising a nucleic acid sequence encoding a polypeptide having cellobiase activity described herein is expressed in a host cell, wherein the host cell is capable of glycosylating proteins expressed within the cell. The cells are cultured under conditions sufficient for expression of the polypeptide, but in the presence of glycosylation inhibitors. The glycosylation inhibitors are présent at a concentration and for a sufficient time such that the expressed polypeptides are not glycosylated. In this embodiment, the polypeptide is operably linked to a fusion tag. The resulting aglycosylated polypeptide can be purified or isolated from the bacterial host cell using affinity chromatography methods for the fusion tag as described herein.
[00125] Examples of suitable glycosylation inhibitors for use in this embodiment include tunicamycin, Benzyl-GalNAc (Benzyl 2-acetamido-2-deoxy-a-D-galactopyranoside), 2Fluoro-2-deoxy-D-glucose, and 5’CDP (5’ cytidylate diphosphate). In some embodiments, a combination of glycosylation inhibitors is used. Preferably, the concentration of glycosylation inhibitors used in this embodiment is suffïcient to inhibit glycosylation of the polypeptide, but do not cause cytotoxicity or inhibition of protein expression of the host cell.
Methods of Producing Products Using Aglycosylated Polypeptides [00126] The présent invention provides methods and compositions for converting or processing a biomass into products, using an aglycosylated polypeptide having cellobiase activity, as described herein. Methods for converting a biomass to products, such as sugar products, are known in the art, for example, as described in US Patent Application 2014/0011258, the contents of which are incorporated by reference in its entirety. Briefly, a biomass is optimally pretreated, e.g., to reduce the recalcitrance, and saccharifîed by a saccharification process that involves incubating the treated biomass with biomass-degrading, or cellulolytic, enzymes to produce sugars (e.g., glucose and/or xylose). The sugar products can then be fùrther processed to produce a final product, e.g., by fermentation or distillation. Final products include alcohols (e.g., éthanol, isobutanol, or n-butanol), sugar alcohols (e.g., erythritol, xylitol, or sorbitol), or organic acids (e.g., lactic acid, pyurvic acid, succinic acid).
[00127] Using the processes described herein, the biomass material can be converted to one or more products, such as energy, fuels, foods and materials. Spécifie examples of products include, but are not limited to, hydrogen, sugars (e.g., glucose, xylose, arabinose, mannose, galactose, fructose, cellobiose, disaccharides, oligosaccharides and polysaccharides), alcohols (e.g., monohydric alcohols or dihydric alcohols, such as éthanol, n-propanol, isobutanol, secbutanol, tert-butanol or n-butanol), hydrated or hydrous alcohols (e.g., containing greater than 10%, 20%, 30% or even greater than 40% water), biodiesel, organic acids, hydrocarbons (e.g., methane, ethane, propane, isobutene, pentane, n-hexane, biodiesel, bio-gasoline and mixtures thereof), co-products (e.g., proteins, such as cellulolytic proteins (enzymes) oi’ single cell proteins), and mixtures of any of these in any combination or relative concentration, and optionally in combination with any additives (e.g., fuel additives). Other examples include carboxylic acids, salts of a carboxylic acid, a mixture of carboxylic acids and salts of carboxylic acids and esters of carboxylic acids (e.g., methyl, ethyl and n-propyl esters), ketones (e.g., acetone), aldéhydes (e.g., acetaldehyde), alpha and beta unsaturated acids (e.g., acrylic acid) and olefins (e.g., ethylene). Other alcohols and alcohol dérivatives include propanol, propylene glycol, 1,4-butanediol, 1,3-propanediol, sugar alcohols and polyols (e.g., glycol, glycerol, erythritol, threitol, arabitol, xylitol, ribitol, mannitol, sorbitol, galactitol, iditol, inositol, volemitol, isomalt, maltitol, lactitol, maltotriitol, maltotetraitol, and polyglycitol and other polyols), and methyl or ethyl esters of any of these alcohols. Other products include methyl acrylate, methylmethacrylate, lactic acid, citric acid, formic acid, acetic acid, propionic acid, butyric acid, succinic acid, valeric acid, caproic acid, 3-hydroxypiOpionic acid, palmitic acid, stearic acid, oxalic acid, malonic acid, glutaric acid, oleic acid, linoleic acid, glycolic acid, gamma-hydroxybutyric acid, and mixtures thereof, salts of any of these acids, mixtures of any of the acids and their respective salts.
Biomass [00128] The biomass to be processed using the methods described herein is a starchy material and/or a cellulosic material comprising cellulose, e.g., a lignocellulosic material. The biomass may also comprise hemicellulose and/or lignin. The biomass can comprise one or more of an agricultural product or waste, a paper product or waste, a forestry product, or a general waste, or any combination thereof. An agricultural product or waste comprises material that can be cultivated, harvested, or processed for use or consumption, e.g., by humans or animais, or any intermediate, byproduct, or waste that is generated from the cultivation, harvest, or processing methods. Agricultural products or waste include, but are not limited to, sugar cane, jute, hemp, flax, bamboo, sisal, alfalfa, hay, arracacha, buckwheat, banana, barley, cassava, kudzu, oca, sago, sorghum, potato, sweet potato, taro, yams, beans, favas, lentils, peas, grasses, switchgrass, miscanthus, cord grass, reed canary grass, grain residues, canola straw, wheat straw, barley straw, oat straw, rice straw, com cobs, com stover, com fïber, coconut haïr, beet pulp, bagasse, soybean stover, grain residues, rice hulls, oat hulls, wheat chaff, barley hulls, or beeswing, or a combination thereof. A paper product or waste comprises material that is used to make a paper product, any paper product, or any intermediate, byproduct or waste that is generated from making or breaking down the paper product. Paper products or waste include, but are not limited to, paper, pigmented papers, loaded papers, coated papers, corrugated paper, filled papers, magazines, printed matter, printer paper, polycoated paper, cardstock, cardboard, paperboard, or paper pulp, or a combination thereof. A forestry product or waste comprises material that is produced by cultivating, harvesting, or processing of wood, or any intermediate, byproduct, or waste that is generated from the cultivation, harvest, or processing of the wood. Forestry products or waste include, but are not limited to, aspen wood, wood from any genus or species of tree, particle board, wood chips, or sawdust, or a combination thereof. A general waste includes, but is not limited to, manure, sewage, or offal, or a combination thereof.
[00129] In an embodiment, the biomass comprises agriculture waste, such as corn cobs, e.g., corn stover. In another embodiment, the biomass comprises grasses.
[00130] In one embodiment, the biomass is treated prior to contact with the compositions described herein. For example, the biomass is treated to reduce the recalcitrance of the biomass, to reduce its bulk density, and/or increase its surface area. Suitable biomass treatment process may include, but are not limited to: bombardment with électrons, sonication, oxidation, pyrolysis, steam explosion, chemical treatment, mechanical treatment, and freeze grinding. Preferably, the treatment method is bombardment with électrons.
[00131] In some embodiments, électron bombardment is performed until the biomass receives a total dose of at least 0.5 Mrad, e.g. at least 5, 10, 20, 30, or at least 40 Mrad. In some embodiments, the treatment is performed until the biomass receives a dose a of from about 0.5 Mrad to about 150 Mrad, about 1 Mrad to about 100 Mrad, about 5 Mrad to about 75 Mrad, about 2 Mrad to about 75 Mrad, about 10 Mrad to about 50 Mrad, e.g., about 5 Mrad to about 50 Mrad, about 20 Mrad to about 40 Mrad, about 10 Mrad to about 35 Mrad, or from about 20 Mrad to about 30 Mrad. In some implémentations, a total dose of 25 to 35 Mrad is preferred, applied ideally over a couple of seconds, e.g., at 5 Mrad/pass with each pass being applied for about one second. Applying a dose of greater than 7 to 9 Mrad/pass can in some cases cause thermal dégradation of the feedstock material.
[00132] The biomass material (e.g., plant biomass, animal biomass, paper, and municipal waste biomass) can be used as feedstock to produce useful intermediates and products such as organic acids, salts of organic acids, anhydrides, esters of organic acids and fuels, e.g., fuels for internai combustion engines or feedstocks for fuel cells. Systems and processes are described herein that can use as feedstock cellulosic and/or lignocellulosic materials that are readily available, but often can be diffîcult to process, e.g., municipal waste streams and waste paper streams, such as streams that include newspaper, kraft paper, corrugated paper or mixtures of these.
[00133] In order to couvert the biomass to a form that can be readily processed, the glucanor xylan-containing cellulose in the feedstock can be hydrolyzed to low molecular weight carbohydrates, such as sugars, by a saccharifying agent, e.g., an enzyme or acid, a process referred to as saccharification. The low molecular weight carbohydrates can then be used, for example, in an existing manufacturing plant, such as a single cell protein plant, an enzyme manufacturing plant, or a fuel plant, e.g., an éthanol manufacturing facility.
[00134] The biomass can be hydrolyzed using an enzyme, e.g., a biomass degrading enzyme, by combining the materials and the enzyme in a solvent, e.g., in an aqueous solution. The enzymes can be made/induced according to the methods described herein.
[00135] Specifically, the biomass degrading enzyme can be supplied by organisms, e.g., a microorganism, that are capable of breaking down biomass (such as the cellulose and/or the lignin portions of the biomass), or that contain or manufacture various cellulolytic enzymes (cellulases), ligninases or various small molécule biomass-degrading métabolites. These enzymes may be a complex of enzymes that act synergistically to dégradé crystalline cellulose or the lignin portions of biomass. Examples of cellulolytic enzymes include: endoglucanases, cellobiohydrolases, and cellobiases (beta-glucosidases).
[00136] During saccharification a cellulosic substrate can be initially hydrolyzed by endoglucanases at random locations producing oligomeric intermediates. These intermediates are then substrates for exo-splitting glucanases such as cellobiohydrolase to produce cellobiose from the ends of the cellulose polymer. Cellobiose is a water-soluble 1,4-linked dimer of glucose. Finally, cellobiase cleaves cellobiose to yield glucose. The efficiency (e.g., time to hydrolyze and/or completeness of hydrolysis) of this process dépends on the recalcitrance of the cellulosic material.
Saccharification [00137] The reduced-recalcitrance biomass is treated with the biomass-degrading enzymes discussed above, generally by combining the reduced-recalcitrance biomass and the biomassdegrading enzymes in a fluid medium, e.g., an aqueous solution. In some cases, the feedstock is boiled, steeped, or cooked in hot water prior to saccharification, as described in U.S. Pat. App.
Pub. 2012/0100577 Al by Medoff and Masterman, published on Apr. 26, 2012, the entire contents of which are incorporated herein.
[00138] Provided herein are mixtures of enzymes that are capable of degrading the biomass, e.g., an enzyme mixture of biomass-degrading enzymes, for use in the saccharification process described herein.
[00139] The saccharification process can be partially or completely performed in a tank (e.g., a tank having a volume of at least 4000, 40,000, or 500,000 L) in a manufacturing plant, and/or can be partially or completely performed in transit, e.g., in a rail car, tanker truck, or in a supertanker or the hold of a ship. The time required for complété saccharification will dépend on the process conditions and the biomass material and enzyme used. If saccharification is performed in a manufacturing plant under controlled conditions, the cellulose may be substantially entirely converted to sugar, e.g., glucose in about 12-96 hours. If saccharification is performed partially or completely in transit, saccharification may take longer.
[00140] In a preferred embodiment, the saccharification reaction occurs at a pH optimal for the enzymatic reactions to occur, e.g., at the pH optimal for the activity of the biomassdegrading enzymes. Preferably, the pH of the saccharification reaction is at pH 4-4.5. In a preferred embodiment, the saccharification reaction occurs at a température optimal for the enzymatic reactions to occur, e.g., at the température optimal for the activity of the biomassdegrading enzymes. Preferably, the température of the saccharification reaction is at 42°C — 52°C.
[00141] It is generally preferred that the tank contents be mixed during saccharification, e.g., usingjet mixing as described in International App. No. PCT/US2010/035331, filed May 18, 2010, which was published in English as WO 2010/135380 and designated the United States, the full disclosure of which is incorporated by reference herein.
[00142] The addition of surfactants can enhance the rate of saccharification. Examples of surfactants include non-ionic surfactants, such as a Tween® 20 or Tween® 80 polyethylene glycol surfactants, ionic surfactants, or amphoteric surfactants.
[00143] It is generally preferred that the concentration of the sugar solution resulting from saccharification be relatively high, e.g., greater than 40%, or greater than 50, 60, 70, 80, 90 or even greater than 95% by weight. Water may be removed, e.g., by évaporation, to increase the concentration of the sugar solution. This reduces the volume to be shipped, and also inhibits microbial growth in the solution.
[00144] Alternatively, sugar solutions of lower concentrations may be used, in which case it may be désirable to add an antimicrobial additive, e.g., a broad spectrum antibiotic, in a low concentration, e.g., 50 to 150 ppm. Other suitable antibiotics include amphotericin B, ampicillin, chloramphenicol, ciprofloxacin, gentamicin, hygromycin B, kanamycin, neomycin, penicillin, puromycin, streptomycin. Antibiotics will inhibit growth of microorganisms during transport and storage, and can be used at appropriate concentrations, e.g., between 15 and 1000 ppm by weight, e.g., between 25 and 500 ppm, or between 50 and 150 ppm. If desired, an antibiotic can be included even if the sugar concentration is relatively high. Alternatively, other additives with anti-microbial of preservative properties may be used. Preferably the antimicrobial additive(s) are food-grade.
[00145] A relatively high concentration solution can be obtained by limiting the amount of water added to the biomass material with the enzyme. The concentration can be controlled, e.g., by controlling how much saccharification takes place. For example, concentration can be increased by adding more biomass material to the solution. In order to keep the sugar that is being produced in solution, a surfactant can be added, e.g., one of those discussed above. Solubility can also be increased by increasing the température of the solution. For example, the solution can be maintained at a température of 40-50° C., 60-80° C., or even higher.
[00146] In the processes described herein, for example after saccharification, sugars (e.g., glucose and xylose) can be isolated. For example, sugars can be isolated by précipitation, crystallization, chromatography (e.g., simulated moving bed chromatography, high pressure chromatography), centrifugation, extraction, any other isolation method known in the art, and combinations thereof.
Enzyme Mixtures for Saccharification [00147] The présent invention provides an enzyme mixture comprising a glycosylated polypeptide comprising at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO: 1, and an aglycosylated polypeptide comprising at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO: 1, wherein both the glycosylated polypeptide and the aglyscosylated polypeptide hâve cellobiase activity. The aglycosylated polypeptide having cellobiase activity is any of the aglycosylated polypeptides described herein, e.g., produced using the methods described herein. The glycosylated polypeptide comprising at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO: 1 can be isolated or obtained from a microorganism that endogenously expresses the polypeptide.
[00148] In an embodiment, the glysocylated polypeptide and the aglycosylated polypeptide are both the Cel3A enzyme from wild-type T. reesei, e.g., comprising SEQ ID NO: 1.
[00149] In embodiments, the enzyme mixture further comprises at least one additional enzyme derived from a microorganism, wherein the additional enzyme has biomass or cellulose-based material-degrading activity, e.g., the additional enzyme is a cellulolytic enzyme, e.g., a cellulase. For example, the additional enzyme is a ligninase, an endoglucanase, a cellobiohydrolase, a xylanase, and a cellobiase. In an embodiment, the mixture further comprises one or more ligninase, one or more endogluconase, one or more cellobiohydrolase, and one or more xylanase. In embodiments, the additional biomass-degrading enzyme is glycosylated. In embodiments, the enzyme mixture further comprises at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, or at least 10 additional biomassdegrading enzymes described herein. For example, the enzyme mixture fùrther comprises at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, at least 15 or all ofthe enzymes listed in Table
1.
[00150] For example, the enzyme mixture further comprises a mixture of additional biomass-degrading enzymes produced by a microorganism, e.g., a fungal cell, such as wildtype T. reesei, or a mutant thereof, e.g., T. Reesei RUTC30. In an embodiment, the additional biomass-degrading enzymes are isolated from the microorganisms. In an embodiment, the mixture comprises one or more of the following biomass-degrading enzymes: B2AF03, CIP1, CIP2, Cella, Cel3a, Cel5a, Cel6a, Cel7a, Cel7b, Cell2a, Cel45a, Cel74a, paMan5a, paMan26a, or Swollenin, or any combination thereof. The additional biomass-degrading enzymes, e.g., listed above, can be endogenously expressed and isolated from the microorganism, e.g., fungal cell, from which the enzyme originates from (listed below in Table 1). Alternatively, the additional biomass-degrading enzymes, e.g., listed above, can be heterologously expressed using similar methods of expression in a host cell described herein, and isolated from the host cells. In an embodiment, the heterologously expressed additional biomass-degrading enzymes are tagged with a His tag at the C or N terminus of the enzyme and are isolated using nickel affïnity chromatography techniques known in the art. For example, the additional biomassdegrading enzymes can be selected from Table 1 below.
[00151] Table 1. Examples of Additional Biomass-Degrading Enzymes
Protein | MW, kDa | no AA’s | th. pi | no. Cvsteines | Orqanism |
B2AF03-C’His | 88.6 | 813 | 6.3 | 10 | Podospora anserina |
CIP1-C’His | 32.5 | 311 | 5.6 | 8 | Trichoderma reesei |
CIP2-C'His | 48.0 | 457 | 7.0 | 12 | Trichoderma reesei |
Cella-C’His | 53.6 | 478 | 5.8 | 5 | Trichoderma reesei |
Cel3a-C’His | 78.0 | 739 | 6.3 | 6 | Trichoderma reesei |
Cel3a-N’His | 78.0 | 739 | 6.3 | 6 | Trichoderma reesei |
Cel5a-N’His | 43.7 | 411 | 5.7 | 12 | Trichoderma reesei |
Cel6a-C’His | 48.8 | 461 | 6.0 | 12 | Trichoderma reesei |
Cel7a-C’His | 53.8 | 511 | 4.8 | 24 | Trichoderma reesei |
Cel7b-C’His | 47.6 | 451 | 5.3 | 22 | Trichoderma reesei |
Cel12a-C’His | 25.1 | 232 | 6.9 | 2 | Trichoderma reesei |
Cel45a-C’His | 24.4 | 239 | 5.4 | 16 | Trichoderma reesei |
Cel74a-C’His | 86.7 | 832 | 5.7 | 4 | Trichoderma reesei |
paMan5a-C’His 41.0 | 370 | 7.0 | 6 | Podospora anserina | |
paMan26a-C’His51.4 | 463 | 5.2 | 1 | Podospora anserina | |
Swollenin-N’His 51.5 | 491 | 5.3 | 28 | Trichoderma reesei |
[00152] The amino acid sequences for the biomass-degrading enzymes listed in Table 1 are provided below.
[00153] B2AF03 (Podospora anserina) [00154] MKSSVFWGASLTSAVVRAIDLPFQFYPNCVDDLLSTNQVCNTTLSPPERAAALVAALTPEE
KLQNIVSKSLGAPRIGLPAYNWWSEALHGVAYAPGTQFWQGDGPFNSSTSFPMPLLMAATFDDELLEKI
AEVIGIEGRAFGNAGFSGLDYWTPNVNPFKDPRWGRGSETPGEDVLLVKRYAAAMIKGLEGPVPEKERR VVATCKHYAANDFEDWNGATRHNFNAKISLQDMAEYYFMPFQQCVRDSRVGSIMCAYNAVNGVPSCASP YLLQTILREHWNWTEHNNYITSDCEAVLDVSLNHKYAATNAEGTAISFEAGMDTSCEYEGSSDIPGAWS QGLLKESTVDRALLRLYEGIVRAGYFDGKQSLYSSLGWADVNKPSAQKLSLQAAVDGTVLLKNDGTLPL SDLLDKSRPKKVAMIGFWSDAKDKLRGGYSGTAAYLHTPAYAASQLGIPFSTASGPILHSDLASNQSWT DNAMAAAKDADYILYFGGIDTSAAGETKDRYDLDWPGAQLSLINLLTTLSKPLIVLQMGDQLDNTPLLS NPKINAILWANWPGQDGGTAVMELVTGLKSPAGRLPVTQYPSNFTELVPMTDMALRPSAGNSQLGRTYR WYKTPVQAFGFGLHYTTFSPKFGKKFPAVIDVDEVLEGCDDKYLDTCPLPDLPVVVENRGNRTSDYVAL AFVSAPGVGPGPWPIKTLGAFTRLRGVKGGEKREGGLKWNLGNLARHDEEGNTVVYPGKYEVSLDEPPK
ARLRFEIVRGGKGKGKVKGKGKAAQKGGVVLDRWPKPPKGQEPPAIERV (SEQ ID NO: 9) [00155] CIP1 (Trichoderma reeseï) [00156] MVRRTALLALGALSTLSMAQISDDFESGWDQTKWPISAPDCNQGGTVSLDTTVAHSGSNSM KVVGGPNGYCGHIFFGTTQVPTGDVYVRAWIRLQTALGSNHVTFIIMPDTAQGGKHLRIGGQSQVLDYN RESDDATLPDLSPNGIASTVTLPTGAFQCFEYHLGTDGTIETWLNGSLIPGMTVGPGVDNPNDAGWTRA SYIPEITGVNFGWEAYSGDVNTVWFDDISIASTRVGCGPGSPGGPGSSTTGRSSTSGPTSTSRPSTTIP PPTSRTTTATGPTQTHYGQCGGIGYSGPTVCASGTTCQVLNPYYSQCL (SEQ ID NO: 10) [00157] CIP2 (Trichoderma reeseï) [00158] MASRFFALLLLAIPIQAQSPVWGQCGGIGWSGPTTCVGGATCVSYNPYYSQCIPSTQASSS IASTTLVTSFTTTTATRTSASTPPASSTGAGGATCSALPGSITLRSNAKLNDLFTMFNGDKVTTKDKFS CRQAEMSELIQRYELGTLPGRPSTLTASFSGNTLTINCGEAGKSISFTVTITYPSSGTAPYPAIIGYGG GSLPAPAGVAMINFNNDNIAAQVNTGSRGQGKFYDLYGSSHSAGAMTAWAWGVSRVIDALELVPGARID TTKIGVTGCSRNGKGAMVAGAFEKRIVLTLPQESGAGGSACWRISDYLKSQGANIQTASEIIGEDPWFS TTFNSYVNQVPVLPFDHHSLAALIAPRGLFVIDNNIDWLGPQSCFGCMTAAHMAWQALGVSDHMGYSQI GAHAHCAFPSNQQSQLTAFVQKFLLGQSTNTAIFQSDFSANQSQWIDWTTPTLS (SEQ ID NO:
11) [00159] Cella (Trichoderma reeseï) [00160] MLPKDFQWGFATAAYQIEGAVDQDGRGPSIWDTFCAQPGKIADGSSGVTACDSYNRTAEDI ALLKSLGAKSYRFSISWSRIIPEGGRGDAVNQAGIDHYVKFVDDLLDAGITPFITLFHWDLPEGLHQRY GGLLNRTEFPLDFENYARVMFRALPKVRNWITFNEPLCSAIPGYGSGTFAPGRQSTSEPWTVGHNILVA HGRAVKAYRDDFKPASGDGQIGIVLNGDFTYPWDAADPADKEAAERRLEFFTAWFADPIYLGDYPASMR KQLGDRLPTFTPEERALVHGSNDFYGMNHYTSNYIRHRSSPASADDTVGNVDVLFTNKQGNCIGPETQS PWLRPCAAGFRDFLVWISKRYGYPPIYVTENGTSIKGESDLPKEKILEDDFRVKYYNEYIRAMVTAVEL DGVNVKGYFAWSLMDNFEWADGYVTRFGVTYVDYENGQKRFPKKSAKSLKPLFDELIAAA (SEQ ID NO: 12) [00161] Cel3a (Trichoderma reeseï) [00162 ] MRYRTAAALALATGPFARADSHSTSGASAEAVVPPAGTPWGTAYDKAKAALAKLNLQDKVG IVSGVGWNGGPCVGNTSPASKISYPSLCLQDGPLGVRYSTGSTAFTPGVQAASTWDVNLIRERGQFIGE EVKASGIHVILGPVAGPLGKTPQGGRNWEGFGVDPYLTGIAMGQTINGIQSVGVQATAKHYILNEQELN RETISSNPDDRTLHELYTWPFADAVQANVASVMCSYNKVNTTWACEDQYTLQTVLKDQLGFPGYVMTDW
NAQHTTVQSANSGLDMSMPGTDFNGNNRLWGPALTNAVNSNQVPTSRVDDMVTRILAAWYLTGQDQAGY PSFNISRNVQGNHKTNVRAIARDGIVLLKNDANILPLKKPASIAVVGSAAIIGNHARNSPSCNDKGCDD GALGMGWGSGAVNYPYFVAPYDAINTRASSQGTQVTLSNTDNTSSGASAARGKDVAIVFITADSGEGYI TVEGNAGDRNNLDPWHNGNALVQAVAGANSNVIVVVHSVGAIILEQILALPQVKAVVWAGLPSQESGNA LVDVLWGDVSPSGKLVYTIAKSPNDYNTRIVSGGSDSFSEGLFIDYKHFDDANITPRYEFGYGLSYTKF NYSRLSVLSTAKSGPATGAVVPGGPSDLFQNVATVTVDIANSGQVTGAEVAQLYITYPSSAPRTPPKQL RGFAKLNLTPGQSGTATFNIRRRDLSYWDTASQKWVVPSGSFGISVGASSRDIRLTSTLSVA (SEQ ID NO: 13) [00163] Cel5a (Trichoderma reesei) [00164] MNKSVAPLLLAASILYGGAAAQQTVWGQCGGIGWSGPTNCAPGSACSTLNPYYAQCIPGAT TITTSTRPPSGPTTTTRATSTSSSTPPTSSGVRFAGVNIAGFDFGCTTDGTCVTSKVYPPLKNFTGSNN YPDGIGQMQHFVNDDGMTIFRLPVGWQYLVNNNLGGNLDSTSISKYDQLVQGCLSLGAYCIVDIHNYAR WNGGIIGQGGPTNAQFTSLWSQLASKYASQSRVWFGIMNEPHDVNINTWAATVQEVVTAIRNAGATSQF ISLPGNDWQSAGAFISDGSAAALSQVTNPDGSTTNLIFDVHKYLDSDNSGTHAECTTNNIDGAFSPLAT WLRQNNRQAILTETGGGNVQSCIQDMCQQIQYLNQNSDVYLGYVGWGAGSFDSTYVLTETPTGSGNSWT DTSLVSSCLARK (SEQ ID NO: 14) [00165] Cel6a (Trichoderma reeseï) [00166] MIVGILTTLATLATLAASVPLEERQACSSVWGQCGGQNWSGPTCCASGSTCVYSNDYYSQC LPGAASSSSSTRAASTTSRVSPTTSRSSSATPPPGSTTTRVPPVGSGTATYSGNPFVGVTPWANAYYAS EVSSLAIPSLTGAMATAAAAVAKVPSFMWLDTLDKTPLMEQTLADIRTANKNGGNYAGQFVVYDLPDRD CAALASNGEYSIADGGVAKYKNYIDTIRQIVVEYSDIRTLLVIEPDSLANLVTNLGTPKCANAQSAYLE CINYAVTQLNLPNVAMYLDAGHAGWLGWPANQDPAAQLFANVYKNASSPRALRGLATNVANYNGWNITS PPSYTQGNAVYNEKLYIHAIGPLLANHGWSNAFFITDQGRSGKQPTGQQQWGDWCNVIGTGFGIRPSAN TGDSLLDSFVWVKPGGECDGTSDSSAPRFDSHCALPDALQPAPQAGAWFQAYFVQLLTNANPSFL (SEQ ID NO: 15) [00167] CeI7a (Trichoderma reesei) [00168] MYRKLAVISAFLATARAQSACTLQSETHPPLTWQKCSSGGTCTQQTGSVVIDANWRWTHAT NSSTNCYDGNTWSSTLCPDNETCAKNCCLDGAAYASTYGVTTSGNSLSIGFVTQSAQKNVGARLYLMAS DTTYQEFTLLGNEFSFDVDVSQLPCGLNGALYFVSMDADGGVSKYPTNTAGAKYGTGYCDSQCPRDLKF INGQANVEGWEPSSNNANTGIGGHGSCCSEMDIWEANSISEALTPHPCTTVGQEICEGDGCGGTYSDNR YGGTCDPDGCDWNPYRLGNTSFYGPGSSFTLDTTKKLTVVTQFETSGAINRYYVQNGVTFQQPNAELGS YSGNELNDDYCTAEEAEFGGSSFSDKGGLTQFKKATSGGMVLVMSLWDDYYANMLWLDSTYPTNETSST PGAVRGSCSTSSGVPAQVESQSPNAKVTFSNIKFGPIGSTGNPSGGNPPGGNPPGTTTTRRPATTTGSS PGPTQSHYGQCGGIGYSGPTVCASGTTCQVLNPYYSQCL (SEQ ID NO: 16) [00169] Cel7b (Trichoderma reesei) [00170] MAPSVTLPLTTAILAIARLVAAQQPGTSTPEVHPKLTTYKCTKSGGCVAQDTSVVLDWNYR
WMHDANYNSCTVNGGVNTTLCPDEATCGKNCFIEGVDYAASGVTTSGSSLTMNQYMPSSSGGYSSVSPR
LYLLDSDGEYVMLKLNGQELSFDVDLSALPCGENGSLYLSQMDENGGANQYNTAGANYGSGYCDAQCPV QTWRNGTLNTSHQGFCCNEMDILEGNSRANALTPHSCTATACDSAGCGFNPYGSGYKSYYGPGDTVDTS KTFTIITQFNTDNGSPSGNLVSITRKYQQNGVDIPSAQPGGDTISSCPSASAYGGLATMGKALSSGMVL VFSIWNDNSQYMNWLDSGNAGPCSSTEGNPSNILANNPNTHVVFSNIRWGDIGSTTNSTAPPPPPASST TFSTTRRSSTTSSSPSCTQTHWGQCGGIGYSGCKTCTSGTTCQYSNDYYSQCL (SEQ ID NO:
17) [00171] CeI12a (Trichoderma reesei) [00172] MKFLQVLPALIPAALAQTSCDQWATFTGNGYTVSNNLWGASAGSGFGCVTAVSLSGGASWH ADWQWSGGQNNVKSYQNSQIAIPQKRTVNSISSMPTTASWSYSGSNIRANVAYDLFTAANPNHVTYSGD YELMIWLGKYGDIGPIGSSQGTVNVGGQSWTLYYGYNGAMQVYSFVAQTNTTNYSGDVKNFFNYLRDNK GYNAAGQYVLSYQFGTEPFTGSGTLNVASWTASIN (SEQ ID NO: 18) [00173] Cel45a (Trichoderma reesei) [00174] MKATLVLGSLIVGAVSAYKATTTRYYDGQEGACGCGSSSGAFPWQLGIGNGVYTAAGSQAL FDTAGASWCGAGCGKCYQLTSTGQAPCSSCGTGGAAGQSIIVMVTNLCPNNGNAQWCPVVGGTNQYGYS YHFDIMAQNEIFGDNVVVDFEPIACPGQAASDWGTCLCVGQQETDPTPVLGNDTGSTPPGSSPPATSSS PPSGGGQQTLYGQCGGAGWTGPTTCQAPGTCKVQNQWYSQCLP (SEQ ID NO: 19) [00175] Cel74a (Trichoderma reesei) [00176] MKVSRVLALVLGAVIPAHAAFSWKNVKLGGGGGFVPGIIFHPKTKGVAYARTDIGGLYRLN ADDSWTAVTDGIADNAGWHNWGIDAVALDPQDDQKVYAAVGMYTNSWDPSNGAIIRSSDRGATWSFTNL PFKVGGNMPGRGAGERLAVDPANSNIIYFGARSGNGLWKSTDGGVTFSKVSSFTATGTYIPDPSDSNGY NSDKQGLMWVTFDSTSSTTGGATSRIFVGTADNITASVYVSTNAGSTWSAVPGQPGKYFPHKAKLQPAE KALYLTYSDGTGPYDGTLGSVWRYDIAGGTWKDITPVSGSDLYFGFGGLGLDLQKPGTLVVASLNSWWP DAQLFRSTDSGTTWSPIWAWASYPTETYYYSISTPKAPWIKNNFIDVTSESPSDGLIKRLGWMIESLEI DPTDSNHWLYGTGMTIFGGHDLTNWDTRHNVSIQSLADGIEEFSVQDLASAPGGSELLAAVGDDNGFTF ASRNDLGTSPQTVWATPTWATSTSVDYAGNSVKSVVRVGNTAGTQQVAISSDGGATWSIDYAADTSMNG GTVAYSADGDTILWSTASSGVQRSQFQGSFASVSSLPAGAVIASDKKTNSVFYAGSGSTFYVSKDTGSS FTRGPKLGSAGTIRDIAAHPTTAGTLYVSTDVGIFRSTDSGTTFGQVSTALTNTYQIALGVGSGSNWNL YAFGTGPSGARLYASGDSGASWTDIQGSQGFGSIDSTKVAGSGSTAGQVYVGTNGRGVFYAQGTVGGGT GGTSSSTKQSSSSTSSASSSTTLRSSVVSTTRASTVTSSRTSSAAGPTGSGVAGHYAQCGGIGWTGPTQ CVAPYVCQKQNDYYYQCV (SEQ ID NO: 20) [00177] paMan5a (Podospora anserina) [00178] MKGLFAFGLGLLSLVNALPQAQGGGAAASAKVSGTRFVIDGKTGYFAGTNSYWIGFLTNNR DVDTTLDHIASSGLKILRVWGFNDVNNQPSGNTVWFQRLASSGSQINTGPNGLQRLDYLVRSAETRGIK LIIALVNYWDDFGGMKAYVNAFGGTKESWYTNARAQEQYKRYIQAVVSRYVNSPAIFAWELANEPRCKG CNTNVIFNWATQISDYIRSLDKDHLITLGDEGFGLPGQTTYPYQYGEGTDFVKNLQIKNLDFGTFHMYP GHWGVPTSFGPGWIKDHAAACRAAGKPCLLEEYGYESDRCNVQKGWQQASRELSRDGMSGDLFWQWGDQ LSTGQTHNDGFTIYYGSSLATCLVTDHVRAINALPA (SEQ ID NO: 21) [00179] paMan26a (Podospora anserina) [00180] MVKLLDIGLFALALASSAVAKPCKPRDGPVTYEAEDAILTGTTVDTAQVGYTGRGYVTGFD EGSDKITFQISSATTKLYDLSIRYAAIYGDKRTNVVLNNGAVSEVFFPAGDSFTSVAAGQVLLNAGQNT IDIVNNWGWYLIDSITLTPSAPRPPHDINPNLNNPNADTNAKKLYSYLRSVYGNKIISGQQELHHAEWI RQQTGKTPALVAVDLMDYSPSRVERGTTSHAVEDAIAHHNAGGIVSVLWHWNAPVGLYDTEENKWWSGF YTRATDFDIAATLANPQGANYTLLIRDIDAIAVQLKRLEAAGVPVLWRPLHEAEGGWFWWGAKGPEPAK QLWDILYERLTVHHGLDNLIWVWNSILEDWYPGDDTVDILSADVYAQGNGPMSTQYNELIALGRDKKMI AAAEVGAAPLPGLLQAYQANWLWFAVWGDDFINNPSWNTVAVLNEIYNSDYVLTLDEIQGWRS (SEQ ID NO: 22) [00181] Swollenin (Trichoderma reeseî)
MAGKLILVALASLVSLSIQQNCAALFGQCGGIGWSGTTCCVAGAQCSFVNDWYSQCLASTGGNPPNGTT SSSLVSRTSSASSSVGSSSPGGNSPTGSASTYTTTDTATVAPHSQSPYPSIAASSCGSWTLVDNVCCPS YCANDDTSESCSGCGTCTTPPSADCKSGTMYPEVHHVSSNESWHYSRSTHFGLTSGGACGFGLYGLCTK GSVTASWTDPMLGATCDAFCTAYPLLCKDPTGTTLRGNFAAPNGDYYTQFWSSLPGALDNYLSCGECIE LIQTKPDGTDYAVGEAGYTDPITLEIVDSCPCSANSKWCCGPGADHCGEIDFKYGCPLPADSIHLDLSD IAMGRLQGNGSLTNGVIPTRYRRVQCPKVGNAYIWLRNGGGPYYFALTAVNTNGPGSVTKIEIKGADTD NWVALVHDPNYTSSRPQERYGSWVIPQGSGPFNLPVGIRLTSPTGEQIVNEQAIKTFTPPATGDPNFYY IDIGVQFSQN (SEQ ID NO: 23) [00182] The ratio between the aglycosylated polypeptide described herein to the other biomass-degrading enzymes in the mixture can be, for example, at least 1:1 ; 1:2; 1:4; 1:8; 1:16; 1:32; 1:50; 1:75; 1:100, 1:150; 1:200; 1:300, 1:400 or 1:500. In an embodiment, the ratio of aglycosylated polypeptide described herein to the other enzymes in the mixture is 1:32. The ratio between the aglycosylated polypeptide described herein to the glycosylated polypeptide is, for example, at least 1:1; 1:2; 1:4; 1:8; 1:16; 1:32; 1:50; 1:75; 1:100, 1:150; 1:200; 1:300, 1:400, or 1:500.
[00183] Other examples of suitable biomass-degrading enzymes for use in the enzyme mixture of the présent invention include the enzymes from species in the généra Bacillus, Coprinus, Myceliophthora, Cephalosporium, Scytalidium, Pénicillium, Aspergillus, Pseudomonas, Humicola, Fusarium, Thielavia, Acremonium, Chrysosporium and Trichoderma, especially those produced by a strain selected from the species Aspergillus (see, e.g., EP Pub. No. 0 458 162), Humicola insolens (reclassified as Scytalidium thermophilum, see, e.g., U.S. Pat. No. 4,435,307), Coprinus cinereus, Fusarium oxysporum, Myceliophthora thermophila, Meripilus giganteus, Thielavia terrestris, Acremonium sp. (including, but not limited to, A. persicinum, A. acremonium, A. brachypenium, A. dichromosporum, A. obclavatum, A. pinkertoniae, A. roseogriseum, A. incoloratum, and A. furatum). Preferred strains include
Humicola insolens DSM 1800, Fusariiim oxysporum DSM 2672, Myceliophthora thermophila CBS 117.65, Cephalosporium sp. RYM-202, Acremonium sp. CBS 478.94, Acremonium sp. CBS 265.95, Acremoniumpersicinum CBS 169.65, Acremonium acremonium AHU 9519, Cephalosporium sp. CBS 535.71, Acremonium brachypenium CBS 866.73, Acremonium dichromosporum CBS 683.73, Acremonium obclavatum CBS 311.74, Acremonium pinkertoniae CBS 157.70, Acremonium roseogriseum CBS 134.56, Acremonium incoloratum CBS 146.62, and Acremonium furatum CBS 299.70H. Biomass-degrading enzymes may also be obtained from Chrysosporium, preferably a strain of Chrysosporium lucknowense. Additional strains that can be used include, but are not limited to, Trichoderma (particularly T. viride, T. reesei, and T. koningii), alkalophilic Bacillus (see, for example, U.S. Pat. No. 3,844,890 and EP Pub. No. 0 458 162), and Streptomyces (see, e.g., EP Pub. No. 0 458 162). [00184] In embodiments, the microorganism is induced to produce the biomass-degrading enzymes described herein under conditions suitable for increasing production of biomassdegrading enzymes compared to an uninduced microorganism. For example, an induction biomass sample comprising biomass as described herein is incubated with the microorganism to increase production of the biomass-degrading enzymes. Further description of the induction process can be found in US 2014/0011258, the contents of which are hereby incorporated by reference in its entirety.
[00185] The biomass-degrading enzymes produced and/or secreted by the aforementioned microorganisms can be isolated and added to the enzyme mixture of the présent invention. Altematively, in one embodiment, the aforementioned microorganisms or host cells expressing the biomass-degrading enzymes described herein and above are not lysed before addition to the saccharification reaction.
[00186] In an embodiment, the enzyme mixture comprises the host cell expressing an aglycosylated polypeptide having cellobiase activity as described herein, and one or more additional biomass-degrading enzymes described herein. In an embodiment, the enzyme mixture comprises a host cell expressing an aglycosylated polypeptide having cellobiase activity as described herein, and one or more host cells expressing one or more additional biomass-degrading enzymes described herein. For example, [00187] Use of the enzyme mixture described herein comprising an aglycosylated polypeptide having cellobiase activity results in increased yield of sugar products from saccharification compared to the yield of sugar products from saccharification using the standard mixture of biomass-degrading enzymes (e.g., RUTC30 cocktail, e.g., without addition of an aglycosylated polypeptide having cellobiase activity). The yield of sugar products increases at least 5%, at least 10%, at least 15%, at least 20%, at least 25%, at least 30%, at least 35%, at least 40%, at least 45%, at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 100% when an aglycosylated polypeptide having cellobiase activity is added to the saccharification process.
Further Processing [00188] Further processing steps may be performed on the sugars produced by saccharification to produce alternative products. For example, the sugars can be hydrogenated, fermented, or treated with other chemicals to produce other products.
[00189] Glucose can be hydrogenated to sorbitol. Xylose can be hydrogenated to xylitol. Hydrogénation can be accomplished by use of a catalyst (e.g., Pt/gamma-AbOs, Ru/C, Raney Nickel, or other catalysts know in the art) in combination with IL under high pressure (e.g., 10 to 12000 psi). The sorbitol and/or xylitol products can be isolated and purified using methods known in the art.
[00190] Sugar products from saccharification can also be fermented to produce alcohols, sugar alcohols, such as erythritol, or organic acids, e.g., lactic, glutamic or citric acids or amino acids.
[00191] Yeast and Zymomonas bacteria, for example, can be used for fermentation or conversion of sugar(s) to alcohol(s). Other microorganisms are discussed below. The optimum pH for fermentations is about pH 4 to 7. For example, the optimum pH for yeast is from about pH 4 to 5, while the optimum pH for Zymomonas is from about pH 5 to 6. Typical fermentation times are about 24 to 168 hours (e.g., 24 to 96 hrs) with températures in the range of 20° C. to 40° C. (e.g., 26° C. to 40° C.), however thermophilic microorganisms prefer higher températures.
[00192] In some embodiments, e.g., when anaérobie organisms are used, at least a portion of the fermentation is conducted in the absence of oxygen, e.g., under a blanket of an inert gas such as N2, Ar, He, CO2 or mixtures thereof. Additionally, the mixture may hâve a constant purge of an inert gas flowing through the tank during part of or ail of the fermentation. In some cases, anaérobie conditions can be achieved or maintained by carbon dioxide production during the fermentation and no additional inert gas is needed.
[00193] In some embodiments, ail or a portion of the fermentation process can be interrupted before the low molecular weight sugar is completely converted to a product (e.g., éthanol). The intermediate fermentation products include sugar and carbohydrates in high concentrations.
The sugars and carbohydrates can be isolated via any means known in the art. These intermediate fermentation products can be used in préparation of food for human or animal consumption. Additionally or alternatively, the intermediate fermentation products can be ground to a fine particle size in a stainless-steel laboratory mill to produce a flour-like substance.
[00194] Jet mixing may be used during fermentation, and in some cases saccharification and fermentation are performed in the same tank.
[00195] Nutrients for the microorganisms may be added during saccharification and/or fermentation, for example the food-based nutrient packages described in U.S. Pat. App. Pub. 2012/0052536, filed Jul. 15, 2011, the complété disclosure of which is incorporated herein by reference.
[00196] “Fermentation” includes the methods and products that are disclosed in U.S. Prov. App. No. 61/579,559, filed Dec. 22, 2012, and U.S. Prov. App. No. 61/579,576, filed Dec. 22, 2012, the contents of both of which are incorporated by reference herein in their entirety.
[00197] Mobile fermenters can be utilized, as described in International App. No. PCT/US2007/074028 (which was filed Jul. 20, 2007, was published in English as WO 2008/011598 and designated the United States), the contents of which is incorporated herein in its entirety. Similarly, the saccharification equipment can be mobile. Further, saccharification and/or fermentation may be performed in part or entirely during transit.
[00198] The microorganism(s) used in fermentation can be naturally-occurring microorganisms and/or engineered microorganisms. For example, the microorganism can be a bacterium (including, but not limited to, e.g., a cellulolytic bacterium), a fungus, (including, but not limited to, e.g., a yeast), a plant, a protist, e.g., a protozoa or a fungus-like protest (including, but not limited to, e.g., a slime mold), or an algae. When the organisms are compatible, mixtures of organisms can be utilized.
[00199] Suitable fermenting microorganisms hâve the ability to convert carbohydrates, such as glucose, fructose, xylose, arabinose, mannose, galactose, oligosaccharides or polysaccharides into fermentation products. Fermenting microorganisms include strains of the genus Saccharomyces spp. (including, but not limited to, S. cerevisiae (baker's yeast), S. distaticus, S. iivarum), the genus Kluyveromyces, (including, but not limited to, K. marxianus, K. fragilis), the genus Candida (including, but not limited to, C. pseudotropicalis, and C. brassicaé), Pichia stipitis (a relative of Candida shehatae), the genus Clavispora (including, but not limited to, C. lusitaniae and C. opuntiaë), the genus Pachysolen (including, but not limited to, P. tannophilus), the genus Bretannomyces (including, but not limited to, e.g., B. clausenii (Philippidis, G. P., 1996, Cellulose bioconversion technology, in Handbook on Bioethanol: Production and Utilization, Wyman, C. E., ed., Taylor & Francis, Washington,
D.C., 179-212) ). Other suitable microorganisms include, for example, Zymomonas mobilis, Clostridium spp. (including, but not limited to, C. thermocellum (Philippidis, 1996, supra), C. saccharobutylacetonicum, C. saccharobutylicum, C. Puniceum, C. beijernckii, and C. acetobutylicum), Moniliellapollinis, Moniliella megachiliensis, Lactobacillus spp. Yarrowia lipolytica, Aureobasidium sp., Trichosporonoides sp., Trigonopsis variabilis, Trichosporon sp., Moniliellaacetoabutans sp., Typhula variabilis, Candida magnoliae, Ustilaginomycetes sp., Pseudozyma tsukubaensis, yeast species of généra Zygosaccharomyces, Debaryomyces, Hansenula and Pichia, and fungi of the dematioid genus Torula.
[00200] For instance, Clostridium spp. can be used to produce éthanol, butanol, butyric acid, acetic acid, and acetone. Lactobacillus spp. can be used to produce lactic acid.
[00201] Many such microbial strains are publicly available, either commercially or through depositories such as the ATCC (American Type Culture Collection, Manassas, Va., USA), the NRRL (Agricultural Research Sevice Culture Collection, Peoria, 111., USA), or the DSMZ (Deutsche Sammlung von Mikroorganismen and Zellkulturen GmbH, Braunschweig, Germany), to name a few.
[00202] Commercially available yeasts include, for example, Red Star®/Lesaffre Ethanol Red (available from Red Star/Lesaffre, USA), FALI® (available from Fleischmann's Yeast, a division of Bums Philip Food Inc., USA), SUPERSTART® (available from Alltech, now Lalemand), GERT STRAND® (available from Gert Strand AB, Sweden) and FERMOL® (available from DSM Specialties).
[00203] Many microorganisms that can be used to saccharify biomass material and produce sugars can also be used to ferment and convert those sugars to useful products.
[00204] After fermentation, the resulting fluids can be distilled using, for example, a “beer column” to separate éthanol and other alcohols from the majority of water and residual solids. The vapor exiting the beer column can be, e.g., 35% by weight éthanol and can be fed to a rectification column. A mixture of nearly azeotropic (92.5%) éthanol and water from the rectification column can be purified to pure (99.5%) éthanol using vapor-phase molecular sieves. The beer column bottoms can be sent to the first effect of a three-effect evaporator. The rectification column reflux condenser can provide heat for this first effect. After the first effect, solids can be separated using a centrifuge and dried in a rotary dryer. A portion (25%) of the centrifuge effluent can be recycled to fermentation and the rest sent to the second and third evaporator effects. Most of the evaporator condensate can be retumed to the process as fairly clean condensate with a small portion split off to waste water treatment to prevent build-up of low-boiling compounds.
[00205] Other types of chemical transformation of the products from the processes described herein can be used, for example, production of organic sugar derived products such (e.g., furfural and furfural-derived products). Chemical transformations of sugar derived products are described in U.S. Prov. App. No. 61/667,481, filed Jul. 3, 2012, the disclosure of which is incorporated herein by reference in its entirety.
Examples [00206] The invention is further described in detail by reference to the following experimental examples. These examples are provided for purposes of illustration only, and are not intended to be limiting unless otherwise specified. Thus, the invention should in no way be construed as being limited to the following examples, but rather, should be construed to encompass any and ail variations which become évident as a resuit of the teaching provided herein.
[00207] Without fùrther description, it is believed that one of ordinary skill in the art can, using the preceding description and the following illustrative examples, make and utilize the compounds of the présent invention and practice the claimed methods. The following working examples specifîcally point out various aspects of the présent invention, and are not to be construed as limiting in any way the remainder of the disclosure.
Example 1: Cloning of Cel3a-C’His into an expression vector [00208] The mature sequence for Cel3a (amino acids 20-744) was synthesized and codonoptimized for E.coli expression by Genewiz. The Cel3a-C’His referred to in the following examples refers to the codon-optimized mature sequence for Cel3a (aas 20-744) with an 8xHis (SEQ ID NO: 7) tag at the C-terminus. The below primers were used to clone the Cel3a-C’His into pET-Duet (Novagen, CatalogNo. 71146):
Forward 5’ -CATGCCATGGGCGATAGTCACAGTACCAGC (SEQ ID NO: 4) Reverse 3’ CCCAAGCTTTCAinA.GTGATGATGATGATGATGATGATGGCTGCCGCTGCCGGCAACA CTCAGGGTGC (SEQ ID NO: 5) (Ncol and HindlII sites are underlined; start and stop codons are in bold; the polyhistidine (8His (SEQ ID NO: 7)) tag; and glycine-serine (GSGS) linker (SEQ ID NO: 8) are italized.) The Amplification reaction was performed using PfuUltra II Fusion HS Polymerase (Agilent, CatalogNo. 600672).
[00209] The amplified DNA was cloned by restriction digestion using Ncol restriction enzyme (New England Biolabs, R3193) and HindlII restriction enzyme (New England Biolabs, R3104) under conditions suggested by the manufacturer. The digested amplified DNA was ligated into the NcoI-HindlII sites in the pETDuet vector using T4 DNA ligase (New England Biolabs, ΜΌ202), followed by transformation of E. coli cloning host Top 10 One Shot (Invitrogen). Plasmid purification was carried out using Qiagen’s plasmid purification kit.
Example 2: Expression and purification of Cel3a-C’His [00210] The Cel3A-C’His constructs were transformed into the E. coli expression host Origami B (DE3) (EMD Millipore, Catalog No. 70837) and streaked on plates containing LB medium and lOOpg/ml ampicillin (Fisher Scientific, CatalogNo. BP1760), 15 pg/ml kanamysin (Fisher Scientific, Catalog No. BP906) and 12.5pg/ml tétracycline (Fisher
Scientified, Catalog No. BP912). Colonies carrying the recombinant DNA were picked from plates for the inoculation of 2 ml starter cultures, and grown ovemight at 37°C, then subsequently used to inoculate 100ml of LB media containing the appropriate antibiotics. Cultures were grown at 37°C until OD600 reached 0.8.
[00211] To induce protein expression, 500μΜ IPTG (Isopropyl-b-D-thiogalactopyranoside; Fisher Scientific, Catalog No. BP1755) was added. The expression culture was further grown for another 4 hours at 37°C. The cells were harvested by centrifugation at 4000g at room température (RT) for 20 minutes using the Sorvall St 16 rotor TX400; and the pellet was stored at -80°C.
[00212] For protein extraction, the cell pellet was thawed on ice and resuspended in 2ml of native lysis buffer containing 50mM Tris-HCl pH 7.5, 0.1% Triton X-100, 5mM imidazole (Fisher Scientific, Catalog No. 03196), and lmg/ml lysozyme (Fisher Scientific, Catalog No. BP535), then incubated on ice for 20 minutes. To digest the DNA in the sample, 2μ1 of lysonase (EMD Millipore, Catalog No. 71230) was added and incubated for another 10 minutes. The sample was then spun in a microcentrifuge at maximum speed for 10 minutes at 4°C. The clarified lysate was transferred to a fresh tube containing 100μ1 of pre-equilibrated Profinity (Biorad) Ni-charged IMAC resin slurry (Biorad, Catalog No. 732-4614). The native binding buffer consisted of 50mM Tris HCl pH 7.5, 150mM NaCl, 0.1% Triton X-100, and 5μΜ imidazole. The protein was batch-bound for 1 hour at RT, and then washed with native buffer containing 25 μΜ imidazole. The protein was eluted in 300ml of native buffer containing 200μΜ imidazole. Purified Cel3a protein from the expression and purification process described above is shown in Figure 1.
Example 3: Expression of Cel3a-C’His in Bioreactors [00213] In this example, the expression and purification of Cel3a is scaled up to 3 liter and 5 liter bioreactors. The skilled artisan could readily use the process described herein to scale the production further, for example, to 3L and 7L bioreactors.
[00214] For each bioreactor run, the pET Duet-1 Cel3a-C’His construct is transformed into
E. Coli Origami B (DE3) host cell line, and streaked on plates containing LB medium and 100pg/ml ampicillin (Fisher Scientific, Catalog No. BP1760), 15 pg/ml kanamysin (Fisher
Scientific, CatalogNo. BP906) and 12.5pg/ml tétracycline (Fisher Scientific, CatalogNo. BP912).
[00215] On Day 1, colonies carrying the recombinant DNA were picked from plates for the inoculation of 200 ml starter cultures, and grown ovemight at 37°C. The bioreactors were prepared, e.g., 1.5 L of LB media was added to each 3L reactor, and 4L of LB media was added to each 5L reactor, with the appropriate antibiotics. Each reactor has a pH probe, a dissolved oxygen (DO) probe, and a condenser.
[00216] On Day 2, the OD of the ovemight inoculums was measured by a spectrophotometer at 600 nM. The spectrophotometer can be blanked by using LB media. The bioreactors were set to 37°C, 300ipm, and 2.5wm. Once the bioreactors reach 37°C, the appropriate mis of ovemight inoculums was added to each 1.5L and to each 4L culture for a target starting OD value of 0.05. The OD of the reactors were measured occasionally by sampling out of the top with éthanol and a stérile pipet. As the OD approaches 0.8, samples were taken more frequently.
[00217] When the OD of the reactors reached 0.7-0.8 (approximately 2-4 hours after inoculation), the température of the bioreactors was reduced to 20°C, and the cultures were induced with IPTG for Cel3a-C’His protein expression. 750ml of IPTG was added to each 1.5L reactor and 1.5 ml of IPTG was added to each 5L reactor. The reactors were run ovemight.
[00218] On Day 3, in the moming, the cells were harvested into clean 2L centrifuge bottles and spun at 4200 rpm for 45 minutes. The supematant was discarded, and the pellet was saved and the weight of the pellet was recorded. Using PBS buffer, the pellet was transferred to 50 mL conical centrifuge tubes. The 50 mL tubes were spun at 4500 rpm for 45 minutes in the Sorvall St-16 tabletop centrifuge. Supematant was discarded and the pellet was frozen at -20°C until ready for purification, or processed immediately for purification.
Example 4: Protein Extraction from Cells [00219] Lysis Buffer with EDTA was prepared, containing the following: 50mM Tris-HCl pH 7.5, 0.1% Triton X-100, 1% Glycerol, 5mM imidazole, 1 mg/mL Lysozyme, and 1 mM EDTA in de-ionized water. The cell pellet was resuspended in 5 mL volume of Lysis Buffer with EDTA. 10 μΐ of Lysonase (DNAse) was added to the sample. Samples were then incubated for 1 hour at room température.
[00220] Then, samples were sonicated on ice for three 1 minute intervals (e.g., for 3 minutes total, Sorvall sonicator at setting 17).
[00221] At this point, ΙΟΟΟμΙ aliquots were e reserved from each fraction of the extraction process to analyze the yield and activity of the Cel3a-C’His protein from the extraction and purification process. An aliquot was taken from sonicated sample (sample A) to represent the total protein isolated. The sample was then centrifuged at 13000 ipm for 5 minutes, and the supernatant was removed. An aliquot was taken from the supernatant (sample D), representing the soluble fraction. The remaining pellet was resuspended in ΙΟΟΟμΙ of Lysis Buffer + EDTA, and then centrifuged at 13000 rpm for 5 minutes. The supernatant was removed, and an aliquot was taken from the supernatant (sample E), representing the soluble wash fraction. The remaining pellet was reserved (sample C), representing the insoluble fraction. All samples were stored in the refrigerator (e.g., 4°C) until analysis.
[00222] A cellobiase activity assay was performed using the total protein aliquot (sample A), for example, as described in Example 6.
[00223] The samples A, C, D, and E were also analyzed by SDS-PAGE and western blotting. Samples were diluted to 2.00 OD to normalize. 17.5μ1 of LDS sample buffer (375μ1 LDS buffer and 150μ1 of NuPAGE reducing agent were mixed to préparé 525μ1 of LDS sample buffer) was added to 32.5μ1 of each sample. Samples were boiled for 5 minutes and 20μ1 of each sample was loaded into each gel well. A standard (molecular weight ladder) was also loaded into a gel well.
Example 5: Mass Spectrometry Analysis of Purified Cel3a-N’His [00224] Cel3a with a His-tag at the N-terminus (Cel3a-N’His) was cloned, expressed in E. coli, and harvested using methods similar to those described above in Examples 1-4. Cel3aN’His was cloned into pET-Duetl vector. The expressed Cel3a-N’His was purified with an affinity column containing Profmity Ni-charge IMAC resin slurry and eluted using elution buffer containing 200mM imidazole in the lysis buffer.
[00225] The purified Cel3a-N’His was dialyzed in buffer containing 50mM Tris-HCl and 0.5M EDTA. The dialyzed protein was then submitted for liquid chromatography-mass spectrometry (LCMS) intact mass analysis.
[00226] Sample concentration was determined, e.g., by Bradford assay and by Nanodrop assay. Sample concentration was at about 4 mg/mL. A 10X and 100X dilution sample was prepared for LCMS, with 0.1% formic acid in de-ionized water to provide 0.4 mg/mL and 0.04 mg/mL samples.
[00227] LC conditions were as follows:
Acquity UPLC H-Class Bio System: BEH300 C4 column, 1.7pm, 2.1x150 mm (p/n: 186004497)
Column Temp: 45°C
- MPA: 0.1% FA in Water
- MPB: 0.1%FAinACN [00228] Mass Spectrometer conditions were as follows:
Source Temp: 125°C
Desolvation Temp: 450°C
Cône Voltage: 40V
Calibration: Csl (500-5000 Da)
Lockmass: Csl [00229] The chromatographie profile of the sample is shown in Figure 2. The profile shows three areas of interest: a large peak at 31 minutes, a cluster of peaks between 38 and 40 minutes, and a smaller peak at about 45 minutes. Manual deconvolution of the mass spectrum of peak at about 45 minutes and the peak cluster at about 39 minutes shows that the main peaks are not part of a charge state envelope, but rather a polymer sériés (with 44 amu as the repeating unit), and were thus attributed to possibly being from the Triton-X used in the lysis buffer.
[00230] Analysis of the mass spectral région for the peak detected at 31 minutes indicates a protein charge state envelope, and importantly, with no evidence of glycosylation (Figure 3). Expansion of the charge state envelope indicates the presence of 2-3 minor proteins slightly larger than the major component. Deconvolution of the charge state envelope indicated that the major component had a molecular weight of 78,052 (Fig. 4), which corresponds to the expected molecular weight of Cel3a-N’His. The minor components had the following molecular weights: 78,100, 78,182, and 78,229 (Fig. 4), therefore indicating minor modifications to the Cel3a-N’His protein.
[00231] Among the charge states for the main component were several signais corresponding to smaller peptide fragments. These are distinguished by the more widely spaced isotope peaks. Some of the fragments identified hâve molecular weights of 8,491, 9,261, and 11,629. These fragments may be generated in the MS source, or may originate from the sample.
[00232] The results from this experiment demonstrate that an aglycosylated cellobiase, Cel3a-N’His was cloned, expressed, and isolated using the methods encompassed by the présent invention.
Example 6: Cellobiase Activity Assay [00233] His-tagged Cel3a (e.g., Cel3a with a His-tag at the N or C-terminus) were purified using IMAC techniques. The amount (titer) of purified His-tagged Cel3a was determined using Bradford assay and/or the nanodrop. For nanodrop quantification, the molar extinction coefficient was estimated by inserting the amino acid sequence of the target form of Cel3a into the ExPASy ProtParam online tool.
[00234] For the activity assay, two fold serial dilutions of samples containing Cel3a-N’His were performed in a 96 well plate format. Dilutions were incubated with a D-(+)-Cellobiose (Fluka) substrate solution in 50mM sodium citrate monobasic buffer at pH 5.0, at 48°C for 30 minutes. After 30 minutes, the samples were heated to 100°C for 10 minutes to stop the reaction. Samples were analysed for glucose and cellobiose using the YSI Biochemistry analyser (YSI Life Sciences) and/or HPLC methods. The protein activity of the purified Cel3aN’His was recorded as percent conversion of cellobiose to glucose per 30 minutes (FIG. 5). [00235] For samples where the amount of Cel3a cannot be assayed by using a Bradford assay and/or the nanodrop, e.g., crude lysate sample, the activity assay can be used to détermine the titer of Cel3a. A standard curve of cellobiase activity of a sample of purified CeI3a-N’His with a known concentration is generated. Two-fold serial dilutions of Cel3a-N’His with a known concentration were prepared in one row of a 96 well plate, e.g., 12 two-fold serial dilutions. The other rows contained two-fold serial dilutions of other remaining samples whose titer is to be determined, e.g., the crude lysate sample. The dilutions were incubated with a D (+)-Cellobiose (Fluka) substrate solution in 50mM sodium citrate monobasic buffer at pH 5.0, at 48°C for 30 minutes. After 30 minutes, the samples were heated to 100°C for 10 minutes to stop the reaction. Samples were analysed for glucose and cellobiose using the YSI Biochemistry analyser (YSI Life Sciences) and/or HPLC methods. Using Cel3a-N’His samples of known concentration, a standard curve is generated using the data points within the linear range of the assay (FIG. 6). The cellobiase activity detected from the samples with unknown Cel3a titer can be compared to the standard curve to détermine the titer of Cel3a in the sample. [00236] Units of activity are only relative if calculated using values within the linear range of the assay. The linear range of the assay is defined as using glucose values that are less than 30% of the original soluble substrate load. In addition, glucose values lower than 0.05g/L are omitted due to instrumentation reporting levels. One unit of cellobiase activity is defined as the amount of glucose per the amount of Cel3a per 30 minutes: [Glucose]g/L / [Cel3a]g/L / 30min.
[00237] A comparison of the activity between aglycosylated Cel3a purified from E. coli and endogenous (glycosylated) cellobiase Cel3a from T. reesei was performed, using the cellobiase activity described herein. The results are shown in Figure 7, which show that the recombinant Cel3a (aglycosylated Cel3a purified from E. coli', labeled Cel3a in Fig. 7) demonstrated higher spécifie activity for converting cellobiose to glucose compared to the endogenous Cel3a (labeled L4196 in Fig. 7). Specifically, at concentrations lower than 0.2 mg/mL of cellobiase, the recombinant Cel3a was able to produce a larger amount of glucose after 30 minutes than the endogenous Cel3a.
Example 7: Addition of Aglycosylated Cellobiase Increases Product From Saccharification Process [00238] In this example, the glucose yield was compared between two saccharification reactions. In one saccharification reaction, an enzyme mixture comprising the enzymes described in Table 1 was added to a biomass. In a second saccharification reaction, 0.8 mg/ml of aglycosylated Cel3a purified from E. coli was added to the enzyme mixture comprising the enzymes described in Table 1, and was added to a biomass. The saccharification reactions were run under the conditions as described herein, with samples taken at multiple timepoints between 0 and 80 minutes of the reaction. The resulting glucose was measured and quantified using methods in the art, such as by YSI instrument or HPLC, and plotted over time, as shown in Figure 8. These results show that addition of aglycosylated Cel3a increased the yield of sugar product, e.g., glucose, in a saccharification reaction.
EQUIVALENTS [00239] The disclosures of each and every patent, patent application, and publication cited herein are hereby incorporated herein by reference in their entirety. While this invention has been disclosed with reference to spécifie aspects, it is apparent that other aspects and variations of this invention may be devised by others skilled in the art without departing from the true spirit and scope of the invention. The appended claims are intended to be construed to 10 include ail such aspects and équivalent variations.
Claims (53)
- What is claimed is:1. An aglycosylated polypeptide having cellobiase activity comprising at least 90% identity to SEQ ID NO: 1, or a functional fragment thereof.
- 2. The aglycosylated polypeptide of claim 1, wherein the polypeptide comprises a CeI3A enzyme from wild-type T. reesei, or a functional variant or fragment thereof.
- 3. The aglycosylated polypeptide of claim 2, wherein the Cel3A enzyme comprises (e.g., consists of) the amino acid sequence SEQ ID NO: 1.
- 4. The aglycosylated polypeptide of any one of claims 1-3, wherein the polypeptide is encoded by a nucleic acid sequence comprising (e.g., consisting of) SEQ ID NO: 2.
- 5. The aglycosylated polypeptide of any one of claims 1-4, wherein the polypeptide comprises a mutation proximal to or at one or more glycosylation sites, wherein the mutation prevents glycosylation at the one or more glycosylation sites.
- 6. The aglycosylated polypeptide of claim 5, wherein the mutation is at one or more of the threonine at amino acid position 78, the threonine at amino acid position 241, the serine at amino acid position 343, the serine at amino acid position 450, the threonine at amino acid position 599, the serine at amino acid position 616, the threonine at amino acid position 691, the serine at amino acid position 21, the threonine at amino acid position 24, the serine at amino acid position 25, the serine at amino acid position 28, the threonine at amino acid position 38, the threonine at amino acid position 42, the threonine at amino acid position 303, the serine at amino acid position at 398, the at serine amino acid position 435, the serine at amino acid position 436, the threonine at amino acid position 439, threonine at amino acid position 442, the threonine at amino acid position 446, the serine at amino acid position 451, the serine at amino acid position 619, the serine at amino acid position 622, the threonine at amino acid position 623, the serine at amino acid position 626, or the threonine at amino acid position 630 ofSEQ IDNO: 1.
- 7. The aglycosylated polypeptide of any one of the preceding claims, wherein the aglycosylated polypeptide has increased cellobiase activity as compared to glycosylated Cel3A enzyme from wild-type T. reesei.
- 8. The aglycosylated polypeptide of any one of the preceding claims, wherein the aglycosylated polypeptide has increased substrate récognition, a more active substrate récognition site, or reduced steric hindrance as compared to glycosylated Cel3 A enzyme from wild-type T. reesei.
- 9. The aglycosylated polypeptide of any one of the preceding claims, wherein the aglycosylated polypeptide hydrolyzes a carbohydrate such as cellobiose into one or more monosaccharides, e.g., glucose.
- 10. The aglycosylated polypeptide of any one of the preceding claims, wherein the cellobiase activity comprises hydrolysis of a beta 1,4 glycosidic linkage of cellobiose.
- 11. A nucleic acid sequence encoding the polypeptide of claims 1-10.
- 12. A nucleic acid sequence encoding a Cel3A enzyme or functional variant thereof, wherein the nucleic acid sequence comprises (e.g., consists of) at least 90% identity to SEQ ID NO: 2 or SEQ ID NO: 3.
- 13. A nucleic acid molécule comprising a nucleic acid sequence of any of claims 11 and 12.
- 14. The nucleic acid molécule of claim 13, further comprising a promoter, e.g., a promoter for prokaryotic cell expression, e.g., bacterial cell expression, e.g., expression in E. coli.
- 15. The nucleic acid molécule of claim 14, wherein the promoter sequence is a constitutive promoter sequence, inducible promoter sequence, or a repressible promoter sequence.
- 16. The nucleic acid molécule of claim 14 or 15, whereinthe promoter is a T7 promoter.
- 17. The nucleic acid molécule of any one of claims 13-16, further comprising a nucleic acid sequence encoding a tag, e.g., a tag for détection and/or purification and/or for linkage to another molécule, e.g., a His tag.
- 18. The nucleic acid molécule of any one of claims 13-17, further comprising a nucleic acid encoding one or more signal sequences, e.g., a sécrétion signal sequence.
- 19. An expression vector comprising the nucleic acid sequence of any of claims 11-18.
- 20. The expression vector of claim 18, further comprising a nucleic acid sequence encoding a sélection marker, e.g., a kanamycin or an ampicillin marker.
- 21. A cell comprising the vector of any one of claims 13-20.
- 22. A prokaryotic cell or bacterial cell comprising the vector of any one of claims 13-20.
- 23. A cell expressing a polypeptide of any one of claims 1-10.
- 24. A prokaryotic cell or bacterial cell expressing a polypeptide of any one of claims 1-10.
- 25. The bacterial cell of claim 22 or 24 wherein the bacterial cell is impaired for glycosylation.
- 26. The bacterial cell of claim 25, wherein the bacterial cell is an E. coli cell.
- 27. The bacterial cell of claim 26, wherein the E. coli cell is an origami E. coli cell.
- 28. A method for producing the aglycosylated polypeptide of any one of claims 1, 2, 3, 4, 7, 8, 9 or 10, comprising culturing a cell expressing a polypeptide of any one of claims 1, 2, 3, 4, 7, 8, 9 or 10, under conditions suitable for the expression of the polypeptide, wherein the cell does not glycosylate the polypeptide, e.g., a bacterial cell, e.g., an E. coli cell, e.g., an origami E. coli cell.
- 29. A method for producing the aglycosylated polypeptide of any of claims 1, 2, 3, 4, 7, 8, 95 or 10, comprising treating a polypeptide comprising an amino acid with at least 90% identity to SEQ ID NO: 1 with a deglycosylating enzyme.
- 30. The method of claim 29, wherein the deglycosylating enzyme is selected from PGNase and EndoH.
- 31. A method for producing the aglycosylated polypeptide of any one of claims 1, 2, 3, 4, 7, 8, 9 or 10, comprising culturing a cell that comprises a nucleic acid sequence encoding a polypeptide of any one of claims 1, 2, 3, 4, 7, 8, 9 or 10, wherein the nucleic acid sequence has one or more mutations which prevent glycosylation of the encoded15 polypeptide.
- 32. A method for culturing a cell expressing the aglycosylated polypeptide of any one of claims 1, 2, 3, 4, 7, 8, 9 or 10 in the presence of a glycosylation inhibitor, e.g., tunicamycin.
- 33. An enzyme mixture comprising a glycosylated polypeptide comprising an amino acid sequence with at least 90% identity to SEQ ID NO: 1 and an aglycosylated polypeptide comprising an amino acid sequence with at least 90% identity to SEQ ID NO: 1, wherein both of the glycosylated polypeptide and the aglycosylated peptide hâve25 cellobiase activity.
- 34. The enzyme mixture of claim 33, wherein the aglycosylated polypeptide is the aglycosylated polypeptide of any one of claims 1-10.30
- 35. The enzyme mixture of claim 33 or 34, wherein the glycosylated polypeptide and the aglycosylated polypeptide both comprise Cel3 A enzyme from wild-type T. reesei.
- 36. The enzyme mixture of any one of claims 33-35, further comprising at least one additional enzyme derived from a microorganism, wherein the additional enzyme has a biomass-degrading activity of a cellulose based material.
- 37. The enzyme mixture of claim 36, wherein the additional enzyme is selected from a ligninase, an endoglucanase, a cellobiohydrolase, xylanase, and a cellobiase.
- 38. The enzyme mixture of claim 33 or 34, wherein the mixture further comprises one or more ligninase, one or more endoglucanase, one or more cellobiohydrolase, one or more xylanase.
- 39. The enzyme mixture of any one of claims 33-38, wherein the ratio between the aglycosylated polypeptide to the remaining enzymes in the mixture is at least 1:32, e.g., 1:32 to 1:300.
- 40. The enzyme mixture of any of claims 33-38, wherein the ratio ofthe aglycosylated polypeptide to a glycosylated polypeptide is at least 1:32, e.g., 1:32 to 1:300.
- 41. A method of producing a product (e.g., a hydrogen, sugar, alcohol, etc.) from a biomass (or converting a biomass to a product) comprising contacting a biomass, e.g., by treatment with an électron beam, with an aglycosylated polypeptide of any one of claims 1-10 and a microorganism (mixture) that produces one or more biomassdegrading enzyme or an enzyme mixture comprising biomass-degrading enzymes under conditions suitable for the production of the sugar product.
- 42. A method of producing a product (e.g., a hydrogen, a sugar, an alcohol) from a biomass comprising contacting a biomass with an enzyme mixture of any one of claims 33-40 under conditions suitable for the production of the product.
- 43. The method of claim 41 or 42, wherein said product is a sugar product.
- 44. The method of any one of claims 41-43, further comprising isolating the sugar product.
- 45. The method of claim 44, wherein the isolating of the sugar product comprises précipitation, crystallization, chromatography, centrifugation, and/or extraction.
- 46. The method of any one of claims 43-45, wherein the sugar product is glucose and/or xylose.
- 47. The method of any one of claims 41-46, wherein the enzyme mixture comprises at least two of the enzymes selected from the group consisting of B2AF03, CIP1, CIP2, Cella, Cel3a, Cel5a, Cel6a, Cel7a, Cel7b, Cell2a, Cel45a, Cel74a, paMan5a, paMan26a, Swollenin, and the enzymes listed in Table 1.
- 48. The method of any one of claims 41-47, wherein the biomass comprises one or more of an agricultural product or waste, a paper product or waste, a forestry product, or a general waste, or any combination thereof; wherein:a) an agricultural product or waste comprises sugar cane jute, hemp, flax, bamboo, sisal, alfalfa, hay, arracacha, buckwheat, banana, barley, cassava, kudzu, oca, sago, sorghum, potato, sweet potato, taro, yams, beans, favas, lentils, peas, grasses, switchgrass, miscanthus, cord grass, reed canary grass, grain residues, canola straw, wheat straw, barley straw, oat straw, rice straw, corn cobs, corn stover, corn fiber, coconut haïr, beet pulp, bagasse, soybean stover, grain residues, rice hulls, oat hulls, wheat chaff, barley hulls, or beeswing, or a combination thereof;b) a paper product or waste comprises paper, pigmented papers, loaded papers, coated papers, filled papers, magazines, printed matter, printer paper, polycoated paper, cardstock, cardboard, paperboard, or paper pulp, or a combination thereof;c) a forestry product comprises aspen wood, particle board, wood chips, or sawdust, or a combination thereof; andd) a general waste comprises manure, sewage, or offal, or a combination thereof.
- 49. The method of any one of claims 41-48, further comprising a step of treating the biomass prior to introducing the microorganism or the enzyme mixture to reduce the recalcitrance of the biomass, wherein the treating comprises bombardment with électrons, sonication, oxidation, pyrolysis, steam explosion, chemical treatment, mechanical treatment, or freeze grinding.
- 50. The method of any one of claims 41-49 wherein the microorganism that produces a biomass-degrading enzyme is from species in the généra selected from Bacillus, Coprinus, Myceliophthora, Cephalosporium, Scytalidium, Pénicillium, Aspergillus, Pseudomonas, Humicola, Fusarium, Thielavia, Acremonium, Chrysosporium or Trichoderma.
- 51. The method of any one of claims 41-50, wherein the microorganism that produces a biomass-degrading enzyme is selected from Aspergillus, Humicola insolens (Scytalidium thermophilum) Coprinus cinereus, Fusarium oxysporum, Myceliophthora thermophila, Meripilus giganteus, Thielavia terrestris, Acremonium persicinum, Acremonium acremonium, Acremonium brachypenium, Acremonium dichromosporum, Acremonium obclavatum, Acremonium pinkertoniae, Acremonium roseogriseum, Acremonium incoloratum, Acremonium furatum, Chrysosporium luckno-wense, Trichoderma viride, Trichoderma reesei, or Trichoderma koningii.
- 52. The method of any one of claims 41-51, wherein the microorganism has been induced to produce a biomass-degrading enzyme by combining the microorganism with an induction biomass sample under conditions suitable for increasing production of a biomass-degrading enzyme compared to an uninduced microorganism.
- 53. The method of any of claim 52, wherein said induction biomass sample comprises paper, paper products, paper waste, paper pulp, pigmented papers, loaded papers, coated papers, filled papers, magazines, printed matter, printer paper, polycoated paper, card stock, cardboard, paperboard, cotton, wood, particle board, forestry wastes, sawdust, aspen wood, wood chips, grasses, switchgrass, miscanthus, cord grass, reed canary grass, grain residues, rice hulls, oat hulls, wheat chaff, barley hulls, agricultural waste, silage, canola straw, wheat straw, barley straw, oat straw, rice straw, jute, hemp, flax, bamboo, sisal, abaca, com cobs, com stover, soybean stover, com fiber, alfalfa, hay, coconut hair, sugar processing residues, bagasse, beet pulp, agave bagasse, algae,5 seaweed, manure, sewage, offal, agricultural or industrial waste, arracacha, buckwheat, banana, barley, cassava, kudzu, oca, sago, sorghum, potato, sweet potato, taro, yams, beans, favas, lentils, peas, or any combination thereof.
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US62/035346 | 2014-08-08 |
Publications (1)
Publication Number | Publication Date |
---|---|
OA18211A true OA18211A (en) | 2018-09-04 |
Family
ID=
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20190161772A1 (en) | Methods for improving the efficiency of simultaneous saccharification and fermentation reactions | |
US8298795B2 (en) | Use of glycoside hydrolase 61 family proteins in processing of cellulose | |
US9080163B2 (en) | Cellobiohydrolase variants | |
US9260705B2 (en) | Cellobiohydrolase variants | |
US20200190462A1 (en) | Compositions for enhanced enzyme production | |
US20190316108A1 (en) | Solubilized enzyme and uses thereof | |
US20190024068A1 (en) | Aglycosylated enzyme and uses thereof | |
EP2794871A1 (en) | Endoglucanase 1b (eg1b) variants | |
OA18211A (en) | Aglycosylated enzyme and uses thereof. | |
WO2018069499A1 (en) | Method for processing lignocellulosic biomass | |
OA18252A (en) | Solubilized enzyme and uses thereof. | |
MEDOFF et al. | Patent 2981059 Summary | |
WO2013043317A1 (en) | Endoglucanase 1b |