DK2855673T3 - IMPROVED ENDOGLUCANASES FOR TREATMENT OF CELLULOS MATERIAL - Google Patents

IMPROVED ENDOGLUCANASES FOR TREATMENT OF CELLULOS MATERIAL Download PDF

Info

Publication number
DK2855673T3
DK2855673T3 DK13793173.9T DK13793173T DK2855673T3 DK 2855673 T3 DK2855673 T3 DK 2855673T3 DK 13793173 T DK13793173 T DK 13793173T DK 2855673 T3 DK2855673 T3 DK 2855673T3
Authority
DK
Denmark
Prior art keywords
gly
ala
ser
thr
cbm
Prior art date
Application number
DK13793173.9T
Other languages
Danish (da)
Inventor
Alexandra Komander
Kim Langfelder
Taija Leinonen
Jari Vehmaanperä
Terhi Puranen
Original Assignee
Roal Oy
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Roal Oy filed Critical Roal Oy
Application granted granted Critical
Publication of DK2855673T3 publication Critical patent/DK2855673T3/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P19/00Preparation of compounds containing saccharide radicals
    • C12P19/02Monosaccharides
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/24Hydrolases (3) acting on glycosyl compounds (3.2)
    • C12N9/2402Hydrolases (3) acting on glycosyl compounds (3.2) hydrolysing O- and S- glycosyl compounds (3.2.1)
    • C12N9/2405Glucanases
    • C12N9/2434Glucanases acting on beta-1,4-glucosidic bonds
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/24Hydrolases (3) acting on glycosyl compounds (3.2)
    • C12N9/2402Hydrolases (3) acting on glycosyl compounds (3.2) hydrolysing O- and S- glycosyl compounds (3.2.1)
    • C12N9/2405Glucanases
    • C12N9/2434Glucanases acting on beta-1,4-glucosidic bonds
    • C12N9/2437Cellulases (3.2.1.4; 3.2.1.74; 3.2.1.91; 3.2.1.150)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P19/00Preparation of compounds containing saccharide radicals
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P19/00Preparation of compounds containing saccharide radicals
    • C12P19/14Preparation of compounds containing saccharide radicals produced by the action of a carbohydrase (EC 3.2.x), e.g. by alpha-amylase, e.g. by cellulase, hemicellulase
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P7/00Preparation of oxygen-containing organic compounds
    • C12P7/02Preparation of oxygen-containing organic compounds containing a hydroxy group
    • C12P7/04Preparation of oxygen-containing organic compounds containing a hydroxy group acyclic
    • C12P7/06Ethanol, i.e. non-beverage
    • C12P7/08Ethanol, i.e. non-beverage produced as by-product or from waste or cellulosic material substrate
    • C12P7/10Ethanol, i.e. non-beverage produced as by-product or from waste or cellulosic material substrate substrate containing cellulosic material
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2319/00Fusion polypeptide
    • C07K2319/20Fusion polypeptide containing a tag with affinity for a non-protein ligand
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P2203/00Fermentation products obtained from optionally pretreated or hydrolyzed cellulosic or lignocellulosic material as the carbon source
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02EREDUCTION OF GREENHOUSE GAS [GHG] EMISSIONS, RELATED TO ENERGY GENERATION, TRANSMISSION OR DISTRIBUTION
    • Y02E50/00Technologies for the production of fuel of non-fossil origin
    • Y02E50/10Biofuels, e.g. bio-diesel

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Organic Chemistry (AREA)
  • Wood Science & Technology (AREA)
  • Zoology (AREA)
  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Genetics & Genomics (AREA)
  • General Engineering & Computer Science (AREA)
  • Microbiology (AREA)
  • Biotechnology (AREA)
  • General Health & Medical Sciences (AREA)
  • Biochemistry (AREA)
  • Medicinal Chemistry (AREA)
  • Molecular Biology (AREA)
  • Biomedical Technology (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • General Chemical & Material Sciences (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)
  • Enzymes And Modification Thereof (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Paper (AREA)
  • Chemical Or Physical Treatment Of Fibers (AREA)

Description

DESCRIPTION
[0001] The work leading to this invention has received funding from the European Community's Seventh Framework Programme FP7I2007-2013 under grant agreement n° 239341.
FIELD OF THE INVENTION
[0002] The present invention relates to production of fermentable sugars from lignocellulosic material by enzymatic conversion. The fermentable sugars are useful e.g. in the production of bioethanol, or for other purposes. In particular the invention relates to novel polypeptides, polynucleotides encoding them, and to vectors and host cells containing the polynucleotides. The invention is further directed to a method for treating cellulosic material with fungal endoglucanase or an enzyme preparation containing said enzyme. Still further the invention is directed to uses of the polypeptides or enzyme preparations containing said polypeptides and to a method of preparing them.
BACKGROUND OF THE INVENTION
[0003] Limited resources of fossil fuels, and increasing amounts of CO2 released from them and causing the greenhouse phenomenon have raised a need for using biomass as a renewable and clean source of energy. Biomass resources can be broadly categorized as agricultural or forestry-based, including secondary sources derived from agro and wood industries, waste sources and municipal solid wastes. One promising, alternative technology is the production of biofuels i.e. (bio)ethanol from lignocellulosic materials. In the transportation sector biofuels are for the time being the only option, which could reduce the CO2 emissions by an order of magnitude. The ethanol can be used in existing vehicles and distribution systems and thus it does not require expensive infrastructure investments. Sugars derived from lignocellulosic renewable raw materials can also be used as raw materials for a variety of chemical products that can replace oil-based chemicals.
[0004] Lignocellulosic raw material comprises an abundant source of carbohydrates for a variety of biofuels, including bioethanol. Most of the carbohydrates in plants are in the form of lignocellulose, which essentially consists of cellulose, hemicellulose, and lignin. Lignocellulose can be converted into bioethanol and other chemical products via fermentation following hydrolysis to fermentable sugars. In a conventional lignocellulose-to-ethanol process the lignocellulosic material is first pretreated either chemically or physically to make the cellulose fraction more accessible to hydrolysis. The cellulose fraction is then hydrolysed to obtain sugars that can be fermented by yeast into ethanol and distilled to obtain pure ethanol. Lignin is obtained as a main coproduct that may be used as a solid fuel.
[0005] One barrier of production of biofuels from cellulosic and lignocellulosic biomass is the robustness of the cell walls and the presence of sugar monomers in the form of inaccessible polymers that require a great amount of processing to make sugar monomers available to the micro-organisms that are typically used to produce alcohol by fermentation. Enzymatic hydrolysis is considered the most promising technology for converting cellulosic biomass into fermentable sugars. However, enzymatic hydrolysis is used only to a limited amount at industrial scale, and especially when using strongly lignified material such as wood or agricultural waste the technology is not satisfactory. The cost of the enzymatic step is one of the major economic factors of the process. Efforts have been made to improve the efficiency of the enzymatic hydrolysis of the cellulosic material (Badger 2002).
[0006] WO2001060752 describes a continuous process for converting solid lignocellulosic biomass into combustible fuel products. After pretreatment by wet oxidation or steam explosion the biomass is partially separated into cellulose, hemicellulose and lignin, and is then subjected to partial hydrolysis using one or more carbohydrase enzymes (EC 3.2).
[0007] W02002024882 concerns a method of converting cellulose to glucose by treating a pretreated lignocellulosic substrate with an enzyme mixture comprising cellulase and a modified cellobiohydrolase I (CBHI) obtained by inactivating its cellulose binding domain (CBD).
[0008] US 20040005674 A1 describes novel enzyme mixtures that can be used directly on lignocellulose substrate, whereby toxic waste products formed during pretreatment processes may be avoided, and energy may be saved. The synergistic enzyme mixture contains a cellulase and an auxiliary enzyme such as xylanase, ligninase, amylase, protease, lipidase or glucuronidase, or any combination thereof. Cellulase is considered to include endoglucanase, beta-glucosidase and cellobiohydrolase. US 20050164355 describes a method for degrading lignocellulosic material with one or more cellulolytic enzymes selected from endoglucanase, beta-glucosidase and cellobiohydrolase and in the presence of at least one surfactant. Additional enzymes such as hemicellulases, esterase, peroxidase, protease, laccase or mixture thereof may also be used. The presence of surfactant increases the degradation of lignocellulosic material compared to the absence of surfactant.
[0009] WO2011080317 describes a method of treating cellulosic material with fungal CBHIl/Cel6A cellobiohydrolase enzyme. The enzyme is useful in various industrial applications, particularly in production of biofuels, where production of fermentable sugars from lignocellulosic material at moderate to elevated temperature is advantageous.
[0010] Cellulases from a number of bacterial and fungal sources have been purified and characterized. The best investigated and most widely applied cellulolytic enzymes of fungal origin have been derived from Trichoderma reesei (the anamorph of Hypocrea jecorina). Cellulases from less known fungi have also been disclosed. Hong et al. (2003a and 2003b) characterize EG and CBHI of Thermoascus aurantiacus produced in yeast. Tuohy etal. (2002) describe three forms of cellobiohydrolases from Talaromyces emersonii, a moderately thermophilic fungus. The sequence and detailed biochemical characterization of these T. emersonii cellobiohydrolases have shown comparable properties with the cellobiohydrolases of T. reesei and P. chrysosporium. The cellulase enzymes of another thermophilic fungus, Melanocarpus albomyces, include at least two endoglucanases (Cel45A and Cel7A) and one cellobiohydrolase (Cel7B). These enzymes have been cloned and characterized for their pH and temperature behavior (Miettinen-Oinonen et ai, 2004). W02007071818 describes enzymatic conversion of lignocellulosic material by enzymes including cellobiohydrolase, endoglucanase, beta-glucosidase and optionally xylanase derived from Thermoascus auranticus, Acremonium thermophilium or Chaetomium thermophilium. US 7,892,812 describes cellulose compositions comprising endoglucanase and their use in industrial applications, for example in saccharification of lignocellulose biomass. The cellulases are from fungi Chrysosporium lucknowense, which has been identified as Myceliophthora thermophila (Visser et al., 2011).
[0011] Endoglucanases of the Cel7 family (EGs fam 7) are disclosed e.g. in US 5,912,157, which pertains Myceliphthora endoglucanase and its homologues and applications thereof in detergent, textile, and pulp. US 6,071,735 describes cellulases exhibiting high endoglucanase activity in alkaline conditions. Uses as detergent, in pulp and paper, and textile applications are discussed. US 5,763,254 discloses enzymes from strains of Humicola, Fusarium and Myceliopthora degrading cellulose/hemicellulose and having a carbohydrate binding module homologous to the region A of T.reesei.
[0012] W02004078919 discloses purified glycosyl hydrolase family 7 (Cel7A) enzymes from Penicillium funiculosum, which demonstrate a high level of specific performance when formulated with an endoglucanase and tested on pretreated corn stover.
[0013] Haakana etal., (2004) describes the cloning and sequencing of three genes encoding cellulases Cel45A, Cel7A and Cel7B from Melanocarpus albomyces. These cellulases work well in biostoning, with lower backstaining compared to T. reesei. WO9714804 discloses Cel7A family enzymes from Melanocarpus albomyces and its applications in textile and detergent industry. Voutilainen et a!., (2008) describes novel GH7 family cellobiohydrolases from the thermophilic fungi Acremonium thermophilum, Thermoascus auranticus and Chaetomium thermophilum active on insoluble polymeric substrates and participating in the rate limiting step in the hydrolysis of cellulose.
[0014] US 5,393,670 describes the DNA, vectors and transformed host encoding Trichoderma reesei endoglucanase I.
[0015] There is a continuous need for new methods of degrading cellulosic substrates, in particular lignocellulosic substrates, and for new enzymes and enzyme mixtures, which enhance the efficiency of the degradation. There is also a need for enzymes and processes, which are versatile and which work not only at moderate temperatures but also at high temperatures, thus increasing the reaction rates and enabling the use of high biomass consistency leading to high sugar and ethanol concentrations. This approach may lead to significant savings in energy and investment costs. The high temperature also decreases the risk of contamination during hydrolysis. The present invention aims to meet at least part of these needs.
BRIEF DESCRIPTION OF THE INVENTION
[0016] It is an object of the present invention to provide novel polypeptides having endoglucanase activity and polynucleotides encoding the polypeptides. The novel polypeptides may have improved specific activity and/or improved thermostability. The novel polypeptides may also have versatile applications. A further object of the present invention is to provide new enzymes and enzyme preparations, which enhance the efficiency of the cellulosic degradation. Especially the object of the invention is to provide new enzymes having endoglucanase activity. Another object of the present invention is to provide a method for treating cellulose material with an improved enzyme or enzyme preparation.
[0017] The objects of the invention are achieved by novel polypeptides of GH family 7 (Cel7) obtained from Acremonium thermophilium ALK04245.
[0018] The present invention provides a polypeptide having endoglucanase activity and comprising an amino acid sequence having at least 57% identity to SEQ ID NO:7 (EG_A) or a fragment or variant thereof having endoglucanase activity.
[0019] The invention further provides an isolated polynucleotide selected from the group consisting of: 1. a) a polynucleotide comprising the coding sequence as shown in SEQ ID NO: 5; 2. b) a polynucleotide encoding a polypeptide of claim 1; 3. c) a polynucleotide encoding a fragment of a polypeptide encoded by a polynucleotide of a) or b), wherein said fragment has endoglucanase activity; and 4. d) a polynucleotide comprising a nucleotide sequence which is degenerate to the nucleotide sequence of a polynucleotide of a) or b); or the complementary strand of such a polynucleotide.
[0020] The invention is also directed to a vector, which comprises said polynucleotide and a host cell comprising said vector. Escherichia coli strains having accession number DSM 25492, DSM 25657, and DSM 25655 are also included in the invention.
[0021] A further object of the invention is to provide a method of producing said polypeptide having endoglucanase activity, the method comprising the steps of transforming a host cell with an expression vector encoding said polypeptide, and culturing said host cell under conditions enabling expression of said polypeptide, and optionally recovering and purifying said polypeptide.
[0022] Other objects of the invention are the enzyme preparations comprising at least one of the novel polypeptides and the use of said enzyme preparations and polypeptides in biofuel, biomass hydrolysis, starch, textile, detergent, pulp and paper, food, feed or beverage industry.
[0023] The invention also provides a method for treating cellulosic material with an endoglucanase or an enzyme preparation comprising said endoglucanase, wherein the method comprises the following steps: 1. i) reacting the cellulosic material with said endoglucanase or the enzyme preparation comprising said endoglucanase 2. ii) obtaining at least partially hydrolyzed cellulosic material.
[0024] Specific embodiments of the invention are set forth in the dependent claims. Other objects, details and advantages of the present invention will become apparent from the following drawings, detailed description and examples.
[0025] The novel endoglucanase applicable in the method is capable of hydrolysing cellulosic materials at moderate to elevated temperatures, particularly in combination with other enzymes used in hydrolysis of cellulosic or lignocellulosic materials.
[0026] Endoglucanases obtainable from Acremonium thermophilum ALK04245 are particularly useful in hydrolysing and degrading cellulosic material. The enzymes are kinetically very effective over a broad range of temperatures, and although they have high activity at standard hydrolysis temperatures, they are also very efficient at high temperatures. This makes them extremely well suited for varying cellulosic substrate hydrolysis processes carried out both at conventional temperatures and at elevated temperatures. In the conventional separate hydrolysis and fermentation process (SHF) the temperature of enzymatic hydrolysis is typically higher than that of fermentation. The use of thermostable enzymes in the hydrolysis offer potential benefits, such as higher reaction rates at elevated temperatures, reduction of enzyme load due to higher specific activity and stability of enzymes, increased flexibility with respect to process configuration and decreased contamination risk. The general robustness of thermostable enzymes compared to mesophilic ones also increases the recyclability of enzymes in the industrial process.
BRIEF DESCRIPTION OF THE DRAWINGS
[0027] In the following the invention will be described in greater detail by means of preferred embodiments with reference to the attached drawings, in which
Figure 1 schematically shows the expression cassettes used in the transformation of Trichoderma reesei protoplasts for overproducing the recombinant Acremonium thermophilum ALK04245 EG/Cel7 proteins (EG_A and EG_B), Acremonium thermophilum ALK04245 EG/Cel7 + Trichoderma reesei EGI/Cel7B_linker-CBM (EG_A+EGI-CBM and EG_B+EGI-CBM) and Acremonium thermophilum ALK04245 EG/Cel7 + Trichoderma reesei CBHI/Cel7A_linker-CBM (EG_A+CBHI-CBM and EG_B+CBHI-CBM) fusion proteins. The Acremonium thermophilum ALK04245 cel7legl genes (egl_A and egl_B) and cef7legl-CBM fusion genes Acremonium thermophilum ALK04245 cel7/egl + Trichoderma reesei cel7S/eg/fJinker-CBM (eg/_A+eg/f-CBM and eg/-B+eg/1-CBM) and Acremonium thermophilum ALK04245 cel7legl + Trichoderma reesei cel7A/cbh1_linker-CBM {egl_A+cbh1 -CBM and egl_B+cbh1 -CBM) were under the control of T. reesei cel7Alcbh1 promoter (p cbh1) and the termination of the transcription was ensured by using T. reesei cel7Alcbh1 terminator sequence (t cbh1). The amdS gene was included as a transformation marker.
Figure 2 shows results from hydrolysis of steam exploded corn fibre performed with enzyme mixtures comprising the EG/Cel7 endoglucanases of the invention. The corn fibre substrate was hydrolyzed using different enzyme mixtures at a dosage of 0.5 mg of protein per g of total solids all at 37°C and 55°C in high dry matter conditions. The compositions of the enzyme mixtures; basis enzyme mixture (MIXTURE 1) and compositions comprising the EG_A and EG_B, are described in more detail in Example 5. Samples from five different tubes were taken after 48 hours hydrolysis time and quantified by HPLC, in which the concentration of glucose was determined. The concentration of glucose is presented.
Figure 2A shows the hydrolysis results of steam exploded corn fibre performed at 37°C with a basis enzyme mixture (MIXTURE 1) supplemented with the EG_A (MIXTURE 1_EG_A) or EG_B (MIXTURE 1_EG_B).
Figure 2B shows the hydrolysis results of steam exploded corn fibre performed at 55°C with a basis enzyme mixture (MIXTURE 1) supplemented with the EG_A (MIXTURE 1_EG_A).
Figure 3 shows results from hydrolysis of steam exploded corn fibre performed with enzyme mixtures comprising the EG_A+EGI-CBM and EG_B+EGI-CBM fusion protein of the invention. The corn fibre substrate was hydrolyzed using different enzyme mixtures at a dosage of 0.5 mg of protein per g of total solids all at 37°C with both low and high dry matter conditions and at 55°C with high dry matter conditions. The compositions of the enzyme mixtures, a basis enzyme mixture (MIXTURE 1) and compositions comprising the EG_A+EGI-CBM and EG_B+EGI-CBM, are described in more detail in Example 5. Samples from five different tubes were taken after 48 hours hydrolysis time and quantified by HPLC, in which the concentration of glucose was determined. The concentration of glucose is presented.
Figure 3A shows the hydrolysis results of steam exploded corn fibre performed at 37°C with low dry matter conditions and with a basis enzyme mixture (MIXTURE 1) supplemented with the EG_A+EGI-CBM or EG_B+EGI-CBM.
Figure 3B shows the hydrolysis results of steam exploded corn fibre performed at 37°C with high dry matter conditions and with a basis enzyme mixture (MIXTURE 1) supplemented with the EG_B+EGI-CBM.
Figure 3C shows the hydrolysis results of steam exploded corn fibre performed at 55°C with high dry matter conditions and with a basis enzyme mixture (MIXTURE 1) supplemented with the EG_B+EGI-CBM.
Figure 4 shows results from hydrolysis of steam exploded corn fibre performed with enzyme mixtures comprising the EG_A+CBHI-CBM and EG_B+CBHI-CBM fusion protein of the invention. The corn fibre substrate was hydrolyzed using different enzyme mixtures at a dosage of 0.5 mg of protein per g of total solids all at 37°C with both low and high dry matter conditions and at 55°C with high dry matter conditions. The compositions of the enzyme mixtures; a basis enzyme mixture (MIXTURE 1) and compositions comprising the EG_A+CBHI-CBM and EG_B+CBHI-CBM, are described in more detail in Example 5. Samples from five different tubes were taken after 48 hours hydrolysis time and quantified by HPLC, in which the concentration of glucose was determined. The concentration of glucose is presented.
Figure 4A shows the hydrolysis results of steam exploded corn fibre performed at 37°C with low dry matter conditions and with a basis enzyme mixture (MIXTURE 1) supplemented with the EGA+CBHI-CBM or EG_B+CBHI-CBM.
Figure 4B shows the hydrolysis results of steam exploded corn fibre performed at 37°C with high dry matter conditions and with a basis enzyme mixture (MIXTURE 1) supplemented with the EG A+CBHI-CBM or EG_B+CBHI-CBM.
Figure 4C shows the hydrolysis results of steam exploded corn fibre performed at 55°C with high dry matter conditions and with a basis enzyme mixture (MIXTURE 1) supplemented with the EG_B+CBHI-CBM.
SEQUENCE LISTING
[0028] SEQ ID NO: 1 Sequence of the oligonucleotide primer egl9 SEQ ID NO: 2 Sequence of the oligonucleotide primer egl11 SEQ ID NO: 3 Sequence of the PCR fragment obtained from Acremonium thermophilum ALK04245 (CBS 116240) using the primers egl9 and egl11. SEQ ID NO: 4 Sequence of the PCR fragment obtained from Acremonium thermophilum ALK04245 (CBS 116240) using primers egl9 and egl11. SEQ ID NO: 5 The nucleotide sequence of the Acremonium thermophilum ALK04245 (CBS 116240) egl_A gene.
SEQ ID NO: 6 The nucleotide sequence of the Acremonium thermophilum ALK04245 (CBS 116240) egl_B gene. SEQ ID NO: 7 The deduced amino acid sequence of the Acremonium thermophilum ALK04245 (CBS 116240) EG A. SEQ ID NO: 8 The deduced amino acid sequence of the Acremonium thermophilum ALK04245 (CBS 116240) EG_B. SEQ ID NO: 9 Sequence of the oligonucleotide primer egl50 SEQ ID NO: 10 Sequence of the oligonucleotide primer CBM_1 SEQ ID NO: 11 Sequence of the oligonucleotide primer CBM_2 SEQ ID NO: 12 Sequence of the oligonucleotide primer CBM_17 SEQ ID NO: 13 Sequence of the oligonucleotide primer egl64 SEQ ID NO: 14 Sequence of the oligonucleotide primer CBM_4 SEQ ID NO: 15 Sequence of the oligonucleotide primer CBM_5 SEQ ID NO: 16 Sequence of the oligonucleotide primer CBM_18 SEQ ID NO: 17 Sequence of the PCR fragment obtained from a plasmid containing the full-length Acremonium thermophilum ALK04245 egl_A gene using primers egl50 and CBM_1. SEQ ID NO: 18 Sequence of the PCR fragment obtained from a plasmid containing the full-length Acremonium thermophilum ALK04245 egl_B gene using primers egl64 and CBM 4. SEQ ID NO: 19 Sequence of the PCR fragment obtained from plasmid containing the Trichoderma reesei egl1 gene using primers CBM 2 and CBM_17. SEQ ID NO: 20 Sequence of the PCR fragment obtained from plasmid containing the Trichoderma reesei egl1 gene using primers CBM 5 and CBM18. SEQ ID NO: 21 The nucleotide sequence of the Acremonium thermophilum ALK04245 (CBS 116240) egl_A + Trichoderma reesei eg 11-CBM fusion gene. SEQ ID NO: 22 The nucleotide sequence of the Acremonium thermophilum ALK04245 (CBS 116240) eg IB + Trichoderma reesei eg 11-CBM fusion gene. SEQ ID NO: 23 The deduced amino acid sequence of the Acremonium thermophilum ALK04245 (CBS 116240) EG_A+ Trichoderma reesei EGI-CBM fusion protein. SEQ ID NO: 24 The deduced amino acid sequence of the Acremonium thermophilum ALK04245 (CBS 116240) EG_B + Trichoderma reesei EGI-CBM fusion protein. SEQ ID NO: 25 The nucleotide sequence of the Acremonium thermophilum ALK04245 (CBS 116240) egl_A + Trichoderma reesei cbh1-CBM fusion gene. SEQ ID NO: 26 The nucleotide sequence of the Acremonium thermophilum ALK04245 (CBS 116240) eg/_B + Trichoderma reesei cbh1-CBM fusion gene. SEQ ID NO: 27 The deduced amino acid sequence of the Acremonium thermophilum ALK04245 (CBS 116240) EG_A + Trichoderma reese/'CBHI-CBM fusion protein. SEQ ID NO: 28 The deduced amino acid sequence of the Acremonium thermophilum ALK04245 (CBS 116240) EG_B + Trichoderma reeseiCBHI-CBM fusion protein.
DETAILED DESCRIPTION OF THE INVENTION
[0029] Cellulose is the major structural component of higher plants. It provides plant cells with high tensile strength helping them to resist mechanical stress and osmotic pressure. Cellulose is a β-1,4^Ιυο8η composed of linear chains of glucose residues joined by P-1,4-glycosidic linkages. Cellobiose is the smallest repeating unit of cellulose. In cell walls cellulose is packed in variously oriented sheets, which are embedded in a matrix of hemicellulose and lignin. Hemicellulose is a heterogeneous group of carbohydrate polymers containing mainly different glucans, xylans and mannans. Hemicellulose consists of a linear backbone with β-1,4-linked residues substituted with short side chains usually containing acetyl, glucuronyl, arabinosyl and galactosyl. Hemicellulose can be chemically cross-linked to lignin. Lignin is a complex cross-linked polymer of variously substituted p-hydroxyphenylpropane units that provides strength to the cell wall to withstand mechanical stress, and it also protects cellulose from enzymatic hydrolysis.
[0030] "Cellulose" or "cellulosic material" as used herein, relates to any material comprising cellulose, hemicellulose and/or lignocellulose as a significant component. Examples of cellulosic material include textile fibers derived e.g. from cotton, flax, hemp, jute and the manmade cellulosic fibers as modal, viscose and lyocel.
[0031] "Lignocellulose" is a combination of cellulose and hemicellulose and polymers of phenol propanol units and lignin. It is physically hard, dense, and inaccessible and the most abundant biochemical material in the biosphere. "Lignocellulosic material" means any material comprising lignocellulose. Such materials are for example: hardwood and softwood chips, wood pulp, sawdust and forestry and wood industrial waste; agricultural biomass as cereal straws, sugar beet pulp, corn fibre, corn stover and corn cobs, sugar cane bagasse, stems, leaves, hulls, husks, and the like; waste products as municipal solid waste, newspaper and waste office paper, milling waste of e.g. grains; dedicated energy crops (e.g., willow, poplar, swithcgrass or reed canarygrass, and the like). Preferred examples are corn stover, switchgrass, cereal straw, sugarcane bagasse and wood derived materials.
[0032] Cellulosic material is degraded in nature by a number of various organisms including bacteria and fungi which produce enzymes capable of hydrolyzing carbohydrate polymers.
Degradation usually requires different cellulases acting sequentially or simultaneously. Degradation of more complex cellulose containing substrates requires a broad range of various enzymes. For example hemicellulose is degraded by hemicellulases, like xylanases and mannanases. Hemicellulase is an enzyme hydrolysing hemicellulose.
[0033] "Cellulolytic enzymes" are enzymes having "cellulolytic activity", which means that they are capable of hydrolysing cellulosic substrates or derivatives thereof into smaller saccharides. Cellulolytic enzymes thus include both cellulases and hemicellulases. Cellulases as used herein include (1) endoglucanases (EG, EC 3.2.1.4) which cut internal beta-1,4-glucosidic bonds; (2) exoglucanases or cellobiohydrolases (CBH, EC 3.2.1.91) that cut the disaccharide cellobiose from the reducing or non-reducing end of the crystalline cellulose polymer chain; (3) beta-1,4-glucosidases (BG, EC 3.2.1.21) which hydrolyze the cellobiose and other short cello-oligosaccharides to glucose. The CAZY (carbohydrate active enzymes) classification system collates glycosyl hydrolase (GH) enzymes into families according to sequence similarity, which have been shown to reflect shared structural features. In addition to this cellulases can be classified to various glycosyl hydrolase families according their primary sequence, supported by analysis of the three dimensional structure of some members of the family (Henrissat 1991, Henrissatand Bairoch 1993, 1996).
[0034] T. reesei has a well-known and effective cellulase system containing two CBH's, two major and several minor EG's and several BG's. T. reesei CBHI (Cel7A) cuts sugar from the reducing end of the cellulose chain, has a C-terminal cellulose binding module (CBM) and may constitute up to 60% of the total secreted protein. T. reesei CBHII (Cel6A) cuts sugar from the non-reducing end of the cellulose chain, has an N-terminal cellulose binding module and may constitute up to 20% of the total secreted protein. Endoglucanases EGI (Cel7B), and EGV (Cel45A) have a cellulose binding module (CBM) in their C-terminus, EGII (Cel5A) has an N-terminal CBM and EGIII (Cel12A) does not have a cellulose binding module at all. CBHI, CBHII, EGI and EGII are so called "major cellulases" of Trichoderma comprising together 80-90% of total secreted proteins. It is known to a man skilled in the art that an enzyme may be active on several substrates and enzymatic activities can be measured using different substrates, methods and conditions. Identifying different cellulolytic activities is discussed for example in van Tilbeurgh et al. 1988.
[0035] Many fungal hydrolases are modular proteins, and all of them contain a catalytic domain (CD)/core expressing cellulolytic activity. In addition to the CD, hydrolases may contain a carbohydrate binding module, also named as cellulose binding domain (CBD), which can be located either at the N- or C-terminus of the catalytic domain. CBM mediates the binding of the cellulase to crystalline cellulose but has little or no effect on cellulase hydrolytic activity of the enzyme on soluble substrates. These two domains are typically connected via a flexible and highly glycosylated linker region.
[0036] Glycoside hydrolase family 7 (GH7) comprises enzymes with several known activities, especially endoglucanase and cellobiohydrolase. "Endoglucanases (EG)" are enzymes that cut internal glycosidic bonds of the cellulose chain. They are 1,4-beta-D-glucan 4- glucanohydrolases and catalyze endohydrolysis of 1,4-beta-D-glycosidic linkages in polymers of glucose such as cellulose and derivatives thereof. Some endoglucanases have a naturally occurring cellulose binding domain, while others do not. Some endoglucanases have also xylanase activity (Bailey et al., 1993).
[0037] The present invention is based on studies, which attempted to find novel GH7 family endoglucanases which would improve hydrolysis efficiency of cellulosic substrates and which could be used for versatile applications. The identification of the novel enzymes was done using known molecular biology methods. The basic methods are described, for example, in Sambrook and Russel, 2001. Two GH family 7 (Cel7) endoglucanases referred as EG_Aand EG_B were obtained (Table 1).
Table 1. The EG/Cel7 endoglucanases of the invention
[0038] The novel EG/Cel7 endoglucanases according to the present invention are obtainable from Acremonium sp. preferably from Acremonium thermophilium and more preferably from strain having the characteristics of strain ALK04245 deposited as CBS 116240. "Obtainable from" means that they can be obtained from said species, but it does not exclude the possibility of obtaining them from other sources. In other words they may originate from any organism including plants. Preferably they originate from microorganisms e.g. bacteria or fungi. The bacteria may be for example from a genus selected from Bacillus, Azospirillum and Streptomyces. More preferably the enzyme originates from fungi (including filamentous fungi and yeasts), for example from a genus selected from the group consisting of Thermoascus, Acremonium, Chaetomium, Achaetomium, Thielavia, Aspergillus, Botrytis, Chrysosporium, Collybia, Fomes, Fusarium, Humicola, Hypocrea, Lentinus, Melanocarpus, Myceliophthora, Myriococcum, Neurospora, Penicillium, Phanerochaete, Phlebia, Pleurotus, Podospora, Polyporus, Rhizoctonia, Scytalidium, Pycnoporus, Talaromyces, Trametes and Trichoderma.
[0039] The novel EG/Cel7 polypeptides of the invention having endoglucanase activity preferably comprise an amino acid sequence having at least 57% identity to SEQ ID NO:7 (EG_A) or at least 58% identity to SEQ ID NO:8 (EG_B) or a fragment or variant thereof having endoglucanase activity. According to one embodiment of the invention, the polypeptide has at least 60, 65, 70, 75, 80, 85, 90, 95, 98 or 99% identity to SEQ ID NO: 7 or 8 or to its enzymatically active fragment. The EG/Cel7 polypeptides having endoglucanase activity are also herein simply called endoglucanases.
[0040] By the term "identity" is here meant the global identity between two amino acid sequences compared to each other from the first amino acid encoded by the corresponding gene to the last amino acid. The identity of the full-length sequences is measured by using EMBOSS Needle Needleman-Wunsch global alignment program at EBI (European Bioinformatics Institute) http://www.ebi.ac.uk/Tools/Dsa/emboss needle/ with the following parameters: BLOSUM50, Gap open 10.0, Gap extend 0.5. The algorithm is described in Needleman and Wunsch (1970). The man skilled in the art is aware of the fact that results using Needleman-Wunsch algorithm are comparable only when aligning corresponding domains of the sequence and using the same parameters in each comparison. Consequently comparison of e.g. cellulase sequences including CBM or signal sequences with sequences lacking those elements cannot be done.
[0041] By the term "fragment having endoglucanase activity" is meant any fragment of a defined sequence that has endoglucanase activity. In other words a fragment having endoglucanase activity may be the mature protein part of the defined sequence, or it may be only a fragment of the mature protein part, provided that it still has endoglucanase activity.
[0042] The novel polypeptides may also be variants of said polypeptides. A "variant" may be a polypeptide that occurs naturally e.g. as an allelic variant within the same strain, species or genus, or it may have been generated by mutagenesis. It may comprise amino acid substitutions, deletions or insertions, but it still functions in a substantially similar manner to the enzymes defined above i.e. it comprises a fragment having endoglucanase activity.
[0043] The cellulolytic polypeptides are usually produced in the cell as pre-polypeptides comprising a signal sequence that is cleaved off during secretion of the protein. They may also be further processed during secretion both at the N-terminal and/or C-terminal end to give a mature, enzymatically active protein. "A polypeptide having endoglucanase activity" thus denotes that the polypeptide may be either in immature or mature form, preferably it is in mature form, i.e. the processing has taken place. In addition, the "mature form" means an enzyme which has been cleaved from its carrier protein in fusion constructions.
[0044] The EG/Cel7 endoglucanases of the present invention are preferably recombinant enzymes, which may be produced in a generally known manner. A polynucleotide fragment comprising the endoglucanase gene is isolated, the gene is inserted under a strong promoter into an expression vector, the vector is transformed into suitable host cells and the host cells are cultivated under conditions provoking production of the enzyme. Methods for protein production by recombinant technology in different host systems are well known in the art (Sambrook et al„ 1989; Coen, 2001; Gellissen, 2005). Preferably the enzymes are produced as extracellular enzymes that are secreted into the culture medium, from which they can easily be recovered and isolated.
[0045] The recombinant polypeptide may be a fused polypeptide in which another polypeptide is fused at the N-terminus or the C-terminus of the polypeptide of the present invention. A fused polypeptide is produced by fusing a polynucleotide encoding another polypeptide to a polynucleotide of the present invention. Techniques for producing fusion polypeptides are known in the art, and include ligating the coding sequences encoding the polypeptides so that they are in frame and that expression of the fused polypeptide is under control of the same promoter(s) and terminator.
[0046] The polypeptide of the invention disclosed in SEQ ID NO: 7 naturally contains a C-terminal CBM and a linker. "A linker" is a flexible and highly glycosylated region which connects the catalytic domain and the CBM. As used herein the CBM includes also the linker region. In one embodiment of the invention this native linker and CBM region may be replaced by, e.g. a linker and a CBM from a Trichoderma or Chaetomium species, preferably from Trichoderma reesei. In a preferred embodiment the natural CBM of endoglucanase EG_A has been replaced with a CBM of T. reesei endoglucanase I (EGI/Cel7B) or a CBM of T. reesei cellobiohydrolase I (CHBI/Cel7A) and preferably the resulting fusion protein comprises an amino acid sequence having SEQ ID NO: 23 (EG A+EGI-CBM) or SEQ ID NO:27 (EGA+CHBI-CBM) (Table 2).
[0047] The polypeptide disclosed in SEQ ID NO: 8 does not naturally contain a CBM and a linker. In one embodiment this polypeptide may be attached by, e.g. a linker and a CBM regions from a Trichoderma or Chaetomium species, preferably from Trichoderma reesei. In a preferred embodiment the linker and CBM of T. reesei EGI/Cel7B or T. reesei CHBI/Cel7A has been genetically attached to the endoglucanase EG_B and preferably the resulting fusion protein comprises an amino acid sequence having SEQ ID NO: 24 (EG_B+EGI-CBM) or SEQ ID NO: 28 (EG_B+CHBI-CBM) (Table 2).
Table 2. EG+CBM recombinant fusion proteins of the invention
[0048] Further, within the scope of the invention are recombinant fusion proteins comprising an amino acid sequence having at least 55% sequence identity to SEQ ID NO: 23 (EGA+EGI-CBM) or SEQ ID NO: 27 (EG_A+CHBI-CMB). According to one embodiment the fusion protein comprises an amino acid sequence having at least 60, 65, 70, 75, 80, 85, 90, 95, 98 or 99% identity to SEQ ID NO: 23 or 27 or to its enzymatically active fragment.
[0049] The EG/Cel7 endoglucanases of the invention may be used without a signal sequence and/or CBM or the signal sequence and/or CBM may derive from different enzymes of the above mentioned microorganisms or different microorganism or be synthetically or recombinantly incorporated to the catalytic domain of the above enzymes.
[0050] The invention relates to novel polynucleotides which may comprise a nucleotide sequence of SEQ ID NO: 5 or a sequence encoding a novel polypeptide as defined above, including complementary strands thereof. "Polynucleotide" as used herein refers to both RNA and DNA, and it may be single stranded or double stranded. Further the polynucleotide may be degenerate as a result of the genetic code to any one of the sequences as defined above. This means that different codons may code for the same amino acid.
[0051] One embodiment is an EG/Cel7 endoglucanase which is encoded by a polynucleotide sequence included in SEQ ID NO: 21,22, 25 or 26.
[0052] The polynucleotide may also be a fragment of said polynucleotides comprising at least 17 nucleotides, preferably at least 20, 30, 40 or 50 nucleotides. According to one embodiment the polynucleotide is having a sequence set forth as SEQ ID NO 1,2, 9, 10, 13 or 14.
[0053] According to another embodiment of the invention, the polynucleotide comprises a gene similar to that included in a microorganism having accession number DSM 25492, DSM 25657, DSM 25655 (Table 1, Table 2).
[0054] The EG/Cel7 endoglucanase of the invention may be produced from a recombinant expression "vector" comprising the nucleic acid molecule, which encodes the endoglucanase as characterized above, operably linked to regulatory sequences capable of directing the expression of a gene encoding said endoglucanase in a suitable host. Said regulatory sequences may be homologous or heterologous to the production organism or they may originate from the organism, from which the gene encoding the endoglucanase polypeptide of the invention is isolated. The expression vector may further comprise marker genes for selection of the transformant strains or the selection marker may be introduced to the host in another vector construct by co- transformation.
[0055] The production "host" can be any homologous or heterologous organism capable of expressing the cellulolytic enzyme. Preferably the host is a microbial cell, more preferably a fungus. Most preferably the host is a filamentous fungus. Preferred hosts for producing the cellulolytic enzymes are in particular strains from the genus Trichoderma or Aspergillus. Preferably the recombinant host is modified to express and secrete cellulolytic enzymes as its main activity or one of its main activities. This can be done by deleting genes encoding major homologous secreted enzymes e.g. the four major cellulases of Trichoderma and by integrating heterologous genes to a locus with high expression and production levels.
[0056] The present invention relates also to a method for producing a polypeptide having endoglucanase activity, said method comprising the steps of transforming a host cell with an expression vector encoding said polypeptide, and culturing said host cell under conditions enabling expression of said polypeptide, and optionally recovering and purifying said polypeptide. The production medium may be a medium suitable for growing the host organism and containing inducers for efficient expression.
[0057] The polypeptides of the present invention may be isolated, which in the present context may simply mean that the cells and cell debris have been removed from the culture medium containing the polypeptide. Conveniently the polypeptides are isolated e.g. by adding anionic and/or cationic polymers (flocculants) to the spent culture medium to enhance precipitation of cells and cell debris. The medium is then filtrated using an inorganic filtering agent and a filter to remove the precipitants formed. After this the filtrate is further processed using a semipermeable membrane to remove excess of salts, sugars and metabolic products. The polypeptides can also be purified or concentrated by crystallization.
[0058] The novel EG/Cel7 polypeptides obtained by the method of the present invention may be components of an enzyme preparation. The term "enzyme preparation" denotes to a composition comprising at least one of the novel polypeptides described herein. The polypeptide in the enzyme preparation may be a recombinant protein having endoglucanase activity and comprising an amino acid sequence having at least 57% sequence identity to EG A having SEQ ID NO: 7. In one embodiment the enzyme preparation comprises a polypeptide which is a recombinant fusion protein having at least 55% sequence identity to SEQ ID NO: 23 (EG_A+EGI-CBM) or SEQ ID NO: 27 (EG_A+CHBI-CBM). According to one embodiment the enzyme preparation comprises a polypeptide having at least 60, 65, 70, 75, 80, 85, 90, 95, 98 or 99% identity to SEQ ID NO: 23 or 27, or at least 65, 70, 75, 80, 85, 90, 95, 98 or 99% identity SEQ ID NO: 24 or 28 or to its enzymatically active fragment. Preferably the enzyme preparation comprises at least cellobiohydrolase, endoglucanase, beta-glucosidase and optionally xylanase.
[0059] The enzyme preparation may also comprise at least one further enzyme selected from a group of cellobiohydrolase, endoglucanase, beta-glucosidase, beta-glucanase, xyloglucanase, xylanase, beta-xylosidase, mannanase, beta-mannosidase, oglucuronidase, acetyl xylan esterase, a-arabinofuranosidase, α-galactosidase, pectinase, involving endo- and exo-a-L-arabinases, endo- and exo-galactoronase, endopectinlyase, pectate lyase, and pectinesterase, phenol esterase, ligninase involving lignin peroxidase, manganese-dependent peroxidase, H202-generating enzyme and laccase with or without a mediator. The enzyme preparation may contain any combination of these enzymes and EG/Cel7 endoglucanases of the invention, but the enzymes are not limited to those described herein. They can for example also be commercially available enzyme preparations.
[0060] The enzyme preparation may be in the form of liquid, powder or granulate. It may be a filtrate containing one or more cellulolytic enzymes. Preferably the enzyme preparation is a spent culture medium. "Spent culture medium" refers to the culture medium of the host comprising the produced enzymes. Preferably the host cells are separated from the said medium after the production. The enzyme preparation or composition may also be a "whole culture broth" obtained, optionally after inactivating the production host(s) or microorganism(s) without any biomass separation, down-stream processing or purification of the desired cellulolytic enzyme(s). In the consolidated bioprocess the enzyme composition or at least some of the enzymes of the enzyme composition may be produced by the fermentative microorganism.
[0061] The enzyme preparation may contain the enzymes in at least partially purified and isolated form. It may even essentially consist of the desired enzyme or enzymes. The culture medium with or without host cells may be utilized as an enzyme preparation as such without further purification, because the endoglucanase proteins can be secreted into the culture medium, and they display activity in the ambient conditions of the spent culture medium.
[0062] In addition to the endoglucanase proteins, the enzyme preparation of the invention may contain additives, such as mediators, stabilizers, buffers, preservatives, surfactants and/or culture medium components. Preferred additives are such, which are commonly used in enzyme preparations intended for a particular application.
[0063] In the method of the present application for treating cellulosic material the cellulosic material is reacted with the EG/Cel7 endoglucanase of the invention or the enzyme preparation comprising said endoglucanase, whereby at least partially hydrolyzed cellulosic material is obtained. The enzymes are added in an enzymatically effective amount either simultaneously e.g. in the form of an enzyme mixture, or sequentially, or are produced by the fermented micro-organisms or as combination of these methods [0064] The EG/Cel7 endoglucanases of the invention are capable of hydrolyzing cellulosic material at moderate to elevated temperatures. The term "moderate temperature" or "conventional temperature" in context of the present invention means temperatures commonly used in cellulose hydrolysis and corresponding to the optimal temperatures or thermal stabilities of the enzymes used in such processes. Thus, the terms refer to temperature ranges from 30°C to 45°C. The term "elevated temperature" or "high temperature" refers to temperature ranges from 45°C to 70°C. In short term hydrolysis processes the enzymes may be effective even up to 80°C. Enzymes active or stable at such elevated temperature ranges are also called "thermostable" or "thermophilic" enzymes. The endoglucanases of the invention are used preferably at temperatures between 35°C and 60°C. More preferably they are used at temperatures between 37°C and 55°C, most preferably at temperatures between 45°C and 55°C.
[0065] The EG/Cel7 endoglucanases of the invention show improved hydrolysis results both at moderate and elevated temperatures when compared to the enzyme mixtures containing conventional T.reesei endoglucanase EGI/Cel7B. Different enzyme mixtures and combinations may be used to suit different process conditions. Elevated temperatures are known to enhance the hydrolysis of crystalline cellulose present in cellulosic or lignocellulosic materials, thus reducing the total amount of enzymes needed in hydrolysis or reducing the required hydrolysis time. Also, since at elevated temperatures the viscosity of the lignocellulosic substrate is decreased, thermostable enzymes make it possible to work at higher solid loadings and save in investment costs.
[0066] Particularly improved results at elevated temperatures may be obtained when using an enzyme preparation comprising recombinant endoglucanase EG_A having at least 57% sequence identity to SEQ ID NO: 7. In one preferred embodiment of the invention the enzyme preparation comprises Acremonium thermophilum cellobiohydrolase CBHI/Cel7A, Acremonium thermophilum cellobiohydrolase CBHII/Cel6A, Thermoascus aurantiacus endoglucanase EGII/Cel5A, Acremonium thermophilum beta-glucosidase pG/Cel3A, Thermoascus aurantiacus xylanase XynlOAand endoglucanase EG_Aof the present invention.
[0067] In one embodiment of the invention the enzyme preparation comprises recombinant EG_A fusion proteins having SEQ ID NOs: 23 or 27. With these endoglucanases improved hydrolysis results also at moderate and elevated temperatures with high dry matter conditions were obtained.
[0068] In one preferred embodiment of the invention the enzyme preparation comprises cellobiohydrolase CBHI/Cel 7A, cellobiohydrolase CBHII/Cel6A, endoglucanase EGII/Cel5A, beta-glucosidase 3G/Cel3A, xylanase XynlOA and endoglucanase fusion protein comprising an amino acid sequence having SEQ ID NO: 24.
[0069] In another preferred embodiment of the invention the enzyme preparation comprises cellobiohydrolase CBHI/Cel 7A, cellobiohydrolase CBHII/Cel6A, endoglucanase EGII/Cel5A, beta-glucosidase 3G/Cel3A, xylanase XynlOA and endoglucanase fusion protein comprising an amino acid sequence having SEQ ID NO: 27.
[0070] As will be understood by one skilled in the art, any quantity of the cellulosic material may be used in the hydrolysis. The term "dry matter" as used herein refers to total solids, both soluble and insoluble, of cellulosic material. The hydrolysis of cellulosic material may be conducted at low dry matter conditions, whereby by low dry matter is <15%. In other embodiments enzymatic hydrolysis may be conducted at high dry matter content, preferably > 15% dry matter.
[0071] The method for treating cellulosic material with the endoglucanases of the invention is especially suitable for producing fermentable sugars from lignocellulosic material. The fermentable sugars may then be fermented by yeast into ethanol, and used as fuel. They can also be used as intermediates or raw materials for the production of various chemicals or building blocks for the processes of chemical industry, e.g. in so called biorefinery. The lignocellulosic material may be pretreated before the enzymatic hydrolysis to disrupt the fiber structure of cellulosic substrates and make the cellulose fraction more accessible to the cellulolytic enzymes. Current pretreatments include mechanical, chemical or thermal processes and combinations thereof. The material may for example be pretreated by steam explosion or acid hydrolysis.
[0072] The novel EG/Cel7 endoglucanases may be applied in any process involving cellulolytic enzymes, such as in biofuel, biomass hydrolysis, starch, textile, detergent, pulp and paper, food, feed or beverage industry, and especially in hydrolysing cellulosic material for the production of biofuel comprising ethanol. In the pulp and paper industry they may be used to modify cellulosic fibre for example in treating kraft pulp, mechanical pulp, or recycled paper.
[0073] The invention is described by the following non-limiting examples. It will be obvious to a person skilled in the art that, as the technology advances, the inventive concept can be implemented in various ways. The invention and its embodiments are not limited to the examples described but may vary within the scope of the claims.
EXAMPLES
Example 1. Cloning of the endoglucanase (cel7legl) genes [0074] Standard molecular biology methods were used in the isolation and enzyme treatments of DNA(e.g. isolation of plasmid DNA, digestion of DNAto produce DNA fragments), in E. coli transformations, sequencing etc. The basic methods used were either as described by the enzyme, reagent or kit manufacturer or as described in the standard molecular biology handbook, e. g. Sambrook and Russell (2001). Isolation of genomic DNA was performed as described in detail by Raeder and Broda (1985).
[0075] After screening of several strains from Roal Oy culture collection one thermophilic fungal strain was selected for cloning. The probes for cloning the cefflegl genes (egl_A and egl_B) from Acremonium thermophilum ALK04245 were synthesized by PCR. Degenerate oligos were planned basing on the alignment of the previously published amino acid sequences of GH family 7 endoglucanase (EGI) proteins. The sequences of the primers are shown in Table 3 (SEQ ID NOs: 1-2).
Table 3. The oligonucleotides used as PCR primers to amplify probes for screening of egl genes from Acremonium thermophilum ALK04245
[0076] The probes were amplified by PCR with primers described in Table 3 using the genomic DNA as a template in the reactions. The PCR mixtures of Acremonium thermophilum ALK04245 contained 1x F-511 Buffer for Dynazyme DNA Polymerase (Finnzymes, Finland), 0.2 mM dNTP Mix (Fermentas, Finland), 1 pM each primer, 3% DMSO (Finnzymes, Finland) 2-4 units of F-501L Dynazyme II DNA Polymerase (Finnzymes, Finland) and 1-2 pg of the corresponding genomic DNA. The conditions for the PCR reactions were the following: 5 min initial denaturation at 95°C, followed by 28 cycles of 1 min at 95°C, 30 sec annealing at 50°C, 30 sec extension at 72°C and a final extension at 72°C for 5 min.
[0077] Primer combinations described in Table 3 produced specific DNA products having the expected sizes (according to calculations basing on published cel7legl sequences). The DNA products were isolated and purified from the PCR reaction mixtures and cloned into pCR®4Blunt-TOPO® vectors according to the manufacturer's instructions (Invitrogen, USA). The inserts were characterized by sequencing and by performing Southern blot hybridizations to the genomic DNA digested with several restriction enzymes. The PCR fragments which were chosen to be used as probes for gene cloning from Acremonium thermophilum ALK04245 strain are presented in Table 4.
Table 4. The primers used in the PCR reactions, probes chosen for screening of the egl genes from Acremonium thermophilum ALK04245. The genomic template DNA and the name of the plasmid containing the probe fragment are shown.
[0078] The deduced amino acid sequences from both of these PCR fragments had similarity to the published EG/Cel7B sequences (BLAST program, version 2.2.9 at NCBI, National Center for Biotechnology Information).
[0079] Acremonium thermophilum ALK04245 genomic DNA was digested with several restriction enzymes for Southern blot analysis. The probes for the hybridization were the 566 bp (SEQ ID NO: 3 for gene egl_A) and 431 bp (SEQ ID NO: 4 for gene egl_B) EcoRI fragments, cut from the plasmids pALK2698 and pALK2699, respectively. The above probes were labeled by using digoxigenin according to supplier's instructions (Roche, Germany). Hybridizations were performed over night at 65°C. After hybridization the filters were washed 2 x 5 min at RT using 2 x SSC - 0.1% SDS followed by 2 x 15 min at 65°C using 0.1 x SSC - 0.1% SDS. The E. coli strains RF8831 including the plasmid pALK2698, and RF8832 including the plasmid pALK2699, were deposited to the DSM collection under the accession numbers DSM 25490 and DSM 25491, respectively.
[0080] From the genomic DNA of Acremonium thermophilum ALK04245, approximate 6.6 kb H/nc/lll-digested fragment was hybridized using dioxigenin-labeled 566 bp EcoRI fragment from the pALK2698 as a probe. Correspondingly, about 9.5 kb EcoRI-digested fragment was hybridized with dioxigeninlabeled 431 bp EcoRI fragment of the pALK2699 also from the genomic DNA of the Acremonium thermophilum ALK04245. The hybridizing genomic DNA fragments were isolated from the pool of the digested genomic fragments based on their size. The genomic fragments were isolated from agarose gel and were cloned into pBluescript II KS+ (Stratagene, USA) vectors cleaved with either Hind\\\ (gene A) or EcoRI (gene B). Ligation mixtures were transformed into Escherichia coli XL10-Gold cells (Stratagene) and plated on LB (Luria-Bertani) plates containing 50-100 pg/ml ampicillin. The E. coli colonies were screened for positive clones using colonial hybridization with the pALK2698 and pALK2699 inserts as probes in the hybridization conditions correspondingly to that described above for Southern blot analyses. Several positive clones were collected from the plates. They were shown by restriction digestion to contain inserts of expected sizes. The full-length gene encoding the Acremonium thermophilum ALK04245 EG_A (egl_A, SEQ ID NO: 5) was sequenced from the 6.6 kb Hind III insert and the plasmid containing this insert was named pALK3152. The E. coli strain RF8939 including the plasmid pALK3152 was deposited to the DSM collection under the accession number DSM 25492. The gene encoding the Acremonium thermophilum ALK04245 protein A was named as egl_A. Correspondingly, the full-length gene encoding the another Acremonium thermophilum ALK04245 EG_B (eg/_B. SEQ ID NO: 6) was sequenced from the 9.5 kb EcoRI insert and the plasmid containing this insert was named pALK3153. The E. coli strain RF8974 including the plasmid pALK3153 was deposited to the DSM collection under the accession number DSM 25493. The gene encoding the Acremonium thermophilum ALK04245 protein B was named as eg/_B. The relevant information on the genes and the deduced protein sequences (SEQ ID NOs: 5-8) are summarized in Table 5 and Table 6, respectively. Table 5. The summary on the eg/ genes isolated from Acremonium thermophilum ALK04245
Table 6. The summary of the amino acid sequences deduced from the egl genes sequences from Acremonium thermophilum ALK04245
[0081] The comparison of the deduced EG sequences from Acremonium thermophilum ALK04245 to the databases are shown in Table 7.
Table 7. The highest identity sequences to the deduced EG_A and EG_B amino acid sequences from Acremonium thermophilum ALK04245. The full-length amino acid sequences including the signal sequences were aligned. The database searches were performed at htt://www.ebi.ac.uk/Tools/sss/fasta/ and http://www.ebi.ac.uk/Tools/psa/emboss needle/ using FASTA (EMBL-EBI, FASTA - Protein Similarity Search, UniProt Knowledgebase + NR Patent Proteins Level-1, BLOSUM50, Gap open -10, Gap extend -2), and EMBOSS Needle (EMBL-EBI, EMBOSS-Needle -Pairwise Sequence Alignment, BLOSUM50, Gap open 10, Gap extend 0.5) for determining the degree of identity.
Example 2. Production of recombinant EG/Cel7 proteins in Trichoderma reesei [0082] Expression plasmids were constructed for production of recombinant EG/Cel7 (EG A and EG_B) proteins from Acremonium thermophilum ALK04245 in Trichoderma reesei. The expression plasmids constructed are listed in Table 8. The recombinant cel7legl genes (egl_A and eg/_B), including their own signal sequences, were fused to the T. reesei cel7Alcbh1 promoter by PCR. The transcription termination was ensured by the T. reesei ce!7Aicbh1 terminator and the A. nidulans amdS marker gene was used for selection of the transformants as described in Paloheimo etal. (2003). The linear expression cassettes (Fig. 1) were isolated from the vector backbones after Not\ digestions and were transformed into T. reesei protoplasts. The host strain used does not produce any of the four major T. reesei cellulases (CBHI, CBHII, EGI, EGII). The transformations were performed as in Penttilå etal. (1987) with the modifications described in Karhunen et al. (1993), selecting acetamide as a sole nitrogen source (amdS marker gene). The transformants were purified on selection plates through single conidia prior to sporulating them on PD.
Table 8. The expression cassettes constructed to produce EG_A and EG_B recombinant proteins from Acremonium thermophilum ALK04245 in Trichoderma reesei. The overall structure of the expression cassettes was as described in Fig. 1.
[0083] The EG/Cel7 production of the transformants was analyzed from the culture supernatants of the shake flask cultivations. The transformants were inoculated from the PD slants to shake flasks containing 50 ml of complex lactose-based cellulase inducing medium (Joutsjoki et al. 1993) buffered with 5% KH2PO4. The EG/Cel7 protein production of the transformants was analyzed from the culture supernatants after growing them for 7 days at 30°C, 250 rpm. Heterologous production of recombinant proteins was analyzed by SDS-PAGE with subsequent Coomassie staining. The genotypes of the chosen transformants were confirmed by using Southern blot analyses in which several genomic digests were included and the respective expression cassette was used as a probe.
[0084] The best producing transformants were chosen to be cultivated in laboratory scale bioreactors. The transformants were cultivated in lab bioreactors at 28°C in cellulase inducing complex medium 3-4 days with pH control 4.2 ± 0.2 (NH3/H3PO4) to obtain material for the application tests. The supernatants were recovered by centrifugation and filtering through EK filters (Pall SeitzSchenk Filtersystems GmbH, Bad Kreuznach, Germany).
Example 3. Production of the recombinant Acremonium thermophilum ALK04245 EG + Trichoderma reesei CBM fusion proteins in T. reesei [0085] The atypical linker and CBM regions of Acremonium thermophilum ALK04245 EG/Cel7_A (EG_A) were removed and the core region of the protein was fused to linker and CBM of Trichoderma reesei EGI/Cel7B (= EGI-CBM). Acremonium thermophilum ALK04245 EG_B was also fused to linker and CBM of Trichoderma reesei EGI/Cel7B. For that purpose, the coding sequence of the core regions of EG_A and EG_B and the coding sequence of the linker and the CBM of Trichoderma reesei EGI/Cel7B were synthesized by PCR using following primers: SEQ ID NO: 9 (forward sequence for EG_A core primer) SEQ ID NO: 10 (reverse sequence for EG_A core primer) SEQ ID NO: 11 (forward sequence of EGI-CBM primer) SEQ ID NO: 12 (reverse sequence of EGI-CBM primer) SEQ ID NO: 13 (forward sequence of EG_B primer) SEQ ID NO: 14 (reverse sequence of EG_B primer) SEQ ID NO: 15 (forward sequence of EGI-CBM primer) SEQ ID NO: 16 (reverse sequence of EGI-CBM primer).
[0086] The PCR reaction mixture for synthesizing the DNA sequence encoding for EG_Acore contained 1x Phusion HF Reaction Buffer (Finnzymes, Finland), 7.5 mM MgCl2, 0.2 mM dNTPs, 1 μΜ of each primer, 3% DMSO, 4 units of Phusion DNA Polymerase (Finnzymes, Finland), and approximately 50 ng / 200 pi of template DNA, containing full-length egl_A gene from Acremonium thermophilum ALK04245. The conditions for the PCR reaction were the following: 30 sec initial denaturation at 98°C, followed by 24 cycles of 10 sec at 98°C, 30 sec annealing at 52.5°C (± 7.5°C gradient), 30 sec extension at 72°C and final extension at 72°C for 7 min. The specific DNA fragment in PCR reaction was obtained at annealing temperature range from 45°C to 60°C. The compatible restriction sites were created to the synthesized core fragment of Acremonium thermophilum ALK04245 egl_A for the fusion to Trichoderma reesei cel7legl1 linker-CBM region and ligation to expression vector.
[0087] The PCR reaction mixtures for synthesizing the DNA sequences encoding for EG_B and EGI_CBMs contained 1x F-511 Buffer for Dynazyme DNA Polymerase (Finnzymes, Finland), 0.2 mM dNTP Mix (Fermentas, Finland), 1 μΜ each primer, 3% DMSO (Finnzymes, Finland) 2-4 units of F-501L Dynazyme II DNA Polymerase (Finnzymes, Finland) and approximately 50 ng / 200 μΙ of template DNA, containing the full-length egl_B gene from Acremonium thermophilum ALK04245 and the full-length cel7legl1 gene from Trichoderma reesei. The conditions for the PCR reactions were the following: 5 min initial denaturation at 95°C, followed by 28 cycles of 1 min at 95°C, 30 sec annealing at 52.5°C (±7.5°C gradient), 30 sec extension at 72°C and a final extension at 72°C for 5 min. The specific DNA fragments in PCR reactions were obtained at annealing temperature range from 45°C to 60°C. The fragments created by primer combinations described above were then digested with compatible restriction enzymes and ligated together. The fragments amplified by PCR, primer combinations and compatible restriction enzymes are described in Table 9.
Table 9. PCR fragments amplified from Acremonium thermophilum ALK04245 egl_A and eg/ B and from Trichoderma reesei egl1 full length genes
[0088] The newly created fragments were then further ligated into expression vectors. The PCR amplified fragments in the expression plasmids were confirmed by sequencing (SEQ ID NO: 17 for eg/_A core, SEQ ID NO: 18 for eg/_B core, SEQ ID NO: 19 for eg/f-CBM, SEQ ID NO: 20 for eg/f-CBM, SEQ ID NO: 21 for eg/_A+eg/f-CBM and SEQ ID NO: 23 for EG_A+EGI-CBM, SEQ ID NO: 22 for egl_B+egl1-CBM and SEQ ID NO: 24 for EGB+EGI-CBM). The fusion genes were also further cloned into Bluescript II KS+ vector for the patent deposition to DSM collection under following accession numbers: DSM 25657 = E. coli strain RF10076 including the plasmid pALK3179 which contains the fusion gene egl_A+egl1 -CBM. DSM 25658 = E. coli strain RF10077 including the plasmid pALK3180 which contains the fusion gene eg/_B+eg/f-CBM.
[0089] Acremonium thermophilum ALK04245 EG_A and EG_B were also fused to linker and CBM of Trichoderma reesei CBHI/Cel7A (= CBHI-CBM). The egl + cbh1-CBM fusion genes were designed such a way that the atypical linker and CBM regions of Acremonium thermophilum ALK04245 eg/_A would be removed and the remaining core region will be fused to linker and CBM region of Trichoderma reesei cel7Alcbh1 (= cbh1-CBM). Trichoderma reesei cel7Alcbh1 linker and CBM regions will be fused straight to Acremonium thermophilum cel7/egl_B, since it did not contain natural CBM. The fusion genes were ordered as synthetic constructs (from GenScript, USA). The synthetic fusion genes in plasmids were confirmed by sequencing (SEQ ID NO: 25 for egl_A+cbh1 -CBM, SEQ ID NO: 26 for eg/_B+cbh7-CBM, SEQ ID NO: 27 for EG_A+CBHI-CBM and SEQ ID NO: 28 for EG_B+CBHI-CBM). The E. coli strains RF9587 including the plasmid pALK3167, which contains the fusion gene egl_A+cbh1 -CBM in pUC57, and RF9588 including the plasmid pALK3168 which contains the fusion gene egl_B+cbh1-CBM in pUC57, were deposited to the DSM collection under the accession numbers DSM 25655 and DSM 25656, respectively.
[0090] The expression plasmids were constructed for production of recombinant EG+EGI-CBM (EG_A+EGI-CBM and EG B+EGI-CBM) and EG+CBHI-CBM (EGA+CBHI-CBM and EG_B+CBHI-CBM) fusion proteins (SEQ ID NO: 23-24 and 27-28 corresponding nucleic acid SEQ ID NO: 21-22 and 25-26). The expression plasmids constructed are listed in Table 10. The constructed eg/_A+eg/f-CBM, egl_B+egl1-CBM, egl_A+cbh1-CBM and egl_B+cbh1-CBM fusion genes were fused to the T. reesei cbh1 (cel7A) promoter in the expression vector. The transcription termination was ensured by the T. reesei cel7A terminator and A. nidulans amdS marker gene was used for selection of the transformants as described in Paloheimo et al. (2003). The linear expression cassettes (Fig. 1) were isolated from the vector backbones after Not\ digestions and were transformed into T. reesei protoplasts. The host strain used does not produce any of the four major T. reesei cellulases (CBHI, CBHII, EGI, EGII). The transformations were performed as in Penttilå et al. (1987) with the modifications described in Karhunen et al. (1993), selecting acetamide as a sole nitrogen source (amdS marker gene). The transformants were purified on selection plates through single conidia prior to sporulating them on PD.
Table 10. The expression cassettes constructed to produce EG+CBM fusion proteins in Trichoderma reesei. The overall structure of the expression cassettes was as described in Fig. 1.
[0091] The EG+CBM fusion protein production of the transformants was analyzed from the culture supernatant of the shake flask cultivations. The transformants were inoculated from the PD slants to shake flasks containing 50 ml of complex lactose-based cellulase inducing medium Joutsjoki et al. 1993) buffered with 5% KH2PO4. The EG+CBM fusion protein production of the transformants was analyzed from the culture supernatants after growing them for 7 days at 30°C, 250 rpm. Heterologous production of recombinant proteins was analyzed by SDS-PAGE with subsequent Coomassie staining. The genotypes of the chosen transformants were confirmed by using Southern blot analyses in which several genomic digests were included and the respective expression cassette was used as a probe.
[0092] The best producing transformants were chosen to be cultivated in laboratory scale bioreactors. The transformants were cultivated in lab bioreactors at 28°C in cellulase inducing complex medium 3-4 days with pH control 4.2 ± 0.2 (NH3/H3PO4) to obtain material for the application tests. The supernatants were recovered by centrifugation and filtering through EK filters (Pall SeitzSchenk Filtersystems GmbH, Bad Kreuznach, Germany).
Example 4. Hydrolysis of corn fibre substrate with enzyme preparations comprising a recombinant EG/Cel7 and EG/Cel7+CBM endoglucanases [0093] Steam exploded corn fibre was suspended in 0.05 M sodium citrate buffer, pH 4.8. The final weight of the hydrolysis mixture was 1 g of which the total solids concentration was either 5% (w/w) or 17% (w/w). The substrate was hydrolyzed using different enzyme mixtures at a dosage of 0.5 mg of protein per g of total solids in 2 ml reaction tubes. The protein contents of the enzyme components were determined using the Pierce BCA assay kit (Thermo Scientific) with Bovine Serum Albumin (Thermo Scientific) as standard. The reaction tubes were agitated in a linear-shaking water bath 1086 from GFL, adjusted in different temperatures. For each sample point, a sample of 0.5 ml was taken from duplicate reaction tubes, and centrifuged, the supernatant was boiled for 20 minutes to terminate the enzymatic hydrolysis, and analyzed for reaction products from the hydrolysis. Seven separate mixture combinations were prepared (a basis mixture MIXTURE 1, MIXTURE 1_EG_A and MIXTURE 1_EG_B, MIXTURE 1_EG_A+EGI-CBM, MIXTURE 1_EG_B+EGI-CBM, MIXTURE 1_EG_A+CBHI-CBM and MIXTURE 1_EG_B+CBHI-CBM) with different EG/Cel7 replacements.
[0094] A basis mixture of different cellulases was prepared using the following components:
Mesophilic EGI/Cel7B preparation containing recombinant Trichoderma reesei EGI/Cel7B. CBHI/Cel7A preparation containing recombinant Acremonium thermophilum ALK04245 CBHI/Cel7A (W02007071818). CBHII/Cel6A preparation containing recombinant Acremonium thermophilum ALK04245 CBHII/Cel6A (WO2011080317). EGII/Cel5A preparation containing recombinant Thermoascus aurantiacus ALK04242 EGII/Cel5A (W02007071818) with genetically attached CBM of Trichoderma reesei EGII/Cel5A. β-glucosidase preparation containing Acremonium thermophilum ALK04245 β-glucosidase/Cel3A (W02007071818).
Xylanase preparation containing Thermoascus aurantiacus ALK04242 Xyn10A xylanase (W02007071818).
[0095] All cellulases were heterologously produced as monocomponents in Trichoderma reesei host strain having cellulase-free background (the genes encoding the four major cellulases CBHI/Cel7A, CBHII/Cel6A, EGI/Cel7B and EGII/Cel5A were deleted). Crude culture supernatants were used in the mixture. The enzyme components were combined as follows to prepare a basis mixture: cellobiohydrolase CBHI/Cel7A preparation 60%, cellobiohydrolase CBHII/Cel6A preparation 15%, endoglucanase EGII/Cel5A preparation 10%, endoglucanase EGI/Cel7B preparation 8%, β-glucosidase ββ/ΰβΙ3Α preparation 4% and xylanase Xyn10A preparation 3%. This enzyme mixture was designated as MIXTURE 1.
[0096] For testing EG endoglucanase performance in the hydrolysis with MIXTURE 1, 8% of the EGI/Cel7B endoglucanase component of MIXTURE 1 was replaced by: EG/Cel7 preparation containing recombinant Acremonium thermophilum ALK04245 EG_Aor EG/Cel7 preparation containing recombinant Acremonium thermophilum ALK04245 EG_B. The mixtures containing Acremonium thermophilum ALK04245 EG/Cel7 proteins EG_A or EG_B were designated as MIXTURE 1_EG_A and MIXTURE 1_EG_B, respectively.
[0097] For testing EG+CBM fusion protein performance in the hydrolysis with MIXTURE 1, 8% of the EGI/Cel7B endoglucanase component of MIXTURE 1 was replaced by: EG+CBM preparation containing recombinant Acremonium thermophilum ALK04245 EG_A genetically fused to Trichoderma reese/EGI-CBM (EGA+EGI-CBM) or EG+CBM preparation containing recombinant Acremonium thermophilum ALK04245 EG_B genetically fused to Trichoderma reesei EGI-CBM (EG B+EGI-CBM) or EG+CBM preparation containing recombinant Acremonium thermophilum ALK04245 EG_A genetically fused to Trichoderma reesei CBHI-CBM (EG_A+CBHI-CBM) or EG+CBM preparation containing recombinant Acremonium thermophilum ALK04245 EG_B genetically fused to Trichoderma reesei CBHI-CBM (EG_B+CBHI-CBM).
[0098] The mixtures containing the fusion proteins EG_A+EGI-CBM, EG_B+EGI-CBM, EG_A+CBHI-CBM or EG_B+CBHI-CBM were designated as MIXTURE 1_EG_A+EGI-CBM, MIXTURE 1_EG_B+EGI-CBM, MIXTURE 1_EG_A+CBHI-CBM and MIXTURE 1_EG_B+CBHI-CBM, respectively.
[0099] For all the mixtures the hydrolysis was performed at 37°C and 55°C. Samples were taken from the hydrolysis after 48 h, quantified by HPLC and the concentration of glucose was determined (Fig. 2).
[0100] The results show better performance of the MIXTURE 1_EG_Aand MIXTURE 1_EG_B at 37°C with high (17%) dry matter content. The amount of glucose released from corn fibre substrate was found to increase 9% for the MIXTURE 1_EG_A and 7% for the MIXTURE 1_EG_B compared to the MIXTURE 1 (Fig. 2A). At 55°C and with high dry matter content MIXTURE 1_EG_A was found to increase glucose yield 34% compared to the enzyme mix MIXTURE 1 (Fig. 2B).
[0101] For the EG+EGI-CBM (EG_A+EGI-CBM and EG_B+EGI-CBM) fusion proteins the results show better performance of the MIXTURE 1_EG_A+EGI-CBM and MIXTURE 1_EG_B+EGI-CBM at 37°C in low (5%) dry matter conditions. The amount of glucose released from corn fibre substrate was found to increase 30% for the MIXTURE 1_EG_A+EGI-CBM and 35% for the MIXTURE 1_EG_B+EGI-CBM compared to the MIXTURE 1 (Fig. 3A). In high dry matter conditions MIXTURE 1_EG_B+EGI-CBM mix was found to increase glucose yield 7% at both temperatures 37°C and 55°C compared to the enzyme mix MIXTURE 1 (Fig. 3B and 3C).
[0102] For the EG+CBHI-CBM (EG_A+CBHI-CBM and EG_B+CBHI-CBM) fusion proteins the results show better performance of the MIXTURE 1_EG_A+CBHI-CBM and MIXTURE 1_EG_B+CBHI-CBM at 37°C in both low and high dry matter conditions. The amount of glucose released from corn fibre substrate was found to increase 22% in low dry matter conditions and 5% in high dry matter conditions for the MIXTURE 1_EG_A+CBHI-CBM, and 8% in low dry matter conditions and 5% in high dry matter conditions for the MIXTURE 1_EG_B+CBHI-CBM compared to the MIXTURE 1 (Fig. 4A and 4B). In high dry matter conditions at 55°C MIXTURE 1_EG_B+CBHI-CBM was found to increase glucose yield 4% compared to the MIXTURE 1 (Fig. 4C).
REFERENCES
[0103]
Badger, PC. (2002) Ethanol from cellulose: a general review. In Trends in new crops and new uses. J. Janick and A. Whipkey (eds.). ASHS Press, Alexandria, VA, USA, pp. 17-21.
Bailey M., Siika-aho M., Valkeajårvi A. and Penttilå M. (1993) Hydrolytic properties of two cellulases of Trichoderma reesei expressed in yeast. Biotehnol. Appl. Biochem 17: 65-76
Bendtsen J.D., Nielsen H., von Heijne G. and Brunak S. (2004) Improved prediction of signal peptides: SignalP3.0. J. Mol. Biol. 340: 783-795.
Coen, D.M. (2001) The polymerase chain reaction. In: Ausubel, F.M., Brent, R., Kingston, R.E., More, D.D., Seidman, J.G., Smith, K. and Struhl, K. (eds.) Current protocols in molecular biology. John Wiley &amp; Sons. Inc., Hoboken, USA.
Gellissen, G. (ed.) (2005) Production of recombinant proteins. Novel microbial and eukaryotic expression systems. Wiley-VCH Verlag Gmbh&amp;Co. Weinheim, Germany.
Haakana H., Miettinen-Oinonen A., Joutsjoki V., Måntylå A., Suominen P, and Vehmaanperå J. (2004) Cloning of cellulase genes from Melanocarpus albomyces and their efficient expression in Trichoderma reesei. Enz. Microbiol. Technol. 34: 159-167.
Henrissat B. (1991) A classification of glycosyl hydrolases based on amino acid sequence similarities. Biochem. J. 280: 309-316.
Henrissat B. and Bairoch A. (1993) New families in the classification of glycosyl hydrolases based on amino acid sequence similarities. Biochem. J. 293: 781-788.
Henrissat B. and Bairoch A. (1996).Updating the sequence-based classification of glycosyl hydrolases. Biochem. J. 316: 695-696
Hong J., H. Tamaki, K. Yamamoto, and Kumagai H. (2003a) Cloning of a gene encoding a thermo-stabile endo-β-1,4-glucanase from Thermoascus aurantiacus and its expression in yeast. Biotech. Letters 25: 657-661.
Hong J., Tamaki H., Yamamoto K. and Kumagai H. (2003b) Cloning of a gene encoding thermostable cellobiohydrolase from Thermoascus aurantiacus and its expression in yeast. Appl. Microbiol. Biotechnol. 63: 42-50.
Joutsjoki, V. V., Torkkeli T.K. and Nevalainen K. Μ. H. (1993) Transformation of Trichoderma reesei with the Hormoconis resinae glucoamylase P (gamP) gene: production of a heterologous glucoamylase by Trichoderma reesei. Curr. Genet. 24: 223-228.
Karhunen T, Måntylå A., Nevalainen K. Μ. H. and Suominen P L. (1993) High frequency one-step gene replacement in Trichoderma reesei. I. Endoglucanase I overproduction. Mol. Gen. Genet. 241: 515-522.
Miettinen-Oinonen A, J Londesborough, V Joutsjoki, R Lantto, J Vehmaanpera. 2004. Three cellulases from Melanocarpus albomyces with application in textile industry. Enzyme Microb. Technol. 34: 332-341.
Needleman S. and Wunsch C. (1970) A general method applicable to the search for similarities in the amino acid sequence of two proteins. Journal of Molecular Biology 48, 443-453.
Nielsen H., Engelbrecht J., Brunak S. and von Heijne G. (1997) Identification of prokaryotic and eykaryotic signal peptides and prediction of their cleavage sites. Protein Engineering 10: 1-6.
Nielsen H and A Krogh. 1998. Prediction of signal peptides and signal anchors by a hidden Markov model. In: Proceedings of the Sixth international Conference of Intelligent Systems for Molecular Biology (ISMB 6), AAAI Press, Menlo Park, California, pp. 122-130.
Paloheimo M., Måntylå A., Kallio J., and Suominen R (2003) High-yield production of a bacterial xylanase in the filamentous fungus Trichoderma reesei requires a carrier polypeptide with an intact domain structure. Appl. Env. Microbiol. 69: 7073-7082.
Penttilå M., Nevalainen H., Råtto M., Salminen E. and Knowles J. (1987) A versatile transformation system for the cellulolytic filamentous fungus Trichoderma reesei. Gene 61:155-164.
Raeder U. and Broda P. (1985) Rapid preparation of DNAfrom filamentous fungi. Lett. Appl. Microbiol. 1: 17-20.
Sambrook J., Fritsch E. F. and Maniatis T. (1989) Molecular cloning, a laboratory manual. Cold Spring Harbor Laboratory, New York, US.
Sambrook J. and Russell D. W. (2001) Molecular cloning, a laboratory manual. Cold Spring Harbor Laboratory, New York, US.
Tuohy M., Walsh J., Murray P, Claeyssens M., Cuffe M., Savage A. and Coughan M. (2002) Kinetic parameters and mode of action of cellobiohydrolases produced by Talaromyces emersonii. Biochem. Biophys. Acta 1596: 366-380 (abstract).
Visser H., Joosten V, Punt P.J., Gusakov A.V., Olson P.T., Joosten R., Bartels J., Visser J., Sinitsyn A.R, Emalfarb M.A., Verdoes J.C. and Wery J. (2011) Development of a mature fungal technology and production platform for industrial enzymes based on a Myceliophthora thermophila isolate, previously known as Chrysosporium lucknowense C1. Industrial Biotechnology. June 7(3): 214-223.
Voutilainen S., Puranen T., Siika-aho, M., Lappalainen A., Alapuranen M., and Kallio J., (2008) Cloning, expression, and characterization of novel thermostable family 7 cellobiohydrolases. Biotechnology and Bioengineering Vol.101 Nr.3, 515-528.
Van Tilbeurgh, H., Loonties, R, de Bruyne, C. and Claeyssens, M. (1988) Fluorogenic and chromogenic glycosides as substrates and ligands of carbohydrases. Methods Enzymol. 160:45-59.
SEQUENCE LISTING
[0104] <110> Roal Oy <120> Improved endoglucanases for treatment of cellulosic material
<130> 2112028PC <160> 28 <170> BiSSAP 1.0
<210> 1 <211 >20 <212> DNA <213> Artificial Sequence <220> <221 > source <222> 1..20 <223> /mol_type="DNA" /note="primer" /organism="Artificial Sequence" <400> 1 tgytgyaayg aratggayat 20
<210> 2 <211 > 17 <212> DNA <213> Artificial Sequence <220> <221 > source <222> 1..17 <223> /mol_type="DNA" /note="primer" /organism="Artificial Sequence" <400>2 swrtcn a rcc a rttcat 17
<210> 3 <211 >566 <212> DNA <213> Acremonium thermophilum <220> <221 > source <222> 1..566 <223> /mol_type="DNA" /organism="Acremonium thermophilum" <400>3 aatfccgccct tgtatcgagc cagttcatga agtcgcccgc cgagttccag atgctgaaga 60 cgagfeaccat gccgcgctcc agcgaggcgc ccatgccatc catgcctccc aggcggaggt 120 actgcgaggc attggcggcg gtgcagaagg cttgcgtcac ggatccgccg ctggogtcgt 180 tcacgttggg cgacgtagag tcgagctggt tcgtgttcga ggggacgggc tcgttetgga 240 tgaccttgcc gtcctggacg tacaggcgac ggatctcggc gagcgtcccc gaaccggtag 300 aggcattgcc agagtcggtc aaaaactgag taaacaccgt gaagggtttg gtggtgtooa 360 cggtcttgcc ggggccgtaa aagtctttga ggcqcgcgcg gaatgggttg acgccgqaae: 420 cgtcgtggte gcatacgccg gcgtcggtct tgccaoactc gtcgccagag cacaagaagg 480 agoGcaegec gctgeaggta tgaggcgtca tggoggtagc catggcattt gcttcccaga 540 tgteeafccte attgcagcaa agggcg 566
<210> 4 <211 > 431 <212> DNA <213> Acremonium thermophilum <220> <221 > source <222> 1..431 <223> /mol_type="DNA" /organism-'Acremonium thermophilum" <400>4 aattcgccct ttgctgcaat gagatggaca tcctggaggc caactcgcag gccaacgcct 60 tcaccccgca ccoctgcagg aaoggcaact gcgacaagag cggctgcgge tacaacccct 120 acgccaaggg cttccgcaac tactggggcc ccggtatgac cgtcgacacg tccaagccct 180 tcaccatcac cacccagttc gtcgagagtg gcgggtccct cagccagatc gtgcgcaggt 240 acgcccaggg cgggcggccg atccccagcg cggcgtcggg cggcgacgtc atccaggcca 300 gcgcctgcgg cacggcgcag agcttcggcg ggctggccga gatgggcgcg gcgctggacc 360 gcggcatggt cctcgtcttc agcatctgga acgacccggc ccagaacatg aactggctcg 420 actgaagggc g 431
<210> 5 <211> 1520 <212> DNA <213> Acremonium thermophilum <220> <221 > source <222> 1..1520 <223> /mol_type="genomic DNA" /organism="Acremonium thermophilum" <400> 5 atggctttca gaetggcagc tgcgctgcct ttccttcttc tcctggcaca cgcacaaaag 60 attggeaetg eggtteeega ggteeatceg aagetggtga eecagaectg caecactteg 120 ggaggttgtg ttgaccgcca aacattgctc gtgacggatg ccctctcgcg tccgttccac 180 aaggttggag atccttcggt gccgtgcacg ccgttgaaca ccaccctctg cceggaegcc 240 gccacttgtg cggcaaactg cgagctegaa ggggtggagt atggtagcct cggcgtgctt 300 accgccggce gagctgtcac tctgcggcag tacctcttcg atggcgacgc cttcacctct 360 gtcagcccgc gccrctacct gctggccgag gacggcatga aetatgagct gcLcaagcLg 420 aacggccaag agccctccta tgacgtcgac gtgtcgcagc tgggctgcgg catgaatggc 480 gocttgtacc tctccgagat ggacgcctcg gggtctagga gcgacctgaa cccagcaggt 540 gccoagtatg gcacgggota ctgcgatgcg cagtgcttta atgtgtcgtt tatcaacgga 600 ctggtatgtg ctcgtggaat tgaacgtatc cgtcatagcc aagtcaaaca tgaactaact 660 cttcacacat cttcctccga cagccgaatc tcaacaactc gggtgcctgc tgcaacgaga 720 tggatatctg ggaagcaaafc gccatggcta ccgccatgac gcctcataco tgcagcggcg 780 tgggctcctt cttgtgctct ggcgacgagt gtggcaagac cgacgccggc gtatgcgacc 840 acgacggccg cggcgccaac ccarxccgcg cgggccccaa agacttctac ggccccggca »uu agaccgtgga caccaccaaa cccttcacgg tggtgactca gtttttgacc gactctggca 960 atgcctctac cggttcgggg acgctcgccg agateegtcg cctgtacgtc caggacggca 1020 aggtcatcca gaacgagccc gtcccctcga acacgaacca gctogactct acgtcgccca 1080 acgtgaacga cgccagcggc ggatccgtga cgcaagcctt ctgcaccgcc gccaatgcct 1140 cgcagtacct ccgcctggga ggcatggatg gcatgggcgc ctcgctggag cgcggcatgg 1200 taetcgtctt cagcatctgg aactcggcgg gcgacttcat gaactggctg gacagcggcg 1260 aggccggccc gtgcagcaac acctccggcg acccggcccg catcgtggcc gagacgcccg 1320 aagtctccgt caccttctcc aacatccgct ggggagatat tgggtccacc fctcaacgcct 1380 cgtattcgcc gctgcgcaao gggacgaatg ccgctttctt gagctcgaat gcgaagagcg 1440 gtacggcgcc gcacactgct gctgcgtctg taggtgtctg ggccatgcta gccgcagtct 1500 ccgcttatct ccttctctga 1520 <210> 6 <211 > 1206
<212> DNA <213> Acremonium thermophilum <220> <221 > source <222> 1..1206 <223> /mol_type="genomic DNA" /organism="Acremonium thermophilum" <400> 6 atggcgcgcc agactcattc ctacggcatc ttggocgtcg cccttgtctt ctggcgccag 60 ctggctcacg ctcagaaccc agccgcgccc gaggtgcatc cgtcgctgcc cacctggaag 120 tgcaceacgg cgggeggttg pgtgcagcag aacacgteeg: tcgtgctcga ctgggacgcg 180 cgcgccatcg tgccgtctgg cggcggcggc ggcggcggtg gcgggtcctg caagacggcc 240 accggcggcg tcagtgccag octetgtccc gacgaggcca cctgcgocaa gaactgcgtc 300 gtccagccca tcgccaacta cacggagcac ggcgtgtcca ccgcgggcga cgccctgacc 360 ctgtaccact acgrtgcgcat caacggccaq ctgtcggccc agtcgccccg cqtctacctg 420 ctgggcccgg acggcgacta egagatgetg cacctcctgg gccgcgagct cagcttcgac 480 gtcgacctgt ccacgctgcc ctgcggogag aacggcgccc tctacctctc egagatgage 540 gccggcggcg ggcgcaaccc cgacaacacg ggeggegega gctacggctc gggctactgc 600 gacgcccagt gcceoaccct ggcgtggcgc aacggcacca tcaacaccgg ccgccagggc 660 tactgctgcg acqagatgga oatcctggag gccaactcgc aggccaacqc cttcaccccq 720 cacccctgca ggaacggcaa ctgcgacaag agcggctgcg gctacaaccc ctacgccaag 780 ggcttccgca actactgggg ccccggtatg accgtcgaca cgtccaagcc cttcaccato 840 accacccagt tegtegagag tggcgggtoc ctcagccaga tegtgogoag gtacgcccag 900 ggcgggcggc cgatccccag egeggcgtcg ggcggfigang tc.atnciaggc cagcgcctgc 96Ώ ggcacggcgc agagcttcgg cgggctggcc gagatgggcg cggcgctgga ccgcggcatg 1020 gtcctcgtct toagcatctg gaacgacccg geccagaaca tgaactggct cgaeagcggc 1080 accgcggggc cctgcagcag ogcggcgggc agtccggcca cgatccaggc gcagcatccg 1140 gatacgcacg ttgtcttctc gaacatccgc tggggcgaca ttggctcgac gacccagacg 1200 tcgtag 1206
<210> 7 <211 >479 <212> PRT <213> Acremonium thermophilum <220> <221 > SOURCE <222> 1..479 <223> /mol_type="protein" /organism="Acremonium thermophilum" <400>7
Mat Ala Phe Arg Leu Ala Ala Ala leu Pro Phe Leu Leu Leu Leu Ala 1 5 10 15
His Ala Gin Lys lie Gly Thr Ala Val Pro Glu Val His Pro Lys Leu 20 25 30
Val Thr Gin Thr Cys Thr Thr Ser Gly Gly Cys Val Asp Arg Gin Thr 35 40 45
Leu Leu Val Thr Asp Ala Leu Ser Arg Pro Phe His Lys Val Gly Asp 50 55 60
Pro Ser Val Pro Cys Thr Pro Leu Asn Thr Thr Leu Cys Pro Asp Ala 65 70 75 80
Ala Thr Cys Ala Ala Asn Cys Glu Leu Glu Gly Val Glu Tyr Gly Ser 85 90 95
Leu Gly Val Leu Thr Ala Gly Arg Ala Val Thr Leu Arg Gin Tyr Leu 100 105 110
Phe Asp Gly Asp Ala Phe Thr Ser Val Ser Pro Arg Leu Tyr Leu Leu 115 120 125
Ala Glu Asp Gly Met Asn Tyr Glu Leu Leu Lys Leu Asn Gly Gin Glu 130 135 140
Leu Ser Tyr Asp Val Asp val Ser Gin Leu Gly Cys Gly Met Asn Gly 145 150 155 160
Ala Leu Tyr Leu Ser Glu Met Asp Ala Ser Gly Ser Arg Ser Asp Leu 165 170 175
Asn Pro Ala Gly Ala Gin Tyr Gly Thr Gly Tyr Cys Asp Ala Gin Cys 180 185 190
Phe Asn Val Ser Phe lie Asn Gly Leu Pro Asn Leu Asn Asn Ser Gly 195 200 205
Ala Cys Cys Asn Glu Met Asp lie Trp Glu Ala Asn Ala Met Ala Thr 210 215 220
Ala Met Thr Pro His Thr Cys Ser Gly Val Gly Ser Phe Leu Cys Ser 225 230 235 240
Gly Asp Glu Cys Gly Lys Thr Asp Ala Gly Val Cys Asp His Asp Gly 245 250 255
Cys Gly Val Asn Pro Phé Arg Ala Gly Leu Lys Asp Phe Tyr Gly Pro 260 265 270
Gly Lys Thr Val Asp Thr Thr Lys Pro Phe Thr Val Val Thr Gin Phe 275 280 285
Leu Thr Asp Ser Gly Asn Ala Ser Thr Gly Ser Gly Thr Leu Ala Glu 290 295 300 lie Arg Arg Leu Tyr Val Gin Asp Gly Lys Val lie Gin Asn Glu Pro 305 310 315 320
Val Pro Ser Asn Thr Asn Gin Leu Asp Ser Thr Ser Pro Asn Val Asn .323 JJU 3.33
Asp Ala Ser Gly Gly Ser Val Thr Gin Ala Phe Cys Thr Ala Ala Asn 340 345 350
Ala Ser Gin Tyr Leu Arg Leu Gly Gly Met Asp Gly Met Gly Ala Ser 355 360 365
Leu Glu Arg Gly Met Val Leu Val Phe Ser Ile Trp Asn Ser Ala Gly 370 375 3180
Asp Phe Met Asn Trp Leu Asp Ser Gly Glu Ala Gly Pro Cys Ser Asn 385 390 395 400
Thr Ser Gly Asp Pro Ala Arg Ile Val Ala Glu Thr Pro Glu Val Ser 405 410 415
Val Thr Phe Ser Asn Ile Arg Trp Gly Asp Ile Gly Ser Thr Phe Asn 420 425 430
Ala Ser Ser Ser Pro Leu Arg Asn Gly Thr Asn Ala Ala Phe Leu Ser 435 440 445
Ser Asn Ala Lys Ser Gly Thr Ala Pro His Thr Ala Ala Ala Ser Val 450 455 460
Gly Val Trp Ala Met Leu Ala Ala Val Ser Ala Tyr Leu Leu Leu 465 470 475
<210>8 <211 > 401 <212> PRT <213> Acremonium thermophilum <220> <221 > SOURCE <222> 1..401 <223> /moltype-'protein" /organism-'Acremonium thermophilum" <400> 8
Met Ala Arg Gin Thr His Ser Tyr Gly Ile Leu Ala Val Ala Leu Val 1 5 10 15
Phe Trp Arg Gin Leu Ala His Ala Gin Åsn Pro Ala. Ala Pro Glu Val, 20 25 30
His Pro Ser Leu Pro Thr Trp Lys Cys Thr Thr Ala Gly Gly Cys val 35 40 45
Gin Gin Asn Thr Ser Val Val Leu Asp Trp Asp Ala Arg Ala Ile Val 50 55 60
Pro Ser Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser Cys Lys Thr Ala 65 70 75 80
Thr Gly Gly Val Ser Ala Ser Leu Cys Pro Asp Glu Ala Thr Cys Ala 85 90 95
Lys Asn Gys Val Val Gin Pro Ile Ala AHA Tyr Thr Glu His Gly Val 100 105 110
Ser Thr Ala Gly Asp Ala Leu Thr Leu Tyr His Tyr Val Arg Ile Asn 115 120 125
Gly Gin Leu Ser Ala Gin Ser Pro Arg val Tyr Leu Leu Gly Pro Asp 130 135 140
Gly Asp Tyr Glu Met Leu His Leu Leu Gly Arg Glu Leu Ser Phe Asp 145 150 155 160
Val Asp Lou Ser Thr Leu Pro Cys Gly Glu Asn Gly Ala Lou Tyr Lou 165 170 175
Ser Glu Met Ser Ala Gly Gly Gly Arg Ash Pro Asp Asn Thr Gly Gly 180 185 190
Ala Ser Tyr Gly Ser Gly Tyr Cys Asp Ala Gin Cys Pro Thr Leu Ala 195 200 205
Trp Arg Asn Gly Thr Ile Asn Thr Gly Arg Gin Gly Tyr Cys Cys Asp 210 215 220
Glu Met Asp Ile Leu Glu Ala Asn Ser Gin Ala Asn Ala Phe Thr Pro 225 230 235 240
His Pro Cys Arg Asn Gly Asn Cys Asp Lys Ser Gly Cys Gly Tyr Asn 245 250 255
Pro Tyr Ala Lys Gly Phe Arg Asn Tyr Trp Gly Pro Gly Met Thr Val Φ-ΤΛ
Asp Thr Ser Lys Pro Phe Thr ile Thr Thr Gin Phe Val Glu Ser Gly 275 280 285
Gly Ser Leu Ser Gin Ile Val Arg Arg Tyr Ala Gin Gly Gly Arg Pro 290 295 300
He Pro Ser Ala Ala Ser Gly Gly Asp Val lie Gin Ala Ser Ala Cys 305 310 315 320
Gly Thr Ala Gin Ser Phe Gly Gly Leu Ala Glu Met Gly Ala Ala Leu 325 330 335
Asp Arg Gly Met Val Leu Val Phe Ser lie Trp Asn Asp Pro Ala Gin 340 345 350
Asn Met Asn Trp Leu Asp Ser Gly Thr Ala Gly Pro Cys Ser Ser Ala 355 360 365
Ala Gly Ser Pro Ala Thr He Gin Ala Gin His Pro Asp Thr His Val 370 375 380
Val Phe Ser Asn He Arg Trp Gly Asp He Gly Ser Thr Thr Gin Thr 385 390 395 400
Ser
<210> 9 <211 >50 <212> DNA <213> Artificial Sequence <220> <221 > source <222> 1..50 <223> /mol_type="DNA" /note="primer" /organism="Artificial Sequence" <400>9 cgagccgcgg atggctttca gactggcagc tgcgctgcct ttccttcttc 50
<210> 10 <211 >40 <212> DNA <213> Artificial Sequence <220> <221 > source <222> 1..40 <223> /mol_type="DNA" /note="primer" /organism="Artificial Sequence" <400> 10 cgggcgtctc ggccacgatg cgggccgggt cgccggaggt 40
<210> 11 <211 > 120 <212> DNA <213> Artificial Sequence <220> <221 > source <222> 1..120 <223> /mol_type="DNA" /note="primer" /organism="Artificial Sequence" <400> 11 catcgtggcc gagacgcdcg aagtctccgt caccttctcc aacatccgct ggggagatat 60 tgggtccacc ttcaacgccg qgcccccgcc cccgcctgcg tccagcacga cgttttcgac 120
<210> 12 <211> 51 <212> DNA <213> Artificial Sequence <220> <221 > source <222> 1..51 <223> /mol_type="DNA" /note="primer" /organism="Artificial Sequence" <400> 12 cgtcctcgag ctaaaggcat tgcgagtagt agtcgttgct atactggcac g 51
<210> 13 <211> 50 <212> DNA <213> Artificial Sequence <220> <221 > source <222> 1..50 <223> /mol_type="DNA" /note="primer" /organism="Artificial Sequence" <400> 13 cgagggatcc atggcgcgcc agactcattc ctacggcatc ttggccgtcg 50
<210> 14 <211 > 95 <212> DNA <213> Artificial Sequence <220> <221 > source <222> 1..95 <223> /mol_type="DNA" /note="primer" /organism="Artificial Sequence" <400> 14 gtcgagctcc tccgtgtagt cgaaaacgtc gtgctggacg caggcggggg cgggggcgcc 60 gacgtctggg tcgtcgagcc åatgtcgccd cagcg 95
<210> 15 <211 >40 <212> DNA <213> Artificial Sequence <220> <221 > source <222> 1..40 <223> /mol_type="DNA" /note="primer" /organism="Artificial Sequence" <400> 15 gaggagctcg acgacttcga gcagcccgag ctgcacgcag 40
<210> 16 <211 > 55 <212> DNA <213> Artificial Sequence <220> <221 > source <222> 1..55 <223> /mol_type="DNA" /note="primer" /organism="Artificial Sequence" <400> 16 ctcgaccggt ctaaaggcat tgcgagtagt agtcgttgct atactggcac gtagt 55
<210> 17 <211 > 1330 <212> DNA <213> Acremonium thermophilum <220> <221 > source <222> 1..1330 <223> /mol_type="DNA" /organism="Acremonium thermophilum" <400> 17 cgagccgcgg atggctttca gactggcagc tgcgctgcct ttccttctto tcctggcaca 60 cgcacaaaag attggcactg cggttcccga ggtccatccg aagctggtga cccagacctg 120 caccacLtcg ggaggLtgLg LLgaccgcca aacattgcLc glgacggatg ccctctcgcg 180 tccgttccac aaggttggag atccttcggt gocgtgcacg ccgttgaaca ccaccctotg 240 cccggacgcc gccacttgtg cggcaaactg cgagctcgaa ggggtggagt atggtagcct 300 cggcgtgctt accgccggcc gagctgtcac tctgcggcag tacctcttcg atggcgacgc 360 cttcacctct gtcagcccgc gcctctacct gctggccgag gacggcatga actatgagct 420 gctcaagctg aaeggccaag agctctccta tgacgtcgac gtgtcgcagc tgggctgcgg 480 catgaatggc gccttgtacc tctccgagat ggacgcctcg gggtctagga gcgacctgaa 540 cccagcaggt gcccagtatg gcacgggcta ctgcgatgcg cagtgcttta atgtgtcgtt 600 tatcaacgga ctggtatgtg ctcgtggaat tgaacgtatc cgtoatagcc aagtcaaaca 660 tgaactaact cttcacacat cttcctccga cagccgaatc tcaacaactc gggtgcctgc 720 tgcaacgaga tggatatctg ggaagcaaat gccatggcta ccgccatgac gcctcatacc 780 tgcagcggcg tgggctcctt cttgtgctct ggcgacgagt gtggcaagac cgacgccggc 840 gtatgcgacc acgacggttg cggcgtcaac ccattccgcg cgggcctcaa agacttctac 900 ggcecoggea agaccgtgga caceaccaaa cccttcasgg tggtgactca gtttttgacc 960 gactctggca atgcctctac eggttcgggg acgctcgccg agatccgtcg cctgtaGgto 1020 caggacggca aggtcatcca gaacgagccc gtcccctcga acacgaacca gctcgactct 1080 aegtcgccca acgtgaacga cgccagcggc ggatcegtga cgcaagcctt ctgcaccgcc 1140 gccaatgcct cgcagtacct ocgcctggga ggcatggatg gcatgggcgc ctcgctggag 1200 cgqggcatgg tacpcgtctt cagcatctgg aactcggogg gcgacttcat gaactggctg 1260 gacagcggcg aggccggccc gtgcagcaac accfcccggcg acccggcccg catcgtggcc 1320 gagaegcceg 1330
<210> 18 <211 > 1272 <212> DNA <213> Acremonium thermophilum <220> <221 > source <222> 1..1272 <223> /mol_type="DNA" /organism-'Acremonium thermophilum" <400> 18 cgagggatcc atggcgcgcc agactcattc ctacggcatc ttggcogtcg cccttgtctt 60 ctggcgccag ctggctcacg ctcagaaccc agccgcgccc gaggfcgcatc cgtcgctgcc 120 cacctggaag tgcaccacgg cgggcggttg cgtgcagcag aacacgtccg tcgtgctcga 180 ctgggacgcg cgcgccatcg tgccgtctgg cggcggcggc ggcggcggtg gcgggtcctg 240 eaagacggec aocggoggog tcagtgccag eetctgteee: gacgaggooa ectgbgBcaa 300 gaactgcgtc gtccagccca tcgccaacta cacggagcac ggegtgtcca ccgcgggcga 360 cgccctgacc ctgtaccact acgtgcgcat caacggccag etgtcggeGc agtcgccccg 420 cgtctacctg ctgggcccgg acggcgacta cgagatgctg cacctcctgg gccgcgagct 480 cagcttcgac gtcgacctgt ccacgctgcc ctgcggcgag aacggcgccc tctacctctc 540 egagafcgago gccggcggog ggcgcaaccc ogaoaaoacg ggcggcgtjga gcfeaeggefcc 600 gggctactgc gacgcccagt gccccaccct ggcgtggcgc aacggcacca tcaacaccgg 660 ccgccagggc tactgctgcg acgagatgga catcctggag gccaactcgc aggccaacgc 720 cttcaccccg cacccctgca ggaacggcaa ctgcgacaag agcggctgcg gctacaaccc 780 ctacgccaag ggcttccgca actactgggg ccccggtatg accgtcgaca cgtccaagcc 840 cttcaccatc accacceagt tcgtcgagag tggcgggtcc ctcagccaga tcgtgcgcag 900 gtacgcccag ggcgggcggc cgatccccag cgcggcgtcg ggcggcgacg tcatccaggc 960 cagcgcctgc ggcacggcgc agagcttcgg cgggctggcc gagatgggcg cggcgctgga 1020 ccgcggcatg gtcctcgtct tcagcatctg gaacgacccg gcccagaaca tgaactggct 1080 cgacagcggc accgcggggc cctgcagcag cgcggcgggc agtccggcca cgatccaggc 1140 gcagcatccg gatacgcacg ttgf.cr.tr.tr. gaaeatccgc tggggcgaca ttggctcgac 1200 gacccagacg toggcgcccc cgcccccgcc tgcgtceagc acgacgtttt cgactacacg 1260 gaggagctcg ac 1272 <210> 19 <211 > 281 <212> DNA <213> null <220> <221 > source <222> 1..281 <223> /mol_type="DNA" /organism=null <400> 19 catcgtggcp gagacgcccg aagtctccgt caccttctcc aacatcegct ggggagatat 60 tgggtccacc ttcaacgccg egccoccgcc cccgcctgcg teeagcacga ogttttcgac 120 tacacggagg agctcgacga cttcgagcag cccgagctgc acgcagactc actgggggca 180 gtgcggtgge attgggtaca gcgggtgcaa gacgtgcacg tcgggcacta cgtgccagta 240 tagcaacgac tactactcgc aatgccttta gctcgaggac g 281 <210> 20 <211>212 <212> DNA <213> null <220> <221 > source <222> 1..212 <223> /mol_type="DNA" /organism=null <400> 20 gaggagctcg acgacttcga gcagcccgag ctgcacgcag actcactggg ggcagtgcgg 60 tggcattggg tacagcgggt gcaagacgtg cacgtcgggc actacgtgcc agtatagcaa 120 cgactgttcg tatccccatg cctgacggga gtgattttga gatgctaacc gctaaaatac 180 agactactcg caatgeettt agaccggtcg ag 212
<210> 21 <211 > 1571 <212> DNA <213> Acremonium thermophilum <220> <221 > source <222> 1..1571 <223> /mol_type="DNA" /organism-'Acremonium thermophilum" <400> 21 atggctttca gactggcagc tgcgctgcct ttcattcttc tcctggcaaa cgcacaaaag 60 attggcactg cggttcccga ggtccatccg aagctggtga cccagacctg caccacttcg 120 ggaggttgtg ttgaccgcca aacattgctc gtgacggatg ccctctcgcg tccgttccac 180 aaggttggag atccttcggt gccgtgcacg ccgttgaaca ccaccctctg cccggacgcc 240 gccacttgtg cggcaaactg cgagctcgaa ggggtggagt atggtagcct cggcgtgctt 300 aocgccggec gagctgtcac tctgcggcag tacctcttcg atggcgacgc cttcacctct 360 gtcagcccgc gcctctacct gctggccgag gacggcatga actatgagct gctcaagctg 420 aacggccaag agctctccta tgacgtcgac gtgtcgcagc tgggctgcgg catgaatggc 480 gccttgtacc tctccgagat ggacgcctcg gggtctagga gcgacctgaa eccagcaggt 540 gcccagtatg gcacgggcta ctgcgatgcg cagtgcttta atgtgtcgtt tatcaacgga 600 ctggtatgtg ctcgtggaat tgaacgtatc cgtcatagcc aagtcaaaca tgaactaact 660 cttcacacat cttcctccga cagccgaatc tcaacaactc gggtgcctgc tgcaacgaga 720 tggatatctg ggaagcaaat gccatggcta ccgccatgac gcctcatacc tgcagcggcg 780 tgggctcctt cttgtgctet ggogacgagt gtggcaagac cgacgccggo gtatgcgacc 840 acgacggttg cggcgtcaac ccattccgcg cgggcctcaa agacttctac ggccccggca 900 agaccgtgga caccacoaaa cccttcacgg tggtgactca gtttttgacc gactctggca 960 atgcctctac cggttcgggg acgctcgcog agatcogtcg cctgtacgtc caggacggca 1020 aggtcatcca gaacgagccc gtcccctcga acacgaacca gctcgactct acgtcgccca 10 80 acgtgaacga cgccagcggc ggatccgtga cgcaagcctt ctgcaccgccj gccaatgcct 1140 cgcagtacct ccgcctggga ggcatggatg gcatgggcgc ctcgctggag cgcggoa~gg 1200 tactcgtctt cagcatctgg aactcggcgg gcgacttcat gaactggctg gacagcggcg 1260 aggccggccc gtgcagoaac acctccggcg acccggcccg catcgtggcc gagacgcccg 1320 aagtctccgt caccttctce aacatccgct ggggagatat tgggtecaec ttcaacgccg 1380 cgcccccgcc cccgcctgcg tccagcacga cgttttcgac tacacggagg agctcgacga 1440 cttcgagcag cccgagctgc acgcagactc actgggggca gtgcggtggc attgggtaca 1500 gcgggtgcaa gacgtgcacg tcgggcacta cgtgccagta tagoaacgac tactactcgc 1560: aatgccttta g 1571
<210> 22 <211 > 1395 <212> DNA <213> Acremonium thermophilum <220> <221 > source <222> 1..1395 <223> /mol_type="DNA" /organism-'Acremonium thermophilum" <400> 22 atggcgcgcc agactcattc ctaoggcatc ttggccgtcg cccttgtctt ctggcgccag 60 ctggctcacg ctcagaaccc agccgcgccc gaggtgcatc cgtcgctgcc cacctggaag 120 tgcaccacgg cgggcggttg cgtgcagcag aacacgtccg tcgtgctcga ctgggacgcg 180 cgcgccatcg tgccgtctgg cggcggcgge ggcggcggtg gcgggtcctg caagacggcc 240 accggcggcg tcagtgccag cctctgtccc gacgaggcca cctgcgccaa gaactgcgtc 300 gtccagccca tcgccaacta cacggagcac ggcgtgtcca ccgcgggcga cgccctgacc 360 ctgtacoact aogtgcgcat caacggccag ctgtcggccc agtcgccccg cgtctacctg 420 ctgggcccgg acggcgacta egagatgdtg cacctcctgg gccgcgagct cagcttcgac 480 gtcgacctgt ccacgetgoa ctgcggogag aacggcgece tefcacatctn, egagatgagc 540 gpcggeggcg ggegcaacec egacaaeaGg ggcggegega gctacggctc gggctactgc 600 gacgcccagt gccccaccct ggcgtggcgc aacggcacca tcaacacogg ccgccagggc 660 tactgctgcg acgagatgga catccrggag gccaactagc aggccaacge cttcaccccg 720 cacccctgca ggaacggcaa ctgcgacaag agcggctgcg gctacaaccc ctacgceaag 780 ggcttocgea actactgggg ccccggtatg accgtcgaca ggtccaagac cttcaccatc 840 accacccagt tcgtcgagag tggcgggtec ctcagccaga tcgtgcgcag gtacgcccag 900 ggcgggcggc cgarccocag cgcggcgtcg ggcggcgacg tcatccagge cagcgcctgc 960 ggcacggcgo agagcttcgg cgggctggCG gagatgggcg cggcgctgga ccgcggcatg 1020 gtcctcgtct tcagcatotg gaacgacccg gcccagaaca tgaactggct egacagcggc 1080
acogcggggc octgcagcag cgcggcgggc agtccggcca cgatccaggc gcagcatccg 1X4Q gatacgcacg ttgtcttotc gaacatccgc tggggcgaca ttggctcgao gacccagacg 1200 tcggcgccco cgcccccgoc tgcgtccagc acgacgtttt cgactacacg gaggagoteg 1260 acgacttcga gcagcccgag ctgcacgcag actcactggg ggcagtgcgg tggeattggg 1320 tacagcgggt gcaagacgtg cacgtcgggc actacgtgcc agtatagcaa cgactactac 1380 tcgcaatgcc tttag 1395
<210> 23 <211 >496 <212> PRT <213> Acremonium thermophilum <220> <221 > SOURCE <222> 1..496 <223> /mol_type="protein" /organism-'Acremonium thermophilum" <400> 23
Met Ala Phe Arg Leu Ala Ala Ala Leu Pro Phe Leu Leu Leu Leu Ala 15 10 15
His Ala Gin Lys Ile Gly Thr Ala Val Pro Glu Val His Pro Lys Leu 20 25 30
Val Thr Gin Thr Cys Thr Thr Ser Gly Gly Cys Val Asp Arg Gin Thr 35 40 45
Leu Leu Val Thr Asp Ala Leu Ser Arg Pro Phe His Lys Val Gly Asp 50 55 60
Pro Ser Val Pro Cys Thr Pro Leu Asn Thr Thr Leu Cys Pro Asp Ala 65 70 75 80
Ala Thr Cys Ala Ala Asn Cys Glu Leu Glu Gly Val Glu Tyr Gly Ser 85 30 95
Leu Gly Val Leu Thr Ala Gly Arg Ala Val Thr Leu Arg Gin Tyr Leu 100 105 110
Phe Asp Gly Asp Ala Phe Thr Ser Val Ser Pro Arg Leu Tyr Leu Leu 115 120 125
Ala Glu Asp Gly Met Asn Tyr Glu Leu Leu Lys Leu Asn Gly Gin Glu 130 135 140
Leu Ser Tyr Asp Val Asp Val Ser Gin Leu Gly Cys Gly Met Asn Gly 145 150 155 160
Ala Leu Tyr Leu Ser Glu Met Asp Ala Ser Gly Ser Arg Ser Asp Leu 165 170 175
Asn Pro Ala Gly Ala Gin Tyr Gly Thr Gly Tyr Cys Asp Ala Gin Cys 180 185 190
Phe Asn Val Ser Phe lie Asn Gly Leu Pro Asn Leu Asn Asn Ser Gly 195 200 205
Ala Cys Cys Asn Glu Met Asp lie Trp Glu Ala Asn Ala Met Ala Thr 210 215 220
Ala Met Thr Pro His Thr Cys Ser Gly Val Gly Set Phe Leu Cys Ser 225 230 235 240
Gly Asp Glu Cys Gly Lys Thr Asp Ala Gly Val Cys Asp His Asp Gly 245 250 255
Cys Gly Val Asn Pro Phe Arg Ala Gly Leu Lys Asp Phe Tyr Gly Pro 26Q 265 270
Gly Lys Thr Val Asp Thr Thr Lys Pro Phe Thr Val Val Thr Gin Phe 275 280 285
Leu Thr Asp Ser Gly Asn Ala Ser Thr Gly Ser Gly Thr Leu Ala Glu 290 295 300
Ile Arg Arg Leu Tyr Val Gin Asp Gly Lys Val Ile Gin Asn Glu Pro 305 310 315 320
Val Pro Ser Asn Thr Asn Gin Leu Asp Ser Thr Ser Pro Asn Val Asn 325 330 335
Asp Ala Ser Gly Gly Ser Val Thr Gin Ala Phe Cys Thr Ala Ala Asn 340 345 350
Ala Ser Gin Tyr Leu Arg Len Gly Gly Met Asp Gly Met Gly Ala Ser 355 360 365
Leu Glu Arg Gly Met Val Leu Val Phe Ser Ile Trp Asn Ser Ala Gly 370 375 380
Asp Phe Met Asn Trp Leu Asp Ser Gly Glu Ala Gly Pro Cys Ser Asn 385 390 395 400
Thr Ser Gly ASp Pro Ala Arg Ile Val Ala GlU Thr Pro Glu Val Ser 405 410 415
Val Thr Phe Ser Asn Ile Arg Trp Gly Asp Ile Gly Ser Thr Phe Asn 420 425 430
Ala Ala Pro Pro Pro Pro Pro Ala Ser Ser Thr Thr Phe Ser Thr Thr 435 440 445
Arg Arg Ser Ser Thr Thr Ser Ser Ser Pro Ser Cys Thr Gin Thr His 450 455 460
Trp Gly Gin Cys Gly Gly Ile Gly Tyr Ser Gly Cys Lys Thr Cys Thr 465 470 475 480
Ser Gly Thr Thr Gys Gin Tyr Ser Asn Asp Tyr Tyr Ser Gin Cys Leu 485 490 495
<210> 24 <211 >464 <212> PRT <213> Acremonium thermophilum <220> <221 > SOURCE <222> 1..464 <223> /mol_type="protein" /organism="Acremonium thermophilum" <400> 24
Met Ala Arg Gin Thr His Ser Tyr Gly Ile Léu Ala val Ala Leu val 15 10 15
Phe Trp Arg Gin Leu Ala His Ala Gin Asn Pro Ala Ala Pro Glu Val 20 25 30
His Pro Ser Leu Pro Thr Trp Lys Cys Thr Thr Ala Gly Gly Cys Val 35 40 45
Gin Gin Asn Thr Ser Val Val leu Asp Trp Asp Ala Arg Ala Ile Val 50 55 60
Pro Ser Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser Cys Lys Thr Ala 65 70 75 80
Thr Gly Gly Val Ser Ala Ser Leu Cys Pro Asp Glu Ala Thr Cys Ala 85 90 95
Lys Asn Cys Val Val Gin Pro Ile Ala Asn Tyr Thr Glu His Gly Val 100 105 110
Ser Thr Ala Gly Asp Ala Leu Thr Leu Tyr His Tyr Val Arg Ile Asn 115 120 125
Gly Gin Leu Ser Ala Gin Sér Pro Arg Val Tyr Leu Leu Gly Pro Asp 130 135 140
Gly Asp Tyr Glu Met Leu His Leu Leu Gly Arg Glu Leu Ser Phe Asp 145 150 155 160
Val Asp Leu Ser Thr Leu Pro Cys Gly Glu Asn Gly Ala Leu Tyr Leu 165 170 175
Ser Glu Met Ser Ala Gly Gly Gly Arg Asn Pro Asp Asn Thr Gly Gly 180 185 190
Ala Ser Tyr Gly Ser Gly Tyr Cys Asp Ala Gin Cys Pro Thr Leu Ala 195 200 205
Trp Arg Asn Gly Thr Ile Asn Thr Gly Arg Gin Gly Tyr Cys Cys Asp 210 215 220
Glu Met Asp Ile lieu Glu Ala Asn Ser Gin Ala Asn Ala Phe Thr Pro 225 230 23:5 240
His Pro Cys Arg Asn Gly Asn Cys Asp Lys Ser Gly Cys Gly Tyr Asn 245 250 255
Pro Tyr Ala Lys Gly Phe Arg Asn Tyr Trp Gly Pro Gly Met Thr Val 260 265 270
Asp Thr Ser Lys Pro Phe Thr Ile Thr Thr Gin Phe Val Glu Ser Gly 275 280 285
Gly Ser Leu Ser Gin Ile Val Arg Arg Tyr Ala Gin Gly Gly Arg Pro 290 295 300
Ile Pro Ser Ala Ala Ser Gly Gly Asp Val Ile Gin Ala Ser Ala Gys 305 310 315 320
Gly Thr Ala Gin Ser Phe Gly Gly Leu Ala Glu Met Gly Ala Ala Leu 325 330 335
Asp Arg Gly Met Val Leu Val Phe Ser Ile Trp Asn Asp Pro Ala Gin 340 345 350
Asn Met Asn Trp Leu Asp Ser Gly Thr Ala Gly Pro Gys Ser Ser Ala 355 360 365
Ala Gly Ser Pro Ala Thr Ile Gin Ala Gin His Pro Asp Thr His Val 370 375 380
Val Phe Ser Asn Ile Arg Trp Gly Asp Ile Gly Ser Thr Thr Gin Thr 385 390 395 400
Ser Ala Pro Pro Pro Pro Pro Ala Ser Ser Thr Thr Phe Ser Thr Thr 405 410 415
Arg Arg Ser Ser Thr Thr Ser Ser Ser Pro Ser Cys Thr Gin Thr His 420 425 430
Trp Gly Gin Cys Gly Gly Ile Gly Tyr Ser Gly Cys Lys Thr Cys Thr 435 440 445
Ser Gly Thr Thr Cys Gin Tyr Ser Asn Asp Tyr Tyr Ser Gin Cys Leu 450 455 460
<210> 25 <211 > 1568 <212> DNA <213> Acremonium thermophilum <220> <221 > source <222> 1..1568 <223> /mol_type="DNA" /organism-'Acremonium thermophilum" <400> 25 atggctttca gactggcagc tgcgctgcct ttccttcttc tcctggcaca cgcacaaaag 60 attggoactg cggttcccga ggtccatccg aagctggtga cccagacctg caccacttcg 120 ggaggttgtg ttgaccgcca aacattgctc gtgacggatg ccctctcgcg tccgttccac 180 aaggttggag atccttcggt gccgtgcacg ccgttgaaca ccaccctctg cccggacgcc 240 gccacttgtg cggcaaactg cgagctcgaa ggggtggagt atggtagcct cggcgtgctt 300 accgccggcc gagctgtcac tctgcggcag tacctcttcg atggcgacgc cttcacctct 360 gtcagcccgc gcctctacct gctggccgag gacggcatga actatgagct gctcaagctg 420 aacggccaag agctctccta tgacgtcgac gtgtcgcagc tgggctgcgg catgaatggc 480 gccttgtacc tctccgagat ggacgcctcg gggtctagga gcgacctgaa cccagcaggt 540 gcccagtatg gcacgggcta ctgcgatgcg cagtgcttta atgtgtcgtt tatcaacgga 600 ctggtatgtg ctcgtggaat tgaacgtatc cgtcatagcc aagtcaaaca tgaactaact 660 cttcacacat cttcctccoa caaccaaatc tcaacaactc aootocctac tacaacaaaa 720 tggatatctg ggaagcaaat gc.catggc.ta cogccatgao gcctoataoc fcgoagcggcg 780 tgggctcctt cttgtgctct ggcgacgagt gtggcaagac cgacgccggc gtatgcgacc 840 acgacggttg cggcgtcaac ccattccgcg cgggcctcaa agacttctac ggccccggca 900 agaccgtgga caccaccaaa cccttcacgg tggtgactca gtttttgacc gactctggca 960 atgcctctac cggttcgggg acgctcgccg agatccgtcg cctgtacgtc caggacggca 1020 aggtcatcca gaacgagccc gtcccctcga acacgaacca gctcgactct acgtcgccca 1080 acgtgaacga cgccagcggc ggatccgtga cgcaagcctt ctgcaccgcc gccaatgcct 1140 cgcagtacct ccgcctggga ggcatggatg gcatgggcgc ctcgctggag cgcggcatgg 1200 tactcgtctt cagcaLctgg aactcggcgg gcgacttcat. gaacLggctg gacagcggcg 1260 aggccggccc gtgcagcaac acctccggcg acccggcccg catcgtggcc gagacgcccg 1320 aagtctccgt caccttctco aacatccgct ggggagatat tgggtccacc ttcaacggcg 1380 gcaaccctcc cggcggaaac cgtggcacca ccaccacccg ccgcccagcc actaccactg 1440 gaagctctcc cggacctacc cagtctcact acggccagtg cggcggtatt ggctacagcg 1500 gccccacggt ctgcgccagc ggcacaactt gccaggtcct gaacccttac tactctcagt 1560 gcctgtaa 1568
<210> 26 <211 > 1395 <212> DNA <213> Acremonium thermophilum <220> <221 > source <222> 1..1395 <223> /mol_type="DNA" /organism="Acremonium thermophilum" <400> 26 atggcgcgcc agactcattc ctacggcatc ttggcogtcg cccttgtctt ctggcgccag 60 ctggctcacg ctcagaaccc agccgcgcec gaggtgcatc cgtcgctgcc cacctggaag 120 tgcaccacgg cgggcggttg cgtgcagcag aacacgtccg tcgtgdtcga ctgggacgcg 180 cgcgccatcg tgccgtctgg cggcggcggc ggcggcggtg gcgggtcctg caagacggcc 240 accggcggcg tcagtgccag cctctgtccc gacgaggcca cctgcgccaa gaactgcgtc 300 gtccagccca tcgccaacta cacggagcac ggcgtgtcca ccgcgggcga cgccctgacc 360 ctgtaccact acgtgcgcat caacggccag ctgtcggocc agtcgccccg cgtctacctg 420
Gtgggaccgg acggcgacta cgagatgctg cacctcctgg gccgcgagct cagcttcgac 480 gtcgacctgt ccacgctgcc ctgcggcgag aacggcgccc tctacototc cgagatgagc 540 gccggcggcg ggcgcaaccc cgacaacacg ggcggcgcga gctacggctc gggctactgc 600 gacgcccagt gceecacqct ggcgtggcgc aacggcacca tcaacaccgg ccgccagggc 660 tactgctgcg acgagatgga catcctggag gccaactcgc aggccaacgo cttcaccccg 720 cacccetgca ggaacggcaa ctgcgacaag agcggctgcg gctacaaccc ctacgccaag 780 ggcttcogca actactgggg ccccggtatg accgtcgaca cgtccaagcc cttcaccatc 840 accacccagt tcgtcgagag tggcgggtcc ctcagccaga tcgtgcgcag gtacgcccag 900 ggcgggcggc cgatccccag cgcggcgtcg ggcggcgacg tcatccaggc cagcgcctgc 960 ggcacggcgc agagcttcgg cgggctggcc gagatgggcg cggcgctgga ccgcggcatg 1020 gtcctcgtct tcagcatctg gaacgacccg gcccagaaca tgaactggct cgacagcggc 1080 accgcggggc cctgcagcag cgcggcgggc agtccggcca cgatccaggc gcagcatccg 1140 gatacgcacg ttgtcttctc gaacatccgc tggggcgaca ttggctcgao gacccagacg 1200 tcgggcggca accctcccgg cggaaaccgt ggcaccaoca ccacccgccg cccagccact 1260 accactggaa gctctcccgg acctacccag tctcactacg gccagtgcgg cggtattggc 1320 tacagcggcc ocacggtctg cgecagcggc acaacttgcc aggtcctgaa cccttactac 1380 tctcagtgcc tgtaa 1395
<210> 27 <211 >495 <212> PRT <213> Acremonium thermophilum <220> <221 > SOURCE <222> 1..495 <223> /mol_type="protein" /organism="Acremonium thermophilum" <400> 27
Met Ala She Arg leu Ala Ala Ala Leu Pro Phe Leu Leu Leu Leu Ala 1 5 10 15
His Ala Gin lys lie Gly Thr Ala Val Pro Glu Val His Pro Lys Leu 20 25 30
Val Thr Gin Thr Cys Thr Thr Ser Gly Gly Cys Val Asp Arg Gin Thr 35 40 45
Leu Leu val Thr Asp Ala Leu ser Arg Pro Phe His Lys val Gly Asp 50 55 60
Pro Ser Val Pro Cys Thr Pro Leu Asn Thr Thr Leu Cys Pro Asp Ala 65 70 75 80
Ala Thr Cys Ala Ala Ash Cys Glu Leu Glu Gly Val Glu Tyr Gly Ser 85 90 95
Leu Gly Val Leu Thr Ala Gly Arg Ala Val Thr Leu Arg Gin Tyr Leu 100 10b 110
Phe Asp Gly Asp Ala Phe Thr Ser Val Ser Pro Arg Leu Tyr Leu Leu 115 120 125
Ala Glu Asp Gly Met Asn Tyr Glu Leu Leu Lys Leu Asn Gly Gin Glu 130 135 140
Leu Ser Tyr Asp Val Asp Val Ser Gin Leu Gly Cys Gly Met Asn Gly 145 150 155 160
Ala Leu Tyr Leu Ser Glu Met Asp Ala :Ser Gly Ser Arg Ser Asp Leu 165 170 175
Asn Pro Ala Gly Ala Gin Tyr Gly Thr Gly Tyr Cys Asp Ala Gin Cys 180 185 190
Phe Ash Val Ser Phe lie Asn Gly Leu Pro Asn Léu Asn Asn Ser Gly 195 200 205
Ala Cys Cvs Asn Glu Met Asp lie Trp Glu Ala Asn Ala Met Ala Thr 210 * 215 * 220
Ala Met Thr Pro His Thr Cys Ser Gly Val Gly Ser Phe Leu Cys Ser 225 230 235 240
Gly Asp Glu Gys Gly Lys Thr Asp Ala Gly Val Cys Asp His Asp Gly 245 250 255
Cys Gly Val Asn Pro Phe Arg Ala Gly Leu Lys Asp Phe Tyr Gly Pro 260 265 270
Gly Lys Thr Val Asp Thr Thr Lys Pro Phe Thr Val Val Thr Gin Phe 275 280 285
Leu Thr Asp Ser Gly Asn Ala Ser Thr Gly Ser Gly Thr Leu Ala Glu 290 295 300 lie Arg Arg Leu Tyr Val Gin Asp Gly Lys Val lie Gin Asn Glu Pro 305 310 315 320
Val Pro Ser Asn Thr Asn Gin Leu Asp Ser Thr Ser Pro Asn Val Asn 325 330 335
Asp Ala Ser Gly Gly Ser Val Thr Gin Ala Phe Cys Thr Ala Ala Asn 340 345 350
Ala Ser Gin Tyr Leu Arg Leu Gly Gly Met Asp Gly Met Gly Ala Ser 355 360 365
Leu Glu Arg Gly Met Val Leu Val Phe Ser He Trp Asn Ser Ala Gly 370 375 380
Asp Phe Met Asn Trp Leu Asp Ser Gly Glu Ala Gly Pro Cys Ser Asn 385 390 395 400
Thr Ser Gly Asp Pro Ala Arg lie Val Ala Glu Thr Pro Glu Val Ser 405 410 415
Val Thr Phe Ser Asn He Arg Trp Gly Asp He Gly Ser Thr Phe Asn 420 425 430
Gly Gly Asn Pro Pro Gly Gly Asn Arg Gly Thr Thr Thr Thr Arg Arg 435 440 445
Pro Ala Thr Thr Thr Gly Ser Ser Pro Gly Pro Thr Gin Ser His Tyr 450 455 460
Gly Gin Cys Gly Gly lie Gly Tyr Ser Gly Pro Thr Val Cys Ala Ser 465 470 475 480
Gly Thr Thr Cys Gin Val Leu Asn Pro Tyr Tyr Ser Gin Cys Leu 485 490 495
<210> 28 <211 >464 <212> PRT <213> Acremonium thermophilum <220> <221 > SOURCE <222> 1..464 <223> /mol_type-'protein" /organism-'Acremonium thermophilum" <400> 28
Met Ala Arg Gin Thr His Ser Tyr Gly lie Leu Ala Val Ala Leu Val 1 5 10 15
Phe Trp Arg Gin Leu Ala His Ala Gin Asn Pro Ala Ala Pro Glu Val 20 25 30
His Pro Ser Leu Pro: Thr Trp Lys Cys Thr Thr Ala Gly Gly Cys Val 35 40 45
Gin Gin Asn Thr Ser Val Val Leu Asp Trp Asp Ala Arg Ala He Val 50 55 60
Pro Ser Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser Cys Lys Thr Ala 65 70 75 80
Thr Gly Gly Val Ser Ala Ser Leu Cys Pro Asp Glu Ala Thr Cys Ala 85 90 95
Lys Asn Cys Val Val Gin Pro He Ala Asn Tyr Thr Glu His Gly Val 100 105 110
Ser Thr Ala Gly Asp Ala Leu Thr Leu Tyr His Tyr Val Arg lie Asn 115 120 125
Gly Gin Leu Ser Ala Gin Ser Pro Arg Val Tyr Leu Leu Gly Pro Asp 130 135 140
Gly Asp Tyr Glu Met Leu His Leu Leu Gly Arg Glu Leu Ser Phe Asp
145 150 155 ISO
Val Asp Leu Ser Thr Leu Pro Gys Gly Glu Asn Gly Ala Leu Tyr Leu 165 170 175
Ser Glu Met Ser Ala Gly Gly Gly Arg Asn Pro Asp Asn Thr Gly Gly 180 185 190
Ala Ser Tyr Gly Ser Gly Tyr Cys Asp Ala Gin Cys Pro Thr Leu Ala 195 200 205
Trp Arg Asn Gly Thr lie Asn Thr Gly Arg Gin Gly Tyr Gys Cys Asp 210 215 220
Glu Met Asp lie Leu Glu Ala Asn Ser Gin Ala Asn Ala Phe Thr Pro 225 230 235 240
His Pro Cys Arg Asn Gly Asn Cys Asp Lys Ser Gly Cys Gly Tyr Asn 245; 250 255
Pro Tyr Ala Lys Gly Phe Arg Asn Tyr Trp Gly Pro Gly Met Thr Val 260 265 270
Asp Thr Ser Lys Pro Phe Thr lie Thr Thr Gin Phe Val Glu Ser Gly 275 280 285
Gly Ser Leu Ser Gin lie Val Arg Arg Tyr Ala Gin Gly Gly Arg Pro 290 295 300
He Pro Ser Ala Ala Ser Gly Gly Asp Val lie Gin Ala Ser Ala Cys 305 310 315 320
Gly Thr Ala Gin Ser Phe Gly Gly Leu Ala Glu Met Gly Ala Ala Leu 325 330 335
Asp Arg Gly Met Val Leu Val Phe Ser lie Trp Asn Asp Pro Ala Gin 340 345 350
Asn Met Asn Trp Leu Asp Ser Gly Thr Ala Gly Pro Cys Ser Ser Ala 355 360 365
Ala Gly Ser Pro Ala Thr Tie Gin Ala Gin His Pro Asp Thr His Val 370 375 380
Val Phe Ser Asn lie Arg Trp Gly Asp lie Gly Ser Thr Thr Gin Thr 385 390 395 400
Ser Gly Gly Asn Pro Pro Gly Gly Asn Arg Gly Thr Thr Thr Thr Arg 405 410 415
Arg Pro Ala Thr Thr Thr Gly Ser Ser Pro Gly Pro Thr Gin Ser His 420 425 430
Tyr Gly Gin Cys Gly Gly lie Gly Tyr Ser Gly Pro Thr Val Cys Ala 435 440 445
Ser Gly Thr Thr Cys Gin Val Leu Asn Pro Tyr Tyr Ser Gin Cys Leu 450 455 460
REFERENCES CITED IN THE DESCRIPTION
This list of references cited by the applicant is for the reader's convenience only. It does not form part of the European patent document. Even though great care has been taken in compiling the references, errors or omissions cannot be excluded and the EPO disclaims all liability in this regard.
Patent documents cited in the description • WO20Q106Q752A f0D081 • WQ2002Q24882A Γ00071 • US2QQ40Q05674A1 [00081 • u^oo5.aiMas5A mmi • WQ2011080317A Γ0009] Γ00841 • WO2QQ7071818A Γ00101 Γ00941 Γ00941 Γ00941 Γ00Θ41 • US7992812B ίΟ0101 • .U.S.6.QZ1USA [0011,1 • US5783254A iOQ111 • W02004078919A Γ00121 • WQ9714804A Ι0Μ.3]
Non-patent literature cited in the description • Ethanol from cellulose: a general reviewBADGER, P.C.In Trends in new crops and new usesASHS Press2002000017-21 ί0103Ί • BAILEY M.SIIKA-AHO M.VALKEAJÅRVI A.PENTTILÅ M.Hydrolytic properties of two cellulases of Trichoderma reesei expressed in yeastBiotehnol. Appl. Biochem, 1993, vol. 17, 65-76 [01031 • BENDTSEN J.D.NIELSEN H.VON HEIJNE G.BRUNAK S.lmproved prediction of signal peptides: SignalP 3.0J. Mol. Biol., 2004, vol. 340, 783-795 i01Q3l • The polymerase chain reactionCOEN, D.M.Current protocols in molecular biologyJohn Wiley &amp; Sons. Inc.20010000 Γ01Θ31 • Novel microbial and eukaryotic expression systemsProduction of recombinant proteinsWiley-VCH Verlag Gmbh&amp;Co.20050000 f0103| • HAAKANA H.MIETTINEN-OINONEN A.JOUTSJOKI V.MÅNTYLA A.SUOMINEN PVEHMAANPERA J.Cloning of cellulase genes from Melanocarpus albomyces and their efficient expression in Trichoderma reeseiEnz. Microbiol. Technol., 2004, vol. 34, 159-16710103] • HENRISSAT B.A classification of glycosyl hydrolases based on amino acid sequence similaritiesBiochem. J., 1991, vol. 280, 309-316 f01031 • HENRISSAT B.BAIROCH A.New families in the classification of glycosyl hydrolases based on amino acid sequence similaritiesBiochem. J., 1993, vol. 293, 781-788 [0103] • HENRISSAT B.BAIROCH A.Updating the sequence-based classification of glycosyl hydrolasesBiochem. J., 1996, vol. 316, 695-696 i01031 • HONG J.H. TAMAKIK. YAMAMOTOKUMAGAI H.CIoning of a gene encoding a thermostabile endo-β-1,4-glucanase from Thermoascus aurantiacus and its expression in yeastBiotech. Letters, 2003, vol. 25, 657-661 f0103l • HONG J.TAMAKI H.YAMAMOTO K.KUMAGAI H.CIoning of a gene encoding thermostable cellobiohydrolase from Thermoascus aurantiacus and its expression in yeastAppl. Microbiol. Biotechnol., 2003, vol. 63, 42-50 |0103^ • JOUTSJOKI, V. V.TORKKELI T.K.NEVALAINEN K. M. ^Transformation of Trichoderma reesei with the Hormoconis resinae glucoamylase P (gamP) gene: production of a heterologous glucoamylase by Trichoderma reeseiCurr. Genet., 1993, vol. 24, 223-228 191031 • KARHUNEN T.MÅNTYLÅ A.NEVALAINEN K. M. H.SUOMINEN P. L.High frequency one-step gene replacement in Trichoderma reesei. I. Endoglucanase I overproductionMol. Gen. Genet., 1993, vol. 241, 515-522 £01031
• MIETTINEN-OINONEN AJ LONDESBOROUGHV JOUTSJOKIR LANTTOJ VEHMAANPERÅThree cellulases from Melanocarpus albomyces with application in textile industryEnzyme Microb. Technol., 2004, vol. 34, 332-341 £01031 • NEEDLEMAN S.WUNSCH C.A general method applicable to the search for similarities in the amino acid sequence of two proteinsjournal of Molecular Biology, 1970, vol. 48, 443- 453£01031 . NIELSEN H.ENGELBRECHT J.BRUNAK S.VON HEIJNE G.ldentification of prokaryotic and eykaryotic signal peptides and prediction of their cleavage sitesProtein Engineering, 1997, vol. 10, 1-610103] • Prediction of signal peptides and signal anchors by a hidden Markov modelNIELSEN HA KROGHProceedings of the Sixth international Conference of Intelligent Systems for Molecular Biology (ISMB 6)AAAI Pressl9980000122-130 Γ01031 • PALOHEIMO M.MÅNTYLÅ A.KALLIO J.SUOMINEN P.High-yield production of a bacterial xylanase in the filamentous fungus Trichoderma reesei requires a carrier polypeptide with an intact domain structureAppl. Env. Microbiol., 2003, vol. 69, 7073-708210103] • PENTTILÅ M.NEVALAINEN H.RÅTTO M.SALMINEN E.KNOWLES J.A versatile transformation system for the cellulolytic filamentous fungus Trichoderma reeseiGene, 1987, vol. 61, 155-164 £01031 • Rapid preparation of DNA from filamentous fungiRAEDER U.BRODA P.Lett. Appl. Microbiol. 19850000vol. 1, 17-20 Γ01031 • SAMBROOK J.FRITSCH E. F.MANIATIS T.Molecular cloning, a laboratory manualCold
Spring Harbor Laboratoryl 9890000 Γ01031 • SAMBROOK J.RUSSELL D. W.Molecular cloning, a laboratory manualCold Spring Harbor Laboratory20010000 £01031
• TUOHY M.WALSH J.MURRAY P.CLAEYSSENS M.CUFFE M.SAVAGE A.COUGHAN M.Kinetic parameters and mode of action of cellobiohydrolases produced by Talaromyces emersoniiBiochem. Biophys. Acta, 2002, vol. 1596, 366-380 101031 • VISSER H.JOOSTEN V.PUNT P.J.GUSAKOV A.V.OLSON P.T.JOOSTEN R.BARTELS J.VISSER J.SINITSYN A.P.EMALFARB M.A.Development of a mature fungal technology and production platform for industrial enzymes based on a Myceliophthora thermophila isolate, previously known as Chrysosporium lucknowense C1 Industrial Biotechnology, 2011, vol. 7, 3214-223 101031 • VOUTILAINEN S.PURANEN T.SIIKA-AHO, M.L APPAL AINEN A.ALAPURANEN M.KALLIO J.Cloning, expression, and characterization of novel thermostable family 7 cellobiohydrolasesBiotechnology and Bioengineering, 2008, vol. 101, 3515-528 101031 • VAN TILBEURGH, H.LOONTIES, F.DE BRUYNE, C.CLAEYSSENS, M.Fluorogenic and chromogenic glycosides as substrates and ligands of carbohydrasesMethods Enzymol., 1988, vol. 160, 45-59 Γ01031

Claims (20)

1. Polypeptid, der har endoglucanaseaktivitet og omfatter en aminosyresekvens med mindst 57 % sekvensidentitet med EG_A der har SEQ ID NO: 7 eller et fragment deraf med endoglucanaseaktivitet.A polypeptide having endoglucanase activity and comprising an amino acid sequence of at least 57% sequence identity with EG_A having SEQ ID NO: 7 or a fragment thereof having endoglucanase activity. 2. Polypeptidet ifølge krav 1, hvor polypeptidet er opnåeligt fra Acremonium thermophilum, fortrinsvis fra A. thermophilum CBS 116240.The polypeptide of claim 1, wherein the polypeptide is obtainable from Acremonium thermophilum, preferably from A. thermophilum CBS 116240. 3. Polypeptidet ifølge krav 1 eller 2, hvor polypeptidet er et rekombinant fusionsprotein yderligere omfattende et cellulosebindingsmodul (CBM) af Trichoderma reesei.The polypeptide of claim 1 or 2, wherein the polypeptide is a recombinant fusion protein further comprising a cellulose binding module (CBM) of Trichoderma reesei. 4. Polypeptidet ifølge krav 3, hvor CBM er et CBM afledt fra T.reesei EGI/Cel7B eller T.reesei CHBI/Cel7A.The polypeptide of claim 3, wherein CBM is a CBM derived from T.reesei EGI / Cel7B or T.reesei CHBI / Cel7A. 5. Polypeptidet ifølge krav 4, hvor det naturlige CBM af endoglucanase EG_A er blevet erstattet med et CBM af T. reesei EGI/Cel7B eller T. reesei CHBI/Cel7A og det resulterende fusionsprotein fortrinsvis omfatter en aminosyresekvens med mindst 55 % sekvensidentitet med henholdsvis SEQ ID NO: 23 eller 27.The polypeptide of claim 4, wherein the natural CBM of endoglucanase EG_A has been replaced by a CBM of T. reesei EGI / Cel7B or T. reesei CHBI / Cel7A and the resulting fusion protein preferably comprises an amino acid sequence having at least 55% sequence identity with SEQ, respectively. ID NO: 23 or 27. 6. Polypeptidet ifølge krav 5, hvor fusionsproteinet omfatter en aminosyresekvens med SEQ ID NO: 23 eller SEQ ID NO: 27.The polypeptide of claim 5, wherein the fusion protein comprises an amino acid sequence having SEQ ID NO: 23 or SEQ ID NO: 27. 7. Isoleret polynukleotid valgt fra gruppen bestående af: a) et polynukleotid omfattende den kodende sekvens som vist i SEQ ID NO: 5; b) et polynukleotid, der koder for et polypeptid ifølge krav 1; c) et polynukleotid, der koder for et fragment af et polypeptid kodet af et polynukleotid af a) eller b), hvor fragmentet har endoglucanaseaktivitet; og d) et polynukleotid omfattende en nukleotidsekvens, der er degenereret i relation til nukleotidsekvensen af et polynukleotid af a) eller b); eller den komplementære streng af sådan et polynukleotid.An isolated polynucleotide selected from the group consisting of: a) a polynucleotide comprising the coding sequence as shown in SEQ ID NO: 5; b) a polynucleotide encoding a polypeptide according to claim 1; c) a polynucleotide encoding a fragment of a polypeptide encoded by a polynucleotide of a) or b), wherein the fragment has endoglucanase activity; and d) a polynucleotide comprising a nucleotide sequence degenerated in relation to the nucleotide sequence of a polynucleotide of a) or b); or the complementary strand of such a polynucleotide. 8. Polynukleotidet ifølge krav 7, omfattende et gen inkluderet i en mikroorganisme med accessionsnummer valgt fra gruppen bestående af DSM 25492, DSM 25657, og DSM 25655.The polynucleotide of claim 7, comprising a gene included in a microorganism with accession number selected from the group consisting of DSM 25492, DSM 25657, and DSM 25655. 9. Vektor, der omfatter et polynukleotid ifølge kravene 7 eller 8 funktionelt forbundet til regulatoriske sekvenser i stand til at styre ekspression af polypeptidet ifølge krav 1.A vector comprising a polynucleotide according to claims 7 or 8 functionally linked to regulatory sequences capable of controlling expression of the polypeptide of claim 1. 10. Værtscelle omfattende vektoren ifølge krav 9.A host cell comprising the vector of claim 9. 11. Fremgangsmåde til fremstilling af et polypeptid ifølge krav 1, hvilken fremgangsmåde omfatter trinnene at transformere en værtscelle med en ekspressionsvektor, der koder for polypeptidet, og at dyrke værtscellen under forhold, der muliggør ekspression af polypeptidet, og eventuelt at genvinde og oprense polypeptidet.A method of producing a polypeptide according to claim 1, comprising the steps of transforming a host cell with an expression vector encoding the polypeptide and culturing the host cell under conditions which allow expression of the polypeptide and optionally recovering and purifying the polypeptide. 12. Enzympræparat omfattende polypeptidet ifølge et hvilket som helst af kravene 1-6.An enzyme composition comprising the polypeptide of any one of claims 1-6. 13. Enzympræparatet ifølge krav 12, omfattende mindst et yderligere enzym valgt fra en gruppe af cellobiohydrolase, endoglucanase, beta-glucosidase, beta-glucanase, xyloglucanase, xylanase, beta-xylosidase, mannanase, beta-mannosidase, α-glucuronidase, acetylxylanesterase, α-arabinofuranosidase, a-galactosidase, pectinase, involverende endo- og exo-a-L-arabinaser, endo- og exo-galactoronase, endopectinlyase, pectatlyase, og pectinesterase, phenolesterase, ligninase involverende ligninperoxidase, mangan-afhængig peroxidase, H202-genererende enzym og laccase med eller uden en mediator.The enzyme composition of claim 12, comprising at least one additional enzyme selected from a group of cellobiohydrolase, endoglucanase, beta-glucosidase, beta-glucanase, xyloglucanase, xylanase, beta-xylosidase, mannanase, beta-mannosidase, α-glucuronidase, acetylxylan esterase, α -arabinofuranosidase, α-galactosidase, pectinase, involving endo- and exo-αL-arabinases, endo- and exo-galactoronase, endopectin lyase, pectatlyase, and pectin esterase, phenol esterase, ligninase involving lignin peroxidase, manganese-dependent peroxidase, manganese-dependent peroxidase, with or without a mediator. 14. Enzympræparatet ifølge krav 13 omfattende cellobiohydrolase CBHI/Cel 7A, cellobiohydrolase CBHII/Cel6A, endoglucanase EGII/Cel5A, beta-glucosidase PG/Cel3A, xylanase XynlOA og endoglucanase EG_A ifølge krav 1.The enzyme composition of claim 13 comprising cellobiohydrolase CBHI / Cel 7A, cellobiohydrolase CBHII / Cel6A, endoglucanase EGII / Cel5A, beta-glucosidase PG / Cel3A, xylanase Xyn10A and endoglucanase EG_A of claim 1. 15. Enzympræparatet ifølge krav 13 omfattende cellobiohydrolase CBHI/Cel 7A, cellobiohydrolase CBHII/Cel6A, endoglucanase EGII/Cel5A, beta-glucosidase PG/Cel3A, xylanase XynlOA og endoglucanase fusionsprotein omfattende en aminosyresekvens med SEQ ID NO: 24.The enzyme preparation of claim 13 comprising cellobiohydrolase CBHI / Cel 7A, cellobiohydrolase CBHII / Cel6A, endoglucanase EGII / Cel5A, beta-glucosidase PG / Cel3A, xylanase Xyn10A and endoglucanase fusion protein comprising an amino acid sequence of SEQ ID NO: 24. 16. Enzympræparatet ifølge krav 13 omfattende cellobiohydrolase CBHI/Cel 7A, cellobiohydrolase CBHII/Cel6A, endoglucanase EGII/Cel5A, beta-glucosidase βΟ/ΟεΙ3Α, xylanase XynlOA og endoglucanase fusionsprotein omfattende en aminosyresekvens med SEQ ID NO: 27.The enzyme preparation of claim 13 comprising cellobiohydrolase CBHI / Cel 7A, cellobiohydrolase CBHII / Cel6A, endoglucanase EGII / Cel5A, beta-glucosidase βΟ / ΟεΙ3Α, xylanase Xyn10OA and endoglucanase fusion protein comprising an amino acid sequence 27: 17. Enzympræparatet ifølge krav 13 omfattende cellobiohydrolase CBHI/Cel 7A, cellobiohydrolase CBHII/Cel6A, endoglucanase EGII/Cel5A, beta-glucosidase PG/Cel3A, xylanase XynlOA og endoglucanase fusionsprotein omfattende en aminosyresekvens med SEQ ID NO: 28.The enzyme composition of claim 13 comprising cellobiohydrolase CBHI / Cel 7A, cellobiohydrolase CBHII / Cel6A, endoglucanase EGII / Cel5A, beta-glucosidase PG / Cel3A, xylanase Xyn10A and endoglucanase fusion protein comprising an amino acid sequence of SEQ ID NO: 28. 18. Fremgangsmåde til behandling af cellulosemateriale med en endoglucanase ifølge et hvilket som helst af kravene 1-6 eller et enzympræparat ifølge et hvilket som helst af kravene 12-17, hvor fremgangsmåden omfatter de følgende trin: i) at reagere cellulosematerialet med endoglucanasen eller enzympræparatet omfattende endoglucanasen ii) at opnå mindst delvist hydrolyseret cellulosemateriale.A method of treating cellulose material with an endoglucanase according to any one of claims 1-6 or an enzyme preparation according to any one of claims 12-17, wherein the method comprises the following steps: i) reacting the cellulose material with the endoglucanase or enzyme preparation comprising the endoglucanase ii) obtaining at least partially hydrolyzed cellulose material. 19. Anvendelse af et polypeptid ifølge et hvilket som helst af kravene 1-6, eller et enzympræparat ifølge et hvilket som helst af kravene 12-17 i biobrændstof-, biomassehydrolyse-, stivelse-, tekstil-, detergent-, pulp- og papir-, fødevare-, foder- eller drikkevareindustri.Use of a polypeptide according to any one of claims 1-6, or an enzyme preparation according to any of claims 12-17 in biofuel, biomass hydrolysis, starch, textile, detergent, pulp and paper -, food, feed or beverage industry. 20. Escherichia coli-stamme med accessionsnummer DSM 25492, DSM 25657, eller DSM 25655.20. Escherichia coli strain with accession number DSM 25492, DSM 25657, or DSM 25655.
DK13793173.9T 2012-05-24 2013-05-23 IMPROVED ENDOGLUCANASES FOR TREATMENT OF CELLULOS MATERIAL DK2855673T3 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
FI20125550A FI124476B (en) 2012-05-24 2012-05-24 Enhanced endoglucanases for the treatment of cellulosic material
PCT/FI2013/050564 WO2013175074A1 (en) 2012-05-24 2013-05-23 Improved endoglucanases for treatment of cellulosic material

Publications (1)

Publication Number Publication Date
DK2855673T3 true DK2855673T3 (en) 2017-06-19

Family

ID=49623218

Family Applications (1)

Application Number Title Priority Date Filing Date
DK13793173.9T DK2855673T3 (en) 2012-05-24 2013-05-23 IMPROVED ENDOGLUCANASES FOR TREATMENT OF CELLULOS MATERIAL

Country Status (9)

Country Link
EP (1) EP2855673B1 (en)
CN (1) CN104995296B (en)
AP (1) AP2014008135A0 (en)
BR (1) BR112014028716A2 (en)
CA (1) CA2872670C (en)
DK (1) DK2855673T3 (en)
FI (1) FI124476B (en)
WO (1) WO2013175074A1 (en)
ZA (1) ZA201408383B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106011198A (en) * 2016-06-16 2016-10-12 桐庐唐赢生物科技有限公司 Method for preparing artificial starch, cellulosic ethanol and single-cell protein by taking agroforestry lignocellulose biomass as raw material
EP3653706A1 (en) * 2018-11-13 2020-05-20 AB Enzymes Oy Fusion protein
CN110923182B (en) * 2019-12-06 2022-09-13 四川轻化工大学 High-efficiency acid-resistant endoglucanase engineering bacteria based on vinasse and application thereof
CN114574370B (en) * 2022-03-18 2023-09-15 贵州医科大学 Screening method of degradation biomass strain, trichoderma and application

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
PL209206B1 (en) * 2000-03-20 2011-08-31 Basf Se Polypeptide constituting β-glucanase from the genus Talaromyces fungus, polynucleotide, vector, host cell, method for producing the polypeptide, method for identifying compound modulating β-glucanase activity, composition suitable for use as animal feed, method for plant material processing, method for reducing plant material viscosity, use of an isolated polypeptide and animal feed
FI120045B (en) * 2005-12-22 2009-06-15 Roal Oy Treatment of cellulose materials and enzymes useful therein
WO2007115201A2 (en) * 2006-03-30 2007-10-11 Novozymes, Inc. Polypeptides having endoglucanase activity and polynucleotides encoding same
EP2007884B1 (en) * 2006-04-13 2012-05-16 AB Enzymes Oy Cellulase fusion proteins and their use
BRPI0812276A2 (en) * 2007-05-31 2014-10-14 Novozymes Inc METHODS FOR INCREASING THE ACTIVITY OF A POLYPEPTIDE HAVING INTENSIFYING CELLULolytic ACTIVITY, DEGRADING OR CONVINCING A CELLULOSE CONTAINING MATERIAL, AND FOR MAKING A FERMENTATION PRODUCT, AND, COMPOSITION
EP2496694B1 (en) * 2009-11-06 2017-04-19 Novozymes, Inc. Compositions for saccharification of cellulosic material
CA2801577A1 (en) * 2010-06-03 2011-12-08 Mascoma Corporation Yeast expressing saccharolytic enzymes for consolidated bioprocessing using starch and cellulose
US9139823B2 (en) * 2010-11-12 2015-09-22 Novozymes, Inc. Polypeptides having endoglucanase activity and polynucleotides encoding same

Also Published As

Publication number Publication date
CA2872670A1 (en) 2013-11-28
CN104995296B (en) 2019-01-15
ZA201408383B (en) 2016-07-27
BR112014028716A2 (en) 2017-07-25
EP2855673B1 (en) 2017-05-10
WO2013175074A1 (en) 2013-11-28
AP2014008135A0 (en) 2014-12-31
EP2855673A4 (en) 2016-05-11
FI124476B (en) 2014-09-15
CN104995296A (en) 2015-10-21
FI20125550A (en) 2013-11-25
CA2872670C (en) 2021-07-27
EP2855673A1 (en) 2015-04-08

Similar Documents

Publication Publication Date Title
DK2519630T3 (en) METHOD OF TREATING CELLULOS MATERIAL AND CBHII / CEL6A ENZYMES THAT CAN BE USED THEREOF
US20180044656A1 (en) Treatment of Cellulosic Material and Enzymes Useful Therein
US8975058B2 (en) Endoglucanases for treatment of cellulosic material
DK2855673T3 (en) IMPROVED ENDOGLUCANASES FOR TREATMENT OF CELLULOS MATERIAL
EP2859095B1 (en) Novel proteins for the treatment of cellulosic material
US9458440B2 (en) Proteins for the treatment of cellulosic material
CA2994320C (en) Treatment of cellulosic material and enzymes useful therein
WO2019074828A1 (en) Cellobiose dehydrogenase variants and methods of use thereof
Class et al. Patent application title: Treatment of Cellulosic Material and Enzymes Useful Therein Inventors: Jari Vehmaanperä (Klaukkala, FI) Jari Vehmaanperä (Klaukkala, FI) Marika Alapuranen (Rajamaki, FI) Terhi Puranen (Nurmijarvi, FI) Terhi Puranen (Nurmijarvi, FI) Matti Siika-Aho (Helsinki, FI) Jarno Kallio (Jarvenpaa, FI) Jarno Kallio (Jarvenpaa, FI) Satu Hooman (Espoo, FI) Sanni Voutilainen (Lohja, FI) Teemu Halonen (Espoo, FI) Liisa Viikari (Helsinki, FI) Assignees: Roal OY