DK2576796T3 - Filamentous fungal host strains and DNA constructs, as well as procedures for their use - Google Patents

Filamentous fungal host strains and DNA constructs, as well as procedures for their use Download PDF

Info

Publication number
DK2576796T3
DK2576796T3 DK11727041.3T DK11727041T DK2576796T3 DK 2576796 T3 DK2576796 T3 DK 2576796T3 DK 11727041 T DK11727041 T DK 11727041T DK 2576796 T3 DK2576796 T3 DK 2576796T3
Authority
DK
Denmark
Prior art keywords
gene
interest
fungal host
host cell
filamentous fungal
Prior art date
Application number
DK11727041.3T
Other languages
Danish (da)
Inventor
Benjamin S Bower
Thijs Kaper
Bradley R Kelemen
Original Assignee
Danisco Us Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Danisco Us Inc filed Critical Danisco Us Inc
Application granted granted Critical
Publication of DK2576796T3 publication Critical patent/DK2576796T3/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/80Vectors or expression systems specially adapted for eukaryotic hosts for fungi
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/24Hydrolases (3) acting on glycosyl compounds (3.2)
    • C12N9/2402Hydrolases (3) acting on glycosyl compounds (3.2) hydrolysing O- and S- glycosyl compounds (3.2.1)
    • C12N9/2405Glucanases
    • C12N9/2434Glucanases acting on beta-1,4-glucosidic bonds
    • C12N9/2437Cellulases (3.2.1.4; 3.2.1.74; 3.2.1.91; 3.2.1.150)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P21/00Preparation of peptides or proteins
    • C12P21/02Preparation of peptides or proteins having a known sequence of two or more amino acids, e.g. glutathione
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y302/00Hydrolases acting on glycosyl compounds, i.e. glycosylases (3.2)
    • C12Y302/01Glycosidases, i.e. enzymes hydrolysing O- and S-glycosyl compounds (3.2.1)
    • C12Y302/01091Cellulose 1,4-beta-cellobiosidase (3.2.1.91)
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P20/00Technologies relating to chemical industry
    • Y02P20/50Improvements relating to the production of bulk chemicals
    • Y02P20/52Improvements relating to the production of bulk chemicals using catalysts, e.g. selective catalysts

Description

DESCRIPTION
II. FIELD
[0001] The present disclosure relates to filamentous fungal host strains and recombinant DNA constructs for creation and use thereof. The filamentous fungal host strains are particularly useful for expressing proteins of interest in a reliable or less variable fashion, and for efficiently screening DNA libraries encoding recombinant proteins.
III. BACKGROUND
[0002] Filamentous fungal host cell strains have been engineered to express various proteins. These proteins can then be used, optionally after being purified, in various industrial, academic or other applications. The expression process can often be unpredictable. It is not a rare occasion when only a very small number, if any, of the transformants prepared actually produce the enzyme of interest. Variability in expression of heterologous genes (e.g., non-native genes, or native genes existing in a form that is different from the native form) can occur as a consequence of factors unrelated to their nucleic acid and/or amino acid sequences. For instance, non-homologous integration predominates in filamentous fungi. Thus, expression vectors integrate into the genome at random, possibly resulting in positional effects on expression levels between transformants. In addition, unstable transformants may be generated, necessitating further screening of transformants to obtain stable transformants. Variability may also occur by generation of heterokaryons as a result of transformation of a multinucleate protoplast. Therefore, reliable means of producing enzymes of interest, with reduced variability, provide clear advantages.
[0003] For certain industrial applications, these proteins produced from fungal host strains are often engineered to obtain new, desirable characteristics, or a different level of certain characteristics. In these cases, existing filamentous fungal host cell strains are often used for screening DNA libraries encoding variant proteins. Variability in expression efficacy and/or levels makes it difficult to compare the characteristics of a given variant with those of another. Therefore, a particular advantage is clearly present if variants can be reliably expressed if they can be expressed by the particular host cell, and if variants can be expressed at less variable levels such that their characteristics can be more readily assessed and compared.
[0004] While telomeric, extrachromosomal replicating vectors can be used as an alternative to genomic integration, this method does not eliminate variability in expression levels between transformants. Thus, the art would benefit from tools to reduce sequence-independent differences in gene expression from filamentous fungal host strains.
IV. SUMMARY
[0005] The present disclosure relates to filamentous fungal host strains and recombinant DNA constructs for creation and use thereof. The filamentous fungal host strains reliably produce transformants and express enzymes with reduced variability in expression levels. The filamentous fungal host strains are useful for efficiently screening DNA libraries encoding recombinant proteins.
[0006] In particular, the present disclosure provides filamentous fungal host cell expression expression system, comprising: 1. a. a fungal host cell containing in its chromosomal DNA a disruption in one or more components of the non homologous recombination (NHR) pathway, a part of a first selectable marker that lacks a first selectable function, and a second selectable marker that is operative to confer a second selectable function; and 2. b. a nucleic acid molecule containing a sequence that, when introduced into said fungal host cell, confers said first selectable function to said first selectable marker, a sequence operable to express one or more genes of interest, and sequences with substantial homology to sequences that flank said chromosomal selectable markers; wherein said homologous sequences cause a homologous recombination event that results in a functional first selectable marker, removal of said second selectable marker, and expression of said gene of interest, and wherein said first selectable marker and said second selectable marker are different markers. In some embodiments, the one or more components of the NIIR pathway comprise one or more of the group consisting of ku80, ku70, rad50, mre11, xrs2, Iig4, and xrs2. In certain embodiments, the nucleic acid molecule introduced into the fungal host cell in b) can be either a non-native or a native molecule existing in a non-native form to the fungal host cell.
[0007] Gene deletion may be accomplished by the use of a deletion plasmid. For example, the desired gene to be deleted or disrupted can be inserted into a plasmid. The deletion plasmid is then cut at an appropriate restriction enzyme site(s), internal to the desired gene coding region, and the gene coding sequence or part thereof replaced with a selectable marker. Flanking DNA sequences from the locus of the gene to be deleted or disrupted, preferably between about 0.5 and about 2.0 kb, remain on either side of the selectable marker gene. A suitable deletion plasmid will generally have unique restriction enzyme sites present therein to enable the fragment containing the deleted gene, including flanking DNA sequences, and the selectable marker gene to be removed as a single linear piece. A deletion plasmid may also be constructed by the use of PCR to amplify the desired flanking regions and selectable markers with restriction enzyme sites at the ends of the amplified fragments to facilitate the joining of fragments. Alternatively, a deletion plasmid can be synthesized de novo by specifying the appropriate flanking DNA and selectable marker sequences.
[0008] The first and second selectable markers are different markers. In some embodiments, the first and second selectable markers are independently selected from the group consisting of alsR, amdS, hygR, pyr2, pyr4, pyrG, sucA, a bleomycin resistance marker, a blasticidin resistance marker, a pyrithiamine resistance marker, a chlorimuron ethyl resistance marker, a neomycin resistance marker, an adenine pathway gene, a tryptophan pathway gene, and thymidine kinase. In some embodiments, at least one of the homologous sequences is upstream or downstream from the pyr2 sequence. In some embodiments, the homologous sequences are upstream and downstream from the pyr2 sequences. In other embodiments, the homologous sequences comprise the sequence(s) operable to express one or more genes of interest or one or more variant genes of interest, and the sequence that confers the first selectable function to the first selectable marker. In some embodiments, the filamentous fungal host cell is a species of a genus selected from the group consisting of Trichoderma, Penicillium, Aspergillus, Humicola, Chrysosporium, Fusarium, and Emericella. In some embodiments, the Trichoderma is T. reesei, while in other embodiments, the Aspergillus is A. niger.
[0009] The present disclosure provides filamentous fungal host cell expression systems wherein the gene of interest or the variant gene of interest is selected from the group consisting of hemicellulases, peroxidases, proteases, cellulases, xylanases, lipases, phospholipases, esterases, cutinases, pectinases, keratinases, reductases, oxidases, phenol oxidases, lipoxygenases, ligninases, pullulanases, tannases, pentosanases, malanases, beta-glucanases, arabinosidases, hyaluronidase, chondroitinase, laccase, amylases, glucoamylases, and mixtures thereof. Non-limiting examples of genes of interest or variant genes encode: proteins or enzymes involved in starch metabolism, proteins or enzymes involved in glycogen metabolism, acetyl esterases, aminopeptidases, amylases, arabinases, arabinofuranosidases, carboxypeptidases, catalases, cellulases, chitinases, chymosin, cutinase, deoxyribonucleases, epimerases, esterases, a-galactosidases, β-galactosidases, α-glucanases, glucan lysases, endo-P-glucanases, glucoamylases, glucose oxidases, a-glucosidases, β-glucosidases, glucuronidases, hemicellulases, hexose oxidases, hydrolases, invertases, isomerases, laccases, lipases, lyases, mannosidases, oxidases, oxidoreductases, pectate lyases, pectin acetyl esterases, pectin depolymerases, pectin methyl esterases, pectinolytic enzymes, peroxidases, phenoloxidases, phytases, polygalacturonases, proteases, rhamno-galacturonases, ribonucleases, thaumatin, transferases, transport proteins, transglutaminases, xylanases, hexose oxidase (D-hexose: 02-oxidoreductase, EC 1.1.3.5), variants thereof, and combinations thereof. In some embodiments, the gene of interest or the variant gene of interest encodes a polypeptide selected from the group consisting of peptide hormones, growth factors, clotting factors, chemokines, cytokines, lymphokines, antibodies, receptors, adhesion molecules, and microbial antigens (e.g., HBV surface antigen, IIPV E7, etc.), and variants (e.g., fragments) thereof.
[0010] In addition the present disclosure provides methods of expressing a gene of interest in the filamentous fungal host cell system as set out in the claims, comprising introducing into said filamentous fungal host cell said nucleic acid molecule, growing said host cells, and selecting for host cells that have said first selectable function but lack said second selectable function. In some embodiments, expression so achieved are more reliable than those achieved using conventional methods in the art. In some embodiments, the methods further comprise assaying for the expression of the gene of interest or the variant gene of interest, and/or for a biochemical function of a polypeptide encoded by the gene of interest or by the variant gene of interest.
V. BRIEF DESCRIPTION OF THE DRAWINGS
[0011] The following figures and tables are meant to be illustrative without limiting the scope and content of the instant disclosure or the claims herein.
Figure 1 provides a schematic illustrating the derivation of the MAD6 host strain, from the quad-deleted derivative strain.
Figure 2 provides a schematic of the 7. reesei ku80 deletion cassette.
Figure 3 provides a schematic of the pyr2 deletion cassette used to create the Archy2 strain.
Figure 4 provides a schematic of the hygR deletion cassette used to create the Archy3 strain.
Figure 5 provides a schematic of the 7. reesei bgll deletion cassette.
Figure 6 provides a schematic of the 7. reesei egl3 deletion cassette.
Figure 7 provides a schematic of the 7. reesei telomeric plasmid vector used for expression of ere recombinase.
Figure 8 illustrates the inactivation of the pyr2 selectable marker and activation of the amdS selectable marker as a consequence of introduction of a polynucleotide of a gene of interest or variant gene of interest (GOI) cassette, wherein the GOI in this example encodes a CBH2 variant.
Figure 9 illustrates the pENTR/D-TOPO vector, as described in Example 2.
Figure 10 illustrates the pTrex3gM vector, as described in Example 2.
Figure 11 illustrates the Fv43B expression vector, pTrex3gM-Fv43B, as described in Example 2.
Figure 12 illustrates the Fv43C expression vector, pTrex3gM-Fv43C, as described in Example 2.
Figure 13 is a picture of an SDS-PAGE characterizing Fv43B expressed using 7. reesei quad deleted clones transformed with fv43B. The percent protein relative to the total proteins was quantitatively determined in accordance with Example 2, and listed below the corresponding lane.
Figure 14 is a picture of an SDS-PAGE characterizing Fv43C expressed using 7. reesei quad deleted clones transformed with fv43C. The percent protein relative to the total proteins loaded was quantitatively determined in accordance with Example 2, and listed below the corresponding lane.
Figure 15 is a picture of an SDS-PAGE characterizing Fv43B and Fv43C expressed using the MAD6 construct. The percent protein relative to the total proteins was quantitatively determined in accordance with Example 2, and listed below the corresponding lane.
Figure 16Ais a picture of 4 SDS-PAGE examining the CBH2 variants as described in Example 3. Figure 16B depicts the average expression of CBH2 variants as described in Example 3.
VI. DETAILED DESCRIPTION OF VARIOUS EMBODIMENTS
[0012] The present disclosure relates to filamentous fungal host strains and recombinant DNA constructs for creation and use thereof. The filamentous fungal host strains can be used to provide expression of genes or variants of interest in these hosts with higher reliability and/or lower variability in expression levels, as compared to other expression methods known in the art. The filamentous fungal host strains are, in a particular embodiment, useful for efficiently screening DNA libraries encoding recombinant proteins.
[0013] The methods described herein express proteins of interest or variants of interest with improved reliability. In this sense, the term "improved reliability" is reflected in that (1) at least 60% (e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, or at least 99%) of the transformants are stable transformants; or that at least 60% (e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, or at least 99%) of the transformants express the protein or variant of interest as intended over the background expression level; and (2) that the proteins or variants of interest are expressed with expression levels varying less than 60% (e.g., less than 55%, less than 50%, less than 45%, less than 40%, less than 35%, less than 30%, less than 25%, less than 20%, less than 15%, less than 10%, less than 5%, or less than 2%), wherein the term "expression level variation" is defined by dividing the difference between the highest and the lowest expression levels with a value that is the difference between the highest expression level and the background expression level, all determined with the same construct and the same gene or variant of interest.
[0014] It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the compositions and methods described herein. In this application, the use of the singular includes the plural unless specifically stated otherwise. The use of "or" means "and/or" unless state otherwise. Likewise, the terms "comprise," "comprising," "comprises," "include," "including" and "includes" are not intended to be limiting. All patents and publications, including all amino acid and nucleotide sequences disclosed within such patents and publications, referred to herein are expressly incorporated by reference. The headings provided herein are not limitations of the various aspects or embodiments of the disclosure, which can be had by reference to the specification as a whole. Accordingly, the terms herein are more fully defined by reference to the specification as a whole.
[0015] Unless defined otherwise herein, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure belongs. Singleton, et al., DICTIONARY OF MICROBIOLOGY AND MOLECULAR BIOLOGY, 2D ED., John Wiley and Sons, New York (1994), and Hale & Marham, THE HARPER COLLINS DICTIONARY OF BIOLOGY Harper Perennial, NY (1991) provide one of skill with a general dictionary of many of the terms used in this disclosure. Although any methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present disclosure, the preferred methods and materials are described. Numeric ranges are inclusive of the numbers defining the range. Unless otherwise indicated, nucleic acids are written left to right in 5' to 3' orientation; amino acid sequences are written left to right in amino to carboxyl orientation, respectively. Practitioners are particularly directed to Sambrook et al., MOLECULAR CLONING: A LABORATORY MANUAL (Second Edition), Cold Spring Harbor Press, Plainview, N.Y, 1989, and Ausubel FM et al., Current Protocols in Molecular Biology, John Wiley & Sons, New York, N.Y, 1993 , for definitions and terms of the art. It is to be understood that this disclosure is not limited to the particular methodology, protocols, and reagents described, as these may vary. 1. Definitions [0016] The terms below are more fully defined by reference to the specification as a whole.
[0017] The term "polypeptide" as used herein refers to a compound or a molecule made up of a single chain of amino acid residues linked by peptide bonds. The term "protein" as used herein may be synonymous with the term "polypeptide".
[0018] "Variant" means a protein, which is derived from a precursor protein (e.g., the native protein) by addition of one or more amino acids to either or both the C- and N-terminal ends, substitution of one or more amino acids at one or a number of different sites in the amino acid sequence, or deletion of one or more amino acids at either or both ends of the protein, or at one or more sites in the amino acid sequence. The preparation of a variant of a protein of interest (e.g., encoded by a "gene of interest"), or a "variant of interest" (e.g., encoded by a "variant gene of interest"), can be performed by any means known in the art. For example, a variant of interest is prepared by modifying a DNAsequence which encodes for the native protein (e.g., the gene of interest), transformation of the modified DNA sequence into a suitable host, and expression of the modified DNA sequence to form the variant of interest. In a non-limiting example, a variant of a cellulase of interest may be performed by any means know in the art. For instance, a cellulase variant is prepared by modifying a DNA sequence which encodes for the its native (or naturally-occurring) counterpart, transformation of the modified DNA sequence into a suitable host, and expression of the modified DNA sequence to form the variant cellulase. The variant enzyme of interest of the disclosure includes polypeptides comprising altered amino acid sequences in comparison to that of the native enzyme of interest. The variant enzyme of interest, in certain embodiments, may retain some characteristics of the native enzyme of interest, but in the mean time, have certain altered characteristics from the native enzyme of interest. For example, variant cellulase of the disclosure includes peptides comprising altered amino acid sequences in comparison with a precursor enzyme amino acid sequence wherein the variant cellulase retains the characteristic cellulolytic nature of the precursor enzyme but which may have altered properties in some specific aspect. For example, a variant cellulase may have an increased pH optimum or increased temperature or oxidative stability or decreased affinity or binding to non-cellulosic materials but will retain its characteristic cellulolytic activity.
[0019] In a non-limiting example, it is contemplated that the variants of interest according to the present disclosure may be derived from a nucleotide sequence encoding a variant wherein the functional activity of the expressed variant is retained. For example, a cellulase variant may be derived from a DNA fragment encoding a cellulase variant wherein the cellulase activity of the expressed variant is retained. The DNA fragment encoding a cellulase may, in some embodiments, further include a DNA sequence or portion thereof encoding a hinge or linker attached to the cellulase DNA sequence at either the 5' or 3' end wherein the functional activity of the encoded cellulase domain is retained. The terms "variant" and "derivative" may be used interchangeably herein.
[0020] "Equivalent residues" may be defined by determining homology at the level of tertiary structure for a precursor or reference enzyme whose tertiary structure has been determined by x-ray crystallography. For example, equivalent residues are defined as those for which the atomic coordinates of two or more of the main chain atoms of a particular amino acid residue of a cellulase and Hypocrea jecorina CBH2 (N on N, CAon CA, C on C and O on O) are within 0.13 nm and preferably 0.1 nm after alignment. Alignment is achieved after the best model has been oriented and positioned to give the maximum overlap of atomic coordinates of non-hydrogen protein atoms of the enzyme and the precursor/reference enzyme in question. For example, a suitable model includes a crystallographic model giving the lowest R factor for experimental diffraction data at the highest resolution available. See, e.g., U.S. Patent Application Publication No. 2006/0205042.
[0021] Equivalent residues which are functionally analogous to a specific residue of a precursor or reference enzyme are defined as those residues that may adopt a conformation such that they alter, modify, or contribute to the structure of the enzyme, to the substrate binding, or to the catalysis in a predefined manner. For example, equivalent residues of H. jecorina CBH2 are those amino acids of a cellulase which may adopt a conformation such that they alter, modify or contribute to protein structure, substrate binding or catalysis in a manner defined. In some embodiments, equivalent residues can be those that occupy an analogous position to the extent that, although the main chain atoms of the given residue may not satisfy the criteria of equivalence on the basis of occupying a homologous position, the atomic coordinates of more than one (e.g., 2, 3, or more) of the side chain atoms of the residue lie within a short distance (e.g., within about 0.02 nm, within about 0.05 nm, within about 0.08 nm, within about 0.10 nm, within about 0.12 nm, within about 0.13 nm, within about 0.14 nm, within about 0.15 nm, within about 0.17 nm, within about 0.18 nm, within about 0.20 nm, within about 0 25 nm, etc) of the corresponding side chain atom of the precursor/reference enzyme. For example, a cellulase, for which a tertiary structure has been obtained by X-ray crystallography, may suitably comprise equivalent residues, wherein the atomic coordinates of at least two of the side chain atoms of the residue lie with 0.13 nm of the corresponding side chain atoms of H. jecorina CBH2, even though the main chain atoms of the given residue do not satisfy the criteria of equivalence on the basis of occupying a homologous position. The crystal structure of H. jecorina CBH2 is shown in Zou et al. (1999) Structure 7(9): 1035-45.
[0022] The term "nucleic acid molecule" includes RNA, DNA, and cDNA molecules. It will be understood that, as a result of the degeneracy of the genetic code, a multitude of nucleotide sequences encoding a given protein and/or variants thereof may be produced. The present disclosure contemplates every possible variant nucleotide sequence encoding the variant enzyme of interest, all of which are possible given the degeneracy of the genetic code. For example, a plurality of nucleotide sequence can encoding a cellulase, such as a CBH2 and/or variants thereof, which can be produced by a method or process described herein, because of the degeneracy of the genetic code.
[0023] A "heterologous" nucleic acid construct or sequence has a portion of the sequence which is not native or existing in a native form to the cell in which it is expressed. Heterologous, with respect to a control sequence refers to a control sequence (i.e. promoter or enhancer) that does not function in nature to regulate the same gene the expression of which it is currently regulating. Generally, heterologous nucleic acid sequences are not endogenous to the cell or part of the genome in which they are present, and have been added to the cell, by infection, transfection, transformation, microinjection, electroporation, or the like. A "heterologous" nucleic acid construct may contain a control sequence/DNA coding sequence combination that is the same as, or different from a control sequence/DNA coding sequence combination found in the native cell.
[0024] As used herein, the term "vector" refers to a nucleic acid construct designed for transfer between different host cells. An "expression vector" refers to a vector that has the ability to incorporate and express heterologous DNA fragments in a foreign cell. Many prokaryotic and eukaryotic expression vectors are commercially available. Selection of appropriate expression vectors is within the knowledge of those having skill in the art.
[0025] Accordingly, an "expression cassette" or "expression vector" is a nucleic acid construct generated recombinantly or synthetically, with a series of specified nucleic acid elements that permit transcription of a particular nucleic acid in a target cell. The recombinant expression cassette can be incorporated into a plasmid, chromosome, mitochondrial DNA, plastid DNA, virus, or nucleic acid fragment. Typically, the recombinant expression cassette portion of an expression vector includes, among other sequences, a nucleic acid sequence to be transcribed and a promoter.
[0026] As used herein, the term "plasmid" refers to a circular double-stranded (ds) DNA construct used as a cloning vector, and which forms an extrachromosomal self-replicating genetic element in many bacteria and some eukaryotes.
[0027] As used herein, the term "selectable marker-encoding nucleotide sequence" refers to a nucleotide sequence which is capable of expression in cells and where expression of the selectable marker confers to cells containing the expressed gene the ability to grow in the presence of a corresponding selective agent, or under corresponding selective growth conditions.
[0028] As used herein, the term "promoter" refers to a nucleic acid sequence that functions to direct transcription of a downstream gene. The promoter will generally be appropriate to the host cell in which the target gene is being expressed. The promoter, together with other transcriptional and translational regulatory nucleic acid sequences (also termed "control sequences"), are often used to express a given gene. In general, the transcriptional and translational regulatory sequences include, but are not limited to, promoter sequences, ribosomal binding sites, transcriptional start and stop sequences, translational start and stop sequences, and enhancer or activator sequences.
[0029] A "chimeric gene construct," as defined herein, refers to a non-native gene (i.e., one that has been introduced into a host or one that does not exist in its native form in the host) that may be composed of parts of different genes, including regulatory elements. A chimeric gene construct for transformation of a host cell is typically composed of a transcriptional regulatory region (promoter) operably linked to a protein coding sequence, or, in a selectable marker chimeric gene, to a selectable marker gene encoding a protein conferring, for example, antibiotic resistance to transformed cells. A typical chimeric gene of the present disclosure, for transformation into a host cell, includes a transcriptional regulatory region that is constitutive or inducible, a protein coding sequence, and a terminator sequence. A chimeric gene construct may also include a second DNA sequence encoding a signal peptide if secretion of the target protein is desired. For example, certain of the constructs described herein, e.g., the Archy 3 T. reesei strain, are chimeric gene constructs.
[0030] A nucleic acid is "operably linked" when it is placed into a functional relationship with another nucleic acid sequence. For example, the DNA encoding a secretory leader is operably linked to the DNA encoding a polypeptide if it is expressed as a preprotein that participates in the secretion of the polypeptide; a promoter or enhancer is operably linked to a coding sequence if it affects the transcription of the sequence; or a ribosome binding site is operably linked to a coding sequence if it is positioned so as to facilitate translation. Generally, "operably linked" means that the DNA sequences being linked are contiguous, and, in the case of a secretory leader, contiguous and in reading frame. Flowever, enhancers do not have to be contiguous. Linking is accomplished by ligation at convenient restriction sites. If such sites do not exist, the synthetic oligonucleotide adaptors, linkers or primers for PCR are used in accordance with conventional practice.
[0031] A selection marker herein is said to be "operative" when it has full selection function.
[0032] As used herein, the term "gene" refers to the segment of DNA involved in producing a polypeptide chain, that may or may not include regions preceding and following the coding region, e.g. 5' untranslated (5' UTR) or "leader" sequences and 3' UTR or "trailer" sequences, as well as intervening sequences (introns) between individual coding segments (exons).
[0033] In general, nucleic acid molecules encoding a variant of interest will hybridize, under moderate to high stringency conditions to a wild type precursor/reference sequence. For example, a nucleic acid molecule encoding a variant cellulase such as CBH2 will hybridize, under moderate to high stringency conditions to the wild type sequence such as provided herein as SEQ ID NO:7. Flowever, in certain embodiments, a nucleotide sequence encoding an enzyme of interest may reflect a substantially different codon usage, but continue to encode the same enzyme of interest. For example, the coding sequence may be modified to facilitate a more robust expression of the enzyme or variant of interest in a particular prokaryotic or eukaryotic expression system, in accordance with the frequency with which a particular codon is utilized by the host (see, e.g., Te'o et al., FEMS Microbiology Letters, 190: 13-19, 2000, describing the optimization of genes for expression in filamentous fungi).
[0034] A nucleic acid sequence is deemed "selectively hybridizable" to a reference nucleic acid sequence if the two sequences specifically hybridize to one another under moderate to high stringency hybridization and wash conditions. Hybridization conditions are based on the melting temperature (Tm) of the nucleic acid binding complex or probe. For example, "maximum stringency" typically occurs at about Tm -5°C (5°C below the Tm of the probe); "high stringency" at about 5- about 10°C below the Tm; "moderate" or "intermediate stringency" at about 10- about 20°C below the Tm of the probe; and "low stringency" at about 20- about 25°C below the Tm. Functionally, maximum stringency conditions may be used to identify sequences having strict identity or near-strict identity with the hybridization probe; while high stringency conditions are used to identify sequences having about 80% or more sequence identity with the probe.
[0035] Moderate and high stringency hybridization conditions are well known in the art (see, e.g., Sambrook, et al, 1989, Chapters 9 and 11, and in Ausubel, F. M., et al., 1993).
[0036] An example of high stringency conditions includes hybridization at about 42°C in 50% formamide, 5xSSC, 5xDenhardt's solution, 0.5% SDS and 100 pg/mL denatured carrier DNA followed by washing two times in 2xSSC and 0.5% SDS at room temperature and two additional times in O.IxSSC and 0.5% SDS at 42° C.
[0037] The term "recombinant" when used with reference, e.g., to a cell, or nucleic acid, protein, or vector, indicates that the cell, nucleic acid, protein or vector, has been modified by the introduction of a heterologous nucleic acid or protein or the alteration of a native nucleic acid or protein, or that the cell is derived from a cell so modified. Thus, for example, recombinant cells express genes that are not found within the native (non-recombinant) form of the cell or express native genes that are otherwise abnormally expressed, under expressed or not expressed at all.
[0038] As used herein, the terms "transformed", "stably transformed" or "transgenic" with reference to a cell means the cell has a non-native (or not existing in its native form) nucleic acid sequence integrated into its genome or as an episomal plasmid that is maintained through multiple generations.
[0039] As used herein, the term "expression" refers to the process by which a polypeptide is produced based on the nucleic acid sequence of a gene. The process includes both transcription and translation.
[0040] The term "introduced" in the context of inserting a nucleic acid sequence into a cell, means "transfection" or "transformation" or "transduction" and includes reference to the incorporation of a nucleic acid sequence into a eukaryotic or prokaryotic cell where the nucleic acid sequence may be incorporated into the genome of the cell (for example, chromosome, plasmid, plastid, or mitochondrial DNA), converted into an autonomous replicon, or transiently expressed (for example, transfected mRNA).
[0041] The term "expression of a protein or variant of interest" refers to transcription and translation of the gene of interest or the variant of interest, the products of which include precursor RNA, mRNA, polypeptide, post-translationally processed polypeptides, and derivatives thereof. For example, "CBH2 expression" refers to transcription and translation of the cbh2 gene or variants thereof, the products of which include precursor RNA, mRNA, polypeptide, post-translationally processed polypeptides, and derivatives thereof, including CBH2 from related species such as Trichoderma koningii, Hypocrea jecorina (also known as Trichoderma longibrachiatum, Trichoderma reesei or Trichoderma wide) and Hypocrea schweinitzii. The level of expression can be determined by various known methods, including, for example, Western blot for the protein or variant of interest, Northern blot analysis and reverse transcriptase polymerase chain reaction (RT-PCR) assays for the mRNA of the gene or variant gene of interest, and enzymatic activity assays on suitable substrates. By way of example, assays for CBH2 expression include Western blot for CBH2 protein, Northern blot analysis and reverse transcriptase polymerase chain reaction (RT-PCR) assays for cbh2 mRNA, and Phosphoric Acid Swollen Cellulose (PASC) and p-hydroxybenzoic acid hydrazide (PAHBAH) assays as described in the following: (a) PASC: (Karlsson, J. et al. (2001), Eur. J. Biochem, 268, 6498-6507, Wood, T. (1988) in Methods in Enzymology, Vol.160. Biomass Part a Cellulose and Hemicellulose (Wood, W. & Kellog, S. Eds.), pap.19-25, Academic Press, San Diego, Calif., USA) and (b) PAHBAH: (Lever, M. (1972) Anal. Biochem., 47, 273, Blakeney, A. B. & Mutton, L. L. (1980) J. Sci. Food & Agriculture, 31, 889, Henry, R. J. (1984) J. of the Institute of Brewing, 90, 37).
[0042] The term "host cell" refers to a cell that contains a vector and supports the replication, and/or transcription or transcription and translation (expression) of the expression construct. Host cells for use in the present disclosure can be prokaryotic cells, such as an E. coli cell, or eukaryotic cells such as yeast, plant, insect, amphibian, or mammalian cells. In certain embodiments, host cells are suitably filamentous fungal cells.
[0043] The term "filamentous fungi" means any and all filamentous fungi recognized by those of skill in the art. A preferred fungus is selected from the group consisting of Aspergillus, Trichoderma, Fusarium, Chrysosporium, Penicillium, Humicola, Neurospora, or alternative sexual forms thereof such as Emericella, Hypocrea. It has now been demonstrated that the asexual industrial fungus Trichoderma reesei is a clonal derivative of the ascomycete Hypocrea jecorina (See, Kuhis et al., PNAS, 93:7755-7760, 1996).
[0044] Many microbes make enzymes that hydrolyze cellulose, including the wood rotting fungus Trichoderma, the compost bacteria Thermomonospora, Bacillus, and Cellulomonas; Streptomyces; and the fungi Humicola, Aspergillus and Fusarium.
[0045] The term "isolated" or "purified" as used herein refers to a nucleic acid or amino acid that is removed from at least one component with which it is naturally associated.
[0046] " Filamentous fungi" include all filamentous forms of the subdivision Eumycota and Oomycota. For example, filamentous fungi include, without limitation, Acremonium, Aspergillus, Emericella, Fusarium, Humicola, Mucor, Myceliophthora, Neurospora, Scytalidium, Thielavia, Tolypocladium, or Trichoderma species. In some embodiments, the filamentous fungus may be an Aspergillus aculeatus, Aspergillus awamori, Aspergillus foetldus, Aspergillus japonlcus, Aspergillus nldulans, Aspergillus nlger, or Aspergillus oryzae. In some embodiments, the filamentous fungus is a Fusarium bactridioides, Fusarium cerealis, Fusarium crookwellense, Fusarium culmorum, Fusarium graminearum, Fusarium graminum, Fusarium heterosporum, Fusarium negundi, Fusarium oxysporum, Fusarium reticulatum, Fusarium roseum, Fusarium sambucinum, Fusarium sarcochroum, Fusarium sporotrichioides, Fusarium sulphureum, Fusarium torulosum, Fusarium trichothecioides, or Fusarium venenatum. In some embodiments, the filamentous fungus is a Humicola insolens, Humicola lanuginosa, Mucor miehei, Myceliophthora thermophila, Neurospora crassa, Scytalidium thermophilum, or Thielavla terrestris. In some embodiments, filamentous fungus is a Trichoderma harzianum, Trichoderma koningii, Trichoderma longibrachiatum, Trichoderma reesei, e.g., RL-P37 (Sheir-Neiss et al., Appl. Microbiol. Biotechnology, 20 (1984) pp. 46-53; Montenecourt B.S., Can., 1-20, 1987), QM9414 (ATCC No. 26921), NRRL 15709, ATCC 13631, 56764, 56466, 56767, or Trichoderma wide, e.g., ATCC 32098 and 32086. In some embodiments, the filamentous fungus is a Trichoderma reesei RutC30, which is available from the American Type Culture Collection as Trichoderma reesei ATCC 56765. Related to this, in some embodiments, the disclosure provides a whole cell broth preparation of any one of the filamentous fungi described herein.
[0047] Generally, the microorganism is cultivated in a cell culture medium suitable for production of enzymes. The cultivation takes place in a suitable nutrient medium comprising carbon and nitrogen sources and inorganic salts, using procedures known in the art. Suitable culture media, temperature ranges and other conditions suitable for growth and enzymatic production are known in the art. As a non-limiting example, the normal temperature rangefor the production of cellulases by Trichoderma reesei is 24°C to 28°C.
[0048] Generally, a "whole cell broth preparation" is used as it is produced by fermentation with no or minimal recovery and/or purification. For example, once an enzyme or variant of interest (or more than one enzyme or variant of interest) is secreted by a cell into the cell culture medium, the cell culture medium containing the enzyme or variant of interest can be used. In some embodiments the whole cell broth preparation comprises the unfractionated contents of fermentation material, including cell culture medium, extracellular enzymes and cells. Alternatively, the whole cell broth preparation can be processed by any convenient method, e.g., by precipitation, centrifugation, affinity, filtration or any other method known in the art. In some embodiments, the whole cell broth preparation can be concentrated, for example, and then used without further purification. In some embodiments the whole cell broth preparation comprises chemical agents that decrease cell viability or kills the cells. In some embodiments, the cells are lysed or permeabilized using methods known in the art. For example, a cellulase or variant of interest (e.g., CBH2 or a variant thereof, Fv43B, Fv43A) can be secreted by a cell into the cell culture medium, the cell culture medium containing the cellulases or variants. The cell culture medium can be used as a whole cell broth preparation. 2. Molecular Biology [0049] In certain embodiments, the present disclosure provides for the expression of an enzyme or variant of interest. In certain embodiments, the gene encoding the enzyme or variant of interest is placed under the control of a promoter functional in a filamentous fungus. In an example of this embodiment, it is provided here a method of expressing variant cbh2 genes under the control of a suitable promoter functional in a filamentous fungus. Known techniques in the field of recombinant genetics can be applied (See, e.g., Sambrook et al., Molecular Cloning, A Laboratory Manual, 2nd ed., 1989; Kriegler, Gene Transfer and Expression: A Laboratory Manual, 1990; and Ausubel et al., eds., CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, Greene Publishing and Wiley-lnterscience, New York, 1994). 3. Expression Of Recombinant Proteins [0050] The methods of the disclosure pertain to host cells engineered to express recombinant proteins, without limiting the method of expression to any particular method. The recombinant protein or variant of interest is preferably secreted from the cells. The disclosure provides host cells, which have been transduced, transformed or transfected with an expression vector comprising a protein-encoding nucleic acid sequence. The culture conditions, such as temperature, pH and the like, are those previously used for the parental host cell prior to transduction, transformation or transfection and will be apparent to those skilled in the art.
[0051] In one approach, a filamentous fungal cell or yeast cell is transfected with an expression vector having a promoter or a biologically active promoter fragment, or one or more (e.g., a series) of enhancers, which function in the host cell line, operably linked to a DNA segment encoding a protein or variant of interest, such that the protein or variant of interest is expressed in the cell line. A Nucleic Acid Constructs/Expression Vectors.
[0052] Natural or synthetic polynucleotide fragments encoding a protein or variant of interest may be incorporated into chimeric constructs or vectors, capable of being introduced into, and of replication in, a filamentous fungal or yeast cell. The vectors and methods disclosed herein are suitable for use in host cells for the expression of the protein or the variant. Any vector may be used as long as it is replicable and viable in the cells into which it is introduced. Large numbers of suitable vectors and promoters are known to those of skill in the art, and many are commercially available. Cloning and expression vectors are also described in Sambrook et al., 1989, Ausubel F M et al., 1989, and Strathern et al., The Molecular Biology of the Yeast Saccharomyces, 1981. Suitable expression vectors for fungi are described in van den Hondel, C. A. M. J. J. et al. (1991) In: Bennett, J. W. and Lasure, L. L. (eds.) More Gene Manipulations in Fungi. Academic Press, pp. 396-428. The appropriate DNAsequence may be inserted into a plasmid or vector (collectively referred to herein as "vectors") by a variety of procedures. In some instances, the DNA sequence is inserted into suitable restriction endonuclease site(s) using known procedures. In other instances, methods of vector construction that do not involve restriction digestion and/or ligation may be suitably applied. Such procedures and related subcloning procedures are deemed to be within the scope of knowledge of those skilled in the art.
[0053] Recombinant filamentous fungi comprising the coding sequence for a protein or variant of interest may be produced by introducing a chimeric construct comprising the coding region of the protein or variant of interest into the cells of a selected strain of the filamentous fungi.
[0054] Once the desired form of a nucleic acid sequence is obtained, it may be modified in a variety of ways. For example, where the sequence involves non-coding flanking regions, the flanking regions may be subjected to resection, mutagenesis, etc. Thus, transitions, transversions, deletions, and insertions may be performed on the naturally occurring sequence [0055] A selected coding sequence may be inserted into a suitable vector according to known recombinant techniques and used to transform filamentous fungi. Due to the inherent degeneracy of the genetic code, other nucleic acid sequences, which encode substantially the same or a functionally equivalent amino acid sequence may be used to clone and express the protein of interest or the variant. Therefore such substitutions in the coding region fall within the sequence variants covered by the present disclosure. Any and all of these sequence variants can be utilized in the same way as described herein, For example, sequence variants of a cellobiohydrolase, such as CBH2 can be used when the protein or variant of interest is a cellulase.
[0056] The terms "cellulase" "cellulolytic enzymes" or "cellulase enzymes" refer to a category of enzymes capable of hydrolyzing cellulose polymers to shorter cello-oligosaccharide oligomers, cellobiose and/or glucose. Numerous examples of cellulases, such as exoglucanases, exocellobiohydrolases, endoglucanases, and glucosidases have been obtained from cellulolytic organisms, particularly including fungi, plants and bacteria. The enzymes made by these microbes are mixtures of proteins with three types of actions useful in the conversion of cellulose to glucose: endoglucanases (EG), cellobiohydrolases (CBH), and beta-glucosidase. These three different types of cellulase enzymes act synergistically to convert cellulose and its derivatives to glucose.
[0057] CBFI2 from Hypocrea jecorina is a member of the Glycosyl Flydrolase Family 6 (hence Cel6) and, specifically, was the first member of that family identified in Hypocrea jecorina (hence Cel6A). The Glycosyl Hydrolase Family 6 contains both Endoglucanases and Cellobiohydrolases/exoglucanases, and CBH2 is a cellobiohydrolase/exoglucanase. Thus, the phrases CBH2, CBH2-type protein and Cel6 cellobiohydrolases are often used interchangeably herein. Thus, the term "variant cbh2 gene" means that the nucleic acid sequence of the cbh2 gene from H. jecorina has been altered by removing, adding, and/or manipulating the coding sequence.
[0058] The present disclosure also includes recombinant nucleic acid constructs comprising one or more protein-encoding nucleic acid sequences as described above. The constructs comprise a vector, such as a plasmid or viral vector, into which a sequence of the disclosure has been inserted, in a forward or reverse orientation.
[0059] Chimeric constructs may include the coding sequence for a protein or variant of interest. In some embodiments, the coding sequence can be present: (i) in isolation; (ii) in combination with additional coding sequences, such as, for example, fusion protein or signal peptide coding sequences, where the coding sequence is the dominant coding sequence; (iii) in combination with non-coding sequences, such as, for example, introns and control elements, which include, for example, promoter and terminator elements or 5' and/or 3' untranslated regions, effective for expression of the coding sequence in a suitable host; and/or (iv) in a vector or host environment in which the coding sequence is a native or a non-native gene.
[0060] In certain aspects, a chimeric construct is employed to transfer a protein-encoding nucleic acid sequence into a cell in vitro. Preferably, the cell into which the protein-encoding nucleic acid sequence is transferred is an established filamentous fungal or yeast line. For long-term, production of a protein or variant of interest, stable expression is preferred. Various known methods effective to generate stable transformants may be used to practice this disclosure.
[0061] Suitable vectors are typically equipped with a selectable marker-encoding nucleic acid sequence(s), insertion sites, and suitable control elements, such as promoter and termination sequences. The vector may comprise regulatory sequences, including, for example, non-coding sequences, such as introns and control elements, e.g., promoter and terminator elements or 5' and/or 3' untranslated regions, effective for expression of the coding sequence in host cells (and/or in a vector or host cell environment in which a modified soluble protein coding sequence is not normally expressed), operably linked to the coding sequence. Many suitable vectors and promoters are known to those of skill in the art, and many are commercially available and/or are described in Sambrook, et al. (supra).
[0062] Examples of suitable promoters include constitutive promoters and inducible promoters, including a CMV promoter, an SV40 early promoter, an RSV promoter, an EF-1a promoter, a promoter containing the tet responsive element (TRE) in the tet-on or tet-off system as described (ClonTech and BASF), the beta actin promoter and the metallothionine promoter that can up regulated by addition of certain metal salts. A promoter sequence is a DNA sequence which is recognized by the particular filamentous fungus for expression purposes. It is operably linked to DNA sequence encoding a protein or variant of interest. Such linkage comprises positioning of the promoter with respect to the initiation codon of the DNA sequence encoding the protein of interest. The promoter sequence contains transcription or translation control sequences, which mediate the expression of the proteins or variants of interest. Non-limiting examples include promoters from Aspergillus niger, A awamori or A. oryzae glucoamylase-, alpha-amylase-, or alpha-glucosidase-encoding genes; the A. nidulans gpdA or trpC genes; the Neurospora crassa cbh1 or trpl genes; the A. niger or Rhizomucor miehei aspartic proteinase encoding genes; the H. jecorina (T. reesei) cbh1, cbh2, egl1, egl2, or other cellulase encoding genes.
[0063] The choice of the proper selectable marker will depend on the host cell, and appropriate markers for different hosts are well known in the art. Examples of suitable selectable marker genes include argB from A. nidulans or T. reesei, amdS from A. nidulans, pyr4 from Neurospora crassa or T. reesei, pyrG from Aspergillus niger or A. nidulans. Additional examples of suitable selectable markers include, but are not limited to trpc, trpl, oliC31, niaD or Ieu2, which are included in chimeric constructs used to transform a mutant strain such as trp-, pyr-, leu-, and the like.
[0064] Such selectable markers confer to transformants the ability to utilize a metabolite that is usually not metabolized by filamentous fungi. For example, the amdS gene from H. jecorina which encodes the enzyme acetamidase, allows transformant cells to grow on acetamide as a nitrogen source. The selectable marker (e.g. pyrG) may restore the ability of an auxotrophic mutant strain to grow on a selective minimal medium or the selectable marker (e.g. olic31) may confer to transformants the ability to grow in the presence of an inhibitory drug or antibiotic.
[0065] The selectable marker coding sequence is cloned into any suitable plasmid using methods generally employed in the art. Examples of suitable plasmids include pUC18, pBR322, pRAXand pUC100. The pRAX plasmid contains AMAL sequences from A. nidulans, which make it possible to replicate in A. niger.
[0066] The practice of the present disclosure will employ, unless otherwise indicated, conventional techniques of molecular biology, microbiology, recombinant DNA, and immunology, which are within the skill of the art. Such techniques are explained fully in the literature. See, e.g., Sambrook et al., 1989; Freshney, Animal Cell Culture, 1987; Ausubel, et al., 1993; and Coligan et al., Current Protocols in Immunology, 1991. B. Filamentous Fungi and Culture Conditions for Recombinant Protein Production [0067] Examples of species of parental filamentous fungi that may be treated and/or modified for recombinant protein expression include, but are not limited to Trichoderma, e.g., Trichoderma reesei, Trichoderma longibrachiatum, Trichoderma wide, Trichoderma koningii; Penicillium sp., Humicola sp., including Humicola insolens, Aspergillus sp., Chrysosporium sp., Fusarium sp., Hypocrea sp., and Emericella sp.
[0068] Transformed cells are cultured under conditions typically employed to culture the parental fungal line. For example, cells can be cultured in a standard medium containing physiological salts and nutrients, such as described in Pourquie, J. et al., Biochemistry and Genetics of Cellulose Degradation, eds. Aubert, J. P. et al., Academic Press, pp. 71-86, 1988 and Ilmen, M. et al., Appl. Environ. Microbiol. 63:1298-1306, 1997. Various common cultural conditions can be suitable, e.g., cultures are incubated at 28°C in shaker cultures or fermenters until desired levels of recombinant protein expression are achieved.
[0069] Suitable culture conditions for a given filamentous fungus may be found in the scientific literature and/or from the source of the fungi such as the American Type Culture Collection (ATCC; www.atcc.org/). After fungal growth has been established, the cells are exposed to conditions effective to cause or permit the expression of the recombinant protein.
[0070] In cases where a coding sequence is under the control of an inducible promoter, the inducing agent, e.g., a sugar, metal salt, or antibiotics, is added to the medium at a concentration effective to induce recombinant protein expression.
[0071] In some embodiments, the filamentous fungus is Aspergillus niger, which is a useful strain for obtaining overexpressed proteins of interest. For example A. niger var awamori dgr246 is known to secrete elevated amounts of cellulases (Goedegebuur et al., Curr. Genet (2002) 41: 89-98). Other strains of Aspergillus niger var awamori such as GCDAP3, GCDAP4 and GAP3-4 are also known. See, e.g., Ward et al, Appl. Microbiol. Biotechnol. 39:738-743.
[0072] In some embodiments, the filamentous fungus is Trichoderma reesei, which is another useful strain for obtaining overexpressed proteins of interest. In some embodiments, such a filamentous fungal host cell can have certain genes (or "detrimental genes" herein) that are linked to detrimental activities or traits (e.g., detrimental to expression, stability, confounding activities that would make queries or assays of certain properties difficult, etc) deleted or reduced. In some embodiments, such a fungal host cell can be modified such that it gains or enhances genes (or "favorable genes" herein) that are linked to certain favorable activities or traits, for example, increased secretion, increased stability, increased solubility, etc.
[0073] For example, a Trichoderma reesei strain RL-P37, described by Sheir-Neiss, et al., Appl. Microbiol. Biotechnol. 20:46-53 (1984) is known to secrete elevated amounts of cellulase enzymes. Functional equivalents of RL-P37 include Trichoderma reesei strain RUT-C30 (ATCC No. 56765) and strain QM9414 (ATCC No. 26921). It is contemplated that these strains would also be useful in over expressing proteins and variants thereof, including, without limitation, certain cellobiohydrolases such as CBH1 or CBFI2, or certain endoglucanases.
[0074] By way of example, when the recombinant protein is a variant CBFI2, it is preferable to produce the variant in the absence of potentially detrimental native cellulolytic activity. Thus, it is useful to obtain a Trichoderma host strain, which has had one or more cellulase genes deleted prior to introduction of a DNA construct or plasmid containing the DNA fragment encoding the variant CBH2. Suitable multiple-deletion strains as such may be prepared by the method disclosed in, for example, U.S. Pat. No. 5,246,853 and PCT publication WO 92/06209. By expressing a variant CBH2 cellulase in a host microorganism that is missing one or more cellulase genes, the identification and subseguent purification procedures are simplified. Any gene from Trichoderma sp. which has been cloned can be thus deleted, for example, the cbh1, cbh2, egl1, and egl2 genes as well as those encoding EG III and/or EGV protein can be deleted from a Trichoderma host strain (see e.g., U.S. Pat. No. 5,475,101 and PCT publication WO 94/28117, respectively).
[0075] Gene deletions may be accomplished by inserting a form of the desired gene to be deleted or disrupted into a plasmid. The deletion plasmid can be then digested at an appropriate restriction enzyme site(s), internal to the desired gene coding region, and the gene coding sequence or a part thereof is replaced by a selectable marker. Flanking DNA sequences from the locus of the gene to be deleted or disrupted, preferably having a size of between about 0.5 to about 2.0 kb, can remain on either side of the selectable marker gene. A suitable deletion plasmid will generally have unique restriction enzyme sites present therein to enable the fragment containing the deleted gene, including flanking DNA sequences, and the selectable marker gene to be removed as a single linear piece.
[0076] In some embodiments, a selectable marker is chosen so as to enable detection of the transformed microorganism. Any selectable marker gene that is expressed in the selected microorganism may be suitable. For example, with Aspergillus sp., a selectable marker can be chosen so that the presence of the selectable marker in the transformants will not significantly alter the properties of the microorganism. Example of a suitable selectable marker is a gene that encodes an assayable product. For example, a functional copy of an Aspergillus sp. gene may be used, which, if lacking in the host strain, results in the host strain displaying an auxotrophic phenotype. Selectable markers also exist for Trichoderma sp.
[0077] In some embodiments, a pyrG' derivative strain of Aspergillus sp. is transformed with a functional pyrG gene, which provides a selectable marker for transformation. The pyrG'derivative strain may be obtained by selection of Aspergillus sp. strains that are resistant to fluoroorotic acid (FOA). The pyrG gene encodes orotidine-5'-monophosphate decarboxylase, an enzyme reguired for the biosynthesis of uridine. Strains with an intact pyrG gene grow in a medium lacking uridine but are sensitive to fluoroorotic acid. Accordingly FOA resistance selection can be used to select pyrG' derivative strains that lack a functional orotidine monophosphate decarboxylase enzyme, and thus reguire uridine for growth. Using the FOA selection technigue, it is also possible to obtain uridine-requiring strains, which lack a functional orotate pyrophosphoribosyl transferase. These cells can be transformed with a functional copy of the gene encoding this enzyme (Berges & Barreau, Curr. Genet. 19:359-365(1991), and van Hartingsveldt et al., (1986) Mol. Gen. Genet. 206:71-75). The selection of derivative strains is performed using the FOA resistance technique described above. In some embodiments, the pyrG gene is employed as a selectable marker.
[0078] In some embodiments, a pyr4' derivative strain of Hypocrea sp. (Trichoderma sp.) is transformed with a functional pyr4 gene, which provides a selectable marker for transformation. The pyr4' derivative strain may be obtained by selection of Hypocrea sp. (Trichoderma sp.) strains that are resistant to fluoroorotic acid (FOA). The pyr4 gene encodes orotidine-5'-monophosphate decarboxylase, an enzyme required for the biosynthesis of uridine. Strains with an intact pyr4 gene grow in a medium lacking uridine but are sensitive to fluoroorotic acid. Accordingly, FOA resistance can be used to select pyr4' derivative strains that lack a functional orotidine monophosphate decarboxylase enzyme, and thus require uridine for growth. Using the FOA selection technique it is also possible to obtain uridine-requiring strains, which lack a functional orotate pyrophosphoribosyl transferase. These cells can be transformed with a functional copy of the gene encoding this enzyme (Berges & Barreau, 1991). The selection of derivative strains is performed using the FOA resistance technique as described above. In some embodiments, the pyr4 gene is employed as a selectable marker.
[0079] A single DNA fragment comprising a disrupted or deleted detrimental gene, for example one exemplified above, is then isolated from the deletion plasmid and used to transform an appropriate pyrG' Aspergillus or pyr4' Trichoderma host. Transformants are identified and selected based on their ability to express the pyrG or pyr4 gene product, respectively, and thus compliment the uridine auxotrophy of the host strain. Southern blot analysis can be suitably carried out on the resultant transformants to identify and confirm a double crossover integration event, during which part or all of the coding regions of the genomic copy of the gene are deleted and replaced with the appropriate pyr selectable markers.
[0080] Although the specific plasmid vectors described above relate to preparation of pyr-transformants, the present disclosure is not limited to these vectors. Various genes can be deleted and replaced in the Aspergillus sp. or Hypocrea sp. (Trichoderma sp.) strain using the above techniques described above. In addition, a number of selectable markers are suitable, as discussed herein. In fact, any gene that has been identified can suitably be deleted from the genome of any host, e.g., Aspergillus sp. or Hypocrea sp., using the above-described strategy.
[0081] In certain embodiments, the host strains used may be derivatives of Hypocrea sp. (Trichoderma sp.) that lack or have a nonfunctional gene or genes corresponding to the chosen selectable marker. For example, if the selectable marker of pyrG is chosen tor Aspergillus sp., then a specific pyrG' derivative strain is used as a recipient in the transformation procedure. In another example, if the selectable marker of pyr4 is chosen for a Hypocrea sp., then a specific pyr4" derivative strain is used as a recipient in the transformation procedure. In some embodiments, selectable markers comprising Hypocrea sp. (Trichoderma sp.) genes similar to toe Aspergillus nidulans genes, including, for example, amdS, argB, trpC, or niaD may be used. The corresponding recipient strain is accordingly a derivative strain such as an amdS-, argB-, trpC-, or niaD- strain, respectively.
[0082] DNA encoding the protein or variant of interest can then be prepared for insertion into an appropriate microorganism. According to the present disclosure, DNA encoding a protein or variant of interest may comprise the DNA encoding a protein or variant that has an activity of the wild type protein. The DNA fragment encoding the protein or variant of interest may be functionally attached to a fungal promoter sequence, for example, the promoter of the glaA gene in Aspergillus or the promoter of the cbh1 or egl1 genes in Trichoderma.
[0083] The DNA encoding the protein of interest or the variant of interest may be prepared by constructing an expression vector carrying the DNA encoding the protein or the variant. The expression vector carrying the inserted DNA fragment encoding the protein or variant of interest can, for example, be any vector capable of replicating autonomously in a given host organism, or of integrating into the DNA of the host, typically in the form of a plasmid. In certain embodiments two types of expression vectors for obtaining expression of genes are contemplated. The first type contains DNA sequences wherein the promoter, gene-coding region, and terminator sequence all originate from the gene to be expressed. Gene truncation may be obtained where desired by deleting undesired DNA sequences (e.g., coding for unwanted domains), leaving the domain to be expressed under control of its own transcriptional and translational regulatory sequences. A selectable marker may also be contained as a part of the vector allowing the selection for integration into the host of multiple copies of the desired gene sequences.
[0084] The second type of expression vector is preassembled and contains sequences useful for high-level transcription and a selectable marker. It is contemplated that the coding region for a gene or a part thereof can be inserted into such a general-purpose expression vector, placing it under the transcriptional control of the expression cassettes promoter and terminator sequences. A non-limiting example of such a general-purpose expression vector is pRAX in Aspergillus. The gene or variant gene of interest, or a part thereof, can be inserted downstream of the strong glaa promoter. A non-limiting example of such a general- purpose expression vector is the pTEX in Hypocrea. The gene or variant gene of interest, or a part thereof, can be inserted downstream of the strong cbh1 promoter.
[0085] In certain embodiments, in the vector, the DNA sequence encoding the protein or variant of interest is operably linked to transcriptional and translational sequences, for example, a suitable promoter sequence and signal sequence, in reading frame to the structural gene. The promoter is suitably any DNA sequence that shows transcriptional activity in the particular host cell and may be derived from genes encoding proteins either homologous or heterologous to the host cell. An optional signal peptide may provide for extracellular production of the protein or variant of interest. The DNA encoding the signal sequence is preferably that which is naturally associated with the gene to be expressed. However signal sequences from any suitable sources, for example from an exo-cellobiohydrolase or from an endoglucanase of Trichoderma, are contemplated.
[0086] Protocols that can be used to ligate the DNA sequences coding for the protein or variant of interest to a promoter, and insertion of such a construct into suitable vectors are known in the art.
[0087] The DNA vector or construct described herein may be introduced into the host cell in accordance with known techniques such as transformation, transfection, microinjection, microporation, biolistic bombardment and the like.
[0088] For example, when a DNA vector or construct described herein is used to transform a fungal host cell, the permeability of the cell wall of Hypocrea sp. (Trichoderma sp.) to DNA can be low. Accordingly, uptake of the desired DNA sequence, gene or gene fragment is often minimal. A number of methods can be used to increase the permeability of the Hypocrea sp. (Trichoderma sp.) cell wall in the derivative strain (e.g., one lacking a functional gene corresponding to the used selectable marker) prior to the transformation process.
[0089] In certain embodiments, to prepare Aspergillus sp. or Hypocrea sp. {Trichoderma sp.) for transformation, protoplasts from fungal mycelium are prepared. See Campbell et al. Curr. Genet. 16:53-56; 1989. Mycelium can be obtained from germinated vegetative spores. The mycelium is treated with an enzyme that digests the cell wall, resulting in protoplasts. The protoplasts are then protected by the presence of an osmotic stabilizer in the suspending medium. Suitable stabilizers include, for example, sorbitol, mannitol, potassium chloride, magnesium sulfate and the like. Usually the concentration of the stabilizer(s) can vary between 0.8 M and 1.2 M (e.g., between 0.9M and 1.2 M, between 1,0M and 1.2 M, between 1.1 M and 1.2 M, etc). In a particular embodiment, 1.2 M of sorbitol is used as stabilizer in a suspension medium.
[0090] Uptake of the DNA into the host strain (e.g., Aspergillus sp. or Hypocrea sp. (Trichoderma sp.) can often be dependent upon the calcium ion concentration. Generally between about 10 mM CaCl2 and about 50 mM CaCl2 (e.g., between about 15 mM and about 45 mM, between about 20 mM and about 40 mM, between about 25 mM and about 35 mM) is used in an uptake solution. Aside from including calcium ion in the uptake solution, other items often included are a buffering system such as a TE buffer (10 mM Tris, pH 7.4; 1 mM EDTA) or a 10 mM MOPS, pH 6.0 buffer (morpholinepropanesulfonic acid) and polyethylene glycol (PEG). It is believed that the polyethylene glycol in this buffer acts to fuse the cell membranes thus permitting the contents of the medium to be delivered into the cytoplasm of the host strain (e.g., Aspergillus sp. or Hypocrea sp), and the plasmid DNA is transferred to the nucleus. In certain embodiments, this fusion process leaves multiple copies of the plasmid DNA integrated into the host chromosome.
[0091] Usually a suspension containing the Aspergillus sp. protoplasts or cells that have been subjected to a permeability treatment at a density of 105 to 106/mL, preferably 2 x105/mL are used in transformation. Similarly, a suspension containing the Hypocrea sp. (Trichoderma sp.) protoplasts or cells that have been subjected to a permeability treatment at a density of 10^ to 10^/mL, preferably 2 xIO^/mL are used in transformation. A volume of 100 pL of these protoplasts or cells in an appropriate solution (e.g., 1.2 M sorbitol; 50 mM CaCl2) are mixed with the desired DNA. In some embodiments, a substantial amount of PEG is added to the uptake solution. For example, from about 0.1 to about 1 volume of 25% PEG 4000 can be added to the protoplast suspension. In a particular example, about 0.25 volume of 25% PEG 4000 is added to the protoplast suspension. Additives such as dimethyl sulfoxide, heparin, spermidine, potassium chloride and the like may also be added to the uptake solution and aid in transformation.
[0092] In certain embodiments, the mixture is incubated at about 0°C, for a period of about 10 to about 30 minutes. Additional PEG can be added to the mixture to further enhance the uptake of the desired gene or DNA sequence. In certain embodiments, the 25% PEG 4000 can be added in volumes that are 5 to 15 times that of the transformation mixture; however, greater and lesser volumes may also be suitable. For example, the 25% PEG 4000 is added at 10 times the volume of the transformation mixture in some embodiments. After the PEG is added, the transformation mixture is then incubated either at room temperature or on ice before the addition of a sorbitol and CaCl2 solution. The protoplast suspension is then further added to molten aliquots of a growth medium. This growth medium permits the growth of transformants. Many growth media can be suitably used to grow the desired transformants in the present disclosure . In certain embodiments, for example, if Pyr+ transformants are being selected it is preferable to use a growth medium that contains no uridine. For example, the colonies are transferred and purified on a growth medium depleted of uridine.
[0093] At this stage, stable transformants may be distinguished from unstable transformants by their faster growth rate. Also, with a number of filamentous fungal hosts, such as, for example, Trichoderma, the formation of circular colonies with a smooth, as opposed to a ragged outline on solid culture medium lacking uridine can be used as a distinguishing feature. In some embodiments, further tests and selections of stability may be made by growing the transformants on solid non-selective medium (e.g., containing uridine), harvesting spores from this culture medium, and determining the percentage of these spores. The selected spores are allowed to germinate and grow on selective medium lacking uridine. C. Introduction of a Recombinant Protein-Encoding Nucleic Acid Sequence into Host Cells.
[0094] The disclosure further provides cells and cell compositions which have been genetically modified to comprise a recombinant protein-encoding nucleic acid sequence. A parental cell or cell line may be genetically modified (i.e., transduced, transformed or transfected) with a cloning vector or an expression vector. The vector may be, for example, in the form of a plasmid, a viral particle, a phage, etc, as further described above.
[0095] The methods of transformation of the present disclosure may result in the stable integration of all or part of the transformation vector into the genome of the filamentous fungus. Transformation resulting in the maintenance of a self-replicating extra-chromosomal transformation vector is also contemplated.
[0096] Many standard transfection methods can be used to produce filamentous fungal, e.g., Trichoderma reesei, cell lines that express substantial quantities of the non-native or native protein. For example, there are a number of published methods for introducing DNA constructs into enzyme-producing strains of Trichoderma include Lorito et al., 1993, Curr. Genet. 24: 349-356; Goldman et al., 1990, Curr. Genet. 17:169-174; Penttila et al., 1987, Gene 6: 155-164; for introducing DNA constructs into enzyme-producing strains of Aspergillus, Yelton et al., 1984, Proc. Natl. Acad. Sci. USA 81: 1470-1474; for introducing DNA constructs into enzyme-producing strains of Fusarium, Bajar et al., 1991, Proc. Natl. Acad. Sci. USA 88: 8202-8212; and for introducing DNA constructs into enzyme-producing strains of Streptomyces, Hopwood et al., 1985, The John Innes Foundation, Norwich, UK, and for Bacillus, Brigidi et al., 1990, FEMS Microbiol. Lett. 55: 135-138).
[0097] Other methods for introducing a chimeric construct (expression vector) into filamentous fungi (e.g., H. jecorina) include, but are not limited to the use of a particle or gene gun, permeabilization of filamentous fungi cell walls prior to the transformation process (e.g., by use of high concentrations of alkali, e.g., 0.05 M to 0.4 M CaCl2 or lithium acetate), protoplast fusion or Agrobacterium mediated transformation. An example of such a method for transforming filamentous fungi by treatment of protoplasts or spheroplasts with polyethylene glycol and CaCl2 is described in Campbell, E. I. et al., Curr. Genet. 16:53-56, 1989; and Penttila, M. et al., Gene, 63:11 -22, 1988.
[0098] Any of the known procedures for introducing foreign nucleotide sequences into host cells may be used. These include, for example, the use of calcium phosphate transfection, polybrene, protoplast fusion, electroporation, biolistics, liposomes, microinjection, plasma vectors, viral vectors and any of the other known methods for introducing cloned genomic DNA, cDNA, synthetic DNA or other foreign genetic material into a host cell (see, e.g., Sambrook et al., supra). Also useful is the Agrobacterium-mediated transfection method described in U.S. Pat. No. 6,255,115. It is important that the particular genetic engineering procedure used be capable of successfully introducing at least one gene into the host cell capable of expressing the non-native or endogenous (but in a non-native form) gene.
[0099] In some embodiments, chimeric constructs comprising a recombinant protein-encoding nucleic acid sequence can be transcribed in vitro, and the resulting RNA can be introduced into the host cell by known methods, e.g., injection.
[0100] The disclosure further includes novel and useful transformants of filamentous fungi such as H. jecorina and A. niger for use in producing fungal enzymes, variants thereof, and compositions comprising these molecules. The disclosure includes transformants of filamentous fungi especially fungi comprising certain recombinant protein coding sequence(s), or deletion of certain endogenous coding sequences.
[0101] Following introduction of a chimeric construct comprising the coding sequence for a protein of interest or a variant thereof, the genetically modified cells can be cultured in conventional nutrient media modified as appropriate for activating promoters, selecting transformants or amplifying expression of a recombinant protein-encoding nucleic acid sequence. The culture conditions, such as temperature, pH and the like, are those previously used for the host cell selected for expression, and will be apparent to those skilled in the art.
[0102] The progeny of cells into which such chimeric constructs have been introduced are generally considered to comprise the protein-encoding nucleic acid sequence found in the chimeric construct.
[0103] The disclosure further includes novel and useful transformants of filamentous fungi such as H. jecorina for use in producing fungal enzymes, variants thereof, or compositions comprising such molecules. For example, Aspergillus niger may also be used in producing the recombinant proteins and variants thereof. The disclosure includes transformants of filamentous fungi especially fungi comprising the coding sequence of a protein of interest or of a variant thereof, or deletion of certain endogenous protein coding sequence(s).
EXAMPLES
[0104] The present disclosure is described in further detail in the following examples, which are not in any way intended to limit the scope of the disclosure as claimed. The attached figures are meant to be considered as integral parts of the specification and description of the disclosure. The following examples are offered to illustrate, but not to limit the claimed disclosure.
[0105] In the experimental disclosure which follows, the following abbreviations apply: M (molar); mM (millimolar); pM (micromolar); nM (nanomolar); mol (moles); mmol (millimoles); pmol (micromoles); nmol (nanomoles); gm (grams); mg (milligrams); pg (micrograms); pg (picograms); L (liters); ml or mL (milliliters); pi or pL (microliters); cm (centimeters); mm (millimeters); pm (micrometers); nm (nanometers); U (units); V (volts); MW (molecular weight); sec (seconds); min(s) (minute/minutes); h(s) or hr(s) (hour/hours); °C (degrees Centigrade); QS (quantity sufficient); ND (not done); NA(not applicable); rpm (revolutions per minute); H2O (water); dh^O (deionized water); HCI (hydrochloric acid); aa (amino acid); bp (base pair); kb (kilobase pair); kD (kilodaltons); cDNA (copy or complementary DNA); DNA (deoxyribonucleic acid); ssDNA (single stranded DNA); dsDNA (double stranded DNA); dNTP (deoxyribonucleotide triphosphate); RNA (ribonucleic acid); MgCl2 (magnesium chloride); NaCI (sodium chloride); w/v (weight to volume); v/v (volume to volume); g (gravity); OD (optical density); HPLC (high pressure liquid chromatography); PAGE (polyacrylamide gel electrophoresis); PCR (polymerase chain reaction); RT-PCR (reverse transcription PCR); and SEL (site evaluation library). EXAMPLE 1
Creation of Trichoderma reesei expression strains [0106] Improved strains were created to increase the expression consistency of variants of interest, in this instance, CBH2 variants, such that the expression level is less variable across variants of the same amino acid sequences. In particular, T. reesei strains were developed in combination with a targeting vector to force integration of cbh2 variant genes (e.g., coding region in operable combination with a regulatory sequence). The new strains prepared during development of the present disclosure, combine several mutations that are advantageous for screening variant libraries. A schematic of the genetic engineering steps is shown in Figure 1.
Deletion ofku80 from the T. reesei quad deleted derivative strain.
[0107] The quad deleted derivative strain is described in PCT Publication WO 2005/001036. Asingle orthologue of MUS52, the N. crassa orthologue of the human KU80, was identified by TBLASTN search in the genome sequence of H. jecorina QM6a (Trichoderma reesei) and was consequently named T.reesei ku80, protein id 58213, available at the U S. Department of Energy Joint Genome Institute The nucleotide sequence of the T. reesei ku80 gene is provided as SEQ ID NO: 13:
ATGGCGGACAAGGAAGCAACCGTCTTCATCATCGACCTCGGCGCGTCCATGC-CAGCTGTCAATG GGGGTCGAGAAGAATCCGACCTTGATTGGAGCATGAGCTACGTCTGGGACAAGATCAGCAACGT CGTGGCCTCGAATCGCAAGACGCTGTGCGTTGGCGTCGTGGGGTTCAGAACCGACGAGACAAAC CACACGCTGAGCGAGGATGGGTACGAGAACATCTCCATATTGCAGCCCCTGGGGCCGATGAGCA TGTCCAGCCTCAAGGCTCTTCAGCCCAAGGTGAAGCCGAGCAGGACGGTGGAAGGCGATGCCAT CTCGGCGATTG7CATTGCCGTCGACATGATTGACAAGTACACGAAGAAGAACAAATGGAAGCGG CAGATTGTTCTCATTACCGACGGCCAAGGCGAGATTGATCCAGATGATATTC-GCGACATTGCTA GAAAGA'rGCGCGAC'lCGAATA'I'lGAAT'lGACAGlC'riGlGAGTTGGCGAGACCGl'riGGCGGAC GGTAATGGTGCTGACGGTGATGCAAGGGGCGTCGACTTTGATGCTCCCGATTACGGCTTCAAAG AGGAGGACAAACCTTCAGTCAAGGTACTCCATATGTTCACTTCTTTTCTTTTTCTTCTTTATTT TCTTTTCTTTTGAAGCTTTCATTAACCTCTTCGTTAGAAGCAAAACGAAGAGACCCTAAAAAAG CTCGTGGATGGCTGTGGCGACGACTCAAGGTTCGCCTCCATGGTCGAGGCCATTGACGACTTGA ATGAGCCACGAGCAAAGTCGGTCAAGCCTTACAAAACGTACGAAGGTCTCTTGACCTTGGGAGA TCCGAAAAACGCTCCCGCAGTGGTGGAAATCCGCGTCGAGAGATACTTCAAC-ACCCATCTAGCC AGGCCACCTGCCGCCAGCACCGTGGTGGTCAAGGAGGAGCAAGCTGGGCCGTCTCAGGCAGACG AGGACGAACAGAIGGACGGAGCGGAAC'ITACAGCTGIGAGGCAGGCCAGGACAIACAAGGTCAA TGATCCAGATGCCCCTGGCGGTAAGCGTGACGTTGAGTTTGAGTCTCTGGCCAAAGGGTACGAG IACGGCAGGACGGCAGTCCACATCAGCGAGTCTGATCAAAACGTCACCAAGCTCGCGACAGAAA AGAGCTTCAAGATCATCGGCTTCGTCCAGAAAGAAAAGGTATTGGCTTGGCTCTCAGCATTTGA CCCGTTGCTCTTGGCTAACCCTTGTTTAGTATGAAATGCTCCTTAATCTTGGCGAAACCTGCGT TACCGTTGCATCCAAGTACGATGAAAAGTCTGAGCTGGCTTTTAGCTCTCTGGTGTGGGCGCTC TCGGAGCTCGACGCCTACGCCGTGGCCCGCCTAGTAACTAAGGACCAAAAGGACCCCATGCTGG TGTTACTGATGCCGTATATGGAGCCTGATTATGTTTGTCTCTATGATGTGCCTCTGCCTTTCGC AGAGGACATCAGGACGTACCAGTTICCTCCCTTGGACAGAGTCGTTACCGTCAGTGGCCAAACG CTCACCAACCATCGCCTATTGCCATCCGACGAGCTCAACCAAGCGATGAGCGACTACGTAGATG CCATGGACATT7CAAGTTATGGTATCGATGAAGATGGGTGAGTATAGAAGATGATTGTTCAAAT CTTTCACTTCTAAGCATTGCTTCTGATCTAGGCAACCGGCTGAATATGCCACCATCGATGAGTT ATACAACCCTGCGATACATCGCATAGGCCATGCGATCAAACAACGAGCGATCCACCCAGAGAAA CCCGTGCCCGAGATCCCCCCAGTCTTGCTTAGATTCGCAGCACCCCCGACAGAACTCGTCGAGA CTGTGCAGCCTCATATCGATGCACTGATTCACGCTGCAGACGTGAAGAAAGC-TACTGATTCCAT TACATATGCTTCTCTGCACACTGATGTTTGATTTC-TGCTAACGCCCCCCTTAGTGCCGCCCAAG GCCAAGGGCAAGCGCCAAAGAGAAACAGTTAAACCCATCTCGGGACTGGATC-TGGATGCCCTTC TGGGAGAAGAGCAGAAAGGTTCCATTAGTCCGGAC-AATGCCATTCCGGACTTCAAACGAGCCCT CAACTCGTCCGAAGAAGTCGAGCAGATTGCCGACGCCACAAAACAAATGGGGGCCATTGTGCGG TCTCTCATTACGGACAGCTTCGGGGATAGCAAATATGCCCAGGCAATGGAAC-GCATTGGTGCGA
TGCGTGAGGAGCTGATCAACCTGGAAGAGCCTGGCCTGTACAACGACTTTGTGCGCGACTTGAA
CAAAAGTTTGCTATCTGCAGCCTTGCGTGGTGACACGCGACATTTCTGCTTCAAGATGAGCTCG
GCGAAGCTGGGCCTGATTGACAAGAAACAGTCGGAGGTGTCTTCGGTCACTCTTGAGGAGGCGG
ACGAGGTGAGTGGTGCAGCATGCTGTCGGATTATACGGAGGTTGTTTGCTAACTTGTGGGATAG
TTTTACAAGTCGAGGTGAGGTATCTACG7TGACCAAGAATGGGACCATGTATATGAGCGGTG7A
ACAACAGAATCCTGTGCTTTGAGCATTG7ATGA
[0108] To delete the T. reesei ku80 gene from the quad deleted derivative strain, standard methods as generally described in, for example, PCT publication WO 2005/001036, were adapted for this purpose. Briefly, a ku80 deletion cassette was utilized that employed a selectable marker flanked between 1.3 kb of 5' ku80 sequence and 2.3 kb of 3' ku80 sequence, as schematically shown in Figure 2. The variant T. reesei als, which confers resistance to the herbicide chlorimuron ethyl, was used as selectable marker. See, e.g., PCT Publication WO 2008/039370. The nucleotide sequence of the ku80 knockout cassette is 7685 base pairs in length: bases 1-1271 correspond to the 5' ku80 homologous region; bases 1280-7685 correspond to the a/s-chlorimuron ethyl resistant variant (A190D); and bases 5381-7685 correspond to the 3' ku80 homologous region. The nucleotide sequence of the ku80 knockout cassette is provided as SEQ ID NO:1:
GGCCGCCTCAACACCCACACTCGAGGCACACGAGTTCATCGGCGGCTTCCCCCACAAGCTCTCG
GCCAACCTGCTACCGGCTCTCTCGCGAGACTTCCCAAAGCCTACAAACGAGGTCGACGTCAAGG
AGGCCCTCCtACtCGCCAGCCCGGCAGATGGACtCCTCCAGGGCCAGATCAAGGCCAACAACATGAG
AGCCCAGAGCGCCGCACTCCGGCTCGACGACAAGGAGGGCAAGGCGAGAGCCTTTGAGGAGGCC
AAGCGCGAGCTACTGGCGTATCACCACAGCGCCCTGCGGAAGCCTTCCGGCGCAAGATAATGAG
CTTGATCGCAATGACGAGTTCACGTACGCTTTGCCATATTGTTGTTGCTTTTTGTTTGGTCCTA
CATGTACGGCGCATTGGTTGGGAGGATATACCCACGGAGAGTGTCCGAGTGGCTTCTGGGATTT
AGAGCGTCA'J.TAGCAGGATAGAGATGGTTGGCCAGGGGAATGGAA'iTGACTT'J.TCACTACAAGG
AACTTGTTCAC7CTGGTGTTGATTCCCATTGCGTGACTGGTAGTAGGGAGGAATGCTTTTACTT
TGTGCCACTAGACCGCAGAGAAGGGTTGGTTGCAAGCGGGGTCCGTGTATACCGACCAAGAGTG
ATGGGCATACAGCAACGTTTCTGAACGACTTCATTTTGTCCGAGTCTACTGGATGCGAGATGCC
AGCGTGAAGCCGTACGCCACCAGGGCGACGAACTCGACAAGGTTGACGAGGGAGGAGATGCCGT
GCAGCATGCCAAACTTCTTGTTGAGGGCACGCATCTCATCCGACTGTGCATCCTTGTCATACCA
CTCCTTTCCGTCTCGCTTGGCTGGTGGGAGGGTTCAACAAATCCATCGTCAC-CCATCCGGGGTC
TCAAATCAATGGCGTGCATGCGGAGTCGGGCTTGAGGCTAACCTTGTCCATC-GCGGTCCTTCAT
GGTCTTGACAGGGGCGGGAAGCAGCACGGCGAGGTTGACGAGGCCGCTGACGAACATGGTTGCG
ATGGGCACCAAGGAGCTCCACTTGTTGGGAGCGTCGACGAGGCCGCCGATGCCGCCCTTGATGC
CCAAGAGGGCGTTTCCGGGGAACGTGAGGGCGAGCAGCGCGGGGATGGCCGTCTGCATGCCAAA
GTAGATGGGGAACAGCTTGCTCTGGATGGCGGAGAAGGAGGGCCGGCTGACC-GTGCGGAACATG
ACGATGCCGTTGACGAAGGACTGCAGTAGCGTAGTGTGATGGTAAGCAGCTGGCCGGCGCGCCT
GAGACAATGGCCGGCAATGGTAAAAAGGACCAAGATGTACTAGGTAGTTGCAATGTGGCTTATT
ACCTACCTACTACCTGGTAGGCACCTACTAGGTACTTGGGTAGACGGACAATGAAATTTGAAGT
CGGGGTTGCAGGAAAGCAGGGCGCTGGACACATTC-TGCTTCAGGCGGTACCCGTCGTCATCGTC
AGCCAATGTCGAGGCCCGGCAGCCGGAGGAGCGAGACAACCTTGGCCGGAGGAGCCCGCAGGTA
CCTGCCAAAGCGCGGCTGGTACCTCTCAACCCTCTCAGGCCTGTTGGATGCCCTATGACATGCC
CTGGGGGATGCAGCTGTTGCCCCGGCCCCGCACTTTCGGGTGACCGCGAGGCTGCTGATTGGCT
GGTTGCCACGGGCTGGGCGGTCCCTGAAGTTGTTC-CCATCTGAACTCTGTCC-GCGCTGGCGTCG
GCTGCGCCCAA7GGGAGGCGAGACAACTCAGGGTACTAGAATCACTGACAGAAGAAGAGAATCG
AAACTACGTACACACCCAATTCCTTCCATCCCACCCAACCCCACACCACAAAAATTCACTACCC
CACAATCAGGCACAGTAAGTAGGGCACAGTACGTATGTACAGACAAGGCGCAAGCGATACTGCG
CGACCCGGTACCTCGCCGGCTTGACACGTGCGACAGGCTACTTTACTAGTATTCGCAGCGGCGG
GTCGOGOATTATTACATGTACTGTGCOGCCATTTGATGACTGGGCTGCTGOAGTATTAGTAGAT
CTGCCCGGCATCGCCCTTCCATGGGCGCGACCCGGGACTGGACCCTCTGACTCTACCTACATGT
ACCTAGGCCGGGCCGGGCTTGGTGACTTTTGTCCGATCAGGTCGTTCGCCTC-GCTACCTATTAT
TTCTCTTTCTTCTTCTCCATCCTGCTTCTGGCCTTGCAATTCTTCTTCGCCACTCCTCCCTCTT
CCCCCCGCGATACCCTTGAATTCGTCAGAGAGGAAAAGACGAGAAAAAAAAGGGCAGCAGAGAC
GTCGGTCTGGCTCZlCGTGCTGCATCTCTGCGCACTCTCATTTTTTTTZGTTGTCCGACCCCTCCC
TCAACCTTCTCCTTCGTTGACAGGCTAAGCCTTGCTTCGACGCTCTCTCTTTGAATTTTTCTAC
TTCTACCTTCTTTTCTTGCGTGTTACCCACCATAGCTCGATTCACGATGCTCCGAAGTCGCCAA
GTCACAGCCAGGGCCGTCCGGGCTCTGGGCCAGGCGCGCGCCTTTACCTCGACGACCAAGCCTG
TCATGATCCAGAGCAGCCAGAGGAAACAGGCCAACGCCAGCGCTGCTCCGTAAGTCGCCCATTG
CCATTGCATCTTCTGTTTGATATATACTTCCTGCTGCTTGCGTGGCGTCGTCTCTCGGTTATGC
GTGTCAAGGACCAGGTGTGTTCGCATCGTGGTTTTCCAGCGCCGATTACCGGGGGACGAATTTT
TGGCTGCTCAACTCGCGCGCGCGCATTCTGATTCTTCGTTTTCAATCTTGAC-CGACAACTGGCT
AACATAATGGCCATTGGCAATTGCTTCACACAGACAAGTCCGCCCTGTACCGAGCCCTGCTTTC
AACGCTGAAGACAAAGACCGCAGCCATGTGCAGCCTCTGGTCAACCCGTCGAAGCCCGACATGG
ATGAATCGTATGTCCACGTCCCCTCGTCCCGCCCTACAAAATGAACACGATTACACCAGAATTT
TTGCAACAATCGACACTTCTATAACAGACCAATTGAGCTTTGTTCTGACCAATCATGTTGCTCT
AGATTCATTGGCAAAACCGGAGGCGAAATCTTCCACGAGATGATGCTGCGACAGGGTGTCAAGC
ACATTTGTAGG7TCCGATGCCGGCCGCCCACACGGGCTCCATCCTTGCTCCATCTCTCCAGCTA
GGCAAATCTCGCTAACCTTGAGTCACCATCCAGTCGGATACCCTGGCGGCGCTATCCTGCCCGT
CTTCGACGCCA7CTACAACTCAAAACACTTCGACTTCATCCTGCCCCGTCATGAGCAGGGAGCT
GGCCATATGGCCGAGGGCTATGCCCGTGCCTCGGGCAAACCCGGTGTCGTCCTGGTGACTTCCG
GCCCCGGTGCTACCAATGTCATCACGCCCATGCAGGATGCCCTGTCGGACGGAACGCCCTTGGT
CGTCTTCTGCGGCCAGGTCCCCACCACGGCCATCGGCAGCGATGACTTCCAAGAGGCCGACGTC
GTGGGCATCTCGCGGGCCTGCACCAAGTGGAACGTCATGGTCAAGAGCGTTC-CTGAGCTGCCGC
GGAGAATCAACGAGGCCTTTGAGATTGCCACCAGCGGCCGCCCTGGCCCCGTCCTCGTCGACCT
GCCCAAGGATGTCACGGCTGGTATCCTGAGGAGAC-CCATCCCTACGGAGACTGCTCTGCCGTCT
CTGCCCAGTGCCGCCTCCCGCGCCGCCATGGAGCTGAGCTCCAAGCAGCTCAACGCCTCCATCA
AGCGTGCCGCCGACCTCATCAACATCGCCAAGAAGCCCGTCATCTACGCCGGTCAGGGTGTCAT
CCAGTCCGAGGGCGGCGTTGAGCTCCTGAAGCAGCTGGCGGACAAGGCCTCCATCCCCGTCACC
ACCACCCTCCATGGCCTGGGTGCCTTTGATGAGCTGGACGAGAAGTCGCTGCACATGCTGGGCA
TGCACGGCTCGGCGTATGCCAACATGGCCATGCAGCAGGCCGACCTCATCATCGCCCTCGGCAG
CCGATTCGACGACCGTGTTACTCTGAATGTCTCCAAATTTGCGCCTGCAGCCAGGCAAGCTGCT
GCCGAGGGCCGCGGCGGCATCATTCACTTTGAGATCATGCCCAAGAACATCAACAAGGTCATCC
AGGCGACCGAGGCCGTCGAGGGCGACGTCGCCACCAACCTGAAGCACCTCATTCCCCAGATTGC
CGAAAAGTCCATGGCGGACCGAGGAGAGTGGTTCGGCCTCATCAATGAGTGGAAGAAGAAGTGG
CCCCTGTCAAACTACCAGCGCGCGGAGCGGGCTGGCCTCATCAAGCCGCAGACGGTCATGGAGG
AGATTAGCAACCTGACGGCCAACCGAAAGGACAAGACGTACATTGCCACGGC-TGTCGGCCAGCA
CCAGATGTGGGTTGCCCAGCACTTCCGCTGGAGGCACCCTCGATCCATGATTACCTCTGGTGGT
CTGGGCACCATGGGCTACGGTGTCCCCGCGGCCATTGGCGCGAAGGTGGCCCAGCCCGACGCTC
TCGTAATTGACGTTGATGGCGATGCCTCGTTTAACATGACGCTGACGGAGCTGTCGACTGCTGC
ACAGTTCAACA7TGGCGTCAAGGTGGTTGTGCTCAACAACGAGGAGCAGGGCATGGTGACGCAG
TGGCAGAACCTCTTTTACGAGGACCGATATGCCCACACGCACCAGAAGAACCCCGACTTCATGA
AGCTGGCCGACGCCATGGGCGTTCAGCACCAGCGCGTGACGGAGCCGGAGAAGCTGGTCGATGC
CCTGACGTGGC7GATCAACACCGATGGCCCGGCCCTGTTGGAGGTTGTCACGGACAAGAAGGTG
CCTGTCCTGCCCATGGTGCCCGCCGGATCGGCCCTGCACGAGTTCCTCGTCTTTGAACCTGGTG
AGTCTACTTCAGACATATTGCTTGCGCATTGCAGATACTAACACTCTCACAGAAAAGGATAAGC
AGCGCCGTGAGCTGATGAAGGAGAGAACAAAGGGTGTGCACl'CCTAAAGCGATGATG'l'CTGCGA GGGGTTCTTCGTTGAACCCTAGTTCAGGCACCATCTTACCCICTTATTTTTTCCCGTGGGCTTT CATTTTGTGTCATCCGAGCATGACGTTGTAGGGTTGGAGTTTCTTCCTTTTTATCTTGTCATTT ACTGGTACCCAGAGGCGCGAGACTAGGCTTCCATC-TTTTGTTTTGCGACTTTCAAAAAGTACTT TTAGTGGTTTGGGGCACGACGAGGGGGGGCAACCTCTTCTGTCGAAAAAGGTGGCTGGATGGAT GAGATGAGATGAGATGAGGGTGAAGATAGATACCTGCAGTGTTTTTGACGCGACGGGATGGCGA TCGCAGCACCCCCGACAC-AACTCGTCGAGACTGTC-CAGCCTCATATCGATGCACTGATTCACGC TGCAGACGTGAAGAAAGGTACTGATTCCATTACATATGCTTCTCTGCACACTGATGTTTGATTT GTGCTAACGCCCCCCT'IAGTGCCGCCCAAGGCCAAGGGCAAGCGCCAAAGAGAAACAGTIAAAC CCATCTCGGGACTGGATGTGGATGCCCTTCTGGGAGAAGAGCAGAAAGGTTCCATTAGTCCGGA GAATGCCATTCCGGACTTCAAACGAGCCCTCAACTCGTCCGAAGAAGTCGAC-CAGATTGCCGAC GCCACAAAACAAATGGGGGCCATT3TGCGGTCTCTCATTACGGACAGCTTCC-GGGATAGCAAAT ATGCCCAGGCAATGGAAGGCATTGGTGCGATGCGTGAGGAGCTGATCAACCTGGAAGAGCCTGG CCTGTACAACGACTTTGTGCGCGACTTGAAGAAAAGTTTGCTATCTGGAGCCTTGGGTGGTGAC AGGCGAGATTTCTGGTTCAAGATGAGGTGGGCGAAGCTGGGCCTGATTGACAAGAAACAGTCGG AGGTGTCTTCGGTCACTCTTGAGGAGGCGGACGAC-GTGAGTGGTGCAGCATC-CTGTCGGATTAT ACGGACGTTGTrTGCTAACTTGTGGGATAGTTTTACAAGTCGAGGTGAGGTATCTACGTTGACC AAGAATGGGACCATGTATATGAGCGGTGTAACAACAGAATCCTGTGCTTTGAGCATTGTATGAT ATGATTATTGA7GAACCGGACAAAAGGGGGTAGGGGATTGATGCCATCACGACCGATTGACCAG ACCTGGATTCTCGCACAGCATGGCTGCTGATTTTC-TTGACCTTGCGACGTAACATCCCTGAAGA ACAACCTACTATTAACCTATCATTTAGCAGAAGCTCTGTAACCTTCTTGATTCTTGTATTCAGC TTCTGAGTCTCtTCAAATGTAATCATTTCGACtGTTC-TGTAATTCCGGCCAAGCAGGCGGCCGTCT GCCAGCGCCTGCCTAGGCTGCACCGCAATCTGCCCAATCAGCTGCCCTTCAC-TTTCGTTTGACC TTGCAGCTGCCCTTCATCCTTTATCTGCACACAATTCTTTTTCCTCTGCTCTGCGCATTCTTCT CTCTCTCGTCTCCCTTCTCAAGCTCAACTTCACCTCATCCGCTCCACTACAAGCCCTCCCGTCG TCGTCTCGCATCCTCATCTCGACTGCGGCCAGCAAAACAAGCAAAGCCGTGATCGATCCTCAGC ATGGCTACCTTCAACCTCACCGTCCGCCTGGAGATGCTCAAAGAAATTGGAATCACCGTCCAAT ACCtGCGAGCATGTAGCGAAAGAAGCAGCCACtCAACGAAGCAGCGATGGCATTCGAAGAAGAAGA AGAGTTCCCCGCCGTTGTGCCGCCCAAGGCAGAACAGCACGCCTCTGAACACGACGCTGGCCAC GATGCTTGGGACGCGGCTGCCCACATCTCGACTTCGGCGCAAGAACAGCAGAAGCCCCAGGAGA TGGACGACTCG7CTATCGTGATGCCGCTGGACTACTCCAAGTTTGTCGTTGC-AGAGCCTGCGGA CGAATCCATCAGCTTTTGCTCGTGGAAGGTCGTCGAGGCTTATCCTGACCAC-TTTATCGGCAAG GCAAACAGGCCrCGTGTATGTAGCGATTGCTTTCTCTGCATTATGGGAATCTCAAGAGAGTATG GTAGAAGATAACTGACAACTTGCA5GCCAAGCCGTACTTTGACAAGATTTTC-GAAGACAGAGTC TGGGATTTGTGAGGATCTTGATTGATGTGCATATGGCGACATGCCTGCTAATATCATTGTAGCT TCTATCTCTACAACCCCGAGAAGCCTTCAGAGAAC-CCTCGCGTGCTGGTGCCCACTGTTCAGCT CGAAGGCTTTC7CAAAAGCATCAACAGAGCGCTCGGTACTTCTCTCACCATTCCAGGAGGGGCA AACCAGGACCGTTTTTATCTGAGGTTCGGCCAGGGAGACACCCCAAGGCCTCGATATCTACAGA GGTCGAGAGACCAGAAATCCCTAAAGATTGAAACGTTCCCCGATTTTCAACAGGCGGACTACGA CAGCTTTAGGAACGCGCATGGCGCCATCCAGGAGC-ACTGGTTGAAGAACTGGCAGATGCTGGTA CCTCGGCCGAGTTTCGACAAGAAGAAAAATGCAGACAAAAGAGCAGCCAAGAGAAGGCTCGAGC GAGAGCGAATGCTTCACAATACGCAGGAATTTCTTCATTTGGCAGGTAAGGGCAAAGGGGCTGA CGTGG.
Creation of the Archv2 strain from the T. reesei Aku80 quad deleted derivative strain.
[0109] First, the pyr2 gene was deleted from the ku80 knockout strain. The pyr2 deletion cassette contained the T. reesei cbh1 promoter, a hygromycin resistance gene, and a partial amdS selectable marker flanked by 5' and 3' pyr2 sequences, schematically depicted in Figure 3. This permitted screening for resistance to hygromycin and fluoroorotic acid among pyr2 knockout transformants. The partial amdS gene contained the 3' portion of the gene, but lacked a promoter and the amino-terminal portion of the coding region, and was consequently nonfunctional. The nucleotide sequence of the pyr2 knockout cassette is 9259 base pairs in length: bases 1-1994 correspond to the pyr2 3' homologous region; bases 2002-3497 correspond to the T. reesei cbh1 promoter; bases 3563-5449 correspond to the hygromycin resistance selectable marker; bases 5450-7440 correspond to the A. nidulans amdS 3' partial marker; and bases 7441-9259 correspond to the pyr2 5' homologous region. The nucleotide sequence of the pyr2 knockout cassette is provided as SEQ ID NO:2:
ATCACGCCCTCGCATAAAAGACCCTCAAGAGTCCATGTGCCCTATCTGCCTGATCTTCCTAACC
CTTATTTAACATTGGCCCTATCACAACCTAGTTCTTCTTCAGCCTGCTTTGTCAACACTTGTCA
CGGTTCAACTCAACGTAATCAGCAGGTAGCAGGACGAGGATAGGGAGAGAAACGAAGAGAAGAA
GAGGAGAGAGGAAGAAAAAAAAAAGAAAAGAAAGAAAAAGGGAAAAGGAAAC-AAGGAGGAAAAG
AGAAGAAAGTCAGATGAAGAAGCAAGAAGACGCCATGGTAGCCACCGCTTGTCAGGGCTCCTTA
GCAACGAACAACTCTAGCTTGGGGACTTGTCGATGTGTCGTTTCCTTCCTACCCATCAGCACCA
ACGATGAGTTCGATATAGACGAGGACCTCATGGAAGTAGAGACCATTGGGTTCGACAGGATCTC
TCAGTTTCACTTCTATGAGGTCTGTCGCTCGGATGACTTTTTGAGGAGCTTCCCCTTCTGCTTC
AACCCCAAACTCTCTTTCCTGAAACCGCAGCACGTTGGCACGGCCGTGTTGCTGGAGCAGTTTG
CTTTCGAGCACTCTCAGCGTGGTTTCAGCAGCCCACTGGTGAGTGGCCTCCTTTGACGTCCACA
CCTTGCTCCTGTCGCATGCGTATCTGGTGGGAACGACTGCTCCAAGGAGGATTGCTAACGAGGT
TGTAGGCCGAA7ATCGCATCAGATTCTCCGGTAACCTTAGCTACGGCCTCTTCAACATCTGTGA
CATGACGGAGCGCAAGTACTGGTGGTTGGCGACCAAGATGCGCGGCTGGAACATCGACGGCTGC
CCCGAAGACGTCAGGAGACTCATGTTTGTTCACATCATCGCCACCCTGGGATGCAGCCCCGTCG
TGACGGATGAAGACATGGACTACCCCAAGAACTGGGCGGCAATTCTCCACGGTAGAGACAGATA
TCCGAGTGAACCTGTGGGCCACCGGCCTCATGGGCGCACCATCTGCCTCCACTCGGTGGCCGTC
TGCCCTCGTCTCCAGGGCTTGGGTCTCGGTACTGCGACTCTGAAGTCGTATGTGCAGCGCATGA
ACAGCCTCGGCGCCGCGGACCGTGTTGCTCTCGTTTGCCGCAAGCCCGAGACGAGATTTTTTGA
AAGATGCGGCTTCAGGAACAGCGGCCGGAGTAGTATCAAGACTCTGGTCGGCGAATACTACAAC
ATGGTGTGTGCTTCCACATCGACTTGGCCAGACTCTATACGATTTTCAAACCTCGCTATACGTC
ATATTGACTTGrTTCTTTAGGTCTTCGATTTGCCCGGGCCCAAAGACTTTATCGACTGGAATAG
CATTGCCGACGCTGCCAAGAAGATGTGAACCATTTGACTGATACGATGTGTGCTACGCATGTCG
ACCTTCTTTGTTTGTTTCTTTGGCGGCTCTTTGTATACCTTGGGACACGGCAGACGCATGTCTA
TGTGAAGAAAACGTTCACGGCGCTGTTTGCATCAC-GAATATGATCATTAAACATGGAGCGTAAT
GGTATTAATGATCAACTAGAAAAATGGTATGGAAC-GGCGAGAGGGCGATCAACAAAGCAGCCCG
GGGCATAGTCTGGAAGCAGCAGGAATTGGAAGGGAAAAGGAAGCTGCACAATGAAGGGATATCG
TGAGCGGAGTGGCTCACGAGAGTATCAACAGACTC-GCGAAAGCAAGCAATTC-CCAACGCCGGCT
ATTAGGCCATAAGATGGCCTGTTGTGAGTCCCAGTTGCACGTATCCCCATATGACTGCTCTGTC
GCTGACTTGAAAAAAAATAGGGAGGATAAAGGAGAAAGAAAGTGAGACAACCCGTGAGGGACTT
GGGGTAGTAGGAGAACACATGGGCAACCGGGCAATACACGCGATGTGAGACGAGTTCAACGGCG
AATGGAAAATC7TGAAAAACAAAATAAAATAACTGCCCTCCATACGGGTATCAAATTCAAGCAG
TTGTACGGAGGCTAGCTAGAGTTGTGAAGTCGGTAATCCCGCTGTATAGTAATACGAGTCGCAT
CTAAATACTCCGAAGCTGCTGCGAACCCGGAGAATCGAGATGTGCTGGAAAC-CTTCTAGCGAGC
GGCTAAATTAGCATGAAAGGCTATGAGAAATTCTGGAGACGGCTTGTTGAATCATGGCGTTCCA
TTCTTCGACAAGCAAAGCGTTCCGTCGCAGTAGCAGGCACTCATTCCCGAAAAAACTCGGAGAT
TCCTAAGTAGCGATGGAACCGGAATAATATAATAC-GCAATACATTGAGTTGCCTCGACGGTTGC
AATGCAGGGGTACTGAGCTTGGACATAACTGTTCCGTACCCCACCTCTTCTCAACCTTTGGCGT TTCCCTGATTCAGCGTACCCGTACAAGTCGTAATCACTATTAACCCAGACTGACCGGACGTGTT TTGCCCTTCAT7TGGAGAAATAATGTCATTGCGATGTGTAATTTGCCTGCTTGACCGACTGGGG CTGTTCGAAGCCCGAATGTAGGATTGTTATCCGAACTCTGCTCGTAGAGGCATGTTGTGAATCT GTGTCGGGCAGGACACGCCTCGAAGGTTCACGGCAAGGGAAACCACCGATAGCAGTGTCTAGTA GCAACCTGTAAAGCCGCAATGCAGCATCACTGGAAAATACAAACCAATGGCTAAAAGTACATAA GTTAATGCCTAAAGAAGTCATATACCAGCGGCTAATAATTGTACAATCAAGTGGCTAAACGTAC CGTAATTTGCCAACGGCTTGTGGGGTTGCAGAAGCAACGGCAAAGCCCCACTTCCCCACGTTTG TTTCTTCACTCAGTCCAATCTCAGCTGGTGATCCCCCAATTGGGTCGCTTGTTTGTTCCGGTGA AGTGAAAGAAGACAGAGGTAAGAATGTCTGACTCGGAGCGTTTTGCATACAACCAAGGGCAGTG ATCtGAAGACACtTCtAAATGTTGACATTCAAGGAGTATTTAGCCAGGGATGCTTGAGTGTATCGTG TAAGGAGGTTTGTCTGCCGATACGACGAATACTGTATAGTCACTTCTGATGAAGTGGTCCATAT TGAAATGTAAAGTCGGCACTGAACAGGCAAAAGATTGAGTTGAAACTGCCTAAGATCTCGGGCC CTCGGGCCTTCGGCCTTTGGGTGTACATGTTTGTC-CTCCGGGCAAATGCAAAGTGTGGTAGGAT CGAACACACTGCTGCCTTTACCAAGCAGCTGAGGC-TATGTGATAGGCAAATC-TTCAGGGGCCAC TGCATGGTTTCGAATAGAAAGAGAAGCTTAGCCAAGAACAATAGCCGATAAAGATAGCCTCATT AAACGGAATGAGCTAGTAGGCAAAGTCAGCGAATGTGTATATATAAAGGTTCGAGGTCCGTGCC TCCCTCATGCTCTCCCCATCTACTCATCAACTCAC-ATCCTCCAGGAGZiCTTC-TACACCATCTTT TGAGGCACAGAAACCCAATAGTCAACCGCGGACTGCGCATCATGTATCGGAAGTTGGCCGTCAT CTCGGCCTTCT7GGCCACACCTCGTGCTAGACTAGGCGCGCCAGGAAGCCCGGAAGGTAAGTGG AITCITCGCCG-'GGCTGGAGCAACCGGTGGATTCCAGCGTCl'CCGACTTGGACTGAGCAATTCA GCGTCACGGAT^CACGATAGACAGCTCAGACCGCTCCACGGCTGGCGGCATTATTGGTTAACCC GGAAACTCAGTCTCCTTGGCCCCGTCCCGAAGGGACCCGACTTACCAGGCTGGGAAAGCCAGGG ATAGAATACACTGTACGGGCTTCGTACGGGAGGTTCGGCGTAGGGTTGTTCCCAAGTTTTACAC ACCCCCCAAGACAGCTAGCGCACGAAAGACGCGGAGGGTTTGGTGAAAAAAGGGCGAAAATTAA GCGGGAGACGTATTTAGGTGCTAGGGCCGGTTTCCTCCCCATTTTTCTTCGGTTCCCTTTCTCT CCTGGAAGACT7TCTCTCTCTCTCTTCTTCTCTTCTTCCATCCTCAGTCCATCTTCCTTTCCCA TCATCCATCTCCTCACCTCCATCTCAACTCCATCACATCACAATCGATATGAAAAAGCCTGAAC TCACCGCGACG7CTGTCGAGAAGTTTCTGATCGAAAAGTTCGACAGCGTCTCCGACCTGATGCA GCTCTCGGAGGGCGAAGAATCTCGTGCTTTCAGCTTCGATGTAGGAGGGCGTGGATATGTCCTG CGGGTAAATAGCTGCGCCGATGGTTTCTACAAAGATCGTTATGTTTATCGGCACTTTGCATCGG CCGCGCTCCCGATTCCGGAAGTGCTTGACATTGGGGAATTCAGCGAGAGCCTGACCTATTGCAT CTCCCGCCGTGCACAGGGTGTCACGTTGCAAGACCTGCCTGAAACCGAACTC-CCCGCTGTTCTG CAGCCGGTCGCGGAGGCCATGGATGCGATCGCTGCGGCCGATCTTAGCCAGACGAGCGGGTTCG GCCCATTCGGACCGCAAGGAATCGGTCAATACACTACATGGCGTGATTTCATATGCGCGATTGC TGATCCCCATG7GTATCACTGGCAAACTGTGATGGACGACACCGTCAGTGCGTCCGTCGCGCAG GCTCTCGATGAGCTGATGCTTTGGGCCGAGGACTC-CCCCGAAGTCCGGCACCTCGTGCACGCGG ATTTCGGCTCCAACAATGTCCTGACGGACAATGGCCGCATAACAGCGGTCATTGACTGGAGCGA GGCGATGTTCGGGGATTCCCAATACGAGGTCGCCAACATCTTCTTCTGGAGGCCGTGGTTGGCT TGTATGGAGCAGCAGACGCGCTACTTCGAGCGGAGGCATCCGGAGCTTGCAGGATCGCCGCGGC TCCGGGCGTATATGCTCCGCATTGGTCTTGACCAACTCTATCAGAGCTTGGTTGACGGCAATTT CGATGATGCAGCTTGGGCGCAGGGTCGATGCGACC-CAATCGTCCGATCCGGAGCCGGGACTGTC GGGCGTACACAAATCGCCCGCAGAAGCGCGGCCGTCTGGACCGATGGCTGTGTAGAAGTACTCG CCGATAGTGGAAACCGACGCCCCAGCACTCGTCCGAGGGCAAAGGAATAGAC-TAGATGCCGACC GGGATCCACTTAACGTTACTGAAATCATCAAACAC-CTTGACGAATCTGGATATAAGATCGTTGG
TGTCGATGTCAGCTCCGGAGTTGAGACAAATGGTGTTCAGGATCTCGATAAGATACGTTCATTT
GTCCAAGCAGCAAAGAGTGCCTTCTAGTGATTTAATAGCTCCATGTCAACAAGAATAAAACGCG
TTTCGGGTTTACCTCTTCCAGATACAGCTCATCTGCAATGCATTAATGCATTGGACCTCGCAAC
CCTAGTACGCCCTTCAGGCTCCGGCGAAGCAGAAGAATAGCTTAGCAGAGTCTATTTTCATTTT
CGGGAGACTAGCATTCTGTAAACGGGCAGCAATCGCCCAGCAGTTAGTAGGGTCCCCTCTACCT
CTCAGGGAGATGTAACAACGCCACCTTATGGGACTATCAAGCTGACGCTGGCTTCTGTGCAGAC
AAACTGCGCCCACGAGTTCTTCCCTGACGCCGCTCTCGCGCAGGCAAGGGAACTCGATGAATAC
TACGCAAAGCACAAGAGACCCGTTGGTCCACTCCATGGCCTCCCCATCTCTCTCAAAGACCAGC
TTCGAGTCAAGGTACACCGTTGCCCCTAAGTCGTTAGATGTCCCTTTTTGTCAGCTAACATATG
CCACCAGGGCTACGAAACATCAATGGGCTACATCTCATGGCTAAACAAGTACGACGAAGGGGAC
TCGGTTCTGACAACCATGCTCCGCAAAGCCGGTGCCGTCTTCTACGTCAAGACCTCTGTCCCGC
AGACCCTGATGGTCTGCGAGACAGTCAACAACATCATCGGGCGCACCGTCAACCCACGCAACAA
GAACTGGTCGTGCGGCGGCAGTTCTGGTGGTGAGGGTGCGATCGTTGGGATTCGTGGTGGCGTC
ATCGGTGTAGGAACGGATATCGGTGGCTCGATTCGAGTGCCGGCCGCGTTCAACTTCCTGTACG
GTCTAAGGCCGAGTCATGGGCGGCTGCCGTATGCAAAGATGGCGAACAGCATGGAGGGTCAGGA
GACGGTGCACAGCGTTGTCGGGCCGATTACGCACTCTGTTGAGGGTGAGTCCTTCGCCTCTTCC
TTCTTTTCCTGCTCTATACCAGGCCTCCACTGTCCTCCTTTCTTGCTTTTTATACTATATACGA
GACCGGCAGTCACTGATGAAGTATGTTAGACCTCCGCCTCTTCACCAAATCCGTCCTCGGTCAG
GAGCCATGGAAATACGACTCCAAGGTCATCCCCATGCCCTGGCGCCAGTCCGAGTCGGACATTA
TTGCCTCCAAGATCAAGAACGGCGGGCTCAATATCGGCTACTACAACTTCGACGGCAATGTCCT
ICCACACCCTCCTATCCIGCGCGGCGTGGAAACCACCGTCGCCGCACTCGCCAAAGCCGGTCAC
ACCGTGACCCCGTGGACGCCATAC^AGCACGATTTCGGCCACGATCTCATCTCCCATATCTACG
CGGCTGACGGCAGCGCCGACGTAATGCGCGATATCAGTGCATCCGGCGAGCCGGCGATTCCAAA
TATCAAAGACCTACTGAACCCGAACATCAAAGCTGTTAACATGAACGAGCTCTGGGACACGCAT
CTCCAGAAGTGGAATTACCAGATGGAGTACCTTGAGAAATGGCGGGAGGCTGAAGAAAAGGCCG
GGAAGGAACTGGACGCCATCATCGCGCCGATTACGCCTACCGCTGCGGTACGGCATGACCAGTT
CCGGTACTATGGGTATGCCTCTGTGATCAACCTGCTGGATTTCACGAGCGTC-GTTGTTCCGGTT
ACCTTTGCGGA7AAGAACATCGATAAGAAGAATGAGAGTTTCAAGGCGGTTAGTGAGCTTGATG
CCCTCGTGCAGGAAGAGTATGATCCGGAGGCGTACCATGGGGCACCGGTTGCAGTGCAGGTTAT
CGGACGGAGACTCAGTGAAGAGAGGACGTTGGCGATTGCAGAGGAAGTGGGGAAGTTGCTGGGA
AATGTGGTGACTCCATAGCTAATAAGTGTCAGATAGCAATTTGCACAAGAAATCAATACCAGCA
ACTGTAAATAAGCGCTGAAGTGACCATGCCATGCTACGAAAGAGCAGAAAAAAACCTGCCGTAG
AACCGAAGAGAGATGACACGCTTCCATCTCTCAAAGGAAGAATCCCTTCAGC-GTTGCGTTTCCA
GTAGTGATTTTACCGCTGATGAAATGACTGGACTCCCTCCTCCTGCTCTTATACGAAAAATTGC
CTGACTCTGCAAAGGTTGTTTGTCTTGGAAGATGATGTGCCCCCCCATCGCTCTTATCTCATAC
CCCGCCATCTTTCTAGATTCTCATCTTCAACAAGAGGGGCAATCCATGATCTGCGATCCAGATG
TGCTTCTGGCC7CATACTCTGCCTTCAGGTTGATC-TTCACTTAATTGGTGACGAATTCAGCTGA
TTTGCTGCAGTATGCTTTGTGTTGGTTCTTTCCAGGCTTGTGCCAGCCATGAGCGCTTTGAGAG
CATGTTGTCAC7TATAAACTCGAGTAACGGCCACATATTGTTCACTACTTGAATCACATACCTA
ATTTTGATAGAATTGACATGTTTAAAGAGCTGAGGTAGCTTTAATGCCTCTGAAGTATTGTGAC
ACAGCTTCTCACAGAGTGAGAATGAAAAGTTGGACTCCCCCTAATGAAGTAAAAGTTTCGTCTC
TGAACGGTGAAGAGCATAGATCCGGCATCAACTACCTGGCTAGACTACGACGTCAATTCTGCGG
CCTTTTGACCTTTATATATGTCCATTAATGCAATAGATTCTTTTTTTTTTTTTTTTTTTTTTTT
TTTTTTTTTTT7TTTTTGCCCAATTTCGCACtATCAAAGTGGACGTTATAGCATCATAACTAAGC
TCAGTTGCTGAGGGAAGCCGTCTACTACCTTAGCCCATCCATCCAGCTCCATACCTTGATACTT
TAGACGTGAAGCAATTCACACTGTACGTCTCGCAGCTCTCCTTCCCGCTCTTGCTTCCCCACTG
GGGTCCATGGTGCGTGTATCGTCCCCTCCACAATTCTATGCCATGGTACCTCCAGCTTATCAAT
GCCCCGCTAACAAGTCGCCTCTTTGCCTTGATAGCTTATCGATAAAACTTTTTTTCCGCCAGAA
AGGCTCCGCCCACAGACAAGAAAAAAAATTCACCC-CCTAGCCTTTGGCCCCC-GCATTTGGCTAA
ACCTCGAGCCTCTCTCCCGTCTTGGGGTATCAGGAAGAAAAGAAAAAAATCCATCGCCAAGGGC
TGTTTTGGCATCACCACCCGAAAACAGCACTTCCTCGATCAAAAGTTGCCCGCCATGAAGACCA
CGTGGAAGGACATCCCTCCGGTGCCTACGCACCAC-GAGTTTCTGGACATTGTGCTGAGCAGGAC
CCAGCGCAAACGGCCCACTCAGATCCGTGCCGGCTTCAAGATTAGCAGAATTCGAGGTACGTCG
CATTGCCCATCGCAGGATGTCTCATTATCGGGGTCCTTGGAGAACGATCATGATTGCATGGCGA
TGCTAACACATAGACAGCCTTCTACACTCGAAAGGTCAAGTTCACCCAGGAGACGTTTTCCGAA
AAGTTCGCCTCCATCCTCGACAGCTTCCCTCGCCTCCAGGACATCCACCCCTTCCACAAGGACC
TTCTCAACACCCTCTACGATGCCGACCACTTCAAGATTGCCCTTGGCCAGATGTCCACTGCCAA
GCACCTGGTCGAGACCATCTCGCGCGACTACGTCCGTCTCTTGAAATACGCCCAGTCGCTCTAC
CAGTGCAAGCAGCTCAAGCGGGCCGCTCTCGGTCGCATGGCCACGCTGGTCAAGCGCCTCAAGG
ACCCGCTGCTGTACCTGGACCAGGTCCGCCAGCATCTCGGCCGTCTTCCCTCCATCGACCCCAA
CACCAGGACCCrGCTCATCTGCGGTTACCCCAATGTTGGCAAGTCCAGCTTCCTGCGAAGTATC
ACCCGCGCCGA7GTGGACGTCCAGCCCTATGCTTTCACCACCAAGAGTCTGTTTGTCGGCCACT
TTGACTACAAGTACCTGCGATTCCAGGCCATTGATACCCCCGGTATTCTGGACCACCCTCTTGxA
GGAGATGAACACTATCGAAATGCAGAGGTATGTGC-CGCGGCTA
Creation of the Archv3 strain from the Archv2 T. reesei strain.
[0110] The Archy 2 strain was transformed with the hygromycin deletion cassette to integrate at the same pyr2 locus and to replace the hygromycin resistance gene with the coding region of the pyr2 gene. The hygromycin deletion cassette is depicted in Figure 4. This re-introduction of the pyr2 gene back into the pyr2 locus placed the pyr2 gene between the T. reesei cbh1 promoter and the partial amdS selectable marker. This strain was selected by uridine prototrophy and sensitivity to hygromycin. The nucleotide sequence of ihehygR knockout cassette is 9088 base pairs in length: bases 1-1994 correspond to the pyr2 3' homologous region; bases 1995-3497 correspond to the T. reesei cbh1 promoter; bases 3564-5137 correspond to the pyr2 selectable marker; bases 5280-7270 correspond to the A. nidulans amdS 3' partial marker; bases 7271-9088 correspond to the pyr2 5' homologous region. The nucleotide sequence of the hygR knockout cassette is provided as SEQ ID NO:3:
ATCACGCCCTCGCATAAAAGACCCTCAAGAGTCCATGTGCCCTATCTGCCTGATCTTCCTAACC
CTTATTTAACA7TGGCCCTATCACAACCTAGTTCTTCTTCAGCCTGCTTTGTCAACACTTGTCA
CGGTTCAACTCAACGTAATCAGCAGGTAGCAGGACGAGGATAGGGAGAGAAACGAAGAGAAGAA
GAGGAGAGAGGAAGAAAAAAAAAAGAAAAGAAAGAAAAAGGGAAAAGGAAAGAAGGAGGAAAAG
AGAAGAAAGTCAGATGAAGAAGCAAGAAGACGCCATGGTAGCCACCGCTTGTCAGGGCTCCTTA
GCAACGAACAACTCTAGCTTGGGGACTTGTCGATC-TGTCGTTTCCTTCCTACCCATCAGCACCA
ACGATGAGTTCGATATAGACGAGGACCTCATGGAAGTAGAGACCATTGGGTTCGACAGGATCTC
ICAGTTTCACTGCTATGAGGTCTGTCGCTCGGATCACTTTTTGAGGAGCTTCCCCTTCTGCTTC
AACCCCAAACTCTCTTTCCTGAAACCGCAGCACGTTGGCACGGCCGTGTTGCTGGAGCAGTTTG
CTTTCGAGCAC7CTCAGCGTGGTTTCAGCAGCCCACTGGTGAGTGGCCTCCTTTGACGTCCACA
CCTTGCTCCTG7CGCATGCGTATCTGGTGGGAACGACTGCTCCAAGGAGGATTGCTAACGAGGT TGTAGGCCGAA7ATCGCATCAGATTCTCCGGTAACCTTAGCTACGGCCTCTTCAACATCTGTGA CATGACGGAGCGCAAGTACTGGTGGTTGGCGACCAAGATGCGCGGCTGGAACATCGACGGCTGC CCCGAAGACGTCAGGAGACTCATGTTTGTTCACATCATCGCCACCCTGGGATGCAGCCCCGTCG TGACGGATGAAGACATGGACTACCCCAAGAACTGC-GCGGCAATTCTCCACGC-TAGAGACAGATA TCCGAGTGAACCTGTGGC-CCACCGGCCTCATGGGCGCACCATCTGCCTCCACTCGGTGGCCGTC TGCCCTCGTCTCCAGGGCTTGGGTCTCGGTACTGCGACTCTGAAGTCGTATGTGCAGCGCATGA ACAGCCTCGGCGCCGCGGACCGTGTTGCTCTCGTTTGCCGCAAGCCCGAGACGAGAT'ITTTTGA AAGATGCGGCT7CAGGAACAGCGGCCGGAGTAGTATCAAGACTCTGGTCGGCGAATACTACAAC ATGGTGTGTGCTTCCACATCGACTTGGCCAGACTCTATACGATTTTCAAACCTCGCTATACGTC ATATTGACTTGTTTCTTTAGGTCTTCGATTTGCCCGGGCCCAAAGACTTTATCGACTGGAATAG CATTGCCGACGCTGCCAAGAAGATGTGAACCATTTGACTGATACGATGTGTGCTACGCATGTCG ACCTTCTTTGTTTGTTTCTTTGGCGGCTCTTTGTATACCTTGGGACACGGCAGACGCATGTCTA TGTGAAGAAAACGTTCACGGCGCTGTTTGCATCAC-GAATATGATCATTAAACATGGAGCGTAAT GGTATTAATGA7CAACTAGAAAAATGGTATGGAAGGGCGAGAGGGCGATCAACAAAGCAGCCCG GGGCATAGTCTGGAAGCAGCAGGAATTGGAAGGGAAAAGGAAGCTGCACAATGAAGGGATATCG TGAGCGGAGTGGCTCACGAGAGTATCAACAGACTGGCGAAAGCAAGCAATTGCCAACGCCGGCT ATTAGGCCATAAGATGGCCTGTTGTGAGTCCCAGTTGCACGTATCCCCATATGACTGCTCTGTC GCTGACTTGAAAAAAAATAGGGAGGATAAAGGAGAAAGAAAGTGAGACAACCCGTGAGGGACTT GGGGTAGTAGGAGAACACATGGGCAACCGGGCAATACACGCGATGTGAGACGAGTTCAACGGCG AATGGAAAATC7TGAAAAACAAAATAAAATAACTGCCCTCCATACGGGTATCAAATTCAAGCAG TTGTACGGAGGCTAGATAGAGTTGTGAAGTCGGTAATCCCGCTGTATAGTAATACGAGTCGCAT CTAAATACTCCGAAGCTGCTGCGAACCCGGAGAATCGAGATGTGCTGGAAAGCTTCTAGCGAGC GGCTAAATTAGCATGAAAGGCTATGAGAAATTCTGGAGACGGCTTGTTGAATCATGGCGTTCCA TTCTTCGACAAGCAAAGCGTTCCGTCGCAGTAGCAGGCACTCATTCCCGAAAAAACTCGGAGAT TCCTAAGTAGCGATGGAACCGGAATAATATAATAGGCAATACATTGAGTTGCCTCGACGGTTGC AATGCAGGGGTACTGAGCTTGGACATAACTGTTCCGTACCCCACCTCTTCTCAACCTTTGGCGT TTCCCTGATTCAGCGTACCCGTACAAGTCGTAATCACTATTAACCCAGACTGACCGGACGTGTT TTGCCCTTCAT7TGGAGAAATAATGTCATTGCGATGTGTAATTTGCCTGCTTGACCGACTGGGG CTGTTCGAAGCCCGAATGTAGGATTGTTATCCGAACTCTGCTCGTAGAGGCATGTTGTGAATCT GTGTCGGGCAGGACACGCCTCGAAGGTTCACGGCAAGGGAAACCACCGATAGCAGTGTCTAGTA GCAACCTGTAAAGCCGCAATGCAGCATCACTGGAAAATACAAACCAATGGCTAAAAGTACATAA GTTAATGCCTAAAGAAGTCATATACCAGCGGCTAATAATTGTACAATCAAGTGGCTAAACGTAC CGTAATTTGCCAACGGCTTGTGGGGTTGCAGAAGCAACGGCAAAGCCCCACTTCCCCACGTTTG TTTCTTCACTCACTCCAATCTCAGCTGCTGATCCCCCAATTCCCTCGCTTGTTTGTTCCCGTCA AGTGAAAGAAGACAGAGGTAAGAATGTCTGACTCGGAGCGTTTTGCATACAACCAAGGGCAGTG ATGGAAGACAG7GAAATGTTGACATTCAAGGAGTATTTAGCCAGGGATGCTTGAGTGTATCGTG TAAGGAGGTTTGTCTGCCGATACGACGAATACTGTATAGTCACTTCTGATGAAGTGGTCCATAT TGAAATGTAAAGTCGGCACTGAACAGGCAAAAGATTGAGTTGAAACTGCCTAAGATCTCGGGCC CTCGGGCCTTCGGCCTTTGGGTGTACATGTTTGTGCTCCGGGCAAATGCAAAGTGTGGTAGGAT CGAACACACTGCTGCCTTTACCAAGCAGCTGAGGGTATGTGATAGGCAAATGTTCAGGGGCCAC TGCATGGTTTCGAATAGAAAGAGAAGCTTAGCCAAGAACAATAGCCGATAAAGATAGCCTCATT AAACGGAATGAGCTAGTAGGCAAAGTCAGCGAATC-TGTATATATAAAGGTTCGAGGTCCGTGCC TCCCTCATGCTCTCCCCATCTACTCATCAACTCAGATCCTCCAGGAGACTTGTACACCATCTTT TGAGGCACAGAAACCCAATAGTCAACCGCGGACTGCGCATCATGTATCGGAAGTTGGCCGTCAT CTCGGCCTTCT7GGCCACACCTCGTGCTAGACTAGGCGCGTCAATATGTGGCCGTTACTCGAGT TTATAAGTGACAACATGCTCTCAAAGCGCTCATGGCTGGCACAAGCCTGGAAAGAACCAACACA AAGCATACTGCAGCAAATCAGCTGAATTCGTCACCAATTAAGTGAACATCAACCTGAAGGCAGA GTATGAGGCCAGAAGCACATCTGGATCGCAGATCATGGATTGCCCCTCTTGTTGAAGATGAGAA TCTAGAAAGATGGCGGGGTATGAGATAAGAGCGATGGGGGGGCACATCATCTTCCAAGACAAAC AACCTTTGCAGAGTCAGGCAATTTTTCGTATAAGAGCAGGAGGAGGGAGTCCAGTCATTTCATC AGCGGTAAAATCACTCTAGACAATCTTCAAGATGAGTTCTGCCTTGGGTGACTTATAGCCATCA
TCATACCTAGACAGAAGCTTGTGGGATACTAAGACCAACGTACAAGCTCGCACTGTACGCTTTG ACTTCCATGTGAAAACTCGATACGGCGCGCCTCTAAATTTTATAGCTCAACCACTCCAATCCAA CCTCTGCATCCCTCTCACTCGTCCTCATCTACTCTTCAAATCACACAATAACCACACTATCCAA ATCCAACAGAA7GGCTACCACCTCCCAGCTGCCTGCCTACAAGCAGGACTTCCTCAAATCCGCC ATCGACGGCGGCGTCCTCAAGTTTGGCAGCTTCGAGCTCAAGTCCAAGCGGATATCCCCCTACT TOTTOAACGOGGGCGAATTOCACACGGCGCGCCTCGCOGGCGOCATCGCCTCOGCOTTTGOAAA GACCATCATCGAGGCCCAGGAGAAGGCCGGCCTAGAGTTCGACATCGTCTTCGGCCCGGCCTAC AAGGGCATCCCGCT GTGCTCCGCCATCACCATCAAGCTCGGCGAGCTGGCGCCCCAGAACCTGG ACCGCGTCTCCTACTCGTTTGACCGCAAGGAGGCCAAGGACCACGGCGAGGGCGGCAACATCGT CGGCGCTTCGCTCAAGGGCAAGAGGGTCCTGATTC-TCGACGACGTCATCACCGCCGGCACCGCC AAGAGGGACGCCATTGAGAAGATCACCAAGGAGGGCGGCATCGTCGCCGGCATCGTCGTGGCCC TGGACCGCATGGAGAAGCTCCCCGCTGCGGATGGCGACGACTCCAAGCCTGGACCGAGTGCCAT TGGCGAGCTGAGGAAGGAGTACGGCATCCCCATCTTTGCCATCCTCACTCTGGATGACATTATC GATGGCATGAAGGGCTTTGCTACCCCTGAGGATATCAAGAACACGGAGGATTACCGTGCCAAGT ACAAGGCGACTGACTGATTGAGGCGTTCAATGTCAGAAGGGAGAGAAAGACTGAAAAGGTGGAA AGAAGAGGCAAATTGTTGTTATTATTATTATTCTATCTCGAATCTTCTAGATCTTGTCGTAAAT AAACAAGCGTAACTAGCTAGCCTCCGTACAACTGCTTGAATTTGATACCCGTATGGAGGGCAGT TATTTTATTTTGTTTTTCAAGATTTTCCATTCGCCGTTGAACTCGTCTCACATCGCGTGTATTG CCCGGTTGCCCATGTGTACGCGTTTCGGGTTTACCTCTTCCAGATACAGCTCATCTGCAATGCA TTAATGCATTGGACCTCGCAACCCTAGTACGCCCTTCAGGCTCCGGCGAAGCAGAAGAATAGCT TAGCAGAGTCTATTTTCATTTTCGGGAGACTAGCATTCTGTAAACGGGCAGCAATCGCCCAGCA GTTAGTAGGGTCCCCTCTACCTCTCAGGGAGATGTAACAACGCCACCTTATGGGACTATCAAGC TGACGCTGGCTTCTGTGCAGACAAACTGCGCCCACGAGTTCTTCCCTGACGCCGCTCTCGCGCA GGCAAGGGAACTCGATGAATACTACGCAAACtCACAAGAGACCCGTTGGTCCACTCCATGGCCTC CCCATCTCTCTCAAAGACCAGCTTCGAGTCAAGGTACACCGTTGCCCCTAAGTCGTTAGATGTC CCTTTTTGTCAGCTAACATATGCCACCAGGGCTACGAAACATCAATGGGCTACATCTCATGGCT AAACAAGTACGACGAAGGGGACTCGGTTCTGACAACCATGCTCCGCAAAGCCGGTGCCGTCTTC TACGTCAAGACCTCTGTCCCGCAGACCCTGATGGTCTGCGAGACAGTCAACAACATCATCGGGC GCACCGTCAACCCACGCAACAAGAACTGGTCGTGCGGCGGCAGTTCTGGTGGTGAGGGTGCGAT CGTTGGGATTCGTGGTGGCGTCATCGGTGTAGGAACGGATATCGGTGGCTCGATTCGAGTGCCG GCCGCGTTCAACTTCCTGTACGGTCTAAGGCCGAGTCATGGGCGGCTGCCGTATGCAAAGATGG CGAACAGCATGGAGGGTCAGGAGACGGTGCACAGCGTTGTCGGGCCGATTACGCACTCTGTTGA GGGTGAGTCCTTCGCCTCTTCCTTCTTTTCCTGCTCTATACCAGGCCTCCACTGTCCTCCTTTC TTGCTTTTTATACTATATACGAGACCGGCAGTCACTGATGAAGTATGTTAGACCTCCGCCTCTT CACCAAATCCGTCCTCGGTCAGGAGCCATGGAAATACGACTCCAAGGTCATCCCCATGCCCTGG CGCCAGTCCGAGTCGGACATTATTGCCTCCAAGATCAAGAACGGCGGGCTCAATATCGGCTACT ACAACTTCGACGGCAATGTCCTTCCACACCCTCCTATCCTGCGCGGCGTGGAAACCACCGTCGC CGCACTCGCCAAAGCCGGTCACACCGTGACCCCGTGGACGCCATACAAGCACGATTTCGGCCAC GATCTCATCTCCCATATCTACGCGGCTGACGGCAGCGCCGACGTAATGCGCGATATCAGTGCAT CCGGCGAGCCGGCGATTCCAAATATCAAAGACCTACTGAACCCGAACATCAAAGCTGTTAACAT GAACGAGCTCTGGGACACGCATCTCCAGAAGTGGAATTACCAGATGGAGTACCTTGAGAAATGG CGGGAGGCTGAAGAAAAGGCCGGGAAGGAACTGGACGCCATCATCGCGCCGATTACGCCTACCG CTGCGGTACGGCATGACCAGTTCCGGTACTATGGGTATGCCTCTGTGATCAACCTGCTGGATTT CACGAGCGTGGTTGTTCCGGTTACCTTTGCGGATAAGAACATCGATAAGAAC-AATGAGAGTTTC AAGGCGGTTAGrGAGCTTGATGCGGTCGTGCAGGAAGAGTATGATCGGGAGGCGTAGCATGGGG CACCGGTTGCAGTGCAGGTTATCGGACGGAGACTCAGTGAAGAGAGGACGTTGGCGATTGCAGA GGAAGTGGGGAAGTTGCTGGGAAATGTGGTGACTCCATAGCTAATAAGTGTCAGATAGCAATTT GCACAAGAAATCAATACCAGCAACTGTAAATAAGCGCTGAAGTGACCATGCCATGCTACGAAAG AGCAGAAAAAAACCTGCCGTAGAACCGAAGAGATATGACACGCTTCCATCTCTCAAAGGAAGAA TCCCTTCAGGG7TGCGTTTCCAGTAGTGATTTTACCGCTGATGAAATGACTGGACTCCCTCCTC CTGCTCTTATACGAAAAATTGCCTGACTCTGCAAAGGTTGTTTGTCTTGGAAGATGATGTGCCC CCCCATCGCTCTTATCTCATACCCCGCCATCTTTCTAGATTCTCATCTTCAACAAGAGGGGCAA
TCCATGATCTGCGATCCAGATGTGGTTCTGGCCTCATACTCTGCCTTCAGGTTGATGTTCACTT
AATTGGTGACGAATTCAGCTGATTTGCTGCAGTATGCTTTGTGTTGGTTCTTTCCAGGCTTGTG
CCAGCCATGAGCGCTTTGAGAGCATGTTGTCACTTATAAACTCGAGTAACGC-CCACATATTGTT
CACTACTTGAA7CACATACCTAATTTTGATAGAATTGACATGTTTAAAGAGCTGAGGTAGCTTT
AATGCCTCTGAAGTATTGTGACACAGCTTCTCACAGAGTGAGAATGAAAAGTTGGACTCCCCCT
AATGAAGTAAAAGTTTCC-TCTCTGAACGGTGAAGAGCATAGATCCGGCATCAACTACCTGGCTA
GACTACGACGTCAATTCTGCGGCCTTTTGACCTTTATATATGTCCATTAATC-CAATAGATTCTT
TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGCCCAATTTCGCAGATCAAAGTGGA
CGTTATAGCATCATAACTAAGCTCAGTTGCTGAGGGAAGCCGTCTACTACCTTAGCCCATCCAT
CCAGCTCCATACCTTGATACTTTAGACGTGAAGCAATTCACACTGTACGTCTCGCAGCTCTCCT
TCCCGCTCTTGCTTCCCCACTGGGGTCCATGGTGCGTGTATCGTCCCCTCCACAATTCTATGCC
ATGGTACCTCCAGCTTATCAATGCCCCGCTAACAAGTCGCCTCTTTGCCTTGATAGCTTATCGA
TAAAACTTTTT7TCCGCCAGAAAGGCTCCGCCCACAGACAAGAAAAAAAATTCACCGCCTAGCC
TTTGGCCCCGGCATTTGGCTAAACCTCGAGCCTCTCTCCCGTCTTGGGGTATCAGGAAGAAAAG
AAAAAAATCCATCGCCAAGGGCTGTTTTGGCATCACCACCCGAAAACAGCACTTCCTCGATCAA
AAGTTGCCCGCCATGAAGACCACGTGGAAGGACATCCCTCCGGTGCCTACGCACCAGGAGTTTC
TGGACATTGTGCTGAGCAGGACCCAGCGCAAACTGCCCACTCAGATCCGTGCCGGCTTCAAGAT
TAGCAGAATTCGAGGTACGTCGCATTGCCCATCGCAGGATGTCTCATTATCGGGGTCCTTGGAG
AACGATCATGATTGCATGGCGATGCTAACACATAGACAGCCITCTACACTCGAAAGGTCAAGTT
CACCCAGGAGACGTTTTCCGAAAAGTTCGCCTCCATCCTCGACAGCTTCCCTCGCCTCCAGGAC
ATCCACCCCTTCCACAAGGACCTTCTCAACACCCTCTACGATGCCGACCACTTCAAGATTGCCC
TTGGCCAGATGTCCACTGCCAAGCACCTGGTCGAGACCATCTCGCGCGACTACGTCCGTCTCTT
GAAATACGCCCAGTCGCTCTACCA3TGCAAGCAGCTCAAGCGGGCCGCTCTCGGTCGCATGGCC
ACGCTGGTCAAGCGCCTCAAGGACCCCCTGCTGTACCTGGACCAGGTCCGCCAGCATCTCGGCC
GTCTTCCCTCCATCGACCCCAACACCAGGACCCTGCTCATCTGCGGTTACCCCAATGTTGGCAA
GTCCAGCTTCCTGCGAAGTATCACCCGCGCCGATC-TGGACGTCCAGCCCTATGCTTTCACCACC
AAGAGTCTGTT7GTCGGCCACTTTGACTACAAGTACCTGCGATTCCAGGCCATTGATACCCCCG
GTATTCTGGACCACCCTCTTGAGGAGATGAACACTATCGAAATGCAGAGGTATGTGGCGCGGCT
Creation of the A5D strain from the ArchvS T. reesei strain.
[0111] Native T. reesei bgi1 was deleted from the Archy 3 strain using a double crossover recombination gene replacement vector known in the art, e.g. M. Ward, et al. (1990). Gene 86(2): 153-62. Hygromycin resistance was used as the selectable marker for bgll deletion. In addition, the hygromycin resistance marker was flanked by loxP sites. The bgll deletion cassette is depicted in Figure 5. Subsequent hygromycin resistant transformants ware analyzed for bgii deletion. A strain confirmed for deletion of bgil was then transformed with a telomeric vector encoding ere recombinase and a functional amdS for selection of transformants to facilitate removal of the hygromycin by ere recombinase expression and loop out of the loxP sites. The telomeric vector encoding the ere recombinase is schematically depicted in Figure 7. Transformants were first obtained on acetamide media, then transferred to potato dextrose agar and replica plated onto hygromycin media to screen for hygromycin sensitivity. Strains sensitive to hygromycin, were again transferred to potato dextrose media and replica plated to acetamide media. A strain was selected, which had lost its capacity to grow on acetamide, indicating a loss of the telomeric vector. The nucleotide sequence of the bgll knockout cassette is provided as SEQ ID NO:4: .aatggtaggaa^gctggc-atataggctctgtgctc-gcaagttgatggatcctcgaatgaggccg
CCCTGCAAGGGGAACATCAGAGATCTACCATTGCCTCCTTGGCCCAATCCACTATCATACCTAC
CTCATGATCAT7CCTGCGAAGGTCTACCAGTAAATATTTCCTCGTCCCGTGTTTCATCATGTCC
AGAACCTCATC7CGCCAAATTGACTTTGCCACAGTGTCTGGAGCTGGGTAAGCAGCGTGCCAAG
GAATTGTTGTCGAGTCTGTGCCAGGCATTGTGCCCGACATTGTGAACTTCAGCCAGGAGAACTT
TTCCATCCCACCTATCCTCACCACCCTCCCCATCCCATCCCTTCAATAATCCACTTCCACACCC
AGTGTGTCATGCCCTAAAGCTCATTGGCCACCTCCACAGGCTAGCTCTACCTGCATCTGTAGAT
GGACTTTCCTTGTCCTCCTCCTTCAGAAAACCTCTTGGTCGCTCGCAGGTAACTGTTGTTGCCG
TCATTGTTTGACAGTGGATAGCCAAGGCAAAACCC-TCTGCTTTCAACGGAAC-CATTCGGCGGTT
GTTTGTCATCGGGTTATCGATCGACCAGGAGAACCCAGACGAGTGTTGTTCC-AGAGAAICATCG
ACGATGTGAAGAGGCGACGACTAGTATCTAGAAGATTATAATCGAACAAATCAGCGTTTGTCTG
TCGGGCGTTTGAGGGCGCAGTTGCCCGCCAAAGCAGCGTCGCAATATATAGGCAGCGAGAGACT
GTCAACAGCCAGCCGCCATGTGATCGATCGTAGCCGTCTTCCCGATCTTCCCTAAACCCCTTTC
TTTCCCCCCCCCCCCCACCCCCCTTCTAATATTTCCTCCCTCTCTCCATAACCTCAATCCTACA
CATGGTAATGT7CGGTCTGCGAAACATTTGTACAATTGGAGTTTACGATCGAGATGGAAGGAAA
CGCTCCACAAACTCGGTGACTGGGTTGCCATCAGGTGCTCAGGGCATAGCGTTCTCTGCAAATA
GAGGAAAGAGAATAGCACTAGTGAAAGTGTGAATCACAATGAAGAGGAGGTTGTTGCCGGAATG
CTTTGAGCAGCGTCAAAGTTGAACTTGAAGCTATCACAAATTGCAGGGTAAAGTACATGTTGGT
GCCAGTTTGACAGCACAGTGCGCGGAGCGGAGGATGTCGCGGAAGAGGCGCGACGCTAACCCGG
GCCTTCTTCTCAGTGAGCAGAACTCCTGCTGCAAGAGTTCCTTCTCTCTGCGAGATGACGTGAG
GCCCAATTTGCAGCTTCCCTCGAACAAGGTGATTC-AACATCTCTCTTCCCTCACATTTCATCAT
CACTACCTCCTCAZiTTCACTTCTGCTTCGGCCGTCTTCATCATTCATGTTACTGCTCTGATGCC
TATCCTGAAGA7TGTATTCCTGCAGTATTCACGCCATCCCACCTTCGGTCCTCACTCACAGTCA
CAGGTCAACCGCCTTCACCCTCCTCGCGATGATGTCGGCAATCTGGTGGATCAATGTGCGGTTG
AGGGCCGCCGTAGTGAGGATGGGCATGGGGAACGAGGTCGCCC.ATTCGCCCAC.AGATAACTTCG
TATAGCATACATTATACGAAGTTATCCTGGGCTTGTGACTGGTCGCGAGCTGCCACTAAGTGGG
GCAGTACCATTTTATCGGACCCATCCAGCTATGGGACCCACICGCAAATTTTTACATCATTTTC
TTTTTGCTCAG7AACGGCCACCTTTTGTAAAGCGTAACCAGCAAACAAATTC-CAATTGGCCCGT
AGCAAGGTAGTCAGGGCTTATCGTGATGGAGGAGAAGGCTATATCAGCCTCAAAAATATGTTGC
CAGCTGGCGGAAGCCCGGAAGGTAAGTGGATTCTTCGCCGTGGCTGGZiGCAACCGGTGGATTCC
AGCGTCTCCGACTTGGACTGAGCAATTCAGCGTCACGGATTCACGATAGACAGCTCAGACCGCT
CCACGGCTGGCGGCATTATTGGTTAACCCGGAAACTCAGTCTCCTTGGCCCCGTCCCGAAGGGA
COCGACTTAOCAGGCTGGGAAAGCCAGGGATAGAATAOACTGTACGGGCTTCGTAOGGGAGGTT
CGGCGTAGGGT7GTTCCCAAGTTTTACACACCCCCCAAGACAGCTAGCGCACGAAAGACGCGGA
GGGTTTGGTGAAAAAAGGGCGAAAATTAAGCGGGAGACGTATTTAGGTGCTAGGGCCGGTTTCC
TCCCCATTTTTCTTCGGTTCCCTTTCTCTCCTGGAAGACTTTCTCTCTCTCTCTTCTTCTCTTC
TTCCATCCTCAGTCCATCTTCCTTTCCCATCATCCATCTCCTCACCTCCATCTCAACTCCATCA
CATCACAATCGATZlTGAAAZYAGCCTGAACTCACCC-CGACGTCTGTCGZGAAAC-TTTCTGATCGAA
AACtTTCGACACtCCtTCTCCGACCTGATGCAGCTCTCGGAGGGCGAAGAATCTCGTGCTTTCAGCT
TCGATGTAGGAGGGCGTGGATATGTCCTGCGGGTAAATAGCTGCGCCGATGGTTTCTACAAAGA
TCGTTATGTTTATCGGCACTTTGCATCGGCCGCGCTCCCGATTCCGGAAGTGCTTGACATTGGG gaattcagcgagagcctgacctattgcatctcccc-ccgtgcacagggtgtcacgttgcaagacc tgcctgaaaccgaactgcccgctgttctgcagccc-gtcgcggaggccatggatgcgatcgctgc
GGCCGATCTTAGCCAGACGAGCGGGTTCGGCCCATTCGGACCGCAAGGAATCGGTCAATACACT acatggcgtgatttcatatgcgcgattgctgatccccatgtgtatcactggcaaactgtgatgg accacaccctcactccctccctcccccaccctctccatcacctcatcctttccccccaccactc
CCCCGAAGTCCGGCACCTCGTGCACGCGGATTTCGGCTCCAACAATGTCCTGACGGACAATGGC
CGCATAACAGCGGTCATTGACTGGAGCGAGGCGATGTTCGGGGATTCCCAATACGAGGTCGCCA
ACATCTTCTTCTGGAGGCCGTGGTTGGCTTGTATGGAGCAGCAGACGCGCTACTTCGAGCGGAG
GCATCCGGAGC77GCAGGA7CGCCGCGGC7CCGGGCG7A7AIGC7CCGCA7IGG7CITGACCAA CTCTATCAGAGCTTGGTTGACGGCAATTTCGATGATGCAGCTTGGGCGCAGGGTCGATGCGACG CAATCGTCCGA7CCGGAC-CCGGGACTG7CGGGCGTACACAAATCGCCCGCAGAAGCGCGGCCGT CTGGACCGATGGCTGTGTAGAAGTACTCGCCGATA.GTGGAAACCGACGCCCCAGCACTCGTCCG AGGGCAAAGGAA7AGAG7AGA7GCCGACCGGGA7CCAC77AACG77AC7GAAA7CA7CAAACAG C77GACGAA7C7GGA7A7AAGA7CG77GG7G7CGA7G7CAGC7CCGGAG77GAGACAAA7GG7G TTCAGGATCTCGATAAGATACGTTCATT7GTCCAAGCAGCAAAGAGTGCCTICTAGTGATTTAA TAGC7CCA7GTCAACAAGAATAAAACGCG7T7CGGG777ACC7C77CCAGATACAGC7CA7C7G CAA7GCAT7AA7GCAT7GGACCTCGCAACCC7AG7ACGCCCT7CAGGCTCCGGCGAAGCAGAAG AATAGC7TAGCAGAGTCTA7T7TCA7777CGGGAGACGA3ATCAAGCAGA7CAACG37CGTCAA GAGACC7ACGAGAC7GAGGAA7CCGC7C77GGCTCCACGCGACTA7ATAT7TGTCTC7AATTGT AC777GACA7GCTCC7C77CT7TAC7C7GATAGC77GAC7ATGAAAATTCCG7CACGAGCCCC7 GGG77CGCAAAGA7AA77GCAC7G777C77CC77GAAC7C7CAAGCC7ACAGGACACACA77CA TCG7AGGTA7AAACCTCGAAAATCA77CC7ACTAAGA7G3GTATACAATAGTAACCA7GGTTGC C7AG7GAA7GC7CCG7AACACCCAA7ACGCCGGCCGA7AAC77CG7A7AGCA7ACA77A7ACGA AG77A7AC77GGCGCGCC7AG7GGAACACGAGCACA7AAGCT7T7ACC7A7GG77ATCGC77GC ATCTACGCGCCGTTGATGGTGGAGGATGGTGGACC-TTCCCGAGACCCCTACGAGCTGTGGCA7C G7CAAACTG7GCCCACAGACC7T7G7C77GC77TCA7AAGCTCGAGGAGTGI77CCAGAC7CA7 CATCCATACACAAGCAGTA7TAA7CAAAGAAACTCGGTCGCAATGGCAAAAATGGTTTGCAAAC AGAAAACTA7GGCC7C7TCCTAT7CCA7CAT7AAC7AC7C7ACCCG7TTG7CATAACAACATCA T7AAAACCC77A7GCG7CAGG7G7AGCA7CC77GA7C7G77GCTCC7CCAACGGCCAGT7C7CA A7CG7TACC7C7TC7CCCACCAAC7CAAACTCAAGC77CACAGAC7CGTCGG7G77GAAGGC7A GCTCATACTTGCCGGGGTA7ACAATCCGG7T7CCGTGAGAATCAACACGGGCGAGA3CACTGAC AGGGA1GGGGA _ GC1GAGC11GGAAGAG _GACCAGGC11oAlG1CGGCAAG1CGG1^GAA1CCG ACGAGCCAC7TGTTCGGG7ACGGGGC7GGGCCAGCG77GC7TGTGCGAACAAACAGCATGGCCG TATA7GGGGAC7CCGTCT7GCCCGAG77C7TGATG77GGCCTCGAAGGTGAAGACG3GAATC7G C7CGCTGTAAG7GTATCCGGGGTGAGGAGCAGAGAGGA7CGA7GAGGTGT7GAACTTGAGGC7C T7GGGG7GGCTGGCGAGAG7C7CC77GAAGG7GG7G7AGAAGAGACCACTGCCAAAC7CGTAGA CGGG777GCCGG7G7ACCAGA7G7AAG7C7G7CCAGGG777GAC777CCA7CGGG7CGGAGG77 CA7G7CAT7C7GGGGGAA77GG7GAACA7AC7CAC-CCGG37ACTGAGTGG7GACCAG7CGGCCG GCAGGAGCACGCTTGCCAGAGAGAATGTCGAAGAC-GGCAACGCC7CCCGACTGGCCGGGATA7C CGCCCCAGACGAGGGAG77GACC77C77G7TGC7C77GAGCGAGGA7GAG7C7ACC7GACCACC GCCCA777GCAGGACGACAAGGGG777GCCGACC7CGC7GAGC7GC77GA7GAGA7GCAGC7GA TTACCGGGCCAAGCAATGTCCGTGCGGTCAGCGCCCTCCTGTTCAATGGTGTTGTCAATTCCAC CGAGG7AGA7GA7GGCA7CCGAC77C77GGCGGCAGCAA7GGCC77GGCAAAGCCA37GG7GC7 G77GCCGGCGA7C7C7G7GCCGAG77CAAAG77GACG7GA7AGCCGGCCT7C77AGCAGC77CC AGAGGGCTGATGAGGTATGGGGCAGGGCCATAGTAG77GCCT7GCA7TTGGG7TGTGGCATTGG CCCA7GGTCCGATCAGAGCAA7GC7GCGCACCTTC77GGACAGAGGGAGAGTGCCATCGTTC7T GAGCAGGACGA7GCCC7CAACAGCAGCC7CG7ACGAGA7G7T
[0112] The nucleotide sequence of the telomeric vector, pTTT-cre, is provided as SEQ ID NO:5:
T7G7ACAAAGTGGTGA7CGCGCCGCGCGCCAGCTCCG7GCGAAAGCCTGACGCACCGGTAGA7T
C7TGGTGAGCCCGTA7CA7GACGGCGGCGGGAGC7ACATGGCCCCGGGTGAT7TA7TTTT7T7G TA7C7AC77CTGACCC7777CAAA7A7ACGGTCAACTCATCT77CAC7GGAGA7GCGGCC7GC7 TGG7AT7GCGA7G77G7CAGCT7GGCAAAT7G7GGCT7TCGAAAACACAAAACGA7TCC77AG7
AGCCATGCATT7TAAGATAACGGAATAGAAGAAAC-AGGAAATTAAAAAAAAAAAAAAAACAAAC
A7CCCGTTCATAACCCGTAGAATCGCCGCTCTTCGTG7ATCCCAGTACCAGT7TATTTTGAATA GC7CGCCCGC7GGAGAGCA7CC7GAA7GCAAG7AACAACCG7AGAGGC7GACACGGCAGG7G77
GC7AGGGAGCG7CG7G77C7ACAAGGCCAGACG7C77CGCGG77GA7A7A7A7G7A7G777GAC
TGCAGGCTGCTCAGCGACGACAGTCAAGTTCGCCCTCGCTGCTTGTGCAATAATCGCAGTGGGG
AAGCCACACCGTGACTCCCATCTTTCAGTAAAGCTCTGTTGGTGTTTATCAGCAATACACGTAA
TTTAAACTCCT7ACCATCCCGCTCATACCTTAATTACCGTTTACCAGTGCCATCGTTCTCCACC
TTTCCTTGGCCCGTAAAATTCGGCGAAGCCAGCCAATCACCAGCTAGGCACCAGCTAAACCCTA
TAATTAGTCTCTTATCAACACCATCCGCTCCCCCGGGATCAATGAGGAGAATGAGGGGGATGCG
GGGCTAAAGAAGCCTACA.TAACCCTCATGCCAACTOCOAGTTTACACTCGTCGAGOCAACATCC
TGACTATAAGCGAACACAGAATGCCTCAATCCTGC-GAAGAACTGGCCGCTGATAAGCGCGCCCG
CCTCGCAAAAACCATCCCTGATGAATGGAAAGTCCAGACGCTGCCTGCGGAAGACAGCGTTATT
GATTTCCCAAAGAAATCGGGGATCCTTTCAGAGGCCGAACTGAAGATCACAC-AGGCCTCCGCTG
CAGATCTTGTG7CCAAGCTGGCGGCCGGAGAGTTGACCTCGGTGGAAGTTACGCTAGCATTCTG
TAAACGGGCAGCAATCGCCCAGCAGTTAGTAGGGTCCCCTCTACCTCTCAGGGAGATGTAACAA
CGCCACCTTATGGGACTATCAAGCTGACGCTGGCTTCTGTGCAGACAAACTGCGCCCACGAGTT
CTTCCCTGACGCCGCTCTCGCGCAGGCAAGGGAACTCGATGAATACTACGCAAAGCACAAGAGA
CCCGTTGGTCCACTCCATGGCCTCCCCATCTCTCTCAAAGACCAGCTTCGAGTCAAGGTACACC
GTTGCCCCTAAGTCGTTAGATGTCCCTTTTTGTCAGCTAACATATGCCACCAGGGCTACGAAAC
ATCAATGGGCTACATCTCATGGCTAAACAAGTACGACGAAGGGGACTCGGTTCTGACAACCATG
CTCCGCAAAGCCGGTGCCGTCTTCTACGTCAAGACCTCTGTCCCGCAGACCCTGATGGTCTGCG
AGACAGTCAACAACATCATCGGGCGCACCGTCAACCCACGCAACAAGAACTC-GTCGTGCGGCGG
CAGTTCTGGTGGTGAGGGTGCGATCGTTGGGATTCGTGGTGGCGTCATCGGTGTAGGAACGGAT
ATCGGTGGCTCGATTCGAGTGCCGGCCGCGTTCAACTTCCTGTACGGTCTAAGGCCGAGTCATG
GGCGGCTGCCG7ATGCAAAGATGGCGAACAGCATGGAGGGTCAGGAGACGGTGCACAGCGTTGT
CGGGCCGATTACGCACTCTGTTGAGGGTGAGTCCTTCGCCTCTTCCTTCTTTTCCTGCTCTATA
CCAGGCCTCCACTGTCCTCCTTTCTTGCTTTTTATACTATATACGAGACCGGCAGTCACTGATG
AAGTATGTTAGACCTCCGCCTCTTCACCAAATCCGTCCTCGGTCAGGAGCCATGGAAATACGAC
TCCAAGGTCATCCCCATGCCCTGGCGCCAGTCCGAGTCGGACATTATTGCCTCCAAGATCAAGA
ACGGCGGGCTCAATATCGGCTACTACAACTTCGACGGCAATGTCCTTCCACACCCTCCTATCCT
GCGCGGCGTGGAAACCACCGTCGCCGCACTCGCCAAAGCCGGTCACACCGTGACCCCGTGGACG
CCATACAAGCACGATTTCGGCCACGATCTCATCTCCCATATCTACGCGGCTGACGGCAGCGCCG
ACGTAATGCGCGATATCAGTGCATCCGGCGAGCCC-GCGATTCCAAATATCAAAGACCTACTGAA
CCCGAACATCAAAGCTGTTAACATGAACGAGCTCTGGGACACGCATCTCCAGAAGTGGAATTAC
CAGATGGAGTACCTTGAGAAATGGCGGGAGGCTGAAGAAAAGGCCGGGAAGGAACTGGACGCCA
TCATCGCGCCGATTACGCCTACCGCTGCGGTACGGCATGACCAGTTCCGGTACTATGGGTATGC
CTCTGTGATCAACCTGCTGGATTTCACGAGCGTGGTTGTTCCGGTTACCTTTGCGGATAAGAAC
ATCGATAAGAAGAATGAGAGTTTCAAGGCGGTTAC-TGAGCTTGATGCCCTCC-TGCAGGAAGAGT
ATGATCCGGAGGCGTACCATGGGGCACCGGTTGCAGTGCAGGTTATCGGACGGAGACTCAGTGA agagaggacgttggcgattgcagaggaagtggggaagttgctgggaaatgtggtgactccatag
CTAATAAGTGTCAGATAGCAATTTGCACAAGAAATCAATACCAGCAACTGTAAATAAGCGCTGA
AGTGACCATGCCATGCTACGAAAGAGCAGAAAAAAACCTGCCGTAGAACCGAAGAGATATGACA
CGCTTCCATCTCTCAAAGGAAGAATCCCTTCAGGGTTGCGTTTCCAGTCTAGACACGTATAACG
GCACAAGTGTCTCTCACCAAATGGGTTATATCTCAAATGTGATCTAAGGATC-GAAAGCCCAGAA
TATCGATCGCGCGCAGATCCATATATAGGGCCCGGGTTATAATTACCTCAGGAAATAGCTTTAA
GTAGCTTATTAAGTATTAAAATTATATATATTTTTAATATAACTATATTTCTTTAATAAATAGG
TATTTTAAGCTTTATATATAAATATAATAATAAAATAATATATTATATAGCTTTTTATTAATAA
ATAAAATAGCTAAAAATATAAAAAAAATAGCTTTAAAATACTTATTTTTAATTAGAATTTTATA
IATTTTTAATArATAAGATCTTTTACTTTTTTATAAGCTTCGTACGTTAAATTAAATTTTTAGT
TTTTTTTACTArTTTACTATATCTTAAATA^GGCTTTAAAAATATAAAAAAAATCTTCTTATA
TATTATAAGCTATAAGGATTATATATATATTTTTTTTTAATTTTTAAAGTAAGTATTAAAGCTA
GAATTAAAGTTTTAATTTTTTAAGGCTTTATTTAAAAAAAGGCAGTAATAGCTTATAAAAGAAA
TTTCTTTTTCTTTTATACTAAAAGTACTTTTTTTTTAATAAGGTTAGGGTTAGGGTTAGGGTTA
GGGTTAGGGTTAGGGTTAGGGTTAGGGTTAGGGTTAGGGTTAGGGTTAGGGTTAGGGTTAGGGT
TAGGGTAAGGGTTTAAACAAAGCCACGTTGTGTCTCAAAATCTCTGATGTTACATTGCACAAGA
TAAAAATATATCATCATGAACAATAAAACTGTCTGCTTACATAAACAGTAATACAAGGGGTGTT
A7GAGCCATAT7CAACGGGAAACG7C77GCTCGAGGCCGCGA77AAAT7CCAACA7GGATGCTG
ATTTATATGGGTATAAATGGGCTCGCGATAATGTCGGGCAATCAGGTGCGACAATCTATCGATT
CTATCGCAACCCCGATCCCCCACACTTCTTTCTGAAACATGGCAAAGCTACCCTTCCCAATCAT
GTTACAGATGAGATGGTCAGACTAAACTGGCTGACGGAATTTATGCCTCTTCCGACCATCAAGC
ATTTTATCCGTACTCCTGATGATGCATGGTTACTCACCACTGCGATCCCCGGGAAAACAGCATT
CCAGGTATTAGAAGAATATCCTGATTCAGGTGAAAATATTGTTGATGCGCTGGCAGTGTTCCTG
CGCCGGTTGCA7TCGATTCCTGTTTGTAATTGTCCTTTTAACAGCGATCGCC-TATTTCGTCTCG
CTCAGGCGCAA7CACGAATGAATAACGGTTTGGTTGATGCGAGTGATTTTGATGACGAGCGTAA
TGGC7GGCC7G77GAACAAG7C7GGAAAGAAA7GCA7AAGCT777GCCAT7C7CACCGGA77CA G7CG7CAC7CA7GG7GA777C7CAC77GA7AACC77A7777TGACGAGGGGAAA77AA7AGG77
GTATTGATGTTGG7i.CGAGTCGGAATCGCAGACCGATACCAGGATCTTGCCATCCTATGGAxACTG CC7CGG7GAG7777C7CC77CA77ACAGAAACGGC77777CAAAAA7A7GG7A77GA7AA7CC7 GA7A7GAA7AAA77GCAG777CA777GA7GC7CGA7GAG77777C7AA7CAGAA77GG77AA77
GGTTGTAACAC7GGCAGAGCATTACGCTGACTTGACGGGACGGCGGCTTTGTTGAATAAATCGA
ACT77TGC7GAGTTGAAGGATCAGA7CACGCATC7TCCCGACAACGCAGACCGTTCCGTGGCAA AGCAAAAG77CAAAA7CACCAAC7GG7CCACC7ACAACAAAGC7C7CA7CAACCG7GGC7CCC7
CAC77TC7GGC7GGA7GA7GGGGCGA77CAGGCC7GG7ATGAG7CAGCAACACCT7CT7CACGA
GGCAGACC7CAGCGG777AAACCTAACCCTAACCC7AACCCTAACCC7AACCC7AACCC7AACC
C7AACCCTAACCCTAACCC7AACCCTAACCCTAACCC7AACCCTAACCTAACCCTAATGGGGTC
GA7C7GAACCGAGGA7GAGGGT7C7A7AGACTAA7CTACAGGCCG7ACATGC-7GTGAT7GCAGA
TGCGACGGGCAAGGTGTACAGTGTCCAGAAGGAGC-AGAGCGGCATAGGTATTGTAATAGACCAG C777ACA7AA7AA7CGCC7G77GC7AC7GAC7GA7GACC77C77CCC7AACCAG777CC7AA77
ACCAC7GCAG7GAGGA7AACCC7AAC7CGC7C7GC-GG77A7TA77A7ACTGA77AGCAGG7GGC
T7A7ATAG7GC7GAAG7AC7A7AAGAG77TCTGCC-GGAGGAGG7GGAAGGAC7ATAAAC7GGAC
ACAG7TAGGGA7AGAG7GA7GACAAGACCTGAATGTTATCCTCCGGTGTGGTATAGCGAA7TGG
C7GACC77GCAGA7GG7AA7GG777AGGCAGGG77777GCAGAGGGGGACGAGAACGCG77C7G
CGA777AACGGC7GC7GCCGCCAAGC777ACGG77C7C7AATGGGCGGCCGCC7CAGG7CGACG
TCCCATGGCCA7TCGAAT7CGTAA7CA7GGTCATAGC7GT7TCCTGTGTGAAATTGTTATCCGC TCACAA77CCACACAACA7ACGAGCCGGAAGCATAAAGTG7AAAGCC7GGGC-7GCCTAA7GAG7
GAGC7AAC7CACA77AA77GCG7TGCGC7CAC7GCCCGCT7TCCAGTCGGGAAACCTG7CG7GC
CAGCTGCATTAATGAATCGGCCAACGCGCGGGGAGAGGCGGTTTGCGTATTC-GGCGCTCTTCCG
C77CC7CGC7CAC7GAC7CGC7GCGC7CGG7CG77CGGC7GCGGCGAGCGG7A7CAGC7CAC7C
AAAGGCGG7AA7ACGG77A7CCACAGAA7CAGGGGATAACGCAGGAAAGAACA7G7GAGCAAAA
GGCCAGCAAAAGGCCAGGAACCG7AAAAAGGCCGCG77GC7GGCG7777TCCA7AGGC7CCGCC
CCCC7GACGAGCA7CACAAAAA7CGACGC7CAAG7CAGAGGTGGCGAAACCCGACAGGAC7A7A AAGA7ACCAGGCG777CCCCC7GGAAGC7CCC7CG7GCGC7C7CC7G77CCC-ACCC7GCCGC77
ACCGGA7ACCTGTCCGCC77TC7CCC77CGGGAAGCG7GGCGCTT7CTCA7AGCTCACGC7G7A
GGTATCTCAGT7CGGTGTAGGTCGTTCGCTCCAAC-CTGGGCTGTGTGCACGAACCCCCCGTTCA
GCCCGACCGCTGCGCC7TA7CCGG7AAC7ATCGTCTTGAG7CCAACCCGG7AAGACACGACT7A
TCGCCAC7GGCAGCAGCCAC7GG7AACAGGA77AGCAGAGCGAGG7A7GTAC-GCGG7GC7ACAG
AGT7CT7GAAG7GG7GGCC7AACTACGGCTACAC7AGAAGAACAG7ATTTGG7ATCTGCGCTCT GC7GAAGCCAG7TACC77CGGAAAAAGAGTTGGTAGC7CT7GA7CCGGCAAACAAACCACCGC7
GG7AGCGG7GG7777777G777GCAAGCAGCAGA77ACGCGCAGAAAAAAAC-GA7C7CAAGAAG A7CC7T7GA7C7T77C7ACGGGGTC7GACGCTCAGTGGAACGAAAAC7CACG77AAGGGA7777
GGTCATGAGAT7A7CAAAAAGGATG77CACCTAGATCCTT7TAAA7TAAAAA7GAAGT777AAA
TCAA7C7AAAG7A7A7A7GAG7AAAC77GG7C7GACAG77ACCAA7GC7TAA7CAG7GAGGCAC
C7A7C7CAGCGA7C7G7C7A777CG77CA7CCA7AG77GCCTGAC7CCCCG7CG7G7AGA7AAC
TACGA7ACGGGAGGGC77ACCA7C7GGCCCCAG7C-C7GCAATGA7ACCGCGAGACCCACGC7CA
CCGGCTCCAGA7TTATCAGCAA7AAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCC7G
CAAC7T7A7CCGCC7CCA7CCAGTC7A77AATTG7TGCCGGGAAGCTAGAGTAAG7AGTTCGCC AG77AA7AG777GCGCAACG77G77GCCA77GC7ACAGGCATCG7GG7GTCACGC7CG7CG777
GGTA7GGC77CATTCAGC7CCGGT7CCCAACGATCAAGGCGAGTTACATGATCCCCCATG7TGT
GCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTT ATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTT TCTCTCACTCCTCACTACTCAACCAACTCATTCTCACAATACTCTATCCCCCCACCCACTTCCT CTTGCCCGGCGTCAATACGGGATAATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCAT TGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATG TAACOCAOTOGTGCACCCAACTGATCTTCAGCATCTTTTACTTTCACOAGOGTTTOTGGGTGAG CAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACACGGAAATGTTGAATACT CATACTCTTCCTTTTTCAATATTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCGGATAC ATATTTGAATG7ATTTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGC CACCTGACGTC7AAGAAACCATTATTATCATGACATTAACCTATAAAAATAGGCGTATCACGAG GCCCTTTCGTCTCGCGCGTTTCGGTGATGACGGTGAAAACCTCTGAC7i.CATGCAGCTCCCGGAG ACGGTCACAGC7TGTCTGTAAGCGGATGCCGGGAGCAGACAAGCCCGTCAGGGCGCGTCAGCGG GTGTTGGCGGGTGTCGGGGCTGGCTTAACTATGCGGCATCAGAGCAGATTGTACTGAGAGTGCA CCATAAAATTG7AAACGTTAATATTTTGTTAAAATTCGCGTTAAATTTTTGTTAAATCAGCTCA TTTTTTAACCAATAGGCCGAAATCGGCAAAATCCCTTATAAATCAAAAGAATAGCCCGAGATAG GGTTGAGTGTTGTTCCAGTTTGGAACAAGAGTCCACTATTAAAGAACGTGGACTCCAACGTCAA AGGGCGAAAAACCGTCTATCAGGGCGATGGCCCACTACGTGAACCATCACCCAAATCAAGTTTT TTGGGGTCGAGGTGCCGTAAAGCACTAAATCGGAACCCTAAAGGGAGCCCCCGATTTAGAGCTT GACGGGGAAAGCCGGCGAACGTGGCGAGAAAGGAAGGGAAGAAAGCGAAAGC-AGCGGGCGCTAG GGCGCTGGCAAGTGTAGCGGTCACGCTGCGCGTAACCACCACACCCGCCGCC-CTTAATGCGCCG CTACAGGGCGCGTACTATGGTTGCTTTGACGTATC-CGGTGTGAAATACCGCACAGATGCGTAAG GAGAAAATACCGCATCAGGCGCCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGATCG GTGCGGGCCTC7TCGCTATTACGCCAGCTGGCGAAAGGGGGATGTGCTGCAAGGCGATTAAGTT GGCtTAACGCCAGCtGTTTTCCCAGTCACGACGTTGTAAAACGACGGCCAGTGCCCAAGCTTACTA GATGCATGCTCGAGCGGCCGCCAGTGTGATGGATATCTGCAGAATTCGCCCTTGACTAGTGCTC TCTATCCTGGTGGCAGGCGTCAAGTACCCAGAGGCAGCAGCGGGCTTAGGAGCGGCCTGGGTTG TTCTCCGCACCCTCTACATGCTGGGCTATATTTATAGCGACAAGCCGAACGC-CACCGGCAGGTA CAATGGTTCGCTGTACTTGCTTGCGCAAGCGGGTCTTTGGGGATTGAGCGCATTTGGTGTTGCA AAGGATTTGATGTAAATGTAGTCGACATCTTAGCACAGAGGGGAGAGTTGATAAAATGTGGTCT GTTTGAATGATAGTCGGGTTCGTGACCTATATTCC-TGATAGTGGAGATAGGTCTGCGCCTATCT TATCGGGCCGGAGCAAAAATTCCACCGCAGCGGGC-TGAGTTTTCGTTATACAGCCATCCCACTT CCAGCTTCAAATTGTCAGTTTAATCCAGCCCAATTCAATCATTGGAGAACCC-CCATCATGTCTT CGAAGTCCCACCTCCCCTACGCAATTCGCGCAACCAACCATCCCAACCCTTTAACATCTAAACT CTTCTCCATCGCCGAGGAGAAGAAAACCAACGTCACCGTCTCCGCAGACGTTACTACTTCCGCC GAGCTCCTCGA7CTTGCTGACCGTACATCCTGCACCAATGCCCCTCCAGGATAACAAATAGCTG ATGCGTAGTGAGTACAGGCCTAGGCCCCTATATCGCAGTTCTGAAAACCCACATCGACATCCTC ACCGATCTCACCCCGTCGACCCTTTCCTCGCTCCAATCCCTCGCGACAAAGCACAACTTCCTCA TCTTTGAGGACCGCAAGTTCATCGACATCGGCAACACCGTGCAAAAGCAGTACCACGGTGGCGC TCTCCGCATCTCCGAATGGGCACACATCATCAACTGCGCCATCCTGCCGGGCGAAGGGATCGTC GAGGCCCTCGCACAGACAACCAAGTCTCCTGACTTTAAAGACGCGAATCAACGAGGTCTCCTGA TTCTTGCCGAGATGACGAGTAAGGGATCTCTTGCGACAGGGGAGTCACAGGCACGCTCGGTTGA GTACGCGCGGAAGTATAAGGGGTTTGTGATGGGATTCGTGAGTACAAGGGCC-TTGAGTGAGGTG CTGCCCGAACAGAAAGAGGAGAGCGAGGATTTTGTCGTCTTTACGACTGGGGTGAATCTGTCGG ATAAGGGGGATAAGCTGGGGCAGCAGTATCAGACACCTGGGTCGGCGGTTGC-GCGAGGTGCGGA GTTTATCATTGCGGGTAGGGGCATGTATAAGGCGGAGGATCGAGTCGAGGCGGTTCAGAGGTAC CGGGAGGAAGGCTGGAAAGCTTACGAGAAAAGAGTTGGACTTTGAGTGTGAC-TGGAAATGTGTA ACGGTATTGAC7AAAAGGGATCCATATGTTTATTGCAGCCAGCATAGTATTACCAGAAAGAGCC TCACTGACGGC7CTAGTAGTATTCGAACAGATATTATTGTGACCAGCTCTGAACGATATGCTCC CTAATCTGGTAGACAAGCACTGATCTACCCCTTGGAACGCAGCATCTAGGCTCTGGCTGTGCTC TAACCCTAACTAGACGATTGATCGCAGACCATCCAATACTGAAAAGTCTCTATCAGAGGAAATC CCCAACATTGTAGTAGTCAGGTTCCTTTGTGGCTC-GGAGAGAATTGGTTCGCTCCACTGATTCC AGTTGAGAAAGTGGGCTAGAAAAAAGTCTTGAAGATTGGAGTTGGGCTGTGGTTATCTAGTACT
ICTCGAGCTCTGTACATGTCCGGTCGCGACGTACGCGTATCGATGGCGCCAC-CTGCAGGCGGCC
GCCTGCAGCCACTTGCAGTCCCGTGGAATTCTCACGGTGAAIGTAGGCCTTTTGTAGGGTAGGA
ATTGTCACTCAAGCACCCCCAACCTCCATTACGCCTCCCCCATAGAGTTCCCAATCAGTGAGTC
ATGGCACTGTTCTCAAATAGATTGGGGAGAAGTTGACTTCCGCCCAGAGCTGAAGGTCGCACAA
CCGCATGATATAGGGTCGGCAACG5CAAAAAAGCACGTGGCTCACCGAAAAGCAAGATGTTTGC
GATCTAACATCCAGGAACCTGGATZ\CATCCATCATCACGCACGACCACTTTC-ATCTGCTGGTAA
ACTCGTATTCCtCCCTAAACCGAAGTGCGTGCtTAAATCTACACGTGGGCCCCTTTCGGTATACTG
CGTGTGTCTTCrCTAGGTGCCATTCTTTTCCCTTCCTCTAGTGTTGAATTGTTTGTGTTGGAGT
CCGAGCTGTAACTACCTCTGAATCTCTGGAGAATC-GTGGACTAACGACTACCGTGCACCTGCAT
CATGTATATAATAGTGATCCTGAGAAGGGGGGTTTGGAGCAATGTGGGACTTTGATGGTCATCA
AACAAAGAACGAAGACGCCTCTTTTGCAAAGTTTTGTTTCGGCTACGGTGAAGAACTGGATACT
TGTTGTGTCTTCTGTGTATTTTTGTGGCAACAAGAGGCCAGAGACAATCTATTCAAACACCAAG
CTTGCTCTTTTGAGCTACAAGAACCTGTGGGGTATATATCTAGAGTTGTGAAGTCGGTAATCCC gctgtatagtaatacgagtcgcatctaaatactccgaagctgctgcgaaccx.ggagaatcgaga
TGTGCTGGAAAGCTTCTAGCGAGCGGCTAAATTAC-CATGAAAGGCTATGAGAAATTCTGGAGAC
GGCTTGTTGAATCATGGCGTTCCATTCTTCGACAAGCAAAGCGTTCCGTCGCAGTAGCAGGCAC
TCATTCCCGAAAAAACTCGGAGATTCCTAAGTAGCGATGGAACCGGAATAATATAATAGGCAAT
ACATTGAGTTGCCTCGACGGTTGCAATGCAGGGGTACTGAGCTTGGACATAACTGTTCCGTACC
CCACCTCTTCTCAACCTTTGGCGTTTCCCTGATTCAGCGTACCCGTACAAGTCGTAATCACTAT
TAACCCAGACTGACCGGACGTGTTTTGCCCTTCATTTGGAGAAATAATGTCATTGCGATGTGTA
ATTTGCCTGCTTGACCGACTGGGGCTGTTCGAAGCCCGAATGTAGGATTGTTATCCGAACTCTG ctcgtagaggcatgttgtgaatctgtgtcgggcaggacacgcctcgaaggttcacggcaaggga
AACCACCGATAGCAGTGTCTAGTAGCAACCTGTAAAGCCGCAATGCAGCATCACTGGAAAATAC
AAACCAATGGCrAAAAGTACATAAGTTAATGCCTAAAGAAGTCATATACCAC-CGGCTAATAATT
GTACAATCAAGGGGCTAAACGTACCGTAATTTGCCAACGGCTTGTGGGGTTGCAGAAGCAACGG
CAAAGCCCCACGTCCCCACGTTTGTTTCTTCACTCAGTCCAATCTCAGCTGC-TGATCCCCCAAT
TGGGTCGCTTGTTTGTTCCGGTGAAGTGAAAGAAGACAGAGGTAAGAATGTCTGACTCGGAGCG
TTTTGCATACAACCAAGGGCAGTGATGGAAGACAC-TGAAATGTTGACATTCAAGGAGTATTTAG
CCAGGGATGCTTGAGTGTATCGTGTAAGGAGGTTTGTCTGCCGATACGACGAATACTGTATAGT cacttctgatgaagtCtGtccatattgaaatgtaaagtcggcactgaacaggc.aaaagattgagt
TGAAACTGCCTAAGATCTCGGGCCCTCGGGCCTTCGGCCTTTGGGTGTACATGTTTGTGCTCCG
GGCAAATGCAAACtTGTGGTAGGATCGAACACACTC-CTGCCTTTACCAAGCAC-CTGAGGGTATGT GATAGGCAAATGTTCAGGGGCCACTGCATGGTTTCGAATAGAAAGAGAAGCTTAGCCAAGAACA ATAGCCGATAAAGATAGCCTCATTAAACGGAATGAGCTAGTAGGCAAAGTCAGCGAATGTGTAT ATATAAAGGTTCGAGGTCCGTGCCTCCCTCATGCTCTCCCCATCTACTCATCAACTCAGATCCT CCAGGAGACTTGTACACCATCTTTTGAGGCACAGAAACCCAATAGTCAACCATCACAAGTTTGT ACAAAAAAGCAGGCTCACCATGAGCAACCTGCTCACCGTCCACCAGAACCTCCCTGCCCTCCCT GTCGACGCCACCTCTGACGAGGTCGGCAAGAACCTCATGGAGATGTTCCGCr-AGCGCCAGGCCT TTAGCGAGCACACCTGGAAGATGCTCCTCAGCGTCTGCCGATCTTGGGCCGCCTGGTGCAAGCT CAACAACCGCAAGT GGTTCCCCGCCGAGCCGGAGGACGTCCGCGACTACCTCCTCTACCTGCAG GCCCGAGGCCTGGCCGTCAAGACCATCCAGCAGCACCTCGGCCAGCTCAACATGCTCCACCGAC GCTCTGGCCTGCCTCGCCCTAGCGACTCTAACGCCGTCAGCCTGGTCATGCGCCGCATCCGCAA CCACAACCTCCACCCTCCCCAGCGACCCAACCAGCCCCTCCCCTTCGAGCCCACCCACTTCCAC CAGGTCCGCAGCCTCATGGAGAACAGCGACCGCTGCCAGGATATCCGCAACCTCGCCTTTCTCG GCATTGCCTACAACACCCTGCTCCGCATTGCCGAGATCGCCCGCATCCGCGTCAAGGACATCTC tcCtCaccgacggcggccgcatgctcattcacatcggccgcaccaagaccctcgtgtctaccgcc GGCGTCGAGAAGGCCCTCAGCCTCGGCGTCACCAAGCTCGTCGAGCGCTGGATTTCTGTCTCCG GCGTCGCTGACGACCCCAACAACTACCTCTTCTGCCGCGTCCGAAAGAACGC-CGTCGCCGCCCC TTCTGCCACCTCTCAGCTCAGCACCCGAGCCCTGGAGGGCATCTTTGAGGCCACCCACCGCCTC ATCTACGGCGCCAAGGACGACTCTGGCCAGCGCTACCTCGCCTGGTCTGGCCACTCTGCCCGAG TCGGCGCTGCCCGAGACATGGCCCGAGCCGGCGTCAGCATCCCCGAGATTATGCAGGCCGGCGG
CTGGACCAACGTCAACATCGTCATGAACTACATCCGCACCCTCGACTCTGAGACCGGCGCCATG
GTCCGACTCCTCGAGGACGGCGACTAAACCCAGCTTTC
Creation of the MAD6 strain from A5D T. reesei strain.
[0113] Native egl3 was deleted from the A5D strain (above) using a similar method as the one previously described for bgll deletion (see, supra). A schematic of the egl3 deletion cassette is shown in Figure 6. Hygromycin resistance was used as the selectable marker for egl3 deletion. The hygromycin resistance marker was flanked by loxP sites. Atransformant was confirmed to have a deletion of the egl3. The hygromycin marker was removed from this strain as described for creation of the A5D strain. The telomeric vector encoding the ere recombinase was removed from this strain as described for the creation of the A5D strain. The nucleotide sequence of the egl3 deletion cassette is provided as SEQ ID NO:6:
GGGAGGTAGGCGCAGATACGGTGCATGGGACCCGAACCCGTAACCGGAACACGACCTTATCAGC CCTCCAACTCACACCCTCTCGCCTATCACTATCCTAGATAGTTCATCGGCCAACTCATGTAACC TAGCTACCTACCTACCTGGTAAGAATGCGGGCTATCATGTCTCACGGCGCGGTACATGTCGGTA TCTCGCTGCTTCCCCGCAGGTTGACGTCGGAATCCATGCAAGTACTCCCTGAAATCGAGACGAC AGAGAGAACAACCAACGCGCTTAAACGCTTCATGTTCATCTAAGAGGCACATTCGAAGAACTAG CTTAACACACTAGACCTGGCTTTTCGACCCCCTCCGCAGAAAGCCGTTTTCTCCTCAATCCTCC CGGGCTTGGCT7TTGTCAGTCCGTACTTGCTGCGCTAACAGAGTCTTGGACC-CAGCGTTTGCGC ATCAGTCTTGCAGGCGGTTCACGGGACTAGGACAACAGGGGATGTGACAGGCCGGATAGTAATT ATGGGTTATCCGGGGTAAGCAGGGAATTTACGAGGCCGCTTTACGTGGGGGAACAGCCACTTGC GGGGGGAAGAGGAGTAGTAGGCGACTCGGTCGATGAGCTCGAGGTGTCTGGTTGACTTGGACTG CAGAGCGTAGG7AATTGAGATCGGGCAACATTATCGGTGTTCGGCTCGGTATGGCCGAGTTGCG ACTGCTTGGTCATTCGGCGAAGCTGATGTCGTGGTATCCTGAAGCATCGATATCGGAAACCATG ATGGTCAGTCTATCTGACGTGTGCGGTGACAAGCGAGTCCGGATTTTGTGACATGACGTTCAAC TTCAGTCAATGCCTTAGGGCTCGATAAGATTAAGATTGGGTTCTGGCAGCGGTCTAGAACACCG CCACAAATTCTGTCCATTGAGGAGCGTGATGTCTAGGCGCATCACTAACACGGAGCTGTATGAC GGGCAGCTCAACGGACTICTCTTCGTTCAACGGCAGTCTATTTGCGGTACAGGAATGGATCTTT CTTCCTGGTCTTGAAGTGCCGCAGTGGCGTGCGAATGTATAGATGTCTCGCTACCTAGAAAAGC TGGCTTTTCTGACAGGGTCCCTTCCACCTCTCCTACCAACGACAAACTGAACAAGTATCTGGCG GTTTCCCAACGCCGAATAGGCCAGTCGCCAATACTCCCTCCAGCCCTGATTC-GGCCCCTCGAAG TATCGCCATGTCTGTGTGTTGAGATTATTCGATGGACGTCACTCCCCCAACCTACAGGAAGAGC AAAATGGGAGCAGTGTTCTGCAATGAGCTATATAATAGATCGCTCGATCTCATACAAATTGTAT GCTCAGTCAATACAACGAGCGGTTCCAAGATCCCTTCTCCAACGACCCTCGAAACATTGCAACC CGGTGCAGCCTGAACTTGTTCGTATAGCCTAGAAAGCGACGCCATCTTCATCTTTTACGCGATT AGCCTCATGGCGATTTGIGCCGAAGTGGGAGTTGTATGGTAGCAGTGAGGAGATTGTGGCTACG ACACAGGCGGGTTCTCTTGAGCGGCTTACATCTCCGCATTAGGCCTGCGTACGATCCAGATCAT GGGAAACTTTACAATGGCTTACTCGTTTTATCTCAACACTGAGCTTCCAATTCACTCTATGCAT TGATTAACACGTTTGGTCATGTGGTTCTTCAGCTC-TAAATCTTCAGCTTCCCAAGAATTGCAAC CTCGCTGATTGCTAATAGTGTTGCATGCGTTGCATCCTGGTGCGGCAGTGCAAAGGAGAGTCAA AGTAGCCGGCAGATTAATTTAAGCTTATATCACTCAGGGGTAAACAGCCGTAAAGGACCTTTTG ATCTAACATGCCGATGTGTATGTAGATCACGCAATGCCCACCATATCTTGGCAGTCAGATTTGT CCGTGGCGCGCCAAGTATAACTTCGTATAATGTATGCTATACGAAGTTATCC-GCCGGCGTATTG GGTGTTACGGAGCATTCACTAGGCAACCATGGTTACTATTGTATACCCATCTTAGTAGGAATGA TTTTCGAGGTTTATACCTACGATGAATGTGTGTCCTGTAGGCTTGAGAGTTCAAGGAAGAAACA GTGCAATTATC7TTGCGAACCCAGGGGCTGGTGACGGAATTTTCATAGTCAAGCTATCAGAGTA AAGAAGAGGAGCATGTCAAAGTACAATTAGAGACAAATATATAGTCGCGTGGAGCCAAGAGCGG
ATTCCTCAGTC7CGTAGGTCTCTTGACGACCGTTGATCTSCTTGATCTCGTCTCCC3AAAATGA
AAATAGACTCTGCTAAGCTATTCTTCTGCTTCGCCGGAGCCTGAAGGGCGTACTAG3GTTGCGA
CCTCCAATCCATTAATCCATTCCACATCACCTCTATCTCCAACACCTAAACCCGAAACCCCITT
TATTCTTGTTGACATGGAGCTATTAAATCACTAGAAGGCACTCTTrGCTGCTTGGACAAATGAA
CGTATCTTATCGAGATCCTGAACACCATTTGTCTCAACTCCGGAGCTGACAICGACACCAACGA
TOTTATATCOAGATTOGTCAAGCTGTTTGATGATTTOAGTAACGTrAAGTGGATOCGGGTCGGC
ATCTACTCTAT7CCTTTGCCCTCGGACGAGTGCTCGGGCGTCGGT7TCCACTATCGGCGAGTAC
ITCTACACAGCCATCGGTCCAGACGGCCGCGCTICTGCGGGCGATrTGTGTACGCCCGACAGTC
CCGGCTCCGGA7CGGACGATTGCGTCGCATCGACCCTGC3CCCAAGCTGCAICATC3AAATTGC
CGTCAACCAAGCTCTGATAGAGTTGGTCAAGACCAATGCGGAGCA7ATACGCCCGGAGCCGCGG
CGATCCTGCAAGCTCCGGATGCCTCCGC7CGAAGTAGCGCGTCTGCTGCTCCATACAAGCCAAC
CACGGCCTCCAGAAGAAGATGTTGGCGACCTCGTATTGGGAATCCCCGAACATCGC3TCGCTCC
AGTCAATGACCGCTGTTATGCGGCCATTGTCCGTCAGGACATTGT7GGAGCCGAAATCCGCG7G
CACGAGGTGCCGGACTTCGGGGCAGTCC7CGGCCCAAAGCATCAGCTCATCGAGAGCCTGCGCG
ACGGACGCACTGACGGTGTCGTCCATCACAGTTTGCCAGTGATACACATGGGGATCAGCAATCG
CGCATATGAAATCACGCCATGTAGTGTATTGACCGATTCCTTGCGGTCCGAATGGGCCGAACCC
GCTCGTCTGGCTAAGATCGGCCGCAGCGATCGCATCCATGGCCTCCGCGACCGGCTGCAGAACA
GCGGGCAGTTCGGTTTCAGGCAGGTCTTGCAACGTGACAGCCTGTGCACGGCGGGAGATGCAAI
AGGTCAGGCTC7CGCTGAATTCCCCAATGTCAAGCACTTCCGGAA7CGGGAGCGCGGCCGATGC
AAAGTGCCGATAAACATAACGATCTTTG7AGAAACCATCGGCGCAGCTATTIACCCGCAGGACA
TATCCACGCCCTCCTACATCGAAGCTGAAAGCACC-AGATTCTTCGCCCTCCGAGAGCTGCATCA
GGTCGGAGACGCTGTCGAACTTTTCGATCAGAAACTTCTGGACAGACGTCGCGGTGAGTTCAGG
CTTTTTCATATCGATTGTGATGTGATGGAGTTGAC-ATGGAGGTGAGGAGATGGATGATGGGAAA
GGAAGATGGAC7GAGGATGGAAGAAGAGAAGAAGAGAGA3AGAGAAAGTCTICCAG3AGAGAAA
GGGAACCGAAGAAAAATGGGGAGGAAACCGGCCCTAGCACCTAAA7ACGTCICCCGCTTAAT7T
TCGCCCTTTTT7CACCAAACCCTCCGCG7CTTTCGTGCGCTAGCTGTCTTGGGGGGTGTGTAAA
ACTTGGGAACAACCCTACGCCGAACCTCCCGTACGAAGCCCGTACAGTGTATTCTATCCCTGGC
TTTCCCAGCCTGGTAAGTCGGGTCCCTTCGGGACGGGGCCAAGGAGACTGAGTTTCCGGGTTAA
CCAATAATGCCGCCAGCCGTGGAGCGGTCTGAGCTGTCTATCGTGAATCCG1GACGCTGAAT7G
CTCAGTCCAAG7CGGAGACGCTGGAATCCACCGGTTGCTGCAGCCACGGCGAAGAATCCACT7A
CCTTCCGGGCT7CCGCCAGCTGGCAACA7ATTTTTGAGGCTGATA7AGCCT1CTCCTCCATCAC
GATAAGCCCTGACTACCTTGCTACGGGCCAATTGCAATTTGTTTGCTGGTTACGCTTTACAAAA
GGTGGCCGTTACTGAGCAAAAAGAAAATGATGTAAAAATTTGCGAGTGGGTCCCATAGCTGGAT
GGGTCCGATAAAATGGTACTGCCCCACT7AGTGGCAGCTCGCGACCAGTCACAAGCCCAGGA7A
ACTTCGTATAA7GTATGCTATACGAAGT7ATCTGTGGGCGTTATGAATAATAGACT3GAACCGG
GCCCTTTGATTGACGACTCCATATTTTGTAGATGTAGCAACTCGGCAAGAGCATTATGTGCAAT
ACATTTGTTACCATACAAAGGCAGCTGCCAGACGACTTGTATTGCGTACAATTCTCACGGCAAG
CTTTCCAGGTG7TATGCATTATGCGCAAATGCTTGATGCTTACCGCAGGAT1AATCTCGGAAGA
AGCGCTGCAAGCTATATGGGTGTAGTAGATATGTAGATGTACCAACCAATGAAGAACATTTATG
GTCTAGAACGTAGTGATGAAGGTTTTGAGTAATTTGTATCAAGTAAGACGATATTATTGATA7A
ATACCAAGCATATATTCATGATAAATTACTTGGAACCACCCTTGCGTCCGGCCTCACGAGCC7T
CTCACTGCCGGGCTCGAAGGAGCCACTGGAGGCCTGTCCACCCTTGGATGCGATTTGCTGCACC
TTTTCCTTGGGCCTGCACGTCGATTAGACATGATTCAAATCGAGA7CTTGGAATAT7TTACA7G
CTGGCGAAGCCACCGGTGTGGCTGGACTGTCCGCCCTTCTGCGCAATGCTTTGAACGTCCTCCT
TGGGGCTGTGTAGAAAGGTTTGTTAGCAACATTAGTACAACTCTCAGGAGTGGGTGGTGGTACC
GGTTGGCGAAG7TTCCGGGGTTATCGTTGCCAGACATTGTGTGAT7ATTTGGTGTGCAAATG7G
TGCTATGTGTGTTGTTGCTGTTGGTGATGATGCTGAAGCTGTTGAAAGCAGGCTGGTTCTGTGG
GAGAGACTTGGGATATTTATATCCAAAG7TCGGTCGTGTTCCTTC7GGAAGCTCTTCTCTAC7C
CATACAATCATCCAAAGTTGTCGTCATTGAGCGTTGATCAGTAGTAGCCTCTGAGGTCATCACC
ATGATCCTTCCGGCCAACAGTCGGCACTCATCAACAGCAACAATCAGCCGCCACAAACATAGGT
ACAGTAAGGAG7TAGATATCATGTAGTCGTCGAGTACTCSACATCATGACGTACAA3CTTTGCC
AGTGTCGGTAGGTGCAAGTATGATGATCGTATCCGCCGTTGTTCGATCGAACAGAGTGCGGTCA
GATTCACGGTTTCTCTCACCTTGAACAT7GGATGCAATTGGATTGATCCACAATCCTGGAGAAT
GGCTTCAAGCTCACTGCTCCAGTCGCAAGCTTCAGAGCCTATTAC7AAGGGTAGAGCTACCTAT
GTCAAGAGTTTTCAAGGTACCTAAGCTACATGTGATAGTCGGCAAGCCATTITGAACGCAGACC
GTGAACGGTGATGTAAATCCGGGATAGACGCCCAAGCGTGCCGTG7CAATGACGCTAGATACAC
CTCGATTTACGTAGAGTGAATGCCAGCCAATGGAGTCATGCACATAACCCGCTTAGACTCTGCT
CGGGGCGATACCCGATCGCAGAGGCAGAGCCGCTTAAACGCGATCGCGGTAACCTGTAATCAGA
GCCAGCGCTCGATGAATTGCATCATGGAAGCCATTGATGTGGAATGTTGAGCGTATAACAACAC
GAATTGAAGACGACATTGACTT3CTTCAAGTGAGTGGAGAATTGCCGGGCAGACAAGATAGGTA
GGCTCTTGGTGCGCTGTCACATCAATCCATTCCTTTTCCTCTGTTCAATCTCTATGTTGACATT
CTGATAGGGATCATTGGATGCCAATGCAAAGAACATGAGAGTGTGGTCTGCATTCAAGTATCCT
GGTCGTAAGCTGTGGCCATGGGGGCTGCGGTCAAGGTCAATCGCGATGACTAATCAGTCTCGGT
GACTCTGGGGCGGTAGAGGCAGTGTCGTGAACCAAAGCTTGAGCCGAGGGCAAAAACAACGGCG
CATCAAACAATCAACGAAAGCATCGTCAACAGTGTCTCTTCCCAG7CAATTACTTCGCAAAACC
TTCTCGATAGAACCCTTCAGACGATGAACAGGCCACGCAACCGTCAGCCGCGCCCCCCAGGACA
GACTCAGCGCCCGGGAGGCAGATCGTCACACCTTGGTCGACGAGCrC EXAMPLE 2
Enzyme Expression Comparison: Quad Deleted Strain vs. MAD6 Strain 1. Preparation of the Vectors [0114] Two glycosyl hydrolase family 43 proteins were expressed in the T. reesei strains, Archy3 and the quad deleted strain. The genes, M3B and M3C, were cloned from Fusarium verticillioides genomic DNA and assembled into expression vector pTrex3gM using the Gateway® cloning system (Invitrogen). Both genes were initially cloned into the pENTR/D-TOPO vector (Invitrogen), as depicted in Figure 9.
[0115] The genes were subsequently recombined into vector pTrex3gM, in which the cbhl promoter is upstream of the coding sequence of the gene of interest, and the cbhl terminator is downstream of the stop codon of the gene of interest. The vector additionally contains the Aspergillus nidulans acetamidase (amdS) selectable marker to the 3' of the cbhl terminator. The vector is depicted in Figure 10.
[0116] The resulting Fv43B expression vector, pTrex3gM-Fv43B, is shown schematically in Figure 11.
[0117] The full nucleotide sequence of the Fv43B glycosyl hydrolase in expression vector pTrex3gM is provided below as SEQ ID NO:14.
TTGTACAAAGTGGTGATCGCGCCGCGCGCCAGCTCCGTGCGAAAGCCTGACC-CACCGGTAGATT
CTTGGTGAGCCCGTATCATGACGGCGGCGGGAGCTACATGGCCCCGGGTGATTTATTTTTTTTG
TATCTACTTCTGACCCTTTTCAAATATACGGTCAACTCATCTTTCACTGGAGATGCGGCCTGCT
TGGTATTGCGATGTTGTCAGCTTGGCAAATTGTGGCTTTCGAAAACACAAAACGATTCCTTAGT
AGCCATGCATT7TAAGATAACGGAATAGAAGAAAGAGGAAATTAAAAAAAAAAAAAAAACAAAC
ATCCCGTTCATAACCCGTAGAATCGCCGCTCTTCGTGTATCCCAGTACCAGTTTATTTTGAATA
GCTCGCCCGCTGGAGAGCATCCTGAATGCAAGTAACAACCGTAGAGGCTGACACGGCAGGTGTT
GCTAGGGAGCG7CGTGTTCTACAAGGCCAGACGTCTTCGCGGTTGATATATATGTATGTTTGAC
TGCAGGCTGCTCAGCGACGACAGTCAAGTTCGCCCTCGCTGCTTGTGCAATAATCGCAGTGGGG
AAGCCACACCGTGACTCCCATCTTTCAGTAAAGCTCTGTTGGTGTTTATCAGCAATACACGTAA
TTTAAACTCCT7ACCATCGCGCTGATACCTTAATTACCGTTTACCAGTGCCATCGTTCTCCACC
TTTCCTTGGCCCGTAAAATTCGGCGAAGCCAGCCAATCACCAGCTAGGCACCAGCTAAACCCTA
TAATTAGTCTCTTATCAACACCATCCGCTCCCCCGGGATCAATGAGGAGAATGAGGGGGATGCG
GGGCTAAAGAAGGCTACATAACCCTCATGCCAACTOCOAGTTTACACTCGTCGAGOCAACATOC
TGACTATAAGCGAACACAGAATGCCTCAATCCTGC-GAAGAACTGGCCGCTGATAAGCGCGCCCG
CCTCGCAAAAACCATCCCTGATGAATGGAAAGTCCAGACGCTGCCTGCGGAAGACAGCGTTATT
GATTTCCCAAAGAAATCGGGGATCCTTTCAGAGGCCGAACTGAAGATCACAC-AGGCCTCCGCTG
CAGATCTTGTG7CCAAGCTGGCGGCCGGAGAGTTGACCTCGGTGGAAGTTACGCTAGCATTCTG
TAAACGGGCAGCAATCGCCCAGCAGTTAGTAGGGTCCCCTCTACCTCTCAGGGAGATGTAACAA
CGCCACCTTATGGGACTATCAAGCTGACGCTGGCTTCTGTGCAGACAAACTGCGCCCACGAGTT
CTTCCCTGACGCCGCTCTCGCGCAGGCAAGGGAACTCGATGAATACTACGCAAAGCACAAGAGA
CCCGTTGGTCCACTCCATGGCCTCCCCATCTCTCTCAAAGACCAGCTTCGAC-TCAAGGTACACC
GTTGCCCCTAAGTCGTTAGATGTCCCTTTTTGTCAGCTAACATATGCCACCAGGGCTACGAAAC
ATCAATGGGCTACATCTCATGGCTAAACAAGTACGACGAAGGGGACTCGGTTCTGACAACCATG
CTCCGCAAAGCCGGTGCCGTCTTCTACGTCAAGACCTCTGTCCCGCAGACCCTGATGGTCTGCG
AGACAGTCAACAACATCATCGGGCGCACCGTCAACCCACGCAACAAGAACTC-GTCGTGCGGCGG
CAGTTCTGGTGGTGAGGGTGCGATCGTTGGGATTCGTGGTGGCGTCATCGGTGTAGGAACGGAT
ATCGGTGGCTCGATTCGAGTGCCGGCCGCGTTCAACTTCCTGTACGGTCTAAGGCCGAGTCATG
GGCGGCTGCCGGATGCAAAGATGGCGAACAGCATC-GAGGGTCAGGAGACGGTGCACAGCGTTGT
CGGGCCGATTACGCACTCTGTTGAGGGTGAGTCCTTCGCCTCTTCCTTCTTTTCCTGCTCTATA
CCAGGCCTCCACTGTCCTCCTTTCTTGCTTTTTATACTATATACGAGACCGGCAGTCACTGATG
AAGTATGTTAGACCTCCGCCTCTTCACCAAATCCGTCCTCGGTCAGGAGCCATGGAAATACGAC
TCCAAGGTCATCCCCATGCCCTGGCGCCAGTCCGAGTCGGACATTATTGCCTCCAAGATCAAGA
ACGGCGGGCTCAATATCGGCTACTACAACTTCGACGGCAATGTCCTTCCACACCCTCCTATCCT
GCGCGGCGTGGAAACCACCGTCGCCGCACTCGCCAAAGCCGGTCACACCGTGACCCCGTGGACG
CCATACAAGCACGATTTCGGCCACGATCTCATCTCCCATATCTACGCGGCTGACGGCAGCGCCG
ACGTAATGCGCGATATCAGTGCATCCGGCGAGCCC-GCGATTCCAAATATCAAAGACCTACTGAA
CCCGAACATCAAAGCTGTTAACATGAACGAGCTCTGGGACACGCATCTCCAGAAGTGGAATTAC
CAGATGGAGTACCTTGAGAAATGGCGGGAGGCTGAAGAAAAGGCCGGGAAGGAACTGGACGCCA
TCATCGCGCCGATTACGCCTACCGCTGCGGTACGGCATGACCAGTTCCGGTACTATGGGTATGC
CTCTGTGATCAACCTGCTGGATTTCACGAGCGTGGTTGTTCCGGTTACCTTTGCGGATAAGAAC
ATCGATAAGAAGAATGAGAGTTTCAAGGCGGTTAC-TGAGCTTGATGCCCTCC-TGCAGGAAGAGT
ATGATCCGGAGGCGTACCATGGGGCACCGGTTGCAGTGCAGGTTATCGGACGGAGACTCAGTGA agagaggacgttggcgattgcagaggaagtggggaagttgctgggaaatgtggtgactccatag
CTAATAAGTGTCAGATAGCAATTTGCACAAGAAATCAATACCAGCAACTGTAAATAAGCGCTGA
AGTGACCATGCCATGCTACGAAAGAGCAGAAAAAAACCTGCCGTAGAACCGAAGAGATATGACA
CGCTTCCATCTCTCAAAGGAAGAATCCCTTCAGGGTTGCGTTTCCAGTCTAGAGGCCATTTAGG
CCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACAAAAATCGACGCTCAA
GTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCCT
CGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGCCTTTCTCCCTTCGGGA
AGCGTGGCGCTTTCTCATAGCTCACGCTGTAGGTATCTCAGTTCGGTGTAGC-TCGTTCGCTCCA
AGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCGGTAACTATCG
TCTTGAGTCCAACCCGGTAAGACAGGACTTATCGGCACTGGGAGGAGCGACTGGTAACAGGATT
AGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAACTACGGCTACA
CTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACCTTCGGAAAAAGAGTTGG
TAGCTCTTGATCCGGCAAACAAACCACCGCTGGTAGCGGTGGTTTTTTTGTTTGCAAGCAGCAG
ATTACGCGCAGAAAAAAAGGATCTCAAGAAGATCCTTTGATCTTTTCTACGGGGTCTGACGCTC
AGTGGAACGAAAACTCACGTTAAGGCCTGCAGGGCCGATTTTGGTCATGAGATTATCAAAAAGG
ATCTTCACCTAGATCCTTTTAAATTAAAAATGAAC-TTTTAAATCAATCTAAAGTATATATGAGT
AAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGCACCTATCTCAGCGATCTGTCTATT
TCGTTCATCCA7AGTTGCCTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGG3CTTACCA TCTGGCCCCAGTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAA TAAACCACCCACCCGGAACCGCCGAGCCCAGAAGTGCTCCTGCAACTTTATCCGCCTCCATCCA GTCTATTAATTGTTGCCGGGAAGCTAGAGTAAGTAGTTCGCCAGT7AATAGITTGC3CAACG7T GTTGCCATTGCTACAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTICATTGAGCTCCG GTTCCCAACGATCAAGGCGAGTTAGATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGGTCCTT CGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCC-CAGTGTTATCACTCATGGTTATGGCAGCA CTGCATAATTCTCTTACTGTCATGCCATCCGTAAC-ATGCTTTTCTGTGACTGGTGAGTACTCAA CCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAATACGGGA TAATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATGATTGGAAAACGTTCTTGGGGGCGA AAACTCTCAAGGATCTTACCGCTGTTGAGATCCAC-TTCGATGTAACCCACTCGTGCACCCAACT GATCTTCAGCATCTTTTACTTTCACCAGCGTTTCTGGGT3AGCAAAAACAGGAAGGGAAAATGC CGCAAAAAAGGGAAT.AAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATAT TATTGAAGCAT7TATCAGGGTTATTGTC7CATGGCCATTTAGGCC7CTAGAGTTGTSAAGTCGG TAATCCCGCTG7ATAGTAATACGAGTCGCATCTAAATACTCCGAAGCTGCTGCGAACCCGGAGA ATCGAGATGTGCTGGAAAGCTTCTAGCGAGCGGCTAAATTAGCATGAAAGGCTATGAGAAAT7C IGGAGACGGCT7GTTGAATCATGGCGTTCCATTCTTCGACAAGCAAAGCGTICCGTCGCAGTAG CAGGCACTCATTCCCGAAAAAACTCGGAGATTCCTAAGTAGCGATGGAACCGGAATAATATAAT AGGCAATACAT7GAGTTGCCTCGACGGT7GCAATGCAGGGGTACTGAGCTTGGACATAACTG7T CCGTACCCCACCTCTTCTCAACCTTTGGCGTTTCCCTGATTCAGCGTACCCGTACAAGTCGTAA TCACTATTAACCCAGACTGACCGGACGTGTTTTGCCCTTGATTTGGAGAAATAATGTCATTGCG ATGTGTAATTTGCCTGCTTGACCGACTGGGGCTGTTCGAAGCCCGAATGTAGGATTGTTATCCG AACTCTGCTCCtTAGACtGCATGTTGTGAATCTGTGTCGGGGAGGACACCtCCTCGAAGGTTCACGG CAAGGGAAACCACCGATAGCAGTGTCTAGTAGCAACCTGTAAAGCCGCAATGCAGCATCACTGG AAAATACAAACCAATGGCTAAAAGTACA7AAGTTAATGCCTAAAGAAGTCAIATACCAGCGGCT AATAATTGTACAATCAAGTGGCTAAACGTACCGTAATTT3CCAACGGCTTGTGGGGTTGCAGAA GCAACGGCAAAGCCCCACTTCCCCACGTTTGTTTCTTCACTCAGTCCAATCTCAGCTGGTGATC CCCCAATTGGGTCGCTTGTTTGTTCCGGTGAAGTGAAAGAAGACAGAGGTAAGAAT3TCTGACT CGGAGCGTTTTGCATACAACCAAGGGCAGTGATGGAAGACAGTGAAATGTTGACATTCAAGGAG TATTTAGCCAGGGATGCTTGAGTGTATCGTGTAAC-GAGGTTTGTCTGCCGATACGAGGAATACT GTATAGTCACT7CTGATGAAGTGGTCCA7ATTGAAATGTAAAGTCGGCACTGAACAGGCAAAAG ATTGAGTTGAAACTGCCTAAGATCTCGGGCCCTCGGGCCTTCGGCCTTTGGGTGTACATGTT7G TGCTCCGGGCAAATGCAAAGTGTGGTAGGATCGAACACAGTGCTGCCTTTACCAAGGAGCTGAG GGTATGTGATAGGCAAATGTTCAGGGGCCACTGCATGGTTTCGAA7AGAAAGAGAAGCTTAGCC AAGAACAATAGCCGATAAAGATAGCCTCATTAAACGGAATGAGCTAGTAGGCAAAGTCAGCGAA TGTGTATATATAAAGGTTCGAGGTCCGTGCCTCCCTCAT3CTCTCCCCATCTACTCATCAAC7C AGATCCTCCAGGAGACTTGTACACCATC7TTTGAC-GCACAGAAACCCAATAGTCAAGCATCACA AGTTTGTACAAAAAAGCAGGCTCCGCGGCCGCCCCCTTCACCATGCGCTTCICTTG3CTATTGT GCCCCCTTCTAGCGATGGGAAGTGCTCT7CCTGAAACGAAGACGGATGTTTCGACATACACCAA CCCTGTCCTTCCAGGATGGCACTCGGATCCATCGTGTATCCAGAAAGATGGCCTCTTTCTCTGC GTCACTTCAACATTCATCTCCTTCCCAGGTCTTCCCGTCTATGCC7CAAGGGATCTAGTCAACT GGCGTCTCATCAGCCATGTCTGGAACCGCGAGAAACAGTTGCCTGGCATTAGCTGGAAGACGGC AGGACAGCAACAGGGAATGTATGCACCAACCATTCGATAGCACAAGGGAACATACTACGTCA7C TGCGAATACCTGGGCGTTGGAGATATTA7TGGTGTCATCTTCAAGACCACCAATCCGTGGGACG AGAGTAGCTGGAGTGACGCTGTTAGGTTCAAGGGAAATCAGATGGACCCGGATCTGTTGTGGGA TGATGACGGAAAGGTTTATTGTGCTACCCATGGCATCACTCTGCAGGAGATTGATTTGGAAACT GGAGAGCTTAGCCCGGAGCTTAATATCTGGAACGGCACAGGAGGTGTATGGCCTGAGGGTCCCC ATATCTACAAGCGCGACGGTTACTACTA7CTCATGATTGCCGAGGGTGGAACTGCC3AAGACCA CGCTATCACAA7CGCTCGGGCCCGCAAGATCACCGGCCCCTATGAAGCCTACAATAACAACCCA ATCTTGACCAACCGCGGGACATCTGAGTACTTCCAGACT3TCGGTCACGGTGATCT3TTCCAAG ATACCAAGGGCAACTGGTGGGGTCTTTG7CTTGCTACTCGCATCACAGCACAGGGAGTTTCACC CATGGGCCGTGAAGCTGTTTTGTTCAATGGCACATGGAACAAGGGCGAATGGCCCAAGTTGCAA
COAGTAOGAGGTOGCATGCOTGGAAAOOTCCTOCCAAAGCCGAOGCGAAAOr-TTCOCGGAGATG
GGCCCTTCAACGCTGACCCAGACAACTACAACTTGAAGAAGACTAAGAAGATCCCTCCTCACTT
TGTGCACCATAGAGTCCCAAGAGACGGTGCCTTCTCTTTGTCTTCCAAGGGTCTGCACATCGTG
CCTAGTCGAAACAACGTTACCGGTAGTGTGTTGCCAGGAGATGAGATTGAGCTATCAGGACAGC
GAGGTCTAGCT7TCATCGGACGCCGCCAAACTCACACTCTGTTCAAATATAGTGTTGATATCGA
CTTCAAGCCCAAGTCCGATGATCAGGAAGCTGGAATCACCGTTTTCCGCACGCAGTTCGACCAT
ATCGATCTTGGCATTGTTCGTCTTCCTACAAACCAAGGCAGCAACAAGAAATCTAAGCTTGCCT
TCCGATTCCGGGCCACAGGAGCTCAGAATGTTCCTGCACCGAAGGTAGTACCGGTCCCCGATGG
CTGGGAGAAGGGCGTAATCAGTCT^CATATCGAGC-CAGCCAACGCGACGCACTACAACCTTGGA
GCTTCGAGCCACAGAGGCAAGACTCTCGACATCGCGACAGCATCAGCAAGTCTTGTGAGTGGAG
GCACGGGTTCATTTGTTGGTAGTTTGCTTGCtACCTTATGCTACCTGCAACGGCAAAGGATCTGG
AGTGGAATGTCCCAAGGGAGGTGATGTCTATGTGACCCAATGGACTTATAAGCCCGTGGCACAA
GAGATTGATCA7GGTGTTTTTGTGAAATCAGAATTGTAGAAGGGTGGGCGCGCCGACCCAGCTT
TC
[0118] The resulting Fv43C expression vector, pTrex3gM-Fv43C, is shown schematically as Figure 12.
[0119] The full nucleotide sequence of the Fv43C glycosyl hydrolase in expression vector pTreX3gM is provided below as SEQ ID NO: 15.
TTGTACAAAGTGGTGATCGCGCCGCGCGCCAGCTCCGTGCGAAAGCCTGACC-CACCGGTAGATT
CTTGGTGAGCCCGTATCATGACGGCGGCGGGAGCTACATGGCCCCGGGTGATTTATTTTTTTTG
TATCTACTTCTGACCCTTTTCAAATATACGGTCAACTCATCTTTCACTGGAGATGCGGCCTGCT
TGGTATTGCGATGTTGTCAGCTTGGCAAATTGTGGCTTTCGAAAACACAAAACGATTCCTTAGT
AGCCATGCATTTTAAGATAACGGAATAGAAGAAAGAGGAAATTAAAAAAAAAAAAAAAACAAAC
ATCCCGTTCATAACCCGTAGAATCGCCGCTCTTCGTGTATCCCAGTACCAGTTTATTTTGAATA
GCTCGCCCGCTGGAGAGCATCCTGAATGCAAGTAACAACCGTAGAGGCTGACACGGCAGGTGTT
GCTAGGGAGCGrCGTGTTCTACAAGGCCAGACGTCTTCGCGGTTGATATATATGTATGTTTGAC
TGCAGGCTGCTCAGCGACGACAGTCAAGTTCGCCCTCGCTGCTTGTGCAATAATCGCAGTGGGG
AAGCCACACCGTGACTCCCATCTTTCAGTAAAGCTCTGTTGGTGTTTATCAGCAATACACGTAA
TTTAAACTCGTGAGCATGGGGCTGATAGCTTAATTACCGTTTACCAGTGCCATGGTTCTGCAGC
TTTCCTTGGCCCGTAAAATTCGGCGAAGCCAGCCAATCACCAGCTAGGCACCAGCTAAACCCTA
TAATTAGTCTCGTATCAACACCATCCGCTCCCCCC-GGATCAATGAGGAGAATGAGGGGGATGCG
GGGCTAAAGAAGCCTACATAACCCTCATGCCAACTCCCAGTTTACACTCGTCGAGCCAACATCC
TGACTATAAGCTAACACAGAATGCCTCAATCCTGGGAAGAACTGGCCGCTGATAAGCGCGCCCG
CCTCGCAAAAACCATCCCTGATGAATGGAAAGTCCAGACGCTGCCTGCGGAAGACAGCGTTATT
GATTTCCCAAAGAAATCGGGGATCCTTTCAGAGGCCGAACTGAAGATCACAGAGGCCTCCGCTG
CAGATCTTGTGGCCAAGCTGGCGGCCGGAGAGTTGACCTCGGTGGAAGTTACGCTAGCATTCTG
TAAACGGGCAGCAATCGCCCAGCAGTTAGTAGGGTCCCCTCTACCTCTCAGC-GAGATGTAACAA
CGCCACCTTATGGGACTATCAAGCTGACGCTGGCTTCTGTGCAGACAAACTGCGCCCACGAGTT
CTTCCCTGACGCCGCTCTCGCGCAGGCAAGGGAACTCGATGAATACTACGCAAAGCACAAGAGA
CCCGTTGGTCCACTCCATGGCCTCCCCATCTCTCTCAAAGACCAGCTTCGAGTCAAGGTACACC
GTTGCCCCTAAGTCGTTAGATGTCCCTTTTTGTCAGCTAACATATGCCACCAGGGCTACGAAAC
ATCAATGGGCTACATCTCATGGCTAAACAAGTACGACGAAGGGGACTCGGTTCTGACAACCATG
CTCCGCAAAGCCGGTGCCGTCTTCTACGTCAAGACCTCTGTCCCGCAGACCCTGATGGTCTGCG
AGACAGTCAACAACATCATCGGGCGCACCGTCAACCCACGCAACAAGAACTC-GTCGTGCGGCGG
CAGTTCTGGTGGTGAGGGTGCGATCGTTGGGATTCGTGGTGGCGTCATCGGTGTAGGAACGGAT
ATCGGTGGCTCGATTCGAGTGCCGGCCGCGTTCAACTTCCTGTACGGTCTAAGGCCGAGTCATG
GGCGGCTGCCGTATGCAAAGATGGCGAACAGCATGGAGGGTCAGGAGACGGTGCACAGCGTTGT
CGGGCCGATTACGCACTCTGTTGAGGGTGAGTCCTTCGCCTCTTCCTTCTTTTCCTGCTCTATA
CCAGGCCTCCACTGTCCTCCTTTCTTGCTTTTTATACTATATACGAGACCGGCAGTCACTGATG
AAGTATGTTAGACCTCCGCCTCTTCACCAAATCCGTCCTCGGTCAGGAGCCATGGAAATACGAC
TCCAAGCTCATCCCCATCCCCTCCCCCCAGTCCCACTCGGACATTATTGCCTCCAAGATCAACA
ACGGCGGGCTCAATATCGGCTACTACAACTTCGACGGCAATGTCCTTCCACACCCTCCTATCCT
GCGCGGCGTGGAAACCACCGTCGCCGCACTCGCCAAAGCCGGTCACACCGTGACCCCGTGGACG
COATACAAGOAOGATTTCGGCCACGATCTCATCTCOCATATCTACGCGGCTGACGGCAGCGCOG
ACGTAATGCGCGATATCAGTGCATCCGGCGAGCCC-GCGATTCCAAATATCAAAGACCTACTGAA
CCCGAACATCAAAGCTGTTAACATGAACGAGCTCTGGGACACGCATCTCCAC-AAGTGGAATTAC
CAGATGGAGTACCTTGAGAAATGGCGGGAGGCTGAAGAAAAGGCCGGGAAGGAACTGGACGCCA
TCATCGCGCCGATTACGCCTACCGCTGCGGTACGGCATGACCAGTTCCGGTACTATGGGTATGC
CTCTGTGATCAACCTGCTGGATTTCACGAGCGTGC-TTGTTCCGGTTACCTTTGCGGATAAGAAC
ATCGATAAGAAGAATGAGAGTTTCAAGGCGGTTAC-TGAGCTTGATGCCCTCC-TGCAGGAAGAGT
ATGATCCGGAGGCGTACCATGGGGCACCGGTTGCAGTGCAGGTTATCGGACGGAGACTCAGTGA
AGAGAGGACGT7GGCGATTGCAGAGGAAGTGGGGAAGTTGCTGGGAAATGTC-GTGACTCCATAG
CTAATAAGTGTCAGATAGCAATTTGCACAAGAAATCAATACCAGCAACTGTAAATAAGCGCTGA
AGTGACCATGCCATGCTACGAAAGAGCAGAAAAAAACCTGCCGTAGAACCGAAGAGATATGACA
CGCTTCCATCTCTCAAAGGAAGAATCCCTTCAGGGTTGCGTTTCCAGTCTAGAGGCCATTTAGG
CCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACAAAAATCGACGCTCAA
GTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCCT
CGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGCCTTTCTCCCTTCGGGA
AGCGTGGCGCTGTCTCATAGCTCACGCTGTAGGTATCTCAGTTCGGTGTAGC-TCGTTCGCTCCA
AGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCGGTAACTATCG
TCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACTGGCAGCAGCCACTGGTAACAGGATT
AGCAGAGCGACtGTATGTAGGCGGTGCTACACtAGTTCTTGAAGTGGTGGCCTAACTACGGCTACA
CTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACCTTCGGAAAAAGAGTTGG tagctcttgatccggcaaacaaaccaccgctggtagcggtggtttttttgtttgcaagcagcag
ATTACGCGCAGAAAAAAAGGATCTCAAGAAGATCCTTTGATCTTTTCTACGGGGTCTGACGCTC
AGTGGAACGAAAACTCACGTTAAGGCCTGCAGGGCCGATTTTGGTCATGAGATTATCAAAAAGG
ATCTTCACCTAGATCCTTTTAAATTAAAAATGAAC-TTTTAAATCAATCTAAAGTATATATGAGT
AAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGCACCTATCTCAGCGATCTGTCTATT
TCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCA
TCTGGCCCCAGTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAA
TAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCA
GTCTATTAATTGTTGCCGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTT
GTTGCCATTGCTACAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCG
GTTCCCAACGA7CAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTT
CGGTCCTCCGA7CGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCA
CTGCATAATTCGCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTC-GTGAGTACTCAA
CCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAATACGGGA
TAATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGA
AAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACT
GATCTTCAGCATCTTTTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGC
CGCAAAAAAGGGAATAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATAT
TATTGAAGCAT7TATCAGGGTTATTGTCTCATGGCCATTTAGGCCTCTAGAGTTGTGAAGTCGG
TAATCCCGCTGTATAGTAATACGAGTCGCATCTAAATACTCGGAAGCTGCTGCGAACCCGGAGA
ATCGAGATGTGCTGGAAAGCTTCTAGCGAGCGGCTAAATTAGCATGAAAGGCTATGAGAAATTC
TGGAGACGGCT7GTTGAATCATGGCGTTCCATTCTTCGACAAGCAAAGCGTTCCGTCGCAGTAG
CAGGCACTCAT7CCCGAAAAAACTCGGAGATTCCTAAGTAGCGATGGAACCC-GAATAATATAAT
AGGCAATACAT7GAGTTGCCTCGACGGTTGCAATGCAGGGGTACTGAGCTTGGACATAACTGTT
CCGTACCCCACCTCTTCTCAACCTTTGGCGTTTCCCTGATTCAGCGTACCCGTACAAGTCGTAA
TCACTATTAACCCAGACTGACCGGACGTGTTTTGCCCTTCATTTGGAGAAATAATGTCATTGCG
ATGTGTAATTTGCCTGCTTGACCGACTGGGGCTGTTCGAAGCCCGAATGTAGGATTGTTATCCG
AAC7C7GC7CG7AGAGGCA7G77G7GAA7C7G7G7CGGGCAGGACACGCC7CGAAGG77CACGG CAAGGGAAACCACCGATAGCAGTGTCTAGTAGCAACCTGTAAAGCCGCAATGCAGCATCACTGG AAAATACAAACCAATGGCTAAAAGTACATAAGTTAATGCCTAAAGAAGTCATATACCAGCGGCT AATAATTGTACAATCAAGTGGCTAAACGTACCGTAATTTGCCAACGGCTTGTGGGGTTGCAGAA GCAACGGCAAAGCCCCACTTCCCCACGTTTGTTTCTTCACTCAGTCCAATCTCAGCTGGTGATC CCCCAATTGGG7CGCTTGTTTGTTCCGGTGAAGTGAAAGAAGACAGAGGTAAGAATGTCTGACT CGGAGCGTTTTGCATACAACCAAGGGCAGTGATGGAAGACAGTGAAATGTTC-ACATTCAAGGAG TATTTAGCCAGGGATGCTTGAGTGTATCGTGTAAC-GAGGTTTGTCTGCCGATACGACGAATACT GTATAGTCACTTCTGATGAAGTGGTCCATATTGAAATGTAAAGTCGGCACTGAACAGGCAAAAG ATTGAGTTGAAACTGCCTAAGATCTCGGGCCCTCGGGCCTTCGGCCTTTGGGTGTACATGTTTG TGCTCCGGGCAAATGCAAAGTGTGGTAGGATCGAACACACTGCTGCCTTTACCAAGCAGCTGAG GGTATGTGATAGGCAAATGTTCAGGGGCCACTGCATGGTTTCGAATAGAAAGAGAAGCTTAGCC AAGAACAATAGCCGATAAAGATAGCCTCATTAAACGGAATGAGCTAGTAGGCAAAGTCAGCGAA TGTGTATATATAAAGGTTCGAGGTCCGTGCCTCCCTCATGCTCTCCCCATCTACTCATCAACTC AGATCCTCCAGGAGACTTGTACACCATCTTTTGAGGCACAGAAACCCAATAGTCAACCATCACA AGTTTGTACAAAAAAGCAGGCTCCGCGGCCGCCCCCTTCACCATGCGTCTTCTATCGTTTCCCA GCCATCTCCTCGTGGCCTTCCTAACCCTCAAAGAGGCTTCATCCCTCGCCCTCAGCAAACGGGA TAGCCCTGTCCECCCCGGCCTCTGGGCGGACCCCAACATCGCCATCGTCGACAAGACATACTAC ATCTTCCCTACCACCGACGGTTTCGAAGGCTGGGGCGGCAACGTCTTCTACTGGTGGAAATCAA AAGATCTCGTAECATGGACAAAGAGCGACAAGCCATTCCTTACTCTCAATGGTACGAATGGCAA CGTTCCCTGGGCTACAGGTAATGCCTGGGCTCCTC-CTTTCGCTGCTCGCGGAGGCAAGTATTAC TTCTACCATAG^GGGAATAATCCCTCTGTGAGTGATGGGCATAAGAGTATTC-GTGCGGCGGTGG CTGATCATCCTGAGGGGCCGTGGAAGGCACAGGATAAGCCGATGATCAAGGGAACTTCTGATGA GGAGATTGTCAGCAACCAGGCTATCGATCCCGCTC-CCTTTGAAGACCCTGAC-ACTGGAAAGTGG TATATCTACTGGGGAAACGGTGTCCCCATTGTCGCAGAGCTCAACGACGACATGGTCTCTCTCA AAGCAGGCTGGCACAAAATCACAGGTCTTCAGAATTTCCGCGAGGGTCTTTTCGTCAACTATCG CGATGGAACATATCATCTGACATACTCTATCGACGATACGGGCTCAGAGAACTATCGCGTTGGG TACGCTACGGCGGATAACCCCATTGGACCTTGGACATATCGTGGTGTTCTTCTGGAGAAGGACG AATCGAAGGGCATTCTTGCTACGGGACATAACTCCATCATCAACATTCCTGGAACGGATGAGTG GTATATCGCGTATCATCGCTTCCATATTCCCGATGGAAATGGGTATΑΆΤAGGGAGACTACGATT GATAGGGTACCCATCGACAAGGATACGGGTTTGTTTGGAAAGGTTACGCCGACTTTGCAGAGTG TTGATCCTAGGCCTTTGTAGAAGGGTGGGCGCGCCGACCCAGCTTTC
[0120] The Nucleotide sequence for Fv43B, a GH43 family enzyme from Fusarium verticillioides is provided below as SEQ ID NO: 16:
A7GCGC77C7C77GGC7A77G7GCCCCC77C7AGCGA7GGGAAG7GC7C77CC7GAAACGAAGA CGGATGTTTCGACATACACCAACCCTGTCCTTCCAGGATGGCACTCGGATCCATCGTGTATCCA GAAAGATGGCC7CTTTCTCTGCGTCACTTCAACATTCATCTCCTTCCCAGGTCTTCCCGTCTAT CCCTCAACCCA7CTACTCAACTCCCCTCTCATCACCCATCTCTCCAACCCCCACAAACACTTCC CTGGCATTAGCEGGAAGACGGCAGGACAGCAACAGGGAATGTATGCACCAACCATTCGATACCA CAAGGGAACATACTACGTCATCTGCGAATACCTGGGCGTTGGAGATATTATTGGTGTCATCTTC AAGACCACCAA^CCGTGGGACGAGAGTAGCTGGAGTGACCCTGTTACCTTCAAGCCAAATCACA TCGACCCCGATCTGTTCTGGGATGATGACGGAAAGGTTTATTGTGCTACCCATGGCATCACTCT GCAGGAGA7 7GA77 7GGAAAC7GGAGAGC 77AGC CCGGAGCT7AA7A7C T GGAACGGCACAGGA GGTGTATGGCCTGAGGGTCCCCATATCTACAAGCGCGACGGTTACTACTATCTCATGATTGCCG AGGG7GGAACTGCCGAAGACCACGC7A7CACAATCGC7CGGGCCCGCAAGATCACCGGCCCC7A 7GAAGCC7ACAA7AACAACCCAA7C77GACCAACCGCGGGACA7C7GAG7AC77CCAGAC7G7C GGTCACGGTGAECTGTTCCAAGATACCAAGGGCAACTGGTGGGGTCTTTGTCTTGCTACTCGCA TCACAGCACAGGGAGTTTCACCCATGGGCCGTGAAGCTGTTTTGTTCAATGC-CACATGGAACAA GGGCGAATGGCCCAAG7TGCAACCAGTACGAGGTCGCATGCCTGGAAACC7CCTCCCAAAGCCG ACGCGAAACGTTCCCGGAGATGGGCCCTTCAACGCTGACCCAGACAACTACAACTTGAAGAAGA
C7AAGAAGA7CCCTCC7CAC777G7GCACCA7AGAG7CCCAAGAGACGGTGCC77C7C777GTC
TTCCAAGGGTC7GCACATCGTGCCTAGTCGAAACAACGTTACCGGTAGTGTGTTGCCAGGAGAT
GAGATTGAGCTATCAGGACAGCGAGGTCTAGCTTTCATCGGACGCCGCCAAACTCACACTCTGT
TCAAATATAGTGTTGATATCGACTTCAAGCCCAAGTCCGATGATCAGGAAGCTGGAATCACCGT
TTTCCGCACGCAGTTCGACCATATCGATCTTGGCATTGTTCGTCTTCCTACAAACCAAGGCAGC
AACAAGAAA7C7AAGC77GCCT7CCGA77CCGGGCCACAGGAGCTCAGAA7G77CCTGCACCGA
AGGTAGTACCGGTCCCCGATGGCTGGGAGAAGGGCGTAATCAGTCTACATATCGAGGCAGCCAA
CGCGACGCACTACAACCTTGGAGCTTCGAGCCACAGAGGCAAGACTCTCGACATCGCGACAGCA
TCAGCAAG7CT7G7GAG7GGAGGCACGGGTTCAT7TG7TGGTAGT7TGCT7GGACCTTA7GC7A
CC7GCAACGGCAAAGGA7C7GGAG7GGAATGTCCCAAGGGAGG7GATGTC7A7GTGACCCAA7G
GAG77A7AAGGCGG7GGGACAAGAGATI'GA'1'GATGGTGTTI'I'I'GTGAAAI'CAGAA7TGI'AG
[0121] The protein sequence of Fv43B is provided below as SEQ ID NO: 17:
KRESWLLCPELAMGSALPETKTDVSTYTNPVLPGWHSDPSCIQKDGLFLCVTSTFISFPGLPVY ASRDLVNWRLISHVWNREKQLPGISWK7AGQQQGMYAP7IRYHKG7YYVICEYLGVGDIIGVIE KTTKPWDESSWSDPVTFKPKHIDPDLFWDDDGKVYCATHGITLQEIDLETGELSPELNIWKGTG GVWPEGPHIYKRDGYYYLMIAEGGTAEDHAITIARARKITGPYEAYNKNPILTNRGTSEYFQTV GHGDLFQDTKGNWWGLCIATRITAQGVSPMGREAVLFNGTWNKGEWPKLOPVRGRMPGNLLPKP TRNVPGDGPFNADPDNYNLKKTKKIPPHFVHHRVPRDGxAFSLSSKGLHIVPSRNNVTGSVLPGD EIELSGQRGLAFIGRRQTH7LFKYSVDIDFKPKSEDQEAGITVFRTQFDHIELGIVRLPTNQGS NKKSKLAFRFRATGAQNVPAPKWPVPDGWEKGVISLHIEAANATHYNLGASSHRGKTLDIATA SASLVSGGTGSFVGSLLGPYATCNGKGSGVECPKGGDVYVTQWTYKPVAQEIDHGVFVKSEL
[0122] The nucleotide sequence for Fv43C, a GH43 family enzyme from Fusarium verticillioides is provided below as SEQ ID NO: 18:
ATGCGTCTTCTATCGTTTCCCAGCCATCTCCTCGTGGCCTTCCTAACCCTCAAAGAGGCTTCAT
CCCTCGCCCTCAGCAAACGGGATAGCCCTGTCCTCCCCGGCCTCTGGGCGGACCCCAACATCGC
CATCGTCGACAAGACATACTACATCTTCCCTACCACCGACGGTTTCGAAGGCTGGGGCGGCAAC
GTCTTCTACTGGTGGAAATCAAAAGATCTCGTATCATGGACAAAGAGCGACAAGCCATTCCTTA
CTCTCAATGGTACGAATGGCAACGTTCCCTGGGCTACAGGTAATGCCTGGGCTCCTGCTTTCGC
TGCTCGCGGAGGCAAGTATTACTTCTACCATAGTC-GGAATAATCCCTCTGTC-AGTGATGGGCAT
AAGAGTATTGGTGCGGCGGTGGCTGATCATCCTGAGGGGCCGTGGAAGGCACAGGATAAGCCGA
TGATCAAGGGAACTTCTGATGAGGAGATTGTCAGCAACCAGGCTATCGATCCCGCTGCCTTTGA
AGACCCTGAGACTGGAAAGTGGTATATCTACTGGC-GAAACGGTGTCCCCATTGTCGCAGAGCTC
AACGACGACATGGTCTCTCTCAAAGCAGGCTGGCACAAAATCACAGGTCTTCAGAATTTCCGCG
AGGGTCTTTTCGTCAACTATCGCGATGGAACATATCATCTGACATACTCTATCGACGATACGGG
CTCAGAGAACTATCGCGTTGGGTACGCTACGGCGGATAACCCCATTGGACCTTGGACATATCGT
GGTGTTCTTCTGGAGAAGGACGAATCGAAGGGCATTCTTGCTACGGGACATAACTCCATCATCA
ACATTCCTGGAACGGATGAGTGGTATATCGCGTATCATCGCTTCCATATTCCCGATGGAAATGG
GTATAATAGGGAGACTACGATTGATAGGGTACCCATCGACAAGGATACGGGTTTGTTTGGAAAG
GTTACGCCGAC7TTGCAGAGTGTTGATCCTAGGCCTTTGTAG
[0123] The protein sequence for Fv43C is provided below as SEQ ID NO: 19:
KRLLSFP 5HLLVAFLTLKEASSLALSKRDSPVLPGLWADPNIAIVDKTYYIFPTTDGFEGWGGN VFYWWKSKDGVSWTKSDKPFLTLNGTKGNVPWATGNAWAPAFAARGGKYYFYHSGKNP3VSDGH KSIGAAVADHPEGPWKAQDKPMIKGTSDEEIVSNQAIDPAAFEDPETGKWYIYWGNGVPIVAEL NDULVIVSLKAGWHKI TGLQEii'REGLEVJv YRDGT YH.LT YSIDUI'GSEK YRVG YA.TADKP IGPW'l'YR GVLLEKDESKGILATGHNSIINIPGTDEWYIAYHRFHIPDGNGYNRETTIDRVPIDKDTGLFGK VTPTLQSVDPRPL
[0124] The pTrex3gM-Fv43B and pTrex3gM-Fv43C vectors were each independently transformed into the MAD6 strain by PEG mediated protoplast fusion and into the quad deleted strain by particle bombardment. 2. Transformation of the quad deleted T. reesei strain.
[0125] The vector pTrex3gM-Fv43B and the vector pTrex3gM-Fv43C were transformed independently into the T.reesei quad deleted strain using biolistic particle bombardment by the PDS-1000/Helium System (Biorad, Hercules, CA) according to the manufacturer's instructions and as described in U.S. patent application publication US 2006/0003408, Example 2.
3. SDS-PAGE of T. reesei quad deleted clones transformed with fv43B and fv43C
[0126] Stable transformants were grown in 96-well microtiter plates as described in PCT publication WO 2011/038019. Culture supernatant was run on SDS-PAGE followed by coomassie blue staining with Simply Blue stain (Invitrogen). The gel was scanned and analyzed by densitometry. Image processing and band intensity quantitation was done using ImageJ (from National Institutes of Health) and by employing the Analyze Gel submenu function as described in the user guide, in subsection 27.13. The band corresponding to the Fv43B protein was quantified and reported as a percentage of the total protein. Figure 13 provides a picture of the SDS-PAGE of proteins expressed from a 7. reese/quad deleted clones transformed with M3B.
[0127] The bands corresponding to the Fv43C protein were quantified and reported as a percentage of the total protein. Fv43C ran as two bands on the gel, representing different glycoforms, and these were summed in the densitometry analysis. The SDS-PAGE of proteins expressed from 7. reesei quad deleted clones transformed with Fv43C is shown in Figure 14. 4. PEG Mediated Protoplast Fusion Transformation of the MAD6 T. reesei strain [0128] The expression cassette portion of vectors pTrex3gM-Fv43B and pTrex3gM-Fv43C were each amplified, by PCR, using primers 1061F and 1085R to generate linear DNAfragments, 5.1 kb and 4.4 kb respectively, which were used for PEG mediated protoplast fusion transformation (see, e.g., Pentilla, M., et al. (1987) Gene 61(2):155-164) of the MAD6 strain. 1061F: 5-GACCGGACGTGTTTTGCCCTTCAT-3' (SEQ ID NO:20) 1085R: 5'- GTGTGACCGGCTTTGGCGAGTG -3' (SEQ ID NO:21) [0129] To make protoplasts, Lysing Enzymes from Trichoderma harzianum (Sigma catalog #L1412) were used at 10 mg/mL. After incubation with the transforming DNAand PEG, protoplasts were added to cooled molten sorbitol/acetamide agar with 0.5% uridine. The plates were incubated at 30°C. After 24 hrs, an equal volume of the same media supplemented with 0.5% uridine and 1.2 g/L 5-fluoroorotic acid (FOA) was added to the plates in the form of an overlay. The plates were incubated at 30°C for a week. The molten sorbitol/acetamide agar was prepared using the following recipe:
Sorbitol/acetamide aaar [0130]
PARTI
Bring to 300 mL with milliQ H2O PART II
Autoclave Part I and Part II separately, then combine. 1000x Salts (per L1
Filter Sterilize (0.22 micron)
5. SDS-PAGE of T. reesei MAD6 clones transformed with fv43B or fv43C
[0131] Three transformants of Fv43B and two transformants of Fv43C were grown in 96-well microtiter well plates as described in PCT publication WO 2011/038019. Culture supernatant was run on SDS-PAGE followed by coomassie blue staining wth Simply Blue stain (Invitrogen). The gel was scanned and analyzed by densitometry. Image processing and band intensity quantitation was done using ImageJ (from National Institutes of Health) and by employing the Analyze Gel submenu function as described in the user guide, subsection 27.13. Figure 15 shows SDS-PAGE of T. reesei MAD6 clones transformed with M3B and fv43C. The bands corresponding to the Fv43C protein were quantified and reported as a percentage of the total protein. Fv43C protein ran as two bands on the gel, representing different glycoforms, and these were summed in the densitometry analysis. 6. Quantitative measurements of amounts of proteins expressed [0132] The amounts of the proteins expressed by the quad deleted strain were compared with those achieved by the MAD6 strain. As described above, the relevant gels, Figures 13, 14 and 15, were scanned and analyzed by densitometry. Image processing and band intensity quantitation was done using ImageJ (from National Institutes of Health) and by employing the Analyze Gel submenu function as described in the user guide, subsection 27.13. The bands corresponding to each of the proteins of interest were quantified and reported as a percentage of the total protein. Fv43C ran as two bands on the gel, representing glycoforms, and these were summed in the densitometry analysis. Results of this analysis is summarized below in Table 2-1:
[0133] This comparison clearly indicates that expression using the MAD6 strain resulted in much more reliable expression with minimum variability (e.g., less than 20% variability) in expression levels. In contrast, using the quad deleted strain, a substantial portion of the transformants failed to express the protein of interest, and the variability of expression is substantial (e.g., greater than 50% variability). EXAMPLE 3
Generation of Hypocrea jecorina CBH2 DNA Libraries [0134] The pTTTpyrG"cbh2 plasmid (see, e.g., PCT publication WO 2010/141779) containing the Hypocrea jecorina CBH2 protein encoding sequence was used as the reference sequence for the production of a DNA library encoding CBH2 variant enzymes. SEQ ID NO:7 sets forth the reference Hypocrea jecorina CBH2 coding DNA sequence:
ATGATTGTCGGCATTCTCACCACGCTGGCTACGCTGGCCACACTCGCAGCTAGTGTGCCTCTAG
AGGAGCGGCAAGCTTGCTCAAGCGTCTGGGGCCAATGTGGTGGCCAGAATTGGTCGGGICCGAC
ITGCTGTGCTTCCGGAAC-CACATGCGTCTACTCCAACGACTATTACTCCCAGTGTCTTCCCGGC
GCTGCAAGCTCAAGCTCGTCCACGCGCGCCGCGTCGACGACTTCTCGAGTATCCCCCACAACAT
CCCGGTCGAGCTCCGCGACGCCTCCACCTGGTTCTACTACTACCAGAGTACCTCCAGTCGGATC
GGGAACCGCTACGTATTCAGGCAACCCTTTTGTTGGGGTCACTCCTTGGGCCAATGCAIATTAC
GCCTCTGAAGT7AGCAGCCTCGCTATTCCTAGCTTGACTGGAGCCATGGCCACTGCTGCAGCAG CTGTCGCAAAGGTTCCCTCTTTTATGTGGCTAGATACTCITGACAAGACCCCTCTCATGGAGCA AACCTTGGCCGACATCCCCACCGCCAACAAGAATCGCGGTAACTAGGCCGGACAGTTTGTGGGG TATGACTTGCCGGATCGCGATTGCGCTGCCCTTGCCTCGAATGGCGAATACTCTATTGCCGA7G GTGGCGTCGCCAAATATAAGAACTATATCGACACCATTCGTCAAAGTGTCGIGGAATATTCCGA TATCCGGACCC7CCTGGTTATTGAGCCTGACTCTCTTGCCAACCTGGTGACCAACCTCGGTACT CCAAAGTGTGCCAATGCTCAGTCAGCCTACCTTGAGTGCATCAACGACGCCGTCACACAGCTGA ACCTTCCAAATGTTGCGATGTATTTGGACGCTGGCCATGGAGGATGGCTTGGCTGGCCGGCAAA CCAAGACCCGGCCGCTCAGCTATTTGCAAATGTTTACAAGAATGCATCGTCTCCGAGAGCTCTT CGCGGATTGGCAACCAATGTCGCCAACTACAACGGGTGGAACATTACCAGCCCCCCATCGTACA CGCAAGGCAACGCTGTCTACAACGAGAAGCTGTACATCCACGCTA7TGGACCTCTTCTTGCCAA TCACGGCTGGTCCAACGCCTTCTTCATCACTGATCAAGGTCGATCGGGAAAGCAGCCTACCGGA CAGCAACAGTGGGGAGACTGGTGCAATG7GATCGGCACCGGATTTGGTATTCGCCCATCCGCAA ACACTGGGGACTCGTTGCTGGATTCGTTTGTCTGGGTCAAGCCAGGCGGCGAGTGTGACGGCAC CAGCGACAGCAGTGCGCCACGATTTGACTCCCACTGTGCGCTCCCAGATGCCTTGCAACCGGCG CCTCAAGCTGGTGCTTGGTTCCAAGCCTACTTTGTGCAGCTTCTCACAAACGCAAACCCATCGT TCCTGTAA. SEQ ID NO:8 is the Hypocrea jecorina CBH2 full length protein sequence:
KIVGILTTLATLATLAASVPLEERQACSSVWGQCGGQKWSGPTCCASGSTCVYSNDYYSQCLPG AASSSSSTRAASTTSRVSPTTSRSSSATPPPGSTTTRVPPVGSGTATYSGKPFVGVTPWAKAYY ASEVSSLAIPSLTGAMATAAAAVAKVPSFMWLDTLDKTPLMEQTLADIRTAN'KNGGNYAGQFW YDLPDRDCAALASKGEYSIADGGVAKYKNYIDTIRQIWEY3DIRTLLVIEPDSLANLVTKLGT PKCAKAQ3AYLECIKYAVTQLNLPNVAMYLDAGHAGWLGWPANQDPAAQLFAKVYKNA3SPRAL RGLATNVANYNGWKITSPP SYTQGKAVYNEKLYIHAIGPLLANHGWSKAFFITDQGRSGKQPTG QQQWGDWCNVIGTGFGIRP SANTGDSELDSFVWVKP GGECDGTSDSSAPRFESHCALPDALQPA PQAGAWFQAYFVQLLTKAKPSFL SEQ ID N0:9 is the Hypocrea jecorina CBH2 mature protein sequence:
QACSSVWGQCGGQKWSGPTCCASGSTCVYSNDYYSQCLPGAASSSSSTRAASTTSRVSPTTSRS
3SATPPPGSTTERVPPVGSGTATY3GNPFVGVTPWANAYYASEVSSLAIPSLTGAMATAAAAVA
KVPSFMWLDTLDKTPLMEQTLADIRTANKNGGNYAGQFWYDLPDRDCAALASNGEYSIADGGV
AKYKNYIDTIRQIVVEYSDIRTLLVIEPDSLANLVTNLGTPKCANAQSAYLECINYAVTQLNLP
NVAMYLDAGHAGWLGWPANQDPAAQLFANVYKNASSPRALRGLATKVANYNGWNITSPPSYTQG
NAVYKEKLYIHAIGPLLANHGWSNAFFITDQGRSC-KQPTGQQQWGDWCNVIC-TGFGIRPSANTG
DSLLDSFVWVKPGGECDGTSDSGAPRFDSEICALPEALQPAPQAGAWrQAYFVQLLTNANPSFL
[0135] A synthetic CBH2 combinatorial library was prepared by GeneOracle (Mountain View, CA). A number of amino acid residues of CBH2 were substituted with a plurality of other amino acid residues. Table 3-1 lists the substitutions of members of the CBH2 combinatorial library (numbered according to the CBH2 mature amino acid sequence).
[0136] The library was provided as purified PCR products in which primers GACCGGACGTGTTTTGCCCTTCAT (SEQ ID NO:10) and GTGTGACCGGCTTTGGCGAGTG (SEQ ID NO: 11) were used to amplify the cbh2 gene flanked upstream by about 1.1 kb of the cbh1 promoter and downstream by about 1.85 kb of the amdS marker for forced integration in the pyr2 locus of the H. jecorina host strain. A schematic of the homologous recombination of the expression cassette into the screening strain is depicted in Figure 8. The nucleotide sequence of a PCR fragment (partial cbh1 promoter, cbh2 gene, and partial amdS gene) amplified from pTTTpyrG-CBH2 using the primers above, is provided below as SEQ ID NO: 12:
GACCGGACGTGYTTTGCCCTTCATTTGGAGAAATAATGTCATTGCGATGTGIAATTTGCCTGCT
TGACCGACTGGGGCTGTTCGAAGCC.CGAATGTAGGATTGTTATCCGAACTCTGCTCGTAGAGGC
ATGTTGTGAATCTGTGTCGGGCAGGACACGCCTCGAAGGTTCACGGCAAGGGAAACGACCGATA
GCAGTGTCTAGTAGCAACCTGTAAAGCCGCAATGCAGCATCACTG&AAAATACAAACCAATGGC
TAAAAGTACATAAGTTAATGCCTAAAGAAGTCATATACCAGCGGCrAATAATTGTAGAATCAAG
TGGCTAAACGTACCGTAATTTGCCAACGGCTTGTC-GGGTTGCAGAAGCAACGGCAAAGCCCCAC
TTCCCCACGTTFGTTTCTTCACTCAGTCCAATCTCAGCTGGTGATCCCCCAATTGGGTCGCTFG
TTTGTTCCGGTGAAGTGAAAGAAGACAGAGGTAAGAATGTCTGACECGGAGCGTTTTGCATACA
ACCAAGGGCAGTGATGGAAGACAGTGAAATGTTGACATTCAAGGAGTATTTAGCCA3GGATGCT
TGAGTGTATCGFGTAAGGAGGTTTGTCTGCCGATACGACGAATACFGTATAGTCACTTCTGAFG
AAGTGGTCCATATTGAAATGTAAAGTCGGCACTGAACAG3CAAAAGATTGAGTTGAAACTGCCT
AAGATCTCGGGCCCTCGGGCCTTCGGCCETTGGGTGTACATGTTTGTGCTCCGGGCAAATGCAA
AGTGTGGTAGGATCGAACACACTGCTGCCTTTACCAAGCAGCTGAGGGTATGTGATAGGCAAAT
GTTCAGGGGCCACTGCATGGTTTCGAATAGAAAGAGAAGCTTAGCCAAGAACAATA3CCGATAA
AGATAGCCTCAFTAAACGGAATGAGCTAGTAGGCAAAGTCAGCGAATGTGTATATATAAAGGFT
CGAGGTCCGTGCCTCCCTCATGCTCTCCCCATCTACTCATCAACTCAGATCCTCCAGGAGACYT
GTACACCATCTETTGAGGCACAGAAACCCAATAGTCAACCATCACAAGTTTGTACAAAAAAGCA
GGCTCCGCGGCCGCCCCCTTCACCCACCATGATTGTCGGGATTCTCACCACGCTGGCTACGCFG
GCCACACTCGCAGCTAGTGTGCCTCTAGAGGAGCGGCAAGCTTGCYCAAGCGTCTGGTAATTAT
GTGAACCCTCTCAAGAGACCCAAATACTGAGATATGTCAAGGGGCCAATGTGGTGGGCAGAAYT
GGTCGGGTCCGACTTGCTGTGCTTCCGGAAGCACATGCGICTACTCCAACGACTATTACTCCCA
GTGTCTTCCCGGCGCTGCAAGCTCAAGCFCGTCCACGCGGGCCGCGTCGACGACTTCTCGAGFA
TCCCCCACAACATCCCGGTCGAGCTCCGCGACGCCTCCACCTGGTFCTACTACTACCAGAGTAC
CTCCAGTCGGATCGGGAACCGCTACGTATTCAGGCAACCCTTTTGTTGGGGTCACTCCTTGGGC
CAATGCATATTACGCCTCTGAAGTTAGCAGCCTCGCTATTCCTAGCTTGACTGGAGCCATGGCC
ACTGCTGCAGCAGCTGTCGCAZVAGGTTCCCTCTTTTATGTGGCTGFACtGTCCTCCCGGAACCxYA GGCAATCTGTTACTGAAGGCTCATCATTCACTGCAGAGATACTCTTGACAAGACCCCTCTCATG GAGCAAACCTTGGCCGACATCCGCACCGCCAACAAGAATGGCGGTAACTATGCCGGACAGTTTG TCCTCTATCACTTGCCCCATCGCCATTCCGCTCCCCTTGCCTCCAATCGCCAATACTCTATTCC CGATGGTGGCGTCGCCAAATATAAGAACTATATCGACACCATTCGTCAAATTGTCGTGGAATAT TCCGATATCCGGACCCTCCTGGTTATTGGTATGAGTTTAAACACCTGCCTCCCCCCCCCCTTCC CTTCCTTTCCCGCCGGCATCTTGTCGTTGTGCTAACTATTGTTCCCTCTTCCAGAGCCTGACTC TCTTGCCAACCTGGTGACCAACCTCGGTACTCCAAAGTGTGCCAATGCTCAGTCAGCCTACCTT GAGTGCATCAACTACGCCGTCACACAGCTGAACCTTCCAAATGTTGCGATGTATTTGGACGCTG GCCATGCAGGA7GGCTTGGCTGGCCGGCAAACCAAGACCCGGCCGCTCAGCTATTTGCAAATGT TTACAAGAATGCATCGTCTCCGAGAGCTCTTCGCC-GATTGGCAACCAATGTCGCCAACTACAAC GGGTGGAACAT7ACCAGCCCCCCATCGTACACGCAAGGCAACGCTGTCTACAACGAGAAGCTGT ACATCCACGCTATTGGACCTCTTCTTGCCAATCACGGCTGGTCCAACGCCTTCTTCATCACTGA TCAAGGTCGATCGGGAAAGCAGCCTACCGGACAGCAACAGTGGGGAGACTGGTGCAATGTGATC GGCACCGGATT7GGTATTCGCCCATCCGCAAACACTGGGGACTCGTTGCTGC-ATTCGTTTGTCT GGGTCAAGCCAGGCGGCGAGTGTGACGGCACCAGCGACAGCAGTGCGCCACGATTTGACTCCCA CTGTGCGCTCCCAGATGCCTTGCAACCGGCGCCTCAAGCTGGTGCTTGGTTCCAAGCCTACTTT GTGCAGCTTCTCACAAACGCAAACCCATCGTTCCTGTAAAAGGGTGGGCGCC-CCGACCCAGCTT TCTTGTACAAAGTGGTGATCGCGCCGCGCGCCAGCTCCGTGCGAAAGCCTGACGCACCGGTAGA TTCTTGGTGAGCCCGTATCATGACGGCGGCGGGAC-CTACATGGCCCCGGGTC-ATTTATTTTTTT TGTATCTACTTCTGACCCTTTTCAAATATACGGTCAACTCATCTTTCACTGGAGATGCGGCCTG CTTGGTATTGCGATGTTGTCAGCTTGGCAAATTGTGGCTTTCGAAAACACAAAACGATTCCTTA GTAGCCATGCA7TTTAAGATAACGGAATAGAAGAAAGAGGAAATTAAAAAAAAAAAAAAAACAA ACATCCCGTTCATAACCCGTAGAATCGCCGCTCTTCGTGTATCCCAGTACCAGTTTATTTTGAA TAGCTCGCCCGCTGGAGAGCATCCTGAATGCAAGTAACAACCGTAGAGGCTGACACGGCAGGTG TTGCTAGGGAGCGTCGTGTTCTACAAGGCCAGACGTCTTCGCGGTTGATATATATGTATGTTTG ACTGCAGGCTGCTCAGCGACGACAGTCAAGTTCGCCCTCGCTGCTTGTGCAATAATCGCAGTGG GGAAGCCACACCGTGACTCCCATCTTTCAGTAAAGCTCTGTTGGTGTTTATCAGCAATACACGT AATTTAAACTCGTTAGCATGGGGCTGATAGCTTAATTACCGTTTACCAGTGCCATGGTTCTGCA GCTTTCCTTGGCCCGTAAAATTCG5CGAAGCCAGCCAATCACCAGCTAGGCACCAGCTAAACCC TATAATTAGTC7CTTATCAACACCATCCGCTCCCCCGGGATCAATGAGGAGAATGAGGGGGATG CGGGGCTAAAGAAGCCTACATAACCCTCATGCCAACTCCCAGTTTACACTCC-TCGAGCCAACAT CCTGACTATAAGCTAACACAGAATGCCTCAATCCTGGGAAGAACTGGCCGCTGATAAGCGCGCC CGCCTCGCAAAAACCATCCCTGATGAATGGAAAGTCCAGACGCTGCCTGCGGAAGACAGCGTTA TTGATTTCCCAAAGAAATCGGGGATCCTTTCAGAC-GCCGAACTGAAGATCACAGAGGCCTCCGC TGCAGATCTTG7GTCCAAGCTGGCGGCCGGAGAGTTGACCTCGGTGGAAGTTACGCTAGCATTC TGTAAACGGGCAGCAATCGCCCAGCAGTTAGTAGC-GTCCCCTCTACCTCTCAGGGAGATGTAAC AACGCCACCTTATGGGACTATCAAGCTGACGCTGGCTTCTGTGCAGACAAACT&CGCCCACGAG TTCTTCCCTGACGCCGCTCTCGCGCAGGCAAGGGAACTCGATGAATACTACC-CAAAGCACAAGA GACCCGTTGGTCCACTCCATGGCCTCCCCATCTCTCTCAAAGACCAGCTTCC-AGTCAAGGTACA CCGTTGCCCCTAAGTCGTTAGATGTCCCTTTTTGTCAGCTAACATATGCCACCAGGGCTACGAA ACATCAATGGGCTACATCTCATGGCTAAACAAGTACGACGAAGGGGACTCGGTTCTGACAACCA TGCTCCGCAAAGCCGGTGCCGTCTTCTACGTCAAGACCTCTGTCCCGCAGACCCTGATGGTCTG CGAGACAGTCAACAACATCATCGGGCGCACCGTCAACCCACGCAACAAGAACTGGTCGTGCGGC GGCAGTTCTGG7GGTGAGGGTGCGATCGTTGGGATTCGTGGTGGCGTCATCGGTGTAGGAACGG ATATCGGTGGC7CGATTCGAGTGCCGGCCGCGTTCAACTTCCTGTACGGTCTAAGGCCGAGTCA TGGGCGGCTGCCGTATGCAAAGATGGCGAACAGCATGGAGGGTCAGGAGACC-GTGCACAGCGTT GTCGGGCCGAT7ACGCACTCTGTTGAGGGTGAGTCCTTCGCCTCTTCCTTCTTTTCCTGCTCTA TACCACCCCTCCACTCTCCTCCTTTCTTCCTTTTTATACTATATACCACACCCCCACTCACTCA TGAAGTATGTTAGACCTCCGCCTCTTCACCAAATCCGTCCTCGGTCAGGAGCCATGGAAATACG ACTCCAAGGTCATCCCCATGCCCTGGCGCCAGTCCGAGTCGGACATTATTGCCTCCAAGATCAA GAA0GGCGGGC7CAATATCGGCTACTACAACTTCGACGGCAATGTCCTTC0A0ACCCT0CTAT0 CTGCGCGGCGTGGAAACCACCGTCGCCGCACTCGCCAAAGCCGGTCACACC.
[0137] Protoplasts of the AD5 H. jecorina strain (Aegll, Aegl2, Acbhl, Acbh2, AbgH) described in Example 1 were transformed with the linear DNA library as described (U.S. patent application publication US 2006/0094080) and grown on selective agar containing acetamide at 28°C for 7 days (0.6 g/Lacetamide, 1.68 g/L CsCI, 20 g/L glucose, 6 g/L KH 2PO4, 0.6 g/L MgS04-7H20, 0.6 g/L CaCl2'2H20, 0.5 g/L uridine, trace element salts, 10 g/L low melting point agarose). After 24 hours the agar was overlaid with selective agar supplemented with 1.2 g/Lfluoroorotic acid (FOA). A total of 380 colonies were transferred to potato dextrose agar plates containing 1.2 g/L FOA and incubated at 28°C for 4-5 days. Spores were transferred to fresh potato dextrose agar plates, which were incubated at 28°C for 3 days.
[0138] Alternatively, protoplasts of the MAD6 strain described in Example 1 can be employed instead of AD5 for expression of variant library members. Likewise, protoplasts of derivatives of the MAD6 strain in vdiich additional cellulases have been inactivated can be used for this purpose. Such derivatives would exhibit even less background cellulase activity.
[0139] For CBH2 variant protein production, spores were transferred using a 96-pin replicator to 200 pL glycine minimal medium supplemented with 2% glucose/sophorose mixture in a PVDF filter plate: 6.0 g/L glycine, 4.7 g/L (NH4)2S04; 5.0 g/L KH2PO4; 1.0 g/L MgS04'7H20; 33.0 g/L PIPPS; pH 5.5; with sterile addition of a 2% glucose/sophorose mixture as the carbon source, 10 ml/L of 100 g/L of CaCl2, 2.5 mL/L of T. reesei trace elements (400X): 175 g/L Citric acid anhydrous; 200 g/L FeS04'7H20; 16 g/L ZnS04'7H20; 3.2 g/L CUSO4 5H2O; 1.4 g/L MnSC^-^O; 0.8 g/L H3BO3. Each CBH2 variant was grown in quadruplicate. After sealing the plate with an oxygen permeable membrane, the plates were incubated at 28°C for 6 days, while shaking at 200 rpm. Supernatant was harvested by transferring the culture medium to a microtiter plate under low pressure.
[0140] The CBH2 variants were tested for properties of interest. Expressions of individual variants were examined using SDS PAGE. Figure 16A is a picture of four SDS-PAGE, showing the expression of a number of variants; Figure 16B depicts the average production levels for these variants, with the error bars indicating variability of expression. The specific activities for washed dilute acid pretreated cornstover (PCS 50°C), for corncob at 50°C (CC 50°C), and for corncob at 57°C (CC 57°C) were determined. A total of ten variants that showed improved activity on corn cob at 57°C were isolated. Genomic DNA of these strains was isolated and their cbh2 gene sequences determined. The substitutions and performance indexfor specific activities of combinatorial library variants on corncob and corn stover is shown in Table 3-2. The performance indexfor specific activities were determined based on normalized protein expression levels of the
variants.
[0141] PCS 50°C. Corn stover was pretreated with 2% w/w H2SO4 as described (Schell et al., J Appl Biochem Biotechnol, 105:69-86, 2003) and followed by multiple washes with deionized water to obtain a paste having a pH of 4.5. Sodium acetate buffer (pH 5.0) was then added to a final concentration of 50 mM sodium acetate, and, this mixture was then titrated to pH 5.0 using 1 N NaOH as appropriate. The cellulose concentration in the reaction mixture was approximately 7%. Sixty-five (65) pL of this cellulose suspension was added per well to a 96-well microtiter plate (Nunc Flat Bottom PS). To each well, 10 pL of the enzyme sample was added, each containing 49 pg protein in supernatant from a quad deleted strain (Aeg11, Aeg12, Acbhl, Acbh2).
[0142] Up to 20 pLof culture supernatants from H.jecorina cells expressing either wild-type CBH2 or CBH2 variants were added. Compensating volumes of acetate buffer were added to make up for differences in total volume. After sealing of the plates, they were incubated at 50°C while shaking at 200 rpm. After 2 days the plate was put on ice for 5 min and 100 pL of 100 mM glycine pH 10.0 was added. After mixing, the plate was centrifuged at 3000 rpm for 5 min. A volume of 10 pL supernatant was diluted in 190 pL water. Ten (10) pL of the diluted solution was transferred to a new 96-well microtiterplate (Costar Flat Bottom PS) containing, in each well, 100 pLABTS glucose assay mixture (2.74 mg/mL 2,2' azino-bis(3-ethylbenzQ-thiazQline-6-sulfonic acid, 1 U/mL horseradish peroxidase type VI, 1 U/mL glucose oxidase) and increase in A420 was recorded in a microtiter plate spectrophotometer (Spectramax Plus 384, Molecular Devices). A range of glucose concentrations was included as a standard on each plate (0; 0.008; 0.016; 0.031; 0.063; 0.125; 0.25; 0.5; 1 mg/mL). Assays were performed in duplicate. A dose response curve was generated for the wild-type CBH2 by fitting the data with a Temkin isotherm equation (y = a+b(ln(1+c*x))) and the activities of the CBH2 variants were divided by a calculated activity of wild-type CBH2 of the same plate to yield a performance index [0143] Corncob 50°C. Corn cob was ground to pass through a 0.9 mm screen, followed by pretreatment in accordance with the method described in Example 4 of PCT Publication WO 2006/110901. Pretreated corn cob was used as a 7% cellulose suspension in 50 mM sodium acetate pH 5.0. Seventy (70) pL of the suspension was added per well to a 96-well microtiter plate (Nunc Flat Bottom PS). To each well 10 pL solution was added containing 46.55 pg protein of supernatant from a quad deleted strain (Aeg11, Aeg12, Acbhl, Acbh2), supplemented with 4.90 pg T. reesei CBH1, 6.84 pg T. reesei Xyn2 Y5 (Xiong et al, Extremophiles 8:393-400, 2004), 2.28 pg Fusarium verticillioides (Fv) 51A, 5.32 pg Fv3A, 0.76 pg Fv43D, and 2.45 pg T. reesei BGL1. The Fusarium verticillioides enzymes have been described in PCT publication WO/2011/038019.
[0144] Up to 20 pL of supernatant from H.jecorina cells expressing either wild-type CBH2 or a CBH2 variant was added to each well. Compensating volumes of acetate buffer were added to make up for differences in total volume. The plate was incubated at 50°C while shaking at 200 rpm. After 2 days the plate was put on ice for 5 min and 100 μΙ_ of 100 mM glycine pH 10.0 was added. After mixing, the plate was centrifuged at 3000 rpm for 5 min. A volume of 10 pL supernatant was diluted in 190 pL water. Ten (10) pL of the diluted solution was transferred to a new 96-well microtiterplate (Costar Flat Bottom PS) containing 100 pL ABTS glucose assay mixture and assayed and analyzed as described above.
[0145] Corncob 57°C. Corn cob was ground to pass through a 0.9 mm screen, followed by pretreatment in accordance with the method described in Example 4 of PCT Publication WO 2006/110901. Pretreated corn cob was used as a 7% cellulose suspension in 50 mM sodium acetate pH 5.0. Seventy (70) pL of the suspension was added per well to a 96-well microtiterplate (Nunc Flat Bottom PS). To each well 10 pL solution containing 46.55 pg protein of supernatant from a quad deleted strain (Aeg11, Aeg12, Acbhl, Acbh2), 4.90 pg CBH1 variant (S8P/T41I/N89D/S92T/S113N/S196T/P227L/D249K/T255P/S278P/E295K/ T296P/T332Y/ V403D/S411F/T462I), 6.84 pg T. reesei Xyn2 Y5 (Xiong et al, Extremophiles 8:393-400, 2004), 2.28 pg Fv51A, 5.32 pg Fv3A, 0.76 pg Fv43D, 2.45 pg Talaromyces emersonii beta-glucosidase ware added. Up to 20 pL of supernatant from H. iecorina cells expressing either wild-type CBH2 or a CBH2 variant was added. Compensating volumes of acetate buffer were added to make up for differences in total volume. The plate was incubated at 57°C while shaking at 200 rpm. After 2 days the plate was put on ice for 5 min and 100 pL of 100 mM glycine pH 10.0 was added. After mixing, the plate was centrifuged at 3000 rpm for 5 min. Avolume of 10 pL supernatant was diluted in 190 pLwater. Ten (10) pLof the diluted solution was transferred to a new 96-well microtiterplate (Costar Flat Bottom PS) containing 100 pLABTS glucose assay mixture and assayed and analyzed as described above.
REFERENCES CITED IN THE DESCRIPTION
This list of references cited by the applicant is for the reader's convenience only. It does not form part of the European patent document. Even though great care has been taken in compiling the references, errors or omissions cannot be excluded and the EPO disclaims all liability in this regard.
Patent documents cited in the description . US20060205042A [00201 • US5246S53A [00741 • W09208209A [0074] • US5475101A Γ00741 • W09428117A [0074] . US625S115B [09081 • WQ2005001036A (01071 (0108) . W0200803937QA [0108] • US20060003406A Γ01251 • WO2011038019A [0126] [0131] [0143] . WQ2010141779ΑΓ01341 • US20060094080A [0137] • WQ20Q6110901A Γ0143] [0145]
Non-patent literature cited in the description . SINGLETON et al.DICTIONARY OF MICROBIOLOGY AND MOLECULAR BIOLOGYJohn Wiley and Sonsl 9940000 Γ00151 . HALEMARHAMTHE HARPER COLLINS DICTIONARY OF BIOLOGYHarper PerenniaM 9910000 [0015] . SAM BROOK etal.MOLECULAR CLONING: A LABORATORY MANUALCold Spring Harbor Pressl 9890000 [0015] • AUSUBEL FM et al.Current Protocols in Molecular BiologyJohn Wiley & Sonsl 9930000 [00151 . ZOU et al.Structure, 1999, vol. 7, 91035-45 [0021] • ΤΕΌ et al.FEMS Microbiology Letters, 2000, vol. 190, 13-19 [0033] . KARLSSON, J. et al.Eur. J. Biochem20010000vol. 268, 6498-6507 [0041] • WOOD, T.Methods in Enzymology, 1988, vol. 160, [00411 • Biomass Part a Cellulose and HemicelluloseAcademic Press19-25 [0041] . LEVER, M.Anal. Biochem., 1972, vol. 47, 273- Γ00411 . BLAKENEY, A B.MUTTON, L L.J. Sci. Food & Agriculture, 1980, vol. 31.889- [0041] • HENRY, R.J.J. of the Institute of Brewing, 1984. vol. 90. 37- [0041] . KUHLS et al.PNAS, 1996, vol. 93, 7755-7760 [00431 . SHEIR-NEISS et al.Appl. Microbiol. Biotechnology, 1984, vol. 20, 46-531004¾ . MONTENECOURT B.S.Can., 1987, 1-20 T66461 • SAM BROOK et al.Molecular Cloning, A Laboratory ManuaM 9890000 XQ049] • KRIEGLERGene Transfer and Expression: A Laboratory ManuaM 9900000 [0049] . CURRENT!- PROTOCOLS IN MOLECULAR BIOLOGYGreene Publishing and Wiley-lntersciencel9940000 TOMS] • STRATHERN et al.The Molecular Biology of the Yeast Saccharomyces, 1981, L0QS21 • VAN DEN HONDEL, C. A M. J. J. et al.More Gene Manipulations in FungiAcademic Pressl9910000396-428 [00521 . FRESHNEYAnimal Cell Culture, 1987, [0066] • COLIGAN etal.Current Protocols in Immunology, 1991, [0066]
• POURQUIE, J. et al.Biochemistry and Genetics of Cellulose DegradationAcademic Pressl 988000071 -86 jOOMI • ILMEN, M. et al.Appl. Environ. Microbiol., 1997, vol. 63, 1298-130610068} . GOEDEGEBUUR et al.Curr. Genet, 2002, vol. 41,89-98 Γ007Ί! • WARD et al.Appl. Microbiol. Biotechnol., vol. 39, 738-743 [00711 . SHEIR-NEISS et al.Appl. Microbiol. Biotechnol., 1984, vol. 20, 46-53 [6073] • BERGESBARREAUCurr. Genet., 1991, vol. 19.359-365 [60771 . VAN HARTINGSVELDTet al.Mol. Gen. Genet., 1986, vol. 206, 71-75 [6077] . CAMPBELL et al.Curr. Genet., 1989, vol. 16, 53-56 [008S1 . LORITO et al.Curr. Genet., 1993, vol. 24, 349-356 [06981 • GOLDMAN et al.Curr. Genet., 1990, vol. 17, 169-174 [00961 . PENTTI LA etal. Gene, 1987, vol. 6, 155-164 [00961 . YELTON et al.Proc. Natl. Acad. Sci. USA, 1984, vol. 81, 1470-1474 [0096] . BAJAR et al.Proc. Natl. Acad. Sci. USA, 1991, vol. 88, 8202-8212 Γ00961 • HOPWOOD et al.The John Innes Foundation, 1985, [00961 • BRIGIDI et al.FEMS Microbiol. Lett., 1990, vol. 55, 135-138 [0096] . CAMPBELL, E. I. et al.Curr. Genet., 1989, vol. 16, 53-56 [0097] . PENTTILA M. etal.Gene, 1988, vol. 63, 11-22 [0697] . M. WARD et al.Gene, 1990. vol. 86. 2153-62 Γ01111 • PENTILLA M. et al.Gene, 1987, vol. 61.2155-164 [01281 • SCHELL et al.JAppI Biochem Biotechnol, 2003, vol. 105, 69-86 [0141] • XIONG et al.Extremophiles, 2004, vol. 8, 393-400 [01431 [61451

Claims (12)

FILAMENTØSE SVAMPEVÆRTSSTAMMER OG DNA-KONSTRUKTER SAMT FREMGANGSMÅDER TIL ANVENDELSE DERAFFilamentous fungal host strains and DNA constructs, as well as procedures for their use 1. Eilamentøs svampe værtscelle-ekspressionssystem, der omfatter: a. en svampeværtscelle, der i sit kromosomale DNA indeholder en disruption i én eller flere komponenter af den ikke-homologe rekombinations- (NHR) bane, en del af en første valgbar markør, der mangler en første valgbar funktion, og en anden valgbar markør, der kan anvendes til at tildele en anden valgbar funktion; og b. et nukleinsyremolekyle indeholdende en sekvens, der, når den introduceres i svampeværtscellen, tildeler den første valgbare markør den første valgbare funktion, en sekvens, der kan anvendes til at udtrykke ét eller flere gener af interesse, og sekvenser med væsentlig homologi med sekvenser, der flankerer de kromosomale valgbare markører; hvor de homologe sekvenser bevirker et homologt rekombinations-event, der resulterer i en første funktionel valgbar markør, fjernelse af den anden valgbare markør og ekspression af genet af interesse, og hvor den første valgbare markør og den anden valgbare markør er forskellige markører.An eilamentous fungal host cell expression system comprising: a. A fungal host cell containing in its chromosomal DNA a disruption in one or more components of the non-homologous recombination (NHR) pathway, part of a first selectable marker comprising: lacks a first selectable function and a second selectable marker that can be used to assign a second selectable function; and b. a nucleic acid molecule containing a sequence which, when introduced into the fungal host cell, assigns the first selectable marker the first selectable function, a sequence that can be used to express one or more genes of interest, and sequences having substantial homology to sequences flanking the chromosomal selectable markers; wherein the homologous sequences cause a homologous recombination event that results in a first functional selectable marker, removal of the second selectable marker and expression of the gene of interest, and wherein the first selectable marker and second selectable marker are different markers. 2. Filamentøs svampeværtscelle-ekspressionssystem ifølge krav 1, hvor den ene eller flere komponenter af NHR-banen er udvalgt fra gruppen bestående af ku80, ku70, rad50, mrell, xrs2, lig4 og xrs.A filamentous fungal host cell expression system according to claim 1, wherein the one or more components of the NHR pathway are selected from the group consisting of ku80, ku70, rad50, mrell, xrs2, lig4 and xrs. 3. Filamentøs svampeværtscelle-ekspressionssystem ifølge krav 1, hvor den første valgbare markør og den anden valgbare markør er forskellige markører udvalgt fra gruppen bestående af alsR, amdS, hygR, pyr2, pyr4, pyrG, sucA, en bleomycinresistensmarkør, en blasticidinresistensmarkør, en pyrithiaminresistensmarkør, en chlorimuronethylresistensmarkør, en neomycinresistensmarkør, et adeninbanegen, et tryptophanbanegen og thymidinkinase.A filamentous fungal host cell expression system according to claim 1, wherein the first selectable marker and the second selectable marker are different markers selected from the group consisting of alsR, amdS, hygR, pyr2, pyr4, pyrG, sucA, a bleomycin resistance marker, a blasticidine resistance marker, a pyrithiamine resistance marker. , a chlorimuronethyl resistance marker, a neomycin resistance marker, an adenine pathway, a tryptophan pathway, and thymidine kinase. 4. Filamentøs svampeværtscelle-ekspressionssystem ifølge krav 1, hvor svampeværtscellen er fra en art, der er udvalgt fra gruppen bestående af Trichoderma, Penicillium, Aspergillus, Humicola, Chrysosporium, Fusarium, Neurospora og Emericella.The filamentous fungal host cell expression system of claim 1, wherein the fungal host cell is from a species selected from the group consisting of Trichoderma, Penicillium, Aspergillus, Humicola, Chrysosporium, Fusarium, Neurospora and Emericella. 5. Filamentøs svampeværtscelle-ekspressionssystem ifølge krav 4, hvor Trichoderma er T. reesei.The filamentous fungal host cell expression system according to claim 4, wherein Trichoderma is T. reesei. 6. Filamentøs svampe værtscelle-ekspressionssystem ifølge krav 4, hvor Aspergillus er A. niger.The filamentous fungal host cell expression system of claim 4, wherein Aspergillus is A. niger. 7. Filamentøs svampeværtscelle-ekspressionssystem ifølge krav 1, hvor genet af interesse er udvalgt fra gruppen bestående af hemicellulaser, peroxidaser, proteaser, cellulaser, xylanaser, lipaser, phospholipaser, esteraser, cutinaser, pectinaser, keratinaser, reduktaser, oxidaser, phenoloxidaser, lipoxygenaser, ligninaser, pullulanaser, tannaser, pentosanaser, malanaser, beta-glucanaser, arabinosidaser, hyaluronidase, chondroitinase, laccase, amylaser, glucoamylaser og blandinger deraf.The filamentous fungal host cell expression system according to claim 1, wherein the gene of interest is selected from the group consisting of hemicellulases, peroxidases, proteases, cellulases, xylanases, lipases, phospholipases, esterases, cutinases, pectinases, keratinases, reductases, oxidases, phenoxygenases, lipoxygenases, ligninases, pullulanases, tannases, pentosanases, malanases, beta-glucanases, arabinosidases, hyaluronidase, chondroitinase, laccase, amylases, glucoamylases and mixtures thereof. 8. Filamentøs svampeværtscelle-ekspressionssystem ifølge krav 1, hvor genet af interesse er udvalgt fra gruppen bestående af acetylesteraser, aminopeptidaser, amylaser, arabinaser, arabinofuranosidaser, carboxypeptidaser, catalaser, cellulaser, chitinaser, chymosin, cutinase, deoxyribonucleaser, epimeraser, esteraser, a-galactosidaser, β-galac tos idaser, a-glucanaser, glucanlysaser, endo^-glucanaser, glucoamylaser, glucoseoxidaser, a-glucosidaser, β-glucosidaser, glucuronidaser, hemicellulaser, hexoseoxidaser, hydrolaser, invertaser, isomeraser, laccaser, lipaser, lyaser, mannosidaser, oxidaser, oxidoreduktaser, pectatlyaser, pectinacetylresteraser, pectinrdepolymeraser, pectinmethylesteraser, pectinolytiske enzymer, peroxidaser, phenoloxidaser, phytaser, polygalacturonaser, proteaser, rhamno-galacturonaser, ribonucleaser, thaumatin, transferaser, transportproteiner, transglutaminaser, xylanaser, hexoseoxidaser og kombinationer deraf.The filamentous fungal host cell expression system according to claim 1, wherein the gene of interest is selected from the group consisting of acetyl esterases, aminopeptidases, amylases, arabinases, arabinofuranosidases, carboxypeptidases, catalases, cellulases, chitinases, chymosin, cutinase, deoxyribonucleases, galactosidases, β-galac tos idases, α-glucanases, glucan lysases, endo ^ -glucanases, glucoamylases, glucose oxidases, α-glucosidases, β-glucosidases, glucuronidases, hemicellulases, hexose oxidases, hydrolases, invertases, isomerases, laccases, lipases ; 9. Filamentøs svampeværtscelle-ekspressionssystem ifølge krav 1, hvor genet af interesse er udvalgt fra gruppen bestående af peptidhormoner, vækstfaktorer, koaguleringsfaktorer, chemokiner, cytokiner, lymfokiner, antistoffer, receptorer, adhæsionsmolekyler, mikrobielle antigener og fragmenter deraf.The filamentous fungal host cell expression system according to claim 1, wherein the gene of interest is selected from the group consisting of peptide hormones, growth factors, coagulation factors, chemokines, cytokines, lymphokines, antibodies, receptors, adhesion molecules, microbial antigens and fragments thereof. 10. Fremgangsmåde til ekspression af et gen af interesse i det filamentøse svampeværtscellesystem ifølge et hvilket som helst af kravene 1 til 9, hvilken fremgangsmåde omfatter introduktion i den filamentøse svampeværtscelle af nukleinsyremolekylet, dyrkning af værtscellerne og udvælgelse af de værtsceller, der har den første valgbare funktion, men mangler den anden valgbare funktion.A method for expressing a gene of interest in the filamentous fungal host cell system according to any one of claims 1 to 9, which comprises introducing into the filamentous fungal host cell of the nucleic acid molecule, culturing the host cells, and selecting the host cells having the first selectable one. feature, but lacks the other selectable feature. 11. Fremgangsmåde ifølge krav 10, hvilken fremgangsmåde endvidere omfatter udførelse af assay for ekspression af genet af interesse.The method of claim 10, further comprising performing assay for expression of the gene of interest. 12. Fremgangsmåde ifølge krav 11, hvilken fremgangsmåde endvidere omfatter udførelse af assay for en biokemisk funktion af ct polypeptid, der cr kodet for af genet af interesse.The method of claim 11, further comprising performing assay for a biochemical function of ct polypeptide that is encoded by the gene of interest.
DK11727041.3T 2010-06-03 2011-06-03 Filamentous fungal host strains and DNA constructs, as well as procedures for their use DK2576796T3 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US35128610P 2010-06-03 2010-06-03
PCT/US2011/039092 WO2011153449A1 (en) 2010-06-03 2011-06-03 Filamentous fungal host strains and dna constructs, and methods of use thereof

Publications (1)

Publication Number Publication Date
DK2576796T3 true DK2576796T3 (en) 2017-06-19

Family

ID=44511803

Family Applications (1)

Application Number Title Priority Date Filing Date
DK11727041.3T DK2576796T3 (en) 2010-06-03 2011-06-03 Filamentous fungal host strains and DNA constructs, as well as procedures for their use

Country Status (8)

Country Link
US (1) US9701969B2 (en)
EP (1) EP2576796B1 (en)
JP (1) JP2013533737A (en)
CN (1) CN102939382B (en)
BR (1) BR112012030746A2 (en)
CA (1) CA2801799C (en)
DK (1) DK2576796T3 (en)
WO (1) WO2011153449A1 (en)

Families Citing this family (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103184211B (en) * 2011-12-27 2014-09-24 东北农业大学 Chlorimuron-ethyl resistance-associated protein as well as encoding gene and application thereof
US20160304905A1 (en) * 2013-12-03 2016-10-20 Novozymes A/S Fungal Gene Library By Double Split-Marker Integration
EP3180354A1 (en) * 2014-08-15 2017-06-21 Danisco US Inc. Compositions and methods for improved protein production
EP3183343B1 (en) * 2014-08-20 2020-01-01 Novozymes A/S Recombinase-mediated integration of a polynucleotide library
CN107109424A (en) * 2014-12-01 2017-08-29 丹尼斯科美国公司 Fungal host strain, DNA construct and application method
KR102350405B1 (en) 2014-12-16 2022-01-11 다니스코 유에스 인크. Fungal genome modification systems and methods of use
CN108064267A (en) * 2015-02-09 2018-05-22 丹尼斯科美国公司 Fungal bacterial strain and application method
CN104894165B (en) * 2015-05-26 2019-01-25 中国科学院青岛生物能源与过程研究所 A kind of method and application improving gene targeting application efficiency in Aspergillus terreus
CN104894115B (en) * 2015-05-26 2018-11-13 中国科学院青岛生物能源与过程研究所 A kind of Aspergillus terreus and its construction method and application with efficient homologous recombination ability
BR112018011503A2 (en) 2015-12-07 2018-12-04 Zymergen Inc corynebacterium glutamicum promoters
US9988624B2 (en) 2015-12-07 2018-06-05 Zymergen Inc. Microbial strain improvement by a HTP genomic engineering platform
US11208649B2 (en) 2015-12-07 2021-12-28 Zymergen Inc. HTP genomic engineering platform
US20180362946A1 (en) 2015-12-18 2018-12-20 Danisco Us Inc. Polypeptides with endoglucanase activity and uses thereof
EP3239898A1 (en) * 2016-04-27 2017-11-01 Mercarista, S.L. Method for quantifying proteins in a sample
US10544411B2 (en) 2016-06-30 2020-01-28 Zymergen Inc. Methods for generating a glucose permease library and uses thereof
EP3478833A4 (en) 2016-06-30 2019-10-02 Zymergen, Inc. Methods for generating a bacterial hemoglobin library and uses thereof
KR102593668B1 (en) 2016-10-04 2023-10-24 다니스코 유에스 인크. Protein production in filamentous fungal cells in the absence of inducing substrates
KR20190098213A (en) * 2016-12-30 2019-08-21 지머젠 인코포레이티드 Method for preparing fungal production strains using automated steps for genetic engineering and strain purification
CN108588060B (en) * 2017-03-07 2020-12-01 武汉康复得生物科技股份有限公司 Recombinant oxalate decarboxylase expressed by filamentous fungus host cell
KR20200026878A (en) 2017-06-06 2020-03-11 지머젠 인코포레이티드 HTP Genome Engineering Platform to Improve Fungal Strains
CN108004153B (en) * 2017-12-07 2021-05-28 潍坊康地恩生物科技有限公司 Trichoderma reesei strain capable of producing pectin lyase in high yield and application thereof
CN112166180A (en) 2018-06-06 2021-01-01 齐默尔根公司 Manipulation of genes involved in signal transduction to control fungal morphology during fermentation and production
US20240102070A1 (en) 2019-11-08 2024-03-28 Danisco Us Inc. Fungal strains comprising enhanced protein productivity phenotypes and methods thereof
KR20230004495A (en) 2020-04-22 2023-01-06 다니스코 유에스 인크. Compositions and methods for improved protein production in filamentous fungal cells
US11479779B2 (en) 2020-07-31 2022-10-25 Zymergen Inc. Systems and methods for high-throughput automated strain generation for non-sporulating fungi
CN117716039A (en) 2021-07-19 2024-03-15 丹尼斯科美国公司 Compositions and methods for enhancing protein production in fungal cells
CN114717268A (en) * 2021-11-01 2022-07-08 天津农学院 Preparation of traceless genetic transformation strain
WO2023102315A1 (en) 2021-12-01 2023-06-08 Danisco Us Inc. Compositions and methods for enhanced protein production in fungal cells

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5246853A (en) 1990-10-05 1993-09-21 Genencor International, Inc. Method for treating cotton-containing fabric with a cellulase composition containing endoglucanase components and which composition is free of exo-cellobiohydrolase I
ATE257511T1 (en) 1990-10-05 2004-01-15 Genencor Int METHODS FOR TREATING COTTON CONTAINING FIBERS WITH CELLULASE
US5475101A (en) 1990-10-05 1995-12-12 Genencor International, Inc. DNA sequence encoding endoglucanase III cellulase
FI932521A0 (en) 1993-06-02 1993-06-02 Alko Ab Oy Now endoglucan enzyme
CN1151264C (en) 1997-04-07 2004-05-26 尤尼利弗公司 Agrobacterium mediated transformation of moulds, in particular those belonging to the genus aspergillus
US7666648B2 (en) 2003-05-29 2010-02-23 Danisco Us Inc. Isolated polypeptide having arabinofuranosidase activity
US7413887B2 (en) 2004-05-27 2008-08-19 Genecor International, Inc. Trichoderma reesei glucoamylase and homologs thereof
EP1751281A2 (en) 2004-05-27 2007-02-14 Genencor International, Inc. Aspergillus kawachi acid-stable alpha amylase and applications in granular starch hydrolysis
CA2592550C (en) 2004-12-30 2015-05-19 Genencor International, Inc. Novel variant hypocrea jecorina cbh2 cellulases
CN101160388B (en) 2005-04-12 2013-05-01 纳幕尔杜邦公司 System and process for biomass treatment
DE602007005237D1 (en) * 2006-04-08 2010-04-22 Dsm Ip Assets Bv IMPROVED METHOD FOR HOMOLOGOUS RECOMBINATION IN PILZ CELLS
JP5322308B2 (en) 2006-09-22 2013-10-23 ダニスコ・ユーエス・インク、ジェネンコー・ディビジョン Acetolactate synthase (ALS) selection marker from Trichoderma Ressei
EP2126100B1 (en) 2007-03-21 2012-11-14 DSM IP Assets B.V. Improved method for homologous recombination
AU2010256519B2 (en) 2009-06-03 2016-03-17 Danisco Us Inc. Cellulase variants with improved expression, activity and/or stability, and use thereof
CA2774776A1 (en) 2009-09-23 2011-03-31 Danisco Us Inc. Novel glycosyl hydrolase enzymes and uses thereof

Also Published As

Publication number Publication date
US9701969B2 (en) 2017-07-11
CN102939382A (en) 2013-02-20
EP2576796A1 (en) 2013-04-10
CA2801799A1 (en) 2011-12-08
JP2013533737A (en) 2013-08-29
CA2801799C (en) 2018-11-20
EP2576796B1 (en) 2017-03-29
CN102939382B (en) 2017-02-15
WO2011153449A1 (en) 2011-12-08
US20130149742A1 (en) 2013-06-13
BR112012030746A2 (en) 2016-02-16

Similar Documents

Publication Publication Date Title
DK2576796T3 (en) Filamentous fungal host strains and DNA constructs, as well as procedures for their use
US9850501B2 (en) Simultaneous site-specific integrations of multiple gene-copies
US10100344B2 (en) AgsE-deficient strain
JP7285780B2 (en) Production of proteins in filamentous fungal cells in the absence of inducing substrates
US9631197B2 (en) Rasamsonia transformants
US20140113304A1 (en) Bi-Directional Cytosine Deaminase-Encoding Selection Marker
EP3227430B1 (en) Fungal host strains, dna constructs, and methods of use
JP2019122386A (en) Filamentous fungal cell with inactivated component of selective autophagy pathway and method of using the same