WO2009079376A2

WO2009079376A2 - Lepidopteran insect n-acetylglucosaminidase genes and their use in glycoengineering

Info

Publication number: WO2009079376A2
Application number: PCT/US2008/086606
Authority: WO
Inventors: Donald L. Jarvis; Christoph Geisler
Original assignee: The University Of Wyoming
Priority date: 2007-12-14
Filing date: 2008-12-12
Publication date: 2009-06-25
Also published as: US20150191732A1; US20100279415A1; US8846886B2; WO2009079376A3

Abstract

A transgenic insect cell line for production of elevated levels of recombinant glycoproteins comprising mammalian-like N-glycans is provided. Also disclosed are nucleic acid sequences encoding β-N-acetylglucosaminidases.

Description

LEPIDOPTERAN INSECT N-ACETYLGLUCOSAMINIDASE GENES AND THEIR USE EV GLYCOENGINEERING

Inventors: Donald L. Jarvis

Christoph Geisler

CROSS-REFERENCE TO RELATED APPLICATIONS

The present application claims the benefit of U.S. Provisional Patent Application No. 61/013,815, filed December 14, 2007, the entire disclosure of which is incorporated by reference herein.

STATEMENT REGARDING FEDERAL SPONSORED RESEARCH OR DEVELOPMENT Pursuant to 35 U.S.C. §202(c), it is acknowledged that the U.S. Government has certain rights in the invention described, which was made in part with funds from NIH grant number, GM49734.

FIELD OF THE INVENTION This invention relates to the fields of molecular biology and production of proteins possessing complex type oligosaccharide side chains. More specifically, the invention provides novel nucleic acid sequences encoding β-N-acetylglucosaminidase enzymes and recombinant insect cell lines comprising the same for the production of therapeutic and commercially valuable glycoproteins.

BACKGROUND OF THE INVENTION

Several publications and patent documents are cited throughout the specification in order to describe the state of the art to which this invention pertains. Each of these citations is incorporated herein by reference as though set forth in full. Insects and other lower eukaryotes, such as nematodes and plants, occupy an interesting evolutionary niche in glycobiology because they produce N-glycoproteins, but they typically process their N-linked glycans less extensively than mammals (1,2). This difference between lower and higher eukaryotic protein N-glycosylation pathways is biotechnologically significant because insects and plants are used to produce recombinant mammalian glycoproteins for many different biomedical research applications (3-7). Insect and mammalian protein N-glycosylation pathways each begin with the co-translational transfer of N-glycan precursors to nascent proteins (1,8). These precursors are subsequently trimmed and elongated by enzymes localized in the endoplasmic reticulum and Golgi apparatus of insect and mammalian cells to produce a common intermediate with the structure Manα6(GlcΝAcB2Manα3)Manβ4 GlcNAcB4GlcNAc-R. In mammalian cells, this intermediate is elongated by various glycosyltransferases to produce complex N-glycans, which often have terminal sialic acid residues. In contrast, insect cells usually fail to elongate this same intermediate and convert it, instead, to paucimannose N-glycans with the core structure Manα6(Manα3)Manβ4GlcΝAcβ4GlcΝAc-R. An unusual β-N-acetylglucosaminidase is responsible for the production of these structures (9). This enzyme specifically removes the terminal N-acetylglucosamine residue from the α3 branch of Manαδ (GlcΝAcβ2Manα3)Manβ4GlcΝAcB4GlcΝAc-R, simultaneously eliminating the intermediate required for N-glycan elongation and producing the core paucimannose glycan typically found on insect cell-derived N-glycoproteins. This same enzyme is also responsible for the production of core paucimannose N-glycans in nematodes (10,11) and plants (12,13). Thus, the presence of a processing β-N-acetylglucosaminidase is a key difference between the protein N-glycosylation pathways of lower and higher eukaryotes. In the seminal insect study on this topic, Altmann and coworkers (9) demonstrated that IPLB-Sf21AE, a cell line derived from the lepidopteran insect S.frugiperda (14), has a membrane-associated β-N-acetylglucosaminidase activity that can specifically cleave the terminal N-acetylglucosamine residue from the α3 branch of a biantennary N-glycan in vitro. Subsequently, it was shown that cell lines derived from E. acrea, another lepidopteran insect, produced hybrid and complex N-glycans containing terminal N-acetylglucosamine or galactose residues because they lack this intracellular β-N-acetylglucosaminidase activity (15). Together, these studies strongly supported the idea that the N-glycosylation pathway of at least some insect cells includes a processing β-N-acetylglucosaminidase, as described above. However, unequivocal proof of this concept awaited the isolation of an insect gene encoding this enzyme, together with evidence that the gene product had the substrate specificity of the N-glycan processing enzyme.

The first proof of this kind was provided by a more recent study from Altmann's group, in which they demonstrated that the D. melanogaster fused lobes {Dm-fdt) gene encodes the specific, processing β-N-acetylglucosaminidase in this organism (16). Importantly, this study demonstrated that the Dm-fdl gene product has several features distinguishing it from degradative hexosaminidases and chitinases, which also have ^β-N- acetylglucosaminidase activities. These features included its specificity for the terminal N-acetylglucosamine residue linked to the α3 branch of N-glycan substrates and its inability to degrade chito-oligosaccharides. Furthermore, it was shown that flies lacking a functional y#7 gene produced a higher proportion of N-glycans with terminal N- acetylglucosamine residues linked to the α3 branch than wild type. These findings, together with the finding that the D. melanogaster hexosaminidase genes (hexol and hexo-2) encode enzymes that can cleave chito-oligosaccharides, but not N-glycans, strongly suggested that Dm-FDL is the β-N-acetylglucosaminidase responsible for N- glycan processing in this fly. These properties also were consistent with the idea that Dm- FDL is an ortholog of the lepidopteran insect N-glycan processing enzyme first detected by Altmann and coworkers (1995) in microsomal membranes from IPLB-Sf21AE cells. Subsequently, two lab groups independently reported molecular cloning of genes encoding β-N-acetylglucosaminidases from Sf9 cells, which are a clonal derivative of the IPLB-SOlAE cell line (17,18). Our group described the isolation of three β-N- acetylglucosaminidase genes from Sf9 cells, which were designated SfGlcNAcase-1, -2, and -3 (18). SfGIcN Acase-1 was clearly distinct from the other two, which were nearly identical to each other and appeared to be allelic variants of the same gene. Further analysis of the SfGlcNAcase-1 and SfGlcNAcase-3 gene products showed that they had high sequence homology to known hexosaminidases and that each also had β-N- acetylglucosaminidase activity when assayed against relevant substrates. However, neither had the tight α3 branch specificity of the processing enzyme activity originally described by Altmann and coworkers (1995). In fact, each could remove the terminal N- acetylglucosamine residues from either the α3 or the α6 branch of various N-glycan substrates and each also was able to release N-acetylglucosamine monomers from a chito- oligosaccharide substrate. Accordingly, we concluded that none of these S.frugiperda genes encoded the N-glycan processing enzyme, but rather, that they encoded broad- spectrum β-N-acetylglucosaminidases that are more likely to be involved in N-glycan and chitin degradation. In a similar study, Tomiya and coworkers (2006) also molecularly cloned two allelic variants of an Sf9 cell β-N-acetylglucosaminidase gene, which they termed Sfhex. Further analysis of the Sfhex gene product, which is identical to the gene product we designated SfGlcΝAcase-3, confirmed that the SfGlcNAcase-3/Sfhex gene product lacks the α3 branch specificity of the processing enzyme activity originally described by Altmann and coworkers. However, because this enzyme had a 2- to 5 -fold higher preference for the terminal N-acetylglucosamine residue on the α3 branch of an N- glycan substrate, Tomiya and coworkers (2006) concluded that the SfGlcNAcase-3/Sflιex gene encodes the processing β-N-acetylglucosaminidase of Sf9 cells.

SUMMARY OF THE INVENTION

In accordance with the present invention, an isolated nucleic acid encoding an N- acetylglucosaminidase is provided. In one embodiment the nucleic acid encodes a protein of SEQ ID NO: 2. In a preferred embodiment, nucleic acid is SEQ ID NO: 1. The nucleic acid molecules of the invention may be DNA, RNA, or cDNA and they may be single or double stranded. Additional embodiments of the invention include nucleic acids of SEQ ID NOS: 3, 5, and 7 and their encoded proteins SEQ ID NOS: 4, 6, and 8.

In another aspect, expression vectors comprising the nucleic acid molecules described above are provided. Also within the scope of the invention are recombinant insect cells transformed with such expression vectors. In a particularly preferred embodiment, the RNA molecule is a fragment of SEQ

ID NO: 1, having SEQ ID NO: 9, which is double stranded and, when expressed in a cell, down regulates production of the protein of SEQ ID NO: 2.

In still another aspect, the present invention provides isolated proteins comprising SEQ ID: 2, 4, 6 and 8. The isolated proteins of this invention may be used for the production of specific glycans for use as standards, or substrates, e.g., in remodeling recombinant glycoprotein glycans.

In yet another aspect, a method for enhancing production of mammalian-like N- glycans in insect cells is provided. An exemplary method entails providing recombinant insect cell lines comprising the double stranded RΝA molecule described above, either transforming the cells with an expression vector or infecting the cells with a recombinant baculovirus comprising a nucleic acid encoding a heterologous glycoprotein of interest, wherein glycoprotein(s) expressed in the recombinant comprise elevated levels of mammalian-like N-glycans when compared to levels observed in wild type cells. In an alternative embodiment, the cells described above may optionally contain additional enzymes involved in the production and synthesis of mammalian-like N glycans. Such enzymes include, without limitation, N-acetylglucosaminyltransferases, galactosyltransferases, sialyltransferases, sulfotransferases, sialic acid synthases, CPM- sialic acid synthetases, UDP-N-acetylglucosamine^-epimerases/N-acetylmannosamine kinases, and CMP-sialic acid transporters. Presented hereinbelow are data that will resolve the apparent discrepancy in the conclusions drawn from the two previous reports referred to above (17,18). In short, the present inventors molecularly cloned a β-N-acetylglucosaminidase cDΝA from Sf9 cells, which turned out to be the S. frugiperda ortholog of the Dm-fdl gene. This gene, designated Sf-fdl, encodes a membrane-associated product that specifically cleaves the terminal N-acetylglucosamine residue from the α3 branch of N-glycan substrates, that has little or no activity against chito-oligosaccharide substrates, and that has precisely the same pH profile as the activity originally identified by Altmann and coworkers (1995) in IPLB-SGlAE cell microsomes. Furthermore, Sf9 cells engineered to express a Sf-fdl- specific double-stranded RΝA had lower levels of specific, processing β-N-acetylglucosaminidase activity. These results indicate that the specific, processing β-N-acetylglucosaminidase activity originally detected by Altmann and coworkers is encoded by the Sf-fdl gene in this lepidopteran insect cell line. The definitive identification of this new gene sets the stage for an effort to create a transformed Sf9 cell variant lacking this key N-glycan processing activity, which would be an improved host for recombinant glycoprotein production by baculovirus expression vectors.

BRIEF DESCRIPTION OF THE DRAWINGS

Fig. 1. Nucleotide sequence of the Sf-fdl gene (SEQ ID NO: 1) and amino acid sequence of the gene product (SEQ ID NO: 2). The putative N-terminal transmembrane domain is underlined and the two consensus N-glycosylation sites are boxed.

Fig. 2. Phylogenetic relationships between the Sf-FDL protein and known hexosaminidases. This Figure shows the phylogenetic relationships between the Sf-FDL protein and Dm-FDL (Ace No. NM_165909; 16), SfGlcNAcase-3/SfHex (Ace No. DQ249309; 17,18)), SfGlcNAcase-1 (DQ249307; 18), and the human hexosaminidases A (Ace. No. NM 000520; 33) and B (NM_000521; 34. The amino acid sequences of these proteins were aligned using CLUSTALX version 1.83 (21) using the default settings and then the alignment was exported in the PHYLIP format (36) and used to generate a distance matrix by protdist in PHYLIP version 3.66 with the Jones-Taylor-Thornton model. Neighbor in PHYLIP version 3.66 was used to generate an unrooted tree from the distance matrix with the neighbor-joining method and, finally, the Neighbor output was used to draw an unrooted tree with the PHYLIP postscript generator. The Sf-FDL amino acid sequence is 44% and 29% identical to the sequences of Dm-FDL and SfGlcNAcase- 3/SfHex, respectively.

Fig. 3. Substrate specificity of Sf-FDL. Various glycan substrates, including

GnGn (A), MGn (B), GnM (Q, and chitotriose (D) were incubated for 16 h with microsomal fractions containing 10 ug of total protein from Sf9 cells infected with AcMNPV, AcDm-FDL, AcSf-FDL, or AcGlcNAcase-3. The reaction products were then recovered and analyzed by reverse-phase HPLC, as described in Experimental Procedures. The arrows show the elution times for each of the relevant glycans.

Fig. 4. pH optimum of Sf-FDL. Microsomal fractions containing 10 ug of total protein from AcSf-FDL-infected Sf9 cells were incubated for 16 h with GnGn at pH values between 4.0 and 8.0, and then the reaction products were recovered and analyzed by reverse-phase HPLC, as described in Experimental Procedures. The plot shows the relative percentages of GnM produced at each pH as a percentage of the area under the GnM peak divided by the sum of the area under the GnGn and GnM peaks.

Fig. 5. Expression and purification of GST-tagged β-N-acetylglucosaminidase ectodomains. The GST-tagged, ectodomains of Sf-FDL (lanes 1), Dm-FDL (lanes 2), and SfGlcΝAcase-3/Sfhex (lanes 3) were expressed in recombinant baculovirus-infected Sf9 cells and purified from the extracellular fraction by glutathione affinity chromatography, as described in Experimental Procedures. Equal amounts of the purified products were then analyzed by (A) SDS-PAGE with Coomassie Blue staining or (B) SDS-PAGE with immunoblotting using a GST-specific antiserum.

Fig. 6. Substrate specificity of the GST-tagged, ectodomains of Sf-FDL, Dm- FDL, and SfGlcΝAcase-3/Sfhex. Equal amounts of each enzyme were incubated for 2 h with GnGn (A), MGn (B), GnM (Q, or chitotriose (D) and the reaction products were recovered and analyzed by reverse-phase HPLC, as described in Experimental Procedures. The arrows show the elution times for each of the relevant glycans.

Fig. 7. Overdigestion of glycan substrates with the GST-tagged, ectodomains of Sf-FDL, Dm-FDL, and SfGlcΝAcase-3/Sfhex. Equal amounts of each enzyme were incubated for 20 h with GnGn (A), GnM (B), or chitotriose (Q and the reaction products were recovered and analyzed by reverse-phase HPLC, as described in Experimental Procedures. The arrows show the elution times for each of the relevant glycans.

Fig. 8. Nucleotide sequence of the Tn-fdl gene (SEQ ID NO: 3) and amino acid sequence of the gene product (SEQ ID NO:4). The putative N-terminal transmembrane domain is underlined and the two consensus N-glycosylation sites are boxed.

Fig. 9. Nucleotide sequence of one allele the Bm-fdl gene (SEQ ID NO:5) and amino acid sequence of the gene product (SEQ ID NO:6). The putative N-terminal transmembrane domain is underlined and the three consensus N-glycosylation sites are boxed.

Fig. 10. Nucleotide sequence of another allele of Bm-fdl gene (SEQ ID NO:7) and amino acid sequence of the gene product (SEQ ID NO:8). The putative N-terminal transmembrane domain is underlined and the three consensus N-glycosylation sites are boxed.

Fig. 11. Endogenous levels of specific, processing β-N-acetylglucosaminidase activity in parental Sf9 cells and an Sf9-derived clone expressing an SyyάY-specific double-stranded RΝA. Microsomal membrane preparations from Sf9 or SfFDL RΝAi cells were incubated for 16 hr with GnGn, and the reaction products were analyzed by HPLC to compare the relative amounts of GnM produced. The plot shows the average results obtained in five replicate assays, with the average percentage of GnM produced by microsomes from the Sf9 controls set to 100%. The error bars show the standard deviations and a one-way AΝOVA analysis showed that the two datasets are significantly different (P <0.01).

Fig. 12. The sequence utilized in the RΝAi experiment is shown (SEQ ID ΝO:9).

DETAILED DESCRIPTION OF THE INVENTION

Manα6(Manα3)Man^β4GlcNAcB4GlcNAc-R is the core structure of the major processed protein N-glycans produced by insect cells. Ultimately, this paucimannose type structure is produced by an unusual β-N-acetylglucosaminidase, which removes the terminal N-acetylglucosamine residue from the upstream intermediate, Manα6(GlcΝAcβ2Manα3)Manβ4GlcΝAcβ4GlcΝAc-R. Because the N-glycan processing pathways leading to the production of this intermediate are probably identical in insects and higher eukaryotes, the presence or absence of this specific, processing ^β-N- acetylglucosaminidase is a key factor distinguishing the processing pathways in these two different types of organisms. Recent studies have shown that the fused lobes (fdl) gene encodes the specific, processing β-N-acetylglucosaminidase of D. melanogaster. However, there are conflicting reports on the identity of the gene encoding this enzyme in the lepidopteran insect, S. frugiperda. One has suggested that a gene alternatively designated SfGlcNAcase-3 or SfHex encodes this function, while another has suggested that this gene encodes a broad-spectrum β-N-acetylglucosaminidase that functions in glycan and chitin degradation. In the present invention, this conflict is resolved by demonstrating that an S. frugiperda fdl ortholog (Sf-fdl) encodes a product with the substrate specificity expected of a processing β-N-acetylglucosaminidase. It is also shown that the endogenous levels of specific, processing β-N-acetylglucosaminidase activity are significantly reduced in S. frugiperda cells engineered to express a double-stranded RΝA derived from the Sf-fdl gene. These results indicate that Sf-fdl encodes the specific, processing ^β-N-acetylglucosaminidase of S. frugiperda.

Definitions:

A "cell line" refers to cells which can be cultured in the lab for an indefinite period and are useful for producing large amounts of a protein of interest. Ideally, such cells are immortalized and do not exhibit senescence in culture.

As used herein, the term "insect" includes any stage of development of an insect, including a one-celled germ line cell, a fertilized egg, an early embryo, a larva, including any of a first through final instar larva, a pupa, or an adult insect. For the production of mammalianized glycoproteins of interest, a large larva, such as a fourth or fifth instar larva is preferred. It will be evident to a skilled worker which insect stage is suitable for a particular purpose, such as for direct production of a glycosylated polypeptide of interest, for storage or transport of an insect to a different location, for generation of progeny, for further genetic crosses, or the like.

With reference to nucleic acids of the invention, the term "isolated nucleic acid" is sometimes used. This term, when applied to DΝA, refers to a DΝA molecule that is separated from sequences with which it is immediately contiguous (in the 5' and 3' directions) in the naturally occurring genome of the organism from which it originates. For example, the "isolated nucleic acid" may comprise a DNA or cDNA molecule inserted into a vector, such as a plasmid or virus vector, or integrated into the DNA of a prokaryote or eukaryote. With respect to RNA molecules of the invention, the term "isolated nucleic acid" primarily refers to an RNA molecule encoded by an isolated DNA molecule as defined above. Alternatively, the term may refer to an RNA molecule that has been sufficiently separated from RNA molecules with which it would be associated in its natural state (i.e., in cells or tissues), such that it exists in a "substantially pure" form (the term "substantially pure" is defined below).

With respect to protein, the term "isolated protein" or "isolated and purified protein" is sometimes used herein. This term refers primarily to a protein produced by expression of an isolated nucleic acid molecule of the invention. Alternatively, this term may refer to a protein which has been sufficiently separated from other proteins with which it would naturally be associated, so as to exist in "substantially pure" form.

The term "promoter region" refers to the transcriptional regulatory regions of a gene, which may be found at the 5 ' or 3' side of the coding region, or within the coding region, or within introns.

The term "vector" refers to a small carrier DNA molecule into which a DNA sequence can be inserted for introduction into a host cell where it will be replicated. An "expression vector" is a specialized vector that contains a gene or nucleic acid sequence with the necessary regulatory regions needed for expression in a host cell.

The term "operably linked" means that the regulatory sequences necessary for expression of a coding sequence are placed in the DNA molecule in the appropriate positions relative to the coding sequence so as to effect expression of the coding sequence. This same definition is sometimes applied to the arrangement of coding sequences and transcription control elements (e.g. promoters, enhancers, and termination elements) in an expression vector. This definition is also sometimes applied to the arrangement of nucleic acid sequences of a first and a second nucleic acid molecule wherein a hybrid nucleic acid molecule is generated.

The term "substantially pure" refers to a preparation comprising at least 50-60% by weight the compound of interest (e.g., nucleic acid, oligonucleotide, protein, etc.). More preferably, the preparation comprises at least 75% by weight, and most preferably 90-99% by weight, of the compound of interest. Purity is measured by methods appropriate for the compound of interest (e.g. chromatographic methods, agarose or polyacrylamide gel electrophoresis, HPLC analysis, and the like).

The phrase "consisting essentially of when referring to a particular nucleotide sequence or amino acid sequence means a sequence having the properties of a given SEQ ID NO:. For example, when used in reference to an amino acid sequence, the phrase includes the sequence per se and molecular modifications that would not affect the basic and novel characteristics of the sequence.

The term "oligonucleotide," as used herein refers to primers and probes of the present invention, and is defined as a nucleic acid molecule comprised of two or more ribo- or deoxyribonucleotides, preferably more than three. The exact size of the oligonucleotide will depend on various factors and on the particular application for which the oligonucleotide is used.

The term "probe" as used herein refers to an oligonucleotide, polynucleotide or nucleic acid, either RNA or DNA, whether occurring naturally as in a purified restriction enzyme digest or produced synthetically, which is capable of annealing with or specifically hybridizing to a nucleic acid with sequences complementary to the probe. A probe may be either single-stranded or double-stranded. The exact length of the probe will depend upon many factors, including temperature, source of probe and method of use. For example, for diagnostic applications, depending on the complexity of the target sequence, the oligonucleotide probe typically contains 15-25 or more nucleotides, although it may contain fewer nucleotides.

The probes herein are selected to be "substantially" complementary to different strands of a particular target nucleic acid sequence. This means that the probes must be sufficiently complementary so as to be able to "specifically hybridize" or anneal with their respective target strands under a set of pre-determined conditions. Therefore, the probe sequence need not reflect the exact complementary sequence of the target. For example, a non-complementary nucleotide fragment may be attached to the 5' or 3' end of the probe, with the remainder of the probe sequence being complementary to the target strand. Alternatively, non-complementary bases or longer sequences can be interspersed into the probe, provided that the probe sequence has sufficient complementarity with the sequence of the target nucleic acid to anneal therewith specifically.

The term "specifically hybridize" refers to the association between two single- stranded nucleic acid molecules of sufficiently complementary sequence to permit such hybridization under pre-determined conditions generally used in the art (sometimes termed "substantially complementary"). In particular, the term refers to hybridization of an oligonucleotide with a substantially complementary sequence contained within a single-stranded DNA or RNA molecule of the invention, to the substantial exclusion of hybridization of the oligonucleotide with single-stranded nucleic acids of non- complementary sequence.

The term "primer" as used herein refers to an oligonucleotide, either RNA or DNA, either single-stranded or double-stranded, either derived from a biological system, generated by restriction enzyme digestion, or produced synthetically which, when placed in the proper environment, is able to act functionally as an initiator of template-dependent nucleic acid synthesis. When presented with an appropriate nucleic acid template, suitable nucleoside triphosphate precursors of nucleic acids, a polymerase enzyme, suitable cofactors and conditions such as a suitable temperature and pH, the primer may be extended at its 3' terminus by the addition of nucleotides by the action of a polymerase or similar activity to yield a primer extension product. The primer may vary in length depending on the particular conditions and requirements of the application. For example, in diagnostic applications, the oligonucleotide primer is typically 15-25 or more nucleotides in length. The primer must be of sufficient complementarity to the desired template to prime the synthesis of the desired extension product, that is, to be able to anneal with the desired template strand in a manner sufficient to provide the 3' hydroxyl moiety of the primer in appropriate juxtaposition for use in the initiation of synthesis by a polymerase or similar enzyme. It is not required that the primer sequence represent an exact complement of the desired template. For example, a non-complementary nucleotide sequence may be attached to the 5' end of an otherwise complementary primer. Alternatively, non-complementary bases may be interspersed within the oligonucleotide primer sequence, provided that the primer sequence has sufficient complementarity with the sequence of the desired template strand to functionally provide a template-primer complex for the synthesis of the extension product.

The term "percent identical" is used herein with reference to comparisons among nucleic acid or amino acid sequences. Nucleic acid and amino acid sequences are often compared using computer programs that align sequences of nucleic or amino acids thus defining the differences between the two. For purposes of this invention comparisons of nucleic acid sequences are performed using the GCG Wisconsin Package version 9.1, available from the Genetics Computer Group in Madison, Wisconsin. For convenience, the default parameters (gap creation penalty = 12, gap extension penalty = 4) specified by that program are intended for use herein to compare sequence identity. Alternately, the Blastn 2.0 program provided by the National Center for Biotechnology Information (at http://www.ncbi.nlm.nih.gov/blast/; Altschul et al, 1990, J MoI Biol 215:403-410) using a gapped alignment with default parameters, may be used to determine the level of identity and similarity between nucleic acid sequences and amino acid sequences.

The term "expression control sequence", as used herein, refers to a polynucleotide sequence that regulates expression of a polypeptide coded for by a polynucleotide to which it is functionally ("operably") linked. Expression can be regulated at the level of the mRNA or polypeptide. Thus, the term expression control sequence includes mRNA-related elements and protein-related elements. Such elements include promoters, domains within promoters, upstream elements, enhancers, elements that confer tissue or cell specificity, response elements, ribosome binding sequences, transcriptional terminators, etc. Suitable expression control sequences that can function in insect cells will be evident to the skilled worker. In some embodiments, it is desirable that the expression control sequence comprises a constitutive promoter. Among the many suitable "strong" promoters which can be used are the baculovirus promoters for the plO, polyhedrin (polh), p6.9, capsid, and cathepsin-like genes. Among the many "weak" promoters which are suitable are the baculovirus promoters for the iel, ie2, ieO, etl, 39K (aka pp31), and gp64 genes. Other suitable strong constitutive promoters include the B. mori actin gene promoter; D. melanogaster hsp70, actin, α-1- tubulin or ubiquitin gene promoters; RSV or MMTV promoters; copia promoter; gypsy promoter; and the cytomegalovirus IE gene promoter. If it is desired to increase the amount of gene expression from a weak promoter, enhancer elements, such as the baculovirus enhancer element, hr5, may be used in conjunction with the promoter.

In some embodiments, the expression control sequence comprises a tissue-or organ- specific promoter. Many such expression control sequences will be evident to the skilled worker. In general, the enzymes involved in N-glycan processing of the invention are required in catalytic amounts. Therefore, in one embodiment of the invention, much lower amounts of these enzymes are present than of the heterologous polypeptides of interest, which are generated in massive, large amounts, glycosylated, and harvested for further use. For example, a suitable molar ratio of heterologous protein produced to enzyme involved in N-glycan processing may be greater than about 100: 1.

Alternatively, the enzymes involved in N-glycan processing may be in comparable (e. g. , approximately stoichiometric) amounts to the heterologous glycoprotein to be processed. A skilled worker can readily select suitable promoters and/or conditions to express suitable amounts of the enzymes involved in N-glycan processing (e. g., amounts which are sufficient to (effective to) process the N-glycans of relatively high amounts of a protein of interest to the desired extent). Furthermore, a skilled worker can readily ensure that the enzymes involved in N-glycan processing are present in sufficient local concentrations, and at an optimal time during insect propagation.

In some embodiments of the invention, as is discussed in more detail elsewhere herein, it is desirable that an expression control sequence is regulatable (e. g., comprises an inducible promoter and/or enhancer element). Suitable regulatable promoters include, e. g., Drosophila or other hsp70 promoters, the Drosophila metallothionein promoter, an ecdysone-regulated promoter, the Saccharomyces cerevisiae Gal4/UAS system, and other well-known inducible systems. A Tet-regulatable molecular switch may be used in conjunction with any constitutive promoter, such as those described elsewhere herein (e. g, in conjunction with the CMV-IE promoter, or baculovirus promoters). Another type of inducible promoter is a baculovirus late or very late promoter that is only activated following infection by a baculovirus.

Methods for designing and preparing constructs suitable for generating transgenic insect cell lines or insects (or vectors for infection of an insect) are conventional. For these methods, as well as other molecular biology procedures related to the invention, see, e. g., Sambrook et al, Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor, Ν. Y. , ( 1989); Wu et al. , Methods in Gene Biotechnology (CRC Press, New York, NY, 1997), Recombinant Gene Expression Protocols, in Methods in Molecular Biology, Vol. 62, (Tuan, Ed., Humana Press, Totowa, NJ, 1997); and Current Protocols in Molecular Biology, (Ausabel et al, Eds.,), John Wiley & Sons, NY (1994- 1999). Some suitable methods are described elsewhere herein. A variety of immortalized lepidopteran insect cell lines are suitable for transformation by the vectors/constructs of the invention. Among these are Sf9 (Vaughn et al. (1977) In Vitro 13, 213- 217), Tn 5B1-4 (High Five; Wickham et al. (1992) Biotech. Progr. 8, 391-6), expresSf+ (Protein Sciences Corporation), and BmN (Bm-N4; Maeda et al. (1985) Nature 315, 592-594) cells. Methods for generating transgenic insect cell lines are conventional. For example, in one embodiment, one or more genes to be introduced are placed under the control of a suitable expression control sequence and are cloned into one or more plasmid vectors. These vectors are then mixed with a vector encoding a selectable marker under the control of a suitably expression control sequence. The DNA mixture is then introduced into the parental insect cell line (e.g., by calcium phosphate-mediated transfection), and the transgene(s) will integrate by non-homologous recombination into in the insect cell genome. Transformed cells are selected using an appropriate antibiotic (e.g. neomycin, hygromycin, or zeocin, among others), cloned by colony formation or limiting dilution, and clones expressing the unselected genes of interest are identified using various methods, including RNA dot blot assays, lectin staining assays, or functional assays. This general approach was first described in 1990 (Jarvis et al, 1990. Bio/Technology 8,950- 955) and has been reviewed recently (Harrison, R.L. and Jarvis, D.L. 2007. Transforming lepidopteran insect cells for improved protein processing. In D.W. Murhammer (Ed.), Methods in Molecular Biology: Baculovirus Expression Protocols. Humana Press, Clifton, NJ. Methods MoI Biol. (2007) 388:3-22.

Methods for generating transgenic insects are conventional. For example, in one embodiment, one or more genes to be introduced are placed under the control of a suitable expression control sequence, and are cloned into a vector, such as a viral vector (e. g, an attenuated baculovirus vector, or a non-permissive viral vector that is not infective for the particular insect of interest). The sequences to be introduced into the insect are flanked by genomic sequences from the insect. The construct is then introduced into an insect egg (e.g., by microinjection), and the transgene (s) then integrate by homologous recombination of the flanking sequences into comparable sequences in the insect genome.

In another embodiment, the vector is a transposon-based vector. One form of such transposon-based vectors is a viral vector (such as those described above) that further comprises inverted terminal repeats of a suitable transposon, between which the transgene of interest is cloned. One or more genes of interest, under the control of a suitable expression control sequence (s), are cloned into the transposon-based vector. In some systems, the transposon-based vector carries its own transposase. However, generally, the transposon-based vector does not encode a suitable transposase. In this case, the vector is co-transfected into an insect (e. g., an insect larva) with a helper virus or plasmid that provides a transposase. The recombinant vector (along with, generally, a helper) is introduced by conventional methods (such as microinjection) into an egg or early embryo; and the transgene (s) become integrated at a transposon site (such as sequences corresponding to the inverted terminal repeat of the transposon) in the insect genome.

Suitable types of transposon-based vectors will be evident to the skilled worker. These include, e. g., Minos, mariner, Hermes, sleeping beauty, and piggyBac.

In a preferred embodiment, the vector is a piggyBac vector. TTAA-specific, short repeat elements feature in a group of transposons (Class II mobile elements) that have similar structures and movement properties. A typical piggyBac vector (formerly IFP2) is the most extensively studied of these insertion elements. piggyBac is 2.4 kb long and terminates in 13 bp perfect inverted repeats, with additional internal 19 bp inverted repeats located asymmetrically with respect to the ends (Cary et al. (1989) Virology. 172,156-69). A piggyBac vector may encode a trans-acting transposase that facilitates its own movement; alternatively, these sequences can be deleted and this function can be supplied by a helper plasmid or virus. Non-essential genes have been deleted from piggyBac, allowing for the cloning of inserts as large as about 15 kB into certain piggyBac vectors. This allows, for example, for the insertion of about six or seven genes with their expression control sequences. Thus, a collection of enzymes involved in N- glycan processing, marker proteins, or the like, can be introduced together via a single transposon vector, into a single site in an insect genome. Several piggyBac vectors have been developed for insect transgenesis. Two particularly useful constructs, defined as minimal constructs for the movement of piggyBac vectored sequences, were developed by analysis of deletion mutations both within and outside of the boundaries of the transposon (Li et al. (2001) MoI. Genet. Genomics. 266, 190-8). Using constructs such as these it is possible to increase the amount of genetic material mobilized by the piggybac transposase by minimizing the size of the vector. The minimal requirements for movement include the 5'and 3'terminal repeat domains and attendant TTAA target sequences.

Nearly all of the internal domain may be removed, although more recent data indicate that some of this region may be required for efficient translocation of the mobilized sequences into the genome of the insect. In addition, a minimum of 50 bases separating the TTAA target sites of the element is required for efficient mobilization (Li et al. (2001), supra). piggyBac can transpose in insect cells while carrying a marker gene, and movement of the piggyBac element can occur in cells from lepidopteran species distantly related to the species from which it was originally isolated. piggyBac has been shown to be capable of transforming Drosophila melanogaster, Anastrepha suspensa, Bactrocera dorsalis, Bombyx mori, Pectinophora gossypiella, Tribolium castaneum, and several mosquito species. At least three lepidopteran species, Pectinophora gossypiella, Trichoplusia ni and Bombyx mori, have been successfully transformed by the piggyBac element.

Generally, a helper virus or plasmid that expresses a transposase is co-introduced with the transposon-based vector as above. Expression of the transposase is determined by the choice of promoter for the insect system being tested. Toward that end, several promoter-driven helper constructs that are useful for lepidopteran transformation, including the Drosophila hsp70, baculovirus iel promoter, and Drosophila Actin 5C promoter, have been constructed.

For further guidance on the use of baculovirus-based vectors, see, e. g., WO01/29204 and US Patent 6,551, 825. Other recent references that discuss piggyBac vectors and methods for generating transgenic insects using them include, e. g, Handler et al. (1998) Proc Natl Acad Sci 95,7520-7525 ; Fraser, MJ (2001) The TTAA-specific family of transposable elements. In: Insect transgenesis : Methods and Applications. James AA and.Handler AH, Eds. CRC Press, Orlando, FL ; Lobo et al. (1999) MoI. Gen. Genetics 261, 803-810; Grossman et al. (2000) Insect Biochem. MoI. Biol. 30,909-914 ; Lobo et al. (2001) MoI Gen. Genom. 265, 66-71; Lorenzen et al. (2003) Insect MoI Biol. 12, 433-40 ; Hacker et al. (2003) Proc Natl Acad Sci U S A. 100, 7720-5; Sumitani et al. (2003) Insect Biochem MoI Biol. 33,449-58 ; Horn et al. (2003) Genetics 163 647-61 ; and Tomita et al. (2003) Nat Biotechnol. 21,52-6.

Methods for introducing constructs into an embryo to generate a transgenic insect (e. g., by microinjection) are conventional. Survivorship is usually quite high (up to 75%) for microinjected embryos. In general, preblastoderm eggs are stuck with a fine glass capillary holding a solution of the plasmid DNA and/or the recombinant virus. GO larvae hatched from the virus-injected eggs are then screened for expression of the gene of interest. Breeding transgenic GIs with normal insects yields transgenic offspring according to the rules of Mendelian inheritance. Once a transgene (s) is stably integrated into the genome of an insect egg or early embryo, conventional methods can be used to generate a transgenic insect, in which the transgene (s) is present in all of the insect somatic and germ cells. When a subset of the complete set of enzymes involved in N-glycan processing are present in a transgenic insect, other transposon-based vectors, which express different subsets of the genes encoding enzymes involved in N-glycan processing, can be introduced sequentially into the insect genome, and transgenic insects can then be generated. In another embodiment, when different subsets of the complete set of enzymes involved in N-glycan processing are present in two or more individual transgenic insects, these insects can be genetically crossed to produce a transgenic insect that expresses a larger subset, or a complete set, of the genes encoding enzymes involved in N-glycan processing.

In some embodiments, the transgenic insects are heterozygous for the modifying enzyme genes. For example, when potentially toxic genes are expressed constitutively, it may be advantageous for the insects to be heterozygous, to limit the amount of the enzyme that is produced. In other embodiments, the insects are homozygous for the transgenes. Methods for producing homozygous transgenic insects (e. g., using suitable back- crosses) are conventional.

Another embodiment of the invention is an isolated cell, or progeny thereof, derived from a transgenic insect of the invention. Suitable cells include isolated germ line cells, and cells that can be used for the in vitro production of a glycoprotein exhibiting a partial or complete pattern of mammalian glycosylation. Methods for obtaining and propagating cells from a transgenic insect, and using them (e. g. to generate more insects, or to generate glycosylated proteins) are conventional.

The transgenic insects discussed above can be used to produce glycoproteins of interest that exhibit partial or complete patterns of mammalian glycosylation. For example, the insects can be used in methods for glycosylating polypeptides in a mammalian (human) glycosylation pattern.

The coding sequences described herein may be operably linked to an expression control sequence from the virus, itself, or to another suitable expression control sequence. Suitable virus-based vectors include, e. g. , baculovirus vectors (such as vectors based on Autographa californica ΝPV, Orgyia pseudotsugata ΝPV, Lymantria dispar ΝPV, Bombyx mori ΝPV, Trichoplusia ni ΝPV, Spodoptera exigua ΝPV, Heliothis zea ΝPV, Galleria mellonella ΝPV, Anagrapha falcifera ΝPV, Trichoplusia ni sΝPV) ) ; retroviral vectors; and viral vectors that comprise transposon recognition sequences (e. g., piggyBac vectors); etc. As discussed above, baculovirus-based vectors have been generated (or can be generated without undue experimentation) that allow the cloning of large numbers of inserts, at any of a variety of cloning sites in the viral vector. Thus, more than one heterologous polypeptide may be introduced together into a transgenic insect cell or insect of the invention. The viral vector can be introduced into an insect cell or insect by conventional methods, such as by in vitro inoculation (insect cells) or oral ingestion (insect larvae).

In one embodiment, the baculovirus replicates until the host insect is killed. The insect cell or insect lives long enough to produce large amounts of the glycosylated polypeptide of interest. In another embodiment, a baculovirus is used that is attenuated or non-permissive for the host. In this case, the host is not killed by replication of the baculovirus, itself (although the host may be damaged by the expression of the enzymes involved in N-glycan processing and/or the heterologous protein of interest).

In another embodiment, sequences encoding one or more recombinant proteins of interest, operably linked to an expression control sequence, are cloned into a suitable transposon-based vector (such as a piggyBac vector). Like the baculovirus vectors discussed above, transposon-based vectors can carry large inserts, so more than one heterologous polypeptide may be introduced together into a transgenic insect of the invention. Transposon-based vectors may on occasion insert into the DΝA of somatic cells, and thus be stably expressed for relatively long periods of time.

In another embodiment, sequences encoding one or more recombinant proteins of interest, operably linked to an expression control sequence, are cloned into a retrovirus vector, or any other suitable virus vector. Such a construct may insert into the DΝA of somatic cells, and thus be stably expressed for relatively long periods of time. Finally, in certain instances it may be desirable to down regulate expression and synthesis of the N-acetylglucosaminidase encoded by genes described in this invention. Accordingly, the invention also provides short double-stranded RΝA sequences which hybridize to SEQ ID NO: 1 and function to downregulate the expression of the same in insect cells by an RNAi-dependant mechanism

The following materials and methods are provided to facilitate the practice of the present invention.

Cells and cell culture — Sf9 cells, which are a subclone of the IPLB-Sf21-AE cell line derived from S. frugiperda ovaries (14), were routinely maintained as shake flask cultures in either TNM-FH medium containing 10% fetal bovine serum (HyClone, Logan, UT) or ESF 921 serum-free medium (Expression Systems, CA), as described previously (18). Molecular cloning of an fdl gene homologfrom S/9 cells — The A. aegypti, A. gambiae, A. mellifera, B. mori, D. pseudoobscura and T. castaneum genomic databases were searched through the NCBI website using tBLASTn (19) with the derived amino acid sequence of Dm-FDL isoform C (Accession No. NM 165909) as the query. These searches identified exons from each species that encoded fragments of putative processing β-N-acetylglucosaminidases. These were joined in silico using an online splice site prediction algorithm available through the ΝetGene2 Server hosted by the Technical University of Denmark (20) to obtain contiguous open reading frames from each species. The predicted amino acid sequences were then aligned using CLUSTALX version 1.83 (21) with the default settings. Highly conserved amino acid sequences were visually identified and used to design degenerate oligonucleotide primers (Table 1), which were then used for polymerase chain reactions (PCRs; 22) with both cDNA and genomic DNA prepared from Sf9 cells as the templates. Genomic DNA was isolated from log phase cultures of uninfected Sf9 cells by a standard method (23). Total RNA was isolated from a log phase culture of uninfected Sf9 cells using the TriReagent (Molecular Research Center, Cincinnati, OH) according to the manufacturer's protocol. cDNA was prepared from 5 μg of GeneRacer™ oligo-dT-primed total RNA using Superscript™ III reverse transcriptase with the commercial GeneRacer™ kit (Invitrogen, Carlsbad, CA) according to the manufacturer's protocol and diluted to a final volume of 50 μL. The PCRs were performed in a total volume of 50 μL containing the manufacturer's high fidelity (HF) buffer plus 0.2 raM of each dNTP, 2 U of Phusion™ DNA polymerase (Promega, Madison, WI), 1 μM of each degenerate primer, and either -100 ng of Sf9 genomic DNA or 2 μL of the cDNA preparation described above. The reactions were incubated for 2 min at 98^°C, then cycled 14 times using (i) 20 sec at 98^°C, (ii) 20 sec at 76 to 62^°C (with a decreasing temperature gradient of 1^°C per cycle), and (iii) 30 sec at 72^°C. The reactions were cycled another 30 times using (i) 20 sec at 98^°C, (ii) 20 sec at 62^°C, and (iii) 20 sec at 72 C, and finally incubated for 5 min at 72^°C in a GeneAmp Model 2400 thermal cycler (Eppendorf, Foster City, CA). The spent reactions were separated on 1.2% agarose gels and specific amplification products of about the expected size (420 bp) were recovered from the gel, purified using the QiaQuick™ Gel Extraction Kit (Qiagen,

Valencia, CA), and directly sequenced using the degenerate PCR primers specified above. The resulting nucleotide sequences were assembled using ContigExpress, a component of Vector NTI Advance 10.3.0 (Invitrogen). These data were used to design gene-specific primers for primary and nested 5'- and 3'-RACE reactions, which were performed to determine the full-length, putative Sf-fdl gene sequence.

TABLE 1 Primer sequences

5 '-RACE — Total RNA was isolated as described above, used for first-strand cDNA synthesis according to the GeneRacer™ protocol, and the resulting 3'- and 5 '-anchored first strand cDNA was diluted to 50 μL. Five μL of this cDNA were then used as the template for 5' RACE reactions with 1.25 U of GoTaq® (Promega) and 200 nM of the SFFDLASPl (Table 1) and GeneRacer™ 5' primers in a final volume of 50 μL of GoTaq® buffer. The reactions were incubated for 4 min at 95^°C, cycled 12 times using (i) 30 sec at 95 C, (ii) 30 sec at 72 to 61 C (with a decreasing temperature gradient of 1 C per cycle), and (iii) 120 sec at 72^°C. The reactions were cycled another 30 times using (i) 30 sec at 95^°C, (ii) 20 sec at 61^°C, and (iii) 120 sec at 72^°C, and finally incubated for 5 min at 72⁰C. One μL of the spent 5'-RACE reaction was used as the template for a nested PCR with Promega's GoTaq® Green Mastermix and 200 nM of the SFFDLASP2 (Table 1) and GeneRacer™ 5'-nested primers in a total volume of 50 μL. These reactions were incubated for 90 sec at 95⁰C, cycled 25 times using (i) 30 sec at 95⁰C, (ii) 30 sec at 63⁰C, and (iii) 120 sec at 72⁰C, and finally incubated for 5 min at 72⁰C. The spent reactions were analyzed on a 1% agarose gel and an amplification product of approximately 1.4 kb in size was purified and used as the template for nested PCRs under the same conditions used for the primary PCRs, except the nested reactions included the SFFDLASP3 (Table 1) and GeneRacer™ 5 '-nested primers and the annealing temperature was 65 C. The spent reactions were analyzed on a 0.9% agarose gel and the 1.4 kb amplification product was purified and directly sequenced using the SFFDLASP3, SFFDLASP4 (Table 1), and GeneRacer™ 5 '-nested primers.

3 '-RACE— The cDNA used for the 3 ' -RACE reactions was prepared from

GeneRacer™ Oligo dT-primed total RNA, as described previously (24) and diluted to a final volume of 100 μL. Two μL of this cDNA preparation were then used as the template for a PCR with 1 U of Phusion™ DNA polymerase, 200 nM of the SFFDLSP3 and GeneRacer™ 3' primers, 1 M betaine, and 5% DMSO in a final volume of 50 μL of Phusion™ GC buffer. These reactions were incubated for 3 min at 98^°C, cycled 45 times using (i) 30 sec at 98^°C, (ii) 30 sec at 68 C for the first five cycles, 63^°C for the next five cycles, 58^°C for the next five cycles, and 52^°C for the final 30 cycles, (iii) 60 sec at 72^°C, and 20 sec at 75^°C, and finally incubated for 2 min at 72^°C. The spent reaction was analyzed on a 1% agarose gel, and the 1.2 Kb amplification product was purified and used as the template for nested PCRs with 0.8 U of Phusion™ DNA polymerase, 200 nM of the SFFDLSP4 and GeneRacer™ 3 '-nested primers, and 1 M betaine in a total final volume of 50 μL of Phusion™ GC buffer. These reactions were incubated for 2 min at 98°C, cycled 45 times using (i) 20 sec at 98°C, (ii) 20 sec at 70⁰C for the first five cycles, 65°C for the next five cycles, 6O⁰C for the next five cycles, and 57°C for the final 30 cycles, (iii) 40 sec at 72°C, and finally incubated for 2 min at 72⁰C. The 1.0 Kb amplification product was purified and directly sequenced using the primer SFFDLSP4. Amplification of the full-length ORF from cDNA and genomic DNA — Sf9 cDNA was simultaneously produced and primed with the Sf-fdl gene-specific primer SFFDLCDNAASP (Table 1) using Superscript™ III reverse transcriptase (Invitrogen) according to the method of Shi et al. (24). Either 1.0 μL of this cDNA preparation or approximately 100 ng of Sf9 genomic DNA was then used as the template for PCRs containing 0.5 U of Phusion™DNA polymerase, 1 M betaine, 0.2 mM of each dNTP and 200 nM of the SFFDLCDNAASP and SFFDLFL50SP primers (Table 1) in Phusion™ GC buffer. These reactions were incubated for 2 min at 98 C, cycled 40 times using (i) 20 sec at 98^°C, (ii) 20 sec at 63^°C for the first five cycles, 58^°C for the next five cycles, and 55^°C for the final 30 cycles, (iii) 60 sec at 72^°C, and finally incubated for 2 min at 72^°C. The amplification products were purified on 1% agarose gels, recovered, and directly sequenced using internal primers.

Construction ofbaculovirus transfer plasmids encoding native β-Η- acetylglucosaminidases — Baculovirus transfer plasmids encoding full-length, untagged Dm-FDL or Sf-FDL were produced by using PCR to amplify the appropriate nucleotide sequences. The Sf-FDL coding sequence was assembled by producing two PCR amplimers with partially overlapping sequences, isolating the products, and then using them as templates for a third PCR designed to produce an amplimer encoding the full- length Sf-FDL protein. Briefly, the 3'-end of the Sf-fdl open reading frame was amplified from Sf9 cDNA prepared as described above in a PCR with 0.3 U of Phusion™ DNA polymerase, 0.2 mM of each dNTP, 1 M betaine, and 0.67 μM of the SFFDLSPl and SFFDLFL31ASP primers (Table 1) in Phusion™ GC buffer. The reactions were incubated for 2 min at 98⁰C, cycled 45 times using (i) 20 sec at 98⁰C, (ii) 20 sec at 67°C for the first five cycles, 62⁰C for the next five cycles, 57°C for the next five cycles, and 54°C for the final 30 cycles, (iii) 40 sec at 72°C, and finally incubated for 2 min at 72°C. One μL of the spent reaction was used as the template for a nested PCR under essentially the same conditions, except the primers were SFFDLSP2 and SFFDLFL31ASP (Table 1). The spent secondary PCR was analyzed on a 1.2% agarose gel and the amplification product with the expected size was excised and purified as described above. The 5 '-end of the Sf-fdl open reading frame was amplified using 1.0 μL of the spent nested 5'-RACE reaction described above as the template for a PCR with 0.5 U of Phusion™ DNA polymerase, 0.2 mM of each dNTP, 1 M betaine, and 1 μM of the SFFDLASP3 and SFFDLFL51SP primers (Table 1) in Phusion™ GC buffer. This reaction was incubated for 1 min at 98^°C, cycled 13 times using (i) 30 sec at 98^°C, (ii) 20 sec at 65 to 53^°C (with a decreasing temperature gradient of 1 C per cycle), and (iii) 60 sec at 72 C, cycled another 30 times using (i) 30 sec at 98^°C, (ii) 20 sec at 52^°C, and (iii) 60 sec at 72^°C, and finally incubated for 2 min at 72^°C. The spent reaction was analyzed on a 1.0% agarose gel and the amplification product with the expected size was excised and purified as described above. Finally, the purified 3'- and 5 '-ends of the predicted Sf-fdl ORF were combined in a PCR with 0.5 U of Phusion™ DNA polymerase, 0.2 mM of each dNTP, 1 M betaine, and 1 μM of the SFFDLFL5N2SP and SFFDLFL3N2ASP primers (Table 1) in Phusion™ GC buffer. This reaction was incubated for 1 min at 98^°C, cycled four times using (i) 30 sec at 98^°C and (ii) 90 sec at 72°C, cycled another 25 times using (i) 30 sec at 98^°C, (ii) 20 sec at 52^°C, and (iii) 80 sec at 72^°C, and finally incubated for 2 min at 72^°C. The spent reaction was analyzed on a 1.0% agarose gel, and the amplification product of the expected size was excised, purified and cloned into pENTR™/D-TOPO® according to the manufacturer's protocol. Sequencing revealed two clones that each had single, but different non-synonymous mutations and these were used to assemble a plasmid designated pENTR™/D-TOPO®-S£/tf/-FL encoding the full-length, wild type Sf-FDL protein.

The Dm-fdl open reading frame was amplified from 50 ng of a plasmid designated pIEBac-CG8824Myc in a PCR with 2 U of Phusion™ DNA polymerase, 0.2 mM of each dNTP, 0.1 μg of the FDLFLSP and FDLFLASP primers (Table 1) in Phusion™ HF buffer. This plasmid encodes the Drosophila melanogaster fdl gene open reading frame with a c-Myc epitope tag under the transcriptional control of a baculovirus IEl promoter. See Geisler et al. (2008) J. Biol. Chem., 283: 11330-11339. These reactions were incubated for 1 min at 98°C, cycled 30 times using (i) 20 sec at

98⁰C, (ii) 20 sec at 55°C, and (iii) 90 sec at 72°C, and finally incubated for 2 min at 72°C. The spent reaction was analyzed on a 0.8% agarose gel, and the amplification product of the expected size was excised, purified and cloned into pENTR™/D-TOPO® according to the manufacturer's protocol. An error-free clone was identified by sequencing and designated pENTR™/D-TOPO®-£>m-/rf/-FL. Construction of baculovirus transfer plasmids encoding GST-tagged β-N- acetylglucosaminidases — Transfer plasmids encoding N-terminally GST-tagged ectodomains of the various β-N-acetylglucosaminidases examined in this study were also produced using PCR-based approaches. Generally, TMpred (25) was used to predict the sequences encoding the ectodomain of each protein, and then these sequences were amplified using primers designed to introduce Smal and EcoBl sites on their 5'- and 3'- ends, respectively. Thus, each of the resulting PCR products was designed for subsequent directional cloning into the Smal and EcoRI sites of the baculovirus transfer plasmid pAcSecG2T (BD Biosciences, San Jose, CA), to position the relevant coding sequences downstream and in-frame with the GST coding sequence in this vector.

The predicted Sf-fdl ectodomain coding sequence was amplified using pEΝTR™/D- ΥOPO®-Sf-fdl-FL as the template for a PCR with 0.5 U of Phusion™ DNA polymerase, 0.2 mM of each dNTP, 1 M betaine, 0.2 μM of the SFFDLFL3N2ASP and 10 nM of the SFFDLGST51SP primers (Table 1) in Phusion™ GC buffer. The reaction was incubated for 1 min at 98°C, cycled four times using (i) 20 sec at 98°C, (ii) 20 sec at 58°C, and (iii) 90 sec at 72°C, after which primer SFFDLGST5N2SP was added to 0.2 μM, incubated for 1 min at 98°C, cycled another 30 times using (i) 20 sec at 98°C, (ii) 20 sec at 60°C, and (iii) 90 sec at 72⁰C, and finally incubated for 2 min at 72°C. The spent reaction was analyzed on a 1.0% agarose gel, and the amplification product of the expected size was excised and purified. The purified amplimer was then treated with 5 U of Taq DNA polymerase (New England Biolabs, Ipswich, MA) for 15 minutes in the presence of 0.2 mM dATP and the manufacturer's standard Taq buffer. The reaction product was cloned into pCR®2.1-TOPO® (Invitrogen) according to the manufacturer's instructions, yielding pCR2.1®-TOPO®-S£/a!7-SOL. An error-free clone was identified by sequencing and the insert was excised with Smal and EcoRI, gel-purified, and subcloned into the corresponding sites of pAcSecG2T to produce the transfer plasmid designated pAcSecG2T-S//J/-SOL.

The predicted Dm-fdl ectodomain coding sequence was amplified using pENTR™/D- TOPO®-Dm-fdl-FL as the template for a PCRwith 2 U of Phusion™ DNA polymerase, 0.2 mM of each dNTP, and 1 μM of the DMFDLGST3ASP and DMFDLGST5SP primers (Table 1) in Phusion™ HF buffer. The reaction was incubated for 1 min at 98°C, cycled five times using (i) 15 sec at 98°C, (ii) 20 sec at 50⁰C, and (iii) 75 sec at 72°C, cycled another 30 times using (i) 15 sec at 98°C, (ii) 20 sec at 64°C, and (iii) 75 sec at 72°C, and finally incubated for 2 min at 72°C. The amplimer was subsequently purified, Tα^-treated, cloned, sequence-verified and subcloned as described above to produce the intermediate plasmid pCR®2.1TOPO®-.Drø_:/άf/-SOL and the final baculovirus transfer plasmid, pAcSecG2T-£>rø-/<#-SOL. The predicted Sf-GIcN ^TAcase3/SfHex ectodomain coding sequence was amplified using pENTR™/D-TOPO®-S/-G/cNΛm«?3 as the template for a PCR with 2 U of Phusion™ DNA polymerase, 0.2 mM of each dNTP, and 1 μM of the SFGN3GST3ASPB and GN3GST5SPB primers (Table 1) in Phusion™ HF buffer, with cycling conditions identical to those used to generate the Dm-fdl ectodomain amplimer. The resulting product was purified, 7α#-treated, cloned into pCR®4-TOPO®, sequence-verified, and subcloned as described above to produce the intermediate plasmid pCR®4-TOPO®-5^ GlcNAcase3-SOL and the final baculovirus transfer plasmid, pAcSecG2T-S/- GlcNAcase3-SOL.

Isolation of baculovirus expression vectors — Each of the baculovirus transfer plasmids described in the preceding sections was extracted from large-scale E. coli cultures and purified by isopycnic ultracentrifugation on ethidium bromide-cesium chloride gradients, as described previously (23). The pENTR plasmids were then used to produce recombinant baculoviruses by the BaculoDirect™ method (Invitrogen), according to the manufacturer's protocol. The transfer plasmids encoding GST-tagged β- N-acetylglucosaminidases were used to produce viruses by a standard allelic transplacement method (3,4) with £sw36/-digested BacPAKό viral DΝA (26) as the target for homologous recombination. Each recombinant baculovirus vector was plaque- purified, amplified in Sf9 cells, and titered by plaque assay on Sf9 cells, as described previously (4). The recombinant viruses encoding various full-length, untagged β-N-acetylglucosaminidase genes were designated AcSfGlcΝAcase-3 (18), AcDm-FDL, and AcSf-FDL and those encoding N-terminally GST-tagged ectodomains of the various β-N-acetylglucosamirnidases were designated AcGSTSfGlcΝAcase-3, AcGSTDm-FDL, and AcGSTSf-FDL, respectively. The parental virus used to produce these viruses, which also served as a negative control for some of the experiments included in this study, was Autographa californica nucleopolyhedrovirus (AαMΝPV). Expression of recombinant proteins in insect cells — Sf9 cells were seeded into 100 mL of ESF 921 medium in 250 mL DeLong flasks (Corning Glass Works, Corning, NY) and allowed to grow to a density of about 1.5-2.0 X 10⁶ cells/mL at 28⁰C and 125 rpm in a Forma Model 4580 rotary platform shaker-incubator (Forma Scientific, Inc., Marietta, OH). The cells were then infected with the appropriate baculovirus at a multiplicity of infection of about 1 plaque-forming unit/cell and incubated for another 72 h under the same conditions.

Isolation of purified microsomal fractions — The isolation of microsomal fractions from baculovirus-infected Sf9 cells has been described previously (18). Briefly, the cells were Dounce-homogenized and microsomes were isolated by ultracentrifugation onto sucrose cushions. The microsomes were solubilized in β-N-acetylglucosaminidase assay buffer (100 mM citrate-phosphate buffer, pH 6.0) containing 0.5% (v/v) Triton-X-100, total protein concentrations were determined using a commercial bicinchoninic acid assay (Pierce Biotechnology Inc., Rockford, IL), and samples containing equal amounts of total protein were assayed for β-N-acetylglucosaminidase activity, as described below.

For a subset of these experiments, which was designed to examine the nature of the association between the enzyme activity and membranes, freshly prepared microsomes were either held or sonicated on ice with ten pulses from a Branson Model 450 Sonifier (Danbury, CT) adjusted to 50% output. The microsomes were then pelleted by centrifugation for 10 min at top speed in a microcentrifuge (Hermle Model Zl 80M) and the pellets were resuspended in β-N-acetylglucosaminidase assay buffer. The sonication and centrifugation steps were repeated, the final pellets were resuspended in β-N-acetylglucosaminidase assay buffer containing 0.5% (v/v) Triton-X-100 (Sigma Chemical Company, St. Louis, MO), and then the solubilized microsomes were assayed for β-N-acetylglucosaminidase activity, as described below.

Glutathione affinity chromatography — The GST-tagged ectodomains of the various β- N-acetylglucosaminidases examined in this study were purified from the extracellular fraction of Sf9 cells infected with AcGSTSfGlcΝAcase-3, AcGSTDm-FDL, or

AcGSTSf-FDL. Briefly, the cells were removed from each infected cell culture at 72 h postinfection by centrifugation for 5 min at 1000 xg, and the supernatant was harvested and ultracentrifuged for 30 min in a Ti45 rotor at 30,000 rpm and 4⁰C in a Beckman Optima IOOXL ultracentrifuge (Beckman Coulter; Fullerton, CA). The resulting supernatant was diluted with an equal volume of ice-cold GST purification buffer (25 niM Tris, 150 mM NaCl, 1 mM EDTA, pH 8.0), solid ammonium sulfate was added to 90% saturation, and the samples were stirred on ice until the salt was fully dissolved. The samples were subsequently ultracentrifuged for 20 minutes in a Ti45 rotor at 30,000 rpm and 4⁰C and the resulting pellet was re-dissolved in a minimal volume of GST purification buffer. The samples were then transferred to dialysis tubing with a 50 kDa molecular weight cutoff (Spectrum Medical Industries Inc.; Laguna Hills, CA) and dialyzed overnight at 4⁰C against 100 volumes of GST purification buffer supplemented with 1 mM phenylmethylsulfonyl fluoride (PMSF). Each GST-tagged protein was then adsorbed to a 1.5 mL bed volume of Glutathione Sepharose 4 Fast Flow (GE Healthcare; Uppsala, Sweden) pre-equilibrated with GST purification buffer in a plugged 20 mL Econo-Pac column (BioRad; Hercules, CA) for one hour at 4°C on a shaking platform. Subsequently, the fluid was drained from the column, the affinity matrix was washed twice with 10 mL of GST purification buffer, and the GST-tagged proteins were eluted with GST purification buffer supplemented with 5 mM reduced glutathione. Fractions were collected and purity was assessed by SDS-PAGE with Coomassie Blue staining, the presence of the GST-tagged proteins was assessed by SDS-PAGE and immunoblotting with a GST-specific antiserum, and enzymatic activity was assessed using /?-nitrophenyl- ^β-N-acetylglucosaminide as the substrate, as described previously (18).

β-Η-acetylglucosaminidase activity assays — Enzyme activity assays were performed using either solubilized microsomal fractions or affinity-purified recombinant proteins isolated from baculovirus-infected Sf9 cells. For the microsomal membrane assays, microsomes were prepared and extracted as described above and samples containing equal amounts of total protein were assayed in a total volume of 0.050 mL containing 25 pmol of various pyridylamine (PA)-tagged glycan substrates. The enzymatic activity of the affinity-purified recombinant proteins was assayed under identical conditions, except the amounts of purified protein used for these assays were equalized by immunoblotting, rather than by total protein assays. The substrates used in this study included GlcΝAcβ2Manα6(GlcΝAcβ2Manα3)Manβ4GlcΝAcβ4GlcΝAc-PA (GnGn;

CalBiochem, La Jolla, CA), GlcNAcB2Manoc6(Manoc3)ManB4GlcNAcB4GlcNAc-PA (GnM), and Mana6(GlcNAcB2Mana3)ManB4GlcNAcB4GlcNAc-PA (MGn). After being incubated for various times at 37⁰C, each reaction was diluted to 0.150 mL with B-N- acetylglucosaminidase reaction buffer and the products were analyzed by reverse phase high performance liquid chromatography, as described previously (27). GnGn, GnM, MGn, and Manα6(Manα3)Manβ4GlcNAcB4GlcNAc-PA (MM), were used as standards for the chromatographic analyses.

RNA interference — In general, the RNA interference approach used in this study involved transforming Sf9 cells with an immediate early expression plasmid encoding an inverted repeat derived from a portion of the Sf-fdl coding sequence, with the inverted repeat separated by a Drosophila melanogaster white gene intron, as originally described by Lee and Carthew (26). Briefly, the Sf-fdl coding sequence from nucleotides 355 to 855 was amplified using pENTR™/D-TOPO®-S£/c?/-FL as the template for a PCR with 0.5 U of Phusion™ DNA polymerase, 0.2 mM of each dNTP, and 1 mM each of the SFFDLRNAIASP and SFFDLRNAISP primers (Table 1), which introduced Xbal sites onto both ends, in Phusion™ HF buffer. The reaction was incubated for 30 seconds at 98⁰C, cycled five times using (i) 20 sec at 98°C, (ii) 20 sec at 54°C, and (iii) 30 sec at

72°C, cycled another 30 times using (i) 20 sec at 98⁰C, (ii) 20 sec at 64°C, and (iii) 30 sec at 72⁰C, and finally incubated for 2 min at 72⁰C. The spent reaction was analyzed on a 1.2% agarose gel and the amplification product was excised, purified, 7o^-treated, and cloned into pCR4®-TOPO® to produce pCR4®-TOPO®-SfFdlRNAi. An error-free clone was identified by sequencing and the insert was excised with Xbal and gel-purified. One copy of the insert was subcloned in antisense orientation into the Avrll site and a second copy was subcloned in sense orientation into the Nhel site of pGEM-WIZ (27); (obtained from the Drosophila Genomics Resource Center) to produce pGEM-WIZ- SfFdIRNAi. Finally, the Sf-fdl inverted repeat and white gene intron cassette was excised with SacU and Notl and subcloned into the corresponding sites of pIElHR3 (29) to produce pIElHR3SfFdlRNAi. This plasmid was used along with pIEINeo to co-transfect Sf9 cells using a modified calcium phosphate method, as described previously (4). The transfected cells were then selected and neomycin-resistant clones were isolated by limiting dilution, as described previously (29). The levels of specific, processing β-N- acetylglucosaminidase activity in the parental and transformed cells were finally compared by HPLC analysis of the products obtained by reacting microsomal membrane preparations with GnGn, as described above. The following examples are provided to illustrate certain embodiments of the invention. These examples are not intended to limit the invention in any way.

EXAMPLE I

CLONING OF A NOVEL PROCESSING β-iV-ACETYLGLUCOSAMINIDASE FROM SF9 CELLS AND CHARACTERIZATION THEREOF

Isolation and characterization of an fdl gene homolog from Sf9 cells Our effort to isolate an fdl gene homolog from Sf9 cells was informed and facilitated by the availability of genome sequence data from several insect species and also by our previous efforts to isolate the gene encoding the processing β-N-acetyl- glucosaminidase activity from this cell line. tBLASTn analysis of the A. aegypti, A. gambiae, A. mellifera, B. mori, D. pseudoobscura and T. castaneum genomes with the Dm-fdl gene as the query yielded exons from each species encoding peptides phylogenetically related to Dm-fdl (data not shown). We subsequently used a splice site prediction algorithm to join the relevant exons and identify open reading frames encoding at least partial, putative β-N-acetylglucosaminidases from each insect species. Importantly, a CLUSTAL-W alignment of the predicted products of these open reading frames with the Dm-fdl gene product revealed conserved amino acid sequences that were not conserved in the Sf-GIcN Acase-1 or SfGlcNAcase-3 gene products identified in our previous study (18). These were used to design degenerate oligonucleotides for high fidelity PCRs with Sf9 cDΝA or genomic DΝA as the templates. These PCRs yielded an amplification product of about the expected size (420 bp), which appeared to be specific because it was not observed in control reactions in which either one of the degenerate oligonucleotides was excluded (data not shown). This product was directly sequenced and the translation product was found to be highly similar to a fragment of the D. melanogaster and putative B. mori FDL proteins (data not shown). Accordingly, we used this sequence to design gene-specific primers for 5'- and 3'-RACE reactions, which yielded the nucleotide sequence of the full length, putative Sf-fdl open reading frame, as detailed above.

The 5'-RACE reactions yielded a specific 1.4 Kb amplification product, which overlapped with the sequence of the original degenerate PCR product, extended it by 1161 bp in the 5' direction, and included a potential translational initiation site (data not shown). The 3'-RACE reactions yielded a specific 1.0 Kb amplification product, which also overlapped with the sequence of the original degenerate PCR product, extended it by 734 bp in the 3' direction, and encoded a translational termination site. A contiguous nucleotide sequence of 2319 bp was assembled by joining the sequences of the degenerate amplimer, the 5'-RACE product, and the 3'-RACE product. The accuracy of this sequence was confirmed by PCR with gene specific primers using both Sf9 cDNA and genomic DNA as the templates, followed by direct sequencing of the products, as described in Experimental Procedures.

In silico analysis of the SfP cell fdl gene homolog — The full-length Sf-fdl nucleotide sequence and theoretical amino acid sequence of the Sf-FDL polypeptide are shown in Fig. 1. The nucleotide sequence includes a single long open reading frame of 1896 bp, which has a GC content of 69%. The theoretical product of this open reading frame is a polypeptide consisting of 631 amino acids, which has a calculated molecular mass of 70,530 Da and a calculated isoelectric point of 7.18. The theoretical protein also has an N-terminal transmembrane domain (underlined in Fig. 1), which extends from amino acids 25 to 42 with in/out topology, according to the TMHMM and TopPred2 algorithms (29). Thus, the putative S. frugiperda FDL polypeptide appears to be a type II transmembrane protein with a short cytoplasmic tail. This is consistent with the idea that the Sf-fdl gene encodes an N-glycan processing enzyme because all N-glycan processing enzymes characterized to date have been predicted or shown to be transmembrane proteins with type II topology (30-32). The putative Sf9 cell enzyme also includes two potential N-glycosylation sites, which are boxed in the amino acid sequence shown in Fig. 1.

A phylogenetic analysis of the predicted Sf-fdl gene product showed that it is related to known hexosaminidases, including the human alpha (Ace. No. NM 000520; 33) and beta (Ace. No. NM_000521; 34) hexosaminidases, as well as SfGlcNAcase-1 (Ace. No. DQ249307; 18) and SfGlcNAcase-3/Sfhex (Ace. No. DQ249309; 17,18), as expected (Fig. 2). Strikingly, however, this analysis also revealed that the predicted Sf-fdl gene product is much more closely related to the Dm-fdl gene product (Ace. No. NM_165909; 16) than to either of the S. frugiperda hexosaminidase gene products, despite the fact that Spodoptera and Drosophila belong to distinct insect Orders, which diverged well over 300 million years ago (35). Together, these results indicated that we had successfully isolated a Dm-fdl gene homolog from Sf9 cells. In addition, the much closer relationship between these two genes and the more distant relationship between the Dm-fdl and SfGlcNAcase-3/Sfhex genes supports the conclusion that this newly-isolated gene encodes the specific, processing β-N-acetylglucosaminidase in Sf9 cells.

Expression and biochemical analysis of the native Sf-fdl gene product — The full- length Sf-FDL coding sequence was subcloned into a baculovirus transfer plasmid and the resulting construct was used to isolate a recombinant baculovirus, AcSf-FDL, that was used, in turn, for high-level expression of the native cDΝA product in insect cells. The parental baculovirus (AcA/ΝPV) was used as a negative control and recombinant baculoviruses encoding Dm-FDL (AcDm-FDL) or Sf-GlcΝAcase-3/SfHex (AcSfGlcΝAcase-3) were used to directly compare the enzymatic activities of the Sf-fdl, Dm-fdl, and Sf-GlcNAcase-3/SfHex gene products. Individual Sf9 cell cultures were infected with the appropriate baculoviruses and then crude microsomal membrane fractions were prepared and assayed for enzymatic activity with various PA-tagged glycans as substrates, as described above. The results showed that negative control microsomes from AcMΝPV-infected cells had very little effect on GnGn, while microsomes isolated from either AcDm-FDL- or AcSf-FDL-infected cells converted this substrate to GnM (Fig. 3A, top three panels). In contrast, microsomes from

AcGlcΝAcase-3 -infected cells converted GnGn to both GnM and MM in parallel assays (Fig. 3A, bottom panel). These results indicated that Sf-FDL and Dm-FDL specifically removed only the terminal N-acetylglucosamine residue from the α3 -branch of GnGn, whereas SfGlcΝAcase-3/SfHex had a broader spectrum of activity and removed the terminal N-acetylglucosamine residues from both branches of GnGn in these assays.

This was supported by the results of additional assays in which other glycans were used as substrates. Microsomes from cells infected with AcDm-FDL, AcSf-FDL, or AcSfGlcΝAcase-3 all removed the terminal N-acetylglucosamine residue from the α3- branch of MGn to produce MM, as expected (Fig. 3B, lower three panels). Significantly, however, microsomes from AcDm-FDL- and AcSf-FDL- (Fig. 3C, middle two panels) infected cells failed to remove the terminal N-acetylglucosamine from the α6-branch of GnM, while those from AcSfGlcΝAcase-3-infected cells (Fig. 3C, bottom panel) clearly converted GnM to MM. These results confirmed that Sf-FDL and Dm-FDL are more highly specific enzymes that remove only the terminal N-acetylglucosamine residue from the α3-branch of glycan substrates in these assays. This substrate specificity distinguishes these enzymes from the SfGlcNAcase-3/SfHex gene product, as this latter readily removed the terminal N-acetylglucosamine residues from both branches of biantennary glycan substrates. The relatively broader spectrum of activity observed with the SfGIcN Acase-3/SfHex gene product was underscored by the ability of microsomes from AcSfGlcNAcase-3- infected cells to efficiently convert chitotriose to chitobiose and chitobiose to a PA-tagged N-acetylglucosamine residue (Fig. 3D, bottom panel). In contrast, microsomes from AcDm-FDL- or AcSf-FDL-infected cells (Fig. 3D, middle two panels) converted only small amounts of chitotriose to chitobiose and produced no PA-tagged N-acetylglucosamine. In fact, microsomes from AαMΝPV-infected cells (Fig. 3D, top panel) produced nearly as much chitobiose as the microsomes from AcDm-FDL- or AcSf-FDL-infected cells, suggesting that the apparent ability of these latter two enzymes to hydrolyze chitotriose was an artifact resulting from contaminating chitinase activity in the crude microsomal preparations.

The results of the experiments described in this specification show that the Sf-fdl gene product is orthologous to the Dm-fdl gene product, which is responsible for N-glycan processing in D. melanogaster (16), and paralogous to the SfGlcNAcase-3/SfHex gene product, which we previously concluded was more likely to be responsible for N-glycan and/or chitin degradation in S. frugiperda (18). Furthermore, the isolation of a. Dm-fdl ortholog from Sf9 cells, its substrate specificity, and the relatively non-specific nature of the SfGlcNAcαse-3/SfHex gene product provide compelling evidence to suggest that the former, not the latter is the N-glycan processing enzyme in Sf9 cells. pH optimum of the Sf-fdl gene product — In their seminal study on the endogenous processing β-N-acetylglucosaminidase activity in microsomal membranes isolated from Sf21 cells, Altmann and coworkers (9) found that it had a pH optimum for GnGn hydrolysis of 6.0. This was consistent with the idea that the activity measured in these assays was involved in N-glycan processing, rather than degradation, because a processing enzyme would be expected to reside in the Golgi apparatus and have a pH optimum around 6.0-6.5, whereas a degradative enzyme would be expected to reside in the lysosomal compartment and have a more acidic optimal pH. We and Tomiya and coworkers found that the SfGlcNAcαse-3/SfHex gene product had a pH optimum of 5.5 and took this as one line of evidence that this enzyme could not account for the processing activity identified by Altmann and coworkers (1995) and was more likely to be involved in N-glycan or chitin degradation (18). Tomiya and coworkers also found that the SfGlcNAcαse-3/SfHex gene product had a pH optimum of 5.5, but concluded that it is involved in N-glycan processing in Sf9 cells because it has the same pH optimum as Dm- FDL (16) and it is at least partially active at the higher pH of secretory compartments, such as the trans-Golgi network (17). Hence, it was of interest to examine the optimal pH of the Sf-fdl gene product. Microsomal membranes were isolated from AcSf-FDL- infected Sf9 cells and assayed for GnGn hydrolysis at various pH values. The results showed that the pH optimum of the Sf-fdl gene product is 6.0 and that it has nearly optimal activity at pH 6.5, as well (Fig. 4). Thus, the pH optimum of the Sf-fdl gene product is identical to that of the processing activity originally identified in microsomal fractions from Sf21 cells by Altmann and coworkers (9). Furthermore, the range of optimal or near-optimal pH values for this enzyme more clearly encompasses the range of pH values found within late secretory pathway compartments, such as the trans-Go\gι network, than the SfGlcNAcase-3/SfHex gene product. Thus, these results also support the idea that the Sf-fdl gene product, not the SfGlcNAcase-3/SfHex gene product, is the specific, processing β-N-acetylglucosaminidase in Sf9 cells.

Biochemical analysis of the purified, ectodomain of Sf-FDL Each of the biochemical assays performed to this point in our study had involved the use of crude microsomes isolated from Sf9 cells infected with recombinant baculoviruses encoding the relevant β-N-acetylglucosaminidases. These assays were relevant because they mimicked the original assays of the endogenous processing β-N- acetylglucosaminidase activity in Sf21 cells and provided data on the substrate specificities of full-length, untagged forms of each of the enzymes of interest. However, one criticism of these assays is that they did not involve the use of purified enzymes. To address this issue, we isolated recombinant baculoviruses encoding Ν-terminally GST- tagged ectodomains of Sf-FDL, Dm-FDL, and SfGlcΝAcase-3/Sfhex, as described in Experimental Procedures. Each was expressed in Sf9 cells and purified from the extra- cellular fraction using glutathione affinity chromatography, as described in Experimental Procedures. Analysis of the purified products by SDS-PAGE with Coomassie Blue staining (Fig. 5A) or immunoblotting with anti-GST (Fig. 5B) established that each had been effectively purified and normalized. Subsequently, equivalent amounts of these purified protein preparations were assayed for β-N-acetylglucosaminidase activity using various glycan substrates.

The results of these assays showed that the GST-tagged ectodomains of both Dm- FDL and Sf-FDL converted GnGn to GnM (Fig. 6A; middle two panels), while the GST- tagged ectodomain of the GlcΝAcase 3/Sfhex protein converted this substrate to GnM, MGn, and MM (Fig. 6A, bottom panel). All three enzymes removed the terminal N- acetylglucosamine residue from the α3-branch of MGn to produce MM (Fig. 6B), as expected, but only the SfGlcNAcase 3/Sfhex protein removed the terminal N- acetylglucosamine from the α6-branch of GnM to produce MM (Fig. 6C, bottom panel). Neither Dm-FDL nor Sf-FDL had any detectable effect on this glycan (Fig. 6C, middle two panels). Similarly, only the SfGlcNAcase-3/Sfhex protein hydrolyzed chitotriose to produce chitobiose and PA-tagged N-acetylglucosamine monomers (Fig. 6D; bottom panel), while Dm-FDL and Sf-FDL had virtually no effect on this glycan (Fig. 6D, middle two panels).

These data supported the major conclusion drawn from the experiments performed with the full-length, untagged forms of these enzymes, which was that Dm-FDL and Sf- FDL are specific for the terminal N-acetylglucosamine on the α3 -branch of biantennary N-glycan substrates, while SfGlcΝAcase-3/SfHex has a much broader spectrum of B-N- acetylglucosaminidase activity. Again, the specificities of Sf-FDL and Dm-FDL are consistent with their proposed function in N-glycan processing and with the conclusion that the Sf-fdl gene encodes the membrane bound, processing β-N-acetylglucosaminidase activity originally identified in Sf21 cells by Altmann and coworkers (1995).

To examine their substrate specificities more stringently, we incubated the purified, GST-tagged ectodomains of Dm-FDL, Sf-FDL, and SfGlcΝAcase-3/SfHex with the various synthetic glycan substrates for 20 h to achieve a ten-fold increase in the enzyme assay times (Fig. 7). The results of these assays verified that Dm-FDL and Sf-FDL are highly specific, even under this extreme condition. It can be seen that Sf-FDL produced tiny amounts of MGn from GnGn (Fig. 7A, middle panel) and tiny amounts of MM from GnM (Fig. 7C, middle panel). In addition, both Sf-FDL and Dm-FDL produced tiny amounts of chitobiose from chitotriose and none of the aforementioned products were observed when the relevant glycan substrates were mock-digested with elution buffer alone (data not shown). Nevertheless, Sf-FDL is clearly much more specific than SfGlcNAcase-3/SfHex and one would reasonably question the physiological relevance of the small amounts of conversion obtained under these extreme in vitro reaction conditions. EXAMPLE II CLONING OF THE TnFDL AND BmFDL GENES

Common materials and methods

All PCRs were carried out in a final volume of 50 μLs in IX of Phusion™ buffer GC with 0.2 mM of each dNTP, 1 μM of each primer, 1 M betaine, 0.6 U of Phusion™ DNA polymerase (NEB, Ipswich, MA) and 1 μL of template, except where indicated otherwise. All PCRs were carried out in a GeneAmp Model 2400 thermal cycler (Eppendorf, Foster City, CA). DNA extraction from agarose gel fragments were carried out using the QiaQuick™ Gel Extraction Kit (Qiagen, Valencia, CA) according to the manufacturer's instructions and eluted into 50 μLs.

Sequences obtained from degenerate and semidegenerate PCRs, TOPO® clones and RACE reactions were analyzed and assembled into full-length mRNA and genomic DNA sequences using ContigExpress, a component of Invitrogen Vector NTI 10.3.0.

T. nifdl

Degenerate PCR

Genomic DNA was isolated from T. ni cells (Tn-4h cell line) according to the method of Laird et al. (Laird et α/., 1991, Nucleic Acids Res.19:4293). Degenerate PCRs were carried out using T. ni genomic DNA with the primers ASPDEG and SPDEG as described previously (Geisler et al., 2008, J. Biol. Chem. 283: 11330-11339.). The spent reactions were separated on a 1.2% agarose gel and specific amplification products of the expected size (420 bp) were recovered from the gel, purified and directly sequenced with the same primers as used in the PCR.

Semi-degenerate PCR

Semi-degenerate PCRs were carried out using T. ni genomic DNA to extend the sequence of the degenerate fragment towards both the 3' and the 5' end. Degenerate primers were designed against regions that are highly conserved between the SfFDL and the BmFDL conceptual translations. To obtain part of the TnFDL 5' end, a semi- degenerate PCR was carried out using primers TnFDL ASP2 and TnFDL SP4DEG. The PCR was incubated for 20 sec at 98⁰C, then cycled 25 times using (i) 10 sec at 98⁰C, (ii) 15 sec at 72 to 6O⁰C (with a decreasing temperature gradient of 0.5⁰C per cycle), and (iii) 60 sec at 72⁰C. The reaction was cycled another 30 times using (i) 10 sec at 98°C, (ii) 15 sec at 6O⁰C, and (iii) 60 sec at 72°C, and finally incubated for 2 min at 72⁰C. The spent reaction was separated on a 1.4% agarose gel and a specific amplification product of about the expected size (1100 bp) was recovered from the gel and purified. This DNA fragment was re-amplified using the TnFDL ASP3 and TnFDL SP4DEG primers using the same conditions, gel purified and directly sequenced using the same primers as used in the PCR.

To obtain part of the 3' end, a semi-degenerate PCR was carried out using primers TnFDL SPl and TnFDL ASP6DEG with identical cycling conditions as specified above. The spent reaction was separated on a 1.4% agarose gel and a specific amplification product of about the expected size (730 bp) was recovered from the gel and purified. This fragment was cloned into pCR®2.1-TOPO® according to the manufacturer's instructions, and three clones were sequenced to yield a consensus sequence.

5'RACE

Total RNA was isolated from a mid-log culture of Tn-4h cells using the Qiagen RNeasy® Plus Mini Kit according to the manufacturer's instructions. 5' RACE-ready RNA was prepared from total RNA using the Invitrogen Generacer™ kit according to the manufacturer's instructions. Reverse transcription was carried out using Thermo-X™ reverse transcriptase with the TnFDL ASPl primer. The reaction was set up according to the manufacturer's instructions and incubated for (i) 5 min at 5O⁰C, (ii) 15 min at 55⁰C, (iii) 30 min at 6O⁰C and finally for (iv) 15 min at 60⁰C. The reaction was diluted with 40 μLs TE buffer and stored at -20⁰C.

5' RACE was carried out using the TnFDL ASP6 primer and the GeneRacer™ 5' Primer. The PCR was incubated for 30 sec at 96°C, then cycled 5 times using (i) 20 sec at 96⁰C, (ii) 60 sec at 72°C, after which the reaction was cycled 13 times using (i) 20 sec at 96⁰C, (ii) 20 sec at 72 to 6O⁰C (with a decreasing temperature gradient of I⁰C per cycle), and (iii) 40 sec at 72⁰C. The reaction was then cycled another 30 times using (i) 20 sec at 90⁰C, (ii) 20 sec at 6O⁰C, and (iii) 40 sec at 72°C. The spent reaction was separated on a 1.4% agarose gel, and a specific amplification product of about 520 bps was isolated and purified. This DNA fragment was re-amplified using the GeneRacer™ 5' Nested Primer and either the TnFDL ASP6 or ASP7 primer. Reactions were incubated for 30 sec at 96⁰C, then cycled 5 times using (i) 20 sec at 96°C, (ii) 45 sec at 72⁰C, after which the reactions were cycled 13 times using (i) 20 sec at 96⁰C, (ii) 15 sec at 72 to 60⁰C (with a decreasing temperature gradient of I⁰C per cycle), and (iii) 30 sec at 72°C. The reactions were then cycled another 30 times using (i) 20 sec at 96°C, (ii) 15 sec at 60⁰C, and (iii) 30 sec at 72°C. The spent reactions were separated on a 1.4% agarose gel, and specific amplification products of about 520 and 500 bps were isolated and directly sequenced using the TnFDL ASP6 or ASP7, respectively.

3'RACE

3' RACE-ready cDNA was prepared from total T. ni RNA isolated as described above. Reverse transcription was carried out using Thermo-X™ reverse transcriptase with the GeneRacer™ Oligo dT primer. The reaction was set up according to the manufacturer's instructions and incubated in the same fashion as for 5' RACE. The reaction was diluted with 40 μLs TE buffer and stored at -2O⁰C.

3' RACE was carried out using the TnFDL SP4 primer and the GeneRacer™ 3' Primer. The PCR was incubated for 30 sec at 96⁰C, then cycled 5 times using (i) 20 sec at 96⁰C, (ii) 45 sec at 72⁰C, after which the reactions were cycled 13 times using (i) 20 sec at 96°C, (ii) 15 sec at 72 to 6O⁰C (with a decreasing temperature gradient of 1°C per cycle), and (iii) 30 sec at 72⁰C. The reactions were then cycled another 30 times using (i) 20 sec at 96⁰C, (ii) 15 sec at 60⁰C, and (iii) 30 sec at 72°C. The spent reaction was separated on a 1.4% agarose gel, and a specific band of approximately 600 bps was isolated and purified. This DNA fragment was re-amplified using the TnFDL SP5 primer and the GeneRacer™ 3' Nested Primer. The PCRs were incubated for 15 sec at 96°C, then cycled 5 times using (i) 15 sec at 96⁰C, (ii) 35 sec at 72°C, after which reactions were cycled 13 times using (i) 15 sec at 96°C, (ii) 15 sec at 72 to 60⁰C (with a decreasing temperature gradient of 1°C per cycle), and (iii) 20 sec at 72 ⁰C. The reactions were the cycled another 30 times using (i) 15 sec at 96⁰C, (ii) 15 sec at 60⁰C, and (iii) 20 sec at 72 ⁰C. The spent reactions were separated on a 1.4% agarose gel, and a specific amplification product of 500 bps was isolated, purified and directly sequenced using the TnFDL SP5 primer.

Amplification of the full-length TnFDL open reading frame for baculovirus expression The full-length open reading frame was amplified from both cDNA primed with the GeneRacer™ Oligo dT Primer as well as genomic DNA (including the intron) using the TnFDL FL SP2 BD and TnFDL ASP BD primers. The reactions were incubated for 20 sec at 98°C, the cycled 25 times using (i) 15 sec at 98°C, (ii) 10 sec at 72 to 60⁰C (with a decreasing temperature gradient of 0.5⁰C per cycle), and (iii) 60 sec at 72°C. The reactions were the cycled another 30 times using (i) 15 sec at 98°C, (ii) 10 sec at 60⁰C, and (iii) 60 sec at 72°C. The spent reactions were separated on a 1% agarose gel, and amplification products of the expected size were excised and purified. These DNA fragments from the reactions template by cDNA and gDNA were cloned into the pENTR™/D-TOPO® vector according to the manufacturer's instructions, yielding pENTR-TnFDL-C and pENTR-TnFDL-G, respectively. Four clones of each were sequenced, and a consensus clone of pENTR-TnFDL-C was used with Invitrogen's Baculodirect™ kit according to the manufacturer's instructions to yield AcTnFDL.

B. morifdl

Bombyx mori genomic database search results A tBLASTn search of the available Bombyx mori genomic sequences was carried out with the SfFDL conceptual translation as query using the online NCBI interface. This search yielded, amongst others, the sequences BAABO 1046610, BAABOl 083831 and BAABOl 153187. The Sequence BAAB01046610 encodes a putative 5' coding exon with a start codon (nts 25-200). The conceptual translation of this exon shows high similarity to the amino-terminal part of SfFDL. The sequences BAABO 1083831 and

BAABOl 153187 could be joined in silico to yield a contig encoding the putative 3' coding exon, including a stop codon. The conceptual translation of this exon showed high similarity to the carboxy-terminal part of SfFDL. The 5' coding exon could be joined in silico to the 3' coding exon at splice junctions predicted with high probability by NetGene2 (Hebsgaard et al, Nucleic Acids Res. 24:3439-3452), yielding a contiguous open reading frame.

Amplification of the full-length BmFDL open frame for baculovirus expression

Primers designed to amplify the entire predicted open reading frame with the additional sequence CACC 5' to the initiation codon were used in PCRs to amplify the open reading frame from cDNA as well as genomic DNA (including the intron). Genomic DNA was prepared by a modification of the method of Laird et al. (Supra) from a single stage 2 B. mori larva (Qiufeng/Baiyu hybrid). Briefly, the larva was homogenized in lysis buffer supplemented with RNAse A, after which the homogenate was incubated at 55⁰C for 1 hour. The lysate was the centrifuged at 13.000 x G to remove debris, and DNA was precipitated by additional of an equal volume of isopropyl alcohol. The DNA was dissolved in 500 μLs of TE buffer and cleaned once by phenol chloroform extraction. Total RNA was isolated from a single stage 2 B. mori larva (Qiufeng/Baiyu hybrid) using the Qiagen RNeasy™ Mini Plus kit. The larva was homogenized in Buffer RLT plus, followed by centrifugation at 13.000 x G to remove debris. Total RNA was subsequently isolated according to the manufacturer's instructions. Total B. mori RNA was used to prepare 5' RACE-ready RNA using the Invitrogen GeneRacer™ kit according to the manufacturer's instructions. In two separate reactions, 5' RACE-ready RNA and total RNA was used for reverse transcription with Invitrogen Thermoscript™ reverse transcriptase using the BmFDL ASPl primer and the GeneRacer™ Oligo dT Primer, respectively. The reaction was set up according to the manufacturer's instructions and incubated for (i) 5 min at 50⁰C, (ii) 15 at 55⁰C, (iii) 30 min at 60⁰C and finally for (iv) 15 mins at 65°C. The reactions were diluted with 40μLs TE buffer and stored at -20⁰C. The predicted full-length open reading frame was amplified from both cDNA primed with GeneRacer™ Oligo dT Primer and genomic DNA (including the intron).The PCRs were set up using the BmFDL FL SP2 and BmFDL ASPlCLO primers and incubated in the same fashion as for the amplification of the full-length TnFDL open reading frame. The spent reactions were separated on a 1% agarose gel, and bands of the expected size were isolated and purified. The DNA fragments from the reactions template with genomic DNA and cDNA were cloned into the pENTR™/D-TOPO® vector according to the manufacturer's instructions, yielding pENTR-BmFDL-G and pENTR- BmFDL-C, respectively. Four clones of each were sequenced, yielding two distinct alleles from both gDNA and cDNA. Despite a substantial number of nucleotide substitutions, the conceptual translation of one of these alleles is identical to the conceptual translation of the putative ^Z/ gene identified from the p50 (Daizo) strain. The two alleles differ between each other in several nucleotides in the intron and both exons. However, only three nucleotide changes are not silent, resulting in the L138I, the G404E and the H481Q amino acid changes. pENTR-BmFDL-C was used with Invitrogen's Baculodirect™ kit according to the manufacturer's instructions to generate AcBmFDL.

5'RACE

5' RACE was carried out using the BmFDL ASP4 primer and the GeneRacer™ 5' Primer with 5 'RACE-ready cDNA prepared as described above. Reactions were incubated for 30 sec at 96⁰C, then cycled 5 times using (i) 15 sec at 96°C, (ii) 45 sec at 72⁰C, after which the reactions were cycled 12 times using (i) 15 sec at 96°C, (ii) 15 sec at 72 to 61⁰C (with a decreasing temperature gradient of 1°C per cycle), and (iii) 30 sec at 72⁰C. The reactions were the cycled another 30 times using (i) 15 sec at 96°C, (ii) 15 sec at 61°C and (iii) 30 sec at 72°C, and finally incubated for 1 min at 72°C. The spent reactions were separated on a 1.2% agarose gel. A specific band of about 570 bps was isolated, purified and re-amplified using the BmFDL ASP5 primer and the GeneRacer™ 5' Nested Primer using the same cycling conditions. The nested 5' RACE reactions were separated on a 1.4% agarose gel, showing a specific band of the expected 550 bps. This band was excised, purified and sequenced using the BmFDL ASP5 primer.

3'RACE

3' RACE was carried out using the BmFDL SP4 primer and the GeneRacer™ 3' Primer with 3' RACE-ready cDNA. Reactions were cycled in the same fashion as described above for 5' RACE. The spent reaction was analyzed on a 1.4% agarose gel, showing a specific faint band at 450 bps. This band was excised, purified and used for nested 3' RACE reactions with the BmFDL SP5 primer and the GeneRacer™ 3' Nested Primer using the same cycling reactions. The spent reactions showed a strong, specific band at the expected size of 420 bps. This band was excised, purified and sequenced using the BmFDL SP5 primer.

Primer Table

EXAMPLE in

Inhibition of specific, processing β-N-acetylglucosaminidase activity by an Sf-fdl-specific double-stranded RNA If the Sf-fdl gene encodes the specific, processing β-N-acetylglucosaminidase activity in Sf9 cells, it should be possible to reduce this activity by RΝA interference with Sf-fdl- specific double-stranded RΝA. Towards this end, we constructed an immediate early expression plasmid encoding an inverted repeat sequence derived from a portion of the Sf-fdl coding sequence (Fig. 12) and used it to isolate a transformed Sf9 cell subclone, as described above. Microsomal membranes were then isolated from the parental cell line or the subclone and used for β-N-acetylglucosaminidase activity assays with GnGn as the substrate. HPLC analysis of the reaction products showed that the microsomal membranes from both the parental Sf9 cells the transformed subclone converted GnGn to GnM, but not detectably to MM or MGn (data not shown). Thus, in this examination of endogenous β-N-acetylglucosaminidase activities in microsomal membranes from uninfected Sf9 cells, we detected only the specific, processing enzyme activity. Furthermore, we found that the levels of this specific, processing β-N- acetylglucosaminidase activity were over 50% lower in the membranes from the transformed subclone, relative to the parental controls (Fig. 11). The reduced levels of specific, processing β-N-acetylglucosaminidase activity in the Sf9 subclone transformed with the constitutive expression plasmid encoding Sf-fdl-specific double-stranded RNA strongly supports the conclusion that the Sf-fdl gene encodes this activity.

While SEQ ID NO:9 is specific for downregulating expression of the Sffdl encoding nucleic acid, provision of the sequence information for T. ni and B. mori homologs readily enables the skilled artisan to generate additional specific RNAi for inhibiting expression of the same. Indeed, computer programs are available online which can assist in the design of such molecules.

From the foregoing description, those skilled in the art will appreciate that the presence or absence of a specific, processing β-N-acetylglucosaminidase is a key difference in the protein N-glycan processing pathways of insects and higher eukaryotes. In insect systems, this function was first identified as an enzyme activity in crude microsomal membranes isolated from a cell line derived from the lepidopteran insect, S. frugiperda (9). Efforts to molecularly clone the gene encoding this enzyme in these cells yielded two recent reports describing a single gene alternatively termed SfGlcNAcase-3 (18) and SflJex (17). Biochemical assays revealed that the SfGIcN Acase-31 SfHex gene product lacked the strict substrate specificity of the enzyme activity originally described by Altmann and co-workers in 1995. Based upon this and other findings, one group concluded in their report that the SfGlcNAcase 3/SfHex gene product is more likely to be involved in glycan and chitin degradation than in N-glycan processing (18). In contrast, based upon a slight preference for the appropriate substrate, the other group concluded in their report that this gene product is involved in N-glycan processing and hypothesized that it serves multifunctional roles in both N-glycan processing and glycan degradation in Sf9 cells (17). Parallel efforts to molecularly clone the processing β-N- acetylglucosaminidase from Drosophila melanogaster yielded a report describing the Dm-FdI gene and the characteristics of the gene product (16). Based upon its substrate specificity and the presence of a higher level of N-glycans containing terminal N- acetylglucosamine residues in mutant flies lacking a functional FdI gene, this report concluded that the FdI gene encoded the β-N-acetylglucosaminidase involved in N-glycan processing in this fruitfly. In accordance with the present invention, we isolated an FdI gene from Sf9 cells and demonstrated that it encodes a membrane-associated β-N-acetylglucosaminidase with the same, strict substrate specificity exhibited by Dm-FDL and by the enzyme activity originally detected in S. frugiperda microsomes (9). The fact that the Sf9 genome encodes a gene with a close phylogenetic relationship to Dm-fdl, the fact that the Sf-fdl gene product is membrane-associated and has the strict substrate specificity and pH optimum profile of the original activity detected in S. frugiperda microsomes, and the fact that Sf9 cells engineered to express S^FαfZ-specific double-stranded RNA have lower levels of specific, processing β-N-acetylglucosaminidase activity all tend to support the view that the Sf-fdl gene encodes the β-N-acetylglucosaminidase involved in N-glycan processing in Sf9 cells. In addition, these findings support our previous conclusion that the broad spectrum β-N-acetylglucosaminidase encoded by the SfGlcNAcase 3/SfiIex gene is more likely involved in glycan and chitin degradation.

The fdl gene orthologs were isolated from the lepidopteran insect cell species, Spodopterafrugiperida, Trichoplusia ni and Bombyx mori, as cell lines derived from these insect species are commonly used with the baculovirus expression system.

REFERENCES

1. Marz, L., Altmann, F., Staudacher, E., and Kubelka, V. (1995) Protein glycosylation in insects. In: Montreuil, J., Vliegenthart, J. F. G., and Schachter, H. (eds). Glycoproteins, Elsevier, Amsterdam 2. Marchal, I., Jarvis, D. L., Cacan, R., and Verbert, A. (2001) Biol. Chem. 382, 151- 159

3. O'Reilly, D. R., Miller, L. K., and Luckow, V. A. (1992) Baculovirus expression vectors, W.H. Freeman and Company, New York

4. Summers, M. D., and Smith, G. E. (1987) Tx. Ag. Expt. Stn. Bull. No. 1555 5. Jarvis, D. L. (1997) Baculovirus expression vectors. In: Miller, L. K. (ed). The

Baculoviruses, Plenum Press, New York

6. Fischer, R., Stoger, E., Schillberg, S., Christou, P., and Twyman, R. M. (2004) Curr. Op. Plant Biol. 7(2), 152-158

7. Ma, J. K., Drake, P. M., and Christou, P. (2003) Nat. Rev. Genet. 4(10), 794-805 8. Kornfeld, R., and Kornfeld, S. (1985) Ann. Rev. Biochem. 54, 631-664

9. Altmann, F., Schwihla, H., Staudacher, E., Glossl, J., and Marz, L. (1995) J. Biol. Chem. 270, 17344-17349

10. Zhang, W., Cao, P., Chen, S., Spence, A. M., Zhu, S., Staudacher, E., and Schachter, H. (2003) Biochem. J. 372(Pt 1), 53-64 11. Gutternigg, M., Kretschmer-Lubich, D., Paschinger, K., Rendic, D., Hader, J., Geier, P., Ranftl, R., Jantsch, V., Lochnit, G., and Wilson, I. B. (2007) J. Biol. Chem. 282(38), 27825-27840

12. Vitale, A., and Chrispeels, M. J. (1984) J. Cell Biol. 99(1 Pt 1), 133-140

13. Sturm, A. (1995) N-glycosylation of proteins in plants. In: Montreuil, J., Vliegenthart, J. F. G., and Schachter, H. (eds). Glycoproteins, Elsevier,

Amsterdam

14. Vaughn, J. L., Goodwin, R. H., Thompkins, G. J., and McCawley, P. (1977) In Vitro 13, 213-217

15. Wagner, R., Geyer, H., Geyer, R., and Klenk, H. D. (1996) J. Virol. 70(6), 4103- 4109

16. Leonard, R., Rendic, D., Rabouille, C, Wilson, I. B., Preat, T., and Altmann, F. (2006) J. Biol. Chem. 281(8), 4867-4875

17. Tomiya, N., Narang, S., Park, J., Abdul-Rahman, B., Choi, O., Singh, S., Hiratake, J., Sakata, K., Betenbaugh, M. J., Palter, K. B., and Lee, Y. C. (2006) J. Biol. Chem. 281(28), 19545-19560

18. Aumiller, J. J., Hollister, J., and Jarvis, D. L. (2006) Prot. Expr. Purif. 47, 571- 590

19. Altschul, S. F., Madden, T. L., Schaffer, A. A., Zhang, J., Zhang, Z., Miller, W., and Lipman, D. J. (1997) Nucl. Acids Res. 25, 3389-3402 20. Brunak, S., Engelbrecht, J., and Knudsen, S. (1991) JMoI Biol 220(1), 49-65

21. Thompson, J. D., Gibson, T. J., Plewniak, F., Jeanmougin, F., and Higgins, D. G. (1997) Nucl. Acids Res. 25, 4876-4882

22. Innis, M. A., and Gelfand, D. H. (1990) Optimization of PCRs. In: Innis, M. A., Gelfand, D. H., Sninsky, J. J., and White, T. J. (eds). PCR Protocols: A Guide to Methods and Applications, Academic Press, San Diego

23. Sambrook, J., Fritsch, E. F., and Maniatis, T. (1989) Molecular Cloning: A Laboratory Manual, 2nd edition Ed., Cold Spring Harbor Press, Cold Spring Harbor, New York 24. Shi, X., and Jarvis, D. L. (2006) Analyt Biochem 356(2), 222-228

25. Hofmann, K., and Stoffel, W. (1993) Biol. Chem. Hoppe-Seyler 374, 166

26. Kitts, P. A., and Possee, R. D. (1993) Biotechniques 14(5), 810-817

27. Bao and Cagan (2006) RNA, 12, 2020-2024 28. Jarvis, D. L., Weinkauf, C, and Guarino, L. A. (1996) Prot. Expr. Purif. 8, 191- 203

29. von Heijne, G. (1992) J. MoI. Biol. 225(2), 487-494

30. Paulson, J. C, and Colley, K. J. (1989) J. Biol. Chem. 264, 17615-17618

31. Breton, C, Mucha, J., and Jeanneau, C. (2001) Biochimie 83(8), 713-718. 32. Field, M. C, and Wainwright, L. J. (199S)

5, 463-472

33. Myerowitz, R., Piekarz, R., Neufeld, E. F., Shows, T. B., and Suzuki, K. (1985) Proceedings of the National Academy of Science of the United States of America 82(23), 7830-7834

34. O'Dowd, B. F., Quan, F., Willard, H. F., Lamhonwah, A. M., Korneluk, R. G., Lowden, J. A., Gravel, R. A., and Mahuran, D. J. (1985) Proceedings of the

National Academy of Science of the United States of America 82(4), 1184-1188

35. Gaunt, M. W., and Miles, M. A. (2002) MoI. Biol. Evol. 19, 748-761

36. Felsenstein, J. (1989) Cladistics 5, 164-166.

While certain of the preferred embodiments of the present invention have been described and specifically exemplified above, it is not intended that the invention be limited to such embodiments. Various modifications may be made thereto without departing from the scope and spirit of the present invention, as set forth in the following claims. Furthermore, the transitional phases "comprising", "consisting essentially of and "consisting of define the scope of the appended claims, in original and amended form, with respect to what unrecited additional claim elements or steps. The term "comprising" is intended to be inclusive or open-ended and does not exclude additional, unrecited elements, methods step or materials. The phrase "consisting of excludes any element, step or material other than those specified in the claim, and, in the latter instance, impurities ordinarily associated with the specified materials. The phrase "consisting essentially of limits the scope of a claim to the specified elements, steps or materials and those that do not materially affect the basic and novel characteristic(s) of the claimed invention. All compositions or formulations identified herein can, in alternate embodiments, be more specifically defined by any of the transitional phases "comprising", "consisting essentially of and "consisting of.

Claims

What is claimed is:

1. An isolated nucleic acid encoding an β-N-acetylglucosaminidase having a sequence selected from the group consisting of SEQ ID NO: 2, SEQ ID NO:4, SEQ ID NO:6 and SEQ ID NO:8.

2. An isolated nucleic acid encoding an β-N-acetylglucosaminidase of SEQ ID ΝO:2.

3. An isolated nucleic acid encoding an β-N-acetylglucosaminidase of SEQ ID ΝO:4.

4. An isolated nucleic acid encoding an β-N-acetylglucosaminidase of SEQ ID ΝO:6.

5. An isolated nucleic acid encoding an β-N-acetylglucosaminidase of SEQ ID ΝO:8.

6. The isolated nucleic acid of claim 1 , which is SEQ ID NO: 1.

7. The isolated nucleic acid of claim 1, which is SEQ ID NO:3.

8. The isolated nucleic acid of claim 1, which is SEQ ID NO:5.

9. The isolated nucleic acid of claim 1, which is SEQ ID NO:7.

10. The isolated nucleic acid molecule of claim 1, which is a DNA molecule.

11. An RNA molecule encoded by at least one of the nucleic acid molecules of claim 1.

12. An expression vector comprising at least one of the nucleic acid molecules of claim 1.

13. A recombinant insect cell transformed with the expression vector of claim 12.

14. The RNA molecule of claim 11, which is a fragment of SEQ ID NO: 1, having SEQ ID NO: 9, said RNA being double stranded.

15. An expression vector comprising the RNA molecule of claim 14.

16. A recombinant transgenic insect cell comprising the expression vector of claim 15.

17. A method for enhancing production of mammalian-like N-glycans in insect cells, comprising a) providing the recombinant insect cells of claim 16; b) transforming said cells with an expression vector comprising a nucleic acid encoding a heterologous glycoprotein of interest, said glycoprotein expressed in said cells of a) comprising elevated levels of mammalian-like N-glycans when compared to levels observed in wild type cells.

18. The recombinant insect cells of claim 16, further comprising at least one glycosylation enzyme selected from the group consisting of N- acetylglucosaminyltransferases, galactosyltransferases, sialyltransferases, sulfotransferases, sialic acid synthases, CPM-sialic acid synthetases, UDP-Ν- acetylglucosamine-2-epimerases/N-acetylmannosamine kinases, and CMP-sialic acid transporters.

19. A method for inhibiting β-N-acetylglucosaminidase activity comprising contacting a Sf-fdl-expressing cell with a nucleic acid molecule comprising SEQ ID NO: 9 in an amount effective to down-regulate β -N-acetylglucosaminidase endogenous to that cell.

20. An isolated protein comprising a sequence selected from the group consisting of SEQ ID ΝO:2, SEQ ID NO:4, SEQ ID NO:6 and SEQ ID NO:8.