WO2009079376A2 - Lepidopteran insect n-acetylglucosaminidase genes and their use in glycoengineering - Google Patents
Lepidopteran insect n-acetylglucosaminidase genes and their use in glycoengineering Download PDFInfo
- Publication number
- WO2009079376A2 WO2009079376A2 PCT/US2008/086606 US2008086606W WO2009079376A2 WO 2009079376 A2 WO2009079376 A2 WO 2009079376A2 US 2008086606 W US2008086606 W US 2008086606W WO 2009079376 A2 WO2009079376 A2 WO 2009079376A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- seq
- fdl
- nucleic acid
- cells
- sec
- Prior art date
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/113—Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
- C12N15/1137—Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing against enzymes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P21/00—Preparation of peptides or proteins
- C12P21/005—Glycopeptides, glycoproteins
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/24—Hydrolases (3) acting on glycosyl compounds (3.2)
- C12N9/2402—Hydrolases (3) acting on glycosyl compounds (3.2) hydrolysing O- and S- glycosyl compounds (3.2.1)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y302/00—Hydrolases acting on glycosyl compounds, i.e. glycosylases (3.2)
- C12Y302/01—Glycosidases, i.e. enzymes hydrolysing O- and S-glycosyl compounds (3.2.1)
- C12Y302/01096—Mannosyl-glycoprotein endo-beta-N-acetylglucosaminidase (3.2.1.96)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2310/00—Structure or type of the nucleic acid
- C12N2310/10—Type of nucleic acid
- C12N2310/14—Type of nucleic acid interfering N.A.
Definitions
- This invention relates to the fields of molecular biology and production of proteins possessing complex type oligosaccharide side chains. More specifically, the invention provides novel nucleic acid sequences encoding ⁇ -N-acetylglucosaminidase enzymes and recombinant insect cell lines comprising the same for the production of therapeutic and commercially valuable glycoproteins.
- Insect and mammalian protein N-glycosylation pathways each begin with the co-translational transfer of N-glycan precursors to nascent proteins (1,8). These precursors are subsequently trimmed and elongated by enzymes localized in the endoplasmic reticulum and Golgi apparatus of insect and mammalian cells to produce a common intermediate with the structure Man ⁇ 6(Glc ⁇ AcB2Man ⁇ 3)Man ⁇ 4 GlcNAcB4GlcNAc-R. In mammalian cells, this intermediate is elongated by various glycosyltransferases to produce complex N-glycans, which often have terminal sialic acid residues.
- insect cells usually fail to elongate this same intermediate and convert it, instead, to paucimannose N-glycans with the core structure Man ⁇ 6(Man ⁇ 3)Man ⁇ 4Glc ⁇ Ac ⁇ 4Glc ⁇ Ac-R.
- An unusual ⁇ -N-acetylglucosaminidase is responsible for the production of these structures (9).
- This enzyme specifically removes the terminal N-acetylglucosamine residue from the ⁇ 3 branch of Man ⁇ (Glc ⁇ Ac ⁇ 2Man ⁇ 3)Man ⁇ 4Glc ⁇ AcB4Glc ⁇ Ac-R, simultaneously eliminating the intermediate required for N-glycan elongation and producing the core paucimannose glycan typically found on insect cell-derived N-glycoproteins.
- IPLB-Sf21AE a cell line derived from the lepidopteran insect S.frugiperda (14), has a membrane-associated ⁇ -N-acetylglucosaminidase activity that can specifically cleave the terminal N-acetylglucosamine residue from the ⁇ 3 branch of a biantennary N-glycan in vitro. Subsequently, it was shown that cell lines derived from E.
- N-glycosylation pathway of at least some insect cells includes a processing ⁇ -N-acetylglucosaminidase, as described above.
- unequivocal proof of this concept awaited the isolation of an insect gene encoding this enzyme, together with evidence that the gene product had the substrate specificity of the N-glycan processing enzyme.
- Dm-FDL melanogaster hexosaminidase genes (hexol and hexo-2) encode enzymes that can cleave chito-oligosaccharides, but not N-glycans, strongly suggested that Dm-FDL is the ⁇ -N-acetylglucosaminidase responsible for N- glycan processing in this fly. These properties also were consistent with the idea that Dm- FDL is an ortholog of the lepidopteran insect N-glycan processing enzyme first detected by Altmann and coworkers (1995) in microsomal membranes from IPLB-Sf21AE cells.
- SfGlcNAcase-1 and SfGlcNAcase-3 gene products showed that they had high sequence homology to known hexosaminidases and that each also had ⁇ -N- acetylglucosaminidase activity when assayed against relevant substrates. However, neither had the tight ⁇ 3 branch specificity of the processing enzyme activity originally described by Altmann and coworkers (1995). In fact, each could remove the terminal N- acetylglucosamine residues from either the ⁇ 3 or the ⁇ 6 branch of various N-glycan substrates and each also was able to release N-acetylglucosamine monomers from a chito- oligosaccharide substrate.
- SfGlc ⁇ Acase-3 Further analysis of the Sfhex gene product, which is identical to the gene product we designated SfGlc ⁇ Acase-3, confirmed that the SfGlcNAcase-3/Sfhex gene product lacks the ⁇ 3 branch specificity of the processing enzyme activity originally described by Altmann and coworkers. However, because this enzyme had a 2- to 5 -fold higher preference for the terminal N-acetylglucosamine residue on the ⁇ 3 branch of an N- glycan substrate, Tomiya and coworkers (2006) concluded that the SfGlcNAcase-3/Sfl ⁇ ex gene encodes the processing ⁇ -N-acetylglucosaminidase of Sf9 cells.
- nucleic acid encoding an N- acetylglucosaminidase
- the nucleic acid encodes a protein of SEQ ID NO: 2.
- nucleic acid is SEQ ID NO: 1.
- the nucleic acid molecules of the invention may be DNA, RNA, or cDNA and they may be single or double stranded. Additional embodiments of the invention include nucleic acids of SEQ ID NOS: 3, 5, and 7 and their encoded proteins SEQ ID NOS: 4, 6, and 8.
- expression vectors comprising the nucleic acid molecules described above are provided. Also within the scope of the invention are recombinant insect cells transformed with such expression vectors.
- the RNA molecule is a fragment of SEQ
- SEQ ID NO: 1 having SEQ ID NO: 9, which is double stranded and, when expressed in a cell, down regulates production of the protein of SEQ ID NO: 2.
- the present invention provides isolated proteins comprising SEQ ID: 2, 4, 6 and 8.
- the isolated proteins of this invention may be used for the production of specific glycans for use as standards, or substrates, e.g., in remodeling recombinant glycoprotein glycans.
- a method for enhancing production of mammalian-like N- glycans in insect cells entails providing recombinant insect cell lines comprising the double stranded R ⁇ A molecule described above, either transforming the cells with an expression vector or infecting the cells with a recombinant baculovirus comprising a nucleic acid encoding a heterologous glycoprotein of interest, wherein glycoprotein(s) expressed in the recombinant comprise elevated levels of mammalian-like N-glycans when compared to levels observed in wild type cells.
- the cells described above may optionally contain additional enzymes involved in the production and synthesis of mammalian-like N glycans.
- enzymes include, without limitation, N-acetylglucosaminyltransferases, galactosyltransferases, sialyltransferases, sulfotransferases, sialic acid synthases, CPM- sialic acid synthetases, UDP-N-acetylglucosamine ⁇ -epimerases/N-acetylmannosamine kinases, and CMP-sialic acid transporters.
- Sf-fdl encodes a membrane-associated product that specifically cleaves the terminal N-acetylglucosamine residue from the ⁇ 3 branch of N-glycan substrates, that has little or no activity against chito-oligosaccharide substrates, and that has precisely the same pH profile as the activity originally identified by Altmann and coworkers (1995) in IPLB-SGlAE cell microsomes. Furthermore, Sf9 cells engineered to express a Sf-fdl- specific double-stranded R ⁇ A had lower levels of specific, processing ⁇ -N-acetylglucosaminidase activity.
- Fig. 1 Nucleotide sequence of the Sf-fdl gene (SEQ ID NO: 1) and amino acid sequence of the gene product (SEQ ID NO: 2). The putative N-terminal transmembrane domain is underlined and the two consensus N-glycosylation sites are boxed.
- FIG. 2 Phylogenetic relationships between the Sf-FDL protein and known hexosaminidases.
- This Figure shows the phylogenetic relationships between the Sf-FDL protein and Dm-FDL (Ace No. NM_165909; 16), SfGlcNAcase-3/SfHex (Ace No. DQ249309; 17,18)), SfGlcNAcase-1 (DQ249307; 18), and the human hexosaminidases A (Ace. No. NM 000520; 33) and B (NM_000521; 34.
- the amino acid sequences of these proteins were aligned using CLUSTALX version 1.83 (21) using the default settings and then the alignment was exported in the PHYLIP format (36) and used to generate a distance matrix by protdist in PHYLIP version 3.66 with the Jones-Taylor-Thornton model. Neighbor in PHYLIP version 3.66 was used to generate an unrooted tree from the distance matrix with the neighbor-joining method and, finally, the Neighbor output was used to draw an unrooted tree with the PHYLIP postscript generator.
- the Sf-FDL amino acid sequence is 44% and 29% identical to the sequences of Dm-FDL and SfGlcNAcase- 3/SfHex, respectively.
- Fig. 3 Substrate specificity of Sf-FDL.
- Various glycan substrates including
- GnGn (A), MGn (B), GnM (Q, and chitotriose (D) were incubated for 16 h with microsomal fractions containing 10 ug of total protein from Sf9 cells infected with AcMNPV, AcDm-FDL, AcSf-FDL, or AcGlcNAcase-3.
- the reaction products were then recovered and analyzed by reverse-phase HPLC, as described in Experimental Procedures.
- the arrows show the elution times for each of the relevant glycans.
- Fig. 4 pH optimum of Sf-FDL.
- Microsomal fractions containing 10 ug of total protein from AcSf-FDL-infected Sf9 cells were incubated for 16 h with GnGn at pH values between 4.0 and 8.0, and then the reaction products were recovered and analyzed by reverse-phase HPLC, as described in Experimental Procedures.
- the plot shows the relative percentages of GnM produced at each pH as a percentage of the area under the GnM peak divided by the sum of the area under the GnGn and GnM peaks.
- Fig. 5 Expression and purification of GST-tagged ⁇ -N-acetylglucosaminidase ectodomains.
- the GST-tagged, ectodomains of Sf-FDL (lanes 1), Dm-FDL (lanes 2), and SfGlc ⁇ Acase-3/Sfhex (lanes 3) were expressed in recombinant baculovirus-infected Sf9 cells and purified from the extracellular fraction by glutathione affinity chromatography, as described in Experimental Procedures. Equal amounts of the purified products were then analyzed by (A) SDS-PAGE with Coomassie Blue staining or (B) SDS-PAGE with immunoblotting using a GST-specific antiserum.
- Fig. 6 Substrate specificity of the GST-tagged, ectodomains of Sf-FDL, Dm- FDL, and SfGlc ⁇ Acase-3/Sfhex. Equal amounts of each enzyme were incubated for 2 h with GnGn (A), MGn (B), GnM (Q, or chitotriose (D) and the reaction products were recovered and analyzed by reverse-phase HPLC, as described in Experimental Procedures. The arrows show the elution times for each of the relevant glycans.
- Fig. 7 Overdigestion of glycan substrates with the GST-tagged, ectodomains of Sf-FDL, Dm-FDL, and SfGlc ⁇ Acase-3/Sfhex. Equal amounts of each enzyme were incubated for 20 h with GnGn (A), GnM (B), or chitotriose (Q and the reaction products were recovered and analyzed by reverse-phase HPLC, as described in Experimental Procedures. The arrows show the elution times for each of the relevant glycans.
- Fig. 8 Nucleotide sequence of the Tn-fdl gene (SEQ ID NO: 3) and amino acid sequence of the gene product (SEQ ID NO:4). The putative N-terminal transmembrane domain is underlined and the two consensus N-glycosylation sites are boxed.
- Fig. 9 Nucleotide sequence of one allele the Bm-fdl gene (SEQ ID NO:5) and amino acid sequence of the gene product (SEQ ID NO:6). The putative N-terminal transmembrane domain is underlined and the three consensus N-glycosylation sites are boxed.
- Fig. 10 Nucleotide sequence of another allele of Bm-fdl gene (SEQ ID NO:7) and amino acid sequence of the gene product (SEQ ID NO:8). The putative N-terminal transmembrane domain is underlined and the three consensus N-glycosylation sites are boxed.
- Fig. 11 Endogenous levels of specific, processing ⁇ -N-acetylglucosaminidase activity in parental Sf9 cells and an Sf9-derived clone expressing an Syy ⁇ Y-specific double-stranded R ⁇ A.
- Microsomal membrane preparations from Sf9 or SfFDL R ⁇ Ai cells were incubated for 16 hr with GnGn, and the reaction products were analyzed by HPLC to compare the relative amounts of GnM produced.
- the plot shows the average results obtained in five replicate assays, with the average percentage of GnM produced by microsomes from the Sf9 controls set to 100%.
- the error bars show the standard deviations and a one-way A ⁇ OVA analysis showed that the two datasets are significantly different (P ⁇ 0.01).
- Fig. 12 The sequence utilized in the R ⁇ Ai experiment is shown (SEQ ID ⁇ O:9).
- Man ⁇ 6(Man ⁇ 3)Man ⁇ 4GlcNAcB4GlcNAc-R is the core structure of the major processed protein N-glycans produced by insect cells. Ultimately, this paucimannose type structure is produced by an unusual ⁇ -N-acetylglucosaminidase, which removes the terminal N-acetylglucosamine residue from the upstream intermediate, Man ⁇ 6(Glc ⁇ Ac ⁇ 2Man ⁇ 3)Man ⁇ 4Glc ⁇ Ac ⁇ 4Glc ⁇ Ac-R.
- SfGlcNAcase-3 or SfHex encodes this function
- this gene encodes a broad-spectrum ⁇ -N-acetylglucosaminidase that functions in glycan and chitin degradation.
- Sf-fdl an S. frugiperda fdl ortholog encodes a product with the substrate specificity expected of a processing ⁇ -N-acetylglucosaminidase. It is also shown that the endogenous levels of specific, processing ⁇ -N-acetylglucosaminidase activity are significantly reduced in S.
- frugiperda cells engineered to express a double-stranded R ⁇ A derived from the Sf-fdl gene. These results indicate that Sf-fdl encodes the specific, processing ⁇ -N-acetylglucosaminidase of S. frugiperda.
- a "cell line” refers to cells which can be cultured in the lab for an indefinite period and are useful for producing large amounts of a protein of interest. Ideally, such cells are immortalized and do not exhibit senescence in culture.
- insect includes any stage of development of an insect, including a one-celled germ line cell, a fertilized egg, an early embryo, a larva, including any of a first through final instar larva, a pupa, or an adult insect.
- a large larva such as a fourth or fifth instar larva is preferred.
- insect stage is suitable for a particular purpose, such as for direct production of a glycosylated polypeptide of interest, for storage or transport of an insect to a different location, for generation of progeny, for further genetic crosses, or the like.
- isolated nucleic acid refers to a D ⁇ A molecule that is separated from sequences with which it is immediately contiguous (in the 5' and 3' directions) in the naturally occurring genome of the organism from which it originates.
- isolated nucleic acid may comprise a DNA or cDNA molecule inserted into a vector, such as a plasmid or virus vector, or integrated into the DNA of a prokaryote or eukaryote.
- isolated nucleic acid primarily refers to an RNA molecule encoded by an isolated DNA molecule as defined above.
- the term may refer to an RNA molecule that has been sufficiently separated from RNA molecules with which it would be associated in its natural state (i.e., in cells or tissues), such that it exists in a “substantially pure” form (the term “substantially pure” is defined below).
- isolated protein or “isolated and purified protein” is sometimes used herein. This term refers primarily to a protein produced by expression of an isolated nucleic acid molecule of the invention. Alternatively, this term may refer to a protein which has been sufficiently separated from other proteins with which it would naturally be associated, so as to exist in "substantially pure” form.
- promoter region refers to the transcriptional regulatory regions of a gene, which may be found at the 5 ' or 3' side of the coding region, or within the coding region, or within introns.
- vector refers to a small carrier DNA molecule into which a DNA sequence can be inserted for introduction into a host cell where it will be replicated.
- expression vector is a specialized vector that contains a gene or nucleic acid sequence with the necessary regulatory regions needed for expression in a host cell.
- operably linked means that the regulatory sequences necessary for expression of a coding sequence are placed in the DNA molecule in the appropriate positions relative to the coding sequence so as to effect expression of the coding sequence.
- This same definition is sometimes applied to the arrangement of coding sequences and transcription control elements (e.g. promoters, enhancers, and termination elements) in an expression vector.
- This definition is also sometimes applied to the arrangement of nucleic acid sequences of a first and a second nucleic acid molecule wherein a hybrid nucleic acid molecule is generated.
- substantially pure refers to a preparation comprising at least 50-60% by weight the compound of interest (e.g., nucleic acid, oligonucleotide, protein, etc.). More preferably, the preparation comprises at least 75% by weight, and most preferably 90-99% by weight, of the compound of interest. Purity is measured by methods appropriate for the compound of interest (e.g. chromatographic methods, agarose or polyacrylamide gel electrophoresis, HPLC analysis, and the like).
- phrases "consisting essentially of when referring to a particular nucleotide sequence or amino acid sequence means a sequence having the properties of a given SEQ ID NO:.
- the phrase when used in reference to an amino acid sequence, the phrase includes the sequence per se and molecular modifications that would not affect the basic and novel characteristics of the sequence.
- oligonucleotide refers to primers and probes of the present invention, and is defined as a nucleic acid molecule comprised of two or more ribo- or deoxyribonucleotides, preferably more than three. The exact size of the oligonucleotide will depend on various factors and on the particular application for which the oligonucleotide is used.
- probe refers to an oligonucleotide, polynucleotide or nucleic acid, either RNA or DNA, whether occurring naturally as in a purified restriction enzyme digest or produced synthetically, which is capable of annealing with or specifically hybridizing to a nucleic acid with sequences complementary to the probe.
- a probe may be either single-stranded or double-stranded. The exact length of the probe will depend upon many factors, including temperature, source of probe and method of use. For example, for diagnostic applications, depending on the complexity of the target sequence, the oligonucleotide probe typically contains 15-25 or more nucleotides, although it may contain fewer nucleotides.
- the probes herein are selected to be “substantially” complementary to different strands of a particular target nucleic acid sequence. This means that the probes must be sufficiently complementary so as to be able to "specifically hybridize” or anneal with their respective target strands under a set of pre-determined conditions. Therefore, the probe sequence need not reflect the exact complementary sequence of the target. For example, a non-complementary nucleotide fragment may be attached to the 5' or 3' end of the probe, with the remainder of the probe sequence being complementary to the target strand. Alternatively, non-complementary bases or longer sequences can be interspersed into the probe, provided that the probe sequence has sufficient complementarity with the sequence of the target nucleic acid to anneal therewith specifically.
- the term “specifically hybridize” refers to the association between two single- stranded nucleic acid molecules of sufficiently complementary sequence to permit such hybridization under pre-determined conditions generally used in the art (sometimes termed “substantially complementary”).
- the term refers to hybridization of an oligonucleotide with a substantially complementary sequence contained within a single-stranded DNA or RNA molecule of the invention, to the substantial exclusion of hybridization of the oligonucleotide with single-stranded nucleic acids of non- complementary sequence.
- primer refers to an oligonucleotide, either RNA or DNA, either single-stranded or double-stranded, either derived from a biological system, generated by restriction enzyme digestion, or produced synthetically which, when placed in the proper environment, is able to act functionally as an initiator of template-dependent nucleic acid synthesis.
- suitable nucleoside triphosphate precursors of nucleic acids, a polymerase enzyme, suitable cofactors and conditions such as a suitable temperature and pH
- the primer may be extended at its 3' terminus by the addition of nucleotides by the action of a polymerase or similar activity to yield a primer extension product.
- the primer may vary in length depending on the particular conditions and requirements of the application.
- the oligonucleotide primer is typically 15-25 or more nucleotides in length.
- the primer must be of sufficient complementarity to the desired template to prime the synthesis of the desired extension product, that is, to be able to anneal with the desired template strand in a manner sufficient to provide the 3' hydroxyl moiety of the primer in appropriate juxtaposition for use in the initiation of synthesis by a polymerase or similar enzyme. It is not required that the primer sequence represent an exact complement of the desired template.
- a non-complementary nucleotide sequence may be attached to the 5' end of an otherwise complementary primer.
- non-complementary bases may be interspersed within the oligonucleotide primer sequence, provided that the primer sequence has sufficient complementarity with the sequence of the desired template strand to functionally provide a template-primer complex for the synthesis of the extension product.
- the Blastn 2.0 program provided by the National Center for Biotechnology Information (at http://www.ncbi.nlm.nih.gov/blast/; Altschul et al, 1990, J MoI Biol 215:403-410) using a gapped alignment with default parameters, may be used to determine the level of identity and similarity between nucleic acid sequences and amino acid sequences.
- expression control sequence refers to a polynucleotide sequence that regulates expression of a polypeptide coded for by a polynucleotide to which it is functionally ("operably") linked. Expression can be regulated at the level of the mRNA or polypeptide.
- expression control sequence includes mRNA-related elements and protein-related elements. Such elements include promoters, domains within promoters, upstream elements, enhancers, elements that confer tissue or cell specificity, response elements, ribosome binding sequences, transcriptional terminators, etc. Suitable expression control sequences that can function in insect cells will be evident to the skilled worker. In some embodiments, it is desirable that the expression control sequence comprises a constitutive promoter.
- baculovirus promoters for the plO, polyhedrin (polh), p6.9, capsid, and cathepsin-like genes.
- polyhedrin polyhedrin
- p6.9 capsid
- cathepsin-like genes baculovirus promoters for the iel, ie2, ieO, etl, 39K (aka pp31), and gp64 genes.
- Other suitable strong constitutive promoters include the B. mori actin gene promoter; D.
- enhancer elements such as the baculovirus enhancer element, hr5, may be used in conjunction with the promoter.
- the expression control sequence comprises a tissue-or organ-specific promoter. Many such expression control sequences will be evident to the skilled worker.
- the enzymes involved in N-glycan processing of the invention are required in catalytic amounts. Therefore, in one embodiment of the invention, much lower amounts of these enzymes are present than of the heterologous polypeptides of interest, which are generated in massive, large amounts, glycosylated, and harvested for further use.
- a suitable molar ratio of heterologous protein produced to enzyme involved in N-glycan processing may be greater than about 100: 1.
- the enzymes involved in N-glycan processing may be in comparable (e. g. , approximately stoichiometric) amounts to the heterologous glycoprotein to be processed.
- suitable promoters and/or conditions to express suitable amounts of the enzymes involved in N-glycan processing (e. g., amounts which are sufficient to (effective to) process the N-glycans of relatively high amounts of a protein of interest to the desired extent).
- suitable promoters and/or conditions to express suitable amounts of the enzymes involved in N-glycan processing (e. g., amounts which are sufficient to (effective to) process the N-glycans of relatively high amounts of a protein of interest to the desired extent).
- a skilled worker can readily ensure that the enzymes involved in N-glycan processing are present in sufficient local concentrations, and at an optimal time during insect propagation.
- an expression control sequence is regulatable (e. g., comprises an inducible promoter and/or enhancer element).
- Suitable regulatable promoters include, e. g., Drosophila or other hsp70 promoters, the Drosophila metallothionein promoter, an ecdysone-regulated promoter, the Saccharomyces cerevisiae Gal4/UAS system, and other well-known inducible systems.
- a Tet-regulatable molecular switch may be used in conjunction with any constitutive promoter, such as those described elsewhere herein (e. g, in conjunction with the CMV-IE promoter, or baculovirus promoters).
- Another type of inducible promoter is a baculovirus late or very late promoter that is only activated following infection by a baculovirus.
- a variety of immortalized lepidopteran insect cell lines are suitable for transformation by the vectors/constructs of the invention. Among these are Sf9 (Vaughn et al. (1977) In Vitro 13, 213- 217), Tn 5B1-4 (High Five; Wickham et al. (1992) Biotech. Progr. 8, 391-6), expresSf+ (Protein Sciences Corporation), and BmN (Bm-N4; Maeda et al.
- transgenic insect cell lines are conventional.
- one or more genes to be introduced are placed under the control of a suitable expression control sequence and are cloned into one or more plasmid vectors. These vectors are then mixed with a vector encoding a selectable marker under the control of a suitably expression control sequence.
- the DNA mixture is then introduced into the parental insect cell line (e.g., by calcium phosphate-mediated transfection), and the transgene(s) will integrate by non-homologous recombination into in the insect cell genome.
- Transformed cells are selected using an appropriate antibiotic (e.g.
- RNA dot blot assays lectin staining assays
- functional assays RNA dot blot assays, lectin staining assays, or functional assays.
- This general approach was first described in 1990 (Jarvis et al, 1990. Bio/Technology 8,950- 955) and has been reviewed recently (Harrison, R.L. and Jarvis, D.L. 2007. Transforming lepidopteran insect cells for improved protein processing. In D.W. Murhammer (Ed.), Methods in Molecular Biology: Baculovirus Expression Protocols. Humana Press, Clifton, NJ. Methods MoI Biol. (2007) 388:3-22.
- one or more genes to be introduced are placed under the control of a suitable expression control sequence, and are cloned into a vector, such as a viral vector (e. g, an attenuated baculovirus vector, or a non-permissive viral vector that is not infective for the particular insect of interest).
- a viral vector e. g, an attenuated baculovirus vector, or a non-permissive viral vector that is not infective for the particular insect of interest.
- the sequences to be introduced into the insect are flanked by genomic sequences from the insect.
- the construct is then introduced into an insect egg (e.g., by microinjection), and the transgene (s) then integrate by homologous recombination of the flanking sequences into comparable sequences in the insect genome.
- the vector is a transposon-based vector.
- transposon-based vectors are a viral vector (such as those described above) that further comprises inverted terminal repeats of a suitable transposon, between which the transgene of interest is cloned.
- a suitable expression control sequence s
- the transposon-based vector carries its own transposase.
- the transposon-based vector does not encode a suitable transposase. In this case, the vector is co-transfected into an insect (e.
- the recombinant vector (along with, generally, a helper) is introduced by conventional methods (such as microinjection) into an egg or early embryo; and the transgene (s) become integrated at a transposon site (such as sequences corresponding to the inverted terminal repeat of the transposon) in the insect genome.
- transposon-based vectors include, e. g., Minos, mariner, Hermes, sleeping beauty, and piggyBac.
- the vector is a piggyBac vector.
- TTAA-specific, short repeat elements feature in a group of transposons (Class II mobile elements) that have similar structures and movement properties.
- a typical piggyBac vector (formerly IFP2) is the most extensively studied of these insertion elements.
- piggyBac is 2.4 kb long and terminates in 13 bp perfect inverted repeats, with additional internal 19 bp inverted repeats located asymmetrically with respect to the ends (Cary et al. (1989) Virology. 172,156-69).
- a piggyBac vector may encode a trans-acting transposase that facilitates its own movement; alternatively, these sequences can be deleted and this function can be supplied by a helper plasmid or virus.
- Non-essential genes have been deleted from piggyBac, allowing for the cloning of inserts as large as about 15 kB into certain piggyBac vectors. This allows, for example, for the insertion of about six or seven genes with their expression control sequences.
- a collection of enzymes involved in N- glycan processing, marker proteins, or the like can be introduced together via a single transposon vector, into a single site in an insect genome.
- Several piggyBac vectors have been developed for insect transgenesis.
- constructs defined as minimal constructs for the movement of piggyBac vectored sequences, were developed by analysis of deletion mutations both within and outside of the boundaries of the transposon (Li et al. (2001) MoI. Genet. Genomics. 266, 190-8). Using constructs such as these it is possible to increase the amount of genetic material mobilized by the piggybac transposase by minimizing the size of the vector.
- the minimal requirements for movement include the 5'and 3'terminal repeat domains and attendant TTAA target sequences.
- piggyBac can transpose in insect cells while carrying a marker gene, and movement of the piggyBac element can occur in cells from lepidopteran species distantly related to the species from which it was originally isolated.
- piggyBac has been shown to be capable of transforming Drosophila melanogaster, Anastrepha suspensa, Bactrocera dorsalis, Bombyx mori, Pectinophora gossypiella, Tribolium castaneum, and several mosquito species. At least three lepidopteran species, Pectinophora gossypiella, Trichoplusia ni and Bombyx mori, have been successfully transformed by the piggyBac element.
- helper virus or plasmid that expresses a transposase is co-introduced with the transposon-based vector as above. Expression of the transposase is determined by the choice of promoter for the insect system being tested. Toward that end, several promoter-driven helper constructs that are useful for lepidopteran transformation, including the Drosophila hsp70, baculovirus iel promoter, and Drosophila Actin 5C promoter, have been constructed.
- Methods for introducing constructs into an embryo to generate a transgenic insect are conventional. Survivorship is usually quite high (up to 75%) for microinjected embryos.
- preblastoderm eggs are stuck with a fine glass capillary holding a solution of the plasmid DNA and/or the recombinant virus.
- GO larvae hatched from the virus-injected eggs are then screened for expression of the gene of interest. Breeding transgenic GIs with normal insects yields transgenic offspring according to the rules of Mendelian inheritance.
- transgenic insect Once a transgene (s) is stably integrated into the genome of an insect egg or early embryo, conventional methods can be used to generate a transgenic insect, in which the transgene (s) is present in all of the insect somatic and germ cells.
- transgenic insect When a subset of the complete set of enzymes involved in N-glycan processing are present in a transgenic insect, other transposon-based vectors, which express different subsets of the genes encoding enzymes involved in N-glycan processing, can be introduced sequentially into the insect genome, and transgenic insects can then be generated.
- these insects can be genetically crossed to produce a transgenic insect that expresses a larger subset, or a complete set, of the genes encoding enzymes involved in N-glycan processing.
- the transgenic insects are heterozygous for the modifying enzyme genes.
- the insects when potentially toxic genes are expressed constitutively, it may be advantageous for the insects to be heterozygous, to limit the amount of the enzyme that is produced.
- the insects are homozygous for the transgenes. Methods for producing homozygous transgenic insects (e. g., using suitable back- crosses) are conventional.
- Another embodiment of the invention is an isolated cell, or progeny thereof, derived from a transgenic insect of the invention.
- Suitable cells include isolated germ line cells, and cells that can be used for the in vitro production of a glycoprotein exhibiting a partial or complete pattern of mammalian glycosylation. Methods for obtaining and propagating cells from a transgenic insect, and using them (e. g. to generate more insects, or to generate glycosylated proteins) are conventional.
- transgenic insects discussed above can be used to produce glycoproteins of interest that exhibit partial or complete patterns of mammalian glycosylation.
- the insects can be used in methods for glycosylating polypeptides in a mammalian (human) glycosylation pattern.
- Suitable virus-based vectors include, e. g. , baculovirus vectors (such as vectors based on Autographa californica ⁇ PV, Orgyia pseudotsugata ⁇ PV, Lymantria dispar ⁇ PV, Bombyx mori ⁇ PV, Trichoplusia ni ⁇ PV, Spodoptera exigua ⁇ PV, Heliothis zea ⁇ PV, Galleria mellonella ⁇ PV, Anagrapha falcifera ⁇ PV, Trichoplusia ni s ⁇ PV) ) ; retroviral vectors; and viral vectors that comprise transposon recognition sequences (e.
- baculovirus vectors such as vectors based on Autographa californica ⁇ PV, Orgyia pseudotsugata ⁇ PV, Lymantria dispar ⁇ PV, Bombyx mori ⁇ PV, Trichoplusia ni ⁇ PV
- baculovirus-based vectors have been generated (or can be generated without undue experimentation) that allow the cloning of large numbers of inserts, at any of a variety of cloning sites in the viral vector.
- more than one heterologous polypeptide may be introduced together into a transgenic insect cell or insect of the invention.
- the viral vector can be introduced into an insect cell or insect by conventional methods, such as by in vitro inoculation (insect cells) or oral ingestion (insect larvae).
- the baculovirus replicates until the host insect is killed.
- the insect cell or insect lives long enough to produce large amounts of the glycosylated polypeptide of interest.
- a baculovirus is used that is attenuated or non-permissive for the host. In this case, the host is not killed by replication of the baculovirus, itself (although the host may be damaged by the expression of the enzymes involved in N-glycan processing and/or the heterologous protein of interest).
- sequences encoding one or more recombinant proteins of interest, operably linked to an expression control sequence are cloned into a suitable transposon-based vector (such as a piggyBac vector).
- a suitable transposon-based vector such as a piggyBac vector.
- transposon-based vectors can carry large inserts, so more than one heterologous polypeptide may be introduced together into a transgenic insect of the invention.
- Transposon-based vectors may on occasion insert into the D ⁇ A of somatic cells, and thus be stably expressed for relatively long periods of time.
- sequences encoding one or more recombinant proteins of interest, operably linked to an expression control sequence are cloned into a retrovirus vector, or any other suitable virus vector.
- a retrovirus vector or any other suitable virus vector.
- Such a construct may insert into the D ⁇ A of somatic cells, and thus be stably expressed for relatively long periods of time.
- the invention also provides short double-stranded R ⁇ A sequences which hybridize to SEQ ID NO: 1 and function to downregulate the expression of the same in insect cells by an RNAi-dependant mechanism
- Sf9 cells which are a subclone of the IPLB-Sf21-AE cell line derived from S. frugiperda ovaries (14), were routinely maintained as shake flask cultures in either TNM-FH medium containing 10% fetal bovine serum (HyClone, Logan, UT) or ESF 921 serum-free medium (Expression Systems, CA), as described previously (18).
- TNM-FH medium containing 10% fetal bovine serum (HyClone, Logan, UT) or ESF 921 serum-free medium (Expression Systems, CA), as described previously (18).
- Molecular cloning of an fdl gene homologfrom S/9 cells The A. aegypti, A. gambiae, A. mellifera, B. mori, D. pseudoobscura and T.
- castaneum genomic databases were searched through the NCBI website using tBLASTn (19) with the derived amino acid sequence of Dm-FDL isoform C (Accession No. NM 165909) as the query. These searches identified exons from each species that encoded fragments of putative processing ⁇ -N-acetylglucosaminidases. These were joined in silico using an online splice site prediction algorithm available through the ⁇ etGene2 Server hosted by the Technical University of Denmark (20) to obtain contiguous open reading frames from each species. The predicted amino acid sequences were then aligned using CLUSTALX version 1.83 (21) with the default settings.
- oligonucleotide primers were then used for polymerase chain reactions (PCRs; 22) with both cDNA and genomic DNA prepared from Sf9 cells as the templates.
- Genomic DNA was isolated from log phase cultures of uninfected Sf9 cells by a standard method (23).
- cDNA was prepared from 5 ⁇ g of GeneRacerTM oligo-dT-primed total RNA using SuperscriptTM III reverse transcriptase with the commercial GeneRacerTM kit (Invitrogen, Carlsbad, CA) according to the manufacturer's protocol and diluted to a final volume of 50 ⁇ L.
- the PCRs were performed in a total volume of 50 ⁇ L containing the manufacturer's high fidelity (HF) buffer plus 0.2 raM of each dNTP, 2 U of PhusionTM DNA polymerase (Promega, Madison, WI), 1 ⁇ M of each degenerate primer, and either -100 ng of Sf9 genomic DNA or 2 ⁇ L of the cDNA preparation described above.
- HF high fidelity
- the reactions were incubated for 2 min at 98 ° C, then cycled 14 times using (i) 20 sec at 98 ° C, (ii) 20 sec at 76 to 62 ° C (with a decreasing temperature gradient of 1 ° C per cycle), and (iii) 30 sec at 72 ° C.
- the reactions were cycled another 30 times using (i) 20 sec at 98 ° C, (ii) 20 sec at 62 ° C, and (iii) 20 sec at 72 C, and finally incubated for 5 min at 72 ° C in a GeneAmp Model 2400 thermal cycler (Eppendorf, Foster City, CA).
- the spent reactions were separated on 1.2% agarose gels and specific amplification products of about the expected size (420 bp) were recovered from the gel, purified using the QiaQuickTM Gel Extraction Kit (Qiagen,
- the reactions were incubated for 4 min at 95 ° C, cycled 12 times using (i) 30 sec at 95 C, (ii) 30 sec at 72 to 61 C (with a decreasing temperature gradient of 1 C per cycle), and (iii) 120 sec at 72 ° C.
- the reactions were cycled another 30 times using (i) 30 sec at 95 ° C, (ii) 20 sec at 61 ° C, and (iii) 120 sec at 72 ° C, and finally incubated for 5 min at 72 0 C.
- the spent reactions were analyzed on a 1% agarose gel and an amplification product of approximately 1.4 kb in size was purified and used as the template for nested PCRs under the same conditions used for the primary PCRs, except the nested reactions included the SFFDLASP3 (Table 1) and GeneRacerTM 5 '-nested primers and the annealing temperature was 65 C.
- the spent reactions were analyzed on a 0.9% agarose gel and the 1.4 kb amplification product was purified and directly sequenced using the SFFDLASP3, SFFDLASP4 (Table 1), and GeneRacerTM 5 '-nested primers.
- the spent reaction was analyzed on a 1% agarose gel, and the 1.2 Kb amplification product was purified and used as the template for nested PCRs with 0.8 U of PhusionTM DNA polymerase, 200 nM of the SFFDLSP4 and GeneRacerTM 3 '-nested primers, and 1 M betaine in a total final volume of 50 ⁇ L of PhusionTM GC buffer.
- Sf9 cDNA was simultaneously produced and primed with the Sf-fdl gene-specific primer SFFDLCDNAASP (Table 1) using SuperscriptTM III reverse transcriptase (Invitrogen) according to the method of Shi et al. (24). Either 1.0 ⁇ L of this cDNA preparation or approximately 100 ng of Sf9 genomic DNA was then used as the template for PCRs containing 0.5 U of PhusionTMDNA polymerase, 1 M betaine, 0.2 mM of each dNTP and 200 nM of the SFFDLCDNAASP and SFFDLFL50SP primers (Table 1) in PhusionTM GC buffer.
- Baculovirus transfer plasmids encoding full-length, untagged Dm-FDL or Sf-FDL were produced by using PCR to amplify the appropriate nucleotide sequences.
- the Sf-FDL coding sequence was assembled by producing two PCR amplimers with partially overlapping sequences, isolating the products, and then using them as templates for a third PCR designed to produce an amplimer encoding the full- length Sf-FDL protein.
- the 3'-end of the Sf-fdl open reading frame was amplified from Sf9 cDNA prepared as described above in a PCR with 0.3 U of PhusionTM DNA polymerase, 0.2 mM of each dNTP, 1 M betaine, and 0.67 ⁇ M of the SFFDLSPl and SFFDLFL31ASP primers (Table 1) in PhusionTM GC buffer.
- the reactions were incubated for 2 min at 98 0 C, cycled 45 times using (i) 20 sec at 98 0 C, (ii) 20 sec at 67°C for the first five cycles, 62 0 C for the next five cycles, 57°C for the next five cycles, and 54°C for the final 30 cycles, (iii) 40 sec at 72°C, and finally incubated for 2 min at 72°C.
- One ⁇ L of the spent reaction was used as the template for a nested PCR under essentially the same conditions, except the primers were SFFDLSP2 and SFFDLFL31ASP (Table 1).
- the spent secondary PCR was analyzed on a 1.2% agarose gel and the amplification product with the expected size was excised and purified as described above.
- the 5 '-end of the Sf-fdl open reading frame was amplified using 1.0 ⁇ L of the spent nested 5'-RACE reaction described above as the template for a PCR with 0.5 U of PhusionTM DNA polymerase, 0.2 mM of each dNTP, 1 M betaine, and 1 ⁇ M of the SFFDLASP3 and SFFDLFL51SP primers (Table 1) in PhusionTM GC buffer.
- This reaction was incubated for 1 min at 98 ° C, cycled 13 times using (i) 30 sec at 98 ° C, (ii) 20 sec at 65 to 53 ° C (with a decreasing temperature gradient of 1 C per cycle), and (iii) 60 sec at 72 C, cycled another 30 times using (i) 30 sec at 98 ° C, (ii) 20 sec at 52 ° C, and (iii) 60 sec at 72 ° C, and finally incubated for 2 min at 72 ° C.
- the spent reaction was analyzed on a 1.0% agarose gel and the amplification product with the expected size was excised and purified as described above.
- This reaction was incubated for 1 min at 98 ° C, cycled four times using (i) 30 sec at 98 ° C and (ii) 90 sec at 72°C, cycled another 25 times using (i) 30 sec at 98 ° C, (ii) 20 sec at 52 ° C, and (iii) 80 sec at 72 ° C, and finally incubated for 2 min at 72 ° C.
- the spent reaction was analyzed on a 1.0% agarose gel, and the amplification product of the expected size was excised, purified and cloned into pENTRTM/D-TOPO® according to the manufacturer's protocol.
- Sequencing revealed two clones that each had single, but different non-synonymous mutations and these were used to assemble a plasmid designated pENTRTM/D-TOPO®-S£/tf/-FL encoding the full-length, wild type Sf-FDL protein.
- the Dm-fdl open reading frame was amplified from 50 ng of a plasmid designated pIEBac-CG8824Myc in a PCR with 2 U of PhusionTM DNA polymerase, 0.2 mM of each dNTP, 0.1 ⁇ g of the FDLFLSP and FDLFLASP primers (Table 1) in PhusionTM HF buffer.
- This plasmid encodes the Drosophila melanogaster fdl gene open reading frame with a c-Myc epitope tag under the transcriptional control of a baculovirus IEl promoter. See Geisler et al. (2008) J. Biol. Chem., 283: 11330-11339. These reactions were incubated for 1 min at 98°C, cycled 30 times using (i) 20 sec at
- Transfer plasmids encoding N-terminally GST-tagged ectodomains of the various ⁇ -N-acetylglucosaminidases examined in this study were also produced using PCR-based approaches. Generally, TMpred (25) was used to predict the sequences encoding the ectodomain of each protein, and then these sequences were amplified using primers designed to introduce Smal and EcoBl sites on their 5'- and 3'- ends, respectively.
- each of the resulting PCR products was designed for subsequent directional cloning into the Smal and EcoRI sites of the baculovirus transfer plasmid pAcSecG2T (BD Biosciences, San Jose, CA), to position the relevant coding sequences downstream and in-frame with the GST coding sequence in this vector.
- the predicted Sf-fdl ectodomain coding sequence was amplified using pE ⁇ TRTM/D- ⁇ OPO®-Sf-fdl-FL as the template for a PCR with 0.5 U of PhusionTM DNA polymerase, 0.2 mM of each dNTP, 1 M betaine, 0.2 ⁇ M of the SFFDLFL3N2ASP and 10 nM of the SFFDLGST51SP primers (Table 1) in PhusionTM GC buffer.
- the reaction was incubated for 1 min at 98°C, cycled four times using (i) 20 sec at 98°C, (ii) 20 sec at 58°C, and (iii) 90 sec at 72°C, after which primer SFFDLGST5N2SP was added to 0.2 ⁇ M, incubated for 1 min at 98°C, cycled another 30 times using (i) 20 sec at 98°C, (ii) 20 sec at 60°C, and (iii) 90 sec at 72 0 C, and finally incubated for 2 min at 72°C.
- the spent reaction was analyzed on a 1.0% agarose gel, and the amplification product of the expected size was excised and purified.
- the purified amplimer was then treated with 5 U of Taq DNA polymerase (New England Biolabs, Ipswich, MA) for 15 minutes in the presence of 0.2 mM dATP and the manufacturer's standard Taq buffer.
- the reaction product was cloned into pCR®2.1-TOPO® (Invitrogen) according to the manufacturer's instructions, yielding pCR2.1®-TOPO®-S£/a!7-SOL.
- the predicted Dm-fdl ectodomain coding sequence was amplified using pENTRTM/D- TOPO®-Dm-fdl-FL as the template for a PCRwith 2 U of PhusionTM DNA polymerase, 0.2 mM of each dNTP, and 1 ⁇ M of the DMFDLGST3ASP and DMFDLGST5SP primers (Table 1) in PhusionTM HF buffer.
- reaction was incubated for 1 min at 98°C, cycled five times using (i) 15 sec at 98°C, (ii) 20 sec at 50 0 C, and (iii) 75 sec at 72°C, cycled another 30 times using (i) 15 sec at 98°C, (ii) 20 sec at 64°C, and (iii) 75 sec at 72°C, and finally incubated for 2 min at 72°C.
- the amplimer was subsequently purified, T ⁇ -treated, cloned, sequence-verified and subcloned as described above to produce the intermediate plasmid pCR®2.1TOPO®-.Dr ⁇ : / ⁇ f/-SOL and the final baculovirus transfer plasmid, pAcSecG2T-£>r ⁇ -/ ⁇ #-SOL.
- the predicted Sf-GIcN T Acase3/SfHex ectodomain coding sequence was amplified using pENTRTM/D-TOPO®-S/-G/cN ⁇ m «?3 as the template for a PCR with 2 U of PhusionTM DNA polymerase, 0.2 mM of each dNTP, and 1 ⁇ M of the SFGN3GST3ASPB and GN3GST5SPB primers (Table 1) in PhusionTM HF buffer, with cycling conditions identical to those used to generate the Dm-fdl ectodomain amplimer.
- the resulting product was purified, 7 ⁇ #-treated, cloned into pCR®4-TOPO®, sequence-verified, and subcloned as described above to produce the intermediate plasmid pCR®4-TOPO®-5 ⁇ GlcNAcase3-SOL and the final baculovirus transfer plasmid, pAcSecG2T-S/- GlcNAcase3-SOL.
- the transfer plasmids encoding GST-tagged ⁇ - N-acetylglucosaminidases were used to produce viruses by a standard allelic transplacement method (3,4) with £sw36/-digested BacPAK ⁇ viral D ⁇ A (26) as the target for homologous recombination.
- Each recombinant baculovirus vector was plaque- purified, amplified in Sf9 cells, and titered by plaque assay on Sf9 cells, as described previously (4).
- the recombinant viruses encoding various full-length, untagged ⁇ -N-acetylglucosaminidase genes were designated AcSfGlc ⁇ Acase-3 (18), AcDm-FDL, and AcSf-FDL and those encoding N-terminally GST-tagged ectodomains of the various ⁇ -N-acetylglucosamirnidases were designated AcGSTSfGlc ⁇ Acase-3, AcGSTDm-FDL, and AcGSTSf-FDL, respectively.
- Sf9 cells were seeded into 100 mL of ESF 921 medium in 250 mL DeLong flasks (Corning Glass Works, Corning, NY) and allowed to grow to a density of about 1.5-2.0 X 10 6 cells/mL at 28 0 C and 125 rpm in a Forma Model 4580 rotary platform shaker-incubator (Forma Scientific, Inc., Marietta, OH). The cells were then infected with the appropriate baculovirus at a multiplicity of infection of about 1 plaque-forming unit/cell and incubated for another 72 h under the same conditions.
- microsomal fractions Isolation of purified microsomal fractions — The isolation of microsomal fractions from baculovirus-infected Sf9 cells has been described previously (18). Briefly, the cells were Dounce-homogenized and microsomes were isolated by ultracentrifugation onto sucrose cushions.
- microsomes were solubilized in ⁇ -N-acetylglucosaminidase assay buffer (100 mM citrate-phosphate buffer, pH 6.0) containing 0.5% (v/v) Triton-X-100, total protein concentrations were determined using a commercial bicinchoninic acid assay (Pierce Biotechnology Inc., Rockford, IL), and samples containing equal amounts of total protein were assayed for ⁇ -N-acetylglucosaminidase activity, as described below.
- ⁇ -N-acetylglucosaminidase assay buffer 100 mM citrate-phosphate buffer, pH 6.0
- Triton-X-100 0.5% Triton-X-100
- microsomes were either held or sonicated on ice with ten pulses from a Branson Model 450 Sonifier (Danbury, CT) adjusted to 50% output. The microsomes were then pelleted by centrifugation for 10 min at top speed in a microcentrifuge (Hermle Model Zl 80M) and the pellets were resuspended in ⁇ -N-acetylglucosaminidase assay buffer.
- the resulting supernatant was diluted with an equal volume of ice-cold GST purification buffer (25 niM Tris, 150 mM NaCl, 1 mM EDTA, pH 8.0), solid ammonium sulfate was added to 90% saturation, and the samples were stirred on ice until the salt was fully dissolved.
- the samples were subsequently ultracentrifuged for 20 minutes in a Ti45 rotor at 30,000 rpm and 4 0 C and the resulting pellet was re-dissolved in a minimal volume of GST purification buffer.
- the samples were then transferred to dialysis tubing with a 50 kDa molecular weight cutoff (Spectrum Medical Industries Inc.; Laguna Hills, CA) and dialyzed overnight at 4 0 C against 100 volumes of GST purification buffer supplemented with 1 mM phenylmethylsulfonyl fluoride (PMSF).
- PMSF phenylmethylsulfonyl fluoride
- Each GST-tagged protein was then adsorbed to a 1.5 mL bed volume of Glutathione Sepharose 4 Fast Flow (GE Healthcare; Uppsala, Sweden) pre-equilibrated with GST purification buffer in a plugged 20 mL Econo-Pac column (BioRad; Hercules, CA) for one hour at 4°C on a shaking platform.
- the fluid was drained from the column, the affinity matrix was washed twice with 10 mL of GST purification buffer, and the GST-tagged proteins were eluted with GST purification buffer supplemented with 5 mM reduced glutathione.
- Fractions were collected and purity was assessed by SDS-PAGE with Coomassie Blue staining, the presence of the GST-tagged proteins was assessed by SDS-PAGE and immunoblotting with a GST-specific antiserum, and enzymatic activity was assessed using /?-nitrophenyl- ⁇ -N-acetylglucosaminide as the substrate, as described previously (18).
- ⁇ - ⁇ -acetylglucosaminidase activity assays Enzyme activity assays were performed using either solubilized microsomal fractions or affinity-purified recombinant proteins isolated from baculovirus-infected Sf9 cells. For the microsomal membrane assays, microsomes were prepared and extracted as described above and samples containing equal amounts of total protein were assayed in a total volume of 0.050 mL containing 25 pmol of various pyridylamine (PA)-tagged glycan substrates.
- PA pyridylamine
- the enzymatic activity of the affinity-purified recombinant proteins was assayed under identical conditions, except the amounts of purified protein used for these assays were equalized by immunoblotting, rather than by total protein assays.
- the substrates used in this study included Glc ⁇ Ac ⁇ 2Man ⁇ 6(Glc ⁇ Ac ⁇ 2Man ⁇ 3)Man ⁇ 4Glc ⁇ Ac ⁇ 4Glc ⁇ Ac-PA (GnGn;
- RNA interference In general, the RNA interference approach used in this study involved transforming Sf9 cells with an immediate early expression plasmid encoding an inverted repeat derived from a portion of the Sf-fdl coding sequence, with the inverted repeat separated by a Drosophila melanogaster white gene intron, as originally described by Lee and Carthew (26).
- the Sf-fdl coding sequence from nucleotides 355 to 855 was amplified using pENTRTM/D-TOPO®-S£/c?/-FL as the template for a PCR with 0.5 U of PhusionTM DNA polymerase, 0.2 mM of each dNTP, and 1 mM each of the SFFDLRNAIASP and SFFDLRNAISP primers (Table 1), which introduced Xbal sites onto both ends, in PhusionTM HF buffer.
- the reaction was incubated for 30 seconds at 98 0 C, cycled five times using (i) 20 sec at 98°C, (ii) 20 sec at 54°C, and (iii) 30 sec at
- transfected cells were then selected and neomycin-resistant clones were isolated by limiting dilution, as described previously (29).
- the levels of specific, processing ⁇ -N- acetylglucosaminidase activity in the parental and transformed cells were finally compared by HPLC analysis of the products obtained by reacting microsomal membrane preparations with GnGn, as described above.
- PCRs yielded an amplification product of about the expected size (420 bp), which appeared to be specific because it was not observed in control reactions in which either one of the degenerate oligonucleotides was excluded (data not shown).
- This product was directly sequenced and the translation product was found to be highly similar to a fragment of the D. melanogaster and putative B. mori FDL proteins (data not shown). Accordingly, we used this sequence to design gene-specific primers for 5'- and 3'-RACE reactions, which yielded the nucleotide sequence of the full length, putative Sf-fdl open reading frame, as detailed above.
- the 5'-RACE reactions yielded a specific 1.4 Kb amplification product, which overlapped with the sequence of the original degenerate PCR product, extended it by 1161 bp in the 5' direction, and included a potential translational initiation site (data not shown).
- the 3'-RACE reactions yielded a specific 1.0 Kb amplification product, which also overlapped with the sequence of the original degenerate PCR product, extended it by 734 bp in the 3' direction, and encoded a translational termination site.
- a contiguous nucleotide sequence of 2319 bp was assembled by joining the sequences of the degenerate amplimer, the 5'-RACE product, and the 3'-RACE product. The accuracy of this sequence was confirmed by PCR with gene specific primers using both Sf9 cDNA and genomic DNA as the templates, followed by direct sequencing of the products, as described in Experimental Procedures.
- the full-length Sf-fdl nucleotide sequence and theoretical amino acid sequence of the Sf-FDL polypeptide are shown in Fig. 1.
- the nucleotide sequence includes a single long open reading frame of 1896 bp, which has a GC content of 69%.
- the theoretical product of this open reading frame is a polypeptide consisting of 631 amino acids, which has a calculated molecular mass of 70,530 Da and a calculated isoelectric point of 7.18.
- the theoretical protein also has an N-terminal transmembrane domain (underlined in Fig.
- the putative S. frugiperda FDL polypeptide appears to be a type II transmembrane protein with a short cytoplasmic tail.
- Sf-fdl gene encodes an N-glycan processing enzyme because all N-glycan processing enzymes characterized to date have been predicted or shown to be transmembrane proteins with type II topology (30-32).
- the putative Sf9 cell enzyme also includes two potential N-glycosylation sites, which are boxed in the amino acid sequence shown in Fig. 1.
- a phylogenetic analysis of the predicted Sf-fdl gene product showed that it is related to known hexosaminidases, including the human alpha (Ace. No. NM 000520; 33) and beta (Ace. No. NM_000521; 34) hexosaminidases, as well as SfGlcNAcase-1 (Ace. No. DQ249307; 18) and SfGlcNAcase-3/Sfhex (Ace. No. DQ249309; 17,18), as expected (Fig. 2). Strikingly, however, this analysis also revealed that the predicted Sf-fdl gene product is much more closely related to the Dm-fdl gene product (Ace. No.
- the parental baculovirus (AcA/ ⁇ PV) was used as a negative control and recombinant baculoviruses encoding Dm-FDL (AcDm-FDL) or Sf-Glc ⁇ Acase-3/SfHex (AcSfGlc ⁇ Acase-3) were used to directly compare the enzymatic activities of the Sf-fdl, Dm-fdl, and Sf-GlcNAcase-3/SfHex gene products.
- Individual Sf9 cell cultures were infected with the appropriate baculoviruses and then crude microsomal membrane fractions were prepared and assayed for enzymatic activity with various PA-tagged glycans as substrates, as described above.
- 3D, top panel produced nearly as much chitobiose as the microsomes from AcDm-FDL- or AcSf-FDL-infected cells, suggesting that the apparent ability of these latter two enzymes to hydrolyze chitotriose was an artifact resulting from contaminating chitinase activity in the crude microsomal preparations.
- the pH optimum of the Sf-fdl gene product is 6.0 and that it has nearly optimal activity at pH 6.5, as well (Fig. 4).
- the pH optimum of the Sf-fdl gene product is identical to that of the processing activity originally identified in microsomal fractions from Sf21 cells by Altmann and coworkers (9).
- the range of optimal or near-optimal pH values for this enzyme more clearly encompasses the range of pH values found within late secretory pathway compartments, such as the trans-Go ⁇ g ⁇ network, than the SfGlcNAcase-3/SfHex gene product.
- Sf-FDL and Dm-FDL are consistent with their proposed function in N-glycan processing and with the conclusion that the Sf-fdl gene encodes the membrane bound, processing ⁇ -N-acetylglucosaminidase activity originally identified in Sf21 cells by Altmann and coworkers (1995).
- PCRs were carried out in a final volume of 50 ⁇ Ls in IX of PhusionTM buffer GC with 0.2 mM of each dNTP, 1 ⁇ M of each primer, 1 M betaine, 0.6 U of PhusionTM DNA polymerase (NEB, Ipswich, MA) and 1 ⁇ L of template, except where indicated otherwise. All PCRs were carried out in a GeneAmp Model 2400 thermal cycler (Eppendorf, Foster City, CA). DNA extraction from agarose gel fragments were carried out using the QiaQuickTM Gel Extraction Kit (Qiagen, Valencia, CA) according to the manufacturer's instructions and eluted into 50 ⁇ Ls.
- QiaQuickTM Gel Extraction Kit Qiagen, Valencia, CA
- Genomic DNA was isolated from T. ni cells (Tn-4h cell line) according to the method of Laird et al. (Laird et ⁇ /., 1991, Nucleic Acids Res.19:4293). Degenerate PCRs were carried out using T. ni genomic DNA with the primers ASPDEG and SPDEG as described previously (Geisler et al., 2008, J. Biol. Chem. 283: 11330-11339.). The spent reactions were separated on a 1.2% agarose gel and specific amplification products of the expected size (420 bp) were recovered from the gel, purified and directly sequenced with the same primers as used in the PCR.
- the PCR was incubated for 20 sec at 98 0 C, then cycled 25 times using (i) 10 sec at 98 0 C, (ii) 15 sec at 72 to 6O 0 C (with a decreasing temperature gradient of 0.5 0 C per cycle), and (iii) 60 sec at 72 0 C.
- the reaction was cycled another 30 times using (i) 10 sec at 98°C, (ii) 15 sec at 6O 0 C, and (iii) 60 sec at 72°C, and finally incubated for 2 min at 72 0 C.
- the spent reaction was separated on a 1.4% agarose gel and a specific amplification product of about the expected size (1100 bp) was recovered from the gel and purified.
- This DNA fragment was re-amplified using the TnFDL ASP3 and TnFDL SP4DEG primers using the same conditions, gel purified and directly sequenced using the same primers as used in the PCR.
- a semi-degenerate PCR was carried out using primers TnFDL SPl and TnFDL ASP6DEG with identical cycling conditions as specified above.
- the spent reaction was separated on a 1.4% agarose gel and a specific amplification product of about the expected size (730 bp) was recovered from the gel and purified. This fragment was cloned into pCR®2.1-TOPO® according to the manufacturer's instructions, and three clones were sequenced to yield a consensus sequence.
- RNA was isolated from a mid-log culture of Tn-4h cells using the Qiagen RNeasy® Plus Mini Kit according to the manufacturer's instructions. 5' RACE-ready RNA was prepared from total RNA using the Invitrogen GeneracerTM kit according to the manufacturer's instructions. Reverse transcription was carried out using Thermo-XTM reverse transcriptase with the TnFDL ASPl primer. The reaction was set up according to the manufacturer's instructions and incubated for (i) 5 min at 5O 0 C, (ii) 15 min at 55 0 C, (iii) 30 min at 6O 0 C and finally for (iv) 15 min at 60 0 C. The reaction was diluted with 40 ⁇ Ls TE buffer and stored at -20 0 C.
- 5' RACE was carried out using the TnFDL ASP6 primer and the GeneRacerTM 5' Primer.
- the PCR was incubated for 30 sec at 96°C, then cycled 5 times using (i) 20 sec at 96 0 C, (ii) 60 sec at 72°C, after which the reaction was cycled 13 times using (i) 20 sec at 96 0 C, (ii) 20 sec at 72 to 6O 0 C (with a decreasing temperature gradient of I 0 C per cycle), and (iii) 40 sec at 72 0 C.
- the reaction was then cycled another 30 times using (i) 20 sec at 90 0 C, (ii) 20 sec at 6O 0 C, and (iii) 40 sec at 72°C.
- the spent reaction was separated on a 1.4% agarose gel, and a specific amplification product of about 520 bps was isolated and purified.
- This DNA fragment was re-amplified using the GeneRacerTM 5' Nested Primer and either the TnFDL ASP6 or ASP7 primer. Reactions were incubated for 30 sec at 96 0 C, then cycled 5 times using (i) 20 sec at 96°C, (ii) 45 sec at 72 0 C, after which the reactions were cycled 13 times using (i) 20 sec at 96 0 C, (ii) 15 sec at 72 to 60 0 C (with a decreasing temperature gradient of I 0 C per cycle), and (iii) 30 sec at 72°C.
- the reactions were then cycled another 30 times using (i) 20 sec at 96°C, (ii) 15 sec at 60 0 C, and (iii) 30 sec at 72°C.
- the spent reactions were separated on a 1.4% agarose gel, and specific amplification products of about 520 and 500 bps were isolated and directly sequenced using the TnFDL ASP6 or ASP7, respectively.
- 3' RACE-ready cDNA was prepared from total T. ni RNA isolated as described above. Reverse transcription was carried out using Thermo-XTM reverse transcriptase with the GeneRacerTM Oligo dT primer. The reaction was set up according to the manufacturer's instructions and incubated in the same fashion as for 5' RACE. The reaction was diluted with 40 ⁇ Ls TE buffer and stored at -2O 0 C.
- 3' RACE was carried out using the TnFDL SP4 primer and the GeneRacerTM 3' Primer.
- the PCR was incubated for 30 sec at 96 0 C, then cycled 5 times using (i) 20 sec at 96 0 C, (ii) 45 sec at 72 0 C, after which the reactions were cycled 13 times using (i) 20 sec at 96°C, (ii) 15 sec at 72 to 6O 0 C (with a decreasing temperature gradient of 1°C per cycle), and (iii) 30 sec at 72 0 C.
- the reactions were then cycled another 30 times using (i) 20 sec at 96 0 C, (ii) 15 sec at 60 0 C, and (iii) 30 sec at 72°C.
- the spent reaction was separated on a 1.4% agarose gel, and a specific band of approximately 600 bps was isolated and purified.
- This DNA fragment was re-amplified using the TnFDL SP5 primer and the GeneRacerTM 3' Nested Primer.
- the PCRs were incubated for 15 sec at 96°C, then cycled 5 times using (i) 15 sec at 96 0 C, (ii) 35 sec at 72°C, after which reactions were cycled 13 times using (i) 15 sec at 96°C, (ii) 15 sec at 72 to 60 0 C (with a decreasing temperature gradient of 1°C per cycle), and (iii) 20 sec at 72 0 C.
- the reactions were the cycled another 30 times using (i) 15 sec at 96 0 C, (ii) 15 sec at 60 0 C, and (iii) 20 sec at 72 0 C.
- the spent reactions were separated on a 1.4% agarose gel, and a specific amplification product of 500 bps was isolated, purified and directly sequenced using the TnFDL SP5 primer.
- the full-length open reading frame was amplified from both cDNA primed with the GeneRacerTM Oligo dT Primer as well as genomic DNA (including the intron) using the TnFDL FL SP2 BD and TnFDL ASP BD primers.
- the reactions were incubated for 20 sec at 98°C, the cycled 25 times using (i) 15 sec at 98°C, (ii) 10 sec at 72 to 60 0 C (with a decreasing temperature gradient of 0.5 0 C per cycle), and (iii) 60 sec at 72°C.
- the reactions were the cycled another 30 times using (i) 15 sec at 98°C, (ii) 10 sec at 60 0 C, and (iii) 60 sec at 72°C.
- the spent reactions were separated on a 1% agarose gel, and amplification products of the expected size were excised and purified.
- These DNA fragments from the reactions template by cDNA and gDNA were cloned into the pENTRTM/D-TOPO® vector according to the manufacturer's instructions, yielding pENTR-TnFDL-C and pENTR-TnFDL-G, respectively.
- Four clones of each were sequenced, and a consensus clone of pENTR-TnFDL-C was used with Invitrogen's BaculodirectTM kit according to the manufacturer's instructions to yield AcTnFDL.
- Bombyx mori genomic database search results A tBLASTn search of the available Bombyx mori genomic sequences was carried out with the SfFDL conceptual translation as query using the online NCBI interface. This search yielded, amongst others, the sequences BAABO 1046610, BAABOl 083831 and BAABOl 153187.
- the Sequence BAAB01046610 encodes a putative 5' coding exon with a start codon (nts 25-200). The conceptual translation of this exon shows high similarity to the amino-terminal part of SfFDL.
- BAABOl 153187 could be joined in silico to yield a contig encoding the putative 3' coding exon, including a stop codon.
- the conceptual translation of this exon showed high similarity to the carboxy-terminal part of SfFDL.
- the 5' coding exon could be joined in silico to the 3' coding exon at splice junctions predicted with high probability by NetGene2 (Hebsgaard et al, Nucleic Acids Res. 24:3439-3452), yielding a contiguous open reading frame.
- Genomic DNA was prepared by a modification of the method of Laird et al. (Supra) from a single stage 2 B. mori larva (Qiufeng/Baiyu hybrid). Briefly, the larva was homogenized in lysis buffer supplemented with RNAse A, after which the homogenate was incubated at 55 0 C for 1 hour. The lysate was the centrifuged at 13.000 x G to remove debris, and DNA was precipitated by additional of an equal volume of isopropyl alcohol.
- the predicted full-length open reading frame was amplified from both cDNA primed with GeneRacerTM Oligo dT Primer and genomic DNA (including the intron).
- the PCRs were set up using the BmFDL FL SP2 and BmFDL ASPlCLO primers and incubated in the same fashion as for the amplification of the full-length TnFDL open reading frame.
- the spent reactions were separated on a 1% agarose gel, and bands of the expected size were isolated and purified.
- the DNA fragments from the reactions template with genomic DNA and cDNA were cloned into the pENTRTM/D-TOPO® vector according to the manufacturer's instructions, yielding pENTR-BmFDL-G and pENTR- BmFDL-C, respectively.
- Four clones of each were sequenced, yielding two distinct alleles from both gDNA and cDNA.
- the conceptual translation of one of these alleles is identical to the conceptual translation of the putative ⁇ Z/ gene identified from the p50 (Daizo) strain.
- the two alleles differ between each other in several nucleotides in the intron and both exons.
- 5' RACE was carried out using the BmFDL ASP4 primer and the GeneRacerTM 5' Primer with 5 'RACE-ready cDNA prepared as described above. Reactions were incubated for 30 sec at 96 0 C, then cycled 5 times using (i) 15 sec at 96°C, (ii) 45 sec at 72 0 C, after which the reactions were cycled 12 times using (i) 15 sec at 96°C, (ii) 15 sec at 72 to 61 0 C (with a decreasing temperature gradient of 1°C per cycle), and (iii) 30 sec at 72 0 C.
- the reactions were the cycled another 30 times using (i) 15 sec at 96°C, (ii) 15 sec at 61°C and (iii) 30 sec at 72°C, and finally incubated for 1 min at 72°C.
- the spent reactions were separated on a 1.2% agarose gel.
- a specific band of about 570 bps was isolated, purified and re-amplified using the BmFDL ASP5 primer and the GeneRacerTM 5' Nested Primer using the same cycling conditions.
- the nested 5' RACE reactions were separated on a 1.4% agarose gel, showing a specific band of the expected 550 bps. This band was excised, purified and sequenced using the BmFDL ASP5 primer.
- 3' RACE was carried out using the BmFDL SP4 primer and the GeneRacerTM 3' Primer with 3' RACE-ready cDNA. Reactions were cycled in the same fashion as described above for 5' RACE. The spent reaction was analyzed on a 1.4% agarose gel, showing a specific faint band at 450 bps. This band was excised, purified and used for nested 3' RACE reactions with the BmFDL SP5 primer and the GeneRacerTM 3' Nested Primer using the same cycling reactions. The spent reactions showed a strong, specific band at the expected size of 420 bps. This band was excised, purified and sequenced using the BmFDL SP5 primer.
- Microsomal membranes were then isolated from the parental cell line or the subclone and used for ⁇ -N-acetylglucosaminidase activity assays with GnGn as the substrate. HPLC analysis of the reaction products showed that the microsomal membranes from both the parental Sf9 cells the transformed subclone converted GnGn to GnM, but not detectably to MM or MGn (data not shown). Thus, in this examination of endogenous ⁇ -N-acetylglucosaminidase activities in microsomal membranes from uninfected Sf9 cells, we detected only the specific, processing enzyme activity.
- SEQ ID NO:9 is specific for downregulating expression of the Sffdl encoding nucleic acid
- provision of the sequence information for T. ni and B. mori homologs readily enables the skilled artisan to generate additional specific RNAi for inhibiting expression of the same. Indeed, computer programs are available online which can assist in the design of such molecules.
- an FdI gene from Sf9 cells and demonstrated that it encodes a membrane-associated ⁇ -N-acetylglucosaminidase with the same, strict substrate specificity exhibited by Dm-FDL and by the enzyme activity originally detected in S. frugiperda microsomes (9).
- Sf9 genome encodes a gene with a close phylogenetic relationship to Dm-fdl
- the fact that the Sf-fdl gene product is membrane-associated and has the strict substrate specificity and pH optimum profile of the original activity detected in S.
- the fdl gene orthologs were isolated from the lepidopteran insect cell species, Spodopterafrugiperida, Trichoplusia ni and Bombyx mori, as cell lines derived from these insect species are commonly used with the baculovirus expression system.
- compositions or formulations identified herein can, in alternate embodiments, be more specifically defined by any of the transitional phases “comprising”, “consisting essentially of and “consisting of.
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Genetics & Genomics (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Biotechnology (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Microbiology (AREA)
- Medicinal Chemistry (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Virology (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Enzymes And Modification Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
A transgenic insect cell line for production of elevated levels of recombinant glycoproteins comprising mammalian-like N-glycans is provided. Also disclosed are nucleic acid sequences encoding β-N-acetylglucosaminidases.
Description
LEPIDOPTERAN INSECT N-ACETYLGLUCOSAMINIDASE GENES AND THEIR USE EV GLYCOENGINEERING
Inventors: Donald L. Jarvis
Christoph Geisler
CROSS-REFERENCE TO RELATED APPLICATIONS
The present application claims the benefit of U.S. Provisional Patent Application No. 61/013,815, filed December 14, 2007, the entire disclosure of which is incorporated by reference herein.
STATEMENT REGARDING FEDERAL SPONSORED RESEARCH OR DEVELOPMENT Pursuant to 35 U.S.C. §202(c), it is acknowledged that the U.S. Government has certain rights in the invention described, which was made in part with funds from NIH grant number, GM49734.
FIELD OF THE INVENTION This invention relates to the fields of molecular biology and production of proteins possessing complex type oligosaccharide side chains. More specifically, the invention provides novel nucleic acid sequences encoding β-N-acetylglucosaminidase enzymes and recombinant insect cell lines comprising the same for the production of therapeutic and commercially valuable glycoproteins.
BACKGROUND OF THE INVENTION
Several publications and patent documents are cited throughout the specification in order to describe the state of the art to which this invention pertains. Each of these citations is incorporated herein by reference as though set forth in full. Insects and other lower eukaryotes, such as nematodes and plants, occupy an interesting evolutionary niche in glycobiology because they produce N-glycoproteins, but they typically process their N-linked glycans less extensively than mammals (1,2). This difference between lower and higher eukaryotic protein N-glycosylation pathways is biotechnologically significant because insects and plants are used to produce recombinant mammalian glycoproteins for many different biomedical research applications (3-7). Insect and mammalian protein N-glycosylation pathways each begin with the
co-translational transfer of N-glycan precursors to nascent proteins (1,8). These precursors are subsequently trimmed and elongated by enzymes localized in the endoplasmic reticulum and Golgi apparatus of insect and mammalian cells to produce a common intermediate with the structure Manα6(GlcΝAcB2Manα3)Manβ4 GlcNAcB4GlcNAc-R. In mammalian cells, this intermediate is elongated by various glycosyltransferases to produce complex N-glycans, which often have terminal sialic acid residues. In contrast, insect cells usually fail to elongate this same intermediate and convert it, instead, to paucimannose N-glycans with the core structure Manα6(Manα3)Manβ4GlcΝAcβ4GlcΝAc-R. An unusual β-N-acetylglucosaminidase is responsible for the production of these structures (9). This enzyme specifically removes the terminal N-acetylglucosamine residue from the α3 branch of Manαδ (GlcΝAcβ2Manα3)Manβ4GlcΝAcB4GlcΝAc-R, simultaneously eliminating the intermediate required for N-glycan elongation and producing the core paucimannose glycan typically found on insect cell-derived N-glycoproteins. This same enzyme is also responsible for the production of core paucimannose N-glycans in nematodes (10,11) and plants (12,13). Thus, the presence of a processing β-N-acetylglucosaminidase is a key difference between the protein N-glycosylation pathways of lower and higher eukaryotes. In the seminal insect study on this topic, Altmann and coworkers (9) demonstrated that IPLB-Sf21AE, a cell line derived from the lepidopteran insect S.frugiperda (14), has a membrane-associated β-N-acetylglucosaminidase activity that can specifically cleave the terminal N-acetylglucosamine residue from the α3 branch of a biantennary N-glycan in vitro. Subsequently, it was shown that cell lines derived from E. acrea, another lepidopteran insect, produced hybrid and complex N-glycans containing terminal N-acetylglucosamine or galactose residues because they lack this intracellular β-N-acetylglucosaminidase activity (15). Together, these studies strongly supported the idea that the N-glycosylation pathway of at least some insect cells includes a processing β-N-acetylglucosaminidase, as described above. However, unequivocal proof of this concept awaited the isolation of an insect gene encoding this enzyme, together with evidence that the gene product had the substrate specificity of the N-glycan processing enzyme.
The first proof of this kind was provided by a more recent study from Altmann's group, in which they demonstrated that the D. melanogaster fused lobes {Dm-fdt) gene encodes the specific, processing β-N-acetylglucosaminidase in this organism (16). Importantly, this study demonstrated that the Dm-fdl gene product has several features
distinguishing it from degradative hexosaminidases and chitinases, which also have β-N- acetylglucosaminidase activities. These features included its specificity for the terminal N-acetylglucosamine residue linked to the α3 branch of N-glycan substrates and its inability to degrade chito-oligosaccharides. Furthermore, it was shown that flies lacking a functional y#7 gene produced a higher proportion of N-glycans with terminal N- acetylglucosamine residues linked to the α3 branch than wild type. These findings, together with the finding that the D. melanogaster hexosaminidase genes (hexol and hexo-2) encode enzymes that can cleave chito-oligosaccharides, but not N-glycans, strongly suggested that Dm-FDL is the β-N-acetylglucosaminidase responsible for N- glycan processing in this fly. These properties also were consistent with the idea that Dm- FDL is an ortholog of the lepidopteran insect N-glycan processing enzyme first detected by Altmann and coworkers (1995) in microsomal membranes from IPLB-Sf21AE cells. Subsequently, two lab groups independently reported molecular cloning of genes encoding β-N-acetylglucosaminidases from Sf9 cells, which are a clonal derivative of the IPLB-SOlAE cell line (17,18). Our group described the isolation of three β-N- acetylglucosaminidase genes from Sf9 cells, which were designated SfGlcNAcase-1, -2, and -3 (18). SfGIcN Acase-1 was clearly distinct from the other two, which were nearly identical to each other and appeared to be allelic variants of the same gene. Further analysis of the SfGlcNAcase-1 and SfGlcNAcase-3 gene products showed that they had high sequence homology to known hexosaminidases and that each also had β-N- acetylglucosaminidase activity when assayed against relevant substrates. However, neither had the tight α3 branch specificity of the processing enzyme activity originally described by Altmann and coworkers (1995). In fact, each could remove the terminal N- acetylglucosamine residues from either the α3 or the α6 branch of various N-glycan substrates and each also was able to release N-acetylglucosamine monomers from a chito- oligosaccharide substrate. Accordingly, we concluded that none of these S.frugiperda genes encoded the N-glycan processing enzyme, but rather, that they encoded broad- spectrum β-N-acetylglucosaminidases that are more likely to be involved in N-glycan and chitin degradation. In a similar study, Tomiya and coworkers (2006) also molecularly cloned two allelic variants of an Sf9 cell β-N-acetylglucosaminidase gene, which they termed Sfhex. Further analysis of the Sfhex gene product, which is identical to the gene product we designated SfGlcΝAcase-3, confirmed that the SfGlcNAcase-3/Sfhex gene product lacks the α3 branch specificity of the processing enzyme activity originally described by Altmann and coworkers. However, because this enzyme had a 2- to 5 -fold
higher preference for the terminal N-acetylglucosamine residue on the α3 branch of an N- glycan substrate, Tomiya and coworkers (2006) concluded that the SfGlcNAcase-3/Sflιex gene encodes the processing β-N-acetylglucosaminidase of Sf9 cells.
SUMMARY OF THE INVENTION
In accordance with the present invention, an isolated nucleic acid encoding an N- acetylglucosaminidase is provided. In one embodiment the nucleic acid encodes a protein of SEQ ID NO: 2. In a preferred embodiment, nucleic acid is SEQ ID NO: 1. The nucleic acid molecules of the invention may be DNA, RNA, or cDNA and they may be single or double stranded. Additional embodiments of the invention include nucleic acids of SEQ ID NOS: 3, 5, and 7 and their encoded proteins SEQ ID NOS: 4, 6, and 8.
In another aspect, expression vectors comprising the nucleic acid molecules described above are provided. Also within the scope of the invention are recombinant insect cells transformed with such expression vectors. In a particularly preferred embodiment, the RNA molecule is a fragment of SEQ
ID NO: 1, having SEQ ID NO: 9, which is double stranded and, when expressed in a cell, down regulates production of the protein of SEQ ID NO: 2.
In still another aspect, the present invention provides isolated proteins comprising SEQ ID: 2, 4, 6 and 8. The isolated proteins of this invention may be used for the production of specific glycans for use as standards, or substrates, e.g., in remodeling recombinant glycoprotein glycans.
In yet another aspect, a method for enhancing production of mammalian-like N- glycans in insect cells is provided. An exemplary method entails providing recombinant insect cell lines comprising the double stranded RΝA molecule described above, either transforming the cells with an expression vector or infecting the cells with a recombinant baculovirus comprising a nucleic acid encoding a heterologous glycoprotein of interest, wherein glycoprotein(s) expressed in the recombinant comprise elevated levels of mammalian-like N-glycans when compared to levels observed in wild type cells. In an alternative embodiment, the cells described above may optionally contain additional enzymes involved in the production and synthesis of mammalian-like N glycans. Such enzymes include, without limitation, N-acetylglucosaminyltransferases, galactosyltransferases, sialyltransferases, sulfotransferases, sialic acid synthases, CPM- sialic acid synthetases, UDP-N-acetylglucosamine^-epimerases/N-acetylmannosamine kinases, and CMP-sialic acid transporters.
Presented hereinbelow are data that will resolve the apparent discrepancy in the conclusions drawn from the two previous reports referred to above (17,18). In short, the present inventors molecularly cloned a β-N-acetylglucosaminidase cDΝA from Sf9 cells, which turned out to be the S. frugiperda ortholog of the Dm-fdl gene. This gene, designated Sf-fdl, encodes a membrane-associated product that specifically cleaves the terminal N-acetylglucosamine residue from the α3 branch of N-glycan substrates, that has little or no activity against chito-oligosaccharide substrates, and that has precisely the same pH profile as the activity originally identified by Altmann and coworkers (1995) in IPLB-SGlAE cell microsomes. Furthermore, Sf9 cells engineered to express a Sf-fdl- specific double-stranded RΝA had lower levels of specific, processing β-N-acetylglucosaminidase activity. These results indicate that the specific, processing β-N-acetylglucosaminidase activity originally detected by Altmann and coworkers is encoded by the Sf-fdl gene in this lepidopteran insect cell line. The definitive identification of this new gene sets the stage for an effort to create a transformed Sf9 cell variant lacking this key N-glycan processing activity, which would be an improved host for recombinant glycoprotein production by baculovirus expression vectors.
BRIEF DESCRIPTION OF THE DRAWINGS
Fig. 1. Nucleotide sequence of the Sf-fdl gene (SEQ ID NO: 1) and amino acid sequence of the gene product (SEQ ID NO: 2). The putative N-terminal transmembrane domain is underlined and the two consensus N-glycosylation sites are boxed.
Fig. 2. Phylogenetic relationships between the Sf-FDL protein and known hexosaminidases. This Figure shows the phylogenetic relationships between the Sf-FDL protein and Dm-FDL (Ace No. NM_165909; 16), SfGlcNAcase-3/SfHex (Ace No. DQ249309; 17,18)), SfGlcNAcase-1 (DQ249307; 18), and the human hexosaminidases A (Ace. No. NM 000520; 33) and B (NM_000521; 34. The amino acid sequences of these proteins were aligned using CLUSTALX version 1.83 (21) using the default settings and then the alignment was exported in the PHYLIP format (36) and used to generate a distance matrix by protdist in PHYLIP version 3.66 with the Jones-Taylor-Thornton model. Neighbor in PHYLIP version 3.66 was used to generate an unrooted tree from the distance matrix with the neighbor-joining method and, finally, the Neighbor output was
used to draw an unrooted tree with the PHYLIP postscript generator. The Sf-FDL amino acid sequence is 44% and 29% identical to the sequences of Dm-FDL and SfGlcNAcase- 3/SfHex, respectively.
Fig. 3. Substrate specificity of Sf-FDL. Various glycan substrates, including
GnGn (A), MGn (B), GnM (Q, and chitotriose (D) were incubated for 16 h with microsomal fractions containing 10 ug of total protein from Sf9 cells infected with AcMNPV, AcDm-FDL, AcSf-FDL, or AcGlcNAcase-3. The reaction products were then recovered and analyzed by reverse-phase HPLC, as described in Experimental Procedures. The arrows show the elution times for each of the relevant glycans.
Fig. 4. pH optimum of Sf-FDL. Microsomal fractions containing 10 ug of total protein from AcSf-FDL-infected Sf9 cells were incubated for 16 h with GnGn at pH values between 4.0 and 8.0, and then the reaction products were recovered and analyzed by reverse-phase HPLC, as described in Experimental Procedures. The plot shows the relative percentages of GnM produced at each pH as a percentage of the area under the GnM peak divided by the sum of the area under the GnGn and GnM peaks.
Fig. 5. Expression and purification of GST-tagged β-N-acetylglucosaminidase ectodomains. The GST-tagged, ectodomains of Sf-FDL (lanes 1), Dm-FDL (lanes 2), and SfGlcΝAcase-3/Sfhex (lanes 3) were expressed in recombinant baculovirus-infected Sf9 cells and purified from the extracellular fraction by glutathione affinity chromatography, as described in Experimental Procedures. Equal amounts of the purified products were then analyzed by (A) SDS-PAGE with Coomassie Blue staining or (B) SDS-PAGE with immunoblotting using a GST-specific antiserum.
Fig. 6. Substrate specificity of the GST-tagged, ectodomains of Sf-FDL, Dm- FDL, and SfGlcΝAcase-3/Sfhex. Equal amounts of each enzyme were incubated for 2 h with GnGn (A), MGn (B), GnM (Q, or chitotriose (D) and the reaction products were recovered and analyzed by reverse-phase HPLC, as described in Experimental Procedures. The arrows show the elution times for each of the relevant glycans.
Fig. 7. Overdigestion of glycan substrates with the GST-tagged, ectodomains of Sf-FDL, Dm-FDL, and SfGlcΝAcase-3/Sfhex. Equal amounts of each enzyme were
incubated for 20 h with GnGn (A), GnM (B), or chitotriose (Q and the reaction products were recovered and analyzed by reverse-phase HPLC, as described in Experimental Procedures. The arrows show the elution times for each of the relevant glycans.
Fig. 8. Nucleotide sequence of the Tn-fdl gene (SEQ ID NO: 3) and amino acid sequence of the gene product (SEQ ID NO:4). The putative N-terminal transmembrane domain is underlined and the two consensus N-glycosylation sites are boxed.
Fig. 9. Nucleotide sequence of one allele the Bm-fdl gene (SEQ ID NO:5) and amino acid sequence of the gene product (SEQ ID NO:6). The putative N-terminal transmembrane domain is underlined and the three consensus N-glycosylation sites are boxed.
Fig. 10. Nucleotide sequence of another allele of Bm-fdl gene (SEQ ID NO:7) and amino acid sequence of the gene product (SEQ ID NO:8). The putative N-terminal transmembrane domain is underlined and the three consensus N-glycosylation sites are boxed.
Fig. 11. Endogenous levels of specific, processing β-N-acetylglucosaminidase activity in parental Sf9 cells and an Sf9-derived clone expressing an SyyάY-specific double-stranded RΝA. Microsomal membrane preparations from Sf9 or SfFDL RΝAi cells were incubated for 16 hr with GnGn, and the reaction products were analyzed by HPLC to compare the relative amounts of GnM produced. The plot shows the average results obtained in five replicate assays, with the average percentage of GnM produced by microsomes from the Sf9 controls set to 100%. The error bars show the standard deviations and a one-way AΝOVA analysis showed that the two datasets are significantly different (P <0.01).
Fig. 12. The sequence utilized in the RΝAi experiment is shown (SEQ ID ΝO:9).
DETAILED DESCRIPTION OF THE INVENTION
Manα6(Manα3)Manβ4GlcNAcB4GlcNAc-R is the core structure of the major processed protein N-glycans produced by insect cells. Ultimately, this paucimannose type structure is produced by an unusual β-N-acetylglucosaminidase, which removes the
terminal N-acetylglucosamine residue from the upstream intermediate, Manα6(GlcΝAcβ2Manα3)Manβ4GlcΝAcβ4GlcΝAc-R. Because the N-glycan processing pathways leading to the production of this intermediate are probably identical in insects and higher eukaryotes, the presence or absence of this specific, processing β-N- acetylglucosaminidase is a key factor distinguishing the processing pathways in these two different types of organisms. Recent studies have shown that the fused lobes (fdl) gene encodes the specific, processing β-N-acetylglucosaminidase of D. melanogaster. However, there are conflicting reports on the identity of the gene encoding this enzyme in the lepidopteran insect, S. frugiperda. One has suggested that a gene alternatively designated SfGlcNAcase-3 or SfHex encodes this function, while another has suggested that this gene encodes a broad-spectrum β-N-acetylglucosaminidase that functions in glycan and chitin degradation. In the present invention, this conflict is resolved by demonstrating that an S. frugiperda fdl ortholog (Sf-fdl) encodes a product with the substrate specificity expected of a processing β-N-acetylglucosaminidase. It is also shown that the endogenous levels of specific, processing β-N-acetylglucosaminidase activity are significantly reduced in S. frugiperda cells engineered to express a double-stranded RΝA derived from the Sf-fdl gene. These results indicate that Sf-fdl encodes the specific, processing β-N-acetylglucosaminidase of S. frugiperda.
Definitions:
A "cell line" refers to cells which can be cultured in the lab for an indefinite period and are useful for producing large amounts of a protein of interest. Ideally, such cells are immortalized and do not exhibit senescence in culture.
As used herein, the term "insect" includes any stage of development of an insect, including a one-celled germ line cell, a fertilized egg, an early embryo, a larva, including any of a first through final instar larva, a pupa, or an adult insect. For the production of mammalianized glycoproteins of interest, a large larva, such as a fourth or fifth instar larva is preferred. It will be evident to a skilled worker which insect stage is suitable for a particular purpose, such as for direct production of a glycosylated polypeptide of interest, for storage or transport of an insect to a different location, for generation of progeny, for further genetic crosses, or the like.
With reference to nucleic acids of the invention, the term "isolated nucleic acid" is sometimes used. This term, when applied to DΝA, refers to a DΝA molecule that is separated from sequences with which it is immediately contiguous (in the 5' and 3'
directions) in the naturally occurring genome of the organism from which it originates. For example, the "isolated nucleic acid" may comprise a DNA or cDNA molecule inserted into a vector, such as a plasmid or virus vector, or integrated into the DNA of a prokaryote or eukaryote. With respect to RNA molecules of the invention, the term "isolated nucleic acid" primarily refers to an RNA molecule encoded by an isolated DNA molecule as defined above. Alternatively, the term may refer to an RNA molecule that has been sufficiently separated from RNA molecules with which it would be associated in its natural state (i.e., in cells or tissues), such that it exists in a "substantially pure" form (the term "substantially pure" is defined below).
With respect to protein, the term "isolated protein" or "isolated and purified protein" is sometimes used herein. This term refers primarily to a protein produced by expression of an isolated nucleic acid molecule of the invention. Alternatively, this term may refer to a protein which has been sufficiently separated from other proteins with which it would naturally be associated, so as to exist in "substantially pure" form.
The term "promoter region" refers to the transcriptional regulatory regions of a gene, which may be found at the 5 ' or 3' side of the coding region, or within the coding region, or within introns.
The term "vector" refers to a small carrier DNA molecule into which a DNA sequence can be inserted for introduction into a host cell where it will be replicated. An "expression vector" is a specialized vector that contains a gene or nucleic acid sequence with the necessary regulatory regions needed for expression in a host cell.
The term "operably linked" means that the regulatory sequences necessary for expression of a coding sequence are placed in the DNA molecule in the appropriate positions relative to the coding sequence so as to effect expression of the coding sequence. This same definition is sometimes applied to the arrangement of coding sequences and transcription control elements (e.g. promoters, enhancers, and termination elements) in an expression vector. This definition is also sometimes applied to the arrangement of nucleic acid sequences of a first and a second nucleic acid molecule wherein a hybrid nucleic acid molecule is generated.
The term "substantially pure" refers to a preparation comprising at least 50-60% by weight the compound of interest (e.g., nucleic acid, oligonucleotide, protein, etc.). More preferably, the preparation comprises at least 75% by weight, and most preferably 90-99% by weight, of the compound of interest. Purity is measured by methods
appropriate for the compound of interest (e.g. chromatographic methods, agarose or polyacrylamide gel electrophoresis, HPLC analysis, and the like).
The phrase "consisting essentially of when referring to a particular nucleotide sequence or amino acid sequence means a sequence having the properties of a given SEQ ID NO:. For example, when used in reference to an amino acid sequence, the phrase includes the sequence per se and molecular modifications that would not affect the basic and novel characteristics of the sequence.
The term "oligonucleotide," as used herein refers to primers and probes of the present invention, and is defined as a nucleic acid molecule comprised of two or more ribo- or deoxyribonucleotides, preferably more than three. The exact size of the oligonucleotide will depend on various factors and on the particular application for which the oligonucleotide is used.
The term "probe" as used herein refers to an oligonucleotide, polynucleotide or nucleic acid, either RNA or DNA, whether occurring naturally as in a purified restriction enzyme digest or produced synthetically, which is capable of annealing with or specifically hybridizing to a nucleic acid with sequences complementary to the probe. A probe may be either single-stranded or double-stranded. The exact length of the probe will depend upon many factors, including temperature, source of probe and method of use. For example, for diagnostic applications, depending on the complexity of the target sequence, the oligonucleotide probe typically contains 15-25 or more nucleotides, although it may contain fewer nucleotides.
The probes herein are selected to be "substantially" complementary to different strands of a particular target nucleic acid sequence. This means that the probes must be sufficiently complementary so as to be able to "specifically hybridize" or anneal with their respective target strands under a set of pre-determined conditions. Therefore, the probe sequence need not reflect the exact complementary sequence of the target. For example, a non-complementary nucleotide fragment may be attached to the 5' or 3' end of the probe, with the remainder of the probe sequence being complementary to the target strand. Alternatively, non-complementary bases or longer sequences can be interspersed into the probe, provided that the probe sequence has sufficient complementarity with the sequence of the target nucleic acid to anneal therewith specifically.
The term "specifically hybridize" refers to the association between two single- stranded nucleic acid molecules of sufficiently complementary sequence to permit such hybridization under pre-determined conditions generally used in the art (sometimes
termed "substantially complementary"). In particular, the term refers to hybridization of an oligonucleotide with a substantially complementary sequence contained within a single-stranded DNA or RNA molecule of the invention, to the substantial exclusion of hybridization of the oligonucleotide with single-stranded nucleic acids of non- complementary sequence.
The term "primer" as used herein refers to an oligonucleotide, either RNA or DNA, either single-stranded or double-stranded, either derived from a biological system, generated by restriction enzyme digestion, or produced synthetically which, when placed in the proper environment, is able to act functionally as an initiator of template-dependent nucleic acid synthesis. When presented with an appropriate nucleic acid template, suitable nucleoside triphosphate precursors of nucleic acids, a polymerase enzyme, suitable cofactors and conditions such as a suitable temperature and pH, the primer may be extended at its 3' terminus by the addition of nucleotides by the action of a polymerase or similar activity to yield a primer extension product. The primer may vary in length depending on the particular conditions and requirements of the application. For example, in diagnostic applications, the oligonucleotide primer is typically 15-25 or more nucleotides in length. The primer must be of sufficient complementarity to the desired template to prime the synthesis of the desired extension product, that is, to be able to anneal with the desired template strand in a manner sufficient to provide the 3' hydroxyl moiety of the primer in appropriate juxtaposition for use in the initiation of synthesis by a polymerase or similar enzyme. It is not required that the primer sequence represent an exact complement of the desired template. For example, a non-complementary nucleotide sequence may be attached to the 5' end of an otherwise complementary primer. Alternatively, non-complementary bases may be interspersed within the oligonucleotide primer sequence, provided that the primer sequence has sufficient complementarity with the sequence of the desired template strand to functionally provide a template-primer complex for the synthesis of the extension product.
The term "percent identical" is used herein with reference to comparisons among nucleic acid or amino acid sequences. Nucleic acid and amino acid sequences are often compared using computer programs that align sequences of nucleic or amino acids thus defining the differences between the two. For purposes of this invention comparisons of nucleic acid sequences are performed using the GCG Wisconsin Package version 9.1, available from the Genetics Computer Group in Madison, Wisconsin. For convenience,
the default parameters (gap creation penalty = 12, gap extension penalty = 4) specified by that program are intended for use herein to compare sequence identity. Alternately, the Blastn 2.0 program provided by the National Center for Biotechnology Information (at http://www.ncbi.nlm.nih.gov/blast/; Altschul et al, 1990, J MoI Biol 215:403-410) using a gapped alignment with default parameters, may be used to determine the level of identity and similarity between nucleic acid sequences and amino acid sequences.
The term "expression control sequence", as used herein, refers to a polynucleotide sequence that regulates expression of a polypeptide coded for by a polynucleotide to which it is functionally ("operably") linked. Expression can be regulated at the level of the mRNA or polypeptide. Thus, the term expression control sequence includes mRNA-related elements and protein-related elements. Such elements include promoters, domains within promoters, upstream elements, enhancers, elements that confer tissue or cell specificity, response elements, ribosome binding sequences, transcriptional terminators, etc. Suitable expression control sequences that can function in insect cells will be evident to the skilled worker. In some embodiments, it is desirable that the expression control sequence comprises a constitutive promoter. Among the many suitable "strong" promoters which can be used are the baculovirus promoters for the plO, polyhedrin (polh), p6.9, capsid, and cathepsin-like genes. Among the many "weak" promoters which are suitable are the baculovirus promoters for the iel, ie2, ieO, etl, 39K (aka pp31), and gp64 genes. Other suitable strong constitutive promoters include the B. mori actin gene promoter; D. melanogaster hsp70, actin, α-1- tubulin or ubiquitin gene promoters; RSV or MMTV promoters; copia promoter; gypsy promoter; and the cytomegalovirus IE gene promoter. If it is desired to increase the amount of gene expression from a weak promoter, enhancer elements, such as the baculovirus enhancer element, hr5, may be used in conjunction with the promoter.
In some embodiments, the expression control sequence comprises a tissue-or organ- specific promoter. Many such expression control sequences will be evident to the skilled worker. In general, the enzymes involved in N-glycan processing of the invention are required in catalytic amounts. Therefore, in one embodiment of the invention, much lower amounts of these enzymes are present than of the heterologous polypeptides of interest, which are generated in massive, large amounts, glycosylated, and harvested for
further use. For example, a suitable molar ratio of heterologous protein produced to enzyme involved in N-glycan processing may be greater than about 100: 1.
Alternatively, the enzymes involved in N-glycan processing may be in comparable (e. g. , approximately stoichiometric) amounts to the heterologous glycoprotein to be processed. A skilled worker can readily select suitable promoters and/or conditions to express suitable amounts of the enzymes involved in N-glycan processing (e. g., amounts which are sufficient to (effective to) process the N-glycans of relatively high amounts of a protein of interest to the desired extent). Furthermore, a skilled worker can readily ensure that the enzymes involved in N-glycan processing are present in sufficient local concentrations, and at an optimal time during insect propagation.
In some embodiments of the invention, as is discussed in more detail elsewhere herein, it is desirable that an expression control sequence is regulatable (e. g., comprises an inducible promoter and/or enhancer element). Suitable regulatable promoters include, e. g., Drosophila or other hsp70 promoters, the Drosophila metallothionein promoter, an ecdysone-regulated promoter, the Saccharomyces cerevisiae Gal4/UAS system, and other well-known inducible systems. A Tet-regulatable molecular switch may be used in conjunction with any constitutive promoter, such as those described elsewhere herein (e. g, in conjunction with the CMV-IE promoter, or baculovirus promoters). Another type of inducible promoter is a baculovirus late or very late promoter that is only activated following infection by a baculovirus.
Methods for designing and preparing constructs suitable for generating transgenic insect cell lines or insects (or vectors for infection of an insect) are conventional. For these methods, as well as other molecular biology procedures related to the invention, see, e. g., Sambrook et al, Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor, Ν. Y. , ( 1989); Wu et al. , Methods in Gene Biotechnology (CRC Press, New York, NY, 1997), Recombinant Gene Expression Protocols, in Methods in Molecular Biology, Vol. 62, (Tuan, Ed., Humana Press, Totowa, NJ, 1997); and Current Protocols in Molecular Biology, (Ausabel et al, Eds.,), John Wiley & Sons, NY (1994- 1999). Some suitable methods are described elsewhere herein. A variety of immortalized lepidopteran insect cell lines are suitable for transformation by the vectors/constructs of the invention. Among these are Sf9 (Vaughn et al. (1977) In Vitro 13, 213- 217), Tn 5B1-4 (High Five; Wickham et al. (1992) Biotech. Progr. 8, 391-6), expresSf+ (Protein Sciences Corporation), and BmN (Bm-N4; Maeda et al. (1985) Nature 315, 592-594) cells.
Methods for generating transgenic insect cell lines are conventional. For example, in one embodiment, one or more genes to be introduced are placed under the control of a suitable expression control sequence and are cloned into one or more plasmid vectors. These vectors are then mixed with a vector encoding a selectable marker under the control of a suitably expression control sequence. The DNA mixture is then introduced into the parental insect cell line (e.g., by calcium phosphate-mediated transfection), and the transgene(s) will integrate by non-homologous recombination into in the insect cell genome. Transformed cells are selected using an appropriate antibiotic (e.g. neomycin, hygromycin, or zeocin, among others), cloned by colony formation or limiting dilution, and clones expressing the unselected genes of interest are identified using various methods, including RNA dot blot assays, lectin staining assays, or functional assays. This general approach was first described in 1990 (Jarvis et al, 1990. Bio/Technology 8,950- 955) and has been reviewed recently (Harrison, R.L. and Jarvis, D.L. 2007. Transforming lepidopteran insect cells for improved protein processing. In D.W. Murhammer (Ed.), Methods in Molecular Biology: Baculovirus Expression Protocols. Humana Press, Clifton, NJ. Methods MoI Biol. (2007) 388:3-22.
Methods for generating transgenic insects are conventional. For example, in one embodiment, one or more genes to be introduced are placed under the control of a suitable expression control sequence, and are cloned into a vector, such as a viral vector (e. g, an attenuated baculovirus vector, or a non-permissive viral vector that is not infective for the particular insect of interest). The sequences to be introduced into the insect are flanked by genomic sequences from the insect. The construct is then introduced into an insect egg (e.g., by microinjection), and the transgene (s) then integrate by homologous recombination of the flanking sequences into comparable sequences in the insect genome.
In another embodiment, the vector is a transposon-based vector. One form of such transposon-based vectors is a viral vector (such as those described above) that further comprises inverted terminal repeats of a suitable transposon, between which the transgene of interest is cloned. One or more genes of interest, under the control of a suitable expression control sequence (s), are cloned into the transposon-based vector. In some systems, the transposon-based vector carries its own transposase. However, generally, the transposon-based vector does not encode a suitable transposase. In this case, the vector is co-transfected into an insect (e. g., an insect larva) with a helper virus or plasmid that provides a transposase. The recombinant vector (along with, generally, a helper) is
introduced by conventional methods (such as microinjection) into an egg or early embryo; and the transgene (s) become integrated at a transposon site (such as sequences corresponding to the inverted terminal repeat of the transposon) in the insect genome.
Suitable types of transposon-based vectors will be evident to the skilled worker. These include, e. g., Minos, mariner, Hermes, sleeping beauty, and piggyBac.
In a preferred embodiment, the vector is a piggyBac vector. TTAA-specific, short repeat elements feature in a group of transposons (Class II mobile elements) that have similar structures and movement properties. A typical piggyBac vector (formerly IFP2) is the most extensively studied of these insertion elements. piggyBac is 2.4 kb long and terminates in 13 bp perfect inverted repeats, with additional internal 19 bp inverted repeats located asymmetrically with respect to the ends (Cary et al. (1989) Virology. 172,156-69). A piggyBac vector may encode a trans-acting transposase that facilitates its own movement; alternatively, these sequences can be deleted and this function can be supplied by a helper plasmid or virus. Non-essential genes have been deleted from piggyBac, allowing for the cloning of inserts as large as about 15 kB into certain piggyBac vectors. This allows, for example, for the insertion of about six or seven genes with their expression control sequences. Thus, a collection of enzymes involved in N- glycan processing, marker proteins, or the like, can be introduced together via a single transposon vector, into a single site in an insect genome. Several piggyBac vectors have been developed for insect transgenesis. Two particularly useful constructs, defined as minimal constructs for the movement of piggyBac vectored sequences, were developed by analysis of deletion mutations both within and outside of the boundaries of the transposon (Li et al. (2001) MoI. Genet. Genomics. 266, 190-8). Using constructs such as these it is possible to increase the amount of genetic material mobilized by the piggybac transposase by minimizing the size of the vector. The minimal requirements for movement include the 5'and 3'terminal repeat domains and attendant TTAA target sequences.
Nearly all of the internal domain may be removed, although more recent data indicate that some of this region may be required for efficient translocation of the mobilized sequences into the genome of the insect. In addition, a minimum of 50 bases separating the TTAA target sites of the element is required for efficient mobilization (Li et al. (2001), supra). piggyBac can transpose in insect cells while carrying a marker gene, and movement of the piggyBac element can occur in cells from lepidopteran species distantly related to the species from which it was originally isolated. piggyBac has been
shown to be capable of transforming Drosophila melanogaster, Anastrepha suspensa, Bactrocera dorsalis, Bombyx mori, Pectinophora gossypiella, Tribolium castaneum, and several mosquito species. At least three lepidopteran species, Pectinophora gossypiella, Trichoplusia ni and Bombyx mori, have been successfully transformed by the piggyBac element.
Generally, a helper virus or plasmid that expresses a transposase is co-introduced with the transposon-based vector as above. Expression of the transposase is determined by the choice of promoter for the insect system being tested. Toward that end, several promoter-driven helper constructs that are useful for lepidopteran transformation, including the Drosophila hsp70, baculovirus iel promoter, and Drosophila Actin 5C promoter, have been constructed.
For further guidance on the use of baculovirus-based vectors, see, e. g., WO01/29204 and US Patent 6,551, 825. Other recent references that discuss piggyBac vectors and methods for generating transgenic insects using them include, e. g, Handler et al. (1998) Proc Natl Acad Sci 95,7520-7525 ; Fraser, MJ (2001) The TTAA-specific family of transposable elements. In: Insect transgenesis : Methods and Applications. James AA and.Handler AH, Eds. CRC Press, Orlando, FL ; Lobo et al. (1999) MoI. Gen. Genetics 261, 803-810; Grossman et al. (2000) Insect Biochem. MoI. Biol. 30,909-914 ; Lobo et al. (2001) MoI Gen. Genom. 265, 66-71; Lorenzen et al. (2003) Insect MoI Biol. 12, 433-40 ; Hacker et al. (2003) Proc Natl Acad Sci U S A. 100, 7720-5; Sumitani et al. (2003) Insect Biochem MoI Biol. 33,449-58 ; Horn et al. (2003) Genetics 163 647-61 ; and Tomita et al. (2003) Nat Biotechnol. 21,52-6.
Methods for introducing constructs into an embryo to generate a transgenic insect (e. g., by microinjection) are conventional. Survivorship is usually quite high (up to 75%) for microinjected embryos. In general, preblastoderm eggs are stuck with a fine glass capillary holding a solution of the plasmid DNA and/or the recombinant virus. GO larvae hatched from the virus-injected eggs are then screened for expression of the gene of interest. Breeding transgenic GIs with normal insects yields transgenic offspring according to the rules of Mendelian inheritance. Once a transgene (s) is stably integrated into the genome of an insect egg or early embryo, conventional methods can be used to generate a transgenic insect, in which the transgene (s) is present in all of the insect somatic and germ cells. When a subset of the complete set of enzymes involved in N-glycan processing are present in a transgenic insect, other transposon-based vectors, which express different subsets of the genes
encoding enzymes involved in N-glycan processing, can be introduced sequentially into the insect genome, and transgenic insects can then be generated. In another embodiment, when different subsets of the complete set of enzymes involved in N-glycan processing are present in two or more individual transgenic insects, these insects can be genetically crossed to produce a transgenic insect that expresses a larger subset, or a complete set, of the genes encoding enzymes involved in N-glycan processing.
In some embodiments, the transgenic insects are heterozygous for the modifying enzyme genes. For example, when potentially toxic genes are expressed constitutively, it may be advantageous for the insects to be heterozygous, to limit the amount of the enzyme that is produced. In other embodiments, the insects are homozygous for the transgenes. Methods for producing homozygous transgenic insects (e. g., using suitable back- crosses) are conventional.
Another embodiment of the invention is an isolated cell, or progeny thereof, derived from a transgenic insect of the invention. Suitable cells include isolated germ line cells, and cells that can be used for the in vitro production of a glycoprotein exhibiting a partial or complete pattern of mammalian glycosylation. Methods for obtaining and propagating cells from a transgenic insect, and using them (e. g. to generate more insects, or to generate glycosylated proteins) are conventional.
The transgenic insects discussed above can be used to produce glycoproteins of interest that exhibit partial or complete patterns of mammalian glycosylation. For example, the insects can be used in methods for glycosylating polypeptides in a mammalian (human) glycosylation pattern.
The coding sequences described herein may be operably linked to an expression control sequence from the virus, itself, or to another suitable expression control sequence. Suitable virus-based vectors include, e. g. , baculovirus vectors (such as vectors based on Autographa californica ΝPV, Orgyia pseudotsugata ΝPV, Lymantria dispar ΝPV, Bombyx mori ΝPV, Trichoplusia ni ΝPV, Spodoptera exigua ΝPV, Heliothis zea ΝPV, Galleria mellonella ΝPV, Anagrapha falcifera ΝPV, Trichoplusia ni sΝPV) ) ; retroviral vectors; and viral vectors that comprise transposon recognition sequences (e. g., piggyBac vectors); etc. As discussed above, baculovirus-based vectors have been generated (or can be generated without undue experimentation) that allow the cloning of large numbers of inserts, at any of a variety of cloning sites in the viral vector. Thus, more than one heterologous polypeptide may be introduced together into a transgenic insect cell or insect of the invention. The viral vector can be introduced into an insect cell or insect by
conventional methods, such as by in vitro inoculation (insect cells) or oral ingestion (insect larvae).
In one embodiment, the baculovirus replicates until the host insect is killed. The insect cell or insect lives long enough to produce large amounts of the glycosylated polypeptide of interest. In another embodiment, a baculovirus is used that is attenuated or non-permissive for the host. In this case, the host is not killed by replication of the baculovirus, itself (although the host may be damaged by the expression of the enzymes involved in N-glycan processing and/or the heterologous protein of interest).
In another embodiment, sequences encoding one or more recombinant proteins of interest, operably linked to an expression control sequence, are cloned into a suitable transposon-based vector (such as a piggyBac vector). Like the baculovirus vectors discussed above, transposon-based vectors can carry large inserts, so more than one heterologous polypeptide may be introduced together into a transgenic insect of the invention. Transposon-based vectors may on occasion insert into the DΝA of somatic cells, and thus be stably expressed for relatively long periods of time.
In another embodiment, sequences encoding one or more recombinant proteins of interest, operably linked to an expression control sequence, are cloned into a retrovirus vector, or any other suitable virus vector. Such a construct may insert into the DΝA of somatic cells, and thus be stably expressed for relatively long periods of time. Finally, in certain instances it may be desirable to down regulate expression and synthesis of the N-acetylglucosaminidase encoded by genes described in this invention. Accordingly, the invention also provides short double-stranded RΝA sequences which hybridize to SEQ ID NO: 1 and function to downregulate the expression of the same in insect cells by an RNAi-dependant mechanism
The following materials and methods are provided to facilitate the practice of the present invention.
Cells and cell culture — Sf9 cells, which are a subclone of the IPLB-Sf21-AE cell line derived from S. frugiperda ovaries (14), were routinely maintained as shake flask cultures in either TNM-FH medium containing 10% fetal bovine serum (HyClone, Logan, UT) or ESF 921 serum-free medium (Expression Systems, CA), as described previously (18).
Molecular cloning of an fdl gene homologfrom S/9 cells — The A. aegypti, A. gambiae, A. mellifera, B. mori, D. pseudoobscura and T. castaneum genomic databases were searched through the NCBI website using tBLASTn (19) with the derived amino acid sequence of Dm-FDL isoform C (Accession No. NM 165909) as the query. These searches identified exons from each species that encoded fragments of putative processing β-N-acetylglucosaminidases. These were joined in silico using an online splice site prediction algorithm available through the ΝetGene2 Server hosted by the Technical University of Denmark (20) to obtain contiguous open reading frames from each species. The predicted amino acid sequences were then aligned using CLUSTALX version 1.83 (21) with the default settings. Highly conserved amino acid sequences were visually identified and used to design degenerate oligonucleotide primers (Table 1), which were then used for polymerase chain reactions (PCRs; 22) with both cDNA and genomic DNA prepared from Sf9 cells as the templates. Genomic DNA was isolated from log phase cultures of uninfected Sf9 cells by a standard method (23). Total RNA was isolated from a log phase culture of uninfected Sf9 cells using the TriReagent (Molecular Research Center, Cincinnati, OH) according to the manufacturer's protocol. cDNA was prepared from 5 μg of GeneRacer™ oligo-dT-primed total RNA using Superscript™ III reverse transcriptase with the commercial GeneRacer™ kit (Invitrogen, Carlsbad, CA) according to the manufacturer's protocol and diluted to a final volume of 50 μL. The PCRs were performed in a total volume of 50 μL containing the manufacturer's high fidelity (HF) buffer plus 0.2 raM of each dNTP, 2 U of Phusion™ DNA polymerase (Promega, Madison, WI), 1 μM of each degenerate primer, and either -100 ng of Sf9 genomic DNA or 2 μL of the cDNA preparation described above. The reactions were incubated for 2 min at 98°C, then cycled 14 times using (i) 20 sec at 98°C, (ii) 20 sec at 76 to 62°C (with a decreasing temperature gradient of 1°C per cycle), and (iii) 30 sec at 72°C. The reactions were cycled another 30 times using (i) 20 sec at 98°C, (ii) 20 sec at 62°C, and (iii) 20 sec at 72 C, and finally incubated for 5 min at 72°C in a GeneAmp Model 2400 thermal cycler (Eppendorf, Foster City, CA). The spent reactions were separated on 1.2% agarose gels and specific amplification products of about the expected size (420 bp) were recovered from the gel, purified using the QiaQuick™ Gel Extraction Kit (Qiagen,
Valencia, CA), and directly sequenced using the degenerate PCR primers specified above. The resulting nucleotide sequences were assembled using ContigExpress, a component of Vector NTI Advance 10.3.0 (Invitrogen). These data were used to design gene-specific
primers for primary and nested 5'- and 3'-RACE reactions, which were performed to determine the full-length, putative Sf-fdl gene sequence.
TABLE 1 Primer sequences
5 '-RACE — Total RNA was isolated as described above, used for first-strand cDNA synthesis according to the GeneRacer™ protocol, and the resulting 3'- and 5 '-anchored first strand cDNA was diluted to 50 μL. Five μL of this cDNA were then used as the template for 5' RACE reactions with 1.25 U of GoTaq® (Promega) and 200 nM of the SFFDLASPl (Table 1) and GeneRacer™ 5' primers in a final volume of 50 μL of GoTaq® buffer. The reactions were incubated for 4 min at 95°C, cycled 12 times using (i) 30 sec at 95 C, (ii) 30 sec at 72 to 61 C (with a decreasing temperature gradient of 1 C per cycle), and (iii) 120 sec at 72°C. The reactions were cycled another 30 times using (i) 30
sec at 95°C, (ii) 20 sec at 61°C, and (iii) 120 sec at 72°C, and finally incubated for 5 min at 720C. One μL of the spent 5'-RACE reaction was used as the template for a nested PCR with Promega's GoTaq® Green Mastermix and 200 nM of the SFFDLASP2 (Table 1) and GeneRacer™ 5'-nested primers in a total volume of 50 μL. These reactions were incubated for 90 sec at 950C, cycled 25 times using (i) 30 sec at 950C, (ii) 30 sec at 630C, and (iii) 120 sec at 720C, and finally incubated for 5 min at 720C. The spent reactions were analyzed on a 1% agarose gel and an amplification product of approximately 1.4 kb in size was purified and used as the template for nested PCRs under the same conditions used for the primary PCRs, except the nested reactions included the SFFDLASP3 (Table 1) and GeneRacer™ 5 '-nested primers and the annealing temperature was 65 C. The spent reactions were analyzed on a 0.9% agarose gel and the 1.4 kb amplification product was purified and directly sequenced using the SFFDLASP3, SFFDLASP4 (Table 1), and GeneRacer™ 5 '-nested primers.
3 '-RACE— The cDNA used for the 3 ' -RACE reactions was prepared from
GeneRacer™ Oligo dT-primed total RNA, as described previously (24) and diluted to a final volume of 100 μL. Two μL of this cDNA preparation were then used as the template for a PCR with 1 U of Phusion™ DNA polymerase, 200 nM of the SFFDLSP3 and GeneRacer™ 3' primers, 1 M betaine, and 5% DMSO in a final volume of 50 μL of Phusion™ GC buffer. These reactions were incubated for 3 min at 98°C, cycled 45 times using (i) 30 sec at 98°C, (ii) 30 sec at 68 C for the first five cycles, 63°C for the next five cycles, 58°C for the next five cycles, and 52°C for the final 30 cycles, (iii) 60 sec at 72°C, and 20 sec at 75°C, and finally incubated for 2 min at 72°C. The spent reaction was analyzed on a 1% agarose gel, and the 1.2 Kb amplification product was purified and used as the template for nested PCRs with 0.8 U of Phusion™ DNA polymerase, 200 nM of the SFFDLSP4 and GeneRacer™ 3 '-nested primers, and 1 M betaine in a total final volume of 50 μL of Phusion™ GC buffer. These reactions were incubated for 2 min at 98°C, cycled 45 times using (i) 20 sec at 98°C, (ii) 20 sec at 700C for the first five cycles, 65°C for the next five cycles, 6O0C for the next five cycles, and 57°C for the final 30 cycles, (iii) 40 sec at 72°C, and finally incubated for 2 min at 720C. The 1.0 Kb amplification product was purified and directly sequenced using the primer SFFDLSP4.
Amplification of the full-length ORF from cDNA and genomic DNA — Sf9 cDNA was simultaneously produced and primed with the Sf-fdl gene-specific primer SFFDLCDNAASP (Table 1) using Superscript™ III reverse transcriptase (Invitrogen) according to the method of Shi et al. (24). Either 1.0 μL of this cDNA preparation or approximately 100 ng of Sf9 genomic DNA was then used as the template for PCRs containing 0.5 U of Phusion™DNA polymerase, 1 M betaine, 0.2 mM of each dNTP and 200 nM of the SFFDLCDNAASP and SFFDLFL50SP primers (Table 1) in Phusion™ GC buffer. These reactions were incubated for 2 min at 98 C, cycled 40 times using (i) 20 sec at 98°C, (ii) 20 sec at 63°C for the first five cycles, 58°C for the next five cycles, and 55°C for the final 30 cycles, (iii) 60 sec at 72°C, and finally incubated for 2 min at 72°C. The amplification products were purified on 1% agarose gels, recovered, and directly sequenced using internal primers.
Construction ofbaculovirus transfer plasmids encoding native β-Η- acetylglucosaminidases — Baculovirus transfer plasmids encoding full-length, untagged Dm-FDL or Sf-FDL were produced by using PCR to amplify the appropriate nucleotide sequences. The Sf-FDL coding sequence was assembled by producing two PCR amplimers with partially overlapping sequences, isolating the products, and then using them as templates for a third PCR designed to produce an amplimer encoding the full- length Sf-FDL protein. Briefly, the 3'-end of the Sf-fdl open reading frame was amplified from Sf9 cDNA prepared as described above in a PCR with 0.3 U of Phusion™ DNA polymerase, 0.2 mM of each dNTP, 1 M betaine, and 0.67 μM of the SFFDLSPl and SFFDLFL31ASP primers (Table 1) in Phusion™ GC buffer. The reactions were incubated for 2 min at 980C, cycled 45 times using (i) 20 sec at 980C, (ii) 20 sec at 67°C for the first five cycles, 620C for the next five cycles, 57°C for the next five cycles, and 54°C for the final 30 cycles, (iii) 40 sec at 72°C, and finally incubated for 2 min at 72°C. One μL of the spent reaction was used as the template for a nested PCR under essentially the same conditions, except the primers were SFFDLSP2 and SFFDLFL31ASP (Table 1). The spent secondary PCR was analyzed on a 1.2% agarose gel and the amplification product with the expected size was excised and purified as described above. The 5 '-end of the Sf-fdl open reading frame was amplified using 1.0 μL of the spent nested 5'-RACE reaction described above as the template for a PCR with 0.5 U of Phusion™ DNA polymerase, 0.2 mM of each dNTP, 1 M betaine, and 1 μM of the SFFDLASP3 and
SFFDLFL51SP primers (Table 1) in Phusion™ GC buffer. This reaction was incubated for 1 min at 98°C, cycled 13 times using (i) 30 sec at 98°C, (ii) 20 sec at 65 to 53°C (with a decreasing temperature gradient of 1 C per cycle), and (iii) 60 sec at 72 C, cycled another 30 times using (i) 30 sec at 98°C, (ii) 20 sec at 52°C, and (iii) 60 sec at 72°C, and finally incubated for 2 min at 72°C. The spent reaction was analyzed on a 1.0% agarose gel and the amplification product with the expected size was excised and purified as described above. Finally, the purified 3'- and 5 '-ends of the predicted Sf-fdl ORF were combined in a PCR with 0.5 U of Phusion™ DNA polymerase, 0.2 mM of each dNTP, 1 M betaine, and 1 μM of the SFFDLFL5N2SP and SFFDLFL3N2ASP primers (Table 1) in Phusion™ GC buffer. This reaction was incubated for 1 min at 98°C, cycled four times using (i) 30 sec at 98°C and (ii) 90 sec at 72°C, cycled another 25 times using (i) 30 sec at 98°C, (ii) 20 sec at 52°C, and (iii) 80 sec at 72°C, and finally incubated for 2 min at 72°C. The spent reaction was analyzed on a 1.0% agarose gel, and the amplification product of the expected size was excised, purified and cloned into pENTR™/D-TOPO® according to the manufacturer's protocol. Sequencing revealed two clones that each had single, but different non-synonymous mutations and these were used to assemble a plasmid designated pENTR™/D-TOPO®-S£/tf/-FL encoding the full-length, wild type Sf-FDL protein.
The Dm-fdl open reading frame was amplified from 50 ng of a plasmid designated pIEBac-CG8824Myc in a PCR with 2 U of Phusion™ DNA polymerase, 0.2 mM of each dNTP, 0.1 μg of the FDLFLSP and FDLFLASP primers (Table 1) in Phusion™ HF buffer. This plasmid encodes the Drosophila melanogaster fdl gene open reading frame with a c-Myc epitope tag under the transcriptional control of a baculovirus IEl promoter. See Geisler et al. (2008) J. Biol. Chem., 283: 11330-11339. These reactions were incubated for 1 min at 98°C, cycled 30 times using (i) 20 sec at
980C, (ii) 20 sec at 55°C, and (iii) 90 sec at 72°C, and finally incubated for 2 min at 72°C. The spent reaction was analyzed on a 0.8% agarose gel, and the amplification product of the expected size was excised, purified and cloned into pENTR™/D-TOPO® according to the manufacturer's protocol. An error-free clone was identified by sequencing and designated pENTR™/D-TOPO®-£>m-/rf/-FL.
Construction of baculovirus transfer plasmids encoding GST-tagged β-N- acetylglucosaminidases — Transfer plasmids encoding N-terminally GST-tagged ectodomains of the various β-N-acetylglucosaminidases examined in this study were also produced using PCR-based approaches. Generally, TMpred (25) was used to predict the sequences encoding the ectodomain of each protein, and then these sequences were amplified using primers designed to introduce Smal and EcoBl sites on their 5'- and 3'- ends, respectively. Thus, each of the resulting PCR products was designed for subsequent directional cloning into the Smal and EcoRI sites of the baculovirus transfer plasmid pAcSecG2T (BD Biosciences, San Jose, CA), to position the relevant coding sequences downstream and in-frame with the GST coding sequence in this vector.
The predicted Sf-fdl ectodomain coding sequence was amplified using pEΝTR™/D- ΥOPO®-Sf-fdl-FL as the template for a PCR with 0.5 U of Phusion™ DNA polymerase, 0.2 mM of each dNTP, 1 M betaine, 0.2 μM of the SFFDLFL3N2ASP and 10 nM of the SFFDLGST51SP primers (Table 1) in Phusion™ GC buffer. The reaction was incubated for 1 min at 98°C, cycled four times using (i) 20 sec at 98°C, (ii) 20 sec at 58°C, and (iii) 90 sec at 72°C, after which primer SFFDLGST5N2SP was added to 0.2 μM, incubated for 1 min at 98°C, cycled another 30 times using (i) 20 sec at 98°C, (ii) 20 sec at 60°C, and (iii) 90 sec at 720C, and finally incubated for 2 min at 72°C. The spent reaction was analyzed on a 1.0% agarose gel, and the amplification product of the expected size was excised and purified. The purified amplimer was then treated with 5 U of Taq DNA polymerase (New England Biolabs, Ipswich, MA) for 15 minutes in the presence of 0.2 mM dATP and the manufacturer's standard Taq buffer. The reaction product was cloned into pCR®2.1-TOPO® (Invitrogen) according to the manufacturer's instructions, yielding pCR2.1®-TOPO®-S£/a!7-SOL. An error-free clone was identified by sequencing and the insert was excised with Smal and EcoRI, gel-purified, and subcloned into the corresponding sites of pAcSecG2T to produce the transfer plasmid designated pAcSecG2T-S//J/-SOL.
The predicted Dm-fdl ectodomain coding sequence was amplified using pENTR™/D- TOPO®-Dm-fdl-FL as the template for a PCRwith 2 U of Phusion™ DNA polymerase, 0.2 mM of each dNTP, and 1 μM of the DMFDLGST3ASP and DMFDLGST5SP primers (Table 1) in Phusion™ HF buffer. The reaction was incubated for 1 min at 98°C, cycled five times using (i) 15 sec at 98°C, (ii) 20 sec at 500C, and (iii) 75 sec at 72°C, cycled another 30 times using (i) 15 sec at 98°C, (ii) 20 sec at 64°C, and (iii) 75 sec at
72°C, and finally incubated for 2 min at 72°C. The amplimer was subsequently purified, Tα^-treated, cloned, sequence-verified and subcloned as described above to produce the intermediate plasmid pCR®2.1TOPO®-.Drø:/άf/-SOL and the final baculovirus transfer plasmid, pAcSecG2T-£>rø-/<#-SOL. The predicted Sf-GIcN TAcase3/SfHex ectodomain coding sequence was amplified using pENTR™/D-TOPO®-S/-G/cNΛm«?3 as the template for a PCR with 2 U of Phusion™ DNA polymerase, 0.2 mM of each dNTP, and 1 μM of the SFGN3GST3ASPB and GN3GST5SPB primers (Table 1) in Phusion™ HF buffer, with cycling conditions identical to those used to generate the Dm-fdl ectodomain amplimer. The resulting product was purified, 7α#-treated, cloned into pCR®4-TOPO®, sequence-verified, and subcloned as described above to produce the intermediate plasmid pCR®4-TOPO®-5^ GlcNAcase3-SOL and the final baculovirus transfer plasmid, pAcSecG2T-S/- GlcNAcase3-SOL.
Isolation of baculovirus expression vectors — Each of the baculovirus transfer plasmids described in the preceding sections was extracted from large-scale E. coli cultures and purified by isopycnic ultracentrifugation on ethidium bromide-cesium chloride gradients, as described previously (23). The pENTR plasmids were then used to produce recombinant baculoviruses by the BaculoDirect™ method (Invitrogen), according to the manufacturer's protocol. The transfer plasmids encoding GST-tagged β- N-acetylglucosaminidases were used to produce viruses by a standard allelic transplacement method (3,4) with £sw36/-digested BacPAKό viral DΝA (26) as the target for homologous recombination. Each recombinant baculovirus vector was plaque- purified, amplified in Sf9 cells, and titered by plaque assay on Sf9 cells, as described previously (4). The recombinant viruses encoding various full-length, untagged β-N-acetylglucosaminidase genes were designated AcSfGlcΝAcase-3 (18), AcDm-FDL, and AcSf-FDL and those encoding N-terminally GST-tagged ectodomains of the various β-N-acetylglucosamirnidases were designated AcGSTSfGlcΝAcase-3, AcGSTDm-FDL, and AcGSTSf-FDL, respectively. The parental virus used to produce these viruses, which also served as a negative control for some of the experiments included in this study, was Autographa californica nucleopolyhedrovirus (AαMΝPV).
Expression of recombinant proteins in insect cells — Sf9 cells were seeded into 100 mL of ESF 921 medium in 250 mL DeLong flasks (Corning Glass Works, Corning, NY) and allowed to grow to a density of about 1.5-2.0 X 106 cells/mL at 280C and 125 rpm in a Forma Model 4580 rotary platform shaker-incubator (Forma Scientific, Inc., Marietta, OH). The cells were then infected with the appropriate baculovirus at a multiplicity of infection of about 1 plaque-forming unit/cell and incubated for another 72 h under the same conditions.
Isolation of purified microsomal fractions — The isolation of microsomal fractions from baculovirus-infected Sf9 cells has been described previously (18). Briefly, the cells were Dounce-homogenized and microsomes were isolated by ultracentrifugation onto sucrose cushions. The microsomes were solubilized in β-N-acetylglucosaminidase assay buffer (100 mM citrate-phosphate buffer, pH 6.0) containing 0.5% (v/v) Triton-X-100, total protein concentrations were determined using a commercial bicinchoninic acid assay (Pierce Biotechnology Inc., Rockford, IL), and samples containing equal amounts of total protein were assayed for β-N-acetylglucosaminidase activity, as described below.
For a subset of these experiments, which was designed to examine the nature of the association between the enzyme activity and membranes, freshly prepared microsomes were either held or sonicated on ice with ten pulses from a Branson Model 450 Sonifier (Danbury, CT) adjusted to 50% output. The microsomes were then pelleted by centrifugation for 10 min at top speed in a microcentrifuge (Hermle Model Zl 80M) and the pellets were resuspended in β-N-acetylglucosaminidase assay buffer. The sonication and centrifugation steps were repeated, the final pellets were resuspended in β-N-acetylglucosaminidase assay buffer containing 0.5% (v/v) Triton-X-100 (Sigma Chemical Company, St. Louis, MO), and then the solubilized microsomes were assayed for β-N-acetylglucosaminidase activity, as described below.
Glutathione affinity chromatography — The GST-tagged ectodomains of the various β- N-acetylglucosaminidases examined in this study were purified from the extracellular fraction of Sf9 cells infected with AcGSTSfGlcΝAcase-3, AcGSTDm-FDL, or
AcGSTSf-FDL. Briefly, the cells were removed from each infected cell culture at 72 h postinfection by centrifugation for 5 min at 1000 xg, and the supernatant was harvested and ultracentrifuged for 30 min in a Ti45 rotor at 30,000 rpm and 40C in a Beckman Optima IOOXL ultracentrifuge (Beckman Coulter; Fullerton, CA). The resulting
supernatant was diluted with an equal volume of ice-cold GST purification buffer (25 niM Tris, 150 mM NaCl, 1 mM EDTA, pH 8.0), solid ammonium sulfate was added to 90% saturation, and the samples were stirred on ice until the salt was fully dissolved. The samples were subsequently ultracentrifuged for 20 minutes in a Ti45 rotor at 30,000 rpm and 40C and the resulting pellet was re-dissolved in a minimal volume of GST purification buffer. The samples were then transferred to dialysis tubing with a 50 kDa molecular weight cutoff (Spectrum Medical Industries Inc.; Laguna Hills, CA) and dialyzed overnight at 40C against 100 volumes of GST purification buffer supplemented with 1 mM phenylmethylsulfonyl fluoride (PMSF). Each GST-tagged protein was then adsorbed to a 1.5 mL bed volume of Glutathione Sepharose 4 Fast Flow (GE Healthcare; Uppsala, Sweden) pre-equilibrated with GST purification buffer in a plugged 20 mL Econo-Pac column (BioRad; Hercules, CA) for one hour at 4°C on a shaking platform. Subsequently, the fluid was drained from the column, the affinity matrix was washed twice with 10 mL of GST purification buffer, and the GST-tagged proteins were eluted with GST purification buffer supplemented with 5 mM reduced glutathione. Fractions were collected and purity was assessed by SDS-PAGE with Coomassie Blue staining, the presence of the GST-tagged proteins was assessed by SDS-PAGE and immunoblotting with a GST-specific antiserum, and enzymatic activity was assessed using /?-nitrophenyl- β-N-acetylglucosaminide as the substrate, as described previously (18).
β-Η-acetylglucosaminidase activity assays — Enzyme activity assays were performed using either solubilized microsomal fractions or affinity-purified recombinant proteins isolated from baculovirus-infected Sf9 cells. For the microsomal membrane assays, microsomes were prepared and extracted as described above and samples containing equal amounts of total protein were assayed in a total volume of 0.050 mL containing 25 pmol of various pyridylamine (PA)-tagged glycan substrates. The enzymatic activity of the affinity-purified recombinant proteins was assayed under identical conditions, except the amounts of purified protein used for these assays were equalized by immunoblotting, rather than by total protein assays. The substrates used in this study included GlcΝAcβ2Manα6(GlcΝAcβ2Manα3)Manβ4GlcΝAcβ4GlcΝAc-PA (GnGn;
CalBiochem, La Jolla, CA), GlcNAcB2Manoc6(Manoc3)ManB4GlcNAcB4GlcNAc-PA (GnM), and Mana6(GlcNAcB2Mana3)ManB4GlcNAcB4GlcNAc-PA (MGn). After being incubated for various times at 370C, each reaction was diluted to 0.150 mL with B-N-
acetylglucosaminidase reaction buffer and the products were analyzed by reverse phase high performance liquid chromatography, as described previously (27). GnGn, GnM, MGn, and Manα6(Manα3)Manβ4GlcNAcB4GlcNAc-PA (MM), were used as standards for the chromatographic analyses.
RNA interference — In general, the RNA interference approach used in this study involved transforming Sf9 cells with an immediate early expression plasmid encoding an inverted repeat derived from a portion of the Sf-fdl coding sequence, with the inverted repeat separated by a Drosophila melanogaster white gene intron, as originally described by Lee and Carthew (26). Briefly, the Sf-fdl coding sequence from nucleotides 355 to 855 was amplified using pENTR™/D-TOPO®-S£/c?/-FL as the template for a PCR with 0.5 U of Phusion™ DNA polymerase, 0.2 mM of each dNTP, and 1 mM each of the SFFDLRNAIASP and SFFDLRNAISP primers (Table 1), which introduced Xbal sites onto both ends, in Phusion™ HF buffer. The reaction was incubated for 30 seconds at 980C, cycled five times using (i) 20 sec at 98°C, (ii) 20 sec at 54°C, and (iii) 30 sec at
72°C, cycled another 30 times using (i) 20 sec at 980C, (ii) 20 sec at 64°C, and (iii) 30 sec at 720C, and finally incubated for 2 min at 720C. The spent reaction was analyzed on a 1.2% agarose gel and the amplification product was excised, purified, 7o^-treated, and cloned into pCR4®-TOPO® to produce pCR4®-TOPO®-SfFdlRNAi. An error-free clone was identified by sequencing and the insert was excised with Xbal and gel-purified. One copy of the insert was subcloned in antisense orientation into the Avrll site and a second copy was subcloned in sense orientation into the Nhel site of pGEM-WIZ (27); (obtained from the Drosophila Genomics Resource Center) to produce pGEM-WIZ- SfFdIRNAi. Finally, the Sf-fdl inverted repeat and white gene intron cassette was excised with SacU and Notl and subcloned into the corresponding sites of pIElHR3 (29) to produce pIElHR3SfFdlRNAi. This plasmid was used along with pIEINeo to co-transfect Sf9 cells using a modified calcium phosphate method, as described previously (4). The transfected cells were then selected and neomycin-resistant clones were isolated by limiting dilution, as described previously (29). The levels of specific, processing β-N- acetylglucosaminidase activity in the parental and transformed cells were finally compared by HPLC analysis of the products obtained by reacting microsomal membrane preparations with GnGn, as described above.
The following examples are provided to illustrate certain embodiments of the invention. These examples are not intended to limit the invention in any way.
EXAMPLE I
CLONING OF A NOVEL PROCESSING β-iV-ACETYLGLUCOSAMINIDASE FROM SF9 CELLS AND CHARACTERIZATION THEREOF
Isolation and characterization of an fdl gene homolog from Sf9 cells Our effort to isolate an fdl gene homolog from Sf9 cells was informed and facilitated by the availability of genome sequence data from several insect species and also by our previous efforts to isolate the gene encoding the processing β-N-acetyl- glucosaminidase activity from this cell line. tBLASTn analysis of the A. aegypti, A. gambiae, A. mellifera, B. mori, D. pseudoobscura and T. castaneum genomes with the Dm-fdl gene as the query yielded exons from each species encoding peptides phylogenetically related to Dm-fdl (data not shown). We subsequently used a splice site prediction algorithm to join the relevant exons and identify open reading frames encoding at least partial, putative β-N-acetylglucosaminidases from each insect species. Importantly, a CLUSTAL-W alignment of the predicted products of these open reading frames with the Dm-fdl gene product revealed conserved amino acid sequences that were not conserved in the Sf-GIcN Acase-1 or SfGlcNAcase-3 gene products identified in our previous study (18). These were used to design degenerate oligonucleotides for high fidelity PCRs with Sf9 cDΝA or genomic DΝA as the templates. These PCRs yielded an amplification product of about the expected size (420 bp), which appeared to be specific because it was not observed in control reactions in which either one of the degenerate oligonucleotides was excluded (data not shown). This product was directly sequenced and the translation product was found to be highly similar to a fragment of the D. melanogaster and putative B. mori FDL proteins (data not shown). Accordingly, we used this sequence to design gene-specific primers for 5'- and 3'-RACE reactions, which yielded the nucleotide sequence of the full length, putative Sf-fdl open reading frame, as detailed above.
The 5'-RACE reactions yielded a specific 1.4 Kb amplification product, which overlapped with the sequence of the original degenerate PCR product, extended it by 1161 bp in the 5' direction, and included a potential translational initiation site (data not shown). The 3'-RACE reactions yielded a specific 1.0 Kb amplification product, which
also overlapped with the sequence of the original degenerate PCR product, extended it by 734 bp in the 3' direction, and encoded a translational termination site. A contiguous nucleotide sequence of 2319 bp was assembled by joining the sequences of the degenerate amplimer, the 5'-RACE product, and the 3'-RACE product. The accuracy of this sequence was confirmed by PCR with gene specific primers using both Sf9 cDNA and genomic DNA as the templates, followed by direct sequencing of the products, as described in Experimental Procedures.
In silico analysis of the SfP cell fdl gene homolog — The full-length Sf-fdl nucleotide sequence and theoretical amino acid sequence of the Sf-FDL polypeptide are shown in Fig. 1. The nucleotide sequence includes a single long open reading frame of 1896 bp, which has a GC content of 69%. The theoretical product of this open reading frame is a polypeptide consisting of 631 amino acids, which has a calculated molecular mass of 70,530 Da and a calculated isoelectric point of 7.18. The theoretical protein also has an N-terminal transmembrane domain (underlined in Fig. 1), which extends from amino acids 25 to 42 with in/out topology, according to the TMHMM and TopPred2 algorithms (29). Thus, the putative S. frugiperda FDL polypeptide appears to be a type II transmembrane protein with a short cytoplasmic tail. This is consistent with the idea that the Sf-fdl gene encodes an N-glycan processing enzyme because all N-glycan processing enzymes characterized to date have been predicted or shown to be transmembrane proteins with type II topology (30-32). The putative Sf9 cell enzyme also includes two potential N-glycosylation sites, which are boxed in the amino acid sequence shown in Fig. 1.
A phylogenetic analysis of the predicted Sf-fdl gene product showed that it is related to known hexosaminidases, including the human alpha (Ace. No. NM 000520; 33) and beta (Ace. No. NM_000521; 34) hexosaminidases, as well as SfGlcNAcase-1 (Ace. No. DQ249307; 18) and SfGlcNAcase-3/Sfhex (Ace. No. DQ249309; 17,18), as expected (Fig. 2). Strikingly, however, this analysis also revealed that the predicted Sf-fdl gene product is much more closely related to the Dm-fdl gene product (Ace. No. NM_165909; 16) than to either of the S. frugiperda hexosaminidase gene products, despite the fact that Spodoptera and Drosophila belong to distinct insect Orders, which diverged well over 300 million years ago (35). Together, these results indicated that we had successfully isolated a Dm-fdl gene homolog from Sf9 cells. In addition, the much closer relationship between these two genes and the more distant relationship between the Dm-fdl and SfGlcNAcase-3/Sfhex genes supports the conclusion that this newly-isolated gene encodes
the specific, processing β-N-acetylglucosaminidase in Sf9 cells.
Expression and biochemical analysis of the native Sf-fdl gene product — The full- length Sf-FDL coding sequence was subcloned into a baculovirus transfer plasmid and the resulting construct was used to isolate a recombinant baculovirus, AcSf-FDL, that was used, in turn, for high-level expression of the native cDΝA product in insect cells. The parental baculovirus (AcA/ΝPV) was used as a negative control and recombinant baculoviruses encoding Dm-FDL (AcDm-FDL) or Sf-GlcΝAcase-3/SfHex (AcSfGlcΝAcase-3) were used to directly compare the enzymatic activities of the Sf-fdl, Dm-fdl, and Sf-GlcNAcase-3/SfHex gene products. Individual Sf9 cell cultures were infected with the appropriate baculoviruses and then crude microsomal membrane fractions were prepared and assayed for enzymatic activity with various PA-tagged glycans as substrates, as described above. The results showed that negative control microsomes from AcMΝPV-infected cells had very little effect on GnGn, while microsomes isolated from either AcDm-FDL- or AcSf-FDL-infected cells converted this substrate to GnM (Fig. 3A, top three panels). In contrast, microsomes from
AcGlcΝAcase-3 -infected cells converted GnGn to both GnM and MM in parallel assays (Fig. 3A, bottom panel). These results indicated that Sf-FDL and Dm-FDL specifically removed only the terminal N-acetylglucosamine residue from the α3 -branch of GnGn, whereas SfGlcΝAcase-3/SfHex had a broader spectrum of activity and removed the terminal N-acetylglucosamine residues from both branches of GnGn in these assays.
This was supported by the results of additional assays in which other glycans were used as substrates. Microsomes from cells infected with AcDm-FDL, AcSf-FDL, or AcSfGlcΝAcase-3 all removed the terminal N-acetylglucosamine residue from the α3- branch of MGn to produce MM, as expected (Fig. 3B, lower three panels). Significantly, however, microsomes from AcDm-FDL- and AcSf-FDL- (Fig. 3C, middle two panels) infected cells failed to remove the terminal N-acetylglucosamine from the α6-branch of GnM, while those from AcSfGlcΝAcase-3-infected cells (Fig. 3C, bottom panel) clearly converted GnM to MM. These results confirmed that Sf-FDL and Dm-FDL are more highly specific enzymes that remove only the terminal N-acetylglucosamine residue from the α3-branch of glycan substrates in these assays. This substrate specificity distinguishes these enzymes from the SfGlcNAcase-3/SfHex gene product, as this latter readily removed the terminal N-acetylglucosamine residues from both branches of biantennary glycan substrates.
The relatively broader spectrum of activity observed with the SfGIcN Acase-3/SfHex gene product was underscored by the ability of microsomes from AcSfGlcNAcase-3- infected cells to efficiently convert chitotriose to chitobiose and chitobiose to a PA-tagged N-acetylglucosamine residue (Fig. 3D, bottom panel). In contrast, microsomes from AcDm-FDL- or AcSf-FDL-infected cells (Fig. 3D, middle two panels) converted only small amounts of chitotriose to chitobiose and produced no PA-tagged N-acetylglucosamine. In fact, microsomes from AαMΝPV-infected cells (Fig. 3D, top panel) produced nearly as much chitobiose as the microsomes from AcDm-FDL- or AcSf-FDL-infected cells, suggesting that the apparent ability of these latter two enzymes to hydrolyze chitotriose was an artifact resulting from contaminating chitinase activity in the crude microsomal preparations.
The results of the experiments described in this specification show that the Sf-fdl gene product is orthologous to the Dm-fdl gene product, which is responsible for N-glycan processing in D. melanogaster (16), and paralogous to the SfGlcNAcase-3/SfHex gene product, which we previously concluded was more likely to be responsible for N-glycan and/or chitin degradation in S. frugiperda (18). Furthermore, the isolation of a. Dm-fdl ortholog from Sf9 cells, its substrate specificity, and the relatively non-specific nature of the SfGlcNAcαse-3/SfHex gene product provide compelling evidence to suggest that the former, not the latter is the N-glycan processing enzyme in Sf9 cells. pH optimum of the Sf-fdl gene product — In their seminal study on the endogenous processing β-N-acetylglucosaminidase activity in microsomal membranes isolated from Sf21 cells, Altmann and coworkers (9) found that it had a pH optimum for GnGn hydrolysis of 6.0. This was consistent with the idea that the activity measured in these assays was involved in N-glycan processing, rather than degradation, because a processing enzyme would be expected to reside in the Golgi apparatus and have a pH optimum around 6.0-6.5, whereas a degradative enzyme would be expected to reside in the lysosomal compartment and have a more acidic optimal pH. We and Tomiya and coworkers found that the SfGlcNAcαse-3/SfHex gene product had a pH optimum of 5.5 and took this as one line of evidence that this enzyme could not account for the processing activity identified by Altmann and coworkers (1995) and was more likely to be involved in N-glycan or chitin degradation (18). Tomiya and coworkers also found that the SfGlcNAcαse-3/SfHex gene product had a pH optimum of 5.5, but concluded that it is involved in N-glycan processing in Sf9 cells because it has the same pH optimum as Dm- FDL (16) and it is at least partially active at the higher pH of secretory compartments,
such as the trans-Golgi network (17). Hence, it was of interest to examine the optimal pH of the Sf-fdl gene product. Microsomal membranes were isolated from AcSf-FDL- infected Sf9 cells and assayed for GnGn hydrolysis at various pH values. The results showed that the pH optimum of the Sf-fdl gene product is 6.0 and that it has nearly optimal activity at pH 6.5, as well (Fig. 4). Thus, the pH optimum of the Sf-fdl gene product is identical to that of the processing activity originally identified in microsomal fractions from Sf21 cells by Altmann and coworkers (9). Furthermore, the range of optimal or near-optimal pH values for this enzyme more clearly encompasses the range of pH values found within late secretory pathway compartments, such as the trans-Go\gι network, than the SfGlcNAcase-3/SfHex gene product. Thus, these results also support the idea that the Sf-fdl gene product, not the SfGlcNAcase-3/SfHex gene product, is the specific, processing β-N-acetylglucosaminidase in Sf9 cells.
Biochemical analysis of the purified, ectodomain of Sf-FDL Each of the biochemical assays performed to this point in our study had involved the use of crude microsomes isolated from Sf9 cells infected with recombinant baculoviruses encoding the relevant β-N-acetylglucosaminidases. These assays were relevant because they mimicked the original assays of the endogenous processing β-N- acetylglucosaminidase activity in Sf21 cells and provided data on the substrate specificities of full-length, untagged forms of each of the enzymes of interest. However, one criticism of these assays is that they did not involve the use of purified enzymes. To address this issue, we isolated recombinant baculoviruses encoding Ν-terminally GST- tagged ectodomains of Sf-FDL, Dm-FDL, and SfGlcΝAcase-3/Sfhex, as described in Experimental Procedures. Each was expressed in Sf9 cells and purified from the extra- cellular fraction using glutathione affinity chromatography, as described in Experimental Procedures. Analysis of the purified products by SDS-PAGE with Coomassie Blue staining (Fig. 5A) or immunoblotting with anti-GST (Fig. 5B) established that each had been effectively purified and normalized. Subsequently, equivalent amounts of these purified protein preparations were assayed for β-N-acetylglucosaminidase activity using various glycan substrates.
The results of these assays showed that the GST-tagged ectodomains of both Dm- FDL and Sf-FDL converted GnGn to GnM (Fig. 6A; middle two panels), while the GST- tagged ectodomain of the GlcΝAcase 3/Sfhex protein converted this substrate to GnM, MGn, and MM (Fig. 6A, bottom panel). All three enzymes removed the terminal N-
acetylglucosamine residue from the α3-branch of MGn to produce MM (Fig. 6B), as expected, but only the SfGlcNAcase 3/Sfhex protein removed the terminal N- acetylglucosamine from the α6-branch of GnM to produce MM (Fig. 6C, bottom panel). Neither Dm-FDL nor Sf-FDL had any detectable effect on this glycan (Fig. 6C, middle two panels). Similarly, only the SfGlcNAcase-3/Sfhex protein hydrolyzed chitotriose to produce chitobiose and PA-tagged N-acetylglucosamine monomers (Fig. 6D; bottom panel), while Dm-FDL and Sf-FDL had virtually no effect on this glycan (Fig. 6D, middle two panels).
These data supported the major conclusion drawn from the experiments performed with the full-length, untagged forms of these enzymes, which was that Dm-FDL and Sf- FDL are specific for the terminal N-acetylglucosamine on the α3 -branch of biantennary N-glycan substrates, while SfGlcΝAcase-3/SfHex has a much broader spectrum of B-N- acetylglucosaminidase activity. Again, the specificities of Sf-FDL and Dm-FDL are consistent with their proposed function in N-glycan processing and with the conclusion that the Sf-fdl gene encodes the membrane bound, processing β-N-acetylglucosaminidase activity originally identified in Sf21 cells by Altmann and coworkers (1995).
To examine their substrate specificities more stringently, we incubated the purified, GST-tagged ectodomains of Dm-FDL, Sf-FDL, and SfGlcΝAcase-3/SfHex with the various synthetic glycan substrates for 20 h to achieve a ten-fold increase in the enzyme assay times (Fig. 7). The results of these assays verified that Dm-FDL and Sf-FDL are highly specific, even under this extreme condition. It can be seen that Sf-FDL produced tiny amounts of MGn from GnGn (Fig. 7A, middle panel) and tiny amounts of MM from GnM (Fig. 7C, middle panel). In addition, both Sf-FDL and Dm-FDL produced tiny amounts of chitobiose from chitotriose and none of the aforementioned products were observed when the relevant glycan substrates were mock-digested with elution buffer alone (data not shown). Nevertheless, Sf-FDL is clearly much more specific than SfGlcNAcase-3/SfHex and one would reasonably question the physiological relevance of the small amounts of conversion obtained under these extreme in vitro reaction conditions.
EXAMPLE II CLONING OF THE TnFDL AND BmFDL GENES
Common materials and methods
All PCRs were carried out in a final volume of 50 μLs in IX of Phusion™ buffer GC with 0.2 mM of each dNTP, 1 μM of each primer, 1 M betaine, 0.6 U of Phusion™ DNA polymerase (NEB, Ipswich, MA) and 1 μL of template, except where indicated otherwise. All PCRs were carried out in a GeneAmp Model 2400 thermal cycler (Eppendorf, Foster City, CA). DNA extraction from agarose gel fragments were carried out using the QiaQuick™ Gel Extraction Kit (Qiagen, Valencia, CA) according to the manufacturer's instructions and eluted into 50 μLs.
Sequences obtained from degenerate and semidegenerate PCRs, TOPO® clones and RACE reactions were analyzed and assembled into full-length mRNA and genomic DNA sequences using ContigExpress, a component of Invitrogen Vector NTI 10.3.0.
T. nifdl
Degenerate PCR
Genomic DNA was isolated from T. ni cells (Tn-4h cell line) according to the method of Laird et al. (Laird et α/., 1991, Nucleic Acids Res.19:4293). Degenerate PCRs were carried out using T. ni genomic DNA with the primers ASPDEG and SPDEG as described previously (Geisler et al., 2008, J. Biol. Chem. 283: 11330-11339.). The spent reactions were separated on a 1.2% agarose gel and specific amplification products of the expected size (420 bp) were recovered from the gel, purified and directly sequenced with the same primers as used in the PCR.
Semi-degenerate PCR
Semi-degenerate PCRs were carried out using T. ni genomic DNA to extend the sequence of the degenerate fragment towards both the 3' and the 5' end. Degenerate primers were designed against regions that are highly conserved between the SfFDL and the BmFDL conceptual translations. To obtain part of the TnFDL 5' end, a semi- degenerate PCR was carried out using primers TnFDL ASP2 and TnFDL SP4DEG. The PCR was incubated for 20 sec at 980C, then cycled 25 times using (i) 10 sec at 980C, (ii) 15 sec at 72 to 6O0C (with a decreasing temperature gradient of 0.50C per cycle), and (iii) 60 sec at 720C. The reaction was cycled another 30 times using (i) 10 sec at 98°C, (ii) 15 sec at 6O0C, and (iii) 60 sec at 72°C, and finally incubated for 2 min at 720C. The spent
reaction was separated on a 1.4% agarose gel and a specific amplification product of about the expected size (1100 bp) was recovered from the gel and purified. This DNA fragment was re-amplified using the TnFDL ASP3 and TnFDL SP4DEG primers using the same conditions, gel purified and directly sequenced using the same primers as used in the PCR.
To obtain part of the 3' end, a semi-degenerate PCR was carried out using primers TnFDL SPl and TnFDL ASP6DEG with identical cycling conditions as specified above. The spent reaction was separated on a 1.4% agarose gel and a specific amplification product of about the expected size (730 bp) was recovered from the gel and purified. This fragment was cloned into pCR®2.1-TOPO® according to the manufacturer's instructions, and three clones were sequenced to yield a consensus sequence.
5'RACE
Total RNA was isolated from a mid-log culture of Tn-4h cells using the Qiagen RNeasy® Plus Mini Kit according to the manufacturer's instructions. 5' RACE-ready RNA was prepared from total RNA using the Invitrogen Generacer™ kit according to the manufacturer's instructions. Reverse transcription was carried out using Thermo-X™ reverse transcriptase with the TnFDL ASPl primer. The reaction was set up according to the manufacturer's instructions and incubated for (i) 5 min at 5O0C, (ii) 15 min at 550C, (iii) 30 min at 6O0C and finally for (iv) 15 min at 600C. The reaction was diluted with 40 μLs TE buffer and stored at -200C.
5' RACE was carried out using the TnFDL ASP6 primer and the GeneRacer™ 5' Primer. The PCR was incubated for 30 sec at 96°C, then cycled 5 times using (i) 20 sec at 960C, (ii) 60 sec at 72°C, after which the reaction was cycled 13 times using (i) 20 sec at 960C, (ii) 20 sec at 72 to 6O0C (with a decreasing temperature gradient of I0C per cycle), and (iii) 40 sec at 720C. The reaction was then cycled another 30 times using (i) 20 sec at 900C, (ii) 20 sec at 6O0C, and (iii) 40 sec at 72°C. The spent reaction was separated on a 1.4% agarose gel, and a specific amplification product of about 520 bps was isolated and purified. This DNA fragment was re-amplified using the GeneRacer™ 5' Nested Primer and either the TnFDL ASP6 or ASP7 primer. Reactions were incubated for 30 sec at 960C, then cycled 5 times using (i) 20 sec at 96°C, (ii) 45 sec at 720C, after which the reactions were cycled 13 times using (i) 20 sec at 960C, (ii) 15 sec at 72 to 600C (with a decreasing temperature gradient of I0C per cycle), and (iii) 30 sec at 72°C. The reactions were then cycled another 30 times using (i) 20 sec at 96°C, (ii) 15 sec at 600C, and (iii) 30
sec at 72°C. The spent reactions were separated on a 1.4% agarose gel, and specific amplification products of about 520 and 500 bps were isolated and directly sequenced using the TnFDL ASP6 or ASP7, respectively.
3'RACE
3' RACE-ready cDNA was prepared from total T. ni RNA isolated as described above. Reverse transcription was carried out using Thermo-X™ reverse transcriptase with the GeneRacer™ Oligo dT primer. The reaction was set up according to the manufacturer's instructions and incubated in the same fashion as for 5' RACE. The reaction was diluted with 40 μLs TE buffer and stored at -2O0C.
3' RACE was carried out using the TnFDL SP4 primer and the GeneRacer™ 3' Primer. The PCR was incubated for 30 sec at 960C, then cycled 5 times using (i) 20 sec at 960C, (ii) 45 sec at 720C, after which the reactions were cycled 13 times using (i) 20 sec at 96°C, (ii) 15 sec at 72 to 6O0C (with a decreasing temperature gradient of 1°C per cycle), and (iii) 30 sec at 720C. The reactions were then cycled another 30 times using (i) 20 sec at 960C, (ii) 15 sec at 600C, and (iii) 30 sec at 72°C. The spent reaction was separated on a 1.4% agarose gel, and a specific band of approximately 600 bps was isolated and purified. This DNA fragment was re-amplified using the TnFDL SP5 primer and the GeneRacer™ 3' Nested Primer. The PCRs were incubated for 15 sec at 96°C, then cycled 5 times using (i) 15 sec at 960C, (ii) 35 sec at 72°C, after which reactions were cycled 13 times using (i) 15 sec at 96°C, (ii) 15 sec at 72 to 600C (with a decreasing temperature gradient of 1°C per cycle), and (iii) 20 sec at 72 0C. The reactions were the cycled another 30 times using (i) 15 sec at 960C, (ii) 15 sec at 600C, and (iii) 20 sec at 72 0C. The spent reactions were separated on a 1.4% agarose gel, and a specific amplification product of 500 bps was isolated, purified and directly sequenced using the TnFDL SP5 primer.
Amplification of the full-length TnFDL open reading frame for baculovirus expression The full-length open reading frame was amplified from both cDNA primed with the GeneRacer™ Oligo dT Primer as well as genomic DNA (including the intron) using the TnFDL FL SP2 BD and TnFDL ASP BD primers. The reactions were incubated for 20 sec at 98°C, the cycled 25 times using (i) 15 sec at 98°C, (ii) 10 sec at 72 to 600C (with a decreasing temperature gradient of 0.50C per cycle), and (iii) 60 sec at 72°C. The reactions were the cycled another 30 times using (i) 15 sec at 98°C, (ii) 10 sec at 600C, and (iii) 60 sec at 72°C. The spent reactions were separated on a 1% agarose gel, and
amplification products of the expected size were excised and purified. These DNA fragments from the reactions template by cDNA and gDNA were cloned into the pENTR™/D-TOPO® vector according to the manufacturer's instructions, yielding pENTR-TnFDL-C and pENTR-TnFDL-G, respectively. Four clones of each were sequenced, and a consensus clone of pENTR-TnFDL-C was used with Invitrogen's Baculodirect™ kit according to the manufacturer's instructions to yield AcTnFDL.
B. morifdl
Bombyx mori genomic database search results A tBLASTn search of the available Bombyx mori genomic sequences was carried out with the SfFDL conceptual translation as query using the online NCBI interface. This search yielded, amongst others, the sequences BAABO 1046610, BAABOl 083831 and BAABOl 153187. The Sequence BAAB01046610 encodes a putative 5' coding exon with a start codon (nts 25-200). The conceptual translation of this exon shows high similarity to the amino-terminal part of SfFDL. The sequences BAABO 1083831 and
BAABOl 153187 could be joined in silico to yield a contig encoding the putative 3' coding exon, including a stop codon. The conceptual translation of this exon showed high similarity to the carboxy-terminal part of SfFDL. The 5' coding exon could be joined in silico to the 3' coding exon at splice junctions predicted with high probability by NetGene2 (Hebsgaard et al, Nucleic Acids Res. 24:3439-3452), yielding a contiguous open reading frame.
Amplification of the full-length BmFDL open frame for baculovirus expression
Primers designed to amplify the entire predicted open reading frame with the additional sequence CACC 5' to the initiation codon were used in PCRs to amplify the open reading frame from cDNA as well as genomic DNA (including the intron). Genomic DNA was prepared by a modification of the method of Laird et al. (Supra) from a single stage 2 B. mori larva (Qiufeng/Baiyu hybrid). Briefly, the larva was homogenized in lysis buffer supplemented with RNAse A, after which the homogenate was incubated at 550C for 1 hour. The lysate was the centrifuged at 13.000 x G to remove debris, and DNA was precipitated by additional of an equal volume of isopropyl alcohol. The DNA was dissolved in 500 μLs of TE buffer and cleaned once by phenol chloroform extraction. Total RNA was isolated from a single stage 2 B. mori larva (Qiufeng/Baiyu hybrid) using the Qiagen RNeasy™ Mini Plus kit. The larva was homogenized in Buffer RLT plus,
followed by centrifugation at 13.000 x G to remove debris. Total RNA was subsequently isolated according to the manufacturer's instructions. Total B. mori RNA was used to prepare 5' RACE-ready RNA using the Invitrogen GeneRacer™ kit according to the manufacturer's instructions. In two separate reactions, 5' RACE-ready RNA and total RNA was used for reverse transcription with Invitrogen Thermoscript™ reverse transcriptase using the BmFDL ASPl primer and the GeneRacer™ Oligo dT Primer, respectively. The reaction was set up according to the manufacturer's instructions and incubated for (i) 5 min at 500C, (ii) 15 at 550C, (iii) 30 min at 600C and finally for (iv) 15 mins at 65°C. The reactions were diluted with 40μLs TE buffer and stored at -200C. The predicted full-length open reading frame was amplified from both cDNA primed with GeneRacer™ Oligo dT Primer and genomic DNA (including the intron).The PCRs were set up using the BmFDL FL SP2 and BmFDL ASPlCLO primers and incubated in the same fashion as for the amplification of the full-length TnFDL open reading frame. The spent reactions were separated on a 1% agarose gel, and bands of the expected size were isolated and purified. The DNA fragments from the reactions template with genomic DNA and cDNA were cloned into the pENTR™/D-TOPO® vector according to the manufacturer's instructions, yielding pENTR-BmFDL-G and pENTR- BmFDL-C, respectively. Four clones of each were sequenced, yielding two distinct alleles from both gDNA and cDNA. Despite a substantial number of nucleotide substitutions, the conceptual translation of one of these alleles is identical to the conceptual translation of the putative ^Z/ gene identified from the p50 (Daizo) strain. The two alleles differ between each other in several nucleotides in the intron and both exons. However, only three nucleotide changes are not silent, resulting in the L138I, the G404E and the H481Q amino acid changes. pENTR-BmFDL-C was used with Invitrogen's Baculodirect™ kit according to the manufacturer's instructions to generate AcBmFDL.
5'RACE
5' RACE was carried out using the BmFDL ASP4 primer and the GeneRacer™ 5' Primer with 5 'RACE-ready cDNA prepared as described above. Reactions were incubated for 30 sec at 960C, then cycled 5 times using (i) 15 sec at 96°C, (ii) 45 sec at 720C, after which the reactions were cycled 12 times using (i) 15 sec at 96°C, (ii) 15 sec at 72 to 610C (with a decreasing temperature gradient of 1°C per cycle), and (iii) 30 sec at 720C. The reactions were the cycled another 30 times using (i) 15 sec at 96°C, (ii) 15 sec at 61°C and (iii) 30 sec at 72°C, and finally incubated for 1 min at 72°C. The spent
reactions were separated on a 1.2% agarose gel. A specific band of about 570 bps was isolated, purified and re-amplified using the BmFDL ASP5 primer and the GeneRacer™ 5' Nested Primer using the same cycling conditions. The nested 5' RACE reactions were separated on a 1.4% agarose gel, showing a specific band of the expected 550 bps. This band was excised, purified and sequenced using the BmFDL ASP5 primer.
3'RACE
3' RACE was carried out using the BmFDL SP4 primer and the GeneRacer™ 3' Primer with 3' RACE-ready cDNA. Reactions were cycled in the same fashion as described above for 5' RACE. The spent reaction was analyzed on a 1.4% agarose gel, showing a specific faint band at 450 bps. This band was excised, purified and used for nested 3' RACE reactions with the BmFDL SP5 primer and the GeneRacer™ 3' Nested Primer using the same cycling reactions. The spent reactions showed a strong, specific band at the expected size of 420 bps. This band was excised, purified and sequenced using the BmFDL SP5 primer.
Primer Table
EXAMPLE in
Inhibition of specific, processing β-N-acetylglucosaminidase activity by an Sf-fdl-specific double-stranded RNA If the Sf-fdl gene encodes the specific, processing β-N-acetylglucosaminidase activity in Sf9 cells, it should be possible to reduce this activity by RΝA interference with Sf-fdl- specific double-stranded RΝA. Towards this end, we constructed an immediate early expression plasmid encoding an inverted repeat sequence derived from a portion of the Sf-fdl coding sequence (Fig. 12) and used it to isolate a transformed Sf9 cell subclone, as described above. Microsomal membranes were then isolated from the parental cell line or the subclone and used for β-N-acetylglucosaminidase activity assays with GnGn as the substrate. HPLC analysis of the reaction products showed that the microsomal membranes from both the parental Sf9 cells the transformed subclone converted GnGn to GnM, but not detectably to MM or MGn (data not shown). Thus, in this examination of endogenous β-N-acetylglucosaminidase activities in microsomal membranes from uninfected Sf9 cells, we detected only the specific, processing enzyme activity. Furthermore, we found that the levels of this specific, processing β-N- acetylglucosaminidase activity were over 50% lower in the membranes from the transformed subclone, relative to the parental controls (Fig. 11). The reduced levels of specific, processing β-N-acetylglucosaminidase activity in the Sf9 subclone transformed
with the constitutive expression plasmid encoding Sf-fdl-specific double-stranded RNA strongly supports the conclusion that the Sf-fdl gene encodes this activity.
While SEQ ID NO:9 is specific for downregulating expression of the Sffdl encoding nucleic acid, provision of the sequence information for T. ni and B. mori homologs readily enables the skilled artisan to generate additional specific RNAi for inhibiting expression of the same. Indeed, computer programs are available online which can assist in the design of such molecules.
From the foregoing description, those skilled in the art will appreciate that the presence or absence of a specific, processing β-N-acetylglucosaminidase is a key difference in the protein N-glycan processing pathways of insects and higher eukaryotes. In insect systems, this function was first identified as an enzyme activity in crude microsomal membranes isolated from a cell line derived from the lepidopteran insect, S. frugiperda (9). Efforts to molecularly clone the gene encoding this enzyme in these cells yielded two recent reports describing a single gene alternatively termed SfGlcNAcase-3 (18) and SflJex (17). Biochemical assays revealed that the SfGIcN Acase-31 SfHex gene product lacked the strict substrate specificity of the enzyme activity originally described by Altmann and co-workers in 1995. Based upon this and other findings, one group concluded in their report that the SfGlcNAcase 3/SfHex gene product is more likely to be involved in glycan and chitin degradation than in N-glycan processing (18). In contrast, based upon a slight preference for the appropriate substrate, the other group concluded in their report that this gene product is involved in N-glycan processing and hypothesized that it serves multifunctional roles in both N-glycan processing and glycan degradation in Sf9 cells (17). Parallel efforts to molecularly clone the processing β-N- acetylglucosaminidase from Drosophila melanogaster yielded a report describing the Dm-FdI gene and the characteristics of the gene product (16). Based upon its substrate specificity and the presence of a higher level of N-glycans containing terminal N- acetylglucosamine residues in mutant flies lacking a functional FdI gene, this report concluded that the FdI gene encoded the β-N-acetylglucosaminidase involved in N-glycan processing in this fruitfly. In accordance with the present invention, we isolated an FdI gene from Sf9 cells and demonstrated that it encodes a membrane-associated β-N-acetylglucosaminidase with the same, strict substrate specificity exhibited by Dm-FDL and by the enzyme activity originally detected in S. frugiperda microsomes (9). The fact that the Sf9 genome encodes a gene with a close phylogenetic relationship to Dm-fdl, the fact that the Sf-fdl gene
product is membrane-associated and has the strict substrate specificity and pH optimum profile of the original activity detected in S. frugiperda microsomes, and the fact that Sf9 cells engineered to express S^FαfZ-specific double-stranded RNA have lower levels of specific, processing β-N-acetylglucosaminidase activity all tend to support the view that the Sf-fdl gene encodes the β-N-acetylglucosaminidase involved in N-glycan processing in Sf9 cells. In addition, these findings support our previous conclusion that the broad spectrum β-N-acetylglucosaminidase encoded by the SfGlcNAcase 3/SfiIex gene is more likely involved in glycan and chitin degradation.
The fdl gene orthologs were isolated from the lepidopteran insect cell species, Spodopterafrugiperida, Trichoplusia ni and Bombyx mori, as cell lines derived from these insect species are commonly used with the baculovirus expression system.
REFERENCES
1. Marz, L., Altmann, F., Staudacher, E., and Kubelka, V. (1995) Protein glycosylation in insects. In: Montreuil, J., Vliegenthart, J. F. G., and Schachter, H. (eds). Glycoproteins, Elsevier, Amsterdam 2. Marchal, I., Jarvis, D. L., Cacan, R., and Verbert, A. (2001) Biol. Chem. 382, 151- 159
3. O'Reilly, D. R., Miller, L. K., and Luckow, V. A. (1992) Baculovirus expression vectors, W.H. Freeman and Company, New York
4. Summers, M. D., and Smith, G. E. (1987) Tx. Ag. Expt. Stn. Bull. No. 1555 5. Jarvis, D. L. (1997) Baculovirus expression vectors. In: Miller, L. K. (ed). The
Baculoviruses, Plenum Press, New York
6. Fischer, R., Stoger, E., Schillberg, S., Christou, P., and Twyman, R. M. (2004) Curr. Op. Plant Biol. 7(2), 152-158
7. Ma, J. K., Drake, P. M., and Christou, P. (2003) Nat. Rev. Genet. 4(10), 794-805 8. Kornfeld, R., and Kornfeld, S. (1985) Ann. Rev. Biochem. 54, 631-664
9. Altmann, F., Schwihla, H., Staudacher, E., Glossl, J., and Marz, L. (1995) J. Biol. Chem. 270, 17344-17349
10. Zhang, W., Cao, P., Chen, S., Spence, A. M., Zhu, S., Staudacher, E., and Schachter, H. (2003) Biochem. J. 372(Pt 1), 53-64 11. Gutternigg, M., Kretschmer-Lubich, D., Paschinger, K., Rendic, D., Hader, J., Geier, P., Ranftl, R., Jantsch, V., Lochnit, G., and Wilson, I. B. (2007) J. Biol. Chem. 282(38), 27825-27840
12. Vitale, A., and Chrispeels, M. J. (1984) J. Cell Biol. 99(1 Pt 1), 133-140
13. Sturm, A. (1995) N-glycosylation of proteins in plants. In: Montreuil, J., Vliegenthart, J. F. G., and Schachter, H. (eds). Glycoproteins, Elsevier,
Amsterdam
14. Vaughn, J. L., Goodwin, R. H., Thompkins, G. J., and McCawley, P. (1977) In Vitro 13, 213-217
15. Wagner, R., Geyer, H., Geyer, R., and Klenk, H. D. (1996) J. Virol. 70(6), 4103- 4109
16. Leonard, R., Rendic, D., Rabouille, C, Wilson, I. B., Preat, T., and Altmann, F. (2006) J. Biol. Chem. 281(8), 4867-4875
17. Tomiya, N., Narang, S., Park, J., Abdul-Rahman, B., Choi, O., Singh, S., Hiratake, J., Sakata, K., Betenbaugh, M. J., Palter, K. B., and Lee, Y. C. (2006) J. Biol. Chem. 281(28), 19545-19560
18. Aumiller, J. J., Hollister, J., and Jarvis, D. L. (2006) Prot. Expr. Purif. 47, 571- 590
19. Altschul, S. F., Madden, T. L., Schaffer, A. A., Zhang, J., Zhang, Z., Miller, W., and Lipman, D. J. (1997) Nucl. Acids Res. 25, 3389-3402 20. Brunak, S., Engelbrecht, J., and Knudsen, S. (1991) JMoI Biol 220(1), 49-65
21. Thompson, J. D., Gibson, T. J., Plewniak, F., Jeanmougin, F., and Higgins, D. G. (1997) Nucl. Acids Res. 25, 4876-4882
22. Innis, M. A., and Gelfand, D. H. (1990) Optimization of PCRs. In: Innis, M. A., Gelfand, D. H., Sninsky, J. J., and White, T. J. (eds). PCR Protocols: A Guide to Methods and Applications, Academic Press, San Diego
23. Sambrook, J., Fritsch, E. F., and Maniatis, T. (1989) Molecular Cloning: A Laboratory Manual, 2nd edition Ed., Cold Spring Harbor Press, Cold Spring Harbor, New York
24. Shi, X., and Jarvis, D. L. (2006) Analyt Biochem 356(2), 222-228
25. Hofmann, K., and Stoffel, W. (1993) Biol. Chem. Hoppe-Seyler 374, 166
26. Kitts, P. A., and Possee, R. D. (1993) Biotechniques 14(5), 810-817
27. Bao and Cagan (2006) RNA, 12, 2020-2024 28. Jarvis, D. L., Weinkauf, C, and Guarino, L. A. (1996) Prot. Expr. Purif. 8, 191- 203
29. von Heijne, G. (1992) J. MoI. Biol. 225(2), 487-494
30. Paulson, J. C, and Colley, K. J. (1989) J. Biol. Chem. 264, 17615-17618
31. Breton, C, Mucha, J., and Jeanneau, C. (2001) Biochimie 83(8), 713-718. 32. Field, M. C, and Wainwright, L. J. (199S)
5, 463-472
33. Myerowitz, R., Piekarz, R., Neufeld, E. F., Shows, T. B., and Suzuki, K. (1985) Proceedings of the National Academy of Science of the United States of America 82(23), 7830-7834
34. O'Dowd, B. F., Quan, F., Willard, H. F., Lamhonwah, A. M., Korneluk, R. G., Lowden, J. A., Gravel, R. A., and Mahuran, D. J. (1985) Proceedings of the
National Academy of Science of the United States of America 82(4), 1184-1188
35. Gaunt, M. W., and Miles, M. A. (2002) MoI. Biol. Evol. 19, 748-761
36. Felsenstein, J. (1989) Cladistics 5, 164-166.
While certain of the preferred embodiments of the present invention have been described and specifically exemplified above, it is not intended that the invention be limited to such embodiments. Various modifications may be made thereto without departing from the scope and spirit of the present invention, as set forth in the following claims. Furthermore, the transitional phases "comprising", "consisting essentially of and "consisting of define the scope of the appended claims, in original and amended form, with respect to what unrecited additional claim elements or steps. The term "comprising" is intended to be inclusive or open-ended and does not exclude additional, unrecited elements, methods step or materials. The phrase "consisting of excludes any element, step or material other than those specified in the claim, and, in the latter instance, impurities ordinarily associated with the specified materials. The phrase "consisting essentially of limits the scope of a claim to the specified elements, steps or materials and those that do not materially affect the basic and novel characteristic(s) of the claimed invention. All compositions or formulations identified herein can, in alternate embodiments, be more specifically defined by any of the transitional phases "comprising", "consisting essentially of and "consisting of.
Claims
1. An isolated nucleic acid encoding an β-N-acetylglucosaminidase having a sequence selected from the group consisting of SEQ ID NO: 2, SEQ ID NO:4, SEQ ID NO:6 and SEQ ID NO:8.
2. An isolated nucleic acid encoding an β-N-acetylglucosaminidase of SEQ ID ΝO:2.
3. An isolated nucleic acid encoding an β-N-acetylglucosaminidase of SEQ ID ΝO:4.
4. An isolated nucleic acid encoding an β-N-acetylglucosaminidase of SEQ ID ΝO:6.
5. An isolated nucleic acid encoding an β-N-acetylglucosaminidase of SEQ ID ΝO:8.
6. The isolated nucleic acid of claim 1 , which is SEQ ID NO: 1.
7. The isolated nucleic acid of claim 1, which is SEQ ID NO:3.
8. The isolated nucleic acid of claim 1, which is SEQ ID NO:5.
9. The isolated nucleic acid of claim 1, which is SEQ ID NO:7.
10. The isolated nucleic acid molecule of claim 1, which is a DNA molecule.
11. An RNA molecule encoded by at least one of the nucleic acid molecules of claim 1.
12. An expression vector comprising at least one of the nucleic acid molecules of claim 1.
13. A recombinant insect cell transformed with the expression vector of claim 12.
14. The RNA molecule of claim 11, which is a fragment of SEQ ID NO: 1, having SEQ ID NO: 9, said RNA being double stranded.
15. An expression vector comprising the RNA molecule of claim 14.
16. A recombinant transgenic insect cell comprising the expression vector of claim 15.
17. A method for enhancing production of mammalian-like N-glycans in insect cells, comprising a) providing the recombinant insect cells of claim 16; b) transforming said cells with an expression vector comprising a nucleic acid encoding a heterologous glycoprotein of interest, said glycoprotein expressed in said cells of a) comprising elevated levels of mammalian-like N-glycans when compared to levels observed in wild type cells.
18. The recombinant insect cells of claim 16, further comprising at least one glycosylation enzyme selected from the group consisting of N- acetylglucosaminyltransferases, galactosyltransferases, sialyltransferases, sulfotransferases, sialic acid synthases, CPM-sialic acid synthetases, UDP-Ν- acetylglucosamine-2-epimerases/N-acetylmannosamine kinases, and CMP-sialic acid transporters.
19. A method for inhibiting β-N-acetylglucosaminidase activity comprising contacting a Sf-fdl-expressing cell with a nucleic acid molecule comprising SEQ ID NO: 9 in an amount effective to down-regulate β -N-acetylglucosaminidase endogenous to that cell.
20. An isolated protein comprising a sequence selected from the group consisting of SEQ ID ΝO:2, SEQ ID NO:4, SEQ ID NO:6 and SEQ ID NO:8.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US12/746,808 US8846886B2 (en) | 2007-12-14 | 2008-12-12 | Lepidopteran insect N-acetylglucosaminidase genes and their use in glycoengineering |
US14/495,800 US20150191732A1 (en) | 2007-12-14 | 2014-09-24 | Lepidopteran insect n-acetylglucosaminidase genes and their use in glycoengineering |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US1381507P | 2007-12-14 | 2007-12-14 | |
US61/013,815 | 2007-12-14 |
Related Child Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US12/746,808 A-371-Of-International US8846886B2 (en) | 2007-12-14 | 2008-12-12 | Lepidopteran insect N-acetylglucosaminidase genes and their use in glycoengineering |
US14/495,800 Continuation US20150191732A1 (en) | 2007-12-14 | 2014-09-24 | Lepidopteran insect n-acetylglucosaminidase genes and their use in glycoengineering |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2009079376A2 true WO2009079376A2 (en) | 2009-06-25 |
WO2009079376A3 WO2009079376A3 (en) | 2009-08-20 |
Family
ID=40796104
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2008/086606 WO2009079376A2 (en) | 2007-12-14 | 2008-12-12 | Lepidopteran insect n-acetylglucosaminidase genes and their use in glycoengineering |
Country Status (2)
Country | Link |
---|---|
US (2) | US8846886B2 (en) |
WO (1) | WO2009079376A2 (en) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2012050175A1 (en) | 2010-10-15 | 2012-04-19 | 日本ケミカルリサーチ株式会社 | Method for producing glycoprotein having mannose residue as non-reducing end of sugar chain |
CN102586208A (en) * | 2012-03-13 | 2012-07-18 | 中国农业科学院饲料研究所 | Protein with Beta-N-acetamido glucosaminidase activity as well as encoding gene and application thereof |
WO2017214632A3 (en) * | 2016-06-10 | 2018-02-08 | University Of Wyoming | Recombinant insect vectors and methods of use |
CN116790638A (en) * | 2023-08-24 | 2023-09-22 | 中国农业科学院草原研究所 | Asiatic dolly locust UDP-N-acetamido glucose pyrophosphorylase gene and application thereof |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20150093782A1 (en) * | 2013-10-01 | 2015-04-02 | The University Of Wyoming | Compositions and methods for reducing fucosylation of glycoproteins in insect cells and methods of use thereof for production of recombinant glycoproteins |
CN116376875B (en) * | 2023-03-03 | 2024-02-23 | 云南师范大学 | N-acetylglucosaminidase mutant with improved heat resistance and application thereof |
CN116555229B (en) * | 2023-05-25 | 2024-09-20 | 云南师范大学 | N-acetylglucosaminidase mutant, recombinant expression vector, bacterium and application |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050208558A1 (en) * | 1999-10-19 | 2005-09-22 | Applera Corporation | Detection kits, such as nucleic acid arrays, for detecting the expression or 10,000 or more Drosophila genes and uses thereof |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6218185B1 (en) | 1996-04-19 | 2001-04-17 | The United States Of America As Represented By The Secretary Of Agriculture | Piggybac transposon-based genetic transformation system for insects |
AU779239C (en) | 1999-10-19 | 2006-04-06 | Minos Biosystems Limited | Protein production system |
-
2008
- 2008-12-12 US US12/746,808 patent/US8846886B2/en active Active
- 2008-12-12 WO PCT/US2008/086606 patent/WO2009079376A2/en active Application Filing
-
2014
- 2014-09-24 US US14/495,800 patent/US20150191732A1/en not_active Abandoned
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050208558A1 (en) * | 1999-10-19 | 2005-09-22 | Applera Corporation | Detection kits, such as nucleic acid arrays, for detecting the expression or 10,000 or more Drosophila genes and uses thereof |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2012050175A1 (en) | 2010-10-15 | 2012-04-19 | 日本ケミカルリサーチ株式会社 | Method for producing glycoprotein having mannose residue as non-reducing end of sugar chain |
CN103154243A (en) * | 2010-10-15 | 2013-06-12 | 日本化学研究株式会社 | Method for producing glycoprotein having mannose residue as non-reducing end of sugar chain |
US20130224797A1 (en) * | 2010-10-15 | 2013-08-29 | Jcr Pharmaceuticals Co., Ltd. | Method for producing glycoprotein having mannose residue as non-reducing end of sugar chain |
KR20130119932A (en) | 2010-10-15 | 2013-11-01 | 니홍 케미칼 리써치 가부시키가이샤 | Method for producing glycoprotein having mannose residue as non-reducing end of sugar chain |
US11649474B2 (en) | 2010-10-15 | 2023-05-16 | Jcr Pharmaceuticals Co., Ltd. | Method for producing glycoprotein having mannose residue as non-reducing end of sugar chain |
CN102586208A (en) * | 2012-03-13 | 2012-07-18 | 中国农业科学院饲料研究所 | Protein with Beta-N-acetamido glucosaminidase activity as well as encoding gene and application thereof |
WO2017214632A3 (en) * | 2016-06-10 | 2018-02-08 | University Of Wyoming | Recombinant insect vectors and methods of use |
CN116790638A (en) * | 2023-08-24 | 2023-09-22 | 中国农业科学院草原研究所 | Asiatic dolly locust UDP-N-acetamido glucose pyrophosphorylase gene and application thereof |
CN116790638B (en) * | 2023-08-24 | 2023-11-14 | 中国农业科学院草原研究所 | Asiatic dolly locust UDP-N-acetamido glucose pyrophosphorylase gene and application thereof |
Also Published As
Publication number | Publication date |
---|---|
US20150191732A1 (en) | 2015-07-09 |
US20100279415A1 (en) | 2010-11-04 |
US8846886B2 (en) | 2014-09-30 |
WO2009079376A3 (en) | 2009-08-20 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20150191732A1 (en) | Lepidopteran insect n-acetylglucosaminidase genes and their use in glycoengineering | |
JP6663904B2 (en) | Materials and methods for the synthesis of nucleic acid molecules that minimize errors | |
Harrison et al. | Protein N‐glycosylation in the baculovirus–insect cell expression system and engineering of insect cells to produce “mammalianized” recombinant glycoproteins | |
US8546105B2 (en) | Engineering intracellular sialylation pathways | |
Geisler et al. | A fused lobes gene encodes the processing β-N-acetylglucosaminidase in Sf9 cells | |
US11136558B2 (en) | Compositions and methods for reducing fucosylation of glycoproteins in insect cells and methods of use thereof for production of recombinant glycoproteins | |
Mabashi-Asazuma et al. | A novel baculovirus vector for the production of nonfucosylated recombinant glycoproteins in insect cells | |
CA2363297C (en) | Engineering intracellular sialylation pathways | |
WO2005042753A1 (en) | Production of human glycosylated proteins in transgenic insects | |
Masuda et al. | Mass production of an active peptide-N-glycosidase F using silkworm-baculovirus expression system | |
US9045778B2 (en) | Insect cell line for production of recombinant glycoproteins with sulfated complex N-glycans | |
Aumiller et al. | Molecular cloning and functional characterization of β-N-acetylglucosaminidase genes from Sf9 cells | |
WO2002086119A1 (en) | Transformed silkworm producing human collagen | |
Nomura et al. | Improvement of glycosylation structure by suppression of β-N-acetylglucosaminidases in silkworm | |
CN113549560B (en) | Construction method of engineering yeast for glycoprotein preparation and strain thereof | |
Rendić et al. | Towards abolition of immunogenic structures in insect cells: characterization of a honey-bee (Apis mellifera) multi-gene family reveals both an allergy-related core α1, 3-fucosyltransferase and the first insect Lewis-histo-blood-group-related antigen-synthesizing enzyme | |
US20100186099A1 (en) | Production of Human Glycosylated Proteins in Silk Worm | |
Fan et al. | Cloning and functional expression of a chitinase cDNA from the apple leaf miner moth Lithocolletis ringoniella | |
Ihara et al. | Cloning, expression and characterization of Bombyx mori α1, 6-fucosyltransferase | |
Minagawa et al. | Identification of Core alpha 1, 3-Fucosyltransferase gene from silkworm: an insect popularly used to express mammalian proteins | |
Juliant et al. | The α1, 6-fucosyltransferase gene (fut8) from the Sf 9 lepidopteran insect cell line: insights into fut8 evolution | |
JP2019017259A (en) | Endoglycosidase that specifically cuts fucose-containing sugar chain | |
Licari et al. | Production of a discrete, heterogeneous population of β‐galactosidase polypeptides using baculovirus expression vectors | |
JP5308055B2 (en) | β-N-acetylhexosaminidase | |
CN110343721B (en) | Method for delaying pupation of silkworms |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 08861290 Country of ref document: EP Kind code of ref document: A2 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 12746808 Country of ref document: US |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 08861290 Country of ref document: EP Kind code of ref document: A2 |